nohup: ignoring input master_addr is only used for static rdzv_backend and when rdzv_endpoint is not specified. WARNING:torch.distributed.run: ***************************************** Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed. ***************************************** [2024-01-02 17:39:28,646][distributed_c10d.py][INFO] Added key: store_based_barrier_key:1 to store for rank: 2 [2024-01-02 17:39:28,651][distributed_c10d.py][INFO] Added key: store_based_barrier_key:1 to store for rank: 1 [2024-01-02 17:39:28,680][distributed_c10d.py][INFO] Added key: store_based_barrier_key:1 to store for rank: 6 [2024-01-02 17:39:28,712][distributed_c10d.py][INFO] Added key: store_based_barrier_key:1 to store for rank: 0 [2024-01-02 17:39:28,713][distributed_c10d.py][INFO] Added key: store_based_barrier_key:1 to store for rank: 7 [2024-01-02 17:39:28,720][distributed_c10d.py][INFO] Added key: store_based_barrier_key:1 to store for rank: 3 [2024-01-02 17:39:28,722][distributed_c10d.py][INFO] Added key: store_based_barrier_key:1 to store for rank: 4 [2024-01-02 17:39:28,728][distributed_c10d.py][INFO] Added key: store_based_barrier_key:1 to store for rank: 5 [2024-01-02 17:39:28,728][distributed_c10d.py][INFO] Rank 5: Completed store-based barrier for key:store_based_barrier_key:1 with 8 nodes. [2024-01-02 17:39:28,728][distributed_c10d.py][INFO] Rank 2: Completed store-based barrier for key:store_based_barrier_key:1 with 8 nodes. [2024-01-02 17:39:28,731][distributed_c10d.py][INFO] Rank 3: Completed store-based barrier for key:store_based_barrier_key:1 with 8 nodes. [2024-01-02 17:39:28,731][distributed_c10d.py][INFO] Rank 6: Completed store-based barrier for key:store_based_barrier_key:1 with 8 nodes. [2024-01-02 17:39:28,733][distributed_c10d.py][INFO] Rank 4: Completed store-based barrier for key:store_based_barrier_key:1 with 8 nodes. [2024-01-02 17:39:28,733][distributed_c10d.py][INFO] Rank 0: Completed store-based barrier for key:store_based_barrier_key:1 with 8 nodes. [2024-01-02 17:39:28,734][distributed_c10d.py][INFO] Rank 1: Completed store-based barrier for key:store_based_barrier_key:1 with 8 nodes. [2024-01-02 17:39:28,734][distributed_c10d.py][INFO] Rank 7: Completed store-based barrier for key:store_based_barrier_key:1 with 8 nodes. memmap:True train data.shape:(293670029, 512) downloading finished..... Initializing a new model from scratch memmap:True train data.shape:(293670029, 512) downloading finished..... Initializing a new model from scratch memmap:True train data.shape:(293670029, 512) downloading finished..... Initializing a new model from scratch memmap:True train data.shape:(293670029, 512) downloading finished..... Initializing a new model from scratch memmap:True train data.shape:(293670029, 512) downloading finished..... Initializing a new model from scratch memmap:True train data.shape:(293670029, 512) downloading finished..... Initializing a new model from scratch tokens per iteration will be: 32,768 breaks down as: 1 grad accum steps * 8 processes * 8 batch size * 512 max seq len memmap:True train data.shape:(293670029, 512) downloading finished..... Initializing a new model from scratch memmap:True train data.shape:(293670029, 512) downloading finished..... Initializing a new model from scratch 总参数量为: 1005591552 总参数量为: 1005591552 总参数量为: 1005591552 总参数量为: 1005591552 总参数量为: 1005591552 总参数量为: 1005591552 总参数量为: 1005591552 总参数量为: 1005591552 num decayed parameter tensors: 225, with 1,005,491,712 parameters num non-decayed parameter tensors: 65, with 99,840 parameters using fused AdamW: True num decayed parameter tensors: 225, with 1,005,491,712 parameters num non-decayed parameter tensors: 65, with 99,840 parameters using fused AdamW: True num decayed parameter tensors: 225, with 1,005,491,712 parameters num non-decayed parameter tensors: 65, with 99,840 parameters using fused AdamW: True num decayed parameter tensors: 225, with 1,005,491,712 parameters num non-decayed parameter tensors: 65, with 99,840 parameters using fused AdamW: True num decayed parameter tensors: 225, with 1,005,491,712 parameters num non-decayed parameter tensors: 65, with 99,840 parameters using fused AdamW: True num decayed parameter tensors: 225, with 1,005,491,712 parameters num non-decayed parameter tensors: 65, with 99,840 parameters using fused AdamW: True num decayed parameter tensors: 225, with 1,005,491,712 parameters num non-decayed parameter tensors: 65, with 99,840 parameters using fused AdamW: True num decayed parameter tensors: 225, with 1,005,491,712 parameters num non-decayed parameter tensors: 65, with 99,840 parameters using fused AdamW: True [2024-01-02 17:40:18,090][model8_pretrain.py][INFO] Epoch:[0/2](0/4588595) loss:11.339 lr:0.0000000 epoch_Time:2333086.0min: [2024-01-02 17:40:18,093][model8_pretrain.py][INFO] Epoch:[0/2](0/4588595) loss:11.359 lr:0.0000000 epoch_Time:2330913.0min: [2024-01-02 17:40:18,093][model8_pretrain.py][INFO] Epoch:[0/2](0/4588595) loss:11.385 lr:0.0000000 epoch_Time:2332028.0min: [2024-01-02 17:40:18,096][model8_pretrain.py][INFO] Epoch:[0/2](0/4588595) loss:11.342 lr:0.0000000 epoch_Time:2332113.0min: [2024-01-02 17:40:18,097][model8_pretrain.py][INFO] Epoch:[0/2](0/4588595) loss:11.393 lr:0.0000000 epoch_Time:2334802.0min: [2024-01-02 17:40:18,098][model8_pretrain.py][INFO] Epoch:[0/2](0/4588595) loss:11.395 lr:0.0000000 epoch_Time:2332419.0min: [2024-01-02 17:40:18,100][distributed.py][INFO] Reducer buckets have been rebuilt in this iteration. [2024-01-02 17:40:18,101][distributed.py][INFO] Reducer buckets have been rebuilt in this iteration. [2024-01-02 17:40:18,101][distributed.py][INFO] Reducer buckets have been rebuilt in this iteration. [2024-01-02 17:40:23,492][model8_pretrain.py][INFO] Epoch:[0/2](0/4588595) loss:11.396 lr:0.0000000 epoch_Time:2742879.0min: [2024-01-02 17:40:23,494][distributed.py][INFO] Reducer buckets have been rebuilt in this iteration. [2024-01-02 17:40:23,496][distributed.py][INFO] Reducer buckets have been rebuilt in this iteration. [2024-01-02 17:40:24,796][model8_pretrain.py][INFO] Epoch:[0/2](0/4588595) loss:11.414 lr:0.0000000 epoch_Time:2844006.0min: [2024-01-02 17:40:24,799][distributed.py][INFO] Reducer buckets have been rebuilt in this iteration. [2024-01-02 17:40:24,799][distributed.py][INFO] Reducer buckets have been rebuilt in this iteration. [2024-01-02 17:40:24,799][distributed.py][INFO] Reducer buckets have been rebuilt in this iteration. [2024-01-02 17:41:01,758][model8_pretrain.py][INFO] Epoch:[0/2](100/4588595) loss:8.210 lr:0.0000300 epoch_Time:56164.0min: [2024-01-02 17:41:01,758][model8_pretrain.py][INFO] Epoch:[0/2](100/4588595) loss:8.346 lr:0.0000300 epoch_Time:56141.0min: [2024-01-02 17:41:01,758][model8_pretrain.py][INFO] Epoch:[0/2](100/4588595) loss:8.100 lr:0.0000300 epoch_Time:56140.0min: [2024-01-02 17:41:01,758][model8_pretrain.py][INFO] Epoch:[0/2](100/4588595) loss:8.115 lr:0.0000300 epoch_Time:56152.0min: [2024-01-02 17:41:01,758][model8_pretrain.py][INFO] Epoch:[0/2](100/4588595) loss:8.118 lr:0.0000300 epoch_Time:56152.0min: [2024-01-02 17:41:01,759][model8_pretrain.py][INFO] Epoch:[0/2](100/4588595) loss:8.180 lr:0.0000300 epoch_Time:56152.0min: [2024-01-02 17:41:01,759][model8_pretrain.py][INFO] Epoch:[0/2](100/4588595) loss:8.233 lr:0.0000300 epoch_Time:56179.0min: [2024-01-02 17:41:01,759][model8_pretrain.py][INFO] Epoch:[0/2](100/4588595) loss:8.211 lr:0.0000300 epoch_Time:56150.0min: [2024-01-02 17:41:38,670][model8_pretrain.py][INFO] Epoch:[0/2](200/4588595) loss:7.767 lr:0.0000600 epoch_Time:42253.0min: [2024-01-02 17:41:38,670][model8_pretrain.py][INFO] Epoch:[0/2](200/4588595) loss:7.478 lr:0.0000600 epoch_Time:42265.0min: [2024-01-02 17:41:38,670][model8_pretrain.py][INFO] Epoch:[0/2](200/4588595) loss:7.602 lr:0.0000600 epoch_Time:42259.0min: [2024-01-02 17:41:38,671][model8_pretrain.py][INFO] Epoch:[0/2](200/4588595) loss:7.547 lr:0.0000600 epoch_Time:42259.0min: [2024-01-02 17:41:38,671][model8_pretrain.py][INFO] Epoch:[0/2](200/4588595) loss:7.386 lr:0.0000600 epoch_Time:42272.0min: [2024-01-02 17:41:38,671][model8_pretrain.py][INFO] Epoch:[0/2](200/4588595) loss:7.533 lr:0.0000600 epoch_Time:42254.0min: [2024-01-02 17:41:38,671][model8_pretrain.py][INFO] Epoch:[0/2](200/4588595) loss:7.719 lr:0.0000600 epoch_Time:42258.0min: [2024-01-02 17:41:38,671][model8_pretrain.py][INFO] Epoch:[0/2](200/4588595) loss:7.560 lr:0.0000600 epoch_Time:42259.0min: [2024-01-02 17:42:15,584][model8_pretrain.py][INFO] Epoch:[0/2](300/4588595) loss:7.410 lr:0.0000900 epoch_Time:37597.0min: [2024-01-02 17:42:15,584][model8_pretrain.py][INFO] Epoch:[0/2](300/4588595) loss:7.087 lr:0.0000900 epoch_Time:37593.0min: [2024-01-02 17:42:15,584][model8_pretrain.py][INFO] Epoch:[0/2](300/4588595) loss:6.866 lr:0.0000900 epoch_Time:37593.0min: [2024-01-02 17:42:15,584][model8_pretrain.py][INFO] Epoch:[0/2](300/4588595) loss:7.182 lr:0.0000900 epoch_Time:37601.0min: [2024-01-02 17:42:15,584][model8_pretrain.py][INFO] Epoch:[0/2](300/4588595) loss:6.785 lr:0.0000900 epoch_Time:37596.0min: [2024-01-02 17:42:15,584][model8_pretrain.py][INFO] Epoch:[0/2](300/4588595) loss:7.290 lr:0.0000900 epoch_Time:37605.0min: [2024-01-02 17:42:15,584][model8_pretrain.py][INFO] Epoch:[0/2](300/4588595) loss:6.996 lr:0.0000900 epoch_Time:37597.0min: [2024-01-02 17:42:15,585][model8_pretrain.py][INFO] Epoch:[0/2](300/4588595) loss:6.967 lr:0.0000900 epoch_Time:37597.0min: [2024-01-02 17:43:00,307][model8_pretrain.py][INFO] Epoch:[0/2](400/4588595) loss:6.753 lr:0.0001200 epoch_Time:36746.0min: [2024-01-02 17:43:00,307][model8_pretrain.py][INFO] Epoch:[0/2](400/4588595) loss:6.811 lr:0.0001200 epoch_Time:36749.0min: [2024-01-02 17:43:00,307][model8_pretrain.py][INFO] Epoch:[0/2](400/4588595) loss:6.687 lr:0.0001200 epoch_Time:36752.0min: [2024-01-02 17:43:00,307][model8_pretrain.py][INFO] Epoch:[0/2](400/4588595) loss:6.655 lr:0.0001200 epoch_Time:36746.0min: [2024-01-02 17:43:00,307][model8_pretrain.py][INFO] Epoch:[0/2](400/4588595) loss:6.974 lr:0.0001200 epoch_Time:36748.0min: [2024-01-02 17:43:00,307][model8_pretrain.py][INFO] Epoch:[0/2](400/4588595) loss:6.632 lr:0.0001200 epoch_Time:36749.0min: [2024-01-02 17:43:00,307][model8_pretrain.py][INFO] Epoch:[0/2](400/4588595) loss:6.642 lr:0.0001200 epoch_Time:36755.0min: [2024-01-02 17:43:00,307][model8_pretrain.py][INFO] Epoch:[0/2](400/4588595) loss:6.716 lr:0.0001200 epoch_Time:36749.0min: [2024-01-02 17:43:37,235][model8_pretrain.py][INFO] Epoch:[0/2](500/4588595) loss:6.493 lr:0.0001500 epoch_Time:35050.0min: [2024-01-02 17:43:37,235][model8_pretrain.py][INFO] Epoch:[0/2](500/4588595) loss:6.015 lr:0.0001500 epoch_Time:35048.0min: [2024-01-02 17:43:37,235][model8_pretrain.py][INFO] Epoch:[0/2](500/4588595) loss:6.041 lr:0.0001500 epoch_Time:35053.0min: [2024-01-02 17:43:37,235][model8_pretrain.py][INFO] Epoch:[0/2](500/4588595) loss:6.653 lr:0.0001500 epoch_Time:35048.0min: [2024-01-02 17:43:37,235][model8_pretrain.py][INFO] Epoch:[0/2](500/4588595) loss:6.189 lr:0.0001500 epoch_Time:35050.0min: [2024-01-02 17:43:37,236][model8_pretrain.py][INFO] Epoch:[0/2](500/4588595) loss:6.283 lr:0.0001500 epoch_Time:35050.0min: [2024-01-02 17:43:37,236][model8_pretrain.py][INFO] Epoch:[0/2](500/4588595) loss:6.149 lr:0.0001500 epoch_Time:35055.0min: [2024-01-02 17:43:37,236][model8_pretrain.py][INFO] Epoch:[0/2](500/4588595) loss:6.319 lr:0.0001500 epoch_Time:35050.0min: [2024-01-02 17:44:14,144][model8_pretrain.py][INFO] Epoch:[0/2](600/4588595) loss:5.909 lr:0.0001800 epoch_Time:33911.0min: [2024-01-02 17:44:14,144][model8_pretrain.py][INFO] Epoch:[0/2](600/4588595) loss:6.013 lr:0.0001800 epoch_Time:33915.0min: [2024-01-02 17:44:14,144][model8_pretrain.py][INFO] Epoch:[0/2](600/4588595) loss:5.877 lr:0.0001800 epoch_Time:33917.0min: [2024-01-02 17:44:14,144][model8_pretrain.py][INFO] Epoch:[0/2](600/4588595) loss:6.153 lr:0.0001800 epoch_Time:33911.0min: [2024-01-02 17:44:14,144][model8_pretrain.py][INFO] Epoch:[0/2](600/4588595) loss:6.032 lr:0.0001800 epoch_Time:33913.0min: [2024-01-02 17:44:14,144][model8_pretrain.py][INFO] Epoch:[0/2](600/4588595) loss:5.923 lr:0.0001800 epoch_Time:33913.0min: [2024-01-02 17:44:14,144][model8_pretrain.py][INFO] Epoch:[0/2](600/4588595) loss:5.827 lr:0.0001800 epoch_Time:33913.0min: [2024-01-02 17:44:14,144][model8_pretrain.py][INFO] Epoch:[0/2](600/4588595) loss:5.634 lr:0.0001800 epoch_Time:33913.0min: [2024-01-02 17:44:51,052][model8_pretrain.py][INFO] Epoch:[0/2](700/4588595) loss:6.019 lr:0.0002100 epoch_Time:33098.0min: [2024-01-02 17:44:51,052][model8_pretrain.py][INFO] Epoch:[0/2](700/4588595) loss:5.657 lr:0.0002100 epoch_Time:33100.0min: [2024-01-02 17:44:51,052][model8_pretrain.py][INFO] Epoch:[0/2](700/4588595) loss:5.606 lr:0.0002100 epoch_Time:33100.0min: [2024-01-02 17:44:51,052][model8_pretrain.py][INFO] Epoch:[0/2](700/4588595) loss:5.410 lr:0.0002100 epoch_Time:33098.0min: [2024-01-02 17:44:51,052][model8_pretrain.py][INFO] Epoch:[0/2](700/4588595) loss:5.606 lr:0.0002100 epoch_Time:33104.0min: [2024-01-02 17:44:51,052][model8_pretrain.py][INFO] Epoch:[0/2](700/4588595) loss:5.873 lr:0.0002100 epoch_Time:33102.0min: [2024-01-02 17:44:51,052][model8_pretrain.py][INFO] Epoch:[0/2](700/4588595) loss:5.719 lr:0.0002100 epoch_Time:33100.0min: [2024-01-02 17:44:51,052][model8_pretrain.py][INFO] Epoch:[0/2](700/4588595) loss:6.092 lr:0.0002100 epoch_Time:33100.0min: [2024-01-02 17:45:27,988][model8_pretrain.py][INFO] Epoch:[0/2](800/4588595) loss:5.251 lr:0.0002400 epoch_Time:32492.0min: [2024-01-02 17:45:27,988][model8_pretrain.py][INFO] Epoch:[0/2](800/4588595) loss:5.469 lr:0.0002400 epoch_Time:32494.0min: [2024-01-02 17:45:27,988][model8_pretrain.py][INFO] Epoch:[0/2](800/4588595) loss:5.254 lr:0.0002400 epoch_Time:32493.0min: [2024-01-02 17:45:27,988][model8_pretrain.py][INFO] Epoch:[0/2](800/4588595) loss:5.536 lr:0.0002400 epoch_Time:32494.0min: [2024-01-02 17:45:27,988][model8_pretrain.py][INFO] Epoch:[0/2](800/4588595) loss:5.269 lr:0.0002400 epoch_Time:32492.0min: [2024-01-02 17:45:27,988][model8_pretrain.py][INFO] Epoch:[0/2](800/4588595) loss:5.615 lr:0.0002400 epoch_Time:32495.0min: [2024-01-02 17:45:27,988][model8_pretrain.py][INFO] Epoch:[0/2](800/4588595) loss:5.228 lr:0.0002400 epoch_Time:32494.0min: [2024-01-02 17:45:27,992][model8_pretrain.py][INFO] Epoch:[0/2](800/4588595) loss:5.535 lr:0.0002400 epoch_Time:32497.0min: [2024-01-02 17:46:04,951][model8_pretrain.py][INFO] Epoch:[0/2](900/4588595) loss:5.372 lr:0.0002700 epoch_Time:32022.0min: [2024-01-02 17:46:04,951][model8_pretrain.py][INFO] Epoch:[0/2](900/4588595) loss:5.445 lr:0.0002700 epoch_Time:32023.0min: [2024-01-02 17:46:04,951][model8_pretrain.py][INFO] Epoch:[0/2](900/4588595) loss:5.731 lr:0.0002700 epoch_Time:32023.0min: [2024-01-02 17:46:04,951][model8_pretrain.py][INFO] Epoch:[0/2](900/4588595) loss:5.999 lr:0.0002700 epoch_Time:32026.0min: [2024-01-02 17:46:04,951][model8_pretrain.py][INFO] Epoch:[0/2](900/4588595) loss:5.171 lr:0.0002700 epoch_Time:32023.0min: [2024-01-02 17:46:04,951][model8_pretrain.py][INFO] Epoch:[0/2](900/4588595) loss:5.595 lr:0.0002700 epoch_Time:32024.0min: [2024-01-02 17:46:04,951][model8_pretrain.py][INFO] Epoch:[0/2](900/4588595) loss:5.122 lr:0.0002700 epoch_Time:32023.0min: [2024-01-02 17:46:04,951][model8_pretrain.py][INFO] Epoch:[0/2](900/4588595) loss:5.550 lr:0.0002700 epoch_Time:32022.0min: [2024-01-02 17:46:41,889][model8_pretrain.py][INFO] Epoch:[0/2](1000/4588595) loss:5.277 lr:0.0003000 epoch_Time:31644.0min: [2024-01-02 17:46:41,889][model8_pretrain.py][INFO] Epoch:[0/2](1000/4588595) loss:5.118 lr:0.0003000 epoch_Time:31645.0min: [2024-01-02 17:46:41,889][model8_pretrain.py][INFO] Epoch:[0/2](1000/4588595) loss:5.034 lr:0.0003000 epoch_Time:31644.0min: [2024-01-02 17:46:41,889][model8_pretrain.py][INFO] Epoch:[0/2](1000/4588595) loss:5.279 lr:0.0003000 epoch_Time:31645.0min: [2024-01-02 17:46:41,889][model8_pretrain.py][INFO] Epoch:[0/2](1000/4588595) loss:5.184 lr:0.0003000 epoch_Time:31645.0min: [2024-01-02 17:46:41,889][model8_pretrain.py][INFO] Epoch:[0/2](1000/4588595) loss:5.347 lr:0.0003000 epoch_Time:31647.0min: [2024-01-02 17:46:41,889][model8_pretrain.py][INFO] Epoch:[0/2](1000/4588595) loss:5.359 lr:0.0003000 epoch_Time:31648.0min: [2024-01-02 17:46:41,889][model8_pretrain.py][INFO] Epoch:[0/2](1000/4588595) loss:4.929 lr:0.0003000 epoch_Time:31645.0min: [2024-01-02 17:47:18,817][model8_pretrain.py][INFO] Epoch:[0/2](1100/4588595) loss:5.343 lr:0.0003000 epoch_Time:31333.0min: [2024-01-02 17:47:18,817][model8_pretrain.py][INFO] Epoch:[0/2](1100/4588595) loss:5.150 lr:0.0003000 epoch_Time:31334.0min: [2024-01-02 17:47:18,817][model8_pretrain.py][INFO] Epoch:[0/2](1100/4588595) loss:5.248 lr:0.0003000 epoch_Time:31334.0min: [2024-01-02 17:47:18,817][model8_pretrain.py][INFO] Epoch:[0/2](1100/4588595) loss:4.960 lr:0.0003000 epoch_Time:31337.0min: [2024-01-02 17:47:18,817][model8_pretrain.py][INFO] Epoch:[0/2](1100/4588595) loss:5.045 lr:0.0003000 epoch_Time:31336.0min: [2024-01-02 17:47:18,817][model8_pretrain.py][INFO] Epoch:[0/2](1100/4588595) loss:5.098 lr:0.0003000 epoch_Time:31335.0min: [2024-01-02 17:47:18,817][model8_pretrain.py][INFO] Epoch:[0/2](1100/4588595) loss:4.868 lr:0.0003000 epoch_Time:31335.0min: [2024-01-02 17:47:18,817][model8_pretrain.py][INFO] Epoch:[0/2](1100/4588595) loss:5.231 lr:0.0003000 epoch_Time:31335.0min: [2024-01-02 17:48:01,751][model8_pretrain.py][INFO] Epoch:[0/2](1200/4588595) loss:5.233 lr:0.0003000 epoch_Time:31457.0min: [2024-01-02 17:48:01,751][model8_pretrain.py][INFO] Epoch:[0/2](1200/4588595) loss:4.877 lr:0.0003000 epoch_Time:31457.0min: [2024-01-02 17:48:01,751][model8_pretrain.py][INFO] Epoch:[0/2](1200/4588595) loss:4.971 lr:0.0003000 epoch_Time:31459.0min: [2024-01-02 17:48:01,751][model8_pretrain.py][INFO] Epoch:[0/2](1200/4588595) loss:5.188 lr:0.0003000 epoch_Time:31458.0min: [2024-01-02 17:48:01,752][model8_pretrain.py][INFO] Epoch:[0/2](1200/4588595) loss:4.788 lr:0.0003000 epoch_Time:31458.0min: [2024-01-02 17:48:01,752][model8_pretrain.py][INFO] Epoch:[0/2](1200/4588595) loss:5.222 lr:0.0003000 epoch_Time:31458.0min: [2024-01-02 17:48:01,752][model8_pretrain.py][INFO] Epoch:[0/2](1200/4588595) loss:5.326 lr:0.0003000 epoch_Time:31458.0min: [2024-01-02 17:48:01,752][model8_pretrain.py][INFO] Epoch:[0/2](1200/4588595) loss:5.419 lr:0.0003000 epoch_Time:31460.0min: [2024-01-02 17:48:38,659][model8_pretrain.py][INFO] Epoch:[0/2](1300/4588595) loss:4.898 lr:0.0003000 epoch_Time:31208.0min: [2024-01-02 17:48:38,659][model8_pretrain.py][INFO] Epoch:[0/2](1300/4588595) loss:4.894 lr:0.0003000 epoch_Time:31208.0min: [2024-01-02 17:48:38,659][model8_pretrain.py][INFO] Epoch:[0/2](1300/4588595) loss:4.979 lr:0.0003000 epoch_Time:31209.0min: [2024-01-02 17:48:38,659][model8_pretrain.py][INFO] Epoch:[0/2](1300/4588595) loss:4.862 lr:0.0003000 epoch_Time:31211.0min: [2024-01-02 17:48:38,659][model8_pretrain.py][INFO] Epoch:[0/2](1300/4588595) loss:5.247 lr:0.0003000 epoch_Time:31209.0min: [2024-01-02 17:48:38,659][model8_pretrain.py][INFO] Epoch:[0/2](1300/4588595) loss:4.664 lr:0.0003000 epoch_Time:31209.0min: [2024-01-02 17:48:38,659][model8_pretrain.py][INFO] Epoch:[0/2](1300/4588595) loss:5.091 lr:0.0003000 epoch_Time:31210.0min: [2024-01-02 17:48:38,659][model8_pretrain.py][INFO] Epoch:[0/2](1300/4588595) loss:4.737 lr:0.0003000 epoch_Time:31209.0min: [2024-01-02 17:49:15,575][model8_pretrain.py][INFO] Epoch:[0/2](1400/4588595) loss:4.742 lr:0.0003000 epoch_Time:30995.0min: [2024-01-02 17:49:15,575][model8_pretrain.py][INFO] Epoch:[0/2](1400/4588595) loss:4.526 lr:0.0003000 epoch_Time:30994.0min: [2024-01-02 17:49:15,575][model8_pretrain.py][INFO] Epoch:[0/2](1400/4588595) loss:5.117 lr:0.0003000 epoch_Time:30995.0min: [2024-01-02 17:49:15,575][model8_pretrain.py][INFO] Epoch:[0/2](1400/4588595) loss:4.708 lr:0.0003000 epoch_Time:30995.0min: [2024-01-02 17:49:15,575][model8_pretrain.py][INFO] Epoch:[0/2](1400/4588595) loss:4.795 lr:0.0003000 epoch_Time:30996.0min: [2024-01-02 17:49:15,575][model8_pretrain.py][INFO] Epoch:[0/2](1400/4588595) loss:5.003 lr:0.0003000 epoch_Time:30996.0min: [2024-01-02 17:49:15,576][model8_pretrain.py][INFO] Epoch:[0/2](1400/4588595) loss:5.020 lr:0.0003000 epoch_Time:30994.0min: [2024-01-02 17:49:15,576][model8_pretrain.py][INFO] Epoch:[0/2](1400/4588595) loss:4.594 lr:0.0003000 epoch_Time:30995.0min: [2024-01-02 17:49:52,497][model8_pretrain.py][INFO] Epoch:[0/2](1500/4588595) loss:5.045 lr:0.0003000 epoch_Time:30809.0min: [2024-01-02 17:49:52,497][model8_pretrain.py][INFO] Epoch:[0/2](1500/4588595) loss:5.140 lr:0.0003000 epoch_Time:30809.0min: [2024-01-02 17:49:52,497][model8_pretrain.py][INFO] Epoch:[0/2](1500/4588595) loss:4.964 lr:0.0003000 epoch_Time:30809.0min: [2024-01-02 17:49:52,497][model8_pretrain.py][INFO] Epoch:[0/2](1500/4588595) loss:4.750 lr:0.0003000 epoch_Time:30811.0min: [2024-01-02 17:49:52,497][model8_pretrain.py][INFO] Epoch:[0/2](1500/4588595) loss:4.741 lr:0.0003000 epoch_Time:30809.0min: [2024-01-02 17:49:52,497][model8_pretrain.py][INFO] Epoch:[0/2](1500/4588595) loss:4.763 lr:0.0003000 epoch_Time:30810.0min: [2024-01-02 17:49:52,497][model8_pretrain.py][INFO] Epoch:[0/2](1500/4588595) loss:4.329 lr:0.0003000 epoch_Time:30809.0min: [2024-01-02 17:49:52,497][model8_pretrain.py][INFO] Epoch:[0/2](1500/4588595) loss:4.972 lr:0.0003000 epoch_Time:30809.0min: [2024-01-02 17:50:29,435][model8_pretrain.py][INFO] Epoch:[0/2](1600/4588595) loss:4.798 lr:0.0003000 epoch_Time:30648.0min: [2024-01-02 17:50:29,435][model8_pretrain.py][INFO] Epoch:[0/2](1600/4588595) loss:4.808 lr:0.0003000 epoch_Time:30649.0min: [2024-01-02 17:50:29,435][model8_pretrain.py][INFO] Epoch:[0/2](1600/4588595) loss:4.498 lr:0.0003000 epoch_Time:30650.0min: [2024-01-02 17:50:29,435][model8_pretrain.py][INFO] Epoch:[0/2](1600/4588595) loss:4.073 lr:0.0003000 epoch_Time:30649.0min: [2024-01-02 17:50:29,435][model8_pretrain.py][INFO] Epoch:[0/2](1600/4588595) loss:4.625 lr:0.0003000 epoch_Time:30650.0min: [2024-01-02 17:50:29,435][model8_pretrain.py][INFO] Epoch:[0/2](1600/4588595) loss:4.473 lr:0.0003000 epoch_Time:30648.0min: [2024-01-02 17:50:29,435][model8_pretrain.py][INFO] Epoch:[0/2](1600/4588595) loss:4.966 lr:0.0003000 epoch_Time:30649.0min: [2024-01-02 17:50:29,435][model8_pretrain.py][INFO] Epoch:[0/2](1600/4588595) loss:4.826 lr:0.0003000 epoch_Time:30649.0min: [2024-01-02 17:51:06,374][model8_pretrain.py][INFO] Epoch:[0/2](1700/4588595) loss:4.383 lr:0.0002999 epoch_Time:30506.0min: [2024-01-02 17:51:06,374][model8_pretrain.py][INFO] Epoch:[0/2](1700/4588595) loss:4.672 lr:0.0002999 epoch_Time:30505.0min: [2024-01-02 17:51:06,375][model8_pretrain.py][INFO] Epoch:[0/2](1700/4588595) loss:4.314 lr:0.0002999 epoch_Time:30507.0min: [2024-01-02 17:51:06,375][model8_pretrain.py][INFO] Epoch:[0/2](1700/4588595) loss:4.578 lr:0.0002999 epoch_Time:30506.0min: [2024-01-02 17:51:06,375][model8_pretrain.py][INFO] Epoch:[0/2](1700/4588595) loss:4.431 lr:0.0002999 epoch_Time:30508.0min: [2024-01-02 17:51:06,374][model8_pretrain.py][INFO] Epoch:[0/2](1700/4588595) loss:4.799 lr:0.0002999 epoch_Time:30505.0min: [2024-01-02 17:51:06,375][model8_pretrain.py][INFO] Epoch:[0/2](1700/4588595) loss:4.558 lr:0.0002999 epoch_Time:30506.0min: [2024-01-02 17:51:06,375][model8_pretrain.py][INFO] Epoch:[0/2](1700/4588595) loss:4.574 lr:0.0002999 epoch_Time:30506.0min: [2024-01-02 17:51:43,291][model8_pretrain.py][INFO] Epoch:[0/2](1800/4588595) loss:4.637 lr:0.0002999 epoch_Time:30379.0min: [2024-01-02 17:51:43,291][model8_pretrain.py][INFO] Epoch:[0/2](1800/4588595) loss:4.934 lr:0.0002999 epoch_Time:30380.0min: [2024-01-02 17:51:43,291][model8_pretrain.py][INFO] Epoch:[0/2](1800/4588595) loss:4.907 lr:0.0002999 epoch_Time:30379.0min: [2024-01-02 17:51:43,291][model8_pretrain.py][INFO] Epoch:[0/2](1800/4588595) loss:4.340 lr:0.0002999 epoch_Time:30379.0min: [2024-01-02 17:51:43,291][model8_pretrain.py][INFO] Epoch:[0/2](1800/4588595) loss:4.838 lr:0.0002999 epoch_Time:30381.0min: [2024-01-02 17:51:43,291][model8_pretrain.py][INFO] Epoch:[0/2](1800/4588595) loss:4.712 lr:0.0002999 epoch_Time:30379.0min: [2024-01-02 17:51:43,291][model8_pretrain.py][INFO] Epoch:[0/2](1800/4588595) loss:4.162 lr:0.0002999 epoch_Time:30379.0min: [2024-01-02 17:51:43,291][model8_pretrain.py][INFO] Epoch:[0/2](1800/4588595) loss:4.411 lr:0.0002999 epoch_Time:30379.0min: [2024-01-02 17:52:20,213][model8_pretrain.py][INFO] Epoch:[0/2](1900/4588595) loss:4.522 lr:0.0002999 epoch_Time:30265.0min: [2024-01-02 17:52:20,213][model8_pretrain.py][INFO] Epoch:[0/2](1900/4588595) loss:4.627 lr:0.0002999 epoch_Time:30266.0min: [2024-01-02 17:52:20,213][model8_pretrain.py][INFO] Epoch:[0/2](1900/4588595) loss:4.601 lr:0.0002999 epoch_Time:30265.0min: [2024-01-02 17:52:20,213][model8_pretrain.py][INFO] Epoch:[0/2](1900/4588595) loss:4.348 lr:0.0002999 epoch_Time:30264.0min: [2024-01-02 17:52:20,213][model8_pretrain.py][INFO] Epoch:[0/2](1900/4588595) loss:4.338 lr:0.0002999 epoch_Time:30264.0min: [2024-01-02 17:52:20,213][model8_pretrain.py][INFO] Epoch:[0/2](1900/4588595) loss:4.453 lr:0.0002999 epoch_Time:30266.0min: [2024-01-02 17:52:20,213][model8_pretrain.py][INFO] Epoch:[0/2](1900/4588595) loss:4.691 lr:0.0002999 epoch_Time:30265.0min: [2024-01-02 17:52:20,213][model8_pretrain.py][INFO] Epoch:[0/2](1900/4588595) loss:4.731 lr:0.0002999 epoch_Time:30265.0min: [2024-01-02 17:53:03,964][model8_pretrain.py][INFO] Epoch:[0/2](2000/4588595) loss:4.637 lr:0.0002999 epoch_Time:30422.0min: [2024-01-02 17:53:03,964][model8_pretrain.py][INFO] Epoch:[0/2](2000/4588595) loss:4.691 lr:0.0002999 epoch_Time:30424.0min: [2024-01-02 17:53:03,964][model8_pretrain.py][INFO] Epoch:[0/2](2000/4588595) loss:4.873 lr:0.0002999 epoch_Time:30423.0min: [2024-01-02 17:53:03,964][model8_pretrain.py][INFO] Epoch:[0/2](2000/4588595) loss:4.789 lr:0.0002999 epoch_Time:30424.0min: [2024-01-02 17:53:03,964][model8_pretrain.py][INFO] Epoch:[0/2](2000/4588595) loss:4.570 lr:0.0002999 epoch_Time:30423.0min: [2024-01-02 17:53:03,964][model8_pretrain.py][INFO] Epoch:[0/2](2000/4588595) loss:4.739 lr:0.0002999 epoch_Time:30423.0min: [2024-01-02 17:53:03,964][model8_pretrain.py][INFO] Epoch:[0/2](2000/4588595) loss:4.036 lr:0.0002999 epoch_Time:30422.0min: [2024-01-02 17:53:03,964][model8_pretrain.py][INFO] Epoch:[0/2](2000/4588595) loss:4.885 lr:0.0002999 epoch_Time:30423.0min: [2024-01-02 17:53:40,880][model8_pretrain.py][INFO] Epoch:[0/2](2100/4588595) loss:4.311 lr:0.0002999 epoch_Time:30318.0min: [2024-01-02 17:53:40,880][model8_pretrain.py][INFO] Epoch:[0/2](2100/4588595) loss:4.478 lr:0.0002999 epoch_Time:30317.0min: [2024-01-02 17:53:40,880][model8_pretrain.py][INFO] Epoch:[0/2](2100/4588595) loss:4.100 lr:0.0002999 epoch_Time:30318.0min: [2024-01-02 17:53:40,880][model8_pretrain.py][INFO] Epoch:[0/2](2100/4588595) loss:4.535 lr:0.0002999 epoch_Time:30318.0min: [2024-01-02 17:53:40,880][model8_pretrain.py][INFO] Epoch:[0/2](2100/4588595) loss:4.445 lr:0.0002999 epoch_Time:30319.0min: [2024-01-02 17:53:40,880][model8_pretrain.py][INFO] Epoch:[0/2](2100/4588595) loss:4.298 lr:0.0002999 epoch_Time:30318.0min: [2024-01-02 17:53:40,880][model8_pretrain.py][INFO] Epoch:[0/2](2100/4588595) loss:4.357 lr:0.0002999 epoch_Time:30319.0min: [2024-01-02 17:53:40,880][model8_pretrain.py][INFO] Epoch:[0/2](2100/4588595) loss:4.564 lr:0.0002999 epoch_Time:30317.0min: [2024-01-02 17:54:17,798][model8_pretrain.py][INFO] Epoch:[0/2](2200/4588595) loss:4.378 lr:0.0002998 epoch_Time:30221.0min: [2024-01-02 17:54:17,798][model8_pretrain.py][INFO] Epoch:[0/2](2200/4588595) loss:4.359 lr:0.0002998 epoch_Time:30221.0min: [2024-01-02 17:54:17,798][model8_pretrain.py][INFO] Epoch:[0/2](2200/4588595) loss:4.522 lr:0.0002998 epoch_Time:30222.0min: [2024-01-02 17:54:17,798][model8_pretrain.py][INFO] Epoch:[0/2](2200/4588595) loss:4.459 lr:0.0002998 epoch_Time:30223.0min: [2024-01-02 17:54:17,798][model8_pretrain.py][INFO] Epoch:[0/2](2200/4588595) loss:3.819 lr:0.0002998 epoch_Time:30222.0min: [2024-01-02 17:54:17,798][model8_pretrain.py][INFO] Epoch:[0/2](2200/4588595) loss:4.431 lr:0.0002998 epoch_Time:30222.0min: [2024-01-02 17:54:17,798][model8_pretrain.py][INFO] Epoch:[0/2](2200/4588595) loss:4.566 lr:0.0002998 epoch_Time:30222.0min: [2024-01-02 17:54:17,799][model8_pretrain.py][INFO] Epoch:[0/2](2200/4588595) loss:4.500 lr:0.0002998 epoch_Time:30222.0min: [2024-01-02 17:54:54,717][model8_pretrain.py][INFO] Epoch:[0/2](2300/4588595) loss:4.922 lr:0.0002998 epoch_Time:30133.0min: [2024-01-02 17:54:54,717][model8_pretrain.py][INFO] Epoch:[0/2](2300/4588595) loss:4.481 lr:0.0002998 epoch_Time:30133.0min: [2024-01-02 17:54:54,717][model8_pretrain.py][INFO] Epoch:[0/2](2300/4588595) loss:3.888 lr:0.0002998 epoch_Time:30134.0min: [2024-01-02 17:54:54,717][model8_pretrain.py][INFO] Epoch:[0/2](2300/4588595) loss:4.337 lr:0.0002998 epoch_Time:30134.0min: [2024-01-02 17:54:54,717][model8_pretrain.py][INFO] Epoch:[0/2](2300/4588595) loss:3.912 lr:0.0002998 epoch_Time:30135.0min: [2024-01-02 17:54:54,717][model8_pretrain.py][INFO] Epoch:[0/2](2300/4588595) loss:4.213 lr:0.0002998 epoch_Time:30134.0min: [2024-01-02 17:54:54,717][model8_pretrain.py][INFO] Epoch:[0/2](2300/4588595) loss:4.301 lr:0.0002998 epoch_Time:30134.0min: [2024-01-02 17:54:54,719][model8_pretrain.py][INFO] Epoch:[0/2](2300/4588595) loss:4.133 lr:0.0002998 epoch_Time:30134.0min: [2024-01-02 17:55:31,654][model8_pretrain.py][INFO] Epoch:[0/2](2400/4588595) loss:4.765 lr:0.0002998 epoch_Time:30054.0min: [2024-01-02 17:55:31,654][model8_pretrain.py][INFO] Epoch:[0/2](2400/4588595) loss:4.332 lr:0.0002998 epoch_Time:30055.0min: [2024-01-02 17:55:31,654][model8_pretrain.py][INFO] Epoch:[0/2](2400/4588595) loss:4.119 lr:0.0002998 epoch_Time:30055.0min: [2024-01-02 17:55:31,654][model8_pretrain.py][INFO] Epoch:[0/2](2400/4588595) loss:4.404 lr:0.0002998 epoch_Time:30055.0min: [2024-01-02 17:55:31,654][model8_pretrain.py][INFO] Epoch:[0/2](2400/4588595) loss:4.079 lr:0.0002998 epoch_Time:30056.0min: [2024-01-02 17:55:31,654][model8_pretrain.py][INFO] Epoch:[0/2](2400/4588595) loss:4.094 lr:0.0002998 epoch_Time:30055.0min: [2024-01-02 17:55:31,654][model8_pretrain.py][INFO] Epoch:[0/2](2400/4588595) loss:3.934 lr:0.0002998 epoch_Time:30054.0min: [2024-01-02 17:55:31,655][model8_pretrain.py][INFO] Epoch:[0/2](2400/4588595) loss:4.675 lr:0.0002998 epoch_Time:30054.0min: [2024-01-02 17:56:08,574][model8_pretrain.py][INFO] Epoch:[0/2](2500/4588595) loss:3.712 lr:0.0002997 epoch_Time:29980.0min: [2024-01-02 17:56:08,574][model8_pretrain.py][INFO] Epoch:[0/2](2500/4588595) loss:4.186 lr:0.0002997 epoch_Time:29980.0min: [2024-01-02 17:56:08,574][model8_pretrain.py][INFO] Epoch:[0/2](2500/4588595) loss:4.460 lr:0.0002997 epoch_Time:29980.0min: [2024-01-02 17:56:08,575][model8_pretrain.py][INFO] Epoch:[0/2](2500/4588595) loss:4.348 lr:0.0002997 epoch_Time:29980.0min: [2024-01-02 17:56:08,575][model8_pretrain.py][INFO] Epoch:[0/2](2500/4588595) loss:4.201 lr:0.0002997 epoch_Time:29980.0min: [2024-01-02 17:56:08,576][model8_pretrain.py][INFO] Epoch:[0/2](2500/4588595) loss:4.542 lr:0.0002997 epoch_Time:29980.0min: [2024-01-02 17:56:08,576][model8_pretrain.py][INFO] Epoch:[0/2](2500/4588595) loss:3.868 lr:0.0002997 epoch_Time:29981.0min: [2024-01-02 17:56:08,576][model8_pretrain.py][INFO] Epoch:[0/2](2500/4588595) loss:4.830 lr:0.0002997 epoch_Time:29981.0min: [2024-01-02 17:56:45,495][model8_pretrain.py][INFO] Epoch:[0/2](2600/4588595) loss:5.182 lr:0.0002997 epoch_Time:29912.0min: [2024-01-02 17:56:45,495][model8_pretrain.py][INFO] Epoch:[0/2](2600/4588595) loss:4.663 lr:0.0002997 epoch_Time:29912.0min: [2024-01-02 17:56:45,495][model8_pretrain.py][INFO] Epoch:[0/2](2600/4588595) loss:4.527 lr:0.0002997 epoch_Time:29913.0min: [2024-01-02 17:56:45,495][model8_pretrain.py][INFO] Epoch:[0/2](2600/4588595) loss:4.432 lr:0.0002997 epoch_Time:29912.0min: [2024-01-02 17:56:45,495][model8_pretrain.py][INFO] Epoch:[0/2](2600/4588595) loss:4.070 lr:0.0002997 epoch_Time:29912.0min: [2024-01-02 17:56:45,496][model8_pretrain.py][INFO] Epoch:[0/2](2600/4588595) loss:4.143 lr:0.0002997 epoch_Time:29912.0min: [2024-01-02 17:56:45,496][model8_pretrain.py][INFO] Epoch:[0/2](2600/4588595) loss:4.054 lr:0.0002997 epoch_Time:29913.0min: [2024-01-02 17:56:45,496][model8_pretrain.py][INFO] Epoch:[0/2](2600/4588595) loss:4.112 lr:0.0002997 epoch_Time:29912.0min: [2024-01-02 17:57:22,433][model8_pretrain.py][INFO] Epoch:[0/2](2700/4588595) loss:4.302 lr:0.0002997 epoch_Time:29849.0min: [2024-01-02 17:57:22,433][model8_pretrain.py][INFO] Epoch:[0/2](2700/4588595) loss:4.582 lr:0.0002997 epoch_Time:29849.0min: [2024-01-02 17:57:22,433][model8_pretrain.py][INFO] Epoch:[0/2](2700/4588595) loss:4.071 lr:0.0002997 epoch_Time:29850.0min: [2024-01-02 17:57:22,433][model8_pretrain.py][INFO] Epoch:[0/2](2700/4588595) loss:4.197 lr:0.0002997 epoch_Time:29849.0min: [2024-01-02 17:57:22,433][model8_pretrain.py][INFO] Epoch:[0/2](2700/4588595) loss:4.527 lr:0.0002997 epoch_Time:29849.0min: [2024-01-02 17:57:22,433][model8_pretrain.py][INFO] Epoch:[0/2](2700/4588595) loss:4.116 lr:0.0002997 epoch_Time:29850.0min: [2024-01-02 17:57:22,433][model8_pretrain.py][INFO] Epoch:[0/2](2700/4588595) loss:3.884 lr:0.0002997 epoch_Time:29849.0min: [2024-01-02 17:57:22,434][model8_pretrain.py][INFO] Epoch:[0/2](2700/4588595) loss:4.187 lr:0.0002997 epoch_Time:29849.0min: [2024-01-02 17:58:03,009][model8_pretrain.py][INFO] Epoch:[0/2](2800/4588595) loss:4.612 lr:0.0002996 epoch_Time:29889.0min: [2024-01-02 17:58:03,009][model8_pretrain.py][INFO] Epoch:[0/2](2800/4588595) loss:4.330 lr:0.0002996 epoch_Time:29890.0min: [2024-01-02 17:58:03,009][model8_pretrain.py][INFO] Epoch:[0/2](2800/4588595) loss:4.169 lr:0.0002996 epoch_Time:29891.0min: [2024-01-02 17:58:03,009][model8_pretrain.py][INFO] Epoch:[0/2](2800/4588595) loss:4.226 lr:0.0002996 epoch_Time:29890.0min: [2024-01-02 17:58:03,009][model8_pretrain.py][INFO] Epoch:[0/2](2800/4588595) loss:4.245 lr:0.0002996 epoch_Time:29890.0min: [2024-01-02 17:58:03,009][model8_pretrain.py][INFO] Epoch:[0/2](2800/4588595) loss:3.950 lr:0.0002996 epoch_Time:29889.0min: [2024-01-02 17:58:03,009][model8_pretrain.py][INFO] Epoch:[0/2](2800/4588595) loss:4.310 lr:0.0002996 epoch_Time:29890.0min: [2024-01-02 17:58:03,009][model8_pretrain.py][INFO] Epoch:[0/2](2800/4588595) loss:4.307 lr:0.0002996 epoch_Time:29890.0min: [2024-01-02 17:58:39,961][model8_pretrain.py][INFO] Epoch:[0/2](2900/4588595) loss:4.247 lr:0.0002996 epoch_Time:29833.0min: [2024-01-02 17:58:39,961][model8_pretrain.py][INFO] Epoch:[0/2](2900/4588595) loss:4.161 lr:0.0002996 epoch_Time:29833.0min: [2024-01-02 17:58:39,961][model8_pretrain.py][INFO] Epoch:[0/2](2900/4588595) loss:4.164 lr:0.0002996 epoch_Time:29833.0min: [2024-01-02 17:58:39,961][model8_pretrain.py][INFO] Epoch:[0/2](2900/4588595) loss:3.785 lr:0.0002996 epoch_Time:29833.0min: [2024-01-02 17:58:39,961][model8_pretrain.py][INFO] Epoch:[0/2](2900/4588595) loss:3.842 lr:0.0002996 epoch_Time:29833.0min: [2024-01-02 17:58:39,961][model8_pretrain.py][INFO] Epoch:[0/2](2900/4588595) loss:4.036 lr:0.0002996 epoch_Time:29833.0min: [2024-01-02 17:58:39,961][model8_pretrain.py][INFO] Epoch:[0/2](2900/4588595) loss:3.985 lr:0.0002996 epoch_Time:29834.0min: [2024-01-02 17:58:39,961][model8_pretrain.py][INFO] Epoch:[0/2](2900/4588595) loss:4.182 lr:0.0002996 epoch_Time:29833.0min: [2024-01-02 17:59:16,922][model8_pretrain.py][INFO] Epoch:[0/2](3000/4588595) loss:4.230 lr:0.0002995 epoch_Time:29779.0min: [2024-01-02 17:59:16,922][model8_pretrain.py][INFO] Epoch:[0/2](3000/4588595) loss:4.475 lr:0.0002995 epoch_Time:29779.0min: [2024-01-02 17:59:16,922][model8_pretrain.py][INFO] Epoch:[0/2](3000/4588595) loss:3.955 lr:0.0002995 epoch_Time:29779.0min: [2024-01-02 17:59:16,922][model8_pretrain.py][INFO] Epoch:[0/2](3000/4588595) loss:3.735 lr:0.0002995 epoch_Time:29780.0min: [2024-01-02 17:59:16,922][model8_pretrain.py][INFO] Epoch:[0/2](3000/4588595) loss:4.096 lr:0.0002995 epoch_Time:29780.0min: [2024-01-02 17:59:16,922][model8_pretrain.py][INFO] Epoch:[0/2](3000/4588595) loss:4.366 lr:0.0002995 epoch_Time:29779.0min: [2024-01-02 17:59:16,922][model8_pretrain.py][INFO] Epoch:[0/2](3000/4588595) loss:4.175 lr:0.0002995 epoch_Time:29779.0min: [2024-01-02 17:59:16,922][model8_pretrain.py][INFO] Epoch:[0/2](3000/4588595) loss:3.893 lr:0.0002995 epoch_Time:29779.0min: [2024-01-02 17:59:53,865][model8_pretrain.py][INFO] Epoch:[0/2](3100/4588595) loss:4.105 lr:0.0002995 epoch_Time:29728.0min: [2024-01-02 17:59:53,865][model8_pretrain.py][INFO] Epoch:[0/2](3100/4588595) loss:4.174 lr:0.0002995 epoch_Time:29729.0min: [2024-01-02 17:59:53,865][model8_pretrain.py][INFO] Epoch:[0/2](3100/4588595) loss:3.920 lr:0.0002995 epoch_Time:29728.0min: [2024-01-02 17:59:53,865][model8_pretrain.py][INFO] Epoch:[0/2](3100/4588595) loss:4.207 lr:0.0002995 epoch_Time:29728.0min: [2024-01-02 17:59:53,865][model8_pretrain.py][INFO] Epoch:[0/2](3100/4588595) loss:4.414 lr:0.0002995 epoch_Time:29728.0min: [2024-01-02 17:59:53,866][model8_pretrain.py][INFO] Epoch:[0/2](3100/4588595) loss:4.271 lr:0.0002995 epoch_Time:29728.0min: [2024-01-02 17:59:53,865][model8_pretrain.py][INFO] Epoch:[0/2](3100/4588595) loss:4.222 lr:0.0002995 epoch_Time:29729.0min: [2024-01-02 17:59:53,866][model8_pretrain.py][INFO] Epoch:[0/2](3100/4588595) loss:4.220 lr:0.0002995 epoch_Time:29728.0min: [2024-01-02 18:00:30,801][model8_pretrain.py][INFO] Epoch:[0/2](3200/4588595) loss:4.481 lr:0.0002994 epoch_Time:29681.0min: [2024-01-02 18:00:30,801][model8_pretrain.py][INFO] Epoch:[0/2](3200/4588595) loss:3.560 lr:0.0002994 epoch_Time:29681.0min: [2024-01-02 18:00:30,801][model8_pretrain.py][INFO] Epoch:[0/2](3200/4588595) loss:3.977 lr:0.0002994 epoch_Time:29682.0min: [2024-01-02 18:00:30,801][model8_pretrain.py][INFO] Epoch:[0/2](3200/4588595) loss:4.355 lr:0.0002994 epoch_Time:29681.0min: [2024-01-02 18:00:30,801][model8_pretrain.py][INFO] Epoch:[0/2](3200/4588595) loss:3.991 lr:0.0002994 epoch_Time:29681.0min: [2024-01-02 18:00:30,801][model8_pretrain.py][INFO] Epoch:[0/2](3200/4588595) loss:4.432 lr:0.0002994 epoch_Time:29681.0min: [2024-01-02 18:00:30,801][model8_pretrain.py][INFO] Epoch:[0/2](3200/4588595) loss:4.112 lr:0.0002994 epoch_Time:29682.0min: [2024-01-02 18:00:30,801][model8_pretrain.py][INFO] Epoch:[0/2](3200/4588595) loss:4.592 lr:0.0002994 epoch_Time:29681.0min: [2024-01-02 18:01:07,751][model8_pretrain.py][INFO] Epoch:[0/2](3300/4588595) loss:3.988 lr:0.0002994 epoch_Time:29636.0min: [2024-01-02 18:01:07,751][model8_pretrain.py][INFO] Epoch:[0/2](3300/4588595) loss:4.269 lr:0.0002994 epoch_Time:29637.0min: [2024-01-02 18:01:07,751][model8_pretrain.py][INFO] Epoch:[0/2](3300/4588595) loss:4.257 lr:0.0002994 epoch_Time:29636.0min: [2024-01-02 18:01:07,751][model8_pretrain.py][INFO] Epoch:[0/2](3300/4588595) loss:4.261 lr:0.0002994 epoch_Time:29637.0min: [2024-01-02 18:01:07,751][model8_pretrain.py][INFO] Epoch:[0/2](3300/4588595) loss:4.318 lr:0.0002994 epoch_Time:29637.0min: [2024-01-02 18:01:07,751][model8_pretrain.py][INFO] Epoch:[0/2](3300/4588595) loss:4.489 lr:0.0002994 epoch_Time:29637.0min: [2024-01-02 18:01:07,751][model8_pretrain.py][INFO] Epoch:[0/2](3300/4588595) loss:4.242 lr:0.0002994 epoch_Time:29637.0min: [2024-01-02 18:01:07,751][model8_pretrain.py][INFO] Epoch:[0/2](3300/4588595) loss:4.238 lr:0.0002994 epoch_Time:29637.0min: [2024-01-02 18:01:44,687][model8_pretrain.py][INFO] Epoch:[0/2](3400/4588595) loss:4.158 lr:0.0002993 epoch_Time:29595.0min: [2024-01-02 18:01:44,687][model8_pretrain.py][INFO] Epoch:[0/2](3400/4588595) loss:4.759 lr:0.0002993 epoch_Time:29595.0min: [2024-01-02 18:01:44,687][model8_pretrain.py][INFO] Epoch:[0/2](3400/4588595) loss:4.349 lr:0.0002993 epoch_Time:29596.0min: [2024-01-02 18:01:44,687][model8_pretrain.py][INFO] Epoch:[0/2](3400/4588595) loss:4.051 lr:0.0002993 epoch_Time:29595.0min: [2024-01-02 18:01:44,688][model8_pretrain.py][INFO] Epoch:[0/2](3400/4588595) loss:3.925 lr:0.0002993 epoch_Time:29595.0min: [2024-01-02 18:01:44,688][model8_pretrain.py][INFO] Epoch:[0/2](3400/4588595) loss:4.221 lr:0.0002993 epoch_Time:29596.0min: [2024-01-02 18:01:44,688][model8_pretrain.py][INFO] Epoch:[0/2](3400/4588595) loss:4.140 lr:0.0002993 epoch_Time:29595.0min: [2024-01-02 18:01:44,688][model8_pretrain.py][INFO] Epoch:[0/2](3400/4588595) loss:4.084 lr:0.0002993 epoch_Time:29595.0min: [2024-01-02 18:02:21,660][model8_pretrain.py][INFO] Epoch:[0/2](3500/4588595) loss:4.264 lr:0.0002993 epoch_Time:29556.0min: [2024-01-02 18:02:21,660][model8_pretrain.py][INFO] Epoch:[0/2](3500/4588595) loss:4.239 lr:0.0002993 epoch_Time:29556.0min: [2024-01-02 18:02:21,660][model8_pretrain.py][INFO] Epoch:[0/2](3500/4588595) loss:4.062 lr:0.0002993 epoch_Time:29557.0min: [2024-01-02 18:02:21,660][model8_pretrain.py][INFO] Epoch:[0/2](3500/4588595) loss:3.977 lr:0.0002993 epoch_Time:29556.0min: [2024-01-02 18:02:21,660][model8_pretrain.py][INFO] Epoch:[0/2](3500/4588595) loss:3.715 lr:0.0002993 epoch_Time:29556.0min: [2024-01-02 18:02:21,660][model8_pretrain.py][INFO] Epoch:[0/2](3500/4588595) loss:4.431 lr:0.0002993 epoch_Time:29556.0min: [2024-01-02 18:02:21,660][model8_pretrain.py][INFO] Epoch:[0/2](3500/4588595) loss:4.191 lr:0.0002993 epoch_Time:29556.0min: [2024-01-02 18:02:21,660][model8_pretrain.py][INFO] Epoch:[0/2](3500/4588595) loss:3.782 lr:0.0002993 epoch_Time:29556.0min: [2024-01-02 18:03:02,231][model8_pretrain.py][INFO] Epoch:[0/2](3600/4588595) loss:4.352 lr:0.0002992 epoch_Time:29595.0min: [2024-01-02 18:03:02,231][model8_pretrain.py][INFO] Epoch:[0/2](3600/4588595) loss:4.441 lr:0.0002992 epoch_Time:29595.0min: [2024-01-02 18:03:02,231][model8_pretrain.py][INFO] Epoch:[0/2](3600/4588595) loss:4.025 lr:0.0002992 epoch_Time:29595.0min: [2024-01-02 18:03:02,231][model8_pretrain.py][INFO] Epoch:[0/2](3600/4588595) loss:4.014 lr:0.0002992 epoch_Time:29595.0min: [2024-01-02 18:03:02,231][model8_pretrain.py][INFO] Epoch:[0/2](3600/4588595) loss:4.568 lr:0.0002992 epoch_Time:29595.0min: [2024-01-02 18:03:02,231][model8_pretrain.py][INFO] Epoch:[0/2](3600/4588595) loss:4.035 lr:0.0002992 epoch_Time:29596.0min: [2024-01-02 18:03:02,231][model8_pretrain.py][INFO] Epoch:[0/2](3600/4588595) loss:3.877 lr:0.0002992 epoch_Time:29595.0min: [2024-01-02 18:03:02,231][model8_pretrain.py][INFO] Epoch:[0/2](3600/4588595) loss:3.624 lr:0.0002992 epoch_Time:29595.0min: [2024-01-02 18:03:39,151][model8_pretrain.py][INFO] Epoch:[0/2](3700/4588595) loss:4.364 lr:0.0002992 epoch_Time:29557.0min: [2024-01-02 18:03:39,151][model8_pretrain.py][INFO] Epoch:[0/2](3700/4588595) loss:4.217 lr:0.0002992 epoch_Time:29558.0min: [2024-01-02 18:03:39,151][model8_pretrain.py][INFO] Epoch:[0/2](3700/4588595) loss:4.208 lr:0.0002992 epoch_Time:29558.0min: [2024-01-02 18:03:39,151][model8_pretrain.py][INFO] Epoch:[0/2](3700/4588595) loss:3.706 lr:0.0002992 epoch_Time:29558.0min: [2024-01-02 18:03:39,152][model8_pretrain.py][INFO] Epoch:[0/2](3700/4588595) loss:4.307 lr:0.0002992 epoch_Time:29558.0min: [2024-01-02 18:03:39,152][model8_pretrain.py][INFO] Epoch:[0/2](3700/4588595) loss:4.187 lr:0.0002992 epoch_Time:29557.0min: [2024-01-02 18:03:39,152][model8_pretrain.py][INFO] Epoch:[0/2](3700/4588595) loss:4.251 lr:0.0002992 epoch_Time:29558.0min: [2024-01-02 18:03:39,152][model8_pretrain.py][INFO] Epoch:[0/2](3700/4588595) loss:3.621 lr:0.0002992 epoch_Time:29558.0min: [2024-01-02 18:04:16,074][model8_pretrain.py][INFO] Epoch:[0/2](3800/4588595) loss:4.212 lr:0.0002991 epoch_Time:29521.0min: [2024-01-02 18:04:16,074][model8_pretrain.py][INFO] Epoch:[0/2](3800/4588595) loss:4.264 lr:0.0002991 epoch_Time:29522.0min: [2024-01-02 18:04:16,074][model8_pretrain.py][INFO] Epoch:[0/2](3800/4588595) loss:4.002 lr:0.0002991 epoch_Time:29521.0min: [2024-01-02 18:04:16,074][model8_pretrain.py][INFO] Epoch:[0/2](3800/4588595) loss:4.032 lr:0.0002991 epoch_Time:29522.0min: [2024-01-02 18:04:16,074][model8_pretrain.py][INFO] Epoch:[0/2](3800/4588595) loss:4.064 lr:0.0002991 epoch_Time:29521.0min: [2024-01-02 18:04:16,074][model8_pretrain.py][INFO] Epoch:[0/2](3800/4588595) loss:3.778 lr:0.0002991 epoch_Time:29521.0min: [2024-01-02 18:04:16,074][model8_pretrain.py][INFO] Epoch:[0/2](3800/4588595) loss:4.183 lr:0.0002991 epoch_Time:29521.0min: [2024-01-02 18:04:16,074][model8_pretrain.py][INFO] Epoch:[0/2](3800/4588595) loss:4.246 lr:0.0002991 epoch_Time:29521.0min: [2024-01-02 18:04:53,003][model8_pretrain.py][INFO] Epoch:[0/2](3900/4588595) loss:4.212 lr:0.0002990 epoch_Time:29487.0min: [2024-01-02 18:04:53,003][model8_pretrain.py][INFO] Epoch:[0/2](3900/4588595) loss:3.375 lr:0.0002990 epoch_Time:29488.0min: [2024-01-02 18:04:53,003][model8_pretrain.py][INFO] Epoch:[0/2](3900/4588595) loss:4.457 lr:0.0002990 epoch_Time:29487.0min: [2024-01-02 18:04:53,003][model8_pretrain.py][INFO] Epoch:[0/2](3900/4588595) loss:3.836 lr:0.0002990 epoch_Time:29487.0min: [2024-01-02 18:04:53,003][model8_pretrain.py][INFO] Epoch:[0/2](3900/4588595) loss:4.184 lr:0.0002990 epoch_Time:29487.0min: [2024-01-02 18:04:53,003][model8_pretrain.py][INFO] Epoch:[0/2](3900/4588595) loss:4.087 lr:0.0002990 epoch_Time:29487.0min: [2024-01-02 18:04:53,004][model8_pretrain.py][INFO] Epoch:[0/2](3900/4588595) loss:4.357 lr:0.0002990 epoch_Time:29487.0min: [2024-01-02 18:04:53,004][model8_pretrain.py][INFO] Epoch:[0/2](3900/4588595) loss:4.618 lr:0.0002990 epoch_Time:29487.0min: [2024-01-02 18:05:29,950][model8_pretrain.py][INFO] Epoch:[0/2](4000/4588595) loss:3.857 lr:0.0002990 epoch_Time:29456.0min: [2024-01-02 18:05:29,950][model8_pretrain.py][INFO] Epoch:[0/2](4000/4588595) loss:4.159 lr:0.0002990 epoch_Time:29456.0min: [2024-01-02 18:05:29,950][model8_pretrain.py][INFO] Epoch:[0/2](4000/4588595) loss:4.226 lr:0.0002990 epoch_Time:29456.0min: [2024-01-02 18:05:29,950][model8_pretrain.py][INFO] Epoch:[0/2](4000/4588595) loss:4.273 lr:0.0002990 epoch_Time:29455.0min: [2024-01-02 18:05:29,950][model8_pretrain.py][INFO] Epoch:[0/2](4000/4588595) loss:4.178 lr:0.0002990 epoch_Time:29456.0min: [2024-01-02 18:05:29,950][model8_pretrain.py][INFO] Epoch:[0/2](4000/4588595) loss:4.127 lr:0.0002990 epoch_Time:29456.0min: [2024-01-02 18:05:29,950][model8_pretrain.py][INFO] Epoch:[0/2](4000/4588595) loss:4.352 lr:0.0002990 epoch_Time:29456.0min: [2024-01-02 18:05:29,950][model8_pretrain.py][INFO] Epoch:[0/2](4000/4588595) loss:3.897 lr:0.0002990 epoch_Time:29455.0min: [2024-01-02 18:06:06,917][model8_pretrain.py][INFO] Epoch:[0/2](4100/4588595) loss:4.322 lr:0.0002989 epoch_Time:29425.0min: [2024-01-02 18:06:06,917][model8_pretrain.py][INFO] Epoch:[0/2](4100/4588595) loss:4.286 lr:0.0002989 epoch_Time:29425.0min: [2024-01-02 18:06:06,917][model8_pretrain.py][INFO] Epoch:[0/2](4100/4588595) loss:4.018 lr:0.0002989 epoch_Time:29425.0min: [2024-01-02 18:06:06,917][model8_pretrain.py][INFO] Epoch:[0/2](4100/4588595) loss:4.082 lr:0.0002989 epoch_Time:29425.0min: [2024-01-02 18:06:06,918][model8_pretrain.py][INFO] Epoch:[0/2](4100/4588595) loss:4.296 lr:0.0002989 epoch_Time:29425.0min: [2024-01-02 18:06:06,918][model8_pretrain.py][INFO] Epoch:[0/2](4100/4588595) loss:4.018 lr:0.0002989 epoch_Time:29425.0min: [2024-01-02 18:06:06,917][model8_pretrain.py][INFO] Epoch:[0/2](4100/4588595) loss:3.975 lr:0.0002989 epoch_Time:29425.0min: [2024-01-02 18:06:06,918][model8_pretrain.py][INFO] Epoch:[0/2](4100/4588595) loss:4.142 lr:0.0002989 epoch_Time:29426.0min: [2024-01-02 18:06:43,859][model8_pretrain.py][INFO] Epoch:[0/2](4200/4588595) loss:3.932 lr:0.0002988 epoch_Time:29396.0min: [2024-01-02 18:06:43,859][model8_pretrain.py][INFO] Epoch:[0/2](4200/4588595) loss:4.165 lr:0.0002988 epoch_Time:29397.0min: [2024-01-02 18:06:43,859][model8_pretrain.py][INFO] Epoch:[0/2](4200/4588595) loss:3.896 lr:0.0002988 epoch_Time:29397.0min: [2024-01-02 18:06:43,859][model8_pretrain.py][INFO] Epoch:[0/2](4200/4588595) loss:4.066 lr:0.0002988 epoch_Time:29396.0min: [2024-01-02 18:06:43,859][model8_pretrain.py][INFO] Epoch:[0/2](4200/4588595) loss:4.102 lr:0.0002988 epoch_Time:29397.0min: [2024-01-02 18:06:43,859][model8_pretrain.py][INFO] Epoch:[0/2](4200/4588595) loss:4.227 lr:0.0002988 epoch_Time:29397.0min: [2024-01-02 18:06:43,859][model8_pretrain.py][INFO] Epoch:[0/2](4200/4588595) loss:3.849 lr:0.0002988 epoch_Time:29396.0min: [2024-01-02 18:06:43,859][model8_pretrain.py][INFO] Epoch:[0/2](4200/4588595) loss:4.459 lr:0.0002988 epoch_Time:29397.0min: [2024-01-02 18:07:20,785][model8_pretrain.py][INFO] Epoch:[0/2](4300/4588595) loss:4.373 lr:0.0002988 epoch_Time:29368.0min: [2024-01-02 18:07:20,785][model8_pretrain.py][INFO] Epoch:[0/2](4300/4588595) loss:3.882 lr:0.0002988 epoch_Time:29368.0min: [2024-01-02 18:07:20,785][model8_pretrain.py][INFO] Epoch:[0/2](4300/4588595) loss:4.404 lr:0.0002988 epoch_Time:29369.0min: [2024-01-02 18:07:20,785][model8_pretrain.py][INFO] Epoch:[0/2](4300/4588595) loss:4.392 lr:0.0002988 epoch_Time:29368.0min: [2024-01-02 18:07:20,785][model8_pretrain.py][INFO] Epoch:[0/2](4300/4588595) loss:3.649 lr:0.0002988 epoch_Time:29368.0min: [2024-01-02 18:07:20,785][model8_pretrain.py][INFO] Epoch:[0/2](4300/4588595) loss:3.843 lr:0.0002988 epoch_Time:29368.0min: [2024-01-02 18:07:20,785][model8_pretrain.py][INFO] Epoch:[0/2](4300/4588595) loss:3.993 lr:0.0002988 epoch_Time:29368.0min: [2024-01-02 18:07:20,785][model8_pretrain.py][INFO] Epoch:[0/2](4300/4588595) loss:3.998 lr:0.0002988 epoch_Time:29368.0min: [2024-01-02 18:08:01,297][model8_pretrain.py][INFO] Epoch:[0/2](4400/4588595) loss:4.181 lr:0.0002987 epoch_Time:29403.0min: [2024-01-02 18:08:01,297][model8_pretrain.py][INFO] Epoch:[0/2](4400/4588595) loss:3.960 lr:0.0002987 epoch_Time:29403.0min: [2024-01-02 18:08:01,297][model8_pretrain.py][INFO] Epoch:[0/2](4400/4588595) loss:4.224 lr:0.0002987 epoch_Time:29404.0min: [2024-01-02 18:08:01,297][model8_pretrain.py][INFO] Epoch:[0/2](4400/4588595) loss:3.731 lr:0.0002987 epoch_Time:29403.0min: [2024-01-02 18:08:01,297][model8_pretrain.py][INFO] Epoch:[0/2](4400/4588595) loss:3.707 lr:0.0002987 epoch_Time:29403.0min: [2024-01-02 18:08:01,297][model8_pretrain.py][INFO] Epoch:[0/2](4400/4588595) loss:3.738 lr:0.0002987 epoch_Time:29403.0min: [2024-01-02 18:08:01,297][model8_pretrain.py][INFO] Epoch:[0/2](4400/4588595) loss:4.200 lr:0.0002987 epoch_Time:29403.0min: [2024-01-02 18:08:01,297][model8_pretrain.py][INFO] Epoch:[0/2](4400/4588595) loss:3.649 lr:0.0002987 epoch_Time:29403.0min: [2024-01-02 18:08:38,202][model8_pretrain.py][INFO] Epoch:[0/2](4500/4588595) loss:3.569 lr:0.0002986 epoch_Time:29376.0min: [2024-01-02 18:08:38,202][model8_pretrain.py][INFO] Epoch:[0/2](4500/4588595) loss:3.982 lr:0.0002986 epoch_Time:29376.0min: [2024-01-02 18:08:38,202][model8_pretrain.py][INFO] Epoch:[0/2](4500/4588595) loss:3.840 lr:0.0002986 epoch_Time:29376.0min: [2024-01-02 18:08:38,202][model8_pretrain.py][INFO] Epoch:[0/2](4500/4588595) loss:4.313 lr:0.0002986 epoch_Time:29377.0min: [2024-01-02 18:08:38,202][model8_pretrain.py][INFO] Epoch:[0/2](4500/4588595) loss:3.473 lr:0.0002986 epoch_Time:29376.0min: [2024-01-02 18:08:38,202][model8_pretrain.py][INFO] Epoch:[0/2](4500/4588595) loss:4.256 lr:0.0002986 epoch_Time:29376.0min: [2024-01-02 18:08:38,202][model8_pretrain.py][INFO] Epoch:[0/2](4500/4588595) loss:3.768 lr:0.0002986 epoch_Time:29376.0min: [2024-01-02 18:08:38,202][model8_pretrain.py][INFO] Epoch:[0/2](4500/4588595) loss:3.804 lr:0.0002986 epoch_Time:29376.0min: [2024-01-02 18:09:15,123][model8_pretrain.py][INFO] Epoch:[0/2](4600/4588595) loss:4.059 lr:0.0002985 epoch_Time:29350.0min: [2024-01-02 18:09:15,123][model8_pretrain.py][INFO] Epoch:[0/2](4600/4588595) loss:3.963 lr:0.0002985 epoch_Time:29350.0min: [2024-01-02 18:09:15,123][model8_pretrain.py][INFO] Epoch:[0/2](4600/4588595) loss:3.498 lr:0.0002985 epoch_Time:29350.0min: [2024-01-02 18:09:15,123][model8_pretrain.py][INFO] Epoch:[0/2](4600/4588595) loss:4.064 lr:0.0002985 epoch_Time:29350.0min: [2024-01-02 18:09:15,124][model8_pretrain.py][INFO] Epoch:[0/2](4600/4588595) loss:3.930 lr:0.0002985 epoch_Time:29350.0min: [2024-01-02 18:09:15,124][model8_pretrain.py][INFO] Epoch:[0/2](4600/4588595) loss:4.243 lr:0.0002985 epoch_Time:29350.0min: [2024-01-02 18:09:15,124][model8_pretrain.py][INFO] Epoch:[0/2](4600/4588595) loss:4.072 lr:0.0002985 epoch_Time:29350.0min: [2024-01-02 18:09:15,124][model8_pretrain.py][INFO] Epoch:[0/2](4600/4588595) loss:4.171 lr:0.0002985 epoch_Time:29350.0min: [2024-01-02 18:09:52,068][model8_pretrain.py][INFO] Epoch:[0/2](4700/4588595) loss:4.578 lr:0.0002984 epoch_Time:29325.0min: [2024-01-02 18:09:52,068][model8_pretrain.py][INFO] Epoch:[0/2](4700/4588595) loss:4.345 lr:0.0002984 epoch_Time:29325.0min: [2024-01-02 18:09:52,068][model8_pretrain.py][INFO] Epoch:[0/2](4700/4588595) loss:4.069 lr:0.0002984 epoch_Time:29325.0min: [2024-01-02 18:09:52,068][model8_pretrain.py][INFO] Epoch:[0/2](4700/4588595) loss:4.200 lr:0.0002984 epoch_Time:29325.0min: [2024-01-02 18:09:52,068][model8_pretrain.py][INFO] Epoch:[0/2](4700/4588595) loss:3.928 lr:0.0002984 epoch_Time:29325.0min: [2024-01-02 18:09:52,068][model8_pretrain.py][INFO] Epoch:[0/2](4700/4588595) loss:3.794 lr:0.0002984 epoch_Time:29325.0min: [2024-01-02 18:09:52,068][model8_pretrain.py][INFO] Epoch:[0/2](4700/4588595) loss:4.278 lr:0.0002984 epoch_Time:29325.0min: [2024-01-02 18:09:52,070][model8_pretrain.py][INFO] Epoch:[0/2](4700/4588595) loss:3.849 lr:0.0002984 epoch_Time:29325.0min: [2024-01-02 18:10:29,039][model8_pretrain.py][INFO] Epoch:[0/2](4800/4588595) loss:4.186 lr:0.0002983 epoch_Time:29302.0min: [2024-01-02 18:10:29,039][model8_pretrain.py][INFO] Epoch:[0/2](4800/4588595) loss:4.271 lr:0.0002983 epoch_Time:29302.0min: [2024-01-02 18:10:29,039][model8_pretrain.py][INFO] Epoch:[0/2](4800/4588595) loss:4.309 lr:0.0002983 epoch_Time:29303.0min: [2024-01-02 18:10:29,039][model8_pretrain.py][INFO] Epoch:[0/2](4800/4588595) loss:3.794 lr:0.0002983 epoch_Time:29302.0min: [2024-01-02 18:10:29,040][model8_pretrain.py][INFO] Epoch:[0/2](4800/4588595) loss:4.063 lr:0.0002983 epoch_Time:29302.0min: [2024-01-02 18:10:29,039][model8_pretrain.py][INFO] Epoch:[0/2](4800/4588595) loss:3.689 lr:0.0002983 epoch_Time:29302.0min: [2024-01-02 18:10:29,040][model8_pretrain.py][INFO] Epoch:[0/2](4800/4588595) loss:4.026 lr:0.0002983 epoch_Time:29302.0min: [2024-01-02 18:10:29,039][model8_pretrain.py][INFO] Epoch:[0/2](4800/4588595) loss:4.292 lr:0.0002983 epoch_Time:29303.0min: [2024-01-02 18:11:05,990][model8_pretrain.py][INFO] Epoch:[0/2](4900/4588595) loss:4.063 lr:0.0002983 epoch_Time:29279.0min: [2024-01-02 18:11:05,990][model8_pretrain.py][INFO] Epoch:[0/2](4900/4588595) loss:3.829 lr:0.0002983 epoch_Time:29279.0min: [2024-01-02 18:11:05,990][model8_pretrain.py][INFO] Epoch:[0/2](4900/4588595) loss:4.190 lr:0.0002983 epoch_Time:29280.0min: [2024-01-02 18:11:05,990][model8_pretrain.py][INFO] Epoch:[0/2](4900/4588595) loss:4.118 lr:0.0002983 epoch_Time:29279.0min: [2024-01-02 18:11:05,990][model8_pretrain.py][INFO] Epoch:[0/2](4900/4588595) loss:3.858 lr:0.0002983 epoch_Time:29280.0min: [2024-01-02 18:11:05,990][model8_pretrain.py][INFO] Epoch:[0/2](4900/4588595) loss:4.275 lr:0.0002983 epoch_Time:29279.0min: [2024-01-02 18:11:05,990][model8_pretrain.py][INFO] Epoch:[0/2](4900/4588595) loss:3.787 lr:0.0002983 epoch_Time:29279.0min: [2024-01-02 18:11:05,990][model8_pretrain.py][INFO] Epoch:[0/2](4900/4588595) loss:4.298 lr:0.0002983 epoch_Time:29279.0min: [2024-01-02 18:11:42,946][model8_pretrain.py][INFO] Epoch:[0/2](5000/4588595) loss:4.206 lr:0.0002982 epoch_Time:29258.0min: [2024-01-02 18:11:42,946][model8_pretrain.py][INFO] Epoch:[0/2](5000/4588595) loss:3.875 lr:0.0002982 epoch_Time:29259.0min: [2024-01-02 18:11:42,946][model8_pretrain.py][INFO] Epoch:[0/2](5000/4588595) loss:4.157 lr:0.0002982 epoch_Time:29258.0min: [2024-01-02 18:11:42,946][model8_pretrain.py][INFO] Epoch:[0/2](5000/4588595) loss:3.974 lr:0.0002982 epoch_Time:29258.0min: [2024-01-02 18:11:42,946][model8_pretrain.py][INFO] Epoch:[0/2](5000/4588595) loss:4.078 lr:0.0002982 epoch_Time:29259.0min: [2024-01-02 18:11:42,946][model8_pretrain.py][INFO] Epoch:[0/2](5000/4588595) loss:4.254 lr:0.0002982 epoch_Time:29258.0min: [2024-01-02 18:11:42,946][model8_pretrain.py][INFO] Epoch:[0/2](5000/4588595) loss:3.662 lr:0.0002982 epoch_Time:29258.0min: [2024-01-02 18:11:42,946][model8_pretrain.py][INFO] Epoch:[0/2](5000/4588595) loss:4.226 lr:0.0002982 epoch_Time:29258.0min: [2024-01-02 18:12:19,872][model8_pretrain.py][INFO] Epoch:[0/2](5100/4588595) loss:4.074 lr:0.0002981 epoch_Time:29237.0min: [2024-01-02 18:12:19,872][model8_pretrain.py][INFO] Epoch:[0/2](5100/4588595) loss:3.889 lr:0.0002981 epoch_Time:29237.0min: [2024-01-02 18:12:19,872][model8_pretrain.py][INFO] Epoch:[0/2](5100/4588595) loss:3.809 lr:0.0002981 epoch_Time:29237.0min: [2024-01-02 18:12:19,872][model8_pretrain.py][INFO] Epoch:[0/2](5100/4588595) loss:3.828 lr:0.0002981 epoch_Time:29237.0min: [2024-01-02 18:12:19,872][model8_pretrain.py][INFO] Epoch:[0/2](5100/4588595) loss:3.924 lr:0.0002981 epoch_Time:29237.0min: [2024-01-02 18:12:19,872][model8_pretrain.py][INFO] Epoch:[0/2](5100/4588595) loss:3.735 lr:0.0002981 epoch_Time:29237.0min: [2024-01-02 18:12:19,873][model8_pretrain.py][INFO] Epoch:[0/2](5100/4588595) loss:4.116 lr:0.0002981 epoch_Time:29237.0min: [2024-01-02 18:12:19,873][model8_pretrain.py][INFO] Epoch:[0/2](5100/4588595) loss:3.626 lr:0.0002981 epoch_Time:29237.0min: [2024-01-02 18:13:00,432][model8_pretrain.py][INFO] Epoch:[0/2](5200/4588595) loss:4.180 lr:0.0002980 epoch_Time:29269.0min: [2024-01-02 18:13:00,432][model8_pretrain.py][INFO] Epoch:[0/2](5200/4588595) loss:3.897 lr:0.0002980 epoch_Time:29270.0min: [2024-01-02 18:13:00,432][model8_pretrain.py][INFO] Epoch:[0/2](5200/4588595) loss:3.677 lr:0.0002980 epoch_Time:29270.0min: [2024-01-02 18:13:00,432][model8_pretrain.py][INFO] Epoch:[0/2](5200/4588595) loss:4.277 lr:0.0002980 epoch_Time:29269.0min: [2024-01-02 18:13:00,432][model8_pretrain.py][INFO] Epoch:[0/2](5200/4588595) loss:3.749 lr:0.0002980 epoch_Time:29270.0min: [2024-01-02 18:13:00,432][model8_pretrain.py][INFO] Epoch:[0/2](5200/4588595) loss:3.867 lr:0.0002980 epoch_Time:29270.0min: [2024-01-02 18:13:00,432][model8_pretrain.py][INFO] Epoch:[0/2](5200/4588595) loss:4.330 lr:0.0002980 epoch_Time:29270.0min: [2024-01-02 18:13:00,432][model8_pretrain.py][INFO] Epoch:[0/2](5200/4588595) loss:4.142 lr:0.0002980 epoch_Time:29269.0min: [2024-01-02 18:13:37,337][model8_pretrain.py][INFO] Epoch:[0/2](5300/4588595) loss:3.981 lr:0.0002979 epoch_Time:29249.0min: [2024-01-02 18:13:37,337][model8_pretrain.py][INFO] Epoch:[0/2](5300/4588595) loss:3.806 lr:0.0002979 epoch_Time:29249.0min: [2024-01-02 18:13:37,337][model8_pretrain.py][INFO] Epoch:[0/2](5300/4588595) loss:4.241 lr:0.0002979 epoch_Time:29249.0min: [2024-01-02 18:13:37,337][model8_pretrain.py][INFO] Epoch:[0/2](5300/4588595) loss:3.577 lr:0.0002979 epoch_Time:29250.0min: [2024-01-02 18:13:37,337][model8_pretrain.py][INFO] Epoch:[0/2](5300/4588595) loss:4.232 lr:0.0002979 epoch_Time:29249.0min: [2024-01-02 18:13:37,337][model8_pretrain.py][INFO] Epoch:[0/2](5300/4588595) loss:4.297 lr:0.0002979 epoch_Time:29249.0min: [2024-01-02 18:13:37,337][model8_pretrain.py][INFO] Epoch:[0/2](5300/4588595) loss:4.352 lr:0.0002979 epoch_Time:29249.0min: [2024-01-02 18:13:37,337][model8_pretrain.py][INFO] Epoch:[0/2](5300/4588595) loss:3.649 lr:0.0002979 epoch_Time:29249.0min: [2024-01-02 18:14:14,256][model8_pretrain.py][INFO] Epoch:[0/2](5400/4588595) loss:3.862 lr:0.0002978 epoch_Time:29229.0min: [2024-01-02 18:14:14,256][model8_pretrain.py][INFO] Epoch:[0/2](5400/4588595) loss:3.927 lr:0.0002978 epoch_Time:29229.0min: [2024-01-02 18:14:14,256][model8_pretrain.py][INFO] Epoch:[0/2](5400/4588595) loss:3.942 lr:0.0002978 epoch_Time:29229.0min: [2024-01-02 18:14:14,256][model8_pretrain.py][INFO] Epoch:[0/2](5400/4588595) loss:3.671 lr:0.0002978 epoch_Time:29229.0min: [2024-01-02 18:14:14,256][model8_pretrain.py][INFO] Epoch:[0/2](5400/4588595) loss:3.881 lr:0.0002978 epoch_Time:29229.0min: [2024-01-02 18:14:14,256][model8_pretrain.py][INFO] Epoch:[0/2](5400/4588595) loss:4.142 lr:0.0002978 epoch_Time:29229.0min: [2024-01-02 18:14:14,256][model8_pretrain.py][INFO] Epoch:[0/2](5400/4588595) loss:3.954 lr:0.0002978 epoch_Time:29229.0min: [2024-01-02 18:14:14,256][model8_pretrain.py][INFO] Epoch:[0/2](5400/4588595) loss:3.824 lr:0.0002978 epoch_Time:29229.0min: [2024-01-02 18:14:51,178][model8_pretrain.py][INFO] Epoch:[0/2](5500/4588595) loss:4.084 lr:0.0002977 epoch_Time:29209.0min: [2024-01-02 18:14:51,178][model8_pretrain.py][INFO] Epoch:[0/2](5500/4588595) loss:4.182 lr:0.0002977 epoch_Time:29209.0min: [2024-01-02 18:14:51,178][model8_pretrain.py][INFO] Epoch:[0/2](5500/4588595) loss:4.055 lr:0.0002977 epoch_Time:29209.0min: [2024-01-02 18:14:51,178][model8_pretrain.py][INFO] Epoch:[0/2](5500/4588595) loss:3.769 lr:0.0002977 epoch_Time:29209.0min: [2024-01-02 18:14:51,178][model8_pretrain.py][INFO] Epoch:[0/2](5500/4588595) loss:3.605 lr:0.0002977 epoch_Time:29209.0min: [2024-01-02 18:14:51,178][model8_pretrain.py][INFO] Epoch:[0/2](5500/4588595) loss:3.904 lr:0.0002977 epoch_Time:29209.0min: [2024-01-02 18:14:51,178][model8_pretrain.py][INFO] Epoch:[0/2](5500/4588595) loss:3.940 lr:0.0002977 epoch_Time:29209.0min: [2024-01-02 18:14:51,178][model8_pretrain.py][INFO] Epoch:[0/2](5500/4588595) loss:4.290 lr:0.0002977 epoch_Time:29210.0min: [2024-01-02 18:15:28,135][model8_pretrain.py][INFO] Epoch:[0/2](5600/4588595) loss:3.913 lr:0.0002976 epoch_Time:29192.0min: [2024-01-02 18:15:28,135][model8_pretrain.py][INFO] Epoch:[0/2](5600/4588595) loss:4.232 lr:0.0002976 epoch_Time:29191.0min: [2024-01-02 18:15:28,135][model8_pretrain.py][INFO] Epoch:[0/2](5600/4588595) loss:4.207 lr:0.0002976 epoch_Time:29192.0min: [2024-01-02 18:15:28,135][model8_pretrain.py][INFO] Epoch:[0/2](5600/4588595) loss:3.930 lr:0.0002976 epoch_Time:29192.0min: [2024-01-02 18:15:28,135][model8_pretrain.py][INFO] Epoch:[0/2](5600/4588595) loss:4.311 lr:0.0002976 epoch_Time:29192.0min: [2024-01-02 18:15:28,135][model8_pretrain.py][INFO] Epoch:[0/2](5600/4588595) loss:3.189 lr:0.0002976 epoch_Time:29191.0min: [2024-01-02 18:15:28,136][model8_pretrain.py][INFO] Epoch:[0/2](5600/4588595) loss:3.944 lr:0.0002976 epoch_Time:29192.0min: [2024-01-02 18:15:28,136][model8_pretrain.py][INFO] Epoch:[0/2](5600/4588595) loss:3.928 lr:0.0002976 epoch_Time:29192.0min: [2024-01-02 18:16:05,071][model8_pretrain.py][INFO] Epoch:[0/2](5700/4588595) loss:3.972 lr:0.0002975 epoch_Time:29173.0min: [2024-01-02 18:16:05,071][model8_pretrain.py][INFO] Epoch:[0/2](5700/4588595) loss:3.645 lr:0.0002975 epoch_Time:29173.0min: [2024-01-02 18:16:05,071][model8_pretrain.py][INFO] Epoch:[0/2](5700/4588595) loss:3.812 lr:0.0002975 epoch_Time:29174.0min: [2024-01-02 18:16:05,071][model8_pretrain.py][INFO] Epoch:[0/2](5700/4588595) loss:3.570 lr:0.0002975 epoch_Time:29173.0min: [2024-01-02 18:16:05,071][model8_pretrain.py][INFO] Epoch:[0/2](5700/4588595) loss:4.423 lr:0.0002975 epoch_Time:29173.0min: [2024-01-02 18:16:05,071][model8_pretrain.py][INFO] Epoch:[0/2](5700/4588595) loss:3.726 lr:0.0002975 epoch_Time:29173.0min: [2024-01-02 18:16:05,071][model8_pretrain.py][INFO] Epoch:[0/2](5700/4588595) loss:3.947 lr:0.0002975 epoch_Time:29173.0min: [2024-01-02 18:16:05,071][model8_pretrain.py][INFO] Epoch:[0/2](5700/4588595) loss:3.854 lr:0.0002975 epoch_Time:29174.0min: [2024-01-02 18:16:42,002][model8_pretrain.py][INFO] Epoch:[0/2](5800/4588595) loss:4.301 lr:0.0002974 epoch_Time:29157.0min: [2024-01-02 18:16:42,002][model8_pretrain.py][INFO] Epoch:[0/2](5800/4588595) loss:3.780 lr:0.0002974 epoch_Time:29157.0min: [2024-01-02 18:16:42,002][model8_pretrain.py][INFO] Epoch:[0/2](5800/4588595) loss:4.294 lr:0.0002974 epoch_Time:29157.0min: [2024-01-02 18:16:42,002][model8_pretrain.py][INFO] Epoch:[0/2](5800/4588595) loss:3.965 lr:0.0002974 epoch_Time:29157.0min: [2024-01-02 18:16:42,002][model8_pretrain.py][INFO] Epoch:[0/2](5800/4588595) loss:4.610 lr:0.0002974 epoch_Time:29157.0min: [2024-01-02 18:16:42,003][model8_pretrain.py][INFO] Epoch:[0/2](5800/4588595) loss:4.370 lr:0.0002974 epoch_Time:29157.0min: [2024-01-02 18:16:42,003][model8_pretrain.py][INFO] Epoch:[0/2](5800/4588595) loss:3.909 lr:0.0002974 epoch_Time:29157.0min: [2024-01-02 18:16:42,003][model8_pretrain.py][INFO] Epoch:[0/2](5800/4588595) loss:3.793 lr:0.0002974 epoch_Time:29157.0min: [2024-01-02 18:17:18,929][model8_pretrain.py][INFO] Epoch:[0/2](5900/4588595) loss:4.123 lr:0.0002973 epoch_Time:29140.0min: [2024-01-02 18:17:18,929][model8_pretrain.py][INFO] Epoch:[0/2](5900/4588595) loss:3.951 lr:0.0002973 epoch_Time:29140.0min: [2024-01-02 18:17:18,929][model8_pretrain.py][INFO] Epoch:[0/2](5900/4588595) loss:3.533 lr:0.0002973 epoch_Time:29139.0min: [2024-01-02 18:17:18,929][model8_pretrain.py][INFO] Epoch:[0/2](5900/4588595) loss:4.278 lr:0.0002973 epoch_Time:29139.0min: [2024-01-02 18:17:18,929][model8_pretrain.py][INFO] Epoch:[0/2](5900/4588595) loss:4.387 lr:0.0002973 epoch_Time:29140.0min: [2024-01-02 18:17:18,929][model8_pretrain.py][INFO] Epoch:[0/2](5900/4588595) loss:3.521 lr:0.0002973 epoch_Time:29140.0min: [2024-01-02 18:17:18,929][model8_pretrain.py][INFO] Epoch:[0/2](5900/4588595) loss:3.828 lr:0.0002973 epoch_Time:29140.0min: [2024-01-02 18:17:18,929][model8_pretrain.py][INFO] Epoch:[0/2](5900/4588595) loss:3.462 lr:0.0002973 epoch_Time:29140.0min: [2024-01-02 18:17:59,507][model8_pretrain.py][INFO] Epoch:[0/2](6000/4588595) loss:4.049 lr:0.0002971 epoch_Time:29169.0min: [2024-01-02 18:17:59,507][model8_pretrain.py][INFO] Epoch:[0/2](6000/4588595) loss:3.860 lr:0.0002971 epoch_Time:29170.0min: [2024-01-02 18:17:59,507][model8_pretrain.py][INFO] Epoch:[0/2](6000/4588595) loss:3.876 lr:0.0002971 epoch_Time:29170.0min: [2024-01-02 18:17:59,507][model8_pretrain.py][INFO] Epoch:[0/2](6000/4588595) loss:3.448 lr:0.0002971 epoch_Time:29170.0min: [2024-01-02 18:17:59,507][model8_pretrain.py][INFO] Epoch:[0/2](6000/4588595) loss:3.583 lr:0.0002971 epoch_Time:29170.0min: [2024-01-02 18:17:59,512][model8_pretrain.py][INFO] Epoch:[0/2](6000/4588595) loss:3.887 lr:0.0002971 epoch_Time:29169.0min: [2024-01-02 18:17:59,512][model8_pretrain.py][INFO] Epoch:[0/2](6000/4588595) loss:4.159 lr:0.0002971 epoch_Time:29170.0min: [2024-01-02 18:17:59,567][model8_pretrain.py][INFO] Epoch:[0/2](6000/4588595) loss:4.021 lr:0.0002971 epoch_Time:29170.0min: [2024-01-02 18:18:36,488][model8_pretrain.py][INFO] Epoch:[0/2](6100/4588595) loss:4.196 lr:0.0002970 epoch_Time:29154.0min: [2024-01-02 18:18:36,488][model8_pretrain.py][INFO] Epoch:[0/2](6100/4588595) loss:3.847 lr:0.0002970 epoch_Time:29154.0min: [2024-01-02 18:18:36,489][model8_pretrain.py][INFO] Epoch:[0/2](6100/4588595) loss:3.528 lr:0.0002970 epoch_Time:29154.0min: [2024-01-02 18:18:36,489][model8_pretrain.py][INFO] Epoch:[0/2](6100/4588595) loss:4.139 lr:0.0002970 epoch_Time:29154.0min: [2024-01-02 18:18:36,489][model8_pretrain.py][INFO] Epoch:[0/2](6100/4588595) loss:4.498 lr:0.0002970 epoch_Time:29155.0min: [2024-01-02 18:18:36,489][model8_pretrain.py][INFO] Epoch:[0/2](6100/4588595) loss:3.877 lr:0.0002970 epoch_Time:29154.0min: [2024-01-02 18:18:36,489][model8_pretrain.py][INFO] Epoch:[0/2](6100/4588595) loss:4.204 lr:0.0002970 epoch_Time:29155.0min: [2024-01-02 18:18:36,489][model8_pretrain.py][INFO] Epoch:[0/2](6100/4588595) loss:3.991 lr:0.0002970 epoch_Time:29154.0min: [2024-01-02 18:19:13,438][model8_pretrain.py][INFO] Epoch:[0/2](6200/4588595) loss:4.059 lr:0.0002969 epoch_Time:29138.0min: [2024-01-02 18:19:13,438][model8_pretrain.py][INFO] Epoch:[0/2](6200/4588595) loss:3.978 lr:0.0002969 epoch_Time:29138.0min: [2024-01-02 18:19:13,438][model8_pretrain.py][INFO] Epoch:[0/2](6200/4588595) loss:3.869 lr:0.0002969 epoch_Time:29139.0min: [2024-01-02 18:19:13,438][model8_pretrain.py][INFO] Epoch:[0/2](6200/4588595) loss:4.132 lr:0.0002969 epoch_Time:29138.0min: [2024-01-02 18:19:13,438][model8_pretrain.py][INFO] Epoch:[0/2](6200/4588595) loss:4.086 lr:0.0002969 epoch_Time:29138.0min: [2024-01-02 18:19:13,438][model8_pretrain.py][INFO] Epoch:[0/2](6200/4588595) loss:4.162 lr:0.0002969 epoch_Time:29138.0min: [2024-01-02 18:19:13,438][model8_pretrain.py][INFO] Epoch:[0/2](6200/4588595) loss:3.975 lr:0.0002969 epoch_Time:29138.0min: [2024-01-02 18:19:13,438][model8_pretrain.py][INFO] Epoch:[0/2](6200/4588595) loss:4.034 lr:0.0002969 epoch_Time:29138.0min: [2024-01-02 18:19:50,349][model8_pretrain.py][INFO] Epoch:[0/2](6300/4588595) loss:3.768 lr:0.0002968 epoch_Time:29122.0min: [2024-01-02 18:19:50,349][model8_pretrain.py][INFO] Epoch:[0/2](6300/4588595) loss:4.095 lr:0.0002968 epoch_Time:29122.0min: [2024-01-02 18:19:50,350][model8_pretrain.py][INFO] Epoch:[0/2](6300/4588595) loss:3.990 lr:0.0002968 epoch_Time:29122.0min: [2024-01-02 18:19:50,350][model8_pretrain.py][INFO] Epoch:[0/2](6300/4588595) loss:3.721 lr:0.0002968 epoch_Time:29122.0min: [2024-01-02 18:19:50,350][model8_pretrain.py][INFO] Epoch:[0/2](6300/4588595) loss:3.987 lr:0.0002968 epoch_Time:29122.0min: [2024-01-02 18:19:50,350][model8_pretrain.py][INFO] Epoch:[0/2](6300/4588595) loss:3.466 lr:0.0002968 epoch_Time:29123.0min: [2024-01-02 18:19:50,350][model8_pretrain.py][INFO] Epoch:[0/2](6300/4588595) loss:4.001 lr:0.0002968 epoch_Time:29122.0min: [2024-01-02 18:19:50,350][model8_pretrain.py][INFO] Epoch:[0/2](6300/4588595) loss:4.251 lr:0.0002968 epoch_Time:29122.0min: [2024-01-02 18:20:27,271][model8_pretrain.py][INFO] Epoch:[0/2](6400/4588595) loss:4.052 lr:0.0002967 epoch_Time:29108.0min: [2024-01-02 18:20:27,271][model8_pretrain.py][INFO] Epoch:[0/2](6400/4588595) loss:3.873 lr:0.0002967 epoch_Time:29108.0min: [2024-01-02 18:20:27,271][model8_pretrain.py][INFO] Epoch:[0/2](6400/4588595) loss:3.449 lr:0.0002967 epoch_Time:29108.0min: [2024-01-02 18:20:27,271][model8_pretrain.py][INFO] Epoch:[0/2](6400/4588595) loss:3.950 lr:0.0002967 epoch_Time:29108.0min: [2024-01-02 18:20:27,271][model8_pretrain.py][INFO] Epoch:[0/2](6400/4588595) loss:3.929 lr:0.0002967 epoch_Time:29108.0min: [2024-01-02 18:20:27,271][model8_pretrain.py][INFO] Epoch:[0/2](6400/4588595) loss:3.811 lr:0.0002967 epoch_Time:29108.0min: [2024-01-02 18:20:27,271][model8_pretrain.py][INFO] Epoch:[0/2](6400/4588595) loss:4.052 lr:0.0002967 epoch_Time:29108.0min: [2024-01-02 18:20:27,271][model8_pretrain.py][INFO] Epoch:[0/2](6400/4588595) loss:4.173 lr:0.0002967 epoch_Time:29108.0min: [2024-01-02 18:21:04,180][model8_pretrain.py][INFO] Epoch:[0/2](6500/4588595) loss:3.633 lr:0.0002965 epoch_Time:29092.0min: [2024-01-02 18:21:04,180][model8_pretrain.py][INFO] Epoch:[0/2](6500/4588595) loss:4.592 lr:0.0002965 epoch_Time:29093.0min: [2024-01-02 18:21:04,180][model8_pretrain.py][INFO] Epoch:[0/2](6500/4588595) loss:4.068 lr:0.0002965 epoch_Time:29093.0min: [2024-01-02 18:21:04,180][model8_pretrain.py][INFO] Epoch:[0/2](6500/4588595) loss:3.554 lr:0.0002965 epoch_Time:29093.0min: [2024-01-02 18:21:04,180][model8_pretrain.py][INFO] Epoch:[0/2](6500/4588595) loss:4.158 lr:0.0002965 epoch_Time:29093.0min: [2024-01-02 18:21:04,180][model8_pretrain.py][INFO] Epoch:[0/2](6500/4588595) loss:3.445 lr:0.0002965 epoch_Time:29093.0min: [2024-01-02 18:21:04,180][model8_pretrain.py][INFO] Epoch:[0/2](6500/4588595) loss:3.344 lr:0.0002965 epoch_Time:29092.0min: [2024-01-02 18:21:04,180][model8_pretrain.py][INFO] Epoch:[0/2](6500/4588595) loss:4.173 lr:0.0002965 epoch_Time:29093.0min: [2024-01-02 18:21:41,100][model8_pretrain.py][INFO] Epoch:[0/2](6600/4588595) loss:3.752 lr:0.0002964 epoch_Time:29079.0min: [2024-01-02 18:21:41,100][model8_pretrain.py][INFO] Epoch:[0/2](6600/4588595) loss:4.081 lr:0.0002964 epoch_Time:29079.0min: [2024-01-02 18:21:41,100][model8_pretrain.py][INFO] Epoch:[0/2](6600/4588595) loss:3.944 lr:0.0002964 epoch_Time:29079.0min: [2024-01-02 18:21:41,100][model8_pretrain.py][INFO] Epoch:[0/2](6600/4588595) loss:3.908 lr:0.0002964 epoch_Time:29079.0min: [2024-01-02 18:21:41,100][model8_pretrain.py][INFO] Epoch:[0/2](6600/4588595) loss:3.617 lr:0.0002964 epoch_Time:29079.0min: [2024-01-02 18:21:41,101][model8_pretrain.py][INFO] Epoch:[0/2](6600/4588595) loss:3.874 lr:0.0002964 epoch_Time:29079.0min: [2024-01-02 18:21:41,101][model8_pretrain.py][INFO] Epoch:[0/2](6600/4588595) loss:4.061 lr:0.0002964 epoch_Time:29079.0min: [2024-01-02 18:21:41,101][model8_pretrain.py][INFO] Epoch:[0/2](6600/4588595) loss:3.926 lr:0.0002964 epoch_Time:29079.0min: [2024-01-02 18:22:18,019][model8_pretrain.py][INFO] Epoch:[0/2](6700/4588595) loss:3.835 lr:0.0002963 epoch_Time:29065.0min: [2024-01-02 18:22:18,019][model8_pretrain.py][INFO] Epoch:[0/2](6700/4588595) loss:3.811 lr:0.0002963 epoch_Time:29065.0min: [2024-01-02 18:22:18,020][model8_pretrain.py][INFO] Epoch:[0/2](6700/4588595) loss:4.071 lr:0.0002963 epoch_Time:29065.0min: [2024-01-02 18:22:18,019][model8_pretrain.py][INFO] Epoch:[0/2](6700/4588595) loss:3.849 lr:0.0002963 epoch_Time:29065.0min: [2024-01-02 18:22:18,019][model8_pretrain.py][INFO] Epoch:[0/2](6700/4588595) loss:3.993 lr:0.0002963 epoch_Time:29065.0min: [2024-01-02 18:22:18,020][model8_pretrain.py][INFO] Epoch:[0/2](6700/4588595) loss:3.532 lr:0.0002963 epoch_Time:29065.0min: [2024-01-02 18:22:18,020][model8_pretrain.py][INFO] Epoch:[0/2](6700/4588595) loss:3.992 lr:0.0002963 epoch_Time:29065.0min: [2024-01-02 18:22:18,020][model8_pretrain.py][INFO] Epoch:[0/2](6700/4588595) loss:3.198 lr:0.0002963 epoch_Time:29065.0min: [2024-01-02 18:22:54,950][model8_pretrain.py][INFO] Epoch:[0/2](6800/4588595) loss:3.985 lr:0.0002962 epoch_Time:29051.0min: [2024-01-02 18:22:54,950][model8_pretrain.py][INFO] Epoch:[0/2](6800/4588595) loss:3.453 lr:0.0002962 epoch_Time:29051.0min: [2024-01-02 18:22:54,950][model8_pretrain.py][INFO] Epoch:[0/2](6800/4588595) loss:3.893 lr:0.0002962 epoch_Time:29051.0min: [2024-01-02 18:22:54,950][model8_pretrain.py][INFO] Epoch:[0/2](6800/4588595) loss:3.250 lr:0.0002962 epoch_Time:29051.0min: [2024-01-02 18:22:54,950][model8_pretrain.py][INFO] Epoch:[0/2](6800/4588595) loss:3.366 lr:0.0002962 epoch_Time:29051.0min: [2024-01-02 18:22:54,950][model8_pretrain.py][INFO] Epoch:[0/2](6800/4588595) loss:3.962 lr:0.0002962 epoch_Time:29051.0min: [2024-01-02 18:22:54,951][model8_pretrain.py][INFO] Epoch:[0/2](6800/4588595) loss:4.206 lr:0.0002962 epoch_Time:29051.0min: [2024-01-02 18:22:54,951][model8_pretrain.py][INFO] Epoch:[0/2](6800/4588595) loss:3.654 lr:0.0002962 epoch_Time:29051.0min: [2024-01-02 18:23:35,504][model8_pretrain.py][INFO] Epoch:[0/2](6900/4588595) loss:4.035 lr:0.0002960 epoch_Time:29079.0min: [2024-01-02 18:23:35,504][model8_pretrain.py][INFO] Epoch:[0/2](6900/4588595) loss:3.839 lr:0.0002960 epoch_Time:29079.0min: [2024-01-02 18:23:35,504][model8_pretrain.py][INFO] Epoch:[0/2](6900/4588595) loss:3.825 lr:0.0002960 epoch_Time:29079.0min: [2024-01-02 18:23:35,504][model8_pretrain.py][INFO] Epoch:[0/2](6900/4588595) loss:4.401 lr:0.0002960 epoch_Time:29079.0min: [2024-01-02 18:23:35,504][model8_pretrain.py][INFO] Epoch:[0/2](6900/4588595) loss:3.858 lr:0.0002960 epoch_Time:29079.0min: [2024-01-02 18:23:35,504][model8_pretrain.py][INFO] Epoch:[0/2](6900/4588595) loss:3.853 lr:0.0002960 epoch_Time:29079.0min: [2024-01-02 18:23:35,504][model8_pretrain.py][INFO] Epoch:[0/2](6900/4588595) loss:3.885 lr:0.0002960 epoch_Time:29079.0min: [2024-01-02 18:23:35,504][model8_pretrain.py][INFO] Epoch:[0/2](6900/4588595) loss:3.939 lr:0.0002960 epoch_Time:29079.0min: [2024-01-02 18:24:12,457][model8_pretrain.py][INFO] Epoch:[0/2](7000/4588595) loss:3.139 lr:0.0002959 epoch_Time:29065.0min: [2024-01-02 18:24:12,457][model8_pretrain.py][INFO] Epoch:[0/2](7000/4588595) loss:3.998 lr:0.0002959 epoch_Time:29065.0min: [2024-01-02 18:24:12,457][model8_pretrain.py][INFO] Epoch:[0/2](7000/4588595) loss:4.198 lr:0.0002959 epoch_Time:29066.0min: [2024-01-02 18:24:12,457][model8_pretrain.py][INFO] Epoch:[0/2](7000/4588595) loss:3.711 lr:0.0002959 epoch_Time:29065.0min: [2024-01-02 18:24:12,457][model8_pretrain.py][INFO] Epoch:[0/2](7000/4588595) loss:3.916 lr:0.0002959 epoch_Time:29066.0min: [2024-01-02 18:24:12,457][model8_pretrain.py][INFO] Epoch:[0/2](7000/4588595) loss:3.949 lr:0.0002959 epoch_Time:29066.0min: [2024-01-02 18:24:12,457][model8_pretrain.py][INFO] Epoch:[0/2](7000/4588595) loss:3.662 lr:0.0002959 epoch_Time:29066.0min: [2024-01-02 18:24:12,457][model8_pretrain.py][INFO] Epoch:[0/2](7000/4588595) loss:3.812 lr:0.0002959 epoch_Time:29066.0min: [2024-01-02 18:24:49,392][model8_pretrain.py][INFO] Epoch:[0/2](7100/4588595) loss:3.920 lr:0.0002958 epoch_Time:29052.0min: [2024-01-02 18:24:49,392][model8_pretrain.py][INFO] Epoch:[0/2](7100/4588595) loss:3.599 lr:0.0002958 epoch_Time:29052.0min: [2024-01-02 18:24:49,392][model8_pretrain.py][INFO] Epoch:[0/2](7100/4588595) loss:3.289 lr:0.0002958 epoch_Time:29052.0min: [2024-01-02 18:24:49,392][model8_pretrain.py][INFO] Epoch:[0/2](7100/4588595) loss:3.956 lr:0.0002958 epoch_Time:29052.0min: [2024-01-02 18:24:49,392][model8_pretrain.py][INFO] Epoch:[0/2](7100/4588595) loss:3.480 lr:0.0002958 epoch_Time:29053.0min: [2024-01-02 18:24:49,392][model8_pretrain.py][INFO] Epoch:[0/2](7100/4588595) loss:3.896 lr:0.0002958 epoch_Time:29053.0min: [2024-01-02 18:24:49,393][model8_pretrain.py][INFO] Epoch:[0/2](7100/4588595) loss:3.801 lr:0.0002958 epoch_Time:29052.0min: [2024-01-02 18:24:49,393][model8_pretrain.py][INFO] Epoch:[0/2](7100/4588595) loss:3.606 lr:0.0002958 epoch_Time:29052.0min: [2024-01-02 18:25:26,324][model8_pretrain.py][INFO] Epoch:[0/2](7200/4588595) loss:3.492 lr:0.0002956 epoch_Time:29040.0min: [2024-01-02 18:25:26,324][model8_pretrain.py][INFO] Epoch:[0/2](7200/4588595) loss:3.787 lr:0.0002956 epoch_Time:29041.0min: [2024-01-02 18:25:26,324][model8_pretrain.py][INFO] Epoch:[0/2](7200/4588595) loss:3.521 lr:0.0002956 epoch_Time:29041.0min: [2024-01-02 18:25:26,324][model8_pretrain.py][INFO] Epoch:[0/2](7200/4588595) loss:3.751 lr:0.0002956 epoch_Time:29040.0min: [2024-01-02 18:25:26,324][model8_pretrain.py][INFO] Epoch:[0/2](7200/4588595) loss:3.979 lr:0.0002956 epoch_Time:29041.0min: [2024-01-02 18:25:26,324][model8_pretrain.py][INFO] Epoch:[0/2](7200/4588595) loss:3.673 lr:0.0002956 epoch_Time:29041.0min: [2024-01-02 18:25:26,324][model8_pretrain.py][INFO] Epoch:[0/2](7200/4588595) loss:4.189 lr:0.0002956 epoch_Time:29041.0min: [2024-01-02 18:25:26,324][model8_pretrain.py][INFO] Epoch:[0/2](7200/4588595) loss:3.588 lr:0.0002956 epoch_Time:29040.0min: [2024-01-02 18:26:03,246][model8_pretrain.py][INFO] Epoch:[0/2](7300/4588595) loss:3.507 lr:0.0002955 epoch_Time:29028.0min: [2024-01-02 18:26:03,246][model8_pretrain.py][INFO] Epoch:[0/2](7300/4588595) loss:4.049 lr:0.0002955 epoch_Time:29028.0min: [2024-01-02 18:26:03,246][model8_pretrain.py][INFO] Epoch:[0/2](7300/4588595) loss:3.921 lr:0.0002955 epoch_Time:29028.0min: [2024-01-02 18:26:03,246][model8_pretrain.py][INFO] Epoch:[0/2](7300/4588595) loss:3.932 lr:0.0002955 epoch_Time:29028.0min: [2024-01-02 18:26:03,246][model8_pretrain.py][INFO] Epoch:[0/2](7300/4588595) loss:3.369 lr:0.0002955 epoch_Time:29028.0min: [2024-01-02 18:26:03,246][model8_pretrain.py][INFO] Epoch:[0/2](7300/4588595) loss:3.864 lr:0.0002955 epoch_Time:29028.0min: [2024-01-02 18:26:03,246][model8_pretrain.py][INFO] Epoch:[0/2](7300/4588595) loss:3.334 lr:0.0002955 epoch_Time:29028.0min: [2024-01-02 18:26:03,246][model8_pretrain.py][INFO] Epoch:[0/2](7300/4588595) loss:4.175 lr:0.0002955 epoch_Time:29028.0min: [2024-01-02 18:26:40,163][model8_pretrain.py][INFO] Epoch:[0/2](7400/4588595) loss:3.799 lr:0.0002953 epoch_Time:29016.0min: [2024-01-02 18:26:40,163][model8_pretrain.py][INFO] Epoch:[0/2](7400/4588595) loss:4.010 lr:0.0002953 epoch_Time:29017.0min: [2024-01-02 18:26:40,163][model8_pretrain.py][INFO] Epoch:[0/2](7400/4588595) loss:3.659 lr:0.0002953 epoch_Time:29016.0min: [2024-01-02 18:26:40,163][model8_pretrain.py][INFO] Epoch:[0/2](7400/4588595) loss:4.279 lr:0.0002953 epoch_Time:29016.0min: [2024-01-02 18:26:40,163][model8_pretrain.py][INFO] Epoch:[0/2](7400/4588595) loss:3.828 lr:0.0002953 epoch_Time:29017.0min: [2024-01-02 18:26:40,163][model8_pretrain.py][INFO] Epoch:[0/2](7400/4588595) loss:4.161 lr:0.0002953 epoch_Time:29016.0min: [2024-01-02 18:26:40,163][model8_pretrain.py][INFO] Epoch:[0/2](7400/4588595) loss:3.962 lr:0.0002953 epoch_Time:29016.0min: [2024-01-02 18:26:40,163][model8_pretrain.py][INFO] Epoch:[0/2](7400/4588595) loss:4.015 lr:0.0002953 epoch_Time:29017.0min: [2024-01-02 18:27:17,090][model8_pretrain.py][INFO] Epoch:[0/2](7500/4588595) loss:3.705 lr:0.0002952 epoch_Time:29004.0min: [2024-01-02 18:27:17,090][model8_pretrain.py][INFO] Epoch:[0/2](7500/4588595) loss:3.666 lr:0.0002952 epoch_Time:29005.0min: [2024-01-02 18:27:17,090][model8_pretrain.py][INFO] Epoch:[0/2](7500/4588595) loss:3.670 lr:0.0002952 epoch_Time:29005.0min: [2024-01-02 18:27:17,090][model8_pretrain.py][INFO] Epoch:[0/2](7500/4588595) loss:3.764 lr:0.0002952 epoch_Time:29004.0min: [2024-01-02 18:27:17,090][model8_pretrain.py][INFO] Epoch:[0/2](7500/4588595) loss:4.074 lr:0.0002952 epoch_Time:29005.0min: [2024-01-02 18:27:17,090][model8_pretrain.py][INFO] Epoch:[0/2](7500/4588595) loss:4.133 lr:0.0002952 epoch_Time:29005.0min: [2024-01-02 18:27:17,090][model8_pretrain.py][INFO] Epoch:[0/2](7500/4588595) loss:3.494 lr:0.0002952 epoch_Time:29005.0min: [2024-01-02 18:27:17,090][model8_pretrain.py][INFO] Epoch:[0/2](7500/4588595) loss:3.784 lr:0.0002952 epoch_Time:29005.0min: [2024-01-02 18:27:53,986][model8_pretrain.py][INFO] Epoch:[0/2](7600/4588595) loss:3.256 lr:0.0002950 epoch_Time:28993.0min: [2024-01-02 18:27:53,986][model8_pretrain.py][INFO] Epoch:[0/2](7600/4588595) loss:3.736 lr:0.0002950 epoch_Time:28992.0min: [2024-01-02 18:27:53,986][model8_pretrain.py][INFO] Epoch:[0/2](7600/4588595) loss:3.716 lr:0.0002950 epoch_Time:28993.0min: [2024-01-02 18:27:53,986][model8_pretrain.py][INFO] Epoch:[0/2](7600/4588595) loss:3.439 lr:0.0002950 epoch_Time:28993.0min: [2024-01-02 18:27:53,986][model8_pretrain.py][INFO] Epoch:[0/2](7600/4588595) loss:3.832 lr:0.0002950 epoch_Time:28992.0min: [2024-01-02 18:27:53,986][model8_pretrain.py][INFO] Epoch:[0/2](7600/4588595) loss:3.756 lr:0.0002950 epoch_Time:28993.0min: [2024-01-02 18:27:53,986][model8_pretrain.py][INFO] Epoch:[0/2](7600/4588595) loss:3.926 lr:0.0002950 epoch_Time:28993.0min: [2024-01-02 18:27:53,986][model8_pretrain.py][INFO] Epoch:[0/2](7600/4588595) loss:3.996 lr:0.0002950 epoch_Time:28993.0min: [2024-01-02 18:28:34,496][model8_pretrain.py][INFO] Epoch:[0/2](7700/4588595) loss:3.857 lr:0.0002949 epoch_Time:29018.0min: [2024-01-02 18:28:34,496][model8_pretrain.py][INFO] Epoch:[0/2](7700/4588595) loss:3.938 lr:0.0002949 epoch_Time:29018.0min: [2024-01-02 18:28:34,496][model8_pretrain.py][INFO] Epoch:[0/2](7700/4588595) loss:3.535 lr:0.0002949 epoch_Time:29018.0min: [2024-01-02 18:28:34,496][model8_pretrain.py][INFO] Epoch:[0/2](7700/4588595) loss:3.621 lr:0.0002949 epoch_Time:29018.0min: [2024-01-02 18:28:34,497][model8_pretrain.py][INFO] Epoch:[0/2](7700/4588595) loss:4.197 lr:0.0002949 epoch_Time:29018.0min: [2024-01-02 18:28:34,497][model8_pretrain.py][INFO] Epoch:[0/2](7700/4588595) loss:3.915 lr:0.0002949 epoch_Time:29018.0min: [2024-01-02 18:28:34,497][model8_pretrain.py][INFO] Epoch:[0/2](7700/4588595) loss:3.774 lr:0.0002949 epoch_Time:29018.0min: [2024-01-02 18:28:34,497][model8_pretrain.py][INFO] Epoch:[0/2](7700/4588595) loss:3.989 lr:0.0002949 epoch_Time:29018.0min: [2024-01-02 18:29:11,432][model8_pretrain.py][INFO] Epoch:[0/2](7800/4588595) loss:3.620 lr:0.0002947 epoch_Time:29007.0min: [2024-01-02 18:29:11,432][model8_pretrain.py][INFO] Epoch:[0/2](7800/4588595) loss:3.514 lr:0.0002947 epoch_Time:29006.0min: [2024-01-02 18:29:11,432][model8_pretrain.py][INFO] Epoch:[0/2](7800/4588595) loss:3.894 lr:0.0002947 epoch_Time:29006.0min: [2024-01-02 18:29:11,432][model8_pretrain.py][INFO] Epoch:[0/2](7800/4588595) loss:3.167 lr:0.0002947 epoch_Time:29006.0min: [2024-01-02 18:29:11,432][model8_pretrain.py][INFO] Epoch:[0/2](7800/4588595) loss:3.726 lr:0.0002947 epoch_Time:29006.0min: [2024-01-02 18:29:11,432][model8_pretrain.py][INFO] Epoch:[0/2](7800/4588595) loss:3.682 lr:0.0002947 epoch_Time:29006.0min: [2024-01-02 18:29:11,432][model8_pretrain.py][INFO] Epoch:[0/2](7800/4588595) loss:3.547 lr:0.0002947 epoch_Time:29006.0min: [2024-01-02 18:29:11,432][model8_pretrain.py][INFO] Epoch:[0/2](7800/4588595) loss:4.107 lr:0.0002947 epoch_Time:29006.0min: [2024-01-02 18:29:48,330][model8_pretrain.py][INFO] Epoch:[0/2](7900/4588595) loss:4.058 lr:0.0002946 epoch_Time:28995.0min: [2024-01-02 18:29:48,330][model8_pretrain.py][INFO] Epoch:[0/2](7900/4588595) loss:3.849 lr:0.0002946 epoch_Time:28995.0min: [2024-01-02 18:29:48,330][model8_pretrain.py][INFO] Epoch:[0/2](7900/4588595) loss:4.063 lr:0.0002946 epoch_Time:28995.0min: [2024-01-02 18:29:48,330][model8_pretrain.py][INFO] Epoch:[0/2](7900/4588595) loss:3.222 lr:0.0002946 epoch_Time:28995.0min: [2024-01-02 18:29:48,330][model8_pretrain.py][INFO] Epoch:[0/2](7900/4588595) loss:3.717 lr:0.0002946 epoch_Time:28994.0min: [2024-01-02 18:29:48,330][model8_pretrain.py][INFO] Epoch:[0/2](7900/4588595) loss:3.323 lr:0.0002946 epoch_Time:28995.0min: [2024-01-02 18:29:48,330][model8_pretrain.py][INFO] Epoch:[0/2](7900/4588595) loss:3.559 lr:0.0002946 epoch_Time:28995.0min: [2024-01-02 18:29:48,330][model8_pretrain.py][INFO] Epoch:[0/2](7900/4588595) loss:4.106 lr:0.0002946 epoch_Time:28995.0min: [2024-01-02 18:30:25,272][model8_pretrain.py][INFO] Epoch:[0/2](8000/4588595) loss:3.686 lr:0.0002944 epoch_Time:28985.0min: [2024-01-02 18:30:25,272][model8_pretrain.py][INFO] Epoch:[0/2](8000/4588595) loss:3.320 lr:0.0002944 epoch_Time:28985.0min: [2024-01-02 18:30:25,272][model8_pretrain.py][INFO] Epoch:[0/2](8000/4588595) loss:4.100 lr:0.0002944 epoch_Time:28985.0min: [2024-01-02 18:30:25,272][model8_pretrain.py][INFO] Epoch:[0/2](8000/4588595) loss:3.714 lr:0.0002944 epoch_Time:28985.0min: [2024-01-02 18:30:25,273][model8_pretrain.py][INFO] Epoch:[0/2](8000/4588595) loss:3.359 lr:0.0002944 epoch_Time:28985.0min: [2024-01-02 18:30:25,273][model8_pretrain.py][INFO] Epoch:[0/2](8000/4588595) loss:3.622 lr:0.0002944 epoch_Time:28985.0min: [2024-01-02 18:30:25,273][model8_pretrain.py][INFO] Epoch:[0/2](8000/4588595) loss:3.691 lr:0.0002944 epoch_Time:28985.0min: [2024-01-02 18:30:25,273][model8_pretrain.py][INFO] Epoch:[0/2](8000/4588595) loss:3.247 lr:0.0002944 epoch_Time:28985.0min: [2024-01-02 18:31:02,203][model8_pretrain.py][INFO] Epoch:[0/2](8100/4588595) loss:3.963 lr:0.0002943 epoch_Time:28974.0min: [2024-01-02 18:31:02,203][model8_pretrain.py][INFO] Epoch:[0/2](8100/4588595) loss:3.917 lr:0.0002943 epoch_Time:28974.0min: [2024-01-02 18:31:02,203][model8_pretrain.py][INFO] Epoch:[0/2](8100/4588595) loss:3.364 lr:0.0002943 epoch_Time:28974.0min: [2024-01-02 18:31:02,203][model8_pretrain.py][INFO] Epoch:[0/2](8100/4588595) loss:3.726 lr:0.0002943 epoch_Time:28974.0min: [2024-01-02 18:31:02,203][model8_pretrain.py][INFO] Epoch:[0/2](8100/4588595) loss:3.884 lr:0.0002943 epoch_Time:28974.0min: [2024-01-02 18:31:02,203][model8_pretrain.py][INFO] Epoch:[0/2](8100/4588595) loss:3.120 lr:0.0002943 epoch_Time:28974.0min: [2024-01-02 18:31:02,203][model8_pretrain.py][INFO] Epoch:[0/2](8100/4588595) loss:3.711 lr:0.0002943 epoch_Time:28974.0min: [2024-01-02 18:31:02,203][model8_pretrain.py][INFO] Epoch:[0/2](8100/4588595) loss:3.832 lr:0.0002943 epoch_Time:28974.0min: [2024-01-02 18:31:39,149][model8_pretrain.py][INFO] Epoch:[0/2](8200/4588595) loss:3.952 lr:0.0002941 epoch_Time:28965.0min: [2024-01-02 18:31:39,149][model8_pretrain.py][INFO] Epoch:[0/2](8200/4588595) loss:3.590 lr:0.0002941 epoch_Time:28965.0min: [2024-01-02 18:31:39,149][model8_pretrain.py][INFO] Epoch:[0/2](8200/4588595) loss:3.829 lr:0.0002941 epoch_Time:28964.0min: [2024-01-02 18:31:39,149][model8_pretrain.py][INFO] Epoch:[0/2](8200/4588595) loss:3.341 lr:0.0002941 epoch_Time:28964.0min: [2024-01-02 18:31:39,149][model8_pretrain.py][INFO] Epoch:[0/2](8200/4588595) loss:3.338 lr:0.0002941 epoch_Time:28965.0min: [2024-01-02 18:31:39,150][model8_pretrain.py][INFO] Epoch:[0/2](8200/4588595) loss:3.435 lr:0.0002941 epoch_Time:28965.0min: [2024-01-02 18:31:39,150][model8_pretrain.py][INFO] Epoch:[0/2](8200/4588595) loss:3.920 lr:0.0002941 epoch_Time:28965.0min: [2024-01-02 18:31:39,150][model8_pretrain.py][INFO] Epoch:[0/2](8200/4588595) loss:3.306 lr:0.0002941 epoch_Time:28965.0min: [2024-01-02 18:32:16,096][model8_pretrain.py][INFO] Epoch:[0/2](8300/4588595) loss:4.068 lr:0.0002939 epoch_Time:28954.0min: [2024-01-02 18:32:16,096][model8_pretrain.py][INFO] Epoch:[0/2](8300/4588595) loss:3.993 lr:0.0002939 epoch_Time:28955.0min: [2024-01-02 18:32:16,096][model8_pretrain.py][INFO] Epoch:[0/2](8300/4588595) loss:3.984 lr:0.0002939 epoch_Time:28954.0min: [2024-01-02 18:32:16,096][model8_pretrain.py][INFO] Epoch:[0/2](8300/4588595) loss:2.905 lr:0.0002939 epoch_Time:28954.0min: [2024-01-02 18:32:16,096][model8_pretrain.py][INFO] Epoch:[0/2](8300/4588595) loss:3.466 lr:0.0002939 epoch_Time:28954.0min: [2024-01-02 18:32:16,096][model8_pretrain.py][INFO] Epoch:[0/2](8300/4588595) loss:3.888 lr:0.0002939 epoch_Time:28955.0min: [2024-01-02 18:32:16,096][model8_pretrain.py][INFO] Epoch:[0/2](8300/4588595) loss:3.691 lr:0.0002939 epoch_Time:28954.0min: [2024-01-02 18:32:16,096][model8_pretrain.py][INFO] Epoch:[0/2](8300/4588595) loss:3.701 lr:0.0002939 epoch_Time:28954.0min: [2024-01-02 18:32:53,024][model8_pretrain.py][INFO] Epoch:[0/2](8400/4588595) loss:3.714 lr:0.0002938 epoch_Time:28944.0min: [2024-01-02 18:32:53,024][model8_pretrain.py][INFO] Epoch:[0/2](8400/4588595) loss:4.136 lr:0.0002938 epoch_Time:28944.0min: [2024-01-02 18:32:53,024][model8_pretrain.py][INFO] Epoch:[0/2](8400/4588595) loss:3.608 lr:0.0002938 epoch_Time:28944.0min: [2024-01-02 18:32:53,024][model8_pretrain.py][INFO] Epoch:[0/2](8400/4588595) loss:3.630 lr:0.0002938 epoch_Time:28945.0min: [2024-01-02 18:32:53,024][model8_pretrain.py][INFO] Epoch:[0/2](8400/4588595) loss:3.590 lr:0.0002938 epoch_Time:28944.0min: [2024-01-02 18:32:53,024][model8_pretrain.py][INFO] Epoch:[0/2](8400/4588595) loss:3.899 lr:0.0002938 epoch_Time:28944.0min: [2024-01-02 18:32:53,024][model8_pretrain.py][INFO] Epoch:[0/2](8400/4588595) loss:4.098 lr:0.0002938 epoch_Time:28944.0min: [2024-01-02 18:32:53,024][model8_pretrain.py][INFO] Epoch:[0/2](8400/4588595) loss:4.066 lr:0.0002938 epoch_Time:28944.0min: [2024-01-02 18:33:33,535][model8_pretrain.py][INFO] Epoch:[0/2](8500/4588595) loss:3.603 lr:0.0002936 epoch_Time:28967.0min: [2024-01-02 18:33:33,535][model8_pretrain.py][INFO] Epoch:[0/2](8500/4588595) loss:3.641 lr:0.0002936 epoch_Time:28968.0min: [2024-01-02 18:33:33,535][model8_pretrain.py][INFO] Epoch:[0/2](8500/4588595) loss:3.284 lr:0.0002936 epoch_Time:28968.0min: [2024-01-02 18:33:33,535][model8_pretrain.py][INFO] Epoch:[0/2](8500/4588595) loss:3.639 lr:0.0002936 epoch_Time:28968.0min: [2024-01-02 18:33:33,535][model8_pretrain.py][INFO] Epoch:[0/2](8500/4588595) loss:4.025 lr:0.0002936 epoch_Time:28968.0min: [2024-01-02 18:33:33,535][model8_pretrain.py][INFO] Epoch:[0/2](8500/4588595) loss:3.309 lr:0.0002936 epoch_Time:28968.0min: [2024-01-02 18:33:33,535][model8_pretrain.py][INFO] Epoch:[0/2](8500/4588595) loss:3.742 lr:0.0002936 epoch_Time:28967.0min: [2024-01-02 18:33:33,535][model8_pretrain.py][INFO] Epoch:[0/2](8500/4588595) loss:3.552 lr:0.0002936 epoch_Time:28968.0min: [2024-01-02 18:34:10,470][model8_pretrain.py][INFO] Epoch:[0/2](8600/4588595) loss:4.202 lr:0.0002934 epoch_Time:28957.0min: [2024-01-02 18:34:10,470][model8_pretrain.py][INFO] Epoch:[0/2](8600/4588595) loss:3.735 lr:0.0002934 epoch_Time:28958.0min: [2024-01-02 18:34:10,470][model8_pretrain.py][INFO] Epoch:[0/2](8600/4588595) loss:3.561 lr:0.0002934 epoch_Time:28957.0min: [2024-01-02 18:34:10,470][model8_pretrain.py][INFO] Epoch:[0/2](8600/4588595) loss:3.557 lr:0.0002934 epoch_Time:28958.0min: [2024-01-02 18:34:10,471][model8_pretrain.py][INFO] Epoch:[0/2](8600/4588595) loss:4.034 lr:0.0002934 epoch_Time:28958.0min: [2024-01-02 18:34:10,470][model8_pretrain.py][INFO] Epoch:[0/2](8600/4588595) loss:3.867 lr:0.0002934 epoch_Time:28958.0min: [2024-01-02 18:34:10,471][model8_pretrain.py][INFO] Epoch:[0/2](8600/4588595) loss:3.840 lr:0.0002934 epoch_Time:28958.0min: [2024-01-02 18:34:10,471][model8_pretrain.py][INFO] Epoch:[0/2](8600/4588595) loss:3.771 lr:0.0002934 epoch_Time:28958.0min: [2024-01-02 18:34:47,403][model8_pretrain.py][INFO] Epoch:[0/2](8700/4588595) loss:3.902 lr:0.0002933 epoch_Time:28949.0min: [2024-01-02 18:34:47,403][model8_pretrain.py][INFO] Epoch:[0/2](8700/4588595) loss:3.731 lr:0.0002933 epoch_Time:28949.0min: [2024-01-02 18:34:47,403][model8_pretrain.py][INFO] Epoch:[0/2](8700/4588595) loss:3.838 lr:0.0002933 epoch_Time:28949.0min: [2024-01-02 18:34:47,403][model8_pretrain.py][INFO] Epoch:[0/2](8700/4588595) loss:3.836 lr:0.0002933 epoch_Time:28949.0min: [2024-01-02 18:34:47,403][model8_pretrain.py][INFO] Epoch:[0/2](8700/4588595) loss:3.823 lr:0.0002933 epoch_Time:28949.0min: [2024-01-02 18:34:47,403][model8_pretrain.py][INFO] Epoch:[0/2](8700/4588595) loss:3.765 lr:0.0002933 epoch_Time:28949.0min: [2024-01-02 18:34:47,403][model8_pretrain.py][INFO] Epoch:[0/2](8700/4588595) loss:3.295 lr:0.0002933 epoch_Time:28949.0min: [2024-01-02 18:34:47,403][model8_pretrain.py][INFO] Epoch:[0/2](8700/4588595) loss:3.829 lr:0.0002933 epoch_Time:28949.0min: [2024-01-02 18:35:24,351][model8_pretrain.py][INFO] Epoch:[0/2](8800/4588595) loss:3.797 lr:0.0002931 epoch_Time:28939.0min: [2024-01-02 18:35:24,351][model8_pretrain.py][INFO] Epoch:[0/2](8800/4588595) loss:3.843 lr:0.0002931 epoch_Time:28939.0min: [2024-01-02 18:35:24,351][model8_pretrain.py][INFO] Epoch:[0/2](8800/4588595) loss:3.789 lr:0.0002931 epoch_Time:28939.0min: [2024-01-02 18:35:24,351][model8_pretrain.py][INFO] Epoch:[0/2](8800/4588595) loss:3.351 lr:0.0002931 epoch_Time:28939.0min: [2024-01-02 18:35:24,351][model8_pretrain.py][INFO] Epoch:[0/2](8800/4588595) loss:3.710 lr:0.0002931 epoch_Time:28939.0min: [2024-01-02 18:35:24,351][model8_pretrain.py][INFO] Epoch:[0/2](8800/4588595) loss:3.744 lr:0.0002931 epoch_Time:28939.0min: [2024-01-02 18:35:24,351][model8_pretrain.py][INFO] Epoch:[0/2](8800/4588595) loss:3.740 lr:0.0002931 epoch_Time:28940.0min: [2024-01-02 18:35:24,351][model8_pretrain.py][INFO] Epoch:[0/2](8800/4588595) loss:2.967 lr:0.0002931 epoch_Time:28939.0min: [2024-01-02 18:36:01,287][model8_pretrain.py][INFO] Epoch:[0/2](8900/4588595) loss:3.563 lr:0.0002929 epoch_Time:28930.0min: [2024-01-02 18:36:01,287][model8_pretrain.py][INFO] Epoch:[0/2](8900/4588595) loss:3.689 lr:0.0002929 epoch_Time:28930.0min: [2024-01-02 18:36:01,287][model8_pretrain.py][INFO] Epoch:[0/2](8900/4588595) loss:3.467 lr:0.0002929 epoch_Time:28930.0min: [2024-01-02 18:36:01,287][model8_pretrain.py][INFO] Epoch:[0/2](8900/4588595) loss:4.222 lr:0.0002929 epoch_Time:28930.0min: [2024-01-02 18:36:01,287][model8_pretrain.py][INFO] Epoch:[0/2](8900/4588595) loss:3.606 lr:0.0002929 epoch_Time:28930.0min: [2024-01-02 18:36:01,287][model8_pretrain.py][INFO] Epoch:[0/2](8900/4588595) loss:3.691 lr:0.0002929 epoch_Time:28930.0min: [2024-01-02 18:36:01,287][model8_pretrain.py][INFO] Epoch:[0/2](8900/4588595) loss:3.809 lr:0.0002929 epoch_Time:28930.0min: [2024-01-02 18:36:01,288][model8_pretrain.py][INFO] Epoch:[0/2](8900/4588595) loss:3.574 lr:0.0002929 epoch_Time:28930.0min: [2024-01-02 18:36:38,215][model8_pretrain.py][INFO] Epoch:[0/2](9000/4588595) loss:3.752 lr:0.0002927 epoch_Time:28922.0min: [2024-01-02 18:36:38,215][model8_pretrain.py][INFO] Epoch:[0/2](9000/4588595) loss:3.771 lr:0.0002927 epoch_Time:28922.0min: [2024-01-02 18:36:38,215][model8_pretrain.py][INFO] Epoch:[0/2](9000/4588595) loss:3.828 lr:0.0002927 epoch_Time:28922.0min: [2024-01-02 18:36:38,215][model8_pretrain.py][INFO] Epoch:[0/2](9000/4588595) loss:3.761 lr:0.0002927 epoch_Time:28922.0min: [2024-01-02 18:36:38,215][model8_pretrain.py][INFO] Epoch:[0/2](9000/4588595) loss:3.863 lr:0.0002927 epoch_Time:28922.0min: [2024-01-02 18:36:38,215][model8_pretrain.py][INFO] Epoch:[0/2](9000/4588595) loss:4.197 lr:0.0002927 epoch_Time:28922.0min: [2024-01-02 18:36:38,215][model8_pretrain.py][INFO] Epoch:[0/2](9000/4588595) loss:3.897 lr:0.0002927 epoch_Time:28922.0min: [2024-01-02 18:36:38,215][model8_pretrain.py][INFO] Epoch:[0/2](9000/4588595) loss:3.940 lr:0.0002927 epoch_Time:28922.0min: [2024-01-02 18:37:15,181][model8_pretrain.py][INFO] Epoch:[0/2](9100/4588595) loss:3.677 lr:0.0002925 epoch_Time:28913.0min: [2024-01-02 18:37:15,181][model8_pretrain.py][INFO] Epoch:[0/2](9100/4588595) loss:3.714 lr:0.0002925 epoch_Time:28913.0min: [2024-01-02 18:37:15,181][model8_pretrain.py][INFO] Epoch:[0/2](9100/4588595) loss:3.938 lr:0.0002925 epoch_Time:28913.0min: [2024-01-02 18:37:15,181][model8_pretrain.py][INFO] Epoch:[0/2](9100/4588595) loss:3.636 lr:0.0002925 epoch_Time:28913.0min: [2024-01-02 18:37:15,181][model8_pretrain.py][INFO] Epoch:[0/2](9100/4588595) loss:3.764 lr:0.0002925 epoch_Time:28913.0min: [2024-01-02 18:37:15,181][model8_pretrain.py][INFO] Epoch:[0/2](9100/4588595) loss:3.303 lr:0.0002925 epoch_Time:28913.0min: [2024-01-02 18:37:15,181][model8_pretrain.py][INFO] Epoch:[0/2](9100/4588595) loss:3.653 lr:0.0002925 epoch_Time:28913.0min: [2024-01-02 18:37:15,181][model8_pretrain.py][INFO] Epoch:[0/2](9100/4588595) loss:3.656 lr:0.0002925 epoch_Time:28913.0min: [2024-01-02 18:37:52,124][model8_pretrain.py][INFO] Epoch:[0/2](9200/4588595) loss:3.910 lr:0.0002924 epoch_Time:28904.0min: [2024-01-02 18:37:52,124][model8_pretrain.py][INFO] Epoch:[0/2](9200/4588595) loss:3.959 lr:0.0002924 epoch_Time:28904.0min: [2024-01-02 18:37:52,124][model8_pretrain.py][INFO] Epoch:[0/2](9200/4588595) loss:4.115 lr:0.0002924 epoch_Time:28904.0min: [2024-01-02 18:37:52,124][model8_pretrain.py][INFO] Epoch:[0/2](9200/4588595) loss:3.401 lr:0.0002924 epoch_Time:28904.0min: [2024-01-02 18:37:52,124][model8_pretrain.py][INFO] Epoch:[0/2](9200/4588595) loss:3.771 lr:0.0002924 epoch_Time:28904.0min: [2024-01-02 18:37:52,124][model8_pretrain.py][INFO] Epoch:[0/2](9200/4588595) loss:3.797 lr:0.0002924 epoch_Time:28904.0min: [2024-01-02 18:37:52,125][model8_pretrain.py][INFO] Epoch:[0/2](9200/4588595) loss:4.079 lr:0.0002924 epoch_Time:28904.0min: [2024-01-02 18:37:52,126][model8_pretrain.py][INFO] Epoch:[0/2](9200/4588595) loss:3.624 lr:0.0002924 epoch_Time:28904.0min: [2024-01-02 18:38:32,671][model8_pretrain.py][INFO] Epoch:[0/2](9300/4588595) loss:4.019 lr:0.0002922 epoch_Time:28926.0min: [2024-01-02 18:38:32,671][model8_pretrain.py][INFO] Epoch:[0/2](9300/4588595) loss:3.756 lr:0.0002922 epoch_Time:28926.0min: [2024-01-02 18:38:32,671][model8_pretrain.py][INFO] Epoch:[0/2](9300/4588595) loss:3.375 lr:0.0002922 epoch_Time:28926.0min: [2024-01-02 18:38:32,671][model8_pretrain.py][INFO] Epoch:[0/2](9300/4588595) loss:3.909 lr:0.0002922 epoch_Time:28926.0min: [2024-01-02 18:38:32,671][model8_pretrain.py][INFO] Epoch:[0/2](9300/4588595) loss:3.501 lr:0.0002922 epoch_Time:28926.0min: [2024-01-02 18:38:32,671][model8_pretrain.py][INFO] Epoch:[0/2](9300/4588595) loss:4.206 lr:0.0002922 epoch_Time:28926.0min: [2024-01-02 18:38:32,671][model8_pretrain.py][INFO] Epoch:[0/2](9300/4588595) loss:3.761 lr:0.0002922 epoch_Time:28926.0min: [2024-01-02 18:38:32,671][model8_pretrain.py][INFO] Epoch:[0/2](9300/4588595) loss:4.338 lr:0.0002922 epoch_Time:28926.0min: [2024-01-02 18:39:09,591][model8_pretrain.py][INFO] Epoch:[0/2](9400/4588595) loss:3.856 lr:0.0002920 epoch_Time:28917.0min: [2024-01-02 18:39:09,591][model8_pretrain.py][INFO] Epoch:[0/2](9400/4588595) loss:3.540 lr:0.0002920 epoch_Time:28917.0min: [2024-01-02 18:39:09,591][model8_pretrain.py][INFO] Epoch:[0/2](9400/4588595) loss:3.842 lr:0.0002920 epoch_Time:28917.0min: [2024-01-02 18:39:09,591][model8_pretrain.py][INFO] Epoch:[0/2](9400/4588595) loss:3.661 lr:0.0002920 epoch_Time:28917.0min: [2024-01-02 18:39:09,591][model8_pretrain.py][INFO] Epoch:[0/2](9400/4588595) loss:3.959 lr:0.0002920 epoch_Time:28917.0min: [2024-01-02 18:39:09,592][model8_pretrain.py][INFO] Epoch:[0/2](9400/4588595) loss:3.285 lr:0.0002920 epoch_Time:28917.0min: [2024-01-02 18:39:09,592][model8_pretrain.py][INFO] Epoch:[0/2](9400/4588595) loss:3.735 lr:0.0002920 epoch_Time:28917.0min: [2024-01-02 18:39:09,592][model8_pretrain.py][INFO] Epoch:[0/2](9400/4588595) loss:3.701 lr:0.0002920 epoch_Time:28917.0min: [2024-01-02 18:39:46,504][model8_pretrain.py][INFO] Epoch:[0/2](9500/4588595) loss:3.593 lr:0.0002918 epoch_Time:28909.0min: [2024-01-02 18:39:46,505][model8_pretrain.py][INFO] Epoch:[0/2](9500/4588595) loss:4.021 lr:0.0002918 epoch_Time:28909.0min: [2024-01-02 18:39:46,505][model8_pretrain.py][INFO] Epoch:[0/2](9500/4588595) loss:3.921 lr:0.0002918 epoch_Time:28910.0min: [2024-01-02 18:39:46,505][model8_pretrain.py][INFO] Epoch:[0/2](9500/4588595) loss:3.854 lr:0.0002918 epoch_Time:28909.0min: [2024-01-02 18:39:46,505][model8_pretrain.py][INFO] Epoch:[0/2](9500/4588595) loss:3.896 lr:0.0002918 epoch_Time:28909.0min: [2024-01-02 18:39:46,505][model8_pretrain.py][INFO] Epoch:[0/2](9500/4588595) loss:3.833 lr:0.0002918 epoch_Time:28909.0min: [2024-01-02 18:39:46,505][model8_pretrain.py][INFO] Epoch:[0/2](9500/4588595) loss:4.070 lr:0.0002918 epoch_Time:28909.0min: [2024-01-02 18:39:46,508][model8_pretrain.py][INFO] Epoch:[0/2](9500/4588595) loss:4.288 lr:0.0002918 epoch_Time:28909.0min: [2024-01-02 18:40:23,429][model8_pretrain.py][INFO] Epoch:[0/2](9600/4588595) loss:3.301 lr:0.0002916 epoch_Time:28901.0min: [2024-01-02 18:40:23,429][model8_pretrain.py][INFO] Epoch:[0/2](9600/4588595) loss:3.455 lr:0.0002916 epoch_Time:28901.0min: [2024-01-02 18:40:23,429][model8_pretrain.py][INFO] Epoch:[0/2](9600/4588595) loss:3.910 lr:0.0002916 epoch_Time:28901.0min: [2024-01-02 18:40:23,429][model8_pretrain.py][INFO] Epoch:[0/2](9600/4588595) loss:3.751 lr:0.0002916 epoch_Time:28901.0min: [2024-01-02 18:40:23,429][model8_pretrain.py][INFO] Epoch:[0/2](9600/4588595) loss:3.817 lr:0.0002916 epoch_Time:28901.0min: [2024-01-02 18:40:23,429][model8_pretrain.py][INFO] Epoch:[0/2](9600/4588595) loss:3.889 lr:0.0002916 epoch_Time:28901.0min: [2024-01-02 18:40:23,429][model8_pretrain.py][INFO] Epoch:[0/2](9600/4588595) loss:4.018 lr:0.0002916 epoch_Time:28901.0min: [2024-01-02 18:40:23,429][model8_pretrain.py][INFO] Epoch:[0/2](9600/4588595) loss:3.714 lr:0.0002916 epoch_Time:28901.0min: [2024-01-02 18:41:00,377][model8_pretrain.py][INFO] Epoch:[0/2](9700/4588595) loss:3.378 lr:0.0002914 epoch_Time:28892.0min: [2024-01-02 18:41:00,377][model8_pretrain.py][INFO] Epoch:[0/2](9700/4588595) loss:3.329 lr:0.0002914 epoch_Time:28892.0min: [2024-01-02 18:41:00,377][model8_pretrain.py][INFO] Epoch:[0/2](9700/4588595) loss:3.830 lr:0.0002914 epoch_Time:28893.0min: [2024-01-02 18:41:00,377][model8_pretrain.py][INFO] Epoch:[0/2](9700/4588595) loss:3.596 lr:0.0002914 epoch_Time:28892.0min: [2024-01-02 18:41:00,377][model8_pretrain.py][INFO] Epoch:[0/2](9700/4588595) loss:3.569 lr:0.0002914 epoch_Time:28892.0min: [2024-01-02 18:41:00,377][model8_pretrain.py][INFO] Epoch:[0/2](9700/4588595) loss:4.068 lr:0.0002914 epoch_Time:28892.0min: [2024-01-02 18:41:00,377][model8_pretrain.py][INFO] Epoch:[0/2](9700/4588595) loss:4.128 lr:0.0002914 epoch_Time:28892.0min: [2024-01-02 18:41:00,377][model8_pretrain.py][INFO] Epoch:[0/2](9700/4588595) loss:3.699 lr:0.0002914 epoch_Time:28893.0min: [2024-01-02 18:41:37,295][model8_pretrain.py][INFO] Epoch:[0/2](9800/4588595) loss:3.545 lr:0.0002912 epoch_Time:28885.0min: [2024-01-02 18:41:37,295][model8_pretrain.py][INFO] Epoch:[0/2](9800/4588595) loss:4.121 lr:0.0002912 epoch_Time:28885.0min: [2024-01-02 18:41:37,295][model8_pretrain.py][INFO] Epoch:[0/2](9800/4588595) loss:3.119 lr:0.0002912 epoch_Time:28885.0min: [2024-01-02 18:41:37,295][model8_pretrain.py][INFO] Epoch:[0/2](9800/4588595) loss:3.506 lr:0.0002912 epoch_Time:28885.0min: [2024-01-02 18:41:37,295][model8_pretrain.py][INFO] Epoch:[0/2](9800/4588595) loss:4.055 lr:0.0002912 epoch_Time:28885.0min: [2024-01-02 18:41:37,295][model8_pretrain.py][INFO] Epoch:[0/2](9800/4588595) loss:3.545 lr:0.0002912 epoch_Time:28885.0min: [2024-01-02 18:41:37,295][model8_pretrain.py][INFO] Epoch:[0/2](9800/4588595) loss:3.472 lr:0.0002912 epoch_Time:28885.0min: [2024-01-02 18:41:37,295][model8_pretrain.py][INFO] Epoch:[0/2](9800/4588595) loss:3.895 lr:0.0002912 epoch_Time:28885.0min: [2024-01-02 18:42:14,216][model8_pretrain.py][INFO] Epoch:[0/2](9900/4588595) loss:3.890 lr:0.0002910 epoch_Time:28877.0min: [2024-01-02 18:42:14,216][model8_pretrain.py][INFO] Epoch:[0/2](9900/4588595) loss:3.443 lr:0.0002910 epoch_Time:28877.0min: [2024-01-02 18:42:14,216][model8_pretrain.py][INFO] Epoch:[0/2](9900/4588595) loss:3.932 lr:0.0002910 epoch_Time:28877.0min: [2024-01-02 18:42:14,216][model8_pretrain.py][INFO] Epoch:[0/2](9900/4588595) loss:3.698 lr:0.0002910 epoch_Time:28877.0min: [2024-01-02 18:42:14,216][model8_pretrain.py][INFO] Epoch:[0/2](9900/4588595) loss:3.459 lr:0.0002910 epoch_Time:28877.0min: [2024-01-02 18:42:14,216][model8_pretrain.py][INFO] Epoch:[0/2](9900/4588595) loss:3.873 lr:0.0002910 epoch_Time:28877.0min: [2024-01-02 18:42:14,216][model8_pretrain.py][INFO] Epoch:[0/2](9900/4588595) loss:3.951 lr:0.0002910 epoch_Time:28877.0min: [2024-01-02 18:42:14,216][model8_pretrain.py][INFO] Epoch:[0/2](9900/4588595) loss:3.498 lr:0.0002910 epoch_Time:28877.0min: [2024-01-02 18:42:51,144][model8_pretrain.py][INFO] Epoch:[0/2](10000/4588595) loss:3.792 lr:0.0002908 epoch_Time:28869.0min: [2024-01-02 18:42:51,144][model8_pretrain.py][INFO] Epoch:[0/2](10000/4588595) loss:3.507 lr:0.0002908 epoch_Time:28869.0min: [2024-01-02 18:42:51,144][model8_pretrain.py][INFO] Epoch:[0/2](10000/4588595) loss:3.585 lr:0.0002908 epoch_Time:28869.0min: [2024-01-02 18:42:51,144][model8_pretrain.py][INFO] Epoch:[0/2](10000/4588595) loss:3.384 lr:0.0002908 epoch_Time:28869.0min: [2024-01-02 18:42:51,144][model8_pretrain.py][INFO] Epoch:[0/2](10000/4588595) loss:3.439 lr:0.0002908 epoch_Time:28869.0min: [2024-01-02 18:42:51,144][model8_pretrain.py][INFO] Epoch:[0/2](10000/4588595) loss:4.211 lr:0.0002908 epoch_Time:28869.0min: [2024-01-02 18:42:51,144][model8_pretrain.py][INFO] Epoch:[0/2](10000/4588595) loss:3.423 lr:0.0002908 epoch_Time:28869.0min: [2024-01-02 18:42:51,144][model8_pretrain.py][INFO] Epoch:[0/2](10000/4588595) loss:3.315 lr:0.0002908 epoch_Time:28869.0min: [2024-01-02 18:43:31,670][model8_pretrain.py][INFO] Epoch:[0/2](10100/4588595) loss:4.037 lr:0.0002906 epoch_Time:28889.0min: [2024-01-02 18:43:31,670][model8_pretrain.py][INFO] Epoch:[0/2](10100/4588595) loss:3.991 lr:0.0002906 epoch_Time:28889.0min: [2024-01-02 18:43:31,670][model8_pretrain.py][INFO] Epoch:[0/2](10100/4588595) loss:4.083 lr:0.0002906 epoch_Time:28889.0min: [2024-01-02 18:43:31,670][model8_pretrain.py][INFO] Epoch:[0/2](10100/4588595) loss:4.336 lr:0.0002906 epoch_Time:28889.0min: [2024-01-02 18:43:31,670][model8_pretrain.py][INFO] Epoch:[0/2](10100/4588595) loss:3.546 lr:0.0002906 epoch_Time:28890.0min: [2024-01-02 18:43:31,670][model8_pretrain.py][INFO] Epoch:[0/2](10100/4588595) loss:3.940 lr:0.0002906 epoch_Time:28889.0min: [2024-01-02 18:43:31,670][model8_pretrain.py][INFO] Epoch:[0/2](10100/4588595) loss:3.781 lr:0.0002906 epoch_Time:28889.0min: [2024-01-02 18:43:31,670][model8_pretrain.py][INFO] Epoch:[0/2](10100/4588595) loss:3.752 lr:0.0002906 epoch_Time:28889.0min: [2024-01-02 18:44:08,592][model8_pretrain.py][INFO] Epoch:[0/2](10200/4588595) loss:3.379 lr:0.0002904 epoch_Time:28881.0min: [2024-01-02 18:44:08,592][model8_pretrain.py][INFO] Epoch:[0/2](10200/4588595) loss:3.727 lr:0.0002904 epoch_Time:28881.0min: [2024-01-02 18:44:08,592][model8_pretrain.py][INFO] Epoch:[0/2](10200/4588595) loss:3.326 lr:0.0002904 epoch_Time:28881.0min: [2024-01-02 18:44:08,592][model8_pretrain.py][INFO] Epoch:[0/2](10200/4588595) loss:3.597 lr:0.0002904 epoch_Time:28881.0min: [2024-01-02 18:44:08,592][model8_pretrain.py][INFO] Epoch:[0/2](10200/4588595) loss:3.569 lr:0.0002904 epoch_Time:28881.0min: [2024-01-02 18:44:08,592][model8_pretrain.py][INFO] Epoch:[0/2](10200/4588595) loss:4.085 lr:0.0002904 epoch_Time:28881.0min: [2024-01-02 18:44:08,592][model8_pretrain.py][INFO] Epoch:[0/2](10200/4588595) loss:3.851 lr:0.0002904 epoch_Time:28881.0min: [2024-01-02 18:44:08,592][model8_pretrain.py][INFO] Epoch:[0/2](10200/4588595) loss:3.973 lr:0.0002904 epoch_Time:28881.0min: [2024-01-02 18:44:45,514][model8_pretrain.py][INFO] Epoch:[0/2](10300/4588595) loss:3.714 lr:0.0002902 epoch_Time:28874.0min: [2024-01-02 18:44:45,514][model8_pretrain.py][INFO] Epoch:[0/2](10300/4588595) loss:2.949 lr:0.0002902 epoch_Time:28874.0min: [2024-01-02 18:44:45,514][model8_pretrain.py][INFO] Epoch:[0/2](10300/4588595) loss:3.172 lr:0.0002902 epoch_Time:28874.0min: [2024-01-02 18:44:45,514][model8_pretrain.py][INFO] Epoch:[0/2](10300/4588595) loss:4.058 lr:0.0002902 epoch_Time:28874.0min: [2024-01-02 18:44:45,514][model8_pretrain.py][INFO] Epoch:[0/2](10300/4588595) loss:3.466 lr:0.0002902 epoch_Time:28875.0min: [2024-01-02 18:44:45,514][model8_pretrain.py][INFO] Epoch:[0/2](10300/4588595) loss:3.749 lr:0.0002902 epoch_Time:28874.0min: [2024-01-02 18:44:45,514][model8_pretrain.py][INFO] Epoch:[0/2](10300/4588595) loss:3.876 lr:0.0002902 epoch_Time:28874.0min: [2024-01-02 18:44:45,514][model8_pretrain.py][INFO] Epoch:[0/2](10300/4588595) loss:3.299 lr:0.0002902 epoch_Time:28874.0min: [2024-01-02 18:45:22,453][model8_pretrain.py][INFO] Epoch:[0/2](10400/4588595) loss:3.642 lr:0.0002900 epoch_Time:28867.0min: [2024-01-02 18:45:22,453][model8_pretrain.py][INFO] Epoch:[0/2](10400/4588595) loss:3.176 lr:0.0002900 epoch_Time:28867.0min: [2024-01-02 18:45:22,453][model8_pretrain.py][INFO] Epoch:[0/2](10400/4588595) loss:3.977 lr:0.0002900 epoch_Time:28867.0min: [2024-01-02 18:45:22,453][model8_pretrain.py][INFO] Epoch:[0/2](10400/4588595) loss:3.553 lr:0.0002900 epoch_Time:28867.0min: [2024-01-02 18:45:22,453][model8_pretrain.py][INFO] Epoch:[0/2](10400/4588595) loss:3.416 lr:0.0002900 epoch_Time:28867.0min: [2024-01-02 18:45:22,453][model8_pretrain.py][INFO] Epoch:[0/2](10400/4588595) loss:3.594 lr:0.0002900 epoch_Time:28867.0min: [2024-01-02 18:45:22,453][model8_pretrain.py][INFO] Epoch:[0/2](10400/4588595) loss:4.030 lr:0.0002900 epoch_Time:28867.0min: [2024-01-02 18:45:22,453][model8_pretrain.py][INFO] Epoch:[0/2](10400/4588595) loss:3.959 lr:0.0002900 epoch_Time:28867.0min: [2024-01-02 18:45:59,376][model8_pretrain.py][INFO] Epoch:[0/2](10500/4588595) loss:3.559 lr:0.0002898 epoch_Time:28859.0min: [2024-01-02 18:45:59,377][model8_pretrain.py][INFO] Epoch:[0/2](10500/4588595) loss:4.087 lr:0.0002898 epoch_Time:28859.0min: [2024-01-02 18:45:59,376][model8_pretrain.py][INFO] Epoch:[0/2](10500/4588595) loss:3.434 lr:0.0002898 epoch_Time:28859.0min: [2024-01-02 18:45:59,377][model8_pretrain.py][INFO] Epoch:[0/2](10500/4588595) loss:3.943 lr:0.0002898 epoch_Time:28859.0min: [2024-01-02 18:45:59,377][model8_pretrain.py][INFO] Epoch:[0/2](10500/4588595) loss:3.417 lr:0.0002898 epoch_Time:28859.0min: [2024-01-02 18:45:59,377][model8_pretrain.py][INFO] Epoch:[0/2](10500/4588595) loss:3.440 lr:0.0002898 epoch_Time:28859.0min: [2024-01-02 18:45:59,377][model8_pretrain.py][INFO] Epoch:[0/2](10500/4588595) loss:4.004 lr:0.0002898 epoch_Time:28859.0min: [2024-01-02 18:45:59,377][model8_pretrain.py][INFO] Epoch:[0/2](10500/4588595) loss:3.921 lr:0.0002898 epoch_Time:28859.0min: [2024-01-02 18:46:36,298][model8_pretrain.py][INFO] Epoch:[0/2](10600/4588595) loss:3.944 lr:0.0002896 epoch_Time:28853.0min: [2024-01-02 18:46:36,298][model8_pretrain.py][INFO] Epoch:[0/2](10600/4588595) loss:3.130 lr:0.0002896 epoch_Time:28853.0min: [2024-01-02 18:46:36,298][model8_pretrain.py][INFO] Epoch:[0/2](10600/4588595) loss:3.882 lr:0.0002896 epoch_Time:28853.0min: [2024-01-02 18:46:36,298][model8_pretrain.py][INFO] Epoch:[0/2](10600/4588595) loss:4.133 lr:0.0002896 epoch_Time:28853.0min: [2024-01-02 18:46:36,298][model8_pretrain.py][INFO] Epoch:[0/2](10600/4588595) loss:3.670 lr:0.0002896 epoch_Time:28853.0min: [2024-01-02 18:46:36,299][model8_pretrain.py][INFO] Epoch:[0/2](10600/4588595) loss:3.682 lr:0.0002896 epoch_Time:28853.0min: [2024-01-02 18:46:36,299][model8_pretrain.py][INFO] Epoch:[0/2](10600/4588595) loss:3.660 lr:0.0002896 epoch_Time:28853.0min: [2024-01-02 18:46:36,299][model8_pretrain.py][INFO] Epoch:[0/2](10600/4588595) loss:3.104 lr:0.0002896 epoch_Time:28853.0min: [2024-01-02 18:47:13,210][model8_pretrain.py][INFO] Epoch:[0/2](10700/4588595) loss:3.479 lr:0.0002893 epoch_Time:28845.0min: [2024-01-02 18:47:13,210][model8_pretrain.py][INFO] Epoch:[0/2](10700/4588595) loss:3.982 lr:0.0002893 epoch_Time:28845.0min: [2024-01-02 18:47:13,210][model8_pretrain.py][INFO] Epoch:[0/2](10700/4588595) loss:3.527 lr:0.0002893 epoch_Time:28845.0min: [2024-01-02 18:47:13,210][model8_pretrain.py][INFO] Epoch:[0/2](10700/4588595) loss:3.014 lr:0.0002893 epoch_Time:28845.0min: [2024-01-02 18:47:13,210][model8_pretrain.py][INFO] Epoch:[0/2](10700/4588595) loss:3.457 lr:0.0002893 epoch_Time:28845.0min: [2024-01-02 18:47:13,210][model8_pretrain.py][INFO] Epoch:[0/2](10700/4588595) loss:3.673 lr:0.0002893 epoch_Time:28845.0min: [2024-01-02 18:47:13,210][model8_pretrain.py][INFO] Epoch:[0/2](10700/4588595) loss:3.694 lr:0.0002893 epoch_Time:28845.0min: [2024-01-02 18:47:13,210][model8_pretrain.py][INFO] Epoch:[0/2](10700/4588595) loss:4.203 lr:0.0002893 epoch_Time:28845.0min: [2024-01-02 18:47:50,124][model8_pretrain.py][INFO] Epoch:[0/2](10800/4588595) loss:3.845 lr:0.0002891 epoch_Time:28838.0min: [2024-01-02 18:47:50,124][model8_pretrain.py][INFO] Epoch:[0/2](10800/4588595) loss:3.795 lr:0.0002891 epoch_Time:28838.0min: [2024-01-02 18:47:50,124][model8_pretrain.py][INFO] Epoch:[0/2](10800/4588595) loss:3.773 lr:0.0002891 epoch_Time:28838.0min: [2024-01-02 18:47:50,124][model8_pretrain.py][INFO] Epoch:[0/2](10800/4588595) loss:3.494 lr:0.0002891 epoch_Time:28838.0min: [2024-01-02 18:47:50,124][model8_pretrain.py][INFO] Epoch:[0/2](10800/4588595) loss:4.113 lr:0.0002891 epoch_Time:28838.0min: [2024-01-02 18:47:50,125][model8_pretrain.py][INFO] Epoch:[0/2](10800/4588595) loss:3.422 lr:0.0002891 epoch_Time:28838.0min: [2024-01-02 18:47:50,125][model8_pretrain.py][INFO] Epoch:[0/2](10800/4588595) loss:3.826 lr:0.0002891 epoch_Time:28838.0min: [2024-01-02 18:47:50,125][model8_pretrain.py][INFO] Epoch:[0/2](10800/4588595) loss:3.976 lr:0.0002891 epoch_Time:28838.0min: [2024-01-02 18:48:30,639][model8_pretrain.py][INFO] Epoch:[0/2](10900/4588595) loss:3.422 lr:0.0002889 epoch_Time:28857.0min: [2024-01-02 18:48:30,639][model8_pretrain.py][INFO] Epoch:[0/2](10900/4588595) loss:3.454 lr:0.0002889 epoch_Time:28857.0min: [2024-01-02 18:48:30,639][model8_pretrain.py][INFO] Epoch:[0/2](10900/4588595) loss:3.834 lr:0.0002889 epoch_Time:28857.0min: [2024-01-02 18:48:30,639][model8_pretrain.py][INFO] Epoch:[0/2](10900/4588595) loss:3.882 lr:0.0002889 epoch_Time:28857.0min: [2024-01-02 18:48:30,639][model8_pretrain.py][INFO] Epoch:[0/2](10900/4588595) loss:3.399 lr:0.0002889 epoch_Time:28857.0min: [2024-01-02 18:48:30,639][model8_pretrain.py][INFO] Epoch:[0/2](10900/4588595) loss:3.874 lr:0.0002889 epoch_Time:28857.0min: [2024-01-02 18:48:30,639][model8_pretrain.py][INFO] Epoch:[0/2](10900/4588595) loss:3.769 lr:0.0002889 epoch_Time:28857.0min: [2024-01-02 18:48:30,639][model8_pretrain.py][INFO] Epoch:[0/2](10900/4588595) loss:3.955 lr:0.0002889 epoch_Time:28857.0min: [2024-01-02 18:49:07,573][model8_pretrain.py][INFO] Epoch:[0/2](11000/4588595) loss:3.860 lr:0.0002887 epoch_Time:28850.0min: [2024-01-02 18:49:07,574][model8_pretrain.py][INFO] Epoch:[0/2](11000/4588595) loss:3.490 lr:0.0002887 epoch_Time:28850.0min: [2024-01-02 18:49:07,574][model8_pretrain.py][INFO] Epoch:[0/2](11000/4588595) loss:3.421 lr:0.0002887 epoch_Time:28850.0min: [2024-01-02 18:49:07,574][model8_pretrain.py][INFO] Epoch:[0/2](11000/4588595) loss:3.467 lr:0.0002887 epoch_Time:28850.0min: [2024-01-02 18:49:07,574][model8_pretrain.py][INFO] Epoch:[0/2](11000/4588595) loss:3.837 lr:0.0002887 epoch_Time:28850.0min: [2024-01-02 18:49:07,574][model8_pretrain.py][INFO] Epoch:[0/2](11000/4588595) loss:3.767 lr:0.0002887 epoch_Time:28850.0min: [2024-01-02 18:49:07,574][model8_pretrain.py][INFO] Epoch:[0/2](11000/4588595) loss:3.846 lr:0.0002887 epoch_Time:28850.0min: [2024-01-02 18:49:07,574][model8_pretrain.py][INFO] Epoch:[0/2](11000/4588595) loss:2.651 lr:0.0002887 epoch_Time:28850.0min: [2024-01-02 18:49:44,488][model8_pretrain.py][INFO] Epoch:[0/2](11100/4588595) loss:3.469 lr:0.0002885 epoch_Time:28843.0min: [2024-01-02 18:49:44,488][model8_pretrain.py][INFO] Epoch:[0/2](11100/4588595) loss:3.309 lr:0.0002885 epoch_Time:28844.0min: [2024-01-02 18:49:44,488][model8_pretrain.py][INFO] Epoch:[0/2](11100/4588595) loss:3.796 lr:0.0002885 epoch_Time:28843.0min: [2024-01-02 18:49:44,488][model8_pretrain.py][INFO] Epoch:[0/2](11100/4588595) loss:3.775 lr:0.0002885 epoch_Time:28844.0min: [2024-01-02 18:49:44,488][model8_pretrain.py][INFO] Epoch:[0/2](11100/4588595) loss:3.796 lr:0.0002885 epoch_Time:28844.0min: [2024-01-02 18:49:44,488][model8_pretrain.py][INFO] Epoch:[0/2](11100/4588595) loss:3.885 lr:0.0002885 epoch_Time:28844.0min: [2024-01-02 18:49:44,488][model8_pretrain.py][INFO] Epoch:[0/2](11100/4588595) loss:3.743 lr:0.0002885 epoch_Time:28844.0min: [2024-01-02 18:49:44,489][model8_pretrain.py][INFO] Epoch:[0/2](11100/4588595) loss:3.498 lr:0.0002885 epoch_Time:28844.0min: [2024-01-02 18:50:21,412][model8_pretrain.py][INFO] Epoch:[0/2](11200/4588595) loss:3.568 lr:0.0002882 epoch_Time:28836.0min: [2024-01-02 18:50:21,412][model8_pretrain.py][INFO] Epoch:[0/2](11200/4588595) loss:3.779 lr:0.0002882 epoch_Time:28837.0min: [2024-01-02 18:50:21,412][model8_pretrain.py][INFO] Epoch:[0/2](11200/4588595) loss:3.390 lr:0.0002882 epoch_Time:28837.0min: [2024-01-02 18:50:21,412][model8_pretrain.py][INFO] Epoch:[0/2](11200/4588595) loss:3.396 lr:0.0002882 epoch_Time:28837.0min: [2024-01-02 18:50:21,412][model8_pretrain.py][INFO] Epoch:[0/2](11200/4588595) loss:3.749 lr:0.0002882 epoch_Time:28837.0min: [2024-01-02 18:50:21,412][model8_pretrain.py][INFO] Epoch:[0/2](11200/4588595) loss:3.874 lr:0.0002882 epoch_Time:28836.0min: [2024-01-02 18:50:21,412][model8_pretrain.py][INFO] Epoch:[0/2](11200/4588595) loss:3.214 lr:0.0002882 epoch_Time:28837.0min: [2024-01-02 18:50:21,412][model8_pretrain.py][INFO] Epoch:[0/2](11200/4588595) loss:3.877 lr:0.0002882 epoch_Time:28837.0min: [2024-01-02 18:50:58,337][model8_pretrain.py][INFO] Epoch:[0/2](11300/4588595) loss:3.694 lr:0.0002880 epoch_Time:28830.0min: [2024-01-02 18:50:58,337][model8_pretrain.py][INFO] Epoch:[0/2](11300/4588595) loss:3.773 lr:0.0002880 epoch_Time:28830.0min: [2024-01-02 18:50:58,337][model8_pretrain.py][INFO] Epoch:[0/2](11300/4588595) loss:4.020 lr:0.0002880 epoch_Time:28830.0min: [2024-01-02 18:50:58,337][model8_pretrain.py][INFO] Epoch:[0/2](11300/4588595) loss:3.576 lr:0.0002880 epoch_Time:28830.0min: [2024-01-02 18:50:58,337][model8_pretrain.py][INFO] Epoch:[0/2](11300/4588595) loss:3.649 lr:0.0002880 epoch_Time:28830.0min: [2024-01-02 18:50:58,337][model8_pretrain.py][INFO] Epoch:[0/2](11300/4588595) loss:3.698 lr:0.0002880 epoch_Time:28830.0min: [2024-01-02 18:50:58,337][model8_pretrain.py][INFO] Epoch:[0/2](11300/4588595) loss:3.662 lr:0.0002880 epoch_Time:28830.0min: [2024-01-02 18:50:58,337][model8_pretrain.py][INFO] Epoch:[0/2](11300/4588595) loss:3.449 lr:0.0002880 epoch_Time:28830.0min: [2024-01-02 18:51:35,273][model8_pretrain.py][INFO] Epoch:[0/2](11400/4588595) loss:3.761 lr:0.0002878 epoch_Time:28824.0min: [2024-01-02 18:51:35,273][model8_pretrain.py][INFO] Epoch:[0/2](11400/4588595) loss:3.449 lr:0.0002878 epoch_Time:28824.0min: [2024-01-02 18:51:35,273][model8_pretrain.py][INFO] Epoch:[0/2](11400/4588595) loss:3.843 lr:0.0002878 epoch_Time:28824.0min: [2024-01-02 18:51:35,273][model8_pretrain.py][INFO] Epoch:[0/2](11400/4588595) loss:4.159 lr:0.0002878 epoch_Time:28824.0min: [2024-01-02 18:51:35,273][model8_pretrain.py][INFO] Epoch:[0/2](11400/4588595) loss:3.301 lr:0.0002878 epoch_Time:28824.0min: [2024-01-02 18:51:35,273][model8_pretrain.py][INFO] Epoch:[0/2](11400/4588595) loss:3.270 lr:0.0002878 epoch_Time:28824.0min: [2024-01-02 18:51:35,274][model8_pretrain.py][INFO] Epoch:[0/2](11400/4588595) loss:3.362 lr:0.0002878 epoch_Time:28824.0min: [2024-01-02 18:51:35,274][model8_pretrain.py][INFO] Epoch:[0/2](11400/4588595) loss:3.674 lr:0.0002878 epoch_Time:28824.0min: [2024-01-02 18:52:12,184][model8_pretrain.py][INFO] Epoch:[0/2](11500/4588595) loss:3.605 lr:0.0002875 epoch_Time:28817.0min: [2024-01-02 18:52:12,184][model8_pretrain.py][INFO] Epoch:[0/2](11500/4588595) loss:3.825 lr:0.0002875 epoch_Time:28817.0min: [2024-01-02 18:52:12,184][model8_pretrain.py][INFO] Epoch:[0/2](11500/4588595) loss:3.505 lr:0.0002875 epoch_Time:28817.0min: [2024-01-02 18:52:12,184][model8_pretrain.py][INFO] Epoch:[0/2](11500/4588595) loss:4.026 lr:0.0002875 epoch_Time:28817.0min: [2024-01-02 18:52:12,184][model8_pretrain.py][INFO] Epoch:[0/2](11500/4588595) loss:3.973 lr:0.0002875 epoch_Time:28817.0min: [2024-01-02 18:52:12,184][model8_pretrain.py][INFO] Epoch:[0/2](11500/4588595) loss:3.903 lr:0.0002875 epoch_Time:28817.0min: [2024-01-02 18:52:12,184][model8_pretrain.py][INFO] Epoch:[0/2](11500/4588595) loss:2.985 lr:0.0002875 epoch_Time:28817.0min: [2024-01-02 18:52:12,184][model8_pretrain.py][INFO] Epoch:[0/2](11500/4588595) loss:3.911 lr:0.0002875 epoch_Time:28817.0min: [2024-01-02 18:52:49,109][model8_pretrain.py][INFO] Epoch:[0/2](11600/4588595) loss:3.607 lr:0.0002873 epoch_Time:28810.0min: [2024-01-02 18:52:49,109][model8_pretrain.py][INFO] Epoch:[0/2](11600/4588595) loss:3.243 lr:0.0002873 epoch_Time:28810.0min: [2024-01-02 18:52:49,109][model8_pretrain.py][INFO] Epoch:[0/2](11600/4588595) loss:3.616 lr:0.0002873 epoch_Time:28810.0min: [2024-01-02 18:52:49,109][model8_pretrain.py][INFO] Epoch:[0/2](11600/4588595) loss:3.998 lr:0.0002873 epoch_Time:28811.0min: [2024-01-02 18:52:49,109][model8_pretrain.py][INFO] Epoch:[0/2](11600/4588595) loss:3.424 lr:0.0002873 epoch_Time:28811.0min: [2024-01-02 18:52:49,109][model8_pretrain.py][INFO] Epoch:[0/2](11600/4588595) loss:3.268 lr:0.0002873 epoch_Time:28810.0min: [2024-01-02 18:52:49,109][model8_pretrain.py][INFO] Epoch:[0/2](11600/4588595) loss:3.834 lr:0.0002873 epoch_Time:28810.0min: [2024-01-02 18:52:49,110][model8_pretrain.py][INFO] Epoch:[0/2](11600/4588595) loss:3.168 lr:0.0002873 epoch_Time:28810.0min: [2024-01-02 18:53:29,680][model8_pretrain.py][INFO] Epoch:[0/2](11700/4588595) loss:3.441 lr:0.0002871 epoch_Time:28829.0min: [2024-01-02 18:53:29,680][model8_pretrain.py][INFO] Epoch:[0/2](11700/4588595) loss:3.932 lr:0.0002871 epoch_Time:28829.0min: [2024-01-02 18:53:29,680][model8_pretrain.py][INFO] Epoch:[0/2](11700/4588595) loss:3.616 lr:0.0002871 epoch_Time:28829.0min: [2024-01-02 18:53:29,680][model8_pretrain.py][INFO] Epoch:[0/2](11700/4588595) loss:3.615 lr:0.0002871 epoch_Time:28829.0min: [2024-01-02 18:53:29,680][model8_pretrain.py][INFO] Epoch:[0/2](11700/4588595) loss:3.622 lr:0.0002871 epoch_Time:28829.0min: [2024-01-02 18:53:29,680][model8_pretrain.py][INFO] Epoch:[0/2](11700/4588595) loss:3.843 lr:0.0002871 epoch_Time:28829.0min: [2024-01-02 18:53:29,680][model8_pretrain.py][INFO] Epoch:[0/2](11700/4588595) loss:3.981 lr:0.0002871 epoch_Time:28829.0min: [2024-01-02 18:53:29,680][model8_pretrain.py][INFO] Epoch:[0/2](11700/4588595) loss:3.113 lr:0.0002871 epoch_Time:28829.0min: [2024-01-02 18:54:06,636][model8_pretrain.py][INFO] Epoch:[0/2](11800/4588595) loss:3.529 lr:0.0002868 epoch_Time:28822.0min: [2024-01-02 18:54:06,636][model8_pretrain.py][INFO] Epoch:[0/2](11800/4588595) loss:3.627 lr:0.0002868 epoch_Time:28822.0min: [2024-01-02 18:54:06,636][model8_pretrain.py][INFO] Epoch:[0/2](11800/4588595) loss:3.424 lr:0.0002868 epoch_Time:28823.0min: [2024-01-02 18:54:06,636][model8_pretrain.py][INFO] Epoch:[0/2](11800/4588595) loss:3.481 lr:0.0002868 epoch_Time:28822.0min: [2024-01-02 18:54:06,636][model8_pretrain.py][INFO] Epoch:[0/2](11800/4588595) loss:3.595 lr:0.0002868 epoch_Time:28822.0min: [2024-01-02 18:54:06,636][model8_pretrain.py][INFO] Epoch:[0/2](11800/4588595) loss:3.837 lr:0.0002868 epoch_Time:28822.0min: [2024-01-02 18:54:06,636][model8_pretrain.py][INFO] Epoch:[0/2](11800/4588595) loss:3.471 lr:0.0002868 epoch_Time:28822.0min: [2024-01-02 18:54:06,636][model8_pretrain.py][INFO] Epoch:[0/2](11800/4588595) loss:4.196 lr:0.0002868 epoch_Time:28822.0min: [2024-01-02 18:54:43,565][model8_pretrain.py][INFO] Epoch:[0/2](11900/4588595) loss:3.548 lr:0.0002866 epoch_Time:28817.0min: [2024-01-02 18:54:43,565][model8_pretrain.py][INFO] Epoch:[0/2](11900/4588595) loss:3.423 lr:0.0002866 epoch_Time:28817.0min: [2024-01-02 18:54:43,565][model8_pretrain.py][INFO] Epoch:[0/2](11900/4588595) loss:4.143 lr:0.0002866 epoch_Time:28817.0min: [2024-01-02 18:54:43,565][model8_pretrain.py][INFO] Epoch:[0/2](11900/4588595) loss:3.327 lr:0.0002866 epoch_Time:28817.0min: [2024-01-02 18:54:43,565][model8_pretrain.py][INFO] Epoch:[0/2](11900/4588595) loss:3.793 lr:0.0002866 epoch_Time:28817.0min: [2024-01-02 18:54:43,565][model8_pretrain.py][INFO] Epoch:[0/2](11900/4588595) loss:3.650 lr:0.0002866 epoch_Time:28817.0min: [2024-01-02 18:54:43,565][model8_pretrain.py][INFO] Epoch:[0/2](11900/4588595) loss:3.600 lr:0.0002866 epoch_Time:28817.0min: [2024-01-02 18:54:43,565][model8_pretrain.py][INFO] Epoch:[0/2](11900/4588595) loss:4.326 lr:0.0002866 epoch_Time:28817.0min: [2024-01-02 18:55:20,496][model8_pretrain.py][INFO] Epoch:[0/2](12000/4588595) loss:3.577 lr:0.0002863 epoch_Time:28810.0min: [2024-01-02 18:55:20,496][model8_pretrain.py][INFO] Epoch:[0/2](12000/4588595) loss:3.673 lr:0.0002863 epoch_Time:28810.0min: [2024-01-02 18:55:20,496][model8_pretrain.py][INFO] Epoch:[0/2](12000/4588595) loss:3.843 lr:0.0002863 epoch_Time:28811.0min: [2024-01-02 18:55:20,496][model8_pretrain.py][INFO] Epoch:[0/2](12000/4588595) loss:3.914 lr:0.0002863 epoch_Time:28810.0min: [2024-01-02 18:55:20,496][model8_pretrain.py][INFO] Epoch:[0/2](12000/4588595) loss:3.631 lr:0.0002863 epoch_Time:28810.0min: [2024-01-02 18:55:20,496][model8_pretrain.py][INFO] Epoch:[0/2](12000/4588595) loss:3.768 lr:0.0002863 epoch_Time:28810.0min: [2024-01-02 18:55:20,496][model8_pretrain.py][INFO] Epoch:[0/2](12000/4588595) loss:3.998 lr:0.0002863 epoch_Time:28810.0min: [2024-01-02 18:55:20,497][model8_pretrain.py][INFO] Epoch:[0/2](12000/4588595) loss:3.635 lr:0.0002863 epoch_Time:28811.0min: [2024-01-02 18:55:57,411][model8_pretrain.py][INFO] Epoch:[0/2](12100/4588595) loss:3.436 lr:0.0002861 epoch_Time:28804.0min: [2024-01-02 18:55:57,411][model8_pretrain.py][INFO] Epoch:[0/2](12100/4588595) loss:3.342 lr:0.0002861 epoch_Time:28804.0min: [2024-01-02 18:55:57,411][model8_pretrain.py][INFO] Epoch:[0/2](12100/4588595) loss:3.663 lr:0.0002861 epoch_Time:28804.0min: [2024-01-02 18:55:57,411][model8_pretrain.py][INFO] Epoch:[0/2](12100/4588595) loss:3.772 lr:0.0002861 epoch_Time:28804.0min: [2024-01-02 18:55:57,411][model8_pretrain.py][INFO] Epoch:[0/2](12100/4588595) loss:2.897 lr:0.0002861 epoch_Time:28804.0min: [2024-01-02 18:55:57,411][model8_pretrain.py][INFO] Epoch:[0/2](12100/4588595) loss:3.316 lr:0.0002861 epoch_Time:28804.0min: [2024-01-02 18:55:57,411][model8_pretrain.py][INFO] Epoch:[0/2](12100/4588595) loss:3.756 lr:0.0002861 epoch_Time:28804.0min: [2024-01-02 18:55:57,411][model8_pretrain.py][INFO] Epoch:[0/2](12100/4588595) loss:3.733 lr:0.0002861 epoch_Time:28804.0min: [2024-01-02 18:56:34,349][model8_pretrain.py][INFO] Epoch:[0/2](12200/4588595) loss:3.701 lr:0.0002859 epoch_Time:28799.0min: [2024-01-02 18:56:34,349][model8_pretrain.py][INFO] Epoch:[0/2](12200/4588595) loss:3.394 lr:0.0002859 epoch_Time:28799.0min: [2024-01-02 18:56:34,349][model8_pretrain.py][INFO] Epoch:[0/2](12200/4588595) loss:3.593 lr:0.0002859 epoch_Time:28799.0min: [2024-01-02 18:56:34,349][model8_pretrain.py][INFO] Epoch:[0/2](12200/4588595) loss:3.788 lr:0.0002859 epoch_Time:28799.0min: [2024-01-02 18:56:34,349][model8_pretrain.py][INFO] Epoch:[0/2](12200/4588595) loss:4.177 lr:0.0002859 epoch_Time:28799.0min: [2024-01-02 18:56:34,349][model8_pretrain.py][INFO] Epoch:[0/2](12200/4588595) loss:3.800 lr:0.0002859 epoch_Time:28799.0min: [2024-01-02 18:56:34,349][model8_pretrain.py][INFO] Epoch:[0/2](12200/4588595) loss:3.383 lr:0.0002859 epoch_Time:28799.0min: [2024-01-02 18:56:34,349][model8_pretrain.py][INFO] Epoch:[0/2](12200/4588595) loss:3.625 lr:0.0002859 epoch_Time:28799.0min: [2024-01-02 18:57:11,269][model8_pretrain.py][INFO] Epoch:[0/2](12300/4588595) loss:3.165 lr:0.0002856 epoch_Time:28793.0min: [2024-01-02 18:57:11,269][model8_pretrain.py][INFO] Epoch:[0/2](12300/4588595) loss:3.407 lr:0.0002856 epoch_Time:28793.0min: [2024-01-02 18:57:11,269][model8_pretrain.py][INFO] Epoch:[0/2](12300/4588595) loss:3.575 lr:0.0002856 epoch_Time:28793.0min: [2024-01-02 18:57:11,269][model8_pretrain.py][INFO] Epoch:[0/2](12300/4588595) loss:3.657 lr:0.0002856 epoch_Time:28793.0min: [2024-01-02 18:57:11,269][model8_pretrain.py][INFO] Epoch:[0/2](12300/4588595) loss:3.543 lr:0.0002856 epoch_Time:28793.0min: [2024-01-02 18:57:11,269][model8_pretrain.py][INFO] Epoch:[0/2](12300/4588595) loss:3.622 lr:0.0002856 epoch_Time:28793.0min: [2024-01-02 18:57:11,269][model8_pretrain.py][INFO] Epoch:[0/2](12300/4588595) loss:3.654 lr:0.0002856 epoch_Time:28793.0min: [2024-01-02 18:57:11,269][model8_pretrain.py][INFO] Epoch:[0/2](12300/4588595) loss:3.454 lr:0.0002856 epoch_Time:28793.0min: [2024-01-02 18:57:48,208][model8_pretrain.py][INFO] Epoch:[0/2](12400/4588595) loss:3.439 lr:0.0002854 epoch_Time:28787.0min: [2024-01-02 18:57:48,208][model8_pretrain.py][INFO] Epoch:[0/2](12400/4588595) loss:3.846 lr:0.0002854 epoch_Time:28787.0min: [2024-01-02 18:57:48,208][model8_pretrain.py][INFO] Epoch:[0/2](12400/4588595) loss:3.509 lr:0.0002854 epoch_Time:28787.0min: [2024-01-02 18:57:48,208][model8_pretrain.py][INFO] Epoch:[0/2](12400/4588595) loss:3.509 lr:0.0002854 epoch_Time:28787.0min: [2024-01-02 18:57:48,208][model8_pretrain.py][INFO] Epoch:[0/2](12400/4588595) loss:3.739 lr:0.0002854 epoch_Time:28787.0min: [2024-01-02 18:57:48,208][model8_pretrain.py][INFO] Epoch:[0/2](12400/4588595) loss:3.346 lr:0.0002854 epoch_Time:28787.0min: [2024-01-02 18:57:48,208][model8_pretrain.py][INFO] Epoch:[0/2](12400/4588595) loss:3.552 lr:0.0002854 epoch_Time:28787.0min: [2024-01-02 18:57:48,208][model8_pretrain.py][INFO] Epoch:[0/2](12400/4588595) loss:3.634 lr:0.0002854 epoch_Time:28787.0min: [2024-01-02 18:58:28,737][model8_pretrain.py][INFO] Epoch:[0/2](12500/4588595) loss:3.796 lr:0.0002851 epoch_Time:28804.0min: [2024-01-02 18:58:28,737][model8_pretrain.py][INFO] Epoch:[0/2](12500/4588595) loss:3.487 lr:0.0002851 epoch_Time:28804.0min: [2024-01-02 18:58:28,737][model8_pretrain.py][INFO] Epoch:[0/2](12500/4588595) loss:3.717 lr:0.0002851 epoch_Time:28804.0min: [2024-01-02 18:58:28,737][model8_pretrain.py][INFO] Epoch:[0/2](12500/4588595) loss:3.706 lr:0.0002851 epoch_Time:28804.0min: [2024-01-02 18:58:28,737][model8_pretrain.py][INFO] Epoch:[0/2](12500/4588595) loss:3.189 lr:0.0002851 epoch_Time:28804.0min: [2024-01-02 18:58:28,737][model8_pretrain.py][INFO] Epoch:[0/2](12500/4588595) loss:3.981 lr:0.0002851 epoch_Time:28804.0min: [2024-01-02 18:58:28,737][model8_pretrain.py][INFO] Epoch:[0/2](12500/4588595) loss:3.690 lr:0.0002851 epoch_Time:28804.0min: [2024-01-02 18:58:28,737][model8_pretrain.py][INFO] Epoch:[0/2](12500/4588595) loss:3.317 lr:0.0002851 epoch_Time:28804.0min: [2024-01-02 18:59:05,661][model8_pretrain.py][INFO] Epoch:[0/2](12600/4588595) loss:3.590 lr:0.0002848 epoch_Time:28798.0min: [2024-01-02 18:59:05,662][model8_pretrain.py][INFO] Epoch:[0/2](12600/4588595) loss:3.898 lr:0.0002848 epoch_Time:28798.0min: [2024-01-02 18:59:05,662][model8_pretrain.py][INFO] Epoch:[0/2](12600/4588595) loss:4.008 lr:0.0002848 epoch_Time:28798.0min: [2024-01-02 18:59:05,662][model8_pretrain.py][INFO] Epoch:[0/2](12600/4588595) loss:3.719 lr:0.0002848 epoch_Time:28798.0min: [2024-01-02 18:59:05,662][model8_pretrain.py][INFO] Epoch:[0/2](12600/4588595) loss:3.493 lr:0.0002848 epoch_Time:28798.0min: [2024-01-02 18:59:05,662][model8_pretrain.py][INFO] Epoch:[0/2](12600/4588595) loss:3.737 lr:0.0002848 epoch_Time:28798.0min: [2024-01-02 18:59:05,662][model8_pretrain.py][INFO] Epoch:[0/2](12600/4588595) loss:3.645 lr:0.0002848 epoch_Time:28798.0min: [2024-01-02 18:59:05,662][model8_pretrain.py][INFO] Epoch:[0/2](12600/4588595) loss:3.770 lr:0.0002848 epoch_Time:28798.0min: [2024-01-02 18:59:42,589][model8_pretrain.py][INFO] Epoch:[0/2](12700/4588595) loss:3.082 lr:0.0002846 epoch_Time:28793.0min: [2024-01-02 18:59:42,589][model8_pretrain.py][INFO] Epoch:[0/2](12700/4588595) loss:3.880 lr:0.0002846 epoch_Time:28793.0min: [2024-01-02 18:59:42,589][model8_pretrain.py][INFO] Epoch:[0/2](12700/4588595) loss:4.042 lr:0.0002846 epoch_Time:28793.0min: [2024-01-02 18:59:42,589][model8_pretrain.py][INFO] Epoch:[0/2](12700/4588595) loss:3.883 lr:0.0002846 epoch_Time:28793.0min: [2024-01-02 18:59:42,589][model8_pretrain.py][INFO] Epoch:[0/2](12700/4588595) loss:3.678 lr:0.0002846 epoch_Time:28793.0min: [2024-01-02 18:59:42,589][model8_pretrain.py][INFO] Epoch:[0/2](12700/4588595) loss:3.554 lr:0.0002846 epoch_Time:28793.0min: [2024-01-02 18:59:42,589][model8_pretrain.py][INFO] Epoch:[0/2](12700/4588595) loss:3.672 lr:0.0002846 epoch_Time:28793.0min: [2024-01-02 18:59:42,589][model8_pretrain.py][INFO] Epoch:[0/2](12700/4588595) loss:3.684 lr:0.0002846 epoch_Time:28793.0min: [2024-01-02 19:00:19,514][model8_pretrain.py][INFO] Epoch:[0/2](12800/4588595) loss:3.661 lr:0.0002843 epoch_Time:28787.0min: [2024-01-02 19:00:19,514][model8_pretrain.py][INFO] Epoch:[0/2](12800/4588595) loss:3.362 lr:0.0002843 epoch_Time:28787.0min: [2024-01-02 19:00:19,514][model8_pretrain.py][INFO] Epoch:[0/2](12800/4588595) loss:3.704 lr:0.0002843 epoch_Time:28787.0min: [2024-01-02 19:00:19,514][model8_pretrain.py][INFO] Epoch:[0/2](12800/4588595) loss:3.374 lr:0.0002843 epoch_Time:28787.0min: [2024-01-02 19:00:19,514][model8_pretrain.py][INFO] Epoch:[0/2](12800/4588595) loss:3.428 lr:0.0002843 epoch_Time:28787.0min: [2024-01-02 19:00:19,514][model8_pretrain.py][INFO] Epoch:[0/2](12800/4588595) loss:3.630 lr:0.0002843 epoch_Time:28787.0min: [2024-01-02 19:00:19,514][model8_pretrain.py][INFO] Epoch:[0/2](12800/4588595) loss:3.560 lr:0.0002843 epoch_Time:28787.0min: [2024-01-02 19:00:19,515][model8_pretrain.py][INFO] Epoch:[0/2](12800/4588595) loss:3.445 lr:0.0002843 epoch_Time:28787.0min: [2024-01-02 19:00:56,429][model8_pretrain.py][INFO] Epoch:[0/2](12900/4588595) loss:3.883 lr:0.0002841 epoch_Time:28781.0min: [2024-01-02 19:00:56,429][model8_pretrain.py][INFO] Epoch:[0/2](12900/4588595) loss:3.320 lr:0.0002841 epoch_Time:28781.0min: [2024-01-02 19:00:56,429][model8_pretrain.py][INFO] Epoch:[0/2](12900/4588595) loss:3.545 lr:0.0002841 epoch_Time:28781.0min: [2024-01-02 19:00:56,429][model8_pretrain.py][INFO] Epoch:[0/2](12900/4588595) loss:3.500 lr:0.0002841 epoch_Time:28781.0min: [2024-01-02 19:00:56,429][model8_pretrain.py][INFO] Epoch:[0/2](12900/4588595) loss:3.463 lr:0.0002841 epoch_Time:28781.0min: [2024-01-02 19:00:56,429][model8_pretrain.py][INFO] Epoch:[0/2](12900/4588595) loss:3.351 lr:0.0002841 epoch_Time:28781.0min: [2024-01-02 19:00:56,429][model8_pretrain.py][INFO] Epoch:[0/2](12900/4588595) loss:3.776 lr:0.0002841 epoch_Time:28781.0min: [2024-01-02 19:00:56,429][model8_pretrain.py][INFO] Epoch:[0/2](12900/4588595) loss:3.754 lr:0.0002841 epoch_Time:28781.0min: [2024-01-02 19:01:33,357][model8_pretrain.py][INFO] Epoch:[0/2](13000/4588595) loss:3.834 lr:0.0002838 epoch_Time:28776.0min: [2024-01-02 19:01:33,357][model8_pretrain.py][INFO] Epoch:[0/2](13000/4588595) loss:3.103 lr:0.0002838 epoch_Time:28776.0min: [2024-01-02 19:01:33,357][model8_pretrain.py][INFO] Epoch:[0/2](13000/4588595) loss:3.680 lr:0.0002838 epoch_Time:28776.0min: [2024-01-02 19:01:33,357][model8_pretrain.py][INFO] Epoch:[0/2](13000/4588595) loss:3.213 lr:0.0002838 epoch_Time:28776.0min: [2024-01-02 19:01:33,357][model8_pretrain.py][INFO] Epoch:[0/2](13000/4588595) loss:3.801 lr:0.0002838 epoch_Time:28776.0min: [2024-01-02 19:01:33,357][model8_pretrain.py][INFO] Epoch:[0/2](13000/4588595) loss:4.137 lr:0.0002838 epoch_Time:28776.0min: [2024-01-02 19:01:33,357][model8_pretrain.py][INFO] Epoch:[0/2](13000/4588595) loss:3.992 lr:0.0002838 epoch_Time:28776.0min: [2024-01-02 19:01:33,358][model8_pretrain.py][INFO] Epoch:[0/2](13000/4588595) loss:3.773 lr:0.0002838 epoch_Time:28776.0min: [2024-01-02 19:02:10,308][model8_pretrain.py][INFO] Epoch:[0/2](13100/4588595) loss:3.325 lr:0.0002835 epoch_Time:28770.0min: [2024-01-02 19:02:10,308][model8_pretrain.py][INFO] Epoch:[0/2](13100/4588595) loss:3.927 lr:0.0002835 epoch_Time:28770.0min: [2024-01-02 19:02:10,308][model8_pretrain.py][INFO] Epoch:[0/2](13100/4588595) loss:3.211 lr:0.0002835 epoch_Time:28770.0min: [2024-01-02 19:02:10,308][model8_pretrain.py][INFO] Epoch:[0/2](13100/4588595) loss:3.737 lr:0.0002835 epoch_Time:28770.0min: [2024-01-02 19:02:10,308][model8_pretrain.py][INFO] Epoch:[0/2](13100/4588595) loss:3.707 lr:0.0002835 epoch_Time:28770.0min: [2024-01-02 19:02:10,308][model8_pretrain.py][INFO] Epoch:[0/2](13100/4588595) loss:3.567 lr:0.0002835 epoch_Time:28771.0min: [2024-01-02 19:02:10,309][model8_pretrain.py][INFO] Epoch:[0/2](13100/4588595) loss:3.488 lr:0.0002835 epoch_Time:28770.0min: [2024-01-02 19:02:10,309][model8_pretrain.py][INFO] Epoch:[0/2](13100/4588595) loss:3.466 lr:0.0002835 epoch_Time:28770.0min: [2024-01-02 19:02:47,297][model8_pretrain.py][INFO] Epoch:[0/2](13200/4588595) loss:3.364 lr:0.0002833 epoch_Time:28766.0min: [2024-01-02 19:02:47,297][model8_pretrain.py][INFO] Epoch:[0/2](13200/4588595) loss:3.306 lr:0.0002833 epoch_Time:28766.0min: [2024-01-02 19:02:47,297][model8_pretrain.py][INFO] Epoch:[0/2](13200/4588595) loss:3.614 lr:0.0002833 epoch_Time:28766.0min: [2024-01-02 19:02:47,297][model8_pretrain.py][INFO] Epoch:[0/2](13200/4588595) loss:3.235 lr:0.0002833 epoch_Time:28766.0min: [2024-01-02 19:02:47,297][model8_pretrain.py][INFO] Epoch:[0/2](13200/4588595) loss:4.036 lr:0.0002833 epoch_Time:28766.0min: [2024-01-02 19:02:47,297][model8_pretrain.py][INFO] Epoch:[0/2](13200/4588595) loss:3.652 lr:0.0002833 epoch_Time:28766.0min: [2024-01-02 19:02:47,297][model8_pretrain.py][INFO] Epoch:[0/2](13200/4588595) loss:3.943 lr:0.0002833 epoch_Time:28766.0min: [2024-01-02 19:02:47,297][model8_pretrain.py][INFO] Epoch:[0/2](13200/4588595) loss:3.630 lr:0.0002833 epoch_Time:28766.0min: [2024-01-02 19:03:28,511][model8_pretrain.py][INFO] Epoch:[0/2](13300/4588595) loss:3.477 lr:0.0002830 epoch_Time:28785.0min: [2024-01-02 19:03:28,511][model8_pretrain.py][INFO] Epoch:[0/2](13300/4588595) loss:3.827 lr:0.0002830 epoch_Time:28785.0min: [2024-01-02 19:03:28,511][model8_pretrain.py][INFO] Epoch:[0/2](13300/4588595) loss:3.713 lr:0.0002830 epoch_Time:28785.0min: [2024-01-02 19:03:28,511][model8_pretrain.py][INFO] Epoch:[0/2](13300/4588595) loss:3.361 lr:0.0002830 epoch_Time:28785.0min: [2024-01-02 19:03:28,511][model8_pretrain.py][INFO] Epoch:[0/2](13300/4588595) loss:3.512 lr:0.0002830 epoch_Time:28785.0min: [2024-01-02 19:03:28,511][model8_pretrain.py][INFO] Epoch:[0/2](13300/4588595) loss:3.443 lr:0.0002830 epoch_Time:28785.0min: [2024-01-02 19:03:28,511][model8_pretrain.py][INFO] Epoch:[0/2](13300/4588595) loss:3.476 lr:0.0002830 epoch_Time:28785.0min: [2024-01-02 19:03:28,511][model8_pretrain.py][INFO] Epoch:[0/2](13300/4588595) loss:3.202 lr:0.0002830 epoch_Time:28785.0min: [2024-01-02 19:04:05,438][model8_pretrain.py][INFO] Epoch:[0/2](13400/4588595) loss:2.858 lr:0.0002827 epoch_Time:28779.0min: [2024-01-02 19:04:05,438][model8_pretrain.py][INFO] Epoch:[0/2](13400/4588595) loss:3.499 lr:0.0002827 epoch_Time:28779.0min: [2024-01-02 19:04:05,438][model8_pretrain.py][INFO] Epoch:[0/2](13400/4588595) loss:3.704 lr:0.0002827 epoch_Time:28780.0min: [2024-01-02 19:04:05,438][model8_pretrain.py][INFO] Epoch:[0/2](13400/4588595) loss:3.487 lr:0.0002827 epoch_Time:28779.0min: [2024-01-02 19:04:05,438][model8_pretrain.py][INFO] Epoch:[0/2](13400/4588595) loss:3.953 lr:0.0002827 epoch_Time:28780.0min: [2024-01-02 19:04:05,438][model8_pretrain.py][INFO] Epoch:[0/2](13400/4588595) loss:3.521 lr:0.0002827 epoch_Time:28779.0min: [2024-01-02 19:04:05,438][model8_pretrain.py][INFO] Epoch:[0/2](13400/4588595) loss:3.867 lr:0.0002827 epoch_Time:28779.0min: [2024-01-02 19:04:05,438][model8_pretrain.py][INFO] Epoch:[0/2](13400/4588595) loss:3.916 lr:0.0002827 epoch_Time:28779.0min: [2024-01-02 19:04:42,351][model8_pretrain.py][INFO] Epoch:[0/2](13500/4588595) loss:3.427 lr:0.0002825 epoch_Time:28775.0min: [2024-01-02 19:04:42,351][model8_pretrain.py][INFO] Epoch:[0/2](13500/4588595) loss:3.960 lr:0.0002825 epoch_Time:28775.0min: [2024-01-02 19:04:42,351][model8_pretrain.py][INFO] Epoch:[0/2](13500/4588595) loss:3.208 lr:0.0002825 epoch_Time:28775.0min: [2024-01-02 19:04:42,351][model8_pretrain.py][INFO] Epoch:[0/2](13500/4588595) loss:2.960 lr:0.0002825 epoch_Time:28775.0min: [2024-01-02 19:04:42,351][model8_pretrain.py][INFO] Epoch:[0/2](13500/4588595) loss:3.260 lr:0.0002825 epoch_Time:28775.0min: [2024-01-02 19:04:42,351][model8_pretrain.py][INFO] Epoch:[0/2](13500/4588595) loss:3.564 lr:0.0002825 epoch_Time:28775.0min: [2024-01-02 19:04:42,351][model8_pretrain.py][INFO] Epoch:[0/2](13500/4588595) loss:3.733 lr:0.0002825 epoch_Time:28775.0min: [2024-01-02 19:04:42,352][model8_pretrain.py][INFO] Epoch:[0/2](13500/4588595) loss:3.294 lr:0.0002825 epoch_Time:28775.0min: [2024-01-02 19:05:19,263][model8_pretrain.py][INFO] Epoch:[0/2](13600/4588595) loss:3.674 lr:0.0002822 epoch_Time:28769.0min: [2024-01-02 19:05:19,263][model8_pretrain.py][INFO] Epoch:[0/2](13600/4588595) loss:3.336 lr:0.0002822 epoch_Time:28769.0min: [2024-01-02 19:05:19,263][model8_pretrain.py][INFO] Epoch:[0/2](13600/4588595) loss:3.627 lr:0.0002822 epoch_Time:28769.0min: [2024-01-02 19:05:19,263][model8_pretrain.py][INFO] Epoch:[0/2](13600/4588595) loss:3.710 lr:0.0002822 epoch_Time:28769.0min: [2024-01-02 19:05:19,263][model8_pretrain.py][INFO] Epoch:[0/2](13600/4588595) loss:3.254 lr:0.0002822 epoch_Time:28769.0min: [2024-01-02 19:05:19,263][model8_pretrain.py][INFO] Epoch:[0/2](13600/4588595) loss:3.835 lr:0.0002822 epoch_Time:28769.0min: [2024-01-02 19:05:19,263][model8_pretrain.py][INFO] Epoch:[0/2](13600/4588595) loss:3.857 lr:0.0002822 epoch_Time:28769.0min: [2024-01-02 19:05:19,263][model8_pretrain.py][INFO] Epoch:[0/2](13600/4588595) loss:3.312 lr:0.0002822 epoch_Time:28769.0min: [2024-01-02 19:05:56,174][model8_pretrain.py][INFO] Epoch:[0/2](13700/4588595) loss:4.016 lr:0.0002819 epoch_Time:28764.0min: [2024-01-02 19:05:56,174][model8_pretrain.py][INFO] Epoch:[0/2](13700/4588595) loss:3.584 lr:0.0002819 epoch_Time:28764.0min: [2024-01-02 19:05:56,174][model8_pretrain.py][INFO] Epoch:[0/2](13700/4588595) loss:3.477 lr:0.0002819 epoch_Time:28764.0min: [2024-01-02 19:05:56,175][model8_pretrain.py][INFO] Epoch:[0/2](13700/4588595) loss:3.695 lr:0.0002819 epoch_Time:28764.0min: [2024-01-02 19:05:56,175][model8_pretrain.py][INFO] Epoch:[0/2](13700/4588595) loss:3.849 lr:0.0002819 epoch_Time:28764.0min: [2024-01-02 19:05:56,175][model8_pretrain.py][INFO] Epoch:[0/2](13700/4588595) loss:3.932 lr:0.0002819 epoch_Time:28764.0min: [2024-01-02 19:05:56,175][model8_pretrain.py][INFO] Epoch:[0/2](13700/4588595) loss:3.680 lr:0.0002819 epoch_Time:28763.0min: [2024-01-02 19:05:56,175][model8_pretrain.py][INFO] Epoch:[0/2](13700/4588595) loss:3.390 lr:0.0002819 epoch_Time:28764.0min: [2024-01-02 19:06:33,104][model8_pretrain.py][INFO] Epoch:[0/2](13800/4588595) loss:4.095 lr:0.0002816 epoch_Time:28759.0min: [2024-01-02 19:06:33,104][model8_pretrain.py][INFO] Epoch:[0/2](13800/4588595) loss:3.310 lr:0.0002816 epoch_Time:28759.0min: [2024-01-02 19:06:33,104][model8_pretrain.py][INFO] Epoch:[0/2](13800/4588595) loss:3.617 lr:0.0002816 epoch_Time:28759.0min: [2024-01-02 19:06:33,104][model8_pretrain.py][INFO] Epoch:[0/2](13800/4588595) loss:3.790 lr:0.0002816 epoch_Time:28759.0min: [2024-01-02 19:06:33,104][model8_pretrain.py][INFO] Epoch:[0/2](13800/4588595) loss:3.713 lr:0.0002816 epoch_Time:28759.0min: [2024-01-02 19:06:33,104][model8_pretrain.py][INFO] Epoch:[0/2](13800/4588595) loss:4.145 lr:0.0002816 epoch_Time:28759.0min: [2024-01-02 19:06:33,104][model8_pretrain.py][INFO] Epoch:[0/2](13800/4588595) loss:3.189 lr:0.0002816 epoch_Time:28759.0min: [2024-01-02 19:06:33,104][model8_pretrain.py][INFO] Epoch:[0/2](13800/4588595) loss:2.952 lr:0.0002816 epoch_Time:28759.0min: [2024-01-02 19:07:10,058][model8_pretrain.py][INFO] Epoch:[0/2](13900/4588595) loss:3.186 lr:0.0002813 epoch_Time:28754.0min: [2024-01-02 19:07:10,058][model8_pretrain.py][INFO] Epoch:[0/2](13900/4588595) loss:2.752 lr:0.0002813 epoch_Time:28754.0min: [2024-01-02 19:07:10,058][model8_pretrain.py][INFO] Epoch:[0/2](13900/4588595) loss:3.259 lr:0.0002813 epoch_Time:28754.0min: [2024-01-02 19:07:10,058][model8_pretrain.py][INFO] Epoch:[0/2](13900/4588595) loss:3.186 lr:0.0002813 epoch_Time:28754.0min: [2024-01-02 19:07:10,058][model8_pretrain.py][INFO] Epoch:[0/2](13900/4588595) loss:3.802 lr:0.0002813 epoch_Time:28754.0min: [2024-01-02 19:07:10,058][model8_pretrain.py][INFO] Epoch:[0/2](13900/4588595) loss:3.372 lr:0.0002813 epoch_Time:28754.0min: [2024-01-02 19:07:10,058][model8_pretrain.py][INFO] Epoch:[0/2](13900/4588595) loss:3.609 lr:0.0002813 epoch_Time:28754.0min: [2024-01-02 19:07:10,058][model8_pretrain.py][INFO] Epoch:[0/2](13900/4588595) loss:3.700 lr:0.0002813 epoch_Time:28754.0min: [2024-01-02 19:07:47,005][model8_pretrain.py][INFO] Epoch:[0/2](14000/4588595) loss:3.704 lr:0.0002811 epoch_Time:28750.0min: [2024-01-02 19:07:47,005][model8_pretrain.py][INFO] Epoch:[0/2](14000/4588595) loss:3.427 lr:0.0002811 epoch_Time:28750.0min: [2024-01-02 19:07:47,005][model8_pretrain.py][INFO] Epoch:[0/2](14000/4588595) loss:3.500 lr:0.0002811 epoch_Time:28750.0min: [2024-01-02 19:07:47,005][model8_pretrain.py][INFO] Epoch:[0/2](14000/4588595) loss:3.816 lr:0.0002811 epoch_Time:28750.0min: [2024-01-02 19:07:47,005][model8_pretrain.py][INFO] Epoch:[0/2](14000/4588595) loss:2.920 lr:0.0002811 epoch_Time:28750.0min: [2024-01-02 19:07:47,005][model8_pretrain.py][INFO] Epoch:[0/2](14000/4588595) loss:3.164 lr:0.0002811 epoch_Time:28750.0min: [2024-01-02 19:07:47,005][model8_pretrain.py][INFO] Epoch:[0/2](14000/4588595) loss:3.799 lr:0.0002811 epoch_Time:28750.0min: [2024-01-02 19:07:47,006][model8_pretrain.py][INFO] Epoch:[0/2](14000/4588595) loss:3.857 lr:0.0002811 epoch_Time:28750.0min: [2024-01-02 19:08:27,734][model8_pretrain.py][INFO] Epoch:[0/2](14100/4588595) loss:3.585 lr:0.0002808 epoch_Time:28765.0min: [2024-01-02 19:08:27,734][model8_pretrain.py][INFO] Epoch:[0/2](14100/4588595) loss:3.554 lr:0.0002808 epoch_Time:28765.0min: [2024-01-02 19:08:27,734][model8_pretrain.py][INFO] Epoch:[0/2](14100/4588595) loss:4.032 lr:0.0002808 epoch_Time:28765.0min: [2024-01-02 19:08:27,734][model8_pretrain.py][INFO] Epoch:[0/2](14100/4588595) loss:3.627 lr:0.0002808 epoch_Time:28765.0min: [2024-01-02 19:08:27,734][model8_pretrain.py][INFO] Epoch:[0/2](14100/4588595) loss:3.730 lr:0.0002808 epoch_Time:28765.0min: [2024-01-02 19:08:27,734][model8_pretrain.py][INFO] Epoch:[0/2](14100/4588595) loss:3.633 lr:0.0002808 epoch_Time:28765.0min: [2024-01-02 19:08:27,734][model8_pretrain.py][INFO] Epoch:[0/2](14100/4588595) loss:3.679 lr:0.0002808 epoch_Time:28765.0min: [2024-01-02 19:08:27,734][model8_pretrain.py][INFO] Epoch:[0/2](14100/4588595) loss:3.462 lr:0.0002808 epoch_Time:28765.0min: [2024-01-02 19:09:04,674][model8_pretrain.py][INFO] Epoch:[0/2](14200/4588595) loss:3.818 lr:0.0002805 epoch_Time:28760.0min: [2024-01-02 19:09:04,674][model8_pretrain.py][INFO] Epoch:[0/2](14200/4588595) loss:3.574 lr:0.0002805 epoch_Time:28760.0min: [2024-01-02 19:09:04,674][model8_pretrain.py][INFO] Epoch:[0/2](14200/4588595) loss:3.621 lr:0.0002805 epoch_Time:28760.0min: [2024-01-02 19:09:04,674][model8_pretrain.py][INFO] Epoch:[0/2](14200/4588595) loss:3.594 lr:0.0002805 epoch_Time:28760.0min: [2024-01-02 19:09:04,674][model8_pretrain.py][INFO] Epoch:[0/2](14200/4588595) loss:3.081 lr:0.0002805 epoch_Time:28760.0min: [2024-01-02 19:09:04,674][model8_pretrain.py][INFO] Epoch:[0/2](14200/4588595) loss:3.092 lr:0.0002805 epoch_Time:28760.0min: [2024-01-02 19:09:04,674][model8_pretrain.py][INFO] Epoch:[0/2](14200/4588595) loss:3.592 lr:0.0002805 epoch_Time:28760.0min: [2024-01-02 19:09:04,675][model8_pretrain.py][INFO] Epoch:[0/2](14200/4588595) loss:3.839 lr:0.0002805 epoch_Time:28760.0min: [2024-01-02 19:09:41,609][model8_pretrain.py][INFO] Epoch:[0/2](14300/4588595) loss:3.651 lr:0.0002802 epoch_Time:28756.0min: [2024-01-02 19:09:41,609][model8_pretrain.py][INFO] Epoch:[0/2](14300/4588595) loss:3.553 lr:0.0002802 epoch_Time:28756.0min: [2024-01-02 19:09:41,609][model8_pretrain.py][INFO] Epoch:[0/2](14300/4588595) loss:3.599 lr:0.0002802 epoch_Time:28756.0min: [2024-01-02 19:09:41,609][model8_pretrain.py][INFO] Epoch:[0/2](14300/4588595) loss:3.612 lr:0.0002802 epoch_Time:28756.0min: [2024-01-02 19:09:41,609][model8_pretrain.py][INFO] Epoch:[0/2](14300/4588595) loss:3.593 lr:0.0002802 epoch_Time:28756.0min: [2024-01-02 19:09:41,609][model8_pretrain.py][INFO] Epoch:[0/2](14300/4588595) loss:3.778 lr:0.0002802 epoch_Time:28756.0min: [2024-01-02 19:09:41,610][model8_pretrain.py][INFO] Epoch:[0/2](14300/4588595) loss:3.764 lr:0.0002802 epoch_Time:28756.0min: [2024-01-02 19:09:41,610][model8_pretrain.py][INFO] Epoch:[0/2](14300/4588595) loss:4.044 lr:0.0002802 epoch_Time:28756.0min: [2024-01-02 19:10:18,542][model8_pretrain.py][INFO] Epoch:[0/2](14400/4588595) loss:3.498 lr:0.0002799 epoch_Time:28751.0min: [2024-01-02 19:10:18,542][model8_pretrain.py][INFO] Epoch:[0/2](14400/4588595) loss:3.502 lr:0.0002799 epoch_Time:28751.0min: [2024-01-02 19:10:18,542][model8_pretrain.py][INFO] Epoch:[0/2](14400/4588595) loss:4.040 lr:0.0002799 epoch_Time:28751.0min: [2024-01-02 19:10:18,542][model8_pretrain.py][INFO] Epoch:[0/2](14400/4588595) loss:3.404 lr:0.0002799 epoch_Time:28751.0min: [2024-01-02 19:10:18,542][model8_pretrain.py][INFO] Epoch:[0/2](14400/4588595) loss:3.224 lr:0.0002799 epoch_Time:28751.0min: [2024-01-02 19:10:18,542][model8_pretrain.py][INFO] Epoch:[0/2](14400/4588595) loss:3.362 lr:0.0002799 epoch_Time:28750.0min: [2024-01-02 19:10:18,542][model8_pretrain.py][INFO] Epoch:[0/2](14400/4588595) loss:3.648 lr:0.0002799 epoch_Time:28751.0min: [2024-01-02 19:10:18,542][model8_pretrain.py][INFO] Epoch:[0/2](14400/4588595) loss:3.469 lr:0.0002799 epoch_Time:28750.0min: [2024-01-02 19:10:55,473][model8_pretrain.py][INFO] Epoch:[0/2](14500/4588595) loss:3.487 lr:0.0002796 epoch_Time:28745.0min: [2024-01-02 19:10:55,473][model8_pretrain.py][INFO] Epoch:[0/2](14500/4588595) loss:3.603 lr:0.0002796 epoch_Time:28746.0min: [2024-01-02 19:10:55,473][model8_pretrain.py][INFO] Epoch:[0/2](14500/4588595) loss:3.971 lr:0.0002796 epoch_Time:28745.0min: [2024-01-02 19:10:55,473][model8_pretrain.py][INFO] Epoch:[0/2](14500/4588595) loss:3.576 lr:0.0002796 epoch_Time:28745.0min: [2024-01-02 19:10:55,473][model8_pretrain.py][INFO] Epoch:[0/2](14500/4588595) loss:2.651 lr:0.0002796 epoch_Time:28745.0min: [2024-01-02 19:10:55,473][model8_pretrain.py][INFO] Epoch:[0/2](14500/4588595) loss:3.850 lr:0.0002796 epoch_Time:28746.0min: [2024-01-02 19:10:55,473][model8_pretrain.py][INFO] Epoch:[0/2](14500/4588595) loss:3.519 lr:0.0002796 epoch_Time:28745.0min: [2024-01-02 19:10:55,474][model8_pretrain.py][INFO] Epoch:[0/2](14500/4588595) loss:3.310 lr:0.0002796 epoch_Time:28745.0min: [2024-01-02 19:11:32,440][model8_pretrain.py][INFO] Epoch:[0/2](14600/4588595) loss:3.642 lr:0.0002793 epoch_Time:28742.0min: [2024-01-02 19:11:32,440][model8_pretrain.py][INFO] Epoch:[0/2](14600/4588595) loss:3.228 lr:0.0002793 epoch_Time:28742.0min: [2024-01-02 19:11:32,440][model8_pretrain.py][INFO] Epoch:[0/2](14600/4588595) loss:3.291 lr:0.0002793 epoch_Time:28741.0min: [2024-01-02 19:11:32,440][model8_pretrain.py][INFO] Epoch:[0/2](14600/4588595) loss:3.706 lr:0.0002793 epoch_Time:28742.0min: [2024-01-02 19:11:32,440][model8_pretrain.py][INFO] Epoch:[0/2](14600/4588595) loss:3.613 lr:0.0002793 epoch_Time:28741.0min: [2024-01-02 19:11:32,440][model8_pretrain.py][INFO] Epoch:[0/2](14600/4588595) loss:3.466 lr:0.0002793 epoch_Time:28742.0min: [2024-01-02 19:11:32,440][model8_pretrain.py][INFO] Epoch:[0/2](14600/4588595) loss:3.431 lr:0.0002793 epoch_Time:28742.0min: [2024-01-02 19:11:32,440][model8_pretrain.py][INFO] Epoch:[0/2](14600/4588595) loss:3.622 lr:0.0002793 epoch_Time:28742.0min: [2024-01-02 19:12:09,402][model8_pretrain.py][INFO] Epoch:[0/2](14700/4588595) loss:3.708 lr:0.0002790 epoch_Time:28737.0min: [2024-01-02 19:12:09,402][model8_pretrain.py][INFO] Epoch:[0/2](14700/4588595) loss:4.075 lr:0.0002790 epoch_Time:28737.0min: [2024-01-02 19:12:09,402][model8_pretrain.py][INFO] Epoch:[0/2](14700/4588595) loss:3.098 lr:0.0002790 epoch_Time:28737.0min: [2024-01-02 19:12:09,402][model8_pretrain.py][INFO] Epoch:[0/2](14700/4588595) loss:3.385 lr:0.0002790 epoch_Time:28737.0min: [2024-01-02 19:12:09,402][model8_pretrain.py][INFO] Epoch:[0/2](14700/4588595) loss:3.235 lr:0.0002790 epoch_Time:28737.0min: [2024-01-02 19:12:09,402][model8_pretrain.py][INFO] Epoch:[0/2](14700/4588595) loss:3.282 lr:0.0002790 epoch_Time:28737.0min: [2024-01-02 19:12:09,402][model8_pretrain.py][INFO] Epoch:[0/2](14700/4588595) loss:3.463 lr:0.0002790 epoch_Time:28737.0min: [2024-01-02 19:12:09,403][model8_pretrain.py][INFO] Epoch:[0/2](14700/4588595) loss:3.460 lr:0.0002790 epoch_Time:28737.0min: [2024-01-02 19:12:46,340][model8_pretrain.py][INFO] Epoch:[0/2](14800/4588595) loss:3.290 lr:0.0002787 epoch_Time:28733.0min: [2024-01-02 19:12:46,340][model8_pretrain.py][INFO] Epoch:[0/2](14800/4588595) loss:3.358 lr:0.0002787 epoch_Time:28733.0min: [2024-01-02 19:12:46,340][model8_pretrain.py][INFO] Epoch:[0/2](14800/4588595) loss:3.923 lr:0.0002787 epoch_Time:28733.0min: [2024-01-02 19:12:46,340][model8_pretrain.py][INFO] Epoch:[0/2](14800/4588595) loss:3.547 lr:0.0002787 epoch_Time:28733.0min: [2024-01-02 19:12:46,340][model8_pretrain.py][INFO] Epoch:[0/2](14800/4588595) loss:3.623 lr:0.0002787 epoch_Time:28733.0min: [2024-01-02 19:12:46,340][model8_pretrain.py][INFO] Epoch:[0/2](14800/4588595) loss:3.804 lr:0.0002787 epoch_Time:28733.0min: [2024-01-02 19:12:46,341][model8_pretrain.py][INFO] Epoch:[0/2](14800/4588595) loss:3.338 lr:0.0002787 epoch_Time:28733.0min: [2024-01-02 19:12:46,342][model8_pretrain.py][INFO] Epoch:[0/2](14800/4588595) loss:3.473 lr:0.0002787 epoch_Time:28733.0min: [2024-01-02 19:13:26,905][model8_pretrain.py][INFO] Epoch:[0/2](14900/4588595) loss:3.789 lr:0.0002784 epoch_Time:28747.0min: [2024-01-02 19:13:26,905][model8_pretrain.py][INFO] Epoch:[0/2](14900/4588595) loss:3.613 lr:0.0002784 epoch_Time:28746.0min: [2024-01-02 19:13:26,905][model8_pretrain.py][INFO] Epoch:[0/2](14900/4588595) loss:3.647 lr:0.0002784 epoch_Time:28747.0min: [2024-01-02 19:13:26,905][model8_pretrain.py][INFO] Epoch:[0/2](14900/4588595) loss:3.751 lr:0.0002784 epoch_Time:28747.0min: [2024-01-02 19:13:26,905][model8_pretrain.py][INFO] Epoch:[0/2](14900/4588595) loss:3.411 lr:0.0002784 epoch_Time:28747.0min: [2024-01-02 19:13:26,905][model8_pretrain.py][INFO] Epoch:[0/2](14900/4588595) loss:3.449 lr:0.0002784 epoch_Time:28746.0min: [2024-01-02 19:13:26,905][model8_pretrain.py][INFO] Epoch:[0/2](14900/4588595) loss:3.924 lr:0.0002784 epoch_Time:28747.0min: [2024-01-02 19:13:26,905][model8_pretrain.py][INFO] Epoch:[0/2](14900/4588595) loss:3.575 lr:0.0002784 epoch_Time:28747.0min: [2024-01-02 19:14:03,843][model8_pretrain.py][INFO] Epoch:[0/2](15000/4588595) loss:3.532 lr:0.0002781 epoch_Time:28742.0min: [2024-01-02 19:14:03,843][model8_pretrain.py][INFO] Epoch:[0/2](15000/4588595) loss:3.759 lr:0.0002781 epoch_Time:28742.0min: [2024-01-02 19:14:03,843][model8_pretrain.py][INFO] Epoch:[0/2](15000/4588595) loss:3.763 lr:0.0002781 epoch_Time:28742.0min: [2024-01-02 19:14:03,843][model8_pretrain.py][INFO] Epoch:[0/2](15000/4588595) loss:3.755 lr:0.0002781 epoch_Time:28742.0min: [2024-01-02 19:14:03,843][model8_pretrain.py][INFO] Epoch:[0/2](15000/4588595) loss:3.650 lr:0.0002781 epoch_Time:28742.0min: [2024-01-02 19:14:03,843][model8_pretrain.py][INFO] Epoch:[0/2](15000/4588595) loss:3.858 lr:0.0002781 epoch_Time:28742.0min: [2024-01-02 19:14:03,843][model8_pretrain.py][INFO] Epoch:[0/2](15000/4588595) loss:3.172 lr:0.0002781 epoch_Time:28742.0min: [2024-01-02 19:14:03,844][model8_pretrain.py][INFO] Epoch:[0/2](15000/4588595) loss:3.438 lr:0.0002781 epoch_Time:28742.0min: [2024-01-02 19:14:40,779][model8_pretrain.py][INFO] Epoch:[0/2](15100/4588595) loss:3.734 lr:0.0002778 epoch_Time:28738.0min: [2024-01-02 19:14:40,779][model8_pretrain.py][INFO] Epoch:[0/2](15100/4588595) loss:3.236 lr:0.0002778 epoch_Time:28738.0min: [2024-01-02 19:14:40,779][model8_pretrain.py][INFO] Epoch:[0/2](15100/4588595) loss:3.523 lr:0.0002778 epoch_Time:28738.0min: [2024-01-02 19:14:40,779][model8_pretrain.py][INFO] Epoch:[0/2](15100/4588595) loss:3.916 lr:0.0002778 epoch_Time:28738.0min: [2024-01-02 19:14:40,779][model8_pretrain.py][INFO] Epoch:[0/2](15100/4588595) loss:3.189 lr:0.0002778 epoch_Time:28738.0min: [2024-01-02 19:14:40,779][model8_pretrain.py][INFO] Epoch:[0/2](15100/4588595) loss:3.664 lr:0.0002778 epoch_Time:28738.0min: [2024-01-02 19:14:40,779][model8_pretrain.py][INFO] Epoch:[0/2](15100/4588595) loss:3.477 lr:0.0002778 epoch_Time:28738.0min: [2024-01-02 19:14:40,779][model8_pretrain.py][INFO] Epoch:[0/2](15100/4588595) loss:3.032 lr:0.0002778 epoch_Time:28738.0min: [2024-01-02 19:15:17,722][model8_pretrain.py][INFO] Epoch:[0/2](15200/4588595) loss:3.265 lr:0.0002775 epoch_Time:28733.0min: [2024-01-02 19:15:17,722][model8_pretrain.py][INFO] Epoch:[0/2](15200/4588595) loss:3.657 lr:0.0002775 epoch_Time:28733.0min: [2024-01-02 19:15:17,722][model8_pretrain.py][INFO] Epoch:[0/2](15200/4588595) loss:3.613 lr:0.0002775 epoch_Time:28733.0min: [2024-01-02 19:15:17,722][model8_pretrain.py][INFO] Epoch:[0/2](15200/4588595) loss:3.458 lr:0.0002775 epoch_Time:28733.0min: [2024-01-02 19:15:17,722][model8_pretrain.py][INFO] Epoch:[0/2](15200/4588595) loss:3.366 lr:0.0002775 epoch_Time:28733.0min: [2024-01-02 19:15:17,722][model8_pretrain.py][INFO] Epoch:[0/2](15200/4588595) loss:3.431 lr:0.0002775 epoch_Time:28733.0min: [2024-01-02 19:15:17,723][model8_pretrain.py][INFO] Epoch:[0/2](15200/4588595) loss:3.651 lr:0.0002775 epoch_Time:28733.0min: [2024-01-02 19:15:17,723][model8_pretrain.py][INFO] Epoch:[0/2](15200/4588595) loss:3.185 lr:0.0002775 epoch_Time:28733.0min: [2024-01-02 19:15:54,675][model8_pretrain.py][INFO] Epoch:[0/2](15300/4588595) loss:3.631 lr:0.0002772 epoch_Time:28728.0min: [2024-01-02 19:15:54,675][model8_pretrain.py][INFO] Epoch:[0/2](15300/4588595) loss:3.500 lr:0.0002772 epoch_Time:28728.0min: [2024-01-02 19:15:54,675][model8_pretrain.py][INFO] Epoch:[0/2](15300/4588595) loss:3.316 lr:0.0002772 epoch_Time:28728.0min: [2024-01-02 19:15:54,675][model8_pretrain.py][INFO] Epoch:[0/2](15300/4588595) loss:3.417 lr:0.0002772 epoch_Time:28728.0min: [2024-01-02 19:15:54,675][model8_pretrain.py][INFO] Epoch:[0/2](15300/4588595) loss:3.654 lr:0.0002772 epoch_Time:28728.0min: [2024-01-02 19:15:54,675][model8_pretrain.py][INFO] Epoch:[0/2](15300/4588595) loss:3.957 lr:0.0002772 epoch_Time:28728.0min: [2024-01-02 19:15:54,676][model8_pretrain.py][INFO] Epoch:[0/2](15300/4588595) loss:3.364 lr:0.0002772 epoch_Time:28728.0min: [2024-01-02 19:15:54,676][model8_pretrain.py][INFO] Epoch:[0/2](15300/4588595) loss:3.428 lr:0.0002772 epoch_Time:28728.0min: [2024-01-02 19:16:31,604][model8_pretrain.py][INFO] Epoch:[0/2](15400/4588595) loss:3.294 lr:0.0002769 epoch_Time:28724.0min: [2024-01-02 19:16:31,604][model8_pretrain.py][INFO] Epoch:[0/2](15400/4588595) loss:3.586 lr:0.0002769 epoch_Time:28724.0min: [2024-01-02 19:16:31,605][model8_pretrain.py][INFO] Epoch:[0/2](15400/4588595) loss:3.636 lr:0.0002769 epoch_Time:28724.0min: [2024-01-02 19:16:31,605][model8_pretrain.py][INFO] Epoch:[0/2](15400/4588595) loss:3.744 lr:0.0002769 epoch_Time:28724.0min: [2024-01-02 19:16:31,605][model8_pretrain.py][INFO] Epoch:[0/2](15400/4588595) loss:3.624 lr:0.0002769 epoch_Time:28725.0min: [2024-01-02 19:16:31,605][model8_pretrain.py][INFO] Epoch:[0/2](15400/4588595) loss:3.266 lr:0.0002769 epoch_Time:28724.0min: [2024-01-02 19:16:31,605][model8_pretrain.py][INFO] Epoch:[0/2](15400/4588595) loss:3.536 lr:0.0002769 epoch_Time:28724.0min: [2024-01-02 19:16:31,605][model8_pretrain.py][INFO] Epoch:[0/2](15400/4588595) loss:3.960 lr:0.0002769 epoch_Time:28724.0min: [2024-01-02 19:17:08,537][model8_pretrain.py][INFO] Epoch:[0/2](15500/4588595) loss:3.256 lr:0.0002766 epoch_Time:28720.0min: [2024-01-02 19:17:08,537][model8_pretrain.py][INFO] Epoch:[0/2](15500/4588595) loss:3.618 lr:0.0002766 epoch_Time:28720.0min: [2024-01-02 19:17:08,537][model8_pretrain.py][INFO] Epoch:[0/2](15500/4588595) loss:2.876 lr:0.0002766 epoch_Time:28720.0min: [2024-01-02 19:17:08,537][model8_pretrain.py][INFO] Epoch:[0/2](15500/4588595) loss:3.724 lr:0.0002766 epoch_Time:28720.0min: [2024-01-02 19:17:08,537][model8_pretrain.py][INFO] Epoch:[0/2](15500/4588595) loss:3.532 lr:0.0002766 epoch_Time:28720.0min: [2024-01-02 19:17:08,537][model8_pretrain.py][INFO] Epoch:[0/2](15500/4588595) loss:3.041 lr:0.0002766 epoch_Time:28720.0min: [2024-01-02 19:17:08,537][model8_pretrain.py][INFO] Epoch:[0/2](15500/4588595) loss:3.398 lr:0.0002766 epoch_Time:28720.0min: [2024-01-02 19:17:08,537][model8_pretrain.py][INFO] Epoch:[0/2](15500/4588595) loss:3.649 lr:0.0002766 epoch_Time:28720.0min: [2024-01-02 19:17:45,486][model8_pretrain.py][INFO] Epoch:[0/2](15600/4588595) loss:3.575 lr:0.0002762 epoch_Time:28716.0min: [2024-01-02 19:17:45,486][model8_pretrain.py][INFO] Epoch:[0/2](15600/4588595) loss:3.250 lr:0.0002762 epoch_Time:28716.0min: [2024-01-02 19:17:45,486][model8_pretrain.py][INFO] Epoch:[0/2](15600/4588595) loss:2.999 lr:0.0002762 epoch_Time:28716.0min: [2024-01-02 19:17:45,486][model8_pretrain.py][INFO] Epoch:[0/2](15600/4588595) loss:3.364 lr:0.0002762 epoch_Time:28716.0min: [2024-01-02 19:17:45,486][model8_pretrain.py][INFO] Epoch:[0/2](15600/4588595) loss:3.436 lr:0.0002762 epoch_Time:28716.0min: [2024-01-02 19:17:45,486][model8_pretrain.py][INFO] Epoch:[0/2](15600/4588595) loss:3.869 lr:0.0002762 epoch_Time:28716.0min: [2024-01-02 19:17:45,486][model8_pretrain.py][INFO] Epoch:[0/2](15600/4588595) loss:3.321 lr:0.0002762 epoch_Time:28716.0min: [2024-01-02 19:17:45,486][model8_pretrain.py][INFO] Epoch:[0/2](15600/4588595) loss:3.397 lr:0.0002762 epoch_Time:28716.0min: [2024-01-02 19:18:26,033][model8_pretrain.py][INFO] Epoch:[0/2](15700/4588595) loss:3.591 lr:0.0002759 epoch_Time:28729.0min: [2024-01-02 19:18:26,033][model8_pretrain.py][INFO] Epoch:[0/2](15700/4588595) loss:3.511 lr:0.0002759 epoch_Time:28729.0min: [2024-01-02 19:18:26,033][model8_pretrain.py][INFO] Epoch:[0/2](15700/4588595) loss:3.404 lr:0.0002759 epoch_Time:28729.0min: [2024-01-02 19:18:26,033][model8_pretrain.py][INFO] Epoch:[0/2](15700/4588595) loss:3.355 lr:0.0002759 epoch_Time:28729.0min: [2024-01-02 19:18:26,033][model8_pretrain.py][INFO] Epoch:[0/2](15700/4588595) loss:3.764 lr:0.0002759 epoch_Time:28729.0min: [2024-01-02 19:18:26,034][model8_pretrain.py][INFO] Epoch:[0/2](15700/4588595) loss:3.426 lr:0.0002759 epoch_Time:28729.0min: [2024-01-02 19:18:26,034][model8_pretrain.py][INFO] Epoch:[0/2](15700/4588595) loss:3.699 lr:0.0002759 epoch_Time:28729.0min: [2024-01-02 19:18:26,034][model8_pretrain.py][INFO] Epoch:[0/2](15700/4588595) loss:2.992 lr:0.0002759 epoch_Time:28729.0min: [2024-01-02 19:19:02,959][model8_pretrain.py][INFO] Epoch:[0/2](15800/4588595) loss:3.796 lr:0.0002756 epoch_Time:28724.0min: [2024-01-02 19:19:02,959][model8_pretrain.py][INFO] Epoch:[0/2](15800/4588595) loss:3.603 lr:0.0002756 epoch_Time:28724.0min: [2024-01-02 19:19:02,959][model8_pretrain.py][INFO] Epoch:[0/2](15800/4588595) loss:3.332 lr:0.0002756 epoch_Time:28725.0min: [2024-01-02 19:19:02,959][model8_pretrain.py][INFO] Epoch:[0/2](15800/4588595) loss:3.648 lr:0.0002756 epoch_Time:28724.0min: [2024-01-02 19:19:02,959][model8_pretrain.py][INFO] Epoch:[0/2](15800/4588595) loss:3.693 lr:0.0002756 epoch_Time:28724.0min: [2024-01-02 19:19:02,959][model8_pretrain.py][INFO] Epoch:[0/2](15800/4588595) loss:3.647 lr:0.0002756 epoch_Time:28724.0min: [2024-01-02 19:19:02,959][model8_pretrain.py][INFO] Epoch:[0/2](15800/4588595) loss:3.536 lr:0.0002756 epoch_Time:28724.0min: [2024-01-02 19:19:02,959][model8_pretrain.py][INFO] Epoch:[0/2](15800/4588595) loss:3.937 lr:0.0002756 epoch_Time:28724.0min: [2024-01-02 19:19:39,904][model8_pretrain.py][INFO] Epoch:[0/2](15900/4588595) loss:3.712 lr:0.0002753 epoch_Time:28721.0min: [2024-01-02 19:19:39,904][model8_pretrain.py][INFO] Epoch:[0/2](15900/4588595) loss:3.428 lr:0.0002753 epoch_Time:28721.0min: [2024-01-02 19:19:39,904][model8_pretrain.py][INFO] Epoch:[0/2](15900/4588595) loss:3.590 lr:0.0002753 epoch_Time:28721.0min: [2024-01-02 19:19:39,904][model8_pretrain.py][INFO] Epoch:[0/2](15900/4588595) loss:2.993 lr:0.0002753 epoch_Time:28721.0min: [2024-01-02 19:19:39,904][model8_pretrain.py][INFO] Epoch:[0/2](15900/4588595) loss:3.570 lr:0.0002753 epoch_Time:28721.0min: [2024-01-02 19:19:39,904][model8_pretrain.py][INFO] Epoch:[0/2](15900/4588595) loss:3.451 lr:0.0002753 epoch_Time:28721.0min: [2024-01-02 19:19:39,905][model8_pretrain.py][INFO] Epoch:[0/2](15900/4588595) loss:3.548 lr:0.0002753 epoch_Time:28721.0min: [2024-01-02 19:19:39,905][model8_pretrain.py][INFO] Epoch:[0/2](15900/4588595) loss:3.516 lr:0.0002753 epoch_Time:28721.0min: [2024-01-02 19:20:16,864][model8_pretrain.py][INFO] Epoch:[0/2](16000/4588595) loss:3.474 lr:0.0002750 epoch_Time:28716.0min: [2024-01-02 19:20:16,864][model8_pretrain.py][INFO] Epoch:[0/2](16000/4588595) loss:3.736 lr:0.0002750 epoch_Time:28716.0min: [2024-01-02 19:20:16,864][model8_pretrain.py][INFO] Epoch:[0/2](16000/4588595) loss:3.773 lr:0.0002750 epoch_Time:28716.0min: [2024-01-02 19:20:16,864][model8_pretrain.py][INFO] Epoch:[0/2](16000/4588595) loss:3.526 lr:0.0002750 epoch_Time:28716.0min: [2024-01-02 19:20:16,864][model8_pretrain.py][INFO] Epoch:[0/2](16000/4588595) loss:3.615 lr:0.0002750 epoch_Time:28716.0min: [2024-01-02 19:20:16,864][model8_pretrain.py][INFO] Epoch:[0/2](16000/4588595) loss:4.060 lr:0.0002750 epoch_Time:28716.0min: [2024-01-02 19:20:16,864][model8_pretrain.py][INFO] Epoch:[0/2](16000/4588595) loss:3.835 lr:0.0002750 epoch_Time:28716.0min: [2024-01-02 19:20:16,865][model8_pretrain.py][INFO] Epoch:[0/2](16000/4588595) loss:3.358 lr:0.0002750 epoch_Time:28716.0min: [2024-01-02 19:20:53,800][model8_pretrain.py][INFO] Epoch:[0/2](16100/4588595) loss:3.696 lr:0.0002746 epoch_Time:28712.0min: [2024-01-02 19:20:53,800][model8_pretrain.py][INFO] Epoch:[0/2](16100/4588595) loss:3.549 lr:0.0002746 epoch_Time:28712.0min: [2024-01-02 19:20:53,800][model8_pretrain.py][INFO] Epoch:[0/2](16100/4588595) loss:3.649 lr:0.0002746 epoch_Time:28712.0min: [2024-01-02 19:20:53,800][model8_pretrain.py][INFO] Epoch:[0/2](16100/4588595) loss:3.099 lr:0.0002746 epoch_Time:28712.0min: [2024-01-02 19:20:53,800][model8_pretrain.py][INFO] Epoch:[0/2](16100/4588595) loss:3.547 lr:0.0002746 epoch_Time:28712.0min: [2024-01-02 19:20:53,800][model8_pretrain.py][INFO] Epoch:[0/2](16100/4588595) loss:3.130 lr:0.0002746 epoch_Time:28712.0min: [2024-01-02 19:20:53,800][model8_pretrain.py][INFO] Epoch:[0/2](16100/4588595) loss:3.479 lr:0.0002746 epoch_Time:28712.0min: [2024-01-02 19:20:53,800][model8_pretrain.py][INFO] Epoch:[0/2](16100/4588595) loss:3.550 lr:0.0002746 epoch_Time:28712.0min: [2024-01-02 19:21:30,741][model8_pretrain.py][INFO] Epoch:[0/2](16200/4588595) loss:3.932 lr:0.0002743 epoch_Time:28708.0min: [2024-01-02 19:21:30,741][model8_pretrain.py][INFO] Epoch:[0/2](16200/4588595) loss:3.809 lr:0.0002743 epoch_Time:28708.0min: [2024-01-02 19:21:30,741][model8_pretrain.py][INFO] Epoch:[0/2](16200/4588595) loss:3.835 lr:0.0002743 epoch_Time:28708.0min: [2024-01-02 19:21:30,741][model8_pretrain.py][INFO] Epoch:[0/2](16200/4588595) loss:3.717 lr:0.0002743 epoch_Time:28708.0min: [2024-01-02 19:21:30,741][model8_pretrain.py][INFO] Epoch:[0/2](16200/4588595) loss:3.844 lr:0.0002743 epoch_Time:28708.0min: [2024-01-02 19:21:30,741][model8_pretrain.py][INFO] Epoch:[0/2](16200/4588595) loss:3.829 lr:0.0002743 epoch_Time:28708.0min: [2024-01-02 19:21:30,741][model8_pretrain.py][INFO] Epoch:[0/2](16200/4588595) loss:3.282 lr:0.0002743 epoch_Time:28708.0min: [2024-01-02 19:21:30,741][model8_pretrain.py][INFO] Epoch:[0/2](16200/4588595) loss:3.721 lr:0.0002743 epoch_Time:28708.0min: [2024-01-02 19:22:07,699][model8_pretrain.py][INFO] Epoch:[0/2](16300/4588595) loss:3.124 lr:0.0002740 epoch_Time:28704.0min: [2024-01-02 19:22:07,699][model8_pretrain.py][INFO] Epoch:[0/2](16300/4588595) loss:3.681 lr:0.0002740 epoch_Time:28704.0min: [2024-01-02 19:22:07,699][model8_pretrain.py][INFO] Epoch:[0/2](16300/4588595) loss:3.681 lr:0.0002740 epoch_Time:28704.0min: [2024-01-02 19:22:07,699][model8_pretrain.py][INFO] Epoch:[0/2](16300/4588595) loss:3.543 lr:0.0002740 epoch_Time:28704.0min: [2024-01-02 19:22:07,699][model8_pretrain.py][INFO] Epoch:[0/2](16300/4588595) loss:2.737 lr:0.0002740 epoch_Time:28704.0min: [2024-01-02 19:22:07,699][model8_pretrain.py][INFO] Epoch:[0/2](16300/4588595) loss:3.305 lr:0.0002740 epoch_Time:28704.0min: [2024-01-02 19:22:07,699][model8_pretrain.py][INFO] Epoch:[0/2](16300/4588595) loss:3.984 lr:0.0002740 epoch_Time:28704.0min: [2024-01-02 19:22:07,699][model8_pretrain.py][INFO] Epoch:[0/2](16300/4588595) loss:3.612 lr:0.0002740 epoch_Time:28704.0min: [2024-01-02 19:22:44,643][model8_pretrain.py][INFO] Epoch:[0/2](16400/4588595) loss:3.537 lr:0.0002736 epoch_Time:28701.0min: [2024-01-02 19:22:44,643][model8_pretrain.py][INFO] Epoch:[0/2](16400/4588595) loss:3.244 lr:0.0002736 epoch_Time:28700.0min: [2024-01-02 19:22:44,643][model8_pretrain.py][INFO] Epoch:[0/2](16400/4588595) loss:3.279 lr:0.0002736 epoch_Time:28701.0min: [2024-01-02 19:22:44,643][model8_pretrain.py][INFO] Epoch:[0/2](16400/4588595) loss:3.381 lr:0.0002736 epoch_Time:28701.0min: [2024-01-02 19:22:44,643][model8_pretrain.py][INFO] Epoch:[0/2](16400/4588595) loss:3.344 lr:0.0002736 epoch_Time:28701.0min: [2024-01-02 19:22:44,643][model8_pretrain.py][INFO] Epoch:[0/2](16400/4588595) loss:3.334 lr:0.0002736 epoch_Time:28701.0min: [2024-01-02 19:22:44,643][model8_pretrain.py][INFO] Epoch:[0/2](16400/4588595) loss:3.555 lr:0.0002736 epoch_Time:28701.0min: [2024-01-02 19:22:44,643][model8_pretrain.py][INFO] Epoch:[0/2](16400/4588595) loss:3.086 lr:0.0002736 epoch_Time:28701.0min: [2024-01-02 19:23:25,369][model8_pretrain.py][INFO] Epoch:[0/2](16500/4588595) loss:3.898 lr:0.0002733 epoch_Time:28714.0min: [2024-01-02 19:23:25,369][model8_pretrain.py][INFO] Epoch:[0/2](16500/4588595) loss:3.388 lr:0.0002733 epoch_Time:28714.0min: [2024-01-02 19:23:25,369][model8_pretrain.py][INFO] Epoch:[0/2](16500/4588595) loss:3.641 lr:0.0002733 epoch_Time:28714.0min: [2024-01-02 19:23:25,369][model8_pretrain.py][INFO] Epoch:[0/2](16500/4588595) loss:3.541 lr:0.0002733 epoch_Time:28714.0min: [2024-01-02 19:23:25,369][model8_pretrain.py][INFO] Epoch:[0/2](16500/4588595) loss:3.357 lr:0.0002733 epoch_Time:28714.0min: [2024-01-02 19:23:25,369][model8_pretrain.py][INFO] Epoch:[0/2](16500/4588595) loss:3.835 lr:0.0002733 epoch_Time:28714.0min: [2024-01-02 19:23:25,369][model8_pretrain.py][INFO] Epoch:[0/2](16500/4588595) loss:3.631 lr:0.0002733 epoch_Time:28714.0min: [2024-01-02 19:23:25,369][model8_pretrain.py][INFO] Epoch:[0/2](16500/4588595) loss:3.600 lr:0.0002733 epoch_Time:28714.0min: [2024-01-02 19:24:02,310][model8_pretrain.py][INFO] Epoch:[0/2](16600/4588595) loss:3.687 lr:0.0002730 epoch_Time:28709.0min: [2024-01-02 19:24:02,310][model8_pretrain.py][INFO] Epoch:[0/2](16600/4588595) loss:3.334 lr:0.0002730 epoch_Time:28709.0min: [2024-01-02 19:24:02,310][model8_pretrain.py][INFO] Epoch:[0/2](16600/4588595) loss:3.627 lr:0.0002730 epoch_Time:28709.0min: [2024-01-02 19:24:02,310][model8_pretrain.py][INFO] Epoch:[0/2](16600/4588595) loss:3.359 lr:0.0002730 epoch_Time:28709.0min: [2024-01-02 19:24:02,310][model8_pretrain.py][INFO] Epoch:[0/2](16600/4588595) loss:3.385 lr:0.0002730 epoch_Time:28709.0min: [2024-01-02 19:24:02,310][model8_pretrain.py][INFO] Epoch:[0/2](16600/4588595) loss:3.400 lr:0.0002730 epoch_Time:28709.0min: [2024-01-02 19:24:02,310][model8_pretrain.py][INFO] Epoch:[0/2](16600/4588595) loss:3.630 lr:0.0002730 epoch_Time:28710.0min: [2024-01-02 19:24:02,311][model8_pretrain.py][INFO] Epoch:[0/2](16600/4588595) loss:3.323 lr:0.0002730 epoch_Time:28709.0min: [2024-01-02 19:24:39,264][model8_pretrain.py][INFO] Epoch:[0/2](16700/4588595) loss:3.522 lr:0.0002726 epoch_Time:28706.0min: [2024-01-02 19:24:39,264][model8_pretrain.py][INFO] Epoch:[0/2](16700/4588595) loss:3.888 lr:0.0002726 epoch_Time:28706.0min: [2024-01-02 19:24:39,264][model8_pretrain.py][INFO] Epoch:[0/2](16700/4588595) loss:3.418 lr:0.0002726 epoch_Time:28706.0min: [2024-01-02 19:24:39,264][model8_pretrain.py][INFO] Epoch:[0/2](16700/4588595) loss:3.109 lr:0.0002726 epoch_Time:28706.0min: [2024-01-02 19:24:39,264][model8_pretrain.py][INFO] Epoch:[0/2](16700/4588595) loss:4.009 lr:0.0002726 epoch_Time:28706.0min: [2024-01-02 19:24:39,264][model8_pretrain.py][INFO] Epoch:[0/2](16700/4588595) loss:3.698 lr:0.0002726 epoch_Time:28706.0min: [2024-01-02 19:24:39,264][model8_pretrain.py][INFO] Epoch:[0/2](16700/4588595) loss:3.404 lr:0.0002726 epoch_Time:28706.0min: [2024-01-02 19:24:39,264][model8_pretrain.py][INFO] Epoch:[0/2](16700/4588595) loss:3.724 lr:0.0002726 epoch_Time:28706.0min: [2024-01-02 19:25:16,204][model8_pretrain.py][INFO] Epoch:[0/2](16800/4588595) loss:3.799 lr:0.0002723 epoch_Time:28702.0min: [2024-01-02 19:25:16,204][model8_pretrain.py][INFO] Epoch:[0/2](16800/4588595) loss:2.777 lr:0.0002723 epoch_Time:28702.0min: [2024-01-02 19:25:16,204][model8_pretrain.py][INFO] Epoch:[0/2](16800/4588595) loss:2.924 lr:0.0002723 epoch_Time:28702.0min: [2024-01-02 19:25:16,204][model8_pretrain.py][INFO] Epoch:[0/2](16800/4588595) loss:3.615 lr:0.0002723 epoch_Time:28702.0min: [2024-01-02 19:25:16,204][model8_pretrain.py][INFO] Epoch:[0/2](16800/4588595) loss:3.274 lr:0.0002723 epoch_Time:28702.0min: [2024-01-02 19:25:16,204][model8_pretrain.py][INFO] Epoch:[0/2](16800/4588595) loss:3.839 lr:0.0002723 epoch_Time:28702.0min: [2024-01-02 19:25:16,204][model8_pretrain.py][INFO] Epoch:[0/2](16800/4588595) loss:3.873 lr:0.0002723 epoch_Time:28702.0min: [2024-01-02 19:25:16,204][model8_pretrain.py][INFO] Epoch:[0/2](16800/4588595) loss:3.384 lr:0.0002723 epoch_Time:28702.0min: [2024-01-02 19:25:53,152][model8_pretrain.py][INFO] Epoch:[0/2](16900/4588595) loss:3.282 lr:0.0002720 epoch_Time:28698.0min: [2024-01-02 19:25:53,152][model8_pretrain.py][INFO] Epoch:[0/2](16900/4588595) loss:3.775 lr:0.0002720 epoch_Time:28697.0min: [2024-01-02 19:25:53,152][model8_pretrain.py][INFO] Epoch:[0/2](16900/4588595) loss:3.779 lr:0.0002720 epoch_Time:28698.0min: [2024-01-02 19:25:53,152][model8_pretrain.py][INFO] Epoch:[0/2](16900/4588595) loss:3.463 lr:0.0002720 epoch_Time:28697.0min: [2024-01-02 19:25:53,152][model8_pretrain.py][INFO] Epoch:[0/2](16900/4588595) loss:3.738 lr:0.0002720 epoch_Time:28697.0min: [2024-01-02 19:25:53,152][model8_pretrain.py][INFO] Epoch:[0/2](16900/4588595) loss:3.587 lr:0.0002720 epoch_Time:28697.0min: [2024-01-02 19:25:53,152][model8_pretrain.py][INFO] Epoch:[0/2](16900/4588595) loss:3.315 lr:0.0002720 epoch_Time:28697.0min: [2024-01-02 19:25:53,152][model8_pretrain.py][INFO] Epoch:[0/2](16900/4588595) loss:3.336 lr:0.0002720 epoch_Time:28697.0min: [2024-01-02 19:26:30,118][model8_pretrain.py][INFO] Epoch:[0/2](17000/4588595) loss:3.481 lr:0.0002716 epoch_Time:28694.0min: [2024-01-02 19:26:30,118][model8_pretrain.py][INFO] Epoch:[0/2](17000/4588595) loss:3.392 lr:0.0002716 epoch_Time:28694.0min: [2024-01-02 19:26:30,118][model8_pretrain.py][INFO] Epoch:[0/2](17000/4588595) loss:3.618 lr:0.0002716 epoch_Time:28694.0min: [2024-01-02 19:26:30,118][model8_pretrain.py][INFO] Epoch:[0/2](17000/4588595) loss:3.598 lr:0.0002716 epoch_Time:28694.0min: [2024-01-02 19:26:30,118][model8_pretrain.py][INFO] Epoch:[0/2](17000/4588595) loss:3.417 lr:0.0002716 epoch_Time:28694.0min: [2024-01-02 19:26:30,118][model8_pretrain.py][INFO] Epoch:[0/2](17000/4588595) loss:3.028 lr:0.0002716 epoch_Time:28694.0min: [2024-01-02 19:26:30,118][model8_pretrain.py][INFO] Epoch:[0/2](17000/4588595) loss:3.488 lr:0.0002716 epoch_Time:28694.0min: [2024-01-02 19:26:30,118][model8_pretrain.py][INFO] Epoch:[0/2](17000/4588595) loss:4.001 lr:0.0002716 epoch_Time:28694.0min: [2024-01-02 19:27:07,074][model8_pretrain.py][INFO] Epoch:[0/2](17100/4588595) loss:3.795 lr:0.0002713 epoch_Time:28690.0min: [2024-01-02 19:27:07,074][model8_pretrain.py][INFO] Epoch:[0/2](17100/4588595) loss:3.766 lr:0.0002713 epoch_Time:28690.0min: [2024-01-02 19:27:07,074][model8_pretrain.py][INFO] Epoch:[0/2](17100/4588595) loss:3.420 lr:0.0002713 epoch_Time:28690.0min: [2024-01-02 19:27:07,074][model8_pretrain.py][INFO] Epoch:[0/2](17100/4588595) loss:3.741 lr:0.0002713 epoch_Time:28690.0min: [2024-01-02 19:27:07,074][model8_pretrain.py][INFO] Epoch:[0/2](17100/4588595) loss:3.721 lr:0.0002713 epoch_Time:28690.0min: [2024-01-02 19:27:07,074][model8_pretrain.py][INFO] Epoch:[0/2](17100/4588595) loss:3.724 lr:0.0002713 epoch_Time:28690.0min: [2024-01-02 19:27:07,074][model8_pretrain.py][INFO] Epoch:[0/2](17100/4588595) loss:3.455 lr:0.0002713 epoch_Time:28690.0min: [2024-01-02 19:27:07,074][model8_pretrain.py][INFO] Epoch:[0/2](17100/4588595) loss:2.740 lr:0.0002713 epoch_Time:28690.0min: [2024-01-02 19:27:44,012][model8_pretrain.py][INFO] Epoch:[0/2](17200/4588595) loss:3.498 lr:0.0002709 epoch_Time:28687.0min: [2024-01-02 19:27:44,012][model8_pretrain.py][INFO] Epoch:[0/2](17200/4588595) loss:3.619 lr:0.0002709 epoch_Time:28687.0min: [2024-01-02 19:27:44,012][model8_pretrain.py][INFO] Epoch:[0/2](17200/4588595) loss:3.849 lr:0.0002709 epoch_Time:28687.0min: [2024-01-02 19:27:44,012][model8_pretrain.py][INFO] Epoch:[0/2](17200/4588595) loss:3.436 lr:0.0002709 epoch_Time:28687.0min: [2024-01-02 19:27:44,012][model8_pretrain.py][INFO] Epoch:[0/2](17200/4588595) loss:3.360 lr:0.0002709 epoch_Time:28687.0min: [2024-01-02 19:27:44,012][model8_pretrain.py][INFO] Epoch:[0/2](17200/4588595) loss:3.570 lr:0.0002709 epoch_Time:28687.0min: [2024-01-02 19:27:44,012][model8_pretrain.py][INFO] Epoch:[0/2](17200/4588595) loss:3.926 lr:0.0002709 epoch_Time:28687.0min: [2024-01-02 19:27:44,012][model8_pretrain.py][INFO] Epoch:[0/2](17200/4588595) loss:3.768 lr:0.0002709 epoch_Time:28687.0min: [2024-01-02 19:28:24,558][model8_pretrain.py][INFO] Epoch:[0/2](17300/4588595) loss:3.363 lr:0.0002706 epoch_Time:28699.0min: [2024-01-02 19:28:24,558][model8_pretrain.py][INFO] Epoch:[0/2](17300/4588595) loss:3.498 lr:0.0002706 epoch_Time:28699.0min: [2024-01-02 19:28:24,558][model8_pretrain.py][INFO] Epoch:[0/2](17300/4588595) loss:3.766 lr:0.0002706 epoch_Time:28699.0min: [2024-01-02 19:28:24,558][model8_pretrain.py][INFO] Epoch:[0/2](17300/4588595) loss:3.333 lr:0.0002706 epoch_Time:28699.0min: [2024-01-02 19:28:24,558][model8_pretrain.py][INFO] Epoch:[0/2](17300/4588595) loss:3.648 lr:0.0002706 epoch_Time:28699.0min: [2024-01-02 19:28:24,558][model8_pretrain.py][INFO] Epoch:[0/2](17300/4588595) loss:3.688 lr:0.0002706 epoch_Time:28699.0min: [2024-01-02 19:28:24,558][model8_pretrain.py][INFO] Epoch:[0/2](17300/4588595) loss:3.847 lr:0.0002706 epoch_Time:28699.0min: [2024-01-02 19:28:24,558][model8_pretrain.py][INFO] Epoch:[0/2](17300/4588595) loss:3.240 lr:0.0002706 epoch_Time:28699.0min: [2024-01-02 19:29:01,498][model8_pretrain.py][INFO] Epoch:[0/2](17400/4588595) loss:3.955 lr:0.0002702 epoch_Time:28695.0min: [2024-01-02 19:29:01,498][model8_pretrain.py][INFO] Epoch:[0/2](17400/4588595) loss:3.649 lr:0.0002702 epoch_Time:28695.0min: [2024-01-02 19:29:01,498][model8_pretrain.py][INFO] Epoch:[0/2](17400/4588595) loss:3.554 lr:0.0002702 epoch_Time:28695.0min: [2024-01-02 19:29:01,498][model8_pretrain.py][INFO] Epoch:[0/2](17400/4588595) loss:3.459 lr:0.0002702 epoch_Time:28695.0min: [2024-01-02 19:29:01,498][model8_pretrain.py][INFO] Epoch:[0/2](17400/4588595) loss:3.187 lr:0.0002702 epoch_Time:28695.0min: [2024-01-02 19:29:01,498][model8_pretrain.py][INFO] Epoch:[0/2](17400/4588595) loss:3.289 lr:0.0002702 epoch_Time:28695.0min: [2024-01-02 19:29:01,498][model8_pretrain.py][INFO] Epoch:[0/2](17400/4588595) loss:3.232 lr:0.0002702 epoch_Time:28695.0min: [2024-01-02 19:29:01,499][model8_pretrain.py][INFO] Epoch:[0/2](17400/4588595) loss:3.854 lr:0.0002702 epoch_Time:28695.0min: [2024-01-02 19:29:38,440][model8_pretrain.py][INFO] Epoch:[0/2](17500/4588595) loss:3.359 lr:0.0002699 epoch_Time:28691.0min: [2024-01-02 19:29:38,440][model8_pretrain.py][INFO] Epoch:[0/2](17500/4588595) loss:3.323 lr:0.0002699 epoch_Time:28691.0min: [2024-01-02 19:29:38,440][model8_pretrain.py][INFO] Epoch:[0/2](17500/4588595) loss:3.648 lr:0.0002699 epoch_Time:28691.0min: [2024-01-02 19:29:38,440][model8_pretrain.py][INFO] Epoch:[0/2](17500/4588595) loss:3.635 lr:0.0002699 epoch_Time:28692.0min: [2024-01-02 19:29:38,440][model8_pretrain.py][INFO] Epoch:[0/2](17500/4588595) loss:3.965 lr:0.0002699 epoch_Time:28691.0min: [2024-01-02 19:29:38,440][model8_pretrain.py][INFO] Epoch:[0/2](17500/4588595) loss:3.458 lr:0.0002699 epoch_Time:28691.0min: [2024-01-02 19:29:38,440][model8_pretrain.py][INFO] Epoch:[0/2](17500/4588595) loss:3.597 lr:0.0002699 epoch_Time:28691.0min: [2024-01-02 19:29:38,440][model8_pretrain.py][INFO] Epoch:[0/2](17500/4588595) loss:3.741 lr:0.0002699 epoch_Time:28691.0min: [2024-01-02 19:30:15,396][model8_pretrain.py][INFO] Epoch:[0/2](17600/4588595) loss:3.752 lr:0.0002695 epoch_Time:28687.0min: [2024-01-02 19:30:15,396][model8_pretrain.py][INFO] Epoch:[0/2](17600/4588595) loss:3.583 lr:0.0002695 epoch_Time:28687.0min: [2024-01-02 19:30:15,396][model8_pretrain.py][INFO] Epoch:[0/2](17600/4588595) loss:3.457 lr:0.0002695 epoch_Time:28687.0min: [2024-01-02 19:30:15,396][model8_pretrain.py][INFO] Epoch:[0/2](17600/4588595) loss:3.687 lr:0.0002695 epoch_Time:28687.0min: [2024-01-02 19:30:15,396][model8_pretrain.py][INFO] Epoch:[0/2](17600/4588595) loss:3.664 lr:0.0002695 epoch_Time:28687.0min: [2024-01-02 19:30:15,396][model8_pretrain.py][INFO] Epoch:[0/2](17600/4588595) loss:3.552 lr:0.0002695 epoch_Time:28687.0min: [2024-01-02 19:30:15,396][model8_pretrain.py][INFO] Epoch:[0/2](17600/4588595) loss:3.770 lr:0.0002695 epoch_Time:28688.0min: [2024-01-02 19:30:15,396][model8_pretrain.py][INFO] Epoch:[0/2](17600/4588595) loss:3.356 lr:0.0002695 epoch_Time:28687.0min: [2024-01-02 19:30:52,333][model8_pretrain.py][INFO] Epoch:[0/2](17700/4588595) loss:3.589 lr:0.0002692 epoch_Time:28683.0min: [2024-01-02 19:30:52,333][model8_pretrain.py][INFO] Epoch:[0/2](17700/4588595) loss:3.448 lr:0.0002692 epoch_Time:28683.0min: [2024-01-02 19:30:52,333][model8_pretrain.py][INFO] Epoch:[0/2](17700/4588595) loss:3.340 lr:0.0002692 epoch_Time:28683.0min: [2024-01-02 19:30:52,333][model8_pretrain.py][INFO] Epoch:[0/2](17700/4588595) loss:3.388 lr:0.0002692 epoch_Time:28683.0min: [2024-01-02 19:30:52,333][model8_pretrain.py][INFO] Epoch:[0/2](17700/4588595) loss:3.738 lr:0.0002692 epoch_Time:28683.0min: [2024-01-02 19:30:52,333][model8_pretrain.py][INFO] Epoch:[0/2](17700/4588595) loss:3.657 lr:0.0002692 epoch_Time:28683.0min: [2024-01-02 19:30:52,333][model8_pretrain.py][INFO] Epoch:[0/2](17700/4588595) loss:3.065 lr:0.0002692 epoch_Time:28683.0min: [2024-01-02 19:30:52,333][model8_pretrain.py][INFO] Epoch:[0/2](17700/4588595) loss:3.113 lr:0.0002692 epoch_Time:28683.0min: [2024-01-02 19:31:29,281][model8_pretrain.py][INFO] Epoch:[0/2](17800/4588595) loss:3.443 lr:0.0002688 epoch_Time:28680.0min: [2024-01-02 19:31:29,281][model8_pretrain.py][INFO] Epoch:[0/2](17800/4588595) loss:3.692 lr:0.0002688 epoch_Time:28680.0min: [2024-01-02 19:31:29,281][model8_pretrain.py][INFO] Epoch:[0/2](17800/4588595) loss:2.999 lr:0.0002688 epoch_Time:28680.0min: [2024-01-02 19:31:29,281][model8_pretrain.py][INFO] Epoch:[0/2](17800/4588595) loss:3.500 lr:0.0002688 epoch_Time:28680.0min: [2024-01-02 19:31:29,281][model8_pretrain.py][INFO] Epoch:[0/2](17800/4588595) loss:3.585 lr:0.0002688 epoch_Time:28680.0min: [2024-01-02 19:31:29,281][model8_pretrain.py][INFO] Epoch:[0/2](17800/4588595) loss:3.868 lr:0.0002688 epoch_Time:28680.0min: [2024-01-02 19:31:29,281][model8_pretrain.py][INFO] Epoch:[0/2](17800/4588595) loss:4.052 lr:0.0002688 epoch_Time:28680.0min: [2024-01-02 19:31:29,281][model8_pretrain.py][INFO] Epoch:[0/2](17800/4588595) loss:3.501 lr:0.0002688 epoch_Time:28680.0min: [2024-01-02 19:32:06,220][model8_pretrain.py][INFO] Epoch:[0/2](17900/4588595) loss:2.586 lr:0.0002685 epoch_Time:28676.0min: [2024-01-02 19:32:06,220][model8_pretrain.py][INFO] Epoch:[0/2](17900/4588595) loss:3.073 lr:0.0002685 epoch_Time:28676.0min: [2024-01-02 19:32:06,220][model8_pretrain.py][INFO] Epoch:[0/2](17900/4588595) loss:3.706 lr:0.0002685 epoch_Time:28676.0min: [2024-01-02 19:32:06,220][model8_pretrain.py][INFO] Epoch:[0/2](17900/4588595) loss:3.957 lr:0.0002685 epoch_Time:28676.0min: [2024-01-02 19:32:06,220][model8_pretrain.py][INFO] Epoch:[0/2](17900/4588595) loss:3.179 lr:0.0002685 epoch_Time:28676.0min: [2024-01-02 19:32:06,220][model8_pretrain.py][INFO] Epoch:[0/2](17900/4588595) loss:3.394 lr:0.0002685 epoch_Time:28676.0min: [2024-01-02 19:32:06,220][model8_pretrain.py][INFO] Epoch:[0/2](17900/4588595) loss:3.932 lr:0.0002685 epoch_Time:28676.0min: [2024-01-02 19:32:06,220][model8_pretrain.py][INFO] Epoch:[0/2](17900/4588595) loss:3.705 lr:0.0002685 epoch_Time:28676.0min: [2024-01-02 19:32:43,169][model8_pretrain.py][INFO] Epoch:[0/2](18000/4588595) loss:3.410 lr:0.0002681 epoch_Time:28673.0min: [2024-01-02 19:32:43,169][model8_pretrain.py][INFO] Epoch:[0/2](18000/4588595) loss:3.782 lr:0.0002681 epoch_Time:28673.0min: [2024-01-02 19:32:43,169][model8_pretrain.py][INFO] Epoch:[0/2](18000/4588595) loss:3.424 lr:0.0002681 epoch_Time:28673.0min: [2024-01-02 19:32:43,169][model8_pretrain.py][INFO] Epoch:[0/2](18000/4588595) loss:3.453 lr:0.0002681 epoch_Time:28673.0min: [2024-01-02 19:32:43,169][model8_pretrain.py][INFO] Epoch:[0/2](18000/4588595) loss:3.530 lr:0.0002681 epoch_Time:28673.0min: [2024-01-02 19:32:43,169][model8_pretrain.py][INFO] Epoch:[0/2](18000/4588595) loss:3.935 lr:0.0002681 epoch_Time:28673.0min: [2024-01-02 19:32:43,169][model8_pretrain.py][INFO] Epoch:[0/2](18000/4588595) loss:3.695 lr:0.0002681 epoch_Time:28673.0min: [2024-01-02 19:32:43,170][model8_pretrain.py][INFO] Epoch:[0/2](18000/4588595) loss:3.656 lr:0.0002681 epoch_Time:28673.0min: [2024-01-02 19:33:23,720][model8_pretrain.py][INFO] Epoch:[0/2](18100/4588595) loss:3.564 lr:0.0002677 epoch_Time:28685.0min: [2024-01-02 19:33:23,720][model8_pretrain.py][INFO] Epoch:[0/2](18100/4588595) loss:3.376 lr:0.0002677 epoch_Time:28684.0min: [2024-01-02 19:33:23,720][model8_pretrain.py][INFO] Epoch:[0/2](18100/4588595) loss:3.753 lr:0.0002677 epoch_Time:28684.0min: [2024-01-02 19:33:23,720][model8_pretrain.py][INFO] Epoch:[0/2](18100/4588595) loss:3.916 lr:0.0002677 epoch_Time:28685.0min: [2024-01-02 19:33:23,720][model8_pretrain.py][INFO] Epoch:[0/2](18100/4588595) loss:3.339 lr:0.0002677 epoch_Time:28685.0min: [2024-01-02 19:33:23,720][model8_pretrain.py][INFO] Epoch:[0/2](18100/4588595) loss:3.012 lr:0.0002677 epoch_Time:28685.0min: [2024-01-02 19:33:23,720][model8_pretrain.py][INFO] Epoch:[0/2](18100/4588595) loss:3.457 lr:0.0002677 epoch_Time:28685.0min: [2024-01-02 19:33:23,720][model8_pretrain.py][INFO] Epoch:[0/2](18100/4588595) loss:3.781 lr:0.0002677 epoch_Time:28685.0min: [2024-01-02 19:34:00,674][model8_pretrain.py][INFO] Epoch:[0/2](18200/4588595) loss:3.574 lr:0.0002674 epoch_Time:28681.0min: [2024-01-02 19:34:00,674][model8_pretrain.py][INFO] Epoch:[0/2](18200/4588595) loss:3.419 lr:0.0002674 epoch_Time:28681.0min: [2024-01-02 19:34:00,674][model8_pretrain.py][INFO] Epoch:[0/2](18200/4588595) loss:3.438 lr:0.0002674 epoch_Time:28681.0min: [2024-01-02 19:34:00,674][model8_pretrain.py][INFO] Epoch:[0/2](18200/4588595) loss:3.350 lr:0.0002674 epoch_Time:28681.0min: [2024-01-02 19:34:00,674][model8_pretrain.py][INFO] Epoch:[0/2](18200/4588595) loss:3.690 lr:0.0002674 epoch_Time:28681.0min: [2024-01-02 19:34:00,674][model8_pretrain.py][INFO] Epoch:[0/2](18200/4588595) loss:3.611 lr:0.0002674 epoch_Time:28681.0min: [2024-01-02 19:34:00,674][model8_pretrain.py][INFO] Epoch:[0/2](18200/4588595) loss:2.974 lr:0.0002674 epoch_Time:28681.0min: [2024-01-02 19:34:00,674][model8_pretrain.py][INFO] Epoch:[0/2](18200/4588595) loss:3.154 lr:0.0002674 epoch_Time:28681.0min: [2024-01-02 19:34:37,630][model8_pretrain.py][INFO] Epoch:[0/2](18300/4588595) loss:3.968 lr:0.0002670 epoch_Time:28678.0min: [2024-01-02 19:34:37,630][model8_pretrain.py][INFO] Epoch:[0/2](18300/4588595) loss:3.533 lr:0.0002670 epoch_Time:28678.0min: [2024-01-02 19:34:37,630][model8_pretrain.py][INFO] Epoch:[0/2](18300/4588595) loss:3.915 lr:0.0002670 epoch_Time:28678.0min: [2024-01-02 19:34:37,630][model8_pretrain.py][INFO] Epoch:[0/2](18300/4588595) loss:3.664 lr:0.0002670 epoch_Time:28678.0min: [2024-01-02 19:34:37,630][model8_pretrain.py][INFO] Epoch:[0/2](18300/4588595) loss:3.670 lr:0.0002670 epoch_Time:28678.0min: [2024-01-02 19:34:37,630][model8_pretrain.py][INFO] Epoch:[0/2](18300/4588595) loss:3.500 lr:0.0002670 epoch_Time:28678.0min: [2024-01-02 19:34:37,630][model8_pretrain.py][INFO] Epoch:[0/2](18300/4588595) loss:3.755 lr:0.0002670 epoch_Time:28678.0min: [2024-01-02 19:34:37,630][model8_pretrain.py][INFO] Epoch:[0/2](18300/4588595) loss:3.869 lr:0.0002670 epoch_Time:28678.0min: [2024-01-02 19:35:14,597][model8_pretrain.py][INFO] Epoch:[0/2](18400/4588595) loss:3.290 lr:0.0002667 epoch_Time:28674.0min: [2024-01-02 19:35:14,597][model8_pretrain.py][INFO] Epoch:[0/2](18400/4588595) loss:3.883 lr:0.0002667 epoch_Time:28674.0min: [2024-01-02 19:35:14,597][model8_pretrain.py][INFO] Epoch:[0/2](18400/4588595) loss:3.700 lr:0.0002667 epoch_Time:28674.0min: [2024-01-02 19:35:14,597][model8_pretrain.py][INFO] Epoch:[0/2](18400/4588595) loss:3.340 lr:0.0002667 epoch_Time:28674.0min: [2024-01-02 19:35:14,597][model8_pretrain.py][INFO] Epoch:[0/2](18400/4588595) loss:3.656 lr:0.0002667 epoch_Time:28674.0min: [2024-01-02 19:35:14,597][model8_pretrain.py][INFO] Epoch:[0/2](18400/4588595) loss:3.449 lr:0.0002667 epoch_Time:28674.0min: [2024-01-02 19:35:14,597][model8_pretrain.py][INFO] Epoch:[0/2](18400/4588595) loss:3.008 lr:0.0002667 epoch_Time:28674.0min: [2024-01-02 19:35:14,597][model8_pretrain.py][INFO] Epoch:[0/2](18400/4588595) loss:3.915 lr:0.0002667 epoch_Time:28674.0min: [2024-01-02 19:35:51,553][model8_pretrain.py][INFO] Epoch:[0/2](18500/4588595) loss:3.946 lr:0.0002663 epoch_Time:28670.0min: [2024-01-02 19:35:51,553][model8_pretrain.py][INFO] Epoch:[0/2](18500/4588595) loss:3.871 lr:0.0002663 epoch_Time:28670.0min: [2024-01-02 19:35:51,553][model8_pretrain.py][INFO] Epoch:[0/2](18500/4588595) loss:3.373 lr:0.0002663 epoch_Time:28670.0min: [2024-01-02 19:35:51,553][model8_pretrain.py][INFO] Epoch:[0/2](18500/4588595) loss:3.267 lr:0.0002663 epoch_Time:28670.0min: [2024-01-02 19:35:51,553][model8_pretrain.py][INFO] Epoch:[0/2](18500/4588595) loss:3.654 lr:0.0002663 epoch_Time:28670.0min: [2024-01-02 19:35:51,553][model8_pretrain.py][INFO] Epoch:[0/2](18500/4588595) loss:3.371 lr:0.0002663 epoch_Time:28670.0min: [2024-01-02 19:35:51,553][model8_pretrain.py][INFO] Epoch:[0/2](18500/4588595) loss:3.289 lr:0.0002663 epoch_Time:28670.0min: [2024-01-02 19:35:51,553][model8_pretrain.py][INFO] Epoch:[0/2](18500/4588595) loss:4.029 lr:0.0002663 epoch_Time:28670.0min: [2024-01-02 19:36:28,504][model8_pretrain.py][INFO] Epoch:[0/2](18600/4588595) loss:3.015 lr:0.0002659 epoch_Time:28667.0min: [2024-01-02 19:36:28,504][model8_pretrain.py][INFO] Epoch:[0/2](18600/4588595) loss:3.380 lr:0.0002659 epoch_Time:28667.0min: [2024-01-02 19:36:28,504][model8_pretrain.py][INFO] Epoch:[0/2](18600/4588595) loss:3.996 lr:0.0002659 epoch_Time:28667.0min: [2024-01-02 19:36:28,504][model8_pretrain.py][INFO] Epoch:[0/2](18600/4588595) loss:3.834 lr:0.0002659 epoch_Time:28667.0min: [2024-01-02 19:36:28,504][model8_pretrain.py][INFO] Epoch:[0/2](18600/4588595) loss:3.714 lr:0.0002659 epoch_Time:28667.0min: [2024-01-02 19:36:28,504][model8_pretrain.py][INFO] Epoch:[0/2](18600/4588595) loss:3.806 lr:0.0002659 epoch_Time:28667.0min: [2024-01-02 19:36:28,504][model8_pretrain.py][INFO] Epoch:[0/2](18600/4588595) loss:3.532 lr:0.0002659 epoch_Time:28667.0min: [2024-01-02 19:36:28,504][model8_pretrain.py][INFO] Epoch:[0/2](18600/4588595) loss:2.817 lr:0.0002659 epoch_Time:28667.0min: [2024-01-02 19:37:05,446][model8_pretrain.py][INFO] Epoch:[0/2](18700/4588595) loss:3.923 lr:0.0002655 epoch_Time:28663.0min: [2024-01-02 19:37:05,446][model8_pretrain.py][INFO] Epoch:[0/2](18700/4588595) loss:3.271 lr:0.0002655 epoch_Time:28663.0min: [2024-01-02 19:37:05,446][model8_pretrain.py][INFO] Epoch:[0/2](18700/4588595) loss:3.518 lr:0.0002655 epoch_Time:28663.0min: [2024-01-02 19:37:05,447][model8_pretrain.py][INFO] Epoch:[0/2](18700/4588595) loss:3.613 lr:0.0002655 epoch_Time:28663.0min: [2024-01-02 19:37:05,447][model8_pretrain.py][INFO] Epoch:[0/2](18700/4588595) loss:3.494 lr:0.0002655 epoch_Time:28663.0min: [2024-01-02 19:37:05,447][model8_pretrain.py][INFO] Epoch:[0/2](18700/4588595) loss:3.713 lr:0.0002655 epoch_Time:28663.0min: [2024-01-02 19:37:05,447][model8_pretrain.py][INFO] Epoch:[0/2](18700/4588595) loss:3.648 lr:0.0002655 epoch_Time:28663.0min: [2024-01-02 19:37:05,447][model8_pretrain.py][INFO] Epoch:[0/2](18700/4588595) loss:3.437 lr:0.0002655 epoch_Time:28663.0min: [2024-01-02 19:37:42,394][model8_pretrain.py][INFO] Epoch:[0/2](18800/4588595) loss:3.795 lr:0.0002652 epoch_Time:28661.0min: [2024-01-02 19:37:42,394][model8_pretrain.py][INFO] Epoch:[0/2](18800/4588595) loss:3.546 lr:0.0002652 epoch_Time:28660.0min: [2024-01-02 19:37:42,394][model8_pretrain.py][INFO] Epoch:[0/2](18800/4588595) loss:3.562 lr:0.0002652 epoch_Time:28661.0min: [2024-01-02 19:37:42,394][model8_pretrain.py][INFO] Epoch:[0/2](18800/4588595) loss:3.835 lr:0.0002652 epoch_Time:28661.0min: [2024-01-02 19:37:42,394][model8_pretrain.py][INFO] Epoch:[0/2](18800/4588595) loss:3.509 lr:0.0002652 epoch_Time:28661.0min: [2024-01-02 19:37:42,394][model8_pretrain.py][INFO] Epoch:[0/2](18800/4588595) loss:3.104 lr:0.0002652 epoch_Time:28661.0min: [2024-01-02 19:37:42,394][model8_pretrain.py][INFO] Epoch:[0/2](18800/4588595) loss:3.289 lr:0.0002652 epoch_Time:28660.0min: [2024-01-02 19:37:42,394][model8_pretrain.py][INFO] Epoch:[0/2](18800/4588595) loss:4.028 lr:0.0002652 epoch_Time:28661.0min: [2024-01-02 19:38:22,965][model8_pretrain.py][INFO] Epoch:[0/2](18900/4588595) loss:3.613 lr:0.0002648 epoch_Time:28671.0min: [2024-01-02 19:38:22,965][model8_pretrain.py][INFO] Epoch:[0/2](18900/4588595) loss:3.589 lr:0.0002648 epoch_Time:28671.0min: [2024-01-02 19:38:22,965][model8_pretrain.py][INFO] Epoch:[0/2](18900/4588595) loss:3.316 lr:0.0002648 epoch_Time:28672.0min: [2024-01-02 19:38:22,965][model8_pretrain.py][INFO] Epoch:[0/2](18900/4588595) loss:3.442 lr:0.0002648 epoch_Time:28671.0min: [2024-01-02 19:38:22,965][model8_pretrain.py][INFO] Epoch:[0/2](18900/4588595) loss:3.883 lr:0.0002648 epoch_Time:28672.0min: [2024-01-02 19:38:22,965][model8_pretrain.py][INFO] Epoch:[0/2](18900/4588595) loss:3.477 lr:0.0002648 epoch_Time:28671.0min: [2024-01-02 19:38:22,965][model8_pretrain.py][INFO] Epoch:[0/2](18900/4588595) loss:3.326 lr:0.0002648 epoch_Time:28671.0min: [2024-01-02 19:38:22,965][model8_pretrain.py][INFO] Epoch:[0/2](18900/4588595) loss:3.396 lr:0.0002648 epoch_Time:28671.0min: [2024-01-02 19:38:59,883][model8_pretrain.py][INFO] Epoch:[0/2](19000/4588595) loss:3.585 lr:0.0002644 epoch_Time:28668.0min: [2024-01-02 19:38:59,883][model8_pretrain.py][INFO] Epoch:[0/2](19000/4588595) loss:3.535 lr:0.0002644 epoch_Time:28668.0min: [2024-01-02 19:38:59,883][model8_pretrain.py][INFO] Epoch:[0/2](19000/4588595) loss:3.345 lr:0.0002644 epoch_Time:28667.0min: [2024-01-02 19:38:59,883][model8_pretrain.py][INFO] Epoch:[0/2](19000/4588595) loss:3.683 lr:0.0002644 epoch_Time:28667.0min: [2024-01-02 19:38:59,883][model8_pretrain.py][INFO] Epoch:[0/2](19000/4588595) loss:3.709 lr:0.0002644 epoch_Time:28668.0min: [2024-01-02 19:38:59,884][model8_pretrain.py][INFO] Epoch:[0/2](19000/4588595) loss:3.820 lr:0.0002644 epoch_Time:28668.0min: [2024-01-02 19:38:59,884][model8_pretrain.py][INFO] Epoch:[0/2](19000/4588595) loss:3.098 lr:0.0002644 epoch_Time:28668.0min: [2024-01-02 19:38:59,884][model8_pretrain.py][INFO] Epoch:[0/2](19000/4588595) loss:3.542 lr:0.0002644 epoch_Time:28668.0min: [2024-01-02 19:39:36,826][model8_pretrain.py][INFO] Epoch:[0/2](19100/4588595) loss:3.517 lr:0.0002640 epoch_Time:28665.0min: [2024-01-02 19:39:36,826][model8_pretrain.py][INFO] Epoch:[0/2](19100/4588595) loss:3.871 lr:0.0002640 epoch_Time:28665.0min: [2024-01-02 19:39:36,826][model8_pretrain.py][INFO] Epoch:[0/2](19100/4588595) loss:2.976 lr:0.0002640 epoch_Time:28665.0min: [2024-01-02 19:39:36,826][model8_pretrain.py][INFO] Epoch:[0/2](19100/4588595) loss:3.364 lr:0.0002640 epoch_Time:28665.0min: [2024-01-02 19:39:36,826][model8_pretrain.py][INFO] Epoch:[0/2](19100/4588595) loss:3.226 lr:0.0002640 epoch_Time:28665.0min: [2024-01-02 19:39:36,826][model8_pretrain.py][INFO] Epoch:[0/2](19100/4588595) loss:2.969 lr:0.0002640 epoch_Time:28665.0min: [2024-01-02 19:39:36,826][model8_pretrain.py][INFO] Epoch:[0/2](19100/4588595) loss:3.490 lr:0.0002640 epoch_Time:28665.0min: [2024-01-02 19:39:36,826][model8_pretrain.py][INFO] Epoch:[0/2](19100/4588595) loss:3.694 lr:0.0002640 epoch_Time:28665.0min: [2024-01-02 19:40:13,766][model8_pretrain.py][INFO] Epoch:[0/2](19200/4588595) loss:3.532 lr:0.0002637 epoch_Time:28661.0min: [2024-01-02 19:40:13,766][model8_pretrain.py][INFO] Epoch:[0/2](19200/4588595) loss:3.825 lr:0.0002637 epoch_Time:28661.0min: [2024-01-02 19:40:13,766][model8_pretrain.py][INFO] Epoch:[0/2](19200/4588595) loss:3.105 lr:0.0002637 epoch_Time:28661.0min: [2024-01-02 19:40:13,766][model8_pretrain.py][INFO] Epoch:[0/2](19200/4588595) loss:3.646 lr:0.0002637 epoch_Time:28661.0min: [2024-01-02 19:40:13,766][model8_pretrain.py][INFO] Epoch:[0/2](19200/4588595) loss:3.621 lr:0.0002637 epoch_Time:28661.0min: [2024-01-02 19:40:13,766][model8_pretrain.py][INFO] Epoch:[0/2](19200/4588595) loss:3.676 lr:0.0002637 epoch_Time:28661.0min: [2024-01-02 19:40:13,766][model8_pretrain.py][INFO] Epoch:[0/2](19200/4588595) loss:3.668 lr:0.0002637 epoch_Time:28661.0min: [2024-01-02 19:40:13,767][model8_pretrain.py][INFO] Epoch:[0/2](19200/4588595) loss:3.402 lr:0.0002637 epoch_Time:28661.0min: [2024-01-02 19:40:50,714][model8_pretrain.py][INFO] Epoch:[0/2](19300/4588595) loss:3.541 lr:0.0002633 epoch_Time:28657.0min: [2024-01-02 19:40:50,714][model8_pretrain.py][INFO] Epoch:[0/2](19300/4588595) loss:3.790 lr:0.0002633 epoch_Time:28657.0min: [2024-01-02 19:40:50,715][model8_pretrain.py][INFO] Epoch:[0/2](19300/4588595) loss:3.378 lr:0.0002633 epoch_Time:28657.0min: [2024-01-02 19:40:50,715][model8_pretrain.py][INFO] Epoch:[0/2](19300/4588595) loss:3.592 lr:0.0002633 epoch_Time:28657.0min: [2024-01-02 19:40:50,715][model8_pretrain.py][INFO] Epoch:[0/2](19300/4588595) loss:3.663 lr:0.0002633 epoch_Time:28657.0min: [2024-01-02 19:40:50,715][model8_pretrain.py][INFO] Epoch:[0/2](19300/4588595) loss:3.696 lr:0.0002633 epoch_Time:28657.0min: [2024-01-02 19:40:50,715][model8_pretrain.py][INFO] Epoch:[0/2](19300/4588595) loss:2.472 lr:0.0002633 epoch_Time:28657.0min: [2024-01-02 19:40:50,715][model8_pretrain.py][INFO] Epoch:[0/2](19300/4588595) loss:3.924 lr:0.0002633 epoch_Time:28657.0min: [2024-01-02 19:41:27,662][model8_pretrain.py][INFO] Epoch:[0/2](19400/4588595) loss:3.159 lr:0.0002629 epoch_Time:28654.0min: [2024-01-02 19:41:27,662][model8_pretrain.py][INFO] Epoch:[0/2](19400/4588595) loss:3.525 lr:0.0002629 epoch_Time:28655.0min: [2024-01-02 19:41:27,662][model8_pretrain.py][INFO] Epoch:[0/2](19400/4588595) loss:3.025 lr:0.0002629 epoch_Time:28655.0min: [2024-01-02 19:41:27,662][model8_pretrain.py][INFO] Epoch:[0/2](19400/4588595) loss:3.126 lr:0.0002629 epoch_Time:28655.0min: [2024-01-02 19:41:27,662][model8_pretrain.py][INFO] Epoch:[0/2](19400/4588595) loss:3.583 lr:0.0002629 epoch_Time:28655.0min: [2024-01-02 19:41:27,662][model8_pretrain.py][INFO] Epoch:[0/2](19400/4588595) loss:3.370 lr:0.0002629 epoch_Time:28655.0min: [2024-01-02 19:41:27,662][model8_pretrain.py][INFO] Epoch:[0/2](19400/4588595) loss:3.633 lr:0.0002629 epoch_Time:28654.0min: [2024-01-02 19:41:27,663][model8_pretrain.py][INFO] Epoch:[0/2](19400/4588595) loss:3.391 lr:0.0002629 epoch_Time:28655.0min: [2024-01-02 19:42:04,619][model8_pretrain.py][INFO] Epoch:[0/2](19500/4588595) loss:3.275 lr:0.0002625 epoch_Time:28651.0min: [2024-01-02 19:42:04,619][model8_pretrain.py][INFO] Epoch:[0/2](19500/4588595) loss:3.529 lr:0.0002625 epoch_Time:28651.0min: [2024-01-02 19:42:04,619][model8_pretrain.py][INFO] Epoch:[0/2](19500/4588595) loss:3.431 lr:0.0002625 epoch_Time:28651.0min: [2024-01-02 19:42:04,619][model8_pretrain.py][INFO] Epoch:[0/2](19500/4588595) loss:3.562 lr:0.0002625 epoch_Time:28651.0min: [2024-01-02 19:42:04,619][model8_pretrain.py][INFO] Epoch:[0/2](19500/4588595) loss:3.433 lr:0.0002625 epoch_Time:28651.0min: [2024-01-02 19:42:04,619][model8_pretrain.py][INFO] Epoch:[0/2](19500/4588595) loss:3.641 lr:0.0002625 epoch_Time:28651.0min: [2024-01-02 19:42:04,619][model8_pretrain.py][INFO] Epoch:[0/2](19500/4588595) loss:3.467 lr:0.0002625 epoch_Time:28651.0min: [2024-01-02 19:42:04,619][model8_pretrain.py][INFO] Epoch:[0/2](19500/4588595) loss:3.703 lr:0.0002625 epoch_Time:28651.0min: [2024-01-02 19:42:41,590][model8_pretrain.py][INFO] Epoch:[0/2](19600/4588595) loss:3.764 lr:0.0002621 epoch_Time:28648.0min: [2024-01-02 19:42:41,590][model8_pretrain.py][INFO] Epoch:[0/2](19600/4588595) loss:3.387 lr:0.0002621 epoch_Time:28648.0min: [2024-01-02 19:42:41,590][model8_pretrain.py][INFO] Epoch:[0/2](19600/4588595) loss:3.631 lr:0.0002621 epoch_Time:28648.0min: [2024-01-02 19:42:41,590][model8_pretrain.py][INFO] Epoch:[0/2](19600/4588595) loss:3.152 lr:0.0002621 epoch_Time:28648.0min: [2024-01-02 19:42:41,590][model8_pretrain.py][INFO] Epoch:[0/2](19600/4588595) loss:3.417 lr:0.0002621 epoch_Time:28648.0min: [2024-01-02 19:42:41,590][model8_pretrain.py][INFO] Epoch:[0/2](19600/4588595) loss:3.339 lr:0.0002621 epoch_Time:28648.0min: [2024-01-02 19:42:41,590][model8_pretrain.py][INFO] Epoch:[0/2](19600/4588595) loss:3.551 lr:0.0002621 epoch_Time:28648.0min: [2024-01-02 19:42:41,590][model8_pretrain.py][INFO] Epoch:[0/2](19600/4588595) loss:3.519 lr:0.0002621 epoch_Time:28648.0min: [2024-01-02 19:43:22,147][model8_pretrain.py][INFO] Epoch:[0/2](19700/4588595) loss:3.567 lr:0.0002617 epoch_Time:28659.0min: [2024-01-02 19:43:22,147][model8_pretrain.py][INFO] Epoch:[0/2](19700/4588595) loss:3.651 lr:0.0002617 epoch_Time:28659.0min: [2024-01-02 19:43:22,147][model8_pretrain.py][INFO] Epoch:[0/2](19700/4588595) loss:3.770 lr:0.0002617 epoch_Time:28659.0min: [2024-01-02 19:43:22,147][model8_pretrain.py][INFO] Epoch:[0/2](19700/4588595) loss:3.170 lr:0.0002617 epoch_Time:28659.0min: [2024-01-02 19:43:22,147][model8_pretrain.py][INFO] Epoch:[0/2](19700/4588595) loss:3.270 lr:0.0002617 epoch_Time:28659.0min: [2024-01-02 19:43:22,147][model8_pretrain.py][INFO] Epoch:[0/2](19700/4588595) loss:3.474 lr:0.0002617 epoch_Time:28659.0min: [2024-01-02 19:43:22,147][model8_pretrain.py][INFO] Epoch:[0/2](19700/4588595) loss:3.762 lr:0.0002617 epoch_Time:28659.0min: [2024-01-02 19:43:22,147][model8_pretrain.py][INFO] Epoch:[0/2](19700/4588595) loss:3.139 lr:0.0002617 epoch_Time:28659.0min: [2024-01-02 19:43:59,101][model8_pretrain.py][INFO] Epoch:[0/2](19800/4588595) loss:3.749 lr:0.0002613 epoch_Time:28655.0min: [2024-01-02 19:43:59,101][model8_pretrain.py][INFO] Epoch:[0/2](19800/4588595) loss:3.774 lr:0.0002613 epoch_Time:28655.0min: [2024-01-02 19:43:59,101][model8_pretrain.py][INFO] Epoch:[0/2](19800/4588595) loss:3.755 lr:0.0002613 epoch_Time:28655.0min: [2024-01-02 19:43:59,101][model8_pretrain.py][INFO] Epoch:[0/2](19800/4588595) loss:3.710 lr:0.0002613 epoch_Time:28655.0min: [2024-01-02 19:43:59,101][model8_pretrain.py][INFO] Epoch:[0/2](19800/4588595) loss:3.431 lr:0.0002613 epoch_Time:28655.0min: [2024-01-02 19:43:59,101][model8_pretrain.py][INFO] Epoch:[0/2](19800/4588595) loss:3.449 lr:0.0002613 epoch_Time:28655.0min: [2024-01-02 19:43:59,101][model8_pretrain.py][INFO] Epoch:[0/2](19800/4588595) loss:3.588 lr:0.0002613 epoch_Time:28655.0min: [2024-01-02 19:43:59,101][model8_pretrain.py][INFO] Epoch:[0/2](19800/4588595) loss:3.389 lr:0.0002613 epoch_Time:28655.0min: [2024-01-02 19:44:36,052][model8_pretrain.py][INFO] Epoch:[0/2](19900/4588595) loss:3.847 lr:0.0002609 epoch_Time:28652.0min: [2024-01-02 19:44:36,053][model8_pretrain.py][INFO] Epoch:[0/2](19900/4588595) loss:3.423 lr:0.0002609 epoch_Time:28652.0min: [2024-01-02 19:44:36,053][model8_pretrain.py][INFO] Epoch:[0/2](19900/4588595) loss:3.356 lr:0.0002609 epoch_Time:28653.0min: [2024-01-02 19:44:36,053][model8_pretrain.py][INFO] Epoch:[0/2](19900/4588595) loss:3.843 lr:0.0002609 epoch_Time:28653.0min: [2024-01-02 19:44:36,053][model8_pretrain.py][INFO] Epoch:[0/2](19900/4588595) loss:3.424 lr:0.0002609 epoch_Time:28653.0min: [2024-01-02 19:44:36,053][model8_pretrain.py][INFO] Epoch:[0/2](19900/4588595) loss:3.346 lr:0.0002609 epoch_Time:28653.0min: [2024-01-02 19:44:36,053][model8_pretrain.py][INFO] Epoch:[0/2](19900/4588595) loss:3.451 lr:0.0002609 epoch_Time:28653.0min: [2024-01-02 19:44:36,053][model8_pretrain.py][INFO] Epoch:[0/2](19900/4588595) loss:3.804 lr:0.0002609 epoch_Time:28653.0min: [2024-01-02 19:45:12,999][model8_pretrain.py][INFO] Epoch:[0/2](20000/4588595) loss:3.419 lr:0.0002605 epoch_Time:28649.0min: [2024-01-02 19:45:12,999][model8_pretrain.py][INFO] Epoch:[0/2](20000/4588595) loss:3.793 lr:0.0002605 epoch_Time:28649.0min: [2024-01-02 19:45:12,999][model8_pretrain.py][INFO] Epoch:[0/2](20000/4588595) loss:3.169 lr:0.0002605 epoch_Time:28649.0min: [2024-01-02 19:45:12,999][model8_pretrain.py][INFO] Epoch:[0/2](20000/4588595) loss:3.302 lr:0.0002605 epoch_Time:28649.0min: [2024-01-02 19:45:12,999][model8_pretrain.py][INFO] Epoch:[0/2](20000/4588595) loss:3.556 lr:0.0002605 epoch_Time:28649.0min: [2024-01-02 19:45:12,999][model8_pretrain.py][INFO] Epoch:[0/2](20000/4588595) loss:3.891 lr:0.0002605 epoch_Time:28649.0min: [2024-01-02 19:45:13,000][model8_pretrain.py][INFO] Epoch:[0/2](20000/4588595) loss:3.375 lr:0.0002605 epoch_Time:28649.0min: [2024-01-02 19:45:13,000][model8_pretrain.py][INFO] Epoch:[0/2](20000/4588595) loss:3.384 lr:0.0002605 epoch_Time:28649.0min: [2024-01-02 19:45:49,946][model8_pretrain.py][INFO] Epoch:[0/2](20100/4588595) loss:3.024 lr:0.0002601 epoch_Time:28645.0min: [2024-01-02 19:45:49,946][model8_pretrain.py][INFO] Epoch:[0/2](20100/4588595) loss:3.672 lr:0.0002601 epoch_Time:28645.0min: [2024-01-02 19:45:49,946][model8_pretrain.py][INFO] Epoch:[0/2](20100/4588595) loss:3.669 lr:0.0002601 epoch_Time:28645.0min: [2024-01-02 19:45:49,946][model8_pretrain.py][INFO] Epoch:[0/2](20100/4588595) loss:3.824 lr:0.0002601 epoch_Time:28645.0min: [2024-01-02 19:45:49,946][model8_pretrain.py][INFO] Epoch:[0/2](20100/4588595) loss:4.071 lr:0.0002601 epoch_Time:28645.0min: [2024-01-02 19:45:49,946][model8_pretrain.py][INFO] Epoch:[0/2](20100/4588595) loss:3.961 lr:0.0002601 epoch_Time:28645.0min: [2024-01-02 19:45:49,946][model8_pretrain.py][INFO] Epoch:[0/2](20100/4588595) loss:2.663 lr:0.0002601 epoch_Time:28645.0min: [2024-01-02 19:45:49,947][model8_pretrain.py][INFO] Epoch:[0/2](20100/4588595) loss:3.966 lr:0.0002601 epoch_Time:28645.0min: [2024-01-02 19:46:26,908][model8_pretrain.py][INFO] Epoch:[0/2](20200/4588595) loss:3.009 lr:0.0002597 epoch_Time:28643.0min: [2024-01-02 19:46:26,908][model8_pretrain.py][INFO] Epoch:[0/2](20200/4588595) loss:3.025 lr:0.0002597 epoch_Time:28643.0min: [2024-01-02 19:46:26,908][model8_pretrain.py][INFO] Epoch:[0/2](20200/4588595) loss:3.729 lr:0.0002597 epoch_Time:28643.0min: [2024-01-02 19:46:26,908][model8_pretrain.py][INFO] Epoch:[0/2](20200/4588595) loss:2.972 lr:0.0002597 epoch_Time:28643.0min: [2024-01-02 19:46:26,908][model8_pretrain.py][INFO] Epoch:[0/2](20200/4588595) loss:3.504 lr:0.0002597 epoch_Time:28643.0min: [2024-01-02 19:46:26,908][model8_pretrain.py][INFO] Epoch:[0/2](20200/4588595) loss:3.389 lr:0.0002597 epoch_Time:28643.0min: [2024-01-02 19:46:26,908][model8_pretrain.py][INFO] Epoch:[0/2](20200/4588595) loss:3.276 lr:0.0002597 epoch_Time:28643.0min: [2024-01-02 19:46:26,908][model8_pretrain.py][INFO] Epoch:[0/2](20200/4588595) loss:3.024 lr:0.0002597 epoch_Time:28643.0min: [2024-01-02 19:47:03,866][model8_pretrain.py][INFO] Epoch:[0/2](20300/4588595) loss:2.927 lr:0.0002593 epoch_Time:28639.0min: [2024-01-02 19:47:03,866][model8_pretrain.py][INFO] Epoch:[0/2](20300/4588595) loss:3.739 lr:0.0002593 epoch_Time:28639.0min: [2024-01-02 19:47:03,866][model8_pretrain.py][INFO] Epoch:[0/2](20300/4588595) loss:3.324 lr:0.0002593 epoch_Time:28639.0min: [2024-01-02 19:47:03,866][model8_pretrain.py][INFO] Epoch:[0/2](20300/4588595) loss:3.283 lr:0.0002593 epoch_Time:28639.0min: [2024-01-02 19:47:03,866][model8_pretrain.py][INFO] Epoch:[0/2](20300/4588595) loss:3.499 lr:0.0002593 epoch_Time:28639.0min: [2024-01-02 19:47:03,866][model8_pretrain.py][INFO] Epoch:[0/2](20300/4588595) loss:3.607 lr:0.0002593 epoch_Time:28639.0min: [2024-01-02 19:47:03,866][model8_pretrain.py][INFO] Epoch:[0/2](20300/4588595) loss:3.119 lr:0.0002593 epoch_Time:28639.0min: [2024-01-02 19:47:03,866][model8_pretrain.py][INFO] Epoch:[0/2](20300/4588595) loss:3.641 lr:0.0002593 epoch_Time:28639.0min: [2024-01-02 19:47:40,821][model8_pretrain.py][INFO] Epoch:[0/2](20400/4588595) loss:3.773 lr:0.0002589 epoch_Time:28637.0min: [2024-01-02 19:47:40,821][model8_pretrain.py][INFO] Epoch:[0/2](20400/4588595) loss:3.658 lr:0.0002589 epoch_Time:28637.0min: [2024-01-02 19:47:40,821][model8_pretrain.py][INFO] Epoch:[0/2](20400/4588595) loss:3.527 lr:0.0002589 epoch_Time:28637.0min: [2024-01-02 19:47:40,821][model8_pretrain.py][INFO] Epoch:[0/2](20400/4588595) loss:3.459 lr:0.0002589 epoch_Time:28637.0min: [2024-01-02 19:47:40,821][model8_pretrain.py][INFO] Epoch:[0/2](20400/4588595) loss:3.495 lr:0.0002589 epoch_Time:28637.0min: [2024-01-02 19:47:40,821][model8_pretrain.py][INFO] Epoch:[0/2](20400/4588595) loss:3.450 lr:0.0002589 epoch_Time:28637.0min: [2024-01-02 19:47:40,822][model8_pretrain.py][INFO] Epoch:[0/2](20400/4588595) loss:3.677 lr:0.0002589 epoch_Time:28637.0min: [2024-01-02 19:47:40,822][model8_pretrain.py][INFO] Epoch:[0/2](20400/4588595) loss:3.529 lr:0.0002589 epoch_Time:28637.0min: [2024-01-02 19:48:21,503][model8_pretrain.py][INFO] Epoch:[0/2](20500/4588595) loss:3.633 lr:0.0002585 epoch_Time:28647.0min: [2024-01-02 19:48:21,503][model8_pretrain.py][INFO] Epoch:[0/2](20500/4588595) loss:3.397 lr:0.0002585 epoch_Time:28647.0min: [2024-01-02 19:48:21,503][model8_pretrain.py][INFO] Epoch:[0/2](20500/4588595) loss:3.424 lr:0.0002585 epoch_Time:28647.0min: [2024-01-02 19:48:21,503][model8_pretrain.py][INFO] Epoch:[0/2](20500/4588595) loss:3.063 lr:0.0002585 epoch_Time:28647.0min: [2024-01-02 19:48:21,503][model8_pretrain.py][INFO] Epoch:[0/2](20500/4588595) loss:3.680 lr:0.0002585 epoch_Time:28647.0min: [2024-01-02 19:48:21,503][model8_pretrain.py][INFO] Epoch:[0/2](20500/4588595) loss:3.104 lr:0.0002585 epoch_Time:28647.0min: [2024-01-02 19:48:21,503][model8_pretrain.py][INFO] Epoch:[0/2](20500/4588595) loss:3.332 lr:0.0002585 epoch_Time:28647.0min: [2024-01-02 19:48:21,503][model8_pretrain.py][INFO] Epoch:[0/2](20500/4588595) loss:3.681 lr:0.0002585 epoch_Time:28647.0min: [2024-01-02 19:48:58,434][model8_pretrain.py][INFO] Epoch:[0/2](20600/4588595) loss:3.990 lr:0.0002581 epoch_Time:28644.0min: [2024-01-02 19:48:58,434][model8_pretrain.py][INFO] Epoch:[0/2](20600/4588595) loss:3.093 lr:0.0002581 epoch_Time:28644.0min: [2024-01-02 19:48:58,434][model8_pretrain.py][INFO] Epoch:[0/2](20600/4588595) loss:3.660 lr:0.0002581 epoch_Time:28644.0min: [2024-01-02 19:48:58,434][model8_pretrain.py][INFO] Epoch:[0/2](20600/4588595) loss:3.598 lr:0.0002581 epoch_Time:28644.0min: [2024-01-02 19:48:58,434][model8_pretrain.py][INFO] Epoch:[0/2](20600/4588595) loss:3.492 lr:0.0002581 epoch_Time:28644.0min: [2024-01-02 19:48:58,434][model8_pretrain.py][INFO] Epoch:[0/2](20600/4588595) loss:3.387 lr:0.0002581 epoch_Time:28644.0min: [2024-01-02 19:48:58,434][model8_pretrain.py][INFO] Epoch:[0/2](20600/4588595) loss:3.633 lr:0.0002581 epoch_Time:28644.0min: [2024-01-02 19:48:58,434][model8_pretrain.py][INFO] Epoch:[0/2](20600/4588595) loss:3.766 lr:0.0002581 epoch_Time:28644.0min: [2024-01-02 19:49:35,369][model8_pretrain.py][INFO] Epoch:[0/2](20700/4588595) loss:3.790 lr:0.0002577 epoch_Time:28641.0min: [2024-01-02 19:49:35,369][model8_pretrain.py][INFO] Epoch:[0/2](20700/4588595) loss:3.411 lr:0.0002577 epoch_Time:28641.0min: [2024-01-02 19:49:35,369][model8_pretrain.py][INFO] Epoch:[0/2](20700/4588595) loss:3.288 lr:0.0002577 epoch_Time:28641.0min: [2024-01-02 19:49:35,369][model8_pretrain.py][INFO] Epoch:[0/2](20700/4588595) loss:3.327 lr:0.0002577 epoch_Time:28641.0min: [2024-01-02 19:49:35,369][model8_pretrain.py][INFO] Epoch:[0/2](20700/4588595) loss:3.653 lr:0.0002577 epoch_Time:28641.0min: [2024-01-02 19:49:35,369][model8_pretrain.py][INFO] Epoch:[0/2](20700/4588595) loss:3.480 lr:0.0002577 epoch_Time:28641.0min: [2024-01-02 19:49:35,369][model8_pretrain.py][INFO] Epoch:[0/2](20700/4588595) loss:3.864 lr:0.0002577 epoch_Time:28641.0min: [2024-01-02 19:49:35,369][model8_pretrain.py][INFO] Epoch:[0/2](20700/4588595) loss:2.905 lr:0.0002577 epoch_Time:28641.0min: [2024-01-02 19:50:12,312][model8_pretrain.py][INFO] Epoch:[0/2](20800/4588595) loss:3.107 lr:0.0002573 epoch_Time:28638.0min: [2024-01-02 19:50:12,312][model8_pretrain.py][INFO] Epoch:[0/2](20800/4588595) loss:3.607 lr:0.0002573 epoch_Time:28638.0min: [2024-01-02 19:50:12,312][model8_pretrain.py][INFO] Epoch:[0/2](20800/4588595) loss:3.745 lr:0.0002573 epoch_Time:28638.0min: [2024-01-02 19:50:12,312][model8_pretrain.py][INFO] Epoch:[0/2](20800/4588595) loss:3.553 lr:0.0002573 epoch_Time:28638.0min: [2024-01-02 19:50:12,312][model8_pretrain.py][INFO] Epoch:[0/2](20800/4588595) loss:3.542 lr:0.0002573 epoch_Time:28638.0min: [2024-01-02 19:50:12,312][model8_pretrain.py][INFO] Epoch:[0/2](20800/4588595) loss:3.583 lr:0.0002573 epoch_Time:28638.0min: [2024-01-02 19:50:12,312][model8_pretrain.py][INFO] Epoch:[0/2](20800/4588595) loss:2.963 lr:0.0002573 epoch_Time:28638.0min: [2024-01-02 19:50:12,312][model8_pretrain.py][INFO] Epoch:[0/2](20800/4588595) loss:4.226 lr:0.0002573 epoch_Time:28638.0min: [2024-01-02 19:50:49,263][model8_pretrain.py][INFO] Epoch:[0/2](20900/4588595) loss:2.851 lr:0.0002569 epoch_Time:28634.0min: [2024-01-02 19:50:49,263][model8_pretrain.py][INFO] Epoch:[0/2](20900/4588595) loss:3.810 lr:0.0002569 epoch_Time:28634.0min: [2024-01-02 19:50:49,263][model8_pretrain.py][INFO] Epoch:[0/2](20900/4588595) loss:3.319 lr:0.0002569 epoch_Time:28634.0min: [2024-01-02 19:50:49,263][model8_pretrain.py][INFO] Epoch:[0/2](20900/4588595) loss:2.881 lr:0.0002569 epoch_Time:28634.0min: [2024-01-02 19:50:49,263][model8_pretrain.py][INFO] Epoch:[0/2](20900/4588595) loss:3.601 lr:0.0002569 epoch_Time:28634.0min: [2024-01-02 19:50:49,263][model8_pretrain.py][INFO] Epoch:[0/2](20900/4588595) loss:3.225 lr:0.0002569 epoch_Time:28634.0min: [2024-01-02 19:50:49,263][model8_pretrain.py][INFO] Epoch:[0/2](20900/4588595) loss:3.608 lr:0.0002569 epoch_Time:28634.0min: [2024-01-02 19:50:49,263][model8_pretrain.py][INFO] Epoch:[0/2](20900/4588595) loss:3.380 lr:0.0002569 epoch_Time:28634.0min: [2024-01-02 19:51:26,209][model8_pretrain.py][INFO] Epoch:[0/2](21000/4588595) loss:3.418 lr:0.0002565 epoch_Time:28632.0min: [2024-01-02 19:51:26,209][model8_pretrain.py][INFO] Epoch:[0/2](21000/4588595) loss:3.699 lr:0.0002565 epoch_Time:28632.0min: [2024-01-02 19:51:26,209][model8_pretrain.py][INFO] Epoch:[0/2](21000/4588595) loss:3.418 lr:0.0002565 epoch_Time:28632.0min: [2024-01-02 19:51:26,209][model8_pretrain.py][INFO] Epoch:[0/2](21000/4588595) loss:3.648 lr:0.0002565 epoch_Time:28632.0min: [2024-01-02 19:51:26,209][model8_pretrain.py][INFO] Epoch:[0/2](21000/4588595) loss:3.387 lr:0.0002565 epoch_Time:28632.0min: [2024-01-02 19:51:26,209][model8_pretrain.py][INFO] Epoch:[0/2](21000/4588595) loss:3.558 lr:0.0002565 epoch_Time:28632.0min: [2024-01-02 19:51:26,209][model8_pretrain.py][INFO] Epoch:[0/2](21000/4588595) loss:3.512 lr:0.0002565 epoch_Time:28632.0min: [2024-01-02 19:51:26,210][model8_pretrain.py][INFO] Epoch:[0/2](21000/4588595) loss:3.512 lr:0.0002565 epoch_Time:28632.0min: [2024-01-02 19:52:03,164][model8_pretrain.py][INFO] Epoch:[0/2](21100/4588595) loss:3.389 lr:0.0002561 epoch_Time:28629.0min: [2024-01-02 19:52:03,164][model8_pretrain.py][INFO] Epoch:[0/2](21100/4588595) loss:3.549 lr:0.0002561 epoch_Time:28628.0min: [2024-01-02 19:52:03,164][model8_pretrain.py][INFO] Epoch:[0/2](21100/4588595) loss:4.120 lr:0.0002561 epoch_Time:28628.0min: [2024-01-02 19:52:03,164][model8_pretrain.py][INFO] Epoch:[0/2](21100/4588595) loss:3.872 lr:0.0002561 epoch_Time:28629.0min: [2024-01-02 19:52:03,164][model8_pretrain.py][INFO] Epoch:[0/2](21100/4588595) loss:3.944 lr:0.0002561 epoch_Time:28628.0min: [2024-01-02 19:52:03,164][model8_pretrain.py][INFO] Epoch:[0/2](21100/4588595) loss:3.319 lr:0.0002561 epoch_Time:28628.0min: [2024-01-02 19:52:03,164][model8_pretrain.py][INFO] Epoch:[0/2](21100/4588595) loss:3.258 lr:0.0002561 epoch_Time:28628.0min: [2024-01-02 19:52:03,164][model8_pretrain.py][INFO] Epoch:[0/2](21100/4588595) loss:2.945 lr:0.0002561 epoch_Time:28628.0min: [2024-01-02 19:52:40,109][model8_pretrain.py][INFO] Epoch:[0/2](21200/4588595) loss:3.772 lr:0.0002557 epoch_Time:28626.0min: [2024-01-02 19:52:40,109][model8_pretrain.py][INFO] Epoch:[0/2](21200/4588595) loss:3.738 lr:0.0002557 epoch_Time:28626.0min: [2024-01-02 19:52:40,109][model8_pretrain.py][INFO] Epoch:[0/2](21200/4588595) loss:3.537 lr:0.0002557 epoch_Time:28626.0min: [2024-01-02 19:52:40,109][model8_pretrain.py][INFO] Epoch:[0/2](21200/4588595) loss:3.675 lr:0.0002557 epoch_Time:28626.0min: [2024-01-02 19:52:40,109][model8_pretrain.py][INFO] Epoch:[0/2](21200/4588595) loss:3.841 lr:0.0002557 epoch_Time:28626.0min: [2024-01-02 19:52:40,109][model8_pretrain.py][INFO] Epoch:[0/2](21200/4588595) loss:3.174 lr:0.0002557 epoch_Time:28626.0min: [2024-01-02 19:52:40,109][model8_pretrain.py][INFO] Epoch:[0/2](21200/4588595) loss:3.230 lr:0.0002557 epoch_Time:28626.0min: [2024-01-02 19:52:40,109][model8_pretrain.py][INFO] Epoch:[0/2](21200/4588595) loss:3.207 lr:0.0002557 epoch_Time:28626.0min: [2024-01-02 19:53:20,622][model8_pretrain.py][INFO] Epoch:[0/2](21300/4588595) loss:3.308 lr:0.0002553 epoch_Time:28635.0min: [2024-01-02 19:53:20,622][model8_pretrain.py][INFO] Epoch:[0/2](21300/4588595) loss:3.057 lr:0.0002553 epoch_Time:28636.0min: [2024-01-02 19:53:20,622][model8_pretrain.py][INFO] Epoch:[0/2](21300/4588595) loss:3.700 lr:0.0002553 epoch_Time:28635.0min: [2024-01-02 19:53:20,622][model8_pretrain.py][INFO] Epoch:[0/2](21300/4588595) loss:3.139 lr:0.0002553 epoch_Time:28635.0min: [2024-01-02 19:53:20,622][model8_pretrain.py][INFO] Epoch:[0/2](21300/4588595) loss:4.015 lr:0.0002553 epoch_Time:28636.0min: [2024-01-02 19:53:20,622][model8_pretrain.py][INFO] Epoch:[0/2](21300/4588595) loss:3.404 lr:0.0002553 epoch_Time:28636.0min: [2024-01-02 19:53:20,622][model8_pretrain.py][INFO] Epoch:[0/2](21300/4588595) loss:3.512 lr:0.0002553 epoch_Time:28636.0min: [2024-01-02 19:53:20,622][model8_pretrain.py][INFO] Epoch:[0/2](21300/4588595) loss:3.312 lr:0.0002553 epoch_Time:28636.0min: [2024-01-02 19:53:57,563][model8_pretrain.py][INFO] Epoch:[0/2](21400/4588595) loss:3.799 lr:0.0002548 epoch_Time:28632.0min: [2024-01-02 19:53:57,563][model8_pretrain.py][INFO] Epoch:[0/2](21400/4588595) loss:3.029 lr:0.0002548 epoch_Time:28632.0min: [2024-01-02 19:53:57,563][model8_pretrain.py][INFO] Epoch:[0/2](21400/4588595) loss:3.232 lr:0.0002548 epoch_Time:28632.0min: [2024-01-02 19:53:57,563][model8_pretrain.py][INFO] Epoch:[0/2](21400/4588595) loss:3.338 lr:0.0002548 epoch_Time:28632.0min: [2024-01-02 19:53:57,563][model8_pretrain.py][INFO] Epoch:[0/2](21400/4588595) loss:3.527 lr:0.0002548 epoch_Time:28632.0min: [2024-01-02 19:53:57,563][model8_pretrain.py][INFO] Epoch:[0/2](21400/4588595) loss:3.630 lr:0.0002548 epoch_Time:28632.0min: [2024-01-02 19:53:57,563][model8_pretrain.py][INFO] Epoch:[0/2](21400/4588595) loss:3.251 lr:0.0002548 epoch_Time:28632.0min: [2024-01-02 19:53:57,564][model8_pretrain.py][INFO] Epoch:[0/2](21400/4588595) loss:2.912 lr:0.0002548 epoch_Time:28632.0min: [2024-01-02 19:54:34,497][model8_pretrain.py][INFO] Epoch:[0/2](21500/4588595) loss:3.242 lr:0.0002544 epoch_Time:28630.0min: [2024-01-02 19:54:34,497][model8_pretrain.py][INFO] Epoch:[0/2](21500/4588595) loss:3.451 lr:0.0002544 epoch_Time:28630.0min: [2024-01-02 19:54:34,497][model8_pretrain.py][INFO] Epoch:[0/2](21500/4588595) loss:3.925 lr:0.0002544 epoch_Time:28630.0min: [2024-01-02 19:54:34,497][model8_pretrain.py][INFO] Epoch:[0/2](21500/4588595) loss:3.126 lr:0.0002544 epoch_Time:28630.0min: [2024-01-02 19:54:34,497][model8_pretrain.py][INFO] Epoch:[0/2](21500/4588595) loss:3.254 lr:0.0002544 epoch_Time:28630.0min: [2024-01-02 19:54:34,497][model8_pretrain.py][INFO] Epoch:[0/2](21500/4588595) loss:3.652 lr:0.0002544 epoch_Time:28630.0min: [2024-01-02 19:54:34,497][model8_pretrain.py][INFO] Epoch:[0/2](21500/4588595) loss:3.175 lr:0.0002544 epoch_Time:28630.0min: [2024-01-02 19:54:34,497][model8_pretrain.py][INFO] Epoch:[0/2](21500/4588595) loss:3.556 lr:0.0002544 epoch_Time:28630.0min: [2024-01-02 19:55:11,440][model8_pretrain.py][INFO] Epoch:[0/2](21600/4588595) loss:3.509 lr:0.0002540 epoch_Time:28626.0min: [2024-01-02 19:55:11,440][model8_pretrain.py][INFO] Epoch:[0/2](21600/4588595) loss:3.231 lr:0.0002540 epoch_Time:28626.0min: [2024-01-02 19:55:11,440][model8_pretrain.py][INFO] Epoch:[0/2](21600/4588595) loss:3.410 lr:0.0002540 epoch_Time:28626.0min: [2024-01-02 19:55:11,440][model8_pretrain.py][INFO] Epoch:[0/2](21600/4588595) loss:4.017 lr:0.0002540 epoch_Time:28626.0min: [2024-01-02 19:55:11,440][model8_pretrain.py][INFO] Epoch:[0/2](21600/4588595) loss:3.286 lr:0.0002540 epoch_Time:28626.0min: [2024-01-02 19:55:11,440][model8_pretrain.py][INFO] Epoch:[0/2](21600/4588595) loss:3.542 lr:0.0002540 epoch_Time:28626.0min: [2024-01-02 19:55:11,440][model8_pretrain.py][INFO] Epoch:[0/2](21600/4588595) loss:3.570 lr:0.0002540 epoch_Time:28626.0min: [2024-01-02 19:55:11,440][model8_pretrain.py][INFO] Epoch:[0/2](21600/4588595) loss:3.512 lr:0.0002540 epoch_Time:28626.0min: [2024-01-02 19:55:48,390][model8_pretrain.py][INFO] Epoch:[0/2](21700/4588595) loss:3.553 lr:0.0002536 epoch_Time:28623.0min: [2024-01-02 19:55:48,390][model8_pretrain.py][INFO] Epoch:[0/2](21700/4588595) loss:3.812 lr:0.0002536 epoch_Time:28623.0min: [2024-01-02 19:55:48,390][model8_pretrain.py][INFO] Epoch:[0/2](21700/4588595) loss:3.629 lr:0.0002536 epoch_Time:28623.0min: [2024-01-02 19:55:48,390][model8_pretrain.py][INFO] Epoch:[0/2](21700/4588595) loss:3.498 lr:0.0002536 epoch_Time:28623.0min: [2024-01-02 19:55:48,390][model8_pretrain.py][INFO] Epoch:[0/2](21700/4588595) loss:3.710 lr:0.0002536 epoch_Time:28623.0min: [2024-01-02 19:55:48,390][model8_pretrain.py][INFO] Epoch:[0/2](21700/4588595) loss:3.113 lr:0.0002536 epoch_Time:28623.0min: [2024-01-02 19:55:48,391][model8_pretrain.py][INFO] Epoch:[0/2](21700/4588595) loss:3.422 lr:0.0002536 epoch_Time:28623.0min: [2024-01-02 19:55:48,391][model8_pretrain.py][INFO] Epoch:[0/2](21700/4588595) loss:3.101 lr:0.0002536 epoch_Time:28623.0min: [2024-01-02 19:56:25,350][model8_pretrain.py][INFO] Epoch:[0/2](21800/4588595) loss:3.204 lr:0.0002532 epoch_Time:28621.0min: [2024-01-02 19:56:25,350][model8_pretrain.py][INFO] Epoch:[0/2](21800/4588595) loss:3.440 lr:0.0002532 epoch_Time:28621.0min: [2024-01-02 19:56:25,350][model8_pretrain.py][INFO] Epoch:[0/2](21800/4588595) loss:3.271 lr:0.0002532 epoch_Time:28621.0min: [2024-01-02 19:56:25,350][model8_pretrain.py][INFO] Epoch:[0/2](21800/4588595) loss:3.351 lr:0.0002532 epoch_Time:28621.0min: [2024-01-02 19:56:25,350][model8_pretrain.py][INFO] Epoch:[0/2](21800/4588595) loss:3.383 lr:0.0002532 epoch_Time:28621.0min: [2024-01-02 19:56:25,350][model8_pretrain.py][INFO] Epoch:[0/2](21800/4588595) loss:3.744 lr:0.0002532 epoch_Time:28621.0min: [2024-01-02 19:56:25,350][model8_pretrain.py][INFO] Epoch:[0/2](21800/4588595) loss:3.509 lr:0.0002532 epoch_Time:28621.0min: [2024-01-02 19:56:25,350][model8_pretrain.py][INFO] Epoch:[0/2](21800/4588595) loss:3.602 lr:0.0002532 epoch_Time:28621.0min: [2024-01-02 19:57:02,299][model8_pretrain.py][INFO] Epoch:[0/2](21900/4588595) loss:3.568 lr:0.0002527 epoch_Time:28617.0min: [2024-01-02 19:57:02,299][model8_pretrain.py][INFO] Epoch:[0/2](21900/4588595) loss:3.726 lr:0.0002527 epoch_Time:28617.0min: [2024-01-02 19:57:02,299][model8_pretrain.py][INFO] Epoch:[0/2](21900/4588595) loss:3.791 lr:0.0002527 epoch_Time:28618.0min: [2024-01-02 19:57:02,299][model8_pretrain.py][INFO] Epoch:[0/2](21900/4588595) loss:3.010 lr:0.0002527 epoch_Time:28617.0min: [2024-01-02 19:57:02,299][model8_pretrain.py][INFO] Epoch:[0/2](21900/4588595) loss:3.913 lr:0.0002527 epoch_Time:28617.0min: [2024-01-02 19:57:02,299][model8_pretrain.py][INFO] Epoch:[0/2](21900/4588595) loss:3.793 lr:0.0002527 epoch_Time:28617.0min: [2024-01-02 19:57:02,299][model8_pretrain.py][INFO] Epoch:[0/2](21900/4588595) loss:3.126 lr:0.0002527 epoch_Time:28617.0min: [2024-01-02 19:57:02,299][model8_pretrain.py][INFO] Epoch:[0/2](21900/4588595) loss:3.374 lr:0.0002527 epoch_Time:28617.0min: [2024-01-02 19:57:39,263][model8_pretrain.py][INFO] Epoch:[0/2](22000/4588595) loss:3.350 lr:0.0002523 epoch_Time:28615.0min: [2024-01-02 19:57:39,263][model8_pretrain.py][INFO] Epoch:[0/2](22000/4588595) loss:3.459 lr:0.0002523 epoch_Time:28615.0min: [2024-01-02 19:57:39,263][model8_pretrain.py][INFO] Epoch:[0/2](22000/4588595) loss:3.494 lr:0.0002523 epoch_Time:28615.0min: [2024-01-02 19:57:39,263][model8_pretrain.py][INFO] Epoch:[0/2](22000/4588595) loss:3.342 lr:0.0002523 epoch_Time:28615.0min: [2024-01-02 19:57:39,263][model8_pretrain.py][INFO] Epoch:[0/2](22000/4588595) loss:3.470 lr:0.0002523 epoch_Time:28615.0min: [2024-01-02 19:57:39,263][model8_pretrain.py][INFO] Epoch:[0/2](22000/4588595) loss:3.338 lr:0.0002523 epoch_Time:28615.0min: [2024-01-02 19:57:39,263][model8_pretrain.py][INFO] Epoch:[0/2](22000/4588595) loss:3.763 lr:0.0002523 epoch_Time:28615.0min: [2024-01-02 19:57:39,263][model8_pretrain.py][INFO] Epoch:[0/2](22000/4588595) loss:3.594 lr:0.0002523 epoch_Time:28615.0min: [2024-01-02 19:58:19,796][model8_pretrain.py][INFO] Epoch:[0/2](22100/4588595) loss:3.059 lr:0.0002519 epoch_Time:28624.0min: [2024-01-02 19:58:19,797][model8_pretrain.py][INFO] Epoch:[0/2](22100/4588595) loss:3.403 lr:0.0002519 epoch_Time:28624.0min: [2024-01-02 19:58:19,797][model8_pretrain.py][INFO] Epoch:[0/2](22100/4588595) loss:3.396 lr:0.0002519 epoch_Time:28624.0min: [2024-01-02 19:58:19,797][model8_pretrain.py][INFO] Epoch:[0/2](22100/4588595) loss:3.486 lr:0.0002519 epoch_Time:28624.0min: [2024-01-02 19:58:19,797][model8_pretrain.py][INFO] Epoch:[0/2](22100/4588595) loss:3.347 lr:0.0002519 epoch_Time:28624.0min: [2024-01-02 19:58:19,797][model8_pretrain.py][INFO] Epoch:[0/2](22100/4588595) loss:3.469 lr:0.0002519 epoch_Time:28624.0min: [2024-01-02 19:58:19,797][model8_pretrain.py][INFO] Epoch:[0/2](22100/4588595) loss:3.456 lr:0.0002519 epoch_Time:28624.0min: [2024-01-02 19:58:19,797][model8_pretrain.py][INFO] Epoch:[0/2](22100/4588595) loss:3.491 lr:0.0002519 epoch_Time:28624.0min: [2024-01-02 19:58:56,751][model8_pretrain.py][INFO] Epoch:[0/2](22200/4588595) loss:3.508 lr:0.0002515 epoch_Time:28621.0min: [2024-01-02 19:58:56,751][model8_pretrain.py][INFO] Epoch:[0/2](22200/4588595) loss:3.335 lr:0.0002515 epoch_Time:28621.0min: [2024-01-02 19:58:56,751][model8_pretrain.py][INFO] Epoch:[0/2](22200/4588595) loss:3.895 lr:0.0002515 epoch_Time:28621.0min: [2024-01-02 19:58:56,751][model8_pretrain.py][INFO] Epoch:[0/2](22200/4588595) loss:3.680 lr:0.0002515 epoch_Time:28621.0min: [2024-01-02 19:58:56,751][model8_pretrain.py][INFO] Epoch:[0/2](22200/4588595) loss:3.532 lr:0.0002515 epoch_Time:28621.0min: [2024-01-02 19:58:56,751][model8_pretrain.py][INFO] Epoch:[0/2](22200/4588595) loss:3.462 lr:0.0002515 epoch_Time:28621.0min: [2024-01-02 19:58:56,751][model8_pretrain.py][INFO] Epoch:[0/2](22200/4588595) loss:3.824 lr:0.0002515 epoch_Time:28621.0min: [2024-01-02 19:58:56,751][model8_pretrain.py][INFO] Epoch:[0/2](22200/4588595) loss:3.201 lr:0.0002515 epoch_Time:28621.0min: [2024-01-02 19:59:33,700][model8_pretrain.py][INFO] Epoch:[0/2](22300/4588595) loss:3.560 lr:0.0002510 epoch_Time:28619.0min: [2024-01-02 19:59:33,700][model8_pretrain.py][INFO] Epoch:[0/2](22300/4588595) loss:3.681 lr:0.0002510 epoch_Time:28619.0min: [2024-01-02 19:59:33,700][model8_pretrain.py][INFO] Epoch:[0/2](22300/4588595) loss:3.509 lr:0.0002510 epoch_Time:28619.0min: [2024-01-02 19:59:33,700][model8_pretrain.py][INFO] Epoch:[0/2](22300/4588595) loss:3.408 lr:0.0002510 epoch_Time:28619.0min: [2024-01-02 19:59:33,700][model8_pretrain.py][INFO] Epoch:[0/2](22300/4588595) loss:3.025 lr:0.0002510 epoch_Time:28619.0min: [2024-01-02 19:59:33,700][model8_pretrain.py][INFO] Epoch:[0/2](22300/4588595) loss:3.755 lr:0.0002510 epoch_Time:28619.0min: [2024-01-02 19:59:33,700][model8_pretrain.py][INFO] Epoch:[0/2](22300/4588595) loss:2.911 lr:0.0002510 epoch_Time:28619.0min: [2024-01-02 19:59:33,700][model8_pretrain.py][INFO] Epoch:[0/2](22300/4588595) loss:3.707 lr:0.0002510 epoch_Time:28619.0min: [2024-01-02 20:00:10,648][model8_pretrain.py][INFO] Epoch:[0/2](22400/4588595) loss:3.383 lr:0.0002506 epoch_Time:28616.0min: [2024-01-02 20:00:10,648][model8_pretrain.py][INFO] Epoch:[0/2](22400/4588595) loss:3.564 lr:0.0002506 epoch_Time:28616.0min: [2024-01-02 20:00:10,648][model8_pretrain.py][INFO] Epoch:[0/2](22400/4588595) loss:3.494 lr:0.0002506 epoch_Time:28616.0min: [2024-01-02 20:00:10,648][model8_pretrain.py][INFO] Epoch:[0/2](22400/4588595) loss:3.570 lr:0.0002506 epoch_Time:28616.0min: [2024-01-02 20:00:10,648][model8_pretrain.py][INFO] Epoch:[0/2](22400/4588595) loss:3.633 lr:0.0002506 epoch_Time:28616.0min: [2024-01-02 20:00:10,648][model8_pretrain.py][INFO] Epoch:[0/2](22400/4588595) loss:3.282 lr:0.0002506 epoch_Time:28616.0min: [2024-01-02 20:00:10,648][model8_pretrain.py][INFO] Epoch:[0/2](22400/4588595) loss:3.656 lr:0.0002506 epoch_Time:28616.0min: [2024-01-02 20:00:10,648][model8_pretrain.py][INFO] Epoch:[0/2](22400/4588595) loss:4.035 lr:0.0002506 epoch_Time:28616.0min: [2024-01-02 20:00:47,604][model8_pretrain.py][INFO] Epoch:[0/2](22500/4588595) loss:3.344 lr:0.0002502 epoch_Time:28612.0min: [2024-01-02 20:00:47,604][model8_pretrain.py][INFO] Epoch:[0/2](22500/4588595) loss:3.533 lr:0.0002502 epoch_Time:28613.0min: [2024-01-02 20:00:47,604][model8_pretrain.py][INFO] Epoch:[0/2](22500/4588595) loss:3.348 lr:0.0002502 epoch_Time:28613.0min: [2024-01-02 20:00:47,604][model8_pretrain.py][INFO] Epoch:[0/2](22500/4588595) loss:3.789 lr:0.0002502 epoch_Time:28612.0min: [2024-01-02 20:00:47,604][model8_pretrain.py][INFO] Epoch:[0/2](22500/4588595) loss:3.333 lr:0.0002502 epoch_Time:28613.0min: [2024-01-02 20:00:47,604][model8_pretrain.py][INFO] Epoch:[0/2](22500/4588595) loss:3.730 lr:0.0002502 epoch_Time:28612.0min: [2024-01-02 20:00:47,604][model8_pretrain.py][INFO] Epoch:[0/2](22500/4588595) loss:3.346 lr:0.0002502 epoch_Time:28612.0min: [2024-01-02 20:00:47,604][model8_pretrain.py][INFO] Epoch:[0/2](22500/4588595) loss:4.040 lr:0.0002502 epoch_Time:28612.0min: [2024-01-02 20:01:24,545][model8_pretrain.py][INFO] Epoch:[0/2](22600/4588595) loss:3.855 lr:0.0002497 epoch_Time:28610.0min: [2024-01-02 20:01:24,545][model8_pretrain.py][INFO] Epoch:[0/2](22600/4588595) loss:3.819 lr:0.0002497 epoch_Time:28610.0min: [2024-01-02 20:01:24,545][model8_pretrain.py][INFO] Epoch:[0/2](22600/4588595) loss:3.833 lr:0.0002497 epoch_Time:28610.0min: [2024-01-02 20:01:24,545][model8_pretrain.py][INFO] Epoch:[0/2](22600/4588595) loss:3.311 lr:0.0002497 epoch_Time:28610.0min: [2024-01-02 20:01:24,545][model8_pretrain.py][INFO] Epoch:[0/2](22600/4588595) loss:3.432 lr:0.0002497 epoch_Time:28610.0min: [2024-01-02 20:01:24,546][model8_pretrain.py][INFO] Epoch:[0/2](22600/4588595) loss:3.231 lr:0.0002497 epoch_Time:28610.0min: [2024-01-02 20:01:24,546][model8_pretrain.py][INFO] Epoch:[0/2](22600/4588595) loss:3.421 lr:0.0002497 epoch_Time:28610.0min: [2024-01-02 20:01:24,546][model8_pretrain.py][INFO] Epoch:[0/2](22600/4588595) loss:3.661 lr:0.0002497 epoch_Time:28610.0min: [2024-01-02 20:02:01,537][model8_pretrain.py][INFO] Epoch:[0/2](22700/4588595) loss:3.270 lr:0.0002493 epoch_Time:28607.0min: [2024-01-02 20:02:01,537][model8_pretrain.py][INFO] Epoch:[0/2](22700/4588595) loss:3.650 lr:0.0002493 epoch_Time:28607.0min: [2024-01-02 20:02:01,537][model8_pretrain.py][INFO] Epoch:[0/2](22700/4588595) loss:3.650 lr:0.0002493 epoch_Time:28607.0min: [2024-01-02 20:02:01,537][model8_pretrain.py][INFO] Epoch:[0/2](22700/4588595) loss:3.611 lr:0.0002493 epoch_Time:28607.0min: [2024-01-02 20:02:01,537][model8_pretrain.py][INFO] Epoch:[0/2](22700/4588595) loss:3.442 lr:0.0002493 epoch_Time:28607.0min: [2024-01-02 20:02:01,537][model8_pretrain.py][INFO] Epoch:[0/2](22700/4588595) loss:3.230 lr:0.0002493 epoch_Time:28607.0min: [2024-01-02 20:02:01,537][model8_pretrain.py][INFO] Epoch:[0/2](22700/4588595) loss:3.496 lr:0.0002493 epoch_Time:28607.0min: [2024-01-02 20:02:01,538][model8_pretrain.py][INFO] Epoch:[0/2](22700/4588595) loss:2.948 lr:0.0002493 epoch_Time:28607.0min: [2024-01-02 20:02:38,501][model8_pretrain.py][INFO] Epoch:[0/2](22800/4588595) loss:3.295 lr:0.0002488 epoch_Time:28605.0min: [2024-01-02 20:02:38,501][model8_pretrain.py][INFO] Epoch:[0/2](22800/4588595) loss:3.750 lr:0.0002488 epoch_Time:28605.0min: [2024-01-02 20:02:38,501][model8_pretrain.py][INFO] Epoch:[0/2](22800/4588595) loss:3.626 lr:0.0002488 epoch_Time:28605.0min: [2024-01-02 20:02:38,501][model8_pretrain.py][INFO] Epoch:[0/2](22800/4588595) loss:2.710 lr:0.0002488 epoch_Time:28605.0min: [2024-01-02 20:02:38,501][model8_pretrain.py][INFO] Epoch:[0/2](22800/4588595) loss:3.317 lr:0.0002488 epoch_Time:28605.0min: [2024-01-02 20:02:38,501][model8_pretrain.py][INFO] Epoch:[0/2](22800/4588595) loss:3.400 lr:0.0002488 epoch_Time:28605.0min: [2024-01-02 20:02:38,501][model8_pretrain.py][INFO] Epoch:[0/2](22800/4588595) loss:3.610 lr:0.0002488 epoch_Time:28605.0min: [2024-01-02 20:02:38,501][model8_pretrain.py][INFO] Epoch:[0/2](22800/4588595) loss:3.418 lr:0.0002488 epoch_Time:28605.0min: [2024-01-02 20:03:19,047][model8_pretrain.py][INFO] Epoch:[0/2](22900/4588595) loss:3.730 lr:0.0002484 epoch_Time:28614.0min: [2024-01-02 20:03:19,047][model8_pretrain.py][INFO] Epoch:[0/2](22900/4588595) loss:3.535 lr:0.0002484 epoch_Time:28614.0min: [2024-01-02 20:03:19,047][model8_pretrain.py][INFO] Epoch:[0/2](22900/4588595) loss:3.315 lr:0.0002484 epoch_Time:28614.0min: [2024-01-02 20:03:19,047][model8_pretrain.py][INFO] Epoch:[0/2](22900/4588595) loss:3.592 lr:0.0002484 epoch_Time:28614.0min: [2024-01-02 20:03:19,047][model8_pretrain.py][INFO] Epoch:[0/2](22900/4588595) loss:3.641 lr:0.0002484 epoch_Time:28614.0min: [2024-01-02 20:03:19,048][model8_pretrain.py][INFO] Epoch:[0/2](22900/4588595) loss:3.456 lr:0.0002484 epoch_Time:28614.0min: [2024-01-02 20:03:19,048][model8_pretrain.py][INFO] Epoch:[0/2](22900/4588595) loss:3.585 lr:0.0002484 epoch_Time:28614.0min: [2024-01-02 20:03:19,048][model8_pretrain.py][INFO] Epoch:[0/2](22900/4588595) loss:3.340 lr:0.0002484 epoch_Time:28614.0min: [2024-01-02 20:03:55,965][model8_pretrain.py][INFO] Epoch:[0/2](23000/4588595) loss:2.863 lr:0.0002480 epoch_Time:28611.0min: [2024-01-02 20:03:55,965][model8_pretrain.py][INFO] Epoch:[0/2](23000/4588595) loss:3.404 lr:0.0002480 epoch_Time:28611.0min: [2024-01-02 20:03:55,965][model8_pretrain.py][INFO] Epoch:[0/2](23000/4588595) loss:3.667 lr:0.0002480 epoch_Time:28611.0min: [2024-01-02 20:03:55,965][model8_pretrain.py][INFO] Epoch:[0/2](23000/4588595) loss:3.326 lr:0.0002480 epoch_Time:28611.0min: [2024-01-02 20:03:55,965][model8_pretrain.py][INFO] Epoch:[0/2](23000/4588595) loss:3.320 lr:0.0002480 epoch_Time:28611.0min: [2024-01-02 20:03:55,965][model8_pretrain.py][INFO] Epoch:[0/2](23000/4588595) loss:3.552 lr:0.0002480 epoch_Time:28611.0min: [2024-01-02 20:03:55,965][model8_pretrain.py][INFO] Epoch:[0/2](23000/4588595) loss:3.667 lr:0.0002480 epoch_Time:28611.0min: [2024-01-02 20:03:55,965][model8_pretrain.py][INFO] Epoch:[0/2](23000/4588595) loss:3.690 lr:0.0002480 epoch_Time:28611.0min: [2024-01-02 20:04:32,916][model8_pretrain.py][INFO] Epoch:[0/2](23100/4588595) loss:3.542 lr:0.0002475 epoch_Time:28608.0min: [2024-01-02 20:04:32,916][model8_pretrain.py][INFO] Epoch:[0/2](23100/4588595) loss:3.305 lr:0.0002475 epoch_Time:28609.0min: [2024-01-02 20:04:32,916][model8_pretrain.py][INFO] Epoch:[0/2](23100/4588595) loss:3.540 lr:0.0002475 epoch_Time:28608.0min: [2024-01-02 20:04:32,916][model8_pretrain.py][INFO] Epoch:[0/2](23100/4588595) loss:4.107 lr:0.0002475 epoch_Time:28609.0min: [2024-01-02 20:04:32,916][model8_pretrain.py][INFO] Epoch:[0/2](23100/4588595) loss:3.165 lr:0.0002475 epoch_Time:28609.0min: [2024-01-02 20:04:32,916][model8_pretrain.py][INFO] Epoch:[0/2](23100/4588595) loss:3.213 lr:0.0002475 epoch_Time:28609.0min: [2024-01-02 20:04:32,916][model8_pretrain.py][INFO] Epoch:[0/2](23100/4588595) loss:3.550 lr:0.0002475 epoch_Time:28609.0min: [2024-01-02 20:04:32,916][model8_pretrain.py][INFO] Epoch:[0/2](23100/4588595) loss:3.365 lr:0.0002475 epoch_Time:28609.0min: [2024-01-02 20:05:09,856][model8_pretrain.py][INFO] Epoch:[0/2](23200/4588595) loss:3.382 lr:0.0002471 epoch_Time:28605.0min: [2024-01-02 20:05:09,856][model8_pretrain.py][INFO] Epoch:[0/2](23200/4588595) loss:3.320 lr:0.0002471 epoch_Time:28605.0min: [2024-01-02 20:05:09,856][model8_pretrain.py][INFO] Epoch:[0/2](23200/4588595) loss:3.558 lr:0.0002471 epoch_Time:28605.0min: [2024-01-02 20:05:09,856][model8_pretrain.py][INFO] Epoch:[0/2](23200/4588595) loss:2.641 lr:0.0002471 epoch_Time:28605.0min: [2024-01-02 20:05:09,856][model8_pretrain.py][INFO] Epoch:[0/2](23200/4588595) loss:3.567 lr:0.0002471 epoch_Time:28605.0min: [2024-01-02 20:05:09,856][model8_pretrain.py][INFO] Epoch:[0/2](23200/4588595) loss:3.568 lr:0.0002471 epoch_Time:28605.0min: [2024-01-02 20:05:09,856][model8_pretrain.py][INFO] Epoch:[0/2](23200/4588595) loss:3.279 lr:0.0002471 epoch_Time:28605.0min: [2024-01-02 20:05:09,856][model8_pretrain.py][INFO] Epoch:[0/2](23200/4588595) loss:3.363 lr:0.0002471 epoch_Time:28605.0min: [2024-01-02 20:05:46,790][model8_pretrain.py][INFO] Epoch:[0/2](23300/4588595) loss:3.338 lr:0.0002466 epoch_Time:28603.0min: [2024-01-02 20:05:46,790][model8_pretrain.py][INFO] Epoch:[0/2](23300/4588595) loss:3.167 lr:0.0002466 epoch_Time:28603.0min: [2024-01-02 20:05:46,790][model8_pretrain.py][INFO] Epoch:[0/2](23300/4588595) loss:3.553 lr:0.0002466 epoch_Time:28603.0min: [2024-01-02 20:05:46,790][model8_pretrain.py][INFO] Epoch:[0/2](23300/4588595) loss:3.846 lr:0.0002466 epoch_Time:28603.0min: [2024-01-02 20:05:46,790][model8_pretrain.py][INFO] Epoch:[0/2](23300/4588595) loss:3.506 lr:0.0002466 epoch_Time:28603.0min: [2024-01-02 20:05:46,790][model8_pretrain.py][INFO] Epoch:[0/2](23300/4588595) loss:3.606 lr:0.0002466 epoch_Time:28603.0min: [2024-01-02 20:05:46,790][model8_pretrain.py][INFO] Epoch:[0/2](23300/4588595) loss:3.610 lr:0.0002466 epoch_Time:28603.0min: [2024-01-02 20:05:46,790][model8_pretrain.py][INFO] Epoch:[0/2](23300/4588595) loss:3.434 lr:0.0002466 epoch_Time:28603.0min: [2024-01-02 20:06:23,733][model8_pretrain.py][INFO] Epoch:[0/2](23400/4588595) loss:3.647 lr:0.0002462 epoch_Time:28600.0min: [2024-01-02 20:06:23,733][model8_pretrain.py][INFO] Epoch:[0/2](23400/4588595) loss:3.366 lr:0.0002462 epoch_Time:28600.0min: [2024-01-02 20:06:23,733][model8_pretrain.py][INFO] Epoch:[0/2](23400/4588595) loss:3.815 lr:0.0002462 epoch_Time:28600.0min: [2024-01-02 20:06:23,733][model8_pretrain.py][INFO] Epoch:[0/2](23400/4588595) loss:3.402 lr:0.0002462 epoch_Time:28600.0min: [2024-01-02 20:06:23,733][model8_pretrain.py][INFO] Epoch:[0/2](23400/4588595) loss:3.909 lr:0.0002462 epoch_Time:28600.0min: [2024-01-02 20:06:23,733][model8_pretrain.py][INFO] Epoch:[0/2](23400/4588595) loss:3.651 lr:0.0002462 epoch_Time:28600.0min: [2024-01-02 20:06:23,733][model8_pretrain.py][INFO] Epoch:[0/2](23400/4588595) loss:3.280 lr:0.0002462 epoch_Time:28600.0min: [2024-01-02 20:06:23,733][model8_pretrain.py][INFO] Epoch:[0/2](23400/4588595) loss:3.681 lr:0.0002462 epoch_Time:28600.0min: [2024-01-02 20:07:00,667][model8_pretrain.py][INFO] Epoch:[0/2](23500/4588595) loss:3.484 lr:0.0002457 epoch_Time:28597.0min: [2024-01-02 20:07:00,667][model8_pretrain.py][INFO] Epoch:[0/2](23500/4588595) loss:3.357 lr:0.0002457 epoch_Time:28597.0min: [2024-01-02 20:07:00,667][model8_pretrain.py][INFO] Epoch:[0/2](23500/4588595) loss:3.581 lr:0.0002457 epoch_Time:28597.0min: [2024-01-02 20:07:00,667][model8_pretrain.py][INFO] Epoch:[0/2](23500/4588595) loss:3.280 lr:0.0002457 epoch_Time:28597.0min: [2024-01-02 20:07:00,667][model8_pretrain.py][INFO] Epoch:[0/2](23500/4588595) loss:3.765 lr:0.0002457 epoch_Time:28597.0min: [2024-01-02 20:07:00,667][model8_pretrain.py][INFO] Epoch:[0/2](23500/4588595) loss:3.409 lr:0.0002457 epoch_Time:28597.0min: [2024-01-02 20:07:00,667][model8_pretrain.py][INFO] Epoch:[0/2](23500/4588595) loss:3.405 lr:0.0002457 epoch_Time:28597.0min: [2024-01-02 20:07:00,668][model8_pretrain.py][INFO] Epoch:[0/2](23500/4588595) loss:3.384 lr:0.0002457 epoch_Time:28597.0min: [2024-01-02 20:07:37,604][model8_pretrain.py][INFO] Epoch:[0/2](23600/4588595) loss:3.560 lr:0.0002453 epoch_Time:28595.0min: [2024-01-02 20:07:37,604][model8_pretrain.py][INFO] Epoch:[0/2](23600/4588595) loss:3.553 lr:0.0002453 epoch_Time:28595.0min: [2024-01-02 20:07:37,604][model8_pretrain.py][INFO] Epoch:[0/2](23600/4588595) loss:3.664 lr:0.0002453 epoch_Time:28595.0min: [2024-01-02 20:07:37,604][model8_pretrain.py][INFO] Epoch:[0/2](23600/4588595) loss:3.876 lr:0.0002453 epoch_Time:28595.0min: [2024-01-02 20:07:37,604][model8_pretrain.py][INFO] Epoch:[0/2](23600/4588595) loss:3.225 lr:0.0002453 epoch_Time:28595.0min: [2024-01-02 20:07:37,604][model8_pretrain.py][INFO] Epoch:[0/2](23600/4588595) loss:3.375 lr:0.0002453 epoch_Time:28595.0min: [2024-01-02 20:07:37,604][model8_pretrain.py][INFO] Epoch:[0/2](23600/4588595) loss:3.388 lr:0.0002453 epoch_Time:28595.0min: [2024-01-02 20:07:37,604][model8_pretrain.py][INFO] Epoch:[0/2](23600/4588595) loss:3.687 lr:0.0002453 epoch_Time:28595.0min: [2024-01-02 20:08:18,153][model8_pretrain.py][INFO] Epoch:[0/2](23700/4588595) loss:3.663 lr:0.0002448 epoch_Time:28603.0min: [2024-01-02 20:08:18,153][model8_pretrain.py][INFO] Epoch:[0/2](23700/4588595) loss:2.973 lr:0.0002448 epoch_Time:28603.0min: [2024-01-02 20:08:18,153][model8_pretrain.py][INFO] Epoch:[0/2](23700/4588595) loss:2.994 lr:0.0002448 epoch_Time:28604.0min: [2024-01-02 20:08:18,153][model8_pretrain.py][INFO] Epoch:[0/2](23700/4588595) loss:3.063 lr:0.0002448 epoch_Time:28603.0min: [2024-01-02 20:08:18,153][model8_pretrain.py][INFO] Epoch:[0/2](23700/4588595) loss:3.139 lr:0.0002448 epoch_Time:28603.0min: [2024-01-02 20:08:18,153][model8_pretrain.py][INFO] Epoch:[0/2](23700/4588595) loss:3.323 lr:0.0002448 epoch_Time:28603.0min: [2024-01-02 20:08:18,153][model8_pretrain.py][INFO] Epoch:[0/2](23700/4588595) loss:3.368 lr:0.0002448 epoch_Time:28603.0min: [2024-01-02 20:08:18,153][model8_pretrain.py][INFO] Epoch:[0/2](23700/4588595) loss:3.533 lr:0.0002448 epoch_Time:28603.0min: [2024-01-02 20:08:55,077][model8_pretrain.py][INFO] Epoch:[0/2](23800/4588595) loss:3.588 lr:0.0002444 epoch_Time:28600.0min: [2024-01-02 20:08:55,077][model8_pretrain.py][INFO] Epoch:[0/2](23800/4588595) loss:3.422 lr:0.0002444 epoch_Time:28600.0min: [2024-01-02 20:08:55,077][model8_pretrain.py][INFO] Epoch:[0/2](23800/4588595) loss:3.681 lr:0.0002444 epoch_Time:28600.0min: [2024-01-02 20:08:55,077][model8_pretrain.py][INFO] Epoch:[0/2](23800/4588595) loss:3.635 lr:0.0002444 epoch_Time:28600.0min: [2024-01-02 20:08:55,077][model8_pretrain.py][INFO] Epoch:[0/2](23800/4588595) loss:3.476 lr:0.0002444 epoch_Time:28600.0min: [2024-01-02 20:08:55,077][model8_pretrain.py][INFO] Epoch:[0/2](23800/4588595) loss:3.725 lr:0.0002444 epoch_Time:28600.0min: [2024-01-02 20:08:55,077][model8_pretrain.py][INFO] Epoch:[0/2](23800/4588595) loss:3.433 lr:0.0002444 epoch_Time:28600.0min: [2024-01-02 20:08:55,077][model8_pretrain.py][INFO] Epoch:[0/2](23800/4588595) loss:3.198 lr:0.0002444 epoch_Time:28600.0min: [2024-01-02 20:09:32,021][model8_pretrain.py][INFO] Epoch:[0/2](23900/4588595) loss:3.438 lr:0.0002439 epoch_Time:28598.0min: [2024-01-02 20:09:32,021][model8_pretrain.py][INFO] Epoch:[0/2](23900/4588595) loss:3.184 lr:0.0002439 epoch_Time:28598.0min: [2024-01-02 20:09:32,021][model8_pretrain.py][INFO] Epoch:[0/2](23900/4588595) loss:3.666 lr:0.0002439 epoch_Time:28598.0min: [2024-01-02 20:09:32,021][model8_pretrain.py][INFO] Epoch:[0/2](23900/4588595) loss:3.348 lr:0.0002439 epoch_Time:28598.0min: [2024-01-02 20:09:32,021][model8_pretrain.py][INFO] Epoch:[0/2](23900/4588595) loss:3.311 lr:0.0002439 epoch_Time:28598.0min: [2024-01-02 20:09:32,021][model8_pretrain.py][INFO] Epoch:[0/2](23900/4588595) loss:3.451 lr:0.0002439 epoch_Time:28598.0min: [2024-01-02 20:09:32,021][model8_pretrain.py][INFO] Epoch:[0/2](23900/4588595) loss:3.509 lr:0.0002439 epoch_Time:28598.0min: [2024-01-02 20:09:32,021][model8_pretrain.py][INFO] Epoch:[0/2](23900/4588595) loss:3.648 lr:0.0002439 epoch_Time:28598.0min: [2024-01-02 20:10:08,990][model8_pretrain.py][INFO] Epoch:[0/2](24000/4588595) loss:3.642 lr:0.0002435 epoch_Time:28595.0min: [2024-01-02 20:10:08,991][model8_pretrain.py][INFO] Epoch:[0/2](24000/4588595) loss:3.613 lr:0.0002435 epoch_Time:28595.0min: [2024-01-02 20:10:08,991][model8_pretrain.py][INFO] Epoch:[0/2](24000/4588595) loss:4.083 lr:0.0002435 epoch_Time:28595.0min: [2024-01-02 20:10:08,991][model8_pretrain.py][INFO] Epoch:[0/2](24000/4588595) loss:3.647 lr:0.0002435 epoch_Time:28595.0min: [2024-01-02 20:10:08,991][model8_pretrain.py][INFO] Epoch:[0/2](24000/4588595) loss:3.365 lr:0.0002435 epoch_Time:28595.0min: [2024-01-02 20:10:08,991][model8_pretrain.py][INFO] Epoch:[0/2](24000/4588595) loss:3.613 lr:0.0002435 epoch_Time:28595.0min: [2024-01-02 20:10:08,991][model8_pretrain.py][INFO] Epoch:[0/2](24000/4588595) loss:3.783 lr:0.0002435 epoch_Time:28595.0min: [2024-01-02 20:10:08,992][model8_pretrain.py][INFO] Epoch:[0/2](24000/4588595) loss:3.888 lr:0.0002435 epoch_Time:28595.0min: [2024-01-02 20:10:45,941][model8_pretrain.py][INFO] Epoch:[0/2](24100/4588595) loss:3.113 lr:0.0002430 epoch_Time:28593.0min: [2024-01-02 20:10:45,941][model8_pretrain.py][INFO] Epoch:[0/2](24100/4588595) loss:3.383 lr:0.0002430 epoch_Time:28593.0min: [2024-01-02 20:10:45,941][model8_pretrain.py][INFO] Epoch:[0/2](24100/4588595) loss:3.533 lr:0.0002430 epoch_Time:28593.0min: [2024-01-02 20:10:45,942][model8_pretrain.py][INFO] Epoch:[0/2](24100/4588595) loss:3.526 lr:0.0002430 epoch_Time:28593.0min: [2024-01-02 20:10:45,942][model8_pretrain.py][INFO] Epoch:[0/2](24100/4588595) loss:2.997 lr:0.0002430 epoch_Time:28593.0min: [2024-01-02 20:10:45,942][model8_pretrain.py][INFO] Epoch:[0/2](24100/4588595) loss:4.042 lr:0.0002430 epoch_Time:28593.0min: [2024-01-02 20:10:45,942][model8_pretrain.py][INFO] Epoch:[0/2](24100/4588595) loss:3.799 lr:0.0002430 epoch_Time:28593.0min: [2024-01-02 20:10:45,942][model8_pretrain.py][INFO] Epoch:[0/2](24100/4588595) loss:3.398 lr:0.0002430 epoch_Time:28593.0min: [2024-01-02 20:11:22,876][model8_pretrain.py][INFO] Epoch:[0/2](24200/4588595) loss:3.756 lr:0.0002425 epoch_Time:28590.0min: [2024-01-02 20:11:22,876][model8_pretrain.py][INFO] Epoch:[0/2](24200/4588595) loss:3.291 lr:0.0002425 epoch_Time:28590.0min: [2024-01-02 20:11:22,876][model8_pretrain.py][INFO] Epoch:[0/2](24200/4588595) loss:3.599 lr:0.0002425 epoch_Time:28590.0min: [2024-01-02 20:11:22,876][model8_pretrain.py][INFO] Epoch:[0/2](24200/4588595) loss:3.963 lr:0.0002425 epoch_Time:28590.0min: [2024-01-02 20:11:22,876][model8_pretrain.py][INFO] Epoch:[0/2](24200/4588595) loss:3.668 lr:0.0002425 epoch_Time:28590.0min: [2024-01-02 20:11:22,876][model8_pretrain.py][INFO] Epoch:[0/2](24200/4588595) loss:3.499 lr:0.0002425 epoch_Time:28590.0min: [2024-01-02 20:11:22,876][model8_pretrain.py][INFO] Epoch:[0/2](24200/4588595) loss:3.262 lr:0.0002425 epoch_Time:28590.0min: [2024-01-02 20:11:22,876][model8_pretrain.py][INFO] Epoch:[0/2](24200/4588595) loss:3.441 lr:0.0002425 epoch_Time:28590.0min: [2024-01-02 20:11:59,808][model8_pretrain.py][INFO] Epoch:[0/2](24300/4588595) loss:3.930 lr:0.0002421 epoch_Time:28587.0min: [2024-01-02 20:11:59,808][model8_pretrain.py][INFO] Epoch:[0/2](24300/4588595) loss:3.042 lr:0.0002421 epoch_Time:28587.0min: [2024-01-02 20:11:59,808][model8_pretrain.py][INFO] Epoch:[0/2](24300/4588595) loss:3.092 lr:0.0002421 epoch_Time:28587.0min: [2024-01-02 20:11:59,808][model8_pretrain.py][INFO] Epoch:[0/2](24300/4588595) loss:3.650 lr:0.0002421 epoch_Time:28587.0min: [2024-01-02 20:11:59,808][model8_pretrain.py][INFO] Epoch:[0/2](24300/4588595) loss:3.379 lr:0.0002421 epoch_Time:28587.0min: [2024-01-02 20:11:59,808][model8_pretrain.py][INFO] Epoch:[0/2](24300/4588595) loss:3.741 lr:0.0002421 epoch_Time:28587.0min: [2024-01-02 20:11:59,808][model8_pretrain.py][INFO] Epoch:[0/2](24300/4588595) loss:3.205 lr:0.0002421 epoch_Time:28587.0min: [2024-01-02 20:11:59,808][model8_pretrain.py][INFO] Epoch:[0/2](24300/4588595) loss:3.682 lr:0.0002421 epoch_Time:28587.0min: [2024-01-02 20:12:36,740][model8_pretrain.py][INFO] Epoch:[0/2](24400/4588595) loss:3.451 lr:0.0002416 epoch_Time:28585.0min: [2024-01-02 20:12:36,740][model8_pretrain.py][INFO] Epoch:[0/2](24400/4588595) loss:3.455 lr:0.0002416 epoch_Time:28585.0min: [2024-01-02 20:12:36,740][model8_pretrain.py][INFO] Epoch:[0/2](24400/4588595) loss:3.785 lr:0.0002416 epoch_Time:28585.0min: [2024-01-02 20:12:36,740][model8_pretrain.py][INFO] Epoch:[0/2](24400/4588595) loss:3.510 lr:0.0002416 epoch_Time:28585.0min: [2024-01-02 20:12:36,740][model8_pretrain.py][INFO] Epoch:[0/2](24400/4588595) loss:2.606 lr:0.0002416 epoch_Time:28585.0min: [2024-01-02 20:12:36,740][model8_pretrain.py][INFO] Epoch:[0/2](24400/4588595) loss:3.435 lr:0.0002416 epoch_Time:28585.0min: [2024-01-02 20:12:36,740][model8_pretrain.py][INFO] Epoch:[0/2](24400/4588595) loss:3.383 lr:0.0002416 epoch_Time:28585.0min: [2024-01-02 20:12:36,740][model8_pretrain.py][INFO] Epoch:[0/2](24400/4588595) loss:3.484 lr:0.0002416 epoch_Time:28585.0min: [2024-01-02 20:13:17,242][model8_pretrain.py][INFO] Epoch:[0/2](24500/4588595) loss:3.426 lr:0.0002412 epoch_Time:28593.0min: [2024-01-02 20:13:17,242][model8_pretrain.py][INFO] Epoch:[0/2](24500/4588595) loss:3.662 lr:0.0002412 epoch_Time:28593.0min: [2024-01-02 20:13:17,242][model8_pretrain.py][INFO] Epoch:[0/2](24500/4588595) loss:3.375 lr:0.0002412 epoch_Time:28593.0min: [2024-01-02 20:13:17,242][model8_pretrain.py][INFO] Epoch:[0/2](24500/4588595) loss:3.391 lr:0.0002412 epoch_Time:28593.0min: [2024-01-02 20:13:17,242][model8_pretrain.py][INFO] Epoch:[0/2](24500/4588595) loss:3.561 lr:0.0002412 epoch_Time:28593.0min: [2024-01-02 20:13:17,242][model8_pretrain.py][INFO] Epoch:[0/2](24500/4588595) loss:3.561 lr:0.0002412 epoch_Time:28593.0min: [2024-01-02 20:13:17,242][model8_pretrain.py][INFO] Epoch:[0/2](24500/4588595) loss:3.506 lr:0.0002412 epoch_Time:28593.0min: [2024-01-02 20:13:17,242][model8_pretrain.py][INFO] Epoch:[0/2](24500/4588595) loss:3.793 lr:0.0002412 epoch_Time:28593.0min: [2024-01-02 20:13:54,161][model8_pretrain.py][INFO] Epoch:[0/2](24600/4588595) loss:3.726 lr:0.0002407 epoch_Time:28590.0min: [2024-01-02 20:13:54,161][model8_pretrain.py][INFO] Epoch:[0/2](24600/4588595) loss:3.490 lr:0.0002407 epoch_Time:28590.0min: [2024-01-02 20:13:54,161][model8_pretrain.py][INFO] Epoch:[0/2](24600/4588595) loss:2.780 lr:0.0002407 epoch_Time:28590.0min: [2024-01-02 20:13:54,162][model8_pretrain.py][INFO] Epoch:[0/2](24600/4588595) loss:3.708 lr:0.0002407 epoch_Time:28590.0min: [2024-01-02 20:13:54,162][model8_pretrain.py][INFO] Epoch:[0/2](24600/4588595) loss:3.743 lr:0.0002407 epoch_Time:28590.0min: [2024-01-02 20:13:54,162][model8_pretrain.py][INFO] Epoch:[0/2](24600/4588595) loss:2.649 lr:0.0002407 epoch_Time:28590.0min: [2024-01-02 20:13:54,162][model8_pretrain.py][INFO] Epoch:[0/2](24600/4588595) loss:3.973 lr:0.0002407 epoch_Time:28590.0min: [2024-01-02 20:13:54,162][model8_pretrain.py][INFO] Epoch:[0/2](24600/4588595) loss:2.933 lr:0.0002407 epoch_Time:28590.0min: [2024-01-02 20:14:31,117][model8_pretrain.py][INFO] Epoch:[0/2](24700/4588595) loss:3.171 lr:0.0002402 epoch_Time:28588.0min: [2024-01-02 20:14:31,117][model8_pretrain.py][INFO] Epoch:[0/2](24700/4588595) loss:3.342 lr:0.0002402 epoch_Time:28588.0min: [2024-01-02 20:14:31,117][model8_pretrain.py][INFO] Epoch:[0/2](24700/4588595) loss:3.318 lr:0.0002402 epoch_Time:28588.0min: [2024-01-02 20:14:31,117][model8_pretrain.py][INFO] Epoch:[0/2](24700/4588595) loss:2.987 lr:0.0002402 epoch_Time:28588.0min: [2024-01-02 20:14:31,117][model8_pretrain.py][INFO] Epoch:[0/2](24700/4588595) loss:3.380 lr:0.0002402 epoch_Time:28588.0min: [2024-01-02 20:14:31,117][model8_pretrain.py][INFO] Epoch:[0/2](24700/4588595) loss:3.744 lr:0.0002402 epoch_Time:28588.0min: [2024-01-02 20:14:31,117][model8_pretrain.py][INFO] Epoch:[0/2](24700/4588595) loss:3.433 lr:0.0002402 epoch_Time:28588.0min: [2024-01-02 20:14:31,117][model8_pretrain.py][INFO] Epoch:[0/2](24700/4588595) loss:3.204 lr:0.0002402 epoch_Time:28588.0min: [2024-01-02 20:15:08,062][model8_pretrain.py][INFO] Epoch:[0/2](24800/4588595) loss:3.558 lr:0.0002398 epoch_Time:28585.0min: [2024-01-02 20:15:08,062][model8_pretrain.py][INFO] Epoch:[0/2](24800/4588595) loss:3.451 lr:0.0002398 epoch_Time:28585.0min: [2024-01-02 20:15:08,062][model8_pretrain.py][INFO] Epoch:[0/2](24800/4588595) loss:3.511 lr:0.0002398 epoch_Time:28585.0min: [2024-01-02 20:15:08,062][model8_pretrain.py][INFO] Epoch:[0/2](24800/4588595) loss:3.560 lr:0.0002398 epoch_Time:28585.0min: [2024-01-02 20:15:08,062][model8_pretrain.py][INFO] Epoch:[0/2](24800/4588595) loss:3.594 lr:0.0002398 epoch_Time:28585.0min: [2024-01-02 20:15:08,062][model8_pretrain.py][INFO] Epoch:[0/2](24800/4588595) loss:3.542 lr:0.0002398 epoch_Time:28585.0min: [2024-01-02 20:15:08,062][model8_pretrain.py][INFO] Epoch:[0/2](24800/4588595) loss:3.843 lr:0.0002398 epoch_Time:28585.0min: [2024-01-02 20:15:08,062][model8_pretrain.py][INFO] Epoch:[0/2](24800/4588595) loss:2.986 lr:0.0002398 epoch_Time:28585.0min: [2024-01-02 20:15:44,987][model8_pretrain.py][INFO] Epoch:[0/2](24900/4588595) loss:3.198 lr:0.0002393 epoch_Time:28583.0min: [2024-01-02 20:15:44,987][model8_pretrain.py][INFO] Epoch:[0/2](24900/4588595) loss:3.658 lr:0.0002393 epoch_Time:28583.0min: [2024-01-02 20:15:44,987][model8_pretrain.py][INFO] Epoch:[0/2](24900/4588595) loss:3.477 lr:0.0002393 epoch_Time:28583.0min: [2024-01-02 20:15:44,987][model8_pretrain.py][INFO] Epoch:[0/2](24900/4588595) loss:3.196 lr:0.0002393 epoch_Time:28583.0min: [2024-01-02 20:15:44,987][model8_pretrain.py][INFO] Epoch:[0/2](24900/4588595) loss:3.475 lr:0.0002393 epoch_Time:28583.0min: [2024-01-02 20:15:44,987][model8_pretrain.py][INFO] Epoch:[0/2](24900/4588595) loss:3.503 lr:0.0002393 epoch_Time:28583.0min: [2024-01-02 20:15:44,987][model8_pretrain.py][INFO] Epoch:[0/2](24900/4588595) loss:3.153 lr:0.0002393 epoch_Time:28583.0min: [2024-01-02 20:15:44,987][model8_pretrain.py][INFO] Epoch:[0/2](24900/4588595) loss:3.472 lr:0.0002393 epoch_Time:28583.0min: [2024-01-02 20:16:21,915][model8_pretrain.py][INFO] Epoch:[0/2](25000/4588595) loss:3.578 lr:0.0002388 epoch_Time:28580.0min: [2024-01-02 20:16:21,915][model8_pretrain.py][INFO] Epoch:[0/2](25000/4588595) loss:3.600 lr:0.0002388 epoch_Time:28580.0min: [2024-01-02 20:16:21,915][model8_pretrain.py][INFO] Epoch:[0/2](25000/4588595) loss:4.054 lr:0.0002388 epoch_Time:28580.0min: [2024-01-02 20:16:21,915][model8_pretrain.py][INFO] Epoch:[0/2](25000/4588595) loss:3.342 lr:0.0002388 epoch_Time:28580.0min: [2024-01-02 20:16:21,915][model8_pretrain.py][INFO] Epoch:[0/2](25000/4588595) loss:3.467 lr:0.0002388 epoch_Time:28580.0min: [2024-01-02 20:16:21,915][model8_pretrain.py][INFO] Epoch:[0/2](25000/4588595) loss:3.189 lr:0.0002388 epoch_Time:28580.0min: [2024-01-02 20:16:21,915][model8_pretrain.py][INFO] Epoch:[0/2](25000/4588595) loss:3.484 lr:0.0002388 epoch_Time:28580.0min: [2024-01-02 20:16:21,916][model8_pretrain.py][INFO] Epoch:[0/2](25000/4588595) loss:3.526 lr:0.0002388 epoch_Time:28580.0min: [2024-01-02 20:16:58,859][model8_pretrain.py][INFO] Epoch:[0/2](25100/4588595) loss:2.803 lr:0.0002384 epoch_Time:28577.0min: [2024-01-02 20:16:58,859][model8_pretrain.py][INFO] Epoch:[0/2](25100/4588595) loss:3.591 lr:0.0002384 epoch_Time:28577.0min: [2024-01-02 20:16:58,859][model8_pretrain.py][INFO] Epoch:[0/2](25100/4588595) loss:3.120 lr:0.0002384 epoch_Time:28577.0min: [2024-01-02 20:16:58,859][model8_pretrain.py][INFO] Epoch:[0/2](25100/4588595) loss:3.330 lr:0.0002384 epoch_Time:28577.0min: [2024-01-02 20:16:58,859][model8_pretrain.py][INFO] Epoch:[0/2](25100/4588595) loss:3.638 lr:0.0002384 epoch_Time:28577.0min: [2024-01-02 20:16:58,859][model8_pretrain.py][INFO] Epoch:[0/2](25100/4588595) loss:3.703 lr:0.0002384 epoch_Time:28577.0min: [2024-01-02 20:16:58,859][model8_pretrain.py][INFO] Epoch:[0/2](25100/4588595) loss:3.505 lr:0.0002384 epoch_Time:28577.0min: [2024-01-02 20:16:58,859][model8_pretrain.py][INFO] Epoch:[0/2](25100/4588595) loss:3.883 lr:0.0002384 epoch_Time:28577.0min: [2024-01-02 20:17:35,787][model8_pretrain.py][INFO] Epoch:[0/2](25200/4588595) loss:3.418 lr:0.0002379 epoch_Time:28575.0min: [2024-01-02 20:17:35,787][model8_pretrain.py][INFO] Epoch:[0/2](25200/4588595) loss:3.270 lr:0.0002379 epoch_Time:28575.0min: [2024-01-02 20:17:35,787][model8_pretrain.py][INFO] Epoch:[0/2](25200/4588595) loss:3.361 lr:0.0002379 epoch_Time:28575.0min: [2024-01-02 20:17:35,787][model8_pretrain.py][INFO] Epoch:[0/2](25200/4588595) loss:3.726 lr:0.0002379 epoch_Time:28575.0min: [2024-01-02 20:17:35,787][model8_pretrain.py][INFO] Epoch:[0/2](25200/4588595) loss:3.536 lr:0.0002379 epoch_Time:28575.0min: [2024-01-02 20:17:35,787][model8_pretrain.py][INFO] Epoch:[0/2](25200/4588595) loss:3.158 lr:0.0002379 epoch_Time:28575.0min: [2024-01-02 20:17:35,787][model8_pretrain.py][INFO] Epoch:[0/2](25200/4588595) loss:3.736 lr:0.0002379 epoch_Time:28575.0min: [2024-01-02 20:17:35,787][model8_pretrain.py][INFO] Epoch:[0/2](25200/4588595) loss:3.411 lr:0.0002379 epoch_Time:28575.0min: [2024-01-02 20:18:16,369][model8_pretrain.py][INFO] Epoch:[0/2](25300/4588595) loss:3.654 lr:0.0002374 epoch_Time:28583.0min: [2024-01-02 20:18:16,369][model8_pretrain.py][INFO] Epoch:[0/2](25300/4588595) loss:3.919 lr:0.0002374 epoch_Time:28583.0min: [2024-01-02 20:18:16,369][model8_pretrain.py][INFO] Epoch:[0/2](25300/4588595) loss:2.828 lr:0.0002374 epoch_Time:28583.0min: [2024-01-02 20:18:16,369][model8_pretrain.py][INFO] Epoch:[0/2](25300/4588595) loss:3.663 lr:0.0002374 epoch_Time:28583.0min: [2024-01-02 20:18:16,369][model8_pretrain.py][INFO] Epoch:[0/2](25300/4588595) loss:3.525 lr:0.0002374 epoch_Time:28583.0min: [2024-01-02 20:18:16,369][model8_pretrain.py][INFO] Epoch:[0/2](25300/4588595) loss:3.434 lr:0.0002374 epoch_Time:28583.0min: [2024-01-02 20:18:16,369][model8_pretrain.py][INFO] Epoch:[0/2](25300/4588595) loss:3.705 lr:0.0002374 epoch_Time:28583.0min: [2024-01-02 20:18:16,370][model8_pretrain.py][INFO] Epoch:[0/2](25300/4588595) loss:3.414 lr:0.0002374 epoch_Time:28583.0min: [2024-01-02 20:18:53,316][model8_pretrain.py][INFO] Epoch:[0/2](25400/4588595) loss:3.284 lr:0.0002369 epoch_Time:28581.0min: [2024-01-02 20:18:53,316][model8_pretrain.py][INFO] Epoch:[0/2](25400/4588595) loss:3.520 lr:0.0002369 epoch_Time:28580.0min: [2024-01-02 20:18:53,316][model8_pretrain.py][INFO] Epoch:[0/2](25400/4588595) loss:3.601 lr:0.0002369 epoch_Time:28580.0min: [2024-01-02 20:18:53,316][model8_pretrain.py][INFO] Epoch:[0/2](25400/4588595) loss:3.524 lr:0.0002369 epoch_Time:28580.0min: [2024-01-02 20:18:53,316][model8_pretrain.py][INFO] Epoch:[0/2](25400/4588595) loss:3.459 lr:0.0002369 epoch_Time:28580.0min: [2024-01-02 20:18:53,316][model8_pretrain.py][INFO] Epoch:[0/2](25400/4588595) loss:4.106 lr:0.0002369 epoch_Time:28580.0min: [2024-01-02 20:18:53,317][model8_pretrain.py][INFO] Epoch:[0/2](25400/4588595) loss:3.494 lr:0.0002369 epoch_Time:28580.0min: [2024-01-02 20:18:53,317][model8_pretrain.py][INFO] Epoch:[0/2](25400/4588595) loss:3.560 lr:0.0002369 epoch_Time:28581.0min: [2024-01-02 20:19:30,266][model8_pretrain.py][INFO] Epoch:[0/2](25500/4588595) loss:3.545 lr:0.0002365 epoch_Time:28579.0min: [2024-01-02 20:19:30,266][model8_pretrain.py][INFO] Epoch:[0/2](25500/4588595) loss:3.375 lr:0.0002365 epoch_Time:28579.0min: [2024-01-02 20:19:30,266][model8_pretrain.py][INFO] Epoch:[0/2](25500/4588595) loss:3.700 lr:0.0002365 epoch_Time:28579.0min: [2024-01-02 20:19:30,266][model8_pretrain.py][INFO] Epoch:[0/2](25500/4588595) loss:3.442 lr:0.0002365 epoch_Time:28579.0min: [2024-01-02 20:19:30,266][model8_pretrain.py][INFO] Epoch:[0/2](25500/4588595) loss:3.810 lr:0.0002365 epoch_Time:28579.0min: [2024-01-02 20:19:30,266][model8_pretrain.py][INFO] Epoch:[0/2](25500/4588595) loss:3.304 lr:0.0002365 epoch_Time:28579.0min: [2024-01-02 20:19:30,266][model8_pretrain.py][INFO] Epoch:[0/2](25500/4588595) loss:3.669 lr:0.0002365 epoch_Time:28579.0min: [2024-01-02 20:19:30,266][model8_pretrain.py][INFO] Epoch:[0/2](25500/4588595) loss:3.448 lr:0.0002365 epoch_Time:28579.0min: [2024-01-02 20:20:07,206][model8_pretrain.py][INFO] Epoch:[0/2](25600/4588595) loss:3.647 lr:0.0002360 epoch_Time:28576.0min: [2024-01-02 20:20:07,206][model8_pretrain.py][INFO] Epoch:[0/2](25600/4588595) loss:3.144 lr:0.0002360 epoch_Time:28576.0min: [2024-01-02 20:20:07,206][model8_pretrain.py][INFO] Epoch:[0/2](25600/4588595) loss:3.656 lr:0.0002360 epoch_Time:28576.0min: [2024-01-02 20:20:07,206][model8_pretrain.py][INFO] Epoch:[0/2](25600/4588595) loss:2.793 lr:0.0002360 epoch_Time:28576.0min: [2024-01-02 20:20:07,206][model8_pretrain.py][INFO] Epoch:[0/2](25600/4588595) loss:3.446 lr:0.0002360 epoch_Time:28576.0min: [2024-01-02 20:20:07,206][model8_pretrain.py][INFO] Epoch:[0/2](25600/4588595) loss:2.886 lr:0.0002360 epoch_Time:28576.0min: [2024-01-02 20:20:07,207][model8_pretrain.py][INFO] Epoch:[0/2](25600/4588595) loss:3.350 lr:0.0002360 epoch_Time:28576.0min: [2024-01-02 20:20:07,207][model8_pretrain.py][INFO] Epoch:[0/2](25600/4588595) loss:3.607 lr:0.0002360 epoch_Time:28576.0min: [2024-01-02 20:20:44,128][model8_pretrain.py][INFO] Epoch:[0/2](25700/4588595) loss:3.172 lr:0.0002355 epoch_Time:28574.0min: [2024-01-02 20:20:44,128][model8_pretrain.py][INFO] Epoch:[0/2](25700/4588595) loss:3.549 lr:0.0002355 epoch_Time:28574.0min: [2024-01-02 20:20:44,128][model8_pretrain.py][INFO] Epoch:[0/2](25700/4588595) loss:3.278 lr:0.0002355 epoch_Time:28574.0min: [2024-01-02 20:20:44,128][model8_pretrain.py][INFO] Epoch:[0/2](25700/4588595) loss:3.710 lr:0.0002355 epoch_Time:28574.0min: [2024-01-02 20:20:44,128][model8_pretrain.py][INFO] Epoch:[0/2](25700/4588595) loss:2.886 lr:0.0002355 epoch_Time:28574.0min: [2024-01-02 20:20:44,128][model8_pretrain.py][INFO] Epoch:[0/2](25700/4588595) loss:3.536 lr:0.0002355 epoch_Time:28574.0min: [2024-01-02 20:20:44,128][model8_pretrain.py][INFO] Epoch:[0/2](25700/4588595) loss:3.773 lr:0.0002355 epoch_Time:28574.0min: [2024-01-02 20:20:44,128][model8_pretrain.py][INFO] Epoch:[0/2](25700/4588595) loss:3.549 lr:0.0002355 epoch_Time:28574.0min: [2024-01-02 20:21:21,057][model8_pretrain.py][INFO] Epoch:[0/2](25800/4588595) loss:3.289 lr:0.0002350 epoch_Time:28571.0min: [2024-01-02 20:21:21,057][model8_pretrain.py][INFO] Epoch:[0/2](25800/4588595) loss:3.361 lr:0.0002350 epoch_Time:28571.0min: [2024-01-02 20:21:21,057][model8_pretrain.py][INFO] Epoch:[0/2](25800/4588595) loss:3.705 lr:0.0002350 epoch_Time:28571.0min: [2024-01-02 20:21:21,057][model8_pretrain.py][INFO] Epoch:[0/2](25800/4588595) loss:3.394 lr:0.0002350 epoch_Time:28571.0min: [2024-01-02 20:21:21,057][model8_pretrain.py][INFO] Epoch:[0/2](25800/4588595) loss:3.270 lr:0.0002350 epoch_Time:28571.0min: [2024-01-02 20:21:21,057][model8_pretrain.py][INFO] Epoch:[0/2](25800/4588595) loss:3.245 lr:0.0002350 epoch_Time:28571.0min: [2024-01-02 20:21:21,058][model8_pretrain.py][INFO] Epoch:[0/2](25800/4588595) loss:2.826 lr:0.0002350 epoch_Time:28571.0min: [2024-01-02 20:21:21,058][model8_pretrain.py][INFO] Epoch:[0/2](25800/4588595) loss:3.558 lr:0.0002350 epoch_Time:28571.0min: [2024-01-02 20:21:57,961][model8_pretrain.py][INFO] Epoch:[0/2](25900/4588595) loss:3.438 lr:0.0002345 epoch_Time:28568.0min: [2024-01-02 20:21:57,961][model8_pretrain.py][INFO] Epoch:[0/2](25900/4588595) loss:3.589 lr:0.0002345 epoch_Time:28568.0min: [2024-01-02 20:21:57,961][model8_pretrain.py][INFO] Epoch:[0/2](25900/4588595) loss:3.587 lr:0.0002345 epoch_Time:28568.0min: [2024-01-02 20:21:57,961][model8_pretrain.py][INFO] Epoch:[0/2](25900/4588595) loss:3.670 lr:0.0002345 epoch_Time:28568.0min: [2024-01-02 20:21:57,961][model8_pretrain.py][INFO] Epoch:[0/2](25900/4588595) loss:3.611 lr:0.0002345 epoch_Time:28568.0min: [2024-01-02 20:21:57,961][model8_pretrain.py][INFO] Epoch:[0/2](25900/4588595) loss:3.552 lr:0.0002345 epoch_Time:28568.0min: [2024-01-02 20:21:57,961][model8_pretrain.py][INFO] Epoch:[0/2](25900/4588595) loss:3.656 lr:0.0002345 epoch_Time:28568.0min: [2024-01-02 20:21:57,961][model8_pretrain.py][INFO] Epoch:[0/2](25900/4588595) loss:3.412 lr:0.0002345 epoch_Time:28568.0min: [2024-01-02 20:22:34,896][model8_pretrain.py][INFO] Epoch:[0/2](26000/4588595) loss:3.807 lr:0.0002341 epoch_Time:28566.0min: [2024-01-02 20:22:34,897][model8_pretrain.py][INFO] Epoch:[0/2](26000/4588595) loss:3.344 lr:0.0002341 epoch_Time:28566.0min: [2024-01-02 20:22:34,897][model8_pretrain.py][INFO] Epoch:[0/2](26000/4588595) loss:3.409 lr:0.0002341 epoch_Time:28566.0min: [2024-01-02 20:22:34,897][model8_pretrain.py][INFO] Epoch:[0/2](26000/4588595) loss:3.805 lr:0.0002341 epoch_Time:28566.0min: [2024-01-02 20:22:34,897][model8_pretrain.py][INFO] Epoch:[0/2](26000/4588595) loss:3.144 lr:0.0002341 epoch_Time:28566.0min: [2024-01-02 20:22:34,897][model8_pretrain.py][INFO] Epoch:[0/2](26000/4588595) loss:3.694 lr:0.0002341 epoch_Time:28566.0min: [2024-01-02 20:22:34,897][model8_pretrain.py][INFO] Epoch:[0/2](26000/4588595) loss:3.521 lr:0.0002341 epoch_Time:28566.0min: [2024-01-02 20:22:34,897][model8_pretrain.py][INFO] Epoch:[0/2](26000/4588595) loss:3.070 lr:0.0002341 epoch_Time:28566.0min: [2024-01-02 20:23:15,350][model8_pretrain.py][INFO] Epoch:[0/2](26100/4588595) loss:2.984 lr:0.0002336 epoch_Time:28573.0min: [2024-01-02 20:23:15,350][model8_pretrain.py][INFO] Epoch:[0/2](26100/4588595) loss:3.442 lr:0.0002336 epoch_Time:28573.0min: [2024-01-02 20:23:15,350][model8_pretrain.py][INFO] Epoch:[0/2](26100/4588595) loss:3.808 lr:0.0002336 epoch_Time:28573.0min: [2024-01-02 20:23:15,350][model8_pretrain.py][INFO] Epoch:[0/2](26100/4588595) loss:3.301 lr:0.0002336 epoch_Time:28574.0min: [2024-01-02 20:23:15,350][model8_pretrain.py][INFO] Epoch:[0/2](26100/4588595) loss:3.653 lr:0.0002336 epoch_Time:28574.0min: [2024-01-02 20:23:15,354][model8_pretrain.py][INFO] Epoch:[0/2](26100/4588595) loss:3.243 lr:0.0002336 epoch_Time:28573.0min: [2024-01-02 20:23:15,355][model8_pretrain.py][INFO] Epoch:[0/2](26100/4588595) loss:3.432 lr:0.0002336 epoch_Time:28573.0min: [2024-01-02 20:23:15,355][model8_pretrain.py][INFO] Epoch:[0/2](26100/4588595) loss:3.425 lr:0.0002336 epoch_Time:28573.0min: [2024-01-02 20:23:52,280][model8_pretrain.py][INFO] Epoch:[0/2](26200/4588595) loss:3.704 lr:0.0002331 epoch_Time:28571.0min: [2024-01-02 20:23:52,280][model8_pretrain.py][INFO] Epoch:[0/2](26200/4588595) loss:3.324 lr:0.0002331 epoch_Time:28571.0min: [2024-01-02 20:23:52,280][model8_pretrain.py][INFO] Epoch:[0/2](26200/4588595) loss:3.507 lr:0.0002331 epoch_Time:28571.0min: [2024-01-02 20:23:52,280][model8_pretrain.py][INFO] Epoch:[0/2](26200/4588595) loss:3.965 lr:0.0002331 epoch_Time:28571.0min: [2024-01-02 20:23:52,280][model8_pretrain.py][INFO] Epoch:[0/2](26200/4588595) loss:3.611 lr:0.0002331 epoch_Time:28571.0min: [2024-01-02 20:23:52,280][model8_pretrain.py][INFO] Epoch:[0/2](26200/4588595) loss:3.755 lr:0.0002331 epoch_Time:28571.0min: [2024-01-02 20:23:52,280][model8_pretrain.py][INFO] Epoch:[0/2](26200/4588595) loss:3.157 lr:0.0002331 epoch_Time:28571.0min: [2024-01-02 20:23:52,280][model8_pretrain.py][INFO] Epoch:[0/2](26200/4588595) loss:2.977 lr:0.0002331 epoch_Time:28571.0min: [2024-01-02 20:24:29,244][model8_pretrain.py][INFO] Epoch:[0/2](26300/4588595) loss:3.162 lr:0.0002326 epoch_Time:28569.0min: [2024-01-02 20:24:29,244][model8_pretrain.py][INFO] Epoch:[0/2](26300/4588595) loss:3.159 lr:0.0002326 epoch_Time:28569.0min: [2024-01-02 20:24:29,244][model8_pretrain.py][INFO] Epoch:[0/2](26300/4588595) loss:3.662 lr:0.0002326 epoch_Time:28569.0min: [2024-01-02 20:24:29,244][model8_pretrain.py][INFO] Epoch:[0/2](26300/4588595) loss:3.315 lr:0.0002326 epoch_Time:28569.0min: [2024-01-02 20:24:29,244][model8_pretrain.py][INFO] Epoch:[0/2](26300/4588595) loss:3.502 lr:0.0002326 epoch_Time:28569.0min: [2024-01-02 20:24:29,244][model8_pretrain.py][INFO] Epoch:[0/2](26300/4588595) loss:3.306 lr:0.0002326 epoch_Time:28569.0min: [2024-01-02 20:24:29,244][model8_pretrain.py][INFO] Epoch:[0/2](26300/4588595) loss:3.618 lr:0.0002326 epoch_Time:28569.0min: [2024-01-02 20:24:29,244][model8_pretrain.py][INFO] Epoch:[0/2](26300/4588595) loss:3.682 lr:0.0002326 epoch_Time:28569.0min: [2024-01-02 20:25:06,201][model8_pretrain.py][INFO] Epoch:[0/2](26400/4588595) loss:3.214 lr:0.0002321 epoch_Time:28566.0min: [2024-01-02 20:25:06,201][model8_pretrain.py][INFO] Epoch:[0/2](26400/4588595) loss:2.879 lr:0.0002321 epoch_Time:28566.0min: [2024-01-02 20:25:06,201][model8_pretrain.py][INFO] Epoch:[0/2](26400/4588595) loss:3.821 lr:0.0002321 epoch_Time:28566.0min: [2024-01-02 20:25:06,201][model8_pretrain.py][INFO] Epoch:[0/2](26400/4588595) loss:3.450 lr:0.0002321 epoch_Time:28566.0min: [2024-01-02 20:25:06,201][model8_pretrain.py][INFO] Epoch:[0/2](26400/4588595) loss:3.606 lr:0.0002321 epoch_Time:28566.0min: [2024-01-02 20:25:06,201][model8_pretrain.py][INFO] Epoch:[0/2](26400/4588595) loss:2.895 lr:0.0002321 epoch_Time:28566.0min: [2024-01-02 20:25:06,201][model8_pretrain.py][INFO] Epoch:[0/2](26400/4588595) loss:3.567 lr:0.0002321 epoch_Time:28566.0min: [2024-01-02 20:25:06,202][model8_pretrain.py][INFO] Epoch:[0/2](26400/4588595) loss:3.388 lr:0.0002321 epoch_Time:28566.0min: [2024-01-02 20:25:43,156][model8_pretrain.py][INFO] Epoch:[0/2](26500/4588595) loss:3.710 lr:0.0002316 epoch_Time:28564.0min: [2024-01-02 20:25:43,156][model8_pretrain.py][INFO] Epoch:[0/2](26500/4588595) loss:3.310 lr:0.0002316 epoch_Time:28564.0min: [2024-01-02 20:25:43,156][model8_pretrain.py][INFO] Epoch:[0/2](26500/4588595) loss:3.199 lr:0.0002316 epoch_Time:28564.0min: [2024-01-02 20:25:43,156][model8_pretrain.py][INFO] Epoch:[0/2](26500/4588595) loss:3.369 lr:0.0002316 epoch_Time:28564.0min: [2024-01-02 20:25:43,156][model8_pretrain.py][INFO] Epoch:[0/2](26500/4588595) loss:3.316 lr:0.0002316 epoch_Time:28564.0min: [2024-01-02 20:25:43,156][model8_pretrain.py][INFO] Epoch:[0/2](26500/4588595) loss:3.748 lr:0.0002316 epoch_Time:28564.0min: [2024-01-02 20:25:43,156][model8_pretrain.py][INFO] Epoch:[0/2](26500/4588595) loss:3.532 lr:0.0002316 epoch_Time:28564.0min: [2024-01-02 20:25:43,157][model8_pretrain.py][INFO] Epoch:[0/2](26500/4588595) loss:3.749 lr:0.0002316 epoch_Time:28564.0min: [2024-01-02 20:26:20,112][model8_pretrain.py][INFO] Epoch:[0/2](26600/4588595) loss:3.216 lr:0.0002311 epoch_Time:28561.0min: [2024-01-02 20:26:20,112][model8_pretrain.py][INFO] Epoch:[0/2](26600/4588595) loss:3.261 lr:0.0002311 epoch_Time:28562.0min: [2024-01-02 20:26:20,112][model8_pretrain.py][INFO] Epoch:[0/2](26600/4588595) loss:3.535 lr:0.0002311 epoch_Time:28561.0min: [2024-01-02 20:26:20,112][model8_pretrain.py][INFO] Epoch:[0/2](26600/4588595) loss:3.622 lr:0.0002311 epoch_Time:28561.0min: [2024-01-02 20:26:20,112][model8_pretrain.py][INFO] Epoch:[0/2](26600/4588595) loss:3.349 lr:0.0002311 epoch_Time:28561.0min: [2024-01-02 20:26:20,112][model8_pretrain.py][INFO] Epoch:[0/2](26600/4588595) loss:3.897 lr:0.0002311 epoch_Time:28562.0min: [2024-01-02 20:26:20,112][model8_pretrain.py][INFO] Epoch:[0/2](26600/4588595) loss:3.413 lr:0.0002311 epoch_Time:28561.0min: [2024-01-02 20:26:20,112][model8_pretrain.py][INFO] Epoch:[0/2](26600/4588595) loss:3.763 lr:0.0002311 epoch_Time:28561.0min: [2024-01-02 20:26:57,063][model8_pretrain.py][INFO] Epoch:[0/2](26700/4588595) loss:3.092 lr:0.0002306 epoch_Time:28559.0min: [2024-01-02 20:26:57,063][model8_pretrain.py][INFO] Epoch:[0/2](26700/4588595) loss:3.095 lr:0.0002306 epoch_Time:28559.0min: [2024-01-02 20:26:57,063][model8_pretrain.py][INFO] Epoch:[0/2](26700/4588595) loss:3.667 lr:0.0002306 epoch_Time:28559.0min: [2024-01-02 20:26:57,063][model8_pretrain.py][INFO] Epoch:[0/2](26700/4588595) loss:3.851 lr:0.0002306 epoch_Time:28559.0min: [2024-01-02 20:26:57,063][model8_pretrain.py][INFO] Epoch:[0/2](26700/4588595) loss:3.408 lr:0.0002306 epoch_Time:28559.0min: [2024-01-02 20:26:57,063][model8_pretrain.py][INFO] Epoch:[0/2](26700/4588595) loss:3.514 lr:0.0002306 epoch_Time:28559.0min: [2024-01-02 20:26:57,064][model8_pretrain.py][INFO] Epoch:[0/2](26700/4588595) loss:3.410 lr:0.0002306 epoch_Time:28559.0min: [2024-01-02 20:26:57,067][model8_pretrain.py][INFO] Epoch:[0/2](26700/4588595) loss:3.164 lr:0.0002306 epoch_Time:28559.0min: [2024-01-02 20:27:34,004][model8_pretrain.py][INFO] Epoch:[0/2](26800/4588595) loss:2.969 lr:0.0002301 epoch_Time:28557.0min: [2024-01-02 20:27:34,004][model8_pretrain.py][INFO] Epoch:[0/2](26800/4588595) loss:3.456 lr:0.0002301 epoch_Time:28557.0min: [2024-01-02 20:27:34,004][model8_pretrain.py][INFO] Epoch:[0/2](26800/4588595) loss:3.694 lr:0.0002301 epoch_Time:28557.0min: [2024-01-02 20:27:34,004][model8_pretrain.py][INFO] Epoch:[0/2](26800/4588595) loss:3.337 lr:0.0002301 epoch_Time:28557.0min: [2024-01-02 20:27:34,004][model8_pretrain.py][INFO] Epoch:[0/2](26800/4588595) loss:3.356 lr:0.0002301 epoch_Time:28557.0min: [2024-01-02 20:27:34,004][model8_pretrain.py][INFO] Epoch:[0/2](26800/4588595) loss:3.344 lr:0.0002301 epoch_Time:28557.0min: [2024-01-02 20:27:34,004][model8_pretrain.py][INFO] Epoch:[0/2](26800/4588595) loss:3.794 lr:0.0002301 epoch_Time:28557.0min: [2024-01-02 20:27:34,004][model8_pretrain.py][INFO] Epoch:[0/2](26800/4588595) loss:3.611 lr:0.0002301 epoch_Time:28557.0min: [2024-01-02 20:28:12,683][model8_pretrain.py][INFO] Epoch:[0/2](26900/4588595) loss:3.297 lr:0.0002297 epoch_Time:28559.0min: [2024-01-02 20:28:12,683][model8_pretrain.py][INFO] Epoch:[0/2](26900/4588595) loss:3.304 lr:0.0002297 epoch_Time:28559.0min: [2024-01-02 20:28:12,683][model8_pretrain.py][INFO] Epoch:[0/2](26900/4588595) loss:3.217 lr:0.0002297 epoch_Time:28559.0min: [2024-01-02 20:28:12,683][model8_pretrain.py][INFO] Epoch:[0/2](26900/4588595) loss:3.606 lr:0.0002297 epoch_Time:28559.0min: [2024-01-02 20:28:12,683][model8_pretrain.py][INFO] Epoch:[0/2](26900/4588595) loss:3.524 lr:0.0002297 epoch_Time:28559.0min: [2024-01-02 20:28:12,683][model8_pretrain.py][INFO] Epoch:[0/2](26900/4588595) loss:3.570 lr:0.0002297 epoch_Time:28559.0min: [2024-01-02 20:28:12,683][model8_pretrain.py][INFO] Epoch:[0/2](26900/4588595) loss:3.726 lr:0.0002297 epoch_Time:28559.0min: [2024-01-02 20:28:12,683][model8_pretrain.py][INFO] Epoch:[0/2](26900/4588595) loss:3.822 lr:0.0002297 epoch_Time:28559.0min: [2024-01-02 20:28:53,234][model8_pretrain.py][INFO] Epoch:[0/2](27000/4588595) loss:3.247 lr:0.0002292 epoch_Time:28567.0min: [2024-01-02 20:28:53,235][model8_pretrain.py][INFO] Epoch:[0/2](27000/4588595) loss:3.393 lr:0.0002292 epoch_Time:28567.0min: [2024-01-02 20:28:53,235][model8_pretrain.py][INFO] Epoch:[0/2](27000/4588595) loss:3.777 lr:0.0002292 epoch_Time:28567.0min: [2024-01-02 20:28:53,235][model8_pretrain.py][INFO] Epoch:[0/2](27000/4588595) loss:3.487 lr:0.0002292 epoch_Time:28567.0min: [2024-01-02 20:28:53,235][model8_pretrain.py][INFO] Epoch:[0/2](27000/4588595) loss:3.595 lr:0.0002292 epoch_Time:28567.0min: [2024-01-02 20:28:53,235][model8_pretrain.py][INFO] Epoch:[0/2](27000/4588595) loss:3.993 lr:0.0002292 epoch_Time:28567.0min: [2024-01-02 20:28:53,235][model8_pretrain.py][INFO] Epoch:[0/2](27000/4588595) loss:3.058 lr:0.0002292 epoch_Time:28567.0min: [2024-01-02 20:28:53,235][model8_pretrain.py][INFO] Epoch:[0/2](27000/4588595) loss:3.266 lr:0.0002292 epoch_Time:28567.0min: [2024-01-02 20:29:30,177][model8_pretrain.py][INFO] Epoch:[0/2](27100/4588595) loss:3.480 lr:0.0002287 epoch_Time:28565.0min: [2024-01-02 20:29:30,177][model8_pretrain.py][INFO] Epoch:[0/2](27100/4588595) loss:3.507 lr:0.0002287 epoch_Time:28565.0min: [2024-01-02 20:29:30,177][model8_pretrain.py][INFO] Epoch:[0/2](27100/4588595) loss:3.979 lr:0.0002287 epoch_Time:28565.0min: [2024-01-02 20:29:30,177][model8_pretrain.py][INFO] Epoch:[0/2](27100/4588595) loss:3.484 lr:0.0002287 epoch_Time:28565.0min: [2024-01-02 20:29:30,177][model8_pretrain.py][INFO] Epoch:[0/2](27100/4588595) loss:3.450 lr:0.0002287 epoch_Time:28565.0min: [2024-01-02 20:29:30,177][model8_pretrain.py][INFO] Epoch:[0/2](27100/4588595) loss:3.503 lr:0.0002287 epoch_Time:28565.0min: [2024-01-02 20:29:30,177][model8_pretrain.py][INFO] Epoch:[0/2](27100/4588595) loss:3.140 lr:0.0002287 epoch_Time:28565.0min: [2024-01-02 20:29:30,177][model8_pretrain.py][INFO] Epoch:[0/2](27100/4588595) loss:3.680 lr:0.0002287 epoch_Time:28565.0min: [2024-01-02 20:30:07,095][model8_pretrain.py][INFO] Epoch:[0/2](27200/4588595) loss:3.589 lr:0.0002282 epoch_Time:28562.0min: [2024-01-02 20:30:07,095][model8_pretrain.py][INFO] Epoch:[0/2](27200/4588595) loss:3.616 lr:0.0002282 epoch_Time:28562.0min: [2024-01-02 20:30:07,095][model8_pretrain.py][INFO] Epoch:[0/2](27200/4588595) loss:3.757 lr:0.0002282 epoch_Time:28562.0min: [2024-01-02 20:30:07,095][model8_pretrain.py][INFO] Epoch:[0/2](27200/4588595) loss:3.464 lr:0.0002282 epoch_Time:28562.0min: [2024-01-02 20:30:07,095][model8_pretrain.py][INFO] Epoch:[0/2](27200/4588595) loss:3.376 lr:0.0002282 epoch_Time:28562.0min: [2024-01-02 20:30:07,096][model8_pretrain.py][INFO] Epoch:[0/2](27200/4588595) loss:3.757 lr:0.0002282 epoch_Time:28562.0min: [2024-01-02 20:30:07,096][model8_pretrain.py][INFO] Epoch:[0/2](27200/4588595) loss:3.693 lr:0.0002282 epoch_Time:28562.0min: [2024-01-02 20:30:07,096][model8_pretrain.py][INFO] Epoch:[0/2](27200/4588595) loss:3.521 lr:0.0002282 epoch_Time:28562.0min: [2024-01-02 20:30:44,027][model8_pretrain.py][INFO] Epoch:[0/2](27300/4588595) loss:3.213 lr:0.0002277 epoch_Time:28560.0min: [2024-01-02 20:30:44,027][model8_pretrain.py][INFO] Epoch:[0/2](27300/4588595) loss:3.097 lr:0.0002277 epoch_Time:28560.0min: [2024-01-02 20:30:44,027][model8_pretrain.py][INFO] Epoch:[0/2](27300/4588595) loss:2.927 lr:0.0002277 epoch_Time:28560.0min: [2024-01-02 20:30:44,027][model8_pretrain.py][INFO] Epoch:[0/2](27300/4588595) loss:3.156 lr:0.0002277 epoch_Time:28560.0min: [2024-01-02 20:30:44,027][model8_pretrain.py][INFO] Epoch:[0/2](27300/4588595) loss:3.504 lr:0.0002277 epoch_Time:28560.0min: [2024-01-02 20:30:44,027][model8_pretrain.py][INFO] Epoch:[0/2](27300/4588595) loss:3.233 lr:0.0002277 epoch_Time:28560.0min: [2024-01-02 20:30:44,027][model8_pretrain.py][INFO] Epoch:[0/2](27300/4588595) loss:2.984 lr:0.0002277 epoch_Time:28560.0min: [2024-01-02 20:30:44,027][model8_pretrain.py][INFO] Epoch:[0/2](27300/4588595) loss:3.622 lr:0.0002277 epoch_Time:28560.0min: [2024-01-02 20:31:20,977][model8_pretrain.py][INFO] Epoch:[0/2](27400/4588595) loss:3.440 lr:0.0002272 epoch_Time:28558.0min: [2024-01-02 20:31:20,977][model8_pretrain.py][INFO] Epoch:[0/2](27400/4588595) loss:3.445 lr:0.0002272 epoch_Time:28557.0min: [2024-01-02 20:31:20,977][model8_pretrain.py][INFO] Epoch:[0/2](27400/4588595) loss:3.263 lr:0.0002272 epoch_Time:28558.0min: [2024-01-02 20:31:20,977][model8_pretrain.py][INFO] Epoch:[0/2](27400/4588595) loss:3.344 lr:0.0002272 epoch_Time:28557.0min: [2024-01-02 20:31:20,977][model8_pretrain.py][INFO] Epoch:[0/2](27400/4588595) loss:3.364 lr:0.0002272 epoch_Time:28557.0min: [2024-01-02 20:31:20,977][model8_pretrain.py][INFO] Epoch:[0/2](27400/4588595) loss:3.858 lr:0.0002272 epoch_Time:28557.0min: [2024-01-02 20:31:20,977][model8_pretrain.py][INFO] Epoch:[0/2](27400/4588595) loss:3.797 lr:0.0002272 epoch_Time:28557.0min: [2024-01-02 20:31:20,977][model8_pretrain.py][INFO] Epoch:[0/2](27400/4588595) loss:2.848 lr:0.0002272 epoch_Time:28557.0min: [2024-01-02 20:31:57,905][model8_pretrain.py][INFO] Epoch:[0/2](27500/4588595) loss:3.248 lr:0.0002267 epoch_Time:28555.0min: [2024-01-02 20:31:57,905][model8_pretrain.py][INFO] Epoch:[0/2](27500/4588595) loss:3.430 lr:0.0002267 epoch_Time:28555.0min: [2024-01-02 20:31:57,905][model8_pretrain.py][INFO] Epoch:[0/2](27500/4588595) loss:3.877 lr:0.0002267 epoch_Time:28555.0min: [2024-01-02 20:31:57,905][model8_pretrain.py][INFO] Epoch:[0/2](27500/4588595) loss:2.814 lr:0.0002267 epoch_Time:28555.0min: [2024-01-02 20:31:57,905][model8_pretrain.py][INFO] Epoch:[0/2](27500/4588595) loss:3.276 lr:0.0002267 epoch_Time:28555.0min: [2024-01-02 20:31:57,905][model8_pretrain.py][INFO] Epoch:[0/2](27500/4588595) loss:3.569 lr:0.0002267 epoch_Time:28555.0min: [2024-01-02 20:31:57,905][model8_pretrain.py][INFO] Epoch:[0/2](27500/4588595) loss:3.561 lr:0.0002267 epoch_Time:28555.0min: [2024-01-02 20:31:57,905][model8_pretrain.py][INFO] Epoch:[0/2](27500/4588595) loss:3.678 lr:0.0002267 epoch_Time:28555.0min: [2024-01-02 20:32:34,829][model8_pretrain.py][INFO] Epoch:[0/2](27600/4588595) loss:3.448 lr:0.0002262 epoch_Time:28553.0min: [2024-01-02 20:32:34,829][model8_pretrain.py][INFO] Epoch:[0/2](27600/4588595) loss:3.472 lr:0.0002262 epoch_Time:28553.0min: [2024-01-02 20:32:34,829][model8_pretrain.py][INFO] Epoch:[0/2](27600/4588595) loss:3.288 lr:0.0002262 epoch_Time:28553.0min: [2024-01-02 20:32:34,829][model8_pretrain.py][INFO] Epoch:[0/2](27600/4588595) loss:3.270 lr:0.0002262 epoch_Time:28553.0min: [2024-01-02 20:32:34,829][model8_pretrain.py][INFO] Epoch:[0/2](27600/4588595) loss:3.866 lr:0.0002262 epoch_Time:28553.0min: [2024-01-02 20:32:34,829][model8_pretrain.py][INFO] Epoch:[0/2](27600/4588595) loss:2.997 lr:0.0002262 epoch_Time:28553.0min: [2024-01-02 20:32:34,830][model8_pretrain.py][INFO] Epoch:[0/2](27600/4588595) loss:2.857 lr:0.0002262 epoch_Time:28553.0min: [2024-01-02 20:32:34,830][model8_pretrain.py][INFO] Epoch:[0/2](27600/4588595) loss:3.696 lr:0.0002262 epoch_Time:28553.0min: [2024-01-02 20:33:13,428][model8_pretrain.py][INFO] Epoch:[0/2](27700/4588595) loss:3.016 lr:0.0002257 epoch_Time:28555.0min: [2024-01-02 20:33:13,428][model8_pretrain.py][INFO] Epoch:[0/2](27700/4588595) loss:3.713 lr:0.0002257 epoch_Time:28555.0min: [2024-01-02 20:33:13,428][model8_pretrain.py][INFO] Epoch:[0/2](27700/4588595) loss:2.923 lr:0.0002257 epoch_Time:28555.0min: [2024-01-02 20:33:13,428][model8_pretrain.py][INFO] Epoch:[0/2](27700/4588595) loss:3.085 lr:0.0002257 epoch_Time:28555.0min: [2024-01-02 20:33:13,428][model8_pretrain.py][INFO] Epoch:[0/2](27700/4588595) loss:3.571 lr:0.0002257 epoch_Time:28555.0min: [2024-01-02 20:33:13,428][model8_pretrain.py][INFO] Epoch:[0/2](27700/4588595) loss:2.922 lr:0.0002257 epoch_Time:28555.0min: [2024-01-02 20:33:13,428][model8_pretrain.py][INFO] Epoch:[0/2](27700/4588595) loss:3.414 lr:0.0002257 epoch_Time:28555.0min: [2024-01-02 20:33:13,428][model8_pretrain.py][INFO] Epoch:[0/2](27700/4588595) loss:3.952 lr:0.0002257 epoch_Time:28555.0min: [2024-01-02 20:33:55,635][model8_pretrain.py][INFO] Epoch:[0/2](27800/4588595) loss:3.443 lr:0.0002252 epoch_Time:28567.0min: [2024-01-02 20:33:55,635][model8_pretrain.py][INFO] Epoch:[0/2](27800/4588595) loss:3.996 lr:0.0002252 epoch_Time:28567.0min: [2024-01-02 20:33:55,635][model8_pretrain.py][INFO] Epoch:[0/2](27800/4588595) loss:3.156 lr:0.0002252 epoch_Time:28567.0min: [2024-01-02 20:33:55,635][model8_pretrain.py][INFO] Epoch:[0/2](27800/4588595) loss:3.380 lr:0.0002252 epoch_Time:28567.0min: [2024-01-02 20:33:55,636][model8_pretrain.py][INFO] Epoch:[0/2](27800/4588595) loss:3.474 lr:0.0002252 epoch_Time:28567.0min: [2024-01-02 20:33:55,636][model8_pretrain.py][INFO] Epoch:[0/2](27800/4588595) loss:3.308 lr:0.0002252 epoch_Time:28567.0min: [2024-01-02 20:33:55,636][model8_pretrain.py][INFO] Epoch:[0/2](27800/4588595) loss:3.863 lr:0.0002252 epoch_Time:28567.0min: [2024-01-02 20:33:55,636][model8_pretrain.py][INFO] Epoch:[0/2](27800/4588595) loss:3.541 lr:0.0002252 epoch_Time:28567.0min: [2024-01-02 20:34:32,571][model8_pretrain.py][INFO] Epoch:[0/2](27900/4588595) loss:3.214 lr:0.0002247 epoch_Time:28565.0min: [2024-01-02 20:34:32,571][model8_pretrain.py][INFO] Epoch:[0/2](27900/4588595) loss:3.339 lr:0.0002247 epoch_Time:28565.0min: [2024-01-02 20:34:32,571][model8_pretrain.py][INFO] Epoch:[0/2](27900/4588595) loss:2.793 lr:0.0002247 epoch_Time:28565.0min: [2024-01-02 20:34:32,571][model8_pretrain.py][INFO] Epoch:[0/2](27900/4588595) loss:3.784 lr:0.0002247 epoch_Time:28565.0min: [2024-01-02 20:34:32,571][model8_pretrain.py][INFO] Epoch:[0/2](27900/4588595) loss:3.350 lr:0.0002247 epoch_Time:28565.0min: [2024-01-02 20:34:32,571][model8_pretrain.py][INFO] Epoch:[0/2](27900/4588595) loss:3.371 lr:0.0002247 epoch_Time:28565.0min: [2024-01-02 20:34:32,571][model8_pretrain.py][INFO] Epoch:[0/2](27900/4588595) loss:2.552 lr:0.0002247 epoch_Time:28565.0min: [2024-01-02 20:34:32,571][model8_pretrain.py][INFO] Epoch:[0/2](27900/4588595) loss:3.725 lr:0.0002247 epoch_Time:28565.0min: [2024-01-02 20:35:09,497][model8_pretrain.py][INFO] Epoch:[0/2](28000/4588595) loss:3.569 lr:0.0002241 epoch_Time:28562.0min: [2024-01-02 20:35:09,497][model8_pretrain.py][INFO] Epoch:[0/2](28000/4588595) loss:3.542 lr:0.0002241 epoch_Time:28562.0min: [2024-01-02 20:35:09,497][model8_pretrain.py][INFO] Epoch:[0/2](28000/4588595) loss:3.650 lr:0.0002241 epoch_Time:28562.0min: [2024-01-02 20:35:09,497][model8_pretrain.py][INFO] Epoch:[0/2](28000/4588595) loss:3.664 lr:0.0002241 epoch_Time:28562.0min: [2024-01-02 20:35:09,497][model8_pretrain.py][INFO] Epoch:[0/2](28000/4588595) loss:3.375 lr:0.0002241 epoch_Time:28562.0min: [2024-01-02 20:35:09,497][model8_pretrain.py][INFO] Epoch:[0/2](28000/4588595) loss:3.672 lr:0.0002241 epoch_Time:28562.0min: [2024-01-02 20:35:09,497][model8_pretrain.py][INFO] Epoch:[0/2](28000/4588595) loss:3.695 lr:0.0002241 epoch_Time:28562.0min: [2024-01-02 20:35:09,497][model8_pretrain.py][INFO] Epoch:[0/2](28000/4588595) loss:3.594 lr:0.0002241 epoch_Time:28562.0min: [2024-01-02 20:35:46,429][model8_pretrain.py][INFO] Epoch:[0/2](28100/4588595) loss:3.520 lr:0.0002236 epoch_Time:28560.0min: [2024-01-02 20:35:46,429][model8_pretrain.py][INFO] Epoch:[0/2](28100/4588595) loss:3.413 lr:0.0002236 epoch_Time:28560.0min: [2024-01-02 20:35:46,429][model8_pretrain.py][INFO] Epoch:[0/2](28100/4588595) loss:3.444 lr:0.0002236 epoch_Time:28560.0min: [2024-01-02 20:35:46,429][model8_pretrain.py][INFO] Epoch:[0/2](28100/4588595) loss:3.374 lr:0.0002236 epoch_Time:28560.0min: [2024-01-02 20:35:46,429][model8_pretrain.py][INFO] Epoch:[0/2](28100/4588595) loss:3.191 lr:0.0002236 epoch_Time:28560.0min: [2024-01-02 20:35:46,429][model8_pretrain.py][INFO] Epoch:[0/2](28100/4588595) loss:3.903 lr:0.0002236 epoch_Time:28560.0min: [2024-01-02 20:35:46,429][model8_pretrain.py][INFO] Epoch:[0/2](28100/4588595) loss:3.034 lr:0.0002236 epoch_Time:28560.0min: [2024-01-02 20:35:46,430][model8_pretrain.py][INFO] Epoch:[0/2](28100/4588595) loss:3.344 lr:0.0002236 epoch_Time:28560.0min: [2024-01-02 20:36:23,359][model8_pretrain.py][INFO] Epoch:[0/2](28200/4588595) loss:3.553 lr:0.0002231 epoch_Time:28557.0min: [2024-01-02 20:36:23,359][model8_pretrain.py][INFO] Epoch:[0/2](28200/4588595) loss:3.456 lr:0.0002231 epoch_Time:28557.0min: [2024-01-02 20:36:23,359][model8_pretrain.py][INFO] Epoch:[0/2](28200/4588595) loss:3.578 lr:0.0002231 epoch_Time:28558.0min: [2024-01-02 20:36:23,359][model8_pretrain.py][INFO] Epoch:[0/2](28200/4588595) loss:3.361 lr:0.0002231 epoch_Time:28557.0min: [2024-01-02 20:36:23,359][model8_pretrain.py][INFO] Epoch:[0/2](28200/4588595) loss:3.380 lr:0.0002231 epoch_Time:28557.0min: [2024-01-02 20:36:23,359][model8_pretrain.py][INFO] Epoch:[0/2](28200/4588595) loss:3.512 lr:0.0002231 epoch_Time:28557.0min: [2024-01-02 20:36:23,359][model8_pretrain.py][INFO] Epoch:[0/2](28200/4588595) loss:3.000 lr:0.0002231 epoch_Time:28557.0min: [2024-01-02 20:36:23,359][model8_pretrain.py][INFO] Epoch:[0/2](28200/4588595) loss:3.273 lr:0.0002231 epoch_Time:28558.0min: [2024-01-02 20:37:00,285][model8_pretrain.py][INFO] Epoch:[0/2](28300/4588595) loss:3.560 lr:0.0002226 epoch_Time:28555.0min: [2024-01-02 20:37:00,285][model8_pretrain.py][INFO] Epoch:[0/2](28300/4588595) loss:3.203 lr:0.0002226 epoch_Time:28555.0min: [2024-01-02 20:37:00,285][model8_pretrain.py][INFO] Epoch:[0/2](28300/4588595) loss:3.055 lr:0.0002226 epoch_Time:28555.0min: [2024-01-02 20:37:00,285][model8_pretrain.py][INFO] Epoch:[0/2](28300/4588595) loss:3.278 lr:0.0002226 epoch_Time:28555.0min: [2024-01-02 20:37:00,285][model8_pretrain.py][INFO] Epoch:[0/2](28300/4588595) loss:3.044 lr:0.0002226 epoch_Time:28555.0min: [2024-01-02 20:37:00,285][model8_pretrain.py][INFO] Epoch:[0/2](28300/4588595) loss:3.418 lr:0.0002226 epoch_Time:28555.0min: [2024-01-02 20:37:00,285][model8_pretrain.py][INFO] Epoch:[0/2](28300/4588595) loss:3.505 lr:0.0002226 epoch_Time:28555.0min: [2024-01-02 20:37:00,285][model8_pretrain.py][INFO] Epoch:[0/2](28300/4588595) loss:3.395 lr:0.0002226 epoch_Time:28555.0min: [2024-01-02 20:37:37,216][model8_pretrain.py][INFO] Epoch:[0/2](28400/4588595) loss:3.458 lr:0.0002221 epoch_Time:28553.0min: [2024-01-02 20:37:37,216][model8_pretrain.py][INFO] Epoch:[0/2](28400/4588595) loss:2.961 lr:0.0002221 epoch_Time:28553.0min: [2024-01-02 20:37:37,216][model8_pretrain.py][INFO] Epoch:[0/2](28400/4588595) loss:3.035 lr:0.0002221 epoch_Time:28553.0min: [2024-01-02 20:37:37,216][model8_pretrain.py][INFO] Epoch:[0/2](28400/4588595) loss:3.370 lr:0.0002221 epoch_Time:28553.0min: [2024-01-02 20:37:37,216][model8_pretrain.py][INFO] Epoch:[0/2](28400/4588595) loss:3.131 lr:0.0002221 epoch_Time:28553.0min: [2024-01-02 20:37:37,216][model8_pretrain.py][INFO] Epoch:[0/2](28400/4588595) loss:3.426 lr:0.0002221 epoch_Time:28553.0min: [2024-01-02 20:37:37,216][model8_pretrain.py][INFO] Epoch:[0/2](28400/4588595) loss:3.440 lr:0.0002221 epoch_Time:28553.0min: [2024-01-02 20:37:37,216][model8_pretrain.py][INFO] Epoch:[0/2](28400/4588595) loss:2.744 lr:0.0002221 epoch_Time:28553.0min: [2024-01-02 20:38:15,913][model8_pretrain.py][INFO] Epoch:[0/2](28500/4588595) loss:3.071 lr:0.0002216 epoch_Time:28555.0min: [2024-01-02 20:38:15,913][model8_pretrain.py][INFO] Epoch:[0/2](28500/4588595) loss:3.763 lr:0.0002216 epoch_Time:28555.0min: [2024-01-02 20:38:15,913][model8_pretrain.py][INFO] Epoch:[0/2](28500/4588595) loss:3.536 lr:0.0002216 epoch_Time:28555.0min: [2024-01-02 20:38:15,913][model8_pretrain.py][INFO] Epoch:[0/2](28500/4588595) loss:3.398 lr:0.0002216 epoch_Time:28555.0min: [2024-01-02 20:38:15,913][model8_pretrain.py][INFO] Epoch:[0/2](28500/4588595) loss:3.480 lr:0.0002216 epoch_Time:28555.0min: [2024-01-02 20:38:15,913][model8_pretrain.py][INFO] Epoch:[0/2](28500/4588595) loss:3.669 lr:0.0002216 epoch_Time:28555.0min: [2024-01-02 20:38:15,913][model8_pretrain.py][INFO] Epoch:[0/2](28500/4588595) loss:3.275 lr:0.0002216 epoch_Time:28555.0min: [2024-01-02 20:38:15,913][model8_pretrain.py][INFO] Epoch:[0/2](28500/4588595) loss:3.785 lr:0.0002216 epoch_Time:28555.0min: [2024-01-02 20:38:58,172][model8_pretrain.py][INFO] Epoch:[0/2](28600/4588595) loss:3.484 lr:0.0002211 epoch_Time:28567.0min: [2024-01-02 20:38:58,172][model8_pretrain.py][INFO] Epoch:[0/2](28600/4588595) loss:3.673 lr:0.0002211 epoch_Time:28567.0min: [2024-01-02 20:38:58,172][model8_pretrain.py][INFO] Epoch:[0/2](28600/4588595) loss:3.508 lr:0.0002211 epoch_Time:28567.0min: [2024-01-02 20:38:58,172][model8_pretrain.py][INFO] Epoch:[0/2](28600/4588595) loss:3.595 lr:0.0002211 epoch_Time:28567.0min: [2024-01-02 20:38:58,172][model8_pretrain.py][INFO] Epoch:[0/2](28600/4588595) loss:3.147 lr:0.0002211 epoch_Time:28567.0min: [2024-01-02 20:38:58,172][model8_pretrain.py][INFO] Epoch:[0/2](28600/4588595) loss:3.083 lr:0.0002211 epoch_Time:28567.0min: [2024-01-02 20:38:58,172][model8_pretrain.py][INFO] Epoch:[0/2](28600/4588595) loss:3.449 lr:0.0002211 epoch_Time:28567.0min: [2024-01-02 20:38:58,172][model8_pretrain.py][INFO] Epoch:[0/2](28600/4588595) loss:3.650 lr:0.0002211 epoch_Time:28567.0min: [2024-01-02 20:39:35,122][model8_pretrain.py][INFO] Epoch:[0/2](28700/4588595) loss:3.689 lr:0.0002206 epoch_Time:28565.0min: [2024-01-02 20:39:35,122][model8_pretrain.py][INFO] Epoch:[0/2](28700/4588595) loss:3.597 lr:0.0002206 epoch_Time:28565.0min: [2024-01-02 20:39:35,122][model8_pretrain.py][INFO] Epoch:[0/2](28700/4588595) loss:3.387 lr:0.0002206 epoch_Time:28565.0min: [2024-01-02 20:39:35,122][model8_pretrain.py][INFO] Epoch:[0/2](28700/4588595) loss:3.563 lr:0.0002206 epoch_Time:28565.0min: [2024-01-02 20:39:35,122][model8_pretrain.py][INFO] Epoch:[0/2](28700/4588595) loss:3.365 lr:0.0002206 epoch_Time:28565.0min: [2024-01-02 20:39:35,122][model8_pretrain.py][INFO] Epoch:[0/2](28700/4588595) loss:3.402 lr:0.0002206 epoch_Time:28565.0min: [2024-01-02 20:39:35,122][model8_pretrain.py][INFO] Epoch:[0/2](28700/4588595) loss:3.166 lr:0.0002206 epoch_Time:28565.0min: [2024-01-02 20:39:35,122][model8_pretrain.py][INFO] Epoch:[0/2](28700/4588595) loss:3.449 lr:0.0002206 epoch_Time:28565.0min: [2024-01-02 20:40:12,061][model8_pretrain.py][INFO] Epoch:[0/2](28800/4588595) loss:2.779 lr:0.0002201 epoch_Time:28562.0min: [2024-01-02 20:40:12,061][model8_pretrain.py][INFO] Epoch:[0/2](28800/4588595) loss:3.308 lr:0.0002201 epoch_Time:28562.0min: [2024-01-02 20:40:12,061][model8_pretrain.py][INFO] Epoch:[0/2](28800/4588595) loss:3.714 lr:0.0002201 epoch_Time:28562.0min: [2024-01-02 20:40:12,061][model8_pretrain.py][INFO] Epoch:[0/2](28800/4588595) loss:3.533 lr:0.0002201 epoch_Time:28562.0min: [2024-01-02 20:40:12,061][model8_pretrain.py][INFO] Epoch:[0/2](28800/4588595) loss:3.317 lr:0.0002201 epoch_Time:28562.0min: [2024-01-02 20:40:12,061][model8_pretrain.py][INFO] Epoch:[0/2](28800/4588595) loss:2.745 lr:0.0002201 epoch_Time:28562.0min: [2024-01-02 20:40:12,061][model8_pretrain.py][INFO] Epoch:[0/2](28800/4588595) loss:3.431 lr:0.0002201 epoch_Time:28562.0min: [2024-01-02 20:40:12,061][model8_pretrain.py][INFO] Epoch:[0/2](28800/4588595) loss:3.430 lr:0.0002201 epoch_Time:28562.0min: [2024-01-02 20:40:49,004][model8_pretrain.py][INFO] Epoch:[0/2](28900/4588595) loss:3.682 lr:0.0002195 epoch_Time:28559.0min: [2024-01-02 20:40:49,004][model8_pretrain.py][INFO] Epoch:[0/2](28900/4588595) loss:3.021 lr:0.0002195 epoch_Time:28559.0min: [2024-01-02 20:40:49,004][model8_pretrain.py][INFO] Epoch:[0/2](28900/4588595) loss:3.373 lr:0.0002195 epoch_Time:28559.0min: [2024-01-02 20:40:49,004][model8_pretrain.py][INFO] Epoch:[0/2](28900/4588595) loss:3.402 lr:0.0002195 epoch_Time:28560.0min: [2024-01-02 20:40:49,004][model8_pretrain.py][INFO] Epoch:[0/2](28900/4588595) loss:2.831 lr:0.0002195 epoch_Time:28560.0min: [2024-01-02 20:40:49,004][model8_pretrain.py][INFO] Epoch:[0/2](28900/4588595) loss:3.083 lr:0.0002195 epoch_Time:28559.0min: [2024-01-02 20:40:49,004][model8_pretrain.py][INFO] Epoch:[0/2](28900/4588595) loss:3.416 lr:0.0002195 epoch_Time:28559.0min: [2024-01-02 20:40:49,004][model8_pretrain.py][INFO] Epoch:[0/2](28900/4588595) loss:3.526 lr:0.0002195 epoch_Time:28559.0min: [2024-01-02 20:41:25,929][model8_pretrain.py][INFO] Epoch:[0/2](29000/4588595) loss:3.762 lr:0.0002190 epoch_Time:28558.0min: [2024-01-02 20:41:25,929][model8_pretrain.py][INFO] Epoch:[0/2](29000/4588595) loss:2.782 lr:0.0002190 epoch_Time:28558.0min: [2024-01-02 20:41:25,929][model8_pretrain.py][INFO] Epoch:[0/2](29000/4588595) loss:2.776 lr:0.0002190 epoch_Time:28558.0min: [2024-01-02 20:41:25,929][model8_pretrain.py][INFO] Epoch:[0/2](29000/4588595) loss:3.301 lr:0.0002190 epoch_Time:28558.0min: [2024-01-02 20:41:25,929][model8_pretrain.py][INFO] Epoch:[0/2](29000/4588595) loss:3.020 lr:0.0002190 epoch_Time:28558.0min: [2024-01-02 20:41:25,929][model8_pretrain.py][INFO] Epoch:[0/2](29000/4588595) loss:3.376 lr:0.0002190 epoch_Time:28558.0min: [2024-01-02 20:41:25,929][model8_pretrain.py][INFO] Epoch:[0/2](29000/4588595) loss:3.394 lr:0.0002190 epoch_Time:28558.0min: [2024-01-02 20:41:25,929][model8_pretrain.py][INFO] Epoch:[0/2](29000/4588595) loss:3.526 lr:0.0002190 epoch_Time:28558.0min: [2024-01-02 20:42:02,873][model8_pretrain.py][INFO] Epoch:[0/2](29100/4588595) loss:3.357 lr:0.0002185 epoch_Time:28555.0min: [2024-01-02 20:42:02,873][model8_pretrain.py][INFO] Epoch:[0/2](29100/4588595) loss:3.291 lr:0.0002185 epoch_Time:28555.0min: [2024-01-02 20:42:02,873][model8_pretrain.py][INFO] Epoch:[0/2](29100/4588595) loss:3.425 lr:0.0002185 epoch_Time:28555.0min: [2024-01-02 20:42:02,873][model8_pretrain.py][INFO] Epoch:[0/2](29100/4588595) loss:3.857 lr:0.0002185 epoch_Time:28555.0min: [2024-01-02 20:42:02,873][model8_pretrain.py][INFO] Epoch:[0/2](29100/4588595) loss:3.659 lr:0.0002185 epoch_Time:28555.0min: [2024-01-02 20:42:02,873][model8_pretrain.py][INFO] Epoch:[0/2](29100/4588595) loss:3.248 lr:0.0002185 epoch_Time:28555.0min: [2024-01-02 20:42:02,874][model8_pretrain.py][INFO] Epoch:[0/2](29100/4588595) loss:3.639 lr:0.0002185 epoch_Time:28555.0min: [2024-01-02 20:42:02,874][model8_pretrain.py][INFO] Epoch:[0/2](29100/4588595) loss:3.715 lr:0.0002185 epoch_Time:28555.0min: [2024-01-02 20:42:39,827][model8_pretrain.py][INFO] Epoch:[0/2](29200/4588595) loss:4.004 lr:0.0002180 epoch_Time:28553.0min: [2024-01-02 20:42:39,827][model8_pretrain.py][INFO] Epoch:[0/2](29200/4588595) loss:3.663 lr:0.0002180 epoch_Time:28553.0min: [2024-01-02 20:42:39,827][model8_pretrain.py][INFO] Epoch:[0/2](29200/4588595) loss:3.330 lr:0.0002180 epoch_Time:28553.0min: [2024-01-02 20:42:39,827][model8_pretrain.py][INFO] Epoch:[0/2](29200/4588595) loss:3.463 lr:0.0002180 epoch_Time:28553.0min: [2024-01-02 20:42:39,828][model8_pretrain.py][INFO] Epoch:[0/2](29200/4588595) loss:2.484 lr:0.0002180 epoch_Time:28553.0min: [2024-01-02 20:42:39,828][model8_pretrain.py][INFO] Epoch:[0/2](29200/4588595) loss:3.142 lr:0.0002180 epoch_Time:28554.0min: [2024-01-02 20:42:39,828][model8_pretrain.py][INFO] Epoch:[0/2](29200/4588595) loss:3.500 lr:0.0002180 epoch_Time:28553.0min: [2024-01-02 20:42:39,828][model8_pretrain.py][INFO] Epoch:[0/2](29200/4588595) loss:3.528 lr:0.0002180 epoch_Time:28553.0min: [2024-01-02 20:43:16,764][model8_pretrain.py][INFO] Epoch:[0/2](29300/4588595) loss:3.619 lr:0.0002175 epoch_Time:28551.0min: [2024-01-02 20:43:16,764][model8_pretrain.py][INFO] Epoch:[0/2](29300/4588595) loss:3.455 lr:0.0002175 epoch_Time:28551.0min: [2024-01-02 20:43:16,764][model8_pretrain.py][INFO] Epoch:[0/2](29300/4588595) loss:3.740 lr:0.0002175 epoch_Time:28551.0min: [2024-01-02 20:43:16,764][model8_pretrain.py][INFO] Epoch:[0/2](29300/4588595) loss:3.326 lr:0.0002175 epoch_Time:28551.0min: [2024-01-02 20:43:16,764][model8_pretrain.py][INFO] Epoch:[0/2](29300/4588595) loss:3.446 lr:0.0002175 epoch_Time:28551.0min: [2024-01-02 20:43:16,764][model8_pretrain.py][INFO] Epoch:[0/2](29300/4588595) loss:3.387 lr:0.0002175 epoch_Time:28551.0min: [2024-01-02 20:43:16,764][model8_pretrain.py][INFO] Epoch:[0/2](29300/4588595) loss:3.500 lr:0.0002175 epoch_Time:28551.0min: [2024-01-02 20:43:16,764][model8_pretrain.py][INFO] Epoch:[0/2](29300/4588595) loss:3.537 lr:0.0002175 epoch_Time:28551.0min: [2024-01-02 20:44:00,732][model8_pretrain.py][INFO] Epoch:[0/2](29400/4588595) loss:2.827 lr:0.0002169 epoch_Time:28566.0min: [2024-01-02 20:44:00,732][model8_pretrain.py][INFO] Epoch:[0/2](29400/4588595) loss:3.792 lr:0.0002169 epoch_Time:28566.0min: [2024-01-02 20:44:00,732][model8_pretrain.py][INFO] Epoch:[0/2](29400/4588595) loss:3.537 lr:0.0002169 epoch_Time:28566.0min: [2024-01-02 20:44:00,732][model8_pretrain.py][INFO] Epoch:[0/2](29400/4588595) loss:3.085 lr:0.0002169 epoch_Time:28566.0min: [2024-01-02 20:44:00,732][model8_pretrain.py][INFO] Epoch:[0/2](29400/4588595) loss:3.548 lr:0.0002169 epoch_Time:28567.0min: [2024-01-02 20:44:00,732][model8_pretrain.py][INFO] Epoch:[0/2](29400/4588595) loss:3.416 lr:0.0002169 epoch_Time:28566.0min: [2024-01-02 20:44:00,732][model8_pretrain.py][INFO] Epoch:[0/2](29400/4588595) loss:3.819 lr:0.0002169 epoch_Time:28566.0min: [2024-01-02 20:44:00,732][model8_pretrain.py][INFO] Epoch:[0/2](29400/4588595) loss:3.545 lr:0.0002169 epoch_Time:28566.0min: [2024-01-02 20:44:37,673][model8_pretrain.py][INFO] Epoch:[0/2](29500/4588595) loss:3.019 lr:0.0002164 epoch_Time:28565.0min: [2024-01-02 20:44:37,673][model8_pretrain.py][INFO] Epoch:[0/2](29500/4588595) loss:3.604 lr:0.0002164 epoch_Time:28565.0min: [2024-01-02 20:44:37,673][model8_pretrain.py][INFO] Epoch:[0/2](29500/4588595) loss:3.558 lr:0.0002164 epoch_Time:28565.0min: [2024-01-02 20:44:37,673][model8_pretrain.py][INFO] Epoch:[0/2](29500/4588595) loss:3.141 lr:0.0002164 epoch_Time:28565.0min: [2024-01-02 20:44:37,673][model8_pretrain.py][INFO] Epoch:[0/2](29500/4588595) loss:3.645 lr:0.0002164 epoch_Time:28565.0min: [2024-01-02 20:44:37,673][model8_pretrain.py][INFO] Epoch:[0/2](29500/4588595) loss:3.664 lr:0.0002164 epoch_Time:28565.0min: [2024-01-02 20:44:37,673][model8_pretrain.py][INFO] Epoch:[0/2](29500/4588595) loss:3.261 lr:0.0002164 epoch_Time:28565.0min: [2024-01-02 20:44:37,673][model8_pretrain.py][INFO] Epoch:[0/2](29500/4588595) loss:3.691 lr:0.0002164 epoch_Time:28565.0min: [2024-01-02 20:45:14,602][model8_pretrain.py][INFO] Epoch:[0/2](29600/4588595) loss:3.354 lr:0.0002159 epoch_Time:28562.0min: [2024-01-02 20:45:14,602][model8_pretrain.py][INFO] Epoch:[0/2](29600/4588595) loss:3.162 lr:0.0002159 epoch_Time:28562.0min: [2024-01-02 20:45:14,602][model8_pretrain.py][INFO] Epoch:[0/2](29600/4588595) loss:3.370 lr:0.0002159 epoch_Time:28562.0min: [2024-01-02 20:45:14,602][model8_pretrain.py][INFO] Epoch:[0/2](29600/4588595) loss:2.292 lr:0.0002159 epoch_Time:28562.0min: [2024-01-02 20:45:14,602][model8_pretrain.py][INFO] Epoch:[0/2](29600/4588595) loss:2.801 lr:0.0002159 epoch_Time:28562.0min: [2024-01-02 20:45:14,602][model8_pretrain.py][INFO] Epoch:[0/2](29600/4588595) loss:3.412 lr:0.0002159 epoch_Time:28562.0min: [2024-01-02 20:45:14,602][model8_pretrain.py][INFO] Epoch:[0/2](29600/4588595) loss:3.351 lr:0.0002159 epoch_Time:28562.0min: [2024-01-02 20:45:14,602][model8_pretrain.py][INFO] Epoch:[0/2](29600/4588595) loss:3.565 lr:0.0002159 epoch_Time:28562.0min: [2024-01-02 20:45:51,531][model8_pretrain.py][INFO] Epoch:[0/2](29700/4588595) loss:3.501 lr:0.0002154 epoch_Time:28559.0min: [2024-01-02 20:45:51,531][model8_pretrain.py][INFO] Epoch:[0/2](29700/4588595) loss:3.286 lr:0.0002154 epoch_Time:28559.0min: [2024-01-02 20:45:51,531][model8_pretrain.py][INFO] Epoch:[0/2](29700/4588595) loss:3.069 lr:0.0002154 epoch_Time:28559.0min: [2024-01-02 20:45:51,531][model8_pretrain.py][INFO] Epoch:[0/2](29700/4588595) loss:3.678 lr:0.0002154 epoch_Time:28559.0min: [2024-01-02 20:45:51,531][model8_pretrain.py][INFO] Epoch:[0/2](29700/4588595) loss:3.558 lr:0.0002154 epoch_Time:28559.0min: [2024-01-02 20:45:51,531][model8_pretrain.py][INFO] Epoch:[0/2](29700/4588595) loss:2.973 lr:0.0002154 epoch_Time:28559.0min: [2024-01-02 20:45:51,531][model8_pretrain.py][INFO] Epoch:[0/2](29700/4588595) loss:3.469 lr:0.0002154 epoch_Time:28559.0min: [2024-01-02 20:45:51,531][model8_pretrain.py][INFO] Epoch:[0/2](29700/4588595) loss:3.393 lr:0.0002154 epoch_Time:28559.0min: [2024-01-02 20:46:28,494][model8_pretrain.py][INFO] Epoch:[0/2](29800/4588595) loss:3.734 lr:0.0002149 epoch_Time:28558.0min: [2024-01-02 20:46:28,494][model8_pretrain.py][INFO] Epoch:[0/2](29800/4588595) loss:3.640 lr:0.0002149 epoch_Time:28558.0min: [2024-01-02 20:46:28,495][model8_pretrain.py][INFO] Epoch:[0/2](29800/4588595) loss:3.242 lr:0.0002149 epoch_Time:28558.0min: [2024-01-02 20:46:28,495][model8_pretrain.py][INFO] Epoch:[0/2](29800/4588595) loss:3.421 lr:0.0002149 epoch_Time:28558.0min: [2024-01-02 20:46:28,495][model8_pretrain.py][INFO] Epoch:[0/2](29800/4588595) loss:3.880 lr:0.0002149 epoch_Time:28558.0min: [2024-01-02 20:46:28,495][model8_pretrain.py][INFO] Epoch:[0/2](29800/4588595) loss:3.425 lr:0.0002149 epoch_Time:28558.0min: [2024-01-02 20:46:28,495][model8_pretrain.py][INFO] Epoch:[0/2](29800/4588595) loss:3.466 lr:0.0002149 epoch_Time:28558.0min: [2024-01-02 20:46:28,495][model8_pretrain.py][INFO] Epoch:[0/2](29800/4588595) loss:2.747 lr:0.0002149 epoch_Time:28558.0min: [2024-01-02 20:47:05,424][model8_pretrain.py][INFO] Epoch:[0/2](29900/4588595) loss:3.431 lr:0.0002143 epoch_Time:28555.0min: [2024-01-02 20:47:05,424][model8_pretrain.py][INFO] Epoch:[0/2](29900/4588595) loss:3.506 lr:0.0002143 epoch_Time:28555.0min: [2024-01-02 20:47:05,424][model8_pretrain.py][INFO] Epoch:[0/2](29900/4588595) loss:3.508 lr:0.0002143 epoch_Time:28555.0min: [2024-01-02 20:47:05,424][model8_pretrain.py][INFO] Epoch:[0/2](29900/4588595) loss:3.714 lr:0.0002143 epoch_Time:28555.0min: [2024-01-02 20:47:05,424][model8_pretrain.py][INFO] Epoch:[0/2](29900/4588595) loss:3.738 lr:0.0002143 epoch_Time:28555.0min: [2024-01-02 20:47:05,424][model8_pretrain.py][INFO] Epoch:[0/2](29900/4588595) loss:3.450 lr:0.0002143 epoch_Time:28555.0min: [2024-01-02 20:47:05,424][model8_pretrain.py][INFO] Epoch:[0/2](29900/4588595) loss:3.738 lr:0.0002143 epoch_Time:28555.0min: [2024-01-02 20:47:05,424][model8_pretrain.py][INFO] Epoch:[0/2](29900/4588595) loss:3.202 lr:0.0002143 epoch_Time:28555.0min: [2024-01-02 20:47:42,358][model8_pretrain.py][INFO] Epoch:[0/2](30000/4588595) loss:3.509 lr:0.0002138 epoch_Time:28553.0min: [2024-01-02 20:47:42,359][model8_pretrain.py][INFO] Epoch:[0/2](30000/4588595) loss:3.216 lr:0.0002138 epoch_Time:28553.0min: [2024-01-02 20:47:42,359][model8_pretrain.py][INFO] Epoch:[0/2](30000/4588595) loss:3.173 lr:0.0002138 epoch_Time:28553.0min: [2024-01-02 20:47:42,359][model8_pretrain.py][INFO] Epoch:[0/2](30000/4588595) loss:3.761 lr:0.0002138 epoch_Time:28553.0min: [2024-01-02 20:47:42,359][model8_pretrain.py][INFO] Epoch:[0/2](30000/4588595) loss:3.272 lr:0.0002138 epoch_Time:28553.0min: [2024-01-02 20:47:42,359][model8_pretrain.py][INFO] Epoch:[0/2](30000/4588595) loss:3.516 lr:0.0002138 epoch_Time:28553.0min: [2024-01-02 20:47:42,359][model8_pretrain.py][INFO] Epoch:[0/2](30000/4588595) loss:3.634 lr:0.0002138 epoch_Time:28553.0min: [2024-01-02 20:47:42,359][model8_pretrain.py][INFO] Epoch:[0/2](30000/4588595) loss:2.937 lr:0.0002138 epoch_Time:28553.0min: [2024-01-02 20:48:19,289][model8_pretrain.py][INFO] Epoch:[0/2](30100/4588595) loss:3.007 lr:0.0002133 epoch_Time:28551.0min: [2024-01-02 20:48:19,289][model8_pretrain.py][INFO] Epoch:[0/2](30100/4588595) loss:3.109 lr:0.0002133 epoch_Time:28551.0min: [2024-01-02 20:48:19,289][model8_pretrain.py][INFO] Epoch:[0/2](30100/4588595) loss:3.933 lr:0.0002133 epoch_Time:28551.0min: [2024-01-02 20:48:19,289][model8_pretrain.py][INFO] Epoch:[0/2](30100/4588595) loss:3.513 lr:0.0002133 epoch_Time:28551.0min: [2024-01-02 20:48:19,289][model8_pretrain.py][INFO] Epoch:[0/2](30100/4588595) loss:3.145 lr:0.0002133 epoch_Time:28551.0min: [2024-01-02 20:48:19,289][model8_pretrain.py][INFO] Epoch:[0/2](30100/4588595) loss:3.392 lr:0.0002133 epoch_Time:28551.0min: [2024-01-02 20:48:19,289][model8_pretrain.py][INFO] Epoch:[0/2](30100/4588595) loss:3.386 lr:0.0002133 epoch_Time:28551.0min: [2024-01-02 20:48:19,289][model8_pretrain.py][INFO] Epoch:[0/2](30100/4588595) loss:3.494 lr:0.0002133 epoch_Time:28551.0min: [2024-01-02 20:49:03,275][model8_pretrain.py][INFO] Epoch:[0/2](30200/4588595) loss:3.832 lr:0.0002127 epoch_Time:28566.0min: [2024-01-02 20:49:03,275][model8_pretrain.py][INFO] Epoch:[0/2](30200/4588595) loss:2.799 lr:0.0002127 epoch_Time:28566.0min: [2024-01-02 20:49:03,275][model8_pretrain.py][INFO] Epoch:[0/2](30200/4588595) loss:3.378 lr:0.0002127 epoch_Time:28566.0min: [2024-01-02 20:49:03,275][model8_pretrain.py][INFO] Epoch:[0/2](30200/4588595) loss:3.844 lr:0.0002127 epoch_Time:28566.0min: [2024-01-02 20:49:03,275][model8_pretrain.py][INFO] Epoch:[0/2](30200/4588595) loss:3.328 lr:0.0002127 epoch_Time:28566.0min: [2024-01-02 20:49:03,275][model8_pretrain.py][INFO] Epoch:[0/2](30200/4588595) loss:3.351 lr:0.0002127 epoch_Time:28566.0min: [2024-01-02 20:49:03,275][model8_pretrain.py][INFO] Epoch:[0/2](30200/4588595) loss:3.817 lr:0.0002127 epoch_Time:28566.0min: [2024-01-02 20:49:03,275][model8_pretrain.py][INFO] Epoch:[0/2](30200/4588595) loss:3.301 lr:0.0002127 epoch_Time:28566.0min: [2024-01-02 20:49:40,226][model8_pretrain.py][INFO] Epoch:[0/2](30300/4588595) loss:3.385 lr:0.0002122 epoch_Time:28564.0min: [2024-01-02 20:49:40,226][model8_pretrain.py][INFO] Epoch:[0/2](30300/4588595) loss:3.278 lr:0.0002122 epoch_Time:28564.0min: [2024-01-02 20:49:40,226][model8_pretrain.py][INFO] Epoch:[0/2](30300/4588595) loss:3.843 lr:0.0002122 epoch_Time:28564.0min: [2024-01-02 20:49:40,226][model8_pretrain.py][INFO] Epoch:[0/2](30300/4588595) loss:2.990 lr:0.0002122 epoch_Time:28564.0min: [2024-01-02 20:49:40,226][model8_pretrain.py][INFO] Epoch:[0/2](30300/4588595) loss:3.525 lr:0.0002122 epoch_Time:28564.0min: [2024-01-02 20:49:40,226][model8_pretrain.py][INFO] Epoch:[0/2](30300/4588595) loss:3.525 lr:0.0002122 epoch_Time:28564.0min: [2024-01-02 20:49:40,226][model8_pretrain.py][INFO] Epoch:[0/2](30300/4588595) loss:3.729 lr:0.0002122 epoch_Time:28564.0min: [2024-01-02 20:49:40,226][model8_pretrain.py][INFO] Epoch:[0/2](30300/4588595) loss:3.672 lr:0.0002122 epoch_Time:28564.0min: [2024-01-02 20:50:17,160][model8_pretrain.py][INFO] Epoch:[0/2](30400/4588595) loss:3.665 lr:0.0002117 epoch_Time:28562.0min: [2024-01-02 20:50:17,160][model8_pretrain.py][INFO] Epoch:[0/2](30400/4588595) loss:3.779 lr:0.0002117 epoch_Time:28562.0min: [2024-01-02 20:50:17,160][model8_pretrain.py][INFO] Epoch:[0/2](30400/4588595) loss:3.611 lr:0.0002117 epoch_Time:28562.0min: [2024-01-02 20:50:17,160][model8_pretrain.py][INFO] Epoch:[0/2](30400/4588595) loss:3.700 lr:0.0002117 epoch_Time:28562.0min: [2024-01-02 20:50:17,160][model8_pretrain.py][INFO] Epoch:[0/2](30400/4588595) loss:3.453 lr:0.0002117 epoch_Time:28562.0min: [2024-01-02 20:50:17,160][model8_pretrain.py][INFO] Epoch:[0/2](30400/4588595) loss:3.128 lr:0.0002117 epoch_Time:28562.0min: [2024-01-02 20:50:17,160][model8_pretrain.py][INFO] Epoch:[0/2](30400/4588595) loss:3.409 lr:0.0002117 epoch_Time:28562.0min: [2024-01-02 20:50:17,161][model8_pretrain.py][INFO] Epoch:[0/2](30400/4588595) loss:3.735 lr:0.0002117 epoch_Time:28562.0min: [2024-01-02 20:50:54,096][model8_pretrain.py][INFO] Epoch:[0/2](30500/4588595) loss:3.531 lr:0.0002112 epoch_Time:28559.0min: [2024-01-02 20:50:54,096][model8_pretrain.py][INFO] Epoch:[0/2](30500/4588595) loss:3.072 lr:0.0002112 epoch_Time:28559.0min: [2024-01-02 20:50:54,096][model8_pretrain.py][INFO] Epoch:[0/2](30500/4588595) loss:3.077 lr:0.0002112 epoch_Time:28559.0min: [2024-01-02 20:50:54,096][model8_pretrain.py][INFO] Epoch:[0/2](30500/4588595) loss:2.862 lr:0.0002112 epoch_Time:28559.0min: [2024-01-02 20:50:54,096][model8_pretrain.py][INFO] Epoch:[0/2](30500/4588595) loss:3.513 lr:0.0002112 epoch_Time:28559.0min: [2024-01-02 20:50:54,096][model8_pretrain.py][INFO] Epoch:[0/2](30500/4588595) loss:3.006 lr:0.0002112 epoch_Time:28559.0min: [2024-01-02 20:50:54,096][model8_pretrain.py][INFO] Epoch:[0/2](30500/4588595) loss:3.339 lr:0.0002112 epoch_Time:28559.0min: [2024-01-02 20:50:54,096][model8_pretrain.py][INFO] Epoch:[0/2](30500/4588595) loss:3.815 lr:0.0002112 epoch_Time:28559.0min: [2024-01-02 20:51:31,041][model8_pretrain.py][INFO] Epoch:[0/2](30600/4588595) loss:3.461 lr:0.0002106 epoch_Time:28557.0min: [2024-01-02 20:51:31,041][model8_pretrain.py][INFO] Epoch:[0/2](30600/4588595) loss:3.253 lr:0.0002106 epoch_Time:28557.0min: [2024-01-02 20:51:31,041][model8_pretrain.py][INFO] Epoch:[0/2](30600/4588595) loss:3.353 lr:0.0002106 epoch_Time:28557.0min: [2024-01-02 20:51:31,041][model8_pretrain.py][INFO] Epoch:[0/2](30600/4588595) loss:3.597 lr:0.0002106 epoch_Time:28557.0min: [2024-01-02 20:51:31,041][model8_pretrain.py][INFO] Epoch:[0/2](30600/4588595) loss:3.250 lr:0.0002106 epoch_Time:28557.0min: [2024-01-02 20:51:31,041][model8_pretrain.py][INFO] Epoch:[0/2](30600/4588595) loss:3.619 lr:0.0002106 epoch_Time:28557.0min: [2024-01-02 20:51:31,041][model8_pretrain.py][INFO] Epoch:[0/2](30600/4588595) loss:3.799 lr:0.0002106 epoch_Time:28557.0min: [2024-01-02 20:51:31,041][model8_pretrain.py][INFO] Epoch:[0/2](30600/4588595) loss:3.432 lr:0.0002106 epoch_Time:28557.0min: [2024-01-02 20:52:07,976][model8_pretrain.py][INFO] Epoch:[0/2](30700/4588595) loss:2.844 lr:0.0002101 epoch_Time:28555.0min: [2024-01-02 20:52:07,976][model8_pretrain.py][INFO] Epoch:[0/2](30700/4588595) loss:3.421 lr:0.0002101 epoch_Time:28555.0min: [2024-01-02 20:52:07,976][model8_pretrain.py][INFO] Epoch:[0/2](30700/4588595) loss:3.526 lr:0.0002101 epoch_Time:28555.0min: [2024-01-02 20:52:07,976][model8_pretrain.py][INFO] Epoch:[0/2](30700/4588595) loss:3.520 lr:0.0002101 epoch_Time:28555.0min: [2024-01-02 20:52:07,976][model8_pretrain.py][INFO] Epoch:[0/2](30700/4588595) loss:3.509 lr:0.0002101 epoch_Time:28555.0min: [2024-01-02 20:52:07,976][model8_pretrain.py][INFO] Epoch:[0/2](30700/4588595) loss:3.399 lr:0.0002101 epoch_Time:28555.0min: [2024-01-02 20:52:07,976][model8_pretrain.py][INFO] Epoch:[0/2](30700/4588595) loss:3.456 lr:0.0002101 epoch_Time:28555.0min: [2024-01-02 20:52:07,977][model8_pretrain.py][INFO] Epoch:[0/2](30700/4588595) loss:3.430 lr:0.0002101 epoch_Time:28555.0min: [2024-01-02 20:52:44,903][model8_pretrain.py][INFO] Epoch:[0/2](30800/4588595) loss:3.435 lr:0.0002096 epoch_Time:28553.0min: [2024-01-02 20:52:44,903][model8_pretrain.py][INFO] Epoch:[0/2](30800/4588595) loss:3.444 lr:0.0002096 epoch_Time:28553.0min: [2024-01-02 20:52:44,903][model8_pretrain.py][INFO] Epoch:[0/2](30800/4588595) loss:3.518 lr:0.0002096 epoch_Time:28553.0min: [2024-01-02 20:52:44,903][model8_pretrain.py][INFO] Epoch:[0/2](30800/4588595) loss:3.068 lr:0.0002096 epoch_Time:28553.0min: [2024-01-02 20:52:44,903][model8_pretrain.py][INFO] Epoch:[0/2](30800/4588595) loss:3.307 lr:0.0002096 epoch_Time:28553.0min: [2024-01-02 20:52:44,903][model8_pretrain.py][INFO] Epoch:[0/2](30800/4588595) loss:3.159 lr:0.0002096 epoch_Time:28553.0min: [2024-01-02 20:52:44,903][model8_pretrain.py][INFO] Epoch:[0/2](30800/4588595) loss:3.201 lr:0.0002096 epoch_Time:28553.0min: [2024-01-02 20:52:44,903][model8_pretrain.py][INFO] Epoch:[0/2](30800/4588595) loss:3.718 lr:0.0002096 epoch_Time:28553.0min: [2024-01-02 20:53:21,851][model8_pretrain.py][INFO] Epoch:[0/2](30900/4588595) loss:3.216 lr:0.0002090 epoch_Time:28550.0min: [2024-01-02 20:53:21,851][model8_pretrain.py][INFO] Epoch:[0/2](30900/4588595) loss:3.793 lr:0.0002090 epoch_Time:28551.0min: [2024-01-02 20:53:21,852][model8_pretrain.py][INFO] Epoch:[0/2](30900/4588595) loss:3.051 lr:0.0002090 epoch_Time:28550.0min: [2024-01-02 20:53:21,852][model8_pretrain.py][INFO] Epoch:[0/2](30900/4588595) loss:2.988 lr:0.0002090 epoch_Time:28551.0min: [2024-01-02 20:53:21,852][model8_pretrain.py][INFO] Epoch:[0/2](30900/4588595) loss:3.328 lr:0.0002090 epoch_Time:28550.0min: [2024-01-02 20:53:21,852][model8_pretrain.py][INFO] Epoch:[0/2](30900/4588595) loss:3.320 lr:0.0002090 epoch_Time:28550.0min: [2024-01-02 20:53:21,852][model8_pretrain.py][INFO] Epoch:[0/2](30900/4588595) loss:3.738 lr:0.0002090 epoch_Time:28551.0min: [2024-01-02 20:53:21,852][model8_pretrain.py][INFO] Epoch:[0/2](30900/4588595) loss:3.973 lr:0.0002090 epoch_Time:28550.0min: [2024-01-02 20:54:05,946][model8_pretrain.py][INFO] Epoch:[0/2](31000/4588595) loss:3.570 lr:0.0002085 epoch_Time:28566.0min: [2024-01-02 20:54:05,946][model8_pretrain.py][INFO] Epoch:[0/2](31000/4588595) loss:3.273 lr:0.0002085 epoch_Time:28566.0min: [2024-01-02 20:54:05,946][model8_pretrain.py][INFO] Epoch:[0/2](31000/4588595) loss:3.274 lr:0.0002085 epoch_Time:28566.0min: [2024-01-02 20:54:05,946][model8_pretrain.py][INFO] Epoch:[0/2](31000/4588595) loss:3.660 lr:0.0002085 epoch_Time:28566.0min: [2024-01-02 20:54:05,946][model8_pretrain.py][INFO] Epoch:[0/2](31000/4588595) loss:3.054 lr:0.0002085 epoch_Time:28566.0min: [2024-01-02 20:54:05,946][model8_pretrain.py][INFO] Epoch:[0/2](31000/4588595) loss:3.777 lr:0.0002085 epoch_Time:28566.0min: [2024-01-02 20:54:05,946][model8_pretrain.py][INFO] Epoch:[0/2](31000/4588595) loss:3.298 lr:0.0002085 epoch_Time:28566.0min: [2024-01-02 20:54:05,946][model8_pretrain.py][INFO] Epoch:[0/2](31000/4588595) loss:3.135 lr:0.0002085 epoch_Time:28566.0min: [2024-01-02 20:54:42,887][model8_pretrain.py][INFO] Epoch:[0/2](31100/4588595) loss:3.419 lr:0.0002079 epoch_Time:28564.0min: [2024-01-02 20:54:42,887][model8_pretrain.py][INFO] Epoch:[0/2](31100/4588595) loss:2.688 lr:0.0002079 epoch_Time:28564.0min: [2024-01-02 20:54:42,887][model8_pretrain.py][INFO] Epoch:[0/2](31100/4588595) loss:3.332 lr:0.0002079 epoch_Time:28564.0min: [2024-01-02 20:54:42,887][model8_pretrain.py][INFO] Epoch:[0/2](31100/4588595) loss:2.814 lr:0.0002079 epoch_Time:28564.0min: [2024-01-02 20:54:42,887][model8_pretrain.py][INFO] Epoch:[0/2](31100/4588595) loss:3.611 lr:0.0002079 epoch_Time:28564.0min: [2024-01-02 20:54:42,887][model8_pretrain.py][INFO] Epoch:[0/2](31100/4588595) loss:3.515 lr:0.0002079 epoch_Time:28564.0min: [2024-01-02 20:54:42,887][model8_pretrain.py][INFO] Epoch:[0/2](31100/4588595) loss:3.520 lr:0.0002079 epoch_Time:28564.0min: [2024-01-02 20:54:42,887][model8_pretrain.py][INFO] Epoch:[0/2](31100/4588595) loss:3.167 lr:0.0002079 epoch_Time:28564.0min: [2024-01-02 20:55:19,829][model8_pretrain.py][INFO] Epoch:[0/2](31200/4588595) loss:3.538 lr:0.0002074 epoch_Time:28561.0min: [2024-01-02 20:55:19,829][model8_pretrain.py][INFO] Epoch:[0/2](31200/4588595) loss:3.518 lr:0.0002074 epoch_Time:28561.0min: [2024-01-02 20:55:19,829][model8_pretrain.py][INFO] Epoch:[0/2](31200/4588595) loss:3.226 lr:0.0002074 epoch_Time:28561.0min: [2024-01-02 20:55:19,829][model8_pretrain.py][INFO] Epoch:[0/2](31200/4588595) loss:3.327 lr:0.0002074 epoch_Time:28561.0min: [2024-01-02 20:55:19,829][model8_pretrain.py][INFO] Epoch:[0/2](31200/4588595) loss:2.535 lr:0.0002074 epoch_Time:28561.0min: [2024-01-02 20:55:19,829][model8_pretrain.py][INFO] Epoch:[0/2](31200/4588595) loss:3.423 lr:0.0002074 epoch_Time:28561.0min: [2024-01-02 20:55:19,830][model8_pretrain.py][INFO] Epoch:[0/2](31200/4588595) loss:3.681 lr:0.0002074 epoch_Time:28561.0min: [2024-01-02 20:55:19,830][model8_pretrain.py][INFO] Epoch:[0/2](31200/4588595) loss:3.240 lr:0.0002074 epoch_Time:28561.0min: [2024-01-02 20:55:56,764][model8_pretrain.py][INFO] Epoch:[0/2](31300/4588595) loss:3.413 lr:0.0002069 epoch_Time:28559.0min: [2024-01-02 20:55:56,764][model8_pretrain.py][INFO] Epoch:[0/2](31300/4588595) loss:3.164 lr:0.0002069 epoch_Time:28559.0min: [2024-01-02 20:55:56,764][model8_pretrain.py][INFO] Epoch:[0/2](31300/4588595) loss:3.323 lr:0.0002069 epoch_Time:28559.0min: [2024-01-02 20:55:56,764][model8_pretrain.py][INFO] Epoch:[0/2](31300/4588595) loss:3.772 lr:0.0002069 epoch_Time:28559.0min: [2024-01-02 20:55:56,764][model8_pretrain.py][INFO] Epoch:[0/2](31300/4588595) loss:3.185 lr:0.0002069 epoch_Time:28559.0min: [2024-01-02 20:55:56,764][model8_pretrain.py][INFO] Epoch:[0/2](31300/4588595) loss:3.092 lr:0.0002069 epoch_Time:28559.0min: [2024-01-02 20:55:56,764][model8_pretrain.py][INFO] Epoch:[0/2](31300/4588595) loss:3.849 lr:0.0002069 epoch_Time:28559.0min: [2024-01-02 20:55:56,764][model8_pretrain.py][INFO] Epoch:[0/2](31300/4588595) loss:3.219 lr:0.0002069 epoch_Time:28559.0min: [2024-01-02 20:56:33,700][model8_pretrain.py][INFO] Epoch:[0/2](31400/4588595) loss:3.733 lr:0.0002063 epoch_Time:28557.0min: [2024-01-02 20:56:33,700][model8_pretrain.py][INFO] Epoch:[0/2](31400/4588595) loss:3.411 lr:0.0002063 epoch_Time:28557.0min: [2024-01-02 20:56:33,700][model8_pretrain.py][INFO] Epoch:[0/2](31400/4588595) loss:3.189 lr:0.0002063 epoch_Time:28557.0min: [2024-01-02 20:56:33,700][model8_pretrain.py][INFO] Epoch:[0/2](31400/4588595) loss:2.743 lr:0.0002063 epoch_Time:28557.0min: [2024-01-02 20:56:33,700][model8_pretrain.py][INFO] Epoch:[0/2](31400/4588595) loss:3.265 lr:0.0002063 epoch_Time:28557.0min: [2024-01-02 20:56:33,700][model8_pretrain.py][INFO] Epoch:[0/2](31400/4588595) loss:3.339 lr:0.0002063 epoch_Time:28557.0min: [2024-01-02 20:56:33,700][model8_pretrain.py][INFO] Epoch:[0/2](31400/4588595) loss:3.273 lr:0.0002063 epoch_Time:28557.0min: [2024-01-02 20:56:33,701][model8_pretrain.py][INFO] Epoch:[0/2](31400/4588595) loss:3.597 lr:0.0002063 epoch_Time:28557.0min: [2024-01-02 20:57:10,650][model8_pretrain.py][INFO] Epoch:[0/2](31500/4588595) loss:3.407 lr:0.0002058 epoch_Time:28554.0min: [2024-01-02 20:57:10,650][model8_pretrain.py][INFO] Epoch:[0/2](31500/4588595) loss:3.542 lr:0.0002058 epoch_Time:28555.0min: [2024-01-02 20:57:10,650][model8_pretrain.py][INFO] Epoch:[0/2](31500/4588595) loss:3.536 lr:0.0002058 epoch_Time:28554.0min: [2024-01-02 20:57:10,650][model8_pretrain.py][INFO] Epoch:[0/2](31500/4588595) loss:3.502 lr:0.0002058 epoch_Time:28554.0min: [2024-01-02 20:57:10,650][model8_pretrain.py][INFO] Epoch:[0/2](31500/4588595) loss:3.687 lr:0.0002058 epoch_Time:28555.0min: [2024-01-02 20:57:10,650][model8_pretrain.py][INFO] Epoch:[0/2](31500/4588595) loss:3.351 lr:0.0002058 epoch_Time:28554.0min: [2024-01-02 20:57:10,650][model8_pretrain.py][INFO] Epoch:[0/2](31500/4588595) loss:3.548 lr:0.0002058 epoch_Time:28554.0min: [2024-01-02 20:57:10,650][model8_pretrain.py][INFO] Epoch:[0/2](31500/4588595) loss:3.204 lr:0.0002058 epoch_Time:28554.0min: [2024-01-02 20:57:47,590][model8_pretrain.py][INFO] Epoch:[0/2](31600/4588595) loss:3.305 lr:0.0002053 epoch_Time:28552.0min: [2024-01-02 20:57:47,590][model8_pretrain.py][INFO] Epoch:[0/2](31600/4588595) loss:3.660 lr:0.0002053 epoch_Time:28552.0min: [2024-01-02 20:57:47,590][model8_pretrain.py][INFO] Epoch:[0/2](31600/4588595) loss:3.437 lr:0.0002053 epoch_Time:28553.0min: [2024-01-02 20:57:47,590][model8_pretrain.py][INFO] Epoch:[0/2](31600/4588595) loss:3.489 lr:0.0002053 epoch_Time:28553.0min: [2024-01-02 20:57:47,590][model8_pretrain.py][INFO] Epoch:[0/2](31600/4588595) loss:2.922 lr:0.0002053 epoch_Time:28553.0min: [2024-01-02 20:57:47,590][model8_pretrain.py][INFO] Epoch:[0/2](31600/4588595) loss:3.279 lr:0.0002053 epoch_Time:28553.0min: [2024-01-02 20:57:47,590][model8_pretrain.py][INFO] Epoch:[0/2](31600/4588595) loss:2.959 lr:0.0002053 epoch_Time:28553.0min: [2024-01-02 20:57:47,590][model8_pretrain.py][INFO] Epoch:[0/2](31600/4588595) loss:3.288 lr:0.0002053 epoch_Time:28553.0min: [2024-01-02 20:58:24,533][model8_pretrain.py][INFO] Epoch:[0/2](31700/4588595) loss:3.567 lr:0.0002047 epoch_Time:28550.0min: [2024-01-02 20:58:24,533][model8_pretrain.py][INFO] Epoch:[0/2](31700/4588595) loss:3.457 lr:0.0002047 epoch_Time:28550.0min: [2024-01-02 20:58:24,533][model8_pretrain.py][INFO] Epoch:[0/2](31700/4588595) loss:3.411 lr:0.0002047 epoch_Time:28550.0min: [2024-01-02 20:58:24,533][model8_pretrain.py][INFO] Epoch:[0/2](31700/4588595) loss:3.559 lr:0.0002047 epoch_Time:28550.0min: [2024-01-02 20:58:24,533][model8_pretrain.py][INFO] Epoch:[0/2](31700/4588595) loss:3.732 lr:0.0002047 epoch_Time:28550.0min: [2024-01-02 20:58:24,533][model8_pretrain.py][INFO] Epoch:[0/2](31700/4588595) loss:3.483 lr:0.0002047 epoch_Time:28550.0min: [2024-01-02 20:58:24,533][model8_pretrain.py][INFO] Epoch:[0/2](31700/4588595) loss:3.558 lr:0.0002047 epoch_Time:28550.0min: [2024-01-02 20:58:24,533][model8_pretrain.py][INFO] Epoch:[0/2](31700/4588595) loss:3.298 lr:0.0002047 epoch_Time:28550.0min: [2024-01-02 20:59:08,533][model8_pretrain.py][INFO] Epoch:[0/2](31800/4588595) loss:3.338 lr:0.0002042 epoch_Time:28565.0min: [2024-01-02 20:59:08,533][model8_pretrain.py][INFO] Epoch:[0/2](31800/4588595) loss:3.153 lr:0.0002042 epoch_Time:28565.0min: [2024-01-02 20:59:08,533][model8_pretrain.py][INFO] Epoch:[0/2](31800/4588595) loss:3.241 lr:0.0002042 epoch_Time:28565.0min: [2024-01-02 20:59:08,533][model8_pretrain.py][INFO] Epoch:[0/2](31800/4588595) loss:3.393 lr:0.0002042 epoch_Time:28565.0min: [2024-01-02 20:59:08,533][model8_pretrain.py][INFO] Epoch:[0/2](31800/4588595) loss:3.169 lr:0.0002042 epoch_Time:28565.0min: [2024-01-02 20:59:08,533][model8_pretrain.py][INFO] Epoch:[0/2](31800/4588595) loss:3.312 lr:0.0002042 epoch_Time:28565.0min: [2024-01-02 20:59:08,533][model8_pretrain.py][INFO] Epoch:[0/2](31800/4588595) loss:3.340 lr:0.0002042 epoch_Time:28565.0min: [2024-01-02 20:59:08,533][model8_pretrain.py][INFO] Epoch:[0/2](31800/4588595) loss:3.392 lr:0.0002042 epoch_Time:28565.0min: [2024-01-02 20:59:45,474][model8_pretrain.py][INFO] Epoch:[0/2](31900/4588595) loss:3.618 lr:0.0002036 epoch_Time:28563.0min: [2024-01-02 20:59:45,474][model8_pretrain.py][INFO] Epoch:[0/2](31900/4588595) loss:2.660 lr:0.0002036 epoch_Time:28563.0min: [2024-01-02 20:59:45,474][model8_pretrain.py][INFO] Epoch:[0/2](31900/4588595) loss:3.476 lr:0.0002036 epoch_Time:28563.0min: [2024-01-02 20:59:45,474][model8_pretrain.py][INFO] Epoch:[0/2](31900/4588595) loss:3.700 lr:0.0002036 epoch_Time:28563.0min: [2024-01-02 20:59:45,474][model8_pretrain.py][INFO] Epoch:[0/2](31900/4588595) loss:3.434 lr:0.0002036 epoch_Time:28563.0min: [2024-01-02 20:59:45,474][model8_pretrain.py][INFO] Epoch:[0/2](31900/4588595) loss:3.638 lr:0.0002036 epoch_Time:28563.0min: [2024-01-02 20:59:45,475][model8_pretrain.py][INFO] Epoch:[0/2](31900/4588595) loss:3.137 lr:0.0002036 epoch_Time:28563.0min: [2024-01-02 20:59:45,475][model8_pretrain.py][INFO] Epoch:[0/2](31900/4588595) loss:3.300 lr:0.0002036 epoch_Time:28563.0min: [2024-01-02 21:00:22,420][model8_pretrain.py][INFO] Epoch:[0/2](32000/4588595) loss:3.425 lr:0.0002031 epoch_Time:28561.0min: [2024-01-02 21:00:22,420][model8_pretrain.py][INFO] Epoch:[0/2](32000/4588595) loss:2.551 lr:0.0002031 epoch_Time:28561.0min: [2024-01-02 21:00:22,420][model8_pretrain.py][INFO] Epoch:[0/2](32000/4588595) loss:3.937 lr:0.0002031 epoch_Time:28561.0min: [2024-01-02 21:00:22,420][model8_pretrain.py][INFO] Epoch:[0/2](32000/4588595) loss:3.573 lr:0.0002031 epoch_Time:28561.0min: [2024-01-02 21:00:22,420][model8_pretrain.py][INFO] Epoch:[0/2](32000/4588595) loss:3.398 lr:0.0002031 epoch_Time:28561.0min: [2024-01-02 21:00:22,420][model8_pretrain.py][INFO] Epoch:[0/2](32000/4588595) loss:2.971 lr:0.0002031 epoch_Time:28561.0min: [2024-01-02 21:00:22,420][model8_pretrain.py][INFO] Epoch:[0/2](32000/4588595) loss:3.397 lr:0.0002031 epoch_Time:28561.0min: [2024-01-02 21:00:22,420][model8_pretrain.py][INFO] Epoch:[0/2](32000/4588595) loss:2.702 lr:0.0002031 epoch_Time:28560.0min: [2024-01-02 21:00:59,359][model8_pretrain.py][INFO] Epoch:[0/2](32100/4588595) loss:3.135 lr:0.0002025 epoch_Time:28558.0min: [2024-01-02 21:00:59,359][model8_pretrain.py][INFO] Epoch:[0/2](32100/4588595) loss:3.470 lr:0.0002025 epoch_Time:28558.0min: [2024-01-02 21:00:59,359][model8_pretrain.py][INFO] Epoch:[0/2](32100/4588595) loss:3.389 lr:0.0002025 epoch_Time:28558.0min: [2024-01-02 21:00:59,359][model8_pretrain.py][INFO] Epoch:[0/2](32100/4588595) loss:3.199 lr:0.0002025 epoch_Time:28558.0min: [2024-01-02 21:00:59,359][model8_pretrain.py][INFO] Epoch:[0/2](32100/4588595) loss:3.744 lr:0.0002025 epoch_Time:28558.0min: [2024-01-02 21:00:59,359][model8_pretrain.py][INFO] Epoch:[0/2](32100/4588595) loss:3.530 lr:0.0002025 epoch_Time:28558.0min: [2024-01-02 21:00:59,359][model8_pretrain.py][INFO] Epoch:[0/2](32100/4588595) loss:3.287 lr:0.0002025 epoch_Time:28558.0min: [2024-01-02 21:00:59,359][model8_pretrain.py][INFO] Epoch:[0/2](32100/4588595) loss:3.239 lr:0.0002025 epoch_Time:28558.0min: [2024-01-02 21:01:36,289][model8_pretrain.py][INFO] Epoch:[0/2](32200/4588595) loss:3.280 lr:0.0002020 epoch_Time:28556.0min: [2024-01-02 21:01:36,290][model8_pretrain.py][INFO] Epoch:[0/2](32200/4588595) loss:3.725 lr:0.0002020 epoch_Time:28556.0min: [2024-01-02 21:01:36,290][model8_pretrain.py][INFO] Epoch:[0/2](32200/4588595) loss:3.494 lr:0.0002020 epoch_Time:28556.0min: [2024-01-02 21:01:36,290][model8_pretrain.py][INFO] Epoch:[0/2](32200/4588595) loss:3.090 lr:0.0002020 epoch_Time:28556.0min: [2024-01-02 21:01:36,290][model8_pretrain.py][INFO] Epoch:[0/2](32200/4588595) loss:3.602 lr:0.0002020 epoch_Time:28556.0min: [2024-01-02 21:01:36,290][model8_pretrain.py][INFO] Epoch:[0/2](32200/4588595) loss:3.314 lr:0.0002020 epoch_Time:28556.0min: [2024-01-02 21:01:36,290][model8_pretrain.py][INFO] Epoch:[0/2](32200/4588595) loss:3.121 lr:0.0002020 epoch_Time:28556.0min: [2024-01-02 21:01:36,290][model8_pretrain.py][INFO] Epoch:[0/2](32200/4588595) loss:3.109 lr:0.0002020 epoch_Time:28556.0min: [2024-01-02 21:02:13,245][model8_pretrain.py][INFO] Epoch:[0/2](32300/4588595) loss:3.033 lr:0.0002014 epoch_Time:28554.0min: [2024-01-02 21:02:13,245][model8_pretrain.py][INFO] Epoch:[0/2](32300/4588595) loss:3.613 lr:0.0002014 epoch_Time:28554.0min: [2024-01-02 21:02:13,245][model8_pretrain.py][INFO] Epoch:[0/2](32300/4588595) loss:3.314 lr:0.0002014 epoch_Time:28554.0min: [2024-01-02 21:02:13,245][model8_pretrain.py][INFO] Epoch:[0/2](32300/4588595) loss:3.219 lr:0.0002014 epoch_Time:28554.0min: [2024-01-02 21:02:13,245][model8_pretrain.py][INFO] Epoch:[0/2](32300/4588595) loss:3.855 lr:0.0002014 epoch_Time:28554.0min: [2024-01-02 21:02:13,245][model8_pretrain.py][INFO] Epoch:[0/2](32300/4588595) loss:3.108 lr:0.0002014 epoch_Time:28554.0min: [2024-01-02 21:02:13,245][model8_pretrain.py][INFO] Epoch:[0/2](32300/4588595) loss:3.444 lr:0.0002014 epoch_Time:28554.0min: [2024-01-02 21:02:13,246][model8_pretrain.py][INFO] Epoch:[0/2](32300/4588595) loss:3.765 lr:0.0002014 epoch_Time:28554.0min: [2024-01-02 21:02:50,187][model8_pretrain.py][INFO] Epoch:[0/2](32400/4588595) loss:3.514 lr:0.0002009 epoch_Time:28551.0min: [2024-01-02 21:02:50,187][model8_pretrain.py][INFO] Epoch:[0/2](32400/4588595) loss:3.575 lr:0.0002009 epoch_Time:28551.0min: [2024-01-02 21:02:50,187][model8_pretrain.py][INFO] Epoch:[0/2](32400/4588595) loss:3.533 lr:0.0002009 epoch_Time:28551.0min: [2024-01-02 21:02:50,187][model8_pretrain.py][INFO] Epoch:[0/2](32400/4588595) loss:3.686 lr:0.0002009 epoch_Time:28551.0min: [2024-01-02 21:02:50,187][model8_pretrain.py][INFO] Epoch:[0/2](32400/4588595) loss:3.037 lr:0.0002009 epoch_Time:28551.0min: [2024-01-02 21:02:50,187][model8_pretrain.py][INFO] Epoch:[0/2](32400/4588595) loss:3.539 lr:0.0002009 epoch_Time:28551.0min: [2024-01-02 21:02:50,187][model8_pretrain.py][INFO] Epoch:[0/2](32400/4588595) loss:3.383 lr:0.0002009 epoch_Time:28551.0min: [2024-01-02 21:02:50,187][model8_pretrain.py][INFO] Epoch:[0/2](32400/4588595) loss:2.977 lr:0.0002009 epoch_Time:28551.0min: [2024-01-02 21:03:27,123][model8_pretrain.py][INFO] Epoch:[0/2](32500/4588595) loss:3.491 lr:0.0002004 epoch_Time:28550.0min: [2024-01-02 21:03:27,123][model8_pretrain.py][INFO] Epoch:[0/2](32500/4588595) loss:3.327 lr:0.0002004 epoch_Time:28550.0min: [2024-01-02 21:03:27,123][model8_pretrain.py][INFO] Epoch:[0/2](32500/4588595) loss:3.016 lr:0.0002004 epoch_Time:28550.0min: [2024-01-02 21:03:27,123][model8_pretrain.py][INFO] Epoch:[0/2](32500/4588595) loss:3.376 lr:0.0002004 epoch_Time:28550.0min: [2024-01-02 21:03:27,123][model8_pretrain.py][INFO] Epoch:[0/2](32500/4588595) loss:3.512 lr:0.0002004 epoch_Time:28550.0min: [2024-01-02 21:03:27,123][model8_pretrain.py][INFO] Epoch:[0/2](32500/4588595) loss:3.547 lr:0.0002004 epoch_Time:28550.0min: [2024-01-02 21:03:27,123][model8_pretrain.py][INFO] Epoch:[0/2](32500/4588595) loss:3.146 lr:0.0002004 epoch_Time:28550.0min: [2024-01-02 21:03:27,124][model8_pretrain.py][INFO] Epoch:[0/2](32500/4588595) loss:3.618 lr:0.0002004 epoch_Time:28550.0min: [2024-01-02 21:04:11,071][model8_pretrain.py][INFO] Epoch:[0/2](32600/4588595) loss:3.259 lr:0.0001998 epoch_Time:28564.0min: [2024-01-02 21:04:11,071][model8_pretrain.py][INFO] Epoch:[0/2](32600/4588595) loss:3.697 lr:0.0001998 epoch_Time:28564.0min: [2024-01-02 21:04:11,072][model8_pretrain.py][INFO] Epoch:[0/2](32600/4588595) loss:3.305 lr:0.0001998 epoch_Time:28564.0min: [2024-01-02 21:04:11,072][model8_pretrain.py][INFO] Epoch:[0/2](32600/4588595) loss:3.622 lr:0.0001998 epoch_Time:28564.0min: [2024-01-02 21:04:11,072][model8_pretrain.py][INFO] Epoch:[0/2](32600/4588595) loss:3.563 lr:0.0001998 epoch_Time:28564.0min: [2024-01-02 21:04:11,072][model8_pretrain.py][INFO] Epoch:[0/2](32600/4588595) loss:3.263 lr:0.0001998 epoch_Time:28564.0min: [2024-01-02 21:04:11,072][model8_pretrain.py][INFO] Epoch:[0/2](32600/4588595) loss:3.124 lr:0.0001998 epoch_Time:28564.0min: [2024-01-02 21:04:11,072][model8_pretrain.py][INFO] Epoch:[0/2](32600/4588595) loss:2.978 lr:0.0001998 epoch_Time:28564.0min: [2024-01-02 21:04:48,011][model8_pretrain.py][INFO] Epoch:[0/2](32700/4588595) loss:3.199 lr:0.0001993 epoch_Time:28561.0min: [2024-01-02 21:04:48,011][model8_pretrain.py][INFO] Epoch:[0/2](32700/4588595) loss:3.253 lr:0.0001993 epoch_Time:28561.0min: [2024-01-02 21:04:48,011][model8_pretrain.py][INFO] Epoch:[0/2](32700/4588595) loss:3.279 lr:0.0001993 epoch_Time:28561.0min: [2024-01-02 21:04:48,011][model8_pretrain.py][INFO] Epoch:[0/2](32700/4588595) loss:3.485 lr:0.0001993 epoch_Time:28561.0min: [2024-01-02 21:04:48,011][model8_pretrain.py][INFO] Epoch:[0/2](32700/4588595) loss:3.316 lr:0.0001993 epoch_Time:28561.0min: [2024-01-02 21:04:48,011][model8_pretrain.py][INFO] Epoch:[0/2](32700/4588595) loss:3.290 lr:0.0001993 epoch_Time:28561.0min: [2024-01-02 21:04:48,011][model8_pretrain.py][INFO] Epoch:[0/2](32700/4588595) loss:3.386 lr:0.0001993 epoch_Time:28561.0min: [2024-01-02 21:04:48,011][model8_pretrain.py][INFO] Epoch:[0/2](32700/4588595) loss:3.359 lr:0.0001993 epoch_Time:28561.0min: [2024-01-02 21:05:24,918][model8_pretrain.py][INFO] Epoch:[0/2](32800/4588595) loss:3.359 lr:0.0001987 epoch_Time:28559.0min: [2024-01-02 21:05:24,918][model8_pretrain.py][INFO] Epoch:[0/2](32800/4588595) loss:3.716 lr:0.0001987 epoch_Time:28559.0min: [2024-01-02 21:05:24,918][model8_pretrain.py][INFO] Epoch:[0/2](32800/4588595) loss:3.496 lr:0.0001987 epoch_Time:28559.0min: [2024-01-02 21:05:24,918][model8_pretrain.py][INFO] Epoch:[0/2](32800/4588595) loss:3.243 lr:0.0001987 epoch_Time:28559.0min: [2024-01-02 21:05:24,918][model8_pretrain.py][INFO] Epoch:[0/2](32800/4588595) loss:3.736 lr:0.0001987 epoch_Time:28559.0min: [2024-01-02 21:05:24,918][model8_pretrain.py][INFO] Epoch:[0/2](32800/4588595) loss:3.442 lr:0.0001987 epoch_Time:28559.0min: [2024-01-02 21:05:24,918][model8_pretrain.py][INFO] Epoch:[0/2](32800/4588595) loss:3.684 lr:0.0001987 epoch_Time:28559.0min: [2024-01-02 21:05:24,918][model8_pretrain.py][INFO] Epoch:[0/2](32800/4588595) loss:3.201 lr:0.0001987 epoch_Time:28559.0min: [2024-01-02 21:06:01,852][model8_pretrain.py][INFO] Epoch:[0/2](32900/4588595) loss:3.136 lr:0.0001982 epoch_Time:28557.0min: [2024-01-02 21:06:01,852][model8_pretrain.py][INFO] Epoch:[0/2](32900/4588595) loss:3.618 lr:0.0001982 epoch_Time:28557.0min: [2024-01-02 21:06:01,852][model8_pretrain.py][INFO] Epoch:[0/2](32900/4588595) loss:3.169 lr:0.0001982 epoch_Time:28557.0min: [2024-01-02 21:06:01,852][model8_pretrain.py][INFO] Epoch:[0/2](32900/4588595) loss:2.985 lr:0.0001982 epoch_Time:28557.0min: [2024-01-02 21:06:01,852][model8_pretrain.py][INFO] Epoch:[0/2](32900/4588595) loss:3.143 lr:0.0001982 epoch_Time:28557.0min: [2024-01-02 21:06:01,852][model8_pretrain.py][INFO] Epoch:[0/2](32900/4588595) loss:3.553 lr:0.0001982 epoch_Time:28557.0min: [2024-01-02 21:06:01,852][model8_pretrain.py][INFO] Epoch:[0/2](32900/4588595) loss:3.600 lr:0.0001982 epoch_Time:28557.0min: [2024-01-02 21:06:01,852][model8_pretrain.py][INFO] Epoch:[0/2](32900/4588595) loss:3.556 lr:0.0001982 epoch_Time:28557.0min: [2024-01-02 21:06:38,809][model8_pretrain.py][INFO] Epoch:[0/2](33000/4588595) loss:3.373 lr:0.0001976 epoch_Time:28555.0min: [2024-01-02 21:06:38,809][model8_pretrain.py][INFO] Epoch:[0/2](33000/4588595) loss:3.845 lr:0.0001976 epoch_Time:28555.0min: [2024-01-02 21:06:38,809][model8_pretrain.py][INFO] Epoch:[0/2](33000/4588595) loss:3.410 lr:0.0001976 epoch_Time:28555.0min: [2024-01-02 21:06:38,809][model8_pretrain.py][INFO] Epoch:[0/2](33000/4588595) loss:3.471 lr:0.0001976 epoch_Time:28555.0min: [2024-01-02 21:06:38,809][model8_pretrain.py][INFO] Epoch:[0/2](33000/4588595) loss:3.842 lr:0.0001976 epoch_Time:28555.0min: [2024-01-02 21:06:38,809][model8_pretrain.py][INFO] Epoch:[0/2](33000/4588595) loss:3.097 lr:0.0001976 epoch_Time:28555.0min: [2024-01-02 21:06:38,809][model8_pretrain.py][INFO] Epoch:[0/2](33000/4588595) loss:3.548 lr:0.0001976 epoch_Time:28555.0min: [2024-01-02 21:06:38,809][model8_pretrain.py][INFO] Epoch:[0/2](33000/4588595) loss:3.430 lr:0.0001976 epoch_Time:28555.0min: [2024-01-02 21:07:15,759][model8_pretrain.py][INFO] Epoch:[0/2](33100/4588595) loss:3.250 lr:0.0001971 epoch_Time:28553.0min: [2024-01-02 21:07:15,759][model8_pretrain.py][INFO] Epoch:[0/2](33100/4588595) loss:3.135 lr:0.0001971 epoch_Time:28553.0min: [2024-01-02 21:07:15,759][model8_pretrain.py][INFO] Epoch:[0/2](33100/4588595) loss:3.283 lr:0.0001971 epoch_Time:28553.0min: [2024-01-02 21:07:15,759][model8_pretrain.py][INFO] Epoch:[0/2](33100/4588595) loss:3.261 lr:0.0001971 epoch_Time:28553.0min: [2024-01-02 21:07:15,759][model8_pretrain.py][INFO] Epoch:[0/2](33100/4588595) loss:3.282 lr:0.0001971 epoch_Time:28553.0min: [2024-01-02 21:07:15,759][model8_pretrain.py][INFO] Epoch:[0/2](33100/4588595) loss:3.785 lr:0.0001971 epoch_Time:28553.0min: [2024-01-02 21:07:15,759][model8_pretrain.py][INFO] Epoch:[0/2](33100/4588595) loss:2.871 lr:0.0001971 epoch_Time:28553.0min: [2024-01-02 21:07:15,760][model8_pretrain.py][INFO] Epoch:[0/2](33100/4588595) loss:3.601 lr:0.0001971 epoch_Time:28553.0min: [2024-01-02 21:07:52,696][model8_pretrain.py][INFO] Epoch:[0/2](33200/4588595) loss:3.066 lr:0.0001965 epoch_Time:28550.0min: [2024-01-02 21:07:52,696][model8_pretrain.py][INFO] Epoch:[0/2](33200/4588595) loss:3.165 lr:0.0001965 epoch_Time:28550.0min: [2024-01-02 21:07:52,696][model8_pretrain.py][INFO] Epoch:[0/2](33200/4588595) loss:3.244 lr:0.0001965 epoch_Time:28550.0min: [2024-01-02 21:07:52,696][model8_pretrain.py][INFO] Epoch:[0/2](33200/4588595) loss:3.532 lr:0.0001965 epoch_Time:28550.0min: [2024-01-02 21:07:52,696][model8_pretrain.py][INFO] Epoch:[0/2](33200/4588595) loss:3.604 lr:0.0001965 epoch_Time:28550.0min: [2024-01-02 21:07:52,696][model8_pretrain.py][INFO] Epoch:[0/2](33200/4588595) loss:3.338 lr:0.0001965 epoch_Time:28550.0min: [2024-01-02 21:07:52,696][model8_pretrain.py][INFO] Epoch:[0/2](33200/4588595) loss:3.178 lr:0.0001965 epoch_Time:28550.0min: [2024-01-02 21:07:52,696][model8_pretrain.py][INFO] Epoch:[0/2](33200/4588595) loss:3.468 lr:0.0001965 epoch_Time:28550.0min: [2024-01-02 21:08:29,635][model8_pretrain.py][INFO] Epoch:[0/2](33300/4588595) loss:3.583 lr:0.0001960 epoch_Time:28549.0min: [2024-01-02 21:08:29,635][model8_pretrain.py][INFO] Epoch:[0/2](33300/4588595) loss:3.526 lr:0.0001960 epoch_Time:28549.0min: [2024-01-02 21:08:29,635][model8_pretrain.py][INFO] Epoch:[0/2](33300/4588595) loss:3.288 lr:0.0001960 epoch_Time:28549.0min: [2024-01-02 21:08:29,635][model8_pretrain.py][INFO] Epoch:[0/2](33300/4588595) loss:3.451 lr:0.0001960 epoch_Time:28549.0min: [2024-01-02 21:08:29,635][model8_pretrain.py][INFO] Epoch:[0/2](33300/4588595) loss:3.478 lr:0.0001960 epoch_Time:28549.0min: [2024-01-02 21:08:29,635][model8_pretrain.py][INFO] Epoch:[0/2](33300/4588595) loss:3.380 lr:0.0001960 epoch_Time:28549.0min: [2024-01-02 21:08:29,635][model8_pretrain.py][INFO] Epoch:[0/2](33300/4588595) loss:3.411 lr:0.0001960 epoch_Time:28549.0min: [2024-01-02 21:08:29,635][model8_pretrain.py][INFO] Epoch:[0/2](33300/4588595) loss:3.579 lr:0.0001960 epoch_Time:28549.0min: [2024-01-02 21:09:13,655][model8_pretrain.py][INFO] Epoch:[0/2](33400/4588595) loss:3.664 lr:0.0001954 epoch_Time:28562.0min: [2024-01-02 21:09:13,655][model8_pretrain.py][INFO] Epoch:[0/2](33400/4588595) loss:3.419 lr:0.0001954 epoch_Time:28562.0min: [2024-01-02 21:09:13,655][model8_pretrain.py][INFO] Epoch:[0/2](33400/4588595) loss:3.114 lr:0.0001954 epoch_Time:28562.0min: [2024-01-02 21:09:13,655][model8_pretrain.py][INFO] Epoch:[0/2](33400/4588595) loss:3.360 lr:0.0001954 epoch_Time:28562.0min: [2024-01-02 21:09:13,655][model8_pretrain.py][INFO] Epoch:[0/2](33400/4588595) loss:3.428 lr:0.0001954 epoch_Time:28562.0min: [2024-01-02 21:09:13,655][model8_pretrain.py][INFO] Epoch:[0/2](33400/4588595) loss:3.287 lr:0.0001954 epoch_Time:28562.0min: [2024-01-02 21:09:13,655][model8_pretrain.py][INFO] Epoch:[0/2](33400/4588595) loss:3.369 lr:0.0001954 epoch_Time:28562.0min: [2024-01-02 21:09:13,655][model8_pretrain.py][INFO] Epoch:[0/2](33400/4588595) loss:3.078 lr:0.0001954 epoch_Time:28562.0min: [2024-01-02 21:09:50,600][model8_pretrain.py][INFO] Epoch:[0/2](33500/4588595) loss:3.655 lr:0.0001948 epoch_Time:28560.0min: [2024-01-02 21:09:50,600][model8_pretrain.py][INFO] Epoch:[0/2](33500/4588595) loss:3.693 lr:0.0001948 epoch_Time:28560.0min: [2024-01-02 21:09:50,600][model8_pretrain.py][INFO] Epoch:[0/2](33500/4588595) loss:3.796 lr:0.0001948 epoch_Time:28560.0min: [2024-01-02 21:09:50,600][model8_pretrain.py][INFO] Epoch:[0/2](33500/4588595) loss:3.302 lr:0.0001948 epoch_Time:28560.0min: [2024-01-02 21:09:50,600][model8_pretrain.py][INFO] Epoch:[0/2](33500/4588595) loss:3.074 lr:0.0001948 epoch_Time:28560.0min: [2024-01-02 21:09:50,600][model8_pretrain.py][INFO] Epoch:[0/2](33500/4588595) loss:3.498 lr:0.0001948 epoch_Time:28560.0min: [2024-01-02 21:09:50,600][model8_pretrain.py][INFO] Epoch:[0/2](33500/4588595) loss:3.529 lr:0.0001948 epoch_Time:28560.0min: [2024-01-02 21:09:50,600][model8_pretrain.py][INFO] Epoch:[0/2](33500/4588595) loss:3.173 lr:0.0001948 epoch_Time:28560.0min: [2024-01-02 21:10:27,560][model8_pretrain.py][INFO] Epoch:[0/2](33600/4588595) loss:3.234 lr:0.0001943 epoch_Time:28558.0min: [2024-01-02 21:10:27,560][model8_pretrain.py][INFO] Epoch:[0/2](33600/4588595) loss:2.860 lr:0.0001943 epoch_Time:28558.0min: [2024-01-02 21:10:27,560][model8_pretrain.py][INFO] Epoch:[0/2](33600/4588595) loss:2.925 lr:0.0001943 epoch_Time:28558.0min: [2024-01-02 21:10:27,560][model8_pretrain.py][INFO] Epoch:[0/2](33600/4588595) loss:2.940 lr:0.0001943 epoch_Time:28558.0min: [2024-01-02 21:10:27,560][model8_pretrain.py][INFO] Epoch:[0/2](33600/4588595) loss:2.891 lr:0.0001943 epoch_Time:28558.0min: [2024-01-02 21:10:27,560][model8_pretrain.py][INFO] Epoch:[0/2](33600/4588595) loss:3.306 lr:0.0001943 epoch_Time:28558.0min: [2024-01-02 21:10:27,560][model8_pretrain.py][INFO] Epoch:[0/2](33600/4588595) loss:3.565 lr:0.0001943 epoch_Time:28558.0min: [2024-01-02 21:10:27,560][model8_pretrain.py][INFO] Epoch:[0/2](33600/4588595) loss:3.489 lr:0.0001943 epoch_Time:28558.0min: [2024-01-02 21:11:04,508][model8_pretrain.py][INFO] Epoch:[0/2](33700/4588595) loss:3.031 lr:0.0001937 epoch_Time:28556.0min: [2024-01-02 21:11:04,508][model8_pretrain.py][INFO] Epoch:[0/2](33700/4588595) loss:3.581 lr:0.0001937 epoch_Time:28556.0min: [2024-01-02 21:11:04,508][model8_pretrain.py][INFO] Epoch:[0/2](33700/4588595) loss:3.738 lr:0.0001937 epoch_Time:28556.0min: [2024-01-02 21:11:04,508][model8_pretrain.py][INFO] Epoch:[0/2](33700/4588595) loss:3.333 lr:0.0001937 epoch_Time:28556.0min: [2024-01-02 21:11:04,508][model8_pretrain.py][INFO] Epoch:[0/2](33700/4588595) loss:3.647 lr:0.0001937 epoch_Time:28556.0min: [2024-01-02 21:11:04,508][model8_pretrain.py][INFO] Epoch:[0/2](33700/4588595) loss:3.237 lr:0.0001937 epoch_Time:28556.0min: [2024-01-02 21:11:04,508][model8_pretrain.py][INFO] Epoch:[0/2](33700/4588595) loss:2.491 lr:0.0001937 epoch_Time:28556.0min: [2024-01-02 21:11:04,508][model8_pretrain.py][INFO] Epoch:[0/2](33700/4588595) loss:3.412 lr:0.0001937 epoch_Time:28556.0min: [2024-01-02 21:11:41,460][model8_pretrain.py][INFO] Epoch:[0/2](33800/4588595) loss:3.450 lr:0.0001932 epoch_Time:28554.0min: [2024-01-02 21:11:41,460][model8_pretrain.py][INFO] Epoch:[0/2](33800/4588595) loss:3.446 lr:0.0001932 epoch_Time:28554.0min: [2024-01-02 21:11:41,460][model8_pretrain.py][INFO] Epoch:[0/2](33800/4588595) loss:3.761 lr:0.0001932 epoch_Time:28554.0min: [2024-01-02 21:11:41,460][model8_pretrain.py][INFO] Epoch:[0/2](33800/4588595) loss:3.774 lr:0.0001932 epoch_Time:28554.0min: [2024-01-02 21:11:41,460][model8_pretrain.py][INFO] Epoch:[0/2](33800/4588595) loss:3.792 lr:0.0001932 epoch_Time:28554.0min: [2024-01-02 21:11:41,460][model8_pretrain.py][INFO] Epoch:[0/2](33800/4588595) loss:3.167 lr:0.0001932 epoch_Time:28554.0min: [2024-01-02 21:11:41,460][model8_pretrain.py][INFO] Epoch:[0/2](33800/4588595) loss:3.065 lr:0.0001932 epoch_Time:28554.0min: [2024-01-02 21:11:41,460][model8_pretrain.py][INFO] Epoch:[0/2](33800/4588595) loss:3.296 lr:0.0001932 epoch_Time:28554.0min: [2024-01-02 21:12:18,394][model8_pretrain.py][INFO] Epoch:[0/2](33900/4588595) loss:3.904 lr:0.0001926 epoch_Time:28552.0min: [2024-01-02 21:12:18,394][model8_pretrain.py][INFO] Epoch:[0/2](33900/4588595) loss:3.581 lr:0.0001926 epoch_Time:28552.0min: [2024-01-02 21:12:18,394][model8_pretrain.py][INFO] Epoch:[0/2](33900/4588595) loss:3.136 lr:0.0001926 epoch_Time:28552.0min: [2024-01-02 21:12:18,394][model8_pretrain.py][INFO] Epoch:[0/2](33900/4588595) loss:3.129 lr:0.0001926 epoch_Time:28552.0min: [2024-01-02 21:12:18,394][model8_pretrain.py][INFO] Epoch:[0/2](33900/4588595) loss:3.246 lr:0.0001926 epoch_Time:28552.0min: [2024-01-02 21:12:18,394][model8_pretrain.py][INFO] Epoch:[0/2](33900/4588595) loss:3.157 lr:0.0001926 epoch_Time:28552.0min: [2024-01-02 21:12:18,395][model8_pretrain.py][INFO] Epoch:[0/2](33900/4588595) loss:3.571 lr:0.0001926 epoch_Time:28552.0min: [2024-01-02 21:12:18,395][model8_pretrain.py][INFO] Epoch:[0/2](33900/4588595) loss:3.419 lr:0.0001926 epoch_Time:28552.0min: [2024-01-02 21:12:55,320][model8_pretrain.py][INFO] Epoch:[0/2](34000/4588595) loss:3.231 lr:0.0001921 epoch_Time:28549.0min: [2024-01-02 21:12:55,320][model8_pretrain.py][INFO] Epoch:[0/2](34000/4588595) loss:3.405 lr:0.0001921 epoch_Time:28549.0min: [2024-01-02 21:12:55,320][model8_pretrain.py][INFO] Epoch:[0/2](34000/4588595) loss:3.005 lr:0.0001921 epoch_Time:28549.0min: [2024-01-02 21:12:55,320][model8_pretrain.py][INFO] Epoch:[0/2](34000/4588595) loss:3.526 lr:0.0001921 epoch_Time:28549.0min: [2024-01-02 21:12:55,320][model8_pretrain.py][INFO] Epoch:[0/2](34000/4588595) loss:3.529 lr:0.0001921 epoch_Time:28549.0min: [2024-01-02 21:12:55,320][model8_pretrain.py][INFO] Epoch:[0/2](34000/4588595) loss:3.133 lr:0.0001921 epoch_Time:28549.0min: [2024-01-02 21:12:55,320][model8_pretrain.py][INFO] Epoch:[0/2](34000/4588595) loss:3.512 lr:0.0001921 epoch_Time:28549.0min: [2024-01-02 21:12:55,320][model8_pretrain.py][INFO] Epoch:[0/2](34000/4588595) loss:3.212 lr:0.0001921 epoch_Time:28549.0min: [2024-01-02 21:13:32,258][model8_pretrain.py][INFO] Epoch:[0/2](34100/4588595) loss:3.585 lr:0.0001915 epoch_Time:28548.0min: [2024-01-02 21:13:32,258][model8_pretrain.py][INFO] Epoch:[0/2](34100/4588595) loss:3.259 lr:0.0001915 epoch_Time:28548.0min: [2024-01-02 21:13:32,258][model8_pretrain.py][INFO] Epoch:[0/2](34100/4588595) loss:3.488 lr:0.0001915 epoch_Time:28548.0min: [2024-01-02 21:13:32,259][model8_pretrain.py][INFO] Epoch:[0/2](34100/4588595) loss:3.124 lr:0.0001915 epoch_Time:28548.0min: [2024-01-02 21:13:32,259][model8_pretrain.py][INFO] Epoch:[0/2](34100/4588595) loss:3.485 lr:0.0001915 epoch_Time:28548.0min: [2024-01-02 21:13:32,259][model8_pretrain.py][INFO] Epoch:[0/2](34100/4588595) loss:3.320 lr:0.0001915 epoch_Time:28548.0min: [2024-01-02 21:13:32,259][model8_pretrain.py][INFO] Epoch:[0/2](34100/4588595) loss:3.966 lr:0.0001915 epoch_Time:28548.0min: [2024-01-02 21:13:32,259][model8_pretrain.py][INFO] Epoch:[0/2](34100/4588595) loss:3.568 lr:0.0001915 epoch_Time:28548.0min: [2024-01-02 21:14:16,224][model8_pretrain.py][INFO] Epoch:[0/2](34200/4588595) loss:3.484 lr:0.0001909 epoch_Time:28561.0min: [2024-01-02 21:14:16,224][model8_pretrain.py][INFO] Epoch:[0/2](34200/4588595) loss:3.067 lr:0.0001909 epoch_Time:28561.0min: [2024-01-02 21:14:16,224][model8_pretrain.py][INFO] Epoch:[0/2](34200/4588595) loss:3.031 lr:0.0001909 epoch_Time:28561.0min: [2024-01-02 21:14:16,224][model8_pretrain.py][INFO] Epoch:[0/2](34200/4588595) loss:3.668 lr:0.0001909 epoch_Time:28561.0min: [2024-01-02 21:14:16,224][model8_pretrain.py][INFO] Epoch:[0/2](34200/4588595) loss:3.114 lr:0.0001909 epoch_Time:28561.0min: [2024-01-02 21:14:16,224][model8_pretrain.py][INFO] Epoch:[0/2](34200/4588595) loss:3.470 lr:0.0001909 epoch_Time:28561.0min: [2024-01-02 21:14:16,224][model8_pretrain.py][INFO] Epoch:[0/2](34200/4588595) loss:3.370 lr:0.0001909 epoch_Time:28561.0min: [2024-01-02 21:14:16,224][model8_pretrain.py][INFO] Epoch:[0/2](34200/4588595) loss:2.571 lr:0.0001909 epoch_Time:28561.0min: [2024-01-02 21:14:53,159][model8_pretrain.py][INFO] Epoch:[0/2](34300/4588595) loss:3.514 lr:0.0001904 epoch_Time:28558.0min: [2024-01-02 21:14:53,159][model8_pretrain.py][INFO] Epoch:[0/2](34300/4588595) loss:3.756 lr:0.0001904 epoch_Time:28558.0min: [2024-01-02 21:14:53,159][model8_pretrain.py][INFO] Epoch:[0/2](34300/4588595) loss:3.379 lr:0.0001904 epoch_Time:28558.0min: [2024-01-02 21:14:53,159][model8_pretrain.py][INFO] Epoch:[0/2](34300/4588595) loss:3.551 lr:0.0001904 epoch_Time:28558.0min: [2024-01-02 21:14:53,159][model8_pretrain.py][INFO] Epoch:[0/2](34300/4588595) loss:3.109 lr:0.0001904 epoch_Time:28558.0min: [2024-01-02 21:14:53,159][model8_pretrain.py][INFO] Epoch:[0/2](34300/4588595) loss:3.317 lr:0.0001904 epoch_Time:28558.0min: [2024-01-02 21:14:53,159][model8_pretrain.py][INFO] Epoch:[0/2](34300/4588595) loss:3.363 lr:0.0001904 epoch_Time:28558.0min: [2024-01-02 21:14:53,159][model8_pretrain.py][INFO] Epoch:[0/2](34300/4588595) loss:2.961 lr:0.0001904 epoch_Time:28558.0min: [2024-01-02 21:15:30,098][model8_pretrain.py][INFO] Epoch:[0/2](34400/4588595) loss:3.342 lr:0.0001898 epoch_Time:28557.0min: [2024-01-02 21:15:30,098][model8_pretrain.py][INFO] Epoch:[0/2](34400/4588595) loss:3.424 lr:0.0001898 epoch_Time:28557.0min: [2024-01-02 21:15:30,098][model8_pretrain.py][INFO] Epoch:[0/2](34400/4588595) loss:3.128 lr:0.0001898 epoch_Time:28557.0min: [2024-01-02 21:15:30,098][model8_pretrain.py][INFO] Epoch:[0/2](34400/4588595) loss:3.699 lr:0.0001898 epoch_Time:28557.0min: [2024-01-02 21:15:30,098][model8_pretrain.py][INFO] Epoch:[0/2](34400/4588595) loss:3.364 lr:0.0001898 epoch_Time:28557.0min: [2024-01-02 21:15:30,098][model8_pretrain.py][INFO] Epoch:[0/2](34400/4588595) loss:3.421 lr:0.0001898 epoch_Time:28557.0min: [2024-01-02 21:15:30,098][model8_pretrain.py][INFO] Epoch:[0/2](34400/4588595) loss:3.950 lr:0.0001898 epoch_Time:28557.0min: [2024-01-02 21:15:30,099][model8_pretrain.py][INFO] Epoch:[0/2](34400/4588595) loss:3.369 lr:0.0001898 epoch_Time:28557.0min: [2024-01-02 21:16:07,047][model8_pretrain.py][INFO] Epoch:[0/2](34500/4588595) loss:3.368 lr:0.0001893 epoch_Time:28554.0min: [2024-01-02 21:16:07,047][model8_pretrain.py][INFO] Epoch:[0/2](34500/4588595) loss:3.187 lr:0.0001893 epoch_Time:28554.0min: [2024-01-02 21:16:07,047][model8_pretrain.py][INFO] Epoch:[0/2](34500/4588595) loss:2.718 lr:0.0001893 epoch_Time:28554.0min: [2024-01-02 21:16:07,047][model8_pretrain.py][INFO] Epoch:[0/2](34500/4588595) loss:4.136 lr:0.0001893 epoch_Time:28554.0min: [2024-01-02 21:16:07,047][model8_pretrain.py][INFO] Epoch:[0/2](34500/4588595) loss:3.503 lr:0.0001893 epoch_Time:28554.0min: [2024-01-02 21:16:07,047][model8_pretrain.py][INFO] Epoch:[0/2](34500/4588595) loss:3.225 lr:0.0001893 epoch_Time:28554.0min: [2024-01-02 21:16:07,047][model8_pretrain.py][INFO] Epoch:[0/2](34500/4588595) loss:3.442 lr:0.0001893 epoch_Time:28554.0min: [2024-01-02 21:16:07,047][model8_pretrain.py][INFO] Epoch:[0/2](34500/4588595) loss:3.438 lr:0.0001893 epoch_Time:28554.0min: [2024-01-02 21:16:43,985][model8_pretrain.py][INFO] Epoch:[0/2](34600/4588595) loss:3.769 lr:0.0001887 epoch_Time:28553.0min: [2024-01-02 21:16:43,985][model8_pretrain.py][INFO] Epoch:[0/2](34600/4588595) loss:3.652 lr:0.0001887 epoch_Time:28553.0min: [2024-01-02 21:16:43,985][model8_pretrain.py][INFO] Epoch:[0/2](34600/4588595) loss:3.464 lr:0.0001887 epoch_Time:28553.0min: [2024-01-02 21:16:43,985][model8_pretrain.py][INFO] Epoch:[0/2](34600/4588595) loss:2.662 lr:0.0001887 epoch_Time:28553.0min: [2024-01-02 21:16:43,985][model8_pretrain.py][INFO] Epoch:[0/2](34600/4588595) loss:3.235 lr:0.0001887 epoch_Time:28553.0min: [2024-01-02 21:16:43,985][model8_pretrain.py][INFO] Epoch:[0/2](34600/4588595) loss:3.091 lr:0.0001887 epoch_Time:28553.0min: [2024-01-02 21:16:43,985][model8_pretrain.py][INFO] Epoch:[0/2](34600/4588595) loss:3.370 lr:0.0001887 epoch_Time:28553.0min: [2024-01-02 21:16:43,986][model8_pretrain.py][INFO] Epoch:[0/2](34600/4588595) loss:3.480 lr:0.0001887 epoch_Time:28553.0min: [2024-01-02 21:17:20,943][model8_pretrain.py][INFO] Epoch:[0/2](34700/4588595) loss:3.441 lr:0.0001881 epoch_Time:28550.0min: [2024-01-02 21:17:20,943][model8_pretrain.py][INFO] Epoch:[0/2](34700/4588595) loss:3.214 lr:0.0001881 epoch_Time:28550.0min: [2024-01-02 21:17:20,943][model8_pretrain.py][INFO] Epoch:[0/2](34700/4588595) loss:3.388 lr:0.0001881 epoch_Time:28550.0min: [2024-01-02 21:17:20,943][model8_pretrain.py][INFO] Epoch:[0/2](34700/4588595) loss:3.058 lr:0.0001881 epoch_Time:28550.0min: [2024-01-02 21:17:20,943][model8_pretrain.py][INFO] Epoch:[0/2](34700/4588595) loss:3.734 lr:0.0001881 epoch_Time:28550.0min: [2024-01-02 21:17:20,943][model8_pretrain.py][INFO] Epoch:[0/2](34700/4588595) loss:3.333 lr:0.0001881 epoch_Time:28550.0min: [2024-01-02 21:17:20,943][model8_pretrain.py][INFO] Epoch:[0/2](34700/4588595) loss:3.562 lr:0.0001881 epoch_Time:28550.0min: [2024-01-02 21:17:20,943][model8_pretrain.py][INFO] Epoch:[0/2](34700/4588595) loss:3.319 lr:0.0001881 epoch_Time:28550.0min: [2024-01-02 21:17:57,873][model8_pretrain.py][INFO] Epoch:[0/2](34800/4588595) loss:3.189 lr:0.0001876 epoch_Time:28548.0min: [2024-01-02 21:17:57,873][model8_pretrain.py][INFO] Epoch:[0/2](34800/4588595) loss:3.551 lr:0.0001876 epoch_Time:28548.0min: [2024-01-02 21:17:57,873][model8_pretrain.py][INFO] Epoch:[0/2](34800/4588595) loss:3.169 lr:0.0001876 epoch_Time:28548.0min: [2024-01-02 21:17:57,873][model8_pretrain.py][INFO] Epoch:[0/2](34800/4588595) loss:3.395 lr:0.0001876 epoch_Time:28548.0min: [2024-01-02 21:17:57,874][model8_pretrain.py][INFO] Epoch:[0/2](34800/4588595) loss:3.692 lr:0.0001876 epoch_Time:28548.0min: [2024-01-02 21:17:57,874][model8_pretrain.py][INFO] Epoch:[0/2](34800/4588595) loss:2.992 lr:0.0001876 epoch_Time:28548.0min: [2024-01-02 21:17:57,873][model8_pretrain.py][INFO] Epoch:[0/2](34800/4588595) loss:3.348 lr:0.0001876 epoch_Time:28548.0min: [2024-01-02 21:17:57,874][model8_pretrain.py][INFO] Epoch:[0/2](34800/4588595) loss:3.095 lr:0.0001876 epoch_Time:28548.0min: [2024-01-02 21:18:34,829][model8_pretrain.py][INFO] Epoch:[0/2](34900/4588595) loss:3.147 lr:0.0001870 epoch_Time:28546.0min: [2024-01-02 21:18:34,829][model8_pretrain.py][INFO] Epoch:[0/2](34900/4588595) loss:3.583 lr:0.0001870 epoch_Time:28546.0min: [2024-01-02 21:18:34,829][model8_pretrain.py][INFO] Epoch:[0/2](34900/4588595) loss:2.725 lr:0.0001870 epoch_Time:28546.0min: [2024-01-02 21:18:34,829][model8_pretrain.py][INFO] Epoch:[0/2](34900/4588595) loss:3.617 lr:0.0001870 epoch_Time:28546.0min: [2024-01-02 21:18:34,829][model8_pretrain.py][INFO] Epoch:[0/2](34900/4588595) loss:2.791 lr:0.0001870 epoch_Time:28546.0min: [2024-01-02 21:18:34,830][model8_pretrain.py][INFO] Epoch:[0/2](34900/4588595) loss:3.297 lr:0.0001870 epoch_Time:28546.0min: [2024-01-02 21:18:34,830][model8_pretrain.py][INFO] Epoch:[0/2](34900/4588595) loss:3.085 lr:0.0001870 epoch_Time:28547.0min: [2024-01-02 21:18:34,830][model8_pretrain.py][INFO] Epoch:[0/2](34900/4588595) loss:3.602 lr:0.0001870 epoch_Time:28546.0min: [2024-01-02 21:19:18,977][model8_pretrain.py][INFO] Epoch:[0/2](35000/4588595) loss:3.652 lr:0.0001865 epoch_Time:28560.0min: [2024-01-02 21:19:18,977][model8_pretrain.py][INFO] Epoch:[0/2](35000/4588595) loss:3.048 lr:0.0001865 epoch_Time:28560.0min: [2024-01-02 21:19:18,977][model8_pretrain.py][INFO] Epoch:[0/2](35000/4588595) loss:3.818 lr:0.0001865 epoch_Time:28560.0min: [2024-01-02 21:19:18,977][model8_pretrain.py][INFO] Epoch:[0/2](35000/4588595) loss:3.498 lr:0.0001865 epoch_Time:28560.0min: [2024-01-02 21:19:18,977][model8_pretrain.py][INFO] Epoch:[0/2](35000/4588595) loss:2.960 lr:0.0001865 epoch_Time:28560.0min: [2024-01-02 21:19:18,977][model8_pretrain.py][INFO] Epoch:[0/2](35000/4588595) loss:3.057 lr:0.0001865 epoch_Time:28560.0min: [2024-01-02 21:19:18,977][model8_pretrain.py][INFO] Epoch:[0/2](35000/4588595) loss:2.984 lr:0.0001865 epoch_Time:28560.0min: [2024-01-02 21:19:18,977][model8_pretrain.py][INFO] Epoch:[0/2](35000/4588595) loss:3.360 lr:0.0001865 epoch_Time:28560.0min: [2024-01-02 21:19:55,904][model8_pretrain.py][INFO] Epoch:[0/2](35100/4588595) loss:3.601 lr:0.0001859 epoch_Time:28557.0min: [2024-01-02 21:19:55,904][model8_pretrain.py][INFO] Epoch:[0/2](35100/4588595) loss:3.236 lr:0.0001859 epoch_Time:28557.0min: [2024-01-02 21:19:55,904][model8_pretrain.py][INFO] Epoch:[0/2](35100/4588595) loss:3.418 lr:0.0001859 epoch_Time:28557.0min: [2024-01-02 21:19:55,904][model8_pretrain.py][INFO] Epoch:[0/2](35100/4588595) loss:2.902 lr:0.0001859 epoch_Time:28557.0min: [2024-01-02 21:19:55,904][model8_pretrain.py][INFO] Epoch:[0/2](35100/4588595) loss:3.237 lr:0.0001859 epoch_Time:28557.0min: [2024-01-02 21:19:55,904][model8_pretrain.py][INFO] Epoch:[0/2](35100/4588595) loss:3.285 lr:0.0001859 epoch_Time:28557.0min: [2024-01-02 21:19:55,905][model8_pretrain.py][INFO] Epoch:[0/2](35100/4588595) loss:3.218 lr:0.0001859 epoch_Time:28557.0min: [2024-01-02 21:19:55,905][model8_pretrain.py][INFO] Epoch:[0/2](35100/4588595) loss:3.702 lr:0.0001859 epoch_Time:28557.0min: [2024-01-02 21:20:32,841][model8_pretrain.py][INFO] Epoch:[0/2](35200/4588595) loss:3.501 lr:0.0001853 epoch_Time:28556.0min: [2024-01-02 21:20:32,841][model8_pretrain.py][INFO] Epoch:[0/2](35200/4588595) loss:3.172 lr:0.0001853 epoch_Time:28556.0min: [2024-01-02 21:20:32,841][model8_pretrain.py][INFO] Epoch:[0/2](35200/4588595) loss:3.068 lr:0.0001853 epoch_Time:28556.0min: [2024-01-02 21:20:32,841][model8_pretrain.py][INFO] Epoch:[0/2](35200/4588595) loss:3.312 lr:0.0001853 epoch_Time:28556.0min: [2024-01-02 21:20:32,841][model8_pretrain.py][INFO] Epoch:[0/2](35200/4588595) loss:3.200 lr:0.0001853 epoch_Time:28556.0min: [2024-01-02 21:20:32,841][model8_pretrain.py][INFO] Epoch:[0/2](35200/4588595) loss:3.710 lr:0.0001853 epoch_Time:28556.0min: [2024-01-02 21:20:32,841][model8_pretrain.py][INFO] Epoch:[0/2](35200/4588595) loss:3.618 lr:0.0001853 epoch_Time:28556.0min: [2024-01-02 21:20:32,841][model8_pretrain.py][INFO] Epoch:[0/2](35200/4588595) loss:3.178 lr:0.0001853 epoch_Time:28556.0min: [2024-01-02 21:21:09,785][model8_pretrain.py][INFO] Epoch:[0/2](35300/4588595) loss:3.414 lr:0.0001848 epoch_Time:28553.0min: [2024-01-02 21:21:09,785][model8_pretrain.py][INFO] Epoch:[0/2](35300/4588595) loss:3.124 lr:0.0001848 epoch_Time:28553.0min: [2024-01-02 21:21:09,785][model8_pretrain.py][INFO] Epoch:[0/2](35300/4588595) loss:3.265 lr:0.0001848 epoch_Time:28553.0min: [2024-01-02 21:21:09,785][model8_pretrain.py][INFO] Epoch:[0/2](35300/4588595) loss:2.455 lr:0.0001848 epoch_Time:28553.0min: [2024-01-02 21:21:09,785][model8_pretrain.py][INFO] Epoch:[0/2](35300/4588595) loss:3.393 lr:0.0001848 epoch_Time:28553.0min: [2024-01-02 21:21:09,785][model8_pretrain.py][INFO] Epoch:[0/2](35300/4588595) loss:2.763 lr:0.0001848 epoch_Time:28553.0min: [2024-01-02 21:21:09,785][model8_pretrain.py][INFO] Epoch:[0/2](35300/4588595) loss:3.290 lr:0.0001848 epoch_Time:28553.0min: [2024-01-02 21:21:09,786][model8_pretrain.py][INFO] Epoch:[0/2](35300/4588595) loss:3.955 lr:0.0001848 epoch_Time:28553.0min: [2024-01-02 21:21:46,745][model8_pretrain.py][INFO] Epoch:[0/2](35400/4588595) loss:2.992 lr:0.0001842 epoch_Time:28552.0min: [2024-01-02 21:21:46,745][model8_pretrain.py][INFO] Epoch:[0/2](35400/4588595) loss:3.703 lr:0.0001842 epoch_Time:28552.0min: [2024-01-02 21:21:46,745][model8_pretrain.py][INFO] Epoch:[0/2](35400/4588595) loss:3.564 lr:0.0001842 epoch_Time:28552.0min: [2024-01-02 21:21:46,745][model8_pretrain.py][INFO] Epoch:[0/2](35400/4588595) loss:2.867 lr:0.0001842 epoch_Time:28552.0min: [2024-01-02 21:21:46,745][model8_pretrain.py][INFO] Epoch:[0/2](35400/4588595) loss:3.352 lr:0.0001842 epoch_Time:28552.0min: [2024-01-02 21:21:46,745][model8_pretrain.py][INFO] Epoch:[0/2](35400/4588595) loss:3.664 lr:0.0001842 epoch_Time:28552.0min: [2024-01-02 21:21:46,745][model8_pretrain.py][INFO] Epoch:[0/2](35400/4588595) loss:3.024 lr:0.0001842 epoch_Time:28552.0min: [2024-01-02 21:21:46,745][model8_pretrain.py][INFO] Epoch:[0/2](35400/4588595) loss:3.407 lr:0.0001842 epoch_Time:28552.0min: [2024-01-02 21:22:23,717][model8_pretrain.py][INFO] Epoch:[0/2](35500/4588595) loss:3.398 lr:0.0001836 epoch_Time:28549.0min: [2024-01-02 21:22:23,717][model8_pretrain.py][INFO] Epoch:[0/2](35500/4588595) loss:3.794 lr:0.0001836 epoch_Time:28549.0min: [2024-01-02 21:22:23,717][model8_pretrain.py][INFO] Epoch:[0/2](35500/4588595) loss:3.543 lr:0.0001836 epoch_Time:28549.0min: [2024-01-02 21:22:23,717][model8_pretrain.py][INFO] Epoch:[0/2](35500/4588595) loss:3.574 lr:0.0001836 epoch_Time:28549.0min: [2024-01-02 21:22:23,717][model8_pretrain.py][INFO] Epoch:[0/2](35500/4588595) loss:3.309 lr:0.0001836 epoch_Time:28549.0min: [2024-01-02 21:22:23,717][model8_pretrain.py][INFO] Epoch:[0/2](35500/4588595) loss:3.483 lr:0.0001836 epoch_Time:28549.0min: [2024-01-02 21:22:23,717][model8_pretrain.py][INFO] Epoch:[0/2](35500/4588595) loss:3.273 lr:0.0001836 epoch_Time:28549.0min: [2024-01-02 21:22:23,717][model8_pretrain.py][INFO] Epoch:[0/2](35500/4588595) loss:3.338 lr:0.0001836 epoch_Time:28549.0min: [2024-01-02 21:23:00,681][model8_pretrain.py][INFO] Epoch:[0/2](35600/4588595) loss:3.811 lr:0.0001831 epoch_Time:28547.0min: [2024-01-02 21:23:00,681][model8_pretrain.py][INFO] Epoch:[0/2](35600/4588595) loss:3.388 lr:0.0001831 epoch_Time:28547.0min: [2024-01-02 21:23:00,681][model8_pretrain.py][INFO] Epoch:[0/2](35600/4588595) loss:3.490 lr:0.0001831 epoch_Time:28547.0min: [2024-01-02 21:23:00,681][model8_pretrain.py][INFO] Epoch:[0/2](35600/4588595) loss:3.596 lr:0.0001831 epoch_Time:28547.0min: [2024-01-02 21:23:00,681][model8_pretrain.py][INFO] Epoch:[0/2](35600/4588595) loss:2.945 lr:0.0001831 epoch_Time:28547.0min: [2024-01-02 21:23:00,681][model8_pretrain.py][INFO] Epoch:[0/2](35600/4588595) loss:2.996 lr:0.0001831 epoch_Time:28547.0min: [2024-01-02 21:23:00,681][model8_pretrain.py][INFO] Epoch:[0/2](35600/4588595) loss:2.959 lr:0.0001831 epoch_Time:28547.0min: [2024-01-02 21:23:00,682][model8_pretrain.py][INFO] Epoch:[0/2](35600/4588595) loss:3.197 lr:0.0001831 epoch_Time:28547.0min: [2024-01-02 21:23:37,632][model8_pretrain.py][INFO] Epoch:[0/2](35700/4588595) loss:3.736 lr:0.0001825 epoch_Time:28546.0min: [2024-01-02 21:23:37,632][model8_pretrain.py][INFO] Epoch:[0/2](35700/4588595) loss:3.140 lr:0.0001825 epoch_Time:28546.0min: [2024-01-02 21:23:37,632][model8_pretrain.py][INFO] Epoch:[0/2](35700/4588595) loss:3.483 lr:0.0001825 epoch_Time:28545.0min: [2024-01-02 21:23:37,632][model8_pretrain.py][INFO] Epoch:[0/2](35700/4588595) loss:2.518 lr:0.0001825 epoch_Time:28546.0min: [2024-01-02 21:23:37,632][model8_pretrain.py][INFO] Epoch:[0/2](35700/4588595) loss:2.831 lr:0.0001825 epoch_Time:28546.0min: [2024-01-02 21:23:37,632][model8_pretrain.py][INFO] Epoch:[0/2](35700/4588595) loss:3.548 lr:0.0001825 epoch_Time:28546.0min: [2024-01-02 21:23:37,632][model8_pretrain.py][INFO] Epoch:[0/2](35700/4588595) loss:3.348 lr:0.0001825 epoch_Time:28546.0min: [2024-01-02 21:23:37,632][model8_pretrain.py][INFO] Epoch:[0/2](35700/4588595) loss:3.120 lr:0.0001825 epoch_Time:28545.0min: [2024-01-02 21:24:21,856][model8_pretrain.py][INFO] Epoch:[0/2](35800/4588595) loss:2.935 lr:0.0001819 epoch_Time:28559.0min: [2024-01-02 21:24:21,856][model8_pretrain.py][INFO] Epoch:[0/2](35800/4588595) loss:3.705 lr:0.0001819 epoch_Time:28559.0min: [2024-01-02 21:24:21,856][model8_pretrain.py][INFO] Epoch:[0/2](35800/4588595) loss:3.377 lr:0.0001819 epoch_Time:28559.0min: [2024-01-02 21:24:21,856][model8_pretrain.py][INFO] Epoch:[0/2](35800/4588595) loss:2.896 lr:0.0001819 epoch_Time:28559.0min: [2024-01-02 21:24:21,856][model8_pretrain.py][INFO] Epoch:[0/2](35800/4588595) loss:3.095 lr:0.0001819 epoch_Time:28559.0min: [2024-01-02 21:24:21,856][model8_pretrain.py][INFO] Epoch:[0/2](35800/4588595) loss:3.587 lr:0.0001819 epoch_Time:28559.0min: [2024-01-02 21:24:21,856][model8_pretrain.py][INFO] Epoch:[0/2](35800/4588595) loss:3.332 lr:0.0001819 epoch_Time:28559.0min: [2024-01-02 21:24:21,856][model8_pretrain.py][INFO] Epoch:[0/2](35800/4588595) loss:3.322 lr:0.0001819 epoch_Time:28559.0min: [2024-01-02 21:24:58,784][model8_pretrain.py][INFO] Epoch:[0/2](35900/4588595) loss:3.746 lr:0.0001814 epoch_Time:28556.0min: [2024-01-02 21:24:58,785][model8_pretrain.py][INFO] Epoch:[0/2](35900/4588595) loss:3.509 lr:0.0001814 epoch_Time:28556.0min: [2024-01-02 21:24:58,785][model8_pretrain.py][INFO] Epoch:[0/2](35900/4588595) loss:3.061 lr:0.0001814 epoch_Time:28556.0min: [2024-01-02 21:24:58,785][model8_pretrain.py][INFO] Epoch:[0/2](35900/4588595) loss:3.570 lr:0.0001814 epoch_Time:28556.0min: [2024-01-02 21:24:58,785][model8_pretrain.py][INFO] Epoch:[0/2](35900/4588595) loss:3.471 lr:0.0001814 epoch_Time:28556.0min: [2024-01-02 21:24:58,785][model8_pretrain.py][INFO] Epoch:[0/2](35900/4588595) loss:2.963 lr:0.0001814 epoch_Time:28556.0min: [2024-01-02 21:24:58,785][model8_pretrain.py][INFO] Epoch:[0/2](35900/4588595) loss:3.595 lr:0.0001814 epoch_Time:28556.0min: [2024-01-02 21:24:58,785][model8_pretrain.py][INFO] Epoch:[0/2](35900/4588595) loss:3.413 lr:0.0001814 epoch_Time:28556.0min: [2024-01-02 21:25:35,734][model8_pretrain.py][INFO] Epoch:[0/2](36000/4588595) loss:3.517 lr:0.0001808 epoch_Time:28555.0min: [2024-01-02 21:25:35,734][model8_pretrain.py][INFO] Epoch:[0/2](36000/4588595) loss:3.676 lr:0.0001808 epoch_Time:28555.0min: [2024-01-02 21:25:35,734][model8_pretrain.py][INFO] Epoch:[0/2](36000/4588595) loss:3.307 lr:0.0001808 epoch_Time:28555.0min: [2024-01-02 21:25:35,734][model8_pretrain.py][INFO] Epoch:[0/2](36000/4588595) loss:3.354 lr:0.0001808 epoch_Time:28555.0min: [2024-01-02 21:25:35,734][model8_pretrain.py][INFO] Epoch:[0/2](36000/4588595) loss:3.093 lr:0.0001808 epoch_Time:28555.0min: [2024-01-02 21:25:35,734][model8_pretrain.py][INFO] Epoch:[0/2](36000/4588595) loss:3.850 lr:0.0001808 epoch_Time:28555.0min: [2024-01-02 21:25:35,734][model8_pretrain.py][INFO] Epoch:[0/2](36000/4588595) loss:3.908 lr:0.0001808 epoch_Time:28555.0min: [2024-01-02 21:25:35,734][model8_pretrain.py][INFO] Epoch:[0/2](36000/4588595) loss:3.631 lr:0.0001808 epoch_Time:28555.0min: [2024-01-02 21:26:12,686][model8_pretrain.py][INFO] Epoch:[0/2](36100/4588595) loss:3.330 lr:0.0001802 epoch_Time:28552.0min: [2024-01-02 21:26:12,686][model8_pretrain.py][INFO] Epoch:[0/2](36100/4588595) loss:3.293 lr:0.0001802 epoch_Time:28552.0min: [2024-01-02 21:26:12,686][model8_pretrain.py][INFO] Epoch:[0/2](36100/4588595) loss:3.499 lr:0.0001802 epoch_Time:28552.0min: [2024-01-02 21:26:12,686][model8_pretrain.py][INFO] Epoch:[0/2](36100/4588595) loss:3.052 lr:0.0001802 epoch_Time:28552.0min: [2024-01-02 21:26:12,686][model8_pretrain.py][INFO] Epoch:[0/2](36100/4588595) loss:2.805 lr:0.0001802 epoch_Time:28552.0min: [2024-01-02 21:26:12,686][model8_pretrain.py][INFO] Epoch:[0/2](36100/4588595) loss:3.100 lr:0.0001802 epoch_Time:28552.0min: [2024-01-02 21:26:12,686][model8_pretrain.py][INFO] Epoch:[0/2](36100/4588595) loss:3.210 lr:0.0001802 epoch_Time:28552.0min: [2024-01-02 21:26:12,686][model8_pretrain.py][INFO] Epoch:[0/2](36100/4588595) loss:3.327 lr:0.0001802 epoch_Time:28552.0min: [2024-01-02 21:26:49,631][model8_pretrain.py][INFO] Epoch:[0/2](36200/4588595) loss:3.566 lr:0.0001797 epoch_Time:28550.0min: [2024-01-02 21:26:49,631][model8_pretrain.py][INFO] Epoch:[0/2](36200/4588595) loss:3.334 lr:0.0001797 epoch_Time:28550.0min: [2024-01-02 21:26:49,631][model8_pretrain.py][INFO] Epoch:[0/2](36200/4588595) loss:3.704 lr:0.0001797 epoch_Time:28550.0min: [2024-01-02 21:26:49,631][model8_pretrain.py][INFO] Epoch:[0/2](36200/4588595) loss:3.466 lr:0.0001797 epoch_Time:28550.0min: [2024-01-02 21:26:49,631][model8_pretrain.py][INFO] Epoch:[0/2](36200/4588595) loss:3.295 lr:0.0001797 epoch_Time:28550.0min: [2024-01-02 21:26:49,631][model8_pretrain.py][INFO] Epoch:[0/2](36200/4588595) loss:3.583 lr:0.0001797 epoch_Time:28550.0min: [2024-01-02 21:26:49,632][model8_pretrain.py][INFO] Epoch:[0/2](36200/4588595) loss:3.594 lr:0.0001797 epoch_Time:28550.0min: [2024-01-02 21:26:49,632][model8_pretrain.py][INFO] Epoch:[0/2](36200/4588595) loss:3.565 lr:0.0001797 epoch_Time:28550.0min: [2024-01-02 21:27:26,572][model8_pretrain.py][INFO] Epoch:[0/2](36300/4588595) loss:3.737 lr:0.0001791 epoch_Time:28548.0min: [2024-01-02 21:27:26,572][model8_pretrain.py][INFO] Epoch:[0/2](36300/4588595) loss:2.948 lr:0.0001791 epoch_Time:28548.0min: [2024-01-02 21:27:26,572][model8_pretrain.py][INFO] Epoch:[0/2](36300/4588595) loss:3.291 lr:0.0001791 epoch_Time:28548.0min: [2024-01-02 21:27:26,572][model8_pretrain.py][INFO] Epoch:[0/2](36300/4588595) loss:3.512 lr:0.0001791 epoch_Time:28548.0min: [2024-01-02 21:27:26,572][model8_pretrain.py][INFO] Epoch:[0/2](36300/4588595) loss:3.144 lr:0.0001791 epoch_Time:28548.0min: [2024-01-02 21:27:26,572][model8_pretrain.py][INFO] Epoch:[0/2](36300/4588595) loss:3.546 lr:0.0001791 epoch_Time:28548.0min: [2024-01-02 21:27:26,572][model8_pretrain.py][INFO] Epoch:[0/2](36300/4588595) loss:3.450 lr:0.0001791 epoch_Time:28548.0min: [2024-01-02 21:27:26,572][model8_pretrain.py][INFO] Epoch:[0/2](36300/4588595) loss:3.491 lr:0.0001791 epoch_Time:28548.0min: [2024-01-02 21:28:03,518][model8_pretrain.py][INFO] Epoch:[0/2](36400/4588595) loss:3.759 lr:0.0001785 epoch_Time:28546.0min: [2024-01-02 21:28:03,518][model8_pretrain.py][INFO] Epoch:[0/2](36400/4588595) loss:3.183 lr:0.0001785 epoch_Time:28546.0min: [2024-01-02 21:28:03,518][model8_pretrain.py][INFO] Epoch:[0/2](36400/4588595) loss:3.145 lr:0.0001785 epoch_Time:28546.0min: [2024-01-02 21:28:03,518][model8_pretrain.py][INFO] Epoch:[0/2](36400/4588595) loss:3.509 lr:0.0001785 epoch_Time:28546.0min: [2024-01-02 21:28:03,518][model8_pretrain.py][INFO] Epoch:[0/2](36400/4588595) loss:3.740 lr:0.0001785 epoch_Time:28546.0min: [2024-01-02 21:28:03,518][model8_pretrain.py][INFO] Epoch:[0/2](36400/4588595) loss:3.412 lr:0.0001785 epoch_Time:28546.0min: [2024-01-02 21:28:03,518][model8_pretrain.py][INFO] Epoch:[0/2](36400/4588595) loss:3.220 lr:0.0001785 epoch_Time:28546.0min: [2024-01-02 21:28:03,518][model8_pretrain.py][INFO] Epoch:[0/2](36400/4588595) loss:3.039 lr:0.0001785 epoch_Time:28546.0min: [2024-01-02 21:28:40,454][model8_pretrain.py][INFO] Epoch:[0/2](36500/4588595) loss:3.327 lr:0.0001780 epoch_Time:28544.0min: [2024-01-02 21:28:40,454][model8_pretrain.py][INFO] Epoch:[0/2](36500/4588595) loss:3.429 lr:0.0001780 epoch_Time:28544.0min: [2024-01-02 21:28:40,454][model8_pretrain.py][INFO] Epoch:[0/2](36500/4588595) loss:3.562 lr:0.0001780 epoch_Time:28545.0min: [2024-01-02 21:28:40,454][model8_pretrain.py][INFO] Epoch:[0/2](36500/4588595) loss:3.784 lr:0.0001780 epoch_Time:28544.0min: [2024-01-02 21:28:40,454][model8_pretrain.py][INFO] Epoch:[0/2](36500/4588595) loss:3.264 lr:0.0001780 epoch_Time:28544.0min: [2024-01-02 21:28:40,454][model8_pretrain.py][INFO] Epoch:[0/2](36500/4588595) loss:2.964 lr:0.0001780 epoch_Time:28544.0min: [2024-01-02 21:28:40,454][model8_pretrain.py][INFO] Epoch:[0/2](36500/4588595) loss:3.307 lr:0.0001780 epoch_Time:28544.0min: [2024-01-02 21:28:40,457][model8_pretrain.py][INFO] Epoch:[0/2](36500/4588595) loss:4.143 lr:0.0001780 epoch_Time:28544.0min: [2024-01-02 21:29:24,437][model8_pretrain.py][INFO] Epoch:[0/2](36600/4588595) loss:3.203 lr:0.0001774 epoch_Time:28557.0min: [2024-01-02 21:29:24,437][model8_pretrain.py][INFO] Epoch:[0/2](36600/4588595) loss:3.786 lr:0.0001774 epoch_Time:28557.0min: [2024-01-02 21:29:24,437][model8_pretrain.py][INFO] Epoch:[0/2](36600/4588595) loss:2.681 lr:0.0001774 epoch_Time:28557.0min: [2024-01-02 21:29:24,437][model8_pretrain.py][INFO] Epoch:[0/2](36600/4588595) loss:3.184 lr:0.0001774 epoch_Time:28557.0min: [2024-01-02 21:29:24,437][model8_pretrain.py][INFO] Epoch:[0/2](36600/4588595) loss:2.746 lr:0.0001774 epoch_Time:28557.0min: [2024-01-02 21:29:24,437][model8_pretrain.py][INFO] Epoch:[0/2](36600/4588595) loss:3.199 lr:0.0001774 epoch_Time:28557.0min: [2024-01-02 21:29:24,437][model8_pretrain.py][INFO] Epoch:[0/2](36600/4588595) loss:3.239 lr:0.0001774 epoch_Time:28557.0min: [2024-01-02 21:29:24,438][model8_pretrain.py][INFO] Epoch:[0/2](36600/4588595) loss:3.392 lr:0.0001774 epoch_Time:28557.0min: [2024-01-02 21:30:01,387][model8_pretrain.py][INFO] Epoch:[0/2](36700/4588595) loss:2.987 lr:0.0001768 epoch_Time:28554.0min: [2024-01-02 21:30:01,387][model8_pretrain.py][INFO] Epoch:[0/2](36700/4588595) loss:3.253 lr:0.0001768 epoch_Time:28554.0min: [2024-01-02 21:30:01,387][model8_pretrain.py][INFO] Epoch:[0/2](36700/4588595) loss:3.161 lr:0.0001768 epoch_Time:28554.0min: [2024-01-02 21:30:01,387][model8_pretrain.py][INFO] Epoch:[0/2](36700/4588595) loss:3.360 lr:0.0001768 epoch_Time:28554.0min: [2024-01-02 21:30:01,387][model8_pretrain.py][INFO] Epoch:[0/2](36700/4588595) loss:3.220 lr:0.0001768 epoch_Time:28554.0min: [2024-01-02 21:30:01,387][model8_pretrain.py][INFO] Epoch:[0/2](36700/4588595) loss:3.280 lr:0.0001768 epoch_Time:28554.0min: [2024-01-02 21:30:01,387][model8_pretrain.py][INFO] Epoch:[0/2](36700/4588595) loss:2.922 lr:0.0001768 epoch_Time:28554.0min: [2024-01-02 21:30:01,387][model8_pretrain.py][INFO] Epoch:[0/2](36700/4588595) loss:3.447 lr:0.0001768 epoch_Time:28554.0min: [2024-01-02 21:30:38,339][model8_pretrain.py][INFO] Epoch:[0/2](36800/4588595) loss:3.395 lr:0.0001763 epoch_Time:28553.0min: [2024-01-02 21:30:38,339][model8_pretrain.py][INFO] Epoch:[0/2](36800/4588595) loss:3.624 lr:0.0001763 epoch_Time:28553.0min: [2024-01-02 21:30:38,339][model8_pretrain.py][INFO] Epoch:[0/2](36800/4588595) loss:3.357 lr:0.0001763 epoch_Time:28553.0min: [2024-01-02 21:30:38,339][model8_pretrain.py][INFO] Epoch:[0/2](36800/4588595) loss:3.342 lr:0.0001763 epoch_Time:28553.0min: [2024-01-02 21:30:38,339][model8_pretrain.py][INFO] Epoch:[0/2](36800/4588595) loss:3.404 lr:0.0001763 epoch_Time:28553.0min: [2024-01-02 21:30:38,339][model8_pretrain.py][INFO] Epoch:[0/2](36800/4588595) loss:3.392 lr:0.0001763 epoch_Time:28553.0min: [2024-01-02 21:30:38,339][model8_pretrain.py][INFO] Epoch:[0/2](36800/4588595) loss:3.687 lr:0.0001763 epoch_Time:28553.0min: [2024-01-02 21:30:38,339][model8_pretrain.py][INFO] Epoch:[0/2](36800/4588595) loss:2.897 lr:0.0001763 epoch_Time:28553.0min: [2024-01-02 21:31:15,292][model8_pretrain.py][INFO] Epoch:[0/2](36900/4588595) loss:3.705 lr:0.0001757 epoch_Time:28550.0min: [2024-01-02 21:31:15,292][model8_pretrain.py][INFO] Epoch:[0/2](36900/4588595) loss:3.491 lr:0.0001757 epoch_Time:28551.0min: [2024-01-02 21:31:15,292][model8_pretrain.py][INFO] Epoch:[0/2](36900/4588595) loss:3.169 lr:0.0001757 epoch_Time:28550.0min: [2024-01-02 21:31:15,292][model8_pretrain.py][INFO] Epoch:[0/2](36900/4588595) loss:2.950 lr:0.0001757 epoch_Time:28550.0min: [2024-01-02 21:31:15,292][model8_pretrain.py][INFO] Epoch:[0/2](36900/4588595) loss:3.247 lr:0.0001757 epoch_Time:28550.0min: [2024-01-02 21:31:15,292][model8_pretrain.py][INFO] Epoch:[0/2](36900/4588595) loss:3.684 lr:0.0001757 epoch_Time:28550.0min: [2024-01-02 21:31:15,292][model8_pretrain.py][INFO] Epoch:[0/2](36900/4588595) loss:3.721 lr:0.0001757 epoch_Time:28550.0min: [2024-01-02 21:31:15,292][model8_pretrain.py][INFO] Epoch:[0/2](36900/4588595) loss:3.369 lr:0.0001757 epoch_Time:28550.0min: [2024-01-02 21:31:52,236][model8_pretrain.py][INFO] Epoch:[0/2](37000/4588595) loss:3.091 lr:0.0001751 epoch_Time:28548.0min: [2024-01-02 21:31:52,236][model8_pretrain.py][INFO] Epoch:[0/2](37000/4588595) loss:3.384 lr:0.0001751 epoch_Time:28548.0min: [2024-01-02 21:31:52,236][model8_pretrain.py][INFO] Epoch:[0/2](37000/4588595) loss:3.471 lr:0.0001751 epoch_Time:28548.0min: [2024-01-02 21:31:52,236][model8_pretrain.py][INFO] Epoch:[0/2](37000/4588595) loss:3.234 lr:0.0001751 epoch_Time:28548.0min: [2024-01-02 21:31:52,236][model8_pretrain.py][INFO] Epoch:[0/2](37000/4588595) loss:3.663 lr:0.0001751 epoch_Time:28548.0min: [2024-01-02 21:31:52,236][model8_pretrain.py][INFO] Epoch:[0/2](37000/4588595) loss:3.175 lr:0.0001751 epoch_Time:28548.0min: [2024-01-02 21:31:52,236][model8_pretrain.py][INFO] Epoch:[0/2](37000/4588595) loss:3.364 lr:0.0001751 epoch_Time:28548.0min: [2024-01-02 21:31:52,236][model8_pretrain.py][INFO] Epoch:[0/2](37000/4588595) loss:3.432 lr:0.0001751 epoch_Time:28548.0min: [2024-01-02 21:32:29,179][model8_pretrain.py][INFO] Epoch:[0/2](37100/4588595) loss:3.618 lr:0.0001745 epoch_Time:28547.0min: [2024-01-02 21:32:29,179][model8_pretrain.py][INFO] Epoch:[0/2](37100/4588595) loss:3.313 lr:0.0001745 epoch_Time:28547.0min: [2024-01-02 21:32:29,179][model8_pretrain.py][INFO] Epoch:[0/2](37100/4588595) loss:3.859 lr:0.0001745 epoch_Time:28547.0min: [2024-01-02 21:32:29,179][model8_pretrain.py][INFO] Epoch:[0/2](37100/4588595) loss:3.659 lr:0.0001745 epoch_Time:28547.0min: [2024-01-02 21:32:29,179][model8_pretrain.py][INFO] Epoch:[0/2](37100/4588595) loss:3.622 lr:0.0001745 epoch_Time:28547.0min: [2024-01-02 21:32:29,179][model8_pretrain.py][INFO] Epoch:[0/2](37100/4588595) loss:3.713 lr:0.0001745 epoch_Time:28547.0min: [2024-01-02 21:32:29,180][model8_pretrain.py][INFO] Epoch:[0/2](37100/4588595) loss:3.503 lr:0.0001745 epoch_Time:28547.0min: [2024-01-02 21:32:29,180][model8_pretrain.py][INFO] Epoch:[0/2](37100/4588595) loss:3.520 lr:0.0001745 epoch_Time:28547.0min: [2024-01-02 21:33:06,196][model8_pretrain.py][INFO] Epoch:[0/2](37200/4588595) loss:3.321 lr:0.0001740 epoch_Time:28544.0min: [2024-01-02 21:33:06,196][model8_pretrain.py][INFO] Epoch:[0/2](37200/4588595) loss:2.980 lr:0.0001740 epoch_Time:28544.0min: [2024-01-02 21:33:06,196][model8_pretrain.py][INFO] Epoch:[0/2](37200/4588595) loss:3.358 lr:0.0001740 epoch_Time:28544.0min: [2024-01-02 21:33:06,196][model8_pretrain.py][INFO] Epoch:[0/2](37200/4588595) loss:2.816 lr:0.0001740 epoch_Time:28544.0min: [2024-01-02 21:33:06,196][model8_pretrain.py][INFO] Epoch:[0/2](37200/4588595) loss:3.028 lr:0.0001740 epoch_Time:28544.0min: [2024-01-02 21:33:06,196][model8_pretrain.py][INFO] Epoch:[0/2](37200/4588595) loss:3.477 lr:0.0001740 epoch_Time:28544.0min: [2024-01-02 21:33:06,196][model8_pretrain.py][INFO] Epoch:[0/2](37200/4588595) loss:3.366 lr:0.0001740 epoch_Time:28544.0min: [2024-01-02 21:33:06,196][model8_pretrain.py][INFO] Epoch:[0/2](37200/4588595) loss:3.241 lr:0.0001740 epoch_Time:28544.0min: [2024-01-02 21:33:43,160][model8_pretrain.py][INFO] Epoch:[0/2](37300/4588595) loss:3.442 lr:0.0001734 epoch_Time:28543.0min: [2024-01-02 21:33:43,160][model8_pretrain.py][INFO] Epoch:[0/2](37300/4588595) loss:3.473 lr:0.0001734 epoch_Time:28543.0min: [2024-01-02 21:33:43,160][model8_pretrain.py][INFO] Epoch:[0/2](37300/4588595) loss:3.354 lr:0.0001734 epoch_Time:28543.0min: [2024-01-02 21:33:43,160][model8_pretrain.py][INFO] Epoch:[0/2](37300/4588595) loss:3.068 lr:0.0001734 epoch_Time:28543.0min: [2024-01-02 21:33:43,160][model8_pretrain.py][INFO] Epoch:[0/2](37300/4588595) loss:2.532 lr:0.0001734 epoch_Time:28543.0min: [2024-01-02 21:33:43,160][model8_pretrain.py][INFO] Epoch:[0/2](37300/4588595) loss:3.266 lr:0.0001734 epoch_Time:28543.0min: [2024-01-02 21:33:43,160][model8_pretrain.py][INFO] Epoch:[0/2](37300/4588595) loss:3.002 lr:0.0001734 epoch_Time:28543.0min: [2024-01-02 21:33:43,160][model8_pretrain.py][INFO] Epoch:[0/2](37300/4588595) loss:3.691 lr:0.0001734 epoch_Time:28543.0min: [2024-01-02 21:34:27,062][model8_pretrain.py][INFO] Epoch:[0/2](37400/4588595) loss:3.429 lr:0.0001728 epoch_Time:28555.0min: [2024-01-02 21:34:27,062][model8_pretrain.py][INFO] Epoch:[0/2](37400/4588595) loss:3.862 lr:0.0001728 epoch_Time:28555.0min: [2024-01-02 21:34:27,062][model8_pretrain.py][INFO] Epoch:[0/2](37400/4588595) loss:3.419 lr:0.0001728 epoch_Time:28555.0min: [2024-01-02 21:34:27,062][model8_pretrain.py][INFO] Epoch:[0/2](37400/4588595) loss:3.195 lr:0.0001728 epoch_Time:28555.0min: [2024-01-02 21:34:27,062][model8_pretrain.py][INFO] Epoch:[0/2](37400/4588595) loss:3.293 lr:0.0001728 epoch_Time:28555.0min: [2024-01-02 21:34:27,062][model8_pretrain.py][INFO] Epoch:[0/2](37400/4588595) loss:2.951 lr:0.0001728 epoch_Time:28555.0min: [2024-01-02 21:34:27,062][model8_pretrain.py][INFO] Epoch:[0/2](37400/4588595) loss:3.516 lr:0.0001728 epoch_Time:28555.0min: [2024-01-02 21:34:27,062][model8_pretrain.py][INFO] Epoch:[0/2](37400/4588595) loss:3.698 lr:0.0001728 epoch_Time:28555.0min: [2024-01-02 21:35:04,015][model8_pretrain.py][INFO] Epoch:[0/2](37500/4588595) loss:3.294 lr:0.0001723 epoch_Time:28552.0min: [2024-01-02 21:35:04,015][model8_pretrain.py][INFO] Epoch:[0/2](37500/4588595) loss:3.505 lr:0.0001723 epoch_Time:28552.0min: [2024-01-02 21:35:04,015][model8_pretrain.py][INFO] Epoch:[0/2](37500/4588595) loss:3.416 lr:0.0001723 epoch_Time:28552.0min: [2024-01-02 21:35:04,015][model8_pretrain.py][INFO] Epoch:[0/2](37500/4588595) loss:3.628 lr:0.0001723 epoch_Time:28552.0min: [2024-01-02 21:35:04,015][model8_pretrain.py][INFO] Epoch:[0/2](37500/4588595) loss:3.705 lr:0.0001723 epoch_Time:28552.0min: [2024-01-02 21:35:04,015][model8_pretrain.py][INFO] Epoch:[0/2](37500/4588595) loss:3.197 lr:0.0001723 epoch_Time:28552.0min: [2024-01-02 21:35:04,015][model8_pretrain.py][INFO] Epoch:[0/2](37500/4588595) loss:3.122 lr:0.0001723 epoch_Time:28552.0min: [2024-01-02 21:35:04,016][model8_pretrain.py][INFO] Epoch:[0/2](37500/4588595) loss:2.976 lr:0.0001723 epoch_Time:28552.0min: [2024-01-02 21:35:40,960][model8_pretrain.py][INFO] Epoch:[0/2](37600/4588595) loss:3.525 lr:0.0001717 epoch_Time:28551.0min: [2024-01-02 21:35:40,960][model8_pretrain.py][INFO] Epoch:[0/2](37600/4588595) loss:3.336 lr:0.0001717 epoch_Time:28551.0min: [2024-01-02 21:35:40,960][model8_pretrain.py][INFO] Epoch:[0/2](37600/4588595) loss:3.476 lr:0.0001717 epoch_Time:28551.0min: [2024-01-02 21:35:40,960][model8_pretrain.py][INFO] Epoch:[0/2](37600/4588595) loss:3.759 lr:0.0001717 epoch_Time:28551.0min: [2024-01-02 21:35:40,960][model8_pretrain.py][INFO] Epoch:[0/2](37600/4588595) loss:3.365 lr:0.0001717 epoch_Time:28551.0min: [2024-01-02 21:35:40,960][model8_pretrain.py][INFO] Epoch:[0/2](37600/4588595) loss:3.484 lr:0.0001717 epoch_Time:28551.0min: [2024-01-02 21:35:40,960][model8_pretrain.py][INFO] Epoch:[0/2](37600/4588595) loss:3.322 lr:0.0001717 epoch_Time:28551.0min: [2024-01-02 21:35:40,960][model8_pretrain.py][INFO] Epoch:[0/2](37600/4588595) loss:3.613 lr:0.0001717 epoch_Time:28551.0min: [2024-01-02 21:36:17,902][model8_pretrain.py][INFO] Epoch:[0/2](37700/4588595) loss:3.204 lr:0.0001711 epoch_Time:28549.0min: [2024-01-02 21:36:17,902][model8_pretrain.py][INFO] Epoch:[0/2](37700/4588595) loss:3.564 lr:0.0001711 epoch_Time:28549.0min: [2024-01-02 21:36:17,902][model8_pretrain.py][INFO] Epoch:[0/2](37700/4588595) loss:2.869 lr:0.0001711 epoch_Time:28549.0min: [2024-01-02 21:36:17,902][model8_pretrain.py][INFO] Epoch:[0/2](37700/4588595) loss:3.004 lr:0.0001711 epoch_Time:28549.0min: [2024-01-02 21:36:17,902][model8_pretrain.py][INFO] Epoch:[0/2](37700/4588595) loss:3.232 lr:0.0001711 epoch_Time:28549.0min: [2024-01-02 21:36:17,902][model8_pretrain.py][INFO] Epoch:[0/2](37700/4588595) loss:3.574 lr:0.0001711 epoch_Time:28549.0min: [2024-01-02 21:36:17,902][model8_pretrain.py][INFO] Epoch:[0/2](37700/4588595) loss:3.422 lr:0.0001711 epoch_Time:28549.0min: [2024-01-02 21:36:17,902][model8_pretrain.py][INFO] Epoch:[0/2](37700/4588595) loss:2.881 lr:0.0001711 epoch_Time:28549.0min: [2024-01-02 21:36:54,840][model8_pretrain.py][INFO] Epoch:[0/2](37800/4588595) loss:3.328 lr:0.0001705 epoch_Time:28546.0min: [2024-01-02 21:36:54,841][model8_pretrain.py][INFO] Epoch:[0/2](37800/4588595) loss:3.381 lr:0.0001705 epoch_Time:28546.0min: [2024-01-02 21:36:54,841][model8_pretrain.py][INFO] Epoch:[0/2](37800/4588595) loss:3.524 lr:0.0001705 epoch_Time:28546.0min: [2024-01-02 21:36:54,841][model8_pretrain.py][INFO] Epoch:[0/2](37800/4588595) loss:3.568 lr:0.0001705 epoch_Time:28546.0min: [2024-01-02 21:36:54,841][model8_pretrain.py][INFO] Epoch:[0/2](37800/4588595) loss:2.856 lr:0.0001705 epoch_Time:28546.0min: [2024-01-02 21:36:54,841][model8_pretrain.py][INFO] Epoch:[0/2](37800/4588595) loss:3.120 lr:0.0001705 epoch_Time:28546.0min: [2024-01-02 21:36:54,841][model8_pretrain.py][INFO] Epoch:[0/2](37800/4588595) loss:3.591 lr:0.0001705 epoch_Time:28546.0min: [2024-01-02 21:36:54,841][model8_pretrain.py][INFO] Epoch:[0/2](37800/4588595) loss:2.965 lr:0.0001705 epoch_Time:28546.0min: [2024-01-02 21:37:31,803][model8_pretrain.py][INFO] Epoch:[0/2](37900/4588595) loss:3.210 lr:0.0001700 epoch_Time:28545.0min: [2024-01-02 21:37:31,803][model8_pretrain.py][INFO] Epoch:[0/2](37900/4588595) loss:3.375 lr:0.0001700 epoch_Time:28545.0min: [2024-01-02 21:37:31,803][model8_pretrain.py][INFO] Epoch:[0/2](37900/4588595) loss:3.345 lr:0.0001700 epoch_Time:28545.0min: [2024-01-02 21:37:31,803][model8_pretrain.py][INFO] Epoch:[0/2](37900/4588595) loss:2.663 lr:0.0001700 epoch_Time:28545.0min: [2024-01-02 21:37:31,803][model8_pretrain.py][INFO] Epoch:[0/2](37900/4588595) loss:3.368 lr:0.0001700 epoch_Time:28545.0min: [2024-01-02 21:37:31,803][model8_pretrain.py][INFO] Epoch:[0/2](37900/4588595) loss:3.442 lr:0.0001700 epoch_Time:28545.0min: [2024-01-02 21:37:31,803][model8_pretrain.py][INFO] Epoch:[0/2](37900/4588595) loss:2.775 lr:0.0001700 epoch_Time:28545.0min: [2024-01-02 21:37:31,804][model8_pretrain.py][INFO] Epoch:[0/2](37900/4588595) loss:3.557 lr:0.0001700 epoch_Time:28545.0min: [2024-01-02 21:38:08,777][model8_pretrain.py][INFO] Epoch:[0/2](38000/4588595) loss:3.375 lr:0.0001694 epoch_Time:28542.0min: [2024-01-02 21:38:08,777][model8_pretrain.py][INFO] Epoch:[0/2](38000/4588595) loss:3.570 lr:0.0001694 epoch_Time:28542.0min: [2024-01-02 21:38:08,777][model8_pretrain.py][INFO] Epoch:[0/2](38000/4588595) loss:3.355 lr:0.0001694 epoch_Time:28542.0min: [2024-01-02 21:38:08,777][model8_pretrain.py][INFO] Epoch:[0/2](38000/4588595) loss:3.553 lr:0.0001694 epoch_Time:28542.0min: [2024-01-02 21:38:08,777][model8_pretrain.py][INFO] Epoch:[0/2](38000/4588595) loss:3.463 lr:0.0001694 epoch_Time:28543.0min: [2024-01-02 21:38:08,778][model8_pretrain.py][INFO] Epoch:[0/2](38000/4588595) loss:3.115 lr:0.0001694 epoch_Time:28542.0min: [2024-01-02 21:38:08,778][model8_pretrain.py][INFO] Epoch:[0/2](38000/4588595) loss:2.836 lr:0.0001694 epoch_Time:28542.0min: [2024-01-02 21:38:08,779][model8_pretrain.py][INFO] Epoch:[0/2](38000/4588595) loss:2.761 lr:0.0001694 epoch_Time:28542.0min: [2024-01-02 21:38:45,737][model8_pretrain.py][INFO] Epoch:[0/2](38100/4588595) loss:3.341 lr:0.0001688 epoch_Time:28541.0min: [2024-01-02 21:38:45,737][model8_pretrain.py][INFO] Epoch:[0/2](38100/4588595) loss:3.139 lr:0.0001688 epoch_Time:28541.0min: [2024-01-02 21:38:45,737][model8_pretrain.py][INFO] Epoch:[0/2](38100/4588595) loss:3.621 lr:0.0001688 epoch_Time:28541.0min: [2024-01-02 21:38:45,737][model8_pretrain.py][INFO] Epoch:[0/2](38100/4588595) loss:3.458 lr:0.0001688 epoch_Time:28541.0min: [2024-01-02 21:38:45,737][model8_pretrain.py][INFO] Epoch:[0/2](38100/4588595) loss:3.673 lr:0.0001688 epoch_Time:28541.0min: [2024-01-02 21:38:45,737][model8_pretrain.py][INFO] Epoch:[0/2](38100/4588595) loss:3.826 lr:0.0001688 epoch_Time:28541.0min: [2024-01-02 21:38:45,737][model8_pretrain.py][INFO] Epoch:[0/2](38100/4588595) loss:2.704 lr:0.0001688 epoch_Time:28541.0min: [2024-01-02 21:38:45,737][model8_pretrain.py][INFO] Epoch:[0/2](38100/4588595) loss:3.224 lr:0.0001688 epoch_Time:28541.0min: [2024-01-02 21:39:29,704][model8_pretrain.py][INFO] Epoch:[0/2](38200/4588595) loss:3.396 lr:0.0001682 epoch_Time:28553.0min: [2024-01-02 21:39:29,703][model8_pretrain.py][INFO] Epoch:[0/2](38200/4588595) loss:3.276 lr:0.0001682 epoch_Time:28553.0min: [2024-01-02 21:39:29,704][model8_pretrain.py][INFO] Epoch:[0/2](38200/4588595) loss:3.770 lr:0.0001682 epoch_Time:28553.0min: [2024-01-02 21:39:29,704][model8_pretrain.py][INFO] Epoch:[0/2](38200/4588595) loss:3.293 lr:0.0001682 epoch_Time:28553.0min: [2024-01-02 21:39:29,704][model8_pretrain.py][INFO] Epoch:[0/2](38200/4588595) loss:2.823 lr:0.0001682 epoch_Time:28553.0min: [2024-01-02 21:39:29,704][model8_pretrain.py][INFO] Epoch:[0/2](38200/4588595) loss:3.457 lr:0.0001682 epoch_Time:28553.0min: [2024-01-02 21:39:29,704][model8_pretrain.py][INFO] Epoch:[0/2](38200/4588595) loss:3.232 lr:0.0001682 epoch_Time:28553.0min: [2024-01-02 21:39:29,704][model8_pretrain.py][INFO] Epoch:[0/2](38200/4588595) loss:3.581 lr:0.0001682 epoch_Time:28553.0min: [2024-01-02 21:40:06,640][model8_pretrain.py][INFO] Epoch:[0/2](38300/4588595) loss:3.655 lr:0.0001677 epoch_Time:28550.0min: [2024-01-02 21:40:06,640][model8_pretrain.py][INFO] Epoch:[0/2](38300/4588595) loss:3.240 lr:0.0001677 epoch_Time:28550.0min: [2024-01-02 21:40:06,640][model8_pretrain.py][INFO] Epoch:[0/2](38300/4588595) loss:3.221 lr:0.0001677 epoch_Time:28550.0min: [2024-01-02 21:40:06,640][model8_pretrain.py][INFO] Epoch:[0/2](38300/4588595) loss:3.068 lr:0.0001677 epoch_Time:28550.0min: [2024-01-02 21:40:06,640][model8_pretrain.py][INFO] Epoch:[0/2](38300/4588595) loss:3.075 lr:0.0001677 epoch_Time:28550.0min: [2024-01-02 21:40:06,640][model8_pretrain.py][INFO] Epoch:[0/2](38300/4588595) loss:3.744 lr:0.0001677 epoch_Time:28550.0min: [2024-01-02 21:40:06,640][model8_pretrain.py][INFO] Epoch:[0/2](38300/4588595) loss:3.075 lr:0.0001677 epoch_Time:28550.0min: [2024-01-02 21:40:06,640][model8_pretrain.py][INFO] Epoch:[0/2](38300/4588595) loss:3.232 lr:0.0001677 epoch_Time:28550.0min: [2024-01-02 21:40:43,587][model8_pretrain.py][INFO] Epoch:[0/2](38400/4588595) loss:3.136 lr:0.0001671 epoch_Time:28549.0min: [2024-01-02 21:40:43,587][model8_pretrain.py][INFO] Epoch:[0/2](38400/4588595) loss:3.609 lr:0.0001671 epoch_Time:28549.0min: [2024-01-02 21:40:43,587][model8_pretrain.py][INFO] Epoch:[0/2](38400/4588595) loss:3.560 lr:0.0001671 epoch_Time:28549.0min: [2024-01-02 21:40:43,587][model8_pretrain.py][INFO] Epoch:[0/2](38400/4588595) loss:3.886 lr:0.0001671 epoch_Time:28549.0min: [2024-01-02 21:40:43,587][model8_pretrain.py][INFO] Epoch:[0/2](38400/4588595) loss:3.141 lr:0.0001671 epoch_Time:28549.0min: [2024-01-02 21:40:43,587][model8_pretrain.py][INFO] Epoch:[0/2](38400/4588595) loss:3.540 lr:0.0001671 epoch_Time:28549.0min: [2024-01-02 21:40:43,587][model8_pretrain.py][INFO] Epoch:[0/2](38400/4588595) loss:3.305 lr:0.0001671 epoch_Time:28549.0min: [2024-01-02 21:40:43,588][model8_pretrain.py][INFO] Epoch:[0/2](38400/4588595) loss:3.253 lr:0.0001671 epoch_Time:28549.0min: [2024-01-02 21:41:20,542][model8_pretrain.py][INFO] Epoch:[0/2](38500/4588595) loss:2.996 lr:0.0001665 epoch_Time:28547.0min: [2024-01-02 21:41:20,543][model8_pretrain.py][INFO] Epoch:[0/2](38500/4588595) loss:3.275 lr:0.0001665 epoch_Time:28547.0min: [2024-01-02 21:41:20,543][model8_pretrain.py][INFO] Epoch:[0/2](38500/4588595) loss:3.604 lr:0.0001665 epoch_Time:28547.0min: [2024-01-02 21:41:20,543][model8_pretrain.py][INFO] Epoch:[0/2](38500/4588595) loss:2.958 lr:0.0001665 epoch_Time:28547.0min: [2024-01-02 21:41:20,543][model8_pretrain.py][INFO] Epoch:[0/2](38500/4588595) loss:3.388 lr:0.0001665 epoch_Time:28547.0min: [2024-01-02 21:41:20,543][model8_pretrain.py][INFO] Epoch:[0/2](38500/4588595) loss:3.042 lr:0.0001665 epoch_Time:28547.0min: [2024-01-02 21:41:20,543][model8_pretrain.py][INFO] Epoch:[0/2](38500/4588595) loss:3.352 lr:0.0001665 epoch_Time:28547.0min: [2024-01-02 21:41:20,543][model8_pretrain.py][INFO] Epoch:[0/2](38500/4588595) loss:3.444 lr:0.0001665 epoch_Time:28547.0min: [2024-01-02 21:41:57,480][model8_pretrain.py][INFO] Epoch:[0/2](38600/4588595) loss:3.113 lr:0.0001659 epoch_Time:28544.0min: [2024-01-02 21:41:57,480][model8_pretrain.py][INFO] Epoch:[0/2](38600/4588595) loss:3.812 lr:0.0001659 epoch_Time:28544.0min: [2024-01-02 21:41:57,480][model8_pretrain.py][INFO] Epoch:[0/2](38600/4588595) loss:3.204 lr:0.0001659 epoch_Time:28544.0min: [2024-01-02 21:41:57,480][model8_pretrain.py][INFO] Epoch:[0/2](38600/4588595) loss:3.643 lr:0.0001659 epoch_Time:28544.0min: [2024-01-02 21:41:57,480][model8_pretrain.py][INFO] Epoch:[0/2](38600/4588595) loss:3.298 lr:0.0001659 epoch_Time:28544.0min: [2024-01-02 21:41:57,480][model8_pretrain.py][INFO] Epoch:[0/2](38600/4588595) loss:3.495 lr:0.0001659 epoch_Time:28544.0min: [2024-01-02 21:41:57,480][model8_pretrain.py][INFO] Epoch:[0/2](38600/4588595) loss:3.594 lr:0.0001659 epoch_Time:28544.0min: [2024-01-02 21:41:57,480][model8_pretrain.py][INFO] Epoch:[0/2](38600/4588595) loss:3.255 lr:0.0001659 epoch_Time:28544.0min: [2024-01-02 21:42:34,424][model8_pretrain.py][INFO] Epoch:[0/2](38700/4588595) loss:3.647 lr:0.0001654 epoch_Time:28543.0min: [2024-01-02 21:42:34,424][model8_pretrain.py][INFO] Epoch:[0/2](38700/4588595) loss:3.351 lr:0.0001654 epoch_Time:28543.0min: [2024-01-02 21:42:34,424][model8_pretrain.py][INFO] Epoch:[0/2](38700/4588595) loss:3.541 lr:0.0001654 epoch_Time:28543.0min: [2024-01-02 21:42:34,425][model8_pretrain.py][INFO] Epoch:[0/2](38700/4588595) loss:3.471 lr:0.0001654 epoch_Time:28543.0min: [2024-01-02 21:42:34,425][model8_pretrain.py][INFO] Epoch:[0/2](38700/4588595) loss:3.482 lr:0.0001654 epoch_Time:28543.0min: [2024-01-02 21:42:34,425][model8_pretrain.py][INFO] Epoch:[0/2](38700/4588595) loss:3.262 lr:0.0001654 epoch_Time:28543.0min: [2024-01-02 21:42:34,424][model8_pretrain.py][INFO] Epoch:[0/2](38700/4588595) loss:3.036 lr:0.0001654 epoch_Time:28543.0min: [2024-01-02 21:42:34,425][model8_pretrain.py][INFO] Epoch:[0/2](38700/4588595) loss:3.210 lr:0.0001654 epoch_Time:28543.0min: [2024-01-02 21:43:11,329][model8_pretrain.py][INFO] Epoch:[0/2](38800/4588595) loss:3.427 lr:0.0001648 epoch_Time:28540.0min: [2024-01-02 21:43:11,329][model8_pretrain.py][INFO] Epoch:[0/2](38800/4588595) loss:3.370 lr:0.0001648 epoch_Time:28540.0min: [2024-01-02 21:43:11,329][model8_pretrain.py][INFO] Epoch:[0/2](38800/4588595) loss:3.090 lr:0.0001648 epoch_Time:28540.0min: [2024-01-02 21:43:11,329][model8_pretrain.py][INFO] Epoch:[0/2](38800/4588595) loss:3.419 lr:0.0001648 epoch_Time:28540.0min: [2024-01-02 21:43:11,329][model8_pretrain.py][INFO] Epoch:[0/2](38800/4588595) loss:3.113 lr:0.0001648 epoch_Time:28540.0min: [2024-01-02 21:43:11,329][model8_pretrain.py][INFO] Epoch:[0/2](38800/4588595) loss:3.662 lr:0.0001648 epoch_Time:28540.0min: [2024-01-02 21:43:11,329][model8_pretrain.py][INFO] Epoch:[0/2](38800/4588595) loss:3.380 lr:0.0001648 epoch_Time:28540.0min: [2024-01-02 21:43:11,329][model8_pretrain.py][INFO] Epoch:[0/2](38800/4588595) loss:3.296 lr:0.0001648 epoch_Time:28540.0min: [2024-01-02 21:43:48,250][model8_pretrain.py][INFO] Epoch:[0/2](38900/4588595) loss:3.562 lr:0.0001642 epoch_Time:28538.0min: [2024-01-02 21:43:48,251][model8_pretrain.py][INFO] Epoch:[0/2](38900/4588595) loss:3.731 lr:0.0001642 epoch_Time:28538.0min: [2024-01-02 21:43:48,251][model8_pretrain.py][INFO] Epoch:[0/2](38900/4588595) loss:3.687 lr:0.0001642 epoch_Time:28538.0min: [2024-01-02 21:43:48,251][model8_pretrain.py][INFO] Epoch:[0/2](38900/4588595) loss:3.334 lr:0.0001642 epoch_Time:28538.0min: [2024-01-02 21:43:48,251][model8_pretrain.py][INFO] Epoch:[0/2](38900/4588595) loss:3.050 lr:0.0001642 epoch_Time:28538.0min: [2024-01-02 21:43:48,251][model8_pretrain.py][INFO] Epoch:[0/2](38900/4588595) loss:3.521 lr:0.0001642 epoch_Time:28538.0min: [2024-01-02 21:43:48,251][model8_pretrain.py][INFO] Epoch:[0/2](38900/4588595) loss:3.130 lr:0.0001642 epoch_Time:28538.0min: [2024-01-02 21:43:48,251][model8_pretrain.py][INFO] Epoch:[0/2](38900/4588595) loss:3.419 lr:0.0001642 epoch_Time:28538.0min: [2024-01-02 21:44:32,271][model8_pretrain.py][INFO] Epoch:[0/2](39000/4588595) loss:3.519 lr:0.0001636 epoch_Time:28550.0min: [2024-01-02 21:44:32,271][model8_pretrain.py][INFO] Epoch:[0/2](39000/4588595) loss:3.258 lr:0.0001636 epoch_Time:28551.0min: [2024-01-02 21:44:32,271][model8_pretrain.py][INFO] Epoch:[0/2](39000/4588595) loss:3.562 lr:0.0001636 epoch_Time:28550.0min: [2024-01-02 21:44:32,271][model8_pretrain.py][INFO] Epoch:[0/2](39000/4588595) loss:2.979 lr:0.0001636 epoch_Time:28550.0min: [2024-01-02 21:44:32,271][model8_pretrain.py][INFO] Epoch:[0/2](39000/4588595) loss:3.509 lr:0.0001636 epoch_Time:28550.0min: [2024-01-02 21:44:32,271][model8_pretrain.py][INFO] Epoch:[0/2](39000/4588595) loss:3.338 lr:0.0001636 epoch_Time:28551.0min: [2024-01-02 21:44:32,271][model8_pretrain.py][INFO] Epoch:[0/2](39000/4588595) loss:2.835 lr:0.0001636 epoch_Time:28550.0min: [2024-01-02 21:44:32,271][model8_pretrain.py][INFO] Epoch:[0/2](39000/4588595) loss:3.326 lr:0.0001636 epoch_Time:28550.0min: [2024-01-02 21:45:09,199][model8_pretrain.py][INFO] Epoch:[0/2](39100/4588595) loss:3.266 lr:0.0001631 epoch_Time:28548.0min: [2024-01-02 21:45:09,199][model8_pretrain.py][INFO] Epoch:[0/2](39100/4588595) loss:3.500 lr:0.0001631 epoch_Time:28548.0min: [2024-01-02 21:45:09,199][model8_pretrain.py][INFO] Epoch:[0/2](39100/4588595) loss:3.254 lr:0.0001631 epoch_Time:28548.0min: [2024-01-02 21:45:09,199][model8_pretrain.py][INFO] Epoch:[0/2](39100/4588595) loss:3.615 lr:0.0001631 epoch_Time:28548.0min: [2024-01-02 21:45:09,199][model8_pretrain.py][INFO] Epoch:[0/2](39100/4588595) loss:3.264 lr:0.0001631 epoch_Time:28548.0min: [2024-01-02 21:45:09,199][model8_pretrain.py][INFO] Epoch:[0/2](39100/4588595) loss:3.269 lr:0.0001631 epoch_Time:28548.0min: [2024-01-02 21:45:09,199][model8_pretrain.py][INFO] Epoch:[0/2](39100/4588595) loss:3.552 lr:0.0001631 epoch_Time:28548.0min: [2024-01-02 21:45:09,199][model8_pretrain.py][INFO] Epoch:[0/2](39100/4588595) loss:2.715 lr:0.0001631 epoch_Time:28548.0min: [2024-01-02 21:45:46,151][model8_pretrain.py][INFO] Epoch:[0/2](39200/4588595) loss:3.377 lr:0.0001625 epoch_Time:28547.0min: [2024-01-02 21:45:46,151][model8_pretrain.py][INFO] Epoch:[0/2](39200/4588595) loss:3.192 lr:0.0001625 epoch_Time:28547.0min: [2024-01-02 21:45:46,151][model8_pretrain.py][INFO] Epoch:[0/2](39200/4588595) loss:3.257 lr:0.0001625 epoch_Time:28547.0min: [2024-01-02 21:45:46,151][model8_pretrain.py][INFO] Epoch:[0/2](39200/4588595) loss:3.790 lr:0.0001625 epoch_Time:28547.0min: [2024-01-02 21:45:46,151][model8_pretrain.py][INFO] Epoch:[0/2](39200/4588595) loss:3.341 lr:0.0001625 epoch_Time:28547.0min: [2024-01-02 21:45:46,151][model8_pretrain.py][INFO] Epoch:[0/2](39200/4588595) loss:3.351 lr:0.0001625 epoch_Time:28547.0min: [2024-01-02 21:45:46,151][model8_pretrain.py][INFO] Epoch:[0/2](39200/4588595) loss:3.307 lr:0.0001625 epoch_Time:28547.0min: [2024-01-02 21:45:46,151][model8_pretrain.py][INFO] Epoch:[0/2](39200/4588595) loss:3.341 lr:0.0001625 epoch_Time:28547.0min: [2024-01-02 21:46:23,113][model8_pretrain.py][INFO] Epoch:[0/2](39300/4588595) loss:3.856 lr:0.0001619 epoch_Time:28544.0min: [2024-01-02 21:46:23,113][model8_pretrain.py][INFO] Epoch:[0/2](39300/4588595) loss:3.456 lr:0.0001619 epoch_Time:28544.0min: [2024-01-02 21:46:23,113][model8_pretrain.py][INFO] Epoch:[0/2](39300/4588595) loss:3.524 lr:0.0001619 epoch_Time:28544.0min: [2024-01-02 21:46:23,113][model8_pretrain.py][INFO] Epoch:[0/2](39300/4588595) loss:3.472 lr:0.0001619 epoch_Time:28544.0min: [2024-01-02 21:46:23,113][model8_pretrain.py][INFO] Epoch:[0/2](39300/4588595) loss:3.213 lr:0.0001619 epoch_Time:28544.0min: [2024-01-02 21:46:23,113][model8_pretrain.py][INFO] Epoch:[0/2](39300/4588595) loss:3.869 lr:0.0001619 epoch_Time:28544.0min: [2024-01-02 21:46:23,113][model8_pretrain.py][INFO] Epoch:[0/2](39300/4588595) loss:3.301 lr:0.0001619 epoch_Time:28544.0min: [2024-01-02 21:46:23,113][model8_pretrain.py][INFO] Epoch:[0/2](39300/4588595) loss:2.998 lr:0.0001619 epoch_Time:28544.0min: [2024-01-02 21:47:00,055][model8_pretrain.py][INFO] Epoch:[0/2](39400/4588595) loss:3.455 lr:0.0001613 epoch_Time:28542.0min: [2024-01-02 21:47:00,055][model8_pretrain.py][INFO] Epoch:[0/2](39400/4588595) loss:3.323 lr:0.0001613 epoch_Time:28542.0min: [2024-01-02 21:47:00,055][model8_pretrain.py][INFO] Epoch:[0/2](39400/4588595) loss:3.452 lr:0.0001613 epoch_Time:28542.0min: [2024-01-02 21:47:00,055][model8_pretrain.py][INFO] Epoch:[0/2](39400/4588595) loss:3.649 lr:0.0001613 epoch_Time:28542.0min: [2024-01-02 21:47:00,055][model8_pretrain.py][INFO] Epoch:[0/2](39400/4588595) loss:3.237 lr:0.0001613 epoch_Time:28542.0min: [2024-01-02 21:47:00,055][model8_pretrain.py][INFO] Epoch:[0/2](39400/4588595) loss:3.502 lr:0.0001613 epoch_Time:28542.0min: [2024-01-02 21:47:00,055][model8_pretrain.py][INFO] Epoch:[0/2](39400/4588595) loss:3.710 lr:0.0001613 epoch_Time:28542.0min: [2024-01-02 21:47:00,055][model8_pretrain.py][INFO] Epoch:[0/2](39400/4588595) loss:3.232 lr:0.0001613 epoch_Time:28542.0min: [2024-01-02 21:47:36,993][model8_pretrain.py][INFO] Epoch:[0/2](39500/4588595) loss:3.277 lr:0.0001608 epoch_Time:28541.0min: [2024-01-02 21:47:36,993][model8_pretrain.py][INFO] Epoch:[0/2](39500/4588595) loss:3.651 lr:0.0001608 epoch_Time:28541.0min: [2024-01-02 21:47:36,993][model8_pretrain.py][INFO] Epoch:[0/2](39500/4588595) loss:3.187 lr:0.0001608 epoch_Time:28541.0min: [2024-01-02 21:47:36,993][model8_pretrain.py][INFO] Epoch:[0/2](39500/4588595) loss:2.954 lr:0.0001608 epoch_Time:28541.0min: [2024-01-02 21:47:36,993][model8_pretrain.py][INFO] Epoch:[0/2](39500/4588595) loss:3.559 lr:0.0001608 epoch_Time:28541.0min: [2024-01-02 21:47:36,993][model8_pretrain.py][INFO] Epoch:[0/2](39500/4588595) loss:3.157 lr:0.0001608 epoch_Time:28541.0min: [2024-01-02 21:47:36,993][model8_pretrain.py][INFO] Epoch:[0/2](39500/4588595) loss:3.313 lr:0.0001608 epoch_Time:28541.0min: [2024-01-02 21:47:36,993][model8_pretrain.py][INFO] Epoch:[0/2](39500/4588595) loss:2.947 lr:0.0001608 epoch_Time:28541.0min: [2024-01-02 21:48:13,922][model8_pretrain.py][INFO] Epoch:[0/2](39600/4588595) loss:3.005 lr:0.0001602 epoch_Time:28538.0min: [2024-01-02 21:48:13,922][model8_pretrain.py][INFO] Epoch:[0/2](39600/4588595) loss:3.598 lr:0.0001602 epoch_Time:28538.0min: [2024-01-02 21:48:13,922][model8_pretrain.py][INFO] Epoch:[0/2](39600/4588595) loss:2.883 lr:0.0001602 epoch_Time:28538.0min: [2024-01-02 21:48:13,922][model8_pretrain.py][INFO] Epoch:[0/2](39600/4588595) loss:2.202 lr:0.0001602 epoch_Time:28538.0min: [2024-01-02 21:48:13,922][model8_pretrain.py][INFO] Epoch:[0/2](39600/4588595) loss:3.481 lr:0.0001602 epoch_Time:28538.0min: [2024-01-02 21:48:13,922][model8_pretrain.py][INFO] Epoch:[0/2](39600/4588595) loss:2.695 lr:0.0001602 epoch_Time:28538.0min: [2024-01-02 21:48:13,922][model8_pretrain.py][INFO] Epoch:[0/2](39600/4588595) loss:3.557 lr:0.0001602 epoch_Time:28538.0min: [2024-01-02 21:48:13,922][model8_pretrain.py][INFO] Epoch:[0/2](39600/4588595) loss:3.219 lr:0.0001602 epoch_Time:28538.0min: [2024-01-02 21:48:50,841][model8_pretrain.py][INFO] Epoch:[0/2](39700/4588595) loss:2.848 lr:0.0001596 epoch_Time:28536.0min: [2024-01-02 21:48:50,841][model8_pretrain.py][INFO] Epoch:[0/2](39700/4588595) loss:2.788 lr:0.0001596 epoch_Time:28536.0min: [2024-01-02 21:48:50,841][model8_pretrain.py][INFO] Epoch:[0/2](39700/4588595) loss:3.667 lr:0.0001596 epoch_Time:28536.0min: [2024-01-02 21:48:50,841][model8_pretrain.py][INFO] Epoch:[0/2](39700/4588595) loss:4.017 lr:0.0001596 epoch_Time:28536.0min: [2024-01-02 21:48:50,841][model8_pretrain.py][INFO] Epoch:[0/2](39700/4588595) loss:3.908 lr:0.0001596 epoch_Time:28536.0min: [2024-01-02 21:48:50,841][model8_pretrain.py][INFO] Epoch:[0/2](39700/4588595) loss:3.417 lr:0.0001596 epoch_Time:28536.0min: [2024-01-02 21:48:50,841][model8_pretrain.py][INFO] Epoch:[0/2](39700/4588595) loss:3.379 lr:0.0001596 epoch_Time:28536.0min: [2024-01-02 21:48:50,841][model8_pretrain.py][INFO] Epoch:[0/2](39700/4588595) loss:3.064 lr:0.0001596 epoch_Time:28536.0min: [2024-01-02 21:49:34,812][model8_pretrain.py][INFO] Epoch:[0/2](39800/4588595) loss:2.719 lr:0.0001590 epoch_Time:28548.0min: [2024-01-02 21:49:34,812][model8_pretrain.py][INFO] Epoch:[0/2](39800/4588595) loss:3.127 lr:0.0001590 epoch_Time:28548.0min: [2024-01-02 21:49:34,812][model8_pretrain.py][INFO] Epoch:[0/2](39800/4588595) loss:3.625 lr:0.0001590 epoch_Time:28548.0min: [2024-01-02 21:49:34,812][model8_pretrain.py][INFO] Epoch:[0/2](39800/4588595) loss:3.420 lr:0.0001590 epoch_Time:28548.0min: [2024-01-02 21:49:34,812][model8_pretrain.py][INFO] Epoch:[0/2](39800/4588595) loss:3.397 lr:0.0001590 epoch_Time:28548.0min: [2024-01-02 21:49:34,812][model8_pretrain.py][INFO] Epoch:[0/2](39800/4588595) loss:3.470 lr:0.0001590 epoch_Time:28548.0min: [2024-01-02 21:49:34,812][model8_pretrain.py][INFO] Epoch:[0/2](39800/4588595) loss:3.138 lr:0.0001590 epoch_Time:28548.0min: [2024-01-02 21:49:34,813][model8_pretrain.py][INFO] Epoch:[0/2](39800/4588595) loss:3.566 lr:0.0001590 epoch_Time:28548.0min: [2024-01-02 21:50:11,765][model8_pretrain.py][INFO] Epoch:[0/2](39900/4588595) loss:3.371 lr:0.0001585 epoch_Time:28546.0min: [2024-01-02 21:50:11,765][model8_pretrain.py][INFO] Epoch:[0/2](39900/4588595) loss:3.212 lr:0.0001585 epoch_Time:28546.0min: [2024-01-02 21:50:11,765][model8_pretrain.py][INFO] Epoch:[0/2](39900/4588595) loss:3.408 lr:0.0001585 epoch_Time:28546.0min: [2024-01-02 21:50:11,765][model8_pretrain.py][INFO] Epoch:[0/2](39900/4588595) loss:3.293 lr:0.0001585 epoch_Time:28546.0min: [2024-01-02 21:50:11,765][model8_pretrain.py][INFO] Epoch:[0/2](39900/4588595) loss:3.190 lr:0.0001585 epoch_Time:28546.0min: [2024-01-02 21:50:11,765][model8_pretrain.py][INFO] Epoch:[0/2](39900/4588595) loss:2.805 lr:0.0001585 epoch_Time:28546.0min: [2024-01-02 21:50:11,765][model8_pretrain.py][INFO] Epoch:[0/2](39900/4588595) loss:3.311 lr:0.0001585 epoch_Time:28546.0min: [2024-01-02 21:50:11,765][model8_pretrain.py][INFO] Epoch:[0/2](39900/4588595) loss:3.008 lr:0.0001585 epoch_Time:28546.0min: [2024-01-02 21:50:48,719][model8_pretrain.py][INFO] Epoch:[0/2](40000/4588595) loss:3.372 lr:0.0001579 epoch_Time:28543.0min: [2024-01-02 21:50:48,719][model8_pretrain.py][INFO] Epoch:[0/2](40000/4588595) loss:3.507 lr:0.0001579 epoch_Time:28543.0min: [2024-01-02 21:50:48,719][model8_pretrain.py][INFO] Epoch:[0/2](40000/4588595) loss:2.649 lr:0.0001579 epoch_Time:28543.0min: [2024-01-02 21:50:48,719][model8_pretrain.py][INFO] Epoch:[0/2](40000/4588595) loss:3.504 lr:0.0001579 epoch_Time:28543.0min: [2024-01-02 21:50:48,719][model8_pretrain.py][INFO] Epoch:[0/2](40000/4588595) loss:2.672 lr:0.0001579 epoch_Time:28543.0min: [2024-01-02 21:50:48,719][model8_pretrain.py][INFO] Epoch:[0/2](40000/4588595) loss:3.565 lr:0.0001579 epoch_Time:28543.0min: [2024-01-02 21:50:48,719][model8_pretrain.py][INFO] Epoch:[0/2](40000/4588595) loss:3.678 lr:0.0001579 epoch_Time:28543.0min: [2024-01-02 21:50:48,720][model8_pretrain.py][INFO] Epoch:[0/2](40000/4588595) loss:3.272 lr:0.0001579 epoch_Time:28543.0min: [2024-01-02 21:51:25,650][model8_pretrain.py][INFO] Epoch:[0/2](40100/4588595) loss:2.850 lr:0.0001573 epoch_Time:28542.0min: [2024-01-02 21:51:25,650][model8_pretrain.py][INFO] Epoch:[0/2](40100/4588595) loss:3.568 lr:0.0001573 epoch_Time:28542.0min: [2024-01-02 21:51:25,650][model8_pretrain.py][INFO] Epoch:[0/2](40100/4588595) loss:3.650 lr:0.0001573 epoch_Time:28542.0min: [2024-01-02 21:51:25,650][model8_pretrain.py][INFO] Epoch:[0/2](40100/4588595) loss:3.139 lr:0.0001573 epoch_Time:28542.0min: [2024-01-02 21:51:25,650][model8_pretrain.py][INFO] Epoch:[0/2](40100/4588595) loss:2.964 lr:0.0001573 epoch_Time:28542.0min: [2024-01-02 21:51:25,650][model8_pretrain.py][INFO] Epoch:[0/2](40100/4588595) loss:3.522 lr:0.0001573 epoch_Time:28542.0min: [2024-01-02 21:51:25,650][model8_pretrain.py][INFO] Epoch:[0/2](40100/4588595) loss:2.980 lr:0.0001573 epoch_Time:28542.0min: [2024-01-02 21:51:25,650][model8_pretrain.py][INFO] Epoch:[0/2](40100/4588595) loss:3.431 lr:0.0001573 epoch_Time:28542.0min: [2024-01-02 21:52:02,591][model8_pretrain.py][INFO] Epoch:[0/2](40200/4588595) loss:3.307 lr:0.0001567 epoch_Time:28540.0min: [2024-01-02 21:52:02,591][model8_pretrain.py][INFO] Epoch:[0/2](40200/4588595) loss:2.821 lr:0.0001567 epoch_Time:28540.0min: [2024-01-02 21:52:02,591][model8_pretrain.py][INFO] Epoch:[0/2](40200/4588595) loss:3.295 lr:0.0001567 epoch_Time:28540.0min: [2024-01-02 21:52:02,591][model8_pretrain.py][INFO] Epoch:[0/2](40200/4588595) loss:3.254 lr:0.0001567 epoch_Time:28540.0min: [2024-01-02 21:52:02,591][model8_pretrain.py][INFO] Epoch:[0/2](40200/4588595) loss:3.597 lr:0.0001567 epoch_Time:28540.0min: [2024-01-02 21:52:02,591][model8_pretrain.py][INFO] Epoch:[0/2](40200/4588595) loss:3.446 lr:0.0001567 epoch_Time:28540.0min: [2024-01-02 21:52:02,591][model8_pretrain.py][INFO] Epoch:[0/2](40200/4588595) loss:3.269 lr:0.0001567 epoch_Time:28540.0min: [2024-01-02 21:52:02,591][model8_pretrain.py][INFO] Epoch:[0/2](40200/4588595) loss:3.012 lr:0.0001567 epoch_Time:28540.0min: [2024-01-02 21:52:39,534][model8_pretrain.py][INFO] Epoch:[0/2](40300/4588595) loss:3.309 lr:0.0001562 epoch_Time:28538.0min: [2024-01-02 21:52:39,534][model8_pretrain.py][INFO] Epoch:[0/2](40300/4588595) loss:2.705 lr:0.0001562 epoch_Time:28538.0min: [2024-01-02 21:52:39,534][model8_pretrain.py][INFO] Epoch:[0/2](40300/4588595) loss:3.043 lr:0.0001562 epoch_Time:28538.0min: [2024-01-02 21:52:39,534][model8_pretrain.py][INFO] Epoch:[0/2](40300/4588595) loss:3.052 lr:0.0001562 epoch_Time:28538.0min: [2024-01-02 21:52:39,534][model8_pretrain.py][INFO] Epoch:[0/2](40300/4588595) loss:3.530 lr:0.0001562 epoch_Time:28538.0min: [2024-01-02 21:52:39,534][model8_pretrain.py][INFO] Epoch:[0/2](40300/4588595) loss:3.061 lr:0.0001562 epoch_Time:28538.0min: [2024-01-02 21:52:39,535][model8_pretrain.py][INFO] Epoch:[0/2](40300/4588595) loss:3.665 lr:0.0001562 epoch_Time:28538.0min: [2024-01-02 21:52:39,543][model8_pretrain.py][INFO] Epoch:[0/2](40300/4588595) loss:3.421 lr:0.0001562 epoch_Time:28538.0min: [2024-01-02 21:53:16,503][model8_pretrain.py][INFO] Epoch:[0/2](40400/4588595) loss:3.398 lr:0.0001556 epoch_Time:28536.0min: [2024-01-02 21:53:16,503][model8_pretrain.py][INFO] Epoch:[0/2](40400/4588595) loss:2.842 lr:0.0001556 epoch_Time:28536.0min: [2024-01-02 21:53:16,503][model8_pretrain.py][INFO] Epoch:[0/2](40400/4588595) loss:3.606 lr:0.0001556 epoch_Time:28536.0min: [2024-01-02 21:53:16,503][model8_pretrain.py][INFO] Epoch:[0/2](40400/4588595) loss:3.138 lr:0.0001556 epoch_Time:28536.0min: [2024-01-02 21:53:16,503][model8_pretrain.py][INFO] Epoch:[0/2](40400/4588595) loss:3.499 lr:0.0001556 epoch_Time:28536.0min: [2024-01-02 21:53:16,504][model8_pretrain.py][INFO] Epoch:[0/2](40400/4588595) loss:3.160 lr:0.0001556 epoch_Time:28536.0min: [2024-01-02 21:53:16,505][model8_pretrain.py][INFO] Epoch:[0/2](40400/4588595) loss:3.483 lr:0.0001556 epoch_Time:28536.0min: [2024-01-02 21:53:16,505][model8_pretrain.py][INFO] Epoch:[0/2](40400/4588595) loss:3.512 lr:0.0001556 epoch_Time:28536.0min: [2024-01-02 21:53:53,461][model8_pretrain.py][INFO] Epoch:[0/2](40500/4588595) loss:3.494 lr:0.0001550 epoch_Time:28534.0min: [2024-01-02 21:53:53,461][model8_pretrain.py][INFO] Epoch:[0/2](40500/4588595) loss:3.228 lr:0.0001550 epoch_Time:28534.0min: [2024-01-02 21:53:53,461][model8_pretrain.py][INFO] Epoch:[0/2](40500/4588595) loss:3.344 lr:0.0001550 epoch_Time:28534.0min: [2024-01-02 21:53:53,461][model8_pretrain.py][INFO] Epoch:[0/2](40500/4588595) loss:3.333 lr:0.0001550 epoch_Time:28534.0min: [2024-01-02 21:53:53,461][model8_pretrain.py][INFO] Epoch:[0/2](40500/4588595) loss:3.039 lr:0.0001550 epoch_Time:28534.0min: [2024-01-02 21:53:53,461][model8_pretrain.py][INFO] Epoch:[0/2](40500/4588595) loss:3.254 lr:0.0001550 epoch_Time:28534.0min: [2024-01-02 21:53:53,461][model8_pretrain.py][INFO] Epoch:[0/2](40500/4588595) loss:3.384 lr:0.0001550 epoch_Time:28534.0min: [2024-01-02 21:53:53,461][model8_pretrain.py][INFO] Epoch:[0/2](40500/4588595) loss:3.095 lr:0.0001550 epoch_Time:28534.0min: [2024-01-02 21:54:37,527][model8_pretrain.py][INFO] Epoch:[0/2](40600/4588595) loss:2.891 lr:0.0001544 epoch_Time:28546.0min: [2024-01-02 21:54:37,527][model8_pretrain.py][INFO] Epoch:[0/2](40600/4588595) loss:3.121 lr:0.0001544 epoch_Time:28546.0min: [2024-01-02 21:54:37,527][model8_pretrain.py][INFO] Epoch:[0/2](40600/4588595) loss:3.218 lr:0.0001544 epoch_Time:28546.0min: [2024-01-02 21:54:37,527][model8_pretrain.py][INFO] Epoch:[0/2](40600/4588595) loss:3.068 lr:0.0001544 epoch_Time:28546.0min: [2024-01-02 21:54:37,527][model8_pretrain.py][INFO] Epoch:[0/2](40600/4588595) loss:3.243 lr:0.0001544 epoch_Time:28546.0min: [2024-01-02 21:54:37,527][model8_pretrain.py][INFO] Epoch:[0/2](40600/4588595) loss:3.362 lr:0.0001544 epoch_Time:28546.0min: [2024-01-02 21:54:37,527][model8_pretrain.py][INFO] Epoch:[0/2](40600/4588595) loss:3.436 lr:0.0001544 epoch_Time:28546.0min: [2024-01-02 21:54:37,528][model8_pretrain.py][INFO] Epoch:[0/2](40600/4588595) loss:3.507 lr:0.0001544 epoch_Time:28546.0min: [2024-01-02 21:55:14,469][model8_pretrain.py][INFO] Epoch:[0/2](40700/4588595) loss:3.520 lr:0.0001538 epoch_Time:28543.0min: [2024-01-02 21:55:14,469][model8_pretrain.py][INFO] Epoch:[0/2](40700/4588595) loss:3.390 lr:0.0001538 epoch_Time:28543.0min: [2024-01-02 21:55:14,469][model8_pretrain.py][INFO] Epoch:[0/2](40700/4588595) loss:3.498 lr:0.0001538 epoch_Time:28543.0min: [2024-01-02 21:55:14,469][model8_pretrain.py][INFO] Epoch:[0/2](40700/4588595) loss:3.123 lr:0.0001538 epoch_Time:28543.0min: [2024-01-02 21:55:14,469][model8_pretrain.py][INFO] Epoch:[0/2](40700/4588595) loss:3.282 lr:0.0001538 epoch_Time:28543.0min: [2024-01-02 21:55:14,469][model8_pretrain.py][INFO] Epoch:[0/2](40700/4588595) loss:3.111 lr:0.0001538 epoch_Time:28544.0min: [2024-01-02 21:55:14,469][model8_pretrain.py][INFO] Epoch:[0/2](40700/4588595) loss:3.382 lr:0.0001538 epoch_Time:28543.0min: [2024-01-02 21:55:14,469][model8_pretrain.py][INFO] Epoch:[0/2](40700/4588595) loss:3.496 lr:0.0001538 epoch_Time:28543.0min: [2024-01-02 21:55:51,399][model8_pretrain.py][INFO] Epoch:[0/2](40800/4588595) loss:3.244 lr:0.0001533 epoch_Time:28541.0min: [2024-01-02 21:55:51,399][model8_pretrain.py][INFO] Epoch:[0/2](40800/4588595) loss:2.824 lr:0.0001533 epoch_Time:28541.0min: [2024-01-02 21:55:51,399][model8_pretrain.py][INFO] Epoch:[0/2](40800/4588595) loss:3.651 lr:0.0001533 epoch_Time:28541.0min: [2024-01-02 21:55:51,399][model8_pretrain.py][INFO] Epoch:[0/2](40800/4588595) loss:3.284 lr:0.0001533 epoch_Time:28541.0min: [2024-01-02 21:55:51,399][model8_pretrain.py][INFO] Epoch:[0/2](40800/4588595) loss:3.224 lr:0.0001533 epoch_Time:28541.0min: [2024-01-02 21:55:51,399][model8_pretrain.py][INFO] Epoch:[0/2](40800/4588595) loss:3.194 lr:0.0001533 epoch_Time:28541.0min: [2024-01-02 21:55:51,399][model8_pretrain.py][INFO] Epoch:[0/2](40800/4588595) loss:2.613 lr:0.0001533 epoch_Time:28541.0min: [2024-01-02 21:55:51,399][model8_pretrain.py][INFO] Epoch:[0/2](40800/4588595) loss:3.037 lr:0.0001533 epoch_Time:28541.0min: [2024-01-02 21:56:28,347][model8_pretrain.py][INFO] Epoch:[0/2](40900/4588595) loss:3.334 lr:0.0001527 epoch_Time:28540.0min: [2024-01-02 21:56:28,347][model8_pretrain.py][INFO] Epoch:[0/2](40900/4588595) loss:3.248 lr:0.0001527 epoch_Time:28540.0min: [2024-01-02 21:56:28,347][model8_pretrain.py][INFO] Epoch:[0/2](40900/4588595) loss:3.038 lr:0.0001527 epoch_Time:28540.0min: [2024-01-02 21:56:28,347][model8_pretrain.py][INFO] Epoch:[0/2](40900/4588595) loss:3.666 lr:0.0001527 epoch_Time:28540.0min: [2024-01-02 21:56:28,347][model8_pretrain.py][INFO] Epoch:[0/2](40900/4588595) loss:3.487 lr:0.0001527 epoch_Time:28540.0min: [2024-01-02 21:56:28,347][model8_pretrain.py][INFO] Epoch:[0/2](40900/4588595) loss:3.475 lr:0.0001527 epoch_Time:28540.0min: [2024-01-02 21:56:28,347][model8_pretrain.py][INFO] Epoch:[0/2](40900/4588595) loss:3.245 lr:0.0001527 epoch_Time:28540.0min: [2024-01-02 21:56:28,347][model8_pretrain.py][INFO] Epoch:[0/2](40900/4588595) loss:3.432 lr:0.0001527 epoch_Time:28540.0min: [2024-01-02 21:57:05,301][model8_pretrain.py][INFO] Epoch:[0/2](41000/4588595) loss:3.094 lr:0.0001521 epoch_Time:28537.0min: [2024-01-02 21:57:05,302][model8_pretrain.py][INFO] Epoch:[0/2](41000/4588595) loss:3.415 lr:0.0001521 epoch_Time:28537.0min: [2024-01-02 21:57:05,302][model8_pretrain.py][INFO] Epoch:[0/2](41000/4588595) loss:3.463 lr:0.0001521 epoch_Time:28538.0min: [2024-01-02 21:57:05,302][model8_pretrain.py][INFO] Epoch:[0/2](41000/4588595) loss:3.465 lr:0.0001521 epoch_Time:28537.0min: [2024-01-02 21:57:05,302][model8_pretrain.py][INFO] Epoch:[0/2](41000/4588595) loss:3.144 lr:0.0001521 epoch_Time:28537.0min: [2024-01-02 21:57:05,302][model8_pretrain.py][INFO] Epoch:[0/2](41000/4588595) loss:3.245 lr:0.0001521 epoch_Time:28537.0min: [2024-01-02 21:57:05,302][model8_pretrain.py][INFO] Epoch:[0/2](41000/4588595) loss:3.607 lr:0.0001521 epoch_Time:28537.0min: [2024-01-02 21:57:05,302][model8_pretrain.py][INFO] Epoch:[0/2](41000/4588595) loss:3.060 lr:0.0001521 epoch_Time:28538.0min: [2024-01-02 21:57:42,257][model8_pretrain.py][INFO] Epoch:[0/2](41100/4588595) loss:3.229 lr:0.0001515 epoch_Time:28536.0min: [2024-01-02 21:57:42,258][model8_pretrain.py][INFO] Epoch:[0/2](41100/4588595) loss:3.362 lr:0.0001515 epoch_Time:28536.0min: [2024-01-02 21:57:42,257][model8_pretrain.py][INFO] Epoch:[0/2](41100/4588595) loss:3.243 lr:0.0001515 epoch_Time:28536.0min: [2024-01-02 21:57:42,258][model8_pretrain.py][INFO] Epoch:[0/2](41100/4588595) loss:3.385 lr:0.0001515 epoch_Time:28536.0min: [2024-01-02 21:57:42,258][model8_pretrain.py][INFO] Epoch:[0/2](41100/4588595) loss:3.518 lr:0.0001515 epoch_Time:28536.0min: [2024-01-02 21:57:42,258][model8_pretrain.py][INFO] Epoch:[0/2](41100/4588595) loss:3.489 lr:0.0001515 epoch_Time:28536.0min: [2024-01-02 21:57:42,258][model8_pretrain.py][INFO] Epoch:[0/2](41100/4588595) loss:3.654 lr:0.0001515 epoch_Time:28536.0min: [2024-01-02 21:57:42,258][model8_pretrain.py][INFO] Epoch:[0/2](41100/4588595) loss:3.294 lr:0.0001515 epoch_Time:28536.0min: [2024-01-02 21:58:19,224][model8_pretrain.py][INFO] Epoch:[0/2](41200/4588595) loss:3.223 lr:0.0001510 epoch_Time:28534.0min: [2024-01-02 21:58:19,224][model8_pretrain.py][INFO] Epoch:[0/2](41200/4588595) loss:3.110 lr:0.0001510 epoch_Time:28534.0min: [2024-01-02 21:58:19,224][model8_pretrain.py][INFO] Epoch:[0/2](41200/4588595) loss:3.166 lr:0.0001510 epoch_Time:28534.0min: [2024-01-02 21:58:19,224][model8_pretrain.py][INFO] Epoch:[0/2](41200/4588595) loss:3.613 lr:0.0001510 epoch_Time:28534.0min: [2024-01-02 21:58:19,224][model8_pretrain.py][INFO] Epoch:[0/2](41200/4588595) loss:3.045 lr:0.0001510 epoch_Time:28534.0min: [2024-01-02 21:58:19,224][model8_pretrain.py][INFO] Epoch:[0/2](41200/4588595) loss:2.786 lr:0.0001510 epoch_Time:28534.0min: [2024-01-02 21:58:19,224][model8_pretrain.py][INFO] Epoch:[0/2](41200/4588595) loss:3.665 lr:0.0001510 epoch_Time:28534.0min: [2024-01-02 21:58:19,224][model8_pretrain.py][INFO] Epoch:[0/2](41200/4588595) loss:3.210 lr:0.0001510 epoch_Time:28534.0min: [2024-01-02 21:58:56,164][model8_pretrain.py][INFO] Epoch:[0/2](41300/4588595) loss:3.219 lr:0.0001504 epoch_Time:28532.0min: [2024-01-02 21:58:56,164][model8_pretrain.py][INFO] Epoch:[0/2](41300/4588595) loss:3.359 lr:0.0001504 epoch_Time:28532.0min: [2024-01-02 21:58:56,164][model8_pretrain.py][INFO] Epoch:[0/2](41300/4588595) loss:3.698 lr:0.0001504 epoch_Time:28532.0min: [2024-01-02 21:58:56,164][model8_pretrain.py][INFO] Epoch:[0/2](41300/4588595) loss:3.587 lr:0.0001504 epoch_Time:28532.0min: [2024-01-02 21:58:56,164][model8_pretrain.py][INFO] Epoch:[0/2](41300/4588595) loss:2.635 lr:0.0001504 epoch_Time:28532.0min: [2024-01-02 21:58:56,164][model8_pretrain.py][INFO] Epoch:[0/2](41300/4588595) loss:3.587 lr:0.0001504 epoch_Time:28532.0min: [2024-01-02 21:58:56,164][model8_pretrain.py][INFO] Epoch:[0/2](41300/4588595) loss:3.201 lr:0.0001504 epoch_Time:28532.0min: [2024-01-02 21:58:56,164][model8_pretrain.py][INFO] Epoch:[0/2](41300/4588595) loss:3.229 lr:0.0001504 epoch_Time:28532.0min: [2024-01-02 21:59:40,349][model8_pretrain.py][INFO] Epoch:[0/2](41400/4588595) loss:2.663 lr:0.0001498 epoch_Time:28544.0min: [2024-01-02 21:59:40,349][model8_pretrain.py][INFO] Epoch:[0/2](41400/4588595) loss:3.405 lr:0.0001498 epoch_Time:28544.0min: [2024-01-02 21:59:40,349][model8_pretrain.py][INFO] Epoch:[0/2](41400/4588595) loss:3.551 lr:0.0001498 epoch_Time:28544.0min: [2024-01-02 21:59:40,349][model8_pretrain.py][INFO] Epoch:[0/2](41400/4588595) loss:3.324 lr:0.0001498 epoch_Time:28544.0min: [2024-01-02 21:59:40,349][model8_pretrain.py][INFO] Epoch:[0/2](41400/4588595) loss:3.580 lr:0.0001498 epoch_Time:28544.0min: [2024-01-02 21:59:40,349][model8_pretrain.py][INFO] Epoch:[0/2](41400/4588595) loss:2.946 lr:0.0001498 epoch_Time:28544.0min: [2024-01-02 21:59:40,349][model8_pretrain.py][INFO] Epoch:[0/2](41400/4588595) loss:3.008 lr:0.0001498 epoch_Time:28544.0min: [2024-01-02 21:59:40,349][model8_pretrain.py][INFO] Epoch:[0/2](41400/4588595) loss:3.740 lr:0.0001498 epoch_Time:28544.0min: [2024-01-02 22:00:17,288][model8_pretrain.py][INFO] Epoch:[0/2](41500/4588595) loss:3.248 lr:0.0001492 epoch_Time:28541.0min: [2024-01-02 22:00:17,288][model8_pretrain.py][INFO] Epoch:[0/2](41500/4588595) loss:3.551 lr:0.0001492 epoch_Time:28541.0min: [2024-01-02 22:00:17,288][model8_pretrain.py][INFO] Epoch:[0/2](41500/4588595) loss:3.245 lr:0.0001492 epoch_Time:28541.0min: [2024-01-02 22:00:17,288][model8_pretrain.py][INFO] Epoch:[0/2](41500/4588595) loss:2.908 lr:0.0001492 epoch_Time:28541.0min: [2024-01-02 22:00:17,288][model8_pretrain.py][INFO] Epoch:[0/2](41500/4588595) loss:3.084 lr:0.0001492 epoch_Time:28541.0min: [2024-01-02 22:00:17,288][model8_pretrain.py][INFO] Epoch:[0/2](41500/4588595) loss:2.998 lr:0.0001492 epoch_Time:28541.0min: [2024-01-02 22:00:17,288][model8_pretrain.py][INFO] Epoch:[0/2](41500/4588595) loss:3.600 lr:0.0001492 epoch_Time:28541.0min: [2024-01-02 22:00:17,288][model8_pretrain.py][INFO] Epoch:[0/2](41500/4588595) loss:3.140 lr:0.0001492 epoch_Time:28541.0min: [2024-01-02 22:00:54,244][model8_pretrain.py][INFO] Epoch:[0/2](41600/4588595) loss:3.219 lr:0.0001487 epoch_Time:28539.0min: [2024-01-02 22:00:54,244][model8_pretrain.py][INFO] Epoch:[0/2](41600/4588595) loss:3.184 lr:0.0001487 epoch_Time:28539.0min: [2024-01-02 22:00:54,244][model8_pretrain.py][INFO] Epoch:[0/2](41600/4588595) loss:3.516 lr:0.0001487 epoch_Time:28539.0min: [2024-01-02 22:00:54,244][model8_pretrain.py][INFO] Epoch:[0/2](41600/4588595) loss:2.955 lr:0.0001487 epoch_Time:28539.0min: [2024-01-02 22:00:54,244][model8_pretrain.py][INFO] Epoch:[0/2](41600/4588595) loss:3.345 lr:0.0001487 epoch_Time:28539.0min: [2024-01-02 22:00:54,244][model8_pretrain.py][INFO] Epoch:[0/2](41600/4588595) loss:3.256 lr:0.0001487 epoch_Time:28539.0min: [2024-01-02 22:00:54,244][model8_pretrain.py][INFO] Epoch:[0/2](41600/4588595) loss:3.187 lr:0.0001487 epoch_Time:28539.0min: [2024-01-02 22:00:54,244][model8_pretrain.py][INFO] Epoch:[0/2](41600/4588595) loss:2.919 lr:0.0001487 epoch_Time:28539.0min: [2024-01-02 22:01:31,179][model8_pretrain.py][INFO] Epoch:[0/2](41700/4588595) loss:3.653 lr:0.0001481 epoch_Time:28538.0min: [2024-01-02 22:01:31,179][model8_pretrain.py][INFO] Epoch:[0/2](41700/4588595) loss:3.445 lr:0.0001481 epoch_Time:28538.0min: [2024-01-02 22:01:31,179][model8_pretrain.py][INFO] Epoch:[0/2](41700/4588595) loss:3.073 lr:0.0001481 epoch_Time:28538.0min: [2024-01-02 22:01:31,179][model8_pretrain.py][INFO] Epoch:[0/2](41700/4588595) loss:2.978 lr:0.0001481 epoch_Time:28538.0min: [2024-01-02 22:01:31,179][model8_pretrain.py][INFO] Epoch:[0/2](41700/4588595) loss:2.961 lr:0.0001481 epoch_Time:28538.0min: [2024-01-02 22:01:31,179][model8_pretrain.py][INFO] Epoch:[0/2](41700/4588595) loss:2.918 lr:0.0001481 epoch_Time:28538.0min: [2024-01-02 22:01:31,179][model8_pretrain.py][INFO] Epoch:[0/2](41700/4588595) loss:3.401 lr:0.0001481 epoch_Time:28538.0min: [2024-01-02 22:01:31,179][model8_pretrain.py][INFO] Epoch:[0/2](41700/4588595) loss:3.612 lr:0.0001481 epoch_Time:28538.0min: [2024-01-02 22:02:08,119][model8_pretrain.py][INFO] Epoch:[0/2](41800/4588595) loss:3.049 lr:0.0001475 epoch_Time:28535.0min: [2024-01-02 22:02:08,119][model8_pretrain.py][INFO] Epoch:[0/2](41800/4588595) loss:3.213 lr:0.0001475 epoch_Time:28535.0min: [2024-01-02 22:02:08,119][model8_pretrain.py][INFO] Epoch:[0/2](41800/4588595) loss:3.599 lr:0.0001475 epoch_Time:28535.0min: [2024-01-02 22:02:08,119][model8_pretrain.py][INFO] Epoch:[0/2](41800/4588595) loss:3.323 lr:0.0001475 epoch_Time:28535.0min: [2024-01-02 22:02:08,119][model8_pretrain.py][INFO] Epoch:[0/2](41800/4588595) loss:2.836 lr:0.0001475 epoch_Time:28535.0min: [2024-01-02 22:02:08,119][model8_pretrain.py][INFO] Epoch:[0/2](41800/4588595) loss:3.496 lr:0.0001475 epoch_Time:28535.0min: [2024-01-02 22:02:08,119][model8_pretrain.py][INFO] Epoch:[0/2](41800/4588595) loss:2.913 lr:0.0001475 epoch_Time:28535.0min: [2024-01-02 22:02:08,119][model8_pretrain.py][INFO] Epoch:[0/2](41800/4588595) loss:3.215 lr:0.0001475 epoch_Time:28535.0min: [2024-01-02 22:02:45,065][model8_pretrain.py][INFO] Epoch:[0/2](41900/4588595) loss:3.295 lr:0.0001469 epoch_Time:28534.0min: [2024-01-02 22:02:45,065][model8_pretrain.py][INFO] Epoch:[0/2](41900/4588595) loss:3.742 lr:0.0001469 epoch_Time:28534.0min: [2024-01-02 22:02:45,065][model8_pretrain.py][INFO] Epoch:[0/2](41900/4588595) loss:2.986 lr:0.0001469 epoch_Time:28534.0min: [2024-01-02 22:02:45,065][model8_pretrain.py][INFO] Epoch:[0/2](41900/4588595) loss:3.032 lr:0.0001469 epoch_Time:28534.0min: [2024-01-02 22:02:45,065][model8_pretrain.py][INFO] Epoch:[0/2](41900/4588595) loss:3.651 lr:0.0001469 epoch_Time:28534.0min: [2024-01-02 22:02:45,065][model8_pretrain.py][INFO] Epoch:[0/2](41900/4588595) loss:3.442 lr:0.0001469 epoch_Time:28534.0min: [2024-01-02 22:02:45,065][model8_pretrain.py][INFO] Epoch:[0/2](41900/4588595) loss:2.673 lr:0.0001469 epoch_Time:28534.0min: [2024-01-02 22:02:45,065][model8_pretrain.py][INFO] Epoch:[0/2](41900/4588595) loss:3.581 lr:0.0001469 epoch_Time:28534.0min: [2024-01-02 22:03:22,009][model8_pretrain.py][INFO] Epoch:[0/2](42000/4588595) loss:2.413 lr:0.0001464 epoch_Time:28532.0min: [2024-01-02 22:03:22,009][model8_pretrain.py][INFO] Epoch:[0/2](42000/4588595) loss:3.429 lr:0.0001464 epoch_Time:28532.0min: [2024-01-02 22:03:22,009][model8_pretrain.py][INFO] Epoch:[0/2](42000/4588595) loss:2.843 lr:0.0001464 epoch_Time:28532.0min: [2024-01-02 22:03:22,009][model8_pretrain.py][INFO] Epoch:[0/2](42000/4588595) loss:3.038 lr:0.0001464 epoch_Time:28532.0min: [2024-01-02 22:03:22,009][model8_pretrain.py][INFO] Epoch:[0/2](42000/4588595) loss:3.534 lr:0.0001464 epoch_Time:28532.0min: [2024-01-02 22:03:22,009][model8_pretrain.py][INFO] Epoch:[0/2](42000/4588595) loss:3.159 lr:0.0001464 epoch_Time:28532.0min: [2024-01-02 22:03:22,009][model8_pretrain.py][INFO] Epoch:[0/2](42000/4588595) loss:3.294 lr:0.0001464 epoch_Time:28532.0min: [2024-01-02 22:03:22,009][model8_pretrain.py][INFO] Epoch:[0/2](42000/4588595) loss:3.478 lr:0.0001464 epoch_Time:28532.0min: [2024-01-02 22:03:58,937][model8_pretrain.py][INFO] Epoch:[0/2](42100/4588595) loss:3.223 lr:0.0001458 epoch_Time:28530.0min: [2024-01-02 22:03:58,937][model8_pretrain.py][INFO] Epoch:[0/2](42100/4588595) loss:3.077 lr:0.0001458 epoch_Time:28529.0min: [2024-01-02 22:03:58,937][model8_pretrain.py][INFO] Epoch:[0/2](42100/4588595) loss:3.572 lr:0.0001458 epoch_Time:28530.0min: [2024-01-02 22:03:58,938][model8_pretrain.py][INFO] Epoch:[0/2](42100/4588595) loss:3.508 lr:0.0001458 epoch_Time:28530.0min: [2024-01-02 22:03:58,938][model8_pretrain.py][INFO] Epoch:[0/2](42100/4588595) loss:3.435 lr:0.0001458 epoch_Time:28529.0min: [2024-01-02 22:03:58,938][model8_pretrain.py][INFO] Epoch:[0/2](42100/4588595) loss:3.038 lr:0.0001458 epoch_Time:28529.0min: [2024-01-02 22:03:58,937][model8_pretrain.py][INFO] Epoch:[0/2](42100/4588595) loss:3.659 lr:0.0001458 epoch_Time:28529.0min: [2024-01-02 22:03:58,938][model8_pretrain.py][INFO] Epoch:[0/2](42100/4588595) loss:2.897 lr:0.0001458 epoch_Time:28529.0min: [2024-01-02 22:04:42,878][model8_pretrain.py][INFO] Epoch:[0/2](42200/4588595) loss:3.509 lr:0.0001452 epoch_Time:28541.0min: [2024-01-02 22:04:42,878][model8_pretrain.py][INFO] Epoch:[0/2](42200/4588595) loss:3.442 lr:0.0001452 epoch_Time:28541.0min: [2024-01-02 22:04:42,878][model8_pretrain.py][INFO] Epoch:[0/2](42200/4588595) loss:3.143 lr:0.0001452 epoch_Time:28541.0min: [2024-01-02 22:04:42,878][model8_pretrain.py][INFO] Epoch:[0/2](42200/4588595) loss:3.105 lr:0.0001452 epoch_Time:28541.0min: [2024-01-02 22:04:42,878][model8_pretrain.py][INFO] Epoch:[0/2](42200/4588595) loss:3.150 lr:0.0001452 epoch_Time:28541.0min: [2024-01-02 22:04:42,878][model8_pretrain.py][INFO] Epoch:[0/2](42200/4588595) loss:3.757 lr:0.0001452 epoch_Time:28541.0min: [2024-01-02 22:04:42,878][model8_pretrain.py][INFO] Epoch:[0/2](42200/4588595) loss:3.558 lr:0.0001452 epoch_Time:28541.0min: [2024-01-02 22:04:42,878][model8_pretrain.py][INFO] Epoch:[0/2](42200/4588595) loss:3.036 lr:0.0001452 epoch_Time:28541.0min: [2024-01-02 22:05:19,818][model8_pretrain.py][INFO] Epoch:[0/2](42300/4588595) loss:3.284 lr:0.0001446 epoch_Time:28539.0min: [2024-01-02 22:05:19,818][model8_pretrain.py][INFO] Epoch:[0/2](42300/4588595) loss:3.287 lr:0.0001446 epoch_Time:28539.0min: [2024-01-02 22:05:19,818][model8_pretrain.py][INFO] Epoch:[0/2](42300/4588595) loss:3.424 lr:0.0001446 epoch_Time:28539.0min: [2024-01-02 22:05:19,818][model8_pretrain.py][INFO] Epoch:[0/2](42300/4588595) loss:3.474 lr:0.0001446 epoch_Time:28539.0min: [2024-01-02 22:05:19,818][model8_pretrain.py][INFO] Epoch:[0/2](42300/4588595) loss:3.325 lr:0.0001446 epoch_Time:28539.0min: [2024-01-02 22:05:19,818][model8_pretrain.py][INFO] Epoch:[0/2](42300/4588595) loss:3.398 lr:0.0001446 epoch_Time:28539.0min: [2024-01-02 22:05:19,818][model8_pretrain.py][INFO] Epoch:[0/2](42300/4588595) loss:3.044 lr:0.0001446 epoch_Time:28539.0min: [2024-01-02 22:05:19,818][model8_pretrain.py][INFO] Epoch:[0/2](42300/4588595) loss:3.302 lr:0.0001446 epoch_Time:28539.0min: [2024-01-02 22:05:56,753][model8_pretrain.py][INFO] Epoch:[0/2](42400/4588595) loss:3.298 lr:0.0001441 epoch_Time:28536.0min: [2024-01-02 22:05:56,753][model8_pretrain.py][INFO] Epoch:[0/2](42400/4588595) loss:3.366 lr:0.0001441 epoch_Time:28536.0min: [2024-01-02 22:05:56,753][model8_pretrain.py][INFO] Epoch:[0/2](42400/4588595) loss:3.362 lr:0.0001441 epoch_Time:28536.0min: [2024-01-02 22:05:56,753][model8_pretrain.py][INFO] Epoch:[0/2](42400/4588595) loss:3.286 lr:0.0001441 epoch_Time:28536.0min: [2024-01-02 22:05:56,753][model8_pretrain.py][INFO] Epoch:[0/2](42400/4588595) loss:3.034 lr:0.0001441 epoch_Time:28536.0min: [2024-01-02 22:05:56,753][model8_pretrain.py][INFO] Epoch:[0/2](42400/4588595) loss:3.507 lr:0.0001441 epoch_Time:28536.0min: [2024-01-02 22:05:56,753][model8_pretrain.py][INFO] Epoch:[0/2](42400/4588595) loss:2.562 lr:0.0001441 epoch_Time:28536.0min: [2024-01-02 22:05:56,753][model8_pretrain.py][INFO] Epoch:[0/2](42400/4588595) loss:3.620 lr:0.0001441 epoch_Time:28536.0min: [2024-01-02 22:06:33,697][model8_pretrain.py][INFO] Epoch:[0/2](42500/4588595) loss:3.278 lr:0.0001435 epoch_Time:28535.0min: [2024-01-02 22:06:33,697][model8_pretrain.py][INFO] Epoch:[0/2](42500/4588595) loss:3.550 lr:0.0001435 epoch_Time:28535.0min: [2024-01-02 22:06:33,697][model8_pretrain.py][INFO] Epoch:[0/2](42500/4588595) loss:3.241 lr:0.0001435 epoch_Time:28535.0min: [2024-01-02 22:06:33,697][model8_pretrain.py][INFO] Epoch:[0/2](42500/4588595) loss:3.354 lr:0.0001435 epoch_Time:28535.0min: [2024-01-02 22:06:33,697][model8_pretrain.py][INFO] Epoch:[0/2](42500/4588595) loss:2.832 lr:0.0001435 epoch_Time:28535.0min: [2024-01-02 22:06:33,697][model8_pretrain.py][INFO] Epoch:[0/2](42500/4588595) loss:3.140 lr:0.0001435 epoch_Time:28535.0min: [2024-01-02 22:06:33,697][model8_pretrain.py][INFO] Epoch:[0/2](42500/4588595) loss:3.019 lr:0.0001435 epoch_Time:28535.0min: [2024-01-02 22:06:33,698][model8_pretrain.py][INFO] Epoch:[0/2](42500/4588595) loss:3.062 lr:0.0001435 epoch_Time:28535.0min: [2024-01-02 22:07:10,651][model8_pretrain.py][INFO] Epoch:[0/2](42600/4588595) loss:3.537 lr:0.0001429 epoch_Time:28533.0min: [2024-01-02 22:07:10,651][model8_pretrain.py][INFO] Epoch:[0/2](42600/4588595) loss:3.097 lr:0.0001429 epoch_Time:28533.0min: [2024-01-02 22:07:10,651][model8_pretrain.py][INFO] Epoch:[0/2](42600/4588595) loss:3.574 lr:0.0001429 epoch_Time:28533.0min: [2024-01-02 22:07:10,651][model8_pretrain.py][INFO] Epoch:[0/2](42600/4588595) loss:2.775 lr:0.0001429 epoch_Time:28533.0min: [2024-01-02 22:07:10,651][model8_pretrain.py][INFO] Epoch:[0/2](42600/4588595) loss:2.797 lr:0.0001429 epoch_Time:28533.0min: [2024-01-02 22:07:10,651][model8_pretrain.py][INFO] Epoch:[0/2](42600/4588595) loss:3.276 lr:0.0001429 epoch_Time:28533.0min: [2024-01-02 22:07:10,651][model8_pretrain.py][INFO] Epoch:[0/2](42600/4588595) loss:3.178 lr:0.0001429 epoch_Time:28533.0min: [2024-01-02 22:07:10,651][model8_pretrain.py][INFO] Epoch:[0/2](42600/4588595) loss:3.196 lr:0.0001429 epoch_Time:28533.0min: [2024-01-02 22:07:47,602][model8_pretrain.py][INFO] Epoch:[0/2](42700/4588595) loss:3.004 lr:0.0001423 epoch_Time:28531.0min: [2024-01-02 22:07:47,602][model8_pretrain.py][INFO] Epoch:[0/2](42700/4588595) loss:3.409 lr:0.0001423 epoch_Time:28530.0min: [2024-01-02 22:07:47,602][model8_pretrain.py][INFO] Epoch:[0/2](42700/4588595) loss:2.990 lr:0.0001423 epoch_Time:28530.0min: [2024-01-02 22:07:47,602][model8_pretrain.py][INFO] Epoch:[0/2](42700/4588595) loss:3.080 lr:0.0001423 epoch_Time:28531.0min: [2024-01-02 22:07:47,602][model8_pretrain.py][INFO] Epoch:[0/2](42700/4588595) loss:3.093 lr:0.0001423 epoch_Time:28530.0min: [2024-01-02 22:07:47,602][model8_pretrain.py][INFO] Epoch:[0/2](42700/4588595) loss:3.464 lr:0.0001423 epoch_Time:28531.0min: [2024-01-02 22:07:47,602][model8_pretrain.py][INFO] Epoch:[0/2](42700/4588595) loss:3.572 lr:0.0001423 epoch_Time:28530.0min: [2024-01-02 22:07:47,602][model8_pretrain.py][INFO] Epoch:[0/2](42700/4588595) loss:3.185 lr:0.0001423 epoch_Time:28530.0min: [2024-01-02 22:08:24,535][model8_pretrain.py][INFO] Epoch:[0/2](42800/4588595) loss:3.256 lr:0.0001418 epoch_Time:28529.0min: [2024-01-02 22:08:24,535][model8_pretrain.py][INFO] Epoch:[0/2](42800/4588595) loss:3.734 lr:0.0001418 epoch_Time:28529.0min: [2024-01-02 22:08:24,535][model8_pretrain.py][INFO] Epoch:[0/2](42800/4588595) loss:3.679 lr:0.0001418 epoch_Time:28529.0min: [2024-01-02 22:08:24,535][model8_pretrain.py][INFO] Epoch:[0/2](42800/4588595) loss:3.442 lr:0.0001418 epoch_Time:28529.0min: [2024-01-02 22:08:24,535][model8_pretrain.py][INFO] Epoch:[0/2](42800/4588595) loss:3.159 lr:0.0001418 epoch_Time:28529.0min: [2024-01-02 22:08:24,535][model8_pretrain.py][INFO] Epoch:[0/2](42800/4588595) loss:3.116 lr:0.0001418 epoch_Time:28529.0min: [2024-01-02 22:08:24,535][model8_pretrain.py][INFO] Epoch:[0/2](42800/4588595) loss:2.975 lr:0.0001418 epoch_Time:28529.0min: [2024-01-02 22:08:24,535][model8_pretrain.py][INFO] Epoch:[0/2](42800/4588595) loss:3.375 lr:0.0001418 epoch_Time:28529.0min: [2024-01-02 22:09:01,478][model8_pretrain.py][INFO] Epoch:[0/2](42900/4588595) loss:2.941 lr:0.0001412 epoch_Time:28527.0min: [2024-01-02 22:09:01,478][model8_pretrain.py][INFO] Epoch:[0/2](42900/4588595) loss:2.992 lr:0.0001412 epoch_Time:28527.0min: [2024-01-02 22:09:01,478][model8_pretrain.py][INFO] Epoch:[0/2](42900/4588595) loss:3.409 lr:0.0001412 epoch_Time:28527.0min: [2024-01-02 22:09:01,478][model8_pretrain.py][INFO] Epoch:[0/2](42900/4588595) loss:2.912 lr:0.0001412 epoch_Time:28527.0min: [2024-01-02 22:09:01,478][model8_pretrain.py][INFO] Epoch:[0/2](42900/4588595) loss:3.049 lr:0.0001412 epoch_Time:28527.0min: [2024-01-02 22:09:01,478][model8_pretrain.py][INFO] Epoch:[0/2](42900/4588595) loss:2.740 lr:0.0001412 epoch_Time:28527.0min: [2024-01-02 22:09:01,478][model8_pretrain.py][INFO] Epoch:[0/2](42900/4588595) loss:3.053 lr:0.0001412 epoch_Time:28527.0min: [2024-01-02 22:09:01,478][model8_pretrain.py][INFO] Epoch:[0/2](42900/4588595) loss:3.739 lr:0.0001412 epoch_Time:28527.0min: [2024-01-02 22:09:45,497][model8_pretrain.py][INFO] Epoch:[0/2](43000/4588595) loss:2.603 lr:0.0001406 epoch_Time:28538.0min: [2024-01-02 22:09:45,497][model8_pretrain.py][INFO] Epoch:[0/2](43000/4588595) loss:3.481 lr:0.0001406 epoch_Time:28538.0min: [2024-01-02 22:09:45,497][model8_pretrain.py][INFO] Epoch:[0/2](43000/4588595) loss:3.313 lr:0.0001406 epoch_Time:28538.0min: [2024-01-02 22:09:45,497][model8_pretrain.py][INFO] Epoch:[0/2](43000/4588595) loss:3.102 lr:0.0001406 epoch_Time:28538.0min: [2024-01-02 22:09:45,497][model8_pretrain.py][INFO] Epoch:[0/2](43000/4588595) loss:3.467 lr:0.0001406 epoch_Time:28538.0min: [2024-01-02 22:09:45,497][model8_pretrain.py][INFO] Epoch:[0/2](43000/4588595) loss:3.394 lr:0.0001406 epoch_Time:28538.0min: [2024-01-02 22:09:45,497][model8_pretrain.py][INFO] Epoch:[0/2](43000/4588595) loss:3.485 lr:0.0001406 epoch_Time:28538.0min: [2024-01-02 22:09:45,498][model8_pretrain.py][INFO] Epoch:[0/2](43000/4588595) loss:2.451 lr:0.0001406 epoch_Time:28538.0min: [2024-01-02 22:10:22,421][model8_pretrain.py][INFO] Epoch:[0/2](43100/4588595) loss:2.788 lr:0.0001400 epoch_Time:28536.0min: [2024-01-02 22:10:22,421][model8_pretrain.py][INFO] Epoch:[0/2](43100/4588595) loss:2.803 lr:0.0001400 epoch_Time:28536.0min: [2024-01-02 22:10:22,421][model8_pretrain.py][INFO] Epoch:[0/2](43100/4588595) loss:3.432 lr:0.0001400 epoch_Time:28536.0min: [2024-01-02 22:10:22,421][model8_pretrain.py][INFO] Epoch:[0/2](43100/4588595) loss:3.195 lr:0.0001400 epoch_Time:28536.0min: [2024-01-02 22:10:22,421][model8_pretrain.py][INFO] Epoch:[0/2](43100/4588595) loss:3.126 lr:0.0001400 epoch_Time:28536.0min: [2024-01-02 22:10:22,421][model8_pretrain.py][INFO] Epoch:[0/2](43100/4588595) loss:3.130 lr:0.0001400 epoch_Time:28536.0min: [2024-01-02 22:10:22,421][model8_pretrain.py][INFO] Epoch:[0/2](43100/4588595) loss:3.996 lr:0.0001400 epoch_Time:28536.0min: [2024-01-02 22:10:22,421][model8_pretrain.py][INFO] Epoch:[0/2](43100/4588595) loss:3.516 lr:0.0001400 epoch_Time:28536.0min: [2024-01-02 22:10:59,352][model8_pretrain.py][INFO] Epoch:[0/2](43200/4588595) loss:3.106 lr:0.0001395 epoch_Time:28534.0min: [2024-01-02 22:10:59,352][model8_pretrain.py][INFO] Epoch:[0/2](43200/4588595) loss:3.378 lr:0.0001395 epoch_Time:28534.0min: [2024-01-02 22:10:59,352][model8_pretrain.py][INFO] Epoch:[0/2](43200/4588595) loss:3.127 lr:0.0001395 epoch_Time:28534.0min: [2024-01-02 22:10:59,352][model8_pretrain.py][INFO] Epoch:[0/2](43200/4588595) loss:3.318 lr:0.0001395 epoch_Time:28534.0min: [2024-01-02 22:10:59,353][model8_pretrain.py][INFO] Epoch:[0/2](43200/4588595) loss:3.027 lr:0.0001395 epoch_Time:28534.0min: [2024-01-02 22:10:59,353][model8_pretrain.py][INFO] Epoch:[0/2](43200/4588595) loss:3.291 lr:0.0001395 epoch_Time:28534.0min: [2024-01-02 22:10:59,353][model8_pretrain.py][INFO] Epoch:[0/2](43200/4588595) loss:2.703 lr:0.0001395 epoch_Time:28534.0min: [2024-01-02 22:10:59,353][model8_pretrain.py][INFO] Epoch:[0/2](43200/4588595) loss:3.152 lr:0.0001395 epoch_Time:28534.0min: [2024-01-02 22:11:36,288][model8_pretrain.py][INFO] Epoch:[0/2](43300/4588595) loss:3.569 lr:0.0001389 epoch_Time:28532.0min: [2024-01-02 22:11:36,288][model8_pretrain.py][INFO] Epoch:[0/2](43300/4588595) loss:3.503 lr:0.0001389 epoch_Time:28532.0min: [2024-01-02 22:11:36,288][model8_pretrain.py][INFO] Epoch:[0/2](43300/4588595) loss:3.105 lr:0.0001389 epoch_Time:28532.0min: [2024-01-02 22:11:36,288][model8_pretrain.py][INFO] Epoch:[0/2](43300/4588595) loss:3.268 lr:0.0001389 epoch_Time:28532.0min: [2024-01-02 22:11:36,288][model8_pretrain.py][INFO] Epoch:[0/2](43300/4588595) loss:3.661 lr:0.0001389 epoch_Time:28532.0min: [2024-01-02 22:11:36,288][model8_pretrain.py][INFO] Epoch:[0/2](43300/4588595) loss:3.498 lr:0.0001389 epoch_Time:28532.0min: [2024-01-02 22:11:36,288][model8_pretrain.py][INFO] Epoch:[0/2](43300/4588595) loss:3.143 lr:0.0001389 epoch_Time:28532.0min: [2024-01-02 22:11:36,288][model8_pretrain.py][INFO] Epoch:[0/2](43300/4588595) loss:3.114 lr:0.0001389 epoch_Time:28532.0min: [2024-01-02 22:12:13,197][model8_pretrain.py][INFO] Epoch:[0/2](43400/4588595) loss:3.449 lr:0.0001383 epoch_Time:28530.0min: [2024-01-02 22:12:13,198][model8_pretrain.py][INFO] Epoch:[0/2](43400/4588595) loss:3.552 lr:0.0001383 epoch_Time:28530.0min: [2024-01-02 22:12:13,198][model8_pretrain.py][INFO] Epoch:[0/2](43400/4588595) loss:2.777 lr:0.0001383 epoch_Time:28530.0min: [2024-01-02 22:12:13,198][model8_pretrain.py][INFO] Epoch:[0/2](43400/4588595) loss:3.199 lr:0.0001383 epoch_Time:28530.0min: [2024-01-02 22:12:13,198][model8_pretrain.py][INFO] Epoch:[0/2](43400/4588595) loss:3.306 lr:0.0001383 epoch_Time:28530.0min: [2024-01-02 22:12:13,198][model8_pretrain.py][INFO] Epoch:[0/2](43400/4588595) loss:2.904 lr:0.0001383 epoch_Time:28530.0min: [2024-01-02 22:12:13,198][model8_pretrain.py][INFO] Epoch:[0/2](43400/4588595) loss:3.555 lr:0.0001383 epoch_Time:28530.0min: [2024-01-02 22:12:13,198][model8_pretrain.py][INFO] Epoch:[0/2](43400/4588595) loss:3.099 lr:0.0001383 epoch_Time:28530.0min: [2024-01-02 22:12:50,152][model8_pretrain.py][INFO] Epoch:[0/2](43500/4588595) loss:3.428 lr:0.0001377 epoch_Time:28528.0min: [2024-01-02 22:12:50,152][model8_pretrain.py][INFO] Epoch:[0/2](43500/4588595) loss:2.919 lr:0.0001377 epoch_Time:28528.0min: [2024-01-02 22:12:50,152][model8_pretrain.py][INFO] Epoch:[0/2](43500/4588595) loss:3.419 lr:0.0001377 epoch_Time:28528.0min: [2024-01-02 22:12:50,152][model8_pretrain.py][INFO] Epoch:[0/2](43500/4588595) loss:2.908 lr:0.0001377 epoch_Time:28528.0min: [2024-01-02 22:12:50,152][model8_pretrain.py][INFO] Epoch:[0/2](43500/4588595) loss:3.136 lr:0.0001377 epoch_Time:28528.0min: [2024-01-02 22:12:50,152][model8_pretrain.py][INFO] Epoch:[0/2](43500/4588595) loss:2.983 lr:0.0001377 epoch_Time:28528.0min: [2024-01-02 22:12:50,153][model8_pretrain.py][INFO] Epoch:[0/2](43500/4588595) loss:3.402 lr:0.0001377 epoch_Time:28528.0min: [2024-01-02 22:12:50,153][model8_pretrain.py][INFO] Epoch:[0/2](43500/4588595) loss:3.374 lr:0.0001377 epoch_Time:28528.0min: [2024-01-02 22:13:27,092][model8_pretrain.py][INFO] Epoch:[0/2](43600/4588595) loss:3.342 lr:0.0001372 epoch_Time:28526.0min: [2024-01-02 22:13:27,092][model8_pretrain.py][INFO] Epoch:[0/2](43600/4588595) loss:3.473 lr:0.0001372 epoch_Time:28526.0min: [2024-01-02 22:13:27,092][model8_pretrain.py][INFO] Epoch:[0/2](43600/4588595) loss:3.065 lr:0.0001372 epoch_Time:28527.0min: [2024-01-02 22:13:27,092][model8_pretrain.py][INFO] Epoch:[0/2](43600/4588595) loss:3.240 lr:0.0001372 epoch_Time:28526.0min: [2024-01-02 22:13:27,092][model8_pretrain.py][INFO] Epoch:[0/2](43600/4588595) loss:3.418 lr:0.0001372 epoch_Time:28526.0min: [2024-01-02 22:13:27,092][model8_pretrain.py][INFO] Epoch:[0/2](43600/4588595) loss:2.907 lr:0.0001372 epoch_Time:28526.0min: [2024-01-02 22:13:27,092][model8_pretrain.py][INFO] Epoch:[0/2](43600/4588595) loss:3.279 lr:0.0001372 epoch_Time:28526.0min: [2024-01-02 22:13:27,092][model8_pretrain.py][INFO] Epoch:[0/2](43600/4588595) loss:3.006 lr:0.0001372 epoch_Time:28526.0min: [2024-01-02 22:14:04,028][model8_pretrain.py][INFO] Epoch:[0/2](43700/4588595) loss:2.963 lr:0.0001366 epoch_Time:28524.0min: [2024-01-02 22:14:04,028][model8_pretrain.py][INFO] Epoch:[0/2](43700/4588595) loss:3.458 lr:0.0001366 epoch_Time:28524.0min: [2024-01-02 22:14:04,028][model8_pretrain.py][INFO] Epoch:[0/2](43700/4588595) loss:3.163 lr:0.0001366 epoch_Time:28524.0min: [2024-01-02 22:14:04,028][model8_pretrain.py][INFO] Epoch:[0/2](43700/4588595) loss:3.606 lr:0.0001366 epoch_Time:28524.0min: [2024-01-02 22:14:04,028][model8_pretrain.py][INFO] Epoch:[0/2](43700/4588595) loss:3.214 lr:0.0001366 epoch_Time:28524.0min: [2024-01-02 22:14:04,028][model8_pretrain.py][INFO] Epoch:[0/2](43700/4588595) loss:3.501 lr:0.0001366 epoch_Time:28524.0min: [2024-01-02 22:14:04,029][model8_pretrain.py][INFO] Epoch:[0/2](43700/4588595) loss:3.635 lr:0.0001366 epoch_Time:28524.0min: [2024-01-02 22:14:04,029][model8_pretrain.py][INFO] Epoch:[0/2](43700/4588595) loss:2.909 lr:0.0001366 epoch_Time:28524.0min: [2024-01-02 22:14:48,067][model8_pretrain.py][INFO] Epoch:[0/2](43800/4588595) loss:3.400 lr:0.0001360 epoch_Time:28534.0min: [2024-01-02 22:14:48,067][model8_pretrain.py][INFO] Epoch:[0/2](43800/4588595) loss:2.740 lr:0.0001360 epoch_Time:28534.0min: [2024-01-02 22:14:48,067][model8_pretrain.py][INFO] Epoch:[0/2](43800/4588595) loss:3.249 lr:0.0001360 epoch_Time:28534.0min: [2024-01-02 22:14:48,067][model8_pretrain.py][INFO] Epoch:[0/2](43800/4588595) loss:3.161 lr:0.0001360 epoch_Time:28534.0min: [2024-01-02 22:14:48,067][model8_pretrain.py][INFO] Epoch:[0/2](43800/4588595) loss:3.347 lr:0.0001360 epoch_Time:28534.0min: [2024-01-02 22:14:48,067][model8_pretrain.py][INFO] Epoch:[0/2](43800/4588595) loss:3.233 lr:0.0001360 epoch_Time:28534.0min: [2024-01-02 22:14:48,067][model8_pretrain.py][INFO] Epoch:[0/2](43800/4588595) loss:3.571 lr:0.0001360 epoch_Time:28534.0min: [2024-01-02 22:14:48,067][model8_pretrain.py][INFO] Epoch:[0/2](43800/4588595) loss:3.615 lr:0.0001360 epoch_Time:28534.0min: [2024-01-02 22:15:24,962][model8_pretrain.py][INFO] Epoch:[0/2](43900/4588595) loss:3.524 lr:0.0001355 epoch_Time:28533.0min: [2024-01-02 22:15:24,962][model8_pretrain.py][INFO] Epoch:[0/2](43900/4588595) loss:2.875 lr:0.0001355 epoch_Time:28533.0min: [2024-01-02 22:15:24,962][model8_pretrain.py][INFO] Epoch:[0/2](43900/4588595) loss:3.390 lr:0.0001355 epoch_Time:28533.0min: [2024-01-02 22:15:24,962][model8_pretrain.py][INFO] Epoch:[0/2](43900/4588595) loss:3.546 lr:0.0001355 epoch_Time:28533.0min: [2024-01-02 22:15:24,962][model8_pretrain.py][INFO] Epoch:[0/2](43900/4588595) loss:3.566 lr:0.0001355 epoch_Time:28533.0min: [2024-01-02 22:15:24,962][model8_pretrain.py][INFO] Epoch:[0/2](43900/4588595) loss:3.500 lr:0.0001355 epoch_Time:28533.0min: [2024-01-02 22:15:24,962][model8_pretrain.py][INFO] Epoch:[0/2](43900/4588595) loss:3.802 lr:0.0001355 epoch_Time:28533.0min: [2024-01-02 22:15:24,962][model8_pretrain.py][INFO] Epoch:[0/2](43900/4588595) loss:2.721 lr:0.0001355 epoch_Time:28533.0min: [2024-01-02 22:16:01,888][model8_pretrain.py][INFO] Epoch:[0/2](44000/4588595) loss:3.657 lr:0.0001349 epoch_Time:28531.0min: [2024-01-02 22:16:01,888][model8_pretrain.py][INFO] Epoch:[0/2](44000/4588595) loss:3.572 lr:0.0001349 epoch_Time:28531.0min: [2024-01-02 22:16:01,888][model8_pretrain.py][INFO] Epoch:[0/2](44000/4588595) loss:3.300 lr:0.0001349 epoch_Time:28531.0min: [2024-01-02 22:16:01,888][model8_pretrain.py][INFO] Epoch:[0/2](44000/4588595) loss:3.349 lr:0.0001349 epoch_Time:28531.0min: [2024-01-02 22:16:01,888][model8_pretrain.py][INFO] Epoch:[0/2](44000/4588595) loss:3.402 lr:0.0001349 epoch_Time:28531.0min: [2024-01-02 22:16:01,888][model8_pretrain.py][INFO] Epoch:[0/2](44000/4588595) loss:3.267 lr:0.0001349 epoch_Time:28531.0min: [2024-01-02 22:16:01,889][model8_pretrain.py][INFO] Epoch:[0/2](44000/4588595) loss:2.758 lr:0.0001349 epoch_Time:28531.0min: [2024-01-02 22:16:01,889][model8_pretrain.py][INFO] Epoch:[0/2](44000/4588595) loss:3.382 lr:0.0001349 epoch_Time:28531.0min: [2024-01-02 22:16:38,831][model8_pretrain.py][INFO] Epoch:[0/2](44100/4588595) loss:2.654 lr:0.0001343 epoch_Time:28529.0min: [2024-01-02 22:16:38,831][model8_pretrain.py][INFO] Epoch:[0/2](44100/4588595) loss:3.488 lr:0.0001343 epoch_Time:28529.0min: [2024-01-02 22:16:38,832][model8_pretrain.py][INFO] Epoch:[0/2](44100/4588595) loss:3.457 lr:0.0001343 epoch_Time:28529.0min: [2024-01-02 22:16:38,832][model8_pretrain.py][INFO] Epoch:[0/2](44100/4588595) loss:3.224 lr:0.0001343 epoch_Time:28529.0min: [2024-01-02 22:16:38,832][model8_pretrain.py][INFO] Epoch:[0/2](44100/4588595) loss:2.856 lr:0.0001343 epoch_Time:28529.0min: [2024-01-02 22:16:38,832][model8_pretrain.py][INFO] Epoch:[0/2](44100/4588595) loss:3.720 lr:0.0001343 epoch_Time:28529.0min: [2024-01-02 22:16:38,832][model8_pretrain.py][INFO] Epoch:[0/2](44100/4588595) loss:3.079 lr:0.0001343 epoch_Time:28529.0min: [2024-01-02 22:16:38,832][model8_pretrain.py][INFO] Epoch:[0/2](44100/4588595) loss:2.855 lr:0.0001343 epoch_Time:28529.0min: [2024-01-02 22:17:15,774][model8_pretrain.py][INFO] Epoch:[0/2](44200/4588595) loss:2.898 lr:0.0001337 epoch_Time:28527.0min: [2024-01-02 22:17:15,774][model8_pretrain.py][INFO] Epoch:[0/2](44200/4588595) loss:3.607 lr:0.0001337 epoch_Time:28527.0min: [2024-01-02 22:17:15,774][model8_pretrain.py][INFO] Epoch:[0/2](44200/4588595) loss:3.113 lr:0.0001337 epoch_Time:28527.0min: [2024-01-02 22:17:15,774][model8_pretrain.py][INFO] Epoch:[0/2](44200/4588595) loss:2.850 lr:0.0001337 epoch_Time:28527.0min: [2024-01-02 22:17:15,774][model8_pretrain.py][INFO] Epoch:[0/2](44200/4588595) loss:3.482 lr:0.0001337 epoch_Time:28527.0min: [2024-01-02 22:17:15,774][model8_pretrain.py][INFO] Epoch:[0/2](44200/4588595) loss:3.211 lr:0.0001337 epoch_Time:28527.0min: [2024-01-02 22:17:15,774][model8_pretrain.py][INFO] Epoch:[0/2](44200/4588595) loss:3.291 lr:0.0001337 epoch_Time:28527.0min: [2024-01-02 22:17:15,775][model8_pretrain.py][INFO] Epoch:[0/2](44200/4588595) loss:3.270 lr:0.0001337 epoch_Time:28527.0min: [2024-01-02 22:17:52,720][model8_pretrain.py][INFO] Epoch:[0/2](44300/4588595) loss:3.419 lr:0.0001332 epoch_Time:28525.0min: [2024-01-02 22:17:52,720][model8_pretrain.py][INFO] Epoch:[0/2](44300/4588595) loss:2.916 lr:0.0001332 epoch_Time:28525.0min: [2024-01-02 22:17:52,720][model8_pretrain.py][INFO] Epoch:[0/2](44300/4588595) loss:3.390 lr:0.0001332 epoch_Time:28525.0min: [2024-01-02 22:17:52,720][model8_pretrain.py][INFO] Epoch:[0/2](44300/4588595) loss:3.178 lr:0.0001332 epoch_Time:28525.0min: [2024-01-02 22:17:52,720][model8_pretrain.py][INFO] Epoch:[0/2](44300/4588595) loss:2.858 lr:0.0001332 epoch_Time:28525.0min: [2024-01-02 22:17:52,720][model8_pretrain.py][INFO] Epoch:[0/2](44300/4588595) loss:3.301 lr:0.0001332 epoch_Time:28525.0min: [2024-01-02 22:17:52,720][model8_pretrain.py][INFO] Epoch:[0/2](44300/4588595) loss:3.401 lr:0.0001332 epoch_Time:28525.0min: [2024-01-02 22:17:52,720][model8_pretrain.py][INFO] Epoch:[0/2](44300/4588595) loss:3.616 lr:0.0001332 epoch_Time:28525.0min: [2024-01-02 22:18:29,660][model8_pretrain.py][INFO] Epoch:[0/2](44400/4588595) loss:3.289 lr:0.0001326 epoch_Time:28524.0min: [2024-01-02 22:18:29,660][model8_pretrain.py][INFO] Epoch:[0/2](44400/4588595) loss:3.216 lr:0.0001326 epoch_Time:28524.0min: [2024-01-02 22:18:29,660][model8_pretrain.py][INFO] Epoch:[0/2](44400/4588595) loss:2.842 lr:0.0001326 epoch_Time:28524.0min: [2024-01-02 22:18:29,660][model8_pretrain.py][INFO] Epoch:[0/2](44400/4588595) loss:3.152 lr:0.0001326 epoch_Time:28524.0min: [2024-01-02 22:18:29,660][model8_pretrain.py][INFO] Epoch:[0/2](44400/4588595) loss:3.246 lr:0.0001326 epoch_Time:28524.0min: [2024-01-02 22:18:29,660][model8_pretrain.py][INFO] Epoch:[0/2](44400/4588595) loss:3.245 lr:0.0001326 epoch_Time:28524.0min: [2024-01-02 22:18:29,661][model8_pretrain.py][INFO] Epoch:[0/2](44400/4588595) loss:3.430 lr:0.0001326 epoch_Time:28524.0min: [2024-01-02 22:18:29,661][model8_pretrain.py][INFO] Epoch:[0/2](44400/4588595) loss:3.429 lr:0.0001326 epoch_Time:28524.0min: [2024-01-02 22:19:06,622][model8_pretrain.py][INFO] Epoch:[0/2](44500/4588595) loss:3.617 lr:0.0001320 epoch_Time:28521.0min: [2024-01-02 22:19:06,623][model8_pretrain.py][INFO] Epoch:[0/2](44500/4588595) loss:3.494 lr:0.0001320 epoch_Time:28521.0min: [2024-01-02 22:19:06,623][model8_pretrain.py][INFO] Epoch:[0/2](44500/4588595) loss:3.162 lr:0.0001320 epoch_Time:28522.0min: [2024-01-02 22:19:06,623][model8_pretrain.py][INFO] Epoch:[0/2](44500/4588595) loss:3.564 lr:0.0001320 epoch_Time:28521.0min: [2024-01-02 22:19:06,623][model8_pretrain.py][INFO] Epoch:[0/2](44500/4588595) loss:3.677 lr:0.0001320 epoch_Time:28522.0min: [2024-01-02 22:19:06,623][model8_pretrain.py][INFO] Epoch:[0/2](44500/4588595) loss:2.973 lr:0.0001320 epoch_Time:28521.0min: [2024-01-02 22:19:06,623][model8_pretrain.py][INFO] Epoch:[0/2](44500/4588595) loss:3.581 lr:0.0001320 epoch_Time:28521.0min: [2024-01-02 22:19:06,623][model8_pretrain.py][INFO] Epoch:[0/2](44500/4588595) loss:3.242 lr:0.0001320 epoch_Time:28521.0min: [2024-01-02 22:19:50,860][model8_pretrain.py][INFO] Epoch:[0/2](44600/4588595) loss:3.693 lr:0.0001315 epoch_Time:28532.0min: [2024-01-02 22:19:50,860][model8_pretrain.py][INFO] Epoch:[0/2](44600/4588595) loss:3.595 lr:0.0001315 epoch_Time:28532.0min: [2024-01-02 22:19:50,860][model8_pretrain.py][INFO] Epoch:[0/2](44600/4588595) loss:2.703 lr:0.0001315 epoch_Time:28532.0min: [2024-01-02 22:19:50,860][model8_pretrain.py][INFO] Epoch:[0/2](44600/4588595) loss:2.945 lr:0.0001315 epoch_Time:28532.0min: [2024-01-02 22:19:50,861][model8_pretrain.py][INFO] Epoch:[0/2](44600/4588595) loss:3.316 lr:0.0001315 epoch_Time:28532.0min: [2024-01-02 22:19:50,861][model8_pretrain.py][INFO] Epoch:[0/2](44600/4588595) loss:2.930 lr:0.0001315 epoch_Time:28532.0min: [2024-01-02 22:19:50,861][model8_pretrain.py][INFO] Epoch:[0/2](44600/4588595) loss:3.023 lr:0.0001315 epoch_Time:28532.0min: [2024-01-02 22:19:50,861][model8_pretrain.py][INFO] Epoch:[0/2](44600/4588595) loss:3.144 lr:0.0001315 epoch_Time:28532.0min: [2024-01-02 22:20:27,787][model8_pretrain.py][INFO] Epoch:[0/2](44700/4588595) loss:3.320 lr:0.0001309 epoch_Time:28530.0min: [2024-01-02 22:20:27,787][model8_pretrain.py][INFO] Epoch:[0/2](44700/4588595) loss:3.246 lr:0.0001309 epoch_Time:28531.0min: [2024-01-02 22:20:27,787][model8_pretrain.py][INFO] Epoch:[0/2](44700/4588595) loss:3.855 lr:0.0001309 epoch_Time:28530.0min: [2024-01-02 22:20:27,787][model8_pretrain.py][INFO] Epoch:[0/2](44700/4588595) loss:3.283 lr:0.0001309 epoch_Time:28530.0min: [2024-01-02 22:20:27,787][model8_pretrain.py][INFO] Epoch:[0/2](44700/4588595) loss:3.633 lr:0.0001309 epoch_Time:28530.0min: [2024-01-02 22:20:27,787][model8_pretrain.py][INFO] Epoch:[0/2](44700/4588595) loss:3.018 lr:0.0001309 epoch_Time:28530.0min: [2024-01-02 22:20:27,787][model8_pretrain.py][INFO] Epoch:[0/2](44700/4588595) loss:2.943 lr:0.0001309 epoch_Time:28531.0min: [2024-01-02 22:20:27,787][model8_pretrain.py][INFO] Epoch:[0/2](44700/4588595) loss:3.232 lr:0.0001309 epoch_Time:28530.0min: [2024-01-02 22:21:04,712][model8_pretrain.py][INFO] Epoch:[0/2](44800/4588595) loss:3.144 lr:0.0001303 epoch_Time:28528.0min: [2024-01-02 22:21:04,712][model8_pretrain.py][INFO] Epoch:[0/2](44800/4588595) loss:3.218 lr:0.0001303 epoch_Time:28528.0min: [2024-01-02 22:21:04,712][model8_pretrain.py][INFO] Epoch:[0/2](44800/4588595) loss:3.450 lr:0.0001303 epoch_Time:28528.0min: [2024-01-02 22:21:04,712][model8_pretrain.py][INFO] Epoch:[0/2](44800/4588595) loss:3.334 lr:0.0001303 epoch_Time:28528.0min: [2024-01-02 22:21:04,712][model8_pretrain.py][INFO] Epoch:[0/2](44800/4588595) loss:3.377 lr:0.0001303 epoch_Time:28528.0min: [2024-01-02 22:21:04,712][model8_pretrain.py][INFO] Epoch:[0/2](44800/4588595) loss:3.190 lr:0.0001303 epoch_Time:28528.0min: [2024-01-02 22:21:04,712][model8_pretrain.py][INFO] Epoch:[0/2](44800/4588595) loss:3.384 lr:0.0001303 epoch_Time:28528.0min: [2024-01-02 22:21:04,712][model8_pretrain.py][INFO] Epoch:[0/2](44800/4588595) loss:3.305 lr:0.0001303 epoch_Time:28528.0min: [2024-01-02 22:21:41,641][model8_pretrain.py][INFO] Epoch:[0/2](44900/4588595) loss:2.995 lr:0.0001298 epoch_Time:28527.0min: [2024-01-02 22:21:41,641][model8_pretrain.py][INFO] Epoch:[0/2](44900/4588595) loss:3.086 lr:0.0001298 epoch_Time:28527.0min: [2024-01-02 22:21:41,641][model8_pretrain.py][INFO] Epoch:[0/2](44900/4588595) loss:3.650 lr:0.0001298 epoch_Time:28527.0min: [2024-01-02 22:21:41,641][model8_pretrain.py][INFO] Epoch:[0/2](44900/4588595) loss:3.058 lr:0.0001298 epoch_Time:28527.0min: [2024-01-02 22:21:41,641][model8_pretrain.py][INFO] Epoch:[0/2](44900/4588595) loss:2.526 lr:0.0001298 epoch_Time:28527.0min: [2024-01-02 22:21:41,641][model8_pretrain.py][INFO] Epoch:[0/2](44900/4588595) loss:2.873 lr:0.0001298 epoch_Time:28527.0min: [2024-01-02 22:21:41,641][model8_pretrain.py][INFO] Epoch:[0/2](44900/4588595) loss:3.375 lr:0.0001298 epoch_Time:28527.0min: [2024-01-02 22:21:41,641][model8_pretrain.py][INFO] Epoch:[0/2](44900/4588595) loss:3.254 lr:0.0001298 epoch_Time:28527.0min: [2024-01-02 22:22:18,586][model8_pretrain.py][INFO] Epoch:[0/2](45000/4588595) loss:2.738 lr:0.0001292 epoch_Time:28525.0min: [2024-01-02 22:22:18,586][model8_pretrain.py][INFO] Epoch:[0/2](45000/4588595) loss:3.121 lr:0.0001292 epoch_Time:28525.0min: [2024-01-02 22:22:18,586][model8_pretrain.py][INFO] Epoch:[0/2](45000/4588595) loss:3.508 lr:0.0001292 epoch_Time:28525.0min: [2024-01-02 22:22:18,586][model8_pretrain.py][INFO] Epoch:[0/2](45000/4588595) loss:3.505 lr:0.0001292 epoch_Time:28525.0min: [2024-01-02 22:22:18,586][model8_pretrain.py][INFO] Epoch:[0/2](45000/4588595) loss:3.342 lr:0.0001292 epoch_Time:28525.0min: [2024-01-02 22:22:18,586][model8_pretrain.py][INFO] Epoch:[0/2](45000/4588595) loss:2.376 lr:0.0001292 epoch_Time:28525.0min: [2024-01-02 22:22:18,586][model8_pretrain.py][INFO] Epoch:[0/2](45000/4588595) loss:2.944 lr:0.0001292 epoch_Time:28525.0min: [2024-01-02 22:22:18,586][model8_pretrain.py][INFO] Epoch:[0/2](45000/4588595) loss:3.459 lr:0.0001292 epoch_Time:28525.0min: [2024-01-02 22:22:55,532][model8_pretrain.py][INFO] Epoch:[0/2](45100/4588595) loss:3.127 lr:0.0001286 epoch_Time:28522.0min: [2024-01-02 22:22:55,532][model8_pretrain.py][INFO] Epoch:[0/2](45100/4588595) loss:3.352 lr:0.0001286 epoch_Time:28523.0min: [2024-01-02 22:22:55,532][model8_pretrain.py][INFO] Epoch:[0/2](45100/4588595) loss:3.158 lr:0.0001286 epoch_Time:28522.0min: [2024-01-02 22:22:55,532][model8_pretrain.py][INFO] Epoch:[0/2](45100/4588595) loss:3.647 lr:0.0001286 epoch_Time:28523.0min: [2024-01-02 22:22:55,532][model8_pretrain.py][INFO] Epoch:[0/2](45100/4588595) loss:3.040 lr:0.0001286 epoch_Time:28522.0min: [2024-01-02 22:22:55,532][model8_pretrain.py][INFO] Epoch:[0/2](45100/4588595) loss:3.321 lr:0.0001286 epoch_Time:28522.0min: [2024-01-02 22:22:55,532][model8_pretrain.py][INFO] Epoch:[0/2](45100/4588595) loss:3.344 lr:0.0001286 epoch_Time:28522.0min: [2024-01-02 22:22:55,532][model8_pretrain.py][INFO] Epoch:[0/2](45100/4588595) loss:3.494 lr:0.0001286 epoch_Time:28522.0min: [2024-01-02 22:23:32,481][model8_pretrain.py][INFO] Epoch:[0/2](45200/4588595) loss:3.028 lr:0.0001281 epoch_Time:28521.0min: [2024-01-02 22:23:32,481][model8_pretrain.py][INFO] Epoch:[0/2](45200/4588595) loss:3.116 lr:0.0001281 epoch_Time:28521.0min: [2024-01-02 22:23:32,481][model8_pretrain.py][INFO] Epoch:[0/2](45200/4588595) loss:3.044 lr:0.0001281 epoch_Time:28521.0min: [2024-01-02 22:23:32,481][model8_pretrain.py][INFO] Epoch:[0/2](45200/4588595) loss:3.697 lr:0.0001281 epoch_Time:28521.0min: [2024-01-02 22:23:32,481][model8_pretrain.py][INFO] Epoch:[0/2](45200/4588595) loss:3.388 lr:0.0001281 epoch_Time:28521.0min: [2024-01-02 22:23:32,481][model8_pretrain.py][INFO] Epoch:[0/2](45200/4588595) loss:3.430 lr:0.0001281 epoch_Time:28521.0min: [2024-01-02 22:23:32,482][model8_pretrain.py][INFO] Epoch:[0/2](45200/4588595) loss:3.333 lr:0.0001281 epoch_Time:28521.0min: [2024-01-02 22:23:32,484][model8_pretrain.py][INFO] Epoch:[0/2](45200/4588595) loss:3.036 lr:0.0001281 epoch_Time:28521.0min: [2024-01-02 22:24:09,448][model8_pretrain.py][INFO] Epoch:[0/2](45300/4588595) loss:3.137 lr:0.0001275 epoch_Time:28519.0min: [2024-01-02 22:24:09,448][model8_pretrain.py][INFO] Epoch:[0/2](45300/4588595) loss:2.940 lr:0.0001275 epoch_Time:28519.0min: [2024-01-02 22:24:09,448][model8_pretrain.py][INFO] Epoch:[0/2](45300/4588595) loss:3.082 lr:0.0001275 epoch_Time:28519.0min: [2024-01-02 22:24:09,448][model8_pretrain.py][INFO] Epoch:[0/2](45300/4588595) loss:3.342 lr:0.0001275 epoch_Time:28519.0min: [2024-01-02 22:24:09,448][model8_pretrain.py][INFO] Epoch:[0/2](45300/4588595) loss:3.382 lr:0.0001275 epoch_Time:28519.0min: [2024-01-02 22:24:09,448][model8_pretrain.py][INFO] Epoch:[0/2](45300/4588595) loss:3.390 lr:0.0001275 epoch_Time:28519.0min: [2024-01-02 22:24:09,448][model8_pretrain.py][INFO] Epoch:[0/2](45300/4588595) loss:3.146 lr:0.0001275 epoch_Time:28519.0min: [2024-01-02 22:24:09,448][model8_pretrain.py][INFO] Epoch:[0/2](45300/4588595) loss:2.797 lr:0.0001275 epoch_Time:28519.0min: [2024-01-02 22:24:53,424][model8_pretrain.py][INFO] Epoch:[0/2](45400/4588595) loss:3.295 lr:0.0001269 epoch_Time:28529.0min: [2024-01-02 22:24:53,424][model8_pretrain.py][INFO] Epoch:[0/2](45400/4588595) loss:3.759 lr:0.0001269 epoch_Time:28529.0min: [2024-01-02 22:24:53,424][model8_pretrain.py][INFO] Epoch:[0/2](45400/4588595) loss:3.448 lr:0.0001269 epoch_Time:28529.0min: [2024-01-02 22:24:53,424][model8_pretrain.py][INFO] Epoch:[0/2](45400/4588595) loss:3.791 lr:0.0001269 epoch_Time:28529.0min: [2024-01-02 22:24:53,424][model8_pretrain.py][INFO] Epoch:[0/2](45400/4588595) loss:3.376 lr:0.0001269 epoch_Time:28529.0min: [2024-01-02 22:24:53,424][model8_pretrain.py][INFO] Epoch:[0/2](45400/4588595) loss:3.579 lr:0.0001269 epoch_Time:28529.0min: [2024-01-02 22:24:53,424][model8_pretrain.py][INFO] Epoch:[0/2](45400/4588595) loss:3.450 lr:0.0001269 epoch_Time:28529.0min: [2024-01-02 22:24:53,424][model8_pretrain.py][INFO] Epoch:[0/2](45400/4588595) loss:3.096 lr:0.0001269 epoch_Time:28529.0min: [2024-01-02 22:25:30,351][model8_pretrain.py][INFO] Epoch:[0/2](45500/4588595) loss:3.191 lr:0.0001264 epoch_Time:28527.0min: [2024-01-02 22:25:30,351][model8_pretrain.py][INFO] Epoch:[0/2](45500/4588595) loss:3.387 lr:0.0001264 epoch_Time:28527.0min: [2024-01-02 22:25:30,351][model8_pretrain.py][INFO] Epoch:[0/2](45500/4588595) loss:3.460 lr:0.0001264 epoch_Time:28527.0min: [2024-01-02 22:25:30,351][model8_pretrain.py][INFO] Epoch:[0/2](45500/4588595) loss:2.903 lr:0.0001264 epoch_Time:28528.0min: [2024-01-02 22:25:30,351][model8_pretrain.py][INFO] Epoch:[0/2](45500/4588595) loss:3.087 lr:0.0001264 epoch_Time:28527.0min: [2024-01-02 22:25:30,351][model8_pretrain.py][INFO] Epoch:[0/2](45500/4588595) loss:3.484 lr:0.0001264 epoch_Time:28527.0min: [2024-01-02 22:25:30,351][model8_pretrain.py][INFO] Epoch:[0/2](45500/4588595) loss:3.139 lr:0.0001264 epoch_Time:28527.0min: [2024-01-02 22:25:30,351][model8_pretrain.py][INFO] Epoch:[0/2](45500/4588595) loss:2.978 lr:0.0001264 epoch_Time:28527.0min: [2024-01-02 22:26:07,300][model8_pretrain.py][INFO] Epoch:[0/2](45600/4588595) loss:3.372 lr:0.0001258 epoch_Time:28525.0min: [2024-01-02 22:26:07,300][model8_pretrain.py][INFO] Epoch:[0/2](45600/4588595) loss:2.825 lr:0.0001258 epoch_Time:28525.0min: [2024-01-02 22:26:07,300][model8_pretrain.py][INFO] Epoch:[0/2](45600/4588595) loss:3.260 lr:0.0001258 epoch_Time:28525.0min: [2024-01-02 22:26:07,300][model8_pretrain.py][INFO] Epoch:[0/2](45600/4588595) loss:2.686 lr:0.0001258 epoch_Time:28525.0min: [2024-01-02 22:26:07,300][model8_pretrain.py][INFO] Epoch:[0/2](45600/4588595) loss:3.172 lr:0.0001258 epoch_Time:28525.0min: [2024-01-02 22:26:07,300][model8_pretrain.py][INFO] Epoch:[0/2](45600/4588595) loss:3.463 lr:0.0001258 epoch_Time:28525.0min: [2024-01-02 22:26:07,300][model8_pretrain.py][INFO] Epoch:[0/2](45600/4588595) loss:3.341 lr:0.0001258 epoch_Time:28525.0min: [2024-01-02 22:26:07,301][model8_pretrain.py][INFO] Epoch:[0/2](45600/4588595) loss:3.166 lr:0.0001258 epoch_Time:28525.0min: [2024-01-02 22:26:44,235][model8_pretrain.py][INFO] Epoch:[0/2](45700/4588595) loss:3.259 lr:0.0001252 epoch_Time:28524.0min: [2024-01-02 22:26:44,235][model8_pretrain.py][INFO] Epoch:[0/2](45700/4588595) loss:3.410 lr:0.0001252 epoch_Time:28524.0min: [2024-01-02 22:26:44,235][model8_pretrain.py][INFO] Epoch:[0/2](45700/4588595) loss:3.090 lr:0.0001252 epoch_Time:28524.0min: [2024-01-02 22:26:44,235][model8_pretrain.py][INFO] Epoch:[0/2](45700/4588595) loss:3.220 lr:0.0001252 epoch_Time:28524.0min: [2024-01-02 22:26:44,235][model8_pretrain.py][INFO] Epoch:[0/2](45700/4588595) loss:3.588 lr:0.0001252 epoch_Time:28524.0min: [2024-01-02 22:26:44,235][model8_pretrain.py][INFO] Epoch:[0/2](45700/4588595) loss:3.370 lr:0.0001252 epoch_Time:28524.0min: [2024-01-02 22:26:44,235][model8_pretrain.py][INFO] Epoch:[0/2](45700/4588595) loss:3.272 lr:0.0001252 epoch_Time:28524.0min: [2024-01-02 22:26:44,235][model8_pretrain.py][INFO] Epoch:[0/2](45700/4588595) loss:3.335 lr:0.0001252 epoch_Time:28524.0min: [2024-01-02 22:27:21,173][model8_pretrain.py][INFO] Epoch:[0/2](45800/4588595) loss:3.281 lr:0.0001247 epoch_Time:28522.0min: [2024-01-02 22:27:21,173][model8_pretrain.py][INFO] Epoch:[0/2](45800/4588595) loss:2.997 lr:0.0001247 epoch_Time:28522.0min: [2024-01-02 22:27:21,173][model8_pretrain.py][INFO] Epoch:[0/2](45800/4588595) loss:2.310 lr:0.0001247 epoch_Time:28522.0min: [2024-01-02 22:27:21,173][model8_pretrain.py][INFO] Epoch:[0/2](45800/4588595) loss:2.987 lr:0.0001247 epoch_Time:28522.0min: [2024-01-02 22:27:21,173][model8_pretrain.py][INFO] Epoch:[0/2](45800/4588595) loss:3.361 lr:0.0001247 epoch_Time:28522.0min: [2024-01-02 22:27:21,173][model8_pretrain.py][INFO] Epoch:[0/2](45800/4588595) loss:3.467 lr:0.0001247 epoch_Time:28522.0min: [2024-01-02 22:27:21,173][model8_pretrain.py][INFO] Epoch:[0/2](45800/4588595) loss:3.412 lr:0.0001247 epoch_Time:28522.0min: [2024-01-02 22:27:21,173][model8_pretrain.py][INFO] Epoch:[0/2](45800/4588595) loss:3.130 lr:0.0001247 epoch_Time:28522.0min: [2024-01-02 22:27:58,123][model8_pretrain.py][INFO] Epoch:[0/2](45900/4588595) loss:3.404 lr:0.0001241 epoch_Time:28520.0min: [2024-01-02 22:27:58,123][model8_pretrain.py][INFO] Epoch:[0/2](45900/4588595) loss:3.130 lr:0.0001241 epoch_Time:28520.0min: [2024-01-02 22:27:58,123][model8_pretrain.py][INFO] Epoch:[0/2](45900/4588595) loss:3.607 lr:0.0001241 epoch_Time:28520.0min: [2024-01-02 22:27:58,123][model8_pretrain.py][INFO] Epoch:[0/2](45900/4588595) loss:3.621 lr:0.0001241 epoch_Time:28520.0min: [2024-01-02 22:27:58,123][model8_pretrain.py][INFO] Epoch:[0/2](45900/4588595) loss:3.042 lr:0.0001241 epoch_Time:28520.0min: [2024-01-02 22:27:58,123][model8_pretrain.py][INFO] Epoch:[0/2](45900/4588595) loss:3.163 lr:0.0001241 epoch_Time:28520.0min: [2024-01-02 22:27:58,123][model8_pretrain.py][INFO] Epoch:[0/2](45900/4588595) loss:3.002 lr:0.0001241 epoch_Time:28520.0min: [2024-01-02 22:27:58,123][model8_pretrain.py][INFO] Epoch:[0/2](45900/4588595) loss:3.209 lr:0.0001241 epoch_Time:28520.0min: [2024-01-02 22:28:35,069][model8_pretrain.py][INFO] Epoch:[0/2](46000/4588595) loss:2.468 lr:0.0001235 epoch_Time:28518.0min: [2024-01-02 22:28:35,069][model8_pretrain.py][INFO] Epoch:[0/2](46000/4588595) loss:3.579 lr:0.0001235 epoch_Time:28518.0min: [2024-01-02 22:28:35,069][model8_pretrain.py][INFO] Epoch:[0/2](46000/4588595) loss:3.400 lr:0.0001235 epoch_Time:28518.0min: [2024-01-02 22:28:35,069][model8_pretrain.py][INFO] Epoch:[0/2](46000/4588595) loss:3.549 lr:0.0001235 epoch_Time:28518.0min: [2024-01-02 22:28:35,069][model8_pretrain.py][INFO] Epoch:[0/2](46000/4588595) loss:3.424 lr:0.0001235 epoch_Time:28518.0min: [2024-01-02 22:28:35,069][model8_pretrain.py][INFO] Epoch:[0/2](46000/4588595) loss:3.614 lr:0.0001235 epoch_Time:28518.0min: [2024-01-02 22:28:35,069][model8_pretrain.py][INFO] Epoch:[0/2](46000/4588595) loss:3.040 lr:0.0001235 epoch_Time:28518.0min: [2024-01-02 22:28:35,069][model8_pretrain.py][INFO] Epoch:[0/2](46000/4588595) loss:3.022 lr:0.0001235 epoch_Time:28518.0min: [2024-01-02 22:29:12,018][model8_pretrain.py][INFO] Epoch:[0/2](46100/4588595) loss:3.512 lr:0.0001230 epoch_Time:28516.0min: [2024-01-02 22:29:12,019][model8_pretrain.py][INFO] Epoch:[0/2](46100/4588595) loss:3.354 lr:0.0001230 epoch_Time:28516.0min: [2024-01-02 22:29:12,019][model8_pretrain.py][INFO] Epoch:[0/2](46100/4588595) loss:2.869 lr:0.0001230 epoch_Time:28516.0min: [2024-01-02 22:29:12,019][model8_pretrain.py][INFO] Epoch:[0/2](46100/4588595) loss:3.144 lr:0.0001230 epoch_Time:28516.0min: [2024-01-02 22:29:12,019][model8_pretrain.py][INFO] Epoch:[0/2](46100/4588595) loss:3.023 lr:0.0001230 epoch_Time:28516.0min: [2024-01-02 22:29:12,019][model8_pretrain.py][INFO] Epoch:[0/2](46100/4588595) loss:3.316 lr:0.0001230 epoch_Time:28516.0min: [2024-01-02 22:29:12,019][model8_pretrain.py][INFO] Epoch:[0/2](46100/4588595) loss:3.497 lr:0.0001230 epoch_Time:28516.0min: [2024-01-02 22:29:12,019][model8_pretrain.py][INFO] Epoch:[0/2](46100/4588595) loss:3.675 lr:0.0001230 epoch_Time:28516.0min: [2024-01-02 22:29:54,258][model8_pretrain.py][INFO] Epoch:[0/2](46200/4588595) loss:3.458 lr:0.0001224 epoch_Time:28523.0min: [2024-01-02 22:29:54,258][model8_pretrain.py][INFO] Epoch:[0/2](46200/4588595) loss:3.426 lr:0.0001224 epoch_Time:28523.0min: [2024-01-02 22:29:54,258][model8_pretrain.py][INFO] Epoch:[0/2](46200/4588595) loss:3.500 lr:0.0001224 epoch_Time:28523.0min: [2024-01-02 22:29:54,258][model8_pretrain.py][INFO] Epoch:[0/2](46200/4588595) loss:3.097 lr:0.0001224 epoch_Time:28523.0min: [2024-01-02 22:29:54,262][model8_pretrain.py][INFO] Epoch:[0/2](46200/4588595) loss:2.647 lr:0.0001224 epoch_Time:28523.0min: [2024-01-02 22:29:54,263][model8_pretrain.py][INFO] Epoch:[0/2](46200/4588595) loss:3.433 lr:0.0001224 epoch_Time:28523.0min: [2024-01-02 22:29:54,263][model8_pretrain.py][INFO] Epoch:[0/2](46200/4588595) loss:2.968 lr:0.0001224 epoch_Time:28523.0min: [2024-01-02 22:29:54,263][model8_pretrain.py][INFO] Epoch:[0/2](46200/4588595) loss:3.047 lr:0.0001224 epoch_Time:28523.0min: [2024-01-02 22:30:32,877][model8_pretrain.py][INFO] Epoch:[0/2](46300/4588595) loss:3.586 lr:0.0001219 epoch_Time:28524.0min: [2024-01-02 22:30:32,877][model8_pretrain.py][INFO] Epoch:[0/2](46300/4588595) loss:3.597 lr:0.0001219 epoch_Time:28524.0min: [2024-01-02 22:30:32,877][model8_pretrain.py][INFO] Epoch:[0/2](46300/4588595) loss:3.551 lr:0.0001219 epoch_Time:28524.0min: [2024-01-02 22:30:32,877][model8_pretrain.py][INFO] Epoch:[0/2](46300/4588595) loss:3.218 lr:0.0001219 epoch_Time:28524.0min: [2024-01-02 22:30:32,877][model8_pretrain.py][INFO] Epoch:[0/2](46300/4588595) loss:3.594 lr:0.0001219 epoch_Time:28524.0min: [2024-01-02 22:30:32,877][model8_pretrain.py][INFO] Epoch:[0/2](46300/4588595) loss:3.318 lr:0.0001219 epoch_Time:28524.0min: [2024-01-02 22:30:32,877][model8_pretrain.py][INFO] Epoch:[0/2](46300/4588595) loss:3.155 lr:0.0001219 epoch_Time:28524.0min: [2024-01-02 22:30:32,877][model8_pretrain.py][INFO] Epoch:[0/2](46300/4588595) loss:3.339 lr:0.0001219 epoch_Time:28524.0min: [2024-01-02 22:31:09,831][model8_pretrain.py][INFO] Epoch:[0/2](46400/4588595) loss:2.998 lr:0.0001213 epoch_Time:28522.0min: [2024-01-02 22:31:09,831][model8_pretrain.py][INFO] Epoch:[0/2](46400/4588595) loss:3.593 lr:0.0001213 epoch_Time:28522.0min: [2024-01-02 22:31:09,831][model8_pretrain.py][INFO] Epoch:[0/2](46400/4588595) loss:3.163 lr:0.0001213 epoch_Time:28522.0min: [2024-01-02 22:31:09,831][model8_pretrain.py][INFO] Epoch:[0/2](46400/4588595) loss:3.225 lr:0.0001213 epoch_Time:28522.0min: [2024-01-02 22:31:09,831][model8_pretrain.py][INFO] Epoch:[0/2](46400/4588595) loss:3.478 lr:0.0001213 epoch_Time:28522.0min: [2024-01-02 22:31:09,831][model8_pretrain.py][INFO] Epoch:[0/2](46400/4588595) loss:3.363 lr:0.0001213 epoch_Time:28522.0min: [2024-01-02 22:31:09,831][model8_pretrain.py][INFO] Epoch:[0/2](46400/4588595) loss:3.659 lr:0.0001213 epoch_Time:28522.0min: [2024-01-02 22:31:09,831][model8_pretrain.py][INFO] Epoch:[0/2](46400/4588595) loss:3.140 lr:0.0001213 epoch_Time:28522.0min: [2024-01-02 22:31:46,772][model8_pretrain.py][INFO] Epoch:[0/2](46500/4588595) loss:3.241 lr:0.0001207 epoch_Time:28521.0min: [2024-01-02 22:31:46,772][model8_pretrain.py][INFO] Epoch:[0/2](46500/4588595) loss:3.355 lr:0.0001207 epoch_Time:28521.0min: [2024-01-02 22:31:46,772][model8_pretrain.py][INFO] Epoch:[0/2](46500/4588595) loss:3.228 lr:0.0001207 epoch_Time:28521.0min: [2024-01-02 22:31:46,772][model8_pretrain.py][INFO] Epoch:[0/2](46500/4588595) loss:3.103 lr:0.0001207 epoch_Time:28521.0min: [2024-01-02 22:31:46,772][model8_pretrain.py][INFO] Epoch:[0/2](46500/4588595) loss:3.114 lr:0.0001207 epoch_Time:28521.0min: [2024-01-02 22:31:46,772][model8_pretrain.py][INFO] Epoch:[0/2](46500/4588595) loss:3.527 lr:0.0001207 epoch_Time:28521.0min: [2024-01-02 22:31:46,772][model8_pretrain.py][INFO] Epoch:[0/2](46500/4588595) loss:3.106 lr:0.0001207 epoch_Time:28521.0min: [2024-01-02 22:31:46,772][model8_pretrain.py][INFO] Epoch:[0/2](46500/4588595) loss:3.462 lr:0.0001207 epoch_Time:28521.0min: [2024-01-02 22:32:23,735][model8_pretrain.py][INFO] Epoch:[0/2](46600/4588595) loss:3.628 lr:0.0001202 epoch_Time:28519.0min: [2024-01-02 22:32:23,735][model8_pretrain.py][INFO] Epoch:[0/2](46600/4588595) loss:2.772 lr:0.0001202 epoch_Time:28519.0min: [2024-01-02 22:32:23,735][model8_pretrain.py][INFO] Epoch:[0/2](46600/4588595) loss:3.409 lr:0.0001202 epoch_Time:28519.0min: [2024-01-02 22:32:23,735][model8_pretrain.py][INFO] Epoch:[0/2](46600/4588595) loss:3.411 lr:0.0001202 epoch_Time:28519.0min: [2024-01-02 22:32:23,735][model8_pretrain.py][INFO] Epoch:[0/2](46600/4588595) loss:3.225 lr:0.0001202 epoch_Time:28519.0min: [2024-01-02 22:32:23,735][model8_pretrain.py][INFO] Epoch:[0/2](46600/4588595) loss:3.640 lr:0.0001202 epoch_Time:28519.0min: [2024-01-02 22:32:23,735][model8_pretrain.py][INFO] Epoch:[0/2](46600/4588595) loss:3.409 lr:0.0001202 epoch_Time:28519.0min: [2024-01-02 22:32:23,736][model8_pretrain.py][INFO] Epoch:[0/2](46600/4588595) loss:3.593 lr:0.0001202 epoch_Time:28519.0min: [2024-01-02 22:33:00,691][model8_pretrain.py][INFO] Epoch:[0/2](46700/4588595) loss:2.826 lr:0.0001196 epoch_Time:28517.0min: [2024-01-02 22:33:00,691][model8_pretrain.py][INFO] Epoch:[0/2](46700/4588595) loss:3.511 lr:0.0001196 epoch_Time:28517.0min: [2024-01-02 22:33:00,691][model8_pretrain.py][INFO] Epoch:[0/2](46700/4588595) loss:3.675 lr:0.0001196 epoch_Time:28517.0min: [2024-01-02 22:33:00,691][model8_pretrain.py][INFO] Epoch:[0/2](46700/4588595) loss:3.117 lr:0.0001196 epoch_Time:28517.0min: [2024-01-02 22:33:00,691][model8_pretrain.py][INFO] Epoch:[0/2](46700/4588595) loss:2.954 lr:0.0001196 epoch_Time:28517.0min: [2024-01-02 22:33:00,691][model8_pretrain.py][INFO] Epoch:[0/2](46700/4588595) loss:3.587 lr:0.0001196 epoch_Time:28517.0min: [2024-01-02 22:33:00,691][model8_pretrain.py][INFO] Epoch:[0/2](46700/4588595) loss:3.230 lr:0.0001196 epoch_Time:28517.0min: [2024-01-02 22:33:00,691][model8_pretrain.py][INFO] Epoch:[0/2](46700/4588595) loss:3.717 lr:0.0001196 epoch_Time:28517.0min: [2024-01-02 22:33:37,636][model8_pretrain.py][INFO] Epoch:[0/2](46800/4588595) loss:3.555 lr:0.0001191 epoch_Time:28515.0min: [2024-01-02 22:33:37,636][model8_pretrain.py][INFO] Epoch:[0/2](46800/4588595) loss:3.336 lr:0.0001191 epoch_Time:28515.0min: [2024-01-02 22:33:37,636][model8_pretrain.py][INFO] Epoch:[0/2](46800/4588595) loss:3.243 lr:0.0001191 epoch_Time:28515.0min: [2024-01-02 22:33:37,636][model8_pretrain.py][INFO] Epoch:[0/2](46800/4588595) loss:3.175 lr:0.0001191 epoch_Time:28515.0min: [2024-01-02 22:33:37,636][model8_pretrain.py][INFO] Epoch:[0/2](46800/4588595) loss:3.459 lr:0.0001191 epoch_Time:28515.0min: [2024-01-02 22:33:37,636][model8_pretrain.py][INFO] Epoch:[0/2](46800/4588595) loss:3.419 lr:0.0001191 epoch_Time:28515.0min: [2024-01-02 22:33:37,636][model8_pretrain.py][INFO] Epoch:[0/2](46800/4588595) loss:2.801 lr:0.0001191 epoch_Time:28515.0min: [2024-01-02 22:33:37,636][model8_pretrain.py][INFO] Epoch:[0/2](46800/4588595) loss:3.282 lr:0.0001191 epoch_Time:28515.0min: [2024-01-02 22:34:14,595][model8_pretrain.py][INFO] Epoch:[0/2](46900/4588595) loss:3.539 lr:0.0001185 epoch_Time:28513.0min: [2024-01-02 22:34:14,595][model8_pretrain.py][INFO] Epoch:[0/2](46900/4588595) loss:3.273 lr:0.0001185 epoch_Time:28513.0min: [2024-01-02 22:34:14,595][model8_pretrain.py][INFO] Epoch:[0/2](46900/4588595) loss:3.191 lr:0.0001185 epoch_Time:28513.0min: [2024-01-02 22:34:14,595][model8_pretrain.py][INFO] Epoch:[0/2](46900/4588595) loss:3.075 lr:0.0001185 epoch_Time:28513.0min: [2024-01-02 22:34:14,595][model8_pretrain.py][INFO] Epoch:[0/2](46900/4588595) loss:3.510 lr:0.0001185 epoch_Time:28513.0min: [2024-01-02 22:34:14,595][model8_pretrain.py][INFO] Epoch:[0/2](46900/4588595) loss:3.081 lr:0.0001185 epoch_Time:28513.0min: [2024-01-02 22:34:14,595][model8_pretrain.py][INFO] Epoch:[0/2](46900/4588595) loss:3.501 lr:0.0001185 epoch_Time:28513.0min: [2024-01-02 22:34:14,595][model8_pretrain.py][INFO] Epoch:[0/2](46900/4588595) loss:3.590 lr:0.0001185 epoch_Time:28513.0min: [2024-01-02 22:34:53,252][model8_pretrain.py][INFO] Epoch:[0/2](47000/4588595) loss:3.131 lr:0.0001179 epoch_Time:28514.0min: [2024-01-02 22:34:53,252][model8_pretrain.py][INFO] Epoch:[0/2](47000/4588595) loss:2.943 lr:0.0001179 epoch_Time:28514.0min: [2024-01-02 22:34:53,252][model8_pretrain.py][INFO] Epoch:[0/2](47000/4588595) loss:3.172 lr:0.0001179 epoch_Time:28514.0min: [2024-01-02 22:34:53,252][model8_pretrain.py][INFO] Epoch:[0/2](47000/4588595) loss:3.163 lr:0.0001179 epoch_Time:28514.0min: [2024-01-02 22:34:53,252][model8_pretrain.py][INFO] Epoch:[0/2](47000/4588595) loss:3.246 lr:0.0001179 epoch_Time:28514.0min: [2024-01-02 22:34:53,252][model8_pretrain.py][INFO] Epoch:[0/2](47000/4588595) loss:3.059 lr:0.0001179 epoch_Time:28514.0min: [2024-01-02 22:34:53,252][model8_pretrain.py][INFO] Epoch:[0/2](47000/4588595) loss:2.766 lr:0.0001179 epoch_Time:28514.0min: [2024-01-02 22:34:53,252][model8_pretrain.py][INFO] Epoch:[0/2](47000/4588595) loss:2.781 lr:0.0001179 epoch_Time:28514.0min: [2024-01-02 22:35:35,511][model8_pretrain.py][INFO] Epoch:[0/2](47100/4588595) loss:3.347 lr:0.0001174 epoch_Time:28521.0min: [2024-01-02 22:35:35,511][model8_pretrain.py][INFO] Epoch:[0/2](47100/4588595) loss:3.124 lr:0.0001174 epoch_Time:28521.0min: [2024-01-02 22:35:35,511][model8_pretrain.py][INFO] Epoch:[0/2](47100/4588595) loss:2.732 lr:0.0001174 epoch_Time:28521.0min: [2024-01-02 22:35:35,511][model8_pretrain.py][INFO] Epoch:[0/2](47100/4588595) loss:3.329 lr:0.0001174 epoch_Time:28521.0min: [2024-01-02 22:35:35,511][model8_pretrain.py][INFO] Epoch:[0/2](47100/4588595) loss:2.840 lr:0.0001174 epoch_Time:28521.0min: [2024-01-02 22:35:35,511][model8_pretrain.py][INFO] Epoch:[0/2](47100/4588595) loss:3.330 lr:0.0001174 epoch_Time:28521.0min: [2024-01-02 22:35:35,511][model8_pretrain.py][INFO] Epoch:[0/2](47100/4588595) loss:3.178 lr:0.0001174 epoch_Time:28521.0min: [2024-01-02 22:35:35,512][model8_pretrain.py][INFO] Epoch:[0/2](47100/4588595) loss:2.554 lr:0.0001174 epoch_Time:28521.0min: [2024-01-02 22:36:12,460][model8_pretrain.py][INFO] Epoch:[0/2](47200/4588595) loss:2.993 lr:0.0001168 epoch_Time:28519.0min: [2024-01-02 22:36:12,460][model8_pretrain.py][INFO] Epoch:[0/2](47200/4588595) loss:3.270 lr:0.0001168 epoch_Time:28519.0min: [2024-01-02 22:36:12,460][model8_pretrain.py][INFO] Epoch:[0/2](47200/4588595) loss:3.351 lr:0.0001168 epoch_Time:28519.0min: [2024-01-02 22:36:12,460][model8_pretrain.py][INFO] Epoch:[0/2](47200/4588595) loss:3.459 lr:0.0001168 epoch_Time:28519.0min: [2024-01-02 22:36:12,460][model8_pretrain.py][INFO] Epoch:[0/2](47200/4588595) loss:3.708 lr:0.0001168 epoch_Time:28519.0min: [2024-01-02 22:36:12,461][model8_pretrain.py][INFO] Epoch:[0/2](47200/4588595) loss:3.420 lr:0.0001168 epoch_Time:28519.0min: [2024-01-02 22:36:12,461][model8_pretrain.py][INFO] Epoch:[0/2](47200/4588595) loss:3.234 lr:0.0001168 epoch_Time:28519.0min: [2024-01-02 22:36:12,461][model8_pretrain.py][INFO] Epoch:[0/2](47200/4588595) loss:3.990 lr:0.0001168 epoch_Time:28519.0min: [2024-01-02 22:36:49,392][model8_pretrain.py][INFO] Epoch:[0/2](47300/4588595) loss:3.133 lr:0.0001163 epoch_Time:28517.0min: [2024-01-02 22:36:49,392][model8_pretrain.py][INFO] Epoch:[0/2](47300/4588595) loss:3.214 lr:0.0001163 epoch_Time:28517.0min: [2024-01-02 22:36:49,392][model8_pretrain.py][INFO] Epoch:[0/2](47300/4588595) loss:2.805 lr:0.0001163 epoch_Time:28517.0min: [2024-01-02 22:36:49,392][model8_pretrain.py][INFO] Epoch:[0/2](47300/4588595) loss:3.443 lr:0.0001163 epoch_Time:28517.0min: [2024-01-02 22:36:49,392][model8_pretrain.py][INFO] Epoch:[0/2](47300/4588595) loss:3.655 lr:0.0001163 epoch_Time:28517.0min: [2024-01-02 22:36:49,392][model8_pretrain.py][INFO] Epoch:[0/2](47300/4588595) loss:3.238 lr:0.0001163 epoch_Time:28517.0min: [2024-01-02 22:36:49,392][model8_pretrain.py][INFO] Epoch:[0/2](47300/4588595) loss:3.100 lr:0.0001163 epoch_Time:28517.0min: [2024-01-02 22:36:49,392][model8_pretrain.py][INFO] Epoch:[0/2](47300/4588595) loss:2.895 lr:0.0001163 epoch_Time:28517.0min: [2024-01-02 22:37:26,327][model8_pretrain.py][INFO] Epoch:[0/2](47400/4588595) loss:2.788 lr:0.0001157 epoch_Time:28516.0min: [2024-01-02 22:37:26,327][model8_pretrain.py][INFO] Epoch:[0/2](47400/4588595) loss:3.218 lr:0.0001157 epoch_Time:28516.0min: [2024-01-02 22:37:26,327][model8_pretrain.py][INFO] Epoch:[0/2](47400/4588595) loss:2.544 lr:0.0001157 epoch_Time:28516.0min: [2024-01-02 22:37:26,327][model8_pretrain.py][INFO] Epoch:[0/2](47400/4588595) loss:3.161 lr:0.0001157 epoch_Time:28516.0min: [2024-01-02 22:37:26,327][model8_pretrain.py][INFO] Epoch:[0/2](47400/4588595) loss:3.478 lr:0.0001157 epoch_Time:28516.0min: [2024-01-02 22:37:26,327][model8_pretrain.py][INFO] Epoch:[0/2](47400/4588595) loss:3.378 lr:0.0001157 epoch_Time:28516.0min: [2024-01-02 22:37:26,327][model8_pretrain.py][INFO] Epoch:[0/2](47400/4588595) loss:3.629 lr:0.0001157 epoch_Time:28516.0min: [2024-01-02 22:37:26,327][model8_pretrain.py][INFO] Epoch:[0/2](47400/4588595) loss:3.395 lr:0.0001157 epoch_Time:28516.0min: [2024-01-02 22:38:03,277][model8_pretrain.py][INFO] Epoch:[0/2](47500/4588595) loss:2.928 lr:0.0001152 epoch_Time:28514.0min: [2024-01-02 22:38:03,277][model8_pretrain.py][INFO] Epoch:[0/2](47500/4588595) loss:2.937 lr:0.0001152 epoch_Time:28514.0min: [2024-01-02 22:38:03,277][model8_pretrain.py][INFO] Epoch:[0/2](47500/4588595) loss:3.612 lr:0.0001152 epoch_Time:28514.0min: [2024-01-02 22:38:03,277][model8_pretrain.py][INFO] Epoch:[0/2](47500/4588595) loss:3.265 lr:0.0001152 epoch_Time:28514.0min: [2024-01-02 22:38:03,277][model8_pretrain.py][INFO] Epoch:[0/2](47500/4588595) loss:3.279 lr:0.0001152 epoch_Time:28514.0min: [2024-01-02 22:38:03,277][model8_pretrain.py][INFO] Epoch:[0/2](47500/4588595) loss:3.492 lr:0.0001152 epoch_Time:28514.0min: [2024-01-02 22:38:03,277][model8_pretrain.py][INFO] Epoch:[0/2](47500/4588595) loss:3.492 lr:0.0001152 epoch_Time:28514.0min: [2024-01-02 22:38:03,277][model8_pretrain.py][INFO] Epoch:[0/2](47500/4588595) loss:2.885 lr:0.0001152 epoch_Time:28514.0min: [2024-01-02 22:38:40,229][model8_pretrain.py][INFO] Epoch:[0/2](47600/4588595) loss:3.069 lr:0.0001146 epoch_Time:28512.0min: [2024-01-02 22:38:40,229][model8_pretrain.py][INFO] Epoch:[0/2](47600/4588595) loss:3.162 lr:0.0001146 epoch_Time:28512.0min: [2024-01-02 22:38:40,229][model8_pretrain.py][INFO] Epoch:[0/2](47600/4588595) loss:3.732 lr:0.0001146 epoch_Time:28512.0min: [2024-01-02 22:38:40,229][model8_pretrain.py][INFO] Epoch:[0/2](47600/4588595) loss:3.135 lr:0.0001146 epoch_Time:28512.0min: [2024-01-02 22:38:40,229][model8_pretrain.py][INFO] Epoch:[0/2](47600/4588595) loss:3.490 lr:0.0001146 epoch_Time:28512.0min: [2024-01-02 22:38:40,229][model8_pretrain.py][INFO] Epoch:[0/2](47600/4588595) loss:3.344 lr:0.0001146 epoch_Time:28512.0min: [2024-01-02 22:38:40,229][model8_pretrain.py][INFO] Epoch:[0/2](47600/4588595) loss:3.609 lr:0.0001146 epoch_Time:28512.0min: [2024-01-02 22:38:40,230][model8_pretrain.py][INFO] Epoch:[0/2](47600/4588595) loss:3.507 lr:0.0001146 epoch_Time:28512.0min: [2024-01-02 22:39:17,173][model8_pretrain.py][INFO] Epoch:[0/2](47700/4588595) loss:2.833 lr:0.0001140 epoch_Time:28510.0min: [2024-01-02 22:39:17,173][model8_pretrain.py][INFO] Epoch:[0/2](47700/4588595) loss:3.362 lr:0.0001140 epoch_Time:28510.0min: [2024-01-02 22:39:17,173][model8_pretrain.py][INFO] Epoch:[0/2](47700/4588595) loss:2.732 lr:0.0001140 epoch_Time:28510.0min: [2024-01-02 22:39:17,173][model8_pretrain.py][INFO] Epoch:[0/2](47700/4588595) loss:3.050 lr:0.0001140 epoch_Time:28510.0min: [2024-01-02 22:39:17,173][model8_pretrain.py][INFO] Epoch:[0/2](47700/4588595) loss:3.034 lr:0.0001140 epoch_Time:28510.0min: [2024-01-02 22:39:17,173][model8_pretrain.py][INFO] Epoch:[0/2](47700/4588595) loss:3.322 lr:0.0001140 epoch_Time:28510.0min: [2024-01-02 22:39:17,173][model8_pretrain.py][INFO] Epoch:[0/2](47700/4588595) loss:3.381 lr:0.0001140 epoch_Time:28510.0min: [2024-01-02 22:39:17,173][model8_pretrain.py][INFO] Epoch:[0/2](47700/4588595) loss:3.482 lr:0.0001140 epoch_Time:28510.0min: [2024-01-02 22:39:55,844][model8_pretrain.py][INFO] Epoch:[0/2](47800/4588595) loss:3.132 lr:0.0001135 epoch_Time:28511.0min: [2024-01-02 22:39:55,844][model8_pretrain.py][INFO] Epoch:[0/2](47800/4588595) loss:2.946 lr:0.0001135 epoch_Time:28511.0min: [2024-01-02 22:39:55,844][model8_pretrain.py][INFO] Epoch:[0/2](47800/4588595) loss:3.214 lr:0.0001135 epoch_Time:28511.0min: [2024-01-02 22:39:55,844][model8_pretrain.py][INFO] Epoch:[0/2](47800/4588595) loss:2.704 lr:0.0001135 epoch_Time:28511.0min: [2024-01-02 22:39:55,844][model8_pretrain.py][INFO] Epoch:[0/2](47800/4588595) loss:3.202 lr:0.0001135 epoch_Time:28511.0min: [2024-01-02 22:39:55,844][model8_pretrain.py][INFO] Epoch:[0/2](47800/4588595) loss:3.149 lr:0.0001135 epoch_Time:28511.0min: [2024-01-02 22:39:55,844][model8_pretrain.py][INFO] Epoch:[0/2](47800/4588595) loss:3.265 lr:0.0001135 epoch_Time:28511.0min: [2024-01-02 22:39:55,844][model8_pretrain.py][INFO] Epoch:[0/2](47800/4588595) loss:3.182 lr:0.0001135 epoch_Time:28511.0min: [2024-01-02 22:40:38,012][model8_pretrain.py][INFO] Epoch:[0/2](47900/4588595) loss:3.042 lr:0.0001129 epoch_Time:28518.0min: [2024-01-02 22:40:38,012][model8_pretrain.py][INFO] Epoch:[0/2](47900/4588595) loss:3.873 lr:0.0001129 epoch_Time:28518.0min: [2024-01-02 22:40:38,012][model8_pretrain.py][INFO] Epoch:[0/2](47900/4588595) loss:2.843 lr:0.0001129 epoch_Time:28518.0min: [2024-01-02 22:40:38,012][model8_pretrain.py][INFO] Epoch:[0/2](47900/4588595) loss:3.141 lr:0.0001129 epoch_Time:28518.0min: [2024-01-02 22:40:38,012][model8_pretrain.py][INFO] Epoch:[0/2](47900/4588595) loss:3.118 lr:0.0001129 epoch_Time:28518.0min: [2024-01-02 22:40:38,012][model8_pretrain.py][INFO] Epoch:[0/2](47900/4588595) loss:2.989 lr:0.0001129 epoch_Time:28518.0min: [2024-01-02 22:40:38,012][model8_pretrain.py][INFO] Epoch:[0/2](47900/4588595) loss:2.804 lr:0.0001129 epoch_Time:28518.0min: [2024-01-02 22:40:38,012][model8_pretrain.py][INFO] Epoch:[0/2](47900/4588595) loss:2.989 lr:0.0001129 epoch_Time:28518.0min: [2024-01-02 22:41:14,945][model8_pretrain.py][INFO] Epoch:[0/2](48000/4588595) loss:2.763 lr:0.0001124 epoch_Time:28516.0min: [2024-01-02 22:41:14,945][model8_pretrain.py][INFO] Epoch:[0/2](48000/4588595) loss:2.706 lr:0.0001124 epoch_Time:28516.0min: [2024-01-02 22:41:14,945][model8_pretrain.py][INFO] Epoch:[0/2](48000/4588595) loss:3.177 lr:0.0001124 epoch_Time:28516.0min: [2024-01-02 22:41:14,945][model8_pretrain.py][INFO] Epoch:[0/2](48000/4588595) loss:3.397 lr:0.0001124 epoch_Time:28516.0min: [2024-01-02 22:41:14,945][model8_pretrain.py][INFO] Epoch:[0/2](48000/4588595) loss:3.485 lr:0.0001124 epoch_Time:28516.0min: [2024-01-02 22:41:14,945][model8_pretrain.py][INFO] Epoch:[0/2](48000/4588595) loss:3.516 lr:0.0001124 epoch_Time:28516.0min: [2024-01-02 22:41:14,945][model8_pretrain.py][INFO] Epoch:[0/2](48000/4588595) loss:3.440 lr:0.0001124 epoch_Time:28516.0min: [2024-01-02 22:41:14,945][model8_pretrain.py][INFO] Epoch:[0/2](48000/4588595) loss:2.953 lr:0.0001124 epoch_Time:28516.0min: [2024-01-02 22:41:51,884][model8_pretrain.py][INFO] Epoch:[0/2](48100/4588595) loss:2.793 lr:0.0001118 epoch_Time:28514.0min: [2024-01-02 22:41:51,884][model8_pretrain.py][INFO] Epoch:[0/2](48100/4588595) loss:3.521 lr:0.0001118 epoch_Time:28514.0min: [2024-01-02 22:41:51,884][model8_pretrain.py][INFO] Epoch:[0/2](48100/4588595) loss:3.014 lr:0.0001118 epoch_Time:28514.0min: [2024-01-02 22:41:51,884][model8_pretrain.py][INFO] Epoch:[0/2](48100/4588595) loss:3.372 lr:0.0001118 epoch_Time:28514.0min: [2024-01-02 22:41:51,884][model8_pretrain.py][INFO] Epoch:[0/2](48100/4588595) loss:2.946 lr:0.0001118 epoch_Time:28514.0min: [2024-01-02 22:41:51,884][model8_pretrain.py][INFO] Epoch:[0/2](48100/4588595) loss:3.408 lr:0.0001118 epoch_Time:28514.0min: [2024-01-02 22:41:51,884][model8_pretrain.py][INFO] Epoch:[0/2](48100/4588595) loss:3.360 lr:0.0001118 epoch_Time:28514.0min: [2024-01-02 22:41:51,884][model8_pretrain.py][INFO] Epoch:[0/2](48100/4588595) loss:3.515 lr:0.0001118 epoch_Time:28514.0min: [2024-01-02 22:42:28,819][model8_pretrain.py][INFO] Epoch:[0/2](48200/4588595) loss:3.689 lr:0.0001113 epoch_Time:28512.0min: [2024-01-02 22:42:28,819][model8_pretrain.py][INFO] Epoch:[0/2](48200/4588595) loss:2.762 lr:0.0001113 epoch_Time:28512.0min: [2024-01-02 22:42:28,819][model8_pretrain.py][INFO] Epoch:[0/2](48200/4588595) loss:3.408 lr:0.0001113 epoch_Time:28512.0min: [2024-01-02 22:42:28,819][model8_pretrain.py][INFO] Epoch:[0/2](48200/4588595) loss:3.490 lr:0.0001113 epoch_Time:28512.0min: [2024-01-02 22:42:28,819][model8_pretrain.py][INFO] Epoch:[0/2](48200/4588595) loss:2.325 lr:0.0001113 epoch_Time:28512.0min: [2024-01-02 22:42:28,819][model8_pretrain.py][INFO] Epoch:[0/2](48200/4588595) loss:3.033 lr:0.0001113 epoch_Time:28512.0min: [2024-01-02 22:42:28,819][model8_pretrain.py][INFO] Epoch:[0/2](48200/4588595) loss:3.660 lr:0.0001113 epoch_Time:28512.0min: [2024-01-02 22:42:28,819][model8_pretrain.py][INFO] Epoch:[0/2](48200/4588595) loss:2.717 lr:0.0001113 epoch_Time:28512.0min: [2024-01-02 22:43:05,770][model8_pretrain.py][INFO] Epoch:[0/2](48300/4588595) loss:3.051 lr:0.0001107 epoch_Time:28510.0min: [2024-01-02 22:43:05,770][model8_pretrain.py][INFO] Epoch:[0/2](48300/4588595) loss:3.537 lr:0.0001107 epoch_Time:28510.0min: [2024-01-02 22:43:05,770][model8_pretrain.py][INFO] Epoch:[0/2](48300/4588595) loss:3.297 lr:0.0001107 epoch_Time:28510.0min: [2024-01-02 22:43:05,770][model8_pretrain.py][INFO] Epoch:[0/2](48300/4588595) loss:3.789 lr:0.0001107 epoch_Time:28510.0min: [2024-01-02 22:43:05,770][model8_pretrain.py][INFO] Epoch:[0/2](48300/4588595) loss:2.958 lr:0.0001107 epoch_Time:28510.0min: [2024-01-02 22:43:05,770][model8_pretrain.py][INFO] Epoch:[0/2](48300/4588595) loss:3.474 lr:0.0001107 epoch_Time:28510.0min: [2024-01-02 22:43:05,770][model8_pretrain.py][INFO] Epoch:[0/2](48300/4588595) loss:3.226 lr:0.0001107 epoch_Time:28510.0min: [2024-01-02 22:43:05,770][model8_pretrain.py][INFO] Epoch:[0/2](48300/4588595) loss:3.530 lr:0.0001107 epoch_Time:28510.0min: [2024-01-02 22:43:42,714][model8_pretrain.py][INFO] Epoch:[0/2](48400/4588595) loss:3.233 lr:0.0001102 epoch_Time:28509.0min: [2024-01-02 22:43:42,714][model8_pretrain.py][INFO] Epoch:[0/2](48400/4588595) loss:3.254 lr:0.0001102 epoch_Time:28509.0min: [2024-01-02 22:43:42,714][model8_pretrain.py][INFO] Epoch:[0/2](48400/4588595) loss:3.559 lr:0.0001102 epoch_Time:28509.0min: [2024-01-02 22:43:42,714][model8_pretrain.py][INFO] Epoch:[0/2](48400/4588595) loss:3.457 lr:0.0001102 epoch_Time:28509.0min: [2024-01-02 22:43:42,714][model8_pretrain.py][INFO] Epoch:[0/2](48400/4588595) loss:3.044 lr:0.0001102 epoch_Time:28509.0min: [2024-01-02 22:43:42,714][model8_pretrain.py][INFO] Epoch:[0/2](48400/4588595) loss:2.810 lr:0.0001102 epoch_Time:28509.0min: [2024-01-02 22:43:42,714][model8_pretrain.py][INFO] Epoch:[0/2](48400/4588595) loss:3.083 lr:0.0001102 epoch_Time:28509.0min: [2024-01-02 22:43:42,714][model8_pretrain.py][INFO] Epoch:[0/2](48400/4588595) loss:3.303 lr:0.0001102 epoch_Time:28509.0min: [2024-01-02 22:44:19,708][model8_pretrain.py][INFO] Epoch:[0/2](48500/4588595) loss:2.789 lr:0.0001096 epoch_Time:28507.0min: [2024-01-02 22:44:19,708][model8_pretrain.py][INFO] Epoch:[0/2](48500/4588595) loss:3.008 lr:0.0001096 epoch_Time:28507.0min: [2024-01-02 22:44:19,708][model8_pretrain.py][INFO] Epoch:[0/2](48500/4588595) loss:3.037 lr:0.0001096 epoch_Time:28507.0min: [2024-01-02 22:44:19,708][model8_pretrain.py][INFO] Epoch:[0/2](48500/4588595) loss:3.274 lr:0.0001096 epoch_Time:28507.0min: [2024-01-02 22:44:19,708][model8_pretrain.py][INFO] Epoch:[0/2](48500/4588595) loss:3.506 lr:0.0001096 epoch_Time:28507.0min: [2024-01-02 22:44:19,708][model8_pretrain.py][INFO] Epoch:[0/2](48500/4588595) loss:3.515 lr:0.0001096 epoch_Time:28507.0min: [2024-01-02 22:44:19,708][model8_pretrain.py][INFO] Epoch:[0/2](48500/4588595) loss:3.415 lr:0.0001096 epoch_Time:28507.0min: [2024-01-02 22:44:19,710][model8_pretrain.py][INFO] Epoch:[0/2](48500/4588595) loss:2.666 lr:0.0001096 epoch_Time:28507.0min: [2024-01-02 22:44:58,496][model8_pretrain.py][INFO] Epoch:[0/2](48600/4588595) loss:3.458 lr:0.0001091 epoch_Time:28508.0min: [2024-01-02 22:44:58,496][model8_pretrain.py][INFO] Epoch:[0/2](48600/4588595) loss:2.917 lr:0.0001091 epoch_Time:28508.0min: [2024-01-02 22:44:58,496][model8_pretrain.py][INFO] Epoch:[0/2](48600/4588595) loss:3.716 lr:0.0001091 epoch_Time:28508.0min: [2024-01-02 22:44:58,496][model8_pretrain.py][INFO] Epoch:[0/2](48600/4588595) loss:3.139 lr:0.0001091 epoch_Time:28508.0min: [2024-01-02 22:44:58,496][model8_pretrain.py][INFO] Epoch:[0/2](48600/4588595) loss:3.259 lr:0.0001091 epoch_Time:28508.0min: [2024-01-02 22:44:58,496][model8_pretrain.py][INFO] Epoch:[0/2](48600/4588595) loss:3.197 lr:0.0001091 epoch_Time:28508.0min: [2024-01-02 22:44:58,496][model8_pretrain.py][INFO] Epoch:[0/2](48600/4588595) loss:3.542 lr:0.0001091 epoch_Time:28508.0min: [2024-01-02 22:44:58,496][model8_pretrain.py][INFO] Epoch:[0/2](48600/4588595) loss:3.125 lr:0.0001091 epoch_Time:28508.0min: [2024-01-02 22:45:40,675][model8_pretrain.py][INFO] Epoch:[0/2](48700/4588595) loss:2.979 lr:0.0001086 epoch_Time:28515.0min: [2024-01-02 22:45:40,675][model8_pretrain.py][INFO] Epoch:[0/2](48700/4588595) loss:3.322 lr:0.0001086 epoch_Time:28515.0min: [2024-01-02 22:45:40,675][model8_pretrain.py][INFO] Epoch:[0/2](48700/4588595) loss:3.226 lr:0.0001086 epoch_Time:28515.0min: [2024-01-02 22:45:40,675][model8_pretrain.py][INFO] Epoch:[0/2](48700/4588595) loss:3.008 lr:0.0001086 epoch_Time:28515.0min: [2024-01-02 22:45:40,675][model8_pretrain.py][INFO] Epoch:[0/2](48700/4588595) loss:3.174 lr:0.0001086 epoch_Time:28515.0min: [2024-01-02 22:45:40,675][model8_pretrain.py][INFO] Epoch:[0/2](48700/4588595) loss:3.159 lr:0.0001086 epoch_Time:28515.0min: [2024-01-02 22:45:40,675][model8_pretrain.py][INFO] Epoch:[0/2](48700/4588595) loss:2.612 lr:0.0001086 epoch_Time:28515.0min: [2024-01-02 22:45:40,675][model8_pretrain.py][INFO] Epoch:[0/2](48700/4588595) loss:2.913 lr:0.0001086 epoch_Time:28515.0min: [2024-01-02 22:46:17,620][model8_pretrain.py][INFO] Epoch:[0/2](48800/4588595) loss:3.083 lr:0.0001080 epoch_Time:28513.0min: [2024-01-02 22:46:17,620][model8_pretrain.py][INFO] Epoch:[0/2](48800/4588595) loss:3.014 lr:0.0001080 epoch_Time:28513.0min: [2024-01-02 22:46:17,620][model8_pretrain.py][INFO] Epoch:[0/2](48800/4588595) loss:3.154 lr:0.0001080 epoch_Time:28513.0min: [2024-01-02 22:46:17,620][model8_pretrain.py][INFO] Epoch:[0/2](48800/4588595) loss:3.206 lr:0.0001080 epoch_Time:28513.0min: [2024-01-02 22:46:17,620][model8_pretrain.py][INFO] Epoch:[0/2](48800/4588595) loss:3.365 lr:0.0001080 epoch_Time:28513.0min: [2024-01-02 22:46:17,620][model8_pretrain.py][INFO] Epoch:[0/2](48800/4588595) loss:3.319 lr:0.0001080 epoch_Time:28513.0min: [2024-01-02 22:46:17,620][model8_pretrain.py][INFO] Epoch:[0/2](48800/4588595) loss:3.223 lr:0.0001080 epoch_Time:28513.0min: [2024-01-02 22:46:17,620][model8_pretrain.py][INFO] Epoch:[0/2](48800/4588595) loss:3.443 lr:0.0001080 epoch_Time:28513.0min: [2024-01-02 22:46:54,562][model8_pretrain.py][INFO] Epoch:[0/2](48900/4588595) loss:3.192 lr:0.0001075 epoch_Time:28511.0min: [2024-01-02 22:46:54,562][model8_pretrain.py][INFO] Epoch:[0/2](48900/4588595) loss:3.136 lr:0.0001075 epoch_Time:28511.0min: [2024-01-02 22:46:54,562][model8_pretrain.py][INFO] Epoch:[0/2](48900/4588595) loss:3.744 lr:0.0001075 epoch_Time:28511.0min: [2024-01-02 22:46:54,562][model8_pretrain.py][INFO] Epoch:[0/2](48900/4588595) loss:3.536 lr:0.0001075 epoch_Time:28511.0min: [2024-01-02 22:46:54,562][model8_pretrain.py][INFO] Epoch:[0/2](48900/4588595) loss:3.132 lr:0.0001075 epoch_Time:28511.0min: [2024-01-02 22:46:54,562][model8_pretrain.py][INFO] Epoch:[0/2](48900/4588595) loss:3.585 lr:0.0001075 epoch_Time:28511.0min: [2024-01-02 22:46:54,562][model8_pretrain.py][INFO] Epoch:[0/2](48900/4588595) loss:3.212 lr:0.0001075 epoch_Time:28511.0min: [2024-01-02 22:46:54,562][model8_pretrain.py][INFO] Epoch:[0/2](48900/4588595) loss:3.560 lr:0.0001075 epoch_Time:28511.0min: [2024-01-02 22:47:31,509][model8_pretrain.py][INFO] Epoch:[0/2](49000/4588595) loss:3.249 lr:0.0001069 epoch_Time:28509.0min: [2024-01-02 22:47:31,509][model8_pretrain.py][INFO] Epoch:[0/2](49000/4588595) loss:3.095 lr:0.0001069 epoch_Time:28509.0min: [2024-01-02 22:47:31,509][model8_pretrain.py][INFO] Epoch:[0/2](49000/4588595) loss:3.175 lr:0.0001069 epoch_Time:28509.0min: [2024-01-02 22:47:31,509][model8_pretrain.py][INFO] Epoch:[0/2](49000/4588595) loss:3.331 lr:0.0001069 epoch_Time:28509.0min: [2024-01-02 22:47:31,509][model8_pretrain.py][INFO] Epoch:[0/2](49000/4588595) loss:3.308 lr:0.0001069 epoch_Time:28509.0min: [2024-01-02 22:47:31,509][model8_pretrain.py][INFO] Epoch:[0/2](49000/4588595) loss:3.335 lr:0.0001069 epoch_Time:28509.0min: [2024-01-02 22:47:31,509][model8_pretrain.py][INFO] Epoch:[0/2](49000/4588595) loss:3.154 lr:0.0001069 epoch_Time:28509.0min: [2024-01-02 22:47:31,509][model8_pretrain.py][INFO] Epoch:[0/2](49000/4588595) loss:3.136 lr:0.0001069 epoch_Time:28509.0min: [2024-01-02 22:48:08,464][model8_pretrain.py][INFO] Epoch:[0/2](49100/4588595) loss:3.072 lr:0.0001064 epoch_Time:28507.0min: [2024-01-02 22:48:08,464][model8_pretrain.py][INFO] Epoch:[0/2](49100/4588595) loss:3.256 lr:0.0001064 epoch_Time:28507.0min: [2024-01-02 22:48:08,464][model8_pretrain.py][INFO] Epoch:[0/2](49100/4588595) loss:2.953 lr:0.0001064 epoch_Time:28507.0min: [2024-01-02 22:48:08,464][model8_pretrain.py][INFO] Epoch:[0/2](49100/4588595) loss:3.601 lr:0.0001064 epoch_Time:28507.0min: [2024-01-02 22:48:08,464][model8_pretrain.py][INFO] Epoch:[0/2](49100/4588595) loss:3.470 lr:0.0001064 epoch_Time:28507.0min: [2024-01-02 22:48:08,464][model8_pretrain.py][INFO] Epoch:[0/2](49100/4588595) loss:3.531 lr:0.0001064 epoch_Time:28507.0min: [2024-01-02 22:48:08,464][model8_pretrain.py][INFO] Epoch:[0/2](49100/4588595) loss:2.904 lr:0.0001064 epoch_Time:28507.0min: [2024-01-02 22:48:08,465][model8_pretrain.py][INFO] Epoch:[0/2](49100/4588595) loss:3.237 lr:0.0001064 epoch_Time:28507.0min: [2024-01-02 22:48:45,419][model8_pretrain.py][INFO] Epoch:[0/2](49200/4588595) loss:3.151 lr:0.0001058 epoch_Time:28506.0min: [2024-01-02 22:48:45,419][model8_pretrain.py][INFO] Epoch:[0/2](49200/4588595) loss:3.392 lr:0.0001058 epoch_Time:28506.0min: [2024-01-02 22:48:45,419][model8_pretrain.py][INFO] Epoch:[0/2](49200/4588595) loss:3.216 lr:0.0001058 epoch_Time:28506.0min: [2024-01-02 22:48:45,419][model8_pretrain.py][INFO] Epoch:[0/2](49200/4588595) loss:3.252 lr:0.0001058 epoch_Time:28506.0min: [2024-01-02 22:48:45,419][model8_pretrain.py][INFO] Epoch:[0/2](49200/4588595) loss:3.324 lr:0.0001058 epoch_Time:28506.0min: [2024-01-02 22:48:45,419][model8_pretrain.py][INFO] Epoch:[0/2](49200/4588595) loss:2.976 lr:0.0001058 epoch_Time:28506.0min: [2024-01-02 22:48:45,419][model8_pretrain.py][INFO] Epoch:[0/2](49200/4588595) loss:2.417 lr:0.0001058 epoch_Time:28506.0min: [2024-01-02 22:48:45,420][model8_pretrain.py][INFO] Epoch:[0/2](49200/4588595) loss:2.883 lr:0.0001058 epoch_Time:28506.0min: [2024-01-02 22:49:22,373][model8_pretrain.py][INFO] Epoch:[0/2](49300/4588595) loss:3.634 lr:0.0001053 epoch_Time:28504.0min: [2024-01-02 22:49:22,373][model8_pretrain.py][INFO] Epoch:[0/2](49300/4588595) loss:3.152 lr:0.0001053 epoch_Time:28504.0min: [2024-01-02 22:49:22,373][model8_pretrain.py][INFO] Epoch:[0/2](49300/4588595) loss:3.472 lr:0.0001053 epoch_Time:28504.0min: [2024-01-02 22:49:22,373][model8_pretrain.py][INFO] Epoch:[0/2](49300/4588595) loss:3.304 lr:0.0001053 epoch_Time:28504.0min: [2024-01-02 22:49:22,373][model8_pretrain.py][INFO] Epoch:[0/2](49300/4588595) loss:3.476 lr:0.0001053 epoch_Time:28504.0min: [2024-01-02 22:49:22,373][model8_pretrain.py][INFO] Epoch:[0/2](49300/4588595) loss:3.343 lr:0.0001053 epoch_Time:28504.0min: [2024-01-02 22:49:22,373][model8_pretrain.py][INFO] Epoch:[0/2](49300/4588595) loss:2.758 lr:0.0001053 epoch_Time:28504.0min: [2024-01-02 22:49:22,373][model8_pretrain.py][INFO] Epoch:[0/2](49300/4588595) loss:3.166 lr:0.0001053 epoch_Time:28504.0min: [2024-01-02 22:49:59,329][model8_pretrain.py][INFO] Epoch:[0/2](49400/4588595) loss:3.318 lr:0.0001047 epoch_Time:28502.0min: [2024-01-02 22:49:59,329][model8_pretrain.py][INFO] Epoch:[0/2](49400/4588595) loss:3.188 lr:0.0001047 epoch_Time:28502.0min: [2024-01-02 22:49:59,329][model8_pretrain.py][INFO] Epoch:[0/2](49400/4588595) loss:3.431 lr:0.0001047 epoch_Time:28502.0min: [2024-01-02 22:49:59,329][model8_pretrain.py][INFO] Epoch:[0/2](49400/4588595) loss:3.236 lr:0.0001047 epoch_Time:28502.0min: [2024-01-02 22:49:59,329][model8_pretrain.py][INFO] Epoch:[0/2](49400/4588595) loss:3.363 lr:0.0001047 epoch_Time:28502.0min: [2024-01-02 22:49:59,329][model8_pretrain.py][INFO] Epoch:[0/2](49400/4588595) loss:3.611 lr:0.0001047 epoch_Time:28502.0min: [2024-01-02 22:49:59,329][model8_pretrain.py][INFO] Epoch:[0/2](49400/4588595) loss:3.435 lr:0.0001047 epoch_Time:28502.0min: [2024-01-02 22:49:59,329][model8_pretrain.py][INFO] Epoch:[0/2](49400/4588595) loss:3.128 lr:0.0001047 epoch_Time:28502.0min: [2024-01-02 22:50:43,286][model8_pretrain.py][INFO] Epoch:[0/2](49500/4588595) loss:2.483 lr:0.0001042 epoch_Time:28512.0min: [2024-01-02 22:50:43,286][model8_pretrain.py][INFO] Epoch:[0/2](49500/4588595) loss:3.408 lr:0.0001042 epoch_Time:28512.0min: [2024-01-02 22:50:43,286][model8_pretrain.py][INFO] Epoch:[0/2](49500/4588595) loss:3.578 lr:0.0001042 epoch_Time:28512.0min: [2024-01-02 22:50:43,287][model8_pretrain.py][INFO] Epoch:[0/2](49500/4588595) loss:2.811 lr:0.0001042 epoch_Time:28512.0min: [2024-01-02 22:50:43,286][model8_pretrain.py][INFO] Epoch:[0/2](49500/4588595) loss:3.369 lr:0.0001042 epoch_Time:28512.0min: [2024-01-02 22:50:43,286][model8_pretrain.py][INFO] Epoch:[0/2](49500/4588595) loss:3.544 lr:0.0001042 epoch_Time:28512.0min: [2024-01-02 22:50:43,287][model8_pretrain.py][INFO] Epoch:[0/2](49500/4588595) loss:2.858 lr:0.0001042 epoch_Time:28512.0min: [2024-01-02 22:50:43,287][model8_pretrain.py][INFO] Epoch:[0/2](49500/4588595) loss:3.101 lr:0.0001042 epoch_Time:28512.0min: [2024-01-02 22:51:20,229][model8_pretrain.py][INFO] Epoch:[0/2](49600/4588595) loss:3.395 lr:0.0001037 epoch_Time:28509.0min: [2024-01-02 22:51:20,229][model8_pretrain.py][INFO] Epoch:[0/2](49600/4588595) loss:2.612 lr:0.0001037 epoch_Time:28509.0min: [2024-01-02 22:51:20,229][model8_pretrain.py][INFO] Epoch:[0/2](49600/4588595) loss:3.191 lr:0.0001037 epoch_Time:28509.0min: [2024-01-02 22:51:20,229][model8_pretrain.py][INFO] Epoch:[0/2](49600/4588595) loss:2.812 lr:0.0001037 epoch_Time:28509.0min: [2024-01-02 22:51:20,229][model8_pretrain.py][INFO] Epoch:[0/2](49600/4588595) loss:3.286 lr:0.0001037 epoch_Time:28509.0min: [2024-01-02 22:51:20,229][model8_pretrain.py][INFO] Epoch:[0/2](49600/4588595) loss:3.445 lr:0.0001037 epoch_Time:28510.0min: [2024-01-02 22:51:20,229][model8_pretrain.py][INFO] Epoch:[0/2](49600/4588595) loss:3.478 lr:0.0001037 epoch_Time:28509.0min: [2024-01-02 22:51:20,229][model8_pretrain.py][INFO] Epoch:[0/2](49600/4588595) loss:2.915 lr:0.0001037 epoch_Time:28509.0min: [2024-01-02 22:51:57,166][model8_pretrain.py][INFO] Epoch:[0/2](49700/4588595) loss:2.543 lr:0.0001031 epoch_Time:28507.0min: [2024-01-02 22:51:57,166][model8_pretrain.py][INFO] Epoch:[0/2](49700/4588595) loss:3.616 lr:0.0001031 epoch_Time:28507.0min: [2024-01-02 22:51:57,166][model8_pretrain.py][INFO] Epoch:[0/2](49700/4588595) loss:3.597 lr:0.0001031 epoch_Time:28507.0min: [2024-01-02 22:51:57,166][model8_pretrain.py][INFO] Epoch:[0/2](49700/4588595) loss:3.125 lr:0.0001031 epoch_Time:28507.0min: [2024-01-02 22:51:57,166][model8_pretrain.py][INFO] Epoch:[0/2](49700/4588595) loss:2.943 lr:0.0001031 epoch_Time:28507.0min: [2024-01-02 22:51:57,166][model8_pretrain.py][INFO] Epoch:[0/2](49700/4588595) loss:3.088 lr:0.0001031 epoch_Time:28507.0min: [2024-01-02 22:51:57,166][model8_pretrain.py][INFO] Epoch:[0/2](49700/4588595) loss:2.915 lr:0.0001031 epoch_Time:28507.0min: [2024-01-02 22:51:57,166][model8_pretrain.py][INFO] Epoch:[0/2](49700/4588595) loss:2.854 lr:0.0001031 epoch_Time:28507.0min: [2024-01-02 22:52:34,102][model8_pretrain.py][INFO] Epoch:[0/2](49800/4588595) loss:3.247 lr:0.0001026 epoch_Time:28506.0min: [2024-01-02 22:52:34,102][model8_pretrain.py][INFO] Epoch:[0/2](49800/4588595) loss:3.432 lr:0.0001026 epoch_Time:28506.0min: [2024-01-02 22:52:34,102][model8_pretrain.py][INFO] Epoch:[0/2](49800/4588595) loss:3.277 lr:0.0001026 epoch_Time:28506.0min: [2024-01-02 22:52:34,102][model8_pretrain.py][INFO] Epoch:[0/2](49800/4588595) loss:3.141 lr:0.0001026 epoch_Time:28506.0min: [2024-01-02 22:52:34,102][model8_pretrain.py][INFO] Epoch:[0/2](49800/4588595) loss:3.195 lr:0.0001026 epoch_Time:28506.0min: [2024-01-02 22:52:34,102][model8_pretrain.py][INFO] Epoch:[0/2](49800/4588595) loss:2.548 lr:0.0001026 epoch_Time:28506.0min: [2024-01-02 22:52:34,102][model8_pretrain.py][INFO] Epoch:[0/2](49800/4588595) loss:3.046 lr:0.0001026 epoch_Time:28506.0min: [2024-01-02 22:52:34,102][model8_pretrain.py][INFO] Epoch:[0/2](49800/4588595) loss:3.152 lr:0.0001026 epoch_Time:28506.0min: [2024-01-02 22:53:11,037][model8_pretrain.py][INFO] Epoch:[0/2](49900/4588595) loss:3.434 lr:0.0001021 epoch_Time:28504.0min: [2024-01-02 22:53:11,037][model8_pretrain.py][INFO] Epoch:[0/2](49900/4588595) loss:3.034 lr:0.0001021 epoch_Time:28504.0min: [2024-01-02 22:53:11,037][model8_pretrain.py][INFO] Epoch:[0/2](49900/4588595) loss:3.256 lr:0.0001021 epoch_Time:28504.0min: [2024-01-02 22:53:11,037][model8_pretrain.py][INFO] Epoch:[0/2](49900/4588595) loss:3.182 lr:0.0001021 epoch_Time:28504.0min: [2024-01-02 22:53:11,037][model8_pretrain.py][INFO] Epoch:[0/2](49900/4588595) loss:2.571 lr:0.0001021 epoch_Time:28504.0min: [2024-01-02 22:53:11,037][model8_pretrain.py][INFO] Epoch:[0/2](49900/4588595) loss:3.230 lr:0.0001021 epoch_Time:28504.0min: [2024-01-02 22:53:11,037][model8_pretrain.py][INFO] Epoch:[0/2](49900/4588595) loss:3.711 lr:0.0001021 epoch_Time:28504.0min: [2024-01-02 22:53:11,037][model8_pretrain.py][INFO] Epoch:[0/2](49900/4588595) loss:3.528 lr:0.0001021 epoch_Time:28504.0min: [2024-01-02 22:53:47,975][model8_pretrain.py][INFO] Epoch:[0/2](50000/4588595) loss:2.860 lr:0.0001015 epoch_Time:28502.0min: [2024-01-02 22:53:47,975][model8_pretrain.py][INFO] Epoch:[0/2](50000/4588595) loss:3.334 lr:0.0001015 epoch_Time:28502.0min: [2024-01-02 22:53:47,975][model8_pretrain.py][INFO] Epoch:[0/2](50000/4588595) loss:3.267 lr:0.0001015 epoch_Time:28502.0min: [2024-01-02 22:53:47,975][model8_pretrain.py][INFO] Epoch:[0/2](50000/4588595) loss:3.155 lr:0.0001015 epoch_Time:28502.0min: [2024-01-02 22:53:47,975][model8_pretrain.py][INFO] Epoch:[0/2](50000/4588595) loss:2.948 lr:0.0001015 epoch_Time:28502.0min: [2024-01-02 22:53:47,975][model8_pretrain.py][INFO] Epoch:[0/2](50000/4588595) loss:2.918 lr:0.0001015 epoch_Time:28502.0min: [2024-01-02 22:53:47,975][model8_pretrain.py][INFO] Epoch:[0/2](50000/4588595) loss:3.260 lr:0.0001015 epoch_Time:28502.0min: [2024-01-02 22:53:47,979][model8_pretrain.py][INFO] Epoch:[0/2](50000/4588595) loss:3.499 lr:0.0001015 epoch_Time:28502.0min: [2024-01-02 22:54:24,920][model8_pretrain.py][INFO] Epoch:[0/2](50100/4588595) loss:3.081 lr:0.0001010 epoch_Time:28501.0min: [2024-01-02 22:54:24,920][model8_pretrain.py][INFO] Epoch:[0/2](50100/4588595) loss:3.640 lr:0.0001010 epoch_Time:28501.0min: [2024-01-02 22:54:24,920][model8_pretrain.py][INFO] Epoch:[0/2](50100/4588595) loss:3.414 lr:0.0001010 epoch_Time:28501.0min: [2024-01-02 22:54:24,921][model8_pretrain.py][INFO] Epoch:[0/2](50100/4588595) loss:3.569 lr:0.0001010 epoch_Time:28501.0min: [2024-01-02 22:54:24,921][model8_pretrain.py][INFO] Epoch:[0/2](50100/4588595) loss:3.239 lr:0.0001010 epoch_Time:28501.0min: [2024-01-02 22:54:24,921][model8_pretrain.py][INFO] Epoch:[0/2](50100/4588595) loss:3.547 lr:0.0001010 epoch_Time:28501.0min: [2024-01-02 22:54:24,921][model8_pretrain.py][INFO] Epoch:[0/2](50100/4588595) loss:2.491 lr:0.0001010 epoch_Time:28501.0min: [2024-01-02 22:54:24,921][model8_pretrain.py][INFO] Epoch:[0/2](50100/4588595) loss:3.124 lr:0.0001010 epoch_Time:28501.0min: [2024-01-02 22:55:01,863][model8_pretrain.py][INFO] Epoch:[0/2](50200/4588595) loss:2.686 lr:0.0001004 epoch_Time:28499.0min: [2024-01-02 22:55:01,863][model8_pretrain.py][INFO] Epoch:[0/2](50200/4588595) loss:2.992 lr:0.0001004 epoch_Time:28499.0min: [2024-01-02 22:55:01,863][model8_pretrain.py][INFO] Epoch:[0/2](50200/4588595) loss:3.540 lr:0.0001004 epoch_Time:28499.0min: [2024-01-02 22:55:01,863][model8_pretrain.py][INFO] Epoch:[0/2](50200/4588595) loss:3.331 lr:0.0001004 epoch_Time:28499.0min: [2024-01-02 22:55:01,863][model8_pretrain.py][INFO] Epoch:[0/2](50200/4588595) loss:2.834 lr:0.0001004 epoch_Time:28499.0min: [2024-01-02 22:55:01,863][model8_pretrain.py][INFO] Epoch:[0/2](50200/4588595) loss:3.535 lr:0.0001004 epoch_Time:28499.0min: [2024-01-02 22:55:01,863][model8_pretrain.py][INFO] Epoch:[0/2](50200/4588595) loss:3.155 lr:0.0001004 epoch_Time:28499.0min: [2024-01-02 22:55:01,863][model8_pretrain.py][INFO] Epoch:[0/2](50200/4588595) loss:3.675 lr:0.0001004 epoch_Time:28499.0min: [2024-01-02 22:55:45,884][model8_pretrain.py][INFO] Epoch:[0/2](50300/4588595) loss:3.525 lr:0.0000999 epoch_Time:28508.0min: [2024-01-02 22:55:45,884][model8_pretrain.py][INFO] Epoch:[0/2](50300/4588595) loss:3.178 lr:0.0000999 epoch_Time:28508.0min: [2024-01-02 22:55:45,884][model8_pretrain.py][INFO] Epoch:[0/2](50300/4588595) loss:3.520 lr:0.0000999 epoch_Time:28508.0min: [2024-01-02 22:55:45,884][model8_pretrain.py][INFO] Epoch:[0/2](50300/4588595) loss:3.471 lr:0.0000999 epoch_Time:28508.0min: [2024-01-02 22:55:45,884][model8_pretrain.py][INFO] Epoch:[0/2](50300/4588595) loss:2.991 lr:0.0000999 epoch_Time:28508.0min: [2024-01-02 22:55:45,884][model8_pretrain.py][INFO] Epoch:[0/2](50300/4588595) loss:3.639 lr:0.0000999 epoch_Time:28508.0min: [2024-01-02 22:55:45,884][model8_pretrain.py][INFO] Epoch:[0/2](50300/4588595) loss:3.178 lr:0.0000999 epoch_Time:28508.0min: [2024-01-02 22:55:45,884][model8_pretrain.py][INFO] Epoch:[0/2](50300/4588595) loss:2.380 lr:0.0000999 epoch_Time:28508.0min: [2024-01-02 22:56:22,835][model8_pretrain.py][INFO] Epoch:[0/2](50400/4588595) loss:3.249 lr:0.0000994 epoch_Time:28506.0min: [2024-01-02 22:56:22,835][model8_pretrain.py][INFO] Epoch:[0/2](50400/4588595) loss:3.496 lr:0.0000994 epoch_Time:28506.0min: [2024-01-02 22:56:22,835][model8_pretrain.py][INFO] Epoch:[0/2](50400/4588595) loss:3.276 lr:0.0000994 epoch_Time:28506.0min: [2024-01-02 22:56:22,835][model8_pretrain.py][INFO] Epoch:[0/2](50400/4588595) loss:3.029 lr:0.0000994 epoch_Time:28506.0min: [2024-01-02 22:56:22,835][model8_pretrain.py][INFO] Epoch:[0/2](50400/4588595) loss:3.327 lr:0.0000994 epoch_Time:28506.0min: [2024-01-02 22:56:22,835][model8_pretrain.py][INFO] Epoch:[0/2](50400/4588595) loss:3.088 lr:0.0000994 epoch_Time:28506.0min: [2024-01-02 22:56:22,835][model8_pretrain.py][INFO] Epoch:[0/2](50400/4588595) loss:3.484 lr:0.0000994 epoch_Time:28506.0min: [2024-01-02 22:56:22,835][model8_pretrain.py][INFO] Epoch:[0/2](50400/4588595) loss:3.279 lr:0.0000994 epoch_Time:28506.0min: [2024-01-02 22:56:59,781][model8_pretrain.py][INFO] Epoch:[0/2](50500/4588595) loss:3.247 lr:0.0000988 epoch_Time:28504.0min: [2024-01-02 22:56:59,781][model8_pretrain.py][INFO] Epoch:[0/2](50500/4588595) loss:3.153 lr:0.0000988 epoch_Time:28504.0min: [2024-01-02 22:56:59,781][model8_pretrain.py][INFO] Epoch:[0/2](50500/4588595) loss:2.989 lr:0.0000988 epoch_Time:28504.0min: [2024-01-02 22:56:59,781][model8_pretrain.py][INFO] Epoch:[0/2](50500/4588595) loss:3.379 lr:0.0000988 epoch_Time:28504.0min: [2024-01-02 22:56:59,781][model8_pretrain.py][INFO] Epoch:[0/2](50500/4588595) loss:2.935 lr:0.0000988 epoch_Time:28504.0min: [2024-01-02 22:56:59,781][model8_pretrain.py][INFO] Epoch:[0/2](50500/4588595) loss:3.482 lr:0.0000988 epoch_Time:28504.0min: [2024-01-02 22:56:59,781][model8_pretrain.py][INFO] Epoch:[0/2](50500/4588595) loss:3.250 lr:0.0000988 epoch_Time:28504.0min: [2024-01-02 22:56:59,781][model8_pretrain.py][INFO] Epoch:[0/2](50500/4588595) loss:2.780 lr:0.0000988 epoch_Time:28504.0min: [2024-01-02 22:57:36,727][model8_pretrain.py][INFO] Epoch:[0/2](50600/4588595) loss:3.066 lr:0.0000983 epoch_Time:28503.0min: [2024-01-02 22:57:36,727][model8_pretrain.py][INFO] Epoch:[0/2](50600/4588595) loss:2.582 lr:0.0000983 epoch_Time:28503.0min: [2024-01-02 22:57:36,727][model8_pretrain.py][INFO] Epoch:[0/2](50600/4588595) loss:3.429 lr:0.0000983 epoch_Time:28503.0min: [2024-01-02 22:57:36,727][model8_pretrain.py][INFO] Epoch:[0/2](50600/4588595) loss:3.404 lr:0.0000983 epoch_Time:28503.0min: [2024-01-02 22:57:36,727][model8_pretrain.py][INFO] Epoch:[0/2](50600/4588595) loss:3.085 lr:0.0000983 epoch_Time:28503.0min: [2024-01-02 22:57:36,727][model8_pretrain.py][INFO] Epoch:[0/2](50600/4588595) loss:3.540 lr:0.0000983 epoch_Time:28503.0min: [2024-01-02 22:57:36,727][model8_pretrain.py][INFO] Epoch:[0/2](50600/4588595) loss:3.171 lr:0.0000983 epoch_Time:28503.0min: [2024-01-02 22:57:36,727][model8_pretrain.py][INFO] Epoch:[0/2](50600/4588595) loss:2.863 lr:0.0000983 epoch_Time:28503.0min: [2024-01-02 22:58:13,672][model8_pretrain.py][INFO] Epoch:[0/2](50700/4588595) loss:2.994 lr:0.0000978 epoch_Time:28501.0min: [2024-01-02 22:58:13,672][model8_pretrain.py][INFO] Epoch:[0/2](50700/4588595) loss:3.267 lr:0.0000978 epoch_Time:28501.0min: [2024-01-02 22:58:13,672][model8_pretrain.py][INFO] Epoch:[0/2](50700/4588595) loss:2.904 lr:0.0000978 epoch_Time:28501.0min: [2024-01-02 22:58:13,672][model8_pretrain.py][INFO] Epoch:[0/2](50700/4588595) loss:3.116 lr:0.0000978 epoch_Time:28501.0min: [2024-01-02 22:58:13,672][model8_pretrain.py][INFO] Epoch:[0/2](50700/4588595) loss:3.246 lr:0.0000978 epoch_Time:28501.0min: [2024-01-02 22:58:13,672][model8_pretrain.py][INFO] Epoch:[0/2](50700/4588595) loss:3.587 lr:0.0000978 epoch_Time:28501.0min: [2024-01-02 22:58:13,672][model8_pretrain.py][INFO] Epoch:[0/2](50700/4588595) loss:3.278 lr:0.0000978 epoch_Time:28501.0min: [2024-01-02 22:58:13,672][model8_pretrain.py][INFO] Epoch:[0/2](50700/4588595) loss:3.373 lr:0.0000978 epoch_Time:28501.0min: [2024-01-02 22:58:50,618][model8_pretrain.py][INFO] Epoch:[0/2](50800/4588595) loss:2.575 lr:0.0000973 epoch_Time:28499.0min: [2024-01-02 22:58:50,618][model8_pretrain.py][INFO] Epoch:[0/2](50800/4588595) loss:2.521 lr:0.0000973 epoch_Time:28499.0min: [2024-01-02 22:58:50,618][model8_pretrain.py][INFO] Epoch:[0/2](50800/4588595) loss:3.537 lr:0.0000973 epoch_Time:28499.0min: [2024-01-02 22:58:50,618][model8_pretrain.py][INFO] Epoch:[0/2](50800/4588595) loss:3.114 lr:0.0000973 epoch_Time:28499.0min: [2024-01-02 22:58:50,618][model8_pretrain.py][INFO] Epoch:[0/2](50800/4588595) loss:3.287 lr:0.0000973 epoch_Time:28499.0min: [2024-01-02 22:58:50,618][model8_pretrain.py][INFO] Epoch:[0/2](50800/4588595) loss:3.290 lr:0.0000973 epoch_Time:28499.0min: [2024-01-02 22:58:50,618][model8_pretrain.py][INFO] Epoch:[0/2](50800/4588595) loss:3.426 lr:0.0000973 epoch_Time:28499.0min: [2024-01-02 22:58:50,618][model8_pretrain.py][INFO] Epoch:[0/2](50800/4588595) loss:3.020 lr:0.0000973 epoch_Time:28499.0min: [2024-01-02 22:59:27,530][model8_pretrain.py][INFO] Epoch:[0/2](50900/4588595) loss:3.421 lr:0.0000967 epoch_Time:28498.0min: [2024-01-02 22:59:27,530][model8_pretrain.py][INFO] Epoch:[0/2](50900/4588595) loss:3.303 lr:0.0000967 epoch_Time:28498.0min: [2024-01-02 22:59:27,530][model8_pretrain.py][INFO] Epoch:[0/2](50900/4588595) loss:2.980 lr:0.0000967 epoch_Time:28498.0min: [2024-01-02 22:59:27,530][model8_pretrain.py][INFO] Epoch:[0/2](50900/4588595) loss:2.978 lr:0.0000967 epoch_Time:28498.0min: [2024-01-02 22:59:27,530][model8_pretrain.py][INFO] Epoch:[0/2](50900/4588595) loss:3.170 lr:0.0000967 epoch_Time:28498.0min: [2024-01-02 22:59:27,530][model8_pretrain.py][INFO] Epoch:[0/2](50900/4588595) loss:2.726 lr:0.0000967 epoch_Time:28498.0min: [2024-01-02 22:59:27,530][model8_pretrain.py][INFO] Epoch:[0/2](50900/4588595) loss:3.235 lr:0.0000967 epoch_Time:28498.0min: [2024-01-02 22:59:27,531][model8_pretrain.py][INFO] Epoch:[0/2](50900/4588595) loss:3.544 lr:0.0000967 epoch_Time:28498.0min: [2024-01-02 23:00:04,474][model8_pretrain.py][INFO] Epoch:[0/2](51000/4588595) loss:3.565 lr:0.0000962 epoch_Time:28495.0min: [2024-01-02 23:00:04,474][model8_pretrain.py][INFO] Epoch:[0/2](51000/4588595) loss:3.745 lr:0.0000962 epoch_Time:28495.0min: [2024-01-02 23:00:04,474][model8_pretrain.py][INFO] Epoch:[0/2](51000/4588595) loss:3.672 lr:0.0000962 epoch_Time:28495.0min: [2024-01-02 23:00:04,474][model8_pretrain.py][INFO] Epoch:[0/2](51000/4588595) loss:3.459 lr:0.0000962 epoch_Time:28495.0min: [2024-01-02 23:00:04,474][model8_pretrain.py][INFO] Epoch:[0/2](51000/4588595) loss:2.701 lr:0.0000962 epoch_Time:28495.0min: [2024-01-02 23:00:04,474][model8_pretrain.py][INFO] Epoch:[0/2](51000/4588595) loss:3.237 lr:0.0000962 epoch_Time:28495.0min: [2024-01-02 23:00:04,474][model8_pretrain.py][INFO] Epoch:[0/2](51000/4588595) loss:3.516 lr:0.0000962 epoch_Time:28495.0min: [2024-01-02 23:00:04,474][model8_pretrain.py][INFO] Epoch:[0/2](51000/4588595) loss:2.938 lr:0.0000962 epoch_Time:28495.0min: [2024-01-02 23:00:48,494][model8_pretrain.py][INFO] Epoch:[0/2](51100/4588595) loss:2.887 lr:0.0000957 epoch_Time:28504.0min: [2024-01-02 23:00:48,494][model8_pretrain.py][INFO] Epoch:[0/2](51100/4588595) loss:3.493 lr:0.0000957 epoch_Time:28504.0min: [2024-01-02 23:00:48,494][model8_pretrain.py][INFO] Epoch:[0/2](51100/4588595) loss:3.430 lr:0.0000957 epoch_Time:28504.0min: [2024-01-02 23:00:48,494][model8_pretrain.py][INFO] Epoch:[0/2](51100/4588595) loss:3.484 lr:0.0000957 epoch_Time:28504.0min: [2024-01-02 23:00:48,494][model8_pretrain.py][INFO] Epoch:[0/2](51100/4588595) loss:3.368 lr:0.0000957 epoch_Time:28504.0min: [2024-01-02 23:00:48,494][model8_pretrain.py][INFO] Epoch:[0/2](51100/4588595) loss:2.881 lr:0.0000957 epoch_Time:28504.0min: [2024-01-02 23:00:48,495][model8_pretrain.py][INFO] Epoch:[0/2](51100/4588595) loss:3.024 lr:0.0000957 epoch_Time:28504.0min: [2024-01-02 23:00:48,495][model8_pretrain.py][INFO] Epoch:[0/2](51100/4588595) loss:3.453 lr:0.0000957 epoch_Time:28504.0min: [2024-01-02 23:01:25,436][model8_pretrain.py][INFO] Epoch:[0/2](51200/4588595) loss:3.385 lr:0.0000951 epoch_Time:28503.0min: [2024-01-02 23:01:25,436][model8_pretrain.py][INFO] Epoch:[0/2](51200/4588595) loss:3.479 lr:0.0000951 epoch_Time:28503.0min: [2024-01-02 23:01:25,436][model8_pretrain.py][INFO] Epoch:[0/2](51200/4588595) loss:3.187 lr:0.0000951 epoch_Time:28503.0min: [2024-01-02 23:01:25,436][model8_pretrain.py][INFO] Epoch:[0/2](51200/4588595) loss:3.089 lr:0.0000951 epoch_Time:28503.0min: [2024-01-02 23:01:25,436][model8_pretrain.py][INFO] Epoch:[0/2](51200/4588595) loss:3.046 lr:0.0000951 epoch_Time:28503.0min: [2024-01-02 23:01:25,436][model8_pretrain.py][INFO] Epoch:[0/2](51200/4588595) loss:3.140 lr:0.0000951 epoch_Time:28503.0min: [2024-01-02 23:01:25,436][model8_pretrain.py][INFO] Epoch:[0/2](51200/4588595) loss:3.223 lr:0.0000951 epoch_Time:28503.0min: [2024-01-02 23:01:25,436][model8_pretrain.py][INFO] Epoch:[0/2](51200/4588595) loss:2.845 lr:0.0000951 epoch_Time:28503.0min: [2024-01-02 23:02:02,365][model8_pretrain.py][INFO] Epoch:[0/2](51300/4588595) loss:3.470 lr:0.0000946 epoch_Time:28501.0min: [2024-01-02 23:02:02,365][model8_pretrain.py][INFO] Epoch:[0/2](51300/4588595) loss:2.888 lr:0.0000946 epoch_Time:28501.0min: [2024-01-02 23:02:02,365][model8_pretrain.py][INFO] Epoch:[0/2](51300/4588595) loss:3.226 lr:0.0000946 epoch_Time:28501.0min: [2024-01-02 23:02:02,365][model8_pretrain.py][INFO] Epoch:[0/2](51300/4588595) loss:2.871 lr:0.0000946 epoch_Time:28501.0min: [2024-01-02 23:02:02,365][model8_pretrain.py][INFO] Epoch:[0/2](51300/4588595) loss:3.160 lr:0.0000946 epoch_Time:28501.0min: [2024-01-02 23:02:02,365][model8_pretrain.py][INFO] Epoch:[0/2](51300/4588595) loss:3.226 lr:0.0000946 epoch_Time:28501.0min: [2024-01-02 23:02:02,365][model8_pretrain.py][INFO] Epoch:[0/2](51300/4588595) loss:3.209 lr:0.0000946 epoch_Time:28501.0min: [2024-01-02 23:02:02,365][model8_pretrain.py][INFO] Epoch:[0/2](51300/4588595) loss:2.903 lr:0.0000946 epoch_Time:28501.0min: [2024-01-02 23:02:39,297][model8_pretrain.py][INFO] Epoch:[0/2](51400/4588595) loss:3.263 lr:0.0000941 epoch_Time:28500.0min: [2024-01-02 23:02:39,297][model8_pretrain.py][INFO] Epoch:[0/2](51400/4588595) loss:3.000 lr:0.0000941 epoch_Time:28500.0min: [2024-01-02 23:02:39,297][model8_pretrain.py][INFO] Epoch:[0/2](51400/4588595) loss:3.352 lr:0.0000941 epoch_Time:28500.0min: [2024-01-02 23:02:39,297][model8_pretrain.py][INFO] Epoch:[0/2](51400/4588595) loss:2.547 lr:0.0000941 epoch_Time:28500.0min: [2024-01-02 23:02:39,297][model8_pretrain.py][INFO] Epoch:[0/2](51400/4588595) loss:2.881 lr:0.0000941 epoch_Time:28500.0min: [2024-01-02 23:02:39,297][model8_pretrain.py][INFO] Epoch:[0/2](51400/4588595) loss:2.813 lr:0.0000941 epoch_Time:28500.0min: [2024-01-02 23:02:39,297][model8_pretrain.py][INFO] Epoch:[0/2](51400/4588595) loss:3.404 lr:0.0000941 epoch_Time:28500.0min: [2024-01-02 23:02:39,297][model8_pretrain.py][INFO] Epoch:[0/2](51400/4588595) loss:3.164 lr:0.0000941 epoch_Time:28500.0min: [2024-01-02 23:03:16,244][model8_pretrain.py][INFO] Epoch:[0/2](51500/4588595) loss:3.537 lr:0.0000936 epoch_Time:28497.0min: [2024-01-02 23:03:16,244][model8_pretrain.py][INFO] Epoch:[0/2](51500/4588595) loss:3.487 lr:0.0000936 epoch_Time:28497.0min: [2024-01-02 23:03:16,244][model8_pretrain.py][INFO] Epoch:[0/2](51500/4588595) loss:3.081 lr:0.0000936 epoch_Time:28497.0min: [2024-01-02 23:03:16,244][model8_pretrain.py][INFO] Epoch:[0/2](51500/4588595) loss:3.172 lr:0.0000936 epoch_Time:28497.0min: [2024-01-02 23:03:16,244][model8_pretrain.py][INFO] Epoch:[0/2](51500/4588595) loss:2.729 lr:0.0000936 epoch_Time:28497.0min: [2024-01-02 23:03:16,244][model8_pretrain.py][INFO] Epoch:[0/2](51500/4588595) loss:2.781 lr:0.0000936 epoch_Time:28497.0min: [2024-01-02 23:03:16,244][model8_pretrain.py][INFO] Epoch:[0/2](51500/4588595) loss:3.303 lr:0.0000936 epoch_Time:28497.0min: [2024-01-02 23:03:16,244][model8_pretrain.py][INFO] Epoch:[0/2](51500/4588595) loss:3.088 lr:0.0000936 epoch_Time:28497.0min: [2024-01-02 23:03:53,206][model8_pretrain.py][INFO] Epoch:[0/2](51600/4588595) loss:3.030 lr:0.0000931 epoch_Time:28495.0min: [2024-01-02 23:03:53,206][model8_pretrain.py][INFO] Epoch:[0/2](51600/4588595) loss:3.234 lr:0.0000931 epoch_Time:28495.0min: [2024-01-02 23:03:53,206][model8_pretrain.py][INFO] Epoch:[0/2](51600/4588595) loss:3.235 lr:0.0000931 epoch_Time:28495.0min: [2024-01-02 23:03:53,206][model8_pretrain.py][INFO] Epoch:[0/2](51600/4588595) loss:3.163 lr:0.0000931 epoch_Time:28495.0min: [2024-01-02 23:03:53,207][model8_pretrain.py][INFO] Epoch:[0/2](51600/4588595) loss:3.399 lr:0.0000931 epoch_Time:28495.0min: [2024-01-02 23:03:53,207][model8_pretrain.py][INFO] Epoch:[0/2](51600/4588595) loss:3.281 lr:0.0000931 epoch_Time:28495.0min: [2024-01-02 23:03:53,207][model8_pretrain.py][INFO] Epoch:[0/2](51600/4588595) loss:3.185 lr:0.0000931 epoch_Time:28495.0min: [2024-01-02 23:03:53,207][model8_pretrain.py][INFO] Epoch:[0/2](51600/4588595) loss:3.233 lr:0.0000931 epoch_Time:28495.0min: [2024-01-02 23:04:30,151][model8_pretrain.py][INFO] Epoch:[0/2](51700/4588595) loss:3.057 lr:0.0000925 epoch_Time:28494.0min: [2024-01-02 23:04:30,151][model8_pretrain.py][INFO] Epoch:[0/2](51700/4588595) loss:2.943 lr:0.0000925 epoch_Time:28494.0min: [2024-01-02 23:04:30,151][model8_pretrain.py][INFO] Epoch:[0/2](51700/4588595) loss:3.166 lr:0.0000925 epoch_Time:28494.0min: [2024-01-02 23:04:30,151][model8_pretrain.py][INFO] Epoch:[0/2](51700/4588595) loss:3.159 lr:0.0000925 epoch_Time:28494.0min: [2024-01-02 23:04:30,151][model8_pretrain.py][INFO] Epoch:[0/2](51700/4588595) loss:2.615 lr:0.0000925 epoch_Time:28494.0min: [2024-01-02 23:04:30,151][model8_pretrain.py][INFO] Epoch:[0/2](51700/4588595) loss:3.454 lr:0.0000925 epoch_Time:28494.0min: [2024-01-02 23:04:30,152][model8_pretrain.py][INFO] Epoch:[0/2](51700/4588595) loss:3.342 lr:0.0000925 epoch_Time:28494.0min: [2024-01-02 23:04:30,151][model8_pretrain.py][INFO] Epoch:[0/2](51700/4588595) loss:2.766 lr:0.0000925 epoch_Time:28494.0min: [2024-01-02 23:05:07,094][model8_pretrain.py][INFO] Epoch:[0/2](51800/4588595) loss:3.013 lr:0.0000920 epoch_Time:28492.0min: [2024-01-02 23:05:07,094][model8_pretrain.py][INFO] Epoch:[0/2](51800/4588595) loss:3.374 lr:0.0000920 epoch_Time:28492.0min: [2024-01-02 23:05:07,094][model8_pretrain.py][INFO] Epoch:[0/2](51800/4588595) loss:3.571 lr:0.0000920 epoch_Time:28492.0min: [2024-01-02 23:05:07,094][model8_pretrain.py][INFO] Epoch:[0/2](51800/4588595) loss:3.615 lr:0.0000920 epoch_Time:28492.0min: [2024-01-02 23:05:07,094][model8_pretrain.py][INFO] Epoch:[0/2](51800/4588595) loss:3.622 lr:0.0000920 epoch_Time:28492.0min: [2024-01-02 23:05:07,094][model8_pretrain.py][INFO] Epoch:[0/2](51800/4588595) loss:2.936 lr:0.0000920 epoch_Time:28492.0min: [2024-01-02 23:05:07,094][model8_pretrain.py][INFO] Epoch:[0/2](51800/4588595) loss:2.909 lr:0.0000920 epoch_Time:28492.0min: [2024-01-02 23:05:07,094][model8_pretrain.py][INFO] Epoch:[0/2](51800/4588595) loss:3.120 lr:0.0000920 epoch_Time:28492.0min: [2024-01-02 23:05:51,028][model8_pretrain.py][INFO] Epoch:[0/2](51900/4588595) loss:3.346 lr:0.0000915 epoch_Time:28500.0min: [2024-01-02 23:05:51,028][model8_pretrain.py][INFO] Epoch:[0/2](51900/4588595) loss:3.259 lr:0.0000915 epoch_Time:28500.0min: [2024-01-02 23:05:51,028][model8_pretrain.py][INFO] Epoch:[0/2](51900/4588595) loss:3.338 lr:0.0000915 epoch_Time:28500.0min: [2024-01-02 23:05:51,029][model8_pretrain.py][INFO] Epoch:[0/2](51900/4588595) loss:3.010 lr:0.0000915 epoch_Time:28500.0min: [2024-01-02 23:05:51,028][model8_pretrain.py][INFO] Epoch:[0/2](51900/4588595) loss:3.143 lr:0.0000915 epoch_Time:28500.0min: [2024-01-02 23:05:51,029][model8_pretrain.py][INFO] Epoch:[0/2](51900/4588595) loss:3.126 lr:0.0000915 epoch_Time:28500.0min: [2024-01-02 23:05:51,029][model8_pretrain.py][INFO] Epoch:[0/2](51900/4588595) loss:2.800 lr:0.0000915 epoch_Time:28500.0min: [2024-01-02 23:05:51,029][model8_pretrain.py][INFO] Epoch:[0/2](51900/4588595) loss:3.303 lr:0.0000915 epoch_Time:28500.0min: [2024-01-02 23:06:27,973][model8_pretrain.py][INFO] Epoch:[0/2](52000/4588595) loss:3.444 lr:0.0000910 epoch_Time:28499.0min: [2024-01-02 23:06:27,973][model8_pretrain.py][INFO] Epoch:[0/2](52000/4588595) loss:3.391 lr:0.0000910 epoch_Time:28499.0min: [2024-01-02 23:06:27,973][model8_pretrain.py][INFO] Epoch:[0/2](52000/4588595) loss:3.652 lr:0.0000910 epoch_Time:28499.0min: [2024-01-02 23:06:27,973][model8_pretrain.py][INFO] Epoch:[0/2](52000/4588595) loss:3.104 lr:0.0000910 epoch_Time:28499.0min: [2024-01-02 23:06:27,973][model8_pretrain.py][INFO] Epoch:[0/2](52000/4588595) loss:3.166 lr:0.0000910 epoch_Time:28499.0min: [2024-01-02 23:06:27,973][model8_pretrain.py][INFO] Epoch:[0/2](52000/4588595) loss:2.462 lr:0.0000910 epoch_Time:28499.0min: [2024-01-02 23:06:27,973][model8_pretrain.py][INFO] Epoch:[0/2](52000/4588595) loss:3.084 lr:0.0000910 epoch_Time:28499.0min: [2024-01-02 23:06:27,974][model8_pretrain.py][INFO] Epoch:[0/2](52000/4588595) loss:3.474 lr:0.0000910 epoch_Time:28499.0min: [2024-01-02 23:07:04,906][model8_pretrain.py][INFO] Epoch:[0/2](52100/4588595) loss:3.359 lr:0.0000905 epoch_Time:28497.0min: [2024-01-02 23:07:04,906][model8_pretrain.py][INFO] Epoch:[0/2](52100/4588595) loss:2.602 lr:0.0000905 epoch_Time:28497.0min: [2024-01-02 23:07:04,907][model8_pretrain.py][INFO] Epoch:[0/2](52100/4588595) loss:3.556 lr:0.0000905 epoch_Time:28497.0min: [2024-01-02 23:07:04,907][model8_pretrain.py][INFO] Epoch:[0/2](52100/4588595) loss:3.476 lr:0.0000905 epoch_Time:28497.0min: [2024-01-02 23:07:04,907][model8_pretrain.py][INFO] Epoch:[0/2](52100/4588595) loss:3.295 lr:0.0000905 epoch_Time:28497.0min: [2024-01-02 23:07:04,907][model8_pretrain.py][INFO] Epoch:[0/2](52100/4588595) loss:3.133 lr:0.0000905 epoch_Time:28497.0min: [2024-01-02 23:07:04,907][model8_pretrain.py][INFO] Epoch:[0/2](52100/4588595) loss:2.871 lr:0.0000905 epoch_Time:28497.0min: [2024-01-02 23:07:04,907][model8_pretrain.py][INFO] Epoch:[0/2](52100/4588595) loss:3.424 lr:0.0000905 epoch_Time:28497.0min: [2024-01-02 23:07:41,841][model8_pretrain.py][INFO] Epoch:[0/2](52200/4588595) loss:2.990 lr:0.0000899 epoch_Time:28496.0min: [2024-01-02 23:07:41,841][model8_pretrain.py][INFO] Epoch:[0/2](52200/4588595) loss:3.311 lr:0.0000899 epoch_Time:28496.0min: [2024-01-02 23:07:41,841][model8_pretrain.py][INFO] Epoch:[0/2](52200/4588595) loss:3.037 lr:0.0000899 epoch_Time:28496.0min: [2024-01-02 23:07:41,841][model8_pretrain.py][INFO] Epoch:[0/2](52200/4588595) loss:3.129 lr:0.0000899 epoch_Time:28496.0min: [2024-01-02 23:07:41,841][model8_pretrain.py][INFO] Epoch:[0/2](52200/4588595) loss:3.344 lr:0.0000899 epoch_Time:28496.0min: [2024-01-02 23:07:41,841][model8_pretrain.py][INFO] Epoch:[0/2](52200/4588595) loss:3.331 lr:0.0000899 epoch_Time:28496.0min: [2024-01-02 23:07:41,841][model8_pretrain.py][INFO] Epoch:[0/2](52200/4588595) loss:3.480 lr:0.0000899 epoch_Time:28496.0min: [2024-01-02 23:07:41,842][model8_pretrain.py][INFO] Epoch:[0/2](52200/4588595) loss:3.339 lr:0.0000899 epoch_Time:28496.0min: [2024-01-02 23:08:18,790][model8_pretrain.py][INFO] Epoch:[0/2](52300/4588595) loss:2.852 lr:0.0000894 epoch_Time:28494.0min: [2024-01-02 23:08:18,790][model8_pretrain.py][INFO] Epoch:[0/2](52300/4588595) loss:3.356 lr:0.0000894 epoch_Time:28494.0min: [2024-01-02 23:08:18,790][model8_pretrain.py][INFO] Epoch:[0/2](52300/4588595) loss:3.302 lr:0.0000894 epoch_Time:28494.0min: [2024-01-02 23:08:18,790][model8_pretrain.py][INFO] Epoch:[0/2](52300/4588595) loss:3.462 lr:0.0000894 epoch_Time:28494.0min: [2024-01-02 23:08:18,790][model8_pretrain.py][INFO] Epoch:[0/2](52300/4588595) loss:3.275 lr:0.0000894 epoch_Time:28494.0min: [2024-01-02 23:08:18,790][model8_pretrain.py][INFO] Epoch:[0/2](52300/4588595) loss:3.139 lr:0.0000894 epoch_Time:28494.0min: [2024-01-02 23:08:18,790][model8_pretrain.py][INFO] Epoch:[0/2](52300/4588595) loss:3.530 lr:0.0000894 epoch_Time:28494.0min: [2024-01-02 23:08:18,790][model8_pretrain.py][INFO] Epoch:[0/2](52300/4588595) loss:3.497 lr:0.0000894 epoch_Time:28494.0min: [2024-01-02 23:08:55,729][model8_pretrain.py][INFO] Epoch:[0/2](52400/4588595) loss:3.416 lr:0.0000889 epoch_Time:28492.0min: [2024-01-02 23:08:55,729][model8_pretrain.py][INFO] Epoch:[0/2](52400/4588595) loss:3.272 lr:0.0000889 epoch_Time:28492.0min: [2024-01-02 23:08:55,729][model8_pretrain.py][INFO] Epoch:[0/2](52400/4588595) loss:3.374 lr:0.0000889 epoch_Time:28492.0min: [2024-01-02 23:08:55,729][model8_pretrain.py][INFO] Epoch:[0/2](52400/4588595) loss:2.894 lr:0.0000889 epoch_Time:28492.0min: [2024-01-02 23:08:55,729][model8_pretrain.py][INFO] Epoch:[0/2](52400/4588595) loss:3.257 lr:0.0000889 epoch_Time:28492.0min: [2024-01-02 23:08:55,729][model8_pretrain.py][INFO] Epoch:[0/2](52400/4588595) loss:2.898 lr:0.0000889 epoch_Time:28492.0min: [2024-01-02 23:08:55,729][model8_pretrain.py][INFO] Epoch:[0/2](52400/4588595) loss:2.690 lr:0.0000889 epoch_Time:28492.0min: [2024-01-02 23:08:55,729][model8_pretrain.py][INFO] Epoch:[0/2](52400/4588595) loss:2.837 lr:0.0000889 epoch_Time:28492.0min: [2024-01-02 23:09:32,662][model8_pretrain.py][INFO] Epoch:[0/2](52500/4588595) loss:3.151 lr:0.0000884 epoch_Time:28491.0min: [2024-01-02 23:09:32,662][model8_pretrain.py][INFO] Epoch:[0/2](52500/4588595) loss:2.721 lr:0.0000884 epoch_Time:28491.0min: [2024-01-02 23:09:32,662][model8_pretrain.py][INFO] Epoch:[0/2](52500/4588595) loss:3.381 lr:0.0000884 epoch_Time:28491.0min: [2024-01-02 23:09:32,662][model8_pretrain.py][INFO] Epoch:[0/2](52500/4588595) loss:3.203 lr:0.0000884 epoch_Time:28491.0min: [2024-01-02 23:09:32,662][model8_pretrain.py][INFO] Epoch:[0/2](52500/4588595) loss:2.651 lr:0.0000884 epoch_Time:28491.0min: [2024-01-02 23:09:32,662][model8_pretrain.py][INFO] Epoch:[0/2](52500/4588595) loss:3.005 lr:0.0000884 epoch_Time:28491.0min: [2024-01-02 23:09:32,662][model8_pretrain.py][INFO] Epoch:[0/2](52500/4588595) loss:3.295 lr:0.0000884 epoch_Time:28491.0min: [2024-01-02 23:09:32,663][model8_pretrain.py][INFO] Epoch:[0/2](52500/4588595) loss:2.864 lr:0.0000884 epoch_Time:28491.0min: [2024-01-02 23:10:09,608][model8_pretrain.py][INFO] Epoch:[0/2](52600/4588595) loss:3.157 lr:0.0000879 epoch_Time:28489.0min: [2024-01-02 23:10:09,608][model8_pretrain.py][INFO] Epoch:[0/2](52600/4588595) loss:3.233 lr:0.0000879 epoch_Time:28489.0min: [2024-01-02 23:10:09,608][model8_pretrain.py][INFO] Epoch:[0/2](52600/4588595) loss:3.298 lr:0.0000879 epoch_Time:28489.0min: [2024-01-02 23:10:09,608][model8_pretrain.py][INFO] Epoch:[0/2](52600/4588595) loss:3.177 lr:0.0000879 epoch_Time:28489.0min: [2024-01-02 23:10:09,608][model8_pretrain.py][INFO] Epoch:[0/2](52600/4588595) loss:2.837 lr:0.0000879 epoch_Time:28489.0min: [2024-01-02 23:10:09,608][model8_pretrain.py][INFO] Epoch:[0/2](52600/4588595) loss:3.346 lr:0.0000879 epoch_Time:28489.0min: [2024-01-02 23:10:09,608][model8_pretrain.py][INFO] Epoch:[0/2](52600/4588595) loss:2.497 lr:0.0000879 epoch_Time:28489.0min: [2024-01-02 23:10:09,608][model8_pretrain.py][INFO] Epoch:[0/2](52600/4588595) loss:3.388 lr:0.0000879 epoch_Time:28489.0min: [2024-01-02 23:10:53,628][model8_pretrain.py][INFO] Epoch:[0/2](52700/4588595) loss:2.979 lr:0.0000874 epoch_Time:28497.0min: [2024-01-02 23:10:53,628][model8_pretrain.py][INFO] Epoch:[0/2](52700/4588595) loss:2.231 lr:0.0000874 epoch_Time:28497.0min: [2024-01-02 23:10:53,628][model8_pretrain.py][INFO] Epoch:[0/2](52700/4588595) loss:3.379 lr:0.0000874 epoch_Time:28497.0min: [2024-01-02 23:10:53,628][model8_pretrain.py][INFO] Epoch:[0/2](52700/4588595) loss:2.342 lr:0.0000874 epoch_Time:28497.0min: [2024-01-02 23:10:53,628][model8_pretrain.py][INFO] Epoch:[0/2](52700/4588595) loss:3.298 lr:0.0000874 epoch_Time:28497.0min: [2024-01-02 23:10:53,628][model8_pretrain.py][INFO] Epoch:[0/2](52700/4588595) loss:3.332 lr:0.0000874 epoch_Time:28497.0min: [2024-01-02 23:10:53,628][model8_pretrain.py][INFO] Epoch:[0/2](52700/4588595) loss:3.013 lr:0.0000874 epoch_Time:28497.0min: [2024-01-02 23:10:53,628][model8_pretrain.py][INFO] Epoch:[0/2](52700/4588595) loss:3.295 lr:0.0000874 epoch_Time:28497.0min: [2024-01-02 23:11:30,566][model8_pretrain.py][INFO] Epoch:[0/2](52800/4588595) loss:3.564 lr:0.0000869 epoch_Time:28496.0min: [2024-01-02 23:11:30,566][model8_pretrain.py][INFO] Epoch:[0/2](52800/4588595) loss:3.059 lr:0.0000869 epoch_Time:28496.0min: [2024-01-02 23:11:30,566][model8_pretrain.py][INFO] Epoch:[0/2](52800/4588595) loss:3.429 lr:0.0000869 epoch_Time:28496.0min: [2024-01-02 23:11:30,566][model8_pretrain.py][INFO] Epoch:[0/2](52800/4588595) loss:3.192 lr:0.0000869 epoch_Time:28496.0min: [2024-01-02 23:11:30,566][model8_pretrain.py][INFO] Epoch:[0/2](52800/4588595) loss:3.204 lr:0.0000869 epoch_Time:28496.0min: [2024-01-02 23:11:30,566][model8_pretrain.py][INFO] Epoch:[0/2](52800/4588595) loss:3.543 lr:0.0000869 epoch_Time:28496.0min: [2024-01-02 23:11:30,566][model8_pretrain.py][INFO] Epoch:[0/2](52800/4588595) loss:3.000 lr:0.0000869 epoch_Time:28496.0min: [2024-01-02 23:11:30,566][model8_pretrain.py][INFO] Epoch:[0/2](52800/4588595) loss:3.111 lr:0.0000869 epoch_Time:28496.0min: [2024-01-02 23:12:07,522][model8_pretrain.py][INFO] Epoch:[0/2](52900/4588595) loss:2.969 lr:0.0000864 epoch_Time:28494.0min: [2024-01-02 23:12:07,522][model8_pretrain.py][INFO] Epoch:[0/2](52900/4588595) loss:3.565 lr:0.0000864 epoch_Time:28494.0min: [2024-01-02 23:12:07,522][model8_pretrain.py][INFO] Epoch:[0/2](52900/4588595) loss:3.179 lr:0.0000864 epoch_Time:28494.0min: [2024-01-02 23:12:07,522][model8_pretrain.py][INFO] Epoch:[0/2](52900/4588595) loss:3.502 lr:0.0000864 epoch_Time:28494.0min: [2024-01-02 23:12:07,522][model8_pretrain.py][INFO] Epoch:[0/2](52900/4588595) loss:3.301 lr:0.0000864 epoch_Time:28494.0min: [2024-01-02 23:12:07,522][model8_pretrain.py][INFO] Epoch:[0/2](52900/4588595) loss:3.510 lr:0.0000864 epoch_Time:28494.0min: [2024-01-02 23:12:07,522][model8_pretrain.py][INFO] Epoch:[0/2](52900/4588595) loss:3.149 lr:0.0000864 epoch_Time:28494.0min: [2024-01-02 23:12:07,522][model8_pretrain.py][INFO] Epoch:[0/2](52900/4588595) loss:2.942 lr:0.0000864 epoch_Time:28494.0min: [2024-01-02 23:12:44,465][model8_pretrain.py][INFO] Epoch:[0/2](53000/4588595) loss:3.234 lr:0.0000859 epoch_Time:28493.0min: [2024-01-02 23:12:44,465][model8_pretrain.py][INFO] Epoch:[0/2](53000/4588595) loss:3.140 lr:0.0000859 epoch_Time:28493.0min: [2024-01-02 23:12:44,465][model8_pretrain.py][INFO] Epoch:[0/2](53000/4588595) loss:3.447 lr:0.0000859 epoch_Time:28493.0min: [2024-01-02 23:12:44,465][model8_pretrain.py][INFO] Epoch:[0/2](53000/4588595) loss:3.253 lr:0.0000859 epoch_Time:28493.0min: [2024-01-02 23:12:44,465][model8_pretrain.py][INFO] Epoch:[0/2](53000/4588595) loss:3.751 lr:0.0000859 epoch_Time:28493.0min: [2024-01-02 23:12:44,465][model8_pretrain.py][INFO] Epoch:[0/2](53000/4588595) loss:3.344 lr:0.0000859 epoch_Time:28493.0min: [2024-01-02 23:12:44,466][model8_pretrain.py][INFO] Epoch:[0/2](53000/4588595) loss:3.077 lr:0.0000859 epoch_Time:28493.0min: [2024-01-02 23:12:44,466][model8_pretrain.py][INFO] Epoch:[0/2](53000/4588595) loss:3.629 lr:0.0000859 epoch_Time:28493.0min: [2024-01-02 23:13:21,414][model8_pretrain.py][INFO] Epoch:[0/2](53100/4588595) loss:2.924 lr:0.0000853 epoch_Time:28491.0min: [2024-01-02 23:13:21,414][model8_pretrain.py][INFO] Epoch:[0/2](53100/4588595) loss:2.354 lr:0.0000853 epoch_Time:28491.0min: [2024-01-02 23:13:21,414][model8_pretrain.py][INFO] Epoch:[0/2](53100/4588595) loss:3.202 lr:0.0000853 epoch_Time:28491.0min: [2024-01-02 23:13:21,414][model8_pretrain.py][INFO] Epoch:[0/2](53100/4588595) loss:3.363 lr:0.0000853 epoch_Time:28491.0min: [2024-01-02 23:13:21,414][model8_pretrain.py][INFO] Epoch:[0/2](53100/4588595) loss:3.589 lr:0.0000853 epoch_Time:28491.0min: [2024-01-02 23:13:21,414][model8_pretrain.py][INFO] Epoch:[0/2](53100/4588595) loss:3.008 lr:0.0000853 epoch_Time:28491.0min: [2024-01-02 23:13:21,414][model8_pretrain.py][INFO] Epoch:[0/2](53100/4588595) loss:3.011 lr:0.0000853 epoch_Time:28491.0min: [2024-01-02 23:13:21,414][model8_pretrain.py][INFO] Epoch:[0/2](53100/4588595) loss:3.448 lr:0.0000853 epoch_Time:28491.0min: [2024-01-02 23:13:58,355][model8_pretrain.py][INFO] Epoch:[0/2](53200/4588595) loss:3.337 lr:0.0000848 epoch_Time:28488.0min: [2024-01-02 23:13:58,355][model8_pretrain.py][INFO] Epoch:[0/2](53200/4588595) loss:3.161 lr:0.0000848 epoch_Time:28489.0min: [2024-01-02 23:13:58,355][model8_pretrain.py][INFO] Epoch:[0/2](53200/4588595) loss:2.790 lr:0.0000848 epoch_Time:28488.0min: [2024-01-02 23:13:58,355][model8_pretrain.py][INFO] Epoch:[0/2](53200/4588595) loss:3.476 lr:0.0000848 epoch_Time:28489.0min: [2024-01-02 23:13:58,355][model8_pretrain.py][INFO] Epoch:[0/2](53200/4588595) loss:3.233 lr:0.0000848 epoch_Time:28489.0min: [2024-01-02 23:13:58,355][model8_pretrain.py][INFO] Epoch:[0/2](53200/4588595) loss:3.250 lr:0.0000848 epoch_Time:28489.0min: [2024-01-02 23:13:58,355][model8_pretrain.py][INFO] Epoch:[0/2](53200/4588595) loss:3.297 lr:0.0000848 epoch_Time:28489.0min: [2024-01-02 23:13:58,355][model8_pretrain.py][INFO] Epoch:[0/2](53200/4588595) loss:3.235 lr:0.0000848 epoch_Time:28489.0min: [2024-01-02 23:14:35,293][model8_pretrain.py][INFO] Epoch:[0/2](53300/4588595) loss:3.244 lr:0.0000843 epoch_Time:28487.0min: [2024-01-02 23:14:35,293][model8_pretrain.py][INFO] Epoch:[0/2](53300/4588595) loss:3.539 lr:0.0000843 epoch_Time:28487.0min: [2024-01-02 23:14:35,293][model8_pretrain.py][INFO] Epoch:[0/2](53300/4588595) loss:3.051 lr:0.0000843 epoch_Time:28487.0min: [2024-01-02 23:14:35,293][model8_pretrain.py][INFO] Epoch:[0/2](53300/4588595) loss:3.274 lr:0.0000843 epoch_Time:28487.0min: [2024-01-02 23:14:35,293][model8_pretrain.py][INFO] Epoch:[0/2](53300/4588595) loss:3.024 lr:0.0000843 epoch_Time:28487.0min: [2024-01-02 23:14:35,293][model8_pretrain.py][INFO] Epoch:[0/2](53300/4588595) loss:3.265 lr:0.0000843 epoch_Time:28487.0min: [2024-01-02 23:14:35,293][model8_pretrain.py][INFO] Epoch:[0/2](53300/4588595) loss:2.980 lr:0.0000843 epoch_Time:28487.0min: [2024-01-02 23:14:35,293][model8_pretrain.py][INFO] Epoch:[0/2](53300/4588595) loss:3.138 lr:0.0000843 epoch_Time:28487.0min: [2024-01-02 23:15:12,228][model8_pretrain.py][INFO] Epoch:[0/2](53400/4588595) loss:3.015 lr:0.0000838 epoch_Time:28485.0min: [2024-01-02 23:15:12,228][model8_pretrain.py][INFO] Epoch:[0/2](53400/4588595) loss:3.261 lr:0.0000838 epoch_Time:28485.0min: [2024-01-02 23:15:12,228][model8_pretrain.py][INFO] Epoch:[0/2](53400/4588595) loss:3.221 lr:0.0000838 epoch_Time:28485.0min: [2024-01-02 23:15:12,228][model8_pretrain.py][INFO] Epoch:[0/2](53400/4588595) loss:2.531 lr:0.0000838 epoch_Time:28485.0min: [2024-01-02 23:15:12,228][model8_pretrain.py][INFO] Epoch:[0/2](53400/4588595) loss:3.351 lr:0.0000838 epoch_Time:28485.0min: [2024-01-02 23:15:12,229][model8_pretrain.py][INFO] Epoch:[0/2](53400/4588595) loss:3.170 lr:0.0000838 epoch_Time:28485.0min: [2024-01-02 23:15:12,230][model8_pretrain.py][INFO] Epoch:[0/2](53400/4588595) loss:3.103 lr:0.0000838 epoch_Time:28485.0min: [2024-01-02 23:15:12,230][model8_pretrain.py][INFO] Epoch:[0/2](53400/4588595) loss:3.137 lr:0.0000838 epoch_Time:28485.0min: [2024-01-02 23:15:56,700][model8_pretrain.py][INFO] Epoch:[0/2](53500/4588595) loss:3.070 lr:0.0000833 epoch_Time:28494.0min: [2024-01-02 23:15:56,700][model8_pretrain.py][INFO] Epoch:[0/2](53500/4588595) loss:2.869 lr:0.0000833 epoch_Time:28494.0min: [2024-01-02 23:15:56,700][model8_pretrain.py][INFO] Epoch:[0/2](53500/4588595) loss:3.369 lr:0.0000833 epoch_Time:28494.0min: [2024-01-02 23:15:56,700][model8_pretrain.py][INFO] Epoch:[0/2](53500/4588595) loss:3.508 lr:0.0000833 epoch_Time:28494.0min: [2024-01-02 23:15:56,700][model8_pretrain.py][INFO] Epoch:[0/2](53500/4588595) loss:3.007 lr:0.0000833 epoch_Time:28494.0min: [2024-01-02 23:15:56,700][model8_pretrain.py][INFO] Epoch:[0/2](53500/4588595) loss:2.986 lr:0.0000833 epoch_Time:28494.0min: [2024-01-02 23:15:56,700][model8_pretrain.py][INFO] Epoch:[0/2](53500/4588595) loss:3.554 lr:0.0000833 epoch_Time:28494.0min: [2024-01-02 23:15:56,701][model8_pretrain.py][INFO] Epoch:[0/2](53500/4588595) loss:3.521 lr:0.0000833 epoch_Time:28494.0min: [2024-01-02 23:16:33,640][model8_pretrain.py][INFO] Epoch:[0/2](53600/4588595) loss:3.305 lr:0.0000828 epoch_Time:28493.0min: [2024-01-02 23:16:33,640][model8_pretrain.py][INFO] Epoch:[0/2](53600/4588595) loss:3.190 lr:0.0000828 epoch_Time:28493.0min: [2024-01-02 23:16:33,640][model8_pretrain.py][INFO] Epoch:[0/2](53600/4588595) loss:2.954 lr:0.0000828 epoch_Time:28493.0min: [2024-01-02 23:16:33,640][model8_pretrain.py][INFO] Epoch:[0/2](53600/4588595) loss:2.723 lr:0.0000828 epoch_Time:28493.0min: [2024-01-02 23:16:33,640][model8_pretrain.py][INFO] Epoch:[0/2](53600/4588595) loss:3.393 lr:0.0000828 epoch_Time:28493.0min: [2024-01-02 23:16:33,640][model8_pretrain.py][INFO] Epoch:[0/2](53600/4588595) loss:3.201 lr:0.0000828 epoch_Time:28493.0min: [2024-01-02 23:16:33,640][model8_pretrain.py][INFO] Epoch:[0/2](53600/4588595) loss:3.060 lr:0.0000828 epoch_Time:28493.0min: [2024-01-02 23:16:33,640][model8_pretrain.py][INFO] Epoch:[0/2](53600/4588595) loss:2.852 lr:0.0000828 epoch_Time:28493.0min: [2024-01-02 23:17:10,579][model8_pretrain.py][INFO] Epoch:[0/2](53700/4588595) loss:3.190 lr:0.0000823 epoch_Time:28491.0min: [2024-01-02 23:17:10,579][model8_pretrain.py][INFO] Epoch:[0/2](53700/4588595) loss:3.565 lr:0.0000823 epoch_Time:28491.0min: [2024-01-02 23:17:10,579][model8_pretrain.py][INFO] Epoch:[0/2](53700/4588595) loss:2.727 lr:0.0000823 epoch_Time:28491.0min: [2024-01-02 23:17:10,579][model8_pretrain.py][INFO] Epoch:[0/2](53700/4588595) loss:3.489 lr:0.0000823 epoch_Time:28491.0min: [2024-01-02 23:17:10,579][model8_pretrain.py][INFO] Epoch:[0/2](53700/4588595) loss:3.536 lr:0.0000823 epoch_Time:28491.0min: [2024-01-02 23:17:10,579][model8_pretrain.py][INFO] Epoch:[0/2](53700/4588595) loss:3.219 lr:0.0000823 epoch_Time:28491.0min: [2024-01-02 23:17:10,579][model8_pretrain.py][INFO] Epoch:[0/2](53700/4588595) loss:3.202 lr:0.0000823 epoch_Time:28491.0min: [2024-01-02 23:17:10,579][model8_pretrain.py][INFO] Epoch:[0/2](53700/4588595) loss:2.640 lr:0.0000823 epoch_Time:28491.0min: [2024-01-02 23:17:47,534][model8_pretrain.py][INFO] Epoch:[0/2](53800/4588595) loss:3.469 lr:0.0000818 epoch_Time:28490.0min: [2024-01-02 23:17:47,534][model8_pretrain.py][INFO] Epoch:[0/2](53800/4588595) loss:2.984 lr:0.0000818 epoch_Time:28490.0min: [2024-01-02 23:17:47,534][model8_pretrain.py][INFO] Epoch:[0/2](53800/4588595) loss:2.905 lr:0.0000818 epoch_Time:28490.0min: [2024-01-02 23:17:47,534][model8_pretrain.py][INFO] Epoch:[0/2](53800/4588595) loss:2.932 lr:0.0000818 epoch_Time:28490.0min: [2024-01-02 23:17:47,534][model8_pretrain.py][INFO] Epoch:[0/2](53800/4588595) loss:3.213 lr:0.0000818 epoch_Time:28490.0min: [2024-01-02 23:17:47,534][model8_pretrain.py][INFO] Epoch:[0/2](53800/4588595) loss:3.368 lr:0.0000818 epoch_Time:28490.0min: [2024-01-02 23:17:47,534][model8_pretrain.py][INFO] Epoch:[0/2](53800/4588595) loss:3.290 lr:0.0000818 epoch_Time:28490.0min: [2024-01-02 23:17:47,534][model8_pretrain.py][INFO] Epoch:[0/2](53800/4588595) loss:3.284 lr:0.0000818 epoch_Time:28490.0min: [2024-01-02 23:18:24,487][model8_pretrain.py][INFO] Epoch:[0/2](53900/4588595) loss:3.342 lr:0.0000813 epoch_Time:28488.0min: [2024-01-02 23:18:24,487][model8_pretrain.py][INFO] Epoch:[0/2](53900/4588595) loss:2.889 lr:0.0000813 epoch_Time:28488.0min: [2024-01-02 23:18:24,487][model8_pretrain.py][INFO] Epoch:[0/2](53900/4588595) loss:3.153 lr:0.0000813 epoch_Time:28488.0min: [2024-01-02 23:18:24,487][model8_pretrain.py][INFO] Epoch:[0/2](53900/4588595) loss:3.327 lr:0.0000813 epoch_Time:28488.0min: [2024-01-02 23:18:24,487][model8_pretrain.py][INFO] Epoch:[0/2](53900/4588595) loss:2.907 lr:0.0000813 epoch_Time:28488.0min: [2024-01-02 23:18:24,487][model8_pretrain.py][INFO] Epoch:[0/2](53900/4588595) loss:2.986 lr:0.0000813 epoch_Time:28488.0min: [2024-01-02 23:18:24,487][model8_pretrain.py][INFO] Epoch:[0/2](53900/4588595) loss:3.175 lr:0.0000813 epoch_Time:28488.0min: [2024-01-02 23:18:24,487][model8_pretrain.py][INFO] Epoch:[0/2](53900/4588595) loss:3.440 lr:0.0000813 epoch_Time:28488.0min: [2024-01-02 23:19:01,413][model8_pretrain.py][INFO] Epoch:[0/2](54000/4588595) loss:2.854 lr:0.0000808 epoch_Time:28486.0min: [2024-01-02 23:19:01,413][model8_pretrain.py][INFO] Epoch:[0/2](54000/4588595) loss:3.019 lr:0.0000808 epoch_Time:28486.0min: [2024-01-02 23:19:01,413][model8_pretrain.py][INFO] Epoch:[0/2](54000/4588595) loss:3.381 lr:0.0000808 epoch_Time:28486.0min: [2024-01-02 23:19:01,413][model8_pretrain.py][INFO] Epoch:[0/2](54000/4588595) loss:3.235 lr:0.0000808 epoch_Time:28486.0min: [2024-01-02 23:19:01,413][model8_pretrain.py][INFO] Epoch:[0/2](54000/4588595) loss:2.951 lr:0.0000808 epoch_Time:28486.0min: [2024-01-02 23:19:01,413][model8_pretrain.py][INFO] Epoch:[0/2](54000/4588595) loss:3.496 lr:0.0000808 epoch_Time:28486.0min: [2024-01-02 23:19:01,413][model8_pretrain.py][INFO] Epoch:[0/2](54000/4588595) loss:3.444 lr:0.0000808 epoch_Time:28486.0min: [2024-01-02 23:19:01,413][model8_pretrain.py][INFO] Epoch:[0/2](54000/4588595) loss:3.401 lr:0.0000808 epoch_Time:28486.0min: [2024-01-02 23:19:38,372][model8_pretrain.py][INFO] Epoch:[0/2](54100/4588595) loss:3.406 lr:0.0000803 epoch_Time:28485.0min: [2024-01-02 23:19:38,372][model8_pretrain.py][INFO] Epoch:[0/2](54100/4588595) loss:2.811 lr:0.0000803 epoch_Time:28485.0min: [2024-01-02 23:19:38,372][model8_pretrain.py][INFO] Epoch:[0/2](54100/4588595) loss:3.072 lr:0.0000803 epoch_Time:28485.0min: [2024-01-02 23:19:38,372][model8_pretrain.py][INFO] Epoch:[0/2](54100/4588595) loss:3.075 lr:0.0000803 epoch_Time:28485.0min: [2024-01-02 23:19:38,372][model8_pretrain.py][INFO] Epoch:[0/2](54100/4588595) loss:3.443 lr:0.0000803 epoch_Time:28485.0min: [2024-01-02 23:19:38,372][model8_pretrain.py][INFO] Epoch:[0/2](54100/4588595) loss:3.011 lr:0.0000803 epoch_Time:28485.0min: [2024-01-02 23:19:38,373][model8_pretrain.py][INFO] Epoch:[0/2](54100/4588595) loss:3.266 lr:0.0000803 epoch_Time:28485.0min: [2024-01-02 23:19:38,374][model8_pretrain.py][INFO] Epoch:[0/2](54100/4588595) loss:2.910 lr:0.0000803 epoch_Time:28485.0min: [2024-01-02 23:20:15,305][model8_pretrain.py][INFO] Epoch:[0/2](54200/4588595) loss:3.222 lr:0.0000799 epoch_Time:28483.0min: [2024-01-02 23:20:15,305][model8_pretrain.py][INFO] Epoch:[0/2](54200/4588595) loss:2.707 lr:0.0000799 epoch_Time:28483.0min: [2024-01-02 23:20:15,305][model8_pretrain.py][INFO] Epoch:[0/2](54200/4588595) loss:2.766 lr:0.0000799 epoch_Time:28483.0min: [2024-01-02 23:20:15,305][model8_pretrain.py][INFO] Epoch:[0/2](54200/4588595) loss:3.477 lr:0.0000799 epoch_Time:28483.0min: [2024-01-02 23:20:15,305][model8_pretrain.py][INFO] Epoch:[0/2](54200/4588595) loss:2.913 lr:0.0000799 epoch_Time:28483.0min: [2024-01-02 23:20:15,305][model8_pretrain.py][INFO] Epoch:[0/2](54200/4588595) loss:3.720 lr:0.0000799 epoch_Time:28483.0min: [2024-01-02 23:20:15,305][model8_pretrain.py][INFO] Epoch:[0/2](54200/4588595) loss:3.034 lr:0.0000799 epoch_Time:28483.0min: [2024-01-02 23:20:15,305][model8_pretrain.py][INFO] Epoch:[0/2](54200/4588595) loss:3.429 lr:0.0000799 epoch_Time:28483.0min: [2024-01-02 23:20:59,351][model8_pretrain.py][INFO] Epoch:[0/2](54300/4588595) loss:3.174 lr:0.0000794 epoch_Time:28491.0min: [2024-01-02 23:20:59,351][model8_pretrain.py][INFO] Epoch:[0/2](54300/4588595) loss:3.201 lr:0.0000794 epoch_Time:28491.0min: [2024-01-02 23:20:59,351][model8_pretrain.py][INFO] Epoch:[0/2](54300/4588595) loss:2.908 lr:0.0000794 epoch_Time:28491.0min: [2024-01-02 23:20:59,351][model8_pretrain.py][INFO] Epoch:[0/2](54300/4588595) loss:3.116 lr:0.0000794 epoch_Time:28491.0min: [2024-01-02 23:20:59,351][model8_pretrain.py][INFO] Epoch:[0/2](54300/4588595) loss:3.186 lr:0.0000794 epoch_Time:28491.0min: [2024-01-02 23:20:59,351][model8_pretrain.py][INFO] Epoch:[0/2](54300/4588595) loss:3.360 lr:0.0000794 epoch_Time:28491.0min: [2024-01-02 23:20:59,351][model8_pretrain.py][INFO] Epoch:[0/2](54300/4588595) loss:2.881 lr:0.0000794 epoch_Time:28491.0min: [2024-01-02 23:20:59,351][model8_pretrain.py][INFO] Epoch:[0/2](54300/4588595) loss:2.867 lr:0.0000794 epoch_Time:28491.0min: [2024-01-02 23:21:36,287][model8_pretrain.py][INFO] Epoch:[0/2](54400/4588595) loss:2.872 lr:0.0000789 epoch_Time:28489.0min: [2024-01-02 23:21:36,287][model8_pretrain.py][INFO] Epoch:[0/2](54400/4588595) loss:2.277 lr:0.0000789 epoch_Time:28490.0min: [2024-01-02 23:21:36,287][model8_pretrain.py][INFO] Epoch:[0/2](54400/4588595) loss:3.272 lr:0.0000789 epoch_Time:28489.0min: [2024-01-02 23:21:36,287][model8_pretrain.py][INFO] Epoch:[0/2](54400/4588595) loss:3.033 lr:0.0000789 epoch_Time:28490.0min: [2024-01-02 23:21:36,287][model8_pretrain.py][INFO] Epoch:[0/2](54400/4588595) loss:3.223 lr:0.0000789 epoch_Time:28489.0min: [2024-01-02 23:21:36,287][model8_pretrain.py][INFO] Epoch:[0/2](54400/4588595) loss:3.003 lr:0.0000789 epoch_Time:28489.0min: [2024-01-02 23:21:36,287][model8_pretrain.py][INFO] Epoch:[0/2](54400/4588595) loss:2.648 lr:0.0000789 epoch_Time:28489.0min: [2024-01-02 23:21:36,287][model8_pretrain.py][INFO] Epoch:[0/2](54400/4588595) loss:2.861 lr:0.0000789 epoch_Time:28489.0min: [2024-01-02 23:22:13,235][model8_pretrain.py][INFO] Epoch:[0/2](54500/4588595) loss:3.237 lr:0.0000784 epoch_Time:28487.0min: [2024-01-02 23:22:13,235][model8_pretrain.py][INFO] Epoch:[0/2](54500/4588595) loss:3.434 lr:0.0000784 epoch_Time:28487.0min: [2024-01-02 23:22:13,235][model8_pretrain.py][INFO] Epoch:[0/2](54500/4588595) loss:2.820 lr:0.0000784 epoch_Time:28487.0min: [2024-01-02 23:22:13,235][model8_pretrain.py][INFO] Epoch:[0/2](54500/4588595) loss:3.637 lr:0.0000784 epoch_Time:28487.0min: [2024-01-02 23:22:13,235][model8_pretrain.py][INFO] Epoch:[0/2](54500/4588595) loss:3.603 lr:0.0000784 epoch_Time:28487.0min: [2024-01-02 23:22:13,235][model8_pretrain.py][INFO] Epoch:[0/2](54500/4588595) loss:3.316 lr:0.0000784 epoch_Time:28487.0min: [2024-01-02 23:22:13,235][model8_pretrain.py][INFO] Epoch:[0/2](54500/4588595) loss:3.578 lr:0.0000784 epoch_Time:28487.0min: [2024-01-02 23:22:13,235][model8_pretrain.py][INFO] Epoch:[0/2](54500/4588595) loss:3.473 lr:0.0000784 epoch_Time:28487.0min: [2024-01-02 23:22:50,178][model8_pretrain.py][INFO] Epoch:[0/2](54600/4588595) loss:2.969 lr:0.0000779 epoch_Time:28485.0min: [2024-01-02 23:22:50,178][model8_pretrain.py][INFO] Epoch:[0/2](54600/4588595) loss:3.314 lr:0.0000779 epoch_Time:28485.0min: [2024-01-02 23:22:50,178][model8_pretrain.py][INFO] Epoch:[0/2](54600/4588595) loss:3.243 lr:0.0000779 epoch_Time:28485.0min: [2024-01-02 23:22:50,178][model8_pretrain.py][INFO] Epoch:[0/2](54600/4588595) loss:2.883 lr:0.0000779 epoch_Time:28485.0min: [2024-01-02 23:22:50,178][model8_pretrain.py][INFO] Epoch:[0/2](54600/4588595) loss:3.090 lr:0.0000779 epoch_Time:28485.0min: [2024-01-02 23:22:50,178][model8_pretrain.py][INFO] Epoch:[0/2](54600/4588595) loss:2.877 lr:0.0000779 epoch_Time:28485.0min: [2024-01-02 23:22:50,178][model8_pretrain.py][INFO] Epoch:[0/2](54600/4588595) loss:3.333 lr:0.0000779 epoch_Time:28485.0min: [2024-01-02 23:22:50,178][model8_pretrain.py][INFO] Epoch:[0/2](54600/4588595) loss:2.741 lr:0.0000779 epoch_Time:28485.0min: [2024-01-02 23:23:27,113][model8_pretrain.py][INFO] Epoch:[0/2](54700/4588595) loss:2.682 lr:0.0000774 epoch_Time:28484.0min: [2024-01-02 23:23:27,113][model8_pretrain.py][INFO] Epoch:[0/2](54700/4588595) loss:3.122 lr:0.0000774 epoch_Time:28484.0min: [2024-01-02 23:23:27,113][model8_pretrain.py][INFO] Epoch:[0/2](54700/4588595) loss:3.533 lr:0.0000774 epoch_Time:28484.0min: [2024-01-02 23:23:27,113][model8_pretrain.py][INFO] Epoch:[0/2](54700/4588595) loss:3.723 lr:0.0000774 epoch_Time:28484.0min: [2024-01-02 23:23:27,113][model8_pretrain.py][INFO] Epoch:[0/2](54700/4588595) loss:3.145 lr:0.0000774 epoch_Time:28484.0min: [2024-01-02 23:23:27,113][model8_pretrain.py][INFO] Epoch:[0/2](54700/4588595) loss:3.422 lr:0.0000774 epoch_Time:28484.0min: [2024-01-02 23:23:27,113][model8_pretrain.py][INFO] Epoch:[0/2](54700/4588595) loss:3.039 lr:0.0000774 epoch_Time:28484.0min: [2024-01-02 23:23:27,114][model8_pretrain.py][INFO] Epoch:[0/2](54700/4588595) loss:3.074 lr:0.0000774 epoch_Time:28484.0min: [2024-01-02 23:24:04,059][model8_pretrain.py][INFO] Epoch:[0/2](54800/4588595) loss:2.481 lr:0.0000769 epoch_Time:28482.0min: [2024-01-02 23:24:04,059][model8_pretrain.py][INFO] Epoch:[0/2](54800/4588595) loss:2.628 lr:0.0000769 epoch_Time:28482.0min: [2024-01-02 23:24:04,059][model8_pretrain.py][INFO] Epoch:[0/2](54800/4588595) loss:3.372 lr:0.0000769 epoch_Time:28482.0min: [2024-01-02 23:24:04,059][model8_pretrain.py][INFO] Epoch:[0/2](54800/4588595) loss:3.510 lr:0.0000769 epoch_Time:28482.0min: [2024-01-02 23:24:04,059][model8_pretrain.py][INFO] Epoch:[0/2](54800/4588595) loss:3.210 lr:0.0000769 epoch_Time:28482.0min: [2024-01-02 23:24:04,059][model8_pretrain.py][INFO] Epoch:[0/2](54800/4588595) loss:3.459 lr:0.0000769 epoch_Time:28482.0min: [2024-01-02 23:24:04,059][model8_pretrain.py][INFO] Epoch:[0/2](54800/4588595) loss:3.420 lr:0.0000769 epoch_Time:28482.0min: [2024-01-02 23:24:04,059][model8_pretrain.py][INFO] Epoch:[0/2](54800/4588595) loss:2.452 lr:0.0000769 epoch_Time:28482.0min: [2024-01-02 23:24:41,005][model8_pretrain.py][INFO] Epoch:[0/2](54900/4588595) loss:3.534 lr:0.0000764 epoch_Time:28481.0min: [2024-01-02 23:24:41,005][model8_pretrain.py][INFO] Epoch:[0/2](54900/4588595) loss:3.105 lr:0.0000764 epoch_Time:28481.0min: [2024-01-02 23:24:41,005][model8_pretrain.py][INFO] Epoch:[0/2](54900/4588595) loss:2.571 lr:0.0000764 epoch_Time:28481.0min: [2024-01-02 23:24:41,005][model8_pretrain.py][INFO] Epoch:[0/2](54900/4588595) loss:3.085 lr:0.0000764 epoch_Time:28481.0min: [2024-01-02 23:24:41,005][model8_pretrain.py][INFO] Epoch:[0/2](54900/4588595) loss:3.010 lr:0.0000764 epoch_Time:28481.0min: [2024-01-02 23:24:41,005][model8_pretrain.py][INFO] Epoch:[0/2](54900/4588595) loss:2.627 lr:0.0000764 epoch_Time:28481.0min: [2024-01-02 23:24:41,006][model8_pretrain.py][INFO] Epoch:[0/2](54900/4588595) loss:2.987 lr:0.0000764 epoch_Time:28481.0min: [2024-01-02 23:24:41,006][model8_pretrain.py][INFO] Epoch:[0/2](54900/4588595) loss:3.134 lr:0.0000764 epoch_Time:28481.0min: [2024-01-02 23:25:17,959][model8_pretrain.py][INFO] Epoch:[0/2](55000/4588595) loss:2.813 lr:0.0000759 epoch_Time:28479.0min: [2024-01-02 23:25:17,959][model8_pretrain.py][INFO] Epoch:[0/2](55000/4588595) loss:3.435 lr:0.0000759 epoch_Time:28479.0min: [2024-01-02 23:25:17,959][model8_pretrain.py][INFO] Epoch:[0/2](55000/4588595) loss:3.179 lr:0.0000759 epoch_Time:28479.0min: [2024-01-02 23:25:17,959][model8_pretrain.py][INFO] Epoch:[0/2](55000/4588595) loss:2.903 lr:0.0000759 epoch_Time:28479.0min: [2024-01-02 23:25:17,959][model8_pretrain.py][INFO] Epoch:[0/2](55000/4588595) loss:3.439 lr:0.0000759 epoch_Time:28479.0min: [2024-01-02 23:25:17,959][model8_pretrain.py][INFO] Epoch:[0/2](55000/4588595) loss:3.142 lr:0.0000759 epoch_Time:28479.0min: [2024-01-02 23:25:17,959][model8_pretrain.py][INFO] Epoch:[0/2](55000/4588595) loss:3.136 lr:0.0000759 epoch_Time:28479.0min: [2024-01-02 23:25:17,960][model8_pretrain.py][INFO] Epoch:[0/2](55000/4588595) loss:3.317 lr:0.0000759 epoch_Time:28479.0min: [2024-01-02 23:26:01,980][model8_pretrain.py][INFO] Epoch:[0/2](55100/4588595) loss:3.032 lr:0.0000755 epoch_Time:28487.0min: [2024-01-02 23:26:01,980][model8_pretrain.py][INFO] Epoch:[0/2](55100/4588595) loss:3.348 lr:0.0000755 epoch_Time:28487.0min: [2024-01-02 23:26:01,980][model8_pretrain.py][INFO] Epoch:[0/2](55100/4588595) loss:2.650 lr:0.0000755 epoch_Time:28487.0min: [2024-01-02 23:26:01,980][model8_pretrain.py][INFO] Epoch:[0/2](55100/4588595) loss:3.140 lr:0.0000755 epoch_Time:28487.0min: [2024-01-02 23:26:01,980][model8_pretrain.py][INFO] Epoch:[0/2](55100/4588595) loss:2.803 lr:0.0000755 epoch_Time:28487.0min: [2024-01-02 23:26:01,980][model8_pretrain.py][INFO] Epoch:[0/2](55100/4588595) loss:2.845 lr:0.0000755 epoch_Time:28487.0min: [2024-01-02 23:26:01,980][model8_pretrain.py][INFO] Epoch:[0/2](55100/4588595) loss:2.738 lr:0.0000755 epoch_Time:28487.0min: [2024-01-02 23:26:01,980][model8_pretrain.py][INFO] Epoch:[0/2](55100/4588595) loss:3.503 lr:0.0000755 epoch_Time:28487.0min: [2024-01-02 23:26:38,917][model8_pretrain.py][INFO] Epoch:[0/2](55200/4588595) loss:3.294 lr:0.0000750 epoch_Time:28486.0min: [2024-01-02 23:26:38,917][model8_pretrain.py][INFO] Epoch:[0/2](55200/4588595) loss:3.430 lr:0.0000750 epoch_Time:28486.0min: [2024-01-02 23:26:38,917][model8_pretrain.py][INFO] Epoch:[0/2](55200/4588595) loss:3.047 lr:0.0000750 epoch_Time:28486.0min: [2024-01-02 23:26:38,917][model8_pretrain.py][INFO] Epoch:[0/2](55200/4588595) loss:3.209 lr:0.0000750 epoch_Time:28486.0min: [2024-01-02 23:26:38,918][model8_pretrain.py][INFO] Epoch:[0/2](55200/4588595) loss:2.936 lr:0.0000750 epoch_Time:28486.0min: [2024-01-02 23:26:38,918][model8_pretrain.py][INFO] Epoch:[0/2](55200/4588595) loss:2.924 lr:0.0000750 epoch_Time:28486.0min: [2024-01-02 23:26:38,918][model8_pretrain.py][INFO] Epoch:[0/2](55200/4588595) loss:2.619 lr:0.0000750 epoch_Time:28486.0min: [2024-01-02 23:26:38,918][model8_pretrain.py][INFO] Epoch:[0/2](55200/4588595) loss:3.249 lr:0.0000750 epoch_Time:28486.0min: [2024-01-02 23:27:15,862][model8_pretrain.py][INFO] Epoch:[0/2](55300/4588595) loss:3.241 lr:0.0000745 epoch_Time:28484.0min: [2024-01-02 23:27:15,862][model8_pretrain.py][INFO] Epoch:[0/2](55300/4588595) loss:3.392 lr:0.0000745 epoch_Time:28484.0min: [2024-01-02 23:27:15,862][model8_pretrain.py][INFO] Epoch:[0/2](55300/4588595) loss:2.695 lr:0.0000745 epoch_Time:28484.0min: [2024-01-02 23:27:15,862][model8_pretrain.py][INFO] Epoch:[0/2](55300/4588595) loss:2.877 lr:0.0000745 epoch_Time:28484.0min: [2024-01-02 23:27:15,862][model8_pretrain.py][INFO] Epoch:[0/2](55300/4588595) loss:3.578 lr:0.0000745 epoch_Time:28484.0min: [2024-01-02 23:27:15,862][model8_pretrain.py][INFO] Epoch:[0/2](55300/4588595) loss:3.351 lr:0.0000745 epoch_Time:28484.0min: [2024-01-02 23:27:15,862][model8_pretrain.py][INFO] Epoch:[0/2](55300/4588595) loss:3.388 lr:0.0000745 epoch_Time:28484.0min: [2024-01-02 23:27:15,862][model8_pretrain.py][INFO] Epoch:[0/2](55300/4588595) loss:2.170 lr:0.0000745 epoch_Time:28484.0min: [2024-01-02 23:27:52,799][model8_pretrain.py][INFO] Epoch:[0/2](55400/4588595) loss:1.884 lr:0.0000740 epoch_Time:28482.0min: [2024-01-02 23:27:52,799][model8_pretrain.py][INFO] Epoch:[0/2](55400/4588595) loss:3.048 lr:0.0000740 epoch_Time:28482.0min: [2024-01-02 23:27:52,799][model8_pretrain.py][INFO] Epoch:[0/2](55400/4588595) loss:3.176 lr:0.0000740 epoch_Time:28482.0min: [2024-01-02 23:27:52,799][model8_pretrain.py][INFO] Epoch:[0/2](55400/4588595) loss:3.010 lr:0.0000740 epoch_Time:28482.0min: [2024-01-02 23:27:52,799][model8_pretrain.py][INFO] Epoch:[0/2](55400/4588595) loss:2.837 lr:0.0000740 epoch_Time:28482.0min: [2024-01-02 23:27:52,799][model8_pretrain.py][INFO] Epoch:[0/2](55400/4588595) loss:3.168 lr:0.0000740 epoch_Time:28482.0min: [2024-01-02 23:27:52,799][model8_pretrain.py][INFO] Epoch:[0/2](55400/4588595) loss:3.101 lr:0.0000740 epoch_Time:28482.0min: [2024-01-02 23:27:52,800][model8_pretrain.py][INFO] Epoch:[0/2](55400/4588595) loss:3.084 lr:0.0000740 epoch_Time:28482.0min: [2024-01-02 23:28:29,737][model8_pretrain.py][INFO] Epoch:[0/2](55500/4588595) loss:3.105 lr:0.0000735 epoch_Time:28481.0min: [2024-01-02 23:28:29,737][model8_pretrain.py][INFO] Epoch:[0/2](55500/4588595) loss:3.276 lr:0.0000735 epoch_Time:28481.0min: [2024-01-02 23:28:29,737][model8_pretrain.py][INFO] Epoch:[0/2](55500/4588595) loss:3.534 lr:0.0000735 epoch_Time:28481.0min: [2024-01-02 23:28:29,737][model8_pretrain.py][INFO] Epoch:[0/2](55500/4588595) loss:2.842 lr:0.0000735 epoch_Time:28481.0min: [2024-01-02 23:28:29,737][model8_pretrain.py][INFO] Epoch:[0/2](55500/4588595) loss:3.362 lr:0.0000735 epoch_Time:28481.0min: [2024-01-02 23:28:29,737][model8_pretrain.py][INFO] Epoch:[0/2](55500/4588595) loss:3.886 lr:0.0000735 epoch_Time:28481.0min: [2024-01-02 23:28:29,737][model8_pretrain.py][INFO] Epoch:[0/2](55500/4588595) loss:3.559 lr:0.0000735 epoch_Time:28481.0min: [2024-01-02 23:28:29,737][model8_pretrain.py][INFO] Epoch:[0/2](55500/4588595) loss:3.368 lr:0.0000735 epoch_Time:28481.0min: [2024-01-02 23:29:06,683][model8_pretrain.py][INFO] Epoch:[0/2](55600/4588595) loss:2.996 lr:0.0000731 epoch_Time:28479.0min: [2024-01-02 23:29:06,683][model8_pretrain.py][INFO] Epoch:[0/2](55600/4588595) loss:3.334 lr:0.0000731 epoch_Time:28479.0min: [2024-01-02 23:29:06,683][model8_pretrain.py][INFO] Epoch:[0/2](55600/4588595) loss:2.868 lr:0.0000731 epoch_Time:28479.0min: [2024-01-02 23:29:06,683][model8_pretrain.py][INFO] Epoch:[0/2](55600/4588595) loss:3.167 lr:0.0000731 epoch_Time:28479.0min: [2024-01-02 23:29:06,683][model8_pretrain.py][INFO] Epoch:[0/2](55600/4588595) loss:3.255 lr:0.0000731 epoch_Time:28479.0min: [2024-01-02 23:29:06,683][model8_pretrain.py][INFO] Epoch:[0/2](55600/4588595) loss:3.216 lr:0.0000731 epoch_Time:28479.0min: [2024-01-02 23:29:06,683][model8_pretrain.py][INFO] Epoch:[0/2](55600/4588595) loss:3.407 lr:0.0000731 epoch_Time:28479.0min: [2024-01-02 23:29:06,683][model8_pretrain.py][INFO] Epoch:[0/2](55600/4588595) loss:3.681 lr:0.0000731 epoch_Time:28479.0min: [2024-01-02 23:29:43,646][model8_pretrain.py][INFO] Epoch:[0/2](55700/4588595) loss:3.165 lr:0.0000726 epoch_Time:28478.0min: [2024-01-02 23:29:43,646][model8_pretrain.py][INFO] Epoch:[0/2](55700/4588595) loss:2.765 lr:0.0000726 epoch_Time:28478.0min: [2024-01-02 23:29:43,646][model8_pretrain.py][INFO] Epoch:[0/2](55700/4588595) loss:3.413 lr:0.0000726 epoch_Time:28478.0min: [2024-01-02 23:29:43,646][model8_pretrain.py][INFO] Epoch:[0/2](55700/4588595) loss:3.152 lr:0.0000726 epoch_Time:28478.0min: [2024-01-02 23:29:43,646][model8_pretrain.py][INFO] Epoch:[0/2](55700/4588595) loss:2.956 lr:0.0000726 epoch_Time:28478.0min: [2024-01-02 23:29:43,646][model8_pretrain.py][INFO] Epoch:[0/2](55700/4588595) loss:2.944 lr:0.0000726 epoch_Time:28478.0min: [2024-01-02 23:29:43,646][model8_pretrain.py][INFO] Epoch:[0/2](55700/4588595) loss:3.480 lr:0.0000726 epoch_Time:28478.0min: [2024-01-02 23:29:43,647][model8_pretrain.py][INFO] Epoch:[0/2](55700/4588595) loss:3.273 lr:0.0000726 epoch_Time:28478.0min: [2024-01-02 23:30:20,600][model8_pretrain.py][INFO] Epoch:[0/2](55800/4588595) loss:3.174 lr:0.0000721 epoch_Time:28476.0min: [2024-01-02 23:30:20,600][model8_pretrain.py][INFO] Epoch:[0/2](55800/4588595) loss:2.469 lr:0.0000721 epoch_Time:28476.0min: [2024-01-02 23:30:20,600][model8_pretrain.py][INFO] Epoch:[0/2](55800/4588595) loss:3.547 lr:0.0000721 epoch_Time:28476.0min: [2024-01-02 23:30:20,600][model8_pretrain.py][INFO] Epoch:[0/2](55800/4588595) loss:2.927 lr:0.0000721 epoch_Time:28476.0min: [2024-01-02 23:30:20,600][model8_pretrain.py][INFO] Epoch:[0/2](55800/4588595) loss:3.433 lr:0.0000721 epoch_Time:28476.0min: [2024-01-02 23:30:20,600][model8_pretrain.py][INFO] Epoch:[0/2](55800/4588595) loss:3.283 lr:0.0000721 epoch_Time:28476.0min: [2024-01-02 23:30:20,600][model8_pretrain.py][INFO] Epoch:[0/2](55800/4588595) loss:3.364 lr:0.0000721 epoch_Time:28476.0min: [2024-01-02 23:30:20,600][model8_pretrain.py][INFO] Epoch:[0/2](55800/4588595) loss:3.253 lr:0.0000721 epoch_Time:28476.0min: [2024-01-02 23:31:04,532][model8_pretrain.py][INFO] Epoch:[0/2](55900/4588595) loss:3.312 lr:0.0000716 epoch_Time:28483.0min: [2024-01-02 23:31:04,532][model8_pretrain.py][INFO] Epoch:[0/2](55900/4588595) loss:3.408 lr:0.0000716 epoch_Time:28483.0min: [2024-01-02 23:31:04,532][model8_pretrain.py][INFO] Epoch:[0/2](55900/4588595) loss:3.203 lr:0.0000716 epoch_Time:28483.0min: [2024-01-02 23:31:04,532][model8_pretrain.py][INFO] Epoch:[0/2](55900/4588595) loss:3.141 lr:0.0000716 epoch_Time:28483.0min: [2024-01-02 23:31:04,532][model8_pretrain.py][INFO] Epoch:[0/2](55900/4588595) loss:2.706 lr:0.0000716 epoch_Time:28483.0min: [2024-01-02 23:31:04,532][model8_pretrain.py][INFO] Epoch:[0/2](55900/4588595) loss:3.036 lr:0.0000716 epoch_Time:28483.0min: [2024-01-02 23:31:04,532][model8_pretrain.py][INFO] Epoch:[0/2](55900/4588595) loss:3.126 lr:0.0000716 epoch_Time:28483.0min: [2024-01-02 23:31:04,532][model8_pretrain.py][INFO] Epoch:[0/2](55900/4588595) loss:3.531 lr:0.0000716 epoch_Time:28483.0min: [2024-01-02 23:31:41,464][model8_pretrain.py][INFO] Epoch:[0/2](56000/4588595) loss:3.219 lr:0.0000712 epoch_Time:28482.0min: [2024-01-02 23:31:41,464][model8_pretrain.py][INFO] Epoch:[0/2](56000/4588595) loss:3.278 lr:0.0000712 epoch_Time:28482.0min: [2024-01-02 23:31:41,464][model8_pretrain.py][INFO] Epoch:[0/2](56000/4588595) loss:3.184 lr:0.0000712 epoch_Time:28482.0min: [2024-01-02 23:31:41,464][model8_pretrain.py][INFO] Epoch:[0/2](56000/4588595) loss:3.139 lr:0.0000712 epoch_Time:28482.0min: [2024-01-02 23:31:41,464][model8_pretrain.py][INFO] Epoch:[0/2](56000/4588595) loss:3.475 lr:0.0000712 epoch_Time:28482.0min: [2024-01-02 23:31:41,464][model8_pretrain.py][INFO] Epoch:[0/2](56000/4588595) loss:3.148 lr:0.0000712 epoch_Time:28482.0min: [2024-01-02 23:31:41,464][model8_pretrain.py][INFO] Epoch:[0/2](56000/4588595) loss:2.749 lr:0.0000712 epoch_Time:28482.0min: [2024-01-02 23:31:41,464][model8_pretrain.py][INFO] Epoch:[0/2](56000/4588595) loss:3.258 lr:0.0000712 epoch_Time:28482.0min: [2024-01-02 23:32:18,403][model8_pretrain.py][INFO] Epoch:[0/2](56100/4588595) loss:3.392 lr:0.0000707 epoch_Time:28480.0min: [2024-01-02 23:32:18,403][model8_pretrain.py][INFO] Epoch:[0/2](56100/4588595) loss:3.146 lr:0.0000707 epoch_Time:28480.0min: [2024-01-02 23:32:18,403][model8_pretrain.py][INFO] Epoch:[0/2](56100/4588595) loss:3.483 lr:0.0000707 epoch_Time:28480.0min: [2024-01-02 23:32:18,404][model8_pretrain.py][INFO] Epoch:[0/2](56100/4588595) loss:2.997 lr:0.0000707 epoch_Time:28480.0min: [2024-01-02 23:32:18,404][model8_pretrain.py][INFO] Epoch:[0/2](56100/4588595) loss:3.334 lr:0.0000707 epoch_Time:28480.0min: [2024-01-02 23:32:18,404][model8_pretrain.py][INFO] Epoch:[0/2](56100/4588595) loss:3.558 lr:0.0000707 epoch_Time:28480.0min: [2024-01-02 23:32:18,404][model8_pretrain.py][INFO] Epoch:[0/2](56100/4588595) loss:3.593 lr:0.0000707 epoch_Time:28480.0min: [2024-01-02 23:32:18,404][model8_pretrain.py][INFO] Epoch:[0/2](56100/4588595) loss:3.145 lr:0.0000707 epoch_Time:28480.0min: [2024-01-02 23:32:55,335][model8_pretrain.py][INFO] Epoch:[0/2](56200/4588595) loss:3.045 lr:0.0000702 epoch_Time:28478.0min: [2024-01-02 23:32:55,335][model8_pretrain.py][INFO] Epoch:[0/2](56200/4588595) loss:2.828 lr:0.0000702 epoch_Time:28478.0min: [2024-01-02 23:32:55,335][model8_pretrain.py][INFO] Epoch:[0/2](56200/4588595) loss:3.369 lr:0.0000702 epoch_Time:28478.0min: [2024-01-02 23:32:55,335][model8_pretrain.py][INFO] Epoch:[0/2](56200/4588595) loss:3.243 lr:0.0000702 epoch_Time:28478.0min: [2024-01-02 23:32:55,335][model8_pretrain.py][INFO] Epoch:[0/2](56200/4588595) loss:3.233 lr:0.0000702 epoch_Time:28478.0min: [2024-01-02 23:32:55,335][model8_pretrain.py][INFO] Epoch:[0/2](56200/4588595) loss:2.906 lr:0.0000702 epoch_Time:28478.0min: [2024-01-02 23:32:55,335][model8_pretrain.py][INFO] Epoch:[0/2](56200/4588595) loss:3.399 lr:0.0000702 epoch_Time:28478.0min: [2024-01-02 23:32:55,335][model8_pretrain.py][INFO] Epoch:[0/2](56200/4588595) loss:2.899 lr:0.0000702 epoch_Time:28478.0min: [2024-01-02 23:33:32,279][model8_pretrain.py][INFO] Epoch:[0/2](56300/4588595) loss:3.361 lr:0.0000698 epoch_Time:28477.0min: [2024-01-02 23:33:32,279][model8_pretrain.py][INFO] Epoch:[0/2](56300/4588595) loss:2.785 lr:0.0000698 epoch_Time:28477.0min: [2024-01-02 23:33:32,279][model8_pretrain.py][INFO] Epoch:[0/2](56300/4588595) loss:2.940 lr:0.0000698 epoch_Time:28477.0min: [2024-01-02 23:33:32,279][model8_pretrain.py][INFO] Epoch:[0/2](56300/4588595) loss:3.528 lr:0.0000698 epoch_Time:28477.0min: [2024-01-02 23:33:32,279][model8_pretrain.py][INFO] Epoch:[0/2](56300/4588595) loss:2.953 lr:0.0000698 epoch_Time:28477.0min: [2024-01-02 23:33:32,279][model8_pretrain.py][INFO] Epoch:[0/2](56300/4588595) loss:3.261 lr:0.0000698 epoch_Time:28477.0min: [2024-01-02 23:33:32,279][model8_pretrain.py][INFO] Epoch:[0/2](56300/4588595) loss:3.429 lr:0.0000698 epoch_Time:28477.0min: [2024-01-02 23:33:32,279][model8_pretrain.py][INFO] Epoch:[0/2](56300/4588595) loss:3.069 lr:0.0000698 epoch_Time:28477.0min: [2024-01-02 23:34:09,217][model8_pretrain.py][INFO] Epoch:[0/2](56400/4588595) loss:2.872 lr:0.0000693 epoch_Time:28475.0min: [2024-01-02 23:34:09,217][model8_pretrain.py][INFO] Epoch:[0/2](56400/4588595) loss:3.351 lr:0.0000693 epoch_Time:28475.0min: [2024-01-02 23:34:09,217][model8_pretrain.py][INFO] Epoch:[0/2](56400/4588595) loss:3.130 lr:0.0000693 epoch_Time:28475.0min: [2024-01-02 23:34:09,217][model8_pretrain.py][INFO] Epoch:[0/2](56400/4588595) loss:3.187 lr:0.0000693 epoch_Time:28475.0min: [2024-01-02 23:34:09,217][model8_pretrain.py][INFO] Epoch:[0/2](56400/4588595) loss:3.091 lr:0.0000693 epoch_Time:28475.0min: [2024-01-02 23:34:09,217][model8_pretrain.py][INFO] Epoch:[0/2](56400/4588595) loss:3.082 lr:0.0000693 epoch_Time:28475.0min: [2024-01-02 23:34:09,217][model8_pretrain.py][INFO] Epoch:[0/2](56400/4588595) loss:3.670 lr:0.0000693 epoch_Time:28475.0min: [2024-01-02 23:34:09,217][model8_pretrain.py][INFO] Epoch:[0/2](56400/4588595) loss:3.208 lr:0.0000693 epoch_Time:28475.0min: [2024-01-02 23:34:46,154][model8_pretrain.py][INFO] Epoch:[0/2](56500/4588595) loss:3.236 lr:0.0000688 epoch_Time:28474.0min: [2024-01-02 23:34:46,154][model8_pretrain.py][INFO] Epoch:[0/2](56500/4588595) loss:3.350 lr:0.0000688 epoch_Time:28474.0min: [2024-01-02 23:34:46,154][model8_pretrain.py][INFO] Epoch:[0/2](56500/4588595) loss:3.612 lr:0.0000688 epoch_Time:28474.0min: [2024-01-02 23:34:46,154][model8_pretrain.py][INFO] Epoch:[0/2](56500/4588595) loss:3.229 lr:0.0000688 epoch_Time:28474.0min: [2024-01-02 23:34:46,154][model8_pretrain.py][INFO] Epoch:[0/2](56500/4588595) loss:3.498 lr:0.0000688 epoch_Time:28474.0min: [2024-01-02 23:34:46,154][model8_pretrain.py][INFO] Epoch:[0/2](56500/4588595) loss:3.330 lr:0.0000688 epoch_Time:28474.0min: [2024-01-02 23:34:46,154][model8_pretrain.py][INFO] Epoch:[0/2](56500/4588595) loss:3.212 lr:0.0000688 epoch_Time:28474.0min: [2024-01-02 23:34:46,155][model8_pretrain.py][INFO] Epoch:[0/2](56500/4588595) loss:3.225 lr:0.0000688 epoch_Time:28474.0min: [2024-01-02 23:35:23,105][model8_pretrain.py][INFO] Epoch:[0/2](56600/4588595) loss:3.175 lr:0.0000684 epoch_Time:28472.0min: [2024-01-02 23:35:23,105][model8_pretrain.py][INFO] Epoch:[0/2](56600/4588595) loss:3.289 lr:0.0000684 epoch_Time:28472.0min: [2024-01-02 23:35:23,105][model8_pretrain.py][INFO] Epoch:[0/2](56600/4588595) loss:3.554 lr:0.0000684 epoch_Time:28472.0min: [2024-01-02 23:35:23,105][model8_pretrain.py][INFO] Epoch:[0/2](56600/4588595) loss:2.710 lr:0.0000684 epoch_Time:28472.0min: [2024-01-02 23:35:23,105][model8_pretrain.py][INFO] Epoch:[0/2](56600/4588595) loss:3.499 lr:0.0000684 epoch_Time:28472.0min: [2024-01-02 23:35:23,105][model8_pretrain.py][INFO] Epoch:[0/2](56600/4588595) loss:3.377 lr:0.0000684 epoch_Time:28472.0min: [2024-01-02 23:35:23,105][model8_pretrain.py][INFO] Epoch:[0/2](56600/4588595) loss:3.412 lr:0.0000684 epoch_Time:28472.0min: [2024-01-02 23:35:23,106][model8_pretrain.py][INFO] Epoch:[0/2](56600/4588595) loss:3.295 lr:0.0000684 epoch_Time:28472.0min: [2024-01-02 23:36:07,195][model8_pretrain.py][INFO] Epoch:[0/2](56700/4588595) loss:3.348 lr:0.0000679 epoch_Time:28480.0min: [2024-01-02 23:36:07,195][model8_pretrain.py][INFO] Epoch:[0/2](56700/4588595) loss:2.999 lr:0.0000679 epoch_Time:28480.0min: [2024-01-02 23:36:07,195][model8_pretrain.py][INFO] Epoch:[0/2](56700/4588595) loss:3.337 lr:0.0000679 epoch_Time:28480.0min: [2024-01-02 23:36:07,195][model8_pretrain.py][INFO] Epoch:[0/2](56700/4588595) loss:2.844 lr:0.0000679 epoch_Time:28480.0min: [2024-01-02 23:36:07,195][model8_pretrain.py][INFO] Epoch:[0/2](56700/4588595) loss:2.926 lr:0.0000679 epoch_Time:28480.0min: [2024-01-02 23:36:07,195][model8_pretrain.py][INFO] Epoch:[0/2](56700/4588595) loss:3.376 lr:0.0000679 epoch_Time:28480.0min: [2024-01-02 23:36:07,195][model8_pretrain.py][INFO] Epoch:[0/2](56700/4588595) loss:3.075 lr:0.0000679 epoch_Time:28480.0min: [2024-01-02 23:36:07,195][model8_pretrain.py][INFO] Epoch:[0/2](56700/4588595) loss:3.427 lr:0.0000679 epoch_Time:28480.0min: [2024-01-02 23:36:44,125][model8_pretrain.py][INFO] Epoch:[0/2](56800/4588595) loss:3.554 lr:0.0000675 epoch_Time:28479.0min: [2024-01-02 23:36:44,125][model8_pretrain.py][INFO] Epoch:[0/2](56800/4588595) loss:2.847 lr:0.0000675 epoch_Time:28479.0min: [2024-01-02 23:36:44,125][model8_pretrain.py][INFO] Epoch:[0/2](56800/4588595) loss:3.045 lr:0.0000675 epoch_Time:28479.0min: [2024-01-02 23:36:44,125][model8_pretrain.py][INFO] Epoch:[0/2](56800/4588595) loss:2.918 lr:0.0000675 epoch_Time:28479.0min: [2024-01-02 23:36:44,125][model8_pretrain.py][INFO] Epoch:[0/2](56800/4588595) loss:3.695 lr:0.0000675 epoch_Time:28479.0min: [2024-01-02 23:36:44,125][model8_pretrain.py][INFO] Epoch:[0/2](56800/4588595) loss:3.005 lr:0.0000675 epoch_Time:28479.0min: [2024-01-02 23:36:44,125][model8_pretrain.py][INFO] Epoch:[0/2](56800/4588595) loss:2.922 lr:0.0000675 epoch_Time:28479.0min: [2024-01-02 23:36:44,125][model8_pretrain.py][INFO] Epoch:[0/2](56800/4588595) loss:3.027 lr:0.0000675 epoch_Time:28479.0min: [2024-01-02 23:37:21,065][model8_pretrain.py][INFO] Epoch:[0/2](56900/4588595) loss:3.441 lr:0.0000670 epoch_Time:28477.0min: [2024-01-02 23:37:21,065][model8_pretrain.py][INFO] Epoch:[0/2](56900/4588595) loss:3.542 lr:0.0000670 epoch_Time:28477.0min: [2024-01-02 23:37:21,065][model8_pretrain.py][INFO] Epoch:[0/2](56900/4588595) loss:2.810 lr:0.0000670 epoch_Time:28477.0min: [2024-01-02 23:37:21,065][model8_pretrain.py][INFO] Epoch:[0/2](56900/4588595) loss:3.161 lr:0.0000670 epoch_Time:28477.0min: [2024-01-02 23:37:21,066][model8_pretrain.py][INFO] Epoch:[0/2](56900/4588595) loss:3.172 lr:0.0000670 epoch_Time:28477.0min: [2024-01-02 23:37:21,066][model8_pretrain.py][INFO] Epoch:[0/2](56900/4588595) loss:3.102 lr:0.0000670 epoch_Time:28477.0min: [2024-01-02 23:37:21,066][model8_pretrain.py][INFO] Epoch:[0/2](56900/4588595) loss:3.298 lr:0.0000670 epoch_Time:28477.0min: [2024-01-02 23:37:21,066][model8_pretrain.py][INFO] Epoch:[0/2](56900/4588595) loss:3.462 lr:0.0000670 epoch_Time:28477.0min: [2024-01-02 23:37:58,014][model8_pretrain.py][INFO] Epoch:[0/2](57000/4588595) loss:3.082 lr:0.0000665 epoch_Time:28475.0min: [2024-01-02 23:37:58,014][model8_pretrain.py][INFO] Epoch:[0/2](57000/4588595) loss:2.562 lr:0.0000665 epoch_Time:28475.0min: [2024-01-02 23:37:58,014][model8_pretrain.py][INFO] Epoch:[0/2](57000/4588595) loss:3.336 lr:0.0000665 epoch_Time:28475.0min: [2024-01-02 23:37:58,014][model8_pretrain.py][INFO] Epoch:[0/2](57000/4588595) loss:2.979 lr:0.0000665 epoch_Time:28475.0min: [2024-01-02 23:37:58,014][model8_pretrain.py][INFO] Epoch:[0/2](57000/4588595) loss:3.455 lr:0.0000665 epoch_Time:28475.0min: [2024-01-02 23:37:58,014][model8_pretrain.py][INFO] Epoch:[0/2](57000/4588595) loss:3.472 lr:0.0000665 epoch_Time:28475.0min: [2024-01-02 23:37:58,014][model8_pretrain.py][INFO] Epoch:[0/2](57000/4588595) loss:3.371 lr:0.0000665 epoch_Time:28475.0min: [2024-01-02 23:37:58,014][model8_pretrain.py][INFO] Epoch:[0/2](57000/4588595) loss:3.125 lr:0.0000665 epoch_Time:28475.0min: [2024-01-02 23:38:34,964][model8_pretrain.py][INFO] Epoch:[0/2](57100/4588595) loss:2.835 lr:0.0000661 epoch_Time:28474.0min: [2024-01-02 23:38:34,964][model8_pretrain.py][INFO] Epoch:[0/2](57100/4588595) loss:2.750 lr:0.0000661 epoch_Time:28474.0min: [2024-01-02 23:38:34,964][model8_pretrain.py][INFO] Epoch:[0/2](57100/4588595) loss:2.804 lr:0.0000661 epoch_Time:28474.0min: [2024-01-02 23:38:34,964][model8_pretrain.py][INFO] Epoch:[0/2](57100/4588595) loss:2.962 lr:0.0000661 epoch_Time:28474.0min: [2024-01-02 23:38:34,964][model8_pretrain.py][INFO] Epoch:[0/2](57100/4588595) loss:3.497 lr:0.0000661 epoch_Time:28474.0min: [2024-01-02 23:38:34,964][model8_pretrain.py][INFO] Epoch:[0/2](57100/4588595) loss:3.113 lr:0.0000661 epoch_Time:28474.0min: [2024-01-02 23:38:34,964][model8_pretrain.py][INFO] Epoch:[0/2](57100/4588595) loss:2.937 lr:0.0000661 epoch_Time:28474.0min: [2024-01-02 23:38:34,964][model8_pretrain.py][INFO] Epoch:[0/2](57100/4588595) loss:3.209 lr:0.0000661 epoch_Time:28474.0min: [2024-01-02 23:39:11,933][model8_pretrain.py][INFO] Epoch:[0/2](57200/4588595) loss:3.388 lr:0.0000656 epoch_Time:28472.0min: [2024-01-02 23:39:11,933][model8_pretrain.py][INFO] Epoch:[0/2](57200/4588595) loss:3.078 lr:0.0000656 epoch_Time:28472.0min: [2024-01-02 23:39:11,933][model8_pretrain.py][INFO] Epoch:[0/2](57200/4588595) loss:2.667 lr:0.0000656 epoch_Time:28472.0min: [2024-01-02 23:39:11,933][model8_pretrain.py][INFO] Epoch:[0/2](57200/4588595) loss:3.207 lr:0.0000656 epoch_Time:28472.0min: [2024-01-02 23:39:11,934][model8_pretrain.py][INFO] Epoch:[0/2](57200/4588595) loss:2.678 lr:0.0000656 epoch_Time:28472.0min: [2024-01-02 23:39:11,934][model8_pretrain.py][INFO] Epoch:[0/2](57200/4588595) loss:3.280 lr:0.0000656 epoch_Time:28472.0min: [2024-01-02 23:39:11,934][model8_pretrain.py][INFO] Epoch:[0/2](57200/4588595) loss:2.949 lr:0.0000656 epoch_Time:28472.0min: [2024-01-02 23:39:11,934][model8_pretrain.py][INFO] Epoch:[0/2](57200/4588595) loss:3.049 lr:0.0000656 epoch_Time:28472.0min: [2024-01-02 23:39:48,912][model8_pretrain.py][INFO] Epoch:[0/2](57300/4588595) loss:3.323 lr:0.0000652 epoch_Time:28470.0min: [2024-01-02 23:39:48,912][model8_pretrain.py][INFO] Epoch:[0/2](57300/4588595) loss:2.977 lr:0.0000652 epoch_Time:28470.0min: [2024-01-02 23:39:48,912][model8_pretrain.py][INFO] Epoch:[0/2](57300/4588595) loss:2.906 lr:0.0000652 epoch_Time:28470.0min: [2024-01-02 23:39:48,912][model8_pretrain.py][INFO] Epoch:[0/2](57300/4588595) loss:3.762 lr:0.0000652 epoch_Time:28470.0min: [2024-01-02 23:39:48,912][model8_pretrain.py][INFO] Epoch:[0/2](57300/4588595) loss:2.901 lr:0.0000652 epoch_Time:28470.0min: [2024-01-02 23:39:48,912][model8_pretrain.py][INFO] Epoch:[0/2](57300/4588595) loss:3.050 lr:0.0000652 epoch_Time:28470.0min: [2024-01-02 23:39:48,912][model8_pretrain.py][INFO] Epoch:[0/2](57300/4588595) loss:3.434 lr:0.0000652 epoch_Time:28470.0min: [2024-01-02 23:39:48,912][model8_pretrain.py][INFO] Epoch:[0/2](57300/4588595) loss:3.404 lr:0.0000652 epoch_Time:28470.0min: [2024-01-02 23:40:25,858][model8_pretrain.py][INFO] Epoch:[0/2](57400/4588595) loss:3.140 lr:0.0000647 epoch_Time:28469.0min: [2024-01-02 23:40:25,858][model8_pretrain.py][INFO] Epoch:[0/2](57400/4588595) loss:3.340 lr:0.0000647 epoch_Time:28469.0min: [2024-01-02 23:40:25,858][model8_pretrain.py][INFO] Epoch:[0/2](57400/4588595) loss:3.241 lr:0.0000647 epoch_Time:28469.0min: [2024-01-02 23:40:25,858][model8_pretrain.py][INFO] Epoch:[0/2](57400/4588595) loss:2.936 lr:0.0000647 epoch_Time:28469.0min: [2024-01-02 23:40:25,858][model8_pretrain.py][INFO] Epoch:[0/2](57400/4588595) loss:3.160 lr:0.0000647 epoch_Time:28469.0min: [2024-01-02 23:40:25,858][model8_pretrain.py][INFO] Epoch:[0/2](57400/4588595) loss:3.198 lr:0.0000647 epoch_Time:28469.0min: [2024-01-02 23:40:25,859][model8_pretrain.py][INFO] Epoch:[0/2](57400/4588595) loss:3.364 lr:0.0000647 epoch_Time:28469.0min: [2024-01-02 23:40:25,859][model8_pretrain.py][INFO] Epoch:[0/2](57400/4588595) loss:3.681 lr:0.0000647 epoch_Time:28469.0min: [2024-01-02 23:41:09,927][model8_pretrain.py][INFO] Epoch:[0/2](57500/4588595) loss:3.341 lr:0.0000643 epoch_Time:28476.0min: [2024-01-02 23:41:09,927][model8_pretrain.py][INFO] Epoch:[0/2](57500/4588595) loss:3.138 lr:0.0000643 epoch_Time:28476.0min: [2024-01-02 23:41:09,927][model8_pretrain.py][INFO] Epoch:[0/2](57500/4588595) loss:2.835 lr:0.0000643 epoch_Time:28476.0min: [2024-01-02 23:41:09,927][model8_pretrain.py][INFO] Epoch:[0/2](57500/4588595) loss:3.147 lr:0.0000643 epoch_Time:28476.0min: [2024-01-02 23:41:09,927][model8_pretrain.py][INFO] Epoch:[0/2](57500/4588595) loss:2.888 lr:0.0000643 epoch_Time:28476.0min: [2024-01-02 23:41:09,927][model8_pretrain.py][INFO] Epoch:[0/2](57500/4588595) loss:3.156 lr:0.0000643 epoch_Time:28476.0min: [2024-01-02 23:41:09,927][model8_pretrain.py][INFO] Epoch:[0/2](57500/4588595) loss:3.057 lr:0.0000643 epoch_Time:28476.0min: [2024-01-02 23:41:09,927][model8_pretrain.py][INFO] Epoch:[0/2](57500/4588595) loss:3.081 lr:0.0000643 epoch_Time:28476.0min: [2024-01-02 23:41:46,861][model8_pretrain.py][INFO] Epoch:[0/2](57600/4588595) loss:3.183 lr:0.0000638 epoch_Time:28475.0min: [2024-01-02 23:41:46,861][model8_pretrain.py][INFO] Epoch:[0/2](57600/4588595) loss:3.001 lr:0.0000638 epoch_Time:28475.0min: [2024-01-02 23:41:46,861][model8_pretrain.py][INFO] Epoch:[0/2](57600/4588595) loss:2.807 lr:0.0000638 epoch_Time:28475.0min: [2024-01-02 23:41:46,861][model8_pretrain.py][INFO] Epoch:[0/2](57600/4588595) loss:3.143 lr:0.0000638 epoch_Time:28475.0min: [2024-01-02 23:41:46,861][model8_pretrain.py][INFO] Epoch:[0/2](57600/4588595) loss:2.429 lr:0.0000638 epoch_Time:28475.0min: [2024-01-02 23:41:46,861][model8_pretrain.py][INFO] Epoch:[0/2](57600/4588595) loss:3.435 lr:0.0000638 epoch_Time:28475.0min: [2024-01-02 23:41:46,861][model8_pretrain.py][INFO] Epoch:[0/2](57600/4588595) loss:2.115 lr:0.0000638 epoch_Time:28475.0min: [2024-01-02 23:41:46,861][model8_pretrain.py][INFO] Epoch:[0/2](57600/4588595) loss:3.537 lr:0.0000638 epoch_Time:28475.0min: [2024-01-02 23:42:23,789][model8_pretrain.py][INFO] Epoch:[0/2](57700/4588595) loss:3.220 lr:0.0000634 epoch_Time:28473.0min: [2024-01-02 23:42:23,789][model8_pretrain.py][INFO] Epoch:[0/2](57700/4588595) loss:2.748 lr:0.0000634 epoch_Time:28473.0min: [2024-01-02 23:42:23,789][model8_pretrain.py][INFO] Epoch:[0/2](57700/4588595) loss:3.447 lr:0.0000634 epoch_Time:28473.0min: [2024-01-02 23:42:23,789][model8_pretrain.py][INFO] Epoch:[0/2](57700/4588595) loss:2.818 lr:0.0000634 epoch_Time:28473.0min: [2024-01-02 23:42:23,789][model8_pretrain.py][INFO] Epoch:[0/2](57700/4588595) loss:3.387 lr:0.0000634 epoch_Time:28473.0min: [2024-01-02 23:42:23,789][model8_pretrain.py][INFO] Epoch:[0/2](57700/4588595) loss:3.195 lr:0.0000634 epoch_Time:28473.0min: [2024-01-02 23:42:23,789][model8_pretrain.py][INFO] Epoch:[0/2](57700/4588595) loss:2.850 lr:0.0000634 epoch_Time:28473.0min: [2024-01-02 23:42:23,789][model8_pretrain.py][INFO] Epoch:[0/2](57700/4588595) loss:2.796 lr:0.0000634 epoch_Time:28473.0min: [2024-01-02 23:43:00,731][model8_pretrain.py][INFO] Epoch:[0/2](57800/4588595) loss:3.569 lr:0.0000629 epoch_Time:28471.0min: [2024-01-02 23:43:00,731][model8_pretrain.py][INFO] Epoch:[0/2](57800/4588595) loss:2.822 lr:0.0000629 epoch_Time:28471.0min: [2024-01-02 23:43:00,731][model8_pretrain.py][INFO] Epoch:[0/2](57800/4588595) loss:3.046 lr:0.0000629 epoch_Time:28471.0min: [2024-01-02 23:43:00,731][model8_pretrain.py][INFO] Epoch:[0/2](57800/4588595) loss:2.874 lr:0.0000629 epoch_Time:28471.0min: [2024-01-02 23:43:00,731][model8_pretrain.py][INFO] Epoch:[0/2](57800/4588595) loss:3.062 lr:0.0000629 epoch_Time:28471.0min: [2024-01-02 23:43:00,731][model8_pretrain.py][INFO] Epoch:[0/2](57800/4588595) loss:3.135 lr:0.0000629 epoch_Time:28471.0min: [2024-01-02 23:43:00,731][model8_pretrain.py][INFO] Epoch:[0/2](57800/4588595) loss:3.147 lr:0.0000629 epoch_Time:28471.0min: [2024-01-02 23:43:00,732][model8_pretrain.py][INFO] Epoch:[0/2](57800/4588595) loss:2.798 lr:0.0000629 epoch_Time:28471.0min: [2024-01-02 23:43:37,689][model8_pretrain.py][INFO] Epoch:[0/2](57900/4588595) loss:2.931 lr:0.0000625 epoch_Time:28470.0min: [2024-01-02 23:43:37,689][model8_pretrain.py][INFO] Epoch:[0/2](57900/4588595) loss:2.715 lr:0.0000625 epoch_Time:28470.0min: [2024-01-02 23:43:37,689][model8_pretrain.py][INFO] Epoch:[0/2](57900/4588595) loss:3.479 lr:0.0000625 epoch_Time:28470.0min: [2024-01-02 23:43:37,689][model8_pretrain.py][INFO] Epoch:[0/2](57900/4588595) loss:3.115 lr:0.0000625 epoch_Time:28470.0min: [2024-01-02 23:43:37,689][model8_pretrain.py][INFO] Epoch:[0/2](57900/4588595) loss:3.063 lr:0.0000625 epoch_Time:28470.0min: [2024-01-02 23:43:37,689][model8_pretrain.py][INFO] Epoch:[0/2](57900/4588595) loss:3.150 lr:0.0000625 epoch_Time:28470.0min: [2024-01-02 23:43:37,689][model8_pretrain.py][INFO] Epoch:[0/2](57900/4588595) loss:2.805 lr:0.0000625 epoch_Time:28470.0min: [2024-01-02 23:43:37,690][model8_pretrain.py][INFO] Epoch:[0/2](57900/4588595) loss:3.040 lr:0.0000625 epoch_Time:28470.0min: [2024-01-02 23:44:14,717][model8_pretrain.py][INFO] Epoch:[0/2](58000/4588595) loss:2.524 lr:0.0000620 epoch_Time:28468.0min: [2024-01-02 23:44:14,717][model8_pretrain.py][INFO] Epoch:[0/2](58000/4588595) loss:3.088 lr:0.0000620 epoch_Time:28468.0min: [2024-01-02 23:44:14,717][model8_pretrain.py][INFO] Epoch:[0/2](58000/4588595) loss:3.350 lr:0.0000620 epoch_Time:28468.0min: [2024-01-02 23:44:14,717][model8_pretrain.py][INFO] Epoch:[0/2](58000/4588595) loss:2.516 lr:0.0000620 epoch_Time:28468.0min: [2024-01-02 23:44:14,717][model8_pretrain.py][INFO] Epoch:[0/2](58000/4588595) loss:2.958 lr:0.0000620 epoch_Time:28468.0min: [2024-01-02 23:44:14,717][model8_pretrain.py][INFO] Epoch:[0/2](58000/4588595) loss:2.480 lr:0.0000620 epoch_Time:28468.0min: [2024-01-02 23:44:14,717][model8_pretrain.py][INFO] Epoch:[0/2](58000/4588595) loss:3.040 lr:0.0000620 epoch_Time:28468.0min: [2024-01-02 23:44:14,717][model8_pretrain.py][INFO] Epoch:[0/2](58000/4588595) loss:2.870 lr:0.0000620 epoch_Time:28468.0min: [2024-01-02 23:44:51,732][model8_pretrain.py][INFO] Epoch:[0/2](58100/4588595) loss:3.383 lr:0.0000616 epoch_Time:28466.0min: [2024-01-02 23:44:51,733][model8_pretrain.py][INFO] Epoch:[0/2](58100/4588595) loss:3.296 lr:0.0000616 epoch_Time:28466.0min: [2024-01-02 23:44:51,733][model8_pretrain.py][INFO] Epoch:[0/2](58100/4588595) loss:2.877 lr:0.0000616 epoch_Time:28466.0min: [2024-01-02 23:44:51,733][model8_pretrain.py][INFO] Epoch:[0/2](58100/4588595) loss:3.134 lr:0.0000616 epoch_Time:28466.0min: [2024-01-02 23:44:51,733][model8_pretrain.py][INFO] Epoch:[0/2](58100/4588595) loss:2.813 lr:0.0000616 epoch_Time:28466.0min: [2024-01-02 23:44:51,733][model8_pretrain.py][INFO] Epoch:[0/2](58100/4588595) loss:2.959 lr:0.0000616 epoch_Time:28466.0min: [2024-01-02 23:44:51,733][model8_pretrain.py][INFO] Epoch:[0/2](58100/4588595) loss:2.756 lr:0.0000616 epoch_Time:28466.0min: [2024-01-02 23:44:51,733][model8_pretrain.py][INFO] Epoch:[0/2](58100/4588595) loss:3.132 lr:0.0000616 epoch_Time:28466.0min: [2024-01-02 23:45:28,692][model8_pretrain.py][INFO] Epoch:[0/2](58200/4588595) loss:3.348 lr:0.0000612 epoch_Time:28465.0min: [2024-01-02 23:45:28,692][model8_pretrain.py][INFO] Epoch:[0/2](58200/4588595) loss:2.676 lr:0.0000612 epoch_Time:28465.0min: [2024-01-02 23:45:28,692][model8_pretrain.py][INFO] Epoch:[0/2](58200/4588595) loss:2.840 lr:0.0000612 epoch_Time:28465.0min: [2024-01-02 23:45:28,692][model8_pretrain.py][INFO] Epoch:[0/2](58200/4588595) loss:2.773 lr:0.0000612 epoch_Time:28465.0min: [2024-01-02 23:45:28,692][model8_pretrain.py][INFO] Epoch:[0/2](58200/4588595) loss:3.353 lr:0.0000612 epoch_Time:28465.0min: [2024-01-02 23:45:28,692][model8_pretrain.py][INFO] Epoch:[0/2](58200/4588595) loss:2.787 lr:0.0000612 epoch_Time:28465.0min: [2024-01-02 23:45:28,692][model8_pretrain.py][INFO] Epoch:[0/2](58200/4588595) loss:3.156 lr:0.0000612 epoch_Time:28465.0min: [2024-01-02 23:45:28,693][model8_pretrain.py][INFO] Epoch:[0/2](58200/4588595) loss:2.922 lr:0.0000612 epoch_Time:28465.0min: [2024-01-02 23:46:12,701][model8_pretrain.py][INFO] Epoch:[0/2](58300/4588595) loss:2.795 lr:0.0000607 epoch_Time:28473.0min: [2024-01-02 23:46:12,701][model8_pretrain.py][INFO] Epoch:[0/2](58300/4588595) loss:3.116 lr:0.0000607 epoch_Time:28473.0min: [2024-01-02 23:46:12,701][model8_pretrain.py][INFO] Epoch:[0/2](58300/4588595) loss:3.675 lr:0.0000607 epoch_Time:28473.0min: [2024-01-02 23:46:12,701][model8_pretrain.py][INFO] Epoch:[0/2](58300/4588595) loss:2.953 lr:0.0000607 epoch_Time:28473.0min: [2024-01-02 23:46:12,701][model8_pretrain.py][INFO] Epoch:[0/2](58300/4588595) loss:3.059 lr:0.0000607 epoch_Time:28473.0min: [2024-01-02 23:46:12,701][model8_pretrain.py][INFO] Epoch:[0/2](58300/4588595) loss:3.440 lr:0.0000607 epoch_Time:28473.0min: [2024-01-02 23:46:12,702][model8_pretrain.py][INFO] Epoch:[0/2](58300/4588595) loss:2.669 lr:0.0000607 epoch_Time:28473.0min: [2024-01-02 23:46:12,702][model8_pretrain.py][INFO] Epoch:[0/2](58300/4588595) loss:2.660 lr:0.0000607 epoch_Time:28473.0min: [2024-01-02 23:46:49,658][model8_pretrain.py][INFO] Epoch:[0/2](58400/4588595) loss:3.047 lr:0.0000603 epoch_Time:28471.0min: [2024-01-02 23:46:49,659][model8_pretrain.py][INFO] Epoch:[0/2](58400/4588595) loss:2.338 lr:0.0000603 epoch_Time:28471.0min: [2024-01-02 23:46:49,659][model8_pretrain.py][INFO] Epoch:[0/2](58400/4588595) loss:2.887 lr:0.0000603 epoch_Time:28471.0min: [2024-01-02 23:46:49,659][model8_pretrain.py][INFO] Epoch:[0/2](58400/4588595) loss:3.198 lr:0.0000603 epoch_Time:28471.0min: [2024-01-02 23:46:49,659][model8_pretrain.py][INFO] Epoch:[0/2](58400/4588595) loss:2.869 lr:0.0000603 epoch_Time:28471.0min: [2024-01-02 23:46:49,659][model8_pretrain.py][INFO] Epoch:[0/2](58400/4588595) loss:3.231 lr:0.0000603 epoch_Time:28471.0min: [2024-01-02 23:46:49,659][model8_pretrain.py][INFO] Epoch:[0/2](58400/4588595) loss:3.165 lr:0.0000603 epoch_Time:28471.0min: [2024-01-02 23:46:49,659][model8_pretrain.py][INFO] Epoch:[0/2](58400/4588595) loss:3.366 lr:0.0000603 epoch_Time:28471.0min: [2024-01-02 23:47:26,589][model8_pretrain.py][INFO] Epoch:[0/2](58500/4588595) loss:2.815 lr:0.0000598 epoch_Time:28470.0min: [2024-01-02 23:47:26,589][model8_pretrain.py][INFO] Epoch:[0/2](58500/4588595) loss:3.007 lr:0.0000598 epoch_Time:28470.0min: [2024-01-02 23:47:26,590][model8_pretrain.py][INFO] Epoch:[0/2](58500/4588595) loss:3.032 lr:0.0000598 epoch_Time:28470.0min: [2024-01-02 23:47:26,589][model8_pretrain.py][INFO] Epoch:[0/2](58500/4588595) loss:2.976 lr:0.0000598 epoch_Time:28470.0min: [2024-01-02 23:47:26,589][model8_pretrain.py][INFO] Epoch:[0/2](58500/4588595) loss:3.161 lr:0.0000598 epoch_Time:28470.0min: [2024-01-02 23:47:26,590][model8_pretrain.py][INFO] Epoch:[0/2](58500/4588595) loss:3.208 lr:0.0000598 epoch_Time:28470.0min: [2024-01-02 23:47:26,589][model8_pretrain.py][INFO] Epoch:[0/2](58500/4588595) loss:3.484 lr:0.0000598 epoch_Time:28470.0min: [2024-01-02 23:47:26,590][model8_pretrain.py][INFO] Epoch:[0/2](58500/4588595) loss:3.076 lr:0.0000598 epoch_Time:28470.0min: [2024-01-02 23:48:03,533][model8_pretrain.py][INFO] Epoch:[0/2](58600/4588595) loss:3.178 lr:0.0000594 epoch_Time:28468.0min: [2024-01-02 23:48:03,533][model8_pretrain.py][INFO] Epoch:[0/2](58600/4588595) loss:3.220 lr:0.0000594 epoch_Time:28468.0min: [2024-01-02 23:48:03,533][model8_pretrain.py][INFO] Epoch:[0/2](58600/4588595) loss:3.205 lr:0.0000594 epoch_Time:28468.0min: [2024-01-02 23:48:03,533][model8_pretrain.py][INFO] Epoch:[0/2](58600/4588595) loss:3.185 lr:0.0000594 epoch_Time:28468.0min: [2024-01-02 23:48:03,533][model8_pretrain.py][INFO] Epoch:[0/2](58600/4588595) loss:2.798 lr:0.0000594 epoch_Time:28468.0min: [2024-01-02 23:48:03,533][model8_pretrain.py][INFO] Epoch:[0/2](58600/4588595) loss:3.145 lr:0.0000594 epoch_Time:28468.0min: [2024-01-02 23:48:03,533][model8_pretrain.py][INFO] Epoch:[0/2](58600/4588595) loss:2.801 lr:0.0000594 epoch_Time:28468.0min: [2024-01-02 23:48:03,533][model8_pretrain.py][INFO] Epoch:[0/2](58600/4588595) loss:3.061 lr:0.0000594 epoch_Time:28468.0min: [2024-01-02 23:48:40,466][model8_pretrain.py][INFO] Epoch:[0/2](58700/4588595) loss:3.505 lr:0.0000590 epoch_Time:28467.0min: [2024-01-02 23:48:40,466][model8_pretrain.py][INFO] Epoch:[0/2](58700/4588595) loss:3.103 lr:0.0000590 epoch_Time:28467.0min: [2024-01-02 23:48:40,466][model8_pretrain.py][INFO] Epoch:[0/2](58700/4588595) loss:2.809 lr:0.0000590 epoch_Time:28467.0min: [2024-01-02 23:48:40,466][model8_pretrain.py][INFO] Epoch:[0/2](58700/4588595) loss:3.334 lr:0.0000590 epoch_Time:28467.0min: [2024-01-02 23:48:40,466][model8_pretrain.py][INFO] Epoch:[0/2](58700/4588595) loss:3.327 lr:0.0000590 epoch_Time:28467.0min: [2024-01-02 23:48:40,466][model8_pretrain.py][INFO] Epoch:[0/2](58700/4588595) loss:2.715 lr:0.0000590 epoch_Time:28467.0min: [2024-01-02 23:48:40,467][model8_pretrain.py][INFO] Epoch:[0/2](58700/4588595) loss:3.072 lr:0.0000590 epoch_Time:28467.0min: [2024-01-02 23:48:40,467][model8_pretrain.py][INFO] Epoch:[0/2](58700/4588595) loss:2.695 lr:0.0000590 epoch_Time:28467.0min: [2024-01-02 23:49:17,399][model8_pretrain.py][INFO] Epoch:[0/2](58800/4588595) loss:2.838 lr:0.0000585 epoch_Time:28465.0min: [2024-01-02 23:49:17,399][model8_pretrain.py][INFO] Epoch:[0/2](58800/4588595) loss:3.277 lr:0.0000585 epoch_Time:28465.0min: [2024-01-02 23:49:17,399][model8_pretrain.py][INFO] Epoch:[0/2](58800/4588595) loss:3.404 lr:0.0000585 epoch_Time:28465.0min: [2024-01-02 23:49:17,399][model8_pretrain.py][INFO] Epoch:[0/2](58800/4588595) loss:2.972 lr:0.0000585 epoch_Time:28465.0min: [2024-01-02 23:49:17,399][model8_pretrain.py][INFO] Epoch:[0/2](58800/4588595) loss:2.759 lr:0.0000585 epoch_Time:28465.0min: [2024-01-02 23:49:17,399][model8_pretrain.py][INFO] Epoch:[0/2](58800/4588595) loss:3.208 lr:0.0000585 epoch_Time:28465.0min: [2024-01-02 23:49:17,399][model8_pretrain.py][INFO] Epoch:[0/2](58800/4588595) loss:3.289 lr:0.0000585 epoch_Time:28465.0min: [2024-01-02 23:49:17,399][model8_pretrain.py][INFO] Epoch:[0/2](58800/4588595) loss:3.252 lr:0.0000585 epoch_Time:28465.0min: [2024-01-02 23:49:54,339][model8_pretrain.py][INFO] Epoch:[0/2](58900/4588595) loss:3.395 lr:0.0000581 epoch_Time:28463.0min: [2024-01-02 23:49:54,339][model8_pretrain.py][INFO] Epoch:[0/2](58900/4588595) loss:3.079 lr:0.0000581 epoch_Time:28463.0min: [2024-01-02 23:49:54,339][model8_pretrain.py][INFO] Epoch:[0/2](58900/4588595) loss:3.175 lr:0.0000581 epoch_Time:28463.0min: [2024-01-02 23:49:54,339][model8_pretrain.py][INFO] Epoch:[0/2](58900/4588595) loss:3.300 lr:0.0000581 epoch_Time:28463.0min: [2024-01-02 23:49:54,339][model8_pretrain.py][INFO] Epoch:[0/2](58900/4588595) loss:3.506 lr:0.0000581 epoch_Time:28463.0min: [2024-01-02 23:49:54,339][model8_pretrain.py][INFO] Epoch:[0/2](58900/4588595) loss:2.807 lr:0.0000581 epoch_Time:28463.0min: [2024-01-02 23:49:54,339][model8_pretrain.py][INFO] Epoch:[0/2](58900/4588595) loss:3.433 lr:0.0000581 epoch_Time:28463.0min: [2024-01-02 23:49:54,339][model8_pretrain.py][INFO] Epoch:[0/2](58900/4588595) loss:3.409 lr:0.0000581 epoch_Time:28463.0min: [2024-01-02 23:50:31,302][model8_pretrain.py][INFO] Epoch:[0/2](59000/4588595) loss:2.763 lr:0.0000577 epoch_Time:28462.0min: [2024-01-02 23:50:31,302][model8_pretrain.py][INFO] Epoch:[0/2](59000/4588595) loss:3.379 lr:0.0000577 epoch_Time:28462.0min: [2024-01-02 23:50:31,302][model8_pretrain.py][INFO] Epoch:[0/2](59000/4588595) loss:2.950 lr:0.0000577 epoch_Time:28462.0min: [2024-01-02 23:50:31,302][model8_pretrain.py][INFO] Epoch:[0/2](59000/4588595) loss:3.207 lr:0.0000577 epoch_Time:28462.0min: [2024-01-02 23:50:31,302][model8_pretrain.py][INFO] Epoch:[0/2](59000/4588595) loss:3.464 lr:0.0000577 epoch_Time:28462.0min: [2024-01-02 23:50:31,302][model8_pretrain.py][INFO] Epoch:[0/2](59000/4588595) loss:2.974 lr:0.0000577 epoch_Time:28462.0min: [2024-01-02 23:50:31,302][model8_pretrain.py][INFO] Epoch:[0/2](59000/4588595) loss:3.633 lr:0.0000577 epoch_Time:28462.0min: [2024-01-02 23:50:31,302][model8_pretrain.py][INFO] Epoch:[0/2](59000/4588595) loss:3.080 lr:0.0000577 epoch_Time:28462.0min: [2024-01-02 23:51:15,708][model8_pretrain.py][INFO] Epoch:[0/2](59100/4588595) loss:3.484 lr:0.0000573 epoch_Time:28469.0min: [2024-01-02 23:51:15,708][model8_pretrain.py][INFO] Epoch:[0/2](59100/4588595) loss:3.434 lr:0.0000573 epoch_Time:28469.0min: [2024-01-02 23:51:15,708][model8_pretrain.py][INFO] Epoch:[0/2](59100/4588595) loss:3.018 lr:0.0000573 epoch_Time:28469.0min: [2024-01-02 23:51:15,708][model8_pretrain.py][INFO] Epoch:[0/2](59100/4588595) loss:2.855 lr:0.0000573 epoch_Time:28469.0min: [2024-01-02 23:51:15,708][model8_pretrain.py][INFO] Epoch:[0/2](59100/4588595) loss:2.704 lr:0.0000573 epoch_Time:28469.0min: [2024-01-02 23:51:15,708][model8_pretrain.py][INFO] Epoch:[0/2](59100/4588595) loss:2.726 lr:0.0000573 epoch_Time:28469.0min: [2024-01-02 23:51:15,708][model8_pretrain.py][INFO] Epoch:[0/2](59100/4588595) loss:3.473 lr:0.0000573 epoch_Time:28469.0min: [2024-01-02 23:51:15,708][model8_pretrain.py][INFO] Epoch:[0/2](59100/4588595) loss:3.231 lr:0.0000573 epoch_Time:28469.0min: [2024-01-02 23:51:52,640][model8_pretrain.py][INFO] Epoch:[0/2](59200/4588595) loss:2.937 lr:0.0000568 epoch_Time:28467.0min: [2024-01-02 23:51:52,640][model8_pretrain.py][INFO] Epoch:[0/2](59200/4588595) loss:3.488 lr:0.0000568 epoch_Time:28467.0min: [2024-01-02 23:51:52,640][model8_pretrain.py][INFO] Epoch:[0/2](59200/4588595) loss:3.221 lr:0.0000568 epoch_Time:28467.0min: [2024-01-02 23:51:52,640][model8_pretrain.py][INFO] Epoch:[0/2](59200/4588595) loss:3.041 lr:0.0000568 epoch_Time:28467.0min: [2024-01-02 23:51:52,640][model8_pretrain.py][INFO] Epoch:[0/2](59200/4588595) loss:3.124 lr:0.0000568 epoch_Time:28467.0min: [2024-01-02 23:51:52,640][model8_pretrain.py][INFO] Epoch:[0/2](59200/4588595) loss:3.134 lr:0.0000568 epoch_Time:28467.0min: [2024-01-02 23:51:52,640][model8_pretrain.py][INFO] Epoch:[0/2](59200/4588595) loss:3.343 lr:0.0000568 epoch_Time:28467.0min: [2024-01-02 23:51:52,640][model8_pretrain.py][INFO] Epoch:[0/2](59200/4588595) loss:3.391 lr:0.0000568 epoch_Time:28467.0min: [2024-01-02 23:52:29,580][model8_pretrain.py][INFO] Epoch:[0/2](59300/4588595) loss:3.063 lr:0.0000564 epoch_Time:28466.0min: [2024-01-02 23:52:29,580][model8_pretrain.py][INFO] Epoch:[0/2](59300/4588595) loss:2.227 lr:0.0000564 epoch_Time:28466.0min: [2024-01-02 23:52:29,580][model8_pretrain.py][INFO] Epoch:[0/2](59300/4588595) loss:3.040 lr:0.0000564 epoch_Time:28466.0min: [2024-01-02 23:52:29,580][model8_pretrain.py][INFO] Epoch:[0/2](59300/4588595) loss:2.956 lr:0.0000564 epoch_Time:28466.0min: [2024-01-02 23:52:29,580][model8_pretrain.py][INFO] Epoch:[0/2](59300/4588595) loss:2.747 lr:0.0000564 epoch_Time:28466.0min: [2024-01-02 23:52:29,580][model8_pretrain.py][INFO] Epoch:[0/2](59300/4588595) loss:2.944 lr:0.0000564 epoch_Time:28466.0min: [2024-01-02 23:52:29,580][model8_pretrain.py][INFO] Epoch:[0/2](59300/4588595) loss:3.233 lr:0.0000564 epoch_Time:28466.0min: [2024-01-02 23:52:29,580][model8_pretrain.py][INFO] Epoch:[0/2](59300/4588595) loss:3.010 lr:0.0000564 epoch_Time:28466.0min: [2024-01-02 23:53:06,538][model8_pretrain.py][INFO] Epoch:[0/2](59400/4588595) loss:3.073 lr:0.0000560 epoch_Time:28464.0min: [2024-01-02 23:53:06,538][model8_pretrain.py][INFO] Epoch:[0/2](59400/4588595) loss:3.463 lr:0.0000560 epoch_Time:28464.0min: [2024-01-02 23:53:06,538][model8_pretrain.py][INFO] Epoch:[0/2](59400/4588595) loss:2.954 lr:0.0000560 epoch_Time:28464.0min: [2024-01-02 23:53:06,538][model8_pretrain.py][INFO] Epoch:[0/2](59400/4588595) loss:3.284 lr:0.0000560 epoch_Time:28464.0min: [2024-01-02 23:53:06,538][model8_pretrain.py][INFO] Epoch:[0/2](59400/4588595) loss:3.354 lr:0.0000560 epoch_Time:28464.0min: [2024-01-02 23:53:06,538][model8_pretrain.py][INFO] Epoch:[0/2](59400/4588595) loss:3.637 lr:0.0000560 epoch_Time:28464.0min: [2024-01-02 23:53:06,538][model8_pretrain.py][INFO] Epoch:[0/2](59400/4588595) loss:3.099 lr:0.0000560 epoch_Time:28464.0min: [2024-01-02 23:53:06,538][model8_pretrain.py][INFO] Epoch:[0/2](59400/4588595) loss:2.825 lr:0.0000560 epoch_Time:28464.0min: [2024-01-02 23:53:43,456][model8_pretrain.py][INFO] Epoch:[0/2](59500/4588595) loss:3.275 lr:0.0000556 epoch_Time:28463.0min: [2024-01-02 23:53:43,456][model8_pretrain.py][INFO] Epoch:[0/2](59500/4588595) loss:3.230 lr:0.0000556 epoch_Time:28463.0min: [2024-01-02 23:53:43,456][model8_pretrain.py][INFO] Epoch:[0/2](59500/4588595) loss:3.353 lr:0.0000556 epoch_Time:28463.0min: [2024-01-02 23:53:43,456][model8_pretrain.py][INFO] Epoch:[0/2](59500/4588595) loss:2.951 lr:0.0000556 epoch_Time:28463.0min: [2024-01-02 23:53:43,456][model8_pretrain.py][INFO] Epoch:[0/2](59500/4588595) loss:3.246 lr:0.0000556 epoch_Time:28463.0min: [2024-01-02 23:53:43,456][model8_pretrain.py][INFO] Epoch:[0/2](59500/4588595) loss:3.065 lr:0.0000556 epoch_Time:28463.0min: [2024-01-02 23:53:43,456][model8_pretrain.py][INFO] Epoch:[0/2](59500/4588595) loss:3.024 lr:0.0000556 epoch_Time:28463.0min: [2024-01-02 23:53:43,456][model8_pretrain.py][INFO] Epoch:[0/2](59500/4588595) loss:3.290 lr:0.0000556 epoch_Time:28463.0min: [2024-01-02 23:54:20,398][model8_pretrain.py][INFO] Epoch:[0/2](59600/4588595) loss:3.152 lr:0.0000552 epoch_Time:28461.0min: [2024-01-02 23:54:20,398][model8_pretrain.py][INFO] Epoch:[0/2](59600/4588595) loss:3.208 lr:0.0000552 epoch_Time:28461.0min: [2024-01-02 23:54:20,398][model8_pretrain.py][INFO] Epoch:[0/2](59600/4588595) loss:2.794 lr:0.0000552 epoch_Time:28461.0min: [2024-01-02 23:54:20,398][model8_pretrain.py][INFO] Epoch:[0/2](59600/4588595) loss:3.134 lr:0.0000552 epoch_Time:28461.0min: [2024-01-02 23:54:20,398][model8_pretrain.py][INFO] Epoch:[0/2](59600/4588595) loss:3.179 lr:0.0000552 epoch_Time:28461.0min: [2024-01-02 23:54:20,398][model8_pretrain.py][INFO] Epoch:[0/2](59600/4588595) loss:3.616 lr:0.0000552 epoch_Time:28461.0min: [2024-01-02 23:54:20,399][model8_pretrain.py][INFO] Epoch:[0/2](59600/4588595) loss:3.421 lr:0.0000552 epoch_Time:28461.0min: [2024-01-02 23:54:20,399][model8_pretrain.py][INFO] Epoch:[0/2](59600/4588595) loss:2.398 lr:0.0000552 epoch_Time:28461.0min: [2024-01-02 23:54:57,365][model8_pretrain.py][INFO] Epoch:[0/2](59700/4588595) loss:3.373 lr:0.0000547 epoch_Time:28459.0min: [2024-01-02 23:54:57,365][model8_pretrain.py][INFO] Epoch:[0/2](59700/4588595) loss:2.831 lr:0.0000547 epoch_Time:28459.0min: [2024-01-02 23:54:57,365][model8_pretrain.py][INFO] Epoch:[0/2](59700/4588595) loss:3.431 lr:0.0000547 epoch_Time:28459.0min: [2024-01-02 23:54:57,365][model8_pretrain.py][INFO] Epoch:[0/2](59700/4588595) loss:3.749 lr:0.0000547 epoch_Time:28459.0min: [2024-01-02 23:54:57,365][model8_pretrain.py][INFO] Epoch:[0/2](59700/4588595) loss:3.384 lr:0.0000547 epoch_Time:28459.0min: [2024-01-02 23:54:57,366][model8_pretrain.py][INFO] Epoch:[0/2](59700/4588595) loss:3.198 lr:0.0000547 epoch_Time:28459.0min: [2024-01-02 23:54:57,366][model8_pretrain.py][INFO] Epoch:[0/2](59700/4588595) loss:3.052 lr:0.0000547 epoch_Time:28459.0min: [2024-01-02 23:54:57,366][model8_pretrain.py][INFO] Epoch:[0/2](59700/4588595) loss:3.152 lr:0.0000547 epoch_Time:28459.0min: [2024-01-02 23:55:34,300][model8_pretrain.py][INFO] Epoch:[0/2](59800/4588595) loss:2.870 lr:0.0000543 epoch_Time:28458.0min: [2024-01-02 23:55:34,300][model8_pretrain.py][INFO] Epoch:[0/2](59800/4588595) loss:3.135 lr:0.0000543 epoch_Time:28458.0min: [2024-01-02 23:55:34,300][model8_pretrain.py][INFO] Epoch:[0/2](59800/4588595) loss:3.063 lr:0.0000543 epoch_Time:28458.0min: [2024-01-02 23:55:34,301][model8_pretrain.py][INFO] Epoch:[0/2](59800/4588595) loss:2.715 lr:0.0000543 epoch_Time:28458.0min: [2024-01-02 23:55:34,300][model8_pretrain.py][INFO] Epoch:[0/2](59800/4588595) loss:3.263 lr:0.0000543 epoch_Time:28458.0min: [2024-01-02 23:55:34,300][model8_pretrain.py][INFO] Epoch:[0/2](59800/4588595) loss:3.408 lr:0.0000543 epoch_Time:28458.0min: [2024-01-02 23:55:34,301][model8_pretrain.py][INFO] Epoch:[0/2](59800/4588595) loss:3.208 lr:0.0000543 epoch_Time:28458.0min: [2024-01-02 23:55:34,301][model8_pretrain.py][INFO] Epoch:[0/2](59800/4588595) loss:3.088 lr:0.0000543 epoch_Time:28458.0min: [2024-01-02 23:56:17,967][model8_pretrain.py][INFO] Epoch:[0/2](59900/4588595) loss:3.287 lr:0.0000539 epoch_Time:28465.0min: [2024-01-02 23:56:17,967][model8_pretrain.py][INFO] Epoch:[0/2](59900/4588595) loss:3.371 lr:0.0000539 epoch_Time:28465.0min: [2024-01-02 23:56:17,967][model8_pretrain.py][INFO] Epoch:[0/2](59900/4588595) loss:2.369 lr:0.0000539 epoch_Time:28465.0min: [2024-01-02 23:56:17,967][model8_pretrain.py][INFO] Epoch:[0/2](59900/4588595) loss:3.286 lr:0.0000539 epoch_Time:28465.0min: [2024-01-02 23:56:17,967][model8_pretrain.py][INFO] Epoch:[0/2](59900/4588595) loss:3.011 lr:0.0000539 epoch_Time:28465.0min: [2024-01-02 23:56:17,967][model8_pretrain.py][INFO] Epoch:[0/2](59900/4588595) loss:3.561 lr:0.0000539 epoch_Time:28465.0min: [2024-01-02 23:56:17,967][model8_pretrain.py][INFO] Epoch:[0/2](59900/4588595) loss:3.284 lr:0.0000539 epoch_Time:28465.0min: [2024-01-02 23:56:17,967][model8_pretrain.py][INFO] Epoch:[0/2](59900/4588595) loss:3.182 lr:0.0000539 epoch_Time:28465.0min: [2024-01-02 23:56:54,897][model8_pretrain.py][INFO] Epoch:[0/2](60000/4588595) loss:3.305 lr:0.0000535 epoch_Time:28463.0min: [2024-01-02 23:56:54,897][model8_pretrain.py][INFO] Epoch:[0/2](60000/4588595) loss:3.106 lr:0.0000535 epoch_Time:28463.0min: [2024-01-02 23:56:54,897][model8_pretrain.py][INFO] Epoch:[0/2](60000/4588595) loss:3.163 lr:0.0000535 epoch_Time:28463.0min: [2024-01-02 23:56:54,897][model8_pretrain.py][INFO] Epoch:[0/2](60000/4588595) loss:2.851 lr:0.0000535 epoch_Time:28463.0min: [2024-01-02 23:56:54,897][model8_pretrain.py][INFO] Epoch:[0/2](60000/4588595) loss:3.054 lr:0.0000535 epoch_Time:28463.0min: [2024-01-02 23:56:54,897][model8_pretrain.py][INFO] Epoch:[0/2](60000/4588595) loss:3.255 lr:0.0000535 epoch_Time:28463.0min: [2024-01-02 23:56:54,897][model8_pretrain.py][INFO] Epoch:[0/2](60000/4588595) loss:3.157 lr:0.0000535 epoch_Time:28463.0min: [2024-01-02 23:56:54,898][model8_pretrain.py][INFO] Epoch:[0/2](60000/4588595) loss:3.197 lr:0.0000535 epoch_Time:28463.0min: [2024-01-02 23:57:31,841][model8_pretrain.py][INFO] Epoch:[0/2](60100/4588595) loss:2.489 lr:0.0000531 epoch_Time:28462.0min: [2024-01-02 23:57:31,841][model8_pretrain.py][INFO] Epoch:[0/2](60100/4588595) loss:2.664 lr:0.0000531 epoch_Time:28462.0min: [2024-01-02 23:57:31,841][model8_pretrain.py][INFO] Epoch:[0/2](60100/4588595) loss:3.142 lr:0.0000531 epoch_Time:28462.0min: [2024-01-02 23:57:31,841][model8_pretrain.py][INFO] Epoch:[0/2](60100/4588595) loss:2.982 lr:0.0000531 epoch_Time:28462.0min: [2024-01-02 23:57:31,841][model8_pretrain.py][INFO] Epoch:[0/2](60100/4588595) loss:3.347 lr:0.0000531 epoch_Time:28462.0min: [2024-01-02 23:57:31,841][model8_pretrain.py][INFO] Epoch:[0/2](60100/4588595) loss:3.248 lr:0.0000531 epoch_Time:28462.0min: [2024-01-02 23:57:31,841][model8_pretrain.py][INFO] Epoch:[0/2](60100/4588595) loss:3.216 lr:0.0000531 epoch_Time:28462.0min: [2024-01-02 23:57:31,841][model8_pretrain.py][INFO] Epoch:[0/2](60100/4588595) loss:3.582 lr:0.0000531 epoch_Time:28462.0min: [2024-01-02 23:58:08,793][model8_pretrain.py][INFO] Epoch:[0/2](60200/4588595) loss:3.405 lr:0.0000527 epoch_Time:28460.0min: [2024-01-02 23:58:08,793][model8_pretrain.py][INFO] Epoch:[0/2](60200/4588595) loss:2.895 lr:0.0000527 epoch_Time:28460.0min: [2024-01-02 23:58:08,793][model8_pretrain.py][INFO] Epoch:[0/2](60200/4588595) loss:2.671 lr:0.0000527 epoch_Time:28460.0min: [2024-01-02 23:58:08,793][model8_pretrain.py][INFO] Epoch:[0/2](60200/4588595) loss:3.008 lr:0.0000527 epoch_Time:28460.0min: [2024-01-02 23:58:08,793][model8_pretrain.py][INFO] Epoch:[0/2](60200/4588595) loss:3.418 lr:0.0000527 epoch_Time:28460.0min: [2024-01-02 23:58:08,793][model8_pretrain.py][INFO] Epoch:[0/2](60200/4588595) loss:2.844 lr:0.0000527 epoch_Time:28460.0min: [2024-01-02 23:58:08,793][model8_pretrain.py][INFO] Epoch:[0/2](60200/4588595) loss:3.175 lr:0.0000527 epoch_Time:28460.0min: [2024-01-02 23:58:08,793][model8_pretrain.py][INFO] Epoch:[0/2](60200/4588595) loss:3.144 lr:0.0000527 epoch_Time:28460.0min: [2024-01-02 23:58:45,731][model8_pretrain.py][INFO] Epoch:[0/2](60300/4588595) loss:2.931 lr:0.0000523 epoch_Time:28459.0min: [2024-01-02 23:58:45,731][model8_pretrain.py][INFO] Epoch:[0/2](60300/4588595) loss:3.007 lr:0.0000523 epoch_Time:28459.0min: [2024-01-02 23:58:45,731][model8_pretrain.py][INFO] Epoch:[0/2](60300/4588595) loss:3.764 lr:0.0000523 epoch_Time:28459.0min: [2024-01-02 23:58:45,731][model8_pretrain.py][INFO] Epoch:[0/2](60300/4588595) loss:3.405 lr:0.0000523 epoch_Time:28459.0min: [2024-01-02 23:58:45,731][model8_pretrain.py][INFO] Epoch:[0/2](60300/4588595) loss:2.875 lr:0.0000523 epoch_Time:28459.0min: [2024-01-02 23:58:45,731][model8_pretrain.py][INFO] Epoch:[0/2](60300/4588595) loss:2.563 lr:0.0000523 epoch_Time:28459.0min: [2024-01-02 23:58:45,731][model8_pretrain.py][INFO] Epoch:[0/2](60300/4588595) loss:2.735 lr:0.0000523 epoch_Time:28459.0min: [2024-01-02 23:58:45,732][model8_pretrain.py][INFO] Epoch:[0/2](60300/4588595) loss:3.099 lr:0.0000523 epoch_Time:28459.0min: [2024-01-02 23:59:22,669][model8_pretrain.py][INFO] Epoch:[0/2](60400/4588595) loss:3.235 lr:0.0000519 epoch_Time:28457.0min: [2024-01-02 23:59:22,670][model8_pretrain.py][INFO] Epoch:[0/2](60400/4588595) loss:3.328 lr:0.0000519 epoch_Time:28457.0min: [2024-01-02 23:59:22,670][model8_pretrain.py][INFO] Epoch:[0/2](60400/4588595) loss:3.408 lr:0.0000519 epoch_Time:28457.0min: [2024-01-02 23:59:22,670][model8_pretrain.py][INFO] Epoch:[0/2](60400/4588595) loss:3.432 lr:0.0000519 epoch_Time:28457.0min: [2024-01-02 23:59:22,670][model8_pretrain.py][INFO] Epoch:[0/2](60400/4588595) loss:3.127 lr:0.0000519 epoch_Time:28457.0min: [2024-01-02 23:59:22,670][model8_pretrain.py][INFO] Epoch:[0/2](60400/4588595) loss:2.977 lr:0.0000519 epoch_Time:28457.0min: [2024-01-02 23:59:22,670][model8_pretrain.py][INFO] Epoch:[0/2](60400/4588595) loss:3.393 lr:0.0000519 epoch_Time:28457.0min: [2024-01-02 23:59:22,670][model8_pretrain.py][INFO] Epoch:[0/2](60400/4588595) loss:3.297 lr:0.0000519 epoch_Time:28457.0min: [2024-01-02 23:59:59,609][model8_pretrain.py][INFO] Epoch:[0/2](60500/4588595) loss:3.545 lr:0.0000515 epoch_Time:28455.0min: [2024-01-02 23:59:59,609][model8_pretrain.py][INFO] Epoch:[0/2](60500/4588595) loss:2.812 lr:0.0000515 epoch_Time:28455.0min: [2024-01-02 23:59:59,609][model8_pretrain.py][INFO] Epoch:[0/2](60500/4588595) loss:2.992 lr:0.0000515 epoch_Time:28455.0min: [2024-01-02 23:59:59,609][model8_pretrain.py][INFO] Epoch:[0/2](60500/4588595) loss:2.899 lr:0.0000515 epoch_Time:28455.0min: [2024-01-02 23:59:59,609][model8_pretrain.py][INFO] Epoch:[0/2](60500/4588595) loss:3.029 lr:0.0000515 epoch_Time:28455.0min: [2024-01-02 23:59:59,609][model8_pretrain.py][INFO] Epoch:[0/2](60500/4588595) loss:3.123 lr:0.0000515 epoch_Time:28455.0min: [2024-01-02 23:59:59,609][model8_pretrain.py][INFO] Epoch:[0/2](60500/4588595) loss:2.730 lr:0.0000515 epoch_Time:28455.0min: [2024-01-02 23:59:59,609][model8_pretrain.py][INFO] Epoch:[0/2](60500/4588595) loss:2.845 lr:0.0000515 epoch_Time:28455.0min: [2024-01-03 00:00:36,552][model8_pretrain.py][INFO] Epoch:[0/2](60600/4588595) loss:3.123 lr:0.0000511 epoch_Time:28454.0min: [2024-01-03 00:00:36,552][model8_pretrain.py][INFO] Epoch:[0/2](60600/4588595) loss:2.660 lr:0.0000511 epoch_Time:28454.0min: [2024-01-03 00:00:36,552][model8_pretrain.py][INFO] Epoch:[0/2](60600/4588595) loss:3.044 lr:0.0000511 epoch_Time:28454.0min: [2024-01-03 00:00:36,554][model8_pretrain.py][INFO] Epoch:[0/2](60600/4588595) loss:3.049 lr:0.0000511 epoch_Time:28454.0min: [2024-01-03 00:00:36,554][model8_pretrain.py][INFO] Epoch:[0/2](60600/4588595) loss:3.477 lr:0.0000511 epoch_Time:28454.0min: [2024-01-03 00:00:36,554][model8_pretrain.py][INFO] Epoch:[0/2](60600/4588595) loss:3.347 lr:0.0000511 epoch_Time:28454.0min: [2024-01-03 00:00:36,554][model8_pretrain.py][INFO] Epoch:[0/2](60600/4588595) loss:2.864 lr:0.0000511 epoch_Time:28454.0min: [2024-01-03 00:00:36,554][model8_pretrain.py][INFO] Epoch:[0/2](60600/4588595) loss:3.049 lr:0.0000511 epoch_Time:28454.0min: [2024-01-03 00:01:20,708][model8_pretrain.py][INFO] Epoch:[0/2](60700/4588595) loss:3.140 lr:0.0000507 epoch_Time:28461.0min: [2024-01-03 00:01:20,708][model8_pretrain.py][INFO] Epoch:[0/2](60700/4588595) loss:2.779 lr:0.0000507 epoch_Time:28461.0min: [2024-01-03 00:01:20,708][model8_pretrain.py][INFO] Epoch:[0/2](60700/4588595) loss:3.048 lr:0.0000507 epoch_Time:28461.0min: [2024-01-03 00:01:20,708][model8_pretrain.py][INFO] Epoch:[0/2](60700/4588595) loss:2.776 lr:0.0000507 epoch_Time:28461.0min: [2024-01-03 00:01:20,708][model8_pretrain.py][INFO] Epoch:[0/2](60700/4588595) loss:3.313 lr:0.0000507 epoch_Time:28461.0min: [2024-01-03 00:01:20,708][model8_pretrain.py][INFO] Epoch:[0/2](60700/4588595) loss:3.007 lr:0.0000507 epoch_Time:28461.0min: [2024-01-03 00:01:20,708][model8_pretrain.py][INFO] Epoch:[0/2](60700/4588595) loss:3.372 lr:0.0000507 epoch_Time:28461.0min: [2024-01-03 00:01:20,708][model8_pretrain.py][INFO] Epoch:[0/2](60700/4588595) loss:3.170 lr:0.0000507 epoch_Time:28461.0min: [2024-01-03 00:01:57,651][model8_pretrain.py][INFO] Epoch:[0/2](60800/4588595) loss:3.249 lr:0.0000503 epoch_Time:28459.0min: [2024-01-03 00:01:57,651][model8_pretrain.py][INFO] Epoch:[0/2](60800/4588595) loss:3.247 lr:0.0000503 epoch_Time:28459.0min: [2024-01-03 00:01:57,651][model8_pretrain.py][INFO] Epoch:[0/2](60800/4588595) loss:2.547 lr:0.0000503 epoch_Time:28459.0min: [2024-01-03 00:01:57,651][model8_pretrain.py][INFO] Epoch:[0/2](60800/4588595) loss:3.711 lr:0.0000503 epoch_Time:28459.0min: [2024-01-03 00:01:57,651][model8_pretrain.py][INFO] Epoch:[0/2](60800/4588595) loss:3.096 lr:0.0000503 epoch_Time:28459.0min: [2024-01-03 00:01:57,651][model8_pretrain.py][INFO] Epoch:[0/2](60800/4588595) loss:3.844 lr:0.0000503 epoch_Time:28459.0min: [2024-01-03 00:01:57,651][model8_pretrain.py][INFO] Epoch:[0/2](60800/4588595) loss:3.220 lr:0.0000503 epoch_Time:28459.0min: [2024-01-03 00:01:57,651][model8_pretrain.py][INFO] Epoch:[0/2](60800/4588595) loss:3.411 lr:0.0000503 epoch_Time:28459.0min: [2024-01-03 00:02:34,592][model8_pretrain.py][INFO] Epoch:[0/2](60900/4588595) loss:3.271 lr:0.0000499 epoch_Time:28458.0min: [2024-01-03 00:02:34,592][model8_pretrain.py][INFO] Epoch:[0/2](60900/4588595) loss:3.043 lr:0.0000499 epoch_Time:28458.0min: [2024-01-03 00:02:34,592][model8_pretrain.py][INFO] Epoch:[0/2](60900/4588595) loss:3.227 lr:0.0000499 epoch_Time:28458.0min: [2024-01-03 00:02:34,592][model8_pretrain.py][INFO] Epoch:[0/2](60900/4588595) loss:2.573 lr:0.0000499 epoch_Time:28458.0min: [2024-01-03 00:02:34,592][model8_pretrain.py][INFO] Epoch:[0/2](60900/4588595) loss:3.157 lr:0.0000499 epoch_Time:28458.0min: [2024-01-03 00:02:34,592][model8_pretrain.py][INFO] Epoch:[0/2](60900/4588595) loss:3.311 lr:0.0000499 epoch_Time:28458.0min: [2024-01-03 00:02:34,592][model8_pretrain.py][INFO] Epoch:[0/2](60900/4588595) loss:2.857 lr:0.0000499 epoch_Time:28458.0min: [2024-01-03 00:02:34,593][model8_pretrain.py][INFO] Epoch:[0/2](60900/4588595) loss:3.071 lr:0.0000499 epoch_Time:28458.0min: [2024-01-03 00:03:11,531][model8_pretrain.py][INFO] Epoch:[0/2](61000/4588595) loss:2.898 lr:0.0000495 epoch_Time:28456.0min: [2024-01-03 00:03:11,531][model8_pretrain.py][INFO] Epoch:[0/2](61000/4588595) loss:3.731 lr:0.0000495 epoch_Time:28456.0min: [2024-01-03 00:03:11,531][model8_pretrain.py][INFO] Epoch:[0/2](61000/4588595) loss:2.898 lr:0.0000495 epoch_Time:28456.0min: [2024-01-03 00:03:11,531][model8_pretrain.py][INFO] Epoch:[0/2](61000/4588595) loss:3.539 lr:0.0000495 epoch_Time:28456.0min: [2024-01-03 00:03:11,531][model8_pretrain.py][INFO] Epoch:[0/2](61000/4588595) loss:2.819 lr:0.0000495 epoch_Time:28456.0min: [2024-01-03 00:03:11,531][model8_pretrain.py][INFO] Epoch:[0/2](61000/4588595) loss:3.177 lr:0.0000495 epoch_Time:28456.0min: [2024-01-03 00:03:11,531][model8_pretrain.py][INFO] Epoch:[0/2](61000/4588595) loss:3.152 lr:0.0000495 epoch_Time:28456.0min: [2024-01-03 00:03:11,531][model8_pretrain.py][INFO] Epoch:[0/2](61000/4588595) loss:3.297 lr:0.0000495 epoch_Time:28456.0min: [2024-01-03 00:03:48,482][model8_pretrain.py][INFO] Epoch:[0/2](61100/4588595) loss:2.983 lr:0.0000491 epoch_Time:28454.0min: [2024-01-03 00:03:48,482][model8_pretrain.py][INFO] Epoch:[0/2](61100/4588595) loss:3.356 lr:0.0000491 epoch_Time:28454.0min: [2024-01-03 00:03:48,482][model8_pretrain.py][INFO] Epoch:[0/2](61100/4588595) loss:3.116 lr:0.0000491 epoch_Time:28454.0min: [2024-01-03 00:03:48,482][model8_pretrain.py][INFO] Epoch:[0/2](61100/4588595) loss:2.960 lr:0.0000491 epoch_Time:28454.0min: [2024-01-03 00:03:48,482][model8_pretrain.py][INFO] Epoch:[0/2](61100/4588595) loss:3.308 lr:0.0000491 epoch_Time:28454.0min: [2024-01-03 00:03:48,482][model8_pretrain.py][INFO] Epoch:[0/2](61100/4588595) loss:3.339 lr:0.0000491 epoch_Time:28454.0min: [2024-01-03 00:03:48,482][model8_pretrain.py][INFO] Epoch:[0/2](61100/4588595) loss:2.433 lr:0.0000491 epoch_Time:28454.0min: [2024-01-03 00:03:48,482][model8_pretrain.py][INFO] Epoch:[0/2](61100/4588595) loss:3.320 lr:0.0000491 epoch_Time:28454.0min: [2024-01-03 00:04:25,450][model8_pretrain.py][INFO] Epoch:[0/2](61200/4588595) loss:3.098 lr:0.0000487 epoch_Time:28453.0min: [2024-01-03 00:04:25,450][model8_pretrain.py][INFO] Epoch:[0/2](61200/4588595) loss:3.129 lr:0.0000487 epoch_Time:28454.0min: [2024-01-03 00:04:25,450][model8_pretrain.py][INFO] Epoch:[0/2](61200/4588595) loss:3.255 lr:0.0000487 epoch_Time:28454.0min: [2024-01-03 00:04:25,450][model8_pretrain.py][INFO] Epoch:[0/2](61200/4588595) loss:3.329 lr:0.0000487 epoch_Time:28454.0min: [2024-01-03 00:04:25,450][model8_pretrain.py][INFO] Epoch:[0/2](61200/4588595) loss:3.387 lr:0.0000487 epoch_Time:28454.0min: [2024-01-03 00:04:25,450][model8_pretrain.py][INFO] Epoch:[0/2](61200/4588595) loss:3.393 lr:0.0000487 epoch_Time:28453.0min: [2024-01-03 00:04:25,450][model8_pretrain.py][INFO] Epoch:[0/2](61200/4588595) loss:3.185 lr:0.0000487 epoch_Time:28454.0min: [2024-01-03 00:04:25,450][model8_pretrain.py][INFO] Epoch:[0/2](61200/4588595) loss:3.223 lr:0.0000487 epoch_Time:28454.0min: [2024-01-03 00:05:02,349][model8_pretrain.py][INFO] Epoch:[0/2](61300/4588595) loss:3.290 lr:0.0000483 epoch_Time:28451.0min: [2024-01-03 00:05:02,350][model8_pretrain.py][INFO] Epoch:[0/2](61300/4588595) loss:3.307 lr:0.0000483 epoch_Time:28452.0min: [2024-01-03 00:05:02,350][model8_pretrain.py][INFO] Epoch:[0/2](61300/4588595) loss:3.076 lr:0.0000483 epoch_Time:28452.0min: [2024-01-03 00:05:02,350][model8_pretrain.py][INFO] Epoch:[0/2](61300/4588595) loss:3.341 lr:0.0000483 epoch_Time:28451.0min: [2024-01-03 00:05:02,350][model8_pretrain.py][INFO] Epoch:[0/2](61300/4588595) loss:3.268 lr:0.0000483 epoch_Time:28452.0min: [2024-01-03 00:05:02,350][model8_pretrain.py][INFO] Epoch:[0/2](61300/4588595) loss:3.494 lr:0.0000483 epoch_Time:28452.0min: [2024-01-03 00:05:02,350][model8_pretrain.py][INFO] Epoch:[0/2](61300/4588595) loss:3.071 lr:0.0000483 epoch_Time:28452.0min: [2024-01-03 00:05:02,350][model8_pretrain.py][INFO] Epoch:[0/2](61300/4588595) loss:2.877 lr:0.0000483 epoch_Time:28452.0min: [2024-01-03 00:05:39,284][model8_pretrain.py][INFO] Epoch:[0/2](61400/4588595) loss:3.358 lr:0.0000479 epoch_Time:28451.0min: [2024-01-03 00:05:39,284][model8_pretrain.py][INFO] Epoch:[0/2](61400/4588595) loss:3.352 lr:0.0000479 epoch_Time:28451.0min: [2024-01-03 00:05:39,285][model8_pretrain.py][INFO] Epoch:[0/2](61400/4588595) loss:2.785 lr:0.0000479 epoch_Time:28451.0min: [2024-01-03 00:05:39,284][model8_pretrain.py][INFO] Epoch:[0/2](61400/4588595) loss:2.838 lr:0.0000479 epoch_Time:28451.0min: [2024-01-03 00:05:39,285][model8_pretrain.py][INFO] Epoch:[0/2](61400/4588595) loss:3.209 lr:0.0000479 epoch_Time:28451.0min: [2024-01-03 00:05:39,285][model8_pretrain.py][INFO] Epoch:[0/2](61400/4588595) loss:2.525 lr:0.0000479 epoch_Time:28451.0min: [2024-01-03 00:05:39,285][model8_pretrain.py][INFO] Epoch:[0/2](61400/4588595) loss:3.236 lr:0.0000479 epoch_Time:28451.0min: [2024-01-03 00:05:39,285][model8_pretrain.py][INFO] Epoch:[0/2](61400/4588595) loss:3.190 lr:0.0000479 epoch_Time:28451.0min: [2024-01-03 00:06:23,219][model8_pretrain.py][INFO] Epoch:[0/2](61500/4588595) loss:3.505 lr:0.0000475 epoch_Time:28457.0min: [2024-01-03 00:06:23,219][model8_pretrain.py][INFO] Epoch:[0/2](61500/4588595) loss:2.995 lr:0.0000475 epoch_Time:28457.0min: [2024-01-03 00:06:23,219][model8_pretrain.py][INFO] Epoch:[0/2](61500/4588595) loss:2.780 lr:0.0000475 epoch_Time:28457.0min: [2024-01-03 00:06:23,220][model8_pretrain.py][INFO] Epoch:[0/2](61500/4588595) loss:3.112 lr:0.0000475 epoch_Time:28457.0min: [2024-01-03 00:06:23,220][model8_pretrain.py][INFO] Epoch:[0/2](61500/4588595) loss:3.102 lr:0.0000475 epoch_Time:28457.0min: [2024-01-03 00:06:23,220][model8_pretrain.py][INFO] Epoch:[0/2](61500/4588595) loss:3.185 lr:0.0000475 epoch_Time:28457.0min: [2024-01-03 00:06:23,220][model8_pretrain.py][INFO] Epoch:[0/2](61500/4588595) loss:3.300 lr:0.0000475 epoch_Time:28457.0min: [2024-01-03 00:06:23,221][model8_pretrain.py][INFO] Epoch:[0/2](61500/4588595) loss:3.373 lr:0.0000475 epoch_Time:28457.0min: [2024-01-03 00:07:00,157][model8_pretrain.py][INFO] Epoch:[0/2](61600/4588595) loss:3.223 lr:0.0000471 epoch_Time:28455.0min: [2024-01-03 00:07:00,157][model8_pretrain.py][INFO] Epoch:[0/2](61600/4588595) loss:2.925 lr:0.0000471 epoch_Time:28455.0min: [2024-01-03 00:07:00,157][model8_pretrain.py][INFO] Epoch:[0/2](61600/4588595) loss:3.310 lr:0.0000471 epoch_Time:28455.0min: [2024-01-03 00:07:00,157][model8_pretrain.py][INFO] Epoch:[0/2](61600/4588595) loss:2.915 lr:0.0000471 epoch_Time:28455.0min: [2024-01-03 00:07:00,157][model8_pretrain.py][INFO] Epoch:[0/2](61600/4588595) loss:3.258 lr:0.0000471 epoch_Time:28455.0min: [2024-01-03 00:07:00,157][model8_pretrain.py][INFO] Epoch:[0/2](61600/4588595) loss:3.784 lr:0.0000471 epoch_Time:28455.0min: [2024-01-03 00:07:00,157][model8_pretrain.py][INFO] Epoch:[0/2](61600/4588595) loss:2.436 lr:0.0000471 epoch_Time:28455.0min: [2024-01-03 00:07:00,157][model8_pretrain.py][INFO] Epoch:[0/2](61600/4588595) loss:2.858 lr:0.0000471 epoch_Time:28455.0min: [2024-01-03 00:07:37,091][model8_pretrain.py][INFO] Epoch:[0/2](61700/4588595) loss:2.794 lr:0.0000467 epoch_Time:28454.0min: [2024-01-03 00:07:37,091][model8_pretrain.py][INFO] Epoch:[0/2](61700/4588595) loss:3.132 lr:0.0000467 epoch_Time:28454.0min: [2024-01-03 00:07:37,091][model8_pretrain.py][INFO] Epoch:[0/2](61700/4588595) loss:3.042 lr:0.0000467 epoch_Time:28454.0min: [2024-01-03 00:07:37,091][model8_pretrain.py][INFO] Epoch:[0/2](61700/4588595) loss:3.024 lr:0.0000467 epoch_Time:28454.0min: [2024-01-03 00:07:37,091][model8_pretrain.py][INFO] Epoch:[0/2](61700/4588595) loss:3.434 lr:0.0000467 epoch_Time:28454.0min: [2024-01-03 00:07:37,091][model8_pretrain.py][INFO] Epoch:[0/2](61700/4588595) loss:3.846 lr:0.0000467 epoch_Time:28454.0min: [2024-01-03 00:07:37,091][model8_pretrain.py][INFO] Epoch:[0/2](61700/4588595) loss:3.335 lr:0.0000467 epoch_Time:28454.0min: [2024-01-03 00:07:37,091][model8_pretrain.py][INFO] Epoch:[0/2](61700/4588595) loss:3.087 lr:0.0000467 epoch_Time:28454.0min: [2024-01-03 00:08:14,029][model8_pretrain.py][INFO] Epoch:[0/2](61800/4588595) loss:2.967 lr:0.0000463 epoch_Time:28452.0min: [2024-01-03 00:08:14,029][model8_pretrain.py][INFO] Epoch:[0/2](61800/4588595) loss:3.089 lr:0.0000463 epoch_Time:28452.0min: [2024-01-03 00:08:14,029][model8_pretrain.py][INFO] Epoch:[0/2](61800/4588595) loss:2.251 lr:0.0000463 epoch_Time:28452.0min: [2024-01-03 00:08:14,029][model8_pretrain.py][INFO] Epoch:[0/2](61800/4588595) loss:3.229 lr:0.0000463 epoch_Time:28452.0min: [2024-01-03 00:08:14,029][model8_pretrain.py][INFO] Epoch:[0/2](61800/4588595) loss:3.239 lr:0.0000463 epoch_Time:28452.0min: [2024-01-03 00:08:14,029][model8_pretrain.py][INFO] Epoch:[0/2](61800/4588595) loss:3.124 lr:0.0000463 epoch_Time:28452.0min: [2024-01-03 00:08:14,030][model8_pretrain.py][INFO] Epoch:[0/2](61800/4588595) loss:3.280 lr:0.0000463 epoch_Time:28452.0min: [2024-01-03 00:08:14,030][model8_pretrain.py][INFO] Epoch:[0/2](61800/4588595) loss:2.721 lr:0.0000463 epoch_Time:28452.0min: [2024-01-03 00:08:50,975][model8_pretrain.py][INFO] Epoch:[0/2](61900/4588595) loss:3.429 lr:0.0000460 epoch_Time:28450.0min: [2024-01-03 00:08:50,975][model8_pretrain.py][INFO] Epoch:[0/2](61900/4588595) loss:3.456 lr:0.0000460 epoch_Time:28450.0min: [2024-01-03 00:08:50,975][model8_pretrain.py][INFO] Epoch:[0/2](61900/4588595) loss:3.016 lr:0.0000460 epoch_Time:28450.0min: [2024-01-03 00:08:50,975][model8_pretrain.py][INFO] Epoch:[0/2](61900/4588595) loss:3.197 lr:0.0000460 epoch_Time:28450.0min: [2024-01-03 00:08:50,975][model8_pretrain.py][INFO] Epoch:[0/2](61900/4588595) loss:3.402 lr:0.0000460 epoch_Time:28450.0min: [2024-01-03 00:08:50,975][model8_pretrain.py][INFO] Epoch:[0/2](61900/4588595) loss:2.814 lr:0.0000460 epoch_Time:28450.0min: [2024-01-03 00:08:50,975][model8_pretrain.py][INFO] Epoch:[0/2](61900/4588595) loss:3.028 lr:0.0000460 epoch_Time:28450.0min: [2024-01-03 00:08:50,975][model8_pretrain.py][INFO] Epoch:[0/2](61900/4588595) loss:3.216 lr:0.0000460 epoch_Time:28450.0min: [2024-01-03 00:09:27,939][model8_pretrain.py][INFO] Epoch:[0/2](62000/4588595) loss:3.362 lr:0.0000456 epoch_Time:28450.0min: [2024-01-03 00:09:27,939][model8_pretrain.py][INFO] Epoch:[0/2](62000/4588595) loss:2.891 lr:0.0000456 epoch_Time:28450.0min: [2024-01-03 00:09:27,939][model8_pretrain.py][INFO] Epoch:[0/2](62000/4588595) loss:2.848 lr:0.0000456 epoch_Time:28450.0min: [2024-01-03 00:09:27,939][model8_pretrain.py][INFO] Epoch:[0/2](62000/4588595) loss:3.176 lr:0.0000456 epoch_Time:28450.0min: [2024-01-03 00:09:27,939][model8_pretrain.py][INFO] Epoch:[0/2](62000/4588595) loss:2.879 lr:0.0000456 epoch_Time:28450.0min: [2024-01-03 00:09:27,939][model8_pretrain.py][INFO] Epoch:[0/2](62000/4588595) loss:2.952 lr:0.0000456 epoch_Time:28450.0min: [2024-01-03 00:09:27,939][model8_pretrain.py][INFO] Epoch:[0/2](62000/4588595) loss:2.964 lr:0.0000456 epoch_Time:28450.0min: [2024-01-03 00:09:27,939][model8_pretrain.py][INFO] Epoch:[0/2](62000/4588595) loss:3.436 lr:0.0000456 epoch_Time:28450.0min: [2024-01-03 00:10:04,879][model8_pretrain.py][INFO] Epoch:[0/2](62100/4588595) loss:3.341 lr:0.0000452 epoch_Time:28448.0min: [2024-01-03 00:10:04,879][model8_pretrain.py][INFO] Epoch:[0/2](62100/4588595) loss:3.250 lr:0.0000452 epoch_Time:28448.0min: [2024-01-03 00:10:04,879][model8_pretrain.py][INFO] Epoch:[0/2](62100/4588595) loss:3.185 lr:0.0000452 epoch_Time:28448.0min: [2024-01-03 00:10:04,879][model8_pretrain.py][INFO] Epoch:[0/2](62100/4588595) loss:2.779 lr:0.0000452 epoch_Time:28448.0min: [2024-01-03 00:10:04,879][model8_pretrain.py][INFO] Epoch:[0/2](62100/4588595) loss:2.932 lr:0.0000452 epoch_Time:28448.0min: [2024-01-03 00:10:04,879][model8_pretrain.py][INFO] Epoch:[0/2](62100/4588595) loss:3.050 lr:0.0000452 epoch_Time:28448.0min: [2024-01-03 00:10:04,879][model8_pretrain.py][INFO] Epoch:[0/2](62100/4588595) loss:3.236 lr:0.0000452 epoch_Time:28448.0min: [2024-01-03 00:10:04,879][model8_pretrain.py][INFO] Epoch:[0/2](62100/4588595) loss:3.270 lr:0.0000452 epoch_Time:28448.0min: [2024-01-03 00:10:41,818][model8_pretrain.py][INFO] Epoch:[0/2](62200/4588595) loss:3.250 lr:0.0000448 epoch_Time:28447.0min: [2024-01-03 00:10:41,818][model8_pretrain.py][INFO] Epoch:[0/2](62200/4588595) loss:3.470 lr:0.0000448 epoch_Time:28447.0min: [2024-01-03 00:10:41,818][model8_pretrain.py][INFO] Epoch:[0/2](62200/4588595) loss:3.024 lr:0.0000448 epoch_Time:28447.0min: [2024-01-03 00:10:41,818][model8_pretrain.py][INFO] Epoch:[0/2](62200/4588595) loss:3.251 lr:0.0000448 epoch_Time:28447.0min: [2024-01-03 00:10:41,818][model8_pretrain.py][INFO] Epoch:[0/2](62200/4588595) loss:3.584 lr:0.0000448 epoch_Time:28447.0min: [2024-01-03 00:10:41,818][model8_pretrain.py][INFO] Epoch:[0/2](62200/4588595) loss:3.273 lr:0.0000448 epoch_Time:28447.0min: [2024-01-03 00:10:41,818][model8_pretrain.py][INFO] Epoch:[0/2](62200/4588595) loss:2.940 lr:0.0000448 epoch_Time:28447.0min: [2024-01-03 00:10:41,818][model8_pretrain.py][INFO] Epoch:[0/2](62200/4588595) loss:3.094 lr:0.0000448 epoch_Time:28447.0min: [2024-01-03 00:11:25,838][model8_pretrain.py][INFO] Epoch:[0/2](62300/4588595) loss:3.104 lr:0.0000445 epoch_Time:28453.0min: [2024-01-03 00:11:25,838][model8_pretrain.py][INFO] Epoch:[0/2](62300/4588595) loss:2.607 lr:0.0000445 epoch_Time:28453.0min: [2024-01-03 00:11:25,838][model8_pretrain.py][INFO] Epoch:[0/2](62300/4588595) loss:3.105 lr:0.0000445 epoch_Time:28453.0min: [2024-01-03 00:11:25,838][model8_pretrain.py][INFO] Epoch:[0/2](62300/4588595) loss:2.915 lr:0.0000445 epoch_Time:28453.0min: [2024-01-03 00:11:25,839][model8_pretrain.py][INFO] Epoch:[0/2](62300/4588595) loss:3.037 lr:0.0000445 epoch_Time:28453.0min: [2024-01-03 00:11:25,839][model8_pretrain.py][INFO] Epoch:[0/2](62300/4588595) loss:3.215 lr:0.0000445 epoch_Time:28453.0min: [2024-01-03 00:11:25,839][model8_pretrain.py][INFO] Epoch:[0/2](62300/4588595) loss:3.313 lr:0.0000445 epoch_Time:28453.0min: [2024-01-03 00:11:25,839][model8_pretrain.py][INFO] Epoch:[0/2](62300/4588595) loss:2.996 lr:0.0000445 epoch_Time:28453.0min: [2024-01-03 00:12:02,776][model8_pretrain.py][INFO] Epoch:[0/2](62400/4588595) loss:3.410 lr:0.0000441 epoch_Time:28451.0min: [2024-01-03 00:12:02,777][model8_pretrain.py][INFO] Epoch:[0/2](62400/4588595) loss:3.292 lr:0.0000441 epoch_Time:28451.0min: [2024-01-03 00:12:02,776][model8_pretrain.py][INFO] Epoch:[0/2](62400/4588595) loss:3.288 lr:0.0000441 epoch_Time:28451.0min: [2024-01-03 00:12:02,777][model8_pretrain.py][INFO] Epoch:[0/2](62400/4588595) loss:3.248 lr:0.0000441 epoch_Time:28451.0min: [2024-01-03 00:12:02,777][model8_pretrain.py][INFO] Epoch:[0/2](62400/4588595) loss:3.523 lr:0.0000441 epoch_Time:28451.0min: [2024-01-03 00:12:02,777][model8_pretrain.py][INFO] Epoch:[0/2](62400/4588595) loss:2.776 lr:0.0000441 epoch_Time:28451.0min: [2024-01-03 00:12:02,777][model8_pretrain.py][INFO] Epoch:[0/2](62400/4588595) loss:3.546 lr:0.0000441 epoch_Time:28451.0min: [2024-01-03 00:12:02,777][model8_pretrain.py][INFO] Epoch:[0/2](62400/4588595) loss:2.461 lr:0.0000441 epoch_Time:28451.0min: [2024-01-03 00:12:39,713][model8_pretrain.py][INFO] Epoch:[0/2](62500/4588595) loss:2.954 lr:0.0000437 epoch_Time:28450.0min: [2024-01-03 00:12:39,713][model8_pretrain.py][INFO] Epoch:[0/2](62500/4588595) loss:3.650 lr:0.0000437 epoch_Time:28450.0min: [2024-01-03 00:12:39,713][model8_pretrain.py][INFO] Epoch:[0/2](62500/4588595) loss:2.961 lr:0.0000437 epoch_Time:28450.0min: [2024-01-03 00:12:39,713][model8_pretrain.py][INFO] Epoch:[0/2](62500/4588595) loss:3.439 lr:0.0000437 epoch_Time:28451.0min: [2024-01-03 00:12:39,713][model8_pretrain.py][INFO] Epoch:[0/2](62500/4588595) loss:3.070 lr:0.0000437 epoch_Time:28450.0min: [2024-01-03 00:12:39,713][model8_pretrain.py][INFO] Epoch:[0/2](62500/4588595) loss:3.641 lr:0.0000437 epoch_Time:28450.0min: [2024-01-03 00:12:39,713][model8_pretrain.py][INFO] Epoch:[0/2](62500/4588595) loss:2.465 lr:0.0000437 epoch_Time:28450.0min: [2024-01-03 00:12:39,714][model8_pretrain.py][INFO] Epoch:[0/2](62500/4588595) loss:2.995 lr:0.0000437 epoch_Time:28450.0min: [2024-01-03 00:13:16,656][model8_pretrain.py][INFO] Epoch:[0/2](62600/4588595) loss:3.428 lr:0.0000433 epoch_Time:28449.0min: [2024-01-03 00:13:16,656][model8_pretrain.py][INFO] Epoch:[0/2](62600/4588595) loss:3.172 lr:0.0000433 epoch_Time:28449.0min: [2024-01-03 00:13:16,656][model8_pretrain.py][INFO] Epoch:[0/2](62600/4588595) loss:2.847 lr:0.0000433 epoch_Time:28449.0min: [2024-01-03 00:13:16,656][model8_pretrain.py][INFO] Epoch:[0/2](62600/4588595) loss:2.688 lr:0.0000433 epoch_Time:28449.0min: [2024-01-03 00:13:16,656][model8_pretrain.py][INFO] Epoch:[0/2](62600/4588595) loss:3.204 lr:0.0000433 epoch_Time:28449.0min: [2024-01-03 00:13:16,656][model8_pretrain.py][INFO] Epoch:[0/2](62600/4588595) loss:2.741 lr:0.0000433 epoch_Time:28449.0min: [2024-01-03 00:13:16,656][model8_pretrain.py][INFO] Epoch:[0/2](62600/4588595) loss:3.182 lr:0.0000433 epoch_Time:28449.0min: [2024-01-03 00:13:16,656][model8_pretrain.py][INFO] Epoch:[0/2](62600/4588595) loss:3.358 lr:0.0000433 epoch_Time:28449.0min: [2024-01-03 00:13:53,584][model8_pretrain.py][INFO] Epoch:[0/2](62700/4588595) loss:2.791 lr:0.0000430 epoch_Time:28447.0min: [2024-01-03 00:13:53,584][model8_pretrain.py][INFO] Epoch:[0/2](62700/4588595) loss:2.997 lr:0.0000430 epoch_Time:28447.0min: [2024-01-03 00:13:53,584][model8_pretrain.py][INFO] Epoch:[0/2](62700/4588595) loss:2.885 lr:0.0000430 epoch_Time:28447.0min: [2024-01-03 00:13:53,584][model8_pretrain.py][INFO] Epoch:[0/2](62700/4588595) loss:2.783 lr:0.0000430 epoch_Time:28447.0min: [2024-01-03 00:13:53,584][model8_pretrain.py][INFO] Epoch:[0/2](62700/4588595) loss:2.399 lr:0.0000430 epoch_Time:28447.0min: [2024-01-03 00:13:53,584][model8_pretrain.py][INFO] Epoch:[0/2](62700/4588595) loss:3.006 lr:0.0000430 epoch_Time:28447.0min: [2024-01-03 00:13:53,584][model8_pretrain.py][INFO] Epoch:[0/2](62700/4588595) loss:3.509 lr:0.0000430 epoch_Time:28447.0min: [2024-01-03 00:13:53,584][model8_pretrain.py][INFO] Epoch:[0/2](62700/4588595) loss:3.333 lr:0.0000430 epoch_Time:28447.0min: [2024-01-03 00:14:30,528][model8_pretrain.py][INFO] Epoch:[0/2](62800/4588595) loss:2.950 lr:0.0000426 epoch_Time:28446.0min: [2024-01-03 00:14:30,528][model8_pretrain.py][INFO] Epoch:[0/2](62800/4588595) loss:3.317 lr:0.0000426 epoch_Time:28446.0min: [2024-01-03 00:14:30,528][model8_pretrain.py][INFO] Epoch:[0/2](62800/4588595) loss:3.115 lr:0.0000426 epoch_Time:28446.0min: [2024-01-03 00:14:30,528][model8_pretrain.py][INFO] Epoch:[0/2](62800/4588595) loss:3.295 lr:0.0000426 epoch_Time:28446.0min: [2024-01-03 00:14:30,528][model8_pretrain.py][INFO] Epoch:[0/2](62800/4588595) loss:3.281 lr:0.0000426 epoch_Time:28446.0min: [2024-01-03 00:14:30,528][model8_pretrain.py][INFO] Epoch:[0/2](62800/4588595) loss:3.154 lr:0.0000426 epoch_Time:28446.0min: [2024-01-03 00:14:30,528][model8_pretrain.py][INFO] Epoch:[0/2](62800/4588595) loss:3.669 lr:0.0000426 epoch_Time:28446.0min: [2024-01-03 00:14:30,528][model8_pretrain.py][INFO] Epoch:[0/2](62800/4588595) loss:3.014 lr:0.0000426 epoch_Time:28446.0min: [2024-01-03 00:15:07,461][model8_pretrain.py][INFO] Epoch:[0/2](62900/4588595) loss:3.034 lr:0.0000423 epoch_Time:28444.0min: [2024-01-03 00:15:07,461][model8_pretrain.py][INFO] Epoch:[0/2](62900/4588595) loss:2.925 lr:0.0000423 epoch_Time:28444.0min: [2024-01-03 00:15:07,461][model8_pretrain.py][INFO] Epoch:[0/2](62900/4588595) loss:2.364 lr:0.0000423 epoch_Time:28444.0min: [2024-01-03 00:15:07,461][model8_pretrain.py][INFO] Epoch:[0/2](62900/4588595) loss:3.309 lr:0.0000423 epoch_Time:28444.0min: [2024-01-03 00:15:07,461][model8_pretrain.py][INFO] Epoch:[0/2](62900/4588595) loss:3.112 lr:0.0000423 epoch_Time:28444.0min: [2024-01-03 00:15:07,461][model8_pretrain.py][INFO] Epoch:[0/2](62900/4588595) loss:3.066 lr:0.0000423 epoch_Time:28444.0min: [2024-01-03 00:15:07,461][model8_pretrain.py][INFO] Epoch:[0/2](62900/4588595) loss:3.530 lr:0.0000423 epoch_Time:28444.0min: [2024-01-03 00:15:07,461][model8_pretrain.py][INFO] Epoch:[0/2](62900/4588595) loss:3.400 lr:0.0000423 epoch_Time:28444.0min: [2024-01-03 00:15:44,401][model8_pretrain.py][INFO] Epoch:[0/2](63000/4588595) loss:3.017 lr:0.0000419 epoch_Time:28443.0min: [2024-01-03 00:15:44,401][model8_pretrain.py][INFO] Epoch:[0/2](63000/4588595) loss:2.957 lr:0.0000419 epoch_Time:28443.0min: [2024-01-03 00:15:44,401][model8_pretrain.py][INFO] Epoch:[0/2](63000/4588595) loss:2.685 lr:0.0000419 epoch_Time:28443.0min: [2024-01-03 00:15:44,401][model8_pretrain.py][INFO] Epoch:[0/2](63000/4588595) loss:3.039 lr:0.0000419 epoch_Time:28443.0min: [2024-01-03 00:15:44,401][model8_pretrain.py][INFO] Epoch:[0/2](63000/4588595) loss:2.960 lr:0.0000419 epoch_Time:28443.0min: [2024-01-03 00:15:44,401][model8_pretrain.py][INFO] Epoch:[0/2](63000/4588595) loss:2.804 lr:0.0000419 epoch_Time:28443.0min: [2024-01-03 00:15:44,401][model8_pretrain.py][INFO] Epoch:[0/2](63000/4588595) loss:3.295 lr:0.0000419 epoch_Time:28443.0min: [2024-01-03 00:15:44,401][model8_pretrain.py][INFO] Epoch:[0/2](63000/4588595) loss:3.362 lr:0.0000419 epoch_Time:28443.0min: [2024-01-03 00:16:28,451][model8_pretrain.py][INFO] Epoch:[0/2](63100/4588595) loss:2.434 lr:0.0000415 epoch_Time:28449.0min: [2024-01-03 00:16:28,451][model8_pretrain.py][INFO] Epoch:[0/2](63100/4588595) loss:3.037 lr:0.0000415 epoch_Time:28449.0min: [2024-01-03 00:16:28,451][model8_pretrain.py][INFO] Epoch:[0/2](63100/4588595) loss:3.304 lr:0.0000415 epoch_Time:28449.0min: [2024-01-03 00:16:28,451][model8_pretrain.py][INFO] Epoch:[0/2](63100/4588595) loss:3.369 lr:0.0000415 epoch_Time:28449.0min: [2024-01-03 00:16:28,451][model8_pretrain.py][INFO] Epoch:[0/2](63100/4588595) loss:3.601 lr:0.0000415 epoch_Time:28449.0min: [2024-01-03 00:16:28,451][model8_pretrain.py][INFO] Epoch:[0/2](63100/4588595) loss:2.796 lr:0.0000415 epoch_Time:28449.0min: [2024-01-03 00:16:28,451][model8_pretrain.py][INFO] Epoch:[0/2](63100/4588595) loss:2.994 lr:0.0000415 epoch_Time:28449.0min: [2024-01-03 00:16:28,451][model8_pretrain.py][INFO] Epoch:[0/2](63100/4588595) loss:3.212 lr:0.0000415 epoch_Time:28449.0min: [2024-01-03 00:17:05,418][model8_pretrain.py][INFO] Epoch:[0/2](63200/4588595) loss:3.269 lr:0.0000412 epoch_Time:28448.0min: [2024-01-03 00:17:05,418][model8_pretrain.py][INFO] Epoch:[0/2](63200/4588595) loss:3.293 lr:0.0000412 epoch_Time:28448.0min: [2024-01-03 00:17:05,418][model8_pretrain.py][INFO] Epoch:[0/2](63200/4588595) loss:3.474 lr:0.0000412 epoch_Time:28448.0min: [2024-01-03 00:17:05,418][model8_pretrain.py][INFO] Epoch:[0/2](63200/4588595) loss:3.143 lr:0.0000412 epoch_Time:28448.0min: [2024-01-03 00:17:05,418][model8_pretrain.py][INFO] Epoch:[0/2](63200/4588595) loss:3.114 lr:0.0000412 epoch_Time:28448.0min: [2024-01-03 00:17:05,418][model8_pretrain.py][INFO] Epoch:[0/2](63200/4588595) loss:3.637 lr:0.0000412 epoch_Time:28448.0min: [2024-01-03 00:17:05,418][model8_pretrain.py][INFO] Epoch:[0/2](63200/4588595) loss:2.985 lr:0.0000412 epoch_Time:28448.0min: [2024-01-03 00:17:05,418][model8_pretrain.py][INFO] Epoch:[0/2](63200/4588595) loss:2.903 lr:0.0000412 epoch_Time:28448.0min: [2024-01-03 00:17:42,350][model8_pretrain.py][INFO] Epoch:[0/2](63300/4588595) loss:2.577 lr:0.0000408 epoch_Time:28447.0min: [2024-01-03 00:17:42,350][model8_pretrain.py][INFO] Epoch:[0/2](63300/4588595) loss:3.053 lr:0.0000408 epoch_Time:28447.0min: [2024-01-03 00:17:42,350][model8_pretrain.py][INFO] Epoch:[0/2](63300/4588595) loss:3.371 lr:0.0000408 epoch_Time:28447.0min: [2024-01-03 00:17:42,350][model8_pretrain.py][INFO] Epoch:[0/2](63300/4588595) loss:3.087 lr:0.0000408 epoch_Time:28447.0min: [2024-01-03 00:17:42,350][model8_pretrain.py][INFO] Epoch:[0/2](63300/4588595) loss:2.283 lr:0.0000408 epoch_Time:28447.0min: [2024-01-03 00:17:42,350][model8_pretrain.py][INFO] Epoch:[0/2](63300/4588595) loss:2.906 lr:0.0000408 epoch_Time:28447.0min: [2024-01-03 00:17:42,350][model8_pretrain.py][INFO] Epoch:[0/2](63300/4588595) loss:2.889 lr:0.0000408 epoch_Time:28447.0min: [2024-01-03 00:17:42,350][model8_pretrain.py][INFO] Epoch:[0/2](63300/4588595) loss:3.043 lr:0.0000408 epoch_Time:28447.0min: [2024-01-03 00:18:19,296][model8_pretrain.py][INFO] Epoch:[0/2](63400/4588595) loss:3.052 lr:0.0000405 epoch_Time:28445.0min: [2024-01-03 00:18:19,296][model8_pretrain.py][INFO] Epoch:[0/2](63400/4588595) loss:3.069 lr:0.0000405 epoch_Time:28445.0min: [2024-01-03 00:18:19,296][model8_pretrain.py][INFO] Epoch:[0/2](63400/4588595) loss:2.934 lr:0.0000405 epoch_Time:28445.0min: [2024-01-03 00:18:19,296][model8_pretrain.py][INFO] Epoch:[0/2](63400/4588595) loss:3.236 lr:0.0000405 epoch_Time:28445.0min: [2024-01-03 00:18:19,296][model8_pretrain.py][INFO] Epoch:[0/2](63400/4588595) loss:2.952 lr:0.0000405 epoch_Time:28445.0min: [2024-01-03 00:18:19,296][model8_pretrain.py][INFO] Epoch:[0/2](63400/4588595) loss:3.046 lr:0.0000405 epoch_Time:28445.0min: [2024-01-03 00:18:19,296][model8_pretrain.py][INFO] Epoch:[0/2](63400/4588595) loss:2.956 lr:0.0000405 epoch_Time:28445.0min: [2024-01-03 00:18:19,296][model8_pretrain.py][INFO] Epoch:[0/2](63400/4588595) loss:3.564 lr:0.0000405 epoch_Time:28445.0min: [2024-01-03 00:18:56,231][model8_pretrain.py][INFO] Epoch:[0/2](63500/4588595) loss:3.069 lr:0.0000401 epoch_Time:28443.0min: [2024-01-03 00:18:56,231][model8_pretrain.py][INFO] Epoch:[0/2](63500/4588595) loss:3.321 lr:0.0000401 epoch_Time:28443.0min: [2024-01-03 00:18:56,231][model8_pretrain.py][INFO] Epoch:[0/2](63500/4588595) loss:3.377 lr:0.0000401 epoch_Time:28443.0min: [2024-01-03 00:18:56,231][model8_pretrain.py][INFO] Epoch:[0/2](63500/4588595) loss:2.960 lr:0.0000401 epoch_Time:28443.0min: [2024-01-03 00:18:56,231][model8_pretrain.py][INFO] Epoch:[0/2](63500/4588595) loss:3.495 lr:0.0000401 epoch_Time:28443.0min: [2024-01-03 00:18:56,231][model8_pretrain.py][INFO] Epoch:[0/2](63500/4588595) loss:3.025 lr:0.0000401 epoch_Time:28443.0min: [2024-01-03 00:18:56,231][model8_pretrain.py][INFO] Epoch:[0/2](63500/4588595) loss:3.442 lr:0.0000401 epoch_Time:28443.0min: [2024-01-03 00:18:56,231][model8_pretrain.py][INFO] Epoch:[0/2](63500/4588595) loss:2.829 lr:0.0000401 epoch_Time:28443.0min: [2024-01-03 00:19:33,203][model8_pretrain.py][INFO] Epoch:[0/2](63600/4588595) loss:2.975 lr:0.0000398 epoch_Time:28442.0min: [2024-01-03 00:19:33,203][model8_pretrain.py][INFO] Epoch:[0/2](63600/4588595) loss:3.051 lr:0.0000398 epoch_Time:28442.0min: [2024-01-03 00:19:33,203][model8_pretrain.py][INFO] Epoch:[0/2](63600/4588595) loss:3.579 lr:0.0000398 epoch_Time:28442.0min: [2024-01-03 00:19:33,203][model8_pretrain.py][INFO] Epoch:[0/2](63600/4588595) loss:3.021 lr:0.0000398 epoch_Time:28442.0min: [2024-01-03 00:19:33,203][model8_pretrain.py][INFO] Epoch:[0/2](63600/4588595) loss:2.899 lr:0.0000398 epoch_Time:28442.0min: [2024-01-03 00:19:33,203][model8_pretrain.py][INFO] Epoch:[0/2](63600/4588595) loss:3.284 lr:0.0000398 epoch_Time:28442.0min: [2024-01-03 00:19:33,204][model8_pretrain.py][INFO] Epoch:[0/2](63600/4588595) loss:3.073 lr:0.0000398 epoch_Time:28442.0min: [2024-01-03 00:19:33,204][model8_pretrain.py][INFO] Epoch:[0/2](63600/4588595) loss:3.216 lr:0.0000398 epoch_Time:28442.0min: [2024-01-03 00:20:10,148][model8_pretrain.py][INFO] Epoch:[0/2](63700/4588595) loss:3.025 lr:0.0000394 epoch_Time:28440.0min: [2024-01-03 00:20:10,148][model8_pretrain.py][INFO] Epoch:[0/2](63700/4588595) loss:3.217 lr:0.0000394 epoch_Time:28440.0min: [2024-01-03 00:20:10,148][model8_pretrain.py][INFO] Epoch:[0/2](63700/4588595) loss:3.395 lr:0.0000394 epoch_Time:28440.0min: [2024-01-03 00:20:10,148][model8_pretrain.py][INFO] Epoch:[0/2](63700/4588595) loss:2.764 lr:0.0000394 epoch_Time:28440.0min: [2024-01-03 00:20:10,148][model8_pretrain.py][INFO] Epoch:[0/2](63700/4588595) loss:3.083 lr:0.0000394 epoch_Time:28440.0min: [2024-01-03 00:20:10,148][model8_pretrain.py][INFO] Epoch:[0/2](63700/4588595) loss:3.100 lr:0.0000394 epoch_Time:28440.0min: [2024-01-03 00:20:10,148][model8_pretrain.py][INFO] Epoch:[0/2](63700/4588595) loss:3.066 lr:0.0000394 epoch_Time:28440.0min: [2024-01-03 00:20:10,148][model8_pretrain.py][INFO] Epoch:[0/2](63700/4588595) loss:3.205 lr:0.0000394 epoch_Time:28440.0min: [2024-01-03 00:20:47,045][model8_pretrain.py][INFO] Epoch:[0/2](63800/4588595) loss:2.976 lr:0.0000391 epoch_Time:28439.0min: [2024-01-03 00:20:47,045][model8_pretrain.py][INFO] Epoch:[0/2](63800/4588595) loss:3.116 lr:0.0000391 epoch_Time:28439.0min: [2024-01-03 00:20:47,045][model8_pretrain.py][INFO] Epoch:[0/2](63800/4588595) loss:3.217 lr:0.0000391 epoch_Time:28439.0min: [2024-01-03 00:20:47,045][model8_pretrain.py][INFO] Epoch:[0/2](63800/4588595) loss:2.897 lr:0.0000391 epoch_Time:28439.0min: [2024-01-03 00:20:47,045][model8_pretrain.py][INFO] Epoch:[0/2](63800/4588595) loss:3.352 lr:0.0000391 epoch_Time:28439.0min: [2024-01-03 00:20:47,045][model8_pretrain.py][INFO] Epoch:[0/2](63800/4588595) loss:3.385 lr:0.0000391 epoch_Time:28439.0min: [2024-01-03 00:20:47,045][model8_pretrain.py][INFO] Epoch:[0/2](63800/4588595) loss:2.972 lr:0.0000391 epoch_Time:28439.0min: [2024-01-03 00:20:47,046][model8_pretrain.py][INFO] Epoch:[0/2](63800/4588595) loss:2.999 lr:0.0000391 epoch_Time:28439.0min: [2024-01-03 00:21:31,822][model8_pretrain.py][INFO] Epoch:[0/2](63900/4588595) loss:3.347 lr:0.0000387 epoch_Time:28446.0min: [2024-01-03 00:21:31,822][model8_pretrain.py][INFO] Epoch:[0/2](63900/4588595) loss:3.395 lr:0.0000387 epoch_Time:28446.0min: [2024-01-03 00:21:31,822][model8_pretrain.py][INFO] Epoch:[0/2](63900/4588595) loss:3.383 lr:0.0000387 epoch_Time:28446.0min: [2024-01-03 00:21:31,822][model8_pretrain.py][INFO] Epoch:[0/2](63900/4588595) loss:3.420 lr:0.0000387 epoch_Time:28446.0min: [2024-01-03 00:21:31,822][model8_pretrain.py][INFO] Epoch:[0/2](63900/4588595) loss:3.236 lr:0.0000387 epoch_Time:28446.0min: [2024-01-03 00:21:31,822][model8_pretrain.py][INFO] Epoch:[0/2](63900/4588595) loss:3.064 lr:0.0000387 epoch_Time:28446.0min: [2024-01-03 00:21:31,822][model8_pretrain.py][INFO] Epoch:[0/2](63900/4588595) loss:3.323 lr:0.0000387 epoch_Time:28446.0min: [2024-01-03 00:21:31,822][model8_pretrain.py][INFO] Epoch:[0/2](63900/4588595) loss:3.428 lr:0.0000387 epoch_Time:28446.0min: [2024-01-03 00:22:08,768][model8_pretrain.py][INFO] Epoch:[0/2](64000/4588595) loss:3.155 lr:0.0000384 epoch_Time:28444.0min: [2024-01-03 00:22:08,768][model8_pretrain.py][INFO] Epoch:[0/2](64000/4588595) loss:3.050 lr:0.0000384 epoch_Time:28444.0min: [2024-01-03 00:22:08,768][model8_pretrain.py][INFO] Epoch:[0/2](64000/4588595) loss:3.328 lr:0.0000384 epoch_Time:28444.0min: [2024-01-03 00:22:08,768][model8_pretrain.py][INFO] Epoch:[0/2](64000/4588595) loss:3.611 lr:0.0000384 epoch_Time:28444.0min: [2024-01-03 00:22:08,768][model8_pretrain.py][INFO] Epoch:[0/2](64000/4588595) loss:3.318 lr:0.0000384 epoch_Time:28444.0min: [2024-01-03 00:22:08,768][model8_pretrain.py][INFO] Epoch:[0/2](64000/4588595) loss:3.090 lr:0.0000384 epoch_Time:28444.0min: [2024-01-03 00:22:08,768][model8_pretrain.py][INFO] Epoch:[0/2](64000/4588595) loss:3.393 lr:0.0000384 epoch_Time:28444.0min: [2024-01-03 00:22:08,768][model8_pretrain.py][INFO] Epoch:[0/2](64000/4588595) loss:3.171 lr:0.0000384 epoch_Time:28444.0min: [2024-01-03 00:22:45,701][model8_pretrain.py][INFO] Epoch:[0/2](64100/4588595) loss:2.723 lr:0.0000380 epoch_Time:28444.0min: [2024-01-03 00:22:45,701][model8_pretrain.py][INFO] Epoch:[0/2](64100/4588595) loss:3.630 lr:0.0000380 epoch_Time:28444.0min: [2024-01-03 00:22:45,701][model8_pretrain.py][INFO] Epoch:[0/2](64100/4588595) loss:3.166 lr:0.0000380 epoch_Time:28444.0min: [2024-01-03 00:22:45,701][model8_pretrain.py][INFO] Epoch:[0/2](64100/4588595) loss:2.947 lr:0.0000380 epoch_Time:28444.0min: [2024-01-03 00:22:45,701][model8_pretrain.py][INFO] Epoch:[0/2](64100/4588595) loss:3.160 lr:0.0000380 epoch_Time:28444.0min: [2024-01-03 00:22:45,701][model8_pretrain.py][INFO] Epoch:[0/2](64100/4588595) loss:3.187 lr:0.0000380 epoch_Time:28444.0min: [2024-01-03 00:22:45,701][model8_pretrain.py][INFO] Epoch:[0/2](64100/4588595) loss:2.862 lr:0.0000380 epoch_Time:28443.0min: [2024-01-03 00:22:45,701][model8_pretrain.py][INFO] Epoch:[0/2](64100/4588595) loss:3.387 lr:0.0000380 epoch_Time:28444.0min: [2024-01-03 00:23:22,644][model8_pretrain.py][INFO] Epoch:[0/2](64200/4588595) loss:2.782 lr:0.0000377 epoch_Time:28442.0min: [2024-01-03 00:23:22,644][model8_pretrain.py][INFO] Epoch:[0/2](64200/4588595) loss:3.264 lr:0.0000377 epoch_Time:28442.0min: [2024-01-03 00:23:22,644][model8_pretrain.py][INFO] Epoch:[0/2](64200/4588595) loss:3.158 lr:0.0000377 epoch_Time:28442.0min: [2024-01-03 00:23:22,644][model8_pretrain.py][INFO] Epoch:[0/2](64200/4588595) loss:2.850 lr:0.0000377 epoch_Time:28442.0min: [2024-01-03 00:23:22,644][model8_pretrain.py][INFO] Epoch:[0/2](64200/4588595) loss:3.484 lr:0.0000377 epoch_Time:28442.0min: [2024-01-03 00:23:22,644][model8_pretrain.py][INFO] Epoch:[0/2](64200/4588595) loss:3.095 lr:0.0000377 epoch_Time:28442.0min: [2024-01-03 00:23:22,644][model8_pretrain.py][INFO] Epoch:[0/2](64200/4588595) loss:3.301 lr:0.0000377 epoch_Time:28442.0min: [2024-01-03 00:23:22,644][model8_pretrain.py][INFO] Epoch:[0/2](64200/4588595) loss:2.780 lr:0.0000377 epoch_Time:28442.0min: [2024-01-03 00:23:59,598][model8_pretrain.py][INFO] Epoch:[0/2](64300/4588595) loss:3.073 lr:0.0000374 epoch_Time:28440.0min: [2024-01-03 00:23:59,598][model8_pretrain.py][INFO] Epoch:[0/2](64300/4588595) loss:3.294 lr:0.0000374 epoch_Time:28440.0min: [2024-01-03 00:23:59,598][model8_pretrain.py][INFO] Epoch:[0/2](64300/4588595) loss:2.686 lr:0.0000374 epoch_Time:28440.0min: [2024-01-03 00:23:59,598][model8_pretrain.py][INFO] Epoch:[0/2](64300/4588595) loss:2.752 lr:0.0000374 epoch_Time:28440.0min: [2024-01-03 00:23:59,598][model8_pretrain.py][INFO] Epoch:[0/2](64300/4588595) loss:3.041 lr:0.0000374 epoch_Time:28440.0min: [2024-01-03 00:23:59,598][model8_pretrain.py][INFO] Epoch:[0/2](64300/4588595) loss:2.998 lr:0.0000374 epoch_Time:28440.0min: [2024-01-03 00:23:59,598][model8_pretrain.py][INFO] Epoch:[0/2](64300/4588595) loss:2.653 lr:0.0000374 epoch_Time:28440.0min: [2024-01-03 00:23:59,598][model8_pretrain.py][INFO] Epoch:[0/2](64300/4588595) loss:3.028 lr:0.0000374 epoch_Time:28440.0min: [2024-01-03 00:24:36,545][model8_pretrain.py][INFO] Epoch:[0/2](64400/4588595) loss:2.738 lr:0.0000370 epoch_Time:28439.0min: [2024-01-03 00:24:36,545][model8_pretrain.py][INFO] Epoch:[0/2](64400/4588595) loss:3.075 lr:0.0000370 epoch_Time:28439.0min: [2024-01-03 00:24:36,545][model8_pretrain.py][INFO] Epoch:[0/2](64400/4588595) loss:3.020 lr:0.0000370 epoch_Time:28439.0min: [2024-01-03 00:24:36,545][model8_pretrain.py][INFO] Epoch:[0/2](64400/4588595) loss:3.037 lr:0.0000370 epoch_Time:28439.0min: [2024-01-03 00:24:36,545][model8_pretrain.py][INFO] Epoch:[0/2](64400/4588595) loss:3.143 lr:0.0000370 epoch_Time:28439.0min: [2024-01-03 00:24:36,545][model8_pretrain.py][INFO] Epoch:[0/2](64400/4588595) loss:2.981 lr:0.0000370 epoch_Time:28439.0min: [2024-01-03 00:24:36,545][model8_pretrain.py][INFO] Epoch:[0/2](64400/4588595) loss:3.124 lr:0.0000370 epoch_Time:28439.0min: [2024-01-03 00:24:36,548][model8_pretrain.py][INFO] Epoch:[0/2](64400/4588595) loss:3.000 lr:0.0000370 epoch_Time:28439.0min: [2024-01-03 00:25:13,489][model8_pretrain.py][INFO] Epoch:[0/2](64500/4588595) loss:3.264 lr:0.0000367 epoch_Time:28437.0min: [2024-01-03 00:25:13,489][model8_pretrain.py][INFO] Epoch:[0/2](64500/4588595) loss:2.526 lr:0.0000367 epoch_Time:28437.0min: [2024-01-03 00:25:13,490][model8_pretrain.py][INFO] Epoch:[0/2](64500/4588595) loss:3.159 lr:0.0000367 epoch_Time:28437.0min: [2024-01-03 00:25:13,490][model8_pretrain.py][INFO] Epoch:[0/2](64500/4588595) loss:2.642 lr:0.0000367 epoch_Time:28437.0min: [2024-01-03 00:25:13,490][model8_pretrain.py][INFO] Epoch:[0/2](64500/4588595) loss:3.282 lr:0.0000367 epoch_Time:28437.0min: [2024-01-03 00:25:13,490][model8_pretrain.py][INFO] Epoch:[0/2](64500/4588595) loss:3.332 lr:0.0000367 epoch_Time:28437.0min: [2024-01-03 00:25:13,490][model8_pretrain.py][INFO] Epoch:[0/2](64500/4588595) loss:3.348 lr:0.0000367 epoch_Time:28437.0min: [2024-01-03 00:25:13,490][model8_pretrain.py][INFO] Epoch:[0/2](64500/4588595) loss:3.269 lr:0.0000367 epoch_Time:28437.0min: [2024-01-03 00:25:50,431][model8_pretrain.py][INFO] Epoch:[0/2](64600/4588595) loss:2.780 lr:0.0000364 epoch_Time:28435.0min: [2024-01-03 00:25:50,431][model8_pretrain.py][INFO] Epoch:[0/2](64600/4588595) loss:3.073 lr:0.0000364 epoch_Time:28435.0min: [2024-01-03 00:25:50,431][model8_pretrain.py][INFO] Epoch:[0/2](64600/4588595) loss:3.265 lr:0.0000364 epoch_Time:28435.0min: [2024-01-03 00:25:50,431][model8_pretrain.py][INFO] Epoch:[0/2](64600/4588595) loss:3.163 lr:0.0000364 epoch_Time:28435.0min: [2024-01-03 00:25:50,431][model8_pretrain.py][INFO] Epoch:[0/2](64600/4588595) loss:2.601 lr:0.0000364 epoch_Time:28435.0min: [2024-01-03 00:25:50,431][model8_pretrain.py][INFO] Epoch:[0/2](64600/4588595) loss:3.358 lr:0.0000364 epoch_Time:28435.0min: [2024-01-03 00:25:50,432][model8_pretrain.py][INFO] Epoch:[0/2](64600/4588595) loss:3.190 lr:0.0000364 epoch_Time:28435.0min: [2024-01-03 00:25:50,432][model8_pretrain.py][INFO] Epoch:[0/2](64600/4588595) loss:2.813 lr:0.0000364 epoch_Time:28435.0min: [2024-01-03 00:26:34,523][model8_pretrain.py][INFO] Epoch:[0/2](64700/4588595) loss:2.566 lr:0.0000360 epoch_Time:28442.0min: [2024-01-03 00:26:34,523][model8_pretrain.py][INFO] Epoch:[0/2](64700/4588595) loss:3.066 lr:0.0000360 epoch_Time:28443.0min: [2024-01-03 00:26:34,523][model8_pretrain.py][INFO] Epoch:[0/2](64700/4588595) loss:2.999 lr:0.0000360 epoch_Time:28442.0min: [2024-01-03 00:26:34,523][model8_pretrain.py][INFO] Epoch:[0/2](64700/4588595) loss:3.372 lr:0.0000360 epoch_Time:28442.0min: [2024-01-03 00:26:34,523][model8_pretrain.py][INFO] Epoch:[0/2](64700/4588595) loss:3.287 lr:0.0000360 epoch_Time:28443.0min: [2024-01-03 00:26:34,523][model8_pretrain.py][INFO] Epoch:[0/2](64700/4588595) loss:2.974 lr:0.0000360 epoch_Time:28442.0min: [2024-01-03 00:26:34,523][model8_pretrain.py][INFO] Epoch:[0/2](64700/4588595) loss:3.090 lr:0.0000360 epoch_Time:28442.0min: [2024-01-03 00:26:34,523][model8_pretrain.py][INFO] Epoch:[0/2](64700/4588595) loss:2.920 lr:0.0000360 epoch_Time:28442.0min: [2024-01-03 00:27:11,467][model8_pretrain.py][INFO] Epoch:[0/2](64800/4588595) loss:2.715 lr:0.0000357 epoch_Time:28441.0min: [2024-01-03 00:27:11,467][model8_pretrain.py][INFO] Epoch:[0/2](64800/4588595) loss:3.327 lr:0.0000357 epoch_Time:28441.0min: [2024-01-03 00:27:11,467][model8_pretrain.py][INFO] Epoch:[0/2](64800/4588595) loss:3.132 lr:0.0000357 epoch_Time:28441.0min: [2024-01-03 00:27:11,467][model8_pretrain.py][INFO] Epoch:[0/2](64800/4588595) loss:2.798 lr:0.0000357 epoch_Time:28441.0min: [2024-01-03 00:27:11,467][model8_pretrain.py][INFO] Epoch:[0/2](64800/4588595) loss:3.289 lr:0.0000357 epoch_Time:28441.0min: [2024-01-03 00:27:11,467][model8_pretrain.py][INFO] Epoch:[0/2](64800/4588595) loss:3.399 lr:0.0000357 epoch_Time:28441.0min: [2024-01-03 00:27:11,467][model8_pretrain.py][INFO] Epoch:[0/2](64800/4588595) loss:2.938 lr:0.0000357 epoch_Time:28441.0min: [2024-01-03 00:27:11,467][model8_pretrain.py][INFO] Epoch:[0/2](64800/4588595) loss:3.369 lr:0.0000357 epoch_Time:28441.0min: [2024-01-03 00:27:48,411][model8_pretrain.py][INFO] Epoch:[0/2](64900/4588595) loss:3.072 lr:0.0000354 epoch_Time:28439.0min: [2024-01-03 00:27:48,412][model8_pretrain.py][INFO] Epoch:[0/2](64900/4588595) loss:2.994 lr:0.0000354 epoch_Time:28439.0min: [2024-01-03 00:27:48,412][model8_pretrain.py][INFO] Epoch:[0/2](64900/4588595) loss:3.445 lr:0.0000354 epoch_Time:28439.0min: [2024-01-03 00:27:48,412][model8_pretrain.py][INFO] Epoch:[0/2](64900/4588595) loss:3.305 lr:0.0000354 epoch_Time:28439.0min: [2024-01-03 00:27:48,412][model8_pretrain.py][INFO] Epoch:[0/2](64900/4588595) loss:3.087 lr:0.0000354 epoch_Time:28439.0min: [2024-01-03 00:27:48,412][model8_pretrain.py][INFO] Epoch:[0/2](64900/4588595) loss:3.248 lr:0.0000354 epoch_Time:28439.0min: [2024-01-03 00:27:48,412][model8_pretrain.py][INFO] Epoch:[0/2](64900/4588595) loss:3.271 lr:0.0000354 epoch_Time:28439.0min: [2024-01-03 00:27:48,412][model8_pretrain.py][INFO] Epoch:[0/2](64900/4588595) loss:3.400 lr:0.0000354 epoch_Time:28439.0min: [2024-01-03 00:28:25,372][model8_pretrain.py][INFO] Epoch:[0/2](65000/4588595) loss:3.306 lr:0.0000350 epoch_Time:28438.0min: [2024-01-03 00:28:25,372][model8_pretrain.py][INFO] Epoch:[0/2](65000/4588595) loss:2.945 lr:0.0000350 epoch_Time:28438.0min: [2024-01-03 00:28:25,372][model8_pretrain.py][INFO] Epoch:[0/2](65000/4588595) loss:2.429 lr:0.0000350 epoch_Time:28438.0min: [2024-01-03 00:28:25,372][model8_pretrain.py][INFO] Epoch:[0/2](65000/4588595) loss:2.814 lr:0.0000350 epoch_Time:28438.0min: [2024-01-03 00:28:25,372][model8_pretrain.py][INFO] Epoch:[0/2](65000/4588595) loss:2.922 lr:0.0000350 epoch_Time:28438.0min: [2024-01-03 00:28:25,372][model8_pretrain.py][INFO] Epoch:[0/2](65000/4588595) loss:3.286 lr:0.0000350 epoch_Time:28438.0min: [2024-01-03 00:28:25,373][model8_pretrain.py][INFO] Epoch:[0/2](65000/4588595) loss:3.159 lr:0.0000350 epoch_Time:28438.0min: [2024-01-03 00:28:25,373][model8_pretrain.py][INFO] Epoch:[0/2](65000/4588595) loss:3.503 lr:0.0000350 epoch_Time:28438.0min: [2024-01-03 00:29:02,315][model8_pretrain.py][INFO] Epoch:[0/2](65100/4588595) loss:3.349 lr:0.0000347 epoch_Time:28436.0min: [2024-01-03 00:29:02,315][model8_pretrain.py][INFO] Epoch:[0/2](65100/4588595) loss:3.126 lr:0.0000347 epoch_Time:28436.0min: [2024-01-03 00:29:02,315][model8_pretrain.py][INFO] Epoch:[0/2](65100/4588595) loss:3.301 lr:0.0000347 epoch_Time:28436.0min: [2024-01-03 00:29:02,315][model8_pretrain.py][INFO] Epoch:[0/2](65100/4588595) loss:3.074 lr:0.0000347 epoch_Time:28436.0min: [2024-01-03 00:29:02,315][model8_pretrain.py][INFO] Epoch:[0/2](65100/4588595) loss:3.055 lr:0.0000347 epoch_Time:28436.0min: [2024-01-03 00:29:02,315][model8_pretrain.py][INFO] Epoch:[0/2](65100/4588595) loss:2.956 lr:0.0000347 epoch_Time:28436.0min: [2024-01-03 00:29:02,316][model8_pretrain.py][INFO] Epoch:[0/2](65100/4588595) loss:2.692 lr:0.0000347 epoch_Time:28436.0min: [2024-01-03 00:29:02,316][model8_pretrain.py][INFO] Epoch:[0/2](65100/4588595) loss:3.125 lr:0.0000347 epoch_Time:28436.0min: [2024-01-03 00:29:39,263][model8_pretrain.py][INFO] Epoch:[0/2](65200/4588595) loss:3.107 lr:0.0000344 epoch_Time:28435.0min: [2024-01-03 00:29:39,263][model8_pretrain.py][INFO] Epoch:[0/2](65200/4588595) loss:3.235 lr:0.0000344 epoch_Time:28435.0min: [2024-01-03 00:29:39,263][model8_pretrain.py][INFO] Epoch:[0/2](65200/4588595) loss:3.067 lr:0.0000344 epoch_Time:28435.0min: [2024-01-03 00:29:39,263][model8_pretrain.py][INFO] Epoch:[0/2](65200/4588595) loss:2.819 lr:0.0000344 epoch_Time:28435.0min: [2024-01-03 00:29:39,263][model8_pretrain.py][INFO] Epoch:[0/2](65200/4588595) loss:3.191 lr:0.0000344 epoch_Time:28435.0min: [2024-01-03 00:29:39,263][model8_pretrain.py][INFO] Epoch:[0/2](65200/4588595) loss:3.189 lr:0.0000344 epoch_Time:28435.0min: [2024-01-03 00:29:39,263][model8_pretrain.py][INFO] Epoch:[0/2](65200/4588595) loss:3.113 lr:0.0000344 epoch_Time:28435.0min: [2024-01-03 00:29:39,264][model8_pretrain.py][INFO] Epoch:[0/2](65200/4588595) loss:3.027 lr:0.0000344 epoch_Time:28435.0min: [2024-01-03 00:30:16,202][model8_pretrain.py][INFO] Epoch:[0/2](65300/4588595) loss:3.226 lr:0.0000341 epoch_Time:28433.0min: [2024-01-03 00:30:16,202][model8_pretrain.py][INFO] Epoch:[0/2](65300/4588595) loss:3.444 lr:0.0000341 epoch_Time:28433.0min: [2024-01-03 00:30:16,202][model8_pretrain.py][INFO] Epoch:[0/2](65300/4588595) loss:3.273 lr:0.0000341 epoch_Time:28433.0min: [2024-01-03 00:30:16,202][model8_pretrain.py][INFO] Epoch:[0/2](65300/4588595) loss:2.797 lr:0.0000341 epoch_Time:28433.0min: [2024-01-03 00:30:16,202][model8_pretrain.py][INFO] Epoch:[0/2](65300/4588595) loss:2.814 lr:0.0000341 epoch_Time:28433.0min: [2024-01-03 00:30:16,202][model8_pretrain.py][INFO] Epoch:[0/2](65300/4588595) loss:2.843 lr:0.0000341 epoch_Time:28433.0min: [2024-01-03 00:30:16,202][model8_pretrain.py][INFO] Epoch:[0/2](65300/4588595) loss:3.061 lr:0.0000341 epoch_Time:28433.0min: [2024-01-03 00:30:16,202][model8_pretrain.py][INFO] Epoch:[0/2](65300/4588595) loss:2.966 lr:0.0000341 epoch_Time:28433.0min: [2024-01-03 00:30:53,143][model8_pretrain.py][INFO] Epoch:[0/2](65400/4588595) loss:3.220 lr:0.0000338 epoch_Time:28431.0min: [2024-01-03 00:30:53,143][model8_pretrain.py][INFO] Epoch:[0/2](65400/4588595) loss:3.121 lr:0.0000338 epoch_Time:28431.0min: [2024-01-03 00:30:53,143][model8_pretrain.py][INFO] Epoch:[0/2](65400/4588595) loss:2.993 lr:0.0000338 epoch_Time:28431.0min: [2024-01-03 00:30:53,143][model8_pretrain.py][INFO] Epoch:[0/2](65400/4588595) loss:3.135 lr:0.0000338 epoch_Time:28431.0min: [2024-01-03 00:30:53,143][model8_pretrain.py][INFO] Epoch:[0/2](65400/4588595) loss:3.347 lr:0.0000338 epoch_Time:28431.0min: [2024-01-03 00:30:53,143][model8_pretrain.py][INFO] Epoch:[0/2](65400/4588595) loss:3.297 lr:0.0000338 epoch_Time:28431.0min: [2024-01-03 00:30:53,143][model8_pretrain.py][INFO] Epoch:[0/2](65400/4588595) loss:3.489 lr:0.0000338 epoch_Time:28431.0min: [2024-01-03 00:30:53,143][model8_pretrain.py][INFO] Epoch:[0/2](65400/4588595) loss:2.692 lr:0.0000338 epoch_Time:28431.0min: [2024-01-03 00:31:37,200][model8_pretrain.py][INFO] Epoch:[0/2](65500/4588595) loss:3.386 lr:0.0000334 epoch_Time:28439.0min: [2024-01-03 00:31:37,200][model8_pretrain.py][INFO] Epoch:[0/2](65500/4588595) loss:3.015 lr:0.0000334 epoch_Time:28439.0min: [2024-01-03 00:31:37,200][model8_pretrain.py][INFO] Epoch:[0/2](65500/4588595) loss:3.078 lr:0.0000334 epoch_Time:28439.0min: [2024-01-03 00:31:37,200][model8_pretrain.py][INFO] Epoch:[0/2](65500/4588595) loss:3.244 lr:0.0000334 epoch_Time:28439.0min: [2024-01-03 00:31:37,201][model8_pretrain.py][INFO] Epoch:[0/2](65500/4588595) loss:2.873 lr:0.0000334 epoch_Time:28439.0min: [2024-01-03 00:31:37,201][model8_pretrain.py][INFO] Epoch:[0/2](65500/4588595) loss:2.993 lr:0.0000334 epoch_Time:28439.0min: [2024-01-03 00:31:37,201][model8_pretrain.py][INFO] Epoch:[0/2](65500/4588595) loss:2.983 lr:0.0000334 epoch_Time:28439.0min: [2024-01-03 00:31:37,201][model8_pretrain.py][INFO] Epoch:[0/2](65500/4588595) loss:3.227 lr:0.0000334 epoch_Time:28439.0min: [2024-01-03 00:32:14,122][model8_pretrain.py][INFO] Epoch:[0/2](65600/4588595) loss:2.778 lr:0.0000331 epoch_Time:28437.0min: [2024-01-03 00:32:14,122][model8_pretrain.py][INFO] Epoch:[0/2](65600/4588595) loss:3.326 lr:0.0000331 epoch_Time:28437.0min: [2024-01-03 00:32:14,122][model8_pretrain.py][INFO] Epoch:[0/2](65600/4588595) loss:3.166 lr:0.0000331 epoch_Time:28437.0min: [2024-01-03 00:32:14,122][model8_pretrain.py][INFO] Epoch:[0/2](65600/4588595) loss:2.909 lr:0.0000331 epoch_Time:28437.0min: [2024-01-03 00:32:14,122][model8_pretrain.py][INFO] Epoch:[0/2](65600/4588595) loss:3.074 lr:0.0000331 epoch_Time:28437.0min: [2024-01-03 00:32:14,122][model8_pretrain.py][INFO] Epoch:[0/2](65600/4588595) loss:2.893 lr:0.0000331 epoch_Time:28437.0min: [2024-01-03 00:32:14,123][model8_pretrain.py][INFO] Epoch:[0/2](65600/4588595) loss:3.001 lr:0.0000331 epoch_Time:28437.0min: [2024-01-03 00:32:14,123][model8_pretrain.py][INFO] Epoch:[0/2](65600/4588595) loss:3.043 lr:0.0000331 epoch_Time:28437.0min: [2024-01-03 00:32:51,058][model8_pretrain.py][INFO] Epoch:[0/2](65700/4588595) loss:2.353 lr:0.0000328 epoch_Time:28435.0min: [2024-01-03 00:32:51,058][model8_pretrain.py][INFO] Epoch:[0/2](65700/4588595) loss:2.915 lr:0.0000328 epoch_Time:28435.0min: [2024-01-03 00:32:51,058][model8_pretrain.py][INFO] Epoch:[0/2](65700/4588595) loss:3.248 lr:0.0000328 epoch_Time:28435.0min: [2024-01-03 00:32:51,058][model8_pretrain.py][INFO] Epoch:[0/2](65700/4588595) loss:2.905 lr:0.0000328 epoch_Time:28435.0min: [2024-01-03 00:32:51,058][model8_pretrain.py][INFO] Epoch:[0/2](65700/4588595) loss:3.234 lr:0.0000328 epoch_Time:28435.0min: [2024-01-03 00:32:51,058][model8_pretrain.py][INFO] Epoch:[0/2](65700/4588595) loss:3.233 lr:0.0000328 epoch_Time:28435.0min: [2024-01-03 00:32:51,058][model8_pretrain.py][INFO] Epoch:[0/2](65700/4588595) loss:3.430 lr:0.0000328 epoch_Time:28435.0min: [2024-01-03 00:32:51,058][model8_pretrain.py][INFO] Epoch:[0/2](65700/4588595) loss:3.446 lr:0.0000328 epoch_Time:28435.0min: [2024-01-03 00:33:28,001][model8_pretrain.py][INFO] Epoch:[0/2](65800/4588595) loss:2.904 lr:0.0000325 epoch_Time:28434.0min: [2024-01-03 00:33:28,001][model8_pretrain.py][INFO] Epoch:[0/2](65800/4588595) loss:2.830 lr:0.0000325 epoch_Time:28434.0min: [2024-01-03 00:33:28,001][model8_pretrain.py][INFO] Epoch:[0/2](65800/4588595) loss:3.063 lr:0.0000325 epoch_Time:28434.0min: [2024-01-03 00:33:28,001][model8_pretrain.py][INFO] Epoch:[0/2](65800/4588595) loss:2.984 lr:0.0000325 epoch_Time:28434.0min: [2024-01-03 00:33:28,001][model8_pretrain.py][INFO] Epoch:[0/2](65800/4588595) loss:3.275 lr:0.0000325 epoch_Time:28434.0min: [2024-01-03 00:33:28,001][model8_pretrain.py][INFO] Epoch:[0/2](65800/4588595) loss:3.084 lr:0.0000325 epoch_Time:28434.0min: [2024-01-03 00:33:28,001][model8_pretrain.py][INFO] Epoch:[0/2](65800/4588595) loss:2.927 lr:0.0000325 epoch_Time:28434.0min: [2024-01-03 00:33:28,001][model8_pretrain.py][INFO] Epoch:[0/2](65800/4588595) loss:2.344 lr:0.0000325 epoch_Time:28434.0min: [2024-01-03 00:34:04,955][model8_pretrain.py][INFO] Epoch:[0/2](65900/4588595) loss:3.616 lr:0.0000322 epoch_Time:28432.0min: [2024-01-03 00:34:04,955][model8_pretrain.py][INFO] Epoch:[0/2](65900/4588595) loss:3.017 lr:0.0000322 epoch_Time:28432.0min: [2024-01-03 00:34:04,955][model8_pretrain.py][INFO] Epoch:[0/2](65900/4588595) loss:3.428 lr:0.0000322 epoch_Time:28432.0min: [2024-01-03 00:34:04,955][model8_pretrain.py][INFO] Epoch:[0/2](65900/4588595) loss:2.966 lr:0.0000322 epoch_Time:28432.0min: [2024-01-03 00:34:04,955][model8_pretrain.py][INFO] Epoch:[0/2](65900/4588595) loss:3.274 lr:0.0000322 epoch_Time:28432.0min: [2024-01-03 00:34:04,955][model8_pretrain.py][INFO] Epoch:[0/2](65900/4588595) loss:2.457 lr:0.0000322 epoch_Time:28432.0min: [2024-01-03 00:34:04,955][model8_pretrain.py][INFO] Epoch:[0/2](65900/4588595) loss:2.753 lr:0.0000322 epoch_Time:28432.0min: [2024-01-03 00:34:04,955][model8_pretrain.py][INFO] Epoch:[0/2](65900/4588595) loss:3.455 lr:0.0000322 epoch_Time:28432.0min: [2024-01-03 00:34:41,904][model8_pretrain.py][INFO] Epoch:[0/2](66000/4588595) loss:2.848 lr:0.0000319 epoch_Time:28431.0min: [2024-01-03 00:34:41,904][model8_pretrain.py][INFO] Epoch:[0/2](66000/4588595) loss:3.078 lr:0.0000319 epoch_Time:28431.0min: [2024-01-03 00:34:41,904][model8_pretrain.py][INFO] Epoch:[0/2](66000/4588595) loss:3.110 lr:0.0000319 epoch_Time:28431.0min: [2024-01-03 00:34:41,904][model8_pretrain.py][INFO] Epoch:[0/2](66000/4588595) loss:2.958 lr:0.0000319 epoch_Time:28431.0min: [2024-01-03 00:34:41,904][model8_pretrain.py][INFO] Epoch:[0/2](66000/4588595) loss:3.434 lr:0.0000319 epoch_Time:28431.0min: [2024-01-03 00:34:41,905][model8_pretrain.py][INFO] Epoch:[0/2](66000/4588595) loss:3.480 lr:0.0000319 epoch_Time:28431.0min: [2024-01-03 00:34:41,905][model8_pretrain.py][INFO] Epoch:[0/2](66000/4588595) loss:3.355 lr:0.0000319 epoch_Time:28431.0min: [2024-01-03 00:34:41,905][model8_pretrain.py][INFO] Epoch:[0/2](66000/4588595) loss:3.194 lr:0.0000319 epoch_Time:28431.0min: [2024-01-03 00:35:18,847][model8_pretrain.py][INFO] Epoch:[0/2](66100/4588595) loss:3.161 lr:0.0000316 epoch_Time:28429.0min: [2024-01-03 00:35:18,847][model8_pretrain.py][INFO] Epoch:[0/2](66100/4588595) loss:3.385 lr:0.0000316 epoch_Time:28429.0min: [2024-01-03 00:35:18,847][model8_pretrain.py][INFO] Epoch:[0/2](66100/4588595) loss:2.948 lr:0.0000316 epoch_Time:28429.0min: [2024-01-03 00:35:18,847][model8_pretrain.py][INFO] Epoch:[0/2](66100/4588595) loss:2.662 lr:0.0000316 epoch_Time:28429.0min: [2024-01-03 00:35:18,847][model8_pretrain.py][INFO] Epoch:[0/2](66100/4588595) loss:3.151 lr:0.0000316 epoch_Time:28429.0min: [2024-01-03 00:35:18,847][model8_pretrain.py][INFO] Epoch:[0/2](66100/4588595) loss:2.915 lr:0.0000316 epoch_Time:28429.0min: [2024-01-03 00:35:18,847][model8_pretrain.py][INFO] Epoch:[0/2](66100/4588595) loss:3.387 lr:0.0000316 epoch_Time:28429.0min: [2024-01-03 00:35:18,847][model8_pretrain.py][INFO] Epoch:[0/2](66100/4588595) loss:3.329 lr:0.0000316 epoch_Time:28429.0min: [2024-01-03 00:35:55,788][model8_pretrain.py][INFO] Epoch:[0/2](66200/4588595) loss:3.310 lr:0.0000313 epoch_Time:28427.0min: [2024-01-03 00:35:55,788][model8_pretrain.py][INFO] Epoch:[0/2](66200/4588595) loss:3.129 lr:0.0000313 epoch_Time:28427.0min: [2024-01-03 00:35:55,788][model8_pretrain.py][INFO] Epoch:[0/2](66200/4588595) loss:3.175 lr:0.0000313 epoch_Time:28427.0min: [2024-01-03 00:35:55,788][model8_pretrain.py][INFO] Epoch:[0/2](66200/4588595) loss:2.979 lr:0.0000313 epoch_Time:28427.0min: [2024-01-03 00:35:55,788][model8_pretrain.py][INFO] Epoch:[0/2](66200/4588595) loss:3.043 lr:0.0000313 epoch_Time:28427.0min: [2024-01-03 00:35:55,788][model8_pretrain.py][INFO] Epoch:[0/2](66200/4588595) loss:3.406 lr:0.0000313 epoch_Time:28427.0min: [2024-01-03 00:35:55,788][model8_pretrain.py][INFO] Epoch:[0/2](66200/4588595) loss:2.997 lr:0.0000313 epoch_Time:28427.0min: [2024-01-03 00:35:55,788][model8_pretrain.py][INFO] Epoch:[0/2](66200/4588595) loss:2.926 lr:0.0000313 epoch_Time:28427.0min: [2024-01-03 00:36:38,139][model8_pretrain.py][INFO] Epoch:[0/2](66300/4588595) loss:3.272 lr:0.0000310 epoch_Time:28433.0min: [2024-01-03 00:36:38,139][model8_pretrain.py][INFO] Epoch:[0/2](66300/4588595) loss:3.075 lr:0.0000310 epoch_Time:28433.0min: [2024-01-03 00:36:38,139][model8_pretrain.py][INFO] Epoch:[0/2](66300/4588595) loss:3.253 lr:0.0000310 epoch_Time:28433.0min: [2024-01-03 00:36:38,139][model8_pretrain.py][INFO] Epoch:[0/2](66300/4588595) loss:3.345 lr:0.0000310 epoch_Time:28433.0min: [2024-01-03 00:36:38,142][model8_pretrain.py][INFO] Epoch:[0/2](66300/4588595) loss:3.455 lr:0.0000310 epoch_Time:28433.0min: [2024-01-03 00:36:38,143][model8_pretrain.py][INFO] Epoch:[0/2](66300/4588595) loss:3.089 lr:0.0000310 epoch_Time:28433.0min: [2024-01-03 00:36:38,143][model8_pretrain.py][INFO] Epoch:[0/2](66300/4588595) loss:2.998 lr:0.0000310 epoch_Time:28433.0min: [2024-01-03 00:36:38,144][model8_pretrain.py][INFO] Epoch:[0/2](66300/4588595) loss:2.992 lr:0.0000310 epoch_Time:28433.0min: [2024-01-03 00:37:16,765][model8_pretrain.py][INFO] Epoch:[0/2](66400/4588595) loss:3.218 lr:0.0000307 epoch_Time:28433.0min: [2024-01-03 00:37:16,765][model8_pretrain.py][INFO] Epoch:[0/2](66400/4588595) loss:2.226 lr:0.0000307 epoch_Time:28433.0min: [2024-01-03 00:37:16,765][model8_pretrain.py][INFO] Epoch:[0/2](66400/4588595) loss:2.831 lr:0.0000307 epoch_Time:28433.0min: [2024-01-03 00:37:16,765][model8_pretrain.py][INFO] Epoch:[0/2](66400/4588595) loss:3.233 lr:0.0000307 epoch_Time:28433.0min: [2024-01-03 00:37:16,765][model8_pretrain.py][INFO] Epoch:[0/2](66400/4588595) loss:3.500 lr:0.0000307 epoch_Time:28433.0min: [2024-01-03 00:37:16,765][model8_pretrain.py][INFO] Epoch:[0/2](66400/4588595) loss:2.973 lr:0.0000307 epoch_Time:28433.0min: [2024-01-03 00:37:16,765][model8_pretrain.py][INFO] Epoch:[0/2](66400/4588595) loss:3.302 lr:0.0000307 epoch_Time:28433.0min: [2024-01-03 00:37:16,765][model8_pretrain.py][INFO] Epoch:[0/2](66400/4588595) loss:2.965 lr:0.0000307 epoch_Time:28433.0min: [2024-01-03 00:37:53,692][model8_pretrain.py][INFO] Epoch:[0/2](66500/4588595) loss:3.059 lr:0.0000304 epoch_Time:28431.0min: [2024-01-03 00:37:53,692][model8_pretrain.py][INFO] Epoch:[0/2](66500/4588595) loss:3.126 lr:0.0000304 epoch_Time:28431.0min: [2024-01-03 00:37:53,692][model8_pretrain.py][INFO] Epoch:[0/2](66500/4588595) loss:3.148 lr:0.0000304 epoch_Time:28431.0min: [2024-01-03 00:37:53,692][model8_pretrain.py][INFO] Epoch:[0/2](66500/4588595) loss:2.572 lr:0.0000304 epoch_Time:28431.0min: [2024-01-03 00:37:53,692][model8_pretrain.py][INFO] Epoch:[0/2](66500/4588595) loss:2.796 lr:0.0000304 epoch_Time:28431.0min: [2024-01-03 00:37:53,692][model8_pretrain.py][INFO] Epoch:[0/2](66500/4588595) loss:3.242 lr:0.0000304 epoch_Time:28431.0min: [2024-01-03 00:37:53,692][model8_pretrain.py][INFO] Epoch:[0/2](66500/4588595) loss:3.348 lr:0.0000304 epoch_Time:28431.0min: [2024-01-03 00:37:53,692][model8_pretrain.py][INFO] Epoch:[0/2](66500/4588595) loss:3.178 lr:0.0000304 epoch_Time:28431.0min: [2024-01-03 00:38:30,641][model8_pretrain.py][INFO] Epoch:[0/2](66600/4588595) loss:3.502 lr:0.0000301 epoch_Time:28430.0min: [2024-01-03 00:38:30,641][model8_pretrain.py][INFO] Epoch:[0/2](66600/4588595) loss:3.283 lr:0.0000301 epoch_Time:28430.0min: [2024-01-03 00:38:30,641][model8_pretrain.py][INFO] Epoch:[0/2](66600/4588595) loss:2.586 lr:0.0000301 epoch_Time:28430.0min: [2024-01-03 00:38:30,641][model8_pretrain.py][INFO] Epoch:[0/2](66600/4588595) loss:2.704 lr:0.0000301 epoch_Time:28430.0min: [2024-01-03 00:38:30,641][model8_pretrain.py][INFO] Epoch:[0/2](66600/4588595) loss:2.660 lr:0.0000301 epoch_Time:28430.0min: [2024-01-03 00:38:30,641][model8_pretrain.py][INFO] Epoch:[0/2](66600/4588595) loss:3.295 lr:0.0000301 epoch_Time:28430.0min: [2024-01-03 00:38:30,641][model8_pretrain.py][INFO] Epoch:[0/2](66600/4588595) loss:2.995 lr:0.0000301 epoch_Time:28430.0min: [2024-01-03 00:38:30,641][model8_pretrain.py][INFO] Epoch:[0/2](66600/4588595) loss:2.602 lr:0.0000301 epoch_Time:28430.0min: [2024-01-03 00:39:07,576][model8_pretrain.py][INFO] Epoch:[0/2](66700/4588595) loss:3.623 lr:0.0000298 epoch_Time:28428.0min: [2024-01-03 00:39:07,576][model8_pretrain.py][INFO] Epoch:[0/2](66700/4588595) loss:3.161 lr:0.0000298 epoch_Time:28428.0min: [2024-01-03 00:39:07,576][model8_pretrain.py][INFO] Epoch:[0/2](66700/4588595) loss:3.216 lr:0.0000298 epoch_Time:28428.0min: [2024-01-03 00:39:07,576][model8_pretrain.py][INFO] Epoch:[0/2](66700/4588595) loss:3.324 lr:0.0000298 epoch_Time:28428.0min: [2024-01-03 00:39:07,576][model8_pretrain.py][INFO] Epoch:[0/2](66700/4588595) loss:3.304 lr:0.0000298 epoch_Time:28428.0min: [2024-01-03 00:39:07,576][model8_pretrain.py][INFO] Epoch:[0/2](66700/4588595) loss:2.640 lr:0.0000298 epoch_Time:28428.0min: [2024-01-03 00:39:07,577][model8_pretrain.py][INFO] Epoch:[0/2](66700/4588595) loss:2.857 lr:0.0000298 epoch_Time:28428.0min: [2024-01-03 00:39:07,577][model8_pretrain.py][INFO] Epoch:[0/2](66700/4588595) loss:2.552 lr:0.0000298 epoch_Time:28428.0min: [2024-01-03 00:39:44,506][model8_pretrain.py][INFO] Epoch:[0/2](66800/4588595) loss:3.336 lr:0.0000295 epoch_Time:28427.0min: [2024-01-03 00:39:44,506][model8_pretrain.py][INFO] Epoch:[0/2](66800/4588595) loss:2.700 lr:0.0000295 epoch_Time:28427.0min: [2024-01-03 00:39:44,506][model8_pretrain.py][INFO] Epoch:[0/2](66800/4588595) loss:2.563 lr:0.0000295 epoch_Time:28427.0min: [2024-01-03 00:39:44,506][model8_pretrain.py][INFO] Epoch:[0/2](66800/4588595) loss:2.939 lr:0.0000295 epoch_Time:28427.0min: [2024-01-03 00:39:44,506][model8_pretrain.py][INFO] Epoch:[0/2](66800/4588595) loss:3.280 lr:0.0000295 epoch_Time:28427.0min: [2024-01-03 00:39:44,506][model8_pretrain.py][INFO] Epoch:[0/2](66800/4588595) loss:2.549 lr:0.0000295 epoch_Time:28427.0min: [2024-01-03 00:39:44,506][model8_pretrain.py][INFO] Epoch:[0/2](66800/4588595) loss:3.041 lr:0.0000295 epoch_Time:28427.0min: [2024-01-03 00:39:44,506][model8_pretrain.py][INFO] Epoch:[0/2](66800/4588595) loss:3.025 lr:0.0000295 epoch_Time:28427.0min: [2024-01-03 00:40:21,430][model8_pretrain.py][INFO] Epoch:[0/2](66900/4588595) loss:3.323 lr:0.0000292 epoch_Time:28425.0min: [2024-01-03 00:40:21,430][model8_pretrain.py][INFO] Epoch:[0/2](66900/4588595) loss:3.595 lr:0.0000292 epoch_Time:28425.0min: [2024-01-03 00:40:21,430][model8_pretrain.py][INFO] Epoch:[0/2](66900/4588595) loss:3.048 lr:0.0000292 epoch_Time:28425.0min: [2024-01-03 00:40:21,430][model8_pretrain.py][INFO] Epoch:[0/2](66900/4588595) loss:2.976 lr:0.0000292 epoch_Time:28425.0min: [2024-01-03 00:40:21,430][model8_pretrain.py][INFO] Epoch:[0/2](66900/4588595) loss:2.680 lr:0.0000292 epoch_Time:28425.0min: [2024-01-03 00:40:21,430][model8_pretrain.py][INFO] Epoch:[0/2](66900/4588595) loss:2.861 lr:0.0000292 epoch_Time:28425.0min: [2024-01-03 00:40:21,430][model8_pretrain.py][INFO] Epoch:[0/2](66900/4588595) loss:3.124 lr:0.0000292 epoch_Time:28425.0min: [2024-01-03 00:40:21,430][model8_pretrain.py][INFO] Epoch:[0/2](66900/4588595) loss:3.056 lr:0.0000292 epoch_Time:28425.0min: [2024-01-03 00:40:58,371][model8_pretrain.py][INFO] Epoch:[0/2](67000/4588595) loss:3.332 lr:0.0000289 epoch_Time:28423.0min: [2024-01-03 00:40:58,371][model8_pretrain.py][INFO] Epoch:[0/2](67000/4588595) loss:2.999 lr:0.0000289 epoch_Time:28423.0min: [2024-01-03 00:40:58,371][model8_pretrain.py][INFO] Epoch:[0/2](67000/4588595) loss:3.141 lr:0.0000289 epoch_Time:28423.0min: [2024-01-03 00:40:58,371][model8_pretrain.py][INFO] Epoch:[0/2](67000/4588595) loss:2.865 lr:0.0000289 epoch_Time:28423.0min: [2024-01-03 00:40:58,371][model8_pretrain.py][INFO] Epoch:[0/2](67000/4588595) loss:3.063 lr:0.0000289 epoch_Time:28423.0min: [2024-01-03 00:40:58,371][model8_pretrain.py][INFO] Epoch:[0/2](67000/4588595) loss:3.514 lr:0.0000289 epoch_Time:28423.0min: [2024-01-03 00:40:58,371][model8_pretrain.py][INFO] Epoch:[0/2](67000/4588595) loss:2.808 lr:0.0000289 epoch_Time:28423.0min: [2024-01-03 00:40:58,371][model8_pretrain.py][INFO] Epoch:[0/2](67000/4588595) loss:3.149 lr:0.0000289 epoch_Time:28423.0min: [2024-01-03 00:41:37,116][model8_pretrain.py][INFO] Epoch:[0/2](67100/4588595) loss:3.028 lr:0.0000287 epoch_Time:28424.0min: [2024-01-03 00:41:37,116][model8_pretrain.py][INFO] Epoch:[0/2](67100/4588595) loss:3.020 lr:0.0000287 epoch_Time:28424.0min: [2024-01-03 00:41:37,116][model8_pretrain.py][INFO] Epoch:[0/2](67100/4588595) loss:3.410 lr:0.0000287 epoch_Time:28424.0min: [2024-01-03 00:41:37,116][model8_pretrain.py][INFO] Epoch:[0/2](67100/4588595) loss:3.254 lr:0.0000287 epoch_Time:28424.0min: [2024-01-03 00:41:37,116][model8_pretrain.py][INFO] Epoch:[0/2](67100/4588595) loss:2.980 lr:0.0000287 epoch_Time:28424.0min: [2024-01-03 00:41:37,116][model8_pretrain.py][INFO] Epoch:[0/2](67100/4588595) loss:3.115 lr:0.0000287 epoch_Time:28424.0min: [2024-01-03 00:41:37,116][model8_pretrain.py][INFO] Epoch:[0/2](67100/4588595) loss:2.988 lr:0.0000287 epoch_Time:28424.0min: [2024-01-03 00:41:37,117][model8_pretrain.py][INFO] Epoch:[0/2](67100/4588595) loss:3.223 lr:0.0000287 epoch_Time:28424.0min: [2024-01-03 00:42:19,279][model8_pretrain.py][INFO] Epoch:[0/2](67200/4588595) loss:2.775 lr:0.0000284 epoch_Time:28428.0min: [2024-01-03 00:42:19,279][model8_pretrain.py][INFO] Epoch:[0/2](67200/4588595) loss:2.773 lr:0.0000284 epoch_Time:28428.0min: [2024-01-03 00:42:19,279][model8_pretrain.py][INFO] Epoch:[0/2](67200/4588595) loss:3.038 lr:0.0000284 epoch_Time:28428.0min: [2024-01-03 00:42:19,279][model8_pretrain.py][INFO] Epoch:[0/2](67200/4588595) loss:2.716 lr:0.0000284 epoch_Time:28428.0min: [2024-01-03 00:42:19,279][model8_pretrain.py][INFO] Epoch:[0/2](67200/4588595) loss:2.990 lr:0.0000284 epoch_Time:28428.0min: [2024-01-03 00:42:19,279][model8_pretrain.py][INFO] Epoch:[0/2](67200/4588595) loss:2.797 lr:0.0000284 epoch_Time:28428.0min: [2024-01-03 00:42:19,279][model8_pretrain.py][INFO] Epoch:[0/2](67200/4588595) loss:3.132 lr:0.0000284 epoch_Time:28428.0min: [2024-01-03 00:42:19,279][model8_pretrain.py][INFO] Epoch:[0/2](67200/4588595) loss:2.987 lr:0.0000284 epoch_Time:28428.0min: [2024-01-03 00:42:56,218][model8_pretrain.py][INFO] Epoch:[0/2](67300/4588595) loss:2.864 lr:0.0000281 epoch_Time:28427.0min: [2024-01-03 00:42:56,218][model8_pretrain.py][INFO] Epoch:[0/2](67300/4588595) loss:2.933 lr:0.0000281 epoch_Time:28426.0min: [2024-01-03 00:42:56,218][model8_pretrain.py][INFO] Epoch:[0/2](67300/4588595) loss:2.891 lr:0.0000281 epoch_Time:28427.0min: [2024-01-03 00:42:56,218][model8_pretrain.py][INFO] Epoch:[0/2](67300/4588595) loss:2.766 lr:0.0000281 epoch_Time:28426.0min: [2024-01-03 00:42:56,218][model8_pretrain.py][INFO] Epoch:[0/2](67300/4588595) loss:3.334 lr:0.0000281 epoch_Time:28427.0min: [2024-01-03 00:42:56,218][model8_pretrain.py][INFO] Epoch:[0/2](67300/4588595) loss:3.170 lr:0.0000281 epoch_Time:28427.0min: [2024-01-03 00:42:56,219][model8_pretrain.py][INFO] Epoch:[0/2](67300/4588595) loss:2.579 lr:0.0000281 epoch_Time:28427.0min: [2024-01-03 00:42:56,219][model8_pretrain.py][INFO] Epoch:[0/2](67300/4588595) loss:3.637 lr:0.0000281 epoch_Time:28427.0min: [2024-01-03 00:43:33,157][model8_pretrain.py][INFO] Epoch:[0/2](67400/4588595) loss:3.204 lr:0.0000278 epoch_Time:28426.0min: [2024-01-03 00:43:33,157][model8_pretrain.py][INFO] Epoch:[0/2](67400/4588595) loss:3.286 lr:0.0000278 epoch_Time:28426.0min: [2024-01-03 00:43:33,157][model8_pretrain.py][INFO] Epoch:[0/2](67400/4588595) loss:3.475 lr:0.0000278 epoch_Time:28426.0min: [2024-01-03 00:43:33,157][model8_pretrain.py][INFO] Epoch:[0/2](67400/4588595) loss:2.888 lr:0.0000278 epoch_Time:28426.0min: [2024-01-03 00:43:33,157][model8_pretrain.py][INFO] Epoch:[0/2](67400/4588595) loss:3.251 lr:0.0000278 epoch_Time:28426.0min: [2024-01-03 00:43:33,157][model8_pretrain.py][INFO] Epoch:[0/2](67400/4588595) loss:2.605 lr:0.0000278 epoch_Time:28426.0min: [2024-01-03 00:43:33,157][model8_pretrain.py][INFO] Epoch:[0/2](67400/4588595) loss:3.303 lr:0.0000278 epoch_Time:28426.0min: [2024-01-03 00:43:33,157][model8_pretrain.py][INFO] Epoch:[0/2](67400/4588595) loss:2.813 lr:0.0000278 epoch_Time:28426.0min: [2024-01-03 00:44:10,110][model8_pretrain.py][INFO] Epoch:[0/2](67500/4588595) loss:3.028 lr:0.0000275 epoch_Time:28424.0min: [2024-01-03 00:44:10,110][model8_pretrain.py][INFO] Epoch:[0/2](67500/4588595) loss:3.185 lr:0.0000275 epoch_Time:28424.0min: [2024-01-03 00:44:10,110][model8_pretrain.py][INFO] Epoch:[0/2](67500/4588595) loss:3.200 lr:0.0000275 epoch_Time:28424.0min: [2024-01-03 00:44:10,110][model8_pretrain.py][INFO] Epoch:[0/2](67500/4588595) loss:2.829 lr:0.0000275 epoch_Time:28424.0min: [2024-01-03 00:44:10,110][model8_pretrain.py][INFO] Epoch:[0/2](67500/4588595) loss:2.967 lr:0.0000275 epoch_Time:28424.0min: [2024-01-03 00:44:10,110][model8_pretrain.py][INFO] Epoch:[0/2](67500/4588595) loss:3.239 lr:0.0000275 epoch_Time:28424.0min: [2024-01-03 00:44:10,110][model8_pretrain.py][INFO] Epoch:[0/2](67500/4588595) loss:2.618 lr:0.0000275 epoch_Time:28424.0min: [2024-01-03 00:44:10,110][model8_pretrain.py][INFO] Epoch:[0/2](67500/4588595) loss:3.176 lr:0.0000275 epoch_Time:28424.0min: [2024-01-03 00:44:47,044][model8_pretrain.py][INFO] Epoch:[0/2](67600/4588595) loss:3.088 lr:0.0000273 epoch_Time:28423.0min: [2024-01-03 00:44:47,044][model8_pretrain.py][INFO] Epoch:[0/2](67600/4588595) loss:2.902 lr:0.0000273 epoch_Time:28423.0min: [2024-01-03 00:44:47,044][model8_pretrain.py][INFO] Epoch:[0/2](67600/4588595) loss:3.085 lr:0.0000273 epoch_Time:28423.0min: [2024-01-03 00:44:47,044][model8_pretrain.py][INFO] Epoch:[0/2](67600/4588595) loss:3.567 lr:0.0000273 epoch_Time:28423.0min: [2024-01-03 00:44:47,044][model8_pretrain.py][INFO] Epoch:[0/2](67600/4588595) loss:3.131 lr:0.0000273 epoch_Time:28423.0min: [2024-01-03 00:44:47,044][model8_pretrain.py][INFO] Epoch:[0/2](67600/4588595) loss:2.209 lr:0.0000273 epoch_Time:28423.0min: [2024-01-03 00:44:47,045][model8_pretrain.py][INFO] Epoch:[0/2](67600/4588595) loss:3.250 lr:0.0000273 epoch_Time:28423.0min: [2024-01-03 00:44:47,045][model8_pretrain.py][INFO] Epoch:[0/2](67600/4588595) loss:3.142 lr:0.0000273 epoch_Time:28423.0min: [2024-01-03 00:45:23,973][model8_pretrain.py][INFO] Epoch:[0/2](67700/4588595) loss:3.066 lr:0.0000270 epoch_Time:28421.0min: [2024-01-03 00:45:23,973][model8_pretrain.py][INFO] Epoch:[0/2](67700/4588595) loss:3.389 lr:0.0000270 epoch_Time:28421.0min: [2024-01-03 00:45:23,973][model8_pretrain.py][INFO] Epoch:[0/2](67700/4588595) loss:3.578 lr:0.0000270 epoch_Time:28421.0min: [2024-01-03 00:45:23,973][model8_pretrain.py][INFO] Epoch:[0/2](67700/4588595) loss:2.478 lr:0.0000270 epoch_Time:28421.0min: [2024-01-03 00:45:23,973][model8_pretrain.py][INFO] Epoch:[0/2](67700/4588595) loss:3.347 lr:0.0000270 epoch_Time:28421.0min: [2024-01-03 00:45:23,973][model8_pretrain.py][INFO] Epoch:[0/2](67700/4588595) loss:3.405 lr:0.0000270 epoch_Time:28421.0min: [2024-01-03 00:45:23,973][model8_pretrain.py][INFO] Epoch:[0/2](67700/4588595) loss:3.127 lr:0.0000270 epoch_Time:28421.0min: [2024-01-03 00:45:23,973][model8_pretrain.py][INFO] Epoch:[0/2](67700/4588595) loss:3.434 lr:0.0000270 epoch_Time:28421.0min: [2024-01-03 00:46:00,932][model8_pretrain.py][INFO] Epoch:[0/2](67800/4588595) loss:3.502 lr:0.0000267 epoch_Time:28419.0min: [2024-01-03 00:46:00,932][model8_pretrain.py][INFO] Epoch:[0/2](67800/4588595) loss:2.872 lr:0.0000267 epoch_Time:28419.0min: [2024-01-03 00:46:00,932][model8_pretrain.py][INFO] Epoch:[0/2](67800/4588595) loss:3.328 lr:0.0000267 epoch_Time:28419.0min: [2024-01-03 00:46:00,932][model8_pretrain.py][INFO] Epoch:[0/2](67800/4588595) loss:3.171 lr:0.0000267 epoch_Time:28419.0min: [2024-01-03 00:46:00,932][model8_pretrain.py][INFO] Epoch:[0/2](67800/4588595) loss:3.014 lr:0.0000267 epoch_Time:28419.0min: [2024-01-03 00:46:00,932][model8_pretrain.py][INFO] Epoch:[0/2](67800/4588595) loss:2.959 lr:0.0000267 epoch_Time:28419.0min: [2024-01-03 00:46:00,932][model8_pretrain.py][INFO] Epoch:[0/2](67800/4588595) loss:2.947 lr:0.0000267 epoch_Time:28419.0min: [2024-01-03 00:46:00,932][model8_pretrain.py][INFO] Epoch:[0/2](67800/4588595) loss:3.589 lr:0.0000267 epoch_Time:28419.0min: [2024-01-03 00:46:39,695][model8_pretrain.py][INFO] Epoch:[0/2](67900/4588595) loss:2.814 lr:0.0000265 epoch_Time:28420.0min: [2024-01-03 00:46:39,695][model8_pretrain.py][INFO] Epoch:[0/2](67900/4588595) loss:3.171 lr:0.0000265 epoch_Time:28420.0min: [2024-01-03 00:46:39,695][model8_pretrain.py][INFO] Epoch:[0/2](67900/4588595) loss:3.473 lr:0.0000265 epoch_Time:28420.0min: [2024-01-03 00:46:39,695][model8_pretrain.py][INFO] Epoch:[0/2](67900/4588595) loss:3.053 lr:0.0000265 epoch_Time:28420.0min: [2024-01-03 00:46:39,695][model8_pretrain.py][INFO] Epoch:[0/2](67900/4588595) loss:2.817 lr:0.0000265 epoch_Time:28420.0min: [2024-01-03 00:46:39,695][model8_pretrain.py][INFO] Epoch:[0/2](67900/4588595) loss:2.679 lr:0.0000265 epoch_Time:28420.0min: [2024-01-03 00:46:39,695][model8_pretrain.py][INFO] Epoch:[0/2](67900/4588595) loss:2.372 lr:0.0000265 epoch_Time:28420.0min: [2024-01-03 00:46:39,696][model8_pretrain.py][INFO] Epoch:[0/2](67900/4588595) loss:2.958 lr:0.0000265 epoch_Time:28420.0min: [2024-01-03 00:47:21,885][model8_pretrain.py][INFO] Epoch:[0/2](68000/4588595) loss:2.760 lr:0.0000262 epoch_Time:28424.0min: [2024-01-03 00:47:21,885][model8_pretrain.py][INFO] Epoch:[0/2](68000/4588595) loss:3.311 lr:0.0000262 epoch_Time:28424.0min: [2024-01-03 00:47:21,885][model8_pretrain.py][INFO] Epoch:[0/2](68000/4588595) loss:3.073 lr:0.0000262 epoch_Time:28424.0min: [2024-01-03 00:47:21,885][model8_pretrain.py][INFO] Epoch:[0/2](68000/4588595) loss:2.847 lr:0.0000262 epoch_Time:28424.0min: [2024-01-03 00:47:21,885][model8_pretrain.py][INFO] Epoch:[0/2](68000/4588595) loss:2.717 lr:0.0000262 epoch_Time:28424.0min: [2024-01-03 00:47:21,885][model8_pretrain.py][INFO] Epoch:[0/2](68000/4588595) loss:2.971 lr:0.0000262 epoch_Time:28424.0min: [2024-01-03 00:47:21,885][model8_pretrain.py][INFO] Epoch:[0/2](68000/4588595) loss:3.203 lr:0.0000262 epoch_Time:28424.0min: [2024-01-03 00:47:21,885][model8_pretrain.py][INFO] Epoch:[0/2](68000/4588595) loss:3.008 lr:0.0000262 epoch_Time:28424.0min: [2024-01-03 00:47:58,829][model8_pretrain.py][INFO] Epoch:[0/2](68100/4588595) loss:3.290 lr:0.0000259 epoch_Time:28422.0min: [2024-01-03 00:47:58,829][model8_pretrain.py][INFO] Epoch:[0/2](68100/4588595) loss:3.081 lr:0.0000259 epoch_Time:28422.0min: [2024-01-03 00:47:58,829][model8_pretrain.py][INFO] Epoch:[0/2](68100/4588595) loss:3.502 lr:0.0000259 epoch_Time:28422.0min: [2024-01-03 00:47:58,829][model8_pretrain.py][INFO] Epoch:[0/2](68100/4588595) loss:2.743 lr:0.0000259 epoch_Time:28422.0min: [2024-01-03 00:47:58,829][model8_pretrain.py][INFO] Epoch:[0/2](68100/4588595) loss:3.086 lr:0.0000259 epoch_Time:28422.0min: [2024-01-03 00:47:58,829][model8_pretrain.py][INFO] Epoch:[0/2](68100/4588595) loss:3.327 lr:0.0000259 epoch_Time:28422.0min: [2024-01-03 00:47:58,829][model8_pretrain.py][INFO] Epoch:[0/2](68100/4588595) loss:2.391 lr:0.0000259 epoch_Time:28422.0min: [2024-01-03 00:47:58,830][model8_pretrain.py][INFO] Epoch:[0/2](68100/4588595) loss:3.137 lr:0.0000259 epoch_Time:28422.0min: [2024-01-03 00:48:35,761][model8_pretrain.py][INFO] Epoch:[0/2](68200/4588595) loss:3.287 lr:0.0000257 epoch_Time:28422.0min: [2024-01-03 00:48:35,761][model8_pretrain.py][INFO] Epoch:[0/2](68200/4588595) loss:2.886 lr:0.0000257 epoch_Time:28422.0min: [2024-01-03 00:48:35,761][model8_pretrain.py][INFO] Epoch:[0/2](68200/4588595) loss:3.118 lr:0.0000257 epoch_Time:28422.0min: [2024-01-03 00:48:35,761][model8_pretrain.py][INFO] Epoch:[0/2](68200/4588595) loss:3.158 lr:0.0000257 epoch_Time:28422.0min: [2024-01-03 00:48:35,761][model8_pretrain.py][INFO] Epoch:[0/2](68200/4588595) loss:2.532 lr:0.0000257 epoch_Time:28422.0min: [2024-01-03 00:48:35,761][model8_pretrain.py][INFO] Epoch:[0/2](68200/4588595) loss:3.108 lr:0.0000257 epoch_Time:28422.0min: [2024-01-03 00:48:35,761][model8_pretrain.py][INFO] Epoch:[0/2](68200/4588595) loss:2.800 lr:0.0000257 epoch_Time:28422.0min: [2024-01-03 00:48:35,761][model8_pretrain.py][INFO] Epoch:[0/2](68200/4588595) loss:2.835 lr:0.0000257 epoch_Time:28422.0min: [2024-01-03 00:49:12,693][model8_pretrain.py][INFO] Epoch:[0/2](68300/4588595) loss:2.828 lr:0.0000254 epoch_Time:28420.0min: [2024-01-03 00:49:12,693][model8_pretrain.py][INFO] Epoch:[0/2](68300/4588595) loss:3.172 lr:0.0000254 epoch_Time:28420.0min: [2024-01-03 00:49:12,693][model8_pretrain.py][INFO] Epoch:[0/2](68300/4588595) loss:3.016 lr:0.0000254 epoch_Time:28420.0min: [2024-01-03 00:49:12,693][model8_pretrain.py][INFO] Epoch:[0/2](68300/4588595) loss:3.334 lr:0.0000254 epoch_Time:28420.0min: [2024-01-03 00:49:12,693][model8_pretrain.py][INFO] Epoch:[0/2](68300/4588595) loss:3.441 lr:0.0000254 epoch_Time:28420.0min: [2024-01-03 00:49:12,693][model8_pretrain.py][INFO] Epoch:[0/2](68300/4588595) loss:3.307 lr:0.0000254 epoch_Time:28420.0min: [2024-01-03 00:49:12,694][model8_pretrain.py][INFO] Epoch:[0/2](68300/4588595) loss:3.676 lr:0.0000254 epoch_Time:28420.0min: [2024-01-03 00:49:12,694][model8_pretrain.py][INFO] Epoch:[0/2](68300/4588595) loss:2.685 lr:0.0000254 epoch_Time:28420.0min: [2024-01-03 00:49:49,626][model8_pretrain.py][INFO] Epoch:[0/2](68400/4588595) loss:2.913 lr:0.0000252 epoch_Time:28418.0min: [2024-01-03 00:49:49,626][model8_pretrain.py][INFO] Epoch:[0/2](68400/4588595) loss:3.200 lr:0.0000252 epoch_Time:28418.0min: [2024-01-03 00:49:49,626][model8_pretrain.py][INFO] Epoch:[0/2](68400/4588595) loss:3.451 lr:0.0000252 epoch_Time:28418.0min: [2024-01-03 00:49:49,626][model8_pretrain.py][INFO] Epoch:[0/2](68400/4588595) loss:2.582 lr:0.0000252 epoch_Time:28418.0min: [2024-01-03 00:49:49,626][model8_pretrain.py][INFO] Epoch:[0/2](68400/4588595) loss:3.279 lr:0.0000252 epoch_Time:28418.0min: [2024-01-03 00:49:49,626][model8_pretrain.py][INFO] Epoch:[0/2](68400/4588595) loss:3.236 lr:0.0000252 epoch_Time:28418.0min: [2024-01-03 00:49:49,626][model8_pretrain.py][INFO] Epoch:[0/2](68400/4588595) loss:3.351 lr:0.0000252 epoch_Time:28418.0min: [2024-01-03 00:49:49,626][model8_pretrain.py][INFO] Epoch:[0/2](68400/4588595) loss:3.379 lr:0.0000252 epoch_Time:28418.0min: [2024-01-03 00:50:26,566][model8_pretrain.py][INFO] Epoch:[0/2](68500/4588595) loss:3.184 lr:0.0000249 epoch_Time:28417.0min: [2024-01-03 00:50:26,566][model8_pretrain.py][INFO] Epoch:[0/2](68500/4588595) loss:3.418 lr:0.0000249 epoch_Time:28417.0min: [2024-01-03 00:50:26,566][model8_pretrain.py][INFO] Epoch:[0/2](68500/4588595) loss:3.152 lr:0.0000249 epoch_Time:28417.0min: [2024-01-03 00:50:26,566][model8_pretrain.py][INFO] Epoch:[0/2](68500/4588595) loss:3.266 lr:0.0000249 epoch_Time:28417.0min: [2024-01-03 00:50:26,567][model8_pretrain.py][INFO] Epoch:[0/2](68500/4588595) loss:3.136 lr:0.0000249 epoch_Time:28417.0min: [2024-01-03 00:50:26,567][model8_pretrain.py][INFO] Epoch:[0/2](68500/4588595) loss:3.250 lr:0.0000249 epoch_Time:28417.0min: [2024-01-03 00:50:26,567][model8_pretrain.py][INFO] Epoch:[0/2](68500/4588595) loss:3.389 lr:0.0000249 epoch_Time:28417.0min: [2024-01-03 00:50:26,567][model8_pretrain.py][INFO] Epoch:[0/2](68500/4588595) loss:3.039 lr:0.0000249 epoch_Time:28417.0min: [2024-01-03 00:51:03,498][model8_pretrain.py][INFO] Epoch:[0/2](68600/4588595) loss:3.148 lr:0.0000246 epoch_Time:28415.0min: [2024-01-03 00:51:03,498][model8_pretrain.py][INFO] Epoch:[0/2](68600/4588595) loss:2.965 lr:0.0000246 epoch_Time:28415.0min: [2024-01-03 00:51:03,498][model8_pretrain.py][INFO] Epoch:[0/2](68600/4588595) loss:2.773 lr:0.0000246 epoch_Time:28415.0min: [2024-01-03 00:51:03,498][model8_pretrain.py][INFO] Epoch:[0/2](68600/4588595) loss:2.919 lr:0.0000246 epoch_Time:28415.0min: [2024-01-03 00:51:03,498][model8_pretrain.py][INFO] Epoch:[0/2](68600/4588595) loss:3.079 lr:0.0000246 epoch_Time:28415.0min: [2024-01-03 00:51:03,498][model8_pretrain.py][INFO] Epoch:[0/2](68600/4588595) loss:3.542 lr:0.0000246 epoch_Time:28415.0min: [2024-01-03 00:51:03,498][model8_pretrain.py][INFO] Epoch:[0/2](68600/4588595) loss:2.886 lr:0.0000246 epoch_Time:28415.0min: [2024-01-03 00:51:03,498][model8_pretrain.py][INFO] Epoch:[0/2](68600/4588595) loss:2.914 lr:0.0000246 epoch_Time:28415.0min: [2024-01-03 00:51:42,182][model8_pretrain.py][INFO] Epoch:[0/2](68700/4588595) loss:2.628 lr:0.0000244 epoch_Time:28416.0min: [2024-01-03 00:51:42,182][model8_pretrain.py][INFO] Epoch:[0/2](68700/4588595) loss:3.346 lr:0.0000244 epoch_Time:28416.0min: [2024-01-03 00:51:42,182][model8_pretrain.py][INFO] Epoch:[0/2](68700/4588595) loss:3.077 lr:0.0000244 epoch_Time:28416.0min: [2024-01-03 00:51:42,182][model8_pretrain.py][INFO] Epoch:[0/2](68700/4588595) loss:2.916 lr:0.0000244 epoch_Time:28416.0min: [2024-01-03 00:51:42,182][model8_pretrain.py][INFO] Epoch:[0/2](68700/4588595) loss:2.907 lr:0.0000244 epoch_Time:28416.0min: [2024-01-03 00:51:42,182][model8_pretrain.py][INFO] Epoch:[0/2](68700/4588595) loss:3.359 lr:0.0000244 epoch_Time:28416.0min: [2024-01-03 00:51:42,182][model8_pretrain.py][INFO] Epoch:[0/2](68700/4588595) loss:3.648 lr:0.0000244 epoch_Time:28416.0min: [2024-01-03 00:51:42,182][model8_pretrain.py][INFO] Epoch:[0/2](68700/4588595) loss:3.373 lr:0.0000244 epoch_Time:28416.0min: [2024-01-03 00:52:24,399][model8_pretrain.py][INFO] Epoch:[0/2](68800/4588595) loss:3.162 lr:0.0000241 epoch_Time:28420.0min: [2024-01-03 00:52:24,399][model8_pretrain.py][INFO] Epoch:[0/2](68800/4588595) loss:2.628 lr:0.0000241 epoch_Time:28420.0min: [2024-01-03 00:52:24,399][model8_pretrain.py][INFO] Epoch:[0/2](68800/4588595) loss:2.502 lr:0.0000241 epoch_Time:28420.0min: [2024-01-03 00:52:24,399][model8_pretrain.py][INFO] Epoch:[0/2](68800/4588595) loss:3.104 lr:0.0000241 epoch_Time:28420.0min: [2024-01-03 00:52:24,399][model8_pretrain.py][INFO] Epoch:[0/2](68800/4588595) loss:2.928 lr:0.0000241 epoch_Time:28420.0min: [2024-01-03 00:52:24,399][model8_pretrain.py][INFO] Epoch:[0/2](68800/4588595) loss:3.044 lr:0.0000241 epoch_Time:28420.0min: [2024-01-03 00:52:24,399][model8_pretrain.py][INFO] Epoch:[0/2](68800/4588595) loss:2.853 lr:0.0000241 epoch_Time:28420.0min: [2024-01-03 00:52:24,399][model8_pretrain.py][INFO] Epoch:[0/2](68800/4588595) loss:2.724 lr:0.0000241 epoch_Time:28420.0min: [2024-01-03 00:53:01,344][model8_pretrain.py][INFO] Epoch:[0/2](68900/4588595) loss:2.598 lr:0.0000239 epoch_Time:28418.0min: [2024-01-03 00:53:01,344][model8_pretrain.py][INFO] Epoch:[0/2](68900/4588595) loss:2.750 lr:0.0000239 epoch_Time:28418.0min: [2024-01-03 00:53:01,344][model8_pretrain.py][INFO] Epoch:[0/2](68900/4588595) loss:3.109 lr:0.0000239 epoch_Time:28418.0min: [2024-01-03 00:53:01,344][model8_pretrain.py][INFO] Epoch:[0/2](68900/4588595) loss:2.397 lr:0.0000239 epoch_Time:28418.0min: [2024-01-03 00:53:01,344][model8_pretrain.py][INFO] Epoch:[0/2](68900/4588595) loss:3.530 lr:0.0000239 epoch_Time:28418.0min: [2024-01-03 00:53:01,344][model8_pretrain.py][INFO] Epoch:[0/2](68900/4588595) loss:2.903 lr:0.0000239 epoch_Time:28418.0min: [2024-01-03 00:53:01,344][model8_pretrain.py][INFO] Epoch:[0/2](68900/4588595) loss:3.141 lr:0.0000239 epoch_Time:28418.0min: [2024-01-03 00:53:01,344][model8_pretrain.py][INFO] Epoch:[0/2](68900/4588595) loss:3.270 lr:0.0000239 epoch_Time:28418.0min: [2024-01-03 00:53:38,284][model8_pretrain.py][INFO] Epoch:[0/2](69000/4588595) loss:2.622 lr:0.0000237 epoch_Time:28417.0min: [2024-01-03 00:53:38,284][model8_pretrain.py][INFO] Epoch:[0/2](69000/4588595) loss:3.323 lr:0.0000237 epoch_Time:28417.0min: [2024-01-03 00:53:38,284][model8_pretrain.py][INFO] Epoch:[0/2](69000/4588595) loss:2.847 lr:0.0000237 epoch_Time:28417.0min: [2024-01-03 00:53:38,284][model8_pretrain.py][INFO] Epoch:[0/2](69000/4588595) loss:3.088 lr:0.0000237 epoch_Time:28417.0min: [2024-01-03 00:53:38,284][model8_pretrain.py][INFO] Epoch:[0/2](69000/4588595) loss:2.934 lr:0.0000237 epoch_Time:28417.0min: [2024-01-03 00:53:38,284][model8_pretrain.py][INFO] Epoch:[0/2](69000/4588595) loss:3.244 lr:0.0000237 epoch_Time:28417.0min: [2024-01-03 00:53:38,284][model8_pretrain.py][INFO] Epoch:[0/2](69000/4588595) loss:3.184 lr:0.0000237 epoch_Time:28417.0min: [2024-01-03 00:53:38,284][model8_pretrain.py][INFO] Epoch:[0/2](69000/4588595) loss:3.462 lr:0.0000237 epoch_Time:28417.0min: [2024-01-03 00:54:15,213][model8_pretrain.py][INFO] Epoch:[0/2](69100/4588595) loss:3.392 lr:0.0000234 epoch_Time:28415.0min: [2024-01-03 00:54:15,213][model8_pretrain.py][INFO] Epoch:[0/2](69100/4588595) loss:3.218 lr:0.0000234 epoch_Time:28415.0min: [2024-01-03 00:54:15,213][model8_pretrain.py][INFO] Epoch:[0/2](69100/4588595) loss:3.324 lr:0.0000234 epoch_Time:28415.0min: [2024-01-03 00:54:15,213][model8_pretrain.py][INFO] Epoch:[0/2](69100/4588595) loss:3.013 lr:0.0000234 epoch_Time:28415.0min: [2024-01-03 00:54:15,213][model8_pretrain.py][INFO] Epoch:[0/2](69100/4588595) loss:3.443 lr:0.0000234 epoch_Time:28415.0min: [2024-01-03 00:54:15,213][model8_pretrain.py][INFO] Epoch:[0/2](69100/4588595) loss:2.866 lr:0.0000234 epoch_Time:28415.0min: [2024-01-03 00:54:15,213][model8_pretrain.py][INFO] Epoch:[0/2](69100/4588595) loss:2.928 lr:0.0000234 epoch_Time:28416.0min: [2024-01-03 00:54:15,213][model8_pretrain.py][INFO] Epoch:[0/2](69100/4588595) loss:2.976 lr:0.0000234 epoch_Time:28415.0min: [2024-01-03 00:54:52,149][model8_pretrain.py][INFO] Epoch:[0/2](69200/4588595) loss:2.940 lr:0.0000232 epoch_Time:28414.0min: [2024-01-03 00:54:52,149][model8_pretrain.py][INFO] Epoch:[0/2](69200/4588595) loss:2.570 lr:0.0000232 epoch_Time:28414.0min: [2024-01-03 00:54:52,149][model8_pretrain.py][INFO] Epoch:[0/2](69200/4588595) loss:3.087 lr:0.0000232 epoch_Time:28414.0min: [2024-01-03 00:54:52,149][model8_pretrain.py][INFO] Epoch:[0/2](69200/4588595) loss:3.168 lr:0.0000232 epoch_Time:28414.0min: [2024-01-03 00:54:52,149][model8_pretrain.py][INFO] Epoch:[0/2](69200/4588595) loss:3.013 lr:0.0000232 epoch_Time:28414.0min: [2024-01-03 00:54:52,149][model8_pretrain.py][INFO] Epoch:[0/2](69200/4588595) loss:3.009 lr:0.0000232 epoch_Time:28414.0min: [2024-01-03 00:54:52,149][model8_pretrain.py][INFO] Epoch:[0/2](69200/4588595) loss:2.717 lr:0.0000232 epoch_Time:28414.0min: [2024-01-03 00:54:52,149][model8_pretrain.py][INFO] Epoch:[0/2](69200/4588595) loss:3.698 lr:0.0000232 epoch_Time:28414.0min: [2024-01-03 00:55:29,085][model8_pretrain.py][INFO] Epoch:[0/2](69300/4588595) loss:3.357 lr:0.0000229 epoch_Time:28413.0min: [2024-01-03 00:55:29,085][model8_pretrain.py][INFO] Epoch:[0/2](69300/4588595) loss:2.939 lr:0.0000229 epoch_Time:28413.0min: [2024-01-03 00:55:29,085][model8_pretrain.py][INFO] Epoch:[0/2](69300/4588595) loss:3.038 lr:0.0000229 epoch_Time:28413.0min: [2024-01-03 00:55:29,085][model8_pretrain.py][INFO] Epoch:[0/2](69300/4588595) loss:2.647 lr:0.0000229 epoch_Time:28413.0min: [2024-01-03 00:55:29,085][model8_pretrain.py][INFO] Epoch:[0/2](69300/4588595) loss:3.126 lr:0.0000229 epoch_Time:28413.0min: [2024-01-03 00:55:29,085][model8_pretrain.py][INFO] Epoch:[0/2](69300/4588595) loss:2.928 lr:0.0000229 epoch_Time:28413.0min: [2024-01-03 00:55:29,085][model8_pretrain.py][INFO] Epoch:[0/2](69300/4588595) loss:2.383 lr:0.0000229 epoch_Time:28413.0min: [2024-01-03 00:55:29,085][model8_pretrain.py][INFO] Epoch:[0/2](69300/4588595) loss:2.786 lr:0.0000229 epoch_Time:28413.0min: [2024-01-03 00:56:05,983][model8_pretrain.py][INFO] Epoch:[0/2](69400/4588595) loss:3.281 lr:0.0000227 epoch_Time:28411.0min: [2024-01-03 00:56:05,983][model8_pretrain.py][INFO] Epoch:[0/2](69400/4588595) loss:2.731 lr:0.0000227 epoch_Time:28411.0min: [2024-01-03 00:56:05,983][model8_pretrain.py][INFO] Epoch:[0/2](69400/4588595) loss:2.614 lr:0.0000227 epoch_Time:28411.0min: [2024-01-03 00:56:05,983][model8_pretrain.py][INFO] Epoch:[0/2](69400/4588595) loss:3.020 lr:0.0000227 epoch_Time:28411.0min: [2024-01-03 00:56:05,983][model8_pretrain.py][INFO] Epoch:[0/2](69400/4588595) loss:2.673 lr:0.0000227 epoch_Time:28411.0min: [2024-01-03 00:56:05,983][model8_pretrain.py][INFO] Epoch:[0/2](69400/4588595) loss:3.242 lr:0.0000227 epoch_Time:28411.0min: [2024-01-03 00:56:05,983][model8_pretrain.py][INFO] Epoch:[0/2](69400/4588595) loss:3.508 lr:0.0000227 epoch_Time:28411.0min: [2024-01-03 00:56:05,983][model8_pretrain.py][INFO] Epoch:[0/2](69400/4588595) loss:3.221 lr:0.0000227 epoch_Time:28411.0min: [2024-01-03 00:56:42,915][model8_pretrain.py][INFO] Epoch:[0/2](69500/4588595) loss:3.371 lr:0.0000225 epoch_Time:28410.0min: [2024-01-03 00:56:42,915][model8_pretrain.py][INFO] Epoch:[0/2](69500/4588595) loss:3.432 lr:0.0000225 epoch_Time:28410.0min: [2024-01-03 00:56:42,915][model8_pretrain.py][INFO] Epoch:[0/2](69500/4588595) loss:3.137 lr:0.0000225 epoch_Time:28410.0min: [2024-01-03 00:56:42,915][model8_pretrain.py][INFO] Epoch:[0/2](69500/4588595) loss:3.261 lr:0.0000225 epoch_Time:28410.0min: [2024-01-03 00:56:42,915][model8_pretrain.py][INFO] Epoch:[0/2](69500/4588595) loss:2.624 lr:0.0000225 epoch_Time:28410.0min: [2024-01-03 00:56:42,915][model8_pretrain.py][INFO] Epoch:[0/2](69500/4588595) loss:3.060 lr:0.0000225 epoch_Time:28410.0min: [2024-01-03 00:56:42,915][model8_pretrain.py][INFO] Epoch:[0/2](69500/4588595) loss:2.637 lr:0.0000225 epoch_Time:28410.0min: [2024-01-03 00:56:42,916][model8_pretrain.py][INFO] Epoch:[0/2](69500/4588595) loss:3.254 lr:0.0000225 epoch_Time:28410.0min: [2024-01-03 00:57:26,818][model8_pretrain.py][INFO] Epoch:[0/2](69600/4588595) loss:3.294 lr:0.0000222 epoch_Time:28416.0min: [2024-01-03 00:57:26,818][model8_pretrain.py][INFO] Epoch:[0/2](69600/4588595) loss:3.628 lr:0.0000222 epoch_Time:28416.0min: [2024-01-03 00:57:26,819][model8_pretrain.py][INFO] Epoch:[0/2](69600/4588595) loss:2.862 lr:0.0000222 epoch_Time:28416.0min: [2024-01-03 00:57:26,819][model8_pretrain.py][INFO] Epoch:[0/2](69600/4588595) loss:3.200 lr:0.0000222 epoch_Time:28416.0min: [2024-01-03 00:57:26,818][model8_pretrain.py][INFO] Epoch:[0/2](69600/4588595) loss:3.090 lr:0.0000222 epoch_Time:28416.0min: [2024-01-03 00:57:26,819][model8_pretrain.py][INFO] Epoch:[0/2](69600/4588595) loss:2.751 lr:0.0000222 epoch_Time:28416.0min: [2024-01-03 00:57:26,819][model8_pretrain.py][INFO] Epoch:[0/2](69600/4588595) loss:3.267 lr:0.0000222 epoch_Time:28416.0min: [2024-01-03 00:57:26,819][model8_pretrain.py][INFO] Epoch:[0/2](69600/4588595) loss:2.997 lr:0.0000222 epoch_Time:28416.0min: [2024-01-03 00:58:03,751][model8_pretrain.py][INFO] Epoch:[0/2](69700/4588595) loss:3.115 lr:0.0000220 epoch_Time:28414.0min: [2024-01-03 00:58:03,751][model8_pretrain.py][INFO] Epoch:[0/2](69700/4588595) loss:3.084 lr:0.0000220 epoch_Time:28414.0min: [2024-01-03 00:58:03,751][model8_pretrain.py][INFO] Epoch:[0/2](69700/4588595) loss:2.790 lr:0.0000220 epoch_Time:28414.0min: [2024-01-03 00:58:03,751][model8_pretrain.py][INFO] Epoch:[0/2](69700/4588595) loss:3.031 lr:0.0000220 epoch_Time:28414.0min: [2024-01-03 00:58:03,751][model8_pretrain.py][INFO] Epoch:[0/2](69700/4588595) loss:3.229 lr:0.0000220 epoch_Time:28414.0min: [2024-01-03 00:58:03,752][model8_pretrain.py][INFO] Epoch:[0/2](69700/4588595) loss:3.059 lr:0.0000220 epoch_Time:28414.0min: [2024-01-03 00:58:03,752][model8_pretrain.py][INFO] Epoch:[0/2](69700/4588595) loss:3.577 lr:0.0000220 epoch_Time:28414.0min: [2024-01-03 00:58:03,752][model8_pretrain.py][INFO] Epoch:[0/2](69700/4588595) loss:3.132 lr:0.0000220 epoch_Time:28414.0min: [2024-01-03 00:58:40,697][model8_pretrain.py][INFO] Epoch:[0/2](69800/4588595) loss:2.796 lr:0.0000218 epoch_Time:28413.0min: [2024-01-03 00:58:40,697][model8_pretrain.py][INFO] Epoch:[0/2](69800/4588595) loss:3.064 lr:0.0000218 epoch_Time:28413.0min: [2024-01-03 00:58:40,697][model8_pretrain.py][INFO] Epoch:[0/2](69800/4588595) loss:3.206 lr:0.0000218 epoch_Time:28413.0min: [2024-01-03 00:58:40,697][model8_pretrain.py][INFO] Epoch:[0/2](69800/4588595) loss:2.727 lr:0.0000218 epoch_Time:28413.0min: [2024-01-03 00:58:40,697][model8_pretrain.py][INFO] Epoch:[0/2](69800/4588595) loss:3.145 lr:0.0000218 epoch_Time:28413.0min: [2024-01-03 00:58:40,697][model8_pretrain.py][INFO] Epoch:[0/2](69800/4588595) loss:3.240 lr:0.0000218 epoch_Time:28413.0min: [2024-01-03 00:58:40,697][model8_pretrain.py][INFO] Epoch:[0/2](69800/4588595) loss:3.033 lr:0.0000218 epoch_Time:28413.0min: [2024-01-03 00:58:40,698][model8_pretrain.py][INFO] Epoch:[0/2](69800/4588595) loss:3.066 lr:0.0000218 epoch_Time:28413.0min: [2024-01-03 00:59:17,638][model8_pretrain.py][INFO] Epoch:[0/2](69900/4588595) loss:2.195 lr:0.0000215 epoch_Time:28411.0min: [2024-01-03 00:59:17,639][model8_pretrain.py][INFO] Epoch:[0/2](69900/4588595) loss:2.822 lr:0.0000215 epoch_Time:28411.0min: [2024-01-03 00:59:17,639][model8_pretrain.py][INFO] Epoch:[0/2](69900/4588595) loss:2.724 lr:0.0000215 epoch_Time:28411.0min: [2024-01-03 00:59:17,639][model8_pretrain.py][INFO] Epoch:[0/2](69900/4588595) loss:3.062 lr:0.0000215 epoch_Time:28411.0min: [2024-01-03 00:59:17,639][model8_pretrain.py][INFO] Epoch:[0/2](69900/4588595) loss:2.986 lr:0.0000215 epoch_Time:28411.0min: [2024-01-03 00:59:17,639][model8_pretrain.py][INFO] Epoch:[0/2](69900/4588595) loss:3.468 lr:0.0000215 epoch_Time:28411.0min: [2024-01-03 00:59:17,639][model8_pretrain.py][INFO] Epoch:[0/2](69900/4588595) loss:2.842 lr:0.0000215 epoch_Time:28411.0min: [2024-01-03 00:59:17,639][model8_pretrain.py][INFO] Epoch:[0/2](69900/4588595) loss:3.376 lr:0.0000215 epoch_Time:28411.0min: [2024-01-03 00:59:54,578][model8_pretrain.py][INFO] Epoch:[0/2](70000/4588595) loss:2.661 lr:0.0000213 epoch_Time:28409.0min: [2024-01-03 00:59:54,578][model8_pretrain.py][INFO] Epoch:[0/2](70000/4588595) loss:2.747 lr:0.0000213 epoch_Time:28409.0min: [2024-01-03 00:59:54,578][model8_pretrain.py][INFO] Epoch:[0/2](70000/4588595) loss:2.675 lr:0.0000213 epoch_Time:28409.0min: [2024-01-03 00:59:54,578][model8_pretrain.py][INFO] Epoch:[0/2](70000/4588595) loss:2.857 lr:0.0000213 epoch_Time:28409.0min: [2024-01-03 00:59:54,578][model8_pretrain.py][INFO] Epoch:[0/2](70000/4588595) loss:3.263 lr:0.0000213 epoch_Time:28409.0min: [2024-01-03 00:59:54,578][model8_pretrain.py][INFO] Epoch:[0/2](70000/4588595) loss:3.085 lr:0.0000213 epoch_Time:28409.0min: [2024-01-03 00:59:54,578][model8_pretrain.py][INFO] Epoch:[0/2](70000/4588595) loss:2.491 lr:0.0000213 epoch_Time:28409.0min: [2024-01-03 00:59:54,578][model8_pretrain.py][INFO] Epoch:[0/2](70000/4588595) loss:3.029 lr:0.0000213 epoch_Time:28409.0min: [2024-01-03 01:00:31,526][model8_pretrain.py][INFO] Epoch:[0/2](70100/4588595) loss:3.096 lr:0.0000211 epoch_Time:28408.0min: [2024-01-03 01:00:31,526][model8_pretrain.py][INFO] Epoch:[0/2](70100/4588595) loss:3.356 lr:0.0000211 epoch_Time:28408.0min: [2024-01-03 01:00:31,526][model8_pretrain.py][INFO] Epoch:[0/2](70100/4588595) loss:3.003 lr:0.0000211 epoch_Time:28408.0min: [2024-01-03 01:00:31,526][model8_pretrain.py][INFO] Epoch:[0/2](70100/4588595) loss:3.485 lr:0.0000211 epoch_Time:28408.0min: [2024-01-03 01:00:31,526][model8_pretrain.py][INFO] Epoch:[0/2](70100/4588595) loss:3.139 lr:0.0000211 epoch_Time:28408.0min: [2024-01-03 01:00:31,526][model8_pretrain.py][INFO] Epoch:[0/2](70100/4588595) loss:2.161 lr:0.0000211 epoch_Time:28408.0min: [2024-01-03 01:00:31,526][model8_pretrain.py][INFO] Epoch:[0/2](70100/4588595) loss:3.180 lr:0.0000211 epoch_Time:28408.0min: [2024-01-03 01:00:31,526][model8_pretrain.py][INFO] Epoch:[0/2](70100/4588595) loss:2.965 lr:0.0000211 epoch_Time:28408.0min: [2024-01-03 01:01:08,456][model8_pretrain.py][INFO] Epoch:[0/2](70200/4588595) loss:3.099 lr:0.0000209 epoch_Time:28407.0min: [2024-01-03 01:01:08,456][model8_pretrain.py][INFO] Epoch:[0/2](70200/4588595) loss:2.865 lr:0.0000209 epoch_Time:28407.0min: [2024-01-03 01:01:08,456][model8_pretrain.py][INFO] Epoch:[0/2](70200/4588595) loss:3.114 lr:0.0000209 epoch_Time:28407.0min: [2024-01-03 01:01:08,456][model8_pretrain.py][INFO] Epoch:[0/2](70200/4588595) loss:3.342 lr:0.0000209 epoch_Time:28407.0min: [2024-01-03 01:01:08,456][model8_pretrain.py][INFO] Epoch:[0/2](70200/4588595) loss:2.793 lr:0.0000209 epoch_Time:28407.0min: [2024-01-03 01:01:08,456][model8_pretrain.py][INFO] Epoch:[0/2](70200/4588595) loss:3.258 lr:0.0000209 epoch_Time:28407.0min: [2024-01-03 01:01:08,456][model8_pretrain.py][INFO] Epoch:[0/2](70200/4588595) loss:2.669 lr:0.0000209 epoch_Time:28407.0min: [2024-01-03 01:01:08,456][model8_pretrain.py][INFO] Epoch:[0/2](70200/4588595) loss:3.527 lr:0.0000209 epoch_Time:28407.0min: [2024-01-03 01:01:45,389][model8_pretrain.py][INFO] Epoch:[0/2](70300/4588595) loss:3.284 lr:0.0000207 epoch_Time:28406.0min: [2024-01-03 01:01:45,389][model8_pretrain.py][INFO] Epoch:[0/2](70300/4588595) loss:3.067 lr:0.0000207 epoch_Time:28406.0min: [2024-01-03 01:01:45,389][model8_pretrain.py][INFO] Epoch:[0/2](70300/4588595) loss:3.271 lr:0.0000207 epoch_Time:28406.0min: [2024-01-03 01:01:45,389][model8_pretrain.py][INFO] Epoch:[0/2](70300/4588595) loss:3.028 lr:0.0000207 epoch_Time:28406.0min: [2024-01-03 01:01:45,389][model8_pretrain.py][INFO] Epoch:[0/2](70300/4588595) loss:3.133 lr:0.0000207 epoch_Time:28406.0min: [2024-01-03 01:01:45,389][model8_pretrain.py][INFO] Epoch:[0/2](70300/4588595) loss:2.556 lr:0.0000207 epoch_Time:28406.0min: [2024-01-03 01:01:45,389][model8_pretrain.py][INFO] Epoch:[0/2](70300/4588595) loss:3.119 lr:0.0000207 epoch_Time:28406.0min: [2024-01-03 01:01:45,389][model8_pretrain.py][INFO] Epoch:[0/2](70300/4588595) loss:2.969 lr:0.0000207 epoch_Time:28406.0min: [2024-01-03 01:02:29,467][model8_pretrain.py][INFO] Epoch:[0/2](70400/4588595) loss:3.119 lr:0.0000204 epoch_Time:28412.0min: [2024-01-03 01:02:29,467][model8_pretrain.py][INFO] Epoch:[0/2](70400/4588595) loss:2.940 lr:0.0000204 epoch_Time:28412.0min: [2024-01-03 01:02:29,467][model8_pretrain.py][INFO] Epoch:[0/2](70400/4588595) loss:3.006 lr:0.0000204 epoch_Time:28412.0min: [2024-01-03 01:02:29,467][model8_pretrain.py][INFO] Epoch:[0/2](70400/4588595) loss:2.239 lr:0.0000204 epoch_Time:28412.0min: [2024-01-03 01:02:29,468][model8_pretrain.py][INFO] Epoch:[0/2](70400/4588595) loss:2.820 lr:0.0000204 epoch_Time:28412.0min: [2024-01-03 01:02:29,467][model8_pretrain.py][INFO] Epoch:[0/2](70400/4588595) loss:3.430 lr:0.0000204 epoch_Time:28412.0min: [2024-01-03 01:02:29,468][model8_pretrain.py][INFO] Epoch:[0/2](70400/4588595) loss:3.180 lr:0.0000204 epoch_Time:28412.0min: [2024-01-03 01:02:29,468][model8_pretrain.py][INFO] Epoch:[0/2](70400/4588595) loss:3.089 lr:0.0000204 epoch_Time:28412.0min: [2024-01-03 01:03:06,398][model8_pretrain.py][INFO] Epoch:[0/2](70500/4588595) loss:3.012 lr:0.0000202 epoch_Time:28410.0min: [2024-01-03 01:03:06,398][model8_pretrain.py][INFO] Epoch:[0/2](70500/4588595) loss:3.374 lr:0.0000202 epoch_Time:28410.0min: [2024-01-03 01:03:06,398][model8_pretrain.py][INFO] Epoch:[0/2](70500/4588595) loss:3.615 lr:0.0000202 epoch_Time:28410.0min: [2024-01-03 01:03:06,398][model8_pretrain.py][INFO] Epoch:[0/2](70500/4588595) loss:3.058 lr:0.0000202 epoch_Time:28410.0min: [2024-01-03 01:03:06,398][model8_pretrain.py][INFO] Epoch:[0/2](70500/4588595) loss:2.835 lr:0.0000202 epoch_Time:28410.0min: [2024-01-03 01:03:06,398][model8_pretrain.py][INFO] Epoch:[0/2](70500/4588595) loss:3.548 lr:0.0000202 epoch_Time:28410.0min: [2024-01-03 01:03:06,398][model8_pretrain.py][INFO] Epoch:[0/2](70500/4588595) loss:3.089 lr:0.0000202 epoch_Time:28410.0min: [2024-01-03 01:03:06,398][model8_pretrain.py][INFO] Epoch:[0/2](70500/4588595) loss:3.590 lr:0.0000202 epoch_Time:28410.0min: [2024-01-03 01:03:43,325][model8_pretrain.py][INFO] Epoch:[0/2](70600/4588595) loss:2.856 lr:0.0000200 epoch_Time:28409.0min: [2024-01-03 01:03:43,325][model8_pretrain.py][INFO] Epoch:[0/2](70600/4588595) loss:2.901 lr:0.0000200 epoch_Time:28409.0min: [2024-01-03 01:03:43,325][model8_pretrain.py][INFO] Epoch:[0/2](70600/4588595) loss:2.807 lr:0.0000200 epoch_Time:28409.0min: [2024-01-03 01:03:43,325][model8_pretrain.py][INFO] Epoch:[0/2](70600/4588595) loss:2.965 lr:0.0000200 epoch_Time:28409.0min: [2024-01-03 01:03:43,325][model8_pretrain.py][INFO] Epoch:[0/2](70600/4588595) loss:2.903 lr:0.0000200 epoch_Time:28409.0min: [2024-01-03 01:03:43,325][model8_pretrain.py][INFO] Epoch:[0/2](70600/4588595) loss:2.841 lr:0.0000200 epoch_Time:28409.0min: [2024-01-03 01:03:43,325][model8_pretrain.py][INFO] Epoch:[0/2](70600/4588595) loss:2.915 lr:0.0000200 epoch_Time:28409.0min: [2024-01-03 01:03:43,325][model8_pretrain.py][INFO] Epoch:[0/2](70600/4588595) loss:3.274 lr:0.0000200 epoch_Time:28409.0min: [2024-01-03 01:04:20,258][model8_pretrain.py][INFO] Epoch:[0/2](70700/4588595) loss:3.709 lr:0.0000198 epoch_Time:28407.0min: [2024-01-03 01:04:20,258][model8_pretrain.py][INFO] Epoch:[0/2](70700/4588595) loss:3.070 lr:0.0000198 epoch_Time:28407.0min: [2024-01-03 01:04:20,258][model8_pretrain.py][INFO] Epoch:[0/2](70700/4588595) loss:2.853 lr:0.0000198 epoch_Time:28407.0min: [2024-01-03 01:04:20,258][model8_pretrain.py][INFO] Epoch:[0/2](70700/4588595) loss:2.746 lr:0.0000198 epoch_Time:28407.0min: [2024-01-03 01:04:20,259][model8_pretrain.py][INFO] Epoch:[0/2](70700/4588595) loss:3.370 lr:0.0000198 epoch_Time:28407.0min: [2024-01-03 01:04:20,258][model8_pretrain.py][INFO] Epoch:[0/2](70700/4588595) loss:2.800 lr:0.0000198 epoch_Time:28407.0min: [2024-01-03 01:04:20,259][model8_pretrain.py][INFO] Epoch:[0/2](70700/4588595) loss:2.855 lr:0.0000198 epoch_Time:28407.0min: [2024-01-03 01:04:20,259][model8_pretrain.py][INFO] Epoch:[0/2](70700/4588595) loss:2.766 lr:0.0000198 epoch_Time:28407.0min: [2024-01-03 01:04:57,205][model8_pretrain.py][INFO] Epoch:[0/2](70800/4588595) loss:2.331 lr:0.0000196 epoch_Time:28405.0min: [2024-01-03 01:04:57,205][model8_pretrain.py][INFO] Epoch:[0/2](70800/4588595) loss:3.176 lr:0.0000196 epoch_Time:28405.0min: [2024-01-03 01:04:57,205][model8_pretrain.py][INFO] Epoch:[0/2](70800/4588595) loss:2.926 lr:0.0000196 epoch_Time:28405.0min: [2024-01-03 01:04:57,206][model8_pretrain.py][INFO] Epoch:[0/2](70800/4588595) loss:2.514 lr:0.0000196 epoch_Time:28405.0min: [2024-01-03 01:04:57,206][model8_pretrain.py][INFO] Epoch:[0/2](70800/4588595) loss:3.351 lr:0.0000196 epoch_Time:28405.0min: [2024-01-03 01:04:57,206][model8_pretrain.py][INFO] Epoch:[0/2](70800/4588595) loss:2.996 lr:0.0000196 epoch_Time:28405.0min: [2024-01-03 01:04:57,206][model8_pretrain.py][INFO] Epoch:[0/2](70800/4588595) loss:2.825 lr:0.0000196 epoch_Time:28405.0min: [2024-01-03 01:04:57,206][model8_pretrain.py][INFO] Epoch:[0/2](70800/4588595) loss:2.941 lr:0.0000196 epoch_Time:28405.0min: [2024-01-03 01:05:34,139][model8_pretrain.py][INFO] Epoch:[0/2](70900/4588595) loss:3.226 lr:0.0000194 epoch_Time:28404.0min: [2024-01-03 01:05:34,139][model8_pretrain.py][INFO] Epoch:[0/2](70900/4588595) loss:2.982 lr:0.0000194 epoch_Time:28404.0min: [2024-01-03 01:05:34,139][model8_pretrain.py][INFO] Epoch:[0/2](70900/4588595) loss:3.549 lr:0.0000194 epoch_Time:28404.0min: [2024-01-03 01:05:34,139][model8_pretrain.py][INFO] Epoch:[0/2](70900/4588595) loss:3.041 lr:0.0000194 epoch_Time:28404.0min: [2024-01-03 01:05:34,139][model8_pretrain.py][INFO] Epoch:[0/2](70900/4588595) loss:2.402 lr:0.0000194 epoch_Time:28404.0min: [2024-01-03 01:05:34,139][model8_pretrain.py][INFO] Epoch:[0/2](70900/4588595) loss:3.058 lr:0.0000194 epoch_Time:28404.0min: [2024-01-03 01:05:34,139][model8_pretrain.py][INFO] Epoch:[0/2](70900/4588595) loss:3.143 lr:0.0000194 epoch_Time:28404.0min: [2024-01-03 01:05:34,139][model8_pretrain.py][INFO] Epoch:[0/2](70900/4588595) loss:3.104 lr:0.0000194 epoch_Time:28404.0min: [2024-01-03 01:06:11,081][model8_pretrain.py][INFO] Epoch:[0/2](71000/4588595) loss:2.922 lr:0.0000192 epoch_Time:28403.0min: [2024-01-03 01:06:11,081][model8_pretrain.py][INFO] Epoch:[0/2](71000/4588595) loss:2.852 lr:0.0000192 epoch_Time:28403.0min: [2024-01-03 01:06:11,081][model8_pretrain.py][INFO] Epoch:[0/2](71000/4588595) loss:2.993 lr:0.0000192 epoch_Time:28403.0min: [2024-01-03 01:06:11,081][model8_pretrain.py][INFO] Epoch:[0/2](71000/4588595) loss:2.827 lr:0.0000192 epoch_Time:28403.0min: [2024-01-03 01:06:11,081][model8_pretrain.py][INFO] Epoch:[0/2](71000/4588595) loss:2.656 lr:0.0000192 epoch_Time:28403.0min: [2024-01-03 01:06:11,081][model8_pretrain.py][INFO] Epoch:[0/2](71000/4588595) loss:3.082 lr:0.0000192 epoch_Time:28403.0min: [2024-01-03 01:06:11,081][model8_pretrain.py][INFO] Epoch:[0/2](71000/4588595) loss:2.949 lr:0.0000192 epoch_Time:28403.0min: [2024-01-03 01:06:11,081][model8_pretrain.py][INFO] Epoch:[0/2](71000/4588595) loss:3.113 lr:0.0000192 epoch_Time:28403.0min: [2024-01-03 01:06:48,024][model8_pretrain.py][INFO] Epoch:[0/2](71100/4588595) loss:3.394 lr:0.0000190 epoch_Time:28401.0min: [2024-01-03 01:06:48,024][model8_pretrain.py][INFO] Epoch:[0/2](71100/4588595) loss:2.635 lr:0.0000190 epoch_Time:28401.0min: [2024-01-03 01:06:48,024][model8_pretrain.py][INFO] Epoch:[0/2](71100/4588595) loss:2.080 lr:0.0000190 epoch_Time:28401.0min: [2024-01-03 01:06:48,024][model8_pretrain.py][INFO] Epoch:[0/2](71100/4588595) loss:3.066 lr:0.0000190 epoch_Time:28401.0min: [2024-01-03 01:06:48,024][model8_pretrain.py][INFO] Epoch:[0/2](71100/4588595) loss:3.128 lr:0.0000190 epoch_Time:28401.0min: [2024-01-03 01:06:48,024][model8_pretrain.py][INFO] Epoch:[0/2](71100/4588595) loss:2.660 lr:0.0000190 epoch_Time:28401.0min: [2024-01-03 01:06:48,024][model8_pretrain.py][INFO] Epoch:[0/2](71100/4588595) loss:3.039 lr:0.0000190 epoch_Time:28401.0min: [2024-01-03 01:06:48,024][model8_pretrain.py][INFO] Epoch:[0/2](71100/4588595) loss:3.305 lr:0.0000190 epoch_Time:28401.0min: [2024-01-03 01:07:32,007][model8_pretrain.py][INFO] Epoch:[0/2](71200/4588595) loss:3.003 lr:0.0000188 epoch_Time:28407.0min: [2024-01-03 01:07:32,007][model8_pretrain.py][INFO] Epoch:[0/2](71200/4588595) loss:3.285 lr:0.0000188 epoch_Time:28407.0min: [2024-01-03 01:07:32,007][model8_pretrain.py][INFO] Epoch:[0/2](71200/4588595) loss:3.262 lr:0.0000188 epoch_Time:28407.0min: [2024-01-03 01:07:32,007][model8_pretrain.py][INFO] Epoch:[0/2](71200/4588595) loss:2.983 lr:0.0000188 epoch_Time:28407.0min: [2024-01-03 01:07:32,007][model8_pretrain.py][INFO] Epoch:[0/2](71200/4588595) loss:2.897 lr:0.0000188 epoch_Time:28407.0min: [2024-01-03 01:07:32,007][model8_pretrain.py][INFO] Epoch:[0/2](71200/4588595) loss:3.571 lr:0.0000188 epoch_Time:28407.0min: [2024-01-03 01:07:32,007][model8_pretrain.py][INFO] Epoch:[0/2](71200/4588595) loss:2.709 lr:0.0000188 epoch_Time:28407.0min: [2024-01-03 01:07:32,007][model8_pretrain.py][INFO] Epoch:[0/2](71200/4588595) loss:3.591 lr:0.0000188 epoch_Time:28407.0min: [2024-01-03 01:08:08,930][model8_pretrain.py][INFO] Epoch:[0/2](71300/4588595) loss:2.855 lr:0.0000186 epoch_Time:28406.0min: [2024-01-03 01:08:08,930][model8_pretrain.py][INFO] Epoch:[0/2](71300/4588595) loss:2.611 lr:0.0000186 epoch_Time:28406.0min: [2024-01-03 01:08:08,930][model8_pretrain.py][INFO] Epoch:[0/2](71300/4588595) loss:3.105 lr:0.0000186 epoch_Time:28406.0min: [2024-01-03 01:08:08,930][model8_pretrain.py][INFO] Epoch:[0/2](71300/4588595) loss:2.542 lr:0.0000186 epoch_Time:28406.0min: [2024-01-03 01:08:08,930][model8_pretrain.py][INFO] Epoch:[0/2](71300/4588595) loss:2.859 lr:0.0000186 epoch_Time:28406.0min: [2024-01-03 01:08:08,930][model8_pretrain.py][INFO] Epoch:[0/2](71300/4588595) loss:2.648 lr:0.0000186 epoch_Time:28406.0min: [2024-01-03 01:08:08,930][model8_pretrain.py][INFO] Epoch:[0/2](71300/4588595) loss:2.965 lr:0.0000186 epoch_Time:28406.0min: [2024-01-03 01:08:08,930][model8_pretrain.py][INFO] Epoch:[0/2](71300/4588595) loss:3.198 lr:0.0000186 epoch_Time:28406.0min: [2024-01-03 01:08:45,877][model8_pretrain.py][INFO] Epoch:[0/2](71400/4588595) loss:2.770 lr:0.0000184 epoch_Time:28405.0min: [2024-01-03 01:08:45,877][model8_pretrain.py][INFO] Epoch:[0/2](71400/4588595) loss:2.993 lr:0.0000184 epoch_Time:28405.0min: [2024-01-03 01:08:45,877][model8_pretrain.py][INFO] Epoch:[0/2](71400/4588595) loss:2.501 lr:0.0000184 epoch_Time:28405.0min: [2024-01-03 01:08:45,877][model8_pretrain.py][INFO] Epoch:[0/2](71400/4588595) loss:3.138 lr:0.0000184 epoch_Time:28405.0min: [2024-01-03 01:08:45,877][model8_pretrain.py][INFO] Epoch:[0/2](71400/4588595) loss:3.408 lr:0.0000184 epoch_Time:28405.0min: [2024-01-03 01:08:45,877][model8_pretrain.py][INFO] Epoch:[0/2](71400/4588595) loss:2.836 lr:0.0000184 epoch_Time:28405.0min: [2024-01-03 01:08:45,877][model8_pretrain.py][INFO] Epoch:[0/2](71400/4588595) loss:3.165 lr:0.0000184 epoch_Time:28405.0min: [2024-01-03 01:08:45,877][model8_pretrain.py][INFO] Epoch:[0/2](71400/4588595) loss:2.560 lr:0.0000184 epoch_Time:28405.0min: [2024-01-03 01:09:22,834][model8_pretrain.py][INFO] Epoch:[0/2](71500/4588595) loss:2.792 lr:0.0000182 epoch_Time:28403.0min: [2024-01-03 01:09:22,834][model8_pretrain.py][INFO] Epoch:[0/2](71500/4588595) loss:3.137 lr:0.0000182 epoch_Time:28403.0min: [2024-01-03 01:09:22,834][model8_pretrain.py][INFO] Epoch:[0/2](71500/4588595) loss:3.042 lr:0.0000182 epoch_Time:28403.0min: [2024-01-03 01:09:22,834][model8_pretrain.py][INFO] Epoch:[0/2](71500/4588595) loss:3.536 lr:0.0000182 epoch_Time:28403.0min: [2024-01-03 01:09:22,834][model8_pretrain.py][INFO] Epoch:[0/2](71500/4588595) loss:2.751 lr:0.0000182 epoch_Time:28403.0min: [2024-01-03 01:09:22,834][model8_pretrain.py][INFO] Epoch:[0/2](71500/4588595) loss:2.678 lr:0.0000182 epoch_Time:28403.0min: [2024-01-03 01:09:22,834][model8_pretrain.py][INFO] Epoch:[0/2](71500/4588595) loss:3.139 lr:0.0000182 epoch_Time:28403.0min: [2024-01-03 01:09:22,834][model8_pretrain.py][INFO] Epoch:[0/2](71500/4588595) loss:3.086 lr:0.0000182 epoch_Time:28403.0min: [2024-01-03 01:09:59,795][model8_pretrain.py][INFO] Epoch:[0/2](71600/4588595) loss:2.989 lr:0.0000180 epoch_Time:28401.0min: [2024-01-03 01:09:59,795][model8_pretrain.py][INFO] Epoch:[0/2](71600/4588595) loss:2.498 lr:0.0000180 epoch_Time:28401.0min: [2024-01-03 01:09:59,795][model8_pretrain.py][INFO] Epoch:[0/2](71600/4588595) loss:3.144 lr:0.0000180 epoch_Time:28401.0min: [2024-01-03 01:09:59,795][model8_pretrain.py][INFO] Epoch:[0/2](71600/4588595) loss:3.866 lr:0.0000180 epoch_Time:28401.0min: [2024-01-03 01:09:59,795][model8_pretrain.py][INFO] Epoch:[0/2](71600/4588595) loss:3.477 lr:0.0000180 epoch_Time:28401.0min: [2024-01-03 01:09:59,795][model8_pretrain.py][INFO] Epoch:[0/2](71600/4588595) loss:2.761 lr:0.0000180 epoch_Time:28401.0min: [2024-01-03 01:09:59,795][model8_pretrain.py][INFO] Epoch:[0/2](71600/4588595) loss:3.420 lr:0.0000180 epoch_Time:28401.0min: [2024-01-03 01:09:59,795][model8_pretrain.py][INFO] Epoch:[0/2](71600/4588595) loss:2.546 lr:0.0000180 epoch_Time:28401.0min: [2024-01-03 01:10:36,741][model8_pretrain.py][INFO] Epoch:[0/2](71700/4588595) loss:3.352 lr:0.0000178 epoch_Time:28400.0min: [2024-01-03 01:10:36,741][model8_pretrain.py][INFO] Epoch:[0/2](71700/4588595) loss:3.546 lr:0.0000178 epoch_Time:28400.0min: [2024-01-03 01:10:36,741][model8_pretrain.py][INFO] Epoch:[0/2](71700/4588595) loss:3.766 lr:0.0000178 epoch_Time:28400.0min: [2024-01-03 01:10:36,741][model8_pretrain.py][INFO] Epoch:[0/2](71700/4588595) loss:3.650 lr:0.0000178 epoch_Time:28400.0min: [2024-01-03 01:10:36,741][model8_pretrain.py][INFO] Epoch:[0/2](71700/4588595) loss:3.161 lr:0.0000178 epoch_Time:28400.0min: [2024-01-03 01:10:36,741][model8_pretrain.py][INFO] Epoch:[0/2](71700/4588595) loss:2.665 lr:0.0000178 epoch_Time:28400.0min: [2024-01-03 01:10:36,741][model8_pretrain.py][INFO] Epoch:[0/2](71700/4588595) loss:2.867 lr:0.0000178 epoch_Time:28400.0min: [2024-01-03 01:10:36,742][model8_pretrain.py][INFO] Epoch:[0/2](71700/4588595) loss:3.629 lr:0.0000178 epoch_Time:28400.0min: [2024-01-03 01:11:13,684][model8_pretrain.py][INFO] Epoch:[0/2](71800/4588595) loss:2.547 lr:0.0000176 epoch_Time:28398.0min: [2024-01-03 01:11:13,684][model8_pretrain.py][INFO] Epoch:[0/2](71800/4588595) loss:3.153 lr:0.0000176 epoch_Time:28398.0min: [2024-01-03 01:11:13,684][model8_pretrain.py][INFO] Epoch:[0/2](71800/4588595) loss:3.209 lr:0.0000176 epoch_Time:28398.0min: [2024-01-03 01:11:13,684][model8_pretrain.py][INFO] Epoch:[0/2](71800/4588595) loss:3.116 lr:0.0000176 epoch_Time:28398.0min: [2024-01-03 01:11:13,684][model8_pretrain.py][INFO] Epoch:[0/2](71800/4588595) loss:3.469 lr:0.0000176 epoch_Time:28398.0min: [2024-01-03 01:11:13,684][model8_pretrain.py][INFO] Epoch:[0/2](71800/4588595) loss:3.577 lr:0.0000176 epoch_Time:28398.0min: [2024-01-03 01:11:13,684][model8_pretrain.py][INFO] Epoch:[0/2](71800/4588595) loss:3.134 lr:0.0000176 epoch_Time:28398.0min: [2024-01-03 01:11:13,685][model8_pretrain.py][INFO] Epoch:[0/2](71800/4588595) loss:3.188 lr:0.0000176 epoch_Time:28398.0min: [2024-01-03 01:11:50,633][model8_pretrain.py][INFO] Epoch:[0/2](71900/4588595) loss:2.959 lr:0.0000175 epoch_Time:28397.0min: [2024-01-03 01:11:50,633][model8_pretrain.py][INFO] Epoch:[0/2](71900/4588595) loss:3.425 lr:0.0000175 epoch_Time:28397.0min: [2024-01-03 01:11:50,633][model8_pretrain.py][INFO] Epoch:[0/2](71900/4588595) loss:3.014 lr:0.0000175 epoch_Time:28397.0min: [2024-01-03 01:11:50,633][model8_pretrain.py][INFO] Epoch:[0/2](71900/4588595) loss:3.178 lr:0.0000175 epoch_Time:28397.0min: [2024-01-03 01:11:50,633][model8_pretrain.py][INFO] Epoch:[0/2](71900/4588595) loss:2.958 lr:0.0000175 epoch_Time:28397.0min: [2024-01-03 01:11:50,633][model8_pretrain.py][INFO] Epoch:[0/2](71900/4588595) loss:2.694 lr:0.0000175 epoch_Time:28397.0min: [2024-01-03 01:11:50,633][model8_pretrain.py][INFO] Epoch:[0/2](71900/4588595) loss:3.019 lr:0.0000175 epoch_Time:28397.0min: [2024-01-03 01:11:50,633][model8_pretrain.py][INFO] Epoch:[0/2](71900/4588595) loss:3.180 lr:0.0000175 epoch_Time:28397.0min: [2024-01-03 01:12:34,649][model8_pretrain.py][INFO] Epoch:[0/2](72000/4588595) loss:3.375 lr:0.0000173 epoch_Time:28403.0min: [2024-01-03 01:12:34,649][model8_pretrain.py][INFO] Epoch:[0/2](72000/4588595) loss:2.765 lr:0.0000173 epoch_Time:28403.0min: [2024-01-03 01:12:34,649][model8_pretrain.py][INFO] Epoch:[0/2](72000/4588595) loss:3.459 lr:0.0000173 epoch_Time:28403.0min: [2024-01-03 01:12:34,649][model8_pretrain.py][INFO] Epoch:[0/2](72000/4588595) loss:3.249 lr:0.0000173 epoch_Time:28403.0min: [2024-01-03 01:12:34,649][model8_pretrain.py][INFO] Epoch:[0/2](72000/4588595) loss:2.855 lr:0.0000173 epoch_Time:28403.0min: [2024-01-03 01:12:34,649][model8_pretrain.py][INFO] Epoch:[0/2](72000/4588595) loss:3.247 lr:0.0000173 epoch_Time:28403.0min: [2024-01-03 01:12:34,649][model8_pretrain.py][INFO] Epoch:[0/2](72000/4588595) loss:2.752 lr:0.0000173 epoch_Time:28403.0min: [2024-01-03 01:12:34,649][model8_pretrain.py][INFO] Epoch:[0/2](72000/4588595) loss:2.797 lr:0.0000173 epoch_Time:28403.0min: [2024-01-03 01:13:11,587][model8_pretrain.py][INFO] Epoch:[0/2](72100/4588595) loss:3.322 lr:0.0000171 epoch_Time:28401.0min: [2024-01-03 01:13:11,587][model8_pretrain.py][INFO] Epoch:[0/2](72100/4588595) loss:3.684 lr:0.0000171 epoch_Time:28401.0min: [2024-01-03 01:13:11,587][model8_pretrain.py][INFO] Epoch:[0/2](72100/4588595) loss:3.066 lr:0.0000171 epoch_Time:28401.0min: [2024-01-03 01:13:11,587][model8_pretrain.py][INFO] Epoch:[0/2](72100/4588595) loss:2.678 lr:0.0000171 epoch_Time:28401.0min: [2024-01-03 01:13:11,587][model8_pretrain.py][INFO] Epoch:[0/2](72100/4588595) loss:2.961 lr:0.0000171 epoch_Time:28401.0min: [2024-01-03 01:13:11,587][model8_pretrain.py][INFO] Epoch:[0/2](72100/4588595) loss:2.479 lr:0.0000171 epoch_Time:28401.0min: [2024-01-03 01:13:11,587][model8_pretrain.py][INFO] Epoch:[0/2](72100/4588595) loss:3.125 lr:0.0000171 epoch_Time:28401.0min: [2024-01-03 01:13:11,587][model8_pretrain.py][INFO] Epoch:[0/2](72100/4588595) loss:3.085 lr:0.0000171 epoch_Time:28401.0min: [2024-01-03 01:13:48,530][model8_pretrain.py][INFO] Epoch:[0/2](72200/4588595) loss:3.192 lr:0.0000169 epoch_Time:28400.0min: [2024-01-03 01:13:48,530][model8_pretrain.py][INFO] Epoch:[0/2](72200/4588595) loss:3.081 lr:0.0000169 epoch_Time:28400.0min: [2024-01-03 01:13:48,530][model8_pretrain.py][INFO] Epoch:[0/2](72200/4588595) loss:3.201 lr:0.0000169 epoch_Time:28400.0min: [2024-01-03 01:13:48,530][model8_pretrain.py][INFO] Epoch:[0/2](72200/4588595) loss:2.766 lr:0.0000169 epoch_Time:28400.0min: [2024-01-03 01:13:48,530][model8_pretrain.py][INFO] Epoch:[0/2](72200/4588595) loss:3.262 lr:0.0000169 epoch_Time:28400.0min: [2024-01-03 01:13:48,530][model8_pretrain.py][INFO] Epoch:[0/2](72200/4588595) loss:3.113 lr:0.0000169 epoch_Time:28400.0min: [2024-01-03 01:13:48,530][model8_pretrain.py][INFO] Epoch:[0/2](72200/4588595) loss:3.242 lr:0.0000169 epoch_Time:28400.0min: [2024-01-03 01:13:48,530][model8_pretrain.py][INFO] Epoch:[0/2](72200/4588595) loss:3.062 lr:0.0000169 epoch_Time:28400.0min: [2024-01-03 01:14:25,465][model8_pretrain.py][INFO] Epoch:[0/2](72300/4588595) loss:2.869 lr:0.0000167 epoch_Time:28399.0min: [2024-01-03 01:14:25,465][model8_pretrain.py][INFO] Epoch:[0/2](72300/4588595) loss:3.007 lr:0.0000167 epoch_Time:28399.0min: [2024-01-03 01:14:25,465][model8_pretrain.py][INFO] Epoch:[0/2](72300/4588595) loss:3.238 lr:0.0000167 epoch_Time:28399.0min: [2024-01-03 01:14:25,465][model8_pretrain.py][INFO] Epoch:[0/2](72300/4588595) loss:3.058 lr:0.0000167 epoch_Time:28399.0min: [2024-01-03 01:14:25,465][model8_pretrain.py][INFO] Epoch:[0/2](72300/4588595) loss:3.142 lr:0.0000167 epoch_Time:28399.0min: [2024-01-03 01:14:25,465][model8_pretrain.py][INFO] Epoch:[0/2](72300/4588595) loss:2.948 lr:0.0000167 epoch_Time:28399.0min: [2024-01-03 01:14:25,465][model8_pretrain.py][INFO] Epoch:[0/2](72300/4588595) loss:2.624 lr:0.0000167 epoch_Time:28399.0min: [2024-01-03 01:14:25,465][model8_pretrain.py][INFO] Epoch:[0/2](72300/4588595) loss:3.107 lr:0.0000167 epoch_Time:28399.0min: [2024-01-03 01:15:02,417][model8_pretrain.py][INFO] Epoch:[0/2](72400/4588595) loss:2.823 lr:0.0000166 epoch_Time:28397.0min: [2024-01-03 01:15:02,417][model8_pretrain.py][INFO] Epoch:[0/2](72400/4588595) loss:2.988 lr:0.0000166 epoch_Time:28397.0min: [2024-01-03 01:15:02,417][model8_pretrain.py][INFO] Epoch:[0/2](72400/4588595) loss:3.450 lr:0.0000166 epoch_Time:28397.0min: [2024-01-03 01:15:02,417][model8_pretrain.py][INFO] Epoch:[0/2](72400/4588595) loss:3.267 lr:0.0000166 epoch_Time:28397.0min: [2024-01-03 01:15:02,417][model8_pretrain.py][INFO] Epoch:[0/2](72400/4588595) loss:2.890 lr:0.0000166 epoch_Time:28397.0min: [2024-01-03 01:15:02,417][model8_pretrain.py][INFO] Epoch:[0/2](72400/4588595) loss:3.302 lr:0.0000166 epoch_Time:28397.0min: [2024-01-03 01:15:02,417][model8_pretrain.py][INFO] Epoch:[0/2](72400/4588595) loss:2.699 lr:0.0000166 epoch_Time:28397.0min: [2024-01-03 01:15:02,418][model8_pretrain.py][INFO] Epoch:[0/2](72400/4588595) loss:3.219 lr:0.0000166 epoch_Time:28397.0min: [2024-01-03 01:15:39,371][model8_pretrain.py][INFO] Epoch:[0/2](72500/4588595) loss:3.066 lr:0.0000164 epoch_Time:28396.0min: [2024-01-03 01:15:39,371][model8_pretrain.py][INFO] Epoch:[0/2](72500/4588595) loss:3.220 lr:0.0000164 epoch_Time:28396.0min: [2024-01-03 01:15:39,371][model8_pretrain.py][INFO] Epoch:[0/2](72500/4588595) loss:3.291 lr:0.0000164 epoch_Time:28396.0min: [2024-01-03 01:15:39,371][model8_pretrain.py][INFO] Epoch:[0/2](72500/4588595) loss:3.289 lr:0.0000164 epoch_Time:28396.0min: [2024-01-03 01:15:39,371][model8_pretrain.py][INFO] Epoch:[0/2](72500/4588595) loss:3.002 lr:0.0000164 epoch_Time:28396.0min: [2024-01-03 01:15:39,371][model8_pretrain.py][INFO] Epoch:[0/2](72500/4588595) loss:2.981 lr:0.0000164 epoch_Time:28396.0min: [2024-01-03 01:15:39,371][model8_pretrain.py][INFO] Epoch:[0/2](72500/4588595) loss:3.230 lr:0.0000164 epoch_Time:28396.0min: [2024-01-03 01:15:39,371][model8_pretrain.py][INFO] Epoch:[0/2](72500/4588595) loss:3.280 lr:0.0000164 epoch_Time:28396.0min: [2024-01-03 01:16:16,281][model8_pretrain.py][INFO] Epoch:[0/2](72600/4588595) loss:2.495 lr:0.0000162 epoch_Time:28394.0min: [2024-01-03 01:16:16,281][model8_pretrain.py][INFO] Epoch:[0/2](72600/4588595) loss:2.604 lr:0.0000162 epoch_Time:28394.0min: [2024-01-03 01:16:16,281][model8_pretrain.py][INFO] Epoch:[0/2](72600/4588595) loss:3.465 lr:0.0000162 epoch_Time:28394.0min: [2024-01-03 01:16:16,281][model8_pretrain.py][INFO] Epoch:[0/2](72600/4588595) loss:3.458 lr:0.0000162 epoch_Time:28394.0min: [2024-01-03 01:16:16,281][model8_pretrain.py][INFO] Epoch:[0/2](72600/4588595) loss:2.948 lr:0.0000162 epoch_Time:28394.0min: [2024-01-03 01:16:16,281][model8_pretrain.py][INFO] Epoch:[0/2](72600/4588595) loss:3.297 lr:0.0000162 epoch_Time:28394.0min: [2024-01-03 01:16:16,281][model8_pretrain.py][INFO] Epoch:[0/2](72600/4588595) loss:3.230 lr:0.0000162 epoch_Time:28394.0min: [2024-01-03 01:16:16,281][model8_pretrain.py][INFO] Epoch:[0/2](72600/4588595) loss:3.225 lr:0.0000162 epoch_Time:28394.0min: [2024-01-03 01:16:53,229][model8_pretrain.py][INFO] Epoch:[0/2](72700/4588595) loss:2.764 lr:0.0000161 epoch_Time:28392.0min: [2024-01-03 01:16:53,229][model8_pretrain.py][INFO] Epoch:[0/2](72700/4588595) loss:3.346 lr:0.0000161 epoch_Time:28392.0min: [2024-01-03 01:16:53,229][model8_pretrain.py][INFO] Epoch:[0/2](72700/4588595) loss:3.203 lr:0.0000161 epoch_Time:28392.0min: [2024-01-03 01:16:53,229][model8_pretrain.py][INFO] Epoch:[0/2](72700/4588595) loss:3.171 lr:0.0000161 epoch_Time:28392.0min: [2024-01-03 01:16:53,229][model8_pretrain.py][INFO] Epoch:[0/2](72700/4588595) loss:3.019 lr:0.0000161 epoch_Time:28392.0min: [2024-01-03 01:16:53,229][model8_pretrain.py][INFO] Epoch:[0/2](72700/4588595) loss:3.493 lr:0.0000161 epoch_Time:28392.0min: [2024-01-03 01:16:53,229][model8_pretrain.py][INFO] Epoch:[0/2](72700/4588595) loss:3.066 lr:0.0000161 epoch_Time:28392.0min: [2024-01-03 01:16:53,229][model8_pretrain.py][INFO] Epoch:[0/2](72700/4588595) loss:3.012 lr:0.0000161 epoch_Time:28392.0min: [2024-01-03 01:17:37,204][model8_pretrain.py][INFO] Epoch:[0/2](72800/4588595) loss:3.335 lr:0.0000159 epoch_Time:28399.0min: [2024-01-03 01:17:37,204][model8_pretrain.py][INFO] Epoch:[0/2](72800/4588595) loss:3.047 lr:0.0000159 epoch_Time:28399.0min: [2024-01-03 01:17:37,204][model8_pretrain.py][INFO] Epoch:[0/2](72800/4588595) loss:3.111 lr:0.0000159 epoch_Time:28399.0min: [2024-01-03 01:17:37,204][model8_pretrain.py][INFO] Epoch:[0/2](72800/4588595) loss:2.723 lr:0.0000159 epoch_Time:28399.0min: [2024-01-03 01:17:37,204][model8_pretrain.py][INFO] Epoch:[0/2](72800/4588595) loss:3.222 lr:0.0000159 epoch_Time:28399.0min: [2024-01-03 01:17:37,204][model8_pretrain.py][INFO] Epoch:[0/2](72800/4588595) loss:2.713 lr:0.0000159 epoch_Time:28399.0min: [2024-01-03 01:17:37,204][model8_pretrain.py][INFO] Epoch:[0/2](72800/4588595) loss:3.145 lr:0.0000159 epoch_Time:28399.0min: [2024-01-03 01:17:37,205][model8_pretrain.py][INFO] Epoch:[0/2](72800/4588595) loss:3.205 lr:0.0000159 epoch_Time:28399.0min: [2024-01-03 01:18:14,131][model8_pretrain.py][INFO] Epoch:[0/2](72900/4588595) loss:3.484 lr:0.0000157 epoch_Time:28397.0min: [2024-01-03 01:18:14,131][model8_pretrain.py][INFO] Epoch:[0/2](72900/4588595) loss:2.454 lr:0.0000157 epoch_Time:28397.0min: [2024-01-03 01:18:14,131][model8_pretrain.py][INFO] Epoch:[0/2](72900/4588595) loss:2.919 lr:0.0000157 epoch_Time:28397.0min: [2024-01-03 01:18:14,131][model8_pretrain.py][INFO] Epoch:[0/2](72900/4588595) loss:3.036 lr:0.0000157 epoch_Time:28397.0min: [2024-01-03 01:18:14,131][model8_pretrain.py][INFO] Epoch:[0/2](72900/4588595) loss:3.035 lr:0.0000157 epoch_Time:28397.0min: [2024-01-03 01:18:14,131][model8_pretrain.py][INFO] Epoch:[0/2](72900/4588595) loss:2.929 lr:0.0000157 epoch_Time:28397.0min: [2024-01-03 01:18:14,131][model8_pretrain.py][INFO] Epoch:[0/2](72900/4588595) loss:2.904 lr:0.0000157 epoch_Time:28397.0min: [2024-01-03 01:18:14,131][model8_pretrain.py][INFO] Epoch:[0/2](72900/4588595) loss:3.544 lr:0.0000157 epoch_Time:28397.0min: [2024-01-03 01:18:51,068][model8_pretrain.py][INFO] Epoch:[0/2](73000/4588595) loss:2.935 lr:0.0000156 epoch_Time:28395.0min: [2024-01-03 01:18:51,068][model8_pretrain.py][INFO] Epoch:[0/2](73000/4588595) loss:2.964 lr:0.0000156 epoch_Time:28395.0min: [2024-01-03 01:18:51,068][model8_pretrain.py][INFO] Epoch:[0/2](73000/4588595) loss:3.316 lr:0.0000156 epoch_Time:28395.0min: [2024-01-03 01:18:51,068][model8_pretrain.py][INFO] Epoch:[0/2](73000/4588595) loss:3.143 lr:0.0000156 epoch_Time:28395.0min: [2024-01-03 01:18:51,068][model8_pretrain.py][INFO] Epoch:[0/2](73000/4588595) loss:3.311 lr:0.0000156 epoch_Time:28395.0min: [2024-01-03 01:18:51,068][model8_pretrain.py][INFO] Epoch:[0/2](73000/4588595) loss:3.528 lr:0.0000156 epoch_Time:28395.0min: [2024-01-03 01:18:51,068][model8_pretrain.py][INFO] Epoch:[0/2](73000/4588595) loss:2.659 lr:0.0000156 epoch_Time:28395.0min: [2024-01-03 01:18:51,068][model8_pretrain.py][INFO] Epoch:[0/2](73000/4588595) loss:3.168 lr:0.0000156 epoch_Time:28395.0min: [2024-01-03 01:19:28,008][model8_pretrain.py][INFO] Epoch:[0/2](73100/4588595) loss:2.756 lr:0.0000154 epoch_Time:28394.0min: [2024-01-03 01:19:28,008][model8_pretrain.py][INFO] Epoch:[0/2](73100/4588595) loss:2.529 lr:0.0000154 epoch_Time:28394.0min: [2024-01-03 01:19:28,008][model8_pretrain.py][INFO] Epoch:[0/2](73100/4588595) loss:2.906 lr:0.0000154 epoch_Time:28394.0min: [2024-01-03 01:19:28,008][model8_pretrain.py][INFO] Epoch:[0/2](73100/4588595) loss:3.406 lr:0.0000154 epoch_Time:28394.0min: [2024-01-03 01:19:28,009][model8_pretrain.py][INFO] Epoch:[0/2](73100/4588595) loss:2.663 lr:0.0000154 epoch_Time:28395.0min: [2024-01-03 01:19:28,009][model8_pretrain.py][INFO] Epoch:[0/2](73100/4588595) loss:3.014 lr:0.0000154 epoch_Time:28394.0min: [2024-01-03 01:19:28,009][model8_pretrain.py][INFO] Epoch:[0/2](73100/4588595) loss:3.258 lr:0.0000154 epoch_Time:28394.0min: [2024-01-03 01:19:28,010][model8_pretrain.py][INFO] Epoch:[0/2](73100/4588595) loss:1.983 lr:0.0000154 epoch_Time:28395.0min: [2024-01-03 01:20:04,947][model8_pretrain.py][INFO] Epoch:[0/2](73200/4588595) loss:3.388 lr:0.0000153 epoch_Time:28393.0min: [2024-01-03 01:20:04,947][model8_pretrain.py][INFO] Epoch:[0/2](73200/4588595) loss:3.120 lr:0.0000153 epoch_Time:28393.0min: [2024-01-03 01:20:04,947][model8_pretrain.py][INFO] Epoch:[0/2](73200/4588595) loss:3.024 lr:0.0000153 epoch_Time:28393.0min: [2024-01-03 01:20:04,947][model8_pretrain.py][INFO] Epoch:[0/2](73200/4588595) loss:3.187 lr:0.0000153 epoch_Time:28393.0min: [2024-01-03 01:20:04,947][model8_pretrain.py][INFO] Epoch:[0/2](73200/4588595) loss:3.013 lr:0.0000153 epoch_Time:28393.0min: [2024-01-03 01:20:04,947][model8_pretrain.py][INFO] Epoch:[0/2](73200/4588595) loss:3.361 lr:0.0000153 epoch_Time:28393.0min: [2024-01-03 01:20:04,948][model8_pretrain.py][INFO] Epoch:[0/2](73200/4588595) loss:3.203 lr:0.0000153 epoch_Time:28393.0min: [2024-01-03 01:20:04,948][model8_pretrain.py][INFO] Epoch:[0/2](73200/4588595) loss:2.810 lr:0.0000153 epoch_Time:28393.0min: [2024-01-03 01:20:41,876][model8_pretrain.py][INFO] Epoch:[0/2](73300/4588595) loss:3.298 lr:0.0000151 epoch_Time:28392.0min: [2024-01-03 01:20:41,876][model8_pretrain.py][INFO] Epoch:[0/2](73300/4588595) loss:2.872 lr:0.0000151 epoch_Time:28392.0min: [2024-01-03 01:20:41,876][model8_pretrain.py][INFO] Epoch:[0/2](73300/4588595) loss:3.390 lr:0.0000151 epoch_Time:28392.0min: [2024-01-03 01:20:41,876][model8_pretrain.py][INFO] Epoch:[0/2](73300/4588595) loss:2.920 lr:0.0000151 epoch_Time:28392.0min: [2024-01-03 01:20:41,876][model8_pretrain.py][INFO] Epoch:[0/2](73300/4588595) loss:3.187 lr:0.0000151 epoch_Time:28392.0min: [2024-01-03 01:20:41,876][model8_pretrain.py][INFO] Epoch:[0/2](73300/4588595) loss:2.696 lr:0.0000151 epoch_Time:28392.0min: [2024-01-03 01:20:41,876][model8_pretrain.py][INFO] Epoch:[0/2](73300/4588595) loss:3.250 lr:0.0000151 epoch_Time:28392.0min: [2024-01-03 01:20:41,876][model8_pretrain.py][INFO] Epoch:[0/2](73300/4588595) loss:3.197 lr:0.0000151 epoch_Time:28392.0min: [2024-01-03 01:21:18,810][model8_pretrain.py][INFO] Epoch:[0/2](73400/4588595) loss:2.559 lr:0.0000150 epoch_Time:28390.0min: [2024-01-03 01:21:18,810][model8_pretrain.py][INFO] Epoch:[0/2](73400/4588595) loss:3.143 lr:0.0000150 epoch_Time:28390.0min: [2024-01-03 01:21:18,810][model8_pretrain.py][INFO] Epoch:[0/2](73400/4588595) loss:2.928 lr:0.0000150 epoch_Time:28390.0min: [2024-01-03 01:21:18,810][model8_pretrain.py][INFO] Epoch:[0/2](73400/4588595) loss:3.102 lr:0.0000150 epoch_Time:28390.0min: [2024-01-03 01:21:18,810][model8_pretrain.py][INFO] Epoch:[0/2](73400/4588595) loss:2.996 lr:0.0000150 epoch_Time:28390.0min: [2024-01-03 01:21:18,810][model8_pretrain.py][INFO] Epoch:[0/2](73400/4588595) loss:3.071 lr:0.0000150 epoch_Time:28390.0min: [2024-01-03 01:21:18,811][model8_pretrain.py][INFO] Epoch:[0/2](73400/4588595) loss:3.420 lr:0.0000150 epoch_Time:28390.0min: [2024-01-03 01:21:18,811][model8_pretrain.py][INFO] Epoch:[0/2](73400/4588595) loss:3.635 lr:0.0000150 epoch_Time:28390.0min: [2024-01-03 01:21:55,745][model8_pretrain.py][INFO] Epoch:[0/2](73500/4588595) loss:3.203 lr:0.0000148 epoch_Time:28388.0min: [2024-01-03 01:21:55,745][model8_pretrain.py][INFO] Epoch:[0/2](73500/4588595) loss:2.981 lr:0.0000148 epoch_Time:28388.0min: [2024-01-03 01:21:55,745][model8_pretrain.py][INFO] Epoch:[0/2](73500/4588595) loss:2.881 lr:0.0000148 epoch_Time:28388.0min: [2024-01-03 01:21:55,745][model8_pretrain.py][INFO] Epoch:[0/2](73500/4588595) loss:2.800 lr:0.0000148 epoch_Time:28388.0min: [2024-01-03 01:21:55,745][model8_pretrain.py][INFO] Epoch:[0/2](73500/4588595) loss:2.232 lr:0.0000148 epoch_Time:28388.0min: [2024-01-03 01:21:55,746][model8_pretrain.py][INFO] Epoch:[0/2](73500/4588595) loss:2.571 lr:0.0000148 epoch_Time:28388.0min: [2024-01-03 01:21:55,746][model8_pretrain.py][INFO] Epoch:[0/2](73500/4588595) loss:2.721 lr:0.0000148 epoch_Time:28388.0min: [2024-01-03 01:21:55,746][model8_pretrain.py][INFO] Epoch:[0/2](73500/4588595) loss:2.704 lr:0.0000148 epoch_Time:28388.0min: [2024-01-03 01:22:39,648][model8_pretrain.py][INFO] Epoch:[0/2](73600/4588595) loss:2.357 lr:0.0000147 epoch_Time:28395.0min: [2024-01-03 01:22:39,648][model8_pretrain.py][INFO] Epoch:[0/2](73600/4588595) loss:3.092 lr:0.0000147 epoch_Time:28395.0min: [2024-01-03 01:22:39,648][model8_pretrain.py][INFO] Epoch:[0/2](73600/4588595) loss:3.184 lr:0.0000147 epoch_Time:28395.0min: [2024-01-03 01:22:39,648][model8_pretrain.py][INFO] Epoch:[0/2](73600/4588595) loss:3.330 lr:0.0000147 epoch_Time:28395.0min: [2024-01-03 01:22:39,648][model8_pretrain.py][INFO] Epoch:[0/2](73600/4588595) loss:2.945 lr:0.0000147 epoch_Time:28395.0min: [2024-01-03 01:22:39,648][model8_pretrain.py][INFO] Epoch:[0/2](73600/4588595) loss:3.039 lr:0.0000147 epoch_Time:28395.0min: [2024-01-03 01:22:39,649][model8_pretrain.py][INFO] Epoch:[0/2](73600/4588595) loss:3.115 lr:0.0000147 epoch_Time:28395.0min: [2024-01-03 01:22:39,649][model8_pretrain.py][INFO] Epoch:[0/2](73600/4588595) loss:3.007 lr:0.0000147 epoch_Time:28395.0min: [2024-01-03 01:23:16,588][model8_pretrain.py][INFO] Epoch:[0/2](73700/4588595) loss:2.642 lr:0.0000145 epoch_Time:28393.0min: [2024-01-03 01:23:16,588][model8_pretrain.py][INFO] Epoch:[0/2](73700/4588595) loss:3.304 lr:0.0000145 epoch_Time:28393.0min: [2024-01-03 01:23:16,588][model8_pretrain.py][INFO] Epoch:[0/2](73700/4588595) loss:3.097 lr:0.0000145 epoch_Time:28393.0min: [2024-01-03 01:23:16,588][model8_pretrain.py][INFO] Epoch:[0/2](73700/4588595) loss:2.580 lr:0.0000145 epoch_Time:28393.0min: [2024-01-03 01:23:16,588][model8_pretrain.py][INFO] Epoch:[0/2](73700/4588595) loss:2.723 lr:0.0000145 epoch_Time:28393.0min: [2024-01-03 01:23:16,589][model8_pretrain.py][INFO] Epoch:[0/2](73700/4588595) loss:2.910 lr:0.0000145 epoch_Time:28393.0min: [2024-01-03 01:23:16,589][model8_pretrain.py][INFO] Epoch:[0/2](73700/4588595) loss:3.317 lr:0.0000145 epoch_Time:28393.0min: [2024-01-03 01:23:16,589][model8_pretrain.py][INFO] Epoch:[0/2](73700/4588595) loss:3.225 lr:0.0000145 epoch_Time:28393.0min: [2024-01-03 01:23:53,528][model8_pretrain.py][INFO] Epoch:[0/2](73800/4588595) loss:2.856 lr:0.0000144 epoch_Time:28391.0min: [2024-01-03 01:23:53,528][model8_pretrain.py][INFO] Epoch:[0/2](73800/4588595) loss:3.154 lr:0.0000144 epoch_Time:28391.0min: [2024-01-03 01:23:53,528][model8_pretrain.py][INFO] Epoch:[0/2](73800/4588595) loss:3.139 lr:0.0000144 epoch_Time:28391.0min: [2024-01-03 01:23:53,528][model8_pretrain.py][INFO] Epoch:[0/2](73800/4588595) loss:3.227 lr:0.0000144 epoch_Time:28391.0min: [2024-01-03 01:23:53,528][model8_pretrain.py][INFO] Epoch:[0/2](73800/4588595) loss:3.284 lr:0.0000144 epoch_Time:28391.0min: [2024-01-03 01:23:53,528][model8_pretrain.py][INFO] Epoch:[0/2](73800/4588595) loss:2.704 lr:0.0000144 epoch_Time:28391.0min: [2024-01-03 01:23:53,528][model8_pretrain.py][INFO] Epoch:[0/2](73800/4588595) loss:2.928 lr:0.0000144 epoch_Time:28391.0min: [2024-01-03 01:23:53,529][model8_pretrain.py][INFO] Epoch:[0/2](73800/4588595) loss:2.802 lr:0.0000144 epoch_Time:28391.0min: [2024-01-03 01:24:30,461][model8_pretrain.py][INFO] Epoch:[0/2](73900/4588595) loss:3.181 lr:0.0000142 epoch_Time:28390.0min: [2024-01-03 01:24:30,461][model8_pretrain.py][INFO] Epoch:[0/2](73900/4588595) loss:2.589 lr:0.0000142 epoch_Time:28390.0min: [2024-01-03 01:24:30,461][model8_pretrain.py][INFO] Epoch:[0/2](73900/4588595) loss:3.518 lr:0.0000142 epoch_Time:28390.0min: [2024-01-03 01:24:30,461][model8_pretrain.py][INFO] Epoch:[0/2](73900/4588595) loss:2.727 lr:0.0000142 epoch_Time:28390.0min: [2024-01-03 01:24:30,461][model8_pretrain.py][INFO] Epoch:[0/2](73900/4588595) loss:3.167 lr:0.0000142 epoch_Time:28390.0min: [2024-01-03 01:24:30,461][model8_pretrain.py][INFO] Epoch:[0/2](73900/4588595) loss:3.666 lr:0.0000142 epoch_Time:28390.0min: [2024-01-03 01:24:30,461][model8_pretrain.py][INFO] Epoch:[0/2](73900/4588595) loss:2.719 lr:0.0000142 epoch_Time:28390.0min: [2024-01-03 01:24:30,461][model8_pretrain.py][INFO] Epoch:[0/2](73900/4588595) loss:3.344 lr:0.0000142 epoch_Time:28390.0min: [2024-01-03 01:25:07,402][model8_pretrain.py][INFO] Epoch:[0/2](74000/4588595) loss:3.042 lr:0.0000141 epoch_Time:28388.0min: [2024-01-03 01:25:07,402][model8_pretrain.py][INFO] Epoch:[0/2](74000/4588595) loss:3.137 lr:0.0000141 epoch_Time:28388.0min: [2024-01-03 01:25:07,402][model8_pretrain.py][INFO] Epoch:[0/2](74000/4588595) loss:2.978 lr:0.0000141 epoch_Time:28388.0min: [2024-01-03 01:25:07,402][model8_pretrain.py][INFO] Epoch:[0/2](74000/4588595) loss:2.368 lr:0.0000141 epoch_Time:28388.0min: [2024-01-03 01:25:07,402][model8_pretrain.py][INFO] Epoch:[0/2](74000/4588595) loss:3.518 lr:0.0000141 epoch_Time:28388.0min: [2024-01-03 01:25:07,402][model8_pretrain.py][INFO] Epoch:[0/2](74000/4588595) loss:3.030 lr:0.0000141 epoch_Time:28388.0min: [2024-01-03 01:25:07,402][model8_pretrain.py][INFO] Epoch:[0/2](74000/4588595) loss:2.900 lr:0.0000141 epoch_Time:28388.0min: [2024-01-03 01:25:07,402][model8_pretrain.py][INFO] Epoch:[0/2](74000/4588595) loss:3.227 lr:0.0000141 epoch_Time:28388.0min: [2024-01-03 01:25:44,331][model8_pretrain.py][INFO] Epoch:[0/2](74100/4588595) loss:3.437 lr:0.0000140 epoch_Time:28387.0min: [2024-01-03 01:25:44,331][model8_pretrain.py][INFO] Epoch:[0/2](74100/4588595) loss:2.874 lr:0.0000140 epoch_Time:28388.0min: [2024-01-03 01:25:44,331][model8_pretrain.py][INFO] Epoch:[0/2](74100/4588595) loss:3.228 lr:0.0000140 epoch_Time:28387.0min: [2024-01-03 01:25:44,331][model8_pretrain.py][INFO] Epoch:[0/2](74100/4588595) loss:2.622 lr:0.0000140 epoch_Time:28387.0min: [2024-01-03 01:25:44,331][model8_pretrain.py][INFO] Epoch:[0/2](74100/4588595) loss:3.332 lr:0.0000140 epoch_Time:28388.0min: [2024-01-03 01:25:44,331][model8_pretrain.py][INFO] Epoch:[0/2](74100/4588595) loss:3.001 lr:0.0000140 epoch_Time:28387.0min: [2024-01-03 01:25:44,331][model8_pretrain.py][INFO] Epoch:[0/2](74100/4588595) loss:3.193 lr:0.0000140 epoch_Time:28387.0min: [2024-01-03 01:25:44,331][model8_pretrain.py][INFO] Epoch:[0/2](74100/4588595) loss:2.917 lr:0.0000140 epoch_Time:28387.0min: [2024-01-03 01:26:21,270][model8_pretrain.py][INFO] Epoch:[0/2](74200/4588595) loss:2.647 lr:0.0000138 epoch_Time:28386.0min: [2024-01-03 01:26:21,270][model8_pretrain.py][INFO] Epoch:[0/2](74200/4588595) loss:2.965 lr:0.0000138 epoch_Time:28386.0min: [2024-01-03 01:26:21,270][model8_pretrain.py][INFO] Epoch:[0/2](74200/4588595) loss:2.946 lr:0.0000138 epoch_Time:28386.0min: [2024-01-03 01:26:21,270][model8_pretrain.py][INFO] Epoch:[0/2](74200/4588595) loss:2.540 lr:0.0000138 epoch_Time:28386.0min: [2024-01-03 01:26:21,270][model8_pretrain.py][INFO] Epoch:[0/2](74200/4588595) loss:3.362 lr:0.0000138 epoch_Time:28386.0min: [2024-01-03 01:26:21,270][model8_pretrain.py][INFO] Epoch:[0/2](74200/4588595) loss:2.848 lr:0.0000138 epoch_Time:28386.0min: [2024-01-03 01:26:21,270][model8_pretrain.py][INFO] Epoch:[0/2](74200/4588595) loss:3.191 lr:0.0000138 epoch_Time:28386.0min: [2024-01-03 01:26:21,270][model8_pretrain.py][INFO] Epoch:[0/2](74200/4588595) loss:3.425 lr:0.0000138 epoch_Time:28386.0min: [2024-01-03 01:26:58,209][model8_pretrain.py][INFO] Epoch:[0/2](74300/4588595) loss:2.918 lr:0.0000137 epoch_Time:28384.0min: [2024-01-03 01:26:58,209][model8_pretrain.py][INFO] Epoch:[0/2](74300/4588595) loss:2.258 lr:0.0000137 epoch_Time:28384.0min: [2024-01-03 01:26:58,209][model8_pretrain.py][INFO] Epoch:[0/2](74300/4588595) loss:3.290 lr:0.0000137 epoch_Time:28384.0min: [2024-01-03 01:26:58,209][model8_pretrain.py][INFO] Epoch:[0/2](74300/4588595) loss:3.052 lr:0.0000137 epoch_Time:28384.0min: [2024-01-03 01:26:58,209][model8_pretrain.py][INFO] Epoch:[0/2](74300/4588595) loss:2.887 lr:0.0000137 epoch_Time:28384.0min: [2024-01-03 01:26:58,209][model8_pretrain.py][INFO] Epoch:[0/2](74300/4588595) loss:3.206 lr:0.0000137 epoch_Time:28384.0min: [2024-01-03 01:26:58,209][model8_pretrain.py][INFO] Epoch:[0/2](74300/4588595) loss:3.046 lr:0.0000137 epoch_Time:28384.0min: [2024-01-03 01:26:58,209][model8_pretrain.py][INFO] Epoch:[0/2](74300/4588595) loss:3.025 lr:0.0000137 epoch_Time:28384.0min: [2024-01-03 01:27:42,870][model8_pretrain.py][INFO] Epoch:[0/2](74400/4588595) loss:3.500 lr:0.0000136 epoch_Time:28391.0min: [2024-01-03 01:27:42,870][model8_pretrain.py][INFO] Epoch:[0/2](74400/4588595) loss:3.241 lr:0.0000136 epoch_Time:28391.0min: [2024-01-03 01:27:42,870][model8_pretrain.py][INFO] Epoch:[0/2](74400/4588595) loss:3.292 lr:0.0000136 epoch_Time:28391.0min: [2024-01-03 01:27:42,870][model8_pretrain.py][INFO] Epoch:[0/2](74400/4588595) loss:3.051 lr:0.0000136 epoch_Time:28391.0min: [2024-01-03 01:27:42,870][model8_pretrain.py][INFO] Epoch:[0/2](74400/4588595) loss:2.810 lr:0.0000136 epoch_Time:28391.0min: [2024-01-03 01:27:42,870][model8_pretrain.py][INFO] Epoch:[0/2](74400/4588595) loss:3.192 lr:0.0000136 epoch_Time:28391.0min: [2024-01-03 01:27:42,870][model8_pretrain.py][INFO] Epoch:[0/2](74400/4588595) loss:2.428 lr:0.0000136 epoch_Time:28391.0min: [2024-01-03 01:27:42,871][model8_pretrain.py][INFO] Epoch:[0/2](74400/4588595) loss:2.925 lr:0.0000136 epoch_Time:28391.0min: [2024-01-03 01:28:19,802][model8_pretrain.py][INFO] Epoch:[0/2](74500/4588595) loss:3.259 lr:0.0000135 epoch_Time:28389.0min: [2024-01-03 01:28:19,802][model8_pretrain.py][INFO] Epoch:[0/2](74500/4588595) loss:3.118 lr:0.0000135 epoch_Time:28389.0min: [2024-01-03 01:28:19,802][model8_pretrain.py][INFO] Epoch:[0/2](74500/4588595) loss:3.091 lr:0.0000135 epoch_Time:28389.0min: [2024-01-03 01:28:19,802][model8_pretrain.py][INFO] Epoch:[0/2](74500/4588595) loss:3.198 lr:0.0000135 epoch_Time:28389.0min: [2024-01-03 01:28:19,802][model8_pretrain.py][INFO] Epoch:[0/2](74500/4588595) loss:2.744 lr:0.0000135 epoch_Time:28389.0min: [2024-01-03 01:28:19,802][model8_pretrain.py][INFO] Epoch:[0/2](74500/4588595) loss:2.982 lr:0.0000135 epoch_Time:28389.0min: [2024-01-03 01:28:19,802][model8_pretrain.py][INFO] Epoch:[0/2](74500/4588595) loss:2.288 lr:0.0000135 epoch_Time:28389.0min: [2024-01-03 01:28:19,802][model8_pretrain.py][INFO] Epoch:[0/2](74500/4588595) loss:3.408 lr:0.0000135 epoch_Time:28389.0min: [2024-01-03 01:28:56,741][model8_pretrain.py][INFO] Epoch:[0/2](74600/4588595) loss:2.974 lr:0.0000133 epoch_Time:28387.0min: [2024-01-03 01:28:56,741][model8_pretrain.py][INFO] Epoch:[0/2](74600/4588595) loss:3.200 lr:0.0000133 epoch_Time:28387.0min: [2024-01-03 01:28:56,741][model8_pretrain.py][INFO] Epoch:[0/2](74600/4588595) loss:3.041 lr:0.0000133 epoch_Time:28387.0min: [2024-01-03 01:28:56,741][model8_pretrain.py][INFO] Epoch:[0/2](74600/4588595) loss:2.514 lr:0.0000133 epoch_Time:28387.0min: [2024-01-03 01:28:56,741][model8_pretrain.py][INFO] Epoch:[0/2](74600/4588595) loss:3.306 lr:0.0000133 epoch_Time:28387.0min: [2024-01-03 01:28:56,741][model8_pretrain.py][INFO] Epoch:[0/2](74600/4588595) loss:2.988 lr:0.0000133 epoch_Time:28387.0min: [2024-01-03 01:28:56,741][model8_pretrain.py][INFO] Epoch:[0/2](74600/4588595) loss:3.126 lr:0.0000133 epoch_Time:28387.0min: [2024-01-03 01:28:56,741][model8_pretrain.py][INFO] Epoch:[0/2](74600/4588595) loss:2.574 lr:0.0000133 epoch_Time:28387.0min: [2024-01-03 01:29:33,669][model8_pretrain.py][INFO] Epoch:[0/2](74700/4588595) loss:2.329 lr:0.0000132 epoch_Time:28387.0min: [2024-01-03 01:29:33,669][model8_pretrain.py][INFO] Epoch:[0/2](74700/4588595) loss:3.282 lr:0.0000132 epoch_Time:28387.0min: [2024-01-03 01:29:33,669][model8_pretrain.py][INFO] Epoch:[0/2](74700/4588595) loss:2.783 lr:0.0000132 epoch_Time:28387.0min: [2024-01-03 01:29:33,669][model8_pretrain.py][INFO] Epoch:[0/2](74700/4588595) loss:2.913 lr:0.0000132 epoch_Time:28387.0min: [2024-01-03 01:29:33,669][model8_pretrain.py][INFO] Epoch:[0/2](74700/4588595) loss:2.361 lr:0.0000132 epoch_Time:28387.0min: [2024-01-03 01:29:33,669][model8_pretrain.py][INFO] Epoch:[0/2](74700/4588595) loss:2.519 lr:0.0000132 epoch_Time:28387.0min: [2024-01-03 01:29:33,669][model8_pretrain.py][INFO] Epoch:[0/2](74700/4588595) loss:2.828 lr:0.0000132 epoch_Time:28387.0min: [2024-01-03 01:29:33,669][model8_pretrain.py][INFO] Epoch:[0/2](74700/4588595) loss:2.973 lr:0.0000132 epoch_Time:28387.0min: [2024-01-03 01:30:10,600][model8_pretrain.py][INFO] Epoch:[0/2](74800/4588595) loss:2.817 lr:0.0000131 epoch_Time:28385.0min: [2024-01-03 01:30:10,600][model8_pretrain.py][INFO] Epoch:[0/2](74800/4588595) loss:2.593 lr:0.0000131 epoch_Time:28385.0min: [2024-01-03 01:30:10,600][model8_pretrain.py][INFO] Epoch:[0/2](74800/4588595) loss:2.783 lr:0.0000131 epoch_Time:28385.0min: [2024-01-03 01:30:10,600][model8_pretrain.py][INFO] Epoch:[0/2](74800/4588595) loss:3.194 lr:0.0000131 epoch_Time:28385.0min: [2024-01-03 01:30:10,600][model8_pretrain.py][INFO] Epoch:[0/2](74800/4588595) loss:2.590 lr:0.0000131 epoch_Time:28385.0min: [2024-01-03 01:30:10,600][model8_pretrain.py][INFO] Epoch:[0/2](74800/4588595) loss:3.063 lr:0.0000131 epoch_Time:28385.0min: [2024-01-03 01:30:10,600][model8_pretrain.py][INFO] Epoch:[0/2](74800/4588595) loss:2.955 lr:0.0000131 epoch_Time:28385.0min: [2024-01-03 01:30:10,601][model8_pretrain.py][INFO] Epoch:[0/2](74800/4588595) loss:2.853 lr:0.0000131 epoch_Time:28385.0min: [2024-01-03 01:30:47,536][model8_pretrain.py][INFO] Epoch:[0/2](74900/4588595) loss:2.659 lr:0.0000130 epoch_Time:28384.0min: [2024-01-03 01:30:47,536][model8_pretrain.py][INFO] Epoch:[0/2](74900/4588595) loss:3.211 lr:0.0000130 epoch_Time:28384.0min: [2024-01-03 01:30:47,536][model8_pretrain.py][INFO] Epoch:[0/2](74900/4588595) loss:3.174 lr:0.0000130 epoch_Time:28384.0min: [2024-01-03 01:30:47,536][model8_pretrain.py][INFO] Epoch:[0/2](74900/4588595) loss:2.699 lr:0.0000130 epoch_Time:28384.0min: [2024-01-03 01:30:47,536][model8_pretrain.py][INFO] Epoch:[0/2](74900/4588595) loss:3.421 lr:0.0000130 epoch_Time:28384.0min: [2024-01-03 01:30:47,536][model8_pretrain.py][INFO] Epoch:[0/2](74900/4588595) loss:3.183 lr:0.0000130 epoch_Time:28384.0min: [2024-01-03 01:30:47,536][model8_pretrain.py][INFO] Epoch:[0/2](74900/4588595) loss:3.321 lr:0.0000130 epoch_Time:28384.0min: [2024-01-03 01:30:47,536][model8_pretrain.py][INFO] Epoch:[0/2](74900/4588595) loss:3.347 lr:0.0000130 epoch_Time:28384.0min: [2024-01-03 01:31:24,479][model8_pretrain.py][INFO] Epoch:[0/2](75000/4588595) loss:3.014 lr:0.0000129 epoch_Time:28382.0min: [2024-01-03 01:31:24,479][model8_pretrain.py][INFO] Epoch:[0/2](75000/4588595) loss:2.999 lr:0.0000129 epoch_Time:28382.0min: [2024-01-03 01:31:24,479][model8_pretrain.py][INFO] Epoch:[0/2](75000/4588595) loss:3.446 lr:0.0000129 epoch_Time:28382.0min: [2024-01-03 01:31:24,479][model8_pretrain.py][INFO] Epoch:[0/2](75000/4588595) loss:2.507 lr:0.0000129 epoch_Time:28382.0min: [2024-01-03 01:31:24,479][model8_pretrain.py][INFO] Epoch:[0/2](75000/4588595) loss:3.484 lr:0.0000129 epoch_Time:28382.0min: [2024-01-03 01:31:24,479][model8_pretrain.py][INFO] Epoch:[0/2](75000/4588595) loss:3.084 lr:0.0000129 epoch_Time:28382.0min: [2024-01-03 01:31:24,479][model8_pretrain.py][INFO] Epoch:[0/2](75000/4588595) loss:2.587 lr:0.0000129 epoch_Time:28382.0min: [2024-01-03 01:31:24,479][model8_pretrain.py][INFO] Epoch:[0/2](75000/4588595) loss:2.931 lr:0.0000129 epoch_Time:28382.0min: [2024-01-03 01:32:01,456][model8_pretrain.py][INFO] Epoch:[0/2](75100/4588595) loss:3.065 lr:0.0000127 epoch_Time:28380.0min: [2024-01-03 01:32:01,456][model8_pretrain.py][INFO] Epoch:[0/2](75100/4588595) loss:3.311 lr:0.0000127 epoch_Time:28380.0min: [2024-01-03 01:32:01,456][model8_pretrain.py][INFO] Epoch:[0/2](75100/4588595) loss:3.190 lr:0.0000127 epoch_Time:28380.0min: [2024-01-03 01:32:01,456][model8_pretrain.py][INFO] Epoch:[0/2](75100/4588595) loss:3.050 lr:0.0000127 epoch_Time:28380.0min: [2024-01-03 01:32:01,456][model8_pretrain.py][INFO] Epoch:[0/2](75100/4588595) loss:2.990 lr:0.0000127 epoch_Time:28380.0min: [2024-01-03 01:32:01,456][model8_pretrain.py][INFO] Epoch:[0/2](75100/4588595) loss:3.091 lr:0.0000127 epoch_Time:28380.0min: [2024-01-03 01:32:01,456][model8_pretrain.py][INFO] Epoch:[0/2](75100/4588595) loss:2.845 lr:0.0000127 epoch_Time:28380.0min: [2024-01-03 01:32:01,456][model8_pretrain.py][INFO] Epoch:[0/2](75100/4588595) loss:2.492 lr:0.0000127 epoch_Time:28380.0min: [2024-01-03 01:32:45,377][model8_pretrain.py][INFO] Epoch:[0/2](75200/4588595) loss:2.988 lr:0.0000126 epoch_Time:28387.0min: [2024-01-03 01:32:45,377][model8_pretrain.py][INFO] Epoch:[0/2](75200/4588595) loss:3.215 lr:0.0000126 epoch_Time:28387.0min: [2024-01-03 01:32:45,377][model8_pretrain.py][INFO] Epoch:[0/2](75200/4588595) loss:3.350 lr:0.0000126 epoch_Time:28387.0min: [2024-01-03 01:32:45,377][model8_pretrain.py][INFO] Epoch:[0/2](75200/4588595) loss:3.003 lr:0.0000126 epoch_Time:28387.0min: [2024-01-03 01:32:45,377][model8_pretrain.py][INFO] Epoch:[0/2](75200/4588595) loss:2.669 lr:0.0000126 epoch_Time:28387.0min: [2024-01-03 01:32:45,377][model8_pretrain.py][INFO] Epoch:[0/2](75200/4588595) loss:2.412 lr:0.0000126 epoch_Time:28387.0min: [2024-01-03 01:32:45,377][model8_pretrain.py][INFO] Epoch:[0/2](75200/4588595) loss:3.191 lr:0.0000126 epoch_Time:28387.0min: [2024-01-03 01:32:45,378][model8_pretrain.py][INFO] Epoch:[0/2](75200/4588595) loss:2.953 lr:0.0000126 epoch_Time:28387.0min: [2024-01-03 01:33:22,315][model8_pretrain.py][INFO] Epoch:[0/2](75300/4588595) loss:2.574 lr:0.0000125 epoch_Time:28385.0min: [2024-01-03 01:33:22,315][model8_pretrain.py][INFO] Epoch:[0/2](75300/4588595) loss:3.104 lr:0.0000125 epoch_Time:28385.0min: [2024-01-03 01:33:22,315][model8_pretrain.py][INFO] Epoch:[0/2](75300/4588595) loss:3.204 lr:0.0000125 epoch_Time:28385.0min: [2024-01-03 01:33:22,315][model8_pretrain.py][INFO] Epoch:[0/2](75300/4588595) loss:3.248 lr:0.0000125 epoch_Time:28385.0min: [2024-01-03 01:33:22,315][model8_pretrain.py][INFO] Epoch:[0/2](75300/4588595) loss:2.923 lr:0.0000125 epoch_Time:28385.0min: [2024-01-03 01:33:22,315][model8_pretrain.py][INFO] Epoch:[0/2](75300/4588595) loss:3.027 lr:0.0000125 epoch_Time:28385.0min: [2024-01-03 01:33:22,315][model8_pretrain.py][INFO] Epoch:[0/2](75300/4588595) loss:3.343 lr:0.0000125 epoch_Time:28385.0min: [2024-01-03 01:33:22,315][model8_pretrain.py][INFO] Epoch:[0/2](75300/4588595) loss:2.832 lr:0.0000125 epoch_Time:28385.0min: [2024-01-03 01:33:59,252][model8_pretrain.py][INFO] Epoch:[0/2](75400/4588595) loss:2.884 lr:0.0000124 epoch_Time:28383.0min: [2024-01-03 01:33:59,252][model8_pretrain.py][INFO] Epoch:[0/2](75400/4588595) loss:2.817 lr:0.0000124 epoch_Time:28383.0min: [2024-01-03 01:33:59,252][model8_pretrain.py][INFO] Epoch:[0/2](75400/4588595) loss:3.388 lr:0.0000124 epoch_Time:28383.0min: [2024-01-03 01:33:59,252][model8_pretrain.py][INFO] Epoch:[0/2](75400/4588595) loss:3.281 lr:0.0000124 epoch_Time:28383.0min: [2024-01-03 01:33:59,252][model8_pretrain.py][INFO] Epoch:[0/2](75400/4588595) loss:3.082 lr:0.0000124 epoch_Time:28383.0min: [2024-01-03 01:33:59,252][model8_pretrain.py][INFO] Epoch:[0/2](75400/4588595) loss:2.876 lr:0.0000124 epoch_Time:28383.0min: [2024-01-03 01:33:59,252][model8_pretrain.py][INFO] Epoch:[0/2](75400/4588595) loss:3.029 lr:0.0000124 epoch_Time:28383.0min: [2024-01-03 01:33:59,252][model8_pretrain.py][INFO] Epoch:[0/2](75400/4588595) loss:2.725 lr:0.0000124 epoch_Time:28383.0min: [2024-01-03 01:34:36,205][model8_pretrain.py][INFO] Epoch:[0/2](75500/4588595) loss:2.574 lr:0.0000123 epoch_Time:28382.0min: [2024-01-03 01:34:36,205][model8_pretrain.py][INFO] Epoch:[0/2](75500/4588595) loss:2.964 lr:0.0000123 epoch_Time:28382.0min: [2024-01-03 01:34:36,205][model8_pretrain.py][INFO] Epoch:[0/2](75500/4588595) loss:2.886 lr:0.0000123 epoch_Time:28382.0min: [2024-01-03 01:34:36,205][model8_pretrain.py][INFO] Epoch:[0/2](75500/4588595) loss:2.984 lr:0.0000123 epoch_Time:28382.0min: [2024-01-03 01:34:36,205][model8_pretrain.py][INFO] Epoch:[0/2](75500/4588595) loss:2.901 lr:0.0000123 epoch_Time:28382.0min: [2024-01-03 01:34:36,205][model8_pretrain.py][INFO] Epoch:[0/2](75500/4588595) loss:3.320 lr:0.0000123 epoch_Time:28382.0min: [2024-01-03 01:34:36,205][model8_pretrain.py][INFO] Epoch:[0/2](75500/4588595) loss:2.965 lr:0.0000123 epoch_Time:28382.0min: [2024-01-03 01:34:36,205][model8_pretrain.py][INFO] Epoch:[0/2](75500/4588595) loss:3.003 lr:0.0000123 epoch_Time:28382.0min: [2024-01-03 01:35:13,148][model8_pretrain.py][INFO] Epoch:[0/2](75600/4588595) loss:3.785 lr:0.0000122 epoch_Time:28380.0min: [2024-01-03 01:35:13,148][model8_pretrain.py][INFO] Epoch:[0/2](75600/4588595) loss:2.927 lr:0.0000122 epoch_Time:28380.0min: [2024-01-03 01:35:13,148][model8_pretrain.py][INFO] Epoch:[0/2](75600/4588595) loss:2.855 lr:0.0000122 epoch_Time:28380.0min: [2024-01-03 01:35:13,148][model8_pretrain.py][INFO] Epoch:[0/2](75600/4588595) loss:3.242 lr:0.0000122 epoch_Time:28380.0min: [2024-01-03 01:35:13,148][model8_pretrain.py][INFO] Epoch:[0/2](75600/4588595) loss:3.085 lr:0.0000122 epoch_Time:28380.0min: [2024-01-03 01:35:13,148][model8_pretrain.py][INFO] Epoch:[0/2](75600/4588595) loss:3.138 lr:0.0000122 epoch_Time:28380.0min: [2024-01-03 01:35:13,148][model8_pretrain.py][INFO] Epoch:[0/2](75600/4588595) loss:2.094 lr:0.0000122 epoch_Time:28380.0min: [2024-01-03 01:35:13,148][model8_pretrain.py][INFO] Epoch:[0/2](75600/4588595) loss:3.166 lr:0.0000122 epoch_Time:28380.0min: [2024-01-03 01:35:50,098][model8_pretrain.py][INFO] Epoch:[0/2](75700/4588595) loss:3.048 lr:0.0000121 epoch_Time:28379.0min: [2024-01-03 01:35:50,098][model8_pretrain.py][INFO] Epoch:[0/2](75700/4588595) loss:3.253 lr:0.0000121 epoch_Time:28379.0min: [2024-01-03 01:35:50,098][model8_pretrain.py][INFO] Epoch:[0/2](75700/4588595) loss:2.802 lr:0.0000121 epoch_Time:28379.0min: [2024-01-03 01:35:50,098][model8_pretrain.py][INFO] Epoch:[0/2](75700/4588595) loss:2.639 lr:0.0000121 epoch_Time:28379.0min: [2024-01-03 01:35:50,098][model8_pretrain.py][INFO] Epoch:[0/2](75700/4588595) loss:3.276 lr:0.0000121 epoch_Time:28379.0min: [2024-01-03 01:35:50,098][model8_pretrain.py][INFO] Epoch:[0/2](75700/4588595) loss:3.033 lr:0.0000121 epoch_Time:28379.0min: [2024-01-03 01:35:50,098][model8_pretrain.py][INFO] Epoch:[0/2](75700/4588595) loss:3.309 lr:0.0000121 epoch_Time:28379.0min: [2024-01-03 01:35:50,098][model8_pretrain.py][INFO] Epoch:[0/2](75700/4588595) loss:2.772 lr:0.0000121 epoch_Time:28379.0min: [2024-01-03 01:36:27,036][model8_pretrain.py][INFO] Epoch:[0/2](75800/4588595) loss:2.787 lr:0.0000120 epoch_Time:28378.0min: [2024-01-03 01:36:27,037][model8_pretrain.py][INFO] Epoch:[0/2](75800/4588595) loss:3.328 lr:0.0000120 epoch_Time:28378.0min: [2024-01-03 01:36:27,037][model8_pretrain.py][INFO] Epoch:[0/2](75800/4588595) loss:3.014 lr:0.0000120 epoch_Time:28378.0min: [2024-01-03 01:36:27,036][model8_pretrain.py][INFO] Epoch:[0/2](75800/4588595) loss:2.840 lr:0.0000120 epoch_Time:28378.0min: [2024-01-03 01:36:27,037][model8_pretrain.py][INFO] Epoch:[0/2](75800/4588595) loss:2.694 lr:0.0000120 epoch_Time:28378.0min: [2024-01-03 01:36:27,037][model8_pretrain.py][INFO] Epoch:[0/2](75800/4588595) loss:3.656 lr:0.0000120 epoch_Time:28378.0min: [2024-01-03 01:36:27,037][model8_pretrain.py][INFO] Epoch:[0/2](75800/4588595) loss:3.203 lr:0.0000120 epoch_Time:28378.0min: [2024-01-03 01:36:27,037][model8_pretrain.py][INFO] Epoch:[0/2](75800/4588595) loss:2.861 lr:0.0000120 epoch_Time:28378.0min: [2024-01-03 01:37:03,978][model8_pretrain.py][INFO] Epoch:[0/2](75900/4588595) loss:2.793 lr:0.0000119 epoch_Time:28376.0min: [2024-01-03 01:37:03,978][model8_pretrain.py][INFO] Epoch:[0/2](75900/4588595) loss:3.395 lr:0.0000119 epoch_Time:28376.0min: [2024-01-03 01:37:03,978][model8_pretrain.py][INFO] Epoch:[0/2](75900/4588595) loss:3.173 lr:0.0000119 epoch_Time:28376.0min: [2024-01-03 01:37:03,978][model8_pretrain.py][INFO] Epoch:[0/2](75900/4588595) loss:2.923 lr:0.0000119 epoch_Time:28376.0min: [2024-01-03 01:37:03,978][model8_pretrain.py][INFO] Epoch:[0/2](75900/4588595) loss:3.502 lr:0.0000119 epoch_Time:28376.0min: [2024-01-03 01:37:03,978][model8_pretrain.py][INFO] Epoch:[0/2](75900/4588595) loss:3.236 lr:0.0000119 epoch_Time:28376.0min: [2024-01-03 01:37:03,978][model8_pretrain.py][INFO] Epoch:[0/2](75900/4588595) loss:2.784 lr:0.0000119 epoch_Time:28376.0min: [2024-01-03 01:37:03,978][model8_pretrain.py][INFO] Epoch:[0/2](75900/4588595) loss:2.642 lr:0.0000119 epoch_Time:28376.0min: [2024-01-03 01:37:48,046][model8_pretrain.py][INFO] Epoch:[0/2](76000/4588595) loss:2.881 lr:0.0000118 epoch_Time:28381.0min: [2024-01-03 01:37:48,046][model8_pretrain.py][INFO] Epoch:[0/2](76000/4588595) loss:2.969 lr:0.0000118 epoch_Time:28381.0min: [2024-01-03 01:37:48,046][model8_pretrain.py][INFO] Epoch:[0/2](76000/4588595) loss:3.582 lr:0.0000118 epoch_Time:28381.0min: [2024-01-03 01:37:48,047][model8_pretrain.py][INFO] Epoch:[0/2](76000/4588595) loss:3.362 lr:0.0000118 epoch_Time:28381.0min: [2024-01-03 01:37:48,046][model8_pretrain.py][INFO] Epoch:[0/2](76000/4588595) loss:2.452 lr:0.0000118 epoch_Time:28381.0min: [2024-01-03 01:37:48,047][model8_pretrain.py][INFO] Epoch:[0/2](76000/4588595) loss:2.903 lr:0.0000118 epoch_Time:28381.0min: [2024-01-03 01:37:48,046][model8_pretrain.py][INFO] Epoch:[0/2](76000/4588595) loss:2.914 lr:0.0000118 epoch_Time:28381.0min: [2024-01-03 01:37:48,047][model8_pretrain.py][INFO] Epoch:[0/2](76000/4588595) loss:3.135 lr:0.0000118 epoch_Time:28381.0min: [2024-01-03 01:38:24,964][model8_pretrain.py][INFO] Epoch:[0/2](76100/4588595) loss:3.530 lr:0.0000117 epoch_Time:28381.0min: [2024-01-03 01:38:24,965][model8_pretrain.py][INFO] Epoch:[0/2](76100/4588595) loss:2.816 lr:0.0000117 epoch_Time:28381.0min: [2024-01-03 01:38:24,965][model8_pretrain.py][INFO] Epoch:[0/2](76100/4588595) loss:3.256 lr:0.0000117 epoch_Time:28381.0min: [2024-01-03 01:38:24,965][model8_pretrain.py][INFO] Epoch:[0/2](76100/4588595) loss:2.580 lr:0.0000117 epoch_Time:28381.0min: [2024-01-03 01:38:24,965][model8_pretrain.py][INFO] Epoch:[0/2](76100/4588595) loss:3.300 lr:0.0000117 epoch_Time:28381.0min: [2024-01-03 01:38:24,965][model8_pretrain.py][INFO] Epoch:[0/2](76100/4588595) loss:3.349 lr:0.0000117 epoch_Time:28381.0min: [2024-01-03 01:38:24,965][model8_pretrain.py][INFO] Epoch:[0/2](76100/4588595) loss:3.133 lr:0.0000117 epoch_Time:28381.0min: [2024-01-03 01:38:24,965][model8_pretrain.py][INFO] Epoch:[0/2](76100/4588595) loss:3.067 lr:0.0000117 epoch_Time:28381.0min: [2024-01-03 01:39:01,902][model8_pretrain.py][INFO] Epoch:[0/2](76200/4588595) loss:2.840 lr:0.0000117 epoch_Time:28379.0min: [2024-01-03 01:39:01,902][model8_pretrain.py][INFO] Epoch:[0/2](76200/4588595) loss:2.674 lr:0.0000117 epoch_Time:28379.0min: [2024-01-03 01:39:01,902][model8_pretrain.py][INFO] Epoch:[0/2](76200/4588595) loss:2.726 lr:0.0000117 epoch_Time:28379.0min: [2024-01-03 01:39:01,903][model8_pretrain.py][INFO] Epoch:[0/2](76200/4588595) loss:2.868 lr:0.0000117 epoch_Time:28379.0min: [2024-01-03 01:39:01,903][model8_pretrain.py][INFO] Epoch:[0/2](76200/4588595) loss:3.084 lr:0.0000117 epoch_Time:28379.0min: [2024-01-03 01:39:01,903][model8_pretrain.py][INFO] Epoch:[0/2](76200/4588595) loss:3.071 lr:0.0000117 epoch_Time:28379.0min: [2024-01-03 01:39:01,903][model8_pretrain.py][INFO] Epoch:[0/2](76200/4588595) loss:3.221 lr:0.0000117 epoch_Time:28379.0min: [2024-01-03 01:39:01,903][model8_pretrain.py][INFO] Epoch:[0/2](76200/4588595) loss:3.175 lr:0.0000117 epoch_Time:28379.0min: [2024-01-03 01:39:38,844][model8_pretrain.py][INFO] Epoch:[0/2](76300/4588595) loss:3.073 lr:0.0000116 epoch_Time:28378.0min: [2024-01-03 01:39:38,844][model8_pretrain.py][INFO] Epoch:[0/2](76300/4588595) loss:3.062 lr:0.0000116 epoch_Time:28378.0min: [2024-01-03 01:39:38,844][model8_pretrain.py][INFO] Epoch:[0/2](76300/4588595) loss:2.913 lr:0.0000116 epoch_Time:28378.0min: [2024-01-03 01:39:38,844][model8_pretrain.py][INFO] Epoch:[0/2](76300/4588595) loss:2.864 lr:0.0000116 epoch_Time:28378.0min: [2024-01-03 01:39:38,844][model8_pretrain.py][INFO] Epoch:[0/2](76300/4588595) loss:3.400 lr:0.0000116 epoch_Time:28378.0min: [2024-01-03 01:39:38,844][model8_pretrain.py][INFO] Epoch:[0/2](76300/4588595) loss:3.198 lr:0.0000116 epoch_Time:28378.0min: [2024-01-03 01:39:38,844][model8_pretrain.py][INFO] Epoch:[0/2](76300/4588595) loss:3.042 lr:0.0000116 epoch_Time:28378.0min: [2024-01-03 01:39:38,844][model8_pretrain.py][INFO] Epoch:[0/2](76300/4588595) loss:2.548 lr:0.0000116 epoch_Time:28378.0min: [2024-01-03 01:40:15,798][model8_pretrain.py][INFO] Epoch:[0/2](76400/4588595) loss:2.895 lr:0.0000115 epoch_Time:28376.0min: [2024-01-03 01:40:15,798][model8_pretrain.py][INFO] Epoch:[0/2](76400/4588595) loss:3.253 lr:0.0000115 epoch_Time:28376.0min: [2024-01-03 01:40:15,798][model8_pretrain.py][INFO] Epoch:[0/2](76400/4588595) loss:3.048 lr:0.0000115 epoch_Time:28376.0min: [2024-01-03 01:40:15,798][model8_pretrain.py][INFO] Epoch:[0/2](76400/4588595) loss:3.102 lr:0.0000115 epoch_Time:28376.0min: [2024-01-03 01:40:15,798][model8_pretrain.py][INFO] Epoch:[0/2](76400/4588595) loss:2.812 lr:0.0000115 epoch_Time:28376.0min: [2024-01-03 01:40:15,798][model8_pretrain.py][INFO] Epoch:[0/2](76400/4588595) loss:2.920 lr:0.0000115 epoch_Time:28376.0min: [2024-01-03 01:40:15,798][model8_pretrain.py][INFO] Epoch:[0/2](76400/4588595) loss:2.962 lr:0.0000115 epoch_Time:28376.0min: [2024-01-03 01:40:15,798][model8_pretrain.py][INFO] Epoch:[0/2](76400/4588595) loss:3.311 lr:0.0000115 epoch_Time:28376.0min: [2024-01-03 01:40:52,737][model8_pretrain.py][INFO] Epoch:[0/2](76500/4588595) loss:3.260 lr:0.0000114 epoch_Time:28374.0min: [2024-01-03 01:40:52,737][model8_pretrain.py][INFO] Epoch:[0/2](76500/4588595) loss:2.611 lr:0.0000114 epoch_Time:28374.0min: [2024-01-03 01:40:52,737][model8_pretrain.py][INFO] Epoch:[0/2](76500/4588595) loss:2.767 lr:0.0000114 epoch_Time:28374.0min: [2024-01-03 01:40:52,737][model8_pretrain.py][INFO] Epoch:[0/2](76500/4588595) loss:3.086 lr:0.0000114 epoch_Time:28374.0min: [2024-01-03 01:40:52,737][model8_pretrain.py][INFO] Epoch:[0/2](76500/4588595) loss:3.297 lr:0.0000114 epoch_Time:28374.0min: [2024-01-03 01:40:52,737][model8_pretrain.py][INFO] Epoch:[0/2](76500/4588595) loss:3.296 lr:0.0000114 epoch_Time:28374.0min: [2024-01-03 01:40:52,737][model8_pretrain.py][INFO] Epoch:[0/2](76500/4588595) loss:2.821 lr:0.0000114 epoch_Time:28374.0min: [2024-01-03 01:40:52,737][model8_pretrain.py][INFO] Epoch:[0/2](76500/4588595) loss:3.014 lr:0.0000114 epoch_Time:28374.0min: [2024-01-03 01:41:29,675][model8_pretrain.py][INFO] Epoch:[0/2](76600/4588595) loss:3.229 lr:0.0000113 epoch_Time:28374.0min: [2024-01-03 01:41:29,675][model8_pretrain.py][INFO] Epoch:[0/2](76600/4588595) loss:3.109 lr:0.0000113 epoch_Time:28374.0min: [2024-01-03 01:41:29,675][model8_pretrain.py][INFO] Epoch:[0/2](76600/4588595) loss:2.868 lr:0.0000113 epoch_Time:28374.0min: [2024-01-03 01:41:29,675][model8_pretrain.py][INFO] Epoch:[0/2](76600/4588595) loss:3.244 lr:0.0000113 epoch_Time:28374.0min: [2024-01-03 01:41:29,675][model8_pretrain.py][INFO] Epoch:[0/2](76600/4588595) loss:2.955 lr:0.0000113 epoch_Time:28374.0min: [2024-01-03 01:41:29,675][model8_pretrain.py][INFO] Epoch:[0/2](76600/4588595) loss:3.020 lr:0.0000113 epoch_Time:28374.0min: [2024-01-03 01:41:29,675][model8_pretrain.py][INFO] Epoch:[0/2](76600/4588595) loss:2.412 lr:0.0000113 epoch_Time:28374.0min: [2024-01-03 01:41:29,675][model8_pretrain.py][INFO] Epoch:[0/2](76600/4588595) loss:2.750 lr:0.0000113 epoch_Time:28374.0min: [2024-01-03 01:42:06,627][model8_pretrain.py][INFO] Epoch:[0/2](76700/4588595) loss:3.072 lr:0.0000112 epoch_Time:28372.0min: [2024-01-03 01:42:06,627][model8_pretrain.py][INFO] Epoch:[0/2](76700/4588595) loss:2.725 lr:0.0000112 epoch_Time:28372.0min: [2024-01-03 01:42:06,627][model8_pretrain.py][INFO] Epoch:[0/2](76700/4588595) loss:3.041 lr:0.0000112 epoch_Time:28372.0min: [2024-01-03 01:42:06,628][model8_pretrain.py][INFO] Epoch:[0/2](76700/4588595) loss:3.044 lr:0.0000112 epoch_Time:28372.0min: [2024-01-03 01:42:06,628][model8_pretrain.py][INFO] Epoch:[0/2](76700/4588595) loss:3.115 lr:0.0000112 epoch_Time:28372.0min: [2024-01-03 01:42:06,628][model8_pretrain.py][INFO] Epoch:[0/2](76700/4588595) loss:2.979 lr:0.0000112 epoch_Time:28372.0min: [2024-01-03 01:42:06,628][model8_pretrain.py][INFO] Epoch:[0/2](76700/4588595) loss:3.287 lr:0.0000112 epoch_Time:28372.0min: [2024-01-03 01:42:06,628][model8_pretrain.py][INFO] Epoch:[0/2](76700/4588595) loss:3.289 lr:0.0000112 epoch_Time:28372.0min: [2024-01-03 01:42:50,697][model8_pretrain.py][INFO] Epoch:[0/2](76800/4588595) loss:3.286 lr:0.0000112 epoch_Time:28377.0min: [2024-01-03 01:42:50,697][model8_pretrain.py][INFO] Epoch:[0/2](76800/4588595) loss:2.863 lr:0.0000112 epoch_Time:28377.0min: [2024-01-03 01:42:50,697][model8_pretrain.py][INFO] Epoch:[0/2](76800/4588595) loss:3.113 lr:0.0000112 epoch_Time:28377.0min: [2024-01-03 01:42:50,697][model8_pretrain.py][INFO] Epoch:[0/2](76800/4588595) loss:2.819 lr:0.0000112 epoch_Time:28377.0min: [2024-01-03 01:42:50,697][model8_pretrain.py][INFO] Epoch:[0/2](76800/4588595) loss:2.938 lr:0.0000112 epoch_Time:28377.0min: [2024-01-03 01:42:50,697][model8_pretrain.py][INFO] Epoch:[0/2](76800/4588595) loss:3.044 lr:0.0000112 epoch_Time:28377.0min: [2024-01-03 01:42:50,697][model8_pretrain.py][INFO] Epoch:[0/2](76800/4588595) loss:3.060 lr:0.0000112 epoch_Time:28377.0min: [2024-01-03 01:42:50,697][model8_pretrain.py][INFO] Epoch:[0/2](76800/4588595) loss:3.222 lr:0.0000112 epoch_Time:28377.0min: [2024-01-03 01:43:27,629][model8_pretrain.py][INFO] Epoch:[0/2](76900/4588595) loss:2.976 lr:0.0000111 epoch_Time:28376.0min: [2024-01-03 01:43:27,629][model8_pretrain.py][INFO] Epoch:[0/2](76900/4588595) loss:3.244 lr:0.0000111 epoch_Time:28376.0min: [2024-01-03 01:43:27,629][model8_pretrain.py][INFO] Epoch:[0/2](76900/4588595) loss:3.285 lr:0.0000111 epoch_Time:28376.0min: [2024-01-03 01:43:27,629][model8_pretrain.py][INFO] Epoch:[0/2](76900/4588595) loss:2.723 lr:0.0000111 epoch_Time:28376.0min: [2024-01-03 01:43:27,630][model8_pretrain.py][INFO] Epoch:[0/2](76900/4588595) loss:2.653 lr:0.0000111 epoch_Time:28376.0min: [2024-01-03 01:43:27,630][model8_pretrain.py][INFO] Epoch:[0/2](76900/4588595) loss:2.894 lr:0.0000111 epoch_Time:28376.0min: [2024-01-03 01:43:27,630][model8_pretrain.py][INFO] Epoch:[0/2](76900/4588595) loss:3.257 lr:0.0000111 epoch_Time:28376.0min: [2024-01-03 01:43:27,630][model8_pretrain.py][INFO] Epoch:[0/2](76900/4588595) loss:3.047 lr:0.0000111 epoch_Time:28376.0min: [2024-01-03 01:44:04,554][model8_pretrain.py][INFO] Epoch:[0/2](77000/4588595) loss:3.259 lr:0.0000110 epoch_Time:28375.0min: [2024-01-03 01:44:04,554][model8_pretrain.py][INFO] Epoch:[0/2](77000/4588595) loss:2.660 lr:0.0000110 epoch_Time:28375.0min: [2024-01-03 01:44:04,554][model8_pretrain.py][INFO] Epoch:[0/2](77000/4588595) loss:2.652 lr:0.0000110 epoch_Time:28375.0min: [2024-01-03 01:44:04,554][model8_pretrain.py][INFO] Epoch:[0/2](77000/4588595) loss:2.699 lr:0.0000110 epoch_Time:28375.0min: [2024-01-03 01:44:04,554][model8_pretrain.py][INFO] Epoch:[0/2](77000/4588595) loss:2.798 lr:0.0000110 epoch_Time:28375.0min: [2024-01-03 01:44:04,554][model8_pretrain.py][INFO] Epoch:[0/2](77000/4588595) loss:3.334 lr:0.0000110 epoch_Time:28375.0min: [2024-01-03 01:44:04,554][model8_pretrain.py][INFO] Epoch:[0/2](77000/4588595) loss:2.268 lr:0.0000110 epoch_Time:28375.0min: [2024-01-03 01:44:04,554][model8_pretrain.py][INFO] Epoch:[0/2](77000/4588595) loss:3.226 lr:0.0000110 epoch_Time:28375.0min: [2024-01-03 01:44:41,497][model8_pretrain.py][INFO] Epoch:[0/2](77100/4588595) loss:3.276 lr:0.0000110 epoch_Time:28374.0min: [2024-01-03 01:44:41,497][model8_pretrain.py][INFO] Epoch:[0/2](77100/4588595) loss:3.321 lr:0.0000110 epoch_Time:28374.0min: [2024-01-03 01:44:41,497][model8_pretrain.py][INFO] Epoch:[0/2](77100/4588595) loss:2.817 lr:0.0000110 epoch_Time:28374.0min: [2024-01-03 01:44:41,497][model8_pretrain.py][INFO] Epoch:[0/2](77100/4588595) loss:2.821 lr:0.0000110 epoch_Time:28374.0min: [2024-01-03 01:44:41,497][model8_pretrain.py][INFO] Epoch:[0/2](77100/4588595) loss:2.801 lr:0.0000110 epoch_Time:28374.0min: [2024-01-03 01:44:41,497][model8_pretrain.py][INFO] Epoch:[0/2](77100/4588595) loss:3.294 lr:0.0000110 epoch_Time:28374.0min: [2024-01-03 01:44:41,497][model8_pretrain.py][INFO] Epoch:[0/2](77100/4588595) loss:3.383 lr:0.0000110 epoch_Time:28374.0min: [2024-01-03 01:44:41,497][model8_pretrain.py][INFO] Epoch:[0/2](77100/4588595) loss:3.348 lr:0.0000110 epoch_Time:28374.0min: [2024-01-03 01:45:18,436][model8_pretrain.py][INFO] Epoch:[0/2](77200/4588595) loss:2.694 lr:0.0000109 epoch_Time:28372.0min: [2024-01-03 01:45:18,436][model8_pretrain.py][INFO] Epoch:[0/2](77200/4588595) loss:3.265 lr:0.0000109 epoch_Time:28372.0min: [2024-01-03 01:45:18,436][model8_pretrain.py][INFO] Epoch:[0/2](77200/4588595) loss:2.792 lr:0.0000109 epoch_Time:28372.0min: [2024-01-03 01:45:18,436][model8_pretrain.py][INFO] Epoch:[0/2](77200/4588595) loss:3.445 lr:0.0000109 epoch_Time:28372.0min: [2024-01-03 01:45:18,436][model8_pretrain.py][INFO] Epoch:[0/2](77200/4588595) loss:3.364 lr:0.0000109 epoch_Time:28372.0min: [2024-01-03 01:45:18,436][model8_pretrain.py][INFO] Epoch:[0/2](77200/4588595) loss:3.285 lr:0.0000109 epoch_Time:28372.0min: [2024-01-03 01:45:18,436][model8_pretrain.py][INFO] Epoch:[0/2](77200/4588595) loss:3.136 lr:0.0000109 epoch_Time:28372.0min: [2024-01-03 01:45:18,437][model8_pretrain.py][INFO] Epoch:[0/2](77200/4588595) loss:3.084 lr:0.0000109 epoch_Time:28372.0min: [2024-01-03 01:45:55,365][model8_pretrain.py][INFO] Epoch:[0/2](77300/4588595) loss:3.460 lr:0.0000108 epoch_Time:28370.0min: [2024-01-03 01:45:55,365][model8_pretrain.py][INFO] Epoch:[0/2](77300/4588595) loss:2.830 lr:0.0000108 epoch_Time:28370.0min: [2024-01-03 01:45:55,365][model8_pretrain.py][INFO] Epoch:[0/2](77300/4588595) loss:3.045 lr:0.0000108 epoch_Time:28370.0min: [2024-01-03 01:45:55,365][model8_pretrain.py][INFO] Epoch:[0/2](77300/4588595) loss:3.200 lr:0.0000108 epoch_Time:28370.0min: [2024-01-03 01:45:55,365][model8_pretrain.py][INFO] Epoch:[0/2](77300/4588595) loss:2.951 lr:0.0000108 epoch_Time:28370.0min: [2024-01-03 01:45:55,365][model8_pretrain.py][INFO] Epoch:[0/2](77300/4588595) loss:2.694 lr:0.0000108 epoch_Time:28370.0min: [2024-01-03 01:45:55,365][model8_pretrain.py][INFO] Epoch:[0/2](77300/4588595) loss:3.038 lr:0.0000108 epoch_Time:28370.0min: [2024-01-03 01:45:55,366][model8_pretrain.py][INFO] Epoch:[0/2](77300/4588595) loss:2.792 lr:0.0000108 epoch_Time:28370.0min: [2024-01-03 01:46:32,300][model8_pretrain.py][INFO] Epoch:[0/2](77400/4588595) loss:2.874 lr:0.0000108 epoch_Time:28369.0min: [2024-01-03 01:46:32,300][model8_pretrain.py][INFO] Epoch:[0/2](77400/4588595) loss:3.077 lr:0.0000108 epoch_Time:28369.0min: [2024-01-03 01:46:32,300][model8_pretrain.py][INFO] Epoch:[0/2](77400/4588595) loss:3.444 lr:0.0000108 epoch_Time:28369.0min: [2024-01-03 01:46:32,300][model8_pretrain.py][INFO] Epoch:[0/2](77400/4588595) loss:2.928 lr:0.0000108 epoch_Time:28369.0min: [2024-01-03 01:46:32,300][model8_pretrain.py][INFO] Epoch:[0/2](77400/4588595) loss:3.123 lr:0.0000108 epoch_Time:28369.0min: [2024-01-03 01:46:32,300][model8_pretrain.py][INFO] Epoch:[0/2](77400/4588595) loss:3.174 lr:0.0000108 epoch_Time:28369.0min: [2024-01-03 01:46:32,300][model8_pretrain.py][INFO] Epoch:[0/2](77400/4588595) loss:3.456 lr:0.0000108 epoch_Time:28369.0min: [2024-01-03 01:46:32,300][model8_pretrain.py][INFO] Epoch:[0/2](77400/4588595) loss:2.302 lr:0.0000108 epoch_Time:28369.0min: [2024-01-03 01:47:09,254][model8_pretrain.py][INFO] Epoch:[0/2](77500/4588595) loss:3.241 lr:0.0000107 epoch_Time:28368.0min: [2024-01-03 01:47:09,254][model8_pretrain.py][INFO] Epoch:[0/2](77500/4588595) loss:3.151 lr:0.0000107 epoch_Time:28368.0min: [2024-01-03 01:47:09,254][model8_pretrain.py][INFO] Epoch:[0/2](77500/4588595) loss:2.866 lr:0.0000107 epoch_Time:28368.0min: [2024-01-03 01:47:09,254][model8_pretrain.py][INFO] Epoch:[0/2](77500/4588595) loss:3.104 lr:0.0000107 epoch_Time:28368.0min: [2024-01-03 01:47:09,254][model8_pretrain.py][INFO] Epoch:[0/2](77500/4588595) loss:2.940 lr:0.0000107 epoch_Time:28368.0min: [2024-01-03 01:47:09,254][model8_pretrain.py][INFO] Epoch:[0/2](77500/4588595) loss:3.166 lr:0.0000107 epoch_Time:28368.0min: [2024-01-03 01:47:09,254][model8_pretrain.py][INFO] Epoch:[0/2](77500/4588595) loss:2.886 lr:0.0000107 epoch_Time:28368.0min: [2024-01-03 01:47:09,254][model8_pretrain.py][INFO] Epoch:[0/2](77500/4588595) loss:3.108 lr:0.0000107 epoch_Time:28368.0min: [2024-01-03 01:47:53,313][model8_pretrain.py][INFO] Epoch:[0/2](77600/4588595) loss:2.790 lr:0.0000107 epoch_Time:28373.0min: [2024-01-03 01:47:53,313][model8_pretrain.py][INFO] Epoch:[0/2](77600/4588595) loss:2.875 lr:0.0000107 epoch_Time:28373.0min: [2024-01-03 01:47:53,313][model8_pretrain.py][INFO] Epoch:[0/2](77600/4588595) loss:3.372 lr:0.0000107 epoch_Time:28373.0min: [2024-01-03 01:47:53,313][model8_pretrain.py][INFO] Epoch:[0/2](77600/4588595) loss:3.199 lr:0.0000107 epoch_Time:28373.0min: [2024-01-03 01:47:53,313][model8_pretrain.py][INFO] Epoch:[0/2](77600/4588595) loss:3.119 lr:0.0000107 epoch_Time:28373.0min: [2024-01-03 01:47:53,313][model8_pretrain.py][INFO] Epoch:[0/2](77600/4588595) loss:3.050 lr:0.0000107 epoch_Time:28373.0min: [2024-01-03 01:47:53,313][model8_pretrain.py][INFO] Epoch:[0/2](77600/4588595) loss:2.966 lr:0.0000107 epoch_Time:28373.0min: [2024-01-03 01:47:53,314][model8_pretrain.py][INFO] Epoch:[0/2](77600/4588595) loss:3.622 lr:0.0000107 epoch_Time:28373.0min: [2024-01-03 01:48:30,253][model8_pretrain.py][INFO] Epoch:[0/2](77700/4588595) loss:2.655 lr:0.0000106 epoch_Time:28372.0min: [2024-01-03 01:48:30,253][model8_pretrain.py][INFO] Epoch:[0/2](77700/4588595) loss:3.214 lr:0.0000106 epoch_Time:28372.0min: [2024-01-03 01:48:30,253][model8_pretrain.py][INFO] Epoch:[0/2](77700/4588595) loss:2.687 lr:0.0000106 epoch_Time:28372.0min: [2024-01-03 01:48:30,253][model8_pretrain.py][INFO] Epoch:[0/2](77700/4588595) loss:3.260 lr:0.0000106 epoch_Time:28372.0min: [2024-01-03 01:48:30,253][model8_pretrain.py][INFO] Epoch:[0/2](77700/4588595) loss:2.891 lr:0.0000106 epoch_Time:28372.0min: [2024-01-03 01:48:30,253][model8_pretrain.py][INFO] Epoch:[0/2](77700/4588595) loss:3.308 lr:0.0000106 epoch_Time:28372.0min: [2024-01-03 01:48:30,253][model8_pretrain.py][INFO] Epoch:[0/2](77700/4588595) loss:3.233 lr:0.0000106 epoch_Time:28372.0min: [2024-01-03 01:48:30,253][model8_pretrain.py][INFO] Epoch:[0/2](77700/4588595) loss:3.014 lr:0.0000106 epoch_Time:28372.0min: [2024-01-03 01:49:07,190][model8_pretrain.py][INFO] Epoch:[0/2](77800/4588595) loss:2.841 lr:0.0000106 epoch_Time:28370.0min: [2024-01-03 01:49:07,190][model8_pretrain.py][INFO] Epoch:[0/2](77800/4588595) loss:3.386 lr:0.0000106 epoch_Time:28370.0min: [2024-01-03 01:49:07,190][model8_pretrain.py][INFO] Epoch:[0/2](77800/4588595) loss:2.578 lr:0.0000106 epoch_Time:28370.0min: [2024-01-03 01:49:07,190][model8_pretrain.py][INFO] Epoch:[0/2](77800/4588595) loss:3.163 lr:0.0000106 epoch_Time:28370.0min: [2024-01-03 01:49:07,190][model8_pretrain.py][INFO] Epoch:[0/2](77800/4588595) loss:2.830 lr:0.0000106 epoch_Time:28370.0min: [2024-01-03 01:49:07,190][model8_pretrain.py][INFO] Epoch:[0/2](77800/4588595) loss:2.954 lr:0.0000106 epoch_Time:28370.0min: [2024-01-03 01:49:07,190][model8_pretrain.py][INFO] Epoch:[0/2](77800/4588595) loss:3.106 lr:0.0000106 epoch_Time:28370.0min: [2024-01-03 01:49:07,191][model8_pretrain.py][INFO] Epoch:[0/2](77800/4588595) loss:3.474 lr:0.0000106 epoch_Time:28370.0min: [2024-01-03 01:49:44,122][model8_pretrain.py][INFO] Epoch:[0/2](77900/4588595) loss:3.397 lr:0.0000105 epoch_Time:28369.0min: [2024-01-03 01:49:44,123][model8_pretrain.py][INFO] Epoch:[0/2](77900/4588595) loss:3.296 lr:0.0000105 epoch_Time:28370.0min: [2024-01-03 01:49:44,123][model8_pretrain.py][INFO] Epoch:[0/2](77900/4588595) loss:2.968 lr:0.0000105 epoch_Time:28370.0min: [2024-01-03 01:49:44,123][model8_pretrain.py][INFO] Epoch:[0/2](77900/4588595) loss:2.850 lr:0.0000105 epoch_Time:28369.0min: [2024-01-03 01:49:44,123][model8_pretrain.py][INFO] Epoch:[0/2](77900/4588595) loss:3.085 lr:0.0000105 epoch_Time:28370.0min: [2024-01-03 01:49:44,123][model8_pretrain.py][INFO] Epoch:[0/2](77900/4588595) loss:3.243 lr:0.0000105 epoch_Time:28370.0min: [2024-01-03 01:49:44,123][model8_pretrain.py][INFO] Epoch:[0/2](77900/4588595) loss:3.171 lr:0.0000105 epoch_Time:28370.0min: [2024-01-03 01:49:44,123][model8_pretrain.py][INFO] Epoch:[0/2](77900/4588595) loss:3.159 lr:0.0000105 epoch_Time:28370.0min: [2024-01-03 01:50:21,087][model8_pretrain.py][INFO] Epoch:[0/2](78000/4588595) loss:3.169 lr:0.0000105 epoch_Time:28368.0min: [2024-01-03 01:50:21,087][model8_pretrain.py][INFO] Epoch:[0/2](78000/4588595) loss:3.099 lr:0.0000105 epoch_Time:28368.0min: [2024-01-03 01:50:21,087][model8_pretrain.py][INFO] Epoch:[0/2](78000/4588595) loss:2.973 lr:0.0000105 epoch_Time:28368.0min: [2024-01-03 01:50:21,087][model8_pretrain.py][INFO] Epoch:[0/2](78000/4588595) loss:2.702 lr:0.0000105 epoch_Time:28368.0min: [2024-01-03 01:50:21,087][model8_pretrain.py][INFO] Epoch:[0/2](78000/4588595) loss:2.850 lr:0.0000105 epoch_Time:28368.0min: [2024-01-03 01:50:21,087][model8_pretrain.py][INFO] Epoch:[0/2](78000/4588595) loss:3.198 lr:0.0000105 epoch_Time:28368.0min: [2024-01-03 01:50:21,087][model8_pretrain.py][INFO] Epoch:[0/2](78000/4588595) loss:3.169 lr:0.0000105 epoch_Time:28368.0min: [2024-01-03 01:50:21,087][model8_pretrain.py][INFO] Epoch:[0/2](78000/4588595) loss:3.169 lr:0.0000105 epoch_Time:28368.0min: [2024-01-03 01:50:58,032][model8_pretrain.py][INFO] Epoch:[0/2](78100/4588595) loss:3.056 lr:0.0000104 epoch_Time:28366.0min: [2024-01-03 01:50:58,032][model8_pretrain.py][INFO] Epoch:[0/2](78100/4588595) loss:3.185 lr:0.0000104 epoch_Time:28366.0min: [2024-01-03 01:50:58,032][model8_pretrain.py][INFO] Epoch:[0/2](78100/4588595) loss:2.792 lr:0.0000104 epoch_Time:28366.0min: [2024-01-03 01:50:58,032][model8_pretrain.py][INFO] Epoch:[0/2](78100/4588595) loss:2.938 lr:0.0000104 epoch_Time:28366.0min: [2024-01-03 01:50:58,032][model8_pretrain.py][INFO] Epoch:[0/2](78100/4588595) loss:3.299 lr:0.0000104 epoch_Time:28366.0min: [2024-01-03 01:50:58,032][model8_pretrain.py][INFO] Epoch:[0/2](78100/4588595) loss:3.086 lr:0.0000104 epoch_Time:28366.0min: [2024-01-03 01:50:58,032][model8_pretrain.py][INFO] Epoch:[0/2](78100/4588595) loss:3.224 lr:0.0000104 epoch_Time:28366.0min: [2024-01-03 01:50:58,032][model8_pretrain.py][INFO] Epoch:[0/2](78100/4588595) loss:2.667 lr:0.0000104 epoch_Time:28366.0min: [2024-01-03 01:51:35,005][model8_pretrain.py][INFO] Epoch:[0/2](78200/4588595) loss:2.586 lr:0.0000104 epoch_Time:28365.0min: [2024-01-03 01:51:35,005][model8_pretrain.py][INFO] Epoch:[0/2](78200/4588595) loss:3.081 lr:0.0000104 epoch_Time:28365.0min: [2024-01-03 01:51:35,005][model8_pretrain.py][INFO] Epoch:[0/2](78200/4588595) loss:3.093 lr:0.0000104 epoch_Time:28365.0min: [2024-01-03 01:51:35,005][model8_pretrain.py][INFO] Epoch:[0/2](78200/4588595) loss:3.647 lr:0.0000104 epoch_Time:28365.0min: [2024-01-03 01:51:35,005][model8_pretrain.py][INFO] Epoch:[0/2](78200/4588595) loss:2.861 lr:0.0000104 epoch_Time:28365.0min: [2024-01-03 01:51:35,005][model8_pretrain.py][INFO] Epoch:[0/2](78200/4588595) loss:2.858 lr:0.0000104 epoch_Time:28365.0min: [2024-01-03 01:51:35,005][model8_pretrain.py][INFO] Epoch:[0/2](78200/4588595) loss:2.974 lr:0.0000104 epoch_Time:28365.0min: [2024-01-03 01:51:35,006][model8_pretrain.py][INFO] Epoch:[0/2](78200/4588595) loss:3.358 lr:0.0000104 epoch_Time:28365.0min: [2024-01-03 01:52:11,983][model8_pretrain.py][INFO] Epoch:[0/2](78300/4588595) loss:3.105 lr:0.0000103 epoch_Time:28363.0min: [2024-01-03 01:52:11,983][model8_pretrain.py][INFO] Epoch:[0/2](78300/4588595) loss:2.973 lr:0.0000103 epoch_Time:28364.0min: [2024-01-03 01:52:11,983][model8_pretrain.py][INFO] Epoch:[0/2](78300/4588595) loss:3.171 lr:0.0000103 epoch_Time:28363.0min: [2024-01-03 01:52:11,983][model8_pretrain.py][INFO] Epoch:[0/2](78300/4588595) loss:3.221 lr:0.0000103 epoch_Time:28363.0min: [2024-01-03 01:52:11,983][model8_pretrain.py][INFO] Epoch:[0/2](78300/4588595) loss:3.274 lr:0.0000103 epoch_Time:28364.0min: [2024-01-03 01:52:11,983][model8_pretrain.py][INFO] Epoch:[0/2](78300/4588595) loss:2.385 lr:0.0000103 epoch_Time:28363.0min: [2024-01-03 01:52:11,983][model8_pretrain.py][INFO] Epoch:[0/2](78300/4588595) loss:3.393 lr:0.0000103 epoch_Time:28363.0min: [2024-01-03 01:52:11,983][model8_pretrain.py][INFO] Epoch:[0/2](78300/4588595) loss:3.090 lr:0.0000103 epoch_Time:28363.0min: [2024-01-03 01:52:56,076][model8_pretrain.py][INFO] Epoch:[0/2](78400/4588595) loss:3.171 lr:0.0000103 epoch_Time:28369.0min: [2024-01-03 01:52:56,076][model8_pretrain.py][INFO] Epoch:[0/2](78400/4588595) loss:3.258 lr:0.0000103 epoch_Time:28369.0min: [2024-01-03 01:52:56,076][model8_pretrain.py][INFO] Epoch:[0/2](78400/4588595) loss:3.008 lr:0.0000103 epoch_Time:28369.0min: [2024-01-03 01:52:56,076][model8_pretrain.py][INFO] Epoch:[0/2](78400/4588595) loss:2.837 lr:0.0000103 epoch_Time:28369.0min: [2024-01-03 01:52:56,076][model8_pretrain.py][INFO] Epoch:[0/2](78400/4588595) loss:2.252 lr:0.0000103 epoch_Time:28369.0min: [2024-01-03 01:52:56,076][model8_pretrain.py][INFO] Epoch:[0/2](78400/4588595) loss:2.527 lr:0.0000103 epoch_Time:28369.0min: [2024-01-03 01:52:56,076][model8_pretrain.py][INFO] Epoch:[0/2](78400/4588595) loss:2.874 lr:0.0000103 epoch_Time:28369.0min: [2024-01-03 01:52:56,076][model8_pretrain.py][INFO] Epoch:[0/2](78400/4588595) loss:2.778 lr:0.0000103 epoch_Time:28369.0min: [2024-01-03 01:53:33,000][model8_pretrain.py][INFO] Epoch:[0/2](78500/4588595) loss:3.173 lr:0.0000103 epoch_Time:28368.0min: [2024-01-03 01:53:33,000][model8_pretrain.py][INFO] Epoch:[0/2](78500/4588595) loss:2.445 lr:0.0000103 epoch_Time:28368.0min: [2024-01-03 01:53:33,000][model8_pretrain.py][INFO] Epoch:[0/2](78500/4588595) loss:2.747 lr:0.0000103 epoch_Time:28368.0min: [2024-01-03 01:53:33,000][model8_pretrain.py][INFO] Epoch:[0/2](78500/4588595) loss:3.145 lr:0.0000103 epoch_Time:28368.0min: [2024-01-03 01:53:33,000][model8_pretrain.py][INFO] Epoch:[0/2](78500/4588595) loss:3.032 lr:0.0000103 epoch_Time:28368.0min: [2024-01-03 01:53:33,000][model8_pretrain.py][INFO] Epoch:[0/2](78500/4588595) loss:2.793 lr:0.0000103 epoch_Time:28368.0min: [2024-01-03 01:53:33,000][model8_pretrain.py][INFO] Epoch:[0/2](78500/4588595) loss:3.150 lr:0.0000103 epoch_Time:28368.0min: [2024-01-03 01:53:33,000][model8_pretrain.py][INFO] Epoch:[0/2](78500/4588595) loss:3.084 lr:0.0000103 epoch_Time:28368.0min: [2024-01-03 01:54:09,931][model8_pretrain.py][INFO] Epoch:[0/2](78600/4588595) loss:2.714 lr:0.0000102 epoch_Time:28366.0min: [2024-01-03 01:54:09,931][model8_pretrain.py][INFO] Epoch:[0/2](78600/4588595) loss:2.950 lr:0.0000102 epoch_Time:28366.0min: [2024-01-03 01:54:09,931][model8_pretrain.py][INFO] Epoch:[0/2](78600/4588595) loss:2.861 lr:0.0000102 epoch_Time:28366.0min: [2024-01-03 01:54:09,931][model8_pretrain.py][INFO] Epoch:[0/2](78600/4588595) loss:3.314 lr:0.0000102 epoch_Time:28366.0min: [2024-01-03 01:54:09,931][model8_pretrain.py][INFO] Epoch:[0/2](78600/4588595) loss:2.936 lr:0.0000102 epoch_Time:28366.0min: [2024-01-03 01:54:09,931][model8_pretrain.py][INFO] Epoch:[0/2](78600/4588595) loss:2.832 lr:0.0000102 epoch_Time:28366.0min: [2024-01-03 01:54:09,931][model8_pretrain.py][INFO] Epoch:[0/2](78600/4588595) loss:2.788 lr:0.0000102 epoch_Time:28366.0min: [2024-01-03 01:54:09,931][model8_pretrain.py][INFO] Epoch:[0/2](78600/4588595) loss:2.757 lr:0.0000102 epoch_Time:28366.0min: [2024-01-03 01:54:46,873][model8_pretrain.py][INFO] Epoch:[0/2](78700/4588595) loss:2.507 lr:0.0000102 epoch_Time:28365.0min: [2024-01-03 01:54:46,873][model8_pretrain.py][INFO] Epoch:[0/2](78700/4588595) loss:3.379 lr:0.0000102 epoch_Time:28365.0min: [2024-01-03 01:54:46,873][model8_pretrain.py][INFO] Epoch:[0/2](78700/4588595) loss:3.068 lr:0.0000102 epoch_Time:28365.0min: [2024-01-03 01:54:46,873][model8_pretrain.py][INFO] Epoch:[0/2](78700/4588595) loss:3.013 lr:0.0000102 epoch_Time:28365.0min: [2024-01-03 01:54:46,873][model8_pretrain.py][INFO] Epoch:[0/2](78700/4588595) loss:3.032 lr:0.0000102 epoch_Time:28365.0min: [2024-01-03 01:54:46,873][model8_pretrain.py][INFO] Epoch:[0/2](78700/4588595) loss:3.088 lr:0.0000102 epoch_Time:28365.0min: [2024-01-03 01:54:46,873][model8_pretrain.py][INFO] Epoch:[0/2](78700/4588595) loss:2.754 lr:0.0000102 epoch_Time:28365.0min: [2024-01-03 01:54:46,873][model8_pretrain.py][INFO] Epoch:[0/2](78700/4588595) loss:2.622 lr:0.0000102 epoch_Time:28365.0min: [2024-01-03 01:55:23,818][model8_pretrain.py][INFO] Epoch:[0/2](78800/4588595) loss:3.195 lr:0.0000102 epoch_Time:28364.0min: [2024-01-03 01:55:23,818][model8_pretrain.py][INFO] Epoch:[0/2](78800/4588595) loss:3.089 lr:0.0000102 epoch_Time:28364.0min: [2024-01-03 01:55:23,818][model8_pretrain.py][INFO] Epoch:[0/2](78800/4588595) loss:3.069 lr:0.0000102 epoch_Time:28364.0min: [2024-01-03 01:55:23,818][model8_pretrain.py][INFO] Epoch:[0/2](78800/4588595) loss:2.963 lr:0.0000102 epoch_Time:28364.0min: [2024-01-03 01:55:23,818][model8_pretrain.py][INFO] Epoch:[0/2](78800/4588595) loss:3.005 lr:0.0000102 epoch_Time:28364.0min: [2024-01-03 01:55:23,818][model8_pretrain.py][INFO] Epoch:[0/2](78800/4588595) loss:3.261 lr:0.0000102 epoch_Time:28364.0min: [2024-01-03 01:55:23,818][model8_pretrain.py][INFO] Epoch:[0/2](78800/4588595) loss:3.024 lr:0.0000102 epoch_Time:28364.0min: [2024-01-03 01:55:23,818][model8_pretrain.py][INFO] Epoch:[0/2](78800/4588595) loss:3.098 lr:0.0000102 epoch_Time:28364.0min: [2024-01-03 01:56:00,753][model8_pretrain.py][INFO] Epoch:[0/2](78900/4588595) loss:3.291 lr:0.0000101 epoch_Time:28362.0min: [2024-01-03 01:56:00,753][model8_pretrain.py][INFO] Epoch:[0/2](78900/4588595) loss:2.306 lr:0.0000101 epoch_Time:28362.0min: [2024-01-03 01:56:00,753][model8_pretrain.py][INFO] Epoch:[0/2](78900/4588595) loss:3.256 lr:0.0000101 epoch_Time:28362.0min: [2024-01-03 01:56:00,753][model8_pretrain.py][INFO] Epoch:[0/2](78900/4588595) loss:3.003 lr:0.0000101 epoch_Time:28362.0min: [2024-01-03 01:56:00,753][model8_pretrain.py][INFO] Epoch:[0/2](78900/4588595) loss:2.875 lr:0.0000101 epoch_Time:28362.0min: [2024-01-03 01:56:00,753][model8_pretrain.py][INFO] Epoch:[0/2](78900/4588595) loss:2.783 lr:0.0000101 epoch_Time:28362.0min: [2024-01-03 01:56:00,753][model8_pretrain.py][INFO] Epoch:[0/2](78900/4588595) loss:2.985 lr:0.0000101 epoch_Time:28362.0min: [2024-01-03 01:56:00,753][model8_pretrain.py][INFO] Epoch:[0/2](78900/4588595) loss:2.721 lr:0.0000101 epoch_Time:28362.0min: [2024-01-03 01:56:37,735][model8_pretrain.py][INFO] Epoch:[0/2](79000/4588595) loss:3.300 lr:0.0000101 epoch_Time:28361.0min: [2024-01-03 01:56:37,735][model8_pretrain.py][INFO] Epoch:[0/2](79000/4588595) loss:2.824 lr:0.0000101 epoch_Time:28361.0min: [2024-01-03 01:56:37,735][model8_pretrain.py][INFO] Epoch:[0/2](79000/4588595) loss:2.839 lr:0.0000101 epoch_Time:28361.0min: [2024-01-03 01:56:37,735][model8_pretrain.py][INFO] Epoch:[0/2](79000/4588595) loss:2.454 lr:0.0000101 epoch_Time:28361.0min: [2024-01-03 01:56:37,736][model8_pretrain.py][INFO] Epoch:[0/2](79000/4588595) loss:3.197 lr:0.0000101 epoch_Time:28361.0min: [2024-01-03 01:56:37,736][model8_pretrain.py][INFO] Epoch:[0/2](79000/4588595) loss:3.185 lr:0.0000101 epoch_Time:28361.0min: [2024-01-03 01:56:37,736][model8_pretrain.py][INFO] Epoch:[0/2](79000/4588595) loss:3.018 lr:0.0000101 epoch_Time:28361.0min: [2024-01-03 01:56:37,736][model8_pretrain.py][INFO] Epoch:[0/2](79000/4588595) loss:2.708 lr:0.0000101 epoch_Time:28361.0min: [2024-01-03 01:57:14,675][model8_pretrain.py][INFO] Epoch:[0/2](79100/4588595) loss:3.041 lr:0.0000101 epoch_Time:28359.0min: [2024-01-03 01:57:14,675][model8_pretrain.py][INFO] Epoch:[0/2](79100/4588595) loss:2.854 lr:0.0000101 epoch_Time:28359.0min: [2024-01-03 01:57:14,675][model8_pretrain.py][INFO] Epoch:[0/2](79100/4588595) loss:3.513 lr:0.0000101 epoch_Time:28359.0min: [2024-01-03 01:57:14,675][model8_pretrain.py][INFO] Epoch:[0/2](79100/4588595) loss:2.904 lr:0.0000101 epoch_Time:28359.0min: [2024-01-03 01:57:14,675][model8_pretrain.py][INFO] Epoch:[0/2](79100/4588595) loss:3.130 lr:0.0000101 epoch_Time:28359.0min: [2024-01-03 01:57:14,675][model8_pretrain.py][INFO] Epoch:[0/2](79100/4588595) loss:3.205 lr:0.0000101 epoch_Time:28359.0min: [2024-01-03 01:57:14,676][model8_pretrain.py][INFO] Epoch:[0/2](79100/4588595) loss:3.453 lr:0.0000101 epoch_Time:28359.0min: [2024-01-03 01:57:14,676][model8_pretrain.py][INFO] Epoch:[0/2](79100/4588595) loss:3.013 lr:0.0000101 epoch_Time:28359.0min: [2024-01-03 01:57:58,857][model8_pretrain.py][INFO] Epoch:[0/2](79200/4588595) loss:3.085 lr:0.0000101 epoch_Time:28365.0min: [2024-01-03 01:57:58,857][model8_pretrain.py][INFO] Epoch:[0/2](79200/4588595) loss:2.877 lr:0.0000101 epoch_Time:28365.0min: [2024-01-03 01:57:58,857][model8_pretrain.py][INFO] Epoch:[0/2](79200/4588595) loss:3.096 lr:0.0000101 epoch_Time:28365.0min: [2024-01-03 01:57:58,857][model8_pretrain.py][INFO] Epoch:[0/2](79200/4588595) loss:2.986 lr:0.0000101 epoch_Time:28365.0min: [2024-01-03 01:57:58,857][model8_pretrain.py][INFO] Epoch:[0/2](79200/4588595) loss:3.168 lr:0.0000101 epoch_Time:28365.0min: [2024-01-03 01:57:58,857][model8_pretrain.py][INFO] Epoch:[0/2](79200/4588595) loss:3.089 lr:0.0000101 epoch_Time:28365.0min: [2024-01-03 01:57:58,857][model8_pretrain.py][INFO] Epoch:[0/2](79200/4588595) loss:2.725 lr:0.0000101 epoch_Time:28365.0min: [2024-01-03 01:57:58,858][model8_pretrain.py][INFO] Epoch:[0/2](79200/4588595) loss:3.234 lr:0.0000101 epoch_Time:28365.0min: [2024-01-03 01:58:35,787][model8_pretrain.py][INFO] Epoch:[0/2](79300/4588595) loss:2.900 lr:0.0000101 epoch_Time:28364.0min: [2024-01-03 01:58:35,787][model8_pretrain.py][INFO] Epoch:[0/2](79300/4588595) loss:2.437 lr:0.0000101 epoch_Time:28364.0min: [2024-01-03 01:58:35,787][model8_pretrain.py][INFO] Epoch:[0/2](79300/4588595) loss:3.012 lr:0.0000101 epoch_Time:28364.0min: [2024-01-03 01:58:35,787][model8_pretrain.py][INFO] Epoch:[0/2](79300/4588595) loss:3.300 lr:0.0000101 epoch_Time:28364.0min: [2024-01-03 01:58:35,787][model8_pretrain.py][INFO] Epoch:[0/2](79300/4588595) loss:2.958 lr:0.0000101 epoch_Time:28364.0min: [2024-01-03 01:58:35,787][model8_pretrain.py][INFO] Epoch:[0/2](79300/4588595) loss:2.806 lr:0.0000101 epoch_Time:28364.0min: [2024-01-03 01:58:35,787][model8_pretrain.py][INFO] Epoch:[0/2](79300/4588595) loss:3.136 lr:0.0000101 epoch_Time:28364.0min: [2024-01-03 01:58:35,787][model8_pretrain.py][INFO] Epoch:[0/2](79300/4588595) loss:3.177 lr:0.0000101 epoch_Time:28364.0min: [2024-01-03 01:59:12,724][model8_pretrain.py][INFO] Epoch:[0/2](79400/4588595) loss:3.181 lr:0.0000100 epoch_Time:28362.0min: [2024-01-03 01:59:12,724][model8_pretrain.py][INFO] Epoch:[0/2](79400/4588595) loss:2.926 lr:0.0000100 epoch_Time:28362.0min: [2024-01-03 01:59:12,725][model8_pretrain.py][INFO] Epoch:[0/2](79400/4588595) loss:2.630 lr:0.0000100 epoch_Time:28362.0min: [2024-01-03 01:59:12,725][model8_pretrain.py][INFO] Epoch:[0/2](79400/4588595) loss:3.512 lr:0.0000100 epoch_Time:28362.0min: [2024-01-03 01:59:12,725][model8_pretrain.py][INFO] Epoch:[0/2](79400/4588595) loss:2.873 lr:0.0000100 epoch_Time:28362.0min: [2024-01-03 01:59:12,725][model8_pretrain.py][INFO] Epoch:[0/2](79400/4588595) loss:2.897 lr:0.0000100 epoch_Time:28362.0min: [2024-01-03 01:59:12,725][model8_pretrain.py][INFO] Epoch:[0/2](79400/4588595) loss:2.548 lr:0.0000100 epoch_Time:28362.0min: [2024-01-03 01:59:12,725][model8_pretrain.py][INFO] Epoch:[0/2](79400/4588595) loss:2.400 lr:0.0000100 epoch_Time:28362.0min: [2024-01-03 01:59:49,669][model8_pretrain.py][INFO] Epoch:[0/2](79500/4588595) loss:2.691 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 01:59:49,669][model8_pretrain.py][INFO] Epoch:[0/2](79500/4588595) loss:2.736 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 01:59:49,669][model8_pretrain.py][INFO] Epoch:[0/2](79500/4588595) loss:2.530 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 01:59:49,669][model8_pretrain.py][INFO] Epoch:[0/2](79500/4588595) loss:3.194 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 01:59:49,669][model8_pretrain.py][INFO] Epoch:[0/2](79500/4588595) loss:3.090 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 01:59:49,669][model8_pretrain.py][INFO] Epoch:[0/2](79500/4588595) loss:3.405 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 01:59:49,669][model8_pretrain.py][INFO] Epoch:[0/2](79500/4588595) loss:2.624 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 01:59:49,669][model8_pretrain.py][INFO] Epoch:[0/2](79500/4588595) loss:3.069 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 02:00:26,617][model8_pretrain.py][INFO] Epoch:[0/2](79600/4588595) loss:2.897 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:00:26,617][model8_pretrain.py][INFO] Epoch:[0/2](79600/4588595) loss:3.026 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:00:26,617][model8_pretrain.py][INFO] Epoch:[0/2](79600/4588595) loss:3.402 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:00:26,617][model8_pretrain.py][INFO] Epoch:[0/2](79600/4588595) loss:2.857 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:00:26,617][model8_pretrain.py][INFO] Epoch:[0/2](79600/4588595) loss:3.211 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:00:26,617][model8_pretrain.py][INFO] Epoch:[0/2](79600/4588595) loss:3.012 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:00:26,618][model8_pretrain.py][INFO] Epoch:[0/2](79600/4588595) loss:2.767 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:00:26,618][model8_pretrain.py][INFO] Epoch:[0/2](79600/4588595) loss:2.927 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:01:03,561][model8_pretrain.py][INFO] Epoch:[0/2](79700/4588595) loss:2.102 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:01:03,561][model8_pretrain.py][INFO] Epoch:[0/2](79700/4588595) loss:2.901 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:01:03,561][model8_pretrain.py][INFO] Epoch:[0/2](79700/4588595) loss:3.485 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:01:03,561][model8_pretrain.py][INFO] Epoch:[0/2](79700/4588595) loss:3.053 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:01:03,561][model8_pretrain.py][INFO] Epoch:[0/2](79700/4588595) loss:3.010 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:01:03,561][model8_pretrain.py][INFO] Epoch:[0/2](79700/4588595) loss:2.712 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:01:03,561][model8_pretrain.py][INFO] Epoch:[0/2](79700/4588595) loss:2.844 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:01:03,561][model8_pretrain.py][INFO] Epoch:[0/2](79700/4588595) loss:3.106 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:01:40,496][model8_pretrain.py][INFO] Epoch:[0/2](79800/4588595) loss:3.271 lr:0.0000100 epoch_Time:28357.0min: [2024-01-03 02:01:40,496][model8_pretrain.py][INFO] Epoch:[0/2](79800/4588595) loss:3.392 lr:0.0000100 epoch_Time:28357.0min: [2024-01-03 02:01:40,496][model8_pretrain.py][INFO] Epoch:[0/2](79800/4588595) loss:3.526 lr:0.0000100 epoch_Time:28357.0min: [2024-01-03 02:01:40,496][model8_pretrain.py][INFO] Epoch:[0/2](79800/4588595) loss:3.176 lr:0.0000100 epoch_Time:28357.0min: [2024-01-03 02:01:40,496][model8_pretrain.py][INFO] Epoch:[0/2](79800/4588595) loss:3.185 lr:0.0000100 epoch_Time:28357.0min: [2024-01-03 02:01:40,496][model8_pretrain.py][INFO] Epoch:[0/2](79800/4588595) loss:3.286 lr:0.0000100 epoch_Time:28357.0min: [2024-01-03 02:01:40,496][model8_pretrain.py][INFO] Epoch:[0/2](79800/4588595) loss:3.264 lr:0.0000100 epoch_Time:28357.0min: [2024-01-03 02:01:40,496][model8_pretrain.py][INFO] Epoch:[0/2](79800/4588595) loss:2.586 lr:0.0000100 epoch_Time:28357.0min: [2024-01-03 02:02:17,430][model8_pretrain.py][INFO] Epoch:[0/2](79900/4588595) loss:3.586 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:02:17,430][model8_pretrain.py][INFO] Epoch:[0/2](79900/4588595) loss:3.101 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:02:17,430][model8_pretrain.py][INFO] Epoch:[0/2](79900/4588595) loss:2.800 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:02:17,430][model8_pretrain.py][INFO] Epoch:[0/2](79900/4588595) loss:3.295 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:02:17,430][model8_pretrain.py][INFO] Epoch:[0/2](79900/4588595) loss:2.786 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:02:17,430][model8_pretrain.py][INFO] Epoch:[0/2](79900/4588595) loss:3.201 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:02:17,430][model8_pretrain.py][INFO] Epoch:[0/2](79900/4588595) loss:2.493 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:02:17,431][model8_pretrain.py][INFO] Epoch:[0/2](79900/4588595) loss:3.376 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:03:01,476][model8_pretrain.py][INFO] Epoch:[0/2](80000/4588595) loss:2.900 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 02:03:01,476][model8_pretrain.py][INFO] Epoch:[0/2](80000/4588595) loss:2.999 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 02:03:01,476][model8_pretrain.py][INFO] Epoch:[0/2](80000/4588595) loss:2.668 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 02:03:01,476][model8_pretrain.py][INFO] Epoch:[0/2](80000/4588595) loss:2.949 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 02:03:01,476][model8_pretrain.py][INFO] Epoch:[0/2](80000/4588595) loss:2.884 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 02:03:01,476][model8_pretrain.py][INFO] Epoch:[0/2](80000/4588595) loss:3.118 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 02:03:01,476][model8_pretrain.py][INFO] Epoch:[0/2](80000/4588595) loss:3.340 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 02:03:01,476][model8_pretrain.py][INFO] Epoch:[0/2](80000/4588595) loss:3.242 lr:0.0000100 epoch_Time:28360.0min: [2024-01-03 02:03:38,455][model8_pretrain.py][INFO] Epoch:[0/2](80100/4588595) loss:3.252 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:03:38,455][model8_pretrain.py][INFO] Epoch:[0/2](80100/4588595) loss:2.791 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:03:38,455][model8_pretrain.py][INFO] Epoch:[0/2](80100/4588595) loss:2.789 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:03:38,455][model8_pretrain.py][INFO] Epoch:[0/2](80100/4588595) loss:3.246 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:03:38,455][model8_pretrain.py][INFO] Epoch:[0/2](80100/4588595) loss:2.555 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:03:38,455][model8_pretrain.py][INFO] Epoch:[0/2](80100/4588595) loss:2.814 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:03:38,456][model8_pretrain.py][INFO] Epoch:[0/2](80100/4588595) loss:2.789 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:03:38,456][model8_pretrain.py][INFO] Epoch:[0/2](80100/4588595) loss:3.321 lr:0.0000100 epoch_Time:28359.0min: [2024-01-03 02:04:15,391][model8_pretrain.py][INFO] Epoch:[0/2](80200/4588595) loss:3.223 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:04:15,391][model8_pretrain.py][INFO] Epoch:[0/2](80200/4588595) loss:2.911 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:04:15,391][model8_pretrain.py][INFO] Epoch:[0/2](80200/4588595) loss:2.585 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:04:15,391][model8_pretrain.py][INFO] Epoch:[0/2](80200/4588595) loss:3.305 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:04:15,391][model8_pretrain.py][INFO] Epoch:[0/2](80200/4588595) loss:3.215 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:04:15,391][model8_pretrain.py][INFO] Epoch:[0/2](80200/4588595) loss:2.979 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:04:15,391][model8_pretrain.py][INFO] Epoch:[0/2](80200/4588595) loss:3.081 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:04:15,392][model8_pretrain.py][INFO] Epoch:[0/2](80200/4588595) loss:3.049 lr:0.0000100 epoch_Time:28358.0min: [2024-01-03 02:04:52,381][model8_pretrain.py][INFO] Epoch:[0/2](80300/4588595) loss:2.652 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:04:52,381][model8_pretrain.py][INFO] Epoch:[0/2](80300/4588595) loss:2.707 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:04:52,381][model8_pretrain.py][INFO] Epoch:[0/2](80300/4588595) loss:3.333 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:04:52,381][model8_pretrain.py][INFO] Epoch:[0/2](80300/4588595) loss:2.729 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:04:52,381][model8_pretrain.py][INFO] Epoch:[0/2](80300/4588595) loss:3.116 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:04:52,381][model8_pretrain.py][INFO] Epoch:[0/2](80300/4588595) loss:3.391 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:04:52,381][model8_pretrain.py][INFO] Epoch:[0/2](80300/4588595) loss:2.627 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:04:52,381][model8_pretrain.py][INFO] Epoch:[0/2](80300/4588595) loss:3.023 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:05:29,343][model8_pretrain.py][INFO] Epoch:[0/2](80400/4588595) loss:3.494 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:05:29,343][model8_pretrain.py][INFO] Epoch:[0/2](80400/4588595) loss:2.931 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:05:29,343][model8_pretrain.py][INFO] Epoch:[0/2](80400/4588595) loss:3.433 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:05:29,343][model8_pretrain.py][INFO] Epoch:[0/2](80400/4588595) loss:3.010 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:05:29,343][model8_pretrain.py][INFO] Epoch:[0/2](80400/4588595) loss:3.191 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:05:29,343][model8_pretrain.py][INFO] Epoch:[0/2](80400/4588595) loss:3.281 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:05:29,343][model8_pretrain.py][INFO] Epoch:[0/2](80400/4588595) loss:3.061 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:05:29,343][model8_pretrain.py][INFO] Epoch:[0/2](80400/4588595) loss:3.139 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:06:06,356][model8_pretrain.py][INFO] Epoch:[0/2](80500/4588595) loss:3.289 lr:0.0000100 epoch_Time:28354.0min: [2024-01-03 02:06:06,356][model8_pretrain.py][INFO] Epoch:[0/2](80500/4588595) loss:3.156 lr:0.0000100 epoch_Time:28354.0min: [2024-01-03 02:06:06,356][model8_pretrain.py][INFO] Epoch:[0/2](80500/4588595) loss:3.116 lr:0.0000100 epoch_Time:28354.0min: [2024-01-03 02:06:06,356][model8_pretrain.py][INFO] Epoch:[0/2](80500/4588595) loss:2.574 lr:0.0000100 epoch_Time:28354.0min: [2024-01-03 02:06:06,356][model8_pretrain.py][INFO] Epoch:[0/2](80500/4588595) loss:2.705 lr:0.0000100 epoch_Time:28354.0min: [2024-01-03 02:06:06,356][model8_pretrain.py][INFO] Epoch:[0/2](80500/4588595) loss:2.869 lr:0.0000100 epoch_Time:28354.0min: [2024-01-03 02:06:06,356][model8_pretrain.py][INFO] Epoch:[0/2](80500/4588595) loss:3.226 lr:0.0000100 epoch_Time:28354.0min: [2024-01-03 02:06:06,356][model8_pretrain.py][INFO] Epoch:[0/2](80500/4588595) loss:2.716 lr:0.0000100 epoch_Time:28354.0min: [2024-01-03 02:06:43,271][model8_pretrain.py][INFO] Epoch:[0/2](80600/4588595) loss:2.927 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:06:43,271][model8_pretrain.py][INFO] Epoch:[0/2](80600/4588595) loss:2.854 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:06:43,271][model8_pretrain.py][INFO] Epoch:[0/2](80600/4588595) loss:2.885 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:06:43,271][model8_pretrain.py][INFO] Epoch:[0/2](80600/4588595) loss:3.077 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:06:43,271][model8_pretrain.py][INFO] Epoch:[0/2](80600/4588595) loss:3.521 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:06:43,271][model8_pretrain.py][INFO] Epoch:[0/2](80600/4588595) loss:2.583 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:06:43,272][model8_pretrain.py][INFO] Epoch:[0/2](80600/4588595) loss:2.735 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:06:43,272][model8_pretrain.py][INFO] Epoch:[0/2](80600/4588595) loss:2.977 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:07:20,211][model8_pretrain.py][INFO] Epoch:[0/2](80700/4588595) loss:2.908 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:07:20,211][model8_pretrain.py][INFO] Epoch:[0/2](80700/4588595) loss:2.813 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:07:20,211][model8_pretrain.py][INFO] Epoch:[0/2](80700/4588595) loss:3.251 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:07:20,211][model8_pretrain.py][INFO] Epoch:[0/2](80700/4588595) loss:3.137 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:07:20,211][model8_pretrain.py][INFO] Epoch:[0/2](80700/4588595) loss:3.059 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:07:20,211][model8_pretrain.py][INFO] Epoch:[0/2](80700/4588595) loss:3.202 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:07:20,211][model8_pretrain.py][INFO] Epoch:[0/2](80700/4588595) loss:2.754 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:07:20,212][model8_pretrain.py][INFO] Epoch:[0/2](80700/4588595) loss:2.912 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:08:04,160][model8_pretrain.py][INFO] Epoch:[0/2](80800/4588595) loss:2.749 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:08:04,160][model8_pretrain.py][INFO] Epoch:[0/2](80800/4588595) loss:2.751 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:08:04,160][model8_pretrain.py][INFO] Epoch:[0/2](80800/4588595) loss:3.044 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:08:04,160][model8_pretrain.py][INFO] Epoch:[0/2](80800/4588595) loss:2.675 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:08:04,160][model8_pretrain.py][INFO] Epoch:[0/2](80800/4588595) loss:3.252 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:08:04,160][model8_pretrain.py][INFO] Epoch:[0/2](80800/4588595) loss:3.174 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:08:04,160][model8_pretrain.py][INFO] Epoch:[0/2](80800/4588595) loss:2.729 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:08:04,160][model8_pretrain.py][INFO] Epoch:[0/2](80800/4588595) loss:2.701 lr:0.0000100 epoch_Time:28356.0min: [2024-01-03 02:08:41,086][model8_pretrain.py][INFO] Epoch:[0/2](80900/4588595) loss:3.337 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:08:41,086][model8_pretrain.py][INFO] Epoch:[0/2](80900/4588595) loss:2.662 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:08:41,086][model8_pretrain.py][INFO] Epoch:[0/2](80900/4588595) loss:2.905 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:08:41,086][model8_pretrain.py][INFO] Epoch:[0/2](80900/4588595) loss:3.493 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:08:41,086][model8_pretrain.py][INFO] Epoch:[0/2](80900/4588595) loss:2.531 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:08:41,086][model8_pretrain.py][INFO] Epoch:[0/2](80900/4588595) loss:3.143 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:08:41,086][model8_pretrain.py][INFO] Epoch:[0/2](80900/4588595) loss:2.552 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:08:41,087][model8_pretrain.py][INFO] Epoch:[0/2](80900/4588595) loss:3.492 lr:0.0000100 epoch_Time:28355.0min: [2024-01-03 02:09:18,022][model8_pretrain.py][INFO] Epoch:[0/2](81000/4588595) loss:3.154 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:09:18,022][model8_pretrain.py][INFO] Epoch:[0/2](81000/4588595) loss:2.479 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:09:18,022][model8_pretrain.py][INFO] Epoch:[0/2](81000/4588595) loss:2.831 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:09:18,022][model8_pretrain.py][INFO] Epoch:[0/2](81000/4588595) loss:3.283 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:09:18,022][model8_pretrain.py][INFO] Epoch:[0/2](81000/4588595) loss:3.248 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:09:18,022][model8_pretrain.py][INFO] Epoch:[0/2](81000/4588595) loss:3.372 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:09:18,022][model8_pretrain.py][INFO] Epoch:[0/2](81000/4588595) loss:2.463 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:09:18,023][model8_pretrain.py][INFO] Epoch:[0/2](81000/4588595) loss:3.155 lr:0.0000100 epoch_Time:28353.0min: [2024-01-03 02:09:54,955][model8_pretrain.py][INFO] Epoch:[0/2](81100/4588595) loss:2.384 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:09:54,955][model8_pretrain.py][INFO] Epoch:[0/2](81100/4588595) loss:3.190 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:09:54,955][model8_pretrain.py][INFO] Epoch:[0/2](81100/4588595) loss:3.246 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:09:54,955][model8_pretrain.py][INFO] Epoch:[0/2](81100/4588595) loss:2.853 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:09:54,955][model8_pretrain.py][INFO] Epoch:[0/2](81100/4588595) loss:3.106 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](81100/4588595) loss:3.240 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](81100/4588595) loss:2.650 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:09:54,957][model8_pretrain.py][INFO] Epoch:[0/2](81100/4588595) loss:3.313 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:10:31,893][model8_pretrain.py][INFO] Epoch:[0/2](81200/4588595) loss:2.799 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:10:31,893][model8_pretrain.py][INFO] Epoch:[0/2](81200/4588595) loss:2.791 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:10:31,893][model8_pretrain.py][INFO] Epoch:[0/2](81200/4588595) loss:3.491 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:10:31,893][model8_pretrain.py][INFO] Epoch:[0/2](81200/4588595) loss:3.279 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:10:31,893][model8_pretrain.py][INFO] Epoch:[0/2](81200/4588595) loss:3.396 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:10:31,893][model8_pretrain.py][INFO] Epoch:[0/2](81200/4588595) loss:3.083 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:10:31,893][model8_pretrain.py][INFO] Epoch:[0/2](81200/4588595) loss:3.283 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:10:31,893][model8_pretrain.py][INFO] Epoch:[0/2](81200/4588595) loss:3.149 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:11:08,826][model8_pretrain.py][INFO] Epoch:[0/2](81300/4588595) loss:2.943 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:11:08,826][model8_pretrain.py][INFO] Epoch:[0/2](81300/4588595) loss:3.199 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:11:08,826][model8_pretrain.py][INFO] Epoch:[0/2](81300/4588595) loss:2.949 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:11:08,826][model8_pretrain.py][INFO] Epoch:[0/2](81300/4588595) loss:3.365 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:11:08,826][model8_pretrain.py][INFO] Epoch:[0/2](81300/4588595) loss:2.950 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:11:08,826][model8_pretrain.py][INFO] Epoch:[0/2](81300/4588595) loss:3.359 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:11:08,827][model8_pretrain.py][INFO] Epoch:[0/2](81300/4588595) loss:3.339 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:11:08,827][model8_pretrain.py][INFO] Epoch:[0/2](81300/4588595) loss:2.873 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:11:45,772][model8_pretrain.py][INFO] Epoch:[0/2](81400/4588595) loss:3.059 lr:0.0000100 epoch_Time:28348.0min: [2024-01-03 02:11:45,772][model8_pretrain.py][INFO] Epoch:[0/2](81400/4588595) loss:3.309 lr:0.0000100 epoch_Time:28348.0min: [2024-01-03 02:11:45,772][model8_pretrain.py][INFO] Epoch:[0/2](81400/4588595) loss:3.089 lr:0.0000100 epoch_Time:28348.0min: [2024-01-03 02:11:45,772][model8_pretrain.py][INFO] Epoch:[0/2](81400/4588595) loss:2.231 lr:0.0000100 epoch_Time:28348.0min: [2024-01-03 02:11:45,772][model8_pretrain.py][INFO] Epoch:[0/2](81400/4588595) loss:3.109 lr:0.0000100 epoch_Time:28348.0min: [2024-01-03 02:11:45,772][model8_pretrain.py][INFO] Epoch:[0/2](81400/4588595) loss:2.255 lr:0.0000100 epoch_Time:28348.0min: [2024-01-03 02:11:45,772][model8_pretrain.py][INFO] Epoch:[0/2](81400/4588595) loss:3.172 lr:0.0000100 epoch_Time:28348.0min: [2024-01-03 02:11:45,772][model8_pretrain.py][INFO] Epoch:[0/2](81400/4588595) loss:3.094 lr:0.0000100 epoch_Time:28348.0min: [2024-01-03 02:12:22,760][model8_pretrain.py][INFO] Epoch:[0/2](81500/4588595) loss:2.505 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:12:22,760][model8_pretrain.py][INFO] Epoch:[0/2](81500/4588595) loss:3.438 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:12:22,760][model8_pretrain.py][INFO] Epoch:[0/2](81500/4588595) loss:2.965 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:12:22,760][model8_pretrain.py][INFO] Epoch:[0/2](81500/4588595) loss:2.835 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:12:22,760][model8_pretrain.py][INFO] Epoch:[0/2](81500/4588595) loss:2.874 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:12:22,760][model8_pretrain.py][INFO] Epoch:[0/2](81500/4588595) loss:3.080 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:12:22,760][model8_pretrain.py][INFO] Epoch:[0/2](81500/4588595) loss:3.423 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:12:22,760][model8_pretrain.py][INFO] Epoch:[0/2](81500/4588595) loss:3.607 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:13:06,806][model8_pretrain.py][INFO] Epoch:[0/2](81600/4588595) loss:3.310 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:13:06,806][model8_pretrain.py][INFO] Epoch:[0/2](81600/4588595) loss:2.710 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:13:06,806][model8_pretrain.py][INFO] Epoch:[0/2](81600/4588595) loss:2.686 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:13:06,806][model8_pretrain.py][INFO] Epoch:[0/2](81600/4588595) loss:3.466 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:13:06,806][model8_pretrain.py][INFO] Epoch:[0/2](81600/4588595) loss:2.425 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:13:06,806][model8_pretrain.py][INFO] Epoch:[0/2](81600/4588595) loss:3.337 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:13:06,806][model8_pretrain.py][INFO] Epoch:[0/2](81600/4588595) loss:3.473 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:13:06,806][model8_pretrain.py][INFO] Epoch:[0/2](81600/4588595) loss:3.262 lr:0.0000100 epoch_Time:28352.0min: [2024-01-03 02:13:43,729][model8_pretrain.py][INFO] Epoch:[0/2](81700/4588595) loss:3.406 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:13:43,729][model8_pretrain.py][INFO] Epoch:[0/2](81700/4588595) loss:3.371 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:13:43,729][model8_pretrain.py][INFO] Epoch:[0/2](81700/4588595) loss:3.211 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:13:43,729][model8_pretrain.py][INFO] Epoch:[0/2](81700/4588595) loss:3.276 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:13:43,729][model8_pretrain.py][INFO] Epoch:[0/2](81700/4588595) loss:3.134 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:13:43,729][model8_pretrain.py][INFO] Epoch:[0/2](81700/4588595) loss:3.107 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:13:43,729][model8_pretrain.py][INFO] Epoch:[0/2](81700/4588595) loss:3.120 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:13:43,730][model8_pretrain.py][INFO] Epoch:[0/2](81700/4588595) loss:3.122 lr:0.0000100 epoch_Time:28351.0min: [2024-01-03 02:14:20,669][model8_pretrain.py][INFO] Epoch:[0/2](81800/4588595) loss:3.288 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:14:20,669][model8_pretrain.py][INFO] Epoch:[0/2](81800/4588595) loss:3.235 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:14:20,669][model8_pretrain.py][INFO] Epoch:[0/2](81800/4588595) loss:3.486 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:14:20,669][model8_pretrain.py][INFO] Epoch:[0/2](81800/4588595) loss:2.968 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:14:20,669][model8_pretrain.py][INFO] Epoch:[0/2](81800/4588595) loss:3.475 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:14:20,669][model8_pretrain.py][INFO] Epoch:[0/2](81800/4588595) loss:3.121 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:14:20,669][model8_pretrain.py][INFO] Epoch:[0/2](81800/4588595) loss:2.237 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:14:20,669][model8_pretrain.py][INFO] Epoch:[0/2](81800/4588595) loss:3.425 lr:0.0000100 epoch_Time:28349.0min: [2024-01-03 02:14:57,612][model8_pretrain.py][INFO] Epoch:[0/2](81900/4588595) loss:3.346 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:14:57,612][model8_pretrain.py][INFO] Epoch:[0/2](81900/4588595) loss:3.124 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:14:57,612][model8_pretrain.py][INFO] Epoch:[0/2](81900/4588595) loss:2.941 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:14:57,612][model8_pretrain.py][INFO] Epoch:[0/2](81900/4588595) loss:3.300 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:14:57,612][model8_pretrain.py][INFO] Epoch:[0/2](81900/4588595) loss:3.605 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:14:57,612][model8_pretrain.py][INFO] Epoch:[0/2](81900/4588595) loss:2.734 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:14:57,612][model8_pretrain.py][INFO] Epoch:[0/2](81900/4588595) loss:2.918 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:14:57,612][model8_pretrain.py][INFO] Epoch:[0/2](81900/4588595) loss:3.150 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:15:34,544][model8_pretrain.py][INFO] Epoch:[0/2](82000/4588595) loss:3.076 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:15:34,544][model8_pretrain.py][INFO] Epoch:[0/2](82000/4588595) loss:2.626 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:15:34,544][model8_pretrain.py][INFO] Epoch:[0/2](82000/4588595) loss:3.136 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:15:34,544][model8_pretrain.py][INFO] Epoch:[0/2](82000/4588595) loss:3.157 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:15:34,544][model8_pretrain.py][INFO] Epoch:[0/2](82000/4588595) loss:3.176 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:15:34,544][model8_pretrain.py][INFO] Epoch:[0/2](82000/4588595) loss:2.600 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:15:34,544][model8_pretrain.py][INFO] Epoch:[0/2](82000/4588595) loss:2.980 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:15:34,545][model8_pretrain.py][INFO] Epoch:[0/2](82000/4588595) loss:3.183 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:16:11,479][model8_pretrain.py][INFO] Epoch:[0/2](82100/4588595) loss:2.941 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:16:11,479][model8_pretrain.py][INFO] Epoch:[0/2](82100/4588595) loss:3.021 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:16:11,479][model8_pretrain.py][INFO] Epoch:[0/2](82100/4588595) loss:2.502 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:16:11,479][model8_pretrain.py][INFO] Epoch:[0/2](82100/4588595) loss:3.637 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:16:11,479][model8_pretrain.py][INFO] Epoch:[0/2](82100/4588595) loss:3.051 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:16:11,479][model8_pretrain.py][INFO] Epoch:[0/2](82100/4588595) loss:3.511 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:16:11,479][model8_pretrain.py][INFO] Epoch:[0/2](82100/4588595) loss:2.907 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:16:11,479][model8_pretrain.py][INFO] Epoch:[0/2](82100/4588595) loss:3.219 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:16:48,423][model8_pretrain.py][INFO] Epoch:[0/2](82200/4588595) loss:3.176 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:16:48,423][model8_pretrain.py][INFO] Epoch:[0/2](82200/4588595) loss:3.085 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:16:48,423][model8_pretrain.py][INFO] Epoch:[0/2](82200/4588595) loss:2.810 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:16:48,423][model8_pretrain.py][INFO] Epoch:[0/2](82200/4588595) loss:2.487 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:16:48,423][model8_pretrain.py][INFO] Epoch:[0/2](82200/4588595) loss:3.004 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:16:48,423][model8_pretrain.py][INFO] Epoch:[0/2](82200/4588595) loss:3.153 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:16:48,423][model8_pretrain.py][INFO] Epoch:[0/2](82200/4588595) loss:3.218 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:16:48,423][model8_pretrain.py][INFO] Epoch:[0/2](82200/4588595) loss:3.279 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:17:25,391][model8_pretrain.py][INFO] Epoch:[0/2](82300/4588595) loss:2.561 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:17:25,391][model8_pretrain.py][INFO] Epoch:[0/2](82300/4588595) loss:3.303 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:17:25,391][model8_pretrain.py][INFO] Epoch:[0/2](82300/4588595) loss:2.937 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:17:25,391][model8_pretrain.py][INFO] Epoch:[0/2](82300/4588595) loss:2.542 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:17:25,391][model8_pretrain.py][INFO] Epoch:[0/2](82300/4588595) loss:3.142 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:17:25,391][model8_pretrain.py][INFO] Epoch:[0/2](82300/4588595) loss:3.290 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:17:25,391][model8_pretrain.py][INFO] Epoch:[0/2](82300/4588595) loss:3.468 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:17:25,391][model8_pretrain.py][INFO] Epoch:[0/2](82300/4588595) loss:3.581 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:18:09,449][model8_pretrain.py][INFO] Epoch:[0/2](82400/4588595) loss:3.404 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:09,449][model8_pretrain.py][INFO] Epoch:[0/2](82400/4588595) loss:3.857 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:09,449][model8_pretrain.py][INFO] Epoch:[0/2](82400/4588595) loss:3.056 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:09,449][model8_pretrain.py][INFO] Epoch:[0/2](82400/4588595) loss:3.175 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:09,449][model8_pretrain.py][INFO] Epoch:[0/2](82400/4588595) loss:2.846 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:09,449][model8_pretrain.py][INFO] Epoch:[0/2](82400/4588595) loss:3.103 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:09,449][model8_pretrain.py][INFO] Epoch:[0/2](82400/4588595) loss:2.679 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:09,449][model8_pretrain.py][INFO] Epoch:[0/2](82400/4588595) loss:3.330 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:46,416][model8_pretrain.py][INFO] Epoch:[0/2](82500/4588595) loss:2.357 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:46,416][model8_pretrain.py][INFO] Epoch:[0/2](82500/4588595) loss:3.144 lr:0.0000100 epoch_Time:28346.0min: [2024-01-03 02:18:46,416][model8_pretrain.py][INFO] Epoch:[0/2](82500/4588595) loss:2.900 lr:0.0000100 epoch_Time:28346.0min: [2024-01-03 02:18:46,416][model8_pretrain.py][INFO] Epoch:[0/2](82500/4588595) loss:2.969 lr:0.0000100 epoch_Time:28347.0min: [2024-01-03 02:18:46,416][model8_pretrain.py][INFO] Epoch:[0/2](82500/4588595) loss:3.277 lr:0.0000100 epoch_Time:28346.0min: [2024-01-03 02:18:46,416][model8_pretrain.py][INFO] Epoch:[0/2](82500/4588595) loss:3.219 lr:0.0000100 epoch_Time:28346.0min: [2024-01-03 02:18:46,416][model8_pretrain.py][INFO] Epoch:[0/2](82500/4588595) loss:2.766 lr:0.0000100 epoch_Time:28346.0min: [2024-01-03 02:18:46,423][model8_pretrain.py][INFO] Epoch:[0/2](82500/4588595) loss:3.026 lr:0.0000100 epoch_Time:28346.0min: [2024-01-03 02:19:23,390][model8_pretrain.py][INFO] Epoch:[0/2](82600/4588595) loss:3.098 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:19:23,390][model8_pretrain.py][INFO] Epoch:[0/2](82600/4588595) loss:2.924 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:19:23,390][model8_pretrain.py][INFO] Epoch:[0/2](82600/4588595) loss:2.496 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:19:23,390][model8_pretrain.py][INFO] Epoch:[0/2](82600/4588595) loss:3.209 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:19:23,390][model8_pretrain.py][INFO] Epoch:[0/2](82600/4588595) loss:3.345 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:19:23,390][model8_pretrain.py][INFO] Epoch:[0/2](82600/4588595) loss:3.034 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:19:23,390][model8_pretrain.py][INFO] Epoch:[0/2](82600/4588595) loss:3.265 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:19:23,391][model8_pretrain.py][INFO] Epoch:[0/2](82600/4588595) loss:3.087 lr:0.0000100 epoch_Time:28345.0min: [2024-01-03 02:20:00,342][model8_pretrain.py][INFO] Epoch:[0/2](82700/4588595) loss:3.380 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:20:00,342][model8_pretrain.py][INFO] Epoch:[0/2](82700/4588595) loss:2.687 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:20:00,342][model8_pretrain.py][INFO] Epoch:[0/2](82700/4588595) loss:3.007 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:20:00,342][model8_pretrain.py][INFO] Epoch:[0/2](82700/4588595) loss:2.930 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:20:00,342][model8_pretrain.py][INFO] Epoch:[0/2](82700/4588595) loss:2.710 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:20:00,342][model8_pretrain.py][INFO] Epoch:[0/2](82700/4588595) loss:3.307 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:20:00,342][model8_pretrain.py][INFO] Epoch:[0/2](82700/4588595) loss:3.252 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:20:00,343][model8_pretrain.py][INFO] Epoch:[0/2](82700/4588595) loss:3.002 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:20:37,296][model8_pretrain.py][INFO] Epoch:[0/2](82800/4588595) loss:2.822 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:20:37,296][model8_pretrain.py][INFO] Epoch:[0/2](82800/4588595) loss:2.766 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:20:37,296][model8_pretrain.py][INFO] Epoch:[0/2](82800/4588595) loss:3.044 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:20:37,296][model8_pretrain.py][INFO] Epoch:[0/2](82800/4588595) loss:2.876 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:20:37,296][model8_pretrain.py][INFO] Epoch:[0/2](82800/4588595) loss:3.223 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:20:37,296][model8_pretrain.py][INFO] Epoch:[0/2](82800/4588595) loss:2.906 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:20:37,296][model8_pretrain.py][INFO] Epoch:[0/2](82800/4588595) loss:2.797 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:20:37,296][model8_pretrain.py][INFO] Epoch:[0/2](82800/4588595) loss:2.634 lr:0.0000100 epoch_Time:28342.0min: [2024-01-03 02:21:14,242][model8_pretrain.py][INFO] Epoch:[0/2](82900/4588595) loss:3.251 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:21:14,242][model8_pretrain.py][INFO] Epoch:[0/2](82900/4588595) loss:3.289 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:21:14,242][model8_pretrain.py][INFO] Epoch:[0/2](82900/4588595) loss:2.891 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:21:14,242][model8_pretrain.py][INFO] Epoch:[0/2](82900/4588595) loss:2.998 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:21:14,242][model8_pretrain.py][INFO] Epoch:[0/2](82900/4588595) loss:3.200 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:21:14,242][model8_pretrain.py][INFO] Epoch:[0/2](82900/4588595) loss:3.198 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:21:14,242][model8_pretrain.py][INFO] Epoch:[0/2](82900/4588595) loss:3.113 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:21:14,242][model8_pretrain.py][INFO] Epoch:[0/2](82900/4588595) loss:2.837 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:21:51,181][model8_pretrain.py][INFO] Epoch:[0/2](83000/4588595) loss:2.440 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:21:51,181][model8_pretrain.py][INFO] Epoch:[0/2](83000/4588595) loss:3.082 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:21:51,181][model8_pretrain.py][INFO] Epoch:[0/2](83000/4588595) loss:2.940 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:21:51,181][model8_pretrain.py][INFO] Epoch:[0/2](83000/4588595) loss:2.483 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:21:51,181][model8_pretrain.py][INFO] Epoch:[0/2](83000/4588595) loss:3.277 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:21:51,181][model8_pretrain.py][INFO] Epoch:[0/2](83000/4588595) loss:2.332 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:21:51,181][model8_pretrain.py][INFO] Epoch:[0/2](83000/4588595) loss:3.276 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:21:51,181][model8_pretrain.py][INFO] Epoch:[0/2](83000/4588595) loss:2.934 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:22:28,131][model8_pretrain.py][INFO] Epoch:[0/2](83100/4588595) loss:2.060 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:22:28,131][model8_pretrain.py][INFO] Epoch:[0/2](83100/4588595) loss:3.156 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:22:28,131][model8_pretrain.py][INFO] Epoch:[0/2](83100/4588595) loss:2.610 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:22:28,131][model8_pretrain.py][INFO] Epoch:[0/2](83100/4588595) loss:3.512 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:22:28,131][model8_pretrain.py][INFO] Epoch:[0/2](83100/4588595) loss:3.211 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:22:28,131][model8_pretrain.py][INFO] Epoch:[0/2](83100/4588595) loss:3.364 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:22:28,131][model8_pretrain.py][INFO] Epoch:[0/2](83100/4588595) loss:3.120 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:22:28,131][model8_pretrain.py][INFO] Epoch:[0/2](83100/4588595) loss:2.807 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:23:12,061][model8_pretrain.py][INFO] Epoch:[0/2](83200/4588595) loss:3.493 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:23:12,061][model8_pretrain.py][INFO] Epoch:[0/2](83200/4588595) loss:3.440 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:23:12,061][model8_pretrain.py][INFO] Epoch:[0/2](83200/4588595) loss:3.169 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:23:12,061][model8_pretrain.py][INFO] Epoch:[0/2](83200/4588595) loss:3.158 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:23:12,061][model8_pretrain.py][INFO] Epoch:[0/2](83200/4588595) loss:3.056 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:23:12,061][model8_pretrain.py][INFO] Epoch:[0/2](83200/4588595) loss:2.905 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:23:12,061][model8_pretrain.py][INFO] Epoch:[0/2](83200/4588595) loss:2.903 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:23:12,061][model8_pretrain.py][INFO] Epoch:[0/2](83200/4588595) loss:3.015 lr:0.0000100 epoch_Time:28343.0min: [2024-01-03 02:23:48,986][model8_pretrain.py][INFO] Epoch:[0/2](83300/4588595) loss:2.832 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:23:48,986][model8_pretrain.py][INFO] Epoch:[0/2](83300/4588595) loss:3.017 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:23:48,986][model8_pretrain.py][INFO] Epoch:[0/2](83300/4588595) loss:3.363 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:23:48,986][model8_pretrain.py][INFO] Epoch:[0/2](83300/4588595) loss:2.926 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:23:48,986][model8_pretrain.py][INFO] Epoch:[0/2](83300/4588595) loss:2.937 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:23:48,986][model8_pretrain.py][INFO] Epoch:[0/2](83300/4588595) loss:3.091 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:23:48,987][model8_pretrain.py][INFO] Epoch:[0/2](83300/4588595) loss:2.746 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:23:48,987][model8_pretrain.py][INFO] Epoch:[0/2](83300/4588595) loss:3.342 lr:0.0000100 epoch_Time:28341.0min: [2024-01-03 02:24:25,918][model8_pretrain.py][INFO] Epoch:[0/2](83400/4588595) loss:2.892 lr:0.0000100 epoch_Time:28340.0min: [2024-01-03 02:24:25,918][model8_pretrain.py][INFO] Epoch:[0/2](83400/4588595) loss:2.603 lr:0.0000100 epoch_Time:28340.0min: [2024-01-03 02:24:25,919][model8_pretrain.py][INFO] Epoch:[0/2](83400/4588595) loss:3.553 lr:0.0000100 epoch_Time:28340.0min: [2024-01-03 02:24:25,919][model8_pretrain.py][INFO] Epoch:[0/2](83400/4588595) loss:3.122 lr:0.0000100 epoch_Time:28340.0min: [2024-01-03 02:24:25,918][model8_pretrain.py][INFO] Epoch:[0/2](83400/4588595) loss:3.482 lr:0.0000100 epoch_Time:28340.0min: [2024-01-03 02:24:25,919][model8_pretrain.py][INFO] Epoch:[0/2](83400/4588595) loss:2.668 lr:0.0000100 epoch_Time:28340.0min: [2024-01-03 02:24:25,919][model8_pretrain.py][INFO] Epoch:[0/2](83400/4588595) loss:2.480 lr:0.0000100 epoch_Time:28340.0min: [2024-01-03 02:24:25,919][model8_pretrain.py][INFO] Epoch:[0/2](83400/4588595) loss:3.283 lr:0.0000100 epoch_Time:28340.0min: [2024-01-03 02:25:02,848][model8_pretrain.py][INFO] Epoch:[0/2](83500/4588595) loss:2.991 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:25:02,848][model8_pretrain.py][INFO] Epoch:[0/2](83500/4588595) loss:2.989 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:25:02,848][model8_pretrain.py][INFO] Epoch:[0/2](83500/4588595) loss:3.129 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:25:02,848][model8_pretrain.py][INFO] Epoch:[0/2](83500/4588595) loss:3.032 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:25:02,848][model8_pretrain.py][INFO] Epoch:[0/2](83500/4588595) loss:3.214 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:25:02,848][model8_pretrain.py][INFO] Epoch:[0/2](83500/4588595) loss:3.065 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:25:02,848][model8_pretrain.py][INFO] Epoch:[0/2](83500/4588595) loss:3.341 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:25:02,848][model8_pretrain.py][INFO] Epoch:[0/2](83500/4588595) loss:3.252 lr:0.0000100 epoch_Time:28339.0min: [2024-01-03 02:25:39,783][model8_pretrain.py][INFO] Epoch:[0/2](83600/4588595) loss:2.789 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:25:39,783][model8_pretrain.py][INFO] Epoch:[0/2](83600/4588595) loss:2.853 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:25:39,783][model8_pretrain.py][INFO] Epoch:[0/2](83600/4588595) loss:3.313 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:25:39,783][model8_pretrain.py][INFO] Epoch:[0/2](83600/4588595) loss:3.147 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:25:39,783][model8_pretrain.py][INFO] Epoch:[0/2](83600/4588595) loss:3.384 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:25:39,783][model8_pretrain.py][INFO] Epoch:[0/2](83600/4588595) loss:3.172 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:25:39,783][model8_pretrain.py][INFO] Epoch:[0/2](83600/4588595) loss:3.091 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:25:39,783][model8_pretrain.py][INFO] Epoch:[0/2](83600/4588595) loss:2.906 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:26:16,719][model8_pretrain.py][INFO] Epoch:[0/2](83700/4588595) loss:2.163 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:26:16,719][model8_pretrain.py][INFO] Epoch:[0/2](83700/4588595) loss:2.952 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:26:16,719][model8_pretrain.py][INFO] Epoch:[0/2](83700/4588595) loss:3.447 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:26:16,719][model8_pretrain.py][INFO] Epoch:[0/2](83700/4588595) loss:3.198 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:26:16,720][model8_pretrain.py][INFO] Epoch:[0/2](83700/4588595) loss:2.997 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:26:16,720][model8_pretrain.py][INFO] Epoch:[0/2](83700/4588595) loss:3.313 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:26:16,720][model8_pretrain.py][INFO] Epoch:[0/2](83700/4588595) loss:2.849 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:26:16,720][model8_pretrain.py][INFO] Epoch:[0/2](83700/4588595) loss:2.325 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:26:53,660][model8_pretrain.py][INFO] Epoch:[0/2](83800/4588595) loss:2.879 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:26:53,660][model8_pretrain.py][INFO] Epoch:[0/2](83800/4588595) loss:3.054 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:26:53,660][model8_pretrain.py][INFO] Epoch:[0/2](83800/4588595) loss:3.075 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:26:53,660][model8_pretrain.py][INFO] Epoch:[0/2](83800/4588595) loss:3.027 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:26:53,660][model8_pretrain.py][INFO] Epoch:[0/2](83800/4588595) loss:2.687 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:26:53,660][model8_pretrain.py][INFO] Epoch:[0/2](83800/4588595) loss:2.726 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:26:53,660][model8_pretrain.py][INFO] Epoch:[0/2](83800/4588595) loss:3.261 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:26:53,665][model8_pretrain.py][INFO] Epoch:[0/2](83800/4588595) loss:3.355 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:27:30,607][model8_pretrain.py][INFO] Epoch:[0/2](83900/4588595) loss:2.945 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:27:30,607][model8_pretrain.py][INFO] Epoch:[0/2](83900/4588595) loss:3.321 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:27:30,607][model8_pretrain.py][INFO] Epoch:[0/2](83900/4588595) loss:3.167 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:27:30,607][model8_pretrain.py][INFO] Epoch:[0/2](83900/4588595) loss:3.119 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:27:30,607][model8_pretrain.py][INFO] Epoch:[0/2](83900/4588595) loss:3.478 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:27:30,607][model8_pretrain.py][INFO] Epoch:[0/2](83900/4588595) loss:2.920 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:27:30,607][model8_pretrain.py][INFO] Epoch:[0/2](83900/4588595) loss:2.980 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:27:30,607][model8_pretrain.py][INFO] Epoch:[0/2](83900/4588595) loss:2.777 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:28:14,572][model8_pretrain.py][INFO] Epoch:[0/2](84000/4588595) loss:3.517 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:28:14,572][model8_pretrain.py][INFO] Epoch:[0/2](84000/4588595) loss:2.833 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:28:14,572][model8_pretrain.py][INFO] Epoch:[0/2](84000/4588595) loss:3.423 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:28:14,572][model8_pretrain.py][INFO] Epoch:[0/2](84000/4588595) loss:2.941 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:28:14,572][model8_pretrain.py][INFO] Epoch:[0/2](84000/4588595) loss:2.886 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:28:14,572][model8_pretrain.py][INFO] Epoch:[0/2](84000/4588595) loss:3.061 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:28:14,573][model8_pretrain.py][INFO] Epoch:[0/2](84000/4588595) loss:2.835 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:28:14,573][model8_pretrain.py][INFO] Epoch:[0/2](84000/4588595) loss:2.806 lr:0.0000100 epoch_Time:28338.0min: [2024-01-03 02:28:51,504][model8_pretrain.py][INFO] Epoch:[0/2](84100/4588595) loss:2.723 lr:0.0000100 epoch_Time:28337.0min: [2024-01-03 02:28:51,504][model8_pretrain.py][INFO] Epoch:[0/2](84100/4588595) loss:3.143 lr:0.0000100 epoch_Time:28337.0min: [2024-01-03 02:28:51,504][model8_pretrain.py][INFO] Epoch:[0/2](84100/4588595) loss:2.805 lr:0.0000100 epoch_Time:28337.0min: [2024-01-03 02:28:51,504][model8_pretrain.py][INFO] Epoch:[0/2](84100/4588595) loss:3.042 lr:0.0000100 epoch_Time:28337.0min: [2024-01-03 02:28:51,504][model8_pretrain.py][INFO] Epoch:[0/2](84100/4588595) loss:3.241 lr:0.0000100 epoch_Time:28337.0min: [2024-01-03 02:28:51,504][model8_pretrain.py][INFO] Epoch:[0/2](84100/4588595) loss:3.325 lr:0.0000100 epoch_Time:28337.0min: [2024-01-03 02:28:51,505][model8_pretrain.py][INFO] Epoch:[0/2](84100/4588595) loss:3.208 lr:0.0000100 epoch_Time:28337.0min: [2024-01-03 02:28:51,505][model8_pretrain.py][INFO] Epoch:[0/2](84100/4588595) loss:3.012 lr:0.0000100 epoch_Time:28337.0min: [2024-01-03 02:29:28,448][model8_pretrain.py][INFO] Epoch:[0/2](84200/4588595) loss:2.878 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:29:28,448][model8_pretrain.py][INFO] Epoch:[0/2](84200/4588595) loss:2.923 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:29:28,448][model8_pretrain.py][INFO] Epoch:[0/2](84200/4588595) loss:2.967 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:29:28,448][model8_pretrain.py][INFO] Epoch:[0/2](84200/4588595) loss:3.060 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:29:28,448][model8_pretrain.py][INFO] Epoch:[0/2](84200/4588595) loss:2.705 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:29:28,448][model8_pretrain.py][INFO] Epoch:[0/2](84200/4588595) loss:3.224 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:29:28,448][model8_pretrain.py][INFO] Epoch:[0/2](84200/4588595) loss:3.138 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:29:28,448][model8_pretrain.py][INFO] Epoch:[0/2](84200/4588595) loss:2.675 lr:0.0000100 epoch_Time:28336.0min: [2024-01-03 02:30:05,389][model8_pretrain.py][INFO] Epoch:[0/2](84300/4588595) loss:3.046 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:30:05,389][model8_pretrain.py][INFO] Epoch:[0/2](84300/4588595) loss:2.460 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:30:05,389][model8_pretrain.py][INFO] Epoch:[0/2](84300/4588595) loss:3.424 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:30:05,389][model8_pretrain.py][INFO] Epoch:[0/2](84300/4588595) loss:3.055 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:30:05,390][model8_pretrain.py][INFO] Epoch:[0/2](84300/4588595) loss:2.621 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:30:05,389][model8_pretrain.py][INFO] Epoch:[0/2](84300/4588595) loss:2.954 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:30:05,390][model8_pretrain.py][INFO] Epoch:[0/2](84300/4588595) loss:3.320 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:30:05,390][model8_pretrain.py][INFO] Epoch:[0/2](84300/4588595) loss:3.201 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:30:42,382][model8_pretrain.py][INFO] Epoch:[0/2](84400/4588595) loss:3.247 lr:0.0000100 epoch_Time:28333.0min: [2024-01-03 02:30:42,382][model8_pretrain.py][INFO] Epoch:[0/2](84400/4588595) loss:2.003 lr:0.0000100 epoch_Time:28333.0min: [2024-01-03 02:30:42,382][model8_pretrain.py][INFO] Epoch:[0/2](84400/4588595) loss:2.811 lr:0.0000100 epoch_Time:28333.0min: [2024-01-03 02:30:42,382][model8_pretrain.py][INFO] Epoch:[0/2](84400/4588595) loss:2.731 lr:0.0000100 epoch_Time:28333.0min: [2024-01-03 02:30:42,382][model8_pretrain.py][INFO] Epoch:[0/2](84400/4588595) loss:2.936 lr:0.0000100 epoch_Time:28333.0min: [2024-01-03 02:30:42,382][model8_pretrain.py][INFO] Epoch:[0/2](84400/4588595) loss:3.232 lr:0.0000100 epoch_Time:28333.0min: [2024-01-03 02:30:42,383][model8_pretrain.py][INFO] Epoch:[0/2](84400/4588595) loss:3.000 lr:0.0000100 epoch_Time:28333.0min: [2024-01-03 02:30:42,391][model8_pretrain.py][INFO] Epoch:[0/2](84400/4588595) loss:2.768 lr:0.0000100 epoch_Time:28333.0min: [2024-01-03 02:31:19,374][model8_pretrain.py][INFO] Epoch:[0/2](84500/4588595) loss:3.158 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:31:19,375][model8_pretrain.py][INFO] Epoch:[0/2](84500/4588595) loss:3.116 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:31:19,375][model8_pretrain.py][INFO] Epoch:[0/2](84500/4588595) loss:3.009 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:31:19,375][model8_pretrain.py][INFO] Epoch:[0/2](84500/4588595) loss:3.230 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:31:19,375][model8_pretrain.py][INFO] Epoch:[0/2](84500/4588595) loss:3.380 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:31:19,375][model8_pretrain.py][INFO] Epoch:[0/2](84500/4588595) loss:3.253 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:31:19,375][model8_pretrain.py][INFO] Epoch:[0/2](84500/4588595) loss:2.954 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:31:19,383][model8_pretrain.py][INFO] Epoch:[0/2](84500/4588595) loss:2.945 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:31:56,377][model8_pretrain.py][INFO] Epoch:[0/2](84600/4588595) loss:2.894 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:31:56,377][model8_pretrain.py][INFO] Epoch:[0/2](84600/4588595) loss:3.066 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:31:56,377][model8_pretrain.py][INFO] Epoch:[0/2](84600/4588595) loss:2.934 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:31:56,378][model8_pretrain.py][INFO] Epoch:[0/2](84600/4588595) loss:3.113 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:31:56,378][model8_pretrain.py][INFO] Epoch:[0/2](84600/4588595) loss:3.166 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:31:56,378][model8_pretrain.py][INFO] Epoch:[0/2](84600/4588595) loss:2.945 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:31:56,378][model8_pretrain.py][INFO] Epoch:[0/2](84600/4588595) loss:3.130 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:31:56,378][model8_pretrain.py][INFO] Epoch:[0/2](84600/4588595) loss:3.018 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:32:33,343][model8_pretrain.py][INFO] Epoch:[0/2](84700/4588595) loss:2.879 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:32:33,343][model8_pretrain.py][INFO] Epoch:[0/2](84700/4588595) loss:2.934 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:32:33,343][model8_pretrain.py][INFO] Epoch:[0/2](84700/4588595) loss:3.048 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:32:33,343][model8_pretrain.py][INFO] Epoch:[0/2](84700/4588595) loss:3.065 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:32:33,343][model8_pretrain.py][INFO] Epoch:[0/2](84700/4588595) loss:3.072 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:32:33,343][model8_pretrain.py][INFO] Epoch:[0/2](84700/4588595) loss:2.623 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:32:33,343][model8_pretrain.py][INFO] Epoch:[0/2](84700/4588595) loss:3.175 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:32:33,343][model8_pretrain.py][INFO] Epoch:[0/2](84700/4588595) loss:2.601 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:33:17,248][model8_pretrain.py][INFO] Epoch:[0/2](84800/4588595) loss:2.865 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:33:17,248][model8_pretrain.py][INFO] Epoch:[0/2](84800/4588595) loss:2.883 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:33:17,248][model8_pretrain.py][INFO] Epoch:[0/2](84800/4588595) loss:3.276 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:33:17,248][model8_pretrain.py][INFO] Epoch:[0/2](84800/4588595) loss:3.062 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:33:17,248][model8_pretrain.py][INFO] Epoch:[0/2](84800/4588595) loss:2.835 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:33:17,248][model8_pretrain.py][INFO] Epoch:[0/2](84800/4588595) loss:3.400 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:33:17,248][model8_pretrain.py][INFO] Epoch:[0/2](84800/4588595) loss:3.201 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:33:17,248][model8_pretrain.py][INFO] Epoch:[0/2](84800/4588595) loss:2.995 lr:0.0000100 epoch_Time:28334.0min: [2024-01-03 02:33:54,172][model8_pretrain.py][INFO] Epoch:[0/2](84900/4588595) loss:2.708 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:33:54,172][model8_pretrain.py][INFO] Epoch:[0/2](84900/4588595) loss:3.011 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:33:54,172][model8_pretrain.py][INFO] Epoch:[0/2](84900/4588595) loss:2.034 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:33:54,172][model8_pretrain.py][INFO] Epoch:[0/2](84900/4588595) loss:3.584 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:33:54,172][model8_pretrain.py][INFO] Epoch:[0/2](84900/4588595) loss:2.974 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:33:54,172][model8_pretrain.py][INFO] Epoch:[0/2](84900/4588595) loss:3.043 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:33:54,172][model8_pretrain.py][INFO] Epoch:[0/2](84900/4588595) loss:3.212 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:33:54,172][model8_pretrain.py][INFO] Epoch:[0/2](84900/4588595) loss:3.140 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:34:31,110][model8_pretrain.py][INFO] Epoch:[0/2](85000/4588595) loss:3.295 lr:0.0000100 epoch_Time:28331.0min: [2024-01-03 02:34:31,110][model8_pretrain.py][INFO] Epoch:[0/2](85000/4588595) loss:2.540 lr:0.0000100 epoch_Time:28331.0min: [2024-01-03 02:34:31,110][model8_pretrain.py][INFO] Epoch:[0/2](85000/4588595) loss:3.407 lr:0.0000100 epoch_Time:28331.0min: [2024-01-03 02:34:31,110][model8_pretrain.py][INFO] Epoch:[0/2](85000/4588595) loss:2.606 lr:0.0000100 epoch_Time:28332.0min: [2024-01-03 02:34:31,110][model8_pretrain.py][INFO] Epoch:[0/2](85000/4588595) loss:3.041 lr:0.0000100 epoch_Time:28331.0min: [2024-01-03 02:34:31,110][model8_pretrain.py][INFO] Epoch:[0/2](85000/4588595) loss:2.829 lr:0.0000100 epoch_Time:28331.0min: [2024-01-03 02:34:31,110][model8_pretrain.py][INFO] Epoch:[0/2](85000/4588595) loss:3.209 lr:0.0000100 epoch_Time:28331.0min: [2024-01-03 02:34:31,110][model8_pretrain.py][INFO] Epoch:[0/2](85000/4588595) loss:2.692 lr:0.0000100 epoch_Time:28331.0min: [2024-01-03 02:35:08,044][model8_pretrain.py][INFO] Epoch:[0/2](85100/4588595) loss:3.107 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:35:08,044][model8_pretrain.py][INFO] Epoch:[0/2](85100/4588595) loss:2.633 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:35:08,044][model8_pretrain.py][INFO] Epoch:[0/2](85100/4588595) loss:3.045 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:35:08,044][model8_pretrain.py][INFO] Epoch:[0/2](85100/4588595) loss:2.915 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:35:08,044][model8_pretrain.py][INFO] Epoch:[0/2](85100/4588595) loss:2.942 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:35:08,044][model8_pretrain.py][INFO] Epoch:[0/2](85100/4588595) loss:3.001 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:35:08,045][model8_pretrain.py][INFO] Epoch:[0/2](85100/4588595) loss:3.295 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:35:08,045][model8_pretrain.py][INFO] Epoch:[0/2](85100/4588595) loss:3.114 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:35:44,988][model8_pretrain.py][INFO] Epoch:[0/2](85200/4588595) loss:3.144 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:35:44,988][model8_pretrain.py][INFO] Epoch:[0/2](85200/4588595) loss:3.389 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:35:44,988][model8_pretrain.py][INFO] Epoch:[0/2](85200/4588595) loss:3.208 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:35:44,988][model8_pretrain.py][INFO] Epoch:[0/2](85200/4588595) loss:3.240 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:35:44,988][model8_pretrain.py][INFO] Epoch:[0/2](85200/4588595) loss:2.840 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:35:44,988][model8_pretrain.py][INFO] Epoch:[0/2](85200/4588595) loss:2.233 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:35:44,988][model8_pretrain.py][INFO] Epoch:[0/2](85200/4588595) loss:3.060 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:35:44,988][model8_pretrain.py][INFO] Epoch:[0/2](85200/4588595) loss:2.924 lr:0.0000100 epoch_Time:28329.0min: [2024-01-03 02:36:21,937][model8_pretrain.py][INFO] Epoch:[0/2](85300/4588595) loss:3.044 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:36:21,937][model8_pretrain.py][INFO] Epoch:[0/2](85300/4588595) loss:3.355 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:36:21,937][model8_pretrain.py][INFO] Epoch:[0/2](85300/4588595) loss:2.878 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:36:21,937][model8_pretrain.py][INFO] Epoch:[0/2](85300/4588595) loss:3.282 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:36:21,937][model8_pretrain.py][INFO] Epoch:[0/2](85300/4588595) loss:2.752 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:36:21,937][model8_pretrain.py][INFO] Epoch:[0/2](85300/4588595) loss:3.113 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:36:21,937][model8_pretrain.py][INFO] Epoch:[0/2](85300/4588595) loss:2.517 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:36:21,937][model8_pretrain.py][INFO] Epoch:[0/2](85300/4588595) loss:3.464 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:36:58,901][model8_pretrain.py][INFO] Epoch:[0/2](85400/4588595) loss:3.296 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:36:58,901][model8_pretrain.py][INFO] Epoch:[0/2](85400/4588595) loss:2.684 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:36:58,901][model8_pretrain.py][INFO] Epoch:[0/2](85400/4588595) loss:2.727 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:36:58,901][model8_pretrain.py][INFO] Epoch:[0/2](85400/4588595) loss:2.839 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:36:58,901][model8_pretrain.py][INFO] Epoch:[0/2](85400/4588595) loss:3.072 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:36:58,901][model8_pretrain.py][INFO] Epoch:[0/2](85400/4588595) loss:3.112 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:36:58,901][model8_pretrain.py][INFO] Epoch:[0/2](85400/4588595) loss:3.007 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:36:58,901][model8_pretrain.py][INFO] Epoch:[0/2](85400/4588595) loss:2.723 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:37:35,851][model8_pretrain.py][INFO] Epoch:[0/2](85500/4588595) loss:2.712 lr:0.0000100 epoch_Time:28325.0min: [2024-01-03 02:37:35,851][model8_pretrain.py][INFO] Epoch:[0/2](85500/4588595) loss:3.139 lr:0.0000100 epoch_Time:28325.0min: [2024-01-03 02:37:35,851][model8_pretrain.py][INFO] Epoch:[0/2](85500/4588595) loss:2.792 lr:0.0000100 epoch_Time:28325.0min: [2024-01-03 02:37:35,851][model8_pretrain.py][INFO] Epoch:[0/2](85500/4588595) loss:3.304 lr:0.0000100 epoch_Time:28325.0min: [2024-01-03 02:37:35,851][model8_pretrain.py][INFO] Epoch:[0/2](85500/4588595) loss:2.629 lr:0.0000100 epoch_Time:28325.0min: [2024-01-03 02:37:35,851][model8_pretrain.py][INFO] Epoch:[0/2](85500/4588595) loss:3.193 lr:0.0000100 epoch_Time:28325.0min: [2024-01-03 02:37:35,851][model8_pretrain.py][INFO] Epoch:[0/2](85500/4588595) loss:3.335 lr:0.0000100 epoch_Time:28325.0min: [2024-01-03 02:37:35,851][model8_pretrain.py][INFO] Epoch:[0/2](85500/4588595) loss:3.003 lr:0.0000100 epoch_Time:28325.0min: [2024-01-03 02:38:20,165][model8_pretrain.py][INFO] Epoch:[0/2](85600/4588595) loss:3.294 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:38:20,165][model8_pretrain.py][INFO] Epoch:[0/2](85600/4588595) loss:3.132 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:38:20,165][model8_pretrain.py][INFO] Epoch:[0/2](85600/4588595) loss:3.158 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:38:20,165][model8_pretrain.py][INFO] Epoch:[0/2](85600/4588595) loss:3.257 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:38:20,165][model8_pretrain.py][INFO] Epoch:[0/2](85600/4588595) loss:2.915 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:38:20,165][model8_pretrain.py][INFO] Epoch:[0/2](85600/4588595) loss:2.931 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:38:20,165][model8_pretrain.py][INFO] Epoch:[0/2](85600/4588595) loss:3.421 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:38:20,165][model8_pretrain.py][INFO] Epoch:[0/2](85600/4588595) loss:2.948 lr:0.0000100 epoch_Time:28330.0min: [2024-01-03 02:38:57,095][model8_pretrain.py][INFO] Epoch:[0/2](85700/4588595) loss:2.901 lr:0.0000100 epoch_Time:28328.0min: [2024-01-03 02:38:57,095][model8_pretrain.py][INFO] Epoch:[0/2](85700/4588595) loss:2.866 lr:0.0000100 epoch_Time:28328.0min: [2024-01-03 02:38:57,095][model8_pretrain.py][INFO] Epoch:[0/2](85700/4588595) loss:3.322 lr:0.0000100 epoch_Time:28328.0min: [2024-01-03 02:38:57,095][model8_pretrain.py][INFO] Epoch:[0/2](85700/4588595) loss:2.887 lr:0.0000100 epoch_Time:28328.0min: [2024-01-03 02:38:57,095][model8_pretrain.py][INFO] Epoch:[0/2](85700/4588595) loss:3.156 lr:0.0000100 epoch_Time:28328.0min: [2024-01-03 02:38:57,095][model8_pretrain.py][INFO] Epoch:[0/2](85700/4588595) loss:2.988 lr:0.0000100 epoch_Time:28328.0min: [2024-01-03 02:38:57,095][model8_pretrain.py][INFO] Epoch:[0/2](85700/4588595) loss:3.059 lr:0.0000100 epoch_Time:28328.0min: [2024-01-03 02:38:57,095][model8_pretrain.py][INFO] Epoch:[0/2](85700/4588595) loss:2.927 lr:0.0000100 epoch_Time:28328.0min: [2024-01-03 02:39:34,040][model8_pretrain.py][INFO] Epoch:[0/2](85800/4588595) loss:2.901 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:39:34,040][model8_pretrain.py][INFO] Epoch:[0/2](85800/4588595) loss:2.818 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:39:34,040][model8_pretrain.py][INFO] Epoch:[0/2](85800/4588595) loss:2.895 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:39:34,040][model8_pretrain.py][INFO] Epoch:[0/2](85800/4588595) loss:2.816 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:39:34,040][model8_pretrain.py][INFO] Epoch:[0/2](85800/4588595) loss:3.208 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:39:34,040][model8_pretrain.py][INFO] Epoch:[0/2](85800/4588595) loss:2.846 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:39:34,040][model8_pretrain.py][INFO] Epoch:[0/2](85800/4588595) loss:2.992 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:39:34,041][model8_pretrain.py][INFO] Epoch:[0/2](85800/4588595) loss:3.305 lr:0.0000100 epoch_Time:28327.0min: [2024-01-03 02:40:10,999][model8_pretrain.py][INFO] Epoch:[0/2](85900/4588595) loss:3.002 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:40:10,999][model8_pretrain.py][INFO] Epoch:[0/2](85900/4588595) loss:3.026 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:40:10,999][model8_pretrain.py][INFO] Epoch:[0/2](85900/4588595) loss:2.855 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:40:11,000][model8_pretrain.py][INFO] Epoch:[0/2](85900/4588595) loss:2.872 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:40:11,000][model8_pretrain.py][INFO] Epoch:[0/2](85900/4588595) loss:3.207 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:40:11,000][model8_pretrain.py][INFO] Epoch:[0/2](85900/4588595) loss:3.438 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:40:11,000][model8_pretrain.py][INFO] Epoch:[0/2](85900/4588595) loss:3.223 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:40:11,000][model8_pretrain.py][INFO] Epoch:[0/2](85900/4588595) loss:3.101 lr:0.0000100 epoch_Time:28326.0min: [2024-01-03 02:40:47,975][model8_pretrain.py][INFO] Epoch:[0/2](86000/4588595) loss:3.214 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:40:47,975][model8_pretrain.py][INFO] Epoch:[0/2](86000/4588595) loss:2.769 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:40:47,975][model8_pretrain.py][INFO] Epoch:[0/2](86000/4588595) loss:2.987 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:40:47,975][model8_pretrain.py][INFO] Epoch:[0/2](86000/4588595) loss:3.266 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:40:47,975][model8_pretrain.py][INFO] Epoch:[0/2](86000/4588595) loss:3.305 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:40:47,975][model8_pretrain.py][INFO] Epoch:[0/2](86000/4588595) loss:3.189 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:40:47,975][model8_pretrain.py][INFO] Epoch:[0/2](86000/4588595) loss:2.833 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:40:47,975][model8_pretrain.py][INFO] Epoch:[0/2](86000/4588595) loss:3.437 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:41:24,928][model8_pretrain.py][INFO] Epoch:[0/2](86100/4588595) loss:2.789 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:41:24,928][model8_pretrain.py][INFO] Epoch:[0/2](86100/4588595) loss:3.203 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:41:24,928][model8_pretrain.py][INFO] Epoch:[0/2](86100/4588595) loss:2.870 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:41:24,928][model8_pretrain.py][INFO] Epoch:[0/2](86100/4588595) loss:2.905 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:41:24,928][model8_pretrain.py][INFO] Epoch:[0/2](86100/4588595) loss:2.891 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:41:24,928][model8_pretrain.py][INFO] Epoch:[0/2](86100/4588595) loss:3.277 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:41:24,928][model8_pretrain.py][INFO] Epoch:[0/2](86100/4588595) loss:3.343 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:41:24,928][model8_pretrain.py][INFO] Epoch:[0/2](86100/4588595) loss:3.230 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:42:01,865][model8_pretrain.py][INFO] Epoch:[0/2](86200/4588595) loss:2.642 lr:0.0000100 epoch_Time:28322.0min: [2024-01-03 02:42:01,865][model8_pretrain.py][INFO] Epoch:[0/2](86200/4588595) loss:3.510 lr:0.0000100 epoch_Time:28322.0min: [2024-01-03 02:42:01,865][model8_pretrain.py][INFO] Epoch:[0/2](86200/4588595) loss:3.117 lr:0.0000100 epoch_Time:28322.0min: [2024-01-03 02:42:01,865][model8_pretrain.py][INFO] Epoch:[0/2](86200/4588595) loss:3.062 lr:0.0000100 epoch_Time:28322.0min: [2024-01-03 02:42:01,865][model8_pretrain.py][INFO] Epoch:[0/2](86200/4588595) loss:3.219 lr:0.0000100 epoch_Time:28322.0min: [2024-01-03 02:42:01,865][model8_pretrain.py][INFO] Epoch:[0/2](86200/4588595) loss:3.534 lr:0.0000100 epoch_Time:28322.0min: [2024-01-03 02:42:01,865][model8_pretrain.py][INFO] Epoch:[0/2](86200/4588595) loss:2.181 lr:0.0000100 epoch_Time:28322.0min: [2024-01-03 02:42:01,865][model8_pretrain.py][INFO] Epoch:[0/2](86200/4588595) loss:2.960 lr:0.0000100 epoch_Time:28322.0min: [2024-01-03 02:42:38,824][model8_pretrain.py][INFO] Epoch:[0/2](86300/4588595) loss:2.772 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:42:38,824][model8_pretrain.py][INFO] Epoch:[0/2](86300/4588595) loss:2.884 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:42:38,824][model8_pretrain.py][INFO] Epoch:[0/2](86300/4588595) loss:3.344 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:42:38,824][model8_pretrain.py][INFO] Epoch:[0/2](86300/4588595) loss:3.051 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:42:38,824][model8_pretrain.py][INFO] Epoch:[0/2](86300/4588595) loss:3.242 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:42:38,824][model8_pretrain.py][INFO] Epoch:[0/2](86300/4588595) loss:2.948 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:42:38,824][model8_pretrain.py][INFO] Epoch:[0/2](86300/4588595) loss:3.344 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:42:38,825][model8_pretrain.py][INFO] Epoch:[0/2](86300/4588595) loss:2.966 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:43:21,080][model8_pretrain.py][INFO] Epoch:[0/2](86400/4588595) loss:3.125 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:21,080][model8_pretrain.py][INFO] Epoch:[0/2](86400/4588595) loss:2.855 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:21,080][model8_pretrain.py][INFO] Epoch:[0/2](86400/4588595) loss:3.028 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:21,080][model8_pretrain.py][INFO] Epoch:[0/2](86400/4588595) loss:2.888 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:21,084][model8_pretrain.py][INFO] Epoch:[0/2](86400/4588595) loss:3.154 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:21,085][model8_pretrain.py][INFO] Epoch:[0/2](86400/4588595) loss:3.367 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:21,085][model8_pretrain.py][INFO] Epoch:[0/2](86400/4588595) loss:2.961 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:21,085][model8_pretrain.py][INFO] Epoch:[0/2](86400/4588595) loss:2.964 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:59,696][model8_pretrain.py][INFO] Epoch:[0/2](86500/4588595) loss:2.770 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:59,696][model8_pretrain.py][INFO] Epoch:[0/2](86500/4588595) loss:2.735 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:59,696][model8_pretrain.py][INFO] Epoch:[0/2](86500/4588595) loss:2.713 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:59,696][model8_pretrain.py][INFO] Epoch:[0/2](86500/4588595) loss:2.963 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:59,696][model8_pretrain.py][INFO] Epoch:[0/2](86500/4588595) loss:3.077 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:59,696][model8_pretrain.py][INFO] Epoch:[0/2](86500/4588595) loss:3.130 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:59,696][model8_pretrain.py][INFO] Epoch:[0/2](86500/4588595) loss:3.570 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:43:59,696][model8_pretrain.py][INFO] Epoch:[0/2](86500/4588595) loss:2.922 lr:0.0000100 epoch_Time:28324.0min: [2024-01-03 02:44:36,601][model8_pretrain.py][INFO] Epoch:[0/2](86600/4588595) loss:3.261 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:44:36,601][model8_pretrain.py][INFO] Epoch:[0/2](86600/4588595) loss:2.913 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:44:36,601][model8_pretrain.py][INFO] Epoch:[0/2](86600/4588595) loss:2.441 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:44:36,602][model8_pretrain.py][INFO] Epoch:[0/2](86600/4588595) loss:2.702 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:44:36,602][model8_pretrain.py][INFO] Epoch:[0/2](86600/4588595) loss:3.054 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:44:36,603][model8_pretrain.py][INFO] Epoch:[0/2](86600/4588595) loss:3.336 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:44:36,603][model8_pretrain.py][INFO] Epoch:[0/2](86600/4588595) loss:3.393 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:44:36,605][model8_pretrain.py][INFO] Epoch:[0/2](86600/4588595) loss:3.079 lr:0.0000100 epoch_Time:28323.0min: [2024-01-03 02:45:13,540][model8_pretrain.py][INFO] Epoch:[0/2](86700/4588595) loss:3.094 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:45:13,540][model8_pretrain.py][INFO] Epoch:[0/2](86700/4588595) loss:3.281 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:45:13,540][model8_pretrain.py][INFO] Epoch:[0/2](86700/4588595) loss:3.032 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:45:13,540][model8_pretrain.py][INFO] Epoch:[0/2](86700/4588595) loss:2.716 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:45:13,540][model8_pretrain.py][INFO] Epoch:[0/2](86700/4588595) loss:3.030 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:45:13,540][model8_pretrain.py][INFO] Epoch:[0/2](86700/4588595) loss:2.901 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:45:13,540][model8_pretrain.py][INFO] Epoch:[0/2](86700/4588595) loss:2.614 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:45:13,540][model8_pretrain.py][INFO] Epoch:[0/2](86700/4588595) loss:3.006 lr:0.0000100 epoch_Time:28321.0min: [2024-01-03 02:45:50,498][model8_pretrain.py][INFO] Epoch:[0/2](86800/4588595) loss:2.946 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:45:50,498][model8_pretrain.py][INFO] Epoch:[0/2](86800/4588595) loss:2.999 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:45:50,498][model8_pretrain.py][INFO] Epoch:[0/2](86800/4588595) loss:3.121 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:45:50,498][model8_pretrain.py][INFO] Epoch:[0/2](86800/4588595) loss:2.820 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:45:50,498][model8_pretrain.py][INFO] Epoch:[0/2](86800/4588595) loss:2.656 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:45:50,498][model8_pretrain.py][INFO] Epoch:[0/2](86800/4588595) loss:2.995 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:45:50,498][model8_pretrain.py][INFO] Epoch:[0/2](86800/4588595) loss:2.741 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:45:50,498][model8_pretrain.py][INFO] Epoch:[0/2](86800/4588595) loss:3.359 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:46:27,469][model8_pretrain.py][INFO] Epoch:[0/2](86900/4588595) loss:3.002 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:46:27,469][model8_pretrain.py][INFO] Epoch:[0/2](86900/4588595) loss:2.886 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:46:27,469][model8_pretrain.py][INFO] Epoch:[0/2](86900/4588595) loss:3.266 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:46:27,469][model8_pretrain.py][INFO] Epoch:[0/2](86900/4588595) loss:2.701 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:46:27,469][model8_pretrain.py][INFO] Epoch:[0/2](86900/4588595) loss:2.411 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:46:27,469][model8_pretrain.py][INFO] Epoch:[0/2](86900/4588595) loss:3.039 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:46:27,470][model8_pretrain.py][INFO] Epoch:[0/2](86900/4588595) loss:3.047 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:46:27,470][model8_pretrain.py][INFO] Epoch:[0/2](86900/4588595) loss:2.811 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:47:04,406][model8_pretrain.py][INFO] Epoch:[0/2](87000/4588595) loss:3.138 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:47:04,406][model8_pretrain.py][INFO] Epoch:[0/2](87000/4588595) loss:3.355 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:47:04,406][model8_pretrain.py][INFO] Epoch:[0/2](87000/4588595) loss:3.577 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:47:04,406][model8_pretrain.py][INFO] Epoch:[0/2](87000/4588595) loss:2.720 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:47:04,406][model8_pretrain.py][INFO] Epoch:[0/2](87000/4588595) loss:3.396 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:47:04,406][model8_pretrain.py][INFO] Epoch:[0/2](87000/4588595) loss:3.057 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:47:04,406][model8_pretrain.py][INFO] Epoch:[0/2](87000/4588595) loss:2.945 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:47:04,407][model8_pretrain.py][INFO] Epoch:[0/2](87000/4588595) loss:2.727 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:47:41,346][model8_pretrain.py][INFO] Epoch:[0/2](87100/4588595) loss:3.308 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:47:41,346][model8_pretrain.py][INFO] Epoch:[0/2](87100/4588595) loss:3.357 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:47:41,346][model8_pretrain.py][INFO] Epoch:[0/2](87100/4588595) loss:3.028 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:47:41,346][model8_pretrain.py][INFO] Epoch:[0/2](87100/4588595) loss:3.284 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:47:41,346][model8_pretrain.py][INFO] Epoch:[0/2](87100/4588595) loss:3.177 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:47:41,346][model8_pretrain.py][INFO] Epoch:[0/2](87100/4588595) loss:2.435 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:47:41,346][model8_pretrain.py][INFO] Epoch:[0/2](87100/4588595) loss:3.024 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:47:41,346][model8_pretrain.py][INFO] Epoch:[0/2](87100/4588595) loss:2.637 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:48:20,094][model8_pretrain.py][INFO] Epoch:[0/2](87200/4588595) loss:2.839 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:48:20,094][model8_pretrain.py][INFO] Epoch:[0/2](87200/4588595) loss:2.741 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:48:20,094][model8_pretrain.py][INFO] Epoch:[0/2](87200/4588595) loss:2.963 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:48:20,094][model8_pretrain.py][INFO] Epoch:[0/2](87200/4588595) loss:3.384 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:48:20,095][model8_pretrain.py][INFO] Epoch:[0/2](87200/4588595) loss:3.067 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:48:20,095][model8_pretrain.py][INFO] Epoch:[0/2](87200/4588595) loss:3.267 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:48:20,095][model8_pretrain.py][INFO] Epoch:[0/2](87200/4588595) loss:2.831 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:48:20,095][model8_pretrain.py][INFO] Epoch:[0/2](87200/4588595) loss:2.910 lr:0.0000100 epoch_Time:28316.0min: [2024-01-03 02:49:02,395][model8_pretrain.py][INFO] Epoch:[0/2](87300/4588595) loss:3.086 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:02,395][model8_pretrain.py][INFO] Epoch:[0/2](87300/4588595) loss:2.865 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:02,395][model8_pretrain.py][INFO] Epoch:[0/2](87300/4588595) loss:2.741 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:02,395][model8_pretrain.py][INFO] Epoch:[0/2](87300/4588595) loss:3.485 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:02,395][model8_pretrain.py][INFO] Epoch:[0/2](87300/4588595) loss:2.928 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:02,395][model8_pretrain.py][INFO] Epoch:[0/2](87300/4588595) loss:3.210 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:02,395][model8_pretrain.py][INFO] Epoch:[0/2](87300/4588595) loss:2.479 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:02,395][model8_pretrain.py][INFO] Epoch:[0/2](87300/4588595) loss:2.884 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:39,323][model8_pretrain.py][INFO] Epoch:[0/2](87400/4588595) loss:2.937 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:39,323][model8_pretrain.py][INFO] Epoch:[0/2](87400/4588595) loss:2.823 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:39,323][model8_pretrain.py][INFO] Epoch:[0/2](87400/4588595) loss:2.773 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:39,323][model8_pretrain.py][INFO] Epoch:[0/2](87400/4588595) loss:2.803 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:39,323][model8_pretrain.py][INFO] Epoch:[0/2](87400/4588595) loss:2.586 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:39,323][model8_pretrain.py][INFO] Epoch:[0/2](87400/4588595) loss:3.198 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:39,323][model8_pretrain.py][INFO] Epoch:[0/2](87400/4588595) loss:3.338 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:49:39,323][model8_pretrain.py][INFO] Epoch:[0/2](87400/4588595) loss:3.284 lr:0.0000100 epoch_Time:28319.0min: [2024-01-03 02:50:16,260][model8_pretrain.py][INFO] Epoch:[0/2](87500/4588595) loss:3.214 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:50:16,260][model8_pretrain.py][INFO] Epoch:[0/2](87500/4588595) loss:2.741 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:50:16,260][model8_pretrain.py][INFO] Epoch:[0/2](87500/4588595) loss:2.612 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:50:16,260][model8_pretrain.py][INFO] Epoch:[0/2](87500/4588595) loss:2.723 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:50:16,260][model8_pretrain.py][INFO] Epoch:[0/2](87500/4588595) loss:3.017 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:50:16,260][model8_pretrain.py][INFO] Epoch:[0/2](87500/4588595) loss:3.312 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:50:16,260][model8_pretrain.py][INFO] Epoch:[0/2](87500/4588595) loss:2.433 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:50:16,260][model8_pretrain.py][INFO] Epoch:[0/2](87500/4588595) loss:3.398 lr:0.0000100 epoch_Time:28317.0min: [2024-01-03 02:50:53,210][model8_pretrain.py][INFO] Epoch:[0/2](87600/4588595) loss:3.202 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:50:53,210][model8_pretrain.py][INFO] Epoch:[0/2](87600/4588595) loss:2.439 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:50:53,210][model8_pretrain.py][INFO] Epoch:[0/2](87600/4588595) loss:2.924 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:50:53,210][model8_pretrain.py][INFO] Epoch:[0/2](87600/4588595) loss:3.046 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:50:53,210][model8_pretrain.py][INFO] Epoch:[0/2](87600/4588595) loss:2.805 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:50:53,210][model8_pretrain.py][INFO] Epoch:[0/2](87600/4588595) loss:3.185 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:50:53,210][model8_pretrain.py][INFO] Epoch:[0/2](87600/4588595) loss:3.516 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:50:53,210][model8_pretrain.py][INFO] Epoch:[0/2](87600/4588595) loss:2.825 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:51:30,157][model8_pretrain.py][INFO] Epoch:[0/2](87700/4588595) loss:3.069 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:51:30,157][model8_pretrain.py][INFO] Epoch:[0/2](87700/4588595) loss:2.983 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:51:30,157][model8_pretrain.py][INFO] Epoch:[0/2](87700/4588595) loss:2.674 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:51:30,157][model8_pretrain.py][INFO] Epoch:[0/2](87700/4588595) loss:3.017 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:51:30,157][model8_pretrain.py][INFO] Epoch:[0/2](87700/4588595) loss:2.942 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:51:30,157][model8_pretrain.py][INFO] Epoch:[0/2](87700/4588595) loss:2.977 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:51:30,157][model8_pretrain.py][INFO] Epoch:[0/2](87700/4588595) loss:3.447 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:51:30,157][model8_pretrain.py][INFO] Epoch:[0/2](87700/4588595) loss:2.400 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:52:07,112][model8_pretrain.py][INFO] Epoch:[0/2](87800/4588595) loss:2.811 lr:0.0000100 epoch_Time:28313.0min: [2024-01-03 02:52:07,112][model8_pretrain.py][INFO] Epoch:[0/2](87800/4588595) loss:2.920 lr:0.0000100 epoch_Time:28313.0min: [2024-01-03 02:52:07,112][model8_pretrain.py][INFO] Epoch:[0/2](87800/4588595) loss:2.721 lr:0.0000100 epoch_Time:28313.0min: [2024-01-03 02:52:07,112][model8_pretrain.py][INFO] Epoch:[0/2](87800/4588595) loss:3.091 lr:0.0000100 epoch_Time:28313.0min: [2024-01-03 02:52:07,112][model8_pretrain.py][INFO] Epoch:[0/2](87800/4588595) loss:2.804 lr:0.0000100 epoch_Time:28313.0min: [2024-01-03 02:52:07,112][model8_pretrain.py][INFO] Epoch:[0/2](87800/4588595) loss:3.228 lr:0.0000100 epoch_Time:28313.0min: [2024-01-03 02:52:07,112][model8_pretrain.py][INFO] Epoch:[0/2](87800/4588595) loss:3.220 lr:0.0000100 epoch_Time:28313.0min: [2024-01-03 02:52:07,112][model8_pretrain.py][INFO] Epoch:[0/2](87800/4588595) loss:2.767 lr:0.0000100 epoch_Time:28313.0min: [2024-01-03 02:52:44,055][model8_pretrain.py][INFO] Epoch:[0/2](87900/4588595) loss:3.196 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:52:44,055][model8_pretrain.py][INFO] Epoch:[0/2](87900/4588595) loss:2.641 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:52:44,055][model8_pretrain.py][INFO] Epoch:[0/2](87900/4588595) loss:2.520 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:52:44,055][model8_pretrain.py][INFO] Epoch:[0/2](87900/4588595) loss:3.233 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:52:44,055][model8_pretrain.py][INFO] Epoch:[0/2](87900/4588595) loss:3.514 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:52:44,055][model8_pretrain.py][INFO] Epoch:[0/2](87900/4588595) loss:2.416 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:52:44,055][model8_pretrain.py][INFO] Epoch:[0/2](87900/4588595) loss:2.942 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:52:44,055][model8_pretrain.py][INFO] Epoch:[0/2](87900/4588595) loss:2.810 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:53:22,768][model8_pretrain.py][INFO] Epoch:[0/2](88000/4588595) loss:3.422 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:53:22,768][model8_pretrain.py][INFO] Epoch:[0/2](88000/4588595) loss:2.808 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:53:22,768][model8_pretrain.py][INFO] Epoch:[0/2](88000/4588595) loss:3.419 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:53:22,768][model8_pretrain.py][INFO] Epoch:[0/2](88000/4588595) loss:3.349 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:53:22,768][model8_pretrain.py][INFO] Epoch:[0/2](88000/4588595) loss:2.749 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:53:22,768][model8_pretrain.py][INFO] Epoch:[0/2](88000/4588595) loss:3.047 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:53:22,768][model8_pretrain.py][INFO] Epoch:[0/2](88000/4588595) loss:3.249 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:53:22,769][model8_pretrain.py][INFO] Epoch:[0/2](88000/4588595) loss:2.836 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:54:05,064][model8_pretrain.py][INFO] Epoch:[0/2](88100/4588595) loss:2.701 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:54:05,064][model8_pretrain.py][INFO] Epoch:[0/2](88100/4588595) loss:3.046 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:54:05,064][model8_pretrain.py][INFO] Epoch:[0/2](88100/4588595) loss:2.913 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:54:05,064][model8_pretrain.py][INFO] Epoch:[0/2](88100/4588595) loss:2.920 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:54:05,064][model8_pretrain.py][INFO] Epoch:[0/2](88100/4588595) loss:3.175 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:54:05,064][model8_pretrain.py][INFO] Epoch:[0/2](88100/4588595) loss:2.767 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:54:05,064][model8_pretrain.py][INFO] Epoch:[0/2](88100/4588595) loss:3.077 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:54:05,064][model8_pretrain.py][INFO] Epoch:[0/2](88100/4588595) loss:2.922 lr:0.0000100 epoch_Time:28315.0min: [2024-01-03 02:54:42,009][model8_pretrain.py][INFO] Epoch:[0/2](88200/4588595) loss:3.076 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:54:42,009][model8_pretrain.py][INFO] Epoch:[0/2](88200/4588595) loss:2.780 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:54:42,009][model8_pretrain.py][INFO] Epoch:[0/2](88200/4588595) loss:3.538 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:54:42,009][model8_pretrain.py][INFO] Epoch:[0/2](88200/4588595) loss:2.894 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:54:42,009][model8_pretrain.py][INFO] Epoch:[0/2](88200/4588595) loss:3.118 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:54:42,009][model8_pretrain.py][INFO] Epoch:[0/2](88200/4588595) loss:2.759 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:54:42,009][model8_pretrain.py][INFO] Epoch:[0/2](88200/4588595) loss:2.489 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:54:42,009][model8_pretrain.py][INFO] Epoch:[0/2](88200/4588595) loss:3.437 lr:0.0000100 epoch_Time:28314.0min: [2024-01-03 02:55:18,948][model8_pretrain.py][INFO] Epoch:[0/2](88300/4588595) loss:3.246 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:55:18,948][model8_pretrain.py][INFO] Epoch:[0/2](88300/4588595) loss:2.403 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:55:18,948][model8_pretrain.py][INFO] Epoch:[0/2](88300/4588595) loss:3.249 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:55:18,948][model8_pretrain.py][INFO] Epoch:[0/2](88300/4588595) loss:2.449 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:55:18,948][model8_pretrain.py][INFO] Epoch:[0/2](88300/4588595) loss:3.251 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:55:18,948][model8_pretrain.py][INFO] Epoch:[0/2](88300/4588595) loss:3.025 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:55:18,949][model8_pretrain.py][INFO] Epoch:[0/2](88300/4588595) loss:3.205 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:55:18,949][model8_pretrain.py][INFO] Epoch:[0/2](88300/4588595) loss:2.984 lr:0.0000100 epoch_Time:28312.0min: [2024-01-03 02:55:55,902][model8_pretrain.py][INFO] Epoch:[0/2](88400/4588595) loss:2.467 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:55:55,902][model8_pretrain.py][INFO] Epoch:[0/2](88400/4588595) loss:2.941 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:55:55,902][model8_pretrain.py][INFO] Epoch:[0/2](88400/4588595) loss:2.780 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:55:55,902][model8_pretrain.py][INFO] Epoch:[0/2](88400/4588595) loss:3.008 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:55:55,902][model8_pretrain.py][INFO] Epoch:[0/2](88400/4588595) loss:3.153 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:55:55,902][model8_pretrain.py][INFO] Epoch:[0/2](88400/4588595) loss:3.364 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:55:55,902][model8_pretrain.py][INFO] Epoch:[0/2](88400/4588595) loss:2.614 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:55:55,902][model8_pretrain.py][INFO] Epoch:[0/2](88400/4588595) loss:2.758 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:56:32,839][model8_pretrain.py][INFO] Epoch:[0/2](88500/4588595) loss:2.845 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:56:32,839][model8_pretrain.py][INFO] Epoch:[0/2](88500/4588595) loss:3.258 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:56:32,839][model8_pretrain.py][INFO] Epoch:[0/2](88500/4588595) loss:2.730 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:56:32,839][model8_pretrain.py][INFO] Epoch:[0/2](88500/4588595) loss:3.045 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:56:32,839][model8_pretrain.py][INFO] Epoch:[0/2](88500/4588595) loss:3.339 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:56:32,839][model8_pretrain.py][INFO] Epoch:[0/2](88500/4588595) loss:2.792 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:56:32,840][model8_pretrain.py][INFO] Epoch:[0/2](88500/4588595) loss:3.168 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:56:32,840][model8_pretrain.py][INFO] Epoch:[0/2](88500/4588595) loss:2.716 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:57:09,795][model8_pretrain.py][INFO] Epoch:[0/2](88600/4588595) loss:3.276 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:09,795][model8_pretrain.py][INFO] Epoch:[0/2](88600/4588595) loss:3.091 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:09,795][model8_pretrain.py][INFO] Epoch:[0/2](88600/4588595) loss:3.587 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:09,795][model8_pretrain.py][INFO] Epoch:[0/2](88600/4588595) loss:2.545 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:09,795][model8_pretrain.py][INFO] Epoch:[0/2](88600/4588595) loss:3.169 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:09,795][model8_pretrain.py][INFO] Epoch:[0/2](88600/4588595) loss:3.222 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:09,795][model8_pretrain.py][INFO] Epoch:[0/2](88600/4588595) loss:3.111 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:09,795][model8_pretrain.py][INFO] Epoch:[0/2](88600/4588595) loss:3.096 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:46,738][model8_pretrain.py][INFO] Epoch:[0/2](88700/4588595) loss:2.799 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:46,738][model8_pretrain.py][INFO] Epoch:[0/2](88700/4588595) loss:2.836 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:46,738][model8_pretrain.py][INFO] Epoch:[0/2](88700/4588595) loss:2.916 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:46,738][model8_pretrain.py][INFO] Epoch:[0/2](88700/4588595) loss:2.804 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:46,738][model8_pretrain.py][INFO] Epoch:[0/2](88700/4588595) loss:3.587 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:46,738][model8_pretrain.py][INFO] Epoch:[0/2](88700/4588595) loss:2.864 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:46,738][model8_pretrain.py][INFO] Epoch:[0/2](88700/4588595) loss:3.226 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:57:46,738][model8_pretrain.py][INFO] Epoch:[0/2](88700/4588595) loss:3.068 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:58:25,828][model8_pretrain.py][INFO] Epoch:[0/2](88800/4588595) loss:3.030 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:58:25,828][model8_pretrain.py][INFO] Epoch:[0/2](88800/4588595) loss:3.625 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:58:25,828][model8_pretrain.py][INFO] Epoch:[0/2](88800/4588595) loss:2.973 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:58:25,828][model8_pretrain.py][INFO] Epoch:[0/2](88800/4588595) loss:2.805 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:58:25,828][model8_pretrain.py][INFO] Epoch:[0/2](88800/4588595) loss:3.273 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:58:25,828][model8_pretrain.py][INFO] Epoch:[0/2](88800/4588595) loss:3.061 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:58:25,828][model8_pretrain.py][INFO] Epoch:[0/2](88800/4588595) loss:3.112 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:58:25,828][model8_pretrain.py][INFO] Epoch:[0/2](88800/4588595) loss:2.995 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 02:59:08,048][model8_pretrain.py][INFO] Epoch:[0/2](88900/4588595) loss:2.973 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:59:08,048][model8_pretrain.py][INFO] Epoch:[0/2](88900/4588595) loss:2.901 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:59:08,048][model8_pretrain.py][INFO] Epoch:[0/2](88900/4588595) loss:2.498 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:59:08,048][model8_pretrain.py][INFO] Epoch:[0/2](88900/4588595) loss:3.058 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:59:08,048][model8_pretrain.py][INFO] Epoch:[0/2](88900/4588595) loss:3.211 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:59:08,048][model8_pretrain.py][INFO] Epoch:[0/2](88900/4588595) loss:3.114 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:59:08,048][model8_pretrain.py][INFO] Epoch:[0/2](88900/4588595) loss:2.699 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:59:08,049][model8_pretrain.py][INFO] Epoch:[0/2](88900/4588595) loss:3.259 lr:0.0000100 epoch_Time:28311.0min: [2024-01-03 02:59:44,983][model8_pretrain.py][INFO] Epoch:[0/2](89000/4588595) loss:3.070 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:59:44,983][model8_pretrain.py][INFO] Epoch:[0/2](89000/4588595) loss:2.957 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:59:44,983][model8_pretrain.py][INFO] Epoch:[0/2](89000/4588595) loss:3.210 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:59:44,983][model8_pretrain.py][INFO] Epoch:[0/2](89000/4588595) loss:3.241 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:59:44,983][model8_pretrain.py][INFO] Epoch:[0/2](89000/4588595) loss:2.724 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:59:44,983][model8_pretrain.py][INFO] Epoch:[0/2](89000/4588595) loss:2.902 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:59:44,983][model8_pretrain.py][INFO] Epoch:[0/2](89000/4588595) loss:3.091 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 02:59:44,983][model8_pretrain.py][INFO] Epoch:[0/2](89000/4588595) loss:2.765 lr:0.0000100 epoch_Time:28310.0min: [2024-01-03 03:00:21,918][model8_pretrain.py][INFO] Epoch:[0/2](89100/4588595) loss:2.762 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:00:21,918][model8_pretrain.py][INFO] Epoch:[0/2](89100/4588595) loss:2.739 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:00:21,918][model8_pretrain.py][INFO] Epoch:[0/2](89100/4588595) loss:3.089 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:00:21,918][model8_pretrain.py][INFO] Epoch:[0/2](89100/4588595) loss:3.292 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:00:21,918][model8_pretrain.py][INFO] Epoch:[0/2](89100/4588595) loss:3.257 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:00:21,918][model8_pretrain.py][INFO] Epoch:[0/2](89100/4588595) loss:2.978 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:00:21,918][model8_pretrain.py][INFO] Epoch:[0/2](89100/4588595) loss:2.733 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:00:21,918][model8_pretrain.py][INFO] Epoch:[0/2](89100/4588595) loss:3.260 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:00:58,851][model8_pretrain.py][INFO] Epoch:[0/2](89200/4588595) loss:3.415 lr:0.0000100 epoch_Time:28307.0min: [2024-01-03 03:00:58,851][model8_pretrain.py][INFO] Epoch:[0/2](89200/4588595) loss:2.537 lr:0.0000100 epoch_Time:28307.0min: [2024-01-03 03:00:58,851][model8_pretrain.py][INFO] Epoch:[0/2](89200/4588595) loss:2.195 lr:0.0000100 epoch_Time:28307.0min: [2024-01-03 03:00:58,851][model8_pretrain.py][INFO] Epoch:[0/2](89200/4588595) loss:2.806 lr:0.0000100 epoch_Time:28307.0min: [2024-01-03 03:00:58,851][model8_pretrain.py][INFO] Epoch:[0/2](89200/4588595) loss:2.748 lr:0.0000100 epoch_Time:28307.0min: [2024-01-03 03:00:58,851][model8_pretrain.py][INFO] Epoch:[0/2](89200/4588595) loss:3.366 lr:0.0000100 epoch_Time:28307.0min: [2024-01-03 03:00:58,851][model8_pretrain.py][INFO] Epoch:[0/2](89200/4588595) loss:2.674 lr:0.0000100 epoch_Time:28307.0min: [2024-01-03 03:00:58,851][model8_pretrain.py][INFO] Epoch:[0/2](89200/4588595) loss:3.588 lr:0.0000100 epoch_Time:28307.0min: [2024-01-03 03:01:35,800][model8_pretrain.py][INFO] Epoch:[0/2](89300/4588595) loss:2.934 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:01:35,800][model8_pretrain.py][INFO] Epoch:[0/2](89300/4588595) loss:2.838 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:01:35,800][model8_pretrain.py][INFO] Epoch:[0/2](89300/4588595) loss:3.324 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:01:35,800][model8_pretrain.py][INFO] Epoch:[0/2](89300/4588595) loss:3.207 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:01:35,800][model8_pretrain.py][INFO] Epoch:[0/2](89300/4588595) loss:3.278 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:01:35,800][model8_pretrain.py][INFO] Epoch:[0/2](89300/4588595) loss:3.035 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:01:35,800][model8_pretrain.py][INFO] Epoch:[0/2](89300/4588595) loss:2.580 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:01:35,801][model8_pretrain.py][INFO] Epoch:[0/2](89300/4588595) loss:2.695 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:02:12,763][model8_pretrain.py][INFO] Epoch:[0/2](89400/4588595) loss:2.649 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:02:12,763][model8_pretrain.py][INFO] Epoch:[0/2](89400/4588595) loss:3.017 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:02:12,763][model8_pretrain.py][INFO] Epoch:[0/2](89400/4588595) loss:2.904 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:02:12,763][model8_pretrain.py][INFO] Epoch:[0/2](89400/4588595) loss:2.894 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:02:12,763][model8_pretrain.py][INFO] Epoch:[0/2](89400/4588595) loss:2.966 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:02:12,763][model8_pretrain.py][INFO] Epoch:[0/2](89400/4588595) loss:2.186 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:02:12,763][model8_pretrain.py][INFO] Epoch:[0/2](89400/4588595) loss:2.914 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:02:12,763][model8_pretrain.py][INFO] Epoch:[0/2](89400/4588595) loss:3.074 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:02:49,704][model8_pretrain.py][INFO] Epoch:[0/2](89500/4588595) loss:3.532 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:02:49,704][model8_pretrain.py][INFO] Epoch:[0/2](89500/4588595) loss:2.668 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:02:49,704][model8_pretrain.py][INFO] Epoch:[0/2](89500/4588595) loss:3.068 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:02:49,704][model8_pretrain.py][INFO] Epoch:[0/2](89500/4588595) loss:2.949 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:02:49,704][model8_pretrain.py][INFO] Epoch:[0/2](89500/4588595) loss:2.824 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:02:49,704][model8_pretrain.py][INFO] Epoch:[0/2](89500/4588595) loss:3.619 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:02:49,704][model8_pretrain.py][INFO] Epoch:[0/2](89500/4588595) loss:3.158 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:02:49,704][model8_pretrain.py][INFO] Epoch:[0/2](89500/4588595) loss:3.240 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:03:26,647][model8_pretrain.py][INFO] Epoch:[0/2](89600/4588595) loss:3.408 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:03:26,647][model8_pretrain.py][INFO] Epoch:[0/2](89600/4588595) loss:2.765 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:03:26,647][model8_pretrain.py][INFO] Epoch:[0/2](89600/4588595) loss:2.853 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:03:26,647][model8_pretrain.py][INFO] Epoch:[0/2](89600/4588595) loss:2.737 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:03:26,647][model8_pretrain.py][INFO] Epoch:[0/2](89600/4588595) loss:2.811 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:03:26,647][model8_pretrain.py][INFO] Epoch:[0/2](89600/4588595) loss:3.174 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:03:26,647][model8_pretrain.py][INFO] Epoch:[0/2](89600/4588595) loss:2.808 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:03:26,647][model8_pretrain.py][INFO] Epoch:[0/2](89600/4588595) loss:2.903 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:04:12,232][model8_pretrain.py][INFO] Epoch:[0/2](89700/4588595) loss:2.800 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:04:12,232][model8_pretrain.py][INFO] Epoch:[0/2](89700/4588595) loss:2.976 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:04:12,233][model8_pretrain.py][INFO] Epoch:[0/2](89700/4588595) loss:3.059 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:04:12,233][model8_pretrain.py][INFO] Epoch:[0/2](89700/4588595) loss:2.895 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:04:12,233][model8_pretrain.py][INFO] Epoch:[0/2](89700/4588595) loss:3.208 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:04:12,233][model8_pretrain.py][INFO] Epoch:[0/2](89700/4588595) loss:2.938 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:04:12,233][model8_pretrain.py][INFO] Epoch:[0/2](89700/4588595) loss:3.136 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:04:12,233][model8_pretrain.py][INFO] Epoch:[0/2](89700/4588595) loss:3.291 lr:0.0000100 epoch_Time:28308.0min: [2024-01-03 03:04:49,165][model8_pretrain.py][INFO] Epoch:[0/2](89800/4588595) loss:3.128 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:04:49,165][model8_pretrain.py][INFO] Epoch:[0/2](89800/4588595) loss:3.300 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:04:49,165][model8_pretrain.py][INFO] Epoch:[0/2](89800/4588595) loss:3.501 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:04:49,166][model8_pretrain.py][INFO] Epoch:[0/2](89800/4588595) loss:2.466 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:04:49,166][model8_pretrain.py][INFO] Epoch:[0/2](89800/4588595) loss:3.032 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:04:49,166][model8_pretrain.py][INFO] Epoch:[0/2](89800/4588595) loss:3.139 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:04:49,166][model8_pretrain.py][INFO] Epoch:[0/2](89800/4588595) loss:3.342 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:04:49,167][model8_pretrain.py][INFO] Epoch:[0/2](89800/4588595) loss:3.589 lr:0.0000100 epoch_Time:28306.0min: [2024-01-03 03:05:26,108][model8_pretrain.py][INFO] Epoch:[0/2](89900/4588595) loss:2.814 lr:0.0000100 epoch_Time:28305.0min: [2024-01-03 03:05:26,108][model8_pretrain.py][INFO] Epoch:[0/2](89900/4588595) loss:3.147 lr:0.0000100 epoch_Time:28305.0min: [2024-01-03 03:05:26,108][model8_pretrain.py][INFO] Epoch:[0/2](89900/4588595) loss:2.975 lr:0.0000100 epoch_Time:28305.0min: [2024-01-03 03:05:26,108][model8_pretrain.py][INFO] Epoch:[0/2](89900/4588595) loss:2.531 lr:0.0000100 epoch_Time:28305.0min: [2024-01-03 03:05:26,108][model8_pretrain.py][INFO] Epoch:[0/2](89900/4588595) loss:3.362 lr:0.0000100 epoch_Time:28305.0min: [2024-01-03 03:05:26,108][model8_pretrain.py][INFO] Epoch:[0/2](89900/4588595) loss:2.632 lr:0.0000100 epoch_Time:28305.0min: [2024-01-03 03:05:26,108][model8_pretrain.py][INFO] Epoch:[0/2](89900/4588595) loss:3.126 lr:0.0000100 epoch_Time:28305.0min: [2024-01-03 03:05:26,109][model8_pretrain.py][INFO] Epoch:[0/2](89900/4588595) loss:3.174 lr:0.0000100 epoch_Time:28305.0min: [2024-01-03 03:06:03,049][model8_pretrain.py][INFO] Epoch:[0/2](90000/4588595) loss:3.106 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:03,049][model8_pretrain.py][INFO] Epoch:[0/2](90000/4588595) loss:3.084 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:03,049][model8_pretrain.py][INFO] Epoch:[0/2](90000/4588595) loss:2.985 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:03,049][model8_pretrain.py][INFO] Epoch:[0/2](90000/4588595) loss:3.168 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:03,049][model8_pretrain.py][INFO] Epoch:[0/2](90000/4588595) loss:3.254 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:03,050][model8_pretrain.py][INFO] Epoch:[0/2](90000/4588595) loss:2.765 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:03,050][model8_pretrain.py][INFO] Epoch:[0/2](90000/4588595) loss:2.692 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:03,050][model8_pretrain.py][INFO] Epoch:[0/2](90000/4588595) loss:2.531 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:39,991][model8_pretrain.py][INFO] Epoch:[0/2](90100/4588595) loss:3.329 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:39,991][model8_pretrain.py][INFO] Epoch:[0/2](90100/4588595) loss:2.657 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:39,991][model8_pretrain.py][INFO] Epoch:[0/2](90100/4588595) loss:2.900 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:39,991][model8_pretrain.py][INFO] Epoch:[0/2](90100/4588595) loss:2.884 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:39,991][model8_pretrain.py][INFO] Epoch:[0/2](90100/4588595) loss:2.493 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:39,991][model8_pretrain.py][INFO] Epoch:[0/2](90100/4588595) loss:2.982 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:39,991][model8_pretrain.py][INFO] Epoch:[0/2](90100/4588595) loss:3.591 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:06:39,991][model8_pretrain.py][INFO] Epoch:[0/2](90100/4588595) loss:3.281 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:07:16,932][model8_pretrain.py][INFO] Epoch:[0/2](90200/4588595) loss:2.886 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:07:16,932][model8_pretrain.py][INFO] Epoch:[0/2](90200/4588595) loss:3.128 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:07:16,932][model8_pretrain.py][INFO] Epoch:[0/2](90200/4588595) loss:2.859 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:07:16,932][model8_pretrain.py][INFO] Epoch:[0/2](90200/4588595) loss:2.838 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:07:16,932][model8_pretrain.py][INFO] Epoch:[0/2](90200/4588595) loss:2.697 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:07:16,932][model8_pretrain.py][INFO] Epoch:[0/2](90200/4588595) loss:3.222 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:07:16,932][model8_pretrain.py][INFO] Epoch:[0/2](90200/4588595) loss:3.180 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:07:16,933][model8_pretrain.py][INFO] Epoch:[0/2](90200/4588595) loss:2.668 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:07:53,860][model8_pretrain.py][INFO] Epoch:[0/2](90300/4588595) loss:3.087 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:07:53,860][model8_pretrain.py][INFO] Epoch:[0/2](90300/4588595) loss:2.303 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:07:53,860][model8_pretrain.py][INFO] Epoch:[0/2](90300/4588595) loss:3.174 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:07:53,860][model8_pretrain.py][INFO] Epoch:[0/2](90300/4588595) loss:3.057 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:07:53,860][model8_pretrain.py][INFO] Epoch:[0/2](90300/4588595) loss:3.086 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:07:53,860][model8_pretrain.py][INFO] Epoch:[0/2](90300/4588595) loss:2.661 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:07:53,860][model8_pretrain.py][INFO] Epoch:[0/2](90300/4588595) loss:2.613 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:07:53,861][model8_pretrain.py][INFO] Epoch:[0/2](90300/4588595) loss:3.255 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:08:30,813][model8_pretrain.py][INFO] Epoch:[0/2](90400/4588595) loss:2.869 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:08:30,813][model8_pretrain.py][INFO] Epoch:[0/2](90400/4588595) loss:3.095 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:08:30,813][model8_pretrain.py][INFO] Epoch:[0/2](90400/4588595) loss:3.061 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:08:30,813][model8_pretrain.py][INFO] Epoch:[0/2](90400/4588595) loss:2.839 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:08:30,813][model8_pretrain.py][INFO] Epoch:[0/2](90400/4588595) loss:2.638 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:08:30,813][model8_pretrain.py][INFO] Epoch:[0/2](90400/4588595) loss:3.373 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:08:30,813][model8_pretrain.py][INFO] Epoch:[0/2](90400/4588595) loss:2.338 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:08:30,814][model8_pretrain.py][INFO] Epoch:[0/2](90400/4588595) loss:3.321 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:09:16,287][model8_pretrain.py][INFO] Epoch:[0/2](90500/4588595) loss:3.323 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:09:16,287][model8_pretrain.py][INFO] Epoch:[0/2](90500/4588595) loss:3.220 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:09:16,287][model8_pretrain.py][INFO] Epoch:[0/2](90500/4588595) loss:2.803 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:09:16,287][model8_pretrain.py][INFO] Epoch:[0/2](90500/4588595) loss:2.789 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:09:16,287][model8_pretrain.py][INFO] Epoch:[0/2](90500/4588595) loss:2.806 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:09:16,287][model8_pretrain.py][INFO] Epoch:[0/2](90500/4588595) loss:3.011 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:09:16,287][model8_pretrain.py][INFO] Epoch:[0/2](90500/4588595) loss:3.327 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:09:16,287][model8_pretrain.py][INFO] Epoch:[0/2](90500/4588595) loss:2.597 lr:0.0000100 epoch_Time:28304.0min: [2024-01-03 03:09:53,199][model8_pretrain.py][INFO] Epoch:[0/2](90600/4588595) loss:2.868 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:09:53,199][model8_pretrain.py][INFO] Epoch:[0/2](90600/4588595) loss:2.987 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:09:53,199][model8_pretrain.py][INFO] Epoch:[0/2](90600/4588595) loss:3.055 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:09:53,199][model8_pretrain.py][INFO] Epoch:[0/2](90600/4588595) loss:3.269 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:09:53,199][model8_pretrain.py][INFO] Epoch:[0/2](90600/4588595) loss:3.184 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:09:53,199][model8_pretrain.py][INFO] Epoch:[0/2](90600/4588595) loss:2.947 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:09:53,199][model8_pretrain.py][INFO] Epoch:[0/2](90600/4588595) loss:2.646 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:09:53,199][model8_pretrain.py][INFO] Epoch:[0/2](90600/4588595) loss:3.448 lr:0.0000100 epoch_Time:28303.0min: [2024-01-03 03:10:30,132][model8_pretrain.py][INFO] Epoch:[0/2](90700/4588595) loss:3.385 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:10:30,132][model8_pretrain.py][INFO] Epoch:[0/2](90700/4588595) loss:2.823 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:10:30,132][model8_pretrain.py][INFO] Epoch:[0/2](90700/4588595) loss:2.867 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:10:30,132][model8_pretrain.py][INFO] Epoch:[0/2](90700/4588595) loss:2.957 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:10:30,132][model8_pretrain.py][INFO] Epoch:[0/2](90700/4588595) loss:2.829 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:10:30,132][model8_pretrain.py][INFO] Epoch:[0/2](90700/4588595) loss:3.146 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:10:30,132][model8_pretrain.py][INFO] Epoch:[0/2](90700/4588595) loss:2.935 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:10:30,132][model8_pretrain.py][INFO] Epoch:[0/2](90700/4588595) loss:3.298 lr:0.0000100 epoch_Time:28302.0min: [2024-01-03 03:11:07,072][model8_pretrain.py][INFO] Epoch:[0/2](90800/4588595) loss:2.295 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:07,072][model8_pretrain.py][INFO] Epoch:[0/2](90800/4588595) loss:2.765 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:07,072][model8_pretrain.py][INFO] Epoch:[0/2](90800/4588595) loss:3.036 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:07,072][model8_pretrain.py][INFO] Epoch:[0/2](90800/4588595) loss:3.126 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:07,072][model8_pretrain.py][INFO] Epoch:[0/2](90800/4588595) loss:2.701 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:07,072][model8_pretrain.py][INFO] Epoch:[0/2](90800/4588595) loss:3.288 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:07,072][model8_pretrain.py][INFO] Epoch:[0/2](90800/4588595) loss:3.155 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:07,072][model8_pretrain.py][INFO] Epoch:[0/2](90800/4588595) loss:2.781 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:44,007][model8_pretrain.py][INFO] Epoch:[0/2](90900/4588595) loss:2.655 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:11:44,007][model8_pretrain.py][INFO] Epoch:[0/2](90900/4588595) loss:3.009 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:11:44,007][model8_pretrain.py][INFO] Epoch:[0/2](90900/4588595) loss:2.641 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:44,007][model8_pretrain.py][INFO] Epoch:[0/2](90900/4588595) loss:2.946 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:11:44,007][model8_pretrain.py][INFO] Epoch:[0/2](90900/4588595) loss:2.583 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:11:44,007][model8_pretrain.py][INFO] Epoch:[0/2](90900/4588595) loss:3.184 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:11:44,007][model8_pretrain.py][INFO] Epoch:[0/2](90900/4588595) loss:2.620 lr:0.0000100 epoch_Time:28300.0min: [2024-01-03 03:11:44,007][model8_pretrain.py][INFO] Epoch:[0/2](90900/4588595) loss:2.816 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:12:20,934][model8_pretrain.py][INFO] Epoch:[0/2](91000/4588595) loss:3.051 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:12:20,934][model8_pretrain.py][INFO] Epoch:[0/2](91000/4588595) loss:3.127 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:12:20,934][model8_pretrain.py][INFO] Epoch:[0/2](91000/4588595) loss:3.268 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:12:20,934][model8_pretrain.py][INFO] Epoch:[0/2](91000/4588595) loss:3.158 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:12:20,934][model8_pretrain.py][INFO] Epoch:[0/2](91000/4588595) loss:3.159 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:12:20,934][model8_pretrain.py][INFO] Epoch:[0/2](91000/4588595) loss:3.279 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:12:20,934][model8_pretrain.py][INFO] Epoch:[0/2](91000/4588595) loss:3.316 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:12:20,934][model8_pretrain.py][INFO] Epoch:[0/2](91000/4588595) loss:2.573 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:12:57,861][model8_pretrain.py][INFO] Epoch:[0/2](91100/4588595) loss:3.194 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:12:57,861][model8_pretrain.py][INFO] Epoch:[0/2](91100/4588595) loss:3.441 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:12:57,861][model8_pretrain.py][INFO] Epoch:[0/2](91100/4588595) loss:3.041 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:12:57,861][model8_pretrain.py][INFO] Epoch:[0/2](91100/4588595) loss:2.511 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:12:57,861][model8_pretrain.py][INFO] Epoch:[0/2](91100/4588595) loss:2.929 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:12:57,861][model8_pretrain.py][INFO] Epoch:[0/2](91100/4588595) loss:2.690 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:12:57,861][model8_pretrain.py][INFO] Epoch:[0/2](91100/4588595) loss:3.156 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:12:57,861][model8_pretrain.py][INFO] Epoch:[0/2](91100/4588595) loss:3.138 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:13:34,787][model8_pretrain.py][INFO] Epoch:[0/2](91200/4588595) loss:2.868 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:13:34,787][model8_pretrain.py][INFO] Epoch:[0/2](91200/4588595) loss:3.228 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:13:34,787][model8_pretrain.py][INFO] Epoch:[0/2](91200/4588595) loss:3.134 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:13:34,787][model8_pretrain.py][INFO] Epoch:[0/2](91200/4588595) loss:3.372 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:13:34,787][model8_pretrain.py][INFO] Epoch:[0/2](91200/4588595) loss:3.072 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:13:34,787][model8_pretrain.py][INFO] Epoch:[0/2](91200/4588595) loss:3.363 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:13:34,787][model8_pretrain.py][INFO] Epoch:[0/2](91200/4588595) loss:3.105 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:13:34,788][model8_pretrain.py][INFO] Epoch:[0/2](91200/4588595) loss:2.770 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:14:20,153][model8_pretrain.py][INFO] Epoch:[0/2](91300/4588595) loss:3.349 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:14:20,153][model8_pretrain.py][INFO] Epoch:[0/2](91300/4588595) loss:2.581 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:14:20,153][model8_pretrain.py][INFO] Epoch:[0/2](91300/4588595) loss:3.094 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:14:20,153][model8_pretrain.py][INFO] Epoch:[0/2](91300/4588595) loss:3.144 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:14:20,154][model8_pretrain.py][INFO] Epoch:[0/2](91300/4588595) loss:3.201 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:14:20,154][model8_pretrain.py][INFO] Epoch:[0/2](91300/4588595) loss:3.440 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:14:20,154][model8_pretrain.py][INFO] Epoch:[0/2](91300/4588595) loss:3.005 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:14:20,154][model8_pretrain.py][INFO] Epoch:[0/2](91300/4588595) loss:2.770 lr:0.0000100 epoch_Time:28301.0min: [2024-01-03 03:14:57,093][model8_pretrain.py][INFO] Epoch:[0/2](91400/4588595) loss:2.789 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:14:57,093][model8_pretrain.py][INFO] Epoch:[0/2](91400/4588595) loss:2.679 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:14:57,093][model8_pretrain.py][INFO] Epoch:[0/2](91400/4588595) loss:2.617 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:14:57,093][model8_pretrain.py][INFO] Epoch:[0/2](91400/4588595) loss:2.946 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:14:57,093][model8_pretrain.py][INFO] Epoch:[0/2](91400/4588595) loss:2.829 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:14:57,093][model8_pretrain.py][INFO] Epoch:[0/2](91400/4588595) loss:2.949 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:14:57,093][model8_pretrain.py][INFO] Epoch:[0/2](91400/4588595) loss:3.519 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:14:57,093][model8_pretrain.py][INFO] Epoch:[0/2](91400/4588595) loss:2.892 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:15:34,102][model8_pretrain.py][INFO] Epoch:[0/2](91500/4588595) loss:2.849 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:15:34,102][model8_pretrain.py][INFO] Epoch:[0/2](91500/4588595) loss:3.041 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:15:34,102][model8_pretrain.py][INFO] Epoch:[0/2](91500/4588595) loss:3.034 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:15:34,102][model8_pretrain.py][INFO] Epoch:[0/2](91500/4588595) loss:3.112 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:15:34,102][model8_pretrain.py][INFO] Epoch:[0/2](91500/4588595) loss:3.034 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:15:34,102][model8_pretrain.py][INFO] Epoch:[0/2](91500/4588595) loss:3.077 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:15:34,102][model8_pretrain.py][INFO] Epoch:[0/2](91500/4588595) loss:3.165 lr:0.0000100 epoch_Time:28299.0min: [2024-01-03 03:15:34,103][model8_pretrain.py][INFO] Epoch:[0/2](91500/4588595) loss:3.164 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:16:11,096][model8_pretrain.py][INFO] Epoch:[0/2](91600/4588595) loss:2.849 lr:0.0000100 epoch_Time:28297.0min: [2024-01-03 03:16:11,096][model8_pretrain.py][INFO] Epoch:[0/2](91600/4588595) loss:3.073 lr:0.0000100 epoch_Time:28297.0min: [2024-01-03 03:16:11,096][model8_pretrain.py][INFO] Epoch:[0/2](91600/4588595) loss:3.431 lr:0.0000100 epoch_Time:28297.0min: [2024-01-03 03:16:11,096][model8_pretrain.py][INFO] Epoch:[0/2](91600/4588595) loss:3.222 lr:0.0000100 epoch_Time:28297.0min: [2024-01-03 03:16:11,096][model8_pretrain.py][INFO] Epoch:[0/2](91600/4588595) loss:2.688 lr:0.0000100 epoch_Time:28297.0min: [2024-01-03 03:16:11,096][model8_pretrain.py][INFO] Epoch:[0/2](91600/4588595) loss:2.973 lr:0.0000100 epoch_Time:28297.0min: [2024-01-03 03:16:11,096][model8_pretrain.py][INFO] Epoch:[0/2](91600/4588595) loss:2.910 lr:0.0000100 epoch_Time:28297.0min: [2024-01-03 03:16:11,097][model8_pretrain.py][INFO] Epoch:[0/2](91600/4588595) loss:3.153 lr:0.0000100 epoch_Time:28297.0min: [2024-01-03 03:16:48,047][model8_pretrain.py][INFO] Epoch:[0/2](91700/4588595) loss:2.771 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:16:48,047][model8_pretrain.py][INFO] Epoch:[0/2](91700/4588595) loss:2.720 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:16:48,047][model8_pretrain.py][INFO] Epoch:[0/2](91700/4588595) loss:2.781 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:16:48,047][model8_pretrain.py][INFO] Epoch:[0/2](91700/4588595) loss:2.790 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:16:48,047][model8_pretrain.py][INFO] Epoch:[0/2](91700/4588595) loss:3.228 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:16:48,047][model8_pretrain.py][INFO] Epoch:[0/2](91700/4588595) loss:2.945 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:16:48,047][model8_pretrain.py][INFO] Epoch:[0/2](91700/4588595) loss:3.036 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:16:48,047][model8_pretrain.py][INFO] Epoch:[0/2](91700/4588595) loss:3.407 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:17:24,994][model8_pretrain.py][INFO] Epoch:[0/2](91800/4588595) loss:3.444 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:17:24,994][model8_pretrain.py][INFO] Epoch:[0/2](91800/4588595) loss:3.118 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:17:24,994][model8_pretrain.py][INFO] Epoch:[0/2](91800/4588595) loss:2.992 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:17:24,995][model8_pretrain.py][INFO] Epoch:[0/2](91800/4588595) loss:3.153 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:17:24,995][model8_pretrain.py][INFO] Epoch:[0/2](91800/4588595) loss:2.772 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:17:24,995][model8_pretrain.py][INFO] Epoch:[0/2](91800/4588595) loss:3.448 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:17:24,995][model8_pretrain.py][INFO] Epoch:[0/2](91800/4588595) loss:3.112 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:17:24,995][model8_pretrain.py][INFO] Epoch:[0/2](91800/4588595) loss:3.124 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:18:01,934][model8_pretrain.py][INFO] Epoch:[0/2](91900/4588595) loss:2.722 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:18:01,934][model8_pretrain.py][INFO] Epoch:[0/2](91900/4588595) loss:3.217 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:18:01,934][model8_pretrain.py][INFO] Epoch:[0/2](91900/4588595) loss:2.645 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:18:01,934][model8_pretrain.py][INFO] Epoch:[0/2](91900/4588595) loss:2.998 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:18:01,934][model8_pretrain.py][INFO] Epoch:[0/2](91900/4588595) loss:2.719 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:18:01,934][model8_pretrain.py][INFO] Epoch:[0/2](91900/4588595) loss:3.347 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:18:01,934][model8_pretrain.py][INFO] Epoch:[0/2](91900/4588595) loss:3.324 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:18:01,934][model8_pretrain.py][INFO] Epoch:[0/2](91900/4588595) loss:3.348 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:18:38,869][model8_pretrain.py][INFO] Epoch:[0/2](92000/4588595) loss:3.027 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:18:38,869][model8_pretrain.py][INFO] Epoch:[0/2](92000/4588595) loss:3.224 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:18:38,869][model8_pretrain.py][INFO] Epoch:[0/2](92000/4588595) loss:3.337 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:18:38,869][model8_pretrain.py][INFO] Epoch:[0/2](92000/4588595) loss:3.176 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:18:38,869][model8_pretrain.py][INFO] Epoch:[0/2](92000/4588595) loss:3.222 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:18:38,870][model8_pretrain.py][INFO] Epoch:[0/2](92000/4588595) loss:3.095 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:18:38,870][model8_pretrain.py][INFO] Epoch:[0/2](92000/4588595) loss:2.857 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:18:38,870][model8_pretrain.py][INFO] Epoch:[0/2](92000/4588595) loss:3.446 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:19:24,287][model8_pretrain.py][INFO] Epoch:[0/2](92100/4588595) loss:2.992 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:19:24,287][model8_pretrain.py][INFO] Epoch:[0/2](92100/4588595) loss:2.620 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:19:24,287][model8_pretrain.py][INFO] Epoch:[0/2](92100/4588595) loss:3.202 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:19:24,287][model8_pretrain.py][INFO] Epoch:[0/2](92100/4588595) loss:3.056 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:19:24,287][model8_pretrain.py][INFO] Epoch:[0/2](92100/4588595) loss:3.160 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:19:24,287][model8_pretrain.py][INFO] Epoch:[0/2](92100/4588595) loss:2.547 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:19:24,287][model8_pretrain.py][INFO] Epoch:[0/2](92100/4588595) loss:2.987 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:19:24,287][model8_pretrain.py][INFO] Epoch:[0/2](92100/4588595) loss:3.502 lr:0.0000100 epoch_Time:28298.0min: [2024-01-03 03:20:01,233][model8_pretrain.py][INFO] Epoch:[0/2](92200/4588595) loss:3.350 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:20:01,233][model8_pretrain.py][INFO] Epoch:[0/2](92200/4588595) loss:2.307 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:20:01,233][model8_pretrain.py][INFO] Epoch:[0/2](92200/4588595) loss:2.380 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:20:01,233][model8_pretrain.py][INFO] Epoch:[0/2](92200/4588595) loss:2.867 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:20:01,233][model8_pretrain.py][INFO] Epoch:[0/2](92200/4588595) loss:3.453 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:20:01,233][model8_pretrain.py][INFO] Epoch:[0/2](92200/4588595) loss:2.678 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:20:01,233][model8_pretrain.py][INFO] Epoch:[0/2](92200/4588595) loss:3.057 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:20:01,233][model8_pretrain.py][INFO] Epoch:[0/2](92200/4588595) loss:3.378 lr:0.0000100 epoch_Time:28296.0min: [2024-01-03 03:20:38,182][model8_pretrain.py][INFO] Epoch:[0/2](92300/4588595) loss:2.798 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:20:38,182][model8_pretrain.py][INFO] Epoch:[0/2](92300/4588595) loss:3.399 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:20:38,182][model8_pretrain.py][INFO] Epoch:[0/2](92300/4588595) loss:2.866 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:20:38,183][model8_pretrain.py][INFO] Epoch:[0/2](92300/4588595) loss:3.023 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:20:38,182][model8_pretrain.py][INFO] Epoch:[0/2](92300/4588595) loss:3.139 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:20:38,183][model8_pretrain.py][INFO] Epoch:[0/2](92300/4588595) loss:2.970 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:20:38,183][model8_pretrain.py][INFO] Epoch:[0/2](92300/4588595) loss:3.179 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:20:38,183][model8_pretrain.py][INFO] Epoch:[0/2](92300/4588595) loss:3.340 lr:0.0000100 epoch_Time:28295.0min: [2024-01-03 03:21:15,135][model8_pretrain.py][INFO] Epoch:[0/2](92400/4588595) loss:3.302 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:21:15,135][model8_pretrain.py][INFO] Epoch:[0/2](92400/4588595) loss:3.029 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:21:15,135][model8_pretrain.py][INFO] Epoch:[0/2](92400/4588595) loss:3.132 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:21:15,135][model8_pretrain.py][INFO] Epoch:[0/2](92400/4588595) loss:2.846 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:21:15,135][model8_pretrain.py][INFO] Epoch:[0/2](92400/4588595) loss:3.197 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:21:15,135][model8_pretrain.py][INFO] Epoch:[0/2](92400/4588595) loss:2.836 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:21:15,135][model8_pretrain.py][INFO] Epoch:[0/2](92400/4588595) loss:3.416 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:21:15,135][model8_pretrain.py][INFO] Epoch:[0/2](92400/4588595) loss:3.183 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:21:52,089][model8_pretrain.py][INFO] Epoch:[0/2](92500/4588595) loss:2.895 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:21:52,089][model8_pretrain.py][INFO] Epoch:[0/2](92500/4588595) loss:3.151 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:21:52,089][model8_pretrain.py][INFO] Epoch:[0/2](92500/4588595) loss:2.798 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:21:52,089][model8_pretrain.py][INFO] Epoch:[0/2](92500/4588595) loss:3.121 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:21:52,089][model8_pretrain.py][INFO] Epoch:[0/2](92500/4588595) loss:2.857 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:21:52,089][model8_pretrain.py][INFO] Epoch:[0/2](92500/4588595) loss:3.022 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:21:52,089][model8_pretrain.py][INFO] Epoch:[0/2](92500/4588595) loss:3.232 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:21:52,089][model8_pretrain.py][INFO] Epoch:[0/2](92500/4588595) loss:3.077 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:22:29,046][model8_pretrain.py][INFO] Epoch:[0/2](92600/4588595) loss:3.378 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:22:29,046][model8_pretrain.py][INFO] Epoch:[0/2](92600/4588595) loss:3.348 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:22:29,046][model8_pretrain.py][INFO] Epoch:[0/2](92600/4588595) loss:2.917 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:22:29,046][model8_pretrain.py][INFO] Epoch:[0/2](92600/4588595) loss:3.062 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:22:29,046][model8_pretrain.py][INFO] Epoch:[0/2](92600/4588595) loss:3.352 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:22:29,046][model8_pretrain.py][INFO] Epoch:[0/2](92600/4588595) loss:3.382 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:22:29,046][model8_pretrain.py][INFO] Epoch:[0/2](92600/4588595) loss:2.373 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:22:29,046][model8_pretrain.py][INFO] Epoch:[0/2](92600/4588595) loss:2.851 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:23:05,990][model8_pretrain.py][INFO] Epoch:[0/2](92700/4588595) loss:3.250 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:23:05,990][model8_pretrain.py][INFO] Epoch:[0/2](92700/4588595) loss:3.344 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:23:05,991][model8_pretrain.py][INFO] Epoch:[0/2](92700/4588595) loss:2.949 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:23:05,991][model8_pretrain.py][INFO] Epoch:[0/2](92700/4588595) loss:2.856 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:23:05,991][model8_pretrain.py][INFO] Epoch:[0/2](92700/4588595) loss:2.692 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:23:05,991][model8_pretrain.py][INFO] Epoch:[0/2](92700/4588595) loss:2.950 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:23:05,991][model8_pretrain.py][INFO] Epoch:[0/2](92700/4588595) loss:2.522 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:23:05,991][model8_pretrain.py][INFO] Epoch:[0/2](92700/4588595) loss:2.538 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:23:42,934][model8_pretrain.py][INFO] Epoch:[0/2](92800/4588595) loss:3.068 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:23:42,934][model8_pretrain.py][INFO] Epoch:[0/2](92800/4588595) loss:3.130 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:23:42,934][model8_pretrain.py][INFO] Epoch:[0/2](92800/4588595) loss:2.607 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:23:42,934][model8_pretrain.py][INFO] Epoch:[0/2](92800/4588595) loss:3.408 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:23:42,934][model8_pretrain.py][INFO] Epoch:[0/2](92800/4588595) loss:3.355 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:23:42,934][model8_pretrain.py][INFO] Epoch:[0/2](92800/4588595) loss:3.376 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:23:42,934][model8_pretrain.py][INFO] Epoch:[0/2](92800/4588595) loss:3.031 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:23:42,934][model8_pretrain.py][INFO] Epoch:[0/2](92800/4588595) loss:2.682 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:24:28,631][model8_pretrain.py][INFO] Epoch:[0/2](92900/4588595) loss:2.671 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:24:28,631][model8_pretrain.py][INFO] Epoch:[0/2](92900/4588595) loss:3.101 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:24:28,631][model8_pretrain.py][INFO] Epoch:[0/2](92900/4588595) loss:2.783 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:24:28,631][model8_pretrain.py][INFO] Epoch:[0/2](92900/4588595) loss:3.088 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:24:28,631][model8_pretrain.py][INFO] Epoch:[0/2](92900/4588595) loss:3.319 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:24:28,631][model8_pretrain.py][INFO] Epoch:[0/2](92900/4588595) loss:2.174 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:24:28,632][model8_pretrain.py][INFO] Epoch:[0/2](92900/4588595) loss:3.560 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:24:28,632][model8_pretrain.py][INFO] Epoch:[0/2](92900/4588595) loss:3.151 lr:0.0000100 epoch_Time:28294.0min: [2024-01-03 03:25:05,574][model8_pretrain.py][INFO] Epoch:[0/2](93000/4588595) loss:2.734 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:25:05,574][model8_pretrain.py][INFO] Epoch:[0/2](93000/4588595) loss:2.904 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:25:05,574][model8_pretrain.py][INFO] Epoch:[0/2](93000/4588595) loss:2.635 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:25:05,574][model8_pretrain.py][INFO] Epoch:[0/2](93000/4588595) loss:3.127 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:25:05,574][model8_pretrain.py][INFO] Epoch:[0/2](93000/4588595) loss:3.085 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:25:05,574][model8_pretrain.py][INFO] Epoch:[0/2](93000/4588595) loss:2.393 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:25:05,574][model8_pretrain.py][INFO] Epoch:[0/2](93000/4588595) loss:3.228 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:25:05,575][model8_pretrain.py][INFO] Epoch:[0/2](93000/4588595) loss:3.088 lr:0.0000100 epoch_Time:28293.0min: [2024-01-03 03:25:42,533][model8_pretrain.py][INFO] Epoch:[0/2](93100/4588595) loss:2.433 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:25:42,533][model8_pretrain.py][INFO] Epoch:[0/2](93100/4588595) loss:2.774 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:25:42,533][model8_pretrain.py][INFO] Epoch:[0/2](93100/4588595) loss:3.020 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:25:42,533][model8_pretrain.py][INFO] Epoch:[0/2](93100/4588595) loss:3.209 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:25:42,533][model8_pretrain.py][INFO] Epoch:[0/2](93100/4588595) loss:2.874 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:25:42,533][model8_pretrain.py][INFO] Epoch:[0/2](93100/4588595) loss:2.825 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:25:42,534][model8_pretrain.py][INFO] Epoch:[0/2](93100/4588595) loss:2.955 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:25:42,535][model8_pretrain.py][INFO] Epoch:[0/2](93100/4588595) loss:2.812 lr:0.0000100 epoch_Time:28292.0min: [2024-01-03 03:26:19,466][model8_pretrain.py][INFO] Epoch:[0/2](93200/4588595) loss:2.942 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:26:19,466][model8_pretrain.py][INFO] Epoch:[0/2](93200/4588595) loss:3.449 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:26:19,466][model8_pretrain.py][INFO] Epoch:[0/2](93200/4588595) loss:3.157 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:26:19,466][model8_pretrain.py][INFO] Epoch:[0/2](93200/4588595) loss:3.404 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:26:19,466][model8_pretrain.py][INFO] Epoch:[0/2](93200/4588595) loss:3.151 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:26:19,466][model8_pretrain.py][INFO] Epoch:[0/2](93200/4588595) loss:3.122 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:26:19,466][model8_pretrain.py][INFO] Epoch:[0/2](93200/4588595) loss:3.077 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:26:19,466][model8_pretrain.py][INFO] Epoch:[0/2](93200/4588595) loss:2.595 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:26:56,402][model8_pretrain.py][INFO] Epoch:[0/2](93300/4588595) loss:2.884 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:26:56,402][model8_pretrain.py][INFO] Epoch:[0/2](93300/4588595) loss:3.014 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:26:56,402][model8_pretrain.py][INFO] Epoch:[0/2](93300/4588595) loss:3.446 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:26:56,402][model8_pretrain.py][INFO] Epoch:[0/2](93300/4588595) loss:3.602 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:26:56,402][model8_pretrain.py][INFO] Epoch:[0/2](93300/4588595) loss:2.899 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:26:56,402][model8_pretrain.py][INFO] Epoch:[0/2](93300/4588595) loss:3.092 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:26:56,402][model8_pretrain.py][INFO] Epoch:[0/2](93300/4588595) loss:2.785 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:26:56,403][model8_pretrain.py][INFO] Epoch:[0/2](93300/4588595) loss:3.224 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:27:33,354][model8_pretrain.py][INFO] Epoch:[0/2](93400/4588595) loss:2.838 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:27:33,354][model8_pretrain.py][INFO] Epoch:[0/2](93400/4588595) loss:3.192 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:27:33,354][model8_pretrain.py][INFO] Epoch:[0/2](93400/4588595) loss:2.734 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:27:33,354][model8_pretrain.py][INFO] Epoch:[0/2](93400/4588595) loss:2.937 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:27:33,354][model8_pretrain.py][INFO] Epoch:[0/2](93400/4588595) loss:3.397 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:27:33,354][model8_pretrain.py][INFO] Epoch:[0/2](93400/4588595) loss:3.200 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:27:33,354][model8_pretrain.py][INFO] Epoch:[0/2](93400/4588595) loss:3.048 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:27:33,354][model8_pretrain.py][INFO] Epoch:[0/2](93400/4588595) loss:3.269 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:28:10,302][model8_pretrain.py][INFO] Epoch:[0/2](93500/4588595) loss:3.134 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:10,302][model8_pretrain.py][INFO] Epoch:[0/2](93500/4588595) loss:3.241 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:10,302][model8_pretrain.py][INFO] Epoch:[0/2](93500/4588595) loss:3.295 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:10,302][model8_pretrain.py][INFO] Epoch:[0/2](93500/4588595) loss:2.994 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:10,302][model8_pretrain.py][INFO] Epoch:[0/2](93500/4588595) loss:2.943 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:10,302][model8_pretrain.py][INFO] Epoch:[0/2](93500/4588595) loss:3.122 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:10,302][model8_pretrain.py][INFO] Epoch:[0/2](93500/4588595) loss:2.989 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:10,302][model8_pretrain.py][INFO] Epoch:[0/2](93500/4588595) loss:3.144 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:47,259][model8_pretrain.py][INFO] Epoch:[0/2](93600/4588595) loss:3.095 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:47,259][model8_pretrain.py][INFO] Epoch:[0/2](93600/4588595) loss:3.327 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:47,259][model8_pretrain.py][INFO] Epoch:[0/2](93600/4588595) loss:3.390 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:47,259][model8_pretrain.py][INFO] Epoch:[0/2](93600/4588595) loss:2.567 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:47,259][model8_pretrain.py][INFO] Epoch:[0/2](93600/4588595) loss:3.163 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:47,259][model8_pretrain.py][INFO] Epoch:[0/2](93600/4588595) loss:2.947 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:47,259][model8_pretrain.py][INFO] Epoch:[0/2](93600/4588595) loss:2.515 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:28:47,259][model8_pretrain.py][INFO] Epoch:[0/2](93600/4588595) loss:2.902 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:29:33,055][model8_pretrain.py][INFO] Epoch:[0/2](93700/4588595) loss:3.030 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:29:33,055][model8_pretrain.py][INFO] Epoch:[0/2](93700/4588595) loss:3.497 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:29:33,055][model8_pretrain.py][INFO] Epoch:[0/2](93700/4588595) loss:3.649 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:29:33,055][model8_pretrain.py][INFO] Epoch:[0/2](93700/4588595) loss:3.175 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:29:33,055][model8_pretrain.py][INFO] Epoch:[0/2](93700/4588595) loss:2.969 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:29:33,055][model8_pretrain.py][INFO] Epoch:[0/2](93700/4588595) loss:3.347 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:29:33,055][model8_pretrain.py][INFO] Epoch:[0/2](93700/4588595) loss:3.335 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:29:33,055][model8_pretrain.py][INFO] Epoch:[0/2](93700/4588595) loss:2.875 lr:0.0000100 epoch_Time:28291.0min: [2024-01-03 03:30:10,010][model8_pretrain.py][INFO] Epoch:[0/2](93800/4588595) loss:2.540 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:30:10,010][model8_pretrain.py][INFO] Epoch:[0/2](93800/4588595) loss:2.444 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:30:10,010][model8_pretrain.py][INFO] Epoch:[0/2](93800/4588595) loss:2.729 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:30:10,010][model8_pretrain.py][INFO] Epoch:[0/2](93800/4588595) loss:2.983 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:30:10,010][model8_pretrain.py][INFO] Epoch:[0/2](93800/4588595) loss:2.386 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:30:10,010][model8_pretrain.py][INFO] Epoch:[0/2](93800/4588595) loss:3.239 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:30:10,010][model8_pretrain.py][INFO] Epoch:[0/2](93800/4588595) loss:3.569 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:30:10,010][model8_pretrain.py][INFO] Epoch:[0/2](93800/4588595) loss:3.002 lr:0.0000100 epoch_Time:28290.0min: [2024-01-03 03:30:46,973][model8_pretrain.py][INFO] Epoch:[0/2](93900/4588595) loss:2.868 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:30:46,973][model8_pretrain.py][INFO] Epoch:[0/2](93900/4588595) loss:2.820 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:30:46,973][model8_pretrain.py][INFO] Epoch:[0/2](93900/4588595) loss:3.133 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:30:46,974][model8_pretrain.py][INFO] Epoch:[0/2](93900/4588595) loss:3.209 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:30:46,974][model8_pretrain.py][INFO] Epoch:[0/2](93900/4588595) loss:3.127 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:30:46,974][model8_pretrain.py][INFO] Epoch:[0/2](93900/4588595) loss:3.294 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:30:46,974][model8_pretrain.py][INFO] Epoch:[0/2](93900/4588595) loss:3.339 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:30:46,974][model8_pretrain.py][INFO] Epoch:[0/2](93900/4588595) loss:2.713 lr:0.0000100 epoch_Time:28289.0min: [2024-01-03 03:31:23,929][model8_pretrain.py][INFO] Epoch:[0/2](94000/4588595) loss:2.651 lr:0.0000100 epoch_Time:28287.0min: [2024-01-03 03:31:23,929][model8_pretrain.py][INFO] Epoch:[0/2](94000/4588595) loss:3.603 lr:0.0000100 epoch_Time:28287.0min: [2024-01-03 03:31:23,929][model8_pretrain.py][INFO] Epoch:[0/2](94000/4588595) loss:3.346 lr:0.0000100 epoch_Time:28287.0min: [2024-01-03 03:31:23,929][model8_pretrain.py][INFO] Epoch:[0/2](94000/4588595) loss:2.821 lr:0.0000100 epoch_Time:28287.0min: [2024-01-03 03:31:23,929][model8_pretrain.py][INFO] Epoch:[0/2](94000/4588595) loss:2.901 lr:0.0000100 epoch_Time:28287.0min: [2024-01-03 03:31:23,929][model8_pretrain.py][INFO] Epoch:[0/2](94000/4588595) loss:3.236 lr:0.0000100 epoch_Time:28287.0min: [2024-01-03 03:31:23,929][model8_pretrain.py][INFO] Epoch:[0/2](94000/4588595) loss:3.038 lr:0.0000100 epoch_Time:28287.0min: [2024-01-03 03:31:23,930][model8_pretrain.py][INFO] Epoch:[0/2](94000/4588595) loss:2.813 lr:0.0000100 epoch_Time:28287.0min: [2024-01-03 03:32:00,898][model8_pretrain.py][INFO] Epoch:[0/2](94100/4588595) loss:3.071 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:32:00,898][model8_pretrain.py][INFO] Epoch:[0/2](94100/4588595) loss:3.171 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:32:00,898][model8_pretrain.py][INFO] Epoch:[0/2](94100/4588595) loss:3.324 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:32:00,898][model8_pretrain.py][INFO] Epoch:[0/2](94100/4588595) loss:2.530 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:32:00,898][model8_pretrain.py][INFO] Epoch:[0/2](94100/4588595) loss:3.027 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:32:00,898][model8_pretrain.py][INFO] Epoch:[0/2](94100/4588595) loss:3.175 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:32:00,898][model8_pretrain.py][INFO] Epoch:[0/2](94100/4588595) loss:3.004 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:32:00,898][model8_pretrain.py][INFO] Epoch:[0/2](94100/4588595) loss:2.874 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:32:37,851][model8_pretrain.py][INFO] Epoch:[0/2](94200/4588595) loss:3.033 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:32:37,851][model8_pretrain.py][INFO] Epoch:[0/2](94200/4588595) loss:2.739 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:32:37,851][model8_pretrain.py][INFO] Epoch:[0/2](94200/4588595) loss:3.104 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:32:37,851][model8_pretrain.py][INFO] Epoch:[0/2](94200/4588595) loss:2.838 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:32:37,851][model8_pretrain.py][INFO] Epoch:[0/2](94200/4588595) loss:3.231 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:32:37,851][model8_pretrain.py][INFO] Epoch:[0/2](94200/4588595) loss:3.038 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:32:37,851][model8_pretrain.py][INFO] Epoch:[0/2](94200/4588595) loss:3.615 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:32:37,851][model8_pretrain.py][INFO] Epoch:[0/2](94200/4588595) loss:2.900 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:33:14,773][model8_pretrain.py][INFO] Epoch:[0/2](94300/4588595) loss:2.733 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:33:14,773][model8_pretrain.py][INFO] Epoch:[0/2](94300/4588595) loss:2.811 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:33:14,773][model8_pretrain.py][INFO] Epoch:[0/2](94300/4588595) loss:2.883 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:33:14,773][model8_pretrain.py][INFO] Epoch:[0/2](94300/4588595) loss:3.183 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:33:14,773][model8_pretrain.py][INFO] Epoch:[0/2](94300/4588595) loss:2.768 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:33:14,773][model8_pretrain.py][INFO] Epoch:[0/2](94300/4588595) loss:3.076 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:33:14,773][model8_pretrain.py][INFO] Epoch:[0/2](94300/4588595) loss:3.184 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:33:14,773][model8_pretrain.py][INFO] Epoch:[0/2](94300/4588595) loss:3.129 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:33:51,732][model8_pretrain.py][INFO] Epoch:[0/2](94400/4588595) loss:2.974 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:33:51,732][model8_pretrain.py][INFO] Epoch:[0/2](94400/4588595) loss:2.840 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:33:51,732][model8_pretrain.py][INFO] Epoch:[0/2](94400/4588595) loss:3.030 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:33:51,732][model8_pretrain.py][INFO] Epoch:[0/2](94400/4588595) loss:2.839 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:33:51,732][model8_pretrain.py][INFO] Epoch:[0/2](94400/4588595) loss:2.944 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:33:51,732][model8_pretrain.py][INFO] Epoch:[0/2](94400/4588595) loss:2.550 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:33:51,732][model8_pretrain.py][INFO] Epoch:[0/2](94400/4588595) loss:3.177 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:33:51,732][model8_pretrain.py][INFO] Epoch:[0/2](94400/4588595) loss:3.080 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:34:37,403][model8_pretrain.py][INFO] Epoch:[0/2](94500/4588595) loss:2.876 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:34:37,403][model8_pretrain.py][INFO] Epoch:[0/2](94500/4588595) loss:2.966 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:34:37,403][model8_pretrain.py][INFO] Epoch:[0/2](94500/4588595) loss:3.181 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:34:37,403][model8_pretrain.py][INFO] Epoch:[0/2](94500/4588595) loss:2.992 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:34:37,403][model8_pretrain.py][INFO] Epoch:[0/2](94500/4588595) loss:2.908 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:34:37,403][model8_pretrain.py][INFO] Epoch:[0/2](94500/4588595) loss:2.898 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:34:37,403][model8_pretrain.py][INFO] Epoch:[0/2](94500/4588595) loss:2.677 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:34:37,403][model8_pretrain.py][INFO] Epoch:[0/2](94500/4588595) loss:3.034 lr:0.0000100 epoch_Time:28288.0min: [2024-01-03 03:35:14,341][model8_pretrain.py][INFO] Epoch:[0/2](94600/4588595) loss:3.309 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:35:14,341][model8_pretrain.py][INFO] Epoch:[0/2](94600/4588595) loss:2.388 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:35:14,341][model8_pretrain.py][INFO] Epoch:[0/2](94600/4588595) loss:2.690 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:35:14,341][model8_pretrain.py][INFO] Epoch:[0/2](94600/4588595) loss:3.018 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:35:14,341][model8_pretrain.py][INFO] Epoch:[0/2](94600/4588595) loss:2.925 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:35:14,341][model8_pretrain.py][INFO] Epoch:[0/2](94600/4588595) loss:3.335 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:35:14,341][model8_pretrain.py][INFO] Epoch:[0/2](94600/4588595) loss:3.068 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:35:14,341][model8_pretrain.py][INFO] Epoch:[0/2](94600/4588595) loss:2.920 lr:0.0000100 epoch_Time:28286.0min: [2024-01-03 03:35:51,281][model8_pretrain.py][INFO] Epoch:[0/2](94700/4588595) loss:3.098 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:35:51,281][model8_pretrain.py][INFO] Epoch:[0/2](94700/4588595) loss:2.909 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:35:51,281][model8_pretrain.py][INFO] Epoch:[0/2](94700/4588595) loss:3.422 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:35:51,281][model8_pretrain.py][INFO] Epoch:[0/2](94700/4588595) loss:2.059 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:35:51,281][model8_pretrain.py][INFO] Epoch:[0/2](94700/4588595) loss:2.850 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:35:51,282][model8_pretrain.py][INFO] Epoch:[0/2](94700/4588595) loss:3.142 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:35:51,282][model8_pretrain.py][INFO] Epoch:[0/2](94700/4588595) loss:3.405 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:35:51,282][model8_pretrain.py][INFO] Epoch:[0/2](94700/4588595) loss:3.117 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:36:28,216][model8_pretrain.py][INFO] Epoch:[0/2](94800/4588595) loss:3.137 lr:0.0000100 epoch_Time:28284.0min: [2024-01-03 03:36:28,216][model8_pretrain.py][INFO] Epoch:[0/2](94800/4588595) loss:2.798 lr:0.0000100 epoch_Time:28284.0min: [2024-01-03 03:36:28,216][model8_pretrain.py][INFO] Epoch:[0/2](94800/4588595) loss:2.726 lr:0.0000100 epoch_Time:28284.0min: [2024-01-03 03:36:28,216][model8_pretrain.py][INFO] Epoch:[0/2](94800/4588595) loss:2.600 lr:0.0000100 epoch_Time:28284.0min: [2024-01-03 03:36:28,216][model8_pretrain.py][INFO] Epoch:[0/2](94800/4588595) loss:3.172 lr:0.0000100 epoch_Time:28284.0min: [2024-01-03 03:36:28,216][model8_pretrain.py][INFO] Epoch:[0/2](94800/4588595) loss:3.385 lr:0.0000100 epoch_Time:28284.0min: [2024-01-03 03:36:28,216][model8_pretrain.py][INFO] Epoch:[0/2](94800/4588595) loss:2.203 lr:0.0000100 epoch_Time:28284.0min: [2024-01-03 03:36:28,216][model8_pretrain.py][INFO] Epoch:[0/2](94800/4588595) loss:2.803 lr:0.0000100 epoch_Time:28284.0min: [2024-01-03 03:37:05,151][model8_pretrain.py][INFO] Epoch:[0/2](94900/4588595) loss:3.321 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:05,151][model8_pretrain.py][INFO] Epoch:[0/2](94900/4588595) loss:3.030 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:05,151][model8_pretrain.py][INFO] Epoch:[0/2](94900/4588595) loss:3.097 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:05,151][model8_pretrain.py][INFO] Epoch:[0/2](94900/4588595) loss:2.789 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:05,151][model8_pretrain.py][INFO] Epoch:[0/2](94900/4588595) loss:3.177 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:05,151][model8_pretrain.py][INFO] Epoch:[0/2](94900/4588595) loss:2.785 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:05,151][model8_pretrain.py][INFO] Epoch:[0/2](94900/4588595) loss:2.631 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:05,151][model8_pretrain.py][INFO] Epoch:[0/2](94900/4588595) loss:2.924 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:42,111][model8_pretrain.py][INFO] Epoch:[0/2](95000/4588595) loss:3.219 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:42,111][model8_pretrain.py][INFO] Epoch:[0/2](95000/4588595) loss:3.042 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:42,111][model8_pretrain.py][INFO] Epoch:[0/2](95000/4588595) loss:2.796 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:42,111][model8_pretrain.py][INFO] Epoch:[0/2](95000/4588595) loss:3.227 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:42,111][model8_pretrain.py][INFO] Epoch:[0/2](95000/4588595) loss:3.234 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:42,111][model8_pretrain.py][INFO] Epoch:[0/2](95000/4588595) loss:3.305 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:42,111][model8_pretrain.py][INFO] Epoch:[0/2](95000/4588595) loss:3.062 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:37:42,111][model8_pretrain.py][INFO] Epoch:[0/2](95000/4588595) loss:3.044 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:38:19,091][model8_pretrain.py][INFO] Epoch:[0/2](95100/4588595) loss:2.499 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:38:19,091][model8_pretrain.py][INFO] Epoch:[0/2](95100/4588595) loss:3.036 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:38:19,091][model8_pretrain.py][INFO] Epoch:[0/2](95100/4588595) loss:2.895 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:38:19,091][model8_pretrain.py][INFO] Epoch:[0/2](95100/4588595) loss:3.423 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:38:19,091][model8_pretrain.py][INFO] Epoch:[0/2](95100/4588595) loss:2.752 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:38:19,091][model8_pretrain.py][INFO] Epoch:[0/2](95100/4588595) loss:2.702 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:38:19,091][model8_pretrain.py][INFO] Epoch:[0/2](95100/4588595) loss:2.995 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:38:19,091][model8_pretrain.py][INFO] Epoch:[0/2](95100/4588595) loss:2.670 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:38:56,034][model8_pretrain.py][INFO] Epoch:[0/2](95200/4588595) loss:3.024 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:38:56,034][model8_pretrain.py][INFO] Epoch:[0/2](95200/4588595) loss:2.703 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:38:56,034][model8_pretrain.py][INFO] Epoch:[0/2](95200/4588595) loss:2.465 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:38:56,034][model8_pretrain.py][INFO] Epoch:[0/2](95200/4588595) loss:2.760 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:38:56,034][model8_pretrain.py][INFO] Epoch:[0/2](95200/4588595) loss:3.168 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:38:56,034][model8_pretrain.py][INFO] Epoch:[0/2](95200/4588595) loss:2.294 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:38:56,034][model8_pretrain.py][INFO] Epoch:[0/2](95200/4588595) loss:2.873 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:38:56,034][model8_pretrain.py][INFO] Epoch:[0/2](95200/4588595) loss:3.159 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:39:41,752][model8_pretrain.py][INFO] Epoch:[0/2](95300/4588595) loss:2.262 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:39:41,752][model8_pretrain.py][INFO] Epoch:[0/2](95300/4588595) loss:3.004 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:39:41,752][model8_pretrain.py][INFO] Epoch:[0/2](95300/4588595) loss:2.823 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:39:41,752][model8_pretrain.py][INFO] Epoch:[0/2](95300/4588595) loss:3.023 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:39:41,752][model8_pretrain.py][INFO] Epoch:[0/2](95300/4588595) loss:3.009 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:39:41,752][model8_pretrain.py][INFO] Epoch:[0/2](95300/4588595) loss:3.353 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:39:41,752][model8_pretrain.py][INFO] Epoch:[0/2](95300/4588595) loss:3.095 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:39:41,752][model8_pretrain.py][INFO] Epoch:[0/2](95300/4588595) loss:3.469 lr:0.0000100 epoch_Time:28285.0min: [2024-01-03 03:40:18,724][model8_pretrain.py][INFO] Epoch:[0/2](95400/4588595) loss:3.421 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:40:18,724][model8_pretrain.py][INFO] Epoch:[0/2](95400/4588595) loss:3.536 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:40:18,724][model8_pretrain.py][INFO] Epoch:[0/2](95400/4588595) loss:2.635 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:40:18,725][model8_pretrain.py][INFO] Epoch:[0/2](95400/4588595) loss:3.171 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:40:18,725][model8_pretrain.py][INFO] Epoch:[0/2](95400/4588595) loss:2.982 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:40:18,725][model8_pretrain.py][INFO] Epoch:[0/2](95400/4588595) loss:2.924 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:40:18,725][model8_pretrain.py][INFO] Epoch:[0/2](95400/4588595) loss:3.009 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:40:18,725][model8_pretrain.py][INFO] Epoch:[0/2](95400/4588595) loss:2.960 lr:0.0000100 epoch_Time:28283.0min: [2024-01-03 03:40:55,684][model8_pretrain.py][INFO] Epoch:[0/2](95500/4588595) loss:3.334 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:40:55,684][model8_pretrain.py][INFO] Epoch:[0/2](95500/4588595) loss:3.419 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:40:55,684][model8_pretrain.py][INFO] Epoch:[0/2](95500/4588595) loss:2.950 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:40:55,684][model8_pretrain.py][INFO] Epoch:[0/2](95500/4588595) loss:2.944 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:40:55,684][model8_pretrain.py][INFO] Epoch:[0/2](95500/4588595) loss:3.312 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:40:55,684][model8_pretrain.py][INFO] Epoch:[0/2](95500/4588595) loss:3.395 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:40:55,684][model8_pretrain.py][INFO] Epoch:[0/2](95500/4588595) loss:2.929 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:40:55,685][model8_pretrain.py][INFO] Epoch:[0/2](95500/4588595) loss:3.429 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:41:32,639][model8_pretrain.py][INFO] Epoch:[0/2](95600/4588595) loss:3.314 lr:0.0000100 epoch_Time:28281.0min: [2024-01-03 03:41:32,640][model8_pretrain.py][INFO] Epoch:[0/2](95600/4588595) loss:3.292 lr:0.0000100 epoch_Time:28281.0min: [2024-01-03 03:41:32,640][model8_pretrain.py][INFO] Epoch:[0/2](95600/4588595) loss:2.515 lr:0.0000100 epoch_Time:28281.0min: [2024-01-03 03:41:32,640][model8_pretrain.py][INFO] Epoch:[0/2](95600/4588595) loss:2.885 lr:0.0000100 epoch_Time:28281.0min: [2024-01-03 03:41:32,640][model8_pretrain.py][INFO] Epoch:[0/2](95600/4588595) loss:2.850 lr:0.0000100 epoch_Time:28281.0min: [2024-01-03 03:41:32,640][model8_pretrain.py][INFO] Epoch:[0/2](95600/4588595) loss:3.118 lr:0.0000100 epoch_Time:28281.0min: [2024-01-03 03:41:32,640][model8_pretrain.py][INFO] Epoch:[0/2](95600/4588595) loss:3.267 lr:0.0000100 epoch_Time:28281.0min: [2024-01-03 03:41:32,640][model8_pretrain.py][INFO] Epoch:[0/2](95600/4588595) loss:2.885 lr:0.0000100 epoch_Time:28281.0min: [2024-01-03 03:42:09,609][model8_pretrain.py][INFO] Epoch:[0/2](95700/4588595) loss:3.147 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:09,609][model8_pretrain.py][INFO] Epoch:[0/2](95700/4588595) loss:3.273 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:09,609][model8_pretrain.py][INFO] Epoch:[0/2](95700/4588595) loss:2.910 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:09,609][model8_pretrain.py][INFO] Epoch:[0/2](95700/4588595) loss:3.109 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:09,609][model8_pretrain.py][INFO] Epoch:[0/2](95700/4588595) loss:3.112 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:09,609][model8_pretrain.py][INFO] Epoch:[0/2](95700/4588595) loss:3.030 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:09,609][model8_pretrain.py][INFO] Epoch:[0/2](95700/4588595) loss:2.832 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:09,610][model8_pretrain.py][INFO] Epoch:[0/2](95700/4588595) loss:3.296 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:46,566][model8_pretrain.py][INFO] Epoch:[0/2](95800/4588595) loss:2.595 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:46,566][model8_pretrain.py][INFO] Epoch:[0/2](95800/4588595) loss:2.826 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:46,566][model8_pretrain.py][INFO] Epoch:[0/2](95800/4588595) loss:3.030 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:46,567][model8_pretrain.py][INFO] Epoch:[0/2](95800/4588595) loss:3.200 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:46,567][model8_pretrain.py][INFO] Epoch:[0/2](95800/4588595) loss:2.804 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:46,567][model8_pretrain.py][INFO] Epoch:[0/2](95800/4588595) loss:3.008 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:46,567][model8_pretrain.py][INFO] Epoch:[0/2](95800/4588595) loss:3.402 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:42:46,567][model8_pretrain.py][INFO] Epoch:[0/2](95800/4588595) loss:3.362 lr:0.0000100 epoch_Time:28279.0min: [2024-01-03 03:43:23,500][model8_pretrain.py][INFO] Epoch:[0/2](95900/4588595) loss:3.039 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:43:23,500][model8_pretrain.py][INFO] Epoch:[0/2](95900/4588595) loss:3.038 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:43:23,500][model8_pretrain.py][INFO] Epoch:[0/2](95900/4588595) loss:2.957 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:43:23,500][model8_pretrain.py][INFO] Epoch:[0/2](95900/4588595) loss:2.888 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:43:23,500][model8_pretrain.py][INFO] Epoch:[0/2](95900/4588595) loss:2.948 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:43:23,500][model8_pretrain.py][INFO] Epoch:[0/2](95900/4588595) loss:2.649 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:43:23,500][model8_pretrain.py][INFO] Epoch:[0/2](95900/4588595) loss:2.841 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:43:23,500][model8_pretrain.py][INFO] Epoch:[0/2](95900/4588595) loss:3.099 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:44:00,428][model8_pretrain.py][INFO] Epoch:[0/2](96000/4588595) loss:3.456 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:44:00,429][model8_pretrain.py][INFO] Epoch:[0/2](96000/4588595) loss:3.443 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:44:00,429][model8_pretrain.py][INFO] Epoch:[0/2](96000/4588595) loss:2.454 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:44:00,429][model8_pretrain.py][INFO] Epoch:[0/2](96000/4588595) loss:2.831 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:44:00,429][model8_pretrain.py][INFO] Epoch:[0/2](96000/4588595) loss:2.966 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:44:00,429][model8_pretrain.py][INFO] Epoch:[0/2](96000/4588595) loss:3.058 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:44:00,429][model8_pretrain.py][INFO] Epoch:[0/2](96000/4588595) loss:2.850 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:44:00,429][model8_pretrain.py][INFO] Epoch:[0/2](96000/4588595) loss:3.111 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:44:46,152][model8_pretrain.py][INFO] Epoch:[0/2](96100/4588595) loss:3.208 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:44:46,152][model8_pretrain.py][INFO] Epoch:[0/2](96100/4588595) loss:2.896 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:44:46,152][model8_pretrain.py][INFO] Epoch:[0/2](96100/4588595) loss:3.238 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:44:46,152][model8_pretrain.py][INFO] Epoch:[0/2](96100/4588595) loss:3.197 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:44:46,152][model8_pretrain.py][INFO] Epoch:[0/2](96100/4588595) loss:2.637 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:44:46,152][model8_pretrain.py][INFO] Epoch:[0/2](96100/4588595) loss:3.139 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:44:46,152][model8_pretrain.py][INFO] Epoch:[0/2](96100/4588595) loss:3.352 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:44:46,152][model8_pretrain.py][INFO] Epoch:[0/2](96100/4588595) loss:3.238 lr:0.0000100 epoch_Time:28282.0min: [2024-01-03 03:45:23,081][model8_pretrain.py][INFO] Epoch:[0/2](96200/4588595) loss:2.632 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:45:23,081][model8_pretrain.py][INFO] Epoch:[0/2](96200/4588595) loss:3.084 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:45:23,081][model8_pretrain.py][INFO] Epoch:[0/2](96200/4588595) loss:3.011 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:45:23,081][model8_pretrain.py][INFO] Epoch:[0/2](96200/4588595) loss:3.397 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:45:23,081][model8_pretrain.py][INFO] Epoch:[0/2](96200/4588595) loss:2.756 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:45:23,081][model8_pretrain.py][INFO] Epoch:[0/2](96200/4588595) loss:3.370 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:45:23,082][model8_pretrain.py][INFO] Epoch:[0/2](96200/4588595) loss:3.340 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:45:23,082][model8_pretrain.py][INFO] Epoch:[0/2](96200/4588595) loss:2.856 lr:0.0000100 epoch_Time:28280.0min: [2024-01-03 03:46:00,016][model8_pretrain.py][INFO] Epoch:[0/2](96300/4588595) loss:2.903 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:00,016][model8_pretrain.py][INFO] Epoch:[0/2](96300/4588595) loss:2.203 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:00,016][model8_pretrain.py][INFO] Epoch:[0/2](96300/4588595) loss:2.748 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:00,016][model8_pretrain.py][INFO] Epoch:[0/2](96300/4588595) loss:3.077 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:00,016][model8_pretrain.py][INFO] Epoch:[0/2](96300/4588595) loss:3.259 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:00,016][model8_pretrain.py][INFO] Epoch:[0/2](96300/4588595) loss:2.726 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:00,016][model8_pretrain.py][INFO] Epoch:[0/2](96300/4588595) loss:2.751 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:00,017][model8_pretrain.py][INFO] Epoch:[0/2](96300/4588595) loss:2.887 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:36,964][model8_pretrain.py][INFO] Epoch:[0/2](96400/4588595) loss:2.957 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:36,964][model8_pretrain.py][INFO] Epoch:[0/2](96400/4588595) loss:2.705 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:36,964][model8_pretrain.py][INFO] Epoch:[0/2](96400/4588595) loss:2.531 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:36,964][model8_pretrain.py][INFO] Epoch:[0/2](96400/4588595) loss:2.853 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:36,964][model8_pretrain.py][INFO] Epoch:[0/2](96400/4588595) loss:3.217 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:36,964][model8_pretrain.py][INFO] Epoch:[0/2](96400/4588595) loss:2.614 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:36,964][model8_pretrain.py][INFO] Epoch:[0/2](96400/4588595) loss:2.378 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:46:36,964][model8_pretrain.py][INFO] Epoch:[0/2](96400/4588595) loss:3.548 lr:0.0000100 epoch_Time:28278.0min: [2024-01-03 03:47:13,902][model8_pretrain.py][INFO] Epoch:[0/2](96500/4588595) loss:2.811 lr:0.0000100 epoch_Time:28276.0min: [2024-01-03 03:47:13,902][model8_pretrain.py][INFO] Epoch:[0/2](96500/4588595) loss:2.980 lr:0.0000100 epoch_Time:28276.0min: [2024-01-03 03:47:13,902][model8_pretrain.py][INFO] Epoch:[0/2](96500/4588595) loss:2.722 lr:0.0000100 epoch_Time:28276.0min: [2024-01-03 03:47:13,902][model8_pretrain.py][INFO] Epoch:[0/2](96500/4588595) loss:3.178 lr:0.0000100 epoch_Time:28276.0min: [2024-01-03 03:47:13,902][model8_pretrain.py][INFO] Epoch:[0/2](96500/4588595) loss:3.436 lr:0.0000100 epoch_Time:28276.0min: [2024-01-03 03:47:13,902][model8_pretrain.py][INFO] Epoch:[0/2](96500/4588595) loss:2.547 lr:0.0000100 epoch_Time:28276.0min: [2024-01-03 03:47:13,902][model8_pretrain.py][INFO] Epoch:[0/2](96500/4588595) loss:2.858 lr:0.0000100 epoch_Time:28276.0min: [2024-01-03 03:47:13,902][model8_pretrain.py][INFO] Epoch:[0/2](96500/4588595) loss:2.880 lr:0.0000100 epoch_Time:28276.0min: [2024-01-03 03:47:50,840][model8_pretrain.py][INFO] Epoch:[0/2](96600/4588595) loss:3.153 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:47:50,840][model8_pretrain.py][INFO] Epoch:[0/2](96600/4588595) loss:3.218 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:47:50,840][model8_pretrain.py][INFO] Epoch:[0/2](96600/4588595) loss:2.689 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:47:50,840][model8_pretrain.py][INFO] Epoch:[0/2](96600/4588595) loss:3.445 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:47:50,840][model8_pretrain.py][INFO] Epoch:[0/2](96600/4588595) loss:2.416 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:47:50,840][model8_pretrain.py][INFO] Epoch:[0/2](96600/4588595) loss:2.708 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:47:50,840][model8_pretrain.py][INFO] Epoch:[0/2](96600/4588595) loss:2.180 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:47:50,840][model8_pretrain.py][INFO] Epoch:[0/2](96600/4588595) loss:3.242 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:48:27,789][model8_pretrain.py][INFO] Epoch:[0/2](96700/4588595) loss:2.955 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:48:27,789][model8_pretrain.py][INFO] Epoch:[0/2](96700/4588595) loss:3.011 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:48:27,789][model8_pretrain.py][INFO] Epoch:[0/2](96700/4588595) loss:2.864 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:48:27,789][model8_pretrain.py][INFO] Epoch:[0/2](96700/4588595) loss:2.248 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:48:27,789][model8_pretrain.py][INFO] Epoch:[0/2](96700/4588595) loss:2.367 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:48:27,789][model8_pretrain.py][INFO] Epoch:[0/2](96700/4588595) loss:2.060 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:48:27,789][model8_pretrain.py][INFO] Epoch:[0/2](96700/4588595) loss:2.668 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:48:27,789][model8_pretrain.py][INFO] Epoch:[0/2](96700/4588595) loss:3.552 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:49:04,732][model8_pretrain.py][INFO] Epoch:[0/2](96800/4588595) loss:2.946 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:49:04,732][model8_pretrain.py][INFO] Epoch:[0/2](96800/4588595) loss:2.792 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:49:04,732][model8_pretrain.py][INFO] Epoch:[0/2](96800/4588595) loss:2.959 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:49:04,732][model8_pretrain.py][INFO] Epoch:[0/2](96800/4588595) loss:2.823 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:49:04,732][model8_pretrain.py][INFO] Epoch:[0/2](96800/4588595) loss:2.867 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:49:04,732][model8_pretrain.py][INFO] Epoch:[0/2](96800/4588595) loss:2.647 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:49:04,732][model8_pretrain.py][INFO] Epoch:[0/2](96800/4588595) loss:3.232 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:49:04,732][model8_pretrain.py][INFO] Epoch:[0/2](96800/4588595) loss:3.137 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:49:50,335][model8_pretrain.py][INFO] Epoch:[0/2](96900/4588595) loss:3.251 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:49:50,335][model8_pretrain.py][INFO] Epoch:[0/2](96900/4588595) loss:3.311 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:49:50,335][model8_pretrain.py][INFO] Epoch:[0/2](96900/4588595) loss:3.185 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:49:50,335][model8_pretrain.py][INFO] Epoch:[0/2](96900/4588595) loss:3.437 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:49:50,335][model8_pretrain.py][INFO] Epoch:[0/2](96900/4588595) loss:3.466 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:49:50,336][model8_pretrain.py][INFO] Epoch:[0/2](96900/4588595) loss:3.160 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:49:50,336][model8_pretrain.py][INFO] Epoch:[0/2](96900/4588595) loss:2.596 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:49:50,336][model8_pretrain.py][INFO] Epoch:[0/2](96900/4588595) loss:3.349 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:50:27,270][model8_pretrain.py][INFO] Epoch:[0/2](97000/4588595) loss:2.868 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:50:27,270][model8_pretrain.py][INFO] Epoch:[0/2](97000/4588595) loss:2.993 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:50:27,270][model8_pretrain.py][INFO] Epoch:[0/2](97000/4588595) loss:3.320 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:50:27,270][model8_pretrain.py][INFO] Epoch:[0/2](97000/4588595) loss:3.090 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:50:27,270][model8_pretrain.py][INFO] Epoch:[0/2](97000/4588595) loss:3.196 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:50:27,270][model8_pretrain.py][INFO] Epoch:[0/2](97000/4588595) loss:3.178 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:50:27,270][model8_pretrain.py][INFO] Epoch:[0/2](97000/4588595) loss:3.211 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:50:27,270][model8_pretrain.py][INFO] Epoch:[0/2](97000/4588595) loss:3.050 lr:0.0000100 epoch_Time:28277.0min: [2024-01-03 03:51:04,211][model8_pretrain.py][INFO] Epoch:[0/2](97100/4588595) loss:2.763 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:51:04,211][model8_pretrain.py][INFO] Epoch:[0/2](97100/4588595) loss:3.123 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:51:04,211][model8_pretrain.py][INFO] Epoch:[0/2](97100/4588595) loss:2.982 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:51:04,211][model8_pretrain.py][INFO] Epoch:[0/2](97100/4588595) loss:3.206 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:51:04,211][model8_pretrain.py][INFO] Epoch:[0/2](97100/4588595) loss:2.989 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:51:04,211][model8_pretrain.py][INFO] Epoch:[0/2](97100/4588595) loss:3.155 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:51:04,211][model8_pretrain.py][INFO] Epoch:[0/2](97100/4588595) loss:2.892 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:51:04,212][model8_pretrain.py][INFO] Epoch:[0/2](97100/4588595) loss:2.230 lr:0.0000100 epoch_Time:28275.0min: [2024-01-03 03:51:41,156][model8_pretrain.py][INFO] Epoch:[0/2](97200/4588595) loss:3.214 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:51:41,156][model8_pretrain.py][INFO] Epoch:[0/2](97200/4588595) loss:2.803 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:51:41,156][model8_pretrain.py][INFO] Epoch:[0/2](97200/4588595) loss:2.706 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:51:41,156][model8_pretrain.py][INFO] Epoch:[0/2](97200/4588595) loss:2.880 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:51:41,156][model8_pretrain.py][INFO] Epoch:[0/2](97200/4588595) loss:2.764 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:51:41,156][model8_pretrain.py][INFO] Epoch:[0/2](97200/4588595) loss:3.000 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:51:41,156][model8_pretrain.py][INFO] Epoch:[0/2](97200/4588595) loss:2.902 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:51:41,156][model8_pretrain.py][INFO] Epoch:[0/2](97200/4588595) loss:2.518 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:52:18,102][model8_pretrain.py][INFO] Epoch:[0/2](97300/4588595) loss:3.161 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:52:18,102][model8_pretrain.py][INFO] Epoch:[0/2](97300/4588595) loss:2.372 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:52:18,102][model8_pretrain.py][INFO] Epoch:[0/2](97300/4588595) loss:2.745 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:52:18,102][model8_pretrain.py][INFO] Epoch:[0/2](97300/4588595) loss:2.712 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:52:18,102][model8_pretrain.py][INFO] Epoch:[0/2](97300/4588595) loss:2.742 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:52:18,102][model8_pretrain.py][INFO] Epoch:[0/2](97300/4588595) loss:3.554 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:52:18,102][model8_pretrain.py][INFO] Epoch:[0/2](97300/4588595) loss:2.865 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:52:18,102][model8_pretrain.py][INFO] Epoch:[0/2](97300/4588595) loss:3.222 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:52:55,031][model8_pretrain.py][INFO] Epoch:[0/2](97400/4588595) loss:3.371 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:52:55,031][model8_pretrain.py][INFO] Epoch:[0/2](97400/4588595) loss:2.753 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:52:55,031][model8_pretrain.py][INFO] Epoch:[0/2](97400/4588595) loss:3.197 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:52:55,031][model8_pretrain.py][INFO] Epoch:[0/2](97400/4588595) loss:3.154 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:52:55,031][model8_pretrain.py][INFO] Epoch:[0/2](97400/4588595) loss:3.397 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:52:55,031][model8_pretrain.py][INFO] Epoch:[0/2](97400/4588595) loss:2.906 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:52:55,031][model8_pretrain.py][INFO] Epoch:[0/2](97400/4588595) loss:2.833 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:52:55,031][model8_pretrain.py][INFO] Epoch:[0/2](97400/4588595) loss:3.334 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:53:31,942][model8_pretrain.py][INFO] Epoch:[0/2](97500/4588595) loss:2.892 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 03:53:31,942][model8_pretrain.py][INFO] Epoch:[0/2](97500/4588595) loss:3.140 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 03:53:31,942][model8_pretrain.py][INFO] Epoch:[0/2](97500/4588595) loss:3.340 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 03:53:31,942][model8_pretrain.py][INFO] Epoch:[0/2](97500/4588595) loss:2.940 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 03:53:31,942][model8_pretrain.py][INFO] Epoch:[0/2](97500/4588595) loss:2.880 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 03:53:31,942][model8_pretrain.py][INFO] Epoch:[0/2](97500/4588595) loss:2.981 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 03:53:31,942][model8_pretrain.py][INFO] Epoch:[0/2](97500/4588595) loss:3.376 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 03:53:31,942][model8_pretrain.py][INFO] Epoch:[0/2](97500/4588595) loss:2.302 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 03:54:08,929][model8_pretrain.py][INFO] Epoch:[0/2](97600/4588595) loss:2.498 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:54:08,929][model8_pretrain.py][INFO] Epoch:[0/2](97600/4588595) loss:3.001 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:54:08,929][model8_pretrain.py][INFO] Epoch:[0/2](97600/4588595) loss:3.235 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:54:08,929][model8_pretrain.py][INFO] Epoch:[0/2](97600/4588595) loss:2.959 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:54:08,929][model8_pretrain.py][INFO] Epoch:[0/2](97600/4588595) loss:3.037 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:54:08,929][model8_pretrain.py][INFO] Epoch:[0/2](97600/4588595) loss:3.230 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:54:08,929][model8_pretrain.py][INFO] Epoch:[0/2](97600/4588595) loss:2.923 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:54:08,929][model8_pretrain.py][INFO] Epoch:[0/2](97600/4588595) loss:3.266 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:54:54,559][model8_pretrain.py][INFO] Epoch:[0/2](97700/4588595) loss:3.256 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:54:54,559][model8_pretrain.py][INFO] Epoch:[0/2](97700/4588595) loss:3.032 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:54:54,559][model8_pretrain.py][INFO] Epoch:[0/2](97700/4588595) loss:2.845 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:54:54,559][model8_pretrain.py][INFO] Epoch:[0/2](97700/4588595) loss:2.976 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:54:54,559][model8_pretrain.py][INFO] Epoch:[0/2](97700/4588595) loss:3.200 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:54:54,559][model8_pretrain.py][INFO] Epoch:[0/2](97700/4588595) loss:2.787 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:54:54,559][model8_pretrain.py][INFO] Epoch:[0/2](97700/4588595) loss:2.835 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:54:54,559][model8_pretrain.py][INFO] Epoch:[0/2](97700/4588595) loss:2.958 lr:0.0000100 epoch_Time:28274.0min: [2024-01-03 03:55:31,514][model8_pretrain.py][INFO] Epoch:[0/2](97800/4588595) loss:2.688 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:55:31,514][model8_pretrain.py][INFO] Epoch:[0/2](97800/4588595) loss:2.542 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:55:31,514][model8_pretrain.py][INFO] Epoch:[0/2](97800/4588595) loss:2.764 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:55:31,514][model8_pretrain.py][INFO] Epoch:[0/2](97800/4588595) loss:2.970 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:55:31,514][model8_pretrain.py][INFO] Epoch:[0/2](97800/4588595) loss:2.627 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:55:31,514][model8_pretrain.py][INFO] Epoch:[0/2](97800/4588595) loss:2.405 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:55:31,514][model8_pretrain.py][INFO] Epoch:[0/2](97800/4588595) loss:2.660 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:55:31,514][model8_pretrain.py][INFO] Epoch:[0/2](97800/4588595) loss:3.420 lr:0.0000100 epoch_Time:28273.0min: [2024-01-03 03:56:08,446][model8_pretrain.py][INFO] Epoch:[0/2](97900/4588595) loss:2.690 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:56:08,446][model8_pretrain.py][INFO] Epoch:[0/2](97900/4588595) loss:3.327 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:56:08,446][model8_pretrain.py][INFO] Epoch:[0/2](97900/4588595) loss:3.140 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:56:08,446][model8_pretrain.py][INFO] Epoch:[0/2](97900/4588595) loss:3.027 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:56:08,446][model8_pretrain.py][INFO] Epoch:[0/2](97900/4588595) loss:3.386 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:56:08,446][model8_pretrain.py][INFO] Epoch:[0/2](97900/4588595) loss:3.255 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:56:08,446][model8_pretrain.py][INFO] Epoch:[0/2](97900/4588595) loss:2.841 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:56:08,446][model8_pretrain.py][INFO] Epoch:[0/2](97900/4588595) loss:3.431 lr:0.0000100 epoch_Time:28272.0min: [2024-01-03 03:56:45,388][model8_pretrain.py][INFO] Epoch:[0/2](98000/4588595) loss:3.311 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:56:45,388][model8_pretrain.py][INFO] Epoch:[0/2](98000/4588595) loss:2.305 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:56:45,388][model8_pretrain.py][INFO] Epoch:[0/2](98000/4588595) loss:3.147 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:56:45,388][model8_pretrain.py][INFO] Epoch:[0/2](98000/4588595) loss:3.087 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:56:45,388][model8_pretrain.py][INFO] Epoch:[0/2](98000/4588595) loss:3.051 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:56:45,388][model8_pretrain.py][INFO] Epoch:[0/2](98000/4588595) loss:3.132 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:56:45,388][model8_pretrain.py][INFO] Epoch:[0/2](98000/4588595) loss:3.076 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:56:45,388][model8_pretrain.py][INFO] Epoch:[0/2](98000/4588595) loss:2.834 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:57:22,333][model8_pretrain.py][INFO] Epoch:[0/2](98100/4588595) loss:3.264 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:57:22,333][model8_pretrain.py][INFO] Epoch:[0/2](98100/4588595) loss:3.034 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:57:22,333][model8_pretrain.py][INFO] Epoch:[0/2](98100/4588595) loss:2.647 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:57:22,333][model8_pretrain.py][INFO] Epoch:[0/2](98100/4588595) loss:3.269 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:57:22,333][model8_pretrain.py][INFO] Epoch:[0/2](98100/4588595) loss:2.924 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:57:22,333][model8_pretrain.py][INFO] Epoch:[0/2](98100/4588595) loss:2.570 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:57:22,333][model8_pretrain.py][INFO] Epoch:[0/2](98100/4588595) loss:2.936 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:57:22,333][model8_pretrain.py][INFO] Epoch:[0/2](98100/4588595) loss:2.617 lr:0.0000100 epoch_Time:28269.0min: [2024-01-03 03:57:59,280][model8_pretrain.py][INFO] Epoch:[0/2](98200/4588595) loss:2.745 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 03:57:59,280][model8_pretrain.py][INFO] Epoch:[0/2](98200/4588595) loss:2.844 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 03:57:59,280][model8_pretrain.py][INFO] Epoch:[0/2](98200/4588595) loss:2.607 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 03:57:59,280][model8_pretrain.py][INFO] Epoch:[0/2](98200/4588595) loss:3.218 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 03:57:59,280][model8_pretrain.py][INFO] Epoch:[0/2](98200/4588595) loss:2.891 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 03:57:59,280][model8_pretrain.py][INFO] Epoch:[0/2](98200/4588595) loss:3.425 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 03:57:59,280][model8_pretrain.py][INFO] Epoch:[0/2](98200/4588595) loss:2.616 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 03:57:59,280][model8_pretrain.py][INFO] Epoch:[0/2](98200/4588595) loss:3.220 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 03:58:36,252][model8_pretrain.py][INFO] Epoch:[0/2](98300/4588595) loss:2.794 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 03:58:36,252][model8_pretrain.py][INFO] Epoch:[0/2](98300/4588595) loss:2.861 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 03:58:36,252][model8_pretrain.py][INFO] Epoch:[0/2](98300/4588595) loss:2.886 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 03:58:36,252][model8_pretrain.py][INFO] Epoch:[0/2](98300/4588595) loss:3.202 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 03:58:36,252][model8_pretrain.py][INFO] Epoch:[0/2](98300/4588595) loss:2.975 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 03:58:36,252][model8_pretrain.py][INFO] Epoch:[0/2](98300/4588595) loss:3.056 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 03:58:36,252][model8_pretrain.py][INFO] Epoch:[0/2](98300/4588595) loss:3.287 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 03:58:36,252][model8_pretrain.py][INFO] Epoch:[0/2](98300/4588595) loss:3.033 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 03:59:13,205][model8_pretrain.py][INFO] Epoch:[0/2](98400/4588595) loss:2.652 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 03:59:13,205][model8_pretrain.py][INFO] Epoch:[0/2](98400/4588595) loss:3.091 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 03:59:13,205][model8_pretrain.py][INFO] Epoch:[0/2](98400/4588595) loss:2.683 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 03:59:13,205][model8_pretrain.py][INFO] Epoch:[0/2](98400/4588595) loss:3.260 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 03:59:13,205][model8_pretrain.py][INFO] Epoch:[0/2](98400/4588595) loss:2.967 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 03:59:13,205][model8_pretrain.py][INFO] Epoch:[0/2](98400/4588595) loss:2.887 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 03:59:13,205][model8_pretrain.py][INFO] Epoch:[0/2](98400/4588595) loss:2.865 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 03:59:13,205][model8_pretrain.py][INFO] Epoch:[0/2](98400/4588595) loss:3.285 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 03:59:59,009][model8_pretrain.py][INFO] Epoch:[0/2](98500/4588595) loss:3.471 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:59:59,009][model8_pretrain.py][INFO] Epoch:[0/2](98500/4588595) loss:2.634 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:59:59,009][model8_pretrain.py][INFO] Epoch:[0/2](98500/4588595) loss:3.205 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:59:59,009][model8_pretrain.py][INFO] Epoch:[0/2](98500/4588595) loss:3.082 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:59:59,010][model8_pretrain.py][INFO] Epoch:[0/2](98500/4588595) loss:2.417 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:59:59,010][model8_pretrain.py][INFO] Epoch:[0/2](98500/4588595) loss:3.141 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:59:59,010][model8_pretrain.py][INFO] Epoch:[0/2](98500/4588595) loss:2.747 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 03:59:59,010][model8_pretrain.py][INFO] Epoch:[0/2](98500/4588595) loss:2.653 lr:0.0000100 epoch_Time:28271.0min: [2024-01-03 04:00:35,946][model8_pretrain.py][INFO] Epoch:[0/2](98600/4588595) loss:2.330 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 04:00:35,946][model8_pretrain.py][INFO] Epoch:[0/2](98600/4588595) loss:2.880 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 04:00:35,946][model8_pretrain.py][INFO] Epoch:[0/2](98600/4588595) loss:3.112 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 04:00:35,946][model8_pretrain.py][INFO] Epoch:[0/2](98600/4588595) loss:3.250 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 04:00:35,946][model8_pretrain.py][INFO] Epoch:[0/2](98600/4588595) loss:3.190 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 04:00:35,946][model8_pretrain.py][INFO] Epoch:[0/2](98600/4588595) loss:3.081 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 04:00:35,946][model8_pretrain.py][INFO] Epoch:[0/2](98600/4588595) loss:2.371 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 04:00:35,946][model8_pretrain.py][INFO] Epoch:[0/2](98600/4588595) loss:2.434 lr:0.0000100 epoch_Time:28270.0min: [2024-01-03 04:01:12,888][model8_pretrain.py][INFO] Epoch:[0/2](98700/4588595) loss:3.103 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 04:01:12,888][model8_pretrain.py][INFO] Epoch:[0/2](98700/4588595) loss:3.259 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 04:01:12,888][model8_pretrain.py][INFO] Epoch:[0/2](98700/4588595) loss:3.317 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 04:01:12,888][model8_pretrain.py][INFO] Epoch:[0/2](98700/4588595) loss:2.829 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 04:01:12,888][model8_pretrain.py][INFO] Epoch:[0/2](98700/4588595) loss:2.998 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 04:01:12,888][model8_pretrain.py][INFO] Epoch:[0/2](98700/4588595) loss:3.209 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 04:01:12,888][model8_pretrain.py][INFO] Epoch:[0/2](98700/4588595) loss:2.910 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 04:01:12,888][model8_pretrain.py][INFO] Epoch:[0/2](98700/4588595) loss:2.758 lr:0.0000100 epoch_Time:28268.0min: [2024-01-03 04:01:49,836][model8_pretrain.py][INFO] Epoch:[0/2](98800/4588595) loss:3.056 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:01:49,836][model8_pretrain.py][INFO] Epoch:[0/2](98800/4588595) loss:3.079 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:01:49,836][model8_pretrain.py][INFO] Epoch:[0/2](98800/4588595) loss:2.681 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:01:49,836][model8_pretrain.py][INFO] Epoch:[0/2](98800/4588595) loss:3.084 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:01:49,836][model8_pretrain.py][INFO] Epoch:[0/2](98800/4588595) loss:2.878 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:01:49,836][model8_pretrain.py][INFO] Epoch:[0/2](98800/4588595) loss:3.206 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:01:49,836][model8_pretrain.py][INFO] Epoch:[0/2](98800/4588595) loss:3.071 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:01:49,836][model8_pretrain.py][INFO] Epoch:[0/2](98800/4588595) loss:2.601 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:02:26,779][model8_pretrain.py][INFO] Epoch:[0/2](98900/4588595) loss:2.955 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:02:26,779][model8_pretrain.py][INFO] Epoch:[0/2](98900/4588595) loss:2.898 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:02:26,779][model8_pretrain.py][INFO] Epoch:[0/2](98900/4588595) loss:2.761 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:02:26,779][model8_pretrain.py][INFO] Epoch:[0/2](98900/4588595) loss:2.481 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:02:26,779][model8_pretrain.py][INFO] Epoch:[0/2](98900/4588595) loss:3.013 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:02:26,779][model8_pretrain.py][INFO] Epoch:[0/2](98900/4588595) loss:2.830 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:02:26,779][model8_pretrain.py][INFO] Epoch:[0/2](98900/4588595) loss:2.469 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:02:26,779][model8_pretrain.py][INFO] Epoch:[0/2](98900/4588595) loss:3.111 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:03:03,724][model8_pretrain.py][INFO] Epoch:[0/2](99000/4588595) loss:2.694 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:03,724][model8_pretrain.py][INFO] Epoch:[0/2](99000/4588595) loss:3.461 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:03,724][model8_pretrain.py][INFO] Epoch:[0/2](99000/4588595) loss:2.456 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:03,724][model8_pretrain.py][INFO] Epoch:[0/2](99000/4588595) loss:2.416 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:03,724][model8_pretrain.py][INFO] Epoch:[0/2](99000/4588595) loss:3.413 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:03,724][model8_pretrain.py][INFO] Epoch:[0/2](99000/4588595) loss:2.837 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:03,724][model8_pretrain.py][INFO] Epoch:[0/2](99000/4588595) loss:2.971 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:03,724][model8_pretrain.py][INFO] Epoch:[0/2](99000/4588595) loss:2.413 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:40,659][model8_pretrain.py][INFO] Epoch:[0/2](99100/4588595) loss:2.816 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:40,659][model8_pretrain.py][INFO] Epoch:[0/2](99100/4588595) loss:3.080 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:40,659][model8_pretrain.py][INFO] Epoch:[0/2](99100/4588595) loss:3.397 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:40,659][model8_pretrain.py][INFO] Epoch:[0/2](99100/4588595) loss:3.024 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:40,660][model8_pretrain.py][INFO] Epoch:[0/2](99100/4588595) loss:3.103 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:40,660][model8_pretrain.py][INFO] Epoch:[0/2](99100/4588595) loss:2.512 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:40,660][model8_pretrain.py][INFO] Epoch:[0/2](99100/4588595) loss:2.913 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:03:40,660][model8_pretrain.py][INFO] Epoch:[0/2](99100/4588595) loss:3.388 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:04:17,600][model8_pretrain.py][INFO] Epoch:[0/2](99200/4588595) loss:3.249 lr:0.0000100 epoch_Time:28262.0min: [2024-01-03 04:04:17,600][model8_pretrain.py][INFO] Epoch:[0/2](99200/4588595) loss:2.896 lr:0.0000100 epoch_Time:28262.0min: [2024-01-03 04:04:17,600][model8_pretrain.py][INFO] Epoch:[0/2](99200/4588595) loss:3.004 lr:0.0000100 epoch_Time:28262.0min: [2024-01-03 04:04:17,600][model8_pretrain.py][INFO] Epoch:[0/2](99200/4588595) loss:3.230 lr:0.0000100 epoch_Time:28262.0min: [2024-01-03 04:04:17,600][model8_pretrain.py][INFO] Epoch:[0/2](99200/4588595) loss:2.823 lr:0.0000100 epoch_Time:28262.0min: [2024-01-03 04:04:17,600][model8_pretrain.py][INFO] Epoch:[0/2](99200/4588595) loss:3.130 lr:0.0000100 epoch_Time:28262.0min: [2024-01-03 04:04:17,600][model8_pretrain.py][INFO] Epoch:[0/2](99200/4588595) loss:2.975 lr:0.0000100 epoch_Time:28262.0min: [2024-01-03 04:04:17,600][model8_pretrain.py][INFO] Epoch:[0/2](99200/4588595) loss:3.146 lr:0.0000100 epoch_Time:28262.0min: [2024-01-03 04:05:03,251][model8_pretrain.py][INFO] Epoch:[0/2](99300/4588595) loss:2.398 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:03,251][model8_pretrain.py][INFO] Epoch:[0/2](99300/4588595) loss:2.666 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:03,251][model8_pretrain.py][INFO] Epoch:[0/2](99300/4588595) loss:3.592 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:03,251][model8_pretrain.py][INFO] Epoch:[0/2](99300/4588595) loss:2.916 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:03,251][model8_pretrain.py][INFO] Epoch:[0/2](99300/4588595) loss:2.723 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:03,251][model8_pretrain.py][INFO] Epoch:[0/2](99300/4588595) loss:2.569 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:03,251][model8_pretrain.py][INFO] Epoch:[0/2](99300/4588595) loss:3.066 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:03,251][model8_pretrain.py][INFO] Epoch:[0/2](99300/4588595) loss:2.876 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:40,184][model8_pretrain.py][INFO] Epoch:[0/2](99400/4588595) loss:3.453 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:40,184][model8_pretrain.py][INFO] Epoch:[0/2](99400/4588595) loss:3.101 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:05:40,184][model8_pretrain.py][INFO] Epoch:[0/2](99400/4588595) loss:2.936 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:05:40,184][model8_pretrain.py][INFO] Epoch:[0/2](99400/4588595) loss:3.046 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:05:40,184][model8_pretrain.py][INFO] Epoch:[0/2](99400/4588595) loss:2.718 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:05:40,184][model8_pretrain.py][INFO] Epoch:[0/2](99400/4588595) loss:2.655 lr:0.0000100 epoch_Time:28267.0min: [2024-01-03 04:05:40,185][model8_pretrain.py][INFO] Epoch:[0/2](99400/4588595) loss:3.286 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:05:40,185][model8_pretrain.py][INFO] Epoch:[0/2](99400/4588595) loss:2.942 lr:0.0000100 epoch_Time:28266.0min: [2024-01-03 04:06:17,128][model8_pretrain.py][INFO] Epoch:[0/2](99500/4588595) loss:3.498 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 04:06:17,128][model8_pretrain.py][INFO] Epoch:[0/2](99500/4588595) loss:3.128 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 04:06:17,128][model8_pretrain.py][INFO] Epoch:[0/2](99500/4588595) loss:2.803 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 04:06:17,128][model8_pretrain.py][INFO] Epoch:[0/2](99500/4588595) loss:3.190 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 04:06:17,128][model8_pretrain.py][INFO] Epoch:[0/2](99500/4588595) loss:3.515 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 04:06:17,128][model8_pretrain.py][INFO] Epoch:[0/2](99500/4588595) loss:3.535 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 04:06:17,128][model8_pretrain.py][INFO] Epoch:[0/2](99500/4588595) loss:2.996 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 04:06:17,128][model8_pretrain.py][INFO] Epoch:[0/2](99500/4588595) loss:3.356 lr:0.0000100 epoch_Time:28265.0min: [2024-01-03 04:06:54,068][model8_pretrain.py][INFO] Epoch:[0/2](99600/4588595) loss:3.103 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:06:54,068][model8_pretrain.py][INFO] Epoch:[0/2](99600/4588595) loss:3.181 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:06:54,068][model8_pretrain.py][INFO] Epoch:[0/2](99600/4588595) loss:3.350 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:06:54,068][model8_pretrain.py][INFO] Epoch:[0/2](99600/4588595) loss:2.591 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:06:54,068][model8_pretrain.py][INFO] Epoch:[0/2](99600/4588595) loss:3.200 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:06:54,068][model8_pretrain.py][INFO] Epoch:[0/2](99600/4588595) loss:3.318 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:06:54,068][model8_pretrain.py][INFO] Epoch:[0/2](99600/4588595) loss:3.133 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:06:54,068][model8_pretrain.py][INFO] Epoch:[0/2](99600/4588595) loss:3.192 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:07:31,013][model8_pretrain.py][INFO] Epoch:[0/2](99700/4588595) loss:3.080 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:07:31,013][model8_pretrain.py][INFO] Epoch:[0/2](99700/4588595) loss:2.508 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:07:31,013][model8_pretrain.py][INFO] Epoch:[0/2](99700/4588595) loss:2.944 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:07:31,013][model8_pretrain.py][INFO] Epoch:[0/2](99700/4588595) loss:2.748 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:07:31,013][model8_pretrain.py][INFO] Epoch:[0/2](99700/4588595) loss:2.960 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:07:31,013][model8_pretrain.py][INFO] Epoch:[0/2](99700/4588595) loss:3.133 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:07:31,013][model8_pretrain.py][INFO] Epoch:[0/2](99700/4588595) loss:2.276 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:07:31,013][model8_pretrain.py][INFO] Epoch:[0/2](99700/4588595) loss:2.999 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:08:07,942][model8_pretrain.py][INFO] Epoch:[0/2](99800/4588595) loss:3.429 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:08:07,942][model8_pretrain.py][INFO] Epoch:[0/2](99800/4588595) loss:2.934 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:08:07,942][model8_pretrain.py][INFO] Epoch:[0/2](99800/4588595) loss:3.197 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:08:07,942][model8_pretrain.py][INFO] Epoch:[0/2](99800/4588595) loss:3.470 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:08:07,942][model8_pretrain.py][INFO] Epoch:[0/2](99800/4588595) loss:2.993 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:08:07,942][model8_pretrain.py][INFO] Epoch:[0/2](99800/4588595) loss:2.117 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:08:07,942][model8_pretrain.py][INFO] Epoch:[0/2](99800/4588595) loss:3.164 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:08:07,942][model8_pretrain.py][INFO] Epoch:[0/2](99800/4588595) loss:2.706 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:08:44,872][model8_pretrain.py][INFO] Epoch:[0/2](99900/4588595) loss:2.876 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:08:44,872][model8_pretrain.py][INFO] Epoch:[0/2](99900/4588595) loss:2.748 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:08:44,872][model8_pretrain.py][INFO] Epoch:[0/2](99900/4588595) loss:2.600 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:08:44,872][model8_pretrain.py][INFO] Epoch:[0/2](99900/4588595) loss:3.346 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:08:44,872][model8_pretrain.py][INFO] Epoch:[0/2](99900/4588595) loss:3.516 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:08:44,872][model8_pretrain.py][INFO] Epoch:[0/2](99900/4588595) loss:2.901 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:08:44,872][model8_pretrain.py][INFO] Epoch:[0/2](99900/4588595) loss:2.944 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:08:44,872][model8_pretrain.py][INFO] Epoch:[0/2](99900/4588595) loss:2.880 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:09:21,808][model8_pretrain.py][INFO] Epoch:[0/2](100000/4588595) loss:2.888 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:09:21,808][model8_pretrain.py][INFO] Epoch:[0/2](100000/4588595) loss:2.700 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:09:21,808][model8_pretrain.py][INFO] Epoch:[0/2](100000/4588595) loss:2.606 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:09:21,808][model8_pretrain.py][INFO] Epoch:[0/2](100000/4588595) loss:3.234 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:09:21,808][model8_pretrain.py][INFO] Epoch:[0/2](100000/4588595) loss:3.258 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:09:21,808][model8_pretrain.py][INFO] Epoch:[0/2](100000/4588595) loss:2.931 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:09:21,808][model8_pretrain.py][INFO] Epoch:[0/2](100000/4588595) loss:2.655 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:09:21,808][model8_pretrain.py][INFO] Epoch:[0/2](100000/4588595) loss:3.060 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:10:07,348][model8_pretrain.py][INFO] Epoch:[0/2](100100/4588595) loss:3.202 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:10:07,348][model8_pretrain.py][INFO] Epoch:[0/2](100100/4588595) loss:2.322 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:10:07,348][model8_pretrain.py][INFO] Epoch:[0/2](100100/4588595) loss:3.147 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:10:07,348][model8_pretrain.py][INFO] Epoch:[0/2](100100/4588595) loss:3.409 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:10:07,348][model8_pretrain.py][INFO] Epoch:[0/2](100100/4588595) loss:2.518 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:10:07,348][model8_pretrain.py][INFO] Epoch:[0/2](100100/4588595) loss:2.958 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:10:07,348][model8_pretrain.py][INFO] Epoch:[0/2](100100/4588595) loss:3.019 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:10:07,348][model8_pretrain.py][INFO] Epoch:[0/2](100100/4588595) loss:2.629 lr:0.0000100 epoch_Time:28264.0min: [2024-01-03 04:10:44,272][model8_pretrain.py][INFO] Epoch:[0/2](100200/4588595) loss:3.158 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:10:44,272][model8_pretrain.py][INFO] Epoch:[0/2](100200/4588595) loss:3.098 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:10:44,272][model8_pretrain.py][INFO] Epoch:[0/2](100200/4588595) loss:3.163 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:10:44,272][model8_pretrain.py][INFO] Epoch:[0/2](100200/4588595) loss:3.285 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:10:44,272][model8_pretrain.py][INFO] Epoch:[0/2](100200/4588595) loss:3.011 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:10:44,272][model8_pretrain.py][INFO] Epoch:[0/2](100200/4588595) loss:2.890 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:10:44,272][model8_pretrain.py][INFO] Epoch:[0/2](100200/4588595) loss:2.654 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:10:44,272][model8_pretrain.py][INFO] Epoch:[0/2](100200/4588595) loss:2.858 lr:0.0000100 epoch_Time:28263.0min: [2024-01-03 04:11:21,205][model8_pretrain.py][INFO] Epoch:[0/2](100300/4588595) loss:3.428 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:11:21,205][model8_pretrain.py][INFO] Epoch:[0/2](100300/4588595) loss:2.980 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:11:21,205][model8_pretrain.py][INFO] Epoch:[0/2](100300/4588595) loss:3.109 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:11:21,205][model8_pretrain.py][INFO] Epoch:[0/2](100300/4588595) loss:2.571 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:11:21,206][model8_pretrain.py][INFO] Epoch:[0/2](100300/4588595) loss:2.912 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:11:21,205][model8_pretrain.py][INFO] Epoch:[0/2](100300/4588595) loss:3.439 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:11:21,206][model8_pretrain.py][INFO] Epoch:[0/2](100300/4588595) loss:3.040 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:11:21,206][model8_pretrain.py][INFO] Epoch:[0/2](100300/4588595) loss:3.284 lr:0.0000100 epoch_Time:28261.0min: [2024-01-03 04:11:58,142][model8_pretrain.py][INFO] Epoch:[0/2](100400/4588595) loss:2.759 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:11:58,142][model8_pretrain.py][INFO] Epoch:[0/2](100400/4588595) loss:3.438 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:11:58,142][model8_pretrain.py][INFO] Epoch:[0/2](100400/4588595) loss:2.805 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:11:58,142][model8_pretrain.py][INFO] Epoch:[0/2](100400/4588595) loss:3.544 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:11:58,142][model8_pretrain.py][INFO] Epoch:[0/2](100400/4588595) loss:3.010 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:11:58,142][model8_pretrain.py][INFO] Epoch:[0/2](100400/4588595) loss:2.914 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:11:58,142][model8_pretrain.py][INFO] Epoch:[0/2](100400/4588595) loss:3.152 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:11:58,143][model8_pretrain.py][INFO] Epoch:[0/2](100400/4588595) loss:3.512 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:12:35,071][model8_pretrain.py][INFO] Epoch:[0/2](100500/4588595) loss:3.008 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:12:35,071][model8_pretrain.py][INFO] Epoch:[0/2](100500/4588595) loss:2.744 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:12:35,071][model8_pretrain.py][INFO] Epoch:[0/2](100500/4588595) loss:2.752 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:12:35,071][model8_pretrain.py][INFO] Epoch:[0/2](100500/4588595) loss:3.203 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:12:35,071][model8_pretrain.py][INFO] Epoch:[0/2](100500/4588595) loss:2.579 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:12:35,071][model8_pretrain.py][INFO] Epoch:[0/2](100500/4588595) loss:2.936 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:12:35,071][model8_pretrain.py][INFO] Epoch:[0/2](100500/4588595) loss:2.779 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:12:35,071][model8_pretrain.py][INFO] Epoch:[0/2](100500/4588595) loss:3.042 lr:0.0000100 epoch_Time:28259.0min: [2024-01-03 04:13:12,003][model8_pretrain.py][INFO] Epoch:[0/2](100600/4588595) loss:2.835 lr:0.0000100 epoch_Time:28257.0min: [2024-01-03 04:13:12,003][model8_pretrain.py][INFO] Epoch:[0/2](100600/4588595) loss:2.990 lr:0.0000100 epoch_Time:28257.0min: [2024-01-03 04:13:12,003][model8_pretrain.py][INFO] Epoch:[0/2](100600/4588595) loss:2.995 lr:0.0000100 epoch_Time:28257.0min: [2024-01-03 04:13:12,003][model8_pretrain.py][INFO] Epoch:[0/2](100600/4588595) loss:2.651 lr:0.0000100 epoch_Time:28257.0min: [2024-01-03 04:13:12,003][model8_pretrain.py][INFO] Epoch:[0/2](100600/4588595) loss:2.850 lr:0.0000100 epoch_Time:28257.0min: [2024-01-03 04:13:12,003][model8_pretrain.py][INFO] Epoch:[0/2](100600/4588595) loss:2.289 lr:0.0000100 epoch_Time:28257.0min: [2024-01-03 04:13:12,003][model8_pretrain.py][INFO] Epoch:[0/2](100600/4588595) loss:2.285 lr:0.0000100 epoch_Time:28257.0min: [2024-01-03 04:13:12,003][model8_pretrain.py][INFO] Epoch:[0/2](100600/4588595) loss:2.798 lr:0.0000100 epoch_Time:28257.0min: [2024-01-03 04:13:48,926][model8_pretrain.py][INFO] Epoch:[0/2](100700/4588595) loss:3.096 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:13:48,926][model8_pretrain.py][INFO] Epoch:[0/2](100700/4588595) loss:3.196 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:13:48,926][model8_pretrain.py][INFO] Epoch:[0/2](100700/4588595) loss:2.921 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:13:48,926][model8_pretrain.py][INFO] Epoch:[0/2](100700/4588595) loss:3.079 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:13:48,926][model8_pretrain.py][INFO] Epoch:[0/2](100700/4588595) loss:2.895 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:13:48,926][model8_pretrain.py][INFO] Epoch:[0/2](100700/4588595) loss:3.150 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:13:48,926][model8_pretrain.py][INFO] Epoch:[0/2](100700/4588595) loss:3.363 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:13:48,926][model8_pretrain.py][INFO] Epoch:[0/2](100700/4588595) loss:3.038 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:14:25,853][model8_pretrain.py][INFO] Epoch:[0/2](100800/4588595) loss:3.262 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:14:25,853][model8_pretrain.py][INFO] Epoch:[0/2](100800/4588595) loss:3.189 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:14:25,853][model8_pretrain.py][INFO] Epoch:[0/2](100800/4588595) loss:3.132 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:14:25,853][model8_pretrain.py][INFO] Epoch:[0/2](100800/4588595) loss:3.124 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:14:25,853][model8_pretrain.py][INFO] Epoch:[0/2](100800/4588595) loss:2.720 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:14:25,853][model8_pretrain.py][INFO] Epoch:[0/2](100800/4588595) loss:2.992 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:14:25,853][model8_pretrain.py][INFO] Epoch:[0/2](100800/4588595) loss:2.245 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:14:25,853][model8_pretrain.py][INFO] Epoch:[0/2](100800/4588595) loss:3.079 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:15:11,480][model8_pretrain.py][INFO] Epoch:[0/2](100900/4588595) loss:3.057 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:15:11,480][model8_pretrain.py][INFO] Epoch:[0/2](100900/4588595) loss:3.025 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:15:11,480][model8_pretrain.py][INFO] Epoch:[0/2](100900/4588595) loss:3.071 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:15:11,480][model8_pretrain.py][INFO] Epoch:[0/2](100900/4588595) loss:3.106 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:15:11,480][model8_pretrain.py][INFO] Epoch:[0/2](100900/4588595) loss:2.863 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:15:11,480][model8_pretrain.py][INFO] Epoch:[0/2](100900/4588595) loss:2.970 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:15:11,480][model8_pretrain.py][INFO] Epoch:[0/2](100900/4588595) loss:3.088 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:15:11,480][model8_pretrain.py][INFO] Epoch:[0/2](100900/4588595) loss:3.398 lr:0.0000100 epoch_Time:28260.0min: [2024-01-03 04:15:48,410][model8_pretrain.py][INFO] Epoch:[0/2](101000/4588595) loss:3.149 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:15:48,410][model8_pretrain.py][INFO] Epoch:[0/2](101000/4588595) loss:2.947 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:15:48,410][model8_pretrain.py][INFO] Epoch:[0/2](101000/4588595) loss:2.980 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:15:48,410][model8_pretrain.py][INFO] Epoch:[0/2](101000/4588595) loss:2.659 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:15:48,410][model8_pretrain.py][INFO] Epoch:[0/2](101000/4588595) loss:3.288 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:15:48,410][model8_pretrain.py][INFO] Epoch:[0/2](101000/4588595) loss:3.237 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:15:48,410][model8_pretrain.py][INFO] Epoch:[0/2](101000/4588595) loss:3.653 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:15:48,410][model8_pretrain.py][INFO] Epoch:[0/2](101000/4588595) loss:2.999 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:16:25,339][model8_pretrain.py][INFO] Epoch:[0/2](101100/4588595) loss:2.872 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:16:25,339][model8_pretrain.py][INFO] Epoch:[0/2](101100/4588595) loss:3.213 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:16:25,339][model8_pretrain.py][INFO] Epoch:[0/2](101100/4588595) loss:3.207 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:16:25,339][model8_pretrain.py][INFO] Epoch:[0/2](101100/4588595) loss:3.013 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:16:25,339][model8_pretrain.py][INFO] Epoch:[0/2](101100/4588595) loss:3.355 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:16:25,339][model8_pretrain.py][INFO] Epoch:[0/2](101100/4588595) loss:2.960 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:16:25,339][model8_pretrain.py][INFO] Epoch:[0/2](101100/4588595) loss:2.712 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:16:25,339][model8_pretrain.py][INFO] Epoch:[0/2](101100/4588595) loss:3.244 lr:0.0000100 epoch_Time:28258.0min: [2024-01-03 04:17:02,275][model8_pretrain.py][INFO] Epoch:[0/2](101200/4588595) loss:3.352 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:17:02,275][model8_pretrain.py][INFO] Epoch:[0/2](101200/4588595) loss:2.360 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:17:02,275][model8_pretrain.py][INFO] Epoch:[0/2](101200/4588595) loss:2.646 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:17:02,275][model8_pretrain.py][INFO] Epoch:[0/2](101200/4588595) loss:3.379 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:17:02,275][model8_pretrain.py][INFO] Epoch:[0/2](101200/4588595) loss:3.164 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:17:02,275][model8_pretrain.py][INFO] Epoch:[0/2](101200/4588595) loss:2.877 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:17:02,275][model8_pretrain.py][INFO] Epoch:[0/2](101200/4588595) loss:3.224 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:17:02,276][model8_pretrain.py][INFO] Epoch:[0/2](101200/4588595) loss:3.049 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:17:39,215][model8_pretrain.py][INFO] Epoch:[0/2](101300/4588595) loss:2.992 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:17:39,215][model8_pretrain.py][INFO] Epoch:[0/2](101300/4588595) loss:3.324 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:17:39,215][model8_pretrain.py][INFO] Epoch:[0/2](101300/4588595) loss:3.403 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:17:39,215][model8_pretrain.py][INFO] Epoch:[0/2](101300/4588595) loss:2.879 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:17:39,215][model8_pretrain.py][INFO] Epoch:[0/2](101300/4588595) loss:2.822 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:17:39,215][model8_pretrain.py][INFO] Epoch:[0/2](101300/4588595) loss:3.230 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:17:39,216][model8_pretrain.py][INFO] Epoch:[0/2](101300/4588595) loss:3.119 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:17:39,216][model8_pretrain.py][INFO] Epoch:[0/2](101300/4588595) loss:2.636 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:18:16,149][model8_pretrain.py][INFO] Epoch:[0/2](101400/4588595) loss:3.181 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:18:16,149][model8_pretrain.py][INFO] Epoch:[0/2](101400/4588595) loss:3.141 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:18:16,149][model8_pretrain.py][INFO] Epoch:[0/2](101400/4588595) loss:2.892 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:18:16,149][model8_pretrain.py][INFO] Epoch:[0/2](101400/4588595) loss:3.083 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:18:16,149][model8_pretrain.py][INFO] Epoch:[0/2](101400/4588595) loss:2.582 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:18:16,149][model8_pretrain.py][INFO] Epoch:[0/2](101400/4588595) loss:2.424 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:18:16,149][model8_pretrain.py][INFO] Epoch:[0/2](101400/4588595) loss:2.832 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:18:16,149][model8_pretrain.py][INFO] Epoch:[0/2](101400/4588595) loss:2.778 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:18:53,067][model8_pretrain.py][INFO] Epoch:[0/2](101500/4588595) loss:2.485 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:18:53,067][model8_pretrain.py][INFO] Epoch:[0/2](101500/4588595) loss:2.871 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:18:53,068][model8_pretrain.py][INFO] Epoch:[0/2](101500/4588595) loss:3.351 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:18:53,068][model8_pretrain.py][INFO] Epoch:[0/2](101500/4588595) loss:3.243 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:18:53,068][model8_pretrain.py][INFO] Epoch:[0/2](101500/4588595) loss:3.015 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:18:53,068][model8_pretrain.py][INFO] Epoch:[0/2](101500/4588595) loss:2.966 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:18:53,068][model8_pretrain.py][INFO] Epoch:[0/2](101500/4588595) loss:2.805 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:18:53,068][model8_pretrain.py][INFO] Epoch:[0/2](101500/4588595) loss:3.325 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:19:30,009][model8_pretrain.py][INFO] Epoch:[0/2](101600/4588595) loss:3.090 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:19:30,009][model8_pretrain.py][INFO] Epoch:[0/2](101600/4588595) loss:3.458 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:19:30,009][model8_pretrain.py][INFO] Epoch:[0/2](101600/4588595) loss:2.967 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:19:30,009][model8_pretrain.py][INFO] Epoch:[0/2](101600/4588595) loss:3.369 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:19:30,009][model8_pretrain.py][INFO] Epoch:[0/2](101600/4588595) loss:3.107 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:19:30,009][model8_pretrain.py][INFO] Epoch:[0/2](101600/4588595) loss:2.868 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:19:30,009][model8_pretrain.py][INFO] Epoch:[0/2](101600/4588595) loss:3.416 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:19:30,009][model8_pretrain.py][INFO] Epoch:[0/2](101600/4588595) loss:2.891 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:20:15,702][model8_pretrain.py][INFO] Epoch:[0/2](101700/4588595) loss:3.360 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:20:15,702][model8_pretrain.py][INFO] Epoch:[0/2](101700/4588595) loss:3.008 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:20:15,702][model8_pretrain.py][INFO] Epoch:[0/2](101700/4588595) loss:3.120 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:20:15,702][model8_pretrain.py][INFO] Epoch:[0/2](101700/4588595) loss:2.825 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:20:15,702][model8_pretrain.py][INFO] Epoch:[0/2](101700/4588595) loss:3.353 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:20:15,702][model8_pretrain.py][INFO] Epoch:[0/2](101700/4588595) loss:2.583 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:20:15,702][model8_pretrain.py][INFO] Epoch:[0/2](101700/4588595) loss:2.422 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:20:15,702][model8_pretrain.py][INFO] Epoch:[0/2](101700/4588595) loss:3.146 lr:0.0000100 epoch_Time:28256.0min: [2024-01-03 04:20:52,631][model8_pretrain.py][INFO] Epoch:[0/2](101800/4588595) loss:2.534 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:20:52,631][model8_pretrain.py][INFO] Epoch:[0/2](101800/4588595) loss:2.865 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:20:52,631][model8_pretrain.py][INFO] Epoch:[0/2](101800/4588595) loss:3.138 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:20:52,631][model8_pretrain.py][INFO] Epoch:[0/2](101800/4588595) loss:2.990 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:20:52,631][model8_pretrain.py][INFO] Epoch:[0/2](101800/4588595) loss:3.235 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:20:52,631][model8_pretrain.py][INFO] Epoch:[0/2](101800/4588595) loss:3.266 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:20:52,631][model8_pretrain.py][INFO] Epoch:[0/2](101800/4588595) loss:2.870 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:20:52,631][model8_pretrain.py][INFO] Epoch:[0/2](101800/4588595) loss:3.048 lr:0.0000100 epoch_Time:28255.0min: [2024-01-03 04:21:29,571][model8_pretrain.py][INFO] Epoch:[0/2](101900/4588595) loss:2.972 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:21:29,571][model8_pretrain.py][INFO] Epoch:[0/2](101900/4588595) loss:2.771 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:21:29,571][model8_pretrain.py][INFO] Epoch:[0/2](101900/4588595) loss:2.975 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:21:29,571][model8_pretrain.py][INFO] Epoch:[0/2](101900/4588595) loss:2.976 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:21:29,571][model8_pretrain.py][INFO] Epoch:[0/2](101900/4588595) loss:2.413 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:21:29,571][model8_pretrain.py][INFO] Epoch:[0/2](101900/4588595) loss:3.356 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:21:29,571][model8_pretrain.py][INFO] Epoch:[0/2](101900/4588595) loss:2.665 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:21:29,571][model8_pretrain.py][INFO] Epoch:[0/2](101900/4588595) loss:3.044 lr:0.0000100 epoch_Time:28254.0min: [2024-01-03 04:22:06,528][model8_pretrain.py][INFO] Epoch:[0/2](102000/4588595) loss:3.058 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:22:06,528][model8_pretrain.py][INFO] Epoch:[0/2](102000/4588595) loss:2.494 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:22:06,528][model8_pretrain.py][INFO] Epoch:[0/2](102000/4588595) loss:3.100 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:22:06,528][model8_pretrain.py][INFO] Epoch:[0/2](102000/4588595) loss:3.133 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:22:06,528][model8_pretrain.py][INFO] Epoch:[0/2](102000/4588595) loss:2.915 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:22:06,528][model8_pretrain.py][INFO] Epoch:[0/2](102000/4588595) loss:2.818 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:22:06,528][model8_pretrain.py][INFO] Epoch:[0/2](102000/4588595) loss:3.310 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:22:06,528][model8_pretrain.py][INFO] Epoch:[0/2](102000/4588595) loss:2.858 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:22:43,459][model8_pretrain.py][INFO] Epoch:[0/2](102100/4588595) loss:2.600 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:22:43,459][model8_pretrain.py][INFO] Epoch:[0/2](102100/4588595) loss:2.745 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:22:43,459][model8_pretrain.py][INFO] Epoch:[0/2](102100/4588595) loss:2.719 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:22:43,459][model8_pretrain.py][INFO] Epoch:[0/2](102100/4588595) loss:3.442 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:22:43,459][model8_pretrain.py][INFO] Epoch:[0/2](102100/4588595) loss:3.323 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:22:43,459][model8_pretrain.py][INFO] Epoch:[0/2](102100/4588595) loss:3.119 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:22:43,459][model8_pretrain.py][INFO] Epoch:[0/2](102100/4588595) loss:2.857 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:22:43,460][model8_pretrain.py][INFO] Epoch:[0/2](102100/4588595) loss:3.355 lr:0.0000100 epoch_Time:28252.0min: [2024-01-03 04:23:20,390][model8_pretrain.py][INFO] Epoch:[0/2](102200/4588595) loss:2.589 lr:0.0000100 epoch_Time:28250.0min: [2024-01-03 04:23:20,390][model8_pretrain.py][INFO] Epoch:[0/2](102200/4588595) loss:3.174 lr:0.0000100 epoch_Time:28250.0min: [2024-01-03 04:23:20,390][model8_pretrain.py][INFO] Epoch:[0/2](102200/4588595) loss:3.309 lr:0.0000100 epoch_Time:28250.0min: [2024-01-03 04:23:20,390][model8_pretrain.py][INFO] Epoch:[0/2](102200/4588595) loss:2.447 lr:0.0000100 epoch_Time:28250.0min: [2024-01-03 04:23:20,390][model8_pretrain.py][INFO] Epoch:[0/2](102200/4588595) loss:3.325 lr:0.0000100 epoch_Time:28250.0min: [2024-01-03 04:23:20,390][model8_pretrain.py][INFO] Epoch:[0/2](102200/4588595) loss:2.757 lr:0.0000100 epoch_Time:28250.0min: [2024-01-03 04:23:20,390][model8_pretrain.py][INFO] Epoch:[0/2](102200/4588595) loss:3.374 lr:0.0000100 epoch_Time:28250.0min: [2024-01-03 04:23:20,390][model8_pretrain.py][INFO] Epoch:[0/2](102200/4588595) loss:2.776 lr:0.0000100 epoch_Time:28250.0min: [2024-01-03 04:23:57,327][model8_pretrain.py][INFO] Epoch:[0/2](102300/4588595) loss:3.509 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:23:57,327][model8_pretrain.py][INFO] Epoch:[0/2](102300/4588595) loss:3.350 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:23:57,327][model8_pretrain.py][INFO] Epoch:[0/2](102300/4588595) loss:3.336 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:23:57,327][model8_pretrain.py][INFO] Epoch:[0/2](102300/4588595) loss:3.251 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:23:57,327][model8_pretrain.py][INFO] Epoch:[0/2](102300/4588595) loss:3.034 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:23:57,327][model8_pretrain.py][INFO] Epoch:[0/2](102300/4588595) loss:2.914 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:23:57,327][model8_pretrain.py][INFO] Epoch:[0/2](102300/4588595) loss:3.079 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:23:57,327][model8_pretrain.py][INFO] Epoch:[0/2](102300/4588595) loss:2.678 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:24:34,265][model8_pretrain.py][INFO] Epoch:[0/2](102400/4588595) loss:2.968 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:24:34,265][model8_pretrain.py][INFO] Epoch:[0/2](102400/4588595) loss:2.614 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:24:34,265][model8_pretrain.py][INFO] Epoch:[0/2](102400/4588595) loss:2.984 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:24:34,265][model8_pretrain.py][INFO] Epoch:[0/2](102400/4588595) loss:2.567 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:24:34,265][model8_pretrain.py][INFO] Epoch:[0/2](102400/4588595) loss:2.741 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:24:34,265][model8_pretrain.py][INFO] Epoch:[0/2](102400/4588595) loss:3.171 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:24:34,265][model8_pretrain.py][INFO] Epoch:[0/2](102400/4588595) loss:2.540 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:24:34,266][model8_pretrain.py][INFO] Epoch:[0/2](102400/4588595) loss:2.481 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:25:19,919][model8_pretrain.py][INFO] Epoch:[0/2](102500/4588595) loss:2.822 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:25:19,919][model8_pretrain.py][INFO] Epoch:[0/2](102500/4588595) loss:3.233 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:25:19,919][model8_pretrain.py][INFO] Epoch:[0/2](102500/4588595) loss:2.997 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:25:19,919][model8_pretrain.py][INFO] Epoch:[0/2](102500/4588595) loss:2.861 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:25:19,919][model8_pretrain.py][INFO] Epoch:[0/2](102500/4588595) loss:2.881 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:25:19,919][model8_pretrain.py][INFO] Epoch:[0/2](102500/4588595) loss:2.986 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:25:19,919][model8_pretrain.py][INFO] Epoch:[0/2](102500/4588595) loss:2.780 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:25:19,919][model8_pretrain.py][INFO] Epoch:[0/2](102500/4588595) loss:2.908 lr:0.0000100 epoch_Time:28253.0min: [2024-01-03 04:25:56,860][model8_pretrain.py][INFO] Epoch:[0/2](102600/4588595) loss:3.081 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:25:56,860][model8_pretrain.py][INFO] Epoch:[0/2](102600/4588595) loss:2.919 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:25:56,861][model8_pretrain.py][INFO] Epoch:[0/2](102600/4588595) loss:2.163 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:25:56,860][model8_pretrain.py][INFO] Epoch:[0/2](102600/4588595) loss:2.570 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:25:56,861][model8_pretrain.py][INFO] Epoch:[0/2](102600/4588595) loss:3.034 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:25:56,861][model8_pretrain.py][INFO] Epoch:[0/2](102600/4588595) loss:3.223 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:25:56,861][model8_pretrain.py][INFO] Epoch:[0/2](102600/4588595) loss:3.100 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:25:56,861][model8_pretrain.py][INFO] Epoch:[0/2](102600/4588595) loss:3.206 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:26:33,804][model8_pretrain.py][INFO] Epoch:[0/2](102700/4588595) loss:3.442 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:26:33,804][model8_pretrain.py][INFO] Epoch:[0/2](102700/4588595) loss:3.158 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:26:33,804][model8_pretrain.py][INFO] Epoch:[0/2](102700/4588595) loss:3.163 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:26:33,804][model8_pretrain.py][INFO] Epoch:[0/2](102700/4588595) loss:2.805 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:26:33,804][model8_pretrain.py][INFO] Epoch:[0/2](102700/4588595) loss:3.257 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:26:33,804][model8_pretrain.py][INFO] Epoch:[0/2](102700/4588595) loss:2.922 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:26:33,804][model8_pretrain.py][INFO] Epoch:[0/2](102700/4588595) loss:2.967 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:26:33,804][model8_pretrain.py][INFO] Epoch:[0/2](102700/4588595) loss:3.275 lr:0.0000100 epoch_Time:28251.0min: [2024-01-03 04:27:10,737][model8_pretrain.py][INFO] Epoch:[0/2](102800/4588595) loss:2.710 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:27:10,737][model8_pretrain.py][INFO] Epoch:[0/2](102800/4588595) loss:3.202 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:27:10,737][model8_pretrain.py][INFO] Epoch:[0/2](102800/4588595) loss:2.852 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:27:10,737][model8_pretrain.py][INFO] Epoch:[0/2](102800/4588595) loss:2.801 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:27:10,737][model8_pretrain.py][INFO] Epoch:[0/2](102800/4588595) loss:2.951 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:27:10,737][model8_pretrain.py][INFO] Epoch:[0/2](102800/4588595) loss:2.969 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:27:10,737][model8_pretrain.py][INFO] Epoch:[0/2](102800/4588595) loss:2.882 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:27:10,737][model8_pretrain.py][INFO] Epoch:[0/2](102800/4588595) loss:2.852 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:27:47,674][model8_pretrain.py][INFO] Epoch:[0/2](102900/4588595) loss:3.145 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:27:47,674][model8_pretrain.py][INFO] Epoch:[0/2](102900/4588595) loss:2.836 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:27:47,674][model8_pretrain.py][INFO] Epoch:[0/2](102900/4588595) loss:2.812 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:27:47,674][model8_pretrain.py][INFO] Epoch:[0/2](102900/4588595) loss:2.917 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:27:47,674][model8_pretrain.py][INFO] Epoch:[0/2](102900/4588595) loss:2.664 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:27:47,674][model8_pretrain.py][INFO] Epoch:[0/2](102900/4588595) loss:2.996 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:27:47,674][model8_pretrain.py][INFO] Epoch:[0/2](102900/4588595) loss:3.194 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:27:47,674][model8_pretrain.py][INFO] Epoch:[0/2](102900/4588595) loss:2.711 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:28:24,616][model8_pretrain.py][INFO] Epoch:[0/2](103000/4588595) loss:3.205 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:28:24,616][model8_pretrain.py][INFO] Epoch:[0/2](103000/4588595) loss:2.705 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:28:24,616][model8_pretrain.py][INFO] Epoch:[0/2](103000/4588595) loss:2.783 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:28:24,616][model8_pretrain.py][INFO] Epoch:[0/2](103000/4588595) loss:3.285 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:28:24,616][model8_pretrain.py][INFO] Epoch:[0/2](103000/4588595) loss:3.157 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:28:24,616][model8_pretrain.py][INFO] Epoch:[0/2](103000/4588595) loss:3.240 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:28:24,616][model8_pretrain.py][INFO] Epoch:[0/2](103000/4588595) loss:2.848 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:28:24,617][model8_pretrain.py][INFO] Epoch:[0/2](103000/4588595) loss:3.187 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:29:01,551][model8_pretrain.py][INFO] Epoch:[0/2](103100/4588595) loss:3.234 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:01,551][model8_pretrain.py][INFO] Epoch:[0/2](103100/4588595) loss:2.851 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:01,551][model8_pretrain.py][INFO] Epoch:[0/2](103100/4588595) loss:3.118 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:01,551][model8_pretrain.py][INFO] Epoch:[0/2](103100/4588595) loss:2.901 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:01,551][model8_pretrain.py][INFO] Epoch:[0/2](103100/4588595) loss:2.761 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:01,551][model8_pretrain.py][INFO] Epoch:[0/2](103100/4588595) loss:2.936 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:01,551][model8_pretrain.py][INFO] Epoch:[0/2](103100/4588595) loss:2.400 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:01,551][model8_pretrain.py][INFO] Epoch:[0/2](103100/4588595) loss:3.234 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:38,488][model8_pretrain.py][INFO] Epoch:[0/2](103200/4588595) loss:3.239 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:29:38,488][model8_pretrain.py][INFO] Epoch:[0/2](103200/4588595) loss:2.956 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:38,488][model8_pretrain.py][INFO] Epoch:[0/2](103200/4588595) loss:2.525 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:38,488][model8_pretrain.py][INFO] Epoch:[0/2](103200/4588595) loss:3.156 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:29:38,488][model8_pretrain.py][INFO] Epoch:[0/2](103200/4588595) loss:3.209 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:38,488][model8_pretrain.py][INFO] Epoch:[0/2](103200/4588595) loss:3.135 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:38,488][model8_pretrain.py][INFO] Epoch:[0/2](103200/4588595) loss:3.372 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:29:38,488][model8_pretrain.py][INFO] Epoch:[0/2](103200/4588595) loss:3.145 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:30:24,115][model8_pretrain.py][INFO] Epoch:[0/2](103300/4588595) loss:2.450 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:30:24,115][model8_pretrain.py][INFO] Epoch:[0/2](103300/4588595) loss:2.986 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:30:24,115][model8_pretrain.py][INFO] Epoch:[0/2](103300/4588595) loss:2.618 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:30:24,115][model8_pretrain.py][INFO] Epoch:[0/2](103300/4588595) loss:2.625 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:30:24,115][model8_pretrain.py][INFO] Epoch:[0/2](103300/4588595) loss:2.997 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:30:24,115][model8_pretrain.py][INFO] Epoch:[0/2](103300/4588595) loss:3.272 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:30:24,115][model8_pretrain.py][INFO] Epoch:[0/2](103300/4588595) loss:3.020 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:30:24,115][model8_pretrain.py][INFO] Epoch:[0/2](103300/4588595) loss:2.956 lr:0.0000100 epoch_Time:28249.0min: [2024-01-03 04:31:01,061][model8_pretrain.py][INFO] Epoch:[0/2](103400/4588595) loss:2.458 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:31:01,062][model8_pretrain.py][INFO] Epoch:[0/2](103400/4588595) loss:2.946 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:31:01,062][model8_pretrain.py][INFO] Epoch:[0/2](103400/4588595) loss:3.285 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:31:01,062][model8_pretrain.py][INFO] Epoch:[0/2](103400/4588595) loss:2.983 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:31:01,062][model8_pretrain.py][INFO] Epoch:[0/2](103400/4588595) loss:2.889 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:31:01,062][model8_pretrain.py][INFO] Epoch:[0/2](103400/4588595) loss:2.880 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:31:01,062][model8_pretrain.py][INFO] Epoch:[0/2](103400/4588595) loss:2.553 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:31:01,062][model8_pretrain.py][INFO] Epoch:[0/2](103400/4588595) loss:3.229 lr:0.0000100 epoch_Time:28248.0min: [2024-01-03 04:31:38,011][model8_pretrain.py][INFO] Epoch:[0/2](103500/4588595) loss:2.431 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:31:38,011][model8_pretrain.py][INFO] Epoch:[0/2](103500/4588595) loss:2.879 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:31:38,011][model8_pretrain.py][INFO] Epoch:[0/2](103500/4588595) loss:3.200 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:31:38,011][model8_pretrain.py][INFO] Epoch:[0/2](103500/4588595) loss:2.559 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:31:38,011][model8_pretrain.py][INFO] Epoch:[0/2](103500/4588595) loss:2.414 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:31:38,011][model8_pretrain.py][INFO] Epoch:[0/2](103500/4588595) loss:3.180 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:31:38,011][model8_pretrain.py][INFO] Epoch:[0/2](103500/4588595) loss:3.016 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:31:38,012][model8_pretrain.py][INFO] Epoch:[0/2](103500/4588595) loss:3.127 lr:0.0000100 epoch_Time:28247.0min: [2024-01-03 04:32:14,942][model8_pretrain.py][INFO] Epoch:[0/2](103600/4588595) loss:3.001 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:32:14,942][model8_pretrain.py][INFO] Epoch:[0/2](103600/4588595) loss:2.461 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:32:14,942][model8_pretrain.py][INFO] Epoch:[0/2](103600/4588595) loss:2.976 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:32:14,942][model8_pretrain.py][INFO] Epoch:[0/2](103600/4588595) loss:2.314 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:32:14,942][model8_pretrain.py][INFO] Epoch:[0/2](103600/4588595) loss:3.065 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:32:14,942][model8_pretrain.py][INFO] Epoch:[0/2](103600/4588595) loss:3.006 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:32:14,942][model8_pretrain.py][INFO] Epoch:[0/2](103600/4588595) loss:2.777 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:32:14,942][model8_pretrain.py][INFO] Epoch:[0/2](103600/4588595) loss:2.852 lr:0.0000100 epoch_Time:28245.0min: [2024-01-03 04:32:51,893][model8_pretrain.py][INFO] Epoch:[0/2](103700/4588595) loss:2.868 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:32:51,893][model8_pretrain.py][INFO] Epoch:[0/2](103700/4588595) loss:3.144 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:32:51,893][model8_pretrain.py][INFO] Epoch:[0/2](103700/4588595) loss:3.312 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:32:51,893][model8_pretrain.py][INFO] Epoch:[0/2](103700/4588595) loss:2.993 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:32:51,893][model8_pretrain.py][INFO] Epoch:[0/2](103700/4588595) loss:2.895 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:32:51,893][model8_pretrain.py][INFO] Epoch:[0/2](103700/4588595) loss:3.222 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:32:51,894][model8_pretrain.py][INFO] Epoch:[0/2](103700/4588595) loss:3.224 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:32:51,894][model8_pretrain.py][INFO] Epoch:[0/2](103700/4588595) loss:3.177 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:33:28,829][model8_pretrain.py][INFO] Epoch:[0/2](103800/4588595) loss:2.603 lr:0.0000100 epoch_Time:28243.0min: [2024-01-03 04:33:28,829][model8_pretrain.py][INFO] Epoch:[0/2](103800/4588595) loss:2.823 lr:0.0000100 epoch_Time:28243.0min: [2024-01-03 04:33:28,829][model8_pretrain.py][INFO] Epoch:[0/2](103800/4588595) loss:3.102 lr:0.0000100 epoch_Time:28243.0min: [2024-01-03 04:33:28,829][model8_pretrain.py][INFO] Epoch:[0/2](103800/4588595) loss:3.368 lr:0.0000100 epoch_Time:28243.0min: [2024-01-03 04:33:28,829][model8_pretrain.py][INFO] Epoch:[0/2](103800/4588595) loss:2.451 lr:0.0000100 epoch_Time:28243.0min: [2024-01-03 04:33:28,829][model8_pretrain.py][INFO] Epoch:[0/2](103800/4588595) loss:2.920 lr:0.0000100 epoch_Time:28243.0min: [2024-01-03 04:33:28,829][model8_pretrain.py][INFO] Epoch:[0/2](103800/4588595) loss:3.233 lr:0.0000100 epoch_Time:28243.0min: [2024-01-03 04:33:28,829][model8_pretrain.py][INFO] Epoch:[0/2](103800/4588595) loss:2.938 lr:0.0000100 epoch_Time:28243.0min: [2024-01-03 04:34:05,771][model8_pretrain.py][INFO] Epoch:[0/2](103900/4588595) loss:2.959 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:34:05,771][model8_pretrain.py][INFO] Epoch:[0/2](103900/4588595) loss:2.813 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:34:05,771][model8_pretrain.py][INFO] Epoch:[0/2](103900/4588595) loss:3.344 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:34:05,771][model8_pretrain.py][INFO] Epoch:[0/2](103900/4588595) loss:2.801 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:34:05,771][model8_pretrain.py][INFO] Epoch:[0/2](103900/4588595) loss:2.839 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:34:05,771][model8_pretrain.py][INFO] Epoch:[0/2](103900/4588595) loss:3.006 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:34:05,771][model8_pretrain.py][INFO] Epoch:[0/2](103900/4588595) loss:2.919 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:34:05,771][model8_pretrain.py][INFO] Epoch:[0/2](103900/4588595) loss:2.811 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:34:42,714][model8_pretrain.py][INFO] Epoch:[0/2](104000/4588595) loss:3.166 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:34:42,714][model8_pretrain.py][INFO] Epoch:[0/2](104000/4588595) loss:2.843 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:34:42,714][model8_pretrain.py][INFO] Epoch:[0/2](104000/4588595) loss:2.703 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:34:42,714][model8_pretrain.py][INFO] Epoch:[0/2](104000/4588595) loss:2.838 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:34:42,714][model8_pretrain.py][INFO] Epoch:[0/2](104000/4588595) loss:3.143 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:34:42,714][model8_pretrain.py][INFO] Epoch:[0/2](104000/4588595) loss:2.925 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:34:42,715][model8_pretrain.py][INFO] Epoch:[0/2](104000/4588595) loss:3.074 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:34:42,715][model8_pretrain.py][INFO] Epoch:[0/2](104000/4588595) loss:2.397 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:35:28,441][model8_pretrain.py][INFO] Epoch:[0/2](104100/4588595) loss:3.006 lr:0.0000100 epoch_Time:28246.0min: [2024-01-03 04:35:28,441][model8_pretrain.py][INFO] Epoch:[0/2](104100/4588595) loss:3.309 lr:0.0000100 epoch_Time:28246.0min: [2024-01-03 04:35:28,441][model8_pretrain.py][INFO] Epoch:[0/2](104100/4588595) loss:3.263 lr:0.0000100 epoch_Time:28246.0min: [2024-01-03 04:35:28,441][model8_pretrain.py][INFO] Epoch:[0/2](104100/4588595) loss:2.887 lr:0.0000100 epoch_Time:28246.0min: [2024-01-03 04:35:28,441][model8_pretrain.py][INFO] Epoch:[0/2](104100/4588595) loss:3.086 lr:0.0000100 epoch_Time:28246.0min: [2024-01-03 04:35:28,441][model8_pretrain.py][INFO] Epoch:[0/2](104100/4588595) loss:2.399 lr:0.0000100 epoch_Time:28246.0min: [2024-01-03 04:35:28,441][model8_pretrain.py][INFO] Epoch:[0/2](104100/4588595) loss:3.409 lr:0.0000100 epoch_Time:28246.0min: [2024-01-03 04:35:28,441][model8_pretrain.py][INFO] Epoch:[0/2](104100/4588595) loss:2.993 lr:0.0000100 epoch_Time:28246.0min: [2024-01-03 04:36:05,432][model8_pretrain.py][INFO] Epoch:[0/2](104200/4588595) loss:2.977 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:05,432][model8_pretrain.py][INFO] Epoch:[0/2](104200/4588595) loss:2.771 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:05,432][model8_pretrain.py][INFO] Epoch:[0/2](104200/4588595) loss:2.688 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:05,432][model8_pretrain.py][INFO] Epoch:[0/2](104200/4588595) loss:2.772 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:05,432][model8_pretrain.py][INFO] Epoch:[0/2](104200/4588595) loss:2.918 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:05,432][model8_pretrain.py][INFO] Epoch:[0/2](104200/4588595) loss:2.907 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:05,432][model8_pretrain.py][INFO] Epoch:[0/2](104200/4588595) loss:2.658 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:05,432][model8_pretrain.py][INFO] Epoch:[0/2](104200/4588595) loss:2.583 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:42,427][model8_pretrain.py][INFO] Epoch:[0/2](104300/4588595) loss:2.618 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:42,427][model8_pretrain.py][INFO] Epoch:[0/2](104300/4588595) loss:3.003 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:42,427][model8_pretrain.py][INFO] Epoch:[0/2](104300/4588595) loss:3.129 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:42,427][model8_pretrain.py][INFO] Epoch:[0/2](104300/4588595) loss:2.993 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:42,427][model8_pretrain.py][INFO] Epoch:[0/2](104300/4588595) loss:2.694 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:42,427][model8_pretrain.py][INFO] Epoch:[0/2](104300/4588595) loss:2.813 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:42,427][model8_pretrain.py][INFO] Epoch:[0/2](104300/4588595) loss:2.805 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:36:42,427][model8_pretrain.py][INFO] Epoch:[0/2](104300/4588595) loss:3.312 lr:0.0000100 epoch_Time:28244.0min: [2024-01-03 04:37:19,413][model8_pretrain.py][INFO] Epoch:[0/2](104400/4588595) loss:2.957 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:37:19,413][model8_pretrain.py][INFO] Epoch:[0/2](104400/4588595) loss:2.729 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:37:19,413][model8_pretrain.py][INFO] Epoch:[0/2](104400/4588595) loss:2.249 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:37:19,414][model8_pretrain.py][INFO] Epoch:[0/2](104400/4588595) loss:2.687 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:37:19,414][model8_pretrain.py][INFO] Epoch:[0/2](104400/4588595) loss:2.579 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:37:19,413][model8_pretrain.py][INFO] Epoch:[0/2](104400/4588595) loss:3.239 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:37:19,413][model8_pretrain.py][INFO] Epoch:[0/2](104400/4588595) loss:3.011 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:37:19,413][model8_pretrain.py][INFO] Epoch:[0/2](104400/4588595) loss:2.944 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:37:56,352][model8_pretrain.py][INFO] Epoch:[0/2](104500/4588595) loss:2.898 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:37:56,352][model8_pretrain.py][INFO] Epoch:[0/2](104500/4588595) loss:3.242 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:37:56,352][model8_pretrain.py][INFO] Epoch:[0/2](104500/4588595) loss:2.722 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:37:56,352][model8_pretrain.py][INFO] Epoch:[0/2](104500/4588595) loss:2.651 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:37:56,352][model8_pretrain.py][INFO] Epoch:[0/2](104500/4588595) loss:3.523 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:37:56,352][model8_pretrain.py][INFO] Epoch:[0/2](104500/4588595) loss:2.451 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:37:56,352][model8_pretrain.py][INFO] Epoch:[0/2](104500/4588595) loss:2.667 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:37:56,352][model8_pretrain.py][INFO] Epoch:[0/2](104500/4588595) loss:3.493 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:38:33,283][model8_pretrain.py][INFO] Epoch:[0/2](104600/4588595) loss:3.411 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:38:33,283][model8_pretrain.py][INFO] Epoch:[0/2](104600/4588595) loss:2.786 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:38:33,283][model8_pretrain.py][INFO] Epoch:[0/2](104600/4588595) loss:2.716 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:38:33,283][model8_pretrain.py][INFO] Epoch:[0/2](104600/4588595) loss:3.326 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:38:33,283][model8_pretrain.py][INFO] Epoch:[0/2](104600/4588595) loss:2.911 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:38:33,284][model8_pretrain.py][INFO] Epoch:[0/2](104600/4588595) loss:3.017 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:38:33,284][model8_pretrain.py][INFO] Epoch:[0/2](104600/4588595) loss:3.097 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:38:33,284][model8_pretrain.py][INFO] Epoch:[0/2](104600/4588595) loss:2.677 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:39:10,227][model8_pretrain.py][INFO] Epoch:[0/2](104700/4588595) loss:2.890 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:10,227][model8_pretrain.py][INFO] Epoch:[0/2](104700/4588595) loss:3.389 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:10,227][model8_pretrain.py][INFO] Epoch:[0/2](104700/4588595) loss:3.063 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:10,227][model8_pretrain.py][INFO] Epoch:[0/2](104700/4588595) loss:2.976 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:10,227][model8_pretrain.py][INFO] Epoch:[0/2](104700/4588595) loss:3.190 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:10,227][model8_pretrain.py][INFO] Epoch:[0/2](104700/4588595) loss:2.997 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:10,227][model8_pretrain.py][INFO] Epoch:[0/2](104700/4588595) loss:3.236 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:10,227][model8_pretrain.py][INFO] Epoch:[0/2](104700/4588595) loss:2.592 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:47,166][model8_pretrain.py][INFO] Epoch:[0/2](104800/4588595) loss:2.693 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:47,166][model8_pretrain.py][INFO] Epoch:[0/2](104800/4588595) loss:3.138 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:47,166][model8_pretrain.py][INFO] Epoch:[0/2](104800/4588595) loss:2.962 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:47,166][model8_pretrain.py][INFO] Epoch:[0/2](104800/4588595) loss:3.013 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:47,166][model8_pretrain.py][INFO] Epoch:[0/2](104800/4588595) loss:2.959 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:47,166][model8_pretrain.py][INFO] Epoch:[0/2](104800/4588595) loss:2.948 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:47,166][model8_pretrain.py][INFO] Epoch:[0/2](104800/4588595) loss:3.052 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:39:47,166][model8_pretrain.py][INFO] Epoch:[0/2](104800/4588595) loss:2.807 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:40:32,772][model8_pretrain.py][INFO] Epoch:[0/2](104900/4588595) loss:3.261 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:40:32,772][model8_pretrain.py][INFO] Epoch:[0/2](104900/4588595) loss:3.610 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:40:32,772][model8_pretrain.py][INFO] Epoch:[0/2](104900/4588595) loss:2.997 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:40:32,772][model8_pretrain.py][INFO] Epoch:[0/2](104900/4588595) loss:2.590 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:40:32,772][model8_pretrain.py][INFO] Epoch:[0/2](104900/4588595) loss:3.248 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:40:32,772][model8_pretrain.py][INFO] Epoch:[0/2](104900/4588595) loss:3.077 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:40:32,772][model8_pretrain.py][INFO] Epoch:[0/2](104900/4588595) loss:2.573 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:40:32,772][model8_pretrain.py][INFO] Epoch:[0/2](104900/4588595) loss:3.146 lr:0.0000100 epoch_Time:28242.0min: [2024-01-03 04:41:09,712][model8_pretrain.py][INFO] Epoch:[0/2](105000/4588595) loss:3.039 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:41:09,712][model8_pretrain.py][INFO] Epoch:[0/2](105000/4588595) loss:3.202 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:41:09,712][model8_pretrain.py][INFO] Epoch:[0/2](105000/4588595) loss:3.502 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:41:09,712][model8_pretrain.py][INFO] Epoch:[0/2](105000/4588595) loss:2.817 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:41:09,712][model8_pretrain.py][INFO] Epoch:[0/2](105000/4588595) loss:2.907 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:41:09,712][model8_pretrain.py][INFO] Epoch:[0/2](105000/4588595) loss:2.972 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:41:09,712][model8_pretrain.py][INFO] Epoch:[0/2](105000/4588595) loss:2.460 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:41:09,712][model8_pretrain.py][INFO] Epoch:[0/2](105000/4588595) loss:2.963 lr:0.0000100 epoch_Time:28241.0min: [2024-01-03 04:41:46,658][model8_pretrain.py][INFO] Epoch:[0/2](105100/4588595) loss:3.004 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:41:46,658][model8_pretrain.py][INFO] Epoch:[0/2](105100/4588595) loss:2.950 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:41:46,658][model8_pretrain.py][INFO] Epoch:[0/2](105100/4588595) loss:3.452 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:41:46,658][model8_pretrain.py][INFO] Epoch:[0/2](105100/4588595) loss:3.452 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:41:46,658][model8_pretrain.py][INFO] Epoch:[0/2](105100/4588595) loss:3.292 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:41:46,658][model8_pretrain.py][INFO] Epoch:[0/2](105100/4588595) loss:3.269 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:41:46,658][model8_pretrain.py][INFO] Epoch:[0/2](105100/4588595) loss:3.001 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:41:46,658][model8_pretrain.py][INFO] Epoch:[0/2](105100/4588595) loss:2.854 lr:0.0000100 epoch_Time:28240.0min: [2024-01-03 04:42:23,602][model8_pretrain.py][INFO] Epoch:[0/2](105200/4588595) loss:2.998 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:42:23,602][model8_pretrain.py][INFO] Epoch:[0/2](105200/4588595) loss:3.339 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:42:23,603][model8_pretrain.py][INFO] Epoch:[0/2](105200/4588595) loss:2.138 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:42:23,603][model8_pretrain.py][INFO] Epoch:[0/2](105200/4588595) loss:3.030 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:42:23,602][model8_pretrain.py][INFO] Epoch:[0/2](105200/4588595) loss:2.921 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:42:23,603][model8_pretrain.py][INFO] Epoch:[0/2](105200/4588595) loss:2.957 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:42:23,603][model8_pretrain.py][INFO] Epoch:[0/2](105200/4588595) loss:3.167 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:42:23,603][model8_pretrain.py][INFO] Epoch:[0/2](105200/4588595) loss:2.487 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:43:00,552][model8_pretrain.py][INFO] Epoch:[0/2](105300/4588595) loss:2.838 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:43:00,552][model8_pretrain.py][INFO] Epoch:[0/2](105300/4588595) loss:2.487 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:43:00,552][model8_pretrain.py][INFO] Epoch:[0/2](105300/4588595) loss:3.207 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:43:00,552][model8_pretrain.py][INFO] Epoch:[0/2](105300/4588595) loss:2.805 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:43:00,552][model8_pretrain.py][INFO] Epoch:[0/2](105300/4588595) loss:2.551 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:43:00,552][model8_pretrain.py][INFO] Epoch:[0/2](105300/4588595) loss:2.513 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:43:00,552][model8_pretrain.py][INFO] Epoch:[0/2](105300/4588595) loss:2.990 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:43:00,553][model8_pretrain.py][INFO] Epoch:[0/2](105300/4588595) loss:2.903 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:43:37,509][model8_pretrain.py][INFO] Epoch:[0/2](105400/4588595) loss:3.125 lr:0.0000100 epoch_Time:28236.0min: [2024-01-03 04:43:37,509][model8_pretrain.py][INFO] Epoch:[0/2](105400/4588595) loss:3.014 lr:0.0000100 epoch_Time:28236.0min: [2024-01-03 04:43:37,509][model8_pretrain.py][INFO] Epoch:[0/2](105400/4588595) loss:3.019 lr:0.0000100 epoch_Time:28236.0min: [2024-01-03 04:43:37,509][model8_pretrain.py][INFO] Epoch:[0/2](105400/4588595) loss:2.725 lr:0.0000100 epoch_Time:28236.0min: [2024-01-03 04:43:37,509][model8_pretrain.py][INFO] Epoch:[0/2](105400/4588595) loss:2.371 lr:0.0000100 epoch_Time:28236.0min: [2024-01-03 04:43:37,510][model8_pretrain.py][INFO] Epoch:[0/2](105400/4588595) loss:3.191 lr:0.0000100 epoch_Time:28236.0min: [2024-01-03 04:43:37,510][model8_pretrain.py][INFO] Epoch:[0/2](105400/4588595) loss:2.523 lr:0.0000100 epoch_Time:28236.0min: [2024-01-03 04:43:37,510][model8_pretrain.py][INFO] Epoch:[0/2](105400/4588595) loss:2.802 lr:0.0000100 epoch_Time:28236.0min: [2024-01-03 04:44:14,446][model8_pretrain.py][INFO] Epoch:[0/2](105500/4588595) loss:3.219 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:44:14,446][model8_pretrain.py][INFO] Epoch:[0/2](105500/4588595) loss:2.768 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:44:14,446][model8_pretrain.py][INFO] Epoch:[0/2](105500/4588595) loss:3.040 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:44:14,446][model8_pretrain.py][INFO] Epoch:[0/2](105500/4588595) loss:2.201 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:44:14,446][model8_pretrain.py][INFO] Epoch:[0/2](105500/4588595) loss:3.419 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:44:14,446][model8_pretrain.py][INFO] Epoch:[0/2](105500/4588595) loss:2.566 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:44:14,446][model8_pretrain.py][INFO] Epoch:[0/2](105500/4588595) loss:2.750 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:44:14,446][model8_pretrain.py][INFO] Epoch:[0/2](105500/4588595) loss:2.860 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:44:51,381][model8_pretrain.py][INFO] Epoch:[0/2](105600/4588595) loss:3.161 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:44:51,381][model8_pretrain.py][INFO] Epoch:[0/2](105600/4588595) loss:3.000 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:44:51,381][model8_pretrain.py][INFO] Epoch:[0/2](105600/4588595) loss:2.937 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:44:51,381][model8_pretrain.py][INFO] Epoch:[0/2](105600/4588595) loss:2.614 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:44:51,381][model8_pretrain.py][INFO] Epoch:[0/2](105600/4588595) loss:2.744 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:44:51,381][model8_pretrain.py][INFO] Epoch:[0/2](105600/4588595) loss:2.922 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:44:51,381][model8_pretrain.py][INFO] Epoch:[0/2](105600/4588595) loss:2.859 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:44:51,381][model8_pretrain.py][INFO] Epoch:[0/2](105600/4588595) loss:3.057 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:45:36,780][model8_pretrain.py][INFO] Epoch:[0/2](105700/4588595) loss:3.182 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:45:36,780][model8_pretrain.py][INFO] Epoch:[0/2](105700/4588595) loss:2.560 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:45:36,780][model8_pretrain.py][INFO] Epoch:[0/2](105700/4588595) loss:3.084 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:45:36,780][model8_pretrain.py][INFO] Epoch:[0/2](105700/4588595) loss:2.825 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:45:36,780][model8_pretrain.py][INFO] Epoch:[0/2](105700/4588595) loss:2.945 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:45:36,780][model8_pretrain.py][INFO] Epoch:[0/2](105700/4588595) loss:2.856 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:45:36,780][model8_pretrain.py][INFO] Epoch:[0/2](105700/4588595) loss:2.621 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:45:36,781][model8_pretrain.py][INFO] Epoch:[0/2](105700/4588595) loss:3.299 lr:0.0000100 epoch_Time:28238.0min: [2024-01-03 04:46:13,703][model8_pretrain.py][INFO] Epoch:[0/2](105800/4588595) loss:3.161 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:46:13,704][model8_pretrain.py][INFO] Epoch:[0/2](105800/4588595) loss:3.082 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:46:13,704][model8_pretrain.py][INFO] Epoch:[0/2](105800/4588595) loss:2.727 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:46:13,704][model8_pretrain.py][INFO] Epoch:[0/2](105800/4588595) loss:3.645 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:46:13,704][model8_pretrain.py][INFO] Epoch:[0/2](105800/4588595) loss:2.716 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:46:13,704][model8_pretrain.py][INFO] Epoch:[0/2](105800/4588595) loss:2.471 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:46:13,704][model8_pretrain.py][INFO] Epoch:[0/2](105800/4588595) loss:3.213 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:46:13,704][model8_pretrain.py][INFO] Epoch:[0/2](105800/4588595) loss:3.063 lr:0.0000100 epoch_Time:28237.0min: [2024-01-03 04:46:50,651][model8_pretrain.py][INFO] Epoch:[0/2](105900/4588595) loss:2.613 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:46:50,651][model8_pretrain.py][INFO] Epoch:[0/2](105900/4588595) loss:2.725 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:46:50,651][model8_pretrain.py][INFO] Epoch:[0/2](105900/4588595) loss:2.718 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:46:50,651][model8_pretrain.py][INFO] Epoch:[0/2](105900/4588595) loss:3.269 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:46:50,651][model8_pretrain.py][INFO] Epoch:[0/2](105900/4588595) loss:3.016 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:46:50,651][model8_pretrain.py][INFO] Epoch:[0/2](105900/4588595) loss:2.994 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:46:50,651][model8_pretrain.py][INFO] Epoch:[0/2](105900/4588595) loss:3.272 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:46:50,651][model8_pretrain.py][INFO] Epoch:[0/2](105900/4588595) loss:2.803 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:47:27,618][model8_pretrain.py][INFO] Epoch:[0/2](106000/4588595) loss:2.706 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:47:27,618][model8_pretrain.py][INFO] Epoch:[0/2](106000/4588595) loss:2.544 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:47:27,618][model8_pretrain.py][INFO] Epoch:[0/2](106000/4588595) loss:3.145 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:47:27,618][model8_pretrain.py][INFO] Epoch:[0/2](106000/4588595) loss:3.103 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:47:27,618][model8_pretrain.py][INFO] Epoch:[0/2](106000/4588595) loss:3.085 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:47:27,618][model8_pretrain.py][INFO] Epoch:[0/2](106000/4588595) loss:2.948 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:47:27,618][model8_pretrain.py][INFO] Epoch:[0/2](106000/4588595) loss:2.826 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:47:27,618][model8_pretrain.py][INFO] Epoch:[0/2](106000/4588595) loss:2.854 lr:0.0000100 epoch_Time:28235.0min: [2024-01-03 04:48:04,555][model8_pretrain.py][INFO] Epoch:[0/2](106100/4588595) loss:2.254 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:48:04,555][model8_pretrain.py][INFO] Epoch:[0/2](106100/4588595) loss:3.196 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:48:04,555][model8_pretrain.py][INFO] Epoch:[0/2](106100/4588595) loss:2.982 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:48:04,555][model8_pretrain.py][INFO] Epoch:[0/2](106100/4588595) loss:2.949 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:48:04,555][model8_pretrain.py][INFO] Epoch:[0/2](106100/4588595) loss:3.018 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:48:04,555][model8_pretrain.py][INFO] Epoch:[0/2](106100/4588595) loss:3.084 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:48:04,555][model8_pretrain.py][INFO] Epoch:[0/2](106100/4588595) loss:2.608 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:48:04,555][model8_pretrain.py][INFO] Epoch:[0/2](106100/4588595) loss:2.944 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:48:41,483][model8_pretrain.py][INFO] Epoch:[0/2](106200/4588595) loss:3.159 lr:0.0000100 epoch_Time:28232.0min: [2024-01-03 04:48:41,483][model8_pretrain.py][INFO] Epoch:[0/2](106200/4588595) loss:3.137 lr:0.0000100 epoch_Time:28232.0min: [2024-01-03 04:48:41,483][model8_pretrain.py][INFO] Epoch:[0/2](106200/4588595) loss:2.979 lr:0.0000100 epoch_Time:28232.0min: [2024-01-03 04:48:41,483][model8_pretrain.py][INFO] Epoch:[0/2](106200/4588595) loss:2.896 lr:0.0000100 epoch_Time:28232.0min: [2024-01-03 04:48:41,483][model8_pretrain.py][INFO] Epoch:[0/2](106200/4588595) loss:2.329 lr:0.0000100 epoch_Time:28232.0min: [2024-01-03 04:48:41,483][model8_pretrain.py][INFO] Epoch:[0/2](106200/4588595) loss:2.653 lr:0.0000100 epoch_Time:28232.0min: [2024-01-03 04:48:41,483][model8_pretrain.py][INFO] Epoch:[0/2](106200/4588595) loss:2.625 lr:0.0000100 epoch_Time:28232.0min: [2024-01-03 04:48:41,483][model8_pretrain.py][INFO] Epoch:[0/2](106200/4588595) loss:3.130 lr:0.0000100 epoch_Time:28232.0min: [2024-01-03 04:49:18,423][model8_pretrain.py][INFO] Epoch:[0/2](106300/4588595) loss:2.651 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:49:18,423][model8_pretrain.py][INFO] Epoch:[0/2](106300/4588595) loss:2.986 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:49:18,423][model8_pretrain.py][INFO] Epoch:[0/2](106300/4588595) loss:2.500 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:49:18,423][model8_pretrain.py][INFO] Epoch:[0/2](106300/4588595) loss:2.787 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:49:18,423][model8_pretrain.py][INFO] Epoch:[0/2](106300/4588595) loss:3.362 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:49:18,423][model8_pretrain.py][INFO] Epoch:[0/2](106300/4588595) loss:2.572 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:49:18,423][model8_pretrain.py][INFO] Epoch:[0/2](106300/4588595) loss:2.908 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:49:18,423][model8_pretrain.py][INFO] Epoch:[0/2](106300/4588595) loss:2.618 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:49:55,359][model8_pretrain.py][INFO] Epoch:[0/2](106400/4588595) loss:3.024 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:49:55,359][model8_pretrain.py][INFO] Epoch:[0/2](106400/4588595) loss:2.607 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:49:55,359][model8_pretrain.py][INFO] Epoch:[0/2](106400/4588595) loss:3.301 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:49:55,359][model8_pretrain.py][INFO] Epoch:[0/2](106400/4588595) loss:3.261 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:49:55,359][model8_pretrain.py][INFO] Epoch:[0/2](106400/4588595) loss:3.386 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:49:55,359][model8_pretrain.py][INFO] Epoch:[0/2](106400/4588595) loss:2.485 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:49:55,359][model8_pretrain.py][INFO] Epoch:[0/2](106400/4588595) loss:2.574 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:49:55,359][model8_pretrain.py][INFO] Epoch:[0/2](106400/4588595) loss:3.141 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:50:38,956][model8_pretrain.py][INFO] Epoch:[0/2](106500/4588595) loss:3.006 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:50:38,956][model8_pretrain.py][INFO] Epoch:[0/2](106500/4588595) loss:2.504 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:50:38,956][model8_pretrain.py][INFO] Epoch:[0/2](106500/4588595) loss:3.351 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:50:38,956][model8_pretrain.py][INFO] Epoch:[0/2](106500/4588595) loss:3.153 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:50:38,960][model8_pretrain.py][INFO] Epoch:[0/2](106500/4588595) loss:2.959 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:50:38,961][model8_pretrain.py][INFO] Epoch:[0/2](106500/4588595) loss:2.775 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:50:38,961][model8_pretrain.py][INFO] Epoch:[0/2](106500/4588595) loss:2.664 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:50:38,961][model8_pretrain.py][INFO] Epoch:[0/2](106500/4588595) loss:3.108 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:51:17,553][model8_pretrain.py][INFO] Epoch:[0/2](106600/4588595) loss:3.175 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:51:17,553][model8_pretrain.py][INFO] Epoch:[0/2](106600/4588595) loss:2.772 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:51:17,553][model8_pretrain.py][INFO] Epoch:[0/2](106600/4588595) loss:2.705 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:51:17,553][model8_pretrain.py][INFO] Epoch:[0/2](106600/4588595) loss:3.261 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:51:17,553][model8_pretrain.py][INFO] Epoch:[0/2](106600/4588595) loss:3.325 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:51:17,553][model8_pretrain.py][INFO] Epoch:[0/2](106600/4588595) loss:2.387 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:51:17,553][model8_pretrain.py][INFO] Epoch:[0/2](106600/4588595) loss:2.859 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:51:17,553][model8_pretrain.py][INFO] Epoch:[0/2](106600/4588595) loss:2.890 lr:0.0000100 epoch_Time:28233.0min: [2024-01-03 04:51:54,470][model8_pretrain.py][INFO] Epoch:[0/2](106700/4588595) loss:3.471 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:51:54,470][model8_pretrain.py][INFO] Epoch:[0/2](106700/4588595) loss:3.290 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:51:54,470][model8_pretrain.py][INFO] Epoch:[0/2](106700/4588595) loss:3.442 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:51:54,470][model8_pretrain.py][INFO] Epoch:[0/2](106700/4588595) loss:3.088 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:51:54,470][model8_pretrain.py][INFO] Epoch:[0/2](106700/4588595) loss:3.454 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:51:54,470][model8_pretrain.py][INFO] Epoch:[0/2](106700/4588595) loss:3.148 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:51:54,470][model8_pretrain.py][INFO] Epoch:[0/2](106700/4588595) loss:2.744 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:51:54,470][model8_pretrain.py][INFO] Epoch:[0/2](106700/4588595) loss:2.742 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:52:31,425][model8_pretrain.py][INFO] Epoch:[0/2](106800/4588595) loss:2.722 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:52:31,425][model8_pretrain.py][INFO] Epoch:[0/2](106800/4588595) loss:3.273 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:52:31,425][model8_pretrain.py][INFO] Epoch:[0/2](106800/4588595) loss:3.113 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:52:31,425][model8_pretrain.py][INFO] Epoch:[0/2](106800/4588595) loss:2.977 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:52:31,425][model8_pretrain.py][INFO] Epoch:[0/2](106800/4588595) loss:3.434 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:52:31,425][model8_pretrain.py][INFO] Epoch:[0/2](106800/4588595) loss:2.876 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:52:31,425][model8_pretrain.py][INFO] Epoch:[0/2](106800/4588595) loss:2.715 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:52:31,425][model8_pretrain.py][INFO] Epoch:[0/2](106800/4588595) loss:2.890 lr:0.0000100 epoch_Time:28231.0min: [2024-01-03 04:53:08,374][model8_pretrain.py][INFO] Epoch:[0/2](106900/4588595) loss:3.152 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:53:08,374][model8_pretrain.py][INFO] Epoch:[0/2](106900/4588595) loss:2.723 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:53:08,374][model8_pretrain.py][INFO] Epoch:[0/2](106900/4588595) loss:3.324 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:53:08,374][model8_pretrain.py][INFO] Epoch:[0/2](106900/4588595) loss:2.630 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:53:08,374][model8_pretrain.py][INFO] Epoch:[0/2](106900/4588595) loss:3.314 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:53:08,374][model8_pretrain.py][INFO] Epoch:[0/2](106900/4588595) loss:2.968 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:53:08,375][model8_pretrain.py][INFO] Epoch:[0/2](106900/4588595) loss:2.987 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:53:08,375][model8_pretrain.py][INFO] Epoch:[0/2](106900/4588595) loss:3.221 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:53:45,321][model8_pretrain.py][INFO] Epoch:[0/2](107000/4588595) loss:2.559 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:53:45,321][model8_pretrain.py][INFO] Epoch:[0/2](107000/4588595) loss:2.707 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:53:45,321][model8_pretrain.py][INFO] Epoch:[0/2](107000/4588595) loss:3.100 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:53:45,321][model8_pretrain.py][INFO] Epoch:[0/2](107000/4588595) loss:2.751 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:53:45,321][model8_pretrain.py][INFO] Epoch:[0/2](107000/4588595) loss:3.394 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:53:45,321][model8_pretrain.py][INFO] Epoch:[0/2](107000/4588595) loss:2.654 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:53:45,321][model8_pretrain.py][INFO] Epoch:[0/2](107000/4588595) loss:3.668 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:53:45,321][model8_pretrain.py][INFO] Epoch:[0/2](107000/4588595) loss:3.437 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:54:22,269][model8_pretrain.py][INFO] Epoch:[0/2](107100/4588595) loss:3.244 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:54:22,269][model8_pretrain.py][INFO] Epoch:[0/2](107100/4588595) loss:2.989 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:54:22,269][model8_pretrain.py][INFO] Epoch:[0/2](107100/4588595) loss:3.331 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:54:22,269][model8_pretrain.py][INFO] Epoch:[0/2](107100/4588595) loss:2.664 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:54:22,269][model8_pretrain.py][INFO] Epoch:[0/2](107100/4588595) loss:2.921 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:54:22,269][model8_pretrain.py][INFO] Epoch:[0/2](107100/4588595) loss:3.196 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:54:22,269][model8_pretrain.py][INFO] Epoch:[0/2](107100/4588595) loss:3.366 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:54:22,269][model8_pretrain.py][INFO] Epoch:[0/2](107100/4588595) loss:3.064 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:54:59,216][model8_pretrain.py][INFO] Epoch:[0/2](107200/4588595) loss:3.130 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:54:59,216][model8_pretrain.py][INFO] Epoch:[0/2](107200/4588595) loss:2.970 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:54:59,216][model8_pretrain.py][INFO] Epoch:[0/2](107200/4588595) loss:2.918 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:54:59,216][model8_pretrain.py][INFO] Epoch:[0/2](107200/4588595) loss:3.509 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:54:59,216][model8_pretrain.py][INFO] Epoch:[0/2](107200/4588595) loss:2.619 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:54:59,217][model8_pretrain.py][INFO] Epoch:[0/2](107200/4588595) loss:2.657 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:54:59,216][model8_pretrain.py][INFO] Epoch:[0/2](107200/4588595) loss:2.895 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:54:59,217][model8_pretrain.py][INFO] Epoch:[0/2](107200/4588595) loss:2.392 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:55:39,337][model8_pretrain.py][INFO] Epoch:[0/2](107300/4588595) loss:3.336 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:55:39,337][model8_pretrain.py][INFO] Epoch:[0/2](107300/4588595) loss:2.636 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:55:39,337][model8_pretrain.py][INFO] Epoch:[0/2](107300/4588595) loss:2.939 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:55:39,337][model8_pretrain.py][INFO] Epoch:[0/2](107300/4588595) loss:2.540 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:55:39,337][model8_pretrain.py][INFO] Epoch:[0/2](107300/4588595) loss:3.107 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:55:39,337][model8_pretrain.py][INFO] Epoch:[0/2](107300/4588595) loss:2.722 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:55:39,337][model8_pretrain.py][INFO] Epoch:[0/2](107300/4588595) loss:3.172 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:55:39,337][model8_pretrain.py][INFO] Epoch:[0/2](107300/4588595) loss:3.246 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:56:21,630][model8_pretrain.py][INFO] Epoch:[0/2](107400/4588595) loss:3.269 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:56:21,630][model8_pretrain.py][INFO] Epoch:[0/2](107400/4588595) loss:2.649 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:56:21,630][model8_pretrain.py][INFO] Epoch:[0/2](107400/4588595) loss:3.034 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:56:21,630][model8_pretrain.py][INFO] Epoch:[0/2](107400/4588595) loss:2.986 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:56:21,630][model8_pretrain.py][INFO] Epoch:[0/2](107400/4588595) loss:3.391 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:56:21,630][model8_pretrain.py][INFO] Epoch:[0/2](107400/4588595) loss:3.047 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:56:21,630][model8_pretrain.py][INFO] Epoch:[0/2](107400/4588595) loss:3.341 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:56:21,630][model8_pretrain.py][INFO] Epoch:[0/2](107400/4588595) loss:3.497 lr:0.0000100 epoch_Time:28229.0min: [2024-01-03 04:56:58,573][model8_pretrain.py][INFO] Epoch:[0/2](107500/4588595) loss:2.888 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:56:58,573][model8_pretrain.py][INFO] Epoch:[0/2](107500/4588595) loss:2.772 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:56:58,573][model8_pretrain.py][INFO] Epoch:[0/2](107500/4588595) loss:2.920 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:56:58,573][model8_pretrain.py][INFO] Epoch:[0/2](107500/4588595) loss:3.107 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:56:58,573][model8_pretrain.py][INFO] Epoch:[0/2](107500/4588595) loss:3.106 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:56:58,573][model8_pretrain.py][INFO] Epoch:[0/2](107500/4588595) loss:2.857 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:56:58,573][model8_pretrain.py][INFO] Epoch:[0/2](107500/4588595) loss:3.318 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:56:58,573][model8_pretrain.py][INFO] Epoch:[0/2](107500/4588595) loss:3.148 lr:0.0000100 epoch_Time:28228.0min: [2024-01-03 04:57:35,505][model8_pretrain.py][INFO] Epoch:[0/2](107600/4588595) loss:3.341 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:57:35,505][model8_pretrain.py][INFO] Epoch:[0/2](107600/4588595) loss:3.127 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:57:35,505][model8_pretrain.py][INFO] Epoch:[0/2](107600/4588595) loss:2.727 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:57:35,505][model8_pretrain.py][INFO] Epoch:[0/2](107600/4588595) loss:2.894 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:57:35,505][model8_pretrain.py][INFO] Epoch:[0/2](107600/4588595) loss:2.830 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:57:35,505][model8_pretrain.py][INFO] Epoch:[0/2](107600/4588595) loss:2.655 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:57:35,505][model8_pretrain.py][INFO] Epoch:[0/2](107600/4588595) loss:2.761 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:57:35,505][model8_pretrain.py][INFO] Epoch:[0/2](107600/4588595) loss:2.884 lr:0.0000100 epoch_Time:28227.0min: [2024-01-03 04:58:12,452][model8_pretrain.py][INFO] Epoch:[0/2](107700/4588595) loss:2.743 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:58:12,452][model8_pretrain.py][INFO] Epoch:[0/2](107700/4588595) loss:2.396 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:58:12,452][model8_pretrain.py][INFO] Epoch:[0/2](107700/4588595) loss:3.593 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:58:12,452][model8_pretrain.py][INFO] Epoch:[0/2](107700/4588595) loss:2.888 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:58:12,452][model8_pretrain.py][INFO] Epoch:[0/2](107700/4588595) loss:3.001 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:58:12,452][model8_pretrain.py][INFO] Epoch:[0/2](107700/4588595) loss:2.898 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:58:12,452][model8_pretrain.py][INFO] Epoch:[0/2](107700/4588595) loss:2.641 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:58:12,453][model8_pretrain.py][INFO] Epoch:[0/2](107700/4588595) loss:2.784 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 04:58:49,400][model8_pretrain.py][INFO] Epoch:[0/2](107800/4588595) loss:3.095 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 04:58:49,400][model8_pretrain.py][INFO] Epoch:[0/2](107800/4588595) loss:3.173 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 04:58:49,400][model8_pretrain.py][INFO] Epoch:[0/2](107800/4588595) loss:3.223 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 04:58:49,400][model8_pretrain.py][INFO] Epoch:[0/2](107800/4588595) loss:2.386 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 04:58:49,400][model8_pretrain.py][INFO] Epoch:[0/2](107800/4588595) loss:2.923 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 04:58:49,400][model8_pretrain.py][INFO] Epoch:[0/2](107800/4588595) loss:2.822 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 04:58:49,400][model8_pretrain.py][INFO] Epoch:[0/2](107800/4588595) loss:2.898 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 04:58:49,400][model8_pretrain.py][INFO] Epoch:[0/2](107800/4588595) loss:3.163 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 04:59:26,348][model8_pretrain.py][INFO] Epoch:[0/2](107900/4588595) loss:3.113 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 04:59:26,348][model8_pretrain.py][INFO] Epoch:[0/2](107900/4588595) loss:3.352 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 04:59:26,348][model8_pretrain.py][INFO] Epoch:[0/2](107900/4588595) loss:2.980 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 04:59:26,348][model8_pretrain.py][INFO] Epoch:[0/2](107900/4588595) loss:2.975 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 04:59:26,348][model8_pretrain.py][INFO] Epoch:[0/2](107900/4588595) loss:2.902 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 04:59:26,348][model8_pretrain.py][INFO] Epoch:[0/2](107900/4588595) loss:3.362 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 04:59:26,348][model8_pretrain.py][INFO] Epoch:[0/2](107900/4588595) loss:2.484 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 04:59:26,348][model8_pretrain.py][INFO] Epoch:[0/2](107900/4588595) loss:3.156 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:00:03,314][model8_pretrain.py][INFO] Epoch:[0/2](108000/4588595) loss:2.892 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:00:03,314][model8_pretrain.py][INFO] Epoch:[0/2](108000/4588595) loss:2.967 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:00:03,314][model8_pretrain.py][INFO] Epoch:[0/2](108000/4588595) loss:2.939 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:00:03,314][model8_pretrain.py][INFO] Epoch:[0/2](108000/4588595) loss:2.864 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:00:03,314][model8_pretrain.py][INFO] Epoch:[0/2](108000/4588595) loss:2.467 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:00:03,314][model8_pretrain.py][INFO] Epoch:[0/2](108000/4588595) loss:2.445 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:00:03,314][model8_pretrain.py][INFO] Epoch:[0/2](108000/4588595) loss:2.977 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:00:03,314][model8_pretrain.py][INFO] Epoch:[0/2](108000/4588595) loss:2.497 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:00:43,589][model8_pretrain.py][INFO] Epoch:[0/2](108100/4588595) loss:2.231 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:00:43,590][model8_pretrain.py][INFO] Epoch:[0/2](108100/4588595) loss:3.101 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:00:43,589][model8_pretrain.py][INFO] Epoch:[0/2](108100/4588595) loss:2.994 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:00:43,590][model8_pretrain.py][INFO] Epoch:[0/2](108100/4588595) loss:2.652 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:00:43,589][model8_pretrain.py][INFO] Epoch:[0/2](108100/4588595) loss:3.571 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:00:43,590][model8_pretrain.py][INFO] Epoch:[0/2](108100/4588595) loss:3.463 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:00:43,590][model8_pretrain.py][INFO] Epoch:[0/2](108100/4588595) loss:2.544 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:00:43,590][model8_pretrain.py][INFO] Epoch:[0/2](108100/4588595) loss:3.248 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:01:25,813][model8_pretrain.py][INFO] Epoch:[0/2](108200/4588595) loss:3.148 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 05:01:25,813][model8_pretrain.py][INFO] Epoch:[0/2](108200/4588595) loss:2.711 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 05:01:25,813][model8_pretrain.py][INFO] Epoch:[0/2](108200/4588595) loss:3.189 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 05:01:25,813][model8_pretrain.py][INFO] Epoch:[0/2](108200/4588595) loss:3.194 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 05:01:25,813][model8_pretrain.py][INFO] Epoch:[0/2](108200/4588595) loss:2.901 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 05:01:25,813][model8_pretrain.py][INFO] Epoch:[0/2](108200/4588595) loss:2.687 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 05:01:25,813][model8_pretrain.py][INFO] Epoch:[0/2](108200/4588595) loss:2.805 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 05:01:25,813][model8_pretrain.py][INFO] Epoch:[0/2](108200/4588595) loss:3.323 lr:0.0000100 epoch_Time:28225.0min: [2024-01-03 05:02:02,747][model8_pretrain.py][INFO] Epoch:[0/2](108300/4588595) loss:3.004 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 05:02:02,747][model8_pretrain.py][INFO] Epoch:[0/2](108300/4588595) loss:2.681 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 05:02:02,747][model8_pretrain.py][INFO] Epoch:[0/2](108300/4588595) loss:3.286 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 05:02:02,747][model8_pretrain.py][INFO] Epoch:[0/2](108300/4588595) loss:3.526 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 05:02:02,747][model8_pretrain.py][INFO] Epoch:[0/2](108300/4588595) loss:3.190 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 05:02:02,747][model8_pretrain.py][INFO] Epoch:[0/2](108300/4588595) loss:2.989 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 05:02:02,747][model8_pretrain.py][INFO] Epoch:[0/2](108300/4588595) loss:3.272 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 05:02:02,747][model8_pretrain.py][INFO] Epoch:[0/2](108300/4588595) loss:3.004 lr:0.0000100 epoch_Time:28224.0min: [2024-01-03 05:02:39,685][model8_pretrain.py][INFO] Epoch:[0/2](108400/4588595) loss:2.914 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:02:39,685][model8_pretrain.py][INFO] Epoch:[0/2](108400/4588595) loss:3.418 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:02:39,685][model8_pretrain.py][INFO] Epoch:[0/2](108400/4588595) loss:3.135 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:02:39,685][model8_pretrain.py][INFO] Epoch:[0/2](108400/4588595) loss:3.485 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:02:39,685][model8_pretrain.py][INFO] Epoch:[0/2](108400/4588595) loss:3.021 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:02:39,685][model8_pretrain.py][INFO] Epoch:[0/2](108400/4588595) loss:3.218 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:02:39,686][model8_pretrain.py][INFO] Epoch:[0/2](108400/4588595) loss:2.980 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:02:39,685][model8_pretrain.py][INFO] Epoch:[0/2](108400/4588595) loss:2.816 lr:0.0000100 epoch_Time:28223.0min: [2024-01-03 05:03:16,619][model8_pretrain.py][INFO] Epoch:[0/2](108500/4588595) loss:3.065 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:03:16,619][model8_pretrain.py][INFO] Epoch:[0/2](108500/4588595) loss:3.259 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:03:16,619][model8_pretrain.py][INFO] Epoch:[0/2](108500/4588595) loss:3.374 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:03:16,619][model8_pretrain.py][INFO] Epoch:[0/2](108500/4588595) loss:3.088 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:03:16,619][model8_pretrain.py][INFO] Epoch:[0/2](108500/4588595) loss:2.872 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:03:16,620][model8_pretrain.py][INFO] Epoch:[0/2](108500/4588595) loss:3.368 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:03:16,620][model8_pretrain.py][INFO] Epoch:[0/2](108500/4588595) loss:3.089 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:03:16,620][model8_pretrain.py][INFO] Epoch:[0/2](108500/4588595) loss:3.324 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:03:53,564][model8_pretrain.py][INFO] Epoch:[0/2](108600/4588595) loss:3.380 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:03:53,564][model8_pretrain.py][INFO] Epoch:[0/2](108600/4588595) loss:2.670 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:03:53,564][model8_pretrain.py][INFO] Epoch:[0/2](108600/4588595) loss:3.387 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:03:53,564][model8_pretrain.py][INFO] Epoch:[0/2](108600/4588595) loss:2.957 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:03:53,564][model8_pretrain.py][INFO] Epoch:[0/2](108600/4588595) loss:3.344 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:03:53,564][model8_pretrain.py][INFO] Epoch:[0/2](108600/4588595) loss:3.044 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:03:53,565][model8_pretrain.py][INFO] Epoch:[0/2](108600/4588595) loss:3.292 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:03:53,565][model8_pretrain.py][INFO] Epoch:[0/2](108600/4588595) loss:3.120 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:04:30,509][model8_pretrain.py][INFO] Epoch:[0/2](108700/4588595) loss:2.836 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:04:30,509][model8_pretrain.py][INFO] Epoch:[0/2](108700/4588595) loss:3.533 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:04:30,509][model8_pretrain.py][INFO] Epoch:[0/2](108700/4588595) loss:3.313 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:04:30,509][model8_pretrain.py][INFO] Epoch:[0/2](108700/4588595) loss:2.616 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:04:30,509][model8_pretrain.py][INFO] Epoch:[0/2](108700/4588595) loss:2.886 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:04:30,509][model8_pretrain.py][INFO] Epoch:[0/2](108700/4588595) loss:2.889 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:04:30,509][model8_pretrain.py][INFO] Epoch:[0/2](108700/4588595) loss:3.483 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:04:30,509][model8_pretrain.py][INFO] Epoch:[0/2](108700/4588595) loss:3.271 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:05:07,454][model8_pretrain.py][INFO] Epoch:[0/2](108800/4588595) loss:2.769 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:05:07,454][model8_pretrain.py][INFO] Epoch:[0/2](108800/4588595) loss:3.213 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:05:07,454][model8_pretrain.py][INFO] Epoch:[0/2](108800/4588595) loss:3.624 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:05:07,454][model8_pretrain.py][INFO] Epoch:[0/2](108800/4588595) loss:2.834 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:05:07,454][model8_pretrain.py][INFO] Epoch:[0/2](108800/4588595) loss:2.474 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:05:07,454][model8_pretrain.py][INFO] Epoch:[0/2](108800/4588595) loss:3.483 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:05:07,454][model8_pretrain.py][INFO] Epoch:[0/2](108800/4588595) loss:3.469 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:05:07,454][model8_pretrain.py][INFO] Epoch:[0/2](108800/4588595) loss:3.151 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:05:47,868][model8_pretrain.py][INFO] Epoch:[0/2](108900/4588595) loss:3.151 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:05:47,868][model8_pretrain.py][INFO] Epoch:[0/2](108900/4588595) loss:3.199 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:05:47,868][model8_pretrain.py][INFO] Epoch:[0/2](108900/4588595) loss:2.960 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:05:47,868][model8_pretrain.py][INFO] Epoch:[0/2](108900/4588595) loss:3.493 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:05:47,868][model8_pretrain.py][INFO] Epoch:[0/2](108900/4588595) loss:3.272 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:05:47,868][model8_pretrain.py][INFO] Epoch:[0/2](108900/4588595) loss:3.176 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:05:47,868][model8_pretrain.py][INFO] Epoch:[0/2](108900/4588595) loss:3.022 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:05:47,873][model8_pretrain.py][INFO] Epoch:[0/2](108900/4588595) loss:3.495 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:06:30,044][model8_pretrain.py][INFO] Epoch:[0/2](109000/4588595) loss:3.276 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:06:30,044][model8_pretrain.py][INFO] Epoch:[0/2](109000/4588595) loss:2.679 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:06:30,044][model8_pretrain.py][INFO] Epoch:[0/2](109000/4588595) loss:3.081 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:06:30,044][model8_pretrain.py][INFO] Epoch:[0/2](109000/4588595) loss:3.272 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:06:30,044][model8_pretrain.py][INFO] Epoch:[0/2](109000/4588595) loss:3.168 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:06:30,044][model8_pretrain.py][INFO] Epoch:[0/2](109000/4588595) loss:3.185 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:06:30,044][model8_pretrain.py][INFO] Epoch:[0/2](109000/4588595) loss:2.840 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:06:30,044][model8_pretrain.py][INFO] Epoch:[0/2](109000/4588595) loss:3.017 lr:0.0000100 epoch_Time:28222.0min: [2024-01-03 05:07:06,977][model8_pretrain.py][INFO] Epoch:[0/2](109100/4588595) loss:3.054 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:06,977][model8_pretrain.py][INFO] Epoch:[0/2](109100/4588595) loss:3.094 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:06,977][model8_pretrain.py][INFO] Epoch:[0/2](109100/4588595) loss:2.927 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:06,977][model8_pretrain.py][INFO] Epoch:[0/2](109100/4588595) loss:2.411 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:06,977][model8_pretrain.py][INFO] Epoch:[0/2](109100/4588595) loss:2.898 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:06,977][model8_pretrain.py][INFO] Epoch:[0/2](109100/4588595) loss:2.354 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:06,977][model8_pretrain.py][INFO] Epoch:[0/2](109100/4588595) loss:2.982 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:06,977][model8_pretrain.py][INFO] Epoch:[0/2](109100/4588595) loss:3.279 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:43,934][model8_pretrain.py][INFO] Epoch:[0/2](109200/4588595) loss:2.648 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:43,934][model8_pretrain.py][INFO] Epoch:[0/2](109200/4588595) loss:3.294 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:07:43,934][model8_pretrain.py][INFO] Epoch:[0/2](109200/4588595) loss:3.035 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:07:43,934][model8_pretrain.py][INFO] Epoch:[0/2](109200/4588595) loss:2.843 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:07:43,934][model8_pretrain.py][INFO] Epoch:[0/2](109200/4588595) loss:3.135 lr:0.0000100 epoch_Time:28220.0min: [2024-01-03 05:07:43,934][model8_pretrain.py][INFO] Epoch:[0/2](109200/4588595) loss:3.010 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:07:43,934][model8_pretrain.py][INFO] Epoch:[0/2](109200/4588595) loss:2.963 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:07:43,934][model8_pretrain.py][INFO] Epoch:[0/2](109200/4588595) loss:3.074 lr:0.0000100 epoch_Time:28219.0min: [2024-01-03 05:08:20,897][model8_pretrain.py][INFO] Epoch:[0/2](109300/4588595) loss:2.862 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:08:20,897][model8_pretrain.py][INFO] Epoch:[0/2](109300/4588595) loss:3.197 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:08:20,897][model8_pretrain.py][INFO] Epoch:[0/2](109300/4588595) loss:3.140 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:08:20,897][model8_pretrain.py][INFO] Epoch:[0/2](109300/4588595) loss:3.452 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:08:20,897][model8_pretrain.py][INFO] Epoch:[0/2](109300/4588595) loss:2.761 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:08:20,897][model8_pretrain.py][INFO] Epoch:[0/2](109300/4588595) loss:2.483 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:08:20,897][model8_pretrain.py][INFO] Epoch:[0/2](109300/4588595) loss:3.127 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:08:20,897][model8_pretrain.py][INFO] Epoch:[0/2](109300/4588595) loss:3.200 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:08:57,830][model8_pretrain.py][INFO] Epoch:[0/2](109400/4588595) loss:3.111 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:08:57,830][model8_pretrain.py][INFO] Epoch:[0/2](109400/4588595) loss:2.690 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:08:57,830][model8_pretrain.py][INFO] Epoch:[0/2](109400/4588595) loss:3.186 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:08:57,830][model8_pretrain.py][INFO] Epoch:[0/2](109400/4588595) loss:3.533 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:08:57,830][model8_pretrain.py][INFO] Epoch:[0/2](109400/4588595) loss:2.997 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:08:57,830][model8_pretrain.py][INFO] Epoch:[0/2](109400/4588595) loss:3.254 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:08:57,830][model8_pretrain.py][INFO] Epoch:[0/2](109400/4588595) loss:3.544 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:08:57,830][model8_pretrain.py][INFO] Epoch:[0/2](109400/4588595) loss:2.576 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:09:34,774][model8_pretrain.py][INFO] Epoch:[0/2](109500/4588595) loss:3.175 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:09:34,774][model8_pretrain.py][INFO] Epoch:[0/2](109500/4588595) loss:3.001 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:09:34,774][model8_pretrain.py][INFO] Epoch:[0/2](109500/4588595) loss:3.020 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:09:34,774][model8_pretrain.py][INFO] Epoch:[0/2](109500/4588595) loss:3.464 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:09:34,774][model8_pretrain.py][INFO] Epoch:[0/2](109500/4588595) loss:3.068 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:09:34,774][model8_pretrain.py][INFO] Epoch:[0/2](109500/4588595) loss:2.955 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:09:34,774][model8_pretrain.py][INFO] Epoch:[0/2](109500/4588595) loss:3.121 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:09:34,774][model8_pretrain.py][INFO] Epoch:[0/2](109500/4588595) loss:2.941 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:10:11,726][model8_pretrain.py][INFO] Epoch:[0/2](109600/4588595) loss:3.558 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:10:11,726][model8_pretrain.py][INFO] Epoch:[0/2](109600/4588595) loss:3.366 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:10:11,726][model8_pretrain.py][INFO] Epoch:[0/2](109600/4588595) loss:2.993 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:10:11,726][model8_pretrain.py][INFO] Epoch:[0/2](109600/4588595) loss:2.942 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:10:11,726][model8_pretrain.py][INFO] Epoch:[0/2](109600/4588595) loss:3.088 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:10:11,726][model8_pretrain.py][INFO] Epoch:[0/2](109600/4588595) loss:2.762 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:10:11,726][model8_pretrain.py][INFO] Epoch:[0/2](109600/4588595) loss:2.876 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:10:11,726][model8_pretrain.py][INFO] Epoch:[0/2](109600/4588595) loss:3.286 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:10:48,665][model8_pretrain.py][INFO] Epoch:[0/2](109700/4588595) loss:3.160 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:10:48,666][model8_pretrain.py][INFO] Epoch:[0/2](109700/4588595) loss:2.881 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:10:48,665][model8_pretrain.py][INFO] Epoch:[0/2](109700/4588595) loss:2.962 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:10:48,666][model8_pretrain.py][INFO] Epoch:[0/2](109700/4588595) loss:2.810 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:10:48,666][model8_pretrain.py][INFO] Epoch:[0/2](109700/4588595) loss:2.552 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:10:48,666][model8_pretrain.py][INFO] Epoch:[0/2](109700/4588595) loss:3.027 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:10:48,666][model8_pretrain.py][INFO] Epoch:[0/2](109700/4588595) loss:2.926 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:10:48,666][model8_pretrain.py][INFO] Epoch:[0/2](109700/4588595) loss:2.833 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:11:34,305][model8_pretrain.py][INFO] Epoch:[0/2](109800/4588595) loss:3.004 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:11:34,305][model8_pretrain.py][INFO] Epoch:[0/2](109800/4588595) loss:3.120 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:11:34,305][model8_pretrain.py][INFO] Epoch:[0/2](109800/4588595) loss:2.422 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:11:34,305][model8_pretrain.py][INFO] Epoch:[0/2](109800/4588595) loss:3.161 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:11:34,305][model8_pretrain.py][INFO] Epoch:[0/2](109800/4588595) loss:2.730 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:11:34,305][model8_pretrain.py][INFO] Epoch:[0/2](109800/4588595) loss:2.623 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:11:34,305][model8_pretrain.py][INFO] Epoch:[0/2](109800/4588595) loss:3.092 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:11:34,306][model8_pretrain.py][INFO] Epoch:[0/2](109800/4588595) loss:2.877 lr:0.0000100 epoch_Time:28218.0min: [2024-01-03 05:12:11,262][model8_pretrain.py][INFO] Epoch:[0/2](109900/4588595) loss:2.859 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:12:11,262][model8_pretrain.py][INFO] Epoch:[0/2](109900/4588595) loss:2.430 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:12:11,262][model8_pretrain.py][INFO] Epoch:[0/2](109900/4588595) loss:3.207 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:12:11,262][model8_pretrain.py][INFO] Epoch:[0/2](109900/4588595) loss:3.140 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:12:11,262][model8_pretrain.py][INFO] Epoch:[0/2](109900/4588595) loss:3.040 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:12:11,262][model8_pretrain.py][INFO] Epoch:[0/2](109900/4588595) loss:2.635 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:12:11,262][model8_pretrain.py][INFO] Epoch:[0/2](109900/4588595) loss:3.447 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:12:11,262][model8_pretrain.py][INFO] Epoch:[0/2](109900/4588595) loss:3.363 lr:0.0000100 epoch_Time:28216.0min: [2024-01-03 05:12:48,200][model8_pretrain.py][INFO] Epoch:[0/2](110000/4588595) loss:2.713 lr:0.0000100 epoch_Time:28215.0min: [2024-01-03 05:12:48,200][model8_pretrain.py][INFO] Epoch:[0/2](110000/4588595) loss:2.566 lr:0.0000100 epoch_Time:28215.0min: [2024-01-03 05:12:48,200][model8_pretrain.py][INFO] Epoch:[0/2](110000/4588595) loss:2.936 lr:0.0000100 epoch_Time:28215.0min: [2024-01-03 05:12:48,200][model8_pretrain.py][INFO] Epoch:[0/2](110000/4588595) loss:2.843 lr:0.0000100 epoch_Time:28215.0min: [2024-01-03 05:12:48,201][model8_pretrain.py][INFO] Epoch:[0/2](110000/4588595) loss:3.010 lr:0.0000100 epoch_Time:28215.0min: [2024-01-03 05:12:48,200][model8_pretrain.py][INFO] Epoch:[0/2](110000/4588595) loss:2.988 lr:0.0000100 epoch_Time:28215.0min: [2024-01-03 05:12:48,201][model8_pretrain.py][INFO] Epoch:[0/2](110000/4588595) loss:2.677 lr:0.0000100 epoch_Time:28215.0min: [2024-01-03 05:12:48,201][model8_pretrain.py][INFO] Epoch:[0/2](110000/4588595) loss:3.070 lr:0.0000100 epoch_Time:28215.0min: [2024-01-03 05:13:25,139][model8_pretrain.py][INFO] Epoch:[0/2](110100/4588595) loss:2.753 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:13:25,139][model8_pretrain.py][INFO] Epoch:[0/2](110100/4588595) loss:3.060 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:13:25,139][model8_pretrain.py][INFO] Epoch:[0/2](110100/4588595) loss:3.117 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:13:25,139][model8_pretrain.py][INFO] Epoch:[0/2](110100/4588595) loss:3.368 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:13:25,139][model8_pretrain.py][INFO] Epoch:[0/2](110100/4588595) loss:2.410 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:13:25,139][model8_pretrain.py][INFO] Epoch:[0/2](110100/4588595) loss:2.636 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:13:25,139][model8_pretrain.py][INFO] Epoch:[0/2](110100/4588595) loss:3.321 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:13:25,139][model8_pretrain.py][INFO] Epoch:[0/2](110100/4588595) loss:2.552 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:14:02,087][model8_pretrain.py][INFO] Epoch:[0/2](110200/4588595) loss:2.581 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:14:02,087][model8_pretrain.py][INFO] Epoch:[0/2](110200/4588595) loss:3.337 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:14:02,087][model8_pretrain.py][INFO] Epoch:[0/2](110200/4588595) loss:3.060 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:14:02,087][model8_pretrain.py][INFO] Epoch:[0/2](110200/4588595) loss:2.890 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:14:02,087][model8_pretrain.py][INFO] Epoch:[0/2](110200/4588595) loss:3.120 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:14:02,087][model8_pretrain.py][INFO] Epoch:[0/2](110200/4588595) loss:3.053 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:14:02,087][model8_pretrain.py][INFO] Epoch:[0/2](110200/4588595) loss:3.195 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:14:02,087][model8_pretrain.py][INFO] Epoch:[0/2](110200/4588595) loss:3.335 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:14:39,029][model8_pretrain.py][INFO] Epoch:[0/2](110300/4588595) loss:3.197 lr:0.0000100 epoch_Time:28212.0min: [2024-01-03 05:14:39,029][model8_pretrain.py][INFO] Epoch:[0/2](110300/4588595) loss:2.858 lr:0.0000100 epoch_Time:28212.0min: [2024-01-03 05:14:39,029][model8_pretrain.py][INFO] Epoch:[0/2](110300/4588595) loss:3.307 lr:0.0000100 epoch_Time:28212.0min: [2024-01-03 05:14:39,029][model8_pretrain.py][INFO] Epoch:[0/2](110300/4588595) loss:3.045 lr:0.0000100 epoch_Time:28212.0min: [2024-01-03 05:14:39,029][model8_pretrain.py][INFO] Epoch:[0/2](110300/4588595) loss:2.405 lr:0.0000100 epoch_Time:28212.0min: [2024-01-03 05:14:39,029][model8_pretrain.py][INFO] Epoch:[0/2](110300/4588595) loss:2.475 lr:0.0000100 epoch_Time:28212.0min: [2024-01-03 05:14:39,029][model8_pretrain.py][INFO] Epoch:[0/2](110300/4588595) loss:3.028 lr:0.0000100 epoch_Time:28212.0min: [2024-01-03 05:14:39,029][model8_pretrain.py][INFO] Epoch:[0/2](110300/4588595) loss:2.469 lr:0.0000100 epoch_Time:28212.0min: [2024-01-03 05:15:15,974][model8_pretrain.py][INFO] Epoch:[0/2](110400/4588595) loss:3.211 lr:0.0000100 epoch_Time:28210.0min: [2024-01-03 05:15:15,974][model8_pretrain.py][INFO] Epoch:[0/2](110400/4588595) loss:2.844 lr:0.0000100 epoch_Time:28210.0min: [2024-01-03 05:15:15,974][model8_pretrain.py][INFO] Epoch:[0/2](110400/4588595) loss:3.300 lr:0.0000100 epoch_Time:28210.0min: [2024-01-03 05:15:15,974][model8_pretrain.py][INFO] Epoch:[0/2](110400/4588595) loss:2.968 lr:0.0000100 epoch_Time:28210.0min: [2024-01-03 05:15:15,974][model8_pretrain.py][INFO] Epoch:[0/2](110400/4588595) loss:2.818 lr:0.0000100 epoch_Time:28210.0min: [2024-01-03 05:15:15,974][model8_pretrain.py][INFO] Epoch:[0/2](110400/4588595) loss:2.561 lr:0.0000100 epoch_Time:28210.0min: [2024-01-03 05:15:15,974][model8_pretrain.py][INFO] Epoch:[0/2](110400/4588595) loss:2.999 lr:0.0000100 epoch_Time:28210.0min: [2024-01-03 05:15:15,974][model8_pretrain.py][INFO] Epoch:[0/2](110400/4588595) loss:3.275 lr:0.0000100 epoch_Time:28210.0min: [2024-01-03 05:15:52,926][model8_pretrain.py][INFO] Epoch:[0/2](110500/4588595) loss:2.736 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:15:52,926][model8_pretrain.py][INFO] Epoch:[0/2](110500/4588595) loss:3.045 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:15:52,926][model8_pretrain.py][INFO] Epoch:[0/2](110500/4588595) loss:3.149 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:15:52,926][model8_pretrain.py][INFO] Epoch:[0/2](110500/4588595) loss:3.213 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:15:52,926][model8_pretrain.py][INFO] Epoch:[0/2](110500/4588595) loss:3.162 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:15:52,927][model8_pretrain.py][INFO] Epoch:[0/2](110500/4588595) loss:2.830 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:15:52,927][model8_pretrain.py][INFO] Epoch:[0/2](110500/4588595) loss:2.868 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:15:52,927][model8_pretrain.py][INFO] Epoch:[0/2](110500/4588595) loss:3.202 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:16:38,650][model8_pretrain.py][INFO] Epoch:[0/2](110600/4588595) loss:3.492 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:16:38,651][model8_pretrain.py][INFO] Epoch:[0/2](110600/4588595) loss:2.771 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:16:38,651][model8_pretrain.py][INFO] Epoch:[0/2](110600/4588595) loss:2.958 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:16:38,651][model8_pretrain.py][INFO] Epoch:[0/2](110600/4588595) loss:2.611 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:16:38,651][model8_pretrain.py][INFO] Epoch:[0/2](110600/4588595) loss:2.795 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:16:38,651][model8_pretrain.py][INFO] Epoch:[0/2](110600/4588595) loss:2.354 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:16:38,651][model8_pretrain.py][INFO] Epoch:[0/2](110600/4588595) loss:3.664 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:16:38,651][model8_pretrain.py][INFO] Epoch:[0/2](110600/4588595) loss:2.499 lr:0.0000100 epoch_Time:28214.0min: [2024-01-03 05:17:15,624][model8_pretrain.py][INFO] Epoch:[0/2](110700/4588595) loss:2.669 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:17:15,624][model8_pretrain.py][INFO] Epoch:[0/2](110700/4588595) loss:2.783 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:17:15,624][model8_pretrain.py][INFO] Epoch:[0/2](110700/4588595) loss:2.998 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:17:15,624][model8_pretrain.py][INFO] Epoch:[0/2](110700/4588595) loss:3.316 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:17:15,624][model8_pretrain.py][INFO] Epoch:[0/2](110700/4588595) loss:2.487 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:17:15,624][model8_pretrain.py][INFO] Epoch:[0/2](110700/4588595) loss:2.783 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:17:15,624][model8_pretrain.py][INFO] Epoch:[0/2](110700/4588595) loss:2.789 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:17:15,624][model8_pretrain.py][INFO] Epoch:[0/2](110700/4588595) loss:2.826 lr:0.0000100 epoch_Time:28213.0min: [2024-01-03 05:17:52,575][model8_pretrain.py][INFO] Epoch:[0/2](110800/4588595) loss:3.219 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:17:52,575][model8_pretrain.py][INFO] Epoch:[0/2](110800/4588595) loss:3.511 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:17:52,575][model8_pretrain.py][INFO] Epoch:[0/2](110800/4588595) loss:3.171 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:17:52,575][model8_pretrain.py][INFO] Epoch:[0/2](110800/4588595) loss:3.232 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:17:52,575][model8_pretrain.py][INFO] Epoch:[0/2](110800/4588595) loss:3.099 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:17:52,575][model8_pretrain.py][INFO] Epoch:[0/2](110800/4588595) loss:2.932 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:17:52,575][model8_pretrain.py][INFO] Epoch:[0/2](110800/4588595) loss:2.728 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:17:52,575][model8_pretrain.py][INFO] Epoch:[0/2](110800/4588595) loss:3.129 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:18:29,505][model8_pretrain.py][INFO] Epoch:[0/2](110900/4588595) loss:2.843 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:18:29,505][model8_pretrain.py][INFO] Epoch:[0/2](110900/4588595) loss:3.111 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:18:29,505][model8_pretrain.py][INFO] Epoch:[0/2](110900/4588595) loss:2.938 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:18:29,505][model8_pretrain.py][INFO] Epoch:[0/2](110900/4588595) loss:3.566 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:18:29,505][model8_pretrain.py][INFO] Epoch:[0/2](110900/4588595) loss:2.941 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:18:29,505][model8_pretrain.py][INFO] Epoch:[0/2](110900/4588595) loss:2.118 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:18:29,505][model8_pretrain.py][INFO] Epoch:[0/2](110900/4588595) loss:3.242 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:18:29,505][model8_pretrain.py][INFO] Epoch:[0/2](110900/4588595) loss:3.069 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:19:06,456][model8_pretrain.py][INFO] Epoch:[0/2](111000/4588595) loss:3.156 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:19:06,457][model8_pretrain.py][INFO] Epoch:[0/2](111000/4588595) loss:3.185 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:19:06,457][model8_pretrain.py][INFO] Epoch:[0/2](111000/4588595) loss:3.132 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:19:06,457][model8_pretrain.py][INFO] Epoch:[0/2](111000/4588595) loss:2.915 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:19:06,457][model8_pretrain.py][INFO] Epoch:[0/2](111000/4588595) loss:2.658 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:19:06,457][model8_pretrain.py][INFO] Epoch:[0/2](111000/4588595) loss:2.997 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:19:06,457][model8_pretrain.py][INFO] Epoch:[0/2](111000/4588595) loss:2.815 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:19:06,457][model8_pretrain.py][INFO] Epoch:[0/2](111000/4588595) loss:3.285 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:19:43,410][model8_pretrain.py][INFO] Epoch:[0/2](111100/4588595) loss:2.906 lr:0.0000100 epoch_Time:28208.0min: [2024-01-03 05:19:43,410][model8_pretrain.py][INFO] Epoch:[0/2](111100/4588595) loss:2.888 lr:0.0000100 epoch_Time:28208.0min: [2024-01-03 05:19:43,410][model8_pretrain.py][INFO] Epoch:[0/2](111100/4588595) loss:2.945 lr:0.0000100 epoch_Time:28208.0min: [2024-01-03 05:19:43,410][model8_pretrain.py][INFO] Epoch:[0/2](111100/4588595) loss:3.088 lr:0.0000100 epoch_Time:28208.0min: [2024-01-03 05:19:43,410][model8_pretrain.py][INFO] Epoch:[0/2](111100/4588595) loss:3.114 lr:0.0000100 epoch_Time:28208.0min: [2024-01-03 05:19:43,410][model8_pretrain.py][INFO] Epoch:[0/2](111100/4588595) loss:2.564 lr:0.0000100 epoch_Time:28208.0min: [2024-01-03 05:19:43,410][model8_pretrain.py][INFO] Epoch:[0/2](111100/4588595) loss:2.503 lr:0.0000100 epoch_Time:28208.0min: [2024-01-03 05:19:43,410][model8_pretrain.py][INFO] Epoch:[0/2](111100/4588595) loss:3.389 lr:0.0000100 epoch_Time:28208.0min: [2024-01-03 05:20:20,378][model8_pretrain.py][INFO] Epoch:[0/2](111200/4588595) loss:2.524 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:20:20,378][model8_pretrain.py][INFO] Epoch:[0/2](111200/4588595) loss:2.853 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:20:20,378][model8_pretrain.py][INFO] Epoch:[0/2](111200/4588595) loss:3.069 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:20:20,378][model8_pretrain.py][INFO] Epoch:[0/2](111200/4588595) loss:2.989 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:20:20,378][model8_pretrain.py][INFO] Epoch:[0/2](111200/4588595) loss:2.855 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:20:20,378][model8_pretrain.py][INFO] Epoch:[0/2](111200/4588595) loss:2.372 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:20:20,378][model8_pretrain.py][INFO] Epoch:[0/2](111200/4588595) loss:2.838 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:20:20,378][model8_pretrain.py][INFO] Epoch:[0/2](111200/4588595) loss:2.997 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:20:57,322][model8_pretrain.py][INFO] Epoch:[0/2](111300/4588595) loss:2.962 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:20:57,322][model8_pretrain.py][INFO] Epoch:[0/2](111300/4588595) loss:3.562 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:20:57,322][model8_pretrain.py][INFO] Epoch:[0/2](111300/4588595) loss:3.201 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:20:57,322][model8_pretrain.py][INFO] Epoch:[0/2](111300/4588595) loss:2.790 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:20:57,322][model8_pretrain.py][INFO] Epoch:[0/2](111300/4588595) loss:3.363 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:20:57,322][model8_pretrain.py][INFO] Epoch:[0/2](111300/4588595) loss:2.994 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:20:57,322][model8_pretrain.py][INFO] Epoch:[0/2](111300/4588595) loss:3.013 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:20:57,322][model8_pretrain.py][INFO] Epoch:[0/2](111300/4588595) loss:2.726 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:21:42,992][model8_pretrain.py][INFO] Epoch:[0/2](111400/4588595) loss:3.289 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:21:42,992][model8_pretrain.py][INFO] Epoch:[0/2](111400/4588595) loss:2.910 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:21:42,992][model8_pretrain.py][INFO] Epoch:[0/2](111400/4588595) loss:2.796 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:21:42,992][model8_pretrain.py][INFO] Epoch:[0/2](111400/4588595) loss:3.091 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:21:42,992][model8_pretrain.py][INFO] Epoch:[0/2](111400/4588595) loss:2.927 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:21:42,992][model8_pretrain.py][INFO] Epoch:[0/2](111400/4588595) loss:2.924 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:21:42,992][model8_pretrain.py][INFO] Epoch:[0/2](111400/4588595) loss:2.894 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:21:42,992][model8_pretrain.py][INFO] Epoch:[0/2](111400/4588595) loss:2.829 lr:0.0000100 epoch_Time:28211.0min: [2024-01-03 05:22:19,943][model8_pretrain.py][INFO] Epoch:[0/2](111500/4588595) loss:3.145 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:22:19,943][model8_pretrain.py][INFO] Epoch:[0/2](111500/4588595) loss:3.205 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:22:19,943][model8_pretrain.py][INFO] Epoch:[0/2](111500/4588595) loss:3.219 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:22:19,943][model8_pretrain.py][INFO] Epoch:[0/2](111500/4588595) loss:3.120 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:22:19,943][model8_pretrain.py][INFO] Epoch:[0/2](111500/4588595) loss:3.051 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:22:19,943][model8_pretrain.py][INFO] Epoch:[0/2](111500/4588595) loss:2.879 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:22:19,943][model8_pretrain.py][INFO] Epoch:[0/2](111500/4588595) loss:2.941 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:22:19,943][model8_pretrain.py][INFO] Epoch:[0/2](111500/4588595) loss:3.026 lr:0.0000100 epoch_Time:28209.0min: [2024-01-03 05:22:56,886][model8_pretrain.py][INFO] Epoch:[0/2](111600/4588595) loss:2.878 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:22:56,886][model8_pretrain.py][INFO] Epoch:[0/2](111600/4588595) loss:1.914 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:22:56,886][model8_pretrain.py][INFO] Epoch:[0/2](111600/4588595) loss:2.967 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:22:56,886][model8_pretrain.py][INFO] Epoch:[0/2](111600/4588595) loss:2.960 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:22:56,886][model8_pretrain.py][INFO] Epoch:[0/2](111600/4588595) loss:3.259 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:22:56,886][model8_pretrain.py][INFO] Epoch:[0/2](111600/4588595) loss:3.124 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:22:56,887][model8_pretrain.py][INFO] Epoch:[0/2](111600/4588595) loss:3.010 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:22:56,887][model8_pretrain.py][INFO] Epoch:[0/2](111600/4588595) loss:2.693 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:23:33,825][model8_pretrain.py][INFO] Epoch:[0/2](111700/4588595) loss:3.141 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:23:33,826][model8_pretrain.py][INFO] Epoch:[0/2](111700/4588595) loss:3.151 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:23:33,826][model8_pretrain.py][INFO] Epoch:[0/2](111700/4588595) loss:2.944 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:23:33,826][model8_pretrain.py][INFO] Epoch:[0/2](111700/4588595) loss:2.850 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:23:33,826][model8_pretrain.py][INFO] Epoch:[0/2](111700/4588595) loss:3.096 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:23:33,826][model8_pretrain.py][INFO] Epoch:[0/2](111700/4588595) loss:2.852 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:23:33,826][model8_pretrain.py][INFO] Epoch:[0/2](111700/4588595) loss:2.506 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:23:33,826][model8_pretrain.py][INFO] Epoch:[0/2](111700/4588595) loss:2.726 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:24:10,771][model8_pretrain.py][INFO] Epoch:[0/2](111800/4588595) loss:2.689 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:24:10,771][model8_pretrain.py][INFO] Epoch:[0/2](111800/4588595) loss:2.961 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:24:10,771][model8_pretrain.py][INFO] Epoch:[0/2](111800/4588595) loss:2.915 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:24:10,771][model8_pretrain.py][INFO] Epoch:[0/2](111800/4588595) loss:2.955 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:24:10,771][model8_pretrain.py][INFO] Epoch:[0/2](111800/4588595) loss:2.997 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:24:10,771][model8_pretrain.py][INFO] Epoch:[0/2](111800/4588595) loss:2.938 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:24:10,771][model8_pretrain.py][INFO] Epoch:[0/2](111800/4588595) loss:2.994 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:24:10,771][model8_pretrain.py][INFO] Epoch:[0/2](111800/4588595) loss:2.917 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:24:47,701][model8_pretrain.py][INFO] Epoch:[0/2](111900/4588595) loss:3.014 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:24:47,701][model8_pretrain.py][INFO] Epoch:[0/2](111900/4588595) loss:3.264 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:24:47,701][model8_pretrain.py][INFO] Epoch:[0/2](111900/4588595) loss:3.679 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:24:47,701][model8_pretrain.py][INFO] Epoch:[0/2](111900/4588595) loss:3.273 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:24:47,701][model8_pretrain.py][INFO] Epoch:[0/2](111900/4588595) loss:2.961 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:24:47,701][model8_pretrain.py][INFO] Epoch:[0/2](111900/4588595) loss:2.933 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:24:47,701][model8_pretrain.py][INFO] Epoch:[0/2](111900/4588595) loss:2.867 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:24:47,702][model8_pretrain.py][INFO] Epoch:[0/2](111900/4588595) loss:3.246 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:25:24,640][model8_pretrain.py][INFO] Epoch:[0/2](112000/4588595) loss:3.030 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:25:24,640][model8_pretrain.py][INFO] Epoch:[0/2](112000/4588595) loss:3.248 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:25:24,640][model8_pretrain.py][INFO] Epoch:[0/2](112000/4588595) loss:2.930 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:25:24,640][model8_pretrain.py][INFO] Epoch:[0/2](112000/4588595) loss:3.427 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:25:24,640][model8_pretrain.py][INFO] Epoch:[0/2](112000/4588595) loss:3.553 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:25:24,640][model8_pretrain.py][INFO] Epoch:[0/2](112000/4588595) loss:2.880 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:25:24,640][model8_pretrain.py][INFO] Epoch:[0/2](112000/4588595) loss:2.977 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:25:24,640][model8_pretrain.py][INFO] Epoch:[0/2](112000/4588595) loss:3.120 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:26:01,577][model8_pretrain.py][INFO] Epoch:[0/2](112100/4588595) loss:2.901 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:26:01,577][model8_pretrain.py][INFO] Epoch:[0/2](112100/4588595) loss:3.270 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:26:01,577][model8_pretrain.py][INFO] Epoch:[0/2](112100/4588595) loss:3.009 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:26:01,577][model8_pretrain.py][INFO] Epoch:[0/2](112100/4588595) loss:2.525 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:26:01,577][model8_pretrain.py][INFO] Epoch:[0/2](112100/4588595) loss:3.134 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:26:01,578][model8_pretrain.py][INFO] Epoch:[0/2](112100/4588595) loss:3.153 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:26:01,578][model8_pretrain.py][INFO] Epoch:[0/2](112100/4588595) loss:3.066 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:26:01,578][model8_pretrain.py][INFO] Epoch:[0/2](112100/4588595) loss:2.997 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:26:47,179][model8_pretrain.py][INFO] Epoch:[0/2](112200/4588595) loss:2.838 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:26:47,179][model8_pretrain.py][INFO] Epoch:[0/2](112200/4588595) loss:3.409 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:26:47,179][model8_pretrain.py][INFO] Epoch:[0/2](112200/4588595) loss:2.636 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:26:47,179][model8_pretrain.py][INFO] Epoch:[0/2](112200/4588595) loss:2.933 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:26:47,179][model8_pretrain.py][INFO] Epoch:[0/2](112200/4588595) loss:3.218 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:26:47,179][model8_pretrain.py][INFO] Epoch:[0/2](112200/4588595) loss:2.702 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:26:47,179][model8_pretrain.py][INFO] Epoch:[0/2](112200/4588595) loss:3.377 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:26:47,179][model8_pretrain.py][INFO] Epoch:[0/2](112200/4588595) loss:2.965 lr:0.0000100 epoch_Time:28207.0min: [2024-01-03 05:27:24,113][model8_pretrain.py][INFO] Epoch:[0/2](112300/4588595) loss:3.139 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:27:24,113][model8_pretrain.py][INFO] Epoch:[0/2](112300/4588595) loss:3.217 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:27:24,113][model8_pretrain.py][INFO] Epoch:[0/2](112300/4588595) loss:3.188 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:27:24,113][model8_pretrain.py][INFO] Epoch:[0/2](112300/4588595) loss:3.035 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:27:24,113][model8_pretrain.py][INFO] Epoch:[0/2](112300/4588595) loss:3.224 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:27:24,113][model8_pretrain.py][INFO] Epoch:[0/2](112300/4588595) loss:3.115 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:27:24,113][model8_pretrain.py][INFO] Epoch:[0/2](112300/4588595) loss:2.423 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:27:24,113][model8_pretrain.py][INFO] Epoch:[0/2](112300/4588595) loss:3.061 lr:0.0000100 epoch_Time:28205.0min: [2024-01-03 05:28:01,054][model8_pretrain.py][INFO] Epoch:[0/2](112400/4588595) loss:2.884 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:28:01,054][model8_pretrain.py][INFO] Epoch:[0/2](112400/4588595) loss:3.208 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:28:01,054][model8_pretrain.py][INFO] Epoch:[0/2](112400/4588595) loss:3.204 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:28:01,054][model8_pretrain.py][INFO] Epoch:[0/2](112400/4588595) loss:3.161 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:28:01,054][model8_pretrain.py][INFO] Epoch:[0/2](112400/4588595) loss:3.090 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:28:01,054][model8_pretrain.py][INFO] Epoch:[0/2](112400/4588595) loss:2.989 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:28:01,054][model8_pretrain.py][INFO] Epoch:[0/2](112400/4588595) loss:2.774 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:28:01,054][model8_pretrain.py][INFO] Epoch:[0/2](112400/4588595) loss:2.796 lr:0.0000100 epoch_Time:28204.0min: [2024-01-03 05:28:38,038][model8_pretrain.py][INFO] Epoch:[0/2](112500/4588595) loss:3.275 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:28:38,038][model8_pretrain.py][INFO] Epoch:[0/2](112500/4588595) loss:2.488 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:28:38,038][model8_pretrain.py][INFO] Epoch:[0/2](112500/4588595) loss:2.667 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:28:38,038][model8_pretrain.py][INFO] Epoch:[0/2](112500/4588595) loss:2.690 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:28:38,038][model8_pretrain.py][INFO] Epoch:[0/2](112500/4588595) loss:3.056 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:28:38,038][model8_pretrain.py][INFO] Epoch:[0/2](112500/4588595) loss:3.102 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:28:38,038][model8_pretrain.py][INFO] Epoch:[0/2](112500/4588595) loss:2.797 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:28:38,038][model8_pretrain.py][INFO] Epoch:[0/2](112500/4588595) loss:2.765 lr:0.0000100 epoch_Time:28203.0min: [2024-01-03 05:29:14,998][model8_pretrain.py][INFO] Epoch:[0/2](112600/4588595) loss:2.802 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:29:14,998][model8_pretrain.py][INFO] Epoch:[0/2](112600/4588595) loss:2.804 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:29:14,998][model8_pretrain.py][INFO] Epoch:[0/2](112600/4588595) loss:2.295 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:29:14,998][model8_pretrain.py][INFO] Epoch:[0/2](112600/4588595) loss:3.396 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:29:14,998][model8_pretrain.py][INFO] Epoch:[0/2](112600/4588595) loss:3.265 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:29:14,998][model8_pretrain.py][INFO] Epoch:[0/2](112600/4588595) loss:2.687 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:29:14,998][model8_pretrain.py][INFO] Epoch:[0/2](112600/4588595) loss:3.295 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:29:14,998][model8_pretrain.py][INFO] Epoch:[0/2](112600/4588595) loss:3.273 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:29:52,011][model8_pretrain.py][INFO] Epoch:[0/2](112700/4588595) loss:2.625 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:29:52,011][model8_pretrain.py][INFO] Epoch:[0/2](112700/4588595) loss:3.410 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:29:52,011][model8_pretrain.py][INFO] Epoch:[0/2](112700/4588595) loss:2.596 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:29:52,011][model8_pretrain.py][INFO] Epoch:[0/2](112700/4588595) loss:3.303 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:29:52,011][model8_pretrain.py][INFO] Epoch:[0/2](112700/4588595) loss:3.126 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:29:52,011][model8_pretrain.py][INFO] Epoch:[0/2](112700/4588595) loss:2.599 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:29:52,011][model8_pretrain.py][INFO] Epoch:[0/2](112700/4588595) loss:2.880 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:29:52,012][model8_pretrain.py][INFO] Epoch:[0/2](112700/4588595) loss:2.856 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:30:28,964][model8_pretrain.py][INFO] Epoch:[0/2](112800/4588595) loss:2.534 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:30:28,964][model8_pretrain.py][INFO] Epoch:[0/2](112800/4588595) loss:3.215 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:30:28,964][model8_pretrain.py][INFO] Epoch:[0/2](112800/4588595) loss:2.913 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:30:28,964][model8_pretrain.py][INFO] Epoch:[0/2](112800/4588595) loss:2.937 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:30:28,964][model8_pretrain.py][INFO] Epoch:[0/2](112800/4588595) loss:3.383 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:30:28,964][model8_pretrain.py][INFO] Epoch:[0/2](112800/4588595) loss:3.004 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:30:28,965][model8_pretrain.py][INFO] Epoch:[0/2](112800/4588595) loss:3.110 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:30:28,965][model8_pretrain.py][INFO] Epoch:[0/2](112800/4588595) loss:3.190 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:31:05,921][model8_pretrain.py][INFO] Epoch:[0/2](112900/4588595) loss:2.957 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:31:05,921][model8_pretrain.py][INFO] Epoch:[0/2](112900/4588595) loss:3.217 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:31:05,921][model8_pretrain.py][INFO] Epoch:[0/2](112900/4588595) loss:3.638 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:31:05,921][model8_pretrain.py][INFO] Epoch:[0/2](112900/4588595) loss:2.867 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:31:05,921][model8_pretrain.py][INFO] Epoch:[0/2](112900/4588595) loss:3.108 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:31:05,921][model8_pretrain.py][INFO] Epoch:[0/2](112900/4588595) loss:2.921 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:31:05,921][model8_pretrain.py][INFO] Epoch:[0/2](112900/4588595) loss:3.021 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:31:05,921][model8_pretrain.py][INFO] Epoch:[0/2](112900/4588595) loss:2.978 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:31:51,596][model8_pretrain.py][INFO] Epoch:[0/2](113000/4588595) loss:2.739 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:31:51,596][model8_pretrain.py][INFO] Epoch:[0/2](113000/4588595) loss:2.890 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:31:51,596][model8_pretrain.py][INFO] Epoch:[0/2](113000/4588595) loss:2.893 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:31:51,596][model8_pretrain.py][INFO] Epoch:[0/2](113000/4588595) loss:3.279 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:31:51,596][model8_pretrain.py][INFO] Epoch:[0/2](113000/4588595) loss:2.978 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:31:51,596][model8_pretrain.py][INFO] Epoch:[0/2](113000/4588595) loss:3.246 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:31:51,596][model8_pretrain.py][INFO] Epoch:[0/2](113000/4588595) loss:3.313 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:31:51,596][model8_pretrain.py][INFO] Epoch:[0/2](113000/4588595) loss:3.013 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:32:28,541][model8_pretrain.py][INFO] Epoch:[0/2](113100/4588595) loss:2.593 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:32:28,541][model8_pretrain.py][INFO] Epoch:[0/2](113100/4588595) loss:2.699 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:32:28,541][model8_pretrain.py][INFO] Epoch:[0/2](113100/4588595) loss:2.587 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:32:28,541][model8_pretrain.py][INFO] Epoch:[0/2](113100/4588595) loss:3.050 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:32:28,541][model8_pretrain.py][INFO] Epoch:[0/2](113100/4588595) loss:2.832 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:32:28,541][model8_pretrain.py][INFO] Epoch:[0/2](113100/4588595) loss:3.775 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:32:28,541][model8_pretrain.py][INFO] Epoch:[0/2](113100/4588595) loss:3.214 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:32:28,541][model8_pretrain.py][INFO] Epoch:[0/2](113100/4588595) loss:2.139 lr:0.0000100 epoch_Time:28202.0min: [2024-01-03 05:33:05,486][model8_pretrain.py][INFO] Epoch:[0/2](113200/4588595) loss:3.105 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:33:05,486][model8_pretrain.py][INFO] Epoch:[0/2](113200/4588595) loss:3.297 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:33:05,486][model8_pretrain.py][INFO] Epoch:[0/2](113200/4588595) loss:3.083 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:33:05,486][model8_pretrain.py][INFO] Epoch:[0/2](113200/4588595) loss:2.976 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:33:05,486][model8_pretrain.py][INFO] Epoch:[0/2](113200/4588595) loss:2.639 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:33:05,486][model8_pretrain.py][INFO] Epoch:[0/2](113200/4588595) loss:3.208 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:33:05,486][model8_pretrain.py][INFO] Epoch:[0/2](113200/4588595) loss:2.710 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:33:05,486][model8_pretrain.py][INFO] Epoch:[0/2](113200/4588595) loss:3.236 lr:0.0000100 epoch_Time:28200.0min: [2024-01-03 05:33:42,434][model8_pretrain.py][INFO] Epoch:[0/2](113300/4588595) loss:3.333 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:33:42,434][model8_pretrain.py][INFO] Epoch:[0/2](113300/4588595) loss:2.885 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:33:42,434][model8_pretrain.py][INFO] Epoch:[0/2](113300/4588595) loss:2.546 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:33:42,434][model8_pretrain.py][INFO] Epoch:[0/2](113300/4588595) loss:2.750 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:33:42,434][model8_pretrain.py][INFO] Epoch:[0/2](113300/4588595) loss:3.128 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:33:42,434][model8_pretrain.py][INFO] Epoch:[0/2](113300/4588595) loss:2.661 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:33:42,434][model8_pretrain.py][INFO] Epoch:[0/2](113300/4588595) loss:3.026 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:33:42,434][model8_pretrain.py][INFO] Epoch:[0/2](113300/4588595) loss:3.005 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:34:19,379][model8_pretrain.py][INFO] Epoch:[0/2](113400/4588595) loss:3.105 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:34:19,379][model8_pretrain.py][INFO] Epoch:[0/2](113400/4588595) loss:2.831 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:34:19,379][model8_pretrain.py][INFO] Epoch:[0/2](113400/4588595) loss:3.452 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:34:19,379][model8_pretrain.py][INFO] Epoch:[0/2](113400/4588595) loss:3.224 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:34:19,379][model8_pretrain.py][INFO] Epoch:[0/2](113400/4588595) loss:3.196 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:34:19,380][model8_pretrain.py][INFO] Epoch:[0/2](113400/4588595) loss:3.309 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:34:19,380][model8_pretrain.py][INFO] Epoch:[0/2](113400/4588595) loss:2.916 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:34:19,380][model8_pretrain.py][INFO] Epoch:[0/2](113400/4588595) loss:3.278 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:34:56,349][model8_pretrain.py][INFO] Epoch:[0/2](113500/4588595) loss:3.218 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:34:56,350][model8_pretrain.py][INFO] Epoch:[0/2](113500/4588595) loss:3.203 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:34:56,350][model8_pretrain.py][INFO] Epoch:[0/2](113500/4588595) loss:3.330 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:34:56,350][model8_pretrain.py][INFO] Epoch:[0/2](113500/4588595) loss:3.110 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:34:56,350][model8_pretrain.py][INFO] Epoch:[0/2](113500/4588595) loss:2.596 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:34:56,350][model8_pretrain.py][INFO] Epoch:[0/2](113500/4588595) loss:2.968 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:34:56,350][model8_pretrain.py][INFO] Epoch:[0/2](113500/4588595) loss:2.772 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:34:56,350][model8_pretrain.py][INFO] Epoch:[0/2](113500/4588595) loss:2.818 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:35:33,314][model8_pretrain.py][INFO] Epoch:[0/2](113600/4588595) loss:3.321 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:35:33,314][model8_pretrain.py][INFO] Epoch:[0/2](113600/4588595) loss:3.159 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:35:33,314][model8_pretrain.py][INFO] Epoch:[0/2](113600/4588595) loss:3.270 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:35:33,314][model8_pretrain.py][INFO] Epoch:[0/2](113600/4588595) loss:3.221 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:35:33,314][model8_pretrain.py][INFO] Epoch:[0/2](113600/4588595) loss:2.928 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:35:33,314][model8_pretrain.py][INFO] Epoch:[0/2](113600/4588595) loss:2.797 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:35:33,314][model8_pretrain.py][INFO] Epoch:[0/2](113600/4588595) loss:2.886 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:35:33,314][model8_pretrain.py][INFO] Epoch:[0/2](113600/4588595) loss:2.698 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:36:10,277][model8_pretrain.py][INFO] Epoch:[0/2](113700/4588595) loss:2.908 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:36:10,277][model8_pretrain.py][INFO] Epoch:[0/2](113700/4588595) loss:2.991 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:36:10,277][model8_pretrain.py][INFO] Epoch:[0/2](113700/4588595) loss:3.407 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:36:10,277][model8_pretrain.py][INFO] Epoch:[0/2](113700/4588595) loss:2.633 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:36:10,277][model8_pretrain.py][INFO] Epoch:[0/2](113700/4588595) loss:2.683 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:36:10,277][model8_pretrain.py][INFO] Epoch:[0/2](113700/4588595) loss:3.150 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:36:10,277][model8_pretrain.py][INFO] Epoch:[0/2](113700/4588595) loss:2.828 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:36:10,277][model8_pretrain.py][INFO] Epoch:[0/2](113700/4588595) loss:3.310 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:36:56,145][model8_pretrain.py][INFO] Epoch:[0/2](113800/4588595) loss:3.015 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:36:56,145][model8_pretrain.py][INFO] Epoch:[0/2](113800/4588595) loss:3.252 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:36:56,145][model8_pretrain.py][INFO] Epoch:[0/2](113800/4588595) loss:2.763 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:36:56,145][model8_pretrain.py][INFO] Epoch:[0/2](113800/4588595) loss:2.675 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:36:56,146][model8_pretrain.py][INFO] Epoch:[0/2](113800/4588595) loss:3.387 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:36:56,146][model8_pretrain.py][INFO] Epoch:[0/2](113800/4588595) loss:3.109 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:36:56,146][model8_pretrain.py][INFO] Epoch:[0/2](113800/4588595) loss:3.032 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:36:56,146][model8_pretrain.py][INFO] Epoch:[0/2](113800/4588595) loss:2.756 lr:0.0000100 epoch_Time:28199.0min: [2024-01-03 05:37:33,085][model8_pretrain.py][INFO] Epoch:[0/2](113900/4588595) loss:2.870 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:37:33,085][model8_pretrain.py][INFO] Epoch:[0/2](113900/4588595) loss:2.467 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:37:33,085][model8_pretrain.py][INFO] Epoch:[0/2](113900/4588595) loss:3.056 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:37:33,086][model8_pretrain.py][INFO] Epoch:[0/2](113900/4588595) loss:3.030 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:37:33,085][model8_pretrain.py][INFO] Epoch:[0/2](113900/4588595) loss:3.048 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:37:33,086][model8_pretrain.py][INFO] Epoch:[0/2](113900/4588595) loss:2.425 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:37:33,086][model8_pretrain.py][INFO] Epoch:[0/2](113900/4588595) loss:2.952 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:37:33,086][model8_pretrain.py][INFO] Epoch:[0/2](113900/4588595) loss:2.516 lr:0.0000100 epoch_Time:28198.0min: [2024-01-03 05:38:10,038][model8_pretrain.py][INFO] Epoch:[0/2](114000/4588595) loss:2.811 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:10,038][model8_pretrain.py][INFO] Epoch:[0/2](114000/4588595) loss:2.889 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:10,038][model8_pretrain.py][INFO] Epoch:[0/2](114000/4588595) loss:2.898 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:10,038][model8_pretrain.py][INFO] Epoch:[0/2](114000/4588595) loss:2.681 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:10,038][model8_pretrain.py][INFO] Epoch:[0/2](114000/4588595) loss:3.043 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:10,038][model8_pretrain.py][INFO] Epoch:[0/2](114000/4588595) loss:3.008 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:10,038][model8_pretrain.py][INFO] Epoch:[0/2](114000/4588595) loss:2.946 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:10,039][model8_pretrain.py][INFO] Epoch:[0/2](114000/4588595) loss:2.768 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:46,985][model8_pretrain.py][INFO] Epoch:[0/2](114100/4588595) loss:3.231 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:46,985][model8_pretrain.py][INFO] Epoch:[0/2](114100/4588595) loss:2.923 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:46,985][model8_pretrain.py][INFO] Epoch:[0/2](114100/4588595) loss:3.059 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:46,985][model8_pretrain.py][INFO] Epoch:[0/2](114100/4588595) loss:2.293 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:46,985][model8_pretrain.py][INFO] Epoch:[0/2](114100/4588595) loss:3.245 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:46,985][model8_pretrain.py][INFO] Epoch:[0/2](114100/4588595) loss:2.868 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:46,985][model8_pretrain.py][INFO] Epoch:[0/2](114100/4588595) loss:2.829 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:38:46,985][model8_pretrain.py][INFO] Epoch:[0/2](114100/4588595) loss:2.644 lr:0.0000100 epoch_Time:28196.0min: [2024-01-03 05:39:23,935][model8_pretrain.py][INFO] Epoch:[0/2](114200/4588595) loss:3.201 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:39:23,935][model8_pretrain.py][INFO] Epoch:[0/2](114200/4588595) loss:3.245 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:39:23,935][model8_pretrain.py][INFO] Epoch:[0/2](114200/4588595) loss:3.371 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:39:23,935][model8_pretrain.py][INFO] Epoch:[0/2](114200/4588595) loss:2.854 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:39:23,935][model8_pretrain.py][INFO] Epoch:[0/2](114200/4588595) loss:3.190 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:39:23,935][model8_pretrain.py][INFO] Epoch:[0/2](114200/4588595) loss:3.314 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:39:23,935][model8_pretrain.py][INFO] Epoch:[0/2](114200/4588595) loss:3.142 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:39:23,935][model8_pretrain.py][INFO] Epoch:[0/2](114200/4588595) loss:2.536 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:40:00,885][model8_pretrain.py][INFO] Epoch:[0/2](114300/4588595) loss:3.119 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:40:00,885][model8_pretrain.py][INFO] Epoch:[0/2](114300/4588595) loss:3.178 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:40:00,885][model8_pretrain.py][INFO] Epoch:[0/2](114300/4588595) loss:3.040 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:40:00,885][model8_pretrain.py][INFO] Epoch:[0/2](114300/4588595) loss:3.423 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:40:00,885][model8_pretrain.py][INFO] Epoch:[0/2](114300/4588595) loss:3.685 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:40:00,885][model8_pretrain.py][INFO] Epoch:[0/2](114300/4588595) loss:2.947 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:40:00,885][model8_pretrain.py][INFO] Epoch:[0/2](114300/4588595) loss:2.644 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:40:00,885][model8_pretrain.py][INFO] Epoch:[0/2](114300/4588595) loss:3.028 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:40:37,831][model8_pretrain.py][INFO] Epoch:[0/2](114400/4588595) loss:3.332 lr:0.0000100 epoch_Time:28192.0min: [2024-01-03 05:40:37,831][model8_pretrain.py][INFO] Epoch:[0/2](114400/4588595) loss:3.289 lr:0.0000100 epoch_Time:28192.0min: [2024-01-03 05:40:37,831][model8_pretrain.py][INFO] Epoch:[0/2](114400/4588595) loss:2.120 lr:0.0000100 epoch_Time:28192.0min: [2024-01-03 05:40:37,831][model8_pretrain.py][INFO] Epoch:[0/2](114400/4588595) loss:3.128 lr:0.0000100 epoch_Time:28192.0min: [2024-01-03 05:40:37,831][model8_pretrain.py][INFO] Epoch:[0/2](114400/4588595) loss:2.985 lr:0.0000100 epoch_Time:28192.0min: [2024-01-03 05:40:37,831][model8_pretrain.py][INFO] Epoch:[0/2](114400/4588595) loss:2.949 lr:0.0000100 epoch_Time:28192.0min: [2024-01-03 05:40:37,831][model8_pretrain.py][INFO] Epoch:[0/2](114400/4588595) loss:3.247 lr:0.0000100 epoch_Time:28192.0min: [2024-01-03 05:40:37,831][model8_pretrain.py][INFO] Epoch:[0/2](114400/4588595) loss:3.408 lr:0.0000100 epoch_Time:28192.0min: [2024-01-03 05:41:14,767][model8_pretrain.py][INFO] Epoch:[0/2](114500/4588595) loss:3.600 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:41:14,767][model8_pretrain.py][INFO] Epoch:[0/2](114500/4588595) loss:3.005 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:41:14,767][model8_pretrain.py][INFO] Epoch:[0/2](114500/4588595) loss:3.071 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:41:14,767][model8_pretrain.py][INFO] Epoch:[0/2](114500/4588595) loss:2.873 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:41:14,767][model8_pretrain.py][INFO] Epoch:[0/2](114500/4588595) loss:2.940 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:41:14,767][model8_pretrain.py][INFO] Epoch:[0/2](114500/4588595) loss:2.724 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:41:14,767][model8_pretrain.py][INFO] Epoch:[0/2](114500/4588595) loss:2.972 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:41:14,767][model8_pretrain.py][INFO] Epoch:[0/2](114500/4588595) loss:3.342 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:42:00,485][model8_pretrain.py][INFO] Epoch:[0/2](114600/4588595) loss:3.024 lr:0.0000100 epoch_Time:28195.0min: [2024-01-03 05:42:00,485][model8_pretrain.py][INFO] Epoch:[0/2](114600/4588595) loss:3.168 lr:0.0000100 epoch_Time:28195.0min: [2024-01-03 05:42:00,485][model8_pretrain.py][INFO] Epoch:[0/2](114600/4588595) loss:3.062 lr:0.0000100 epoch_Time:28195.0min: [2024-01-03 05:42:00,485][model8_pretrain.py][INFO] Epoch:[0/2](114600/4588595) loss:3.081 lr:0.0000100 epoch_Time:28195.0min: [2024-01-03 05:42:00,485][model8_pretrain.py][INFO] Epoch:[0/2](114600/4588595) loss:2.790 lr:0.0000100 epoch_Time:28195.0min: [2024-01-03 05:42:00,485][model8_pretrain.py][INFO] Epoch:[0/2](114600/4588595) loss:3.372 lr:0.0000100 epoch_Time:28195.0min: [2024-01-03 05:42:00,485][model8_pretrain.py][INFO] Epoch:[0/2](114600/4588595) loss:3.032 lr:0.0000100 epoch_Time:28195.0min: [2024-01-03 05:42:00,485][model8_pretrain.py][INFO] Epoch:[0/2](114600/4588595) loss:3.163 lr:0.0000100 epoch_Time:28195.0min: [2024-01-03 05:42:37,427][model8_pretrain.py][INFO] Epoch:[0/2](114700/4588595) loss:3.336 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:42:37,427][model8_pretrain.py][INFO] Epoch:[0/2](114700/4588595) loss:3.050 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:42:37,427][model8_pretrain.py][INFO] Epoch:[0/2](114700/4588595) loss:2.607 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:42:37,427][model8_pretrain.py][INFO] Epoch:[0/2](114700/4588595) loss:2.794 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:42:37,427][model8_pretrain.py][INFO] Epoch:[0/2](114700/4588595) loss:2.943 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:42:37,428][model8_pretrain.py][INFO] Epoch:[0/2](114700/4588595) loss:2.898 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:42:37,428][model8_pretrain.py][INFO] Epoch:[0/2](114700/4588595) loss:2.854 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:42:37,428][model8_pretrain.py][INFO] Epoch:[0/2](114700/4588595) loss:3.232 lr:0.0000100 epoch_Time:28194.0min: [2024-01-03 05:43:14,387][model8_pretrain.py][INFO] Epoch:[0/2](114800/4588595) loss:2.446 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:43:14,388][model8_pretrain.py][INFO] Epoch:[0/2](114800/4588595) loss:3.382 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:43:14,388][model8_pretrain.py][INFO] Epoch:[0/2](114800/4588595) loss:2.955 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:43:14,388][model8_pretrain.py][INFO] Epoch:[0/2](114800/4588595) loss:2.833 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:43:14,388][model8_pretrain.py][INFO] Epoch:[0/2](114800/4588595) loss:3.196 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:43:14,388][model8_pretrain.py][INFO] Epoch:[0/2](114800/4588595) loss:2.819 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:43:14,388][model8_pretrain.py][INFO] Epoch:[0/2](114800/4588595) loss:3.253 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:43:14,388][model8_pretrain.py][INFO] Epoch:[0/2](114800/4588595) loss:3.212 lr:0.0000100 epoch_Time:28193.0min: [2024-01-03 05:43:51,335][model8_pretrain.py][INFO] Epoch:[0/2](114900/4588595) loss:3.358 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:43:51,335][model8_pretrain.py][INFO] Epoch:[0/2](114900/4588595) loss:3.153 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:43:51,335][model8_pretrain.py][INFO] Epoch:[0/2](114900/4588595) loss:3.546 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:43:51,335][model8_pretrain.py][INFO] Epoch:[0/2](114900/4588595) loss:2.565 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:43:51,335][model8_pretrain.py][INFO] Epoch:[0/2](114900/4588595) loss:3.567 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:43:51,335][model8_pretrain.py][INFO] Epoch:[0/2](114900/4588595) loss:2.686 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:43:51,335][model8_pretrain.py][INFO] Epoch:[0/2](114900/4588595) loss:2.743 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:43:51,335][model8_pretrain.py][INFO] Epoch:[0/2](114900/4588595) loss:3.003 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:44:28,273][model8_pretrain.py][INFO] Epoch:[0/2](115000/4588595) loss:2.790 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:44:28,273][model8_pretrain.py][INFO] Epoch:[0/2](115000/4588595) loss:3.385 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:44:28,273][model8_pretrain.py][INFO] Epoch:[0/2](115000/4588595) loss:3.161 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:44:28,273][model8_pretrain.py][INFO] Epoch:[0/2](115000/4588595) loss:2.898 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:44:28,273][model8_pretrain.py][INFO] Epoch:[0/2](115000/4588595) loss:3.125 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:44:28,273][model8_pretrain.py][INFO] Epoch:[0/2](115000/4588595) loss:3.047 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:44:28,273][model8_pretrain.py][INFO] Epoch:[0/2](115000/4588595) loss:3.109 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:44:28,274][model8_pretrain.py][INFO] Epoch:[0/2](115000/4588595) loss:2.528 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:45:05,192][model8_pretrain.py][INFO] Epoch:[0/2](115100/4588595) loss:2.934 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:45:05,192][model8_pretrain.py][INFO] Epoch:[0/2](115100/4588595) loss:3.053 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:45:05,192][model8_pretrain.py][INFO] Epoch:[0/2](115100/4588595) loss:3.120 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:45:05,192][model8_pretrain.py][INFO] Epoch:[0/2](115100/4588595) loss:3.185 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:45:05,192][model8_pretrain.py][INFO] Epoch:[0/2](115100/4588595) loss:2.659 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:45:05,192][model8_pretrain.py][INFO] Epoch:[0/2](115100/4588595) loss:3.079 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:45:05,192][model8_pretrain.py][INFO] Epoch:[0/2](115100/4588595) loss:2.762 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:45:05,192][model8_pretrain.py][INFO] Epoch:[0/2](115100/4588595) loss:2.705 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:45:42,142][model8_pretrain.py][INFO] Epoch:[0/2](115200/4588595) loss:2.389 lr:0.0000100 epoch_Time:28188.0min: [2024-01-03 05:45:42,142][model8_pretrain.py][INFO] Epoch:[0/2](115200/4588595) loss:3.223 lr:0.0000100 epoch_Time:28188.0min: [2024-01-03 05:45:42,142][model8_pretrain.py][INFO] Epoch:[0/2](115200/4588595) loss:3.019 lr:0.0000100 epoch_Time:28188.0min: [2024-01-03 05:45:42,142][model8_pretrain.py][INFO] Epoch:[0/2](115200/4588595) loss:3.243 lr:0.0000100 epoch_Time:28188.0min: [2024-01-03 05:45:42,142][model8_pretrain.py][INFO] Epoch:[0/2](115200/4588595) loss:3.310 lr:0.0000100 epoch_Time:28188.0min: [2024-01-03 05:45:42,142][model8_pretrain.py][INFO] Epoch:[0/2](115200/4588595) loss:3.172 lr:0.0000100 epoch_Time:28188.0min: [2024-01-03 05:45:42,142][model8_pretrain.py][INFO] Epoch:[0/2](115200/4588595) loss:3.368 lr:0.0000100 epoch_Time:28188.0min: [2024-01-03 05:45:42,142][model8_pretrain.py][INFO] Epoch:[0/2](115200/4588595) loss:3.225 lr:0.0000100 epoch_Time:28188.0min: [2024-01-03 05:46:19,118][model8_pretrain.py][INFO] Epoch:[0/2](115300/4588595) loss:2.926 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:46:19,118][model8_pretrain.py][INFO] Epoch:[0/2](115300/4588595) loss:2.634 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:46:19,118][model8_pretrain.py][INFO] Epoch:[0/2](115300/4588595) loss:2.778 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:46:19,118][model8_pretrain.py][INFO] Epoch:[0/2](115300/4588595) loss:3.413 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:46:19,118][model8_pretrain.py][INFO] Epoch:[0/2](115300/4588595) loss:2.854 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:46:19,118][model8_pretrain.py][INFO] Epoch:[0/2](115300/4588595) loss:2.967 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:46:19,118][model8_pretrain.py][INFO] Epoch:[0/2](115300/4588595) loss:2.763 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:46:19,118][model8_pretrain.py][INFO] Epoch:[0/2](115300/4588595) loss:2.845 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:47:04,743][model8_pretrain.py][INFO] Epoch:[0/2](115400/4588595) loss:3.039 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:47:04,743][model8_pretrain.py][INFO] Epoch:[0/2](115400/4588595) loss:2.892 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:47:04,743][model8_pretrain.py][INFO] Epoch:[0/2](115400/4588595) loss:3.151 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:47:04,743][model8_pretrain.py][INFO] Epoch:[0/2](115400/4588595) loss:3.026 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:47:04,743][model8_pretrain.py][INFO] Epoch:[0/2](115400/4588595) loss:3.137 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:47:04,743][model8_pretrain.py][INFO] Epoch:[0/2](115400/4588595) loss:3.202 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:47:04,743][model8_pretrain.py][INFO] Epoch:[0/2](115400/4588595) loss:3.267 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:47:04,743][model8_pretrain.py][INFO] Epoch:[0/2](115400/4588595) loss:3.077 lr:0.0000100 epoch_Time:28191.0min: [2024-01-03 05:47:41,686][model8_pretrain.py][INFO] Epoch:[0/2](115500/4588595) loss:3.441 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:47:41,686][model8_pretrain.py][INFO] Epoch:[0/2](115500/4588595) loss:3.596 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:47:41,686][model8_pretrain.py][INFO] Epoch:[0/2](115500/4588595) loss:3.000 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:47:41,686][model8_pretrain.py][INFO] Epoch:[0/2](115500/4588595) loss:3.008 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:47:41,686][model8_pretrain.py][INFO] Epoch:[0/2](115500/4588595) loss:2.961 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:47:41,686][model8_pretrain.py][INFO] Epoch:[0/2](115500/4588595) loss:3.360 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:47:41,687][model8_pretrain.py][INFO] Epoch:[0/2](115500/4588595) loss:2.545 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:47:41,687][model8_pretrain.py][INFO] Epoch:[0/2](115500/4588595) loss:3.025 lr:0.0000100 epoch_Time:28190.0min: [2024-01-03 05:48:18,674][model8_pretrain.py][INFO] Epoch:[0/2](115600/4588595) loss:2.661 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:48:18,675][model8_pretrain.py][INFO] Epoch:[0/2](115600/4588595) loss:3.304 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:48:18,675][model8_pretrain.py][INFO] Epoch:[0/2](115600/4588595) loss:2.789 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:48:18,675][model8_pretrain.py][INFO] Epoch:[0/2](115600/4588595) loss:2.789 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:48:18,675][model8_pretrain.py][INFO] Epoch:[0/2](115600/4588595) loss:2.812 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:48:18,675][model8_pretrain.py][INFO] Epoch:[0/2](115600/4588595) loss:2.751 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:48:18,675][model8_pretrain.py][INFO] Epoch:[0/2](115600/4588595) loss:3.184 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:48:18,675][model8_pretrain.py][INFO] Epoch:[0/2](115600/4588595) loss:2.877 lr:0.0000100 epoch_Time:28189.0min: [2024-01-03 05:48:55,621][model8_pretrain.py][INFO] Epoch:[0/2](115700/4588595) loss:2.815 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:48:55,621][model8_pretrain.py][INFO] Epoch:[0/2](115700/4588595) loss:2.871 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:48:55,621][model8_pretrain.py][INFO] Epoch:[0/2](115700/4588595) loss:3.016 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:48:55,621][model8_pretrain.py][INFO] Epoch:[0/2](115700/4588595) loss:3.372 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:48:55,621][model8_pretrain.py][INFO] Epoch:[0/2](115700/4588595) loss:3.075 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:48:55,621][model8_pretrain.py][INFO] Epoch:[0/2](115700/4588595) loss:2.858 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:48:55,621][model8_pretrain.py][INFO] Epoch:[0/2](115700/4588595) loss:2.677 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:48:55,621][model8_pretrain.py][INFO] Epoch:[0/2](115700/4588595) loss:3.079 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:49:32,563][model8_pretrain.py][INFO] Epoch:[0/2](115800/4588595) loss:3.189 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:49:32,563][model8_pretrain.py][INFO] Epoch:[0/2](115800/4588595) loss:3.202 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:49:32,563][model8_pretrain.py][INFO] Epoch:[0/2](115800/4588595) loss:2.660 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:49:32,563][model8_pretrain.py][INFO] Epoch:[0/2](115800/4588595) loss:3.191 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:49:32,563][model8_pretrain.py][INFO] Epoch:[0/2](115800/4588595) loss:2.967 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:49:32,563][model8_pretrain.py][INFO] Epoch:[0/2](115800/4588595) loss:3.099 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:49:32,563][model8_pretrain.py][INFO] Epoch:[0/2](115800/4588595) loss:3.055 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:49:32,563][model8_pretrain.py][INFO] Epoch:[0/2](115800/4588595) loss:3.094 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:50:09,531][model8_pretrain.py][INFO] Epoch:[0/2](115900/4588595) loss:2.835 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:09,531][model8_pretrain.py][INFO] Epoch:[0/2](115900/4588595) loss:3.222 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:09,531][model8_pretrain.py][INFO] Epoch:[0/2](115900/4588595) loss:2.776 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:09,531][model8_pretrain.py][INFO] Epoch:[0/2](115900/4588595) loss:3.172 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:09,531][model8_pretrain.py][INFO] Epoch:[0/2](115900/4588595) loss:2.564 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:09,531][model8_pretrain.py][INFO] Epoch:[0/2](115900/4588595) loss:3.269 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:09,531][model8_pretrain.py][INFO] Epoch:[0/2](115900/4588595) loss:3.077 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:09,531][model8_pretrain.py][INFO] Epoch:[0/2](115900/4588595) loss:2.998 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:46,474][model8_pretrain.py][INFO] Epoch:[0/2](116000/4588595) loss:2.826 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:46,474][model8_pretrain.py][INFO] Epoch:[0/2](116000/4588595) loss:2.604 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:46,474][model8_pretrain.py][INFO] Epoch:[0/2](116000/4588595) loss:3.317 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:46,474][model8_pretrain.py][INFO] Epoch:[0/2](116000/4588595) loss:3.041 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:46,474][model8_pretrain.py][INFO] Epoch:[0/2](116000/4588595) loss:3.150 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:46,474][model8_pretrain.py][INFO] Epoch:[0/2](116000/4588595) loss:2.854 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:46,474][model8_pretrain.py][INFO] Epoch:[0/2](116000/4588595) loss:2.797 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:50:46,474][model8_pretrain.py][INFO] Epoch:[0/2](116000/4588595) loss:3.159 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:51:23,403][model8_pretrain.py][INFO] Epoch:[0/2](116100/4588595) loss:3.387 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:51:23,403][model8_pretrain.py][INFO] Epoch:[0/2](116100/4588595) loss:3.083 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:51:23,403][model8_pretrain.py][INFO] Epoch:[0/2](116100/4588595) loss:3.110 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:51:23,403][model8_pretrain.py][INFO] Epoch:[0/2](116100/4588595) loss:3.174 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:51:23,403][model8_pretrain.py][INFO] Epoch:[0/2](116100/4588595) loss:3.314 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:51:23,403][model8_pretrain.py][INFO] Epoch:[0/2](116100/4588595) loss:3.365 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:51:23,404][model8_pretrain.py][INFO] Epoch:[0/2](116100/4588595) loss:2.824 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:51:23,405][model8_pretrain.py][INFO] Epoch:[0/2](116100/4588595) loss:3.245 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:52:09,059][model8_pretrain.py][INFO] Epoch:[0/2](116200/4588595) loss:3.218 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:09,059][model8_pretrain.py][INFO] Epoch:[0/2](116200/4588595) loss:2.792 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:09,059][model8_pretrain.py][INFO] Epoch:[0/2](116200/4588595) loss:2.522 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:09,059][model8_pretrain.py][INFO] Epoch:[0/2](116200/4588595) loss:3.124 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:09,059][model8_pretrain.py][INFO] Epoch:[0/2](116200/4588595) loss:3.139 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:09,059][model8_pretrain.py][INFO] Epoch:[0/2](116200/4588595) loss:2.342 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:09,060][model8_pretrain.py][INFO] Epoch:[0/2](116200/4588595) loss:2.846 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:09,059][model8_pretrain.py][INFO] Epoch:[0/2](116200/4588595) loss:3.398 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:45,995][model8_pretrain.py][INFO] Epoch:[0/2](116300/4588595) loss:3.408 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:45,995][model8_pretrain.py][INFO] Epoch:[0/2](116300/4588595) loss:2.192 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:45,995][model8_pretrain.py][INFO] Epoch:[0/2](116300/4588595) loss:2.741 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:45,995][model8_pretrain.py][INFO] Epoch:[0/2](116300/4588595) loss:2.717 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:45,995][model8_pretrain.py][INFO] Epoch:[0/2](116300/4588595) loss:2.888 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:45,995][model8_pretrain.py][INFO] Epoch:[0/2](116300/4588595) loss:2.993 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:45,995][model8_pretrain.py][INFO] Epoch:[0/2](116300/4588595) loss:2.903 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:52:45,995][model8_pretrain.py][INFO] Epoch:[0/2](116300/4588595) loss:2.556 lr:0.0000100 epoch_Time:28187.0min: [2024-01-03 05:53:22,927][model8_pretrain.py][INFO] Epoch:[0/2](116400/4588595) loss:2.735 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:53:22,927][model8_pretrain.py][INFO] Epoch:[0/2](116400/4588595) loss:3.200 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:53:22,927][model8_pretrain.py][INFO] Epoch:[0/2](116400/4588595) loss:2.680 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:53:22,927][model8_pretrain.py][INFO] Epoch:[0/2](116400/4588595) loss:3.420 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:53:22,927][model8_pretrain.py][INFO] Epoch:[0/2](116400/4588595) loss:3.032 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:53:22,927][model8_pretrain.py][INFO] Epoch:[0/2](116400/4588595) loss:2.898 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:53:22,927][model8_pretrain.py][INFO] Epoch:[0/2](116400/4588595) loss:2.892 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:53:22,927][model8_pretrain.py][INFO] Epoch:[0/2](116400/4588595) loss:3.137 lr:0.0000100 epoch_Time:28185.0min: [2024-01-03 05:53:59,871][model8_pretrain.py][INFO] Epoch:[0/2](116500/4588595) loss:2.647 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:53:59,871][model8_pretrain.py][INFO] Epoch:[0/2](116500/4588595) loss:3.278 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:53:59,871][model8_pretrain.py][INFO] Epoch:[0/2](116500/4588595) loss:3.132 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:53:59,871][model8_pretrain.py][INFO] Epoch:[0/2](116500/4588595) loss:2.380 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:53:59,871][model8_pretrain.py][INFO] Epoch:[0/2](116500/4588595) loss:3.289 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:53:59,871][model8_pretrain.py][INFO] Epoch:[0/2](116500/4588595) loss:3.064 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:53:59,871][model8_pretrain.py][INFO] Epoch:[0/2](116500/4588595) loss:3.208 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:53:59,871][model8_pretrain.py][INFO] Epoch:[0/2](116500/4588595) loss:2.892 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:54:36,812][model8_pretrain.py][INFO] Epoch:[0/2](116600/4588595) loss:3.052 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:54:36,812][model8_pretrain.py][INFO] Epoch:[0/2](116600/4588595) loss:3.065 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:54:36,812][model8_pretrain.py][INFO] Epoch:[0/2](116600/4588595) loss:3.127 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:54:36,812][model8_pretrain.py][INFO] Epoch:[0/2](116600/4588595) loss:2.363 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:54:36,812][model8_pretrain.py][INFO] Epoch:[0/2](116600/4588595) loss:2.896 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:54:36,812][model8_pretrain.py][INFO] Epoch:[0/2](116600/4588595) loss:2.876 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:54:36,812][model8_pretrain.py][INFO] Epoch:[0/2](116600/4588595) loss:2.551 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:54:36,812][model8_pretrain.py][INFO] Epoch:[0/2](116600/4588595) loss:2.702 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:55:13,750][model8_pretrain.py][INFO] Epoch:[0/2](116700/4588595) loss:3.154 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:55:13,750][model8_pretrain.py][INFO] Epoch:[0/2](116700/4588595) loss:2.763 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:55:13,750][model8_pretrain.py][INFO] Epoch:[0/2](116700/4588595) loss:2.506 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:55:13,750][model8_pretrain.py][INFO] Epoch:[0/2](116700/4588595) loss:3.036 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:55:13,750][model8_pretrain.py][INFO] Epoch:[0/2](116700/4588595) loss:3.021 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:55:13,750][model8_pretrain.py][INFO] Epoch:[0/2](116700/4588595) loss:2.816 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:55:13,750][model8_pretrain.py][INFO] Epoch:[0/2](116700/4588595) loss:3.253 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:55:13,750][model8_pretrain.py][INFO] Epoch:[0/2](116700/4588595) loss:2.719 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:55:50,701][model8_pretrain.py][INFO] Epoch:[0/2](116800/4588595) loss:3.009 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:55:50,701][model8_pretrain.py][INFO] Epoch:[0/2](116800/4588595) loss:2.972 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:55:50,701][model8_pretrain.py][INFO] Epoch:[0/2](116800/4588595) loss:2.961 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:55:50,701][model8_pretrain.py][INFO] Epoch:[0/2](116800/4588595) loss:2.429 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:55:50,701][model8_pretrain.py][INFO] Epoch:[0/2](116800/4588595) loss:3.320 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:55:50,701][model8_pretrain.py][INFO] Epoch:[0/2](116800/4588595) loss:3.158 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:55:50,701][model8_pretrain.py][INFO] Epoch:[0/2](116800/4588595) loss:2.814 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:55:50,701][model8_pretrain.py][INFO] Epoch:[0/2](116800/4588595) loss:3.375 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:56:27,651][model8_pretrain.py][INFO] Epoch:[0/2](116900/4588595) loss:2.633 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:56:27,651][model8_pretrain.py][INFO] Epoch:[0/2](116900/4588595) loss:2.780 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:56:27,651][model8_pretrain.py][INFO] Epoch:[0/2](116900/4588595) loss:3.003 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:56:27,651][model8_pretrain.py][INFO] Epoch:[0/2](116900/4588595) loss:3.226 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:56:27,651][model8_pretrain.py][INFO] Epoch:[0/2](116900/4588595) loss:3.028 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:56:27,651][model8_pretrain.py][INFO] Epoch:[0/2](116900/4588595) loss:3.228 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:56:27,651][model8_pretrain.py][INFO] Epoch:[0/2](116900/4588595) loss:3.009 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:56:27,651][model8_pretrain.py][INFO] Epoch:[0/2](116900/4588595) loss:2.914 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:57:13,353][model8_pretrain.py][INFO] Epoch:[0/2](117000/4588595) loss:3.122 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:57:13,353][model8_pretrain.py][INFO] Epoch:[0/2](117000/4588595) loss:2.496 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:57:13,353][model8_pretrain.py][INFO] Epoch:[0/2](117000/4588595) loss:2.921 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:57:13,353][model8_pretrain.py][INFO] Epoch:[0/2](117000/4588595) loss:2.457 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:57:13,353][model8_pretrain.py][INFO] Epoch:[0/2](117000/4588595) loss:3.295 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:57:13,353][model8_pretrain.py][INFO] Epoch:[0/2](117000/4588595) loss:2.535 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:57:13,353][model8_pretrain.py][INFO] Epoch:[0/2](117000/4588595) loss:2.727 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:57:13,353][model8_pretrain.py][INFO] Epoch:[0/2](117000/4588595) loss:2.878 lr:0.0000100 epoch_Time:28183.0min: [2024-01-03 05:57:50,297][model8_pretrain.py][INFO] Epoch:[0/2](117100/4588595) loss:3.390 lr:0.0000100 epoch_Time:28182.0min: [2024-01-03 05:57:50,297][model8_pretrain.py][INFO] Epoch:[0/2](117100/4588595) loss:2.695 lr:0.0000100 epoch_Time:28182.0min: [2024-01-03 05:57:50,297][model8_pretrain.py][INFO] Epoch:[0/2](117100/4588595) loss:3.307 lr:0.0000100 epoch_Time:28182.0min: [2024-01-03 05:57:50,297][model8_pretrain.py][INFO] Epoch:[0/2](117100/4588595) loss:3.220 lr:0.0000100 epoch_Time:28182.0min: [2024-01-03 05:57:50,297][model8_pretrain.py][INFO] Epoch:[0/2](117100/4588595) loss:3.397 lr:0.0000100 epoch_Time:28182.0min: [2024-01-03 05:57:50,297][model8_pretrain.py][INFO] Epoch:[0/2](117100/4588595) loss:3.183 lr:0.0000100 epoch_Time:28182.0min: [2024-01-03 05:57:50,297][model8_pretrain.py][INFO] Epoch:[0/2](117100/4588595) loss:2.799 lr:0.0000100 epoch_Time:28182.0min: [2024-01-03 05:57:50,297][model8_pretrain.py][INFO] Epoch:[0/2](117100/4588595) loss:2.942 lr:0.0000100 epoch_Time:28182.0min: [2024-01-03 05:58:27,287][model8_pretrain.py][INFO] Epoch:[0/2](117200/4588595) loss:3.164 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:58:27,287][model8_pretrain.py][INFO] Epoch:[0/2](117200/4588595) loss:2.994 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:58:27,287][model8_pretrain.py][INFO] Epoch:[0/2](117200/4588595) loss:2.701 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:58:27,287][model8_pretrain.py][INFO] Epoch:[0/2](117200/4588595) loss:2.787 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:58:27,287][model8_pretrain.py][INFO] Epoch:[0/2](117200/4588595) loss:1.860 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:58:27,287][model8_pretrain.py][INFO] Epoch:[0/2](117200/4588595) loss:2.848 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:58:27,287][model8_pretrain.py][INFO] Epoch:[0/2](117200/4588595) loss:3.025 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:58:27,287][model8_pretrain.py][INFO] Epoch:[0/2](117200/4588595) loss:2.719 lr:0.0000100 epoch_Time:28181.0min: [2024-01-03 05:59:04,244][model8_pretrain.py][INFO] Epoch:[0/2](117300/4588595) loss:2.980 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:59:04,244][model8_pretrain.py][INFO] Epoch:[0/2](117300/4588595) loss:3.000 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:59:04,244][model8_pretrain.py][INFO] Epoch:[0/2](117300/4588595) loss:2.530 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:59:04,244][model8_pretrain.py][INFO] Epoch:[0/2](117300/4588595) loss:3.207 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:59:04,244][model8_pretrain.py][INFO] Epoch:[0/2](117300/4588595) loss:2.849 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:59:04,244][model8_pretrain.py][INFO] Epoch:[0/2](117300/4588595) loss:2.667 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:59:04,244][model8_pretrain.py][INFO] Epoch:[0/2](117300/4588595) loss:3.063 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:59:04,244][model8_pretrain.py][INFO] Epoch:[0/2](117300/4588595) loss:2.939 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 05:59:41,223][model8_pretrain.py][INFO] Epoch:[0/2](117400/4588595) loss:2.645 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:59:41,223][model8_pretrain.py][INFO] Epoch:[0/2](117400/4588595) loss:3.305 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:59:41,223][model8_pretrain.py][INFO] Epoch:[0/2](117400/4588595) loss:3.236 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:59:41,223][model8_pretrain.py][INFO] Epoch:[0/2](117400/4588595) loss:2.612 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:59:41,223][model8_pretrain.py][INFO] Epoch:[0/2](117400/4588595) loss:2.776 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:59:41,223][model8_pretrain.py][INFO] Epoch:[0/2](117400/4588595) loss:2.938 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:59:41,223][model8_pretrain.py][INFO] Epoch:[0/2](117400/4588595) loss:3.253 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 05:59:41,223][model8_pretrain.py][INFO] Epoch:[0/2](117400/4588595) loss:3.249 lr:0.0000100 epoch_Time:28179.0min: [2024-01-03 06:00:18,248][model8_pretrain.py][INFO] Epoch:[0/2](117500/4588595) loss:3.083 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:00:18,248][model8_pretrain.py][INFO] Epoch:[0/2](117500/4588595) loss:2.848 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:00:18,248][model8_pretrain.py][INFO] Epoch:[0/2](117500/4588595) loss:3.306 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:00:18,248][model8_pretrain.py][INFO] Epoch:[0/2](117500/4588595) loss:3.418 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:00:18,248][model8_pretrain.py][INFO] Epoch:[0/2](117500/4588595) loss:3.442 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:00:18,248][model8_pretrain.py][INFO] Epoch:[0/2](117500/4588595) loss:3.384 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:00:18,248][model8_pretrain.py][INFO] Epoch:[0/2](117500/4588595) loss:3.050 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:00:18,248][model8_pretrain.py][INFO] Epoch:[0/2](117500/4588595) loss:2.652 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:00:55,190][model8_pretrain.py][INFO] Epoch:[0/2](117600/4588595) loss:3.039 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:00:55,190][model8_pretrain.py][INFO] Epoch:[0/2](117600/4588595) loss:2.945 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:00:55,190][model8_pretrain.py][INFO] Epoch:[0/2](117600/4588595) loss:2.815 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:00:55,190][model8_pretrain.py][INFO] Epoch:[0/2](117600/4588595) loss:3.408 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:00:55,190][model8_pretrain.py][INFO] Epoch:[0/2](117600/4588595) loss:2.781 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:00:55,190][model8_pretrain.py][INFO] Epoch:[0/2](117600/4588595) loss:3.245 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:00:55,190][model8_pretrain.py][INFO] Epoch:[0/2](117600/4588595) loss:3.311 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:00:55,190][model8_pretrain.py][INFO] Epoch:[0/2](117600/4588595) loss:3.087 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:01:32,138][model8_pretrain.py][INFO] Epoch:[0/2](117700/4588595) loss:2.493 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:01:32,138][model8_pretrain.py][INFO] Epoch:[0/2](117700/4588595) loss:3.320 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:01:32,138][model8_pretrain.py][INFO] Epoch:[0/2](117700/4588595) loss:2.110 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:01:32,138][model8_pretrain.py][INFO] Epoch:[0/2](117700/4588595) loss:3.051 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:01:32,138][model8_pretrain.py][INFO] Epoch:[0/2](117700/4588595) loss:2.477 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:01:32,138][model8_pretrain.py][INFO] Epoch:[0/2](117700/4588595) loss:2.836 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:01:32,138][model8_pretrain.py][INFO] Epoch:[0/2](117700/4588595) loss:3.082 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:01:32,138][model8_pretrain.py][INFO] Epoch:[0/2](117700/4588595) loss:3.134 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:02:17,790][model8_pretrain.py][INFO] Epoch:[0/2](117800/4588595) loss:2.750 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 06:02:17,790][model8_pretrain.py][INFO] Epoch:[0/2](117800/4588595) loss:3.003 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 06:02:17,791][model8_pretrain.py][INFO] Epoch:[0/2](117800/4588595) loss:3.594 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 06:02:17,791][model8_pretrain.py][INFO] Epoch:[0/2](117800/4588595) loss:3.674 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 06:02:17,791][model8_pretrain.py][INFO] Epoch:[0/2](117800/4588595) loss:3.015 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 06:02:17,791][model8_pretrain.py][INFO] Epoch:[0/2](117800/4588595) loss:2.901 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 06:02:17,791][model8_pretrain.py][INFO] Epoch:[0/2](117800/4588595) loss:3.294 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 06:02:17,791][model8_pretrain.py][INFO] Epoch:[0/2](117800/4588595) loss:2.691 lr:0.0000100 epoch_Time:28180.0min: [2024-01-03 06:02:54,725][model8_pretrain.py][INFO] Epoch:[0/2](117900/4588595) loss:3.017 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:02:54,725][model8_pretrain.py][INFO] Epoch:[0/2](117900/4588595) loss:3.268 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:02:54,725][model8_pretrain.py][INFO] Epoch:[0/2](117900/4588595) loss:3.283 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:02:54,725][model8_pretrain.py][INFO] Epoch:[0/2](117900/4588595) loss:2.921 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:02:54,725][model8_pretrain.py][INFO] Epoch:[0/2](117900/4588595) loss:2.650 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:02:54,725][model8_pretrain.py][INFO] Epoch:[0/2](117900/4588595) loss:2.761 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:02:54,725][model8_pretrain.py][INFO] Epoch:[0/2](117900/4588595) loss:2.884 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:02:54,725][model8_pretrain.py][INFO] Epoch:[0/2](117900/4588595) loss:2.856 lr:0.0000100 epoch_Time:28178.0min: [2024-01-03 06:03:31,663][model8_pretrain.py][INFO] Epoch:[0/2](118000/4588595) loss:2.906 lr:0.0000100 epoch_Time:28177.0min: [2024-01-03 06:03:31,663][model8_pretrain.py][INFO] Epoch:[0/2](118000/4588595) loss:2.359 lr:0.0000100 epoch_Time:28177.0min: [2024-01-03 06:03:31,663][model8_pretrain.py][INFO] Epoch:[0/2](118000/4588595) loss:3.501 lr:0.0000100 epoch_Time:28177.0min: [2024-01-03 06:03:31,663][model8_pretrain.py][INFO] Epoch:[0/2](118000/4588595) loss:3.190 lr:0.0000100 epoch_Time:28177.0min: [2024-01-03 06:03:31,663][model8_pretrain.py][INFO] Epoch:[0/2](118000/4588595) loss:2.657 lr:0.0000100 epoch_Time:28177.0min: [2024-01-03 06:03:31,663][model8_pretrain.py][INFO] Epoch:[0/2](118000/4588595) loss:2.695 lr:0.0000100 epoch_Time:28177.0min: [2024-01-03 06:03:31,664][model8_pretrain.py][INFO] Epoch:[0/2](118000/4588595) loss:3.220 lr:0.0000100 epoch_Time:28177.0min: [2024-01-03 06:03:31,664][model8_pretrain.py][INFO] Epoch:[0/2](118000/4588595) loss:3.110 lr:0.0000100 epoch_Time:28177.0min: [2024-01-03 06:04:08,602][model8_pretrain.py][INFO] Epoch:[0/2](118100/4588595) loss:2.979 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:04:08,602][model8_pretrain.py][INFO] Epoch:[0/2](118100/4588595) loss:3.000 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:04:08,602][model8_pretrain.py][INFO] Epoch:[0/2](118100/4588595) loss:2.470 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:04:08,602][model8_pretrain.py][INFO] Epoch:[0/2](118100/4588595) loss:3.094 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:04:08,602][model8_pretrain.py][INFO] Epoch:[0/2](118100/4588595) loss:2.803 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:04:08,602][model8_pretrain.py][INFO] Epoch:[0/2](118100/4588595) loss:3.094 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:04:08,602][model8_pretrain.py][INFO] Epoch:[0/2](118100/4588595) loss:2.906 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:04:08,602][model8_pretrain.py][INFO] Epoch:[0/2](118100/4588595) loss:2.905 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:04:45,545][model8_pretrain.py][INFO] Epoch:[0/2](118200/4588595) loss:3.074 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:04:45,545][model8_pretrain.py][INFO] Epoch:[0/2](118200/4588595) loss:3.038 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:04:45,545][model8_pretrain.py][INFO] Epoch:[0/2](118200/4588595) loss:3.092 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:04:45,545][model8_pretrain.py][INFO] Epoch:[0/2](118200/4588595) loss:3.150 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:04:45,546][model8_pretrain.py][INFO] Epoch:[0/2](118200/4588595) loss:2.702 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:04:45,546][model8_pretrain.py][INFO] Epoch:[0/2](118200/4588595) loss:2.732 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:04:45,546][model8_pretrain.py][INFO] Epoch:[0/2](118200/4588595) loss:3.152 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:04:45,546][model8_pretrain.py][INFO] Epoch:[0/2](118200/4588595) loss:2.458 lr:0.0000100 epoch_Time:28175.0min: [2024-01-03 06:05:22,485][model8_pretrain.py][INFO] Epoch:[0/2](118300/4588595) loss:3.388 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:05:22,485][model8_pretrain.py][INFO] Epoch:[0/2](118300/4588595) loss:2.854 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:05:22,485][model8_pretrain.py][INFO] Epoch:[0/2](118300/4588595) loss:3.055 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:05:22,485][model8_pretrain.py][INFO] Epoch:[0/2](118300/4588595) loss:2.880 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:05:22,485][model8_pretrain.py][INFO] Epoch:[0/2](118300/4588595) loss:3.292 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:05:22,485][model8_pretrain.py][INFO] Epoch:[0/2](118300/4588595) loss:2.600 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:05:22,485][model8_pretrain.py][INFO] Epoch:[0/2](118300/4588595) loss:2.809 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:05:22,485][model8_pretrain.py][INFO] Epoch:[0/2](118300/4588595) loss:3.015 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:05:59,441][model8_pretrain.py][INFO] Epoch:[0/2](118400/4588595) loss:2.870 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:05:59,441][model8_pretrain.py][INFO] Epoch:[0/2](118400/4588595) loss:3.334 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:05:59,441][model8_pretrain.py][INFO] Epoch:[0/2](118400/4588595) loss:3.172 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:05:59,441][model8_pretrain.py][INFO] Epoch:[0/2](118400/4588595) loss:2.932 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:05:59,441][model8_pretrain.py][INFO] Epoch:[0/2](118400/4588595) loss:3.037 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:05:59,441][model8_pretrain.py][INFO] Epoch:[0/2](118400/4588595) loss:2.880 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:05:59,441][model8_pretrain.py][INFO] Epoch:[0/2](118400/4588595) loss:2.938 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:05:59,441][model8_pretrain.py][INFO] Epoch:[0/2](118400/4588595) loss:2.873 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:06:36,388][model8_pretrain.py][INFO] Epoch:[0/2](118500/4588595) loss:2.947 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:06:36,388][model8_pretrain.py][INFO] Epoch:[0/2](118500/4588595) loss:3.088 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:06:36,388][model8_pretrain.py][INFO] Epoch:[0/2](118500/4588595) loss:3.383 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:06:36,389][model8_pretrain.py][INFO] Epoch:[0/2](118500/4588595) loss:2.605 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:06:36,389][model8_pretrain.py][INFO] Epoch:[0/2](118500/4588595) loss:3.163 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:06:36,389][model8_pretrain.py][INFO] Epoch:[0/2](118500/4588595) loss:3.202 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:06:36,389][model8_pretrain.py][INFO] Epoch:[0/2](118500/4588595) loss:3.032 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:06:36,389][model8_pretrain.py][INFO] Epoch:[0/2](118500/4588595) loss:3.171 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:07:22,047][model8_pretrain.py][INFO] Epoch:[0/2](118600/4588595) loss:3.283 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:07:22,047][model8_pretrain.py][INFO] Epoch:[0/2](118600/4588595) loss:2.542 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:07:22,047][model8_pretrain.py][INFO] Epoch:[0/2](118600/4588595) loss:2.143 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:07:22,047][model8_pretrain.py][INFO] Epoch:[0/2](118600/4588595) loss:3.191 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:07:22,047][model8_pretrain.py][INFO] Epoch:[0/2](118600/4588595) loss:2.629 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:07:22,047][model8_pretrain.py][INFO] Epoch:[0/2](118600/4588595) loss:3.180 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:07:22,047][model8_pretrain.py][INFO] Epoch:[0/2](118600/4588595) loss:2.925 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:07:22,047][model8_pretrain.py][INFO] Epoch:[0/2](118600/4588595) loss:2.812 lr:0.0000100 epoch_Time:28176.0min: [2024-01-03 06:07:59,005][model8_pretrain.py][INFO] Epoch:[0/2](118700/4588595) loss:3.009 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:07:59,005][model8_pretrain.py][INFO] Epoch:[0/2](118700/4588595) loss:2.866 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:07:59,005][model8_pretrain.py][INFO] Epoch:[0/2](118700/4588595) loss:2.728 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:07:59,005][model8_pretrain.py][INFO] Epoch:[0/2](118700/4588595) loss:2.752 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:07:59,005][model8_pretrain.py][INFO] Epoch:[0/2](118700/4588595) loss:2.453 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:07:59,005][model8_pretrain.py][INFO] Epoch:[0/2](118700/4588595) loss:3.274 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:07:59,005][model8_pretrain.py][INFO] Epoch:[0/2](118700/4588595) loss:3.335 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:07:59,005][model8_pretrain.py][INFO] Epoch:[0/2](118700/4588595) loss:2.886 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:08:36,010][model8_pretrain.py][INFO] Epoch:[0/2](118800/4588595) loss:3.193 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:08:36,010][model8_pretrain.py][INFO] Epoch:[0/2](118800/4588595) loss:3.011 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:08:36,010][model8_pretrain.py][INFO] Epoch:[0/2](118800/4588595) loss:3.018 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:08:36,010][model8_pretrain.py][INFO] Epoch:[0/2](118800/4588595) loss:3.052 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:08:36,010][model8_pretrain.py][INFO] Epoch:[0/2](118800/4588595) loss:3.141 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:08:36,010][model8_pretrain.py][INFO] Epoch:[0/2](118800/4588595) loss:2.920 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:08:36,010][model8_pretrain.py][INFO] Epoch:[0/2](118800/4588595) loss:2.943 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:08:36,010][model8_pretrain.py][INFO] Epoch:[0/2](118800/4588595) loss:2.490 lr:0.0000100 epoch_Time:28174.0min: [2024-01-03 06:09:12,968][model8_pretrain.py][INFO] Epoch:[0/2](118900/4588595) loss:3.133 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:09:12,968][model8_pretrain.py][INFO] Epoch:[0/2](118900/4588595) loss:2.771 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:09:12,968][model8_pretrain.py][INFO] Epoch:[0/2](118900/4588595) loss:2.881 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:09:12,968][model8_pretrain.py][INFO] Epoch:[0/2](118900/4588595) loss:3.225 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:09:12,968][model8_pretrain.py][INFO] Epoch:[0/2](118900/4588595) loss:3.214 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:09:12,968][model8_pretrain.py][INFO] Epoch:[0/2](118900/4588595) loss:2.160 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:09:12,968][model8_pretrain.py][INFO] Epoch:[0/2](118900/4588595) loss:2.264 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:09:12,968][model8_pretrain.py][INFO] Epoch:[0/2](118900/4588595) loss:3.260 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:09:49,916][model8_pretrain.py][INFO] Epoch:[0/2](119000/4588595) loss:3.028 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:09:49,916][model8_pretrain.py][INFO] Epoch:[0/2](119000/4588595) loss:2.823 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:09:49,916][model8_pretrain.py][INFO] Epoch:[0/2](119000/4588595) loss:3.201 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:09:49,916][model8_pretrain.py][INFO] Epoch:[0/2](119000/4588595) loss:2.795 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:09:49,916][model8_pretrain.py][INFO] Epoch:[0/2](119000/4588595) loss:2.794 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:09:49,916][model8_pretrain.py][INFO] Epoch:[0/2](119000/4588595) loss:3.211 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:09:49,916][model8_pretrain.py][INFO] Epoch:[0/2](119000/4588595) loss:2.911 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:09:49,916][model8_pretrain.py][INFO] Epoch:[0/2](119000/4588595) loss:3.012 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:10:26,869][model8_pretrain.py][INFO] Epoch:[0/2](119100/4588595) loss:3.075 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:10:26,869][model8_pretrain.py][INFO] Epoch:[0/2](119100/4588595) loss:2.664 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:10:26,869][model8_pretrain.py][INFO] Epoch:[0/2](119100/4588595) loss:2.690 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:10:26,869][model8_pretrain.py][INFO] Epoch:[0/2](119100/4588595) loss:3.039 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:10:26,869][model8_pretrain.py][INFO] Epoch:[0/2](119100/4588595) loss:2.958 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:10:26,869][model8_pretrain.py][INFO] Epoch:[0/2](119100/4588595) loss:2.684 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:10:26,869][model8_pretrain.py][INFO] Epoch:[0/2](119100/4588595) loss:3.425 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:10:26,869][model8_pretrain.py][INFO] Epoch:[0/2](119100/4588595) loss:3.001 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:11:03,817][model8_pretrain.py][INFO] Epoch:[0/2](119200/4588595) loss:3.446 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:03,817][model8_pretrain.py][INFO] Epoch:[0/2](119200/4588595) loss:3.150 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:03,817][model8_pretrain.py][INFO] Epoch:[0/2](119200/4588595) loss:2.481 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:03,817][model8_pretrain.py][INFO] Epoch:[0/2](119200/4588595) loss:2.717 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:03,817][model8_pretrain.py][INFO] Epoch:[0/2](119200/4588595) loss:2.829 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:03,817][model8_pretrain.py][INFO] Epoch:[0/2](119200/4588595) loss:3.312 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:03,817][model8_pretrain.py][INFO] Epoch:[0/2](119200/4588595) loss:3.157 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:03,817][model8_pretrain.py][INFO] Epoch:[0/2](119200/4588595) loss:3.578 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:40,755][model8_pretrain.py][INFO] Epoch:[0/2](119300/4588595) loss:3.045 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:40,755][model8_pretrain.py][INFO] Epoch:[0/2](119300/4588595) loss:2.912 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:40,755][model8_pretrain.py][INFO] Epoch:[0/2](119300/4588595) loss:3.286 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:40,755][model8_pretrain.py][INFO] Epoch:[0/2](119300/4588595) loss:3.113 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:40,755][model8_pretrain.py][INFO] Epoch:[0/2](119300/4588595) loss:2.873 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:40,755][model8_pretrain.py][INFO] Epoch:[0/2](119300/4588595) loss:2.414 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:40,755][model8_pretrain.py][INFO] Epoch:[0/2](119300/4588595) loss:3.109 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:11:40,756][model8_pretrain.py][INFO] Epoch:[0/2](119300/4588595) loss:2.844 lr:0.0000100 epoch_Time:28168.0min: [2024-01-03 06:12:27,426][model8_pretrain.py][INFO] Epoch:[0/2](119400/4588595) loss:2.683 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:12:27,426][model8_pretrain.py][INFO] Epoch:[0/2](119400/4588595) loss:3.294 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:12:27,426][model8_pretrain.py][INFO] Epoch:[0/2](119400/4588595) loss:2.969 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:12:27,427][model8_pretrain.py][INFO] Epoch:[0/2](119400/4588595) loss:3.138 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:12:27,427][model8_pretrain.py][INFO] Epoch:[0/2](119400/4588595) loss:2.883 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:12:27,427][model8_pretrain.py][INFO] Epoch:[0/2](119400/4588595) loss:2.702 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:12:27,427][model8_pretrain.py][INFO] Epoch:[0/2](119400/4588595) loss:2.818 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:12:27,427][model8_pretrain.py][INFO] Epoch:[0/2](119400/4588595) loss:2.876 lr:0.0000100 epoch_Time:28172.0min: [2024-01-03 06:13:04,401][model8_pretrain.py][INFO] Epoch:[0/2](119500/4588595) loss:3.058 lr:0.0000100 epoch_Time:28171.0min: [2024-01-03 06:13:04,401][model8_pretrain.py][INFO] Epoch:[0/2](119500/4588595) loss:2.144 lr:0.0000100 epoch_Time:28171.0min: [2024-01-03 06:13:04,401][model8_pretrain.py][INFO] Epoch:[0/2](119500/4588595) loss:3.061 lr:0.0000100 epoch_Time:28171.0min: [2024-01-03 06:13:04,401][model8_pretrain.py][INFO] Epoch:[0/2](119500/4588595) loss:2.601 lr:0.0000100 epoch_Time:28171.0min: [2024-01-03 06:13:04,401][model8_pretrain.py][INFO] Epoch:[0/2](119500/4588595) loss:2.601 lr:0.0000100 epoch_Time:28171.0min: [2024-01-03 06:13:04,401][model8_pretrain.py][INFO] Epoch:[0/2](119500/4588595) loss:2.991 lr:0.0000100 epoch_Time:28171.0min: [2024-01-03 06:13:04,401][model8_pretrain.py][INFO] Epoch:[0/2](119500/4588595) loss:3.031 lr:0.0000100 epoch_Time:28171.0min: [2024-01-03 06:13:04,401][model8_pretrain.py][INFO] Epoch:[0/2](119500/4588595) loss:2.991 lr:0.0000100 epoch_Time:28171.0min: [2024-01-03 06:13:41,334][model8_pretrain.py][INFO] Epoch:[0/2](119600/4588595) loss:2.971 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:13:41,334][model8_pretrain.py][INFO] Epoch:[0/2](119600/4588595) loss:2.398 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:13:41,334][model8_pretrain.py][INFO] Epoch:[0/2](119600/4588595) loss:2.862 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:13:41,334][model8_pretrain.py][INFO] Epoch:[0/2](119600/4588595) loss:2.946 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:13:41,334][model8_pretrain.py][INFO] Epoch:[0/2](119600/4588595) loss:3.491 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:13:41,334][model8_pretrain.py][INFO] Epoch:[0/2](119600/4588595) loss:3.206 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:13:41,334][model8_pretrain.py][INFO] Epoch:[0/2](119600/4588595) loss:2.899 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:13:41,334][model8_pretrain.py][INFO] Epoch:[0/2](119600/4588595) loss:2.835 lr:0.0000100 epoch_Time:28170.0min: [2024-01-03 06:14:18,282][model8_pretrain.py][INFO] Epoch:[0/2](119700/4588595) loss:3.361 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:14:18,282][model8_pretrain.py][INFO] Epoch:[0/2](119700/4588595) loss:3.305 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:14:18,282][model8_pretrain.py][INFO] Epoch:[0/2](119700/4588595) loss:2.409 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:14:18,282][model8_pretrain.py][INFO] Epoch:[0/2](119700/4588595) loss:2.728 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:14:18,282][model8_pretrain.py][INFO] Epoch:[0/2](119700/4588595) loss:2.853 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:14:18,282][model8_pretrain.py][INFO] Epoch:[0/2](119700/4588595) loss:3.225 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:14:18,282][model8_pretrain.py][INFO] Epoch:[0/2](119700/4588595) loss:2.712 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:14:18,283][model8_pretrain.py][INFO] Epoch:[0/2](119700/4588595) loss:2.770 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:14:55,220][model8_pretrain.py][INFO] Epoch:[0/2](119800/4588595) loss:3.504 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:14:55,220][model8_pretrain.py][INFO] Epoch:[0/2](119800/4588595) loss:3.067 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:14:55,220][model8_pretrain.py][INFO] Epoch:[0/2](119800/4588595) loss:3.419 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:14:55,220][model8_pretrain.py][INFO] Epoch:[0/2](119800/4588595) loss:2.717 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:14:55,220][model8_pretrain.py][INFO] Epoch:[0/2](119800/4588595) loss:2.850 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:14:55,220][model8_pretrain.py][INFO] Epoch:[0/2](119800/4588595) loss:3.157 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:14:55,220][model8_pretrain.py][INFO] Epoch:[0/2](119800/4588595) loss:3.231 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:14:55,220][model8_pretrain.py][INFO] Epoch:[0/2](119800/4588595) loss:2.821 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:15:32,179][model8_pretrain.py][INFO] Epoch:[0/2](119900/4588595) loss:2.772 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:15:32,179][model8_pretrain.py][INFO] Epoch:[0/2](119900/4588595) loss:3.371 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:15:32,179][model8_pretrain.py][INFO] Epoch:[0/2](119900/4588595) loss:2.622 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:15:32,179][model8_pretrain.py][INFO] Epoch:[0/2](119900/4588595) loss:3.047 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:15:32,179][model8_pretrain.py][INFO] Epoch:[0/2](119900/4588595) loss:2.807 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:15:32,179][model8_pretrain.py][INFO] Epoch:[0/2](119900/4588595) loss:3.089 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:15:32,179][model8_pretrain.py][INFO] Epoch:[0/2](119900/4588595) loss:3.073 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:15:32,179][model8_pretrain.py][INFO] Epoch:[0/2](119900/4588595) loss:3.019 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:16:09,136][model8_pretrain.py][INFO] Epoch:[0/2](120000/4588595) loss:3.270 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:09,136][model8_pretrain.py][INFO] Epoch:[0/2](120000/4588595) loss:3.093 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:09,136][model8_pretrain.py][INFO] Epoch:[0/2](120000/4588595) loss:3.458 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:09,136][model8_pretrain.py][INFO] Epoch:[0/2](120000/4588595) loss:3.539 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:09,136][model8_pretrain.py][INFO] Epoch:[0/2](120000/4588595) loss:2.920 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:09,136][model8_pretrain.py][INFO] Epoch:[0/2](120000/4588595) loss:2.969 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:09,136][model8_pretrain.py][INFO] Epoch:[0/2](120000/4588595) loss:2.946 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:09,136][model8_pretrain.py][INFO] Epoch:[0/2](120000/4588595) loss:3.392 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:46,102][model8_pretrain.py][INFO] Epoch:[0/2](120100/4588595) loss:2.543 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:46,102][model8_pretrain.py][INFO] Epoch:[0/2](120100/4588595) loss:3.387 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:46,102][model8_pretrain.py][INFO] Epoch:[0/2](120100/4588595) loss:3.103 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:46,102][model8_pretrain.py][INFO] Epoch:[0/2](120100/4588595) loss:3.083 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:46,102][model8_pretrain.py][INFO] Epoch:[0/2](120100/4588595) loss:3.152 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:46,102][model8_pretrain.py][INFO] Epoch:[0/2](120100/4588595) loss:2.848 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:46,102][model8_pretrain.py][INFO] Epoch:[0/2](120100/4588595) loss:2.769 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:16:46,102][model8_pretrain.py][INFO] Epoch:[0/2](120100/4588595) loss:3.165 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:17:31,826][model8_pretrain.py][INFO] Epoch:[0/2](120200/4588595) loss:3.234 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:17:31,826][model8_pretrain.py][INFO] Epoch:[0/2](120200/4588595) loss:2.563 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:17:31,826][model8_pretrain.py][INFO] Epoch:[0/2](120200/4588595) loss:2.744 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:17:31,826][model8_pretrain.py][INFO] Epoch:[0/2](120200/4588595) loss:3.255 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:17:31,826][model8_pretrain.py][INFO] Epoch:[0/2](120200/4588595) loss:3.095 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:17:31,826][model8_pretrain.py][INFO] Epoch:[0/2](120200/4588595) loss:2.818 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:17:31,826][model8_pretrain.py][INFO] Epoch:[0/2](120200/4588595) loss:2.860 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:17:31,826][model8_pretrain.py][INFO] Epoch:[0/2](120200/4588595) loss:2.963 lr:0.0000100 epoch_Time:28169.0min: [2024-01-03 06:18:08,762][model8_pretrain.py][INFO] Epoch:[0/2](120300/4588595) loss:3.086 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:08,762][model8_pretrain.py][INFO] Epoch:[0/2](120300/4588595) loss:2.847 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:08,762][model8_pretrain.py][INFO] Epoch:[0/2](120300/4588595) loss:2.153 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:08,762][model8_pretrain.py][INFO] Epoch:[0/2](120300/4588595) loss:2.695 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:08,762][model8_pretrain.py][INFO] Epoch:[0/2](120300/4588595) loss:3.150 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:08,762][model8_pretrain.py][INFO] Epoch:[0/2](120300/4588595) loss:2.785 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:08,762][model8_pretrain.py][INFO] Epoch:[0/2](120300/4588595) loss:2.856 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:08,763][model8_pretrain.py][INFO] Epoch:[0/2](120300/4588595) loss:3.034 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:45,692][model8_pretrain.py][INFO] Epoch:[0/2](120400/4588595) loss:2.659 lr:0.0000100 epoch_Time:28166.0min: [2024-01-03 06:18:45,692][model8_pretrain.py][INFO] Epoch:[0/2](120400/4588595) loss:2.367 lr:0.0000100 epoch_Time:28166.0min: [2024-01-03 06:18:45,692][model8_pretrain.py][INFO] Epoch:[0/2](120400/4588595) loss:2.882 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:45,692][model8_pretrain.py][INFO] Epoch:[0/2](120400/4588595) loss:3.017 lr:0.0000100 epoch_Time:28166.0min: [2024-01-03 06:18:45,692][model8_pretrain.py][INFO] Epoch:[0/2](120400/4588595) loss:2.625 lr:0.0000100 epoch_Time:28166.0min: [2024-01-03 06:18:45,692][model8_pretrain.py][INFO] Epoch:[0/2](120400/4588595) loss:3.478 lr:0.0000100 epoch_Time:28166.0min: [2024-01-03 06:18:45,692][model8_pretrain.py][INFO] Epoch:[0/2](120400/4588595) loss:2.644 lr:0.0000100 epoch_Time:28167.0min: [2024-01-03 06:18:45,693][model8_pretrain.py][INFO] Epoch:[0/2](120400/4588595) loss:3.566 lr:0.0000100 epoch_Time:28166.0min: [2024-01-03 06:19:22,639][model8_pretrain.py][INFO] Epoch:[0/2](120500/4588595) loss:2.663 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:19:22,639][model8_pretrain.py][INFO] Epoch:[0/2](120500/4588595) loss:3.044 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:19:22,639][model8_pretrain.py][INFO] Epoch:[0/2](120500/4588595) loss:3.265 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:19:22,639][model8_pretrain.py][INFO] Epoch:[0/2](120500/4588595) loss:2.563 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:19:22,639][model8_pretrain.py][INFO] Epoch:[0/2](120500/4588595) loss:3.158 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:19:22,639][model8_pretrain.py][INFO] Epoch:[0/2](120500/4588595) loss:3.198 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:19:22,639][model8_pretrain.py][INFO] Epoch:[0/2](120500/4588595) loss:3.113 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:19:22,639][model8_pretrain.py][INFO] Epoch:[0/2](120500/4588595) loss:3.318 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:19:59,584][model8_pretrain.py][INFO] Epoch:[0/2](120600/4588595) loss:3.069 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:19:59,584][model8_pretrain.py][INFO] Epoch:[0/2](120600/4588595) loss:2.950 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:19:59,584][model8_pretrain.py][INFO] Epoch:[0/2](120600/4588595) loss:3.238 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:19:59,583][model8_pretrain.py][INFO] Epoch:[0/2](120600/4588595) loss:2.656 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:19:59,584][model8_pretrain.py][INFO] Epoch:[0/2](120600/4588595) loss:2.701 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:19:59,584][model8_pretrain.py][INFO] Epoch:[0/2](120600/4588595) loss:3.314 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:19:59,584][model8_pretrain.py][INFO] Epoch:[0/2](120600/4588595) loss:2.999 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:19:59,584][model8_pretrain.py][INFO] Epoch:[0/2](120600/4588595) loss:2.475 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:20:36,527][model8_pretrain.py][INFO] Epoch:[0/2](120700/4588595) loss:2.993 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:20:36,527][model8_pretrain.py][INFO] Epoch:[0/2](120700/4588595) loss:2.655 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:20:36,527][model8_pretrain.py][INFO] Epoch:[0/2](120700/4588595) loss:2.813 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:20:36,527][model8_pretrain.py][INFO] Epoch:[0/2](120700/4588595) loss:2.902 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:20:36,527][model8_pretrain.py][INFO] Epoch:[0/2](120700/4588595) loss:2.980 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:20:36,527][model8_pretrain.py][INFO] Epoch:[0/2](120700/4588595) loss:2.780 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:20:36,527][model8_pretrain.py][INFO] Epoch:[0/2](120700/4588595) loss:3.196 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:20:36,527][model8_pretrain.py][INFO] Epoch:[0/2](120700/4588595) loss:2.964 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:21:13,486][model8_pretrain.py][INFO] Epoch:[0/2](120800/4588595) loss:3.251 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:21:13,486][model8_pretrain.py][INFO] Epoch:[0/2](120800/4588595) loss:2.915 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:21:13,486][model8_pretrain.py][INFO] Epoch:[0/2](120800/4588595) loss:2.990 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:21:13,486][model8_pretrain.py][INFO] Epoch:[0/2](120800/4588595) loss:2.917 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:21:13,486][model8_pretrain.py][INFO] Epoch:[0/2](120800/4588595) loss:2.822 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:21:13,486][model8_pretrain.py][INFO] Epoch:[0/2](120800/4588595) loss:2.737 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:21:13,486][model8_pretrain.py][INFO] Epoch:[0/2](120800/4588595) loss:2.915 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:21:13,486][model8_pretrain.py][INFO] Epoch:[0/2](120800/4588595) loss:2.845 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:21:50,422][model8_pretrain.py][INFO] Epoch:[0/2](120900/4588595) loss:2.942 lr:0.0000100 epoch_Time:28160.0min: [2024-01-03 06:21:50,423][model8_pretrain.py][INFO] Epoch:[0/2](120900/4588595) loss:2.734 lr:0.0000100 epoch_Time:28160.0min: [2024-01-03 06:21:50,423][model8_pretrain.py][INFO] Epoch:[0/2](120900/4588595) loss:2.462 lr:0.0000100 epoch_Time:28160.0min: [2024-01-03 06:21:50,423][model8_pretrain.py][INFO] Epoch:[0/2](120900/4588595) loss:2.643 lr:0.0000100 epoch_Time:28160.0min: [2024-01-03 06:21:50,423][model8_pretrain.py][INFO] Epoch:[0/2](120900/4588595) loss:2.800 lr:0.0000100 epoch_Time:28160.0min: [2024-01-03 06:21:50,423][model8_pretrain.py][INFO] Epoch:[0/2](120900/4588595) loss:2.862 lr:0.0000100 epoch_Time:28160.0min: [2024-01-03 06:21:50,423][model8_pretrain.py][INFO] Epoch:[0/2](120900/4588595) loss:2.641 lr:0.0000100 epoch_Time:28160.0min: [2024-01-03 06:21:50,423][model8_pretrain.py][INFO] Epoch:[0/2](120900/4588595) loss:2.873 lr:0.0000100 epoch_Time:28160.0min: [2024-01-03 06:22:36,024][model8_pretrain.py][INFO] Epoch:[0/2](121000/4588595) loss:3.144 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:22:36,024][model8_pretrain.py][INFO] Epoch:[0/2](121000/4588595) loss:2.900 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:22:36,024][model8_pretrain.py][INFO] Epoch:[0/2](121000/4588595) loss:2.704 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:22:36,024][model8_pretrain.py][INFO] Epoch:[0/2](121000/4588595) loss:2.986 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:22:36,024][model8_pretrain.py][INFO] Epoch:[0/2](121000/4588595) loss:2.982 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:22:36,024][model8_pretrain.py][INFO] Epoch:[0/2](121000/4588595) loss:2.968 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:22:36,024][model8_pretrain.py][INFO] Epoch:[0/2](121000/4588595) loss:3.035 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:22:36,024][model8_pretrain.py][INFO] Epoch:[0/2](121000/4588595) loss:3.139 lr:0.0000100 epoch_Time:28165.0min: [2024-01-03 06:23:12,948][model8_pretrain.py][INFO] Epoch:[0/2](121100/4588595) loss:2.851 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:23:12,948][model8_pretrain.py][INFO] Epoch:[0/2](121100/4588595) loss:3.154 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:23:12,948][model8_pretrain.py][INFO] Epoch:[0/2](121100/4588595) loss:2.178 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:23:12,948][model8_pretrain.py][INFO] Epoch:[0/2](121100/4588595) loss:2.916 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:23:12,948][model8_pretrain.py][INFO] Epoch:[0/2](121100/4588595) loss:2.517 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:23:12,948][model8_pretrain.py][INFO] Epoch:[0/2](121100/4588595) loss:3.122 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:23:12,948][model8_pretrain.py][INFO] Epoch:[0/2](121100/4588595) loss:3.203 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:23:12,948][model8_pretrain.py][INFO] Epoch:[0/2](121100/4588595) loss:2.725 lr:0.0000100 epoch_Time:28163.0min: [2024-01-03 06:23:49,897][model8_pretrain.py][INFO] Epoch:[0/2](121200/4588595) loss:2.679 lr:0.0000100 epoch_Time:28162.0min: [2024-01-03 06:23:49,897][model8_pretrain.py][INFO] Epoch:[0/2](121200/4588595) loss:2.480 lr:0.0000100 epoch_Time:28162.0min: [2024-01-03 06:23:49,897][model8_pretrain.py][INFO] Epoch:[0/2](121200/4588595) loss:2.863 lr:0.0000100 epoch_Time:28162.0min: [2024-01-03 06:23:49,897][model8_pretrain.py][INFO] Epoch:[0/2](121200/4588595) loss:3.369 lr:0.0000100 epoch_Time:28162.0min: [2024-01-03 06:23:49,897][model8_pretrain.py][INFO] Epoch:[0/2](121200/4588595) loss:2.584 lr:0.0000100 epoch_Time:28162.0min: [2024-01-03 06:23:49,897][model8_pretrain.py][INFO] Epoch:[0/2](121200/4588595) loss:2.900 lr:0.0000100 epoch_Time:28162.0min: [2024-01-03 06:23:49,897][model8_pretrain.py][INFO] Epoch:[0/2](121200/4588595) loss:2.924 lr:0.0000100 epoch_Time:28162.0min: [2024-01-03 06:23:49,897][model8_pretrain.py][INFO] Epoch:[0/2](121200/4588595) loss:2.611 lr:0.0000100 epoch_Time:28162.0min: [2024-01-03 06:24:26,862][model8_pretrain.py][INFO] Epoch:[0/2](121300/4588595) loss:2.750 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:24:26,862][model8_pretrain.py][INFO] Epoch:[0/2](121300/4588595) loss:3.364 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:24:26,862][model8_pretrain.py][INFO] Epoch:[0/2](121300/4588595) loss:3.094 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:24:26,862][model8_pretrain.py][INFO] Epoch:[0/2](121300/4588595) loss:2.783 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:24:26,862][model8_pretrain.py][INFO] Epoch:[0/2](121300/4588595) loss:3.150 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:24:26,862][model8_pretrain.py][INFO] Epoch:[0/2](121300/4588595) loss:2.885 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:24:26,862][model8_pretrain.py][INFO] Epoch:[0/2](121300/4588595) loss:3.127 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:24:26,862][model8_pretrain.py][INFO] Epoch:[0/2](121300/4588595) loss:3.261 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:25:03,799][model8_pretrain.py][INFO] Epoch:[0/2](121400/4588595) loss:2.671 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:03,799][model8_pretrain.py][INFO] Epoch:[0/2](121400/4588595) loss:2.556 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:03,799][model8_pretrain.py][INFO] Epoch:[0/2](121400/4588595) loss:2.802 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:03,799][model8_pretrain.py][INFO] Epoch:[0/2](121400/4588595) loss:3.530 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:03,799][model8_pretrain.py][INFO] Epoch:[0/2](121400/4588595) loss:2.341 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:03,799][model8_pretrain.py][INFO] Epoch:[0/2](121400/4588595) loss:3.302 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:03,800][model8_pretrain.py][INFO] Epoch:[0/2](121400/4588595) loss:3.311 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:03,800][model8_pretrain.py][INFO] Epoch:[0/2](121400/4588595) loss:3.089 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:40,732][model8_pretrain.py][INFO] Epoch:[0/2](121500/4588595) loss:3.404 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:40,732][model8_pretrain.py][INFO] Epoch:[0/2](121500/4588595) loss:3.250 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:40,732][model8_pretrain.py][INFO] Epoch:[0/2](121500/4588595) loss:3.034 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:40,732][model8_pretrain.py][INFO] Epoch:[0/2](121500/4588595) loss:2.988 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:40,732][model8_pretrain.py][INFO] Epoch:[0/2](121500/4588595) loss:3.166 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:40,732][model8_pretrain.py][INFO] Epoch:[0/2](121500/4588595) loss:2.639 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:40,732][model8_pretrain.py][INFO] Epoch:[0/2](121500/4588595) loss:3.382 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:25:40,732][model8_pretrain.py][INFO] Epoch:[0/2](121500/4588595) loss:2.834 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:26:17,683][model8_pretrain.py][INFO] Epoch:[0/2](121600/4588595) loss:2.750 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:26:17,683][model8_pretrain.py][INFO] Epoch:[0/2](121600/4588595) loss:2.582 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:26:17,683][model8_pretrain.py][INFO] Epoch:[0/2](121600/4588595) loss:2.588 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:26:17,683][model8_pretrain.py][INFO] Epoch:[0/2](121600/4588595) loss:3.269 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:26:17,683][model8_pretrain.py][INFO] Epoch:[0/2](121600/4588595) loss:3.170 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:26:17,683][model8_pretrain.py][INFO] Epoch:[0/2](121600/4588595) loss:3.524 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:26:17,683][model8_pretrain.py][INFO] Epoch:[0/2](121600/4588595) loss:3.071 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:26:17,684][model8_pretrain.py][INFO] Epoch:[0/2](121600/4588595) loss:2.855 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:26:54,644][model8_pretrain.py][INFO] Epoch:[0/2](121700/4588595) loss:3.139 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:26:54,644][model8_pretrain.py][INFO] Epoch:[0/2](121700/4588595) loss:3.045 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:26:54,644][model8_pretrain.py][INFO] Epoch:[0/2](121700/4588595) loss:2.481 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:26:54,644][model8_pretrain.py][INFO] Epoch:[0/2](121700/4588595) loss:3.011 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:26:54,644][model8_pretrain.py][INFO] Epoch:[0/2](121700/4588595) loss:2.959 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:26:54,644][model8_pretrain.py][INFO] Epoch:[0/2](121700/4588595) loss:3.147 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:26:54,644][model8_pretrain.py][INFO] Epoch:[0/2](121700/4588595) loss:2.699 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:26:54,644][model8_pretrain.py][INFO] Epoch:[0/2](121700/4588595) loss:3.397 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:27:40,402][model8_pretrain.py][INFO] Epoch:[0/2](121800/4588595) loss:3.350 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:27:40,402][model8_pretrain.py][INFO] Epoch:[0/2](121800/4588595) loss:3.038 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:27:40,402][model8_pretrain.py][INFO] Epoch:[0/2](121800/4588595) loss:2.821 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:27:40,402][model8_pretrain.py][INFO] Epoch:[0/2](121800/4588595) loss:2.878 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:27:40,402][model8_pretrain.py][INFO] Epoch:[0/2](121800/4588595) loss:2.794 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:27:40,402][model8_pretrain.py][INFO] Epoch:[0/2](121800/4588595) loss:3.401 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:27:40,402][model8_pretrain.py][INFO] Epoch:[0/2](121800/4588595) loss:2.626 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:27:40,402][model8_pretrain.py][INFO] Epoch:[0/2](121800/4588595) loss:2.678 lr:0.0000100 epoch_Time:28161.0min: [2024-01-03 06:28:17,320][model8_pretrain.py][INFO] Epoch:[0/2](121900/4588595) loss:2.877 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:28:17,320][model8_pretrain.py][INFO] Epoch:[0/2](121900/4588595) loss:2.967 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:28:17,320][model8_pretrain.py][INFO] Epoch:[0/2](121900/4588595) loss:2.504 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:28:17,320][model8_pretrain.py][INFO] Epoch:[0/2](121900/4588595) loss:2.699 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:28:17,320][model8_pretrain.py][INFO] Epoch:[0/2](121900/4588595) loss:2.374 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:28:17,320][model8_pretrain.py][INFO] Epoch:[0/2](121900/4588595) loss:3.067 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:28:17,320][model8_pretrain.py][INFO] Epoch:[0/2](121900/4588595) loss:3.154 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:28:17,320][model8_pretrain.py][INFO] Epoch:[0/2](121900/4588595) loss:2.908 lr:0.0000100 epoch_Time:28159.0min: [2024-01-03 06:28:54,238][model8_pretrain.py][INFO] Epoch:[0/2](122000/4588595) loss:2.623 lr:0.0000100 epoch_Time:28158.0min: [2024-01-03 06:28:54,238][model8_pretrain.py][INFO] Epoch:[0/2](122000/4588595) loss:3.067 lr:0.0000100 epoch_Time:28158.0min: [2024-01-03 06:28:54,238][model8_pretrain.py][INFO] Epoch:[0/2](122000/4588595) loss:3.368 lr:0.0000100 epoch_Time:28158.0min: [2024-01-03 06:28:54,238][model8_pretrain.py][INFO] Epoch:[0/2](122000/4588595) loss:3.197 lr:0.0000100 epoch_Time:28158.0min: [2024-01-03 06:28:54,238][model8_pretrain.py][INFO] Epoch:[0/2](122000/4588595) loss:3.528 lr:0.0000100 epoch_Time:28158.0min: [2024-01-03 06:28:54,238][model8_pretrain.py][INFO] Epoch:[0/2](122000/4588595) loss:3.336 lr:0.0000100 epoch_Time:28158.0min: [2024-01-03 06:28:54,238][model8_pretrain.py][INFO] Epoch:[0/2](122000/4588595) loss:2.607 lr:0.0000100 epoch_Time:28158.0min: [2024-01-03 06:28:54,238][model8_pretrain.py][INFO] Epoch:[0/2](122000/4588595) loss:2.546 lr:0.0000100 epoch_Time:28158.0min: [2024-01-03 06:29:31,177][model8_pretrain.py][INFO] Epoch:[0/2](122100/4588595) loss:2.913 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:29:31,177][model8_pretrain.py][INFO] Epoch:[0/2](122100/4588595) loss:2.795 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:29:31,177][model8_pretrain.py][INFO] Epoch:[0/2](122100/4588595) loss:3.239 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:29:31,178][model8_pretrain.py][INFO] Epoch:[0/2](122100/4588595) loss:3.302 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:29:31,178][model8_pretrain.py][INFO] Epoch:[0/2](122100/4588595) loss:3.420 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:29:31,177][model8_pretrain.py][INFO] Epoch:[0/2](122100/4588595) loss:3.624 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:29:31,178][model8_pretrain.py][INFO] Epoch:[0/2](122100/4588595) loss:2.731 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:29:31,178][model8_pretrain.py][INFO] Epoch:[0/2](122100/4588595) loss:3.034 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:30:08,117][model8_pretrain.py][INFO] Epoch:[0/2](122200/4588595) loss:2.222 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:30:08,117][model8_pretrain.py][INFO] Epoch:[0/2](122200/4588595) loss:3.467 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:30:08,117][model8_pretrain.py][INFO] Epoch:[0/2](122200/4588595) loss:3.448 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:30:08,117][model8_pretrain.py][INFO] Epoch:[0/2](122200/4588595) loss:2.644 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:30:08,117][model8_pretrain.py][INFO] Epoch:[0/2](122200/4588595) loss:3.284 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:30:08,117][model8_pretrain.py][INFO] Epoch:[0/2](122200/4588595) loss:2.971 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:30:08,117][model8_pretrain.py][INFO] Epoch:[0/2](122200/4588595) loss:2.281 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:30:08,117][model8_pretrain.py][INFO] Epoch:[0/2](122200/4588595) loss:2.680 lr:0.0000100 epoch_Time:28156.0min: [2024-01-03 06:30:45,068][model8_pretrain.py][INFO] Epoch:[0/2](122300/4588595) loss:2.806 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:30:45,068][model8_pretrain.py][INFO] Epoch:[0/2](122300/4588595) loss:2.424 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:30:45,068][model8_pretrain.py][INFO] Epoch:[0/2](122300/4588595) loss:3.169 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:30:45,068][model8_pretrain.py][INFO] Epoch:[0/2](122300/4588595) loss:2.914 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:30:45,068][model8_pretrain.py][INFO] Epoch:[0/2](122300/4588595) loss:2.747 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:30:45,068][model8_pretrain.py][INFO] Epoch:[0/2](122300/4588595) loss:2.975 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:30:45,068][model8_pretrain.py][INFO] Epoch:[0/2](122300/4588595) loss:2.772 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:30:45,068][model8_pretrain.py][INFO] Epoch:[0/2](122300/4588595) loss:3.096 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:31:22,007][model8_pretrain.py][INFO] Epoch:[0/2](122400/4588595) loss:2.630 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:31:22,007][model8_pretrain.py][INFO] Epoch:[0/2](122400/4588595) loss:2.997 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:31:22,007][model8_pretrain.py][INFO] Epoch:[0/2](122400/4588595) loss:3.327 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:31:22,007][model8_pretrain.py][INFO] Epoch:[0/2](122400/4588595) loss:3.194 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:31:22,007][model8_pretrain.py][INFO] Epoch:[0/2](122400/4588595) loss:3.099 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:31:22,007][model8_pretrain.py][INFO] Epoch:[0/2](122400/4588595) loss:3.048 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:31:22,007][model8_pretrain.py][INFO] Epoch:[0/2](122400/4588595) loss:3.137 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:31:22,008][model8_pretrain.py][INFO] Epoch:[0/2](122400/4588595) loss:3.087 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:31:58,973][model8_pretrain.py][INFO] Epoch:[0/2](122500/4588595) loss:3.266 lr:0.0000100 epoch_Time:28152.0min: [2024-01-03 06:31:58,973][model8_pretrain.py][INFO] Epoch:[0/2](122500/4588595) loss:2.810 lr:0.0000100 epoch_Time:28152.0min: [2024-01-03 06:31:58,973][model8_pretrain.py][INFO] Epoch:[0/2](122500/4588595) loss:3.611 lr:0.0000100 epoch_Time:28152.0min: [2024-01-03 06:31:58,973][model8_pretrain.py][INFO] Epoch:[0/2](122500/4588595) loss:3.132 lr:0.0000100 epoch_Time:28152.0min: [2024-01-03 06:31:58,973][model8_pretrain.py][INFO] Epoch:[0/2](122500/4588595) loss:3.455 lr:0.0000100 epoch_Time:28152.0min: [2024-01-03 06:31:58,973][model8_pretrain.py][INFO] Epoch:[0/2](122500/4588595) loss:3.063 lr:0.0000100 epoch_Time:28152.0min: [2024-01-03 06:31:58,973][model8_pretrain.py][INFO] Epoch:[0/2](122500/4588595) loss:2.791 lr:0.0000100 epoch_Time:28152.0min: [2024-01-03 06:31:58,973][model8_pretrain.py][INFO] Epoch:[0/2](122500/4588595) loss:3.347 lr:0.0000100 epoch_Time:28152.0min: [2024-01-03 06:32:44,445][model8_pretrain.py][INFO] Epoch:[0/2](122600/4588595) loss:3.140 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:32:44,445][model8_pretrain.py][INFO] Epoch:[0/2](122600/4588595) loss:3.215 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:32:44,445][model8_pretrain.py][INFO] Epoch:[0/2](122600/4588595) loss:3.088 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:32:44,445][model8_pretrain.py][INFO] Epoch:[0/2](122600/4588595) loss:3.415 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:32:44,445][model8_pretrain.py][INFO] Epoch:[0/2](122600/4588595) loss:3.053 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:32:44,445][model8_pretrain.py][INFO] Epoch:[0/2](122600/4588595) loss:3.677 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:32:44,445][model8_pretrain.py][INFO] Epoch:[0/2](122600/4588595) loss:3.014 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:32:44,445][model8_pretrain.py][INFO] Epoch:[0/2](122600/4588595) loss:3.132 lr:0.0000100 epoch_Time:28157.0min: [2024-01-03 06:33:21,346][model8_pretrain.py][INFO] Epoch:[0/2](122700/4588595) loss:2.887 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:33:21,346][model8_pretrain.py][INFO] Epoch:[0/2](122700/4588595) loss:2.709 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:33:21,346][model8_pretrain.py][INFO] Epoch:[0/2](122700/4588595) loss:3.118 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:33:21,346][model8_pretrain.py][INFO] Epoch:[0/2](122700/4588595) loss:3.090 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:33:21,346][model8_pretrain.py][INFO] Epoch:[0/2](122700/4588595) loss:3.108 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:33:21,346][model8_pretrain.py][INFO] Epoch:[0/2](122700/4588595) loss:2.956 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:33:21,346][model8_pretrain.py][INFO] Epoch:[0/2](122700/4588595) loss:2.828 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:33:21,346][model8_pretrain.py][INFO] Epoch:[0/2](122700/4588595) loss:3.051 lr:0.0000100 epoch_Time:28155.0min: [2024-01-03 06:33:58,291][model8_pretrain.py][INFO] Epoch:[0/2](122800/4588595) loss:3.108 lr:0.0000100 epoch_Time:28154.0min: [2024-01-03 06:33:58,291][model8_pretrain.py][INFO] Epoch:[0/2](122800/4588595) loss:3.146 lr:0.0000100 epoch_Time:28154.0min: [2024-01-03 06:33:58,291][model8_pretrain.py][INFO] Epoch:[0/2](122800/4588595) loss:3.441 lr:0.0000100 epoch_Time:28154.0min: [2024-01-03 06:33:58,291][model8_pretrain.py][INFO] Epoch:[0/2](122800/4588595) loss:2.728 lr:0.0000100 epoch_Time:28154.0min: [2024-01-03 06:33:58,291][model8_pretrain.py][INFO] Epoch:[0/2](122800/4588595) loss:2.672 lr:0.0000100 epoch_Time:28154.0min: [2024-01-03 06:33:58,291][model8_pretrain.py][INFO] Epoch:[0/2](122800/4588595) loss:2.934 lr:0.0000100 epoch_Time:28154.0min: [2024-01-03 06:33:58,291][model8_pretrain.py][INFO] Epoch:[0/2](122800/4588595) loss:3.145 lr:0.0000100 epoch_Time:28154.0min: [2024-01-03 06:33:58,292][model8_pretrain.py][INFO] Epoch:[0/2](122800/4588595) loss:3.335 lr:0.0000100 epoch_Time:28154.0min: [2024-01-03 06:34:35,248][model8_pretrain.py][INFO] Epoch:[0/2](122900/4588595) loss:2.965 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:34:35,248][model8_pretrain.py][INFO] Epoch:[0/2](122900/4588595) loss:2.731 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:34:35,248][model8_pretrain.py][INFO] Epoch:[0/2](122900/4588595) loss:3.101 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:34:35,248][model8_pretrain.py][INFO] Epoch:[0/2](122900/4588595) loss:3.029 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:34:35,248][model8_pretrain.py][INFO] Epoch:[0/2](122900/4588595) loss:3.185 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:34:35,248][model8_pretrain.py][INFO] Epoch:[0/2](122900/4588595) loss:2.837 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:34:35,248][model8_pretrain.py][INFO] Epoch:[0/2](122900/4588595) loss:3.011 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:34:35,248][model8_pretrain.py][INFO] Epoch:[0/2](122900/4588595) loss:3.251 lr:0.0000100 epoch_Time:28153.0min: [2024-01-03 06:35:12,196][model8_pretrain.py][INFO] Epoch:[0/2](123000/4588595) loss:3.251 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:35:12,196][model8_pretrain.py][INFO] Epoch:[0/2](123000/4588595) loss:2.401 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:35:12,196][model8_pretrain.py][INFO] Epoch:[0/2](123000/4588595) loss:3.162 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:35:12,196][model8_pretrain.py][INFO] Epoch:[0/2](123000/4588595) loss:2.535 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:35:12,196][model8_pretrain.py][INFO] Epoch:[0/2](123000/4588595) loss:2.630 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:35:12,196][model8_pretrain.py][INFO] Epoch:[0/2](123000/4588595) loss:2.639 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:35:12,196][model8_pretrain.py][INFO] Epoch:[0/2](123000/4588595) loss:2.734 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:35:12,196][model8_pretrain.py][INFO] Epoch:[0/2](123000/4588595) loss:3.011 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:35:49,128][model8_pretrain.py][INFO] Epoch:[0/2](123100/4588595) loss:3.092 lr:0.0000100 epoch_Time:28150.0min: [2024-01-03 06:35:49,128][model8_pretrain.py][INFO] Epoch:[0/2](123100/4588595) loss:3.192 lr:0.0000100 epoch_Time:28150.0min: [2024-01-03 06:35:49,128][model8_pretrain.py][INFO] Epoch:[0/2](123100/4588595) loss:2.592 lr:0.0000100 epoch_Time:28150.0min: [2024-01-03 06:35:49,128][model8_pretrain.py][INFO] Epoch:[0/2](123100/4588595) loss:2.393 lr:0.0000100 epoch_Time:28150.0min: [2024-01-03 06:35:49,128][model8_pretrain.py][INFO] Epoch:[0/2](123100/4588595) loss:3.123 lr:0.0000100 epoch_Time:28150.0min: [2024-01-03 06:35:49,128][model8_pretrain.py][INFO] Epoch:[0/2](123100/4588595) loss:2.702 lr:0.0000100 epoch_Time:28150.0min: [2024-01-03 06:35:49,128][model8_pretrain.py][INFO] Epoch:[0/2](123100/4588595) loss:2.516 lr:0.0000100 epoch_Time:28150.0min: [2024-01-03 06:35:49,129][model8_pretrain.py][INFO] Epoch:[0/2](123100/4588595) loss:2.980 lr:0.0000100 epoch_Time:28150.0min: [2024-01-03 06:36:26,079][model8_pretrain.py][INFO] Epoch:[0/2](123200/4588595) loss:3.016 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:36:26,079][model8_pretrain.py][INFO] Epoch:[0/2](123200/4588595) loss:2.690 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:36:26,079][model8_pretrain.py][INFO] Epoch:[0/2](123200/4588595) loss:3.214 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:36:26,080][model8_pretrain.py][INFO] Epoch:[0/2](123200/4588595) loss:3.010 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:36:26,080][model8_pretrain.py][INFO] Epoch:[0/2](123200/4588595) loss:2.498 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:36:26,080][model8_pretrain.py][INFO] Epoch:[0/2](123200/4588595) loss:2.758 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:36:26,080][model8_pretrain.py][INFO] Epoch:[0/2](123200/4588595) loss:3.253 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:36:26,080][model8_pretrain.py][INFO] Epoch:[0/2](123200/4588595) loss:2.811 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:37:03,009][model8_pretrain.py][INFO] Epoch:[0/2](123300/4588595) loss:2.920 lr:0.0000100 epoch_Time:28148.0min: [2024-01-03 06:37:03,009][model8_pretrain.py][INFO] Epoch:[0/2](123300/4588595) loss:3.405 lr:0.0000100 epoch_Time:28148.0min: [2024-01-03 06:37:03,009][model8_pretrain.py][INFO] Epoch:[0/2](123300/4588595) loss:3.151 lr:0.0000100 epoch_Time:28148.0min: [2024-01-03 06:37:03,009][model8_pretrain.py][INFO] Epoch:[0/2](123300/4588595) loss:2.365 lr:0.0000100 epoch_Time:28148.0min: [2024-01-03 06:37:03,009][model8_pretrain.py][INFO] Epoch:[0/2](123300/4588595) loss:2.682 lr:0.0000100 epoch_Time:28148.0min: [2024-01-03 06:37:03,009][model8_pretrain.py][INFO] Epoch:[0/2](123300/4588595) loss:2.168 lr:0.0000100 epoch_Time:28148.0min: [2024-01-03 06:37:03,009][model8_pretrain.py][INFO] Epoch:[0/2](123300/4588595) loss:3.006 lr:0.0000100 epoch_Time:28148.0min: [2024-01-03 06:37:03,009][model8_pretrain.py][INFO] Epoch:[0/2](123300/4588595) loss:3.035 lr:0.0000100 epoch_Time:28148.0min: [2024-01-03 06:37:48,370][model8_pretrain.py][INFO] Epoch:[0/2](123400/4588595) loss:2.659 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:37:48,370][model8_pretrain.py][INFO] Epoch:[0/2](123400/4588595) loss:2.856 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:37:48,371][model8_pretrain.py][INFO] Epoch:[0/2](123400/4588595) loss:2.619 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:37:48,370][model8_pretrain.py][INFO] Epoch:[0/2](123400/4588595) loss:2.725 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:37:48,371][model8_pretrain.py][INFO] Epoch:[0/2](123400/4588595) loss:2.737 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:37:48,371][model8_pretrain.py][INFO] Epoch:[0/2](123400/4588595) loss:2.788 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:37:48,371][model8_pretrain.py][INFO] Epoch:[0/2](123400/4588595) loss:3.296 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:37:48,371][model8_pretrain.py][INFO] Epoch:[0/2](123400/4588595) loss:2.978 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:38:25,295][model8_pretrain.py][INFO] Epoch:[0/2](123500/4588595) loss:2.771 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:38:25,295][model8_pretrain.py][INFO] Epoch:[0/2](123500/4588595) loss:3.224 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:38:25,295][model8_pretrain.py][INFO] Epoch:[0/2](123500/4588595) loss:3.115 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:38:25,295][model8_pretrain.py][INFO] Epoch:[0/2](123500/4588595) loss:2.957 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:38:25,295][model8_pretrain.py][INFO] Epoch:[0/2](123500/4588595) loss:3.031 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:38:25,295][model8_pretrain.py][INFO] Epoch:[0/2](123500/4588595) loss:3.114 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:38:25,295][model8_pretrain.py][INFO] Epoch:[0/2](123500/4588595) loss:3.207 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:38:25,295][model8_pretrain.py][INFO] Epoch:[0/2](123500/4588595) loss:3.037 lr:0.0000100 epoch_Time:28151.0min: [2024-01-03 06:39:02,221][model8_pretrain.py][INFO] Epoch:[0/2](123600/4588595) loss:3.070 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:02,221][model8_pretrain.py][INFO] Epoch:[0/2](123600/4588595) loss:2.740 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:02,221][model8_pretrain.py][INFO] Epoch:[0/2](123600/4588595) loss:2.855 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:02,221][model8_pretrain.py][INFO] Epoch:[0/2](123600/4588595) loss:3.262 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:02,221][model8_pretrain.py][INFO] Epoch:[0/2](123600/4588595) loss:2.777 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:02,221][model8_pretrain.py][INFO] Epoch:[0/2](123600/4588595) loss:3.014 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:02,221][model8_pretrain.py][INFO] Epoch:[0/2](123600/4588595) loss:2.932 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:02,221][model8_pretrain.py][INFO] Epoch:[0/2](123600/4588595) loss:3.459 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:39,153][model8_pretrain.py][INFO] Epoch:[0/2](123700/4588595) loss:3.396 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:39,153][model8_pretrain.py][INFO] Epoch:[0/2](123700/4588595) loss:2.847 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:39,153][model8_pretrain.py][INFO] Epoch:[0/2](123700/4588595) loss:2.895 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:39,153][model8_pretrain.py][INFO] Epoch:[0/2](123700/4588595) loss:3.251 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:39,153][model8_pretrain.py][INFO] Epoch:[0/2](123700/4588595) loss:3.098 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:39,153][model8_pretrain.py][INFO] Epoch:[0/2](123700/4588595) loss:3.263 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:39,153][model8_pretrain.py][INFO] Epoch:[0/2](123700/4588595) loss:2.549 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:39:39,153][model8_pretrain.py][INFO] Epoch:[0/2](123700/4588595) loss:3.088 lr:0.0000100 epoch_Time:28149.0min: [2024-01-03 06:40:16,109][model8_pretrain.py][INFO] Epoch:[0/2](123800/4588595) loss:2.968 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:40:16,109][model8_pretrain.py][INFO] Epoch:[0/2](123800/4588595) loss:3.154 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:40:16,109][model8_pretrain.py][INFO] Epoch:[0/2](123800/4588595) loss:2.748 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:40:16,109][model8_pretrain.py][INFO] Epoch:[0/2](123800/4588595) loss:3.008 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:40:16,109][model8_pretrain.py][INFO] Epoch:[0/2](123800/4588595) loss:3.242 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:40:16,109][model8_pretrain.py][INFO] Epoch:[0/2](123800/4588595) loss:2.949 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:40:16,109][model8_pretrain.py][INFO] Epoch:[0/2](123800/4588595) loss:2.748 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:40:16,109][model8_pretrain.py][INFO] Epoch:[0/2](123800/4588595) loss:2.637 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:40:53,093][model8_pretrain.py][INFO] Epoch:[0/2](123900/4588595) loss:2.949 lr:0.0000100 epoch_Time:28146.0min: [2024-01-03 06:40:53,093][model8_pretrain.py][INFO] Epoch:[0/2](123900/4588595) loss:2.695 lr:0.0000100 epoch_Time:28146.0min: [2024-01-03 06:40:53,093][model8_pretrain.py][INFO] Epoch:[0/2](123900/4588595) loss:3.049 lr:0.0000100 epoch_Time:28146.0min: [2024-01-03 06:40:53,093][model8_pretrain.py][INFO] Epoch:[0/2](123900/4588595) loss:2.676 lr:0.0000100 epoch_Time:28146.0min: [2024-01-03 06:40:53,093][model8_pretrain.py][INFO] Epoch:[0/2](123900/4588595) loss:2.908 lr:0.0000100 epoch_Time:28146.0min: [2024-01-03 06:40:53,093][model8_pretrain.py][INFO] Epoch:[0/2](123900/4588595) loss:2.989 lr:0.0000100 epoch_Time:28146.0min: [2024-01-03 06:40:53,093][model8_pretrain.py][INFO] Epoch:[0/2](123900/4588595) loss:2.711 lr:0.0000100 epoch_Time:28146.0min: [2024-01-03 06:40:53,093][model8_pretrain.py][INFO] Epoch:[0/2](123900/4588595) loss:3.094 lr:0.0000100 epoch_Time:28146.0min: [2024-01-03 06:41:30,105][model8_pretrain.py][INFO] Epoch:[0/2](124000/4588595) loss:3.502 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:41:30,105][model8_pretrain.py][INFO] Epoch:[0/2](124000/4588595) loss:2.852 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:41:30,105][model8_pretrain.py][INFO] Epoch:[0/2](124000/4588595) loss:3.131 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:41:30,105][model8_pretrain.py][INFO] Epoch:[0/2](124000/4588595) loss:3.119 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:41:30,105][model8_pretrain.py][INFO] Epoch:[0/2](124000/4588595) loss:2.909 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:41:30,105][model8_pretrain.py][INFO] Epoch:[0/2](124000/4588595) loss:3.082 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:41:30,105][model8_pretrain.py][INFO] Epoch:[0/2](124000/4588595) loss:2.323 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:41:30,105][model8_pretrain.py][INFO] Epoch:[0/2](124000/4588595) loss:3.087 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:42:07,081][model8_pretrain.py][INFO] Epoch:[0/2](124100/4588595) loss:2.806 lr:0.0000100 epoch_Time:28144.0min: [2024-01-03 06:42:07,081][model8_pretrain.py][INFO] Epoch:[0/2](124100/4588595) loss:3.148 lr:0.0000100 epoch_Time:28144.0min: [2024-01-03 06:42:07,081][model8_pretrain.py][INFO] Epoch:[0/2](124100/4588595) loss:2.844 lr:0.0000100 epoch_Time:28144.0min: [2024-01-03 06:42:07,081][model8_pretrain.py][INFO] Epoch:[0/2](124100/4588595) loss:2.732 lr:0.0000100 epoch_Time:28144.0min: [2024-01-03 06:42:07,081][model8_pretrain.py][INFO] Epoch:[0/2](124100/4588595) loss:2.834 lr:0.0000100 epoch_Time:28144.0min: [2024-01-03 06:42:07,081][model8_pretrain.py][INFO] Epoch:[0/2](124100/4588595) loss:2.948 lr:0.0000100 epoch_Time:28144.0min: [2024-01-03 06:42:07,081][model8_pretrain.py][INFO] Epoch:[0/2](124100/4588595) loss:3.387 lr:0.0000100 epoch_Time:28144.0min: [2024-01-03 06:42:07,081][model8_pretrain.py][INFO] Epoch:[0/2](124100/4588595) loss:3.023 lr:0.0000100 epoch_Time:28144.0min: [2024-01-03 06:42:52,397][model8_pretrain.py][INFO] Epoch:[0/2](124200/4588595) loss:3.163 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:42:52,398][model8_pretrain.py][INFO] Epoch:[0/2](124200/4588595) loss:2.134 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:42:52,398][model8_pretrain.py][INFO] Epoch:[0/2](124200/4588595) loss:3.331 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:42:52,398][model8_pretrain.py][INFO] Epoch:[0/2](124200/4588595) loss:3.082 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:42:52,398][model8_pretrain.py][INFO] Epoch:[0/2](124200/4588595) loss:2.991 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:42:52,398][model8_pretrain.py][INFO] Epoch:[0/2](124200/4588595) loss:2.355 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:42:52,398][model8_pretrain.py][INFO] Epoch:[0/2](124200/4588595) loss:3.045 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:42:52,398][model8_pretrain.py][INFO] Epoch:[0/2](124200/4588595) loss:3.086 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:43:29,342][model8_pretrain.py][INFO] Epoch:[0/2](124300/4588595) loss:3.216 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:43:29,342][model8_pretrain.py][INFO] Epoch:[0/2](124300/4588595) loss:2.572 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:43:29,342][model8_pretrain.py][INFO] Epoch:[0/2](124300/4588595) loss:2.995 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:43:29,342][model8_pretrain.py][INFO] Epoch:[0/2](124300/4588595) loss:3.344 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:43:29,342][model8_pretrain.py][INFO] Epoch:[0/2](124300/4588595) loss:2.734 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:43:29,342][model8_pretrain.py][INFO] Epoch:[0/2](124300/4588595) loss:3.575 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:43:29,343][model8_pretrain.py][INFO] Epoch:[0/2](124300/4588595) loss:3.176 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:43:29,343][model8_pretrain.py][INFO] Epoch:[0/2](124300/4588595) loss:3.035 lr:0.0000100 epoch_Time:28147.0min: [2024-01-03 06:44:06,279][model8_pretrain.py][INFO] Epoch:[0/2](124400/4588595) loss:2.762 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:06,279][model8_pretrain.py][INFO] Epoch:[0/2](124400/4588595) loss:2.742 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:06,279][model8_pretrain.py][INFO] Epoch:[0/2](124400/4588595) loss:3.010 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:06,279][model8_pretrain.py][INFO] Epoch:[0/2](124400/4588595) loss:3.283 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:06,279][model8_pretrain.py][INFO] Epoch:[0/2](124400/4588595) loss:3.189 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:06,279][model8_pretrain.py][INFO] Epoch:[0/2](124400/4588595) loss:2.656 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:06,280][model8_pretrain.py][INFO] Epoch:[0/2](124400/4588595) loss:3.073 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:06,280][model8_pretrain.py][INFO] Epoch:[0/2](124400/4588595) loss:3.478 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:43,241][model8_pretrain.py][INFO] Epoch:[0/2](124500/4588595) loss:2.839 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:43,241][model8_pretrain.py][INFO] Epoch:[0/2](124500/4588595) loss:3.201 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:43,241][model8_pretrain.py][INFO] Epoch:[0/2](124500/4588595) loss:2.604 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:43,241][model8_pretrain.py][INFO] Epoch:[0/2](124500/4588595) loss:2.635 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:43,241][model8_pretrain.py][INFO] Epoch:[0/2](124500/4588595) loss:2.847 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:43,242][model8_pretrain.py][INFO] Epoch:[0/2](124500/4588595) loss:2.673 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:43,242][model8_pretrain.py][INFO] Epoch:[0/2](124500/4588595) loss:2.487 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:44:43,242][model8_pretrain.py][INFO] Epoch:[0/2](124500/4588595) loss:2.882 lr:0.0000100 epoch_Time:28145.0min: [2024-01-03 06:45:20,182][model8_pretrain.py][INFO] Epoch:[0/2](124600/4588595) loss:2.944 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:45:20,182][model8_pretrain.py][INFO] Epoch:[0/2](124600/4588595) loss:2.807 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:45:20,182][model8_pretrain.py][INFO] Epoch:[0/2](124600/4588595) loss:2.931 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:45:20,182][model8_pretrain.py][INFO] Epoch:[0/2](124600/4588595) loss:3.196 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:45:20,182][model8_pretrain.py][INFO] Epoch:[0/2](124600/4588595) loss:3.326 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:45:20,182][model8_pretrain.py][INFO] Epoch:[0/2](124600/4588595) loss:3.098 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:45:20,182][model8_pretrain.py][INFO] Epoch:[0/2](124600/4588595) loss:3.033 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:45:20,182][model8_pretrain.py][INFO] Epoch:[0/2](124600/4588595) loss:3.033 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:45:57,133][model8_pretrain.py][INFO] Epoch:[0/2](124700/4588595) loss:3.160 lr:0.0000100 epoch_Time:28142.0min: [2024-01-03 06:45:57,133][model8_pretrain.py][INFO] Epoch:[0/2](124700/4588595) loss:2.499 lr:0.0000100 epoch_Time:28142.0min: [2024-01-03 06:45:57,133][model8_pretrain.py][INFO] Epoch:[0/2](124700/4588595) loss:2.795 lr:0.0000100 epoch_Time:28142.0min: [2024-01-03 06:45:57,133][model8_pretrain.py][INFO] Epoch:[0/2](124700/4588595) loss:2.517 lr:0.0000100 epoch_Time:28142.0min: [2024-01-03 06:45:57,133][model8_pretrain.py][INFO] Epoch:[0/2](124700/4588595) loss:3.214 lr:0.0000100 epoch_Time:28142.0min: [2024-01-03 06:45:57,133][model8_pretrain.py][INFO] Epoch:[0/2](124700/4588595) loss:3.157 lr:0.0000100 epoch_Time:28142.0min: [2024-01-03 06:45:57,133][model8_pretrain.py][INFO] Epoch:[0/2](124700/4588595) loss:2.765 lr:0.0000100 epoch_Time:28142.0min: [2024-01-03 06:45:57,133][model8_pretrain.py][INFO] Epoch:[0/2](124700/4588595) loss:2.859 lr:0.0000100 epoch_Time:28142.0min: [2024-01-03 06:46:34,069][model8_pretrain.py][INFO] Epoch:[0/2](124800/4588595) loss:2.773 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:46:34,069][model8_pretrain.py][INFO] Epoch:[0/2](124800/4588595) loss:2.606 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:46:34,069][model8_pretrain.py][INFO] Epoch:[0/2](124800/4588595) loss:3.111 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:46:34,069][model8_pretrain.py][INFO] Epoch:[0/2](124800/4588595) loss:2.954 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:46:34,069][model8_pretrain.py][INFO] Epoch:[0/2](124800/4588595) loss:3.167 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:46:34,069][model8_pretrain.py][INFO] Epoch:[0/2](124800/4588595) loss:3.008 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:46:34,069][model8_pretrain.py][INFO] Epoch:[0/2](124800/4588595) loss:2.802 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:46:34,070][model8_pretrain.py][INFO] Epoch:[0/2](124800/4588595) loss:3.595 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:47:10,996][model8_pretrain.py][INFO] Epoch:[0/2](124900/4588595) loss:3.211 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:47:10,996][model8_pretrain.py][INFO] Epoch:[0/2](124900/4588595) loss:3.207 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:47:10,996][model8_pretrain.py][INFO] Epoch:[0/2](124900/4588595) loss:2.644 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:47:10,996][model8_pretrain.py][INFO] Epoch:[0/2](124900/4588595) loss:3.363 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:47:10,996][model8_pretrain.py][INFO] Epoch:[0/2](124900/4588595) loss:3.024 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:47:10,996][model8_pretrain.py][INFO] Epoch:[0/2](124900/4588595) loss:3.115 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:47:10,996][model8_pretrain.py][INFO] Epoch:[0/2](124900/4588595) loss:2.958 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:47:10,997][model8_pretrain.py][INFO] Epoch:[0/2](124900/4588595) loss:2.955 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:47:56,360][model8_pretrain.py][INFO] Epoch:[0/2](125000/4588595) loss:2.817 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:47:56,360][model8_pretrain.py][INFO] Epoch:[0/2](125000/4588595) loss:2.957 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:47:56,360][model8_pretrain.py][INFO] Epoch:[0/2](125000/4588595) loss:3.742 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:47:56,360][model8_pretrain.py][INFO] Epoch:[0/2](125000/4588595) loss:2.993 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:47:56,360][model8_pretrain.py][INFO] Epoch:[0/2](125000/4588595) loss:2.328 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:47:56,360][model8_pretrain.py][INFO] Epoch:[0/2](125000/4588595) loss:3.472 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:47:56,360][model8_pretrain.py][INFO] Epoch:[0/2](125000/4588595) loss:2.873 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:47:56,360][model8_pretrain.py][INFO] Epoch:[0/2](125000/4588595) loss:2.701 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:48:33,282][model8_pretrain.py][INFO] Epoch:[0/2](125100/4588595) loss:2.784 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:48:33,282][model8_pretrain.py][INFO] Epoch:[0/2](125100/4588595) loss:3.300 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:48:33,282][model8_pretrain.py][INFO] Epoch:[0/2](125100/4588595) loss:3.232 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:48:33,282][model8_pretrain.py][INFO] Epoch:[0/2](125100/4588595) loss:2.569 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:48:33,282][model8_pretrain.py][INFO] Epoch:[0/2](125100/4588595) loss:3.043 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:48:33,282][model8_pretrain.py][INFO] Epoch:[0/2](125100/4588595) loss:2.873 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:48:33,282][model8_pretrain.py][INFO] Epoch:[0/2](125100/4588595) loss:2.779 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:48:33,282][model8_pretrain.py][INFO] Epoch:[0/2](125100/4588595) loss:3.137 lr:0.0000100 epoch_Time:28143.0min: [2024-01-03 06:49:10,225][model8_pretrain.py][INFO] Epoch:[0/2](125200/4588595) loss:3.197 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:49:10,225][model8_pretrain.py][INFO] Epoch:[0/2](125200/4588595) loss:3.208 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:49:10,225][model8_pretrain.py][INFO] Epoch:[0/2](125200/4588595) loss:2.732 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:49:10,225][model8_pretrain.py][INFO] Epoch:[0/2](125200/4588595) loss:3.282 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:49:10,225][model8_pretrain.py][INFO] Epoch:[0/2](125200/4588595) loss:2.442 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:49:10,225][model8_pretrain.py][INFO] Epoch:[0/2](125200/4588595) loss:3.054 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:49:10,225][model8_pretrain.py][INFO] Epoch:[0/2](125200/4588595) loss:3.387 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:49:10,225][model8_pretrain.py][INFO] Epoch:[0/2](125200/4588595) loss:3.230 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:49:47,155][model8_pretrain.py][INFO] Epoch:[0/2](125300/4588595) loss:3.353 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:49:47,155][model8_pretrain.py][INFO] Epoch:[0/2](125300/4588595) loss:3.285 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:49:47,155][model8_pretrain.py][INFO] Epoch:[0/2](125300/4588595) loss:3.035 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:49:47,155][model8_pretrain.py][INFO] Epoch:[0/2](125300/4588595) loss:3.360 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:49:47,155][model8_pretrain.py][INFO] Epoch:[0/2](125300/4588595) loss:3.211 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:49:47,155][model8_pretrain.py][INFO] Epoch:[0/2](125300/4588595) loss:3.197 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:49:47,155][model8_pretrain.py][INFO] Epoch:[0/2](125300/4588595) loss:3.124 lr:0.0000100 epoch_Time:28141.0min: [2024-01-03 06:49:47,155][model8_pretrain.py][INFO] Epoch:[0/2](125300/4588595) loss:2.545 lr:0.0000100 epoch_Time:28140.0min: [2024-01-03 06:50:24,091][model8_pretrain.py][INFO] Epoch:[0/2](125400/4588595) loss:3.259 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:50:24,091][model8_pretrain.py][INFO] Epoch:[0/2](125400/4588595) loss:2.426 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:50:24,091][model8_pretrain.py][INFO] Epoch:[0/2](125400/4588595) loss:3.342 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:50:24,091][model8_pretrain.py][INFO] Epoch:[0/2](125400/4588595) loss:2.785 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:50:24,091][model8_pretrain.py][INFO] Epoch:[0/2](125400/4588595) loss:2.934 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:50:24,091][model8_pretrain.py][INFO] Epoch:[0/2](125400/4588595) loss:3.052 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:50:24,091][model8_pretrain.py][INFO] Epoch:[0/2](125400/4588595) loss:3.201 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:50:24,091][model8_pretrain.py][INFO] Epoch:[0/2](125400/4588595) loss:3.031 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:51:01,038][model8_pretrain.py][INFO] Epoch:[0/2](125500/4588595) loss:2.525 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:01,038][model8_pretrain.py][INFO] Epoch:[0/2](125500/4588595) loss:3.161 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:01,039][model8_pretrain.py][INFO] Epoch:[0/2](125500/4588595) loss:2.937 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:01,038][model8_pretrain.py][INFO] Epoch:[0/2](125500/4588595) loss:2.732 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:01,039][model8_pretrain.py][INFO] Epoch:[0/2](125500/4588595) loss:3.136 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:01,039][model8_pretrain.py][INFO] Epoch:[0/2](125500/4588595) loss:3.145 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:01,039][model8_pretrain.py][INFO] Epoch:[0/2](125500/4588595) loss:3.023 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:01,039][model8_pretrain.py][INFO] Epoch:[0/2](125500/4588595) loss:3.369 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:37,980][model8_pretrain.py][INFO] Epoch:[0/2](125600/4588595) loss:2.709 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:37,980][model8_pretrain.py][INFO] Epoch:[0/2](125600/4588595) loss:3.315 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:37,980][model8_pretrain.py][INFO] Epoch:[0/2](125600/4588595) loss:2.667 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:37,980][model8_pretrain.py][INFO] Epoch:[0/2](125600/4588595) loss:2.973 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:37,980][model8_pretrain.py][INFO] Epoch:[0/2](125600/4588595) loss:3.039 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:37,980][model8_pretrain.py][INFO] Epoch:[0/2](125600/4588595) loss:2.864 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:37,980][model8_pretrain.py][INFO] Epoch:[0/2](125600/4588595) loss:3.236 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:51:37,980][model8_pretrain.py][INFO] Epoch:[0/2](125600/4588595) loss:2.792 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:52:14,937][model8_pretrain.py][INFO] Epoch:[0/2](125700/4588595) loss:3.168 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:52:14,937][model8_pretrain.py][INFO] Epoch:[0/2](125700/4588595) loss:2.754 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:52:14,937][model8_pretrain.py][INFO] Epoch:[0/2](125700/4588595) loss:2.375 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:52:14,937][model8_pretrain.py][INFO] Epoch:[0/2](125700/4588595) loss:2.221 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:52:14,937][model8_pretrain.py][INFO] Epoch:[0/2](125700/4588595) loss:2.958 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:52:14,937][model8_pretrain.py][INFO] Epoch:[0/2](125700/4588595) loss:3.393 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:52:14,937][model8_pretrain.py][INFO] Epoch:[0/2](125700/4588595) loss:3.423 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:52:14,937][model8_pretrain.py][INFO] Epoch:[0/2](125700/4588595) loss:3.153 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:53:01,121][model8_pretrain.py][INFO] Epoch:[0/2](125800/4588595) loss:3.254 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:01,121][model8_pretrain.py][INFO] Epoch:[0/2](125800/4588595) loss:3.387 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:01,121][model8_pretrain.py][INFO] Epoch:[0/2](125800/4588595) loss:3.313 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:01,121][model8_pretrain.py][INFO] Epoch:[0/2](125800/4588595) loss:2.549 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:01,121][model8_pretrain.py][INFO] Epoch:[0/2](125800/4588595) loss:2.759 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:01,121][model8_pretrain.py][INFO] Epoch:[0/2](125800/4588595) loss:2.445 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:01,121][model8_pretrain.py][INFO] Epoch:[0/2](125800/4588595) loss:2.932 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:01,121][model8_pretrain.py][INFO] Epoch:[0/2](125800/4588595) loss:3.043 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:38,053][model8_pretrain.py][INFO] Epoch:[0/2](125900/4588595) loss:3.159 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:38,054][model8_pretrain.py][INFO] Epoch:[0/2](125900/4588595) loss:3.160 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:38,054][model8_pretrain.py][INFO] Epoch:[0/2](125900/4588595) loss:2.793 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:38,054][model8_pretrain.py][INFO] Epoch:[0/2](125900/4588595) loss:2.533 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:38,054][model8_pretrain.py][INFO] Epoch:[0/2](125900/4588595) loss:2.683 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:38,054][model8_pretrain.py][INFO] Epoch:[0/2](125900/4588595) loss:2.985 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:38,054][model8_pretrain.py][INFO] Epoch:[0/2](125900/4588595) loss:3.299 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:53:38,054][model8_pretrain.py][INFO] Epoch:[0/2](125900/4588595) loss:2.939 lr:0.0000100 epoch_Time:28139.0min: [2024-01-03 06:54:14,996][model8_pretrain.py][INFO] Epoch:[0/2](126000/4588595) loss:3.096 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:54:14,996][model8_pretrain.py][INFO] Epoch:[0/2](126000/4588595) loss:3.238 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:54:14,996][model8_pretrain.py][INFO] Epoch:[0/2](126000/4588595) loss:3.123 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:54:14,996][model8_pretrain.py][INFO] Epoch:[0/2](126000/4588595) loss:3.715 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:54:14,996][model8_pretrain.py][INFO] Epoch:[0/2](126000/4588595) loss:3.109 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:54:14,996][model8_pretrain.py][INFO] Epoch:[0/2](126000/4588595) loss:3.080 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:54:14,996][model8_pretrain.py][INFO] Epoch:[0/2](126000/4588595) loss:3.147 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:54:14,997][model8_pretrain.py][INFO] Epoch:[0/2](126000/4588595) loss:2.875 lr:0.0000100 epoch_Time:28137.0min: [2024-01-03 06:54:51,942][model8_pretrain.py][INFO] Epoch:[0/2](126100/4588595) loss:3.228 lr:0.0000100 epoch_Time:28136.0min: [2024-01-03 06:54:51,942][model8_pretrain.py][INFO] Epoch:[0/2](126100/4588595) loss:3.024 lr:0.0000100 epoch_Time:28136.0min: [2024-01-03 06:54:51,942][model8_pretrain.py][INFO] Epoch:[0/2](126100/4588595) loss:2.496 lr:0.0000100 epoch_Time:28136.0min: [2024-01-03 06:54:51,943][model8_pretrain.py][INFO] Epoch:[0/2](126100/4588595) loss:2.988 lr:0.0000100 epoch_Time:28136.0min: [2024-01-03 06:54:51,943][model8_pretrain.py][INFO] Epoch:[0/2](126100/4588595) loss:3.086 lr:0.0000100 epoch_Time:28136.0min: [2024-01-03 06:54:51,943][model8_pretrain.py][INFO] Epoch:[0/2](126100/4588595) loss:2.884 lr:0.0000100 epoch_Time:28136.0min: [2024-01-03 06:54:51,943][model8_pretrain.py][INFO] Epoch:[0/2](126100/4588595) loss:2.999 lr:0.0000100 epoch_Time:28136.0min: [2024-01-03 06:54:51,943][model8_pretrain.py][INFO] Epoch:[0/2](126100/4588595) loss:3.126 lr:0.0000100 epoch_Time:28136.0min: [2024-01-03 06:55:28,891][model8_pretrain.py][INFO] Epoch:[0/2](126200/4588595) loss:3.160 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:55:28,891][model8_pretrain.py][INFO] Epoch:[0/2](126200/4588595) loss:2.588 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:55:28,891][model8_pretrain.py][INFO] Epoch:[0/2](126200/4588595) loss:3.165 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:55:28,891][model8_pretrain.py][INFO] Epoch:[0/2](126200/4588595) loss:2.298 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:55:28,891][model8_pretrain.py][INFO] Epoch:[0/2](126200/4588595) loss:2.667 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:55:28,891][model8_pretrain.py][INFO] Epoch:[0/2](126200/4588595) loss:2.985 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:55:28,891][model8_pretrain.py][INFO] Epoch:[0/2](126200/4588595) loss:2.787 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:55:28,892][model8_pretrain.py][INFO] Epoch:[0/2](126200/4588595) loss:2.669 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:56:05,844][model8_pretrain.py][INFO] Epoch:[0/2](126300/4588595) loss:3.055 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:56:05,844][model8_pretrain.py][INFO] Epoch:[0/2](126300/4588595) loss:3.030 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:56:05,844][model8_pretrain.py][INFO] Epoch:[0/2](126300/4588595) loss:2.850 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:56:05,844][model8_pretrain.py][INFO] Epoch:[0/2](126300/4588595) loss:2.702 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:56:05,844][model8_pretrain.py][INFO] Epoch:[0/2](126300/4588595) loss:3.132 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:56:05,845][model8_pretrain.py][INFO] Epoch:[0/2](126300/4588595) loss:3.072 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:56:05,845][model8_pretrain.py][INFO] Epoch:[0/2](126300/4588595) loss:3.193 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:56:05,845][model8_pretrain.py][INFO] Epoch:[0/2](126300/4588595) loss:2.480 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:56:42,806][model8_pretrain.py][INFO] Epoch:[0/2](126400/4588595) loss:2.841 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:56:42,806][model8_pretrain.py][INFO] Epoch:[0/2](126400/4588595) loss:2.308 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:56:42,806][model8_pretrain.py][INFO] Epoch:[0/2](126400/4588595) loss:2.411 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:56:42,806][model8_pretrain.py][INFO] Epoch:[0/2](126400/4588595) loss:3.127 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:56:42,806][model8_pretrain.py][INFO] Epoch:[0/2](126400/4588595) loss:3.193 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:56:42,806][model8_pretrain.py][INFO] Epoch:[0/2](126400/4588595) loss:3.122 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:56:42,807][model8_pretrain.py][INFO] Epoch:[0/2](126400/4588595) loss:3.154 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:56:42,807][model8_pretrain.py][INFO] Epoch:[0/2](126400/4588595) loss:3.541 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:57:19,765][model8_pretrain.py][INFO] Epoch:[0/2](126500/4588595) loss:3.096 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:57:19,765][model8_pretrain.py][INFO] Epoch:[0/2](126500/4588595) loss:2.916 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:57:19,765][model8_pretrain.py][INFO] Epoch:[0/2](126500/4588595) loss:3.282 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:57:19,765][model8_pretrain.py][INFO] Epoch:[0/2](126500/4588595) loss:3.096 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:57:19,765][model8_pretrain.py][INFO] Epoch:[0/2](126500/4588595) loss:2.322 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:57:19,765][model8_pretrain.py][INFO] Epoch:[0/2](126500/4588595) loss:2.725 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:57:19,765][model8_pretrain.py][INFO] Epoch:[0/2](126500/4588595) loss:2.641 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:57:19,766][model8_pretrain.py][INFO] Epoch:[0/2](126500/4588595) loss:2.506 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:58:03,794][model8_pretrain.py][INFO] Epoch:[0/2](126600/4588595) loss:2.899 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:58:03,794][model8_pretrain.py][INFO] Epoch:[0/2](126600/4588595) loss:2.936 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:58:03,794][model8_pretrain.py][INFO] Epoch:[0/2](126600/4588595) loss:3.567 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:58:03,794][model8_pretrain.py][INFO] Epoch:[0/2](126600/4588595) loss:2.804 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:58:03,794][model8_pretrain.py][INFO] Epoch:[0/2](126600/4588595) loss:2.505 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:58:03,795][model8_pretrain.py][INFO] Epoch:[0/2](126600/4588595) loss:3.233 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:58:03,799][model8_pretrain.py][INFO] Epoch:[0/2](126600/4588595) loss:2.653 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:58:03,799][model8_pretrain.py][INFO] Epoch:[0/2](126600/4588595) loss:3.303 lr:0.0000100 epoch_Time:28134.0min: [2024-01-03 06:58:42,468][model8_pretrain.py][INFO] Epoch:[0/2](126700/4588595) loss:3.502 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:58:42,468][model8_pretrain.py][INFO] Epoch:[0/2](126700/4588595) loss:2.819 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:58:42,468][model8_pretrain.py][INFO] Epoch:[0/2](126700/4588595) loss:3.031 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:58:42,468][model8_pretrain.py][INFO] Epoch:[0/2](126700/4588595) loss:2.863 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:58:42,468][model8_pretrain.py][INFO] Epoch:[0/2](126700/4588595) loss:2.956 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:58:42,468][model8_pretrain.py][INFO] Epoch:[0/2](126700/4588595) loss:2.953 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:58:42,468][model8_pretrain.py][INFO] Epoch:[0/2](126700/4588595) loss:3.268 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:58:42,468][model8_pretrain.py][INFO] Epoch:[0/2](126700/4588595) loss:3.024 lr:0.0000100 epoch_Time:28135.0min: [2024-01-03 06:59:19,414][model8_pretrain.py][INFO] Epoch:[0/2](126800/4588595) loss:3.184 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:59:19,414][model8_pretrain.py][INFO] Epoch:[0/2](126800/4588595) loss:2.237 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:59:19,414][model8_pretrain.py][INFO] Epoch:[0/2](126800/4588595) loss:2.920 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:59:19,414][model8_pretrain.py][INFO] Epoch:[0/2](126800/4588595) loss:2.734 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:59:19,414][model8_pretrain.py][INFO] Epoch:[0/2](126800/4588595) loss:3.046 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:59:19,414][model8_pretrain.py][INFO] Epoch:[0/2](126800/4588595) loss:2.772 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:59:19,414][model8_pretrain.py][INFO] Epoch:[0/2](126800/4588595) loss:3.614 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:59:19,414][model8_pretrain.py][INFO] Epoch:[0/2](126800/4588595) loss:2.896 lr:0.0000100 epoch_Time:28133.0min: [2024-01-03 06:59:56,351][model8_pretrain.py][INFO] Epoch:[0/2](126900/4588595) loss:3.325 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:59:56,351][model8_pretrain.py][INFO] Epoch:[0/2](126900/4588595) loss:2.690 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:59:56,351][model8_pretrain.py][INFO] Epoch:[0/2](126900/4588595) loss:3.192 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:59:56,351][model8_pretrain.py][INFO] Epoch:[0/2](126900/4588595) loss:3.444 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:59:56,351][model8_pretrain.py][INFO] Epoch:[0/2](126900/4588595) loss:2.879 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:59:56,351][model8_pretrain.py][INFO] Epoch:[0/2](126900/4588595) loss:2.876 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:59:56,352][model8_pretrain.py][INFO] Epoch:[0/2](126900/4588595) loss:3.237 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 06:59:56,352][model8_pretrain.py][INFO] Epoch:[0/2](126900/4588595) loss:3.029 lr:0.0000100 epoch_Time:28132.0min: [2024-01-03 07:00:33,286][model8_pretrain.py][INFO] Epoch:[0/2](127000/4588595) loss:2.798 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:00:33,286][model8_pretrain.py][INFO] Epoch:[0/2](127000/4588595) loss:3.091 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:00:33,286][model8_pretrain.py][INFO] Epoch:[0/2](127000/4588595) loss:3.597 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:00:33,286][model8_pretrain.py][INFO] Epoch:[0/2](127000/4588595) loss:2.807 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:00:33,286][model8_pretrain.py][INFO] Epoch:[0/2](127000/4588595) loss:3.120 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:00:33,286][model8_pretrain.py][INFO] Epoch:[0/2](127000/4588595) loss:2.657 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:00:33,286][model8_pretrain.py][INFO] Epoch:[0/2](127000/4588595) loss:3.153 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:00:33,286][model8_pretrain.py][INFO] Epoch:[0/2](127000/4588595) loss:3.359 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:01:10,228][model8_pretrain.py][INFO] Epoch:[0/2](127100/4588595) loss:3.072 lr:0.0000100 epoch_Time:28130.0min: [2024-01-03 07:01:10,228][model8_pretrain.py][INFO] Epoch:[0/2](127100/4588595) loss:2.824 lr:0.0000100 epoch_Time:28130.0min: [2024-01-03 07:01:10,228][model8_pretrain.py][INFO] Epoch:[0/2](127100/4588595) loss:2.789 lr:0.0000100 epoch_Time:28130.0min: [2024-01-03 07:01:10,228][model8_pretrain.py][INFO] Epoch:[0/2](127100/4588595) loss:3.195 lr:0.0000100 epoch_Time:28130.0min: [2024-01-03 07:01:10,228][model8_pretrain.py][INFO] Epoch:[0/2](127100/4588595) loss:3.001 lr:0.0000100 epoch_Time:28130.0min: [2024-01-03 07:01:10,228][model8_pretrain.py][INFO] Epoch:[0/2](127100/4588595) loss:3.083 lr:0.0000100 epoch_Time:28130.0min: [2024-01-03 07:01:10,228][model8_pretrain.py][INFO] Epoch:[0/2](127100/4588595) loss:2.490 lr:0.0000100 epoch_Time:28130.0min: [2024-01-03 07:01:10,228][model8_pretrain.py][INFO] Epoch:[0/2](127100/4588595) loss:2.589 lr:0.0000100 epoch_Time:28130.0min: [2024-01-03 07:01:47,180][model8_pretrain.py][INFO] Epoch:[0/2](127200/4588595) loss:3.391 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:01:47,180][model8_pretrain.py][INFO] Epoch:[0/2](127200/4588595) loss:3.195 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:01:47,180][model8_pretrain.py][INFO] Epoch:[0/2](127200/4588595) loss:3.480 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:01:47,180][model8_pretrain.py][INFO] Epoch:[0/2](127200/4588595) loss:2.558 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:01:47,180][model8_pretrain.py][INFO] Epoch:[0/2](127200/4588595) loss:2.675 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:01:47,180][model8_pretrain.py][INFO] Epoch:[0/2](127200/4588595) loss:2.701 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:01:47,180][model8_pretrain.py][INFO] Epoch:[0/2](127200/4588595) loss:3.367 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:01:47,180][model8_pretrain.py][INFO] Epoch:[0/2](127200/4588595) loss:2.942 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:02:24,117][model8_pretrain.py][INFO] Epoch:[0/2](127300/4588595) loss:2.901 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:02:24,117][model8_pretrain.py][INFO] Epoch:[0/2](127300/4588595) loss:3.199 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:02:24,117][model8_pretrain.py][INFO] Epoch:[0/2](127300/4588595) loss:2.967 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:02:24,117][model8_pretrain.py][INFO] Epoch:[0/2](127300/4588595) loss:2.934 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:02:24,117][model8_pretrain.py][INFO] Epoch:[0/2](127300/4588595) loss:2.572 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:02:24,117][model8_pretrain.py][INFO] Epoch:[0/2](127300/4588595) loss:2.821 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:02:24,117][model8_pretrain.py][INFO] Epoch:[0/2](127300/4588595) loss:3.047 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:02:24,117][model8_pretrain.py][INFO] Epoch:[0/2](127300/4588595) loss:3.412 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:03:04,473][model8_pretrain.py][INFO] Epoch:[0/2](127400/4588595) loss:3.048 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:03:04,473][model8_pretrain.py][INFO] Epoch:[0/2](127400/4588595) loss:3.062 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:03:04,473][model8_pretrain.py][INFO] Epoch:[0/2](127400/4588595) loss:3.363 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:03:04,473][model8_pretrain.py][INFO] Epoch:[0/2](127400/4588595) loss:2.946 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:03:04,473][model8_pretrain.py][INFO] Epoch:[0/2](127400/4588595) loss:3.161 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:03:04,473][model8_pretrain.py][INFO] Epoch:[0/2](127400/4588595) loss:3.116 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:03:04,473][model8_pretrain.py][INFO] Epoch:[0/2](127400/4588595) loss:3.274 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:03:04,473][model8_pretrain.py][INFO] Epoch:[0/2](127400/4588595) loss:2.534 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:03:46,693][model8_pretrain.py][INFO] Epoch:[0/2](127500/4588595) loss:3.055 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:03:46,693][model8_pretrain.py][INFO] Epoch:[0/2](127500/4588595) loss:3.211 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:03:46,693][model8_pretrain.py][INFO] Epoch:[0/2](127500/4588595) loss:2.047 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:03:46,693][model8_pretrain.py][INFO] Epoch:[0/2](127500/4588595) loss:3.158 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:03:46,693][model8_pretrain.py][INFO] Epoch:[0/2](127500/4588595) loss:3.172 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:03:46,693][model8_pretrain.py][INFO] Epoch:[0/2](127500/4588595) loss:3.081 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:03:46,694][model8_pretrain.py][INFO] Epoch:[0/2](127500/4588595) loss:3.295 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:03:46,694][model8_pretrain.py][INFO] Epoch:[0/2](127500/4588595) loss:2.514 lr:0.0000100 epoch_Time:28131.0min: [2024-01-03 07:04:23,635][model8_pretrain.py][INFO] Epoch:[0/2](127600/4588595) loss:3.497 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:04:23,635][model8_pretrain.py][INFO] Epoch:[0/2](127600/4588595) loss:2.933 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:04:23,635][model8_pretrain.py][INFO] Epoch:[0/2](127600/4588595) loss:2.969 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:04:23,635][model8_pretrain.py][INFO] Epoch:[0/2](127600/4588595) loss:3.444 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:04:23,635][model8_pretrain.py][INFO] Epoch:[0/2](127600/4588595) loss:3.445 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:04:23,635][model8_pretrain.py][INFO] Epoch:[0/2](127600/4588595) loss:3.596 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:04:23,635][model8_pretrain.py][INFO] Epoch:[0/2](127600/4588595) loss:3.122 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:04:23,638][model8_pretrain.py][INFO] Epoch:[0/2](127600/4588595) loss:2.838 lr:0.0000100 epoch_Time:28129.0min: [2024-01-03 07:05:00,571][model8_pretrain.py][INFO] Epoch:[0/2](127700/4588595) loss:3.393 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:05:00,571][model8_pretrain.py][INFO] Epoch:[0/2](127700/4588595) loss:3.254 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:05:00,571][model8_pretrain.py][INFO] Epoch:[0/2](127700/4588595) loss:3.280 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:05:00,571][model8_pretrain.py][INFO] Epoch:[0/2](127700/4588595) loss:2.999 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:05:00,571][model8_pretrain.py][INFO] Epoch:[0/2](127700/4588595) loss:3.227 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:05:00,571][model8_pretrain.py][INFO] Epoch:[0/2](127700/4588595) loss:3.232 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:05:00,571][model8_pretrain.py][INFO] Epoch:[0/2](127700/4588595) loss:3.049 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:05:00,571][model8_pretrain.py][INFO] Epoch:[0/2](127700/4588595) loss:2.745 lr:0.0000100 epoch_Time:28128.0min: [2024-01-03 07:05:37,516][model8_pretrain.py][INFO] Epoch:[0/2](127800/4588595) loss:2.820 lr:0.0000100 epoch_Time:28127.0min: [2024-01-03 07:05:37,516][model8_pretrain.py][INFO] Epoch:[0/2](127800/4588595) loss:2.855 lr:0.0000100 epoch_Time:28127.0min: [2024-01-03 07:05:37,516][model8_pretrain.py][INFO] Epoch:[0/2](127800/4588595) loss:3.339 lr:0.0000100 epoch_Time:28127.0min: [2024-01-03 07:05:37,516][model8_pretrain.py][INFO] Epoch:[0/2](127800/4588595) loss:3.078 lr:0.0000100 epoch_Time:28127.0min: [2024-01-03 07:05:37,517][model8_pretrain.py][INFO] Epoch:[0/2](127800/4588595) loss:2.581 lr:0.0000100 epoch_Time:28127.0min: [2024-01-03 07:05:37,517][model8_pretrain.py][INFO] Epoch:[0/2](127800/4588595) loss:3.125 lr:0.0000100 epoch_Time:28127.0min: [2024-01-03 07:05:37,517][model8_pretrain.py][INFO] Epoch:[0/2](127800/4588595) loss:2.950 lr:0.0000100 epoch_Time:28127.0min: [2024-01-03 07:05:37,517][model8_pretrain.py][INFO] Epoch:[0/2](127800/4588595) loss:2.922 lr:0.0000100 epoch_Time:28127.0min: [2024-01-03 07:06:14,453][model8_pretrain.py][INFO] Epoch:[0/2](127900/4588595) loss:3.207 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:06:14,453][model8_pretrain.py][INFO] Epoch:[0/2](127900/4588595) loss:3.100 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:06:14,453][model8_pretrain.py][INFO] Epoch:[0/2](127900/4588595) loss:2.772 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:06:14,453][model8_pretrain.py][INFO] Epoch:[0/2](127900/4588595) loss:2.730 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:06:14,453][model8_pretrain.py][INFO] Epoch:[0/2](127900/4588595) loss:3.021 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:06:14,453][model8_pretrain.py][INFO] Epoch:[0/2](127900/4588595) loss:2.978 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:06:14,453][model8_pretrain.py][INFO] Epoch:[0/2](127900/4588595) loss:3.037 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:06:14,453][model8_pretrain.py][INFO] Epoch:[0/2](127900/4588595) loss:3.514 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:06:51,410][model8_pretrain.py][INFO] Epoch:[0/2](128000/4588595) loss:2.907 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:06:51,410][model8_pretrain.py][INFO] Epoch:[0/2](128000/4588595) loss:3.068 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:06:51,410][model8_pretrain.py][INFO] Epoch:[0/2](128000/4588595) loss:2.915 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:06:51,410][model8_pretrain.py][INFO] Epoch:[0/2](128000/4588595) loss:3.040 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:06:51,410][model8_pretrain.py][INFO] Epoch:[0/2](128000/4588595) loss:2.796 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:06:51,410][model8_pretrain.py][INFO] Epoch:[0/2](128000/4588595) loss:3.498 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:06:51,410][model8_pretrain.py][INFO] Epoch:[0/2](128000/4588595) loss:3.418 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:06:51,410][model8_pretrain.py][INFO] Epoch:[0/2](128000/4588595) loss:2.923 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:07:28,364][model8_pretrain.py][INFO] Epoch:[0/2](128100/4588595) loss:2.964 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:07:28,364][model8_pretrain.py][INFO] Epoch:[0/2](128100/4588595) loss:2.824 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:07:28,364][model8_pretrain.py][INFO] Epoch:[0/2](128100/4588595) loss:3.243 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:07:28,364][model8_pretrain.py][INFO] Epoch:[0/2](128100/4588595) loss:2.576 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:07:28,364][model8_pretrain.py][INFO] Epoch:[0/2](128100/4588595) loss:3.440 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:07:28,364][model8_pretrain.py][INFO] Epoch:[0/2](128100/4588595) loss:3.091 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:07:28,364][model8_pretrain.py][INFO] Epoch:[0/2](128100/4588595) loss:2.485 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:07:28,364][model8_pretrain.py][INFO] Epoch:[0/2](128100/4588595) loss:3.163 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:08:08,839][model8_pretrain.py][INFO] Epoch:[0/2](128200/4588595) loss:3.140 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:08:08,839][model8_pretrain.py][INFO] Epoch:[0/2](128200/4588595) loss:2.359 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:08:08,839][model8_pretrain.py][INFO] Epoch:[0/2](128200/4588595) loss:3.186 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:08:08,839][model8_pretrain.py][INFO] Epoch:[0/2](128200/4588595) loss:3.069 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:08:08,839][model8_pretrain.py][INFO] Epoch:[0/2](128200/4588595) loss:3.043 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:08:08,839][model8_pretrain.py][INFO] Epoch:[0/2](128200/4588595) loss:2.889 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:08:08,839][model8_pretrain.py][INFO] Epoch:[0/2](128200/4588595) loss:2.908 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:08:08,839][model8_pretrain.py][INFO] Epoch:[0/2](128200/4588595) loss:2.764 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:08:51,136][model8_pretrain.py][INFO] Epoch:[0/2](128300/4588595) loss:3.219 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:08:51,136][model8_pretrain.py][INFO] Epoch:[0/2](128300/4588595) loss:2.714 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:08:51,137][model8_pretrain.py][INFO] Epoch:[0/2](128300/4588595) loss:3.054 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:08:51,137][model8_pretrain.py][INFO] Epoch:[0/2](128300/4588595) loss:3.391 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:08:51,136][model8_pretrain.py][INFO] Epoch:[0/2](128300/4588595) loss:3.228 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:08:51,137][model8_pretrain.py][INFO] Epoch:[0/2](128300/4588595) loss:3.105 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:08:51,137][model8_pretrain.py][INFO] Epoch:[0/2](128300/4588595) loss:2.563 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:08:51,137][model8_pretrain.py][INFO] Epoch:[0/2](128300/4588595) loss:3.284 lr:0.0000100 epoch_Time:28126.0min: [2024-01-03 07:09:28,059][model8_pretrain.py][INFO] Epoch:[0/2](128400/4588595) loss:2.989 lr:0.0000100 epoch_Time:28125.0min: [2024-01-03 07:09:28,059][model8_pretrain.py][INFO] Epoch:[0/2](128400/4588595) loss:2.685 lr:0.0000100 epoch_Time:28125.0min: [2024-01-03 07:09:28,059][model8_pretrain.py][INFO] Epoch:[0/2](128400/4588595) loss:2.938 lr:0.0000100 epoch_Time:28125.0min: [2024-01-03 07:09:28,059][model8_pretrain.py][INFO] Epoch:[0/2](128400/4588595) loss:2.568 lr:0.0000100 epoch_Time:28125.0min: [2024-01-03 07:09:28,060][model8_pretrain.py][INFO] Epoch:[0/2](128400/4588595) loss:2.819 lr:0.0000100 epoch_Time:28125.0min: [2024-01-03 07:09:28,060][model8_pretrain.py][INFO] Epoch:[0/2](128400/4588595) loss:2.955 lr:0.0000100 epoch_Time:28125.0min: [2024-01-03 07:09:28,060][model8_pretrain.py][INFO] Epoch:[0/2](128400/4588595) loss:3.287 lr:0.0000100 epoch_Time:28125.0min: [2024-01-03 07:09:28,061][model8_pretrain.py][INFO] Epoch:[0/2](128400/4588595) loss:3.049 lr:0.0000100 epoch_Time:28125.0min: [2024-01-03 07:10:04,987][model8_pretrain.py][INFO] Epoch:[0/2](128500/4588595) loss:3.070 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:10:04,987][model8_pretrain.py][INFO] Epoch:[0/2](128500/4588595) loss:3.230 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:10:04,987][model8_pretrain.py][INFO] Epoch:[0/2](128500/4588595) loss:3.200 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:10:04,987][model8_pretrain.py][INFO] Epoch:[0/2](128500/4588595) loss:3.544 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:10:04,987][model8_pretrain.py][INFO] Epoch:[0/2](128500/4588595) loss:2.537 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:10:04,987][model8_pretrain.py][INFO] Epoch:[0/2](128500/4588595) loss:2.847 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:10:04,987][model8_pretrain.py][INFO] Epoch:[0/2](128500/4588595) loss:2.469 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:10:04,987][model8_pretrain.py][INFO] Epoch:[0/2](128500/4588595) loss:3.351 lr:0.0000100 epoch_Time:28124.0min: [2024-01-03 07:10:41,922][model8_pretrain.py][INFO] Epoch:[0/2](128600/4588595) loss:2.530 lr:0.0000100 epoch_Time:28123.0min: [2024-01-03 07:10:41,922][model8_pretrain.py][INFO] Epoch:[0/2](128600/4588595) loss:2.868 lr:0.0000100 epoch_Time:28123.0min: [2024-01-03 07:10:41,922][model8_pretrain.py][INFO] Epoch:[0/2](128600/4588595) loss:2.854 lr:0.0000100 epoch_Time:28123.0min: [2024-01-03 07:10:41,922][model8_pretrain.py][INFO] Epoch:[0/2](128600/4588595) loss:2.970 lr:0.0000100 epoch_Time:28123.0min: [2024-01-03 07:10:41,922][model8_pretrain.py][INFO] Epoch:[0/2](128600/4588595) loss:3.188 lr:0.0000100 epoch_Time:28123.0min: [2024-01-03 07:10:41,922][model8_pretrain.py][INFO] Epoch:[0/2](128600/4588595) loss:3.476 lr:0.0000100 epoch_Time:28123.0min: [2024-01-03 07:10:41,922][model8_pretrain.py][INFO] Epoch:[0/2](128600/4588595) loss:2.972 lr:0.0000100 epoch_Time:28123.0min: [2024-01-03 07:10:41,922][model8_pretrain.py][INFO] Epoch:[0/2](128600/4588595) loss:3.174 lr:0.0000100 epoch_Time:28123.0min: [2024-01-03 07:11:18,854][model8_pretrain.py][INFO] Epoch:[0/2](128700/4588595) loss:3.193 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:11:18,854][model8_pretrain.py][INFO] Epoch:[0/2](128700/4588595) loss:3.115 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:11:18,854][model8_pretrain.py][INFO] Epoch:[0/2](128700/4588595) loss:2.515 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:11:18,854][model8_pretrain.py][INFO] Epoch:[0/2](128700/4588595) loss:3.088 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:11:18,854][model8_pretrain.py][INFO] Epoch:[0/2](128700/4588595) loss:2.914 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:11:18,854][model8_pretrain.py][INFO] Epoch:[0/2](128700/4588595) loss:2.732 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:11:18,854][model8_pretrain.py][INFO] Epoch:[0/2](128700/4588595) loss:3.245 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:11:18,854][model8_pretrain.py][INFO] Epoch:[0/2](128700/4588595) loss:2.812 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:11:55,793][model8_pretrain.py][INFO] Epoch:[0/2](128800/4588595) loss:2.985 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:11:55,793][model8_pretrain.py][INFO] Epoch:[0/2](128800/4588595) loss:3.155 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:11:55,793][model8_pretrain.py][INFO] Epoch:[0/2](128800/4588595) loss:2.728 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:11:55,793][model8_pretrain.py][INFO] Epoch:[0/2](128800/4588595) loss:3.124 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:11:55,793][model8_pretrain.py][INFO] Epoch:[0/2](128800/4588595) loss:3.286 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:11:55,793][model8_pretrain.py][INFO] Epoch:[0/2](128800/4588595) loss:2.578 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:11:55,793][model8_pretrain.py][INFO] Epoch:[0/2](128800/4588595) loss:3.366 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:11:55,793][model8_pretrain.py][INFO] Epoch:[0/2](128800/4588595) loss:3.179 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:12:32,743][model8_pretrain.py][INFO] Epoch:[0/2](128900/4588595) loss:2.676 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:12:32,743][model8_pretrain.py][INFO] Epoch:[0/2](128900/4588595) loss:2.947 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:12:32,743][model8_pretrain.py][INFO] Epoch:[0/2](128900/4588595) loss:2.737 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:12:32,743][model8_pretrain.py][INFO] Epoch:[0/2](128900/4588595) loss:2.560 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:12:32,743][model8_pretrain.py][INFO] Epoch:[0/2](128900/4588595) loss:2.931 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:12:32,743][model8_pretrain.py][INFO] Epoch:[0/2](128900/4588595) loss:2.598 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:12:32,743][model8_pretrain.py][INFO] Epoch:[0/2](128900/4588595) loss:2.800 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:12:32,743][model8_pretrain.py][INFO] Epoch:[0/2](128900/4588595) loss:2.983 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:13:13,109][model8_pretrain.py][INFO] Epoch:[0/2](129000/4588595) loss:2.575 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:13:13,109][model8_pretrain.py][INFO] Epoch:[0/2](129000/4588595) loss:2.579 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:13:13,109][model8_pretrain.py][INFO] Epoch:[0/2](129000/4588595) loss:2.459 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:13:13,109][model8_pretrain.py][INFO] Epoch:[0/2](129000/4588595) loss:3.241 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:13:13,109][model8_pretrain.py][INFO] Epoch:[0/2](129000/4588595) loss:3.322 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:13:13,109][model8_pretrain.py][INFO] Epoch:[0/2](129000/4588595) loss:2.883 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:13:13,109][model8_pretrain.py][INFO] Epoch:[0/2](129000/4588595) loss:2.893 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:13:13,114][model8_pretrain.py][INFO] Epoch:[0/2](129000/4588595) loss:2.937 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:13:55,343][model8_pretrain.py][INFO] Epoch:[0/2](129100/4588595) loss:3.430 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:13:55,343][model8_pretrain.py][INFO] Epoch:[0/2](129100/4588595) loss:3.484 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:13:55,343][model8_pretrain.py][INFO] Epoch:[0/2](129100/4588595) loss:3.444 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:13:55,343][model8_pretrain.py][INFO] Epoch:[0/2](129100/4588595) loss:3.472 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:13:55,343][model8_pretrain.py][INFO] Epoch:[0/2](129100/4588595) loss:2.429 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:13:55,343][model8_pretrain.py][INFO] Epoch:[0/2](129100/4588595) loss:2.979 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:13:55,343][model8_pretrain.py][INFO] Epoch:[0/2](129100/4588595) loss:3.049 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:13:55,343][model8_pretrain.py][INFO] Epoch:[0/2](129100/4588595) loss:2.990 lr:0.0000100 epoch_Time:28122.0min: [2024-01-03 07:14:32,275][model8_pretrain.py][INFO] Epoch:[0/2](129200/4588595) loss:3.544 lr:0.0000100 epoch_Time:28121.0min: [2024-01-03 07:14:32,275][model8_pretrain.py][INFO] Epoch:[0/2](129200/4588595) loss:3.218 lr:0.0000100 epoch_Time:28121.0min: [2024-01-03 07:14:32,275][model8_pretrain.py][INFO] Epoch:[0/2](129200/4588595) loss:2.185 lr:0.0000100 epoch_Time:28121.0min: [2024-01-03 07:14:32,275][model8_pretrain.py][INFO] Epoch:[0/2](129200/4588595) loss:3.000 lr:0.0000100 epoch_Time:28121.0min: [2024-01-03 07:14:32,275][model8_pretrain.py][INFO] Epoch:[0/2](129200/4588595) loss:3.004 lr:0.0000100 epoch_Time:28121.0min: [2024-01-03 07:14:32,275][model8_pretrain.py][INFO] Epoch:[0/2](129200/4588595) loss:2.739 lr:0.0000100 epoch_Time:28121.0min: [2024-01-03 07:14:32,275][model8_pretrain.py][INFO] Epoch:[0/2](129200/4588595) loss:2.581 lr:0.0000100 epoch_Time:28121.0min: [2024-01-03 07:14:32,275][model8_pretrain.py][INFO] Epoch:[0/2](129200/4588595) loss:2.840 lr:0.0000100 epoch_Time:28121.0min: [2024-01-03 07:15:09,214][model8_pretrain.py][INFO] Epoch:[0/2](129300/4588595) loss:2.482 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:15:09,214][model8_pretrain.py][INFO] Epoch:[0/2](129300/4588595) loss:3.174 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:15:09,214][model8_pretrain.py][INFO] Epoch:[0/2](129300/4588595) loss:3.034 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:15:09,214][model8_pretrain.py][INFO] Epoch:[0/2](129300/4588595) loss:3.009 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:15:09,214][model8_pretrain.py][INFO] Epoch:[0/2](129300/4588595) loss:2.657 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:15:09,214][model8_pretrain.py][INFO] Epoch:[0/2](129300/4588595) loss:2.908 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:15:09,214][model8_pretrain.py][INFO] Epoch:[0/2](129300/4588595) loss:3.058 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:15:09,215][model8_pretrain.py][INFO] Epoch:[0/2](129300/4588595) loss:2.772 lr:0.0000100 epoch_Time:28120.0min: [2024-01-03 07:15:46,165][model8_pretrain.py][INFO] Epoch:[0/2](129400/4588595) loss:3.453 lr:0.0000100 epoch_Time:28119.0min: [2024-01-03 07:15:46,165][model8_pretrain.py][INFO] Epoch:[0/2](129400/4588595) loss:2.551 lr:0.0000100 epoch_Time:28119.0min: [2024-01-03 07:15:46,165][model8_pretrain.py][INFO] Epoch:[0/2](129400/4588595) loss:3.118 lr:0.0000100 epoch_Time:28119.0min: [2024-01-03 07:15:46,165][model8_pretrain.py][INFO] Epoch:[0/2](129400/4588595) loss:3.041 lr:0.0000100 epoch_Time:28119.0min: [2024-01-03 07:15:46,165][model8_pretrain.py][INFO] Epoch:[0/2](129400/4588595) loss:2.814 lr:0.0000100 epoch_Time:28119.0min: [2024-01-03 07:15:46,165][model8_pretrain.py][INFO] Epoch:[0/2](129400/4588595) loss:3.144 lr:0.0000100 epoch_Time:28119.0min: [2024-01-03 07:15:46,165][model8_pretrain.py][INFO] Epoch:[0/2](129400/4588595) loss:2.876 lr:0.0000100 epoch_Time:28119.0min: [2024-01-03 07:15:46,165][model8_pretrain.py][INFO] Epoch:[0/2](129400/4588595) loss:2.832 lr:0.0000100 epoch_Time:28119.0min: [2024-01-03 07:16:23,117][model8_pretrain.py][INFO] Epoch:[0/2](129500/4588595) loss:3.083 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:16:23,117][model8_pretrain.py][INFO] Epoch:[0/2](129500/4588595) loss:2.651 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:16:23,117][model8_pretrain.py][INFO] Epoch:[0/2](129500/4588595) loss:2.940 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:16:23,117][model8_pretrain.py][INFO] Epoch:[0/2](129500/4588595) loss:3.074 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:16:23,117][model8_pretrain.py][INFO] Epoch:[0/2](129500/4588595) loss:2.991 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:16:23,117][model8_pretrain.py][INFO] Epoch:[0/2](129500/4588595) loss:3.277 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:16:23,117][model8_pretrain.py][INFO] Epoch:[0/2](129500/4588595) loss:3.086 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:16:23,117][model8_pretrain.py][INFO] Epoch:[0/2](129500/4588595) loss:2.831 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:17:00,049][model8_pretrain.py][INFO] Epoch:[0/2](129600/4588595) loss:3.257 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:00,049][model8_pretrain.py][INFO] Epoch:[0/2](129600/4588595) loss:3.264 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:00,049][model8_pretrain.py][INFO] Epoch:[0/2](129600/4588595) loss:2.931 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:00,049][model8_pretrain.py][INFO] Epoch:[0/2](129600/4588595) loss:2.996 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:00,049][model8_pretrain.py][INFO] Epoch:[0/2](129600/4588595) loss:2.876 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:00,049][model8_pretrain.py][INFO] Epoch:[0/2](129600/4588595) loss:2.191 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:00,049][model8_pretrain.py][INFO] Epoch:[0/2](129600/4588595) loss:2.623 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:00,050][model8_pretrain.py][INFO] Epoch:[0/2](129600/4588595) loss:3.154 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:36,988][model8_pretrain.py][INFO] Epoch:[0/2](129700/4588595) loss:2.921 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:36,988][model8_pretrain.py][INFO] Epoch:[0/2](129700/4588595) loss:3.268 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:36,988][model8_pretrain.py][INFO] Epoch:[0/2](129700/4588595) loss:3.260 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:36,988][model8_pretrain.py][INFO] Epoch:[0/2](129700/4588595) loss:2.777 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:36,988][model8_pretrain.py][INFO] Epoch:[0/2](129700/4588595) loss:3.020 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:36,988][model8_pretrain.py][INFO] Epoch:[0/2](129700/4588595) loss:3.034 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:36,988][model8_pretrain.py][INFO] Epoch:[0/2](129700/4588595) loss:2.374 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:17:36,988][model8_pretrain.py][INFO] Epoch:[0/2](129700/4588595) loss:3.289 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:18:13,906][model8_pretrain.py][INFO] Epoch:[0/2](129800/4588595) loss:3.126 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:18:13,906][model8_pretrain.py][INFO] Epoch:[0/2](129800/4588595) loss:3.138 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:18:13,906][model8_pretrain.py][INFO] Epoch:[0/2](129800/4588595) loss:3.260 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:18:13,906][model8_pretrain.py][INFO] Epoch:[0/2](129800/4588595) loss:3.070 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:18:13,906][model8_pretrain.py][INFO] Epoch:[0/2](129800/4588595) loss:2.905 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:18:13,906][model8_pretrain.py][INFO] Epoch:[0/2](129800/4588595) loss:3.359 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:18:13,907][model8_pretrain.py][INFO] Epoch:[0/2](129800/4588595) loss:2.773 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:18:13,907][model8_pretrain.py][INFO] Epoch:[0/2](129800/4588595) loss:3.265 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:18:59,720][model8_pretrain.py][INFO] Epoch:[0/2](129900/4588595) loss:3.007 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:18:59,720][model8_pretrain.py][INFO] Epoch:[0/2](129900/4588595) loss:3.039 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:18:59,720][model8_pretrain.py][INFO] Epoch:[0/2](129900/4588595) loss:2.628 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:18:59,720][model8_pretrain.py][INFO] Epoch:[0/2](129900/4588595) loss:2.828 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:18:59,720][model8_pretrain.py][INFO] Epoch:[0/2](129900/4588595) loss:3.271 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:18:59,720][model8_pretrain.py][INFO] Epoch:[0/2](129900/4588595) loss:2.124 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:18:59,720][model8_pretrain.py][INFO] Epoch:[0/2](129900/4588595) loss:2.924 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:18:59,720][model8_pretrain.py][INFO] Epoch:[0/2](129900/4588595) loss:2.765 lr:0.0000100 epoch_Time:28118.0min: [2024-01-03 07:19:36,639][model8_pretrain.py][INFO] Epoch:[0/2](130000/4588595) loss:3.052 lr:0.0000100 epoch_Time:28117.0min: [2024-01-03 07:19:36,639][model8_pretrain.py][INFO] Epoch:[0/2](130000/4588595) loss:2.840 lr:0.0000100 epoch_Time:28117.0min: [2024-01-03 07:19:36,639][model8_pretrain.py][INFO] Epoch:[0/2](130000/4588595) loss:3.232 lr:0.0000100 epoch_Time:28117.0min: [2024-01-03 07:19:36,639][model8_pretrain.py][INFO] Epoch:[0/2](130000/4588595) loss:2.978 lr:0.0000100 epoch_Time:28117.0min: [2024-01-03 07:19:36,639][model8_pretrain.py][INFO] Epoch:[0/2](130000/4588595) loss:3.009 lr:0.0000100 epoch_Time:28117.0min: [2024-01-03 07:19:36,639][model8_pretrain.py][INFO] Epoch:[0/2](130000/4588595) loss:3.157 lr:0.0000100 epoch_Time:28117.0min: [2024-01-03 07:19:36,640][model8_pretrain.py][INFO] Epoch:[0/2](130000/4588595) loss:2.797 lr:0.0000100 epoch_Time:28117.0min: [2024-01-03 07:19:36,640][model8_pretrain.py][INFO] Epoch:[0/2](130000/4588595) loss:2.417 lr:0.0000100 epoch_Time:28117.0min: [2024-01-03 07:20:13,561][model8_pretrain.py][INFO] Epoch:[0/2](130100/4588595) loss:2.751 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:20:13,562][model8_pretrain.py][INFO] Epoch:[0/2](130100/4588595) loss:2.617 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:20:13,562][model8_pretrain.py][INFO] Epoch:[0/2](130100/4588595) loss:3.204 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:20:13,562][model8_pretrain.py][INFO] Epoch:[0/2](130100/4588595) loss:3.077 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:20:13,562][model8_pretrain.py][INFO] Epoch:[0/2](130100/4588595) loss:3.260 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:20:13,562][model8_pretrain.py][INFO] Epoch:[0/2](130100/4588595) loss:3.239 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:20:13,562][model8_pretrain.py][INFO] Epoch:[0/2](130100/4588595) loss:2.928 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:20:13,562][model8_pretrain.py][INFO] Epoch:[0/2](130100/4588595) loss:2.802 lr:0.0000100 epoch_Time:28116.0min: [2024-01-03 07:20:50,490][model8_pretrain.py][INFO] Epoch:[0/2](130200/4588595) loss:3.008 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:20:50,490][model8_pretrain.py][INFO] Epoch:[0/2](130200/4588595) loss:3.371 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:20:50,490][model8_pretrain.py][INFO] Epoch:[0/2](130200/4588595) loss:3.200 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:20:50,490][model8_pretrain.py][INFO] Epoch:[0/2](130200/4588595) loss:3.093 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:20:50,490][model8_pretrain.py][INFO] Epoch:[0/2](130200/4588595) loss:3.590 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:20:50,490][model8_pretrain.py][INFO] Epoch:[0/2](130200/4588595) loss:3.018 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:20:50,490][model8_pretrain.py][INFO] Epoch:[0/2](130200/4588595) loss:3.500 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:20:50,490][model8_pretrain.py][INFO] Epoch:[0/2](130200/4588595) loss:2.625 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:21:27,420][model8_pretrain.py][INFO] Epoch:[0/2](130300/4588595) loss:2.840 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:21:27,420][model8_pretrain.py][INFO] Epoch:[0/2](130300/4588595) loss:3.194 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:21:27,420][model8_pretrain.py][INFO] Epoch:[0/2](130300/4588595) loss:3.115 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:21:27,420][model8_pretrain.py][INFO] Epoch:[0/2](130300/4588595) loss:2.864 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:21:27,420][model8_pretrain.py][INFO] Epoch:[0/2](130300/4588595) loss:2.678 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:21:27,420][model8_pretrain.py][INFO] Epoch:[0/2](130300/4588595) loss:2.715 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:21:27,421][model8_pretrain.py][INFO] Epoch:[0/2](130300/4588595) loss:2.812 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:21:27,421][model8_pretrain.py][INFO] Epoch:[0/2](130300/4588595) loss:3.296 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:22:04,345][model8_pretrain.py][INFO] Epoch:[0/2](130400/4588595) loss:2.919 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:04,345][model8_pretrain.py][INFO] Epoch:[0/2](130400/4588595) loss:2.989 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:04,345][model8_pretrain.py][INFO] Epoch:[0/2](130400/4588595) loss:3.120 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:04,345][model8_pretrain.py][INFO] Epoch:[0/2](130400/4588595) loss:3.140 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:04,345][model8_pretrain.py][INFO] Epoch:[0/2](130400/4588595) loss:2.783 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:04,345][model8_pretrain.py][INFO] Epoch:[0/2](130400/4588595) loss:2.891 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:04,345][model8_pretrain.py][INFO] Epoch:[0/2](130400/4588595) loss:3.068 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:04,345][model8_pretrain.py][INFO] Epoch:[0/2](130400/4588595) loss:2.790 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:41,278][model8_pretrain.py][INFO] Epoch:[0/2](130500/4588595) loss:3.251 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:41,278][model8_pretrain.py][INFO] Epoch:[0/2](130500/4588595) loss:2.526 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:41,278][model8_pretrain.py][INFO] Epoch:[0/2](130500/4588595) loss:2.807 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:41,278][model8_pretrain.py][INFO] Epoch:[0/2](130500/4588595) loss:2.657 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:41,278][model8_pretrain.py][INFO] Epoch:[0/2](130500/4588595) loss:3.075 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:41,278][model8_pretrain.py][INFO] Epoch:[0/2](130500/4588595) loss:3.250 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:41,278][model8_pretrain.py][INFO] Epoch:[0/2](130500/4588595) loss:3.084 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:22:41,278][model8_pretrain.py][INFO] Epoch:[0/2](130500/4588595) loss:3.364 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:23:18,223][model8_pretrain.py][INFO] Epoch:[0/2](130600/4588595) loss:3.445 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:23:18,223][model8_pretrain.py][INFO] Epoch:[0/2](130600/4588595) loss:3.419 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:23:18,223][model8_pretrain.py][INFO] Epoch:[0/2](130600/4588595) loss:2.957 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:23:18,223][model8_pretrain.py][INFO] Epoch:[0/2](130600/4588595) loss:3.104 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:23:18,223][model8_pretrain.py][INFO] Epoch:[0/2](130600/4588595) loss:2.493 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:23:18,223][model8_pretrain.py][INFO] Epoch:[0/2](130600/4588595) loss:2.984 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:23:18,224][model8_pretrain.py][INFO] Epoch:[0/2](130600/4588595) loss:2.941 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:23:18,224][model8_pretrain.py][INFO] Epoch:[0/2](130600/4588595) loss:3.134 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:24:03,890][model8_pretrain.py][INFO] Epoch:[0/2](130700/4588595) loss:2.937 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:24:03,890][model8_pretrain.py][INFO] Epoch:[0/2](130700/4588595) loss:3.311 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:24:03,890][model8_pretrain.py][INFO] Epoch:[0/2](130700/4588595) loss:3.321 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:24:03,890][model8_pretrain.py][INFO] Epoch:[0/2](130700/4588595) loss:3.378 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:24:03,890][model8_pretrain.py][INFO] Epoch:[0/2](130700/4588595) loss:2.876 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:24:03,890][model8_pretrain.py][INFO] Epoch:[0/2](130700/4588595) loss:2.820 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:24:03,890][model8_pretrain.py][INFO] Epoch:[0/2](130700/4588595) loss:2.670 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:24:03,891][model8_pretrain.py][INFO] Epoch:[0/2](130700/4588595) loss:2.844 lr:0.0000100 epoch_Time:28114.0min: [2024-01-03 07:24:40,813][model8_pretrain.py][INFO] Epoch:[0/2](130800/4588595) loss:2.942 lr:0.0000100 epoch_Time:28113.0min: [2024-01-03 07:24:40,814][model8_pretrain.py][INFO] Epoch:[0/2](130800/4588595) loss:3.257 lr:0.0000100 epoch_Time:28113.0min: [2024-01-03 07:24:40,814][model8_pretrain.py][INFO] Epoch:[0/2](130800/4588595) loss:3.265 lr:0.0000100 epoch_Time:28113.0min: [2024-01-03 07:24:40,813][model8_pretrain.py][INFO] Epoch:[0/2](130800/4588595) loss:3.005 lr:0.0000100 epoch_Time:28113.0min: [2024-01-03 07:24:40,814][model8_pretrain.py][INFO] Epoch:[0/2](130800/4588595) loss:2.381 lr:0.0000100 epoch_Time:28113.0min: [2024-01-03 07:24:40,814][model8_pretrain.py][INFO] Epoch:[0/2](130800/4588595) loss:3.007 lr:0.0000100 epoch_Time:28113.0min: [2024-01-03 07:24:40,814][model8_pretrain.py][INFO] Epoch:[0/2](130800/4588595) loss:2.759 lr:0.0000100 epoch_Time:28113.0min: [2024-01-03 07:24:40,814][model8_pretrain.py][INFO] Epoch:[0/2](130800/4588595) loss:2.202 lr:0.0000100 epoch_Time:28113.0min: [2024-01-03 07:25:17,749][model8_pretrain.py][INFO] Epoch:[0/2](130900/4588595) loss:2.612 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:25:17,749][model8_pretrain.py][INFO] Epoch:[0/2](130900/4588595) loss:3.264 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:25:17,749][model8_pretrain.py][INFO] Epoch:[0/2](130900/4588595) loss:3.381 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:25:17,750][model8_pretrain.py][INFO] Epoch:[0/2](130900/4588595) loss:2.720 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:25:17,750][model8_pretrain.py][INFO] Epoch:[0/2](130900/4588595) loss:2.862 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:25:17,750][model8_pretrain.py][INFO] Epoch:[0/2](130900/4588595) loss:3.286 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:25:17,750][model8_pretrain.py][INFO] Epoch:[0/2](130900/4588595) loss:3.214 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:25:17,750][model8_pretrain.py][INFO] Epoch:[0/2](130900/4588595) loss:3.030 lr:0.0000100 epoch_Time:28112.0min: [2024-01-03 07:25:54,683][model8_pretrain.py][INFO] Epoch:[0/2](131000/4588595) loss:3.153 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:25:54,683][model8_pretrain.py][INFO] Epoch:[0/2](131000/4588595) loss:3.433 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:25:54,683][model8_pretrain.py][INFO] Epoch:[0/2](131000/4588595) loss:2.520 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:25:54,683][model8_pretrain.py][INFO] Epoch:[0/2](131000/4588595) loss:3.037 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:25:54,683][model8_pretrain.py][INFO] Epoch:[0/2](131000/4588595) loss:2.395 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:25:54,683][model8_pretrain.py][INFO] Epoch:[0/2](131000/4588595) loss:3.353 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:25:54,683][model8_pretrain.py][INFO] Epoch:[0/2](131000/4588595) loss:3.091 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:25:54,684][model8_pretrain.py][INFO] Epoch:[0/2](131000/4588595) loss:2.784 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:26:31,624][model8_pretrain.py][INFO] Epoch:[0/2](131100/4588595) loss:3.034 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:26:31,624][model8_pretrain.py][INFO] Epoch:[0/2](131100/4588595) loss:2.896 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:26:31,624][model8_pretrain.py][INFO] Epoch:[0/2](131100/4588595) loss:2.718 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:26:31,624][model8_pretrain.py][INFO] Epoch:[0/2](131100/4588595) loss:2.843 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:26:31,624][model8_pretrain.py][INFO] Epoch:[0/2](131100/4588595) loss:2.619 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:26:31,624][model8_pretrain.py][INFO] Epoch:[0/2](131100/4588595) loss:3.049 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:26:31,624][model8_pretrain.py][INFO] Epoch:[0/2](131100/4588595) loss:3.016 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:26:31,624][model8_pretrain.py][INFO] Epoch:[0/2](131100/4588595) loss:2.986 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:27:08,596][model8_pretrain.py][INFO] Epoch:[0/2](131200/4588595) loss:3.045 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:08,596][model8_pretrain.py][INFO] Epoch:[0/2](131200/4588595) loss:2.986 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:08,596][model8_pretrain.py][INFO] Epoch:[0/2](131200/4588595) loss:3.041 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:08,596][model8_pretrain.py][INFO] Epoch:[0/2](131200/4588595) loss:2.623 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:08,596][model8_pretrain.py][INFO] Epoch:[0/2](131200/4588595) loss:2.407 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:08,596][model8_pretrain.py][INFO] Epoch:[0/2](131200/4588595) loss:3.390 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:08,596][model8_pretrain.py][INFO] Epoch:[0/2](131200/4588595) loss:2.235 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:08,596][model8_pretrain.py][INFO] Epoch:[0/2](131200/4588595) loss:2.371 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:45,563][model8_pretrain.py][INFO] Epoch:[0/2](131300/4588595) loss:2.634 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:45,563][model8_pretrain.py][INFO] Epoch:[0/2](131300/4588595) loss:3.338 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:45,563][model8_pretrain.py][INFO] Epoch:[0/2](131300/4588595) loss:3.241 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:45,563][model8_pretrain.py][INFO] Epoch:[0/2](131300/4588595) loss:3.470 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:45,563][model8_pretrain.py][INFO] Epoch:[0/2](131300/4588595) loss:2.767 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:45,563][model8_pretrain.py][INFO] Epoch:[0/2](131300/4588595) loss:3.312 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:45,563][model8_pretrain.py][INFO] Epoch:[0/2](131300/4588595) loss:2.727 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:27:45,563][model8_pretrain.py][INFO] Epoch:[0/2](131300/4588595) loss:2.479 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:28:22,512][model8_pretrain.py][INFO] Epoch:[0/2](131400/4588595) loss:2.401 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:28:22,512][model8_pretrain.py][INFO] Epoch:[0/2](131400/4588595) loss:3.030 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:28:22,512][model8_pretrain.py][INFO] Epoch:[0/2](131400/4588595) loss:2.492 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:28:22,512][model8_pretrain.py][INFO] Epoch:[0/2](131400/4588595) loss:2.495 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:28:22,512][model8_pretrain.py][INFO] Epoch:[0/2](131400/4588595) loss:3.470 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:28:22,512][model8_pretrain.py][INFO] Epoch:[0/2](131400/4588595) loss:2.856 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:28:22,512][model8_pretrain.py][INFO] Epoch:[0/2](131400/4588595) loss:2.560 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:28:22,512][model8_pretrain.py][INFO] Epoch:[0/2](131400/4588595) loss:3.019 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:29:08,302][model8_pretrain.py][INFO] Epoch:[0/2](131500/4588595) loss:3.270 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:29:08,302][model8_pretrain.py][INFO] Epoch:[0/2](131500/4588595) loss:2.941 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:29:08,302][model8_pretrain.py][INFO] Epoch:[0/2](131500/4588595) loss:2.653 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:29:08,302][model8_pretrain.py][INFO] Epoch:[0/2](131500/4588595) loss:2.651 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:29:08,302][model8_pretrain.py][INFO] Epoch:[0/2](131500/4588595) loss:3.069 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:29:08,302][model8_pretrain.py][INFO] Epoch:[0/2](131500/4588595) loss:3.354 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:29:08,302][model8_pretrain.py][INFO] Epoch:[0/2](131500/4588595) loss:3.081 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:29:08,302][model8_pretrain.py][INFO] Epoch:[0/2](131500/4588595) loss:3.103 lr:0.0000100 epoch_Time:28110.0min: [2024-01-03 07:29:45,231][model8_pretrain.py][INFO] Epoch:[0/2](131600/4588595) loss:3.123 lr:0.0000100 epoch_Time:28109.0min: [2024-01-03 07:29:45,231][model8_pretrain.py][INFO] Epoch:[0/2](131600/4588595) loss:3.314 lr:0.0000100 epoch_Time:28109.0min: [2024-01-03 07:29:45,231][model8_pretrain.py][INFO] Epoch:[0/2](131600/4588595) loss:2.796 lr:0.0000100 epoch_Time:28109.0min: [2024-01-03 07:29:45,231][model8_pretrain.py][INFO] Epoch:[0/2](131600/4588595) loss:2.715 lr:0.0000100 epoch_Time:28109.0min: [2024-01-03 07:29:45,231][model8_pretrain.py][INFO] Epoch:[0/2](131600/4588595) loss:3.151 lr:0.0000100 epoch_Time:28109.0min: [2024-01-03 07:29:45,231][model8_pretrain.py][INFO] Epoch:[0/2](131600/4588595) loss:2.944 lr:0.0000100 epoch_Time:28109.0min: [2024-01-03 07:29:45,231][model8_pretrain.py][INFO] Epoch:[0/2](131600/4588595) loss:2.912 lr:0.0000100 epoch_Time:28109.0min: [2024-01-03 07:29:45,232][model8_pretrain.py][INFO] Epoch:[0/2](131600/4588595) loss:3.020 lr:0.0000100 epoch_Time:28109.0min: [2024-01-03 07:30:22,166][model8_pretrain.py][INFO] Epoch:[0/2](131700/4588595) loss:3.666 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:30:22,166][model8_pretrain.py][INFO] Epoch:[0/2](131700/4588595) loss:2.978 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:30:22,166][model8_pretrain.py][INFO] Epoch:[0/2](131700/4588595) loss:3.058 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:30:22,166][model8_pretrain.py][INFO] Epoch:[0/2](131700/4588595) loss:3.272 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:30:22,166][model8_pretrain.py][INFO] Epoch:[0/2](131700/4588595) loss:3.114 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:30:22,166][model8_pretrain.py][INFO] Epoch:[0/2](131700/4588595) loss:3.119 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:30:22,166][model8_pretrain.py][INFO] Epoch:[0/2](131700/4588595) loss:2.387 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:30:22,167][model8_pretrain.py][INFO] Epoch:[0/2](131700/4588595) loss:3.473 lr:0.0000100 epoch_Time:28108.0min: [2024-01-03 07:30:59,117][model8_pretrain.py][INFO] Epoch:[0/2](131800/4588595) loss:2.978 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:30:59,117][model8_pretrain.py][INFO] Epoch:[0/2](131800/4588595) loss:2.948 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:30:59,117][model8_pretrain.py][INFO] Epoch:[0/2](131800/4588595) loss:2.914 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:30:59,117][model8_pretrain.py][INFO] Epoch:[0/2](131800/4588595) loss:2.965 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:30:59,117][model8_pretrain.py][INFO] Epoch:[0/2](131800/4588595) loss:2.761 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:30:59,117][model8_pretrain.py][INFO] Epoch:[0/2](131800/4588595) loss:2.676 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:30:59,117][model8_pretrain.py][INFO] Epoch:[0/2](131800/4588595) loss:2.814 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:30:59,117][model8_pretrain.py][INFO] Epoch:[0/2](131800/4588595) loss:2.240 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:31:36,053][model8_pretrain.py][INFO] Epoch:[0/2](131900/4588595) loss:3.019 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:31:36,053][model8_pretrain.py][INFO] Epoch:[0/2](131900/4588595) loss:2.872 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:31:36,053][model8_pretrain.py][INFO] Epoch:[0/2](131900/4588595) loss:3.200 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:31:36,053][model8_pretrain.py][INFO] Epoch:[0/2](131900/4588595) loss:3.252 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:31:36,053][model8_pretrain.py][INFO] Epoch:[0/2](131900/4588595) loss:3.262 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:31:36,054][model8_pretrain.py][INFO] Epoch:[0/2](131900/4588595) loss:3.032 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:31:36,054][model8_pretrain.py][INFO] Epoch:[0/2](131900/4588595) loss:2.546 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:31:36,054][model8_pretrain.py][INFO] Epoch:[0/2](131900/4588595) loss:3.048 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:32:12,994][model8_pretrain.py][INFO] Epoch:[0/2](132000/4588595) loss:3.155 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:32:12,994][model8_pretrain.py][INFO] Epoch:[0/2](132000/4588595) loss:3.169 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:32:12,994][model8_pretrain.py][INFO] Epoch:[0/2](132000/4588595) loss:3.339 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:32:12,994][model8_pretrain.py][INFO] Epoch:[0/2](132000/4588595) loss:3.466 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:32:12,994][model8_pretrain.py][INFO] Epoch:[0/2](132000/4588595) loss:2.928 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:32:12,994][model8_pretrain.py][INFO] Epoch:[0/2](132000/4588595) loss:3.164 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:32:12,994][model8_pretrain.py][INFO] Epoch:[0/2](132000/4588595) loss:3.074 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:32:12,994][model8_pretrain.py][INFO] Epoch:[0/2](132000/4588595) loss:2.905 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:32:49,939][model8_pretrain.py][INFO] Epoch:[0/2](132100/4588595) loss:2.320 lr:0.0000100 epoch_Time:28103.0min: [2024-01-03 07:32:49,939][model8_pretrain.py][INFO] Epoch:[0/2](132100/4588595) loss:2.796 lr:0.0000100 epoch_Time:28103.0min: [2024-01-03 07:32:49,939][model8_pretrain.py][INFO] Epoch:[0/2](132100/4588595) loss:2.784 lr:0.0000100 epoch_Time:28103.0min: [2024-01-03 07:32:49,939][model8_pretrain.py][INFO] Epoch:[0/2](132100/4588595) loss:2.759 lr:0.0000100 epoch_Time:28103.0min: [2024-01-03 07:32:49,939][model8_pretrain.py][INFO] Epoch:[0/2](132100/4588595) loss:2.929 lr:0.0000100 epoch_Time:28103.0min: [2024-01-03 07:32:49,939][model8_pretrain.py][INFO] Epoch:[0/2](132100/4588595) loss:3.002 lr:0.0000100 epoch_Time:28103.0min: [2024-01-03 07:32:49,939][model8_pretrain.py][INFO] Epoch:[0/2](132100/4588595) loss:3.069 lr:0.0000100 epoch_Time:28103.0min: [2024-01-03 07:32:49,940][model8_pretrain.py][INFO] Epoch:[0/2](132100/4588595) loss:3.131 lr:0.0000100 epoch_Time:28103.0min: [2024-01-03 07:33:26,903][model8_pretrain.py][INFO] Epoch:[0/2](132200/4588595) loss:2.754 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:33:26,903][model8_pretrain.py][INFO] Epoch:[0/2](132200/4588595) loss:2.474 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:33:26,903][model8_pretrain.py][INFO] Epoch:[0/2](132200/4588595) loss:3.247 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:33:26,903][model8_pretrain.py][INFO] Epoch:[0/2](132200/4588595) loss:3.243 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:33:26,903][model8_pretrain.py][INFO] Epoch:[0/2](132200/4588595) loss:2.774 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:33:26,903][model8_pretrain.py][INFO] Epoch:[0/2](132200/4588595) loss:2.657 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:33:26,903][model8_pretrain.py][INFO] Epoch:[0/2](132200/4588595) loss:3.089 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:33:26,903][model8_pretrain.py][INFO] Epoch:[0/2](132200/4588595) loss:3.212 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:34:12,578][model8_pretrain.py][INFO] Epoch:[0/2](132300/4588595) loss:3.073 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:34:12,578][model8_pretrain.py][INFO] Epoch:[0/2](132300/4588595) loss:3.001 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:34:12,578][model8_pretrain.py][INFO] Epoch:[0/2](132300/4588595) loss:2.615 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:34:12,578][model8_pretrain.py][INFO] Epoch:[0/2](132300/4588595) loss:2.793 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:34:12,578][model8_pretrain.py][INFO] Epoch:[0/2](132300/4588595) loss:2.364 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:34:12,578][model8_pretrain.py][INFO] Epoch:[0/2](132300/4588595) loss:3.031 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:34:12,578][model8_pretrain.py][INFO] Epoch:[0/2](132300/4588595) loss:2.863 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:34:12,578][model8_pretrain.py][INFO] Epoch:[0/2](132300/4588595) loss:2.489 lr:0.0000100 epoch_Time:28106.0min: [2024-01-03 07:34:49,526][model8_pretrain.py][INFO] Epoch:[0/2](132400/4588595) loss:2.866 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:34:49,526][model8_pretrain.py][INFO] Epoch:[0/2](132400/4588595) loss:2.578 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:34:49,526][model8_pretrain.py][INFO] Epoch:[0/2](132400/4588595) loss:2.801 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:34:49,526][model8_pretrain.py][INFO] Epoch:[0/2](132400/4588595) loss:2.969 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:34:49,526][model8_pretrain.py][INFO] Epoch:[0/2](132400/4588595) loss:2.440 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:34:49,526][model8_pretrain.py][INFO] Epoch:[0/2](132400/4588595) loss:2.931 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:34:49,526][model8_pretrain.py][INFO] Epoch:[0/2](132400/4588595) loss:3.348 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:34:49,526][model8_pretrain.py][INFO] Epoch:[0/2](132400/4588595) loss:3.528 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:35:26,478][model8_pretrain.py][INFO] Epoch:[0/2](132500/4588595) loss:3.290 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:35:26,478][model8_pretrain.py][INFO] Epoch:[0/2](132500/4588595) loss:3.353 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:35:26,478][model8_pretrain.py][INFO] Epoch:[0/2](132500/4588595) loss:2.951 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:35:26,478][model8_pretrain.py][INFO] Epoch:[0/2](132500/4588595) loss:2.709 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:35:26,478][model8_pretrain.py][INFO] Epoch:[0/2](132500/4588595) loss:2.936 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:35:26,478][model8_pretrain.py][INFO] Epoch:[0/2](132500/4588595) loss:3.310 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:35:26,478][model8_pretrain.py][INFO] Epoch:[0/2](132500/4588595) loss:3.049 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:35:26,478][model8_pretrain.py][INFO] Epoch:[0/2](132500/4588595) loss:3.022 lr:0.0000100 epoch_Time:28104.0min: [2024-01-03 07:36:03,423][model8_pretrain.py][INFO] Epoch:[0/2](132600/4588595) loss:3.103 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:03,423][model8_pretrain.py][INFO] Epoch:[0/2](132600/4588595) loss:2.555 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:03,423][model8_pretrain.py][INFO] Epoch:[0/2](132600/4588595) loss:2.871 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:03,423][model8_pretrain.py][INFO] Epoch:[0/2](132600/4588595) loss:2.729 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:03,423][model8_pretrain.py][INFO] Epoch:[0/2](132600/4588595) loss:3.174 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:03,423][model8_pretrain.py][INFO] Epoch:[0/2](132600/4588595) loss:3.261 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:03,423][model8_pretrain.py][INFO] Epoch:[0/2](132600/4588595) loss:3.130 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:03,423][model8_pretrain.py][INFO] Epoch:[0/2](132600/4588595) loss:2.847 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:40,363][model8_pretrain.py][INFO] Epoch:[0/2](132700/4588595) loss:3.747 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:40,363][model8_pretrain.py][INFO] Epoch:[0/2](132700/4588595) loss:3.211 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:36:40,363][model8_pretrain.py][INFO] Epoch:[0/2](132700/4588595) loss:3.157 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:36:40,363][model8_pretrain.py][INFO] Epoch:[0/2](132700/4588595) loss:3.367 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:36:40,363][model8_pretrain.py][INFO] Epoch:[0/2](132700/4588595) loss:3.001 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:36:40,363][model8_pretrain.py][INFO] Epoch:[0/2](132700/4588595) loss:2.777 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:36:40,363][model8_pretrain.py][INFO] Epoch:[0/2](132700/4588595) loss:2.840 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:36:40,363][model8_pretrain.py][INFO] Epoch:[0/2](132700/4588595) loss:3.167 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:37:17,307][model8_pretrain.py][INFO] Epoch:[0/2](132800/4588595) loss:3.166 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:37:17,307][model8_pretrain.py][INFO] Epoch:[0/2](132800/4588595) loss:3.179 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:37:17,307][model8_pretrain.py][INFO] Epoch:[0/2](132800/4588595) loss:3.273 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:37:17,307][model8_pretrain.py][INFO] Epoch:[0/2](132800/4588595) loss:3.438 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:37:17,307][model8_pretrain.py][INFO] Epoch:[0/2](132800/4588595) loss:2.380 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:37:17,307][model8_pretrain.py][INFO] Epoch:[0/2](132800/4588595) loss:3.092 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:37:17,307][model8_pretrain.py][INFO] Epoch:[0/2](132800/4588595) loss:2.744 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:37:17,307][model8_pretrain.py][INFO] Epoch:[0/2](132800/4588595) loss:3.054 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:37:54,249][model8_pretrain.py][INFO] Epoch:[0/2](132900/4588595) loss:3.094 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:37:54,249][model8_pretrain.py][INFO] Epoch:[0/2](132900/4588595) loss:2.711 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:37:54,249][model8_pretrain.py][INFO] Epoch:[0/2](132900/4588595) loss:3.033 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:37:54,249][model8_pretrain.py][INFO] Epoch:[0/2](132900/4588595) loss:2.822 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:37:54,249][model8_pretrain.py][INFO] Epoch:[0/2](132900/4588595) loss:2.731 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:37:54,249][model8_pretrain.py][INFO] Epoch:[0/2](132900/4588595) loss:2.827 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:37:54,249][model8_pretrain.py][INFO] Epoch:[0/2](132900/4588595) loss:3.232 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:37:54,249][model8_pretrain.py][INFO] Epoch:[0/2](132900/4588595) loss:2.331 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:38:31,187][model8_pretrain.py][INFO] Epoch:[0/2](133000/4588595) loss:3.059 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:38:31,187][model8_pretrain.py][INFO] Epoch:[0/2](133000/4588595) loss:2.992 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:38:31,187][model8_pretrain.py][INFO] Epoch:[0/2](133000/4588595) loss:3.254 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:38:31,187][model8_pretrain.py][INFO] Epoch:[0/2](133000/4588595) loss:3.345 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:38:31,187][model8_pretrain.py][INFO] Epoch:[0/2](133000/4588595) loss:2.883 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:38:31,187][model8_pretrain.py][INFO] Epoch:[0/2](133000/4588595) loss:2.727 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:38:31,187][model8_pretrain.py][INFO] Epoch:[0/2](133000/4588595) loss:3.268 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:38:31,187][model8_pretrain.py][INFO] Epoch:[0/2](133000/4588595) loss:2.765 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:39:16,938][model8_pretrain.py][INFO] Epoch:[0/2](133100/4588595) loss:2.829 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:39:16,938][model8_pretrain.py][INFO] Epoch:[0/2](133100/4588595) loss:3.225 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:39:16,938][model8_pretrain.py][INFO] Epoch:[0/2](133100/4588595) loss:2.987 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:39:16,938][model8_pretrain.py][INFO] Epoch:[0/2](133100/4588595) loss:2.417 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:39:16,938][model8_pretrain.py][INFO] Epoch:[0/2](133100/4588595) loss:2.990 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:39:16,938][model8_pretrain.py][INFO] Epoch:[0/2](133100/4588595) loss:2.627 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:39:16,938][model8_pretrain.py][INFO] Epoch:[0/2](133100/4588595) loss:3.091 lr:0.0000100 epoch_Time:28102.0min: [2024-01-03 07:39:16,938][model8_pretrain.py][INFO] Epoch:[0/2](133100/4588595) loss:3.034 lr:0.0000100 epoch_Time:28101.0min: [2024-01-03 07:39:53,874][model8_pretrain.py][INFO] Epoch:[0/2](133200/4588595) loss:2.869 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:39:53,873][model8_pretrain.py][INFO] Epoch:[0/2](133200/4588595) loss:3.427 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:39:53,873][model8_pretrain.py][INFO] Epoch:[0/2](133200/4588595) loss:3.184 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:39:53,873][model8_pretrain.py][INFO] Epoch:[0/2](133200/4588595) loss:2.866 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:39:53,874][model8_pretrain.py][INFO] Epoch:[0/2](133200/4588595) loss:2.895 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:39:53,874][model8_pretrain.py][INFO] Epoch:[0/2](133200/4588595) loss:3.086 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:39:53,874][model8_pretrain.py][INFO] Epoch:[0/2](133200/4588595) loss:3.278 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:39:53,874][model8_pretrain.py][INFO] Epoch:[0/2](133200/4588595) loss:3.356 lr:0.0000100 epoch_Time:28100.0min: [2024-01-03 07:40:30,810][model8_pretrain.py][INFO] Epoch:[0/2](133300/4588595) loss:2.383 lr:0.0000100 epoch_Time:28099.0min: [2024-01-03 07:40:30,810][model8_pretrain.py][INFO] Epoch:[0/2](133300/4588595) loss:2.942 lr:0.0000100 epoch_Time:28099.0min: [2024-01-03 07:40:30,810][model8_pretrain.py][INFO] Epoch:[0/2](133300/4588595) loss:3.014 lr:0.0000100 epoch_Time:28099.0min: [2024-01-03 07:40:30,810][model8_pretrain.py][INFO] Epoch:[0/2](133300/4588595) loss:3.167 lr:0.0000100 epoch_Time:28099.0min: [2024-01-03 07:40:30,810][model8_pretrain.py][INFO] Epoch:[0/2](133300/4588595) loss:3.275 lr:0.0000100 epoch_Time:28099.0min: [2024-01-03 07:40:30,810][model8_pretrain.py][INFO] Epoch:[0/2](133300/4588595) loss:2.794 lr:0.0000100 epoch_Time:28099.0min: [2024-01-03 07:40:30,811][model8_pretrain.py][INFO] Epoch:[0/2](133300/4588595) loss:3.056 lr:0.0000100 epoch_Time:28099.0min: [2024-01-03 07:40:30,812][model8_pretrain.py][INFO] Epoch:[0/2](133300/4588595) loss:2.567 lr:0.0000100 epoch_Time:28099.0min: [2024-01-03 07:41:07,742][model8_pretrain.py][INFO] Epoch:[0/2](133400/4588595) loss:3.062 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:41:07,742][model8_pretrain.py][INFO] Epoch:[0/2](133400/4588595) loss:2.933 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:41:07,742][model8_pretrain.py][INFO] Epoch:[0/2](133400/4588595) loss:3.052 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:41:07,742][model8_pretrain.py][INFO] Epoch:[0/2](133400/4588595) loss:2.832 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:41:07,742][model8_pretrain.py][INFO] Epoch:[0/2](133400/4588595) loss:3.120 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:41:07,742][model8_pretrain.py][INFO] Epoch:[0/2](133400/4588595) loss:3.041 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:41:07,742][model8_pretrain.py][INFO] Epoch:[0/2](133400/4588595) loss:3.280 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:41:07,742][model8_pretrain.py][INFO] Epoch:[0/2](133400/4588595) loss:2.648 lr:0.0000100 epoch_Time:28098.0min: [2024-01-03 07:41:44,688][model8_pretrain.py][INFO] Epoch:[0/2](133500/4588595) loss:3.250 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:41:44,688][model8_pretrain.py][INFO] Epoch:[0/2](133500/4588595) loss:2.494 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:41:44,688][model8_pretrain.py][INFO] Epoch:[0/2](133500/4588595) loss:2.587 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:41:44,688][model8_pretrain.py][INFO] Epoch:[0/2](133500/4588595) loss:3.143 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:41:44,688][model8_pretrain.py][INFO] Epoch:[0/2](133500/4588595) loss:3.108 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:41:44,688][model8_pretrain.py][INFO] Epoch:[0/2](133500/4588595) loss:3.165 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:41:44,688][model8_pretrain.py][INFO] Epoch:[0/2](133500/4588595) loss:2.411 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:41:44,688][model8_pretrain.py][INFO] Epoch:[0/2](133500/4588595) loss:3.195 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:42:21,651][model8_pretrain.py][INFO] Epoch:[0/2](133600/4588595) loss:2.962 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:42:21,651][model8_pretrain.py][INFO] Epoch:[0/2](133600/4588595) loss:2.809 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:42:21,651][model8_pretrain.py][INFO] Epoch:[0/2](133600/4588595) loss:3.135 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:42:21,651][model8_pretrain.py][INFO] Epoch:[0/2](133600/4588595) loss:2.765 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:42:21,651][model8_pretrain.py][INFO] Epoch:[0/2](133600/4588595) loss:3.136 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:42:21,651][model8_pretrain.py][INFO] Epoch:[0/2](133600/4588595) loss:3.193 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:42:21,651][model8_pretrain.py][INFO] Epoch:[0/2](133600/4588595) loss:2.908 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:42:21,651][model8_pretrain.py][INFO] Epoch:[0/2](133600/4588595) loss:3.381 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:42:58,593][model8_pretrain.py][INFO] Epoch:[0/2](133700/4588595) loss:3.426 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:42:58,593][model8_pretrain.py][INFO] Epoch:[0/2](133700/4588595) loss:3.368 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:42:58,593][model8_pretrain.py][INFO] Epoch:[0/2](133700/4588595) loss:2.486 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:42:58,593][model8_pretrain.py][INFO] Epoch:[0/2](133700/4588595) loss:2.926 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:42:58,593][model8_pretrain.py][INFO] Epoch:[0/2](133700/4588595) loss:3.275 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:42:58,593][model8_pretrain.py][INFO] Epoch:[0/2](133700/4588595) loss:3.084 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:42:58,593][model8_pretrain.py][INFO] Epoch:[0/2](133700/4588595) loss:2.907 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:42:58,593][model8_pretrain.py][INFO] Epoch:[0/2](133700/4588595) loss:2.566 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:43:35,546][model8_pretrain.py][INFO] Epoch:[0/2](133800/4588595) loss:2.946 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:43:35,546][model8_pretrain.py][INFO] Epoch:[0/2](133800/4588595) loss:2.779 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:43:35,546][model8_pretrain.py][INFO] Epoch:[0/2](133800/4588595) loss:2.920 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:43:35,546][model8_pretrain.py][INFO] Epoch:[0/2](133800/4588595) loss:2.713 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:43:35,546][model8_pretrain.py][INFO] Epoch:[0/2](133800/4588595) loss:3.057 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:43:35,546][model8_pretrain.py][INFO] Epoch:[0/2](133800/4588595) loss:3.095 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:43:35,546][model8_pretrain.py][INFO] Epoch:[0/2](133800/4588595) loss:3.167 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:43:35,546][model8_pretrain.py][INFO] Epoch:[0/2](133800/4588595) loss:3.131 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:44:21,296][model8_pretrain.py][INFO] Epoch:[0/2](133900/4588595) loss:3.051 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:44:21,296][model8_pretrain.py][INFO] Epoch:[0/2](133900/4588595) loss:3.169 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:44:21,296][model8_pretrain.py][INFO] Epoch:[0/2](133900/4588595) loss:2.948 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:44:21,296][model8_pretrain.py][INFO] Epoch:[0/2](133900/4588595) loss:2.610 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:44:21,296][model8_pretrain.py][INFO] Epoch:[0/2](133900/4588595) loss:2.889 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:44:21,296][model8_pretrain.py][INFO] Epoch:[0/2](133900/4588595) loss:2.643 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:44:21,296][model8_pretrain.py][INFO] Epoch:[0/2](133900/4588595) loss:2.651 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:44:21,296][model8_pretrain.py][INFO] Epoch:[0/2](133900/4588595) loss:2.715 lr:0.0000100 epoch_Time:28097.0min: [2024-01-03 07:44:58,244][model8_pretrain.py][INFO] Epoch:[0/2](134000/4588595) loss:3.266 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:44:58,244][model8_pretrain.py][INFO] Epoch:[0/2](134000/4588595) loss:2.760 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:44:58,245][model8_pretrain.py][INFO] Epoch:[0/2](134000/4588595) loss:3.373 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:44:58,245][model8_pretrain.py][INFO] Epoch:[0/2](134000/4588595) loss:2.689 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:44:58,245][model8_pretrain.py][INFO] Epoch:[0/2](134000/4588595) loss:2.603 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:44:58,245][model8_pretrain.py][INFO] Epoch:[0/2](134000/4588595) loss:3.470 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:44:58,245][model8_pretrain.py][INFO] Epoch:[0/2](134000/4588595) loss:3.038 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:44:58,245][model8_pretrain.py][INFO] Epoch:[0/2](134000/4588595) loss:3.428 lr:0.0000100 epoch_Time:28096.0min: [2024-01-03 07:45:35,185][model8_pretrain.py][INFO] Epoch:[0/2](134100/4588595) loss:2.583 lr:0.0000100 epoch_Time:28095.0min: [2024-01-03 07:45:35,185][model8_pretrain.py][INFO] Epoch:[0/2](134100/4588595) loss:2.721 lr:0.0000100 epoch_Time:28095.0min: [2024-01-03 07:45:35,185][model8_pretrain.py][INFO] Epoch:[0/2](134100/4588595) loss:3.240 lr:0.0000100 epoch_Time:28095.0min: [2024-01-03 07:45:35,185][model8_pretrain.py][INFO] Epoch:[0/2](134100/4588595) loss:3.098 lr:0.0000100 epoch_Time:28095.0min: [2024-01-03 07:45:35,185][model8_pretrain.py][INFO] Epoch:[0/2](134100/4588595) loss:3.081 lr:0.0000100 epoch_Time:28095.0min: [2024-01-03 07:45:35,185][model8_pretrain.py][INFO] Epoch:[0/2](134100/4588595) loss:2.894 lr:0.0000100 epoch_Time:28095.0min: [2024-01-03 07:45:35,185][model8_pretrain.py][INFO] Epoch:[0/2](134100/4588595) loss:3.220 lr:0.0000100 epoch_Time:28095.0min: [2024-01-03 07:45:35,185][model8_pretrain.py][INFO] Epoch:[0/2](134100/4588595) loss:2.728 lr:0.0000100 epoch_Time:28095.0min: [2024-01-03 07:46:12,103][model8_pretrain.py][INFO] Epoch:[0/2](134200/4588595) loss:3.147 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:46:12,103][model8_pretrain.py][INFO] Epoch:[0/2](134200/4588595) loss:2.860 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:46:12,103][model8_pretrain.py][INFO] Epoch:[0/2](134200/4588595) loss:2.918 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:46:12,103][model8_pretrain.py][INFO] Epoch:[0/2](134200/4588595) loss:3.531 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:46:12,103][model8_pretrain.py][INFO] Epoch:[0/2](134200/4588595) loss:2.820 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:46:12,103][model8_pretrain.py][INFO] Epoch:[0/2](134200/4588595) loss:3.103 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:46:12,103][model8_pretrain.py][INFO] Epoch:[0/2](134200/4588595) loss:2.927 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:46:12,103][model8_pretrain.py][INFO] Epoch:[0/2](134200/4588595) loss:2.362 lr:0.0000100 epoch_Time:28094.0min: [2024-01-03 07:46:49,006][model8_pretrain.py][INFO] Epoch:[0/2](134300/4588595) loss:2.771 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:46:49,006][model8_pretrain.py][INFO] Epoch:[0/2](134300/4588595) loss:3.439 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:46:49,006][model8_pretrain.py][INFO] Epoch:[0/2](134300/4588595) loss:2.445 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:46:49,007][model8_pretrain.py][INFO] Epoch:[0/2](134300/4588595) loss:3.129 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:46:49,007][model8_pretrain.py][INFO] Epoch:[0/2](134300/4588595) loss:2.683 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:46:49,007][model8_pretrain.py][INFO] Epoch:[0/2](134300/4588595) loss:2.725 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:46:49,007][model8_pretrain.py][INFO] Epoch:[0/2](134300/4588595) loss:3.356 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:46:49,007][model8_pretrain.py][INFO] Epoch:[0/2](134300/4588595) loss:3.056 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:47:25,941][model8_pretrain.py][INFO] Epoch:[0/2](134400/4588595) loss:3.022 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:47:25,941][model8_pretrain.py][INFO] Epoch:[0/2](134400/4588595) loss:2.904 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:47:25,941][model8_pretrain.py][INFO] Epoch:[0/2](134400/4588595) loss:3.576 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:47:25,941][model8_pretrain.py][INFO] Epoch:[0/2](134400/4588595) loss:3.406 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:47:25,941][model8_pretrain.py][INFO] Epoch:[0/2](134400/4588595) loss:3.135 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:47:25,941][model8_pretrain.py][INFO] Epoch:[0/2](134400/4588595) loss:3.014 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:47:25,941][model8_pretrain.py][INFO] Epoch:[0/2](134400/4588595) loss:2.680 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:47:25,941][model8_pretrain.py][INFO] Epoch:[0/2](134400/4588595) loss:3.635 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:48:02,869][model8_pretrain.py][INFO] Epoch:[0/2](134500/4588595) loss:3.674 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:02,870][model8_pretrain.py][INFO] Epoch:[0/2](134500/4588595) loss:2.565 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:02,870][model8_pretrain.py][INFO] Epoch:[0/2](134500/4588595) loss:2.645 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:02,870][model8_pretrain.py][INFO] Epoch:[0/2](134500/4588595) loss:3.486 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:02,870][model8_pretrain.py][INFO] Epoch:[0/2](134500/4588595) loss:3.462 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:02,870][model8_pretrain.py][INFO] Epoch:[0/2](134500/4588595) loss:2.980 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:02,870][model8_pretrain.py][INFO] Epoch:[0/2](134500/4588595) loss:3.042 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:02,870][model8_pretrain.py][INFO] Epoch:[0/2](134500/4588595) loss:2.474 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:39,816][model8_pretrain.py][INFO] Epoch:[0/2](134600/4588595) loss:2.904 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:39,816][model8_pretrain.py][INFO] Epoch:[0/2](134600/4588595) loss:3.108 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:39,816][model8_pretrain.py][INFO] Epoch:[0/2](134600/4588595) loss:2.613 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:39,816][model8_pretrain.py][INFO] Epoch:[0/2](134600/4588595) loss:3.458 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:39,816][model8_pretrain.py][INFO] Epoch:[0/2](134600/4588595) loss:3.228 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:39,816][model8_pretrain.py][INFO] Epoch:[0/2](134600/4588595) loss:3.026 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:39,816][model8_pretrain.py][INFO] Epoch:[0/2](134600/4588595) loss:3.222 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:48:39,816][model8_pretrain.py][INFO] Epoch:[0/2](134600/4588595) loss:3.398 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:49:25,444][model8_pretrain.py][INFO] Epoch:[0/2](134700/4588595) loss:2.848 lr:0.0000100 epoch_Time:28093.0min: [2024-01-03 07:49:25,444][model8_pretrain.py][INFO] Epoch:[0/2](134700/4588595) loss:2.967 lr:0.0000100 epoch_Time:28093.0min: [2024-01-03 07:49:25,444][model8_pretrain.py][INFO] Epoch:[0/2](134700/4588595) loss:2.942 lr:0.0000100 epoch_Time:28093.0min: [2024-01-03 07:49:25,445][model8_pretrain.py][INFO] Epoch:[0/2](134700/4588595) loss:3.551 lr:0.0000100 epoch_Time:28093.0min: [2024-01-03 07:49:25,445][model8_pretrain.py][INFO] Epoch:[0/2](134700/4588595) loss:3.053 lr:0.0000100 epoch_Time:28093.0min: [2024-01-03 07:49:25,445][model8_pretrain.py][INFO] Epoch:[0/2](134700/4588595) loss:2.409 lr:0.0000100 epoch_Time:28093.0min: [2024-01-03 07:49:25,445][model8_pretrain.py][INFO] Epoch:[0/2](134700/4588595) loss:3.493 lr:0.0000100 epoch_Time:28093.0min: [2024-01-03 07:49:25,445][model8_pretrain.py][INFO] Epoch:[0/2](134700/4588595) loss:3.156 lr:0.0000100 epoch_Time:28093.0min: [2024-01-03 07:50:02,387][model8_pretrain.py][INFO] Epoch:[0/2](134800/4588595) loss:3.235 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:50:02,387][model8_pretrain.py][INFO] Epoch:[0/2](134800/4588595) loss:2.950 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:50:02,387][model8_pretrain.py][INFO] Epoch:[0/2](134800/4588595) loss:3.180 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:50:02,387][model8_pretrain.py][INFO] Epoch:[0/2](134800/4588595) loss:3.335 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:50:02,387][model8_pretrain.py][INFO] Epoch:[0/2](134800/4588595) loss:3.080 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:50:02,387][model8_pretrain.py][INFO] Epoch:[0/2](134800/4588595) loss:2.967 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:50:02,387][model8_pretrain.py][INFO] Epoch:[0/2](134800/4588595) loss:3.563 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:50:02,387][model8_pretrain.py][INFO] Epoch:[0/2](134800/4588595) loss:3.093 lr:0.0000100 epoch_Time:28092.0min: [2024-01-03 07:50:39,322][model8_pretrain.py][INFO] Epoch:[0/2](134900/4588595) loss:2.696 lr:0.0000100 epoch_Time:28091.0min: [2024-01-03 07:50:39,322][model8_pretrain.py][INFO] Epoch:[0/2](134900/4588595) loss:2.848 lr:0.0000100 epoch_Time:28091.0min: [2024-01-03 07:50:39,322][model8_pretrain.py][INFO] Epoch:[0/2](134900/4588595) loss:3.055 lr:0.0000100 epoch_Time:28091.0min: [2024-01-03 07:50:39,322][model8_pretrain.py][INFO] Epoch:[0/2](134900/4588595) loss:2.929 lr:0.0000100 epoch_Time:28091.0min: [2024-01-03 07:50:39,322][model8_pretrain.py][INFO] Epoch:[0/2](134900/4588595) loss:2.750 lr:0.0000100 epoch_Time:28091.0min: [2024-01-03 07:50:39,322][model8_pretrain.py][INFO] Epoch:[0/2](134900/4588595) loss:2.693 lr:0.0000100 epoch_Time:28091.0min: [2024-01-03 07:50:39,322][model8_pretrain.py][INFO] Epoch:[0/2](134900/4588595) loss:2.591 lr:0.0000100 epoch_Time:28091.0min: [2024-01-03 07:50:39,322][model8_pretrain.py][INFO] Epoch:[0/2](134900/4588595) loss:3.163 lr:0.0000100 epoch_Time:28091.0min: [2024-01-03 07:51:16,265][model8_pretrain.py][INFO] Epoch:[0/2](135000/4588595) loss:2.205 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:51:16,265][model8_pretrain.py][INFO] Epoch:[0/2](135000/4588595) loss:2.832 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:51:16,265][model8_pretrain.py][INFO] Epoch:[0/2](135000/4588595) loss:2.440 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:51:16,265][model8_pretrain.py][INFO] Epoch:[0/2](135000/4588595) loss:3.171 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:51:16,265][model8_pretrain.py][INFO] Epoch:[0/2](135000/4588595) loss:2.672 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:51:16,265][model8_pretrain.py][INFO] Epoch:[0/2](135000/4588595) loss:2.934 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:51:16,265][model8_pretrain.py][INFO] Epoch:[0/2](135000/4588595) loss:3.241 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:51:16,265][model8_pretrain.py][INFO] Epoch:[0/2](135000/4588595) loss:2.826 lr:0.0000100 epoch_Time:28090.0min: [2024-01-03 07:51:53,202][model8_pretrain.py][INFO] Epoch:[0/2](135100/4588595) loss:2.463 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:51:53,202][model8_pretrain.py][INFO] Epoch:[0/2](135100/4588595) loss:3.351 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:51:53,202][model8_pretrain.py][INFO] Epoch:[0/2](135100/4588595) loss:2.761 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:51:53,202][model8_pretrain.py][INFO] Epoch:[0/2](135100/4588595) loss:3.021 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:51:53,202][model8_pretrain.py][INFO] Epoch:[0/2](135100/4588595) loss:2.997 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:51:53,202][model8_pretrain.py][INFO] Epoch:[0/2](135100/4588595) loss:3.241 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:51:53,202][model8_pretrain.py][INFO] Epoch:[0/2](135100/4588595) loss:2.721 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:51:53,202][model8_pretrain.py][INFO] Epoch:[0/2](135100/4588595) loss:2.557 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:52:30,157][model8_pretrain.py][INFO] Epoch:[0/2](135200/4588595) loss:3.168 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:52:30,157][model8_pretrain.py][INFO] Epoch:[0/2](135200/4588595) loss:3.013 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:52:30,157][model8_pretrain.py][INFO] Epoch:[0/2](135200/4588595) loss:2.795 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:52:30,157][model8_pretrain.py][INFO] Epoch:[0/2](135200/4588595) loss:2.917 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:52:30,157][model8_pretrain.py][INFO] Epoch:[0/2](135200/4588595) loss:3.396 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:52:30,158][model8_pretrain.py][INFO] Epoch:[0/2](135200/4588595) loss:2.631 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:52:30,158][model8_pretrain.py][INFO] Epoch:[0/2](135200/4588595) loss:3.183 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:52:30,158][model8_pretrain.py][INFO] Epoch:[0/2](135200/4588595) loss:3.400 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:53:07,105][model8_pretrain.py][INFO] Epoch:[0/2](135300/4588595) loss:2.885 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:07,105][model8_pretrain.py][INFO] Epoch:[0/2](135300/4588595) loss:2.801 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:07,106][model8_pretrain.py][INFO] Epoch:[0/2](135300/4588595) loss:2.816 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:07,106][model8_pretrain.py][INFO] Epoch:[0/2](135300/4588595) loss:2.734 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:07,106][model8_pretrain.py][INFO] Epoch:[0/2](135300/4588595) loss:3.254 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:07,106][model8_pretrain.py][INFO] Epoch:[0/2](135300/4588595) loss:2.892 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:07,106][model8_pretrain.py][INFO] Epoch:[0/2](135300/4588595) loss:3.154 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:07,106][model8_pretrain.py][INFO] Epoch:[0/2](135300/4588595) loss:2.906 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:44,033][model8_pretrain.py][INFO] Epoch:[0/2](135400/4588595) loss:2.742 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:44,033][model8_pretrain.py][INFO] Epoch:[0/2](135400/4588595) loss:2.733 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:44,033][model8_pretrain.py][INFO] Epoch:[0/2](135400/4588595) loss:2.445 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:44,033][model8_pretrain.py][INFO] Epoch:[0/2](135400/4588595) loss:2.685 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:44,033][model8_pretrain.py][INFO] Epoch:[0/2](135400/4588595) loss:2.933 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:44,033][model8_pretrain.py][INFO] Epoch:[0/2](135400/4588595) loss:3.193 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:44,033][model8_pretrain.py][INFO] Epoch:[0/2](135400/4588595) loss:2.656 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:53:44,033][model8_pretrain.py][INFO] Epoch:[0/2](135400/4588595) loss:2.190 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:54:29,897][model8_pretrain.py][INFO] Epoch:[0/2](135500/4588595) loss:2.776 lr:0.0000100 epoch_Time:28089.0min: [2024-01-03 07:54:29,897][model8_pretrain.py][INFO] Epoch:[0/2](135500/4588595) loss:2.692 lr:0.0000100 epoch_Time:28089.0min: [2024-01-03 07:54:29,897][model8_pretrain.py][INFO] Epoch:[0/2](135500/4588595) loss:3.055 lr:0.0000100 epoch_Time:28089.0min: [2024-01-03 07:54:29,897][model8_pretrain.py][INFO] Epoch:[0/2](135500/4588595) loss:2.535 lr:0.0000100 epoch_Time:28089.0min: [2024-01-03 07:54:29,897][model8_pretrain.py][INFO] Epoch:[0/2](135500/4588595) loss:3.408 lr:0.0000100 epoch_Time:28089.0min: [2024-01-03 07:54:29,897][model8_pretrain.py][INFO] Epoch:[0/2](135500/4588595) loss:2.898 lr:0.0000100 epoch_Time:28089.0min: [2024-01-03 07:54:29,897][model8_pretrain.py][INFO] Epoch:[0/2](135500/4588595) loss:3.228 lr:0.0000100 epoch_Time:28089.0min: [2024-01-03 07:54:29,898][model8_pretrain.py][INFO] Epoch:[0/2](135500/4588595) loss:3.106 lr:0.0000100 epoch_Time:28089.0min: [2024-01-03 07:55:06,830][model8_pretrain.py][INFO] Epoch:[0/2](135600/4588595) loss:3.184 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:55:06,830][model8_pretrain.py][INFO] Epoch:[0/2](135600/4588595) loss:2.933 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:55:06,830][model8_pretrain.py][INFO] Epoch:[0/2](135600/4588595) loss:2.919 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:55:06,830][model8_pretrain.py][INFO] Epoch:[0/2](135600/4588595) loss:2.587 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:55:06,830][model8_pretrain.py][INFO] Epoch:[0/2](135600/4588595) loss:2.455 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:55:06,831][model8_pretrain.py][INFO] Epoch:[0/2](135600/4588595) loss:2.726 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:55:06,831][model8_pretrain.py][INFO] Epoch:[0/2](135600/4588595) loss:2.451 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:55:06,831][model8_pretrain.py][INFO] Epoch:[0/2](135600/4588595) loss:3.063 lr:0.0000100 epoch_Time:28088.0min: [2024-01-03 07:55:43,754][model8_pretrain.py][INFO] Epoch:[0/2](135700/4588595) loss:2.956 lr:0.0000100 epoch_Time:28087.0min: [2024-01-03 07:55:43,754][model8_pretrain.py][INFO] Epoch:[0/2](135700/4588595) loss:2.443 lr:0.0000100 epoch_Time:28087.0min: [2024-01-03 07:55:43,754][model8_pretrain.py][INFO] Epoch:[0/2](135700/4588595) loss:3.058 lr:0.0000100 epoch_Time:28087.0min: [2024-01-03 07:55:43,754][model8_pretrain.py][INFO] Epoch:[0/2](135700/4588595) loss:3.024 lr:0.0000100 epoch_Time:28087.0min: [2024-01-03 07:55:43,754][model8_pretrain.py][INFO] Epoch:[0/2](135700/4588595) loss:3.059 lr:0.0000100 epoch_Time:28087.0min: [2024-01-03 07:55:43,754][model8_pretrain.py][INFO] Epoch:[0/2](135700/4588595) loss:2.891 lr:0.0000100 epoch_Time:28087.0min: [2024-01-03 07:55:43,754][model8_pretrain.py][INFO] Epoch:[0/2](135700/4588595) loss:2.977 lr:0.0000100 epoch_Time:28087.0min: [2024-01-03 07:55:43,754][model8_pretrain.py][INFO] Epoch:[0/2](135700/4588595) loss:3.397 lr:0.0000100 epoch_Time:28087.0min: [2024-01-03 07:56:20,685][model8_pretrain.py][INFO] Epoch:[0/2](135800/4588595) loss:3.162 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:56:20,685][model8_pretrain.py][INFO] Epoch:[0/2](135800/4588595) loss:3.076 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:56:20,685][model8_pretrain.py][INFO] Epoch:[0/2](135800/4588595) loss:3.050 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:56:20,685][model8_pretrain.py][INFO] Epoch:[0/2](135800/4588595) loss:3.266 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:56:20,685][model8_pretrain.py][INFO] Epoch:[0/2](135800/4588595) loss:3.036 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:56:20,685][model8_pretrain.py][INFO] Epoch:[0/2](135800/4588595) loss:2.470 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:56:20,685][model8_pretrain.py][INFO] Epoch:[0/2](135800/4588595) loss:3.108 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:56:20,685][model8_pretrain.py][INFO] Epoch:[0/2](135800/4588595) loss:3.080 lr:0.0000100 epoch_Time:28086.0min: [2024-01-03 07:56:57,647][model8_pretrain.py][INFO] Epoch:[0/2](135900/4588595) loss:2.748 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:56:57,647][model8_pretrain.py][INFO] Epoch:[0/2](135900/4588595) loss:2.834 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:56:57,647][model8_pretrain.py][INFO] Epoch:[0/2](135900/4588595) loss:2.805 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:56:57,647][model8_pretrain.py][INFO] Epoch:[0/2](135900/4588595) loss:2.929 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:56:57,647][model8_pretrain.py][INFO] Epoch:[0/2](135900/4588595) loss:3.075 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:56:57,647][model8_pretrain.py][INFO] Epoch:[0/2](135900/4588595) loss:2.762 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:56:57,647][model8_pretrain.py][INFO] Epoch:[0/2](135900/4588595) loss:3.051 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:56:57,648][model8_pretrain.py][INFO] Epoch:[0/2](135900/4588595) loss:2.756 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:57:34,591][model8_pretrain.py][INFO] Epoch:[0/2](136000/4588595) loss:3.119 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:57:34,591][model8_pretrain.py][INFO] Epoch:[0/2](136000/4588595) loss:3.089 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:57:34,591][model8_pretrain.py][INFO] Epoch:[0/2](136000/4588595) loss:3.454 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:57:34,591][model8_pretrain.py][INFO] Epoch:[0/2](136000/4588595) loss:2.952 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:57:34,591][model8_pretrain.py][INFO] Epoch:[0/2](136000/4588595) loss:3.389 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:57:34,591][model8_pretrain.py][INFO] Epoch:[0/2](136000/4588595) loss:2.431 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:57:34,591][model8_pretrain.py][INFO] Epoch:[0/2](136000/4588595) loss:3.319 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:57:34,591][model8_pretrain.py][INFO] Epoch:[0/2](136000/4588595) loss:3.033 lr:0.0000100 epoch_Time:28084.0min: [2024-01-03 07:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](136100/4588595) loss:3.203 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 07:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](136100/4588595) loss:2.931 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 07:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](136100/4588595) loss:3.225 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 07:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](136100/4588595) loss:3.086 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 07:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](136100/4588595) loss:2.858 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 07:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](136100/4588595) loss:3.125 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 07:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](136100/4588595) loss:2.718 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 07:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](136100/4588595) loss:2.547 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 07:58:48,480][model8_pretrain.py][INFO] Epoch:[0/2](136200/4588595) loss:2.660 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 07:58:48,480][model8_pretrain.py][INFO] Epoch:[0/2](136200/4588595) loss:2.405 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 07:58:48,480][model8_pretrain.py][INFO] Epoch:[0/2](136200/4588595) loss:3.140 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 07:58:48,480][model8_pretrain.py][INFO] Epoch:[0/2](136200/4588595) loss:3.551 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 07:58:48,480][model8_pretrain.py][INFO] Epoch:[0/2](136200/4588595) loss:2.963 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 07:58:48,480][model8_pretrain.py][INFO] Epoch:[0/2](136200/4588595) loss:3.177 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 07:58:48,480][model8_pretrain.py][INFO] Epoch:[0/2](136200/4588595) loss:2.375 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 07:58:48,481][model8_pretrain.py][INFO] Epoch:[0/2](136200/4588595) loss:3.174 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 07:59:34,087][model8_pretrain.py][INFO] Epoch:[0/2](136300/4588595) loss:3.127 lr:0.0000100 epoch_Time:28085.0min: [2024-01-03 07:59:34,087][model8_pretrain.py][INFO] Epoch:[0/2](136300/4588595) loss:2.733 lr:0.0000100 epoch_Time:28085.0min: [2024-01-03 07:59:34,087][model8_pretrain.py][INFO] Epoch:[0/2](136300/4588595) loss:2.677 lr:0.0000100 epoch_Time:28085.0min: [2024-01-03 07:59:34,087][model8_pretrain.py][INFO] Epoch:[0/2](136300/4588595) loss:3.073 lr:0.0000100 epoch_Time:28085.0min: [2024-01-03 07:59:34,087][model8_pretrain.py][INFO] Epoch:[0/2](136300/4588595) loss:2.788 lr:0.0000100 epoch_Time:28085.0min: [2024-01-03 07:59:34,087][model8_pretrain.py][INFO] Epoch:[0/2](136300/4588595) loss:3.111 lr:0.0000100 epoch_Time:28085.0min: [2024-01-03 07:59:34,088][model8_pretrain.py][INFO] Epoch:[0/2](136300/4588595) loss:3.060 lr:0.0000100 epoch_Time:28085.0min: [2024-01-03 07:59:34,088][model8_pretrain.py][INFO] Epoch:[0/2](136300/4588595) loss:2.760 lr:0.0000100 epoch_Time:28085.0min: [2024-01-03 08:00:11,019][model8_pretrain.py][INFO] Epoch:[0/2](136400/4588595) loss:3.267 lr:0.0000100 epoch_Time:28083.0min: [2024-01-03 08:00:11,019][model8_pretrain.py][INFO] Epoch:[0/2](136400/4588595) loss:2.590 lr:0.0000100 epoch_Time:28083.0min: [2024-01-03 08:00:11,019][model8_pretrain.py][INFO] Epoch:[0/2](136400/4588595) loss:3.182 lr:0.0000100 epoch_Time:28083.0min: [2024-01-03 08:00:11,020][model8_pretrain.py][INFO] Epoch:[0/2](136400/4588595) loss:3.027 lr:0.0000100 epoch_Time:28083.0min: [2024-01-03 08:00:11,020][model8_pretrain.py][INFO] Epoch:[0/2](136400/4588595) loss:2.929 lr:0.0000100 epoch_Time:28083.0min: [2024-01-03 08:00:11,020][model8_pretrain.py][INFO] Epoch:[0/2](136400/4588595) loss:3.007 lr:0.0000100 epoch_Time:28083.0min: [2024-01-03 08:00:11,020][model8_pretrain.py][INFO] Epoch:[0/2](136400/4588595) loss:3.151 lr:0.0000100 epoch_Time:28083.0min: [2024-01-03 08:00:11,020][model8_pretrain.py][INFO] Epoch:[0/2](136400/4588595) loss:2.694 lr:0.0000100 epoch_Time:28083.0min: [2024-01-03 08:00:47,921][model8_pretrain.py][INFO] Epoch:[0/2](136500/4588595) loss:2.948 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 08:00:47,921][model8_pretrain.py][INFO] Epoch:[0/2](136500/4588595) loss:3.218 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 08:00:47,921][model8_pretrain.py][INFO] Epoch:[0/2](136500/4588595) loss:3.080 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 08:00:47,921][model8_pretrain.py][INFO] Epoch:[0/2](136500/4588595) loss:2.858 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 08:00:47,921][model8_pretrain.py][INFO] Epoch:[0/2](136500/4588595) loss:2.508 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 08:00:47,922][model8_pretrain.py][INFO] Epoch:[0/2](136500/4588595) loss:3.183 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 08:00:47,922][model8_pretrain.py][INFO] Epoch:[0/2](136500/4588595) loss:2.674 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 08:00:47,922][model8_pretrain.py][INFO] Epoch:[0/2](136500/4588595) loss:3.046 lr:0.0000100 epoch_Time:28082.0min: [2024-01-03 08:01:24,851][model8_pretrain.py][INFO] Epoch:[0/2](136600/4588595) loss:3.365 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:01:24,851][model8_pretrain.py][INFO] Epoch:[0/2](136600/4588595) loss:3.094 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:01:24,851][model8_pretrain.py][INFO] Epoch:[0/2](136600/4588595) loss:2.639 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:01:24,851][model8_pretrain.py][INFO] Epoch:[0/2](136600/4588595) loss:3.136 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:01:24,851][model8_pretrain.py][INFO] Epoch:[0/2](136600/4588595) loss:3.116 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:01:24,851][model8_pretrain.py][INFO] Epoch:[0/2](136600/4588595) loss:3.368 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:01:24,851][model8_pretrain.py][INFO] Epoch:[0/2](136600/4588595) loss:3.365 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:01:24,851][model8_pretrain.py][INFO] Epoch:[0/2](136600/4588595) loss:3.127 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:02:01,793][model8_pretrain.py][INFO] Epoch:[0/2](136700/4588595) loss:2.880 lr:0.0000100 epoch_Time:28080.0min: [2024-01-03 08:02:01,793][model8_pretrain.py][INFO] Epoch:[0/2](136700/4588595) loss:3.186 lr:0.0000100 epoch_Time:28080.0min: [2024-01-03 08:02:01,793][model8_pretrain.py][INFO] Epoch:[0/2](136700/4588595) loss:2.937 lr:0.0000100 epoch_Time:28080.0min: [2024-01-03 08:02:01,793][model8_pretrain.py][INFO] Epoch:[0/2](136700/4588595) loss:2.541 lr:0.0000100 epoch_Time:28080.0min: [2024-01-03 08:02:01,793][model8_pretrain.py][INFO] Epoch:[0/2](136700/4588595) loss:3.421 lr:0.0000100 epoch_Time:28080.0min: [2024-01-03 08:02:01,793][model8_pretrain.py][INFO] Epoch:[0/2](136700/4588595) loss:3.014 lr:0.0000100 epoch_Time:28080.0min: [2024-01-03 08:02:01,793][model8_pretrain.py][INFO] Epoch:[0/2](136700/4588595) loss:3.160 lr:0.0000100 epoch_Time:28080.0min: [2024-01-03 08:02:01,793][model8_pretrain.py][INFO] Epoch:[0/2](136700/4588595) loss:2.411 lr:0.0000100 epoch_Time:28080.0min: [2024-01-03 08:02:38,729][model8_pretrain.py][INFO] Epoch:[0/2](136800/4588595) loss:2.793 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:02:38,729][model8_pretrain.py][INFO] Epoch:[0/2](136800/4588595) loss:3.355 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:02:38,729][model8_pretrain.py][INFO] Epoch:[0/2](136800/4588595) loss:2.267 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:02:38,729][model8_pretrain.py][INFO] Epoch:[0/2](136800/4588595) loss:2.885 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:02:38,729][model8_pretrain.py][INFO] Epoch:[0/2](136800/4588595) loss:2.716 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:02:38,729][model8_pretrain.py][INFO] Epoch:[0/2](136800/4588595) loss:3.339 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:02:38,729][model8_pretrain.py][INFO] Epoch:[0/2](136800/4588595) loss:3.328 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:02:38,729][model8_pretrain.py][INFO] Epoch:[0/2](136800/4588595) loss:2.896 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:03:15,672][model8_pretrain.py][INFO] Epoch:[0/2](136900/4588595) loss:2.996 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:03:15,672][model8_pretrain.py][INFO] Epoch:[0/2](136900/4588595) loss:3.011 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:03:15,672][model8_pretrain.py][INFO] Epoch:[0/2](136900/4588595) loss:3.022 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:03:15,672][model8_pretrain.py][INFO] Epoch:[0/2](136900/4588595) loss:2.901 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:03:15,672][model8_pretrain.py][INFO] Epoch:[0/2](136900/4588595) loss:2.953 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:03:15,672][model8_pretrain.py][INFO] Epoch:[0/2](136900/4588595) loss:3.267 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:03:15,672][model8_pretrain.py][INFO] Epoch:[0/2](136900/4588595) loss:3.132 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:03:15,673][model8_pretrain.py][INFO] Epoch:[0/2](136900/4588595) loss:3.073 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:03:52,615][model8_pretrain.py][INFO] Epoch:[0/2](137000/4588595) loss:3.011 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:03:52,615][model8_pretrain.py][INFO] Epoch:[0/2](137000/4588595) loss:3.023 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:03:52,615][model8_pretrain.py][INFO] Epoch:[0/2](137000/4588595) loss:3.050 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:03:52,615][model8_pretrain.py][INFO] Epoch:[0/2](137000/4588595) loss:3.260 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:03:52,615][model8_pretrain.py][INFO] Epoch:[0/2](137000/4588595) loss:3.288 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:03:52,615][model8_pretrain.py][INFO] Epoch:[0/2](137000/4588595) loss:2.743 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:03:52,615][model8_pretrain.py][INFO] Epoch:[0/2](137000/4588595) loss:3.050 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:03:52,616][model8_pretrain.py][INFO] Epoch:[0/2](137000/4588595) loss:2.771 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:04:38,275][model8_pretrain.py][INFO] Epoch:[0/2](137100/4588595) loss:2.954 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:04:38,275][model8_pretrain.py][INFO] Epoch:[0/2](137100/4588595) loss:3.000 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:04:38,275][model8_pretrain.py][INFO] Epoch:[0/2](137100/4588595) loss:2.873 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:04:38,275][model8_pretrain.py][INFO] Epoch:[0/2](137100/4588595) loss:2.513 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:04:38,275][model8_pretrain.py][INFO] Epoch:[0/2](137100/4588595) loss:3.096 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:04:38,275][model8_pretrain.py][INFO] Epoch:[0/2](137100/4588595) loss:3.081 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:04:38,275][model8_pretrain.py][INFO] Epoch:[0/2](137100/4588595) loss:3.379 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:04:38,275][model8_pretrain.py][INFO] Epoch:[0/2](137100/4588595) loss:3.505 lr:0.0000100 epoch_Time:28081.0min: [2024-01-03 08:05:15,214][model8_pretrain.py][INFO] Epoch:[0/2](137200/4588595) loss:2.856 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:05:15,214][model8_pretrain.py][INFO] Epoch:[0/2](137200/4588595) loss:3.063 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:05:15,214][model8_pretrain.py][INFO] Epoch:[0/2](137200/4588595) loss:3.243 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:05:15,214][model8_pretrain.py][INFO] Epoch:[0/2](137200/4588595) loss:3.346 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:05:15,214][model8_pretrain.py][INFO] Epoch:[0/2](137200/4588595) loss:3.208 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:05:15,214][model8_pretrain.py][INFO] Epoch:[0/2](137200/4588595) loss:2.400 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:05:15,214][model8_pretrain.py][INFO] Epoch:[0/2](137200/4588595) loss:2.701 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:05:15,214][model8_pretrain.py][INFO] Epoch:[0/2](137200/4588595) loss:2.997 lr:0.0000100 epoch_Time:28079.0min: [2024-01-03 08:05:52,145][model8_pretrain.py][INFO] Epoch:[0/2](137300/4588595) loss:2.800 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:05:52,145][model8_pretrain.py][INFO] Epoch:[0/2](137300/4588595) loss:3.092 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:05:52,145][model8_pretrain.py][INFO] Epoch:[0/2](137300/4588595) loss:3.043 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:05:52,145][model8_pretrain.py][INFO] Epoch:[0/2](137300/4588595) loss:3.418 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:05:52,145][model8_pretrain.py][INFO] Epoch:[0/2](137300/4588595) loss:2.736 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:05:52,145][model8_pretrain.py][INFO] Epoch:[0/2](137300/4588595) loss:3.223 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:05:52,145][model8_pretrain.py][INFO] Epoch:[0/2](137300/4588595) loss:3.053 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:05:52,145][model8_pretrain.py][INFO] Epoch:[0/2](137300/4588595) loss:2.112 lr:0.0000100 epoch_Time:28078.0min: [2024-01-03 08:06:29,097][model8_pretrain.py][INFO] Epoch:[0/2](137400/4588595) loss:3.132 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:06:29,097][model8_pretrain.py][INFO] Epoch:[0/2](137400/4588595) loss:3.029 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:06:29,098][model8_pretrain.py][INFO] Epoch:[0/2](137400/4588595) loss:3.453 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:06:29,098][model8_pretrain.py][INFO] Epoch:[0/2](137400/4588595) loss:3.591 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:06:29,098][model8_pretrain.py][INFO] Epoch:[0/2](137400/4588595) loss:3.204 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:06:29,098][model8_pretrain.py][INFO] Epoch:[0/2](137400/4588595) loss:2.854 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:06:29,098][model8_pretrain.py][INFO] Epoch:[0/2](137400/4588595) loss:3.198 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:06:29,098][model8_pretrain.py][INFO] Epoch:[0/2](137400/4588595) loss:2.154 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:07:06,034][model8_pretrain.py][INFO] Epoch:[0/2](137500/4588595) loss:2.954 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:07:06,034][model8_pretrain.py][INFO] Epoch:[0/2](137500/4588595) loss:3.070 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:07:06,034][model8_pretrain.py][INFO] Epoch:[0/2](137500/4588595) loss:2.504 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:07:06,034][model8_pretrain.py][INFO] Epoch:[0/2](137500/4588595) loss:3.121 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:07:06,034][model8_pretrain.py][INFO] Epoch:[0/2](137500/4588595) loss:3.002 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:07:06,034][model8_pretrain.py][INFO] Epoch:[0/2](137500/4588595) loss:2.596 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:07:06,034][model8_pretrain.py][INFO] Epoch:[0/2](137500/4588595) loss:3.097 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:07:06,034][model8_pretrain.py][INFO] Epoch:[0/2](137500/4588595) loss:2.911 lr:0.0000100 epoch_Time:28076.0min: [2024-01-03 08:07:42,980][model8_pretrain.py][INFO] Epoch:[0/2](137600/4588595) loss:2.906 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:07:42,980][model8_pretrain.py][INFO] Epoch:[0/2](137600/4588595) loss:3.045 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:07:42,980][model8_pretrain.py][INFO] Epoch:[0/2](137600/4588595) loss:2.981 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:07:42,980][model8_pretrain.py][INFO] Epoch:[0/2](137600/4588595) loss:3.206 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:07:42,980][model8_pretrain.py][INFO] Epoch:[0/2](137600/4588595) loss:2.834 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:07:42,980][model8_pretrain.py][INFO] Epoch:[0/2](137600/4588595) loss:2.794 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:07:42,980][model8_pretrain.py][INFO] Epoch:[0/2](137600/4588595) loss:2.705 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:07:42,981][model8_pretrain.py][INFO] Epoch:[0/2](137600/4588595) loss:3.018 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:08:19,931][model8_pretrain.py][INFO] Epoch:[0/2](137700/4588595) loss:2.676 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:08:19,931][model8_pretrain.py][INFO] Epoch:[0/2](137700/4588595) loss:2.775 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:08:19,931][model8_pretrain.py][INFO] Epoch:[0/2](137700/4588595) loss:2.931 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:08:19,931][model8_pretrain.py][INFO] Epoch:[0/2](137700/4588595) loss:3.303 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:08:19,931][model8_pretrain.py][INFO] Epoch:[0/2](137700/4588595) loss:2.864 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:08:19,931][model8_pretrain.py][INFO] Epoch:[0/2](137700/4588595) loss:2.950 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:08:19,931][model8_pretrain.py][INFO] Epoch:[0/2](137700/4588595) loss:3.331 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:08:19,931][model8_pretrain.py][INFO] Epoch:[0/2](137700/4588595) loss:2.697 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:08:56,872][model8_pretrain.py][INFO] Epoch:[0/2](137800/4588595) loss:2.802 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:08:56,872][model8_pretrain.py][INFO] Epoch:[0/2](137800/4588595) loss:2.801 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:08:56,872][model8_pretrain.py][INFO] Epoch:[0/2](137800/4588595) loss:2.978 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:08:56,872][model8_pretrain.py][INFO] Epoch:[0/2](137800/4588595) loss:3.523 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:08:56,872][model8_pretrain.py][INFO] Epoch:[0/2](137800/4588595) loss:3.147 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:08:56,872][model8_pretrain.py][INFO] Epoch:[0/2](137800/4588595) loss:3.148 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:08:56,872][model8_pretrain.py][INFO] Epoch:[0/2](137800/4588595) loss:2.860 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:08:56,872][model8_pretrain.py][INFO] Epoch:[0/2](137800/4588595) loss:3.081 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:09:42,598][model8_pretrain.py][INFO] Epoch:[0/2](137900/4588595) loss:3.268 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:09:42,598][model8_pretrain.py][INFO] Epoch:[0/2](137900/4588595) loss:2.842 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:09:42,599][model8_pretrain.py][INFO] Epoch:[0/2](137900/4588595) loss:2.262 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:09:42,599][model8_pretrain.py][INFO] Epoch:[0/2](137900/4588595) loss:3.158 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:09:42,599][model8_pretrain.py][INFO] Epoch:[0/2](137900/4588595) loss:2.861 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:09:42,599][model8_pretrain.py][INFO] Epoch:[0/2](137900/4588595) loss:2.929 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:09:42,599][model8_pretrain.py][INFO] Epoch:[0/2](137900/4588595) loss:2.581 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:09:42,599][model8_pretrain.py][INFO] Epoch:[0/2](137900/4588595) loss:2.653 lr:0.0000100 epoch_Time:28077.0min: [2024-01-03 08:10:19,555][model8_pretrain.py][INFO] Epoch:[0/2](138000/4588595) loss:3.246 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:10:19,555][model8_pretrain.py][INFO] Epoch:[0/2](138000/4588595) loss:3.079 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:10:19,556][model8_pretrain.py][INFO] Epoch:[0/2](138000/4588595) loss:3.116 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:10:19,556][model8_pretrain.py][INFO] Epoch:[0/2](138000/4588595) loss:2.688 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:10:19,555][model8_pretrain.py][INFO] Epoch:[0/2](138000/4588595) loss:2.835 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:10:19,556][model8_pretrain.py][INFO] Epoch:[0/2](138000/4588595) loss:2.843 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:10:19,556][model8_pretrain.py][INFO] Epoch:[0/2](138000/4588595) loss:2.857 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:10:19,556][model8_pretrain.py][INFO] Epoch:[0/2](138000/4588595) loss:2.724 lr:0.0000100 epoch_Time:28075.0min: [2024-01-03 08:10:56,502][model8_pretrain.py][INFO] Epoch:[0/2](138100/4588595) loss:2.895 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:10:56,502][model8_pretrain.py][INFO] Epoch:[0/2](138100/4588595) loss:3.096 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:10:56,502][model8_pretrain.py][INFO] Epoch:[0/2](138100/4588595) loss:2.861 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:10:56,502][model8_pretrain.py][INFO] Epoch:[0/2](138100/4588595) loss:3.046 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:10:56,502][model8_pretrain.py][INFO] Epoch:[0/2](138100/4588595) loss:3.098 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:10:56,502][model8_pretrain.py][INFO] Epoch:[0/2](138100/4588595) loss:2.739 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:10:56,502][model8_pretrain.py][INFO] Epoch:[0/2](138100/4588595) loss:3.138 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:10:56,503][model8_pretrain.py][INFO] Epoch:[0/2](138100/4588595) loss:2.828 lr:0.0000100 epoch_Time:28074.0min: [2024-01-03 08:11:33,441][model8_pretrain.py][INFO] Epoch:[0/2](138200/4588595) loss:2.713 lr:0.0000100 epoch_Time:28073.0min: [2024-01-03 08:11:33,441][model8_pretrain.py][INFO] Epoch:[0/2](138200/4588595) loss:3.075 lr:0.0000100 epoch_Time:28073.0min: [2024-01-03 08:11:33,441][model8_pretrain.py][INFO] Epoch:[0/2](138200/4588595) loss:2.746 lr:0.0000100 epoch_Time:28073.0min: [2024-01-03 08:11:33,441][model8_pretrain.py][INFO] Epoch:[0/2](138200/4588595) loss:3.054 lr:0.0000100 epoch_Time:28073.0min: [2024-01-03 08:11:33,441][model8_pretrain.py][INFO] Epoch:[0/2](138200/4588595) loss:3.375 lr:0.0000100 epoch_Time:28073.0min: [2024-01-03 08:11:33,441][model8_pretrain.py][INFO] Epoch:[0/2](138200/4588595) loss:2.893 lr:0.0000100 epoch_Time:28073.0min: [2024-01-03 08:11:33,441][model8_pretrain.py][INFO] Epoch:[0/2](138200/4588595) loss:3.045 lr:0.0000100 epoch_Time:28073.0min: [2024-01-03 08:11:33,441][model8_pretrain.py][INFO] Epoch:[0/2](138200/4588595) loss:3.060 lr:0.0000100 epoch_Time:28073.0min: [2024-01-03 08:12:10,381][model8_pretrain.py][INFO] Epoch:[0/2](138300/4588595) loss:3.210 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:12:10,381][model8_pretrain.py][INFO] Epoch:[0/2](138300/4588595) loss:3.101 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:12:10,381][model8_pretrain.py][INFO] Epoch:[0/2](138300/4588595) loss:2.872 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:12:10,381][model8_pretrain.py][INFO] Epoch:[0/2](138300/4588595) loss:3.488 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:12:10,381][model8_pretrain.py][INFO] Epoch:[0/2](138300/4588595) loss:2.976 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:12:10,381][model8_pretrain.py][INFO] Epoch:[0/2](138300/4588595) loss:3.379 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:12:10,382][model8_pretrain.py][INFO] Epoch:[0/2](138300/4588595) loss:3.174 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:12:10,382][model8_pretrain.py][INFO] Epoch:[0/2](138300/4588595) loss:2.589 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:12:47,328][model8_pretrain.py][INFO] Epoch:[0/2](138400/4588595) loss:2.950 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:12:47,328][model8_pretrain.py][INFO] Epoch:[0/2](138400/4588595) loss:2.720 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:12:47,328][model8_pretrain.py][INFO] Epoch:[0/2](138400/4588595) loss:2.916 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:12:47,328][model8_pretrain.py][INFO] Epoch:[0/2](138400/4588595) loss:2.638 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:12:47,328][model8_pretrain.py][INFO] Epoch:[0/2](138400/4588595) loss:2.704 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:12:47,328][model8_pretrain.py][INFO] Epoch:[0/2](138400/4588595) loss:3.216 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:12:47,328][model8_pretrain.py][INFO] Epoch:[0/2](138400/4588595) loss:3.055 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:12:47,328][model8_pretrain.py][INFO] Epoch:[0/2](138400/4588595) loss:3.378 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:13:24,288][model8_pretrain.py][INFO] Epoch:[0/2](138500/4588595) loss:2.535 lr:0.0000100 epoch_Time:28070.0min: [2024-01-03 08:13:24,288][model8_pretrain.py][INFO] Epoch:[0/2](138500/4588595) loss:2.996 lr:0.0000100 epoch_Time:28070.0min: [2024-01-03 08:13:24,288][model8_pretrain.py][INFO] Epoch:[0/2](138500/4588595) loss:2.656 lr:0.0000100 epoch_Time:28070.0min: [2024-01-03 08:13:24,288][model8_pretrain.py][INFO] Epoch:[0/2](138500/4588595) loss:2.933 lr:0.0000100 epoch_Time:28070.0min: [2024-01-03 08:13:24,288][model8_pretrain.py][INFO] Epoch:[0/2](138500/4588595) loss:3.058 lr:0.0000100 epoch_Time:28070.0min: [2024-01-03 08:13:24,288][model8_pretrain.py][INFO] Epoch:[0/2](138500/4588595) loss:2.899 lr:0.0000100 epoch_Time:28070.0min: [2024-01-03 08:13:24,288][model8_pretrain.py][INFO] Epoch:[0/2](138500/4588595) loss:2.999 lr:0.0000100 epoch_Time:28070.0min: [2024-01-03 08:13:24,288][model8_pretrain.py][INFO] Epoch:[0/2](138500/4588595) loss:2.924 lr:0.0000100 epoch_Time:28070.0min: [2024-01-03 08:14:01,254][model8_pretrain.py][INFO] Epoch:[0/2](138600/4588595) loss:2.125 lr:0.0000100 epoch_Time:28068.0min: [2024-01-03 08:14:01,254][model8_pretrain.py][INFO] Epoch:[0/2](138600/4588595) loss:2.683 lr:0.0000100 epoch_Time:28068.0min: [2024-01-03 08:14:01,254][model8_pretrain.py][INFO] Epoch:[0/2](138600/4588595) loss:2.749 lr:0.0000100 epoch_Time:28068.0min: [2024-01-03 08:14:01,254][model8_pretrain.py][INFO] Epoch:[0/2](138600/4588595) loss:3.194 lr:0.0000100 epoch_Time:28068.0min: [2024-01-03 08:14:01,254][model8_pretrain.py][INFO] Epoch:[0/2](138600/4588595) loss:2.835 lr:0.0000100 epoch_Time:28068.0min: [2024-01-03 08:14:01,254][model8_pretrain.py][INFO] Epoch:[0/2](138600/4588595) loss:3.004 lr:0.0000100 epoch_Time:28068.0min: [2024-01-03 08:14:01,254][model8_pretrain.py][INFO] Epoch:[0/2](138600/4588595) loss:2.932 lr:0.0000100 epoch_Time:28068.0min: [2024-01-03 08:14:01,254][model8_pretrain.py][INFO] Epoch:[0/2](138600/4588595) loss:3.271 lr:0.0000100 epoch_Time:28068.0min: [2024-01-03 08:14:46,798][model8_pretrain.py][INFO] Epoch:[0/2](138700/4588595) loss:2.887 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:14:46,798][model8_pretrain.py][INFO] Epoch:[0/2](138700/4588595) loss:3.171 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:14:46,798][model8_pretrain.py][INFO] Epoch:[0/2](138700/4588595) loss:3.415 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:14:46,798][model8_pretrain.py][INFO] Epoch:[0/2](138700/4588595) loss:2.725 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:14:46,798][model8_pretrain.py][INFO] Epoch:[0/2](138700/4588595) loss:2.845 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:14:46,798][model8_pretrain.py][INFO] Epoch:[0/2](138700/4588595) loss:3.293 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:14:46,798][model8_pretrain.py][INFO] Epoch:[0/2](138700/4588595) loss:3.185 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:14:46,798][model8_pretrain.py][INFO] Epoch:[0/2](138700/4588595) loss:2.833 lr:0.0000100 epoch_Time:28072.0min: [2024-01-03 08:15:23,722][model8_pretrain.py][INFO] Epoch:[0/2](138800/4588595) loss:2.976 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:15:23,722][model8_pretrain.py][INFO] Epoch:[0/2](138800/4588595) loss:2.762 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:15:23,722][model8_pretrain.py][INFO] Epoch:[0/2](138800/4588595) loss:3.004 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:15:23,722][model8_pretrain.py][INFO] Epoch:[0/2](138800/4588595) loss:3.298 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:15:23,722][model8_pretrain.py][INFO] Epoch:[0/2](138800/4588595) loss:2.914 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:15:23,722][model8_pretrain.py][INFO] Epoch:[0/2](138800/4588595) loss:2.651 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:15:23,722][model8_pretrain.py][INFO] Epoch:[0/2](138800/4588595) loss:2.851 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:15:23,722][model8_pretrain.py][INFO] Epoch:[0/2](138800/4588595) loss:3.592 lr:0.0000100 epoch_Time:28071.0min: [2024-01-03 08:16:00,661][model8_pretrain.py][INFO] Epoch:[0/2](138900/4588595) loss:2.511 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:00,661][model8_pretrain.py][INFO] Epoch:[0/2](138900/4588595) loss:3.401 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:00,661][model8_pretrain.py][INFO] Epoch:[0/2](138900/4588595) loss:2.998 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:00,661][model8_pretrain.py][INFO] Epoch:[0/2](138900/4588595) loss:2.582 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:00,661][model8_pretrain.py][INFO] Epoch:[0/2](138900/4588595) loss:3.320 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:00,661][model8_pretrain.py][INFO] Epoch:[0/2](138900/4588595) loss:2.492 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:00,661][model8_pretrain.py][INFO] Epoch:[0/2](138900/4588595) loss:3.036 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:00,662][model8_pretrain.py][INFO] Epoch:[0/2](138900/4588595) loss:2.766 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:37,595][model8_pretrain.py][INFO] Epoch:[0/2](139000/4588595) loss:3.156 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:37,595][model8_pretrain.py][INFO] Epoch:[0/2](139000/4588595) loss:2.936 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:37,595][model8_pretrain.py][INFO] Epoch:[0/2](139000/4588595) loss:2.988 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:37,595][model8_pretrain.py][INFO] Epoch:[0/2](139000/4588595) loss:2.759 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:37,595][model8_pretrain.py][INFO] Epoch:[0/2](139000/4588595) loss:2.497 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:37,595][model8_pretrain.py][INFO] Epoch:[0/2](139000/4588595) loss:2.525 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:37,595][model8_pretrain.py][INFO] Epoch:[0/2](139000/4588595) loss:2.741 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:16:37,596][model8_pretrain.py][INFO] Epoch:[0/2](139000/4588595) loss:3.475 lr:0.0000100 epoch_Time:28069.0min: [2024-01-03 08:17:14,517][model8_pretrain.py][INFO] Epoch:[0/2](139100/4588595) loss:2.392 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:17:14,517][model8_pretrain.py][INFO] Epoch:[0/2](139100/4588595) loss:3.260 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:17:14,517][model8_pretrain.py][INFO] Epoch:[0/2](139100/4588595) loss:2.958 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:17:14,517][model8_pretrain.py][INFO] Epoch:[0/2](139100/4588595) loss:2.973 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:17:14,517][model8_pretrain.py][INFO] Epoch:[0/2](139100/4588595) loss:2.784 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:17:14,517][model8_pretrain.py][INFO] Epoch:[0/2](139100/4588595) loss:2.925 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:17:14,517][model8_pretrain.py][INFO] Epoch:[0/2](139100/4588595) loss:2.616 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:17:14,517][model8_pretrain.py][INFO] Epoch:[0/2](139100/4588595) loss:3.173 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:17:51,459][model8_pretrain.py][INFO] Epoch:[0/2](139200/4588595) loss:2.520 lr:0.0000100 epoch_Time:28066.0min: [2024-01-03 08:17:51,459][model8_pretrain.py][INFO] Epoch:[0/2](139200/4588595) loss:2.865 lr:0.0000100 epoch_Time:28066.0min: [2024-01-03 08:17:51,459][model8_pretrain.py][INFO] Epoch:[0/2](139200/4588595) loss:2.605 lr:0.0000100 epoch_Time:28066.0min: [2024-01-03 08:17:51,459][model8_pretrain.py][INFO] Epoch:[0/2](139200/4588595) loss:3.147 lr:0.0000100 epoch_Time:28066.0min: [2024-01-03 08:17:51,459][model8_pretrain.py][INFO] Epoch:[0/2](139200/4588595) loss:2.807 lr:0.0000100 epoch_Time:28066.0min: [2024-01-03 08:17:51,459][model8_pretrain.py][INFO] Epoch:[0/2](139200/4588595) loss:3.106 lr:0.0000100 epoch_Time:28066.0min: [2024-01-03 08:17:51,459][model8_pretrain.py][INFO] Epoch:[0/2](139200/4588595) loss:2.925 lr:0.0000100 epoch_Time:28066.0min: [2024-01-03 08:17:51,459][model8_pretrain.py][INFO] Epoch:[0/2](139200/4588595) loss:3.197 lr:0.0000100 epoch_Time:28066.0min: [2024-01-03 08:18:28,404][model8_pretrain.py][INFO] Epoch:[0/2](139300/4588595) loss:3.255 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:18:28,404][model8_pretrain.py][INFO] Epoch:[0/2](139300/4588595) loss:3.446 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:18:28,404][model8_pretrain.py][INFO] Epoch:[0/2](139300/4588595) loss:2.800 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:18:28,404][model8_pretrain.py][INFO] Epoch:[0/2](139300/4588595) loss:2.899 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:18:28,404][model8_pretrain.py][INFO] Epoch:[0/2](139300/4588595) loss:2.792 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:18:28,404][model8_pretrain.py][INFO] Epoch:[0/2](139300/4588595) loss:3.194 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:18:28,404][model8_pretrain.py][INFO] Epoch:[0/2](139300/4588595) loss:3.088 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:18:28,404][model8_pretrain.py][INFO] Epoch:[0/2](139300/4588595) loss:2.307 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:19:05,335][model8_pretrain.py][INFO] Epoch:[0/2](139400/4588595) loss:3.449 lr:0.0000100 epoch_Time:28064.0min: [2024-01-03 08:19:05,335][model8_pretrain.py][INFO] Epoch:[0/2](139400/4588595) loss:2.469 lr:0.0000100 epoch_Time:28064.0min: [2024-01-03 08:19:05,335][model8_pretrain.py][INFO] Epoch:[0/2](139400/4588595) loss:2.791 lr:0.0000100 epoch_Time:28064.0min: [2024-01-03 08:19:05,335][model8_pretrain.py][INFO] Epoch:[0/2](139400/4588595) loss:3.016 lr:0.0000100 epoch_Time:28064.0min: [2024-01-03 08:19:05,335][model8_pretrain.py][INFO] Epoch:[0/2](139400/4588595) loss:2.984 lr:0.0000100 epoch_Time:28064.0min: [2024-01-03 08:19:05,335][model8_pretrain.py][INFO] Epoch:[0/2](139400/4588595) loss:3.136 lr:0.0000100 epoch_Time:28064.0min: [2024-01-03 08:19:05,335][model8_pretrain.py][INFO] Epoch:[0/2](139400/4588595) loss:2.761 lr:0.0000100 epoch_Time:28064.0min: [2024-01-03 08:19:05,335][model8_pretrain.py][INFO] Epoch:[0/2](139400/4588595) loss:2.676 lr:0.0000100 epoch_Time:28064.0min: [2024-01-03 08:19:51,116][model8_pretrain.py][INFO] Epoch:[0/2](139500/4588595) loss:2.805 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:19:51,116][model8_pretrain.py][INFO] Epoch:[0/2](139500/4588595) loss:2.832 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:19:51,117][model8_pretrain.py][INFO] Epoch:[0/2](139500/4588595) loss:3.221 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:19:51,117][model8_pretrain.py][INFO] Epoch:[0/2](139500/4588595) loss:3.040 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:19:51,117][model8_pretrain.py][INFO] Epoch:[0/2](139500/4588595) loss:2.901 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:19:51,116][model8_pretrain.py][INFO] Epoch:[0/2](139500/4588595) loss:2.844 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:19:51,117][model8_pretrain.py][INFO] Epoch:[0/2](139500/4588595) loss:3.154 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:19:51,117][model8_pretrain.py][INFO] Epoch:[0/2](139500/4588595) loss:3.223 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:20:28,042][model8_pretrain.py][INFO] Epoch:[0/2](139600/4588595) loss:3.379 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:20:28,042][model8_pretrain.py][INFO] Epoch:[0/2](139600/4588595) loss:3.094 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:20:28,042][model8_pretrain.py][INFO] Epoch:[0/2](139600/4588595) loss:2.976 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:20:28,042][model8_pretrain.py][INFO] Epoch:[0/2](139600/4588595) loss:3.020 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:20:28,042][model8_pretrain.py][INFO] Epoch:[0/2](139600/4588595) loss:3.086 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:20:28,042][model8_pretrain.py][INFO] Epoch:[0/2](139600/4588595) loss:2.978 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:20:28,042][model8_pretrain.py][INFO] Epoch:[0/2](139600/4588595) loss:3.344 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:20:28,042][model8_pretrain.py][INFO] Epoch:[0/2](139600/4588595) loss:2.870 lr:0.0000100 epoch_Time:28067.0min: [2024-01-03 08:21:04,965][model8_pretrain.py][INFO] Epoch:[0/2](139700/4588595) loss:2.862 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:04,965][model8_pretrain.py][INFO] Epoch:[0/2](139700/4588595) loss:3.035 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:04,965][model8_pretrain.py][INFO] Epoch:[0/2](139700/4588595) loss:2.624 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:04,965][model8_pretrain.py][INFO] Epoch:[0/2](139700/4588595) loss:3.158 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:04,965][model8_pretrain.py][INFO] Epoch:[0/2](139700/4588595) loss:2.880 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:04,965][model8_pretrain.py][INFO] Epoch:[0/2](139700/4588595) loss:3.077 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:04,965][model8_pretrain.py][INFO] Epoch:[0/2](139700/4588595) loss:2.816 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:04,966][model8_pretrain.py][INFO] Epoch:[0/2](139700/4588595) loss:3.407 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:41,896][model8_pretrain.py][INFO] Epoch:[0/2](139800/4588595) loss:3.208 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:41,896][model8_pretrain.py][INFO] Epoch:[0/2](139800/4588595) loss:3.058 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:41,896][model8_pretrain.py][INFO] Epoch:[0/2](139800/4588595) loss:3.324 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:41,896][model8_pretrain.py][INFO] Epoch:[0/2](139800/4588595) loss:3.607 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:41,896][model8_pretrain.py][INFO] Epoch:[0/2](139800/4588595) loss:3.321 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:41,896][model8_pretrain.py][INFO] Epoch:[0/2](139800/4588595) loss:2.239 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:41,896][model8_pretrain.py][INFO] Epoch:[0/2](139800/4588595) loss:3.172 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:21:41,896][model8_pretrain.py][INFO] Epoch:[0/2](139800/4588595) loss:2.467 lr:0.0000100 epoch_Time:28065.0min: [2024-01-03 08:22:18,828][model8_pretrain.py][INFO] Epoch:[0/2](139900/4588595) loss:2.712 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:22:18,828][model8_pretrain.py][INFO] Epoch:[0/2](139900/4588595) loss:2.853 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:22:18,828][model8_pretrain.py][INFO] Epoch:[0/2](139900/4588595) loss:3.183 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:22:18,828][model8_pretrain.py][INFO] Epoch:[0/2](139900/4588595) loss:2.397 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:22:18,828][model8_pretrain.py][INFO] Epoch:[0/2](139900/4588595) loss:3.377 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:22:18,828][model8_pretrain.py][INFO] Epoch:[0/2](139900/4588595) loss:2.708 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:22:18,828][model8_pretrain.py][INFO] Epoch:[0/2](139900/4588595) loss:3.113 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:22:18,828][model8_pretrain.py][INFO] Epoch:[0/2](139900/4588595) loss:3.142 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](140000/4588595) loss:3.286 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](140000/4588595) loss:2.977 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](140000/4588595) loss:3.005 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](140000/4588595) loss:2.835 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](140000/4588595) loss:2.743 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](140000/4588595) loss:3.077 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](140000/4588595) loss:2.909 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](140000/4588595) loss:2.843 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:23:32,703][model8_pretrain.py][INFO] Epoch:[0/2](140100/4588595) loss:3.046 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:23:32,703][model8_pretrain.py][INFO] Epoch:[0/2](140100/4588595) loss:3.464 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:23:32,703][model8_pretrain.py][INFO] Epoch:[0/2](140100/4588595) loss:2.965 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:23:32,703][model8_pretrain.py][INFO] Epoch:[0/2](140100/4588595) loss:2.585 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:23:32,703][model8_pretrain.py][INFO] Epoch:[0/2](140100/4588595) loss:2.952 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:23:32,703][model8_pretrain.py][INFO] Epoch:[0/2](140100/4588595) loss:2.749 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:23:32,703][model8_pretrain.py][INFO] Epoch:[0/2](140100/4588595) loss:2.560 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:23:32,703][model8_pretrain.py][INFO] Epoch:[0/2](140100/4588595) loss:3.094 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:24:09,641][model8_pretrain.py][INFO] Epoch:[0/2](140200/4588595) loss:2.513 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:24:09,642][model8_pretrain.py][INFO] Epoch:[0/2](140200/4588595) loss:3.145 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:24:09,642][model8_pretrain.py][INFO] Epoch:[0/2](140200/4588595) loss:2.804 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:24:09,642][model8_pretrain.py][INFO] Epoch:[0/2](140200/4588595) loss:2.799 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:24:09,642][model8_pretrain.py][INFO] Epoch:[0/2](140200/4588595) loss:2.970 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:24:09,642][model8_pretrain.py][INFO] Epoch:[0/2](140200/4588595) loss:3.561 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:24:09,642][model8_pretrain.py][INFO] Epoch:[0/2](140200/4588595) loss:2.129 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:24:09,642][model8_pretrain.py][INFO] Epoch:[0/2](140200/4588595) loss:3.151 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:24:55,035][model8_pretrain.py][INFO] Epoch:[0/2](140300/4588595) loss:3.046 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:24:55,035][model8_pretrain.py][INFO] Epoch:[0/2](140300/4588595) loss:2.755 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:24:55,035][model8_pretrain.py][INFO] Epoch:[0/2](140300/4588595) loss:3.383 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:24:55,035][model8_pretrain.py][INFO] Epoch:[0/2](140300/4588595) loss:2.696 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:24:55,035][model8_pretrain.py][INFO] Epoch:[0/2](140300/4588595) loss:3.005 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:24:55,035][model8_pretrain.py][INFO] Epoch:[0/2](140300/4588595) loss:3.135 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:24:55,035][model8_pretrain.py][INFO] Epoch:[0/2](140300/4588595) loss:3.027 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:24:55,035][model8_pretrain.py][INFO] Epoch:[0/2](140300/4588595) loss:3.172 lr:0.0000100 epoch_Time:28063.0min: [2024-01-03 08:25:31,961][model8_pretrain.py][INFO] Epoch:[0/2](140400/4588595) loss:2.191 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:25:31,961][model8_pretrain.py][INFO] Epoch:[0/2](140400/4588595) loss:3.195 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:25:31,961][model8_pretrain.py][INFO] Epoch:[0/2](140400/4588595) loss:3.248 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:25:31,961][model8_pretrain.py][INFO] Epoch:[0/2](140400/4588595) loss:3.104 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:25:31,961][model8_pretrain.py][INFO] Epoch:[0/2](140400/4588595) loss:3.272 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:25:31,961][model8_pretrain.py][INFO] Epoch:[0/2](140400/4588595) loss:2.623 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:25:31,962][model8_pretrain.py][INFO] Epoch:[0/2](140400/4588595) loss:2.493 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:25:31,962][model8_pretrain.py][INFO] Epoch:[0/2](140400/4588595) loss:3.304 lr:0.0000100 epoch_Time:28062.0min: [2024-01-03 08:26:08,893][model8_pretrain.py][INFO] Epoch:[0/2](140500/4588595) loss:3.173 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:26:08,893][model8_pretrain.py][INFO] Epoch:[0/2](140500/4588595) loss:3.329 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:26:08,893][model8_pretrain.py][INFO] Epoch:[0/2](140500/4588595) loss:2.825 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:26:08,894][model8_pretrain.py][INFO] Epoch:[0/2](140500/4588595) loss:3.151 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:26:08,894][model8_pretrain.py][INFO] Epoch:[0/2](140500/4588595) loss:2.928 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:26:08,894][model8_pretrain.py][INFO] Epoch:[0/2](140500/4588595) loss:2.443 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:26:08,893][model8_pretrain.py][INFO] Epoch:[0/2](140500/4588595) loss:2.709 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:26:08,894][model8_pretrain.py][INFO] Epoch:[0/2](140500/4588595) loss:3.166 lr:0.0000100 epoch_Time:28061.0min: [2024-01-03 08:26:45,838][model8_pretrain.py][INFO] Epoch:[0/2](140600/4588595) loss:2.753 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:26:45,838][model8_pretrain.py][INFO] Epoch:[0/2](140600/4588595) loss:3.213 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:26:45,838][model8_pretrain.py][INFO] Epoch:[0/2](140600/4588595) loss:2.818 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:26:45,838][model8_pretrain.py][INFO] Epoch:[0/2](140600/4588595) loss:3.092 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:26:45,838][model8_pretrain.py][INFO] Epoch:[0/2](140600/4588595) loss:3.017 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:26:45,839][model8_pretrain.py][INFO] Epoch:[0/2](140600/4588595) loss:2.994 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:26:45,839][model8_pretrain.py][INFO] Epoch:[0/2](140600/4588595) loss:2.735 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:26:45,840][model8_pretrain.py][INFO] Epoch:[0/2](140600/4588595) loss:3.248 lr:0.0000100 epoch_Time:28060.0min: [2024-01-03 08:27:22,798][model8_pretrain.py][INFO] Epoch:[0/2](140700/4588595) loss:2.634 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:27:22,798][model8_pretrain.py][INFO] Epoch:[0/2](140700/4588595) loss:2.571 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:27:22,798][model8_pretrain.py][INFO] Epoch:[0/2](140700/4588595) loss:3.156 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:27:22,798][model8_pretrain.py][INFO] Epoch:[0/2](140700/4588595) loss:3.115 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:27:22,798][model8_pretrain.py][INFO] Epoch:[0/2](140700/4588595) loss:3.161 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:27:22,798][model8_pretrain.py][INFO] Epoch:[0/2](140700/4588595) loss:3.147 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:27:22,798][model8_pretrain.py][INFO] Epoch:[0/2](140700/4588595) loss:2.825 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:27:22,799][model8_pretrain.py][INFO] Epoch:[0/2](140700/4588595) loss:2.588 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:27:59,740][model8_pretrain.py][INFO] Epoch:[0/2](140800/4588595) loss:3.411 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:27:59,740][model8_pretrain.py][INFO] Epoch:[0/2](140800/4588595) loss:2.686 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:27:59,740][model8_pretrain.py][INFO] Epoch:[0/2](140800/4588595) loss:3.241 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:27:59,740][model8_pretrain.py][INFO] Epoch:[0/2](140800/4588595) loss:3.095 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:27:59,740][model8_pretrain.py][INFO] Epoch:[0/2](140800/4588595) loss:3.374 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:27:59,740][model8_pretrain.py][INFO] Epoch:[0/2](140800/4588595) loss:3.098 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:27:59,740][model8_pretrain.py][INFO] Epoch:[0/2](140800/4588595) loss:3.474 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:27:59,741][model8_pretrain.py][INFO] Epoch:[0/2](140800/4588595) loss:2.775 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:28:36,673][model8_pretrain.py][INFO] Epoch:[0/2](140900/4588595) loss:3.363 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:28:36,673][model8_pretrain.py][INFO] Epoch:[0/2](140900/4588595) loss:2.980 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:28:36,673][model8_pretrain.py][INFO] Epoch:[0/2](140900/4588595) loss:2.861 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:28:36,673][model8_pretrain.py][INFO] Epoch:[0/2](140900/4588595) loss:2.717 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:28:36,673][model8_pretrain.py][INFO] Epoch:[0/2](140900/4588595) loss:2.570 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:28:36,673][model8_pretrain.py][INFO] Epoch:[0/2](140900/4588595) loss:3.198 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:28:36,673][model8_pretrain.py][INFO] Epoch:[0/2](140900/4588595) loss:3.156 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:28:36,673][model8_pretrain.py][INFO] Epoch:[0/2](140900/4588595) loss:2.993 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:29:13,614][model8_pretrain.py][INFO] Epoch:[0/2](141000/4588595) loss:2.605 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:29:13,614][model8_pretrain.py][INFO] Epoch:[0/2](141000/4588595) loss:2.653 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:29:13,614][model8_pretrain.py][INFO] Epoch:[0/2](141000/4588595) loss:2.784 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:29:13,614][model8_pretrain.py][INFO] Epoch:[0/2](141000/4588595) loss:2.826 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:29:13,614][model8_pretrain.py][INFO] Epoch:[0/2](141000/4588595) loss:3.019 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:29:13,614][model8_pretrain.py][INFO] Epoch:[0/2](141000/4588595) loss:3.376 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:29:13,614][model8_pretrain.py][INFO] Epoch:[0/2](141000/4588595) loss:3.010 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:29:13,614][model8_pretrain.py][INFO] Epoch:[0/2](141000/4588595) loss:2.959 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:29:59,321][model8_pretrain.py][INFO] Epoch:[0/2](141100/4588595) loss:2.933 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:29:59,321][model8_pretrain.py][INFO] Epoch:[0/2](141100/4588595) loss:3.084 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:29:59,321][model8_pretrain.py][INFO] Epoch:[0/2](141100/4588595) loss:2.989 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:29:59,321][model8_pretrain.py][INFO] Epoch:[0/2](141100/4588595) loss:3.218 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:29:59,321][model8_pretrain.py][INFO] Epoch:[0/2](141100/4588595) loss:3.269 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:29:59,321][model8_pretrain.py][INFO] Epoch:[0/2](141100/4588595) loss:2.943 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:29:59,321][model8_pretrain.py][INFO] Epoch:[0/2](141100/4588595) loss:3.545 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:29:59,321][model8_pretrain.py][INFO] Epoch:[0/2](141100/4588595) loss:3.029 lr:0.0000100 epoch_Time:28059.0min: [2024-01-03 08:30:36,255][model8_pretrain.py][INFO] Epoch:[0/2](141200/4588595) loss:3.576 lr:0.0000100 epoch_Time:28058.0min: [2024-01-03 08:30:36,255][model8_pretrain.py][INFO] Epoch:[0/2](141200/4588595) loss:3.152 lr:0.0000100 epoch_Time:28058.0min: [2024-01-03 08:30:36,255][model8_pretrain.py][INFO] Epoch:[0/2](141200/4588595) loss:3.233 lr:0.0000100 epoch_Time:28058.0min: [2024-01-03 08:30:36,255][model8_pretrain.py][INFO] Epoch:[0/2](141200/4588595) loss:3.348 lr:0.0000100 epoch_Time:28058.0min: [2024-01-03 08:30:36,255][model8_pretrain.py][INFO] Epoch:[0/2](141200/4588595) loss:3.251 lr:0.0000100 epoch_Time:28058.0min: [2024-01-03 08:30:36,255][model8_pretrain.py][INFO] Epoch:[0/2](141200/4588595) loss:2.779 lr:0.0000100 epoch_Time:28058.0min: [2024-01-03 08:30:36,255][model8_pretrain.py][INFO] Epoch:[0/2](141200/4588595) loss:3.458 lr:0.0000100 epoch_Time:28058.0min: [2024-01-03 08:30:36,255][model8_pretrain.py][INFO] Epoch:[0/2](141200/4588595) loss:2.929 lr:0.0000100 epoch_Time:28058.0min: [2024-01-03 08:31:13,194][model8_pretrain.py][INFO] Epoch:[0/2](141300/4588595) loss:2.707 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:31:13,194][model8_pretrain.py][INFO] Epoch:[0/2](141300/4588595) loss:3.131 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:31:13,194][model8_pretrain.py][INFO] Epoch:[0/2](141300/4588595) loss:2.988 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:31:13,194][model8_pretrain.py][INFO] Epoch:[0/2](141300/4588595) loss:3.306 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:31:13,194][model8_pretrain.py][INFO] Epoch:[0/2](141300/4588595) loss:2.823 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:31:13,194][model8_pretrain.py][INFO] Epoch:[0/2](141300/4588595) loss:3.055 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:31:13,194][model8_pretrain.py][INFO] Epoch:[0/2](141300/4588595) loss:2.880 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:31:13,194][model8_pretrain.py][INFO] Epoch:[0/2](141300/4588595) loss:3.119 lr:0.0000100 epoch_Time:28057.0min: [2024-01-03 08:31:50,135][model8_pretrain.py][INFO] Epoch:[0/2](141400/4588595) loss:3.225 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:31:50,135][model8_pretrain.py][INFO] Epoch:[0/2](141400/4588595) loss:3.043 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:31:50,135][model8_pretrain.py][INFO] Epoch:[0/2](141400/4588595) loss:3.152 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:31:50,135][model8_pretrain.py][INFO] Epoch:[0/2](141400/4588595) loss:2.998 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:31:50,135][model8_pretrain.py][INFO] Epoch:[0/2](141400/4588595) loss:3.084 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:31:50,135][model8_pretrain.py][INFO] Epoch:[0/2](141400/4588595) loss:3.131 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:31:50,135][model8_pretrain.py][INFO] Epoch:[0/2](141400/4588595) loss:2.956 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:31:50,135][model8_pretrain.py][INFO] Epoch:[0/2](141400/4588595) loss:3.246 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:32:27,061][model8_pretrain.py][INFO] Epoch:[0/2](141500/4588595) loss:2.384 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:32:27,061][model8_pretrain.py][INFO] Epoch:[0/2](141500/4588595) loss:2.042 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:32:27,061][model8_pretrain.py][INFO] Epoch:[0/2](141500/4588595) loss:2.434 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:32:27,061][model8_pretrain.py][INFO] Epoch:[0/2](141500/4588595) loss:3.362 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:32:27,061][model8_pretrain.py][INFO] Epoch:[0/2](141500/4588595) loss:3.181 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:32:27,061][model8_pretrain.py][INFO] Epoch:[0/2](141500/4588595) loss:2.757 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:32:27,061][model8_pretrain.py][INFO] Epoch:[0/2](141500/4588595) loss:2.858 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:32:27,061][model8_pretrain.py][INFO] Epoch:[0/2](141500/4588595) loss:3.229 lr:0.0000100 epoch_Time:28055.0min: [2024-01-03 08:33:03,994][model8_pretrain.py][INFO] Epoch:[0/2](141600/4588595) loss:3.193 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:03,994][model8_pretrain.py][INFO] Epoch:[0/2](141600/4588595) loss:3.152 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:03,994][model8_pretrain.py][INFO] Epoch:[0/2](141600/4588595) loss:2.839 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:03,994][model8_pretrain.py][INFO] Epoch:[0/2](141600/4588595) loss:2.895 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:03,994][model8_pretrain.py][INFO] Epoch:[0/2](141600/4588595) loss:2.899 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:03,994][model8_pretrain.py][INFO] Epoch:[0/2](141600/4588595) loss:2.894 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:03,994][model8_pretrain.py][INFO] Epoch:[0/2](141600/4588595) loss:2.796 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:03,994][model8_pretrain.py][INFO] Epoch:[0/2](141600/4588595) loss:3.492 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:40,937][model8_pretrain.py][INFO] Epoch:[0/2](141700/4588595) loss:3.261 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:40,937][model8_pretrain.py][INFO] Epoch:[0/2](141700/4588595) loss:3.237 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:40,938][model8_pretrain.py][INFO] Epoch:[0/2](141700/4588595) loss:3.277 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:40,938][model8_pretrain.py][INFO] Epoch:[0/2](141700/4588595) loss:3.081 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:40,938][model8_pretrain.py][INFO] Epoch:[0/2](141700/4588595) loss:2.854 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:40,938][model8_pretrain.py][INFO] Epoch:[0/2](141700/4588595) loss:2.418 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:40,938][model8_pretrain.py][INFO] Epoch:[0/2](141700/4588595) loss:2.761 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:33:40,938][model8_pretrain.py][INFO] Epoch:[0/2](141700/4588595) loss:3.429 lr:0.0000100 epoch_Time:28053.0min: [2024-01-03 08:34:17,879][model8_pretrain.py][INFO] Epoch:[0/2](141800/4588595) loss:3.479 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:34:17,879][model8_pretrain.py][INFO] Epoch:[0/2](141800/4588595) loss:3.133 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:34:17,879][model8_pretrain.py][INFO] Epoch:[0/2](141800/4588595) loss:3.174 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:34:17,880][model8_pretrain.py][INFO] Epoch:[0/2](141800/4588595) loss:3.142 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:34:17,880][model8_pretrain.py][INFO] Epoch:[0/2](141800/4588595) loss:2.779 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:34:17,880][model8_pretrain.py][INFO] Epoch:[0/2](141800/4588595) loss:2.851 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:34:17,880][model8_pretrain.py][INFO] Epoch:[0/2](141800/4588595) loss:2.916 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:34:17,880][model8_pretrain.py][INFO] Epoch:[0/2](141800/4588595) loss:2.681 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:35:03,190][model8_pretrain.py][INFO] Epoch:[0/2](141900/4588595) loss:2.962 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:03,190][model8_pretrain.py][INFO] Epoch:[0/2](141900/4588595) loss:2.880 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:03,190][model8_pretrain.py][INFO] Epoch:[0/2](141900/4588595) loss:3.058 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:03,190][model8_pretrain.py][INFO] Epoch:[0/2](141900/4588595) loss:3.383 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:03,190][model8_pretrain.py][INFO] Epoch:[0/2](141900/4588595) loss:3.052 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:03,190][model8_pretrain.py][INFO] Epoch:[0/2](141900/4588595) loss:2.606 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:03,190][model8_pretrain.py][INFO] Epoch:[0/2](141900/4588595) loss:3.369 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:03,190][model8_pretrain.py][INFO] Epoch:[0/2](141900/4588595) loss:3.271 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:40,120][model8_pretrain.py][INFO] Epoch:[0/2](142000/4588595) loss:2.999 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:40,120][model8_pretrain.py][INFO] Epoch:[0/2](142000/4588595) loss:3.036 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:40,120][model8_pretrain.py][INFO] Epoch:[0/2](142000/4588595) loss:3.329 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:40,120][model8_pretrain.py][INFO] Epoch:[0/2](142000/4588595) loss:2.906 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:40,120][model8_pretrain.py][INFO] Epoch:[0/2](142000/4588595) loss:2.889 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:40,120][model8_pretrain.py][INFO] Epoch:[0/2](142000/4588595) loss:3.379 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:40,120][model8_pretrain.py][INFO] Epoch:[0/2](142000/4588595) loss:3.029 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:35:40,120][model8_pretrain.py][INFO] Epoch:[0/2](142000/4588595) loss:2.919 lr:0.0000100 epoch_Time:28054.0min: [2024-01-03 08:36:17,077][model8_pretrain.py][INFO] Epoch:[0/2](142100/4588595) loss:3.003 lr:0.0000100 epoch_Time:28052.0min: [2024-01-03 08:36:17,077][model8_pretrain.py][INFO] Epoch:[0/2](142100/4588595) loss:2.776 lr:0.0000100 epoch_Time:28052.0min: [2024-01-03 08:36:17,077][model8_pretrain.py][INFO] Epoch:[0/2](142100/4588595) loss:2.803 lr:0.0000100 epoch_Time:28052.0min: [2024-01-03 08:36:17,077][model8_pretrain.py][INFO] Epoch:[0/2](142100/4588595) loss:3.171 lr:0.0000100 epoch_Time:28052.0min: [2024-01-03 08:36:17,077][model8_pretrain.py][INFO] Epoch:[0/2](142100/4588595) loss:2.902 lr:0.0000100 epoch_Time:28052.0min: [2024-01-03 08:36:17,077][model8_pretrain.py][INFO] Epoch:[0/2](142100/4588595) loss:3.212 lr:0.0000100 epoch_Time:28052.0min: [2024-01-03 08:36:17,077][model8_pretrain.py][INFO] Epoch:[0/2](142100/4588595) loss:2.850 lr:0.0000100 epoch_Time:28052.0min: [2024-01-03 08:36:17,077][model8_pretrain.py][INFO] Epoch:[0/2](142100/4588595) loss:3.073 lr:0.0000100 epoch_Time:28052.0min: [2024-01-03 08:36:54,034][model8_pretrain.py][INFO] Epoch:[0/2](142200/4588595) loss:3.144 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:36:54,034][model8_pretrain.py][INFO] Epoch:[0/2](142200/4588595) loss:3.232 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:36:54,034][model8_pretrain.py][INFO] Epoch:[0/2](142200/4588595) loss:3.624 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:36:54,034][model8_pretrain.py][INFO] Epoch:[0/2](142200/4588595) loss:2.683 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:36:54,034][model8_pretrain.py][INFO] Epoch:[0/2](142200/4588595) loss:2.752 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:36:54,034][model8_pretrain.py][INFO] Epoch:[0/2](142200/4588595) loss:2.700 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:36:54,034][model8_pretrain.py][INFO] Epoch:[0/2](142200/4588595) loss:2.734 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:36:54,034][model8_pretrain.py][INFO] Epoch:[0/2](142200/4588595) loss:2.961 lr:0.0000100 epoch_Time:28051.0min: [2024-01-03 08:37:30,992][model8_pretrain.py][INFO] Epoch:[0/2](142300/4588595) loss:3.319 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:37:30,992][model8_pretrain.py][INFO] Epoch:[0/2](142300/4588595) loss:3.205 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:37:30,992][model8_pretrain.py][INFO] Epoch:[0/2](142300/4588595) loss:2.419 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:37:30,992][model8_pretrain.py][INFO] Epoch:[0/2](142300/4588595) loss:2.548 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:37:30,992][model8_pretrain.py][INFO] Epoch:[0/2](142300/4588595) loss:2.446 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:37:30,993][model8_pretrain.py][INFO] Epoch:[0/2](142300/4588595) loss:3.034 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:37:30,993][model8_pretrain.py][INFO] Epoch:[0/2](142300/4588595) loss:2.930 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:37:30,993][model8_pretrain.py][INFO] Epoch:[0/2](142300/4588595) loss:3.177 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:38:07,935][model8_pretrain.py][INFO] Epoch:[0/2](142400/4588595) loss:2.761 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:38:07,935][model8_pretrain.py][INFO] Epoch:[0/2](142400/4588595) loss:2.837 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:38:07,935][model8_pretrain.py][INFO] Epoch:[0/2](142400/4588595) loss:3.322 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:38:07,935][model8_pretrain.py][INFO] Epoch:[0/2](142400/4588595) loss:2.451 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:38:07,936][model8_pretrain.py][INFO] Epoch:[0/2](142400/4588595) loss:2.519 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:38:07,936][model8_pretrain.py][INFO] Epoch:[0/2](142400/4588595) loss:2.889 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:38:07,936][model8_pretrain.py][INFO] Epoch:[0/2](142400/4588595) loss:3.230 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:38:07,936][model8_pretrain.py][INFO] Epoch:[0/2](142400/4588595) loss:3.138 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:38:44,880][model8_pretrain.py][INFO] Epoch:[0/2](142500/4588595) loss:2.885 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:38:44,880][model8_pretrain.py][INFO] Epoch:[0/2](142500/4588595) loss:2.136 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:38:44,880][model8_pretrain.py][INFO] Epoch:[0/2](142500/4588595) loss:3.131 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:38:44,880][model8_pretrain.py][INFO] Epoch:[0/2](142500/4588595) loss:3.062 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:38:44,880][model8_pretrain.py][INFO] Epoch:[0/2](142500/4588595) loss:2.764 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:38:44,880][model8_pretrain.py][INFO] Epoch:[0/2](142500/4588595) loss:3.179 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:38:44,880][model8_pretrain.py][INFO] Epoch:[0/2](142500/4588595) loss:3.248 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:38:44,880][model8_pretrain.py][INFO] Epoch:[0/2](142500/4588595) loss:2.849 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:39:21,815][model8_pretrain.py][INFO] Epoch:[0/2](142600/4588595) loss:2.876 lr:0.0000100 epoch_Time:28047.0min: [2024-01-03 08:39:21,815][model8_pretrain.py][INFO] Epoch:[0/2](142600/4588595) loss:3.011 lr:0.0000100 epoch_Time:28047.0min: [2024-01-03 08:39:21,815][model8_pretrain.py][INFO] Epoch:[0/2](142600/4588595) loss:3.133 lr:0.0000100 epoch_Time:28047.0min: [2024-01-03 08:39:21,815][model8_pretrain.py][INFO] Epoch:[0/2](142600/4588595) loss:3.126 lr:0.0000100 epoch_Time:28047.0min: [2024-01-03 08:39:21,815][model8_pretrain.py][INFO] Epoch:[0/2](142600/4588595) loss:2.547 lr:0.0000100 epoch_Time:28047.0min: [2024-01-03 08:39:21,815][model8_pretrain.py][INFO] Epoch:[0/2](142600/4588595) loss:2.662 lr:0.0000100 epoch_Time:28047.0min: [2024-01-03 08:39:21,815][model8_pretrain.py][INFO] Epoch:[0/2](142600/4588595) loss:2.927 lr:0.0000100 epoch_Time:28047.0min: [2024-01-03 08:39:21,815][model8_pretrain.py][INFO] Epoch:[0/2](142600/4588595) loss:3.030 lr:0.0000100 epoch_Time:28047.0min: [2024-01-03 08:40:07,371][model8_pretrain.py][INFO] Epoch:[0/2](142700/4588595) loss:2.741 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:40:07,371][model8_pretrain.py][INFO] Epoch:[0/2](142700/4588595) loss:2.966 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:40:07,371][model8_pretrain.py][INFO] Epoch:[0/2](142700/4588595) loss:3.368 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:40:07,371][model8_pretrain.py][INFO] Epoch:[0/2](142700/4588595) loss:3.024 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:40:07,371][model8_pretrain.py][INFO] Epoch:[0/2](142700/4588595) loss:3.276 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:40:07,371][model8_pretrain.py][INFO] Epoch:[0/2](142700/4588595) loss:3.329 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:40:07,371][model8_pretrain.py][INFO] Epoch:[0/2](142700/4588595) loss:2.737 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:40:07,371][model8_pretrain.py][INFO] Epoch:[0/2](142700/4588595) loss:3.019 lr:0.0000100 epoch_Time:28050.0min: [2024-01-03 08:40:44,281][model8_pretrain.py][INFO] Epoch:[0/2](142800/4588595) loss:3.242 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:40:44,281][model8_pretrain.py][INFO] Epoch:[0/2](142800/4588595) loss:2.342 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:40:44,281][model8_pretrain.py][INFO] Epoch:[0/2](142800/4588595) loss:2.779 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:40:44,281][model8_pretrain.py][INFO] Epoch:[0/2](142800/4588595) loss:2.733 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:40:44,281][model8_pretrain.py][INFO] Epoch:[0/2](142800/4588595) loss:2.619 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:40:44,281][model8_pretrain.py][INFO] Epoch:[0/2](142800/4588595) loss:2.887 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:40:44,281][model8_pretrain.py][INFO] Epoch:[0/2](142800/4588595) loss:2.975 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:40:44,281][model8_pretrain.py][INFO] Epoch:[0/2](142800/4588595) loss:2.805 lr:0.0000100 epoch_Time:28049.0min: [2024-01-03 08:41:21,207][model8_pretrain.py][INFO] Epoch:[0/2](142900/4588595) loss:3.234 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:41:21,208][model8_pretrain.py][INFO] Epoch:[0/2](142900/4588595) loss:2.391 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:41:21,208][model8_pretrain.py][INFO] Epoch:[0/2](142900/4588595) loss:2.903 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:41:21,207][model8_pretrain.py][INFO] Epoch:[0/2](142900/4588595) loss:3.192 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:41:21,208][model8_pretrain.py][INFO] Epoch:[0/2](142900/4588595) loss:3.103 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:41:21,208][model8_pretrain.py][INFO] Epoch:[0/2](142900/4588595) loss:2.729 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:41:21,208][model8_pretrain.py][INFO] Epoch:[0/2](142900/4588595) loss:2.699 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:41:21,208][model8_pretrain.py][INFO] Epoch:[0/2](142900/4588595) loss:3.457 lr:0.0000100 epoch_Time:28048.0min: [2024-01-03 08:41:58,139][model8_pretrain.py][INFO] Epoch:[0/2](143000/4588595) loss:3.136 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:41:58,139][model8_pretrain.py][INFO] Epoch:[0/2](143000/4588595) loss:3.203 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:41:58,139][model8_pretrain.py][INFO] Epoch:[0/2](143000/4588595) loss:2.886 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:41:58,139][model8_pretrain.py][INFO] Epoch:[0/2](143000/4588595) loss:3.248 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:41:58,139][model8_pretrain.py][INFO] Epoch:[0/2](143000/4588595) loss:2.850 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:41:58,139][model8_pretrain.py][INFO] Epoch:[0/2](143000/4588595) loss:2.702 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:41:58,139][model8_pretrain.py][INFO] Epoch:[0/2](143000/4588595) loss:2.441 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:41:58,139][model8_pretrain.py][INFO] Epoch:[0/2](143000/4588595) loss:2.991 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:42:35,072][model8_pretrain.py][INFO] Epoch:[0/2](143100/4588595) loss:3.120 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:42:35,072][model8_pretrain.py][INFO] Epoch:[0/2](143100/4588595) loss:2.864 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:42:35,072][model8_pretrain.py][INFO] Epoch:[0/2](143100/4588595) loss:3.145 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:42:35,072][model8_pretrain.py][INFO] Epoch:[0/2](143100/4588595) loss:2.667 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:42:35,072][model8_pretrain.py][INFO] Epoch:[0/2](143100/4588595) loss:3.519 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:42:35,072][model8_pretrain.py][INFO] Epoch:[0/2](143100/4588595) loss:3.189 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:42:35,072][model8_pretrain.py][INFO] Epoch:[0/2](143100/4588595) loss:2.776 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:42:35,072][model8_pretrain.py][INFO] Epoch:[0/2](143100/4588595) loss:3.277 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:43:12,009][model8_pretrain.py][INFO] Epoch:[0/2](143200/4588595) loss:3.157 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:43:12,009][model8_pretrain.py][INFO] Epoch:[0/2](143200/4588595) loss:2.672 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:43:12,009][model8_pretrain.py][INFO] Epoch:[0/2](143200/4588595) loss:2.792 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:43:12,009][model8_pretrain.py][INFO] Epoch:[0/2](143200/4588595) loss:3.226 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:43:12,009][model8_pretrain.py][INFO] Epoch:[0/2](143200/4588595) loss:3.090 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:43:12,009][model8_pretrain.py][INFO] Epoch:[0/2](143200/4588595) loss:2.802 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:43:12,009][model8_pretrain.py][INFO] Epoch:[0/2](143200/4588595) loss:3.044 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:43:12,010][model8_pretrain.py][INFO] Epoch:[0/2](143200/4588595) loss:2.742 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:43:48,936][model8_pretrain.py][INFO] Epoch:[0/2](143300/4588595) loss:3.204 lr:0.0000100 epoch_Time:28043.0min: [2024-01-03 08:43:48,936][model8_pretrain.py][INFO] Epoch:[0/2](143300/4588595) loss:3.242 lr:0.0000100 epoch_Time:28043.0min: [2024-01-03 08:43:48,936][model8_pretrain.py][INFO] Epoch:[0/2](143300/4588595) loss:3.058 lr:0.0000100 epoch_Time:28043.0min: [2024-01-03 08:43:48,936][model8_pretrain.py][INFO] Epoch:[0/2](143300/4588595) loss:3.098 lr:0.0000100 epoch_Time:28043.0min: [2024-01-03 08:43:48,937][model8_pretrain.py][INFO] Epoch:[0/2](143300/4588595) loss:2.328 lr:0.0000100 epoch_Time:28043.0min: [2024-01-03 08:43:48,937][model8_pretrain.py][INFO] Epoch:[0/2](143300/4588595) loss:2.621 lr:0.0000100 epoch_Time:28043.0min: [2024-01-03 08:43:48,937][model8_pretrain.py][INFO] Epoch:[0/2](143300/4588595) loss:2.753 lr:0.0000100 epoch_Time:28043.0min: [2024-01-03 08:43:48,937][model8_pretrain.py][INFO] Epoch:[0/2](143300/4588595) loss:3.323 lr:0.0000100 epoch_Time:28043.0min: [2024-01-03 08:44:25,865][model8_pretrain.py][INFO] Epoch:[0/2](143400/4588595) loss:3.320 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:44:25,865][model8_pretrain.py][INFO] Epoch:[0/2](143400/4588595) loss:2.546 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:44:25,865][model8_pretrain.py][INFO] Epoch:[0/2](143400/4588595) loss:3.575 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:44:25,865][model8_pretrain.py][INFO] Epoch:[0/2](143400/4588595) loss:2.645 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:44:25,865][model8_pretrain.py][INFO] Epoch:[0/2](143400/4588595) loss:3.115 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:44:25,865][model8_pretrain.py][INFO] Epoch:[0/2](143400/4588595) loss:3.428 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:44:25,865][model8_pretrain.py][INFO] Epoch:[0/2](143400/4588595) loss:3.048 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:44:25,865][model8_pretrain.py][INFO] Epoch:[0/2](143400/4588595) loss:3.018 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:45:11,545][model8_pretrain.py][INFO] Epoch:[0/2](143500/4588595) loss:2.685 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:45:11,545][model8_pretrain.py][INFO] Epoch:[0/2](143500/4588595) loss:3.639 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:45:11,545][model8_pretrain.py][INFO] Epoch:[0/2](143500/4588595) loss:2.857 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:45:11,545][model8_pretrain.py][INFO] Epoch:[0/2](143500/4588595) loss:3.171 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:45:11,545][model8_pretrain.py][INFO] Epoch:[0/2](143500/4588595) loss:3.151 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:45:11,545][model8_pretrain.py][INFO] Epoch:[0/2](143500/4588595) loss:3.007 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:45:11,546][model8_pretrain.py][INFO] Epoch:[0/2](143500/4588595) loss:3.120 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:45:11,546][model8_pretrain.py][INFO] Epoch:[0/2](143500/4588595) loss:2.676 lr:0.0000100 epoch_Time:28046.0min: [2024-01-03 08:45:48,476][model8_pretrain.py][INFO] Epoch:[0/2](143600/4588595) loss:3.102 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:45:48,476][model8_pretrain.py][INFO] Epoch:[0/2](143600/4588595) loss:2.568 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:45:48,476][model8_pretrain.py][INFO] Epoch:[0/2](143600/4588595) loss:2.792 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:45:48,476][model8_pretrain.py][INFO] Epoch:[0/2](143600/4588595) loss:2.878 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:45:48,476][model8_pretrain.py][INFO] Epoch:[0/2](143600/4588595) loss:3.098 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:45:48,476][model8_pretrain.py][INFO] Epoch:[0/2](143600/4588595) loss:2.803 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:45:48,476][model8_pretrain.py][INFO] Epoch:[0/2](143600/4588595) loss:3.118 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:45:48,476][model8_pretrain.py][INFO] Epoch:[0/2](143600/4588595) loss:2.769 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:46:25,412][model8_pretrain.py][INFO] Epoch:[0/2](143700/4588595) loss:3.063 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:46:25,412][model8_pretrain.py][INFO] Epoch:[0/2](143700/4588595) loss:2.294 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:46:25,412][model8_pretrain.py][INFO] Epoch:[0/2](143700/4588595) loss:3.253 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:46:25,412][model8_pretrain.py][INFO] Epoch:[0/2](143700/4588595) loss:2.900 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:46:25,412][model8_pretrain.py][INFO] Epoch:[0/2](143700/4588595) loss:2.564 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:46:25,412][model8_pretrain.py][INFO] Epoch:[0/2](143700/4588595) loss:3.294 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:46:25,412][model8_pretrain.py][INFO] Epoch:[0/2](143700/4588595) loss:2.675 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:46:25,412][model8_pretrain.py][INFO] Epoch:[0/2](143700/4588595) loss:2.935 lr:0.0000100 epoch_Time:28044.0min: [2024-01-03 08:47:02,348][model8_pretrain.py][INFO] Epoch:[0/2](143800/4588595) loss:2.478 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:02,348][model8_pretrain.py][INFO] Epoch:[0/2](143800/4588595) loss:3.273 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:02,348][model8_pretrain.py][INFO] Epoch:[0/2](143800/4588595) loss:2.757 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:02,348][model8_pretrain.py][INFO] Epoch:[0/2](143800/4588595) loss:3.197 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:02,348][model8_pretrain.py][INFO] Epoch:[0/2](143800/4588595) loss:3.252 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:02,348][model8_pretrain.py][INFO] Epoch:[0/2](143800/4588595) loss:3.058 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:02,348][model8_pretrain.py][INFO] Epoch:[0/2](143800/4588595) loss:3.048 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:02,348][model8_pretrain.py][INFO] Epoch:[0/2](143800/4588595) loss:2.682 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:39,276][model8_pretrain.py][INFO] Epoch:[0/2](143900/4588595) loss:2.632 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:39,276][model8_pretrain.py][INFO] Epoch:[0/2](143900/4588595) loss:2.406 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:39,276][model8_pretrain.py][INFO] Epoch:[0/2](143900/4588595) loss:2.802 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:39,276][model8_pretrain.py][INFO] Epoch:[0/2](143900/4588595) loss:2.847 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:39,276][model8_pretrain.py][INFO] Epoch:[0/2](143900/4588595) loss:3.349 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:39,276][model8_pretrain.py][INFO] Epoch:[0/2](143900/4588595) loss:2.576 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:39,276][model8_pretrain.py][INFO] Epoch:[0/2](143900/4588595) loss:2.916 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:47:39,276][model8_pretrain.py][INFO] Epoch:[0/2](143900/4588595) loss:3.257 lr:0.0000100 epoch_Time:28042.0min: [2024-01-03 08:48:16,205][model8_pretrain.py][INFO] Epoch:[0/2](144000/4588595) loss:3.175 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:48:16,205][model8_pretrain.py][INFO] Epoch:[0/2](144000/4588595) loss:3.336 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:48:16,205][model8_pretrain.py][INFO] Epoch:[0/2](144000/4588595) loss:3.470 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:48:16,205][model8_pretrain.py][INFO] Epoch:[0/2](144000/4588595) loss:3.283 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:48:16,205][model8_pretrain.py][INFO] Epoch:[0/2](144000/4588595) loss:2.684 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:48:16,205][model8_pretrain.py][INFO] Epoch:[0/2](144000/4588595) loss:3.203 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:48:16,205][model8_pretrain.py][INFO] Epoch:[0/2](144000/4588595) loss:2.892 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:48:16,205][model8_pretrain.py][INFO] Epoch:[0/2](144000/4588595) loss:3.110 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:48:53,144][model8_pretrain.py][INFO] Epoch:[0/2](144100/4588595) loss:3.187 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:48:53,144][model8_pretrain.py][INFO] Epoch:[0/2](144100/4588595) loss:3.258 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:48:53,144][model8_pretrain.py][INFO] Epoch:[0/2](144100/4588595) loss:2.539 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:48:53,144][model8_pretrain.py][INFO] Epoch:[0/2](144100/4588595) loss:2.960 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:48:53,144][model8_pretrain.py][INFO] Epoch:[0/2](144100/4588595) loss:3.143 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:48:53,144][model8_pretrain.py][INFO] Epoch:[0/2](144100/4588595) loss:3.031 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:48:53,144][model8_pretrain.py][INFO] Epoch:[0/2](144100/4588595) loss:2.847 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:48:53,144][model8_pretrain.py][INFO] Epoch:[0/2](144100/4588595) loss:2.792 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:49:30,058][model8_pretrain.py][INFO] Epoch:[0/2](144200/4588595) loss:3.021 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:49:30,058][model8_pretrain.py][INFO] Epoch:[0/2](144200/4588595) loss:2.805 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:49:30,058][model8_pretrain.py][INFO] Epoch:[0/2](144200/4588595) loss:2.807 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:49:30,058][model8_pretrain.py][INFO] Epoch:[0/2](144200/4588595) loss:3.207 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:49:30,058][model8_pretrain.py][INFO] Epoch:[0/2](144200/4588595) loss:2.973 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:49:30,058][model8_pretrain.py][INFO] Epoch:[0/2](144200/4588595) loss:3.126 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:49:30,058][model8_pretrain.py][INFO] Epoch:[0/2](144200/4588595) loss:3.306 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:49:30,059][model8_pretrain.py][INFO] Epoch:[0/2](144200/4588595) loss:2.914 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:50:15,723][model8_pretrain.py][INFO] Epoch:[0/2](144300/4588595) loss:3.355 lr:0.0000100 epoch_Time:28041.0min: [2024-01-03 08:50:15,723][model8_pretrain.py][INFO] Epoch:[0/2](144300/4588595) loss:2.928 lr:0.0000100 epoch_Time:28041.0min: [2024-01-03 08:50:15,723][model8_pretrain.py][INFO] Epoch:[0/2](144300/4588595) loss:3.175 lr:0.0000100 epoch_Time:28041.0min: [2024-01-03 08:50:15,723][model8_pretrain.py][INFO] Epoch:[0/2](144300/4588595) loss:3.102 lr:0.0000100 epoch_Time:28041.0min: [2024-01-03 08:50:15,723][model8_pretrain.py][INFO] Epoch:[0/2](144300/4588595) loss:2.885 lr:0.0000100 epoch_Time:28041.0min: [2024-01-03 08:50:15,723][model8_pretrain.py][INFO] Epoch:[0/2](144300/4588595) loss:3.076 lr:0.0000100 epoch_Time:28041.0min: [2024-01-03 08:50:15,723][model8_pretrain.py][INFO] Epoch:[0/2](144300/4588595) loss:2.713 lr:0.0000100 epoch_Time:28041.0min: [2024-01-03 08:50:15,723][model8_pretrain.py][INFO] Epoch:[0/2](144300/4588595) loss:2.611 lr:0.0000100 epoch_Time:28041.0min: [2024-01-03 08:50:52,642][model8_pretrain.py][INFO] Epoch:[0/2](144400/4588595) loss:2.922 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:50:52,642][model8_pretrain.py][INFO] Epoch:[0/2](144400/4588595) loss:2.752 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:50:52,642][model8_pretrain.py][INFO] Epoch:[0/2](144400/4588595) loss:2.932 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:50:52,642][model8_pretrain.py][INFO] Epoch:[0/2](144400/4588595) loss:3.142 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:50:52,642][model8_pretrain.py][INFO] Epoch:[0/2](144400/4588595) loss:3.402 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:50:52,642][model8_pretrain.py][INFO] Epoch:[0/2](144400/4588595) loss:2.636 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:50:52,642][model8_pretrain.py][INFO] Epoch:[0/2](144400/4588595) loss:2.621 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:50:52,642][model8_pretrain.py][INFO] Epoch:[0/2](144400/4588595) loss:2.975 lr:0.0000100 epoch_Time:28040.0min: [2024-01-03 08:51:29,586][model8_pretrain.py][INFO] Epoch:[0/2](144500/4588595) loss:2.535 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:51:29,586][model8_pretrain.py][INFO] Epoch:[0/2](144500/4588595) loss:2.295 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:51:29,586][model8_pretrain.py][INFO] Epoch:[0/2](144500/4588595) loss:3.050 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:51:29,586][model8_pretrain.py][INFO] Epoch:[0/2](144500/4588595) loss:2.669 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:51:29,586][model8_pretrain.py][INFO] Epoch:[0/2](144500/4588595) loss:2.576 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:51:29,586][model8_pretrain.py][INFO] Epoch:[0/2](144500/4588595) loss:2.487 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:51:29,586][model8_pretrain.py][INFO] Epoch:[0/2](144500/4588595) loss:3.132 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:51:29,588][model8_pretrain.py][INFO] Epoch:[0/2](144500/4588595) loss:2.773 lr:0.0000100 epoch_Time:28039.0min: [2024-01-03 08:52:06,544][model8_pretrain.py][INFO] Epoch:[0/2](144600/4588595) loss:2.452 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:52:06,544][model8_pretrain.py][INFO] Epoch:[0/2](144600/4588595) loss:2.758 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:52:06,544][model8_pretrain.py][INFO] Epoch:[0/2](144600/4588595) loss:2.701 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:52:06,544][model8_pretrain.py][INFO] Epoch:[0/2](144600/4588595) loss:3.080 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:52:06,545][model8_pretrain.py][INFO] Epoch:[0/2](144600/4588595) loss:2.405 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:52:06,545][model8_pretrain.py][INFO] Epoch:[0/2](144600/4588595) loss:2.822 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:52:06,545][model8_pretrain.py][INFO] Epoch:[0/2](144600/4588595) loss:3.106 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:52:06,545][model8_pretrain.py][INFO] Epoch:[0/2](144600/4588595) loss:2.891 lr:0.0000100 epoch_Time:28038.0min: [2024-01-03 08:52:43,491][model8_pretrain.py][INFO] Epoch:[0/2](144700/4588595) loss:2.993 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:52:43,491][model8_pretrain.py][INFO] Epoch:[0/2](144700/4588595) loss:3.356 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:52:43,491][model8_pretrain.py][INFO] Epoch:[0/2](144700/4588595) loss:2.977 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:52:43,491][model8_pretrain.py][INFO] Epoch:[0/2](144700/4588595) loss:3.053 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:52:43,491][model8_pretrain.py][INFO] Epoch:[0/2](144700/4588595) loss:3.162 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:52:43,491][model8_pretrain.py][INFO] Epoch:[0/2](144700/4588595) loss:2.814 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:52:43,491][model8_pretrain.py][INFO] Epoch:[0/2](144700/4588595) loss:3.101 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:52:43,491][model8_pretrain.py][INFO] Epoch:[0/2](144700/4588595) loss:2.948 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:53:20,445][model8_pretrain.py][INFO] Epoch:[0/2](144800/4588595) loss:2.935 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:53:20,445][model8_pretrain.py][INFO] Epoch:[0/2](144800/4588595) loss:2.851 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:53:20,445][model8_pretrain.py][INFO] Epoch:[0/2](144800/4588595) loss:2.579 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:53:20,445][model8_pretrain.py][INFO] Epoch:[0/2](144800/4588595) loss:2.805 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:53:20,445][model8_pretrain.py][INFO] Epoch:[0/2](144800/4588595) loss:3.037 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:53:20,445][model8_pretrain.py][INFO] Epoch:[0/2](144800/4588595) loss:3.348 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:53:20,445][model8_pretrain.py][INFO] Epoch:[0/2](144800/4588595) loss:2.941 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:53:20,445][model8_pretrain.py][INFO] Epoch:[0/2](144800/4588595) loss:3.118 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:53:57,399][model8_pretrain.py][INFO] Epoch:[0/2](144900/4588595) loss:1.958 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:53:57,399][model8_pretrain.py][INFO] Epoch:[0/2](144900/4588595) loss:3.250 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:53:57,399][model8_pretrain.py][INFO] Epoch:[0/2](144900/4588595) loss:3.123 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:53:57,399][model8_pretrain.py][INFO] Epoch:[0/2](144900/4588595) loss:3.289 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:53:57,399][model8_pretrain.py][INFO] Epoch:[0/2](144900/4588595) loss:3.123 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:53:57,399][model8_pretrain.py][INFO] Epoch:[0/2](144900/4588595) loss:2.900 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:53:57,399][model8_pretrain.py][INFO] Epoch:[0/2](144900/4588595) loss:2.565 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:53:57,399][model8_pretrain.py][INFO] Epoch:[0/2](144900/4588595) loss:2.994 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:54:34,347][model8_pretrain.py][INFO] Epoch:[0/2](145000/4588595) loss:2.910 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:54:34,347][model8_pretrain.py][INFO] Epoch:[0/2](145000/4588595) loss:2.816 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:54:34,347][model8_pretrain.py][INFO] Epoch:[0/2](145000/4588595) loss:3.210 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:54:34,347][model8_pretrain.py][INFO] Epoch:[0/2](145000/4588595) loss:2.985 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:54:34,347][model8_pretrain.py][INFO] Epoch:[0/2](145000/4588595) loss:2.823 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:54:34,347][model8_pretrain.py][INFO] Epoch:[0/2](145000/4588595) loss:2.839 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:54:34,347][model8_pretrain.py][INFO] Epoch:[0/2](145000/4588595) loss:2.703 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:54:34,348][model8_pretrain.py][INFO] Epoch:[0/2](145000/4588595) loss:2.918 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:55:20,125][model8_pretrain.py][INFO] Epoch:[0/2](145100/4588595) loss:3.438 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:55:20,125][model8_pretrain.py][INFO] Epoch:[0/2](145100/4588595) loss:2.464 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:55:20,125][model8_pretrain.py][INFO] Epoch:[0/2](145100/4588595) loss:2.961 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:55:20,125][model8_pretrain.py][INFO] Epoch:[0/2](145100/4588595) loss:2.994 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:55:20,125][model8_pretrain.py][INFO] Epoch:[0/2](145100/4588595) loss:3.112 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:55:20,125][model8_pretrain.py][INFO] Epoch:[0/2](145100/4588595) loss:2.944 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:55:20,125][model8_pretrain.py][INFO] Epoch:[0/2](145100/4588595) loss:2.651 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:55:20,126][model8_pretrain.py][INFO] Epoch:[0/2](145100/4588595) loss:3.205 lr:0.0000100 epoch_Time:28037.0min: [2024-01-03 08:55:57,045][model8_pretrain.py][INFO] Epoch:[0/2](145200/4588595) loss:2.685 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:55:57,045][model8_pretrain.py][INFO] Epoch:[0/2](145200/4588595) loss:2.964 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:55:57,045][model8_pretrain.py][INFO] Epoch:[0/2](145200/4588595) loss:3.501 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:55:57,045][model8_pretrain.py][INFO] Epoch:[0/2](145200/4588595) loss:3.196 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:55:57,045][model8_pretrain.py][INFO] Epoch:[0/2](145200/4588595) loss:2.542 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:55:57,045][model8_pretrain.py][INFO] Epoch:[0/2](145200/4588595) loss:2.914 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:55:57,045][model8_pretrain.py][INFO] Epoch:[0/2](145200/4588595) loss:2.901 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:55:57,045][model8_pretrain.py][INFO] Epoch:[0/2](145200/4588595) loss:2.887 lr:0.0000100 epoch_Time:28036.0min: [2024-01-03 08:56:33,976][model8_pretrain.py][INFO] Epoch:[0/2](145300/4588595) loss:2.939 lr:0.0000100 epoch_Time:28035.0min: [2024-01-03 08:56:33,976][model8_pretrain.py][INFO] Epoch:[0/2](145300/4588595) loss:3.059 lr:0.0000100 epoch_Time:28035.0min: [2024-01-03 08:56:33,976][model8_pretrain.py][INFO] Epoch:[0/2](145300/4588595) loss:2.561 lr:0.0000100 epoch_Time:28035.0min: [2024-01-03 08:56:33,976][model8_pretrain.py][INFO] Epoch:[0/2](145300/4588595) loss:3.507 lr:0.0000100 epoch_Time:28035.0min: [2024-01-03 08:56:33,976][model8_pretrain.py][INFO] Epoch:[0/2](145300/4588595) loss:3.329 lr:0.0000100 epoch_Time:28035.0min: [2024-01-03 08:56:33,976][model8_pretrain.py][INFO] Epoch:[0/2](145300/4588595) loss:2.806 lr:0.0000100 epoch_Time:28035.0min: [2024-01-03 08:56:33,976][model8_pretrain.py][INFO] Epoch:[0/2](145300/4588595) loss:3.052 lr:0.0000100 epoch_Time:28035.0min: [2024-01-03 08:56:33,977][model8_pretrain.py][INFO] Epoch:[0/2](145300/4588595) loss:2.834 lr:0.0000100 epoch_Time:28035.0min: [2024-01-03 08:57:10,911][model8_pretrain.py][INFO] Epoch:[0/2](145400/4588595) loss:2.976 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:57:10,911][model8_pretrain.py][INFO] Epoch:[0/2](145400/4588595) loss:2.690 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:57:10,911][model8_pretrain.py][INFO] Epoch:[0/2](145400/4588595) loss:3.070 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:57:10,911][model8_pretrain.py][INFO] Epoch:[0/2](145400/4588595) loss:2.965 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:57:10,911][model8_pretrain.py][INFO] Epoch:[0/2](145400/4588595) loss:3.235 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:57:10,911][model8_pretrain.py][INFO] Epoch:[0/2](145400/4588595) loss:2.582 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:57:10,911][model8_pretrain.py][INFO] Epoch:[0/2](145400/4588595) loss:2.623 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:57:10,911][model8_pretrain.py][INFO] Epoch:[0/2](145400/4588595) loss:2.528 lr:0.0000100 epoch_Time:28034.0min: [2024-01-03 08:57:47,839][model8_pretrain.py][INFO] Epoch:[0/2](145500/4588595) loss:2.899 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:57:47,839][model8_pretrain.py][INFO] Epoch:[0/2](145500/4588595) loss:2.825 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:57:47,839][model8_pretrain.py][INFO] Epoch:[0/2](145500/4588595) loss:2.952 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:57:47,839][model8_pretrain.py][INFO] Epoch:[0/2](145500/4588595) loss:3.290 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:57:47,839][model8_pretrain.py][INFO] Epoch:[0/2](145500/4588595) loss:3.339 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:57:47,839][model8_pretrain.py][INFO] Epoch:[0/2](145500/4588595) loss:2.932 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:57:47,839][model8_pretrain.py][INFO] Epoch:[0/2](145500/4588595) loss:2.651 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:57:47,839][model8_pretrain.py][INFO] Epoch:[0/2](145500/4588595) loss:2.595 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:58:24,773][model8_pretrain.py][INFO] Epoch:[0/2](145600/4588595) loss:2.929 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:58:24,773][model8_pretrain.py][INFO] Epoch:[0/2](145600/4588595) loss:2.664 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:58:24,773][model8_pretrain.py][INFO] Epoch:[0/2](145600/4588595) loss:2.657 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:58:24,773][model8_pretrain.py][INFO] Epoch:[0/2](145600/4588595) loss:3.479 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:58:24,773][model8_pretrain.py][INFO] Epoch:[0/2](145600/4588595) loss:3.264 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:58:24,773][model8_pretrain.py][INFO] Epoch:[0/2](145600/4588595) loss:2.263 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:58:24,773][model8_pretrain.py][INFO] Epoch:[0/2](145600/4588595) loss:2.843 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:58:24,773][model8_pretrain.py][INFO] Epoch:[0/2](145600/4588595) loss:3.107 lr:0.0000100 epoch_Time:28032.0min: [2024-01-03 08:59:01,698][model8_pretrain.py][INFO] Epoch:[0/2](145700/4588595) loss:2.816 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:01,698][model8_pretrain.py][INFO] Epoch:[0/2](145700/4588595) loss:2.738 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:01,698][model8_pretrain.py][INFO] Epoch:[0/2](145700/4588595) loss:2.991 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:01,698][model8_pretrain.py][INFO] Epoch:[0/2](145700/4588595) loss:2.961 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:01,698][model8_pretrain.py][INFO] Epoch:[0/2](145700/4588595) loss:2.849 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:01,698][model8_pretrain.py][INFO] Epoch:[0/2](145700/4588595) loss:3.066 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:01,698][model8_pretrain.py][INFO] Epoch:[0/2](145700/4588595) loss:2.601 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:01,698][model8_pretrain.py][INFO] Epoch:[0/2](145700/4588595) loss:3.457 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:38,652][model8_pretrain.py][INFO] Epoch:[0/2](145800/4588595) loss:3.151 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:38,652][model8_pretrain.py][INFO] Epoch:[0/2](145800/4588595) loss:2.704 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:38,652][model8_pretrain.py][INFO] Epoch:[0/2](145800/4588595) loss:3.021 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:38,652][model8_pretrain.py][INFO] Epoch:[0/2](145800/4588595) loss:2.981 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:38,652][model8_pretrain.py][INFO] Epoch:[0/2](145800/4588595) loss:2.955 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:38,652][model8_pretrain.py][INFO] Epoch:[0/2](145800/4588595) loss:3.213 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:38,652][model8_pretrain.py][INFO] Epoch:[0/2](145800/4588595) loss:2.987 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 08:59:38,652][model8_pretrain.py][INFO] Epoch:[0/2](145800/4588595) loss:2.960 lr:0.0000100 epoch_Time:28030.0min: [2024-01-03 09:00:24,373][model8_pretrain.py][INFO] Epoch:[0/2](145900/4588595) loss:2.877 lr:0.0000100 epoch_Time:28033.0min: [2024-01-03 09:00:24,373][model8_pretrain.py][INFO] Epoch:[0/2](145900/4588595) loss:3.074 lr:0.0000100 epoch_Time:28033.0min: [2024-01-03 09:00:24,373][model8_pretrain.py][INFO] Epoch:[0/2](145900/4588595) loss:2.748 lr:0.0000100 epoch_Time:28033.0min: [2024-01-03 09:00:24,373][model8_pretrain.py][INFO] Epoch:[0/2](145900/4588595) loss:2.732 lr:0.0000100 epoch_Time:28033.0min: [2024-01-03 09:00:24,373][model8_pretrain.py][INFO] Epoch:[0/2](145900/4588595) loss:3.358 lr:0.0000100 epoch_Time:28033.0min: [2024-01-03 09:00:24,373][model8_pretrain.py][INFO] Epoch:[0/2](145900/4588595) loss:2.922 lr:0.0000100 epoch_Time:28033.0min: [2024-01-03 09:00:24,373][model8_pretrain.py][INFO] Epoch:[0/2](145900/4588595) loss:2.714 lr:0.0000100 epoch_Time:28033.0min: [2024-01-03 09:00:24,373][model8_pretrain.py][INFO] Epoch:[0/2](145900/4588595) loss:2.760 lr:0.0000100 epoch_Time:28033.0min: [2024-01-03 09:01:01,293][model8_pretrain.py][INFO] Epoch:[0/2](146000/4588595) loss:2.454 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:01,293][model8_pretrain.py][INFO] Epoch:[0/2](146000/4588595) loss:3.012 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:01,293][model8_pretrain.py][INFO] Epoch:[0/2](146000/4588595) loss:2.427 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:01,293][model8_pretrain.py][INFO] Epoch:[0/2](146000/4588595) loss:3.524 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:01,293][model8_pretrain.py][INFO] Epoch:[0/2](146000/4588595) loss:2.730 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:01,293][model8_pretrain.py][INFO] Epoch:[0/2](146000/4588595) loss:2.722 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:01,293][model8_pretrain.py][INFO] Epoch:[0/2](146000/4588595) loss:3.357 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:01,293][model8_pretrain.py][INFO] Epoch:[0/2](146000/4588595) loss:2.896 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:38,219][model8_pretrain.py][INFO] Epoch:[0/2](146100/4588595) loss:3.470 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:38,219][model8_pretrain.py][INFO] Epoch:[0/2](146100/4588595) loss:3.086 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:38,219][model8_pretrain.py][INFO] Epoch:[0/2](146100/4588595) loss:3.357 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:38,219][model8_pretrain.py][INFO] Epoch:[0/2](146100/4588595) loss:3.450 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:38,219][model8_pretrain.py][INFO] Epoch:[0/2](146100/4588595) loss:3.013 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:38,219][model8_pretrain.py][INFO] Epoch:[0/2](146100/4588595) loss:2.828 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:38,219][model8_pretrain.py][INFO] Epoch:[0/2](146100/4588595) loss:2.955 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:01:38,219][model8_pretrain.py][INFO] Epoch:[0/2](146100/4588595) loss:3.208 lr:0.0000100 epoch_Time:28031.0min: [2024-01-03 09:02:15,157][model8_pretrain.py][INFO] Epoch:[0/2](146200/4588595) loss:3.454 lr:0.0000100 epoch_Time:28029.0min: [2024-01-03 09:02:15,157][model8_pretrain.py][INFO] Epoch:[0/2](146200/4588595) loss:3.010 lr:0.0000100 epoch_Time:28029.0min: [2024-01-03 09:02:15,157][model8_pretrain.py][INFO] Epoch:[0/2](146200/4588595) loss:3.353 lr:0.0000100 epoch_Time:28029.0min: [2024-01-03 09:02:15,157][model8_pretrain.py][INFO] Epoch:[0/2](146200/4588595) loss:3.221 lr:0.0000100 epoch_Time:28029.0min: [2024-01-03 09:02:15,158][model8_pretrain.py][INFO] Epoch:[0/2](146200/4588595) loss:2.430 lr:0.0000100 epoch_Time:28029.0min: [2024-01-03 09:02:15,157][model8_pretrain.py][INFO] Epoch:[0/2](146200/4588595) loss:2.862 lr:0.0000100 epoch_Time:28029.0min: [2024-01-03 09:02:15,158][model8_pretrain.py][INFO] Epoch:[0/2](146200/4588595) loss:2.237 lr:0.0000100 epoch_Time:28029.0min: [2024-01-03 09:02:15,158][model8_pretrain.py][INFO] Epoch:[0/2](146200/4588595) loss:2.953 lr:0.0000100 epoch_Time:28029.0min: [2024-01-03 09:02:52,079][model8_pretrain.py][INFO] Epoch:[0/2](146300/4588595) loss:2.656 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:02:52,079][model8_pretrain.py][INFO] Epoch:[0/2](146300/4588595) loss:2.445 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:02:52,079][model8_pretrain.py][INFO] Epoch:[0/2](146300/4588595) loss:2.895 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:02:52,079][model8_pretrain.py][INFO] Epoch:[0/2](146300/4588595) loss:3.127 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:02:52,079][model8_pretrain.py][INFO] Epoch:[0/2](146300/4588595) loss:2.790 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:02:52,079][model8_pretrain.py][INFO] Epoch:[0/2](146300/4588595) loss:2.961 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:02:52,079][model8_pretrain.py][INFO] Epoch:[0/2](146300/4588595) loss:3.059 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:02:52,079][model8_pretrain.py][INFO] Epoch:[0/2](146300/4588595) loss:2.932 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:03:29,016][model8_pretrain.py][INFO] Epoch:[0/2](146400/4588595) loss:2.926 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:03:29,016][model8_pretrain.py][INFO] Epoch:[0/2](146400/4588595) loss:3.224 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:03:29,016][model8_pretrain.py][INFO] Epoch:[0/2](146400/4588595) loss:2.810 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:03:29,016][model8_pretrain.py][INFO] Epoch:[0/2](146400/4588595) loss:3.228 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:03:29,016][model8_pretrain.py][INFO] Epoch:[0/2](146400/4588595) loss:2.775 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:03:29,016][model8_pretrain.py][INFO] Epoch:[0/2](146400/4588595) loss:3.199 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:03:29,016][model8_pretrain.py][INFO] Epoch:[0/2](146400/4588595) loss:3.036 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:03:29,023][model8_pretrain.py][INFO] Epoch:[0/2](146400/4588595) loss:3.249 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:04:05,960][model8_pretrain.py][INFO] Epoch:[0/2](146500/4588595) loss:3.055 lr:0.0000100 epoch_Time:28026.0min: [2024-01-03 09:04:05,960][model8_pretrain.py][INFO] Epoch:[0/2](146500/4588595) loss:3.227 lr:0.0000100 epoch_Time:28026.0min: [2024-01-03 09:04:05,960][model8_pretrain.py][INFO] Epoch:[0/2](146500/4588595) loss:2.791 lr:0.0000100 epoch_Time:28026.0min: [2024-01-03 09:04:05,960][model8_pretrain.py][INFO] Epoch:[0/2](146500/4588595) loss:2.965 lr:0.0000100 epoch_Time:28026.0min: [2024-01-03 09:04:05,960][model8_pretrain.py][INFO] Epoch:[0/2](146500/4588595) loss:3.537 lr:0.0000100 epoch_Time:28026.0min: [2024-01-03 09:04:05,960][model8_pretrain.py][INFO] Epoch:[0/2](146500/4588595) loss:2.961 lr:0.0000100 epoch_Time:28026.0min: [2024-01-03 09:04:05,960][model8_pretrain.py][INFO] Epoch:[0/2](146500/4588595) loss:2.935 lr:0.0000100 epoch_Time:28026.0min: [2024-01-03 09:04:05,960][model8_pretrain.py][INFO] Epoch:[0/2](146500/4588595) loss:2.926 lr:0.0000100 epoch_Time:28026.0min: [2024-01-03 09:04:42,885][model8_pretrain.py][INFO] Epoch:[0/2](146600/4588595) loss:2.760 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:04:42,885][model8_pretrain.py][INFO] Epoch:[0/2](146600/4588595) loss:3.456 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:04:42,885][model8_pretrain.py][INFO] Epoch:[0/2](146600/4588595) loss:3.049 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:04:42,885][model8_pretrain.py][INFO] Epoch:[0/2](146600/4588595) loss:2.559 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:04:42,885][model8_pretrain.py][INFO] Epoch:[0/2](146600/4588595) loss:3.127 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:04:42,885][model8_pretrain.py][INFO] Epoch:[0/2](146600/4588595) loss:2.947 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:04:42,885][model8_pretrain.py][INFO] Epoch:[0/2](146600/4588595) loss:2.508 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:04:42,885][model8_pretrain.py][INFO] Epoch:[0/2](146600/4588595) loss:2.706 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:05:26,834][model8_pretrain.py][INFO] Epoch:[0/2](146700/4588595) loss:2.653 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:05:26,834][model8_pretrain.py][INFO] Epoch:[0/2](146700/4588595) loss:2.856 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:05:26,834][model8_pretrain.py][INFO] Epoch:[0/2](146700/4588595) loss:3.195 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:05:26,834][model8_pretrain.py][INFO] Epoch:[0/2](146700/4588595) loss:3.224 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:05:26,835][model8_pretrain.py][INFO] Epoch:[0/2](146700/4588595) loss:2.493 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:05:26,837][model8_pretrain.py][INFO] Epoch:[0/2](146700/4588595) loss:3.428 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:05:26,838][model8_pretrain.py][INFO] Epoch:[0/2](146700/4588595) loss:3.238 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:05:26,839][model8_pretrain.py][INFO] Epoch:[0/2](146700/4588595) loss:2.778 lr:0.0000100 epoch_Time:28028.0min: [2024-01-03 09:06:05,442][model8_pretrain.py][INFO] Epoch:[0/2](146800/4588595) loss:2.970 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:05,442][model8_pretrain.py][INFO] Epoch:[0/2](146800/4588595) loss:2.703 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:05,442][model8_pretrain.py][INFO] Epoch:[0/2](146800/4588595) loss:2.699 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:05,442][model8_pretrain.py][INFO] Epoch:[0/2](146800/4588595) loss:2.702 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:05,442][model8_pretrain.py][INFO] Epoch:[0/2](146800/4588595) loss:3.009 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:05,442][model8_pretrain.py][INFO] Epoch:[0/2](146800/4588595) loss:3.425 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:05,442][model8_pretrain.py][INFO] Epoch:[0/2](146800/4588595) loss:2.633 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:05,442][model8_pretrain.py][INFO] Epoch:[0/2](146800/4588595) loss:1.960 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:42,371][model8_pretrain.py][INFO] Epoch:[0/2](146900/4588595) loss:2.967 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:42,371][model8_pretrain.py][INFO] Epoch:[0/2](146900/4588595) loss:2.850 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:42,371][model8_pretrain.py][INFO] Epoch:[0/2](146900/4588595) loss:2.850 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:42,372][model8_pretrain.py][INFO] Epoch:[0/2](146900/4588595) loss:3.000 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:42,372][model8_pretrain.py][INFO] Epoch:[0/2](146900/4588595) loss:3.040 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:42,372][model8_pretrain.py][INFO] Epoch:[0/2](146900/4588595) loss:2.525 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:42,372][model8_pretrain.py][INFO] Epoch:[0/2](146900/4588595) loss:2.577 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:06:42,372][model8_pretrain.py][INFO] Epoch:[0/2](146900/4588595) loss:3.525 lr:0.0000100 epoch_Time:28027.0min: [2024-01-03 09:07:19,308][model8_pretrain.py][INFO] Epoch:[0/2](147000/4588595) loss:2.846 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:07:19,308][model8_pretrain.py][INFO] Epoch:[0/2](147000/4588595) loss:2.898 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:07:19,308][model8_pretrain.py][INFO] Epoch:[0/2](147000/4588595) loss:2.212 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:07:19,308][model8_pretrain.py][INFO] Epoch:[0/2](147000/4588595) loss:2.894 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:07:19,308][model8_pretrain.py][INFO] Epoch:[0/2](147000/4588595) loss:3.322 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:07:19,308][model8_pretrain.py][INFO] Epoch:[0/2](147000/4588595) loss:2.888 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:07:19,308][model8_pretrain.py][INFO] Epoch:[0/2](147000/4588595) loss:3.420 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:07:19,308][model8_pretrain.py][INFO] Epoch:[0/2](147000/4588595) loss:2.687 lr:0.0000100 epoch_Time:28025.0min: [2024-01-03 09:07:56,242][model8_pretrain.py][INFO] Epoch:[0/2](147100/4588595) loss:2.827 lr:0.0000100 epoch_Time:28024.0min: [2024-01-03 09:07:56,242][model8_pretrain.py][INFO] Epoch:[0/2](147100/4588595) loss:2.859 lr:0.0000100 epoch_Time:28024.0min: [2024-01-03 09:07:56,242][model8_pretrain.py][INFO] Epoch:[0/2](147100/4588595) loss:3.161 lr:0.0000100 epoch_Time:28024.0min: [2024-01-03 09:07:56,242][model8_pretrain.py][INFO] Epoch:[0/2](147100/4588595) loss:2.220 lr:0.0000100 epoch_Time:28024.0min: [2024-01-03 09:07:56,242][model8_pretrain.py][INFO] Epoch:[0/2](147100/4588595) loss:3.329 lr:0.0000100 epoch_Time:28024.0min: [2024-01-03 09:07:56,242][model8_pretrain.py][INFO] Epoch:[0/2](147100/4588595) loss:2.875 lr:0.0000100 epoch_Time:28024.0min: [2024-01-03 09:07:56,242][model8_pretrain.py][INFO] Epoch:[0/2](147100/4588595) loss:3.048 lr:0.0000100 epoch_Time:28024.0min: [2024-01-03 09:07:56,242][model8_pretrain.py][INFO] Epoch:[0/2](147100/4588595) loss:2.463 lr:0.0000100 epoch_Time:28024.0min: [2024-01-03 09:08:33,176][model8_pretrain.py][INFO] Epoch:[0/2](147200/4588595) loss:3.189 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:08:33,176][model8_pretrain.py][INFO] Epoch:[0/2](147200/4588595) loss:2.661 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:08:33,176][model8_pretrain.py][INFO] Epoch:[0/2](147200/4588595) loss:2.978 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:08:33,176][model8_pretrain.py][INFO] Epoch:[0/2](147200/4588595) loss:2.822 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:08:33,176][model8_pretrain.py][INFO] Epoch:[0/2](147200/4588595) loss:2.952 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:08:33,176][model8_pretrain.py][INFO] Epoch:[0/2](147200/4588595) loss:3.249 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:08:33,176][model8_pretrain.py][INFO] Epoch:[0/2](147200/4588595) loss:3.193 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:08:33,176][model8_pretrain.py][INFO] Epoch:[0/2](147200/4588595) loss:2.987 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:09:10,101][model8_pretrain.py][INFO] Epoch:[0/2](147300/4588595) loss:3.106 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:09:10,101][model8_pretrain.py][INFO] Epoch:[0/2](147300/4588595) loss:3.310 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:09:10,101][model8_pretrain.py][INFO] Epoch:[0/2](147300/4588595) loss:3.001 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:09:10,101][model8_pretrain.py][INFO] Epoch:[0/2](147300/4588595) loss:2.686 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:09:10,101][model8_pretrain.py][INFO] Epoch:[0/2](147300/4588595) loss:3.236 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:09:10,101][model8_pretrain.py][INFO] Epoch:[0/2](147300/4588595) loss:2.805 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:09:10,101][model8_pretrain.py][INFO] Epoch:[0/2](147300/4588595) loss:2.986 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:09:10,101][model8_pretrain.py][INFO] Epoch:[0/2](147300/4588595) loss:2.663 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:09:47,023][model8_pretrain.py][INFO] Epoch:[0/2](147400/4588595) loss:2.477 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:09:47,024][model8_pretrain.py][INFO] Epoch:[0/2](147400/4588595) loss:2.571 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:09:47,024][model8_pretrain.py][INFO] Epoch:[0/2](147400/4588595) loss:3.075 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:09:47,024][model8_pretrain.py][INFO] Epoch:[0/2](147400/4588595) loss:2.618 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:09:47,024][model8_pretrain.py][INFO] Epoch:[0/2](147400/4588595) loss:2.982 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:09:47,024][model8_pretrain.py][INFO] Epoch:[0/2](147400/4588595) loss:3.173 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:09:47,024][model8_pretrain.py][INFO] Epoch:[0/2](147400/4588595) loss:2.764 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:09:47,024][model8_pretrain.py][INFO] Epoch:[0/2](147400/4588595) loss:2.544 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:10:27,457][model8_pretrain.py][INFO] Epoch:[0/2](147500/4588595) loss:2.870 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:10:27,457][model8_pretrain.py][INFO] Epoch:[0/2](147500/4588595) loss:2.493 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:10:27,457][model8_pretrain.py][INFO] Epoch:[0/2](147500/4588595) loss:3.193 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:10:27,457][model8_pretrain.py][INFO] Epoch:[0/2](147500/4588595) loss:3.177 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:10:27,457][model8_pretrain.py][INFO] Epoch:[0/2](147500/4588595) loss:3.483 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:10:27,457][model8_pretrain.py][INFO] Epoch:[0/2](147500/4588595) loss:3.128 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:10:27,457][model8_pretrain.py][INFO] Epoch:[0/2](147500/4588595) loss:2.488 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:10:27,457][model8_pretrain.py][INFO] Epoch:[0/2](147500/4588595) loss:3.191 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:11:09,621][model8_pretrain.py][INFO] Epoch:[0/2](147600/4588595) loss:2.786 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:11:09,621][model8_pretrain.py][INFO] Epoch:[0/2](147600/4588595) loss:2.682 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:11:09,621][model8_pretrain.py][INFO] Epoch:[0/2](147600/4588595) loss:2.664 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:11:09,621][model8_pretrain.py][INFO] Epoch:[0/2](147600/4588595) loss:3.122 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:11:09,621][model8_pretrain.py][INFO] Epoch:[0/2](147600/4588595) loss:2.690 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:11:09,621][model8_pretrain.py][INFO] Epoch:[0/2](147600/4588595) loss:3.138 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:11:09,621][model8_pretrain.py][INFO] Epoch:[0/2](147600/4588595) loss:3.110 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:11:09,621][model8_pretrain.py][INFO] Epoch:[0/2](147600/4588595) loss:2.567 lr:0.0000100 epoch_Time:28023.0min: [2024-01-03 09:11:46,547][model8_pretrain.py][INFO] Epoch:[0/2](147700/4588595) loss:2.769 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:11:46,547][model8_pretrain.py][INFO] Epoch:[0/2](147700/4588595) loss:3.277 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:11:46,547][model8_pretrain.py][INFO] Epoch:[0/2](147700/4588595) loss:3.430 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:11:46,547][model8_pretrain.py][INFO] Epoch:[0/2](147700/4588595) loss:3.085 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:11:46,547][model8_pretrain.py][INFO] Epoch:[0/2](147700/4588595) loss:2.547 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:11:46,547][model8_pretrain.py][INFO] Epoch:[0/2](147700/4588595) loss:2.531 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:11:46,547][model8_pretrain.py][INFO] Epoch:[0/2](147700/4588595) loss:2.918 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:11:46,548][model8_pretrain.py][INFO] Epoch:[0/2](147700/4588595) loss:3.524 lr:0.0000100 epoch_Time:28022.0min: [2024-01-03 09:12:23,497][model8_pretrain.py][INFO] Epoch:[0/2](147800/4588595) loss:3.202 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:12:23,497][model8_pretrain.py][INFO] Epoch:[0/2](147800/4588595) loss:3.080 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:12:23,497][model8_pretrain.py][INFO] Epoch:[0/2](147800/4588595) loss:3.080 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:12:23,497][model8_pretrain.py][INFO] Epoch:[0/2](147800/4588595) loss:3.024 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:12:23,497][model8_pretrain.py][INFO] Epoch:[0/2](147800/4588595) loss:3.164 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:12:23,497][model8_pretrain.py][INFO] Epoch:[0/2](147800/4588595) loss:2.864 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:12:23,497][model8_pretrain.py][INFO] Epoch:[0/2](147800/4588595) loss:3.093 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:12:23,497][model8_pretrain.py][INFO] Epoch:[0/2](147800/4588595) loss:2.978 lr:0.0000100 epoch_Time:28021.0min: [2024-01-03 09:13:00,430][model8_pretrain.py][INFO] Epoch:[0/2](147900/4588595) loss:3.067 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:00,430][model8_pretrain.py][INFO] Epoch:[0/2](147900/4588595) loss:3.085 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:00,430][model8_pretrain.py][INFO] Epoch:[0/2](147900/4588595) loss:2.479 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:00,430][model8_pretrain.py][INFO] Epoch:[0/2](147900/4588595) loss:2.716 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:00,430][model8_pretrain.py][INFO] Epoch:[0/2](147900/4588595) loss:3.230 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:00,430][model8_pretrain.py][INFO] Epoch:[0/2](147900/4588595) loss:3.209 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:00,430][model8_pretrain.py][INFO] Epoch:[0/2](147900/4588595) loss:3.213 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:00,430][model8_pretrain.py][INFO] Epoch:[0/2](147900/4588595) loss:2.172 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:37,361][model8_pretrain.py][INFO] Epoch:[0/2](148000/4588595) loss:2.257 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:37,361][model8_pretrain.py][INFO] Epoch:[0/2](148000/4588595) loss:3.255 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:37,361][model8_pretrain.py][INFO] Epoch:[0/2](148000/4588595) loss:2.556 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:37,361][model8_pretrain.py][INFO] Epoch:[0/2](148000/4588595) loss:3.057 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:37,361][model8_pretrain.py][INFO] Epoch:[0/2](148000/4588595) loss:3.063 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:37,362][model8_pretrain.py][INFO] Epoch:[0/2](148000/4588595) loss:2.934 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:37,362][model8_pretrain.py][INFO] Epoch:[0/2](148000/4588595) loss:3.033 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:13:37,362][model8_pretrain.py][INFO] Epoch:[0/2](148000/4588595) loss:3.352 lr:0.0000100 epoch_Time:28019.0min: [2024-01-03 09:14:14,303][model8_pretrain.py][INFO] Epoch:[0/2](148100/4588595) loss:2.659 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:14:14,303][model8_pretrain.py][INFO] Epoch:[0/2](148100/4588595) loss:2.470 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:14:14,303][model8_pretrain.py][INFO] Epoch:[0/2](148100/4588595) loss:3.358 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:14:14,303][model8_pretrain.py][INFO] Epoch:[0/2](148100/4588595) loss:3.228 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:14:14,303][model8_pretrain.py][INFO] Epoch:[0/2](148100/4588595) loss:2.664 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:14:14,303][model8_pretrain.py][INFO] Epoch:[0/2](148100/4588595) loss:2.393 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:14:14,303][model8_pretrain.py][INFO] Epoch:[0/2](148100/4588595) loss:2.868 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:14:14,304][model8_pretrain.py][INFO] Epoch:[0/2](148100/4588595) loss:3.514 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:14:51,254][model8_pretrain.py][INFO] Epoch:[0/2](148200/4588595) loss:2.814 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:14:51,254][model8_pretrain.py][INFO] Epoch:[0/2](148200/4588595) loss:3.174 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:14:51,254][model8_pretrain.py][INFO] Epoch:[0/2](148200/4588595) loss:2.795 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:14:51,254][model8_pretrain.py][INFO] Epoch:[0/2](148200/4588595) loss:2.924 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:14:51,254][model8_pretrain.py][INFO] Epoch:[0/2](148200/4588595) loss:3.266 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:14:51,254][model8_pretrain.py][INFO] Epoch:[0/2](148200/4588595) loss:3.210 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:14:51,254][model8_pretrain.py][INFO] Epoch:[0/2](148200/4588595) loss:2.761 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:14:51,254][model8_pretrain.py][INFO] Epoch:[0/2](148200/4588595) loss:3.372 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:15:31,668][model8_pretrain.py][INFO] Epoch:[0/2](148300/4588595) loss:2.461 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:15:31,668][model8_pretrain.py][INFO] Epoch:[0/2](148300/4588595) loss:2.586 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:15:31,668][model8_pretrain.py][INFO] Epoch:[0/2](148300/4588595) loss:3.140 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:15:31,668][model8_pretrain.py][INFO] Epoch:[0/2](148300/4588595) loss:3.243 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:15:31,668][model8_pretrain.py][INFO] Epoch:[0/2](148300/4588595) loss:2.986 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:15:31,668][model8_pretrain.py][INFO] Epoch:[0/2](148300/4588595) loss:3.105 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:15:31,668][model8_pretrain.py][INFO] Epoch:[0/2](148300/4588595) loss:2.792 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:15:31,668][model8_pretrain.py][INFO] Epoch:[0/2](148300/4588595) loss:2.837 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:16:13,869][model8_pretrain.py][INFO] Epoch:[0/2](148400/4588595) loss:3.107 lr:0.0000100 epoch_Time:28018.0min: [2024-01-03 09:16:13,869][model8_pretrain.py][INFO] Epoch:[0/2](148400/4588595) loss:2.908 lr:0.0000100 epoch_Time:28018.0min: [2024-01-03 09:16:13,869][model8_pretrain.py][INFO] Epoch:[0/2](148400/4588595) loss:3.241 lr:0.0000100 epoch_Time:28018.0min: [2024-01-03 09:16:13,869][model8_pretrain.py][INFO] Epoch:[0/2](148400/4588595) loss:2.460 lr:0.0000100 epoch_Time:28018.0min: [2024-01-03 09:16:13,869][model8_pretrain.py][INFO] Epoch:[0/2](148400/4588595) loss:2.780 lr:0.0000100 epoch_Time:28018.0min: [2024-01-03 09:16:13,869][model8_pretrain.py][INFO] Epoch:[0/2](148400/4588595) loss:2.672 lr:0.0000100 epoch_Time:28018.0min: [2024-01-03 09:16:13,870][model8_pretrain.py][INFO] Epoch:[0/2](148400/4588595) loss:2.720 lr:0.0000100 epoch_Time:28018.0min: [2024-01-03 09:16:13,870][model8_pretrain.py][INFO] Epoch:[0/2](148400/4588595) loss:2.957 lr:0.0000100 epoch_Time:28018.0min: [2024-01-03 09:16:50,792][model8_pretrain.py][INFO] Epoch:[0/2](148500/4588595) loss:2.880 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:16:50,792][model8_pretrain.py][INFO] Epoch:[0/2](148500/4588595) loss:2.888 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:16:50,792][model8_pretrain.py][INFO] Epoch:[0/2](148500/4588595) loss:2.840 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:16:50,792][model8_pretrain.py][INFO] Epoch:[0/2](148500/4588595) loss:2.922 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:16:50,792][model8_pretrain.py][INFO] Epoch:[0/2](148500/4588595) loss:3.326 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:16:50,792][model8_pretrain.py][INFO] Epoch:[0/2](148500/4588595) loss:2.752 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:16:50,792][model8_pretrain.py][INFO] Epoch:[0/2](148500/4588595) loss:3.366 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:16:50,792][model8_pretrain.py][INFO] Epoch:[0/2](148500/4588595) loss:3.585 lr:0.0000100 epoch_Time:28017.0min: [2024-01-03 09:17:27,718][model8_pretrain.py][INFO] Epoch:[0/2](148600/4588595) loss:2.895 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:17:27,718][model8_pretrain.py][INFO] Epoch:[0/2](148600/4588595) loss:3.009 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:17:27,718][model8_pretrain.py][INFO] Epoch:[0/2](148600/4588595) loss:3.164 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:17:27,718][model8_pretrain.py][INFO] Epoch:[0/2](148600/4588595) loss:2.832 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:17:27,718][model8_pretrain.py][INFO] Epoch:[0/2](148600/4588595) loss:3.242 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:17:27,718][model8_pretrain.py][INFO] Epoch:[0/2](148600/4588595) loss:2.872 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:17:27,719][model8_pretrain.py][INFO] Epoch:[0/2](148600/4588595) loss:2.909 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:17:27,719][model8_pretrain.py][INFO] Epoch:[0/2](148600/4588595) loss:2.903 lr:0.0000100 epoch_Time:28016.0min: [2024-01-03 09:18:04,658][model8_pretrain.py][INFO] Epoch:[0/2](148700/4588595) loss:3.347 lr:0.0000100 epoch_Time:28015.0min: [2024-01-03 09:18:04,658][model8_pretrain.py][INFO] Epoch:[0/2](148700/4588595) loss:1.963 lr:0.0000100 epoch_Time:28015.0min: [2024-01-03 09:18:04,658][model8_pretrain.py][INFO] Epoch:[0/2](148700/4588595) loss:3.008 lr:0.0000100 epoch_Time:28015.0min: [2024-01-03 09:18:04,658][model8_pretrain.py][INFO] Epoch:[0/2](148700/4588595) loss:3.132 lr:0.0000100 epoch_Time:28015.0min: [2024-01-03 09:18:04,658][model8_pretrain.py][INFO] Epoch:[0/2](148700/4588595) loss:2.901 lr:0.0000100 epoch_Time:28015.0min: [2024-01-03 09:18:04,658][model8_pretrain.py][INFO] Epoch:[0/2](148700/4588595) loss:2.217 lr:0.0000100 epoch_Time:28015.0min: [2024-01-03 09:18:04,658][model8_pretrain.py][INFO] Epoch:[0/2](148700/4588595) loss:2.915 lr:0.0000100 epoch_Time:28015.0min: [2024-01-03 09:18:04,658][model8_pretrain.py][INFO] Epoch:[0/2](148700/4588595) loss:2.930 lr:0.0000100 epoch_Time:28015.0min: [2024-01-03 09:18:41,606][model8_pretrain.py][INFO] Epoch:[0/2](148800/4588595) loss:3.349 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:18:41,606][model8_pretrain.py][INFO] Epoch:[0/2](148800/4588595) loss:3.479 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:18:41,606][model8_pretrain.py][INFO] Epoch:[0/2](148800/4588595) loss:3.071 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:18:41,606][model8_pretrain.py][INFO] Epoch:[0/2](148800/4588595) loss:2.734 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:18:41,606][model8_pretrain.py][INFO] Epoch:[0/2](148800/4588595) loss:2.927 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:18:41,606][model8_pretrain.py][INFO] Epoch:[0/2](148800/4588595) loss:2.710 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:18:41,606][model8_pretrain.py][INFO] Epoch:[0/2](148800/4588595) loss:2.877 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:18:41,606][model8_pretrain.py][INFO] Epoch:[0/2](148800/4588595) loss:2.905 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:19:18,539][model8_pretrain.py][INFO] Epoch:[0/2](148900/4588595) loss:2.719 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:19:18,539][model8_pretrain.py][INFO] Epoch:[0/2](148900/4588595) loss:2.826 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:19:18,539][model8_pretrain.py][INFO] Epoch:[0/2](148900/4588595) loss:2.877 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:19:18,539][model8_pretrain.py][INFO] Epoch:[0/2](148900/4588595) loss:2.727 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:19:18,539][model8_pretrain.py][INFO] Epoch:[0/2](148900/4588595) loss:3.092 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:19:18,539][model8_pretrain.py][INFO] Epoch:[0/2](148900/4588595) loss:2.297 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:19:18,539][model8_pretrain.py][INFO] Epoch:[0/2](148900/4588595) loss:3.050 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:19:18,539][model8_pretrain.py][INFO] Epoch:[0/2](148900/4588595) loss:2.702 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:19:55,504][model8_pretrain.py][INFO] Epoch:[0/2](149000/4588595) loss:2.987 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:19:55,504][model8_pretrain.py][INFO] Epoch:[0/2](149000/4588595) loss:2.911 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:19:55,504][model8_pretrain.py][INFO] Epoch:[0/2](149000/4588595) loss:3.167 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:19:55,504][model8_pretrain.py][INFO] Epoch:[0/2](149000/4588595) loss:2.855 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:19:55,504][model8_pretrain.py][INFO] Epoch:[0/2](149000/4588595) loss:3.339 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:19:55,504][model8_pretrain.py][INFO] Epoch:[0/2](149000/4588595) loss:2.610 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:19:55,504][model8_pretrain.py][INFO] Epoch:[0/2](149000/4588595) loss:2.903 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:19:55,504][model8_pretrain.py][INFO] Epoch:[0/2](149000/4588595) loss:2.777 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:20:35,873][model8_pretrain.py][INFO] Epoch:[0/2](149100/4588595) loss:3.001 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:20:35,873][model8_pretrain.py][INFO] Epoch:[0/2](149100/4588595) loss:3.067 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:20:35,873][model8_pretrain.py][INFO] Epoch:[0/2](149100/4588595) loss:3.248 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:20:35,873][model8_pretrain.py][INFO] Epoch:[0/2](149100/4588595) loss:3.221 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:20:35,873][model8_pretrain.py][INFO] Epoch:[0/2](149100/4588595) loss:3.250 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:20:35,873][model8_pretrain.py][INFO] Epoch:[0/2](149100/4588595) loss:2.114 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:20:35,873][model8_pretrain.py][INFO] Epoch:[0/2](149100/4588595) loss:3.121 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:20:35,878][model8_pretrain.py][INFO] Epoch:[0/2](149100/4588595) loss:2.893 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:21:18,144][model8_pretrain.py][INFO] Epoch:[0/2](149200/4588595) loss:2.611 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:21:18,144][model8_pretrain.py][INFO] Epoch:[0/2](149200/4588595) loss:2.859 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:21:18,144][model8_pretrain.py][INFO] Epoch:[0/2](149200/4588595) loss:3.203 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:21:18,144][model8_pretrain.py][INFO] Epoch:[0/2](149200/4588595) loss:2.671 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:21:18,144][model8_pretrain.py][INFO] Epoch:[0/2](149200/4588595) loss:3.051 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:21:18,144][model8_pretrain.py][INFO] Epoch:[0/2](149200/4588595) loss:3.038 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:21:18,144][model8_pretrain.py][INFO] Epoch:[0/2](149200/4588595) loss:3.279 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:21:18,144][model8_pretrain.py][INFO] Epoch:[0/2](149200/4588595) loss:3.222 lr:0.0000100 epoch_Time:28014.0min: [2024-01-03 09:21:55,080][model8_pretrain.py][INFO] Epoch:[0/2](149300/4588595) loss:2.823 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:21:55,080][model8_pretrain.py][INFO] Epoch:[0/2](149300/4588595) loss:2.759 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:21:55,080][model8_pretrain.py][INFO] Epoch:[0/2](149300/4588595) loss:2.729 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:21:55,080][model8_pretrain.py][INFO] Epoch:[0/2](149300/4588595) loss:3.396 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:21:55,080][model8_pretrain.py][INFO] Epoch:[0/2](149300/4588595) loss:2.588 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:21:55,080][model8_pretrain.py][INFO] Epoch:[0/2](149300/4588595) loss:2.848 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:21:55,080][model8_pretrain.py][INFO] Epoch:[0/2](149300/4588595) loss:2.387 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:21:55,080][model8_pretrain.py][INFO] Epoch:[0/2](149300/4588595) loss:2.645 lr:0.0000100 epoch_Time:28013.0min: [2024-01-03 09:22:31,999][model8_pretrain.py][INFO] Epoch:[0/2](149400/4588595) loss:2.017 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:22:31,999][model8_pretrain.py][INFO] Epoch:[0/2](149400/4588595) loss:3.008 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:22:31,999][model8_pretrain.py][INFO] Epoch:[0/2](149400/4588595) loss:3.094 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:22:31,999][model8_pretrain.py][INFO] Epoch:[0/2](149400/4588595) loss:2.885 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:22:31,999][model8_pretrain.py][INFO] Epoch:[0/2](149400/4588595) loss:3.069 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:22:31,999][model8_pretrain.py][INFO] Epoch:[0/2](149400/4588595) loss:2.830 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:22:31,999][model8_pretrain.py][INFO] Epoch:[0/2](149400/4588595) loss:3.315 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:22:32,000][model8_pretrain.py][INFO] Epoch:[0/2](149400/4588595) loss:3.479 lr:0.0000100 epoch_Time:28012.0min: [2024-01-03 09:23:08,928][model8_pretrain.py][INFO] Epoch:[0/2](149500/4588595) loss:2.863 lr:0.0000100 epoch_Time:28011.0min: [2024-01-03 09:23:08,928][model8_pretrain.py][INFO] Epoch:[0/2](149500/4588595) loss:3.095 lr:0.0000100 epoch_Time:28011.0min: [2024-01-03 09:23:08,928][model8_pretrain.py][INFO] Epoch:[0/2](149500/4588595) loss:3.317 lr:0.0000100 epoch_Time:28011.0min: [2024-01-03 09:23:08,928][model8_pretrain.py][INFO] Epoch:[0/2](149500/4588595) loss:2.964 lr:0.0000100 epoch_Time:28011.0min: [2024-01-03 09:23:08,928][model8_pretrain.py][INFO] Epoch:[0/2](149500/4588595) loss:2.912 lr:0.0000100 epoch_Time:28011.0min: [2024-01-03 09:23:08,928][model8_pretrain.py][INFO] Epoch:[0/2](149500/4588595) loss:3.168 lr:0.0000100 epoch_Time:28011.0min: [2024-01-03 09:23:08,928][model8_pretrain.py][INFO] Epoch:[0/2](149500/4588595) loss:2.779 lr:0.0000100 epoch_Time:28011.0min: [2024-01-03 09:23:08,929][model8_pretrain.py][INFO] Epoch:[0/2](149500/4588595) loss:3.111 lr:0.0000100 epoch_Time:28011.0min: [2024-01-03 09:23:45,867][model8_pretrain.py][INFO] Epoch:[0/2](149600/4588595) loss:3.262 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:23:45,867][model8_pretrain.py][INFO] Epoch:[0/2](149600/4588595) loss:3.157 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:23:45,867][model8_pretrain.py][INFO] Epoch:[0/2](149600/4588595) loss:3.432 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:23:45,867][model8_pretrain.py][INFO] Epoch:[0/2](149600/4588595) loss:2.992 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:23:45,867][model8_pretrain.py][INFO] Epoch:[0/2](149600/4588595) loss:2.940 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:23:45,867][model8_pretrain.py][INFO] Epoch:[0/2](149600/4588595) loss:2.580 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:23:45,867][model8_pretrain.py][INFO] Epoch:[0/2](149600/4588595) loss:3.085 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:23:45,867][model8_pretrain.py][INFO] Epoch:[0/2](149600/4588595) loss:2.686 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:24:22,815][model8_pretrain.py][INFO] Epoch:[0/2](149700/4588595) loss:3.052 lr:0.0000100 epoch_Time:28009.0min: [2024-01-03 09:24:22,815][model8_pretrain.py][INFO] Epoch:[0/2](149700/4588595) loss:2.679 lr:0.0000100 epoch_Time:28009.0min: [2024-01-03 09:24:22,815][model8_pretrain.py][INFO] Epoch:[0/2](149700/4588595) loss:3.044 lr:0.0000100 epoch_Time:28009.0min: [2024-01-03 09:24:22,815][model8_pretrain.py][INFO] Epoch:[0/2](149700/4588595) loss:2.940 lr:0.0000100 epoch_Time:28009.0min: [2024-01-03 09:24:22,815][model8_pretrain.py][INFO] Epoch:[0/2](149700/4588595) loss:2.943 lr:0.0000100 epoch_Time:28009.0min: [2024-01-03 09:24:22,815][model8_pretrain.py][INFO] Epoch:[0/2](149700/4588595) loss:2.905 lr:0.0000100 epoch_Time:28009.0min: [2024-01-03 09:24:22,815][model8_pretrain.py][INFO] Epoch:[0/2](149700/4588595) loss:2.531 lr:0.0000100 epoch_Time:28009.0min: [2024-01-03 09:24:22,815][model8_pretrain.py][INFO] Epoch:[0/2](149700/4588595) loss:3.057 lr:0.0000100 epoch_Time:28009.0min: [2024-01-03 09:24:59,752][model8_pretrain.py][INFO] Epoch:[0/2](149800/4588595) loss:2.436 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:24:59,752][model8_pretrain.py][INFO] Epoch:[0/2](149800/4588595) loss:3.120 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:24:59,752][model8_pretrain.py][INFO] Epoch:[0/2](149800/4588595) loss:3.473 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:24:59,752][model8_pretrain.py][INFO] Epoch:[0/2](149800/4588595) loss:3.184 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:24:59,752][model8_pretrain.py][INFO] Epoch:[0/2](149800/4588595) loss:3.113 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:24:59,752][model8_pretrain.py][INFO] Epoch:[0/2](149800/4588595) loss:2.956 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:24:59,752][model8_pretrain.py][INFO] Epoch:[0/2](149800/4588595) loss:3.046 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:24:59,752][model8_pretrain.py][INFO] Epoch:[0/2](149800/4588595) loss:2.927 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:25:36,700][model8_pretrain.py][INFO] Epoch:[0/2](149900/4588595) loss:2.903 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:25:36,700][model8_pretrain.py][INFO] Epoch:[0/2](149900/4588595) loss:2.738 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:25:36,700][model8_pretrain.py][INFO] Epoch:[0/2](149900/4588595) loss:3.062 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:25:36,700][model8_pretrain.py][INFO] Epoch:[0/2](149900/4588595) loss:3.423 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:25:36,700][model8_pretrain.py][INFO] Epoch:[0/2](149900/4588595) loss:3.416 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:25:36,700][model8_pretrain.py][INFO] Epoch:[0/2](149900/4588595) loss:3.230 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:25:36,700][model8_pretrain.py][INFO] Epoch:[0/2](149900/4588595) loss:3.415 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:25:36,700][model8_pretrain.py][INFO] Epoch:[0/2](149900/4588595) loss:2.974 lr:0.0000100 epoch_Time:28007.0min: [2024-01-03 09:26:22,357][model8_pretrain.py][INFO] Epoch:[0/2](150000/4588595) loss:2.945 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:26:22,357][model8_pretrain.py][INFO] Epoch:[0/2](150000/4588595) loss:3.334 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:26:22,357][model8_pretrain.py][INFO] Epoch:[0/2](150000/4588595) loss:3.224 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:26:22,357][model8_pretrain.py][INFO] Epoch:[0/2](150000/4588595) loss:2.069 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:26:22,357][model8_pretrain.py][INFO] Epoch:[0/2](150000/4588595) loss:3.168 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:26:22,357][model8_pretrain.py][INFO] Epoch:[0/2](150000/4588595) loss:3.122 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:26:22,357][model8_pretrain.py][INFO] Epoch:[0/2](150000/4588595) loss:3.177 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:26:22,357][model8_pretrain.py][INFO] Epoch:[0/2](150000/4588595) loss:3.068 lr:0.0000100 epoch_Time:28010.0min: [2024-01-03 09:26:59,284][model8_pretrain.py][INFO] Epoch:[0/2](150100/4588595) loss:2.903 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:26:59,284][model8_pretrain.py][INFO] Epoch:[0/2](150100/4588595) loss:2.852 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:26:59,284][model8_pretrain.py][INFO] Epoch:[0/2](150100/4588595) loss:3.184 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:26:59,284][model8_pretrain.py][INFO] Epoch:[0/2](150100/4588595) loss:2.881 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:26:59,284][model8_pretrain.py][INFO] Epoch:[0/2](150100/4588595) loss:2.977 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:26:59,284][model8_pretrain.py][INFO] Epoch:[0/2](150100/4588595) loss:3.037 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:26:59,284][model8_pretrain.py][INFO] Epoch:[0/2](150100/4588595) loss:2.751 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:26:59,284][model8_pretrain.py][INFO] Epoch:[0/2](150100/4588595) loss:3.123 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:27:36,205][model8_pretrain.py][INFO] Epoch:[0/2](150200/4588595) loss:2.648 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:27:36,205][model8_pretrain.py][INFO] Epoch:[0/2](150200/4588595) loss:2.509 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:27:36,205][model8_pretrain.py][INFO] Epoch:[0/2](150200/4588595) loss:3.105 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:27:36,205][model8_pretrain.py][INFO] Epoch:[0/2](150200/4588595) loss:2.555 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:27:36,205][model8_pretrain.py][INFO] Epoch:[0/2](150200/4588595) loss:2.928 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:27:36,205][model8_pretrain.py][INFO] Epoch:[0/2](150200/4588595) loss:3.329 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:27:36,205][model8_pretrain.py][INFO] Epoch:[0/2](150200/4588595) loss:2.398 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:27:36,205][model8_pretrain.py][INFO] Epoch:[0/2](150200/4588595) loss:3.023 lr:0.0000100 epoch_Time:28008.0min: [2024-01-03 09:28:13,133][model8_pretrain.py][INFO] Epoch:[0/2](150300/4588595) loss:3.046 lr:0.0000100 epoch_Time:28006.0min: [2024-01-03 09:28:13,133][model8_pretrain.py][INFO] Epoch:[0/2](150300/4588595) loss:2.618 lr:0.0000100 epoch_Time:28006.0min: [2024-01-03 09:28:13,133][model8_pretrain.py][INFO] Epoch:[0/2](150300/4588595) loss:2.920 lr:0.0000100 epoch_Time:28006.0min: [2024-01-03 09:28:13,133][model8_pretrain.py][INFO] Epoch:[0/2](150300/4588595) loss:2.894 lr:0.0000100 epoch_Time:28006.0min: [2024-01-03 09:28:13,133][model8_pretrain.py][INFO] Epoch:[0/2](150300/4588595) loss:2.091 lr:0.0000100 epoch_Time:28006.0min: [2024-01-03 09:28:13,133][model8_pretrain.py][INFO] Epoch:[0/2](150300/4588595) loss:3.223 lr:0.0000100 epoch_Time:28006.0min: [2024-01-03 09:28:13,133][model8_pretrain.py][INFO] Epoch:[0/2](150300/4588595) loss:2.724 lr:0.0000100 epoch_Time:28006.0min: [2024-01-03 09:28:13,133][model8_pretrain.py][INFO] Epoch:[0/2](150300/4588595) loss:2.706 lr:0.0000100 epoch_Time:28006.0min: [2024-01-03 09:28:50,070][model8_pretrain.py][INFO] Epoch:[0/2](150400/4588595) loss:2.956 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:28:50,070][model8_pretrain.py][INFO] Epoch:[0/2](150400/4588595) loss:3.117 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:28:50,070][model8_pretrain.py][INFO] Epoch:[0/2](150400/4588595) loss:3.161 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:28:50,070][model8_pretrain.py][INFO] Epoch:[0/2](150400/4588595) loss:3.538 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:28:50,071][model8_pretrain.py][INFO] Epoch:[0/2](150400/4588595) loss:2.724 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:28:50,070][model8_pretrain.py][INFO] Epoch:[0/2](150400/4588595) loss:2.592 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:28:50,071][model8_pretrain.py][INFO] Epoch:[0/2](150400/4588595) loss:2.991 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:28:50,071][model8_pretrain.py][INFO] Epoch:[0/2](150400/4588595) loss:2.992 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:29:27,012][model8_pretrain.py][INFO] Epoch:[0/2](150500/4588595) loss:2.809 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:29:27,012][model8_pretrain.py][INFO] Epoch:[0/2](150500/4588595) loss:2.704 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:29:27,012][model8_pretrain.py][INFO] Epoch:[0/2](150500/4588595) loss:3.061 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:29:27,012][model8_pretrain.py][INFO] Epoch:[0/2](150500/4588595) loss:2.869 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:29:27,012][model8_pretrain.py][INFO] Epoch:[0/2](150500/4588595) loss:2.887 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:29:27,012][model8_pretrain.py][INFO] Epoch:[0/2](150500/4588595) loss:2.547 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:29:27,012][model8_pretrain.py][INFO] Epoch:[0/2](150500/4588595) loss:3.410 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:29:27,012][model8_pretrain.py][INFO] Epoch:[0/2](150500/4588595) loss:2.548 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:30:03,938][model8_pretrain.py][INFO] Epoch:[0/2](150600/4588595) loss:2.642 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:30:03,938][model8_pretrain.py][INFO] Epoch:[0/2](150600/4588595) loss:3.014 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:30:03,938][model8_pretrain.py][INFO] Epoch:[0/2](150600/4588595) loss:2.695 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:30:03,938][model8_pretrain.py][INFO] Epoch:[0/2](150600/4588595) loss:3.069 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:30:03,938][model8_pretrain.py][INFO] Epoch:[0/2](150600/4588595) loss:2.793 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:30:03,938][model8_pretrain.py][INFO] Epoch:[0/2](150600/4588595) loss:2.951 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:30:03,938][model8_pretrain.py][INFO] Epoch:[0/2](150600/4588595) loss:3.033 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:30:03,938][model8_pretrain.py][INFO] Epoch:[0/2](150600/4588595) loss:2.496 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:30:40,875][model8_pretrain.py][INFO] Epoch:[0/2](150700/4588595) loss:2.758 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:30:40,875][model8_pretrain.py][INFO] Epoch:[0/2](150700/4588595) loss:3.130 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:30:40,875][model8_pretrain.py][INFO] Epoch:[0/2](150700/4588595) loss:3.234 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:30:40,875][model8_pretrain.py][INFO] Epoch:[0/2](150700/4588595) loss:3.133 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:30:40,875][model8_pretrain.py][INFO] Epoch:[0/2](150700/4588595) loss:2.988 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:30:40,875][model8_pretrain.py][INFO] Epoch:[0/2](150700/4588595) loss:3.116 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:30:40,875][model8_pretrain.py][INFO] Epoch:[0/2](150700/4588595) loss:2.151 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:30:40,875][model8_pretrain.py][INFO] Epoch:[0/2](150700/4588595) loss:3.386 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:31:26,562][model8_pretrain.py][INFO] Epoch:[0/2](150800/4588595) loss:2.468 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:31:26,562][model8_pretrain.py][INFO] Epoch:[0/2](150800/4588595) loss:2.788 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:31:26,562][model8_pretrain.py][INFO] Epoch:[0/2](150800/4588595) loss:3.125 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:31:26,562][model8_pretrain.py][INFO] Epoch:[0/2](150800/4588595) loss:2.777 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:31:26,562][model8_pretrain.py][INFO] Epoch:[0/2](150800/4588595) loss:3.062 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:31:26,562][model8_pretrain.py][INFO] Epoch:[0/2](150800/4588595) loss:3.387 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:31:26,563][model8_pretrain.py][INFO] Epoch:[0/2](150800/4588595) loss:3.007 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:31:26,563][model8_pretrain.py][INFO] Epoch:[0/2](150800/4588595) loss:2.984 lr:0.0000100 epoch_Time:28005.0min: [2024-01-03 09:32:03,494][model8_pretrain.py][INFO] Epoch:[0/2](150900/4588595) loss:2.815 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:32:03,494][model8_pretrain.py][INFO] Epoch:[0/2](150900/4588595) loss:3.391 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:32:03,494][model8_pretrain.py][INFO] Epoch:[0/2](150900/4588595) loss:2.483 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:32:03,494][model8_pretrain.py][INFO] Epoch:[0/2](150900/4588595) loss:3.256 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:32:03,494][model8_pretrain.py][INFO] Epoch:[0/2](150900/4588595) loss:2.942 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:32:03,494][model8_pretrain.py][INFO] Epoch:[0/2](150900/4588595) loss:2.905 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:32:03,494][model8_pretrain.py][INFO] Epoch:[0/2](150900/4588595) loss:2.823 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:32:03,494][model8_pretrain.py][INFO] Epoch:[0/2](150900/4588595) loss:3.051 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:32:40,434][model8_pretrain.py][INFO] Epoch:[0/2](151000/4588595) loss:2.629 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:32:40,434][model8_pretrain.py][INFO] Epoch:[0/2](151000/4588595) loss:3.288 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:32:40,434][model8_pretrain.py][INFO] Epoch:[0/2](151000/4588595) loss:3.056 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:32:40,434][model8_pretrain.py][INFO] Epoch:[0/2](151000/4588595) loss:2.914 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:32:40,434][model8_pretrain.py][INFO] Epoch:[0/2](151000/4588595) loss:3.096 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:32:40,434][model8_pretrain.py][INFO] Epoch:[0/2](151000/4588595) loss:2.942 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:32:40,434][model8_pretrain.py][INFO] Epoch:[0/2](151000/4588595) loss:2.844 lr:0.0000100 epoch_Time:28004.0min: [2024-01-03 09:32:40,434][model8_pretrain.py][INFO] Epoch:[0/2](151000/4588595) loss:3.155 lr:0.0000100 epoch_Time:28003.0min: [2024-01-03 09:33:17,372][model8_pretrain.py][INFO] Epoch:[0/2](151100/4588595) loss:2.885 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:33:17,372][model8_pretrain.py][INFO] Epoch:[0/2](151100/4588595) loss:2.955 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:33:17,372][model8_pretrain.py][INFO] Epoch:[0/2](151100/4588595) loss:3.387 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:33:17,372][model8_pretrain.py][INFO] Epoch:[0/2](151100/4588595) loss:3.073 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:33:17,372][model8_pretrain.py][INFO] Epoch:[0/2](151100/4588595) loss:3.257 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:33:17,372][model8_pretrain.py][INFO] Epoch:[0/2](151100/4588595) loss:3.137 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:33:17,372][model8_pretrain.py][INFO] Epoch:[0/2](151100/4588595) loss:2.795 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:33:17,373][model8_pretrain.py][INFO] Epoch:[0/2](151100/4588595) loss:3.308 lr:0.0000100 epoch_Time:28002.0min: [2024-01-03 09:33:54,321][model8_pretrain.py][INFO] Epoch:[0/2](151200/4588595) loss:3.033 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:33:54,321][model8_pretrain.py][INFO] Epoch:[0/2](151200/4588595) loss:2.696 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:33:54,321][model8_pretrain.py][INFO] Epoch:[0/2](151200/4588595) loss:3.105 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:33:54,321][model8_pretrain.py][INFO] Epoch:[0/2](151200/4588595) loss:3.346 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:33:54,321][model8_pretrain.py][INFO] Epoch:[0/2](151200/4588595) loss:3.271 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:33:54,321][model8_pretrain.py][INFO] Epoch:[0/2](151200/4588595) loss:2.902 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:33:54,321][model8_pretrain.py][INFO] Epoch:[0/2](151200/4588595) loss:3.148 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:33:54,322][model8_pretrain.py][INFO] Epoch:[0/2](151200/4588595) loss:3.377 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:34:31,269][model8_pretrain.py][INFO] Epoch:[0/2](151300/4588595) loss:2.120 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:34:31,269][model8_pretrain.py][INFO] Epoch:[0/2](151300/4588595) loss:2.987 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:34:31,269][model8_pretrain.py][INFO] Epoch:[0/2](151300/4588595) loss:2.735 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:34:31,269][model8_pretrain.py][INFO] Epoch:[0/2](151300/4588595) loss:2.750 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:34:31,269][model8_pretrain.py][INFO] Epoch:[0/2](151300/4588595) loss:2.680 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:34:31,269][model8_pretrain.py][INFO] Epoch:[0/2](151300/4588595) loss:3.127 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:34:31,270][model8_pretrain.py][INFO] Epoch:[0/2](151300/4588595) loss:2.939 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:34:31,270][model8_pretrain.py][INFO] Epoch:[0/2](151300/4588595) loss:3.332 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:35:08,206][model8_pretrain.py][INFO] Epoch:[0/2](151400/4588595) loss:2.672 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:35:08,206][model8_pretrain.py][INFO] Epoch:[0/2](151400/4588595) loss:2.506 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:35:08,206][model8_pretrain.py][INFO] Epoch:[0/2](151400/4588595) loss:3.233 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:35:08,206][model8_pretrain.py][INFO] Epoch:[0/2](151400/4588595) loss:2.652 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:35:08,206][model8_pretrain.py][INFO] Epoch:[0/2](151400/4588595) loss:2.493 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:35:08,206][model8_pretrain.py][INFO] Epoch:[0/2](151400/4588595) loss:2.889 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:35:08,206][model8_pretrain.py][INFO] Epoch:[0/2](151400/4588595) loss:2.932 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:35:08,206][model8_pretrain.py][INFO] Epoch:[0/2](151400/4588595) loss:2.770 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:35:45,140][model8_pretrain.py][INFO] Epoch:[0/2](151500/4588595) loss:2.830 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:35:45,140][model8_pretrain.py][INFO] Epoch:[0/2](151500/4588595) loss:2.695 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:35:45,140][model8_pretrain.py][INFO] Epoch:[0/2](151500/4588595) loss:2.849 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:35:45,140][model8_pretrain.py][INFO] Epoch:[0/2](151500/4588595) loss:3.024 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:35:45,140][model8_pretrain.py][INFO] Epoch:[0/2](151500/4588595) loss:3.186 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:35:45,140][model8_pretrain.py][INFO] Epoch:[0/2](151500/4588595) loss:2.985 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:35:45,140][model8_pretrain.py][INFO] Epoch:[0/2](151500/4588595) loss:3.051 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:35:45,140][model8_pretrain.py][INFO] Epoch:[0/2](151500/4588595) loss:3.303 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:36:30,781][model8_pretrain.py][INFO] Epoch:[0/2](151600/4588595) loss:3.122 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:36:30,781][model8_pretrain.py][INFO] Epoch:[0/2](151600/4588595) loss:2.744 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:36:30,781][model8_pretrain.py][INFO] Epoch:[0/2](151600/4588595) loss:3.265 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:36:30,781][model8_pretrain.py][INFO] Epoch:[0/2](151600/4588595) loss:2.690 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:36:30,781][model8_pretrain.py][INFO] Epoch:[0/2](151600/4588595) loss:2.509 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:36:30,781][model8_pretrain.py][INFO] Epoch:[0/2](151600/4588595) loss:3.653 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:36:30,781][model8_pretrain.py][INFO] Epoch:[0/2](151600/4588595) loss:3.219 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:36:30,781][model8_pretrain.py][INFO] Epoch:[0/2](151600/4588595) loss:2.934 lr:0.0000100 epoch_Time:28001.0min: [2024-01-03 09:37:07,724][model8_pretrain.py][INFO] Epoch:[0/2](151700/4588595) loss:3.560 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:37:07,724][model8_pretrain.py][INFO] Epoch:[0/2](151700/4588595) loss:2.589 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:37:07,724][model8_pretrain.py][INFO] Epoch:[0/2](151700/4588595) loss:2.816 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:37:07,724][model8_pretrain.py][INFO] Epoch:[0/2](151700/4588595) loss:3.120 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:37:07,724][model8_pretrain.py][INFO] Epoch:[0/2](151700/4588595) loss:3.469 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:37:07,724][model8_pretrain.py][INFO] Epoch:[0/2](151700/4588595) loss:3.297 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:37:07,724][model8_pretrain.py][INFO] Epoch:[0/2](151700/4588595) loss:2.971 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:37:07,724][model8_pretrain.py][INFO] Epoch:[0/2](151700/4588595) loss:3.518 lr:0.0000100 epoch_Time:28000.0min: [2024-01-03 09:37:44,650][model8_pretrain.py][INFO] Epoch:[0/2](151800/4588595) loss:2.650 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:37:44,650][model8_pretrain.py][INFO] Epoch:[0/2](151800/4588595) loss:3.048 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:37:44,650][model8_pretrain.py][INFO] Epoch:[0/2](151800/4588595) loss:2.887 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:37:44,650][model8_pretrain.py][INFO] Epoch:[0/2](151800/4588595) loss:2.544 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:37:44,650][model8_pretrain.py][INFO] Epoch:[0/2](151800/4588595) loss:3.098 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:37:44,650][model8_pretrain.py][INFO] Epoch:[0/2](151800/4588595) loss:2.298 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:37:44,651][model8_pretrain.py][INFO] Epoch:[0/2](151800/4588595) loss:3.135 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:37:44,652][model8_pretrain.py][INFO] Epoch:[0/2](151800/4588595) loss:3.151 lr:0.0000100 epoch_Time:27999.0min: [2024-01-03 09:38:21,593][model8_pretrain.py][INFO] Epoch:[0/2](151900/4588595) loss:2.611 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:38:21,593][model8_pretrain.py][INFO] Epoch:[0/2](151900/4588595) loss:2.406 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:38:21,593][model8_pretrain.py][INFO] Epoch:[0/2](151900/4588595) loss:2.908 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:38:21,593][model8_pretrain.py][INFO] Epoch:[0/2](151900/4588595) loss:2.313 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:38:21,593][model8_pretrain.py][INFO] Epoch:[0/2](151900/4588595) loss:3.278 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:38:21,593][model8_pretrain.py][INFO] Epoch:[0/2](151900/4588595) loss:2.980 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:38:21,593][model8_pretrain.py][INFO] Epoch:[0/2](151900/4588595) loss:2.969 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:38:21,594][model8_pretrain.py][INFO] Epoch:[0/2](151900/4588595) loss:2.820 lr:0.0000100 epoch_Time:27998.0min: [2024-01-03 09:38:58,540][model8_pretrain.py][INFO] Epoch:[0/2](152000/4588595) loss:3.061 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:38:58,540][model8_pretrain.py][INFO] Epoch:[0/2](152000/4588595) loss:2.928 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:38:58,540][model8_pretrain.py][INFO] Epoch:[0/2](152000/4588595) loss:2.996 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:38:58,540][model8_pretrain.py][INFO] Epoch:[0/2](152000/4588595) loss:3.166 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:38:58,540][model8_pretrain.py][INFO] Epoch:[0/2](152000/4588595) loss:2.883 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:38:58,540][model8_pretrain.py][INFO] Epoch:[0/2](152000/4588595) loss:3.411 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:38:58,540][model8_pretrain.py][INFO] Epoch:[0/2](152000/4588595) loss:2.835 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:38:58,540][model8_pretrain.py][INFO] Epoch:[0/2](152000/4588595) loss:2.868 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:39:35,473][model8_pretrain.py][INFO] Epoch:[0/2](152100/4588595) loss:3.610 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:39:35,473][model8_pretrain.py][INFO] Epoch:[0/2](152100/4588595) loss:3.215 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:39:35,473][model8_pretrain.py][INFO] Epoch:[0/2](152100/4588595) loss:3.320 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:39:35,473][model8_pretrain.py][INFO] Epoch:[0/2](152100/4588595) loss:2.679 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:39:35,473][model8_pretrain.py][INFO] Epoch:[0/2](152100/4588595) loss:2.803 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:39:35,473][model8_pretrain.py][INFO] Epoch:[0/2](152100/4588595) loss:2.925 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:39:35,473][model8_pretrain.py][INFO] Epoch:[0/2](152100/4588595) loss:2.864 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:39:35,473][model8_pretrain.py][INFO] Epoch:[0/2](152100/4588595) loss:2.801 lr:0.0000100 epoch_Time:27996.0min: [2024-01-03 09:40:12,426][model8_pretrain.py][INFO] Epoch:[0/2](152200/4588595) loss:2.933 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:40:12,426][model8_pretrain.py][INFO] Epoch:[0/2](152200/4588595) loss:3.175 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:40:12,426][model8_pretrain.py][INFO] Epoch:[0/2](152200/4588595) loss:3.135 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:40:12,426][model8_pretrain.py][INFO] Epoch:[0/2](152200/4588595) loss:2.879 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:40:12,426][model8_pretrain.py][INFO] Epoch:[0/2](152200/4588595) loss:3.133 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:40:12,426][model8_pretrain.py][INFO] Epoch:[0/2](152200/4588595) loss:3.015 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:40:12,426][model8_pretrain.py][INFO] Epoch:[0/2](152200/4588595) loss:3.012 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:40:12,427][model8_pretrain.py][INFO] Epoch:[0/2](152200/4588595) loss:3.109 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:40:49,369][model8_pretrain.py][INFO] Epoch:[0/2](152300/4588595) loss:2.684 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:40:49,369][model8_pretrain.py][INFO] Epoch:[0/2](152300/4588595) loss:3.108 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:40:49,369][model8_pretrain.py][INFO] Epoch:[0/2](152300/4588595) loss:3.059 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:40:49,369][model8_pretrain.py][INFO] Epoch:[0/2](152300/4588595) loss:2.935 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:40:49,369][model8_pretrain.py][INFO] Epoch:[0/2](152300/4588595) loss:2.841 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:40:49,369][model8_pretrain.py][INFO] Epoch:[0/2](152300/4588595) loss:3.306 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:40:49,369][model8_pretrain.py][INFO] Epoch:[0/2](152300/4588595) loss:3.434 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:40:49,370][model8_pretrain.py][INFO] Epoch:[0/2](152300/4588595) loss:3.121 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:41:34,948][model8_pretrain.py][INFO] Epoch:[0/2](152400/4588595) loss:2.878 lr:0.0000100 epoch_Time:27997.0min: [2024-01-03 09:41:34,948][model8_pretrain.py][INFO] Epoch:[0/2](152400/4588595) loss:3.380 lr:0.0000100 epoch_Time:27997.0min: [2024-01-03 09:41:34,948][model8_pretrain.py][INFO] Epoch:[0/2](152400/4588595) loss:2.849 lr:0.0000100 epoch_Time:27997.0min: [2024-01-03 09:41:34,948][model8_pretrain.py][INFO] Epoch:[0/2](152400/4588595) loss:3.094 lr:0.0000100 epoch_Time:27997.0min: [2024-01-03 09:41:34,948][model8_pretrain.py][INFO] Epoch:[0/2](152400/4588595) loss:2.787 lr:0.0000100 epoch_Time:27997.0min: [2024-01-03 09:41:34,948][model8_pretrain.py][INFO] Epoch:[0/2](152400/4588595) loss:2.322 lr:0.0000100 epoch_Time:27997.0min: [2024-01-03 09:41:34,948][model8_pretrain.py][INFO] Epoch:[0/2](152400/4588595) loss:3.026 lr:0.0000100 epoch_Time:27997.0min: [2024-01-03 09:41:34,948][model8_pretrain.py][INFO] Epoch:[0/2](152400/4588595) loss:3.236 lr:0.0000100 epoch_Time:27997.0min: [2024-01-03 09:42:11,873][model8_pretrain.py][INFO] Epoch:[0/2](152500/4588595) loss:2.497 lr:0.0000100 epoch_Time:27995.0min: [2024-01-03 09:42:11,873][model8_pretrain.py][INFO] Epoch:[0/2](152500/4588595) loss:2.608 lr:0.0000100 epoch_Time:27995.0min: [2024-01-03 09:42:11,873][model8_pretrain.py][INFO] Epoch:[0/2](152500/4588595) loss:2.992 lr:0.0000100 epoch_Time:27995.0min: [2024-01-03 09:42:11,873][model8_pretrain.py][INFO] Epoch:[0/2](152500/4588595) loss:3.001 lr:0.0000100 epoch_Time:27995.0min: [2024-01-03 09:42:11,873][model8_pretrain.py][INFO] Epoch:[0/2](152500/4588595) loss:3.290 lr:0.0000100 epoch_Time:27995.0min: [2024-01-03 09:42:11,873][model8_pretrain.py][INFO] Epoch:[0/2](152500/4588595) loss:3.444 lr:0.0000100 epoch_Time:27995.0min: [2024-01-03 09:42:11,873][model8_pretrain.py][INFO] Epoch:[0/2](152500/4588595) loss:2.995 lr:0.0000100 epoch_Time:27995.0min: [2024-01-03 09:42:11,873][model8_pretrain.py][INFO] Epoch:[0/2](152500/4588595) loss:2.981 lr:0.0000100 epoch_Time:27995.0min: [2024-01-03 09:42:48,810][model8_pretrain.py][INFO] Epoch:[0/2](152600/4588595) loss:2.737 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:42:48,810][model8_pretrain.py][INFO] Epoch:[0/2](152600/4588595) loss:3.301 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:42:48,810][model8_pretrain.py][INFO] Epoch:[0/2](152600/4588595) loss:3.110 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:42:48,810][model8_pretrain.py][INFO] Epoch:[0/2](152600/4588595) loss:2.558 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:42:48,810][model8_pretrain.py][INFO] Epoch:[0/2](152600/4588595) loss:2.479 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:42:48,810][model8_pretrain.py][INFO] Epoch:[0/2](152600/4588595) loss:2.339 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:42:48,811][model8_pretrain.py][INFO] Epoch:[0/2](152600/4588595) loss:3.149 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:42:48,812][model8_pretrain.py][INFO] Epoch:[0/2](152600/4588595) loss:2.770 lr:0.0000100 epoch_Time:27994.0min: [2024-01-03 09:43:25,757][model8_pretrain.py][INFO] Epoch:[0/2](152700/4588595) loss:3.032 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:43:25,757][model8_pretrain.py][INFO] Epoch:[0/2](152700/4588595) loss:3.304 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:43:25,757][model8_pretrain.py][INFO] Epoch:[0/2](152700/4588595) loss:3.057 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:43:25,757][model8_pretrain.py][INFO] Epoch:[0/2](152700/4588595) loss:3.121 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:43:25,757][model8_pretrain.py][INFO] Epoch:[0/2](152700/4588595) loss:2.683 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:43:25,757][model8_pretrain.py][INFO] Epoch:[0/2](152700/4588595) loss:3.044 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:43:25,757][model8_pretrain.py][INFO] Epoch:[0/2](152700/4588595) loss:3.281 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:43:25,757][model8_pretrain.py][INFO] Epoch:[0/2](152700/4588595) loss:3.155 lr:0.0000100 epoch_Time:27993.0min: [2024-01-03 09:44:02,665][model8_pretrain.py][INFO] Epoch:[0/2](152800/4588595) loss:2.672 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:44:02,665][model8_pretrain.py][INFO] Epoch:[0/2](152800/4588595) loss:2.363 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:44:02,665][model8_pretrain.py][INFO] Epoch:[0/2](152800/4588595) loss:2.810 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:44:02,665][model8_pretrain.py][INFO] Epoch:[0/2](152800/4588595) loss:3.125 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:44:02,665][model8_pretrain.py][INFO] Epoch:[0/2](152800/4588595) loss:3.366 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:44:02,665][model8_pretrain.py][INFO] Epoch:[0/2](152800/4588595) loss:3.160 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:44:02,665][model8_pretrain.py][INFO] Epoch:[0/2](152800/4588595) loss:2.842 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:44:02,665][model8_pretrain.py][INFO] Epoch:[0/2](152800/4588595) loss:2.784 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:44:39,601][model8_pretrain.py][INFO] Epoch:[0/2](152900/4588595) loss:2.892 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:44:39,601][model8_pretrain.py][INFO] Epoch:[0/2](152900/4588595) loss:2.831 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:44:39,601][model8_pretrain.py][INFO] Epoch:[0/2](152900/4588595) loss:2.852 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:44:39,601][model8_pretrain.py][INFO] Epoch:[0/2](152900/4588595) loss:3.106 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:44:39,601][model8_pretrain.py][INFO] Epoch:[0/2](152900/4588595) loss:3.175 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:44:39,601][model8_pretrain.py][INFO] Epoch:[0/2](152900/4588595) loss:2.789 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:44:39,601][model8_pretrain.py][INFO] Epoch:[0/2](152900/4588595) loss:3.001 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:44:39,601][model8_pretrain.py][INFO] Epoch:[0/2](152900/4588595) loss:3.049 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:45:16,533][model8_pretrain.py][INFO] Epoch:[0/2](153000/4588595) loss:3.181 lr:0.0000100 epoch_Time:27990.0min: [2024-01-03 09:45:16,533][model8_pretrain.py][INFO] Epoch:[0/2](153000/4588595) loss:3.275 lr:0.0000100 epoch_Time:27990.0min: [2024-01-03 09:45:16,533][model8_pretrain.py][INFO] Epoch:[0/2](153000/4588595) loss:2.824 lr:0.0000100 epoch_Time:27990.0min: [2024-01-03 09:45:16,533][model8_pretrain.py][INFO] Epoch:[0/2](153000/4588595) loss:3.266 lr:0.0000100 epoch_Time:27990.0min: [2024-01-03 09:45:16,534][model8_pretrain.py][INFO] Epoch:[0/2](153000/4588595) loss:3.165 lr:0.0000100 epoch_Time:27990.0min: [2024-01-03 09:45:16,534][model8_pretrain.py][INFO] Epoch:[0/2](153000/4588595) loss:2.917 lr:0.0000100 epoch_Time:27990.0min: [2024-01-03 09:45:16,534][model8_pretrain.py][INFO] Epoch:[0/2](153000/4588595) loss:3.295 lr:0.0000100 epoch_Time:27990.0min: [2024-01-03 09:45:16,534][model8_pretrain.py][INFO] Epoch:[0/2](153000/4588595) loss:3.547 lr:0.0000100 epoch_Time:27990.0min: [2024-01-03 09:45:53,474][model8_pretrain.py][INFO] Epoch:[0/2](153100/4588595) loss:2.665 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:45:53,474][model8_pretrain.py][INFO] Epoch:[0/2](153100/4588595) loss:2.694 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:45:53,474][model8_pretrain.py][INFO] Epoch:[0/2](153100/4588595) loss:3.071 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:45:53,474][model8_pretrain.py][INFO] Epoch:[0/2](153100/4588595) loss:3.227 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:45:53,474][model8_pretrain.py][INFO] Epoch:[0/2](153100/4588595) loss:2.862 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:45:53,474][model8_pretrain.py][INFO] Epoch:[0/2](153100/4588595) loss:2.881 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:45:53,474][model8_pretrain.py][INFO] Epoch:[0/2](153100/4588595) loss:2.699 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:45:53,474][model8_pretrain.py][INFO] Epoch:[0/2](153100/4588595) loss:2.865 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:46:39,185][model8_pretrain.py][INFO] Epoch:[0/2](153200/4588595) loss:2.970 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:46:39,185][model8_pretrain.py][INFO] Epoch:[0/2](153200/4588595) loss:3.119 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:46:39,185][model8_pretrain.py][INFO] Epoch:[0/2](153200/4588595) loss:2.450 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:46:39,185][model8_pretrain.py][INFO] Epoch:[0/2](153200/4588595) loss:3.131 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:46:39,185][model8_pretrain.py][INFO] Epoch:[0/2](153200/4588595) loss:3.257 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:46:39,185][model8_pretrain.py][INFO] Epoch:[0/2](153200/4588595) loss:3.040 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:46:39,185][model8_pretrain.py][INFO] Epoch:[0/2](153200/4588595) loss:2.733 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:46:39,185][model8_pretrain.py][INFO] Epoch:[0/2](153200/4588595) loss:2.932 lr:0.0000100 epoch_Time:27992.0min: [2024-01-03 09:47:16,106][model8_pretrain.py][INFO] Epoch:[0/2](153300/4588595) loss:2.938 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:47:16,106][model8_pretrain.py][INFO] Epoch:[0/2](153300/4588595) loss:2.706 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:47:16,106][model8_pretrain.py][INFO] Epoch:[0/2](153300/4588595) loss:3.155 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:47:16,106][model8_pretrain.py][INFO] Epoch:[0/2](153300/4588595) loss:2.681 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:47:16,106][model8_pretrain.py][INFO] Epoch:[0/2](153300/4588595) loss:3.019 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:47:16,106][model8_pretrain.py][INFO] Epoch:[0/2](153300/4588595) loss:2.607 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:47:16,106][model8_pretrain.py][INFO] Epoch:[0/2](153300/4588595) loss:3.158 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:47:16,106][model8_pretrain.py][INFO] Epoch:[0/2](153300/4588595) loss:2.780 lr:0.0000100 epoch_Time:27991.0min: [2024-01-03 09:47:53,020][model8_pretrain.py][INFO] Epoch:[0/2](153400/4588595) loss:2.737 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:47:53,021][model8_pretrain.py][INFO] Epoch:[0/2](153400/4588595) loss:2.947 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:47:53,021][model8_pretrain.py][INFO] Epoch:[0/2](153400/4588595) loss:3.040 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:47:53,021][model8_pretrain.py][INFO] Epoch:[0/2](153400/4588595) loss:3.340 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:47:53,021][model8_pretrain.py][INFO] Epoch:[0/2](153400/4588595) loss:2.815 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:47:53,021][model8_pretrain.py][INFO] Epoch:[0/2](153400/4588595) loss:3.434 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:47:53,021][model8_pretrain.py][INFO] Epoch:[0/2](153400/4588595) loss:3.108 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:47:53,021][model8_pretrain.py][INFO] Epoch:[0/2](153400/4588595) loss:2.865 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:48:29,945][model8_pretrain.py][INFO] Epoch:[0/2](153500/4588595) loss:2.778 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:48:29,945][model8_pretrain.py][INFO] Epoch:[0/2](153500/4588595) loss:3.071 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:48:29,945][model8_pretrain.py][INFO] Epoch:[0/2](153500/4588595) loss:3.481 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:48:29,945][model8_pretrain.py][INFO] Epoch:[0/2](153500/4588595) loss:2.995 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:48:29,945][model8_pretrain.py][INFO] Epoch:[0/2](153500/4588595) loss:2.629 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:48:29,945][model8_pretrain.py][INFO] Epoch:[0/2](153500/4588595) loss:3.106 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:48:29,945][model8_pretrain.py][INFO] Epoch:[0/2](153500/4588595) loss:2.737 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:48:29,945][model8_pretrain.py][INFO] Epoch:[0/2](153500/4588595) loss:2.527 lr:0.0000100 epoch_Time:27989.0min: [2024-01-03 09:49:06,875][model8_pretrain.py][INFO] Epoch:[0/2](153600/4588595) loss:3.107 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:06,875][model8_pretrain.py][INFO] Epoch:[0/2](153600/4588595) loss:3.416 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:06,875][model8_pretrain.py][INFO] Epoch:[0/2](153600/4588595) loss:3.106 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:06,875][model8_pretrain.py][INFO] Epoch:[0/2](153600/4588595) loss:3.214 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:06,875][model8_pretrain.py][INFO] Epoch:[0/2](153600/4588595) loss:3.027 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:06,875][model8_pretrain.py][INFO] Epoch:[0/2](153600/4588595) loss:2.835 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:49:06,875][model8_pretrain.py][INFO] Epoch:[0/2](153600/4588595) loss:2.748 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:06,875][model8_pretrain.py][INFO] Epoch:[0/2](153600/4588595) loss:2.997 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:43,794][model8_pretrain.py][INFO] Epoch:[0/2](153700/4588595) loss:3.272 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:43,794][model8_pretrain.py][INFO] Epoch:[0/2](153700/4588595) loss:2.884 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:43,794][model8_pretrain.py][INFO] Epoch:[0/2](153700/4588595) loss:2.798 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:43,794][model8_pretrain.py][INFO] Epoch:[0/2](153700/4588595) loss:3.121 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:43,794][model8_pretrain.py][INFO] Epoch:[0/2](153700/4588595) loss:2.652 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:43,794][model8_pretrain.py][INFO] Epoch:[0/2](153700/4588595) loss:2.989 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:43,794][model8_pretrain.py][INFO] Epoch:[0/2](153700/4588595) loss:2.785 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:49:43,794][model8_pretrain.py][INFO] Epoch:[0/2](153700/4588595) loss:2.941 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:50:20,754][model8_pretrain.py][INFO] Epoch:[0/2](153800/4588595) loss:3.449 lr:0.0000100 epoch_Time:27986.0min: [2024-01-03 09:50:20,754][model8_pretrain.py][INFO] Epoch:[0/2](153800/4588595) loss:3.015 lr:0.0000100 epoch_Time:27986.0min: [2024-01-03 09:50:20,754][model8_pretrain.py][INFO] Epoch:[0/2](153800/4588595) loss:3.112 lr:0.0000100 epoch_Time:27986.0min: [2024-01-03 09:50:20,754][model8_pretrain.py][INFO] Epoch:[0/2](153800/4588595) loss:2.789 lr:0.0000100 epoch_Time:27986.0min: [2024-01-03 09:50:20,754][model8_pretrain.py][INFO] Epoch:[0/2](153800/4588595) loss:2.630 lr:0.0000100 epoch_Time:27986.0min: [2024-01-03 09:50:20,754][model8_pretrain.py][INFO] Epoch:[0/2](153800/4588595) loss:3.301 lr:0.0000100 epoch_Time:27986.0min: [2024-01-03 09:50:20,754][model8_pretrain.py][INFO] Epoch:[0/2](153800/4588595) loss:3.182 lr:0.0000100 epoch_Time:27986.0min: [2024-01-03 09:50:20,754][model8_pretrain.py][INFO] Epoch:[0/2](153800/4588595) loss:3.381 lr:0.0000100 epoch_Time:27986.0min: [2024-01-03 09:50:57,689][model8_pretrain.py][INFO] Epoch:[0/2](153900/4588595) loss:3.262 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:50:57,689][model8_pretrain.py][INFO] Epoch:[0/2](153900/4588595) loss:3.365 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:50:57,689][model8_pretrain.py][INFO] Epoch:[0/2](153900/4588595) loss:2.496 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:50:57,689][model8_pretrain.py][INFO] Epoch:[0/2](153900/4588595) loss:2.310 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:50:57,689][model8_pretrain.py][INFO] Epoch:[0/2](153900/4588595) loss:3.341 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:50:57,689][model8_pretrain.py][INFO] Epoch:[0/2](153900/4588595) loss:3.009 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:50:57,689][model8_pretrain.py][INFO] Epoch:[0/2](153900/4588595) loss:2.810 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:50:57,689][model8_pretrain.py][INFO] Epoch:[0/2](153900/4588595) loss:2.957 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:51:43,436][model8_pretrain.py][INFO] Epoch:[0/2](154000/4588595) loss:2.445 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:51:43,436][model8_pretrain.py][INFO] Epoch:[0/2](154000/4588595) loss:3.090 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:51:43,436][model8_pretrain.py][INFO] Epoch:[0/2](154000/4588595) loss:3.417 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:51:43,436][model8_pretrain.py][INFO] Epoch:[0/2](154000/4588595) loss:2.877 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:51:43,436][model8_pretrain.py][INFO] Epoch:[0/2](154000/4588595) loss:3.258 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:51:43,436][model8_pretrain.py][INFO] Epoch:[0/2](154000/4588595) loss:3.009 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:51:43,436][model8_pretrain.py][INFO] Epoch:[0/2](154000/4588595) loss:2.964 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:51:43,437][model8_pretrain.py][INFO] Epoch:[0/2](154000/4588595) loss:3.352 lr:0.0000100 epoch_Time:27988.0min: [2024-01-03 09:52:20,362][model8_pretrain.py][INFO] Epoch:[0/2](154100/4588595) loss:2.982 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:52:20,362][model8_pretrain.py][INFO] Epoch:[0/2](154100/4588595) loss:3.114 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:52:20,362][model8_pretrain.py][INFO] Epoch:[0/2](154100/4588595) loss:3.688 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:52:20,362][model8_pretrain.py][INFO] Epoch:[0/2](154100/4588595) loss:3.256 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:52:20,362][model8_pretrain.py][INFO] Epoch:[0/2](154100/4588595) loss:2.819 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:52:20,362][model8_pretrain.py][INFO] Epoch:[0/2](154100/4588595) loss:2.542 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:52:20,363][model8_pretrain.py][INFO] Epoch:[0/2](154100/4588595) loss:2.752 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:52:20,363][model8_pretrain.py][INFO] Epoch:[0/2](154100/4588595) loss:2.790 lr:0.0000100 epoch_Time:27987.0min: [2024-01-03 09:52:57,305][model8_pretrain.py][INFO] Epoch:[0/2](154200/4588595) loss:2.704 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:52:57,305][model8_pretrain.py][INFO] Epoch:[0/2](154200/4588595) loss:3.178 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:52:57,305][model8_pretrain.py][INFO] Epoch:[0/2](154200/4588595) loss:2.893 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:52:57,305][model8_pretrain.py][INFO] Epoch:[0/2](154200/4588595) loss:3.115 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:52:57,305][model8_pretrain.py][INFO] Epoch:[0/2](154200/4588595) loss:2.852 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:52:57,305][model8_pretrain.py][INFO] Epoch:[0/2](154200/4588595) loss:3.091 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:52:57,305][model8_pretrain.py][INFO] Epoch:[0/2](154200/4588595) loss:2.857 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:52:57,305][model8_pretrain.py][INFO] Epoch:[0/2](154200/4588595) loss:2.686 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:53:34,238][model8_pretrain.py][INFO] Epoch:[0/2](154300/4588595) loss:2.978 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:53:34,238][model8_pretrain.py][INFO] Epoch:[0/2](154300/4588595) loss:2.774 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:53:34,238][model8_pretrain.py][INFO] Epoch:[0/2](154300/4588595) loss:3.489 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:53:34,238][model8_pretrain.py][INFO] Epoch:[0/2](154300/4588595) loss:2.716 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:53:34,238][model8_pretrain.py][INFO] Epoch:[0/2](154300/4588595) loss:2.675 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:53:34,238][model8_pretrain.py][INFO] Epoch:[0/2](154300/4588595) loss:2.749 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:53:34,239][model8_pretrain.py][INFO] Epoch:[0/2](154300/4588595) loss:2.708 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:53:34,239][model8_pretrain.py][INFO] Epoch:[0/2](154300/4588595) loss:3.354 lr:0.0000100 epoch_Time:27985.0min: [2024-01-03 09:54:11,174][model8_pretrain.py][INFO] Epoch:[0/2](154400/4588595) loss:3.362 lr:0.0000100 epoch_Time:27983.0min: [2024-01-03 09:54:11,174][model8_pretrain.py][INFO] Epoch:[0/2](154400/4588595) loss:3.008 lr:0.0000100 epoch_Time:27983.0min: [2024-01-03 09:54:11,174][model8_pretrain.py][INFO] Epoch:[0/2](154400/4588595) loss:2.843 lr:0.0000100 epoch_Time:27983.0min: [2024-01-03 09:54:11,174][model8_pretrain.py][INFO] Epoch:[0/2](154400/4588595) loss:2.682 lr:0.0000100 epoch_Time:27983.0min: [2024-01-03 09:54:11,174][model8_pretrain.py][INFO] Epoch:[0/2](154400/4588595) loss:2.767 lr:0.0000100 epoch_Time:27983.0min: [2024-01-03 09:54:11,174][model8_pretrain.py][INFO] Epoch:[0/2](154400/4588595) loss:3.236 lr:0.0000100 epoch_Time:27983.0min: [2024-01-03 09:54:11,175][model8_pretrain.py][INFO] Epoch:[0/2](154400/4588595) loss:3.211 lr:0.0000100 epoch_Time:27983.0min: [2024-01-03 09:54:11,175][model8_pretrain.py][INFO] Epoch:[0/2](154400/4588595) loss:2.860 lr:0.0000100 epoch_Time:27983.0min: [2024-01-03 09:54:48,106][model8_pretrain.py][INFO] Epoch:[0/2](154500/4588595) loss:2.697 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:54:48,106][model8_pretrain.py][INFO] Epoch:[0/2](154500/4588595) loss:2.887 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:54:48,107][model8_pretrain.py][INFO] Epoch:[0/2](154500/4588595) loss:2.902 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:54:48,107][model8_pretrain.py][INFO] Epoch:[0/2](154500/4588595) loss:3.075 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:54:48,107][model8_pretrain.py][INFO] Epoch:[0/2](154500/4588595) loss:3.261 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:54:48,107][model8_pretrain.py][INFO] Epoch:[0/2](154500/4588595) loss:3.181 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:54:48,107][model8_pretrain.py][INFO] Epoch:[0/2](154500/4588595) loss:3.313 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:54:48,107][model8_pretrain.py][INFO] Epoch:[0/2](154500/4588595) loss:2.743 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:55:25,036][model8_pretrain.py][INFO] Epoch:[0/2](154600/4588595) loss:2.814 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:55:25,036][model8_pretrain.py][INFO] Epoch:[0/2](154600/4588595) loss:3.246 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:55:25,036][model8_pretrain.py][INFO] Epoch:[0/2](154600/4588595) loss:3.056 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:55:25,036][model8_pretrain.py][INFO] Epoch:[0/2](154600/4588595) loss:2.587 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:55:25,036][model8_pretrain.py][INFO] Epoch:[0/2](154600/4588595) loss:2.749 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:55:25,036][model8_pretrain.py][INFO] Epoch:[0/2](154600/4588595) loss:3.175 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:55:25,036][model8_pretrain.py][INFO] Epoch:[0/2](154600/4588595) loss:2.715 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:55:25,036][model8_pretrain.py][INFO] Epoch:[0/2](154600/4588595) loss:2.676 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:56:01,971][model8_pretrain.py][INFO] Epoch:[0/2](154700/4588595) loss:3.106 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:56:01,971][model8_pretrain.py][INFO] Epoch:[0/2](154700/4588595) loss:2.974 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:56:01,971][model8_pretrain.py][INFO] Epoch:[0/2](154700/4588595) loss:2.345 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:56:01,971][model8_pretrain.py][INFO] Epoch:[0/2](154700/4588595) loss:3.140 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:56:01,971][model8_pretrain.py][INFO] Epoch:[0/2](154700/4588595) loss:2.897 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:56:01,971][model8_pretrain.py][INFO] Epoch:[0/2](154700/4588595) loss:3.407 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:56:01,971][model8_pretrain.py][INFO] Epoch:[0/2](154700/4588595) loss:2.525 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:56:01,971][model8_pretrain.py][INFO] Epoch:[0/2](154700/4588595) loss:2.813 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:56:47,521][model8_pretrain.py][INFO] Epoch:[0/2](154800/4588595) loss:3.282 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:56:47,521][model8_pretrain.py][INFO] Epoch:[0/2](154800/4588595) loss:3.214 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:56:47,521][model8_pretrain.py][INFO] Epoch:[0/2](154800/4588595) loss:2.435 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:56:47,521][model8_pretrain.py][INFO] Epoch:[0/2](154800/4588595) loss:2.641 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:56:47,521][model8_pretrain.py][INFO] Epoch:[0/2](154800/4588595) loss:3.074 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:56:47,521][model8_pretrain.py][INFO] Epoch:[0/2](154800/4588595) loss:2.503 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:56:47,521][model8_pretrain.py][INFO] Epoch:[0/2](154800/4588595) loss:2.309 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:56:47,521][model8_pretrain.py][INFO] Epoch:[0/2](154800/4588595) loss:3.505 lr:0.0000100 epoch_Time:27984.0min: [2024-01-03 09:57:24,445][model8_pretrain.py][INFO] Epoch:[0/2](154900/4588595) loss:2.873 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:57:24,445][model8_pretrain.py][INFO] Epoch:[0/2](154900/4588595) loss:3.195 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:57:24,445][model8_pretrain.py][INFO] Epoch:[0/2](154900/4588595) loss:2.992 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:57:24,445][model8_pretrain.py][INFO] Epoch:[0/2](154900/4588595) loss:2.846 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:57:24,445][model8_pretrain.py][INFO] Epoch:[0/2](154900/4588595) loss:3.188 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:57:24,445][model8_pretrain.py][INFO] Epoch:[0/2](154900/4588595) loss:2.980 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:57:24,446][model8_pretrain.py][INFO] Epoch:[0/2](154900/4588595) loss:2.830 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:57:24,446][model8_pretrain.py][INFO] Epoch:[0/2](154900/4588595) loss:3.054 lr:0.0000100 epoch_Time:27982.0min: [2024-01-03 09:58:01,373][model8_pretrain.py][INFO] Epoch:[0/2](155000/4588595) loss:2.973 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:58:01,373][model8_pretrain.py][INFO] Epoch:[0/2](155000/4588595) loss:2.763 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:58:01,373][model8_pretrain.py][INFO] Epoch:[0/2](155000/4588595) loss:3.239 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:58:01,373][model8_pretrain.py][INFO] Epoch:[0/2](155000/4588595) loss:2.642 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:58:01,373][model8_pretrain.py][INFO] Epoch:[0/2](155000/4588595) loss:2.848 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:58:01,373][model8_pretrain.py][INFO] Epoch:[0/2](155000/4588595) loss:3.056 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:58:01,373][model8_pretrain.py][INFO] Epoch:[0/2](155000/4588595) loss:2.990 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:58:01,374][model8_pretrain.py][INFO] Epoch:[0/2](155000/4588595) loss:2.968 lr:0.0000100 epoch_Time:27981.0min: [2024-01-03 09:58:38,314][model8_pretrain.py][INFO] Epoch:[0/2](155100/4588595) loss:3.170 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:58:38,314][model8_pretrain.py][INFO] Epoch:[0/2](155100/4588595) loss:2.767 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:58:38,314][model8_pretrain.py][INFO] Epoch:[0/2](155100/4588595) loss:3.504 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:58:38,314][model8_pretrain.py][INFO] Epoch:[0/2](155100/4588595) loss:2.929 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:58:38,314][model8_pretrain.py][INFO] Epoch:[0/2](155100/4588595) loss:3.264 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:58:38,314][model8_pretrain.py][INFO] Epoch:[0/2](155100/4588595) loss:2.689 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:58:38,314][model8_pretrain.py][INFO] Epoch:[0/2](155100/4588595) loss:2.666 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:58:38,314][model8_pretrain.py][INFO] Epoch:[0/2](155100/4588595) loss:3.100 lr:0.0000100 epoch_Time:27980.0min: [2024-01-03 09:59:15,237][model8_pretrain.py][INFO] Epoch:[0/2](155200/4588595) loss:2.873 lr:0.0000100 epoch_Time:27979.0min: [2024-01-03 09:59:15,237][model8_pretrain.py][INFO] Epoch:[0/2](155200/4588595) loss:2.783 lr:0.0000100 epoch_Time:27979.0min: [2024-01-03 09:59:15,237][model8_pretrain.py][INFO] Epoch:[0/2](155200/4588595) loss:2.868 lr:0.0000100 epoch_Time:27979.0min: [2024-01-03 09:59:15,237][model8_pretrain.py][INFO] Epoch:[0/2](155200/4588595) loss:2.936 lr:0.0000100 epoch_Time:27979.0min: [2024-01-03 09:59:15,237][model8_pretrain.py][INFO] Epoch:[0/2](155200/4588595) loss:3.475 lr:0.0000100 epoch_Time:27979.0min: [2024-01-03 09:59:15,237][model8_pretrain.py][INFO] Epoch:[0/2](155200/4588595) loss:3.197 lr:0.0000100 epoch_Time:27979.0min: [2024-01-03 09:59:15,237][model8_pretrain.py][INFO] Epoch:[0/2](155200/4588595) loss:2.823 lr:0.0000100 epoch_Time:27979.0min: [2024-01-03 09:59:15,237][model8_pretrain.py][INFO] Epoch:[0/2](155200/4588595) loss:2.859 lr:0.0000100 epoch_Time:27979.0min: [2024-01-03 09:59:52,168][model8_pretrain.py][INFO] Epoch:[0/2](155300/4588595) loss:2.870 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 09:59:52,168][model8_pretrain.py][INFO] Epoch:[0/2](155300/4588595) loss:3.155 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 09:59:52,168][model8_pretrain.py][INFO] Epoch:[0/2](155300/4588595) loss:3.019 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 09:59:52,168][model8_pretrain.py][INFO] Epoch:[0/2](155300/4588595) loss:2.659 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 09:59:52,168][model8_pretrain.py][INFO] Epoch:[0/2](155300/4588595) loss:2.585 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 09:59:52,168][model8_pretrain.py][INFO] Epoch:[0/2](155300/4588595) loss:3.427 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 09:59:52,168][model8_pretrain.py][INFO] Epoch:[0/2](155300/4588595) loss:3.172 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 09:59:52,168][model8_pretrain.py][INFO] Epoch:[0/2](155300/4588595) loss:3.164 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 10:00:29,097][model8_pretrain.py][INFO] Epoch:[0/2](155400/4588595) loss:3.173 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 10:00:29,097][model8_pretrain.py][INFO] Epoch:[0/2](155400/4588595) loss:3.206 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 10:00:29,097][model8_pretrain.py][INFO] Epoch:[0/2](155400/4588595) loss:2.873 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 10:00:29,097][model8_pretrain.py][INFO] Epoch:[0/2](155400/4588595) loss:3.208 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 10:00:29,097][model8_pretrain.py][INFO] Epoch:[0/2](155400/4588595) loss:3.054 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 10:00:29,097][model8_pretrain.py][INFO] Epoch:[0/2](155400/4588595) loss:3.004 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 10:00:29,097][model8_pretrain.py][INFO] Epoch:[0/2](155400/4588595) loss:2.580 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 10:00:29,097][model8_pretrain.py][INFO] Epoch:[0/2](155400/4588595) loss:3.210 lr:0.0000100 epoch_Time:27977.0min: [2024-01-03 10:01:06,027][model8_pretrain.py][INFO] Epoch:[0/2](155500/4588595) loss:3.390 lr:0.0000100 epoch_Time:27975.0min: [2024-01-03 10:01:06,027][model8_pretrain.py][INFO] Epoch:[0/2](155500/4588595) loss:3.385 lr:0.0000100 epoch_Time:27975.0min: [2024-01-03 10:01:06,027][model8_pretrain.py][INFO] Epoch:[0/2](155500/4588595) loss:3.261 lr:0.0000100 epoch_Time:27975.0min: [2024-01-03 10:01:06,027][model8_pretrain.py][INFO] Epoch:[0/2](155500/4588595) loss:3.252 lr:0.0000100 epoch_Time:27975.0min: [2024-01-03 10:01:06,027][model8_pretrain.py][INFO] Epoch:[0/2](155500/4588595) loss:3.051 lr:0.0000100 epoch_Time:27975.0min: [2024-01-03 10:01:06,027][model8_pretrain.py][INFO] Epoch:[0/2](155500/4588595) loss:3.215 lr:0.0000100 epoch_Time:27975.0min: [2024-01-03 10:01:06,027][model8_pretrain.py][INFO] Epoch:[0/2](155500/4588595) loss:2.916 lr:0.0000100 epoch_Time:27975.0min: [2024-01-03 10:01:06,027][model8_pretrain.py][INFO] Epoch:[0/2](155500/4588595) loss:2.982 lr:0.0000100 epoch_Time:27975.0min: [2024-01-03 10:01:51,566][model8_pretrain.py][INFO] Epoch:[0/2](155600/4588595) loss:2.885 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:01:51,566][model8_pretrain.py][INFO] Epoch:[0/2](155600/4588595) loss:3.286 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:01:51,566][model8_pretrain.py][INFO] Epoch:[0/2](155600/4588595) loss:2.788 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:01:51,566][model8_pretrain.py][INFO] Epoch:[0/2](155600/4588595) loss:2.473 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:01:51,566][model8_pretrain.py][INFO] Epoch:[0/2](155600/4588595) loss:3.155 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:01:51,566][model8_pretrain.py][INFO] Epoch:[0/2](155600/4588595) loss:2.997 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:01:51,566][model8_pretrain.py][INFO] Epoch:[0/2](155600/4588595) loss:2.523 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:01:51,566][model8_pretrain.py][INFO] Epoch:[0/2](155600/4588595) loss:3.639 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:02:28,513][model8_pretrain.py][INFO] Epoch:[0/2](155700/4588595) loss:2.869 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:02:28,513][model8_pretrain.py][INFO] Epoch:[0/2](155700/4588595) loss:2.778 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:02:28,513][model8_pretrain.py][INFO] Epoch:[0/2](155700/4588595) loss:2.973 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:02:28,513][model8_pretrain.py][INFO] Epoch:[0/2](155700/4588595) loss:3.011 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:02:28,513][model8_pretrain.py][INFO] Epoch:[0/2](155700/4588595) loss:3.414 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:02:28,513][model8_pretrain.py][INFO] Epoch:[0/2](155700/4588595) loss:3.343 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:02:28,513][model8_pretrain.py][INFO] Epoch:[0/2](155700/4588595) loss:3.081 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:02:28,513][model8_pretrain.py][INFO] Epoch:[0/2](155700/4588595) loss:3.368 lr:0.0000100 epoch_Time:27978.0min: [2024-01-03 10:03:05,483][model8_pretrain.py][INFO] Epoch:[0/2](155800/4588595) loss:3.032 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:05,483][model8_pretrain.py][INFO] Epoch:[0/2](155800/4588595) loss:2.625 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:05,483][model8_pretrain.py][INFO] Epoch:[0/2](155800/4588595) loss:2.966 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:05,484][model8_pretrain.py][INFO] Epoch:[0/2](155800/4588595) loss:2.953 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:05,483][model8_pretrain.py][INFO] Epoch:[0/2](155800/4588595) loss:2.592 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:05,483][model8_pretrain.py][INFO] Epoch:[0/2](155800/4588595) loss:3.163 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:05,484][model8_pretrain.py][INFO] Epoch:[0/2](155800/4588595) loss:3.127 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:05,484][model8_pretrain.py][INFO] Epoch:[0/2](155800/4588595) loss:3.311 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:42,430][model8_pretrain.py][INFO] Epoch:[0/2](155900/4588595) loss:3.142 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:42,430][model8_pretrain.py][INFO] Epoch:[0/2](155900/4588595) loss:2.785 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:42,430][model8_pretrain.py][INFO] Epoch:[0/2](155900/4588595) loss:3.034 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:42,430][model8_pretrain.py][INFO] Epoch:[0/2](155900/4588595) loss:2.792 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:42,430][model8_pretrain.py][INFO] Epoch:[0/2](155900/4588595) loss:3.593 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:42,430][model8_pretrain.py][INFO] Epoch:[0/2](155900/4588595) loss:2.705 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:42,431][model8_pretrain.py][INFO] Epoch:[0/2](155900/4588595) loss:2.809 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:03:42,431][model8_pretrain.py][INFO] Epoch:[0/2](155900/4588595) loss:3.151 lr:0.0000100 epoch_Time:27976.0min: [2024-01-03 10:04:19,381][model8_pretrain.py][INFO] Epoch:[0/2](156000/4588595) loss:2.708 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:04:19,381][model8_pretrain.py][INFO] Epoch:[0/2](156000/4588595) loss:2.924 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:04:19,381][model8_pretrain.py][INFO] Epoch:[0/2](156000/4588595) loss:2.849 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:04:19,381][model8_pretrain.py][INFO] Epoch:[0/2](156000/4588595) loss:2.543 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:04:19,381][model8_pretrain.py][INFO] Epoch:[0/2](156000/4588595) loss:3.104 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:04:19,381][model8_pretrain.py][INFO] Epoch:[0/2](156000/4588595) loss:2.637 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:04:19,381][model8_pretrain.py][INFO] Epoch:[0/2](156000/4588595) loss:2.716 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:04:19,382][model8_pretrain.py][INFO] Epoch:[0/2](156000/4588595) loss:2.393 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:04:56,295][model8_pretrain.py][INFO] Epoch:[0/2](156100/4588595) loss:2.472 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:04:56,295][model8_pretrain.py][INFO] Epoch:[0/2](156100/4588595) loss:2.751 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:04:56,295][model8_pretrain.py][INFO] Epoch:[0/2](156100/4588595) loss:2.442 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:04:56,295][model8_pretrain.py][INFO] Epoch:[0/2](156100/4588595) loss:2.992 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:04:56,295][model8_pretrain.py][INFO] Epoch:[0/2](156100/4588595) loss:2.432 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:04:56,295][model8_pretrain.py][INFO] Epoch:[0/2](156100/4588595) loss:3.006 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:04:56,295][model8_pretrain.py][INFO] Epoch:[0/2](156100/4588595) loss:3.089 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:04:56,295][model8_pretrain.py][INFO] Epoch:[0/2](156100/4588595) loss:3.333 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:05:33,241][model8_pretrain.py][INFO] Epoch:[0/2](156200/4588595) loss:2.428 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:05:33,241][model8_pretrain.py][INFO] Epoch:[0/2](156200/4588595) loss:3.200 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:05:33,241][model8_pretrain.py][INFO] Epoch:[0/2](156200/4588595) loss:2.637 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:05:33,241][model8_pretrain.py][INFO] Epoch:[0/2](156200/4588595) loss:2.774 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:05:33,241][model8_pretrain.py][INFO] Epoch:[0/2](156200/4588595) loss:3.261 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:05:33,241][model8_pretrain.py][INFO] Epoch:[0/2](156200/4588595) loss:2.891 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:05:33,241][model8_pretrain.py][INFO] Epoch:[0/2](156200/4588595) loss:2.996 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:05:33,241][model8_pretrain.py][INFO] Epoch:[0/2](156200/4588595) loss:3.561 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:06:10,204][model8_pretrain.py][INFO] Epoch:[0/2](156300/4588595) loss:2.745 lr:0.0000100 epoch_Time:27971.0min: [2024-01-03 10:06:10,204][model8_pretrain.py][INFO] Epoch:[0/2](156300/4588595) loss:3.051 lr:0.0000100 epoch_Time:27971.0min: [2024-01-03 10:06:10,204][model8_pretrain.py][INFO] Epoch:[0/2](156300/4588595) loss:2.534 lr:0.0000100 epoch_Time:27971.0min: [2024-01-03 10:06:10,204][model8_pretrain.py][INFO] Epoch:[0/2](156300/4588595) loss:3.289 lr:0.0000100 epoch_Time:27971.0min: [2024-01-03 10:06:10,204][model8_pretrain.py][INFO] Epoch:[0/2](156300/4588595) loss:2.935 lr:0.0000100 epoch_Time:27971.0min: [2024-01-03 10:06:10,204][model8_pretrain.py][INFO] Epoch:[0/2](156300/4588595) loss:2.437 lr:0.0000100 epoch_Time:27971.0min: [2024-01-03 10:06:10,204][model8_pretrain.py][INFO] Epoch:[0/2](156300/4588595) loss:3.177 lr:0.0000100 epoch_Time:27971.0min: [2024-01-03 10:06:10,205][model8_pretrain.py][INFO] Epoch:[0/2](156300/4588595) loss:3.333 lr:0.0000100 epoch_Time:27971.0min: [2024-01-03 10:06:56,121][model8_pretrain.py][INFO] Epoch:[0/2](156400/4588595) loss:2.967 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:06:56,121][model8_pretrain.py][INFO] Epoch:[0/2](156400/4588595) loss:3.013 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:06:56,121][model8_pretrain.py][INFO] Epoch:[0/2](156400/4588595) loss:2.829 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:06:56,121][model8_pretrain.py][INFO] Epoch:[0/2](156400/4588595) loss:3.008 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:06:56,121][model8_pretrain.py][INFO] Epoch:[0/2](156400/4588595) loss:2.561 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:06:56,121][model8_pretrain.py][INFO] Epoch:[0/2](156400/4588595) loss:2.850 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:06:56,121][model8_pretrain.py][INFO] Epoch:[0/2](156400/4588595) loss:3.080 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:06:56,121][model8_pretrain.py][INFO] Epoch:[0/2](156400/4588595) loss:3.280 lr:0.0000100 epoch_Time:27974.0min: [2024-01-03 10:07:33,092][model8_pretrain.py][INFO] Epoch:[0/2](156500/4588595) loss:3.516 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:07:33,092][model8_pretrain.py][INFO] Epoch:[0/2](156500/4588595) loss:3.399 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:07:33,092][model8_pretrain.py][INFO] Epoch:[0/2](156500/4588595) loss:2.598 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:07:33,092][model8_pretrain.py][INFO] Epoch:[0/2](156500/4588595) loss:2.875 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:07:33,092][model8_pretrain.py][INFO] Epoch:[0/2](156500/4588595) loss:2.807 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:07:33,092][model8_pretrain.py][INFO] Epoch:[0/2](156500/4588595) loss:2.888 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:07:33,092][model8_pretrain.py][INFO] Epoch:[0/2](156500/4588595) loss:2.591 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:07:33,092][model8_pretrain.py][INFO] Epoch:[0/2](156500/4588595) loss:3.033 lr:0.0000100 epoch_Time:27973.0min: [2024-01-03 10:08:10,029][model8_pretrain.py][INFO] Epoch:[0/2](156600/4588595) loss:2.792 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:10,029][model8_pretrain.py][INFO] Epoch:[0/2](156600/4588595) loss:3.067 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:10,029][model8_pretrain.py][INFO] Epoch:[0/2](156600/4588595) loss:2.867 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:10,029][model8_pretrain.py][INFO] Epoch:[0/2](156600/4588595) loss:3.312 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:10,029][model8_pretrain.py][INFO] Epoch:[0/2](156600/4588595) loss:2.754 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:10,029][model8_pretrain.py][INFO] Epoch:[0/2](156600/4588595) loss:3.101 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:10,029][model8_pretrain.py][INFO] Epoch:[0/2](156600/4588595) loss:3.058 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:10,030][model8_pretrain.py][INFO] Epoch:[0/2](156600/4588595) loss:2.905 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:46,959][model8_pretrain.py][INFO] Epoch:[0/2](156700/4588595) loss:3.210 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:46,959][model8_pretrain.py][INFO] Epoch:[0/2](156700/4588595) loss:2.856 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:46,959][model8_pretrain.py][INFO] Epoch:[0/2](156700/4588595) loss:3.026 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:46,959][model8_pretrain.py][INFO] Epoch:[0/2](156700/4588595) loss:3.055 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:46,959][model8_pretrain.py][INFO] Epoch:[0/2](156700/4588595) loss:3.076 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:46,959][model8_pretrain.py][INFO] Epoch:[0/2](156700/4588595) loss:2.895 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:46,959][model8_pretrain.py][INFO] Epoch:[0/2](156700/4588595) loss:2.118 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:08:46,959][model8_pretrain.py][INFO] Epoch:[0/2](156700/4588595) loss:2.518 lr:0.0000100 epoch_Time:27972.0min: [2024-01-03 10:09:23,883][model8_pretrain.py][INFO] Epoch:[0/2](156800/4588595) loss:2.935 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:09:23,883][model8_pretrain.py][INFO] Epoch:[0/2](156800/4588595) loss:2.511 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:09:23,883][model8_pretrain.py][INFO] Epoch:[0/2](156800/4588595) loss:2.476 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:09:23,883][model8_pretrain.py][INFO] Epoch:[0/2](156800/4588595) loss:3.052 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:09:23,883][model8_pretrain.py][INFO] Epoch:[0/2](156800/4588595) loss:3.243 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:09:23,883][model8_pretrain.py][INFO] Epoch:[0/2](156800/4588595) loss:2.714 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:09:23,883][model8_pretrain.py][INFO] Epoch:[0/2](156800/4588595) loss:2.552 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:09:23,884][model8_pretrain.py][INFO] Epoch:[0/2](156800/4588595) loss:2.111 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:10:00,824][model8_pretrain.py][INFO] Epoch:[0/2](156900/4588595) loss:3.018 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:10:00,824][model8_pretrain.py][INFO] Epoch:[0/2](156900/4588595) loss:2.894 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:10:00,824][model8_pretrain.py][INFO] Epoch:[0/2](156900/4588595) loss:2.868 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:10:00,824][model8_pretrain.py][INFO] Epoch:[0/2](156900/4588595) loss:2.910 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:10:00,824][model8_pretrain.py][INFO] Epoch:[0/2](156900/4588595) loss:3.332 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:10:00,824][model8_pretrain.py][INFO] Epoch:[0/2](156900/4588595) loss:2.870 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:10:00,824][model8_pretrain.py][INFO] Epoch:[0/2](156900/4588595) loss:2.480 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:10:00,824][model8_pretrain.py][INFO] Epoch:[0/2](156900/4588595) loss:3.248 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:10:37,767][model8_pretrain.py][INFO] Epoch:[0/2](157000/4588595) loss:2.710 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:10:37,767][model8_pretrain.py][INFO] Epoch:[0/2](157000/4588595) loss:2.460 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:10:37,767][model8_pretrain.py][INFO] Epoch:[0/2](157000/4588595) loss:2.597 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:10:37,767][model8_pretrain.py][INFO] Epoch:[0/2](157000/4588595) loss:3.341 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:10:37,767][model8_pretrain.py][INFO] Epoch:[0/2](157000/4588595) loss:2.584 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:10:37,767][model8_pretrain.py][INFO] Epoch:[0/2](157000/4588595) loss:2.397 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:10:37,767][model8_pretrain.py][INFO] Epoch:[0/2](157000/4588595) loss:2.542 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:10:37,767][model8_pretrain.py][INFO] Epoch:[0/2](157000/4588595) loss:3.245 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:11:14,700][model8_pretrain.py][INFO] Epoch:[0/2](157100/4588595) loss:2.374 lr:0.0000100 epoch_Time:27967.0min: [2024-01-03 10:11:14,700][model8_pretrain.py][INFO] Epoch:[0/2](157100/4588595) loss:2.692 lr:0.0000100 epoch_Time:27967.0min: [2024-01-03 10:11:14,700][model8_pretrain.py][INFO] Epoch:[0/2](157100/4588595) loss:2.946 lr:0.0000100 epoch_Time:27967.0min: [2024-01-03 10:11:14,700][model8_pretrain.py][INFO] Epoch:[0/2](157100/4588595) loss:2.974 lr:0.0000100 epoch_Time:27967.0min: [2024-01-03 10:11:14,700][model8_pretrain.py][INFO] Epoch:[0/2](157100/4588595) loss:3.084 lr:0.0000100 epoch_Time:27967.0min: [2024-01-03 10:11:14,700][model8_pretrain.py][INFO] Epoch:[0/2](157100/4588595) loss:2.998 lr:0.0000100 epoch_Time:27967.0min: [2024-01-03 10:11:14,700][model8_pretrain.py][INFO] Epoch:[0/2](157100/4588595) loss:2.938 lr:0.0000100 epoch_Time:27967.0min: [2024-01-03 10:11:14,700][model8_pretrain.py][INFO] Epoch:[0/2](157100/4588595) loss:3.193 lr:0.0000100 epoch_Time:27967.0min: [2024-01-03 10:12:00,364][model8_pretrain.py][INFO] Epoch:[0/2](157200/4588595) loss:2.920 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:12:00,364][model8_pretrain.py][INFO] Epoch:[0/2](157200/4588595) loss:2.987 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:12:00,364][model8_pretrain.py][INFO] Epoch:[0/2](157200/4588595) loss:3.057 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:12:00,365][model8_pretrain.py][INFO] Epoch:[0/2](157200/4588595) loss:3.089 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:12:00,364][model8_pretrain.py][INFO] Epoch:[0/2](157200/4588595) loss:3.211 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:12:00,365][model8_pretrain.py][INFO] Epoch:[0/2](157200/4588595) loss:2.331 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:12:00,365][model8_pretrain.py][INFO] Epoch:[0/2](157200/4588595) loss:2.730 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:12:00,365][model8_pretrain.py][INFO] Epoch:[0/2](157200/4588595) loss:2.953 lr:0.0000100 epoch_Time:27970.0min: [2024-01-03 10:12:37,331][model8_pretrain.py][INFO] Epoch:[0/2](157300/4588595) loss:2.930 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:12:37,331][model8_pretrain.py][INFO] Epoch:[0/2](157300/4588595) loss:3.515 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:12:37,331][model8_pretrain.py][INFO] Epoch:[0/2](157300/4588595) loss:2.430 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:12:37,331][model8_pretrain.py][INFO] Epoch:[0/2](157300/4588595) loss:3.063 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:12:37,331][model8_pretrain.py][INFO] Epoch:[0/2](157300/4588595) loss:3.061 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:12:37,331][model8_pretrain.py][INFO] Epoch:[0/2](157300/4588595) loss:3.151 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:12:37,331][model8_pretrain.py][INFO] Epoch:[0/2](157300/4588595) loss:3.344 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:12:37,331][model8_pretrain.py][INFO] Epoch:[0/2](157300/4588595) loss:3.012 lr:0.0000100 epoch_Time:27969.0min: [2024-01-03 10:13:14,265][model8_pretrain.py][INFO] Epoch:[0/2](157400/4588595) loss:3.277 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:13:14,265][model8_pretrain.py][INFO] Epoch:[0/2](157400/4588595) loss:3.066 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:13:14,265][model8_pretrain.py][INFO] Epoch:[0/2](157400/4588595) loss:2.738 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:13:14,265][model8_pretrain.py][INFO] Epoch:[0/2](157400/4588595) loss:3.282 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:13:14,265][model8_pretrain.py][INFO] Epoch:[0/2](157400/4588595) loss:3.416 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:13:14,265][model8_pretrain.py][INFO] Epoch:[0/2](157400/4588595) loss:2.749 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:13:14,265][model8_pretrain.py][INFO] Epoch:[0/2](157400/4588595) loss:2.761 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:13:14,265][model8_pretrain.py][INFO] Epoch:[0/2](157400/4588595) loss:2.832 lr:0.0000100 epoch_Time:27968.0min: [2024-01-03 10:13:51,199][model8_pretrain.py][INFO] Epoch:[0/2](157500/4588595) loss:3.414 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:13:51,199][model8_pretrain.py][INFO] Epoch:[0/2](157500/4588595) loss:2.998 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:13:51,199][model8_pretrain.py][INFO] Epoch:[0/2](157500/4588595) loss:2.868 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:13:51,199][model8_pretrain.py][INFO] Epoch:[0/2](157500/4588595) loss:2.783 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:13:51,199][model8_pretrain.py][INFO] Epoch:[0/2](157500/4588595) loss:3.078 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:13:51,199][model8_pretrain.py][INFO] Epoch:[0/2](157500/4588595) loss:2.543 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:13:51,199][model8_pretrain.py][INFO] Epoch:[0/2](157500/4588595) loss:2.944 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:13:51,199][model8_pretrain.py][INFO] Epoch:[0/2](157500/4588595) loss:3.498 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:14:28,137][model8_pretrain.py][INFO] Epoch:[0/2](157600/4588595) loss:2.858 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:14:28,137][model8_pretrain.py][INFO] Epoch:[0/2](157600/4588595) loss:3.536 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:14:28,137][model8_pretrain.py][INFO] Epoch:[0/2](157600/4588595) loss:2.665 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:14:28,137][model8_pretrain.py][INFO] Epoch:[0/2](157600/4588595) loss:3.264 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:14:28,137][model8_pretrain.py][INFO] Epoch:[0/2](157600/4588595) loss:3.453 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:14:28,137][model8_pretrain.py][INFO] Epoch:[0/2](157600/4588595) loss:2.682 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:14:28,137][model8_pretrain.py][INFO] Epoch:[0/2](157600/4588595) loss:3.127 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:14:28,137][model8_pretrain.py][INFO] Epoch:[0/2](157600/4588595) loss:3.165 lr:0.0000100 epoch_Time:27966.0min: [2024-01-03 10:15:05,074][model8_pretrain.py][INFO] Epoch:[0/2](157700/4588595) loss:2.628 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:05,074][model8_pretrain.py][INFO] Epoch:[0/2](157700/4588595) loss:2.859 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:05,074][model8_pretrain.py][INFO] Epoch:[0/2](157700/4588595) loss:2.556 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:05,074][model8_pretrain.py][INFO] Epoch:[0/2](157700/4588595) loss:3.112 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:05,074][model8_pretrain.py][INFO] Epoch:[0/2](157700/4588595) loss:2.923 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:05,074][model8_pretrain.py][INFO] Epoch:[0/2](157700/4588595) loss:3.212 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:05,074][model8_pretrain.py][INFO] Epoch:[0/2](157700/4588595) loss:3.015 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:05,074][model8_pretrain.py][INFO] Epoch:[0/2](157700/4588595) loss:2.887 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:42,019][model8_pretrain.py][INFO] Epoch:[0/2](157800/4588595) loss:3.080 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:42,019][model8_pretrain.py][INFO] Epoch:[0/2](157800/4588595) loss:3.413 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:42,019][model8_pretrain.py][INFO] Epoch:[0/2](157800/4588595) loss:2.879 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:42,019][model8_pretrain.py][INFO] Epoch:[0/2](157800/4588595) loss:2.746 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:42,019][model8_pretrain.py][INFO] Epoch:[0/2](157800/4588595) loss:2.680 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:42,019][model8_pretrain.py][INFO] Epoch:[0/2](157800/4588595) loss:2.722 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:42,019][model8_pretrain.py][INFO] Epoch:[0/2](157800/4588595) loss:2.658 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:15:42,019][model8_pretrain.py][INFO] Epoch:[0/2](157800/4588595) loss:2.464 lr:0.0000100 epoch_Time:27964.0min: [2024-01-03 10:16:18,952][model8_pretrain.py][INFO] Epoch:[0/2](157900/4588595) loss:3.053 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:16:18,952][model8_pretrain.py][INFO] Epoch:[0/2](157900/4588595) loss:2.737 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:16:18,952][model8_pretrain.py][INFO] Epoch:[0/2](157900/4588595) loss:3.265 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:16:18,952][model8_pretrain.py][INFO] Epoch:[0/2](157900/4588595) loss:2.757 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:16:18,952][model8_pretrain.py][INFO] Epoch:[0/2](157900/4588595) loss:3.073 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:16:18,952][model8_pretrain.py][INFO] Epoch:[0/2](157900/4588595) loss:2.716 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:16:18,952][model8_pretrain.py][INFO] Epoch:[0/2](157900/4588595) loss:3.215 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:16:18,952][model8_pretrain.py][INFO] Epoch:[0/2](157900/4588595) loss:2.867 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:17:04,549][model8_pretrain.py][INFO] Epoch:[0/2](158000/4588595) loss:2.989 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:04,549][model8_pretrain.py][INFO] Epoch:[0/2](158000/4588595) loss:3.081 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:04,549][model8_pretrain.py][INFO] Epoch:[0/2](158000/4588595) loss:3.189 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:04,549][model8_pretrain.py][INFO] Epoch:[0/2](158000/4588595) loss:3.351 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:04,549][model8_pretrain.py][INFO] Epoch:[0/2](158000/4588595) loss:3.319 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:04,549][model8_pretrain.py][INFO] Epoch:[0/2](158000/4588595) loss:3.058 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:04,549][model8_pretrain.py][INFO] Epoch:[0/2](158000/4588595) loss:2.943 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:04,549][model8_pretrain.py][INFO] Epoch:[0/2](158000/4588595) loss:2.519 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:41,471][model8_pretrain.py][INFO] Epoch:[0/2](158100/4588595) loss:3.244 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:41,471][model8_pretrain.py][INFO] Epoch:[0/2](158100/4588595) loss:3.222 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:41,471][model8_pretrain.py][INFO] Epoch:[0/2](158100/4588595) loss:3.590 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:41,471][model8_pretrain.py][INFO] Epoch:[0/2](158100/4588595) loss:2.581 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:41,471][model8_pretrain.py][INFO] Epoch:[0/2](158100/4588595) loss:2.971 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:41,471][model8_pretrain.py][INFO] Epoch:[0/2](158100/4588595) loss:2.489 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:41,471][model8_pretrain.py][INFO] Epoch:[0/2](158100/4588595) loss:2.322 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:17:41,471][model8_pretrain.py][INFO] Epoch:[0/2](158100/4588595) loss:3.014 lr:0.0000100 epoch_Time:27965.0min: [2024-01-03 10:18:18,407][model8_pretrain.py][INFO] Epoch:[0/2](158200/4588595) loss:2.908 lr:0.0000100 epoch_Time:27963.0min: [2024-01-03 10:18:18,407][model8_pretrain.py][INFO] Epoch:[0/2](158200/4588595) loss:2.876 lr:0.0000100 epoch_Time:27963.0min: [2024-01-03 10:18:18,407][model8_pretrain.py][INFO] Epoch:[0/2](158200/4588595) loss:3.061 lr:0.0000100 epoch_Time:27963.0min: [2024-01-03 10:18:18,407][model8_pretrain.py][INFO] Epoch:[0/2](158200/4588595) loss:3.443 lr:0.0000100 epoch_Time:27963.0min: [2024-01-03 10:18:18,407][model8_pretrain.py][INFO] Epoch:[0/2](158200/4588595) loss:3.188 lr:0.0000100 epoch_Time:27963.0min: [2024-01-03 10:18:18,407][model8_pretrain.py][INFO] Epoch:[0/2](158200/4588595) loss:2.936 lr:0.0000100 epoch_Time:27963.0min: [2024-01-03 10:18:18,407][model8_pretrain.py][INFO] Epoch:[0/2](158200/4588595) loss:2.803 lr:0.0000100 epoch_Time:27963.0min: [2024-01-03 10:18:18,407][model8_pretrain.py][INFO] Epoch:[0/2](158200/4588595) loss:2.596 lr:0.0000100 epoch_Time:27963.0min: [2024-01-03 10:18:55,332][model8_pretrain.py][INFO] Epoch:[0/2](158300/4588595) loss:2.851 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:18:55,332][model8_pretrain.py][INFO] Epoch:[0/2](158300/4588595) loss:3.023 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:18:55,332][model8_pretrain.py][INFO] Epoch:[0/2](158300/4588595) loss:3.089 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:18:55,332][model8_pretrain.py][INFO] Epoch:[0/2](158300/4588595) loss:2.552 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:18:55,332][model8_pretrain.py][INFO] Epoch:[0/2](158300/4588595) loss:3.397 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:18:55,332][model8_pretrain.py][INFO] Epoch:[0/2](158300/4588595) loss:2.911 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:18:55,332][model8_pretrain.py][INFO] Epoch:[0/2](158300/4588595) loss:3.013 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:18:55,333][model8_pretrain.py][INFO] Epoch:[0/2](158300/4588595) loss:3.239 lr:0.0000100 epoch_Time:27962.0min: [2024-01-03 10:19:32,266][model8_pretrain.py][INFO] Epoch:[0/2](158400/4588595) loss:2.529 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:19:32,266][model8_pretrain.py][INFO] Epoch:[0/2](158400/4588595) loss:3.180 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:19:32,266][model8_pretrain.py][INFO] Epoch:[0/2](158400/4588595) loss:3.061 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:19:32,266][model8_pretrain.py][INFO] Epoch:[0/2](158400/4588595) loss:2.754 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:19:32,266][model8_pretrain.py][INFO] Epoch:[0/2](158400/4588595) loss:2.898 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:19:32,266][model8_pretrain.py][INFO] Epoch:[0/2](158400/4588595) loss:3.502 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:19:32,266][model8_pretrain.py][INFO] Epoch:[0/2](158400/4588595) loss:2.520 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:19:32,266][model8_pretrain.py][INFO] Epoch:[0/2](158400/4588595) loss:3.203 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:20:09,195][model8_pretrain.py][INFO] Epoch:[0/2](158500/4588595) loss:2.751 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:20:09,195][model8_pretrain.py][INFO] Epoch:[0/2](158500/4588595) loss:2.832 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:20:09,195][model8_pretrain.py][INFO] Epoch:[0/2](158500/4588595) loss:3.043 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:20:09,195][model8_pretrain.py][INFO] Epoch:[0/2](158500/4588595) loss:3.420 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:20:09,195][model8_pretrain.py][INFO] Epoch:[0/2](158500/4588595) loss:3.234 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:20:09,195][model8_pretrain.py][INFO] Epoch:[0/2](158500/4588595) loss:3.382 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:20:09,195][model8_pretrain.py][INFO] Epoch:[0/2](158500/4588595) loss:2.697 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:20:09,195][model8_pretrain.py][INFO] Epoch:[0/2](158500/4588595) loss:2.690 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:20:46,121][model8_pretrain.py][INFO] Epoch:[0/2](158600/4588595) loss:2.712 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:20:46,121][model8_pretrain.py][INFO] Epoch:[0/2](158600/4588595) loss:3.213 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:20:46,121][model8_pretrain.py][INFO] Epoch:[0/2](158600/4588595) loss:3.220 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:20:46,121][model8_pretrain.py][INFO] Epoch:[0/2](158600/4588595) loss:3.275 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:20:46,121][model8_pretrain.py][INFO] Epoch:[0/2](158600/4588595) loss:2.722 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:20:46,121][model8_pretrain.py][INFO] Epoch:[0/2](158600/4588595) loss:2.955 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:20:46,121][model8_pretrain.py][INFO] Epoch:[0/2](158600/4588595) loss:3.137 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:20:46,122][model8_pretrain.py][INFO] Epoch:[0/2](158600/4588595) loss:2.649 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:21:23,054][model8_pretrain.py][INFO] Epoch:[0/2](158700/4588595) loss:2.750 lr:0.0000100 epoch_Time:27958.0min: [2024-01-03 10:21:23,054][model8_pretrain.py][INFO] Epoch:[0/2](158700/4588595) loss:3.160 lr:0.0000100 epoch_Time:27958.0min: [2024-01-03 10:21:23,054][model8_pretrain.py][INFO] Epoch:[0/2](158700/4588595) loss:3.224 lr:0.0000100 epoch_Time:27958.0min: [2024-01-03 10:21:23,054][model8_pretrain.py][INFO] Epoch:[0/2](158700/4588595) loss:2.843 lr:0.0000100 epoch_Time:27958.0min: [2024-01-03 10:21:23,054][model8_pretrain.py][INFO] Epoch:[0/2](158700/4588595) loss:3.074 lr:0.0000100 epoch_Time:27958.0min: [2024-01-03 10:21:23,054][model8_pretrain.py][INFO] Epoch:[0/2](158700/4588595) loss:2.799 lr:0.0000100 epoch_Time:27958.0min: [2024-01-03 10:21:23,054][model8_pretrain.py][INFO] Epoch:[0/2](158700/4588595) loss:2.878 lr:0.0000100 epoch_Time:27958.0min: [2024-01-03 10:21:23,055][model8_pretrain.py][INFO] Epoch:[0/2](158700/4588595) loss:2.848 lr:0.0000100 epoch_Time:27958.0min: [2024-01-03 10:22:08,479][model8_pretrain.py][INFO] Epoch:[0/2](158800/4588595) loss:2.505 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:22:08,479][model8_pretrain.py][INFO] Epoch:[0/2](158800/4588595) loss:2.833 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:22:08,479][model8_pretrain.py][INFO] Epoch:[0/2](158800/4588595) loss:2.977 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:22:08,479][model8_pretrain.py][INFO] Epoch:[0/2](158800/4588595) loss:3.419 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:22:08,479][model8_pretrain.py][INFO] Epoch:[0/2](158800/4588595) loss:2.371 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:22:08,479][model8_pretrain.py][INFO] Epoch:[0/2](158800/4588595) loss:2.960 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:22:08,479][model8_pretrain.py][INFO] Epoch:[0/2](158800/4588595) loss:3.415 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:22:08,479][model8_pretrain.py][INFO] Epoch:[0/2](158800/4588595) loss:3.000 lr:0.0000100 epoch_Time:27961.0min: [2024-01-03 10:22:45,404][model8_pretrain.py][INFO] Epoch:[0/2](158900/4588595) loss:3.519 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:22:45,404][model8_pretrain.py][INFO] Epoch:[0/2](158900/4588595) loss:2.954 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:22:45,404][model8_pretrain.py][INFO] Epoch:[0/2](158900/4588595) loss:3.048 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:22:45,404][model8_pretrain.py][INFO] Epoch:[0/2](158900/4588595) loss:2.356 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:22:45,404][model8_pretrain.py][INFO] Epoch:[0/2](158900/4588595) loss:2.771 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:22:45,404][model8_pretrain.py][INFO] Epoch:[0/2](158900/4588595) loss:2.601 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:22:45,404][model8_pretrain.py][INFO] Epoch:[0/2](158900/4588595) loss:2.845 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:22:45,404][model8_pretrain.py][INFO] Epoch:[0/2](158900/4588595) loss:2.905 lr:0.0000100 epoch_Time:27960.0min: [2024-01-03 10:23:22,347][model8_pretrain.py][INFO] Epoch:[0/2](159000/4588595) loss:2.697 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:23:22,347][model8_pretrain.py][INFO] Epoch:[0/2](159000/4588595) loss:3.240 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:23:22,347][model8_pretrain.py][INFO] Epoch:[0/2](159000/4588595) loss:3.001 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:23:22,347][model8_pretrain.py][INFO] Epoch:[0/2](159000/4588595) loss:2.851 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:23:22,347][model8_pretrain.py][INFO] Epoch:[0/2](159000/4588595) loss:2.831 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:23:22,348][model8_pretrain.py][INFO] Epoch:[0/2](159000/4588595) loss:3.376 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:23:22,348][model8_pretrain.py][INFO] Epoch:[0/2](159000/4588595) loss:2.902 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:23:22,348][model8_pretrain.py][INFO] Epoch:[0/2](159000/4588595) loss:2.845 lr:0.0000100 epoch_Time:27959.0min: [2024-01-03 10:23:59,287][model8_pretrain.py][INFO] Epoch:[0/2](159100/4588595) loss:2.873 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:23:59,287][model8_pretrain.py][INFO] Epoch:[0/2](159100/4588595) loss:2.723 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:23:59,287][model8_pretrain.py][INFO] Epoch:[0/2](159100/4588595) loss:3.197 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:23:59,287][model8_pretrain.py][INFO] Epoch:[0/2](159100/4588595) loss:3.217 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:23:59,287][model8_pretrain.py][INFO] Epoch:[0/2](159100/4588595) loss:3.032 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:23:59,287][model8_pretrain.py][INFO] Epoch:[0/2](159100/4588595) loss:2.927 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:23:59,287][model8_pretrain.py][INFO] Epoch:[0/2](159100/4588595) loss:3.400 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:23:59,287][model8_pretrain.py][INFO] Epoch:[0/2](159100/4588595) loss:3.292 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:24:36,222][model8_pretrain.py][INFO] Epoch:[0/2](159200/4588595) loss:2.807 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:24:36,222][model8_pretrain.py][INFO] Epoch:[0/2](159200/4588595) loss:3.328 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:24:36,222][model8_pretrain.py][INFO] Epoch:[0/2](159200/4588595) loss:3.176 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:24:36,222][model8_pretrain.py][INFO] Epoch:[0/2](159200/4588595) loss:3.196 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:24:36,222][model8_pretrain.py][INFO] Epoch:[0/2](159200/4588595) loss:2.993 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:24:36,223][model8_pretrain.py][INFO] Epoch:[0/2](159200/4588595) loss:3.116 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:24:36,223][model8_pretrain.py][INFO] Epoch:[0/2](159200/4588595) loss:2.760 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:24:36,223][model8_pretrain.py][INFO] Epoch:[0/2](159200/4588595) loss:3.031 lr:0.0000100 epoch_Time:27957.0min: [2024-01-03 10:25:13,167][model8_pretrain.py][INFO] Epoch:[0/2](159300/4588595) loss:3.209 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:25:13,167][model8_pretrain.py][INFO] Epoch:[0/2](159300/4588595) loss:2.761 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:25:13,167][model8_pretrain.py][INFO] Epoch:[0/2](159300/4588595) loss:3.076 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:25:13,167][model8_pretrain.py][INFO] Epoch:[0/2](159300/4588595) loss:3.238 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:25:13,167][model8_pretrain.py][INFO] Epoch:[0/2](159300/4588595) loss:2.600 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:25:13,167][model8_pretrain.py][INFO] Epoch:[0/2](159300/4588595) loss:2.751 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:25:13,167][model8_pretrain.py][INFO] Epoch:[0/2](159300/4588595) loss:3.199 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:25:13,167][model8_pretrain.py][INFO] Epoch:[0/2](159300/4588595) loss:2.834 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:25:50,109][model8_pretrain.py][INFO] Epoch:[0/2](159400/4588595) loss:2.875 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:25:50,109][model8_pretrain.py][INFO] Epoch:[0/2](159400/4588595) loss:2.455 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:25:50,109][model8_pretrain.py][INFO] Epoch:[0/2](159400/4588595) loss:3.453 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:25:50,109][model8_pretrain.py][INFO] Epoch:[0/2](159400/4588595) loss:2.812 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:25:50,109][model8_pretrain.py][INFO] Epoch:[0/2](159400/4588595) loss:2.826 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:25:50,109][model8_pretrain.py][INFO] Epoch:[0/2](159400/4588595) loss:2.799 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:25:50,109][model8_pretrain.py][INFO] Epoch:[0/2](159400/4588595) loss:2.003 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:25:50,109][model8_pretrain.py][INFO] Epoch:[0/2](159400/4588595) loss:3.147 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:26:27,067][model8_pretrain.py][INFO] Epoch:[0/2](159500/4588595) loss:3.158 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:26:27,067][model8_pretrain.py][INFO] Epoch:[0/2](159500/4588595) loss:2.768 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:26:27,067][model8_pretrain.py][INFO] Epoch:[0/2](159500/4588595) loss:3.109 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:26:27,067][model8_pretrain.py][INFO] Epoch:[0/2](159500/4588595) loss:2.826 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:26:27,067][model8_pretrain.py][INFO] Epoch:[0/2](159500/4588595) loss:2.861 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:26:27,067][model8_pretrain.py][INFO] Epoch:[0/2](159500/4588595) loss:3.178 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:26:27,067][model8_pretrain.py][INFO] Epoch:[0/2](159500/4588595) loss:3.686 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:26:27,067][model8_pretrain.py][INFO] Epoch:[0/2](159500/4588595) loss:3.178 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:27:12,569][model8_pretrain.py][INFO] Epoch:[0/2](159600/4588595) loss:2.773 lr:0.0000100 epoch_Time:27956.0min: [2024-01-03 10:27:12,570][model8_pretrain.py][INFO] Epoch:[0/2](159600/4588595) loss:3.073 lr:0.0000100 epoch_Time:27956.0min: [2024-01-03 10:27:12,569][model8_pretrain.py][INFO] Epoch:[0/2](159600/4588595) loss:2.520 lr:0.0000100 epoch_Time:27956.0min: [2024-01-03 10:27:12,569][model8_pretrain.py][INFO] Epoch:[0/2](159600/4588595) loss:2.941 lr:0.0000100 epoch_Time:27956.0min: [2024-01-03 10:27:12,570][model8_pretrain.py][INFO] Epoch:[0/2](159600/4588595) loss:3.431 lr:0.0000100 epoch_Time:27956.0min: [2024-01-03 10:27:12,570][model8_pretrain.py][INFO] Epoch:[0/2](159600/4588595) loss:2.239 lr:0.0000100 epoch_Time:27956.0min: [2024-01-03 10:27:12,570][model8_pretrain.py][INFO] Epoch:[0/2](159600/4588595) loss:3.261 lr:0.0000100 epoch_Time:27956.0min: [2024-01-03 10:27:12,570][model8_pretrain.py][INFO] Epoch:[0/2](159600/4588595) loss:2.742 lr:0.0000100 epoch_Time:27956.0min: [2024-01-03 10:27:49,521][model8_pretrain.py][INFO] Epoch:[0/2](159700/4588595) loss:2.568 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:27:49,521][model8_pretrain.py][INFO] Epoch:[0/2](159700/4588595) loss:2.874 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:27:49,521][model8_pretrain.py][INFO] Epoch:[0/2](159700/4588595) loss:2.415 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:27:49,521][model8_pretrain.py][INFO] Epoch:[0/2](159700/4588595) loss:2.887 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:27:49,521][model8_pretrain.py][INFO] Epoch:[0/2](159700/4588595) loss:2.841 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:27:49,521][model8_pretrain.py][INFO] Epoch:[0/2](159700/4588595) loss:3.068 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:27:49,521][model8_pretrain.py][INFO] Epoch:[0/2](159700/4588595) loss:3.144 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:27:49,521][model8_pretrain.py][INFO] Epoch:[0/2](159700/4588595) loss:3.425 lr:0.0000100 epoch_Time:27955.0min: [2024-01-03 10:28:26,456][model8_pretrain.py][INFO] Epoch:[0/2](159800/4588595) loss:1.992 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:28:26,456][model8_pretrain.py][INFO] Epoch:[0/2](159800/4588595) loss:2.051 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:28:26,456][model8_pretrain.py][INFO] Epoch:[0/2](159800/4588595) loss:2.670 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:28:26,457][model8_pretrain.py][INFO] Epoch:[0/2](159800/4588595) loss:2.774 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:28:26,456][model8_pretrain.py][INFO] Epoch:[0/2](159800/4588595) loss:3.309 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:28:26,456][model8_pretrain.py][INFO] Epoch:[0/2](159800/4588595) loss:2.963 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:28:26,457][model8_pretrain.py][INFO] Epoch:[0/2](159800/4588595) loss:3.195 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:28:26,457][model8_pretrain.py][INFO] Epoch:[0/2](159800/4588595) loss:2.644 lr:0.0000100 epoch_Time:27954.0min: [2024-01-03 10:29:03,390][model8_pretrain.py][INFO] Epoch:[0/2](159900/4588595) loss:2.488 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:29:03,390][model8_pretrain.py][INFO] Epoch:[0/2](159900/4588595) loss:3.077 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:29:03,390][model8_pretrain.py][INFO] Epoch:[0/2](159900/4588595) loss:2.798 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:29:03,390][model8_pretrain.py][INFO] Epoch:[0/2](159900/4588595) loss:2.902 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:29:03,390][model8_pretrain.py][INFO] Epoch:[0/2](159900/4588595) loss:2.569 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:29:03,390][model8_pretrain.py][INFO] Epoch:[0/2](159900/4588595) loss:3.376 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:29:03,390][model8_pretrain.py][INFO] Epoch:[0/2](159900/4588595) loss:2.511 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:29:03,390][model8_pretrain.py][INFO] Epoch:[0/2](159900/4588595) loss:2.856 lr:0.0000100 epoch_Time:27953.0min: [2024-01-03 10:29:40,322][model8_pretrain.py][INFO] Epoch:[0/2](160000/4588595) loss:2.559 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:29:40,322][model8_pretrain.py][INFO] Epoch:[0/2](160000/4588595) loss:3.206 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:29:40,322][model8_pretrain.py][INFO] Epoch:[0/2](160000/4588595) loss:2.837 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:29:40,322][model8_pretrain.py][INFO] Epoch:[0/2](160000/4588595) loss:3.423 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:29:40,322][model8_pretrain.py][INFO] Epoch:[0/2](160000/4588595) loss:3.117 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:29:40,322][model8_pretrain.py][INFO] Epoch:[0/2](160000/4588595) loss:3.276 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:29:40,322][model8_pretrain.py][INFO] Epoch:[0/2](160000/4588595) loss:2.599 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:29:40,323][model8_pretrain.py][INFO] Epoch:[0/2](160000/4588595) loss:2.730 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:30:17,276][model8_pretrain.py][INFO] Epoch:[0/2](160100/4588595) loss:3.013 lr:0.0000100 epoch_Time:27951.0min: [2024-01-03 10:30:17,276][model8_pretrain.py][INFO] Epoch:[0/2](160100/4588595) loss:3.228 lr:0.0000100 epoch_Time:27951.0min: [2024-01-03 10:30:17,276][model8_pretrain.py][INFO] Epoch:[0/2](160100/4588595) loss:2.975 lr:0.0000100 epoch_Time:27951.0min: [2024-01-03 10:30:17,276][model8_pretrain.py][INFO] Epoch:[0/2](160100/4588595) loss:2.864 lr:0.0000100 epoch_Time:27951.0min: [2024-01-03 10:30:17,276][model8_pretrain.py][INFO] Epoch:[0/2](160100/4588595) loss:2.888 lr:0.0000100 epoch_Time:27951.0min: [2024-01-03 10:30:17,276][model8_pretrain.py][INFO] Epoch:[0/2](160100/4588595) loss:3.047 lr:0.0000100 epoch_Time:27951.0min: [2024-01-03 10:30:17,276][model8_pretrain.py][INFO] Epoch:[0/2](160100/4588595) loss:2.988 lr:0.0000100 epoch_Time:27951.0min: [2024-01-03 10:30:17,276][model8_pretrain.py][INFO] Epoch:[0/2](160100/4588595) loss:2.775 lr:0.0000100 epoch_Time:27951.0min: [2024-01-03 10:30:54,216][model8_pretrain.py][INFO] Epoch:[0/2](160200/4588595) loss:2.857 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:30:54,216][model8_pretrain.py][INFO] Epoch:[0/2](160200/4588595) loss:2.830 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:30:54,216][model8_pretrain.py][INFO] Epoch:[0/2](160200/4588595) loss:2.547 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:30:54,216][model8_pretrain.py][INFO] Epoch:[0/2](160200/4588595) loss:3.102 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:30:54,216][model8_pretrain.py][INFO] Epoch:[0/2](160200/4588595) loss:2.943 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:30:54,216][model8_pretrain.py][INFO] Epoch:[0/2](160200/4588595) loss:3.269 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:30:54,216][model8_pretrain.py][INFO] Epoch:[0/2](160200/4588595) loss:2.990 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:30:54,216][model8_pretrain.py][INFO] Epoch:[0/2](160200/4588595) loss:2.966 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:31:31,159][model8_pretrain.py][INFO] Epoch:[0/2](160300/4588595) loss:3.356 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:31:31,159][model8_pretrain.py][INFO] Epoch:[0/2](160300/4588595) loss:2.877 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:31:31,159][model8_pretrain.py][INFO] Epoch:[0/2](160300/4588595) loss:2.619 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:31:31,159][model8_pretrain.py][INFO] Epoch:[0/2](160300/4588595) loss:3.164 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:31:31,159][model8_pretrain.py][INFO] Epoch:[0/2](160300/4588595) loss:2.357 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:31:31,159][model8_pretrain.py][INFO] Epoch:[0/2](160300/4588595) loss:2.833 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:31:31,159][model8_pretrain.py][INFO] Epoch:[0/2](160300/4588595) loss:2.550 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:31:31,159][model8_pretrain.py][INFO] Epoch:[0/2](160300/4588595) loss:2.525 lr:0.0000100 epoch_Time:27949.0min: [2024-01-03 10:32:16,549][model8_pretrain.py][INFO] Epoch:[0/2](160400/4588595) loss:2.396 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:32:16,549][model8_pretrain.py][INFO] Epoch:[0/2](160400/4588595) loss:3.380 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:32:16,549][model8_pretrain.py][INFO] Epoch:[0/2](160400/4588595) loss:3.171 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:32:16,549][model8_pretrain.py][INFO] Epoch:[0/2](160400/4588595) loss:3.255 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:32:16,549][model8_pretrain.py][INFO] Epoch:[0/2](160400/4588595) loss:2.487 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:32:16,549][model8_pretrain.py][INFO] Epoch:[0/2](160400/4588595) loss:3.107 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:32:16,550][model8_pretrain.py][INFO] Epoch:[0/2](160400/4588595) loss:3.096 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:32:16,550][model8_pretrain.py][INFO] Epoch:[0/2](160400/4588595) loss:2.914 lr:0.0000100 epoch_Time:27952.0min: [2024-01-03 10:32:53,478][model8_pretrain.py][INFO] Epoch:[0/2](160500/4588595) loss:2.774 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:32:53,478][model8_pretrain.py][INFO] Epoch:[0/2](160500/4588595) loss:3.332 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:32:53,478][model8_pretrain.py][INFO] Epoch:[0/2](160500/4588595) loss:3.046 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:32:53,478][model8_pretrain.py][INFO] Epoch:[0/2](160500/4588595) loss:2.863 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:32:53,479][model8_pretrain.py][INFO] Epoch:[0/2](160500/4588595) loss:2.496 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:32:53,478][model8_pretrain.py][INFO] Epoch:[0/2](160500/4588595) loss:3.078 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:32:53,479][model8_pretrain.py][INFO] Epoch:[0/2](160500/4588595) loss:2.880 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:32:53,478][model8_pretrain.py][INFO] Epoch:[0/2](160500/4588595) loss:3.415 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:33:30,419][model8_pretrain.py][INFO] Epoch:[0/2](160600/4588595) loss:3.123 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:33:30,419][model8_pretrain.py][INFO] Epoch:[0/2](160600/4588595) loss:2.890 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:33:30,419][model8_pretrain.py][INFO] Epoch:[0/2](160600/4588595) loss:3.155 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:33:30,419][model8_pretrain.py][INFO] Epoch:[0/2](160600/4588595) loss:2.489 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:33:30,419][model8_pretrain.py][INFO] Epoch:[0/2](160600/4588595) loss:3.022 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:33:30,419][model8_pretrain.py][INFO] Epoch:[0/2](160600/4588595) loss:2.678 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:33:30,419][model8_pretrain.py][INFO] Epoch:[0/2](160600/4588595) loss:3.145 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:33:30,420][model8_pretrain.py][INFO] Epoch:[0/2](160600/4588595) loss:3.415 lr:0.0000100 epoch_Time:27950.0min: [2024-01-03 10:34:07,394][model8_pretrain.py][INFO] Epoch:[0/2](160700/4588595) loss:2.939 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:07,394][model8_pretrain.py][INFO] Epoch:[0/2](160700/4588595) loss:3.303 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:07,394][model8_pretrain.py][INFO] Epoch:[0/2](160700/4588595) loss:3.021 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:07,394][model8_pretrain.py][INFO] Epoch:[0/2](160700/4588595) loss:2.574 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:07,394][model8_pretrain.py][INFO] Epoch:[0/2](160700/4588595) loss:3.139 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:07,394][model8_pretrain.py][INFO] Epoch:[0/2](160700/4588595) loss:3.186 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:07,394][model8_pretrain.py][INFO] Epoch:[0/2](160700/4588595) loss:2.849 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:07,394][model8_pretrain.py][INFO] Epoch:[0/2](160700/4588595) loss:3.223 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:44,363][model8_pretrain.py][INFO] Epoch:[0/2](160800/4588595) loss:3.557 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:44,363][model8_pretrain.py][INFO] Epoch:[0/2](160800/4588595) loss:2.612 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:44,363][model8_pretrain.py][INFO] Epoch:[0/2](160800/4588595) loss:2.836 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:44,363][model8_pretrain.py][INFO] Epoch:[0/2](160800/4588595) loss:2.898 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:44,363][model8_pretrain.py][INFO] Epoch:[0/2](160800/4588595) loss:3.123 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:44,363][model8_pretrain.py][INFO] Epoch:[0/2](160800/4588595) loss:2.719 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:44,363][model8_pretrain.py][INFO] Epoch:[0/2](160800/4588595) loss:3.355 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:34:44,363][model8_pretrain.py][INFO] Epoch:[0/2](160800/4588595) loss:2.808 lr:0.0000100 epoch_Time:27948.0min: [2024-01-03 10:35:21,364][model8_pretrain.py][INFO] Epoch:[0/2](160900/4588595) loss:2.726 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:35:21,364][model8_pretrain.py][INFO] Epoch:[0/2](160900/4588595) loss:2.377 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:35:21,364][model8_pretrain.py][INFO] Epoch:[0/2](160900/4588595) loss:3.285 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:35:21,364][model8_pretrain.py][INFO] Epoch:[0/2](160900/4588595) loss:2.686 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:35:21,364][model8_pretrain.py][INFO] Epoch:[0/2](160900/4588595) loss:2.423 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:35:21,364][model8_pretrain.py][INFO] Epoch:[0/2](160900/4588595) loss:3.170 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:35:21,364][model8_pretrain.py][INFO] Epoch:[0/2](160900/4588595) loss:3.339 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:35:21,364][model8_pretrain.py][INFO] Epoch:[0/2](160900/4588595) loss:2.702 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:35:58,331][model8_pretrain.py][INFO] Epoch:[0/2](161000/4588595) loss:2.881 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:35:58,331][model8_pretrain.py][INFO] Epoch:[0/2](161000/4588595) loss:2.702 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:35:58,331][model8_pretrain.py][INFO] Epoch:[0/2](161000/4588595) loss:2.881 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:35:58,331][model8_pretrain.py][INFO] Epoch:[0/2](161000/4588595) loss:2.747 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:35:58,331][model8_pretrain.py][INFO] Epoch:[0/2](161000/4588595) loss:2.998 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:35:58,331][model8_pretrain.py][INFO] Epoch:[0/2](161000/4588595) loss:2.821 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:35:58,331][model8_pretrain.py][INFO] Epoch:[0/2](161000/4588595) loss:2.966 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:35:58,331][model8_pretrain.py][INFO] Epoch:[0/2](161000/4588595) loss:3.008 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:36:35,285][model8_pretrain.py][INFO] Epoch:[0/2](161100/4588595) loss:3.010 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:36:35,285][model8_pretrain.py][INFO] Epoch:[0/2](161100/4588595) loss:3.326 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:36:35,285][model8_pretrain.py][INFO] Epoch:[0/2](161100/4588595) loss:2.926 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:36:35,285][model8_pretrain.py][INFO] Epoch:[0/2](161100/4588595) loss:2.741 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:36:35,285][model8_pretrain.py][INFO] Epoch:[0/2](161100/4588595) loss:3.281 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:36:35,285][model8_pretrain.py][INFO] Epoch:[0/2](161100/4588595) loss:2.817 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:36:35,285][model8_pretrain.py][INFO] Epoch:[0/2](161100/4588595) loss:3.510 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:36:35,285][model8_pretrain.py][INFO] Epoch:[0/2](161100/4588595) loss:2.627 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:37:20,838][model8_pretrain.py][INFO] Epoch:[0/2](161200/4588595) loss:3.135 lr:0.0000100 epoch_Time:27947.0min: [2024-01-03 10:37:20,838][model8_pretrain.py][INFO] Epoch:[0/2](161200/4588595) loss:3.128 lr:0.0000100 epoch_Time:27947.0min: [2024-01-03 10:37:20,838][model8_pretrain.py][INFO] Epoch:[0/2](161200/4588595) loss:3.183 lr:0.0000100 epoch_Time:27947.0min: [2024-01-03 10:37:20,838][model8_pretrain.py][INFO] Epoch:[0/2](161200/4588595) loss:2.303 lr:0.0000100 epoch_Time:27947.0min: [2024-01-03 10:37:20,838][model8_pretrain.py][INFO] Epoch:[0/2](161200/4588595) loss:2.499 lr:0.0000100 epoch_Time:27947.0min: [2024-01-03 10:37:20,838][model8_pretrain.py][INFO] Epoch:[0/2](161200/4588595) loss:3.117 lr:0.0000100 epoch_Time:27947.0min: [2024-01-03 10:37:20,838][model8_pretrain.py][INFO] Epoch:[0/2](161200/4588595) loss:3.026 lr:0.0000100 epoch_Time:27947.0min: [2024-01-03 10:37:20,838][model8_pretrain.py][INFO] Epoch:[0/2](161200/4588595) loss:2.992 lr:0.0000100 epoch_Time:27947.0min: [2024-01-03 10:37:57,766][model8_pretrain.py][INFO] Epoch:[0/2](161300/4588595) loss:3.490 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:37:57,766][model8_pretrain.py][INFO] Epoch:[0/2](161300/4588595) loss:2.951 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:37:57,766][model8_pretrain.py][INFO] Epoch:[0/2](161300/4588595) loss:2.954 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:37:57,766][model8_pretrain.py][INFO] Epoch:[0/2](161300/4588595) loss:3.223 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:37:57,766][model8_pretrain.py][INFO] Epoch:[0/2](161300/4588595) loss:3.011 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:37:57,766][model8_pretrain.py][INFO] Epoch:[0/2](161300/4588595) loss:2.810 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:37:57,766][model8_pretrain.py][INFO] Epoch:[0/2](161300/4588595) loss:3.257 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:37:57,766][model8_pretrain.py][INFO] Epoch:[0/2](161300/4588595) loss:2.812 lr:0.0000100 epoch_Time:27946.0min: [2024-01-03 10:38:34,707][model8_pretrain.py][INFO] Epoch:[0/2](161400/4588595) loss:3.366 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:38:34,707][model8_pretrain.py][INFO] Epoch:[0/2](161400/4588595) loss:2.714 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:38:34,707][model8_pretrain.py][INFO] Epoch:[0/2](161400/4588595) loss:3.055 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:38:34,707][model8_pretrain.py][INFO] Epoch:[0/2](161400/4588595) loss:3.452 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:38:34,707][model8_pretrain.py][INFO] Epoch:[0/2](161400/4588595) loss:3.451 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:38:34,707][model8_pretrain.py][INFO] Epoch:[0/2](161400/4588595) loss:3.244 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:38:34,707][model8_pretrain.py][INFO] Epoch:[0/2](161400/4588595) loss:3.113 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:38:34,707][model8_pretrain.py][INFO] Epoch:[0/2](161400/4588595) loss:3.150 lr:0.0000100 epoch_Time:27945.0min: [2024-01-03 10:39:11,647][model8_pretrain.py][INFO] Epoch:[0/2](161500/4588595) loss:3.062 lr:0.0000100 epoch_Time:27944.0min: [2024-01-03 10:39:11,647][model8_pretrain.py][INFO] Epoch:[0/2](161500/4588595) loss:3.119 lr:0.0000100 epoch_Time:27944.0min: [2024-01-03 10:39:11,647][model8_pretrain.py][INFO] Epoch:[0/2](161500/4588595) loss:3.132 lr:0.0000100 epoch_Time:27944.0min: [2024-01-03 10:39:11,647][model8_pretrain.py][INFO] Epoch:[0/2](161500/4588595) loss:2.863 lr:0.0000100 epoch_Time:27944.0min: [2024-01-03 10:39:11,647][model8_pretrain.py][INFO] Epoch:[0/2](161500/4588595) loss:2.846 lr:0.0000100 epoch_Time:27944.0min: [2024-01-03 10:39:11,648][model8_pretrain.py][INFO] Epoch:[0/2](161500/4588595) loss:3.149 lr:0.0000100 epoch_Time:27944.0min: [2024-01-03 10:39:11,648][model8_pretrain.py][INFO] Epoch:[0/2](161500/4588595) loss:2.644 lr:0.0000100 epoch_Time:27944.0min: [2024-01-03 10:39:11,648][model8_pretrain.py][INFO] Epoch:[0/2](161500/4588595) loss:3.243 lr:0.0000100 epoch_Time:27944.0min: [2024-01-03 10:39:48,602][model8_pretrain.py][INFO] Epoch:[0/2](161600/4588595) loss:2.949 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:39:48,602][model8_pretrain.py][INFO] Epoch:[0/2](161600/4588595) loss:2.531 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:39:48,602][model8_pretrain.py][INFO] Epoch:[0/2](161600/4588595) loss:3.616 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:39:48,602][model8_pretrain.py][INFO] Epoch:[0/2](161600/4588595) loss:3.016 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:39:48,602][model8_pretrain.py][INFO] Epoch:[0/2](161600/4588595) loss:2.834 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:39:48,602][model8_pretrain.py][INFO] Epoch:[0/2](161600/4588595) loss:2.614 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:39:48,602][model8_pretrain.py][INFO] Epoch:[0/2](161600/4588595) loss:3.166 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:39:48,602][model8_pretrain.py][INFO] Epoch:[0/2](161600/4588595) loss:2.646 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:40:25,546][model8_pretrain.py][INFO] Epoch:[0/2](161700/4588595) loss:2.126 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:40:25,546][model8_pretrain.py][INFO] Epoch:[0/2](161700/4588595) loss:2.295 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:40:25,546][model8_pretrain.py][INFO] Epoch:[0/2](161700/4588595) loss:2.983 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:40:25,546][model8_pretrain.py][INFO] Epoch:[0/2](161700/4588595) loss:2.911 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:40:25,546][model8_pretrain.py][INFO] Epoch:[0/2](161700/4588595) loss:2.653 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:40:25,546][model8_pretrain.py][INFO] Epoch:[0/2](161700/4588595) loss:2.795 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:40:25,546][model8_pretrain.py][INFO] Epoch:[0/2](161700/4588595) loss:2.985 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:40:25,546][model8_pretrain.py][INFO] Epoch:[0/2](161700/4588595) loss:3.158 lr:0.0000100 epoch_Time:27942.0min: [2024-01-03 10:41:02,482][model8_pretrain.py][INFO] Epoch:[0/2](161800/4588595) loss:3.067 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:41:02,482][model8_pretrain.py][INFO] Epoch:[0/2](161800/4588595) loss:2.637 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:41:02,482][model8_pretrain.py][INFO] Epoch:[0/2](161800/4588595) loss:3.369 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:41:02,482][model8_pretrain.py][INFO] Epoch:[0/2](161800/4588595) loss:3.236 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:41:02,482][model8_pretrain.py][INFO] Epoch:[0/2](161800/4588595) loss:3.123 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:41:02,482][model8_pretrain.py][INFO] Epoch:[0/2](161800/4588595) loss:2.467 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:41:02,482][model8_pretrain.py][INFO] Epoch:[0/2](161800/4588595) loss:2.400 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:41:02,482][model8_pretrain.py][INFO] Epoch:[0/2](161800/4588595) loss:2.742 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:41:39,447][model8_pretrain.py][INFO] Epoch:[0/2](161900/4588595) loss:3.232 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:41:39,447][model8_pretrain.py][INFO] Epoch:[0/2](161900/4588595) loss:3.440 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:41:39,447][model8_pretrain.py][INFO] Epoch:[0/2](161900/4588595) loss:2.738 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:41:39,447][model8_pretrain.py][INFO] Epoch:[0/2](161900/4588595) loss:3.029 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:41:39,447][model8_pretrain.py][INFO] Epoch:[0/2](161900/4588595) loss:3.202 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:41:39,447][model8_pretrain.py][INFO] Epoch:[0/2](161900/4588595) loss:3.334 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:41:39,447][model8_pretrain.py][INFO] Epoch:[0/2](161900/4588595) loss:3.166 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:41:39,447][model8_pretrain.py][INFO] Epoch:[0/2](161900/4588595) loss:3.046 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:42:25,222][model8_pretrain.py][INFO] Epoch:[0/2](162000/4588595) loss:3.204 lr:0.0000100 epoch_Time:27943.0min: [2024-01-03 10:42:25,222][model8_pretrain.py][INFO] Epoch:[0/2](162000/4588595) loss:2.704 lr:0.0000100 epoch_Time:27943.0min: [2024-01-03 10:42:25,222][model8_pretrain.py][INFO] Epoch:[0/2](162000/4588595) loss:2.716 lr:0.0000100 epoch_Time:27943.0min: [2024-01-03 10:42:25,222][model8_pretrain.py][INFO] Epoch:[0/2](162000/4588595) loss:2.753 lr:0.0000100 epoch_Time:27943.0min: [2024-01-03 10:42:25,222][model8_pretrain.py][INFO] Epoch:[0/2](162000/4588595) loss:2.882 lr:0.0000100 epoch_Time:27943.0min: [2024-01-03 10:42:25,222][model8_pretrain.py][INFO] Epoch:[0/2](162000/4588595) loss:2.982 lr:0.0000100 epoch_Time:27943.0min: [2024-01-03 10:42:25,222][model8_pretrain.py][INFO] Epoch:[0/2](162000/4588595) loss:3.087 lr:0.0000100 epoch_Time:27943.0min: [2024-01-03 10:42:25,222][model8_pretrain.py][INFO] Epoch:[0/2](162000/4588595) loss:2.914 lr:0.0000100 epoch_Time:27943.0min: [2024-01-03 10:43:02,153][model8_pretrain.py][INFO] Epoch:[0/2](162100/4588595) loss:2.583 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:02,153][model8_pretrain.py][INFO] Epoch:[0/2](162100/4588595) loss:2.786 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:02,153][model8_pretrain.py][INFO] Epoch:[0/2](162100/4588595) loss:2.862 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:02,153][model8_pretrain.py][INFO] Epoch:[0/2](162100/4588595) loss:3.292 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:02,153][model8_pretrain.py][INFO] Epoch:[0/2](162100/4588595) loss:3.385 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:02,154][model8_pretrain.py][INFO] Epoch:[0/2](162100/4588595) loss:2.760 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:02,154][model8_pretrain.py][INFO] Epoch:[0/2](162100/4588595) loss:3.129 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:02,154][model8_pretrain.py][INFO] Epoch:[0/2](162100/4588595) loss:2.964 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:39,098][model8_pretrain.py][INFO] Epoch:[0/2](162200/4588595) loss:2.760 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:39,098][model8_pretrain.py][INFO] Epoch:[0/2](162200/4588595) loss:2.794 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:39,098][model8_pretrain.py][INFO] Epoch:[0/2](162200/4588595) loss:3.312 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:39,098][model8_pretrain.py][INFO] Epoch:[0/2](162200/4588595) loss:2.795 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:39,098][model8_pretrain.py][INFO] Epoch:[0/2](162200/4588595) loss:2.798 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:39,098][model8_pretrain.py][INFO] Epoch:[0/2](162200/4588595) loss:2.822 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:39,099][model8_pretrain.py][INFO] Epoch:[0/2](162200/4588595) loss:3.359 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:43:39,099][model8_pretrain.py][INFO] Epoch:[0/2](162200/4588595) loss:2.325 lr:0.0000100 epoch_Time:27941.0min: [2024-01-03 10:44:16,041][model8_pretrain.py][INFO] Epoch:[0/2](162300/4588595) loss:2.931 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:44:16,041][model8_pretrain.py][INFO] Epoch:[0/2](162300/4588595) loss:2.725 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:44:16,041][model8_pretrain.py][INFO] Epoch:[0/2](162300/4588595) loss:3.053 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:44:16,041][model8_pretrain.py][INFO] Epoch:[0/2](162300/4588595) loss:3.211 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:44:16,041][model8_pretrain.py][INFO] Epoch:[0/2](162300/4588595) loss:2.877 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:44:16,041][model8_pretrain.py][INFO] Epoch:[0/2](162300/4588595) loss:3.185 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:44:16,041][model8_pretrain.py][INFO] Epoch:[0/2](162300/4588595) loss:3.065 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:44:16,041][model8_pretrain.py][INFO] Epoch:[0/2](162300/4588595) loss:3.317 lr:0.0000100 epoch_Time:27940.0min: [2024-01-03 10:44:52,973][model8_pretrain.py][INFO] Epoch:[0/2](162400/4588595) loss:3.282 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:44:52,973][model8_pretrain.py][INFO] Epoch:[0/2](162400/4588595) loss:2.810 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:44:52,973][model8_pretrain.py][INFO] Epoch:[0/2](162400/4588595) loss:3.095 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:44:52,973][model8_pretrain.py][INFO] Epoch:[0/2](162400/4588595) loss:2.775 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:44:52,973][model8_pretrain.py][INFO] Epoch:[0/2](162400/4588595) loss:2.850 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:44:52,973][model8_pretrain.py][INFO] Epoch:[0/2](162400/4588595) loss:3.352 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:44:52,973][model8_pretrain.py][INFO] Epoch:[0/2](162400/4588595) loss:2.600 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:44:52,973][model8_pretrain.py][INFO] Epoch:[0/2](162400/4588595) loss:2.915 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:45:29,924][model8_pretrain.py][INFO] Epoch:[0/2](162500/4588595) loss:3.416 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:45:29,924][model8_pretrain.py][INFO] Epoch:[0/2](162500/4588595) loss:3.344 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:45:29,924][model8_pretrain.py][INFO] Epoch:[0/2](162500/4588595) loss:3.198 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:45:29,924][model8_pretrain.py][INFO] Epoch:[0/2](162500/4588595) loss:2.540 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:45:29,924][model8_pretrain.py][INFO] Epoch:[0/2](162500/4588595) loss:3.170 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:45:29,924][model8_pretrain.py][INFO] Epoch:[0/2](162500/4588595) loss:2.914 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:45:29,924][model8_pretrain.py][INFO] Epoch:[0/2](162500/4588595) loss:2.954 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:45:29,924][model8_pretrain.py][INFO] Epoch:[0/2](162500/4588595) loss:2.782 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:46:06,868][model8_pretrain.py][INFO] Epoch:[0/2](162600/4588595) loss:2.654 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:06,868][model8_pretrain.py][INFO] Epoch:[0/2](162600/4588595) loss:2.541 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:06,868][model8_pretrain.py][INFO] Epoch:[0/2](162600/4588595) loss:2.496 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:06,868][model8_pretrain.py][INFO] Epoch:[0/2](162600/4588595) loss:2.778 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:06,868][model8_pretrain.py][INFO] Epoch:[0/2](162600/4588595) loss:3.071 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:06,868][model8_pretrain.py][INFO] Epoch:[0/2](162600/4588595) loss:2.798 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:06,868][model8_pretrain.py][INFO] Epoch:[0/2](162600/4588595) loss:2.940 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:06,868][model8_pretrain.py][INFO] Epoch:[0/2](162600/4588595) loss:2.443 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:43,776][model8_pretrain.py][INFO] Epoch:[0/2](162700/4588595) loss:2.432 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:43,775][model8_pretrain.py][INFO] Epoch:[0/2](162700/4588595) loss:2.891 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:43,776][model8_pretrain.py][INFO] Epoch:[0/2](162700/4588595) loss:2.792 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:43,776][model8_pretrain.py][INFO] Epoch:[0/2](162700/4588595) loss:3.315 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:43,776][model8_pretrain.py][INFO] Epoch:[0/2](162700/4588595) loss:3.087 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:43,776][model8_pretrain.py][INFO] Epoch:[0/2](162700/4588595) loss:3.325 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:43,776][model8_pretrain.py][INFO] Epoch:[0/2](162700/4588595) loss:3.090 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:46:43,776][model8_pretrain.py][INFO] Epoch:[0/2](162700/4588595) loss:3.537 lr:0.0000100 epoch_Time:27936.0min: [2024-01-03 10:47:29,409][model8_pretrain.py][INFO] Epoch:[0/2](162800/4588595) loss:2.647 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:47:29,409][model8_pretrain.py][INFO] Epoch:[0/2](162800/4588595) loss:2.954 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:47:29,409][model8_pretrain.py][INFO] Epoch:[0/2](162800/4588595) loss:3.167 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:47:29,409][model8_pretrain.py][INFO] Epoch:[0/2](162800/4588595) loss:2.786 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:47:29,409][model8_pretrain.py][INFO] Epoch:[0/2](162800/4588595) loss:3.399 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:47:29,409][model8_pretrain.py][INFO] Epoch:[0/2](162800/4588595) loss:2.816 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:47:29,409][model8_pretrain.py][INFO] Epoch:[0/2](162800/4588595) loss:3.235 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:47:29,410][model8_pretrain.py][INFO] Epoch:[0/2](162800/4588595) loss:3.154 lr:0.0000100 epoch_Time:27938.0min: [2024-01-03 10:48:06,332][model8_pretrain.py][INFO] Epoch:[0/2](162900/4588595) loss:2.688 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:06,332][model8_pretrain.py][INFO] Epoch:[0/2](162900/4588595) loss:3.011 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:06,332][model8_pretrain.py][INFO] Epoch:[0/2](162900/4588595) loss:3.531 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:06,332][model8_pretrain.py][INFO] Epoch:[0/2](162900/4588595) loss:2.786 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:06,332][model8_pretrain.py][INFO] Epoch:[0/2](162900/4588595) loss:3.146 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:06,332][model8_pretrain.py][INFO] Epoch:[0/2](162900/4588595) loss:2.849 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:06,332][model8_pretrain.py][INFO] Epoch:[0/2](162900/4588595) loss:3.668 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:06,333][model8_pretrain.py][INFO] Epoch:[0/2](162900/4588595) loss:2.841 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:43,271][model8_pretrain.py][INFO] Epoch:[0/2](163000/4588595) loss:3.278 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:43,271][model8_pretrain.py][INFO] Epoch:[0/2](163000/4588595) loss:3.199 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:43,271][model8_pretrain.py][INFO] Epoch:[0/2](163000/4588595) loss:3.023 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:43,271][model8_pretrain.py][INFO] Epoch:[0/2](163000/4588595) loss:2.901 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:43,271][model8_pretrain.py][INFO] Epoch:[0/2](163000/4588595) loss:2.801 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:43,271][model8_pretrain.py][INFO] Epoch:[0/2](163000/4588595) loss:2.873 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:43,271][model8_pretrain.py][INFO] Epoch:[0/2](163000/4588595) loss:3.128 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:48:43,271][model8_pretrain.py][INFO] Epoch:[0/2](163000/4588595) loss:2.503 lr:0.0000100 epoch_Time:27937.0min: [2024-01-03 10:49:20,220][model8_pretrain.py][INFO] Epoch:[0/2](163100/4588595) loss:3.339 lr:0.0000100 epoch_Time:27935.0min: [2024-01-03 10:49:20,220][model8_pretrain.py][INFO] Epoch:[0/2](163100/4588595) loss:2.991 lr:0.0000100 epoch_Time:27935.0min: [2024-01-03 10:49:20,220][model8_pretrain.py][INFO] Epoch:[0/2](163100/4588595) loss:3.173 lr:0.0000100 epoch_Time:27935.0min: [2024-01-03 10:49:20,220][model8_pretrain.py][INFO] Epoch:[0/2](163100/4588595) loss:3.319 lr:0.0000100 epoch_Time:27935.0min: [2024-01-03 10:49:20,220][model8_pretrain.py][INFO] Epoch:[0/2](163100/4588595) loss:3.066 lr:0.0000100 epoch_Time:27935.0min: [2024-01-03 10:49:20,221][model8_pretrain.py][INFO] Epoch:[0/2](163100/4588595) loss:2.985 lr:0.0000100 epoch_Time:27935.0min: [2024-01-03 10:49:20,221][model8_pretrain.py][INFO] Epoch:[0/2](163100/4588595) loss:2.827 lr:0.0000100 epoch_Time:27935.0min: [2024-01-03 10:49:20,221][model8_pretrain.py][INFO] Epoch:[0/2](163100/4588595) loss:2.946 lr:0.0000100 epoch_Time:27935.0min: [2024-01-03 10:49:57,152][model8_pretrain.py][INFO] Epoch:[0/2](163200/4588595) loss:3.335 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:49:57,152][model8_pretrain.py][INFO] Epoch:[0/2](163200/4588595) loss:2.441 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:49:57,152][model8_pretrain.py][INFO] Epoch:[0/2](163200/4588595) loss:2.429 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:49:57,152][model8_pretrain.py][INFO] Epoch:[0/2](163200/4588595) loss:2.420 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:49:57,152][model8_pretrain.py][INFO] Epoch:[0/2](163200/4588595) loss:3.148 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:49:57,152][model8_pretrain.py][INFO] Epoch:[0/2](163200/4588595) loss:2.518 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:49:57,153][model8_pretrain.py][INFO] Epoch:[0/2](163200/4588595) loss:3.361 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:49:57,153][model8_pretrain.py][INFO] Epoch:[0/2](163200/4588595) loss:3.081 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:50:34,092][model8_pretrain.py][INFO] Epoch:[0/2](163300/4588595) loss:3.349 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:50:34,092][model8_pretrain.py][INFO] Epoch:[0/2](163300/4588595) loss:2.921 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:50:34,092][model8_pretrain.py][INFO] Epoch:[0/2](163300/4588595) loss:3.244 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:50:34,092][model8_pretrain.py][INFO] Epoch:[0/2](163300/4588595) loss:3.305 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:50:34,092][model8_pretrain.py][INFO] Epoch:[0/2](163300/4588595) loss:2.897 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:50:34,092][model8_pretrain.py][INFO] Epoch:[0/2](163300/4588595) loss:2.578 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:50:34,092][model8_pretrain.py][INFO] Epoch:[0/2](163300/4588595) loss:2.845 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:50:34,092][model8_pretrain.py][INFO] Epoch:[0/2](163300/4588595) loss:2.608 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:51:11,039][model8_pretrain.py][INFO] Epoch:[0/2](163400/4588595) loss:2.811 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:51:11,039][model8_pretrain.py][INFO] Epoch:[0/2](163400/4588595) loss:2.411 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:51:11,039][model8_pretrain.py][INFO] Epoch:[0/2](163400/4588595) loss:3.092 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:51:11,039][model8_pretrain.py][INFO] Epoch:[0/2](163400/4588595) loss:2.146 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:51:11,039][model8_pretrain.py][INFO] Epoch:[0/2](163400/4588595) loss:2.765 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:51:11,039][model8_pretrain.py][INFO] Epoch:[0/2](163400/4588595) loss:2.634 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:51:11,039][model8_pretrain.py][INFO] Epoch:[0/2](163400/4588595) loss:3.065 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:51:11,039][model8_pretrain.py][INFO] Epoch:[0/2](163400/4588595) loss:2.929 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:51:48,002][model8_pretrain.py][INFO] Epoch:[0/2](163500/4588595) loss:3.069 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:51:48,002][model8_pretrain.py][INFO] Epoch:[0/2](163500/4588595) loss:2.792 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:51:48,002][model8_pretrain.py][INFO] Epoch:[0/2](163500/4588595) loss:3.007 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:51:48,002][model8_pretrain.py][INFO] Epoch:[0/2](163500/4588595) loss:2.545 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:51:48,002][model8_pretrain.py][INFO] Epoch:[0/2](163500/4588595) loss:2.774 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:51:48,002][model8_pretrain.py][INFO] Epoch:[0/2](163500/4588595) loss:2.879 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:51:48,002][model8_pretrain.py][INFO] Epoch:[0/2](163500/4588595) loss:2.711 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:51:48,003][model8_pretrain.py][INFO] Epoch:[0/2](163500/4588595) loss:3.284 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:52:33,691][model8_pretrain.py][INFO] Epoch:[0/2](163600/4588595) loss:3.516 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:52:33,691][model8_pretrain.py][INFO] Epoch:[0/2](163600/4588595) loss:3.181 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:52:33,691][model8_pretrain.py][INFO] Epoch:[0/2](163600/4588595) loss:3.016 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:52:33,691][model8_pretrain.py][INFO] Epoch:[0/2](163600/4588595) loss:2.401 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:52:33,691][model8_pretrain.py][INFO] Epoch:[0/2](163600/4588595) loss:3.058 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:52:33,691][model8_pretrain.py][INFO] Epoch:[0/2](163600/4588595) loss:3.016 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:52:33,691][model8_pretrain.py][INFO] Epoch:[0/2](163600/4588595) loss:3.299 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:52:33,691][model8_pretrain.py][INFO] Epoch:[0/2](163600/4588595) loss:2.383 lr:0.0000100 epoch_Time:27934.0min: [2024-01-03 10:53:10,615][model8_pretrain.py][INFO] Epoch:[0/2](163700/4588595) loss:2.798 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:53:10,615][model8_pretrain.py][INFO] Epoch:[0/2](163700/4588595) loss:2.633 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:53:10,615][model8_pretrain.py][INFO] Epoch:[0/2](163700/4588595) loss:3.615 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:53:10,615][model8_pretrain.py][INFO] Epoch:[0/2](163700/4588595) loss:2.986 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:53:10,615][model8_pretrain.py][INFO] Epoch:[0/2](163700/4588595) loss:3.408 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:53:10,615][model8_pretrain.py][INFO] Epoch:[0/2](163700/4588595) loss:3.040 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:53:10,615][model8_pretrain.py][INFO] Epoch:[0/2](163700/4588595) loss:3.455 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:53:10,616][model8_pretrain.py][INFO] Epoch:[0/2](163700/4588595) loss:2.399 lr:0.0000100 epoch_Time:27933.0min: [2024-01-03 10:53:47,537][model8_pretrain.py][INFO] Epoch:[0/2](163800/4588595) loss:3.476 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:53:47,537][model8_pretrain.py][INFO] Epoch:[0/2](163800/4588595) loss:3.132 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:53:47,537][model8_pretrain.py][INFO] Epoch:[0/2](163800/4588595) loss:2.978 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:53:47,537][model8_pretrain.py][INFO] Epoch:[0/2](163800/4588595) loss:2.734 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:53:47,537][model8_pretrain.py][INFO] Epoch:[0/2](163800/4588595) loss:2.869 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:53:47,537][model8_pretrain.py][INFO] Epoch:[0/2](163800/4588595) loss:2.349 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:53:47,537][model8_pretrain.py][INFO] Epoch:[0/2](163800/4588595) loss:3.193 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:53:47,537][model8_pretrain.py][INFO] Epoch:[0/2](163800/4588595) loss:3.478 lr:0.0000100 epoch_Time:27932.0min: [2024-01-03 10:54:24,467][model8_pretrain.py][INFO] Epoch:[0/2](163900/4588595) loss:2.417 lr:0.0000100 epoch_Time:27931.0min: [2024-01-03 10:54:24,467][model8_pretrain.py][INFO] Epoch:[0/2](163900/4588595) loss:3.232 lr:0.0000100 epoch_Time:27931.0min: [2024-01-03 10:54:24,467][model8_pretrain.py][INFO] Epoch:[0/2](163900/4588595) loss:3.026 lr:0.0000100 epoch_Time:27931.0min: [2024-01-03 10:54:24,467][model8_pretrain.py][INFO] Epoch:[0/2](163900/4588595) loss:2.646 lr:0.0000100 epoch_Time:27931.0min: [2024-01-03 10:54:24,467][model8_pretrain.py][INFO] Epoch:[0/2](163900/4588595) loss:3.353 lr:0.0000100 epoch_Time:27931.0min: [2024-01-03 10:54:24,467][model8_pretrain.py][INFO] Epoch:[0/2](163900/4588595) loss:3.554 lr:0.0000100 epoch_Time:27931.0min: [2024-01-03 10:54:24,468][model8_pretrain.py][INFO] Epoch:[0/2](163900/4588595) loss:3.028 lr:0.0000100 epoch_Time:27931.0min: [2024-01-03 10:54:24,468][model8_pretrain.py][INFO] Epoch:[0/2](163900/4588595) loss:3.408 lr:0.0000100 epoch_Time:27931.0min: [2024-01-03 10:55:01,406][model8_pretrain.py][INFO] Epoch:[0/2](164000/4588595) loss:3.280 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:01,406][model8_pretrain.py][INFO] Epoch:[0/2](164000/4588595) loss:3.192 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:01,406][model8_pretrain.py][INFO] Epoch:[0/2](164000/4588595) loss:2.652 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:01,406][model8_pretrain.py][INFO] Epoch:[0/2](164000/4588595) loss:3.141 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:01,406][model8_pretrain.py][INFO] Epoch:[0/2](164000/4588595) loss:3.276 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:01,406][model8_pretrain.py][INFO] Epoch:[0/2](164000/4588595) loss:3.237 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:01,406][model8_pretrain.py][INFO] Epoch:[0/2](164000/4588595) loss:2.542 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:01,406][model8_pretrain.py][INFO] Epoch:[0/2](164000/4588595) loss:2.851 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:38,349][model8_pretrain.py][INFO] Epoch:[0/2](164100/4588595) loss:3.265 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:38,349][model8_pretrain.py][INFO] Epoch:[0/2](164100/4588595) loss:1.923 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:38,349][model8_pretrain.py][INFO] Epoch:[0/2](164100/4588595) loss:2.662 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:38,349][model8_pretrain.py][INFO] Epoch:[0/2](164100/4588595) loss:2.783 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:38,349][model8_pretrain.py][INFO] Epoch:[0/2](164100/4588595) loss:3.140 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:38,349][model8_pretrain.py][INFO] Epoch:[0/2](164100/4588595) loss:2.876 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:38,349][model8_pretrain.py][INFO] Epoch:[0/2](164100/4588595) loss:2.836 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:55:38,349][model8_pretrain.py][INFO] Epoch:[0/2](164100/4588595) loss:3.229 lr:0.0000100 epoch_Time:27929.0min: [2024-01-03 10:56:15,302][model8_pretrain.py][INFO] Epoch:[0/2](164200/4588595) loss:3.058 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:56:15,302][model8_pretrain.py][INFO] Epoch:[0/2](164200/4588595) loss:2.971 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:56:15,302][model8_pretrain.py][INFO] Epoch:[0/2](164200/4588595) loss:2.819 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:56:15,302][model8_pretrain.py][INFO] Epoch:[0/2](164200/4588595) loss:3.164 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:56:15,302][model8_pretrain.py][INFO] Epoch:[0/2](164200/4588595) loss:2.757 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:56:15,302][model8_pretrain.py][INFO] Epoch:[0/2](164200/4588595) loss:3.378 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:56:15,302][model8_pretrain.py][INFO] Epoch:[0/2](164200/4588595) loss:3.236 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:56:15,302][model8_pretrain.py][INFO] Epoch:[0/2](164200/4588595) loss:2.723 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:56:52,250][model8_pretrain.py][INFO] Epoch:[0/2](164300/4588595) loss:2.588 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:56:52,250][model8_pretrain.py][INFO] Epoch:[0/2](164300/4588595) loss:3.035 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:56:52,250][model8_pretrain.py][INFO] Epoch:[0/2](164300/4588595) loss:3.018 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:56:52,250][model8_pretrain.py][INFO] Epoch:[0/2](164300/4588595) loss:2.681 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:56:52,250][model8_pretrain.py][INFO] Epoch:[0/2](164300/4588595) loss:2.917 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:56:52,250][model8_pretrain.py][INFO] Epoch:[0/2](164300/4588595) loss:2.667 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:56:52,250][model8_pretrain.py][INFO] Epoch:[0/2](164300/4588595) loss:2.755 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:56:52,250][model8_pretrain.py][INFO] Epoch:[0/2](164300/4588595) loss:2.776 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:57:38,126][model8_pretrain.py][INFO] Epoch:[0/2](164400/4588595) loss:3.094 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:57:38,126][model8_pretrain.py][INFO] Epoch:[0/2](164400/4588595) loss:2.861 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:57:38,126][model8_pretrain.py][INFO] Epoch:[0/2](164400/4588595) loss:3.194 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:57:38,126][model8_pretrain.py][INFO] Epoch:[0/2](164400/4588595) loss:2.911 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:57:38,126][model8_pretrain.py][INFO] Epoch:[0/2](164400/4588595) loss:2.980 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:57:38,126][model8_pretrain.py][INFO] Epoch:[0/2](164400/4588595) loss:3.133 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:57:38,126][model8_pretrain.py][INFO] Epoch:[0/2](164400/4588595) loss:2.949 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:57:38,126][model8_pretrain.py][INFO] Epoch:[0/2](164400/4588595) loss:2.852 lr:0.0000100 epoch_Time:27930.0min: [2024-01-03 10:58:15,074][model8_pretrain.py][INFO] Epoch:[0/2](164500/4588595) loss:3.283 lr:0.0000100 epoch_Time:27928.0min: [2024-01-03 10:58:15,074][model8_pretrain.py][INFO] Epoch:[0/2](164500/4588595) loss:2.916 lr:0.0000100 epoch_Time:27928.0min: [2024-01-03 10:58:15,074][model8_pretrain.py][INFO] Epoch:[0/2](164500/4588595) loss:3.066 lr:0.0000100 epoch_Time:27928.0min: [2024-01-03 10:58:15,074][model8_pretrain.py][INFO] Epoch:[0/2](164500/4588595) loss:2.830 lr:0.0000100 epoch_Time:27928.0min: [2024-01-03 10:58:15,074][model8_pretrain.py][INFO] Epoch:[0/2](164500/4588595) loss:2.999 lr:0.0000100 epoch_Time:27928.0min: [2024-01-03 10:58:15,074][model8_pretrain.py][INFO] Epoch:[0/2](164500/4588595) loss:2.391 lr:0.0000100 epoch_Time:27928.0min: [2024-01-03 10:58:15,074][model8_pretrain.py][INFO] Epoch:[0/2](164500/4588595) loss:2.990 lr:0.0000100 epoch_Time:27928.0min: [2024-01-03 10:58:15,074][model8_pretrain.py][INFO] Epoch:[0/2](164500/4588595) loss:2.422 lr:0.0000100 epoch_Time:27928.0min: [2024-01-03 10:58:52,019][model8_pretrain.py][INFO] Epoch:[0/2](164600/4588595) loss:2.868 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:58:52,019][model8_pretrain.py][INFO] Epoch:[0/2](164600/4588595) loss:3.127 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:58:52,019][model8_pretrain.py][INFO] Epoch:[0/2](164600/4588595) loss:2.813 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:58:52,019][model8_pretrain.py][INFO] Epoch:[0/2](164600/4588595) loss:2.564 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:58:52,019][model8_pretrain.py][INFO] Epoch:[0/2](164600/4588595) loss:3.068 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:58:52,019][model8_pretrain.py][INFO] Epoch:[0/2](164600/4588595) loss:3.459 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:58:52,019][model8_pretrain.py][INFO] Epoch:[0/2](164600/4588595) loss:2.604 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:58:52,020][model8_pretrain.py][INFO] Epoch:[0/2](164600/4588595) loss:3.039 lr:0.0000100 epoch_Time:27927.0min: [2024-01-03 10:59:28,956][model8_pretrain.py][INFO] Epoch:[0/2](164700/4588595) loss:3.156 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:59:28,956][model8_pretrain.py][INFO] Epoch:[0/2](164700/4588595) loss:3.112 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:59:28,956][model8_pretrain.py][INFO] Epoch:[0/2](164700/4588595) loss:3.186 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:59:28,956][model8_pretrain.py][INFO] Epoch:[0/2](164700/4588595) loss:3.053 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:59:28,956][model8_pretrain.py][INFO] Epoch:[0/2](164700/4588595) loss:3.283 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:59:28,956][model8_pretrain.py][INFO] Epoch:[0/2](164700/4588595) loss:2.788 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:59:28,956][model8_pretrain.py][INFO] Epoch:[0/2](164700/4588595) loss:3.076 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 10:59:28,956][model8_pretrain.py][INFO] Epoch:[0/2](164700/4588595) loss:2.345 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 11:00:05,939][model8_pretrain.py][INFO] Epoch:[0/2](164800/4588595) loss:2.977 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:05,939][model8_pretrain.py][INFO] Epoch:[0/2](164800/4588595) loss:2.359 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:05,939][model8_pretrain.py][INFO] Epoch:[0/2](164800/4588595) loss:2.676 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:05,939][model8_pretrain.py][INFO] Epoch:[0/2](164800/4588595) loss:2.976 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:05,939][model8_pretrain.py][INFO] Epoch:[0/2](164800/4588595) loss:3.102 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:05,940][model8_pretrain.py][INFO] Epoch:[0/2](164800/4588595) loss:3.194 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:05,940][model8_pretrain.py][INFO] Epoch:[0/2](164800/4588595) loss:3.105 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:05,940][model8_pretrain.py][INFO] Epoch:[0/2](164800/4588595) loss:3.168 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:42,931][model8_pretrain.py][INFO] Epoch:[0/2](164900/4588595) loss:2.856 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:42,931][model8_pretrain.py][INFO] Epoch:[0/2](164900/4588595) loss:2.998 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:42,931][model8_pretrain.py][INFO] Epoch:[0/2](164900/4588595) loss:3.011 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:42,931][model8_pretrain.py][INFO] Epoch:[0/2](164900/4588595) loss:3.128 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:42,931][model8_pretrain.py][INFO] Epoch:[0/2](164900/4588595) loss:3.059 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:42,931][model8_pretrain.py][INFO] Epoch:[0/2](164900/4588595) loss:3.144 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:42,931][model8_pretrain.py][INFO] Epoch:[0/2](164900/4588595) loss:2.925 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:00:42,931][model8_pretrain.py][INFO] Epoch:[0/2](164900/4588595) loss:2.947 lr:0.0000100 epoch_Time:27925.0min: [2024-01-03 11:01:19,888][model8_pretrain.py][INFO] Epoch:[0/2](165000/4588595) loss:2.558 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:01:19,888][model8_pretrain.py][INFO] Epoch:[0/2](165000/4588595) loss:2.439 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:01:19,888][model8_pretrain.py][INFO] Epoch:[0/2](165000/4588595) loss:3.127 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:01:19,888][model8_pretrain.py][INFO] Epoch:[0/2](165000/4588595) loss:3.062 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:01:19,888][model8_pretrain.py][INFO] Epoch:[0/2](165000/4588595) loss:2.445 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:01:19,888][model8_pretrain.py][INFO] Epoch:[0/2](165000/4588595) loss:3.089 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:01:19,888][model8_pretrain.py][INFO] Epoch:[0/2](165000/4588595) loss:2.966 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:01:19,888][model8_pretrain.py][INFO] Epoch:[0/2](165000/4588595) loss:2.606 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:01:56,833][model8_pretrain.py][INFO] Epoch:[0/2](165100/4588595) loss:3.486 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:01:56,833][model8_pretrain.py][INFO] Epoch:[0/2](165100/4588595) loss:2.728 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:01:56,833][model8_pretrain.py][INFO] Epoch:[0/2](165100/4588595) loss:2.814 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:01:56,833][model8_pretrain.py][INFO] Epoch:[0/2](165100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:01:56,833][model8_pretrain.py][INFO] Epoch:[0/2](165100/4588595) loss:2.670 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:01:56,833][model8_pretrain.py][INFO] Epoch:[0/2](165100/4588595) loss:3.290 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:01:56,833][model8_pretrain.py][INFO] Epoch:[0/2](165100/4588595) loss:3.267 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:01:56,833][model8_pretrain.py][INFO] Epoch:[0/2](165100/4588595) loss:3.268 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:02:43,428][model8_pretrain.py][INFO] Epoch:[0/2](165200/4588595) loss:2.431 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 11:02:43,428][model8_pretrain.py][INFO] Epoch:[0/2](165200/4588595) loss:2.712 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 11:02:43,428][model8_pretrain.py][INFO] Epoch:[0/2](165200/4588595) loss:3.155 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 11:02:43,428][model8_pretrain.py][INFO] Epoch:[0/2](165200/4588595) loss:3.049 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 11:02:43,428][model8_pretrain.py][INFO] Epoch:[0/2](165200/4588595) loss:3.256 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 11:02:43,428][model8_pretrain.py][INFO] Epoch:[0/2](165200/4588595) loss:2.348 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 11:02:43,428][model8_pretrain.py][INFO] Epoch:[0/2](165200/4588595) loss:3.222 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 11:02:43,428][model8_pretrain.py][INFO] Epoch:[0/2](165200/4588595) loss:3.122 lr:0.0000100 epoch_Time:27926.0min: [2024-01-03 11:03:20,361][model8_pretrain.py][INFO] Epoch:[0/2](165300/4588595) loss:2.961 lr:0.0000100 epoch_Time:27924.0min: [2024-01-03 11:03:20,361][model8_pretrain.py][INFO] Epoch:[0/2](165300/4588595) loss:2.983 lr:0.0000100 epoch_Time:27924.0min: [2024-01-03 11:03:20,361][model8_pretrain.py][INFO] Epoch:[0/2](165300/4588595) loss:3.286 lr:0.0000100 epoch_Time:27924.0min: [2024-01-03 11:03:20,361][model8_pretrain.py][INFO] Epoch:[0/2](165300/4588595) loss:3.028 lr:0.0000100 epoch_Time:27924.0min: [2024-01-03 11:03:20,361][model8_pretrain.py][INFO] Epoch:[0/2](165300/4588595) loss:3.495 lr:0.0000100 epoch_Time:27924.0min: [2024-01-03 11:03:20,361][model8_pretrain.py][INFO] Epoch:[0/2](165300/4588595) loss:3.394 lr:0.0000100 epoch_Time:27924.0min: [2024-01-03 11:03:20,361][model8_pretrain.py][INFO] Epoch:[0/2](165300/4588595) loss:2.686 lr:0.0000100 epoch_Time:27924.0min: [2024-01-03 11:03:20,362][model8_pretrain.py][INFO] Epoch:[0/2](165300/4588595) loss:3.310 lr:0.0000100 epoch_Time:27924.0min: [2024-01-03 11:03:57,311][model8_pretrain.py][INFO] Epoch:[0/2](165400/4588595) loss:3.157 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:03:57,311][model8_pretrain.py][INFO] Epoch:[0/2](165400/4588595) loss:3.024 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:03:57,311][model8_pretrain.py][INFO] Epoch:[0/2](165400/4588595) loss:2.915 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:03:57,311][model8_pretrain.py][INFO] Epoch:[0/2](165400/4588595) loss:2.434 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:03:57,311][model8_pretrain.py][INFO] Epoch:[0/2](165400/4588595) loss:3.449 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:03:57,311][model8_pretrain.py][INFO] Epoch:[0/2](165400/4588595) loss:2.894 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:03:57,311][model8_pretrain.py][INFO] Epoch:[0/2](165400/4588595) loss:3.590 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:03:57,312][model8_pretrain.py][INFO] Epoch:[0/2](165400/4588595) loss:2.773 lr:0.0000100 epoch_Time:27923.0min: [2024-01-03 11:04:34,254][model8_pretrain.py][INFO] Epoch:[0/2](165500/4588595) loss:2.867 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:04:34,254][model8_pretrain.py][INFO] Epoch:[0/2](165500/4588595) loss:3.139 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:04:34,254][model8_pretrain.py][INFO] Epoch:[0/2](165500/4588595) loss:2.514 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:04:34,254][model8_pretrain.py][INFO] Epoch:[0/2](165500/4588595) loss:3.249 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:04:34,254][model8_pretrain.py][INFO] Epoch:[0/2](165500/4588595) loss:3.002 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:04:34,254][model8_pretrain.py][INFO] Epoch:[0/2](165500/4588595) loss:2.779 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:04:34,254][model8_pretrain.py][INFO] Epoch:[0/2](165500/4588595) loss:2.932 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:04:34,255][model8_pretrain.py][INFO] Epoch:[0/2](165500/4588595) loss:3.033 lr:0.0000100 epoch_Time:27922.0min: [2024-01-03 11:05:11,214][model8_pretrain.py][INFO] Epoch:[0/2](165600/4588595) loss:3.134 lr:0.0000100 epoch_Time:27921.0min: [2024-01-03 11:05:11,214][model8_pretrain.py][INFO] Epoch:[0/2](165600/4588595) loss:2.642 lr:0.0000100 epoch_Time:27921.0min: [2024-01-03 11:05:11,214][model8_pretrain.py][INFO] Epoch:[0/2](165600/4588595) loss:2.868 lr:0.0000100 epoch_Time:27921.0min: [2024-01-03 11:05:11,214][model8_pretrain.py][INFO] Epoch:[0/2](165600/4588595) loss:2.863 lr:0.0000100 epoch_Time:27921.0min: [2024-01-03 11:05:11,214][model8_pretrain.py][INFO] Epoch:[0/2](165600/4588595) loss:2.920 lr:0.0000100 epoch_Time:27921.0min: [2024-01-03 11:05:11,214][model8_pretrain.py][INFO] Epoch:[0/2](165600/4588595) loss:3.605 lr:0.0000100 epoch_Time:27921.0min: [2024-01-03 11:05:11,214][model8_pretrain.py][INFO] Epoch:[0/2](165600/4588595) loss:2.979 lr:0.0000100 epoch_Time:27921.0min: [2024-01-03 11:05:11,214][model8_pretrain.py][INFO] Epoch:[0/2](165600/4588595) loss:2.862 lr:0.0000100 epoch_Time:27921.0min: [2024-01-03 11:05:48,164][model8_pretrain.py][INFO] Epoch:[0/2](165700/4588595) loss:3.093 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:05:48,164][model8_pretrain.py][INFO] Epoch:[0/2](165700/4588595) loss:3.152 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:05:48,164][model8_pretrain.py][INFO] Epoch:[0/2](165700/4588595) loss:2.884 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:05:48,164][model8_pretrain.py][INFO] Epoch:[0/2](165700/4588595) loss:3.526 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:05:48,164][model8_pretrain.py][INFO] Epoch:[0/2](165700/4588595) loss:3.508 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:05:48,164][model8_pretrain.py][INFO] Epoch:[0/2](165700/4588595) loss:3.068 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:05:48,164][model8_pretrain.py][INFO] Epoch:[0/2](165700/4588595) loss:2.883 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:05:48,164][model8_pretrain.py][INFO] Epoch:[0/2](165700/4588595) loss:3.058 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:06:25,111][model8_pretrain.py][INFO] Epoch:[0/2](165800/4588595) loss:3.199 lr:0.0000100 epoch_Time:27919.0min: [2024-01-03 11:06:25,111][model8_pretrain.py][INFO] Epoch:[0/2](165800/4588595) loss:3.066 lr:0.0000100 epoch_Time:27919.0min: [2024-01-03 11:06:25,111][model8_pretrain.py][INFO] Epoch:[0/2](165800/4588595) loss:2.939 lr:0.0000100 epoch_Time:27919.0min: [2024-01-03 11:06:25,111][model8_pretrain.py][INFO] Epoch:[0/2](165800/4588595) loss:3.054 lr:0.0000100 epoch_Time:27919.0min: [2024-01-03 11:06:25,111][model8_pretrain.py][INFO] Epoch:[0/2](165800/4588595) loss:3.183 lr:0.0000100 epoch_Time:27919.0min: [2024-01-03 11:06:25,111][model8_pretrain.py][INFO] Epoch:[0/2](165800/4588595) loss:2.872 lr:0.0000100 epoch_Time:27919.0min: [2024-01-03 11:06:25,111][model8_pretrain.py][INFO] Epoch:[0/2](165800/4588595) loss:2.342 lr:0.0000100 epoch_Time:27919.0min: [2024-01-03 11:06:25,111][model8_pretrain.py][INFO] Epoch:[0/2](165800/4588595) loss:3.013 lr:0.0000100 epoch_Time:27919.0min: [2024-01-03 11:07:02,053][model8_pretrain.py][INFO] Epoch:[0/2](165900/4588595) loss:3.141 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:07:02,053][model8_pretrain.py][INFO] Epoch:[0/2](165900/4588595) loss:3.009 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:07:02,053][model8_pretrain.py][INFO] Epoch:[0/2](165900/4588595) loss:3.002 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:07:02,053][model8_pretrain.py][INFO] Epoch:[0/2](165900/4588595) loss:3.372 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:07:02,053][model8_pretrain.py][INFO] Epoch:[0/2](165900/4588595) loss:2.797 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:07:02,053][model8_pretrain.py][INFO] Epoch:[0/2](165900/4588595) loss:2.881 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:07:02,054][model8_pretrain.py][INFO] Epoch:[0/2](165900/4588595) loss:2.965 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:07:02,054][model8_pretrain.py][INFO] Epoch:[0/2](165900/4588595) loss:3.399 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:07:47,732][model8_pretrain.py][INFO] Epoch:[0/2](166000/4588595) loss:3.045 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:07:47,732][model8_pretrain.py][INFO] Epoch:[0/2](166000/4588595) loss:2.675 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:07:47,732][model8_pretrain.py][INFO] Epoch:[0/2](166000/4588595) loss:2.345 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:07:47,732][model8_pretrain.py][INFO] Epoch:[0/2](166000/4588595) loss:2.907 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:07:47,732][model8_pretrain.py][INFO] Epoch:[0/2](166000/4588595) loss:1.974 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:07:47,732][model8_pretrain.py][INFO] Epoch:[0/2](166000/4588595) loss:2.767 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:07:47,732][model8_pretrain.py][INFO] Epoch:[0/2](166000/4588595) loss:2.885 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:07:47,732][model8_pretrain.py][INFO] Epoch:[0/2](166000/4588595) loss:3.215 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:08:24,651][model8_pretrain.py][INFO] Epoch:[0/2](166100/4588595) loss:3.295 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:08:24,651][model8_pretrain.py][INFO] Epoch:[0/2](166100/4588595) loss:2.511 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:08:24,651][model8_pretrain.py][INFO] Epoch:[0/2](166100/4588595) loss:2.785 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:08:24,651][model8_pretrain.py][INFO] Epoch:[0/2](166100/4588595) loss:3.042 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:08:24,651][model8_pretrain.py][INFO] Epoch:[0/2](166100/4588595) loss:2.983 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:08:24,651][model8_pretrain.py][INFO] Epoch:[0/2](166100/4588595) loss:3.551 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:08:24,651][model8_pretrain.py][INFO] Epoch:[0/2](166100/4588595) loss:2.414 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:08:24,652][model8_pretrain.py][INFO] Epoch:[0/2](166100/4588595) loss:2.661 lr:0.0000100 epoch_Time:27920.0min: [2024-01-03 11:09:01,590][model8_pretrain.py][INFO] Epoch:[0/2](166200/4588595) loss:2.822 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:01,590][model8_pretrain.py][INFO] Epoch:[0/2](166200/4588595) loss:2.934 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:01,590][model8_pretrain.py][INFO] Epoch:[0/2](166200/4588595) loss:2.801 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:01,590][model8_pretrain.py][INFO] Epoch:[0/2](166200/4588595) loss:3.068 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:01,590][model8_pretrain.py][INFO] Epoch:[0/2](166200/4588595) loss:3.108 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:01,590][model8_pretrain.py][INFO] Epoch:[0/2](166200/4588595) loss:3.313 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:01,590][model8_pretrain.py][INFO] Epoch:[0/2](166200/4588595) loss:2.850 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:01,591][model8_pretrain.py][INFO] Epoch:[0/2](166200/4588595) loss:2.671 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:38,545][model8_pretrain.py][INFO] Epoch:[0/2](166300/4588595) loss:3.066 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:38,545][model8_pretrain.py][INFO] Epoch:[0/2](166300/4588595) loss:3.043 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:38,545][model8_pretrain.py][INFO] Epoch:[0/2](166300/4588595) loss:3.292 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:38,546][model8_pretrain.py][INFO] Epoch:[0/2](166300/4588595) loss:2.842 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:38,546][model8_pretrain.py][INFO] Epoch:[0/2](166300/4588595) loss:2.732 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:38,546][model8_pretrain.py][INFO] Epoch:[0/2](166300/4588595) loss:2.708 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:38,546][model8_pretrain.py][INFO] Epoch:[0/2](166300/4588595) loss:3.152 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:09:38,546][model8_pretrain.py][INFO] Epoch:[0/2](166300/4588595) loss:3.354 lr:0.0000100 epoch_Time:27918.0min: [2024-01-03 11:10:15,521][model8_pretrain.py][INFO] Epoch:[0/2](166400/4588595) loss:2.804 lr:0.0000100 epoch_Time:27917.0min: [2024-01-03 11:10:15,521][model8_pretrain.py][INFO] Epoch:[0/2](166400/4588595) loss:3.153 lr:0.0000100 epoch_Time:27917.0min: [2024-01-03 11:10:15,522][model8_pretrain.py][INFO] Epoch:[0/2](166400/4588595) loss:2.820 lr:0.0000100 epoch_Time:27917.0min: [2024-01-03 11:10:15,522][model8_pretrain.py][INFO] Epoch:[0/2](166400/4588595) loss:3.562 lr:0.0000100 epoch_Time:27917.0min: [2024-01-03 11:10:15,522][model8_pretrain.py][INFO] Epoch:[0/2](166400/4588595) loss:3.297 lr:0.0000100 epoch_Time:27917.0min: [2024-01-03 11:10:15,522][model8_pretrain.py][INFO] Epoch:[0/2](166400/4588595) loss:3.280 lr:0.0000100 epoch_Time:27917.0min: [2024-01-03 11:10:15,522][model8_pretrain.py][INFO] Epoch:[0/2](166400/4588595) loss:2.541 lr:0.0000100 epoch_Time:27917.0min: [2024-01-03 11:10:15,522][model8_pretrain.py][INFO] Epoch:[0/2](166400/4588595) loss:3.373 lr:0.0000100 epoch_Time:27917.0min: [2024-01-03 11:10:52,468][model8_pretrain.py][INFO] Epoch:[0/2](166500/4588595) loss:3.036 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:10:52,468][model8_pretrain.py][INFO] Epoch:[0/2](166500/4588595) loss:2.895 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:10:52,469][model8_pretrain.py][INFO] Epoch:[0/2](166500/4588595) loss:2.641 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:10:52,469][model8_pretrain.py][INFO] Epoch:[0/2](166500/4588595) loss:3.238 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:10:52,469][model8_pretrain.py][INFO] Epoch:[0/2](166500/4588595) loss:2.803 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:10:52,469][model8_pretrain.py][INFO] Epoch:[0/2](166500/4588595) loss:2.827 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:10:52,469][model8_pretrain.py][INFO] Epoch:[0/2](166500/4588595) loss:3.336 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:10:52,469][model8_pretrain.py][INFO] Epoch:[0/2](166500/4588595) loss:2.568 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](166600/4588595) loss:2.606 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](166600/4588595) loss:2.492 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](166600/4588595) loss:2.817 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](166600/4588595) loss:2.988 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:11:29,414][model8_pretrain.py][INFO] Epoch:[0/2](166600/4588595) loss:2.634 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:11:29,414][model8_pretrain.py][INFO] Epoch:[0/2](166600/4588595) loss:2.505 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:11:29,414][model8_pretrain.py][INFO] Epoch:[0/2](166600/4588595) loss:3.297 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:11:29,414][model8_pretrain.py][INFO] Epoch:[0/2](166600/4588595) loss:3.483 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:12:06,345][model8_pretrain.py][INFO] Epoch:[0/2](166700/4588595) loss:3.338 lr:0.0000100 epoch_Time:27913.0min: [2024-01-03 11:12:06,345][model8_pretrain.py][INFO] Epoch:[0/2](166700/4588595) loss:3.623 lr:0.0000100 epoch_Time:27913.0min: [2024-01-03 11:12:06,345][model8_pretrain.py][INFO] Epoch:[0/2](166700/4588595) loss:3.337 lr:0.0000100 epoch_Time:27913.0min: [2024-01-03 11:12:06,345][model8_pretrain.py][INFO] Epoch:[0/2](166700/4588595) loss:3.588 lr:0.0000100 epoch_Time:27913.0min: [2024-01-03 11:12:06,345][model8_pretrain.py][INFO] Epoch:[0/2](166700/4588595) loss:2.959 lr:0.0000100 epoch_Time:27913.0min: [2024-01-03 11:12:06,345][model8_pretrain.py][INFO] Epoch:[0/2](166700/4588595) loss:3.141 lr:0.0000100 epoch_Time:27913.0min: [2024-01-03 11:12:06,346][model8_pretrain.py][INFO] Epoch:[0/2](166700/4588595) loss:3.702 lr:0.0000100 epoch_Time:27913.0min: [2024-01-03 11:12:06,347][model8_pretrain.py][INFO] Epoch:[0/2](166700/4588595) loss:3.370 lr:0.0000100 epoch_Time:27913.0min: [2024-01-03 11:12:50,326][model8_pretrain.py][INFO] Epoch:[0/2](166800/4588595) loss:3.325 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:12:50,326][model8_pretrain.py][INFO] Epoch:[0/2](166800/4588595) loss:2.696 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:12:50,326][model8_pretrain.py][INFO] Epoch:[0/2](166800/4588595) loss:2.467 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:12:50,326][model8_pretrain.py][INFO] Epoch:[0/2](166800/4588595) loss:3.303 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:12:50,326][model8_pretrain.py][INFO] Epoch:[0/2](166800/4588595) loss:2.761 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:12:50,330][model8_pretrain.py][INFO] Epoch:[0/2](166800/4588595) loss:3.161 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:12:50,331][model8_pretrain.py][INFO] Epoch:[0/2](166800/4588595) loss:3.019 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:12:50,331][model8_pretrain.py][INFO] Epoch:[0/2](166800/4588595) loss:2.372 lr:0.0000100 epoch_Time:27915.0min: [2024-01-03 11:13:29,197][model8_pretrain.py][INFO] Epoch:[0/2](166900/4588595) loss:3.054 lr:0.0000100 epoch_Time:27916.0min: [2024-01-03 11:13:29,197][model8_pretrain.py][INFO] Epoch:[0/2](166900/4588595) loss:3.028 lr:0.0000100 epoch_Time:27916.0min: [2024-01-03 11:13:29,197][model8_pretrain.py][INFO] Epoch:[0/2](166900/4588595) loss:3.415 lr:0.0000100 epoch_Time:27916.0min: [2024-01-03 11:13:29,197][model8_pretrain.py][INFO] Epoch:[0/2](166900/4588595) loss:2.905 lr:0.0000100 epoch_Time:27916.0min: [2024-01-03 11:13:29,197][model8_pretrain.py][INFO] Epoch:[0/2](166900/4588595) loss:2.925 lr:0.0000100 epoch_Time:27916.0min: [2024-01-03 11:13:29,197][model8_pretrain.py][INFO] Epoch:[0/2](166900/4588595) loss:2.913 lr:0.0000100 epoch_Time:27916.0min: [2024-01-03 11:13:29,197][model8_pretrain.py][INFO] Epoch:[0/2](166900/4588595) loss:3.339 lr:0.0000100 epoch_Time:27916.0min: [2024-01-03 11:13:29,197][model8_pretrain.py][INFO] Epoch:[0/2](166900/4588595) loss:3.240 lr:0.0000100 epoch_Time:27916.0min: [2024-01-03 11:14:06,152][model8_pretrain.py][INFO] Epoch:[0/2](167000/4588595) loss:3.112 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:06,152][model8_pretrain.py][INFO] Epoch:[0/2](167000/4588595) loss:3.164 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:06,152][model8_pretrain.py][INFO] Epoch:[0/2](167000/4588595) loss:2.492 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:06,152][model8_pretrain.py][INFO] Epoch:[0/2](167000/4588595) loss:2.958 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:06,152][model8_pretrain.py][INFO] Epoch:[0/2](167000/4588595) loss:3.129 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:06,152][model8_pretrain.py][INFO] Epoch:[0/2](167000/4588595) loss:3.113 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:06,153][model8_pretrain.py][INFO] Epoch:[0/2](167000/4588595) loss:3.074 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:06,153][model8_pretrain.py][INFO] Epoch:[0/2](167000/4588595) loss:3.156 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:43,089][model8_pretrain.py][INFO] Epoch:[0/2](167100/4588595) loss:2.908 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:43,089][model8_pretrain.py][INFO] Epoch:[0/2](167100/4588595) loss:3.238 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:43,089][model8_pretrain.py][INFO] Epoch:[0/2](167100/4588595) loss:3.069 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:43,089][model8_pretrain.py][INFO] Epoch:[0/2](167100/4588595) loss:2.834 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:43,089][model8_pretrain.py][INFO] Epoch:[0/2](167100/4588595) loss:3.237 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:43,089][model8_pretrain.py][INFO] Epoch:[0/2](167100/4588595) loss:2.555 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:43,089][model8_pretrain.py][INFO] Epoch:[0/2](167100/4588595) loss:2.848 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:14:43,089][model8_pretrain.py][INFO] Epoch:[0/2](167100/4588595) loss:2.949 lr:0.0000100 epoch_Time:27914.0min: [2024-01-03 11:15:20,032][model8_pretrain.py][INFO] Epoch:[0/2](167200/4588595) loss:2.883 lr:0.0000100 epoch_Time:27912.0min: [2024-01-03 11:15:20,032][model8_pretrain.py][INFO] Epoch:[0/2](167200/4588595) loss:3.277 lr:0.0000100 epoch_Time:27912.0min: [2024-01-03 11:15:20,032][model8_pretrain.py][INFO] Epoch:[0/2](167200/4588595) loss:2.766 lr:0.0000100 epoch_Time:27912.0min: [2024-01-03 11:15:20,032][model8_pretrain.py][INFO] Epoch:[0/2](167200/4588595) loss:3.136 lr:0.0000100 epoch_Time:27912.0min: [2024-01-03 11:15:20,032][model8_pretrain.py][INFO] Epoch:[0/2](167200/4588595) loss:2.717 lr:0.0000100 epoch_Time:27912.0min: [2024-01-03 11:15:20,032][model8_pretrain.py][INFO] Epoch:[0/2](167200/4588595) loss:2.626 lr:0.0000100 epoch_Time:27912.0min: [2024-01-03 11:15:20,032][model8_pretrain.py][INFO] Epoch:[0/2](167200/4588595) loss:3.268 lr:0.0000100 epoch_Time:27912.0min: [2024-01-03 11:15:20,032][model8_pretrain.py][INFO] Epoch:[0/2](167200/4588595) loss:2.963 lr:0.0000100 epoch_Time:27912.0min: [2024-01-03 11:15:56,969][model8_pretrain.py][INFO] Epoch:[0/2](167300/4588595) loss:3.014 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:15:56,969][model8_pretrain.py][INFO] Epoch:[0/2](167300/4588595) loss:3.262 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:15:56,969][model8_pretrain.py][INFO] Epoch:[0/2](167300/4588595) loss:2.998 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:15:56,969][model8_pretrain.py][INFO] Epoch:[0/2](167300/4588595) loss:3.154 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:15:56,969][model8_pretrain.py][INFO] Epoch:[0/2](167300/4588595) loss:2.917 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:15:56,969][model8_pretrain.py][INFO] Epoch:[0/2](167300/4588595) loss:2.807 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:15:56,969][model8_pretrain.py][INFO] Epoch:[0/2](167300/4588595) loss:2.775 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:15:56,969][model8_pretrain.py][INFO] Epoch:[0/2](167300/4588595) loss:3.489 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:16:33,909][model8_pretrain.py][INFO] Epoch:[0/2](167400/4588595) loss:1.998 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:16:33,909][model8_pretrain.py][INFO] Epoch:[0/2](167400/4588595) loss:2.851 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:16:33,909][model8_pretrain.py][INFO] Epoch:[0/2](167400/4588595) loss:2.748 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:16:33,909][model8_pretrain.py][INFO] Epoch:[0/2](167400/4588595) loss:2.685 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:16:33,909][model8_pretrain.py][INFO] Epoch:[0/2](167400/4588595) loss:2.917 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:16:33,909][model8_pretrain.py][INFO] Epoch:[0/2](167400/4588595) loss:2.861 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:16:33,909][model8_pretrain.py][INFO] Epoch:[0/2](167400/4588595) loss:2.993 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:16:33,909][model8_pretrain.py][INFO] Epoch:[0/2](167400/4588595) loss:2.920 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:17:10,843][model8_pretrain.py][INFO] Epoch:[0/2](167500/4588595) loss:3.343 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:10,843][model8_pretrain.py][INFO] Epoch:[0/2](167500/4588595) loss:2.867 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:10,843][model8_pretrain.py][INFO] Epoch:[0/2](167500/4588595) loss:2.979 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:10,843][model8_pretrain.py][INFO] Epoch:[0/2](167500/4588595) loss:3.213 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:10,843][model8_pretrain.py][INFO] Epoch:[0/2](167500/4588595) loss:3.001 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:10,843][model8_pretrain.py][INFO] Epoch:[0/2](167500/4588595) loss:3.406 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:10,844][model8_pretrain.py][INFO] Epoch:[0/2](167500/4588595) loss:2.938 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:10,844][model8_pretrain.py][INFO] Epoch:[0/2](167500/4588595) loss:3.093 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:51,221][model8_pretrain.py][INFO] Epoch:[0/2](167600/4588595) loss:2.908 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:51,221][model8_pretrain.py][INFO] Epoch:[0/2](167600/4588595) loss:2.952 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:51,222][model8_pretrain.py][INFO] Epoch:[0/2](167600/4588595) loss:3.007 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:51,222][model8_pretrain.py][INFO] Epoch:[0/2](167600/4588595) loss:3.100 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:51,222][model8_pretrain.py][INFO] Epoch:[0/2](167600/4588595) loss:3.086 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:51,222][model8_pretrain.py][INFO] Epoch:[0/2](167600/4588595) loss:2.935 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:51,222][model8_pretrain.py][INFO] Epoch:[0/2](167600/4588595) loss:3.303 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:17:51,222][model8_pretrain.py][INFO] Epoch:[0/2](167600/4588595) loss:2.903 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:18:33,369][model8_pretrain.py][INFO] Epoch:[0/2](167700/4588595) loss:2.926 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:18:33,369][model8_pretrain.py][INFO] Epoch:[0/2](167700/4588595) loss:3.348 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:18:33,369][model8_pretrain.py][INFO] Epoch:[0/2](167700/4588595) loss:3.207 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:18:33,369][model8_pretrain.py][INFO] Epoch:[0/2](167700/4588595) loss:2.362 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:18:33,369][model8_pretrain.py][INFO] Epoch:[0/2](167700/4588595) loss:3.146 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:18:33,369][model8_pretrain.py][INFO] Epoch:[0/2](167700/4588595) loss:2.818 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:18:33,369][model8_pretrain.py][INFO] Epoch:[0/2](167700/4588595) loss:3.247 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:18:33,370][model8_pretrain.py][INFO] Epoch:[0/2](167700/4588595) loss:3.233 lr:0.0000100 epoch_Time:27911.0min: [2024-01-03 11:19:10,308][model8_pretrain.py][INFO] Epoch:[0/2](167800/4588595) loss:2.810 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:19:10,308][model8_pretrain.py][INFO] Epoch:[0/2](167800/4588595) loss:3.108 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:19:10,308][model8_pretrain.py][INFO] Epoch:[0/2](167800/4588595) loss:2.762 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:19:10,308][model8_pretrain.py][INFO] Epoch:[0/2](167800/4588595) loss:2.394 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:19:10,308][model8_pretrain.py][INFO] Epoch:[0/2](167800/4588595) loss:2.817 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:19:10,308][model8_pretrain.py][INFO] Epoch:[0/2](167800/4588595) loss:3.207 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:19:10,308][model8_pretrain.py][INFO] Epoch:[0/2](167800/4588595) loss:3.315 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:19:10,308][model8_pretrain.py][INFO] Epoch:[0/2](167800/4588595) loss:3.215 lr:0.0000100 epoch_Time:27910.0min: [2024-01-03 11:19:47,248][model8_pretrain.py][INFO] Epoch:[0/2](167900/4588595) loss:2.834 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:19:47,248][model8_pretrain.py][INFO] Epoch:[0/2](167900/4588595) loss:2.841 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:19:47,248][model8_pretrain.py][INFO] Epoch:[0/2](167900/4588595) loss:3.027 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:19:47,248][model8_pretrain.py][INFO] Epoch:[0/2](167900/4588595) loss:2.496 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:19:47,248][model8_pretrain.py][INFO] Epoch:[0/2](167900/4588595) loss:2.537 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:19:47,248][model8_pretrain.py][INFO] Epoch:[0/2](167900/4588595) loss:2.561 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:19:47,248][model8_pretrain.py][INFO] Epoch:[0/2](167900/4588595) loss:2.508 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:19:47,248][model8_pretrain.py][INFO] Epoch:[0/2](167900/4588595) loss:3.335 lr:0.0000100 epoch_Time:27909.0min: [2024-01-03 11:20:24,187][model8_pretrain.py][INFO] Epoch:[0/2](168000/4588595) loss:2.925 lr:0.0000100 epoch_Time:27908.0min: [2024-01-03 11:20:24,187][model8_pretrain.py][INFO] Epoch:[0/2](168000/4588595) loss:2.899 lr:0.0000100 epoch_Time:27908.0min: [2024-01-03 11:20:24,187][model8_pretrain.py][INFO] Epoch:[0/2](168000/4588595) loss:2.345 lr:0.0000100 epoch_Time:27908.0min: [2024-01-03 11:20:24,187][model8_pretrain.py][INFO] Epoch:[0/2](168000/4588595) loss:2.750 lr:0.0000100 epoch_Time:27908.0min: [2024-01-03 11:20:24,187][model8_pretrain.py][INFO] Epoch:[0/2](168000/4588595) loss:2.617 lr:0.0000100 epoch_Time:27908.0min: [2024-01-03 11:20:24,188][model8_pretrain.py][INFO] Epoch:[0/2](168000/4588595) loss:2.568 lr:0.0000100 epoch_Time:27908.0min: [2024-01-03 11:20:24,188][model8_pretrain.py][INFO] Epoch:[0/2](168000/4588595) loss:3.158 lr:0.0000100 epoch_Time:27908.0min: [2024-01-03 11:20:24,188][model8_pretrain.py][INFO] Epoch:[0/2](168000/4588595) loss:2.561 lr:0.0000100 epoch_Time:27908.0min: [2024-01-03 11:21:01,131][model8_pretrain.py][INFO] Epoch:[0/2](168100/4588595) loss:3.043 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:01,131][model8_pretrain.py][INFO] Epoch:[0/2](168100/4588595) loss:3.532 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:01,131][model8_pretrain.py][INFO] Epoch:[0/2](168100/4588595) loss:2.838 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:01,131][model8_pretrain.py][INFO] Epoch:[0/2](168100/4588595) loss:3.064 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:01,131][model8_pretrain.py][INFO] Epoch:[0/2](168100/4588595) loss:3.459 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:01,131][model8_pretrain.py][INFO] Epoch:[0/2](168100/4588595) loss:2.970 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:01,131][model8_pretrain.py][INFO] Epoch:[0/2](168100/4588595) loss:3.230 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:01,131][model8_pretrain.py][INFO] Epoch:[0/2](168100/4588595) loss:3.016 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:38,081][model8_pretrain.py][INFO] Epoch:[0/2](168200/4588595) loss:3.433 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:38,081][model8_pretrain.py][INFO] Epoch:[0/2](168200/4588595) loss:3.485 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:38,081][model8_pretrain.py][INFO] Epoch:[0/2](168200/4588595) loss:2.878 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:38,081][model8_pretrain.py][INFO] Epoch:[0/2](168200/4588595) loss:3.011 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:38,081][model8_pretrain.py][INFO] Epoch:[0/2](168200/4588595) loss:2.750 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:38,081][model8_pretrain.py][INFO] Epoch:[0/2](168200/4588595) loss:3.263 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:38,081][model8_pretrain.py][INFO] Epoch:[0/2](168200/4588595) loss:2.922 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:21:38,082][model8_pretrain.py][INFO] Epoch:[0/2](168200/4588595) loss:2.640 lr:0.0000100 epoch_Time:27906.0min: [2024-01-03 11:22:15,024][model8_pretrain.py][INFO] Epoch:[0/2](168300/4588595) loss:2.729 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:15,024][model8_pretrain.py][INFO] Epoch:[0/2](168300/4588595) loss:2.292 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:15,024][model8_pretrain.py][INFO] Epoch:[0/2](168300/4588595) loss:2.813 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:15,024][model8_pretrain.py][INFO] Epoch:[0/2](168300/4588595) loss:2.777 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:15,024][model8_pretrain.py][INFO] Epoch:[0/2](168300/4588595) loss:3.411 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:15,024][model8_pretrain.py][INFO] Epoch:[0/2](168300/4588595) loss:2.698 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:15,025][model8_pretrain.py][INFO] Epoch:[0/2](168300/4588595) loss:2.625 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:15,025][model8_pretrain.py][INFO] Epoch:[0/2](168300/4588595) loss:2.977 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:55,404][model8_pretrain.py][INFO] Epoch:[0/2](168400/4588595) loss:3.082 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:55,404][model8_pretrain.py][INFO] Epoch:[0/2](168400/4588595) loss:3.139 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:55,404][model8_pretrain.py][INFO] Epoch:[0/2](168400/4588595) loss:3.504 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:55,404][model8_pretrain.py][INFO] Epoch:[0/2](168400/4588595) loss:2.089 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:55,405][model8_pretrain.py][INFO] Epoch:[0/2](168400/4588595) loss:2.811 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:55,404][model8_pretrain.py][INFO] Epoch:[0/2](168400/4588595) loss:3.152 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:55,405][model8_pretrain.py][INFO] Epoch:[0/2](168400/4588595) loss:2.928 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:22:55,405][model8_pretrain.py][INFO] Epoch:[0/2](168400/4588595) loss:2.967 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:23:37,592][model8_pretrain.py][INFO] Epoch:[0/2](168500/4588595) loss:2.800 lr:0.0000100 epoch_Time:27907.0min: [2024-01-03 11:23:37,592][model8_pretrain.py][INFO] Epoch:[0/2](168500/4588595) loss:3.172 lr:0.0000100 epoch_Time:27907.0min: [2024-01-03 11:23:37,592][model8_pretrain.py][INFO] Epoch:[0/2](168500/4588595) loss:2.740 lr:0.0000100 epoch_Time:27907.0min: [2024-01-03 11:23:37,592][model8_pretrain.py][INFO] Epoch:[0/2](168500/4588595) loss:2.794 lr:0.0000100 epoch_Time:27907.0min: [2024-01-03 11:23:37,592][model8_pretrain.py][INFO] Epoch:[0/2](168500/4588595) loss:3.338 lr:0.0000100 epoch_Time:27907.0min: [2024-01-03 11:23:37,592][model8_pretrain.py][INFO] Epoch:[0/2](168500/4588595) loss:3.275 lr:0.0000100 epoch_Time:27907.0min: [2024-01-03 11:23:37,592][model8_pretrain.py][INFO] Epoch:[0/2](168500/4588595) loss:2.562 lr:0.0000100 epoch_Time:27907.0min: [2024-01-03 11:23:37,592][model8_pretrain.py][INFO] Epoch:[0/2](168500/4588595) loss:3.227 lr:0.0000100 epoch_Time:27907.0min: [2024-01-03 11:24:14,526][model8_pretrain.py][INFO] Epoch:[0/2](168600/4588595) loss:2.759 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:24:14,526][model8_pretrain.py][INFO] Epoch:[0/2](168600/4588595) loss:2.974 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:24:14,526][model8_pretrain.py][INFO] Epoch:[0/2](168600/4588595) loss:2.923 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:24:14,526][model8_pretrain.py][INFO] Epoch:[0/2](168600/4588595) loss:3.093 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:24:14,526][model8_pretrain.py][INFO] Epoch:[0/2](168600/4588595) loss:3.305 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:24:14,527][model8_pretrain.py][INFO] Epoch:[0/2](168600/4588595) loss:2.835 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:24:14,527][model8_pretrain.py][INFO] Epoch:[0/2](168600/4588595) loss:2.783 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:24:14,527][model8_pretrain.py][INFO] Epoch:[0/2](168600/4588595) loss:3.183 lr:0.0000100 epoch_Time:27905.0min: [2024-01-03 11:24:51,462][model8_pretrain.py][INFO] Epoch:[0/2](168700/4588595) loss:3.196 lr:0.0000100 epoch_Time:27904.0min: [2024-01-03 11:24:51,462][model8_pretrain.py][INFO] Epoch:[0/2](168700/4588595) loss:2.987 lr:0.0000100 epoch_Time:27904.0min: [2024-01-03 11:24:51,462][model8_pretrain.py][INFO] Epoch:[0/2](168700/4588595) loss:3.266 lr:0.0000100 epoch_Time:27904.0min: [2024-01-03 11:24:51,462][model8_pretrain.py][INFO] Epoch:[0/2](168700/4588595) loss:3.034 lr:0.0000100 epoch_Time:27904.0min: [2024-01-03 11:24:51,462][model8_pretrain.py][INFO] Epoch:[0/2](168700/4588595) loss:2.726 lr:0.0000100 epoch_Time:27904.0min: [2024-01-03 11:24:51,462][model8_pretrain.py][INFO] Epoch:[0/2](168700/4588595) loss:2.781 lr:0.0000100 epoch_Time:27904.0min: [2024-01-03 11:24:51,462][model8_pretrain.py][INFO] Epoch:[0/2](168700/4588595) loss:3.072 lr:0.0000100 epoch_Time:27904.0min: [2024-01-03 11:24:51,462][model8_pretrain.py][INFO] Epoch:[0/2](168700/4588595) loss:3.107 lr:0.0000100 epoch_Time:27904.0min: [2024-01-03 11:25:28,396][model8_pretrain.py][INFO] Epoch:[0/2](168800/4588595) loss:2.912 lr:0.0000100 epoch_Time:27903.0min: [2024-01-03 11:25:28,396][model8_pretrain.py][INFO] Epoch:[0/2](168800/4588595) loss:2.716 lr:0.0000100 epoch_Time:27903.0min: [2024-01-03 11:25:28,396][model8_pretrain.py][INFO] Epoch:[0/2](168800/4588595) loss:3.057 lr:0.0000100 epoch_Time:27903.0min: [2024-01-03 11:25:28,396][model8_pretrain.py][INFO] Epoch:[0/2](168800/4588595) loss:3.114 lr:0.0000100 epoch_Time:27903.0min: [2024-01-03 11:25:28,396][model8_pretrain.py][INFO] Epoch:[0/2](168800/4588595) loss:3.112 lr:0.0000100 epoch_Time:27903.0min: [2024-01-03 11:25:28,396][model8_pretrain.py][INFO] Epoch:[0/2](168800/4588595) loss:3.032 lr:0.0000100 epoch_Time:27903.0min: [2024-01-03 11:25:28,396][model8_pretrain.py][INFO] Epoch:[0/2](168800/4588595) loss:2.678 lr:0.0000100 epoch_Time:27903.0min: [2024-01-03 11:25:28,396][model8_pretrain.py][INFO] Epoch:[0/2](168800/4588595) loss:2.537 lr:0.0000100 epoch_Time:27903.0min: [2024-01-03 11:26:05,330][model8_pretrain.py][INFO] Epoch:[0/2](168900/4588595) loss:2.680 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:05,330][model8_pretrain.py][INFO] Epoch:[0/2](168900/4588595) loss:2.980 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:05,330][model8_pretrain.py][INFO] Epoch:[0/2](168900/4588595) loss:2.600 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:05,330][model8_pretrain.py][INFO] Epoch:[0/2](168900/4588595) loss:2.982 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:05,330][model8_pretrain.py][INFO] Epoch:[0/2](168900/4588595) loss:3.322 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:05,330][model8_pretrain.py][INFO] Epoch:[0/2](168900/4588595) loss:3.007 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:05,330][model8_pretrain.py][INFO] Epoch:[0/2](168900/4588595) loss:2.919 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:05,330][model8_pretrain.py][INFO] Epoch:[0/2](168900/4588595) loss:2.698 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:42,264][model8_pretrain.py][INFO] Epoch:[0/2](169000/4588595) loss:2.377 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:42,264][model8_pretrain.py][INFO] Epoch:[0/2](169000/4588595) loss:3.195 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:42,264][model8_pretrain.py][INFO] Epoch:[0/2](169000/4588595) loss:3.618 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:42,264][model8_pretrain.py][INFO] Epoch:[0/2](169000/4588595) loss:2.877 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:42,264][model8_pretrain.py][INFO] Epoch:[0/2](169000/4588595) loss:2.846 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:42,264][model8_pretrain.py][INFO] Epoch:[0/2](169000/4588595) loss:2.877 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:42,264][model8_pretrain.py][INFO] Epoch:[0/2](169000/4588595) loss:3.003 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:26:42,264][model8_pretrain.py][INFO] Epoch:[0/2](169000/4588595) loss:2.663 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:27:19,199][model8_pretrain.py][INFO] Epoch:[0/2](169100/4588595) loss:2.698 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:19,199][model8_pretrain.py][INFO] Epoch:[0/2](169100/4588595) loss:3.497 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:19,199][model8_pretrain.py][INFO] Epoch:[0/2](169100/4588595) loss:3.071 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:19,199][model8_pretrain.py][INFO] Epoch:[0/2](169100/4588595) loss:2.608 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:19,199][model8_pretrain.py][INFO] Epoch:[0/2](169100/4588595) loss:2.953 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:19,199][model8_pretrain.py][INFO] Epoch:[0/2](169100/4588595) loss:3.141 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:19,199][model8_pretrain.py][INFO] Epoch:[0/2](169100/4588595) loss:2.914 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:19,200][model8_pretrain.py][INFO] Epoch:[0/2](169100/4588595) loss:3.033 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:59,594][model8_pretrain.py][INFO] Epoch:[0/2](169200/4588595) loss:2.470 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:59,594][model8_pretrain.py][INFO] Epoch:[0/2](169200/4588595) loss:2.937 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:59,594][model8_pretrain.py][INFO] Epoch:[0/2](169200/4588595) loss:3.015 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:59,594][model8_pretrain.py][INFO] Epoch:[0/2](169200/4588595) loss:3.116 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:59,594][model8_pretrain.py][INFO] Epoch:[0/2](169200/4588595) loss:3.018 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:59,594][model8_pretrain.py][INFO] Epoch:[0/2](169200/4588595) loss:2.883 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:59,594][model8_pretrain.py][INFO] Epoch:[0/2](169200/4588595) loss:2.935 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:27:59,599][model8_pretrain.py][INFO] Epoch:[0/2](169200/4588595) loss:2.984 lr:0.0000100 epoch_Time:27900.0min: [2024-01-03 11:28:41,818][model8_pretrain.py][INFO] Epoch:[0/2](169300/4588595) loss:3.230 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:28:41,818][model8_pretrain.py][INFO] Epoch:[0/2](169300/4588595) loss:3.100 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:28:41,818][model8_pretrain.py][INFO] Epoch:[0/2](169300/4588595) loss:2.823 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:28:41,818][model8_pretrain.py][INFO] Epoch:[0/2](169300/4588595) loss:2.882 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:28:41,818][model8_pretrain.py][INFO] Epoch:[0/2](169300/4588595) loss:3.675 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:28:41,818][model8_pretrain.py][INFO] Epoch:[0/2](169300/4588595) loss:3.060 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:28:41,819][model8_pretrain.py][INFO] Epoch:[0/2](169300/4588595) loss:3.159 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:28:41,819][model8_pretrain.py][INFO] Epoch:[0/2](169300/4588595) loss:2.638 lr:0.0000100 epoch_Time:27902.0min: [2024-01-03 11:29:18,754][model8_pretrain.py][INFO] Epoch:[0/2](169400/4588595) loss:3.126 lr:0.0000100 epoch_Time:27901.0min: [2024-01-03 11:29:18,754][model8_pretrain.py][INFO] Epoch:[0/2](169400/4588595) loss:2.688 lr:0.0000100 epoch_Time:27901.0min: [2024-01-03 11:29:18,754][model8_pretrain.py][INFO] Epoch:[0/2](169400/4588595) loss:2.850 lr:0.0000100 epoch_Time:27901.0min: [2024-01-03 11:29:18,754][model8_pretrain.py][INFO] Epoch:[0/2](169400/4588595) loss:3.249 lr:0.0000100 epoch_Time:27901.0min: [2024-01-03 11:29:18,754][model8_pretrain.py][INFO] Epoch:[0/2](169400/4588595) loss:2.937 lr:0.0000100 epoch_Time:27901.0min: [2024-01-03 11:29:18,754][model8_pretrain.py][INFO] Epoch:[0/2](169400/4588595) loss:2.777 lr:0.0000100 epoch_Time:27901.0min: [2024-01-03 11:29:18,754][model8_pretrain.py][INFO] Epoch:[0/2](169400/4588595) loss:2.955 lr:0.0000100 epoch_Time:27901.0min: [2024-01-03 11:29:18,755][model8_pretrain.py][INFO] Epoch:[0/2](169400/4588595) loss:3.095 lr:0.0000100 epoch_Time:27901.0min: [2024-01-03 11:29:55,688][model8_pretrain.py][INFO] Epoch:[0/2](169500/4588595) loss:3.269 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:29:55,688][model8_pretrain.py][INFO] Epoch:[0/2](169500/4588595) loss:3.175 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:29:55,688][model8_pretrain.py][INFO] Epoch:[0/2](169500/4588595) loss:2.896 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:29:55,688][model8_pretrain.py][INFO] Epoch:[0/2](169500/4588595) loss:2.915 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:29:55,688][model8_pretrain.py][INFO] Epoch:[0/2](169500/4588595) loss:3.537 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:29:55,688][model8_pretrain.py][INFO] Epoch:[0/2](169500/4588595) loss:2.541 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:29:55,688][model8_pretrain.py][INFO] Epoch:[0/2](169500/4588595) loss:2.800 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:29:55,688][model8_pretrain.py][INFO] Epoch:[0/2](169500/4588595) loss:2.886 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:30:32,628][model8_pretrain.py][INFO] Epoch:[0/2](169600/4588595) loss:3.151 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:30:32,628][model8_pretrain.py][INFO] Epoch:[0/2](169600/4588595) loss:3.803 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:30:32,628][model8_pretrain.py][INFO] Epoch:[0/2](169600/4588595) loss:3.128 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:30:32,628][model8_pretrain.py][INFO] Epoch:[0/2](169600/4588595) loss:2.887 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:30:32,628][model8_pretrain.py][INFO] Epoch:[0/2](169600/4588595) loss:3.048 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:30:32,628][model8_pretrain.py][INFO] Epoch:[0/2](169600/4588595) loss:2.445 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:30:32,628][model8_pretrain.py][INFO] Epoch:[0/2](169600/4588595) loss:3.168 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:30:32,629][model8_pretrain.py][INFO] Epoch:[0/2](169600/4588595) loss:2.795 lr:0.0000100 epoch_Time:27899.0min: [2024-01-03 11:31:09,557][model8_pretrain.py][INFO] Epoch:[0/2](169700/4588595) loss:2.488 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:09,558][model8_pretrain.py][INFO] Epoch:[0/2](169700/4588595) loss:2.954 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:09,557][model8_pretrain.py][INFO] Epoch:[0/2](169700/4588595) loss:2.910 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:09,558][model8_pretrain.py][INFO] Epoch:[0/2](169700/4588595) loss:2.582 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:09,558][model8_pretrain.py][INFO] Epoch:[0/2](169700/4588595) loss:2.942 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:09,558][model8_pretrain.py][INFO] Epoch:[0/2](169700/4588595) loss:2.550 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:09,558][model8_pretrain.py][INFO] Epoch:[0/2](169700/4588595) loss:3.129 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:09,558][model8_pretrain.py][INFO] Epoch:[0/2](169700/4588595) loss:2.986 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:46,498][model8_pretrain.py][INFO] Epoch:[0/2](169800/4588595) loss:3.507 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:46,498][model8_pretrain.py][INFO] Epoch:[0/2](169800/4588595) loss:3.029 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:46,498][model8_pretrain.py][INFO] Epoch:[0/2](169800/4588595) loss:3.202 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:46,498][model8_pretrain.py][INFO] Epoch:[0/2](169800/4588595) loss:3.012 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:46,498][model8_pretrain.py][INFO] Epoch:[0/2](169800/4588595) loss:2.526 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:46,498][model8_pretrain.py][INFO] Epoch:[0/2](169800/4588595) loss:1.843 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:46,500][model8_pretrain.py][INFO] Epoch:[0/2](169800/4588595) loss:3.004 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:31:46,498][model8_pretrain.py][INFO] Epoch:[0/2](169800/4588595) loss:2.971 lr:0.0000100 epoch_Time:27897.0min: [2024-01-03 11:32:23,434][model8_pretrain.py][INFO] Epoch:[0/2](169900/4588595) loss:2.973 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:32:23,434][model8_pretrain.py][INFO] Epoch:[0/2](169900/4588595) loss:3.032 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:32:23,434][model8_pretrain.py][INFO] Epoch:[0/2](169900/4588595) loss:3.322 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:32:23,434][model8_pretrain.py][INFO] Epoch:[0/2](169900/4588595) loss:2.363 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:32:23,434][model8_pretrain.py][INFO] Epoch:[0/2](169900/4588595) loss:3.067 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:32:23,434][model8_pretrain.py][INFO] Epoch:[0/2](169900/4588595) loss:3.068 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:32:23,434][model8_pretrain.py][INFO] Epoch:[0/2](169900/4588595) loss:2.661 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:32:23,435][model8_pretrain.py][INFO] Epoch:[0/2](169900/4588595) loss:2.757 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:33:00,370][model8_pretrain.py][INFO] Epoch:[0/2](170000/4588595) loss:3.198 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:33:00,370][model8_pretrain.py][INFO] Epoch:[0/2](170000/4588595) loss:2.736 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:33:00,370][model8_pretrain.py][INFO] Epoch:[0/2](170000/4588595) loss:2.619 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:33:00,370][model8_pretrain.py][INFO] Epoch:[0/2](170000/4588595) loss:2.878 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:33:00,370][model8_pretrain.py][INFO] Epoch:[0/2](170000/4588595) loss:3.119 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:33:00,370][model8_pretrain.py][INFO] Epoch:[0/2](170000/4588595) loss:2.591 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:33:00,370][model8_pretrain.py][INFO] Epoch:[0/2](170000/4588595) loss:3.359 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:33:00,370][model8_pretrain.py][INFO] Epoch:[0/2](170000/4588595) loss:2.974 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:33:45,904][model8_pretrain.py][INFO] Epoch:[0/2](170100/4588595) loss:2.816 lr:0.0000100 epoch_Time:27898.0min: [2024-01-03 11:33:45,904][model8_pretrain.py][INFO] Epoch:[0/2](170100/4588595) loss:3.215 lr:0.0000100 epoch_Time:27898.0min: [2024-01-03 11:33:45,904][model8_pretrain.py][INFO] Epoch:[0/2](170100/4588595) loss:2.921 lr:0.0000100 epoch_Time:27898.0min: [2024-01-03 11:33:45,904][model8_pretrain.py][INFO] Epoch:[0/2](170100/4588595) loss:3.649 lr:0.0000100 epoch_Time:27898.0min: [2024-01-03 11:33:45,904][model8_pretrain.py][INFO] Epoch:[0/2](170100/4588595) loss:2.462 lr:0.0000100 epoch_Time:27898.0min: [2024-01-03 11:33:45,904][model8_pretrain.py][INFO] Epoch:[0/2](170100/4588595) loss:3.055 lr:0.0000100 epoch_Time:27898.0min: [2024-01-03 11:33:45,904][model8_pretrain.py][INFO] Epoch:[0/2](170100/4588595) loss:3.273 lr:0.0000100 epoch_Time:27898.0min: [2024-01-03 11:33:45,904][model8_pretrain.py][INFO] Epoch:[0/2](170100/4588595) loss:3.061 lr:0.0000100 epoch_Time:27898.0min: [2024-01-03 11:34:22,831][model8_pretrain.py][INFO] Epoch:[0/2](170200/4588595) loss:2.871 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:34:22,831][model8_pretrain.py][INFO] Epoch:[0/2](170200/4588595) loss:2.901 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:34:22,831][model8_pretrain.py][INFO] Epoch:[0/2](170200/4588595) loss:3.035 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:34:22,831][model8_pretrain.py][INFO] Epoch:[0/2](170200/4588595) loss:2.678 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:34:22,831][model8_pretrain.py][INFO] Epoch:[0/2](170200/4588595) loss:2.886 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:34:22,831][model8_pretrain.py][INFO] Epoch:[0/2](170200/4588595) loss:2.937 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:34:22,831][model8_pretrain.py][INFO] Epoch:[0/2](170200/4588595) loss:2.913 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:34:22,831][model8_pretrain.py][INFO] Epoch:[0/2](170200/4588595) loss:3.146 lr:0.0000100 epoch_Time:27896.0min: [2024-01-03 11:34:59,775][model8_pretrain.py][INFO] Epoch:[0/2](170300/4588595) loss:3.220 lr:0.0000100 epoch_Time:27895.0min: [2024-01-03 11:34:59,775][model8_pretrain.py][INFO] Epoch:[0/2](170300/4588595) loss:2.784 lr:0.0000100 epoch_Time:27895.0min: [2024-01-03 11:34:59,775][model8_pretrain.py][INFO] Epoch:[0/2](170300/4588595) loss:3.122 lr:0.0000100 epoch_Time:27895.0min: [2024-01-03 11:34:59,775][model8_pretrain.py][INFO] Epoch:[0/2](170300/4588595) loss:2.920 lr:0.0000100 epoch_Time:27895.0min: [2024-01-03 11:34:59,775][model8_pretrain.py][INFO] Epoch:[0/2](170300/4588595) loss:2.400 lr:0.0000100 epoch_Time:27895.0min: [2024-01-03 11:34:59,775][model8_pretrain.py][INFO] Epoch:[0/2](170300/4588595) loss:2.944 lr:0.0000100 epoch_Time:27895.0min: [2024-01-03 11:34:59,775][model8_pretrain.py][INFO] Epoch:[0/2](170300/4588595) loss:3.361 lr:0.0000100 epoch_Time:27895.0min: [2024-01-03 11:34:59,775][model8_pretrain.py][INFO] Epoch:[0/2](170300/4588595) loss:3.188 lr:0.0000100 epoch_Time:27895.0min: [2024-01-03 11:35:36,717][model8_pretrain.py][INFO] Epoch:[0/2](170400/4588595) loss:2.893 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:35:36,717][model8_pretrain.py][INFO] Epoch:[0/2](170400/4588595) loss:3.071 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:35:36,717][model8_pretrain.py][INFO] Epoch:[0/2](170400/4588595) loss:2.621 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:35:36,717][model8_pretrain.py][INFO] Epoch:[0/2](170400/4588595) loss:3.049 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:35:36,717][model8_pretrain.py][INFO] Epoch:[0/2](170400/4588595) loss:2.961 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:35:36,717][model8_pretrain.py][INFO] Epoch:[0/2](170400/4588595) loss:3.077 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:35:36,717][model8_pretrain.py][INFO] Epoch:[0/2](170400/4588595) loss:2.292 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:35:36,717][model8_pretrain.py][INFO] Epoch:[0/2](170400/4588595) loss:2.955 lr:0.0000100 epoch_Time:27894.0min: [2024-01-03 11:36:13,645][model8_pretrain.py][INFO] Epoch:[0/2](170500/4588595) loss:2.365 lr:0.0000100 epoch_Time:27893.0min: [2024-01-03 11:36:13,645][model8_pretrain.py][INFO] Epoch:[0/2](170500/4588595) loss:2.843 lr:0.0000100 epoch_Time:27893.0min: [2024-01-03 11:36:13,645][model8_pretrain.py][INFO] Epoch:[0/2](170500/4588595) loss:3.164 lr:0.0000100 epoch_Time:27893.0min: [2024-01-03 11:36:13,645][model8_pretrain.py][INFO] Epoch:[0/2](170500/4588595) loss:3.136 lr:0.0000100 epoch_Time:27893.0min: [2024-01-03 11:36:13,645][model8_pretrain.py][INFO] Epoch:[0/2](170500/4588595) loss:2.899 lr:0.0000100 epoch_Time:27893.0min: [2024-01-03 11:36:13,645][model8_pretrain.py][INFO] Epoch:[0/2](170500/4588595) loss:2.652 lr:0.0000100 epoch_Time:27893.0min: [2024-01-03 11:36:13,645][model8_pretrain.py][INFO] Epoch:[0/2](170500/4588595) loss:2.189 lr:0.0000100 epoch_Time:27893.0min: [2024-01-03 11:36:13,645][model8_pretrain.py][INFO] Epoch:[0/2](170500/4588595) loss:2.791 lr:0.0000100 epoch_Time:27893.0min: [2024-01-03 11:36:50,580][model8_pretrain.py][INFO] Epoch:[0/2](170600/4588595) loss:3.291 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:36:50,580][model8_pretrain.py][INFO] Epoch:[0/2](170600/4588595) loss:3.151 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:36:50,580][model8_pretrain.py][INFO] Epoch:[0/2](170600/4588595) loss:2.951 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:36:50,581][model8_pretrain.py][INFO] Epoch:[0/2](170600/4588595) loss:3.243 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:36:50,580][model8_pretrain.py][INFO] Epoch:[0/2](170600/4588595) loss:2.937 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:36:50,581][model8_pretrain.py][INFO] Epoch:[0/2](170600/4588595) loss:2.979 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:36:50,581][model8_pretrain.py][INFO] Epoch:[0/2](170600/4588595) loss:2.767 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:36:50,581][model8_pretrain.py][INFO] Epoch:[0/2](170600/4588595) loss:3.189 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:37:27,503][model8_pretrain.py][INFO] Epoch:[0/2](170700/4588595) loss:3.022 lr:0.0000100 epoch_Time:27891.0min: [2024-01-03 11:37:27,504][model8_pretrain.py][INFO] Epoch:[0/2](170700/4588595) loss:3.353 lr:0.0000100 epoch_Time:27891.0min: [2024-01-03 11:37:27,504][model8_pretrain.py][INFO] Epoch:[0/2](170700/4588595) loss:3.194 lr:0.0000100 epoch_Time:27891.0min: [2024-01-03 11:37:27,504][model8_pretrain.py][INFO] Epoch:[0/2](170700/4588595) loss:2.658 lr:0.0000100 epoch_Time:27891.0min: [2024-01-03 11:37:27,504][model8_pretrain.py][INFO] Epoch:[0/2](170700/4588595) loss:2.758 lr:0.0000100 epoch_Time:27891.0min: [2024-01-03 11:37:27,504][model8_pretrain.py][INFO] Epoch:[0/2](170700/4588595) loss:3.284 lr:0.0000100 epoch_Time:27891.0min: [2024-01-03 11:37:27,504][model8_pretrain.py][INFO] Epoch:[0/2](170700/4588595) loss:3.077 lr:0.0000100 epoch_Time:27891.0min: [2024-01-03 11:37:27,504][model8_pretrain.py][INFO] Epoch:[0/2](170700/4588595) loss:2.870 lr:0.0000100 epoch_Time:27891.0min: [2024-01-03 11:38:04,426][model8_pretrain.py][INFO] Epoch:[0/2](170800/4588595) loss:2.526 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:38:04,426][model8_pretrain.py][INFO] Epoch:[0/2](170800/4588595) loss:2.788 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:38:04,426][model8_pretrain.py][INFO] Epoch:[0/2](170800/4588595) loss:3.254 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:38:04,426][model8_pretrain.py][INFO] Epoch:[0/2](170800/4588595) loss:2.976 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:38:04,426][model8_pretrain.py][INFO] Epoch:[0/2](170800/4588595) loss:3.025 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:38:04,426][model8_pretrain.py][INFO] Epoch:[0/2](170800/4588595) loss:2.620 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:38:04,426][model8_pretrain.py][INFO] Epoch:[0/2](170800/4588595) loss:2.924 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:38:04,426][model8_pretrain.py][INFO] Epoch:[0/2](170800/4588595) loss:2.943 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:38:49,873][model8_pretrain.py][INFO] Epoch:[0/2](170900/4588595) loss:2.937 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:38:49,873][model8_pretrain.py][INFO] Epoch:[0/2](170900/4588595) loss:2.906 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:38:49,873][model8_pretrain.py][INFO] Epoch:[0/2](170900/4588595) loss:3.326 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:38:49,873][model8_pretrain.py][INFO] Epoch:[0/2](170900/4588595) loss:2.694 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:38:49,873][model8_pretrain.py][INFO] Epoch:[0/2](170900/4588595) loss:2.811 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:38:49,873][model8_pretrain.py][INFO] Epoch:[0/2](170900/4588595) loss:2.738 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:38:49,873][model8_pretrain.py][INFO] Epoch:[0/2](170900/4588595) loss:2.868 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:38:49,873][model8_pretrain.py][INFO] Epoch:[0/2](170900/4588595) loss:2.753 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:39:26,828][model8_pretrain.py][INFO] Epoch:[0/2](171000/4588595) loss:3.194 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:39:26,828][model8_pretrain.py][INFO] Epoch:[0/2](171000/4588595) loss:2.802 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:39:26,828][model8_pretrain.py][INFO] Epoch:[0/2](171000/4588595) loss:2.923 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:39:26,828][model8_pretrain.py][INFO] Epoch:[0/2](171000/4588595) loss:3.131 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:39:26,829][model8_pretrain.py][INFO] Epoch:[0/2](171000/4588595) loss:3.059 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:39:26,828][model8_pretrain.py][INFO] Epoch:[0/2](171000/4588595) loss:3.164 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:39:26,828][model8_pretrain.py][INFO] Epoch:[0/2](171000/4588595) loss:2.869 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:39:26,829][model8_pretrain.py][INFO] Epoch:[0/2](171000/4588595) loss:3.112 lr:0.0000100 epoch_Time:27892.0min: [2024-01-03 11:40:03,770][model8_pretrain.py][INFO] Epoch:[0/2](171100/4588595) loss:3.218 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:03,770][model8_pretrain.py][INFO] Epoch:[0/2](171100/4588595) loss:2.949 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:03,770][model8_pretrain.py][INFO] Epoch:[0/2](171100/4588595) loss:3.330 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:03,770][model8_pretrain.py][INFO] Epoch:[0/2](171100/4588595) loss:2.997 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:03,770][model8_pretrain.py][INFO] Epoch:[0/2](171100/4588595) loss:2.587 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:03,770][model8_pretrain.py][INFO] Epoch:[0/2](171100/4588595) loss:2.848 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:03,770][model8_pretrain.py][INFO] Epoch:[0/2](171100/4588595) loss:3.254 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:03,770][model8_pretrain.py][INFO] Epoch:[0/2](171100/4588595) loss:2.761 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:40,698][model8_pretrain.py][INFO] Epoch:[0/2](171200/4588595) loss:3.306 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:40,698][model8_pretrain.py][INFO] Epoch:[0/2](171200/4588595) loss:2.721 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:40,698][model8_pretrain.py][INFO] Epoch:[0/2](171200/4588595) loss:2.718 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:40,698][model8_pretrain.py][INFO] Epoch:[0/2](171200/4588595) loss:2.860 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:40,698][model8_pretrain.py][INFO] Epoch:[0/2](171200/4588595) loss:2.897 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:40,698][model8_pretrain.py][INFO] Epoch:[0/2](171200/4588595) loss:3.459 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:40,698][model8_pretrain.py][INFO] Epoch:[0/2](171200/4588595) loss:3.461 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:40:40,699][model8_pretrain.py][INFO] Epoch:[0/2](171200/4588595) loss:2.766 lr:0.0000100 epoch_Time:27890.0min: [2024-01-03 11:41:17,628][model8_pretrain.py][INFO] Epoch:[0/2](171300/4588595) loss:3.368 lr:0.0000100 epoch_Time:27888.0min: [2024-01-03 11:41:17,628][model8_pretrain.py][INFO] Epoch:[0/2](171300/4588595) loss:3.222 lr:0.0000100 epoch_Time:27888.0min: [2024-01-03 11:41:17,628][model8_pretrain.py][INFO] Epoch:[0/2](171300/4588595) loss:3.207 lr:0.0000100 epoch_Time:27888.0min: [2024-01-03 11:41:17,628][model8_pretrain.py][INFO] Epoch:[0/2](171300/4588595) loss:3.105 lr:0.0000100 epoch_Time:27888.0min: [2024-01-03 11:41:17,628][model8_pretrain.py][INFO] Epoch:[0/2](171300/4588595) loss:3.013 lr:0.0000100 epoch_Time:27888.0min: [2024-01-03 11:41:17,628][model8_pretrain.py][INFO] Epoch:[0/2](171300/4588595) loss:3.083 lr:0.0000100 epoch_Time:27888.0min: [2024-01-03 11:41:17,628][model8_pretrain.py][INFO] Epoch:[0/2](171300/4588595) loss:3.165 lr:0.0000100 epoch_Time:27888.0min: [2024-01-03 11:41:17,628][model8_pretrain.py][INFO] Epoch:[0/2](171300/4588595) loss:3.389 lr:0.0000100 epoch_Time:27888.0min: [2024-01-03 11:41:54,519][model8_pretrain.py][INFO] Epoch:[0/2](171400/4588595) loss:2.817 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:41:54,519][model8_pretrain.py][INFO] Epoch:[0/2](171400/4588595) loss:2.750 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:41:54,519][model8_pretrain.py][INFO] Epoch:[0/2](171400/4588595) loss:3.122 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:41:54,519][model8_pretrain.py][INFO] Epoch:[0/2](171400/4588595) loss:3.025 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:41:54,519][model8_pretrain.py][INFO] Epoch:[0/2](171400/4588595) loss:3.005 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:41:54,519][model8_pretrain.py][INFO] Epoch:[0/2](171400/4588595) loss:3.178 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:41:54,519][model8_pretrain.py][INFO] Epoch:[0/2](171400/4588595) loss:2.935 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:41:54,520][model8_pretrain.py][INFO] Epoch:[0/2](171400/4588595) loss:3.117 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:42:31,448][model8_pretrain.py][INFO] Epoch:[0/2](171500/4588595) loss:2.769 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:42:31,448][model8_pretrain.py][INFO] Epoch:[0/2](171500/4588595) loss:2.685 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:42:31,448][model8_pretrain.py][INFO] Epoch:[0/2](171500/4588595) loss:2.721 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:42:31,448][model8_pretrain.py][INFO] Epoch:[0/2](171500/4588595) loss:3.237 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:42:31,448][model8_pretrain.py][INFO] Epoch:[0/2](171500/4588595) loss:3.100 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:42:31,448][model8_pretrain.py][INFO] Epoch:[0/2](171500/4588595) loss:3.279 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:42:31,448][model8_pretrain.py][INFO] Epoch:[0/2](171500/4588595) loss:2.443 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:42:31,453][model8_pretrain.py][INFO] Epoch:[0/2](171500/4588595) loss:2.573 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:43:08,371][model8_pretrain.py][INFO] Epoch:[0/2](171600/4588595) loss:2.222 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:43:08,371][model8_pretrain.py][INFO] Epoch:[0/2](171600/4588595) loss:2.556 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:43:08,371][model8_pretrain.py][INFO] Epoch:[0/2](171600/4588595) loss:2.577 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:43:08,371][model8_pretrain.py][INFO] Epoch:[0/2](171600/4588595) loss:2.897 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:43:08,371][model8_pretrain.py][INFO] Epoch:[0/2](171600/4588595) loss:3.094 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:43:08,371][model8_pretrain.py][INFO] Epoch:[0/2](171600/4588595) loss:2.893 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:43:08,371][model8_pretrain.py][INFO] Epoch:[0/2](171600/4588595) loss:3.161 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:43:08,371][model8_pretrain.py][INFO] Epoch:[0/2](171600/4588595) loss:3.119 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:43:53,686][model8_pretrain.py][INFO] Epoch:[0/2](171700/4588595) loss:3.394 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:43:53,686][model8_pretrain.py][INFO] Epoch:[0/2](171700/4588595) loss:3.169 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:43:53,686][model8_pretrain.py][INFO] Epoch:[0/2](171700/4588595) loss:2.577 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:43:53,686][model8_pretrain.py][INFO] Epoch:[0/2](171700/4588595) loss:3.243 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:43:53,686][model8_pretrain.py][INFO] Epoch:[0/2](171700/4588595) loss:3.379 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:43:53,686][model8_pretrain.py][INFO] Epoch:[0/2](171700/4588595) loss:2.628 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:43:53,686][model8_pretrain.py][INFO] Epoch:[0/2](171700/4588595) loss:2.977 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:43:53,686][model8_pretrain.py][INFO] Epoch:[0/2](171700/4588595) loss:2.345 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:44:30,622][model8_pretrain.py][INFO] Epoch:[0/2](171800/4588595) loss:2.178 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:44:30,622][model8_pretrain.py][INFO] Epoch:[0/2](171800/4588595) loss:2.813 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:44:30,622][model8_pretrain.py][INFO] Epoch:[0/2](171800/4588595) loss:3.067 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:44:30,622][model8_pretrain.py][INFO] Epoch:[0/2](171800/4588595) loss:3.270 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:44:30,622][model8_pretrain.py][INFO] Epoch:[0/2](171800/4588595) loss:2.996 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:44:30,622][model8_pretrain.py][INFO] Epoch:[0/2](171800/4588595) loss:2.934 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:44:30,622][model8_pretrain.py][INFO] Epoch:[0/2](171800/4588595) loss:3.126 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:44:30,623][model8_pretrain.py][INFO] Epoch:[0/2](171800/4588595) loss:2.786 lr:0.0000100 epoch_Time:27887.0min: [2024-01-03 11:45:07,559][model8_pretrain.py][INFO] Epoch:[0/2](171900/4588595) loss:2.485 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:45:07,559][model8_pretrain.py][INFO] Epoch:[0/2](171900/4588595) loss:2.917 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:45:07,559][model8_pretrain.py][INFO] Epoch:[0/2](171900/4588595) loss:2.746 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:45:07,559][model8_pretrain.py][INFO] Epoch:[0/2](171900/4588595) loss:2.780 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:45:07,559][model8_pretrain.py][INFO] Epoch:[0/2](171900/4588595) loss:2.574 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:45:07,559][model8_pretrain.py][INFO] Epoch:[0/2](171900/4588595) loss:3.362 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:45:07,559][model8_pretrain.py][INFO] Epoch:[0/2](171900/4588595) loss:2.771 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:45:07,559][model8_pretrain.py][INFO] Epoch:[0/2](171900/4588595) loss:2.971 lr:0.0000100 epoch_Time:27886.0min: [2024-01-03 11:45:44,494][model8_pretrain.py][INFO] Epoch:[0/2](172000/4588595) loss:2.542 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:45:44,494][model8_pretrain.py][INFO] Epoch:[0/2](172000/4588595) loss:2.975 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:45:44,494][model8_pretrain.py][INFO] Epoch:[0/2](172000/4588595) loss:3.244 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:45:44,494][model8_pretrain.py][INFO] Epoch:[0/2](172000/4588595) loss:3.336 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:45:44,494][model8_pretrain.py][INFO] Epoch:[0/2](172000/4588595) loss:2.760 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:45:44,494][model8_pretrain.py][INFO] Epoch:[0/2](172000/4588595) loss:2.911 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:45:44,494][model8_pretrain.py][INFO] Epoch:[0/2](172000/4588595) loss:3.085 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:45:44,494][model8_pretrain.py][INFO] Epoch:[0/2](172000/4588595) loss:3.165 lr:0.0000100 epoch_Time:27885.0min: [2024-01-03 11:46:21,431][model8_pretrain.py][INFO] Epoch:[0/2](172100/4588595) loss:2.784 lr:0.0000100 epoch_Time:27884.0min: [2024-01-03 11:46:21,431][model8_pretrain.py][INFO] Epoch:[0/2](172100/4588595) loss:2.621 lr:0.0000100 epoch_Time:27884.0min: [2024-01-03 11:46:21,431][model8_pretrain.py][INFO] Epoch:[0/2](172100/4588595) loss:2.787 lr:0.0000100 epoch_Time:27884.0min: [2024-01-03 11:46:21,431][model8_pretrain.py][INFO] Epoch:[0/2](172100/4588595) loss:2.736 lr:0.0000100 epoch_Time:27884.0min: [2024-01-03 11:46:21,431][model8_pretrain.py][INFO] Epoch:[0/2](172100/4588595) loss:3.272 lr:0.0000100 epoch_Time:27884.0min: [2024-01-03 11:46:21,431][model8_pretrain.py][INFO] Epoch:[0/2](172100/4588595) loss:2.919 lr:0.0000100 epoch_Time:27884.0min: [2024-01-03 11:46:21,431][model8_pretrain.py][INFO] Epoch:[0/2](172100/4588595) loss:3.086 lr:0.0000100 epoch_Time:27884.0min: [2024-01-03 11:46:21,431][model8_pretrain.py][INFO] Epoch:[0/2](172100/4588595) loss:2.982 lr:0.0000100 epoch_Time:27884.0min: [2024-01-03 11:46:58,359][model8_pretrain.py][INFO] Epoch:[0/2](172200/4588595) loss:3.404 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:46:58,359][model8_pretrain.py][INFO] Epoch:[0/2](172200/4588595) loss:3.412 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:46:58,359][model8_pretrain.py][INFO] Epoch:[0/2](172200/4588595) loss:3.020 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:46:58,360][model8_pretrain.py][INFO] Epoch:[0/2](172200/4588595) loss:3.339 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:46:58,360][model8_pretrain.py][INFO] Epoch:[0/2](172200/4588595) loss:2.775 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:46:58,359][model8_pretrain.py][INFO] Epoch:[0/2](172200/4588595) loss:2.726 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:46:58,360][model8_pretrain.py][INFO] Epoch:[0/2](172200/4588595) loss:3.297 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:46:58,360][model8_pretrain.py][INFO] Epoch:[0/2](172200/4588595) loss:2.296 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:47:35,285][model8_pretrain.py][INFO] Epoch:[0/2](172300/4588595) loss:2.473 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:47:35,285][model8_pretrain.py][INFO] Epoch:[0/2](172300/4588595) loss:3.205 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:47:35,285][model8_pretrain.py][INFO] Epoch:[0/2](172300/4588595) loss:2.464 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:47:35,285][model8_pretrain.py][INFO] Epoch:[0/2](172300/4588595) loss:3.160 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:47:35,286][model8_pretrain.py][INFO] Epoch:[0/2](172300/4588595) loss:2.388 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:47:35,286][model8_pretrain.py][INFO] Epoch:[0/2](172300/4588595) loss:2.747 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:47:35,286][model8_pretrain.py][INFO] Epoch:[0/2](172300/4588595) loss:2.494 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:47:35,286][model8_pretrain.py][INFO] Epoch:[0/2](172300/4588595) loss:3.193 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:48:12,214][model8_pretrain.py][INFO] Epoch:[0/2](172400/4588595) loss:3.052 lr:0.0000100 epoch_Time:27880.0min: [2024-01-03 11:48:12,214][model8_pretrain.py][INFO] Epoch:[0/2](172400/4588595) loss:3.229 lr:0.0000100 epoch_Time:27880.0min: [2024-01-03 11:48:12,214][model8_pretrain.py][INFO] Epoch:[0/2](172400/4588595) loss:3.372 lr:0.0000100 epoch_Time:27880.0min: [2024-01-03 11:48:12,214][model8_pretrain.py][INFO] Epoch:[0/2](172400/4588595) loss:2.787 lr:0.0000100 epoch_Time:27880.0min: [2024-01-03 11:48:12,214][model8_pretrain.py][INFO] Epoch:[0/2](172400/4588595) loss:2.790 lr:0.0000100 epoch_Time:27880.0min: [2024-01-03 11:48:12,214][model8_pretrain.py][INFO] Epoch:[0/2](172400/4588595) loss:3.070 lr:0.0000100 epoch_Time:27880.0min: [2024-01-03 11:48:12,214][model8_pretrain.py][INFO] Epoch:[0/2](172400/4588595) loss:2.785 lr:0.0000100 epoch_Time:27880.0min: [2024-01-03 11:48:12,214][model8_pretrain.py][INFO] Epoch:[0/2](172400/4588595) loss:2.818 lr:0.0000100 epoch_Time:27880.0min: [2024-01-03 11:48:57,582][model8_pretrain.py][INFO] Epoch:[0/2](172500/4588595) loss:2.644 lr:0.0000100 epoch_Time:27883.0min: [2024-01-03 11:48:57,582][model8_pretrain.py][INFO] Epoch:[0/2](172500/4588595) loss:3.151 lr:0.0000100 epoch_Time:27883.0min: [2024-01-03 11:48:57,582][model8_pretrain.py][INFO] Epoch:[0/2](172500/4588595) loss:3.148 lr:0.0000100 epoch_Time:27883.0min: [2024-01-03 11:48:57,582][model8_pretrain.py][INFO] Epoch:[0/2](172500/4588595) loss:3.237 lr:0.0000100 epoch_Time:27883.0min: [2024-01-03 11:48:57,582][model8_pretrain.py][INFO] Epoch:[0/2](172500/4588595) loss:2.776 lr:0.0000100 epoch_Time:27883.0min: [2024-01-03 11:48:57,582][model8_pretrain.py][INFO] Epoch:[0/2](172500/4588595) loss:2.872 lr:0.0000100 epoch_Time:27883.0min: [2024-01-03 11:48:57,582][model8_pretrain.py][INFO] Epoch:[0/2](172500/4588595) loss:2.987 lr:0.0000100 epoch_Time:27883.0min: [2024-01-03 11:48:57,583][model8_pretrain.py][INFO] Epoch:[0/2](172500/4588595) loss:2.908 lr:0.0000100 epoch_Time:27883.0min: [2024-01-03 11:49:34,479][model8_pretrain.py][INFO] Epoch:[0/2](172600/4588595) loss:2.668 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:49:34,479][model8_pretrain.py][INFO] Epoch:[0/2](172600/4588595) loss:2.881 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:49:34,479][model8_pretrain.py][INFO] Epoch:[0/2](172600/4588595) loss:2.486 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:49:34,479][model8_pretrain.py][INFO] Epoch:[0/2](172600/4588595) loss:2.511 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:49:34,479][model8_pretrain.py][INFO] Epoch:[0/2](172600/4588595) loss:3.148 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:49:34,479][model8_pretrain.py][INFO] Epoch:[0/2](172600/4588595) loss:2.892 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:49:34,479][model8_pretrain.py][INFO] Epoch:[0/2](172600/4588595) loss:3.294 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:49:34,479][model8_pretrain.py][INFO] Epoch:[0/2](172600/4588595) loss:2.808 lr:0.0000100 epoch_Time:27882.0min: [2024-01-03 11:50:11,429][model8_pretrain.py][INFO] Epoch:[0/2](172700/4588595) loss:2.606 lr:0.0000100 epoch_Time:27881.0min: [2024-01-03 11:50:11,429][model8_pretrain.py][INFO] Epoch:[0/2](172700/4588595) loss:3.129 lr:0.0000100 epoch_Time:27881.0min: [2024-01-03 11:50:11,429][model8_pretrain.py][INFO] Epoch:[0/2](172700/4588595) loss:2.433 lr:0.0000100 epoch_Time:27881.0min: [2024-01-03 11:50:11,429][model8_pretrain.py][INFO] Epoch:[0/2](172700/4588595) loss:2.168 lr:0.0000100 epoch_Time:27881.0min: [2024-01-03 11:50:11,429][model8_pretrain.py][INFO] Epoch:[0/2](172700/4588595) loss:2.552 lr:0.0000100 epoch_Time:27881.0min: [2024-01-03 11:50:11,429][model8_pretrain.py][INFO] Epoch:[0/2](172700/4588595) loss:2.108 lr:0.0000100 epoch_Time:27881.0min: [2024-01-03 11:50:11,429][model8_pretrain.py][INFO] Epoch:[0/2](172700/4588595) loss:2.702 lr:0.0000100 epoch_Time:27881.0min: [2024-01-03 11:50:11,429][model8_pretrain.py][INFO] Epoch:[0/2](172700/4588595) loss:2.845 lr:0.0000100 epoch_Time:27881.0min: [2024-01-03 11:50:48,375][model8_pretrain.py][INFO] Epoch:[0/2](172800/4588595) loss:3.076 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:50:48,376][model8_pretrain.py][INFO] Epoch:[0/2](172800/4588595) loss:3.415 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:50:48,376][model8_pretrain.py][INFO] Epoch:[0/2](172800/4588595) loss:2.734 lr:0.0000100 epoch_Time:27880.0min: [2024-01-03 11:50:48,376][model8_pretrain.py][INFO] Epoch:[0/2](172800/4588595) loss:3.088 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:50:48,376][model8_pretrain.py][INFO] Epoch:[0/2](172800/4588595) loss:2.980 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:50:48,376][model8_pretrain.py][INFO] Epoch:[0/2](172800/4588595) loss:2.781 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:50:48,376][model8_pretrain.py][INFO] Epoch:[0/2](172800/4588595) loss:2.512 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:50:48,376][model8_pretrain.py][INFO] Epoch:[0/2](172800/4588595) loss:3.140 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:51:25,305][model8_pretrain.py][INFO] Epoch:[0/2](172900/4588595) loss:3.009 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:51:25,305][model8_pretrain.py][INFO] Epoch:[0/2](172900/4588595) loss:3.407 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:51:25,305][model8_pretrain.py][INFO] Epoch:[0/2](172900/4588595) loss:2.793 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:51:25,305][model8_pretrain.py][INFO] Epoch:[0/2](172900/4588595) loss:2.709 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:51:25,305][model8_pretrain.py][INFO] Epoch:[0/2](172900/4588595) loss:3.131 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:51:25,305][model8_pretrain.py][INFO] Epoch:[0/2](172900/4588595) loss:2.995 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:51:25,305][model8_pretrain.py][INFO] Epoch:[0/2](172900/4588595) loss:3.401 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:51:25,305][model8_pretrain.py][INFO] Epoch:[0/2](172900/4588595) loss:2.898 lr:0.0000100 epoch_Time:27879.0min: [2024-01-03 11:52:02,237][model8_pretrain.py][INFO] Epoch:[0/2](173000/4588595) loss:2.833 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:52:02,238][model8_pretrain.py][INFO] Epoch:[0/2](173000/4588595) loss:3.182 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:52:02,238][model8_pretrain.py][INFO] Epoch:[0/2](173000/4588595) loss:2.852 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:52:02,238][model8_pretrain.py][INFO] Epoch:[0/2](173000/4588595) loss:3.252 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:52:02,237][model8_pretrain.py][INFO] Epoch:[0/2](173000/4588595) loss:3.293 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:52:02,238][model8_pretrain.py][INFO] Epoch:[0/2](173000/4588595) loss:2.775 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:52:02,238][model8_pretrain.py][INFO] Epoch:[0/2](173000/4588595) loss:2.412 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:52:02,238][model8_pretrain.py][INFO] Epoch:[0/2](173000/4588595) loss:2.450 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:52:39,175][model8_pretrain.py][INFO] Epoch:[0/2](173100/4588595) loss:2.317 lr:0.0000100 epoch_Time:27877.0min: [2024-01-03 11:52:39,175][model8_pretrain.py][INFO] Epoch:[0/2](173100/4588595) loss:3.206 lr:0.0000100 epoch_Time:27877.0min: [2024-01-03 11:52:39,175][model8_pretrain.py][INFO] Epoch:[0/2](173100/4588595) loss:3.289 lr:0.0000100 epoch_Time:27877.0min: [2024-01-03 11:52:39,175][model8_pretrain.py][INFO] Epoch:[0/2](173100/4588595) loss:3.050 lr:0.0000100 epoch_Time:27877.0min: [2024-01-03 11:52:39,175][model8_pretrain.py][INFO] Epoch:[0/2](173100/4588595) loss:2.834 lr:0.0000100 epoch_Time:27877.0min: [2024-01-03 11:52:39,175][model8_pretrain.py][INFO] Epoch:[0/2](173100/4588595) loss:3.267 lr:0.0000100 epoch_Time:27877.0min: [2024-01-03 11:52:39,176][model8_pretrain.py][INFO] Epoch:[0/2](173100/4588595) loss:2.395 lr:0.0000100 epoch_Time:27877.0min: [2024-01-03 11:52:39,176][model8_pretrain.py][INFO] Epoch:[0/2](173100/4588595) loss:2.090 lr:0.0000100 epoch_Time:27877.0min: [2024-01-03 11:53:16,104][model8_pretrain.py][INFO] Epoch:[0/2](173200/4588595) loss:3.433 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:53:16,104][model8_pretrain.py][INFO] Epoch:[0/2](173200/4588595) loss:3.019 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:53:16,105][model8_pretrain.py][INFO] Epoch:[0/2](173200/4588595) loss:3.025 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:53:16,105][model8_pretrain.py][INFO] Epoch:[0/2](173200/4588595) loss:3.290 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:53:16,105][model8_pretrain.py][INFO] Epoch:[0/2](173200/4588595) loss:2.762 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:53:16,105][model8_pretrain.py][INFO] Epoch:[0/2](173200/4588595) loss:2.785 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:53:16,105][model8_pretrain.py][INFO] Epoch:[0/2](173200/4588595) loss:3.033 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:53:16,105][model8_pretrain.py][INFO] Epoch:[0/2](173200/4588595) loss:2.953 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:54:01,553][model8_pretrain.py][INFO] Epoch:[0/2](173300/4588595) loss:3.286 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:01,553][model8_pretrain.py][INFO] Epoch:[0/2](173300/4588595) loss:2.487 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:01,553][model8_pretrain.py][INFO] Epoch:[0/2](173300/4588595) loss:2.804 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:01,553][model8_pretrain.py][INFO] Epoch:[0/2](173300/4588595) loss:2.590 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:01,553][model8_pretrain.py][INFO] Epoch:[0/2](173300/4588595) loss:2.839 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:01,553][model8_pretrain.py][INFO] Epoch:[0/2](173300/4588595) loss:2.791 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:01,553][model8_pretrain.py][INFO] Epoch:[0/2](173300/4588595) loss:3.045 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:01,554][model8_pretrain.py][INFO] Epoch:[0/2](173300/4588595) loss:3.121 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:38,503][model8_pretrain.py][INFO] Epoch:[0/2](173400/4588595) loss:2.333 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:38,504][model8_pretrain.py][INFO] Epoch:[0/2](173400/4588595) loss:3.208 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:38,504][model8_pretrain.py][INFO] Epoch:[0/2](173400/4588595) loss:2.650 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:38,504][model8_pretrain.py][INFO] Epoch:[0/2](173400/4588595) loss:2.898 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:38,504][model8_pretrain.py][INFO] Epoch:[0/2](173400/4588595) loss:3.000 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:38,504][model8_pretrain.py][INFO] Epoch:[0/2](173400/4588595) loss:2.817 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:38,504][model8_pretrain.py][INFO] Epoch:[0/2](173400/4588595) loss:2.996 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:54:38,504][model8_pretrain.py][INFO] Epoch:[0/2](173400/4588595) loss:3.276 lr:0.0000100 epoch_Time:27878.0min: [2024-01-03 11:55:15,456][model8_pretrain.py][INFO] Epoch:[0/2](173500/4588595) loss:2.955 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:55:15,456][model8_pretrain.py][INFO] Epoch:[0/2](173500/4588595) loss:2.762 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:55:15,456][model8_pretrain.py][INFO] Epoch:[0/2](173500/4588595) loss:3.095 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:55:15,456][model8_pretrain.py][INFO] Epoch:[0/2](173500/4588595) loss:3.340 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:55:15,456][model8_pretrain.py][INFO] Epoch:[0/2](173500/4588595) loss:2.719 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:55:15,456][model8_pretrain.py][INFO] Epoch:[0/2](173500/4588595) loss:2.673 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:55:15,456][model8_pretrain.py][INFO] Epoch:[0/2](173500/4588595) loss:2.651 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:55:15,456][model8_pretrain.py][INFO] Epoch:[0/2](173500/4588595) loss:2.663 lr:0.0000100 epoch_Time:27876.0min: [2024-01-03 11:55:52,424][model8_pretrain.py][INFO] Epoch:[0/2](173600/4588595) loss:3.174 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:55:52,424][model8_pretrain.py][INFO] Epoch:[0/2](173600/4588595) loss:3.303 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:55:52,424][model8_pretrain.py][INFO] Epoch:[0/2](173600/4588595) loss:3.180 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:55:52,424][model8_pretrain.py][INFO] Epoch:[0/2](173600/4588595) loss:3.312 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:55:52,424][model8_pretrain.py][INFO] Epoch:[0/2](173600/4588595) loss:2.989 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:55:52,424][model8_pretrain.py][INFO] Epoch:[0/2](173600/4588595) loss:2.725 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:55:52,424][model8_pretrain.py][INFO] Epoch:[0/2](173600/4588595) loss:3.100 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:55:52,424][model8_pretrain.py][INFO] Epoch:[0/2](173600/4588595) loss:2.991 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:56:29,377][model8_pretrain.py][INFO] Epoch:[0/2](173700/4588595) loss:3.175 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:56:29,377][model8_pretrain.py][INFO] Epoch:[0/2](173700/4588595) loss:3.100 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:56:29,377][model8_pretrain.py][INFO] Epoch:[0/2](173700/4588595) loss:2.980 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:56:29,377][model8_pretrain.py][INFO] Epoch:[0/2](173700/4588595) loss:3.407 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:56:29,377][model8_pretrain.py][INFO] Epoch:[0/2](173700/4588595) loss:3.509 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:56:29,377][model8_pretrain.py][INFO] Epoch:[0/2](173700/4588595) loss:2.785 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:56:29,377][model8_pretrain.py][INFO] Epoch:[0/2](173700/4588595) loss:2.846 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:56:29,377][model8_pretrain.py][INFO] Epoch:[0/2](173700/4588595) loss:2.751 lr:0.0000100 epoch_Time:27875.0min: [2024-01-03 11:57:06,320][model8_pretrain.py][INFO] Epoch:[0/2](173800/4588595) loss:2.901 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:06,320][model8_pretrain.py][INFO] Epoch:[0/2](173800/4588595) loss:2.779 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:06,320][model8_pretrain.py][INFO] Epoch:[0/2](173800/4588595) loss:3.435 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:06,320][model8_pretrain.py][INFO] Epoch:[0/2](173800/4588595) loss:2.763 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:06,320][model8_pretrain.py][INFO] Epoch:[0/2](173800/4588595) loss:2.768 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:06,320][model8_pretrain.py][INFO] Epoch:[0/2](173800/4588595) loss:3.186 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:06,320][model8_pretrain.py][INFO] Epoch:[0/2](173800/4588595) loss:3.250 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:06,321][model8_pretrain.py][INFO] Epoch:[0/2](173800/4588595) loss:2.362 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:43,274][model8_pretrain.py][INFO] Epoch:[0/2](173900/4588595) loss:2.950 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:43,274][model8_pretrain.py][INFO] Epoch:[0/2](173900/4588595) loss:3.021 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:43,274][model8_pretrain.py][INFO] Epoch:[0/2](173900/4588595) loss:3.060 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:43,274][model8_pretrain.py][INFO] Epoch:[0/2](173900/4588595) loss:2.611 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:43,274][model8_pretrain.py][INFO] Epoch:[0/2](173900/4588595) loss:3.036 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:43,274][model8_pretrain.py][INFO] Epoch:[0/2](173900/4588595) loss:2.924 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:43,274][model8_pretrain.py][INFO] Epoch:[0/2](173900/4588595) loss:3.090 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:57:43,274][model8_pretrain.py][INFO] Epoch:[0/2](173900/4588595) loss:2.578 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:58:20,224][model8_pretrain.py][INFO] Epoch:[0/2](174000/4588595) loss:2.350 lr:0.0000100 epoch_Time:27871.0min: [2024-01-03 11:58:20,224][model8_pretrain.py][INFO] Epoch:[0/2](174000/4588595) loss:3.041 lr:0.0000100 epoch_Time:27871.0min: [2024-01-03 11:58:20,224][model8_pretrain.py][INFO] Epoch:[0/2](174000/4588595) loss:3.032 lr:0.0000100 epoch_Time:27871.0min: [2024-01-03 11:58:20,224][model8_pretrain.py][INFO] Epoch:[0/2](174000/4588595) loss:2.877 lr:0.0000100 epoch_Time:27871.0min: [2024-01-03 11:58:20,224][model8_pretrain.py][INFO] Epoch:[0/2](174000/4588595) loss:2.762 lr:0.0000100 epoch_Time:27871.0min: [2024-01-03 11:58:20,224][model8_pretrain.py][INFO] Epoch:[0/2](174000/4588595) loss:2.952 lr:0.0000100 epoch_Time:27871.0min: [2024-01-03 11:58:20,224][model8_pretrain.py][INFO] Epoch:[0/2](174000/4588595) loss:3.098 lr:0.0000100 epoch_Time:27871.0min: [2024-01-03 11:58:20,224][model8_pretrain.py][INFO] Epoch:[0/2](174000/4588595) loss:2.287 lr:0.0000100 epoch_Time:27871.0min: [2024-01-03 11:59:05,888][model8_pretrain.py][INFO] Epoch:[0/2](174100/4588595) loss:3.000 lr:0.0000100 epoch_Time:27874.0min: [2024-01-03 11:59:05,888][model8_pretrain.py][INFO] Epoch:[0/2](174100/4588595) loss:2.921 lr:0.0000100 epoch_Time:27874.0min: [2024-01-03 11:59:05,888][model8_pretrain.py][INFO] Epoch:[0/2](174100/4588595) loss:2.593 lr:0.0000100 epoch_Time:27874.0min: [2024-01-03 11:59:05,888][model8_pretrain.py][INFO] Epoch:[0/2](174100/4588595) loss:3.041 lr:0.0000100 epoch_Time:27874.0min: [2024-01-03 11:59:05,888][model8_pretrain.py][INFO] Epoch:[0/2](174100/4588595) loss:2.986 lr:0.0000100 epoch_Time:27874.0min: [2024-01-03 11:59:05,888][model8_pretrain.py][INFO] Epoch:[0/2](174100/4588595) loss:2.781 lr:0.0000100 epoch_Time:27874.0min: [2024-01-03 11:59:05,888][model8_pretrain.py][INFO] Epoch:[0/2](174100/4588595) loss:3.039 lr:0.0000100 epoch_Time:27874.0min: [2024-01-03 11:59:05,888][model8_pretrain.py][INFO] Epoch:[0/2](174100/4588595) loss:3.171 lr:0.0000100 epoch_Time:27874.0min: [2024-01-03 11:59:42,825][model8_pretrain.py][INFO] Epoch:[0/2](174200/4588595) loss:2.698 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:59:42,825][model8_pretrain.py][INFO] Epoch:[0/2](174200/4588595) loss:3.038 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:59:42,825][model8_pretrain.py][INFO] Epoch:[0/2](174200/4588595) loss:2.650 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:59:42,825][model8_pretrain.py][INFO] Epoch:[0/2](174200/4588595) loss:3.189 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:59:42,826][model8_pretrain.py][INFO] Epoch:[0/2](174200/4588595) loss:2.945 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:59:42,826][model8_pretrain.py][INFO] Epoch:[0/2](174200/4588595) loss:2.711 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:59:42,825][model8_pretrain.py][INFO] Epoch:[0/2](174200/4588595) loss:3.371 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 11:59:42,825][model8_pretrain.py][INFO] Epoch:[0/2](174200/4588595) loss:2.983 lr:0.0000100 epoch_Time:27873.0min: [2024-01-03 12:00:19,768][model8_pretrain.py][INFO] Epoch:[0/2](174300/4588595) loss:3.442 lr:0.0000100 epoch_Time:27872.0min: [2024-01-03 12:00:19,768][model8_pretrain.py][INFO] Epoch:[0/2](174300/4588595) loss:3.087 lr:0.0000100 epoch_Time:27872.0min: [2024-01-03 12:00:19,768][model8_pretrain.py][INFO] Epoch:[0/2](174300/4588595) loss:3.320 lr:0.0000100 epoch_Time:27872.0min: [2024-01-03 12:00:19,768][model8_pretrain.py][INFO] Epoch:[0/2](174300/4588595) loss:3.156 lr:0.0000100 epoch_Time:27872.0min: [2024-01-03 12:00:19,768][model8_pretrain.py][INFO] Epoch:[0/2](174300/4588595) loss:2.387 lr:0.0000100 epoch_Time:27872.0min: [2024-01-03 12:00:19,768][model8_pretrain.py][INFO] Epoch:[0/2](174300/4588595) loss:2.792 lr:0.0000100 epoch_Time:27872.0min: [2024-01-03 12:00:19,768][model8_pretrain.py][INFO] Epoch:[0/2](174300/4588595) loss:2.617 lr:0.0000100 epoch_Time:27872.0min: [2024-01-03 12:00:19,768][model8_pretrain.py][INFO] Epoch:[0/2](174300/4588595) loss:3.030 lr:0.0000100 epoch_Time:27872.0min: [2024-01-03 12:00:56,702][model8_pretrain.py][INFO] Epoch:[0/2](174400/4588595) loss:2.871 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:00:56,702][model8_pretrain.py][INFO] Epoch:[0/2](174400/4588595) loss:2.890 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:00:56,702][model8_pretrain.py][INFO] Epoch:[0/2](174400/4588595) loss:3.226 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:00:56,702][model8_pretrain.py][INFO] Epoch:[0/2](174400/4588595) loss:2.750 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:00:56,702][model8_pretrain.py][INFO] Epoch:[0/2](174400/4588595) loss:3.031 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:00:56,702][model8_pretrain.py][INFO] Epoch:[0/2](174400/4588595) loss:3.139 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:00:56,702][model8_pretrain.py][INFO] Epoch:[0/2](174400/4588595) loss:2.397 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:00:56,702][model8_pretrain.py][INFO] Epoch:[0/2](174400/4588595) loss:2.981 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:01:33,632][model8_pretrain.py][INFO] Epoch:[0/2](174500/4588595) loss:2.666 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:01:33,632][model8_pretrain.py][INFO] Epoch:[0/2](174500/4588595) loss:2.871 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:01:33,632][model8_pretrain.py][INFO] Epoch:[0/2](174500/4588595) loss:3.350 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:01:33,632][model8_pretrain.py][INFO] Epoch:[0/2](174500/4588595) loss:2.686 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:01:33,632][model8_pretrain.py][INFO] Epoch:[0/2](174500/4588595) loss:2.915 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:01:33,632][model8_pretrain.py][INFO] Epoch:[0/2](174500/4588595) loss:2.656 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:01:33,632][model8_pretrain.py][INFO] Epoch:[0/2](174500/4588595) loss:2.890 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:01:33,632][model8_pretrain.py][INFO] Epoch:[0/2](174500/4588595) loss:2.959 lr:0.0000100 epoch_Time:27870.0min: [2024-01-03 12:02:10,569][model8_pretrain.py][INFO] Epoch:[0/2](174600/4588595) loss:2.941 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:02:10,569][model8_pretrain.py][INFO] Epoch:[0/2](174600/4588595) loss:3.331 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:02:10,569][model8_pretrain.py][INFO] Epoch:[0/2](174600/4588595) loss:3.020 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:02:10,569][model8_pretrain.py][INFO] Epoch:[0/2](174600/4588595) loss:3.191 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:02:10,569][model8_pretrain.py][INFO] Epoch:[0/2](174600/4588595) loss:3.132 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:02:10,569][model8_pretrain.py][INFO] Epoch:[0/2](174600/4588595) loss:2.868 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:02:10,570][model8_pretrain.py][INFO] Epoch:[0/2](174600/4588595) loss:2.637 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:02:10,570][model8_pretrain.py][INFO] Epoch:[0/2](174600/4588595) loss:3.181 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:02:47,498][model8_pretrain.py][INFO] Epoch:[0/2](174700/4588595) loss:2.767 lr:0.0000100 epoch_Time:27868.0min: [2024-01-03 12:02:47,498][model8_pretrain.py][INFO] Epoch:[0/2](174700/4588595) loss:2.920 lr:0.0000100 epoch_Time:27868.0min: [2024-01-03 12:02:47,498][model8_pretrain.py][INFO] Epoch:[0/2](174700/4588595) loss:2.960 lr:0.0000100 epoch_Time:27868.0min: [2024-01-03 12:02:47,499][model8_pretrain.py][INFO] Epoch:[0/2](174700/4588595) loss:2.968 lr:0.0000100 epoch_Time:27868.0min: [2024-01-03 12:02:47,499][model8_pretrain.py][INFO] Epoch:[0/2](174700/4588595) loss:3.162 lr:0.0000100 epoch_Time:27868.0min: [2024-01-03 12:02:47,499][model8_pretrain.py][INFO] Epoch:[0/2](174700/4588595) loss:2.881 lr:0.0000100 epoch_Time:27868.0min: [2024-01-03 12:02:47,499][model8_pretrain.py][INFO] Epoch:[0/2](174700/4588595) loss:3.251 lr:0.0000100 epoch_Time:27868.0min: [2024-01-03 12:02:47,499][model8_pretrain.py][INFO] Epoch:[0/2](174700/4588595) loss:2.758 lr:0.0000100 epoch_Time:27868.0min: [2024-01-03 12:03:24,423][model8_pretrain.py][INFO] Epoch:[0/2](174800/4588595) loss:3.107 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:03:24,423][model8_pretrain.py][INFO] Epoch:[0/2](174800/4588595) loss:2.992 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:03:24,423][model8_pretrain.py][INFO] Epoch:[0/2](174800/4588595) loss:3.331 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:03:24,423][model8_pretrain.py][INFO] Epoch:[0/2](174800/4588595) loss:2.847 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:03:24,423][model8_pretrain.py][INFO] Epoch:[0/2](174800/4588595) loss:3.058 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:03:24,423][model8_pretrain.py][INFO] Epoch:[0/2](174800/4588595) loss:3.328 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:03:24,424][model8_pretrain.py][INFO] Epoch:[0/2](174800/4588595) loss:3.269 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:03:24,425][model8_pretrain.py][INFO] Epoch:[0/2](174800/4588595) loss:2.563 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:04:10,061][model8_pretrain.py][INFO] Epoch:[0/2](174900/4588595) loss:2.928 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:10,061][model8_pretrain.py][INFO] Epoch:[0/2](174900/4588595) loss:2.823 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:10,061][model8_pretrain.py][INFO] Epoch:[0/2](174900/4588595) loss:3.161 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:10,061][model8_pretrain.py][INFO] Epoch:[0/2](174900/4588595) loss:3.197 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:10,061][model8_pretrain.py][INFO] Epoch:[0/2](174900/4588595) loss:2.943 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:10,061][model8_pretrain.py][INFO] Epoch:[0/2](174900/4588595) loss:3.149 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:10,061][model8_pretrain.py][INFO] Epoch:[0/2](174900/4588595) loss:2.895 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:10,062][model8_pretrain.py][INFO] Epoch:[0/2](174900/4588595) loss:3.320 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:46,992][model8_pretrain.py][INFO] Epoch:[0/2](175000/4588595) loss:3.077 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:46,992][model8_pretrain.py][INFO] Epoch:[0/2](175000/4588595) loss:3.064 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:46,992][model8_pretrain.py][INFO] Epoch:[0/2](175000/4588595) loss:3.371 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:46,992][model8_pretrain.py][INFO] Epoch:[0/2](175000/4588595) loss:2.977 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:46,992][model8_pretrain.py][INFO] Epoch:[0/2](175000/4588595) loss:2.716 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:46,992][model8_pretrain.py][INFO] Epoch:[0/2](175000/4588595) loss:2.296 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:46,992][model8_pretrain.py][INFO] Epoch:[0/2](175000/4588595) loss:2.944 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:04:46,992][model8_pretrain.py][INFO] Epoch:[0/2](175000/4588595) loss:3.225 lr:0.0000100 epoch_Time:27869.0min: [2024-01-03 12:05:23,921][model8_pretrain.py][INFO] Epoch:[0/2](175100/4588595) loss:3.113 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:05:23,921][model8_pretrain.py][INFO] Epoch:[0/2](175100/4588595) loss:3.209 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:05:23,921][model8_pretrain.py][INFO] Epoch:[0/2](175100/4588595) loss:3.045 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:05:23,921][model8_pretrain.py][INFO] Epoch:[0/2](175100/4588595) loss:2.537 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:05:23,921][model8_pretrain.py][INFO] Epoch:[0/2](175100/4588595) loss:2.709 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:05:23,921][model8_pretrain.py][INFO] Epoch:[0/2](175100/4588595) loss:3.268 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:05:23,921][model8_pretrain.py][INFO] Epoch:[0/2](175100/4588595) loss:2.701 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:05:23,921][model8_pretrain.py][INFO] Epoch:[0/2](175100/4588595) loss:3.442 lr:0.0000100 epoch_Time:27867.0min: [2024-01-03 12:06:00,856][model8_pretrain.py][INFO] Epoch:[0/2](175200/4588595) loss:2.936 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:00,856][model8_pretrain.py][INFO] Epoch:[0/2](175200/4588595) loss:2.639 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:00,856][model8_pretrain.py][INFO] Epoch:[0/2](175200/4588595) loss:2.841 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:00,856][model8_pretrain.py][INFO] Epoch:[0/2](175200/4588595) loss:2.359 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:00,856][model8_pretrain.py][INFO] Epoch:[0/2](175200/4588595) loss:3.341 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:00,856][model8_pretrain.py][INFO] Epoch:[0/2](175200/4588595) loss:2.906 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:00,856][model8_pretrain.py][INFO] Epoch:[0/2](175200/4588595) loss:3.034 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:00,856][model8_pretrain.py][INFO] Epoch:[0/2](175200/4588595) loss:3.119 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:37,798][model8_pretrain.py][INFO] Epoch:[0/2](175300/4588595) loss:2.789 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:37,798][model8_pretrain.py][INFO] Epoch:[0/2](175300/4588595) loss:3.364 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:37,798][model8_pretrain.py][INFO] Epoch:[0/2](175300/4588595) loss:3.082 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:37,798][model8_pretrain.py][INFO] Epoch:[0/2](175300/4588595) loss:3.465 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:37,798][model8_pretrain.py][INFO] Epoch:[0/2](175300/4588595) loss:3.158 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:37,798][model8_pretrain.py][INFO] Epoch:[0/2](175300/4588595) loss:2.866 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:37,798][model8_pretrain.py][INFO] Epoch:[0/2](175300/4588595) loss:2.978 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:06:37,798][model8_pretrain.py][INFO] Epoch:[0/2](175300/4588595) loss:3.053 lr:0.0000100 epoch_Time:27866.0min: [2024-01-03 12:07:14,734][model8_pretrain.py][INFO] Epoch:[0/2](175400/4588595) loss:3.023 lr:0.0000100 epoch_Time:27864.0min: [2024-01-03 12:07:14,734][model8_pretrain.py][INFO] Epoch:[0/2](175400/4588595) loss:3.009 lr:0.0000100 epoch_Time:27864.0min: [2024-01-03 12:07:14,734][model8_pretrain.py][INFO] Epoch:[0/2](175400/4588595) loss:3.092 lr:0.0000100 epoch_Time:27864.0min: [2024-01-03 12:07:14,734][model8_pretrain.py][INFO] Epoch:[0/2](175400/4588595) loss:3.055 lr:0.0000100 epoch_Time:27864.0min: [2024-01-03 12:07:14,734][model8_pretrain.py][INFO] Epoch:[0/2](175400/4588595) loss:2.674 lr:0.0000100 epoch_Time:27864.0min: [2024-01-03 12:07:14,734][model8_pretrain.py][INFO] Epoch:[0/2](175400/4588595) loss:2.844 lr:0.0000100 epoch_Time:27864.0min: [2024-01-03 12:07:14,734][model8_pretrain.py][INFO] Epoch:[0/2](175400/4588595) loss:2.941 lr:0.0000100 epoch_Time:27864.0min: [2024-01-03 12:07:14,734][model8_pretrain.py][INFO] Epoch:[0/2](175400/4588595) loss:3.056 lr:0.0000100 epoch_Time:27864.0min: [2024-01-03 12:07:51,682][model8_pretrain.py][INFO] Epoch:[0/2](175500/4588595) loss:3.530 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:07:51,682][model8_pretrain.py][INFO] Epoch:[0/2](175500/4588595) loss:2.896 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:07:51,682][model8_pretrain.py][INFO] Epoch:[0/2](175500/4588595) loss:2.367 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:07:51,682][model8_pretrain.py][INFO] Epoch:[0/2](175500/4588595) loss:3.353 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:07:51,682][model8_pretrain.py][INFO] Epoch:[0/2](175500/4588595) loss:3.276 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:07:51,682][model8_pretrain.py][INFO] Epoch:[0/2](175500/4588595) loss:3.252 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:07:51,682][model8_pretrain.py][INFO] Epoch:[0/2](175500/4588595) loss:3.240 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:07:51,682][model8_pretrain.py][INFO] Epoch:[0/2](175500/4588595) loss:2.844 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:08:28,626][model8_pretrain.py][INFO] Epoch:[0/2](175600/4588595) loss:2.379 lr:0.0000100 epoch_Time:27862.0min: [2024-01-03 12:08:28,626][model8_pretrain.py][INFO] Epoch:[0/2](175600/4588595) loss:3.150 lr:0.0000100 epoch_Time:27862.0min: [2024-01-03 12:08:28,626][model8_pretrain.py][INFO] Epoch:[0/2](175600/4588595) loss:2.456 lr:0.0000100 epoch_Time:27862.0min: [2024-01-03 12:08:28,626][model8_pretrain.py][INFO] Epoch:[0/2](175600/4588595) loss:3.105 lr:0.0000100 epoch_Time:27862.0min: [2024-01-03 12:08:28,626][model8_pretrain.py][INFO] Epoch:[0/2](175600/4588595) loss:2.787 lr:0.0000100 epoch_Time:27862.0min: [2024-01-03 12:08:28,626][model8_pretrain.py][INFO] Epoch:[0/2](175600/4588595) loss:2.209 lr:0.0000100 epoch_Time:27862.0min: [2024-01-03 12:08:28,627][model8_pretrain.py][INFO] Epoch:[0/2](175600/4588595) loss:2.732 lr:0.0000100 epoch_Time:27862.0min: [2024-01-03 12:08:28,626][model8_pretrain.py][INFO] Epoch:[0/2](175600/4588595) loss:3.025 lr:0.0000100 epoch_Time:27862.0min: [2024-01-03 12:09:14,095][model8_pretrain.py][INFO] Epoch:[0/2](175700/4588595) loss:2.756 lr:0.0000100 epoch_Time:27865.0min: [2024-01-03 12:09:14,095][model8_pretrain.py][INFO] Epoch:[0/2](175700/4588595) loss:3.323 lr:0.0000100 epoch_Time:27865.0min: [2024-01-03 12:09:14,095][model8_pretrain.py][INFO] Epoch:[0/2](175700/4588595) loss:3.004 lr:0.0000100 epoch_Time:27865.0min: [2024-01-03 12:09:14,095][model8_pretrain.py][INFO] Epoch:[0/2](175700/4588595) loss:2.588 lr:0.0000100 epoch_Time:27865.0min: [2024-01-03 12:09:14,095][model8_pretrain.py][INFO] Epoch:[0/2](175700/4588595) loss:3.094 lr:0.0000100 epoch_Time:27865.0min: [2024-01-03 12:09:14,095][model8_pretrain.py][INFO] Epoch:[0/2](175700/4588595) loss:2.685 lr:0.0000100 epoch_Time:27865.0min: [2024-01-03 12:09:14,095][model8_pretrain.py][INFO] Epoch:[0/2](175700/4588595) loss:2.394 lr:0.0000100 epoch_Time:27865.0min: [2024-01-03 12:09:14,095][model8_pretrain.py][INFO] Epoch:[0/2](175700/4588595) loss:2.352 lr:0.0000100 epoch_Time:27865.0min: [2024-01-03 12:09:51,081][model8_pretrain.py][INFO] Epoch:[0/2](175800/4588595) loss:2.867 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:09:51,081][model8_pretrain.py][INFO] Epoch:[0/2](175800/4588595) loss:2.524 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:09:51,081][model8_pretrain.py][INFO] Epoch:[0/2](175800/4588595) loss:2.890 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:09:51,081][model8_pretrain.py][INFO] Epoch:[0/2](175800/4588595) loss:3.174 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:09:51,081][model8_pretrain.py][INFO] Epoch:[0/2](175800/4588595) loss:3.292 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:09:51,081][model8_pretrain.py][INFO] Epoch:[0/2](175800/4588595) loss:3.235 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:09:51,081][model8_pretrain.py][INFO] Epoch:[0/2](175800/4588595) loss:2.700 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:09:51,081][model8_pretrain.py][INFO] Epoch:[0/2](175800/4588595) loss:2.971 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:10:28,044][model8_pretrain.py][INFO] Epoch:[0/2](175900/4588595) loss:3.283 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:10:28,044][model8_pretrain.py][INFO] Epoch:[0/2](175900/4588595) loss:3.234 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:10:28,044][model8_pretrain.py][INFO] Epoch:[0/2](175900/4588595) loss:3.250 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:10:28,044][model8_pretrain.py][INFO] Epoch:[0/2](175900/4588595) loss:2.798 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:10:28,045][model8_pretrain.py][INFO] Epoch:[0/2](175900/4588595) loss:2.468 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:10:28,045][model8_pretrain.py][INFO] Epoch:[0/2](175900/4588595) loss:3.294 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:10:28,045][model8_pretrain.py][INFO] Epoch:[0/2](175900/4588595) loss:3.538 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:10:28,045][model8_pretrain.py][INFO] Epoch:[0/2](175900/4588595) loss:2.717 lr:0.0000100 epoch_Time:27863.0min: [2024-01-03 12:11:04,972][model8_pretrain.py][INFO] Epoch:[0/2](176000/4588595) loss:3.116 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:04,972][model8_pretrain.py][INFO] Epoch:[0/2](176000/4588595) loss:3.508 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:04,972][model8_pretrain.py][INFO] Epoch:[0/2](176000/4588595) loss:3.173 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:04,972][model8_pretrain.py][INFO] Epoch:[0/2](176000/4588595) loss:3.431 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:04,972][model8_pretrain.py][INFO] Epoch:[0/2](176000/4588595) loss:3.209 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:04,972][model8_pretrain.py][INFO] Epoch:[0/2](176000/4588595) loss:3.400 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:04,972][model8_pretrain.py][INFO] Epoch:[0/2](176000/4588595) loss:2.622 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:04,972][model8_pretrain.py][INFO] Epoch:[0/2](176000/4588595) loss:3.095 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:41,914][model8_pretrain.py][INFO] Epoch:[0/2](176100/4588595) loss:3.141 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:41,914][model8_pretrain.py][INFO] Epoch:[0/2](176100/4588595) loss:2.969 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:41,914][model8_pretrain.py][INFO] Epoch:[0/2](176100/4588595) loss:3.093 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:41,914][model8_pretrain.py][INFO] Epoch:[0/2](176100/4588595) loss:2.724 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:41,914][model8_pretrain.py][INFO] Epoch:[0/2](176100/4588595) loss:2.608 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:41,914][model8_pretrain.py][INFO] Epoch:[0/2](176100/4588595) loss:2.754 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:41,914][model8_pretrain.py][INFO] Epoch:[0/2](176100/4588595) loss:3.109 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:11:41,914][model8_pretrain.py][INFO] Epoch:[0/2](176100/4588595) loss:3.214 lr:0.0000100 epoch_Time:27861.0min: [2024-01-03 12:12:18,861][model8_pretrain.py][INFO] Epoch:[0/2](176200/4588595) loss:2.943 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:12:18,861][model8_pretrain.py][INFO] Epoch:[0/2](176200/4588595) loss:2.714 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:12:18,861][model8_pretrain.py][INFO] Epoch:[0/2](176200/4588595) loss:3.068 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:12:18,861][model8_pretrain.py][INFO] Epoch:[0/2](176200/4588595) loss:2.793 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:12:18,861][model8_pretrain.py][INFO] Epoch:[0/2](176200/4588595) loss:3.585 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:12:18,861][model8_pretrain.py][INFO] Epoch:[0/2](176200/4588595) loss:2.851 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:12:18,862][model8_pretrain.py][INFO] Epoch:[0/2](176200/4588595) loss:2.592 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:12:18,862][model8_pretrain.py][INFO] Epoch:[0/2](176200/4588595) loss:3.044 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:12:55,783][model8_pretrain.py][INFO] Epoch:[0/2](176300/4588595) loss:2.756 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:12:55,783][model8_pretrain.py][INFO] Epoch:[0/2](176300/4588595) loss:2.516 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:12:55,783][model8_pretrain.py][INFO] Epoch:[0/2](176300/4588595) loss:2.793 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:12:55,783][model8_pretrain.py][INFO] Epoch:[0/2](176300/4588595) loss:2.902 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:12:55,783][model8_pretrain.py][INFO] Epoch:[0/2](176300/4588595) loss:3.459 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:12:55,783][model8_pretrain.py][INFO] Epoch:[0/2](176300/4588595) loss:2.504 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:12:55,783][model8_pretrain.py][INFO] Epoch:[0/2](176300/4588595) loss:3.246 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:12:55,783][model8_pretrain.py][INFO] Epoch:[0/2](176300/4588595) loss:2.758 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:13:32,735][model8_pretrain.py][INFO] Epoch:[0/2](176400/4588595) loss:2.337 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:13:32,735][model8_pretrain.py][INFO] Epoch:[0/2](176400/4588595) loss:3.358 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:13:32,735][model8_pretrain.py][INFO] Epoch:[0/2](176400/4588595) loss:2.729 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:13:32,735][model8_pretrain.py][INFO] Epoch:[0/2](176400/4588595) loss:3.232 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:13:32,735][model8_pretrain.py][INFO] Epoch:[0/2](176400/4588595) loss:3.000 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:13:32,736][model8_pretrain.py][INFO] Epoch:[0/2](176400/4588595) loss:2.951 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:13:32,736][model8_pretrain.py][INFO] Epoch:[0/2](176400/4588595) loss:2.798 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:13:32,737][model8_pretrain.py][INFO] Epoch:[0/2](176400/4588595) loss:3.041 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:14:18,234][model8_pretrain.py][INFO] Epoch:[0/2](176500/4588595) loss:2.098 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:14:18,234][model8_pretrain.py][INFO] Epoch:[0/2](176500/4588595) loss:3.111 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:14:18,234][model8_pretrain.py][INFO] Epoch:[0/2](176500/4588595) loss:3.210 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:14:18,234][model8_pretrain.py][INFO] Epoch:[0/2](176500/4588595) loss:2.875 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:14:18,234][model8_pretrain.py][INFO] Epoch:[0/2](176500/4588595) loss:2.704 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:14:18,234][model8_pretrain.py][INFO] Epoch:[0/2](176500/4588595) loss:3.036 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:14:18,234][model8_pretrain.py][INFO] Epoch:[0/2](176500/4588595) loss:3.002 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:14:18,235][model8_pretrain.py][INFO] Epoch:[0/2](176500/4588595) loss:3.000 lr:0.0000100 epoch_Time:27860.0min: [2024-01-03 12:14:55,170][model8_pretrain.py][INFO] Epoch:[0/2](176600/4588595) loss:2.653 lr:0.0000100 epoch_Time:27859.0min: [2024-01-03 12:14:55,170][model8_pretrain.py][INFO] Epoch:[0/2](176600/4588595) loss:2.796 lr:0.0000100 epoch_Time:27859.0min: [2024-01-03 12:14:55,170][model8_pretrain.py][INFO] Epoch:[0/2](176600/4588595) loss:2.767 lr:0.0000100 epoch_Time:27859.0min: [2024-01-03 12:14:55,170][model8_pretrain.py][INFO] Epoch:[0/2](176600/4588595) loss:3.214 lr:0.0000100 epoch_Time:27859.0min: [2024-01-03 12:14:55,170][model8_pretrain.py][INFO] Epoch:[0/2](176600/4588595) loss:2.834 lr:0.0000100 epoch_Time:27859.0min: [2024-01-03 12:14:55,170][model8_pretrain.py][INFO] Epoch:[0/2](176600/4588595) loss:2.852 lr:0.0000100 epoch_Time:27859.0min: [2024-01-03 12:14:55,170][model8_pretrain.py][INFO] Epoch:[0/2](176600/4588595) loss:2.753 lr:0.0000100 epoch_Time:27859.0min: [2024-01-03 12:14:55,170][model8_pretrain.py][INFO] Epoch:[0/2](176600/4588595) loss:2.677 lr:0.0000100 epoch_Time:27859.0min: [2024-01-03 12:15:32,087][model8_pretrain.py][INFO] Epoch:[0/2](176700/4588595) loss:2.739 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:15:32,087][model8_pretrain.py][INFO] Epoch:[0/2](176700/4588595) loss:3.230 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:15:32,088][model8_pretrain.py][INFO] Epoch:[0/2](176700/4588595) loss:3.194 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:15:32,088][model8_pretrain.py][INFO] Epoch:[0/2](176700/4588595) loss:3.257 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:15:32,088][model8_pretrain.py][INFO] Epoch:[0/2](176700/4588595) loss:2.773 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:15:32,088][model8_pretrain.py][INFO] Epoch:[0/2](176700/4588595) loss:3.288 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:15:32,088][model8_pretrain.py][INFO] Epoch:[0/2](176700/4588595) loss:3.165 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:15:32,088][model8_pretrain.py][INFO] Epoch:[0/2](176700/4588595) loss:2.627 lr:0.0000100 epoch_Time:27858.0min: [2024-01-03 12:16:09,016][model8_pretrain.py][INFO] Epoch:[0/2](176800/4588595) loss:3.379 lr:0.0000100 epoch_Time:27857.0min: [2024-01-03 12:16:09,016][model8_pretrain.py][INFO] Epoch:[0/2](176800/4588595) loss:2.242 lr:0.0000100 epoch_Time:27857.0min: [2024-01-03 12:16:09,016][model8_pretrain.py][INFO] Epoch:[0/2](176800/4588595) loss:2.914 lr:0.0000100 epoch_Time:27857.0min: [2024-01-03 12:16:09,016][model8_pretrain.py][INFO] Epoch:[0/2](176800/4588595) loss:2.960 lr:0.0000100 epoch_Time:27857.0min: [2024-01-03 12:16:09,016][model8_pretrain.py][INFO] Epoch:[0/2](176800/4588595) loss:3.005 lr:0.0000100 epoch_Time:27857.0min: [2024-01-03 12:16:09,016][model8_pretrain.py][INFO] Epoch:[0/2](176800/4588595) loss:2.489 lr:0.0000100 epoch_Time:27857.0min: [2024-01-03 12:16:09,016][model8_pretrain.py][INFO] Epoch:[0/2](176800/4588595) loss:2.912 lr:0.0000100 epoch_Time:27857.0min: [2024-01-03 12:16:09,016][model8_pretrain.py][INFO] Epoch:[0/2](176800/4588595) loss:3.235 lr:0.0000100 epoch_Time:27857.0min: [2024-01-03 12:16:45,935][model8_pretrain.py][INFO] Epoch:[0/2](176900/4588595) loss:3.044 lr:0.0000100 epoch_Time:27856.0min: [2024-01-03 12:16:45,935][model8_pretrain.py][INFO] Epoch:[0/2](176900/4588595) loss:2.706 lr:0.0000100 epoch_Time:27856.0min: [2024-01-03 12:16:45,935][model8_pretrain.py][INFO] Epoch:[0/2](176900/4588595) loss:3.422 lr:0.0000100 epoch_Time:27856.0min: [2024-01-03 12:16:45,935][model8_pretrain.py][INFO] Epoch:[0/2](176900/4588595) loss:2.312 lr:0.0000100 epoch_Time:27856.0min: [2024-01-03 12:16:45,935][model8_pretrain.py][INFO] Epoch:[0/2](176900/4588595) loss:3.149 lr:0.0000100 epoch_Time:27856.0min: [2024-01-03 12:16:45,935][model8_pretrain.py][INFO] Epoch:[0/2](176900/4588595) loss:3.095 lr:0.0000100 epoch_Time:27856.0min: [2024-01-03 12:16:45,935][model8_pretrain.py][INFO] Epoch:[0/2](176900/4588595) loss:2.613 lr:0.0000100 epoch_Time:27856.0min: [2024-01-03 12:16:45,935][model8_pretrain.py][INFO] Epoch:[0/2](176900/4588595) loss:3.088 lr:0.0000100 epoch_Time:27856.0min: [2024-01-03 12:17:22,869][model8_pretrain.py][INFO] Epoch:[0/2](177000/4588595) loss:1.926 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:17:22,869][model8_pretrain.py][INFO] Epoch:[0/2](177000/4588595) loss:2.678 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:17:22,869][model8_pretrain.py][INFO] Epoch:[0/2](177000/4588595) loss:3.079 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:17:22,869][model8_pretrain.py][INFO] Epoch:[0/2](177000/4588595) loss:2.469 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:17:22,869][model8_pretrain.py][INFO] Epoch:[0/2](177000/4588595) loss:2.227 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:17:22,869][model8_pretrain.py][INFO] Epoch:[0/2](177000/4588595) loss:2.696 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:17:22,869][model8_pretrain.py][INFO] Epoch:[0/2](177000/4588595) loss:3.152 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:17:22,869][model8_pretrain.py][INFO] Epoch:[0/2](177000/4588595) loss:2.618 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:17:59,821][model8_pretrain.py][INFO] Epoch:[0/2](177100/4588595) loss:3.543 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:17:59,821][model8_pretrain.py][INFO] Epoch:[0/2](177100/4588595) loss:3.172 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:17:59,821][model8_pretrain.py][INFO] Epoch:[0/2](177100/4588595) loss:2.875 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:17:59,821][model8_pretrain.py][INFO] Epoch:[0/2](177100/4588595) loss:3.018 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:17:59,821][model8_pretrain.py][INFO] Epoch:[0/2](177100/4588595) loss:2.579 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:17:59,821][model8_pretrain.py][INFO] Epoch:[0/2](177100/4588595) loss:2.427 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:17:59,821][model8_pretrain.py][INFO] Epoch:[0/2](177100/4588595) loss:2.885 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:17:59,821][model8_pretrain.py][INFO] Epoch:[0/2](177100/4588595) loss:2.841 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:18:36,740][model8_pretrain.py][INFO] Epoch:[0/2](177200/4588595) loss:2.583 lr:0.0000100 epoch_Time:27853.0min: [2024-01-03 12:18:36,740][model8_pretrain.py][INFO] Epoch:[0/2](177200/4588595) loss:2.558 lr:0.0000100 epoch_Time:27853.0min: [2024-01-03 12:18:36,740][model8_pretrain.py][INFO] Epoch:[0/2](177200/4588595) loss:3.025 lr:0.0000100 epoch_Time:27853.0min: [2024-01-03 12:18:36,740][model8_pretrain.py][INFO] Epoch:[0/2](177200/4588595) loss:3.275 lr:0.0000100 epoch_Time:27853.0min: [2024-01-03 12:18:36,740][model8_pretrain.py][INFO] Epoch:[0/2](177200/4588595) loss:3.142 lr:0.0000100 epoch_Time:27853.0min: [2024-01-03 12:18:36,741][model8_pretrain.py][INFO] Epoch:[0/2](177200/4588595) loss:3.047 lr:0.0000100 epoch_Time:27853.0min: [2024-01-03 12:18:36,741][model8_pretrain.py][INFO] Epoch:[0/2](177200/4588595) loss:3.115 lr:0.0000100 epoch_Time:27853.0min: [2024-01-03 12:18:36,741][model8_pretrain.py][INFO] Epoch:[0/2](177200/4588595) loss:3.385 lr:0.0000100 epoch_Time:27853.0min: [2024-01-03 12:19:22,160][model8_pretrain.py][INFO] Epoch:[0/2](177300/4588595) loss:3.087 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:19:22,160][model8_pretrain.py][INFO] Epoch:[0/2](177300/4588595) loss:2.745 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:19:22,160][model8_pretrain.py][INFO] Epoch:[0/2](177300/4588595) loss:3.072 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:19:22,160][model8_pretrain.py][INFO] Epoch:[0/2](177300/4588595) loss:3.378 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:19:22,160][model8_pretrain.py][INFO] Epoch:[0/2](177300/4588595) loss:2.518 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:19:22,160][model8_pretrain.py][INFO] Epoch:[0/2](177300/4588595) loss:3.250 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:19:22,160][model8_pretrain.py][INFO] Epoch:[0/2](177300/4588595) loss:3.078 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:19:22,160][model8_pretrain.py][INFO] Epoch:[0/2](177300/4588595) loss:3.055 lr:0.0000100 epoch_Time:27855.0min: [2024-01-03 12:19:59,097][model8_pretrain.py][INFO] Epoch:[0/2](177400/4588595) loss:2.669 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:19:59,097][model8_pretrain.py][INFO] Epoch:[0/2](177400/4588595) loss:2.869 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:19:59,097][model8_pretrain.py][INFO] Epoch:[0/2](177400/4588595) loss:3.256 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:19:59,097][model8_pretrain.py][INFO] Epoch:[0/2](177400/4588595) loss:2.935 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:19:59,097][model8_pretrain.py][INFO] Epoch:[0/2](177400/4588595) loss:2.760 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:19:59,097][model8_pretrain.py][INFO] Epoch:[0/2](177400/4588595) loss:3.036 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:19:59,097][model8_pretrain.py][INFO] Epoch:[0/2](177400/4588595) loss:3.554 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:19:59,097][model8_pretrain.py][INFO] Epoch:[0/2](177400/4588595) loss:2.735 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:20:36,040][model8_pretrain.py][INFO] Epoch:[0/2](177500/4588595) loss:2.022 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:20:36,040][model8_pretrain.py][INFO] Epoch:[0/2](177500/4588595) loss:3.311 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:20:36,040][model8_pretrain.py][INFO] Epoch:[0/2](177500/4588595) loss:3.226 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:20:36,040][model8_pretrain.py][INFO] Epoch:[0/2](177500/4588595) loss:2.779 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:20:36,040][model8_pretrain.py][INFO] Epoch:[0/2](177500/4588595) loss:3.160 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:20:36,040][model8_pretrain.py][INFO] Epoch:[0/2](177500/4588595) loss:2.862 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:20:36,040][model8_pretrain.py][INFO] Epoch:[0/2](177500/4588595) loss:3.116 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:20:36,040][model8_pretrain.py][INFO] Epoch:[0/2](177500/4588595) loss:2.952 lr:0.0000100 epoch_Time:27854.0min: [2024-01-03 12:21:12,980][model8_pretrain.py][INFO] Epoch:[0/2](177600/4588595) loss:2.657 lr:0.0000100 epoch_Time:27852.0min: [2024-01-03 12:21:12,980][model8_pretrain.py][INFO] Epoch:[0/2](177600/4588595) loss:3.037 lr:0.0000100 epoch_Time:27852.0min: [2024-01-03 12:21:12,980][model8_pretrain.py][INFO] Epoch:[0/2](177600/4588595) loss:2.930 lr:0.0000100 epoch_Time:27852.0min: [2024-01-03 12:21:12,980][model8_pretrain.py][INFO] Epoch:[0/2](177600/4588595) loss:2.637 lr:0.0000100 epoch_Time:27852.0min: [2024-01-03 12:21:12,980][model8_pretrain.py][INFO] Epoch:[0/2](177600/4588595) loss:2.876 lr:0.0000100 epoch_Time:27852.0min: [2024-01-03 12:21:12,980][model8_pretrain.py][INFO] Epoch:[0/2](177600/4588595) loss:2.937 lr:0.0000100 epoch_Time:27852.0min: [2024-01-03 12:21:12,980][model8_pretrain.py][INFO] Epoch:[0/2](177600/4588595) loss:3.163 lr:0.0000100 epoch_Time:27852.0min: [2024-01-03 12:21:12,980][model8_pretrain.py][INFO] Epoch:[0/2](177600/4588595) loss:2.717 lr:0.0000100 epoch_Time:27852.0min: [2024-01-03 12:21:49,934][model8_pretrain.py][INFO] Epoch:[0/2](177700/4588595) loss:2.726 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:21:49,934][model8_pretrain.py][INFO] Epoch:[0/2](177700/4588595) loss:3.205 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:21:49,934][model8_pretrain.py][INFO] Epoch:[0/2](177700/4588595) loss:3.404 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:21:49,934][model8_pretrain.py][INFO] Epoch:[0/2](177700/4588595) loss:3.108 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:21:49,934][model8_pretrain.py][INFO] Epoch:[0/2](177700/4588595) loss:3.205 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:21:49,934][model8_pretrain.py][INFO] Epoch:[0/2](177700/4588595) loss:2.841 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:21:49,934][model8_pretrain.py][INFO] Epoch:[0/2](177700/4588595) loss:2.488 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:21:49,934][model8_pretrain.py][INFO] Epoch:[0/2](177700/4588595) loss:3.030 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:22:26,934][model8_pretrain.py][INFO] Epoch:[0/2](177800/4588595) loss:2.745 lr:0.0000100 epoch_Time:27850.0min: [2024-01-03 12:22:26,934][model8_pretrain.py][INFO] Epoch:[0/2](177800/4588595) loss:2.957 lr:0.0000100 epoch_Time:27850.0min: [2024-01-03 12:22:26,935][model8_pretrain.py][INFO] Epoch:[0/2](177800/4588595) loss:1.988 lr:0.0000100 epoch_Time:27850.0min: [2024-01-03 12:22:26,935][model8_pretrain.py][INFO] Epoch:[0/2](177800/4588595) loss:3.119 lr:0.0000100 epoch_Time:27850.0min: [2024-01-03 12:22:26,935][model8_pretrain.py][INFO] Epoch:[0/2](177800/4588595) loss:2.856 lr:0.0000100 epoch_Time:27850.0min: [2024-01-03 12:22:26,935][model8_pretrain.py][INFO] Epoch:[0/2](177800/4588595) loss:2.854 lr:0.0000100 epoch_Time:27850.0min: [2024-01-03 12:22:26,935][model8_pretrain.py][INFO] Epoch:[0/2](177800/4588595) loss:2.550 lr:0.0000100 epoch_Time:27850.0min: [2024-01-03 12:22:26,935][model8_pretrain.py][INFO] Epoch:[0/2](177800/4588595) loss:3.113 lr:0.0000100 epoch_Time:27850.0min: [2024-01-03 12:23:03,925][model8_pretrain.py][INFO] Epoch:[0/2](177900/4588595) loss:2.830 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:03,925][model8_pretrain.py][INFO] Epoch:[0/2](177900/4588595) loss:2.569 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:03,925][model8_pretrain.py][INFO] Epoch:[0/2](177900/4588595) loss:3.112 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:03,925][model8_pretrain.py][INFO] Epoch:[0/2](177900/4588595) loss:2.716 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:03,926][model8_pretrain.py][INFO] Epoch:[0/2](177900/4588595) loss:2.851 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:03,926][model8_pretrain.py][INFO] Epoch:[0/2](177900/4588595) loss:3.534 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:03,926][model8_pretrain.py][INFO] Epoch:[0/2](177900/4588595) loss:2.862 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:03,926][model8_pretrain.py][INFO] Epoch:[0/2](177900/4588595) loss:2.967 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:40,873][model8_pretrain.py][INFO] Epoch:[0/2](178000/4588595) loss:2.591 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:40,873][model8_pretrain.py][INFO] Epoch:[0/2](178000/4588595) loss:3.200 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:40,873][model8_pretrain.py][INFO] Epoch:[0/2](178000/4588595) loss:3.002 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:40,873][model8_pretrain.py][INFO] Epoch:[0/2](178000/4588595) loss:2.095 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:40,873][model8_pretrain.py][INFO] Epoch:[0/2](178000/4588595) loss:2.830 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:40,873][model8_pretrain.py][INFO] Epoch:[0/2](178000/4588595) loss:3.013 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:40,873][model8_pretrain.py][INFO] Epoch:[0/2](178000/4588595) loss:2.679 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:23:40,873][model8_pretrain.py][INFO] Epoch:[0/2](178000/4588595) loss:2.988 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:24:26,386][model8_pretrain.py][INFO] Epoch:[0/2](178100/4588595) loss:3.183 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:24:26,386][model8_pretrain.py][INFO] Epoch:[0/2](178100/4588595) loss:2.562 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:24:26,386][model8_pretrain.py][INFO] Epoch:[0/2](178100/4588595) loss:3.229 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:24:26,386][model8_pretrain.py][INFO] Epoch:[0/2](178100/4588595) loss:3.463 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:24:26,386][model8_pretrain.py][INFO] Epoch:[0/2](178100/4588595) loss:3.153 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:24:26,386][model8_pretrain.py][INFO] Epoch:[0/2](178100/4588595) loss:2.950 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:24:26,386][model8_pretrain.py][INFO] Epoch:[0/2](178100/4588595) loss:3.195 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:24:26,386][model8_pretrain.py][INFO] Epoch:[0/2](178100/4588595) loss:3.030 lr:0.0000100 epoch_Time:27851.0min: [2024-01-03 12:25:03,315][model8_pretrain.py][INFO] Epoch:[0/2](178200/4588595) loss:3.278 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:03,315][model8_pretrain.py][INFO] Epoch:[0/2](178200/4588595) loss:3.014 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:03,315][model8_pretrain.py][INFO] Epoch:[0/2](178200/4588595) loss:3.146 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:03,315][model8_pretrain.py][INFO] Epoch:[0/2](178200/4588595) loss:3.063 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:03,315][model8_pretrain.py][INFO] Epoch:[0/2](178200/4588595) loss:3.216 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:03,315][model8_pretrain.py][INFO] Epoch:[0/2](178200/4588595) loss:2.472 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:03,315][model8_pretrain.py][INFO] Epoch:[0/2](178200/4588595) loss:2.814 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:03,315][model8_pretrain.py][INFO] Epoch:[0/2](178200/4588595) loss:3.158 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:40,256][model8_pretrain.py][INFO] Epoch:[0/2](178300/4588595) loss:2.893 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:40,256][model8_pretrain.py][INFO] Epoch:[0/2](178300/4588595) loss:2.370 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:40,256][model8_pretrain.py][INFO] Epoch:[0/2](178300/4588595) loss:3.422 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:40,256][model8_pretrain.py][INFO] Epoch:[0/2](178300/4588595) loss:2.774 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:40,256][model8_pretrain.py][INFO] Epoch:[0/2](178300/4588595) loss:2.637 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:40,256][model8_pretrain.py][INFO] Epoch:[0/2](178300/4588595) loss:3.252 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:40,256][model8_pretrain.py][INFO] Epoch:[0/2](178300/4588595) loss:3.100 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:25:40,256][model8_pretrain.py][INFO] Epoch:[0/2](178300/4588595) loss:2.753 lr:0.0000100 epoch_Time:27849.0min: [2024-01-03 12:26:17,199][model8_pretrain.py][INFO] Epoch:[0/2](178400/4588595) loss:2.846 lr:0.0000100 epoch_Time:27848.0min: [2024-01-03 12:26:17,199][model8_pretrain.py][INFO] Epoch:[0/2](178400/4588595) loss:3.076 lr:0.0000100 epoch_Time:27848.0min: [2024-01-03 12:26:17,199][model8_pretrain.py][INFO] Epoch:[0/2](178400/4588595) loss:2.565 lr:0.0000100 epoch_Time:27848.0min: [2024-01-03 12:26:17,199][model8_pretrain.py][INFO] Epoch:[0/2](178400/4588595) loss:2.796 lr:0.0000100 epoch_Time:27848.0min: [2024-01-03 12:26:17,199][model8_pretrain.py][INFO] Epoch:[0/2](178400/4588595) loss:2.991 lr:0.0000100 epoch_Time:27848.0min: [2024-01-03 12:26:17,199][model8_pretrain.py][INFO] Epoch:[0/2](178400/4588595) loss:3.218 lr:0.0000100 epoch_Time:27848.0min: [2024-01-03 12:26:17,199][model8_pretrain.py][INFO] Epoch:[0/2](178400/4588595) loss:2.912 lr:0.0000100 epoch_Time:27848.0min: [2024-01-03 12:26:17,199][model8_pretrain.py][INFO] Epoch:[0/2](178400/4588595) loss:2.550 lr:0.0000100 epoch_Time:27848.0min: [2024-01-03 12:26:54,137][model8_pretrain.py][INFO] Epoch:[0/2](178500/4588595) loss:2.987 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:26:54,137][model8_pretrain.py][INFO] Epoch:[0/2](178500/4588595) loss:3.746 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:26:54,137][model8_pretrain.py][INFO] Epoch:[0/2](178500/4588595) loss:2.870 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:26:54,138][model8_pretrain.py][INFO] Epoch:[0/2](178500/4588595) loss:2.861 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:26:54,138][model8_pretrain.py][INFO] Epoch:[0/2](178500/4588595) loss:2.816 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:26:54,138][model8_pretrain.py][INFO] Epoch:[0/2](178500/4588595) loss:2.937 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:26:54,138][model8_pretrain.py][INFO] Epoch:[0/2](178500/4588595) loss:3.104 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:26:54,138][model8_pretrain.py][INFO] Epoch:[0/2](178500/4588595) loss:3.058 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:27:31,073][model8_pretrain.py][INFO] Epoch:[0/2](178600/4588595) loss:2.812 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:27:31,073][model8_pretrain.py][INFO] Epoch:[0/2](178600/4588595) loss:3.387 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:27:31,073][model8_pretrain.py][INFO] Epoch:[0/2](178600/4588595) loss:3.473 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:27:31,073][model8_pretrain.py][INFO] Epoch:[0/2](178600/4588595) loss:3.263 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:27:31,073][model8_pretrain.py][INFO] Epoch:[0/2](178600/4588595) loss:3.001 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:27:31,073][model8_pretrain.py][INFO] Epoch:[0/2](178600/4588595) loss:2.826 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:27:31,073][model8_pretrain.py][INFO] Epoch:[0/2](178600/4588595) loss:3.297 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:27:31,073][model8_pretrain.py][INFO] Epoch:[0/2](178600/4588595) loss:3.052 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:28:08,016][model8_pretrain.py][INFO] Epoch:[0/2](178700/4588595) loss:2.596 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:08,016][model8_pretrain.py][INFO] Epoch:[0/2](178700/4588595) loss:3.355 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:08,016][model8_pretrain.py][INFO] Epoch:[0/2](178700/4588595) loss:2.921 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:08,016][model8_pretrain.py][INFO] Epoch:[0/2](178700/4588595) loss:2.952 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:08,016][model8_pretrain.py][INFO] Epoch:[0/2](178700/4588595) loss:3.156 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:08,016][model8_pretrain.py][INFO] Epoch:[0/2](178700/4588595) loss:2.908 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:08,016][model8_pretrain.py][INFO] Epoch:[0/2](178700/4588595) loss:3.300 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:08,016][model8_pretrain.py][INFO] Epoch:[0/2](178700/4588595) loss:3.166 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:44,939][model8_pretrain.py][INFO] Epoch:[0/2](178800/4588595) loss:3.072 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:44,939][model8_pretrain.py][INFO] Epoch:[0/2](178800/4588595) loss:2.586 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:44,939][model8_pretrain.py][INFO] Epoch:[0/2](178800/4588595) loss:3.262 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:44,939][model8_pretrain.py][INFO] Epoch:[0/2](178800/4588595) loss:2.622 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:44,939][model8_pretrain.py][INFO] Epoch:[0/2](178800/4588595) loss:3.055 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:44,939][model8_pretrain.py][INFO] Epoch:[0/2](178800/4588595) loss:2.404 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:44,940][model8_pretrain.py][INFO] Epoch:[0/2](178800/4588595) loss:2.749 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:28:44,940][model8_pretrain.py][INFO] Epoch:[0/2](178800/4588595) loss:3.327 lr:0.0000100 epoch_Time:27844.0min: [2024-01-03 12:29:30,549][model8_pretrain.py][INFO] Epoch:[0/2](178900/4588595) loss:2.961 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:29:30,549][model8_pretrain.py][INFO] Epoch:[0/2](178900/4588595) loss:3.167 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:29:30,549][model8_pretrain.py][INFO] Epoch:[0/2](178900/4588595) loss:3.045 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:29:30,549][model8_pretrain.py][INFO] Epoch:[0/2](178900/4588595) loss:2.611 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:29:30,549][model8_pretrain.py][INFO] Epoch:[0/2](178900/4588595) loss:2.805 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:29:30,549][model8_pretrain.py][INFO] Epoch:[0/2](178900/4588595) loss:2.943 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:29:30,549][model8_pretrain.py][INFO] Epoch:[0/2](178900/4588595) loss:2.856 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:29:30,549][model8_pretrain.py][INFO] Epoch:[0/2](178900/4588595) loss:2.994 lr:0.0000100 epoch_Time:27846.0min: [2024-01-03 12:30:07,482][model8_pretrain.py][INFO] Epoch:[0/2](179000/4588595) loss:3.337 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:07,482][model8_pretrain.py][INFO] Epoch:[0/2](179000/4588595) loss:3.265 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:07,482][model8_pretrain.py][INFO] Epoch:[0/2](179000/4588595) loss:3.054 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:07,482][model8_pretrain.py][INFO] Epoch:[0/2](179000/4588595) loss:3.480 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:07,482][model8_pretrain.py][INFO] Epoch:[0/2](179000/4588595) loss:2.607 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:07,482][model8_pretrain.py][INFO] Epoch:[0/2](179000/4588595) loss:2.814 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:07,482][model8_pretrain.py][INFO] Epoch:[0/2](179000/4588595) loss:2.869 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:07,482][model8_pretrain.py][INFO] Epoch:[0/2](179000/4588595) loss:2.500 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:44,425][model8_pretrain.py][INFO] Epoch:[0/2](179100/4588595) loss:3.138 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:44,425][model8_pretrain.py][INFO] Epoch:[0/2](179100/4588595) loss:3.038 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:44,425][model8_pretrain.py][INFO] Epoch:[0/2](179100/4588595) loss:3.019 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:44,425][model8_pretrain.py][INFO] Epoch:[0/2](179100/4588595) loss:3.121 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:44,425][model8_pretrain.py][INFO] Epoch:[0/2](179100/4588595) loss:2.265 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:44,425][model8_pretrain.py][INFO] Epoch:[0/2](179100/4588595) loss:2.890 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:44,425][model8_pretrain.py][INFO] Epoch:[0/2](179100/4588595) loss:2.869 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:30:44,425][model8_pretrain.py][INFO] Epoch:[0/2](179100/4588595) loss:3.168 lr:0.0000100 epoch_Time:27845.0min: [2024-01-03 12:31:21,380][model8_pretrain.py][INFO] Epoch:[0/2](179200/4588595) loss:2.965 lr:0.0000100 epoch_Time:27843.0min: [2024-01-03 12:31:21,380][model8_pretrain.py][INFO] Epoch:[0/2](179200/4588595) loss:2.796 lr:0.0000100 epoch_Time:27843.0min: [2024-01-03 12:31:21,380][model8_pretrain.py][INFO] Epoch:[0/2](179200/4588595) loss:3.257 lr:0.0000100 epoch_Time:27843.0min: [2024-01-03 12:31:21,380][model8_pretrain.py][INFO] Epoch:[0/2](179200/4588595) loss:2.716 lr:0.0000100 epoch_Time:27843.0min: [2024-01-03 12:31:21,380][model8_pretrain.py][INFO] Epoch:[0/2](179200/4588595) loss:2.983 lr:0.0000100 epoch_Time:27843.0min: [2024-01-03 12:31:21,380][model8_pretrain.py][INFO] Epoch:[0/2](179200/4588595) loss:2.847 lr:0.0000100 epoch_Time:27843.0min: [2024-01-03 12:31:21,381][model8_pretrain.py][INFO] Epoch:[0/2](179200/4588595) loss:2.808 lr:0.0000100 epoch_Time:27843.0min: [2024-01-03 12:31:21,381][model8_pretrain.py][INFO] Epoch:[0/2](179200/4588595) loss:2.728 lr:0.0000100 epoch_Time:27843.0min: [2024-01-03 12:31:58,319][model8_pretrain.py][INFO] Epoch:[0/2](179300/4588595) loss:2.983 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:31:58,319][model8_pretrain.py][INFO] Epoch:[0/2](179300/4588595) loss:2.936 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:31:58,320][model8_pretrain.py][INFO] Epoch:[0/2](179300/4588595) loss:2.842 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:31:58,320][model8_pretrain.py][INFO] Epoch:[0/2](179300/4588595) loss:3.558 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:31:58,320][model8_pretrain.py][INFO] Epoch:[0/2](179300/4588595) loss:3.454 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:31:58,320][model8_pretrain.py][INFO] Epoch:[0/2](179300/4588595) loss:2.416 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:31:58,320][model8_pretrain.py][INFO] Epoch:[0/2](179300/4588595) loss:3.166 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:31:58,320][model8_pretrain.py][INFO] Epoch:[0/2](179300/4588595) loss:2.810 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:32:35,256][model8_pretrain.py][INFO] Epoch:[0/2](179400/4588595) loss:3.123 lr:0.0000100 epoch_Time:27841.0min: [2024-01-03 12:32:35,256][model8_pretrain.py][INFO] Epoch:[0/2](179400/4588595) loss:3.180 lr:0.0000100 epoch_Time:27841.0min: [2024-01-03 12:32:35,256][model8_pretrain.py][INFO] Epoch:[0/2](179400/4588595) loss:2.534 lr:0.0000100 epoch_Time:27841.0min: [2024-01-03 12:32:35,256][model8_pretrain.py][INFO] Epoch:[0/2](179400/4588595) loss:3.223 lr:0.0000100 epoch_Time:27841.0min: [2024-01-03 12:32:35,256][model8_pretrain.py][INFO] Epoch:[0/2](179400/4588595) loss:3.169 lr:0.0000100 epoch_Time:27841.0min: [2024-01-03 12:32:35,256][model8_pretrain.py][INFO] Epoch:[0/2](179400/4588595) loss:3.279 lr:0.0000100 epoch_Time:27841.0min: [2024-01-03 12:32:35,256][model8_pretrain.py][INFO] Epoch:[0/2](179400/4588595) loss:2.496 lr:0.0000100 epoch_Time:27841.0min: [2024-01-03 12:32:35,257][model8_pretrain.py][INFO] Epoch:[0/2](179400/4588595) loss:2.574 lr:0.0000100 epoch_Time:27841.0min: [2024-01-03 12:33:12,189][model8_pretrain.py][INFO] Epoch:[0/2](179500/4588595) loss:2.350 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:33:12,189][model8_pretrain.py][INFO] Epoch:[0/2](179500/4588595) loss:2.578 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:33:12,189][model8_pretrain.py][INFO] Epoch:[0/2](179500/4588595) loss:2.910 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:33:12,189][model8_pretrain.py][INFO] Epoch:[0/2](179500/4588595) loss:2.337 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:33:12,189][model8_pretrain.py][INFO] Epoch:[0/2](179500/4588595) loss:3.075 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:33:12,189][model8_pretrain.py][INFO] Epoch:[0/2](179500/4588595) loss:2.193 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:33:12,189][model8_pretrain.py][INFO] Epoch:[0/2](179500/4588595) loss:2.890 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:33:12,189][model8_pretrain.py][INFO] Epoch:[0/2](179500/4588595) loss:2.678 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:33:49,117][model8_pretrain.py][INFO] Epoch:[0/2](179600/4588595) loss:3.092 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:33:49,117][model8_pretrain.py][INFO] Epoch:[0/2](179600/4588595) loss:2.575 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:33:49,117][model8_pretrain.py][INFO] Epoch:[0/2](179600/4588595) loss:2.505 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:33:49,117][model8_pretrain.py][INFO] Epoch:[0/2](179600/4588595) loss:2.766 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:33:49,117][model8_pretrain.py][INFO] Epoch:[0/2](179600/4588595) loss:3.267 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:33:49,117][model8_pretrain.py][INFO] Epoch:[0/2](179600/4588595) loss:3.025 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:33:49,118][model8_pretrain.py][INFO] Epoch:[0/2](179600/4588595) loss:2.957 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:33:49,118][model8_pretrain.py][INFO] Epoch:[0/2](179600/4588595) loss:2.763 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:34:34,835][model8_pretrain.py][INFO] Epoch:[0/2](179700/4588595) loss:2.829 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:34:34,835][model8_pretrain.py][INFO] Epoch:[0/2](179700/4588595) loss:2.875 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:34:34,835][model8_pretrain.py][INFO] Epoch:[0/2](179700/4588595) loss:2.958 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:34:34,835][model8_pretrain.py][INFO] Epoch:[0/2](179700/4588595) loss:2.964 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:34:34,835][model8_pretrain.py][INFO] Epoch:[0/2](179700/4588595) loss:3.257 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:34:34,835][model8_pretrain.py][INFO] Epoch:[0/2](179700/4588595) loss:2.914 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:34:34,835][model8_pretrain.py][INFO] Epoch:[0/2](179700/4588595) loss:3.095 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:34:34,835][model8_pretrain.py][INFO] Epoch:[0/2](179700/4588595) loss:3.276 lr:0.0000100 epoch_Time:27842.0min: [2024-01-03 12:35:11,763][model8_pretrain.py][INFO] Epoch:[0/2](179800/4588595) loss:2.948 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:35:11,763][model8_pretrain.py][INFO] Epoch:[0/2](179800/4588595) loss:3.020 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:35:11,763][model8_pretrain.py][INFO] Epoch:[0/2](179800/4588595) loss:3.193 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:35:11,763][model8_pretrain.py][INFO] Epoch:[0/2](179800/4588595) loss:2.631 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:35:11,763][model8_pretrain.py][INFO] Epoch:[0/2](179800/4588595) loss:2.842 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:35:11,763][model8_pretrain.py][INFO] Epoch:[0/2](179800/4588595) loss:2.872 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:35:11,763][model8_pretrain.py][INFO] Epoch:[0/2](179800/4588595) loss:3.014 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:35:11,763][model8_pretrain.py][INFO] Epoch:[0/2](179800/4588595) loss:2.354 lr:0.0000100 epoch_Time:27840.0min: [2024-01-03 12:35:48,706][model8_pretrain.py][INFO] Epoch:[0/2](179900/4588595) loss:3.114 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:35:48,706][model8_pretrain.py][INFO] Epoch:[0/2](179900/4588595) loss:2.830 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:35:48,706][model8_pretrain.py][INFO] Epoch:[0/2](179900/4588595) loss:1.993 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:35:48,706][model8_pretrain.py][INFO] Epoch:[0/2](179900/4588595) loss:2.841 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:35:48,706][model8_pretrain.py][INFO] Epoch:[0/2](179900/4588595) loss:3.298 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:35:48,706][model8_pretrain.py][INFO] Epoch:[0/2](179900/4588595) loss:2.317 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:35:48,707][model8_pretrain.py][INFO] Epoch:[0/2](179900/4588595) loss:3.463 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:35:48,707][model8_pretrain.py][INFO] Epoch:[0/2](179900/4588595) loss:2.592 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:36:25,635][model8_pretrain.py][INFO] Epoch:[0/2](180000/4588595) loss:3.293 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:36:25,635][model8_pretrain.py][INFO] Epoch:[0/2](180000/4588595) loss:2.538 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:36:25,635][model8_pretrain.py][INFO] Epoch:[0/2](180000/4588595) loss:2.887 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:36:25,635][model8_pretrain.py][INFO] Epoch:[0/2](180000/4588595) loss:3.171 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:36:25,635][model8_pretrain.py][INFO] Epoch:[0/2](180000/4588595) loss:3.205 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:36:25,635][model8_pretrain.py][INFO] Epoch:[0/2](180000/4588595) loss:2.789 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:36:25,635][model8_pretrain.py][INFO] Epoch:[0/2](180000/4588595) loss:3.004 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:36:25,635][model8_pretrain.py][INFO] Epoch:[0/2](180000/4588595) loss:3.074 lr:0.0000100 epoch_Time:27839.0min: [2024-01-03 12:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](180100/4588595) loss:3.113 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](180100/4588595) loss:3.168 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](180100/4588595) loss:3.085 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](180100/4588595) loss:3.021 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](180100/4588595) loss:2.092 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](180100/4588595) loss:2.789 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](180100/4588595) loss:2.899 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](180100/4588595) loss:2.084 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:39,503][model8_pretrain.py][INFO] Epoch:[0/2](180200/4588595) loss:2.959 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:39,503][model8_pretrain.py][INFO] Epoch:[0/2](180200/4588595) loss:3.066 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:39,503][model8_pretrain.py][INFO] Epoch:[0/2](180200/4588595) loss:3.048 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:39,503][model8_pretrain.py][INFO] Epoch:[0/2](180200/4588595) loss:2.552 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:39,503][model8_pretrain.py][INFO] Epoch:[0/2](180200/4588595) loss:3.221 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:39,503][model8_pretrain.py][INFO] Epoch:[0/2](180200/4588595) loss:2.387 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:39,503][model8_pretrain.py][INFO] Epoch:[0/2](180200/4588595) loss:2.946 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:37:39,504][model8_pretrain.py][INFO] Epoch:[0/2](180200/4588595) loss:3.140 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:38:16,436][model8_pretrain.py][INFO] Epoch:[0/2](180300/4588595) loss:3.057 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:38:16,436][model8_pretrain.py][INFO] Epoch:[0/2](180300/4588595) loss:3.159 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:38:16,436][model8_pretrain.py][INFO] Epoch:[0/2](180300/4588595) loss:3.165 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:38:16,436][model8_pretrain.py][INFO] Epoch:[0/2](180300/4588595) loss:2.635 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:38:16,436][model8_pretrain.py][INFO] Epoch:[0/2](180300/4588595) loss:2.886 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:38:16,436][model8_pretrain.py][INFO] Epoch:[0/2](180300/4588595) loss:2.972 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:38:16,436][model8_pretrain.py][INFO] Epoch:[0/2](180300/4588595) loss:2.788 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:38:16,437][model8_pretrain.py][INFO] Epoch:[0/2](180300/4588595) loss:2.902 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:38:53,370][model8_pretrain.py][INFO] Epoch:[0/2](180400/4588595) loss:3.302 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:38:53,370][model8_pretrain.py][INFO] Epoch:[0/2](180400/4588595) loss:3.187 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:38:53,370][model8_pretrain.py][INFO] Epoch:[0/2](180400/4588595) loss:3.193 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:38:53,370][model8_pretrain.py][INFO] Epoch:[0/2](180400/4588595) loss:2.513 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:38:53,370][model8_pretrain.py][INFO] Epoch:[0/2](180400/4588595) loss:3.284 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:38:53,370][model8_pretrain.py][INFO] Epoch:[0/2](180400/4588595) loss:3.014 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:38:53,370][model8_pretrain.py][INFO] Epoch:[0/2](180400/4588595) loss:2.697 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:38:53,371][model8_pretrain.py][INFO] Epoch:[0/2](180400/4588595) loss:2.088 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:39:39,066][model8_pretrain.py][INFO] Epoch:[0/2](180500/4588595) loss:3.023 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:39:39,066][model8_pretrain.py][INFO] Epoch:[0/2](180500/4588595) loss:3.533 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:39:39,066][model8_pretrain.py][INFO] Epoch:[0/2](180500/4588595) loss:3.435 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:39:39,066][model8_pretrain.py][INFO] Epoch:[0/2](180500/4588595) loss:2.798 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:39:39,066][model8_pretrain.py][INFO] Epoch:[0/2](180500/4588595) loss:3.436 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:39:39,066][model8_pretrain.py][INFO] Epoch:[0/2](180500/4588595) loss:3.052 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:39:39,066][model8_pretrain.py][INFO] Epoch:[0/2](180500/4588595) loss:3.209 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:39:39,066][model8_pretrain.py][INFO] Epoch:[0/2](180500/4588595) loss:2.857 lr:0.0000100 epoch_Time:27837.0min: [2024-01-03 12:40:15,991][model8_pretrain.py][INFO] Epoch:[0/2](180600/4588595) loss:2.895 lr:0.0000100 epoch_Time:27836.0min: [2024-01-03 12:40:15,991][model8_pretrain.py][INFO] Epoch:[0/2](180600/4588595) loss:3.316 lr:0.0000100 epoch_Time:27836.0min: [2024-01-03 12:40:15,991][model8_pretrain.py][INFO] Epoch:[0/2](180600/4588595) loss:3.187 lr:0.0000100 epoch_Time:27836.0min: [2024-01-03 12:40:15,991][model8_pretrain.py][INFO] Epoch:[0/2](180600/4588595) loss:2.955 lr:0.0000100 epoch_Time:27836.0min: [2024-01-03 12:40:15,991][model8_pretrain.py][INFO] Epoch:[0/2](180600/4588595) loss:2.828 lr:0.0000100 epoch_Time:27836.0min: [2024-01-03 12:40:15,991][model8_pretrain.py][INFO] Epoch:[0/2](180600/4588595) loss:3.222 lr:0.0000100 epoch_Time:27836.0min: [2024-01-03 12:40:15,991][model8_pretrain.py][INFO] Epoch:[0/2](180600/4588595) loss:3.125 lr:0.0000100 epoch_Time:27836.0min: [2024-01-03 12:40:15,991][model8_pretrain.py][INFO] Epoch:[0/2](180600/4588595) loss:2.933 lr:0.0000100 epoch_Time:27836.0min: [2024-01-03 12:40:52,924][model8_pretrain.py][INFO] Epoch:[0/2](180700/4588595) loss:2.813 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:40:52,924][model8_pretrain.py][INFO] Epoch:[0/2](180700/4588595) loss:3.263 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:40:52,924][model8_pretrain.py][INFO] Epoch:[0/2](180700/4588595) loss:2.869 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:40:52,924][model8_pretrain.py][INFO] Epoch:[0/2](180700/4588595) loss:2.993 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:40:52,924][model8_pretrain.py][INFO] Epoch:[0/2](180700/4588595) loss:3.338 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:40:52,924][model8_pretrain.py][INFO] Epoch:[0/2](180700/4588595) loss:2.719 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:40:52,925][model8_pretrain.py][INFO] Epoch:[0/2](180700/4588595) loss:2.966 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:40:52,925][model8_pretrain.py][INFO] Epoch:[0/2](180700/4588595) loss:3.028 lr:0.0000100 epoch_Time:27835.0min: [2024-01-03 12:41:29,866][model8_pretrain.py][INFO] Epoch:[0/2](180800/4588595) loss:3.371 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:41:29,866][model8_pretrain.py][INFO] Epoch:[0/2](180800/4588595) loss:2.962 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:41:29,866][model8_pretrain.py][INFO] Epoch:[0/2](180800/4588595) loss:2.794 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:41:29,866][model8_pretrain.py][INFO] Epoch:[0/2](180800/4588595) loss:2.693 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:41:29,866][model8_pretrain.py][INFO] Epoch:[0/2](180800/4588595) loss:3.457 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:41:29,866][model8_pretrain.py][INFO] Epoch:[0/2](180800/4588595) loss:2.661 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:41:29,866][model8_pretrain.py][INFO] Epoch:[0/2](180800/4588595) loss:2.812 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:41:29,866][model8_pretrain.py][INFO] Epoch:[0/2](180800/4588595) loss:3.394 lr:0.0000100 epoch_Time:27834.0min: [2024-01-03 12:42:06,797][model8_pretrain.py][INFO] Epoch:[0/2](180900/4588595) loss:3.314 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:42:06,797][model8_pretrain.py][INFO] Epoch:[0/2](180900/4588595) loss:2.860 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:42:06,797][model8_pretrain.py][INFO] Epoch:[0/2](180900/4588595) loss:2.833 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:42:06,797][model8_pretrain.py][INFO] Epoch:[0/2](180900/4588595) loss:3.429 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:42:06,797][model8_pretrain.py][INFO] Epoch:[0/2](180900/4588595) loss:2.976 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:42:06,797][model8_pretrain.py][INFO] Epoch:[0/2](180900/4588595) loss:2.825 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:42:06,797][model8_pretrain.py][INFO] Epoch:[0/2](180900/4588595) loss:2.971 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:42:06,797][model8_pretrain.py][INFO] Epoch:[0/2](180900/4588595) loss:2.608 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:42:43,715][model8_pretrain.py][INFO] Epoch:[0/2](181000/4588595) loss:2.756 lr:0.0000100 epoch_Time:27832.0min: [2024-01-03 12:42:43,715][model8_pretrain.py][INFO] Epoch:[0/2](181000/4588595) loss:3.268 lr:0.0000100 epoch_Time:27832.0min: [2024-01-03 12:42:43,715][model8_pretrain.py][INFO] Epoch:[0/2](181000/4588595) loss:2.500 lr:0.0000100 epoch_Time:27832.0min: [2024-01-03 12:42:43,715][model8_pretrain.py][INFO] Epoch:[0/2](181000/4588595) loss:2.876 lr:0.0000100 epoch_Time:27832.0min: [2024-01-03 12:42:43,715][model8_pretrain.py][INFO] Epoch:[0/2](181000/4588595) loss:3.495 lr:0.0000100 epoch_Time:27832.0min: [2024-01-03 12:42:43,715][model8_pretrain.py][INFO] Epoch:[0/2](181000/4588595) loss:3.193 lr:0.0000100 epoch_Time:27832.0min: [2024-01-03 12:42:43,715][model8_pretrain.py][INFO] Epoch:[0/2](181000/4588595) loss:2.899 lr:0.0000100 epoch_Time:27832.0min: [2024-01-03 12:42:43,715][model8_pretrain.py][INFO] Epoch:[0/2](181000/4588595) loss:2.933 lr:0.0000100 epoch_Time:27832.0min: [2024-01-03 12:43:20,651][model8_pretrain.py][INFO] Epoch:[0/2](181100/4588595) loss:2.891 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:43:20,651][model8_pretrain.py][INFO] Epoch:[0/2](181100/4588595) loss:2.901 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:43:20,651][model8_pretrain.py][INFO] Epoch:[0/2](181100/4588595) loss:2.988 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:43:20,651][model8_pretrain.py][INFO] Epoch:[0/2](181100/4588595) loss:3.376 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:43:20,651][model8_pretrain.py][INFO] Epoch:[0/2](181100/4588595) loss:3.451 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:43:20,651][model8_pretrain.py][INFO] Epoch:[0/2](181100/4588595) loss:3.013 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:43:20,651][model8_pretrain.py][INFO] Epoch:[0/2](181100/4588595) loss:3.178 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:43:20,651][model8_pretrain.py][INFO] Epoch:[0/2](181100/4588595) loss:3.120 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:43:57,581][model8_pretrain.py][INFO] Epoch:[0/2](181200/4588595) loss:2.874 lr:0.0000100 epoch_Time:27829.0min: [2024-01-03 12:43:57,581][model8_pretrain.py][INFO] Epoch:[0/2](181200/4588595) loss:2.687 lr:0.0000100 epoch_Time:27829.0min: [2024-01-03 12:43:57,581][model8_pretrain.py][INFO] Epoch:[0/2](181200/4588595) loss:3.245 lr:0.0000100 epoch_Time:27829.0min: [2024-01-03 12:43:57,581][model8_pretrain.py][INFO] Epoch:[0/2](181200/4588595) loss:2.332 lr:0.0000100 epoch_Time:27829.0min: [2024-01-03 12:43:57,581][model8_pretrain.py][INFO] Epoch:[0/2](181200/4588595) loss:3.013 lr:0.0000100 epoch_Time:27829.0min: [2024-01-03 12:43:57,581][model8_pretrain.py][INFO] Epoch:[0/2](181200/4588595) loss:2.945 lr:0.0000100 epoch_Time:27829.0min: [2024-01-03 12:43:57,582][model8_pretrain.py][INFO] Epoch:[0/2](181200/4588595) loss:3.307 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:43:57,582][model8_pretrain.py][INFO] Epoch:[0/2](181200/4588595) loss:2.893 lr:0.0000100 epoch_Time:27829.0min: [2024-01-03 12:44:43,179][model8_pretrain.py][INFO] Epoch:[0/2](181300/4588595) loss:2.518 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:44:43,179][model8_pretrain.py][INFO] Epoch:[0/2](181300/4588595) loss:3.072 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:44:43,179][model8_pretrain.py][INFO] Epoch:[0/2](181300/4588595) loss:2.474 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:44:43,179][model8_pretrain.py][INFO] Epoch:[0/2](181300/4588595) loss:3.366 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:44:43,179][model8_pretrain.py][INFO] Epoch:[0/2](181300/4588595) loss:3.270 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:44:43,179][model8_pretrain.py][INFO] Epoch:[0/2](181300/4588595) loss:3.118 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:44:43,179][model8_pretrain.py][INFO] Epoch:[0/2](181300/4588595) loss:3.652 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:44:43,179][model8_pretrain.py][INFO] Epoch:[0/2](181300/4588595) loss:3.006 lr:0.0000100 epoch_Time:27833.0min: [2024-01-03 12:45:20,101][model8_pretrain.py][INFO] Epoch:[0/2](181400/4588595) loss:3.157 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:45:20,101][model8_pretrain.py][INFO] Epoch:[0/2](181400/4588595) loss:3.284 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:45:20,101][model8_pretrain.py][INFO] Epoch:[0/2](181400/4588595) loss:3.008 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:45:20,101][model8_pretrain.py][INFO] Epoch:[0/2](181400/4588595) loss:3.048 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:45:20,101][model8_pretrain.py][INFO] Epoch:[0/2](181400/4588595) loss:3.216 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:45:20,101][model8_pretrain.py][INFO] Epoch:[0/2](181400/4588595) loss:3.146 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:45:20,101][model8_pretrain.py][INFO] Epoch:[0/2](181400/4588595) loss:2.945 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:45:20,102][model8_pretrain.py][INFO] Epoch:[0/2](181400/4588595) loss:3.052 lr:0.0000100 epoch_Time:27831.0min: [2024-01-03 12:45:57,029][model8_pretrain.py][INFO] Epoch:[0/2](181500/4588595) loss:3.215 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:45:57,029][model8_pretrain.py][INFO] Epoch:[0/2](181500/4588595) loss:2.940 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:45:57,029][model8_pretrain.py][INFO] Epoch:[0/2](181500/4588595) loss:3.167 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:45:57,029][model8_pretrain.py][INFO] Epoch:[0/2](181500/4588595) loss:2.984 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:45:57,029][model8_pretrain.py][INFO] Epoch:[0/2](181500/4588595) loss:2.994 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:45:57,029][model8_pretrain.py][INFO] Epoch:[0/2](181500/4588595) loss:2.789 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:45:57,029][model8_pretrain.py][INFO] Epoch:[0/2](181500/4588595) loss:3.265 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:45:57,030][model8_pretrain.py][INFO] Epoch:[0/2](181500/4588595) loss:2.616 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:46:33,965][model8_pretrain.py][INFO] Epoch:[0/2](181600/4588595) loss:3.092 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:46:33,965][model8_pretrain.py][INFO] Epoch:[0/2](181600/4588595) loss:3.372 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:46:33,965][model8_pretrain.py][INFO] Epoch:[0/2](181600/4588595) loss:2.935 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:46:33,965][model8_pretrain.py][INFO] Epoch:[0/2](181600/4588595) loss:2.663 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:46:33,965][model8_pretrain.py][INFO] Epoch:[0/2](181600/4588595) loss:2.428 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:46:33,965][model8_pretrain.py][INFO] Epoch:[0/2](181600/4588595) loss:3.247 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:46:33,965][model8_pretrain.py][INFO] Epoch:[0/2](181600/4588595) loss:2.977 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:46:33,965][model8_pretrain.py][INFO] Epoch:[0/2](181600/4588595) loss:2.759 lr:0.0000100 epoch_Time:27830.0min: [2024-01-03 12:47:10,894][model8_pretrain.py][INFO] Epoch:[0/2](181700/4588595) loss:3.054 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:47:10,894][model8_pretrain.py][INFO] Epoch:[0/2](181700/4588595) loss:3.295 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:47:10,894][model8_pretrain.py][INFO] Epoch:[0/2](181700/4588595) loss:2.931 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:47:10,894][model8_pretrain.py][INFO] Epoch:[0/2](181700/4588595) loss:3.038 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:47:10,894][model8_pretrain.py][INFO] Epoch:[0/2](181700/4588595) loss:2.786 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:47:10,895][model8_pretrain.py][INFO] Epoch:[0/2](181700/4588595) loss:3.351 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:47:10,895][model8_pretrain.py][INFO] Epoch:[0/2](181700/4588595) loss:2.959 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:47:10,895][model8_pretrain.py][INFO] Epoch:[0/2](181700/4588595) loss:3.034 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:47:47,840][model8_pretrain.py][INFO] Epoch:[0/2](181800/4588595) loss:3.256 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:47:47,840][model8_pretrain.py][INFO] Epoch:[0/2](181800/4588595) loss:3.002 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:47:47,840][model8_pretrain.py][INFO] Epoch:[0/2](181800/4588595) loss:3.348 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:47:47,840][model8_pretrain.py][INFO] Epoch:[0/2](181800/4588595) loss:3.315 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:47:47,840][model8_pretrain.py][INFO] Epoch:[0/2](181800/4588595) loss:3.027 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:47:47,840][model8_pretrain.py][INFO] Epoch:[0/2](181800/4588595) loss:3.032 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:47:47,840][model8_pretrain.py][INFO] Epoch:[0/2](181800/4588595) loss:2.993 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:47:47,840][model8_pretrain.py][INFO] Epoch:[0/2](181800/4588595) loss:3.240 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:48:24,773][model8_pretrain.py][INFO] Epoch:[0/2](181900/4588595) loss:3.127 lr:0.0000100 epoch_Time:27826.0min: [2024-01-03 12:48:24,773][model8_pretrain.py][INFO] Epoch:[0/2](181900/4588595) loss:3.052 lr:0.0000100 epoch_Time:27826.0min: [2024-01-03 12:48:24,773][model8_pretrain.py][INFO] Epoch:[0/2](181900/4588595) loss:2.095 lr:0.0000100 epoch_Time:27826.0min: [2024-01-03 12:48:24,773][model8_pretrain.py][INFO] Epoch:[0/2](181900/4588595) loss:2.911 lr:0.0000100 epoch_Time:27826.0min: [2024-01-03 12:48:24,773][model8_pretrain.py][INFO] Epoch:[0/2](181900/4588595) loss:2.471 lr:0.0000100 epoch_Time:27826.0min: [2024-01-03 12:48:24,773][model8_pretrain.py][INFO] Epoch:[0/2](181900/4588595) loss:2.779 lr:0.0000100 epoch_Time:27826.0min: [2024-01-03 12:48:24,773][model8_pretrain.py][INFO] Epoch:[0/2](181900/4588595) loss:2.887 lr:0.0000100 epoch_Time:27826.0min: [2024-01-03 12:48:24,773][model8_pretrain.py][INFO] Epoch:[0/2](181900/4588595) loss:2.782 lr:0.0000100 epoch_Time:27826.0min: [2024-01-03 12:49:01,698][model8_pretrain.py][INFO] Epoch:[0/2](182000/4588595) loss:2.560 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:49:01,698][model8_pretrain.py][INFO] Epoch:[0/2](182000/4588595) loss:3.030 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:49:01,698][model8_pretrain.py][INFO] Epoch:[0/2](182000/4588595) loss:2.662 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:49:01,698][model8_pretrain.py][INFO] Epoch:[0/2](182000/4588595) loss:2.829 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:49:01,698][model8_pretrain.py][INFO] Epoch:[0/2](182000/4588595) loss:2.944 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:49:01,698][model8_pretrain.py][INFO] Epoch:[0/2](182000/4588595) loss:2.806 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:49:01,698][model8_pretrain.py][INFO] Epoch:[0/2](182000/4588595) loss:3.011 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:49:01,698][model8_pretrain.py][INFO] Epoch:[0/2](182000/4588595) loss:3.313 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:49:47,333][model8_pretrain.py][INFO] Epoch:[0/2](182100/4588595) loss:3.345 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:49:47,333][model8_pretrain.py][INFO] Epoch:[0/2](182100/4588595) loss:3.122 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:49:47,333][model8_pretrain.py][INFO] Epoch:[0/2](182100/4588595) loss:3.203 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:49:47,333][model8_pretrain.py][INFO] Epoch:[0/2](182100/4588595) loss:2.978 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:49:47,333][model8_pretrain.py][INFO] Epoch:[0/2](182100/4588595) loss:3.193 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:49:47,333][model8_pretrain.py][INFO] Epoch:[0/2](182100/4588595) loss:2.655 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:49:47,333][model8_pretrain.py][INFO] Epoch:[0/2](182100/4588595) loss:2.757 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:49:47,333][model8_pretrain.py][INFO] Epoch:[0/2](182100/4588595) loss:2.875 lr:0.0000100 epoch_Time:27828.0min: [2024-01-03 12:50:24,274][model8_pretrain.py][INFO] Epoch:[0/2](182200/4588595) loss:3.244 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:50:24,274][model8_pretrain.py][INFO] Epoch:[0/2](182200/4588595) loss:2.865 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:50:24,274][model8_pretrain.py][INFO] Epoch:[0/2](182200/4588595) loss:2.933 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:50:24,274][model8_pretrain.py][INFO] Epoch:[0/2](182200/4588595) loss:3.086 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:50:24,274][model8_pretrain.py][INFO] Epoch:[0/2](182200/4588595) loss:3.082 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:50:24,274][model8_pretrain.py][INFO] Epoch:[0/2](182200/4588595) loss:3.098 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:50:24,274][model8_pretrain.py][INFO] Epoch:[0/2](182200/4588595) loss:3.107 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:50:24,274][model8_pretrain.py][INFO] Epoch:[0/2](182200/4588595) loss:3.027 lr:0.0000100 epoch_Time:27827.0min: [2024-01-03 12:51:01,214][model8_pretrain.py][INFO] Epoch:[0/2](182300/4588595) loss:3.004 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:01,214][model8_pretrain.py][INFO] Epoch:[0/2](182300/4588595) loss:3.101 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:01,214][model8_pretrain.py][INFO] Epoch:[0/2](182300/4588595) loss:2.638 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:01,214][model8_pretrain.py][INFO] Epoch:[0/2](182300/4588595) loss:2.683 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:01,214][model8_pretrain.py][INFO] Epoch:[0/2](182300/4588595) loss:2.927 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:01,214][model8_pretrain.py][INFO] Epoch:[0/2](182300/4588595) loss:2.899 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:01,214][model8_pretrain.py][INFO] Epoch:[0/2](182300/4588595) loss:3.031 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:01,214][model8_pretrain.py][INFO] Epoch:[0/2](182300/4588595) loss:2.220 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:38,167][model8_pretrain.py][INFO] Epoch:[0/2](182400/4588595) loss:2.809 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:38,167][model8_pretrain.py][INFO] Epoch:[0/2](182400/4588595) loss:3.138 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:38,167][model8_pretrain.py][INFO] Epoch:[0/2](182400/4588595) loss:3.147 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:38,167][model8_pretrain.py][INFO] Epoch:[0/2](182400/4588595) loss:2.807 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:38,167][model8_pretrain.py][INFO] Epoch:[0/2](182400/4588595) loss:3.132 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:38,167][model8_pretrain.py][INFO] Epoch:[0/2](182400/4588595) loss:3.309 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:38,167][model8_pretrain.py][INFO] Epoch:[0/2](182400/4588595) loss:3.184 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:51:38,167][model8_pretrain.py][INFO] Epoch:[0/2](182400/4588595) loss:2.530 lr:0.0000100 epoch_Time:27825.0min: [2024-01-03 12:52:15,111][model8_pretrain.py][INFO] Epoch:[0/2](182500/4588595) loss:2.976 lr:0.0000100 epoch_Time:27824.0min: [2024-01-03 12:52:15,111][model8_pretrain.py][INFO] Epoch:[0/2](182500/4588595) loss:2.973 lr:0.0000100 epoch_Time:27824.0min: [2024-01-03 12:52:15,111][model8_pretrain.py][INFO] Epoch:[0/2](182500/4588595) loss:3.271 lr:0.0000100 epoch_Time:27824.0min: [2024-01-03 12:52:15,111][model8_pretrain.py][INFO] Epoch:[0/2](182500/4588595) loss:2.656 lr:0.0000100 epoch_Time:27824.0min: [2024-01-03 12:52:15,111][model8_pretrain.py][INFO] Epoch:[0/2](182500/4588595) loss:3.286 lr:0.0000100 epoch_Time:27824.0min: [2024-01-03 12:52:15,111][model8_pretrain.py][INFO] Epoch:[0/2](182500/4588595) loss:3.024 lr:0.0000100 epoch_Time:27824.0min: [2024-01-03 12:52:15,111][model8_pretrain.py][INFO] Epoch:[0/2](182500/4588595) loss:3.258 lr:0.0000100 epoch_Time:27824.0min: [2024-01-03 12:52:15,111][model8_pretrain.py][INFO] Epoch:[0/2](182500/4588595) loss:2.930 lr:0.0000100 epoch_Time:27824.0min: [2024-01-03 12:52:52,056][model8_pretrain.py][INFO] Epoch:[0/2](182600/4588595) loss:3.421 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:52:52,057][model8_pretrain.py][INFO] Epoch:[0/2](182600/4588595) loss:2.864 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:52:52,057][model8_pretrain.py][INFO] Epoch:[0/2](182600/4588595) loss:3.159 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:52:52,057][model8_pretrain.py][INFO] Epoch:[0/2](182600/4588595) loss:2.593 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:52:52,057][model8_pretrain.py][INFO] Epoch:[0/2](182600/4588595) loss:3.455 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:52:52,057][model8_pretrain.py][INFO] Epoch:[0/2](182600/4588595) loss:3.290 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:52:52,057][model8_pretrain.py][INFO] Epoch:[0/2](182600/4588595) loss:3.176 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:52:52,057][model8_pretrain.py][INFO] Epoch:[0/2](182600/4588595) loss:2.656 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:53:28,970][model8_pretrain.py][INFO] Epoch:[0/2](182700/4588595) loss:3.318 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:53:28,970][model8_pretrain.py][INFO] Epoch:[0/2](182700/4588595) loss:2.556 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:53:28,970][model8_pretrain.py][INFO] Epoch:[0/2](182700/4588595) loss:3.078 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:53:28,970][model8_pretrain.py][INFO] Epoch:[0/2](182700/4588595) loss:2.845 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:53:28,970][model8_pretrain.py][INFO] Epoch:[0/2](182700/4588595) loss:2.632 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:53:28,970][model8_pretrain.py][INFO] Epoch:[0/2](182700/4588595) loss:2.403 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:53:28,971][model8_pretrain.py][INFO] Epoch:[0/2](182700/4588595) loss:2.739 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:53:28,972][model8_pretrain.py][INFO] Epoch:[0/2](182700/4588595) loss:2.996 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:54:05,904][model8_pretrain.py][INFO] Epoch:[0/2](182800/4588595) loss:3.442 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:54:05,904][model8_pretrain.py][INFO] Epoch:[0/2](182800/4588595) loss:2.741 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:54:05,904][model8_pretrain.py][INFO] Epoch:[0/2](182800/4588595) loss:2.458 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:54:05,904][model8_pretrain.py][INFO] Epoch:[0/2](182800/4588595) loss:3.393 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:54:05,904][model8_pretrain.py][INFO] Epoch:[0/2](182800/4588595) loss:2.418 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:54:05,904][model8_pretrain.py][INFO] Epoch:[0/2](182800/4588595) loss:3.176 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:54:05,904][model8_pretrain.py][INFO] Epoch:[0/2](182800/4588595) loss:3.032 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:54:05,904][model8_pretrain.py][INFO] Epoch:[0/2](182800/4588595) loss:3.267 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:54:51,500][model8_pretrain.py][INFO] Epoch:[0/2](182900/4588595) loss:2.957 lr:0.0000100 epoch_Time:27823.0min: [2024-01-03 12:54:51,500][model8_pretrain.py][INFO] Epoch:[0/2](182900/4588595) loss:2.529 lr:0.0000100 epoch_Time:27823.0min: [2024-01-03 12:54:51,500][model8_pretrain.py][INFO] Epoch:[0/2](182900/4588595) loss:3.456 lr:0.0000100 epoch_Time:27823.0min: [2024-01-03 12:54:51,500][model8_pretrain.py][INFO] Epoch:[0/2](182900/4588595) loss:2.999 lr:0.0000100 epoch_Time:27823.0min: [2024-01-03 12:54:51,500][model8_pretrain.py][INFO] Epoch:[0/2](182900/4588595) loss:3.167 lr:0.0000100 epoch_Time:27823.0min: [2024-01-03 12:54:51,500][model8_pretrain.py][INFO] Epoch:[0/2](182900/4588595) loss:2.931 lr:0.0000100 epoch_Time:27823.0min: [2024-01-03 12:54:51,500][model8_pretrain.py][INFO] Epoch:[0/2](182900/4588595) loss:3.199 lr:0.0000100 epoch_Time:27823.0min: [2024-01-03 12:54:51,500][model8_pretrain.py][INFO] Epoch:[0/2](182900/4588595) loss:3.451 lr:0.0000100 epoch_Time:27823.0min: [2024-01-03 12:55:28,423][model8_pretrain.py][INFO] Epoch:[0/2](183000/4588595) loss:2.944 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:55:28,423][model8_pretrain.py][INFO] Epoch:[0/2](183000/4588595) loss:3.218 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:55:28,423][model8_pretrain.py][INFO] Epoch:[0/2](183000/4588595) loss:2.660 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:55:28,423][model8_pretrain.py][INFO] Epoch:[0/2](183000/4588595) loss:2.934 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:55:28,423][model8_pretrain.py][INFO] Epoch:[0/2](183000/4588595) loss:2.939 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:55:28,423][model8_pretrain.py][INFO] Epoch:[0/2](183000/4588595) loss:2.726 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:55:28,423][model8_pretrain.py][INFO] Epoch:[0/2](183000/4588595) loss:3.313 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:55:28,423][model8_pretrain.py][INFO] Epoch:[0/2](183000/4588595) loss:3.386 lr:0.0000100 epoch_Time:27822.0min: [2024-01-03 12:56:05,360][model8_pretrain.py][INFO] Epoch:[0/2](183100/4588595) loss:3.504 lr:0.0000100 epoch_Time:27821.0min: [2024-01-03 12:56:05,360][model8_pretrain.py][INFO] Epoch:[0/2](183100/4588595) loss:3.128 lr:0.0000100 epoch_Time:27821.0min: [2024-01-03 12:56:05,360][model8_pretrain.py][INFO] Epoch:[0/2](183100/4588595) loss:2.876 lr:0.0000100 epoch_Time:27821.0min: [2024-01-03 12:56:05,360][model8_pretrain.py][INFO] Epoch:[0/2](183100/4588595) loss:3.403 lr:0.0000100 epoch_Time:27821.0min: [2024-01-03 12:56:05,361][model8_pretrain.py][INFO] Epoch:[0/2](183100/4588595) loss:3.010 lr:0.0000100 epoch_Time:27821.0min: [2024-01-03 12:56:05,361][model8_pretrain.py][INFO] Epoch:[0/2](183100/4588595) loss:3.060 lr:0.0000100 epoch_Time:27821.0min: [2024-01-03 12:56:05,361][model8_pretrain.py][INFO] Epoch:[0/2](183100/4588595) loss:2.540 lr:0.0000100 epoch_Time:27821.0min: [2024-01-03 12:56:05,361][model8_pretrain.py][INFO] Epoch:[0/2](183100/4588595) loss:3.177 lr:0.0000100 epoch_Time:27821.0min: [2024-01-03 12:56:42,329][model8_pretrain.py][INFO] Epoch:[0/2](183200/4588595) loss:2.601 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:56:42,329][model8_pretrain.py][INFO] Epoch:[0/2](183200/4588595) loss:2.787 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:56:42,329][model8_pretrain.py][INFO] Epoch:[0/2](183200/4588595) loss:2.395 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:56:42,329][model8_pretrain.py][INFO] Epoch:[0/2](183200/4588595) loss:2.662 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:56:42,329][model8_pretrain.py][INFO] Epoch:[0/2](183200/4588595) loss:3.273 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:56:42,329][model8_pretrain.py][INFO] Epoch:[0/2](183200/4588595) loss:3.416 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:56:42,329][model8_pretrain.py][INFO] Epoch:[0/2](183200/4588595) loss:2.994 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:56:42,330][model8_pretrain.py][INFO] Epoch:[0/2](183200/4588595) loss:3.344 lr:0.0000100 epoch_Time:27820.0min: [2024-01-03 12:57:19,292][model8_pretrain.py][INFO] Epoch:[0/2](183300/4588595) loss:3.287 lr:0.0000100 epoch_Time:27819.0min: [2024-01-03 12:57:19,292][model8_pretrain.py][INFO] Epoch:[0/2](183300/4588595) loss:2.571 lr:0.0000100 epoch_Time:27819.0min: [2024-01-03 12:57:19,292][model8_pretrain.py][INFO] Epoch:[0/2](183300/4588595) loss:3.012 lr:0.0000100 epoch_Time:27819.0min: [2024-01-03 12:57:19,292][model8_pretrain.py][INFO] Epoch:[0/2](183300/4588595) loss:3.176 lr:0.0000100 epoch_Time:27819.0min: [2024-01-03 12:57:19,292][model8_pretrain.py][INFO] Epoch:[0/2](183300/4588595) loss:2.833 lr:0.0000100 epoch_Time:27819.0min: [2024-01-03 12:57:19,292][model8_pretrain.py][INFO] Epoch:[0/2](183300/4588595) loss:3.159 lr:0.0000100 epoch_Time:27819.0min: [2024-01-03 12:57:19,292][model8_pretrain.py][INFO] Epoch:[0/2](183300/4588595) loss:3.419 lr:0.0000100 epoch_Time:27819.0min: [2024-01-03 12:57:19,293][model8_pretrain.py][INFO] Epoch:[0/2](183300/4588595) loss:3.175 lr:0.0000100 epoch_Time:27819.0min: [2024-01-03 12:57:56,237][model8_pretrain.py][INFO] Epoch:[0/2](183400/4588595) loss:2.722 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:57:56,237][model8_pretrain.py][INFO] Epoch:[0/2](183400/4588595) loss:2.859 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:57:56,237][model8_pretrain.py][INFO] Epoch:[0/2](183400/4588595) loss:2.958 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:57:56,237][model8_pretrain.py][INFO] Epoch:[0/2](183400/4588595) loss:2.983 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:57:56,237][model8_pretrain.py][INFO] Epoch:[0/2](183400/4588595) loss:2.805 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:57:56,237][model8_pretrain.py][INFO] Epoch:[0/2](183400/4588595) loss:3.137 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:57:56,237][model8_pretrain.py][INFO] Epoch:[0/2](183400/4588595) loss:3.019 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:57:56,238][model8_pretrain.py][INFO] Epoch:[0/2](183400/4588595) loss:1.896 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:58:33,197][model8_pretrain.py][INFO] Epoch:[0/2](183500/4588595) loss:2.989 lr:0.0000100 epoch_Time:27817.0min: [2024-01-03 12:58:33,197][model8_pretrain.py][INFO] Epoch:[0/2](183500/4588595) loss:2.622 lr:0.0000100 epoch_Time:27817.0min: [2024-01-03 12:58:33,197][model8_pretrain.py][INFO] Epoch:[0/2](183500/4588595) loss:3.084 lr:0.0000100 epoch_Time:27817.0min: [2024-01-03 12:58:33,197][model8_pretrain.py][INFO] Epoch:[0/2](183500/4588595) loss:2.932 lr:0.0000100 epoch_Time:27817.0min: [2024-01-03 12:58:33,197][model8_pretrain.py][INFO] Epoch:[0/2](183500/4588595) loss:2.322 lr:0.0000100 epoch_Time:27817.0min: [2024-01-03 12:58:33,197][model8_pretrain.py][INFO] Epoch:[0/2](183500/4588595) loss:2.935 lr:0.0000100 epoch_Time:27817.0min: [2024-01-03 12:58:33,197][model8_pretrain.py][INFO] Epoch:[0/2](183500/4588595) loss:2.928 lr:0.0000100 epoch_Time:27817.0min: [2024-01-03 12:58:33,197][model8_pretrain.py][INFO] Epoch:[0/2](183500/4588595) loss:3.134 lr:0.0000100 epoch_Time:27817.0min: [2024-01-03 12:59:10,143][model8_pretrain.py][INFO] Epoch:[0/2](183600/4588595) loss:2.981 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 12:59:10,143][model8_pretrain.py][INFO] Epoch:[0/2](183600/4588595) loss:3.084 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 12:59:10,143][model8_pretrain.py][INFO] Epoch:[0/2](183600/4588595) loss:3.002 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 12:59:10,143][model8_pretrain.py][INFO] Epoch:[0/2](183600/4588595) loss:2.734 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 12:59:10,143][model8_pretrain.py][INFO] Epoch:[0/2](183600/4588595) loss:3.289 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 12:59:10,144][model8_pretrain.py][INFO] Epoch:[0/2](183600/4588595) loss:3.206 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 12:59:10,144][model8_pretrain.py][INFO] Epoch:[0/2](183600/4588595) loss:2.291 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 12:59:10,144][model8_pretrain.py][INFO] Epoch:[0/2](183600/4588595) loss:2.669 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 12:59:55,875][model8_pretrain.py][INFO] Epoch:[0/2](183700/4588595) loss:3.017 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:59:55,875][model8_pretrain.py][INFO] Epoch:[0/2](183700/4588595) loss:3.220 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:59:55,875][model8_pretrain.py][INFO] Epoch:[0/2](183700/4588595) loss:3.680 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:59:55,875][model8_pretrain.py][INFO] Epoch:[0/2](183700/4588595) loss:2.719 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:59:55,875][model8_pretrain.py][INFO] Epoch:[0/2](183700/4588595) loss:2.833 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:59:55,875][model8_pretrain.py][INFO] Epoch:[0/2](183700/4588595) loss:2.814 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:59:55,875][model8_pretrain.py][INFO] Epoch:[0/2](183700/4588595) loss:2.857 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 12:59:55,875][model8_pretrain.py][INFO] Epoch:[0/2](183700/4588595) loss:3.191 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 13:00:32,809][model8_pretrain.py][INFO] Epoch:[0/2](183800/4588595) loss:2.767 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 13:00:32,809][model8_pretrain.py][INFO] Epoch:[0/2](183800/4588595) loss:3.171 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 13:00:32,809][model8_pretrain.py][INFO] Epoch:[0/2](183800/4588595) loss:3.233 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 13:00:32,809][model8_pretrain.py][INFO] Epoch:[0/2](183800/4588595) loss:2.934 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 13:00:32,809][model8_pretrain.py][INFO] Epoch:[0/2](183800/4588595) loss:2.787 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 13:00:32,809][model8_pretrain.py][INFO] Epoch:[0/2](183800/4588595) loss:2.622 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 13:00:32,809][model8_pretrain.py][INFO] Epoch:[0/2](183800/4588595) loss:2.657 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 13:00:32,809][model8_pretrain.py][INFO] Epoch:[0/2](183800/4588595) loss:2.756 lr:0.0000100 epoch_Time:27818.0min: [2024-01-03 13:01:09,746][model8_pretrain.py][INFO] Epoch:[0/2](183900/4588595) loss:3.145 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:09,746][model8_pretrain.py][INFO] Epoch:[0/2](183900/4588595) loss:2.533 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:09,746][model8_pretrain.py][INFO] Epoch:[0/2](183900/4588595) loss:3.162 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:09,746][model8_pretrain.py][INFO] Epoch:[0/2](183900/4588595) loss:3.401 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:09,746][model8_pretrain.py][INFO] Epoch:[0/2](183900/4588595) loss:3.202 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:09,746][model8_pretrain.py][INFO] Epoch:[0/2](183900/4588595) loss:2.887 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:09,746][model8_pretrain.py][INFO] Epoch:[0/2](183900/4588595) loss:2.266 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:09,747][model8_pretrain.py][INFO] Epoch:[0/2](183900/4588595) loss:2.692 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:46,685][model8_pretrain.py][INFO] Epoch:[0/2](184000/4588595) loss:3.023 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:46,685][model8_pretrain.py][INFO] Epoch:[0/2](184000/4588595) loss:3.288 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:46,685][model8_pretrain.py][INFO] Epoch:[0/2](184000/4588595) loss:3.227 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:46,685][model8_pretrain.py][INFO] Epoch:[0/2](184000/4588595) loss:2.782 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:46,685][model8_pretrain.py][INFO] Epoch:[0/2](184000/4588595) loss:2.962 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:46,685][model8_pretrain.py][INFO] Epoch:[0/2](184000/4588595) loss:3.075 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:46,685][model8_pretrain.py][INFO] Epoch:[0/2](184000/4588595) loss:3.053 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:01:46,686][model8_pretrain.py][INFO] Epoch:[0/2](184000/4588595) loss:2.890 lr:0.0000100 epoch_Time:27816.0min: [2024-01-03 13:02:23,628][model8_pretrain.py][INFO] Epoch:[0/2](184100/4588595) loss:2.902 lr:0.0000100 epoch_Time:27815.0min: [2024-01-03 13:02:23,628][model8_pretrain.py][INFO] Epoch:[0/2](184100/4588595) loss:2.502 lr:0.0000100 epoch_Time:27815.0min: [2024-01-03 13:02:23,628][model8_pretrain.py][INFO] Epoch:[0/2](184100/4588595) loss:2.563 lr:0.0000100 epoch_Time:27815.0min: [2024-01-03 13:02:23,628][model8_pretrain.py][INFO] Epoch:[0/2](184100/4588595) loss:2.897 lr:0.0000100 epoch_Time:27815.0min: [2024-01-03 13:02:23,628][model8_pretrain.py][INFO] Epoch:[0/2](184100/4588595) loss:2.883 lr:0.0000100 epoch_Time:27815.0min: [2024-01-03 13:02:23,628][model8_pretrain.py][INFO] Epoch:[0/2](184100/4588595) loss:2.842 lr:0.0000100 epoch_Time:27815.0min: [2024-01-03 13:02:23,628][model8_pretrain.py][INFO] Epoch:[0/2](184100/4588595) loss:2.856 lr:0.0000100 epoch_Time:27815.0min: [2024-01-03 13:02:23,628][model8_pretrain.py][INFO] Epoch:[0/2](184100/4588595) loss:2.764 lr:0.0000100 epoch_Time:27815.0min: [2024-01-03 13:03:00,572][model8_pretrain.py][INFO] Epoch:[0/2](184200/4588595) loss:3.121 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:00,572][model8_pretrain.py][INFO] Epoch:[0/2](184200/4588595) loss:3.290 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:00,572][model8_pretrain.py][INFO] Epoch:[0/2](184200/4588595) loss:2.845 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:00,572][model8_pretrain.py][INFO] Epoch:[0/2](184200/4588595) loss:3.090 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:00,572][model8_pretrain.py][INFO] Epoch:[0/2](184200/4588595) loss:2.601 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:00,572][model8_pretrain.py][INFO] Epoch:[0/2](184200/4588595) loss:3.133 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:00,572][model8_pretrain.py][INFO] Epoch:[0/2](184200/4588595) loss:3.209 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:00,572][model8_pretrain.py][INFO] Epoch:[0/2](184200/4588595) loss:3.149 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:37,531][model8_pretrain.py][INFO] Epoch:[0/2](184300/4588595) loss:3.034 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:37,531][model8_pretrain.py][INFO] Epoch:[0/2](184300/4588595) loss:2.657 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:37,531][model8_pretrain.py][INFO] Epoch:[0/2](184300/4588595) loss:3.266 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:37,531][model8_pretrain.py][INFO] Epoch:[0/2](184300/4588595) loss:2.864 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:37,531][model8_pretrain.py][INFO] Epoch:[0/2](184300/4588595) loss:3.240 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:37,531][model8_pretrain.py][INFO] Epoch:[0/2](184300/4588595) loss:3.280 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:37,531][model8_pretrain.py][INFO] Epoch:[0/2](184300/4588595) loss:2.807 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:03:37,531][model8_pretrain.py][INFO] Epoch:[0/2](184300/4588595) loss:3.073 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:04:14,473][model8_pretrain.py][INFO] Epoch:[0/2](184400/4588595) loss:3.194 lr:0.0000100 epoch_Time:27811.0min: [2024-01-03 13:04:14,473][model8_pretrain.py][INFO] Epoch:[0/2](184400/4588595) loss:2.632 lr:0.0000100 epoch_Time:27811.0min: [2024-01-03 13:04:14,473][model8_pretrain.py][INFO] Epoch:[0/2](184400/4588595) loss:3.103 lr:0.0000100 epoch_Time:27811.0min: [2024-01-03 13:04:14,473][model8_pretrain.py][INFO] Epoch:[0/2](184400/4588595) loss:2.728 lr:0.0000100 epoch_Time:27811.0min: [2024-01-03 13:04:14,473][model8_pretrain.py][INFO] Epoch:[0/2](184400/4588595) loss:3.292 lr:0.0000100 epoch_Time:27811.0min: [2024-01-03 13:04:14,473][model8_pretrain.py][INFO] Epoch:[0/2](184400/4588595) loss:3.082 lr:0.0000100 epoch_Time:27811.0min: [2024-01-03 13:04:14,473][model8_pretrain.py][INFO] Epoch:[0/2](184400/4588595) loss:3.163 lr:0.0000100 epoch_Time:27811.0min: [2024-01-03 13:04:14,473][model8_pretrain.py][INFO] Epoch:[0/2](184400/4588595) loss:3.090 lr:0.0000100 epoch_Time:27811.0min: [2024-01-03 13:05:00,207][model8_pretrain.py][INFO] Epoch:[0/2](184500/4588595) loss:2.991 lr:0.0000100 epoch_Time:27814.0min: [2024-01-03 13:05:00,207][model8_pretrain.py][INFO] Epoch:[0/2](184500/4588595) loss:3.286 lr:0.0000100 epoch_Time:27814.0min: [2024-01-03 13:05:00,207][model8_pretrain.py][INFO] Epoch:[0/2](184500/4588595) loss:3.028 lr:0.0000100 epoch_Time:27814.0min: [2024-01-03 13:05:00,207][model8_pretrain.py][INFO] Epoch:[0/2](184500/4588595) loss:3.376 lr:0.0000100 epoch_Time:27814.0min: [2024-01-03 13:05:00,207][model8_pretrain.py][INFO] Epoch:[0/2](184500/4588595) loss:2.870 lr:0.0000100 epoch_Time:27814.0min: [2024-01-03 13:05:00,207][model8_pretrain.py][INFO] Epoch:[0/2](184500/4588595) loss:2.697 lr:0.0000100 epoch_Time:27814.0min: [2024-01-03 13:05:00,207][model8_pretrain.py][INFO] Epoch:[0/2](184500/4588595) loss:2.751 lr:0.0000100 epoch_Time:27814.0min: [2024-01-03 13:05:00,207][model8_pretrain.py][INFO] Epoch:[0/2](184500/4588595) loss:2.459 lr:0.0000100 epoch_Time:27814.0min: [2024-01-03 13:05:37,119][model8_pretrain.py][INFO] Epoch:[0/2](184600/4588595) loss:2.709 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:05:37,119][model8_pretrain.py][INFO] Epoch:[0/2](184600/4588595) loss:3.447 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:05:37,119][model8_pretrain.py][INFO] Epoch:[0/2](184600/4588595) loss:2.732 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:05:37,119][model8_pretrain.py][INFO] Epoch:[0/2](184600/4588595) loss:3.331 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:05:37,119][model8_pretrain.py][INFO] Epoch:[0/2](184600/4588595) loss:3.217 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:05:37,119][model8_pretrain.py][INFO] Epoch:[0/2](184600/4588595) loss:3.103 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:05:37,119][model8_pretrain.py][INFO] Epoch:[0/2](184600/4588595) loss:3.163 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:05:37,121][model8_pretrain.py][INFO] Epoch:[0/2](184600/4588595) loss:3.098 lr:0.0000100 epoch_Time:27813.0min: [2024-01-03 13:06:14,051][model8_pretrain.py][INFO] Epoch:[0/2](184700/4588595) loss:2.544 lr:0.0000100 epoch_Time:27812.0min: [2024-01-03 13:06:14,051][model8_pretrain.py][INFO] Epoch:[0/2](184700/4588595) loss:3.383 lr:0.0000100 epoch_Time:27812.0min: [2024-01-03 13:06:14,051][model8_pretrain.py][INFO] Epoch:[0/2](184700/4588595) loss:2.920 lr:0.0000100 epoch_Time:27812.0min: [2024-01-03 13:06:14,051][model8_pretrain.py][INFO] Epoch:[0/2](184700/4588595) loss:2.634 lr:0.0000100 epoch_Time:27812.0min: [2024-01-03 13:06:14,051][model8_pretrain.py][INFO] Epoch:[0/2](184700/4588595) loss:2.962 lr:0.0000100 epoch_Time:27812.0min: [2024-01-03 13:06:14,051][model8_pretrain.py][INFO] Epoch:[0/2](184700/4588595) loss:2.377 lr:0.0000100 epoch_Time:27812.0min: [2024-01-03 13:06:14,051][model8_pretrain.py][INFO] Epoch:[0/2](184700/4588595) loss:2.988 lr:0.0000100 epoch_Time:27812.0min: [2024-01-03 13:06:14,051][model8_pretrain.py][INFO] Epoch:[0/2](184700/4588595) loss:1.928 lr:0.0000100 epoch_Time:27812.0min: [2024-01-03 13:06:50,987][model8_pretrain.py][INFO] Epoch:[0/2](184800/4588595) loss:2.843 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:06:50,987][model8_pretrain.py][INFO] Epoch:[0/2](184800/4588595) loss:2.871 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:06:50,987][model8_pretrain.py][INFO] Epoch:[0/2](184800/4588595) loss:2.980 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:06:50,987][model8_pretrain.py][INFO] Epoch:[0/2](184800/4588595) loss:2.993 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:06:50,987][model8_pretrain.py][INFO] Epoch:[0/2](184800/4588595) loss:3.283 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:06:50,987][model8_pretrain.py][INFO] Epoch:[0/2](184800/4588595) loss:2.982 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:06:50,987][model8_pretrain.py][INFO] Epoch:[0/2](184800/4588595) loss:2.592 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:06:50,987][model8_pretrain.py][INFO] Epoch:[0/2](184800/4588595) loss:2.965 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:07:27,928][model8_pretrain.py][INFO] Epoch:[0/2](184900/4588595) loss:2.318 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:07:27,928][model8_pretrain.py][INFO] Epoch:[0/2](184900/4588595) loss:3.421 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:07:27,928][model8_pretrain.py][INFO] Epoch:[0/2](184900/4588595) loss:2.958 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:07:27,928][model8_pretrain.py][INFO] Epoch:[0/2](184900/4588595) loss:3.395 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:07:27,928][model8_pretrain.py][INFO] Epoch:[0/2](184900/4588595) loss:3.054 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:07:27,928][model8_pretrain.py][INFO] Epoch:[0/2](184900/4588595) loss:2.288 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:07:27,928][model8_pretrain.py][INFO] Epoch:[0/2](184900/4588595) loss:3.042 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:07:27,928][model8_pretrain.py][INFO] Epoch:[0/2](184900/4588595) loss:2.367 lr:0.0000100 epoch_Time:27810.0min: [2024-01-03 13:08:04,868][model8_pretrain.py][INFO] Epoch:[0/2](185000/4588595) loss:2.481 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:08:04,868][model8_pretrain.py][INFO] Epoch:[0/2](185000/4588595) loss:2.737 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:08:04,869][model8_pretrain.py][INFO] Epoch:[0/2](185000/4588595) loss:2.884 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:08:04,869][model8_pretrain.py][INFO] Epoch:[0/2](185000/4588595) loss:2.792 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:08:04,868][model8_pretrain.py][INFO] Epoch:[0/2](185000/4588595) loss:2.968 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:08:04,869][model8_pretrain.py][INFO] Epoch:[0/2](185000/4588595) loss:3.058 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:08:04,869][model8_pretrain.py][INFO] Epoch:[0/2](185000/4588595) loss:3.036 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:08:04,869][model8_pretrain.py][INFO] Epoch:[0/2](185000/4588595) loss:2.808 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:08:41,826][model8_pretrain.py][INFO] Epoch:[0/2](185100/4588595) loss:3.660 lr:0.0000100 epoch_Time:27808.0min: [2024-01-03 13:08:41,826][model8_pretrain.py][INFO] Epoch:[0/2](185100/4588595) loss:3.330 lr:0.0000100 epoch_Time:27808.0min: [2024-01-03 13:08:41,826][model8_pretrain.py][INFO] Epoch:[0/2](185100/4588595) loss:2.965 lr:0.0000100 epoch_Time:27808.0min: [2024-01-03 13:08:41,826][model8_pretrain.py][INFO] Epoch:[0/2](185100/4588595) loss:3.194 lr:0.0000100 epoch_Time:27808.0min: [2024-01-03 13:08:41,826][model8_pretrain.py][INFO] Epoch:[0/2](185100/4588595) loss:2.676 lr:0.0000100 epoch_Time:27808.0min: [2024-01-03 13:08:41,826][model8_pretrain.py][INFO] Epoch:[0/2](185100/4588595) loss:2.987 lr:0.0000100 epoch_Time:27808.0min: [2024-01-03 13:08:41,826][model8_pretrain.py][INFO] Epoch:[0/2](185100/4588595) loss:3.049 lr:0.0000100 epoch_Time:27808.0min: [2024-01-03 13:08:41,826][model8_pretrain.py][INFO] Epoch:[0/2](185100/4588595) loss:2.433 lr:0.0000100 epoch_Time:27808.0min: [2024-01-03 13:09:18,780][model8_pretrain.py][INFO] Epoch:[0/2](185200/4588595) loss:3.257 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:09:18,780][model8_pretrain.py][INFO] Epoch:[0/2](185200/4588595) loss:2.746 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:09:18,780][model8_pretrain.py][INFO] Epoch:[0/2](185200/4588595) loss:2.448 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:09:18,780][model8_pretrain.py][INFO] Epoch:[0/2](185200/4588595) loss:2.949 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:09:18,780][model8_pretrain.py][INFO] Epoch:[0/2](185200/4588595) loss:3.062 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:09:18,780][model8_pretrain.py][INFO] Epoch:[0/2](185200/4588595) loss:3.050 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:09:18,780][model8_pretrain.py][INFO] Epoch:[0/2](185200/4588595) loss:2.895 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:09:18,780][model8_pretrain.py][INFO] Epoch:[0/2](185200/4588595) loss:3.095 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:10:04,437][model8_pretrain.py][INFO] Epoch:[0/2](185300/4588595) loss:2.532 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:04,437][model8_pretrain.py][INFO] Epoch:[0/2](185300/4588595) loss:2.577 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:04,437][model8_pretrain.py][INFO] Epoch:[0/2](185300/4588595) loss:3.237 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:04,437][model8_pretrain.py][INFO] Epoch:[0/2](185300/4588595) loss:3.262 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:04,437][model8_pretrain.py][INFO] Epoch:[0/2](185300/4588595) loss:3.076 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:04,437][model8_pretrain.py][INFO] Epoch:[0/2](185300/4588595) loss:2.395 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:04,437][model8_pretrain.py][INFO] Epoch:[0/2](185300/4588595) loss:3.511 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:04,437][model8_pretrain.py][INFO] Epoch:[0/2](185300/4588595) loss:2.959 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:41,364][model8_pretrain.py][INFO] Epoch:[0/2](185400/4588595) loss:2.762 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:41,364][model8_pretrain.py][INFO] Epoch:[0/2](185400/4588595) loss:2.910 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:41,364][model8_pretrain.py][INFO] Epoch:[0/2](185400/4588595) loss:2.996 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:41,364][model8_pretrain.py][INFO] Epoch:[0/2](185400/4588595) loss:3.453 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:41,365][model8_pretrain.py][INFO] Epoch:[0/2](185400/4588595) loss:3.421 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:41,365][model8_pretrain.py][INFO] Epoch:[0/2](185400/4588595) loss:3.272 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:41,365][model8_pretrain.py][INFO] Epoch:[0/2](185400/4588595) loss:3.102 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:10:41,365][model8_pretrain.py][INFO] Epoch:[0/2](185400/4588595) loss:2.598 lr:0.0000100 epoch_Time:27809.0min: [2024-01-03 13:11:18,286][model8_pretrain.py][INFO] Epoch:[0/2](185500/4588595) loss:3.248 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:11:18,286][model8_pretrain.py][INFO] Epoch:[0/2](185500/4588595) loss:2.973 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:11:18,286][model8_pretrain.py][INFO] Epoch:[0/2](185500/4588595) loss:3.359 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:11:18,286][model8_pretrain.py][INFO] Epoch:[0/2](185500/4588595) loss:3.029 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:11:18,286][model8_pretrain.py][INFO] Epoch:[0/2](185500/4588595) loss:3.081 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:11:18,286][model8_pretrain.py][INFO] Epoch:[0/2](185500/4588595) loss:3.257 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:11:18,286][model8_pretrain.py][INFO] Epoch:[0/2](185500/4588595) loss:2.780 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:11:18,286][model8_pretrain.py][INFO] Epoch:[0/2](185500/4588595) loss:2.529 lr:0.0000100 epoch_Time:27807.0min: [2024-01-03 13:11:55,231][model8_pretrain.py][INFO] Epoch:[0/2](185600/4588595) loss:3.126 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:11:55,231][model8_pretrain.py][INFO] Epoch:[0/2](185600/4588595) loss:2.183 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:11:55,231][model8_pretrain.py][INFO] Epoch:[0/2](185600/4588595) loss:2.988 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:11:55,231][model8_pretrain.py][INFO] Epoch:[0/2](185600/4588595) loss:2.880 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:11:55,231][model8_pretrain.py][INFO] Epoch:[0/2](185600/4588595) loss:2.924 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:11:55,232][model8_pretrain.py][INFO] Epoch:[0/2](185600/4588595) loss:2.000 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:11:55,232][model8_pretrain.py][INFO] Epoch:[0/2](185600/4588595) loss:2.799 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:11:55,232][model8_pretrain.py][INFO] Epoch:[0/2](185600/4588595) loss:2.734 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:12:32,189][model8_pretrain.py][INFO] Epoch:[0/2](185700/4588595) loss:2.833 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:12:32,189][model8_pretrain.py][INFO] Epoch:[0/2](185700/4588595) loss:2.730 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:12:32,189][model8_pretrain.py][INFO] Epoch:[0/2](185700/4588595) loss:2.722 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:12:32,189][model8_pretrain.py][INFO] Epoch:[0/2](185700/4588595) loss:2.960 lr:0.0000100 epoch_Time:27806.0min: [2024-01-03 13:12:32,189][model8_pretrain.py][INFO] Epoch:[0/2](185700/4588595) loss:2.927 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:12:32,189][model8_pretrain.py][INFO] Epoch:[0/2](185700/4588595) loss:2.575 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:12:32,189][model8_pretrain.py][INFO] Epoch:[0/2](185700/4588595) loss:2.896 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:12:32,190][model8_pretrain.py][INFO] Epoch:[0/2](185700/4588595) loss:3.530 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:13:09,143][model8_pretrain.py][INFO] Epoch:[0/2](185800/4588595) loss:3.036 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:09,143][model8_pretrain.py][INFO] Epoch:[0/2](185800/4588595) loss:3.097 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:09,143][model8_pretrain.py][INFO] Epoch:[0/2](185800/4588595) loss:2.922 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:09,143][model8_pretrain.py][INFO] Epoch:[0/2](185800/4588595) loss:3.156 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:09,143][model8_pretrain.py][INFO] Epoch:[0/2](185800/4588595) loss:3.332 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:09,143][model8_pretrain.py][INFO] Epoch:[0/2](185800/4588595) loss:2.929 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:09,143][model8_pretrain.py][INFO] Epoch:[0/2](185800/4588595) loss:2.878 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:09,143][model8_pretrain.py][INFO] Epoch:[0/2](185800/4588595) loss:3.050 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:46,100][model8_pretrain.py][INFO] Epoch:[0/2](185900/4588595) loss:2.795 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:46,100][model8_pretrain.py][INFO] Epoch:[0/2](185900/4588595) loss:2.952 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:46,100][model8_pretrain.py][INFO] Epoch:[0/2](185900/4588595) loss:3.045 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:46,100][model8_pretrain.py][INFO] Epoch:[0/2](185900/4588595) loss:2.596 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:46,100][model8_pretrain.py][INFO] Epoch:[0/2](185900/4588595) loss:2.968 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:46,100][model8_pretrain.py][INFO] Epoch:[0/2](185900/4588595) loss:2.727 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:46,100][model8_pretrain.py][INFO] Epoch:[0/2](185900/4588595) loss:2.944 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:13:46,100][model8_pretrain.py][INFO] Epoch:[0/2](185900/4588595) loss:3.547 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:14:23,052][model8_pretrain.py][INFO] Epoch:[0/2](186000/4588595) loss:2.834 lr:0.0000100 epoch_Time:27802.0min: [2024-01-03 13:14:23,052][model8_pretrain.py][INFO] Epoch:[0/2](186000/4588595) loss:2.466 lr:0.0000100 epoch_Time:27802.0min: [2024-01-03 13:14:23,052][model8_pretrain.py][INFO] Epoch:[0/2](186000/4588595) loss:2.759 lr:0.0000100 epoch_Time:27802.0min: [2024-01-03 13:14:23,052][model8_pretrain.py][INFO] Epoch:[0/2](186000/4588595) loss:3.012 lr:0.0000100 epoch_Time:27802.0min: [2024-01-03 13:14:23,052][model8_pretrain.py][INFO] Epoch:[0/2](186000/4588595) loss:2.963 lr:0.0000100 epoch_Time:27802.0min: [2024-01-03 13:14:23,052][model8_pretrain.py][INFO] Epoch:[0/2](186000/4588595) loss:3.122 lr:0.0000100 epoch_Time:27802.0min: [2024-01-03 13:14:23,053][model8_pretrain.py][INFO] Epoch:[0/2](186000/4588595) loss:3.187 lr:0.0000100 epoch_Time:27802.0min: [2024-01-03 13:14:23,053][model8_pretrain.py][INFO] Epoch:[0/2](186000/4588595) loss:3.562 lr:0.0000100 epoch_Time:27802.0min: [2024-01-03 13:15:08,893][model8_pretrain.py][INFO] Epoch:[0/2](186100/4588595) loss:2.700 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:15:08,897][model8_pretrain.py][INFO] Epoch:[0/2](186100/4588595) loss:3.137 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:15:08,897][model8_pretrain.py][INFO] Epoch:[0/2](186100/4588595) loss:2.832 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:15:08,898][model8_pretrain.py][INFO] Epoch:[0/2](186100/4588595) loss:2.994 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:15:08,898][model8_pretrain.py][INFO] Epoch:[0/2](186100/4588595) loss:3.057 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:15:08,898][model8_pretrain.py][INFO] Epoch:[0/2](186100/4588595) loss:2.571 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:15:08,898][model8_pretrain.py][INFO] Epoch:[0/2](186100/4588595) loss:3.059 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:15:08,898][model8_pretrain.py][INFO] Epoch:[0/2](186100/4588595) loss:2.478 lr:0.0000100 epoch_Time:27805.0min: [2024-01-03 13:15:45,815][model8_pretrain.py][INFO] Epoch:[0/2](186200/4588595) loss:2.930 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:15:45,815][model8_pretrain.py][INFO] Epoch:[0/2](186200/4588595) loss:2.513 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:15:45,815][model8_pretrain.py][INFO] Epoch:[0/2](186200/4588595) loss:3.255 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:15:45,815][model8_pretrain.py][INFO] Epoch:[0/2](186200/4588595) loss:3.314 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:15:45,815][model8_pretrain.py][INFO] Epoch:[0/2](186200/4588595) loss:2.588 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:15:45,815][model8_pretrain.py][INFO] Epoch:[0/2](186200/4588595) loss:3.224 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:15:45,815][model8_pretrain.py][INFO] Epoch:[0/2](186200/4588595) loss:2.521 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:15:45,815][model8_pretrain.py][INFO] Epoch:[0/2](186200/4588595) loss:2.396 lr:0.0000100 epoch_Time:27804.0min: [2024-01-03 13:16:22,753][model8_pretrain.py][INFO] Epoch:[0/2](186300/4588595) loss:3.089 lr:0.0000100 epoch_Time:27803.0min: [2024-01-03 13:16:22,753][model8_pretrain.py][INFO] Epoch:[0/2](186300/4588595) loss:2.913 lr:0.0000100 epoch_Time:27803.0min: [2024-01-03 13:16:22,753][model8_pretrain.py][INFO] Epoch:[0/2](186300/4588595) loss:3.203 lr:0.0000100 epoch_Time:27803.0min: [2024-01-03 13:16:22,753][model8_pretrain.py][INFO] Epoch:[0/2](186300/4588595) loss:2.950 lr:0.0000100 epoch_Time:27803.0min: [2024-01-03 13:16:22,753][model8_pretrain.py][INFO] Epoch:[0/2](186300/4588595) loss:2.835 lr:0.0000100 epoch_Time:27803.0min: [2024-01-03 13:16:22,753][model8_pretrain.py][INFO] Epoch:[0/2](186300/4588595) loss:2.891 lr:0.0000100 epoch_Time:27803.0min: [2024-01-03 13:16:22,753][model8_pretrain.py][INFO] Epoch:[0/2](186300/4588595) loss:3.160 lr:0.0000100 epoch_Time:27803.0min: [2024-01-03 13:16:22,753][model8_pretrain.py][INFO] Epoch:[0/2](186300/4588595) loss:3.091 lr:0.0000100 epoch_Time:27803.0min: [2024-01-03 13:16:59,690][model8_pretrain.py][INFO] Epoch:[0/2](186400/4588595) loss:3.380 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:16:59,690][model8_pretrain.py][INFO] Epoch:[0/2](186400/4588595) loss:2.606 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:16:59,690][model8_pretrain.py][INFO] Epoch:[0/2](186400/4588595) loss:3.291 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:16:59,690][model8_pretrain.py][INFO] Epoch:[0/2](186400/4588595) loss:2.819 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:16:59,690][model8_pretrain.py][INFO] Epoch:[0/2](186400/4588595) loss:2.605 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:16:59,690][model8_pretrain.py][INFO] Epoch:[0/2](186400/4588595) loss:3.322 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:16:59,690][model8_pretrain.py][INFO] Epoch:[0/2](186400/4588595) loss:2.885 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:16:59,690][model8_pretrain.py][INFO] Epoch:[0/2](186400/4588595) loss:3.283 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:17:36,636][model8_pretrain.py][INFO] Epoch:[0/2](186500/4588595) loss:3.011 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:17:36,636][model8_pretrain.py][INFO] Epoch:[0/2](186500/4588595) loss:2.288 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:17:36,636][model8_pretrain.py][INFO] Epoch:[0/2](186500/4588595) loss:2.785 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:17:36,636][model8_pretrain.py][INFO] Epoch:[0/2](186500/4588595) loss:3.059 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:17:36,636][model8_pretrain.py][INFO] Epoch:[0/2](186500/4588595) loss:2.914 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:17:36,636][model8_pretrain.py][INFO] Epoch:[0/2](186500/4588595) loss:2.762 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:17:36,637][model8_pretrain.py][INFO] Epoch:[0/2](186500/4588595) loss:3.187 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:17:36,637][model8_pretrain.py][INFO] Epoch:[0/2](186500/4588595) loss:2.814 lr:0.0000100 epoch_Time:27801.0min: [2024-01-03 13:18:13,589][model8_pretrain.py][INFO] Epoch:[0/2](186600/4588595) loss:2.587 lr:0.0000100 epoch_Time:27800.0min: [2024-01-03 13:18:13,589][model8_pretrain.py][INFO] Epoch:[0/2](186600/4588595) loss:3.508 lr:0.0000100 epoch_Time:27800.0min: [2024-01-03 13:18:13,589][model8_pretrain.py][INFO] Epoch:[0/2](186600/4588595) loss:2.808 lr:0.0000100 epoch_Time:27800.0min: [2024-01-03 13:18:13,589][model8_pretrain.py][INFO] Epoch:[0/2](186600/4588595) loss:2.149 lr:0.0000100 epoch_Time:27800.0min: [2024-01-03 13:18:13,589][model8_pretrain.py][INFO] Epoch:[0/2](186600/4588595) loss:2.835 lr:0.0000100 epoch_Time:27800.0min: [2024-01-03 13:18:13,589][model8_pretrain.py][INFO] Epoch:[0/2](186600/4588595) loss:2.893 lr:0.0000100 epoch_Time:27800.0min: [2024-01-03 13:18:13,589][model8_pretrain.py][INFO] Epoch:[0/2](186600/4588595) loss:3.144 lr:0.0000100 epoch_Time:27800.0min: [2024-01-03 13:18:13,590][model8_pretrain.py][INFO] Epoch:[0/2](186600/4588595) loss:2.850 lr:0.0000100 epoch_Time:27800.0min: [2024-01-03 13:18:50,537][model8_pretrain.py][INFO] Epoch:[0/2](186700/4588595) loss:3.204 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:18:50,537][model8_pretrain.py][INFO] Epoch:[0/2](186700/4588595) loss:2.916 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:18:50,537][model8_pretrain.py][INFO] Epoch:[0/2](186700/4588595) loss:2.983 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:18:50,537][model8_pretrain.py][INFO] Epoch:[0/2](186700/4588595) loss:3.245 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:18:50,537][model8_pretrain.py][INFO] Epoch:[0/2](186700/4588595) loss:3.134 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:18:50,537][model8_pretrain.py][INFO] Epoch:[0/2](186700/4588595) loss:2.779 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:18:50,537][model8_pretrain.py][INFO] Epoch:[0/2](186700/4588595) loss:2.801 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:18:50,537][model8_pretrain.py][INFO] Epoch:[0/2](186700/4588595) loss:2.509 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:19:27,482][model8_pretrain.py][INFO] Epoch:[0/2](186800/4588595) loss:2.991 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:19:27,482][model8_pretrain.py][INFO] Epoch:[0/2](186800/4588595) loss:2.895 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:19:27,482][model8_pretrain.py][INFO] Epoch:[0/2](186800/4588595) loss:2.951 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:19:27,482][model8_pretrain.py][INFO] Epoch:[0/2](186800/4588595) loss:3.366 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:19:27,482][model8_pretrain.py][INFO] Epoch:[0/2](186800/4588595) loss:3.167 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:19:27,482][model8_pretrain.py][INFO] Epoch:[0/2](186800/4588595) loss:2.807 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:19:27,482][model8_pretrain.py][INFO] Epoch:[0/2](186800/4588595) loss:2.915 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:19:27,482][model8_pretrain.py][INFO] Epoch:[0/2](186800/4588595) loss:2.882 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:20:11,532][model8_pretrain.py][INFO] Epoch:[0/2](186900/4588595) loss:3.138 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:11,532][model8_pretrain.py][INFO] Epoch:[0/2](186900/4588595) loss:2.944 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:11,532][model8_pretrain.py][INFO] Epoch:[0/2](186900/4588595) loss:2.910 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:11,532][model8_pretrain.py][INFO] Epoch:[0/2](186900/4588595) loss:2.800 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:11,536][model8_pretrain.py][INFO] Epoch:[0/2](186900/4588595) loss:2.778 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:11,537][model8_pretrain.py][INFO] Epoch:[0/2](186900/4588595) loss:2.779 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:11,537][model8_pretrain.py][INFO] Epoch:[0/2](186900/4588595) loss:3.042 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:11,537][model8_pretrain.py][INFO] Epoch:[0/2](186900/4588595) loss:2.846 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:50,130][model8_pretrain.py][INFO] Epoch:[0/2](187000/4588595) loss:3.374 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:50,130][model8_pretrain.py][INFO] Epoch:[0/2](187000/4588595) loss:3.518 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:50,130][model8_pretrain.py][INFO] Epoch:[0/2](187000/4588595) loss:3.043 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:50,130][model8_pretrain.py][INFO] Epoch:[0/2](187000/4588595) loss:3.109 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:50,130][model8_pretrain.py][INFO] Epoch:[0/2](187000/4588595) loss:3.135 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:50,130][model8_pretrain.py][INFO] Epoch:[0/2](187000/4588595) loss:3.294 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:50,130][model8_pretrain.py][INFO] Epoch:[0/2](187000/4588595) loss:3.306 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:20:50,130][model8_pretrain.py][INFO] Epoch:[0/2](187000/4588595) loss:2.516 lr:0.0000100 epoch_Time:27799.0min: [2024-01-03 13:21:27,061][model8_pretrain.py][INFO] Epoch:[0/2](187100/4588595) loss:2.978 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:21:27,061][model8_pretrain.py][INFO] Epoch:[0/2](187100/4588595) loss:3.048 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:21:27,061][model8_pretrain.py][INFO] Epoch:[0/2](187100/4588595) loss:3.049 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:21:27,061][model8_pretrain.py][INFO] Epoch:[0/2](187100/4588595) loss:3.543 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:21:27,061][model8_pretrain.py][INFO] Epoch:[0/2](187100/4588595) loss:3.059 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:21:27,061][model8_pretrain.py][INFO] Epoch:[0/2](187100/4588595) loss:2.600 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:21:27,061][model8_pretrain.py][INFO] Epoch:[0/2](187100/4588595) loss:3.309 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:21:27,061][model8_pretrain.py][INFO] Epoch:[0/2](187100/4588595) loss:2.694 lr:0.0000100 epoch_Time:27798.0min: [2024-01-03 13:22:04,003][model8_pretrain.py][INFO] Epoch:[0/2](187200/4588595) loss:2.660 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:04,003][model8_pretrain.py][INFO] Epoch:[0/2](187200/4588595) loss:3.308 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:04,003][model8_pretrain.py][INFO] Epoch:[0/2](187200/4588595) loss:3.412 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:04,003][model8_pretrain.py][INFO] Epoch:[0/2](187200/4588595) loss:3.025 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:04,003][model8_pretrain.py][INFO] Epoch:[0/2](187200/4588595) loss:3.260 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:04,003][model8_pretrain.py][INFO] Epoch:[0/2](187200/4588595) loss:3.022 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:04,003][model8_pretrain.py][INFO] Epoch:[0/2](187200/4588595) loss:3.202 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:04,004][model8_pretrain.py][INFO] Epoch:[0/2](187200/4588595) loss:2.987 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:40,949][model8_pretrain.py][INFO] Epoch:[0/2](187300/4588595) loss:2.909 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:40,949][model8_pretrain.py][INFO] Epoch:[0/2](187300/4588595) loss:3.142 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:40,949][model8_pretrain.py][INFO] Epoch:[0/2](187300/4588595) loss:3.319 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:40,949][model8_pretrain.py][INFO] Epoch:[0/2](187300/4588595) loss:3.673 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:40,949][model8_pretrain.py][INFO] Epoch:[0/2](187300/4588595) loss:3.202 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:40,949][model8_pretrain.py][INFO] Epoch:[0/2](187300/4588595) loss:2.794 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:40,949][model8_pretrain.py][INFO] Epoch:[0/2](187300/4588595) loss:2.877 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:22:40,950][model8_pretrain.py][INFO] Epoch:[0/2](187300/4588595) loss:2.987 lr:0.0000100 epoch_Time:27797.0min: [2024-01-03 13:23:17,891][model8_pretrain.py][INFO] Epoch:[0/2](187400/4588595) loss:2.908 lr:0.0000100 epoch_Time:27795.0min: [2024-01-03 13:23:17,891][model8_pretrain.py][INFO] Epoch:[0/2](187400/4588595) loss:2.914 lr:0.0000100 epoch_Time:27795.0min: [2024-01-03 13:23:17,891][model8_pretrain.py][INFO] Epoch:[0/2](187400/4588595) loss:2.171 lr:0.0000100 epoch_Time:27795.0min: [2024-01-03 13:23:17,891][model8_pretrain.py][INFO] Epoch:[0/2](187400/4588595) loss:3.202 lr:0.0000100 epoch_Time:27795.0min: [2024-01-03 13:23:17,891][model8_pretrain.py][INFO] Epoch:[0/2](187400/4588595) loss:2.984 lr:0.0000100 epoch_Time:27795.0min: [2024-01-03 13:23:17,891][model8_pretrain.py][INFO] Epoch:[0/2](187400/4588595) loss:3.275 lr:0.0000100 epoch_Time:27795.0min: [2024-01-03 13:23:17,891][model8_pretrain.py][INFO] Epoch:[0/2](187400/4588595) loss:2.920 lr:0.0000100 epoch_Time:27795.0min: [2024-01-03 13:23:17,891][model8_pretrain.py][INFO] Epoch:[0/2](187400/4588595) loss:3.065 lr:0.0000100 epoch_Time:27795.0min: [2024-01-03 13:23:54,830][model8_pretrain.py][INFO] Epoch:[0/2](187500/4588595) loss:3.222 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:23:54,830][model8_pretrain.py][INFO] Epoch:[0/2](187500/4588595) loss:2.321 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:23:54,830][model8_pretrain.py][INFO] Epoch:[0/2](187500/4588595) loss:3.325 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:23:54,830][model8_pretrain.py][INFO] Epoch:[0/2](187500/4588595) loss:3.161 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:23:54,830][model8_pretrain.py][INFO] Epoch:[0/2](187500/4588595) loss:3.311 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:23:54,830][model8_pretrain.py][INFO] Epoch:[0/2](187500/4588595) loss:3.071 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:23:54,830][model8_pretrain.py][INFO] Epoch:[0/2](187500/4588595) loss:2.442 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:23:54,830][model8_pretrain.py][INFO] Epoch:[0/2](187500/4588595) loss:2.947 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:24:31,748][model8_pretrain.py][INFO] Epoch:[0/2](187600/4588595) loss:2.374 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:24:31,748][model8_pretrain.py][INFO] Epoch:[0/2](187600/4588595) loss:2.707 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:24:31,748][model8_pretrain.py][INFO] Epoch:[0/2](187600/4588595) loss:3.136 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:24:31,748][model8_pretrain.py][INFO] Epoch:[0/2](187600/4588595) loss:3.017 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:24:31,748][model8_pretrain.py][INFO] Epoch:[0/2](187600/4588595) loss:2.470 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:24:31,748][model8_pretrain.py][INFO] Epoch:[0/2](187600/4588595) loss:3.287 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:24:31,748][model8_pretrain.py][INFO] Epoch:[0/2](187600/4588595) loss:2.726 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:24:31,748][model8_pretrain.py][INFO] Epoch:[0/2](187600/4588595) loss:3.034 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:25:12,199][model8_pretrain.py][INFO] Epoch:[0/2](187700/4588595) loss:2.992 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:25:12,199][model8_pretrain.py][INFO] Epoch:[0/2](187700/4588595) loss:2.552 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:25:12,199][model8_pretrain.py][INFO] Epoch:[0/2](187700/4588595) loss:2.613 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:25:12,199][model8_pretrain.py][INFO] Epoch:[0/2](187700/4588595) loss:3.621 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:25:12,199][model8_pretrain.py][INFO] Epoch:[0/2](187700/4588595) loss:3.378 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:25:12,199][model8_pretrain.py][INFO] Epoch:[0/2](187700/4588595) loss:2.696 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:25:12,199][model8_pretrain.py][INFO] Epoch:[0/2](187700/4588595) loss:3.320 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:25:12,200][model8_pretrain.py][INFO] Epoch:[0/2](187700/4588595) loss:2.549 lr:0.0000100 epoch_Time:27793.0min: [2024-01-03 13:25:54,612][model8_pretrain.py][INFO] Epoch:[0/2](187800/4588595) loss:2.989 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:25:54,612][model8_pretrain.py][INFO] Epoch:[0/2](187800/4588595) loss:3.129 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:25:54,612][model8_pretrain.py][INFO] Epoch:[0/2](187800/4588595) loss:2.858 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:25:54,613][model8_pretrain.py][INFO] Epoch:[0/2](187800/4588595) loss:2.621 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:25:54,612][model8_pretrain.py][INFO] Epoch:[0/2](187800/4588595) loss:3.340 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:25:54,613][model8_pretrain.py][INFO] Epoch:[0/2](187800/4588595) loss:2.967 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:25:54,612][model8_pretrain.py][INFO] Epoch:[0/2](187800/4588595) loss:3.167 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:25:54,613][model8_pretrain.py][INFO] Epoch:[0/2](187800/4588595) loss:2.130 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:26:31,544][model8_pretrain.py][INFO] Epoch:[0/2](187900/4588595) loss:2.370 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:26:31,544][model8_pretrain.py][INFO] Epoch:[0/2](187900/4588595) loss:3.587 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:26:31,544][model8_pretrain.py][INFO] Epoch:[0/2](187900/4588595) loss:2.647 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:26:31,544][model8_pretrain.py][INFO] Epoch:[0/2](187900/4588595) loss:2.984 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:26:31,544][model8_pretrain.py][INFO] Epoch:[0/2](187900/4588595) loss:2.647 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:26:31,544][model8_pretrain.py][INFO] Epoch:[0/2](187900/4588595) loss:3.224 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:26:31,544][model8_pretrain.py][INFO] Epoch:[0/2](187900/4588595) loss:3.067 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:26:31,544][model8_pretrain.py][INFO] Epoch:[0/2](187900/4588595) loss:3.090 lr:0.0000100 epoch_Time:27794.0min: [2024-01-03 13:27:08,472][model8_pretrain.py][INFO] Epoch:[0/2](188000/4588595) loss:3.314 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:08,472][model8_pretrain.py][INFO] Epoch:[0/2](188000/4588595) loss:2.949 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:08,472][model8_pretrain.py][INFO] Epoch:[0/2](188000/4588595) loss:2.736 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:08,472][model8_pretrain.py][INFO] Epoch:[0/2](188000/4588595) loss:3.317 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:08,472][model8_pretrain.py][INFO] Epoch:[0/2](188000/4588595) loss:3.222 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:08,472][model8_pretrain.py][INFO] Epoch:[0/2](188000/4588595) loss:2.806 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:08,472][model8_pretrain.py][INFO] Epoch:[0/2](188000/4588595) loss:3.318 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:08,472][model8_pretrain.py][INFO] Epoch:[0/2](188000/4588595) loss:3.028 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:45,402][model8_pretrain.py][INFO] Epoch:[0/2](188100/4588595) loss:3.150 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:45,402][model8_pretrain.py][INFO] Epoch:[0/2](188100/4588595) loss:3.024 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:45,402][model8_pretrain.py][INFO] Epoch:[0/2](188100/4588595) loss:3.164 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:45,402][model8_pretrain.py][INFO] Epoch:[0/2](188100/4588595) loss:2.780 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:45,402][model8_pretrain.py][INFO] Epoch:[0/2](188100/4588595) loss:3.386 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:45,402][model8_pretrain.py][INFO] Epoch:[0/2](188100/4588595) loss:3.031 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:45,402][model8_pretrain.py][INFO] Epoch:[0/2](188100/4588595) loss:2.551 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:27:45,402][model8_pretrain.py][INFO] Epoch:[0/2](188100/4588595) loss:2.749 lr:0.0000100 epoch_Time:27792.0min: [2024-01-03 13:28:22,337][model8_pretrain.py][INFO] Epoch:[0/2](188200/4588595) loss:3.052 lr:0.0000100 epoch_Time:27791.0min: [2024-01-03 13:28:22,337][model8_pretrain.py][INFO] Epoch:[0/2](188200/4588595) loss:2.918 lr:0.0000100 epoch_Time:27791.0min: [2024-01-03 13:28:22,337][model8_pretrain.py][INFO] Epoch:[0/2](188200/4588595) loss:2.965 lr:0.0000100 epoch_Time:27791.0min: [2024-01-03 13:28:22,337][model8_pretrain.py][INFO] Epoch:[0/2](188200/4588595) loss:2.781 lr:0.0000100 epoch_Time:27791.0min: [2024-01-03 13:28:22,337][model8_pretrain.py][INFO] Epoch:[0/2](188200/4588595) loss:2.927 lr:0.0000100 epoch_Time:27791.0min: [2024-01-03 13:28:22,337][model8_pretrain.py][INFO] Epoch:[0/2](188200/4588595) loss:2.645 lr:0.0000100 epoch_Time:27791.0min: [2024-01-03 13:28:22,337][model8_pretrain.py][INFO] Epoch:[0/2](188200/4588595) loss:2.678 lr:0.0000100 epoch_Time:27791.0min: [2024-01-03 13:28:22,337][model8_pretrain.py][INFO] Epoch:[0/2](188200/4588595) loss:2.622 lr:0.0000100 epoch_Time:27791.0min: [2024-01-03 13:28:59,257][model8_pretrain.py][INFO] Epoch:[0/2](188300/4588595) loss:3.128 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:28:59,257][model8_pretrain.py][INFO] Epoch:[0/2](188300/4588595) loss:3.267 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:28:59,257][model8_pretrain.py][INFO] Epoch:[0/2](188300/4588595) loss:3.399 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:28:59,257][model8_pretrain.py][INFO] Epoch:[0/2](188300/4588595) loss:3.470 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:28:59,257][model8_pretrain.py][INFO] Epoch:[0/2](188300/4588595) loss:3.426 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:28:59,257][model8_pretrain.py][INFO] Epoch:[0/2](188300/4588595) loss:2.729 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:28:59,257][model8_pretrain.py][INFO] Epoch:[0/2](188300/4588595) loss:3.036 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:28:59,257][model8_pretrain.py][INFO] Epoch:[0/2](188300/4588595) loss:2.784 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:29:36,172][model8_pretrain.py][INFO] Epoch:[0/2](188400/4588595) loss:2.819 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:29:36,172][model8_pretrain.py][INFO] Epoch:[0/2](188400/4588595) loss:3.466 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:29:36,172][model8_pretrain.py][INFO] Epoch:[0/2](188400/4588595) loss:2.929 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:29:36,172][model8_pretrain.py][INFO] Epoch:[0/2](188400/4588595) loss:3.142 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:29:36,172][model8_pretrain.py][INFO] Epoch:[0/2](188400/4588595) loss:2.311 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:29:36,172][model8_pretrain.py][INFO] Epoch:[0/2](188400/4588595) loss:3.187 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:29:36,172][model8_pretrain.py][INFO] Epoch:[0/2](188400/4588595) loss:2.834 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:29:36,172][model8_pretrain.py][INFO] Epoch:[0/2](188400/4588595) loss:2.850 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:30:16,557][model8_pretrain.py][INFO] Epoch:[0/2](188500/4588595) loss:3.150 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:30:16,557][model8_pretrain.py][INFO] Epoch:[0/2](188500/4588595) loss:3.165 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:30:16,557][model8_pretrain.py][INFO] Epoch:[0/2](188500/4588595) loss:2.689 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:30:16,557][model8_pretrain.py][INFO] Epoch:[0/2](188500/4588595) loss:3.242 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:30:16,557][model8_pretrain.py][INFO] Epoch:[0/2](188500/4588595) loss:2.998 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:30:16,557][model8_pretrain.py][INFO] Epoch:[0/2](188500/4588595) loss:2.723 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:30:16,557][model8_pretrain.py][INFO] Epoch:[0/2](188500/4588595) loss:2.786 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:30:16,558][model8_pretrain.py][INFO] Epoch:[0/2](188500/4588595) loss:3.043 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:30:58,750][model8_pretrain.py][INFO] Epoch:[0/2](188600/4588595) loss:2.794 lr:0.0000100 epoch_Time:27790.0min: [2024-01-03 13:30:58,750][model8_pretrain.py][INFO] Epoch:[0/2](188600/4588595) loss:3.261 lr:0.0000100 epoch_Time:27790.0min: [2024-01-03 13:30:58,750][model8_pretrain.py][INFO] Epoch:[0/2](188600/4588595) loss:3.082 lr:0.0000100 epoch_Time:27790.0min: [2024-01-03 13:30:58,750][model8_pretrain.py][INFO] Epoch:[0/2](188600/4588595) loss:2.427 lr:0.0000100 epoch_Time:27790.0min: [2024-01-03 13:30:58,750][model8_pretrain.py][INFO] Epoch:[0/2](188600/4588595) loss:2.638 lr:0.0000100 epoch_Time:27790.0min: [2024-01-03 13:30:58,750][model8_pretrain.py][INFO] Epoch:[0/2](188600/4588595) loss:2.887 lr:0.0000100 epoch_Time:27790.0min: [2024-01-03 13:30:58,750][model8_pretrain.py][INFO] Epoch:[0/2](188600/4588595) loss:2.373 lr:0.0000100 epoch_Time:27790.0min: [2024-01-03 13:30:58,750][model8_pretrain.py][INFO] Epoch:[0/2](188600/4588595) loss:2.837 lr:0.0000100 epoch_Time:27790.0min: [2024-01-03 13:31:35,687][model8_pretrain.py][INFO] Epoch:[0/2](188700/4588595) loss:2.671 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:31:35,687][model8_pretrain.py][INFO] Epoch:[0/2](188700/4588595) loss:2.672 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:31:35,687][model8_pretrain.py][INFO] Epoch:[0/2](188700/4588595) loss:2.870 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:31:35,687][model8_pretrain.py][INFO] Epoch:[0/2](188700/4588595) loss:2.955 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:31:35,687][model8_pretrain.py][INFO] Epoch:[0/2](188700/4588595) loss:3.039 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:31:35,688][model8_pretrain.py][INFO] Epoch:[0/2](188700/4588595) loss:3.104 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:31:35,688][model8_pretrain.py][INFO] Epoch:[0/2](188700/4588595) loss:2.935 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:31:35,688][model8_pretrain.py][INFO] Epoch:[0/2](188700/4588595) loss:2.811 lr:0.0000100 epoch_Time:27789.0min: [2024-01-03 13:32:12,617][model8_pretrain.py][INFO] Epoch:[0/2](188800/4588595) loss:2.653 lr:0.0000100 epoch_Time:27788.0min: [2024-01-03 13:32:12,617][model8_pretrain.py][INFO] Epoch:[0/2](188800/4588595) loss:2.641 lr:0.0000100 epoch_Time:27788.0min: [2024-01-03 13:32:12,617][model8_pretrain.py][INFO] Epoch:[0/2](188800/4588595) loss:3.128 lr:0.0000100 epoch_Time:27788.0min: [2024-01-03 13:32:12,617][model8_pretrain.py][INFO] Epoch:[0/2](188800/4588595) loss:3.274 lr:0.0000100 epoch_Time:27788.0min: [2024-01-03 13:32:12,617][model8_pretrain.py][INFO] Epoch:[0/2](188800/4588595) loss:2.500 lr:0.0000100 epoch_Time:27788.0min: [2024-01-03 13:32:12,617][model8_pretrain.py][INFO] Epoch:[0/2](188800/4588595) loss:2.476 lr:0.0000100 epoch_Time:27788.0min: [2024-01-03 13:32:12,617][model8_pretrain.py][INFO] Epoch:[0/2](188800/4588595) loss:3.074 lr:0.0000100 epoch_Time:27788.0min: [2024-01-03 13:32:12,617][model8_pretrain.py][INFO] Epoch:[0/2](188800/4588595) loss:2.841 lr:0.0000100 epoch_Time:27788.0min: [2024-01-03 13:32:49,561][model8_pretrain.py][INFO] Epoch:[0/2](188900/4588595) loss:3.086 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:32:49,562][model8_pretrain.py][INFO] Epoch:[0/2](188900/4588595) loss:2.506 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:32:49,562][model8_pretrain.py][INFO] Epoch:[0/2](188900/4588595) loss:3.024 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:32:49,562][model8_pretrain.py][INFO] Epoch:[0/2](188900/4588595) loss:2.940 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:32:49,561][model8_pretrain.py][INFO] Epoch:[0/2](188900/4588595) loss:3.127 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:32:49,562][model8_pretrain.py][INFO] Epoch:[0/2](188900/4588595) loss:2.850 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:32:49,562][model8_pretrain.py][INFO] Epoch:[0/2](188900/4588595) loss:2.911 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:32:49,562][model8_pretrain.py][INFO] Epoch:[0/2](188900/4588595) loss:2.605 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:33:26,505][model8_pretrain.py][INFO] Epoch:[0/2](189000/4588595) loss:2.833 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:33:26,505][model8_pretrain.py][INFO] Epoch:[0/2](189000/4588595) loss:2.251 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:33:26,505][model8_pretrain.py][INFO] Epoch:[0/2](189000/4588595) loss:2.885 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:33:26,505][model8_pretrain.py][INFO] Epoch:[0/2](189000/4588595) loss:2.764 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:33:26,505][model8_pretrain.py][INFO] Epoch:[0/2](189000/4588595) loss:2.854 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:33:26,505][model8_pretrain.py][INFO] Epoch:[0/2](189000/4588595) loss:3.096 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:33:26,505][model8_pretrain.py][INFO] Epoch:[0/2](189000/4588595) loss:3.021 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:33:26,505][model8_pretrain.py][INFO] Epoch:[0/2](189000/4588595) loss:3.158 lr:0.0000100 epoch_Time:27786.0min: [2024-01-03 13:34:03,440][model8_pretrain.py][INFO] Epoch:[0/2](189100/4588595) loss:2.610 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:34:03,440][model8_pretrain.py][INFO] Epoch:[0/2](189100/4588595) loss:2.771 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:34:03,440][model8_pretrain.py][INFO] Epoch:[0/2](189100/4588595) loss:2.729 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:34:03,440][model8_pretrain.py][INFO] Epoch:[0/2](189100/4588595) loss:3.076 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:34:03,440][model8_pretrain.py][INFO] Epoch:[0/2](189100/4588595) loss:2.891 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:34:03,440][model8_pretrain.py][INFO] Epoch:[0/2](189100/4588595) loss:2.764 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:34:03,440][model8_pretrain.py][INFO] Epoch:[0/2](189100/4588595) loss:3.368 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:34:03,441][model8_pretrain.py][INFO] Epoch:[0/2](189100/4588595) loss:2.935 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:34:40,383][model8_pretrain.py][INFO] Epoch:[0/2](189200/4588595) loss:3.018 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:34:40,383][model8_pretrain.py][INFO] Epoch:[0/2](189200/4588595) loss:2.785 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:34:40,383][model8_pretrain.py][INFO] Epoch:[0/2](189200/4588595) loss:2.820 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:34:40,383][model8_pretrain.py][INFO] Epoch:[0/2](189200/4588595) loss:3.297 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:34:40,383][model8_pretrain.py][INFO] Epoch:[0/2](189200/4588595) loss:3.188 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:34:40,384][model8_pretrain.py][INFO] Epoch:[0/2](189200/4588595) loss:3.155 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:34:40,384][model8_pretrain.py][INFO] Epoch:[0/2](189200/4588595) loss:3.224 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:34:40,384][model8_pretrain.py][INFO] Epoch:[0/2](189200/4588595) loss:2.824 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:35:20,777][model8_pretrain.py][INFO] Epoch:[0/2](189300/4588595) loss:3.434 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:35:20,777][model8_pretrain.py][INFO] Epoch:[0/2](189300/4588595) loss:2.605 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:35:20,777][model8_pretrain.py][INFO] Epoch:[0/2](189300/4588595) loss:2.711 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:35:20,777][model8_pretrain.py][INFO] Epoch:[0/2](189300/4588595) loss:2.935 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:35:20,777][model8_pretrain.py][INFO] Epoch:[0/2](189300/4588595) loss:3.089 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:35:20,777][model8_pretrain.py][INFO] Epoch:[0/2](189300/4588595) loss:2.840 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:35:20,778][model8_pretrain.py][INFO] Epoch:[0/2](189300/4588595) loss:2.934 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:35:20,781][model8_pretrain.py][INFO] Epoch:[0/2](189300/4588595) loss:3.193 lr:0.0000100 epoch_Time:27784.0min: [2024-01-03 13:36:02,938][model8_pretrain.py][INFO] Epoch:[0/2](189400/4588595) loss:2.941 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:02,938][model8_pretrain.py][INFO] Epoch:[0/2](189400/4588595) loss:2.673 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:02,938][model8_pretrain.py][INFO] Epoch:[0/2](189400/4588595) loss:2.833 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:02,938][model8_pretrain.py][INFO] Epoch:[0/2](189400/4588595) loss:3.497 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:02,938][model8_pretrain.py][INFO] Epoch:[0/2](189400/4588595) loss:2.602 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:02,938][model8_pretrain.py][INFO] Epoch:[0/2](189400/4588595) loss:2.403 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:02,938][model8_pretrain.py][INFO] Epoch:[0/2](189400/4588595) loss:2.491 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:02,938][model8_pretrain.py][INFO] Epoch:[0/2](189400/4588595) loss:2.565 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:39,872][model8_pretrain.py][INFO] Epoch:[0/2](189500/4588595) loss:2.719 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:39,872][model8_pretrain.py][INFO] Epoch:[0/2](189500/4588595) loss:2.704 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:39,872][model8_pretrain.py][INFO] Epoch:[0/2](189500/4588595) loss:2.969 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:39,872][model8_pretrain.py][INFO] Epoch:[0/2](189500/4588595) loss:2.553 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:39,872][model8_pretrain.py][INFO] Epoch:[0/2](189500/4588595) loss:2.410 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:39,872][model8_pretrain.py][INFO] Epoch:[0/2](189500/4588595) loss:3.267 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:39,872][model8_pretrain.py][INFO] Epoch:[0/2](189500/4588595) loss:2.871 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:36:39,872][model8_pretrain.py][INFO] Epoch:[0/2](189500/4588595) loss:2.135 lr:0.0000100 epoch_Time:27785.0min: [2024-01-03 13:37:16,826][model8_pretrain.py][INFO] Epoch:[0/2](189600/4588595) loss:2.882 lr:0.0000100 epoch_Time:27783.0min: [2024-01-03 13:37:16,826][model8_pretrain.py][INFO] Epoch:[0/2](189600/4588595) loss:2.811 lr:0.0000100 epoch_Time:27783.0min: [2024-01-03 13:37:16,826][model8_pretrain.py][INFO] Epoch:[0/2](189600/4588595) loss:3.281 lr:0.0000100 epoch_Time:27783.0min: [2024-01-03 13:37:16,826][model8_pretrain.py][INFO] Epoch:[0/2](189600/4588595) loss:2.829 lr:0.0000100 epoch_Time:27783.0min: [2024-01-03 13:37:16,826][model8_pretrain.py][INFO] Epoch:[0/2](189600/4588595) loss:2.928 lr:0.0000100 epoch_Time:27783.0min: [2024-01-03 13:37:16,826][model8_pretrain.py][INFO] Epoch:[0/2](189600/4588595) loss:2.871 lr:0.0000100 epoch_Time:27783.0min: [2024-01-03 13:37:16,826][model8_pretrain.py][INFO] Epoch:[0/2](189600/4588595) loss:3.151 lr:0.0000100 epoch_Time:27783.0min: [2024-01-03 13:37:16,826][model8_pretrain.py][INFO] Epoch:[0/2](189600/4588595) loss:3.118 lr:0.0000100 epoch_Time:27783.0min: [2024-01-03 13:37:53,758][model8_pretrain.py][INFO] Epoch:[0/2](189700/4588595) loss:2.368 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:37:53,759][model8_pretrain.py][INFO] Epoch:[0/2](189700/4588595) loss:3.235 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:37:53,759][model8_pretrain.py][INFO] Epoch:[0/2](189700/4588595) loss:2.645 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:37:53,759][model8_pretrain.py][INFO] Epoch:[0/2](189700/4588595) loss:3.017 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:37:53,759][model8_pretrain.py][INFO] Epoch:[0/2](189700/4588595) loss:2.840 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:37:53,759][model8_pretrain.py][INFO] Epoch:[0/2](189700/4588595) loss:3.207 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:37:53,759][model8_pretrain.py][INFO] Epoch:[0/2](189700/4588595) loss:3.329 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:37:53,759][model8_pretrain.py][INFO] Epoch:[0/2](189700/4588595) loss:3.297 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:38:30,701][model8_pretrain.py][INFO] Epoch:[0/2](189800/4588595) loss:2.461 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:38:30,701][model8_pretrain.py][INFO] Epoch:[0/2](189800/4588595) loss:2.871 lr:0.0000100 epoch_Time:27781.0min: [2024-01-03 13:38:30,701][model8_pretrain.py][INFO] Epoch:[0/2](189800/4588595) loss:2.466 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:38:30,701][model8_pretrain.py][INFO] Epoch:[0/2](189800/4588595) loss:2.445 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:38:30,701][model8_pretrain.py][INFO] Epoch:[0/2](189800/4588595) loss:3.017 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:38:30,701][model8_pretrain.py][INFO] Epoch:[0/2](189800/4588595) loss:3.114 lr:0.0000100 epoch_Time:27781.0min: [2024-01-03 13:38:30,701][model8_pretrain.py][INFO] Epoch:[0/2](189800/4588595) loss:3.000 lr:0.0000100 epoch_Time:27781.0min: [2024-01-03 13:38:30,702][model8_pretrain.py][INFO] Epoch:[0/2](189800/4588595) loss:2.937 lr:0.0000100 epoch_Time:27782.0min: [2024-01-03 13:39:07,642][model8_pretrain.py][INFO] Epoch:[0/2](189900/4588595) loss:3.027 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:07,642][model8_pretrain.py][INFO] Epoch:[0/2](189900/4588595) loss:3.013 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:07,642][model8_pretrain.py][INFO] Epoch:[0/2](189900/4588595) loss:3.567 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:07,642][model8_pretrain.py][INFO] Epoch:[0/2](189900/4588595) loss:3.051 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:07,642][model8_pretrain.py][INFO] Epoch:[0/2](189900/4588595) loss:2.973 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:07,642][model8_pretrain.py][INFO] Epoch:[0/2](189900/4588595) loss:2.819 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:07,643][model8_pretrain.py][INFO] Epoch:[0/2](189900/4588595) loss:2.974 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:07,643][model8_pretrain.py][INFO] Epoch:[0/2](189900/4588595) loss:2.857 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:44,568][model8_pretrain.py][INFO] Epoch:[0/2](190000/4588595) loss:3.225 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:44,568][model8_pretrain.py][INFO] Epoch:[0/2](190000/4588595) loss:2.035 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:44,568][model8_pretrain.py][INFO] Epoch:[0/2](190000/4588595) loss:3.070 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:44,568][model8_pretrain.py][INFO] Epoch:[0/2](190000/4588595) loss:3.108 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:44,568][model8_pretrain.py][INFO] Epoch:[0/2](190000/4588595) loss:2.661 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:44,568][model8_pretrain.py][INFO] Epoch:[0/2](190000/4588595) loss:2.839 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:44,568][model8_pretrain.py][INFO] Epoch:[0/2](190000/4588595) loss:3.051 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:39:44,568][model8_pretrain.py][INFO] Epoch:[0/2](190000/4588595) loss:2.815 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:40:21,509][model8_pretrain.py][INFO] Epoch:[0/2](190100/4588595) loss:3.077 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:40:21,509][model8_pretrain.py][INFO] Epoch:[0/2](190100/4588595) loss:3.280 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:40:21,509][model8_pretrain.py][INFO] Epoch:[0/2](190100/4588595) loss:3.577 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:40:21,509][model8_pretrain.py][INFO] Epoch:[0/2](190100/4588595) loss:3.205 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:40:21,509][model8_pretrain.py][INFO] Epoch:[0/2](190100/4588595) loss:3.027 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:40:21,509][model8_pretrain.py][INFO] Epoch:[0/2](190100/4588595) loss:2.914 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:40:21,509][model8_pretrain.py][INFO] Epoch:[0/2](190100/4588595) loss:2.659 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:40:21,510][model8_pretrain.py][INFO] Epoch:[0/2](190100/4588595) loss:3.119 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:41:05,490][model8_pretrain.py][INFO] Epoch:[0/2](190200/4588595) loss:2.277 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:41:05,490][model8_pretrain.py][INFO] Epoch:[0/2](190200/4588595) loss:3.127 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:41:05,490][model8_pretrain.py][INFO] Epoch:[0/2](190200/4588595) loss:3.005 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:41:05,490][model8_pretrain.py][INFO] Epoch:[0/2](190200/4588595) loss:3.196 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:41:05,490][model8_pretrain.py][INFO] Epoch:[0/2](190200/4588595) loss:3.046 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:41:05,490][model8_pretrain.py][INFO] Epoch:[0/2](190200/4588595) loss:2.956 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:41:05,490][model8_pretrain.py][INFO] Epoch:[0/2](190200/4588595) loss:2.974 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:41:05,490][model8_pretrain.py][INFO] Epoch:[0/2](190200/4588595) loss:2.837 lr:0.0000100 epoch_Time:27780.0min: [2024-01-03 13:41:42,418][model8_pretrain.py][INFO] Epoch:[0/2](190300/4588595) loss:3.502 lr:0.0000100 epoch_Time:27779.0min: [2024-01-03 13:41:42,418][model8_pretrain.py][INFO] Epoch:[0/2](190300/4588595) loss:3.029 lr:0.0000100 epoch_Time:27779.0min: [2024-01-03 13:41:42,418][model8_pretrain.py][INFO] Epoch:[0/2](190300/4588595) loss:2.589 lr:0.0000100 epoch_Time:27779.0min: [2024-01-03 13:41:42,418][model8_pretrain.py][INFO] Epoch:[0/2](190300/4588595) loss:2.720 lr:0.0000100 epoch_Time:27779.0min: [2024-01-03 13:41:42,418][model8_pretrain.py][INFO] Epoch:[0/2](190300/4588595) loss:3.312 lr:0.0000100 epoch_Time:27779.0min: [2024-01-03 13:41:42,418][model8_pretrain.py][INFO] Epoch:[0/2](190300/4588595) loss:2.923 lr:0.0000100 epoch_Time:27779.0min: [2024-01-03 13:41:42,418][model8_pretrain.py][INFO] Epoch:[0/2](190300/4588595) loss:3.265 lr:0.0000100 epoch_Time:27779.0min: [2024-01-03 13:41:42,419][model8_pretrain.py][INFO] Epoch:[0/2](190300/4588595) loss:2.737 lr:0.0000100 epoch_Time:27779.0min: [2024-01-03 13:42:19,362][model8_pretrain.py][INFO] Epoch:[0/2](190400/4588595) loss:2.746 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:42:19,362][model8_pretrain.py][INFO] Epoch:[0/2](190400/4588595) loss:2.737 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:42:19,362][model8_pretrain.py][INFO] Epoch:[0/2](190400/4588595) loss:2.570 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:42:19,362][model8_pretrain.py][INFO] Epoch:[0/2](190400/4588595) loss:2.850 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:42:19,362][model8_pretrain.py][INFO] Epoch:[0/2](190400/4588595) loss:2.686 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:42:19,362][model8_pretrain.py][INFO] Epoch:[0/2](190400/4588595) loss:2.787 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:42:19,362][model8_pretrain.py][INFO] Epoch:[0/2](190400/4588595) loss:2.647 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:42:19,362][model8_pretrain.py][INFO] Epoch:[0/2](190400/4588595) loss:3.027 lr:0.0000100 epoch_Time:27778.0min: [2024-01-03 13:42:56,300][model8_pretrain.py][INFO] Epoch:[0/2](190500/4588595) loss:3.308 lr:0.0000100 epoch_Time:27777.0min: [2024-01-03 13:42:56,300][model8_pretrain.py][INFO] Epoch:[0/2](190500/4588595) loss:2.560 lr:0.0000100 epoch_Time:27777.0min: [2024-01-03 13:42:56,300][model8_pretrain.py][INFO] Epoch:[0/2](190500/4588595) loss:2.774 lr:0.0000100 epoch_Time:27777.0min: [2024-01-03 13:42:56,300][model8_pretrain.py][INFO] Epoch:[0/2](190500/4588595) loss:2.978 lr:0.0000100 epoch_Time:27777.0min: [2024-01-03 13:42:56,300][model8_pretrain.py][INFO] Epoch:[0/2](190500/4588595) loss:2.854 lr:0.0000100 epoch_Time:27777.0min: [2024-01-03 13:42:56,300][model8_pretrain.py][INFO] Epoch:[0/2](190500/4588595) loss:2.903 lr:0.0000100 epoch_Time:27777.0min: [2024-01-03 13:42:56,300][model8_pretrain.py][INFO] Epoch:[0/2](190500/4588595) loss:3.064 lr:0.0000100 epoch_Time:27777.0min: [2024-01-03 13:42:56,300][model8_pretrain.py][INFO] Epoch:[0/2](190500/4588595) loss:3.159 lr:0.0000100 epoch_Time:27777.0min: [2024-01-03 13:43:33,245][model8_pretrain.py][INFO] Epoch:[0/2](190600/4588595) loss:3.079 lr:0.0000100 epoch_Time:27776.0min: [2024-01-03 13:43:33,245][model8_pretrain.py][INFO] Epoch:[0/2](190600/4588595) loss:2.683 lr:0.0000100 epoch_Time:27776.0min: [2024-01-03 13:43:33,245][model8_pretrain.py][INFO] Epoch:[0/2](190600/4588595) loss:2.452 lr:0.0000100 epoch_Time:27776.0min: [2024-01-03 13:43:33,245][model8_pretrain.py][INFO] Epoch:[0/2](190600/4588595) loss:3.099 lr:0.0000100 epoch_Time:27776.0min: [2024-01-03 13:43:33,245][model8_pretrain.py][INFO] Epoch:[0/2](190600/4588595) loss:2.616 lr:0.0000100 epoch_Time:27776.0min: [2024-01-03 13:43:33,245][model8_pretrain.py][INFO] Epoch:[0/2](190600/4588595) loss:2.531 lr:0.0000100 epoch_Time:27776.0min: [2024-01-03 13:43:33,245][model8_pretrain.py][INFO] Epoch:[0/2](190600/4588595) loss:2.306 lr:0.0000100 epoch_Time:27776.0min: [2024-01-03 13:43:33,245][model8_pretrain.py][INFO] Epoch:[0/2](190600/4588595) loss:3.036 lr:0.0000100 epoch_Time:27776.0min: [2024-01-03 13:44:10,192][model8_pretrain.py][INFO] Epoch:[0/2](190700/4588595) loss:3.273 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:10,192][model8_pretrain.py][INFO] Epoch:[0/2](190700/4588595) loss:2.629 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:10,192][model8_pretrain.py][INFO] Epoch:[0/2](190700/4588595) loss:1.619 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:10,192][model8_pretrain.py][INFO] Epoch:[0/2](190700/4588595) loss:2.998 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:10,192][model8_pretrain.py][INFO] Epoch:[0/2](190700/4588595) loss:3.120 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:10,192][model8_pretrain.py][INFO] Epoch:[0/2](190700/4588595) loss:2.567 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:10,192][model8_pretrain.py][INFO] Epoch:[0/2](190700/4588595) loss:2.998 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:10,192][model8_pretrain.py][INFO] Epoch:[0/2](190700/4588595) loss:2.498 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:47,129][model8_pretrain.py][INFO] Epoch:[0/2](190800/4588595) loss:2.948 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:44:47,130][model8_pretrain.py][INFO] Epoch:[0/2](190800/4588595) loss:3.090 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:47,130][model8_pretrain.py][INFO] Epoch:[0/2](190800/4588595) loss:3.215 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:44:47,130][model8_pretrain.py][INFO] Epoch:[0/2](190800/4588595) loss:2.942 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:44:47,130][model8_pretrain.py][INFO] Epoch:[0/2](190800/4588595) loss:3.032 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:44:47,130][model8_pretrain.py][INFO] Epoch:[0/2](190800/4588595) loss:2.926 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:44:47,130][model8_pretrain.py][INFO] Epoch:[0/2](190800/4588595) loss:3.023 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:44:47,130][model8_pretrain.py][INFO] Epoch:[0/2](190800/4588595) loss:3.434 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:45:24,075][model8_pretrain.py][INFO] Epoch:[0/2](190900/4588595) loss:3.457 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:45:24,075][model8_pretrain.py][INFO] Epoch:[0/2](190900/4588595) loss:2.686 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:45:24,075][model8_pretrain.py][INFO] Epoch:[0/2](190900/4588595) loss:2.955 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:45:24,075][model8_pretrain.py][INFO] Epoch:[0/2](190900/4588595) loss:2.969 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:45:24,075][model8_pretrain.py][INFO] Epoch:[0/2](190900/4588595) loss:2.853 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:45:24,075][model8_pretrain.py][INFO] Epoch:[0/2](190900/4588595) loss:2.649 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:45:24,075][model8_pretrain.py][INFO] Epoch:[0/2](190900/4588595) loss:3.108 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:45:24,075][model8_pretrain.py][INFO] Epoch:[0/2](190900/4588595) loss:2.342 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:46:08,049][model8_pretrain.py][INFO] Epoch:[0/2](191000/4588595) loss:3.001 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:46:08,049][model8_pretrain.py][INFO] Epoch:[0/2](191000/4588595) loss:2.961 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:46:08,049][model8_pretrain.py][INFO] Epoch:[0/2](191000/4588595) loss:3.092 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:46:08,049][model8_pretrain.py][INFO] Epoch:[0/2](191000/4588595) loss:2.731 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:46:08,049][model8_pretrain.py][INFO] Epoch:[0/2](191000/4588595) loss:2.785 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:46:08,050][model8_pretrain.py][INFO] Epoch:[0/2](191000/4588595) loss:2.609 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:46:08,050][model8_pretrain.py][INFO] Epoch:[0/2](191000/4588595) loss:2.852 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:46:08,050][model8_pretrain.py][INFO] Epoch:[0/2](191000/4588595) loss:2.744 lr:0.0000100 epoch_Time:27775.0min: [2024-01-03 13:46:44,995][model8_pretrain.py][INFO] Epoch:[0/2](191100/4588595) loss:2.841 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:46:44,995][model8_pretrain.py][INFO] Epoch:[0/2](191100/4588595) loss:2.203 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:46:44,995][model8_pretrain.py][INFO] Epoch:[0/2](191100/4588595) loss:2.537 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:46:44,996][model8_pretrain.py][INFO] Epoch:[0/2](191100/4588595) loss:3.152 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:46:44,996][model8_pretrain.py][INFO] Epoch:[0/2](191100/4588595) loss:2.541 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:46:44,996][model8_pretrain.py][INFO] Epoch:[0/2](191100/4588595) loss:3.201 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:46:44,996][model8_pretrain.py][INFO] Epoch:[0/2](191100/4588595) loss:3.254 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:46:44,996][model8_pretrain.py][INFO] Epoch:[0/2](191100/4588595) loss:3.044 lr:0.0000100 epoch_Time:27774.0min: [2024-01-03 13:47:21,948][model8_pretrain.py][INFO] Epoch:[0/2](191200/4588595) loss:3.473 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:47:21,948][model8_pretrain.py][INFO] Epoch:[0/2](191200/4588595) loss:3.202 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:47:21,948][model8_pretrain.py][INFO] Epoch:[0/2](191200/4588595) loss:2.692 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:47:21,948][model8_pretrain.py][INFO] Epoch:[0/2](191200/4588595) loss:3.443 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:47:21,948][model8_pretrain.py][INFO] Epoch:[0/2](191200/4588595) loss:2.956 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:47:21,948][model8_pretrain.py][INFO] Epoch:[0/2](191200/4588595) loss:2.874 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:47:21,948][model8_pretrain.py][INFO] Epoch:[0/2](191200/4588595) loss:2.950 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:47:21,948][model8_pretrain.py][INFO] Epoch:[0/2](191200/4588595) loss:3.001 lr:0.0000100 epoch_Time:27773.0min: [2024-01-03 13:47:58,890][model8_pretrain.py][INFO] Epoch:[0/2](191300/4588595) loss:2.875 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:47:58,890][model8_pretrain.py][INFO] Epoch:[0/2](191300/4588595) loss:2.777 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:47:58,890][model8_pretrain.py][INFO] Epoch:[0/2](191300/4588595) loss:3.108 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:47:58,890][model8_pretrain.py][INFO] Epoch:[0/2](191300/4588595) loss:3.205 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:47:58,890][model8_pretrain.py][INFO] Epoch:[0/2](191300/4588595) loss:2.940 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:47:58,890][model8_pretrain.py][INFO] Epoch:[0/2](191300/4588595) loss:3.454 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:47:58,890][model8_pretrain.py][INFO] Epoch:[0/2](191300/4588595) loss:2.398 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:47:58,890][model8_pretrain.py][INFO] Epoch:[0/2](191300/4588595) loss:2.530 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:48:35,839][model8_pretrain.py][INFO] Epoch:[0/2](191400/4588595) loss:2.805 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:48:35,839][model8_pretrain.py][INFO] Epoch:[0/2](191400/4588595) loss:3.010 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:48:35,839][model8_pretrain.py][INFO] Epoch:[0/2](191400/4588595) loss:3.165 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:48:35,839][model8_pretrain.py][INFO] Epoch:[0/2](191400/4588595) loss:2.426 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:48:35,839][model8_pretrain.py][INFO] Epoch:[0/2](191400/4588595) loss:3.127 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:48:35,839][model8_pretrain.py][INFO] Epoch:[0/2](191400/4588595) loss:2.822 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:48:35,839][model8_pretrain.py][INFO] Epoch:[0/2](191400/4588595) loss:3.061 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:48:35,839][model8_pretrain.py][INFO] Epoch:[0/2](191400/4588595) loss:3.613 lr:0.0000100 epoch_Time:27771.0min: [2024-01-03 13:49:12,797][model8_pretrain.py][INFO] Epoch:[0/2](191500/4588595) loss:3.104 lr:0.0000100 epoch_Time:27770.0min: [2024-01-03 13:49:12,797][model8_pretrain.py][INFO] Epoch:[0/2](191500/4588595) loss:3.084 lr:0.0000100 epoch_Time:27770.0min: [2024-01-03 13:49:12,797][model8_pretrain.py][INFO] Epoch:[0/2](191500/4588595) loss:2.953 lr:0.0000100 epoch_Time:27770.0min: [2024-01-03 13:49:12,797][model8_pretrain.py][INFO] Epoch:[0/2](191500/4588595) loss:3.099 lr:0.0000100 epoch_Time:27770.0min: [2024-01-03 13:49:12,797][model8_pretrain.py][INFO] Epoch:[0/2](191500/4588595) loss:3.247 lr:0.0000100 epoch_Time:27770.0min: [2024-01-03 13:49:12,797][model8_pretrain.py][INFO] Epoch:[0/2](191500/4588595) loss:3.130 lr:0.0000100 epoch_Time:27770.0min: [2024-01-03 13:49:12,798][model8_pretrain.py][INFO] Epoch:[0/2](191500/4588595) loss:2.767 lr:0.0000100 epoch_Time:27770.0min: [2024-01-03 13:49:12,798][model8_pretrain.py][INFO] Epoch:[0/2](191500/4588595) loss:2.809 lr:0.0000100 epoch_Time:27770.0min: [2024-01-03 13:49:49,748][model8_pretrain.py][INFO] Epoch:[0/2](191600/4588595) loss:3.215 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:49:49,748][model8_pretrain.py][INFO] Epoch:[0/2](191600/4588595) loss:2.958 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:49:49,748][model8_pretrain.py][INFO] Epoch:[0/2](191600/4588595) loss:2.624 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:49:49,748][model8_pretrain.py][INFO] Epoch:[0/2](191600/4588595) loss:2.721 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:49:49,748][model8_pretrain.py][INFO] Epoch:[0/2](191600/4588595) loss:3.247 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:49:49,748][model8_pretrain.py][INFO] Epoch:[0/2](191600/4588595) loss:3.059 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:49:49,748][model8_pretrain.py][INFO] Epoch:[0/2](191600/4588595) loss:2.555 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:49:49,748][model8_pretrain.py][INFO] Epoch:[0/2](191600/4588595) loss:3.059 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:50:26,671][model8_pretrain.py][INFO] Epoch:[0/2](191700/4588595) loss:2.903 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:50:26,671][model8_pretrain.py][INFO] Epoch:[0/2](191700/4588595) loss:3.036 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:50:26,671][model8_pretrain.py][INFO] Epoch:[0/2](191700/4588595) loss:2.793 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:50:26,671][model8_pretrain.py][INFO] Epoch:[0/2](191700/4588595) loss:3.080 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:50:26,671][model8_pretrain.py][INFO] Epoch:[0/2](191700/4588595) loss:3.797 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:50:26,671][model8_pretrain.py][INFO] Epoch:[0/2](191700/4588595) loss:3.286 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:50:26,672][model8_pretrain.py][INFO] Epoch:[0/2](191700/4588595) loss:3.010 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:50:26,672][model8_pretrain.py][INFO] Epoch:[0/2](191700/4588595) loss:2.548 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:51:10,633][model8_pretrain.py][INFO] Epoch:[0/2](191800/4588595) loss:2.729 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:10,633][model8_pretrain.py][INFO] Epoch:[0/2](191800/4588595) loss:2.655 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:10,633][model8_pretrain.py][INFO] Epoch:[0/2](191800/4588595) loss:2.881 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:10,633][model8_pretrain.py][INFO] Epoch:[0/2](191800/4588595) loss:2.997 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:10,633][model8_pretrain.py][INFO] Epoch:[0/2](191800/4588595) loss:2.731 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:10,633][model8_pretrain.py][INFO] Epoch:[0/2](191800/4588595) loss:2.920 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:10,633][model8_pretrain.py][INFO] Epoch:[0/2](191800/4588595) loss:2.772 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:10,633][model8_pretrain.py][INFO] Epoch:[0/2](191800/4588595) loss:3.196 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:47,567][model8_pretrain.py][INFO] Epoch:[0/2](191900/4588595) loss:2.687 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:47,567][model8_pretrain.py][INFO] Epoch:[0/2](191900/4588595) loss:3.353 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:47,567][model8_pretrain.py][INFO] Epoch:[0/2](191900/4588595) loss:3.475 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:47,568][model8_pretrain.py][INFO] Epoch:[0/2](191900/4588595) loss:2.577 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:51:47,568][model8_pretrain.py][INFO] Epoch:[0/2](191900/4588595) loss:3.244 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:47,568][model8_pretrain.py][INFO] Epoch:[0/2](191900/4588595) loss:3.360 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:47,568][model8_pretrain.py][INFO] Epoch:[0/2](191900/4588595) loss:2.697 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:51:47,568][model8_pretrain.py][INFO] Epoch:[0/2](191900/4588595) loss:3.062 lr:0.0000100 epoch_Time:27769.0min: [2024-01-03 13:52:24,509][model8_pretrain.py][INFO] Epoch:[0/2](192000/4588595) loss:2.727 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:52:24,509][model8_pretrain.py][INFO] Epoch:[0/2](192000/4588595) loss:2.897 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:52:24,509][model8_pretrain.py][INFO] Epoch:[0/2](192000/4588595) loss:3.512 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:52:24,509][model8_pretrain.py][INFO] Epoch:[0/2](192000/4588595) loss:3.059 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:52:24,509][model8_pretrain.py][INFO] Epoch:[0/2](192000/4588595) loss:3.074 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:52:24,509][model8_pretrain.py][INFO] Epoch:[0/2](192000/4588595) loss:2.885 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:52:24,509][model8_pretrain.py][INFO] Epoch:[0/2](192000/4588595) loss:3.433 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:52:24,509][model8_pretrain.py][INFO] Epoch:[0/2](192000/4588595) loss:2.682 lr:0.0000100 epoch_Time:27768.0min: [2024-01-03 13:53:01,448][model8_pretrain.py][INFO] Epoch:[0/2](192100/4588595) loss:2.996 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:01,448][model8_pretrain.py][INFO] Epoch:[0/2](192100/4588595) loss:3.039 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:01,448][model8_pretrain.py][INFO] Epoch:[0/2](192100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:01,448][model8_pretrain.py][INFO] Epoch:[0/2](192100/4588595) loss:3.183 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:01,448][model8_pretrain.py][INFO] Epoch:[0/2](192100/4588595) loss:2.461 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:01,448][model8_pretrain.py][INFO] Epoch:[0/2](192100/4588595) loss:2.489 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:01,448][model8_pretrain.py][INFO] Epoch:[0/2](192100/4588595) loss:3.006 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:01,448][model8_pretrain.py][INFO] Epoch:[0/2](192100/4588595) loss:3.078 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:38,397][model8_pretrain.py][INFO] Epoch:[0/2](192200/4588595) loss:3.228 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:38,397][model8_pretrain.py][INFO] Epoch:[0/2](192200/4588595) loss:2.746 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:38,397][model8_pretrain.py][INFO] Epoch:[0/2](192200/4588595) loss:2.820 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:38,397][model8_pretrain.py][INFO] Epoch:[0/2](192200/4588595) loss:2.839 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:38,397][model8_pretrain.py][INFO] Epoch:[0/2](192200/4588595) loss:2.674 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:38,397][model8_pretrain.py][INFO] Epoch:[0/2](192200/4588595) loss:2.260 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:38,397][model8_pretrain.py][INFO] Epoch:[0/2](192200/4588595) loss:2.987 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:53:38,397][model8_pretrain.py][INFO] Epoch:[0/2](192200/4588595) loss:3.019 lr:0.0000100 epoch_Time:27766.0min: [2024-01-03 13:54:15,337][model8_pretrain.py][INFO] Epoch:[0/2](192300/4588595) loss:3.350 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:54:15,337][model8_pretrain.py][INFO] Epoch:[0/2](192300/4588595) loss:2.704 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:54:15,337][model8_pretrain.py][INFO] Epoch:[0/2](192300/4588595) loss:3.175 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:54:15,337][model8_pretrain.py][INFO] Epoch:[0/2](192300/4588595) loss:3.346 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:54:15,337][model8_pretrain.py][INFO] Epoch:[0/2](192300/4588595) loss:2.140 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:54:15,337][model8_pretrain.py][INFO] Epoch:[0/2](192300/4588595) loss:2.500 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:54:15,337][model8_pretrain.py][INFO] Epoch:[0/2](192300/4588595) loss:2.677 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:54:15,338][model8_pretrain.py][INFO] Epoch:[0/2](192300/4588595) loss:3.095 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:54:52,276][model8_pretrain.py][INFO] Epoch:[0/2](192400/4588595) loss:3.287 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:54:52,276][model8_pretrain.py][INFO] Epoch:[0/2](192400/4588595) loss:3.390 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:54:52,276][model8_pretrain.py][INFO] Epoch:[0/2](192400/4588595) loss:2.716 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:54:52,277][model8_pretrain.py][INFO] Epoch:[0/2](192400/4588595) loss:3.083 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:54:52,277][model8_pretrain.py][INFO] Epoch:[0/2](192400/4588595) loss:2.187 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:54:52,277][model8_pretrain.py][INFO] Epoch:[0/2](192400/4588595) loss:2.896 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:54:52,277][model8_pretrain.py][INFO] Epoch:[0/2](192400/4588595) loss:2.936 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:54:52,277][model8_pretrain.py][INFO] Epoch:[0/2](192400/4588595) loss:2.850 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:55:29,211][model8_pretrain.py][INFO] Epoch:[0/2](192500/4588595) loss:2.174 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:55:29,211][model8_pretrain.py][INFO] Epoch:[0/2](192500/4588595) loss:3.138 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:55:29,211][model8_pretrain.py][INFO] Epoch:[0/2](192500/4588595) loss:2.754 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:55:29,211][model8_pretrain.py][INFO] Epoch:[0/2](192500/4588595) loss:2.736 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:55:29,211][model8_pretrain.py][INFO] Epoch:[0/2](192500/4588595) loss:2.905 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:55:29,211][model8_pretrain.py][INFO] Epoch:[0/2](192500/4588595) loss:2.958 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:55:29,211][model8_pretrain.py][INFO] Epoch:[0/2](192500/4588595) loss:2.723 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:55:29,211][model8_pretrain.py][INFO] Epoch:[0/2](192500/4588595) loss:3.085 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:56:13,151][model8_pretrain.py][INFO] Epoch:[0/2](192600/4588595) loss:2.807 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:56:13,151][model8_pretrain.py][INFO] Epoch:[0/2](192600/4588595) loss:2.917 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:56:13,151][model8_pretrain.py][INFO] Epoch:[0/2](192600/4588595) loss:3.232 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:56:13,151][model8_pretrain.py][INFO] Epoch:[0/2](192600/4588595) loss:3.043 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:56:13,151][model8_pretrain.py][INFO] Epoch:[0/2](192600/4588595) loss:3.118 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:56:13,152][model8_pretrain.py][INFO] Epoch:[0/2](192600/4588595) loss:3.408 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:56:13,152][model8_pretrain.py][INFO] Epoch:[0/2](192600/4588595) loss:3.097 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:56:13,152][model8_pretrain.py][INFO] Epoch:[0/2](192600/4588595) loss:2.783 lr:0.0000100 epoch_Time:27764.0min: [2024-01-03 13:56:50,077][model8_pretrain.py][INFO] Epoch:[0/2](192700/4588595) loss:2.979 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:56:50,077][model8_pretrain.py][INFO] Epoch:[0/2](192700/4588595) loss:3.369 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:56:50,077][model8_pretrain.py][INFO] Epoch:[0/2](192700/4588595) loss:2.663 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:56:50,077][model8_pretrain.py][INFO] Epoch:[0/2](192700/4588595) loss:3.449 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:56:50,077][model8_pretrain.py][INFO] Epoch:[0/2](192700/4588595) loss:2.903 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:56:50,077][model8_pretrain.py][INFO] Epoch:[0/2](192700/4588595) loss:2.808 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:56:50,077][model8_pretrain.py][INFO] Epoch:[0/2](192700/4588595) loss:3.412 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:56:50,077][model8_pretrain.py][INFO] Epoch:[0/2](192700/4588595) loss:2.706 lr:0.0000100 epoch_Time:27763.0min: [2024-01-03 13:57:26,997][model8_pretrain.py][INFO] Epoch:[0/2](192800/4588595) loss:2.937 lr:0.0000100 epoch_Time:27762.0min: [2024-01-03 13:57:26,997][model8_pretrain.py][INFO] Epoch:[0/2](192800/4588595) loss:2.752 lr:0.0000100 epoch_Time:27762.0min: [2024-01-03 13:57:26,997][model8_pretrain.py][INFO] Epoch:[0/2](192800/4588595) loss:3.225 lr:0.0000100 epoch_Time:27762.0min: [2024-01-03 13:57:26,997][model8_pretrain.py][INFO] Epoch:[0/2](192800/4588595) loss:2.955 lr:0.0000100 epoch_Time:27762.0min: [2024-01-03 13:57:26,997][model8_pretrain.py][INFO] Epoch:[0/2](192800/4588595) loss:2.787 lr:0.0000100 epoch_Time:27762.0min: [2024-01-03 13:57:26,997][model8_pretrain.py][INFO] Epoch:[0/2](192800/4588595) loss:2.595 lr:0.0000100 epoch_Time:27762.0min: [2024-01-03 13:57:26,997][model8_pretrain.py][INFO] Epoch:[0/2](192800/4588595) loss:3.202 lr:0.0000100 epoch_Time:27762.0min: [2024-01-03 13:57:26,997][model8_pretrain.py][INFO] Epoch:[0/2](192800/4588595) loss:2.543 lr:0.0000100 epoch_Time:27762.0min: [2024-01-03 13:58:03,931][model8_pretrain.py][INFO] Epoch:[0/2](192900/4588595) loss:2.657 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:03,931][model8_pretrain.py][INFO] Epoch:[0/2](192900/4588595) loss:2.858 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:03,931][model8_pretrain.py][INFO] Epoch:[0/2](192900/4588595) loss:3.099 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:03,931][model8_pretrain.py][INFO] Epoch:[0/2](192900/4588595) loss:3.133 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:03,931][model8_pretrain.py][INFO] Epoch:[0/2](192900/4588595) loss:3.226 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:03,931][model8_pretrain.py][INFO] Epoch:[0/2](192900/4588595) loss:3.233 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:03,931][model8_pretrain.py][INFO] Epoch:[0/2](192900/4588595) loss:2.820 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:03,931][model8_pretrain.py][INFO] Epoch:[0/2](192900/4588595) loss:2.364 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:40,866][model8_pretrain.py][INFO] Epoch:[0/2](193000/4588595) loss:2.945 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:40,866][model8_pretrain.py][INFO] Epoch:[0/2](193000/4588595) loss:2.581 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:40,866][model8_pretrain.py][INFO] Epoch:[0/2](193000/4588595) loss:3.131 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:40,866][model8_pretrain.py][INFO] Epoch:[0/2](193000/4588595) loss:2.491 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:40,866][model8_pretrain.py][INFO] Epoch:[0/2](193000/4588595) loss:2.758 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:40,866][model8_pretrain.py][INFO] Epoch:[0/2](193000/4588595) loss:2.483 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:40,866][model8_pretrain.py][INFO] Epoch:[0/2](193000/4588595) loss:3.372 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:58:40,866][model8_pretrain.py][INFO] Epoch:[0/2](193000/4588595) loss:2.999 lr:0.0000100 epoch_Time:27761.0min: [2024-01-03 13:59:17,798][model8_pretrain.py][INFO] Epoch:[0/2](193100/4588595) loss:2.649 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 13:59:17,798][model8_pretrain.py][INFO] Epoch:[0/2](193100/4588595) loss:2.994 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 13:59:17,798][model8_pretrain.py][INFO] Epoch:[0/2](193100/4588595) loss:3.206 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 13:59:17,798][model8_pretrain.py][INFO] Epoch:[0/2](193100/4588595) loss:2.758 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 13:59:17,798][model8_pretrain.py][INFO] Epoch:[0/2](193100/4588595) loss:3.013 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 13:59:17,798][model8_pretrain.py][INFO] Epoch:[0/2](193100/4588595) loss:2.355 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 13:59:17,798][model8_pretrain.py][INFO] Epoch:[0/2](193100/4588595) loss:2.949 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 13:59:17,798][model8_pretrain.py][INFO] Epoch:[0/2](193100/4588595) loss:3.337 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 13:59:54,730][model8_pretrain.py][INFO] Epoch:[0/2](193200/4588595) loss:2.799 lr:0.0000100 epoch_Time:27758.0min: [2024-01-03 13:59:54,730][model8_pretrain.py][INFO] Epoch:[0/2](193200/4588595) loss:3.035 lr:0.0000100 epoch_Time:27758.0min: [2024-01-03 13:59:54,730][model8_pretrain.py][INFO] Epoch:[0/2](193200/4588595) loss:2.726 lr:0.0000100 epoch_Time:27758.0min: [2024-01-03 13:59:54,730][model8_pretrain.py][INFO] Epoch:[0/2](193200/4588595) loss:3.539 lr:0.0000100 epoch_Time:27758.0min: [2024-01-03 13:59:54,730][model8_pretrain.py][INFO] Epoch:[0/2](193200/4588595) loss:2.743 lr:0.0000100 epoch_Time:27758.0min: [2024-01-03 13:59:54,730][model8_pretrain.py][INFO] Epoch:[0/2](193200/4588595) loss:3.192 lr:0.0000100 epoch_Time:27758.0min: [2024-01-03 13:59:54,730][model8_pretrain.py][INFO] Epoch:[0/2](193200/4588595) loss:3.125 lr:0.0000100 epoch_Time:27758.0min: [2024-01-03 13:59:54,730][model8_pretrain.py][INFO] Epoch:[0/2](193200/4588595) loss:3.191 lr:0.0000100 epoch_Time:27758.0min: [2024-01-03 14:00:31,649][model8_pretrain.py][INFO] Epoch:[0/2](193300/4588595) loss:2.971 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:00:31,649][model8_pretrain.py][INFO] Epoch:[0/2](193300/4588595) loss:3.325 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:00:31,650][model8_pretrain.py][INFO] Epoch:[0/2](193300/4588595) loss:2.873 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:00:31,650][model8_pretrain.py][INFO] Epoch:[0/2](193300/4588595) loss:3.102 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:00:31,650][model8_pretrain.py][INFO] Epoch:[0/2](193300/4588595) loss:3.078 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:00:31,650][model8_pretrain.py][INFO] Epoch:[0/2](193300/4588595) loss:3.203 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:00:31,650][model8_pretrain.py][INFO] Epoch:[0/2](193300/4588595) loss:3.159 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:00:31,650][model8_pretrain.py][INFO] Epoch:[0/2](193300/4588595) loss:3.374 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:01:15,597][model8_pretrain.py][INFO] Epoch:[0/2](193400/4588595) loss:3.071 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 14:01:15,597][model8_pretrain.py][INFO] Epoch:[0/2](193400/4588595) loss:2.865 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 14:01:15,597][model8_pretrain.py][INFO] Epoch:[0/2](193400/4588595) loss:2.826 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 14:01:15,597][model8_pretrain.py][INFO] Epoch:[0/2](193400/4588595) loss:3.234 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 14:01:15,597][model8_pretrain.py][INFO] Epoch:[0/2](193400/4588595) loss:2.879 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 14:01:15,597][model8_pretrain.py][INFO] Epoch:[0/2](193400/4588595) loss:2.686 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 14:01:15,597][model8_pretrain.py][INFO] Epoch:[0/2](193400/4588595) loss:3.052 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 14:01:15,597][model8_pretrain.py][INFO] Epoch:[0/2](193400/4588595) loss:2.873 lr:0.0000100 epoch_Time:27759.0min: [2024-01-03 14:01:52,530][model8_pretrain.py][INFO] Epoch:[0/2](193500/4588595) loss:3.141 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:01:52,530][model8_pretrain.py][INFO] Epoch:[0/2](193500/4588595) loss:3.061 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:01:52,530][model8_pretrain.py][INFO] Epoch:[0/2](193500/4588595) loss:3.280 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:01:52,530][model8_pretrain.py][INFO] Epoch:[0/2](193500/4588595) loss:2.629 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:01:52,530][model8_pretrain.py][INFO] Epoch:[0/2](193500/4588595) loss:2.610 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:01:52,530][model8_pretrain.py][INFO] Epoch:[0/2](193500/4588595) loss:3.290 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:01:52,530][model8_pretrain.py][INFO] Epoch:[0/2](193500/4588595) loss:3.189 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:01:52,530][model8_pretrain.py][INFO] Epoch:[0/2](193500/4588595) loss:3.063 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:02:29,500][model8_pretrain.py][INFO] Epoch:[0/2](193600/4588595) loss:3.111 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:02:29,500][model8_pretrain.py][INFO] Epoch:[0/2](193600/4588595) loss:2.968 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:02:29,500][model8_pretrain.py][INFO] Epoch:[0/2](193600/4588595) loss:2.469 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:02:29,500][model8_pretrain.py][INFO] Epoch:[0/2](193600/4588595) loss:3.027 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:02:29,500][model8_pretrain.py][INFO] Epoch:[0/2](193600/4588595) loss:3.340 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:02:29,500][model8_pretrain.py][INFO] Epoch:[0/2](193600/4588595) loss:2.902 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:02:29,500][model8_pretrain.py][INFO] Epoch:[0/2](193600/4588595) loss:3.246 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:02:29,500][model8_pretrain.py][INFO] Epoch:[0/2](193600/4588595) loss:2.674 lr:0.0000100 epoch_Time:27757.0min: [2024-01-03 14:03:06,495][model8_pretrain.py][INFO] Epoch:[0/2](193700/4588595) loss:3.086 lr:0.0000100 epoch_Time:27756.0min: [2024-01-03 14:03:06,495][model8_pretrain.py][INFO] Epoch:[0/2](193700/4588595) loss:3.206 lr:0.0000100 epoch_Time:27756.0min: [2024-01-03 14:03:06,495][model8_pretrain.py][INFO] Epoch:[0/2](193700/4588595) loss:3.125 lr:0.0000100 epoch_Time:27756.0min: [2024-01-03 14:03:06,495][model8_pretrain.py][INFO] Epoch:[0/2](193700/4588595) loss:3.231 lr:0.0000100 epoch_Time:27756.0min: [2024-01-03 14:03:06,495][model8_pretrain.py][INFO] Epoch:[0/2](193700/4588595) loss:2.924 lr:0.0000100 epoch_Time:27756.0min: [2024-01-03 14:03:06,495][model8_pretrain.py][INFO] Epoch:[0/2](193700/4588595) loss:2.766 lr:0.0000100 epoch_Time:27756.0min: [2024-01-03 14:03:06,495][model8_pretrain.py][INFO] Epoch:[0/2](193700/4588595) loss:2.951 lr:0.0000100 epoch_Time:27756.0min: [2024-01-03 14:03:06,495][model8_pretrain.py][INFO] Epoch:[0/2](193700/4588595) loss:3.418 lr:0.0000100 epoch_Time:27756.0min: [2024-01-03 14:03:43,464][model8_pretrain.py][INFO] Epoch:[0/2](193800/4588595) loss:3.335 lr:0.0000100 epoch_Time:27755.0min: [2024-01-03 14:03:43,464][model8_pretrain.py][INFO] Epoch:[0/2](193800/4588595) loss:2.910 lr:0.0000100 epoch_Time:27755.0min: [2024-01-03 14:03:43,464][model8_pretrain.py][INFO] Epoch:[0/2](193800/4588595) loss:3.079 lr:0.0000100 epoch_Time:27755.0min: [2024-01-03 14:03:43,464][model8_pretrain.py][INFO] Epoch:[0/2](193800/4588595) loss:3.128 lr:0.0000100 epoch_Time:27755.0min: [2024-01-03 14:03:43,464][model8_pretrain.py][INFO] Epoch:[0/2](193800/4588595) loss:2.632 lr:0.0000100 epoch_Time:27755.0min: [2024-01-03 14:03:43,464][model8_pretrain.py][INFO] Epoch:[0/2](193800/4588595) loss:2.821 lr:0.0000100 epoch_Time:27755.0min: [2024-01-03 14:03:43,465][model8_pretrain.py][INFO] Epoch:[0/2](193800/4588595) loss:2.861 lr:0.0000100 epoch_Time:27755.0min: [2024-01-03 14:03:43,465][model8_pretrain.py][INFO] Epoch:[0/2](193800/4588595) loss:2.998 lr:0.0000100 epoch_Time:27755.0min: [2024-01-03 14:04:20,398][model8_pretrain.py][INFO] Epoch:[0/2](193900/4588595) loss:3.204 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:04:20,398][model8_pretrain.py][INFO] Epoch:[0/2](193900/4588595) loss:3.043 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:04:20,398][model8_pretrain.py][INFO] Epoch:[0/2](193900/4588595) loss:2.988 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:04:20,398][model8_pretrain.py][INFO] Epoch:[0/2](193900/4588595) loss:2.970 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:04:20,398][model8_pretrain.py][INFO] Epoch:[0/2](193900/4588595) loss:3.334 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:04:20,398][model8_pretrain.py][INFO] Epoch:[0/2](193900/4588595) loss:3.101 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:04:20,398][model8_pretrain.py][INFO] Epoch:[0/2](193900/4588595) loss:3.107 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:04:20,398][model8_pretrain.py][INFO] Epoch:[0/2](193900/4588595) loss:3.283 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:04:57,334][model8_pretrain.py][INFO] Epoch:[0/2](194000/4588595) loss:2.758 lr:0.0000100 epoch_Time:27753.0min: [2024-01-03 14:04:57,334][model8_pretrain.py][INFO] Epoch:[0/2](194000/4588595) loss:3.273 lr:0.0000100 epoch_Time:27753.0min: [2024-01-03 14:04:57,334][model8_pretrain.py][INFO] Epoch:[0/2](194000/4588595) loss:3.117 lr:0.0000100 epoch_Time:27753.0min: [2024-01-03 14:04:57,334][model8_pretrain.py][INFO] Epoch:[0/2](194000/4588595) loss:3.286 lr:0.0000100 epoch_Time:27753.0min: [2024-01-03 14:04:57,334][model8_pretrain.py][INFO] Epoch:[0/2](194000/4588595) loss:3.175 lr:0.0000100 epoch_Time:27753.0min: [2024-01-03 14:04:57,334][model8_pretrain.py][INFO] Epoch:[0/2](194000/4588595) loss:2.797 lr:0.0000100 epoch_Time:27753.0min: [2024-01-03 14:04:57,334][model8_pretrain.py][INFO] Epoch:[0/2](194000/4588595) loss:3.185 lr:0.0000100 epoch_Time:27753.0min: [2024-01-03 14:04:57,334][model8_pretrain.py][INFO] Epoch:[0/2](194000/4588595) loss:3.196 lr:0.0000100 epoch_Time:27753.0min: [2024-01-03 14:05:34,312][model8_pretrain.py][INFO] Epoch:[0/2](194100/4588595) loss:2.904 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:05:34,312][model8_pretrain.py][INFO] Epoch:[0/2](194100/4588595) loss:2.514 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:05:34,312][model8_pretrain.py][INFO] Epoch:[0/2](194100/4588595) loss:3.059 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:05:34,312][model8_pretrain.py][INFO] Epoch:[0/2](194100/4588595) loss:3.413 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:05:34,312][model8_pretrain.py][INFO] Epoch:[0/2](194100/4588595) loss:3.261 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:05:34,312][model8_pretrain.py][INFO] Epoch:[0/2](194100/4588595) loss:3.177 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:05:34,312][model8_pretrain.py][INFO] Epoch:[0/2](194100/4588595) loss:2.435 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:05:34,312][model8_pretrain.py][INFO] Epoch:[0/2](194100/4588595) loss:3.127 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:06:18,264][model8_pretrain.py][INFO] Epoch:[0/2](194200/4588595) loss:2.883 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:06:18,264][model8_pretrain.py][INFO] Epoch:[0/2](194200/4588595) loss:2.478 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:06:18,264][model8_pretrain.py][INFO] Epoch:[0/2](194200/4588595) loss:2.778 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:06:18,264][model8_pretrain.py][INFO] Epoch:[0/2](194200/4588595) loss:3.197 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:06:18,264][model8_pretrain.py][INFO] Epoch:[0/2](194200/4588595) loss:2.865 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:06:18,264][model8_pretrain.py][INFO] Epoch:[0/2](194200/4588595) loss:3.073 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:06:18,264][model8_pretrain.py][INFO] Epoch:[0/2](194200/4588595) loss:2.859 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:06:18,264][model8_pretrain.py][INFO] Epoch:[0/2](194200/4588595) loss:3.292 lr:0.0000100 epoch_Time:27754.0min: [2024-01-03 14:06:55,187][model8_pretrain.py][INFO] Epoch:[0/2](194300/4588595) loss:2.681 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:06:55,187][model8_pretrain.py][INFO] Epoch:[0/2](194300/4588595) loss:2.671 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:06:55,187][model8_pretrain.py][INFO] Epoch:[0/2](194300/4588595) loss:3.217 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:06:55,187][model8_pretrain.py][INFO] Epoch:[0/2](194300/4588595) loss:2.614 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:06:55,187][model8_pretrain.py][INFO] Epoch:[0/2](194300/4588595) loss:2.859 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:06:55,187][model8_pretrain.py][INFO] Epoch:[0/2](194300/4588595) loss:2.203 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:06:55,187][model8_pretrain.py][INFO] Epoch:[0/2](194300/4588595) loss:2.963 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:06:55,187][model8_pretrain.py][INFO] Epoch:[0/2](194300/4588595) loss:2.682 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:07:32,120][model8_pretrain.py][INFO] Epoch:[0/2](194400/4588595) loss:2.854 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:07:32,120][model8_pretrain.py][INFO] Epoch:[0/2](194400/4588595) loss:2.921 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:07:32,120][model8_pretrain.py][INFO] Epoch:[0/2](194400/4588595) loss:2.874 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:07:32,121][model8_pretrain.py][INFO] Epoch:[0/2](194400/4588595) loss:3.049 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:07:32,121][model8_pretrain.py][INFO] Epoch:[0/2](194400/4588595) loss:2.924 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:07:32,121][model8_pretrain.py][INFO] Epoch:[0/2](194400/4588595) loss:2.841 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:07:32,121][model8_pretrain.py][INFO] Epoch:[0/2](194400/4588595) loss:2.857 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:07:32,121][model8_pretrain.py][INFO] Epoch:[0/2](194400/4588595) loss:2.915 lr:0.0000100 epoch_Time:27752.0min: [2024-01-03 14:08:09,057][model8_pretrain.py][INFO] Epoch:[0/2](194500/4588595) loss:2.976 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:09,057][model8_pretrain.py][INFO] Epoch:[0/2](194500/4588595) loss:3.139 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:09,057][model8_pretrain.py][INFO] Epoch:[0/2](194500/4588595) loss:3.196 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:09,058][model8_pretrain.py][INFO] Epoch:[0/2](194500/4588595) loss:2.461 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:09,058][model8_pretrain.py][INFO] Epoch:[0/2](194500/4588595) loss:3.359 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:09,058][model8_pretrain.py][INFO] Epoch:[0/2](194500/4588595) loss:3.067 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:09,058][model8_pretrain.py][INFO] Epoch:[0/2](194500/4588595) loss:3.003 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:09,058][model8_pretrain.py][INFO] Epoch:[0/2](194500/4588595) loss:2.404 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:45,989][model8_pretrain.py][INFO] Epoch:[0/2](194600/4588595) loss:2.686 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:45,989][model8_pretrain.py][INFO] Epoch:[0/2](194600/4588595) loss:3.130 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:45,989][model8_pretrain.py][INFO] Epoch:[0/2](194600/4588595) loss:2.613 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:45,989][model8_pretrain.py][INFO] Epoch:[0/2](194600/4588595) loss:2.792 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:45,989][model8_pretrain.py][INFO] Epoch:[0/2](194600/4588595) loss:2.892 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:45,989][model8_pretrain.py][INFO] Epoch:[0/2](194600/4588595) loss:2.716 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:45,989][model8_pretrain.py][INFO] Epoch:[0/2](194600/4588595) loss:2.688 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:08:45,989][model8_pretrain.py][INFO] Epoch:[0/2](194600/4588595) loss:3.022 lr:0.0000100 epoch_Time:27750.0min: [2024-01-03 14:09:22,921][model8_pretrain.py][INFO] Epoch:[0/2](194700/4588595) loss:2.800 lr:0.0000100 epoch_Time:27749.0min: [2024-01-03 14:09:22,921][model8_pretrain.py][INFO] Epoch:[0/2](194700/4588595) loss:3.043 lr:0.0000100 epoch_Time:27749.0min: [2024-01-03 14:09:22,921][model8_pretrain.py][INFO] Epoch:[0/2](194700/4588595) loss:3.092 lr:0.0000100 epoch_Time:27749.0min: [2024-01-03 14:09:22,921][model8_pretrain.py][INFO] Epoch:[0/2](194700/4588595) loss:2.839 lr:0.0000100 epoch_Time:27749.0min: [2024-01-03 14:09:22,921][model8_pretrain.py][INFO] Epoch:[0/2](194700/4588595) loss:2.965 lr:0.0000100 epoch_Time:27749.0min: [2024-01-03 14:09:22,922][model8_pretrain.py][INFO] Epoch:[0/2](194700/4588595) loss:2.661 lr:0.0000100 epoch_Time:27749.0min: [2024-01-03 14:09:22,922][model8_pretrain.py][INFO] Epoch:[0/2](194700/4588595) loss:2.760 lr:0.0000100 epoch_Time:27749.0min: [2024-01-03 14:09:22,922][model8_pretrain.py][INFO] Epoch:[0/2](194700/4588595) loss:2.799 lr:0.0000100 epoch_Time:27749.0min: [2024-01-03 14:09:59,856][model8_pretrain.py][INFO] Epoch:[0/2](194800/4588595) loss:2.934 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:09:59,856][model8_pretrain.py][INFO] Epoch:[0/2](194800/4588595) loss:2.567 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:09:59,856][model8_pretrain.py][INFO] Epoch:[0/2](194800/4588595) loss:2.884 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:09:59,856][model8_pretrain.py][INFO] Epoch:[0/2](194800/4588595) loss:2.493 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:09:59,856][model8_pretrain.py][INFO] Epoch:[0/2](194800/4588595) loss:2.202 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:09:59,856][model8_pretrain.py][INFO] Epoch:[0/2](194800/4588595) loss:3.017 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:09:59,856][model8_pretrain.py][INFO] Epoch:[0/2](194800/4588595) loss:3.362 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:09:59,856][model8_pretrain.py][INFO] Epoch:[0/2](194800/4588595) loss:2.915 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:10:36,780][model8_pretrain.py][INFO] Epoch:[0/2](194900/4588595) loss:2.971 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:10:36,780][model8_pretrain.py][INFO] Epoch:[0/2](194900/4588595) loss:2.625 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:10:36,780][model8_pretrain.py][INFO] Epoch:[0/2](194900/4588595) loss:2.837 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:10:36,780][model8_pretrain.py][INFO] Epoch:[0/2](194900/4588595) loss:3.331 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:10:36,780][model8_pretrain.py][INFO] Epoch:[0/2](194900/4588595) loss:3.160 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:10:36,780][model8_pretrain.py][INFO] Epoch:[0/2](194900/4588595) loss:2.536 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:10:36,780][model8_pretrain.py][INFO] Epoch:[0/2](194900/4588595) loss:3.074 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:10:36,780][model8_pretrain.py][INFO] Epoch:[0/2](194900/4588595) loss:2.914 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:11:20,838][model8_pretrain.py][INFO] Epoch:[0/2](195000/4588595) loss:2.646 lr:0.0000100 epoch_Time:27748.0min: [2024-01-03 14:11:20,838][model8_pretrain.py][INFO] Epoch:[0/2](195000/4588595) loss:2.748 lr:0.0000100 epoch_Time:27748.0min: [2024-01-03 14:11:20,838][model8_pretrain.py][INFO] Epoch:[0/2](195000/4588595) loss:2.730 lr:0.0000100 epoch_Time:27748.0min: [2024-01-03 14:11:20,838][model8_pretrain.py][INFO] Epoch:[0/2](195000/4588595) loss:2.966 lr:0.0000100 epoch_Time:27748.0min: [2024-01-03 14:11:20,838][model8_pretrain.py][INFO] Epoch:[0/2](195000/4588595) loss:3.066 lr:0.0000100 epoch_Time:27748.0min: [2024-01-03 14:11:20,838][model8_pretrain.py][INFO] Epoch:[0/2](195000/4588595) loss:2.870 lr:0.0000100 epoch_Time:27748.0min: [2024-01-03 14:11:20,838][model8_pretrain.py][INFO] Epoch:[0/2](195000/4588595) loss:2.674 lr:0.0000100 epoch_Time:27748.0min: [2024-01-03 14:11:20,838][model8_pretrain.py][INFO] Epoch:[0/2](195000/4588595) loss:2.879 lr:0.0000100 epoch_Time:27748.0min: [2024-01-03 14:11:57,764][model8_pretrain.py][INFO] Epoch:[0/2](195100/4588595) loss:2.658 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:11:57,764][model8_pretrain.py][INFO] Epoch:[0/2](195100/4588595) loss:2.600 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:11:57,764][model8_pretrain.py][INFO] Epoch:[0/2](195100/4588595) loss:2.968 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:11:57,764][model8_pretrain.py][INFO] Epoch:[0/2](195100/4588595) loss:3.337 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:11:57,764][model8_pretrain.py][INFO] Epoch:[0/2](195100/4588595) loss:3.083 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:11:57,764][model8_pretrain.py][INFO] Epoch:[0/2](195100/4588595) loss:3.050 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:11:57,764][model8_pretrain.py][INFO] Epoch:[0/2](195100/4588595) loss:2.984 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:11:57,764][model8_pretrain.py][INFO] Epoch:[0/2](195100/4588595) loss:2.794 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:12:34,689][model8_pretrain.py][INFO] Epoch:[0/2](195200/4588595) loss:3.229 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:12:34,689][model8_pretrain.py][INFO] Epoch:[0/2](195200/4588595) loss:2.953 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:12:34,689][model8_pretrain.py][INFO] Epoch:[0/2](195200/4588595) loss:2.916 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:12:34,689][model8_pretrain.py][INFO] Epoch:[0/2](195200/4588595) loss:3.099 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:12:34,689][model8_pretrain.py][INFO] Epoch:[0/2](195200/4588595) loss:2.221 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:12:34,689][model8_pretrain.py][INFO] Epoch:[0/2](195200/4588595) loss:2.661 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:12:34,689][model8_pretrain.py][INFO] Epoch:[0/2](195200/4588595) loss:2.849 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:12:34,689][model8_pretrain.py][INFO] Epoch:[0/2](195200/4588595) loss:2.059 lr:0.0000100 epoch_Time:27747.0min: [2024-01-03 14:13:11,617][model8_pretrain.py][INFO] Epoch:[0/2](195300/4588595) loss:2.852 lr:0.0000100 epoch_Time:27745.0min: [2024-01-03 14:13:11,617][model8_pretrain.py][INFO] Epoch:[0/2](195300/4588595) loss:2.336 lr:0.0000100 epoch_Time:27745.0min: [2024-01-03 14:13:11,617][model8_pretrain.py][INFO] Epoch:[0/2](195300/4588595) loss:3.156 lr:0.0000100 epoch_Time:27745.0min: [2024-01-03 14:13:11,617][model8_pretrain.py][INFO] Epoch:[0/2](195300/4588595) loss:3.153 lr:0.0000100 epoch_Time:27745.0min: [2024-01-03 14:13:11,617][model8_pretrain.py][INFO] Epoch:[0/2](195300/4588595) loss:2.860 lr:0.0000100 epoch_Time:27745.0min: [2024-01-03 14:13:11,617][model8_pretrain.py][INFO] Epoch:[0/2](195300/4588595) loss:3.063 lr:0.0000100 epoch_Time:27745.0min: [2024-01-03 14:13:11,617][model8_pretrain.py][INFO] Epoch:[0/2](195300/4588595) loss:3.112 lr:0.0000100 epoch_Time:27745.0min: [2024-01-03 14:13:11,617][model8_pretrain.py][INFO] Epoch:[0/2](195300/4588595) loss:2.786 lr:0.0000100 epoch_Time:27745.0min: [2024-01-03 14:13:48,541][model8_pretrain.py][INFO] Epoch:[0/2](195400/4588595) loss:2.954 lr:0.0000100 epoch_Time:27744.0min: [2024-01-03 14:13:48,541][model8_pretrain.py][INFO] Epoch:[0/2](195400/4588595) loss:3.444 lr:0.0000100 epoch_Time:27744.0min: [2024-01-03 14:13:48,541][model8_pretrain.py][INFO] Epoch:[0/2](195400/4588595) loss:2.413 lr:0.0000100 epoch_Time:27744.0min: [2024-01-03 14:13:48,541][model8_pretrain.py][INFO] Epoch:[0/2](195400/4588595) loss:2.779 lr:0.0000100 epoch_Time:27744.0min: [2024-01-03 14:13:48,541][model8_pretrain.py][INFO] Epoch:[0/2](195400/4588595) loss:2.965 lr:0.0000100 epoch_Time:27744.0min: [2024-01-03 14:13:48,541][model8_pretrain.py][INFO] Epoch:[0/2](195400/4588595) loss:2.976 lr:0.0000100 epoch_Time:27744.0min: [2024-01-03 14:13:48,541][model8_pretrain.py][INFO] Epoch:[0/2](195400/4588595) loss:2.903 lr:0.0000100 epoch_Time:27744.0min: [2024-01-03 14:13:48,541][model8_pretrain.py][INFO] Epoch:[0/2](195400/4588595) loss:2.692 lr:0.0000100 epoch_Time:27744.0min: [2024-01-03 14:14:25,476][model8_pretrain.py][INFO] Epoch:[0/2](195500/4588595) loss:3.143 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:14:25,476][model8_pretrain.py][INFO] Epoch:[0/2](195500/4588595) loss:2.885 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:14:25,476][model8_pretrain.py][INFO] Epoch:[0/2](195500/4588595) loss:2.979 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:14:25,476][model8_pretrain.py][INFO] Epoch:[0/2](195500/4588595) loss:3.204 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:14:25,476][model8_pretrain.py][INFO] Epoch:[0/2](195500/4588595) loss:2.892 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:14:25,476][model8_pretrain.py][INFO] Epoch:[0/2](195500/4588595) loss:2.846 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:14:25,476][model8_pretrain.py][INFO] Epoch:[0/2](195500/4588595) loss:3.231 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:14:25,476][model8_pretrain.py][INFO] Epoch:[0/2](195500/4588595) loss:3.166 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:15:02,402][model8_pretrain.py][INFO] Epoch:[0/2](195600/4588595) loss:3.252 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:02,402][model8_pretrain.py][INFO] Epoch:[0/2](195600/4588595) loss:3.121 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:02,402][model8_pretrain.py][INFO] Epoch:[0/2](195600/4588595) loss:3.030 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:02,402][model8_pretrain.py][INFO] Epoch:[0/2](195600/4588595) loss:2.876 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:02,402][model8_pretrain.py][INFO] Epoch:[0/2](195600/4588595) loss:2.868 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:02,402][model8_pretrain.py][INFO] Epoch:[0/2](195600/4588595) loss:2.795 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:02,402][model8_pretrain.py][INFO] Epoch:[0/2](195600/4588595) loss:2.445 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:02,402][model8_pretrain.py][INFO] Epoch:[0/2](195600/4588595) loss:2.956 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:39,329][model8_pretrain.py][INFO] Epoch:[0/2](195700/4588595) loss:2.619 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:39,329][model8_pretrain.py][INFO] Epoch:[0/2](195700/4588595) loss:3.318 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:39,329][model8_pretrain.py][INFO] Epoch:[0/2](195700/4588595) loss:2.486 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:39,329][model8_pretrain.py][INFO] Epoch:[0/2](195700/4588595) loss:3.343 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:39,329][model8_pretrain.py][INFO] Epoch:[0/2](195700/4588595) loss:3.010 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:39,329][model8_pretrain.py][INFO] Epoch:[0/2](195700/4588595) loss:3.311 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:39,329][model8_pretrain.py][INFO] Epoch:[0/2](195700/4588595) loss:2.719 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:15:39,329][model8_pretrain.py][INFO] Epoch:[0/2](195700/4588595) loss:2.990 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:16:23,296][model8_pretrain.py][INFO] Epoch:[0/2](195800/4588595) loss:2.866 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:16:23,296][model8_pretrain.py][INFO] Epoch:[0/2](195800/4588595) loss:2.555 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:16:23,296][model8_pretrain.py][INFO] Epoch:[0/2](195800/4588595) loss:3.154 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:16:23,296][model8_pretrain.py][INFO] Epoch:[0/2](195800/4588595) loss:3.140 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:16:23,296][model8_pretrain.py][INFO] Epoch:[0/2](195800/4588595) loss:3.113 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:16:23,296][model8_pretrain.py][INFO] Epoch:[0/2](195800/4588595) loss:2.505 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:16:23,296][model8_pretrain.py][INFO] Epoch:[0/2](195800/4588595) loss:2.519 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:16:23,296][model8_pretrain.py][INFO] Epoch:[0/2](195800/4588595) loss:2.908 lr:0.0000100 epoch_Time:27743.0min: [2024-01-03 14:17:00,237][model8_pretrain.py][INFO] Epoch:[0/2](195900/4588595) loss:2.671 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:17:00,237][model8_pretrain.py][INFO] Epoch:[0/2](195900/4588595) loss:2.764 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:17:00,237][model8_pretrain.py][INFO] Epoch:[0/2](195900/4588595) loss:3.036 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:17:00,237][model8_pretrain.py][INFO] Epoch:[0/2](195900/4588595) loss:2.534 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:17:00,237][model8_pretrain.py][INFO] Epoch:[0/2](195900/4588595) loss:2.917 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:17:00,237][model8_pretrain.py][INFO] Epoch:[0/2](195900/4588595) loss:3.144 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:17:00,237][model8_pretrain.py][INFO] Epoch:[0/2](195900/4588595) loss:3.493 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:17:00,237][model8_pretrain.py][INFO] Epoch:[0/2](195900/4588595) loss:2.806 lr:0.0000100 epoch_Time:27742.0min: [2024-01-03 14:17:37,166][model8_pretrain.py][INFO] Epoch:[0/2](196000/4588595) loss:2.441 lr:0.0000100 epoch_Time:27741.0min: [2024-01-03 14:17:37,166][model8_pretrain.py][INFO] Epoch:[0/2](196000/4588595) loss:2.775 lr:0.0000100 epoch_Time:27741.0min: [2024-01-03 14:17:37,166][model8_pretrain.py][INFO] Epoch:[0/2](196000/4588595) loss:2.874 lr:0.0000100 epoch_Time:27741.0min: [2024-01-03 14:17:37,166][model8_pretrain.py][INFO] Epoch:[0/2](196000/4588595) loss:2.767 lr:0.0000100 epoch_Time:27741.0min: [2024-01-03 14:17:37,166][model8_pretrain.py][INFO] Epoch:[0/2](196000/4588595) loss:3.400 lr:0.0000100 epoch_Time:27741.0min: [2024-01-03 14:17:37,166][model8_pretrain.py][INFO] Epoch:[0/2](196000/4588595) loss:2.795 lr:0.0000100 epoch_Time:27741.0min: [2024-01-03 14:17:37,166][model8_pretrain.py][INFO] Epoch:[0/2](196000/4588595) loss:2.994 lr:0.0000100 epoch_Time:27741.0min: [2024-01-03 14:17:37,166][model8_pretrain.py][INFO] Epoch:[0/2](196000/4588595) loss:2.721 lr:0.0000100 epoch_Time:27741.0min: [2024-01-03 14:18:14,104][model8_pretrain.py][INFO] Epoch:[0/2](196100/4588595) loss:3.255 lr:0.0000100 epoch_Time:27740.0min: [2024-01-03 14:18:14,104][model8_pretrain.py][INFO] Epoch:[0/2](196100/4588595) loss:3.021 lr:0.0000100 epoch_Time:27740.0min: [2024-01-03 14:18:14,104][model8_pretrain.py][INFO] Epoch:[0/2](196100/4588595) loss:3.341 lr:0.0000100 epoch_Time:27740.0min: [2024-01-03 14:18:14,104][model8_pretrain.py][INFO] Epoch:[0/2](196100/4588595) loss:2.833 lr:0.0000100 epoch_Time:27740.0min: [2024-01-03 14:18:14,104][model8_pretrain.py][INFO] Epoch:[0/2](196100/4588595) loss:2.971 lr:0.0000100 epoch_Time:27740.0min: [2024-01-03 14:18:14,104][model8_pretrain.py][INFO] Epoch:[0/2](196100/4588595) loss:2.205 lr:0.0000100 epoch_Time:27740.0min: [2024-01-03 14:18:14,104][model8_pretrain.py][INFO] Epoch:[0/2](196100/4588595) loss:2.915 lr:0.0000100 epoch_Time:27740.0min: [2024-01-03 14:18:14,104][model8_pretrain.py][INFO] Epoch:[0/2](196100/4588595) loss:2.473 lr:0.0000100 epoch_Time:27740.0min: [2024-01-03 14:18:51,041][model8_pretrain.py][INFO] Epoch:[0/2](196200/4588595) loss:2.954 lr:0.0000100 epoch_Time:27739.0min: [2024-01-03 14:18:51,041][model8_pretrain.py][INFO] Epoch:[0/2](196200/4588595) loss:2.244 lr:0.0000100 epoch_Time:27739.0min: [2024-01-03 14:18:51,041][model8_pretrain.py][INFO] Epoch:[0/2](196200/4588595) loss:3.350 lr:0.0000100 epoch_Time:27739.0min: [2024-01-03 14:18:51,041][model8_pretrain.py][INFO] Epoch:[0/2](196200/4588595) loss:2.810 lr:0.0000100 epoch_Time:27739.0min: [2024-01-03 14:18:51,041][model8_pretrain.py][INFO] Epoch:[0/2](196200/4588595) loss:3.034 lr:0.0000100 epoch_Time:27739.0min: [2024-01-03 14:18:51,041][model8_pretrain.py][INFO] Epoch:[0/2](196200/4588595) loss:3.351 lr:0.0000100 epoch_Time:27739.0min: [2024-01-03 14:18:51,041][model8_pretrain.py][INFO] Epoch:[0/2](196200/4588595) loss:2.941 lr:0.0000100 epoch_Time:27739.0min: [2024-01-03 14:18:51,041][model8_pretrain.py][INFO] Epoch:[0/2](196200/4588595) loss:3.080 lr:0.0000100 epoch_Time:27739.0min: [2024-01-03 14:19:27,984][model8_pretrain.py][INFO] Epoch:[0/2](196300/4588595) loss:2.615 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:19:27,984][model8_pretrain.py][INFO] Epoch:[0/2](196300/4588595) loss:3.162 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:19:27,984][model8_pretrain.py][INFO] Epoch:[0/2](196300/4588595) loss:3.171 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:19:27,984][model8_pretrain.py][INFO] Epoch:[0/2](196300/4588595) loss:2.217 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:19:27,984][model8_pretrain.py][INFO] Epoch:[0/2](196300/4588595) loss:2.934 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:19:27,985][model8_pretrain.py][INFO] Epoch:[0/2](196300/4588595) loss:2.596 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:19:27,985][model8_pretrain.py][INFO] Epoch:[0/2](196300/4588595) loss:2.895 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:19:27,985][model8_pretrain.py][INFO] Epoch:[0/2](196300/4588595) loss:3.201 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:20:04,931][model8_pretrain.py][INFO] Epoch:[0/2](196400/4588595) loss:3.151 lr:0.0000100 epoch_Time:27737.0min: [2024-01-03 14:20:04,931][model8_pretrain.py][INFO] Epoch:[0/2](196400/4588595) loss:3.408 lr:0.0000100 epoch_Time:27737.0min: [2024-01-03 14:20:04,931][model8_pretrain.py][INFO] Epoch:[0/2](196400/4588595) loss:3.395 lr:0.0000100 epoch_Time:27737.0min: [2024-01-03 14:20:04,931][model8_pretrain.py][INFO] Epoch:[0/2](196400/4588595) loss:3.010 lr:0.0000100 epoch_Time:27737.0min: [2024-01-03 14:20:04,931][model8_pretrain.py][INFO] Epoch:[0/2](196400/4588595) loss:2.932 lr:0.0000100 epoch_Time:27737.0min: [2024-01-03 14:20:04,931][model8_pretrain.py][INFO] Epoch:[0/2](196400/4588595) loss:3.129 lr:0.0000100 epoch_Time:27737.0min: [2024-01-03 14:20:04,931][model8_pretrain.py][INFO] Epoch:[0/2](196400/4588595) loss:3.102 lr:0.0000100 epoch_Time:27737.0min: [2024-01-03 14:20:04,931][model8_pretrain.py][INFO] Epoch:[0/2](196400/4588595) loss:3.000 lr:0.0000100 epoch_Time:27737.0min: [2024-01-03 14:20:41,843][model8_pretrain.py][INFO] Epoch:[0/2](196500/4588595) loss:3.495 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:20:41,843][model8_pretrain.py][INFO] Epoch:[0/2](196500/4588595) loss:2.480 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:20:41,843][model8_pretrain.py][INFO] Epoch:[0/2](196500/4588595) loss:3.410 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:20:41,843][model8_pretrain.py][INFO] Epoch:[0/2](196500/4588595) loss:2.879 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:20:41,843][model8_pretrain.py][INFO] Epoch:[0/2](196500/4588595) loss:2.892 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:20:41,843][model8_pretrain.py][INFO] Epoch:[0/2](196500/4588595) loss:3.487 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:20:41,843][model8_pretrain.py][INFO] Epoch:[0/2](196500/4588595) loss:3.088 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:20:41,843][model8_pretrain.py][INFO] Epoch:[0/2](196500/4588595) loss:3.122 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:21:25,843][model8_pretrain.py][INFO] Epoch:[0/2](196600/4588595) loss:2.895 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:21:25,843][model8_pretrain.py][INFO] Epoch:[0/2](196600/4588595) loss:3.358 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:21:25,843][model8_pretrain.py][INFO] Epoch:[0/2](196600/4588595) loss:2.672 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:21:25,843][model8_pretrain.py][INFO] Epoch:[0/2](196600/4588595) loss:3.322 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:21:25,843][model8_pretrain.py][INFO] Epoch:[0/2](196600/4588595) loss:2.574 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:21:25,843][model8_pretrain.py][INFO] Epoch:[0/2](196600/4588595) loss:3.070 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:21:25,843][model8_pretrain.py][INFO] Epoch:[0/2](196600/4588595) loss:3.004 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:21:25,843][model8_pretrain.py][INFO] Epoch:[0/2](196600/4588595) loss:2.741 lr:0.0000100 epoch_Time:27738.0min: [2024-01-03 14:22:02,780][model8_pretrain.py][INFO] Epoch:[0/2](196700/4588595) loss:3.010 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:02,780][model8_pretrain.py][INFO] Epoch:[0/2](196700/4588595) loss:2.581 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:02,780][model8_pretrain.py][INFO] Epoch:[0/2](196700/4588595) loss:3.139 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:02,780][model8_pretrain.py][INFO] Epoch:[0/2](196700/4588595) loss:2.762 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:02,780][model8_pretrain.py][INFO] Epoch:[0/2](196700/4588595) loss:2.959 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:02,780][model8_pretrain.py][INFO] Epoch:[0/2](196700/4588595) loss:3.198 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:02,780][model8_pretrain.py][INFO] Epoch:[0/2](196700/4588595) loss:3.305 lr:0.0000100 epoch_Time:27737.0min: [2024-01-03 14:22:02,781][model8_pretrain.py][INFO] Epoch:[0/2](196700/4588595) loss:2.553 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:39,716][model8_pretrain.py][INFO] Epoch:[0/2](196800/4588595) loss:2.994 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:39,716][model8_pretrain.py][INFO] Epoch:[0/2](196800/4588595) loss:3.322 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:39,716][model8_pretrain.py][INFO] Epoch:[0/2](196800/4588595) loss:3.297 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:39,716][model8_pretrain.py][INFO] Epoch:[0/2](196800/4588595) loss:3.186 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:39,716][model8_pretrain.py][INFO] Epoch:[0/2](196800/4588595) loss:2.404 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:39,716][model8_pretrain.py][INFO] Epoch:[0/2](196800/4588595) loss:2.829 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:39,716][model8_pretrain.py][INFO] Epoch:[0/2](196800/4588595) loss:2.238 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:22:39,716][model8_pretrain.py][INFO] Epoch:[0/2](196800/4588595) loss:2.790 lr:0.0000100 epoch_Time:27736.0min: [2024-01-03 14:23:16,663][model8_pretrain.py][INFO] Epoch:[0/2](196900/4588595) loss:2.491 lr:0.0000100 epoch_Time:27735.0min: [2024-01-03 14:23:16,663][model8_pretrain.py][INFO] Epoch:[0/2](196900/4588595) loss:2.740 lr:0.0000100 epoch_Time:27735.0min: [2024-01-03 14:23:16,663][model8_pretrain.py][INFO] Epoch:[0/2](196900/4588595) loss:3.361 lr:0.0000100 epoch_Time:27735.0min: [2024-01-03 14:23:16,663][model8_pretrain.py][INFO] Epoch:[0/2](196900/4588595) loss:3.152 lr:0.0000100 epoch_Time:27735.0min: [2024-01-03 14:23:16,663][model8_pretrain.py][INFO] Epoch:[0/2](196900/4588595) loss:2.473 lr:0.0000100 epoch_Time:27735.0min: [2024-01-03 14:23:16,663][model8_pretrain.py][INFO] Epoch:[0/2](196900/4588595) loss:3.030 lr:0.0000100 epoch_Time:27735.0min: [2024-01-03 14:23:16,663][model8_pretrain.py][INFO] Epoch:[0/2](196900/4588595) loss:3.451 lr:0.0000100 epoch_Time:27735.0min: [2024-01-03 14:23:16,664][model8_pretrain.py][INFO] Epoch:[0/2](196900/4588595) loss:3.118 lr:0.0000100 epoch_Time:27735.0min: [2024-01-03 14:23:53,601][model8_pretrain.py][INFO] Epoch:[0/2](197000/4588595) loss:2.935 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:23:53,602][model8_pretrain.py][INFO] Epoch:[0/2](197000/4588595) loss:2.966 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:23:53,602][model8_pretrain.py][INFO] Epoch:[0/2](197000/4588595) loss:3.458 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:23:53,602][model8_pretrain.py][INFO] Epoch:[0/2](197000/4588595) loss:2.832 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:23:53,602][model8_pretrain.py][INFO] Epoch:[0/2](197000/4588595) loss:2.336 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:23:53,602][model8_pretrain.py][INFO] Epoch:[0/2](197000/4588595) loss:2.893 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:23:53,602][model8_pretrain.py][INFO] Epoch:[0/2](197000/4588595) loss:2.852 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:23:53,602][model8_pretrain.py][INFO] Epoch:[0/2](197000/4588595) loss:2.670 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:24:30,542][model8_pretrain.py][INFO] Epoch:[0/2](197100/4588595) loss:2.869 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:24:30,542][model8_pretrain.py][INFO] Epoch:[0/2](197100/4588595) loss:2.848 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:24:30,542][model8_pretrain.py][INFO] Epoch:[0/2](197100/4588595) loss:2.823 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:24:30,542][model8_pretrain.py][INFO] Epoch:[0/2](197100/4588595) loss:2.967 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:24:30,542][model8_pretrain.py][INFO] Epoch:[0/2](197100/4588595) loss:2.941 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:24:30,542][model8_pretrain.py][INFO] Epoch:[0/2](197100/4588595) loss:3.197 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:24:30,542][model8_pretrain.py][INFO] Epoch:[0/2](197100/4588595) loss:2.406 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:24:30,542][model8_pretrain.py][INFO] Epoch:[0/2](197100/4588595) loss:3.012 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:25:07,480][model8_pretrain.py][INFO] Epoch:[0/2](197200/4588595) loss:2.939 lr:0.0000100 epoch_Time:27732.0min: [2024-01-03 14:25:07,480][model8_pretrain.py][INFO] Epoch:[0/2](197200/4588595) loss:3.121 lr:0.0000100 epoch_Time:27732.0min: [2024-01-03 14:25:07,480][model8_pretrain.py][INFO] Epoch:[0/2](197200/4588595) loss:3.001 lr:0.0000100 epoch_Time:27732.0min: [2024-01-03 14:25:07,480][model8_pretrain.py][INFO] Epoch:[0/2](197200/4588595) loss:2.941 lr:0.0000100 epoch_Time:27732.0min: [2024-01-03 14:25:07,480][model8_pretrain.py][INFO] Epoch:[0/2](197200/4588595) loss:2.951 lr:0.0000100 epoch_Time:27732.0min: [2024-01-03 14:25:07,480][model8_pretrain.py][INFO] Epoch:[0/2](197200/4588595) loss:2.415 lr:0.0000100 epoch_Time:27732.0min: [2024-01-03 14:25:07,480][model8_pretrain.py][INFO] Epoch:[0/2](197200/4588595) loss:3.150 lr:0.0000100 epoch_Time:27732.0min: [2024-01-03 14:25:07,480][model8_pretrain.py][INFO] Epoch:[0/2](197200/4588595) loss:3.134 lr:0.0000100 epoch_Time:27732.0min: [2024-01-03 14:25:44,413][model8_pretrain.py][INFO] Epoch:[0/2](197300/4588595) loss:3.158 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:25:44,413][model8_pretrain.py][INFO] Epoch:[0/2](197300/4588595) loss:2.991 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:25:44,413][model8_pretrain.py][INFO] Epoch:[0/2](197300/4588595) loss:2.660 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:25:44,413][model8_pretrain.py][INFO] Epoch:[0/2](197300/4588595) loss:3.444 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:25:44,413][model8_pretrain.py][INFO] Epoch:[0/2](197300/4588595) loss:3.178 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:25:44,413][model8_pretrain.py][INFO] Epoch:[0/2](197300/4588595) loss:3.106 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:25:44,413][model8_pretrain.py][INFO] Epoch:[0/2](197300/4588595) loss:3.116 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:25:44,413][model8_pretrain.py][INFO] Epoch:[0/2](197300/4588595) loss:3.204 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:26:28,392][model8_pretrain.py][INFO] Epoch:[0/2](197400/4588595) loss:2.981 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:26:28,392][model8_pretrain.py][INFO] Epoch:[0/2](197400/4588595) loss:2.997 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:26:28,392][model8_pretrain.py][INFO] Epoch:[0/2](197400/4588595) loss:2.958 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:26:28,392][model8_pretrain.py][INFO] Epoch:[0/2](197400/4588595) loss:2.280 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:26:28,392][model8_pretrain.py][INFO] Epoch:[0/2](197400/4588595) loss:2.657 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:26:28,392][model8_pretrain.py][INFO] Epoch:[0/2](197400/4588595) loss:2.768 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:26:28,392][model8_pretrain.py][INFO] Epoch:[0/2](197400/4588595) loss:3.064 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:26:28,392][model8_pretrain.py][INFO] Epoch:[0/2](197400/4588595) loss:3.388 lr:0.0000100 epoch_Time:27733.0min: [2024-01-03 14:27:05,306][model8_pretrain.py][INFO] Epoch:[0/2](197500/4588595) loss:2.514 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:05,306][model8_pretrain.py][INFO] Epoch:[0/2](197500/4588595) loss:2.490 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:05,306][model8_pretrain.py][INFO] Epoch:[0/2](197500/4588595) loss:2.850 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:05,306][model8_pretrain.py][INFO] Epoch:[0/2](197500/4588595) loss:2.938 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:05,306][model8_pretrain.py][INFO] Epoch:[0/2](197500/4588595) loss:2.757 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:05,306][model8_pretrain.py][INFO] Epoch:[0/2](197500/4588595) loss:3.026 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:05,306][model8_pretrain.py][INFO] Epoch:[0/2](197500/4588595) loss:2.879 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:05,306][model8_pretrain.py][INFO] Epoch:[0/2](197500/4588595) loss:3.321 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:42,247][model8_pretrain.py][INFO] Epoch:[0/2](197600/4588595) loss:3.252 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:42,247][model8_pretrain.py][INFO] Epoch:[0/2](197600/4588595) loss:3.243 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:42,247][model8_pretrain.py][INFO] Epoch:[0/2](197600/4588595) loss:2.804 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:42,247][model8_pretrain.py][INFO] Epoch:[0/2](197600/4588595) loss:3.580 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:42,247][model8_pretrain.py][INFO] Epoch:[0/2](197600/4588595) loss:2.714 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:42,247][model8_pretrain.py][INFO] Epoch:[0/2](197600/4588595) loss:2.828 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:42,247][model8_pretrain.py][INFO] Epoch:[0/2](197600/4588595) loss:3.351 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:27:42,248][model8_pretrain.py][INFO] Epoch:[0/2](197600/4588595) loss:3.264 lr:0.0000100 epoch_Time:27731.0min: [2024-01-03 14:28:19,199][model8_pretrain.py][INFO] Epoch:[0/2](197700/4588595) loss:3.133 lr:0.0000100 epoch_Time:27730.0min: [2024-01-03 14:28:19,199][model8_pretrain.py][INFO] Epoch:[0/2](197700/4588595) loss:2.872 lr:0.0000100 epoch_Time:27730.0min: [2024-01-03 14:28:19,199][model8_pretrain.py][INFO] Epoch:[0/2](197700/4588595) loss:2.638 lr:0.0000100 epoch_Time:27730.0min: [2024-01-03 14:28:19,199][model8_pretrain.py][INFO] Epoch:[0/2](197700/4588595) loss:3.129 lr:0.0000100 epoch_Time:27730.0min: [2024-01-03 14:28:19,199][model8_pretrain.py][INFO] Epoch:[0/2](197700/4588595) loss:3.006 lr:0.0000100 epoch_Time:27730.0min: [2024-01-03 14:28:19,199][model8_pretrain.py][INFO] Epoch:[0/2](197700/4588595) loss:3.325 lr:0.0000100 epoch_Time:27730.0min: [2024-01-03 14:28:19,199][model8_pretrain.py][INFO] Epoch:[0/2](197700/4588595) loss:3.270 lr:0.0000100 epoch_Time:27730.0min: [2024-01-03 14:28:19,199][model8_pretrain.py][INFO] Epoch:[0/2](197700/4588595) loss:3.608 lr:0.0000100 epoch_Time:27730.0min: [2024-01-03 14:28:56,136][model8_pretrain.py][INFO] Epoch:[0/2](197800/4588595) loss:3.059 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:28:56,136][model8_pretrain.py][INFO] Epoch:[0/2](197800/4588595) loss:3.039 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:28:56,136][model8_pretrain.py][INFO] Epoch:[0/2](197800/4588595) loss:2.432 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:28:56,136][model8_pretrain.py][INFO] Epoch:[0/2](197800/4588595) loss:3.223 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:28:56,136][model8_pretrain.py][INFO] Epoch:[0/2](197800/4588595) loss:2.858 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:28:56,136][model8_pretrain.py][INFO] Epoch:[0/2](197800/4588595) loss:2.905 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:28:56,136][model8_pretrain.py][INFO] Epoch:[0/2](197800/4588595) loss:2.944 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:28:56,137][model8_pretrain.py][INFO] Epoch:[0/2](197800/4588595) loss:2.216 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:29:33,078][model8_pretrain.py][INFO] Epoch:[0/2](197900/4588595) loss:3.016 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:29:33,078][model8_pretrain.py][INFO] Epoch:[0/2](197900/4588595) loss:3.689 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:29:33,078][model8_pretrain.py][INFO] Epoch:[0/2](197900/4588595) loss:3.018 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:29:33,078][model8_pretrain.py][INFO] Epoch:[0/2](197900/4588595) loss:3.046 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:29:33,078][model8_pretrain.py][INFO] Epoch:[0/2](197900/4588595) loss:2.585 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:29:33,078][model8_pretrain.py][INFO] Epoch:[0/2](197900/4588595) loss:3.047 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:29:33,078][model8_pretrain.py][INFO] Epoch:[0/2](197900/4588595) loss:3.138 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:29:33,079][model8_pretrain.py][INFO] Epoch:[0/2](197900/4588595) loss:2.661 lr:0.0000100 epoch_Time:27728.0min: [2024-01-03 14:30:10,023][model8_pretrain.py][INFO] Epoch:[0/2](198000/4588595) loss:3.252 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:10,023][model8_pretrain.py][INFO] Epoch:[0/2](198000/4588595) loss:2.863 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:10,023][model8_pretrain.py][INFO] Epoch:[0/2](198000/4588595) loss:3.166 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:10,023][model8_pretrain.py][INFO] Epoch:[0/2](198000/4588595) loss:2.994 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:10,023][model8_pretrain.py][INFO] Epoch:[0/2](198000/4588595) loss:2.557 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:10,023][model8_pretrain.py][INFO] Epoch:[0/2](198000/4588595) loss:2.975 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:10,023][model8_pretrain.py][INFO] Epoch:[0/2](198000/4588595) loss:3.235 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:10,023][model8_pretrain.py][INFO] Epoch:[0/2](198000/4588595) loss:3.262 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:46,959][model8_pretrain.py][INFO] Epoch:[0/2](198100/4588595) loss:3.102 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:46,959][model8_pretrain.py][INFO] Epoch:[0/2](198100/4588595) loss:3.038 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:46,959][model8_pretrain.py][INFO] Epoch:[0/2](198100/4588595) loss:3.203 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:46,959][model8_pretrain.py][INFO] Epoch:[0/2](198100/4588595) loss:2.887 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:46,959][model8_pretrain.py][INFO] Epoch:[0/2](198100/4588595) loss:3.246 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:46,959][model8_pretrain.py][INFO] Epoch:[0/2](198100/4588595) loss:3.077 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:46,959][model8_pretrain.py][INFO] Epoch:[0/2](198100/4588595) loss:2.973 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:30:46,959][model8_pretrain.py][INFO] Epoch:[0/2](198100/4588595) loss:3.100 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:31:30,991][model8_pretrain.py][INFO] Epoch:[0/2](198200/4588595) loss:2.588 lr:0.0000100 epoch_Time:27727.0min: [2024-01-03 14:31:30,991][model8_pretrain.py][INFO] Epoch:[0/2](198200/4588595) loss:2.939 lr:0.0000100 epoch_Time:27727.0min: [2024-01-03 14:31:30,991][model8_pretrain.py][INFO] Epoch:[0/2](198200/4588595) loss:3.008 lr:0.0000100 epoch_Time:27727.0min: [2024-01-03 14:31:30,991][model8_pretrain.py][INFO] Epoch:[0/2](198200/4588595) loss:2.811 lr:0.0000100 epoch_Time:27727.0min: [2024-01-03 14:31:30,991][model8_pretrain.py][INFO] Epoch:[0/2](198200/4588595) loss:3.088 lr:0.0000100 epoch_Time:27727.0min: [2024-01-03 14:31:30,991][model8_pretrain.py][INFO] Epoch:[0/2](198200/4588595) loss:3.583 lr:0.0000100 epoch_Time:27727.0min: [2024-01-03 14:31:30,991][model8_pretrain.py][INFO] Epoch:[0/2](198200/4588595) loss:3.131 lr:0.0000100 epoch_Time:27727.0min: [2024-01-03 14:31:30,991][model8_pretrain.py][INFO] Epoch:[0/2](198200/4588595) loss:3.031 lr:0.0000100 epoch_Time:27727.0min: [2024-01-03 14:32:07,926][model8_pretrain.py][INFO] Epoch:[0/2](198300/4588595) loss:3.171 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:07,926][model8_pretrain.py][INFO] Epoch:[0/2](198300/4588595) loss:2.600 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:07,926][model8_pretrain.py][INFO] Epoch:[0/2](198300/4588595) loss:3.082 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:07,926][model8_pretrain.py][INFO] Epoch:[0/2](198300/4588595) loss:2.828 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:07,926][model8_pretrain.py][INFO] Epoch:[0/2](198300/4588595) loss:2.703 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:07,926][model8_pretrain.py][INFO] Epoch:[0/2](198300/4588595) loss:3.087 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:07,926][model8_pretrain.py][INFO] Epoch:[0/2](198300/4588595) loss:2.897 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:07,926][model8_pretrain.py][INFO] Epoch:[0/2](198300/4588595) loss:2.678 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:44,851][model8_pretrain.py][INFO] Epoch:[0/2](198400/4588595) loss:3.238 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:44,851][model8_pretrain.py][INFO] Epoch:[0/2](198400/4588595) loss:3.066 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:44,851][model8_pretrain.py][INFO] Epoch:[0/2](198400/4588595) loss:3.000 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:44,851][model8_pretrain.py][INFO] Epoch:[0/2](198400/4588595) loss:2.697 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:44,851][model8_pretrain.py][INFO] Epoch:[0/2](198400/4588595) loss:3.176 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:44,851][model8_pretrain.py][INFO] Epoch:[0/2](198400/4588595) loss:2.905 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:44,851][model8_pretrain.py][INFO] Epoch:[0/2](198400/4588595) loss:3.135 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:32:44,852][model8_pretrain.py][INFO] Epoch:[0/2](198400/4588595) loss:2.614 lr:0.0000100 epoch_Time:27726.0min: [2024-01-03 14:33:21,789][model8_pretrain.py][INFO] Epoch:[0/2](198500/4588595) loss:3.108 lr:0.0000100 epoch_Time:27724.0min: [2024-01-03 14:33:21,789][model8_pretrain.py][INFO] Epoch:[0/2](198500/4588595) loss:2.762 lr:0.0000100 epoch_Time:27724.0min: [2024-01-03 14:33:21,789][model8_pretrain.py][INFO] Epoch:[0/2](198500/4588595) loss:2.824 lr:0.0000100 epoch_Time:27724.0min: [2024-01-03 14:33:21,789][model8_pretrain.py][INFO] Epoch:[0/2](198500/4588595) loss:3.035 lr:0.0000100 epoch_Time:27724.0min: [2024-01-03 14:33:21,789][model8_pretrain.py][INFO] Epoch:[0/2](198500/4588595) loss:3.009 lr:0.0000100 epoch_Time:27724.0min: [2024-01-03 14:33:21,789][model8_pretrain.py][INFO] Epoch:[0/2](198500/4588595) loss:3.260 lr:0.0000100 epoch_Time:27724.0min: [2024-01-03 14:33:21,789][model8_pretrain.py][INFO] Epoch:[0/2](198500/4588595) loss:3.467 lr:0.0000100 epoch_Time:27724.0min: [2024-01-03 14:33:21,789][model8_pretrain.py][INFO] Epoch:[0/2](198500/4588595) loss:2.910 lr:0.0000100 epoch_Time:27724.0min: [2024-01-03 14:33:58,740][model8_pretrain.py][INFO] Epoch:[0/2](198600/4588595) loss:2.963 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:33:58,740][model8_pretrain.py][INFO] Epoch:[0/2](198600/4588595) loss:3.641 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:33:58,740][model8_pretrain.py][INFO] Epoch:[0/2](198600/4588595) loss:2.971 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:33:58,740][model8_pretrain.py][INFO] Epoch:[0/2](198600/4588595) loss:3.203 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:33:58,740][model8_pretrain.py][INFO] Epoch:[0/2](198600/4588595) loss:2.891 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:33:58,740][model8_pretrain.py][INFO] Epoch:[0/2](198600/4588595) loss:3.075 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:33:58,740][model8_pretrain.py][INFO] Epoch:[0/2](198600/4588595) loss:3.047 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:33:58,752][model8_pretrain.py][INFO] Epoch:[0/2](198600/4588595) loss:3.174 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:34:35,728][model8_pretrain.py][INFO] Epoch:[0/2](198700/4588595) loss:3.239 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:34:35,728][model8_pretrain.py][INFO] Epoch:[0/2](198700/4588595) loss:2.406 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:34:35,728][model8_pretrain.py][INFO] Epoch:[0/2](198700/4588595) loss:2.616 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:34:35,728][model8_pretrain.py][INFO] Epoch:[0/2](198700/4588595) loss:3.193 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:34:35,728][model8_pretrain.py][INFO] Epoch:[0/2](198700/4588595) loss:2.900 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:34:35,728][model8_pretrain.py][INFO] Epoch:[0/2](198700/4588595) loss:3.248 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:34:35,728][model8_pretrain.py][INFO] Epoch:[0/2](198700/4588595) loss:2.822 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:34:35,729][model8_pretrain.py][INFO] Epoch:[0/2](198700/4588595) loss:3.242 lr:0.0000100 epoch_Time:27723.0min: [2024-01-03 14:35:12,725][model8_pretrain.py][INFO] Epoch:[0/2](198800/4588595) loss:2.996 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:35:12,725][model8_pretrain.py][INFO] Epoch:[0/2](198800/4588595) loss:2.811 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:35:12,725][model8_pretrain.py][INFO] Epoch:[0/2](198800/4588595) loss:2.812 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:35:12,725][model8_pretrain.py][INFO] Epoch:[0/2](198800/4588595) loss:2.850 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:35:12,725][model8_pretrain.py][INFO] Epoch:[0/2](198800/4588595) loss:3.125 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:35:12,725][model8_pretrain.py][INFO] Epoch:[0/2](198800/4588595) loss:3.321 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:35:12,725][model8_pretrain.py][INFO] Epoch:[0/2](198800/4588595) loss:3.106 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:35:12,725][model8_pretrain.py][INFO] Epoch:[0/2](198800/4588595) loss:2.331 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:35:49,685][model8_pretrain.py][INFO] Epoch:[0/2](198900/4588595) loss:2.923 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:35:49,685][model8_pretrain.py][INFO] Epoch:[0/2](198900/4588595) loss:3.355 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:35:49,685][model8_pretrain.py][INFO] Epoch:[0/2](198900/4588595) loss:3.144 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:35:49,685][model8_pretrain.py][INFO] Epoch:[0/2](198900/4588595) loss:2.666 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:35:49,686][model8_pretrain.py][INFO] Epoch:[0/2](198900/4588595) loss:2.933 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:35:49,686][model8_pretrain.py][INFO] Epoch:[0/2](198900/4588595) loss:2.372 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:35:49,686][model8_pretrain.py][INFO] Epoch:[0/2](198900/4588595) loss:3.113 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:35:49,686][model8_pretrain.py][INFO] Epoch:[0/2](198900/4588595) loss:2.926 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:36:33,458][model8_pretrain.py][INFO] Epoch:[0/2](199000/4588595) loss:2.371 lr:0.0000100 epoch_Time:27722.0min: [2024-01-03 14:36:33,458][model8_pretrain.py][INFO] Epoch:[0/2](199000/4588595) loss:2.524 lr:0.0000100 epoch_Time:27722.0min: [2024-01-03 14:36:33,458][model8_pretrain.py][INFO] Epoch:[0/2](199000/4588595) loss:2.816 lr:0.0000100 epoch_Time:27722.0min: [2024-01-03 14:36:33,458][model8_pretrain.py][INFO] Epoch:[0/2](199000/4588595) loss:2.852 lr:0.0000100 epoch_Time:27722.0min: [2024-01-03 14:36:33,458][model8_pretrain.py][INFO] Epoch:[0/2](199000/4588595) loss:2.938 lr:0.0000100 epoch_Time:27722.0min: [2024-01-03 14:36:33,458][model8_pretrain.py][INFO] Epoch:[0/2](199000/4588595) loss:3.453 lr:0.0000100 epoch_Time:27722.0min: [2024-01-03 14:36:33,458][model8_pretrain.py][INFO] Epoch:[0/2](199000/4588595) loss:2.404 lr:0.0000100 epoch_Time:27722.0min: [2024-01-03 14:36:33,458][model8_pretrain.py][INFO] Epoch:[0/2](199000/4588595) loss:2.646 lr:0.0000100 epoch_Time:27722.0min: [2024-01-03 14:37:10,391][model8_pretrain.py][INFO] Epoch:[0/2](199100/4588595) loss:3.036 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:37:10,391][model8_pretrain.py][INFO] Epoch:[0/2](199100/4588595) loss:2.811 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:37:10,391][model8_pretrain.py][INFO] Epoch:[0/2](199100/4588595) loss:3.278 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:37:10,391][model8_pretrain.py][INFO] Epoch:[0/2](199100/4588595) loss:2.654 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:37:10,391][model8_pretrain.py][INFO] Epoch:[0/2](199100/4588595) loss:2.287 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:37:10,391][model8_pretrain.py][INFO] Epoch:[0/2](199100/4588595) loss:2.656 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:37:10,392][model8_pretrain.py][INFO] Epoch:[0/2](199100/4588595) loss:3.383 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:37:10,392][model8_pretrain.py][INFO] Epoch:[0/2](199100/4588595) loss:3.089 lr:0.0000100 epoch_Time:27721.0min: [2024-01-03 14:37:47,328][model8_pretrain.py][INFO] Epoch:[0/2](199200/4588595) loss:3.110 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:37:47,328][model8_pretrain.py][INFO] Epoch:[0/2](199200/4588595) loss:2.755 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:37:47,328][model8_pretrain.py][INFO] Epoch:[0/2](199200/4588595) loss:2.354 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:37:47,328][model8_pretrain.py][INFO] Epoch:[0/2](199200/4588595) loss:3.526 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:37:47,328][model8_pretrain.py][INFO] Epoch:[0/2](199200/4588595) loss:3.340 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:37:47,328][model8_pretrain.py][INFO] Epoch:[0/2](199200/4588595) loss:2.379 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:37:47,328][model8_pretrain.py][INFO] Epoch:[0/2](199200/4588595) loss:3.184 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:37:47,328][model8_pretrain.py][INFO] Epoch:[0/2](199200/4588595) loss:2.702 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:38:24,266][model8_pretrain.py][INFO] Epoch:[0/2](199300/4588595) loss:3.225 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:38:24,266][model8_pretrain.py][INFO] Epoch:[0/2](199300/4588595) loss:2.996 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:38:24,266][model8_pretrain.py][INFO] Epoch:[0/2](199300/4588595) loss:3.008 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:38:24,266][model8_pretrain.py][INFO] Epoch:[0/2](199300/4588595) loss:2.578 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:38:24,266][model8_pretrain.py][INFO] Epoch:[0/2](199300/4588595) loss:2.843 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:38:24,266][model8_pretrain.py][INFO] Epoch:[0/2](199300/4588595) loss:3.002 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:38:24,266][model8_pretrain.py][INFO] Epoch:[0/2](199300/4588595) loss:3.153 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:38:24,267][model8_pretrain.py][INFO] Epoch:[0/2](199300/4588595) loss:2.943 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:39:01,197][model8_pretrain.py][INFO] Epoch:[0/2](199400/4588595) loss:3.077 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:39:01,197][model8_pretrain.py][INFO] Epoch:[0/2](199400/4588595) loss:3.256 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:39:01,197][model8_pretrain.py][INFO] Epoch:[0/2](199400/4588595) loss:2.610 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:39:01,197][model8_pretrain.py][INFO] Epoch:[0/2](199400/4588595) loss:3.037 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:39:01,197][model8_pretrain.py][INFO] Epoch:[0/2](199400/4588595) loss:3.000 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:39:01,197][model8_pretrain.py][INFO] Epoch:[0/2](199400/4588595) loss:3.024 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:39:01,197][model8_pretrain.py][INFO] Epoch:[0/2](199400/4588595) loss:2.813 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:39:01,198][model8_pretrain.py][INFO] Epoch:[0/2](199400/4588595) loss:3.041 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:39:38,155][model8_pretrain.py][INFO] Epoch:[0/2](199500/4588595) loss:2.309 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:39:38,155][model8_pretrain.py][INFO] Epoch:[0/2](199500/4588595) loss:2.360 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:39:38,155][model8_pretrain.py][INFO] Epoch:[0/2](199500/4588595) loss:3.186 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:39:38,155][model8_pretrain.py][INFO] Epoch:[0/2](199500/4588595) loss:2.560 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:39:38,155][model8_pretrain.py][INFO] Epoch:[0/2](199500/4588595) loss:2.982 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:39:38,155][model8_pretrain.py][INFO] Epoch:[0/2](199500/4588595) loss:2.804 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:39:38,155][model8_pretrain.py][INFO] Epoch:[0/2](199500/4588595) loss:2.947 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:39:38,155][model8_pretrain.py][INFO] Epoch:[0/2](199500/4588595) loss:2.705 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:40:15,085][model8_pretrain.py][INFO] Epoch:[0/2](199600/4588595) loss:2.601 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:40:15,085][model8_pretrain.py][INFO] Epoch:[0/2](199600/4588595) loss:3.256 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:40:15,085][model8_pretrain.py][INFO] Epoch:[0/2](199600/4588595) loss:3.379 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:40:15,085][model8_pretrain.py][INFO] Epoch:[0/2](199600/4588595) loss:2.928 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:40:15,085][model8_pretrain.py][INFO] Epoch:[0/2](199600/4588595) loss:3.256 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:40:15,085][model8_pretrain.py][INFO] Epoch:[0/2](199600/4588595) loss:3.359 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:40:15,085][model8_pretrain.py][INFO] Epoch:[0/2](199600/4588595) loss:2.561 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:40:15,085][model8_pretrain.py][INFO] Epoch:[0/2](199600/4588595) loss:3.264 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:40:52,062][model8_pretrain.py][INFO] Epoch:[0/2](199700/4588595) loss:2.604 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:40:52,062][model8_pretrain.py][INFO] Epoch:[0/2](199700/4588595) loss:2.474 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:40:52,062][model8_pretrain.py][INFO] Epoch:[0/2](199700/4588595) loss:2.280 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:40:52,062][model8_pretrain.py][INFO] Epoch:[0/2](199700/4588595) loss:3.021 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:40:52,062][model8_pretrain.py][INFO] Epoch:[0/2](199700/4588595) loss:2.879 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:40:52,062][model8_pretrain.py][INFO] Epoch:[0/2](199700/4588595) loss:3.418 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:40:52,062][model8_pretrain.py][INFO] Epoch:[0/2](199700/4588595) loss:2.718 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:40:52,062][model8_pretrain.py][INFO] Epoch:[0/2](199700/4588595) loss:2.992 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:41:35,934][model8_pretrain.py][INFO] Epoch:[0/2](199800/4588595) loss:2.749 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:41:35,934][model8_pretrain.py][INFO] Epoch:[0/2](199800/4588595) loss:2.324 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:41:35,934][model8_pretrain.py][INFO] Epoch:[0/2](199800/4588595) loss:2.885 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:41:35,934][model8_pretrain.py][INFO] Epoch:[0/2](199800/4588595) loss:3.326 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:41:35,934][model8_pretrain.py][INFO] Epoch:[0/2](199800/4588595) loss:2.828 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:41:35,934][model8_pretrain.py][INFO] Epoch:[0/2](199800/4588595) loss:3.317 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:41:35,934][model8_pretrain.py][INFO] Epoch:[0/2](199800/4588595) loss:2.895 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:41:35,934][model8_pretrain.py][INFO] Epoch:[0/2](199800/4588595) loss:3.110 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:42:12,870][model8_pretrain.py][INFO] Epoch:[0/2](199900/4588595) loss:2.885 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:42:12,870][model8_pretrain.py][INFO] Epoch:[0/2](199900/4588595) loss:2.670 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:42:12,870][model8_pretrain.py][INFO] Epoch:[0/2](199900/4588595) loss:2.877 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:42:12,870][model8_pretrain.py][INFO] Epoch:[0/2](199900/4588595) loss:3.162 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:42:12,870][model8_pretrain.py][INFO] Epoch:[0/2](199900/4588595) loss:2.752 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:42:12,870][model8_pretrain.py][INFO] Epoch:[0/2](199900/4588595) loss:3.244 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:42:12,870][model8_pretrain.py][INFO] Epoch:[0/2](199900/4588595) loss:2.975 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:42:12,870][model8_pretrain.py][INFO] Epoch:[0/2](199900/4588595) loss:3.358 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:42:49,812][model8_pretrain.py][INFO] Epoch:[0/2](200000/4588595) loss:3.042 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:42:49,813][model8_pretrain.py][INFO] Epoch:[0/2](200000/4588595) loss:2.620 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:42:49,813][model8_pretrain.py][INFO] Epoch:[0/2](200000/4588595) loss:2.506 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:42:49,813][model8_pretrain.py][INFO] Epoch:[0/2](200000/4588595) loss:3.261 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:42:49,813][model8_pretrain.py][INFO] Epoch:[0/2](200000/4588595) loss:3.155 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:42:49,813][model8_pretrain.py][INFO] Epoch:[0/2](200000/4588595) loss:3.134 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:42:49,813][model8_pretrain.py][INFO] Epoch:[0/2](200000/4588595) loss:2.941 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:42:49,813][model8_pretrain.py][INFO] Epoch:[0/2](200000/4588595) loss:3.428 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:43:06,553][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_200000.pth [2024-01-03 14:43:06,554][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_200000.pth [2024-01-03 14:43:06,557][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_200000.pth [2024-01-03 14:43:06,558][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_200000.pth [2024-01-03 14:43:06,572][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_200000.pth [2024-01-03 14:43:06,598][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_200000.pth [2024-01-03 14:43:06,611][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_200000.pth [2024-01-03 14:43:06,615][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_200000.pth [2024-01-03 14:43:43,564][model8_pretrain.py][INFO] Epoch:[0/2](200100/4588595) loss:2.665 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:43:43,564][model8_pretrain.py][INFO] Epoch:[0/2](200100/4588595) loss:2.735 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:43:43,565][model8_pretrain.py][INFO] Epoch:[0/2](200100/4588595) loss:2.940 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:43:43,565][model8_pretrain.py][INFO] Epoch:[0/2](200100/4588595) loss:2.830 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:43:43,565][model8_pretrain.py][INFO] Epoch:[0/2](200100/4588595) loss:3.131 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:43:43,565][model8_pretrain.py][INFO] Epoch:[0/2](200100/4588595) loss:3.429 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:43:43,565][model8_pretrain.py][INFO] Epoch:[0/2](200100/4588595) loss:3.218 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:43:43,565][model8_pretrain.py][INFO] Epoch:[0/2](200100/4588595) loss:2.792 lr:0.0000100 epoch_Time:27720.0min: [2024-01-03 14:44:20,498][model8_pretrain.py][INFO] Epoch:[0/2](200200/4588595) loss:3.164 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:44:20,498][model8_pretrain.py][INFO] Epoch:[0/2](200200/4588595) loss:2.982 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:44:20,499][model8_pretrain.py][INFO] Epoch:[0/2](200200/4588595) loss:2.432 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:44:20,499][model8_pretrain.py][INFO] Epoch:[0/2](200200/4588595) loss:3.345 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:44:20,499][model8_pretrain.py][INFO] Epoch:[0/2](200200/4588595) loss:2.923 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:44:20,499][model8_pretrain.py][INFO] Epoch:[0/2](200200/4588595) loss:2.811 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:44:20,499][model8_pretrain.py][INFO] Epoch:[0/2](200200/4588595) loss:3.435 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:44:20,499][model8_pretrain.py][INFO] Epoch:[0/2](200200/4588595) loss:2.952 lr:0.0000100 epoch_Time:27719.0min: [2024-01-03 14:44:57,432][model8_pretrain.py][INFO] Epoch:[0/2](200300/4588595) loss:3.051 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:44:57,432][model8_pretrain.py][INFO] Epoch:[0/2](200300/4588595) loss:2.441 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:44:57,432][model8_pretrain.py][INFO] Epoch:[0/2](200300/4588595) loss:2.914 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:44:57,432][model8_pretrain.py][INFO] Epoch:[0/2](200300/4588595) loss:3.512 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:44:57,432][model8_pretrain.py][INFO] Epoch:[0/2](200300/4588595) loss:2.512 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:44:57,432][model8_pretrain.py][INFO] Epoch:[0/2](200300/4588595) loss:2.615 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:44:57,432][model8_pretrain.py][INFO] Epoch:[0/2](200300/4588595) loss:2.938 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:44:57,432][model8_pretrain.py][INFO] Epoch:[0/2](200300/4588595) loss:3.295 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:45:34,364][model8_pretrain.py][INFO] Epoch:[0/2](200400/4588595) loss:3.227 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:45:34,364][model8_pretrain.py][INFO] Epoch:[0/2](200400/4588595) loss:3.075 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:45:34,364][model8_pretrain.py][INFO] Epoch:[0/2](200400/4588595) loss:2.739 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:45:34,364][model8_pretrain.py][INFO] Epoch:[0/2](200400/4588595) loss:3.327 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:45:34,364][model8_pretrain.py][INFO] Epoch:[0/2](200400/4588595) loss:3.193 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:45:34,364][model8_pretrain.py][INFO] Epoch:[0/2](200400/4588595) loss:3.148 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:45:34,364][model8_pretrain.py][INFO] Epoch:[0/2](200400/4588595) loss:2.419 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:45:34,364][model8_pretrain.py][INFO] Epoch:[0/2](200400/4588595) loss:2.766 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:46:11,317][model8_pretrain.py][INFO] Epoch:[0/2](200500/4588595) loss:2.376 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:46:11,317][model8_pretrain.py][INFO] Epoch:[0/2](200500/4588595) loss:3.157 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:46:11,317][model8_pretrain.py][INFO] Epoch:[0/2](200500/4588595) loss:2.871 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:46:11,317][model8_pretrain.py][INFO] Epoch:[0/2](200500/4588595) loss:3.077 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:46:11,317][model8_pretrain.py][INFO] Epoch:[0/2](200500/4588595) loss:2.825 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:46:11,317][model8_pretrain.py][INFO] Epoch:[0/2](200500/4588595) loss:2.892 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:46:11,317][model8_pretrain.py][INFO] Epoch:[0/2](200500/4588595) loss:3.074 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:46:11,318][model8_pretrain.py][INFO] Epoch:[0/2](200500/4588595) loss:3.417 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:46:56,869][model8_pretrain.py][INFO] Epoch:[0/2](200600/4588595) loss:2.433 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:46:56,869][model8_pretrain.py][INFO] Epoch:[0/2](200600/4588595) loss:3.186 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:46:56,869][model8_pretrain.py][INFO] Epoch:[0/2](200600/4588595) loss:3.298 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:46:56,869][model8_pretrain.py][INFO] Epoch:[0/2](200600/4588595) loss:3.358 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:46:56,869][model8_pretrain.py][INFO] Epoch:[0/2](200600/4588595) loss:3.009 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:46:56,869][model8_pretrain.py][INFO] Epoch:[0/2](200600/4588595) loss:3.097 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:46:56,869][model8_pretrain.py][INFO] Epoch:[0/2](200600/4588595) loss:3.035 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:46:56,869][model8_pretrain.py][INFO] Epoch:[0/2](200600/4588595) loss:3.222 lr:0.0000100 epoch_Time:27718.0min: [2024-01-03 14:47:33,804][model8_pretrain.py][INFO] Epoch:[0/2](200700/4588595) loss:2.599 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:47:33,805][model8_pretrain.py][INFO] Epoch:[0/2](200700/4588595) loss:2.257 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:47:33,805][model8_pretrain.py][INFO] Epoch:[0/2](200700/4588595) loss:2.980 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:47:33,805][model8_pretrain.py][INFO] Epoch:[0/2](200700/4588595) loss:2.815 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:47:33,805][model8_pretrain.py][INFO] Epoch:[0/2](200700/4588595) loss:2.598 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:47:33,805][model8_pretrain.py][INFO] Epoch:[0/2](200700/4588595) loss:2.933 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:47:33,805][model8_pretrain.py][INFO] Epoch:[0/2](200700/4588595) loss:2.436 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:47:33,805][model8_pretrain.py][INFO] Epoch:[0/2](200700/4588595) loss:3.325 lr:0.0000100 epoch_Time:27717.0min: [2024-01-03 14:48:10,737][model8_pretrain.py][INFO] Epoch:[0/2](200800/4588595) loss:2.861 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:48:10,738][model8_pretrain.py][INFO] Epoch:[0/2](200800/4588595) loss:3.210 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:48:10,738][model8_pretrain.py][INFO] Epoch:[0/2](200800/4588595) loss:2.939 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:48:10,738][model8_pretrain.py][INFO] Epoch:[0/2](200800/4588595) loss:3.084 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:48:10,738][model8_pretrain.py][INFO] Epoch:[0/2](200800/4588595) loss:3.126 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:48:10,738][model8_pretrain.py][INFO] Epoch:[0/2](200800/4588595) loss:2.889 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:48:10,738][model8_pretrain.py][INFO] Epoch:[0/2](200800/4588595) loss:3.054 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:48:10,738][model8_pretrain.py][INFO] Epoch:[0/2](200800/4588595) loss:2.757 lr:0.0000100 epoch_Time:27716.0min: [2024-01-03 14:48:47,668][model8_pretrain.py][INFO] Epoch:[0/2](200900/4588595) loss:2.763 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:48:47,668][model8_pretrain.py][INFO] Epoch:[0/2](200900/4588595) loss:2.884 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:48:47,668][model8_pretrain.py][INFO] Epoch:[0/2](200900/4588595) loss:3.196 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:48:47,668][model8_pretrain.py][INFO] Epoch:[0/2](200900/4588595) loss:2.849 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:48:47,668][model8_pretrain.py][INFO] Epoch:[0/2](200900/4588595) loss:2.806 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:48:47,668][model8_pretrain.py][INFO] Epoch:[0/2](200900/4588595) loss:3.319 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:48:47,668][model8_pretrain.py][INFO] Epoch:[0/2](200900/4588595) loss:3.006 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:48:47,668][model8_pretrain.py][INFO] Epoch:[0/2](200900/4588595) loss:3.219 lr:0.0000100 epoch_Time:27715.0min: [2024-01-03 14:49:24,605][model8_pretrain.py][INFO] Epoch:[0/2](201000/4588595) loss:2.545 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:49:24,605][model8_pretrain.py][INFO] Epoch:[0/2](201000/4588595) loss:2.494 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:49:24,605][model8_pretrain.py][INFO] Epoch:[0/2](201000/4588595) loss:3.014 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:49:24,605][model8_pretrain.py][INFO] Epoch:[0/2](201000/4588595) loss:2.405 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:49:24,605][model8_pretrain.py][INFO] Epoch:[0/2](201000/4588595) loss:2.908 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:49:24,605][model8_pretrain.py][INFO] Epoch:[0/2](201000/4588595) loss:2.905 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:49:24,605][model8_pretrain.py][INFO] Epoch:[0/2](201000/4588595) loss:2.873 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:49:24,606][model8_pretrain.py][INFO] Epoch:[0/2](201000/4588595) loss:3.129 lr:0.0000100 epoch_Time:27714.0min: [2024-01-03 14:50:01,550][model8_pretrain.py][INFO] Epoch:[0/2](201100/4588595) loss:3.251 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:01,550][model8_pretrain.py][INFO] Epoch:[0/2](201100/4588595) loss:2.605 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:01,550][model8_pretrain.py][INFO] Epoch:[0/2](201100/4588595) loss:3.238 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:01,550][model8_pretrain.py][INFO] Epoch:[0/2](201100/4588595) loss:2.695 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:01,550][model8_pretrain.py][INFO] Epoch:[0/2](201100/4588595) loss:3.302 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:01,550][model8_pretrain.py][INFO] Epoch:[0/2](201100/4588595) loss:2.755 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:01,550][model8_pretrain.py][INFO] Epoch:[0/2](201100/4588595) loss:2.494 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:01,550][model8_pretrain.py][INFO] Epoch:[0/2](201100/4588595) loss:2.891 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:38,510][model8_pretrain.py][INFO] Epoch:[0/2](201200/4588595) loss:2.733 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:38,510][model8_pretrain.py][INFO] Epoch:[0/2](201200/4588595) loss:3.292 lr:0.0000100 epoch_Time:27712.0min: [2024-01-03 14:50:38,510][model8_pretrain.py][INFO] Epoch:[0/2](201200/4588595) loss:3.059 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:38,510][model8_pretrain.py][INFO] Epoch:[0/2](201200/4588595) loss:3.140 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:38,510][model8_pretrain.py][INFO] Epoch:[0/2](201200/4588595) loss:3.371 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:38,510][model8_pretrain.py][INFO] Epoch:[0/2](201200/4588595) loss:3.417 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:50:38,510][model8_pretrain.py][INFO] Epoch:[0/2](201200/4588595) loss:3.036 lr:0.0000100 epoch_Time:27712.0min: [2024-01-03 14:50:38,510][model8_pretrain.py][INFO] Epoch:[0/2](201200/4588595) loss:2.959 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:51:15,511][model8_pretrain.py][INFO] Epoch:[0/2](201300/4588595) loss:2.950 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:51:15,511][model8_pretrain.py][INFO] Epoch:[0/2](201300/4588595) loss:2.540 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:51:15,511][model8_pretrain.py][INFO] Epoch:[0/2](201300/4588595) loss:2.905 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:51:15,511][model8_pretrain.py][INFO] Epoch:[0/2](201300/4588595) loss:2.997 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:51:15,511][model8_pretrain.py][INFO] Epoch:[0/2](201300/4588595) loss:3.239 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:51:15,511][model8_pretrain.py][INFO] Epoch:[0/2](201300/4588595) loss:2.734 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:51:15,511][model8_pretrain.py][INFO] Epoch:[0/2](201300/4588595) loss:2.581 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:51:15,511][model8_pretrain.py][INFO] Epoch:[0/2](201300/4588595) loss:2.971 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:52:01,204][model8_pretrain.py][INFO] Epoch:[0/2](201400/4588595) loss:3.132 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:01,204][model8_pretrain.py][INFO] Epoch:[0/2](201400/4588595) loss:3.049 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:01,204][model8_pretrain.py][INFO] Epoch:[0/2](201400/4588595) loss:2.879 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:01,204][model8_pretrain.py][INFO] Epoch:[0/2](201400/4588595) loss:2.815 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:01,204][model8_pretrain.py][INFO] Epoch:[0/2](201400/4588595) loss:3.073 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:01,204][model8_pretrain.py][INFO] Epoch:[0/2](201400/4588595) loss:2.912 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:01,204][model8_pretrain.py][INFO] Epoch:[0/2](201400/4588595) loss:3.246 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:01,204][model8_pretrain.py][INFO] Epoch:[0/2](201400/4588595) loss:3.279 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:38,137][model8_pretrain.py][INFO] Epoch:[0/2](201500/4588595) loss:3.308 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:38,137][model8_pretrain.py][INFO] Epoch:[0/2](201500/4588595) loss:2.419 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:38,137][model8_pretrain.py][INFO] Epoch:[0/2](201500/4588595) loss:3.081 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:38,137][model8_pretrain.py][INFO] Epoch:[0/2](201500/4588595) loss:3.247 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:38,137][model8_pretrain.py][INFO] Epoch:[0/2](201500/4588595) loss:2.928 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:38,137][model8_pretrain.py][INFO] Epoch:[0/2](201500/4588595) loss:3.063 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:38,137][model8_pretrain.py][INFO] Epoch:[0/2](201500/4588595) loss:3.111 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:52:38,137][model8_pretrain.py][INFO] Epoch:[0/2](201500/4588595) loss:2.899 lr:0.0000100 epoch_Time:27713.0min: [2024-01-03 14:53:15,093][model8_pretrain.py][INFO] Epoch:[0/2](201600/4588595) loss:3.208 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:53:15,093][model8_pretrain.py][INFO] Epoch:[0/2](201600/4588595) loss:2.604 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:53:15,094][model8_pretrain.py][INFO] Epoch:[0/2](201600/4588595) loss:2.829 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:53:15,094][model8_pretrain.py][INFO] Epoch:[0/2](201600/4588595) loss:3.084 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:53:15,094][model8_pretrain.py][INFO] Epoch:[0/2](201600/4588595) loss:3.313 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:53:15,094][model8_pretrain.py][INFO] Epoch:[0/2](201600/4588595) loss:3.046 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:53:15,094][model8_pretrain.py][INFO] Epoch:[0/2](201600/4588595) loss:3.102 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:53:15,094][model8_pretrain.py][INFO] Epoch:[0/2](201600/4588595) loss:3.155 lr:0.0000100 epoch_Time:27711.0min: [2024-01-03 14:53:52,008][model8_pretrain.py][INFO] Epoch:[0/2](201700/4588595) loss:2.615 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:53:52,008][model8_pretrain.py][INFO] Epoch:[0/2](201700/4588595) loss:2.803 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:53:52,008][model8_pretrain.py][INFO] Epoch:[0/2](201700/4588595) loss:3.124 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:53:52,008][model8_pretrain.py][INFO] Epoch:[0/2](201700/4588595) loss:3.508 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:53:52,008][model8_pretrain.py][INFO] Epoch:[0/2](201700/4588595) loss:3.081 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:53:52,009][model8_pretrain.py][INFO] Epoch:[0/2](201700/4588595) loss:2.777 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:53:52,009][model8_pretrain.py][INFO] Epoch:[0/2](201700/4588595) loss:2.480 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:53:52,009][model8_pretrain.py][INFO] Epoch:[0/2](201700/4588595) loss:3.374 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:54:28,946][model8_pretrain.py][INFO] Epoch:[0/2](201800/4588595) loss:3.059 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:54:28,946][model8_pretrain.py][INFO] Epoch:[0/2](201800/4588595) loss:2.747 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:54:28,946][model8_pretrain.py][INFO] Epoch:[0/2](201800/4588595) loss:3.127 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:54:28,946][model8_pretrain.py][INFO] Epoch:[0/2](201800/4588595) loss:2.853 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:54:28,946][model8_pretrain.py][INFO] Epoch:[0/2](201800/4588595) loss:3.006 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:54:28,946][model8_pretrain.py][INFO] Epoch:[0/2](201800/4588595) loss:3.029 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:54:28,946][model8_pretrain.py][INFO] Epoch:[0/2](201800/4588595) loss:3.185 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:54:28,946][model8_pretrain.py][INFO] Epoch:[0/2](201800/4588595) loss:2.608 lr:0.0000100 epoch_Time:27710.0min: [2024-01-03 14:55:05,881][model8_pretrain.py][INFO] Epoch:[0/2](201900/4588595) loss:2.707 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:05,881][model8_pretrain.py][INFO] Epoch:[0/2](201900/4588595) loss:3.275 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:05,881][model8_pretrain.py][INFO] Epoch:[0/2](201900/4588595) loss:3.034 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:05,881][model8_pretrain.py][INFO] Epoch:[0/2](201900/4588595) loss:2.938 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:05,881][model8_pretrain.py][INFO] Epoch:[0/2](201900/4588595) loss:2.553 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:05,881][model8_pretrain.py][INFO] Epoch:[0/2](201900/4588595) loss:3.243 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:05,881][model8_pretrain.py][INFO] Epoch:[0/2](201900/4588595) loss:3.186 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:05,881][model8_pretrain.py][INFO] Epoch:[0/2](201900/4588595) loss:3.172 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:42,813][model8_pretrain.py][INFO] Epoch:[0/2](202000/4588595) loss:2.760 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:42,813][model8_pretrain.py][INFO] Epoch:[0/2](202000/4588595) loss:2.919 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:42,813][model8_pretrain.py][INFO] Epoch:[0/2](202000/4588595) loss:2.936 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:42,813][model8_pretrain.py][INFO] Epoch:[0/2](202000/4588595) loss:2.692 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:42,813][model8_pretrain.py][INFO] Epoch:[0/2](202000/4588595) loss:3.165 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:42,813][model8_pretrain.py][INFO] Epoch:[0/2](202000/4588595) loss:3.231 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:42,813][model8_pretrain.py][INFO] Epoch:[0/2](202000/4588595) loss:3.176 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:55:42,813][model8_pretrain.py][INFO] Epoch:[0/2](202000/4588595) loss:2.919 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:56:19,766][model8_pretrain.py][INFO] Epoch:[0/2](202100/4588595) loss:2.711 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:56:19,766][model8_pretrain.py][INFO] Epoch:[0/2](202100/4588595) loss:2.933 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:56:19,766][model8_pretrain.py][INFO] Epoch:[0/2](202100/4588595) loss:2.864 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:56:19,766][model8_pretrain.py][INFO] Epoch:[0/2](202100/4588595) loss:3.202 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:56:19,766][model8_pretrain.py][INFO] Epoch:[0/2](202100/4588595) loss:2.789 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:56:19,766][model8_pretrain.py][INFO] Epoch:[0/2](202100/4588595) loss:2.942 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:56:19,766][model8_pretrain.py][INFO] Epoch:[0/2](202100/4588595) loss:3.068 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:56:19,767][model8_pretrain.py][INFO] Epoch:[0/2](202100/4588595) loss:3.230 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:57:05,421][model8_pretrain.py][INFO] Epoch:[0/2](202200/4588595) loss:2.725 lr:0.0000100 epoch_Time:27709.0min: [2024-01-03 14:57:05,421][model8_pretrain.py][INFO] Epoch:[0/2](202200/4588595) loss:2.876 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:57:05,421][model8_pretrain.py][INFO] Epoch:[0/2](202200/4588595) loss:2.762 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:57:05,421][model8_pretrain.py][INFO] Epoch:[0/2](202200/4588595) loss:2.928 lr:0.0000100 epoch_Time:27709.0min: [2024-01-03 14:57:05,421][model8_pretrain.py][INFO] Epoch:[0/2](202200/4588595) loss:3.218 lr:0.0000100 epoch_Time:27709.0min: [2024-01-03 14:57:05,421][model8_pretrain.py][INFO] Epoch:[0/2](202200/4588595) loss:2.650 lr:0.0000100 epoch_Time:27709.0min: [2024-01-03 14:57:05,421][model8_pretrain.py][INFO] Epoch:[0/2](202200/4588595) loss:3.106 lr:0.0000100 epoch_Time:27709.0min: [2024-01-03 14:57:05,421][model8_pretrain.py][INFO] Epoch:[0/2](202200/4588595) loss:3.008 lr:0.0000100 epoch_Time:27709.0min: [2024-01-03 14:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](202300/4588595) loss:3.038 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](202300/4588595) loss:3.360 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](202300/4588595) loss:2.321 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](202300/4588595) loss:2.533 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](202300/4588595) loss:2.785 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](202300/4588595) loss:3.314 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](202300/4588595) loss:3.017 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](202300/4588595) loss:2.876 lr:0.0000100 epoch_Time:27708.0min: [2024-01-03 14:58:19,290][model8_pretrain.py][INFO] Epoch:[0/2](202400/4588595) loss:3.417 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:58:19,290][model8_pretrain.py][INFO] Epoch:[0/2](202400/4588595) loss:2.736 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:58:19,290][model8_pretrain.py][INFO] Epoch:[0/2](202400/4588595) loss:3.082 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:58:19,290][model8_pretrain.py][INFO] Epoch:[0/2](202400/4588595) loss:2.512 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:58:19,290][model8_pretrain.py][INFO] Epoch:[0/2](202400/4588595) loss:2.896 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:58:19,290][model8_pretrain.py][INFO] Epoch:[0/2](202400/4588595) loss:2.955 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:58:19,290][model8_pretrain.py][INFO] Epoch:[0/2](202400/4588595) loss:3.301 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:58:19,290][model8_pretrain.py][INFO] Epoch:[0/2](202400/4588595) loss:3.094 lr:0.0000100 epoch_Time:27707.0min: [2024-01-03 14:58:56,230][model8_pretrain.py][INFO] Epoch:[0/2](202500/4588595) loss:2.743 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:58:56,230][model8_pretrain.py][INFO] Epoch:[0/2](202500/4588595) loss:2.729 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:58:56,230][model8_pretrain.py][INFO] Epoch:[0/2](202500/4588595) loss:3.106 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:58:56,230][model8_pretrain.py][INFO] Epoch:[0/2](202500/4588595) loss:3.131 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:58:56,230][model8_pretrain.py][INFO] Epoch:[0/2](202500/4588595) loss:2.890 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:58:56,230][model8_pretrain.py][INFO] Epoch:[0/2](202500/4588595) loss:2.804 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:58:56,230][model8_pretrain.py][INFO] Epoch:[0/2](202500/4588595) loss:2.907 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:58:56,230][model8_pretrain.py][INFO] Epoch:[0/2](202500/4588595) loss:3.194 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:59:33,168][model8_pretrain.py][INFO] Epoch:[0/2](202600/4588595) loss:3.003 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:59:33,168][model8_pretrain.py][INFO] Epoch:[0/2](202600/4588595) loss:2.980 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:59:33,168][model8_pretrain.py][INFO] Epoch:[0/2](202600/4588595) loss:3.058 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:59:33,168][model8_pretrain.py][INFO] Epoch:[0/2](202600/4588595) loss:2.902 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:59:33,168][model8_pretrain.py][INFO] Epoch:[0/2](202600/4588595) loss:3.136 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:59:33,168][model8_pretrain.py][INFO] Epoch:[0/2](202600/4588595) loss:3.341 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:59:33,168][model8_pretrain.py][INFO] Epoch:[0/2](202600/4588595) loss:2.880 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 14:59:33,169][model8_pretrain.py][INFO] Epoch:[0/2](202600/4588595) loss:2.796 lr:0.0000100 epoch_Time:27705.0min: [2024-01-03 15:00:10,122][model8_pretrain.py][INFO] Epoch:[0/2](202700/4588595) loss:2.980 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:00:10,122][model8_pretrain.py][INFO] Epoch:[0/2](202700/4588595) loss:2.974 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:00:10,122][model8_pretrain.py][INFO] Epoch:[0/2](202700/4588595) loss:2.617 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:00:10,122][model8_pretrain.py][INFO] Epoch:[0/2](202700/4588595) loss:2.439 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:00:10,122][model8_pretrain.py][INFO] Epoch:[0/2](202700/4588595) loss:3.214 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:00:10,122][model8_pretrain.py][INFO] Epoch:[0/2](202700/4588595) loss:2.635 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:00:10,122][model8_pretrain.py][INFO] Epoch:[0/2](202700/4588595) loss:2.615 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:00:10,122][model8_pretrain.py][INFO] Epoch:[0/2](202700/4588595) loss:3.113 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:00:47,065][model8_pretrain.py][INFO] Epoch:[0/2](202800/4588595) loss:2.788 lr:0.0000100 epoch_Time:27703.0min: [2024-01-03 15:00:47,065][model8_pretrain.py][INFO] Epoch:[0/2](202800/4588595) loss:3.188 lr:0.0000100 epoch_Time:27703.0min: [2024-01-03 15:00:47,065][model8_pretrain.py][INFO] Epoch:[0/2](202800/4588595) loss:2.838 lr:0.0000100 epoch_Time:27703.0min: [2024-01-03 15:00:47,065][model8_pretrain.py][INFO] Epoch:[0/2](202800/4588595) loss:3.389 lr:0.0000100 epoch_Time:27703.0min: [2024-01-03 15:00:47,065][model8_pretrain.py][INFO] Epoch:[0/2](202800/4588595) loss:3.340 lr:0.0000100 epoch_Time:27703.0min: [2024-01-03 15:00:47,065][model8_pretrain.py][INFO] Epoch:[0/2](202800/4588595) loss:3.056 lr:0.0000100 epoch_Time:27703.0min: [2024-01-03 15:00:47,066][model8_pretrain.py][INFO] Epoch:[0/2](202800/4588595) loss:3.220 lr:0.0000100 epoch_Time:27703.0min: [2024-01-03 15:00:47,066][model8_pretrain.py][INFO] Epoch:[0/2](202800/4588595) loss:3.272 lr:0.0000100 epoch_Time:27703.0min: [2024-01-03 15:01:24,012][model8_pretrain.py][INFO] Epoch:[0/2](202900/4588595) loss:3.289 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:01:24,012][model8_pretrain.py][INFO] Epoch:[0/2](202900/4588595) loss:3.279 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:01:24,012][model8_pretrain.py][INFO] Epoch:[0/2](202900/4588595) loss:2.382 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:01:24,012][model8_pretrain.py][INFO] Epoch:[0/2](202900/4588595) loss:2.856 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:01:24,012][model8_pretrain.py][INFO] Epoch:[0/2](202900/4588595) loss:2.791 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:01:24,012][model8_pretrain.py][INFO] Epoch:[0/2](202900/4588595) loss:2.878 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:01:24,012][model8_pretrain.py][INFO] Epoch:[0/2](202900/4588595) loss:2.711 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:01:24,012][model8_pretrain.py][INFO] Epoch:[0/2](202900/4588595) loss:3.198 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:02:09,663][model8_pretrain.py][INFO] Epoch:[0/2](203000/4588595) loss:3.416 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:09,663][model8_pretrain.py][INFO] Epoch:[0/2](203000/4588595) loss:3.282 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:09,663][model8_pretrain.py][INFO] Epoch:[0/2](203000/4588595) loss:2.892 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:09,663][model8_pretrain.py][INFO] Epoch:[0/2](203000/4588595) loss:3.153 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:09,663][model8_pretrain.py][INFO] Epoch:[0/2](203000/4588595) loss:2.980 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:09,663][model8_pretrain.py][INFO] Epoch:[0/2](203000/4588595) loss:2.709 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:09,663][model8_pretrain.py][INFO] Epoch:[0/2](203000/4588595) loss:2.904 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:09,663][model8_pretrain.py][INFO] Epoch:[0/2](203000/4588595) loss:2.561 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:46,640][model8_pretrain.py][INFO] Epoch:[0/2](203100/4588595) loss:3.163 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:46,640][model8_pretrain.py][INFO] Epoch:[0/2](203100/4588595) loss:3.019 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:46,641][model8_pretrain.py][INFO] Epoch:[0/2](203100/4588595) loss:2.436 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:46,641][model8_pretrain.py][INFO] Epoch:[0/2](203100/4588595) loss:2.831 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:46,641][model8_pretrain.py][INFO] Epoch:[0/2](203100/4588595) loss:2.561 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:46,641][model8_pretrain.py][INFO] Epoch:[0/2](203100/4588595) loss:2.858 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:46,641][model8_pretrain.py][INFO] Epoch:[0/2](203100/4588595) loss:2.944 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:02:46,641][model8_pretrain.py][INFO] Epoch:[0/2](203100/4588595) loss:2.972 lr:0.0000100 epoch_Time:27704.0min: [2024-01-03 15:03:23,586][model8_pretrain.py][INFO] Epoch:[0/2](203200/4588595) loss:2.675 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:03:23,586][model8_pretrain.py][INFO] Epoch:[0/2](203200/4588595) loss:3.010 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:03:23,586][model8_pretrain.py][INFO] Epoch:[0/2](203200/4588595) loss:3.570 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:03:23,586][model8_pretrain.py][INFO] Epoch:[0/2](203200/4588595) loss:3.056 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:03:23,586][model8_pretrain.py][INFO] Epoch:[0/2](203200/4588595) loss:2.725 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:03:23,587][model8_pretrain.py][INFO] Epoch:[0/2](203200/4588595) loss:3.051 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:03:23,587][model8_pretrain.py][INFO] Epoch:[0/2](203200/4588595) loss:3.088 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:03:23,588][model8_pretrain.py][INFO] Epoch:[0/2](203200/4588595) loss:3.055 lr:0.0000100 epoch_Time:27702.0min: [2024-01-03 15:04:00,541][model8_pretrain.py][INFO] Epoch:[0/2](203300/4588595) loss:2.989 lr:0.0000100 epoch_Time:27701.0min: [2024-01-03 15:04:00,541][model8_pretrain.py][INFO] Epoch:[0/2](203300/4588595) loss:2.780 lr:0.0000100 epoch_Time:27701.0min: [2024-01-03 15:04:00,541][model8_pretrain.py][INFO] Epoch:[0/2](203300/4588595) loss:2.929 lr:0.0000100 epoch_Time:27701.0min: [2024-01-03 15:04:00,541][model8_pretrain.py][INFO] Epoch:[0/2](203300/4588595) loss:3.122 lr:0.0000100 epoch_Time:27701.0min: [2024-01-03 15:04:00,541][model8_pretrain.py][INFO] Epoch:[0/2](203300/4588595) loss:3.023 lr:0.0000100 epoch_Time:27701.0min: [2024-01-03 15:04:00,541][model8_pretrain.py][INFO] Epoch:[0/2](203300/4588595) loss:3.251 lr:0.0000100 epoch_Time:27701.0min: [2024-01-03 15:04:00,541][model8_pretrain.py][INFO] Epoch:[0/2](203300/4588595) loss:3.162 lr:0.0000100 epoch_Time:27701.0min: [2024-01-03 15:04:00,541][model8_pretrain.py][INFO] Epoch:[0/2](203300/4588595) loss:2.717 lr:0.0000100 epoch_Time:27701.0min: [2024-01-03 15:04:37,488][model8_pretrain.py][INFO] Epoch:[0/2](203400/4588595) loss:3.084 lr:0.0000100 epoch_Time:27700.0min: [2024-01-03 15:04:37,488][model8_pretrain.py][INFO] Epoch:[0/2](203400/4588595) loss:2.726 lr:0.0000100 epoch_Time:27700.0min: [2024-01-03 15:04:37,488][model8_pretrain.py][INFO] Epoch:[0/2](203400/4588595) loss:3.327 lr:0.0000100 epoch_Time:27700.0min: [2024-01-03 15:04:37,488][model8_pretrain.py][INFO] Epoch:[0/2](203400/4588595) loss:3.158 lr:0.0000100 epoch_Time:27700.0min: [2024-01-03 15:04:37,488][model8_pretrain.py][INFO] Epoch:[0/2](203400/4588595) loss:2.602 lr:0.0000100 epoch_Time:27700.0min: [2024-01-03 15:04:37,488][model8_pretrain.py][INFO] Epoch:[0/2](203400/4588595) loss:2.959 lr:0.0000100 epoch_Time:27700.0min: [2024-01-03 15:04:37,488][model8_pretrain.py][INFO] Epoch:[0/2](203400/4588595) loss:3.164 lr:0.0000100 epoch_Time:27700.0min: [2024-01-03 15:04:37,488][model8_pretrain.py][INFO] Epoch:[0/2](203400/4588595) loss:3.147 lr:0.0000100 epoch_Time:27700.0min: [2024-01-03 15:05:14,423][model8_pretrain.py][INFO] Epoch:[0/2](203500/4588595) loss:2.791 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:05:14,423][model8_pretrain.py][INFO] Epoch:[0/2](203500/4588595) loss:3.194 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:05:14,423][model8_pretrain.py][INFO] Epoch:[0/2](203500/4588595) loss:3.182 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:05:14,423][model8_pretrain.py][INFO] Epoch:[0/2](203500/4588595) loss:2.928 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:05:14,423][model8_pretrain.py][INFO] Epoch:[0/2](203500/4588595) loss:2.726 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:05:14,423][model8_pretrain.py][INFO] Epoch:[0/2](203500/4588595) loss:2.949 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:05:14,423][model8_pretrain.py][INFO] Epoch:[0/2](203500/4588595) loss:3.063 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:05:14,423][model8_pretrain.py][INFO] Epoch:[0/2](203500/4588595) loss:2.931 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:05:51,372][model8_pretrain.py][INFO] Epoch:[0/2](203600/4588595) loss:3.201 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:05:51,372][model8_pretrain.py][INFO] Epoch:[0/2](203600/4588595) loss:2.593 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:05:51,372][model8_pretrain.py][INFO] Epoch:[0/2](203600/4588595) loss:2.300 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:05:51,372][model8_pretrain.py][INFO] Epoch:[0/2](203600/4588595) loss:2.910 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:05:51,372][model8_pretrain.py][INFO] Epoch:[0/2](203600/4588595) loss:2.967 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:05:51,372][model8_pretrain.py][INFO] Epoch:[0/2](203600/4588595) loss:3.405 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:05:51,372][model8_pretrain.py][INFO] Epoch:[0/2](203600/4588595) loss:3.133 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:05:51,372][model8_pretrain.py][INFO] Epoch:[0/2](203600/4588595) loss:3.225 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:06:28,333][model8_pretrain.py][INFO] Epoch:[0/2](203700/4588595) loss:2.283 lr:0.0000100 epoch_Time:27697.0min: [2024-01-03 15:06:28,333][model8_pretrain.py][INFO] Epoch:[0/2](203700/4588595) loss:3.084 lr:0.0000100 epoch_Time:27697.0min: [2024-01-03 15:06:28,333][model8_pretrain.py][INFO] Epoch:[0/2](203700/4588595) loss:2.891 lr:0.0000100 epoch_Time:27697.0min: [2024-01-03 15:06:28,333][model8_pretrain.py][INFO] Epoch:[0/2](203700/4588595) loss:2.574 lr:0.0000100 epoch_Time:27697.0min: [2024-01-03 15:06:28,333][model8_pretrain.py][INFO] Epoch:[0/2](203700/4588595) loss:2.814 lr:0.0000100 epoch_Time:27697.0min: [2024-01-03 15:06:28,333][model8_pretrain.py][INFO] Epoch:[0/2](203700/4588595) loss:3.031 lr:0.0000100 epoch_Time:27697.0min: [2024-01-03 15:06:28,334][model8_pretrain.py][INFO] Epoch:[0/2](203700/4588595) loss:3.415 lr:0.0000100 epoch_Time:27697.0min: [2024-01-03 15:06:28,334][model8_pretrain.py][INFO] Epoch:[0/2](203700/4588595) loss:3.196 lr:0.0000100 epoch_Time:27697.0min: [2024-01-03 15:07:14,046][model8_pretrain.py][INFO] Epoch:[0/2](203800/4588595) loss:2.904 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:07:14,046][model8_pretrain.py][INFO] Epoch:[0/2](203800/4588595) loss:2.814 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:07:14,046][model8_pretrain.py][INFO] Epoch:[0/2](203800/4588595) loss:2.905 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:07:14,046][model8_pretrain.py][INFO] Epoch:[0/2](203800/4588595) loss:2.985 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:07:14,046][model8_pretrain.py][INFO] Epoch:[0/2](203800/4588595) loss:2.322 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:07:14,046][model8_pretrain.py][INFO] Epoch:[0/2](203800/4588595) loss:3.216 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:07:14,046][model8_pretrain.py][INFO] Epoch:[0/2](203800/4588595) loss:2.641 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:07:14,047][model8_pretrain.py][INFO] Epoch:[0/2](203800/4588595) loss:2.778 lr:0.0000100 epoch_Time:27699.0min: [2024-01-03 15:07:50,973][model8_pretrain.py][INFO] Epoch:[0/2](203900/4588595) loss:2.792 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:07:50,973][model8_pretrain.py][INFO] Epoch:[0/2](203900/4588595) loss:3.073 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:07:50,973][model8_pretrain.py][INFO] Epoch:[0/2](203900/4588595) loss:3.109 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:07:50,973][model8_pretrain.py][INFO] Epoch:[0/2](203900/4588595) loss:2.993 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:07:50,973][model8_pretrain.py][INFO] Epoch:[0/2](203900/4588595) loss:2.633 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:07:50,973][model8_pretrain.py][INFO] Epoch:[0/2](203900/4588595) loss:2.366 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:07:50,973][model8_pretrain.py][INFO] Epoch:[0/2](203900/4588595) loss:2.770 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:07:50,973][model8_pretrain.py][INFO] Epoch:[0/2](203900/4588595) loss:2.686 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:08:27,902][model8_pretrain.py][INFO] Epoch:[0/2](204000/4588595) loss:2.831 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:08:27,902][model8_pretrain.py][INFO] Epoch:[0/2](204000/4588595) loss:2.849 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:08:27,902][model8_pretrain.py][INFO] Epoch:[0/2](204000/4588595) loss:2.417 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:08:27,902][model8_pretrain.py][INFO] Epoch:[0/2](204000/4588595) loss:3.285 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:08:27,902][model8_pretrain.py][INFO] Epoch:[0/2](204000/4588595) loss:3.462 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:08:27,902][model8_pretrain.py][INFO] Epoch:[0/2](204000/4588595) loss:3.006 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:08:27,902][model8_pretrain.py][INFO] Epoch:[0/2](204000/4588595) loss:2.912 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:08:27,902][model8_pretrain.py][INFO] Epoch:[0/2](204000/4588595) loss:3.187 lr:0.0000100 epoch_Time:27698.0min: [2024-01-03 15:09:04,839][model8_pretrain.py][INFO] Epoch:[0/2](204100/4588595) loss:2.433 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:04,839][model8_pretrain.py][INFO] Epoch:[0/2](204100/4588595) loss:2.359 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:04,839][model8_pretrain.py][INFO] Epoch:[0/2](204100/4588595) loss:3.086 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:04,839][model8_pretrain.py][INFO] Epoch:[0/2](204100/4588595) loss:2.595 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:04,839][model8_pretrain.py][INFO] Epoch:[0/2](204100/4588595) loss:3.080 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:04,839][model8_pretrain.py][INFO] Epoch:[0/2](204100/4588595) loss:2.895 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:04,839][model8_pretrain.py][INFO] Epoch:[0/2](204100/4588595) loss:2.968 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:04,839][model8_pretrain.py][INFO] Epoch:[0/2](204100/4588595) loss:3.343 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:41,768][model8_pretrain.py][INFO] Epoch:[0/2](204200/4588595) loss:2.539 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:41,768][model8_pretrain.py][INFO] Epoch:[0/2](204200/4588595) loss:2.849 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:41,768][model8_pretrain.py][INFO] Epoch:[0/2](204200/4588595) loss:2.432 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:41,768][model8_pretrain.py][INFO] Epoch:[0/2](204200/4588595) loss:2.983 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:41,768][model8_pretrain.py][INFO] Epoch:[0/2](204200/4588595) loss:3.097 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:41,768][model8_pretrain.py][INFO] Epoch:[0/2](204200/4588595) loss:2.552 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:41,768][model8_pretrain.py][INFO] Epoch:[0/2](204200/4588595) loss:2.613 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:09:41,768][model8_pretrain.py][INFO] Epoch:[0/2](204200/4588595) loss:3.048 lr:0.0000100 epoch_Time:27696.0min: [2024-01-03 15:10:18,701][model8_pretrain.py][INFO] Epoch:[0/2](204300/4588595) loss:2.514 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:10:18,701][model8_pretrain.py][INFO] Epoch:[0/2](204300/4588595) loss:3.051 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:10:18,701][model8_pretrain.py][INFO] Epoch:[0/2](204300/4588595) loss:2.603 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:10:18,701][model8_pretrain.py][INFO] Epoch:[0/2](204300/4588595) loss:2.914 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:10:18,701][model8_pretrain.py][INFO] Epoch:[0/2](204300/4588595) loss:3.094 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:10:18,701][model8_pretrain.py][INFO] Epoch:[0/2](204300/4588595) loss:2.542 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:10:18,701][model8_pretrain.py][INFO] Epoch:[0/2](204300/4588595) loss:3.007 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:10:18,701][model8_pretrain.py][INFO] Epoch:[0/2](204300/4588595) loss:3.140 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:10:55,624][model8_pretrain.py][INFO] Epoch:[0/2](204400/4588595) loss:2.779 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:10:55,624][model8_pretrain.py][INFO] Epoch:[0/2](204400/4588595) loss:2.757 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:10:55,624][model8_pretrain.py][INFO] Epoch:[0/2](204400/4588595) loss:3.042 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:10:55,624][model8_pretrain.py][INFO] Epoch:[0/2](204400/4588595) loss:2.922 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:10:55,624][model8_pretrain.py][INFO] Epoch:[0/2](204400/4588595) loss:2.881 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:10:55,624][model8_pretrain.py][INFO] Epoch:[0/2](204400/4588595) loss:2.733 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:10:55,624][model8_pretrain.py][INFO] Epoch:[0/2](204400/4588595) loss:3.043 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:10:55,624][model8_pretrain.py][INFO] Epoch:[0/2](204400/4588595) loss:3.478 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:11:32,551][model8_pretrain.py][INFO] Epoch:[0/2](204500/4588595) loss:2.749 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:11:32,551][model8_pretrain.py][INFO] Epoch:[0/2](204500/4588595) loss:2.937 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:11:32,551][model8_pretrain.py][INFO] Epoch:[0/2](204500/4588595) loss:2.930 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:11:32,551][model8_pretrain.py][INFO] Epoch:[0/2](204500/4588595) loss:2.650 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:11:32,551][model8_pretrain.py][INFO] Epoch:[0/2](204500/4588595) loss:3.111 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:11:32,551][model8_pretrain.py][INFO] Epoch:[0/2](204500/4588595) loss:3.043 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:11:32,551][model8_pretrain.py][INFO] Epoch:[0/2](204500/4588595) loss:3.172 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:11:32,551][model8_pretrain.py][INFO] Epoch:[0/2](204500/4588595) loss:2.596 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:12:18,235][model8_pretrain.py][INFO] Epoch:[0/2](204600/4588595) loss:3.102 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:12:18,235][model8_pretrain.py][INFO] Epoch:[0/2](204600/4588595) loss:3.390 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:12:18,235][model8_pretrain.py][INFO] Epoch:[0/2](204600/4588595) loss:2.921 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:12:18,235][model8_pretrain.py][INFO] Epoch:[0/2](204600/4588595) loss:2.650 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:12:18,235][model8_pretrain.py][INFO] Epoch:[0/2](204600/4588595) loss:2.839 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:12:18,235][model8_pretrain.py][INFO] Epoch:[0/2](204600/4588595) loss:3.136 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:12:18,235][model8_pretrain.py][INFO] Epoch:[0/2](204600/4588595) loss:2.841 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:12:18,235][model8_pretrain.py][INFO] Epoch:[0/2](204600/4588595) loss:2.798 lr:0.0000100 epoch_Time:27695.0min: [2024-01-03 15:12:55,157][model8_pretrain.py][INFO] Epoch:[0/2](204700/4588595) loss:2.732 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:12:55,157][model8_pretrain.py][INFO] Epoch:[0/2](204700/4588595) loss:3.180 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:12:55,157][model8_pretrain.py][INFO] Epoch:[0/2](204700/4588595) loss:3.244 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:12:55,157][model8_pretrain.py][INFO] Epoch:[0/2](204700/4588595) loss:2.524 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:12:55,157][model8_pretrain.py][INFO] Epoch:[0/2](204700/4588595) loss:3.037 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:12:55,157][model8_pretrain.py][INFO] Epoch:[0/2](204700/4588595) loss:2.455 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:12:55,157][model8_pretrain.py][INFO] Epoch:[0/2](204700/4588595) loss:2.130 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:12:55,158][model8_pretrain.py][INFO] Epoch:[0/2](204700/4588595) loss:3.156 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:13:32,103][model8_pretrain.py][INFO] Epoch:[0/2](204800/4588595) loss:3.261 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:13:32,104][model8_pretrain.py][INFO] Epoch:[0/2](204800/4588595) loss:3.325 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:13:32,104][model8_pretrain.py][INFO] Epoch:[0/2](204800/4588595) loss:2.622 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:13:32,104][model8_pretrain.py][INFO] Epoch:[0/2](204800/4588595) loss:2.619 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:13:32,104][model8_pretrain.py][INFO] Epoch:[0/2](204800/4588595) loss:2.838 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:13:32,104][model8_pretrain.py][INFO] Epoch:[0/2](204800/4588595) loss:2.851 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:13:32,104][model8_pretrain.py][INFO] Epoch:[0/2](204800/4588595) loss:2.473 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:13:32,104][model8_pretrain.py][INFO] Epoch:[0/2](204800/4588595) loss:3.054 lr:0.0000100 epoch_Time:27693.0min: [2024-01-03 15:14:09,058][model8_pretrain.py][INFO] Epoch:[0/2](204900/4588595) loss:2.736 lr:0.0000100 epoch_Time:27692.0min: [2024-01-03 15:14:09,058][model8_pretrain.py][INFO] Epoch:[0/2](204900/4588595) loss:3.212 lr:0.0000100 epoch_Time:27692.0min: [2024-01-03 15:14:09,058][model8_pretrain.py][INFO] Epoch:[0/2](204900/4588595) loss:3.492 lr:0.0000100 epoch_Time:27692.0min: [2024-01-03 15:14:09,058][model8_pretrain.py][INFO] Epoch:[0/2](204900/4588595) loss:2.947 lr:0.0000100 epoch_Time:27692.0min: [2024-01-03 15:14:09,058][model8_pretrain.py][INFO] Epoch:[0/2](204900/4588595) loss:2.756 lr:0.0000100 epoch_Time:27692.0min: [2024-01-03 15:14:09,058][model8_pretrain.py][INFO] Epoch:[0/2](204900/4588595) loss:2.814 lr:0.0000100 epoch_Time:27692.0min: [2024-01-03 15:14:09,058][model8_pretrain.py][INFO] Epoch:[0/2](204900/4588595) loss:2.437 lr:0.0000100 epoch_Time:27692.0min: [2024-01-03 15:14:09,058][model8_pretrain.py][INFO] Epoch:[0/2](204900/4588595) loss:3.051 lr:0.0000100 epoch_Time:27692.0min: [2024-01-03 15:14:46,000][model8_pretrain.py][INFO] Epoch:[0/2](205000/4588595) loss:3.126 lr:0.0000100 epoch_Time:27691.0min: [2024-01-03 15:14:46,000][model8_pretrain.py][INFO] Epoch:[0/2](205000/4588595) loss:3.220 lr:0.0000100 epoch_Time:27691.0min: [2024-01-03 15:14:46,000][model8_pretrain.py][INFO] Epoch:[0/2](205000/4588595) loss:2.833 lr:0.0000100 epoch_Time:27691.0min: [2024-01-03 15:14:46,000][model8_pretrain.py][INFO] Epoch:[0/2](205000/4588595) loss:2.610 lr:0.0000100 epoch_Time:27691.0min: [2024-01-03 15:14:46,000][model8_pretrain.py][INFO] Epoch:[0/2](205000/4588595) loss:3.281 lr:0.0000100 epoch_Time:27691.0min: [2024-01-03 15:14:46,000][model8_pretrain.py][INFO] Epoch:[0/2](205000/4588595) loss:3.258 lr:0.0000100 epoch_Time:27691.0min: [2024-01-03 15:14:46,000][model8_pretrain.py][INFO] Epoch:[0/2](205000/4588595) loss:2.944 lr:0.0000100 epoch_Time:27691.0min: [2024-01-03 15:14:46,000][model8_pretrain.py][INFO] Epoch:[0/2](205000/4588595) loss:2.738 lr:0.0000100 epoch_Time:27691.0min: [2024-01-03 15:15:22,971][model8_pretrain.py][INFO] Epoch:[0/2](205100/4588595) loss:3.191 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:15:22,971][model8_pretrain.py][INFO] Epoch:[0/2](205100/4588595) loss:2.639 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:15:22,971][model8_pretrain.py][INFO] Epoch:[0/2](205100/4588595) loss:2.973 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:15:22,971][model8_pretrain.py][INFO] Epoch:[0/2](205100/4588595) loss:3.375 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:15:22,971][model8_pretrain.py][INFO] Epoch:[0/2](205100/4588595) loss:3.359 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:15:22,971][model8_pretrain.py][INFO] Epoch:[0/2](205100/4588595) loss:2.602 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:15:22,971][model8_pretrain.py][INFO] Epoch:[0/2](205100/4588595) loss:3.078 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:15:22,971][model8_pretrain.py][INFO] Epoch:[0/2](205100/4588595) loss:2.828 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:15:59,929][model8_pretrain.py][INFO] Epoch:[0/2](205200/4588595) loss:3.344 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:15:59,929][model8_pretrain.py][INFO] Epoch:[0/2](205200/4588595) loss:2.903 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:15:59,929][model8_pretrain.py][INFO] Epoch:[0/2](205200/4588595) loss:2.680 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:15:59,929][model8_pretrain.py][INFO] Epoch:[0/2](205200/4588595) loss:3.319 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:15:59,930][model8_pretrain.py][INFO] Epoch:[0/2](205200/4588595) loss:2.648 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:15:59,930][model8_pretrain.py][INFO] Epoch:[0/2](205200/4588595) loss:2.657 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:15:59,930][model8_pretrain.py][INFO] Epoch:[0/2](205200/4588595) loss:3.153 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:15:59,930][model8_pretrain.py][INFO] Epoch:[0/2](205200/4588595) loss:2.978 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:16:36,887][model8_pretrain.py][INFO] Epoch:[0/2](205300/4588595) loss:3.048 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:16:36,887][model8_pretrain.py][INFO] Epoch:[0/2](205300/4588595) loss:2.910 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:16:36,887][model8_pretrain.py][INFO] Epoch:[0/2](205300/4588595) loss:3.413 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:16:36,887][model8_pretrain.py][INFO] Epoch:[0/2](205300/4588595) loss:3.122 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:16:36,887][model8_pretrain.py][INFO] Epoch:[0/2](205300/4588595) loss:3.270 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:16:36,887][model8_pretrain.py][INFO] Epoch:[0/2](205300/4588595) loss:2.736 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:16:36,887][model8_pretrain.py][INFO] Epoch:[0/2](205300/4588595) loss:3.227 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:16:36,887][model8_pretrain.py][INFO] Epoch:[0/2](205300/4588595) loss:3.289 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](205400/4588595) loss:3.045 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](205400/4588595) loss:2.788 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](205400/4588595) loss:2.657 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](205400/4588595) loss:3.286 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](205400/4588595) loss:3.132 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](205400/4588595) loss:3.006 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](205400/4588595) loss:3.166 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](205400/4588595) loss:2.959 lr:0.0000100 epoch_Time:27690.0min: [2024-01-03 15:17:59,378][model8_pretrain.py][INFO] Epoch:[0/2](205500/4588595) loss:3.104 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:17:59,378][model8_pretrain.py][INFO] Epoch:[0/2](205500/4588595) loss:3.072 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:17:59,378][model8_pretrain.py][INFO] Epoch:[0/2](205500/4588595) loss:3.020 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:17:59,378][model8_pretrain.py][INFO] Epoch:[0/2](205500/4588595) loss:2.812 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:17:59,378][model8_pretrain.py][INFO] Epoch:[0/2](205500/4588595) loss:2.504 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:17:59,378][model8_pretrain.py][INFO] Epoch:[0/2](205500/4588595) loss:2.859 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:17:59,378][model8_pretrain.py][INFO] Epoch:[0/2](205500/4588595) loss:2.780 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:17:59,378][model8_pretrain.py][INFO] Epoch:[0/2](205500/4588595) loss:2.912 lr:0.0000100 epoch_Time:27689.0min: [2024-01-03 15:18:36,327][model8_pretrain.py][INFO] Epoch:[0/2](205600/4588595) loss:3.553 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:18:36,327][model8_pretrain.py][INFO] Epoch:[0/2](205600/4588595) loss:2.683 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:18:36,327][model8_pretrain.py][INFO] Epoch:[0/2](205600/4588595) loss:3.219 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:18:36,327][model8_pretrain.py][INFO] Epoch:[0/2](205600/4588595) loss:3.004 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:18:36,327][model8_pretrain.py][INFO] Epoch:[0/2](205600/4588595) loss:2.529 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:18:36,328][model8_pretrain.py][INFO] Epoch:[0/2](205600/4588595) loss:3.176 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:18:36,328][model8_pretrain.py][INFO] Epoch:[0/2](205600/4588595) loss:3.401 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:18:36,328][model8_pretrain.py][INFO] Epoch:[0/2](205600/4588595) loss:2.906 lr:0.0000100 epoch_Time:27688.0min: [2024-01-03 15:19:13,281][model8_pretrain.py][INFO] Epoch:[0/2](205700/4588595) loss:3.381 lr:0.0000100 epoch_Time:27687.0min: [2024-01-03 15:19:13,281][model8_pretrain.py][INFO] Epoch:[0/2](205700/4588595) loss:3.156 lr:0.0000100 epoch_Time:27687.0min: [2024-01-03 15:19:13,281][model8_pretrain.py][INFO] Epoch:[0/2](205700/4588595) loss:2.957 lr:0.0000100 epoch_Time:27687.0min: [2024-01-03 15:19:13,281][model8_pretrain.py][INFO] Epoch:[0/2](205700/4588595) loss:2.986 lr:0.0000100 epoch_Time:27687.0min: [2024-01-03 15:19:13,281][model8_pretrain.py][INFO] Epoch:[0/2](205700/4588595) loss:2.022 lr:0.0000100 epoch_Time:27687.0min: [2024-01-03 15:19:13,281][model8_pretrain.py][INFO] Epoch:[0/2](205700/4588595) loss:2.561 lr:0.0000100 epoch_Time:27687.0min: [2024-01-03 15:19:13,282][model8_pretrain.py][INFO] Epoch:[0/2](205700/4588595) loss:2.780 lr:0.0000100 epoch_Time:27687.0min: [2024-01-03 15:19:13,282][model8_pretrain.py][INFO] Epoch:[0/2](205700/4588595) loss:3.318 lr:0.0000100 epoch_Time:27687.0min: [2024-01-03 15:19:50,234][model8_pretrain.py][INFO] Epoch:[0/2](205800/4588595) loss:2.581 lr:0.0000100 epoch_Time:27686.0min: [2024-01-03 15:19:50,235][model8_pretrain.py][INFO] Epoch:[0/2](205800/4588595) loss:3.082 lr:0.0000100 epoch_Time:27686.0min: [2024-01-03 15:19:50,235][model8_pretrain.py][INFO] Epoch:[0/2](205800/4588595) loss:2.785 lr:0.0000100 epoch_Time:27686.0min: [2024-01-03 15:19:50,235][model8_pretrain.py][INFO] Epoch:[0/2](205800/4588595) loss:2.978 lr:0.0000100 epoch_Time:27686.0min: [2024-01-03 15:19:50,235][model8_pretrain.py][INFO] Epoch:[0/2](205800/4588595) loss:3.144 lr:0.0000100 epoch_Time:27686.0min: [2024-01-03 15:19:50,235][model8_pretrain.py][INFO] Epoch:[0/2](205800/4588595) loss:3.104 lr:0.0000100 epoch_Time:27686.0min: [2024-01-03 15:19:50,235][model8_pretrain.py][INFO] Epoch:[0/2](205800/4588595) loss:3.288 lr:0.0000100 epoch_Time:27686.0min: [2024-01-03 15:19:50,235][model8_pretrain.py][INFO] Epoch:[0/2](205800/4588595) loss:3.255 lr:0.0000100 epoch_Time:27686.0min: [2024-01-03 15:20:27,179][model8_pretrain.py][INFO] Epoch:[0/2](205900/4588595) loss:3.005 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:20:27,179][model8_pretrain.py][INFO] Epoch:[0/2](205900/4588595) loss:2.907 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:20:27,179][model8_pretrain.py][INFO] Epoch:[0/2](205900/4588595) loss:2.948 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:20:27,179][model8_pretrain.py][INFO] Epoch:[0/2](205900/4588595) loss:2.599 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:20:27,179][model8_pretrain.py][INFO] Epoch:[0/2](205900/4588595) loss:3.472 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:20:27,179][model8_pretrain.py][INFO] Epoch:[0/2](205900/4588595) loss:2.650 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:20:27,179][model8_pretrain.py][INFO] Epoch:[0/2](205900/4588595) loss:3.388 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:20:27,179][model8_pretrain.py][INFO] Epoch:[0/2](205900/4588595) loss:3.265 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:21:04,130][model8_pretrain.py][INFO] Epoch:[0/2](206000/4588595) loss:2.498 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:04,130][model8_pretrain.py][INFO] Epoch:[0/2](206000/4588595) loss:3.350 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:04,130][model8_pretrain.py][INFO] Epoch:[0/2](206000/4588595) loss:2.957 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:04,130][model8_pretrain.py][INFO] Epoch:[0/2](206000/4588595) loss:2.491 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:04,130][model8_pretrain.py][INFO] Epoch:[0/2](206000/4588595) loss:3.365 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:04,130][model8_pretrain.py][INFO] Epoch:[0/2](206000/4588595) loss:3.288 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:04,130][model8_pretrain.py][INFO] Epoch:[0/2](206000/4588595) loss:2.515 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:04,130][model8_pretrain.py][INFO] Epoch:[0/2](206000/4588595) loss:3.090 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:41,085][model8_pretrain.py][INFO] Epoch:[0/2](206100/4588595) loss:3.140 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:41,085][model8_pretrain.py][INFO] Epoch:[0/2](206100/4588595) loss:2.682 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:41,085][model8_pretrain.py][INFO] Epoch:[0/2](206100/4588595) loss:2.779 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:41,085][model8_pretrain.py][INFO] Epoch:[0/2](206100/4588595) loss:3.113 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:41,085][model8_pretrain.py][INFO] Epoch:[0/2](206100/4588595) loss:2.923 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:41,085][model8_pretrain.py][INFO] Epoch:[0/2](206100/4588595) loss:3.290 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:41,085][model8_pretrain.py][INFO] Epoch:[0/2](206100/4588595) loss:2.768 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:21:41,085][model8_pretrain.py][INFO] Epoch:[0/2](206100/4588595) loss:2.677 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:22:26,668][model8_pretrain.py][INFO] Epoch:[0/2](206200/4588595) loss:3.361 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:22:26,668][model8_pretrain.py][INFO] Epoch:[0/2](206200/4588595) loss:3.456 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:22:26,668][model8_pretrain.py][INFO] Epoch:[0/2](206200/4588595) loss:3.080 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:22:26,668][model8_pretrain.py][INFO] Epoch:[0/2](206200/4588595) loss:3.042 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:22:26,668][model8_pretrain.py][INFO] Epoch:[0/2](206200/4588595) loss:3.038 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:22:26,668][model8_pretrain.py][INFO] Epoch:[0/2](206200/4588595) loss:2.758 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:22:26,668][model8_pretrain.py][INFO] Epoch:[0/2](206200/4588595) loss:3.078 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:22:26,668][model8_pretrain.py][INFO] Epoch:[0/2](206200/4588595) loss:3.175 lr:0.0000100 epoch_Time:27685.0min: [2024-01-03 15:23:03,587][model8_pretrain.py][INFO] Epoch:[0/2](206300/4588595) loss:2.905 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:03,587][model8_pretrain.py][INFO] Epoch:[0/2](206300/4588595) loss:2.648 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:03,587][model8_pretrain.py][INFO] Epoch:[0/2](206300/4588595) loss:3.217 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:03,587][model8_pretrain.py][INFO] Epoch:[0/2](206300/4588595) loss:2.613 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:03,587][model8_pretrain.py][INFO] Epoch:[0/2](206300/4588595) loss:2.422 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:03,587][model8_pretrain.py][INFO] Epoch:[0/2](206300/4588595) loss:3.091 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:03,587][model8_pretrain.py][INFO] Epoch:[0/2](206300/4588595) loss:2.682 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:03,587][model8_pretrain.py][INFO] Epoch:[0/2](206300/4588595) loss:2.293 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:40,529][model8_pretrain.py][INFO] Epoch:[0/2](206400/4588595) loss:2.901 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:40,529][model8_pretrain.py][INFO] Epoch:[0/2](206400/4588595) loss:2.284 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:40,529][model8_pretrain.py][INFO] Epoch:[0/2](206400/4588595) loss:3.340 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:40,529][model8_pretrain.py][INFO] Epoch:[0/2](206400/4588595) loss:3.195 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:40,529][model8_pretrain.py][INFO] Epoch:[0/2](206400/4588595) loss:2.986 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:40,529][model8_pretrain.py][INFO] Epoch:[0/2](206400/4588595) loss:2.928 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:40,529][model8_pretrain.py][INFO] Epoch:[0/2](206400/4588595) loss:3.341 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:23:40,529][model8_pretrain.py][INFO] Epoch:[0/2](206400/4588595) loss:3.273 lr:0.0000100 epoch_Time:27684.0min: [2024-01-03 15:24:17,469][model8_pretrain.py][INFO] Epoch:[0/2](206500/4588595) loss:2.280 lr:0.0000100 epoch_Time:27682.0min: [2024-01-03 15:24:17,469][model8_pretrain.py][INFO] Epoch:[0/2](206500/4588595) loss:2.470 lr:0.0000100 epoch_Time:27682.0min: [2024-01-03 15:24:17,469][model8_pretrain.py][INFO] Epoch:[0/2](206500/4588595) loss:2.881 lr:0.0000100 epoch_Time:27682.0min: [2024-01-03 15:24:17,469][model8_pretrain.py][INFO] Epoch:[0/2](206500/4588595) loss:3.070 lr:0.0000100 epoch_Time:27682.0min: [2024-01-03 15:24:17,469][model8_pretrain.py][INFO] Epoch:[0/2](206500/4588595) loss:2.751 lr:0.0000100 epoch_Time:27682.0min: [2024-01-03 15:24:17,469][model8_pretrain.py][INFO] Epoch:[0/2](206500/4588595) loss:2.996 lr:0.0000100 epoch_Time:27682.0min: [2024-01-03 15:24:17,469][model8_pretrain.py][INFO] Epoch:[0/2](206500/4588595) loss:2.641 lr:0.0000100 epoch_Time:27682.0min: [2024-01-03 15:24:17,469][model8_pretrain.py][INFO] Epoch:[0/2](206500/4588595) loss:3.228 lr:0.0000100 epoch_Time:27682.0min: [2024-01-03 15:24:54,414][model8_pretrain.py][INFO] Epoch:[0/2](206600/4588595) loss:3.365 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:24:54,414][model8_pretrain.py][INFO] Epoch:[0/2](206600/4588595) loss:3.206 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:24:54,414][model8_pretrain.py][INFO] Epoch:[0/2](206600/4588595) loss:3.137 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:24:54,414][model8_pretrain.py][INFO] Epoch:[0/2](206600/4588595) loss:3.264 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:24:54,414][model8_pretrain.py][INFO] Epoch:[0/2](206600/4588595) loss:2.612 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:24:54,414][model8_pretrain.py][INFO] Epoch:[0/2](206600/4588595) loss:3.246 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:24:54,414][model8_pretrain.py][INFO] Epoch:[0/2](206600/4588595) loss:3.604 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:24:54,415][model8_pretrain.py][INFO] Epoch:[0/2](206600/4588595) loss:2.781 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:25:31,364][model8_pretrain.py][INFO] Epoch:[0/2](206700/4588595) loss:2.895 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:25:31,364][model8_pretrain.py][INFO] Epoch:[0/2](206700/4588595) loss:2.661 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:25:31,365][model8_pretrain.py][INFO] Epoch:[0/2](206700/4588595) loss:3.350 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:25:31,365][model8_pretrain.py][INFO] Epoch:[0/2](206700/4588595) loss:3.096 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:25:31,365][model8_pretrain.py][INFO] Epoch:[0/2](206700/4588595) loss:2.904 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:25:31,365][model8_pretrain.py][INFO] Epoch:[0/2](206700/4588595) loss:3.374 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:25:31,365][model8_pretrain.py][INFO] Epoch:[0/2](206700/4588595) loss:3.351 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:25:31,365][model8_pretrain.py][INFO] Epoch:[0/2](206700/4588595) loss:3.283 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:26:08,301][model8_pretrain.py][INFO] Epoch:[0/2](206800/4588595) loss:3.396 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:08,301][model8_pretrain.py][INFO] Epoch:[0/2](206800/4588595) loss:3.172 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:08,301][model8_pretrain.py][INFO] Epoch:[0/2](206800/4588595) loss:3.416 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:08,301][model8_pretrain.py][INFO] Epoch:[0/2](206800/4588595) loss:2.931 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:08,301][model8_pretrain.py][INFO] Epoch:[0/2](206800/4588595) loss:2.755 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:08,301][model8_pretrain.py][INFO] Epoch:[0/2](206800/4588595) loss:2.295 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:08,301][model8_pretrain.py][INFO] Epoch:[0/2](206800/4588595) loss:3.158 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:08,301][model8_pretrain.py][INFO] Epoch:[0/2](206800/4588595) loss:2.664 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:45,246][model8_pretrain.py][INFO] Epoch:[0/2](206900/4588595) loss:3.413 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:45,246][model8_pretrain.py][INFO] Epoch:[0/2](206900/4588595) loss:3.035 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:45,246][model8_pretrain.py][INFO] Epoch:[0/2](206900/4588595) loss:2.658 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:45,246][model8_pretrain.py][INFO] Epoch:[0/2](206900/4588595) loss:2.090 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:45,246][model8_pretrain.py][INFO] Epoch:[0/2](206900/4588595) loss:2.726 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:45,246][model8_pretrain.py][INFO] Epoch:[0/2](206900/4588595) loss:2.159 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:45,246][model8_pretrain.py][INFO] Epoch:[0/2](206900/4588595) loss:3.507 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:26:45,246][model8_pretrain.py][INFO] Epoch:[0/2](206900/4588595) loss:2.892 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:27:29,224][model8_pretrain.py][INFO] Epoch:[0/2](207000/4588595) loss:2.283 lr:0.0000100 epoch_Time:27680.0min: [2024-01-03 15:27:29,224][model8_pretrain.py][INFO] Epoch:[0/2](207000/4588595) loss:3.156 lr:0.0000100 epoch_Time:27680.0min: [2024-01-03 15:27:29,224][model8_pretrain.py][INFO] Epoch:[0/2](207000/4588595) loss:3.315 lr:0.0000100 epoch_Time:27680.0min: [2024-01-03 15:27:29,224][model8_pretrain.py][INFO] Epoch:[0/2](207000/4588595) loss:3.029 lr:0.0000100 epoch_Time:27680.0min: [2024-01-03 15:27:29,224][model8_pretrain.py][INFO] Epoch:[0/2](207000/4588595) loss:2.952 lr:0.0000100 epoch_Time:27680.0min: [2024-01-03 15:27:29,224][model8_pretrain.py][INFO] Epoch:[0/2](207000/4588595) loss:2.803 lr:0.0000100 epoch_Time:27680.0min: [2024-01-03 15:27:29,224][model8_pretrain.py][INFO] Epoch:[0/2](207000/4588595) loss:3.278 lr:0.0000100 epoch_Time:27680.0min: [2024-01-03 15:27:30,903][model8_pretrain.py][INFO] Epoch:[0/2](207000/4588595) loss:2.874 lr:0.0000100 epoch_Time:27681.0min: [2024-01-03 15:28:07,831][model8_pretrain.py][INFO] Epoch:[0/2](207100/4588595) loss:3.220 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:07,831][model8_pretrain.py][INFO] Epoch:[0/2](207100/4588595) loss:3.117 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:07,831][model8_pretrain.py][INFO] Epoch:[0/2](207100/4588595) loss:2.827 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:07,832][model8_pretrain.py][INFO] Epoch:[0/2](207100/4588595) loss:2.904 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:07,832][model8_pretrain.py][INFO] Epoch:[0/2](207100/4588595) loss:2.784 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:07,831][model8_pretrain.py][INFO] Epoch:[0/2](207100/4588595) loss:2.964 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:07,831][model8_pretrain.py][INFO] Epoch:[0/2](207100/4588595) loss:3.025 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:07,832][model8_pretrain.py][INFO] Epoch:[0/2](207100/4588595) loss:3.101 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:44,775][model8_pretrain.py][INFO] Epoch:[0/2](207200/4588595) loss:2.634 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:44,775][model8_pretrain.py][INFO] Epoch:[0/2](207200/4588595) loss:3.314 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:44,775][model8_pretrain.py][INFO] Epoch:[0/2](207200/4588595) loss:2.922 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:44,775][model8_pretrain.py][INFO] Epoch:[0/2](207200/4588595) loss:3.127 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:44,775][model8_pretrain.py][INFO] Epoch:[0/2](207200/4588595) loss:3.248 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:44,775][model8_pretrain.py][INFO] Epoch:[0/2](207200/4588595) loss:2.775 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:44,775][model8_pretrain.py][INFO] Epoch:[0/2](207200/4588595) loss:2.716 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:28:44,775][model8_pretrain.py][INFO] Epoch:[0/2](207200/4588595) loss:2.965 lr:0.0000100 epoch_Time:27679.0min: [2024-01-03 15:29:21,718][model8_pretrain.py][INFO] Epoch:[0/2](207300/4588595) loss:2.774 lr:0.0000100 epoch_Time:27678.0min: [2024-01-03 15:29:21,718][model8_pretrain.py][INFO] Epoch:[0/2](207300/4588595) loss:3.043 lr:0.0000100 epoch_Time:27678.0min: [2024-01-03 15:29:21,718][model8_pretrain.py][INFO] Epoch:[0/2](207300/4588595) loss:2.676 lr:0.0000100 epoch_Time:27678.0min: [2024-01-03 15:29:21,718][model8_pretrain.py][INFO] Epoch:[0/2](207300/4588595) loss:3.178 lr:0.0000100 epoch_Time:27678.0min: [2024-01-03 15:29:21,719][model8_pretrain.py][INFO] Epoch:[0/2](207300/4588595) loss:3.291 lr:0.0000100 epoch_Time:27678.0min: [2024-01-03 15:29:21,719][model8_pretrain.py][INFO] Epoch:[0/2](207300/4588595) loss:3.116 lr:0.0000100 epoch_Time:27678.0min: [2024-01-03 15:29:21,719][model8_pretrain.py][INFO] Epoch:[0/2](207300/4588595) loss:2.375 lr:0.0000100 epoch_Time:27678.0min: [2024-01-03 15:29:21,719][model8_pretrain.py][INFO] Epoch:[0/2](207300/4588595) loss:3.039 lr:0.0000100 epoch_Time:27678.0min: [2024-01-03 15:29:58,659][model8_pretrain.py][INFO] Epoch:[0/2](207400/4588595) loss:2.618 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:29:58,659][model8_pretrain.py][INFO] Epoch:[0/2](207400/4588595) loss:3.237 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:29:58,659][model8_pretrain.py][INFO] Epoch:[0/2](207400/4588595) loss:3.372 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:29:58,659][model8_pretrain.py][INFO] Epoch:[0/2](207400/4588595) loss:3.505 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:29:58,659][model8_pretrain.py][INFO] Epoch:[0/2](207400/4588595) loss:3.090 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:29:58,659][model8_pretrain.py][INFO] Epoch:[0/2](207400/4588595) loss:2.789 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:29:58,659][model8_pretrain.py][INFO] Epoch:[0/2](207400/4588595) loss:3.196 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:29:58,659][model8_pretrain.py][INFO] Epoch:[0/2](207400/4588595) loss:2.997 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:30:35,603][model8_pretrain.py][INFO] Epoch:[0/2](207500/4588595) loss:2.994 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:30:35,603][model8_pretrain.py][INFO] Epoch:[0/2](207500/4588595) loss:2.914 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:30:35,604][model8_pretrain.py][INFO] Epoch:[0/2](207500/4588595) loss:2.115 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:30:35,604][model8_pretrain.py][INFO] Epoch:[0/2](207500/4588595) loss:3.211 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:30:35,604][model8_pretrain.py][INFO] Epoch:[0/2](207500/4588595) loss:3.005 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:30:35,604][model8_pretrain.py][INFO] Epoch:[0/2](207500/4588595) loss:3.154 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:30:35,604][model8_pretrain.py][INFO] Epoch:[0/2](207500/4588595) loss:3.010 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:30:35,604][model8_pretrain.py][INFO] Epoch:[0/2](207500/4588595) loss:2.832 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:31:12,552][model8_pretrain.py][INFO] Epoch:[0/2](207600/4588595) loss:2.715 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:31:12,552][model8_pretrain.py][INFO] Epoch:[0/2](207600/4588595) loss:2.957 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:31:12,552][model8_pretrain.py][INFO] Epoch:[0/2](207600/4588595) loss:3.045 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:31:12,552][model8_pretrain.py][INFO] Epoch:[0/2](207600/4588595) loss:3.166 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:31:12,552][model8_pretrain.py][INFO] Epoch:[0/2](207600/4588595) loss:2.871 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:31:12,552][model8_pretrain.py][INFO] Epoch:[0/2](207600/4588595) loss:3.244 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:31:12,552][model8_pretrain.py][INFO] Epoch:[0/2](207600/4588595) loss:2.752 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:31:12,552][model8_pretrain.py][INFO] Epoch:[0/2](207600/4588595) loss:2.909 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:31:49,465][model8_pretrain.py][INFO] Epoch:[0/2](207700/4588595) loss:3.114 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:31:49,465][model8_pretrain.py][INFO] Epoch:[0/2](207700/4588595) loss:2.525 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:31:49,465][model8_pretrain.py][INFO] Epoch:[0/2](207700/4588595) loss:2.642 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:31:49,465][model8_pretrain.py][INFO] Epoch:[0/2](207700/4588595) loss:3.275 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:31:49,465][model8_pretrain.py][INFO] Epoch:[0/2](207700/4588595) loss:3.014 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:31:49,465][model8_pretrain.py][INFO] Epoch:[0/2](207700/4588595) loss:2.747 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:31:49,465][model8_pretrain.py][INFO] Epoch:[0/2](207700/4588595) loss:3.265 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:31:49,465][model8_pretrain.py][INFO] Epoch:[0/2](207700/4588595) loss:2.752 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:32:33,400][model8_pretrain.py][INFO] Epoch:[0/2](207800/4588595) loss:2.742 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:32:33,400][model8_pretrain.py][INFO] Epoch:[0/2](207800/4588595) loss:3.100 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:32:33,400][model8_pretrain.py][INFO] Epoch:[0/2](207800/4588595) loss:2.800 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:32:33,400][model8_pretrain.py][INFO] Epoch:[0/2](207800/4588595) loss:2.662 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:32:33,400][model8_pretrain.py][INFO] Epoch:[0/2](207800/4588595) loss:3.286 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:32:33,400][model8_pretrain.py][INFO] Epoch:[0/2](207800/4588595) loss:3.468 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:32:33,400][model8_pretrain.py][INFO] Epoch:[0/2](207800/4588595) loss:2.818 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:32:33,400][model8_pretrain.py][INFO] Epoch:[0/2](207800/4588595) loss:3.565 lr:0.0000100 epoch_Time:27676.0min: [2024-01-03 15:33:12,023][model8_pretrain.py][INFO] Epoch:[0/2](207900/4588595) loss:2.873 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:33:12,023][model8_pretrain.py][INFO] Epoch:[0/2](207900/4588595) loss:2.527 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:33:12,023][model8_pretrain.py][INFO] Epoch:[0/2](207900/4588595) loss:2.936 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:33:12,023][model8_pretrain.py][INFO] Epoch:[0/2](207900/4588595) loss:2.554 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:33:12,023][model8_pretrain.py][INFO] Epoch:[0/2](207900/4588595) loss:3.149 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:33:12,023][model8_pretrain.py][INFO] Epoch:[0/2](207900/4588595) loss:2.464 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:33:12,023][model8_pretrain.py][INFO] Epoch:[0/2](207900/4588595) loss:2.775 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:33:12,023][model8_pretrain.py][INFO] Epoch:[0/2](207900/4588595) loss:2.886 lr:0.0000100 epoch_Time:27675.0min: [2024-01-03 15:33:48,956][model8_pretrain.py][INFO] Epoch:[0/2](208000/4588595) loss:3.199 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:33:48,957][model8_pretrain.py][INFO] Epoch:[0/2](208000/4588595) loss:3.049 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:33:48,957][model8_pretrain.py][INFO] Epoch:[0/2](208000/4588595) loss:3.392 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:33:48,957][model8_pretrain.py][INFO] Epoch:[0/2](208000/4588595) loss:2.951 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:33:48,957][model8_pretrain.py][INFO] Epoch:[0/2](208000/4588595) loss:3.073 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:33:48,957][model8_pretrain.py][INFO] Epoch:[0/2](208000/4588595) loss:3.238 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:33:48,957][model8_pretrain.py][INFO] Epoch:[0/2](208000/4588595) loss:2.518 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:33:48,957][model8_pretrain.py][INFO] Epoch:[0/2](208000/4588595) loss:2.780 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:34:25,895][model8_pretrain.py][INFO] Epoch:[0/2](208100/4588595) loss:2.971 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:34:25,896][model8_pretrain.py][INFO] Epoch:[0/2](208100/4588595) loss:3.111 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:34:25,896][model8_pretrain.py][INFO] Epoch:[0/2](208100/4588595) loss:3.091 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:34:25,895][model8_pretrain.py][INFO] Epoch:[0/2](208100/4588595) loss:2.779 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:34:25,896][model8_pretrain.py][INFO] Epoch:[0/2](208100/4588595) loss:3.118 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:34:25,896][model8_pretrain.py][INFO] Epoch:[0/2](208100/4588595) loss:2.739 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:34:25,896][model8_pretrain.py][INFO] Epoch:[0/2](208100/4588595) loss:2.944 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:34:25,896][model8_pretrain.py][INFO] Epoch:[0/2](208100/4588595) loss:3.428 lr:0.0000100 epoch_Time:27673.0min: [2024-01-03 15:35:02,830][model8_pretrain.py][INFO] Epoch:[0/2](208200/4588595) loss:2.861 lr:0.0000100 epoch_Time:27672.0min: [2024-01-03 15:35:02,830][model8_pretrain.py][INFO] Epoch:[0/2](208200/4588595) loss:2.843 lr:0.0000100 epoch_Time:27672.0min: [2024-01-03 15:35:02,830][model8_pretrain.py][INFO] Epoch:[0/2](208200/4588595) loss:2.937 lr:0.0000100 epoch_Time:27672.0min: [2024-01-03 15:35:02,830][model8_pretrain.py][INFO] Epoch:[0/2](208200/4588595) loss:2.652 lr:0.0000100 epoch_Time:27672.0min: [2024-01-03 15:35:02,830][model8_pretrain.py][INFO] Epoch:[0/2](208200/4588595) loss:2.549 lr:0.0000100 epoch_Time:27672.0min: [2024-01-03 15:35:02,830][model8_pretrain.py][INFO] Epoch:[0/2](208200/4588595) loss:2.905 lr:0.0000100 epoch_Time:27672.0min: [2024-01-03 15:35:02,830][model8_pretrain.py][INFO] Epoch:[0/2](208200/4588595) loss:2.903 lr:0.0000100 epoch_Time:27672.0min: [2024-01-03 15:35:02,830][model8_pretrain.py][INFO] Epoch:[0/2](208200/4588595) loss:2.905 lr:0.0000100 epoch_Time:27672.0min: [2024-01-03 15:35:39,780][model8_pretrain.py][INFO] Epoch:[0/2](208300/4588595) loss:3.187 lr:0.0000100 epoch_Time:27671.0min: [2024-01-03 15:35:39,780][model8_pretrain.py][INFO] Epoch:[0/2](208300/4588595) loss:2.713 lr:0.0000100 epoch_Time:27671.0min: [2024-01-03 15:35:39,780][model8_pretrain.py][INFO] Epoch:[0/2](208300/4588595) loss:2.983 lr:0.0000100 epoch_Time:27671.0min: [2024-01-03 15:35:39,780][model8_pretrain.py][INFO] Epoch:[0/2](208300/4588595) loss:2.958 lr:0.0000100 epoch_Time:27671.0min: [2024-01-03 15:35:39,780][model8_pretrain.py][INFO] Epoch:[0/2](208300/4588595) loss:3.395 lr:0.0000100 epoch_Time:27671.0min: [2024-01-03 15:35:39,780][model8_pretrain.py][INFO] Epoch:[0/2](208300/4588595) loss:3.188 lr:0.0000100 epoch_Time:27671.0min: [2024-01-03 15:35:39,780][model8_pretrain.py][INFO] Epoch:[0/2](208300/4588595) loss:3.736 lr:0.0000100 epoch_Time:27671.0min: [2024-01-03 15:35:39,780][model8_pretrain.py][INFO] Epoch:[0/2](208300/4588595) loss:3.167 lr:0.0000100 epoch_Time:27671.0min: [2024-01-03 15:36:16,724][model8_pretrain.py][INFO] Epoch:[0/2](208400/4588595) loss:2.998 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:36:16,724][model8_pretrain.py][INFO] Epoch:[0/2](208400/4588595) loss:3.325 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:36:16,724][model8_pretrain.py][INFO] Epoch:[0/2](208400/4588595) loss:3.002 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:36:16,724][model8_pretrain.py][INFO] Epoch:[0/2](208400/4588595) loss:3.181 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:36:16,724][model8_pretrain.py][INFO] Epoch:[0/2](208400/4588595) loss:2.895 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:36:16,724][model8_pretrain.py][INFO] Epoch:[0/2](208400/4588595) loss:3.184 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:36:16,724][model8_pretrain.py][INFO] Epoch:[0/2](208400/4588595) loss:3.117 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:36:16,724][model8_pretrain.py][INFO] Epoch:[0/2](208400/4588595) loss:2.908 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:36:53,668][model8_pretrain.py][INFO] Epoch:[0/2](208500/4588595) loss:2.827 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:36:53,668][model8_pretrain.py][INFO] Epoch:[0/2](208500/4588595) loss:3.040 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:36:53,668][model8_pretrain.py][INFO] Epoch:[0/2](208500/4588595) loss:2.876 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:36:53,668][model8_pretrain.py][INFO] Epoch:[0/2](208500/4588595) loss:3.216 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:36:53,669][model8_pretrain.py][INFO] Epoch:[0/2](208500/4588595) loss:2.949 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:36:53,669][model8_pretrain.py][INFO] Epoch:[0/2](208500/4588595) loss:2.978 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:36:53,669][model8_pretrain.py][INFO] Epoch:[0/2](208500/4588595) loss:3.008 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:36:53,669][model8_pretrain.py][INFO] Epoch:[0/2](208500/4588595) loss:2.786 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:37:34,104][model8_pretrain.py][INFO] Epoch:[0/2](208600/4588595) loss:3.509 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:37:34,104][model8_pretrain.py][INFO] Epoch:[0/2](208600/4588595) loss:2.019 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:37:34,104][model8_pretrain.py][INFO] Epoch:[0/2](208600/4588595) loss:2.687 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:37:34,104][model8_pretrain.py][INFO] Epoch:[0/2](208600/4588595) loss:3.477 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:37:34,104][model8_pretrain.py][INFO] Epoch:[0/2](208600/4588595) loss:2.448 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:37:34,104][model8_pretrain.py][INFO] Epoch:[0/2](208600/4588595) loss:2.637 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:37:34,104][model8_pretrain.py][INFO] Epoch:[0/2](208600/4588595) loss:2.942 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:37:34,104][model8_pretrain.py][INFO] Epoch:[0/2](208600/4588595) loss:3.225 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:38:16,245][model8_pretrain.py][INFO] Epoch:[0/2](208700/4588595) loss:2.443 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:38:16,245][model8_pretrain.py][INFO] Epoch:[0/2](208700/4588595) loss:2.204 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:38:16,245][model8_pretrain.py][INFO] Epoch:[0/2](208700/4588595) loss:3.285 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:38:16,245][model8_pretrain.py][INFO] Epoch:[0/2](208700/4588595) loss:2.879 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:38:16,245][model8_pretrain.py][INFO] Epoch:[0/2](208700/4588595) loss:2.787 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:38:16,245][model8_pretrain.py][INFO] Epoch:[0/2](208700/4588595) loss:2.688 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:38:16,245][model8_pretrain.py][INFO] Epoch:[0/2](208700/4588595) loss:3.224 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:38:16,245][model8_pretrain.py][INFO] Epoch:[0/2](208700/4588595) loss:3.537 lr:0.0000100 epoch_Time:27670.0min: [2024-01-03 15:38:53,184][model8_pretrain.py][INFO] Epoch:[0/2](208800/4588595) loss:2.873 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:38:53,184][model8_pretrain.py][INFO] Epoch:[0/2](208800/4588595) loss:2.745 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:38:53,184][model8_pretrain.py][INFO] Epoch:[0/2](208800/4588595) loss:3.092 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:38:53,184][model8_pretrain.py][INFO] Epoch:[0/2](208800/4588595) loss:2.829 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:38:53,184][model8_pretrain.py][INFO] Epoch:[0/2](208800/4588595) loss:2.847 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:38:53,184][model8_pretrain.py][INFO] Epoch:[0/2](208800/4588595) loss:3.297 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:38:53,184][model8_pretrain.py][INFO] Epoch:[0/2](208800/4588595) loss:3.026 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:38:53,184][model8_pretrain.py][INFO] Epoch:[0/2](208800/4588595) loss:2.668 lr:0.0000100 epoch_Time:27669.0min: [2024-01-03 15:39:30,144][model8_pretrain.py][INFO] Epoch:[0/2](208900/4588595) loss:2.843 lr:0.0000100 epoch_Time:27668.0min: [2024-01-03 15:39:30,144][model8_pretrain.py][INFO] Epoch:[0/2](208900/4588595) loss:3.255 lr:0.0000100 epoch_Time:27668.0min: [2024-01-03 15:39:30,144][model8_pretrain.py][INFO] Epoch:[0/2](208900/4588595) loss:2.991 lr:0.0000100 epoch_Time:27668.0min: [2024-01-03 15:39:30,144][model8_pretrain.py][INFO] Epoch:[0/2](208900/4588595) loss:3.484 lr:0.0000100 epoch_Time:27668.0min: [2024-01-03 15:39:30,144][model8_pretrain.py][INFO] Epoch:[0/2](208900/4588595) loss:3.120 lr:0.0000100 epoch_Time:27668.0min: [2024-01-03 15:39:30,144][model8_pretrain.py][INFO] Epoch:[0/2](208900/4588595) loss:2.657 lr:0.0000100 epoch_Time:27668.0min: [2024-01-03 15:39:30,144][model8_pretrain.py][INFO] Epoch:[0/2](208900/4588595) loss:2.542 lr:0.0000100 epoch_Time:27668.0min: [2024-01-03 15:39:30,144][model8_pretrain.py][INFO] Epoch:[0/2](208900/4588595) loss:3.522 lr:0.0000100 epoch_Time:27668.0min: [2024-01-03 15:40:07,090][model8_pretrain.py][INFO] Epoch:[0/2](209000/4588595) loss:3.211 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:07,090][model8_pretrain.py][INFO] Epoch:[0/2](209000/4588595) loss:3.176 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:07,090][model8_pretrain.py][INFO] Epoch:[0/2](209000/4588595) loss:3.173 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:07,090][model8_pretrain.py][INFO] Epoch:[0/2](209000/4588595) loss:2.839 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:07,090][model8_pretrain.py][INFO] Epoch:[0/2](209000/4588595) loss:2.271 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:07,090][model8_pretrain.py][INFO] Epoch:[0/2](209000/4588595) loss:3.290 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:07,090][model8_pretrain.py][INFO] Epoch:[0/2](209000/4588595) loss:3.518 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:07,090][model8_pretrain.py][INFO] Epoch:[0/2](209000/4588595) loss:2.606 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:44,040][model8_pretrain.py][INFO] Epoch:[0/2](209100/4588595) loss:2.573 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:44,040][model8_pretrain.py][INFO] Epoch:[0/2](209100/4588595) loss:2.749 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:44,040][model8_pretrain.py][INFO] Epoch:[0/2](209100/4588595) loss:3.084 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:44,040][model8_pretrain.py][INFO] Epoch:[0/2](209100/4588595) loss:3.416 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:44,040][model8_pretrain.py][INFO] Epoch:[0/2](209100/4588595) loss:2.806 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:44,040][model8_pretrain.py][INFO] Epoch:[0/2](209100/4588595) loss:2.917 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:44,040][model8_pretrain.py][INFO] Epoch:[0/2](209100/4588595) loss:3.110 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:40:44,040][model8_pretrain.py][INFO] Epoch:[0/2](209100/4588595) loss:3.341 lr:0.0000100 epoch_Time:27667.0min: [2024-01-03 15:41:20,975][model8_pretrain.py][INFO] Epoch:[0/2](209200/4588595) loss:2.660 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:41:20,975][model8_pretrain.py][INFO] Epoch:[0/2](209200/4588595) loss:3.034 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:41:20,975][model8_pretrain.py][INFO] Epoch:[0/2](209200/4588595) loss:2.864 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:41:20,975][model8_pretrain.py][INFO] Epoch:[0/2](209200/4588595) loss:3.055 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:41:20,975][model8_pretrain.py][INFO] Epoch:[0/2](209200/4588595) loss:2.858 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:41:20,975][model8_pretrain.py][INFO] Epoch:[0/2](209200/4588595) loss:3.174 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:41:20,975][model8_pretrain.py][INFO] Epoch:[0/2](209200/4588595) loss:2.340 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:41:20,975][model8_pretrain.py][INFO] Epoch:[0/2](209200/4588595) loss:3.077 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:41:57,916][model8_pretrain.py][INFO] Epoch:[0/2](209300/4588595) loss:3.164 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:41:57,916][model8_pretrain.py][INFO] Epoch:[0/2](209300/4588595) loss:3.036 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:41:57,916][model8_pretrain.py][INFO] Epoch:[0/2](209300/4588595) loss:2.722 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:41:57,916][model8_pretrain.py][INFO] Epoch:[0/2](209300/4588595) loss:2.851 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:41:57,916][model8_pretrain.py][INFO] Epoch:[0/2](209300/4588595) loss:2.969 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:41:57,916][model8_pretrain.py][INFO] Epoch:[0/2](209300/4588595) loss:3.020 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:41:57,916][model8_pretrain.py][INFO] Epoch:[0/2](209300/4588595) loss:3.169 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:41:57,916][model8_pretrain.py][INFO] Epoch:[0/2](209300/4588595) loss:2.518 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:42:38,352][model8_pretrain.py][INFO] Epoch:[0/2](209400/4588595) loss:3.118 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:42:38,352][model8_pretrain.py][INFO] Epoch:[0/2](209400/4588595) loss:2.112 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:42:38,352][model8_pretrain.py][INFO] Epoch:[0/2](209400/4588595) loss:3.196 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:42:38,352][model8_pretrain.py][INFO] Epoch:[0/2](209400/4588595) loss:2.901 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:42:38,352][model8_pretrain.py][INFO] Epoch:[0/2](209400/4588595) loss:2.366 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:42:38,352][model8_pretrain.py][INFO] Epoch:[0/2](209400/4588595) loss:3.278 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:42:38,352][model8_pretrain.py][INFO] Epoch:[0/2](209400/4588595) loss:2.857 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:42:38,352][model8_pretrain.py][INFO] Epoch:[0/2](209400/4588595) loss:2.730 lr:0.0000100 epoch_Time:27665.0min: [2024-01-03 15:43:20,522][model8_pretrain.py][INFO] Epoch:[0/2](209500/4588595) loss:3.431 lr:0.0000100 epoch_Time:27666.0min: [2024-01-03 15:43:20,522][model8_pretrain.py][INFO] Epoch:[0/2](209500/4588595) loss:2.904 lr:0.0000100 epoch_Time:27666.0min: [2024-01-03 15:43:20,522][model8_pretrain.py][INFO] Epoch:[0/2](209500/4588595) loss:3.067 lr:0.0000100 epoch_Time:27666.0min: [2024-01-03 15:43:20,522][model8_pretrain.py][INFO] Epoch:[0/2](209500/4588595) loss:3.136 lr:0.0000100 epoch_Time:27666.0min: [2024-01-03 15:43:20,522][model8_pretrain.py][INFO] Epoch:[0/2](209500/4588595) loss:2.691 lr:0.0000100 epoch_Time:27666.0min: [2024-01-03 15:43:20,522][model8_pretrain.py][INFO] Epoch:[0/2](209500/4588595) loss:2.745 lr:0.0000100 epoch_Time:27666.0min: [2024-01-03 15:43:20,522][model8_pretrain.py][INFO] Epoch:[0/2](209500/4588595) loss:2.530 lr:0.0000100 epoch_Time:27666.0min: [2024-01-03 15:43:20,522][model8_pretrain.py][INFO] Epoch:[0/2](209500/4588595) loss:3.245 lr:0.0000100 epoch_Time:27666.0min: [2024-01-03 15:43:57,454][model8_pretrain.py][INFO] Epoch:[0/2](209600/4588595) loss:2.997 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:43:57,454][model8_pretrain.py][INFO] Epoch:[0/2](209600/4588595) loss:3.317 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:43:57,454][model8_pretrain.py][INFO] Epoch:[0/2](209600/4588595) loss:2.956 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:43:57,454][model8_pretrain.py][INFO] Epoch:[0/2](209600/4588595) loss:2.812 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:43:57,454][model8_pretrain.py][INFO] Epoch:[0/2](209600/4588595) loss:2.266 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:43:57,454][model8_pretrain.py][INFO] Epoch:[0/2](209600/4588595) loss:3.100 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:43:57,454][model8_pretrain.py][INFO] Epoch:[0/2](209600/4588595) loss:3.504 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:43:57,454][model8_pretrain.py][INFO] Epoch:[0/2](209600/4588595) loss:2.978 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:44:34,387][model8_pretrain.py][INFO] Epoch:[0/2](209700/4588595) loss:2.569 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:44:34,387][model8_pretrain.py][INFO] Epoch:[0/2](209700/4588595) loss:2.926 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:44:34,387][model8_pretrain.py][INFO] Epoch:[0/2](209700/4588595) loss:2.988 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:44:34,387][model8_pretrain.py][INFO] Epoch:[0/2](209700/4588595) loss:2.852 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:44:34,387][model8_pretrain.py][INFO] Epoch:[0/2](209700/4588595) loss:2.766 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:44:34,388][model8_pretrain.py][INFO] Epoch:[0/2](209700/4588595) loss:3.174 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:44:34,388][model8_pretrain.py][INFO] Epoch:[0/2](209700/4588595) loss:2.866 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:44:34,388][model8_pretrain.py][INFO] Epoch:[0/2](209700/4588595) loss:2.157 lr:0.0000100 epoch_Time:27664.0min: [2024-01-03 15:45:11,339][model8_pretrain.py][INFO] Epoch:[0/2](209800/4588595) loss:2.881 lr:0.0000100 epoch_Time:27662.0min: [2024-01-03 15:45:11,339][model8_pretrain.py][INFO] Epoch:[0/2](209800/4588595) loss:3.378 lr:0.0000100 epoch_Time:27662.0min: [2024-01-03 15:45:11,339][model8_pretrain.py][INFO] Epoch:[0/2](209800/4588595) loss:3.198 lr:0.0000100 epoch_Time:27662.0min: [2024-01-03 15:45:11,339][model8_pretrain.py][INFO] Epoch:[0/2](209800/4588595) loss:3.049 lr:0.0000100 epoch_Time:27662.0min: [2024-01-03 15:45:11,339][model8_pretrain.py][INFO] Epoch:[0/2](209800/4588595) loss:2.875 lr:0.0000100 epoch_Time:27662.0min: [2024-01-03 15:45:11,339][model8_pretrain.py][INFO] Epoch:[0/2](209800/4588595) loss:2.714 lr:0.0000100 epoch_Time:27662.0min: [2024-01-03 15:45:11,340][model8_pretrain.py][INFO] Epoch:[0/2](209800/4588595) loss:2.300 lr:0.0000100 epoch_Time:27662.0min: [2024-01-03 15:45:11,340][model8_pretrain.py][INFO] Epoch:[0/2](209800/4588595) loss:2.379 lr:0.0000100 epoch_Time:27662.0min: [2024-01-03 15:45:48,283][model8_pretrain.py][INFO] Epoch:[0/2](209900/4588595) loss:3.053 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:45:48,283][model8_pretrain.py][INFO] Epoch:[0/2](209900/4588595) loss:3.057 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:45:48,283][model8_pretrain.py][INFO] Epoch:[0/2](209900/4588595) loss:2.869 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:45:48,283][model8_pretrain.py][INFO] Epoch:[0/2](209900/4588595) loss:2.901 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:45:48,283][model8_pretrain.py][INFO] Epoch:[0/2](209900/4588595) loss:2.434 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:45:48,283][model8_pretrain.py][INFO] Epoch:[0/2](209900/4588595) loss:3.229 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:45:48,283][model8_pretrain.py][INFO] Epoch:[0/2](209900/4588595) loss:2.663 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:45:48,283][model8_pretrain.py][INFO] Epoch:[0/2](209900/4588595) loss:3.459 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:46:25,201][model8_pretrain.py][INFO] Epoch:[0/2](210000/4588595) loss:3.036 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:46:25,201][model8_pretrain.py][INFO] Epoch:[0/2](210000/4588595) loss:2.812 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:46:25,201][model8_pretrain.py][INFO] Epoch:[0/2](210000/4588595) loss:3.080 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:46:25,201][model8_pretrain.py][INFO] Epoch:[0/2](210000/4588595) loss:3.196 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:46:25,201][model8_pretrain.py][INFO] Epoch:[0/2](210000/4588595) loss:2.829 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:46:25,201][model8_pretrain.py][INFO] Epoch:[0/2](210000/4588595) loss:3.232 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:46:25,201][model8_pretrain.py][INFO] Epoch:[0/2](210000/4588595) loss:3.297 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:46:25,201][model8_pretrain.py][INFO] Epoch:[0/2](210000/4588595) loss:2.982 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:47:02,142][model8_pretrain.py][INFO] Epoch:[0/2](210100/4588595) loss:2.902 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:47:02,142][model8_pretrain.py][INFO] Epoch:[0/2](210100/4588595) loss:2.700 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:47:02,142][model8_pretrain.py][INFO] Epoch:[0/2](210100/4588595) loss:2.692 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:47:02,142][model8_pretrain.py][INFO] Epoch:[0/2](210100/4588595) loss:3.150 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:47:02,142][model8_pretrain.py][INFO] Epoch:[0/2](210100/4588595) loss:2.855 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:47:02,142][model8_pretrain.py][INFO] Epoch:[0/2](210100/4588595) loss:2.321 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:47:02,143][model8_pretrain.py][INFO] Epoch:[0/2](210100/4588595) loss:2.652 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:47:02,142][model8_pretrain.py][INFO] Epoch:[0/2](210100/4588595) loss:3.561 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:47:42,543][model8_pretrain.py][INFO] Epoch:[0/2](210200/4588595) loss:2.850 lr:0.0000100 epoch_Time:27660.0min: [2024-01-03 15:47:42,543][model8_pretrain.py][INFO] Epoch:[0/2](210200/4588595) loss:3.041 lr:0.0000100 epoch_Time:27660.0min: [2024-01-03 15:47:42,543][model8_pretrain.py][INFO] Epoch:[0/2](210200/4588595) loss:2.548 lr:0.0000100 epoch_Time:27660.0min: [2024-01-03 15:47:42,543][model8_pretrain.py][INFO] Epoch:[0/2](210200/4588595) loss:3.535 lr:0.0000100 epoch_Time:27660.0min: [2024-01-03 15:47:42,547][model8_pretrain.py][INFO] Epoch:[0/2](210200/4588595) loss:2.990 lr:0.0000100 epoch_Time:27660.0min: [2024-01-03 15:47:42,548][model8_pretrain.py][INFO] Epoch:[0/2](210200/4588595) loss:3.086 lr:0.0000100 epoch_Time:27660.0min: [2024-01-03 15:47:42,548][model8_pretrain.py][INFO] Epoch:[0/2](210200/4588595) loss:3.208 lr:0.0000100 epoch_Time:27660.0min: [2024-01-03 15:47:42,548][model8_pretrain.py][INFO] Epoch:[0/2](210200/4588595) loss:3.086 lr:0.0000100 epoch_Time:27660.0min: [2024-01-03 15:48:24,668][model8_pretrain.py][INFO] Epoch:[0/2](210300/4588595) loss:2.997 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:48:24,668][model8_pretrain.py][INFO] Epoch:[0/2](210300/4588595) loss:3.323 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:48:24,668][model8_pretrain.py][INFO] Epoch:[0/2](210300/4588595) loss:2.641 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:48:24,668][model8_pretrain.py][INFO] Epoch:[0/2](210300/4588595) loss:3.002 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:48:24,668][model8_pretrain.py][INFO] Epoch:[0/2](210300/4588595) loss:3.126 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:48:24,668][model8_pretrain.py][INFO] Epoch:[0/2](210300/4588595) loss:2.763 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:48:24,668][model8_pretrain.py][INFO] Epoch:[0/2](210300/4588595) loss:2.938 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:48:24,668][model8_pretrain.py][INFO] Epoch:[0/2](210300/4588595) loss:2.818 lr:0.0000100 epoch_Time:27661.0min: [2024-01-03 15:49:01,597][model8_pretrain.py][INFO] Epoch:[0/2](210400/4588595) loss:3.203 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:01,597][model8_pretrain.py][INFO] Epoch:[0/2](210400/4588595) loss:3.081 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:01,597][model8_pretrain.py][INFO] Epoch:[0/2](210400/4588595) loss:2.897 lr:0.0000100 epoch_Time:27660.0min: [2024-01-03 15:49:01,597][model8_pretrain.py][INFO] Epoch:[0/2](210400/4588595) loss:2.814 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:01,597][model8_pretrain.py][INFO] Epoch:[0/2](210400/4588595) loss:3.085 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:01,597][model8_pretrain.py][INFO] Epoch:[0/2](210400/4588595) loss:2.519 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:01,597][model8_pretrain.py][INFO] Epoch:[0/2](210400/4588595) loss:3.045 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:01,597][model8_pretrain.py][INFO] Epoch:[0/2](210400/4588595) loss:2.559 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:38,537][model8_pretrain.py][INFO] Epoch:[0/2](210500/4588595) loss:2.981 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:38,537][model8_pretrain.py][INFO] Epoch:[0/2](210500/4588595) loss:2.780 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:38,537][model8_pretrain.py][INFO] Epoch:[0/2](210500/4588595) loss:3.266 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:38,537][model8_pretrain.py][INFO] Epoch:[0/2](210500/4588595) loss:2.513 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:38,537][model8_pretrain.py][INFO] Epoch:[0/2](210500/4588595) loss:2.960 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:38,537][model8_pretrain.py][INFO] Epoch:[0/2](210500/4588595) loss:3.237 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:38,537][model8_pretrain.py][INFO] Epoch:[0/2](210500/4588595) loss:3.591 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:49:38,537][model8_pretrain.py][INFO] Epoch:[0/2](210500/4588595) loss:3.178 lr:0.0000100 epoch_Time:27659.0min: [2024-01-03 15:50:15,482][model8_pretrain.py][INFO] Epoch:[0/2](210600/4588595) loss:2.854 lr:0.0000100 epoch_Time:27658.0min: [2024-01-03 15:50:15,482][model8_pretrain.py][INFO] Epoch:[0/2](210600/4588595) loss:3.344 lr:0.0000100 epoch_Time:27658.0min: [2024-01-03 15:50:15,482][model8_pretrain.py][INFO] Epoch:[0/2](210600/4588595) loss:2.951 lr:0.0000100 epoch_Time:27658.0min: [2024-01-03 15:50:15,482][model8_pretrain.py][INFO] Epoch:[0/2](210600/4588595) loss:3.156 lr:0.0000100 epoch_Time:27658.0min: [2024-01-03 15:50:15,482][model8_pretrain.py][INFO] Epoch:[0/2](210600/4588595) loss:2.794 lr:0.0000100 epoch_Time:27658.0min: [2024-01-03 15:50:15,482][model8_pretrain.py][INFO] Epoch:[0/2](210600/4588595) loss:2.847 lr:0.0000100 epoch_Time:27658.0min: [2024-01-03 15:50:15,482][model8_pretrain.py][INFO] Epoch:[0/2](210600/4588595) loss:2.656 lr:0.0000100 epoch_Time:27658.0min: [2024-01-03 15:50:15,483][model8_pretrain.py][INFO] Epoch:[0/2](210600/4588595) loss:2.734 lr:0.0000100 epoch_Time:27658.0min: [2024-01-03 15:50:52,430][model8_pretrain.py][INFO] Epoch:[0/2](210700/4588595) loss:3.031 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:50:52,430][model8_pretrain.py][INFO] Epoch:[0/2](210700/4588595) loss:2.965 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:50:52,430][model8_pretrain.py][INFO] Epoch:[0/2](210700/4588595) loss:2.855 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:50:52,430][model8_pretrain.py][INFO] Epoch:[0/2](210700/4588595) loss:2.670 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:50:52,430][model8_pretrain.py][INFO] Epoch:[0/2](210700/4588595) loss:2.235 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:50:52,430][model8_pretrain.py][INFO] Epoch:[0/2](210700/4588595) loss:2.563 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:50:52,430][model8_pretrain.py][INFO] Epoch:[0/2](210700/4588595) loss:2.925 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:50:52,430][model8_pretrain.py][INFO] Epoch:[0/2](210700/4588595) loss:2.748 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:51:29,368][model8_pretrain.py][INFO] Epoch:[0/2](210800/4588595) loss:2.926 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:51:29,368][model8_pretrain.py][INFO] Epoch:[0/2](210800/4588595) loss:2.336 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:51:29,368][model8_pretrain.py][INFO] Epoch:[0/2](210800/4588595) loss:3.443 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:51:29,368][model8_pretrain.py][INFO] Epoch:[0/2](210800/4588595) loss:2.280 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:51:29,368][model8_pretrain.py][INFO] Epoch:[0/2](210800/4588595) loss:3.210 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:51:29,368][model8_pretrain.py][INFO] Epoch:[0/2](210800/4588595) loss:2.993 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:51:29,368][model8_pretrain.py][INFO] Epoch:[0/2](210800/4588595) loss:2.923 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:51:29,368][model8_pretrain.py][INFO] Epoch:[0/2](210800/4588595) loss:2.427 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:52:06,312][model8_pretrain.py][INFO] Epoch:[0/2](210900/4588595) loss:3.306 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:52:06,313][model8_pretrain.py][INFO] Epoch:[0/2](210900/4588595) loss:2.842 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:52:06,313][model8_pretrain.py][INFO] Epoch:[0/2](210900/4588595) loss:3.157 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:52:06,313][model8_pretrain.py][INFO] Epoch:[0/2](210900/4588595) loss:2.432 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:52:06,313][model8_pretrain.py][INFO] Epoch:[0/2](210900/4588595) loss:3.015 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:52:06,313][model8_pretrain.py][INFO] Epoch:[0/2](210900/4588595) loss:3.123 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:52:06,313][model8_pretrain.py][INFO] Epoch:[0/2](210900/4588595) loss:2.832 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:52:06,313][model8_pretrain.py][INFO] Epoch:[0/2](210900/4588595) loss:2.479 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:52:43,244][model8_pretrain.py][INFO] Epoch:[0/2](211000/4588595) loss:2.754 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:52:43,244][model8_pretrain.py][INFO] Epoch:[0/2](211000/4588595) loss:2.852 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:52:43,244][model8_pretrain.py][INFO] Epoch:[0/2](211000/4588595) loss:3.121 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:52:43,244][model8_pretrain.py][INFO] Epoch:[0/2](211000/4588595) loss:2.733 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:52:43,244][model8_pretrain.py][INFO] Epoch:[0/2](211000/4588595) loss:3.004 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:52:43,245][model8_pretrain.py][INFO] Epoch:[0/2](211000/4588595) loss:2.815 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:52:43,245][model8_pretrain.py][INFO] Epoch:[0/2](211000/4588595) loss:2.734 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:52:43,246][model8_pretrain.py][INFO] Epoch:[0/2](211000/4588595) loss:2.311 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:53:28,868][model8_pretrain.py][INFO] Epoch:[0/2](211100/4588595) loss:2.543 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:53:28,868][model8_pretrain.py][INFO] Epoch:[0/2](211100/4588595) loss:2.607 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:53:28,868][model8_pretrain.py][INFO] Epoch:[0/2](211100/4588595) loss:3.155 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:53:28,868][model8_pretrain.py][INFO] Epoch:[0/2](211100/4588595) loss:3.145 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:53:28,868][model8_pretrain.py][INFO] Epoch:[0/2](211100/4588595) loss:2.216 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:53:28,868][model8_pretrain.py][INFO] Epoch:[0/2](211100/4588595) loss:2.935 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:53:28,869][model8_pretrain.py][INFO] Epoch:[0/2](211100/4588595) loss:3.132 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:53:28,869][model8_pretrain.py][INFO] Epoch:[0/2](211100/4588595) loss:2.784 lr:0.0000100 epoch_Time:27656.0min: [2024-01-03 15:54:05,813][model8_pretrain.py][INFO] Epoch:[0/2](211200/4588595) loss:2.978 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:05,813][model8_pretrain.py][INFO] Epoch:[0/2](211200/4588595) loss:2.922 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:05,813][model8_pretrain.py][INFO] Epoch:[0/2](211200/4588595) loss:2.679 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:05,813][model8_pretrain.py][INFO] Epoch:[0/2](211200/4588595) loss:2.389 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:05,813][model8_pretrain.py][INFO] Epoch:[0/2](211200/4588595) loss:3.165 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:05,813][model8_pretrain.py][INFO] Epoch:[0/2](211200/4588595) loss:3.088 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:05,814][model8_pretrain.py][INFO] Epoch:[0/2](211200/4588595) loss:2.905 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:05,814][model8_pretrain.py][INFO] Epoch:[0/2](211200/4588595) loss:2.676 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:42,761][model8_pretrain.py][INFO] Epoch:[0/2](211300/4588595) loss:2.569 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:42,761][model8_pretrain.py][INFO] Epoch:[0/2](211300/4588595) loss:2.692 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:54:42,761][model8_pretrain.py][INFO] Epoch:[0/2](211300/4588595) loss:2.793 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:54:42,761][model8_pretrain.py][INFO] Epoch:[0/2](211300/4588595) loss:2.821 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:54:42,761][model8_pretrain.py][INFO] Epoch:[0/2](211300/4588595) loss:3.041 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:54:42,761][model8_pretrain.py][INFO] Epoch:[0/2](211300/4588595) loss:3.179 lr:0.0000100 epoch_Time:27655.0min: [2024-01-03 15:54:42,761][model8_pretrain.py][INFO] Epoch:[0/2](211300/4588595) loss:3.234 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:54:42,761][model8_pretrain.py][INFO] Epoch:[0/2](211300/4588595) loss:3.270 lr:0.0000100 epoch_Time:27654.0min: [2024-01-03 15:55:19,708][model8_pretrain.py][INFO] Epoch:[0/2](211400/4588595) loss:3.341 lr:0.0000100 epoch_Time:27653.0min: [2024-01-03 15:55:19,708][model8_pretrain.py][INFO] Epoch:[0/2](211400/4588595) loss:2.800 lr:0.0000100 epoch_Time:27653.0min: [2024-01-03 15:55:19,708][model8_pretrain.py][INFO] Epoch:[0/2](211400/4588595) loss:3.041 lr:0.0000100 epoch_Time:27653.0min: [2024-01-03 15:55:19,708][model8_pretrain.py][INFO] Epoch:[0/2](211400/4588595) loss:2.626 lr:0.0000100 epoch_Time:27653.0min: [2024-01-03 15:55:19,709][model8_pretrain.py][INFO] Epoch:[0/2](211400/4588595) loss:2.862 lr:0.0000100 epoch_Time:27653.0min: [2024-01-03 15:55:19,709][model8_pretrain.py][INFO] Epoch:[0/2](211400/4588595) loss:2.866 lr:0.0000100 epoch_Time:27653.0min: [2024-01-03 15:55:19,709][model8_pretrain.py][INFO] Epoch:[0/2](211400/4588595) loss:2.943 lr:0.0000100 epoch_Time:27653.0min: [2024-01-03 15:55:19,709][model8_pretrain.py][INFO] Epoch:[0/2](211400/4588595) loss:3.187 lr:0.0000100 epoch_Time:27653.0min: [2024-01-03 15:55:56,655][model8_pretrain.py][INFO] Epoch:[0/2](211500/4588595) loss:3.351 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:55:56,655][model8_pretrain.py][INFO] Epoch:[0/2](211500/4588595) loss:2.950 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:55:56,655][model8_pretrain.py][INFO] Epoch:[0/2](211500/4588595) loss:3.269 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:55:56,655][model8_pretrain.py][INFO] Epoch:[0/2](211500/4588595) loss:3.436 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:55:56,655][model8_pretrain.py][INFO] Epoch:[0/2](211500/4588595) loss:2.756 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:55:56,655][model8_pretrain.py][INFO] Epoch:[0/2](211500/4588595) loss:3.144 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:55:56,655][model8_pretrain.py][INFO] Epoch:[0/2](211500/4588595) loss:2.963 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:55:56,655][model8_pretrain.py][INFO] Epoch:[0/2](211500/4588595) loss:2.789 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:56:33,617][model8_pretrain.py][INFO] Epoch:[0/2](211600/4588595) loss:2.416 lr:0.0000100 epoch_Time:27651.0min: [2024-01-03 15:56:33,617][model8_pretrain.py][INFO] Epoch:[0/2](211600/4588595) loss:2.588 lr:0.0000100 epoch_Time:27651.0min: [2024-01-03 15:56:33,617][model8_pretrain.py][INFO] Epoch:[0/2](211600/4588595) loss:2.836 lr:0.0000100 epoch_Time:27651.0min: [2024-01-03 15:56:33,617][model8_pretrain.py][INFO] Epoch:[0/2](211600/4588595) loss:2.420 lr:0.0000100 epoch_Time:27651.0min: [2024-01-03 15:56:33,617][model8_pretrain.py][INFO] Epoch:[0/2](211600/4588595) loss:2.764 lr:0.0000100 epoch_Time:27651.0min: [2024-01-03 15:56:33,617][model8_pretrain.py][INFO] Epoch:[0/2](211600/4588595) loss:3.042 lr:0.0000100 epoch_Time:27651.0min: [2024-01-03 15:56:33,617][model8_pretrain.py][INFO] Epoch:[0/2](211600/4588595) loss:2.934 lr:0.0000100 epoch_Time:27651.0min: [2024-01-03 15:56:33,617][model8_pretrain.py][INFO] Epoch:[0/2](211600/4588595) loss:2.792 lr:0.0000100 epoch_Time:27651.0min: [2024-01-03 15:57:10,576][model8_pretrain.py][INFO] Epoch:[0/2](211700/4588595) loss:3.203 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:10,576][model8_pretrain.py][INFO] Epoch:[0/2](211700/4588595) loss:2.904 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:10,576][model8_pretrain.py][INFO] Epoch:[0/2](211700/4588595) loss:3.033 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:10,577][model8_pretrain.py][INFO] Epoch:[0/2](211700/4588595) loss:3.108 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:10,576][model8_pretrain.py][INFO] Epoch:[0/2](211700/4588595) loss:3.066 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:10,577][model8_pretrain.py][INFO] Epoch:[0/2](211700/4588595) loss:2.822 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:10,577][model8_pretrain.py][INFO] Epoch:[0/2](211700/4588595) loss:2.831 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:10,577][model8_pretrain.py][INFO] Epoch:[0/2](211700/4588595) loss:3.047 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:47,526][model8_pretrain.py][INFO] Epoch:[0/2](211800/4588595) loss:3.361 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:47,526][model8_pretrain.py][INFO] Epoch:[0/2](211800/4588595) loss:2.695 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:47,526][model8_pretrain.py][INFO] Epoch:[0/2](211800/4588595) loss:2.477 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:47,526][model8_pretrain.py][INFO] Epoch:[0/2](211800/4588595) loss:2.734 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:47,526][model8_pretrain.py][INFO] Epoch:[0/2](211800/4588595) loss:2.838 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:47,526][model8_pretrain.py][INFO] Epoch:[0/2](211800/4588595) loss:3.184 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:47,527][model8_pretrain.py][INFO] Epoch:[0/2](211800/4588595) loss:3.027 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:57:47,527][model8_pretrain.py][INFO] Epoch:[0/2](211800/4588595) loss:3.033 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:58:33,077][model8_pretrain.py][INFO] Epoch:[0/2](211900/4588595) loss:3.182 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:58:33,077][model8_pretrain.py][INFO] Epoch:[0/2](211900/4588595) loss:2.782 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:58:33,077][model8_pretrain.py][INFO] Epoch:[0/2](211900/4588595) loss:3.140 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:58:33,077][model8_pretrain.py][INFO] Epoch:[0/2](211900/4588595) loss:2.373 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:58:33,077][model8_pretrain.py][INFO] Epoch:[0/2](211900/4588595) loss:2.686 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:58:33,077][model8_pretrain.py][INFO] Epoch:[0/2](211900/4588595) loss:3.478 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:58:33,077][model8_pretrain.py][INFO] Epoch:[0/2](211900/4588595) loss:3.167 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:58:33,078][model8_pretrain.py][INFO] Epoch:[0/2](211900/4588595) loss:3.043 lr:0.0000100 epoch_Time:27652.0min: [2024-01-03 15:59:10,017][model8_pretrain.py][INFO] Epoch:[0/2](212000/4588595) loss:3.531 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:10,017][model8_pretrain.py][INFO] Epoch:[0/2](212000/4588595) loss:2.930 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:10,017][model8_pretrain.py][INFO] Epoch:[0/2](212000/4588595) loss:2.958 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:10,017][model8_pretrain.py][INFO] Epoch:[0/2](212000/4588595) loss:3.181 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:10,017][model8_pretrain.py][INFO] Epoch:[0/2](212000/4588595) loss:3.219 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:10,018][model8_pretrain.py][INFO] Epoch:[0/2](212000/4588595) loss:3.245 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:10,018][model8_pretrain.py][INFO] Epoch:[0/2](212000/4588595) loss:3.138 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:10,018][model8_pretrain.py][INFO] Epoch:[0/2](212000/4588595) loss:2.689 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:46,979][model8_pretrain.py][INFO] Epoch:[0/2](212100/4588595) loss:2.621 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:46,979][model8_pretrain.py][INFO] Epoch:[0/2](212100/4588595) loss:2.690 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:46,979][model8_pretrain.py][INFO] Epoch:[0/2](212100/4588595) loss:2.948 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:46,979][model8_pretrain.py][INFO] Epoch:[0/2](212100/4588595) loss:2.914 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:46,979][model8_pretrain.py][INFO] Epoch:[0/2](212100/4588595) loss:3.094 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:46,979][model8_pretrain.py][INFO] Epoch:[0/2](212100/4588595) loss:2.917 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:46,979][model8_pretrain.py][INFO] Epoch:[0/2](212100/4588595) loss:3.308 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 15:59:46,979][model8_pretrain.py][INFO] Epoch:[0/2](212100/4588595) loss:2.480 lr:0.0000100 epoch_Time:27650.0min: [2024-01-03 16:00:23,931][model8_pretrain.py][INFO] Epoch:[0/2](212200/4588595) loss:3.393 lr:0.0000100 epoch_Time:27648.0min: [2024-01-03 16:00:23,931][model8_pretrain.py][INFO] Epoch:[0/2](212200/4588595) loss:2.726 lr:0.0000100 epoch_Time:27649.0min: [2024-01-03 16:00:23,931][model8_pretrain.py][INFO] Epoch:[0/2](212200/4588595) loss:2.800 lr:0.0000100 epoch_Time:27648.0min: [2024-01-03 16:00:23,931][model8_pretrain.py][INFO] Epoch:[0/2](212200/4588595) loss:3.429 lr:0.0000100 epoch_Time:27649.0min: [2024-01-03 16:00:23,931][model8_pretrain.py][INFO] Epoch:[0/2](212200/4588595) loss:2.762 lr:0.0000100 epoch_Time:27649.0min: [2024-01-03 16:00:23,931][model8_pretrain.py][INFO] Epoch:[0/2](212200/4588595) loss:2.916 lr:0.0000100 epoch_Time:27649.0min: [2024-01-03 16:00:23,931][model8_pretrain.py][INFO] Epoch:[0/2](212200/4588595) loss:2.385 lr:0.0000100 epoch_Time:27649.0min: [2024-01-03 16:00:23,932][model8_pretrain.py][INFO] Epoch:[0/2](212200/4588595) loss:2.829 lr:0.0000100 epoch_Time:27649.0min: [2024-01-03 16:01:00,888][model8_pretrain.py][INFO] Epoch:[0/2](212300/4588595) loss:3.443 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:00,888][model8_pretrain.py][INFO] Epoch:[0/2](212300/4588595) loss:2.748 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:00,888][model8_pretrain.py][INFO] Epoch:[0/2](212300/4588595) loss:2.840 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:00,888][model8_pretrain.py][INFO] Epoch:[0/2](212300/4588595) loss:3.345 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:00,888][model8_pretrain.py][INFO] Epoch:[0/2](212300/4588595) loss:3.164 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:00,888][model8_pretrain.py][INFO] Epoch:[0/2](212300/4588595) loss:2.680 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:00,888][model8_pretrain.py][INFO] Epoch:[0/2](212300/4588595) loss:3.287 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:00,888][model8_pretrain.py][INFO] Epoch:[0/2](212300/4588595) loss:3.098 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:37,848][model8_pretrain.py][INFO] Epoch:[0/2](212400/4588595) loss:2.919 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:37,848][model8_pretrain.py][INFO] Epoch:[0/2](212400/4588595) loss:3.092 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:37,848][model8_pretrain.py][INFO] Epoch:[0/2](212400/4588595) loss:2.850 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:37,848][model8_pretrain.py][INFO] Epoch:[0/2](212400/4588595) loss:2.852 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:37,848][model8_pretrain.py][INFO] Epoch:[0/2](212400/4588595) loss:2.211 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:37,848][model8_pretrain.py][INFO] Epoch:[0/2](212400/4588595) loss:2.464 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:37,848][model8_pretrain.py][INFO] Epoch:[0/2](212400/4588595) loss:3.216 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:01:37,848][model8_pretrain.py][INFO] Epoch:[0/2](212400/4588595) loss:3.196 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:02:14,799][model8_pretrain.py][INFO] Epoch:[0/2](212500/4588595) loss:2.870 lr:0.0000100 epoch_Time:27645.0min: [2024-01-03 16:02:14,799][model8_pretrain.py][INFO] Epoch:[0/2](212500/4588595) loss:2.653 lr:0.0000100 epoch_Time:27645.0min: [2024-01-03 16:02:14,799][model8_pretrain.py][INFO] Epoch:[0/2](212500/4588595) loss:2.982 lr:0.0000100 epoch_Time:27645.0min: [2024-01-03 16:02:14,799][model8_pretrain.py][INFO] Epoch:[0/2](212500/4588595) loss:2.866 lr:0.0000100 epoch_Time:27645.0min: [2024-01-03 16:02:14,799][model8_pretrain.py][INFO] Epoch:[0/2](212500/4588595) loss:2.898 lr:0.0000100 epoch_Time:27645.0min: [2024-01-03 16:02:14,799][model8_pretrain.py][INFO] Epoch:[0/2](212500/4588595) loss:3.414 lr:0.0000100 epoch_Time:27645.0min: [2024-01-03 16:02:14,800][model8_pretrain.py][INFO] Epoch:[0/2](212500/4588595) loss:2.894 lr:0.0000100 epoch_Time:27645.0min: [2024-01-03 16:02:14,800][model8_pretrain.py][INFO] Epoch:[0/2](212500/4588595) loss:3.102 lr:0.0000100 epoch_Time:27645.0min: [2024-01-03 16:02:51,760][model8_pretrain.py][INFO] Epoch:[0/2](212600/4588595) loss:3.086 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:02:51,760][model8_pretrain.py][INFO] Epoch:[0/2](212600/4588595) loss:3.051 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:02:51,760][model8_pretrain.py][INFO] Epoch:[0/2](212600/4588595) loss:3.156 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:02:51,760][model8_pretrain.py][INFO] Epoch:[0/2](212600/4588595) loss:2.940 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:02:51,760][model8_pretrain.py][INFO] Epoch:[0/2](212600/4588595) loss:2.700 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:02:51,760][model8_pretrain.py][INFO] Epoch:[0/2](212600/4588595) loss:2.741 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:02:51,760][model8_pretrain.py][INFO] Epoch:[0/2](212600/4588595) loss:3.193 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:02:51,760][model8_pretrain.py][INFO] Epoch:[0/2](212600/4588595) loss:3.597 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:03:37,263][model8_pretrain.py][INFO] Epoch:[0/2](212700/4588595) loss:3.044 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:03:37,263][model8_pretrain.py][INFO] Epoch:[0/2](212700/4588595) loss:2.860 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:03:37,263][model8_pretrain.py][INFO] Epoch:[0/2](212700/4588595) loss:3.062 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:03:37,263][model8_pretrain.py][INFO] Epoch:[0/2](212700/4588595) loss:2.715 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:03:37,263][model8_pretrain.py][INFO] Epoch:[0/2](212700/4588595) loss:3.178 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:03:37,263][model8_pretrain.py][INFO] Epoch:[0/2](212700/4588595) loss:2.896 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:03:37,263][model8_pretrain.py][INFO] Epoch:[0/2](212700/4588595) loss:2.717 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:03:37,263][model8_pretrain.py][INFO] Epoch:[0/2](212700/4588595) loss:2.968 lr:0.0000100 epoch_Time:27647.0min: [2024-01-03 16:04:14,194][model8_pretrain.py][INFO] Epoch:[0/2](212800/4588595) loss:2.753 lr:0.0000100 epoch_Time:27646.0min: [2024-01-03 16:04:14,194][model8_pretrain.py][INFO] Epoch:[0/2](212800/4588595) loss:3.093 lr:0.0000100 epoch_Time:27646.0min: [2024-01-03 16:04:14,194][model8_pretrain.py][INFO] Epoch:[0/2](212800/4588595) loss:2.668 lr:0.0000100 epoch_Time:27646.0min: [2024-01-03 16:04:14,194][model8_pretrain.py][INFO] Epoch:[0/2](212800/4588595) loss:2.681 lr:0.0000100 epoch_Time:27646.0min: [2024-01-03 16:04:14,194][model8_pretrain.py][INFO] Epoch:[0/2](212800/4588595) loss:3.080 lr:0.0000100 epoch_Time:27646.0min: [2024-01-03 16:04:14,194][model8_pretrain.py][INFO] Epoch:[0/2](212800/4588595) loss:2.930 lr:0.0000100 epoch_Time:27646.0min: [2024-01-03 16:04:14,195][model8_pretrain.py][INFO] Epoch:[0/2](212800/4588595) loss:3.150 lr:0.0000100 epoch_Time:27646.0min: [2024-01-03 16:04:14,196][model8_pretrain.py][INFO] Epoch:[0/2](212800/4588595) loss:2.797 lr:0.0000100 epoch_Time:27646.0min: [2024-01-03 16:04:51,076][model8_pretrain.py][INFO] Epoch:[0/2](212900/4588595) loss:3.037 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:04:51,076][model8_pretrain.py][INFO] Epoch:[0/2](212900/4588595) loss:2.706 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:04:51,076][model8_pretrain.py][INFO] Epoch:[0/2](212900/4588595) loss:3.097 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:04:51,076][model8_pretrain.py][INFO] Epoch:[0/2](212900/4588595) loss:2.583 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:04:51,076][model8_pretrain.py][INFO] Epoch:[0/2](212900/4588595) loss:3.127 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:04:51,076][model8_pretrain.py][INFO] Epoch:[0/2](212900/4588595) loss:3.100 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:04:51,076][model8_pretrain.py][INFO] Epoch:[0/2](212900/4588595) loss:2.884 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:04:51,076][model8_pretrain.py][INFO] Epoch:[0/2](212900/4588595) loss:2.500 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:05:28,018][model8_pretrain.py][INFO] Epoch:[0/2](213000/4588595) loss:3.067 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:05:28,018][model8_pretrain.py][INFO] Epoch:[0/2](213000/4588595) loss:3.263 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:05:28,018][model8_pretrain.py][INFO] Epoch:[0/2](213000/4588595) loss:2.839 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:05:28,018][model8_pretrain.py][INFO] Epoch:[0/2](213000/4588595) loss:3.017 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:05:28,018][model8_pretrain.py][INFO] Epoch:[0/2](213000/4588595) loss:2.877 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:05:28,018][model8_pretrain.py][INFO] Epoch:[0/2](213000/4588595) loss:3.487 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:05:28,018][model8_pretrain.py][INFO] Epoch:[0/2](213000/4588595) loss:3.333 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:05:28,018][model8_pretrain.py][INFO] Epoch:[0/2](213000/4588595) loss:3.037 lr:0.0000100 epoch_Time:27644.0min: [2024-01-03 16:06:04,961][model8_pretrain.py][INFO] Epoch:[0/2](213100/4588595) loss:3.230 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:04,961][model8_pretrain.py][INFO] Epoch:[0/2](213100/4588595) loss:3.521 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:04,961][model8_pretrain.py][INFO] Epoch:[0/2](213100/4588595) loss:3.291 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:04,962][model8_pretrain.py][INFO] Epoch:[0/2](213100/4588595) loss:2.966 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:04,962][model8_pretrain.py][INFO] Epoch:[0/2](213100/4588595) loss:3.098 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:04,962][model8_pretrain.py][INFO] Epoch:[0/2](213100/4588595) loss:3.222 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:04,962][model8_pretrain.py][INFO] Epoch:[0/2](213100/4588595) loss:3.295 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:04,962][model8_pretrain.py][INFO] Epoch:[0/2](213100/4588595) loss:2.451 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:41,894][model8_pretrain.py][INFO] Epoch:[0/2](213200/4588595) loss:3.128 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:41,894][model8_pretrain.py][INFO] Epoch:[0/2](213200/4588595) loss:3.007 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:41,894][model8_pretrain.py][INFO] Epoch:[0/2](213200/4588595) loss:2.718 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:41,894][model8_pretrain.py][INFO] Epoch:[0/2](213200/4588595) loss:2.899 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:41,894][model8_pretrain.py][INFO] Epoch:[0/2](213200/4588595) loss:3.115 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:41,894][model8_pretrain.py][INFO] Epoch:[0/2](213200/4588595) loss:3.312 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:41,894][model8_pretrain.py][INFO] Epoch:[0/2](213200/4588595) loss:3.274 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:06:41,894][model8_pretrain.py][INFO] Epoch:[0/2](213200/4588595) loss:3.202 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:07:18,835][model8_pretrain.py][INFO] Epoch:[0/2](213300/4588595) loss:2.709 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:07:18,835][model8_pretrain.py][INFO] Epoch:[0/2](213300/4588595) loss:2.954 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:07:18,835][model8_pretrain.py][INFO] Epoch:[0/2](213300/4588595) loss:2.717 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:07:18,835][model8_pretrain.py][INFO] Epoch:[0/2](213300/4588595) loss:2.687 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:07:18,835][model8_pretrain.py][INFO] Epoch:[0/2](213300/4588595) loss:3.302 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:07:18,835][model8_pretrain.py][INFO] Epoch:[0/2](213300/4588595) loss:3.171 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:07:18,835][model8_pretrain.py][INFO] Epoch:[0/2](213300/4588595) loss:3.055 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:07:18,835][model8_pretrain.py][INFO] Epoch:[0/2](213300/4588595) loss:2.647 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:07:55,770][model8_pretrain.py][INFO] Epoch:[0/2](213400/4588595) loss:3.234 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:07:55,770][model8_pretrain.py][INFO] Epoch:[0/2](213400/4588595) loss:3.361 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:07:55,770][model8_pretrain.py][INFO] Epoch:[0/2](213400/4588595) loss:3.133 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:07:55,770][model8_pretrain.py][INFO] Epoch:[0/2](213400/4588595) loss:2.957 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:07:55,770][model8_pretrain.py][INFO] Epoch:[0/2](213400/4588595) loss:2.802 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:07:55,770][model8_pretrain.py][INFO] Epoch:[0/2](213400/4588595) loss:2.620 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:07:55,770][model8_pretrain.py][INFO] Epoch:[0/2](213400/4588595) loss:2.526 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:07:55,770][model8_pretrain.py][INFO] Epoch:[0/2](213400/4588595) loss:3.140 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:08:41,067][model8_pretrain.py][INFO] Epoch:[0/2](213500/4588595) loss:3.000 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:08:41,067][model8_pretrain.py][INFO] Epoch:[0/2](213500/4588595) loss:3.203 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:08:41,067][model8_pretrain.py][INFO] Epoch:[0/2](213500/4588595) loss:2.554 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:08:41,067][model8_pretrain.py][INFO] Epoch:[0/2](213500/4588595) loss:3.493 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:08:41,067][model8_pretrain.py][INFO] Epoch:[0/2](213500/4588595) loss:2.629 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:08:41,067][model8_pretrain.py][INFO] Epoch:[0/2](213500/4588595) loss:2.814 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:08:41,067][model8_pretrain.py][INFO] Epoch:[0/2](213500/4588595) loss:2.759 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:08:41,067][model8_pretrain.py][INFO] Epoch:[0/2](213500/4588595) loss:2.644 lr:0.0000100 epoch_Time:27642.0min: [2024-01-03 16:09:18,001][model8_pretrain.py][INFO] Epoch:[0/2](213600/4588595) loss:3.028 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:09:18,001][model8_pretrain.py][INFO] Epoch:[0/2](213600/4588595) loss:2.690 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:09:18,001][model8_pretrain.py][INFO] Epoch:[0/2](213600/4588595) loss:3.589 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:09:18,001][model8_pretrain.py][INFO] Epoch:[0/2](213600/4588595) loss:3.011 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:09:18,001][model8_pretrain.py][INFO] Epoch:[0/2](213600/4588595) loss:2.671 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:09:18,001][model8_pretrain.py][INFO] Epoch:[0/2](213600/4588595) loss:3.053 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:09:18,001][model8_pretrain.py][INFO] Epoch:[0/2](213600/4588595) loss:2.038 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:09:18,001][model8_pretrain.py][INFO] Epoch:[0/2](213600/4588595) loss:3.194 lr:0.0000100 epoch_Time:27641.0min: [2024-01-03 16:09:54,929][model8_pretrain.py][INFO] Epoch:[0/2](213700/4588595) loss:2.944 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:09:54,929][model8_pretrain.py][INFO] Epoch:[0/2](213700/4588595) loss:2.564 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:09:54,929][model8_pretrain.py][INFO] Epoch:[0/2](213700/4588595) loss:3.165 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:09:54,929][model8_pretrain.py][INFO] Epoch:[0/2](213700/4588595) loss:3.476 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:09:54,929][model8_pretrain.py][INFO] Epoch:[0/2](213700/4588595) loss:2.848 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:09:54,929][model8_pretrain.py][INFO] Epoch:[0/2](213700/4588595) loss:3.145 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:09:54,929][model8_pretrain.py][INFO] Epoch:[0/2](213700/4588595) loss:3.078 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:09:54,929][model8_pretrain.py][INFO] Epoch:[0/2](213700/4588595) loss:3.260 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:10:31,870][model8_pretrain.py][INFO] Epoch:[0/2](213800/4588595) loss:3.159 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:10:31,870][model8_pretrain.py][INFO] Epoch:[0/2](213800/4588595) loss:2.813 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:10:31,870][model8_pretrain.py][INFO] Epoch:[0/2](213800/4588595) loss:2.595 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:10:31,870][model8_pretrain.py][INFO] Epoch:[0/2](213800/4588595) loss:3.307 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:10:31,870][model8_pretrain.py][INFO] Epoch:[0/2](213800/4588595) loss:3.111 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:10:31,870][model8_pretrain.py][INFO] Epoch:[0/2](213800/4588595) loss:3.254 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:10:31,870][model8_pretrain.py][INFO] Epoch:[0/2](213800/4588595) loss:2.462 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:10:31,870][model8_pretrain.py][INFO] Epoch:[0/2](213800/4588595) loss:2.963 lr:0.0000100 epoch_Time:27639.0min: [2024-01-03 16:11:08,809][model8_pretrain.py][INFO] Epoch:[0/2](213900/4588595) loss:3.201 lr:0.0000100 epoch_Time:27638.0min: [2024-01-03 16:11:08,809][model8_pretrain.py][INFO] Epoch:[0/2](213900/4588595) loss:2.967 lr:0.0000100 epoch_Time:27638.0min: [2024-01-03 16:11:08,809][model8_pretrain.py][INFO] Epoch:[0/2](213900/4588595) loss:2.459 lr:0.0000100 epoch_Time:27638.0min: [2024-01-03 16:11:08,809][model8_pretrain.py][INFO] Epoch:[0/2](213900/4588595) loss:2.851 lr:0.0000100 epoch_Time:27638.0min: [2024-01-03 16:11:08,809][model8_pretrain.py][INFO] Epoch:[0/2](213900/4588595) loss:3.175 lr:0.0000100 epoch_Time:27638.0min: [2024-01-03 16:11:08,809][model8_pretrain.py][INFO] Epoch:[0/2](213900/4588595) loss:3.116 lr:0.0000100 epoch_Time:27638.0min: [2024-01-03 16:11:08,809][model8_pretrain.py][INFO] Epoch:[0/2](213900/4588595) loss:2.949 lr:0.0000100 epoch_Time:27638.0min: [2024-01-03 16:11:08,809][model8_pretrain.py][INFO] Epoch:[0/2](213900/4588595) loss:2.640 lr:0.0000100 epoch_Time:27638.0min: [2024-01-03 16:11:45,751][model8_pretrain.py][INFO] Epoch:[0/2](214000/4588595) loss:3.383 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:11:45,751][model8_pretrain.py][INFO] Epoch:[0/2](214000/4588595) loss:2.270 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:11:45,752][model8_pretrain.py][INFO] Epoch:[0/2](214000/4588595) loss:3.131 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:11:45,752][model8_pretrain.py][INFO] Epoch:[0/2](214000/4588595) loss:2.663 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:11:45,752][model8_pretrain.py][INFO] Epoch:[0/2](214000/4588595) loss:3.143 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:11:45,752][model8_pretrain.py][INFO] Epoch:[0/2](214000/4588595) loss:3.213 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:11:45,752][model8_pretrain.py][INFO] Epoch:[0/2](214000/4588595) loss:3.258 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:11:45,753][model8_pretrain.py][INFO] Epoch:[0/2](214000/4588595) loss:3.464 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:12:22,690][model8_pretrain.py][INFO] Epoch:[0/2](214100/4588595) loss:2.636 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:12:22,690][model8_pretrain.py][INFO] Epoch:[0/2](214100/4588595) loss:1.535 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:12:22,690][model8_pretrain.py][INFO] Epoch:[0/2](214100/4588595) loss:3.421 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:12:22,690][model8_pretrain.py][INFO] Epoch:[0/2](214100/4588595) loss:2.529 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:12:22,690][model8_pretrain.py][INFO] Epoch:[0/2](214100/4588595) loss:2.516 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:12:22,690][model8_pretrain.py][INFO] Epoch:[0/2](214100/4588595) loss:3.247 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:12:22,690][model8_pretrain.py][INFO] Epoch:[0/2](214100/4588595) loss:2.605 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:12:22,690][model8_pretrain.py][INFO] Epoch:[0/2](214100/4588595) loss:2.983 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:12:59,627][model8_pretrain.py][INFO] Epoch:[0/2](214200/4588595) loss:3.202 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:12:59,627][model8_pretrain.py][INFO] Epoch:[0/2](214200/4588595) loss:3.141 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:12:59,627][model8_pretrain.py][INFO] Epoch:[0/2](214200/4588595) loss:2.722 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:12:59,627][model8_pretrain.py][INFO] Epoch:[0/2](214200/4588595) loss:2.287 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:12:59,627][model8_pretrain.py][INFO] Epoch:[0/2](214200/4588595) loss:2.864 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:12:59,627][model8_pretrain.py][INFO] Epoch:[0/2](214200/4588595) loss:3.234 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:12:59,627][model8_pretrain.py][INFO] Epoch:[0/2](214200/4588595) loss:2.817 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:12:59,628][model8_pretrain.py][INFO] Epoch:[0/2](214200/4588595) loss:3.254 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:13:45,010][model8_pretrain.py][INFO] Epoch:[0/2](214300/4588595) loss:2.608 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:13:45,010][model8_pretrain.py][INFO] Epoch:[0/2](214300/4588595) loss:3.255 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:13:45,010][model8_pretrain.py][INFO] Epoch:[0/2](214300/4588595) loss:2.628 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:13:45,010][model8_pretrain.py][INFO] Epoch:[0/2](214300/4588595) loss:2.851 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:13:45,010][model8_pretrain.py][INFO] Epoch:[0/2](214300/4588595) loss:2.472 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:13:45,010][model8_pretrain.py][INFO] Epoch:[0/2](214300/4588595) loss:3.305 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:13:45,010][model8_pretrain.py][INFO] Epoch:[0/2](214300/4588595) loss:3.316 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:13:45,010][model8_pretrain.py][INFO] Epoch:[0/2](214300/4588595) loss:2.676 lr:0.0000100 epoch_Time:27637.0min: [2024-01-03 16:14:21,953][model8_pretrain.py][INFO] Epoch:[0/2](214400/4588595) loss:2.669 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:14:21,953][model8_pretrain.py][INFO] Epoch:[0/2](214400/4588595) loss:2.641 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:14:21,953][model8_pretrain.py][INFO] Epoch:[0/2](214400/4588595) loss:3.514 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:14:21,953][model8_pretrain.py][INFO] Epoch:[0/2](214400/4588595) loss:2.204 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:14:21,953][model8_pretrain.py][INFO] Epoch:[0/2](214400/4588595) loss:2.884 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:14:21,953][model8_pretrain.py][INFO] Epoch:[0/2](214400/4588595) loss:3.086 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:14:21,953][model8_pretrain.py][INFO] Epoch:[0/2](214400/4588595) loss:2.779 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:14:21,953][model8_pretrain.py][INFO] Epoch:[0/2](214400/4588595) loss:3.180 lr:0.0000100 epoch_Time:27636.0min: [2024-01-03 16:14:58,887][model8_pretrain.py][INFO] Epoch:[0/2](214500/4588595) loss:3.125 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:14:58,887][model8_pretrain.py][INFO] Epoch:[0/2](214500/4588595) loss:3.050 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:14:58,887][model8_pretrain.py][INFO] Epoch:[0/2](214500/4588595) loss:2.427 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:14:58,887][model8_pretrain.py][INFO] Epoch:[0/2](214500/4588595) loss:3.072 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:14:58,887][model8_pretrain.py][INFO] Epoch:[0/2](214500/4588595) loss:3.177 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:14:58,887][model8_pretrain.py][INFO] Epoch:[0/2](214500/4588595) loss:3.040 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:14:58,887][model8_pretrain.py][INFO] Epoch:[0/2](214500/4588595) loss:2.570 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:14:58,887][model8_pretrain.py][INFO] Epoch:[0/2](214500/4588595) loss:2.613 lr:0.0000100 epoch_Time:27635.0min: [2024-01-03 16:15:35,817][model8_pretrain.py][INFO] Epoch:[0/2](214600/4588595) loss:3.114 lr:0.0000100 epoch_Time:27634.0min: [2024-01-03 16:15:35,817][model8_pretrain.py][INFO] Epoch:[0/2](214600/4588595) loss:2.587 lr:0.0000100 epoch_Time:27634.0min: [2024-01-03 16:15:35,817][model8_pretrain.py][INFO] Epoch:[0/2](214600/4588595) loss:3.128 lr:0.0000100 epoch_Time:27634.0min: [2024-01-03 16:15:35,817][model8_pretrain.py][INFO] Epoch:[0/2](214600/4588595) loss:2.755 lr:0.0000100 epoch_Time:27634.0min: [2024-01-03 16:15:35,817][model8_pretrain.py][INFO] Epoch:[0/2](214600/4588595) loss:2.813 lr:0.0000100 epoch_Time:27634.0min: [2024-01-03 16:15:35,817][model8_pretrain.py][INFO] Epoch:[0/2](214600/4588595) loss:2.894 lr:0.0000100 epoch_Time:27634.0min: [2024-01-03 16:15:35,817][model8_pretrain.py][INFO] Epoch:[0/2](214600/4588595) loss:2.535 lr:0.0000100 epoch_Time:27634.0min: [2024-01-03 16:15:35,817][model8_pretrain.py][INFO] Epoch:[0/2](214600/4588595) loss:2.917 lr:0.0000100 epoch_Time:27634.0min: [2024-01-03 16:16:12,741][model8_pretrain.py][INFO] Epoch:[0/2](214700/4588595) loss:2.813 lr:0.0000100 epoch_Time:27633.0min: [2024-01-03 16:16:12,741][model8_pretrain.py][INFO] Epoch:[0/2](214700/4588595) loss:2.993 lr:0.0000100 epoch_Time:27633.0min: [2024-01-03 16:16:12,741][model8_pretrain.py][INFO] Epoch:[0/2](214700/4588595) loss:2.828 lr:0.0000100 epoch_Time:27633.0min: [2024-01-03 16:16:12,741][model8_pretrain.py][INFO] Epoch:[0/2](214700/4588595) loss:3.128 lr:0.0000100 epoch_Time:27633.0min: [2024-01-03 16:16:12,741][model8_pretrain.py][INFO] Epoch:[0/2](214700/4588595) loss:2.698 lr:0.0000100 epoch_Time:27633.0min: [2024-01-03 16:16:12,741][model8_pretrain.py][INFO] Epoch:[0/2](214700/4588595) loss:3.228 lr:0.0000100 epoch_Time:27633.0min: [2024-01-03 16:16:12,742][model8_pretrain.py][INFO] Epoch:[0/2](214700/4588595) loss:2.328 lr:0.0000100 epoch_Time:27633.0min: [2024-01-03 16:16:12,742][model8_pretrain.py][INFO] Epoch:[0/2](214700/4588595) loss:2.726 lr:0.0000100 epoch_Time:27633.0min: [2024-01-03 16:16:49,667][model8_pretrain.py][INFO] Epoch:[0/2](214800/4588595) loss:2.828 lr:0.0000100 epoch_Time:27632.0min: [2024-01-03 16:16:49,667][model8_pretrain.py][INFO] Epoch:[0/2](214800/4588595) loss:2.641 lr:0.0000100 epoch_Time:27632.0min: [2024-01-03 16:16:49,667][model8_pretrain.py][INFO] Epoch:[0/2](214800/4588595) loss:2.486 lr:0.0000100 epoch_Time:27632.0min: [2024-01-03 16:16:49,667][model8_pretrain.py][INFO] Epoch:[0/2](214800/4588595) loss:2.817 lr:0.0000100 epoch_Time:27632.0min: [2024-01-03 16:16:49,667][model8_pretrain.py][INFO] Epoch:[0/2](214800/4588595) loss:2.987 lr:0.0000100 epoch_Time:27632.0min: [2024-01-03 16:16:49,667][model8_pretrain.py][INFO] Epoch:[0/2](214800/4588595) loss:2.665 lr:0.0000100 epoch_Time:27632.0min: [2024-01-03 16:16:49,667][model8_pretrain.py][INFO] Epoch:[0/2](214800/4588595) loss:2.955 lr:0.0000100 epoch_Time:27632.0min: [2024-01-03 16:16:49,667][model8_pretrain.py][INFO] Epoch:[0/2](214800/4588595) loss:2.713 lr:0.0000100 epoch_Time:27632.0min: [2024-01-03 16:17:26,606][model8_pretrain.py][INFO] Epoch:[0/2](214900/4588595) loss:3.150 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:17:26,606][model8_pretrain.py][INFO] Epoch:[0/2](214900/4588595) loss:3.216 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:17:26,606][model8_pretrain.py][INFO] Epoch:[0/2](214900/4588595) loss:2.857 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:17:26,606][model8_pretrain.py][INFO] Epoch:[0/2](214900/4588595) loss:2.505 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:17:26,606][model8_pretrain.py][INFO] Epoch:[0/2](214900/4588595) loss:3.010 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:17:26,606][model8_pretrain.py][INFO] Epoch:[0/2](214900/4588595) loss:3.284 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:17:26,607][model8_pretrain.py][INFO] Epoch:[0/2](214900/4588595) loss:3.107 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:17:26,607][model8_pretrain.py][INFO] Epoch:[0/2](214900/4588595) loss:2.989 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:18:03,543][model8_pretrain.py][INFO] Epoch:[0/2](215000/4588595) loss:3.279 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:18:03,543][model8_pretrain.py][INFO] Epoch:[0/2](215000/4588595) loss:2.777 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:18:03,543][model8_pretrain.py][INFO] Epoch:[0/2](215000/4588595) loss:2.826 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:18:03,543][model8_pretrain.py][INFO] Epoch:[0/2](215000/4588595) loss:3.255 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:18:03,543][model8_pretrain.py][INFO] Epoch:[0/2](215000/4588595) loss:2.548 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:18:03,543][model8_pretrain.py][INFO] Epoch:[0/2](215000/4588595) loss:2.835 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:18:03,543][model8_pretrain.py][INFO] Epoch:[0/2](215000/4588595) loss:2.987 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:18:03,543][model8_pretrain.py][INFO] Epoch:[0/2](215000/4588595) loss:2.682 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:18:48,532][model8_pretrain.py][INFO] Epoch:[0/2](215100/4588595) loss:2.972 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:18:48,532][model8_pretrain.py][INFO] Epoch:[0/2](215100/4588595) loss:3.183 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:18:48,532][model8_pretrain.py][INFO] Epoch:[0/2](215100/4588595) loss:2.904 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:18:48,532][model8_pretrain.py][INFO] Epoch:[0/2](215100/4588595) loss:2.558 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:18:48,532][model8_pretrain.py][INFO] Epoch:[0/2](215100/4588595) loss:3.154 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:18:48,532][model8_pretrain.py][INFO] Epoch:[0/2](215100/4588595) loss:3.043 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:18:48,532][model8_pretrain.py][INFO] Epoch:[0/2](215100/4588595) loss:3.142 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:18:48,533][model8_pretrain.py][INFO] Epoch:[0/2](215100/4588595) loss:3.188 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:19:25,469][model8_pretrain.py][INFO] Epoch:[0/2](215200/4588595) loss:3.200 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:19:25,469][model8_pretrain.py][INFO] Epoch:[0/2](215200/4588595) loss:3.029 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:19:25,469][model8_pretrain.py][INFO] Epoch:[0/2](215200/4588595) loss:2.719 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:19:25,469][model8_pretrain.py][INFO] Epoch:[0/2](215200/4588595) loss:3.012 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:19:25,469][model8_pretrain.py][INFO] Epoch:[0/2](215200/4588595) loss:3.227 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:19:25,469][model8_pretrain.py][INFO] Epoch:[0/2](215200/4588595) loss:3.123 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:19:25,469][model8_pretrain.py][INFO] Epoch:[0/2](215200/4588595) loss:3.009 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:19:25,470][model8_pretrain.py][INFO] Epoch:[0/2](215200/4588595) loss:2.818 lr:0.0000100 epoch_Time:27631.0min: [2024-01-03 16:20:02,408][model8_pretrain.py][INFO] Epoch:[0/2](215300/4588595) loss:2.997 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:20:02,408][model8_pretrain.py][INFO] Epoch:[0/2](215300/4588595) loss:2.883 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:20:02,408][model8_pretrain.py][INFO] Epoch:[0/2](215300/4588595) loss:2.438 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:20:02,408][model8_pretrain.py][INFO] Epoch:[0/2](215300/4588595) loss:3.163 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:20:02,408][model8_pretrain.py][INFO] Epoch:[0/2](215300/4588595) loss:2.894 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:20:02,408][model8_pretrain.py][INFO] Epoch:[0/2](215300/4588595) loss:3.038 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:20:02,409][model8_pretrain.py][INFO] Epoch:[0/2](215300/4588595) loss:3.291 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:20:02,409][model8_pretrain.py][INFO] Epoch:[0/2](215300/4588595) loss:3.125 lr:0.0000100 epoch_Time:27630.0min: [2024-01-03 16:20:39,358][model8_pretrain.py][INFO] Epoch:[0/2](215400/4588595) loss:3.328 lr:0.0000100 epoch_Time:27629.0min: [2024-01-03 16:20:39,358][model8_pretrain.py][INFO] Epoch:[0/2](215400/4588595) loss:3.036 lr:0.0000100 epoch_Time:27629.0min: [2024-01-03 16:20:39,358][model8_pretrain.py][INFO] Epoch:[0/2](215400/4588595) loss:3.048 lr:0.0000100 epoch_Time:27629.0min: [2024-01-03 16:20:39,358][model8_pretrain.py][INFO] Epoch:[0/2](215400/4588595) loss:2.733 lr:0.0000100 epoch_Time:27629.0min: [2024-01-03 16:20:39,358][model8_pretrain.py][INFO] Epoch:[0/2](215400/4588595) loss:2.778 lr:0.0000100 epoch_Time:27629.0min: [2024-01-03 16:20:39,358][model8_pretrain.py][INFO] Epoch:[0/2](215400/4588595) loss:3.009 lr:0.0000100 epoch_Time:27629.0min: [2024-01-03 16:20:39,358][model8_pretrain.py][INFO] Epoch:[0/2](215400/4588595) loss:3.412 lr:0.0000100 epoch_Time:27629.0min: [2024-01-03 16:20:39,358][model8_pretrain.py][INFO] Epoch:[0/2](215400/4588595) loss:2.864 lr:0.0000100 epoch_Time:27629.0min: [2024-01-03 16:21:16,302][model8_pretrain.py][INFO] Epoch:[0/2](215500/4588595) loss:3.351 lr:0.0000100 epoch_Time:27628.0min: [2024-01-03 16:21:16,302][model8_pretrain.py][INFO] Epoch:[0/2](215500/4588595) loss:2.860 lr:0.0000100 epoch_Time:27628.0min: [2024-01-03 16:21:16,302][model8_pretrain.py][INFO] Epoch:[0/2](215500/4588595) loss:2.963 lr:0.0000100 epoch_Time:27628.0min: [2024-01-03 16:21:16,302][model8_pretrain.py][INFO] Epoch:[0/2](215500/4588595) loss:3.224 lr:0.0000100 epoch_Time:27628.0min: [2024-01-03 16:21:16,302][model8_pretrain.py][INFO] Epoch:[0/2](215500/4588595) loss:2.679 lr:0.0000100 epoch_Time:27628.0min: [2024-01-03 16:21:16,302][model8_pretrain.py][INFO] Epoch:[0/2](215500/4588595) loss:2.707 lr:0.0000100 epoch_Time:27628.0min: [2024-01-03 16:21:16,302][model8_pretrain.py][INFO] Epoch:[0/2](215500/4588595) loss:2.938 lr:0.0000100 epoch_Time:27628.0min: [2024-01-03 16:21:16,302][model8_pretrain.py][INFO] Epoch:[0/2](215500/4588595) loss:2.571 lr:0.0000100 epoch_Time:27628.0min: [2024-01-03 16:21:53,245][model8_pretrain.py][INFO] Epoch:[0/2](215600/4588595) loss:2.781 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:21:53,245][model8_pretrain.py][INFO] Epoch:[0/2](215600/4588595) loss:2.967 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:21:53,245][model8_pretrain.py][INFO] Epoch:[0/2](215600/4588595) loss:3.209 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:21:53,245][model8_pretrain.py][INFO] Epoch:[0/2](215600/4588595) loss:3.007 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:21:53,245][model8_pretrain.py][INFO] Epoch:[0/2](215600/4588595) loss:2.983 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:21:53,245][model8_pretrain.py][INFO] Epoch:[0/2](215600/4588595) loss:2.912 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:21:53,245][model8_pretrain.py][INFO] Epoch:[0/2](215600/4588595) loss:3.113 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:21:53,245][model8_pretrain.py][INFO] Epoch:[0/2](215600/4588595) loss:3.251 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:22:30,202][model8_pretrain.py][INFO] Epoch:[0/2](215700/4588595) loss:2.757 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:22:30,202][model8_pretrain.py][INFO] Epoch:[0/2](215700/4588595) loss:3.255 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:22:30,202][model8_pretrain.py][INFO] Epoch:[0/2](215700/4588595) loss:3.206 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:22:30,202][model8_pretrain.py][INFO] Epoch:[0/2](215700/4588595) loss:3.215 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:22:30,203][model8_pretrain.py][INFO] Epoch:[0/2](215700/4588595) loss:3.037 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:22:30,202][model8_pretrain.py][INFO] Epoch:[0/2](215700/4588595) loss:2.906 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:22:30,203][model8_pretrain.py][INFO] Epoch:[0/2](215700/4588595) loss:2.241 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:22:30,203][model8_pretrain.py][INFO] Epoch:[0/2](215700/4588595) loss:3.163 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:23:07,175][model8_pretrain.py][INFO] Epoch:[0/2](215800/4588595) loss:2.886 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:23:07,175][model8_pretrain.py][INFO] Epoch:[0/2](215800/4588595) loss:2.076 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:23:07,175][model8_pretrain.py][INFO] Epoch:[0/2](215800/4588595) loss:3.444 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:23:07,175][model8_pretrain.py][INFO] Epoch:[0/2](215800/4588595) loss:2.683 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:23:07,175][model8_pretrain.py][INFO] Epoch:[0/2](215800/4588595) loss:2.116 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:23:07,175][model8_pretrain.py][INFO] Epoch:[0/2](215800/4588595) loss:2.979 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:23:07,176][model8_pretrain.py][INFO] Epoch:[0/2](215800/4588595) loss:3.159 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:23:07,176][model8_pretrain.py][INFO] Epoch:[0/2](215800/4588595) loss:3.216 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:23:52,319][model8_pretrain.py][INFO] Epoch:[0/2](215900/4588595) loss:2.790 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:23:52,319][model8_pretrain.py][INFO] Epoch:[0/2](215900/4588595) loss:3.221 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:23:52,319][model8_pretrain.py][INFO] Epoch:[0/2](215900/4588595) loss:3.430 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:23:52,319][model8_pretrain.py][INFO] Epoch:[0/2](215900/4588595) loss:3.202 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:23:52,319][model8_pretrain.py][INFO] Epoch:[0/2](215900/4588595) loss:3.256 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:23:52,319][model8_pretrain.py][INFO] Epoch:[0/2](215900/4588595) loss:3.108 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:23:52,319][model8_pretrain.py][INFO] Epoch:[0/2](215900/4588595) loss:2.471 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:23:52,320][model8_pretrain.py][INFO] Epoch:[0/2](215900/4588595) loss:3.083 lr:0.0000100 epoch_Time:27627.0min: [2024-01-03 16:24:29,251][model8_pretrain.py][INFO] Epoch:[0/2](216000/4588595) loss:2.599 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:24:29,251][model8_pretrain.py][INFO] Epoch:[0/2](216000/4588595) loss:2.661 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:24:29,251][model8_pretrain.py][INFO] Epoch:[0/2](216000/4588595) loss:2.089 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:24:29,251][model8_pretrain.py][INFO] Epoch:[0/2](216000/4588595) loss:2.558 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:24:29,251][model8_pretrain.py][INFO] Epoch:[0/2](216000/4588595) loss:2.766 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:24:29,251][model8_pretrain.py][INFO] Epoch:[0/2](216000/4588595) loss:2.893 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:24:29,251][model8_pretrain.py][INFO] Epoch:[0/2](216000/4588595) loss:3.076 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:24:29,251][model8_pretrain.py][INFO] Epoch:[0/2](216000/4588595) loss:2.955 lr:0.0000100 epoch_Time:27626.0min: [2024-01-03 16:25:06,195][model8_pretrain.py][INFO] Epoch:[0/2](216100/4588595) loss:3.375 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:06,195][model8_pretrain.py][INFO] Epoch:[0/2](216100/4588595) loss:3.110 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:06,195][model8_pretrain.py][INFO] Epoch:[0/2](216100/4588595) loss:3.157 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:06,195][model8_pretrain.py][INFO] Epoch:[0/2](216100/4588595) loss:3.269 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:06,195][model8_pretrain.py][INFO] Epoch:[0/2](216100/4588595) loss:3.046 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:06,195][model8_pretrain.py][INFO] Epoch:[0/2](216100/4588595) loss:3.050 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:06,195][model8_pretrain.py][INFO] Epoch:[0/2](216100/4588595) loss:2.772 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:06,195][model8_pretrain.py][INFO] Epoch:[0/2](216100/4588595) loss:2.749 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:43,142][model8_pretrain.py][INFO] Epoch:[0/2](216200/4588595) loss:3.015 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:43,142][model8_pretrain.py][INFO] Epoch:[0/2](216200/4588595) loss:2.834 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:43,142][model8_pretrain.py][INFO] Epoch:[0/2](216200/4588595) loss:2.956 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:43,142][model8_pretrain.py][INFO] Epoch:[0/2](216200/4588595) loss:2.681 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:43,142][model8_pretrain.py][INFO] Epoch:[0/2](216200/4588595) loss:3.038 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:43,142][model8_pretrain.py][INFO] Epoch:[0/2](216200/4588595) loss:3.421 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:43,142][model8_pretrain.py][INFO] Epoch:[0/2](216200/4588595) loss:3.180 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:25:43,142][model8_pretrain.py][INFO] Epoch:[0/2](216200/4588595) loss:2.925 lr:0.0000100 epoch_Time:27625.0min: [2024-01-03 16:26:20,084][model8_pretrain.py][INFO] Epoch:[0/2](216300/4588595) loss:3.058 lr:0.0000100 epoch_Time:27623.0min: [2024-01-03 16:26:20,084][model8_pretrain.py][INFO] Epoch:[0/2](216300/4588595) loss:2.244 lr:0.0000100 epoch_Time:27623.0min: [2024-01-03 16:26:20,084][model8_pretrain.py][INFO] Epoch:[0/2](216300/4588595) loss:2.877 lr:0.0000100 epoch_Time:27623.0min: [2024-01-03 16:26:20,084][model8_pretrain.py][INFO] Epoch:[0/2](216300/4588595) loss:2.989 lr:0.0000100 epoch_Time:27623.0min: [2024-01-03 16:26:20,084][model8_pretrain.py][INFO] Epoch:[0/2](216300/4588595) loss:2.409 lr:0.0000100 epoch_Time:27623.0min: [2024-01-03 16:26:20,084][model8_pretrain.py][INFO] Epoch:[0/2](216300/4588595) loss:2.932 lr:0.0000100 epoch_Time:27623.0min: [2024-01-03 16:26:20,084][model8_pretrain.py][INFO] Epoch:[0/2](216300/4588595) loss:2.665 lr:0.0000100 epoch_Time:27623.0min: [2024-01-03 16:26:20,085][model8_pretrain.py][INFO] Epoch:[0/2](216300/4588595) loss:3.133 lr:0.0000100 epoch_Time:27623.0min: [2024-01-03 16:26:57,024][model8_pretrain.py][INFO] Epoch:[0/2](216400/4588595) loss:3.098 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:26:57,024][model8_pretrain.py][INFO] Epoch:[0/2](216400/4588595) loss:3.380 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:26:57,024][model8_pretrain.py][INFO] Epoch:[0/2](216400/4588595) loss:2.744 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:26:57,024][model8_pretrain.py][INFO] Epoch:[0/2](216400/4588595) loss:2.620 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:26:57,024][model8_pretrain.py][INFO] Epoch:[0/2](216400/4588595) loss:2.542 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:26:57,024][model8_pretrain.py][INFO] Epoch:[0/2](216400/4588595) loss:3.132 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:26:57,024][model8_pretrain.py][INFO] Epoch:[0/2](216400/4588595) loss:3.213 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:26:57,024][model8_pretrain.py][INFO] Epoch:[0/2](216400/4588595) loss:3.289 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:27:33,958][model8_pretrain.py][INFO] Epoch:[0/2](216500/4588595) loss:3.092 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:27:33,958][model8_pretrain.py][INFO] Epoch:[0/2](216500/4588595) loss:2.464 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:27:33,958][model8_pretrain.py][INFO] Epoch:[0/2](216500/4588595) loss:2.616 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:27:33,958][model8_pretrain.py][INFO] Epoch:[0/2](216500/4588595) loss:3.153 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:27:33,958][model8_pretrain.py][INFO] Epoch:[0/2](216500/4588595) loss:2.887 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:27:33,958][model8_pretrain.py][INFO] Epoch:[0/2](216500/4588595) loss:2.430 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:27:33,958][model8_pretrain.py][INFO] Epoch:[0/2](216500/4588595) loss:3.160 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:27:33,958][model8_pretrain.py][INFO] Epoch:[0/2](216500/4588595) loss:2.886 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:28:10,890][model8_pretrain.py][INFO] Epoch:[0/2](216600/4588595) loss:3.213 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:28:10,890][model8_pretrain.py][INFO] Epoch:[0/2](216600/4588595) loss:2.753 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:28:10,890][model8_pretrain.py][INFO] Epoch:[0/2](216600/4588595) loss:2.572 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:28:10,890][model8_pretrain.py][INFO] Epoch:[0/2](216600/4588595) loss:2.800 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:28:10,890][model8_pretrain.py][INFO] Epoch:[0/2](216600/4588595) loss:2.678 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:28:10,890][model8_pretrain.py][INFO] Epoch:[0/2](216600/4588595) loss:2.906 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:28:10,890][model8_pretrain.py][INFO] Epoch:[0/2](216600/4588595) loss:3.386 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:28:10,890][model8_pretrain.py][INFO] Epoch:[0/2](216600/4588595) loss:2.940 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:28:55,974][model8_pretrain.py][INFO] Epoch:[0/2](216700/4588595) loss:3.141 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:28:55,974][model8_pretrain.py][INFO] Epoch:[0/2](216700/4588595) loss:2.632 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:28:55,974][model8_pretrain.py][INFO] Epoch:[0/2](216700/4588595) loss:2.522 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:28:55,974][model8_pretrain.py][INFO] Epoch:[0/2](216700/4588595) loss:3.120 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:28:55,974][model8_pretrain.py][INFO] Epoch:[0/2](216700/4588595) loss:3.140 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:28:55,974][model8_pretrain.py][INFO] Epoch:[0/2](216700/4588595) loss:3.058 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:28:55,974][model8_pretrain.py][INFO] Epoch:[0/2](216700/4588595) loss:2.780 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:28:55,974][model8_pretrain.py][INFO] Epoch:[0/2](216700/4588595) loss:3.184 lr:0.0000100 epoch_Time:27622.0min: [2024-01-03 16:29:32,908][model8_pretrain.py][INFO] Epoch:[0/2](216800/4588595) loss:3.350 lr:0.0000100 epoch_Time:27621.0min: [2024-01-03 16:29:32,908][model8_pretrain.py][INFO] Epoch:[0/2](216800/4588595) loss:3.207 lr:0.0000100 epoch_Time:27621.0min: [2024-01-03 16:29:32,908][model8_pretrain.py][INFO] Epoch:[0/2](216800/4588595) loss:3.078 lr:0.0000100 epoch_Time:27621.0min: [2024-01-03 16:29:32,908][model8_pretrain.py][INFO] Epoch:[0/2](216800/4588595) loss:3.208 lr:0.0000100 epoch_Time:27621.0min: [2024-01-03 16:29:32,908][model8_pretrain.py][INFO] Epoch:[0/2](216800/4588595) loss:3.580 lr:0.0000100 epoch_Time:27621.0min: [2024-01-03 16:29:32,908][model8_pretrain.py][INFO] Epoch:[0/2](216800/4588595) loss:3.203 lr:0.0000100 epoch_Time:27621.0min: [2024-01-03 16:29:32,908][model8_pretrain.py][INFO] Epoch:[0/2](216800/4588595) loss:2.975 lr:0.0000100 epoch_Time:27621.0min: [2024-01-03 16:29:32,908][model8_pretrain.py][INFO] Epoch:[0/2](216800/4588595) loss:3.210 lr:0.0000100 epoch_Time:27621.0min: [2024-01-03 16:30:09,847][model8_pretrain.py][INFO] Epoch:[0/2](216900/4588595) loss:2.799 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:09,847][model8_pretrain.py][INFO] Epoch:[0/2](216900/4588595) loss:3.015 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:09,847][model8_pretrain.py][INFO] Epoch:[0/2](216900/4588595) loss:2.768 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:09,847][model8_pretrain.py][INFO] Epoch:[0/2](216900/4588595) loss:2.655 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:09,847][model8_pretrain.py][INFO] Epoch:[0/2](216900/4588595) loss:2.914 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:09,847][model8_pretrain.py][INFO] Epoch:[0/2](216900/4588595) loss:3.043 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:09,847][model8_pretrain.py][INFO] Epoch:[0/2](216900/4588595) loss:2.364 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:09,847][model8_pretrain.py][INFO] Epoch:[0/2](216900/4588595) loss:2.565 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:46,788][model8_pretrain.py][INFO] Epoch:[0/2](217000/4588595) loss:3.180 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:46,789][model8_pretrain.py][INFO] Epoch:[0/2](217000/4588595) loss:3.170 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:46,789][model8_pretrain.py][INFO] Epoch:[0/2](217000/4588595) loss:3.601 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:46,789][model8_pretrain.py][INFO] Epoch:[0/2](217000/4588595) loss:2.780 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:46,789][model8_pretrain.py][INFO] Epoch:[0/2](217000/4588595) loss:3.295 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:46,789][model8_pretrain.py][INFO] Epoch:[0/2](217000/4588595) loss:2.816 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:46,789][model8_pretrain.py][INFO] Epoch:[0/2](217000/4588595) loss:2.978 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:30:46,789][model8_pretrain.py][INFO] Epoch:[0/2](217000/4588595) loss:2.740 lr:0.0000100 epoch_Time:27620.0min: [2024-01-03 16:31:23,730][model8_pretrain.py][INFO] Epoch:[0/2](217100/4588595) loss:2.887 lr:0.0000100 epoch_Time:27618.0min: [2024-01-03 16:31:23,730][model8_pretrain.py][INFO] Epoch:[0/2](217100/4588595) loss:3.301 lr:0.0000100 epoch_Time:27618.0min: [2024-01-03 16:31:23,730][model8_pretrain.py][INFO] Epoch:[0/2](217100/4588595) loss:2.771 lr:0.0000100 epoch_Time:27618.0min: [2024-01-03 16:31:23,730][model8_pretrain.py][INFO] Epoch:[0/2](217100/4588595) loss:3.420 lr:0.0000100 epoch_Time:27618.0min: [2024-01-03 16:31:23,730][model8_pretrain.py][INFO] Epoch:[0/2](217100/4588595) loss:2.840 lr:0.0000100 epoch_Time:27618.0min: [2024-01-03 16:31:23,730][model8_pretrain.py][INFO] Epoch:[0/2](217100/4588595) loss:2.718 lr:0.0000100 epoch_Time:27618.0min: [2024-01-03 16:31:23,730][model8_pretrain.py][INFO] Epoch:[0/2](217100/4588595) loss:2.716 lr:0.0000100 epoch_Time:27618.0min: [2024-01-03 16:31:23,730][model8_pretrain.py][INFO] Epoch:[0/2](217100/4588595) loss:2.993 lr:0.0000100 epoch_Time:27618.0min: [2024-01-03 16:32:00,671][model8_pretrain.py][INFO] Epoch:[0/2](217200/4588595) loss:2.790 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:00,671][model8_pretrain.py][INFO] Epoch:[0/2](217200/4588595) loss:2.498 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:00,671][model8_pretrain.py][INFO] Epoch:[0/2](217200/4588595) loss:3.223 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:00,671][model8_pretrain.py][INFO] Epoch:[0/2](217200/4588595) loss:2.273 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:00,671][model8_pretrain.py][INFO] Epoch:[0/2](217200/4588595) loss:3.439 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:00,671][model8_pretrain.py][INFO] Epoch:[0/2](217200/4588595) loss:3.031 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:00,671][model8_pretrain.py][INFO] Epoch:[0/2](217200/4588595) loss:2.593 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:00,671][model8_pretrain.py][INFO] Epoch:[0/2](217200/4588595) loss:2.506 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:37,618][model8_pretrain.py][INFO] Epoch:[0/2](217300/4588595) loss:2.607 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:37,618][model8_pretrain.py][INFO] Epoch:[0/2](217300/4588595) loss:3.000 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:37,618][model8_pretrain.py][INFO] Epoch:[0/2](217300/4588595) loss:2.898 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:37,618][model8_pretrain.py][INFO] Epoch:[0/2](217300/4588595) loss:3.136 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:37,618][model8_pretrain.py][INFO] Epoch:[0/2](217300/4588595) loss:3.279 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:37,618][model8_pretrain.py][INFO] Epoch:[0/2](217300/4588595) loss:2.812 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:37,618][model8_pretrain.py][INFO] Epoch:[0/2](217300/4588595) loss:2.873 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:32:37,618][model8_pretrain.py][INFO] Epoch:[0/2](217300/4588595) loss:2.898 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:33:14,559][model8_pretrain.py][INFO] Epoch:[0/2](217400/4588595) loss:2.849 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:33:14,559][model8_pretrain.py][INFO] Epoch:[0/2](217400/4588595) loss:3.121 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:33:14,559][model8_pretrain.py][INFO] Epoch:[0/2](217400/4588595) loss:3.030 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:33:14,559][model8_pretrain.py][INFO] Epoch:[0/2](217400/4588595) loss:2.945 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:33:14,559][model8_pretrain.py][INFO] Epoch:[0/2](217400/4588595) loss:3.427 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:33:14,559][model8_pretrain.py][INFO] Epoch:[0/2](217400/4588595) loss:3.146 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:33:14,559][model8_pretrain.py][INFO] Epoch:[0/2](217400/4588595) loss:2.480 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:33:14,559][model8_pretrain.py][INFO] Epoch:[0/2](217400/4588595) loss:3.189 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:33:59,728][model8_pretrain.py][INFO] Epoch:[0/2](217500/4588595) loss:2.950 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:33:59,728][model8_pretrain.py][INFO] Epoch:[0/2](217500/4588595) loss:3.163 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:33:59,728][model8_pretrain.py][INFO] Epoch:[0/2](217500/4588595) loss:3.551 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:33:59,728][model8_pretrain.py][INFO] Epoch:[0/2](217500/4588595) loss:3.009 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:33:59,728][model8_pretrain.py][INFO] Epoch:[0/2](217500/4588595) loss:3.171 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:33:59,728][model8_pretrain.py][INFO] Epoch:[0/2](217500/4588595) loss:3.137 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:33:59,728][model8_pretrain.py][INFO] Epoch:[0/2](217500/4588595) loss:3.021 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:33:59,728][model8_pretrain.py][INFO] Epoch:[0/2](217500/4588595) loss:2.912 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:34:36,671][model8_pretrain.py][INFO] Epoch:[0/2](217600/4588595) loss:3.344 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:34:36,671][model8_pretrain.py][INFO] Epoch:[0/2](217600/4588595) loss:2.828 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:34:36,671][model8_pretrain.py][INFO] Epoch:[0/2](217600/4588595) loss:2.588 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:34:36,671][model8_pretrain.py][INFO] Epoch:[0/2](217600/4588595) loss:2.855 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:34:36,671][model8_pretrain.py][INFO] Epoch:[0/2](217600/4588595) loss:3.229 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:34:36,671][model8_pretrain.py][INFO] Epoch:[0/2](217600/4588595) loss:3.023 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:34:36,672][model8_pretrain.py][INFO] Epoch:[0/2](217600/4588595) loss:3.136 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:34:36,672][model8_pretrain.py][INFO] Epoch:[0/2](217600/4588595) loss:2.893 lr:0.0000100 epoch_Time:27617.0min: [2024-01-03 16:35:13,611][model8_pretrain.py][INFO] Epoch:[0/2](217700/4588595) loss:3.296 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:35:13,611][model8_pretrain.py][INFO] Epoch:[0/2](217700/4588595) loss:2.890 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:35:13,611][model8_pretrain.py][INFO] Epoch:[0/2](217700/4588595) loss:2.929 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:35:13,611][model8_pretrain.py][INFO] Epoch:[0/2](217700/4588595) loss:3.095 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:35:13,611][model8_pretrain.py][INFO] Epoch:[0/2](217700/4588595) loss:3.428 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:35:13,611][model8_pretrain.py][INFO] Epoch:[0/2](217700/4588595) loss:3.157 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:35:13,611][model8_pretrain.py][INFO] Epoch:[0/2](217700/4588595) loss:2.668 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:35:13,611][model8_pretrain.py][INFO] Epoch:[0/2](217700/4588595) loss:3.030 lr:0.0000100 epoch_Time:27615.0min: [2024-01-03 16:35:50,557][model8_pretrain.py][INFO] Epoch:[0/2](217800/4588595) loss:3.265 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:35:50,557][model8_pretrain.py][INFO] Epoch:[0/2](217800/4588595) loss:3.151 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:35:50,557][model8_pretrain.py][INFO] Epoch:[0/2](217800/4588595) loss:2.675 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:35:50,557][model8_pretrain.py][INFO] Epoch:[0/2](217800/4588595) loss:2.762 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:35:50,558][model8_pretrain.py][INFO] Epoch:[0/2](217800/4588595) loss:2.855 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:35:50,558][model8_pretrain.py][INFO] Epoch:[0/2](217800/4588595) loss:2.814 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:35:50,558][model8_pretrain.py][INFO] Epoch:[0/2](217800/4588595) loss:2.712 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:35:50,558][model8_pretrain.py][INFO] Epoch:[0/2](217800/4588595) loss:3.172 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:36:27,505][model8_pretrain.py][INFO] Epoch:[0/2](217900/4588595) loss:2.711 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:36:27,505][model8_pretrain.py][INFO] Epoch:[0/2](217900/4588595) loss:2.733 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:36:27,505][model8_pretrain.py][INFO] Epoch:[0/2](217900/4588595) loss:2.670 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:36:27,505][model8_pretrain.py][INFO] Epoch:[0/2](217900/4588595) loss:2.057 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:36:27,505][model8_pretrain.py][INFO] Epoch:[0/2](217900/4588595) loss:3.386 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:36:27,505][model8_pretrain.py][INFO] Epoch:[0/2](217900/4588595) loss:3.093 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:36:27,505][model8_pretrain.py][INFO] Epoch:[0/2](217900/4588595) loss:2.671 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:36:27,506][model8_pretrain.py][INFO] Epoch:[0/2](217900/4588595) loss:3.444 lr:0.0000100 epoch_Time:27614.0min: [2024-01-03 16:37:04,457][model8_pretrain.py][INFO] Epoch:[0/2](218000/4588595) loss:3.015 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:04,457][model8_pretrain.py][INFO] Epoch:[0/2](218000/4588595) loss:2.645 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:04,457][model8_pretrain.py][INFO] Epoch:[0/2](218000/4588595) loss:3.199 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:04,457][model8_pretrain.py][INFO] Epoch:[0/2](218000/4588595) loss:2.245 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:04,457][model8_pretrain.py][INFO] Epoch:[0/2](218000/4588595) loss:3.002 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:04,457][model8_pretrain.py][INFO] Epoch:[0/2](218000/4588595) loss:2.767 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:04,457][model8_pretrain.py][INFO] Epoch:[0/2](218000/4588595) loss:3.227 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:04,457][model8_pretrain.py][INFO] Epoch:[0/2](218000/4588595) loss:3.177 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:41,414][model8_pretrain.py][INFO] Epoch:[0/2](218100/4588595) loss:3.187 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:41,414][model8_pretrain.py][INFO] Epoch:[0/2](218100/4588595) loss:3.057 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:41,414][model8_pretrain.py][INFO] Epoch:[0/2](218100/4588595) loss:2.375 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:41,414][model8_pretrain.py][INFO] Epoch:[0/2](218100/4588595) loss:3.224 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:41,415][model8_pretrain.py][INFO] Epoch:[0/2](218100/4588595) loss:2.934 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:41,415][model8_pretrain.py][INFO] Epoch:[0/2](218100/4588595) loss:3.094 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:41,415][model8_pretrain.py][INFO] Epoch:[0/2](218100/4588595) loss:2.520 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:37:41,415][model8_pretrain.py][INFO] Epoch:[0/2](218100/4588595) loss:3.273 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:38:18,374][model8_pretrain.py][INFO] Epoch:[0/2](218200/4588595) loss:3.159 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:38:18,374][model8_pretrain.py][INFO] Epoch:[0/2](218200/4588595) loss:2.016 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:38:18,374][model8_pretrain.py][INFO] Epoch:[0/2](218200/4588595) loss:2.529 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:38:18,374][model8_pretrain.py][INFO] Epoch:[0/2](218200/4588595) loss:2.336 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:38:18,374][model8_pretrain.py][INFO] Epoch:[0/2](218200/4588595) loss:3.237 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:38:18,374][model8_pretrain.py][INFO] Epoch:[0/2](218200/4588595) loss:2.743 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:38:18,374][model8_pretrain.py][INFO] Epoch:[0/2](218200/4588595) loss:2.841 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:38:18,374][model8_pretrain.py][INFO] Epoch:[0/2](218200/4588595) loss:2.634 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:39:03,859][model8_pretrain.py][INFO] Epoch:[0/2](218300/4588595) loss:3.265 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:03,859][model8_pretrain.py][INFO] Epoch:[0/2](218300/4588595) loss:2.687 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:03,859][model8_pretrain.py][INFO] Epoch:[0/2](218300/4588595) loss:2.745 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:03,859][model8_pretrain.py][INFO] Epoch:[0/2](218300/4588595) loss:2.979 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:03,859][model8_pretrain.py][INFO] Epoch:[0/2](218300/4588595) loss:2.738 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:03,859][model8_pretrain.py][INFO] Epoch:[0/2](218300/4588595) loss:3.059 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:03,859][model8_pretrain.py][INFO] Epoch:[0/2](218300/4588595) loss:2.923 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:03,860][model8_pretrain.py][INFO] Epoch:[0/2](218300/4588595) loss:2.559 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:40,796][model8_pretrain.py][INFO] Epoch:[0/2](218400/4588595) loss:2.899 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:40,796][model8_pretrain.py][INFO] Epoch:[0/2](218400/4588595) loss:2.713 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:40,796][model8_pretrain.py][INFO] Epoch:[0/2](218400/4588595) loss:2.837 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:40,796][model8_pretrain.py][INFO] Epoch:[0/2](218400/4588595) loss:2.832 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:40,796][model8_pretrain.py][INFO] Epoch:[0/2](218400/4588595) loss:2.904 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:40,796][model8_pretrain.py][INFO] Epoch:[0/2](218400/4588595) loss:2.891 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:40,796][model8_pretrain.py][INFO] Epoch:[0/2](218400/4588595) loss:3.003 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:39:40,796][model8_pretrain.py][INFO] Epoch:[0/2](218400/4588595) loss:3.170 lr:0.0000100 epoch_Time:27612.0min: [2024-01-03 16:40:17,739][model8_pretrain.py][INFO] Epoch:[0/2](218500/4588595) loss:3.134 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:40:17,739][model8_pretrain.py][INFO] Epoch:[0/2](218500/4588595) loss:2.815 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:40:17,739][model8_pretrain.py][INFO] Epoch:[0/2](218500/4588595) loss:2.415 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:40:17,739][model8_pretrain.py][INFO] Epoch:[0/2](218500/4588595) loss:2.781 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:40:17,739][model8_pretrain.py][INFO] Epoch:[0/2](218500/4588595) loss:3.509 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:40:17,739][model8_pretrain.py][INFO] Epoch:[0/2](218500/4588595) loss:2.934 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:40:17,739][model8_pretrain.py][INFO] Epoch:[0/2](218500/4588595) loss:3.112 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:40:17,739][model8_pretrain.py][INFO] Epoch:[0/2](218500/4588595) loss:2.612 lr:0.0000100 epoch_Time:27611.0min: [2024-01-03 16:40:54,683][model8_pretrain.py][INFO] Epoch:[0/2](218600/4588595) loss:3.182 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:40:54,683][model8_pretrain.py][INFO] Epoch:[0/2](218600/4588595) loss:3.164 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:40:54,683][model8_pretrain.py][INFO] Epoch:[0/2](218600/4588595) loss:2.889 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:40:54,683][model8_pretrain.py][INFO] Epoch:[0/2](218600/4588595) loss:2.225 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:40:54,683][model8_pretrain.py][INFO] Epoch:[0/2](218600/4588595) loss:2.962 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:40:54,683][model8_pretrain.py][INFO] Epoch:[0/2](218600/4588595) loss:2.782 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:40:54,683][model8_pretrain.py][INFO] Epoch:[0/2](218600/4588595) loss:2.346 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:40:54,683][model8_pretrain.py][INFO] Epoch:[0/2](218600/4588595) loss:2.824 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:41:31,630][model8_pretrain.py][INFO] Epoch:[0/2](218700/4588595) loss:2.926 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:41:31,630][model8_pretrain.py][INFO] Epoch:[0/2](218700/4588595) loss:2.799 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:41:31,630][model8_pretrain.py][INFO] Epoch:[0/2](218700/4588595) loss:3.204 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:41:31,630][model8_pretrain.py][INFO] Epoch:[0/2](218700/4588595) loss:3.209 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:41:31,630][model8_pretrain.py][INFO] Epoch:[0/2](218700/4588595) loss:2.460 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:41:31,630][model8_pretrain.py][INFO] Epoch:[0/2](218700/4588595) loss:2.801 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:41:31,630][model8_pretrain.py][INFO] Epoch:[0/2](218700/4588595) loss:3.264 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:41:31,631][model8_pretrain.py][INFO] Epoch:[0/2](218700/4588595) loss:2.571 lr:0.0000100 epoch_Time:27609.0min: [2024-01-03 16:42:08,579][model8_pretrain.py][INFO] Epoch:[0/2](218800/4588595) loss:3.575 lr:0.0000100 epoch_Time:27608.0min: [2024-01-03 16:42:08,579][model8_pretrain.py][INFO] Epoch:[0/2](218800/4588595) loss:2.944 lr:0.0000100 epoch_Time:27608.0min: [2024-01-03 16:42:08,579][model8_pretrain.py][INFO] Epoch:[0/2](218800/4588595) loss:2.654 lr:0.0000100 epoch_Time:27608.0min: [2024-01-03 16:42:08,579][model8_pretrain.py][INFO] Epoch:[0/2](218800/4588595) loss:2.692 lr:0.0000100 epoch_Time:27608.0min: [2024-01-03 16:42:08,579][model8_pretrain.py][INFO] Epoch:[0/2](218800/4588595) loss:2.599 lr:0.0000100 epoch_Time:27608.0min: [2024-01-03 16:42:08,579][model8_pretrain.py][INFO] Epoch:[0/2](218800/4588595) loss:2.824 lr:0.0000100 epoch_Time:27608.0min: [2024-01-03 16:42:08,579][model8_pretrain.py][INFO] Epoch:[0/2](218800/4588595) loss:2.149 lr:0.0000100 epoch_Time:27608.0min: [2024-01-03 16:42:08,579][model8_pretrain.py][INFO] Epoch:[0/2](218800/4588595) loss:2.956 lr:0.0000100 epoch_Time:27608.0min: [2024-01-03 16:42:45,520][model8_pretrain.py][INFO] Epoch:[0/2](218900/4588595) loss:3.171 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:42:45,520][model8_pretrain.py][INFO] Epoch:[0/2](218900/4588595) loss:3.182 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:42:45,520][model8_pretrain.py][INFO] Epoch:[0/2](218900/4588595) loss:2.666 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:42:45,520][model8_pretrain.py][INFO] Epoch:[0/2](218900/4588595) loss:2.794 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:42:45,520][model8_pretrain.py][INFO] Epoch:[0/2](218900/4588595) loss:3.283 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:42:45,520][model8_pretrain.py][INFO] Epoch:[0/2](218900/4588595) loss:3.369 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:42:45,520][model8_pretrain.py][INFO] Epoch:[0/2](218900/4588595) loss:3.445 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:42:45,520][model8_pretrain.py][INFO] Epoch:[0/2](218900/4588595) loss:3.287 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:43:22,454][model8_pretrain.py][INFO] Epoch:[0/2](219000/4588595) loss:2.850 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:43:22,454][model8_pretrain.py][INFO] Epoch:[0/2](219000/4588595) loss:3.162 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:43:22,454][model8_pretrain.py][INFO] Epoch:[0/2](219000/4588595) loss:2.853 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:43:22,454][model8_pretrain.py][INFO] Epoch:[0/2](219000/4588595) loss:2.977 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:43:22,454][model8_pretrain.py][INFO] Epoch:[0/2](219000/4588595) loss:2.520 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:43:22,454][model8_pretrain.py][INFO] Epoch:[0/2](219000/4588595) loss:2.953 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:43:22,454][model8_pretrain.py][INFO] Epoch:[0/2](219000/4588595) loss:2.880 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:43:22,455][model8_pretrain.py][INFO] Epoch:[0/2](219000/4588595) loss:2.698 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:44:07,817][model8_pretrain.py][INFO] Epoch:[0/2](219100/4588595) loss:2.616 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:07,817][model8_pretrain.py][INFO] Epoch:[0/2](219100/4588595) loss:2.821 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:07,817][model8_pretrain.py][INFO] Epoch:[0/2](219100/4588595) loss:3.134 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:07,818][model8_pretrain.py][INFO] Epoch:[0/2](219100/4588595) loss:2.544 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:07,818][model8_pretrain.py][INFO] Epoch:[0/2](219100/4588595) loss:3.106 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:07,817][model8_pretrain.py][INFO] Epoch:[0/2](219100/4588595) loss:2.939 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:07,818][model8_pretrain.py][INFO] Epoch:[0/2](219100/4588595) loss:3.015 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:07,818][model8_pretrain.py][INFO] Epoch:[0/2](219100/4588595) loss:2.787 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:44,750][model8_pretrain.py][INFO] Epoch:[0/2](219200/4588595) loss:3.266 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:44,750][model8_pretrain.py][INFO] Epoch:[0/2](219200/4588595) loss:2.836 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:44,750][model8_pretrain.py][INFO] Epoch:[0/2](219200/4588595) loss:2.950 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:44,750][model8_pretrain.py][INFO] Epoch:[0/2](219200/4588595) loss:2.648 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:44,750][model8_pretrain.py][INFO] Epoch:[0/2](219200/4588595) loss:3.001 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:44,750][model8_pretrain.py][INFO] Epoch:[0/2](219200/4588595) loss:2.851 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:44,751][model8_pretrain.py][INFO] Epoch:[0/2](219200/4588595) loss:3.311 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:44:44,751][model8_pretrain.py][INFO] Epoch:[0/2](219200/4588595) loss:3.007 lr:0.0000100 epoch_Time:27607.0min: [2024-01-03 16:45:21,700][model8_pretrain.py][INFO] Epoch:[0/2](219300/4588595) loss:2.413 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:45:21,700][model8_pretrain.py][INFO] Epoch:[0/2](219300/4588595) loss:2.869 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:45:21,700][model8_pretrain.py][INFO] Epoch:[0/2](219300/4588595) loss:2.664 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:45:21,700][model8_pretrain.py][INFO] Epoch:[0/2](219300/4588595) loss:3.000 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:45:21,700][model8_pretrain.py][INFO] Epoch:[0/2](219300/4588595) loss:2.773 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:45:21,700][model8_pretrain.py][INFO] Epoch:[0/2](219300/4588595) loss:2.984 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:45:21,700][model8_pretrain.py][INFO] Epoch:[0/2](219300/4588595) loss:3.000 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:45:21,701][model8_pretrain.py][INFO] Epoch:[0/2](219300/4588595) loss:2.562 lr:0.0000100 epoch_Time:27606.0min: [2024-01-03 16:45:58,651][model8_pretrain.py][INFO] Epoch:[0/2](219400/4588595) loss:3.206 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:45:58,651][model8_pretrain.py][INFO] Epoch:[0/2](219400/4588595) loss:2.978 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:45:58,651][model8_pretrain.py][INFO] Epoch:[0/2](219400/4588595) loss:3.102 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:45:58,651][model8_pretrain.py][INFO] Epoch:[0/2](219400/4588595) loss:2.948 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:45:58,651][model8_pretrain.py][INFO] Epoch:[0/2](219400/4588595) loss:3.209 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:45:58,651][model8_pretrain.py][INFO] Epoch:[0/2](219400/4588595) loss:3.117 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:45:58,651][model8_pretrain.py][INFO] Epoch:[0/2](219400/4588595) loss:2.896 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:45:58,651][model8_pretrain.py][INFO] Epoch:[0/2](219400/4588595) loss:3.038 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:46:35,594][model8_pretrain.py][INFO] Epoch:[0/2](219500/4588595) loss:3.340 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:46:35,594][model8_pretrain.py][INFO] Epoch:[0/2](219500/4588595) loss:3.198 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:46:35,594][model8_pretrain.py][INFO] Epoch:[0/2](219500/4588595) loss:3.324 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:46:35,594][model8_pretrain.py][INFO] Epoch:[0/2](219500/4588595) loss:2.443 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:46:35,594][model8_pretrain.py][INFO] Epoch:[0/2](219500/4588595) loss:3.050 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:46:35,594][model8_pretrain.py][INFO] Epoch:[0/2](219500/4588595) loss:2.756 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:46:35,594][model8_pretrain.py][INFO] Epoch:[0/2](219500/4588595) loss:3.113 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:46:35,594][model8_pretrain.py][INFO] Epoch:[0/2](219500/4588595) loss:2.755 lr:0.0000100 epoch_Time:27604.0min: [2024-01-03 16:47:12,511][model8_pretrain.py][INFO] Epoch:[0/2](219600/4588595) loss:3.350 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:47:12,511][model8_pretrain.py][INFO] Epoch:[0/2](219600/4588595) loss:2.459 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:47:12,511][model8_pretrain.py][INFO] Epoch:[0/2](219600/4588595) loss:3.135 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:47:12,511][model8_pretrain.py][INFO] Epoch:[0/2](219600/4588595) loss:2.579 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:47:12,511][model8_pretrain.py][INFO] Epoch:[0/2](219600/4588595) loss:2.846 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:47:12,511][model8_pretrain.py][INFO] Epoch:[0/2](219600/4588595) loss:2.538 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:47:12,511][model8_pretrain.py][INFO] Epoch:[0/2](219600/4588595) loss:2.836 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:47:12,511][model8_pretrain.py][INFO] Epoch:[0/2](219600/4588595) loss:3.183 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:47:49,456][model8_pretrain.py][INFO] Epoch:[0/2](219700/4588595) loss:3.161 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:47:49,456][model8_pretrain.py][INFO] Epoch:[0/2](219700/4588595) loss:3.383 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:47:49,456][model8_pretrain.py][INFO] Epoch:[0/2](219700/4588595) loss:3.112 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:47:49,456][model8_pretrain.py][INFO] Epoch:[0/2](219700/4588595) loss:3.019 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:47:49,456][model8_pretrain.py][INFO] Epoch:[0/2](219700/4588595) loss:3.107 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:47:49,456][model8_pretrain.py][INFO] Epoch:[0/2](219700/4588595) loss:3.063 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:47:49,456][model8_pretrain.py][INFO] Epoch:[0/2](219700/4588595) loss:3.142 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:47:49,456][model8_pretrain.py][INFO] Epoch:[0/2](219700/4588595) loss:2.471 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:48:26,395][model8_pretrain.py][INFO] Epoch:[0/2](219800/4588595) loss:2.502 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:48:26,395][model8_pretrain.py][INFO] Epoch:[0/2](219800/4588595) loss:2.671 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:48:26,395][model8_pretrain.py][INFO] Epoch:[0/2](219800/4588595) loss:2.981 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:48:26,395][model8_pretrain.py][INFO] Epoch:[0/2](219800/4588595) loss:2.354 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:48:26,395][model8_pretrain.py][INFO] Epoch:[0/2](219800/4588595) loss:3.178 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:48:26,395][model8_pretrain.py][INFO] Epoch:[0/2](219800/4588595) loss:2.990 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:48:26,396][model8_pretrain.py][INFO] Epoch:[0/2](219800/4588595) loss:3.061 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:48:26,396][model8_pretrain.py][INFO] Epoch:[0/2](219800/4588595) loss:3.257 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:49:11,821][model8_pretrain.py][INFO] Epoch:[0/2](219900/4588595) loss:3.080 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:49:11,821][model8_pretrain.py][INFO] Epoch:[0/2](219900/4588595) loss:2.672 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:49:11,821][model8_pretrain.py][INFO] Epoch:[0/2](219900/4588595) loss:2.876 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:49:11,821][model8_pretrain.py][INFO] Epoch:[0/2](219900/4588595) loss:2.734 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:49:11,821][model8_pretrain.py][INFO] Epoch:[0/2](219900/4588595) loss:3.208 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:49:11,821][model8_pretrain.py][INFO] Epoch:[0/2](219900/4588595) loss:2.983 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:49:11,821][model8_pretrain.py][INFO] Epoch:[0/2](219900/4588595) loss:3.088 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:49:11,821][model8_pretrain.py][INFO] Epoch:[0/2](219900/4588595) loss:2.974 lr:0.0000100 epoch_Time:27603.0min: [2024-01-03 16:49:48,752][model8_pretrain.py][INFO] Epoch:[0/2](220000/4588595) loss:2.858 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:49:48,752][model8_pretrain.py][INFO] Epoch:[0/2](220000/4588595) loss:3.028 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:49:48,752][model8_pretrain.py][INFO] Epoch:[0/2](220000/4588595) loss:2.516 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:49:48,752][model8_pretrain.py][INFO] Epoch:[0/2](220000/4588595) loss:2.800 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:49:48,752][model8_pretrain.py][INFO] Epoch:[0/2](220000/4588595) loss:3.116 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:49:48,753][model8_pretrain.py][INFO] Epoch:[0/2](220000/4588595) loss:2.399 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:49:48,753][model8_pretrain.py][INFO] Epoch:[0/2](220000/4588595) loss:3.210 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:49:48,753][model8_pretrain.py][INFO] Epoch:[0/2](220000/4588595) loss:2.717 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:50:25,700][model8_pretrain.py][INFO] Epoch:[0/2](220100/4588595) loss:2.478 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:50:25,700][model8_pretrain.py][INFO] Epoch:[0/2](220100/4588595) loss:3.025 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:50:25,701][model8_pretrain.py][INFO] Epoch:[0/2](220100/4588595) loss:3.212 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:50:25,701][model8_pretrain.py][INFO] Epoch:[0/2](220100/4588595) loss:2.918 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:50:25,701][model8_pretrain.py][INFO] Epoch:[0/2](220100/4588595) loss:2.569 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:50:25,701][model8_pretrain.py][INFO] Epoch:[0/2](220100/4588595) loss:3.299 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:50:25,701][model8_pretrain.py][INFO] Epoch:[0/2](220100/4588595) loss:3.065 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:50:25,701][model8_pretrain.py][INFO] Epoch:[0/2](220100/4588595) loss:2.987 lr:0.0000100 epoch_Time:27601.0min: [2024-01-03 16:51:02,643][model8_pretrain.py][INFO] Epoch:[0/2](220200/4588595) loss:2.521 lr:0.0000100 epoch_Time:27600.0min: [2024-01-03 16:51:02,643][model8_pretrain.py][INFO] Epoch:[0/2](220200/4588595) loss:2.806 lr:0.0000100 epoch_Time:27600.0min: [2024-01-03 16:51:02,643][model8_pretrain.py][INFO] Epoch:[0/2](220200/4588595) loss:2.654 lr:0.0000100 epoch_Time:27600.0min: [2024-01-03 16:51:02,643][model8_pretrain.py][INFO] Epoch:[0/2](220200/4588595) loss:2.668 lr:0.0000100 epoch_Time:27600.0min: [2024-01-03 16:51:02,643][model8_pretrain.py][INFO] Epoch:[0/2](220200/4588595) loss:2.947 lr:0.0000100 epoch_Time:27600.0min: [2024-01-03 16:51:02,643][model8_pretrain.py][INFO] Epoch:[0/2](220200/4588595) loss:3.254 lr:0.0000100 epoch_Time:27600.0min: [2024-01-03 16:51:02,643][model8_pretrain.py][INFO] Epoch:[0/2](220200/4588595) loss:2.980 lr:0.0000100 epoch_Time:27600.0min: [2024-01-03 16:51:02,643][model8_pretrain.py][INFO] Epoch:[0/2](220200/4588595) loss:3.081 lr:0.0000100 epoch_Time:27600.0min: [2024-01-03 16:51:39,587][model8_pretrain.py][INFO] Epoch:[0/2](220300/4588595) loss:2.786 lr:0.0000100 epoch_Time:27599.0min: [2024-01-03 16:51:39,587][model8_pretrain.py][INFO] Epoch:[0/2](220300/4588595) loss:3.191 lr:0.0000100 epoch_Time:27599.0min: [2024-01-03 16:51:39,587][model8_pretrain.py][INFO] Epoch:[0/2](220300/4588595) loss:2.859 lr:0.0000100 epoch_Time:27599.0min: [2024-01-03 16:51:39,587][model8_pretrain.py][INFO] Epoch:[0/2](220300/4588595) loss:3.078 lr:0.0000100 epoch_Time:27599.0min: [2024-01-03 16:51:39,587][model8_pretrain.py][INFO] Epoch:[0/2](220300/4588595) loss:3.189 lr:0.0000100 epoch_Time:27599.0min: [2024-01-03 16:51:39,587][model8_pretrain.py][INFO] Epoch:[0/2](220300/4588595) loss:2.878 lr:0.0000100 epoch_Time:27599.0min: [2024-01-03 16:51:39,587][model8_pretrain.py][INFO] Epoch:[0/2](220300/4588595) loss:2.802 lr:0.0000100 epoch_Time:27599.0min: [2024-01-03 16:51:39,587][model8_pretrain.py][INFO] Epoch:[0/2](220300/4588595) loss:3.546 lr:0.0000100 epoch_Time:27599.0min: [2024-01-03 16:52:16,533][model8_pretrain.py][INFO] Epoch:[0/2](220400/4588595) loss:2.627 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:52:16,534][model8_pretrain.py][INFO] Epoch:[0/2](220400/4588595) loss:3.239 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:52:16,533][model8_pretrain.py][INFO] Epoch:[0/2](220400/4588595) loss:3.121 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:52:16,534][model8_pretrain.py][INFO] Epoch:[0/2](220400/4588595) loss:2.558 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:52:16,534][model8_pretrain.py][INFO] Epoch:[0/2](220400/4588595) loss:2.807 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:52:16,534][model8_pretrain.py][INFO] Epoch:[0/2](220400/4588595) loss:2.835 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:52:16,533][model8_pretrain.py][INFO] Epoch:[0/2](220400/4588595) loss:2.790 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:52:16,534][model8_pretrain.py][INFO] Epoch:[0/2](220400/4588595) loss:3.121 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:52:53,468][model8_pretrain.py][INFO] Epoch:[0/2](220500/4588595) loss:2.900 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:52:53,468][model8_pretrain.py][INFO] Epoch:[0/2](220500/4588595) loss:2.660 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:52:53,468][model8_pretrain.py][INFO] Epoch:[0/2](220500/4588595) loss:2.801 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:52:53,468][model8_pretrain.py][INFO] Epoch:[0/2](220500/4588595) loss:2.796 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:52:53,468][model8_pretrain.py][INFO] Epoch:[0/2](220500/4588595) loss:2.791 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:52:53,468][model8_pretrain.py][INFO] Epoch:[0/2](220500/4588595) loss:2.531 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:52:53,468][model8_pretrain.py][INFO] Epoch:[0/2](220500/4588595) loss:3.470 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:52:53,469][model8_pretrain.py][INFO] Epoch:[0/2](220500/4588595) loss:3.066 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:53:30,411][model8_pretrain.py][INFO] Epoch:[0/2](220600/4588595) loss:3.004 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:53:30,411][model8_pretrain.py][INFO] Epoch:[0/2](220600/4588595) loss:2.911 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:53:30,411][model8_pretrain.py][INFO] Epoch:[0/2](220600/4588595) loss:3.458 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:53:30,411][model8_pretrain.py][INFO] Epoch:[0/2](220600/4588595) loss:3.163 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:53:30,411][model8_pretrain.py][INFO] Epoch:[0/2](220600/4588595) loss:2.665 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:53:30,411][model8_pretrain.py][INFO] Epoch:[0/2](220600/4588595) loss:3.041 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:53:30,411][model8_pretrain.py][INFO] Epoch:[0/2](220600/4588595) loss:3.142 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:53:30,411][model8_pretrain.py][INFO] Epoch:[0/2](220600/4588595) loss:2.989 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:54:15,815][model8_pretrain.py][INFO] Epoch:[0/2](220700/4588595) loss:2.920 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:54:15,815][model8_pretrain.py][INFO] Epoch:[0/2](220700/4588595) loss:2.831 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:54:15,815][model8_pretrain.py][INFO] Epoch:[0/2](220700/4588595) loss:2.879 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:54:15,815][model8_pretrain.py][INFO] Epoch:[0/2](220700/4588595) loss:3.001 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:54:15,815][model8_pretrain.py][INFO] Epoch:[0/2](220700/4588595) loss:2.338 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:54:15,815][model8_pretrain.py][INFO] Epoch:[0/2](220700/4588595) loss:3.234 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:54:15,815][model8_pretrain.py][INFO] Epoch:[0/2](220700/4588595) loss:3.059 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:54:15,815][model8_pretrain.py][INFO] Epoch:[0/2](220700/4588595) loss:2.733 lr:0.0000100 epoch_Time:27598.0min: [2024-01-03 16:54:52,733][model8_pretrain.py][INFO] Epoch:[0/2](220800/4588595) loss:2.834 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:54:52,733][model8_pretrain.py][INFO] Epoch:[0/2](220800/4588595) loss:3.154 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:54:52,733][model8_pretrain.py][INFO] Epoch:[0/2](220800/4588595) loss:3.061 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:54:52,733][model8_pretrain.py][INFO] Epoch:[0/2](220800/4588595) loss:2.269 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:54:52,733][model8_pretrain.py][INFO] Epoch:[0/2](220800/4588595) loss:3.028 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:54:52,733][model8_pretrain.py][INFO] Epoch:[0/2](220800/4588595) loss:2.947 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:54:52,733][model8_pretrain.py][INFO] Epoch:[0/2](220800/4588595) loss:3.058 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:54:52,733][model8_pretrain.py][INFO] Epoch:[0/2](220800/4588595) loss:3.194 lr:0.0000100 epoch_Time:27597.0min: [2024-01-03 16:55:29,665][model8_pretrain.py][INFO] Epoch:[0/2](220900/4588595) loss:2.809 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:55:29,666][model8_pretrain.py][INFO] Epoch:[0/2](220900/4588595) loss:2.825 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:55:29,666][model8_pretrain.py][INFO] Epoch:[0/2](220900/4588595) loss:3.062 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:55:29,666][model8_pretrain.py][INFO] Epoch:[0/2](220900/4588595) loss:2.331 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:55:29,666][model8_pretrain.py][INFO] Epoch:[0/2](220900/4588595) loss:2.771 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:55:29,666][model8_pretrain.py][INFO] Epoch:[0/2](220900/4588595) loss:3.023 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:55:29,666][model8_pretrain.py][INFO] Epoch:[0/2](220900/4588595) loss:3.000 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:55:29,666][model8_pretrain.py][INFO] Epoch:[0/2](220900/4588595) loss:2.641 lr:0.0000100 epoch_Time:27596.0min: [2024-01-03 16:56:06,602][model8_pretrain.py][INFO] Epoch:[0/2](221000/4588595) loss:3.087 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:06,602][model8_pretrain.py][INFO] Epoch:[0/2](221000/4588595) loss:2.396 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:06,602][model8_pretrain.py][INFO] Epoch:[0/2](221000/4588595) loss:3.389 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:06,602][model8_pretrain.py][INFO] Epoch:[0/2](221000/4588595) loss:3.039 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:06,602][model8_pretrain.py][INFO] Epoch:[0/2](221000/4588595) loss:3.032 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:06,602][model8_pretrain.py][INFO] Epoch:[0/2](221000/4588595) loss:2.556 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:06,603][model8_pretrain.py][INFO] Epoch:[0/2](221000/4588595) loss:2.620 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:06,603][model8_pretrain.py][INFO] Epoch:[0/2](221000/4588595) loss:3.217 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:43,536][model8_pretrain.py][INFO] Epoch:[0/2](221100/4588595) loss:3.167 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:43,536][model8_pretrain.py][INFO] Epoch:[0/2](221100/4588595) loss:3.160 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:43,536][model8_pretrain.py][INFO] Epoch:[0/2](221100/4588595) loss:2.175 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:43,536][model8_pretrain.py][INFO] Epoch:[0/2](221100/4588595) loss:2.474 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:43,536][model8_pretrain.py][INFO] Epoch:[0/2](221100/4588595) loss:3.070 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:43,536][model8_pretrain.py][INFO] Epoch:[0/2](221100/4588595) loss:3.312 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:43,536][model8_pretrain.py][INFO] Epoch:[0/2](221100/4588595) loss:2.457 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:56:43,536][model8_pretrain.py][INFO] Epoch:[0/2](221100/4588595) loss:2.734 lr:0.0000100 epoch_Time:27595.0min: [2024-01-03 16:57:20,467][model8_pretrain.py][INFO] Epoch:[0/2](221200/4588595) loss:2.861 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:57:20,467][model8_pretrain.py][INFO] Epoch:[0/2](221200/4588595) loss:2.585 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:57:20,467][model8_pretrain.py][INFO] Epoch:[0/2](221200/4588595) loss:3.368 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:57:20,467][model8_pretrain.py][INFO] Epoch:[0/2](221200/4588595) loss:2.514 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:57:20,467][model8_pretrain.py][INFO] Epoch:[0/2](221200/4588595) loss:2.889 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:57:20,467][model8_pretrain.py][INFO] Epoch:[0/2](221200/4588595) loss:2.985 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:57:20,467][model8_pretrain.py][INFO] Epoch:[0/2](221200/4588595) loss:2.780 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:57:20,467][model8_pretrain.py][INFO] Epoch:[0/2](221200/4588595) loss:3.029 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:57:57,416][model8_pretrain.py][INFO] Epoch:[0/2](221300/4588595) loss:2.745 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:57:57,416][model8_pretrain.py][INFO] Epoch:[0/2](221300/4588595) loss:2.901 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:57:57,416][model8_pretrain.py][INFO] Epoch:[0/2](221300/4588595) loss:3.440 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:57:57,416][model8_pretrain.py][INFO] Epoch:[0/2](221300/4588595) loss:2.747 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:57:57,416][model8_pretrain.py][INFO] Epoch:[0/2](221300/4588595) loss:2.646 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:57:57,416][model8_pretrain.py][INFO] Epoch:[0/2](221300/4588595) loss:3.330 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:57:57,416][model8_pretrain.py][INFO] Epoch:[0/2](221300/4588595) loss:2.744 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:57:57,417][model8_pretrain.py][INFO] Epoch:[0/2](221300/4588595) loss:2.590 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:58:34,349][model8_pretrain.py][INFO] Epoch:[0/2](221400/4588595) loss:2.677 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:58:34,349][model8_pretrain.py][INFO] Epoch:[0/2](221400/4588595) loss:2.593 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:58:34,349][model8_pretrain.py][INFO] Epoch:[0/2](221400/4588595) loss:3.106 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:58:34,349][model8_pretrain.py][INFO] Epoch:[0/2](221400/4588595) loss:3.011 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:58:34,349][model8_pretrain.py][INFO] Epoch:[0/2](221400/4588595) loss:2.962 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:58:34,349][model8_pretrain.py][INFO] Epoch:[0/2](221400/4588595) loss:2.859 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:58:34,349][model8_pretrain.py][INFO] Epoch:[0/2](221400/4588595) loss:3.044 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:58:34,350][model8_pretrain.py][INFO] Epoch:[0/2](221400/4588595) loss:2.866 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:59:19,730][model8_pretrain.py][INFO] Epoch:[0/2](221500/4588595) loss:2.796 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:59:19,730][model8_pretrain.py][INFO] Epoch:[0/2](221500/4588595) loss:2.780 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:59:19,730][model8_pretrain.py][INFO] Epoch:[0/2](221500/4588595) loss:3.508 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:59:19,730][model8_pretrain.py][INFO] Epoch:[0/2](221500/4588595) loss:2.883 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:59:19,730][model8_pretrain.py][INFO] Epoch:[0/2](221500/4588595) loss:2.904 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:59:19,730][model8_pretrain.py][INFO] Epoch:[0/2](221500/4588595) loss:2.596 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:59:19,730][model8_pretrain.py][INFO] Epoch:[0/2](221500/4588595) loss:2.410 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:59:19,730][model8_pretrain.py][INFO] Epoch:[0/2](221500/4588595) loss:3.182 lr:0.0000100 epoch_Time:27593.0min: [2024-01-03 16:59:56,668][model8_pretrain.py][INFO] Epoch:[0/2](221600/4588595) loss:3.160 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:59:56,668][model8_pretrain.py][INFO] Epoch:[0/2](221600/4588595) loss:3.030 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:59:56,668][model8_pretrain.py][INFO] Epoch:[0/2](221600/4588595) loss:3.002 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:59:56,668][model8_pretrain.py][INFO] Epoch:[0/2](221600/4588595) loss:3.319 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:59:56,668][model8_pretrain.py][INFO] Epoch:[0/2](221600/4588595) loss:3.195 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:59:56,668][model8_pretrain.py][INFO] Epoch:[0/2](221600/4588595) loss:3.259 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:59:56,668][model8_pretrain.py][INFO] Epoch:[0/2](221600/4588595) loss:2.918 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 16:59:56,668][model8_pretrain.py][INFO] Epoch:[0/2](221600/4588595) loss:2.112 lr:0.0000100 epoch_Time:27592.0min: [2024-01-03 17:00:33,614][model8_pretrain.py][INFO] Epoch:[0/2](221700/4588595) loss:2.597 lr:0.0000100 epoch_Time:27591.0min: [2024-01-03 17:00:33,614][model8_pretrain.py][INFO] Epoch:[0/2](221700/4588595) loss:2.337 lr:0.0000100 epoch_Time:27591.0min: [2024-01-03 17:00:33,614][model8_pretrain.py][INFO] Epoch:[0/2](221700/4588595) loss:2.442 lr:0.0000100 epoch_Time:27591.0min: [2024-01-03 17:00:33,614][model8_pretrain.py][INFO] Epoch:[0/2](221700/4588595) loss:3.044 lr:0.0000100 epoch_Time:27591.0min: [2024-01-03 17:00:33,614][model8_pretrain.py][INFO] Epoch:[0/2](221700/4588595) loss:2.915 lr:0.0000100 epoch_Time:27591.0min: [2024-01-03 17:00:33,614][model8_pretrain.py][INFO] Epoch:[0/2](221700/4588595) loss:2.808 lr:0.0000100 epoch_Time:27591.0min: [2024-01-03 17:00:33,614][model8_pretrain.py][INFO] Epoch:[0/2](221700/4588595) loss:2.476 lr:0.0000100 epoch_Time:27591.0min: [2024-01-03 17:00:33,615][model8_pretrain.py][INFO] Epoch:[0/2](221700/4588595) loss:3.413 lr:0.0000100 epoch_Time:27591.0min: [2024-01-03 17:01:10,537][model8_pretrain.py][INFO] Epoch:[0/2](221800/4588595) loss:3.129 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:10,537][model8_pretrain.py][INFO] Epoch:[0/2](221800/4588595) loss:3.426 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:10,537][model8_pretrain.py][INFO] Epoch:[0/2](221800/4588595) loss:2.960 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:10,537][model8_pretrain.py][INFO] Epoch:[0/2](221800/4588595) loss:2.797 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:10,537][model8_pretrain.py][INFO] Epoch:[0/2](221800/4588595) loss:3.185 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:10,537][model8_pretrain.py][INFO] Epoch:[0/2](221800/4588595) loss:3.339 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:10,537][model8_pretrain.py][INFO] Epoch:[0/2](221800/4588595) loss:2.675 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:10,538][model8_pretrain.py][INFO] Epoch:[0/2](221800/4588595) loss:2.511 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:47,485][model8_pretrain.py][INFO] Epoch:[0/2](221900/4588595) loss:2.944 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:47,485][model8_pretrain.py][INFO] Epoch:[0/2](221900/4588595) loss:2.798 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:47,485][model8_pretrain.py][INFO] Epoch:[0/2](221900/4588595) loss:3.245 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:47,485][model8_pretrain.py][INFO] Epoch:[0/2](221900/4588595) loss:3.134 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:47,485][model8_pretrain.py][INFO] Epoch:[0/2](221900/4588595) loss:3.109 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:47,485][model8_pretrain.py][INFO] Epoch:[0/2](221900/4588595) loss:3.024 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:47,485][model8_pretrain.py][INFO] Epoch:[0/2](221900/4588595) loss:2.556 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:01:47,485][model8_pretrain.py][INFO] Epoch:[0/2](221900/4588595) loss:3.196 lr:0.0000100 epoch_Time:27590.0min: [2024-01-03 17:02:24,428][model8_pretrain.py][INFO] Epoch:[0/2](222000/4588595) loss:3.418 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:02:24,428][model8_pretrain.py][INFO] Epoch:[0/2](222000/4588595) loss:3.184 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:02:24,428][model8_pretrain.py][INFO] Epoch:[0/2](222000/4588595) loss:2.572 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:02:24,428][model8_pretrain.py][INFO] Epoch:[0/2](222000/4588595) loss:1.632 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:02:24,428][model8_pretrain.py][INFO] Epoch:[0/2](222000/4588595) loss:3.170 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:02:24,428][model8_pretrain.py][INFO] Epoch:[0/2](222000/4588595) loss:2.938 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:02:24,428][model8_pretrain.py][INFO] Epoch:[0/2](222000/4588595) loss:2.404 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:02:24,428][model8_pretrain.py][INFO] Epoch:[0/2](222000/4588595) loss:3.084 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:03:01,339][model8_pretrain.py][INFO] Epoch:[0/2](222100/4588595) loss:3.051 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:01,339][model8_pretrain.py][INFO] Epoch:[0/2](222100/4588595) loss:3.079 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:01,339][model8_pretrain.py][INFO] Epoch:[0/2](222100/4588595) loss:2.747 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:01,339][model8_pretrain.py][INFO] Epoch:[0/2](222100/4588595) loss:3.162 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:01,339][model8_pretrain.py][INFO] Epoch:[0/2](222100/4588595) loss:2.495 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:01,339][model8_pretrain.py][INFO] Epoch:[0/2](222100/4588595) loss:3.013 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:01,339][model8_pretrain.py][INFO] Epoch:[0/2](222100/4588595) loss:2.899 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:01,339][model8_pretrain.py][INFO] Epoch:[0/2](222100/4588595) loss:2.597 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:38,263][model8_pretrain.py][INFO] Epoch:[0/2](222200/4588595) loss:2.950 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:38,263][model8_pretrain.py][INFO] Epoch:[0/2](222200/4588595) loss:2.643 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:38,263][model8_pretrain.py][INFO] Epoch:[0/2](222200/4588595) loss:2.576 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:38,263][model8_pretrain.py][INFO] Epoch:[0/2](222200/4588595) loss:2.617 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:38,263][model8_pretrain.py][INFO] Epoch:[0/2](222200/4588595) loss:2.537 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:38,264][model8_pretrain.py][INFO] Epoch:[0/2](222200/4588595) loss:3.106 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:38,264][model8_pretrain.py][INFO] Epoch:[0/2](222200/4588595) loss:3.119 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:03:38,264][model8_pretrain.py][INFO] Epoch:[0/2](222200/4588595) loss:3.084 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:04:23,816][model8_pretrain.py][INFO] Epoch:[0/2](222300/4588595) loss:3.224 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:04:23,816][model8_pretrain.py][INFO] Epoch:[0/2](222300/4588595) loss:2.290 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:04:23,816][model8_pretrain.py][INFO] Epoch:[0/2](222300/4588595) loss:3.381 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:04:23,816][model8_pretrain.py][INFO] Epoch:[0/2](222300/4588595) loss:3.128 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:04:23,816][model8_pretrain.py][INFO] Epoch:[0/2](222300/4588595) loss:3.000 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:04:23,816][model8_pretrain.py][INFO] Epoch:[0/2](222300/4588595) loss:2.813 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:04:23,817][model8_pretrain.py][INFO] Epoch:[0/2](222300/4588595) loss:2.724 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:04:23,818][model8_pretrain.py][INFO] Epoch:[0/2](222300/4588595) loss:2.904 lr:0.0000100 epoch_Time:27588.0min: [2024-01-03 17:05:00,754][model8_pretrain.py][INFO] Epoch:[0/2](222400/4588595) loss:3.364 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:00,754][model8_pretrain.py][INFO] Epoch:[0/2](222400/4588595) loss:2.914 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:00,754][model8_pretrain.py][INFO] Epoch:[0/2](222400/4588595) loss:3.352 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:00,754][model8_pretrain.py][INFO] Epoch:[0/2](222400/4588595) loss:3.037 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:00,754][model8_pretrain.py][INFO] Epoch:[0/2](222400/4588595) loss:3.121 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:00,754][model8_pretrain.py][INFO] Epoch:[0/2](222400/4588595) loss:2.553 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:00,754][model8_pretrain.py][INFO] Epoch:[0/2](222400/4588595) loss:2.719 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:00,754][model8_pretrain.py][INFO] Epoch:[0/2](222400/4588595) loss:2.874 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:37,698][model8_pretrain.py][INFO] Epoch:[0/2](222500/4588595) loss:2.944 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:37,698][model8_pretrain.py][INFO] Epoch:[0/2](222500/4588595) loss:2.475 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:37,698][model8_pretrain.py][INFO] Epoch:[0/2](222500/4588595) loss:2.702 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:37,698][model8_pretrain.py][INFO] Epoch:[0/2](222500/4588595) loss:3.051 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:37,698][model8_pretrain.py][INFO] Epoch:[0/2](222500/4588595) loss:2.275 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:37,698][model8_pretrain.py][INFO] Epoch:[0/2](222500/4588595) loss:3.021 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:37,698][model8_pretrain.py][INFO] Epoch:[0/2](222500/4588595) loss:2.919 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:05:37,698][model8_pretrain.py][INFO] Epoch:[0/2](222500/4588595) loss:2.988 lr:0.0000100 epoch_Time:27587.0min: [2024-01-03 17:06:14,652][model8_pretrain.py][INFO] Epoch:[0/2](222600/4588595) loss:3.374 lr:0.0000100 epoch_Time:27585.0min: [2024-01-03 17:06:14,652][model8_pretrain.py][INFO] Epoch:[0/2](222600/4588595) loss:3.030 lr:0.0000100 epoch_Time:27585.0min: [2024-01-03 17:06:14,652][model8_pretrain.py][INFO] Epoch:[0/2](222600/4588595) loss:2.689 lr:0.0000100 epoch_Time:27585.0min: [2024-01-03 17:06:14,652][model8_pretrain.py][INFO] Epoch:[0/2](222600/4588595) loss:3.240 lr:0.0000100 epoch_Time:27585.0min: [2024-01-03 17:06:14,652][model8_pretrain.py][INFO] Epoch:[0/2](222600/4588595) loss:3.137 lr:0.0000100 epoch_Time:27585.0min: [2024-01-03 17:06:14,652][model8_pretrain.py][INFO] Epoch:[0/2](222600/4588595) loss:2.819 lr:0.0000100 epoch_Time:27585.0min: [2024-01-03 17:06:14,652][model8_pretrain.py][INFO] Epoch:[0/2](222600/4588595) loss:3.332 lr:0.0000100 epoch_Time:27585.0min: [2024-01-03 17:06:14,652][model8_pretrain.py][INFO] Epoch:[0/2](222600/4588595) loss:2.943 lr:0.0000100 epoch_Time:27585.0min: [2024-01-03 17:06:51,601][model8_pretrain.py][INFO] Epoch:[0/2](222700/4588595) loss:3.292 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:06:51,601][model8_pretrain.py][INFO] Epoch:[0/2](222700/4588595) loss:3.303 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:06:51,601][model8_pretrain.py][INFO] Epoch:[0/2](222700/4588595) loss:2.534 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:06:51,601][model8_pretrain.py][INFO] Epoch:[0/2](222700/4588595) loss:3.080 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:06:51,601][model8_pretrain.py][INFO] Epoch:[0/2](222700/4588595) loss:3.075 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:06:51,601][model8_pretrain.py][INFO] Epoch:[0/2](222700/4588595) loss:3.103 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:06:51,601][model8_pretrain.py][INFO] Epoch:[0/2](222700/4588595) loss:2.854 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:06:51,601][model8_pretrain.py][INFO] Epoch:[0/2](222700/4588595) loss:3.312 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:07:28,584][model8_pretrain.py][INFO] Epoch:[0/2](222800/4588595) loss:2.424 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:07:28,584][model8_pretrain.py][INFO] Epoch:[0/2](222800/4588595) loss:2.781 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:07:28,584][model8_pretrain.py][INFO] Epoch:[0/2](222800/4588595) loss:3.358 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:07:28,584][model8_pretrain.py][INFO] Epoch:[0/2](222800/4588595) loss:2.718 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:07:28,584][model8_pretrain.py][INFO] Epoch:[0/2](222800/4588595) loss:3.027 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:07:28,584][model8_pretrain.py][INFO] Epoch:[0/2](222800/4588595) loss:2.600 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:07:28,584][model8_pretrain.py][INFO] Epoch:[0/2](222800/4588595) loss:3.124 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:07:28,584][model8_pretrain.py][INFO] Epoch:[0/2](222800/4588595) loss:2.793 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:08:05,544][model8_pretrain.py][INFO] Epoch:[0/2](222900/4588595) loss:3.147 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:05,544][model8_pretrain.py][INFO] Epoch:[0/2](222900/4588595) loss:2.951 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:05,544][model8_pretrain.py][INFO] Epoch:[0/2](222900/4588595) loss:2.573 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:05,544][model8_pretrain.py][INFO] Epoch:[0/2](222900/4588595) loss:2.805 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:05,544][model8_pretrain.py][INFO] Epoch:[0/2](222900/4588595) loss:3.047 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:05,544][model8_pretrain.py][INFO] Epoch:[0/2](222900/4588595) loss:2.823 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:05,545][model8_pretrain.py][INFO] Epoch:[0/2](222900/4588595) loss:3.234 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:05,545][model8_pretrain.py][INFO] Epoch:[0/2](222900/4588595) loss:2.620 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:42,482][model8_pretrain.py][INFO] Epoch:[0/2](223000/4588595) loss:2.648 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:42,482][model8_pretrain.py][INFO] Epoch:[0/2](223000/4588595) loss:3.151 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:42,482][model8_pretrain.py][INFO] Epoch:[0/2](223000/4588595) loss:2.796 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:42,482][model8_pretrain.py][INFO] Epoch:[0/2](223000/4588595) loss:2.741 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:42,482][model8_pretrain.py][INFO] Epoch:[0/2](223000/4588595) loss:2.916 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:42,482][model8_pretrain.py][INFO] Epoch:[0/2](223000/4588595) loss:3.005 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:42,482][model8_pretrain.py][INFO] Epoch:[0/2](223000/4588595) loss:3.079 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:08:42,482][model8_pretrain.py][INFO] Epoch:[0/2](223000/4588595) loss:3.092 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:09:28,404][model8_pretrain.py][INFO] Epoch:[0/2](223100/4588595) loss:3.154 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:09:28,404][model8_pretrain.py][INFO] Epoch:[0/2](223100/4588595) loss:2.756 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:09:28,404][model8_pretrain.py][INFO] Epoch:[0/2](223100/4588595) loss:3.239 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:09:28,404][model8_pretrain.py][INFO] Epoch:[0/2](223100/4588595) loss:2.900 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:09:28,404][model8_pretrain.py][INFO] Epoch:[0/2](223100/4588595) loss:3.022 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:09:28,404][model8_pretrain.py][INFO] Epoch:[0/2](223100/4588595) loss:2.717 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:09:28,404][model8_pretrain.py][INFO] Epoch:[0/2](223100/4588595) loss:2.523 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:09:28,404][model8_pretrain.py][INFO] Epoch:[0/2](223100/4588595) loss:2.762 lr:0.0000100 epoch_Time:27584.0min: [2024-01-03 17:10:05,331][model8_pretrain.py][INFO] Epoch:[0/2](223200/4588595) loss:2.758 lr:0.0000100 epoch_Time:27583.0min: [2024-01-03 17:10:05,331][model8_pretrain.py][INFO] Epoch:[0/2](223200/4588595) loss:2.843 lr:0.0000100 epoch_Time:27583.0min: [2024-01-03 17:10:05,331][model8_pretrain.py][INFO] Epoch:[0/2](223200/4588595) loss:3.124 lr:0.0000100 epoch_Time:27583.0min: [2024-01-03 17:10:05,331][model8_pretrain.py][INFO] Epoch:[0/2](223200/4588595) loss:2.531 lr:0.0000100 epoch_Time:27583.0min: [2024-01-03 17:10:05,331][model8_pretrain.py][INFO] Epoch:[0/2](223200/4588595) loss:3.324 lr:0.0000100 epoch_Time:27583.0min: [2024-01-03 17:10:05,331][model8_pretrain.py][INFO] Epoch:[0/2](223200/4588595) loss:3.113 lr:0.0000100 epoch_Time:27583.0min: [2024-01-03 17:10:05,331][model8_pretrain.py][INFO] Epoch:[0/2](223200/4588595) loss:2.818 lr:0.0000100 epoch_Time:27583.0min: [2024-01-03 17:10:05,331][model8_pretrain.py][INFO] Epoch:[0/2](223200/4588595) loss:3.279 lr:0.0000100 epoch_Time:27583.0min: [2024-01-03 17:10:42,258][model8_pretrain.py][INFO] Epoch:[0/2](223300/4588595) loss:2.937 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:10:42,258][model8_pretrain.py][INFO] Epoch:[0/2](223300/4588595) loss:3.148 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:10:42,258][model8_pretrain.py][INFO] Epoch:[0/2](223300/4588595) loss:3.085 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:10:42,258][model8_pretrain.py][INFO] Epoch:[0/2](223300/4588595) loss:3.004 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:10:42,258][model8_pretrain.py][INFO] Epoch:[0/2](223300/4588595) loss:3.198 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:10:42,258][model8_pretrain.py][INFO] Epoch:[0/2](223300/4588595) loss:3.208 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:10:42,258][model8_pretrain.py][INFO] Epoch:[0/2](223300/4588595) loss:2.476 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:10:42,259][model8_pretrain.py][INFO] Epoch:[0/2](223300/4588595) loss:3.321 lr:0.0000100 epoch_Time:27582.0min: [2024-01-03 17:11:19,185][model8_pretrain.py][INFO] Epoch:[0/2](223400/4588595) loss:3.055 lr:0.0000100 epoch_Time:27581.0min: [2024-01-03 17:11:19,185][model8_pretrain.py][INFO] Epoch:[0/2](223400/4588595) loss:2.684 lr:0.0000100 epoch_Time:27581.0min: [2024-01-03 17:11:19,185][model8_pretrain.py][INFO] Epoch:[0/2](223400/4588595) loss:2.695 lr:0.0000100 epoch_Time:27581.0min: [2024-01-03 17:11:19,185][model8_pretrain.py][INFO] Epoch:[0/2](223400/4588595) loss:3.076 lr:0.0000100 epoch_Time:27581.0min: [2024-01-03 17:11:19,185][model8_pretrain.py][INFO] Epoch:[0/2](223400/4588595) loss:2.611 lr:0.0000100 epoch_Time:27581.0min: [2024-01-03 17:11:19,186][model8_pretrain.py][INFO] Epoch:[0/2](223400/4588595) loss:2.720 lr:0.0000100 epoch_Time:27581.0min: [2024-01-03 17:11:19,186][model8_pretrain.py][INFO] Epoch:[0/2](223400/4588595) loss:2.898 lr:0.0000100 epoch_Time:27581.0min: [2024-01-03 17:11:19,186][model8_pretrain.py][INFO] Epoch:[0/2](223400/4588595) loss:2.266 lr:0.0000100 epoch_Time:27581.0min: [2024-01-03 17:11:56,125][model8_pretrain.py][INFO] Epoch:[0/2](223500/4588595) loss:3.151 lr:0.0000100 epoch_Time:27580.0min: [2024-01-03 17:11:56,125][model8_pretrain.py][INFO] Epoch:[0/2](223500/4588595) loss:2.404 lr:0.0000100 epoch_Time:27580.0min: [2024-01-03 17:11:56,125][model8_pretrain.py][INFO] Epoch:[0/2](223500/4588595) loss:3.121 lr:0.0000100 epoch_Time:27580.0min: [2024-01-03 17:11:56,125][model8_pretrain.py][INFO] Epoch:[0/2](223500/4588595) loss:2.951 lr:0.0000100 epoch_Time:27580.0min: [2024-01-03 17:11:56,125][model8_pretrain.py][INFO] Epoch:[0/2](223500/4588595) loss:3.050 lr:0.0000100 epoch_Time:27580.0min: [2024-01-03 17:11:56,125][model8_pretrain.py][INFO] Epoch:[0/2](223500/4588595) loss:3.388 lr:0.0000100 epoch_Time:27580.0min: [2024-01-03 17:11:56,125][model8_pretrain.py][INFO] Epoch:[0/2](223500/4588595) loss:2.901 lr:0.0000100 epoch_Time:27580.0min: [2024-01-03 17:11:56,126][model8_pretrain.py][INFO] Epoch:[0/2](223500/4588595) loss:2.815 lr:0.0000100 epoch_Time:27580.0min: [2024-01-03 17:12:33,059][model8_pretrain.py][INFO] Epoch:[0/2](223600/4588595) loss:2.399 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:12:33,059][model8_pretrain.py][INFO] Epoch:[0/2](223600/4588595) loss:2.871 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:12:33,059][model8_pretrain.py][INFO] Epoch:[0/2](223600/4588595) loss:2.139 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:12:33,059][model8_pretrain.py][INFO] Epoch:[0/2](223600/4588595) loss:3.098 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:12:33,059][model8_pretrain.py][INFO] Epoch:[0/2](223600/4588595) loss:2.898 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:12:33,059][model8_pretrain.py][INFO] Epoch:[0/2](223600/4588595) loss:2.988 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:12:33,060][model8_pretrain.py][INFO] Epoch:[0/2](223600/4588595) loss:2.924 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:12:33,060][model8_pretrain.py][INFO] Epoch:[0/2](223600/4588595) loss:2.588 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:13:09,993][model8_pretrain.py][INFO] Epoch:[0/2](223700/4588595) loss:3.265 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:09,993][model8_pretrain.py][INFO] Epoch:[0/2](223700/4588595) loss:3.239 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:09,993][model8_pretrain.py][INFO] Epoch:[0/2](223700/4588595) loss:2.753 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:09,993][model8_pretrain.py][INFO] Epoch:[0/2](223700/4588595) loss:3.064 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:09,993][model8_pretrain.py][INFO] Epoch:[0/2](223700/4588595) loss:2.901 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:09,993][model8_pretrain.py][INFO] Epoch:[0/2](223700/4588595) loss:3.129 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:09,993][model8_pretrain.py][INFO] Epoch:[0/2](223700/4588595) loss:2.869 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:09,993][model8_pretrain.py][INFO] Epoch:[0/2](223700/4588595) loss:3.130 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:46,935][model8_pretrain.py][INFO] Epoch:[0/2](223800/4588595) loss:3.348 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:46,935][model8_pretrain.py][INFO] Epoch:[0/2](223800/4588595) loss:2.952 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:46,935][model8_pretrain.py][INFO] Epoch:[0/2](223800/4588595) loss:2.584 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:46,935][model8_pretrain.py][INFO] Epoch:[0/2](223800/4588595) loss:2.999 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:46,935][model8_pretrain.py][INFO] Epoch:[0/2](223800/4588595) loss:3.060 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:46,935][model8_pretrain.py][INFO] Epoch:[0/2](223800/4588595) loss:2.923 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:46,935][model8_pretrain.py][INFO] Epoch:[0/2](223800/4588595) loss:2.807 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:13:46,935][model8_pretrain.py][INFO] Epoch:[0/2](223800/4588595) loss:2.432 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:14:32,570][model8_pretrain.py][INFO] Epoch:[0/2](223900/4588595) loss:3.199 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:14:32,570][model8_pretrain.py][INFO] Epoch:[0/2](223900/4588595) loss:2.962 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:14:32,570][model8_pretrain.py][INFO] Epoch:[0/2](223900/4588595) loss:2.637 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:14:32,570][model8_pretrain.py][INFO] Epoch:[0/2](223900/4588595) loss:3.177 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:14:32,570][model8_pretrain.py][INFO] Epoch:[0/2](223900/4588595) loss:2.394 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:14:32,570][model8_pretrain.py][INFO] Epoch:[0/2](223900/4588595) loss:2.687 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:14:32,570][model8_pretrain.py][INFO] Epoch:[0/2](223900/4588595) loss:3.267 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:14:32,570][model8_pretrain.py][INFO] Epoch:[0/2](223900/4588595) loss:2.580 lr:0.0000100 epoch_Time:27579.0min: [2024-01-03 17:15:09,499][model8_pretrain.py][INFO] Epoch:[0/2](224000/4588595) loss:3.251 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:09,499][model8_pretrain.py][INFO] Epoch:[0/2](224000/4588595) loss:2.880 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:09,499][model8_pretrain.py][INFO] Epoch:[0/2](224000/4588595) loss:3.214 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:09,499][model8_pretrain.py][INFO] Epoch:[0/2](224000/4588595) loss:2.761 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:09,499][model8_pretrain.py][INFO] Epoch:[0/2](224000/4588595) loss:2.433 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:09,499][model8_pretrain.py][INFO] Epoch:[0/2](224000/4588595) loss:2.712 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:09,500][model8_pretrain.py][INFO] Epoch:[0/2](224000/4588595) loss:2.999 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:09,500][model8_pretrain.py][INFO] Epoch:[0/2](224000/4588595) loss:2.900 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:46,431][model8_pretrain.py][INFO] Epoch:[0/2](224100/4588595) loss:2.841 lr:0.0000100 epoch_Time:27577.0min: [2024-01-03 17:15:46,431][model8_pretrain.py][INFO] Epoch:[0/2](224100/4588595) loss:2.749 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:46,431][model8_pretrain.py][INFO] Epoch:[0/2](224100/4588595) loss:3.321 lr:0.0000100 epoch_Time:27577.0min: [2024-01-03 17:15:46,431][model8_pretrain.py][INFO] Epoch:[0/2](224100/4588595) loss:3.012 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:46,431][model8_pretrain.py][INFO] Epoch:[0/2](224100/4588595) loss:2.919 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:46,431][model8_pretrain.py][INFO] Epoch:[0/2](224100/4588595) loss:3.347 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:46,431][model8_pretrain.py][INFO] Epoch:[0/2](224100/4588595) loss:3.178 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:15:46,431][model8_pretrain.py][INFO] Epoch:[0/2](224100/4588595) loss:2.927 lr:0.0000100 epoch_Time:27578.0min: [2024-01-03 17:16:23,361][model8_pretrain.py][INFO] Epoch:[0/2](224200/4588595) loss:3.083 lr:0.0000100 epoch_Time:27576.0min: [2024-01-03 17:16:23,361][model8_pretrain.py][INFO] Epoch:[0/2](224200/4588595) loss:2.753 lr:0.0000100 epoch_Time:27576.0min: [2024-01-03 17:16:23,361][model8_pretrain.py][INFO] Epoch:[0/2](224200/4588595) loss:2.745 lr:0.0000100 epoch_Time:27576.0min: [2024-01-03 17:16:23,361][model8_pretrain.py][INFO] Epoch:[0/2](224200/4588595) loss:3.018 lr:0.0000100 epoch_Time:27576.0min: [2024-01-03 17:16:23,361][model8_pretrain.py][INFO] Epoch:[0/2](224200/4588595) loss:3.335 lr:0.0000100 epoch_Time:27576.0min: [2024-01-03 17:16:23,361][model8_pretrain.py][INFO] Epoch:[0/2](224200/4588595) loss:3.373 lr:0.0000100 epoch_Time:27576.0min: [2024-01-03 17:16:23,361][model8_pretrain.py][INFO] Epoch:[0/2](224200/4588595) loss:2.859 lr:0.0000100 epoch_Time:27576.0min: [2024-01-03 17:16:23,361][model8_pretrain.py][INFO] Epoch:[0/2](224200/4588595) loss:3.014 lr:0.0000100 epoch_Time:27576.0min: [2024-01-03 17:17:00,302][model8_pretrain.py][INFO] Epoch:[0/2](224300/4588595) loss:3.376 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:00,302][model8_pretrain.py][INFO] Epoch:[0/2](224300/4588595) loss:2.934 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:00,302][model8_pretrain.py][INFO] Epoch:[0/2](224300/4588595) loss:2.724 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:00,302][model8_pretrain.py][INFO] Epoch:[0/2](224300/4588595) loss:2.576 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:00,302][model8_pretrain.py][INFO] Epoch:[0/2](224300/4588595) loss:2.889 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:00,302][model8_pretrain.py][INFO] Epoch:[0/2](224300/4588595) loss:2.572 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:00,303][model8_pretrain.py][INFO] Epoch:[0/2](224300/4588595) loss:3.567 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:00,303][model8_pretrain.py][INFO] Epoch:[0/2](224300/4588595) loss:2.406 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:37,251][model8_pretrain.py][INFO] Epoch:[0/2](224400/4588595) loss:3.324 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:37,251][model8_pretrain.py][INFO] Epoch:[0/2](224400/4588595) loss:2.609 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:37,251][model8_pretrain.py][INFO] Epoch:[0/2](224400/4588595) loss:2.737 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:37,251][model8_pretrain.py][INFO] Epoch:[0/2](224400/4588595) loss:2.751 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:37,251][model8_pretrain.py][INFO] Epoch:[0/2](224400/4588595) loss:3.128 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:37,251][model8_pretrain.py][INFO] Epoch:[0/2](224400/4588595) loss:3.137 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:37,251][model8_pretrain.py][INFO] Epoch:[0/2](224400/4588595) loss:2.969 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:17:37,251][model8_pretrain.py][INFO] Epoch:[0/2](224400/4588595) loss:3.022 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:18:14,190][model8_pretrain.py][INFO] Epoch:[0/2](224500/4588595) loss:2.769 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:18:14,190][model8_pretrain.py][INFO] Epoch:[0/2](224500/4588595) loss:2.732 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:18:14,190][model8_pretrain.py][INFO] Epoch:[0/2](224500/4588595) loss:3.466 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:18:14,190][model8_pretrain.py][INFO] Epoch:[0/2](224500/4588595) loss:3.448 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:18:14,190][model8_pretrain.py][INFO] Epoch:[0/2](224500/4588595) loss:3.158 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:18:14,190][model8_pretrain.py][INFO] Epoch:[0/2](224500/4588595) loss:2.890 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:18:14,190][model8_pretrain.py][INFO] Epoch:[0/2](224500/4588595) loss:2.643 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:18:14,190][model8_pretrain.py][INFO] Epoch:[0/2](224500/4588595) loss:3.601 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:18:51,128][model8_pretrain.py][INFO] Epoch:[0/2](224600/4588595) loss:3.022 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:18:51,128][model8_pretrain.py][INFO] Epoch:[0/2](224600/4588595) loss:2.996 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:18:51,128][model8_pretrain.py][INFO] Epoch:[0/2](224600/4588595) loss:3.335 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:18:51,128][model8_pretrain.py][INFO] Epoch:[0/2](224600/4588595) loss:2.911 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:18:51,128][model8_pretrain.py][INFO] Epoch:[0/2](224600/4588595) loss:2.942 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:18:51,128][model8_pretrain.py][INFO] Epoch:[0/2](224600/4588595) loss:2.561 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:18:51,128][model8_pretrain.py][INFO] Epoch:[0/2](224600/4588595) loss:2.981 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:18:51,128][model8_pretrain.py][INFO] Epoch:[0/2](224600/4588595) loss:2.933 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:19:36,810][model8_pretrain.py][INFO] Epoch:[0/2](224700/4588595) loss:3.250 lr:0.0000100 epoch_Time:27574.0min: [2024-01-03 17:19:36,810][model8_pretrain.py][INFO] Epoch:[0/2](224700/4588595) loss:3.142 lr:0.0000100 epoch_Time:27574.0min: [2024-01-03 17:19:36,810][model8_pretrain.py][INFO] Epoch:[0/2](224700/4588595) loss:3.045 lr:0.0000100 epoch_Time:27574.0min: [2024-01-03 17:19:36,810][model8_pretrain.py][INFO] Epoch:[0/2](224700/4588595) loss:2.944 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:19:36,810][model8_pretrain.py][INFO] Epoch:[0/2](224700/4588595) loss:2.743 lr:0.0000100 epoch_Time:27574.0min: [2024-01-03 17:19:36,810][model8_pretrain.py][INFO] Epoch:[0/2](224700/4588595) loss:2.998 lr:0.0000100 epoch_Time:27574.0min: [2024-01-03 17:19:36,810][model8_pretrain.py][INFO] Epoch:[0/2](224700/4588595) loss:3.203 lr:0.0000100 epoch_Time:27575.0min: [2024-01-03 17:19:36,811][model8_pretrain.py][INFO] Epoch:[0/2](224700/4588595) loss:3.212 lr:0.0000100 epoch_Time:27574.0min: [2024-01-03 17:20:13,735][model8_pretrain.py][INFO] Epoch:[0/2](224800/4588595) loss:2.983 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:20:13,735][model8_pretrain.py][INFO] Epoch:[0/2](224800/4588595) loss:2.888 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:20:13,735][model8_pretrain.py][INFO] Epoch:[0/2](224800/4588595) loss:3.120 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:20:13,735][model8_pretrain.py][INFO] Epoch:[0/2](224800/4588595) loss:2.841 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:20:13,735][model8_pretrain.py][INFO] Epoch:[0/2](224800/4588595) loss:3.106 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:20:13,735][model8_pretrain.py][INFO] Epoch:[0/2](224800/4588595) loss:2.926 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:20:13,735][model8_pretrain.py][INFO] Epoch:[0/2](224800/4588595) loss:2.673 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:20:13,735][model8_pretrain.py][INFO] Epoch:[0/2](224800/4588595) loss:2.704 lr:0.0000100 epoch_Time:27573.0min: [2024-01-03 17:20:50,689][model8_pretrain.py][INFO] Epoch:[0/2](224900/4588595) loss:2.914 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:20:50,689][model8_pretrain.py][INFO] Epoch:[0/2](224900/4588595) loss:2.876 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:20:50,690][model8_pretrain.py][INFO] Epoch:[0/2](224900/4588595) loss:3.091 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:20:50,690][model8_pretrain.py][INFO] Epoch:[0/2](224900/4588595) loss:2.489 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:20:50,689][model8_pretrain.py][INFO] Epoch:[0/2](224900/4588595) loss:2.304 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:20:50,690][model8_pretrain.py][INFO] Epoch:[0/2](224900/4588595) loss:2.661 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:20:50,690][model8_pretrain.py][INFO] Epoch:[0/2](224900/4588595) loss:2.698 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:20:50,690][model8_pretrain.py][INFO] Epoch:[0/2](224900/4588595) loss:3.174 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:21:27,648][model8_pretrain.py][INFO] Epoch:[0/2](225000/4588595) loss:2.809 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:21:27,648][model8_pretrain.py][INFO] Epoch:[0/2](225000/4588595) loss:2.936 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:21:27,648][model8_pretrain.py][INFO] Epoch:[0/2](225000/4588595) loss:2.921 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:21:27,648][model8_pretrain.py][INFO] Epoch:[0/2](225000/4588595) loss:2.875 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:21:27,648][model8_pretrain.py][INFO] Epoch:[0/2](225000/4588595) loss:2.938 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:21:27,648][model8_pretrain.py][INFO] Epoch:[0/2](225000/4588595) loss:2.336 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:21:27,648][model8_pretrain.py][INFO] Epoch:[0/2](225000/4588595) loss:3.043 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:21:27,649][model8_pretrain.py][INFO] Epoch:[0/2](225000/4588595) loss:2.838 lr:0.0000100 epoch_Time:27572.0min: [2024-01-03 17:22:04,612][model8_pretrain.py][INFO] Epoch:[0/2](225100/4588595) loss:2.902 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:04,612][model8_pretrain.py][INFO] Epoch:[0/2](225100/4588595) loss:2.803 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:04,612][model8_pretrain.py][INFO] Epoch:[0/2](225100/4588595) loss:2.570 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:04,612][model8_pretrain.py][INFO] Epoch:[0/2](225100/4588595) loss:2.885 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:04,612][model8_pretrain.py][INFO] Epoch:[0/2](225100/4588595) loss:2.972 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:04,612][model8_pretrain.py][INFO] Epoch:[0/2](225100/4588595) loss:2.568 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:04,612][model8_pretrain.py][INFO] Epoch:[0/2](225100/4588595) loss:2.594 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:04,612][model8_pretrain.py][INFO] Epoch:[0/2](225100/4588595) loss:3.250 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:41,575][model8_pretrain.py][INFO] Epoch:[0/2](225200/4588595) loss:2.363 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:41,575][model8_pretrain.py][INFO] Epoch:[0/2](225200/4588595) loss:2.897 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:41,576][model8_pretrain.py][INFO] Epoch:[0/2](225200/4588595) loss:2.789 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:41,576][model8_pretrain.py][INFO] Epoch:[0/2](225200/4588595) loss:2.929 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:41,576][model8_pretrain.py][INFO] Epoch:[0/2](225200/4588595) loss:2.839 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:41,576][model8_pretrain.py][INFO] Epoch:[0/2](225200/4588595) loss:2.540 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:41,576][model8_pretrain.py][INFO] Epoch:[0/2](225200/4588595) loss:2.781 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:22:41,576][model8_pretrain.py][INFO] Epoch:[0/2](225200/4588595) loss:3.043 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:23:18,535][model8_pretrain.py][INFO] Epoch:[0/2](225300/4588595) loss:3.263 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:23:18,535][model8_pretrain.py][INFO] Epoch:[0/2](225300/4588595) loss:2.758 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:23:18,535][model8_pretrain.py][INFO] Epoch:[0/2](225300/4588595) loss:2.799 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:23:18,535][model8_pretrain.py][INFO] Epoch:[0/2](225300/4588595) loss:2.733 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:23:18,535][model8_pretrain.py][INFO] Epoch:[0/2](225300/4588595) loss:2.782 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:23:18,535][model8_pretrain.py][INFO] Epoch:[0/2](225300/4588595) loss:2.834 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:23:18,535][model8_pretrain.py][INFO] Epoch:[0/2](225300/4588595) loss:2.844 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:23:18,535][model8_pretrain.py][INFO] Epoch:[0/2](225300/4588595) loss:3.029 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:23:55,497][model8_pretrain.py][INFO] Epoch:[0/2](225400/4588595) loss:3.081 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:23:55,497][model8_pretrain.py][INFO] Epoch:[0/2](225400/4588595) loss:3.107 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:23:55,497][model8_pretrain.py][INFO] Epoch:[0/2](225400/4588595) loss:2.765 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:23:55,497][model8_pretrain.py][INFO] Epoch:[0/2](225400/4588595) loss:2.852 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:23:55,497][model8_pretrain.py][INFO] Epoch:[0/2](225400/4588595) loss:2.899 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:23:55,497][model8_pretrain.py][INFO] Epoch:[0/2](225400/4588595) loss:2.475 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:23:55,497][model8_pretrain.py][INFO] Epoch:[0/2](225400/4588595) loss:2.520 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:23:55,497][model8_pretrain.py][INFO] Epoch:[0/2](225400/4588595) loss:2.963 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:24:41,215][model8_pretrain.py][INFO] Epoch:[0/2](225500/4588595) loss:2.934 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:24:41,215][model8_pretrain.py][INFO] Epoch:[0/2](225500/4588595) loss:3.261 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:24:41,215][model8_pretrain.py][INFO] Epoch:[0/2](225500/4588595) loss:2.961 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:24:41,215][model8_pretrain.py][INFO] Epoch:[0/2](225500/4588595) loss:2.439 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:24:41,215][model8_pretrain.py][INFO] Epoch:[0/2](225500/4588595) loss:3.198 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:24:41,215][model8_pretrain.py][INFO] Epoch:[0/2](225500/4588595) loss:3.031 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:24:41,215][model8_pretrain.py][INFO] Epoch:[0/2](225500/4588595) loss:2.818 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:24:41,215][model8_pretrain.py][INFO] Epoch:[0/2](225500/4588595) loss:3.316 lr:0.0000100 epoch_Time:27570.0min: [2024-01-03 17:25:18,147][model8_pretrain.py][INFO] Epoch:[0/2](225600/4588595) loss:2.666 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:25:18,147][model8_pretrain.py][INFO] Epoch:[0/2](225600/4588595) loss:2.737 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:25:18,147][model8_pretrain.py][INFO] Epoch:[0/2](225600/4588595) loss:3.475 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:25:18,147][model8_pretrain.py][INFO] Epoch:[0/2](225600/4588595) loss:3.399 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:25:18,147][model8_pretrain.py][INFO] Epoch:[0/2](225600/4588595) loss:2.794 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:25:18,147][model8_pretrain.py][INFO] Epoch:[0/2](225600/4588595) loss:3.291 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:25:18,147][model8_pretrain.py][INFO] Epoch:[0/2](225600/4588595) loss:3.127 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:25:18,147][model8_pretrain.py][INFO] Epoch:[0/2](225600/4588595) loss:2.667 lr:0.0000100 epoch_Time:27569.0min: [2024-01-03 17:25:55,083][model8_pretrain.py][INFO] Epoch:[0/2](225700/4588595) loss:2.916 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:25:55,083][model8_pretrain.py][INFO] Epoch:[0/2](225700/4588595) loss:3.367 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:25:55,083][model8_pretrain.py][INFO] Epoch:[0/2](225700/4588595) loss:2.613 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:25:55,083][model8_pretrain.py][INFO] Epoch:[0/2](225700/4588595) loss:3.093 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:25:55,083][model8_pretrain.py][INFO] Epoch:[0/2](225700/4588595) loss:2.563 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:25:55,083][model8_pretrain.py][INFO] Epoch:[0/2](225700/4588595) loss:3.179 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:25:55,084][model8_pretrain.py][INFO] Epoch:[0/2](225700/4588595) loss:3.314 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:25:55,085][model8_pretrain.py][INFO] Epoch:[0/2](225700/4588595) loss:2.811 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:26:32,034][model8_pretrain.py][INFO] Epoch:[0/2](225800/4588595) loss:3.234 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:26:32,034][model8_pretrain.py][INFO] Epoch:[0/2](225800/4588595) loss:3.191 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:26:32,034][model8_pretrain.py][INFO] Epoch:[0/2](225800/4588595) loss:3.095 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:26:32,034][model8_pretrain.py][INFO] Epoch:[0/2](225800/4588595) loss:3.031 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:26:32,034][model8_pretrain.py][INFO] Epoch:[0/2](225800/4588595) loss:3.028 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:26:32,034][model8_pretrain.py][INFO] Epoch:[0/2](225800/4588595) loss:2.495 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:26:32,034][model8_pretrain.py][INFO] Epoch:[0/2](225800/4588595) loss:3.212 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:26:32,034][model8_pretrain.py][INFO] Epoch:[0/2](225800/4588595) loss:2.780 lr:0.0000100 epoch_Time:27567.0min: [2024-01-03 17:27:08,981][model8_pretrain.py][INFO] Epoch:[0/2](225900/4588595) loss:3.064 lr:0.0000100 epoch_Time:27566.0min: [2024-01-03 17:27:08,981][model8_pretrain.py][INFO] Epoch:[0/2](225900/4588595) loss:3.208 lr:0.0000100 epoch_Time:27566.0min: [2024-01-03 17:27:08,981][model8_pretrain.py][INFO] Epoch:[0/2](225900/4588595) loss:3.566 lr:0.0000100 epoch_Time:27566.0min: [2024-01-03 17:27:08,981][model8_pretrain.py][INFO] Epoch:[0/2](225900/4588595) loss:3.316 lr:0.0000100 epoch_Time:27566.0min: [2024-01-03 17:27:08,981][model8_pretrain.py][INFO] Epoch:[0/2](225900/4588595) loss:3.335 lr:0.0000100 epoch_Time:27566.0min: [2024-01-03 17:27:08,981][model8_pretrain.py][INFO] Epoch:[0/2](225900/4588595) loss:2.939 lr:0.0000100 epoch_Time:27566.0min: [2024-01-03 17:27:08,981][model8_pretrain.py][INFO] Epoch:[0/2](225900/4588595) loss:3.066 lr:0.0000100 epoch_Time:27566.0min: [2024-01-03 17:27:08,981][model8_pretrain.py][INFO] Epoch:[0/2](225900/4588595) loss:3.155 lr:0.0000100 epoch_Time:27566.0min: [2024-01-03 17:27:45,923][model8_pretrain.py][INFO] Epoch:[0/2](226000/4588595) loss:3.296 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:27:45,923][model8_pretrain.py][INFO] Epoch:[0/2](226000/4588595) loss:2.883 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:27:45,923][model8_pretrain.py][INFO] Epoch:[0/2](226000/4588595) loss:2.796 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:27:45,923][model8_pretrain.py][INFO] Epoch:[0/2](226000/4588595) loss:3.109 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:27:45,923][model8_pretrain.py][INFO] Epoch:[0/2](226000/4588595) loss:2.481 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:27:45,923][model8_pretrain.py][INFO] Epoch:[0/2](226000/4588595) loss:2.478 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:27:45,923][model8_pretrain.py][INFO] Epoch:[0/2](226000/4588595) loss:2.821 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:27:45,923][model8_pretrain.py][INFO] Epoch:[0/2](226000/4588595) loss:2.967 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:28:22,872][model8_pretrain.py][INFO] Epoch:[0/2](226100/4588595) loss:2.629 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:28:22,872][model8_pretrain.py][INFO] Epoch:[0/2](226100/4588595) loss:3.165 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:28:22,872][model8_pretrain.py][INFO] Epoch:[0/2](226100/4588595) loss:3.271 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:28:22,872][model8_pretrain.py][INFO] Epoch:[0/2](226100/4588595) loss:2.940 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:28:22,872][model8_pretrain.py][INFO] Epoch:[0/2](226100/4588595) loss:2.547 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:28:22,872][model8_pretrain.py][INFO] Epoch:[0/2](226100/4588595) loss:3.490 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:28:22,872][model8_pretrain.py][INFO] Epoch:[0/2](226100/4588595) loss:3.334 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:28:22,872][model8_pretrain.py][INFO] Epoch:[0/2](226100/4588595) loss:3.497 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:28:59,824][model8_pretrain.py][INFO] Epoch:[0/2](226200/4588595) loss:3.226 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:28:59,824][model8_pretrain.py][INFO] Epoch:[0/2](226200/4588595) loss:3.189 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:28:59,825][model8_pretrain.py][INFO] Epoch:[0/2](226200/4588595) loss:3.384 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:28:59,825][model8_pretrain.py][INFO] Epoch:[0/2](226200/4588595) loss:3.218 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:28:59,825][model8_pretrain.py][INFO] Epoch:[0/2](226200/4588595) loss:3.172 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:28:59,825][model8_pretrain.py][INFO] Epoch:[0/2](226200/4588595) loss:3.394 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:28:59,825][model8_pretrain.py][INFO] Epoch:[0/2](226200/4588595) loss:3.313 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:28:59,825][model8_pretrain.py][INFO] Epoch:[0/2](226200/4588595) loss:3.362 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:29:45,440][model8_pretrain.py][INFO] Epoch:[0/2](226300/4588595) loss:2.726 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:29:45,440][model8_pretrain.py][INFO] Epoch:[0/2](226300/4588595) loss:3.488 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:29:45,440][model8_pretrain.py][INFO] Epoch:[0/2](226300/4588595) loss:3.012 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:29:45,440][model8_pretrain.py][INFO] Epoch:[0/2](226300/4588595) loss:2.515 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:29:45,440][model8_pretrain.py][INFO] Epoch:[0/2](226300/4588595) loss:3.206 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:29:45,440][model8_pretrain.py][INFO] Epoch:[0/2](226300/4588595) loss:2.956 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:29:45,440][model8_pretrain.py][INFO] Epoch:[0/2](226300/4588595) loss:2.845 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:29:45,440][model8_pretrain.py][INFO] Epoch:[0/2](226300/4588595) loss:2.858 lr:0.0000100 epoch_Time:27565.0min: [2024-01-03 17:30:22,363][model8_pretrain.py][INFO] Epoch:[0/2](226400/4588595) loss:3.081 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:30:22,363][model8_pretrain.py][INFO] Epoch:[0/2](226400/4588595) loss:2.586 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:30:22,363][model8_pretrain.py][INFO] Epoch:[0/2](226400/4588595) loss:2.834 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:30:22,363][model8_pretrain.py][INFO] Epoch:[0/2](226400/4588595) loss:3.178 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:30:22,363][model8_pretrain.py][INFO] Epoch:[0/2](226400/4588595) loss:2.298 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:30:22,363][model8_pretrain.py][INFO] Epoch:[0/2](226400/4588595) loss:2.947 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:30:22,363][model8_pretrain.py][INFO] Epoch:[0/2](226400/4588595) loss:2.931 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:30:22,363][model8_pretrain.py][INFO] Epoch:[0/2](226400/4588595) loss:3.258 lr:0.0000100 epoch_Time:27564.0min: [2024-01-03 17:30:59,302][model8_pretrain.py][INFO] Epoch:[0/2](226500/4588595) loss:3.202 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:30:59,302][model8_pretrain.py][INFO] Epoch:[0/2](226500/4588595) loss:3.133 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:30:59,302][model8_pretrain.py][INFO] Epoch:[0/2](226500/4588595) loss:3.170 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:30:59,302][model8_pretrain.py][INFO] Epoch:[0/2](226500/4588595) loss:2.964 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:30:59,302][model8_pretrain.py][INFO] Epoch:[0/2](226500/4588595) loss:2.634 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:30:59,302][model8_pretrain.py][INFO] Epoch:[0/2](226500/4588595) loss:2.932 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:30:59,302][model8_pretrain.py][INFO] Epoch:[0/2](226500/4588595) loss:3.142 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:30:59,303][model8_pretrain.py][INFO] Epoch:[0/2](226500/4588595) loss:2.875 lr:0.0000100 epoch_Time:27563.0min: [2024-01-03 17:31:36,239][model8_pretrain.py][INFO] Epoch:[0/2](226600/4588595) loss:2.431 lr:0.0000100 epoch_Time:27562.0min: [2024-01-03 17:31:36,239][model8_pretrain.py][INFO] Epoch:[0/2](226600/4588595) loss:2.664 lr:0.0000100 epoch_Time:27562.0min: [2024-01-03 17:31:36,239][model8_pretrain.py][INFO] Epoch:[0/2](226600/4588595) loss:2.156 lr:0.0000100 epoch_Time:27562.0min: [2024-01-03 17:31:36,239][model8_pretrain.py][INFO] Epoch:[0/2](226600/4588595) loss:2.756 lr:0.0000100 epoch_Time:27562.0min: [2024-01-03 17:31:36,239][model8_pretrain.py][INFO] Epoch:[0/2](226600/4588595) loss:3.085 lr:0.0000100 epoch_Time:27562.0min: [2024-01-03 17:31:36,239][model8_pretrain.py][INFO] Epoch:[0/2](226600/4588595) loss:3.295 lr:0.0000100 epoch_Time:27562.0min: [2024-01-03 17:31:36,239][model8_pretrain.py][INFO] Epoch:[0/2](226600/4588595) loss:2.896 lr:0.0000100 epoch_Time:27562.0min: [2024-01-03 17:31:36,239][model8_pretrain.py][INFO] Epoch:[0/2](226600/4588595) loss:2.846 lr:0.0000100 epoch_Time:27562.0min: [2024-01-03 17:32:13,183][model8_pretrain.py][INFO] Epoch:[0/2](226700/4588595) loss:3.211 lr:0.0000100 epoch_Time:27561.0min: [2024-01-03 17:32:13,183][model8_pretrain.py][INFO] Epoch:[0/2](226700/4588595) loss:3.261 lr:0.0000100 epoch_Time:27561.0min: [2024-01-03 17:32:13,183][model8_pretrain.py][INFO] Epoch:[0/2](226700/4588595) loss:3.164 lr:0.0000100 epoch_Time:27561.0min: [2024-01-03 17:32:13,183][model8_pretrain.py][INFO] Epoch:[0/2](226700/4588595) loss:2.888 lr:0.0000100 epoch_Time:27561.0min: [2024-01-03 17:32:13,183][model8_pretrain.py][INFO] Epoch:[0/2](226700/4588595) loss:3.269 lr:0.0000100 epoch_Time:27561.0min: [2024-01-03 17:32:13,183][model8_pretrain.py][INFO] Epoch:[0/2](226700/4588595) loss:2.764 lr:0.0000100 epoch_Time:27561.0min: [2024-01-03 17:32:13,183][model8_pretrain.py][INFO] Epoch:[0/2](226700/4588595) loss:2.631 lr:0.0000100 epoch_Time:27561.0min: [2024-01-03 17:32:13,183][model8_pretrain.py][INFO] Epoch:[0/2](226700/4588595) loss:3.107 lr:0.0000100 epoch_Time:27561.0min: [2024-01-03 17:32:50,119][model8_pretrain.py][INFO] Epoch:[0/2](226800/4588595) loss:2.818 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:32:50,119][model8_pretrain.py][INFO] Epoch:[0/2](226800/4588595) loss:3.118 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:32:50,119][model8_pretrain.py][INFO] Epoch:[0/2](226800/4588595) loss:2.727 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:32:50,119][model8_pretrain.py][INFO] Epoch:[0/2](226800/4588595) loss:3.042 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:32:50,119][model8_pretrain.py][INFO] Epoch:[0/2](226800/4588595) loss:2.841 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:32:50,119][model8_pretrain.py][INFO] Epoch:[0/2](226800/4588595) loss:2.806 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:32:50,119][model8_pretrain.py][INFO] Epoch:[0/2](226800/4588595) loss:3.398 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:32:50,119][model8_pretrain.py][INFO] Epoch:[0/2](226800/4588595) loss:2.553 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:33:27,051][model8_pretrain.py][INFO] Epoch:[0/2](226900/4588595) loss:2.967 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:33:27,051][model8_pretrain.py][INFO] Epoch:[0/2](226900/4588595) loss:3.359 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:33:27,051][model8_pretrain.py][INFO] Epoch:[0/2](226900/4588595) loss:2.535 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:33:27,051][model8_pretrain.py][INFO] Epoch:[0/2](226900/4588595) loss:2.909 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:33:27,051][model8_pretrain.py][INFO] Epoch:[0/2](226900/4588595) loss:2.940 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:33:27,051][model8_pretrain.py][INFO] Epoch:[0/2](226900/4588595) loss:3.017 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:33:27,051][model8_pretrain.py][INFO] Epoch:[0/2](226900/4588595) loss:3.457 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:33:27,051][model8_pretrain.py][INFO] Epoch:[0/2](226900/4588595) loss:2.858 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:34:03,986][model8_pretrain.py][INFO] Epoch:[0/2](227000/4588595) loss:2.484 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:34:03,986][model8_pretrain.py][INFO] Epoch:[0/2](227000/4588595) loss:3.006 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:34:03,986][model8_pretrain.py][INFO] Epoch:[0/2](227000/4588595) loss:3.035 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:34:03,986][model8_pretrain.py][INFO] Epoch:[0/2](227000/4588595) loss:2.946 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:34:03,986][model8_pretrain.py][INFO] Epoch:[0/2](227000/4588595) loss:3.119 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:34:03,986][model8_pretrain.py][INFO] Epoch:[0/2](227000/4588595) loss:3.332 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:34:03,986][model8_pretrain.py][INFO] Epoch:[0/2](227000/4588595) loss:2.997 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:34:03,986][model8_pretrain.py][INFO] Epoch:[0/2](227000/4588595) loss:2.860 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:34:49,700][model8_pretrain.py][INFO] Epoch:[0/2](227100/4588595) loss:3.236 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:34:49,700][model8_pretrain.py][INFO] Epoch:[0/2](227100/4588595) loss:2.438 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:34:49,700][model8_pretrain.py][INFO] Epoch:[0/2](227100/4588595) loss:2.912 lr:0.0000100 epoch_Time:27559.0min: [2024-01-03 17:34:49,700][model8_pretrain.py][INFO] Epoch:[0/2](227100/4588595) loss:2.802 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:34:49,700][model8_pretrain.py][INFO] Epoch:[0/2](227100/4588595) loss:2.556 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:34:49,700][model8_pretrain.py][INFO] Epoch:[0/2](227100/4588595) loss:2.548 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:34:49,700][model8_pretrain.py][INFO] Epoch:[0/2](227100/4588595) loss:2.922 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:34:51,382][model8_pretrain.py][INFO] Epoch:[0/2](227100/4588595) loss:2.765 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:35:28,296][model8_pretrain.py][INFO] Epoch:[0/2](227200/4588595) loss:3.066 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:35:28,296][model8_pretrain.py][INFO] Epoch:[0/2](227200/4588595) loss:3.091 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:35:28,296][model8_pretrain.py][INFO] Epoch:[0/2](227200/4588595) loss:2.906 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:35:28,296][model8_pretrain.py][INFO] Epoch:[0/2](227200/4588595) loss:3.646 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:35:28,296][model8_pretrain.py][INFO] Epoch:[0/2](227200/4588595) loss:2.966 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:35:28,296][model8_pretrain.py][INFO] Epoch:[0/2](227200/4588595) loss:2.512 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:35:28,296][model8_pretrain.py][INFO] Epoch:[0/2](227200/4588595) loss:2.823 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:35:28,296][model8_pretrain.py][INFO] Epoch:[0/2](227200/4588595) loss:3.253 lr:0.0000100 epoch_Time:27560.0min: [2024-01-03 17:36:05,220][model8_pretrain.py][INFO] Epoch:[0/2](227300/4588595) loss:2.806 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:05,220][model8_pretrain.py][INFO] Epoch:[0/2](227300/4588595) loss:3.342 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:05,220][model8_pretrain.py][INFO] Epoch:[0/2](227300/4588595) loss:2.715 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:05,220][model8_pretrain.py][INFO] Epoch:[0/2](227300/4588595) loss:2.984 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:05,220][model8_pretrain.py][INFO] Epoch:[0/2](227300/4588595) loss:3.098 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:05,220][model8_pretrain.py][INFO] Epoch:[0/2](227300/4588595) loss:2.622 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:05,220][model8_pretrain.py][INFO] Epoch:[0/2](227300/4588595) loss:2.885 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:05,221][model8_pretrain.py][INFO] Epoch:[0/2](227300/4588595) loss:3.232 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:42,157][model8_pretrain.py][INFO] Epoch:[0/2](227400/4588595) loss:3.346 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:42,157][model8_pretrain.py][INFO] Epoch:[0/2](227400/4588595) loss:3.231 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:42,158][model8_pretrain.py][INFO] Epoch:[0/2](227400/4588595) loss:3.148 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:42,157][model8_pretrain.py][INFO] Epoch:[0/2](227400/4588595) loss:3.011 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:42,158][model8_pretrain.py][INFO] Epoch:[0/2](227400/4588595) loss:2.633 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:42,158][model8_pretrain.py][INFO] Epoch:[0/2](227400/4588595) loss:2.745 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:42,158][model8_pretrain.py][INFO] Epoch:[0/2](227400/4588595) loss:3.065 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:36:42,158][model8_pretrain.py][INFO] Epoch:[0/2](227400/4588595) loss:2.838 lr:0.0000100 epoch_Time:27558.0min: [2024-01-03 17:37:19,097][model8_pretrain.py][INFO] Epoch:[0/2](227500/4588595) loss:3.310 lr:0.0000100 epoch_Time:27557.0min: [2024-01-03 17:37:19,097][model8_pretrain.py][INFO] Epoch:[0/2](227500/4588595) loss:2.723 lr:0.0000100 epoch_Time:27557.0min: [2024-01-03 17:37:19,097][model8_pretrain.py][INFO] Epoch:[0/2](227500/4588595) loss:2.309 lr:0.0000100 epoch_Time:27557.0min: [2024-01-03 17:37:19,097][model8_pretrain.py][INFO] Epoch:[0/2](227500/4588595) loss:3.177 lr:0.0000100 epoch_Time:27557.0min: [2024-01-03 17:37:19,097][model8_pretrain.py][INFO] Epoch:[0/2](227500/4588595) loss:3.339 lr:0.0000100 epoch_Time:27557.0min: [2024-01-03 17:37:19,097][model8_pretrain.py][INFO] Epoch:[0/2](227500/4588595) loss:3.285 lr:0.0000100 epoch_Time:27557.0min: [2024-01-03 17:37:19,097][model8_pretrain.py][INFO] Epoch:[0/2](227500/4588595) loss:3.355 lr:0.0000100 epoch_Time:27557.0min: [2024-01-03 17:37:19,097][model8_pretrain.py][INFO] Epoch:[0/2](227500/4588595) loss:2.872 lr:0.0000100 epoch_Time:27557.0min: [2024-01-03 17:37:56,037][model8_pretrain.py][INFO] Epoch:[0/2](227600/4588595) loss:3.064 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:37:56,037][model8_pretrain.py][INFO] Epoch:[0/2](227600/4588595) loss:2.385 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:37:56,037][model8_pretrain.py][INFO] Epoch:[0/2](227600/4588595) loss:2.562 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:37:56,037][model8_pretrain.py][INFO] Epoch:[0/2](227600/4588595) loss:3.370 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:37:56,037][model8_pretrain.py][INFO] Epoch:[0/2](227600/4588595) loss:2.967 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:37:56,037][model8_pretrain.py][INFO] Epoch:[0/2](227600/4588595) loss:3.111 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:37:56,037][model8_pretrain.py][INFO] Epoch:[0/2](227600/4588595) loss:2.870 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:37:56,037][model8_pretrain.py][INFO] Epoch:[0/2](227600/4588595) loss:2.465 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:38:32,975][model8_pretrain.py][INFO] Epoch:[0/2](227700/4588595) loss:3.192 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:38:32,975][model8_pretrain.py][INFO] Epoch:[0/2](227700/4588595) loss:3.397 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:38:32,975][model8_pretrain.py][INFO] Epoch:[0/2](227700/4588595) loss:2.785 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:38:32,975][model8_pretrain.py][INFO] Epoch:[0/2](227700/4588595) loss:2.274 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:38:32,975][model8_pretrain.py][INFO] Epoch:[0/2](227700/4588595) loss:2.289 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:38:32,975][model8_pretrain.py][INFO] Epoch:[0/2](227700/4588595) loss:2.970 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:38:32,975][model8_pretrain.py][INFO] Epoch:[0/2](227700/4588595) loss:2.918 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:38:32,975][model8_pretrain.py][INFO] Epoch:[0/2](227700/4588595) loss:2.870 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:39:09,916][model8_pretrain.py][INFO] Epoch:[0/2](227800/4588595) loss:3.253 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:39:09,916][model8_pretrain.py][INFO] Epoch:[0/2](227800/4588595) loss:3.133 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:39:09,916][model8_pretrain.py][INFO] Epoch:[0/2](227800/4588595) loss:2.929 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:39:09,916][model8_pretrain.py][INFO] Epoch:[0/2](227800/4588595) loss:3.103 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:39:09,916][model8_pretrain.py][INFO] Epoch:[0/2](227800/4588595) loss:2.986 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:39:09,916][model8_pretrain.py][INFO] Epoch:[0/2](227800/4588595) loss:2.800 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:39:09,916][model8_pretrain.py][INFO] Epoch:[0/2](227800/4588595) loss:2.903 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:39:09,916][model8_pretrain.py][INFO] Epoch:[0/2](227800/4588595) loss:2.514 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:39:55,596][model8_pretrain.py][INFO] Epoch:[0/2](227900/4588595) loss:2.541 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:39:55,596][model8_pretrain.py][INFO] Epoch:[0/2](227900/4588595) loss:2.625 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:39:55,596][model8_pretrain.py][INFO] Epoch:[0/2](227900/4588595) loss:2.832 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:39:55,596][model8_pretrain.py][INFO] Epoch:[0/2](227900/4588595) loss:2.547 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:39:55,596][model8_pretrain.py][INFO] Epoch:[0/2](227900/4588595) loss:3.107 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:39:55,596][model8_pretrain.py][INFO] Epoch:[0/2](227900/4588595) loss:3.492 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:39:55,596][model8_pretrain.py][INFO] Epoch:[0/2](227900/4588595) loss:3.011 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:39:55,596][model8_pretrain.py][INFO] Epoch:[0/2](227900/4588595) loss:2.881 lr:0.0000100 epoch_Time:27555.0min: [2024-01-03 17:40:34,203][model8_pretrain.py][INFO] Epoch:[0/2](228000/4588595) loss:3.420 lr:0.0000100 epoch_Time:27556.0min: [2024-01-03 17:40:34,203][model8_pretrain.py][INFO] Epoch:[0/2](228000/4588595) loss:2.813 lr:0.0000100 epoch_Time:27556.0min: [2024-01-03 17:40:34,203][model8_pretrain.py][INFO] Epoch:[0/2](228000/4588595) loss:2.277 lr:0.0000100 epoch_Time:27556.0min: [2024-01-03 17:40:34,203][model8_pretrain.py][INFO] Epoch:[0/2](228000/4588595) loss:3.157 lr:0.0000100 epoch_Time:27556.0min: [2024-01-03 17:40:34,203][model8_pretrain.py][INFO] Epoch:[0/2](228000/4588595) loss:2.845 lr:0.0000100 epoch_Time:27556.0min: [2024-01-03 17:40:34,203][model8_pretrain.py][INFO] Epoch:[0/2](228000/4588595) loss:2.643 lr:0.0000100 epoch_Time:27556.0min: [2024-01-03 17:40:34,203][model8_pretrain.py][INFO] Epoch:[0/2](228000/4588595) loss:3.202 lr:0.0000100 epoch_Time:27556.0min: [2024-01-03 17:40:34,204][model8_pretrain.py][INFO] Epoch:[0/2](228000/4588595) loss:2.598 lr:0.0000100 epoch_Time:27556.0min: [2024-01-03 17:41:11,164][model8_pretrain.py][INFO] Epoch:[0/2](228100/4588595) loss:3.290 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:41:11,164][model8_pretrain.py][INFO] Epoch:[0/2](228100/4588595) loss:3.225 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:41:11,164][model8_pretrain.py][INFO] Epoch:[0/2](228100/4588595) loss:3.428 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:41:11,164][model8_pretrain.py][INFO] Epoch:[0/2](228100/4588595) loss:3.274 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:41:11,164][model8_pretrain.py][INFO] Epoch:[0/2](228100/4588595) loss:2.096 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:41:11,164][model8_pretrain.py][INFO] Epoch:[0/2](228100/4588595) loss:3.161 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:41:11,164][model8_pretrain.py][INFO] Epoch:[0/2](228100/4588595) loss:2.791 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:41:11,164][model8_pretrain.py][INFO] Epoch:[0/2](228100/4588595) loss:3.144 lr:0.0000100 epoch_Time:27554.0min: [2024-01-03 17:41:48,110][model8_pretrain.py][INFO] Epoch:[0/2](228200/4588595) loss:2.306 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:41:48,110][model8_pretrain.py][INFO] Epoch:[0/2](228200/4588595) loss:3.383 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:41:48,110][model8_pretrain.py][INFO] Epoch:[0/2](228200/4588595) loss:2.989 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:41:48,110][model8_pretrain.py][INFO] Epoch:[0/2](228200/4588595) loss:2.702 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:41:48,110][model8_pretrain.py][INFO] Epoch:[0/2](228200/4588595) loss:2.312 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:41:48,110][model8_pretrain.py][INFO] Epoch:[0/2](228200/4588595) loss:2.743 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:41:48,110][model8_pretrain.py][INFO] Epoch:[0/2](228200/4588595) loss:3.138 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:41:48,110][model8_pretrain.py][INFO] Epoch:[0/2](228200/4588595) loss:2.969 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:42:25,071][model8_pretrain.py][INFO] Epoch:[0/2](228300/4588595) loss:2.655 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:42:25,071][model8_pretrain.py][INFO] Epoch:[0/2](228300/4588595) loss:3.315 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:42:25,071][model8_pretrain.py][INFO] Epoch:[0/2](228300/4588595) loss:2.815 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:42:25,071][model8_pretrain.py][INFO] Epoch:[0/2](228300/4588595) loss:3.052 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:42:25,071][model8_pretrain.py][INFO] Epoch:[0/2](228300/4588595) loss:3.081 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:42:25,071][model8_pretrain.py][INFO] Epoch:[0/2](228300/4588595) loss:3.075 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:42:25,071][model8_pretrain.py][INFO] Epoch:[0/2](228300/4588595) loss:2.811 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:42:25,071][model8_pretrain.py][INFO] Epoch:[0/2](228300/4588595) loss:3.193 lr:0.0000100 epoch_Time:27553.0min: [2024-01-03 17:43:02,082][model8_pretrain.py][INFO] Epoch:[0/2](228400/4588595) loss:3.026 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:02,082][model8_pretrain.py][INFO] Epoch:[0/2](228400/4588595) loss:2.777 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:02,082][model8_pretrain.py][INFO] Epoch:[0/2](228400/4588595) loss:3.184 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:02,082][model8_pretrain.py][INFO] Epoch:[0/2](228400/4588595) loss:3.058 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:02,082][model8_pretrain.py][INFO] Epoch:[0/2](228400/4588595) loss:2.709 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:02,083][model8_pretrain.py][INFO] Epoch:[0/2](228400/4588595) loss:3.268 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:02,083][model8_pretrain.py][INFO] Epoch:[0/2](228400/4588595) loss:3.188 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:02,083][model8_pretrain.py][INFO] Epoch:[0/2](228400/4588595) loss:3.194 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:39,024][model8_pretrain.py][INFO] Epoch:[0/2](228500/4588595) loss:3.020 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:39,024][model8_pretrain.py][INFO] Epoch:[0/2](228500/4588595) loss:3.355 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:39,024][model8_pretrain.py][INFO] Epoch:[0/2](228500/4588595) loss:3.091 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:39,024][model8_pretrain.py][INFO] Epoch:[0/2](228500/4588595) loss:2.891 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:39,024][model8_pretrain.py][INFO] Epoch:[0/2](228500/4588595) loss:2.885 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:39,024][model8_pretrain.py][INFO] Epoch:[0/2](228500/4588595) loss:2.448 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:39,024][model8_pretrain.py][INFO] Epoch:[0/2](228500/4588595) loss:2.819 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:43:39,025][model8_pretrain.py][INFO] Epoch:[0/2](228500/4588595) loss:3.078 lr:0.0000100 epoch_Time:27551.0min: [2024-01-03 17:44:15,979][model8_pretrain.py][INFO] Epoch:[0/2](228600/4588595) loss:3.278 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:15,979][model8_pretrain.py][INFO] Epoch:[0/2](228600/4588595) loss:3.358 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:15,979][model8_pretrain.py][INFO] Epoch:[0/2](228600/4588595) loss:3.342 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:15,979][model8_pretrain.py][INFO] Epoch:[0/2](228600/4588595) loss:3.018 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:15,979][model8_pretrain.py][INFO] Epoch:[0/2](228600/4588595) loss:2.732 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:15,979][model8_pretrain.py][INFO] Epoch:[0/2](228600/4588595) loss:2.835 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:15,979][model8_pretrain.py][INFO] Epoch:[0/2](228600/4588595) loss:3.422 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:15,979][model8_pretrain.py][INFO] Epoch:[0/2](228600/4588595) loss:2.638 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:58,179][model8_pretrain.py][INFO] Epoch:[0/2](228700/4588595) loss:2.982 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:58,179][model8_pretrain.py][INFO] Epoch:[0/2](228700/4588595) loss:2.922 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:58,179][model8_pretrain.py][INFO] Epoch:[0/2](228700/4588595) loss:3.098 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:58,179][model8_pretrain.py][INFO] Epoch:[0/2](228700/4588595) loss:2.973 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:58,179][model8_pretrain.py][INFO] Epoch:[0/2](228700/4588595) loss:3.715 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:58,179][model8_pretrain.py][INFO] Epoch:[0/2](228700/4588595) loss:3.193 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:58,179][model8_pretrain.py][INFO] Epoch:[0/2](228700/4588595) loss:2.812 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:44:58,179][model8_pretrain.py][INFO] Epoch:[0/2](228700/4588595) loss:2.693 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:45:40,296][model8_pretrain.py][INFO] Epoch:[0/2](228800/4588595) loss:3.257 lr:0.0000100 epoch_Time:27552.0min: [2024-01-03 17:45:40,296][model8_pretrain.py][INFO] Epoch:[0/2](228800/4588595) loss:2.497 lr:0.0000100 epoch_Time:27552.0min: [2024-01-03 17:45:40,296][model8_pretrain.py][INFO] Epoch:[0/2](228800/4588595) loss:3.199 lr:0.0000100 epoch_Time:27552.0min: [2024-01-03 17:45:40,296][model8_pretrain.py][INFO] Epoch:[0/2](228800/4588595) loss:3.125 lr:0.0000100 epoch_Time:27552.0min: [2024-01-03 17:45:40,296][model8_pretrain.py][INFO] Epoch:[0/2](228800/4588595) loss:2.377 lr:0.0000100 epoch_Time:27552.0min: [2024-01-03 17:45:40,296][model8_pretrain.py][INFO] Epoch:[0/2](228800/4588595) loss:3.612 lr:0.0000100 epoch_Time:27552.0min: [2024-01-03 17:45:40,296][model8_pretrain.py][INFO] Epoch:[0/2](228800/4588595) loss:1.751 lr:0.0000100 epoch_Time:27552.0min: [2024-01-03 17:45:40,296][model8_pretrain.py][INFO] Epoch:[0/2](228800/4588595) loss:2.901 lr:0.0000100 epoch_Time:27552.0min: [2024-01-03 17:46:17,249][model8_pretrain.py][INFO] Epoch:[0/2](228900/4588595) loss:3.365 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:46:17,249][model8_pretrain.py][INFO] Epoch:[0/2](228900/4588595) loss:2.780 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:46:17,249][model8_pretrain.py][INFO] Epoch:[0/2](228900/4588595) loss:2.671 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:46:17,249][model8_pretrain.py][INFO] Epoch:[0/2](228900/4588595) loss:3.076 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:46:17,249][model8_pretrain.py][INFO] Epoch:[0/2](228900/4588595) loss:2.459 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:46:17,249][model8_pretrain.py][INFO] Epoch:[0/2](228900/4588595) loss:2.614 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:46:17,249][model8_pretrain.py][INFO] Epoch:[0/2](228900/4588595) loss:2.665 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:46:17,249][model8_pretrain.py][INFO] Epoch:[0/2](228900/4588595) loss:3.174 lr:0.0000100 epoch_Time:27550.0min: [2024-01-03 17:46:54,193][model8_pretrain.py][INFO] Epoch:[0/2](229000/4588595) loss:2.213 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:46:54,193][model8_pretrain.py][INFO] Epoch:[0/2](229000/4588595) loss:2.911 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:46:54,193][model8_pretrain.py][INFO] Epoch:[0/2](229000/4588595) loss:3.285 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:46:54,193][model8_pretrain.py][INFO] Epoch:[0/2](229000/4588595) loss:3.146 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:46:54,193][model8_pretrain.py][INFO] Epoch:[0/2](229000/4588595) loss:2.973 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:46:54,193][model8_pretrain.py][INFO] Epoch:[0/2](229000/4588595) loss:2.847 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:46:54,193][model8_pretrain.py][INFO] Epoch:[0/2](229000/4588595) loss:3.155 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:46:54,193][model8_pretrain.py][INFO] Epoch:[0/2](229000/4588595) loss:2.750 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:47:31,130][model8_pretrain.py][INFO] Epoch:[0/2](229100/4588595) loss:3.186 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:47:31,130][model8_pretrain.py][INFO] Epoch:[0/2](229100/4588595) loss:3.042 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:47:31,130][model8_pretrain.py][INFO] Epoch:[0/2](229100/4588595) loss:2.770 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:47:31,130][model8_pretrain.py][INFO] Epoch:[0/2](229100/4588595) loss:3.312 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:47:31,130][model8_pretrain.py][INFO] Epoch:[0/2](229100/4588595) loss:3.348 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:47:31,130][model8_pretrain.py][INFO] Epoch:[0/2](229100/4588595) loss:3.407 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:47:31,130][model8_pretrain.py][INFO] Epoch:[0/2](229100/4588595) loss:2.821 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:47:31,130][model8_pretrain.py][INFO] Epoch:[0/2](229100/4588595) loss:3.084 lr:0.0000100 epoch_Time:27549.0min: [2024-01-03 17:48:08,066][model8_pretrain.py][INFO] Epoch:[0/2](229200/4588595) loss:2.916 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:08,066][model8_pretrain.py][INFO] Epoch:[0/2](229200/4588595) loss:2.894 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:08,066][model8_pretrain.py][INFO] Epoch:[0/2](229200/4588595) loss:3.051 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:08,066][model8_pretrain.py][INFO] Epoch:[0/2](229200/4588595) loss:2.985 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:08,066][model8_pretrain.py][INFO] Epoch:[0/2](229200/4588595) loss:2.877 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:08,066][model8_pretrain.py][INFO] Epoch:[0/2](229200/4588595) loss:2.924 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:08,066][model8_pretrain.py][INFO] Epoch:[0/2](229200/4588595) loss:3.142 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:08,066][model8_pretrain.py][INFO] Epoch:[0/2](229200/4588595) loss:2.899 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:45,007][model8_pretrain.py][INFO] Epoch:[0/2](229300/4588595) loss:2.768 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:45,007][model8_pretrain.py][INFO] Epoch:[0/2](229300/4588595) loss:3.217 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:45,007][model8_pretrain.py][INFO] Epoch:[0/2](229300/4588595) loss:3.110 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:45,007][model8_pretrain.py][INFO] Epoch:[0/2](229300/4588595) loss:2.895 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:45,007][model8_pretrain.py][INFO] Epoch:[0/2](229300/4588595) loss:3.093 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:45,007][model8_pretrain.py][INFO] Epoch:[0/2](229300/4588595) loss:2.585 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:45,007][model8_pretrain.py][INFO] Epoch:[0/2](229300/4588595) loss:3.080 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:48:45,007][model8_pretrain.py][INFO] Epoch:[0/2](229300/4588595) loss:3.229 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:49:21,917][model8_pretrain.py][INFO] Epoch:[0/2](229400/4588595) loss:3.079 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:49:21,917][model8_pretrain.py][INFO] Epoch:[0/2](229400/4588595) loss:3.316 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:49:21,917][model8_pretrain.py][INFO] Epoch:[0/2](229400/4588595) loss:3.089 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:49:21,917][model8_pretrain.py][INFO] Epoch:[0/2](229400/4588595) loss:2.980 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:49:21,917][model8_pretrain.py][INFO] Epoch:[0/2](229400/4588595) loss:2.901 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:49:21,917][model8_pretrain.py][INFO] Epoch:[0/2](229400/4588595) loss:2.861 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:49:21,917][model8_pretrain.py][INFO] Epoch:[0/2](229400/4588595) loss:2.802 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:49:21,917][model8_pretrain.py][INFO] Epoch:[0/2](229400/4588595) loss:2.827 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:50:04,122][model8_pretrain.py][INFO] Epoch:[0/2](229500/4588595) loss:2.758 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:50:04,122][model8_pretrain.py][INFO] Epoch:[0/2](229500/4588595) loss:3.024 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:50:04,122][model8_pretrain.py][INFO] Epoch:[0/2](229500/4588595) loss:2.954 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:50:04,122][model8_pretrain.py][INFO] Epoch:[0/2](229500/4588595) loss:2.490 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:50:04,122][model8_pretrain.py][INFO] Epoch:[0/2](229500/4588595) loss:2.723 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:50:04,122][model8_pretrain.py][INFO] Epoch:[0/2](229500/4588595) loss:2.539 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:50:04,122][model8_pretrain.py][INFO] Epoch:[0/2](229500/4588595) loss:2.591 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:50:04,122][model8_pretrain.py][INFO] Epoch:[0/2](229500/4588595) loss:3.080 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:50:46,256][model8_pretrain.py][INFO] Epoch:[0/2](229600/4588595) loss:2.637 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:50:46,256][model8_pretrain.py][INFO] Epoch:[0/2](229600/4588595) loss:2.361 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:50:46,256][model8_pretrain.py][INFO] Epoch:[0/2](229600/4588595) loss:2.177 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:50:46,256][model8_pretrain.py][INFO] Epoch:[0/2](229600/4588595) loss:2.475 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:50:46,256][model8_pretrain.py][INFO] Epoch:[0/2](229600/4588595) loss:3.207 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:50:46,256][model8_pretrain.py][INFO] Epoch:[0/2](229600/4588595) loss:3.062 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:50:46,256][model8_pretrain.py][INFO] Epoch:[0/2](229600/4588595) loss:3.374 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:50:46,256][model8_pretrain.py][INFO] Epoch:[0/2](229600/4588595) loss:3.168 lr:0.0000100 epoch_Time:27547.0min: [2024-01-03 17:51:23,192][model8_pretrain.py][INFO] Epoch:[0/2](229700/4588595) loss:3.231 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:51:23,192][model8_pretrain.py][INFO] Epoch:[0/2](229700/4588595) loss:2.842 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:51:23,192][model8_pretrain.py][INFO] Epoch:[0/2](229700/4588595) loss:2.659 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:51:23,192][model8_pretrain.py][INFO] Epoch:[0/2](229700/4588595) loss:3.384 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:51:23,192][model8_pretrain.py][INFO] Epoch:[0/2](229700/4588595) loss:2.707 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:51:23,192][model8_pretrain.py][INFO] Epoch:[0/2](229700/4588595) loss:3.015 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:51:23,192][model8_pretrain.py][INFO] Epoch:[0/2](229700/4588595) loss:2.845 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:51:23,192][model8_pretrain.py][INFO] Epoch:[0/2](229700/4588595) loss:3.308 lr:0.0000100 epoch_Time:27546.0min: [2024-01-03 17:52:00,152][model8_pretrain.py][INFO] Epoch:[0/2](229800/4588595) loss:2.684 lr:0.0000100 epoch_Time:27545.0min: [2024-01-03 17:52:00,152][model8_pretrain.py][INFO] Epoch:[0/2](229800/4588595) loss:3.194 lr:0.0000100 epoch_Time:27545.0min: [2024-01-03 17:52:00,152][model8_pretrain.py][INFO] Epoch:[0/2](229800/4588595) loss:3.311 lr:0.0000100 epoch_Time:27545.0min: [2024-01-03 17:52:00,152][model8_pretrain.py][INFO] Epoch:[0/2](229800/4588595) loss:3.079 lr:0.0000100 epoch_Time:27545.0min: [2024-01-03 17:52:00,152][model8_pretrain.py][INFO] Epoch:[0/2](229800/4588595) loss:3.065 lr:0.0000100 epoch_Time:27545.0min: [2024-01-03 17:52:00,152][model8_pretrain.py][INFO] Epoch:[0/2](229800/4588595) loss:2.711 lr:0.0000100 epoch_Time:27545.0min: [2024-01-03 17:52:00,153][model8_pretrain.py][INFO] Epoch:[0/2](229800/4588595) loss:2.898 lr:0.0000100 epoch_Time:27545.0min: [2024-01-03 17:52:00,153][model8_pretrain.py][INFO] Epoch:[0/2](229800/4588595) loss:3.154 lr:0.0000100 epoch_Time:27545.0min: [2024-01-03 17:52:37,145][model8_pretrain.py][INFO] Epoch:[0/2](229900/4588595) loss:3.015 lr:0.0000100 epoch_Time:27544.0min: [2024-01-03 17:52:37,145][model8_pretrain.py][INFO] Epoch:[0/2](229900/4588595) loss:2.588 lr:0.0000100 epoch_Time:27544.0min: [2024-01-03 17:52:37,145][model8_pretrain.py][INFO] Epoch:[0/2](229900/4588595) loss:3.097 lr:0.0000100 epoch_Time:27544.0min: [2024-01-03 17:52:37,145][model8_pretrain.py][INFO] Epoch:[0/2](229900/4588595) loss:3.039 lr:0.0000100 epoch_Time:27544.0min: [2024-01-03 17:52:37,145][model8_pretrain.py][INFO] Epoch:[0/2](229900/4588595) loss:2.716 lr:0.0000100 epoch_Time:27544.0min: [2024-01-03 17:52:37,145][model8_pretrain.py][INFO] Epoch:[0/2](229900/4588595) loss:2.614 lr:0.0000100 epoch_Time:27544.0min: [2024-01-03 17:52:37,145][model8_pretrain.py][INFO] Epoch:[0/2](229900/4588595) loss:3.198 lr:0.0000100 epoch_Time:27544.0min: [2024-01-03 17:52:37,145][model8_pretrain.py][INFO] Epoch:[0/2](229900/4588595) loss:2.685 lr:0.0000100 epoch_Time:27544.0min: [2024-01-03 17:53:14,079][model8_pretrain.py][INFO] Epoch:[0/2](230000/4588595) loss:3.343 lr:0.0000100 epoch_Time:27543.0min: [2024-01-03 17:53:14,079][model8_pretrain.py][INFO] Epoch:[0/2](230000/4588595) loss:3.269 lr:0.0000100 epoch_Time:27543.0min: [2024-01-03 17:53:14,079][model8_pretrain.py][INFO] Epoch:[0/2](230000/4588595) loss:3.044 lr:0.0000100 epoch_Time:27543.0min: [2024-01-03 17:53:14,079][model8_pretrain.py][INFO] Epoch:[0/2](230000/4588595) loss:2.718 lr:0.0000100 epoch_Time:27543.0min: [2024-01-03 17:53:14,079][model8_pretrain.py][INFO] Epoch:[0/2](230000/4588595) loss:3.268 lr:0.0000100 epoch_Time:27543.0min: [2024-01-03 17:53:14,079][model8_pretrain.py][INFO] Epoch:[0/2](230000/4588595) loss:3.342 lr:0.0000100 epoch_Time:27543.0min: [2024-01-03 17:53:14,079][model8_pretrain.py][INFO] Epoch:[0/2](230000/4588595) loss:2.784 lr:0.0000100 epoch_Time:27543.0min: [2024-01-03 17:53:14,079][model8_pretrain.py][INFO] Epoch:[0/2](230000/4588595) loss:3.065 lr:0.0000100 epoch_Time:27543.0min: [2024-01-03 17:53:51,001][model8_pretrain.py][INFO] Epoch:[0/2](230100/4588595) loss:3.092 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:53:51,001][model8_pretrain.py][INFO] Epoch:[0/2](230100/4588595) loss:3.157 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:53:51,001][model8_pretrain.py][INFO] Epoch:[0/2](230100/4588595) loss:3.228 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:53:51,001][model8_pretrain.py][INFO] Epoch:[0/2](230100/4588595) loss:3.088 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:53:51,001][model8_pretrain.py][INFO] Epoch:[0/2](230100/4588595) loss:2.993 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:53:51,001][model8_pretrain.py][INFO] Epoch:[0/2](230100/4588595) loss:3.377 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:53:51,001][model8_pretrain.py][INFO] Epoch:[0/2](230100/4588595) loss:2.535 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:53:51,001][model8_pretrain.py][INFO] Epoch:[0/2](230100/4588595) loss:3.379 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:54:27,932][model8_pretrain.py][INFO] Epoch:[0/2](230200/4588595) loss:2.909 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:54:27,932][model8_pretrain.py][INFO] Epoch:[0/2](230200/4588595) loss:2.910 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:54:27,932][model8_pretrain.py][INFO] Epoch:[0/2](230200/4588595) loss:2.790 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:54:27,932][model8_pretrain.py][INFO] Epoch:[0/2](230200/4588595) loss:2.832 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:54:27,933][model8_pretrain.py][INFO] Epoch:[0/2](230200/4588595) loss:3.010 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:54:27,933][model8_pretrain.py][INFO] Epoch:[0/2](230200/4588595) loss:3.229 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:54:27,933][model8_pretrain.py][INFO] Epoch:[0/2](230200/4588595) loss:3.009 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:54:27,933][model8_pretrain.py][INFO] Epoch:[0/2](230200/4588595) loss:3.036 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:55:10,057][model8_pretrain.py][INFO] Epoch:[0/2](230300/4588595) loss:2.988 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:10,057][model8_pretrain.py][INFO] Epoch:[0/2](230300/4588595) loss:2.827 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:10,057][model8_pretrain.py][INFO] Epoch:[0/2](230300/4588595) loss:2.938 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:10,057][model8_pretrain.py][INFO] Epoch:[0/2](230300/4588595) loss:2.737 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:10,062][model8_pretrain.py][INFO] Epoch:[0/2](230300/4588595) loss:2.758 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:10,062][model8_pretrain.py][INFO] Epoch:[0/2](230300/4588595) loss:2.513 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:10,062][model8_pretrain.py][INFO] Epoch:[0/2](230300/4588595) loss:2.806 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:10,063][model8_pretrain.py][INFO] Epoch:[0/2](230300/4588595) loss:2.822 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:52,175][model8_pretrain.py][INFO] Epoch:[0/2](230400/4588595) loss:3.116 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:52,175][model8_pretrain.py][INFO] Epoch:[0/2](230400/4588595) loss:2.523 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:52,175][model8_pretrain.py][INFO] Epoch:[0/2](230400/4588595) loss:2.564 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:52,175][model8_pretrain.py][INFO] Epoch:[0/2](230400/4588595) loss:3.209 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:52,175][model8_pretrain.py][INFO] Epoch:[0/2](230400/4588595) loss:3.512 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:52,176][model8_pretrain.py][INFO] Epoch:[0/2](230400/4588595) loss:2.657 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:52,176][model8_pretrain.py][INFO] Epoch:[0/2](230400/4588595) loss:3.056 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:55:52,176][model8_pretrain.py][INFO] Epoch:[0/2](230400/4588595) loss:2.305 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:56:29,106][model8_pretrain.py][INFO] Epoch:[0/2](230500/4588595) loss:2.494 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:56:29,106][model8_pretrain.py][INFO] Epoch:[0/2](230500/4588595) loss:2.808 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:56:29,106][model8_pretrain.py][INFO] Epoch:[0/2](230500/4588595) loss:3.509 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:56:29,106][model8_pretrain.py][INFO] Epoch:[0/2](230500/4588595) loss:2.899 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:56:29,106][model8_pretrain.py][INFO] Epoch:[0/2](230500/4588595) loss:3.404 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:56:29,106][model8_pretrain.py][INFO] Epoch:[0/2](230500/4588595) loss:3.112 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:56:29,106][model8_pretrain.py][INFO] Epoch:[0/2](230500/4588595) loss:2.767 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:56:29,106][model8_pretrain.py][INFO] Epoch:[0/2](230500/4588595) loss:2.709 lr:0.0000100 epoch_Time:27542.0min: [2024-01-03 17:57:06,047][model8_pretrain.py][INFO] Epoch:[0/2](230600/4588595) loss:3.065 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:57:06,047][model8_pretrain.py][INFO] Epoch:[0/2](230600/4588595) loss:2.578 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:57:06,047][model8_pretrain.py][INFO] Epoch:[0/2](230600/4588595) loss:3.227 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:57:06,047][model8_pretrain.py][INFO] Epoch:[0/2](230600/4588595) loss:2.889 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:57:06,047][model8_pretrain.py][INFO] Epoch:[0/2](230600/4588595) loss:3.374 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:57:06,047][model8_pretrain.py][INFO] Epoch:[0/2](230600/4588595) loss:3.143 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:57:06,047][model8_pretrain.py][INFO] Epoch:[0/2](230600/4588595) loss:3.183 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:57:06,047][model8_pretrain.py][INFO] Epoch:[0/2](230600/4588595) loss:2.745 lr:0.0000100 epoch_Time:27541.0min: [2024-01-03 17:57:42,982][model8_pretrain.py][INFO] Epoch:[0/2](230700/4588595) loss:2.876 lr:0.0000100 epoch_Time:27540.0min: [2024-01-03 17:57:42,982][model8_pretrain.py][INFO] Epoch:[0/2](230700/4588595) loss:2.737 lr:0.0000100 epoch_Time:27540.0min: [2024-01-03 17:57:42,982][model8_pretrain.py][INFO] Epoch:[0/2](230700/4588595) loss:3.021 lr:0.0000100 epoch_Time:27540.0min: [2024-01-03 17:57:42,982][model8_pretrain.py][INFO] Epoch:[0/2](230700/4588595) loss:2.332 lr:0.0000100 epoch_Time:27540.0min: [2024-01-03 17:57:42,982][model8_pretrain.py][INFO] Epoch:[0/2](230700/4588595) loss:2.898 lr:0.0000100 epoch_Time:27540.0min: [2024-01-03 17:57:42,982][model8_pretrain.py][INFO] Epoch:[0/2](230700/4588595) loss:3.024 lr:0.0000100 epoch_Time:27540.0min: [2024-01-03 17:57:42,982][model8_pretrain.py][INFO] Epoch:[0/2](230700/4588595) loss:3.096 lr:0.0000100 epoch_Time:27540.0min: [2024-01-03 17:57:42,982][model8_pretrain.py][INFO] Epoch:[0/2](230700/4588595) loss:2.964 lr:0.0000100 epoch_Time:27540.0min: [2024-01-03 17:58:19,924][model8_pretrain.py][INFO] Epoch:[0/2](230800/4588595) loss:2.930 lr:0.0000100 epoch_Time:27539.0min: [2024-01-03 17:58:19,924][model8_pretrain.py][INFO] Epoch:[0/2](230800/4588595) loss:3.387 lr:0.0000100 epoch_Time:27539.0min: [2024-01-03 17:58:19,924][model8_pretrain.py][INFO] Epoch:[0/2](230800/4588595) loss:2.721 lr:0.0000100 epoch_Time:27539.0min: [2024-01-03 17:58:19,924][model8_pretrain.py][INFO] Epoch:[0/2](230800/4588595) loss:3.044 lr:0.0000100 epoch_Time:27539.0min: [2024-01-03 17:58:19,924][model8_pretrain.py][INFO] Epoch:[0/2](230800/4588595) loss:3.354 lr:0.0000100 epoch_Time:27539.0min: [2024-01-03 17:58:19,924][model8_pretrain.py][INFO] Epoch:[0/2](230800/4588595) loss:3.045 lr:0.0000100 epoch_Time:27539.0min: [2024-01-03 17:58:19,924][model8_pretrain.py][INFO] Epoch:[0/2](230800/4588595) loss:2.930 lr:0.0000100 epoch_Time:27539.0min: [2024-01-03 17:58:19,924][model8_pretrain.py][INFO] Epoch:[0/2](230800/4588595) loss:3.000 lr:0.0000100 epoch_Time:27539.0min: [2024-01-03 17:58:56,860][model8_pretrain.py][INFO] Epoch:[0/2](230900/4588595) loss:3.217 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 17:58:56,860][model8_pretrain.py][INFO] Epoch:[0/2](230900/4588595) loss:2.744 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 17:58:56,860][model8_pretrain.py][INFO] Epoch:[0/2](230900/4588595) loss:3.203 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 17:58:56,860][model8_pretrain.py][INFO] Epoch:[0/2](230900/4588595) loss:2.489 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 17:58:56,860][model8_pretrain.py][INFO] Epoch:[0/2](230900/4588595) loss:3.314 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 17:58:56,860][model8_pretrain.py][INFO] Epoch:[0/2](230900/4588595) loss:2.930 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 17:58:56,860][model8_pretrain.py][INFO] Epoch:[0/2](230900/4588595) loss:3.216 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 17:58:56,861][model8_pretrain.py][INFO] Epoch:[0/2](230900/4588595) loss:3.101 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 17:59:33,797][model8_pretrain.py][INFO] Epoch:[0/2](231000/4588595) loss:3.015 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 17:59:33,797][model8_pretrain.py][INFO] Epoch:[0/2](231000/4588595) loss:3.019 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 17:59:33,797][model8_pretrain.py][INFO] Epoch:[0/2](231000/4588595) loss:2.952 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 17:59:33,797][model8_pretrain.py][INFO] Epoch:[0/2](231000/4588595) loss:3.206 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 17:59:33,797][model8_pretrain.py][INFO] Epoch:[0/2](231000/4588595) loss:3.204 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 17:59:33,797][model8_pretrain.py][INFO] Epoch:[0/2](231000/4588595) loss:2.665 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 17:59:33,797][model8_pretrain.py][INFO] Epoch:[0/2](231000/4588595) loss:2.905 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 17:59:33,798][model8_pretrain.py][INFO] Epoch:[0/2](231000/4588595) loss:3.292 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 18:00:12,500][model8_pretrain.py][INFO] Epoch:[0/2](231100/4588595) loss:2.715 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 18:00:12,500][model8_pretrain.py][INFO] Epoch:[0/2](231100/4588595) loss:3.626 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 18:00:12,500][model8_pretrain.py][INFO] Epoch:[0/2](231100/4588595) loss:3.085 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 18:00:12,500][model8_pretrain.py][INFO] Epoch:[0/2](231100/4588595) loss:2.115 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 18:00:12,500][model8_pretrain.py][INFO] Epoch:[0/2](231100/4588595) loss:3.202 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 18:00:12,500][model8_pretrain.py][INFO] Epoch:[0/2](231100/4588595) loss:2.974 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 18:00:12,500][model8_pretrain.py][INFO] Epoch:[0/2](231100/4588595) loss:3.021 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 18:00:12,500][model8_pretrain.py][INFO] Epoch:[0/2](231100/4588595) loss:2.216 lr:0.0000100 epoch_Time:27537.0min: [2024-01-03 18:00:58,056][model8_pretrain.py][INFO] Epoch:[0/2](231200/4588595) loss:3.142 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:00:58,056][model8_pretrain.py][INFO] Epoch:[0/2](231200/4588595) loss:3.030 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:00:58,056][model8_pretrain.py][INFO] Epoch:[0/2](231200/4588595) loss:3.207 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:00:58,056][model8_pretrain.py][INFO] Epoch:[0/2](231200/4588595) loss:3.049 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:00:58,056][model8_pretrain.py][INFO] Epoch:[0/2](231200/4588595) loss:2.991 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:00:58,057][model8_pretrain.py][INFO] Epoch:[0/2](231200/4588595) loss:2.695 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:00:58,057][model8_pretrain.py][INFO] Epoch:[0/2](231200/4588595) loss:2.919 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:00:58,057][model8_pretrain.py][INFO] Epoch:[0/2](231200/4588595) loss:2.902 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:01:34,986][model8_pretrain.py][INFO] Epoch:[0/2](231300/4588595) loss:2.884 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:01:34,986][model8_pretrain.py][INFO] Epoch:[0/2](231300/4588595) loss:3.038 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:01:34,986][model8_pretrain.py][INFO] Epoch:[0/2](231300/4588595) loss:3.303 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:01:34,986][model8_pretrain.py][INFO] Epoch:[0/2](231300/4588595) loss:3.215 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:01:34,986][model8_pretrain.py][INFO] Epoch:[0/2](231300/4588595) loss:3.029 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:01:34,986][model8_pretrain.py][INFO] Epoch:[0/2](231300/4588595) loss:2.335 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:01:34,986][model8_pretrain.py][INFO] Epoch:[0/2](231300/4588595) loss:2.456 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:01:34,986][model8_pretrain.py][INFO] Epoch:[0/2](231300/4588595) loss:2.956 lr:0.0000100 epoch_Time:27538.0min: [2024-01-03 18:02:11,925][model8_pretrain.py][INFO] Epoch:[0/2](231400/4588595) loss:2.510 lr:0.0000100 epoch_Time:27536.0min: [2024-01-03 18:02:11,925][model8_pretrain.py][INFO] Epoch:[0/2](231400/4588595) loss:3.158 lr:0.0000100 epoch_Time:27536.0min: [2024-01-03 18:02:11,925][model8_pretrain.py][INFO] Epoch:[0/2](231400/4588595) loss:2.438 lr:0.0000100 epoch_Time:27536.0min: [2024-01-03 18:02:11,925][model8_pretrain.py][INFO] Epoch:[0/2](231400/4588595) loss:2.821 lr:0.0000100 epoch_Time:27536.0min: [2024-01-03 18:02:11,925][model8_pretrain.py][INFO] Epoch:[0/2](231400/4588595) loss:3.332 lr:0.0000100 epoch_Time:27536.0min: [2024-01-03 18:02:11,925][model8_pretrain.py][INFO] Epoch:[0/2](231400/4588595) loss:3.077 lr:0.0000100 epoch_Time:27536.0min: [2024-01-03 18:02:11,925][model8_pretrain.py][INFO] Epoch:[0/2](231400/4588595) loss:3.145 lr:0.0000100 epoch_Time:27536.0min: [2024-01-03 18:02:11,926][model8_pretrain.py][INFO] Epoch:[0/2](231400/4588595) loss:2.831 lr:0.0000100 epoch_Time:27536.0min: [2024-01-03 18:02:48,841][model8_pretrain.py][INFO] Epoch:[0/2](231500/4588595) loss:2.830 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:02:48,842][model8_pretrain.py][INFO] Epoch:[0/2](231500/4588595) loss:2.787 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:02:48,842][model8_pretrain.py][INFO] Epoch:[0/2](231500/4588595) loss:3.147 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:02:48,842][model8_pretrain.py][INFO] Epoch:[0/2](231500/4588595) loss:3.008 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:02:48,842][model8_pretrain.py][INFO] Epoch:[0/2](231500/4588595) loss:3.317 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:02:48,842][model8_pretrain.py][INFO] Epoch:[0/2](231500/4588595) loss:2.921 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:02:48,842][model8_pretrain.py][INFO] Epoch:[0/2](231500/4588595) loss:2.493 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:02:48,842][model8_pretrain.py][INFO] Epoch:[0/2](231500/4588595) loss:3.332 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:03:25,791][model8_pretrain.py][INFO] Epoch:[0/2](231600/4588595) loss:3.142 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:03:25,791][model8_pretrain.py][INFO] Epoch:[0/2](231600/4588595) loss:3.049 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:03:25,791][model8_pretrain.py][INFO] Epoch:[0/2](231600/4588595) loss:3.267 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:03:25,791][model8_pretrain.py][INFO] Epoch:[0/2](231600/4588595) loss:3.038 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:03:25,791][model8_pretrain.py][INFO] Epoch:[0/2](231600/4588595) loss:2.852 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:03:25,791][model8_pretrain.py][INFO] Epoch:[0/2](231600/4588595) loss:3.116 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:03:25,791][model8_pretrain.py][INFO] Epoch:[0/2](231600/4588595) loss:3.319 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:03:25,793][model8_pretrain.py][INFO] Epoch:[0/2](231600/4588595) loss:3.053 lr:0.0000100 epoch_Time:27535.0min: [2024-01-03 18:04:02,760][model8_pretrain.py][INFO] Epoch:[0/2](231700/4588595) loss:2.702 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:02,760][model8_pretrain.py][INFO] Epoch:[0/2](231700/4588595) loss:2.983 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:02,760][model8_pretrain.py][INFO] Epoch:[0/2](231700/4588595) loss:3.000 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:02,760][model8_pretrain.py][INFO] Epoch:[0/2](231700/4588595) loss:2.989 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:02,760][model8_pretrain.py][INFO] Epoch:[0/2](231700/4588595) loss:2.664 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:02,760][model8_pretrain.py][INFO] Epoch:[0/2](231700/4588595) loss:3.190 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:02,761][model8_pretrain.py][INFO] Epoch:[0/2](231700/4588595) loss:3.238 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:02,761][model8_pretrain.py][INFO] Epoch:[0/2](231700/4588595) loss:2.819 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:39,700][model8_pretrain.py][INFO] Epoch:[0/2](231800/4588595) loss:2.980 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:39,700][model8_pretrain.py][INFO] Epoch:[0/2](231800/4588595) loss:3.355 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:39,700][model8_pretrain.py][INFO] Epoch:[0/2](231800/4588595) loss:3.014 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:39,700][model8_pretrain.py][INFO] Epoch:[0/2](231800/4588595) loss:2.783 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:39,700][model8_pretrain.py][INFO] Epoch:[0/2](231800/4588595) loss:2.846 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:39,700][model8_pretrain.py][INFO] Epoch:[0/2](231800/4588595) loss:2.999 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:39,700][model8_pretrain.py][INFO] Epoch:[0/2](231800/4588595) loss:2.892 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:04:39,700][model8_pretrain.py][INFO] Epoch:[0/2](231800/4588595) loss:2.426 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:05:18,373][model8_pretrain.py][INFO] Epoch:[0/2](231900/4588595) loss:2.498 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:05:18,373][model8_pretrain.py][INFO] Epoch:[0/2](231900/4588595) loss:2.865 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:05:18,373][model8_pretrain.py][INFO] Epoch:[0/2](231900/4588595) loss:3.111 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:05:18,373][model8_pretrain.py][INFO] Epoch:[0/2](231900/4588595) loss:3.183 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:05:18,373][model8_pretrain.py][INFO] Epoch:[0/2](231900/4588595) loss:2.972 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:05:18,373][model8_pretrain.py][INFO] Epoch:[0/2](231900/4588595) loss:2.610 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:05:18,373][model8_pretrain.py][INFO] Epoch:[0/2](231900/4588595) loss:3.279 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:05:18,373][model8_pretrain.py][INFO] Epoch:[0/2](231900/4588595) loss:2.731 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:06:03,476][model8_pretrain.py][INFO] Epoch:[0/2](232000/4588595) loss:2.856 lr:0.0000100 epoch_Time:27534.0min: [2024-01-03 18:06:03,476][model8_pretrain.py][INFO] Epoch:[0/2](232000/4588595) loss:3.171 lr:0.0000100 epoch_Time:27534.0min: [2024-01-03 18:06:03,476][model8_pretrain.py][INFO] Epoch:[0/2](232000/4588595) loss:2.820 lr:0.0000100 epoch_Time:27534.0min: [2024-01-03 18:06:03,476][model8_pretrain.py][INFO] Epoch:[0/2](232000/4588595) loss:3.047 lr:0.0000100 epoch_Time:27534.0min: [2024-01-03 18:06:03,476][model8_pretrain.py][INFO] Epoch:[0/2](232000/4588595) loss:3.335 lr:0.0000100 epoch_Time:27534.0min: [2024-01-03 18:06:03,476][model8_pretrain.py][INFO] Epoch:[0/2](232000/4588595) loss:2.650 lr:0.0000100 epoch_Time:27534.0min: [2024-01-03 18:06:03,477][model8_pretrain.py][INFO] Epoch:[0/2](232000/4588595) loss:3.141 lr:0.0000100 epoch_Time:27534.0min: [2024-01-03 18:06:03,478][model8_pretrain.py][INFO] Epoch:[0/2](232000/4588595) loss:2.893 lr:0.0000100 epoch_Time:27534.0min: [2024-01-03 18:06:40,416][model8_pretrain.py][INFO] Epoch:[0/2](232100/4588595) loss:2.503 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:06:40,416][model8_pretrain.py][INFO] Epoch:[0/2](232100/4588595) loss:2.488 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:06:40,416][model8_pretrain.py][INFO] Epoch:[0/2](232100/4588595) loss:2.672 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:06:40,416][model8_pretrain.py][INFO] Epoch:[0/2](232100/4588595) loss:2.910 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:06:40,416][model8_pretrain.py][INFO] Epoch:[0/2](232100/4588595) loss:3.183 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:06:40,416][model8_pretrain.py][INFO] Epoch:[0/2](232100/4588595) loss:3.017 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:06:40,416][model8_pretrain.py][INFO] Epoch:[0/2](232100/4588595) loss:2.864 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:06:40,416][model8_pretrain.py][INFO] Epoch:[0/2](232100/4588595) loss:2.986 lr:0.0000100 epoch_Time:27533.0min: [2024-01-03 18:07:17,356][model8_pretrain.py][INFO] Epoch:[0/2](232200/4588595) loss:3.177 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:07:17,356][model8_pretrain.py][INFO] Epoch:[0/2](232200/4588595) loss:3.110 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:07:17,356][model8_pretrain.py][INFO] Epoch:[0/2](232200/4588595) loss:2.926 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:07:17,356][model8_pretrain.py][INFO] Epoch:[0/2](232200/4588595) loss:2.869 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:07:17,356][model8_pretrain.py][INFO] Epoch:[0/2](232200/4588595) loss:2.941 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:07:17,356][model8_pretrain.py][INFO] Epoch:[0/2](232200/4588595) loss:2.795 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:07:17,356][model8_pretrain.py][INFO] Epoch:[0/2](232200/4588595) loss:3.280 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:07:17,356][model8_pretrain.py][INFO] Epoch:[0/2](232200/4588595) loss:3.089 lr:0.0000100 epoch_Time:27532.0min: [2024-01-03 18:07:54,301][model8_pretrain.py][INFO] Epoch:[0/2](232300/4588595) loss:2.702 lr:0.0000100 epoch_Time:27531.0min: [2024-01-03 18:07:54,301][model8_pretrain.py][INFO] Epoch:[0/2](232300/4588595) loss:3.183 lr:0.0000100 epoch_Time:27531.0min: [2024-01-03 18:07:54,301][model8_pretrain.py][INFO] Epoch:[0/2](232300/4588595) loss:3.023 lr:0.0000100 epoch_Time:27531.0min: [2024-01-03 18:07:54,301][model8_pretrain.py][INFO] Epoch:[0/2](232300/4588595) loss:2.625 lr:0.0000100 epoch_Time:27531.0min: [2024-01-03 18:07:54,301][model8_pretrain.py][INFO] Epoch:[0/2](232300/4588595) loss:2.999 lr:0.0000100 epoch_Time:27531.0min: [2024-01-03 18:07:54,301][model8_pretrain.py][INFO] Epoch:[0/2](232300/4588595) loss:3.421 lr:0.0000100 epoch_Time:27531.0min: [2024-01-03 18:07:54,301][model8_pretrain.py][INFO] Epoch:[0/2](232300/4588595) loss:3.186 lr:0.0000100 epoch_Time:27531.0min: [2024-01-03 18:07:54,301][model8_pretrain.py][INFO] Epoch:[0/2](232300/4588595) loss:2.941 lr:0.0000100 epoch_Time:27531.0min: [2024-01-03 18:08:31,219][model8_pretrain.py][INFO] Epoch:[0/2](232400/4588595) loss:2.712 lr:0.0000100 epoch_Time:27530.0min: [2024-01-03 18:08:31,219][model8_pretrain.py][INFO] Epoch:[0/2](232400/4588595) loss:3.538 lr:0.0000100 epoch_Time:27530.0min: [2024-01-03 18:08:31,219][model8_pretrain.py][INFO] Epoch:[0/2](232400/4588595) loss:2.946 lr:0.0000100 epoch_Time:27530.0min: [2024-01-03 18:08:31,219][model8_pretrain.py][INFO] Epoch:[0/2](232400/4588595) loss:3.352 lr:0.0000100 epoch_Time:27530.0min: [2024-01-03 18:08:31,219][model8_pretrain.py][INFO] Epoch:[0/2](232400/4588595) loss:2.586 lr:0.0000100 epoch_Time:27530.0min: [2024-01-03 18:08:31,219][model8_pretrain.py][INFO] Epoch:[0/2](232400/4588595) loss:2.415 lr:0.0000100 epoch_Time:27530.0min: [2024-01-03 18:08:31,219][model8_pretrain.py][INFO] Epoch:[0/2](232400/4588595) loss:3.045 lr:0.0000100 epoch_Time:27530.0min: [2024-01-03 18:08:31,219][model8_pretrain.py][INFO] Epoch:[0/2](232400/4588595) loss:2.596 lr:0.0000100 epoch_Time:27530.0min: [2024-01-03 18:09:08,162][model8_pretrain.py][INFO] Epoch:[0/2](232500/4588595) loss:2.862 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:08,162][model8_pretrain.py][INFO] Epoch:[0/2](232500/4588595) loss:3.111 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:08,162][model8_pretrain.py][INFO] Epoch:[0/2](232500/4588595) loss:2.699 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:08,162][model8_pretrain.py][INFO] Epoch:[0/2](232500/4588595) loss:2.832 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:08,162][model8_pretrain.py][INFO] Epoch:[0/2](232500/4588595) loss:2.975 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:08,162][model8_pretrain.py][INFO] Epoch:[0/2](232500/4588595) loss:2.726 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:08,162][model8_pretrain.py][INFO] Epoch:[0/2](232500/4588595) loss:3.432 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:08,162][model8_pretrain.py][INFO] Epoch:[0/2](232500/4588595) loss:3.361 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:45,094][model8_pretrain.py][INFO] Epoch:[0/2](232600/4588595) loss:2.718 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:45,095][model8_pretrain.py][INFO] Epoch:[0/2](232600/4588595) loss:2.642 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:45,095][model8_pretrain.py][INFO] Epoch:[0/2](232600/4588595) loss:2.922 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:45,095][model8_pretrain.py][INFO] Epoch:[0/2](232600/4588595) loss:2.910 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:45,095][model8_pretrain.py][INFO] Epoch:[0/2](232600/4588595) loss:2.529 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:45,095][model8_pretrain.py][INFO] Epoch:[0/2](232600/4588595) loss:2.052 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:45,095][model8_pretrain.py][INFO] Epoch:[0/2](232600/4588595) loss:3.104 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:09:45,095][model8_pretrain.py][INFO] Epoch:[0/2](232600/4588595) loss:2.621 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:10:23,855][model8_pretrain.py][INFO] Epoch:[0/2](232700/4588595) loss:2.962 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:10:23,855][model8_pretrain.py][INFO] Epoch:[0/2](232700/4588595) loss:2.802 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:10:23,855][model8_pretrain.py][INFO] Epoch:[0/2](232700/4588595) loss:2.985 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:10:23,855][model8_pretrain.py][INFO] Epoch:[0/2](232700/4588595) loss:2.999 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:10:23,855][model8_pretrain.py][INFO] Epoch:[0/2](232700/4588595) loss:3.142 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:10:23,855][model8_pretrain.py][INFO] Epoch:[0/2](232700/4588595) loss:2.932 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:10:23,855][model8_pretrain.py][INFO] Epoch:[0/2](232700/4588595) loss:2.668 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:10:23,855][model8_pretrain.py][INFO] Epoch:[0/2](232700/4588595) loss:3.242 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:11:08,756][model8_pretrain.py][INFO] Epoch:[0/2](232800/4588595) loss:3.461 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:08,756][model8_pretrain.py][INFO] Epoch:[0/2](232800/4588595) loss:3.321 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:08,756][model8_pretrain.py][INFO] Epoch:[0/2](232800/4588595) loss:2.232 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:08,756][model8_pretrain.py][INFO] Epoch:[0/2](232800/4588595) loss:2.448 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:08,756][model8_pretrain.py][INFO] Epoch:[0/2](232800/4588595) loss:2.960 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:08,756][model8_pretrain.py][INFO] Epoch:[0/2](232800/4588595) loss:3.308 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:08,756][model8_pretrain.py][INFO] Epoch:[0/2](232800/4588595) loss:2.856 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:08,756][model8_pretrain.py][INFO] Epoch:[0/2](232800/4588595) loss:2.939 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:45,683][model8_pretrain.py][INFO] Epoch:[0/2](232900/4588595) loss:2.895 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:45,683][model8_pretrain.py][INFO] Epoch:[0/2](232900/4588595) loss:2.802 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:45,683][model8_pretrain.py][INFO] Epoch:[0/2](232900/4588595) loss:3.070 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:45,683][model8_pretrain.py][INFO] Epoch:[0/2](232900/4588595) loss:2.573 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:45,683][model8_pretrain.py][INFO] Epoch:[0/2](232900/4588595) loss:3.066 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:45,683][model8_pretrain.py][INFO] Epoch:[0/2](232900/4588595) loss:3.274 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:45,683][model8_pretrain.py][INFO] Epoch:[0/2](232900/4588595) loss:3.126 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:11:45,683][model8_pretrain.py][INFO] Epoch:[0/2](232900/4588595) loss:3.290 lr:0.0000100 epoch_Time:27529.0min: [2024-01-03 18:12:22,620][model8_pretrain.py][INFO] Epoch:[0/2](233000/4588595) loss:2.786 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:12:22,620][model8_pretrain.py][INFO] Epoch:[0/2](233000/4588595) loss:3.106 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:12:22,620][model8_pretrain.py][INFO] Epoch:[0/2](233000/4588595) loss:2.833 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:12:22,620][model8_pretrain.py][INFO] Epoch:[0/2](233000/4588595) loss:2.766 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:12:22,620][model8_pretrain.py][INFO] Epoch:[0/2](233000/4588595) loss:2.858 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:12:22,620][model8_pretrain.py][INFO] Epoch:[0/2](233000/4588595) loss:2.757 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:12:22,620][model8_pretrain.py][INFO] Epoch:[0/2](233000/4588595) loss:3.265 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:12:22,620][model8_pretrain.py][INFO] Epoch:[0/2](233000/4588595) loss:3.431 lr:0.0000100 epoch_Time:27528.0min: [2024-01-03 18:12:59,566][model8_pretrain.py][INFO] Epoch:[0/2](233100/4588595) loss:2.866 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:12:59,566][model8_pretrain.py][INFO] Epoch:[0/2](233100/4588595) loss:3.361 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:12:59,566][model8_pretrain.py][INFO] Epoch:[0/2](233100/4588595) loss:3.271 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:12:59,566][model8_pretrain.py][INFO] Epoch:[0/2](233100/4588595) loss:3.362 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:12:59,566][model8_pretrain.py][INFO] Epoch:[0/2](233100/4588595) loss:3.233 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:12:59,566][model8_pretrain.py][INFO] Epoch:[0/2](233100/4588595) loss:3.067 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:12:59,566][model8_pretrain.py][INFO] Epoch:[0/2](233100/4588595) loss:2.388 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:12:59,566][model8_pretrain.py][INFO] Epoch:[0/2](233100/4588595) loss:3.027 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:13:36,522][model8_pretrain.py][INFO] Epoch:[0/2](233200/4588595) loss:2.930 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:13:36,522][model8_pretrain.py][INFO] Epoch:[0/2](233200/4588595) loss:3.595 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:13:36,523][model8_pretrain.py][INFO] Epoch:[0/2](233200/4588595) loss:3.130 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:13:36,523][model8_pretrain.py][INFO] Epoch:[0/2](233200/4588595) loss:2.465 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:13:36,523][model8_pretrain.py][INFO] Epoch:[0/2](233200/4588595) loss:2.552 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:13:36,523][model8_pretrain.py][INFO] Epoch:[0/2](233200/4588595) loss:2.560 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:13:36,523][model8_pretrain.py][INFO] Epoch:[0/2](233200/4588595) loss:2.813 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:13:36,523][model8_pretrain.py][INFO] Epoch:[0/2](233200/4588595) loss:2.351 lr:0.0000100 epoch_Time:27526.0min: [2024-01-03 18:14:13,471][model8_pretrain.py][INFO] Epoch:[0/2](233300/4588595) loss:2.955 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:14:13,471][model8_pretrain.py][INFO] Epoch:[0/2](233300/4588595) loss:3.092 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:14:13,471][model8_pretrain.py][INFO] Epoch:[0/2](233300/4588595) loss:3.197 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:14:13,471][model8_pretrain.py][INFO] Epoch:[0/2](233300/4588595) loss:2.667 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:14:13,471][model8_pretrain.py][INFO] Epoch:[0/2](233300/4588595) loss:2.934 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:14:13,471][model8_pretrain.py][INFO] Epoch:[0/2](233300/4588595) loss:3.275 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:14:13,471][model8_pretrain.py][INFO] Epoch:[0/2](233300/4588595) loss:3.351 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:14:13,471][model8_pretrain.py][INFO] Epoch:[0/2](233300/4588595) loss:3.143 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:14:50,415][model8_pretrain.py][INFO] Epoch:[0/2](233400/4588595) loss:3.126 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:14:50,415][model8_pretrain.py][INFO] Epoch:[0/2](233400/4588595) loss:3.023 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:14:50,415][model8_pretrain.py][INFO] Epoch:[0/2](233400/4588595) loss:3.479 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:14:50,415][model8_pretrain.py][INFO] Epoch:[0/2](233400/4588595) loss:3.161 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:14:50,415][model8_pretrain.py][INFO] Epoch:[0/2](233400/4588595) loss:2.851 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:14:50,415][model8_pretrain.py][INFO] Epoch:[0/2](233400/4588595) loss:2.789 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:14:50,415][model8_pretrain.py][INFO] Epoch:[0/2](233400/4588595) loss:2.777 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:14:50,415][model8_pretrain.py][INFO] Epoch:[0/2](233400/4588595) loss:3.391 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:15:27,361][model8_pretrain.py][INFO] Epoch:[0/2](233500/4588595) loss:2.909 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:15:27,361][model8_pretrain.py][INFO] Epoch:[0/2](233500/4588595) loss:2.670 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:15:27,361][model8_pretrain.py][INFO] Epoch:[0/2](233500/4588595) loss:2.619 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:15:27,361][model8_pretrain.py][INFO] Epoch:[0/2](233500/4588595) loss:3.336 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:15:27,361][model8_pretrain.py][INFO] Epoch:[0/2](233500/4588595) loss:2.746 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:15:27,361][model8_pretrain.py][INFO] Epoch:[0/2](233500/4588595) loss:3.234 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:15:27,361][model8_pretrain.py][INFO] Epoch:[0/2](233500/4588595) loss:2.882 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:15:27,361][model8_pretrain.py][INFO] Epoch:[0/2](233500/4588595) loss:2.599 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:16:13,694][model8_pretrain.py][INFO] Epoch:[0/2](233600/4588595) loss:2.569 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:16:13,694][model8_pretrain.py][INFO] Epoch:[0/2](233600/4588595) loss:3.087 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:16:13,694][model8_pretrain.py][INFO] Epoch:[0/2](233600/4588595) loss:2.931 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:16:13,694][model8_pretrain.py][INFO] Epoch:[0/2](233600/4588595) loss:3.020 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:16:13,694][model8_pretrain.py][INFO] Epoch:[0/2](233600/4588595) loss:2.895 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:16:13,694][model8_pretrain.py][INFO] Epoch:[0/2](233600/4588595) loss:2.676 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:16:13,694][model8_pretrain.py][INFO] Epoch:[0/2](233600/4588595) loss:2.831 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:16:13,695][model8_pretrain.py][INFO] Epoch:[0/2](233600/4588595) loss:3.051 lr:0.0000100 epoch_Time:27525.0min: [2024-01-03 18:16:50,639][model8_pretrain.py][INFO] Epoch:[0/2](233700/4588595) loss:2.794 lr:0.0000100 epoch_Time:27524.0min: [2024-01-03 18:16:50,639][model8_pretrain.py][INFO] Epoch:[0/2](233700/4588595) loss:2.747 lr:0.0000100 epoch_Time:27524.0min: [2024-01-03 18:16:50,639][model8_pretrain.py][INFO] Epoch:[0/2](233700/4588595) loss:2.998 lr:0.0000100 epoch_Time:27524.0min: [2024-01-03 18:16:50,639][model8_pretrain.py][INFO] Epoch:[0/2](233700/4588595) loss:2.956 lr:0.0000100 epoch_Time:27524.0min: [2024-01-03 18:16:50,639][model8_pretrain.py][INFO] Epoch:[0/2](233700/4588595) loss:3.347 lr:0.0000100 epoch_Time:27524.0min: [2024-01-03 18:16:50,639][model8_pretrain.py][INFO] Epoch:[0/2](233700/4588595) loss:2.387 lr:0.0000100 epoch_Time:27524.0min: [2024-01-03 18:16:50,639][model8_pretrain.py][INFO] Epoch:[0/2](233700/4588595) loss:2.826 lr:0.0000100 epoch_Time:27524.0min: [2024-01-03 18:16:50,639][model8_pretrain.py][INFO] Epoch:[0/2](233700/4588595) loss:2.802 lr:0.0000100 epoch_Time:27524.0min: [2024-01-03 18:17:27,587][model8_pretrain.py][INFO] Epoch:[0/2](233800/4588595) loss:3.075 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:17:27,587][model8_pretrain.py][INFO] Epoch:[0/2](233800/4588595) loss:3.126 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:17:27,587][model8_pretrain.py][INFO] Epoch:[0/2](233800/4588595) loss:2.652 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:17:27,587][model8_pretrain.py][INFO] Epoch:[0/2](233800/4588595) loss:3.098 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:17:27,587][model8_pretrain.py][INFO] Epoch:[0/2](233800/4588595) loss:2.900 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:17:27,587][model8_pretrain.py][INFO] Epoch:[0/2](233800/4588595) loss:2.963 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:17:27,587][model8_pretrain.py][INFO] Epoch:[0/2](233800/4588595) loss:2.461 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:17:27,588][model8_pretrain.py][INFO] Epoch:[0/2](233800/4588595) loss:3.020 lr:0.0000100 epoch_Time:27523.0min: [2024-01-03 18:18:04,523][model8_pretrain.py][INFO] Epoch:[0/2](233900/4588595) loss:3.112 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:04,523][model8_pretrain.py][INFO] Epoch:[0/2](233900/4588595) loss:2.569 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:04,523][model8_pretrain.py][INFO] Epoch:[0/2](233900/4588595) loss:3.279 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:04,523][model8_pretrain.py][INFO] Epoch:[0/2](233900/4588595) loss:2.777 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:04,523][model8_pretrain.py][INFO] Epoch:[0/2](233900/4588595) loss:2.758 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:04,523][model8_pretrain.py][INFO] Epoch:[0/2](233900/4588595) loss:2.407 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:04,523][model8_pretrain.py][INFO] Epoch:[0/2](233900/4588595) loss:3.072 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:04,524][model8_pretrain.py][INFO] Epoch:[0/2](233900/4588595) loss:3.085 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:41,467][model8_pretrain.py][INFO] Epoch:[0/2](234000/4588595) loss:2.815 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:41,467][model8_pretrain.py][INFO] Epoch:[0/2](234000/4588595) loss:2.546 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:41,467][model8_pretrain.py][INFO] Epoch:[0/2](234000/4588595) loss:2.256 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:41,467][model8_pretrain.py][INFO] Epoch:[0/2](234000/4588595) loss:3.198 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:41,467][model8_pretrain.py][INFO] Epoch:[0/2](234000/4588595) loss:3.027 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:41,467][model8_pretrain.py][INFO] Epoch:[0/2](234000/4588595) loss:3.679 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:41,467][model8_pretrain.py][INFO] Epoch:[0/2](234000/4588595) loss:2.854 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:18:41,467][model8_pretrain.py][INFO] Epoch:[0/2](234000/4588595) loss:2.596 lr:0.0000100 epoch_Time:27522.0min: [2024-01-03 18:19:18,409][model8_pretrain.py][INFO] Epoch:[0/2](234100/4588595) loss:2.843 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:19:18,409][model8_pretrain.py][INFO] Epoch:[0/2](234100/4588595) loss:2.383 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:19:18,409][model8_pretrain.py][INFO] Epoch:[0/2](234100/4588595) loss:3.160 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:19:18,409][model8_pretrain.py][INFO] Epoch:[0/2](234100/4588595) loss:2.722 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:19:18,409][model8_pretrain.py][INFO] Epoch:[0/2](234100/4588595) loss:3.141 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:19:18,409][model8_pretrain.py][INFO] Epoch:[0/2](234100/4588595) loss:3.192 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:19:18,409][model8_pretrain.py][INFO] Epoch:[0/2](234100/4588595) loss:2.774 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:19:18,409][model8_pretrain.py][INFO] Epoch:[0/2](234100/4588595) loss:2.704 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:19:55,347][model8_pretrain.py][INFO] Epoch:[0/2](234200/4588595) loss:2.942 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:19:55,347][model8_pretrain.py][INFO] Epoch:[0/2](234200/4588595) loss:3.159 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:19:55,347][model8_pretrain.py][INFO] Epoch:[0/2](234200/4588595) loss:2.978 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:19:55,347][model8_pretrain.py][INFO] Epoch:[0/2](234200/4588595) loss:3.366 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:19:55,347][model8_pretrain.py][INFO] Epoch:[0/2](234200/4588595) loss:3.273 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:19:55,348][model8_pretrain.py][INFO] Epoch:[0/2](234200/4588595) loss:3.090 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:19:55,347][model8_pretrain.py][INFO] Epoch:[0/2](234200/4588595) loss:3.028 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:19:55,348][model8_pretrain.py][INFO] Epoch:[0/2](234200/4588595) loss:3.110 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:20:32,284][model8_pretrain.py][INFO] Epoch:[0/2](234300/4588595) loss:2.975 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:20:32,284][model8_pretrain.py][INFO] Epoch:[0/2](234300/4588595) loss:3.045 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:20:32,284][model8_pretrain.py][INFO] Epoch:[0/2](234300/4588595) loss:2.651 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:20:32,284][model8_pretrain.py][INFO] Epoch:[0/2](234300/4588595) loss:3.428 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:20:32,284][model8_pretrain.py][INFO] Epoch:[0/2](234300/4588595) loss:3.329 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:20:32,284][model8_pretrain.py][INFO] Epoch:[0/2](234300/4588595) loss:2.679 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:20:32,285][model8_pretrain.py][INFO] Epoch:[0/2](234300/4588595) loss:2.835 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:20:32,285][model8_pretrain.py][INFO] Epoch:[0/2](234300/4588595) loss:2.987 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:21:18,837][model8_pretrain.py][INFO] Epoch:[0/2](234400/4588595) loss:2.665 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:21:18,837][model8_pretrain.py][INFO] Epoch:[0/2](234400/4588595) loss:3.096 lr:0.0000100 epoch_Time:27521.0min: [2024-01-03 18:21:18,837][model8_pretrain.py][INFO] Epoch:[0/2](234400/4588595) loss:2.953 lr:0.0000100 epoch_Time:27521.0min: [2024-01-03 18:21:18,837][model8_pretrain.py][INFO] Epoch:[0/2](234400/4588595) loss:3.249 lr:0.0000100 epoch_Time:27521.0min: [2024-01-03 18:21:18,837][model8_pretrain.py][INFO] Epoch:[0/2](234400/4588595) loss:2.906 lr:0.0000100 epoch_Time:27521.0min: [2024-01-03 18:21:18,837][model8_pretrain.py][INFO] Epoch:[0/2](234400/4588595) loss:2.841 lr:0.0000100 epoch_Time:27521.0min: [2024-01-03 18:21:18,837][model8_pretrain.py][INFO] Epoch:[0/2](234400/4588595) loss:2.955 lr:0.0000100 epoch_Time:27521.0min: [2024-01-03 18:21:18,838][model8_pretrain.py][INFO] Epoch:[0/2](234400/4588595) loss:2.906 lr:0.0000100 epoch_Time:27520.0min: [2024-01-03 18:21:55,776][model8_pretrain.py][INFO] Epoch:[0/2](234500/4588595) loss:3.301 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:21:55,776][model8_pretrain.py][INFO] Epoch:[0/2](234500/4588595) loss:2.578 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:21:55,776][model8_pretrain.py][INFO] Epoch:[0/2](234500/4588595) loss:2.799 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:21:55,776][model8_pretrain.py][INFO] Epoch:[0/2](234500/4588595) loss:2.915 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:21:55,776][model8_pretrain.py][INFO] Epoch:[0/2](234500/4588595) loss:2.271 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:21:55,776][model8_pretrain.py][INFO] Epoch:[0/2](234500/4588595) loss:2.960 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:21:55,776][model8_pretrain.py][INFO] Epoch:[0/2](234500/4588595) loss:2.839 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:21:55,776][model8_pretrain.py][INFO] Epoch:[0/2](234500/4588595) loss:3.086 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:22:32,720][model8_pretrain.py][INFO] Epoch:[0/2](234600/4588595) loss:2.398 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:22:32,720][model8_pretrain.py][INFO] Epoch:[0/2](234600/4588595) loss:3.258 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:22:32,720][model8_pretrain.py][INFO] Epoch:[0/2](234600/4588595) loss:3.052 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:22:32,720][model8_pretrain.py][INFO] Epoch:[0/2](234600/4588595) loss:3.170 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:22:32,721][model8_pretrain.py][INFO] Epoch:[0/2](234600/4588595) loss:2.765 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:22:32,721][model8_pretrain.py][INFO] Epoch:[0/2](234600/4588595) loss:3.243 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:22:32,721][model8_pretrain.py][INFO] Epoch:[0/2](234600/4588595) loss:2.968 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:22:32,721][model8_pretrain.py][INFO] Epoch:[0/2](234600/4588595) loss:3.096 lr:0.0000100 epoch_Time:27519.0min: [2024-01-03 18:23:09,663][model8_pretrain.py][INFO] Epoch:[0/2](234700/4588595) loss:2.752 lr:0.0000100 epoch_Time:27518.0min: [2024-01-03 18:23:09,663][model8_pretrain.py][INFO] Epoch:[0/2](234700/4588595) loss:2.872 lr:0.0000100 epoch_Time:27518.0min: [2024-01-03 18:23:09,663][model8_pretrain.py][INFO] Epoch:[0/2](234700/4588595) loss:2.965 lr:0.0000100 epoch_Time:27518.0min: [2024-01-03 18:23:09,663][model8_pretrain.py][INFO] Epoch:[0/2](234700/4588595) loss:3.149 lr:0.0000100 epoch_Time:27518.0min: [2024-01-03 18:23:09,663][model8_pretrain.py][INFO] Epoch:[0/2](234700/4588595) loss:3.125 lr:0.0000100 epoch_Time:27518.0min: [2024-01-03 18:23:09,663][model8_pretrain.py][INFO] Epoch:[0/2](234700/4588595) loss:2.989 lr:0.0000100 epoch_Time:27518.0min: [2024-01-03 18:23:09,663][model8_pretrain.py][INFO] Epoch:[0/2](234700/4588595) loss:3.059 lr:0.0000100 epoch_Time:27518.0min: [2024-01-03 18:23:09,664][model8_pretrain.py][INFO] Epoch:[0/2](234700/4588595) loss:2.828 lr:0.0000100 epoch_Time:27518.0min: [2024-01-03 18:23:46,613][model8_pretrain.py][INFO] Epoch:[0/2](234800/4588595) loss:2.912 lr:0.0000100 epoch_Time:27517.0min: [2024-01-03 18:23:46,613][model8_pretrain.py][INFO] Epoch:[0/2](234800/4588595) loss:2.546 lr:0.0000100 epoch_Time:27517.0min: [2024-01-03 18:23:46,613][model8_pretrain.py][INFO] Epoch:[0/2](234800/4588595) loss:3.462 lr:0.0000100 epoch_Time:27517.0min: [2024-01-03 18:23:46,613][model8_pretrain.py][INFO] Epoch:[0/2](234800/4588595) loss:2.708 lr:0.0000100 epoch_Time:27517.0min: [2024-01-03 18:23:46,613][model8_pretrain.py][INFO] Epoch:[0/2](234800/4588595) loss:2.364 lr:0.0000100 epoch_Time:27517.0min: [2024-01-03 18:23:46,613][model8_pretrain.py][INFO] Epoch:[0/2](234800/4588595) loss:2.945 lr:0.0000100 epoch_Time:27517.0min: [2024-01-03 18:23:46,614][model8_pretrain.py][INFO] Epoch:[0/2](234800/4588595) loss:3.289 lr:0.0000100 epoch_Time:27517.0min: [2024-01-03 18:23:46,614][model8_pretrain.py][INFO] Epoch:[0/2](234800/4588595) loss:2.895 lr:0.0000100 epoch_Time:27517.0min: [2024-01-03 18:24:23,557][model8_pretrain.py][INFO] Epoch:[0/2](234900/4588595) loss:3.126 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:24:23,557][model8_pretrain.py][INFO] Epoch:[0/2](234900/4588595) loss:2.412 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:24:23,558][model8_pretrain.py][INFO] Epoch:[0/2](234900/4588595) loss:2.901 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:24:23,558][model8_pretrain.py][INFO] Epoch:[0/2](234900/4588595) loss:2.732 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:24:23,558][model8_pretrain.py][INFO] Epoch:[0/2](234900/4588595) loss:3.008 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:24:23,558][model8_pretrain.py][INFO] Epoch:[0/2](234900/4588595) loss:2.960 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:24:23,558][model8_pretrain.py][INFO] Epoch:[0/2](234900/4588595) loss:2.992 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:24:23,558][model8_pretrain.py][INFO] Epoch:[0/2](234900/4588595) loss:2.827 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:25:00,499][model8_pretrain.py][INFO] Epoch:[0/2](235000/4588595) loss:3.345 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:25:00,499][model8_pretrain.py][INFO] Epoch:[0/2](235000/4588595) loss:3.024 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:25:00,499][model8_pretrain.py][INFO] Epoch:[0/2](235000/4588595) loss:2.864 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:25:00,499][model8_pretrain.py][INFO] Epoch:[0/2](235000/4588595) loss:3.093 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:25:00,499][model8_pretrain.py][INFO] Epoch:[0/2](235000/4588595) loss:3.227 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:25:00,499][model8_pretrain.py][INFO] Epoch:[0/2](235000/4588595) loss:2.714 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:25:00,499][model8_pretrain.py][INFO] Epoch:[0/2](235000/4588595) loss:3.196 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:25:00,499][model8_pretrain.py][INFO] Epoch:[0/2](235000/4588595) loss:2.859 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:25:37,446][model8_pretrain.py][INFO] Epoch:[0/2](235100/4588595) loss:2.611 lr:0.0000100 epoch_Time:27514.0min: [2024-01-03 18:25:37,446][model8_pretrain.py][INFO] Epoch:[0/2](235100/4588595) loss:3.220 lr:0.0000100 epoch_Time:27514.0min: [2024-01-03 18:25:37,446][model8_pretrain.py][INFO] Epoch:[0/2](235100/4588595) loss:3.161 lr:0.0000100 epoch_Time:27514.0min: [2024-01-03 18:25:37,446][model8_pretrain.py][INFO] Epoch:[0/2](235100/4588595) loss:3.211 lr:0.0000100 epoch_Time:27514.0min: [2024-01-03 18:25:37,446][model8_pretrain.py][INFO] Epoch:[0/2](235100/4588595) loss:2.975 lr:0.0000100 epoch_Time:27514.0min: [2024-01-03 18:25:37,446][model8_pretrain.py][INFO] Epoch:[0/2](235100/4588595) loss:2.826 lr:0.0000100 epoch_Time:27514.0min: [2024-01-03 18:25:37,446][model8_pretrain.py][INFO] Epoch:[0/2](235100/4588595) loss:2.344 lr:0.0000100 epoch_Time:27514.0min: [2024-01-03 18:25:37,446][model8_pretrain.py][INFO] Epoch:[0/2](235100/4588595) loss:2.931 lr:0.0000100 epoch_Time:27514.0min: [2024-01-03 18:26:24,628][model8_pretrain.py][INFO] Epoch:[0/2](235200/4588595) loss:2.781 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:26:24,628][model8_pretrain.py][INFO] Epoch:[0/2](235200/4588595) loss:3.174 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:26:24,628][model8_pretrain.py][INFO] Epoch:[0/2](235200/4588595) loss:2.836 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:26:24,628][model8_pretrain.py][INFO] Epoch:[0/2](235200/4588595) loss:3.086 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:26:24,628][model8_pretrain.py][INFO] Epoch:[0/2](235200/4588595) loss:2.886 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:26:24,628][model8_pretrain.py][INFO] Epoch:[0/2](235200/4588595) loss:3.146 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:26:24,628][model8_pretrain.py][INFO] Epoch:[0/2](235200/4588595) loss:3.267 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:26:24,629][model8_pretrain.py][INFO] Epoch:[0/2](235200/4588595) loss:2.822 lr:0.0000100 epoch_Time:27516.0min: [2024-01-03 18:27:01,561][model8_pretrain.py][INFO] Epoch:[0/2](235300/4588595) loss:2.956 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:01,561][model8_pretrain.py][INFO] Epoch:[0/2](235300/4588595) loss:2.586 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:01,561][model8_pretrain.py][INFO] Epoch:[0/2](235300/4588595) loss:2.845 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:01,561][model8_pretrain.py][INFO] Epoch:[0/2](235300/4588595) loss:2.447 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:01,561][model8_pretrain.py][INFO] Epoch:[0/2](235300/4588595) loss:2.884 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:01,561][model8_pretrain.py][INFO] Epoch:[0/2](235300/4588595) loss:2.506 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:01,561][model8_pretrain.py][INFO] Epoch:[0/2](235300/4588595) loss:2.676 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:01,561][model8_pretrain.py][INFO] Epoch:[0/2](235300/4588595) loss:3.478 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:38,501][model8_pretrain.py][INFO] Epoch:[0/2](235400/4588595) loss:2.734 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:38,501][model8_pretrain.py][INFO] Epoch:[0/2](235400/4588595) loss:3.235 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:38,501][model8_pretrain.py][INFO] Epoch:[0/2](235400/4588595) loss:3.053 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:38,501][model8_pretrain.py][INFO] Epoch:[0/2](235400/4588595) loss:2.740 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:38,501][model8_pretrain.py][INFO] Epoch:[0/2](235400/4588595) loss:2.984 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:38,501][model8_pretrain.py][INFO] Epoch:[0/2](235400/4588595) loss:2.097 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:38,501][model8_pretrain.py][INFO] Epoch:[0/2](235400/4588595) loss:3.524 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:27:38,502][model8_pretrain.py][INFO] Epoch:[0/2](235400/4588595) loss:3.714 lr:0.0000100 epoch_Time:27515.0min: [2024-01-03 18:28:15,448][model8_pretrain.py][INFO] Epoch:[0/2](235500/4588595) loss:2.574 lr:0.0000100 epoch_Time:27513.0min: [2024-01-03 18:28:15,448][model8_pretrain.py][INFO] Epoch:[0/2](235500/4588595) loss:3.210 lr:0.0000100 epoch_Time:27513.0min: [2024-01-03 18:28:15,448][model8_pretrain.py][INFO] Epoch:[0/2](235500/4588595) loss:3.361 lr:0.0000100 epoch_Time:27513.0min: [2024-01-03 18:28:15,448][model8_pretrain.py][INFO] Epoch:[0/2](235500/4588595) loss:2.838 lr:0.0000100 epoch_Time:27513.0min: [2024-01-03 18:28:15,448][model8_pretrain.py][INFO] Epoch:[0/2](235500/4588595) loss:2.601 lr:0.0000100 epoch_Time:27513.0min: [2024-01-03 18:28:15,448][model8_pretrain.py][INFO] Epoch:[0/2](235500/4588595) loss:2.899 lr:0.0000100 epoch_Time:27513.0min: [2024-01-03 18:28:15,448][model8_pretrain.py][INFO] Epoch:[0/2](235500/4588595) loss:3.078 lr:0.0000100 epoch_Time:27513.0min: [2024-01-03 18:28:15,449][model8_pretrain.py][INFO] Epoch:[0/2](235500/4588595) loss:3.196 lr:0.0000100 epoch_Time:27513.0min: [2024-01-03 18:28:52,394][model8_pretrain.py][INFO] Epoch:[0/2](235600/4588595) loss:2.713 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:28:52,394][model8_pretrain.py][INFO] Epoch:[0/2](235600/4588595) loss:3.082 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:28:52,395][model8_pretrain.py][INFO] Epoch:[0/2](235600/4588595) loss:2.438 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:28:52,395][model8_pretrain.py][INFO] Epoch:[0/2](235600/4588595) loss:3.090 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:28:52,395][model8_pretrain.py][INFO] Epoch:[0/2](235600/4588595) loss:2.517 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:28:52,395][model8_pretrain.py][INFO] Epoch:[0/2](235600/4588595) loss:2.693 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:28:52,395][model8_pretrain.py][INFO] Epoch:[0/2](235600/4588595) loss:2.803 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:28:52,395][model8_pretrain.py][INFO] Epoch:[0/2](235600/4588595) loss:3.173 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:29:29,340][model8_pretrain.py][INFO] Epoch:[0/2](235700/4588595) loss:2.719 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:29:29,340][model8_pretrain.py][INFO] Epoch:[0/2](235700/4588595) loss:2.897 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:29:29,340][model8_pretrain.py][INFO] Epoch:[0/2](235700/4588595) loss:3.129 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:29:29,340][model8_pretrain.py][INFO] Epoch:[0/2](235700/4588595) loss:3.367 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:29:29,340][model8_pretrain.py][INFO] Epoch:[0/2](235700/4588595) loss:3.188 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:29:29,340][model8_pretrain.py][INFO] Epoch:[0/2](235700/4588595) loss:3.010 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:29:29,340][model8_pretrain.py][INFO] Epoch:[0/2](235700/4588595) loss:3.151 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:29:29,341][model8_pretrain.py][INFO] Epoch:[0/2](235700/4588595) loss:3.189 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:30:06,282][model8_pretrain.py][INFO] Epoch:[0/2](235800/4588595) loss:3.105 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:06,282][model8_pretrain.py][INFO] Epoch:[0/2](235800/4588595) loss:3.085 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:06,282][model8_pretrain.py][INFO] Epoch:[0/2](235800/4588595) loss:2.825 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:06,282][model8_pretrain.py][INFO] Epoch:[0/2](235800/4588595) loss:3.062 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:06,282][model8_pretrain.py][INFO] Epoch:[0/2](235800/4588595) loss:2.868 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:06,282][model8_pretrain.py][INFO] Epoch:[0/2](235800/4588595) loss:2.903 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:06,282][model8_pretrain.py][INFO] Epoch:[0/2](235800/4588595) loss:2.734 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:06,282][model8_pretrain.py][INFO] Epoch:[0/2](235800/4588595) loss:3.160 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:43,218][model8_pretrain.py][INFO] Epoch:[0/2](235900/4588595) loss:3.149 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:43,218][model8_pretrain.py][INFO] Epoch:[0/2](235900/4588595) loss:3.106 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:43,218][model8_pretrain.py][INFO] Epoch:[0/2](235900/4588595) loss:2.779 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:43,218][model8_pretrain.py][INFO] Epoch:[0/2](235900/4588595) loss:2.139 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:43,218][model8_pretrain.py][INFO] Epoch:[0/2](235900/4588595) loss:3.012 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:43,218][model8_pretrain.py][INFO] Epoch:[0/2](235900/4588595) loss:3.660 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:43,218][model8_pretrain.py][INFO] Epoch:[0/2](235900/4588595) loss:2.751 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:30:43,218][model8_pretrain.py][INFO] Epoch:[0/2](235900/4588595) loss:2.555 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:31:30,273][model8_pretrain.py][INFO] Epoch:[0/2](236000/4588595) loss:3.148 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:31:30,273][model8_pretrain.py][INFO] Epoch:[0/2](236000/4588595) loss:2.934 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:31:30,273][model8_pretrain.py][INFO] Epoch:[0/2](236000/4588595) loss:3.092 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:31:30,273][model8_pretrain.py][INFO] Epoch:[0/2](236000/4588595) loss:3.097 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:31:30,273][model8_pretrain.py][INFO] Epoch:[0/2](236000/4588595) loss:2.666 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:31:30,273][model8_pretrain.py][INFO] Epoch:[0/2](236000/4588595) loss:2.995 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:31:30,273][model8_pretrain.py][INFO] Epoch:[0/2](236000/4588595) loss:2.875 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:31:30,274][model8_pretrain.py][INFO] Epoch:[0/2](236000/4588595) loss:2.765 lr:0.0000100 epoch_Time:27512.0min: [2024-01-03 18:32:07,223][model8_pretrain.py][INFO] Epoch:[0/2](236100/4588595) loss:3.266 lr:0.0000100 epoch_Time:27511.0min: [2024-01-03 18:32:07,223][model8_pretrain.py][INFO] Epoch:[0/2](236100/4588595) loss:3.074 lr:0.0000100 epoch_Time:27511.0min: [2024-01-03 18:32:07,224][model8_pretrain.py][INFO] Epoch:[0/2](236100/4588595) loss:3.435 lr:0.0000100 epoch_Time:27511.0min: [2024-01-03 18:32:07,224][model8_pretrain.py][INFO] Epoch:[0/2](236100/4588595) loss:2.883 lr:0.0000100 epoch_Time:27511.0min: [2024-01-03 18:32:07,224][model8_pretrain.py][INFO] Epoch:[0/2](236100/4588595) loss:2.994 lr:0.0000100 epoch_Time:27511.0min: [2024-01-03 18:32:07,224][model8_pretrain.py][INFO] Epoch:[0/2](236100/4588595) loss:3.385 lr:0.0000100 epoch_Time:27511.0min: [2024-01-03 18:32:07,224][model8_pretrain.py][INFO] Epoch:[0/2](236100/4588595) loss:2.894 lr:0.0000100 epoch_Time:27511.0min: [2024-01-03 18:32:07,224][model8_pretrain.py][INFO] Epoch:[0/2](236100/4588595) loss:3.346 lr:0.0000100 epoch_Time:27511.0min: [2024-01-03 18:32:44,163][model8_pretrain.py][INFO] Epoch:[0/2](236200/4588595) loss:3.392 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:32:44,163][model8_pretrain.py][INFO] Epoch:[0/2](236200/4588595) loss:2.965 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:32:44,163][model8_pretrain.py][INFO] Epoch:[0/2](236200/4588595) loss:3.189 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:32:44,163][model8_pretrain.py][INFO] Epoch:[0/2](236200/4588595) loss:2.883 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:32:44,163][model8_pretrain.py][INFO] Epoch:[0/2](236200/4588595) loss:2.498 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:32:44,164][model8_pretrain.py][INFO] Epoch:[0/2](236200/4588595) loss:2.653 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:32:44,164][model8_pretrain.py][INFO] Epoch:[0/2](236200/4588595) loss:3.396 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:32:44,164][model8_pretrain.py][INFO] Epoch:[0/2](236200/4588595) loss:3.119 lr:0.0000100 epoch_Time:27510.0min: [2024-01-03 18:33:21,104][model8_pretrain.py][INFO] Epoch:[0/2](236300/4588595) loss:3.620 lr:0.0000100 epoch_Time:27509.0min: [2024-01-03 18:33:21,104][model8_pretrain.py][INFO] Epoch:[0/2](236300/4588595) loss:2.766 lr:0.0000100 epoch_Time:27509.0min: [2024-01-03 18:33:21,104][model8_pretrain.py][INFO] Epoch:[0/2](236300/4588595) loss:3.053 lr:0.0000100 epoch_Time:27509.0min: [2024-01-03 18:33:21,104][model8_pretrain.py][INFO] Epoch:[0/2](236300/4588595) loss:2.663 lr:0.0000100 epoch_Time:27509.0min: [2024-01-03 18:33:21,104][model8_pretrain.py][INFO] Epoch:[0/2](236300/4588595) loss:2.660 lr:0.0000100 epoch_Time:27509.0min: [2024-01-03 18:33:21,104][model8_pretrain.py][INFO] Epoch:[0/2](236300/4588595) loss:3.190 lr:0.0000100 epoch_Time:27509.0min: [2024-01-03 18:33:21,104][model8_pretrain.py][INFO] Epoch:[0/2](236300/4588595) loss:2.849 lr:0.0000100 epoch_Time:27509.0min: [2024-01-03 18:33:21,105][model8_pretrain.py][INFO] Epoch:[0/2](236300/4588595) loss:3.070 lr:0.0000100 epoch_Time:27509.0min: [2024-01-03 18:33:58,048][model8_pretrain.py][INFO] Epoch:[0/2](236400/4588595) loss:3.029 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:33:58,048][model8_pretrain.py][INFO] Epoch:[0/2](236400/4588595) loss:3.133 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:33:58,048][model8_pretrain.py][INFO] Epoch:[0/2](236400/4588595) loss:2.748 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:33:58,048][model8_pretrain.py][INFO] Epoch:[0/2](236400/4588595) loss:2.400 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:33:58,048][model8_pretrain.py][INFO] Epoch:[0/2](236400/4588595) loss:3.068 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:33:58,048][model8_pretrain.py][INFO] Epoch:[0/2](236400/4588595) loss:3.314 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:33:58,049][model8_pretrain.py][INFO] Epoch:[0/2](236400/4588595) loss:3.238 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:33:58,049][model8_pretrain.py][INFO] Epoch:[0/2](236400/4588595) loss:2.999 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:34:34,992][model8_pretrain.py][INFO] Epoch:[0/2](236500/4588595) loss:2.626 lr:0.0000100 epoch_Time:27507.0min: [2024-01-03 18:34:34,992][model8_pretrain.py][INFO] Epoch:[0/2](236500/4588595) loss:2.469 lr:0.0000100 epoch_Time:27507.0min: [2024-01-03 18:34:34,992][model8_pretrain.py][INFO] Epoch:[0/2](236500/4588595) loss:3.008 lr:0.0000100 epoch_Time:27507.0min: [2024-01-03 18:34:34,992][model8_pretrain.py][INFO] Epoch:[0/2](236500/4588595) loss:2.948 lr:0.0000100 epoch_Time:27507.0min: [2024-01-03 18:34:34,992][model8_pretrain.py][INFO] Epoch:[0/2](236500/4588595) loss:2.969 lr:0.0000100 epoch_Time:27507.0min: [2024-01-03 18:34:34,992][model8_pretrain.py][INFO] Epoch:[0/2](236500/4588595) loss:3.052 lr:0.0000100 epoch_Time:27507.0min: [2024-01-03 18:34:34,992][model8_pretrain.py][INFO] Epoch:[0/2](236500/4588595) loss:3.492 lr:0.0000100 epoch_Time:27507.0min: [2024-01-03 18:34:34,992][model8_pretrain.py][INFO] Epoch:[0/2](236500/4588595) loss:3.009 lr:0.0000100 epoch_Time:27507.0min: [2024-01-03 18:35:11,933][model8_pretrain.py][INFO] Epoch:[0/2](236600/4588595) loss:3.429 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:35:11,933][model8_pretrain.py][INFO] Epoch:[0/2](236600/4588595) loss:2.684 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:35:11,933][model8_pretrain.py][INFO] Epoch:[0/2](236600/4588595) loss:3.128 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:35:11,933][model8_pretrain.py][INFO] Epoch:[0/2](236600/4588595) loss:2.618 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:35:11,933][model8_pretrain.py][INFO] Epoch:[0/2](236600/4588595) loss:3.124 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:35:11,933][model8_pretrain.py][INFO] Epoch:[0/2](236600/4588595) loss:2.885 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:35:11,933][model8_pretrain.py][INFO] Epoch:[0/2](236600/4588595) loss:2.909 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:35:11,933][model8_pretrain.py][INFO] Epoch:[0/2](236600/4588595) loss:2.889 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:35:48,876][model8_pretrain.py][INFO] Epoch:[0/2](236700/4588595) loss:2.916 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:35:48,876][model8_pretrain.py][INFO] Epoch:[0/2](236700/4588595) loss:3.354 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:35:48,876][model8_pretrain.py][INFO] Epoch:[0/2](236700/4588595) loss:3.247 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:35:48,876][model8_pretrain.py][INFO] Epoch:[0/2](236700/4588595) loss:2.902 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:35:48,876][model8_pretrain.py][INFO] Epoch:[0/2](236700/4588595) loss:3.153 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:35:48,876][model8_pretrain.py][INFO] Epoch:[0/2](236700/4588595) loss:2.939 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:35:48,876][model8_pretrain.py][INFO] Epoch:[0/2](236700/4588595) loss:3.382 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:35:48,876][model8_pretrain.py][INFO] Epoch:[0/2](236700/4588595) loss:2.658 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:36:36,072][model8_pretrain.py][INFO] Epoch:[0/2](236800/4588595) loss:3.055 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:36:36,072][model8_pretrain.py][INFO] Epoch:[0/2](236800/4588595) loss:2.552 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:36:36,072][model8_pretrain.py][INFO] Epoch:[0/2](236800/4588595) loss:3.012 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:36:36,072][model8_pretrain.py][INFO] Epoch:[0/2](236800/4588595) loss:2.633 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:36:36,072][model8_pretrain.py][INFO] Epoch:[0/2](236800/4588595) loss:2.924 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:36:36,072][model8_pretrain.py][INFO] Epoch:[0/2](236800/4588595) loss:2.933 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:36:36,072][model8_pretrain.py][INFO] Epoch:[0/2](236800/4588595) loss:3.115 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:36:36,072][model8_pretrain.py][INFO] Epoch:[0/2](236800/4588595) loss:2.583 lr:0.0000100 epoch_Time:27508.0min: [2024-01-03 18:37:13,010][model8_pretrain.py][INFO] Epoch:[0/2](236900/4588595) loss:2.414 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:37:13,010][model8_pretrain.py][INFO] Epoch:[0/2](236900/4588595) loss:3.209 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:37:13,010][model8_pretrain.py][INFO] Epoch:[0/2](236900/4588595) loss:3.668 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:37:13,010][model8_pretrain.py][INFO] Epoch:[0/2](236900/4588595) loss:2.711 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:37:13,010][model8_pretrain.py][INFO] Epoch:[0/2](236900/4588595) loss:2.811 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:37:13,010][model8_pretrain.py][INFO] Epoch:[0/2](236900/4588595) loss:2.844 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:37:13,010][model8_pretrain.py][INFO] Epoch:[0/2](236900/4588595) loss:2.943 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:37:13,010][model8_pretrain.py][INFO] Epoch:[0/2](236900/4588595) loss:3.159 lr:0.0000100 epoch_Time:27506.0min: [2024-01-03 18:37:49,949][model8_pretrain.py][INFO] Epoch:[0/2](237000/4588595) loss:2.955 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:37:49,949][model8_pretrain.py][INFO] Epoch:[0/2](237000/4588595) loss:3.079 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:37:49,949][model8_pretrain.py][INFO] Epoch:[0/2](237000/4588595) loss:2.950 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:37:49,949][model8_pretrain.py][INFO] Epoch:[0/2](237000/4588595) loss:3.211 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:37:49,949][model8_pretrain.py][INFO] Epoch:[0/2](237000/4588595) loss:2.166 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:37:49,949][model8_pretrain.py][INFO] Epoch:[0/2](237000/4588595) loss:2.339 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:37:49,949][model8_pretrain.py][INFO] Epoch:[0/2](237000/4588595) loss:3.059 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:37:49,949][model8_pretrain.py][INFO] Epoch:[0/2](237000/4588595) loss:3.028 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:38:26,902][model8_pretrain.py][INFO] Epoch:[0/2](237100/4588595) loss:2.990 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:38:26,902][model8_pretrain.py][INFO] Epoch:[0/2](237100/4588595) loss:2.184 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:38:26,902][model8_pretrain.py][INFO] Epoch:[0/2](237100/4588595) loss:2.245 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:38:26,902][model8_pretrain.py][INFO] Epoch:[0/2](237100/4588595) loss:2.362 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:38:26,902][model8_pretrain.py][INFO] Epoch:[0/2](237100/4588595) loss:2.846 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:38:26,902][model8_pretrain.py][INFO] Epoch:[0/2](237100/4588595) loss:3.300 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:38:26,902][model8_pretrain.py][INFO] Epoch:[0/2](237100/4588595) loss:2.868 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:38:26,903][model8_pretrain.py][INFO] Epoch:[0/2](237100/4588595) loss:2.800 lr:0.0000100 epoch_Time:27505.0min: [2024-01-03 18:39:03,847][model8_pretrain.py][INFO] Epoch:[0/2](237200/4588595) loss:3.124 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:39:03,847][model8_pretrain.py][INFO] Epoch:[0/2](237200/4588595) loss:2.955 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:39:03,847][model8_pretrain.py][INFO] Epoch:[0/2](237200/4588595) loss:2.933 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:39:03,847][model8_pretrain.py][INFO] Epoch:[0/2](237200/4588595) loss:2.467 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:39:03,847][model8_pretrain.py][INFO] Epoch:[0/2](237200/4588595) loss:2.393 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:39:03,847][model8_pretrain.py][INFO] Epoch:[0/2](237200/4588595) loss:2.520 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:39:03,847][model8_pretrain.py][INFO] Epoch:[0/2](237200/4588595) loss:2.833 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:39:03,847][model8_pretrain.py][INFO] Epoch:[0/2](237200/4588595) loss:3.312 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:39:40,766][model8_pretrain.py][INFO] Epoch:[0/2](237300/4588595) loss:2.533 lr:0.0000100 epoch_Time:27503.0min: [2024-01-03 18:39:40,766][model8_pretrain.py][INFO] Epoch:[0/2](237300/4588595) loss:3.362 lr:0.0000100 epoch_Time:27503.0min: [2024-01-03 18:39:40,766][model8_pretrain.py][INFO] Epoch:[0/2](237300/4588595) loss:2.824 lr:0.0000100 epoch_Time:27503.0min: [2024-01-03 18:39:40,766][model8_pretrain.py][INFO] Epoch:[0/2](237300/4588595) loss:3.341 lr:0.0000100 epoch_Time:27503.0min: [2024-01-03 18:39:40,766][model8_pretrain.py][INFO] Epoch:[0/2](237300/4588595) loss:2.313 lr:0.0000100 epoch_Time:27503.0min: [2024-01-03 18:39:40,766][model8_pretrain.py][INFO] Epoch:[0/2](237300/4588595) loss:3.378 lr:0.0000100 epoch_Time:27503.0min: [2024-01-03 18:39:40,766][model8_pretrain.py][INFO] Epoch:[0/2](237300/4588595) loss:2.912 lr:0.0000100 epoch_Time:27503.0min: [2024-01-03 18:39:40,766][model8_pretrain.py][INFO] Epoch:[0/2](237300/4588595) loss:2.985 lr:0.0000100 epoch_Time:27503.0min: [2024-01-03 18:40:17,720][model8_pretrain.py][INFO] Epoch:[0/2](237400/4588595) loss:3.115 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:40:17,720][model8_pretrain.py][INFO] Epoch:[0/2](237400/4588595) loss:2.394 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:40:17,720][model8_pretrain.py][INFO] Epoch:[0/2](237400/4588595) loss:2.949 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:40:17,720][model8_pretrain.py][INFO] Epoch:[0/2](237400/4588595) loss:3.178 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:40:17,720][model8_pretrain.py][INFO] Epoch:[0/2](237400/4588595) loss:2.902 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:40:17,720][model8_pretrain.py][INFO] Epoch:[0/2](237400/4588595) loss:2.743 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:40:17,720][model8_pretrain.py][INFO] Epoch:[0/2](237400/4588595) loss:2.775 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:40:17,721][model8_pretrain.py][INFO] Epoch:[0/2](237400/4588595) loss:2.623 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:40:54,673][model8_pretrain.py][INFO] Epoch:[0/2](237500/4588595) loss:2.788 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:40:54,673][model8_pretrain.py][INFO] Epoch:[0/2](237500/4588595) loss:2.540 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:40:54,673][model8_pretrain.py][INFO] Epoch:[0/2](237500/4588595) loss:2.409 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:40:54,673][model8_pretrain.py][INFO] Epoch:[0/2](237500/4588595) loss:2.423 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:40:54,673][model8_pretrain.py][INFO] Epoch:[0/2](237500/4588595) loss:2.948 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:40:54,673][model8_pretrain.py][INFO] Epoch:[0/2](237500/4588595) loss:3.110 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:40:54,673][model8_pretrain.py][INFO] Epoch:[0/2](237500/4588595) loss:3.167 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:40:54,673][model8_pretrain.py][INFO] Epoch:[0/2](237500/4588595) loss:3.222 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:41:41,773][model8_pretrain.py][INFO] Epoch:[0/2](237600/4588595) loss:3.109 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:41:41,773][model8_pretrain.py][INFO] Epoch:[0/2](237600/4588595) loss:2.702 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:41:41,773][model8_pretrain.py][INFO] Epoch:[0/2](237600/4588595) loss:2.379 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:41:41,773][model8_pretrain.py][INFO] Epoch:[0/2](237600/4588595) loss:2.595 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:41:41,773][model8_pretrain.py][INFO] Epoch:[0/2](237600/4588595) loss:2.829 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:41:41,773][model8_pretrain.py][INFO] Epoch:[0/2](237600/4588595) loss:3.671 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:41:41,773][model8_pretrain.py][INFO] Epoch:[0/2](237600/4588595) loss:2.787 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:41:41,773][model8_pretrain.py][INFO] Epoch:[0/2](237600/4588595) loss:2.499 lr:0.0000100 epoch_Time:27504.0min: [2024-01-03 18:42:18,712][model8_pretrain.py][INFO] Epoch:[0/2](237700/4588595) loss:2.705 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:42:18,712][model8_pretrain.py][INFO] Epoch:[0/2](237700/4588595) loss:2.323 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:42:18,712][model8_pretrain.py][INFO] Epoch:[0/2](237700/4588595) loss:2.646 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:42:18,712][model8_pretrain.py][INFO] Epoch:[0/2](237700/4588595) loss:3.160 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:42:18,712][model8_pretrain.py][INFO] Epoch:[0/2](237700/4588595) loss:3.327 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:42:18,712][model8_pretrain.py][INFO] Epoch:[0/2](237700/4588595) loss:2.662 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:42:18,712][model8_pretrain.py][INFO] Epoch:[0/2](237700/4588595) loss:3.247 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:42:18,712][model8_pretrain.py][INFO] Epoch:[0/2](237700/4588595) loss:3.088 lr:0.0000100 epoch_Time:27502.0min: [2024-01-03 18:42:55,650][model8_pretrain.py][INFO] Epoch:[0/2](237800/4588595) loss:3.225 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:42:55,650][model8_pretrain.py][INFO] Epoch:[0/2](237800/4588595) loss:3.034 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:42:55,650][model8_pretrain.py][INFO] Epoch:[0/2](237800/4588595) loss:2.824 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:42:55,650][model8_pretrain.py][INFO] Epoch:[0/2](237800/4588595) loss:3.033 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:42:55,650][model8_pretrain.py][INFO] Epoch:[0/2](237800/4588595) loss:2.940 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:42:55,650][model8_pretrain.py][INFO] Epoch:[0/2](237800/4588595) loss:3.128 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:42:55,650][model8_pretrain.py][INFO] Epoch:[0/2](237800/4588595) loss:2.840 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:42:55,650][model8_pretrain.py][INFO] Epoch:[0/2](237800/4588595) loss:3.106 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:43:32,584][model8_pretrain.py][INFO] Epoch:[0/2](237900/4588595) loss:3.067 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:43:32,584][model8_pretrain.py][INFO] Epoch:[0/2](237900/4588595) loss:3.202 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:43:32,584][model8_pretrain.py][INFO] Epoch:[0/2](237900/4588595) loss:3.028 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:43:32,584][model8_pretrain.py][INFO] Epoch:[0/2](237900/4588595) loss:2.996 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:43:32,584][model8_pretrain.py][INFO] Epoch:[0/2](237900/4588595) loss:2.559 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:43:32,584][model8_pretrain.py][INFO] Epoch:[0/2](237900/4588595) loss:3.157 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:43:32,584][model8_pretrain.py][INFO] Epoch:[0/2](237900/4588595) loss:2.645 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:43:32,584][model8_pretrain.py][INFO] Epoch:[0/2](237900/4588595) loss:3.065 lr:0.0000100 epoch_Time:27501.0min: [2024-01-03 18:44:09,534][model8_pretrain.py][INFO] Epoch:[0/2](238000/4588595) loss:2.453 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:09,534][model8_pretrain.py][INFO] Epoch:[0/2](238000/4588595) loss:2.920 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:09,534][model8_pretrain.py][INFO] Epoch:[0/2](238000/4588595) loss:3.167 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:09,534][model8_pretrain.py][INFO] Epoch:[0/2](238000/4588595) loss:3.474 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:09,534][model8_pretrain.py][INFO] Epoch:[0/2](238000/4588595) loss:2.828 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:09,534][model8_pretrain.py][INFO] Epoch:[0/2](238000/4588595) loss:3.120 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:09,534][model8_pretrain.py][INFO] Epoch:[0/2](238000/4588595) loss:3.168 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:09,534][model8_pretrain.py][INFO] Epoch:[0/2](238000/4588595) loss:2.821 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:46,474][model8_pretrain.py][INFO] Epoch:[0/2](238100/4588595) loss:2.819 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:46,474][model8_pretrain.py][INFO] Epoch:[0/2](238100/4588595) loss:2.927 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:46,474][model8_pretrain.py][INFO] Epoch:[0/2](238100/4588595) loss:2.929 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:46,474][model8_pretrain.py][INFO] Epoch:[0/2](238100/4588595) loss:2.900 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:46,474][model8_pretrain.py][INFO] Epoch:[0/2](238100/4588595) loss:3.037 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:46,474][model8_pretrain.py][INFO] Epoch:[0/2](238100/4588595) loss:3.173 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:46,474][model8_pretrain.py][INFO] Epoch:[0/2](238100/4588595) loss:2.588 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:44:46,474][model8_pretrain.py][INFO] Epoch:[0/2](238100/4588595) loss:2.053 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:45:23,419][model8_pretrain.py][INFO] Epoch:[0/2](238200/4588595) loss:3.113 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:45:23,419][model8_pretrain.py][INFO] Epoch:[0/2](238200/4588595) loss:2.979 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:45:23,419][model8_pretrain.py][INFO] Epoch:[0/2](238200/4588595) loss:3.280 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:45:23,419][model8_pretrain.py][INFO] Epoch:[0/2](238200/4588595) loss:3.251 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:45:23,419][model8_pretrain.py][INFO] Epoch:[0/2](238200/4588595) loss:2.559 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:45:23,419][model8_pretrain.py][INFO] Epoch:[0/2](238200/4588595) loss:2.959 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:45:23,420][model8_pretrain.py][INFO] Epoch:[0/2](238200/4588595) loss:3.182 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:45:23,420][model8_pretrain.py][INFO] Epoch:[0/2](238200/4588595) loss:2.874 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:46:00,372][model8_pretrain.py][INFO] Epoch:[0/2](238300/4588595) loss:2.890 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:46:00,372][model8_pretrain.py][INFO] Epoch:[0/2](238300/4588595) loss:2.998 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:46:00,372][model8_pretrain.py][INFO] Epoch:[0/2](238300/4588595) loss:3.687 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:46:00,372][model8_pretrain.py][INFO] Epoch:[0/2](238300/4588595) loss:3.543 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:46:00,372][model8_pretrain.py][INFO] Epoch:[0/2](238300/4588595) loss:2.965 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:46:00,372][model8_pretrain.py][INFO] Epoch:[0/2](238300/4588595) loss:2.315 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:46:00,372][model8_pretrain.py][INFO] Epoch:[0/2](238300/4588595) loss:3.323 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:46:00,372][model8_pretrain.py][INFO] Epoch:[0/2](238300/4588595) loss:3.049 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:46:47,609][model8_pretrain.py][INFO] Epoch:[0/2](238400/4588595) loss:2.577 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:46:47,609][model8_pretrain.py][INFO] Epoch:[0/2](238400/4588595) loss:3.055 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:46:47,609][model8_pretrain.py][INFO] Epoch:[0/2](238400/4588595) loss:2.467 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:46:47,609][model8_pretrain.py][INFO] Epoch:[0/2](238400/4588595) loss:3.353 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:46:47,609][model8_pretrain.py][INFO] Epoch:[0/2](238400/4588595) loss:3.371 lr:0.0000100 epoch_Time:27499.0min: [2024-01-03 18:46:47,609][model8_pretrain.py][INFO] Epoch:[0/2](238400/4588595) loss:2.787 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:46:47,609][model8_pretrain.py][INFO] Epoch:[0/2](238400/4588595) loss:3.160 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:46:47,609][model8_pretrain.py][INFO] Epoch:[0/2](238400/4588595) loss:3.090 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:47:24,535][model8_pretrain.py][INFO] Epoch:[0/2](238500/4588595) loss:2.998 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:47:24,535][model8_pretrain.py][INFO] Epoch:[0/2](238500/4588595) loss:2.906 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:47:24,535][model8_pretrain.py][INFO] Epoch:[0/2](238500/4588595) loss:3.291 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:47:24,535][model8_pretrain.py][INFO] Epoch:[0/2](238500/4588595) loss:3.000 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:47:24,535][model8_pretrain.py][INFO] Epoch:[0/2](238500/4588595) loss:2.328 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:47:24,535][model8_pretrain.py][INFO] Epoch:[0/2](238500/4588595) loss:3.086 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:47:24,535][model8_pretrain.py][INFO] Epoch:[0/2](238500/4588595) loss:2.618 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:47:24,536][model8_pretrain.py][INFO] Epoch:[0/2](238500/4588595) loss:3.126 lr:0.0000100 epoch_Time:27498.0min: [2024-01-03 18:48:01,477][model8_pretrain.py][INFO] Epoch:[0/2](238600/4588595) loss:2.785 lr:0.0000100 epoch_Time:27497.0min: [2024-01-03 18:48:01,478][model8_pretrain.py][INFO] Epoch:[0/2](238600/4588595) loss:3.575 lr:0.0000100 epoch_Time:27497.0min: [2024-01-03 18:48:01,478][model8_pretrain.py][INFO] Epoch:[0/2](238600/4588595) loss:3.317 lr:0.0000100 epoch_Time:27497.0min: [2024-01-03 18:48:01,478][model8_pretrain.py][INFO] Epoch:[0/2](238600/4588595) loss:2.686 lr:0.0000100 epoch_Time:27497.0min: [2024-01-03 18:48:01,478][model8_pretrain.py][INFO] Epoch:[0/2](238600/4588595) loss:3.079 lr:0.0000100 epoch_Time:27497.0min: [2024-01-03 18:48:01,478][model8_pretrain.py][INFO] Epoch:[0/2](238600/4588595) loss:2.872 lr:0.0000100 epoch_Time:27497.0min: [2024-01-03 18:48:01,478][model8_pretrain.py][INFO] Epoch:[0/2](238600/4588595) loss:2.875 lr:0.0000100 epoch_Time:27497.0min: [2024-01-03 18:48:01,478][model8_pretrain.py][INFO] Epoch:[0/2](238600/4588595) loss:3.281 lr:0.0000100 epoch_Time:27497.0min: [2024-01-03 18:48:38,424][model8_pretrain.py][INFO] Epoch:[0/2](238700/4588595) loss:2.744 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:48:38,424][model8_pretrain.py][INFO] Epoch:[0/2](238700/4588595) loss:3.610 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:48:38,424][model8_pretrain.py][INFO] Epoch:[0/2](238700/4588595) loss:2.641 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:48:38,424][model8_pretrain.py][INFO] Epoch:[0/2](238700/4588595) loss:2.882 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:48:38,424][model8_pretrain.py][INFO] Epoch:[0/2](238700/4588595) loss:2.486 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:48:38,424][model8_pretrain.py][INFO] Epoch:[0/2](238700/4588595) loss:2.793 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:48:38,424][model8_pretrain.py][INFO] Epoch:[0/2](238700/4588595) loss:2.543 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:48:38,424][model8_pretrain.py][INFO] Epoch:[0/2](238700/4588595) loss:2.475 lr:0.0000100 epoch_Time:27496.0min: [2024-01-03 18:49:15,371][model8_pretrain.py][INFO] Epoch:[0/2](238800/4588595) loss:2.740 lr:0.0000100 epoch_Time:27495.0min: [2024-01-03 18:49:15,371][model8_pretrain.py][INFO] Epoch:[0/2](238800/4588595) loss:3.075 lr:0.0000100 epoch_Time:27495.0min: [2024-01-03 18:49:15,371][model8_pretrain.py][INFO] Epoch:[0/2](238800/4588595) loss:2.682 lr:0.0000100 epoch_Time:27495.0min: [2024-01-03 18:49:15,371][model8_pretrain.py][INFO] Epoch:[0/2](238800/4588595) loss:2.870 lr:0.0000100 epoch_Time:27495.0min: [2024-01-03 18:49:15,371][model8_pretrain.py][INFO] Epoch:[0/2](238800/4588595) loss:3.132 lr:0.0000100 epoch_Time:27495.0min: [2024-01-03 18:49:15,371][model8_pretrain.py][INFO] Epoch:[0/2](238800/4588595) loss:2.706 lr:0.0000100 epoch_Time:27495.0min: [2024-01-03 18:49:15,372][model8_pretrain.py][INFO] Epoch:[0/2](238800/4588595) loss:3.021 lr:0.0000100 epoch_Time:27495.0min: [2024-01-03 18:49:15,372][model8_pretrain.py][INFO] Epoch:[0/2](238800/4588595) loss:2.720 lr:0.0000100 epoch_Time:27495.0min: [2024-01-03 18:49:52,322][model8_pretrain.py][INFO] Epoch:[0/2](238900/4588595) loss:2.905 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:49:52,322][model8_pretrain.py][INFO] Epoch:[0/2](238900/4588595) loss:2.984 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:49:52,322][model8_pretrain.py][INFO] Epoch:[0/2](238900/4588595) loss:2.846 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:49:52,322][model8_pretrain.py][INFO] Epoch:[0/2](238900/4588595) loss:2.940 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:49:52,322][model8_pretrain.py][INFO] Epoch:[0/2](238900/4588595) loss:3.465 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:49:52,322][model8_pretrain.py][INFO] Epoch:[0/2](238900/4588595) loss:2.750 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:49:52,322][model8_pretrain.py][INFO] Epoch:[0/2](238900/4588595) loss:3.441 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:49:52,322][model8_pretrain.py][INFO] Epoch:[0/2](238900/4588595) loss:3.096 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:50:29,266][model8_pretrain.py][INFO] Epoch:[0/2](239000/4588595) loss:2.792 lr:0.0000100 epoch_Time:27493.0min: [2024-01-03 18:50:29,266][model8_pretrain.py][INFO] Epoch:[0/2](239000/4588595) loss:3.209 lr:0.0000100 epoch_Time:27493.0min: [2024-01-03 18:50:29,266][model8_pretrain.py][INFO] Epoch:[0/2](239000/4588595) loss:2.781 lr:0.0000100 epoch_Time:27493.0min: [2024-01-03 18:50:29,266][model8_pretrain.py][INFO] Epoch:[0/2](239000/4588595) loss:2.866 lr:0.0000100 epoch_Time:27493.0min: [2024-01-03 18:50:29,266][model8_pretrain.py][INFO] Epoch:[0/2](239000/4588595) loss:3.266 lr:0.0000100 epoch_Time:27493.0min: [2024-01-03 18:50:29,266][model8_pretrain.py][INFO] Epoch:[0/2](239000/4588595) loss:2.742 lr:0.0000100 epoch_Time:27493.0min: [2024-01-03 18:50:29,266][model8_pretrain.py][INFO] Epoch:[0/2](239000/4588595) loss:3.055 lr:0.0000100 epoch_Time:27493.0min: [2024-01-03 18:50:29,266][model8_pretrain.py][INFO] Epoch:[0/2](239000/4588595) loss:3.125 lr:0.0000100 epoch_Time:27493.0min: [2024-01-03 18:51:06,203][model8_pretrain.py][INFO] Epoch:[0/2](239100/4588595) loss:3.247 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:51:06,203][model8_pretrain.py][INFO] Epoch:[0/2](239100/4588595) loss:3.109 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:51:06,203][model8_pretrain.py][INFO] Epoch:[0/2](239100/4588595) loss:2.594 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:51:06,203][model8_pretrain.py][INFO] Epoch:[0/2](239100/4588595) loss:3.099 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:51:06,203][model8_pretrain.py][INFO] Epoch:[0/2](239100/4588595) loss:3.354 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:51:06,203][model8_pretrain.py][INFO] Epoch:[0/2](239100/4588595) loss:1.841 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:51:06,203][model8_pretrain.py][INFO] Epoch:[0/2](239100/4588595) loss:2.662 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:51:06,204][model8_pretrain.py][INFO] Epoch:[0/2](239100/4588595) loss:3.205 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:51:53,401][model8_pretrain.py][INFO] Epoch:[0/2](239200/4588595) loss:2.447 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:51:53,401][model8_pretrain.py][INFO] Epoch:[0/2](239200/4588595) loss:3.270 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:51:53,401][model8_pretrain.py][INFO] Epoch:[0/2](239200/4588595) loss:2.771 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:51:53,401][model8_pretrain.py][INFO] Epoch:[0/2](239200/4588595) loss:2.902 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:51:53,401][model8_pretrain.py][INFO] Epoch:[0/2](239200/4588595) loss:2.844 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:51:53,401][model8_pretrain.py][INFO] Epoch:[0/2](239200/4588595) loss:3.089 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:51:53,401][model8_pretrain.py][INFO] Epoch:[0/2](239200/4588595) loss:2.816 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:51:53,401][model8_pretrain.py][INFO] Epoch:[0/2](239200/4588595) loss:2.907 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:52:30,329][model8_pretrain.py][INFO] Epoch:[0/2](239300/4588595) loss:2.912 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:52:30,329][model8_pretrain.py][INFO] Epoch:[0/2](239300/4588595) loss:3.069 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:52:30,329][model8_pretrain.py][INFO] Epoch:[0/2](239300/4588595) loss:2.999 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:52:30,329][model8_pretrain.py][INFO] Epoch:[0/2](239300/4588595) loss:3.047 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:52:30,329][model8_pretrain.py][INFO] Epoch:[0/2](239300/4588595) loss:2.426 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:52:30,330][model8_pretrain.py][INFO] Epoch:[0/2](239300/4588595) loss:3.240 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:52:30,330][model8_pretrain.py][INFO] Epoch:[0/2](239300/4588595) loss:2.629 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:52:30,330][model8_pretrain.py][INFO] Epoch:[0/2](239300/4588595) loss:2.607 lr:0.0000100 epoch_Time:27494.0min: [2024-01-03 18:53:07,277][model8_pretrain.py][INFO] Epoch:[0/2](239400/4588595) loss:2.820 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:07,277][model8_pretrain.py][INFO] Epoch:[0/2](239400/4588595) loss:2.330 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:07,277][model8_pretrain.py][INFO] Epoch:[0/2](239400/4588595) loss:2.819 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:07,277][model8_pretrain.py][INFO] Epoch:[0/2](239400/4588595) loss:3.015 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:07,277][model8_pretrain.py][INFO] Epoch:[0/2](239400/4588595) loss:3.597 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:07,277][model8_pretrain.py][INFO] Epoch:[0/2](239400/4588595) loss:3.363 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:07,277][model8_pretrain.py][INFO] Epoch:[0/2](239400/4588595) loss:3.093 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:07,277][model8_pretrain.py][INFO] Epoch:[0/2](239400/4588595) loss:3.128 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:44,218][model8_pretrain.py][INFO] Epoch:[0/2](239500/4588595) loss:3.489 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:44,218][model8_pretrain.py][INFO] Epoch:[0/2](239500/4588595) loss:3.303 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:44,218][model8_pretrain.py][INFO] Epoch:[0/2](239500/4588595) loss:2.771 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:44,218][model8_pretrain.py][INFO] Epoch:[0/2](239500/4588595) loss:3.319 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:44,218][model8_pretrain.py][INFO] Epoch:[0/2](239500/4588595) loss:3.016 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:44,218][model8_pretrain.py][INFO] Epoch:[0/2](239500/4588595) loss:2.589 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:44,218][model8_pretrain.py][INFO] Epoch:[0/2](239500/4588595) loss:2.655 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:53:44,226][model8_pretrain.py][INFO] Epoch:[0/2](239500/4588595) loss:2.663 lr:0.0000100 epoch_Time:27492.0min: [2024-01-03 18:54:21,172][model8_pretrain.py][INFO] Epoch:[0/2](239600/4588595) loss:2.156 lr:0.0000100 epoch_Time:27491.0min: [2024-01-03 18:54:21,172][model8_pretrain.py][INFO] Epoch:[0/2](239600/4588595) loss:2.681 lr:0.0000100 epoch_Time:27491.0min: [2024-01-03 18:54:21,172][model8_pretrain.py][INFO] Epoch:[0/2](239600/4588595) loss:3.028 lr:0.0000100 epoch_Time:27491.0min: [2024-01-03 18:54:21,172][model8_pretrain.py][INFO] Epoch:[0/2](239600/4588595) loss:3.313 lr:0.0000100 epoch_Time:27491.0min: [2024-01-03 18:54:21,172][model8_pretrain.py][INFO] Epoch:[0/2](239600/4588595) loss:2.463 lr:0.0000100 epoch_Time:27491.0min: [2024-01-03 18:54:21,172][model8_pretrain.py][INFO] Epoch:[0/2](239600/4588595) loss:3.419 lr:0.0000100 epoch_Time:27491.0min: [2024-01-03 18:54:21,172][model8_pretrain.py][INFO] Epoch:[0/2](239600/4588595) loss:3.168 lr:0.0000100 epoch_Time:27491.0min: [2024-01-03 18:54:21,172][model8_pretrain.py][INFO] Epoch:[0/2](239600/4588595) loss:3.136 lr:0.0000100 epoch_Time:27491.0min: [2024-01-03 18:54:58,111][model8_pretrain.py][INFO] Epoch:[0/2](239700/4588595) loss:3.068 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:54:58,111][model8_pretrain.py][INFO] Epoch:[0/2](239700/4588595) loss:2.469 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:54:58,111][model8_pretrain.py][INFO] Epoch:[0/2](239700/4588595) loss:2.296 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:54:58,111][model8_pretrain.py][INFO] Epoch:[0/2](239700/4588595) loss:2.276 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:54:58,111][model8_pretrain.py][INFO] Epoch:[0/2](239700/4588595) loss:2.607 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:54:58,111][model8_pretrain.py][INFO] Epoch:[0/2](239700/4588595) loss:3.125 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:54:58,111][model8_pretrain.py][INFO] Epoch:[0/2](239700/4588595) loss:2.986 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:54:58,111][model8_pretrain.py][INFO] Epoch:[0/2](239700/4588595) loss:2.834 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:55:35,037][model8_pretrain.py][INFO] Epoch:[0/2](239800/4588595) loss:3.035 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:55:35,037][model8_pretrain.py][INFO] Epoch:[0/2](239800/4588595) loss:3.104 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:55:35,037][model8_pretrain.py][INFO] Epoch:[0/2](239800/4588595) loss:3.059 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:55:35,037][model8_pretrain.py][INFO] Epoch:[0/2](239800/4588595) loss:2.939 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:55:35,037][model8_pretrain.py][INFO] Epoch:[0/2](239800/4588595) loss:3.109 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:55:35,037][model8_pretrain.py][INFO] Epoch:[0/2](239800/4588595) loss:3.204 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:55:35,037][model8_pretrain.py][INFO] Epoch:[0/2](239800/4588595) loss:2.667 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:55:35,037][model8_pretrain.py][INFO] Epoch:[0/2](239800/4588595) loss:2.909 lr:0.0000100 epoch_Time:27489.0min: [2024-01-03 18:56:11,971][model8_pretrain.py][INFO] Epoch:[0/2](239900/4588595) loss:3.054 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:56:11,971][model8_pretrain.py][INFO] Epoch:[0/2](239900/4588595) loss:2.846 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:56:11,971][model8_pretrain.py][INFO] Epoch:[0/2](239900/4588595) loss:2.181 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:56:11,971][model8_pretrain.py][INFO] Epoch:[0/2](239900/4588595) loss:3.130 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:56:11,971][model8_pretrain.py][INFO] Epoch:[0/2](239900/4588595) loss:3.181 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:56:11,971][model8_pretrain.py][INFO] Epoch:[0/2](239900/4588595) loss:2.111 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:56:11,971][model8_pretrain.py][INFO] Epoch:[0/2](239900/4588595) loss:3.003 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:56:11,971][model8_pretrain.py][INFO] Epoch:[0/2](239900/4588595) loss:2.421 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:56:59,243][model8_pretrain.py][INFO] Epoch:[0/2](240000/4588595) loss:3.128 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:56:59,243][model8_pretrain.py][INFO] Epoch:[0/2](240000/4588595) loss:2.854 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:56:59,243][model8_pretrain.py][INFO] Epoch:[0/2](240000/4588595) loss:3.062 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:56:59,244][model8_pretrain.py][INFO] Epoch:[0/2](240000/4588595) loss:3.031 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:56:59,244][model8_pretrain.py][INFO] Epoch:[0/2](240000/4588595) loss:3.198 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:56:59,244][model8_pretrain.py][INFO] Epoch:[0/2](240000/4588595) loss:3.298 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:56:59,244][model8_pretrain.py][INFO] Epoch:[0/2](240000/4588595) loss:3.024 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:56:59,244][model8_pretrain.py][INFO] Epoch:[0/2](240000/4588595) loss:2.322 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:57:36,172][model8_pretrain.py][INFO] Epoch:[0/2](240100/4588595) loss:3.544 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:57:36,172][model8_pretrain.py][INFO] Epoch:[0/2](240100/4588595) loss:3.131 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:57:36,172][model8_pretrain.py][INFO] Epoch:[0/2](240100/4588595) loss:3.686 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:57:36,172][model8_pretrain.py][INFO] Epoch:[0/2](240100/4588595) loss:2.854 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:57:36,172][model8_pretrain.py][INFO] Epoch:[0/2](240100/4588595) loss:2.600 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:57:36,172][model8_pretrain.py][INFO] Epoch:[0/2](240100/4588595) loss:3.153 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:57:36,172][model8_pretrain.py][INFO] Epoch:[0/2](240100/4588595) loss:2.991 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:57:36,172][model8_pretrain.py][INFO] Epoch:[0/2](240100/4588595) loss:2.664 lr:0.0000100 epoch_Time:27490.0min: [2024-01-03 18:58:13,115][model8_pretrain.py][INFO] Epoch:[0/2](240200/4588595) loss:2.811 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:58:13,115][model8_pretrain.py][INFO] Epoch:[0/2](240200/4588595) loss:2.898 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:58:13,116][model8_pretrain.py][INFO] Epoch:[0/2](240200/4588595) loss:3.287 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:58:13,116][model8_pretrain.py][INFO] Epoch:[0/2](240200/4588595) loss:2.742 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:58:13,116][model8_pretrain.py][INFO] Epoch:[0/2](240200/4588595) loss:3.222 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:58:13,116][model8_pretrain.py][INFO] Epoch:[0/2](240200/4588595) loss:2.954 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:58:13,116][model8_pretrain.py][INFO] Epoch:[0/2](240200/4588595) loss:3.156 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:58:13,116][model8_pretrain.py][INFO] Epoch:[0/2](240200/4588595) loss:3.008 lr:0.0000100 epoch_Time:27488.0min: [2024-01-03 18:58:50,061][model8_pretrain.py][INFO] Epoch:[0/2](240300/4588595) loss:2.591 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:58:50,061][model8_pretrain.py][INFO] Epoch:[0/2](240300/4588595) loss:2.640 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:58:50,061][model8_pretrain.py][INFO] Epoch:[0/2](240300/4588595) loss:2.740 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:58:50,061][model8_pretrain.py][INFO] Epoch:[0/2](240300/4588595) loss:3.073 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:58:50,061][model8_pretrain.py][INFO] Epoch:[0/2](240300/4588595) loss:3.006 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:58:50,061][model8_pretrain.py][INFO] Epoch:[0/2](240300/4588595) loss:2.711 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:58:50,061][model8_pretrain.py][INFO] Epoch:[0/2](240300/4588595) loss:3.114 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:58:50,061][model8_pretrain.py][INFO] Epoch:[0/2](240300/4588595) loss:2.961 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:59:26,995][model8_pretrain.py][INFO] Epoch:[0/2](240400/4588595) loss:3.002 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:59:26,995][model8_pretrain.py][INFO] Epoch:[0/2](240400/4588595) loss:3.087 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:59:26,995][model8_pretrain.py][INFO] Epoch:[0/2](240400/4588595) loss:2.594 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:59:26,995][model8_pretrain.py][INFO] Epoch:[0/2](240400/4588595) loss:2.915 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:59:26,995][model8_pretrain.py][INFO] Epoch:[0/2](240400/4588595) loss:2.495 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:59:26,995][model8_pretrain.py][INFO] Epoch:[0/2](240400/4588595) loss:3.453 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:59:26,995][model8_pretrain.py][INFO] Epoch:[0/2](240400/4588595) loss:3.303 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 18:59:26,995][model8_pretrain.py][INFO] Epoch:[0/2](240400/4588595) loss:2.935 lr:0.0000100 epoch_Time:27487.0min: [2024-01-03 19:00:03,908][model8_pretrain.py][INFO] Epoch:[0/2](240500/4588595) loss:2.826 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:03,908][model8_pretrain.py][INFO] Epoch:[0/2](240500/4588595) loss:3.240 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:03,908][model8_pretrain.py][INFO] Epoch:[0/2](240500/4588595) loss:3.104 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:03,908][model8_pretrain.py][INFO] Epoch:[0/2](240500/4588595) loss:3.658 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:03,908][model8_pretrain.py][INFO] Epoch:[0/2](240500/4588595) loss:3.446 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:03,908][model8_pretrain.py][INFO] Epoch:[0/2](240500/4588595) loss:2.863 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:03,908][model8_pretrain.py][INFO] Epoch:[0/2](240500/4588595) loss:3.088 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:03,908][model8_pretrain.py][INFO] Epoch:[0/2](240500/4588595) loss:3.463 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:40,836][model8_pretrain.py][INFO] Epoch:[0/2](240600/4588595) loss:3.174 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:40,836][model8_pretrain.py][INFO] Epoch:[0/2](240600/4588595) loss:2.609 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:40,836][model8_pretrain.py][INFO] Epoch:[0/2](240600/4588595) loss:3.221 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:40,836][model8_pretrain.py][INFO] Epoch:[0/2](240600/4588595) loss:2.574 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:40,836][model8_pretrain.py][INFO] Epoch:[0/2](240600/4588595) loss:3.049 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:40,836][model8_pretrain.py][INFO] Epoch:[0/2](240600/4588595) loss:3.291 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:40,836][model8_pretrain.py][INFO] Epoch:[0/2](240600/4588595) loss:3.409 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:00:40,836][model8_pretrain.py][INFO] Epoch:[0/2](240600/4588595) loss:3.129 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:01:17,771][model8_pretrain.py][INFO] Epoch:[0/2](240700/4588595) loss:2.752 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:01:17,771][model8_pretrain.py][INFO] Epoch:[0/2](240700/4588595) loss:3.447 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:01:17,771][model8_pretrain.py][INFO] Epoch:[0/2](240700/4588595) loss:3.213 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:01:17,771][model8_pretrain.py][INFO] Epoch:[0/2](240700/4588595) loss:3.185 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:01:17,771][model8_pretrain.py][INFO] Epoch:[0/2](240700/4588595) loss:2.784 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:01:17,771][model8_pretrain.py][INFO] Epoch:[0/2](240700/4588595) loss:2.992 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:01:17,771][model8_pretrain.py][INFO] Epoch:[0/2](240700/4588595) loss:3.256 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:01:17,771][model8_pretrain.py][INFO] Epoch:[0/2](240700/4588595) loss:2.946 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:02:05,113][model8_pretrain.py][INFO] Epoch:[0/2](240800/4588595) loss:3.196 lr:0.0000100 epoch_Time:27486.0min: [2024-01-03 19:02:05,113][model8_pretrain.py][INFO] Epoch:[0/2](240800/4588595) loss:2.804 lr:0.0000100 epoch_Time:27486.0min: [2024-01-03 19:02:05,113][model8_pretrain.py][INFO] Epoch:[0/2](240800/4588595) loss:3.148 lr:0.0000100 epoch_Time:27486.0min: [2024-01-03 19:02:05,113][model8_pretrain.py][INFO] Epoch:[0/2](240800/4588595) loss:3.033 lr:0.0000100 epoch_Time:27486.0min: [2024-01-03 19:02:05,113][model8_pretrain.py][INFO] Epoch:[0/2](240800/4588595) loss:3.427 lr:0.0000100 epoch_Time:27486.0min: [2024-01-03 19:02:05,113][model8_pretrain.py][INFO] Epoch:[0/2](240800/4588595) loss:3.184 lr:0.0000100 epoch_Time:27486.0min: [2024-01-03 19:02:05,113][model8_pretrain.py][INFO] Epoch:[0/2](240800/4588595) loss:2.432 lr:0.0000100 epoch_Time:27486.0min: [2024-01-03 19:02:05,113][model8_pretrain.py][INFO] Epoch:[0/2](240800/4588595) loss:3.215 lr:0.0000100 epoch_Time:27486.0min: [2024-01-03 19:02:42,052][model8_pretrain.py][INFO] Epoch:[0/2](240900/4588595) loss:3.301 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:02:42,052][model8_pretrain.py][INFO] Epoch:[0/2](240900/4588595) loss:2.806 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:02:42,052][model8_pretrain.py][INFO] Epoch:[0/2](240900/4588595) loss:3.121 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:02:42,052][model8_pretrain.py][INFO] Epoch:[0/2](240900/4588595) loss:2.920 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:02:42,052][model8_pretrain.py][INFO] Epoch:[0/2](240900/4588595) loss:3.542 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:02:42,052][model8_pretrain.py][INFO] Epoch:[0/2](240900/4588595) loss:3.201 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:02:42,052][model8_pretrain.py][INFO] Epoch:[0/2](240900/4588595) loss:2.971 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:02:42,052][model8_pretrain.py][INFO] Epoch:[0/2](240900/4588595) loss:2.401 lr:0.0000100 epoch_Time:27485.0min: [2024-01-03 19:03:19,011][model8_pretrain.py][INFO] Epoch:[0/2](241000/4588595) loss:3.173 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:03:19,011][model8_pretrain.py][INFO] Epoch:[0/2](241000/4588595) loss:3.085 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:03:19,011][model8_pretrain.py][INFO] Epoch:[0/2](241000/4588595) loss:2.635 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:03:19,011][model8_pretrain.py][INFO] Epoch:[0/2](241000/4588595) loss:3.112 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:03:19,011][model8_pretrain.py][INFO] Epoch:[0/2](241000/4588595) loss:3.195 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:03:19,011][model8_pretrain.py][INFO] Epoch:[0/2](241000/4588595) loss:3.056 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:03:19,011][model8_pretrain.py][INFO] Epoch:[0/2](241000/4588595) loss:3.146 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:03:19,011][model8_pretrain.py][INFO] Epoch:[0/2](241000/4588595) loss:2.851 lr:0.0000100 epoch_Time:27484.0min: [2024-01-03 19:03:55,968][model8_pretrain.py][INFO] Epoch:[0/2](241100/4588595) loss:3.392 lr:0.0000100 epoch_Time:27483.0min: [2024-01-03 19:03:55,968][model8_pretrain.py][INFO] Epoch:[0/2](241100/4588595) loss:2.903 lr:0.0000100 epoch_Time:27483.0min: [2024-01-03 19:03:55,968][model8_pretrain.py][INFO] Epoch:[0/2](241100/4588595) loss:2.898 lr:0.0000100 epoch_Time:27483.0min: [2024-01-03 19:03:55,968][model8_pretrain.py][INFO] Epoch:[0/2](241100/4588595) loss:2.881 lr:0.0000100 epoch_Time:27483.0min: [2024-01-03 19:03:55,968][model8_pretrain.py][INFO] Epoch:[0/2](241100/4588595) loss:3.066 lr:0.0000100 epoch_Time:27483.0min: [2024-01-03 19:03:55,968][model8_pretrain.py][INFO] Epoch:[0/2](241100/4588595) loss:3.023 lr:0.0000100 epoch_Time:27483.0min: [2024-01-03 19:03:55,968][model8_pretrain.py][INFO] Epoch:[0/2](241100/4588595) loss:2.686 lr:0.0000100 epoch_Time:27483.0min: [2024-01-03 19:03:55,968][model8_pretrain.py][INFO] Epoch:[0/2](241100/4588595) loss:2.763 lr:0.0000100 epoch_Time:27483.0min: [2024-01-03 19:04:32,922][model8_pretrain.py][INFO] Epoch:[0/2](241200/4588595) loss:3.173 lr:0.0000100 epoch_Time:27482.0min: [2024-01-03 19:04:32,922][model8_pretrain.py][INFO] Epoch:[0/2](241200/4588595) loss:2.561 lr:0.0000100 epoch_Time:27482.0min: [2024-01-03 19:04:32,922][model8_pretrain.py][INFO] Epoch:[0/2](241200/4588595) loss:2.340 lr:0.0000100 epoch_Time:27482.0min: [2024-01-03 19:04:32,922][model8_pretrain.py][INFO] Epoch:[0/2](241200/4588595) loss:3.538 lr:0.0000100 epoch_Time:27482.0min: [2024-01-03 19:04:32,922][model8_pretrain.py][INFO] Epoch:[0/2](241200/4588595) loss:2.856 lr:0.0000100 epoch_Time:27482.0min: [2024-01-03 19:04:32,922][model8_pretrain.py][INFO] Epoch:[0/2](241200/4588595) loss:3.189 lr:0.0000100 epoch_Time:27482.0min: [2024-01-03 19:04:32,922][model8_pretrain.py][INFO] Epoch:[0/2](241200/4588595) loss:2.913 lr:0.0000100 epoch_Time:27482.0min: [2024-01-03 19:04:32,922][model8_pretrain.py][INFO] Epoch:[0/2](241200/4588595) loss:2.600 lr:0.0000100 epoch_Time:27482.0min: [2024-01-03 19:05:09,853][model8_pretrain.py][INFO] Epoch:[0/2](241300/4588595) loss:3.036 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:09,853][model8_pretrain.py][INFO] Epoch:[0/2](241300/4588595) loss:2.268 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:09,853][model8_pretrain.py][INFO] Epoch:[0/2](241300/4588595) loss:2.836 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:09,853][model8_pretrain.py][INFO] Epoch:[0/2](241300/4588595) loss:3.502 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:09,853][model8_pretrain.py][INFO] Epoch:[0/2](241300/4588595) loss:3.002 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:09,853][model8_pretrain.py][INFO] Epoch:[0/2](241300/4588595) loss:3.414 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:09,853][model8_pretrain.py][INFO] Epoch:[0/2](241300/4588595) loss:2.542 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:09,853][model8_pretrain.py][INFO] Epoch:[0/2](241300/4588595) loss:3.306 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:46,805][model8_pretrain.py][INFO] Epoch:[0/2](241400/4588595) loss:3.320 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:46,805][model8_pretrain.py][INFO] Epoch:[0/2](241400/4588595) loss:2.626 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:46,805][model8_pretrain.py][INFO] Epoch:[0/2](241400/4588595) loss:2.836 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:46,805][model8_pretrain.py][INFO] Epoch:[0/2](241400/4588595) loss:2.193 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:46,806][model8_pretrain.py][INFO] Epoch:[0/2](241400/4588595) loss:2.678 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:46,806][model8_pretrain.py][INFO] Epoch:[0/2](241400/4588595) loss:3.177 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:46,806][model8_pretrain.py][INFO] Epoch:[0/2](241400/4588595) loss:2.725 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:05:46,806][model8_pretrain.py][INFO] Epoch:[0/2](241400/4588595) loss:3.078 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:06:23,762][model8_pretrain.py][INFO] Epoch:[0/2](241500/4588595) loss:2.746 lr:0.0000100 epoch_Time:27479.0min: [2024-01-03 19:06:23,762][model8_pretrain.py][INFO] Epoch:[0/2](241500/4588595) loss:2.854 lr:0.0000100 epoch_Time:27479.0min: [2024-01-03 19:06:23,762][model8_pretrain.py][INFO] Epoch:[0/2](241500/4588595) loss:3.189 lr:0.0000100 epoch_Time:27479.0min: [2024-01-03 19:06:23,762][model8_pretrain.py][INFO] Epoch:[0/2](241500/4588595) loss:2.870 lr:0.0000100 epoch_Time:27479.0min: [2024-01-03 19:06:23,762][model8_pretrain.py][INFO] Epoch:[0/2](241500/4588595) loss:2.875 lr:0.0000100 epoch_Time:27479.0min: [2024-01-03 19:06:23,763][model8_pretrain.py][INFO] Epoch:[0/2](241500/4588595) loss:2.779 lr:0.0000100 epoch_Time:27479.0min: [2024-01-03 19:06:23,763][model8_pretrain.py][INFO] Epoch:[0/2](241500/4588595) loss:2.911 lr:0.0000100 epoch_Time:27479.0min: [2024-01-03 19:06:23,763][model8_pretrain.py][INFO] Epoch:[0/2](241500/4588595) loss:3.212 lr:0.0000100 epoch_Time:27479.0min: [2024-01-03 19:07:11,150][model8_pretrain.py][INFO] Epoch:[0/2](241600/4588595) loss:3.066 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:07:11,150][model8_pretrain.py][INFO] Epoch:[0/2](241600/4588595) loss:3.605 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:07:11,150][model8_pretrain.py][INFO] Epoch:[0/2](241600/4588595) loss:2.764 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:07:11,150][model8_pretrain.py][INFO] Epoch:[0/2](241600/4588595) loss:2.747 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:07:11,150][model8_pretrain.py][INFO] Epoch:[0/2](241600/4588595) loss:3.284 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:07:11,150][model8_pretrain.py][INFO] Epoch:[0/2](241600/4588595) loss:3.236 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:07:11,150][model8_pretrain.py][INFO] Epoch:[0/2](241600/4588595) loss:2.485 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:07:11,150][model8_pretrain.py][INFO] Epoch:[0/2](241600/4588595) loss:3.218 lr:0.0000100 epoch_Time:27481.0min: [2024-01-03 19:07:48,075][model8_pretrain.py][INFO] Epoch:[0/2](241700/4588595) loss:2.742 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:07:48,075][model8_pretrain.py][INFO] Epoch:[0/2](241700/4588595) loss:3.193 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:07:48,075][model8_pretrain.py][INFO] Epoch:[0/2](241700/4588595) loss:3.137 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:07:48,075][model8_pretrain.py][INFO] Epoch:[0/2](241700/4588595) loss:3.094 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:07:48,075][model8_pretrain.py][INFO] Epoch:[0/2](241700/4588595) loss:2.607 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:07:48,075][model8_pretrain.py][INFO] Epoch:[0/2](241700/4588595) loss:2.673 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:07:48,075][model8_pretrain.py][INFO] Epoch:[0/2](241700/4588595) loss:3.204 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:07:48,075][model8_pretrain.py][INFO] Epoch:[0/2](241700/4588595) loss:2.369 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:08:25,010][model8_pretrain.py][INFO] Epoch:[0/2](241800/4588595) loss:2.748 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:08:25,010][model8_pretrain.py][INFO] Epoch:[0/2](241800/4588595) loss:2.820 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:08:25,010][model8_pretrain.py][INFO] Epoch:[0/2](241800/4588595) loss:3.215 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:08:25,010][model8_pretrain.py][INFO] Epoch:[0/2](241800/4588595) loss:2.475 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:08:25,010][model8_pretrain.py][INFO] Epoch:[0/2](241800/4588595) loss:2.459 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:08:25,010][model8_pretrain.py][INFO] Epoch:[0/2](241800/4588595) loss:2.290 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:08:25,010][model8_pretrain.py][INFO] Epoch:[0/2](241800/4588595) loss:3.092 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:08:25,010][model8_pretrain.py][INFO] Epoch:[0/2](241800/4588595) loss:2.930 lr:0.0000100 epoch_Time:27480.0min: [2024-01-03 19:09:01,956][model8_pretrain.py][INFO] Epoch:[0/2](241900/4588595) loss:3.025 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:01,956][model8_pretrain.py][INFO] Epoch:[0/2](241900/4588595) loss:2.972 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:01,956][model8_pretrain.py][INFO] Epoch:[0/2](241900/4588595) loss:3.040 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:01,956][model8_pretrain.py][INFO] Epoch:[0/2](241900/4588595) loss:2.768 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:01,956][model8_pretrain.py][INFO] Epoch:[0/2](241900/4588595) loss:3.212 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:01,957][model8_pretrain.py][INFO] Epoch:[0/2](241900/4588595) loss:2.861 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:01,957][model8_pretrain.py][INFO] Epoch:[0/2](241900/4588595) loss:2.934 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:01,957][model8_pretrain.py][INFO] Epoch:[0/2](241900/4588595) loss:2.829 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:38,890][model8_pretrain.py][INFO] Epoch:[0/2](242000/4588595) loss:2.631 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:38,890][model8_pretrain.py][INFO] Epoch:[0/2](242000/4588595) loss:2.957 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:38,890][model8_pretrain.py][INFO] Epoch:[0/2](242000/4588595) loss:2.936 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:38,890][model8_pretrain.py][INFO] Epoch:[0/2](242000/4588595) loss:2.851 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:38,890][model8_pretrain.py][INFO] Epoch:[0/2](242000/4588595) loss:3.084 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:38,890][model8_pretrain.py][INFO] Epoch:[0/2](242000/4588595) loss:3.150 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:38,890][model8_pretrain.py][INFO] Epoch:[0/2](242000/4588595) loss:2.626 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:09:38,890][model8_pretrain.py][INFO] Epoch:[0/2](242000/4588595) loss:2.974 lr:0.0000100 epoch_Time:27478.0min: [2024-01-03 19:10:15,816][model8_pretrain.py][INFO] Epoch:[0/2](242100/4588595) loss:2.942 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:10:15,816][model8_pretrain.py][INFO] Epoch:[0/2](242100/4588595) loss:2.585 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:10:15,816][model8_pretrain.py][INFO] Epoch:[0/2](242100/4588595) loss:2.741 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:10:15,817][model8_pretrain.py][INFO] Epoch:[0/2](242100/4588595) loss:2.708 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:10:15,817][model8_pretrain.py][INFO] Epoch:[0/2](242100/4588595) loss:3.162 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:10:15,817][model8_pretrain.py][INFO] Epoch:[0/2](242100/4588595) loss:2.681 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:10:15,817][model8_pretrain.py][INFO] Epoch:[0/2](242100/4588595) loss:2.571 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:10:15,817][model8_pretrain.py][INFO] Epoch:[0/2](242100/4588595) loss:3.368 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:10:52,744][model8_pretrain.py][INFO] Epoch:[0/2](242200/4588595) loss:2.685 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:10:52,744][model8_pretrain.py][INFO] Epoch:[0/2](242200/4588595) loss:3.401 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:10:52,744][model8_pretrain.py][INFO] Epoch:[0/2](242200/4588595) loss:3.005 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:10:52,744][model8_pretrain.py][INFO] Epoch:[0/2](242200/4588595) loss:3.226 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:10:52,744][model8_pretrain.py][INFO] Epoch:[0/2](242200/4588595) loss:2.552 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:10:52,744][model8_pretrain.py][INFO] Epoch:[0/2](242200/4588595) loss:2.899 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:10:52,744][model8_pretrain.py][INFO] Epoch:[0/2](242200/4588595) loss:2.801 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:10:52,744][model8_pretrain.py][INFO] Epoch:[0/2](242200/4588595) loss:2.900 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:11:29,662][model8_pretrain.py][INFO] Epoch:[0/2](242300/4588595) loss:2.987 lr:0.0000100 epoch_Time:27475.0min: [2024-01-03 19:11:29,662][model8_pretrain.py][INFO] Epoch:[0/2](242300/4588595) loss:3.087 lr:0.0000100 epoch_Time:27475.0min: [2024-01-03 19:11:29,662][model8_pretrain.py][INFO] Epoch:[0/2](242300/4588595) loss:2.801 lr:0.0000100 epoch_Time:27475.0min: [2024-01-03 19:11:29,662][model8_pretrain.py][INFO] Epoch:[0/2](242300/4588595) loss:3.158 lr:0.0000100 epoch_Time:27475.0min: [2024-01-03 19:11:29,662][model8_pretrain.py][INFO] Epoch:[0/2](242300/4588595) loss:3.018 lr:0.0000100 epoch_Time:27475.0min: [2024-01-03 19:11:29,662][model8_pretrain.py][INFO] Epoch:[0/2](242300/4588595) loss:2.438 lr:0.0000100 epoch_Time:27475.0min: [2024-01-03 19:11:29,662][model8_pretrain.py][INFO] Epoch:[0/2](242300/4588595) loss:3.063 lr:0.0000100 epoch_Time:27475.0min: [2024-01-03 19:11:29,662][model8_pretrain.py][INFO] Epoch:[0/2](242300/4588595) loss:3.088 lr:0.0000100 epoch_Time:27475.0min: [2024-01-03 19:12:17,110][model8_pretrain.py][INFO] Epoch:[0/2](242400/4588595) loss:3.268 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:12:17,110][model8_pretrain.py][INFO] Epoch:[0/2](242400/4588595) loss:3.162 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:12:17,110][model8_pretrain.py][INFO] Epoch:[0/2](242400/4588595) loss:3.415 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:12:17,110][model8_pretrain.py][INFO] Epoch:[0/2](242400/4588595) loss:3.040 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:12:17,110][model8_pretrain.py][INFO] Epoch:[0/2](242400/4588595) loss:3.210 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:12:17,110][model8_pretrain.py][INFO] Epoch:[0/2](242400/4588595) loss:3.260 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:12:17,110][model8_pretrain.py][INFO] Epoch:[0/2](242400/4588595) loss:3.132 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:12:17,110][model8_pretrain.py][INFO] Epoch:[0/2](242400/4588595) loss:2.686 lr:0.0000100 epoch_Time:27477.0min: [2024-01-03 19:12:54,025][model8_pretrain.py][INFO] Epoch:[0/2](242500/4588595) loss:3.425 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:12:54,025][model8_pretrain.py][INFO] Epoch:[0/2](242500/4588595) loss:2.845 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:12:54,025][model8_pretrain.py][INFO] Epoch:[0/2](242500/4588595) loss:3.239 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:12:54,025][model8_pretrain.py][INFO] Epoch:[0/2](242500/4588595) loss:3.258 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:12:54,025][model8_pretrain.py][INFO] Epoch:[0/2](242500/4588595) loss:2.973 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:12:54,025][model8_pretrain.py][INFO] Epoch:[0/2](242500/4588595) loss:2.845 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:12:54,025][model8_pretrain.py][INFO] Epoch:[0/2](242500/4588595) loss:2.822 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:12:54,025][model8_pretrain.py][INFO] Epoch:[0/2](242500/4588595) loss:3.214 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:13:30,951][model8_pretrain.py][INFO] Epoch:[0/2](242600/4588595) loss:2.751 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:13:30,951][model8_pretrain.py][INFO] Epoch:[0/2](242600/4588595) loss:2.648 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:13:30,951][model8_pretrain.py][INFO] Epoch:[0/2](242600/4588595) loss:2.519 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:13:30,951][model8_pretrain.py][INFO] Epoch:[0/2](242600/4588595) loss:3.146 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:13:30,951][model8_pretrain.py][INFO] Epoch:[0/2](242600/4588595) loss:2.918 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:13:30,951][model8_pretrain.py][INFO] Epoch:[0/2](242600/4588595) loss:2.771 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:13:30,951][model8_pretrain.py][INFO] Epoch:[0/2](242600/4588595) loss:3.177 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:13:30,951][model8_pretrain.py][INFO] Epoch:[0/2](242600/4588595) loss:2.641 lr:0.0000100 epoch_Time:27476.0min: [2024-01-03 19:14:07,889][model8_pretrain.py][INFO] Epoch:[0/2](242700/4588595) loss:3.287 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:07,889][model8_pretrain.py][INFO] Epoch:[0/2](242700/4588595) loss:3.011 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:07,889][model8_pretrain.py][INFO] Epoch:[0/2](242700/4588595) loss:2.963 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:07,889][model8_pretrain.py][INFO] Epoch:[0/2](242700/4588595) loss:3.017 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:07,889][model8_pretrain.py][INFO] Epoch:[0/2](242700/4588595) loss:3.242 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:07,889][model8_pretrain.py][INFO] Epoch:[0/2](242700/4588595) loss:2.822 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:07,889][model8_pretrain.py][INFO] Epoch:[0/2](242700/4588595) loss:3.127 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:07,889][model8_pretrain.py][INFO] Epoch:[0/2](242700/4588595) loss:2.827 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:44,817][model8_pretrain.py][INFO] Epoch:[0/2](242800/4588595) loss:2.510 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:44,817][model8_pretrain.py][INFO] Epoch:[0/2](242800/4588595) loss:2.552 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:44,817][model8_pretrain.py][INFO] Epoch:[0/2](242800/4588595) loss:2.997 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:44,817][model8_pretrain.py][INFO] Epoch:[0/2](242800/4588595) loss:2.559 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:44,817][model8_pretrain.py][INFO] Epoch:[0/2](242800/4588595) loss:3.101 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:44,817][model8_pretrain.py][INFO] Epoch:[0/2](242800/4588595) loss:3.173 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:44,817][model8_pretrain.py][INFO] Epoch:[0/2](242800/4588595) loss:3.056 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:14:44,817][model8_pretrain.py][INFO] Epoch:[0/2](242800/4588595) loss:2.728 lr:0.0000100 epoch_Time:27474.0min: [2024-01-03 19:15:21,795][model8_pretrain.py][INFO] Epoch:[0/2](242900/4588595) loss:3.237 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:15:21,795][model8_pretrain.py][INFO] Epoch:[0/2](242900/4588595) loss:3.159 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:15:21,795][model8_pretrain.py][INFO] Epoch:[0/2](242900/4588595) loss:2.716 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:15:21,795][model8_pretrain.py][INFO] Epoch:[0/2](242900/4588595) loss:3.243 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:15:21,795][model8_pretrain.py][INFO] Epoch:[0/2](242900/4588595) loss:2.703 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:15:21,796][model8_pretrain.py][INFO] Epoch:[0/2](242900/4588595) loss:2.602 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:15:21,796][model8_pretrain.py][INFO] Epoch:[0/2](242900/4588595) loss:2.834 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:15:21,796][model8_pretrain.py][INFO] Epoch:[0/2](242900/4588595) loss:2.805 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:15:58,760][model8_pretrain.py][INFO] Epoch:[0/2](243000/4588595) loss:3.469 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:15:58,760][model8_pretrain.py][INFO] Epoch:[0/2](243000/4588595) loss:2.807 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:15:58,760][model8_pretrain.py][INFO] Epoch:[0/2](243000/4588595) loss:2.688 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:15:58,760][model8_pretrain.py][INFO] Epoch:[0/2](243000/4588595) loss:3.184 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:15:58,760][model8_pretrain.py][INFO] Epoch:[0/2](243000/4588595) loss:2.902 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:15:58,760][model8_pretrain.py][INFO] Epoch:[0/2](243000/4588595) loss:3.074 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:15:58,760][model8_pretrain.py][INFO] Epoch:[0/2](243000/4588595) loss:2.644 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:15:58,760][model8_pretrain.py][INFO] Epoch:[0/2](243000/4588595) loss:3.152 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:16:35,702][model8_pretrain.py][INFO] Epoch:[0/2](243100/4588595) loss:2.538 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:16:35,702][model8_pretrain.py][INFO] Epoch:[0/2](243100/4588595) loss:2.613 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:16:35,702][model8_pretrain.py][INFO] Epoch:[0/2](243100/4588595) loss:2.534 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:16:35,702][model8_pretrain.py][INFO] Epoch:[0/2](243100/4588595) loss:2.841 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:16:35,702][model8_pretrain.py][INFO] Epoch:[0/2](243100/4588595) loss:2.899 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:16:35,702][model8_pretrain.py][INFO] Epoch:[0/2](243100/4588595) loss:2.563 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:16:35,702][model8_pretrain.py][INFO] Epoch:[0/2](243100/4588595) loss:3.004 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:16:35,702][model8_pretrain.py][INFO] Epoch:[0/2](243100/4588595) loss:2.621 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:17:22,993][model8_pretrain.py][INFO] Epoch:[0/2](243200/4588595) loss:2.871 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:17:22,993][model8_pretrain.py][INFO] Epoch:[0/2](243200/4588595) loss:2.996 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:17:22,993][model8_pretrain.py][INFO] Epoch:[0/2](243200/4588595) loss:2.591 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:17:22,993][model8_pretrain.py][INFO] Epoch:[0/2](243200/4588595) loss:2.848 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:17:22,993][model8_pretrain.py][INFO] Epoch:[0/2](243200/4588595) loss:2.789 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:17:22,993][model8_pretrain.py][INFO] Epoch:[0/2](243200/4588595) loss:2.349 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:17:22,993][model8_pretrain.py][INFO] Epoch:[0/2](243200/4588595) loss:2.866 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:17:22,993][model8_pretrain.py][INFO] Epoch:[0/2](243200/4588595) loss:3.018 lr:0.0000100 epoch_Time:27473.0min: [2024-01-03 19:17:59,921][model8_pretrain.py][INFO] Epoch:[0/2](243300/4588595) loss:2.744 lr:0.0000100 epoch_Time:27472.0min: [2024-01-03 19:17:59,921][model8_pretrain.py][INFO] Epoch:[0/2](243300/4588595) loss:2.995 lr:0.0000100 epoch_Time:27472.0min: [2024-01-03 19:17:59,921][model8_pretrain.py][INFO] Epoch:[0/2](243300/4588595) loss:3.044 lr:0.0000100 epoch_Time:27472.0min: [2024-01-03 19:17:59,921][model8_pretrain.py][INFO] Epoch:[0/2](243300/4588595) loss:3.111 lr:0.0000100 epoch_Time:27472.0min: [2024-01-03 19:17:59,921][model8_pretrain.py][INFO] Epoch:[0/2](243300/4588595) loss:2.815 lr:0.0000100 epoch_Time:27472.0min: [2024-01-03 19:17:59,921][model8_pretrain.py][INFO] Epoch:[0/2](243300/4588595) loss:3.083 lr:0.0000100 epoch_Time:27472.0min: [2024-01-03 19:17:59,921][model8_pretrain.py][INFO] Epoch:[0/2](243300/4588595) loss:2.717 lr:0.0000100 epoch_Time:27472.0min: [2024-01-03 19:17:59,921][model8_pretrain.py][INFO] Epoch:[0/2](243300/4588595) loss:2.687 lr:0.0000100 epoch_Time:27472.0min: [2024-01-03 19:18:36,859][model8_pretrain.py][INFO] Epoch:[0/2](243400/4588595) loss:3.290 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:18:36,859][model8_pretrain.py][INFO] Epoch:[0/2](243400/4588595) loss:2.956 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:18:36,859][model8_pretrain.py][INFO] Epoch:[0/2](243400/4588595) loss:3.217 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:18:36,859][model8_pretrain.py][INFO] Epoch:[0/2](243400/4588595) loss:2.098 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:18:36,859][model8_pretrain.py][INFO] Epoch:[0/2](243400/4588595) loss:2.744 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:18:36,859][model8_pretrain.py][INFO] Epoch:[0/2](243400/4588595) loss:2.963 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:18:36,859][model8_pretrain.py][INFO] Epoch:[0/2](243400/4588595) loss:2.942 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:18:36,860][model8_pretrain.py][INFO] Epoch:[0/2](243400/4588595) loss:2.388 lr:0.0000100 epoch_Time:27471.0min: [2024-01-03 19:19:13,792][model8_pretrain.py][INFO] Epoch:[0/2](243500/4588595) loss:3.323 lr:0.0000100 epoch_Time:27470.0min: [2024-01-03 19:19:13,792][model8_pretrain.py][INFO] Epoch:[0/2](243500/4588595) loss:2.955 lr:0.0000100 epoch_Time:27470.0min: [2024-01-03 19:19:13,792][model8_pretrain.py][INFO] Epoch:[0/2](243500/4588595) loss:2.213 lr:0.0000100 epoch_Time:27470.0min: [2024-01-03 19:19:13,792][model8_pretrain.py][INFO] Epoch:[0/2](243500/4588595) loss:2.611 lr:0.0000100 epoch_Time:27470.0min: [2024-01-03 19:19:13,792][model8_pretrain.py][INFO] Epoch:[0/2](243500/4588595) loss:2.401 lr:0.0000100 epoch_Time:27470.0min: [2024-01-03 19:19:13,792][model8_pretrain.py][INFO] Epoch:[0/2](243500/4588595) loss:3.012 lr:0.0000100 epoch_Time:27470.0min: [2024-01-03 19:19:13,792][model8_pretrain.py][INFO] Epoch:[0/2](243500/4588595) loss:2.836 lr:0.0000100 epoch_Time:27470.0min: [2024-01-03 19:19:13,792][model8_pretrain.py][INFO] Epoch:[0/2](243500/4588595) loss:2.958 lr:0.0000100 epoch_Time:27470.0min: [2024-01-03 19:19:50,731][model8_pretrain.py][INFO] Epoch:[0/2](243600/4588595) loss:3.054 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:19:50,731][model8_pretrain.py][INFO] Epoch:[0/2](243600/4588595) loss:2.797 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:19:50,731][model8_pretrain.py][INFO] Epoch:[0/2](243600/4588595) loss:2.663 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:19:50,731][model8_pretrain.py][INFO] Epoch:[0/2](243600/4588595) loss:2.389 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:19:50,731][model8_pretrain.py][INFO] Epoch:[0/2](243600/4588595) loss:2.723 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:19:50,731][model8_pretrain.py][INFO] Epoch:[0/2](243600/4588595) loss:2.894 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:19:50,731][model8_pretrain.py][INFO] Epoch:[0/2](243600/4588595) loss:2.684 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:19:50,731][model8_pretrain.py][INFO] Epoch:[0/2](243600/4588595) loss:3.102 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:20:27,667][model8_pretrain.py][INFO] Epoch:[0/2](243700/4588595) loss:3.216 lr:0.0000100 epoch_Time:27468.0min: [2024-01-03 19:20:27,667][model8_pretrain.py][INFO] Epoch:[0/2](243700/4588595) loss:2.936 lr:0.0000100 epoch_Time:27468.0min: [2024-01-03 19:20:27,667][model8_pretrain.py][INFO] Epoch:[0/2](243700/4588595) loss:3.005 lr:0.0000100 epoch_Time:27468.0min: [2024-01-03 19:20:27,667][model8_pretrain.py][INFO] Epoch:[0/2](243700/4588595) loss:3.463 lr:0.0000100 epoch_Time:27468.0min: [2024-01-03 19:20:27,667][model8_pretrain.py][INFO] Epoch:[0/2](243700/4588595) loss:2.803 lr:0.0000100 epoch_Time:27468.0min: [2024-01-03 19:20:27,667][model8_pretrain.py][INFO] Epoch:[0/2](243700/4588595) loss:2.528 lr:0.0000100 epoch_Time:27468.0min: [2024-01-03 19:20:27,667][model8_pretrain.py][INFO] Epoch:[0/2](243700/4588595) loss:2.737 lr:0.0000100 epoch_Time:27468.0min: [2024-01-03 19:20:27,667][model8_pretrain.py][INFO] Epoch:[0/2](243700/4588595) loss:2.433 lr:0.0000100 epoch_Time:27468.0min: [2024-01-03 19:21:04,594][model8_pretrain.py][INFO] Epoch:[0/2](243800/4588595) loss:2.791 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:04,594][model8_pretrain.py][INFO] Epoch:[0/2](243800/4588595) loss:2.448 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:04,594][model8_pretrain.py][INFO] Epoch:[0/2](243800/4588595) loss:2.189 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:04,594][model8_pretrain.py][INFO] Epoch:[0/2](243800/4588595) loss:2.459 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:04,594][model8_pretrain.py][INFO] Epoch:[0/2](243800/4588595) loss:3.181 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:04,594][model8_pretrain.py][INFO] Epoch:[0/2](243800/4588595) loss:2.944 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:04,594][model8_pretrain.py][INFO] Epoch:[0/2](243800/4588595) loss:2.610 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:04,594][model8_pretrain.py][INFO] Epoch:[0/2](243800/4588595) loss:2.867 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:41,520][model8_pretrain.py][INFO] Epoch:[0/2](243900/4588595) loss:3.080 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:41,520][model8_pretrain.py][INFO] Epoch:[0/2](243900/4588595) loss:3.290 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:41,520][model8_pretrain.py][INFO] Epoch:[0/2](243900/4588595) loss:2.804 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:41,520][model8_pretrain.py][INFO] Epoch:[0/2](243900/4588595) loss:2.904 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:41,520][model8_pretrain.py][INFO] Epoch:[0/2](243900/4588595) loss:3.016 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:41,520][model8_pretrain.py][INFO] Epoch:[0/2](243900/4588595) loss:2.740 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:41,520][model8_pretrain.py][INFO] Epoch:[0/2](243900/4588595) loss:2.935 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:21:41,520][model8_pretrain.py][INFO] Epoch:[0/2](243900/4588595) loss:2.962 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:22:28,880][model8_pretrain.py][INFO] Epoch:[0/2](244000/4588595) loss:3.431 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:22:28,880][model8_pretrain.py][INFO] Epoch:[0/2](244000/4588595) loss:2.673 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:22:28,880][model8_pretrain.py][INFO] Epoch:[0/2](244000/4588595) loss:2.924 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:22:28,880][model8_pretrain.py][INFO] Epoch:[0/2](244000/4588595) loss:3.031 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:22:28,880][model8_pretrain.py][INFO] Epoch:[0/2](244000/4588595) loss:2.644 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:22:28,881][model8_pretrain.py][INFO] Epoch:[0/2](244000/4588595) loss:2.898 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:22:28,881][model8_pretrain.py][INFO] Epoch:[0/2](244000/4588595) loss:2.736 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:22:28,881][model8_pretrain.py][INFO] Epoch:[0/2](244000/4588595) loss:3.267 lr:0.0000100 epoch_Time:27469.0min: [2024-01-03 19:23:05,796][model8_pretrain.py][INFO] Epoch:[0/2](244100/4588595) loss:2.830 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:05,796][model8_pretrain.py][INFO] Epoch:[0/2](244100/4588595) loss:3.030 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:05,796][model8_pretrain.py][INFO] Epoch:[0/2](244100/4588595) loss:3.230 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:05,796][model8_pretrain.py][INFO] Epoch:[0/2](244100/4588595) loss:3.115 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:05,796][model8_pretrain.py][INFO] Epoch:[0/2](244100/4588595) loss:2.688 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:05,796][model8_pretrain.py][INFO] Epoch:[0/2](244100/4588595) loss:2.431 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:05,796][model8_pretrain.py][INFO] Epoch:[0/2](244100/4588595) loss:3.233 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:05,796][model8_pretrain.py][INFO] Epoch:[0/2](244100/4588595) loss:2.912 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:42,725][model8_pretrain.py][INFO] Epoch:[0/2](244200/4588595) loss:2.432 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:42,725][model8_pretrain.py][INFO] Epoch:[0/2](244200/4588595) loss:3.077 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:42,725][model8_pretrain.py][INFO] Epoch:[0/2](244200/4588595) loss:2.864 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:42,725][model8_pretrain.py][INFO] Epoch:[0/2](244200/4588595) loss:2.843 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:42,725][model8_pretrain.py][INFO] Epoch:[0/2](244200/4588595) loss:3.038 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:42,725][model8_pretrain.py][INFO] Epoch:[0/2](244200/4588595) loss:3.227 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:42,725][model8_pretrain.py][INFO] Epoch:[0/2](244200/4588595) loss:2.876 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:23:42,726][model8_pretrain.py][INFO] Epoch:[0/2](244200/4588595) loss:2.727 lr:0.0000100 epoch_Time:27467.0min: [2024-01-03 19:24:19,666][model8_pretrain.py][INFO] Epoch:[0/2](244300/4588595) loss:2.728 lr:0.0000100 epoch_Time:27466.0min: [2024-01-03 19:24:19,666][model8_pretrain.py][INFO] Epoch:[0/2](244300/4588595) loss:2.549 lr:0.0000100 epoch_Time:27466.0min: [2024-01-03 19:24:19,666][model8_pretrain.py][INFO] Epoch:[0/2](244300/4588595) loss:3.183 lr:0.0000100 epoch_Time:27466.0min: [2024-01-03 19:24:19,666][model8_pretrain.py][INFO] Epoch:[0/2](244300/4588595) loss:3.440 lr:0.0000100 epoch_Time:27466.0min: [2024-01-03 19:24:19,666][model8_pretrain.py][INFO] Epoch:[0/2](244300/4588595) loss:3.346 lr:0.0000100 epoch_Time:27466.0min: [2024-01-03 19:24:19,666][model8_pretrain.py][INFO] Epoch:[0/2](244300/4588595) loss:2.595 lr:0.0000100 epoch_Time:27466.0min: [2024-01-03 19:24:19,666][model8_pretrain.py][INFO] Epoch:[0/2](244300/4588595) loss:2.929 lr:0.0000100 epoch_Time:27466.0min: [2024-01-03 19:24:19,666][model8_pretrain.py][INFO] Epoch:[0/2](244300/4588595) loss:2.703 lr:0.0000100 epoch_Time:27466.0min: [2024-01-03 19:24:56,593][model8_pretrain.py][INFO] Epoch:[0/2](244400/4588595) loss:3.048 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:24:56,593][model8_pretrain.py][INFO] Epoch:[0/2](244400/4588595) loss:3.306 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:24:56,593][model8_pretrain.py][INFO] Epoch:[0/2](244400/4588595) loss:2.946 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:24:56,593][model8_pretrain.py][INFO] Epoch:[0/2](244400/4588595) loss:2.550 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:24:56,593][model8_pretrain.py][INFO] Epoch:[0/2](244400/4588595) loss:3.157 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:24:56,593][model8_pretrain.py][INFO] Epoch:[0/2](244400/4588595) loss:2.616 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:24:56,593][model8_pretrain.py][INFO] Epoch:[0/2](244400/4588595) loss:3.371 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:24:56,593][model8_pretrain.py][INFO] Epoch:[0/2](244400/4588595) loss:3.058 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:25:33,522][model8_pretrain.py][INFO] Epoch:[0/2](244500/4588595) loss:3.121 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:25:33,522][model8_pretrain.py][INFO] Epoch:[0/2](244500/4588595) loss:2.744 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:25:33,522][model8_pretrain.py][INFO] Epoch:[0/2](244500/4588595) loss:2.823 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:25:33,522][model8_pretrain.py][INFO] Epoch:[0/2](244500/4588595) loss:2.394 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:25:33,522][model8_pretrain.py][INFO] Epoch:[0/2](244500/4588595) loss:3.292 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:25:33,522][model8_pretrain.py][INFO] Epoch:[0/2](244500/4588595) loss:3.131 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:25:33,522][model8_pretrain.py][INFO] Epoch:[0/2](244500/4588595) loss:2.560 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:25:33,522][model8_pretrain.py][INFO] Epoch:[0/2](244500/4588595) loss:2.854 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](244600/4588595) loss:3.012 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](244600/4588595) loss:2.406 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](244600/4588595) loss:3.137 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](244600/4588595) loss:2.966 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](244600/4588595) loss:3.494 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](244600/4588595) loss:3.040 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](244600/4588595) loss:3.266 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](244600/4588595) loss:2.808 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:47,374][model8_pretrain.py][INFO] Epoch:[0/2](244700/4588595) loss:3.299 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:47,374][model8_pretrain.py][INFO] Epoch:[0/2](244700/4588595) loss:2.901 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:47,374][model8_pretrain.py][INFO] Epoch:[0/2](244700/4588595) loss:3.108 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:47,374][model8_pretrain.py][INFO] Epoch:[0/2](244700/4588595) loss:3.310 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:47,374][model8_pretrain.py][INFO] Epoch:[0/2](244700/4588595) loss:2.723 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:47,374][model8_pretrain.py][INFO] Epoch:[0/2](244700/4588595) loss:2.695 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:47,374][model8_pretrain.py][INFO] Epoch:[0/2](244700/4588595) loss:2.849 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:26:47,374][model8_pretrain.py][INFO] Epoch:[0/2](244700/4588595) loss:2.746 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:27:34,652][model8_pretrain.py][INFO] Epoch:[0/2](244800/4588595) loss:2.803 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:27:34,652][model8_pretrain.py][INFO] Epoch:[0/2](244800/4588595) loss:3.325 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:27:34,652][model8_pretrain.py][INFO] Epoch:[0/2](244800/4588595) loss:3.063 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:27:34,652][model8_pretrain.py][INFO] Epoch:[0/2](244800/4588595) loss:2.568 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:27:34,652][model8_pretrain.py][INFO] Epoch:[0/2](244800/4588595) loss:3.138 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:27:34,652][model8_pretrain.py][INFO] Epoch:[0/2](244800/4588595) loss:3.082 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:27:34,652][model8_pretrain.py][INFO] Epoch:[0/2](244800/4588595) loss:3.230 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:27:34,653][model8_pretrain.py][INFO] Epoch:[0/2](244800/4588595) loss:3.230 lr:0.0000100 epoch_Time:27464.0min: [2024-01-03 19:28:11,577][model8_pretrain.py][INFO] Epoch:[0/2](244900/4588595) loss:2.794 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:28:11,577][model8_pretrain.py][INFO] Epoch:[0/2](244900/4588595) loss:3.464 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:28:11,577][model8_pretrain.py][INFO] Epoch:[0/2](244900/4588595) loss:2.776 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:28:11,577][model8_pretrain.py][INFO] Epoch:[0/2](244900/4588595) loss:2.775 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:28:11,577][model8_pretrain.py][INFO] Epoch:[0/2](244900/4588595) loss:3.008 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:28:11,577][model8_pretrain.py][INFO] Epoch:[0/2](244900/4588595) loss:2.787 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:28:11,577][model8_pretrain.py][INFO] Epoch:[0/2](244900/4588595) loss:2.878 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:28:11,577][model8_pretrain.py][INFO] Epoch:[0/2](244900/4588595) loss:2.960 lr:0.0000100 epoch_Time:27463.0min: [2024-01-03 19:28:48,521][model8_pretrain.py][INFO] Epoch:[0/2](245000/4588595) loss:2.836 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:28:48,521][model8_pretrain.py][INFO] Epoch:[0/2](245000/4588595) loss:3.093 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:28:48,521][model8_pretrain.py][INFO] Epoch:[0/2](245000/4588595) loss:3.309 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:28:48,521][model8_pretrain.py][INFO] Epoch:[0/2](245000/4588595) loss:2.472 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:28:48,521][model8_pretrain.py][INFO] Epoch:[0/2](245000/4588595) loss:2.936 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:28:48,521][model8_pretrain.py][INFO] Epoch:[0/2](245000/4588595) loss:3.360 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:28:48,521][model8_pretrain.py][INFO] Epoch:[0/2](245000/4588595) loss:3.186 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:28:48,521][model8_pretrain.py][INFO] Epoch:[0/2](245000/4588595) loss:3.235 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:29:25,461][model8_pretrain.py][INFO] Epoch:[0/2](245100/4588595) loss:3.053 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:29:25,461][model8_pretrain.py][INFO] Epoch:[0/2](245100/4588595) loss:2.400 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:29:25,461][model8_pretrain.py][INFO] Epoch:[0/2](245100/4588595) loss:2.827 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:29:25,461][model8_pretrain.py][INFO] Epoch:[0/2](245100/4588595) loss:2.503 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:29:25,461][model8_pretrain.py][INFO] Epoch:[0/2](245100/4588595) loss:2.740 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:29:25,462][model8_pretrain.py][INFO] Epoch:[0/2](245100/4588595) loss:3.063 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:29:25,462][model8_pretrain.py][INFO] Epoch:[0/2](245100/4588595) loss:2.877 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:29:25,462][model8_pretrain.py][INFO] Epoch:[0/2](245100/4588595) loss:2.990 lr:0.0000100 epoch_Time:27462.0min: [2024-01-03 19:30:02,410][model8_pretrain.py][INFO] Epoch:[0/2](245200/4588595) loss:2.952 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:02,410][model8_pretrain.py][INFO] Epoch:[0/2](245200/4588595) loss:2.206 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:02,410][model8_pretrain.py][INFO] Epoch:[0/2](245200/4588595) loss:2.598 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:02,410][model8_pretrain.py][INFO] Epoch:[0/2](245200/4588595) loss:2.821 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:02,410][model8_pretrain.py][INFO] Epoch:[0/2](245200/4588595) loss:2.917 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:02,410][model8_pretrain.py][INFO] Epoch:[0/2](245200/4588595) loss:2.916 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:02,410][model8_pretrain.py][INFO] Epoch:[0/2](245200/4588595) loss:2.821 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:02,411][model8_pretrain.py][INFO] Epoch:[0/2](245200/4588595) loss:3.207 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:39,353][model8_pretrain.py][INFO] Epoch:[0/2](245300/4588595) loss:3.034 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:39,353][model8_pretrain.py][INFO] Epoch:[0/2](245300/4588595) loss:3.193 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:39,353][model8_pretrain.py][INFO] Epoch:[0/2](245300/4588595) loss:2.741 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:39,353][model8_pretrain.py][INFO] Epoch:[0/2](245300/4588595) loss:2.368 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:39,354][model8_pretrain.py][INFO] Epoch:[0/2](245300/4588595) loss:3.634 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:39,354][model8_pretrain.py][INFO] Epoch:[0/2](245300/4588595) loss:2.929 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:39,354][model8_pretrain.py][INFO] Epoch:[0/2](245300/4588595) loss:2.624 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:30:39,354][model8_pretrain.py][INFO] Epoch:[0/2](245300/4588595) loss:2.921 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:31:16,290][model8_pretrain.py][INFO] Epoch:[0/2](245400/4588595) loss:3.323 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:31:16,291][model8_pretrain.py][INFO] Epoch:[0/2](245400/4588595) loss:2.920 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:31:16,291][model8_pretrain.py][INFO] Epoch:[0/2](245400/4588595) loss:2.941 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:31:16,291][model8_pretrain.py][INFO] Epoch:[0/2](245400/4588595) loss:2.679 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:31:16,291][model8_pretrain.py][INFO] Epoch:[0/2](245400/4588595) loss:3.143 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:31:16,291][model8_pretrain.py][INFO] Epoch:[0/2](245400/4588595) loss:2.653 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:31:16,291][model8_pretrain.py][INFO] Epoch:[0/2](245400/4588595) loss:3.549 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:31:16,291][model8_pretrain.py][INFO] Epoch:[0/2](245400/4588595) loss:2.941 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:31:53,226][model8_pretrain.py][INFO] Epoch:[0/2](245500/4588595) loss:3.063 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:31:53,226][model8_pretrain.py][INFO] Epoch:[0/2](245500/4588595) loss:3.066 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:31:53,226][model8_pretrain.py][INFO] Epoch:[0/2](245500/4588595) loss:2.756 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:31:53,226][model8_pretrain.py][INFO] Epoch:[0/2](245500/4588595) loss:2.530 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:31:53,226][model8_pretrain.py][INFO] Epoch:[0/2](245500/4588595) loss:2.796 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:31:53,226][model8_pretrain.py][INFO] Epoch:[0/2](245500/4588595) loss:2.428 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:31:53,227][model8_pretrain.py][INFO] Epoch:[0/2](245500/4588595) loss:3.066 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:31:53,227][model8_pretrain.py][INFO] Epoch:[0/2](245500/4588595) loss:2.925 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:32:40,467][model8_pretrain.py][INFO] Epoch:[0/2](245600/4588595) loss:3.233 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:32:40,467][model8_pretrain.py][INFO] Epoch:[0/2](245600/4588595) loss:2.920 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:32:40,467][model8_pretrain.py][INFO] Epoch:[0/2](245600/4588595) loss:2.776 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:32:40,467][model8_pretrain.py][INFO] Epoch:[0/2](245600/4588595) loss:3.278 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:32:40,467][model8_pretrain.py][INFO] Epoch:[0/2](245600/4588595) loss:3.258 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:32:40,467][model8_pretrain.py][INFO] Epoch:[0/2](245600/4588595) loss:2.764 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:32:40,467][model8_pretrain.py][INFO] Epoch:[0/2](245600/4588595) loss:2.588 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:32:40,467][model8_pretrain.py][INFO] Epoch:[0/2](245600/4588595) loss:2.332 lr:0.0000100 epoch_Time:27460.0min: [2024-01-03 19:33:17,391][model8_pretrain.py][INFO] Epoch:[0/2](245700/4588595) loss:2.968 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:33:17,391][model8_pretrain.py][INFO] Epoch:[0/2](245700/4588595) loss:2.893 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:33:17,391][model8_pretrain.py][INFO] Epoch:[0/2](245700/4588595) loss:2.496 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:33:17,391][model8_pretrain.py][INFO] Epoch:[0/2](245700/4588595) loss:3.153 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:33:17,391][model8_pretrain.py][INFO] Epoch:[0/2](245700/4588595) loss:2.988 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:33:17,391][model8_pretrain.py][INFO] Epoch:[0/2](245700/4588595) loss:2.849 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:33:17,391][model8_pretrain.py][INFO] Epoch:[0/2](245700/4588595) loss:2.796 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:33:17,391][model8_pretrain.py][INFO] Epoch:[0/2](245700/4588595) loss:2.532 lr:0.0000100 epoch_Time:27459.0min: [2024-01-03 19:33:54,334][model8_pretrain.py][INFO] Epoch:[0/2](245800/4588595) loss:3.286 lr:0.0000100 epoch_Time:27458.0min: [2024-01-03 19:33:54,334][model8_pretrain.py][INFO] Epoch:[0/2](245800/4588595) loss:2.314 lr:0.0000100 epoch_Time:27458.0min: [2024-01-03 19:33:54,334][model8_pretrain.py][INFO] Epoch:[0/2](245800/4588595) loss:2.854 lr:0.0000100 epoch_Time:27458.0min: [2024-01-03 19:33:54,334][model8_pretrain.py][INFO] Epoch:[0/2](245800/4588595) loss:2.500 lr:0.0000100 epoch_Time:27458.0min: [2024-01-03 19:33:54,334][model8_pretrain.py][INFO] Epoch:[0/2](245800/4588595) loss:3.649 lr:0.0000100 epoch_Time:27458.0min: [2024-01-03 19:33:54,334][model8_pretrain.py][INFO] Epoch:[0/2](245800/4588595) loss:2.351 lr:0.0000100 epoch_Time:27458.0min: [2024-01-03 19:33:54,334][model8_pretrain.py][INFO] Epoch:[0/2](245800/4588595) loss:3.159 lr:0.0000100 epoch_Time:27458.0min: [2024-01-03 19:33:54,334][model8_pretrain.py][INFO] Epoch:[0/2](245800/4588595) loss:3.005 lr:0.0000100 epoch_Time:27458.0min: [2024-01-03 19:34:31,286][model8_pretrain.py][INFO] Epoch:[0/2](245900/4588595) loss:2.974 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:34:31,286][model8_pretrain.py][INFO] Epoch:[0/2](245900/4588595) loss:2.843 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:34:31,286][model8_pretrain.py][INFO] Epoch:[0/2](245900/4588595) loss:2.912 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:34:31,286][model8_pretrain.py][INFO] Epoch:[0/2](245900/4588595) loss:3.181 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:34:31,286][model8_pretrain.py][INFO] Epoch:[0/2](245900/4588595) loss:3.445 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:34:31,286][model8_pretrain.py][INFO] Epoch:[0/2](245900/4588595) loss:2.991 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:34:31,286][model8_pretrain.py][INFO] Epoch:[0/2](245900/4588595) loss:2.810 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:34:31,286][model8_pretrain.py][INFO] Epoch:[0/2](245900/4588595) loss:3.251 lr:0.0000100 epoch_Time:27457.0min: [2024-01-03 19:35:08,220][model8_pretrain.py][INFO] Epoch:[0/2](246000/4588595) loss:2.726 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:08,220][model8_pretrain.py][INFO] Epoch:[0/2](246000/4588595) loss:2.996 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:08,220][model8_pretrain.py][INFO] Epoch:[0/2](246000/4588595) loss:3.408 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:08,220][model8_pretrain.py][INFO] Epoch:[0/2](246000/4588595) loss:3.043 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:08,220][model8_pretrain.py][INFO] Epoch:[0/2](246000/4588595) loss:2.865 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:08,220][model8_pretrain.py][INFO] Epoch:[0/2](246000/4588595) loss:2.475 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:08,220][model8_pretrain.py][INFO] Epoch:[0/2](246000/4588595) loss:2.414 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:08,220][model8_pretrain.py][INFO] Epoch:[0/2](246000/4588595) loss:3.140 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:45,163][model8_pretrain.py][INFO] Epoch:[0/2](246100/4588595) loss:2.760 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:45,163][model8_pretrain.py][INFO] Epoch:[0/2](246100/4588595) loss:2.654 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:45,163][model8_pretrain.py][INFO] Epoch:[0/2](246100/4588595) loss:3.014 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:45,163][model8_pretrain.py][INFO] Epoch:[0/2](246100/4588595) loss:3.134 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:45,163][model8_pretrain.py][INFO] Epoch:[0/2](246100/4588595) loss:3.351 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:45,163][model8_pretrain.py][INFO] Epoch:[0/2](246100/4588595) loss:2.669 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:45,163][model8_pretrain.py][INFO] Epoch:[0/2](246100/4588595) loss:3.037 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:35:45,163][model8_pretrain.py][INFO] Epoch:[0/2](246100/4588595) loss:3.217 lr:0.0000100 epoch_Time:27456.0min: [2024-01-03 19:36:22,093][model8_pretrain.py][INFO] Epoch:[0/2](246200/4588595) loss:2.469 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:36:22,093][model8_pretrain.py][INFO] Epoch:[0/2](246200/4588595) loss:2.684 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:36:22,093][model8_pretrain.py][INFO] Epoch:[0/2](246200/4588595) loss:3.209 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:36:22,093][model8_pretrain.py][INFO] Epoch:[0/2](246200/4588595) loss:3.086 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:36:22,093][model8_pretrain.py][INFO] Epoch:[0/2](246200/4588595) loss:3.367 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:36:22,093][model8_pretrain.py][INFO] Epoch:[0/2](246200/4588595) loss:1.948 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:36:22,093][model8_pretrain.py][INFO] Epoch:[0/2](246200/4588595) loss:2.862 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:36:22,093][model8_pretrain.py][INFO] Epoch:[0/2](246200/4588595) loss:3.183 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:36:59,028][model8_pretrain.py][INFO] Epoch:[0/2](246300/4588595) loss:2.807 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:36:59,028][model8_pretrain.py][INFO] Epoch:[0/2](246300/4588595) loss:2.524 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:36:59,028][model8_pretrain.py][INFO] Epoch:[0/2](246300/4588595) loss:3.063 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:36:59,028][model8_pretrain.py][INFO] Epoch:[0/2](246300/4588595) loss:2.992 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:36:59,028][model8_pretrain.py][INFO] Epoch:[0/2](246300/4588595) loss:3.282 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:36:59,028][model8_pretrain.py][INFO] Epoch:[0/2](246300/4588595) loss:2.613 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:36:59,028][model8_pretrain.py][INFO] Epoch:[0/2](246300/4588595) loss:2.209 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:36:59,029][model8_pretrain.py][INFO] Epoch:[0/2](246300/4588595) loss:3.362 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:37:44,358][model8_pretrain.py][INFO] Epoch:[0/2](246400/4588595) loss:2.337 lr:0.0000100 epoch_Time:27455.0min: [2024-01-03 19:37:44,358][model8_pretrain.py][INFO] Epoch:[0/2](246400/4588595) loss:2.919 lr:0.0000100 epoch_Time:27455.0min: [2024-01-03 19:37:44,358][model8_pretrain.py][INFO] Epoch:[0/2](246400/4588595) loss:3.055 lr:0.0000100 epoch_Time:27455.0min: [2024-01-03 19:37:44,358][model8_pretrain.py][INFO] Epoch:[0/2](246400/4588595) loss:2.823 lr:0.0000100 epoch_Time:27455.0min: [2024-01-03 19:37:44,358][model8_pretrain.py][INFO] Epoch:[0/2](246400/4588595) loss:2.685 lr:0.0000100 epoch_Time:27455.0min: [2024-01-03 19:37:44,358][model8_pretrain.py][INFO] Epoch:[0/2](246400/4588595) loss:2.959 lr:0.0000100 epoch_Time:27455.0min: [2024-01-03 19:37:44,358][model8_pretrain.py][INFO] Epoch:[0/2](246400/4588595) loss:3.050 lr:0.0000100 epoch_Time:27455.0min: [2024-01-03 19:37:44,358][model8_pretrain.py][INFO] Epoch:[0/2](246400/4588595) loss:3.081 lr:0.0000100 epoch_Time:27455.0min: [2024-01-03 19:38:21,294][model8_pretrain.py][INFO] Epoch:[0/2](246500/4588595) loss:2.496 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:38:21,294][model8_pretrain.py][INFO] Epoch:[0/2](246500/4588595) loss:3.052 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:38:21,294][model8_pretrain.py][INFO] Epoch:[0/2](246500/4588595) loss:3.173 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:38:21,294][model8_pretrain.py][INFO] Epoch:[0/2](246500/4588595) loss:2.927 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:38:21,294][model8_pretrain.py][INFO] Epoch:[0/2](246500/4588595) loss:2.625 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:38:21,294][model8_pretrain.py][INFO] Epoch:[0/2](246500/4588595) loss:2.192 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:38:21,294][model8_pretrain.py][INFO] Epoch:[0/2](246500/4588595) loss:2.871 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:38:21,294][model8_pretrain.py][INFO] Epoch:[0/2](246500/4588595) loss:2.438 lr:0.0000100 epoch_Time:27454.0min: [2024-01-03 19:38:58,223][model8_pretrain.py][INFO] Epoch:[0/2](246600/4588595) loss:3.443 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:38:58,223][model8_pretrain.py][INFO] Epoch:[0/2](246600/4588595) loss:2.736 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:38:58,223][model8_pretrain.py][INFO] Epoch:[0/2](246600/4588595) loss:3.055 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:38:58,223][model8_pretrain.py][INFO] Epoch:[0/2](246600/4588595) loss:2.847 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:38:58,223][model8_pretrain.py][INFO] Epoch:[0/2](246600/4588595) loss:3.082 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:38:58,223][model8_pretrain.py][INFO] Epoch:[0/2](246600/4588595) loss:2.679 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:38:58,223][model8_pretrain.py][INFO] Epoch:[0/2](246600/4588595) loss:2.993 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:38:58,223][model8_pretrain.py][INFO] Epoch:[0/2](246600/4588595) loss:3.486 lr:0.0000100 epoch_Time:27453.0min: [2024-01-03 19:39:35,193][model8_pretrain.py][INFO] Epoch:[0/2](246700/4588595) loss:2.961 lr:0.0000100 epoch_Time:27452.0min: [2024-01-03 19:39:35,193][model8_pretrain.py][INFO] Epoch:[0/2](246700/4588595) loss:2.858 lr:0.0000100 epoch_Time:27452.0min: [2024-01-03 19:39:35,194][model8_pretrain.py][INFO] Epoch:[0/2](246700/4588595) loss:2.494 lr:0.0000100 epoch_Time:27452.0min: [2024-01-03 19:39:35,194][model8_pretrain.py][INFO] Epoch:[0/2](246700/4588595) loss:2.692 lr:0.0000100 epoch_Time:27452.0min: [2024-01-03 19:39:35,194][model8_pretrain.py][INFO] Epoch:[0/2](246700/4588595) loss:2.980 lr:0.0000100 epoch_Time:27452.0min: [2024-01-03 19:39:35,194][model8_pretrain.py][INFO] Epoch:[0/2](246700/4588595) loss:3.215 lr:0.0000100 epoch_Time:27452.0min: [2024-01-03 19:39:35,194][model8_pretrain.py][INFO] Epoch:[0/2](246700/4588595) loss:2.916 lr:0.0000100 epoch_Time:27452.0min: [2024-01-03 19:39:35,194][model8_pretrain.py][INFO] Epoch:[0/2](246700/4588595) loss:3.610 lr:0.0000100 epoch_Time:27452.0min: [2024-01-03 19:40:12,202][model8_pretrain.py][INFO] Epoch:[0/2](246800/4588595) loss:3.174 lr:0.0000100 epoch_Time:27451.0min: [2024-01-03 19:40:12,202][model8_pretrain.py][INFO] Epoch:[0/2](246800/4588595) loss:2.876 lr:0.0000100 epoch_Time:27451.0min: [2024-01-03 19:40:12,202][model8_pretrain.py][INFO] Epoch:[0/2](246800/4588595) loss:2.270 lr:0.0000100 epoch_Time:27451.0min: [2024-01-03 19:40:12,202][model8_pretrain.py][INFO] Epoch:[0/2](246800/4588595) loss:2.933 lr:0.0000100 epoch_Time:27451.0min: [2024-01-03 19:40:12,202][model8_pretrain.py][INFO] Epoch:[0/2](246800/4588595) loss:2.959 lr:0.0000100 epoch_Time:27451.0min: [2024-01-03 19:40:12,202][model8_pretrain.py][INFO] Epoch:[0/2](246800/4588595) loss:2.564 lr:0.0000100 epoch_Time:27451.0min: [2024-01-03 19:40:12,202][model8_pretrain.py][INFO] Epoch:[0/2](246800/4588595) loss:2.960 lr:0.0000100 epoch_Time:27451.0min: [2024-01-03 19:40:12,202][model8_pretrain.py][INFO] Epoch:[0/2](246800/4588595) loss:3.174 lr:0.0000100 epoch_Time:27451.0min: [2024-01-03 19:40:49,150][model8_pretrain.py][INFO] Epoch:[0/2](246900/4588595) loss:3.493 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:40:49,150][model8_pretrain.py][INFO] Epoch:[0/2](246900/4588595) loss:2.907 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:40:49,150][model8_pretrain.py][INFO] Epoch:[0/2](246900/4588595) loss:3.255 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:40:49,150][model8_pretrain.py][INFO] Epoch:[0/2](246900/4588595) loss:3.034 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:40:49,150][model8_pretrain.py][INFO] Epoch:[0/2](246900/4588595) loss:2.973 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:40:49,150][model8_pretrain.py][INFO] Epoch:[0/2](246900/4588595) loss:3.231 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:40:49,150][model8_pretrain.py][INFO] Epoch:[0/2](246900/4588595) loss:3.075 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:40:49,150][model8_pretrain.py][INFO] Epoch:[0/2](246900/4588595) loss:3.676 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:41:26,093][model8_pretrain.py][INFO] Epoch:[0/2](247000/4588595) loss:3.082 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:41:26,093][model8_pretrain.py][INFO] Epoch:[0/2](247000/4588595) loss:3.029 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:41:26,093][model8_pretrain.py][INFO] Epoch:[0/2](247000/4588595) loss:2.568 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:41:26,093][model8_pretrain.py][INFO] Epoch:[0/2](247000/4588595) loss:3.267 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:41:26,093][model8_pretrain.py][INFO] Epoch:[0/2](247000/4588595) loss:2.605 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:41:26,093][model8_pretrain.py][INFO] Epoch:[0/2](247000/4588595) loss:3.274 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:41:26,093][model8_pretrain.py][INFO] Epoch:[0/2](247000/4588595) loss:2.799 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:41:26,093][model8_pretrain.py][INFO] Epoch:[0/2](247000/4588595) loss:2.969 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:42:03,040][model8_pretrain.py][INFO] Epoch:[0/2](247100/4588595) loss:2.573 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:42:03,040][model8_pretrain.py][INFO] Epoch:[0/2](247100/4588595) loss:2.868 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:42:03,040][model8_pretrain.py][INFO] Epoch:[0/2](247100/4588595) loss:2.577 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:42:03,040][model8_pretrain.py][INFO] Epoch:[0/2](247100/4588595) loss:2.675 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:42:03,040][model8_pretrain.py][INFO] Epoch:[0/2](247100/4588595) loss:2.949 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:42:03,040][model8_pretrain.py][INFO] Epoch:[0/2](247100/4588595) loss:2.864 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:42:03,040][model8_pretrain.py][INFO] Epoch:[0/2](247100/4588595) loss:2.764 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:42:03,040][model8_pretrain.py][INFO] Epoch:[0/2](247100/4588595) loss:2.849 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:42:46,973][model8_pretrain.py][INFO] Epoch:[0/2](247200/4588595) loss:2.875 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:42:46,973][model8_pretrain.py][INFO] Epoch:[0/2](247200/4588595) loss:2.990 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:42:46,973][model8_pretrain.py][INFO] Epoch:[0/2](247200/4588595) loss:2.431 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:42:46,973][model8_pretrain.py][INFO] Epoch:[0/2](247200/4588595) loss:3.193 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:42:46,973][model8_pretrain.py][INFO] Epoch:[0/2](247200/4588595) loss:3.046 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:42:46,973][model8_pretrain.py][INFO] Epoch:[0/2](247200/4588595) loss:2.744 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:42:46,973][model8_pretrain.py][INFO] Epoch:[0/2](247200/4588595) loss:2.501 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:42:48,656][model8_pretrain.py][INFO] Epoch:[0/2](247200/4588595) loss:3.250 lr:0.0000100 epoch_Time:27450.0min: [2024-01-03 19:43:25,570][model8_pretrain.py][INFO] Epoch:[0/2](247300/4588595) loss:3.175 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:43:25,570][model8_pretrain.py][INFO] Epoch:[0/2](247300/4588595) loss:3.178 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:43:25,570][model8_pretrain.py][INFO] Epoch:[0/2](247300/4588595) loss:2.999 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:43:25,570][model8_pretrain.py][INFO] Epoch:[0/2](247300/4588595) loss:2.957 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:43:25,570][model8_pretrain.py][INFO] Epoch:[0/2](247300/4588595) loss:2.700 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:43:25,570][model8_pretrain.py][INFO] Epoch:[0/2](247300/4588595) loss:3.088 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:43:25,570][model8_pretrain.py][INFO] Epoch:[0/2](247300/4588595) loss:2.780 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:43:25,570][model8_pretrain.py][INFO] Epoch:[0/2](247300/4588595) loss:3.026 lr:0.0000100 epoch_Time:27449.0min: [2024-01-03 19:44:02,511][model8_pretrain.py][INFO] Epoch:[0/2](247400/4588595) loss:3.062 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:02,511][model8_pretrain.py][INFO] Epoch:[0/2](247400/4588595) loss:3.139 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:02,511][model8_pretrain.py][INFO] Epoch:[0/2](247400/4588595) loss:2.844 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:02,511][model8_pretrain.py][INFO] Epoch:[0/2](247400/4588595) loss:3.036 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:02,511][model8_pretrain.py][INFO] Epoch:[0/2](247400/4588595) loss:3.257 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:02,511][model8_pretrain.py][INFO] Epoch:[0/2](247400/4588595) loss:2.721 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:02,511][model8_pretrain.py][INFO] Epoch:[0/2](247400/4588595) loss:2.490 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:02,511][model8_pretrain.py][INFO] Epoch:[0/2](247400/4588595) loss:3.127 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:39,456][model8_pretrain.py][INFO] Epoch:[0/2](247500/4588595) loss:2.972 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:39,456][model8_pretrain.py][INFO] Epoch:[0/2](247500/4588595) loss:2.858 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:39,456][model8_pretrain.py][INFO] Epoch:[0/2](247500/4588595) loss:3.284 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:39,456][model8_pretrain.py][INFO] Epoch:[0/2](247500/4588595) loss:3.506 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:39,456][model8_pretrain.py][INFO] Epoch:[0/2](247500/4588595) loss:3.046 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:39,456][model8_pretrain.py][INFO] Epoch:[0/2](247500/4588595) loss:3.099 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:39,456][model8_pretrain.py][INFO] Epoch:[0/2](247500/4588595) loss:3.016 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:44:39,456][model8_pretrain.py][INFO] Epoch:[0/2](247500/4588595) loss:2.851 lr:0.0000100 epoch_Time:27448.0min: [2024-01-03 19:45:16,396][model8_pretrain.py][INFO] Epoch:[0/2](247600/4588595) loss:3.074 lr:0.0000100 epoch_Time:27446.0min: [2024-01-03 19:45:16,396][model8_pretrain.py][INFO] Epoch:[0/2](247600/4588595) loss:2.579 lr:0.0000100 epoch_Time:27446.0min: [2024-01-03 19:45:16,396][model8_pretrain.py][INFO] Epoch:[0/2](247600/4588595) loss:2.900 lr:0.0000100 epoch_Time:27446.0min: [2024-01-03 19:45:16,396][model8_pretrain.py][INFO] Epoch:[0/2](247600/4588595) loss:2.253 lr:0.0000100 epoch_Time:27446.0min: [2024-01-03 19:45:16,396][model8_pretrain.py][INFO] Epoch:[0/2](247600/4588595) loss:2.756 lr:0.0000100 epoch_Time:27446.0min: [2024-01-03 19:45:16,396][model8_pretrain.py][INFO] Epoch:[0/2](247600/4588595) loss:2.937 lr:0.0000100 epoch_Time:27446.0min: [2024-01-03 19:45:16,396][model8_pretrain.py][INFO] Epoch:[0/2](247600/4588595) loss:2.844 lr:0.0000100 epoch_Time:27446.0min: [2024-01-03 19:45:16,396][model8_pretrain.py][INFO] Epoch:[0/2](247600/4588595) loss:3.157 lr:0.0000100 epoch_Time:27446.0min: [2024-01-03 19:45:53,333][model8_pretrain.py][INFO] Epoch:[0/2](247700/4588595) loss:3.396 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:45:53,333][model8_pretrain.py][INFO] Epoch:[0/2](247700/4588595) loss:2.980 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:45:53,333][model8_pretrain.py][INFO] Epoch:[0/2](247700/4588595) loss:3.002 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:45:53,333][model8_pretrain.py][INFO] Epoch:[0/2](247700/4588595) loss:2.569 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:45:53,333][model8_pretrain.py][INFO] Epoch:[0/2](247700/4588595) loss:2.855 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:45:53,333][model8_pretrain.py][INFO] Epoch:[0/2](247700/4588595) loss:3.373 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:45:53,333][model8_pretrain.py][INFO] Epoch:[0/2](247700/4588595) loss:3.349 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:45:53,333][model8_pretrain.py][INFO] Epoch:[0/2](247700/4588595) loss:3.088 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:46:30,279][model8_pretrain.py][INFO] Epoch:[0/2](247800/4588595) loss:2.437 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:46:30,279][model8_pretrain.py][INFO] Epoch:[0/2](247800/4588595) loss:2.624 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:46:30,280][model8_pretrain.py][INFO] Epoch:[0/2](247800/4588595) loss:3.032 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:46:30,279][model8_pretrain.py][INFO] Epoch:[0/2](247800/4588595) loss:3.177 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:46:30,280][model8_pretrain.py][INFO] Epoch:[0/2](247800/4588595) loss:3.279 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:46:30,280][model8_pretrain.py][INFO] Epoch:[0/2](247800/4588595) loss:3.241 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:46:30,280][model8_pretrain.py][INFO] Epoch:[0/2](247800/4588595) loss:3.040 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:46:30,280][model8_pretrain.py][INFO] Epoch:[0/2](247800/4588595) loss:2.437 lr:0.0000100 epoch_Time:27445.0min: [2024-01-03 19:47:07,215][model8_pretrain.py][INFO] Epoch:[0/2](247900/4588595) loss:3.063 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:47:07,215][model8_pretrain.py][INFO] Epoch:[0/2](247900/4588595) loss:2.630 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:47:07,215][model8_pretrain.py][INFO] Epoch:[0/2](247900/4588595) loss:2.695 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:47:07,215][model8_pretrain.py][INFO] Epoch:[0/2](247900/4588595) loss:2.962 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:47:07,215][model8_pretrain.py][INFO] Epoch:[0/2](247900/4588595) loss:3.384 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:47:07,215][model8_pretrain.py][INFO] Epoch:[0/2](247900/4588595) loss:2.570 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:47:07,215][model8_pretrain.py][INFO] Epoch:[0/2](247900/4588595) loss:3.010 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:47:07,215][model8_pretrain.py][INFO] Epoch:[0/2](247900/4588595) loss:2.826 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:47:51,189][model8_pretrain.py][INFO] Epoch:[0/2](248000/4588595) loss:3.053 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:47:51,189][model8_pretrain.py][INFO] Epoch:[0/2](248000/4588595) loss:2.942 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:47:51,189][model8_pretrain.py][INFO] Epoch:[0/2](248000/4588595) loss:3.122 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:47:51,189][model8_pretrain.py][INFO] Epoch:[0/2](248000/4588595) loss:2.811 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:47:51,189][model8_pretrain.py][INFO] Epoch:[0/2](248000/4588595) loss:3.287 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:47:51,189][model8_pretrain.py][INFO] Epoch:[0/2](248000/4588595) loss:2.962 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:47:51,189][model8_pretrain.py][INFO] Epoch:[0/2](248000/4588595) loss:2.989 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:47:51,189][model8_pretrain.py][INFO] Epoch:[0/2](248000/4588595) loss:3.027 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:48:29,805][model8_pretrain.py][INFO] Epoch:[0/2](248100/4588595) loss:2.977 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:48:29,805][model8_pretrain.py][INFO] Epoch:[0/2](248100/4588595) loss:2.942 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:48:29,805][model8_pretrain.py][INFO] Epoch:[0/2](248100/4588595) loss:2.769 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:48:29,805][model8_pretrain.py][INFO] Epoch:[0/2](248100/4588595) loss:3.058 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:48:29,805][model8_pretrain.py][INFO] Epoch:[0/2](248100/4588595) loss:3.144 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:48:29,805][model8_pretrain.py][INFO] Epoch:[0/2](248100/4588595) loss:2.614 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:48:29,805][model8_pretrain.py][INFO] Epoch:[0/2](248100/4588595) loss:2.357 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:48:29,805][model8_pretrain.py][INFO] Epoch:[0/2](248100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27444.0min: [2024-01-03 19:49:06,741][model8_pretrain.py][INFO] Epoch:[0/2](248200/4588595) loss:3.094 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:06,741][model8_pretrain.py][INFO] Epoch:[0/2](248200/4588595) loss:3.275 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:06,741][model8_pretrain.py][INFO] Epoch:[0/2](248200/4588595) loss:2.935 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:06,741][model8_pretrain.py][INFO] Epoch:[0/2](248200/4588595) loss:2.662 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:06,741][model8_pretrain.py][INFO] Epoch:[0/2](248200/4588595) loss:3.528 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:06,741][model8_pretrain.py][INFO] Epoch:[0/2](248200/4588595) loss:2.871 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:06,741][model8_pretrain.py][INFO] Epoch:[0/2](248200/4588595) loss:2.880 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:06,741][model8_pretrain.py][INFO] Epoch:[0/2](248200/4588595) loss:3.083 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:43,676][model8_pretrain.py][INFO] Epoch:[0/2](248300/4588595) loss:2.739 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:43,676][model8_pretrain.py][INFO] Epoch:[0/2](248300/4588595) loss:3.187 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:43,676][model8_pretrain.py][INFO] Epoch:[0/2](248300/4588595) loss:3.296 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:43,676][model8_pretrain.py][INFO] Epoch:[0/2](248300/4588595) loss:2.696 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:43,676][model8_pretrain.py][INFO] Epoch:[0/2](248300/4588595) loss:2.743 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:43,676][model8_pretrain.py][INFO] Epoch:[0/2](248300/4588595) loss:3.007 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:43,676][model8_pretrain.py][INFO] Epoch:[0/2](248300/4588595) loss:2.383 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:49:43,676][model8_pretrain.py][INFO] Epoch:[0/2](248300/4588595) loss:2.710 lr:0.0000100 epoch_Time:27443.0min: [2024-01-03 19:50:20,615][model8_pretrain.py][INFO] Epoch:[0/2](248400/4588595) loss:3.177 lr:0.0000100 epoch_Time:27442.0min: [2024-01-03 19:50:20,615][model8_pretrain.py][INFO] Epoch:[0/2](248400/4588595) loss:2.906 lr:0.0000100 epoch_Time:27442.0min: [2024-01-03 19:50:20,615][model8_pretrain.py][INFO] Epoch:[0/2](248400/4588595) loss:2.806 lr:0.0000100 epoch_Time:27442.0min: [2024-01-03 19:50:20,615][model8_pretrain.py][INFO] Epoch:[0/2](248400/4588595) loss:3.320 lr:0.0000100 epoch_Time:27442.0min: [2024-01-03 19:50:20,615][model8_pretrain.py][INFO] Epoch:[0/2](248400/4588595) loss:2.975 lr:0.0000100 epoch_Time:27442.0min: [2024-01-03 19:50:20,615][model8_pretrain.py][INFO] Epoch:[0/2](248400/4588595) loss:2.998 lr:0.0000100 epoch_Time:27442.0min: [2024-01-03 19:50:20,615][model8_pretrain.py][INFO] Epoch:[0/2](248400/4588595) loss:3.302 lr:0.0000100 epoch_Time:27442.0min: [2024-01-03 19:50:20,615][model8_pretrain.py][INFO] Epoch:[0/2](248400/4588595) loss:3.202 lr:0.0000100 epoch_Time:27442.0min: [2024-01-03 19:50:57,560][model8_pretrain.py][INFO] Epoch:[0/2](248500/4588595) loss:3.442 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:50:57,560][model8_pretrain.py][INFO] Epoch:[0/2](248500/4588595) loss:2.775 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:50:57,560][model8_pretrain.py][INFO] Epoch:[0/2](248500/4588595) loss:3.438 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:50:57,560][model8_pretrain.py][INFO] Epoch:[0/2](248500/4588595) loss:2.958 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:50:57,560][model8_pretrain.py][INFO] Epoch:[0/2](248500/4588595) loss:2.595 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:50:57,560][model8_pretrain.py][INFO] Epoch:[0/2](248500/4588595) loss:3.010 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:50:57,560][model8_pretrain.py][INFO] Epoch:[0/2](248500/4588595) loss:3.155 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:50:57,560][model8_pretrain.py][INFO] Epoch:[0/2](248500/4588595) loss:3.090 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:51:34,499][model8_pretrain.py][INFO] Epoch:[0/2](248600/4588595) loss:3.043 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:51:34,499][model8_pretrain.py][INFO] Epoch:[0/2](248600/4588595) loss:2.306 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:51:34,499][model8_pretrain.py][INFO] Epoch:[0/2](248600/4588595) loss:3.061 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:51:34,499][model8_pretrain.py][INFO] Epoch:[0/2](248600/4588595) loss:3.008 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:51:34,499][model8_pretrain.py][INFO] Epoch:[0/2](248600/4588595) loss:3.512 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:51:34,499][model8_pretrain.py][INFO] Epoch:[0/2](248600/4588595) loss:3.236 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:51:34,499][model8_pretrain.py][INFO] Epoch:[0/2](248600/4588595) loss:2.666 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:51:34,499][model8_pretrain.py][INFO] Epoch:[0/2](248600/4588595) loss:2.832 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:52:11,432][model8_pretrain.py][INFO] Epoch:[0/2](248700/4588595) loss:2.944 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:11,432][model8_pretrain.py][INFO] Epoch:[0/2](248700/4588595) loss:2.722 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:11,432][model8_pretrain.py][INFO] Epoch:[0/2](248700/4588595) loss:3.328 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:11,431][model8_pretrain.py][INFO] Epoch:[0/2](248700/4588595) loss:2.880 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:11,432][model8_pretrain.py][INFO] Epoch:[0/2](248700/4588595) loss:3.003 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:11,432][model8_pretrain.py][INFO] Epoch:[0/2](248700/4588595) loss:2.954 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:11,432][model8_pretrain.py][INFO] Epoch:[0/2](248700/4588595) loss:2.501 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:11,432][model8_pretrain.py][INFO] Epoch:[0/2](248700/4588595) loss:2.624 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:53,324][model8_pretrain.py][INFO] Epoch:[0/2](248800/4588595) loss:2.889 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:53,324][model8_pretrain.py][INFO] Epoch:[0/2](248800/4588595) loss:3.198 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:53,324][model8_pretrain.py][INFO] Epoch:[0/2](248800/4588595) loss:3.217 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:53,324][model8_pretrain.py][INFO] Epoch:[0/2](248800/4588595) loss:2.378 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:53,324][model8_pretrain.py][INFO] Epoch:[0/2](248800/4588595) loss:3.187 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:53,324][model8_pretrain.py][INFO] Epoch:[0/2](248800/4588595) loss:2.763 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:53,324][model8_pretrain.py][INFO] Epoch:[0/2](248800/4588595) loss:2.985 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:52:53,324][model8_pretrain.py][INFO] Epoch:[0/2](248800/4588595) loss:3.073 lr:0.0000100 epoch_Time:27439.0min: [2024-01-03 19:53:33,765][model8_pretrain.py][INFO] Epoch:[0/2](248900/4588595) loss:3.214 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:53:33,765][model8_pretrain.py][INFO] Epoch:[0/2](248900/4588595) loss:2.995 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:53:33,765][model8_pretrain.py][INFO] Epoch:[0/2](248900/4588595) loss:3.750 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:53:33,765][model8_pretrain.py][INFO] Epoch:[0/2](248900/4588595) loss:2.832 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:53:33,765][model8_pretrain.py][INFO] Epoch:[0/2](248900/4588595) loss:2.924 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:53:33,765][model8_pretrain.py][INFO] Epoch:[0/2](248900/4588595) loss:2.249 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:53:33,765][model8_pretrain.py][INFO] Epoch:[0/2](248900/4588595) loss:3.424 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:53:33,765][model8_pretrain.py][INFO] Epoch:[0/2](248900/4588595) loss:3.051 lr:0.0000100 epoch_Time:27440.0min: [2024-01-03 19:54:10,713][model8_pretrain.py][INFO] Epoch:[0/2](249000/4588595) loss:3.045 lr:0.0000100 epoch_Time:27438.0min: [2024-01-03 19:54:10,714][model8_pretrain.py][INFO] Epoch:[0/2](249000/4588595) loss:3.142 lr:0.0000100 epoch_Time:27438.0min: [2024-01-03 19:54:10,714][model8_pretrain.py][INFO] Epoch:[0/2](249000/4588595) loss:3.243 lr:0.0000100 epoch_Time:27438.0min: [2024-01-03 19:54:10,714][model8_pretrain.py][INFO] Epoch:[0/2](249000/4588595) loss:2.977 lr:0.0000100 epoch_Time:27438.0min: [2024-01-03 19:54:10,714][model8_pretrain.py][INFO] Epoch:[0/2](249000/4588595) loss:2.497 lr:0.0000100 epoch_Time:27438.0min: [2024-01-03 19:54:10,714][model8_pretrain.py][INFO] Epoch:[0/2](249000/4588595) loss:2.643 lr:0.0000100 epoch_Time:27438.0min: [2024-01-03 19:54:10,714][model8_pretrain.py][INFO] Epoch:[0/2](249000/4588595) loss:2.852 lr:0.0000100 epoch_Time:27438.0min: [2024-01-03 19:54:10,714][model8_pretrain.py][INFO] Epoch:[0/2](249000/4588595) loss:2.866 lr:0.0000100 epoch_Time:27438.0min: [2024-01-03 19:54:47,659][model8_pretrain.py][INFO] Epoch:[0/2](249100/4588595) loss:2.909 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:54:47,659][model8_pretrain.py][INFO] Epoch:[0/2](249100/4588595) loss:2.627 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:54:47,659][model8_pretrain.py][INFO] Epoch:[0/2](249100/4588595) loss:3.509 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:54:47,659][model8_pretrain.py][INFO] Epoch:[0/2](249100/4588595) loss:3.263 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:54:47,659][model8_pretrain.py][INFO] Epoch:[0/2](249100/4588595) loss:2.592 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:54:47,659][model8_pretrain.py][INFO] Epoch:[0/2](249100/4588595) loss:2.163 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:54:47,659][model8_pretrain.py][INFO] Epoch:[0/2](249100/4588595) loss:2.827 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:54:47,659][model8_pretrain.py][INFO] Epoch:[0/2](249100/4588595) loss:3.331 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:55:24,596][model8_pretrain.py][INFO] Epoch:[0/2](249200/4588595) loss:3.349 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:55:24,596][model8_pretrain.py][INFO] Epoch:[0/2](249200/4588595) loss:2.906 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:55:24,596][model8_pretrain.py][INFO] Epoch:[0/2](249200/4588595) loss:2.974 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:55:24,596][model8_pretrain.py][INFO] Epoch:[0/2](249200/4588595) loss:2.385 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:55:24,596][model8_pretrain.py][INFO] Epoch:[0/2](249200/4588595) loss:2.810 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:55:24,596][model8_pretrain.py][INFO] Epoch:[0/2](249200/4588595) loss:2.924 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:55:24,596][model8_pretrain.py][INFO] Epoch:[0/2](249200/4588595) loss:3.433 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:55:24,596][model8_pretrain.py][INFO] Epoch:[0/2](249200/4588595) loss:2.362 lr:0.0000100 epoch_Time:27437.0min: [2024-01-03 19:56:01,536][model8_pretrain.py][INFO] Epoch:[0/2](249300/4588595) loss:3.141 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:01,536][model8_pretrain.py][INFO] Epoch:[0/2](249300/4588595) loss:3.143 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:01,536][model8_pretrain.py][INFO] Epoch:[0/2](249300/4588595) loss:2.814 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:01,536][model8_pretrain.py][INFO] Epoch:[0/2](249300/4588595) loss:3.300 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:01,536][model8_pretrain.py][INFO] Epoch:[0/2](249300/4588595) loss:1.394 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:01,536][model8_pretrain.py][INFO] Epoch:[0/2](249300/4588595) loss:2.483 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:01,536][model8_pretrain.py][INFO] Epoch:[0/2](249300/4588595) loss:2.931 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:01,536][model8_pretrain.py][INFO] Epoch:[0/2](249300/4588595) loss:2.753 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:38,481][model8_pretrain.py][INFO] Epoch:[0/2](249400/4588595) loss:2.756 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:38,481][model8_pretrain.py][INFO] Epoch:[0/2](249400/4588595) loss:2.723 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:38,481][model8_pretrain.py][INFO] Epoch:[0/2](249400/4588595) loss:3.099 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:38,481][model8_pretrain.py][INFO] Epoch:[0/2](249400/4588595) loss:3.007 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:38,481][model8_pretrain.py][INFO] Epoch:[0/2](249400/4588595) loss:3.282 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:38,481][model8_pretrain.py][INFO] Epoch:[0/2](249400/4588595) loss:3.076 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:38,481][model8_pretrain.py][INFO] Epoch:[0/2](249400/4588595) loss:3.057 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:56:38,481][model8_pretrain.py][INFO] Epoch:[0/2](249400/4588595) loss:2.718 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:57:15,398][model8_pretrain.py][INFO] Epoch:[0/2](249500/4588595) loss:3.156 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:15,398][model8_pretrain.py][INFO] Epoch:[0/2](249500/4588595) loss:2.785 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:15,398][model8_pretrain.py][INFO] Epoch:[0/2](249500/4588595) loss:2.885 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:15,398][model8_pretrain.py][INFO] Epoch:[0/2](249500/4588595) loss:3.219 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:15,398][model8_pretrain.py][INFO] Epoch:[0/2](249500/4588595) loss:2.972 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:15,398][model8_pretrain.py][INFO] Epoch:[0/2](249500/4588595) loss:3.160 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:15,398][model8_pretrain.py][INFO] Epoch:[0/2](249500/4588595) loss:3.215 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:15,399][model8_pretrain.py][INFO] Epoch:[0/2](249500/4588595) loss:3.306 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:57,292][model8_pretrain.py][INFO] Epoch:[0/2](249600/4588595) loss:2.583 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:57,292][model8_pretrain.py][INFO] Epoch:[0/2](249600/4588595) loss:3.019 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:57,292][model8_pretrain.py][INFO] Epoch:[0/2](249600/4588595) loss:3.362 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:57,292][model8_pretrain.py][INFO] Epoch:[0/2](249600/4588595) loss:2.753 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:57,292][model8_pretrain.py][INFO] Epoch:[0/2](249600/4588595) loss:2.692 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:57,292][model8_pretrain.py][INFO] Epoch:[0/2](249600/4588595) loss:2.792 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:57,292][model8_pretrain.py][INFO] Epoch:[0/2](249600/4588595) loss:3.221 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:57:57,292][model8_pretrain.py][INFO] Epoch:[0/2](249600/4588595) loss:3.200 lr:0.0000100 epoch_Time:27434.0min: [2024-01-03 19:58:37,730][model8_pretrain.py][INFO] Epoch:[0/2](249700/4588595) loss:2.588 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:58:37,730][model8_pretrain.py][INFO] Epoch:[0/2](249700/4588595) loss:3.342 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:58:37,730][model8_pretrain.py][INFO] Epoch:[0/2](249700/4588595) loss:2.660 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:58:37,730][model8_pretrain.py][INFO] Epoch:[0/2](249700/4588595) loss:2.786 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:58:37,730][model8_pretrain.py][INFO] Epoch:[0/2](249700/4588595) loss:3.007 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:58:37,730][model8_pretrain.py][INFO] Epoch:[0/2](249700/4588595) loss:2.760 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:58:37,730][model8_pretrain.py][INFO] Epoch:[0/2](249700/4588595) loss:2.753 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:58:37,730][model8_pretrain.py][INFO] Epoch:[0/2](249700/4588595) loss:2.819 lr:0.0000100 epoch_Time:27435.0min: [2024-01-03 19:59:14,674][model8_pretrain.py][INFO] Epoch:[0/2](249800/4588595) loss:2.922 lr:0.0000100 epoch_Time:27433.0min: [2024-01-03 19:59:14,674][model8_pretrain.py][INFO] Epoch:[0/2](249800/4588595) loss:3.211 lr:0.0000100 epoch_Time:27433.0min: [2024-01-03 19:59:14,674][model8_pretrain.py][INFO] Epoch:[0/2](249800/4588595) loss:2.600 lr:0.0000100 epoch_Time:27433.0min: [2024-01-03 19:59:14,674][model8_pretrain.py][INFO] Epoch:[0/2](249800/4588595) loss:2.453 lr:0.0000100 epoch_Time:27433.0min: [2024-01-03 19:59:14,675][model8_pretrain.py][INFO] Epoch:[0/2](249800/4588595) loss:2.823 lr:0.0000100 epoch_Time:27433.0min: [2024-01-03 19:59:14,675][model8_pretrain.py][INFO] Epoch:[0/2](249800/4588595) loss:2.527 lr:0.0000100 epoch_Time:27433.0min: [2024-01-03 19:59:14,675][model8_pretrain.py][INFO] Epoch:[0/2](249800/4588595) loss:2.777 lr:0.0000100 epoch_Time:27433.0min: [2024-01-03 19:59:14,675][model8_pretrain.py][INFO] Epoch:[0/2](249800/4588595) loss:2.367 lr:0.0000100 epoch_Time:27433.0min: [2024-01-03 19:59:51,616][model8_pretrain.py][INFO] Epoch:[0/2](249900/4588595) loss:2.605 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 19:59:51,616][model8_pretrain.py][INFO] Epoch:[0/2](249900/4588595) loss:2.799 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 19:59:51,616][model8_pretrain.py][INFO] Epoch:[0/2](249900/4588595) loss:3.137 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 19:59:51,616][model8_pretrain.py][INFO] Epoch:[0/2](249900/4588595) loss:3.269 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 19:59:51,616][model8_pretrain.py][INFO] Epoch:[0/2](249900/4588595) loss:2.641 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 19:59:51,616][model8_pretrain.py][INFO] Epoch:[0/2](249900/4588595) loss:3.465 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 19:59:51,616][model8_pretrain.py][INFO] Epoch:[0/2](249900/4588595) loss:3.128 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 19:59:51,616][model8_pretrain.py][INFO] Epoch:[0/2](249900/4588595) loss:2.938 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 20:00:28,541][model8_pretrain.py][INFO] Epoch:[0/2](250000/4588595) loss:3.180 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 20:00:28,541][model8_pretrain.py][INFO] Epoch:[0/2](250000/4588595) loss:3.503 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 20:00:28,541][model8_pretrain.py][INFO] Epoch:[0/2](250000/4588595) loss:2.937 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 20:00:28,541][model8_pretrain.py][INFO] Epoch:[0/2](250000/4588595) loss:3.087 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 20:00:28,541][model8_pretrain.py][INFO] Epoch:[0/2](250000/4588595) loss:3.314 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 20:00:28,541][model8_pretrain.py][INFO] Epoch:[0/2](250000/4588595) loss:2.470 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 20:00:28,541][model8_pretrain.py][INFO] Epoch:[0/2](250000/4588595) loss:2.652 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 20:00:28,541][model8_pretrain.py][INFO] Epoch:[0/2](250000/4588595) loss:2.801 lr:0.0000100 epoch_Time:27432.0min: [2024-01-03 20:01:05,465][model8_pretrain.py][INFO] Epoch:[0/2](250100/4588595) loss:3.061 lr:0.0000100 epoch_Time:27431.0min: [2024-01-03 20:01:05,465][model8_pretrain.py][INFO] Epoch:[0/2](250100/4588595) loss:2.920 lr:0.0000100 epoch_Time:27431.0min: [2024-01-03 20:01:05,465][model8_pretrain.py][INFO] Epoch:[0/2](250100/4588595) loss:2.815 lr:0.0000100 epoch_Time:27431.0min: [2024-01-03 20:01:05,465][model8_pretrain.py][INFO] Epoch:[0/2](250100/4588595) loss:3.289 lr:0.0000100 epoch_Time:27431.0min: [2024-01-03 20:01:05,465][model8_pretrain.py][INFO] Epoch:[0/2](250100/4588595) loss:2.954 lr:0.0000100 epoch_Time:27431.0min: [2024-01-03 20:01:05,465][model8_pretrain.py][INFO] Epoch:[0/2](250100/4588595) loss:3.207 lr:0.0000100 epoch_Time:27431.0min: [2024-01-03 20:01:05,465][model8_pretrain.py][INFO] Epoch:[0/2](250100/4588595) loss:2.796 lr:0.0000100 epoch_Time:27431.0min: [2024-01-03 20:01:05,465][model8_pretrain.py][INFO] Epoch:[0/2](250100/4588595) loss:3.122 lr:0.0000100 epoch_Time:27431.0min: [2024-01-03 20:01:42,398][model8_pretrain.py][INFO] Epoch:[0/2](250200/4588595) loss:2.623 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:01:42,398][model8_pretrain.py][INFO] Epoch:[0/2](250200/4588595) loss:2.806 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:01:42,398][model8_pretrain.py][INFO] Epoch:[0/2](250200/4588595) loss:2.421 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:01:42,398][model8_pretrain.py][INFO] Epoch:[0/2](250200/4588595) loss:2.072 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:01:42,398][model8_pretrain.py][INFO] Epoch:[0/2](250200/4588595) loss:3.242 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:01:42,398][model8_pretrain.py][INFO] Epoch:[0/2](250200/4588595) loss:3.230 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:01:42,398][model8_pretrain.py][INFO] Epoch:[0/2](250200/4588595) loss:2.762 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:01:42,398][model8_pretrain.py][INFO] Epoch:[0/2](250200/4588595) loss:3.060 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:02:19,330][model8_pretrain.py][INFO] Epoch:[0/2](250300/4588595) loss:2.830 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:02:19,330][model8_pretrain.py][INFO] Epoch:[0/2](250300/4588595) loss:2.980 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:02:19,330][model8_pretrain.py][INFO] Epoch:[0/2](250300/4588595) loss:2.961 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:02:19,330][model8_pretrain.py][INFO] Epoch:[0/2](250300/4588595) loss:2.997 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:02:19,330][model8_pretrain.py][INFO] Epoch:[0/2](250300/4588595) loss:3.426 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:02:19,330][model8_pretrain.py][INFO] Epoch:[0/2](250300/4588595) loss:2.896 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:02:19,330][model8_pretrain.py][INFO] Epoch:[0/2](250300/4588595) loss:2.540 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:02:19,330][model8_pretrain.py][INFO] Epoch:[0/2](250300/4588595) loss:2.372 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:03:01,159][model8_pretrain.py][INFO] Epoch:[0/2](250400/4588595) loss:2.899 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:03:01,159][model8_pretrain.py][INFO] Epoch:[0/2](250400/4588595) loss:3.030 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:03:01,159][model8_pretrain.py][INFO] Epoch:[0/2](250400/4588595) loss:2.698 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:03:01,159][model8_pretrain.py][INFO] Epoch:[0/2](250400/4588595) loss:3.080 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:03:01,164][model8_pretrain.py][INFO] Epoch:[0/2](250400/4588595) loss:3.277 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:03:01,164][model8_pretrain.py][INFO] Epoch:[0/2](250400/4588595) loss:3.168 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:03:01,164][model8_pretrain.py][INFO] Epoch:[0/2](250400/4588595) loss:3.049 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:03:01,164][model8_pretrain.py][INFO] Epoch:[0/2](250400/4588595) loss:3.072 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:03:41,618][model8_pretrain.py][INFO] Epoch:[0/2](250500/4588595) loss:2.521 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:03:41,618][model8_pretrain.py][INFO] Epoch:[0/2](250500/4588595) loss:2.973 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:03:41,618][model8_pretrain.py][INFO] Epoch:[0/2](250500/4588595) loss:3.081 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:03:41,618][model8_pretrain.py][INFO] Epoch:[0/2](250500/4588595) loss:3.329 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:03:41,618][model8_pretrain.py][INFO] Epoch:[0/2](250500/4588595) loss:2.726 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:03:41,618][model8_pretrain.py][INFO] Epoch:[0/2](250500/4588595) loss:2.878 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:03:41,618][model8_pretrain.py][INFO] Epoch:[0/2](250500/4588595) loss:3.003 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:03:41,618][model8_pretrain.py][INFO] Epoch:[0/2](250500/4588595) loss:2.319 lr:0.0000100 epoch_Time:27430.0min: [2024-01-03 20:04:18,563][model8_pretrain.py][INFO] Epoch:[0/2](250600/4588595) loss:2.506 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:04:18,563][model8_pretrain.py][INFO] Epoch:[0/2](250600/4588595) loss:2.673 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:04:18,563][model8_pretrain.py][INFO] Epoch:[0/2](250600/4588595) loss:3.155 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:04:18,563][model8_pretrain.py][INFO] Epoch:[0/2](250600/4588595) loss:3.188 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:04:18,563][model8_pretrain.py][INFO] Epoch:[0/2](250600/4588595) loss:2.809 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:04:18,563][model8_pretrain.py][INFO] Epoch:[0/2](250600/4588595) loss:2.744 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:04:18,563][model8_pretrain.py][INFO] Epoch:[0/2](250600/4588595) loss:3.220 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:04:18,563][model8_pretrain.py][INFO] Epoch:[0/2](250600/4588595) loss:2.670 lr:0.0000100 epoch_Time:27429.0min: [2024-01-03 20:04:55,474][model8_pretrain.py][INFO] Epoch:[0/2](250700/4588595) loss:2.853 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:04:55,474][model8_pretrain.py][INFO] Epoch:[0/2](250700/4588595) loss:3.244 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:04:55,474][model8_pretrain.py][INFO] Epoch:[0/2](250700/4588595) loss:3.215 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:04:55,474][model8_pretrain.py][INFO] Epoch:[0/2](250700/4588595) loss:2.697 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:04:55,474][model8_pretrain.py][INFO] Epoch:[0/2](250700/4588595) loss:3.048 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:04:55,474][model8_pretrain.py][INFO] Epoch:[0/2](250700/4588595) loss:2.812 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:04:55,474][model8_pretrain.py][INFO] Epoch:[0/2](250700/4588595) loss:2.121 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:04:55,474][model8_pretrain.py][INFO] Epoch:[0/2](250700/4588595) loss:3.272 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:05:32,424][model8_pretrain.py][INFO] Epoch:[0/2](250800/4588595) loss:2.931 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:05:32,424][model8_pretrain.py][INFO] Epoch:[0/2](250800/4588595) loss:2.826 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:05:32,424][model8_pretrain.py][INFO] Epoch:[0/2](250800/4588595) loss:3.492 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:05:32,424][model8_pretrain.py][INFO] Epoch:[0/2](250800/4588595) loss:3.513 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:05:32,424][model8_pretrain.py][INFO] Epoch:[0/2](250800/4588595) loss:3.174 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:05:32,424][model8_pretrain.py][INFO] Epoch:[0/2](250800/4588595) loss:2.671 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:05:32,424][model8_pretrain.py][INFO] Epoch:[0/2](250800/4588595) loss:3.240 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:05:32,425][model8_pretrain.py][INFO] Epoch:[0/2](250800/4588595) loss:2.901 lr:0.0000100 epoch_Time:27427.0min: [2024-01-03 20:06:09,354][model8_pretrain.py][INFO] Epoch:[0/2](250900/4588595) loss:2.445 lr:0.0000100 epoch_Time:27426.0min: [2024-01-03 20:06:09,354][model8_pretrain.py][INFO] Epoch:[0/2](250900/4588595) loss:2.877 lr:0.0000100 epoch_Time:27426.0min: [2024-01-03 20:06:09,354][model8_pretrain.py][INFO] Epoch:[0/2](250900/4588595) loss:2.979 lr:0.0000100 epoch_Time:27426.0min: [2024-01-03 20:06:09,354][model8_pretrain.py][INFO] Epoch:[0/2](250900/4588595) loss:3.154 lr:0.0000100 epoch_Time:27426.0min: [2024-01-03 20:06:09,354][model8_pretrain.py][INFO] Epoch:[0/2](250900/4588595) loss:2.615 lr:0.0000100 epoch_Time:27426.0min: [2024-01-03 20:06:09,354][model8_pretrain.py][INFO] Epoch:[0/2](250900/4588595) loss:2.821 lr:0.0000100 epoch_Time:27426.0min: [2024-01-03 20:06:09,354][model8_pretrain.py][INFO] Epoch:[0/2](250900/4588595) loss:2.730 lr:0.0000100 epoch_Time:27426.0min: [2024-01-03 20:06:09,355][model8_pretrain.py][INFO] Epoch:[0/2](250900/4588595) loss:2.528 lr:0.0000100 epoch_Time:27426.0min: [2024-01-03 20:06:46,288][model8_pretrain.py][INFO] Epoch:[0/2](251000/4588595) loss:3.125 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:06:46,288][model8_pretrain.py][INFO] Epoch:[0/2](251000/4588595) loss:3.372 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:06:46,288][model8_pretrain.py][INFO] Epoch:[0/2](251000/4588595) loss:2.941 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:06:46,289][model8_pretrain.py][INFO] Epoch:[0/2](251000/4588595) loss:2.556 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:06:46,289][model8_pretrain.py][INFO] Epoch:[0/2](251000/4588595) loss:3.214 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:06:46,289][model8_pretrain.py][INFO] Epoch:[0/2](251000/4588595) loss:2.841 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:06:46,289][model8_pretrain.py][INFO] Epoch:[0/2](251000/4588595) loss:2.407 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:06:46,290][model8_pretrain.py][INFO] Epoch:[0/2](251000/4588595) loss:3.240 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:07:23,224][model8_pretrain.py][INFO] Epoch:[0/2](251100/4588595) loss:2.894 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:07:23,224][model8_pretrain.py][INFO] Epoch:[0/2](251100/4588595) loss:2.868 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:07:23,223][model8_pretrain.py][INFO] Epoch:[0/2](251100/4588595) loss:2.721 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:07:23,224][model8_pretrain.py][INFO] Epoch:[0/2](251100/4588595) loss:2.655 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:07:23,224][model8_pretrain.py][INFO] Epoch:[0/2](251100/4588595) loss:3.016 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:07:23,224][model8_pretrain.py][INFO] Epoch:[0/2](251100/4588595) loss:2.750 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:07:23,224][model8_pretrain.py][INFO] Epoch:[0/2](251100/4588595) loss:3.078 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:07:23,224][model8_pretrain.py][INFO] Epoch:[0/2](251100/4588595) loss:2.196 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:08:01,895][model8_pretrain.py][INFO] Epoch:[0/2](251200/4588595) loss:3.343 lr:0.0000100 epoch_Time:27423.0min: [2024-01-03 20:08:01,895][model8_pretrain.py][INFO] Epoch:[0/2](251200/4588595) loss:3.166 lr:0.0000100 epoch_Time:27423.0min: [2024-01-03 20:08:01,895][model8_pretrain.py][INFO] Epoch:[0/2](251200/4588595) loss:3.153 lr:0.0000100 epoch_Time:27423.0min: [2024-01-03 20:08:01,895][model8_pretrain.py][INFO] Epoch:[0/2](251200/4588595) loss:2.488 lr:0.0000100 epoch_Time:27423.0min: [2024-01-03 20:08:01,895][model8_pretrain.py][INFO] Epoch:[0/2](251200/4588595) loss:3.133 lr:0.0000100 epoch_Time:27423.0min: [2024-01-03 20:08:01,895][model8_pretrain.py][INFO] Epoch:[0/2](251200/4588595) loss:3.130 lr:0.0000100 epoch_Time:27423.0min: [2024-01-03 20:08:01,895][model8_pretrain.py][INFO] Epoch:[0/2](251200/4588595) loss:2.960 lr:0.0000100 epoch_Time:27423.0min: [2024-01-03 20:08:01,896][model8_pretrain.py][INFO] Epoch:[0/2](251200/4588595) loss:3.054 lr:0.0000100 epoch_Time:27423.0min: [2024-01-03 20:08:45,475][model8_pretrain.py][INFO] Epoch:[0/2](251300/4588595) loss:3.219 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:08:45,475][model8_pretrain.py][INFO] Epoch:[0/2](251300/4588595) loss:3.521 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:08:45,475][model8_pretrain.py][INFO] Epoch:[0/2](251300/4588595) loss:2.439 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:08:45,475][model8_pretrain.py][INFO] Epoch:[0/2](251300/4588595) loss:3.101 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:08:45,475][model8_pretrain.py][INFO] Epoch:[0/2](251300/4588595) loss:3.115 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:08:45,475][model8_pretrain.py][INFO] Epoch:[0/2](251300/4588595) loss:2.685 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:08:45,475][model8_pretrain.py][INFO] Epoch:[0/2](251300/4588595) loss:3.190 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:08:45,476][model8_pretrain.py][INFO] Epoch:[0/2](251300/4588595) loss:2.499 lr:0.0000100 epoch_Time:27425.0min: [2024-01-03 20:09:22,401][model8_pretrain.py][INFO] Epoch:[0/2](251400/4588595) loss:2.866 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:09:22,401][model8_pretrain.py][INFO] Epoch:[0/2](251400/4588595) loss:2.960 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:09:22,401][model8_pretrain.py][INFO] Epoch:[0/2](251400/4588595) loss:3.516 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:09:22,401][model8_pretrain.py][INFO] Epoch:[0/2](251400/4588595) loss:2.228 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:09:22,401][model8_pretrain.py][INFO] Epoch:[0/2](251400/4588595) loss:1.994 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:09:22,401][model8_pretrain.py][INFO] Epoch:[0/2](251400/4588595) loss:2.620 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:09:22,401][model8_pretrain.py][INFO] Epoch:[0/2](251400/4588595) loss:2.881 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:09:22,401][model8_pretrain.py][INFO] Epoch:[0/2](251400/4588595) loss:2.804 lr:0.0000100 epoch_Time:27424.0min: [2024-01-03 20:09:59,342][model8_pretrain.py][INFO] Epoch:[0/2](251500/4588595) loss:2.936 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:09:59,342][model8_pretrain.py][INFO] Epoch:[0/2](251500/4588595) loss:3.215 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:09:59,342][model8_pretrain.py][INFO] Epoch:[0/2](251500/4588595) loss:2.956 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:09:59,342][model8_pretrain.py][INFO] Epoch:[0/2](251500/4588595) loss:2.872 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:09:59,342][model8_pretrain.py][INFO] Epoch:[0/2](251500/4588595) loss:2.844 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:09:59,342][model8_pretrain.py][INFO] Epoch:[0/2](251500/4588595) loss:3.085 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:09:59,342][model8_pretrain.py][INFO] Epoch:[0/2](251500/4588595) loss:3.142 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:09:59,342][model8_pretrain.py][INFO] Epoch:[0/2](251500/4588595) loss:2.290 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:10:36,274][model8_pretrain.py][INFO] Epoch:[0/2](251600/4588595) loss:3.423 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:10:36,274][model8_pretrain.py][INFO] Epoch:[0/2](251600/4588595) loss:2.466 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:10:36,274][model8_pretrain.py][INFO] Epoch:[0/2](251600/4588595) loss:2.742 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:10:36,274][model8_pretrain.py][INFO] Epoch:[0/2](251600/4588595) loss:3.405 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:10:36,274][model8_pretrain.py][INFO] Epoch:[0/2](251600/4588595) loss:2.078 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:10:36,274][model8_pretrain.py][INFO] Epoch:[0/2](251600/4588595) loss:2.798 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:10:36,275][model8_pretrain.py][INFO] Epoch:[0/2](251600/4588595) loss:2.828 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:10:36,275][model8_pretrain.py][INFO] Epoch:[0/2](251600/4588595) loss:3.242 lr:0.0000100 epoch_Time:27422.0min: [2024-01-03 20:11:13,217][model8_pretrain.py][INFO] Epoch:[0/2](251700/4588595) loss:3.048 lr:0.0000100 epoch_Time:27421.0min: [2024-01-03 20:11:13,217][model8_pretrain.py][INFO] Epoch:[0/2](251700/4588595) loss:3.301 lr:0.0000100 epoch_Time:27421.0min: [2024-01-03 20:11:13,217][model8_pretrain.py][INFO] Epoch:[0/2](251700/4588595) loss:2.253 lr:0.0000100 epoch_Time:27421.0min: [2024-01-03 20:11:13,217][model8_pretrain.py][INFO] Epoch:[0/2](251700/4588595) loss:1.855 lr:0.0000100 epoch_Time:27421.0min: [2024-01-03 20:11:13,217][model8_pretrain.py][INFO] Epoch:[0/2](251700/4588595) loss:2.416 lr:0.0000100 epoch_Time:27421.0min: [2024-01-03 20:11:13,217][model8_pretrain.py][INFO] Epoch:[0/2](251700/4588595) loss:2.729 lr:0.0000100 epoch_Time:27421.0min: [2024-01-03 20:11:13,217][model8_pretrain.py][INFO] Epoch:[0/2](251700/4588595) loss:2.350 lr:0.0000100 epoch_Time:27421.0min: [2024-01-03 20:11:13,217][model8_pretrain.py][INFO] Epoch:[0/2](251700/4588595) loss:3.281 lr:0.0000100 epoch_Time:27421.0min: [2024-01-03 20:11:50,150][model8_pretrain.py][INFO] Epoch:[0/2](251800/4588595) loss:3.200 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:11:50,150][model8_pretrain.py][INFO] Epoch:[0/2](251800/4588595) loss:3.025 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:11:50,150][model8_pretrain.py][INFO] Epoch:[0/2](251800/4588595) loss:3.089 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:11:50,150][model8_pretrain.py][INFO] Epoch:[0/2](251800/4588595) loss:3.141 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:11:50,150][model8_pretrain.py][INFO] Epoch:[0/2](251800/4588595) loss:2.919 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:11:50,150][model8_pretrain.py][INFO] Epoch:[0/2](251800/4588595) loss:3.028 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:11:50,150][model8_pretrain.py][INFO] Epoch:[0/2](251800/4588595) loss:3.599 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:11:50,150][model8_pretrain.py][INFO] Epoch:[0/2](251800/4588595) loss:2.768 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:12:27,087][model8_pretrain.py][INFO] Epoch:[0/2](251900/4588595) loss:2.974 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:12:27,087][model8_pretrain.py][INFO] Epoch:[0/2](251900/4588595) loss:3.338 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:12:27,087][model8_pretrain.py][INFO] Epoch:[0/2](251900/4588595) loss:2.934 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:12:27,087][model8_pretrain.py][INFO] Epoch:[0/2](251900/4588595) loss:3.179 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:12:27,087][model8_pretrain.py][INFO] Epoch:[0/2](251900/4588595) loss:2.812 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:12:27,087][model8_pretrain.py][INFO] Epoch:[0/2](251900/4588595) loss:2.958 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:12:27,087][model8_pretrain.py][INFO] Epoch:[0/2](251900/4588595) loss:2.506 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:12:27,088][model8_pretrain.py][INFO] Epoch:[0/2](251900/4588595) loss:2.658 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:13:05,798][model8_pretrain.py][INFO] Epoch:[0/2](252000/4588595) loss:3.156 lr:0.0000100 epoch_Time:27418.0min: [2024-01-03 20:13:05,798][model8_pretrain.py][INFO] Epoch:[0/2](252000/4588595) loss:2.882 lr:0.0000100 epoch_Time:27418.0min: [2024-01-03 20:13:05,798][model8_pretrain.py][INFO] Epoch:[0/2](252000/4588595) loss:3.011 lr:0.0000100 epoch_Time:27418.0min: [2024-01-03 20:13:05,798][model8_pretrain.py][INFO] Epoch:[0/2](252000/4588595) loss:2.611 lr:0.0000100 epoch_Time:27418.0min: [2024-01-03 20:13:05,798][model8_pretrain.py][INFO] Epoch:[0/2](252000/4588595) loss:3.157 lr:0.0000100 epoch_Time:27418.0min: [2024-01-03 20:13:05,798][model8_pretrain.py][INFO] Epoch:[0/2](252000/4588595) loss:3.237 lr:0.0000100 epoch_Time:27418.0min: [2024-01-03 20:13:05,798][model8_pretrain.py][INFO] Epoch:[0/2](252000/4588595) loss:2.823 lr:0.0000100 epoch_Time:27418.0min: [2024-01-03 20:13:05,798][model8_pretrain.py][INFO] Epoch:[0/2](252000/4588595) loss:2.990 lr:0.0000100 epoch_Time:27418.0min: [2024-01-03 20:13:49,356][model8_pretrain.py][INFO] Epoch:[0/2](252100/4588595) loss:2.885 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:13:49,356][model8_pretrain.py][INFO] Epoch:[0/2](252100/4588595) loss:2.476 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:13:49,356][model8_pretrain.py][INFO] Epoch:[0/2](252100/4588595) loss:3.476 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:13:49,356][model8_pretrain.py][INFO] Epoch:[0/2](252100/4588595) loss:2.830 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:13:49,356][model8_pretrain.py][INFO] Epoch:[0/2](252100/4588595) loss:2.091 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:13:49,356][model8_pretrain.py][INFO] Epoch:[0/2](252100/4588595) loss:3.031 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:13:49,356][model8_pretrain.py][INFO] Epoch:[0/2](252100/4588595) loss:2.641 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:13:49,356][model8_pretrain.py][INFO] Epoch:[0/2](252100/4588595) loss:3.252 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:14:26,288][model8_pretrain.py][INFO] Epoch:[0/2](252200/4588595) loss:3.348 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:14:26,288][model8_pretrain.py][INFO] Epoch:[0/2](252200/4588595) loss:3.033 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:14:26,288][model8_pretrain.py][INFO] Epoch:[0/2](252200/4588595) loss:2.703 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:14:26,288][model8_pretrain.py][INFO] Epoch:[0/2](252200/4588595) loss:3.027 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:14:26,288][model8_pretrain.py][INFO] Epoch:[0/2](252200/4588595) loss:2.563 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:14:26,289][model8_pretrain.py][INFO] Epoch:[0/2](252200/4588595) loss:3.211 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:14:26,289][model8_pretrain.py][INFO] Epoch:[0/2](252200/4588595) loss:2.784 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:14:26,289][model8_pretrain.py][INFO] Epoch:[0/2](252200/4588595) loss:3.304 lr:0.0000100 epoch_Time:27419.0min: [2024-01-03 20:15:03,223][model8_pretrain.py][INFO] Epoch:[0/2](252300/4588595) loss:2.813 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:03,223][model8_pretrain.py][INFO] Epoch:[0/2](252300/4588595) loss:3.236 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:03,223][model8_pretrain.py][INFO] Epoch:[0/2](252300/4588595) loss:2.620 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:03,223][model8_pretrain.py][INFO] Epoch:[0/2](252300/4588595) loss:2.643 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:03,223][model8_pretrain.py][INFO] Epoch:[0/2](252300/4588595) loss:2.992 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:03,223][model8_pretrain.py][INFO] Epoch:[0/2](252300/4588595) loss:2.902 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:03,223][model8_pretrain.py][INFO] Epoch:[0/2](252300/4588595) loss:2.912 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:03,223][model8_pretrain.py][INFO] Epoch:[0/2](252300/4588595) loss:2.730 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:40,155][model8_pretrain.py][INFO] Epoch:[0/2](252400/4588595) loss:3.200 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:40,155][model8_pretrain.py][INFO] Epoch:[0/2](252400/4588595) loss:2.760 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:40,155][model8_pretrain.py][INFO] Epoch:[0/2](252400/4588595) loss:2.989 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:40,156][model8_pretrain.py][INFO] Epoch:[0/2](252400/4588595) loss:2.996 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:40,156][model8_pretrain.py][INFO] Epoch:[0/2](252400/4588595) loss:3.033 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:40,156][model8_pretrain.py][INFO] Epoch:[0/2](252400/4588595) loss:2.652 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:40,156][model8_pretrain.py][INFO] Epoch:[0/2](252400/4588595) loss:3.072 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:15:40,156][model8_pretrain.py][INFO] Epoch:[0/2](252400/4588595) loss:3.180 lr:0.0000100 epoch_Time:27417.0min: [2024-01-03 20:16:17,092][model8_pretrain.py][INFO] Epoch:[0/2](252500/4588595) loss:2.789 lr:0.0000100 epoch_Time:27416.0min: [2024-01-03 20:16:17,092][model8_pretrain.py][INFO] Epoch:[0/2](252500/4588595) loss:3.355 lr:0.0000100 epoch_Time:27416.0min: [2024-01-03 20:16:17,092][model8_pretrain.py][INFO] Epoch:[0/2](252500/4588595) loss:3.062 lr:0.0000100 epoch_Time:27416.0min: [2024-01-03 20:16:17,092][model8_pretrain.py][INFO] Epoch:[0/2](252500/4588595) loss:3.425 lr:0.0000100 epoch_Time:27416.0min: [2024-01-03 20:16:17,092][model8_pretrain.py][INFO] Epoch:[0/2](252500/4588595) loss:2.777 lr:0.0000100 epoch_Time:27416.0min: [2024-01-03 20:16:17,092][model8_pretrain.py][INFO] Epoch:[0/2](252500/4588595) loss:3.127 lr:0.0000100 epoch_Time:27416.0min: [2024-01-03 20:16:17,092][model8_pretrain.py][INFO] Epoch:[0/2](252500/4588595) loss:2.161 lr:0.0000100 epoch_Time:27416.0min: [2024-01-03 20:16:17,092][model8_pretrain.py][INFO] Epoch:[0/2](252500/4588595) loss:3.189 lr:0.0000100 epoch_Time:27416.0min: [2024-01-03 20:16:54,011][model8_pretrain.py][INFO] Epoch:[0/2](252600/4588595) loss:2.486 lr:0.0000100 epoch_Time:27415.0min: [2024-01-03 20:16:54,011][model8_pretrain.py][INFO] Epoch:[0/2](252600/4588595) loss:2.897 lr:0.0000100 epoch_Time:27415.0min: [2024-01-03 20:16:54,011][model8_pretrain.py][INFO] Epoch:[0/2](252600/4588595) loss:2.384 lr:0.0000100 epoch_Time:27415.0min: [2024-01-03 20:16:54,011][model8_pretrain.py][INFO] Epoch:[0/2](252600/4588595) loss:2.982 lr:0.0000100 epoch_Time:27415.0min: [2024-01-03 20:16:54,011][model8_pretrain.py][INFO] Epoch:[0/2](252600/4588595) loss:3.168 lr:0.0000100 epoch_Time:27415.0min: [2024-01-03 20:16:54,011][model8_pretrain.py][INFO] Epoch:[0/2](252600/4588595) loss:3.103 lr:0.0000100 epoch_Time:27415.0min: [2024-01-03 20:16:54,011][model8_pretrain.py][INFO] Epoch:[0/2](252600/4588595) loss:2.621 lr:0.0000100 epoch_Time:27415.0min: [2024-01-03 20:16:54,011][model8_pretrain.py][INFO] Epoch:[0/2](252600/4588595) loss:2.523 lr:0.0000100 epoch_Time:27415.0min: [2024-01-03 20:17:30,947][model8_pretrain.py][INFO] Epoch:[0/2](252700/4588595) loss:2.904 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:17:30,947][model8_pretrain.py][INFO] Epoch:[0/2](252700/4588595) loss:2.960 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:17:30,947][model8_pretrain.py][INFO] Epoch:[0/2](252700/4588595) loss:2.653 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:17:30,947][model8_pretrain.py][INFO] Epoch:[0/2](252700/4588595) loss:2.980 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:17:30,947][model8_pretrain.py][INFO] Epoch:[0/2](252700/4588595) loss:2.998 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:17:30,947][model8_pretrain.py][INFO] Epoch:[0/2](252700/4588595) loss:3.094 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:17:30,947][model8_pretrain.py][INFO] Epoch:[0/2](252700/4588595) loss:2.821 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:17:30,947][model8_pretrain.py][INFO] Epoch:[0/2](252700/4588595) loss:3.308 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:18:09,600][model8_pretrain.py][INFO] Epoch:[0/2](252800/4588595) loss:3.207 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:18:09,600][model8_pretrain.py][INFO] Epoch:[0/2](252800/4588595) loss:2.927 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:18:09,600][model8_pretrain.py][INFO] Epoch:[0/2](252800/4588595) loss:2.977 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:18:09,600][model8_pretrain.py][INFO] Epoch:[0/2](252800/4588595) loss:2.491 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:18:09,600][model8_pretrain.py][INFO] Epoch:[0/2](252800/4588595) loss:2.999 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:18:09,605][model8_pretrain.py][INFO] Epoch:[0/2](252800/4588595) loss:2.912 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:18:09,605][model8_pretrain.py][INFO] Epoch:[0/2](252800/4588595) loss:3.292 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:18:09,605][model8_pretrain.py][INFO] Epoch:[0/2](252800/4588595) loss:2.438 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:18:53,277][model8_pretrain.py][INFO] Epoch:[0/2](252900/4588595) loss:3.265 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:18:53,277][model8_pretrain.py][INFO] Epoch:[0/2](252900/4588595) loss:3.463 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:18:53,277][model8_pretrain.py][INFO] Epoch:[0/2](252900/4588595) loss:2.824 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:18:53,277][model8_pretrain.py][INFO] Epoch:[0/2](252900/4588595) loss:3.331 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:18:53,277][model8_pretrain.py][INFO] Epoch:[0/2](252900/4588595) loss:3.246 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:18:53,277][model8_pretrain.py][INFO] Epoch:[0/2](252900/4588595) loss:2.393 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:18:53,277][model8_pretrain.py][INFO] Epoch:[0/2](252900/4588595) loss:3.107 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:18:53,277][model8_pretrain.py][INFO] Epoch:[0/2](252900/4588595) loss:2.982 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:19:30,217][model8_pretrain.py][INFO] Epoch:[0/2](253000/4588595) loss:3.077 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:19:30,217][model8_pretrain.py][INFO] Epoch:[0/2](253000/4588595) loss:2.995 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:19:30,217][model8_pretrain.py][INFO] Epoch:[0/2](253000/4588595) loss:2.947 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:19:30,217][model8_pretrain.py][INFO] Epoch:[0/2](253000/4588595) loss:2.782 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:19:30,217][model8_pretrain.py][INFO] Epoch:[0/2](253000/4588595) loss:2.511 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:19:30,217][model8_pretrain.py][INFO] Epoch:[0/2](253000/4588595) loss:2.974 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:19:30,217][model8_pretrain.py][INFO] Epoch:[0/2](253000/4588595) loss:2.781 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:19:30,217][model8_pretrain.py][INFO] Epoch:[0/2](253000/4588595) loss:2.668 lr:0.0000100 epoch_Time:27414.0min: [2024-01-03 20:20:07,145][model8_pretrain.py][INFO] Epoch:[0/2](253100/4588595) loss:2.656 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:20:07,145][model8_pretrain.py][INFO] Epoch:[0/2](253100/4588595) loss:2.447 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:20:07,145][model8_pretrain.py][INFO] Epoch:[0/2](253100/4588595) loss:2.944 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:20:07,145][model8_pretrain.py][INFO] Epoch:[0/2](253100/4588595) loss:3.136 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:20:07,145][model8_pretrain.py][INFO] Epoch:[0/2](253100/4588595) loss:2.632 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:20:07,145][model8_pretrain.py][INFO] Epoch:[0/2](253100/4588595) loss:3.185 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:20:07,145][model8_pretrain.py][INFO] Epoch:[0/2](253100/4588595) loss:2.819 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:20:07,145][model8_pretrain.py][INFO] Epoch:[0/2](253100/4588595) loss:2.871 lr:0.0000100 epoch_Time:27413.0min: [2024-01-03 20:20:44,068][model8_pretrain.py][INFO] Epoch:[0/2](253200/4588595) loss:3.044 lr:0.0000100 epoch_Time:27412.0min: [2024-01-03 20:20:44,068][model8_pretrain.py][INFO] Epoch:[0/2](253200/4588595) loss:2.623 lr:0.0000100 epoch_Time:27412.0min: [2024-01-03 20:20:44,068][model8_pretrain.py][INFO] Epoch:[0/2](253200/4588595) loss:2.874 lr:0.0000100 epoch_Time:27412.0min: [2024-01-03 20:20:44,068][model8_pretrain.py][INFO] Epoch:[0/2](253200/4588595) loss:3.182 lr:0.0000100 epoch_Time:27412.0min: [2024-01-03 20:20:44,068][model8_pretrain.py][INFO] Epoch:[0/2](253200/4588595) loss:2.942 lr:0.0000100 epoch_Time:27412.0min: [2024-01-03 20:20:44,068][model8_pretrain.py][INFO] Epoch:[0/2](253200/4588595) loss:3.019 lr:0.0000100 epoch_Time:27412.0min: [2024-01-03 20:20:44,068][model8_pretrain.py][INFO] Epoch:[0/2](253200/4588595) loss:2.849 lr:0.0000100 epoch_Time:27412.0min: [2024-01-03 20:20:44,068][model8_pretrain.py][INFO] Epoch:[0/2](253200/4588595) loss:2.689 lr:0.0000100 epoch_Time:27412.0min: [2024-01-03 20:21:20,979][model8_pretrain.py][INFO] Epoch:[0/2](253300/4588595) loss:2.588 lr:0.0000100 epoch_Time:27411.0min: [2024-01-03 20:21:20,979][model8_pretrain.py][INFO] Epoch:[0/2](253300/4588595) loss:2.548 lr:0.0000100 epoch_Time:27411.0min: [2024-01-03 20:21:20,979][model8_pretrain.py][INFO] Epoch:[0/2](253300/4588595) loss:2.662 lr:0.0000100 epoch_Time:27411.0min: [2024-01-03 20:21:20,979][model8_pretrain.py][INFO] Epoch:[0/2](253300/4588595) loss:3.074 lr:0.0000100 epoch_Time:27411.0min: [2024-01-03 20:21:20,979][model8_pretrain.py][INFO] Epoch:[0/2](253300/4588595) loss:2.281 lr:0.0000100 epoch_Time:27411.0min: [2024-01-03 20:21:20,979][model8_pretrain.py][INFO] Epoch:[0/2](253300/4588595) loss:2.518 lr:0.0000100 epoch_Time:27411.0min: [2024-01-03 20:21:20,979][model8_pretrain.py][INFO] Epoch:[0/2](253300/4588595) loss:2.899 lr:0.0000100 epoch_Time:27411.0min: [2024-01-03 20:21:20,979][model8_pretrain.py][INFO] Epoch:[0/2](253300/4588595) loss:3.330 lr:0.0000100 epoch_Time:27411.0min: [2024-01-03 20:21:57,879][model8_pretrain.py][INFO] Epoch:[0/2](253400/4588595) loss:2.794 lr:0.0000100 epoch_Time:27410.0min: [2024-01-03 20:21:57,879][model8_pretrain.py][INFO] Epoch:[0/2](253400/4588595) loss:3.110 lr:0.0000100 epoch_Time:27410.0min: [2024-01-03 20:21:57,879][model8_pretrain.py][INFO] Epoch:[0/2](253400/4588595) loss:2.934 lr:0.0000100 epoch_Time:27410.0min: [2024-01-03 20:21:57,879][model8_pretrain.py][INFO] Epoch:[0/2](253400/4588595) loss:2.986 lr:0.0000100 epoch_Time:27410.0min: [2024-01-03 20:21:57,879][model8_pretrain.py][INFO] Epoch:[0/2](253400/4588595) loss:2.894 lr:0.0000100 epoch_Time:27410.0min: [2024-01-03 20:21:57,879][model8_pretrain.py][INFO] Epoch:[0/2](253400/4588595) loss:3.120 lr:0.0000100 epoch_Time:27410.0min: [2024-01-03 20:21:57,879][model8_pretrain.py][INFO] Epoch:[0/2](253400/4588595) loss:2.874 lr:0.0000100 epoch_Time:27410.0min: [2024-01-03 20:21:57,879][model8_pretrain.py][INFO] Epoch:[0/2](253400/4588595) loss:2.819 lr:0.0000100 epoch_Time:27410.0min: [2024-01-03 20:22:34,812][model8_pretrain.py][INFO] Epoch:[0/2](253500/4588595) loss:3.045 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:22:34,812][model8_pretrain.py][INFO] Epoch:[0/2](253500/4588595) loss:2.772 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:22:34,812][model8_pretrain.py][INFO] Epoch:[0/2](253500/4588595) loss:2.783 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:22:34,812][model8_pretrain.py][INFO] Epoch:[0/2](253500/4588595) loss:3.116 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:22:34,812][model8_pretrain.py][INFO] Epoch:[0/2](253500/4588595) loss:2.947 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:22:34,812][model8_pretrain.py][INFO] Epoch:[0/2](253500/4588595) loss:2.630 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:22:34,812][model8_pretrain.py][INFO] Epoch:[0/2](253500/4588595) loss:3.111 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:22:34,812][model8_pretrain.py][INFO] Epoch:[0/2](253500/4588595) loss:3.138 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:23:11,739][model8_pretrain.py][INFO] Epoch:[0/2](253600/4588595) loss:2.439 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:23:11,739][model8_pretrain.py][INFO] Epoch:[0/2](253600/4588595) loss:2.265 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:23:11,739][model8_pretrain.py][INFO] Epoch:[0/2](253600/4588595) loss:3.131 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:23:11,739][model8_pretrain.py][INFO] Epoch:[0/2](253600/4588595) loss:2.664 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:23:11,739][model8_pretrain.py][INFO] Epoch:[0/2](253600/4588595) loss:2.955 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:23:11,739][model8_pretrain.py][INFO] Epoch:[0/2](253600/4588595) loss:2.772 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:23:11,740][model8_pretrain.py][INFO] Epoch:[0/2](253600/4588595) loss:3.008 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:23:11,740][model8_pretrain.py][INFO] Epoch:[0/2](253600/4588595) loss:2.854 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:23:57,099][model8_pretrain.py][INFO] Epoch:[0/2](253700/4588595) loss:3.195 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:23:57,099][model8_pretrain.py][INFO] Epoch:[0/2](253700/4588595) loss:3.166 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:23:57,099][model8_pretrain.py][INFO] Epoch:[0/2](253700/4588595) loss:2.771 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:23:57,099][model8_pretrain.py][INFO] Epoch:[0/2](253700/4588595) loss:2.874 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:23:57,099][model8_pretrain.py][INFO] Epoch:[0/2](253700/4588595) loss:2.801 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:23:57,099][model8_pretrain.py][INFO] Epoch:[0/2](253700/4588595) loss:2.904 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:23:57,100][model8_pretrain.py][INFO] Epoch:[0/2](253700/4588595) loss:2.344 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:23:57,100][model8_pretrain.py][INFO] Epoch:[0/2](253700/4588595) loss:2.933 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:24:34,029][model8_pretrain.py][INFO] Epoch:[0/2](253800/4588595) loss:3.079 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:24:34,029][model8_pretrain.py][INFO] Epoch:[0/2](253800/4588595) loss:3.317 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:24:34,029][model8_pretrain.py][INFO] Epoch:[0/2](253800/4588595) loss:2.884 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:24:34,029][model8_pretrain.py][INFO] Epoch:[0/2](253800/4588595) loss:3.326 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:24:34,029][model8_pretrain.py][INFO] Epoch:[0/2](253800/4588595) loss:2.585 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:24:34,029][model8_pretrain.py][INFO] Epoch:[0/2](253800/4588595) loss:3.164 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:24:34,029][model8_pretrain.py][INFO] Epoch:[0/2](253800/4588595) loss:3.102 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:24:34,029][model8_pretrain.py][INFO] Epoch:[0/2](253800/4588595) loss:3.105 lr:0.0000100 epoch_Time:27409.0min: [2024-01-03 20:25:10,974][model8_pretrain.py][INFO] Epoch:[0/2](253900/4588595) loss:2.692 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:25:10,974][model8_pretrain.py][INFO] Epoch:[0/2](253900/4588595) loss:3.458 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:25:10,974][model8_pretrain.py][INFO] Epoch:[0/2](253900/4588595) loss:2.873 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:25:10,974][model8_pretrain.py][INFO] Epoch:[0/2](253900/4588595) loss:2.798 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:25:10,974][model8_pretrain.py][INFO] Epoch:[0/2](253900/4588595) loss:2.781 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:25:10,974][model8_pretrain.py][INFO] Epoch:[0/2](253900/4588595) loss:3.043 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:25:10,974][model8_pretrain.py][INFO] Epoch:[0/2](253900/4588595) loss:3.275 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:25:10,974][model8_pretrain.py][INFO] Epoch:[0/2](253900/4588595) loss:2.517 lr:0.0000100 epoch_Time:27408.0min: [2024-01-03 20:25:47,908][model8_pretrain.py][INFO] Epoch:[0/2](254000/4588595) loss:3.275 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:25:47,908][model8_pretrain.py][INFO] Epoch:[0/2](254000/4588595) loss:2.326 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:25:47,908][model8_pretrain.py][INFO] Epoch:[0/2](254000/4588595) loss:2.982 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:25:47,908][model8_pretrain.py][INFO] Epoch:[0/2](254000/4588595) loss:2.758 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:25:47,908][model8_pretrain.py][INFO] Epoch:[0/2](254000/4588595) loss:2.981 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:25:47,908][model8_pretrain.py][INFO] Epoch:[0/2](254000/4588595) loss:3.060 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:25:47,908][model8_pretrain.py][INFO] Epoch:[0/2](254000/4588595) loss:3.030 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:25:47,908][model8_pretrain.py][INFO] Epoch:[0/2](254000/4588595) loss:3.141 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:26:24,835][model8_pretrain.py][INFO] Epoch:[0/2](254100/4588595) loss:3.118 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:26:24,835][model8_pretrain.py][INFO] Epoch:[0/2](254100/4588595) loss:3.031 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:26:24,836][model8_pretrain.py][INFO] Epoch:[0/2](254100/4588595) loss:2.382 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:26:24,836][model8_pretrain.py][INFO] Epoch:[0/2](254100/4588595) loss:2.725 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:26:24,836][model8_pretrain.py][INFO] Epoch:[0/2](254100/4588595) loss:3.134 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:26:24,836][model8_pretrain.py][INFO] Epoch:[0/2](254100/4588595) loss:2.911 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:26:24,836][model8_pretrain.py][INFO] Epoch:[0/2](254100/4588595) loss:2.898 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:26:24,836][model8_pretrain.py][INFO] Epoch:[0/2](254100/4588595) loss:2.951 lr:0.0000100 epoch_Time:27406.0min: [2024-01-03 20:27:01,770][model8_pretrain.py][INFO] Epoch:[0/2](254200/4588595) loss:2.769 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:01,770][model8_pretrain.py][INFO] Epoch:[0/2](254200/4588595) loss:2.819 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:01,770][model8_pretrain.py][INFO] Epoch:[0/2](254200/4588595) loss:3.146 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:01,770][model8_pretrain.py][INFO] Epoch:[0/2](254200/4588595) loss:2.918 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:01,770][model8_pretrain.py][INFO] Epoch:[0/2](254200/4588595) loss:2.617 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:01,770][model8_pretrain.py][INFO] Epoch:[0/2](254200/4588595) loss:2.581 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:01,770][model8_pretrain.py][INFO] Epoch:[0/2](254200/4588595) loss:2.332 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:01,770][model8_pretrain.py][INFO] Epoch:[0/2](254200/4588595) loss:2.961 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:38,709][model8_pretrain.py][INFO] Epoch:[0/2](254300/4588595) loss:2.718 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:27:38,709][model8_pretrain.py][INFO] Epoch:[0/2](254300/4588595) loss:2.607 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:27:38,709][model8_pretrain.py][INFO] Epoch:[0/2](254300/4588595) loss:3.431 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:38,709][model8_pretrain.py][INFO] Epoch:[0/2](254300/4588595) loss:2.398 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:38,709][model8_pretrain.py][INFO] Epoch:[0/2](254300/4588595) loss:3.114 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:38,709][model8_pretrain.py][INFO] Epoch:[0/2](254300/4588595) loss:3.344 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:38,710][model8_pretrain.py][INFO] Epoch:[0/2](254300/4588595) loss:2.797 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:27:38,710][model8_pretrain.py][INFO] Epoch:[0/2](254300/4588595) loss:3.254 lr:0.0000100 epoch_Time:27405.0min: [2024-01-03 20:28:15,639][model8_pretrain.py][INFO] Epoch:[0/2](254400/4588595) loss:2.875 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:28:15,639][model8_pretrain.py][INFO] Epoch:[0/2](254400/4588595) loss:3.100 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:28:15,639][model8_pretrain.py][INFO] Epoch:[0/2](254400/4588595) loss:2.993 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:28:15,639][model8_pretrain.py][INFO] Epoch:[0/2](254400/4588595) loss:3.056 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:28:15,639][model8_pretrain.py][INFO] Epoch:[0/2](254400/4588595) loss:2.844 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:28:15,639][model8_pretrain.py][INFO] Epoch:[0/2](254400/4588595) loss:2.881 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:28:15,639][model8_pretrain.py][INFO] Epoch:[0/2](254400/4588595) loss:2.382 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:28:15,639][model8_pretrain.py][INFO] Epoch:[0/2](254400/4588595) loss:2.492 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:29:01,037][model8_pretrain.py][INFO] Epoch:[0/2](254500/4588595) loss:2.908 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:01,037][model8_pretrain.py][INFO] Epoch:[0/2](254500/4588595) loss:3.451 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:01,037][model8_pretrain.py][INFO] Epoch:[0/2](254500/4588595) loss:2.813 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:01,037][model8_pretrain.py][INFO] Epoch:[0/2](254500/4588595) loss:3.047 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:01,037][model8_pretrain.py][INFO] Epoch:[0/2](254500/4588595) loss:2.493 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:01,037][model8_pretrain.py][INFO] Epoch:[0/2](254500/4588595) loss:2.940 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:01,037][model8_pretrain.py][INFO] Epoch:[0/2](254500/4588595) loss:3.034 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:01,038][model8_pretrain.py][INFO] Epoch:[0/2](254500/4588595) loss:2.669 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:37,989][model8_pretrain.py][INFO] Epoch:[0/2](254600/4588595) loss:3.010 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:37,989][model8_pretrain.py][INFO] Epoch:[0/2](254600/4588595) loss:2.893 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:37,989][model8_pretrain.py][INFO] Epoch:[0/2](254600/4588595) loss:2.968 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:37,989][model8_pretrain.py][INFO] Epoch:[0/2](254600/4588595) loss:2.668 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:37,989][model8_pretrain.py][INFO] Epoch:[0/2](254600/4588595) loss:3.390 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:37,989][model8_pretrain.py][INFO] Epoch:[0/2](254600/4588595) loss:2.779 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:37,989][model8_pretrain.py][INFO] Epoch:[0/2](254600/4588595) loss:2.832 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:29:37,989][model8_pretrain.py][INFO] Epoch:[0/2](254600/4588595) loss:3.401 lr:0.0000100 epoch_Time:27404.0min: [2024-01-03 20:30:14,935][model8_pretrain.py][INFO] Epoch:[0/2](254700/4588595) loss:2.791 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:30:14,935][model8_pretrain.py][INFO] Epoch:[0/2](254700/4588595) loss:2.675 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:30:14,935][model8_pretrain.py][INFO] Epoch:[0/2](254700/4588595) loss:3.284 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:30:14,935][model8_pretrain.py][INFO] Epoch:[0/2](254700/4588595) loss:2.472 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:30:14,935][model8_pretrain.py][INFO] Epoch:[0/2](254700/4588595) loss:3.190 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:30:14,935][model8_pretrain.py][INFO] Epoch:[0/2](254700/4588595) loss:2.544 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:30:14,935][model8_pretrain.py][INFO] Epoch:[0/2](254700/4588595) loss:2.595 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:30:14,935][model8_pretrain.py][INFO] Epoch:[0/2](254700/4588595) loss:2.993 lr:0.0000100 epoch_Time:27403.0min: [2024-01-03 20:30:51,879][model8_pretrain.py][INFO] Epoch:[0/2](254800/4588595) loss:3.144 lr:0.0000100 epoch_Time:27402.0min: [2024-01-03 20:30:51,879][model8_pretrain.py][INFO] Epoch:[0/2](254800/4588595) loss:2.788 lr:0.0000100 epoch_Time:27402.0min: [2024-01-03 20:30:51,879][model8_pretrain.py][INFO] Epoch:[0/2](254800/4588595) loss:2.608 lr:0.0000100 epoch_Time:27402.0min: [2024-01-03 20:30:51,879][model8_pretrain.py][INFO] Epoch:[0/2](254800/4588595) loss:3.130 lr:0.0000100 epoch_Time:27402.0min: [2024-01-03 20:30:51,879][model8_pretrain.py][INFO] Epoch:[0/2](254800/4588595) loss:2.649 lr:0.0000100 epoch_Time:27402.0min: [2024-01-03 20:30:51,879][model8_pretrain.py][INFO] Epoch:[0/2](254800/4588595) loss:3.107 lr:0.0000100 epoch_Time:27402.0min: [2024-01-03 20:30:51,879][model8_pretrain.py][INFO] Epoch:[0/2](254800/4588595) loss:3.034 lr:0.0000100 epoch_Time:27402.0min: [2024-01-03 20:30:51,879][model8_pretrain.py][INFO] Epoch:[0/2](254800/4588595) loss:2.745 lr:0.0000100 epoch_Time:27402.0min: [2024-01-03 20:31:28,823][model8_pretrain.py][INFO] Epoch:[0/2](254900/4588595) loss:3.332 lr:0.0000100 epoch_Time:27401.0min: [2024-01-03 20:31:28,823][model8_pretrain.py][INFO] Epoch:[0/2](254900/4588595) loss:2.878 lr:0.0000100 epoch_Time:27401.0min: [2024-01-03 20:31:28,823][model8_pretrain.py][INFO] Epoch:[0/2](254900/4588595) loss:2.691 lr:0.0000100 epoch_Time:27401.0min: [2024-01-03 20:31:28,823][model8_pretrain.py][INFO] Epoch:[0/2](254900/4588595) loss:2.393 lr:0.0000100 epoch_Time:27401.0min: [2024-01-03 20:31:28,823][model8_pretrain.py][INFO] Epoch:[0/2](254900/4588595) loss:2.709 lr:0.0000100 epoch_Time:27401.0min: [2024-01-03 20:31:28,823][model8_pretrain.py][INFO] Epoch:[0/2](254900/4588595) loss:2.682 lr:0.0000100 epoch_Time:27401.0min: [2024-01-03 20:31:28,823][model8_pretrain.py][INFO] Epoch:[0/2](254900/4588595) loss:2.665 lr:0.0000100 epoch_Time:27401.0min: [2024-01-03 20:31:28,823][model8_pretrain.py][INFO] Epoch:[0/2](254900/4588595) loss:2.696 lr:0.0000100 epoch_Time:27401.0min: [2024-01-03 20:32:05,766][model8_pretrain.py][INFO] Epoch:[0/2](255000/4588595) loss:3.511 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:05,766][model8_pretrain.py][INFO] Epoch:[0/2](255000/4588595) loss:3.033 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:05,766][model8_pretrain.py][INFO] Epoch:[0/2](255000/4588595) loss:2.349 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:05,766][model8_pretrain.py][INFO] Epoch:[0/2](255000/4588595) loss:3.023 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:05,766][model8_pretrain.py][INFO] Epoch:[0/2](255000/4588595) loss:3.132 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:05,766][model8_pretrain.py][INFO] Epoch:[0/2](255000/4588595) loss:3.180 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:05,767][model8_pretrain.py][INFO] Epoch:[0/2](255000/4588595) loss:2.539 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:05,767][model8_pretrain.py][INFO] Epoch:[0/2](255000/4588595) loss:2.400 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:42,718][model8_pretrain.py][INFO] Epoch:[0/2](255100/4588595) loss:2.846 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:42,718][model8_pretrain.py][INFO] Epoch:[0/2](255100/4588595) loss:3.219 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:42,718][model8_pretrain.py][INFO] Epoch:[0/2](255100/4588595) loss:2.857 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:42,718][model8_pretrain.py][INFO] Epoch:[0/2](255100/4588595) loss:2.631 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:42,718][model8_pretrain.py][INFO] Epoch:[0/2](255100/4588595) loss:3.300 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:42,718][model8_pretrain.py][INFO] Epoch:[0/2](255100/4588595) loss:2.878 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:42,718][model8_pretrain.py][INFO] Epoch:[0/2](255100/4588595) loss:3.322 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:32:42,718][model8_pretrain.py][INFO] Epoch:[0/2](255100/4588595) loss:2.818 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:33:19,661][model8_pretrain.py][INFO] Epoch:[0/2](255200/4588595) loss:2.880 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:33:19,661][model8_pretrain.py][INFO] Epoch:[0/2](255200/4588595) loss:2.960 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:33:19,661][model8_pretrain.py][INFO] Epoch:[0/2](255200/4588595) loss:2.598 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:33:19,661][model8_pretrain.py][INFO] Epoch:[0/2](255200/4588595) loss:3.002 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:33:19,661][model8_pretrain.py][INFO] Epoch:[0/2](255200/4588595) loss:3.486 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:33:19,661][model8_pretrain.py][INFO] Epoch:[0/2](255200/4588595) loss:2.664 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:33:19,661][model8_pretrain.py][INFO] Epoch:[0/2](255200/4588595) loss:3.207 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:33:19,661][model8_pretrain.py][INFO] Epoch:[0/2](255200/4588595) loss:3.050 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:34:05,157][model8_pretrain.py][INFO] Epoch:[0/2](255300/4588595) loss:3.125 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:34:05,158][model8_pretrain.py][INFO] Epoch:[0/2](255300/4588595) loss:3.232 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:34:05,158][model8_pretrain.py][INFO] Epoch:[0/2](255300/4588595) loss:2.927 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:34:05,158][model8_pretrain.py][INFO] Epoch:[0/2](255300/4588595) loss:3.194 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:34:05,158][model8_pretrain.py][INFO] Epoch:[0/2](255300/4588595) loss:3.015 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:34:05,158][model8_pretrain.py][INFO] Epoch:[0/2](255300/4588595) loss:3.129 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:34:05,158][model8_pretrain.py][INFO] Epoch:[0/2](255300/4588595) loss:3.015 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:34:05,158][model8_pretrain.py][INFO] Epoch:[0/2](255300/4588595) loss:3.198 lr:0.0000100 epoch_Time:27400.0min: [2024-01-03 20:34:42,100][model8_pretrain.py][INFO] Epoch:[0/2](255400/4588595) loss:3.316 lr:0.0000100 epoch_Time:27399.0min: [2024-01-03 20:34:42,100][model8_pretrain.py][INFO] Epoch:[0/2](255400/4588595) loss:2.680 lr:0.0000100 epoch_Time:27399.0min: [2024-01-03 20:34:42,100][model8_pretrain.py][INFO] Epoch:[0/2](255400/4588595) loss:3.035 lr:0.0000100 epoch_Time:27399.0min: [2024-01-03 20:34:42,100][model8_pretrain.py][INFO] Epoch:[0/2](255400/4588595) loss:2.561 lr:0.0000100 epoch_Time:27399.0min: [2024-01-03 20:34:42,100][model8_pretrain.py][INFO] Epoch:[0/2](255400/4588595) loss:2.854 lr:0.0000100 epoch_Time:27399.0min: [2024-01-03 20:34:42,100][model8_pretrain.py][INFO] Epoch:[0/2](255400/4588595) loss:2.847 lr:0.0000100 epoch_Time:27399.0min: [2024-01-03 20:34:42,100][model8_pretrain.py][INFO] Epoch:[0/2](255400/4588595) loss:2.718 lr:0.0000100 epoch_Time:27399.0min: [2024-01-03 20:34:42,101][model8_pretrain.py][INFO] Epoch:[0/2](255400/4588595) loss:2.946 lr:0.0000100 epoch_Time:27399.0min: [2024-01-03 20:35:19,041][model8_pretrain.py][INFO] Epoch:[0/2](255500/4588595) loss:2.410 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:35:19,041][model8_pretrain.py][INFO] Epoch:[0/2](255500/4588595) loss:3.157 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:35:19,041][model8_pretrain.py][INFO] Epoch:[0/2](255500/4588595) loss:3.063 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:35:19,041][model8_pretrain.py][INFO] Epoch:[0/2](255500/4588595) loss:2.309 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:35:19,041][model8_pretrain.py][INFO] Epoch:[0/2](255500/4588595) loss:3.016 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:35:19,041][model8_pretrain.py][INFO] Epoch:[0/2](255500/4588595) loss:3.063 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:35:19,041][model8_pretrain.py][INFO] Epoch:[0/2](255500/4588595) loss:2.159 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:35:19,041][model8_pretrain.py][INFO] Epoch:[0/2](255500/4588595) loss:3.240 lr:0.0000100 epoch_Time:27398.0min: [2024-01-03 20:35:55,965][model8_pretrain.py][INFO] Epoch:[0/2](255600/4588595) loss:3.597 lr:0.0000100 epoch_Time:27397.0min: [2024-01-03 20:35:55,965][model8_pretrain.py][INFO] Epoch:[0/2](255600/4588595) loss:2.961 lr:0.0000100 epoch_Time:27397.0min: [2024-01-03 20:35:55,965][model8_pretrain.py][INFO] Epoch:[0/2](255600/4588595) loss:3.191 lr:0.0000100 epoch_Time:27397.0min: [2024-01-03 20:35:55,965][model8_pretrain.py][INFO] Epoch:[0/2](255600/4588595) loss:3.604 lr:0.0000100 epoch_Time:27397.0min: [2024-01-03 20:35:55,965][model8_pretrain.py][INFO] Epoch:[0/2](255600/4588595) loss:2.593 lr:0.0000100 epoch_Time:27397.0min: [2024-01-03 20:35:55,965][model8_pretrain.py][INFO] Epoch:[0/2](255600/4588595) loss:3.171 lr:0.0000100 epoch_Time:27397.0min: [2024-01-03 20:35:55,965][model8_pretrain.py][INFO] Epoch:[0/2](255600/4588595) loss:2.934 lr:0.0000100 epoch_Time:27397.0min: [2024-01-03 20:35:55,965][model8_pretrain.py][INFO] Epoch:[0/2](255600/4588595) loss:3.197 lr:0.0000100 epoch_Time:27397.0min: [2024-01-03 20:36:32,892][model8_pretrain.py][INFO] Epoch:[0/2](255700/4588595) loss:3.250 lr:0.0000100 epoch_Time:27396.0min: [2024-01-03 20:36:32,892][model8_pretrain.py][INFO] Epoch:[0/2](255700/4588595) loss:2.616 lr:0.0000100 epoch_Time:27396.0min: [2024-01-03 20:36:32,892][model8_pretrain.py][INFO] Epoch:[0/2](255700/4588595) loss:2.727 lr:0.0000100 epoch_Time:27396.0min: [2024-01-03 20:36:32,892][model8_pretrain.py][INFO] Epoch:[0/2](255700/4588595) loss:3.110 lr:0.0000100 epoch_Time:27396.0min: [2024-01-03 20:36:32,892][model8_pretrain.py][INFO] Epoch:[0/2](255700/4588595) loss:3.008 lr:0.0000100 epoch_Time:27396.0min: [2024-01-03 20:36:32,892][model8_pretrain.py][INFO] Epoch:[0/2](255700/4588595) loss:2.669 lr:0.0000100 epoch_Time:27396.0min: [2024-01-03 20:36:32,893][model8_pretrain.py][INFO] Epoch:[0/2](255700/4588595) loss:3.180 lr:0.0000100 epoch_Time:27396.0min: [2024-01-03 20:36:32,893][model8_pretrain.py][INFO] Epoch:[0/2](255700/4588595) loss:2.434 lr:0.0000100 epoch_Time:27396.0min: [2024-01-03 20:37:09,826][model8_pretrain.py][INFO] Epoch:[0/2](255800/4588595) loss:3.264 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:09,826][model8_pretrain.py][INFO] Epoch:[0/2](255800/4588595) loss:2.938 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:09,826][model8_pretrain.py][INFO] Epoch:[0/2](255800/4588595) loss:2.931 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:09,826][model8_pretrain.py][INFO] Epoch:[0/2](255800/4588595) loss:2.524 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:09,826][model8_pretrain.py][INFO] Epoch:[0/2](255800/4588595) loss:2.612 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:09,826][model8_pretrain.py][INFO] Epoch:[0/2](255800/4588595) loss:2.883 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:09,826][model8_pretrain.py][INFO] Epoch:[0/2](255800/4588595) loss:3.236 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:09,826][model8_pretrain.py][INFO] Epoch:[0/2](255800/4588595) loss:2.848 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:46,759][model8_pretrain.py][INFO] Epoch:[0/2](255900/4588595) loss:3.190 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:46,759][model8_pretrain.py][INFO] Epoch:[0/2](255900/4588595) loss:2.701 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:46,759][model8_pretrain.py][INFO] Epoch:[0/2](255900/4588595) loss:2.958 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:46,759][model8_pretrain.py][INFO] Epoch:[0/2](255900/4588595) loss:2.355 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:46,759][model8_pretrain.py][INFO] Epoch:[0/2](255900/4588595) loss:2.971 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:46,759][model8_pretrain.py][INFO] Epoch:[0/2](255900/4588595) loss:2.369 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:46,760][model8_pretrain.py][INFO] Epoch:[0/2](255900/4588595) loss:2.833 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:37:46,760][model8_pretrain.py][INFO] Epoch:[0/2](255900/4588595) loss:2.755 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:38:23,688][model8_pretrain.py][INFO] Epoch:[0/2](256000/4588595) loss:2.538 lr:0.0000100 epoch_Time:27394.0min: [2024-01-03 20:38:23,688][model8_pretrain.py][INFO] Epoch:[0/2](256000/4588595) loss:2.798 lr:0.0000100 epoch_Time:27394.0min: [2024-01-03 20:38:23,688][model8_pretrain.py][INFO] Epoch:[0/2](256000/4588595) loss:3.523 lr:0.0000100 epoch_Time:27394.0min: [2024-01-03 20:38:23,688][model8_pretrain.py][INFO] Epoch:[0/2](256000/4588595) loss:3.088 lr:0.0000100 epoch_Time:27394.0min: [2024-01-03 20:38:23,688][model8_pretrain.py][INFO] Epoch:[0/2](256000/4588595) loss:3.199 lr:0.0000100 epoch_Time:27394.0min: [2024-01-03 20:38:23,688][model8_pretrain.py][INFO] Epoch:[0/2](256000/4588595) loss:2.912 lr:0.0000100 epoch_Time:27394.0min: [2024-01-03 20:38:23,688][model8_pretrain.py][INFO] Epoch:[0/2](256000/4588595) loss:3.214 lr:0.0000100 epoch_Time:27394.0min: [2024-01-03 20:38:23,688][model8_pretrain.py][INFO] Epoch:[0/2](256000/4588595) loss:2.538 lr:0.0000100 epoch_Time:27394.0min: [2024-01-03 20:39:09,290][model8_pretrain.py][INFO] Epoch:[0/2](256100/4588595) loss:2.843 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:09,290][model8_pretrain.py][INFO] Epoch:[0/2](256100/4588595) loss:2.931 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:09,290][model8_pretrain.py][INFO] Epoch:[0/2](256100/4588595) loss:2.674 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:09,290][model8_pretrain.py][INFO] Epoch:[0/2](256100/4588595) loss:2.838 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:09,290][model8_pretrain.py][INFO] Epoch:[0/2](256100/4588595) loss:3.216 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:09,290][model8_pretrain.py][INFO] Epoch:[0/2](256100/4588595) loss:2.780 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:09,291][model8_pretrain.py][INFO] Epoch:[0/2](256100/4588595) loss:3.326 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:09,291][model8_pretrain.py][INFO] Epoch:[0/2](256100/4588595) loss:2.867 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:46,229][model8_pretrain.py][INFO] Epoch:[0/2](256200/4588595) loss:2.907 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:46,229][model8_pretrain.py][INFO] Epoch:[0/2](256200/4588595) loss:2.721 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:46,229][model8_pretrain.py][INFO] Epoch:[0/2](256200/4588595) loss:2.977 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:46,229][model8_pretrain.py][INFO] Epoch:[0/2](256200/4588595) loss:2.695 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:46,229][model8_pretrain.py][INFO] Epoch:[0/2](256200/4588595) loss:2.880 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:46,229][model8_pretrain.py][INFO] Epoch:[0/2](256200/4588595) loss:2.434 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:46,229][model8_pretrain.py][INFO] Epoch:[0/2](256200/4588595) loss:2.621 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:39:46,230][model8_pretrain.py][INFO] Epoch:[0/2](256200/4588595) loss:3.024 lr:0.0000100 epoch_Time:27395.0min: [2024-01-03 20:40:23,173][model8_pretrain.py][INFO] Epoch:[0/2](256300/4588595) loss:2.656 lr:0.0000100 epoch_Time:27393.0min: [2024-01-03 20:40:23,174][model8_pretrain.py][INFO] Epoch:[0/2](256300/4588595) loss:3.083 lr:0.0000100 epoch_Time:27393.0min: [2024-01-03 20:40:23,174][model8_pretrain.py][INFO] Epoch:[0/2](256300/4588595) loss:3.306 lr:0.0000100 epoch_Time:27393.0min: [2024-01-03 20:40:23,174][model8_pretrain.py][INFO] Epoch:[0/2](256300/4588595) loss:3.438 lr:0.0000100 epoch_Time:27393.0min: [2024-01-03 20:40:23,173][model8_pretrain.py][INFO] Epoch:[0/2](256300/4588595) loss:2.463 lr:0.0000100 epoch_Time:27393.0min: [2024-01-03 20:40:23,174][model8_pretrain.py][INFO] Epoch:[0/2](256300/4588595) loss:2.907 lr:0.0000100 epoch_Time:27393.0min: [2024-01-03 20:40:23,174][model8_pretrain.py][INFO] Epoch:[0/2](256300/4588595) loss:2.913 lr:0.0000100 epoch_Time:27393.0min: [2024-01-03 20:40:23,174][model8_pretrain.py][INFO] Epoch:[0/2](256300/4588595) loss:2.777 lr:0.0000100 epoch_Time:27393.0min: [2024-01-03 20:41:00,103][model8_pretrain.py][INFO] Epoch:[0/2](256400/4588595) loss:2.998 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:00,103][model8_pretrain.py][INFO] Epoch:[0/2](256400/4588595) loss:3.025 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:00,103][model8_pretrain.py][INFO] Epoch:[0/2](256400/4588595) loss:3.277 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:00,103][model8_pretrain.py][INFO] Epoch:[0/2](256400/4588595) loss:2.896 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:00,103][model8_pretrain.py][INFO] Epoch:[0/2](256400/4588595) loss:3.080 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:00,103][model8_pretrain.py][INFO] Epoch:[0/2](256400/4588595) loss:2.527 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:00,103][model8_pretrain.py][INFO] Epoch:[0/2](256400/4588595) loss:3.029 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:00,103][model8_pretrain.py][INFO] Epoch:[0/2](256400/4588595) loss:3.182 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:37,035][model8_pretrain.py][INFO] Epoch:[0/2](256500/4588595) loss:2.678 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:37,035][model8_pretrain.py][INFO] Epoch:[0/2](256500/4588595) loss:2.608 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:37,035][model8_pretrain.py][INFO] Epoch:[0/2](256500/4588595) loss:3.203 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:37,035][model8_pretrain.py][INFO] Epoch:[0/2](256500/4588595) loss:2.959 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:37,035][model8_pretrain.py][INFO] Epoch:[0/2](256500/4588595) loss:2.521 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:37,035][model8_pretrain.py][INFO] Epoch:[0/2](256500/4588595) loss:3.010 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:37,035][model8_pretrain.py][INFO] Epoch:[0/2](256500/4588595) loss:2.991 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:41:37,035][model8_pretrain.py][INFO] Epoch:[0/2](256500/4588595) loss:2.454 lr:0.0000100 epoch_Time:27392.0min: [2024-01-03 20:42:13,965][model8_pretrain.py][INFO] Epoch:[0/2](256600/4588595) loss:3.022 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:42:13,965][model8_pretrain.py][INFO] Epoch:[0/2](256600/4588595) loss:2.795 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:42:13,965][model8_pretrain.py][INFO] Epoch:[0/2](256600/4588595) loss:2.964 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:42:13,965][model8_pretrain.py][INFO] Epoch:[0/2](256600/4588595) loss:2.624 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:42:13,965][model8_pretrain.py][INFO] Epoch:[0/2](256600/4588595) loss:2.850 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:42:13,965][model8_pretrain.py][INFO] Epoch:[0/2](256600/4588595) loss:3.093 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:42:13,965][model8_pretrain.py][INFO] Epoch:[0/2](256600/4588595) loss:2.873 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:42:13,966][model8_pretrain.py][INFO] Epoch:[0/2](256600/4588595) loss:2.967 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:42:50,904][model8_pretrain.py][INFO] Epoch:[0/2](256700/4588595) loss:2.771 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:42:50,904][model8_pretrain.py][INFO] Epoch:[0/2](256700/4588595) loss:2.542 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:42:50,904][model8_pretrain.py][INFO] Epoch:[0/2](256700/4588595) loss:2.986 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:42:50,904][model8_pretrain.py][INFO] Epoch:[0/2](256700/4588595) loss:2.980 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:42:50,904][model8_pretrain.py][INFO] Epoch:[0/2](256700/4588595) loss:3.471 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:42:50,904][model8_pretrain.py][INFO] Epoch:[0/2](256700/4588595) loss:3.238 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:42:50,904][model8_pretrain.py][INFO] Epoch:[0/2](256700/4588595) loss:3.091 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:42:50,905][model8_pretrain.py][INFO] Epoch:[0/2](256700/4588595) loss:3.079 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:43:27,859][model8_pretrain.py][INFO] Epoch:[0/2](256800/4588595) loss:2.821 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:43:27,859][model8_pretrain.py][INFO] Epoch:[0/2](256800/4588595) loss:2.925 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:43:27,859][model8_pretrain.py][INFO] Epoch:[0/2](256800/4588595) loss:3.394 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:43:27,859][model8_pretrain.py][INFO] Epoch:[0/2](256800/4588595) loss:3.427 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:43:27,859][model8_pretrain.py][INFO] Epoch:[0/2](256800/4588595) loss:3.077 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:43:27,860][model8_pretrain.py][INFO] Epoch:[0/2](256800/4588595) loss:3.110 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:43:27,860][model8_pretrain.py][INFO] Epoch:[0/2](256800/4588595) loss:2.668 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:43:27,860][model8_pretrain.py][INFO] Epoch:[0/2](256800/4588595) loss:2.613 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:44:13,502][model8_pretrain.py][INFO] Epoch:[0/2](256900/4588595) loss:1.962 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:44:13,502][model8_pretrain.py][INFO] Epoch:[0/2](256900/4588595) loss:2.997 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:44:13,502][model8_pretrain.py][INFO] Epoch:[0/2](256900/4588595) loss:3.136 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:44:13,502][model8_pretrain.py][INFO] Epoch:[0/2](256900/4588595) loss:3.322 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:44:13,502][model8_pretrain.py][INFO] Epoch:[0/2](256900/4588595) loss:2.526 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:44:13,503][model8_pretrain.py][INFO] Epoch:[0/2](256900/4588595) loss:2.862 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:44:13,503][model8_pretrain.py][INFO] Epoch:[0/2](256900/4588595) loss:3.313 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:44:13,503][model8_pretrain.py][INFO] Epoch:[0/2](256900/4588595) loss:3.001 lr:0.0000100 epoch_Time:27390.0min: [2024-01-03 20:44:50,450][model8_pretrain.py][INFO] Epoch:[0/2](257000/4588595) loss:2.949 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:44:50,450][model8_pretrain.py][INFO] Epoch:[0/2](257000/4588595) loss:2.858 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:44:50,450][model8_pretrain.py][INFO] Epoch:[0/2](257000/4588595) loss:3.227 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:44:50,450][model8_pretrain.py][INFO] Epoch:[0/2](257000/4588595) loss:2.986 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:44:50,450][model8_pretrain.py][INFO] Epoch:[0/2](257000/4588595) loss:2.716 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:44:50,450][model8_pretrain.py][INFO] Epoch:[0/2](257000/4588595) loss:2.724 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:44:50,450][model8_pretrain.py][INFO] Epoch:[0/2](257000/4588595) loss:3.130 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:44:50,451][model8_pretrain.py][INFO] Epoch:[0/2](257000/4588595) loss:2.877 lr:0.0000100 epoch_Time:27389.0min: [2024-01-03 20:45:27,391][model8_pretrain.py][INFO] Epoch:[0/2](257100/4588595) loss:2.619 lr:0.0000100 epoch_Time:27388.0min: [2024-01-03 20:45:27,391][model8_pretrain.py][INFO] Epoch:[0/2](257100/4588595) loss:2.976 lr:0.0000100 epoch_Time:27388.0min: [2024-01-03 20:45:27,391][model8_pretrain.py][INFO] Epoch:[0/2](257100/4588595) loss:2.828 lr:0.0000100 epoch_Time:27388.0min: [2024-01-03 20:45:27,391][model8_pretrain.py][INFO] Epoch:[0/2](257100/4588595) loss:2.641 lr:0.0000100 epoch_Time:27388.0min: [2024-01-03 20:45:27,391][model8_pretrain.py][INFO] Epoch:[0/2](257100/4588595) loss:2.730 lr:0.0000100 epoch_Time:27388.0min: [2024-01-03 20:45:27,391][model8_pretrain.py][INFO] Epoch:[0/2](257100/4588595) loss:3.359 lr:0.0000100 epoch_Time:27388.0min: [2024-01-03 20:45:27,391][model8_pretrain.py][INFO] Epoch:[0/2](257100/4588595) loss:3.201 lr:0.0000100 epoch_Time:27388.0min: [2024-01-03 20:45:27,391][model8_pretrain.py][INFO] Epoch:[0/2](257100/4588595) loss:3.021 lr:0.0000100 epoch_Time:27388.0min: [2024-01-03 20:46:04,330][model8_pretrain.py][INFO] Epoch:[0/2](257200/4588595) loss:2.555 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:04,330][model8_pretrain.py][INFO] Epoch:[0/2](257200/4588595) loss:2.870 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:04,330][model8_pretrain.py][INFO] Epoch:[0/2](257200/4588595) loss:3.212 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:04,330][model8_pretrain.py][INFO] Epoch:[0/2](257200/4588595) loss:3.041 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:04,330][model8_pretrain.py][INFO] Epoch:[0/2](257200/4588595) loss:3.272 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:04,330][model8_pretrain.py][INFO] Epoch:[0/2](257200/4588595) loss:3.198 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:04,330][model8_pretrain.py][INFO] Epoch:[0/2](257200/4588595) loss:2.731 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:04,330][model8_pretrain.py][INFO] Epoch:[0/2](257200/4588595) loss:2.616 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:41,277][model8_pretrain.py][INFO] Epoch:[0/2](257300/4588595) loss:2.969 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:41,277][model8_pretrain.py][INFO] Epoch:[0/2](257300/4588595) loss:3.114 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:41,277][model8_pretrain.py][INFO] Epoch:[0/2](257300/4588595) loss:2.958 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:41,277][model8_pretrain.py][INFO] Epoch:[0/2](257300/4588595) loss:2.876 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:41,277][model8_pretrain.py][INFO] Epoch:[0/2](257300/4588595) loss:2.958 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:41,277][model8_pretrain.py][INFO] Epoch:[0/2](257300/4588595) loss:3.321 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:41,278][model8_pretrain.py][INFO] Epoch:[0/2](257300/4588595) loss:3.055 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:46:41,279][model8_pretrain.py][INFO] Epoch:[0/2](257300/4588595) loss:2.829 lr:0.0000100 epoch_Time:27387.0min: [2024-01-03 20:47:18,212][model8_pretrain.py][INFO] Epoch:[0/2](257400/4588595) loss:2.607 lr:0.0000100 epoch_Time:27386.0min: [2024-01-03 20:47:18,212][model8_pretrain.py][INFO] Epoch:[0/2](257400/4588595) loss:3.144 lr:0.0000100 epoch_Time:27386.0min: [2024-01-03 20:47:18,212][model8_pretrain.py][INFO] Epoch:[0/2](257400/4588595) loss:2.877 lr:0.0000100 epoch_Time:27386.0min: [2024-01-03 20:47:18,212][model8_pretrain.py][INFO] Epoch:[0/2](257400/4588595) loss:2.487 lr:0.0000100 epoch_Time:27386.0min: [2024-01-03 20:47:18,212][model8_pretrain.py][INFO] Epoch:[0/2](257400/4588595) loss:3.471 lr:0.0000100 epoch_Time:27386.0min: [2024-01-03 20:47:18,212][model8_pretrain.py][INFO] Epoch:[0/2](257400/4588595) loss:2.680 lr:0.0000100 epoch_Time:27386.0min: [2024-01-03 20:47:18,213][model8_pretrain.py][INFO] Epoch:[0/2](257400/4588595) loss:3.399 lr:0.0000100 epoch_Time:27386.0min: [2024-01-03 20:47:18,213][model8_pretrain.py][INFO] Epoch:[0/2](257400/4588595) loss:3.342 lr:0.0000100 epoch_Time:27386.0min: [2024-01-03 20:47:55,150][model8_pretrain.py][INFO] Epoch:[0/2](257500/4588595) loss:3.415 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:47:55,150][model8_pretrain.py][INFO] Epoch:[0/2](257500/4588595) loss:3.017 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:47:55,150][model8_pretrain.py][INFO] Epoch:[0/2](257500/4588595) loss:3.012 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:47:55,150][model8_pretrain.py][INFO] Epoch:[0/2](257500/4588595) loss:2.415 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:47:55,150][model8_pretrain.py][INFO] Epoch:[0/2](257500/4588595) loss:3.147 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:47:55,150][model8_pretrain.py][INFO] Epoch:[0/2](257500/4588595) loss:2.607 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:47:55,150][model8_pretrain.py][INFO] Epoch:[0/2](257500/4588595) loss:2.700 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:47:55,151][model8_pretrain.py][INFO] Epoch:[0/2](257500/4588595) loss:2.619 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:48:32,090][model8_pretrain.py][INFO] Epoch:[0/2](257600/4588595) loss:2.793 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:48:32,090][model8_pretrain.py][INFO] Epoch:[0/2](257600/4588595) loss:3.036 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:48:32,090][model8_pretrain.py][INFO] Epoch:[0/2](257600/4588595) loss:2.643 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:48:32,090][model8_pretrain.py][INFO] Epoch:[0/2](257600/4588595) loss:3.209 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:48:32,090][model8_pretrain.py][INFO] Epoch:[0/2](257600/4588595) loss:3.040 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:48:32,091][model8_pretrain.py][INFO] Epoch:[0/2](257600/4588595) loss:2.848 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:48:32,091][model8_pretrain.py][INFO] Epoch:[0/2](257600/4588595) loss:2.796 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:48:32,091][model8_pretrain.py][INFO] Epoch:[0/2](257600/4588595) loss:3.120 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:49:17,737][model8_pretrain.py][INFO] Epoch:[0/2](257700/4588595) loss:3.374 lr:0.0000100 epoch_Time:27385.0min: [2024-01-03 20:49:17,737][model8_pretrain.py][INFO] Epoch:[0/2](257700/4588595) loss:3.292 lr:0.0000100 epoch_Time:27385.0min: [2024-01-03 20:49:17,737][model8_pretrain.py][INFO] Epoch:[0/2](257700/4588595) loss:2.672 lr:0.0000100 epoch_Time:27385.0min: [2024-01-03 20:49:17,737][model8_pretrain.py][INFO] Epoch:[0/2](257700/4588595) loss:2.824 lr:0.0000100 epoch_Time:27385.0min: [2024-01-03 20:49:17,737][model8_pretrain.py][INFO] Epoch:[0/2](257700/4588595) loss:2.616 lr:0.0000100 epoch_Time:27385.0min: [2024-01-03 20:49:17,737][model8_pretrain.py][INFO] Epoch:[0/2](257700/4588595) loss:2.737 lr:0.0000100 epoch_Time:27385.0min: [2024-01-03 20:49:17,737][model8_pretrain.py][INFO] Epoch:[0/2](257700/4588595) loss:3.503 lr:0.0000100 epoch_Time:27385.0min: [2024-01-03 20:49:17,737][model8_pretrain.py][INFO] Epoch:[0/2](257700/4588595) loss:2.802 lr:0.0000100 epoch_Time:27385.0min: [2024-01-03 20:49:54,670][model8_pretrain.py][INFO] Epoch:[0/2](257800/4588595) loss:2.792 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:49:54,670][model8_pretrain.py][INFO] Epoch:[0/2](257800/4588595) loss:3.044 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:49:54,670][model8_pretrain.py][INFO] Epoch:[0/2](257800/4588595) loss:3.147 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:49:54,670][model8_pretrain.py][INFO] Epoch:[0/2](257800/4588595) loss:3.141 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:49:54,670][model8_pretrain.py][INFO] Epoch:[0/2](257800/4588595) loss:3.334 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:49:54,670][model8_pretrain.py][INFO] Epoch:[0/2](257800/4588595) loss:2.741 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:49:54,670][model8_pretrain.py][INFO] Epoch:[0/2](257800/4588595) loss:3.071 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:49:54,670][model8_pretrain.py][INFO] Epoch:[0/2](257800/4588595) loss:3.235 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:50:31,617][model8_pretrain.py][INFO] Epoch:[0/2](257900/4588595) loss:2.791 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:50:31,617][model8_pretrain.py][INFO] Epoch:[0/2](257900/4588595) loss:2.727 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:50:31,617][model8_pretrain.py][INFO] Epoch:[0/2](257900/4588595) loss:2.544 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:50:31,617][model8_pretrain.py][INFO] Epoch:[0/2](257900/4588595) loss:2.576 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:50:31,617][model8_pretrain.py][INFO] Epoch:[0/2](257900/4588595) loss:3.351 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:50:31,617][model8_pretrain.py][INFO] Epoch:[0/2](257900/4588595) loss:3.328 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:50:31,617][model8_pretrain.py][INFO] Epoch:[0/2](257900/4588595) loss:2.954 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:50:31,617][model8_pretrain.py][INFO] Epoch:[0/2](257900/4588595) loss:2.768 lr:0.0000100 epoch_Time:27384.0min: [2024-01-03 20:51:08,551][model8_pretrain.py][INFO] Epoch:[0/2](258000/4588595) loss:2.846 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:08,551][model8_pretrain.py][INFO] Epoch:[0/2](258000/4588595) loss:2.611 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:08,551][model8_pretrain.py][INFO] Epoch:[0/2](258000/4588595) loss:3.024 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:08,551][model8_pretrain.py][INFO] Epoch:[0/2](258000/4588595) loss:2.380 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:08,551][model8_pretrain.py][INFO] Epoch:[0/2](258000/4588595) loss:2.623 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:08,551][model8_pretrain.py][INFO] Epoch:[0/2](258000/4588595) loss:3.134 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:08,552][model8_pretrain.py][INFO] Epoch:[0/2](258000/4588595) loss:2.838 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:08,552][model8_pretrain.py][INFO] Epoch:[0/2](258000/4588595) loss:3.104 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:45,490][model8_pretrain.py][INFO] Epoch:[0/2](258100/4588595) loss:2.990 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:45,490][model8_pretrain.py][INFO] Epoch:[0/2](258100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:45,490][model8_pretrain.py][INFO] Epoch:[0/2](258100/4588595) loss:3.004 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:45,490][model8_pretrain.py][INFO] Epoch:[0/2](258100/4588595) loss:3.366 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:45,490][model8_pretrain.py][INFO] Epoch:[0/2](258100/4588595) loss:2.899 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:45,490][model8_pretrain.py][INFO] Epoch:[0/2](258100/4588595) loss:3.132 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:45,490][model8_pretrain.py][INFO] Epoch:[0/2](258100/4588595) loss:3.417 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:51:45,490][model8_pretrain.py][INFO] Epoch:[0/2](258100/4588595) loss:2.745 lr:0.0000100 epoch_Time:27382.0min: [2024-01-03 20:52:22,431][model8_pretrain.py][INFO] Epoch:[0/2](258200/4588595) loss:2.658 lr:0.0000100 epoch_Time:27381.0min: [2024-01-03 20:52:22,431][model8_pretrain.py][INFO] Epoch:[0/2](258200/4588595) loss:2.644 lr:0.0000100 epoch_Time:27381.0min: [2024-01-03 20:52:22,431][model8_pretrain.py][INFO] Epoch:[0/2](258200/4588595) loss:2.646 lr:0.0000100 epoch_Time:27381.0min: [2024-01-03 20:52:22,431][model8_pretrain.py][INFO] Epoch:[0/2](258200/4588595) loss:2.334 lr:0.0000100 epoch_Time:27381.0min: [2024-01-03 20:52:22,431][model8_pretrain.py][INFO] Epoch:[0/2](258200/4588595) loss:2.873 lr:0.0000100 epoch_Time:27381.0min: [2024-01-03 20:52:22,431][model8_pretrain.py][INFO] Epoch:[0/2](258200/4588595) loss:3.080 lr:0.0000100 epoch_Time:27381.0min: [2024-01-03 20:52:22,432][model8_pretrain.py][INFO] Epoch:[0/2](258200/4588595) loss:2.938 lr:0.0000100 epoch_Time:27381.0min: [2024-01-03 20:52:22,432][model8_pretrain.py][INFO] Epoch:[0/2](258200/4588595) loss:2.551 lr:0.0000100 epoch_Time:27381.0min: [2024-01-03 20:52:59,371][model8_pretrain.py][INFO] Epoch:[0/2](258300/4588595) loss:3.009 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:52:59,371][model8_pretrain.py][INFO] Epoch:[0/2](258300/4588595) loss:2.383 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:52:59,371][model8_pretrain.py][INFO] Epoch:[0/2](258300/4588595) loss:3.127 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:52:59,371][model8_pretrain.py][INFO] Epoch:[0/2](258300/4588595) loss:2.074 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:52:59,371][model8_pretrain.py][INFO] Epoch:[0/2](258300/4588595) loss:2.612 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:52:59,371][model8_pretrain.py][INFO] Epoch:[0/2](258300/4588595) loss:3.180 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:52:59,371][model8_pretrain.py][INFO] Epoch:[0/2](258300/4588595) loss:3.372 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:52:59,371][model8_pretrain.py][INFO] Epoch:[0/2](258300/4588595) loss:3.127 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:53:36,302][model8_pretrain.py][INFO] Epoch:[0/2](258400/4588595) loss:3.001 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:53:36,302][model8_pretrain.py][INFO] Epoch:[0/2](258400/4588595) loss:3.403 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:53:36,302][model8_pretrain.py][INFO] Epoch:[0/2](258400/4588595) loss:2.532 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:53:36,302][model8_pretrain.py][INFO] Epoch:[0/2](258400/4588595) loss:3.560 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:53:36,302][model8_pretrain.py][INFO] Epoch:[0/2](258400/4588595) loss:3.150 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:53:36,302][model8_pretrain.py][INFO] Epoch:[0/2](258400/4588595) loss:2.388 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:53:36,302][model8_pretrain.py][INFO] Epoch:[0/2](258400/4588595) loss:2.962 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:53:36,313][model8_pretrain.py][INFO] Epoch:[0/2](258400/4588595) loss:2.849 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:54:21,959][model8_pretrain.py][INFO] Epoch:[0/2](258500/4588595) loss:3.017 lr:0.0000100 epoch_Time:27380.0min: [2024-01-03 20:54:21,959][model8_pretrain.py][INFO] Epoch:[0/2](258500/4588595) loss:2.492 lr:0.0000100 epoch_Time:27380.0min: [2024-01-03 20:54:21,959][model8_pretrain.py][INFO] Epoch:[0/2](258500/4588595) loss:3.277 lr:0.0000100 epoch_Time:27380.0min: [2024-01-03 20:54:21,959][model8_pretrain.py][INFO] Epoch:[0/2](258500/4588595) loss:2.989 lr:0.0000100 epoch_Time:27380.0min: [2024-01-03 20:54:21,959][model8_pretrain.py][INFO] Epoch:[0/2](258500/4588595) loss:2.625 lr:0.0000100 epoch_Time:27380.0min: [2024-01-03 20:54:21,959][model8_pretrain.py][INFO] Epoch:[0/2](258500/4588595) loss:2.607 lr:0.0000100 epoch_Time:27380.0min: [2024-01-03 20:54:21,959][model8_pretrain.py][INFO] Epoch:[0/2](258500/4588595) loss:2.823 lr:0.0000100 epoch_Time:27380.0min: [2024-01-03 20:54:21,959][model8_pretrain.py][INFO] Epoch:[0/2](258500/4588595) loss:2.801 lr:0.0000100 epoch_Time:27380.0min: [2024-01-03 20:54:58,903][model8_pretrain.py][INFO] Epoch:[0/2](258600/4588595) loss:2.633 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:54:58,903][model8_pretrain.py][INFO] Epoch:[0/2](258600/4588595) loss:3.206 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:54:58,903][model8_pretrain.py][INFO] Epoch:[0/2](258600/4588595) loss:2.859 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:54:58,903][model8_pretrain.py][INFO] Epoch:[0/2](258600/4588595) loss:3.029 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:54:58,903][model8_pretrain.py][INFO] Epoch:[0/2](258600/4588595) loss:3.083 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:54:58,903][model8_pretrain.py][INFO] Epoch:[0/2](258600/4588595) loss:3.079 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:54:58,903][model8_pretrain.py][INFO] Epoch:[0/2](258600/4588595) loss:2.377 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:54:58,903][model8_pretrain.py][INFO] Epoch:[0/2](258600/4588595) loss:3.009 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:55:35,847][model8_pretrain.py][INFO] Epoch:[0/2](258700/4588595) loss:3.249 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:55:35,847][model8_pretrain.py][INFO] Epoch:[0/2](258700/4588595) loss:2.777 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:55:35,847][model8_pretrain.py][INFO] Epoch:[0/2](258700/4588595) loss:2.949 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:55:35,847][model8_pretrain.py][INFO] Epoch:[0/2](258700/4588595) loss:3.228 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:55:35,847][model8_pretrain.py][INFO] Epoch:[0/2](258700/4588595) loss:2.755 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:55:35,847][model8_pretrain.py][INFO] Epoch:[0/2](258700/4588595) loss:2.731 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:55:35,847][model8_pretrain.py][INFO] Epoch:[0/2](258700/4588595) loss:2.996 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:55:35,848][model8_pretrain.py][INFO] Epoch:[0/2](258700/4588595) loss:2.847 lr:0.0000100 epoch_Time:27379.0min: [2024-01-03 20:56:12,806][model8_pretrain.py][INFO] Epoch:[0/2](258800/4588595) loss:3.157 lr:0.0000100 epoch_Time:27378.0min: [2024-01-03 20:56:12,806][model8_pretrain.py][INFO] Epoch:[0/2](258800/4588595) loss:2.996 lr:0.0000100 epoch_Time:27378.0min: [2024-01-03 20:56:12,806][model8_pretrain.py][INFO] Epoch:[0/2](258800/4588595) loss:2.915 lr:0.0000100 epoch_Time:27378.0min: [2024-01-03 20:56:12,806][model8_pretrain.py][INFO] Epoch:[0/2](258800/4588595) loss:2.815 lr:0.0000100 epoch_Time:27378.0min: [2024-01-03 20:56:12,806][model8_pretrain.py][INFO] Epoch:[0/2](258800/4588595) loss:3.079 lr:0.0000100 epoch_Time:27378.0min: [2024-01-03 20:56:12,806][model8_pretrain.py][INFO] Epoch:[0/2](258800/4588595) loss:2.896 lr:0.0000100 epoch_Time:27378.0min: [2024-01-03 20:56:12,807][model8_pretrain.py][INFO] Epoch:[0/2](258800/4588595) loss:2.985 lr:0.0000100 epoch_Time:27378.0min: [2024-01-03 20:56:12,807][model8_pretrain.py][INFO] Epoch:[0/2](258800/4588595) loss:2.906 lr:0.0000100 epoch_Time:27378.0min: [2024-01-03 20:56:49,749][model8_pretrain.py][INFO] Epoch:[0/2](258900/4588595) loss:2.593 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:56:49,749][model8_pretrain.py][INFO] Epoch:[0/2](258900/4588595) loss:2.252 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:56:49,749][model8_pretrain.py][INFO] Epoch:[0/2](258900/4588595) loss:3.007 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:56:49,749][model8_pretrain.py][INFO] Epoch:[0/2](258900/4588595) loss:3.033 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:56:49,749][model8_pretrain.py][INFO] Epoch:[0/2](258900/4588595) loss:3.292 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:56:49,749][model8_pretrain.py][INFO] Epoch:[0/2](258900/4588595) loss:3.101 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:56:49,749][model8_pretrain.py][INFO] Epoch:[0/2](258900/4588595) loss:3.049 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:56:49,749][model8_pretrain.py][INFO] Epoch:[0/2](258900/4588595) loss:2.522 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:57:26,700][model8_pretrain.py][INFO] Epoch:[0/2](259000/4588595) loss:3.027 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:57:26,700][model8_pretrain.py][INFO] Epoch:[0/2](259000/4588595) loss:2.535 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:57:26,700][model8_pretrain.py][INFO] Epoch:[0/2](259000/4588595) loss:3.234 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:57:26,700][model8_pretrain.py][INFO] Epoch:[0/2](259000/4588595) loss:2.827 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:57:26,700][model8_pretrain.py][INFO] Epoch:[0/2](259000/4588595) loss:3.011 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:57:26,700][model8_pretrain.py][INFO] Epoch:[0/2](259000/4588595) loss:3.008 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:57:26,700][model8_pretrain.py][INFO] Epoch:[0/2](259000/4588595) loss:2.851 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:57:26,700][model8_pretrain.py][INFO] Epoch:[0/2](259000/4588595) loss:3.111 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:58:03,643][model8_pretrain.py][INFO] Epoch:[0/2](259100/4588595) loss:3.027 lr:0.0000100 epoch_Time:27375.0min: [2024-01-03 20:58:03,643][model8_pretrain.py][INFO] Epoch:[0/2](259100/4588595) loss:2.445 lr:0.0000100 epoch_Time:27375.0min: [2024-01-03 20:58:03,643][model8_pretrain.py][INFO] Epoch:[0/2](259100/4588595) loss:2.879 lr:0.0000100 epoch_Time:27375.0min: [2024-01-03 20:58:03,643][model8_pretrain.py][INFO] Epoch:[0/2](259100/4588595) loss:2.900 lr:0.0000100 epoch_Time:27375.0min: [2024-01-03 20:58:03,643][model8_pretrain.py][INFO] Epoch:[0/2](259100/4588595) loss:2.742 lr:0.0000100 epoch_Time:27375.0min: [2024-01-03 20:58:03,643][model8_pretrain.py][INFO] Epoch:[0/2](259100/4588595) loss:2.951 lr:0.0000100 epoch_Time:27375.0min: [2024-01-03 20:58:03,643][model8_pretrain.py][INFO] Epoch:[0/2](259100/4588595) loss:2.526 lr:0.0000100 epoch_Time:27375.0min: [2024-01-03 20:58:03,643][model8_pretrain.py][INFO] Epoch:[0/2](259100/4588595) loss:2.577 lr:0.0000100 epoch_Time:27375.0min: [2024-01-03 20:58:40,584][model8_pretrain.py][INFO] Epoch:[0/2](259200/4588595) loss:2.731 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 20:58:40,584][model8_pretrain.py][INFO] Epoch:[0/2](259200/4588595) loss:2.957 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 20:58:40,584][model8_pretrain.py][INFO] Epoch:[0/2](259200/4588595) loss:2.897 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 20:58:40,584][model8_pretrain.py][INFO] Epoch:[0/2](259200/4588595) loss:2.873 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 20:58:40,584][model8_pretrain.py][INFO] Epoch:[0/2](259200/4588595) loss:2.615 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 20:58:40,584][model8_pretrain.py][INFO] Epoch:[0/2](259200/4588595) loss:2.714 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 20:58:40,584][model8_pretrain.py][INFO] Epoch:[0/2](259200/4588595) loss:2.941 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 20:58:40,584][model8_pretrain.py][INFO] Epoch:[0/2](259200/4588595) loss:2.716 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 20:59:26,221][model8_pretrain.py][INFO] Epoch:[0/2](259300/4588595) loss:2.271 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:59:26,221][model8_pretrain.py][INFO] Epoch:[0/2](259300/4588595) loss:3.374 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:59:26,221][model8_pretrain.py][INFO] Epoch:[0/2](259300/4588595) loss:2.339 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:59:26,221][model8_pretrain.py][INFO] Epoch:[0/2](259300/4588595) loss:2.932 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:59:26,221][model8_pretrain.py][INFO] Epoch:[0/2](259300/4588595) loss:2.547 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:59:26,221][model8_pretrain.py][INFO] Epoch:[0/2](259300/4588595) loss:3.042 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:59:26,221][model8_pretrain.py][INFO] Epoch:[0/2](259300/4588595) loss:2.443 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 20:59:26,221][model8_pretrain.py][INFO] Epoch:[0/2](259300/4588595) loss:3.042 lr:0.0000100 epoch_Time:27376.0min: [2024-01-03 21:00:03,157][model8_pretrain.py][INFO] Epoch:[0/2](259400/4588595) loss:2.621 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:03,157][model8_pretrain.py][INFO] Epoch:[0/2](259400/4588595) loss:2.702 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:03,157][model8_pretrain.py][INFO] Epoch:[0/2](259400/4588595) loss:2.855 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:03,157][model8_pretrain.py][INFO] Epoch:[0/2](259400/4588595) loss:2.365 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:03,157][model8_pretrain.py][INFO] Epoch:[0/2](259400/4588595) loss:2.833 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:03,157][model8_pretrain.py][INFO] Epoch:[0/2](259400/4588595) loss:3.099 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:03,157][model8_pretrain.py][INFO] Epoch:[0/2](259400/4588595) loss:2.964 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:03,157][model8_pretrain.py][INFO] Epoch:[0/2](259400/4588595) loss:2.641 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:40,066][model8_pretrain.py][INFO] Epoch:[0/2](259500/4588595) loss:2.513 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:40,066][model8_pretrain.py][INFO] Epoch:[0/2](259500/4588595) loss:3.073 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:40,066][model8_pretrain.py][INFO] Epoch:[0/2](259500/4588595) loss:2.987 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:40,066][model8_pretrain.py][INFO] Epoch:[0/2](259500/4588595) loss:3.118 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:40,066][model8_pretrain.py][INFO] Epoch:[0/2](259500/4588595) loss:3.208 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:40,066][model8_pretrain.py][INFO] Epoch:[0/2](259500/4588595) loss:3.250 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:40,066][model8_pretrain.py][INFO] Epoch:[0/2](259500/4588595) loss:2.681 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:00:40,066][model8_pretrain.py][INFO] Epoch:[0/2](259500/4588595) loss:3.113 lr:0.0000100 epoch_Time:27374.0min: [2024-01-03 21:01:16,999][model8_pretrain.py][INFO] Epoch:[0/2](259600/4588595) loss:3.337 lr:0.0000100 epoch_Time:27373.0min: [2024-01-03 21:01:16,999][model8_pretrain.py][INFO] Epoch:[0/2](259600/4588595) loss:2.945 lr:0.0000100 epoch_Time:27373.0min: [2024-01-03 21:01:16,999][model8_pretrain.py][INFO] Epoch:[0/2](259600/4588595) loss:2.859 lr:0.0000100 epoch_Time:27373.0min: [2024-01-03 21:01:16,999][model8_pretrain.py][INFO] Epoch:[0/2](259600/4588595) loss:2.262 lr:0.0000100 epoch_Time:27373.0min: [2024-01-03 21:01:16,999][model8_pretrain.py][INFO] Epoch:[0/2](259600/4588595) loss:3.193 lr:0.0000100 epoch_Time:27373.0min: [2024-01-03 21:01:16,999][model8_pretrain.py][INFO] Epoch:[0/2](259600/4588595) loss:2.672 lr:0.0000100 epoch_Time:27373.0min: [2024-01-03 21:01:17,000][model8_pretrain.py][INFO] Epoch:[0/2](259600/4588595) loss:3.328 lr:0.0000100 epoch_Time:27373.0min: [2024-01-03 21:01:17,000][model8_pretrain.py][INFO] Epoch:[0/2](259600/4588595) loss:3.109 lr:0.0000100 epoch_Time:27373.0min: [2024-01-03 21:01:53,928][model8_pretrain.py][INFO] Epoch:[0/2](259700/4588595) loss:2.702 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:01:53,928][model8_pretrain.py][INFO] Epoch:[0/2](259700/4588595) loss:3.159 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:01:53,928][model8_pretrain.py][INFO] Epoch:[0/2](259700/4588595) loss:3.151 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:01:53,928][model8_pretrain.py][INFO] Epoch:[0/2](259700/4588595) loss:2.799 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:01:53,928][model8_pretrain.py][INFO] Epoch:[0/2](259700/4588595) loss:2.491 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:01:53,928][model8_pretrain.py][INFO] Epoch:[0/2](259700/4588595) loss:3.197 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:01:53,928][model8_pretrain.py][INFO] Epoch:[0/2](259700/4588595) loss:3.294 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:01:53,928][model8_pretrain.py][INFO] Epoch:[0/2](259700/4588595) loss:3.591 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:02:30,862][model8_pretrain.py][INFO] Epoch:[0/2](259800/4588595) loss:3.226 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:02:30,862][model8_pretrain.py][INFO] Epoch:[0/2](259800/4588595) loss:2.952 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:02:30,862][model8_pretrain.py][INFO] Epoch:[0/2](259800/4588595) loss:3.039 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:02:30,862][model8_pretrain.py][INFO] Epoch:[0/2](259800/4588595) loss:3.138 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:02:30,862][model8_pretrain.py][INFO] Epoch:[0/2](259800/4588595) loss:2.941 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:02:30,862][model8_pretrain.py][INFO] Epoch:[0/2](259800/4588595) loss:2.429 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:02:30,862][model8_pretrain.py][INFO] Epoch:[0/2](259800/4588595) loss:2.908 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:02:30,862][model8_pretrain.py][INFO] Epoch:[0/2](259800/4588595) loss:2.811 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:03:07,789][model8_pretrain.py][INFO] Epoch:[0/2](259900/4588595) loss:2.847 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:07,789][model8_pretrain.py][INFO] Epoch:[0/2](259900/4588595) loss:2.527 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:07,789][model8_pretrain.py][INFO] Epoch:[0/2](259900/4588595) loss:2.812 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:07,789][model8_pretrain.py][INFO] Epoch:[0/2](259900/4588595) loss:3.289 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:07,789][model8_pretrain.py][INFO] Epoch:[0/2](259900/4588595) loss:2.577 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:07,789][model8_pretrain.py][INFO] Epoch:[0/2](259900/4588595) loss:2.473 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:07,789][model8_pretrain.py][INFO] Epoch:[0/2](259900/4588595) loss:3.073 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:07,790][model8_pretrain.py][INFO] Epoch:[0/2](259900/4588595) loss:3.301 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:44,719][model8_pretrain.py][INFO] Epoch:[0/2](260000/4588595) loss:3.364 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:44,719][model8_pretrain.py][INFO] Epoch:[0/2](260000/4588595) loss:2.645 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:44,719][model8_pretrain.py][INFO] Epoch:[0/2](260000/4588595) loss:3.062 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:44,719][model8_pretrain.py][INFO] Epoch:[0/2](260000/4588595) loss:3.255 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:44,719][model8_pretrain.py][INFO] Epoch:[0/2](260000/4588595) loss:3.258 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:44,719][model8_pretrain.py][INFO] Epoch:[0/2](260000/4588595) loss:2.970 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:44,719][model8_pretrain.py][INFO] Epoch:[0/2](260000/4588595) loss:2.762 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:03:44,719][model8_pretrain.py][INFO] Epoch:[0/2](260000/4588595) loss:2.497 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:04:30,407][model8_pretrain.py][INFO] Epoch:[0/2](260100/4588595) loss:2.659 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:04:30,407][model8_pretrain.py][INFO] Epoch:[0/2](260100/4588595) loss:2.814 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:04:30,407][model8_pretrain.py][INFO] Epoch:[0/2](260100/4588595) loss:2.883 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:04:30,407][model8_pretrain.py][INFO] Epoch:[0/2](260100/4588595) loss:3.035 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:04:30,407][model8_pretrain.py][INFO] Epoch:[0/2](260100/4588595) loss:3.294 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:04:30,407][model8_pretrain.py][INFO] Epoch:[0/2](260100/4588595) loss:2.167 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:04:30,407][model8_pretrain.py][INFO] Epoch:[0/2](260100/4588595) loss:2.793 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:04:30,407][model8_pretrain.py][INFO] Epoch:[0/2](260100/4588595) loss:2.769 lr:0.0000100 epoch_Time:27371.0min: [2024-01-03 21:05:07,318][model8_pretrain.py][INFO] Epoch:[0/2](260200/4588595) loss:2.646 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:05:07,319][model8_pretrain.py][INFO] Epoch:[0/2](260200/4588595) loss:3.255 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:05:07,319][model8_pretrain.py][INFO] Epoch:[0/2](260200/4588595) loss:2.262 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:05:07,319][model8_pretrain.py][INFO] Epoch:[0/2](260200/4588595) loss:3.157 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:05:07,319][model8_pretrain.py][INFO] Epoch:[0/2](260200/4588595) loss:2.978 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:05:07,319][model8_pretrain.py][INFO] Epoch:[0/2](260200/4588595) loss:3.272 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:05:07,319][model8_pretrain.py][INFO] Epoch:[0/2](260200/4588595) loss:2.800 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:05:07,319][model8_pretrain.py][INFO] Epoch:[0/2](260200/4588595) loss:3.043 lr:0.0000100 epoch_Time:27370.0min: [2024-01-03 21:05:44,261][model8_pretrain.py][INFO] Epoch:[0/2](260300/4588595) loss:3.153 lr:0.0000100 epoch_Time:27369.0min: [2024-01-03 21:05:44,261][model8_pretrain.py][INFO] Epoch:[0/2](260300/4588595) loss:2.914 lr:0.0000100 epoch_Time:27369.0min: [2024-01-03 21:05:44,261][model8_pretrain.py][INFO] Epoch:[0/2](260300/4588595) loss:2.508 lr:0.0000100 epoch_Time:27369.0min: [2024-01-03 21:05:44,261][model8_pretrain.py][INFO] Epoch:[0/2](260300/4588595) loss:3.044 lr:0.0000100 epoch_Time:27369.0min: [2024-01-03 21:05:44,261][model8_pretrain.py][INFO] Epoch:[0/2](260300/4588595) loss:2.830 lr:0.0000100 epoch_Time:27369.0min: [2024-01-03 21:05:44,261][model8_pretrain.py][INFO] Epoch:[0/2](260300/4588595) loss:2.480 lr:0.0000100 epoch_Time:27369.0min: [2024-01-03 21:05:44,261][model8_pretrain.py][INFO] Epoch:[0/2](260300/4588595) loss:3.348 lr:0.0000100 epoch_Time:27369.0min: [2024-01-03 21:05:44,261][model8_pretrain.py][INFO] Epoch:[0/2](260300/4588595) loss:3.391 lr:0.0000100 epoch_Time:27369.0min: [2024-01-03 21:06:21,208][model8_pretrain.py][INFO] Epoch:[0/2](260400/4588595) loss:2.767 lr:0.0000100 epoch_Time:27368.0min: [2024-01-03 21:06:21,208][model8_pretrain.py][INFO] Epoch:[0/2](260400/4588595) loss:2.679 lr:0.0000100 epoch_Time:27368.0min: [2024-01-03 21:06:21,208][model8_pretrain.py][INFO] Epoch:[0/2](260400/4588595) loss:3.102 lr:0.0000100 epoch_Time:27368.0min: [2024-01-03 21:06:21,208][model8_pretrain.py][INFO] Epoch:[0/2](260400/4588595) loss:2.377 lr:0.0000100 epoch_Time:27368.0min: [2024-01-03 21:06:21,208][model8_pretrain.py][INFO] Epoch:[0/2](260400/4588595) loss:2.623 lr:0.0000100 epoch_Time:27368.0min: [2024-01-03 21:06:21,208][model8_pretrain.py][INFO] Epoch:[0/2](260400/4588595) loss:2.915 lr:0.0000100 epoch_Time:27368.0min: [2024-01-03 21:06:21,208][model8_pretrain.py][INFO] Epoch:[0/2](260400/4588595) loss:2.832 lr:0.0000100 epoch_Time:27368.0min: [2024-01-03 21:06:21,208][model8_pretrain.py][INFO] Epoch:[0/2](260400/4588595) loss:3.067 lr:0.0000100 epoch_Time:27368.0min: [2024-01-03 21:06:58,137][model8_pretrain.py][INFO] Epoch:[0/2](260500/4588595) loss:2.669 lr:0.0000100 epoch_Time:27367.0min: [2024-01-03 21:06:58,137][model8_pretrain.py][INFO] Epoch:[0/2](260500/4588595) loss:2.862 lr:0.0000100 epoch_Time:27367.0min: [2024-01-03 21:06:58,137][model8_pretrain.py][INFO] Epoch:[0/2](260500/4588595) loss:2.653 lr:0.0000100 epoch_Time:27367.0min: [2024-01-03 21:06:58,137][model8_pretrain.py][INFO] Epoch:[0/2](260500/4588595) loss:2.837 lr:0.0000100 epoch_Time:27367.0min: [2024-01-03 21:06:58,137][model8_pretrain.py][INFO] Epoch:[0/2](260500/4588595) loss:2.790 lr:0.0000100 epoch_Time:27367.0min: [2024-01-03 21:06:58,137][model8_pretrain.py][INFO] Epoch:[0/2](260500/4588595) loss:2.616 lr:0.0000100 epoch_Time:27367.0min: [2024-01-03 21:06:58,137][model8_pretrain.py][INFO] Epoch:[0/2](260500/4588595) loss:3.244 lr:0.0000100 epoch_Time:27367.0min: [2024-01-03 21:06:58,138][model8_pretrain.py][INFO] Epoch:[0/2](260500/4588595) loss:1.810 lr:0.0000100 epoch_Time:27367.0min: [2024-01-03 21:07:35,065][model8_pretrain.py][INFO] Epoch:[0/2](260600/4588595) loss:2.970 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:07:35,065][model8_pretrain.py][INFO] Epoch:[0/2](260600/4588595) loss:2.564 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:07:35,065][model8_pretrain.py][INFO] Epoch:[0/2](260600/4588595) loss:3.012 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:07:35,065][model8_pretrain.py][INFO] Epoch:[0/2](260600/4588595) loss:2.772 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:07:35,065][model8_pretrain.py][INFO] Epoch:[0/2](260600/4588595) loss:3.058 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:07:35,065][model8_pretrain.py][INFO] Epoch:[0/2](260600/4588595) loss:2.875 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:07:35,065][model8_pretrain.py][INFO] Epoch:[0/2](260600/4588595) loss:2.871 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:07:35,066][model8_pretrain.py][INFO] Epoch:[0/2](260600/4588595) loss:3.041 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:08:12,001][model8_pretrain.py][INFO] Epoch:[0/2](260700/4588595) loss:2.939 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:08:12,001][model8_pretrain.py][INFO] Epoch:[0/2](260700/4588595) loss:2.699 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:08:12,001][model8_pretrain.py][INFO] Epoch:[0/2](260700/4588595) loss:2.823 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:08:12,001][model8_pretrain.py][INFO] Epoch:[0/2](260700/4588595) loss:2.533 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:08:12,001][model8_pretrain.py][INFO] Epoch:[0/2](260700/4588595) loss:2.686 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:08:12,001][model8_pretrain.py][INFO] Epoch:[0/2](260700/4588595) loss:3.097 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:08:12,001][model8_pretrain.py][INFO] Epoch:[0/2](260700/4588595) loss:2.752 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:08:12,001][model8_pretrain.py][INFO] Epoch:[0/2](260700/4588595) loss:2.538 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:08:48,931][model8_pretrain.py][INFO] Epoch:[0/2](260800/4588595) loss:3.482 lr:0.0000100 epoch_Time:27364.0min: [2024-01-03 21:08:48,931][model8_pretrain.py][INFO] Epoch:[0/2](260800/4588595) loss:2.592 lr:0.0000100 epoch_Time:27364.0min: [2024-01-03 21:08:48,931][model8_pretrain.py][INFO] Epoch:[0/2](260800/4588595) loss:2.379 lr:0.0000100 epoch_Time:27364.0min: [2024-01-03 21:08:48,931][model8_pretrain.py][INFO] Epoch:[0/2](260800/4588595) loss:2.948 lr:0.0000100 epoch_Time:27364.0min: [2024-01-03 21:08:48,931][model8_pretrain.py][INFO] Epoch:[0/2](260800/4588595) loss:3.341 lr:0.0000100 epoch_Time:27364.0min: [2024-01-03 21:08:48,931][model8_pretrain.py][INFO] Epoch:[0/2](260800/4588595) loss:2.432 lr:0.0000100 epoch_Time:27364.0min: [2024-01-03 21:08:48,931][model8_pretrain.py][INFO] Epoch:[0/2](260800/4588595) loss:2.756 lr:0.0000100 epoch_Time:27364.0min: [2024-01-03 21:08:48,931][model8_pretrain.py][INFO] Epoch:[0/2](260800/4588595) loss:2.759 lr:0.0000100 epoch_Time:27364.0min: [2024-01-03 21:09:34,527][model8_pretrain.py][INFO] Epoch:[0/2](260900/4588595) loss:3.161 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:09:34,527][model8_pretrain.py][INFO] Epoch:[0/2](260900/4588595) loss:3.219 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:09:34,527][model8_pretrain.py][INFO] Epoch:[0/2](260900/4588595) loss:3.183 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:09:34,527][model8_pretrain.py][INFO] Epoch:[0/2](260900/4588595) loss:3.384 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:09:34,527][model8_pretrain.py][INFO] Epoch:[0/2](260900/4588595) loss:2.821 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:09:34,527][model8_pretrain.py][INFO] Epoch:[0/2](260900/4588595) loss:3.507 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:09:34,528][model8_pretrain.py][INFO] Epoch:[0/2](260900/4588595) loss:3.136 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:09:34,529][model8_pretrain.py][INFO] Epoch:[0/2](260900/4588595) loss:2.616 lr:0.0000100 epoch_Time:27366.0min: [2024-01-03 21:10:11,459][model8_pretrain.py][INFO] Epoch:[0/2](261000/4588595) loss:3.100 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:10:11,459][model8_pretrain.py][INFO] Epoch:[0/2](261000/4588595) loss:2.968 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:10:11,459][model8_pretrain.py][INFO] Epoch:[0/2](261000/4588595) loss:2.413 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:10:11,459][model8_pretrain.py][INFO] Epoch:[0/2](261000/4588595) loss:2.952 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:10:11,459][model8_pretrain.py][INFO] Epoch:[0/2](261000/4588595) loss:2.840 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:10:11,459][model8_pretrain.py][INFO] Epoch:[0/2](261000/4588595) loss:3.062 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:10:11,460][model8_pretrain.py][INFO] Epoch:[0/2](261000/4588595) loss:2.960 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:10:11,459][model8_pretrain.py][INFO] Epoch:[0/2](261000/4588595) loss:2.930 lr:0.0000100 epoch_Time:27365.0min: [2024-01-03 21:10:48,397][model8_pretrain.py][INFO] Epoch:[0/2](261100/4588595) loss:3.212 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:10:48,397][model8_pretrain.py][INFO] Epoch:[0/2](261100/4588595) loss:3.104 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:10:48,397][model8_pretrain.py][INFO] Epoch:[0/2](261100/4588595) loss:2.016 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:10:48,397][model8_pretrain.py][INFO] Epoch:[0/2](261100/4588595) loss:2.267 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:10:48,397][model8_pretrain.py][INFO] Epoch:[0/2](261100/4588595) loss:3.049 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:10:48,397][model8_pretrain.py][INFO] Epoch:[0/2](261100/4588595) loss:2.482 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:10:48,397][model8_pretrain.py][INFO] Epoch:[0/2](261100/4588595) loss:2.941 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:10:48,397][model8_pretrain.py][INFO] Epoch:[0/2](261100/4588595) loss:3.211 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:11:25,340][model8_pretrain.py][INFO] Epoch:[0/2](261200/4588595) loss:2.855 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:11:25,340][model8_pretrain.py][INFO] Epoch:[0/2](261200/4588595) loss:2.601 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:11:25,340][model8_pretrain.py][INFO] Epoch:[0/2](261200/4588595) loss:3.276 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:11:25,340][model8_pretrain.py][INFO] Epoch:[0/2](261200/4588595) loss:2.594 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:11:25,340][model8_pretrain.py][INFO] Epoch:[0/2](261200/4588595) loss:3.103 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:11:25,340][model8_pretrain.py][INFO] Epoch:[0/2](261200/4588595) loss:2.411 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:11:25,340][model8_pretrain.py][INFO] Epoch:[0/2](261200/4588595) loss:2.882 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:11:25,341][model8_pretrain.py][INFO] Epoch:[0/2](261200/4588595) loss:2.941 lr:0.0000100 epoch_Time:27363.0min: [2024-01-03 21:12:02,281][model8_pretrain.py][INFO] Epoch:[0/2](261300/4588595) loss:3.274 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:02,281][model8_pretrain.py][INFO] Epoch:[0/2](261300/4588595) loss:3.208 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:02,281][model8_pretrain.py][INFO] Epoch:[0/2](261300/4588595) loss:2.645 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:02,281][model8_pretrain.py][INFO] Epoch:[0/2](261300/4588595) loss:3.089 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:02,281][model8_pretrain.py][INFO] Epoch:[0/2](261300/4588595) loss:2.820 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:02,281][model8_pretrain.py][INFO] Epoch:[0/2](261300/4588595) loss:2.722 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:02,281][model8_pretrain.py][INFO] Epoch:[0/2](261300/4588595) loss:3.138 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:02,282][model8_pretrain.py][INFO] Epoch:[0/2](261300/4588595) loss:3.012 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:39,214][model8_pretrain.py][INFO] Epoch:[0/2](261400/4588595) loss:2.838 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:39,214][model8_pretrain.py][INFO] Epoch:[0/2](261400/4588595) loss:2.684 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:39,214][model8_pretrain.py][INFO] Epoch:[0/2](261400/4588595) loss:3.102 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:39,214][model8_pretrain.py][INFO] Epoch:[0/2](261400/4588595) loss:2.755 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:39,214][model8_pretrain.py][INFO] Epoch:[0/2](261400/4588595) loss:2.962 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:39,214][model8_pretrain.py][INFO] Epoch:[0/2](261400/4588595) loss:3.299 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:39,214][model8_pretrain.py][INFO] Epoch:[0/2](261400/4588595) loss:3.068 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:12:39,214][model8_pretrain.py][INFO] Epoch:[0/2](261400/4588595) loss:2.749 lr:0.0000100 epoch_Time:27362.0min: [2024-01-03 21:13:16,143][model8_pretrain.py][INFO] Epoch:[0/2](261500/4588595) loss:3.152 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:13:16,143][model8_pretrain.py][INFO] Epoch:[0/2](261500/4588595) loss:3.078 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:13:16,143][model8_pretrain.py][INFO] Epoch:[0/2](261500/4588595) loss:2.567 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:13:16,143][model8_pretrain.py][INFO] Epoch:[0/2](261500/4588595) loss:2.584 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:13:16,143][model8_pretrain.py][INFO] Epoch:[0/2](261500/4588595) loss:2.876 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:13:16,143][model8_pretrain.py][INFO] Epoch:[0/2](261500/4588595) loss:2.507 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:13:16,143][model8_pretrain.py][INFO] Epoch:[0/2](261500/4588595) loss:2.994 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:13:16,144][model8_pretrain.py][INFO] Epoch:[0/2](261500/4588595) loss:3.064 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:13:53,072][model8_pretrain.py][INFO] Epoch:[0/2](261600/4588595) loss:2.685 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:13:53,072][model8_pretrain.py][INFO] Epoch:[0/2](261600/4588595) loss:3.063 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:13:53,072][model8_pretrain.py][INFO] Epoch:[0/2](261600/4588595) loss:2.926 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:13:53,072][model8_pretrain.py][INFO] Epoch:[0/2](261600/4588595) loss:2.858 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:13:53,072][model8_pretrain.py][INFO] Epoch:[0/2](261600/4588595) loss:3.128 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:13:53,072][model8_pretrain.py][INFO] Epoch:[0/2](261600/4588595) loss:2.419 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:13:53,072][model8_pretrain.py][INFO] Epoch:[0/2](261600/4588595) loss:3.162 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:13:53,072][model8_pretrain.py][INFO] Epoch:[0/2](261600/4588595) loss:3.281 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:14:38,699][model8_pretrain.py][INFO] Epoch:[0/2](261700/4588595) loss:3.440 lr:0.0000100 epoch_Time:27361.0min: [2024-01-03 21:14:38,699][model8_pretrain.py][INFO] Epoch:[0/2](261700/4588595) loss:3.105 lr:0.0000100 epoch_Time:27361.0min: [2024-01-03 21:14:38,699][model8_pretrain.py][INFO] Epoch:[0/2](261700/4588595) loss:3.290 lr:0.0000100 epoch_Time:27361.0min: [2024-01-03 21:14:38,699][model8_pretrain.py][INFO] Epoch:[0/2](261700/4588595) loss:3.300 lr:0.0000100 epoch_Time:27361.0min: [2024-01-03 21:14:38,699][model8_pretrain.py][INFO] Epoch:[0/2](261700/4588595) loss:2.630 lr:0.0000100 epoch_Time:27361.0min: [2024-01-03 21:14:38,699][model8_pretrain.py][INFO] Epoch:[0/2](261700/4588595) loss:2.938 lr:0.0000100 epoch_Time:27361.0min: [2024-01-03 21:14:38,699][model8_pretrain.py][INFO] Epoch:[0/2](261700/4588595) loss:2.941 lr:0.0000100 epoch_Time:27361.0min: [2024-01-03 21:14:38,699][model8_pretrain.py][INFO] Epoch:[0/2](261700/4588595) loss:2.896 lr:0.0000100 epoch_Time:27361.0min: [2024-01-03 21:15:15,622][model8_pretrain.py][INFO] Epoch:[0/2](261800/4588595) loss:3.222 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:15:15,622][model8_pretrain.py][INFO] Epoch:[0/2](261800/4588595) loss:2.025 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:15:15,622][model8_pretrain.py][INFO] Epoch:[0/2](261800/4588595) loss:2.137 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:15:15,622][model8_pretrain.py][INFO] Epoch:[0/2](261800/4588595) loss:3.023 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:15:15,622][model8_pretrain.py][INFO] Epoch:[0/2](261800/4588595) loss:2.812 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:15:15,622][model8_pretrain.py][INFO] Epoch:[0/2](261800/4588595) loss:2.700 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:15:15,622][model8_pretrain.py][INFO] Epoch:[0/2](261800/4588595) loss:2.659 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:15:15,622][model8_pretrain.py][INFO] Epoch:[0/2](261800/4588595) loss:2.734 lr:0.0000100 epoch_Time:27360.0min: [2024-01-03 21:15:52,554][model8_pretrain.py][INFO] Epoch:[0/2](261900/4588595) loss:2.820 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:15:52,554][model8_pretrain.py][INFO] Epoch:[0/2](261900/4588595) loss:2.433 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:15:52,554][model8_pretrain.py][INFO] Epoch:[0/2](261900/4588595) loss:2.397 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:15:52,554][model8_pretrain.py][INFO] Epoch:[0/2](261900/4588595) loss:2.484 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:15:52,554][model8_pretrain.py][INFO] Epoch:[0/2](261900/4588595) loss:2.876 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:15:52,554][model8_pretrain.py][INFO] Epoch:[0/2](261900/4588595) loss:2.959 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:15:52,554][model8_pretrain.py][INFO] Epoch:[0/2](261900/4588595) loss:2.952 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:15:52,554][model8_pretrain.py][INFO] Epoch:[0/2](261900/4588595) loss:2.780 lr:0.0000100 epoch_Time:27359.0min: [2024-01-03 21:16:29,491][model8_pretrain.py][INFO] Epoch:[0/2](262000/4588595) loss:2.876 lr:0.0000100 epoch_Time:27358.0min: [2024-01-03 21:16:29,491][model8_pretrain.py][INFO] Epoch:[0/2](262000/4588595) loss:3.158 lr:0.0000100 epoch_Time:27358.0min: [2024-01-03 21:16:29,491][model8_pretrain.py][INFO] Epoch:[0/2](262000/4588595) loss:3.070 lr:0.0000100 epoch_Time:27358.0min: [2024-01-03 21:16:29,491][model8_pretrain.py][INFO] Epoch:[0/2](262000/4588595) loss:3.174 lr:0.0000100 epoch_Time:27358.0min: [2024-01-03 21:16:29,491][model8_pretrain.py][INFO] Epoch:[0/2](262000/4588595) loss:2.969 lr:0.0000100 epoch_Time:27358.0min: [2024-01-03 21:16:29,491][model8_pretrain.py][INFO] Epoch:[0/2](262000/4588595) loss:3.174 lr:0.0000100 epoch_Time:27358.0min: [2024-01-03 21:16:29,491][model8_pretrain.py][INFO] Epoch:[0/2](262000/4588595) loss:3.222 lr:0.0000100 epoch_Time:27358.0min: [2024-01-03 21:16:29,491][model8_pretrain.py][INFO] Epoch:[0/2](262000/4588595) loss:2.676 lr:0.0000100 epoch_Time:27358.0min: [2024-01-03 21:17:06,431][model8_pretrain.py][INFO] Epoch:[0/2](262100/4588595) loss:3.277 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:06,431][model8_pretrain.py][INFO] Epoch:[0/2](262100/4588595) loss:2.996 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:06,431][model8_pretrain.py][INFO] Epoch:[0/2](262100/4588595) loss:2.827 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:06,431][model8_pretrain.py][INFO] Epoch:[0/2](262100/4588595) loss:2.851 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:06,431][model8_pretrain.py][INFO] Epoch:[0/2](262100/4588595) loss:2.778 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:06,431][model8_pretrain.py][INFO] Epoch:[0/2](262100/4588595) loss:3.161 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:06,431][model8_pretrain.py][INFO] Epoch:[0/2](262100/4588595) loss:2.894 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:06,432][model8_pretrain.py][INFO] Epoch:[0/2](262100/4588595) loss:2.885 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:43,364][model8_pretrain.py][INFO] Epoch:[0/2](262200/4588595) loss:3.334 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:43,364][model8_pretrain.py][INFO] Epoch:[0/2](262200/4588595) loss:2.965 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:43,364][model8_pretrain.py][INFO] Epoch:[0/2](262200/4588595) loss:2.557 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:43,364][model8_pretrain.py][INFO] Epoch:[0/2](262200/4588595) loss:2.998 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:43,364][model8_pretrain.py][INFO] Epoch:[0/2](262200/4588595) loss:3.179 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:43,364][model8_pretrain.py][INFO] Epoch:[0/2](262200/4588595) loss:2.545 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:43,364][model8_pretrain.py][INFO] Epoch:[0/2](262200/4588595) loss:3.025 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:17:43,365][model8_pretrain.py][INFO] Epoch:[0/2](262200/4588595) loss:3.205 lr:0.0000100 epoch_Time:27357.0min: [2024-01-03 21:18:20,301][model8_pretrain.py][INFO] Epoch:[0/2](262300/4588595) loss:3.470 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:18:20,301][model8_pretrain.py][INFO] Epoch:[0/2](262300/4588595) loss:3.159 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:18:20,302][model8_pretrain.py][INFO] Epoch:[0/2](262300/4588595) loss:2.826 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:18:20,302][model8_pretrain.py][INFO] Epoch:[0/2](262300/4588595) loss:3.460 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:18:20,302][model8_pretrain.py][INFO] Epoch:[0/2](262300/4588595) loss:3.064 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:18:20,302][model8_pretrain.py][INFO] Epoch:[0/2](262300/4588595) loss:2.604 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:18:20,302][model8_pretrain.py][INFO] Epoch:[0/2](262300/4588595) loss:2.864 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:18:20,302][model8_pretrain.py][INFO] Epoch:[0/2](262300/4588595) loss:3.151 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:18:57,235][model8_pretrain.py][INFO] Epoch:[0/2](262400/4588595) loss:2.359 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:18:57,235][model8_pretrain.py][INFO] Epoch:[0/2](262400/4588595) loss:3.250 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:18:57,235][model8_pretrain.py][INFO] Epoch:[0/2](262400/4588595) loss:2.731 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:18:57,235][model8_pretrain.py][INFO] Epoch:[0/2](262400/4588595) loss:2.976 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:18:57,235][model8_pretrain.py][INFO] Epoch:[0/2](262400/4588595) loss:2.941 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:18:57,235][model8_pretrain.py][INFO] Epoch:[0/2](262400/4588595) loss:1.940 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:18:57,235][model8_pretrain.py][INFO] Epoch:[0/2](262400/4588595) loss:3.008 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:18:57,235][model8_pretrain.py][INFO] Epoch:[0/2](262400/4588595) loss:2.686 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:19:42,853][model8_pretrain.py][INFO] Epoch:[0/2](262500/4588595) loss:3.138 lr:0.0000100 epoch_Time:27356.0min: [2024-01-03 21:19:42,853][model8_pretrain.py][INFO] Epoch:[0/2](262500/4588595) loss:2.472 lr:0.0000100 epoch_Time:27356.0min: [2024-01-03 21:19:42,853][model8_pretrain.py][INFO] Epoch:[0/2](262500/4588595) loss:2.917 lr:0.0000100 epoch_Time:27356.0min: [2024-01-03 21:19:42,853][model8_pretrain.py][INFO] Epoch:[0/2](262500/4588595) loss:2.785 lr:0.0000100 epoch_Time:27356.0min: [2024-01-03 21:19:42,853][model8_pretrain.py][INFO] Epoch:[0/2](262500/4588595) loss:2.995 lr:0.0000100 epoch_Time:27356.0min: [2024-01-03 21:19:42,853][model8_pretrain.py][INFO] Epoch:[0/2](262500/4588595) loss:3.211 lr:0.0000100 epoch_Time:27356.0min: [2024-01-03 21:19:42,853][model8_pretrain.py][INFO] Epoch:[0/2](262500/4588595) loss:2.876 lr:0.0000100 epoch_Time:27356.0min: [2024-01-03 21:19:42,853][model8_pretrain.py][INFO] Epoch:[0/2](262500/4588595) loss:3.159 lr:0.0000100 epoch_Time:27356.0min: [2024-01-03 21:20:19,787][model8_pretrain.py][INFO] Epoch:[0/2](262600/4588595) loss:3.159 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:20:19,787][model8_pretrain.py][INFO] Epoch:[0/2](262600/4588595) loss:3.224 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:20:19,787][model8_pretrain.py][INFO] Epoch:[0/2](262600/4588595) loss:2.571 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:20:19,787][model8_pretrain.py][INFO] Epoch:[0/2](262600/4588595) loss:3.246 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:20:19,787][model8_pretrain.py][INFO] Epoch:[0/2](262600/4588595) loss:2.554 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:20:19,787][model8_pretrain.py][INFO] Epoch:[0/2](262600/4588595) loss:3.374 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:20:19,788][model8_pretrain.py][INFO] Epoch:[0/2](262600/4588595) loss:2.771 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:20:19,788][model8_pretrain.py][INFO] Epoch:[0/2](262600/4588595) loss:2.538 lr:0.0000100 epoch_Time:27355.0min: [2024-01-03 21:20:56,720][model8_pretrain.py][INFO] Epoch:[0/2](262700/4588595) loss:3.082 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:20:56,720][model8_pretrain.py][INFO] Epoch:[0/2](262700/4588595) loss:2.702 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:20:56,720][model8_pretrain.py][INFO] Epoch:[0/2](262700/4588595) loss:2.541 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:20:56,720][model8_pretrain.py][INFO] Epoch:[0/2](262700/4588595) loss:2.917 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:20:56,720][model8_pretrain.py][INFO] Epoch:[0/2](262700/4588595) loss:2.932 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:20:56,720][model8_pretrain.py][INFO] Epoch:[0/2](262700/4588595) loss:3.309 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:20:56,720][model8_pretrain.py][INFO] Epoch:[0/2](262700/4588595) loss:2.680 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:20:56,720][model8_pretrain.py][INFO] Epoch:[0/2](262700/4588595) loss:2.763 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:21:33,656][model8_pretrain.py][INFO] Epoch:[0/2](262800/4588595) loss:2.784 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:21:33,656][model8_pretrain.py][INFO] Epoch:[0/2](262800/4588595) loss:3.135 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:21:33,656][model8_pretrain.py][INFO] Epoch:[0/2](262800/4588595) loss:2.512 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:21:33,657][model8_pretrain.py][INFO] Epoch:[0/2](262800/4588595) loss:2.839 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:21:33,657][model8_pretrain.py][INFO] Epoch:[0/2](262800/4588595) loss:3.512 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:21:33,657][model8_pretrain.py][INFO] Epoch:[0/2](262800/4588595) loss:2.742 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:21:33,657][model8_pretrain.py][INFO] Epoch:[0/2](262800/4588595) loss:2.679 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:21:33,657][model8_pretrain.py][INFO] Epoch:[0/2](262800/4588595) loss:3.324 lr:0.0000100 epoch_Time:27354.0min: [2024-01-03 21:22:10,606][model8_pretrain.py][INFO] Epoch:[0/2](262900/4588595) loss:2.891 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:10,606][model8_pretrain.py][INFO] Epoch:[0/2](262900/4588595) loss:2.276 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:10,606][model8_pretrain.py][INFO] Epoch:[0/2](262900/4588595) loss:3.153 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:10,606][model8_pretrain.py][INFO] Epoch:[0/2](262900/4588595) loss:3.447 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:10,606][model8_pretrain.py][INFO] Epoch:[0/2](262900/4588595) loss:3.148 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:10,606][model8_pretrain.py][INFO] Epoch:[0/2](262900/4588595) loss:3.105 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:10,607][model8_pretrain.py][INFO] Epoch:[0/2](262900/4588595) loss:2.282 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:10,607][model8_pretrain.py][INFO] Epoch:[0/2](262900/4588595) loss:2.919 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:47,550][model8_pretrain.py][INFO] Epoch:[0/2](263000/4588595) loss:2.759 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:47,550][model8_pretrain.py][INFO] Epoch:[0/2](263000/4588595) loss:2.612 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:47,550][model8_pretrain.py][INFO] Epoch:[0/2](263000/4588595) loss:3.209 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:47,550][model8_pretrain.py][INFO] Epoch:[0/2](263000/4588595) loss:2.602 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:47,550][model8_pretrain.py][INFO] Epoch:[0/2](263000/4588595) loss:2.992 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:47,550][model8_pretrain.py][INFO] Epoch:[0/2](263000/4588595) loss:3.443 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:47,550][model8_pretrain.py][INFO] Epoch:[0/2](263000/4588595) loss:3.126 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:22:47,550][model8_pretrain.py][INFO] Epoch:[0/2](263000/4588595) loss:3.324 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:23:24,486][model8_pretrain.py][INFO] Epoch:[0/2](263100/4588595) loss:2.795 lr:0.0000100 epoch_Time:27351.0min: [2024-01-03 21:23:24,486][model8_pretrain.py][INFO] Epoch:[0/2](263100/4588595) loss:3.121 lr:0.0000100 epoch_Time:27351.0min: [2024-01-03 21:23:24,486][model8_pretrain.py][INFO] Epoch:[0/2](263100/4588595) loss:3.324 lr:0.0000100 epoch_Time:27351.0min: [2024-01-03 21:23:24,486][model8_pretrain.py][INFO] Epoch:[0/2](263100/4588595) loss:2.393 lr:0.0000100 epoch_Time:27351.0min: [2024-01-03 21:23:24,486][model8_pretrain.py][INFO] Epoch:[0/2](263100/4588595) loss:2.891 lr:0.0000100 epoch_Time:27351.0min: [2024-01-03 21:23:24,486][model8_pretrain.py][INFO] Epoch:[0/2](263100/4588595) loss:3.137 lr:0.0000100 epoch_Time:27351.0min: [2024-01-03 21:23:24,486][model8_pretrain.py][INFO] Epoch:[0/2](263100/4588595) loss:2.508 lr:0.0000100 epoch_Time:27351.0min: [2024-01-03 21:23:24,486][model8_pretrain.py][INFO] Epoch:[0/2](263100/4588595) loss:2.954 lr:0.0000100 epoch_Time:27351.0min: [2024-01-03 21:24:01,419][model8_pretrain.py][INFO] Epoch:[0/2](263200/4588595) loss:2.652 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:24:01,419][model8_pretrain.py][INFO] Epoch:[0/2](263200/4588595) loss:3.399 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:24:01,419][model8_pretrain.py][INFO] Epoch:[0/2](263200/4588595) loss:2.318 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:24:01,419][model8_pretrain.py][INFO] Epoch:[0/2](263200/4588595) loss:2.352 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:24:01,420][model8_pretrain.py][INFO] Epoch:[0/2](263200/4588595) loss:2.998 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:24:01,420][model8_pretrain.py][INFO] Epoch:[0/2](263200/4588595) loss:3.245 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:24:01,420][model8_pretrain.py][INFO] Epoch:[0/2](263200/4588595) loss:3.293 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:24:01,420][model8_pretrain.py][INFO] Epoch:[0/2](263200/4588595) loss:2.995 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:24:47,055][model8_pretrain.py][INFO] Epoch:[0/2](263300/4588595) loss:2.818 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:24:47,055][model8_pretrain.py][INFO] Epoch:[0/2](263300/4588595) loss:3.219 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:24:47,055][model8_pretrain.py][INFO] Epoch:[0/2](263300/4588595) loss:3.106 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:24:47,055][model8_pretrain.py][INFO] Epoch:[0/2](263300/4588595) loss:2.724 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:24:47,055][model8_pretrain.py][INFO] Epoch:[0/2](263300/4588595) loss:3.062 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:24:47,055][model8_pretrain.py][INFO] Epoch:[0/2](263300/4588595) loss:2.823 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:24:47,055][model8_pretrain.py][INFO] Epoch:[0/2](263300/4588595) loss:2.811 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:24:47,055][model8_pretrain.py][INFO] Epoch:[0/2](263300/4588595) loss:2.883 lr:0.0000100 epoch_Time:27352.0min: [2024-01-03 21:25:23,990][model8_pretrain.py][INFO] Epoch:[0/2](263400/4588595) loss:3.026 lr:0.0000100 epoch_Time:27350.0min: [2024-01-03 21:25:23,990][model8_pretrain.py][INFO] Epoch:[0/2](263400/4588595) loss:2.946 lr:0.0000100 epoch_Time:27350.0min: [2024-01-03 21:25:23,990][model8_pretrain.py][INFO] Epoch:[0/2](263400/4588595) loss:3.233 lr:0.0000100 epoch_Time:27350.0min: [2024-01-03 21:25:23,990][model8_pretrain.py][INFO] Epoch:[0/2](263400/4588595) loss:2.696 lr:0.0000100 epoch_Time:27350.0min: [2024-01-03 21:25:23,990][model8_pretrain.py][INFO] Epoch:[0/2](263400/4588595) loss:3.065 lr:0.0000100 epoch_Time:27350.0min: [2024-01-03 21:25:23,990][model8_pretrain.py][INFO] Epoch:[0/2](263400/4588595) loss:2.854 lr:0.0000100 epoch_Time:27350.0min: [2024-01-03 21:25:23,990][model8_pretrain.py][INFO] Epoch:[0/2](263400/4588595) loss:3.134 lr:0.0000100 epoch_Time:27350.0min: [2024-01-03 21:25:23,990][model8_pretrain.py][INFO] Epoch:[0/2](263400/4588595) loss:3.198 lr:0.0000100 epoch_Time:27350.0min: [2024-01-03 21:26:00,949][model8_pretrain.py][INFO] Epoch:[0/2](263500/4588595) loss:3.214 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:00,949][model8_pretrain.py][INFO] Epoch:[0/2](263500/4588595) loss:3.017 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:00,949][model8_pretrain.py][INFO] Epoch:[0/2](263500/4588595) loss:2.966 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:00,949][model8_pretrain.py][INFO] Epoch:[0/2](263500/4588595) loss:3.006 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:00,949][model8_pretrain.py][INFO] Epoch:[0/2](263500/4588595) loss:3.238 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:00,949][model8_pretrain.py][INFO] Epoch:[0/2](263500/4588595) loss:2.937 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:00,949][model8_pretrain.py][INFO] Epoch:[0/2](263500/4588595) loss:3.011 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:00,949][model8_pretrain.py][INFO] Epoch:[0/2](263500/4588595) loss:2.852 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](263600/4588595) loss:2.578 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](263600/4588595) loss:2.841 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](263600/4588595) loss:2.971 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](263600/4588595) loss:2.643 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](263600/4588595) loss:3.000 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](263600/4588595) loss:3.102 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](263600/4588595) loss:2.683 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](263600/4588595) loss:3.204 lr:0.0000100 epoch_Time:27349.0min: [2024-01-03 21:27:14,807][model8_pretrain.py][INFO] Epoch:[0/2](263700/4588595) loss:2.903 lr:0.0000100 epoch_Time:27347.0min: [2024-01-03 21:27:14,807][model8_pretrain.py][INFO] Epoch:[0/2](263700/4588595) loss:3.196 lr:0.0000100 epoch_Time:27347.0min: [2024-01-03 21:27:14,807][model8_pretrain.py][INFO] Epoch:[0/2](263700/4588595) loss:2.889 lr:0.0000100 epoch_Time:27347.0min: [2024-01-03 21:27:14,808][model8_pretrain.py][INFO] Epoch:[0/2](263700/4588595) loss:2.396 lr:0.0000100 epoch_Time:27347.0min: [2024-01-03 21:27:14,808][model8_pretrain.py][INFO] Epoch:[0/2](263700/4588595) loss:3.046 lr:0.0000100 epoch_Time:27347.0min: [2024-01-03 21:27:14,808][model8_pretrain.py][INFO] Epoch:[0/2](263700/4588595) loss:2.928 lr:0.0000100 epoch_Time:27347.0min: [2024-01-03 21:27:14,808][model8_pretrain.py][INFO] Epoch:[0/2](263700/4588595) loss:2.837 lr:0.0000100 epoch_Time:27347.0min: [2024-01-03 21:27:14,808][model8_pretrain.py][INFO] Epoch:[0/2](263700/4588595) loss:2.958 lr:0.0000100 epoch_Time:27347.0min: [2024-01-03 21:27:51,751][model8_pretrain.py][INFO] Epoch:[0/2](263800/4588595) loss:3.449 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:27:51,751][model8_pretrain.py][INFO] Epoch:[0/2](263800/4588595) loss:3.076 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:27:51,751][model8_pretrain.py][INFO] Epoch:[0/2](263800/4588595) loss:2.625 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:27:51,751][model8_pretrain.py][INFO] Epoch:[0/2](263800/4588595) loss:2.653 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:27:51,751][model8_pretrain.py][INFO] Epoch:[0/2](263800/4588595) loss:2.756 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:27:51,751][model8_pretrain.py][INFO] Epoch:[0/2](263800/4588595) loss:2.697 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:27:51,751][model8_pretrain.py][INFO] Epoch:[0/2](263800/4588595) loss:3.208 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:27:51,751][model8_pretrain.py][INFO] Epoch:[0/2](263800/4588595) loss:2.941 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:28:28,698][model8_pretrain.py][INFO] Epoch:[0/2](263900/4588595) loss:2.958 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:28:28,698][model8_pretrain.py][INFO] Epoch:[0/2](263900/4588595) loss:3.297 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:28:28,698][model8_pretrain.py][INFO] Epoch:[0/2](263900/4588595) loss:2.848 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:28:28,698][model8_pretrain.py][INFO] Epoch:[0/2](263900/4588595) loss:3.244 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:28:28,698][model8_pretrain.py][INFO] Epoch:[0/2](263900/4588595) loss:2.967 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:28:28,698][model8_pretrain.py][INFO] Epoch:[0/2](263900/4588595) loss:2.871 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:28:28,698][model8_pretrain.py][INFO] Epoch:[0/2](263900/4588595) loss:3.036 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:28:28,698][model8_pretrain.py][INFO] Epoch:[0/2](263900/4588595) loss:2.749 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:29:05,640][model8_pretrain.py][INFO] Epoch:[0/2](264000/4588595) loss:3.338 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:29:05,640][model8_pretrain.py][INFO] Epoch:[0/2](264000/4588595) loss:3.116 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:29:05,640][model8_pretrain.py][INFO] Epoch:[0/2](264000/4588595) loss:3.187 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:29:05,640][model8_pretrain.py][INFO] Epoch:[0/2](264000/4588595) loss:2.424 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:29:05,640][model8_pretrain.py][INFO] Epoch:[0/2](264000/4588595) loss:3.275 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:29:05,640][model8_pretrain.py][INFO] Epoch:[0/2](264000/4588595) loss:2.831 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:29:05,640][model8_pretrain.py][INFO] Epoch:[0/2](264000/4588595) loss:2.917 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:29:05,640][model8_pretrain.py][INFO] Epoch:[0/2](264000/4588595) loss:3.072 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:29:53,042][model8_pretrain.py][INFO] Epoch:[0/2](264100/4588595) loss:2.429 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:29:53,042][model8_pretrain.py][INFO] Epoch:[0/2](264100/4588595) loss:2.090 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:29:53,042][model8_pretrain.py][INFO] Epoch:[0/2](264100/4588595) loss:3.193 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:29:53,042][model8_pretrain.py][INFO] Epoch:[0/2](264100/4588595) loss:2.968 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:29:53,042][model8_pretrain.py][INFO] Epoch:[0/2](264100/4588595) loss:2.709 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:29:53,042][model8_pretrain.py][INFO] Epoch:[0/2](264100/4588595) loss:2.912 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:29:53,042][model8_pretrain.py][INFO] Epoch:[0/2](264100/4588595) loss:2.933 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:29:53,043][model8_pretrain.py][INFO] Epoch:[0/2](264100/4588595) loss:2.881 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:30:29,973][model8_pretrain.py][INFO] Epoch:[0/2](264200/4588595) loss:3.483 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:30:29,973][model8_pretrain.py][INFO] Epoch:[0/2](264200/4588595) loss:2.621 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:30:29,973][model8_pretrain.py][INFO] Epoch:[0/2](264200/4588595) loss:2.736 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:30:29,973][model8_pretrain.py][INFO] Epoch:[0/2](264200/4588595) loss:3.222 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:30:29,973][model8_pretrain.py][INFO] Epoch:[0/2](264200/4588595) loss:2.574 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:30:29,973][model8_pretrain.py][INFO] Epoch:[0/2](264200/4588595) loss:3.050 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:30:29,973][model8_pretrain.py][INFO] Epoch:[0/2](264200/4588595) loss:3.296 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:30:29,974][model8_pretrain.py][INFO] Epoch:[0/2](264200/4588595) loss:2.897 lr:0.0000100 epoch_Time:27346.0min: [2024-01-03 21:31:06,909][model8_pretrain.py][INFO] Epoch:[0/2](264300/4588595) loss:2.803 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:31:06,909][model8_pretrain.py][INFO] Epoch:[0/2](264300/4588595) loss:2.786 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:31:06,909][model8_pretrain.py][INFO] Epoch:[0/2](264300/4588595) loss:2.762 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:31:06,909][model8_pretrain.py][INFO] Epoch:[0/2](264300/4588595) loss:3.172 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:31:06,909][model8_pretrain.py][INFO] Epoch:[0/2](264300/4588595) loss:3.081 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:31:06,909][model8_pretrain.py][INFO] Epoch:[0/2](264300/4588595) loss:3.161 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:31:06,909][model8_pretrain.py][INFO] Epoch:[0/2](264300/4588595) loss:2.837 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:31:06,909][model8_pretrain.py][INFO] Epoch:[0/2](264300/4588595) loss:2.854 lr:0.0000100 epoch_Time:27345.0min: [2024-01-03 21:31:43,842][model8_pretrain.py][INFO] Epoch:[0/2](264400/4588595) loss:2.286 lr:0.0000100 epoch_Time:27344.0min: [2024-01-03 21:31:43,842][model8_pretrain.py][INFO] Epoch:[0/2](264400/4588595) loss:3.017 lr:0.0000100 epoch_Time:27344.0min: [2024-01-03 21:31:43,842][model8_pretrain.py][INFO] Epoch:[0/2](264400/4588595) loss:3.300 lr:0.0000100 epoch_Time:27344.0min: [2024-01-03 21:31:43,843][model8_pretrain.py][INFO] Epoch:[0/2](264400/4588595) loss:3.110 lr:0.0000100 epoch_Time:27344.0min: [2024-01-03 21:31:43,843][model8_pretrain.py][INFO] Epoch:[0/2](264400/4588595) loss:3.070 lr:0.0000100 epoch_Time:27344.0min: [2024-01-03 21:31:43,843][model8_pretrain.py][INFO] Epoch:[0/2](264400/4588595) loss:2.590 lr:0.0000100 epoch_Time:27344.0min: [2024-01-03 21:31:43,843][model8_pretrain.py][INFO] Epoch:[0/2](264400/4588595) loss:3.084 lr:0.0000100 epoch_Time:27344.0min: [2024-01-03 21:31:43,843][model8_pretrain.py][INFO] Epoch:[0/2](264400/4588595) loss:3.200 lr:0.0000100 epoch_Time:27344.0min: [2024-01-03 21:32:20,782][model8_pretrain.py][INFO] Epoch:[0/2](264500/4588595) loss:2.561 lr:0.0000100 epoch_Time:27343.0min: [2024-01-03 21:32:20,782][model8_pretrain.py][INFO] Epoch:[0/2](264500/4588595) loss:3.118 lr:0.0000100 epoch_Time:27343.0min: [2024-01-03 21:32:20,782][model8_pretrain.py][INFO] Epoch:[0/2](264500/4588595) loss:2.861 lr:0.0000100 epoch_Time:27343.0min: [2024-01-03 21:32:20,782][model8_pretrain.py][INFO] Epoch:[0/2](264500/4588595) loss:2.890 lr:0.0000100 epoch_Time:27343.0min: [2024-01-03 21:32:20,782][model8_pretrain.py][INFO] Epoch:[0/2](264500/4588595) loss:2.832 lr:0.0000100 epoch_Time:27343.0min: [2024-01-03 21:32:20,782][model8_pretrain.py][INFO] Epoch:[0/2](264500/4588595) loss:3.132 lr:0.0000100 epoch_Time:27343.0min: [2024-01-03 21:32:20,782][model8_pretrain.py][INFO] Epoch:[0/2](264500/4588595) loss:3.341 lr:0.0000100 epoch_Time:27343.0min: [2024-01-03 21:32:20,782][model8_pretrain.py][INFO] Epoch:[0/2](264500/4588595) loss:2.957 lr:0.0000100 epoch_Time:27343.0min: [2024-01-03 21:32:57,707][model8_pretrain.py][INFO] Epoch:[0/2](264600/4588595) loss:2.668 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:32:57,707][model8_pretrain.py][INFO] Epoch:[0/2](264600/4588595) loss:3.437 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:32:57,707][model8_pretrain.py][INFO] Epoch:[0/2](264600/4588595) loss:3.321 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:32:57,707][model8_pretrain.py][INFO] Epoch:[0/2](264600/4588595) loss:3.062 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:32:57,707][model8_pretrain.py][INFO] Epoch:[0/2](264600/4588595) loss:3.147 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:32:57,707][model8_pretrain.py][INFO] Epoch:[0/2](264600/4588595) loss:3.377 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:32:57,707][model8_pretrain.py][INFO] Epoch:[0/2](264600/4588595) loss:3.324 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:32:57,707][model8_pretrain.py][INFO] Epoch:[0/2](264600/4588595) loss:3.168 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:33:34,637][model8_pretrain.py][INFO] Epoch:[0/2](264700/4588595) loss:3.322 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:33:34,637][model8_pretrain.py][INFO] Epoch:[0/2](264700/4588595) loss:2.166 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:33:34,637][model8_pretrain.py][INFO] Epoch:[0/2](264700/4588595) loss:2.959 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:33:34,637][model8_pretrain.py][INFO] Epoch:[0/2](264700/4588595) loss:3.274 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:33:34,637][model8_pretrain.py][INFO] Epoch:[0/2](264700/4588595) loss:3.269 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:33:34,637][model8_pretrain.py][INFO] Epoch:[0/2](264700/4588595) loss:3.070 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:33:34,637][model8_pretrain.py][INFO] Epoch:[0/2](264700/4588595) loss:2.387 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:33:34,637][model8_pretrain.py][INFO] Epoch:[0/2](264700/4588595) loss:3.070 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:34:11,570][model8_pretrain.py][INFO] Epoch:[0/2](264800/4588595) loss:3.343 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:34:11,570][model8_pretrain.py][INFO] Epoch:[0/2](264800/4588595) loss:2.849 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:34:11,570][model8_pretrain.py][INFO] Epoch:[0/2](264800/4588595) loss:2.625 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:34:11,570][model8_pretrain.py][INFO] Epoch:[0/2](264800/4588595) loss:3.084 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:34:11,570][model8_pretrain.py][INFO] Epoch:[0/2](264800/4588595) loss:2.875 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:34:11,570][model8_pretrain.py][INFO] Epoch:[0/2](264800/4588595) loss:3.232 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:34:11,570][model8_pretrain.py][INFO] Epoch:[0/2](264800/4588595) loss:2.297 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:34:11,570][model8_pretrain.py][INFO] Epoch:[0/2](264800/4588595) loss:3.096 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:34:58,683][model8_pretrain.py][INFO] Epoch:[0/2](264900/4588595) loss:3.137 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:34:58,683][model8_pretrain.py][INFO] Epoch:[0/2](264900/4588595) loss:3.325 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:34:58,683][model8_pretrain.py][INFO] Epoch:[0/2](264900/4588595) loss:2.394 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:34:58,683][model8_pretrain.py][INFO] Epoch:[0/2](264900/4588595) loss:2.254 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:34:58,683][model8_pretrain.py][INFO] Epoch:[0/2](264900/4588595) loss:2.898 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:34:58,683][model8_pretrain.py][INFO] Epoch:[0/2](264900/4588595) loss:3.149 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:34:58,683][model8_pretrain.py][INFO] Epoch:[0/2](264900/4588595) loss:2.978 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:34:58,683][model8_pretrain.py][INFO] Epoch:[0/2](264900/4588595) loss:2.898 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:35:35,614][model8_pretrain.py][INFO] Epoch:[0/2](265000/4588595) loss:3.023 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:35:35,614][model8_pretrain.py][INFO] Epoch:[0/2](265000/4588595) loss:2.349 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:35:35,614][model8_pretrain.py][INFO] Epoch:[0/2](265000/4588595) loss:2.894 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:35:35,614][model8_pretrain.py][INFO] Epoch:[0/2](265000/4588595) loss:2.277 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:35:35,614][model8_pretrain.py][INFO] Epoch:[0/2](265000/4588595) loss:3.035 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:35:35,614][model8_pretrain.py][INFO] Epoch:[0/2](265000/4588595) loss:3.197 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:35:35,614][model8_pretrain.py][INFO] Epoch:[0/2](265000/4588595) loss:3.079 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:35:35,615][model8_pretrain.py][INFO] Epoch:[0/2](265000/4588595) loss:2.895 lr:0.0000100 epoch_Time:27342.0min: [2024-01-03 21:36:12,561][model8_pretrain.py][INFO] Epoch:[0/2](265100/4588595) loss:2.655 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:36:12,561][model8_pretrain.py][INFO] Epoch:[0/2](265100/4588595) loss:2.620 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:36:12,561][model8_pretrain.py][INFO] Epoch:[0/2](265100/4588595) loss:3.345 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:36:12,561][model8_pretrain.py][INFO] Epoch:[0/2](265100/4588595) loss:2.494 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:36:12,561][model8_pretrain.py][INFO] Epoch:[0/2](265100/4588595) loss:3.026 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:36:12,561][model8_pretrain.py][INFO] Epoch:[0/2](265100/4588595) loss:3.270 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:36:12,561][model8_pretrain.py][INFO] Epoch:[0/2](265100/4588595) loss:3.167 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:36:12,561][model8_pretrain.py][INFO] Epoch:[0/2](265100/4588595) loss:3.081 lr:0.0000100 epoch_Time:27340.0min: [2024-01-03 21:36:49,501][model8_pretrain.py][INFO] Epoch:[0/2](265200/4588595) loss:2.203 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:36:49,501][model8_pretrain.py][INFO] Epoch:[0/2](265200/4588595) loss:2.702 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:36:49,501][model8_pretrain.py][INFO] Epoch:[0/2](265200/4588595) loss:3.427 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:36:49,501][model8_pretrain.py][INFO] Epoch:[0/2](265200/4588595) loss:2.630 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:36:49,501][model8_pretrain.py][INFO] Epoch:[0/2](265200/4588595) loss:2.564 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:36:49,501][model8_pretrain.py][INFO] Epoch:[0/2](265200/4588595) loss:2.863 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:36:49,501][model8_pretrain.py][INFO] Epoch:[0/2](265200/4588595) loss:2.810 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:36:49,501][model8_pretrain.py][INFO] Epoch:[0/2](265200/4588595) loss:2.963 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:37:26,439][model8_pretrain.py][INFO] Epoch:[0/2](265300/4588595) loss:2.140 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:37:26,439][model8_pretrain.py][INFO] Epoch:[0/2](265300/4588595) loss:3.167 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:37:26,439][model8_pretrain.py][INFO] Epoch:[0/2](265300/4588595) loss:3.139 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:37:26,439][model8_pretrain.py][INFO] Epoch:[0/2](265300/4588595) loss:2.914 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:37:26,440][model8_pretrain.py][INFO] Epoch:[0/2](265300/4588595) loss:2.873 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:37:26,440][model8_pretrain.py][INFO] Epoch:[0/2](265300/4588595) loss:2.909 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:37:26,440][model8_pretrain.py][INFO] Epoch:[0/2](265300/4588595) loss:2.918 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:37:26,440][model8_pretrain.py][INFO] Epoch:[0/2](265300/4588595) loss:3.031 lr:0.0000100 epoch_Time:27339.0min: [2024-01-03 21:38:03,383][model8_pretrain.py][INFO] Epoch:[0/2](265400/4588595) loss:3.040 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:03,383][model8_pretrain.py][INFO] Epoch:[0/2](265400/4588595) loss:3.249 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:03,383][model8_pretrain.py][INFO] Epoch:[0/2](265400/4588595) loss:2.907 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:03,383][model8_pretrain.py][INFO] Epoch:[0/2](265400/4588595) loss:3.462 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:03,383][model8_pretrain.py][INFO] Epoch:[0/2](265400/4588595) loss:2.398 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:03,383][model8_pretrain.py][INFO] Epoch:[0/2](265400/4588595) loss:2.551 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:03,383][model8_pretrain.py][INFO] Epoch:[0/2](265400/4588595) loss:2.987 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:03,383][model8_pretrain.py][INFO] Epoch:[0/2](265400/4588595) loss:3.250 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:40,330][model8_pretrain.py][INFO] Epoch:[0/2](265500/4588595) loss:3.007 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:40,330][model8_pretrain.py][INFO] Epoch:[0/2](265500/4588595) loss:2.817 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:40,330][model8_pretrain.py][INFO] Epoch:[0/2](265500/4588595) loss:3.387 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:40,330][model8_pretrain.py][INFO] Epoch:[0/2](265500/4588595) loss:3.027 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:40,330][model8_pretrain.py][INFO] Epoch:[0/2](265500/4588595) loss:3.155 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:40,330][model8_pretrain.py][INFO] Epoch:[0/2](265500/4588595) loss:2.863 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:40,330][model8_pretrain.py][INFO] Epoch:[0/2](265500/4588595) loss:2.751 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:38:40,331][model8_pretrain.py][INFO] Epoch:[0/2](265500/4588595) loss:2.864 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:39:17,250][model8_pretrain.py][INFO] Epoch:[0/2](265600/4588595) loss:2.932 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:39:17,250][model8_pretrain.py][INFO] Epoch:[0/2](265600/4588595) loss:2.818 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:39:17,250][model8_pretrain.py][INFO] Epoch:[0/2](265600/4588595) loss:3.087 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:39:17,250][model8_pretrain.py][INFO] Epoch:[0/2](265600/4588595) loss:3.100 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:39:17,250][model8_pretrain.py][INFO] Epoch:[0/2](265600/4588595) loss:2.605 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:39:17,250][model8_pretrain.py][INFO] Epoch:[0/2](265600/4588595) loss:3.184 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:39:17,250][model8_pretrain.py][INFO] Epoch:[0/2](265600/4588595) loss:2.686 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:39:17,250][model8_pretrain.py][INFO] Epoch:[0/2](265600/4588595) loss:2.908 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:40:04,471][model8_pretrain.py][INFO] Epoch:[0/2](265700/4588595) loss:2.463 lr:0.0000100 epoch_Time:27338.0min: [2024-01-03 21:40:04,471][model8_pretrain.py][INFO] Epoch:[0/2](265700/4588595) loss:2.776 lr:0.0000100 epoch_Time:27338.0min: [2024-01-03 21:40:04,471][model8_pretrain.py][INFO] Epoch:[0/2](265700/4588595) loss:2.715 lr:0.0000100 epoch_Time:27338.0min: [2024-01-03 21:40:04,471][model8_pretrain.py][INFO] Epoch:[0/2](265700/4588595) loss:2.924 lr:0.0000100 epoch_Time:27338.0min: [2024-01-03 21:40:04,471][model8_pretrain.py][INFO] Epoch:[0/2](265700/4588595) loss:2.552 lr:0.0000100 epoch_Time:27338.0min: [2024-01-03 21:40:04,471][model8_pretrain.py][INFO] Epoch:[0/2](265700/4588595) loss:3.323 lr:0.0000100 epoch_Time:27338.0min: [2024-01-03 21:40:04,471][model8_pretrain.py][INFO] Epoch:[0/2](265700/4588595) loss:2.902 lr:0.0000100 epoch_Time:27338.0min: [2024-01-03 21:40:04,471][model8_pretrain.py][INFO] Epoch:[0/2](265700/4588595) loss:3.237 lr:0.0000100 epoch_Time:27338.0min: [2024-01-03 21:40:41,419][model8_pretrain.py][INFO] Epoch:[0/2](265800/4588595) loss:2.680 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:40:41,419][model8_pretrain.py][INFO] Epoch:[0/2](265800/4588595) loss:2.697 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:40:41,419][model8_pretrain.py][INFO] Epoch:[0/2](265800/4588595) loss:2.841 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:40:41,419][model8_pretrain.py][INFO] Epoch:[0/2](265800/4588595) loss:2.756 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:40:41,419][model8_pretrain.py][INFO] Epoch:[0/2](265800/4588595) loss:2.991 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:40:41,419][model8_pretrain.py][INFO] Epoch:[0/2](265800/4588595) loss:3.000 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:40:41,419][model8_pretrain.py][INFO] Epoch:[0/2](265800/4588595) loss:2.595 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:40:41,419][model8_pretrain.py][INFO] Epoch:[0/2](265800/4588595) loss:2.996 lr:0.0000100 epoch_Time:27337.0min: [2024-01-03 21:41:18,359][model8_pretrain.py][INFO] Epoch:[0/2](265900/4588595) loss:2.619 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:41:18,359][model8_pretrain.py][INFO] Epoch:[0/2](265900/4588595) loss:3.013 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:41:18,359][model8_pretrain.py][INFO] Epoch:[0/2](265900/4588595) loss:2.750 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:41:18,359][model8_pretrain.py][INFO] Epoch:[0/2](265900/4588595) loss:2.863 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:41:18,359][model8_pretrain.py][INFO] Epoch:[0/2](265900/4588595) loss:3.100 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:41:18,359][model8_pretrain.py][INFO] Epoch:[0/2](265900/4588595) loss:2.849 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:41:18,359][model8_pretrain.py][INFO] Epoch:[0/2](265900/4588595) loss:3.308 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:41:18,360][model8_pretrain.py][INFO] Epoch:[0/2](265900/4588595) loss:2.626 lr:0.0000100 epoch_Time:27336.0min: [2024-01-03 21:41:55,297][model8_pretrain.py][INFO] Epoch:[0/2](266000/4588595) loss:2.706 lr:0.0000100 epoch_Time:27335.0min: [2024-01-03 21:41:55,297][model8_pretrain.py][INFO] Epoch:[0/2](266000/4588595) loss:2.983 lr:0.0000100 epoch_Time:27335.0min: [2024-01-03 21:41:55,297][model8_pretrain.py][INFO] Epoch:[0/2](266000/4588595) loss:3.115 lr:0.0000100 epoch_Time:27335.0min: [2024-01-03 21:41:55,297][model8_pretrain.py][INFO] Epoch:[0/2](266000/4588595) loss:2.836 lr:0.0000100 epoch_Time:27335.0min: [2024-01-03 21:41:55,297][model8_pretrain.py][INFO] Epoch:[0/2](266000/4588595) loss:2.435 lr:0.0000100 epoch_Time:27335.0min: [2024-01-03 21:41:55,297][model8_pretrain.py][INFO] Epoch:[0/2](266000/4588595) loss:2.703 lr:0.0000100 epoch_Time:27335.0min: [2024-01-03 21:41:55,297][model8_pretrain.py][INFO] Epoch:[0/2](266000/4588595) loss:3.222 lr:0.0000100 epoch_Time:27335.0min: [2024-01-03 21:41:55,297][model8_pretrain.py][INFO] Epoch:[0/2](266000/4588595) loss:2.827 lr:0.0000100 epoch_Time:27335.0min: [2024-01-03 21:42:32,241][model8_pretrain.py][INFO] Epoch:[0/2](266100/4588595) loss:3.361 lr:0.0000100 epoch_Time:27334.0min: [2024-01-03 21:42:32,241][model8_pretrain.py][INFO] Epoch:[0/2](266100/4588595) loss:2.970 lr:0.0000100 epoch_Time:27334.0min: [2024-01-03 21:42:32,241][model8_pretrain.py][INFO] Epoch:[0/2](266100/4588595) loss:3.439 lr:0.0000100 epoch_Time:27334.0min: [2024-01-03 21:42:32,241][model8_pretrain.py][INFO] Epoch:[0/2](266100/4588595) loss:3.002 lr:0.0000100 epoch_Time:27334.0min: [2024-01-03 21:42:32,241][model8_pretrain.py][INFO] Epoch:[0/2](266100/4588595) loss:2.931 lr:0.0000100 epoch_Time:27334.0min: [2024-01-03 21:42:32,241][model8_pretrain.py][INFO] Epoch:[0/2](266100/4588595) loss:2.666 lr:0.0000100 epoch_Time:27334.0min: [2024-01-03 21:42:32,241][model8_pretrain.py][INFO] Epoch:[0/2](266100/4588595) loss:2.824 lr:0.0000100 epoch_Time:27334.0min: [2024-01-03 21:42:32,242][model8_pretrain.py][INFO] Epoch:[0/2](266100/4588595) loss:2.444 lr:0.0000100 epoch_Time:27334.0min: [2024-01-03 21:43:09,170][model8_pretrain.py][INFO] Epoch:[0/2](266200/4588595) loss:3.299 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:09,170][model8_pretrain.py][INFO] Epoch:[0/2](266200/4588595) loss:3.300 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:09,170][model8_pretrain.py][INFO] Epoch:[0/2](266200/4588595) loss:3.449 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:09,170][model8_pretrain.py][INFO] Epoch:[0/2](266200/4588595) loss:3.057 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:09,170][model8_pretrain.py][INFO] Epoch:[0/2](266200/4588595) loss:2.477 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:09,170][model8_pretrain.py][INFO] Epoch:[0/2](266200/4588595) loss:3.475 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:09,170][model8_pretrain.py][INFO] Epoch:[0/2](266200/4588595) loss:3.101 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:09,170][model8_pretrain.py][INFO] Epoch:[0/2](266200/4588595) loss:3.000 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:46,100][model8_pretrain.py][INFO] Epoch:[0/2](266300/4588595) loss:2.685 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:46,100][model8_pretrain.py][INFO] Epoch:[0/2](266300/4588595) loss:2.629 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:46,100][model8_pretrain.py][INFO] Epoch:[0/2](266300/4588595) loss:3.373 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:46,100][model8_pretrain.py][INFO] Epoch:[0/2](266300/4588595) loss:2.827 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:46,100][model8_pretrain.py][INFO] Epoch:[0/2](266300/4588595) loss:3.162 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:46,100][model8_pretrain.py][INFO] Epoch:[0/2](266300/4588595) loss:2.620 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:46,100][model8_pretrain.py][INFO] Epoch:[0/2](266300/4588595) loss:3.037 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:43:46,100][model8_pretrain.py][INFO] Epoch:[0/2](266300/4588595) loss:3.177 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:44:23,026][model8_pretrain.py][INFO] Epoch:[0/2](266400/4588595) loss:3.159 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:44:23,026][model8_pretrain.py][INFO] Epoch:[0/2](266400/4588595) loss:3.160 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:44:23,026][model8_pretrain.py][INFO] Epoch:[0/2](266400/4588595) loss:2.903 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:44:23,026][model8_pretrain.py][INFO] Epoch:[0/2](266400/4588595) loss:2.741 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:44:23,026][model8_pretrain.py][INFO] Epoch:[0/2](266400/4588595) loss:2.660 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:44:23,027][model8_pretrain.py][INFO] Epoch:[0/2](266400/4588595) loss:2.400 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:44:23,027][model8_pretrain.py][INFO] Epoch:[0/2](266400/4588595) loss:2.856 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:44:23,027][model8_pretrain.py][INFO] Epoch:[0/2](266400/4588595) loss:3.045 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:45:09,859][model8_pretrain.py][INFO] Epoch:[0/2](266500/4588595) loss:2.591 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:09,859][model8_pretrain.py][INFO] Epoch:[0/2](266500/4588595) loss:2.310 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:09,859][model8_pretrain.py][INFO] Epoch:[0/2](266500/4588595) loss:2.985 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:09,859][model8_pretrain.py][INFO] Epoch:[0/2](266500/4588595) loss:2.351 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:09,859][model8_pretrain.py][INFO] Epoch:[0/2](266500/4588595) loss:2.968 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:09,859][model8_pretrain.py][INFO] Epoch:[0/2](266500/4588595) loss:2.848 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:09,859][model8_pretrain.py][INFO] Epoch:[0/2](266500/4588595) loss:2.911 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:09,859][model8_pretrain.py][INFO] Epoch:[0/2](266500/4588595) loss:2.156 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:46,781][model8_pretrain.py][INFO] Epoch:[0/2](266600/4588595) loss:2.955 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:46,781][model8_pretrain.py][INFO] Epoch:[0/2](266600/4588595) loss:3.235 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:46,781][model8_pretrain.py][INFO] Epoch:[0/2](266600/4588595) loss:2.523 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:46,781][model8_pretrain.py][INFO] Epoch:[0/2](266600/4588595) loss:3.547 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:46,781][model8_pretrain.py][INFO] Epoch:[0/2](266600/4588595) loss:2.705 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:46,781][model8_pretrain.py][INFO] Epoch:[0/2](266600/4588595) loss:2.769 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:46,781][model8_pretrain.py][INFO] Epoch:[0/2](266600/4588595) loss:2.547 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:45:46,781][model8_pretrain.py][INFO] Epoch:[0/2](266600/4588595) loss:3.327 lr:0.0000100 epoch_Time:27333.0min: [2024-01-03 21:46:23,711][model8_pretrain.py][INFO] Epoch:[0/2](266700/4588595) loss:2.348 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:46:23,711][model8_pretrain.py][INFO] Epoch:[0/2](266700/4588595) loss:3.132 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:46:23,711][model8_pretrain.py][INFO] Epoch:[0/2](266700/4588595) loss:2.555 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:46:23,711][model8_pretrain.py][INFO] Epoch:[0/2](266700/4588595) loss:2.738 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:46:23,711][model8_pretrain.py][INFO] Epoch:[0/2](266700/4588595) loss:2.826 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:46:23,711][model8_pretrain.py][INFO] Epoch:[0/2](266700/4588595) loss:2.923 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:46:23,711][model8_pretrain.py][INFO] Epoch:[0/2](266700/4588595) loss:3.222 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:46:23,711][model8_pretrain.py][INFO] Epoch:[0/2](266700/4588595) loss:2.780 lr:0.0000100 epoch_Time:27332.0min: [2024-01-03 21:47:00,643][model8_pretrain.py][INFO] Epoch:[0/2](266800/4588595) loss:3.295 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:00,643][model8_pretrain.py][INFO] Epoch:[0/2](266800/4588595) loss:3.113 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:00,643][model8_pretrain.py][INFO] Epoch:[0/2](266800/4588595) loss:2.546 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:00,643][model8_pretrain.py][INFO] Epoch:[0/2](266800/4588595) loss:3.119 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:00,643][model8_pretrain.py][INFO] Epoch:[0/2](266800/4588595) loss:2.967 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:00,643][model8_pretrain.py][INFO] Epoch:[0/2](266800/4588595) loss:2.951 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:00,643][model8_pretrain.py][INFO] Epoch:[0/2](266800/4588595) loss:3.059 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:00,643][model8_pretrain.py][INFO] Epoch:[0/2](266800/4588595) loss:3.208 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:37,580][model8_pretrain.py][INFO] Epoch:[0/2](266900/4588595) loss:2.939 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:37,580][model8_pretrain.py][INFO] Epoch:[0/2](266900/4588595) loss:3.488 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:37,580][model8_pretrain.py][INFO] Epoch:[0/2](266900/4588595) loss:3.039 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:37,580][model8_pretrain.py][INFO] Epoch:[0/2](266900/4588595) loss:2.880 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:37,580][model8_pretrain.py][INFO] Epoch:[0/2](266900/4588595) loss:3.034 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:37,580][model8_pretrain.py][INFO] Epoch:[0/2](266900/4588595) loss:2.623 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:37,580][model8_pretrain.py][INFO] Epoch:[0/2](266900/4588595) loss:3.160 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:47:37,580][model8_pretrain.py][INFO] Epoch:[0/2](266900/4588595) loss:2.793 lr:0.0000100 epoch_Time:27330.0min: [2024-01-03 21:48:14,516][model8_pretrain.py][INFO] Epoch:[0/2](267000/4588595) loss:2.930 lr:0.0000100 epoch_Time:27329.0min: [2024-01-03 21:48:14,516][model8_pretrain.py][INFO] Epoch:[0/2](267000/4588595) loss:3.095 lr:0.0000100 epoch_Time:27329.0min: [2024-01-03 21:48:14,516][model8_pretrain.py][INFO] Epoch:[0/2](267000/4588595) loss:3.090 lr:0.0000100 epoch_Time:27329.0min: [2024-01-03 21:48:14,516][model8_pretrain.py][INFO] Epoch:[0/2](267000/4588595) loss:2.597 lr:0.0000100 epoch_Time:27329.0min: [2024-01-03 21:48:14,516][model8_pretrain.py][INFO] Epoch:[0/2](267000/4588595) loss:3.508 lr:0.0000100 epoch_Time:27329.0min: [2024-01-03 21:48:14,516][model8_pretrain.py][INFO] Epoch:[0/2](267000/4588595) loss:2.965 lr:0.0000100 epoch_Time:27329.0min: [2024-01-03 21:48:14,516][model8_pretrain.py][INFO] Epoch:[0/2](267000/4588595) loss:3.189 lr:0.0000100 epoch_Time:27329.0min: [2024-01-03 21:48:14,516][model8_pretrain.py][INFO] Epoch:[0/2](267000/4588595) loss:3.085 lr:0.0000100 epoch_Time:27329.0min: [2024-01-03 21:48:51,453][model8_pretrain.py][INFO] Epoch:[0/2](267100/4588595) loss:2.786 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:48:51,453][model8_pretrain.py][INFO] Epoch:[0/2](267100/4588595) loss:3.036 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:48:51,453][model8_pretrain.py][INFO] Epoch:[0/2](267100/4588595) loss:2.640 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:48:51,453][model8_pretrain.py][INFO] Epoch:[0/2](267100/4588595) loss:2.719 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:48:51,453][model8_pretrain.py][INFO] Epoch:[0/2](267100/4588595) loss:2.771 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:48:51,453][model8_pretrain.py][INFO] Epoch:[0/2](267100/4588595) loss:2.925 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:48:51,453][model8_pretrain.py][INFO] Epoch:[0/2](267100/4588595) loss:3.059 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:48:51,453][model8_pretrain.py][INFO] Epoch:[0/2](267100/4588595) loss:2.556 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:49:28,384][model8_pretrain.py][INFO] Epoch:[0/2](267200/4588595) loss:2.968 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:49:28,384][model8_pretrain.py][INFO] Epoch:[0/2](267200/4588595) loss:3.015 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:49:28,384][model8_pretrain.py][INFO] Epoch:[0/2](267200/4588595) loss:3.271 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:49:28,384][model8_pretrain.py][INFO] Epoch:[0/2](267200/4588595) loss:2.771 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:49:28,384][model8_pretrain.py][INFO] Epoch:[0/2](267200/4588595) loss:2.514 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:49:28,384][model8_pretrain.py][INFO] Epoch:[0/2](267200/4588595) loss:3.122 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:49:28,384][model8_pretrain.py][INFO] Epoch:[0/2](267200/4588595) loss:2.410 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:49:28,384][model8_pretrain.py][INFO] Epoch:[0/2](267200/4588595) loss:2.992 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:50:13,308][model8_pretrain.py][INFO] Epoch:[0/2](267300/4588595) loss:3.141 lr:0.0000100 epoch_Time:27328.0min: [2024-01-03 21:50:13,308][model8_pretrain.py][INFO] Epoch:[0/2](267300/4588595) loss:2.687 lr:0.0000100 epoch_Time:27328.0min: [2024-01-03 21:50:13,308][model8_pretrain.py][INFO] Epoch:[0/2](267300/4588595) loss:3.069 lr:0.0000100 epoch_Time:27328.0min: [2024-01-03 21:50:13,308][model8_pretrain.py][INFO] Epoch:[0/2](267300/4588595) loss:2.924 lr:0.0000100 epoch_Time:27328.0min: [2024-01-03 21:50:13,308][model8_pretrain.py][INFO] Epoch:[0/2](267300/4588595) loss:3.111 lr:0.0000100 epoch_Time:27328.0min: [2024-01-03 21:50:13,308][model8_pretrain.py][INFO] Epoch:[0/2](267300/4588595) loss:2.991 lr:0.0000100 epoch_Time:27328.0min: [2024-01-03 21:50:13,309][model8_pretrain.py][INFO] Epoch:[0/2](267300/4588595) loss:2.776 lr:0.0000100 epoch_Time:27328.0min: [2024-01-03 21:50:14,998][model8_pretrain.py][INFO] Epoch:[0/2](267300/4588595) loss:2.813 lr:0.0000100 epoch_Time:27329.0min: [2024-01-03 21:50:51,918][model8_pretrain.py][INFO] Epoch:[0/2](267400/4588595) loss:3.128 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:50:51,918][model8_pretrain.py][INFO] Epoch:[0/2](267400/4588595) loss:2.879 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:50:51,918][model8_pretrain.py][INFO] Epoch:[0/2](267400/4588595) loss:3.202 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:50:51,918][model8_pretrain.py][INFO] Epoch:[0/2](267400/4588595) loss:3.350 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:50:51,918][model8_pretrain.py][INFO] Epoch:[0/2](267400/4588595) loss:2.745 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:50:51,918][model8_pretrain.py][INFO] Epoch:[0/2](267400/4588595) loss:3.049 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:50:51,918][model8_pretrain.py][INFO] Epoch:[0/2](267400/4588595) loss:2.978 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:50:51,918][model8_pretrain.py][INFO] Epoch:[0/2](267400/4588595) loss:2.866 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:51:28,854][model8_pretrain.py][INFO] Epoch:[0/2](267500/4588595) loss:3.029 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:51:28,854][model8_pretrain.py][INFO] Epoch:[0/2](267500/4588595) loss:2.555 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:51:28,854][model8_pretrain.py][INFO] Epoch:[0/2](267500/4588595) loss:2.910 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:51:28,854][model8_pretrain.py][INFO] Epoch:[0/2](267500/4588595) loss:2.999 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:51:28,854][model8_pretrain.py][INFO] Epoch:[0/2](267500/4588595) loss:2.663 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:51:28,854][model8_pretrain.py][INFO] Epoch:[0/2](267500/4588595) loss:2.532 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:51:28,854][model8_pretrain.py][INFO] Epoch:[0/2](267500/4588595) loss:2.867 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:51:28,854][model8_pretrain.py][INFO] Epoch:[0/2](267500/4588595) loss:3.279 lr:0.0000100 epoch_Time:27327.0min: [2024-01-03 21:52:05,779][model8_pretrain.py][INFO] Epoch:[0/2](267600/4588595) loss:2.856 lr:0.0000100 epoch_Time:27326.0min: [2024-01-03 21:52:05,779][model8_pretrain.py][INFO] Epoch:[0/2](267600/4588595) loss:3.052 lr:0.0000100 epoch_Time:27326.0min: [2024-01-03 21:52:05,779][model8_pretrain.py][INFO] Epoch:[0/2](267600/4588595) loss:2.642 lr:0.0000100 epoch_Time:27326.0min: [2024-01-03 21:52:05,779][model8_pretrain.py][INFO] Epoch:[0/2](267600/4588595) loss:2.523 lr:0.0000100 epoch_Time:27326.0min: [2024-01-03 21:52:05,779][model8_pretrain.py][INFO] Epoch:[0/2](267600/4588595) loss:2.720 lr:0.0000100 epoch_Time:27326.0min: [2024-01-03 21:52:05,779][model8_pretrain.py][INFO] Epoch:[0/2](267600/4588595) loss:3.306 lr:0.0000100 epoch_Time:27326.0min: [2024-01-03 21:52:05,779][model8_pretrain.py][INFO] Epoch:[0/2](267600/4588595) loss:3.129 lr:0.0000100 epoch_Time:27326.0min: [2024-01-03 21:52:05,779][model8_pretrain.py][INFO] Epoch:[0/2](267600/4588595) loss:2.986 lr:0.0000100 epoch_Time:27326.0min: [2024-01-03 21:52:42,721][model8_pretrain.py][INFO] Epoch:[0/2](267700/4588595) loss:2.625 lr:0.0000100 epoch_Time:27325.0min: [2024-01-03 21:52:42,721][model8_pretrain.py][INFO] Epoch:[0/2](267700/4588595) loss:3.513 lr:0.0000100 epoch_Time:27325.0min: [2024-01-03 21:52:42,721][model8_pretrain.py][INFO] Epoch:[0/2](267700/4588595) loss:3.032 lr:0.0000100 epoch_Time:27325.0min: [2024-01-03 21:52:42,721][model8_pretrain.py][INFO] Epoch:[0/2](267700/4588595) loss:2.538 lr:0.0000100 epoch_Time:27325.0min: [2024-01-03 21:52:42,721][model8_pretrain.py][INFO] Epoch:[0/2](267700/4588595) loss:2.419 lr:0.0000100 epoch_Time:27325.0min: [2024-01-03 21:52:42,721][model8_pretrain.py][INFO] Epoch:[0/2](267700/4588595) loss:3.164 lr:0.0000100 epoch_Time:27325.0min: [2024-01-03 21:52:42,721][model8_pretrain.py][INFO] Epoch:[0/2](267700/4588595) loss:3.167 lr:0.0000100 epoch_Time:27325.0min: [2024-01-03 21:52:42,721][model8_pretrain.py][INFO] Epoch:[0/2](267700/4588595) loss:3.018 lr:0.0000100 epoch_Time:27325.0min: [2024-01-03 21:53:19,646][model8_pretrain.py][INFO] Epoch:[0/2](267800/4588595) loss:2.279 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:53:19,646][model8_pretrain.py][INFO] Epoch:[0/2](267800/4588595) loss:2.664 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:53:19,646][model8_pretrain.py][INFO] Epoch:[0/2](267800/4588595) loss:2.427 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:53:19,646][model8_pretrain.py][INFO] Epoch:[0/2](267800/4588595) loss:2.777 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:53:19,646][model8_pretrain.py][INFO] Epoch:[0/2](267800/4588595) loss:3.142 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:53:19,646][model8_pretrain.py][INFO] Epoch:[0/2](267800/4588595) loss:3.366 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:53:19,646][model8_pretrain.py][INFO] Epoch:[0/2](267800/4588595) loss:3.198 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:53:19,646][model8_pretrain.py][INFO] Epoch:[0/2](267800/4588595) loss:2.927 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:53:56,572][model8_pretrain.py][INFO] Epoch:[0/2](267900/4588595) loss:2.641 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:53:56,572][model8_pretrain.py][INFO] Epoch:[0/2](267900/4588595) loss:2.512 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:53:56,572][model8_pretrain.py][INFO] Epoch:[0/2](267900/4588595) loss:3.080 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:53:56,572][model8_pretrain.py][INFO] Epoch:[0/2](267900/4588595) loss:3.255 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:53:56,572][model8_pretrain.py][INFO] Epoch:[0/2](267900/4588595) loss:2.681 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:53:56,572][model8_pretrain.py][INFO] Epoch:[0/2](267900/4588595) loss:2.939 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:53:56,572][model8_pretrain.py][INFO] Epoch:[0/2](267900/4588595) loss:2.687 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:53:56,572][model8_pretrain.py][INFO] Epoch:[0/2](267900/4588595) loss:2.811 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:54:33,498][model8_pretrain.py][INFO] Epoch:[0/2](268000/4588595) loss:2.669 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:54:33,498][model8_pretrain.py][INFO] Epoch:[0/2](268000/4588595) loss:2.649 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:54:33,498][model8_pretrain.py][INFO] Epoch:[0/2](268000/4588595) loss:2.594 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:54:33,498][model8_pretrain.py][INFO] Epoch:[0/2](268000/4588595) loss:2.848 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:54:33,498][model8_pretrain.py][INFO] Epoch:[0/2](268000/4588595) loss:2.580 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:54:33,498][model8_pretrain.py][INFO] Epoch:[0/2](268000/4588595) loss:3.199 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:54:33,498][model8_pretrain.py][INFO] Epoch:[0/2](268000/4588595) loss:2.918 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:54:33,499][model8_pretrain.py][INFO] Epoch:[0/2](268000/4588595) loss:2.782 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:55:18,454][model8_pretrain.py][INFO] Epoch:[0/2](268100/4588595) loss:3.162 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:55:18,454][model8_pretrain.py][INFO] Epoch:[0/2](268100/4588595) loss:2.554 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:55:18,454][model8_pretrain.py][INFO] Epoch:[0/2](268100/4588595) loss:2.715 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:55:18,454][model8_pretrain.py][INFO] Epoch:[0/2](268100/4588595) loss:3.276 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:55:18,454][model8_pretrain.py][INFO] Epoch:[0/2](268100/4588595) loss:2.721 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:55:18,454][model8_pretrain.py][INFO] Epoch:[0/2](268100/4588595) loss:2.480 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:55:18,454][model8_pretrain.py][INFO] Epoch:[0/2](268100/4588595) loss:3.236 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:55:18,459][model8_pretrain.py][INFO] Epoch:[0/2](268100/4588595) loss:3.350 lr:0.0000100 epoch_Time:27324.0min: [2024-01-03 21:55:57,069][model8_pretrain.py][INFO] Epoch:[0/2](268200/4588595) loss:2.846 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:55:57,069][model8_pretrain.py][INFO] Epoch:[0/2](268200/4588595) loss:2.719 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:55:57,069][model8_pretrain.py][INFO] Epoch:[0/2](268200/4588595) loss:2.772 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:55:57,069][model8_pretrain.py][INFO] Epoch:[0/2](268200/4588595) loss:2.475 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:55:57,069][model8_pretrain.py][INFO] Epoch:[0/2](268200/4588595) loss:2.684 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:55:57,069][model8_pretrain.py][INFO] Epoch:[0/2](268200/4588595) loss:2.860 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:55:57,069][model8_pretrain.py][INFO] Epoch:[0/2](268200/4588595) loss:2.869 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:55:57,069][model8_pretrain.py][INFO] Epoch:[0/2](268200/4588595) loss:2.278 lr:0.0000100 epoch_Time:27323.0min: [2024-01-03 21:56:34,000][model8_pretrain.py][INFO] Epoch:[0/2](268300/4588595) loss:3.067 lr:0.0000100 epoch_Time:27322.0min: [2024-01-03 21:56:34,000][model8_pretrain.py][INFO] Epoch:[0/2](268300/4588595) loss:2.605 lr:0.0000100 epoch_Time:27322.0min: [2024-01-03 21:56:34,000][model8_pretrain.py][INFO] Epoch:[0/2](268300/4588595) loss:2.961 lr:0.0000100 epoch_Time:27322.0min: [2024-01-03 21:56:34,001][model8_pretrain.py][INFO] Epoch:[0/2](268300/4588595) loss:2.795 lr:0.0000100 epoch_Time:27322.0min: [2024-01-03 21:56:34,001][model8_pretrain.py][INFO] Epoch:[0/2](268300/4588595) loss:3.191 lr:0.0000100 epoch_Time:27322.0min: [2024-01-03 21:56:34,000][model8_pretrain.py][INFO] Epoch:[0/2](268300/4588595) loss:3.118 lr:0.0000100 epoch_Time:27322.0min: [2024-01-03 21:56:34,001][model8_pretrain.py][INFO] Epoch:[0/2](268300/4588595) loss:3.674 lr:0.0000100 epoch_Time:27322.0min: [2024-01-03 21:56:34,001][model8_pretrain.py][INFO] Epoch:[0/2](268300/4588595) loss:2.771 lr:0.0000100 epoch_Time:27322.0min: [2024-01-03 21:57:10,946][model8_pretrain.py][INFO] Epoch:[0/2](268400/4588595) loss:3.219 lr:0.0000100 epoch_Time:27321.0min: [2024-01-03 21:57:10,946][model8_pretrain.py][INFO] Epoch:[0/2](268400/4588595) loss:3.097 lr:0.0000100 epoch_Time:27321.0min: [2024-01-03 21:57:10,946][model8_pretrain.py][INFO] Epoch:[0/2](268400/4588595) loss:2.757 lr:0.0000100 epoch_Time:27321.0min: [2024-01-03 21:57:10,946][model8_pretrain.py][INFO] Epoch:[0/2](268400/4588595) loss:2.933 lr:0.0000100 epoch_Time:27321.0min: [2024-01-03 21:57:10,946][model8_pretrain.py][INFO] Epoch:[0/2](268400/4588595) loss:2.790 lr:0.0000100 epoch_Time:27321.0min: [2024-01-03 21:57:10,946][model8_pretrain.py][INFO] Epoch:[0/2](268400/4588595) loss:3.113 lr:0.0000100 epoch_Time:27321.0min: [2024-01-03 21:57:10,946][model8_pretrain.py][INFO] Epoch:[0/2](268400/4588595) loss:3.029 lr:0.0000100 epoch_Time:27321.0min: [2024-01-03 21:57:10,946][model8_pretrain.py][INFO] Epoch:[0/2](268400/4588595) loss:2.813 lr:0.0000100 epoch_Time:27321.0min: [2024-01-03 21:57:47,893][model8_pretrain.py][INFO] Epoch:[0/2](268500/4588595) loss:2.969 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:57:47,893][model8_pretrain.py][INFO] Epoch:[0/2](268500/4588595) loss:2.944 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:57:47,893][model8_pretrain.py][INFO] Epoch:[0/2](268500/4588595) loss:3.241 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:57:47,893][model8_pretrain.py][INFO] Epoch:[0/2](268500/4588595) loss:2.554 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:57:47,893][model8_pretrain.py][INFO] Epoch:[0/2](268500/4588595) loss:2.809 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:57:47,893][model8_pretrain.py][INFO] Epoch:[0/2](268500/4588595) loss:3.081 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:57:47,893][model8_pretrain.py][INFO] Epoch:[0/2](268500/4588595) loss:3.542 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:57:47,893][model8_pretrain.py][INFO] Epoch:[0/2](268500/4588595) loss:3.466 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:58:24,824][model8_pretrain.py][INFO] Epoch:[0/2](268600/4588595) loss:2.987 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:58:24,824][model8_pretrain.py][INFO] Epoch:[0/2](268600/4588595) loss:2.723 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:58:24,824][model8_pretrain.py][INFO] Epoch:[0/2](268600/4588595) loss:2.812 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:58:24,824][model8_pretrain.py][INFO] Epoch:[0/2](268600/4588595) loss:2.747 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:58:24,824][model8_pretrain.py][INFO] Epoch:[0/2](268600/4588595) loss:3.296 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:58:24,824][model8_pretrain.py][INFO] Epoch:[0/2](268600/4588595) loss:3.199 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:58:24,824][model8_pretrain.py][INFO] Epoch:[0/2](268600/4588595) loss:2.447 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:58:24,825][model8_pretrain.py][INFO] Epoch:[0/2](268600/4588595) loss:2.306 lr:0.0000100 epoch_Time:27320.0min: [2024-01-03 21:59:01,764][model8_pretrain.py][INFO] Epoch:[0/2](268700/4588595) loss:2.739 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:01,764][model8_pretrain.py][INFO] Epoch:[0/2](268700/4588595) loss:2.751 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:01,764][model8_pretrain.py][INFO] Epoch:[0/2](268700/4588595) loss:2.971 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:01,764][model8_pretrain.py][INFO] Epoch:[0/2](268700/4588595) loss:2.967 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:01,764][model8_pretrain.py][INFO] Epoch:[0/2](268700/4588595) loss:2.881 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:01,764][model8_pretrain.py][INFO] Epoch:[0/2](268700/4588595) loss:2.937 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:01,764][model8_pretrain.py][INFO] Epoch:[0/2](268700/4588595) loss:2.825 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:01,764][model8_pretrain.py][INFO] Epoch:[0/2](268700/4588595) loss:3.060 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:38,697][model8_pretrain.py][INFO] Epoch:[0/2](268800/4588595) loss:3.534 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:38,697][model8_pretrain.py][INFO] Epoch:[0/2](268800/4588595) loss:2.948 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:38,697][model8_pretrain.py][INFO] Epoch:[0/2](268800/4588595) loss:2.613 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:38,697][model8_pretrain.py][INFO] Epoch:[0/2](268800/4588595) loss:2.878 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:38,697][model8_pretrain.py][INFO] Epoch:[0/2](268800/4588595) loss:3.169 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:38,697][model8_pretrain.py][INFO] Epoch:[0/2](268800/4588595) loss:2.931 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:38,698][model8_pretrain.py][INFO] Epoch:[0/2](268800/4588595) loss:2.813 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 21:59:38,698][model8_pretrain.py][INFO] Epoch:[0/2](268800/4588595) loss:2.595 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:00:20,401][model8_pretrain.py][INFO] Epoch:[0/2](268900/4588595) loss:3.094 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:00:20,401][model8_pretrain.py][INFO] Epoch:[0/2](268900/4588595) loss:2.929 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:00:20,401][model8_pretrain.py][INFO] Epoch:[0/2](268900/4588595) loss:3.008 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:00:20,401][model8_pretrain.py][INFO] Epoch:[0/2](268900/4588595) loss:3.765 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:00:20,401][model8_pretrain.py][INFO] Epoch:[0/2](268900/4588595) loss:3.005 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:00:20,401][model8_pretrain.py][INFO] Epoch:[0/2](268900/4588595) loss:2.836 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:00:20,401][model8_pretrain.py][INFO] Epoch:[0/2](268900/4588595) loss:2.417 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:00:20,401][model8_pretrain.py][INFO] Epoch:[0/2](268900/4588595) loss:2.928 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:02,330][model8_pretrain.py][INFO] Epoch:[0/2](269000/4588595) loss:3.294 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:02,330][model8_pretrain.py][INFO] Epoch:[0/2](269000/4588595) loss:3.002 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:02,330][model8_pretrain.py][INFO] Epoch:[0/2](269000/4588595) loss:2.305 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:02,330][model8_pretrain.py][INFO] Epoch:[0/2](269000/4588595) loss:2.703 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:02,330][model8_pretrain.py][INFO] Epoch:[0/2](269000/4588595) loss:3.163 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:02,330][model8_pretrain.py][INFO] Epoch:[0/2](269000/4588595) loss:2.865 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:02,330][model8_pretrain.py][INFO] Epoch:[0/2](269000/4588595) loss:2.730 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:02,331][model8_pretrain.py][INFO] Epoch:[0/2](269000/4588595) loss:2.731 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:39,267][model8_pretrain.py][INFO] Epoch:[0/2](269100/4588595) loss:2.728 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:39,267][model8_pretrain.py][INFO] Epoch:[0/2](269100/4588595) loss:3.132 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:39,267][model8_pretrain.py][INFO] Epoch:[0/2](269100/4588595) loss:2.690 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:39,267][model8_pretrain.py][INFO] Epoch:[0/2](269100/4588595) loss:2.153 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:39,267][model8_pretrain.py][INFO] Epoch:[0/2](269100/4588595) loss:3.079 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:39,267][model8_pretrain.py][INFO] Epoch:[0/2](269100/4588595) loss:3.111 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:39,267][model8_pretrain.py][INFO] Epoch:[0/2](269100/4588595) loss:2.501 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:01:39,267][model8_pretrain.py][INFO] Epoch:[0/2](269100/4588595) loss:3.074 lr:0.0000100 epoch_Time:27318.0min: [2024-01-03 22:02:16,213][model8_pretrain.py][INFO] Epoch:[0/2](269200/4588595) loss:2.379 lr:0.0000100 epoch_Time:27317.0min: [2024-01-03 22:02:16,213][model8_pretrain.py][INFO] Epoch:[0/2](269200/4588595) loss:2.903 lr:0.0000100 epoch_Time:27317.0min: [2024-01-03 22:02:16,213][model8_pretrain.py][INFO] Epoch:[0/2](269200/4588595) loss:2.975 lr:0.0000100 epoch_Time:27317.0min: [2024-01-03 22:02:16,213][model8_pretrain.py][INFO] Epoch:[0/2](269200/4588595) loss:2.280 lr:0.0000100 epoch_Time:27317.0min: [2024-01-03 22:02:16,213][model8_pretrain.py][INFO] Epoch:[0/2](269200/4588595) loss:2.728 lr:0.0000100 epoch_Time:27317.0min: [2024-01-03 22:02:16,213][model8_pretrain.py][INFO] Epoch:[0/2](269200/4588595) loss:2.927 lr:0.0000100 epoch_Time:27317.0min: [2024-01-03 22:02:16,213][model8_pretrain.py][INFO] Epoch:[0/2](269200/4588595) loss:2.705 lr:0.0000100 epoch_Time:27317.0min: [2024-01-03 22:02:16,213][model8_pretrain.py][INFO] Epoch:[0/2](269200/4588595) loss:3.061 lr:0.0000100 epoch_Time:27317.0min: [2024-01-03 22:02:53,151][model8_pretrain.py][INFO] Epoch:[0/2](269300/4588595) loss:3.168 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:02:53,151][model8_pretrain.py][INFO] Epoch:[0/2](269300/4588595) loss:3.056 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:02:53,151][model8_pretrain.py][INFO] Epoch:[0/2](269300/4588595) loss:3.372 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:02:53,151][model8_pretrain.py][INFO] Epoch:[0/2](269300/4588595) loss:3.236 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:02:53,151][model8_pretrain.py][INFO] Epoch:[0/2](269300/4588595) loss:3.634 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:02:53,151][model8_pretrain.py][INFO] Epoch:[0/2](269300/4588595) loss:3.011 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:02:53,151][model8_pretrain.py][INFO] Epoch:[0/2](269300/4588595) loss:3.001 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:02:53,151][model8_pretrain.py][INFO] Epoch:[0/2](269300/4588595) loss:2.849 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:03:30,090][model8_pretrain.py][INFO] Epoch:[0/2](269400/4588595) loss:3.086 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:03:30,090][model8_pretrain.py][INFO] Epoch:[0/2](269400/4588595) loss:3.209 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:03:30,090][model8_pretrain.py][INFO] Epoch:[0/2](269400/4588595) loss:2.896 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:03:30,090][model8_pretrain.py][INFO] Epoch:[0/2](269400/4588595) loss:2.705 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:03:30,090][model8_pretrain.py][INFO] Epoch:[0/2](269400/4588595) loss:2.781 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:03:30,090][model8_pretrain.py][INFO] Epoch:[0/2](269400/4588595) loss:2.790 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:03:30,090][model8_pretrain.py][INFO] Epoch:[0/2](269400/4588595) loss:2.571 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:03:30,090][model8_pretrain.py][INFO] Epoch:[0/2](269400/4588595) loss:2.735 lr:0.0000100 epoch_Time:27315.0min: [2024-01-03 22:04:07,027][model8_pretrain.py][INFO] Epoch:[0/2](269500/4588595) loss:3.216 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:04:07,027][model8_pretrain.py][INFO] Epoch:[0/2](269500/4588595) loss:2.729 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:04:07,027][model8_pretrain.py][INFO] Epoch:[0/2](269500/4588595) loss:3.144 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:04:07,027][model8_pretrain.py][INFO] Epoch:[0/2](269500/4588595) loss:3.172 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:04:07,027][model8_pretrain.py][INFO] Epoch:[0/2](269500/4588595) loss:2.373 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:04:07,027][model8_pretrain.py][INFO] Epoch:[0/2](269500/4588595) loss:2.143 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:04:07,027][model8_pretrain.py][INFO] Epoch:[0/2](269500/4588595) loss:3.168 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:04:07,027][model8_pretrain.py][INFO] Epoch:[0/2](269500/4588595) loss:3.316 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:04:43,949][model8_pretrain.py][INFO] Epoch:[0/2](269600/4588595) loss:2.199 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:04:43,949][model8_pretrain.py][INFO] Epoch:[0/2](269600/4588595) loss:3.044 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:04:43,949][model8_pretrain.py][INFO] Epoch:[0/2](269600/4588595) loss:3.220 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:04:43,949][model8_pretrain.py][INFO] Epoch:[0/2](269600/4588595) loss:2.481 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:04:43,949][model8_pretrain.py][INFO] Epoch:[0/2](269600/4588595) loss:2.796 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:04:43,949][model8_pretrain.py][INFO] Epoch:[0/2](269600/4588595) loss:3.051 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:04:43,949][model8_pretrain.py][INFO] Epoch:[0/2](269600/4588595) loss:2.513 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:04:43,949][model8_pretrain.py][INFO] Epoch:[0/2](269600/4588595) loss:2.959 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:05:25,742][model8_pretrain.py][INFO] Epoch:[0/2](269700/4588595) loss:3.437 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:05:25,742][model8_pretrain.py][INFO] Epoch:[0/2](269700/4588595) loss:3.412 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:05:25,742][model8_pretrain.py][INFO] Epoch:[0/2](269700/4588595) loss:3.080 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:05:25,742][model8_pretrain.py][INFO] Epoch:[0/2](269700/4588595) loss:2.781 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:05:25,742][model8_pretrain.py][INFO] Epoch:[0/2](269700/4588595) loss:3.013 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:05:25,742][model8_pretrain.py][INFO] Epoch:[0/2](269700/4588595) loss:2.738 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:05:25,742][model8_pretrain.py][INFO] Epoch:[0/2](269700/4588595) loss:3.098 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:05:25,742][model8_pretrain.py][INFO] Epoch:[0/2](269700/4588595) loss:2.950 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:06:07,646][model8_pretrain.py][INFO] Epoch:[0/2](269800/4588595) loss:3.163 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:06:07,646][model8_pretrain.py][INFO] Epoch:[0/2](269800/4588595) loss:2.724 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:06:07,646][model8_pretrain.py][INFO] Epoch:[0/2](269800/4588595) loss:3.114 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:06:07,646][model8_pretrain.py][INFO] Epoch:[0/2](269800/4588595) loss:2.995 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:06:07,646][model8_pretrain.py][INFO] Epoch:[0/2](269800/4588595) loss:2.199 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:06:07,646][model8_pretrain.py][INFO] Epoch:[0/2](269800/4588595) loss:3.009 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:06:07,646][model8_pretrain.py][INFO] Epoch:[0/2](269800/4588595) loss:2.804 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:06:07,646][model8_pretrain.py][INFO] Epoch:[0/2](269800/4588595) loss:3.642 lr:0.0000100 epoch_Time:27314.0min: [2024-01-03 22:06:44,580][model8_pretrain.py][INFO] Epoch:[0/2](269900/4588595) loss:2.727 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:06:44,580][model8_pretrain.py][INFO] Epoch:[0/2](269900/4588595) loss:2.901 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:06:44,580][model8_pretrain.py][INFO] Epoch:[0/2](269900/4588595) loss:3.065 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:06:44,581][model8_pretrain.py][INFO] Epoch:[0/2](269900/4588595) loss:3.112 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:06:44,581][model8_pretrain.py][INFO] Epoch:[0/2](269900/4588595) loss:2.759 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:06:44,581][model8_pretrain.py][INFO] Epoch:[0/2](269900/4588595) loss:3.121 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:06:44,581][model8_pretrain.py][INFO] Epoch:[0/2](269900/4588595) loss:3.005 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:06:44,581][model8_pretrain.py][INFO] Epoch:[0/2](269900/4588595) loss:3.177 lr:0.0000100 epoch_Time:27313.0min: [2024-01-03 22:07:21,514][model8_pretrain.py][INFO] Epoch:[0/2](270000/4588595) loss:2.675 lr:0.0000100 epoch_Time:27312.0min: [2024-01-03 22:07:21,514][model8_pretrain.py][INFO] Epoch:[0/2](270000/4588595) loss:3.255 lr:0.0000100 epoch_Time:27312.0min: [2024-01-03 22:07:21,514][model8_pretrain.py][INFO] Epoch:[0/2](270000/4588595) loss:2.889 lr:0.0000100 epoch_Time:27312.0min: [2024-01-03 22:07:21,514][model8_pretrain.py][INFO] Epoch:[0/2](270000/4588595) loss:2.794 lr:0.0000100 epoch_Time:27312.0min: [2024-01-03 22:07:21,514][model8_pretrain.py][INFO] Epoch:[0/2](270000/4588595) loss:2.969 lr:0.0000100 epoch_Time:27312.0min: [2024-01-03 22:07:21,514][model8_pretrain.py][INFO] Epoch:[0/2](270000/4588595) loss:2.601 lr:0.0000100 epoch_Time:27312.0min: [2024-01-03 22:07:21,514][model8_pretrain.py][INFO] Epoch:[0/2](270000/4588595) loss:2.825 lr:0.0000100 epoch_Time:27312.0min: [2024-01-03 22:07:21,514][model8_pretrain.py][INFO] Epoch:[0/2](270000/4588595) loss:2.579 lr:0.0000100 epoch_Time:27312.0min: [2024-01-03 22:07:58,487][model8_pretrain.py][INFO] Epoch:[0/2](270100/4588595) loss:2.558 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:07:58,487][model8_pretrain.py][INFO] Epoch:[0/2](270100/4588595) loss:3.173 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:07:58,487][model8_pretrain.py][INFO] Epoch:[0/2](270100/4588595) loss:3.091 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:07:58,487][model8_pretrain.py][INFO] Epoch:[0/2](270100/4588595) loss:2.792 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:07:58,487][model8_pretrain.py][INFO] Epoch:[0/2](270100/4588595) loss:2.173 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:07:58,487][model8_pretrain.py][INFO] Epoch:[0/2](270100/4588595) loss:2.699 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:07:58,488][model8_pretrain.py][INFO] Epoch:[0/2](270100/4588595) loss:3.167 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:07:58,488][model8_pretrain.py][INFO] Epoch:[0/2](270100/4588595) loss:3.228 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:08:35,425][model8_pretrain.py][INFO] Epoch:[0/2](270200/4588595) loss:3.000 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:08:35,425][model8_pretrain.py][INFO] Epoch:[0/2](270200/4588595) loss:3.236 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:08:35,425][model8_pretrain.py][INFO] Epoch:[0/2](270200/4588595) loss:2.790 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:08:35,425][model8_pretrain.py][INFO] Epoch:[0/2](270200/4588595) loss:2.955 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:08:35,425][model8_pretrain.py][INFO] Epoch:[0/2](270200/4588595) loss:3.106 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:08:35,425][model8_pretrain.py][INFO] Epoch:[0/2](270200/4588595) loss:3.049 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:08:35,425][model8_pretrain.py][INFO] Epoch:[0/2](270200/4588595) loss:3.114 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:08:35,425][model8_pretrain.py][INFO] Epoch:[0/2](270200/4588595) loss:2.857 lr:0.0000100 epoch_Time:27311.0min: [2024-01-03 22:09:12,352][model8_pretrain.py][INFO] Epoch:[0/2](270300/4588595) loss:2.805 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:09:12,352][model8_pretrain.py][INFO] Epoch:[0/2](270300/4588595) loss:3.006 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:09:12,352][model8_pretrain.py][INFO] Epoch:[0/2](270300/4588595) loss:3.176 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:09:12,352][model8_pretrain.py][INFO] Epoch:[0/2](270300/4588595) loss:3.315 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:09:12,352][model8_pretrain.py][INFO] Epoch:[0/2](270300/4588595) loss:2.228 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:09:12,352][model8_pretrain.py][INFO] Epoch:[0/2](270300/4588595) loss:3.319 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:09:12,352][model8_pretrain.py][INFO] Epoch:[0/2](270300/4588595) loss:2.670 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:09:12,353][model8_pretrain.py][INFO] Epoch:[0/2](270300/4588595) loss:3.057 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:09:49,290][model8_pretrain.py][INFO] Epoch:[0/2](270400/4588595) loss:3.079 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:09:49,290][model8_pretrain.py][INFO] Epoch:[0/2](270400/4588595) loss:2.441 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:09:49,290][model8_pretrain.py][INFO] Epoch:[0/2](270400/4588595) loss:3.032 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:09:49,290][model8_pretrain.py][INFO] Epoch:[0/2](270400/4588595) loss:2.874 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:09:49,290][model8_pretrain.py][INFO] Epoch:[0/2](270400/4588595) loss:3.357 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:09:49,290][model8_pretrain.py][INFO] Epoch:[0/2](270400/4588595) loss:2.194 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:09:49,290][model8_pretrain.py][INFO] Epoch:[0/2](270400/4588595) loss:2.080 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:09:49,290][model8_pretrain.py][INFO] Epoch:[0/2](270400/4588595) loss:3.232 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:10:31,392][model8_pretrain.py][INFO] Epoch:[0/2](270500/4588595) loss:2.721 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:10:31,392][model8_pretrain.py][INFO] Epoch:[0/2](270500/4588595) loss:3.025 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:10:31,393][model8_pretrain.py][INFO] Epoch:[0/2](270500/4588595) loss:3.135 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:10:31,392][model8_pretrain.py][INFO] Epoch:[0/2](270500/4588595) loss:3.047 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:10:31,397][model8_pretrain.py][INFO] Epoch:[0/2](270500/4588595) loss:3.208 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:10:31,397][model8_pretrain.py][INFO] Epoch:[0/2](270500/4588595) loss:2.854 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:10:31,397][model8_pretrain.py][INFO] Epoch:[0/2](270500/4588595) loss:2.376 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:10:31,397][model8_pretrain.py][INFO] Epoch:[0/2](270500/4588595) loss:2.886 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:11:13,338][model8_pretrain.py][INFO] Epoch:[0/2](270600/4588595) loss:3.259 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:11:13,338][model8_pretrain.py][INFO] Epoch:[0/2](270600/4588595) loss:3.117 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:11:13,338][model8_pretrain.py][INFO] Epoch:[0/2](270600/4588595) loss:3.012 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:11:13,338][model8_pretrain.py][INFO] Epoch:[0/2](270600/4588595) loss:3.069 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:11:13,338][model8_pretrain.py][INFO] Epoch:[0/2](270600/4588595) loss:3.077 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:11:13,338][model8_pretrain.py][INFO] Epoch:[0/2](270600/4588595) loss:2.962 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:11:13,338][model8_pretrain.py][INFO] Epoch:[0/2](270600/4588595) loss:2.932 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:11:13,338][model8_pretrain.py][INFO] Epoch:[0/2](270600/4588595) loss:2.972 lr:0.0000100 epoch_Time:27309.0min: [2024-01-03 22:11:50,286][model8_pretrain.py][INFO] Epoch:[0/2](270700/4588595) loss:2.701 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:11:50,286][model8_pretrain.py][INFO] Epoch:[0/2](270700/4588595) loss:2.960 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:11:50,286][model8_pretrain.py][INFO] Epoch:[0/2](270700/4588595) loss:2.742 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:11:50,286][model8_pretrain.py][INFO] Epoch:[0/2](270700/4588595) loss:3.269 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:11:50,286][model8_pretrain.py][INFO] Epoch:[0/2](270700/4588595) loss:3.035 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:11:50,286][model8_pretrain.py][INFO] Epoch:[0/2](270700/4588595) loss:2.758 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:11:50,287][model8_pretrain.py][INFO] Epoch:[0/2](270700/4588595) loss:2.584 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:11:50,287][model8_pretrain.py][INFO] Epoch:[0/2](270700/4588595) loss:2.779 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:12:27,240][model8_pretrain.py][INFO] Epoch:[0/2](270800/4588595) loss:3.175 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:12:27,240][model8_pretrain.py][INFO] Epoch:[0/2](270800/4588595) loss:3.121 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:12:27,240][model8_pretrain.py][INFO] Epoch:[0/2](270800/4588595) loss:2.894 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:12:27,240][model8_pretrain.py][INFO] Epoch:[0/2](270800/4588595) loss:2.808 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:12:27,240][model8_pretrain.py][INFO] Epoch:[0/2](270800/4588595) loss:3.136 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:12:27,240][model8_pretrain.py][INFO] Epoch:[0/2](270800/4588595) loss:2.529 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:12:27,240][model8_pretrain.py][INFO] Epoch:[0/2](270800/4588595) loss:3.130 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:12:27,240][model8_pretrain.py][INFO] Epoch:[0/2](270800/4588595) loss:2.853 lr:0.0000100 epoch_Time:27308.0min: [2024-01-03 22:13:04,199][model8_pretrain.py][INFO] Epoch:[0/2](270900/4588595) loss:2.474 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:04,200][model8_pretrain.py][INFO] Epoch:[0/2](270900/4588595) loss:3.201 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:04,200][model8_pretrain.py][INFO] Epoch:[0/2](270900/4588595) loss:2.465 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:04,199][model8_pretrain.py][INFO] Epoch:[0/2](270900/4588595) loss:3.319 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:04,199][model8_pretrain.py][INFO] Epoch:[0/2](270900/4588595) loss:3.163 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:04,200][model8_pretrain.py][INFO] Epoch:[0/2](270900/4588595) loss:2.895 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:04,200][model8_pretrain.py][INFO] Epoch:[0/2](270900/4588595) loss:2.537 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:04,200][model8_pretrain.py][INFO] Epoch:[0/2](270900/4588595) loss:3.083 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:41,162][model8_pretrain.py][INFO] Epoch:[0/2](271000/4588595) loss:3.079 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:41,162][model8_pretrain.py][INFO] Epoch:[0/2](271000/4588595) loss:2.805 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:41,162][model8_pretrain.py][INFO] Epoch:[0/2](271000/4588595) loss:2.516 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:41,162][model8_pretrain.py][INFO] Epoch:[0/2](271000/4588595) loss:2.799 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:41,162][model8_pretrain.py][INFO] Epoch:[0/2](271000/4588595) loss:2.693 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:41,162][model8_pretrain.py][INFO] Epoch:[0/2](271000/4588595) loss:2.529 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:41,162][model8_pretrain.py][INFO] Epoch:[0/2](271000/4588595) loss:2.304 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:13:41,162][model8_pretrain.py][INFO] Epoch:[0/2](271000/4588595) loss:2.688 lr:0.0000100 epoch_Time:27306.0min: [2024-01-03 22:14:18,126][model8_pretrain.py][INFO] Epoch:[0/2](271100/4588595) loss:3.374 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:14:18,126][model8_pretrain.py][INFO] Epoch:[0/2](271100/4588595) loss:2.895 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:14:18,126][model8_pretrain.py][INFO] Epoch:[0/2](271100/4588595) loss:3.210 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:14:18,126][model8_pretrain.py][INFO] Epoch:[0/2](271100/4588595) loss:3.290 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:14:18,126][model8_pretrain.py][INFO] Epoch:[0/2](271100/4588595) loss:2.662 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:14:18,126][model8_pretrain.py][INFO] Epoch:[0/2](271100/4588595) loss:2.654 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:14:18,126][model8_pretrain.py][INFO] Epoch:[0/2](271100/4588595) loss:2.766 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:14:18,126][model8_pretrain.py][INFO] Epoch:[0/2](271100/4588595) loss:2.893 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:14:55,079][model8_pretrain.py][INFO] Epoch:[0/2](271200/4588595) loss:2.889 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:14:55,079][model8_pretrain.py][INFO] Epoch:[0/2](271200/4588595) loss:3.114 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:14:55,079][model8_pretrain.py][INFO] Epoch:[0/2](271200/4588595) loss:3.148 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:14:55,079][model8_pretrain.py][INFO] Epoch:[0/2](271200/4588595) loss:2.889 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:14:55,079][model8_pretrain.py][INFO] Epoch:[0/2](271200/4588595) loss:3.140 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:14:55,079][model8_pretrain.py][INFO] Epoch:[0/2](271200/4588595) loss:3.093 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:14:55,079][model8_pretrain.py][INFO] Epoch:[0/2](271200/4588595) loss:2.880 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:14:55,079][model8_pretrain.py][INFO] Epoch:[0/2](271200/4588595) loss:2.304 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:15:33,780][model8_pretrain.py][INFO] Epoch:[0/2](271300/4588595) loss:3.069 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:15:33,780][model8_pretrain.py][INFO] Epoch:[0/2](271300/4588595) loss:2.699 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:15:33,780][model8_pretrain.py][INFO] Epoch:[0/2](271300/4588595) loss:3.220 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:15:33,780][model8_pretrain.py][INFO] Epoch:[0/2](271300/4588595) loss:2.823 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:15:33,780][model8_pretrain.py][INFO] Epoch:[0/2](271300/4588595) loss:3.117 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:15:33,780][model8_pretrain.py][INFO] Epoch:[0/2](271300/4588595) loss:2.152 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:15:33,780][model8_pretrain.py][INFO] Epoch:[0/2](271300/4588595) loss:2.137 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:15:33,780][model8_pretrain.py][INFO] Epoch:[0/2](271300/4588595) loss:2.544 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:16:19,180][model8_pretrain.py][INFO] Epoch:[0/2](271400/4588595) loss:2.832 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:16:19,180][model8_pretrain.py][INFO] Epoch:[0/2](271400/4588595) loss:2.793 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:16:19,180][model8_pretrain.py][INFO] Epoch:[0/2](271400/4588595) loss:2.784 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:16:19,180][model8_pretrain.py][INFO] Epoch:[0/2](271400/4588595) loss:2.610 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:16:19,180][model8_pretrain.py][INFO] Epoch:[0/2](271400/4588595) loss:3.204 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:16:19,180][model8_pretrain.py][INFO] Epoch:[0/2](271400/4588595) loss:2.631 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:16:19,180][model8_pretrain.py][INFO] Epoch:[0/2](271400/4588595) loss:2.893 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:16:19,180][model8_pretrain.py][INFO] Epoch:[0/2](271400/4588595) loss:2.841 lr:0.0000100 epoch_Time:27305.0min: [2024-01-03 22:16:56,130][model8_pretrain.py][INFO] Epoch:[0/2](271500/4588595) loss:3.256 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:16:56,130][model8_pretrain.py][INFO] Epoch:[0/2](271500/4588595) loss:3.010 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:16:56,130][model8_pretrain.py][INFO] Epoch:[0/2](271500/4588595) loss:2.786 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:16:56,130][model8_pretrain.py][INFO] Epoch:[0/2](271500/4588595) loss:2.925 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:16:56,130][model8_pretrain.py][INFO] Epoch:[0/2](271500/4588595) loss:3.021 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:16:56,130][model8_pretrain.py][INFO] Epoch:[0/2](271500/4588595) loss:2.765 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:16:56,130][model8_pretrain.py][INFO] Epoch:[0/2](271500/4588595) loss:2.939 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:16:56,131][model8_pretrain.py][INFO] Epoch:[0/2](271500/4588595) loss:3.034 lr:0.0000100 epoch_Time:27304.0min: [2024-01-03 22:17:33,073][model8_pretrain.py][INFO] Epoch:[0/2](271600/4588595) loss:2.791 lr:0.0000100 epoch_Time:27303.0min: [2024-01-03 22:17:33,073][model8_pretrain.py][INFO] Epoch:[0/2](271600/4588595) loss:3.178 lr:0.0000100 epoch_Time:27303.0min: [2024-01-03 22:17:33,073][model8_pretrain.py][INFO] Epoch:[0/2](271600/4588595) loss:3.079 lr:0.0000100 epoch_Time:27303.0min: [2024-01-03 22:17:33,073][model8_pretrain.py][INFO] Epoch:[0/2](271600/4588595) loss:3.022 lr:0.0000100 epoch_Time:27303.0min: [2024-01-03 22:17:33,073][model8_pretrain.py][INFO] Epoch:[0/2](271600/4588595) loss:2.677 lr:0.0000100 epoch_Time:27303.0min: [2024-01-03 22:17:33,073][model8_pretrain.py][INFO] Epoch:[0/2](271600/4588595) loss:2.599 lr:0.0000100 epoch_Time:27303.0min: [2024-01-03 22:17:33,073][model8_pretrain.py][INFO] Epoch:[0/2](271600/4588595) loss:2.546 lr:0.0000100 epoch_Time:27303.0min: [2024-01-03 22:17:33,073][model8_pretrain.py][INFO] Epoch:[0/2](271600/4588595) loss:2.846 lr:0.0000100 epoch_Time:27303.0min: [2024-01-03 22:18:10,013][model8_pretrain.py][INFO] Epoch:[0/2](271700/4588595) loss:2.206 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:10,013][model8_pretrain.py][INFO] Epoch:[0/2](271700/4588595) loss:2.908 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:10,013][model8_pretrain.py][INFO] Epoch:[0/2](271700/4588595) loss:3.079 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:10,013][model8_pretrain.py][INFO] Epoch:[0/2](271700/4588595) loss:2.469 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:10,013][model8_pretrain.py][INFO] Epoch:[0/2](271700/4588595) loss:2.874 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:10,013][model8_pretrain.py][INFO] Epoch:[0/2](271700/4588595) loss:2.734 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:10,013][model8_pretrain.py][INFO] Epoch:[0/2](271700/4588595) loss:3.075 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:10,013][model8_pretrain.py][INFO] Epoch:[0/2](271700/4588595) loss:3.104 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:46,957][model8_pretrain.py][INFO] Epoch:[0/2](271800/4588595) loss:2.938 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:46,957][model8_pretrain.py][INFO] Epoch:[0/2](271800/4588595) loss:3.137 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:46,957][model8_pretrain.py][INFO] Epoch:[0/2](271800/4588595) loss:3.135 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:46,957][model8_pretrain.py][INFO] Epoch:[0/2](271800/4588595) loss:2.877 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:46,957][model8_pretrain.py][INFO] Epoch:[0/2](271800/4588595) loss:3.146 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:46,957][model8_pretrain.py][INFO] Epoch:[0/2](271800/4588595) loss:2.879 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:46,957][model8_pretrain.py][INFO] Epoch:[0/2](271800/4588595) loss:2.755 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:18:46,957][model8_pretrain.py][INFO] Epoch:[0/2](271800/4588595) loss:2.926 lr:0.0000100 epoch_Time:27302.0min: [2024-01-03 22:19:23,906][model8_pretrain.py][INFO] Epoch:[0/2](271900/4588595) loss:3.027 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:19:23,906][model8_pretrain.py][INFO] Epoch:[0/2](271900/4588595) loss:3.510 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:19:23,906][model8_pretrain.py][INFO] Epoch:[0/2](271900/4588595) loss:2.964 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:19:23,906][model8_pretrain.py][INFO] Epoch:[0/2](271900/4588595) loss:3.196 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:19:23,906][model8_pretrain.py][INFO] Epoch:[0/2](271900/4588595) loss:2.746 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:19:23,906][model8_pretrain.py][INFO] Epoch:[0/2](271900/4588595) loss:2.729 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:19:23,906][model8_pretrain.py][INFO] Epoch:[0/2](271900/4588595) loss:2.984 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:19:23,906][model8_pretrain.py][INFO] Epoch:[0/2](271900/4588595) loss:2.917 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:20:00,834][model8_pretrain.py][INFO] Epoch:[0/2](272000/4588595) loss:2.413 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:00,834][model8_pretrain.py][INFO] Epoch:[0/2](272000/4588595) loss:2.902 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:00,834][model8_pretrain.py][INFO] Epoch:[0/2](272000/4588595) loss:2.566 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:00,834][model8_pretrain.py][INFO] Epoch:[0/2](272000/4588595) loss:2.927 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:00,834][model8_pretrain.py][INFO] Epoch:[0/2](272000/4588595) loss:3.417 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:00,834][model8_pretrain.py][INFO] Epoch:[0/2](272000/4588595) loss:2.825 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:00,834][model8_pretrain.py][INFO] Epoch:[0/2](272000/4588595) loss:3.299 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:00,834][model8_pretrain.py][INFO] Epoch:[0/2](272000/4588595) loss:3.232 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:39,511][model8_pretrain.py][INFO] Epoch:[0/2](272100/4588595) loss:2.800 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:39,511][model8_pretrain.py][INFO] Epoch:[0/2](272100/4588595) loss:3.166 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:39,511][model8_pretrain.py][INFO] Epoch:[0/2](272100/4588595) loss:3.131 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:39,511][model8_pretrain.py][INFO] Epoch:[0/2](272100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:39,511][model8_pretrain.py][INFO] Epoch:[0/2](272100/4588595) loss:2.828 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:39,511][model8_pretrain.py][INFO] Epoch:[0/2](272100/4588595) loss:3.334 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:39,511][model8_pretrain.py][INFO] Epoch:[0/2](272100/4588595) loss:3.266 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:20:39,511][model8_pretrain.py][INFO] Epoch:[0/2](272100/4588595) loss:2.853 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:21:24,988][model8_pretrain.py][INFO] Epoch:[0/2](272200/4588595) loss:2.633 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:21:24,988][model8_pretrain.py][INFO] Epoch:[0/2](272200/4588595) loss:2.908 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:21:24,988][model8_pretrain.py][INFO] Epoch:[0/2](272200/4588595) loss:2.756 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:21:24,988][model8_pretrain.py][INFO] Epoch:[0/2](272200/4588595) loss:2.949 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:21:24,988][model8_pretrain.py][INFO] Epoch:[0/2](272200/4588595) loss:2.961 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:21:24,988][model8_pretrain.py][INFO] Epoch:[0/2](272200/4588595) loss:3.378 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:21:24,988][model8_pretrain.py][INFO] Epoch:[0/2](272200/4588595) loss:2.905 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:21:24,988][model8_pretrain.py][INFO] Epoch:[0/2](272200/4588595) loss:2.977 lr:0.0000100 epoch_Time:27301.0min: [2024-01-03 22:22:01,915][model8_pretrain.py][INFO] Epoch:[0/2](272300/4588595) loss:2.496 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:01,915][model8_pretrain.py][INFO] Epoch:[0/2](272300/4588595) loss:2.729 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:01,915][model8_pretrain.py][INFO] Epoch:[0/2](272300/4588595) loss:3.003 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:01,915][model8_pretrain.py][INFO] Epoch:[0/2](272300/4588595) loss:2.973 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:01,915][model8_pretrain.py][INFO] Epoch:[0/2](272300/4588595) loss:2.971 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:01,915][model8_pretrain.py][INFO] Epoch:[0/2](272300/4588595) loss:2.839 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:01,915][model8_pretrain.py][INFO] Epoch:[0/2](272300/4588595) loss:3.193 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:01,915][model8_pretrain.py][INFO] Epoch:[0/2](272300/4588595) loss:3.019 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:38,852][model8_pretrain.py][INFO] Epoch:[0/2](272400/4588595) loss:2.976 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:38,852][model8_pretrain.py][INFO] Epoch:[0/2](272400/4588595) loss:2.506 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:38,852][model8_pretrain.py][INFO] Epoch:[0/2](272400/4588595) loss:2.871 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:38,852][model8_pretrain.py][INFO] Epoch:[0/2](272400/4588595) loss:2.808 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:38,852][model8_pretrain.py][INFO] Epoch:[0/2](272400/4588595) loss:2.927 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:38,852][model8_pretrain.py][INFO] Epoch:[0/2](272400/4588595) loss:3.099 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:38,852][model8_pretrain.py][INFO] Epoch:[0/2](272400/4588595) loss:2.797 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:22:38,852][model8_pretrain.py][INFO] Epoch:[0/2](272400/4588595) loss:2.939 lr:0.0000100 epoch_Time:27299.0min: [2024-01-03 22:23:15,790][model8_pretrain.py][INFO] Epoch:[0/2](272500/4588595) loss:2.846 lr:0.0000100 epoch_Time:27298.0min: [2024-01-03 22:23:15,791][model8_pretrain.py][INFO] Epoch:[0/2](272500/4588595) loss:2.645 lr:0.0000100 epoch_Time:27298.0min: [2024-01-03 22:23:15,791][model8_pretrain.py][INFO] Epoch:[0/2](272500/4588595) loss:2.355 lr:0.0000100 epoch_Time:27298.0min: [2024-01-03 22:23:15,791][model8_pretrain.py][INFO] Epoch:[0/2](272500/4588595) loss:3.002 lr:0.0000100 epoch_Time:27298.0min: [2024-01-03 22:23:15,791][model8_pretrain.py][INFO] Epoch:[0/2](272500/4588595) loss:2.848 lr:0.0000100 epoch_Time:27298.0min: [2024-01-03 22:23:15,791][model8_pretrain.py][INFO] Epoch:[0/2](272500/4588595) loss:2.984 lr:0.0000100 epoch_Time:27298.0min: [2024-01-03 22:23:15,791][model8_pretrain.py][INFO] Epoch:[0/2](272500/4588595) loss:3.085 lr:0.0000100 epoch_Time:27298.0min: [2024-01-03 22:23:15,791][model8_pretrain.py][INFO] Epoch:[0/2](272500/4588595) loss:3.058 lr:0.0000100 epoch_Time:27298.0min: [2024-01-03 22:23:52,733][model8_pretrain.py][INFO] Epoch:[0/2](272600/4588595) loss:3.116 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:23:52,733][model8_pretrain.py][INFO] Epoch:[0/2](272600/4588595) loss:2.658 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:23:52,733][model8_pretrain.py][INFO] Epoch:[0/2](272600/4588595) loss:3.341 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:23:52,733][model8_pretrain.py][INFO] Epoch:[0/2](272600/4588595) loss:2.725 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:23:52,733][model8_pretrain.py][INFO] Epoch:[0/2](272600/4588595) loss:2.845 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:23:52,733][model8_pretrain.py][INFO] Epoch:[0/2](272600/4588595) loss:2.990 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:23:52,733][model8_pretrain.py][INFO] Epoch:[0/2](272600/4588595) loss:3.035 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:23:52,733][model8_pretrain.py][INFO] Epoch:[0/2](272600/4588595) loss:2.386 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:24:29,668][model8_pretrain.py][INFO] Epoch:[0/2](272700/4588595) loss:2.994 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:24:29,668][model8_pretrain.py][INFO] Epoch:[0/2](272700/4588595) loss:2.736 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:24:29,668][model8_pretrain.py][INFO] Epoch:[0/2](272700/4588595) loss:2.727 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:24:29,668][model8_pretrain.py][INFO] Epoch:[0/2](272700/4588595) loss:2.657 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:24:29,668][model8_pretrain.py][INFO] Epoch:[0/2](272700/4588595) loss:3.197 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:24:29,668][model8_pretrain.py][INFO] Epoch:[0/2](272700/4588595) loss:3.480 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:24:29,668][model8_pretrain.py][INFO] Epoch:[0/2](272700/4588595) loss:3.050 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:24:29,668][model8_pretrain.py][INFO] Epoch:[0/2](272700/4588595) loss:2.631 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:25:06,604][model8_pretrain.py][INFO] Epoch:[0/2](272800/4588595) loss:2.789 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:06,604][model8_pretrain.py][INFO] Epoch:[0/2](272800/4588595) loss:3.056 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:06,604][model8_pretrain.py][INFO] Epoch:[0/2](272800/4588595) loss:2.593 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:06,604][model8_pretrain.py][INFO] Epoch:[0/2](272800/4588595) loss:2.884 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:06,604][model8_pretrain.py][INFO] Epoch:[0/2](272800/4588595) loss:2.661 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:06,604][model8_pretrain.py][INFO] Epoch:[0/2](272800/4588595) loss:3.415 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:06,604][model8_pretrain.py][INFO] Epoch:[0/2](272800/4588595) loss:2.445 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:06,604][model8_pretrain.py][INFO] Epoch:[0/2](272800/4588595) loss:2.798 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:45,268][model8_pretrain.py][INFO] Epoch:[0/2](272900/4588595) loss:2.699 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:45,268][model8_pretrain.py][INFO] Epoch:[0/2](272900/4588595) loss:3.024 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:45,268][model8_pretrain.py][INFO] Epoch:[0/2](272900/4588595) loss:2.730 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:45,268][model8_pretrain.py][INFO] Epoch:[0/2](272900/4588595) loss:2.799 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:45,268][model8_pretrain.py][INFO] Epoch:[0/2](272900/4588595) loss:3.398 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:45,273][model8_pretrain.py][INFO] Epoch:[0/2](272900/4588595) loss:2.868 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:45,273][model8_pretrain.py][INFO] Epoch:[0/2](272900/4588595) loss:3.154 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:25:45,273][model8_pretrain.py][INFO] Epoch:[0/2](272900/4588595) loss:3.515 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:26:30,850][model8_pretrain.py][INFO] Epoch:[0/2](273000/4588595) loss:3.330 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:26:30,850][model8_pretrain.py][INFO] Epoch:[0/2](273000/4588595) loss:2.901 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:26:30,850][model8_pretrain.py][INFO] Epoch:[0/2](273000/4588595) loss:3.232 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:26:30,850][model8_pretrain.py][INFO] Epoch:[0/2](273000/4588595) loss:3.198 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:26:30,850][model8_pretrain.py][INFO] Epoch:[0/2](273000/4588595) loss:2.945 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:26:30,850][model8_pretrain.py][INFO] Epoch:[0/2](273000/4588595) loss:2.899 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:26:30,850][model8_pretrain.py][INFO] Epoch:[0/2](273000/4588595) loss:2.981 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:26:30,850][model8_pretrain.py][INFO] Epoch:[0/2](273000/4588595) loss:2.658 lr:0.0000100 epoch_Time:27296.0min: [2024-01-03 22:27:07,789][model8_pretrain.py][INFO] Epoch:[0/2](273100/4588595) loss:3.246 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:07,789][model8_pretrain.py][INFO] Epoch:[0/2](273100/4588595) loss:3.339 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:07,789][model8_pretrain.py][INFO] Epoch:[0/2](273100/4588595) loss:2.517 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:07,789][model8_pretrain.py][INFO] Epoch:[0/2](273100/4588595) loss:2.714 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:07,789][model8_pretrain.py][INFO] Epoch:[0/2](273100/4588595) loss:2.673 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:07,789][model8_pretrain.py][INFO] Epoch:[0/2](273100/4588595) loss:2.839 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:07,789][model8_pretrain.py][INFO] Epoch:[0/2](273100/4588595) loss:3.294 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:07,789][model8_pretrain.py][INFO] Epoch:[0/2](273100/4588595) loss:3.151 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:44,695][model8_pretrain.py][INFO] Epoch:[0/2](273200/4588595) loss:2.297 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:44,695][model8_pretrain.py][INFO] Epoch:[0/2](273200/4588595) loss:3.043 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:44,695][model8_pretrain.py][INFO] Epoch:[0/2](273200/4588595) loss:2.787 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:44,695][model8_pretrain.py][INFO] Epoch:[0/2](273200/4588595) loss:2.286 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:44,695][model8_pretrain.py][INFO] Epoch:[0/2](273200/4588595) loss:2.970 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:44,695][model8_pretrain.py][INFO] Epoch:[0/2](273200/4588595) loss:2.752 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:44,695][model8_pretrain.py][INFO] Epoch:[0/2](273200/4588595) loss:3.180 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:27:44,695][model8_pretrain.py][INFO] Epoch:[0/2](273200/4588595) loss:2.658 lr:0.0000100 epoch_Time:27295.0min: [2024-01-03 22:28:21,635][model8_pretrain.py][INFO] Epoch:[0/2](273300/4588595) loss:2.636 lr:0.0000100 epoch_Time:27293.0min: [2024-01-03 22:28:21,635][model8_pretrain.py][INFO] Epoch:[0/2](273300/4588595) loss:2.883 lr:0.0000100 epoch_Time:27293.0min: [2024-01-03 22:28:21,635][model8_pretrain.py][INFO] Epoch:[0/2](273300/4588595) loss:2.737 lr:0.0000100 epoch_Time:27293.0min: [2024-01-03 22:28:21,635][model8_pretrain.py][INFO] Epoch:[0/2](273300/4588595) loss:2.828 lr:0.0000100 epoch_Time:27293.0min: [2024-01-03 22:28:21,635][model8_pretrain.py][INFO] Epoch:[0/2](273300/4588595) loss:3.155 lr:0.0000100 epoch_Time:27293.0min: [2024-01-03 22:28:21,635][model8_pretrain.py][INFO] Epoch:[0/2](273300/4588595) loss:3.050 lr:0.0000100 epoch_Time:27293.0min: [2024-01-03 22:28:21,636][model8_pretrain.py][INFO] Epoch:[0/2](273300/4588595) loss:2.914 lr:0.0000100 epoch_Time:27293.0min: [2024-01-03 22:28:21,636][model8_pretrain.py][INFO] Epoch:[0/2](273300/4588595) loss:3.132 lr:0.0000100 epoch_Time:27293.0min: [2024-01-03 22:28:58,579][model8_pretrain.py][INFO] Epoch:[0/2](273400/4588595) loss:2.908 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:28:58,579][model8_pretrain.py][INFO] Epoch:[0/2](273400/4588595) loss:3.053 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:28:58,579][model8_pretrain.py][INFO] Epoch:[0/2](273400/4588595) loss:1.956 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:28:58,579][model8_pretrain.py][INFO] Epoch:[0/2](273400/4588595) loss:3.021 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:28:58,579][model8_pretrain.py][INFO] Epoch:[0/2](273400/4588595) loss:3.088 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:28:58,579][model8_pretrain.py][INFO] Epoch:[0/2](273400/4588595) loss:2.925 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:28:58,579][model8_pretrain.py][INFO] Epoch:[0/2](273400/4588595) loss:2.662 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:28:58,579][model8_pretrain.py][INFO] Epoch:[0/2](273400/4588595) loss:2.760 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:29:35,522][model8_pretrain.py][INFO] Epoch:[0/2](273500/4588595) loss:2.605 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:29:35,522][model8_pretrain.py][INFO] Epoch:[0/2](273500/4588595) loss:2.750 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:29:35,522][model8_pretrain.py][INFO] Epoch:[0/2](273500/4588595) loss:2.752 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:29:35,522][model8_pretrain.py][INFO] Epoch:[0/2](273500/4588595) loss:2.952 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:29:35,522][model8_pretrain.py][INFO] Epoch:[0/2](273500/4588595) loss:3.140 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:29:35,522][model8_pretrain.py][INFO] Epoch:[0/2](273500/4588595) loss:2.403 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:29:35,522][model8_pretrain.py][INFO] Epoch:[0/2](273500/4588595) loss:2.851 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:29:35,522][model8_pretrain.py][INFO] Epoch:[0/2](273500/4588595) loss:3.254 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:30:12,468][model8_pretrain.py][INFO] Epoch:[0/2](273600/4588595) loss:3.262 lr:0.0000100 epoch_Time:27290.0min: [2024-01-03 22:30:12,468][model8_pretrain.py][INFO] Epoch:[0/2](273600/4588595) loss:2.774 lr:0.0000100 epoch_Time:27290.0min: [2024-01-03 22:30:12,468][model8_pretrain.py][INFO] Epoch:[0/2](273600/4588595) loss:3.055 lr:0.0000100 epoch_Time:27290.0min: [2024-01-03 22:30:12,468][model8_pretrain.py][INFO] Epoch:[0/2](273600/4588595) loss:3.077 lr:0.0000100 epoch_Time:27290.0min: [2024-01-03 22:30:12,468][model8_pretrain.py][INFO] Epoch:[0/2](273600/4588595) loss:3.052 lr:0.0000100 epoch_Time:27290.0min: [2024-01-03 22:30:12,468][model8_pretrain.py][INFO] Epoch:[0/2](273600/4588595) loss:2.270 lr:0.0000100 epoch_Time:27290.0min: [2024-01-03 22:30:12,468][model8_pretrain.py][INFO] Epoch:[0/2](273600/4588595) loss:3.020 lr:0.0000100 epoch_Time:27290.0min: [2024-01-03 22:30:12,468][model8_pretrain.py][INFO] Epoch:[0/2](273600/4588595) loss:2.878 lr:0.0000100 epoch_Time:27290.0min: [2024-01-03 22:30:49,410][model8_pretrain.py][INFO] Epoch:[0/2](273700/4588595) loss:3.288 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:30:49,410][model8_pretrain.py][INFO] Epoch:[0/2](273700/4588595) loss:2.729 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:30:49,410][model8_pretrain.py][INFO] Epoch:[0/2](273700/4588595) loss:3.021 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:30:49,410][model8_pretrain.py][INFO] Epoch:[0/2](273700/4588595) loss:3.116 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:30:49,410][model8_pretrain.py][INFO] Epoch:[0/2](273700/4588595) loss:2.533 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:30:49,410][model8_pretrain.py][INFO] Epoch:[0/2](273700/4588595) loss:3.188 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:30:49,411][model8_pretrain.py][INFO] Epoch:[0/2](273700/4588595) loss:2.865 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:30:49,411][model8_pretrain.py][INFO] Epoch:[0/2](273700/4588595) loss:2.836 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:31:36,878][model8_pretrain.py][INFO] Epoch:[0/2](273800/4588595) loss:3.009 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:31:36,878][model8_pretrain.py][INFO] Epoch:[0/2](273800/4588595) loss:3.295 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:31:36,878][model8_pretrain.py][INFO] Epoch:[0/2](273800/4588595) loss:3.025 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:31:36,878][model8_pretrain.py][INFO] Epoch:[0/2](273800/4588595) loss:2.806 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:31:36,879][model8_pretrain.py][INFO] Epoch:[0/2](273800/4588595) loss:2.828 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:31:36,879][model8_pretrain.py][INFO] Epoch:[0/2](273800/4588595) loss:2.679 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:31:36,879][model8_pretrain.py][INFO] Epoch:[0/2](273800/4588595) loss:2.650 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:31:36,879][model8_pretrain.py][INFO] Epoch:[0/2](273800/4588595) loss:2.902 lr:0.0000100 epoch_Time:27292.0min: [2024-01-03 22:32:13,798][model8_pretrain.py][INFO] Epoch:[0/2](273900/4588595) loss:3.439 lr:0.0000100 epoch_Time:27291.0min: [2024-01-03 22:32:13,798][model8_pretrain.py][INFO] Epoch:[0/2](273900/4588595) loss:3.047 lr:0.0000100 epoch_Time:27291.0min: [2024-01-03 22:32:13,798][model8_pretrain.py][INFO] Epoch:[0/2](273900/4588595) loss:2.627 lr:0.0000100 epoch_Time:27291.0min: [2024-01-03 22:32:13,798][model8_pretrain.py][INFO] Epoch:[0/2](273900/4588595) loss:2.960 lr:0.0000100 epoch_Time:27291.0min: [2024-01-03 22:32:13,798][model8_pretrain.py][INFO] Epoch:[0/2](273900/4588595) loss:3.076 lr:0.0000100 epoch_Time:27291.0min: [2024-01-03 22:32:13,798][model8_pretrain.py][INFO] Epoch:[0/2](273900/4588595) loss:3.144 lr:0.0000100 epoch_Time:27291.0min: [2024-01-03 22:32:13,800][model8_pretrain.py][INFO] Epoch:[0/2](273900/4588595) loss:2.184 lr:0.0000100 epoch_Time:27291.0min: [2024-01-03 22:32:13,800][model8_pretrain.py][INFO] Epoch:[0/2](273900/4588595) loss:2.967 lr:0.0000100 epoch_Time:27291.0min: [2024-01-03 22:32:50,730][model8_pretrain.py][INFO] Epoch:[0/2](274000/4588595) loss:3.008 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:32:50,730][model8_pretrain.py][INFO] Epoch:[0/2](274000/4588595) loss:3.206 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:32:50,730][model8_pretrain.py][INFO] Epoch:[0/2](274000/4588595) loss:2.900 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:32:50,730][model8_pretrain.py][INFO] Epoch:[0/2](274000/4588595) loss:3.201 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:32:50,730][model8_pretrain.py][INFO] Epoch:[0/2](274000/4588595) loss:2.999 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:32:50,730][model8_pretrain.py][INFO] Epoch:[0/2](274000/4588595) loss:3.003 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:32:50,731][model8_pretrain.py][INFO] Epoch:[0/2](274000/4588595) loss:3.190 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:32:50,731][model8_pretrain.py][INFO] Epoch:[0/2](274000/4588595) loss:3.367 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:33:27,665][model8_pretrain.py][INFO] Epoch:[0/2](274100/4588595) loss:3.149 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:33:27,665][model8_pretrain.py][INFO] Epoch:[0/2](274100/4588595) loss:3.254 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:33:27,665][model8_pretrain.py][INFO] Epoch:[0/2](274100/4588595) loss:3.151 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:33:27,665][model8_pretrain.py][INFO] Epoch:[0/2](274100/4588595) loss:2.863 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:33:27,665][model8_pretrain.py][INFO] Epoch:[0/2](274100/4588595) loss:2.598 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:33:27,665][model8_pretrain.py][INFO] Epoch:[0/2](274100/4588595) loss:3.075 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:33:27,665][model8_pretrain.py][INFO] Epoch:[0/2](274100/4588595) loss:3.255 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:33:27,665][model8_pretrain.py][INFO] Epoch:[0/2](274100/4588595) loss:3.010 lr:0.0000100 epoch_Time:27289.0min: [2024-01-03 22:34:04,593][model8_pretrain.py][INFO] Epoch:[0/2](274200/4588595) loss:3.438 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:34:04,593][model8_pretrain.py][INFO] Epoch:[0/2](274200/4588595) loss:1.893 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:34:04,593][model8_pretrain.py][INFO] Epoch:[0/2](274200/4588595) loss:2.612 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:34:04,593][model8_pretrain.py][INFO] Epoch:[0/2](274200/4588595) loss:2.695 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:34:04,593][model8_pretrain.py][INFO] Epoch:[0/2](274200/4588595) loss:2.934 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:34:04,594][model8_pretrain.py][INFO] Epoch:[0/2](274200/4588595) loss:2.701 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:34:04,594][model8_pretrain.py][INFO] Epoch:[0/2](274200/4588595) loss:2.885 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:34:04,594][model8_pretrain.py][INFO] Epoch:[0/2](274200/4588595) loss:2.973 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:34:41,524][model8_pretrain.py][INFO] Epoch:[0/2](274300/4588595) loss:2.946 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:34:41,524][model8_pretrain.py][INFO] Epoch:[0/2](274300/4588595) loss:3.446 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:34:41,524][model8_pretrain.py][INFO] Epoch:[0/2](274300/4588595) loss:2.766 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:34:41,524][model8_pretrain.py][INFO] Epoch:[0/2](274300/4588595) loss:3.059 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:34:41,524][model8_pretrain.py][INFO] Epoch:[0/2](274300/4588595) loss:2.819 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:34:41,524][model8_pretrain.py][INFO] Epoch:[0/2](274300/4588595) loss:2.967 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:34:41,524][model8_pretrain.py][INFO] Epoch:[0/2](274300/4588595) loss:3.113 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:34:41,524][model8_pretrain.py][INFO] Epoch:[0/2](274300/4588595) loss:2.981 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:35:18,463][model8_pretrain.py][INFO] Epoch:[0/2](274400/4588595) loss:3.018 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:35:18,463][model8_pretrain.py][INFO] Epoch:[0/2](274400/4588595) loss:2.283 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:35:18,463][model8_pretrain.py][INFO] Epoch:[0/2](274400/4588595) loss:2.704 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:35:18,463][model8_pretrain.py][INFO] Epoch:[0/2](274400/4588595) loss:3.357 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:35:18,463][model8_pretrain.py][INFO] Epoch:[0/2](274400/4588595) loss:3.207 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:35:18,463][model8_pretrain.py][INFO] Epoch:[0/2](274400/4588595) loss:2.812 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:35:18,463][model8_pretrain.py][INFO] Epoch:[0/2](274400/4588595) loss:2.882 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:35:18,463][model8_pretrain.py][INFO] Epoch:[0/2](274400/4588595) loss:2.505 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:35:55,392][model8_pretrain.py][INFO] Epoch:[0/2](274500/4588595) loss:2.811 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:35:55,392][model8_pretrain.py][INFO] Epoch:[0/2](274500/4588595) loss:2.549 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:35:55,392][model8_pretrain.py][INFO] Epoch:[0/2](274500/4588595) loss:3.266 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:35:55,392][model8_pretrain.py][INFO] Epoch:[0/2](274500/4588595) loss:2.693 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:35:55,392][model8_pretrain.py][INFO] Epoch:[0/2](274500/4588595) loss:2.456 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:35:55,392][model8_pretrain.py][INFO] Epoch:[0/2](274500/4588595) loss:3.332 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:35:55,392][model8_pretrain.py][INFO] Epoch:[0/2](274500/4588595) loss:2.896 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:35:55,392][model8_pretrain.py][INFO] Epoch:[0/2](274500/4588595) loss:3.218 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:36:42,747][model8_pretrain.py][INFO] Epoch:[0/2](274600/4588595) loss:2.248 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:36:42,747][model8_pretrain.py][INFO] Epoch:[0/2](274600/4588595) loss:2.993 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:36:42,747][model8_pretrain.py][INFO] Epoch:[0/2](274600/4588595) loss:2.800 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:36:42,747][model8_pretrain.py][INFO] Epoch:[0/2](274600/4588595) loss:2.931 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:36:42,747][model8_pretrain.py][INFO] Epoch:[0/2](274600/4588595) loss:3.015 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:36:42,747][model8_pretrain.py][INFO] Epoch:[0/2](274600/4588595) loss:2.781 lr:0.0000100 epoch_Time:27288.0min: [2024-01-03 22:36:42,747][model8_pretrain.py][INFO] Epoch:[0/2](274600/4588595) loss:3.140 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:36:42,748][model8_pretrain.py][INFO] Epoch:[0/2](274600/4588595) loss:2.768 lr:0.0000100 epoch_Time:27287.0min: [2024-01-03 22:37:19,676][model8_pretrain.py][INFO] Epoch:[0/2](274700/4588595) loss:2.607 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:37:19,676][model8_pretrain.py][INFO] Epoch:[0/2](274700/4588595) loss:3.132 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:37:19,676][model8_pretrain.py][INFO] Epoch:[0/2](274700/4588595) loss:2.947 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:37:19,676][model8_pretrain.py][INFO] Epoch:[0/2](274700/4588595) loss:2.633 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:37:19,676][model8_pretrain.py][INFO] Epoch:[0/2](274700/4588595) loss:2.164 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:37:19,676][model8_pretrain.py][INFO] Epoch:[0/2](274700/4588595) loss:2.621 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:37:19,676][model8_pretrain.py][INFO] Epoch:[0/2](274700/4588595) loss:3.038 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:37:19,676][model8_pretrain.py][INFO] Epoch:[0/2](274700/4588595) loss:2.960 lr:0.0000100 epoch_Time:27286.0min: [2024-01-03 22:37:56,601][model8_pretrain.py][INFO] Epoch:[0/2](274800/4588595) loss:2.358 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:37:56,602][model8_pretrain.py][INFO] Epoch:[0/2](274800/4588595) loss:2.703 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:37:56,602][model8_pretrain.py][INFO] Epoch:[0/2](274800/4588595) loss:3.393 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:37:56,602][model8_pretrain.py][INFO] Epoch:[0/2](274800/4588595) loss:2.726 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:37:56,603][model8_pretrain.py][INFO] Epoch:[0/2](274800/4588595) loss:2.466 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:37:56,603][model8_pretrain.py][INFO] Epoch:[0/2](274800/4588595) loss:3.141 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:37:56,603][model8_pretrain.py][INFO] Epoch:[0/2](274800/4588595) loss:3.171 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:37:56,604][model8_pretrain.py][INFO] Epoch:[0/2](274800/4588595) loss:3.095 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:38:33,537][model8_pretrain.py][INFO] Epoch:[0/2](274900/4588595) loss:3.442 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:38:33,537][model8_pretrain.py][INFO] Epoch:[0/2](274900/4588595) loss:2.973 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:38:33,537][model8_pretrain.py][INFO] Epoch:[0/2](274900/4588595) loss:3.148 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:38:33,537][model8_pretrain.py][INFO] Epoch:[0/2](274900/4588595) loss:2.367 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:38:33,537][model8_pretrain.py][INFO] Epoch:[0/2](274900/4588595) loss:2.928 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:38:33,537][model8_pretrain.py][INFO] Epoch:[0/2](274900/4588595) loss:3.216 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:38:33,537][model8_pretrain.py][INFO] Epoch:[0/2](274900/4588595) loss:2.900 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:38:33,537][model8_pretrain.py][INFO] Epoch:[0/2](274900/4588595) loss:2.352 lr:0.0000100 epoch_Time:27285.0min: [2024-01-03 22:39:10,467][model8_pretrain.py][INFO] Epoch:[0/2](275000/4588595) loss:3.007 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:10,467][model8_pretrain.py][INFO] Epoch:[0/2](275000/4588595) loss:2.846 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:10,467][model8_pretrain.py][INFO] Epoch:[0/2](275000/4588595) loss:2.566 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:10,467][model8_pretrain.py][INFO] Epoch:[0/2](275000/4588595) loss:3.060 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:10,467][model8_pretrain.py][INFO] Epoch:[0/2](275000/4588595) loss:3.249 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:10,467][model8_pretrain.py][INFO] Epoch:[0/2](275000/4588595) loss:2.764 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:10,468][model8_pretrain.py][INFO] Epoch:[0/2](275000/4588595) loss:2.705 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:10,468][model8_pretrain.py][INFO] Epoch:[0/2](275000/4588595) loss:2.992 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:47,396][model8_pretrain.py][INFO] Epoch:[0/2](275100/4588595) loss:3.139 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:47,396][model8_pretrain.py][INFO] Epoch:[0/2](275100/4588595) loss:2.324 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:47,396][model8_pretrain.py][INFO] Epoch:[0/2](275100/4588595) loss:3.019 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:47,396][model8_pretrain.py][INFO] Epoch:[0/2](275100/4588595) loss:2.445 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:47,396][model8_pretrain.py][INFO] Epoch:[0/2](275100/4588595) loss:3.134 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:47,396][model8_pretrain.py][INFO] Epoch:[0/2](275100/4588595) loss:2.988 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:47,396][model8_pretrain.py][INFO] Epoch:[0/2](275100/4588595) loss:2.834 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:39:47,396][model8_pretrain.py][INFO] Epoch:[0/2](275100/4588595) loss:3.197 lr:0.0000100 epoch_Time:27283.0min: [2024-01-03 22:40:24,330][model8_pretrain.py][INFO] Epoch:[0/2](275200/4588595) loss:2.454 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:40:24,330][model8_pretrain.py][INFO] Epoch:[0/2](275200/4588595) loss:3.286 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:40:24,330][model8_pretrain.py][INFO] Epoch:[0/2](275200/4588595) loss:2.769 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:40:24,330][model8_pretrain.py][INFO] Epoch:[0/2](275200/4588595) loss:3.080 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:40:24,330][model8_pretrain.py][INFO] Epoch:[0/2](275200/4588595) loss:2.778 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:40:24,330][model8_pretrain.py][INFO] Epoch:[0/2](275200/4588595) loss:2.876 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:40:24,330][model8_pretrain.py][INFO] Epoch:[0/2](275200/4588595) loss:3.294 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:40:24,330][model8_pretrain.py][INFO] Epoch:[0/2](275200/4588595) loss:3.106 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:41:01,267][model8_pretrain.py][INFO] Epoch:[0/2](275300/4588595) loss:2.797 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:41:01,267][model8_pretrain.py][INFO] Epoch:[0/2](275300/4588595) loss:3.168 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:41:01,267][model8_pretrain.py][INFO] Epoch:[0/2](275300/4588595) loss:2.489 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:41:01,267][model8_pretrain.py][INFO] Epoch:[0/2](275300/4588595) loss:3.236 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:41:01,267][model8_pretrain.py][INFO] Epoch:[0/2](275300/4588595) loss:3.016 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:41:01,267][model8_pretrain.py][INFO] Epoch:[0/2](275300/4588595) loss:2.371 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:41:01,267][model8_pretrain.py][INFO] Epoch:[0/2](275300/4588595) loss:2.939 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:41:01,268][model8_pretrain.py][INFO] Epoch:[0/2](275300/4588595) loss:2.996 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:41:48,633][model8_pretrain.py][INFO] Epoch:[0/2](275400/4588595) loss:3.027 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:41:48,633][model8_pretrain.py][INFO] Epoch:[0/2](275400/4588595) loss:3.440 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:41:48,633][model8_pretrain.py][INFO] Epoch:[0/2](275400/4588595) loss:3.323 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:41:48,633][model8_pretrain.py][INFO] Epoch:[0/2](275400/4588595) loss:2.825 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:41:48,633][model8_pretrain.py][INFO] Epoch:[0/2](275400/4588595) loss:2.752 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:41:48,633][model8_pretrain.py][INFO] Epoch:[0/2](275400/4588595) loss:3.079 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:41:48,633][model8_pretrain.py][INFO] Epoch:[0/2](275400/4588595) loss:2.769 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:41:48,633][model8_pretrain.py][INFO] Epoch:[0/2](275400/4588595) loss:2.225 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:42:25,567][model8_pretrain.py][INFO] Epoch:[0/2](275500/4588595) loss:3.261 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:42:25,567][model8_pretrain.py][INFO] Epoch:[0/2](275500/4588595) loss:2.903 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:42:25,567][model8_pretrain.py][INFO] Epoch:[0/2](275500/4588595) loss:2.250 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:42:25,567][model8_pretrain.py][INFO] Epoch:[0/2](275500/4588595) loss:2.780 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:42:25,567][model8_pretrain.py][INFO] Epoch:[0/2](275500/4588595) loss:2.850 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:42:25,567][model8_pretrain.py][INFO] Epoch:[0/2](275500/4588595) loss:3.234 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:42:25,567][model8_pretrain.py][INFO] Epoch:[0/2](275500/4588595) loss:2.554 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:42:25,567][model8_pretrain.py][INFO] Epoch:[0/2](275500/4588595) loss:2.648 lr:0.0000100 epoch_Time:27282.0min: [2024-01-03 22:43:02,504][model8_pretrain.py][INFO] Epoch:[0/2](275600/4588595) loss:1.805 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:43:02,504][model8_pretrain.py][INFO] Epoch:[0/2](275600/4588595) loss:2.851 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:43:02,504][model8_pretrain.py][INFO] Epoch:[0/2](275600/4588595) loss:3.402 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:43:02,504][model8_pretrain.py][INFO] Epoch:[0/2](275600/4588595) loss:3.136 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:43:02,504][model8_pretrain.py][INFO] Epoch:[0/2](275600/4588595) loss:2.846 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:43:02,504][model8_pretrain.py][INFO] Epoch:[0/2](275600/4588595) loss:2.780 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:43:02,505][model8_pretrain.py][INFO] Epoch:[0/2](275600/4588595) loss:3.155 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:43:02,505][model8_pretrain.py][INFO] Epoch:[0/2](275600/4588595) loss:3.165 lr:0.0000100 epoch_Time:27281.0min: [2024-01-03 22:43:39,452][model8_pretrain.py][INFO] Epoch:[0/2](275700/4588595) loss:2.611 lr:0.0000100 epoch_Time:27280.0min: [2024-01-03 22:43:39,452][model8_pretrain.py][INFO] Epoch:[0/2](275700/4588595) loss:2.672 lr:0.0000100 epoch_Time:27280.0min: [2024-01-03 22:43:39,452][model8_pretrain.py][INFO] Epoch:[0/2](275700/4588595) loss:2.816 lr:0.0000100 epoch_Time:27280.0min: [2024-01-03 22:43:39,452][model8_pretrain.py][INFO] Epoch:[0/2](275700/4588595) loss:3.086 lr:0.0000100 epoch_Time:27280.0min: [2024-01-03 22:43:39,452][model8_pretrain.py][INFO] Epoch:[0/2](275700/4588595) loss:2.674 lr:0.0000100 epoch_Time:27280.0min: [2024-01-03 22:43:39,452][model8_pretrain.py][INFO] Epoch:[0/2](275700/4588595) loss:2.660 lr:0.0000100 epoch_Time:27280.0min: [2024-01-03 22:43:39,452][model8_pretrain.py][INFO] Epoch:[0/2](275700/4588595) loss:3.148 lr:0.0000100 epoch_Time:27280.0min: [2024-01-03 22:43:39,453][model8_pretrain.py][INFO] Epoch:[0/2](275700/4588595) loss:2.892 lr:0.0000100 epoch_Time:27280.0min: [2024-01-03 22:44:16,381][model8_pretrain.py][INFO] Epoch:[0/2](275800/4588595) loss:2.924 lr:0.0000100 epoch_Time:27279.0min: [2024-01-03 22:44:16,381][model8_pretrain.py][INFO] Epoch:[0/2](275800/4588595) loss:3.506 lr:0.0000100 epoch_Time:27279.0min: [2024-01-03 22:44:16,381][model8_pretrain.py][INFO] Epoch:[0/2](275800/4588595) loss:3.013 lr:0.0000100 epoch_Time:27279.0min: [2024-01-03 22:44:16,381][model8_pretrain.py][INFO] Epoch:[0/2](275800/4588595) loss:2.842 lr:0.0000100 epoch_Time:27279.0min: [2024-01-03 22:44:16,381][model8_pretrain.py][INFO] Epoch:[0/2](275800/4588595) loss:2.683 lr:0.0000100 epoch_Time:27279.0min: [2024-01-03 22:44:16,381][model8_pretrain.py][INFO] Epoch:[0/2](275800/4588595) loss:3.426 lr:0.0000100 epoch_Time:27279.0min: [2024-01-03 22:44:16,381][model8_pretrain.py][INFO] Epoch:[0/2](275800/4588595) loss:3.326 lr:0.0000100 epoch_Time:27279.0min: [2024-01-03 22:44:16,381][model8_pretrain.py][INFO] Epoch:[0/2](275800/4588595) loss:2.521 lr:0.0000100 epoch_Time:27279.0min: [2024-01-03 22:44:53,317][model8_pretrain.py][INFO] Epoch:[0/2](275900/4588595) loss:3.138 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:44:53,317][model8_pretrain.py][INFO] Epoch:[0/2](275900/4588595) loss:2.989 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:44:53,317][model8_pretrain.py][INFO] Epoch:[0/2](275900/4588595) loss:3.296 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:44:53,317][model8_pretrain.py][INFO] Epoch:[0/2](275900/4588595) loss:3.283 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:44:53,317][model8_pretrain.py][INFO] Epoch:[0/2](275900/4588595) loss:2.689 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:44:53,317][model8_pretrain.py][INFO] Epoch:[0/2](275900/4588595) loss:2.984 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:44:53,317][model8_pretrain.py][INFO] Epoch:[0/2](275900/4588595) loss:3.046 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:44:53,318][model8_pretrain.py][INFO] Epoch:[0/2](275900/4588595) loss:2.968 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:45:30,265][model8_pretrain.py][INFO] Epoch:[0/2](276000/4588595) loss:2.960 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:45:30,265][model8_pretrain.py][INFO] Epoch:[0/2](276000/4588595) loss:3.424 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:45:30,265][model8_pretrain.py][INFO] Epoch:[0/2](276000/4588595) loss:3.068 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:45:30,265][model8_pretrain.py][INFO] Epoch:[0/2](276000/4588595) loss:2.764 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:45:30,265][model8_pretrain.py][INFO] Epoch:[0/2](276000/4588595) loss:3.047 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:45:30,265][model8_pretrain.py][INFO] Epoch:[0/2](276000/4588595) loss:2.863 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:45:30,265][model8_pretrain.py][INFO] Epoch:[0/2](276000/4588595) loss:2.884 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:45:30,265][model8_pretrain.py][INFO] Epoch:[0/2](276000/4588595) loss:2.651 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:46:07,209][model8_pretrain.py][INFO] Epoch:[0/2](276100/4588595) loss:3.026 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:46:07,209][model8_pretrain.py][INFO] Epoch:[0/2](276100/4588595) loss:2.997 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:46:07,209][model8_pretrain.py][INFO] Epoch:[0/2](276100/4588595) loss:3.000 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:46:07,209][model8_pretrain.py][INFO] Epoch:[0/2](276100/4588595) loss:3.104 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:46:07,209][model8_pretrain.py][INFO] Epoch:[0/2](276100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:46:07,209][model8_pretrain.py][INFO] Epoch:[0/2](276100/4588595) loss:2.258 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:46:07,209][model8_pretrain.py][INFO] Epoch:[0/2](276100/4588595) loss:3.290 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:46:07,209][model8_pretrain.py][INFO] Epoch:[0/2](276100/4588595) loss:3.158 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:46:54,546][model8_pretrain.py][INFO] Epoch:[0/2](276200/4588595) loss:3.349 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:46:54,547][model8_pretrain.py][INFO] Epoch:[0/2](276200/4588595) loss:2.505 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:46:54,547][model8_pretrain.py][INFO] Epoch:[0/2](276200/4588595) loss:2.793 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:46:54,547][model8_pretrain.py][INFO] Epoch:[0/2](276200/4588595) loss:3.467 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:46:54,547][model8_pretrain.py][INFO] Epoch:[0/2](276200/4588595) loss:2.423 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:46:54,547][model8_pretrain.py][INFO] Epoch:[0/2](276200/4588595) loss:3.015 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:46:54,547][model8_pretrain.py][INFO] Epoch:[0/2](276200/4588595) loss:3.051 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:46:54,547][model8_pretrain.py][INFO] Epoch:[0/2](276200/4588595) loss:3.054 lr:0.0000100 epoch_Time:27278.0min: [2024-01-03 22:47:31,480][model8_pretrain.py][INFO] Epoch:[0/2](276300/4588595) loss:2.696 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:47:31,480][model8_pretrain.py][INFO] Epoch:[0/2](276300/4588595) loss:3.357 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:47:31,480][model8_pretrain.py][INFO] Epoch:[0/2](276300/4588595) loss:2.008 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:47:31,480][model8_pretrain.py][INFO] Epoch:[0/2](276300/4588595) loss:2.963 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:47:31,480][model8_pretrain.py][INFO] Epoch:[0/2](276300/4588595) loss:2.795 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:47:31,480][model8_pretrain.py][INFO] Epoch:[0/2](276300/4588595) loss:2.982 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:47:31,480][model8_pretrain.py][INFO] Epoch:[0/2](276300/4588595) loss:2.426 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:47:31,480][model8_pretrain.py][INFO] Epoch:[0/2](276300/4588595) loss:2.854 lr:0.0000100 epoch_Time:27277.0min: [2024-01-03 22:48:08,423][model8_pretrain.py][INFO] Epoch:[0/2](276400/4588595) loss:3.198 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:08,423][model8_pretrain.py][INFO] Epoch:[0/2](276400/4588595) loss:3.030 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:08,423][model8_pretrain.py][INFO] Epoch:[0/2](276400/4588595) loss:3.180 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:08,423][model8_pretrain.py][INFO] Epoch:[0/2](276400/4588595) loss:2.831 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:08,423][model8_pretrain.py][INFO] Epoch:[0/2](276400/4588595) loss:2.715 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:08,423][model8_pretrain.py][INFO] Epoch:[0/2](276400/4588595) loss:2.707 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:08,423][model8_pretrain.py][INFO] Epoch:[0/2](276400/4588595) loss:2.884 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:08,423][model8_pretrain.py][INFO] Epoch:[0/2](276400/4588595) loss:2.948 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:45,397][model8_pretrain.py][INFO] Epoch:[0/2](276500/4588595) loss:3.018 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:45,397][model8_pretrain.py][INFO] Epoch:[0/2](276500/4588595) loss:3.355 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:45,397][model8_pretrain.py][INFO] Epoch:[0/2](276500/4588595) loss:3.041 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:45,397][model8_pretrain.py][INFO] Epoch:[0/2](276500/4588595) loss:2.715 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:45,397][model8_pretrain.py][INFO] Epoch:[0/2](276500/4588595) loss:3.065 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:45,397][model8_pretrain.py][INFO] Epoch:[0/2](276500/4588595) loss:2.954 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:45,397][model8_pretrain.py][INFO] Epoch:[0/2](276500/4588595) loss:2.513 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:48:45,397][model8_pretrain.py][INFO] Epoch:[0/2](276500/4588595) loss:2.684 lr:0.0000100 epoch_Time:27276.0min: [2024-01-03 22:49:22,390][model8_pretrain.py][INFO] Epoch:[0/2](276600/4588595) loss:2.293 lr:0.0000100 epoch_Time:27275.0min: [2024-01-03 22:49:22,390][model8_pretrain.py][INFO] Epoch:[0/2](276600/4588595) loss:3.184 lr:0.0000100 epoch_Time:27275.0min: [2024-01-03 22:49:22,390][model8_pretrain.py][INFO] Epoch:[0/2](276600/4588595) loss:2.635 lr:0.0000100 epoch_Time:27275.0min: [2024-01-03 22:49:22,390][model8_pretrain.py][INFO] Epoch:[0/2](276600/4588595) loss:2.602 lr:0.0000100 epoch_Time:27275.0min: [2024-01-03 22:49:22,390][model8_pretrain.py][INFO] Epoch:[0/2](276600/4588595) loss:3.166 lr:0.0000100 epoch_Time:27275.0min: [2024-01-03 22:49:22,390][model8_pretrain.py][INFO] Epoch:[0/2](276600/4588595) loss:2.886 lr:0.0000100 epoch_Time:27275.0min: [2024-01-03 22:49:22,390][model8_pretrain.py][INFO] Epoch:[0/2](276600/4588595) loss:2.967 lr:0.0000100 epoch_Time:27275.0min: [2024-01-03 22:49:22,390][model8_pretrain.py][INFO] Epoch:[0/2](276600/4588595) loss:2.897 lr:0.0000100 epoch_Time:27275.0min: [2024-01-03 22:49:59,352][model8_pretrain.py][INFO] Epoch:[0/2](276700/4588595) loss:3.186 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:49:59,352][model8_pretrain.py][INFO] Epoch:[0/2](276700/4588595) loss:3.124 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:49:59,352][model8_pretrain.py][INFO] Epoch:[0/2](276700/4588595) loss:3.036 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:49:59,352][model8_pretrain.py][INFO] Epoch:[0/2](276700/4588595) loss:3.168 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:49:59,352][model8_pretrain.py][INFO] Epoch:[0/2](276700/4588595) loss:3.079 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:49:59,352][model8_pretrain.py][INFO] Epoch:[0/2](276700/4588595) loss:3.080 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:49:59,352][model8_pretrain.py][INFO] Epoch:[0/2](276700/4588595) loss:2.987 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:49:59,353][model8_pretrain.py][INFO] Epoch:[0/2](276700/4588595) loss:2.335 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:50:36,295][model8_pretrain.py][INFO] Epoch:[0/2](276800/4588595) loss:3.261 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:50:36,295][model8_pretrain.py][INFO] Epoch:[0/2](276800/4588595) loss:3.026 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:50:36,295][model8_pretrain.py][INFO] Epoch:[0/2](276800/4588595) loss:2.563 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:50:36,295][model8_pretrain.py][INFO] Epoch:[0/2](276800/4588595) loss:3.356 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:50:36,295][model8_pretrain.py][INFO] Epoch:[0/2](276800/4588595) loss:3.005 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:50:36,295][model8_pretrain.py][INFO] Epoch:[0/2](276800/4588595) loss:3.231 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:50:36,295][model8_pretrain.py][INFO] Epoch:[0/2](276800/4588595) loss:3.030 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:50:36,295][model8_pretrain.py][INFO] Epoch:[0/2](276800/4588595) loss:3.362 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:51:13,232][model8_pretrain.py][INFO] Epoch:[0/2](276900/4588595) loss:2.192 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:51:13,233][model8_pretrain.py][INFO] Epoch:[0/2](276900/4588595) loss:3.002 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:51:13,233][model8_pretrain.py][INFO] Epoch:[0/2](276900/4588595) loss:2.438 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:51:13,233][model8_pretrain.py][INFO] Epoch:[0/2](276900/4588595) loss:3.095 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:51:13,233][model8_pretrain.py][INFO] Epoch:[0/2](276900/4588595) loss:3.178 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:51:13,233][model8_pretrain.py][INFO] Epoch:[0/2](276900/4588595) loss:2.874 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:51:13,233][model8_pretrain.py][INFO] Epoch:[0/2](276900/4588595) loss:3.209 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:51:13,233][model8_pretrain.py][INFO] Epoch:[0/2](276900/4588595) loss:2.530 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:52:00,593][model8_pretrain.py][INFO] Epoch:[0/2](277000/4588595) loss:2.686 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:00,593][model8_pretrain.py][INFO] Epoch:[0/2](277000/4588595) loss:3.288 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:00,593][model8_pretrain.py][INFO] Epoch:[0/2](277000/4588595) loss:2.676 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:00,593][model8_pretrain.py][INFO] Epoch:[0/2](277000/4588595) loss:2.962 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:00,594][model8_pretrain.py][INFO] Epoch:[0/2](277000/4588595) loss:3.116 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:00,594][model8_pretrain.py][INFO] Epoch:[0/2](277000/4588595) loss:2.217 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:00,594][model8_pretrain.py][INFO] Epoch:[0/2](277000/4588595) loss:2.366 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:00,594][model8_pretrain.py][INFO] Epoch:[0/2](277000/4588595) loss:3.304 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:37,526][model8_pretrain.py][INFO] Epoch:[0/2](277100/4588595) loss:2.697 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:37,526][model8_pretrain.py][INFO] Epoch:[0/2](277100/4588595) loss:2.944 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:37,526][model8_pretrain.py][INFO] Epoch:[0/2](277100/4588595) loss:2.911 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:37,526][model8_pretrain.py][INFO] Epoch:[0/2](277100/4588595) loss:2.968 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:37,526][model8_pretrain.py][INFO] Epoch:[0/2](277100/4588595) loss:2.691 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:37,526][model8_pretrain.py][INFO] Epoch:[0/2](277100/4588595) loss:3.009 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:37,526][model8_pretrain.py][INFO] Epoch:[0/2](277100/4588595) loss:2.812 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:52:37,526][model8_pretrain.py][INFO] Epoch:[0/2](277100/4588595) loss:2.786 lr:0.0000100 epoch_Time:27273.0min: [2024-01-03 22:53:14,494][model8_pretrain.py][INFO] Epoch:[0/2](277200/4588595) loss:2.932 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:53:14,494][model8_pretrain.py][INFO] Epoch:[0/2](277200/4588595) loss:3.174 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:53:14,494][model8_pretrain.py][INFO] Epoch:[0/2](277200/4588595) loss:3.295 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:53:14,494][model8_pretrain.py][INFO] Epoch:[0/2](277200/4588595) loss:2.867 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:53:14,494][model8_pretrain.py][INFO] Epoch:[0/2](277200/4588595) loss:2.685 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:53:14,494][model8_pretrain.py][INFO] Epoch:[0/2](277200/4588595) loss:3.039 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:53:14,494][model8_pretrain.py][INFO] Epoch:[0/2](277200/4588595) loss:3.172 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:53:14,494][model8_pretrain.py][INFO] Epoch:[0/2](277200/4588595) loss:3.315 lr:0.0000100 epoch_Time:27272.0min: [2024-01-03 22:53:51,431][model8_pretrain.py][INFO] Epoch:[0/2](277300/4588595) loss:2.851 lr:0.0000100 epoch_Time:27271.0min: [2024-01-03 22:53:51,431][model8_pretrain.py][INFO] Epoch:[0/2](277300/4588595) loss:3.009 lr:0.0000100 epoch_Time:27271.0min: [2024-01-03 22:53:51,431][model8_pretrain.py][INFO] Epoch:[0/2](277300/4588595) loss:2.735 lr:0.0000100 epoch_Time:27271.0min: [2024-01-03 22:53:51,431][model8_pretrain.py][INFO] Epoch:[0/2](277300/4588595) loss:2.716 lr:0.0000100 epoch_Time:27271.0min: [2024-01-03 22:53:51,431][model8_pretrain.py][INFO] Epoch:[0/2](277300/4588595) loss:3.072 lr:0.0000100 epoch_Time:27271.0min: [2024-01-03 22:53:51,431][model8_pretrain.py][INFO] Epoch:[0/2](277300/4588595) loss:3.166 lr:0.0000100 epoch_Time:27271.0min: [2024-01-03 22:53:51,431][model8_pretrain.py][INFO] Epoch:[0/2](277300/4588595) loss:2.974 lr:0.0000100 epoch_Time:27271.0min: [2024-01-03 22:53:51,432][model8_pretrain.py][INFO] Epoch:[0/2](277300/4588595) loss:2.790 lr:0.0000100 epoch_Time:27271.0min: [2024-01-03 22:54:28,378][model8_pretrain.py][INFO] Epoch:[0/2](277400/4588595) loss:3.397 lr:0.0000100 epoch_Time:27270.0min: [2024-01-03 22:54:28,378][model8_pretrain.py][INFO] Epoch:[0/2](277400/4588595) loss:2.944 lr:0.0000100 epoch_Time:27270.0min: [2024-01-03 22:54:28,378][model8_pretrain.py][INFO] Epoch:[0/2](277400/4588595) loss:2.440 lr:0.0000100 epoch_Time:27270.0min: [2024-01-03 22:54:28,378][model8_pretrain.py][INFO] Epoch:[0/2](277400/4588595) loss:3.307 lr:0.0000100 epoch_Time:27270.0min: [2024-01-03 22:54:28,378][model8_pretrain.py][INFO] Epoch:[0/2](277400/4588595) loss:2.919 lr:0.0000100 epoch_Time:27270.0min: [2024-01-03 22:54:28,378][model8_pretrain.py][INFO] Epoch:[0/2](277400/4588595) loss:2.850 lr:0.0000100 epoch_Time:27270.0min: [2024-01-03 22:54:28,378][model8_pretrain.py][INFO] Epoch:[0/2](277400/4588595) loss:3.669 lr:0.0000100 epoch_Time:27270.0min: [2024-01-03 22:54:28,378][model8_pretrain.py][INFO] Epoch:[0/2](277400/4588595) loss:3.353 lr:0.0000100 epoch_Time:27270.0min: [2024-01-03 22:55:05,315][model8_pretrain.py][INFO] Epoch:[0/2](277500/4588595) loss:2.383 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:05,315][model8_pretrain.py][INFO] Epoch:[0/2](277500/4588595) loss:2.764 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:05,315][model8_pretrain.py][INFO] Epoch:[0/2](277500/4588595) loss:3.029 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:05,315][model8_pretrain.py][INFO] Epoch:[0/2](277500/4588595) loss:2.658 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:05,315][model8_pretrain.py][INFO] Epoch:[0/2](277500/4588595) loss:2.825 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:05,315][model8_pretrain.py][INFO] Epoch:[0/2](277500/4588595) loss:2.908 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:05,315][model8_pretrain.py][INFO] Epoch:[0/2](277500/4588595) loss:2.900 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:05,316][model8_pretrain.py][INFO] Epoch:[0/2](277500/4588595) loss:3.291 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:42,248][model8_pretrain.py][INFO] Epoch:[0/2](277600/4588595) loss:3.007 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:42,248][model8_pretrain.py][INFO] Epoch:[0/2](277600/4588595) loss:2.819 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:42,248][model8_pretrain.py][INFO] Epoch:[0/2](277600/4588595) loss:2.794 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:42,248][model8_pretrain.py][INFO] Epoch:[0/2](277600/4588595) loss:2.931 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:42,248][model8_pretrain.py][INFO] Epoch:[0/2](277600/4588595) loss:3.173 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:42,248][model8_pretrain.py][INFO] Epoch:[0/2](277600/4588595) loss:2.910 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:42,248][model8_pretrain.py][INFO] Epoch:[0/2](277600/4588595) loss:2.968 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:55:42,248][model8_pretrain.py][INFO] Epoch:[0/2](277600/4588595) loss:2.785 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:56:19,192][model8_pretrain.py][INFO] Epoch:[0/2](277700/4588595) loss:3.057 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:56:19,192][model8_pretrain.py][INFO] Epoch:[0/2](277700/4588595) loss:2.849 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:56:19,192][model8_pretrain.py][INFO] Epoch:[0/2](277700/4588595) loss:2.832 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:56:19,192][model8_pretrain.py][INFO] Epoch:[0/2](277700/4588595) loss:3.038 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:56:19,193][model8_pretrain.py][INFO] Epoch:[0/2](277700/4588595) loss:2.995 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:56:19,193][model8_pretrain.py][INFO] Epoch:[0/2](277700/4588595) loss:2.666 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:56:19,193][model8_pretrain.py][INFO] Epoch:[0/2](277700/4588595) loss:2.917 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:56:19,193][model8_pretrain.py][INFO] Epoch:[0/2](277700/4588595) loss:2.380 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:57:06,533][model8_pretrain.py][INFO] Epoch:[0/2](277800/4588595) loss:2.864 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:06,533][model8_pretrain.py][INFO] Epoch:[0/2](277800/4588595) loss:2.907 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:06,533][model8_pretrain.py][INFO] Epoch:[0/2](277800/4588595) loss:2.984 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:06,533][model8_pretrain.py][INFO] Epoch:[0/2](277800/4588595) loss:2.620 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:06,533][model8_pretrain.py][INFO] Epoch:[0/2](277800/4588595) loss:3.324 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:06,533][model8_pretrain.py][INFO] Epoch:[0/2](277800/4588595) loss:2.776 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:06,534][model8_pretrain.py][INFO] Epoch:[0/2](277800/4588595) loss:3.373 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:06,534][model8_pretrain.py][INFO] Epoch:[0/2](277800/4588595) loss:2.796 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:43,462][model8_pretrain.py][INFO] Epoch:[0/2](277900/4588595) loss:3.164 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:43,462][model8_pretrain.py][INFO] Epoch:[0/2](277900/4588595) loss:2.607 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:43,462][model8_pretrain.py][INFO] Epoch:[0/2](277900/4588595) loss:2.581 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:43,462][model8_pretrain.py][INFO] Epoch:[0/2](277900/4588595) loss:3.225 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:43,462][model8_pretrain.py][INFO] Epoch:[0/2](277900/4588595) loss:2.760 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:43,462][model8_pretrain.py][INFO] Epoch:[0/2](277900/4588595) loss:3.317 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:43,462][model8_pretrain.py][INFO] Epoch:[0/2](277900/4588595) loss:2.986 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:57:43,462][model8_pretrain.py][INFO] Epoch:[0/2](277900/4588595) loss:2.879 lr:0.0000100 epoch_Time:27269.0min: [2024-01-03 22:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](278000/4588595) loss:2.695 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](278000/4588595) loss:2.729 lr:0.0000100 epoch_Time:27268.0min: [2024-01-03 22:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](278000/4588595) loss:2.968 lr:0.0000100 epoch_Time:27268.0min: [2024-01-03 22:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](278000/4588595) loss:2.600 lr:0.0000100 epoch_Time:27267.0min: [2024-01-03 22:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](278000/4588595) loss:2.911 lr:0.0000100 epoch_Time:27268.0min: [2024-01-03 22:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](278000/4588595) loss:3.001 lr:0.0000100 epoch_Time:27268.0min: [2024-01-03 22:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](278000/4588595) loss:3.416 lr:0.0000100 epoch_Time:27268.0min: [2024-01-03 22:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](278000/4588595) loss:2.782 lr:0.0000100 epoch_Time:27268.0min: [2024-01-03 22:58:57,338][model8_pretrain.py][INFO] Epoch:[0/2](278100/4588595) loss:2.667 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:58:57,338][model8_pretrain.py][INFO] Epoch:[0/2](278100/4588595) loss:3.050 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:58:57,338][model8_pretrain.py][INFO] Epoch:[0/2](278100/4588595) loss:3.009 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:58:57,339][model8_pretrain.py][INFO] Epoch:[0/2](278100/4588595) loss:2.656 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:58:57,339][model8_pretrain.py][INFO] Epoch:[0/2](278100/4588595) loss:3.247 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:58:57,339][model8_pretrain.py][INFO] Epoch:[0/2](278100/4588595) loss:2.974 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:58:57,339][model8_pretrain.py][INFO] Epoch:[0/2](278100/4588595) loss:2.940 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:58:57,339][model8_pretrain.py][INFO] Epoch:[0/2](278100/4588595) loss:3.077 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:59:34,277][model8_pretrain.py][INFO] Epoch:[0/2](278200/4588595) loss:2.710 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:59:34,277][model8_pretrain.py][INFO] Epoch:[0/2](278200/4588595) loss:2.895 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:59:34,277][model8_pretrain.py][INFO] Epoch:[0/2](278200/4588595) loss:3.393 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:59:34,277][model8_pretrain.py][INFO] Epoch:[0/2](278200/4588595) loss:2.860 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:59:34,277][model8_pretrain.py][INFO] Epoch:[0/2](278200/4588595) loss:2.158 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:59:34,277][model8_pretrain.py][INFO] Epoch:[0/2](278200/4588595) loss:2.935 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:59:34,277][model8_pretrain.py][INFO] Epoch:[0/2](278200/4588595) loss:3.075 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 22:59:34,278][model8_pretrain.py][INFO] Epoch:[0/2](278200/4588595) loss:2.880 lr:0.0000100 epoch_Time:27266.0min: [2024-01-03 23:00:11,231][model8_pretrain.py][INFO] Epoch:[0/2](278300/4588595) loss:2.646 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:00:11,231][model8_pretrain.py][INFO] Epoch:[0/2](278300/4588595) loss:3.264 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:00:11,231][model8_pretrain.py][INFO] Epoch:[0/2](278300/4588595) loss:3.400 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:00:11,231][model8_pretrain.py][INFO] Epoch:[0/2](278300/4588595) loss:3.489 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:00:11,231][model8_pretrain.py][INFO] Epoch:[0/2](278300/4588595) loss:3.221 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:00:11,231][model8_pretrain.py][INFO] Epoch:[0/2](278300/4588595) loss:3.143 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:00:11,231][model8_pretrain.py][INFO] Epoch:[0/2](278300/4588595) loss:2.929 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:00:11,231][model8_pretrain.py][INFO] Epoch:[0/2](278300/4588595) loss:2.683 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:00:48,149][model8_pretrain.py][INFO] Epoch:[0/2](278400/4588595) loss:2.921 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:00:48,149][model8_pretrain.py][INFO] Epoch:[0/2](278400/4588595) loss:2.860 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:00:48,149][model8_pretrain.py][INFO] Epoch:[0/2](278400/4588595) loss:2.982 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:00:48,149][model8_pretrain.py][INFO] Epoch:[0/2](278400/4588595) loss:3.559 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:00:48,149][model8_pretrain.py][INFO] Epoch:[0/2](278400/4588595) loss:2.795 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:00:48,149][model8_pretrain.py][INFO] Epoch:[0/2](278400/4588595) loss:2.659 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:00:48,149][model8_pretrain.py][INFO] Epoch:[0/2](278400/4588595) loss:2.782 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:00:48,149][model8_pretrain.py][INFO] Epoch:[0/2](278400/4588595) loss:3.212 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:01:25,090][model8_pretrain.py][INFO] Epoch:[0/2](278500/4588595) loss:3.116 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:01:25,090][model8_pretrain.py][INFO] Epoch:[0/2](278500/4588595) loss:3.198 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:01:25,090][model8_pretrain.py][INFO] Epoch:[0/2](278500/4588595) loss:2.698 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:01:25,090][model8_pretrain.py][INFO] Epoch:[0/2](278500/4588595) loss:3.021 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:01:25,090][model8_pretrain.py][INFO] Epoch:[0/2](278500/4588595) loss:2.256 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:01:25,090][model8_pretrain.py][INFO] Epoch:[0/2](278500/4588595) loss:3.066 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:01:25,090][model8_pretrain.py][INFO] Epoch:[0/2](278500/4588595) loss:2.904 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:01:25,090][model8_pretrain.py][INFO] Epoch:[0/2](278500/4588595) loss:2.872 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:02:12,474][model8_pretrain.py][INFO] Epoch:[0/2](278600/4588595) loss:2.424 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:02:12,474][model8_pretrain.py][INFO] Epoch:[0/2](278600/4588595) loss:2.477 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:02:12,474][model8_pretrain.py][INFO] Epoch:[0/2](278600/4588595) loss:2.900 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:02:12,474][model8_pretrain.py][INFO] Epoch:[0/2](278600/4588595) loss:2.987 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:02:12,474][model8_pretrain.py][INFO] Epoch:[0/2](278600/4588595) loss:2.851 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:02:12,474][model8_pretrain.py][INFO] Epoch:[0/2](278600/4588595) loss:3.291 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:02:12,474][model8_pretrain.py][INFO] Epoch:[0/2](278600/4588595) loss:3.145 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:02:12,474][model8_pretrain.py][INFO] Epoch:[0/2](278600/4588595) loss:2.987 lr:0.0000100 epoch_Time:27265.0min: [2024-01-03 23:02:49,399][model8_pretrain.py][INFO] Epoch:[0/2](278700/4588595) loss:2.784 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:02:49,399][model8_pretrain.py][INFO] Epoch:[0/2](278700/4588595) loss:2.950 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:02:49,399][model8_pretrain.py][INFO] Epoch:[0/2](278700/4588595) loss:2.800 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:02:49,399][model8_pretrain.py][INFO] Epoch:[0/2](278700/4588595) loss:3.214 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:02:49,399][model8_pretrain.py][INFO] Epoch:[0/2](278700/4588595) loss:3.157 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:02:49,399][model8_pretrain.py][INFO] Epoch:[0/2](278700/4588595) loss:2.846 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:02:49,399][model8_pretrain.py][INFO] Epoch:[0/2](278700/4588595) loss:2.645 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:02:49,399][model8_pretrain.py][INFO] Epoch:[0/2](278700/4588595) loss:2.907 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:03:26,329][model8_pretrain.py][INFO] Epoch:[0/2](278800/4588595) loss:3.167 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:03:26,329][model8_pretrain.py][INFO] Epoch:[0/2](278800/4588595) loss:3.057 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:03:26,330][model8_pretrain.py][INFO] Epoch:[0/2](278800/4588595) loss:3.664 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:03:26,329][model8_pretrain.py][INFO] Epoch:[0/2](278800/4588595) loss:2.746 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:03:26,330][model8_pretrain.py][INFO] Epoch:[0/2](278800/4588595) loss:3.076 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:03:26,330][model8_pretrain.py][INFO] Epoch:[0/2](278800/4588595) loss:3.138 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:03:26,330][model8_pretrain.py][INFO] Epoch:[0/2](278800/4588595) loss:3.203 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:03:26,330][model8_pretrain.py][INFO] Epoch:[0/2](278800/4588595) loss:2.993 lr:0.0000100 epoch_Time:27263.0min: [2024-01-03 23:04:03,267][model8_pretrain.py][INFO] Epoch:[0/2](278900/4588595) loss:2.578 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:03,267][model8_pretrain.py][INFO] Epoch:[0/2](278900/4588595) loss:2.764 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:03,267][model8_pretrain.py][INFO] Epoch:[0/2](278900/4588595) loss:3.002 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:03,267][model8_pretrain.py][INFO] Epoch:[0/2](278900/4588595) loss:3.367 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:03,267][model8_pretrain.py][INFO] Epoch:[0/2](278900/4588595) loss:3.145 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:03,267][model8_pretrain.py][INFO] Epoch:[0/2](278900/4588595) loss:2.394 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:03,268][model8_pretrain.py][INFO] Epoch:[0/2](278900/4588595) loss:2.743 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:03,268][model8_pretrain.py][INFO] Epoch:[0/2](278900/4588595) loss:3.258 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:40,199][model8_pretrain.py][INFO] Epoch:[0/2](279000/4588595) loss:3.004 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:40,199][model8_pretrain.py][INFO] Epoch:[0/2](279000/4588595) loss:3.173 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:40,199][model8_pretrain.py][INFO] Epoch:[0/2](279000/4588595) loss:2.572 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:40,199][model8_pretrain.py][INFO] Epoch:[0/2](279000/4588595) loss:3.225 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:40,199][model8_pretrain.py][INFO] Epoch:[0/2](279000/4588595) loss:3.438 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:40,199][model8_pretrain.py][INFO] Epoch:[0/2](279000/4588595) loss:3.195 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:40,199][model8_pretrain.py][INFO] Epoch:[0/2](279000/4588595) loss:2.529 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:04:40,199][model8_pretrain.py][INFO] Epoch:[0/2](279000/4588595) loss:3.060 lr:0.0000100 epoch_Time:27262.0min: [2024-01-03 23:05:17,133][model8_pretrain.py][INFO] Epoch:[0/2](279100/4588595) loss:2.364 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:05:17,133][model8_pretrain.py][INFO] Epoch:[0/2](279100/4588595) loss:2.925 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:05:17,133][model8_pretrain.py][INFO] Epoch:[0/2](279100/4588595) loss:3.222 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:05:17,133][model8_pretrain.py][INFO] Epoch:[0/2](279100/4588595) loss:2.893 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:05:17,133][model8_pretrain.py][INFO] Epoch:[0/2](279100/4588595) loss:2.656 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:05:17,133][model8_pretrain.py][INFO] Epoch:[0/2](279100/4588595) loss:2.546 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:05:17,133][model8_pretrain.py][INFO] Epoch:[0/2](279100/4588595) loss:2.910 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:05:17,133][model8_pretrain.py][INFO] Epoch:[0/2](279100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:05:54,058][model8_pretrain.py][INFO] Epoch:[0/2](279200/4588595) loss:3.400 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:05:54,058][model8_pretrain.py][INFO] Epoch:[0/2](279200/4588595) loss:2.919 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:05:54,058][model8_pretrain.py][INFO] Epoch:[0/2](279200/4588595) loss:3.071 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:05:54,059][model8_pretrain.py][INFO] Epoch:[0/2](279200/4588595) loss:3.313 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:05:54,059][model8_pretrain.py][INFO] Epoch:[0/2](279200/4588595) loss:3.166 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:05:54,059][model8_pretrain.py][INFO] Epoch:[0/2](279200/4588595) loss:3.006 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:05:54,059][model8_pretrain.py][INFO] Epoch:[0/2](279200/4588595) loss:2.693 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:05:54,059][model8_pretrain.py][INFO] Epoch:[0/2](279200/4588595) loss:2.995 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:06:30,991][model8_pretrain.py][INFO] Epoch:[0/2](279300/4588595) loss:2.810 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:06:30,991][model8_pretrain.py][INFO] Epoch:[0/2](279300/4588595) loss:2.692 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:06:30,991][model8_pretrain.py][INFO] Epoch:[0/2](279300/4588595) loss:3.184 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:06:30,991][model8_pretrain.py][INFO] Epoch:[0/2](279300/4588595) loss:3.470 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:06:30,991][model8_pretrain.py][INFO] Epoch:[0/2](279300/4588595) loss:2.364 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:06:30,991][model8_pretrain.py][INFO] Epoch:[0/2](279300/4588595) loss:3.227 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:06:30,991][model8_pretrain.py][INFO] Epoch:[0/2](279300/4588595) loss:3.083 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:06:30,992][model8_pretrain.py][INFO] Epoch:[0/2](279300/4588595) loss:3.256 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:07:18,363][model8_pretrain.py][INFO] Epoch:[0/2](279400/4588595) loss:3.111 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:07:18,363][model8_pretrain.py][INFO] Epoch:[0/2](279400/4588595) loss:2.332 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:07:18,363][model8_pretrain.py][INFO] Epoch:[0/2](279400/4588595) loss:3.139 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:07:18,363][model8_pretrain.py][INFO] Epoch:[0/2](279400/4588595) loss:2.535 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:07:18,363][model8_pretrain.py][INFO] Epoch:[0/2](279400/4588595) loss:3.190 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:07:18,363][model8_pretrain.py][INFO] Epoch:[0/2](279400/4588595) loss:3.066 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:07:18,363][model8_pretrain.py][INFO] Epoch:[0/2](279400/4588595) loss:2.914 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:07:18,363][model8_pretrain.py][INFO] Epoch:[0/2](279400/4588595) loss:2.689 lr:0.0000100 epoch_Time:27260.0min: [2024-01-03 23:07:55,282][model8_pretrain.py][INFO] Epoch:[0/2](279500/4588595) loss:2.855 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:07:55,282][model8_pretrain.py][INFO] Epoch:[0/2](279500/4588595) loss:2.694 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:07:55,282][model8_pretrain.py][INFO] Epoch:[0/2](279500/4588595) loss:3.070 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:07:55,282][model8_pretrain.py][INFO] Epoch:[0/2](279500/4588595) loss:3.090 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:07:55,282][model8_pretrain.py][INFO] Epoch:[0/2](279500/4588595) loss:2.710 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:07:55,282][model8_pretrain.py][INFO] Epoch:[0/2](279500/4588595) loss:2.066 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:07:55,282][model8_pretrain.py][INFO] Epoch:[0/2](279500/4588595) loss:2.472 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:07:55,282][model8_pretrain.py][INFO] Epoch:[0/2](279500/4588595) loss:2.816 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:08:32,220][model8_pretrain.py][INFO] Epoch:[0/2](279600/4588595) loss:2.378 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:08:32,220][model8_pretrain.py][INFO] Epoch:[0/2](279600/4588595) loss:2.992 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:08:32,220][model8_pretrain.py][INFO] Epoch:[0/2](279600/4588595) loss:2.844 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:08:32,220][model8_pretrain.py][INFO] Epoch:[0/2](279600/4588595) loss:3.058 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:08:32,220][model8_pretrain.py][INFO] Epoch:[0/2](279600/4588595) loss:2.764 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:08:32,220][model8_pretrain.py][INFO] Epoch:[0/2](279600/4588595) loss:2.102 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:08:32,220][model8_pretrain.py][INFO] Epoch:[0/2](279600/4588595) loss:2.740 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:08:32,220][model8_pretrain.py][INFO] Epoch:[0/2](279600/4588595) loss:2.797 lr:0.0000100 epoch_Time:27259.0min: [2024-01-03 23:09:09,153][model8_pretrain.py][INFO] Epoch:[0/2](279700/4588595) loss:2.732 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:09,153][model8_pretrain.py][INFO] Epoch:[0/2](279700/4588595) loss:3.162 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:09,153][model8_pretrain.py][INFO] Epoch:[0/2](279700/4588595) loss:2.570 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:09,153][model8_pretrain.py][INFO] Epoch:[0/2](279700/4588595) loss:3.111 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:09,153][model8_pretrain.py][INFO] Epoch:[0/2](279700/4588595) loss:2.972 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:09,153][model8_pretrain.py][INFO] Epoch:[0/2](279700/4588595) loss:3.269 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:09,154][model8_pretrain.py][INFO] Epoch:[0/2](279700/4588595) loss:2.929 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:09,165][model8_pretrain.py][INFO] Epoch:[0/2](279700/4588595) loss:2.774 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:46,122][model8_pretrain.py][INFO] Epoch:[0/2](279800/4588595) loss:2.606 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:46,122][model8_pretrain.py][INFO] Epoch:[0/2](279800/4588595) loss:2.573 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:46,122][model8_pretrain.py][INFO] Epoch:[0/2](279800/4588595) loss:2.742 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:46,122][model8_pretrain.py][INFO] Epoch:[0/2](279800/4588595) loss:3.237 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:46,122][model8_pretrain.py][INFO] Epoch:[0/2](279800/4588595) loss:3.120 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:46,122][model8_pretrain.py][INFO] Epoch:[0/2](279800/4588595) loss:3.020 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:46,122][model8_pretrain.py][INFO] Epoch:[0/2](279800/4588595) loss:2.822 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:09:46,122][model8_pretrain.py][INFO] Epoch:[0/2](279800/4588595) loss:3.215 lr:0.0000100 epoch_Time:27257.0min: [2024-01-03 23:10:23,092][model8_pretrain.py][INFO] Epoch:[0/2](279900/4588595) loss:2.611 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:10:23,092][model8_pretrain.py][INFO] Epoch:[0/2](279900/4588595) loss:3.118 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:10:23,092][model8_pretrain.py][INFO] Epoch:[0/2](279900/4588595) loss:2.778 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:10:23,092][model8_pretrain.py][INFO] Epoch:[0/2](279900/4588595) loss:2.516 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:10:23,092][model8_pretrain.py][INFO] Epoch:[0/2](279900/4588595) loss:3.209 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:10:23,092][model8_pretrain.py][INFO] Epoch:[0/2](279900/4588595) loss:3.190 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:10:23,093][model8_pretrain.py][INFO] Epoch:[0/2](279900/4588595) loss:2.918 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:10:23,093][model8_pretrain.py][INFO] Epoch:[0/2](279900/4588595) loss:2.753 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:11:00,060][model8_pretrain.py][INFO] Epoch:[0/2](280000/4588595) loss:2.972 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:11:00,060][model8_pretrain.py][INFO] Epoch:[0/2](280000/4588595) loss:2.027 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:11:00,060][model8_pretrain.py][INFO] Epoch:[0/2](280000/4588595) loss:3.000 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:11:00,060][model8_pretrain.py][INFO] Epoch:[0/2](280000/4588595) loss:2.628 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:11:00,060][model8_pretrain.py][INFO] Epoch:[0/2](280000/4588595) loss:3.177 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:11:00,060][model8_pretrain.py][INFO] Epoch:[0/2](280000/4588595) loss:2.620 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:11:00,060][model8_pretrain.py][INFO] Epoch:[0/2](280000/4588595) loss:2.951 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:11:00,060][model8_pretrain.py][INFO] Epoch:[0/2](280000/4588595) loss:2.398 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:11:36,994][model8_pretrain.py][INFO] Epoch:[0/2](280100/4588595) loss:3.366 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:11:36,994][model8_pretrain.py][INFO] Epoch:[0/2](280100/4588595) loss:3.110 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:11:36,994][model8_pretrain.py][INFO] Epoch:[0/2](280100/4588595) loss:2.982 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:11:36,994][model8_pretrain.py][INFO] Epoch:[0/2](280100/4588595) loss:2.266 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:11:36,994][model8_pretrain.py][INFO] Epoch:[0/2](280100/4588595) loss:2.503 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:11:36,994][model8_pretrain.py][INFO] Epoch:[0/2](280100/4588595) loss:3.419 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:11:36,994][model8_pretrain.py][INFO] Epoch:[0/2](280100/4588595) loss:2.555 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:11:36,994][model8_pretrain.py][INFO] Epoch:[0/2](280100/4588595) loss:2.778 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:12:24,475][model8_pretrain.py][INFO] Epoch:[0/2](280200/4588595) loss:2.598 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:12:24,475][model8_pretrain.py][INFO] Epoch:[0/2](280200/4588595) loss:2.732 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:12:24,476][model8_pretrain.py][INFO] Epoch:[0/2](280200/4588595) loss:3.039 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:12:24,476][model8_pretrain.py][INFO] Epoch:[0/2](280200/4588595) loss:2.928 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:12:24,476][model8_pretrain.py][INFO] Epoch:[0/2](280200/4588595) loss:2.273 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:12:24,476][model8_pretrain.py][INFO] Epoch:[0/2](280200/4588595) loss:2.742 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:12:24,476][model8_pretrain.py][INFO] Epoch:[0/2](280200/4588595) loss:2.991 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:12:24,476][model8_pretrain.py][INFO] Epoch:[0/2](280200/4588595) loss:3.104 lr:0.0000100 epoch_Time:27256.0min: [2024-01-03 23:13:01,396][model8_pretrain.py][INFO] Epoch:[0/2](280300/4588595) loss:3.040 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:13:01,396][model8_pretrain.py][INFO] Epoch:[0/2](280300/4588595) loss:2.266 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:13:01,396][model8_pretrain.py][INFO] Epoch:[0/2](280300/4588595) loss:3.002 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:13:01,396][model8_pretrain.py][INFO] Epoch:[0/2](280300/4588595) loss:2.363 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:13:01,396][model8_pretrain.py][INFO] Epoch:[0/2](280300/4588595) loss:2.821 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:13:01,396][model8_pretrain.py][INFO] Epoch:[0/2](280300/4588595) loss:2.834 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:13:01,396][model8_pretrain.py][INFO] Epoch:[0/2](280300/4588595) loss:2.964 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:13:01,397][model8_pretrain.py][INFO] Epoch:[0/2](280300/4588595) loss:2.603 lr:0.0000100 epoch_Time:27255.0min: [2024-01-03 23:13:38,320][model8_pretrain.py][INFO] Epoch:[0/2](280400/4588595) loss:2.926 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:13:38,320][model8_pretrain.py][INFO] Epoch:[0/2](280400/4588595) loss:2.873 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:13:38,320][model8_pretrain.py][INFO] Epoch:[0/2](280400/4588595) loss:2.661 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:13:38,320][model8_pretrain.py][INFO] Epoch:[0/2](280400/4588595) loss:3.152 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:13:38,320][model8_pretrain.py][INFO] Epoch:[0/2](280400/4588595) loss:3.006 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:13:38,320][model8_pretrain.py][INFO] Epoch:[0/2](280400/4588595) loss:2.800 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:13:38,320][model8_pretrain.py][INFO] Epoch:[0/2](280400/4588595) loss:2.929 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:13:38,320][model8_pretrain.py][INFO] Epoch:[0/2](280400/4588595) loss:2.850 lr:0.0000100 epoch_Time:27254.0min: [2024-01-03 23:14:15,242][model8_pretrain.py][INFO] Epoch:[0/2](280500/4588595) loss:2.614 lr:0.0000100 epoch_Time:27253.0min: [2024-01-03 23:14:15,242][model8_pretrain.py][INFO] Epoch:[0/2](280500/4588595) loss:2.699 lr:0.0000100 epoch_Time:27253.0min: [2024-01-03 23:14:15,242][model8_pretrain.py][INFO] Epoch:[0/2](280500/4588595) loss:2.727 lr:0.0000100 epoch_Time:27253.0min: [2024-01-03 23:14:15,242][model8_pretrain.py][INFO] Epoch:[0/2](280500/4588595) loss:3.045 lr:0.0000100 epoch_Time:27253.0min: [2024-01-03 23:14:15,242][model8_pretrain.py][INFO] Epoch:[0/2](280500/4588595) loss:3.056 lr:0.0000100 epoch_Time:27253.0min: [2024-01-03 23:14:15,242][model8_pretrain.py][INFO] Epoch:[0/2](280500/4588595) loss:3.083 lr:0.0000100 epoch_Time:27253.0min: [2024-01-03 23:14:15,242][model8_pretrain.py][INFO] Epoch:[0/2](280500/4588595) loss:2.351 lr:0.0000100 epoch_Time:27253.0min: [2024-01-03 23:14:15,242][model8_pretrain.py][INFO] Epoch:[0/2](280500/4588595) loss:3.059 lr:0.0000100 epoch_Time:27253.0min: [2024-01-03 23:14:52,159][model8_pretrain.py][INFO] Epoch:[0/2](280600/4588595) loss:3.009 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:14:52,159][model8_pretrain.py][INFO] Epoch:[0/2](280600/4588595) loss:3.185 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:14:52,159][model8_pretrain.py][INFO] Epoch:[0/2](280600/4588595) loss:2.899 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:14:52,159][model8_pretrain.py][INFO] Epoch:[0/2](280600/4588595) loss:3.044 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:14:52,159][model8_pretrain.py][INFO] Epoch:[0/2](280600/4588595) loss:2.615 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:14:52,159][model8_pretrain.py][INFO] Epoch:[0/2](280600/4588595) loss:2.914 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:14:52,159][model8_pretrain.py][INFO] Epoch:[0/2](280600/4588595) loss:2.743 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:14:52,159][model8_pretrain.py][INFO] Epoch:[0/2](280600/4588595) loss:3.080 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:15:29,089][model8_pretrain.py][INFO] Epoch:[0/2](280700/4588595) loss:2.637 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:15:29,089][model8_pretrain.py][INFO] Epoch:[0/2](280700/4588595) loss:3.188 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:15:29,089][model8_pretrain.py][INFO] Epoch:[0/2](280700/4588595) loss:2.955 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:15:29,089][model8_pretrain.py][INFO] Epoch:[0/2](280700/4588595) loss:2.378 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:15:29,089][model8_pretrain.py][INFO] Epoch:[0/2](280700/4588595) loss:3.196 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:15:29,089][model8_pretrain.py][INFO] Epoch:[0/2](280700/4588595) loss:3.441 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:15:29,089][model8_pretrain.py][INFO] Epoch:[0/2](280700/4588595) loss:2.205 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:15:29,089][model8_pretrain.py][INFO] Epoch:[0/2](280700/4588595) loss:2.534 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:16:06,017][model8_pretrain.py][INFO] Epoch:[0/2](280800/4588595) loss:3.135 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:06,017][model8_pretrain.py][INFO] Epoch:[0/2](280800/4588595) loss:2.582 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:06,017][model8_pretrain.py][INFO] Epoch:[0/2](280800/4588595) loss:2.778 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:06,017][model8_pretrain.py][INFO] Epoch:[0/2](280800/4588595) loss:2.926 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:06,017][model8_pretrain.py][INFO] Epoch:[0/2](280800/4588595) loss:2.757 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:06,017][model8_pretrain.py][INFO] Epoch:[0/2](280800/4588595) loss:2.665 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:06,017][model8_pretrain.py][INFO] Epoch:[0/2](280800/4588595) loss:1.837 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:06,017][model8_pretrain.py][INFO] Epoch:[0/2](280800/4588595) loss:2.801 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:42,945][model8_pretrain.py][INFO] Epoch:[0/2](280900/4588595) loss:2.992 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:42,945][model8_pretrain.py][INFO] Epoch:[0/2](280900/4588595) loss:3.056 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:42,945][model8_pretrain.py][INFO] Epoch:[0/2](280900/4588595) loss:3.491 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:42,945][model8_pretrain.py][INFO] Epoch:[0/2](280900/4588595) loss:2.719 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:42,945][model8_pretrain.py][INFO] Epoch:[0/2](280900/4588595) loss:2.931 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:42,945][model8_pretrain.py][INFO] Epoch:[0/2](280900/4588595) loss:3.105 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:42,945][model8_pretrain.py][INFO] Epoch:[0/2](280900/4588595) loss:3.272 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:16:42,945][model8_pretrain.py][INFO] Epoch:[0/2](280900/4588595) loss:3.312 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:17:30,336][model8_pretrain.py][INFO] Epoch:[0/2](281000/4588595) loss:3.102 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:17:30,336][model8_pretrain.py][INFO] Epoch:[0/2](281000/4588595) loss:3.160 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:17:30,336][model8_pretrain.py][INFO] Epoch:[0/2](281000/4588595) loss:2.167 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:17:30,336][model8_pretrain.py][INFO] Epoch:[0/2](281000/4588595) loss:3.085 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:17:30,336][model8_pretrain.py][INFO] Epoch:[0/2](281000/4588595) loss:2.801 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:17:30,336][model8_pretrain.py][INFO] Epoch:[0/2](281000/4588595) loss:3.414 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:17:30,336][model8_pretrain.py][INFO] Epoch:[0/2](281000/4588595) loss:2.662 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:17:30,336][model8_pretrain.py][INFO] Epoch:[0/2](281000/4588595) loss:2.840 lr:0.0000100 epoch_Time:27252.0min: [2024-01-03 23:18:07,271][model8_pretrain.py][INFO] Epoch:[0/2](281100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:07,271][model8_pretrain.py][INFO] Epoch:[0/2](281100/4588595) loss:2.980 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:07,271][model8_pretrain.py][INFO] Epoch:[0/2](281100/4588595) loss:2.329 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:07,271][model8_pretrain.py][INFO] Epoch:[0/2](281100/4588595) loss:3.148 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:07,271][model8_pretrain.py][INFO] Epoch:[0/2](281100/4588595) loss:3.042 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:07,271][model8_pretrain.py][INFO] Epoch:[0/2](281100/4588595) loss:3.091 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:07,271][model8_pretrain.py][INFO] Epoch:[0/2](281100/4588595) loss:3.025 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:07,271][model8_pretrain.py][INFO] Epoch:[0/2](281100/4588595) loss:3.075 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:44,201][model8_pretrain.py][INFO] Epoch:[0/2](281200/4588595) loss:2.125 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:44,201][model8_pretrain.py][INFO] Epoch:[0/2](281200/4588595) loss:2.780 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:44,201][model8_pretrain.py][INFO] Epoch:[0/2](281200/4588595) loss:3.224 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:44,201][model8_pretrain.py][INFO] Epoch:[0/2](281200/4588595) loss:3.211 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:44,201][model8_pretrain.py][INFO] Epoch:[0/2](281200/4588595) loss:2.553 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:44,201][model8_pretrain.py][INFO] Epoch:[0/2](281200/4588595) loss:3.363 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:44,201][model8_pretrain.py][INFO] Epoch:[0/2](281200/4588595) loss:2.901 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:18:44,202][model8_pretrain.py][INFO] Epoch:[0/2](281200/4588595) loss:3.139 lr:0.0000100 epoch_Time:27250.0min: [2024-01-03 23:19:21,127][model8_pretrain.py][INFO] Epoch:[0/2](281300/4588595) loss:2.913 lr:0.0000100 epoch_Time:27249.0min: [2024-01-03 23:19:21,127][model8_pretrain.py][INFO] Epoch:[0/2](281300/4588595) loss:2.829 lr:0.0000100 epoch_Time:27249.0min: [2024-01-03 23:19:21,127][model8_pretrain.py][INFO] Epoch:[0/2](281300/4588595) loss:2.890 lr:0.0000100 epoch_Time:27249.0min: [2024-01-03 23:19:21,127][model8_pretrain.py][INFO] Epoch:[0/2](281300/4588595) loss:2.884 lr:0.0000100 epoch_Time:27249.0min: [2024-01-03 23:19:21,127][model8_pretrain.py][INFO] Epoch:[0/2](281300/4588595) loss:3.346 lr:0.0000100 epoch_Time:27249.0min: [2024-01-03 23:19:21,127][model8_pretrain.py][INFO] Epoch:[0/2](281300/4588595) loss:2.391 lr:0.0000100 epoch_Time:27249.0min: [2024-01-03 23:19:21,127][model8_pretrain.py][INFO] Epoch:[0/2](281300/4588595) loss:2.771 lr:0.0000100 epoch_Time:27249.0min: [2024-01-03 23:19:21,127][model8_pretrain.py][INFO] Epoch:[0/2](281300/4588595) loss:3.012 lr:0.0000100 epoch_Time:27249.0min: [2024-01-03 23:19:58,060][model8_pretrain.py][INFO] Epoch:[0/2](281400/4588595) loss:3.100 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:19:58,060][model8_pretrain.py][INFO] Epoch:[0/2](281400/4588595) loss:2.272 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:19:58,060][model8_pretrain.py][INFO] Epoch:[0/2](281400/4588595) loss:2.997 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:19:58,060][model8_pretrain.py][INFO] Epoch:[0/2](281400/4588595) loss:2.734 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:19:58,060][model8_pretrain.py][INFO] Epoch:[0/2](281400/4588595) loss:2.624 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:19:58,060][model8_pretrain.py][INFO] Epoch:[0/2](281400/4588595) loss:3.133 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:19:58,060][model8_pretrain.py][INFO] Epoch:[0/2](281400/4588595) loss:3.024 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:19:58,060][model8_pretrain.py][INFO] Epoch:[0/2](281400/4588595) loss:2.454 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:20:35,002][model8_pretrain.py][INFO] Epoch:[0/2](281500/4588595) loss:2.937 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:20:35,002][model8_pretrain.py][INFO] Epoch:[0/2](281500/4588595) loss:3.285 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:20:35,002][model8_pretrain.py][INFO] Epoch:[0/2](281500/4588595) loss:2.467 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:20:35,003][model8_pretrain.py][INFO] Epoch:[0/2](281500/4588595) loss:2.597 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:20:35,003][model8_pretrain.py][INFO] Epoch:[0/2](281500/4588595) loss:2.592 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:20:35,003][model8_pretrain.py][INFO] Epoch:[0/2](281500/4588595) loss:2.448 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:20:35,003][model8_pretrain.py][INFO] Epoch:[0/2](281500/4588595) loss:3.076 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:20:35,003][model8_pretrain.py][INFO] Epoch:[0/2](281500/4588595) loss:3.057 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:21:11,944][model8_pretrain.py][INFO] Epoch:[0/2](281600/4588595) loss:2.748 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:21:11,944][model8_pretrain.py][INFO] Epoch:[0/2](281600/4588595) loss:2.720 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:21:11,944][model8_pretrain.py][INFO] Epoch:[0/2](281600/4588595) loss:2.355 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:21:11,944][model8_pretrain.py][INFO] Epoch:[0/2](281600/4588595) loss:2.866 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:21:11,944][model8_pretrain.py][INFO] Epoch:[0/2](281600/4588595) loss:2.965 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:21:11,944][model8_pretrain.py][INFO] Epoch:[0/2](281600/4588595) loss:2.359 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:21:11,944][model8_pretrain.py][INFO] Epoch:[0/2](281600/4588595) loss:2.874 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:21:11,944][model8_pretrain.py][INFO] Epoch:[0/2](281600/4588595) loss:2.142 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:21:48,880][model8_pretrain.py][INFO] Epoch:[0/2](281700/4588595) loss:2.646 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:21:48,880][model8_pretrain.py][INFO] Epoch:[0/2](281700/4588595) loss:2.810 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:21:48,880][model8_pretrain.py][INFO] Epoch:[0/2](281700/4588595) loss:3.498 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:21:48,880][model8_pretrain.py][INFO] Epoch:[0/2](281700/4588595) loss:2.811 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:21:48,880][model8_pretrain.py][INFO] Epoch:[0/2](281700/4588595) loss:3.254 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:21:48,880][model8_pretrain.py][INFO] Epoch:[0/2](281700/4588595) loss:3.296 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:21:48,880][model8_pretrain.py][INFO] Epoch:[0/2](281700/4588595) loss:2.097 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:21:48,881][model8_pretrain.py][INFO] Epoch:[0/2](281700/4588595) loss:2.699 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:22:36,149][model8_pretrain.py][INFO] Epoch:[0/2](281800/4588595) loss:1.948 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:22:36,149][model8_pretrain.py][INFO] Epoch:[0/2](281800/4588595) loss:2.762 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:22:36,149][model8_pretrain.py][INFO] Epoch:[0/2](281800/4588595) loss:2.994 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:22:36,149][model8_pretrain.py][INFO] Epoch:[0/2](281800/4588595) loss:2.732 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:22:36,149][model8_pretrain.py][INFO] Epoch:[0/2](281800/4588595) loss:2.976 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:22:36,149][model8_pretrain.py][INFO] Epoch:[0/2](281800/4588595) loss:3.257 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:22:36,149][model8_pretrain.py][INFO] Epoch:[0/2](281800/4588595) loss:3.548 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:22:36,149][model8_pretrain.py][INFO] Epoch:[0/2](281800/4588595) loss:3.167 lr:0.0000100 epoch_Time:27247.0min: [2024-01-03 23:23:13,082][model8_pretrain.py][INFO] Epoch:[0/2](281900/4588595) loss:2.921 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:23:13,082][model8_pretrain.py][INFO] Epoch:[0/2](281900/4588595) loss:3.412 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:23:13,082][model8_pretrain.py][INFO] Epoch:[0/2](281900/4588595) loss:3.106 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:23:13,082][model8_pretrain.py][INFO] Epoch:[0/2](281900/4588595) loss:2.858 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:23:13,082][model8_pretrain.py][INFO] Epoch:[0/2](281900/4588595) loss:3.076 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:23:13,082][model8_pretrain.py][INFO] Epoch:[0/2](281900/4588595) loss:2.353 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:23:13,082][model8_pretrain.py][INFO] Epoch:[0/2](281900/4588595) loss:2.991 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:23:13,082][model8_pretrain.py][INFO] Epoch:[0/2](281900/4588595) loss:2.712 lr:0.0000100 epoch_Time:27246.0min: [2024-01-03 23:23:50,034][model8_pretrain.py][INFO] Epoch:[0/2](282000/4588595) loss:2.642 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:23:50,034][model8_pretrain.py][INFO] Epoch:[0/2](282000/4588595) loss:2.709 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:23:50,035][model8_pretrain.py][INFO] Epoch:[0/2](282000/4588595) loss:2.993 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:23:50,035][model8_pretrain.py][INFO] Epoch:[0/2](282000/4588595) loss:2.978 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:23:50,035][model8_pretrain.py][INFO] Epoch:[0/2](282000/4588595) loss:2.854 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:23:50,035][model8_pretrain.py][INFO] Epoch:[0/2](282000/4588595) loss:3.240 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:23:50,035][model8_pretrain.py][INFO] Epoch:[0/2](282000/4588595) loss:2.887 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:23:50,035][model8_pretrain.py][INFO] Epoch:[0/2](282000/4588595) loss:3.050 lr:0.0000100 epoch_Time:27245.0min: [2024-01-03 23:24:26,989][model8_pretrain.py][INFO] Epoch:[0/2](282100/4588595) loss:2.746 lr:0.0000100 epoch_Time:27244.0min: [2024-01-03 23:24:26,989][model8_pretrain.py][INFO] Epoch:[0/2](282100/4588595) loss:2.735 lr:0.0000100 epoch_Time:27244.0min: [2024-01-03 23:24:26,989][model8_pretrain.py][INFO] Epoch:[0/2](282100/4588595) loss:2.939 lr:0.0000100 epoch_Time:27244.0min: [2024-01-03 23:24:26,989][model8_pretrain.py][INFO] Epoch:[0/2](282100/4588595) loss:3.106 lr:0.0000100 epoch_Time:27244.0min: [2024-01-03 23:24:26,989][model8_pretrain.py][INFO] Epoch:[0/2](282100/4588595) loss:3.425 lr:0.0000100 epoch_Time:27244.0min: [2024-01-03 23:24:26,989][model8_pretrain.py][INFO] Epoch:[0/2](282100/4588595) loss:3.046 lr:0.0000100 epoch_Time:27244.0min: [2024-01-03 23:24:26,989][model8_pretrain.py][INFO] Epoch:[0/2](282100/4588595) loss:2.757 lr:0.0000100 epoch_Time:27244.0min: [2024-01-03 23:24:26,989][model8_pretrain.py][INFO] Epoch:[0/2](282100/4588595) loss:3.076 lr:0.0000100 epoch_Time:27244.0min: [2024-01-03 23:25:03,935][model8_pretrain.py][INFO] Epoch:[0/2](282200/4588595) loss:2.990 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:03,935][model8_pretrain.py][INFO] Epoch:[0/2](282200/4588595) loss:2.739 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:03,935][model8_pretrain.py][INFO] Epoch:[0/2](282200/4588595) loss:2.964 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:03,935][model8_pretrain.py][INFO] Epoch:[0/2](282200/4588595) loss:2.708 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:03,935][model8_pretrain.py][INFO] Epoch:[0/2](282200/4588595) loss:3.087 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:03,935][model8_pretrain.py][INFO] Epoch:[0/2](282200/4588595) loss:3.049 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:03,935][model8_pretrain.py][INFO] Epoch:[0/2](282200/4588595) loss:2.730 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:03,935][model8_pretrain.py][INFO] Epoch:[0/2](282200/4588595) loss:2.966 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:40,886][model8_pretrain.py][INFO] Epoch:[0/2](282300/4588595) loss:3.039 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:40,886][model8_pretrain.py][INFO] Epoch:[0/2](282300/4588595) loss:3.080 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:40,886][model8_pretrain.py][INFO] Epoch:[0/2](282300/4588595) loss:3.383 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:40,886][model8_pretrain.py][INFO] Epoch:[0/2](282300/4588595) loss:2.704 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:40,886][model8_pretrain.py][INFO] Epoch:[0/2](282300/4588595) loss:2.961 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:40,886][model8_pretrain.py][INFO] Epoch:[0/2](282300/4588595) loss:2.750 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:40,887][model8_pretrain.py][INFO] Epoch:[0/2](282300/4588595) loss:3.626 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:25:40,887][model8_pretrain.py][INFO] Epoch:[0/2](282300/4588595) loss:3.426 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:26:17,843][model8_pretrain.py][INFO] Epoch:[0/2](282400/4588595) loss:2.433 lr:0.0000100 epoch_Time:27242.0min: [2024-01-03 23:26:17,843][model8_pretrain.py][INFO] Epoch:[0/2](282400/4588595) loss:3.246 lr:0.0000100 epoch_Time:27242.0min: [2024-01-03 23:26:17,843][model8_pretrain.py][INFO] Epoch:[0/2](282400/4588595) loss:2.762 lr:0.0000100 epoch_Time:27242.0min: [2024-01-03 23:26:17,843][model8_pretrain.py][INFO] Epoch:[0/2](282400/4588595) loss:3.042 lr:0.0000100 epoch_Time:27242.0min: [2024-01-03 23:26:17,843][model8_pretrain.py][INFO] Epoch:[0/2](282400/4588595) loss:2.771 lr:0.0000100 epoch_Time:27242.0min: [2024-01-03 23:26:17,843][model8_pretrain.py][INFO] Epoch:[0/2](282400/4588595) loss:2.985 lr:0.0000100 epoch_Time:27242.0min: [2024-01-03 23:26:17,844][model8_pretrain.py][INFO] Epoch:[0/2](282400/4588595) loss:2.833 lr:0.0000100 epoch_Time:27242.0min: [2024-01-03 23:26:17,844][model8_pretrain.py][INFO] Epoch:[0/2](282400/4588595) loss:2.846 lr:0.0000100 epoch_Time:27242.0min: [2024-01-03 23:26:54,778][model8_pretrain.py][INFO] Epoch:[0/2](282500/4588595) loss:2.873 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:26:54,778][model8_pretrain.py][INFO] Epoch:[0/2](282500/4588595) loss:2.819 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:26:54,778][model8_pretrain.py][INFO] Epoch:[0/2](282500/4588595) loss:2.097 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:26:54,778][model8_pretrain.py][INFO] Epoch:[0/2](282500/4588595) loss:2.802 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:26:54,778][model8_pretrain.py][INFO] Epoch:[0/2](282500/4588595) loss:3.261 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:26:54,778][model8_pretrain.py][INFO] Epoch:[0/2](282500/4588595) loss:2.887 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:26:54,778][model8_pretrain.py][INFO] Epoch:[0/2](282500/4588595) loss:3.132 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:26:54,778][model8_pretrain.py][INFO] Epoch:[0/2](282500/4588595) loss:3.464 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:27:41,790][model8_pretrain.py][INFO] Epoch:[0/2](282600/4588595) loss:2.458 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:27:41,790][model8_pretrain.py][INFO] Epoch:[0/2](282600/4588595) loss:2.927 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:27:41,790][model8_pretrain.py][INFO] Epoch:[0/2](282600/4588595) loss:3.108 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:27:41,790][model8_pretrain.py][INFO] Epoch:[0/2](282600/4588595) loss:3.244 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:27:41,790][model8_pretrain.py][INFO] Epoch:[0/2](282600/4588595) loss:3.734 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:27:41,791][model8_pretrain.py][INFO] Epoch:[0/2](282600/4588595) loss:3.076 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:27:41,791][model8_pretrain.py][INFO] Epoch:[0/2](282600/4588595) loss:2.767 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:27:41,791][model8_pretrain.py][INFO] Epoch:[0/2](282600/4588595) loss:2.847 lr:0.0000100 epoch_Time:27243.0min: [2024-01-03 23:28:18,672][model8_pretrain.py][INFO] Epoch:[0/2](282700/4588595) loss:2.507 lr:0.0000100 epoch_Time:27241.0min: [2024-01-03 23:28:18,672][model8_pretrain.py][INFO] Epoch:[0/2](282700/4588595) loss:2.807 lr:0.0000100 epoch_Time:27241.0min: [2024-01-03 23:28:18,672][model8_pretrain.py][INFO] Epoch:[0/2](282700/4588595) loss:2.943 lr:0.0000100 epoch_Time:27241.0min: [2024-01-03 23:28:18,672][model8_pretrain.py][INFO] Epoch:[0/2](282700/4588595) loss:2.732 lr:0.0000100 epoch_Time:27241.0min: [2024-01-03 23:28:18,672][model8_pretrain.py][INFO] Epoch:[0/2](282700/4588595) loss:3.046 lr:0.0000100 epoch_Time:27241.0min: [2024-01-03 23:28:18,672][model8_pretrain.py][INFO] Epoch:[0/2](282700/4588595) loss:2.817 lr:0.0000100 epoch_Time:27241.0min: [2024-01-03 23:28:18,672][model8_pretrain.py][INFO] Epoch:[0/2](282700/4588595) loss:2.599 lr:0.0000100 epoch_Time:27241.0min: [2024-01-03 23:28:18,672][model8_pretrain.py][INFO] Epoch:[0/2](282700/4588595) loss:2.382 lr:0.0000100 epoch_Time:27241.0min: [2024-01-03 23:28:55,603][model8_pretrain.py][INFO] Epoch:[0/2](282800/4588595) loss:2.755 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:28:55,603][model8_pretrain.py][INFO] Epoch:[0/2](282800/4588595) loss:2.978 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:28:55,603][model8_pretrain.py][INFO] Epoch:[0/2](282800/4588595) loss:2.350 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:28:55,603][model8_pretrain.py][INFO] Epoch:[0/2](282800/4588595) loss:2.248 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:28:55,603][model8_pretrain.py][INFO] Epoch:[0/2](282800/4588595) loss:2.560 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:28:55,603][model8_pretrain.py][INFO] Epoch:[0/2](282800/4588595) loss:2.308 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:28:55,603][model8_pretrain.py][INFO] Epoch:[0/2](282800/4588595) loss:3.012 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:28:55,604][model8_pretrain.py][INFO] Epoch:[0/2](282800/4588595) loss:3.293 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:29:32,544][model8_pretrain.py][INFO] Epoch:[0/2](282900/4588595) loss:2.879 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:29:32,544][model8_pretrain.py][INFO] Epoch:[0/2](282900/4588595) loss:2.606 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:29:32,544][model8_pretrain.py][INFO] Epoch:[0/2](282900/4588595) loss:3.390 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:29:32,544][model8_pretrain.py][INFO] Epoch:[0/2](282900/4588595) loss:2.014 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:29:32,545][model8_pretrain.py][INFO] Epoch:[0/2](282900/4588595) loss:3.083 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:29:32,545][model8_pretrain.py][INFO] Epoch:[0/2](282900/4588595) loss:2.834 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:29:32,545][model8_pretrain.py][INFO] Epoch:[0/2](282900/4588595) loss:3.140 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:29:32,545][model8_pretrain.py][INFO] Epoch:[0/2](282900/4588595) loss:3.147 lr:0.0000100 epoch_Time:27240.0min: [2024-01-03 23:30:09,491][model8_pretrain.py][INFO] Epoch:[0/2](283000/4588595) loss:2.800 lr:0.0000100 epoch_Time:27239.0min: [2024-01-03 23:30:09,491][model8_pretrain.py][INFO] Epoch:[0/2](283000/4588595) loss:2.557 lr:0.0000100 epoch_Time:27239.0min: [2024-01-03 23:30:09,491][model8_pretrain.py][INFO] Epoch:[0/2](283000/4588595) loss:2.679 lr:0.0000100 epoch_Time:27239.0min: [2024-01-03 23:30:09,491][model8_pretrain.py][INFO] Epoch:[0/2](283000/4588595) loss:3.067 lr:0.0000100 epoch_Time:27239.0min: [2024-01-03 23:30:09,491][model8_pretrain.py][INFO] Epoch:[0/2](283000/4588595) loss:3.131 lr:0.0000100 epoch_Time:27239.0min: [2024-01-03 23:30:09,491][model8_pretrain.py][INFO] Epoch:[0/2](283000/4588595) loss:3.402 lr:0.0000100 epoch_Time:27239.0min: [2024-01-03 23:30:09,491][model8_pretrain.py][INFO] Epoch:[0/2](283000/4588595) loss:3.307 lr:0.0000100 epoch_Time:27239.0min: [2024-01-03 23:30:09,491][model8_pretrain.py][INFO] Epoch:[0/2](283000/4588595) loss:3.547 lr:0.0000100 epoch_Time:27239.0min: [2024-01-03 23:30:46,442][model8_pretrain.py][INFO] Epoch:[0/2](283100/4588595) loss:2.897 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:30:46,442][model8_pretrain.py][INFO] Epoch:[0/2](283100/4588595) loss:2.617 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:30:46,442][model8_pretrain.py][INFO] Epoch:[0/2](283100/4588595) loss:2.824 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:30:46,442][model8_pretrain.py][INFO] Epoch:[0/2](283100/4588595) loss:3.242 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:30:46,442][model8_pretrain.py][INFO] Epoch:[0/2](283100/4588595) loss:3.195 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:30:46,442][model8_pretrain.py][INFO] Epoch:[0/2](283100/4588595) loss:2.945 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:30:46,442][model8_pretrain.py][INFO] Epoch:[0/2](283100/4588595) loss:2.865 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:30:46,442][model8_pretrain.py][INFO] Epoch:[0/2](283100/4588595) loss:2.654 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:31:23,388][model8_pretrain.py][INFO] Epoch:[0/2](283200/4588595) loss:3.045 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:31:23,388][model8_pretrain.py][INFO] Epoch:[0/2](283200/4588595) loss:2.647 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:31:23,388][model8_pretrain.py][INFO] Epoch:[0/2](283200/4588595) loss:2.977 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:31:23,388][model8_pretrain.py][INFO] Epoch:[0/2](283200/4588595) loss:3.037 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:31:23,388][model8_pretrain.py][INFO] Epoch:[0/2](283200/4588595) loss:2.857 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:31:23,388][model8_pretrain.py][INFO] Epoch:[0/2](283200/4588595) loss:3.058 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:31:23,388][model8_pretrain.py][INFO] Epoch:[0/2](283200/4588595) loss:3.032 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:31:23,388][model8_pretrain.py][INFO] Epoch:[0/2](283200/4588595) loss:2.934 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:32:00,341][model8_pretrain.py][INFO] Epoch:[0/2](283300/4588595) loss:2.553 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:32:00,341][model8_pretrain.py][INFO] Epoch:[0/2](283300/4588595) loss:3.432 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:32:00,341][model8_pretrain.py][INFO] Epoch:[0/2](283300/4588595) loss:2.257 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:32:00,341][model8_pretrain.py][INFO] Epoch:[0/2](283300/4588595) loss:2.586 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:32:00,341][model8_pretrain.py][INFO] Epoch:[0/2](283300/4588595) loss:2.942 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:32:00,341][model8_pretrain.py][INFO] Epoch:[0/2](283300/4588595) loss:2.956 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:32:00,341][model8_pretrain.py][INFO] Epoch:[0/2](283300/4588595) loss:3.095 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:32:00,341][model8_pretrain.py][INFO] Epoch:[0/2](283300/4588595) loss:2.703 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:32:47,094][model8_pretrain.py][INFO] Epoch:[0/2](283400/4588595) loss:2.533 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:32:47,094][model8_pretrain.py][INFO] Epoch:[0/2](283400/4588595) loss:2.860 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:32:47,094][model8_pretrain.py][INFO] Epoch:[0/2](283400/4588595) loss:2.617 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:32:47,094][model8_pretrain.py][INFO] Epoch:[0/2](283400/4588595) loss:2.746 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:32:47,094][model8_pretrain.py][INFO] Epoch:[0/2](283400/4588595) loss:3.093 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:32:47,094][model8_pretrain.py][INFO] Epoch:[0/2](283400/4588595) loss:2.925 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:32:47,095][model8_pretrain.py][INFO] Epoch:[0/2](283400/4588595) loss:2.285 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:32:47,095][model8_pretrain.py][INFO] Epoch:[0/2](283400/4588595) loss:2.973 lr:0.0000100 epoch_Time:27238.0min: [2024-01-03 23:33:24,023][model8_pretrain.py][INFO] Epoch:[0/2](283500/4588595) loss:2.343 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:33:24,023][model8_pretrain.py][INFO] Epoch:[0/2](283500/4588595) loss:2.698 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:33:24,023][model8_pretrain.py][INFO] Epoch:[0/2](283500/4588595) loss:3.027 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:33:24,023][model8_pretrain.py][INFO] Epoch:[0/2](283500/4588595) loss:3.076 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:33:24,023][model8_pretrain.py][INFO] Epoch:[0/2](283500/4588595) loss:3.152 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:33:24,023][model8_pretrain.py][INFO] Epoch:[0/2](283500/4588595) loss:2.783 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:33:24,023][model8_pretrain.py][INFO] Epoch:[0/2](283500/4588595) loss:3.133 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:33:24,023][model8_pretrain.py][INFO] Epoch:[0/2](283500/4588595) loss:2.438 lr:0.0000100 epoch_Time:27237.0min: [2024-01-03 23:34:00,960][model8_pretrain.py][INFO] Epoch:[0/2](283600/4588595) loss:2.979 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:34:00,960][model8_pretrain.py][INFO] Epoch:[0/2](283600/4588595) loss:2.889 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:34:00,960][model8_pretrain.py][INFO] Epoch:[0/2](283600/4588595) loss:2.486 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:34:00,960][model8_pretrain.py][INFO] Epoch:[0/2](283600/4588595) loss:3.028 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:34:00,960][model8_pretrain.py][INFO] Epoch:[0/2](283600/4588595) loss:2.965 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:34:00,960][model8_pretrain.py][INFO] Epoch:[0/2](283600/4588595) loss:3.002 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:34:00,960][model8_pretrain.py][INFO] Epoch:[0/2](283600/4588595) loss:2.981 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:34:00,960][model8_pretrain.py][INFO] Epoch:[0/2](283600/4588595) loss:2.725 lr:0.0000100 epoch_Time:27236.0min: [2024-01-03 23:34:37,894][model8_pretrain.py][INFO] Epoch:[0/2](283700/4588595) loss:2.856 lr:0.0000100 epoch_Time:27235.0min: [2024-01-03 23:34:37,894][model8_pretrain.py][INFO] Epoch:[0/2](283700/4588595) loss:2.747 lr:0.0000100 epoch_Time:27235.0min: [2024-01-03 23:34:37,894][model8_pretrain.py][INFO] Epoch:[0/2](283700/4588595) loss:3.079 lr:0.0000100 epoch_Time:27235.0min: [2024-01-03 23:34:37,894][model8_pretrain.py][INFO] Epoch:[0/2](283700/4588595) loss:2.637 lr:0.0000100 epoch_Time:27235.0min: [2024-01-03 23:34:37,894][model8_pretrain.py][INFO] Epoch:[0/2](283700/4588595) loss:2.695 lr:0.0000100 epoch_Time:27235.0min: [2024-01-03 23:34:37,894][model8_pretrain.py][INFO] Epoch:[0/2](283700/4588595) loss:2.605 lr:0.0000100 epoch_Time:27235.0min: [2024-01-03 23:34:37,894][model8_pretrain.py][INFO] Epoch:[0/2](283700/4588595) loss:3.102 lr:0.0000100 epoch_Time:27235.0min: [2024-01-03 23:34:37,894][model8_pretrain.py][INFO] Epoch:[0/2](283700/4588595) loss:2.984 lr:0.0000100 epoch_Time:27235.0min: [2024-01-03 23:35:14,825][model8_pretrain.py][INFO] Epoch:[0/2](283800/4588595) loss:3.138 lr:0.0000100 epoch_Time:27234.0min: [2024-01-03 23:35:14,825][model8_pretrain.py][INFO] Epoch:[0/2](283800/4588595) loss:2.846 lr:0.0000100 epoch_Time:27234.0min: [2024-01-03 23:35:14,825][model8_pretrain.py][INFO] Epoch:[0/2](283800/4588595) loss:3.129 lr:0.0000100 epoch_Time:27234.0min: [2024-01-03 23:35:14,825][model8_pretrain.py][INFO] Epoch:[0/2](283800/4588595) loss:2.952 lr:0.0000100 epoch_Time:27234.0min: [2024-01-03 23:35:14,825][model8_pretrain.py][INFO] Epoch:[0/2](283800/4588595) loss:2.796 lr:0.0000100 epoch_Time:27234.0min: [2024-01-03 23:35:14,825][model8_pretrain.py][INFO] Epoch:[0/2](283800/4588595) loss:2.508 lr:0.0000100 epoch_Time:27234.0min: [2024-01-03 23:35:14,825][model8_pretrain.py][INFO] Epoch:[0/2](283800/4588595) loss:2.818 lr:0.0000100 epoch_Time:27234.0min: [2024-01-03 23:35:14,825][model8_pretrain.py][INFO] Epoch:[0/2](283800/4588595) loss:3.023 lr:0.0000100 epoch_Time:27234.0min: [2024-01-03 23:35:51,762][model8_pretrain.py][INFO] Epoch:[0/2](283900/4588595) loss:2.894 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:35:51,762][model8_pretrain.py][INFO] Epoch:[0/2](283900/4588595) loss:3.327 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:35:51,763][model8_pretrain.py][INFO] Epoch:[0/2](283900/4588595) loss:3.287 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:35:51,763][model8_pretrain.py][INFO] Epoch:[0/2](283900/4588595) loss:3.281 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:35:51,763][model8_pretrain.py][INFO] Epoch:[0/2](283900/4588595) loss:2.445 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:35:51,763][model8_pretrain.py][INFO] Epoch:[0/2](283900/4588595) loss:2.812 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:35:51,763][model8_pretrain.py][INFO] Epoch:[0/2](283900/4588595) loss:3.016 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:35:51,763][model8_pretrain.py][INFO] Epoch:[0/2](283900/4588595) loss:2.864 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:36:28,705][model8_pretrain.py][INFO] Epoch:[0/2](284000/4588595) loss:3.060 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:36:28,705][model8_pretrain.py][INFO] Epoch:[0/2](284000/4588595) loss:3.082 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:36:28,705][model8_pretrain.py][INFO] Epoch:[0/2](284000/4588595) loss:2.809 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:36:28,705][model8_pretrain.py][INFO] Epoch:[0/2](284000/4588595) loss:2.948 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:36:28,705][model8_pretrain.py][INFO] Epoch:[0/2](284000/4588595) loss:2.748 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:36:28,705][model8_pretrain.py][INFO] Epoch:[0/2](284000/4588595) loss:3.255 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:36:28,705][model8_pretrain.py][INFO] Epoch:[0/2](284000/4588595) loss:3.296 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:36:28,705][model8_pretrain.py][INFO] Epoch:[0/2](284000/4588595) loss:3.137 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:37:05,643][model8_pretrain.py][INFO] Epoch:[0/2](284100/4588595) loss:2.900 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:37:05,643][model8_pretrain.py][INFO] Epoch:[0/2](284100/4588595) loss:3.049 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:37:05,643][model8_pretrain.py][INFO] Epoch:[0/2](284100/4588595) loss:2.951 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:37:05,643][model8_pretrain.py][INFO] Epoch:[0/2](284100/4588595) loss:2.719 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:37:05,643][model8_pretrain.py][INFO] Epoch:[0/2](284100/4588595) loss:3.101 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:37:05,643][model8_pretrain.py][INFO] Epoch:[0/2](284100/4588595) loss:2.966 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:37:05,643][model8_pretrain.py][INFO] Epoch:[0/2](284100/4588595) loss:3.027 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:37:05,643][model8_pretrain.py][INFO] Epoch:[0/2](284100/4588595) loss:2.694 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:37:52,305][model8_pretrain.py][INFO] Epoch:[0/2](284200/4588595) loss:2.892 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:37:52,305][model8_pretrain.py][INFO] Epoch:[0/2](284200/4588595) loss:3.082 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:37:52,305][model8_pretrain.py][INFO] Epoch:[0/2](284200/4588595) loss:3.198 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:37:52,305][model8_pretrain.py][INFO] Epoch:[0/2](284200/4588595) loss:2.520 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:37:52,305][model8_pretrain.py][INFO] Epoch:[0/2](284200/4588595) loss:3.172 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:37:52,305][model8_pretrain.py][INFO] Epoch:[0/2](284200/4588595) loss:2.975 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:37:52,305][model8_pretrain.py][INFO] Epoch:[0/2](284200/4588595) loss:2.545 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:37:52,305][model8_pretrain.py][INFO] Epoch:[0/2](284200/4588595) loss:2.441 lr:0.0000100 epoch_Time:27233.0min: [2024-01-03 23:38:29,228][model8_pretrain.py][INFO] Epoch:[0/2](284300/4588595) loss:2.647 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:38:29,228][model8_pretrain.py][INFO] Epoch:[0/2](284300/4588595) loss:3.263 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:38:29,228][model8_pretrain.py][INFO] Epoch:[0/2](284300/4588595) loss:3.050 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:38:29,228][model8_pretrain.py][INFO] Epoch:[0/2](284300/4588595) loss:2.956 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:38:29,228][model8_pretrain.py][INFO] Epoch:[0/2](284300/4588595) loss:3.041 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:38:29,228][model8_pretrain.py][INFO] Epoch:[0/2](284300/4588595) loss:2.783 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:38:29,229][model8_pretrain.py][INFO] Epoch:[0/2](284300/4588595) loss:2.859 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:38:29,229][model8_pretrain.py][INFO] Epoch:[0/2](284300/4588595) loss:3.268 lr:0.0000100 epoch_Time:27232.0min: [2024-01-03 23:39:06,154][model8_pretrain.py][INFO] Epoch:[0/2](284400/4588595) loss:3.111 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:06,154][model8_pretrain.py][INFO] Epoch:[0/2](284400/4588595) loss:2.893 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:06,154][model8_pretrain.py][INFO] Epoch:[0/2](284400/4588595) loss:2.708 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:06,154][model8_pretrain.py][INFO] Epoch:[0/2](284400/4588595) loss:2.868 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:06,154][model8_pretrain.py][INFO] Epoch:[0/2](284400/4588595) loss:2.692 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:06,154][model8_pretrain.py][INFO] Epoch:[0/2](284400/4588595) loss:2.869 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:06,154][model8_pretrain.py][INFO] Epoch:[0/2](284400/4588595) loss:3.198 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:06,155][model8_pretrain.py][INFO] Epoch:[0/2](284400/4588595) loss:3.045 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:43,084][model8_pretrain.py][INFO] Epoch:[0/2](284500/4588595) loss:2.836 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:43,084][model8_pretrain.py][INFO] Epoch:[0/2](284500/4588595) loss:2.738 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:43,084][model8_pretrain.py][INFO] Epoch:[0/2](284500/4588595) loss:3.393 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:43,084][model8_pretrain.py][INFO] Epoch:[0/2](284500/4588595) loss:2.120 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:43,084][model8_pretrain.py][INFO] Epoch:[0/2](284500/4588595) loss:3.030 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:43,084][model8_pretrain.py][INFO] Epoch:[0/2](284500/4588595) loss:3.055 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:43,084][model8_pretrain.py][INFO] Epoch:[0/2](284500/4588595) loss:2.997 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:39:43,084][model8_pretrain.py][INFO] Epoch:[0/2](284500/4588595) loss:2.234 lr:0.0000100 epoch_Time:27231.0min: [2024-01-03 23:40:20,017][model8_pretrain.py][INFO] Epoch:[0/2](284600/4588595) loss:2.896 lr:0.0000100 epoch_Time:27229.0min: [2024-01-03 23:40:20,017][model8_pretrain.py][INFO] Epoch:[0/2](284600/4588595) loss:2.847 lr:0.0000100 epoch_Time:27229.0min: [2024-01-03 23:40:20,017][model8_pretrain.py][INFO] Epoch:[0/2](284600/4588595) loss:2.568 lr:0.0000100 epoch_Time:27229.0min: [2024-01-03 23:40:20,017][model8_pretrain.py][INFO] Epoch:[0/2](284600/4588595) loss:3.153 lr:0.0000100 epoch_Time:27229.0min: [2024-01-03 23:40:20,017][model8_pretrain.py][INFO] Epoch:[0/2](284600/4588595) loss:3.071 lr:0.0000100 epoch_Time:27229.0min: [2024-01-03 23:40:20,017][model8_pretrain.py][INFO] Epoch:[0/2](284600/4588595) loss:2.730 lr:0.0000100 epoch_Time:27229.0min: [2024-01-03 23:40:20,017][model8_pretrain.py][INFO] Epoch:[0/2](284600/4588595) loss:3.044 lr:0.0000100 epoch_Time:27229.0min: [2024-01-03 23:40:20,017][model8_pretrain.py][INFO] Epoch:[0/2](284600/4588595) loss:3.315 lr:0.0000100 epoch_Time:27229.0min: [2024-01-03 23:40:56,963][model8_pretrain.py][INFO] Epoch:[0/2](284700/4588595) loss:2.413 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:40:56,963][model8_pretrain.py][INFO] Epoch:[0/2](284700/4588595) loss:3.137 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:40:56,963][model8_pretrain.py][INFO] Epoch:[0/2](284700/4588595) loss:3.247 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:40:56,963][model8_pretrain.py][INFO] Epoch:[0/2](284700/4588595) loss:2.673 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:40:56,963][model8_pretrain.py][INFO] Epoch:[0/2](284700/4588595) loss:2.745 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:40:56,963][model8_pretrain.py][INFO] Epoch:[0/2](284700/4588595) loss:3.200 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:40:56,963][model8_pretrain.py][INFO] Epoch:[0/2](284700/4588595) loss:2.565 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:40:56,963][model8_pretrain.py][INFO] Epoch:[0/2](284700/4588595) loss:2.610 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:41:33,873][model8_pretrain.py][INFO] Epoch:[0/2](284800/4588595) loss:2.744 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:41:33,873][model8_pretrain.py][INFO] Epoch:[0/2](284800/4588595) loss:3.198 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:41:33,873][model8_pretrain.py][INFO] Epoch:[0/2](284800/4588595) loss:2.443 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:41:33,873][model8_pretrain.py][INFO] Epoch:[0/2](284800/4588595) loss:2.912 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:41:33,873][model8_pretrain.py][INFO] Epoch:[0/2](284800/4588595) loss:2.564 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:41:33,873][model8_pretrain.py][INFO] Epoch:[0/2](284800/4588595) loss:3.338 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:41:33,873][model8_pretrain.py][INFO] Epoch:[0/2](284800/4588595) loss:3.109 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:41:33,873][model8_pretrain.py][INFO] Epoch:[0/2](284800/4588595) loss:3.036 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:42:10,808][model8_pretrain.py][INFO] Epoch:[0/2](284900/4588595) loss:2.551 lr:0.0000100 epoch_Time:27227.0min: [2024-01-03 23:42:10,808][model8_pretrain.py][INFO] Epoch:[0/2](284900/4588595) loss:2.406 lr:0.0000100 epoch_Time:27227.0min: [2024-01-03 23:42:10,808][model8_pretrain.py][INFO] Epoch:[0/2](284900/4588595) loss:3.131 lr:0.0000100 epoch_Time:27227.0min: [2024-01-03 23:42:10,808][model8_pretrain.py][INFO] Epoch:[0/2](284900/4588595) loss:2.780 lr:0.0000100 epoch_Time:27227.0min: [2024-01-03 23:42:10,808][model8_pretrain.py][INFO] Epoch:[0/2](284900/4588595) loss:3.161 lr:0.0000100 epoch_Time:27227.0min: [2024-01-03 23:42:10,808][model8_pretrain.py][INFO] Epoch:[0/2](284900/4588595) loss:2.854 lr:0.0000100 epoch_Time:27227.0min: [2024-01-03 23:42:10,808][model8_pretrain.py][INFO] Epoch:[0/2](284900/4588595) loss:2.556 lr:0.0000100 epoch_Time:27227.0min: [2024-01-03 23:42:10,808][model8_pretrain.py][INFO] Epoch:[0/2](284900/4588595) loss:2.946 lr:0.0000100 epoch_Time:27227.0min: [2024-01-03 23:42:57,662][model8_pretrain.py][INFO] Epoch:[0/2](285000/4588595) loss:2.489 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:42:57,662][model8_pretrain.py][INFO] Epoch:[0/2](285000/4588595) loss:2.842 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:42:57,662][model8_pretrain.py][INFO] Epoch:[0/2](285000/4588595) loss:2.483 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:42:57,662][model8_pretrain.py][INFO] Epoch:[0/2](285000/4588595) loss:2.418 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:42:57,662][model8_pretrain.py][INFO] Epoch:[0/2](285000/4588595) loss:3.125 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:42:57,662][model8_pretrain.py][INFO] Epoch:[0/2](285000/4588595) loss:2.819 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:42:57,662][model8_pretrain.py][INFO] Epoch:[0/2](285000/4588595) loss:2.522 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:42:57,662][model8_pretrain.py][INFO] Epoch:[0/2](285000/4588595) loss:2.677 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:43:34,586][model8_pretrain.py][INFO] Epoch:[0/2](285100/4588595) loss:2.657 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:43:34,586][model8_pretrain.py][INFO] Epoch:[0/2](285100/4588595) loss:2.482 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:43:34,586][model8_pretrain.py][INFO] Epoch:[0/2](285100/4588595) loss:2.578 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:43:34,586][model8_pretrain.py][INFO] Epoch:[0/2](285100/4588595) loss:2.782 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:43:34,586][model8_pretrain.py][INFO] Epoch:[0/2](285100/4588595) loss:2.824 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:43:34,586][model8_pretrain.py][INFO] Epoch:[0/2](285100/4588595) loss:3.170 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:43:34,586][model8_pretrain.py][INFO] Epoch:[0/2](285100/4588595) loss:3.198 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:43:34,586][model8_pretrain.py][INFO] Epoch:[0/2](285100/4588595) loss:3.235 lr:0.0000100 epoch_Time:27228.0min: [2024-01-03 23:44:11,514][model8_pretrain.py][INFO] Epoch:[0/2](285200/4588595) loss:2.829 lr:0.0000100 epoch_Time:27226.0min: [2024-01-03 23:44:11,514][model8_pretrain.py][INFO] Epoch:[0/2](285200/4588595) loss:3.579 lr:0.0000100 epoch_Time:27226.0min: [2024-01-03 23:44:11,514][model8_pretrain.py][INFO] Epoch:[0/2](285200/4588595) loss:2.873 lr:0.0000100 epoch_Time:27226.0min: [2024-01-03 23:44:11,514][model8_pretrain.py][INFO] Epoch:[0/2](285200/4588595) loss:2.866 lr:0.0000100 epoch_Time:27226.0min: [2024-01-03 23:44:11,514][model8_pretrain.py][INFO] Epoch:[0/2](285200/4588595) loss:2.692 lr:0.0000100 epoch_Time:27226.0min: [2024-01-03 23:44:11,514][model8_pretrain.py][INFO] Epoch:[0/2](285200/4588595) loss:3.271 lr:0.0000100 epoch_Time:27226.0min: [2024-01-03 23:44:11,514][model8_pretrain.py][INFO] Epoch:[0/2](285200/4588595) loss:2.801 lr:0.0000100 epoch_Time:27226.0min: [2024-01-03 23:44:11,515][model8_pretrain.py][INFO] Epoch:[0/2](285200/4588595) loss:3.028 lr:0.0000100 epoch_Time:27226.0min: [2024-01-03 23:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](285300/4588595) loss:3.085 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](285300/4588595) loss:2.673 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](285300/4588595) loss:2.473 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](285300/4588595) loss:2.933 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](285300/4588595) loss:2.703 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](285300/4588595) loss:2.798 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](285300/4588595) loss:3.098 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](285300/4588595) loss:3.231 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:45:25,388][model8_pretrain.py][INFO] Epoch:[0/2](285400/4588595) loss:2.969 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:45:25,388][model8_pretrain.py][INFO] Epoch:[0/2](285400/4588595) loss:2.706 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:45:25,388][model8_pretrain.py][INFO] Epoch:[0/2](285400/4588595) loss:2.910 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:45:25,388][model8_pretrain.py][INFO] Epoch:[0/2](285400/4588595) loss:2.896 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:45:25,388][model8_pretrain.py][INFO] Epoch:[0/2](285400/4588595) loss:3.066 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:45:25,388][model8_pretrain.py][INFO] Epoch:[0/2](285400/4588595) loss:3.151 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:45:25,388][model8_pretrain.py][INFO] Epoch:[0/2](285400/4588595) loss:3.196 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:45:25,389][model8_pretrain.py][INFO] Epoch:[0/2](285400/4588595) loss:3.109 lr:0.0000100 epoch_Time:27225.0min: [2024-01-03 23:46:02,334][model8_pretrain.py][INFO] Epoch:[0/2](285500/4588595) loss:2.766 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:46:02,334][model8_pretrain.py][INFO] Epoch:[0/2](285500/4588595) loss:3.218 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:46:02,334][model8_pretrain.py][INFO] Epoch:[0/2](285500/4588595) loss:2.496 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:46:02,334][model8_pretrain.py][INFO] Epoch:[0/2](285500/4588595) loss:3.337 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:46:02,334][model8_pretrain.py][INFO] Epoch:[0/2](285500/4588595) loss:2.929 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:46:02,334][model8_pretrain.py][INFO] Epoch:[0/2](285500/4588595) loss:2.346 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:46:02,334][model8_pretrain.py][INFO] Epoch:[0/2](285500/4588595) loss:3.148 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:46:02,334][model8_pretrain.py][INFO] Epoch:[0/2](285500/4588595) loss:3.018 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](285600/4588595) loss:2.686 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](285600/4588595) loss:2.220 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](285600/4588595) loss:2.829 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](285600/4588595) loss:3.134 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](285600/4588595) loss:2.520 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](285600/4588595) loss:2.677 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](285600/4588595) loss:3.242 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](285600/4588595) loss:2.634 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:47:16,210][model8_pretrain.py][INFO] Epoch:[0/2](285700/4588595) loss:3.023 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:47:16,210][model8_pretrain.py][INFO] Epoch:[0/2](285700/4588595) loss:3.333 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:47:16,210][model8_pretrain.py][INFO] Epoch:[0/2](285700/4588595) loss:3.116 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:47:16,210][model8_pretrain.py][INFO] Epoch:[0/2](285700/4588595) loss:3.032 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:47:16,210][model8_pretrain.py][INFO] Epoch:[0/2](285700/4588595) loss:2.959 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:47:16,210][model8_pretrain.py][INFO] Epoch:[0/2](285700/4588595) loss:2.571 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:47:16,210][model8_pretrain.py][INFO] Epoch:[0/2](285700/4588595) loss:2.916 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:47:16,210][model8_pretrain.py][INFO] Epoch:[0/2](285700/4588595) loss:2.810 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:48:03,227][model8_pretrain.py][INFO] Epoch:[0/2](285800/4588595) loss:3.010 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:48:03,227][model8_pretrain.py][INFO] Epoch:[0/2](285800/4588595) loss:2.928 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:48:03,227][model8_pretrain.py][INFO] Epoch:[0/2](285800/4588595) loss:2.446 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:48:03,227][model8_pretrain.py][INFO] Epoch:[0/2](285800/4588595) loss:3.303 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:48:03,227][model8_pretrain.py][INFO] Epoch:[0/2](285800/4588595) loss:2.857 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:48:03,227][model8_pretrain.py][INFO] Epoch:[0/2](285800/4588595) loss:2.623 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:48:03,227][model8_pretrain.py][INFO] Epoch:[0/2](285800/4588595) loss:3.091 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:48:03,228][model8_pretrain.py][INFO] Epoch:[0/2](285800/4588595) loss:3.015 lr:0.0000100 epoch_Time:27224.0min: [2024-01-03 23:48:40,155][model8_pretrain.py][INFO] Epoch:[0/2](285900/4588595) loss:3.206 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:48:40,155][model8_pretrain.py][INFO] Epoch:[0/2](285900/4588595) loss:2.517 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:48:40,155][model8_pretrain.py][INFO] Epoch:[0/2](285900/4588595) loss:2.940 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:48:40,155][model8_pretrain.py][INFO] Epoch:[0/2](285900/4588595) loss:2.826 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:48:40,155][model8_pretrain.py][INFO] Epoch:[0/2](285900/4588595) loss:3.221 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:48:40,155][model8_pretrain.py][INFO] Epoch:[0/2](285900/4588595) loss:2.854 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:48:40,155][model8_pretrain.py][INFO] Epoch:[0/2](285900/4588595) loss:2.788 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:48:40,155][model8_pretrain.py][INFO] Epoch:[0/2](285900/4588595) loss:3.061 lr:0.0000100 epoch_Time:27223.0min: [2024-01-03 23:49:17,090][model8_pretrain.py][INFO] Epoch:[0/2](286000/4588595) loss:2.624 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:49:17,090][model8_pretrain.py][INFO] Epoch:[0/2](286000/4588595) loss:2.848 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:49:17,090][model8_pretrain.py][INFO] Epoch:[0/2](286000/4588595) loss:2.798 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:49:17,090][model8_pretrain.py][INFO] Epoch:[0/2](286000/4588595) loss:2.907 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:49:17,090][model8_pretrain.py][INFO] Epoch:[0/2](286000/4588595) loss:2.998 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:49:17,090][model8_pretrain.py][INFO] Epoch:[0/2](286000/4588595) loss:2.880 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:49:17,090][model8_pretrain.py][INFO] Epoch:[0/2](286000/4588595) loss:3.184 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:49:17,090][model8_pretrain.py][INFO] Epoch:[0/2](286000/4588595) loss:2.588 lr:0.0000100 epoch_Time:27222.0min: [2024-01-03 23:49:54,024][model8_pretrain.py][INFO] Epoch:[0/2](286100/4588595) loss:3.348 lr:0.0000100 epoch_Time:27221.0min: [2024-01-03 23:49:54,024][model8_pretrain.py][INFO] Epoch:[0/2](286100/4588595) loss:3.145 lr:0.0000100 epoch_Time:27221.0min: [2024-01-03 23:49:54,024][model8_pretrain.py][INFO] Epoch:[0/2](286100/4588595) loss:2.728 lr:0.0000100 epoch_Time:27221.0min: [2024-01-03 23:49:54,024][model8_pretrain.py][INFO] Epoch:[0/2](286100/4588595) loss:2.964 lr:0.0000100 epoch_Time:27221.0min: [2024-01-03 23:49:54,024][model8_pretrain.py][INFO] Epoch:[0/2](286100/4588595) loss:2.867 lr:0.0000100 epoch_Time:27221.0min: [2024-01-03 23:49:54,024][model8_pretrain.py][INFO] Epoch:[0/2](286100/4588595) loss:2.965 lr:0.0000100 epoch_Time:27221.0min: [2024-01-03 23:49:54,025][model8_pretrain.py][INFO] Epoch:[0/2](286100/4588595) loss:3.259 lr:0.0000100 epoch_Time:27221.0min: [2024-01-03 23:49:54,026][model8_pretrain.py][INFO] Epoch:[0/2](286100/4588595) loss:2.892 lr:0.0000100 epoch_Time:27221.0min: [2024-01-03 23:50:30,969][model8_pretrain.py][INFO] Epoch:[0/2](286200/4588595) loss:3.061 lr:0.0000100 epoch_Time:27220.0min: [2024-01-03 23:50:30,969][model8_pretrain.py][INFO] Epoch:[0/2](286200/4588595) loss:3.059 lr:0.0000100 epoch_Time:27220.0min: [2024-01-03 23:50:30,969][model8_pretrain.py][INFO] Epoch:[0/2](286200/4588595) loss:2.848 lr:0.0000100 epoch_Time:27220.0min: [2024-01-03 23:50:30,969][model8_pretrain.py][INFO] Epoch:[0/2](286200/4588595) loss:2.958 lr:0.0000100 epoch_Time:27220.0min: [2024-01-03 23:50:30,969][model8_pretrain.py][INFO] Epoch:[0/2](286200/4588595) loss:3.364 lr:0.0000100 epoch_Time:27220.0min: [2024-01-03 23:50:30,969][model8_pretrain.py][INFO] Epoch:[0/2](286200/4588595) loss:2.879 lr:0.0000100 epoch_Time:27220.0min: [2024-01-03 23:50:30,969][model8_pretrain.py][INFO] Epoch:[0/2](286200/4588595) loss:2.786 lr:0.0000100 epoch_Time:27220.0min: [2024-01-03 23:50:30,969][model8_pretrain.py][INFO] Epoch:[0/2](286200/4588595) loss:2.732 lr:0.0000100 epoch_Time:27220.0min: [2024-01-03 23:51:07,908][model8_pretrain.py][INFO] Epoch:[0/2](286300/4588595) loss:3.441 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:07,908][model8_pretrain.py][INFO] Epoch:[0/2](286300/4588595) loss:3.029 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:07,908][model8_pretrain.py][INFO] Epoch:[0/2](286300/4588595) loss:2.532 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:07,908][model8_pretrain.py][INFO] Epoch:[0/2](286300/4588595) loss:3.096 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:07,908][model8_pretrain.py][INFO] Epoch:[0/2](286300/4588595) loss:3.407 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:07,908][model8_pretrain.py][INFO] Epoch:[0/2](286300/4588595) loss:1.747 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:07,908][model8_pretrain.py][INFO] Epoch:[0/2](286300/4588595) loss:2.536 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:07,908][model8_pretrain.py][INFO] Epoch:[0/2](286300/4588595) loss:3.037 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:44,843][model8_pretrain.py][INFO] Epoch:[0/2](286400/4588595) loss:2.304 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:44,843][model8_pretrain.py][INFO] Epoch:[0/2](286400/4588595) loss:2.765 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:44,843][model8_pretrain.py][INFO] Epoch:[0/2](286400/4588595) loss:3.113 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:44,843][model8_pretrain.py][INFO] Epoch:[0/2](286400/4588595) loss:2.544 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:44,843][model8_pretrain.py][INFO] Epoch:[0/2](286400/4588595) loss:2.688 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:44,843][model8_pretrain.py][INFO] Epoch:[0/2](286400/4588595) loss:2.959 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:44,843][model8_pretrain.py][INFO] Epoch:[0/2](286400/4588595) loss:2.949 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:51:44,844][model8_pretrain.py][INFO] Epoch:[0/2](286400/4588595) loss:2.189 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:52:21,778][model8_pretrain.py][INFO] Epoch:[0/2](286500/4588595) loss:2.575 lr:0.0000100 epoch_Time:27218.0min: [2024-01-03 23:52:21,778][model8_pretrain.py][INFO] Epoch:[0/2](286500/4588595) loss:2.924 lr:0.0000100 epoch_Time:27218.0min: [2024-01-03 23:52:21,778][model8_pretrain.py][INFO] Epoch:[0/2](286500/4588595) loss:3.168 lr:0.0000100 epoch_Time:27218.0min: [2024-01-03 23:52:21,778][model8_pretrain.py][INFO] Epoch:[0/2](286500/4588595) loss:3.003 lr:0.0000100 epoch_Time:27218.0min: [2024-01-03 23:52:21,778][model8_pretrain.py][INFO] Epoch:[0/2](286500/4588595) loss:2.957 lr:0.0000100 epoch_Time:27218.0min: [2024-01-03 23:52:21,778][model8_pretrain.py][INFO] Epoch:[0/2](286500/4588595) loss:2.654 lr:0.0000100 epoch_Time:27218.0min: [2024-01-03 23:52:21,779][model8_pretrain.py][INFO] Epoch:[0/2](286500/4588595) loss:2.941 lr:0.0000100 epoch_Time:27218.0min: [2024-01-03 23:52:21,779][model8_pretrain.py][INFO] Epoch:[0/2](286500/4588595) loss:3.187 lr:0.0000100 epoch_Time:27218.0min: [2024-01-03 23:53:08,931][model8_pretrain.py][INFO] Epoch:[0/2](286600/4588595) loss:2.956 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:08,931][model8_pretrain.py][INFO] Epoch:[0/2](286600/4588595) loss:2.250 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:08,931][model8_pretrain.py][INFO] Epoch:[0/2](286600/4588595) loss:2.757 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:08,931][model8_pretrain.py][INFO] Epoch:[0/2](286600/4588595) loss:3.447 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:08,931][model8_pretrain.py][INFO] Epoch:[0/2](286600/4588595) loss:3.200 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:08,931][model8_pretrain.py][INFO] Epoch:[0/2](286600/4588595) loss:2.641 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:08,931][model8_pretrain.py][INFO] Epoch:[0/2](286600/4588595) loss:3.372 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:08,931][model8_pretrain.py][INFO] Epoch:[0/2](286600/4588595) loss:2.606 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:45,854][model8_pretrain.py][INFO] Epoch:[0/2](286700/4588595) loss:2.905 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:45,854][model8_pretrain.py][INFO] Epoch:[0/2](286700/4588595) loss:2.978 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:45,854][model8_pretrain.py][INFO] Epoch:[0/2](286700/4588595) loss:3.324 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:45,854][model8_pretrain.py][INFO] Epoch:[0/2](286700/4588595) loss:2.932 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:45,854][model8_pretrain.py][INFO] Epoch:[0/2](286700/4588595) loss:3.203 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:45,854][model8_pretrain.py][INFO] Epoch:[0/2](286700/4588595) loss:3.010 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:45,854][model8_pretrain.py][INFO] Epoch:[0/2](286700/4588595) loss:3.293 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:53:45,854][model8_pretrain.py][INFO] Epoch:[0/2](286700/4588595) loss:3.161 lr:0.0000100 epoch_Time:27219.0min: [2024-01-03 23:54:22,804][model8_pretrain.py][INFO] Epoch:[0/2](286800/4588595) loss:2.644 lr:0.0000100 epoch_Time:27217.0min: [2024-01-03 23:54:22,805][model8_pretrain.py][INFO] Epoch:[0/2](286800/4588595) loss:2.462 lr:0.0000100 epoch_Time:27217.0min: [2024-01-03 23:54:22,805][model8_pretrain.py][INFO] Epoch:[0/2](286800/4588595) loss:2.464 lr:0.0000100 epoch_Time:27217.0min: [2024-01-03 23:54:22,805][model8_pretrain.py][INFO] Epoch:[0/2](286800/4588595) loss:2.990 lr:0.0000100 epoch_Time:27217.0min: [2024-01-03 23:54:22,805][model8_pretrain.py][INFO] Epoch:[0/2](286800/4588595) loss:3.127 lr:0.0000100 epoch_Time:27217.0min: [2024-01-03 23:54:22,805][model8_pretrain.py][INFO] Epoch:[0/2](286800/4588595) loss:3.576 lr:0.0000100 epoch_Time:27217.0min: [2024-01-03 23:54:22,805][model8_pretrain.py][INFO] Epoch:[0/2](286800/4588595) loss:2.851 lr:0.0000100 epoch_Time:27217.0min: [2024-01-03 23:54:22,805][model8_pretrain.py][INFO] Epoch:[0/2](286800/4588595) loss:3.004 lr:0.0000100 epoch_Time:27217.0min: [2024-01-03 23:54:59,745][model8_pretrain.py][INFO] Epoch:[0/2](286900/4588595) loss:2.605 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:54:59,745][model8_pretrain.py][INFO] Epoch:[0/2](286900/4588595) loss:3.216 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:54:59,745][model8_pretrain.py][INFO] Epoch:[0/2](286900/4588595) loss:2.314 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:54:59,745][model8_pretrain.py][INFO] Epoch:[0/2](286900/4588595) loss:2.646 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:54:59,745][model8_pretrain.py][INFO] Epoch:[0/2](286900/4588595) loss:2.452 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:54:59,745][model8_pretrain.py][INFO] Epoch:[0/2](286900/4588595) loss:2.815 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:54:59,745][model8_pretrain.py][INFO] Epoch:[0/2](286900/4588595) loss:3.008 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:54:59,745][model8_pretrain.py][INFO] Epoch:[0/2](286900/4588595) loss:2.910 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:55:36,679][model8_pretrain.py][INFO] Epoch:[0/2](287000/4588595) loss:2.411 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:55:36,679][model8_pretrain.py][INFO] Epoch:[0/2](287000/4588595) loss:2.970 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:55:36,679][model8_pretrain.py][INFO] Epoch:[0/2](287000/4588595) loss:2.988 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:55:36,679][model8_pretrain.py][INFO] Epoch:[0/2](287000/4588595) loss:2.415 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:55:36,679][model8_pretrain.py][INFO] Epoch:[0/2](287000/4588595) loss:3.192 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:55:36,679][model8_pretrain.py][INFO] Epoch:[0/2](287000/4588595) loss:3.037 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:55:36,679][model8_pretrain.py][INFO] Epoch:[0/2](287000/4588595) loss:3.115 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:55:36,679][model8_pretrain.py][INFO] Epoch:[0/2](287000/4588595) loss:2.689 lr:0.0000100 epoch_Time:27216.0min: [2024-01-03 23:56:13,624][model8_pretrain.py][INFO] Epoch:[0/2](287100/4588595) loss:2.723 lr:0.0000100 epoch_Time:27215.0min: [2024-01-03 23:56:13,624][model8_pretrain.py][INFO] Epoch:[0/2](287100/4588595) loss:2.851 lr:0.0000100 epoch_Time:27215.0min: [2024-01-03 23:56:13,624][model8_pretrain.py][INFO] Epoch:[0/2](287100/4588595) loss:2.780 lr:0.0000100 epoch_Time:27215.0min: [2024-01-03 23:56:13,624][model8_pretrain.py][INFO] Epoch:[0/2](287100/4588595) loss:3.457 lr:0.0000100 epoch_Time:27215.0min: [2024-01-03 23:56:13,624][model8_pretrain.py][INFO] Epoch:[0/2](287100/4588595) loss:3.164 lr:0.0000100 epoch_Time:27215.0min: [2024-01-03 23:56:13,624][model8_pretrain.py][INFO] Epoch:[0/2](287100/4588595) loss:3.573 lr:0.0000100 epoch_Time:27215.0min: [2024-01-03 23:56:13,624][model8_pretrain.py][INFO] Epoch:[0/2](287100/4588595) loss:2.965 lr:0.0000100 epoch_Time:27215.0min: [2024-01-03 23:56:13,624][model8_pretrain.py][INFO] Epoch:[0/2](287100/4588595) loss:2.756 lr:0.0000100 epoch_Time:27215.0min: [2024-01-03 23:56:50,566][model8_pretrain.py][INFO] Epoch:[0/2](287200/4588595) loss:2.312 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:56:50,566][model8_pretrain.py][INFO] Epoch:[0/2](287200/4588595) loss:2.233 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:56:50,566][model8_pretrain.py][INFO] Epoch:[0/2](287200/4588595) loss:3.187 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:56:50,566][model8_pretrain.py][INFO] Epoch:[0/2](287200/4588595) loss:3.048 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:56:50,566][model8_pretrain.py][INFO] Epoch:[0/2](287200/4588595) loss:2.694 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:56:50,566][model8_pretrain.py][INFO] Epoch:[0/2](287200/4588595) loss:2.919 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:56:50,566][model8_pretrain.py][INFO] Epoch:[0/2](287200/4588595) loss:2.567 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:56:50,566][model8_pretrain.py][INFO] Epoch:[0/2](287200/4588595) loss:2.818 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:57:27,510][model8_pretrain.py][INFO] Epoch:[0/2](287300/4588595) loss:2.794 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:57:27,511][model8_pretrain.py][INFO] Epoch:[0/2](287300/4588595) loss:2.098 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:57:27,511][model8_pretrain.py][INFO] Epoch:[0/2](287300/4588595) loss:3.027 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:57:27,511][model8_pretrain.py][INFO] Epoch:[0/2](287300/4588595) loss:2.536 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:57:27,511][model8_pretrain.py][INFO] Epoch:[0/2](287300/4588595) loss:2.876 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:57:27,511][model8_pretrain.py][INFO] Epoch:[0/2](287300/4588595) loss:3.276 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:57:27,511][model8_pretrain.py][INFO] Epoch:[0/2](287300/4588595) loss:3.116 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:57:27,511][model8_pretrain.py][INFO] Epoch:[0/2](287300/4588595) loss:3.150 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:58:13,213][model8_pretrain.py][INFO] Epoch:[0/2](287400/4588595) loss:3.102 lr:0.0000100 epoch_Time:27214.0min: [2024-01-03 23:58:13,213][model8_pretrain.py][INFO] Epoch:[0/2](287400/4588595) loss:3.042 lr:0.0000100 epoch_Time:27214.0min: [2024-01-03 23:58:13,213][model8_pretrain.py][INFO] Epoch:[0/2](287400/4588595) loss:3.452 lr:0.0000100 epoch_Time:27214.0min: [2024-01-03 23:58:13,213][model8_pretrain.py][INFO] Epoch:[0/2](287400/4588595) loss:2.774 lr:0.0000100 epoch_Time:27214.0min: [2024-01-03 23:58:13,213][model8_pretrain.py][INFO] Epoch:[0/2](287400/4588595) loss:2.796 lr:0.0000100 epoch_Time:27214.0min: [2024-01-03 23:58:13,213][model8_pretrain.py][INFO] Epoch:[0/2](287400/4588595) loss:2.176 lr:0.0000100 epoch_Time:27214.0min: [2024-01-03 23:58:13,213][model8_pretrain.py][INFO] Epoch:[0/2](287400/4588595) loss:2.278 lr:0.0000100 epoch_Time:27214.0min: [2024-01-03 23:58:14,665][model8_pretrain.py][INFO] Epoch:[0/2](287400/4588595) loss:2.676 lr:0.0000100 epoch_Time:27215.0min: [2024-01-03 23:58:51,582][model8_pretrain.py][INFO] Epoch:[0/2](287500/4588595) loss:2.804 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:58:51,582][model8_pretrain.py][INFO] Epoch:[0/2](287500/4588595) loss:3.107 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:58:51,582][model8_pretrain.py][INFO] Epoch:[0/2](287500/4588595) loss:2.930 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:58:51,582][model8_pretrain.py][INFO] Epoch:[0/2](287500/4588595) loss:2.486 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:58:51,582][model8_pretrain.py][INFO] Epoch:[0/2](287500/4588595) loss:2.413 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:58:51,582][model8_pretrain.py][INFO] Epoch:[0/2](287500/4588595) loss:3.301 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:58:51,582][model8_pretrain.py][INFO] Epoch:[0/2](287500/4588595) loss:3.352 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:58:51,582][model8_pretrain.py][INFO] Epoch:[0/2](287500/4588595) loss:2.965 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:59:28,518][model8_pretrain.py][INFO] Epoch:[0/2](287600/4588595) loss:2.853 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:59:28,518][model8_pretrain.py][INFO] Epoch:[0/2](287600/4588595) loss:2.600 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:59:28,518][model8_pretrain.py][INFO] Epoch:[0/2](287600/4588595) loss:3.186 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:59:28,518][model8_pretrain.py][INFO] Epoch:[0/2](287600/4588595) loss:2.328 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:59:28,518][model8_pretrain.py][INFO] Epoch:[0/2](287600/4588595) loss:2.480 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:59:28,518][model8_pretrain.py][INFO] Epoch:[0/2](287600/4588595) loss:2.531 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:59:28,518][model8_pretrain.py][INFO] Epoch:[0/2](287600/4588595) loss:2.554 lr:0.0000100 epoch_Time:27213.0min: [2024-01-03 23:59:28,518][model8_pretrain.py][INFO] Epoch:[0/2](287600/4588595) loss:2.695 lr:0.0000100 epoch_Time:27213.0min: [2024-01-04 00:00:05,454][model8_pretrain.py][INFO] Epoch:[0/2](287700/4588595) loss:2.853 lr:0.0000100 epoch_Time:27212.0min: [2024-01-04 00:00:05,454][model8_pretrain.py][INFO] Epoch:[0/2](287700/4588595) loss:2.162 lr:0.0000100 epoch_Time:27212.0min: [2024-01-04 00:00:05,454][model8_pretrain.py][INFO] Epoch:[0/2](287700/4588595) loss:2.849 lr:0.0000100 epoch_Time:27212.0min: [2024-01-04 00:00:05,454][model8_pretrain.py][INFO] Epoch:[0/2](287700/4588595) loss:2.809 lr:0.0000100 epoch_Time:27212.0min: [2024-01-04 00:00:05,454][model8_pretrain.py][INFO] Epoch:[0/2](287700/4588595) loss:3.028 lr:0.0000100 epoch_Time:27212.0min: [2024-01-04 00:00:05,454][model8_pretrain.py][INFO] Epoch:[0/2](287700/4588595) loss:2.930 lr:0.0000100 epoch_Time:27212.0min: [2024-01-04 00:00:05,454][model8_pretrain.py][INFO] Epoch:[0/2](287700/4588595) loss:2.794 lr:0.0000100 epoch_Time:27212.0min: [2024-01-04 00:00:05,454][model8_pretrain.py][INFO] Epoch:[0/2](287700/4588595) loss:2.810 lr:0.0000100 epoch_Time:27212.0min: [2024-01-04 00:00:42,400][model8_pretrain.py][INFO] Epoch:[0/2](287800/4588595) loss:2.620 lr:0.0000100 epoch_Time:27211.0min: [2024-01-04 00:00:42,400][model8_pretrain.py][INFO] Epoch:[0/2](287800/4588595) loss:2.927 lr:0.0000100 epoch_Time:27211.0min: [2024-01-04 00:00:42,400][model8_pretrain.py][INFO] Epoch:[0/2](287800/4588595) loss:3.323 lr:0.0000100 epoch_Time:27211.0min: [2024-01-04 00:00:42,400][model8_pretrain.py][INFO] Epoch:[0/2](287800/4588595) loss:2.864 lr:0.0000100 epoch_Time:27211.0min: [2024-01-04 00:00:42,400][model8_pretrain.py][INFO] Epoch:[0/2](287800/4588595) loss:3.123 lr:0.0000100 epoch_Time:27211.0min: [2024-01-04 00:00:42,400][model8_pretrain.py][INFO] Epoch:[0/2](287800/4588595) loss:3.153 lr:0.0000100 epoch_Time:27211.0min: [2024-01-04 00:00:42,400][model8_pretrain.py][INFO] Epoch:[0/2](287800/4588595) loss:3.365 lr:0.0000100 epoch_Time:27211.0min: [2024-01-04 00:00:42,400][model8_pretrain.py][INFO] Epoch:[0/2](287800/4588595) loss:2.737 lr:0.0000100 epoch_Time:27211.0min: [2024-01-04 00:01:19,343][model8_pretrain.py][INFO] Epoch:[0/2](287900/4588595) loss:3.189 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:01:19,343][model8_pretrain.py][INFO] Epoch:[0/2](287900/4588595) loss:2.679 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:01:19,343][model8_pretrain.py][INFO] Epoch:[0/2](287900/4588595) loss:2.824 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:01:19,343][model8_pretrain.py][INFO] Epoch:[0/2](287900/4588595) loss:3.147 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:01:19,343][model8_pretrain.py][INFO] Epoch:[0/2](287900/4588595) loss:2.835 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:01:19,343][model8_pretrain.py][INFO] Epoch:[0/2](287900/4588595) loss:3.120 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:01:19,343][model8_pretrain.py][INFO] Epoch:[0/2](287900/4588595) loss:3.476 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:01:19,343][model8_pretrain.py][INFO] Epoch:[0/2](287900/4588595) loss:2.779 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:01:56,287][model8_pretrain.py][INFO] Epoch:[0/2](288000/4588595) loss:2.847 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:01:56,287][model8_pretrain.py][INFO] Epoch:[0/2](288000/4588595) loss:3.018 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:01:56,287][model8_pretrain.py][INFO] Epoch:[0/2](288000/4588595) loss:2.898 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:01:56,287][model8_pretrain.py][INFO] Epoch:[0/2](288000/4588595) loss:2.863 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:01:56,287][model8_pretrain.py][INFO] Epoch:[0/2](288000/4588595) loss:2.913 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:01:56,287][model8_pretrain.py][INFO] Epoch:[0/2](288000/4588595) loss:3.195 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:01:56,287][model8_pretrain.py][INFO] Epoch:[0/2](288000/4588595) loss:3.258 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:01:56,287][model8_pretrain.py][INFO] Epoch:[0/2](288000/4588595) loss:3.203 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:02:33,233][model8_pretrain.py][INFO] Epoch:[0/2](288100/4588595) loss:2.900 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:02:33,233][model8_pretrain.py][INFO] Epoch:[0/2](288100/4588595) loss:2.718 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:02:33,233][model8_pretrain.py][INFO] Epoch:[0/2](288100/4588595) loss:2.661 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:02:33,233][model8_pretrain.py][INFO] Epoch:[0/2](288100/4588595) loss:3.472 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:02:33,233][model8_pretrain.py][INFO] Epoch:[0/2](288100/4588595) loss:3.061 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:02:33,233][model8_pretrain.py][INFO] Epoch:[0/2](288100/4588595) loss:2.870 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:02:33,233][model8_pretrain.py][INFO] Epoch:[0/2](288100/4588595) loss:3.317 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:02:33,233][model8_pretrain.py][INFO] Epoch:[0/2](288100/4588595) loss:2.920 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:03:18,983][model8_pretrain.py][INFO] Epoch:[0/2](288200/4588595) loss:2.708 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:03:18,983][model8_pretrain.py][INFO] Epoch:[0/2](288200/4588595) loss:2.716 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:03:18,983][model8_pretrain.py][INFO] Epoch:[0/2](288200/4588595) loss:2.851 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:03:18,983][model8_pretrain.py][INFO] Epoch:[0/2](288200/4588595) loss:2.639 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:03:18,983][model8_pretrain.py][INFO] Epoch:[0/2](288200/4588595) loss:3.298 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:03:18,984][model8_pretrain.py][INFO] Epoch:[0/2](288200/4588595) loss:2.880 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:03:18,988][model8_pretrain.py][INFO] Epoch:[0/2](288200/4588595) loss:3.202 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:03:18,988][model8_pretrain.py][INFO] Epoch:[0/2](288200/4588595) loss:2.725 lr:0.0000100 epoch_Time:27210.0min: [2024-01-04 00:03:57,390][model8_pretrain.py][INFO] Epoch:[0/2](288300/4588595) loss:2.646 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:03:57,390][model8_pretrain.py][INFO] Epoch:[0/2](288300/4588595) loss:2.914 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:03:57,390][model8_pretrain.py][INFO] Epoch:[0/2](288300/4588595) loss:2.988 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:03:57,390][model8_pretrain.py][INFO] Epoch:[0/2](288300/4588595) loss:2.673 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:03:57,391][model8_pretrain.py][INFO] Epoch:[0/2](288300/4588595) loss:2.867 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:03:57,391][model8_pretrain.py][INFO] Epoch:[0/2](288300/4588595) loss:2.952 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:03:57,391][model8_pretrain.py][INFO] Epoch:[0/2](288300/4588595) loss:3.074 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:03:57,391][model8_pretrain.py][INFO] Epoch:[0/2](288300/4588595) loss:3.091 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:04:34,328][model8_pretrain.py][INFO] Epoch:[0/2](288400/4588595) loss:2.346 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:04:34,328][model8_pretrain.py][INFO] Epoch:[0/2](288400/4588595) loss:2.719 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:04:34,328][model8_pretrain.py][INFO] Epoch:[0/2](288400/4588595) loss:2.658 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:04:34,328][model8_pretrain.py][INFO] Epoch:[0/2](288400/4588595) loss:2.999 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:04:34,328][model8_pretrain.py][INFO] Epoch:[0/2](288400/4588595) loss:3.553 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:04:34,328][model8_pretrain.py][INFO] Epoch:[0/2](288400/4588595) loss:2.833 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:04:34,328][model8_pretrain.py][INFO] Epoch:[0/2](288400/4588595) loss:2.708 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:04:34,328][model8_pretrain.py][INFO] Epoch:[0/2](288400/4588595) loss:2.991 lr:0.0000100 epoch_Time:27209.0min: [2024-01-04 00:05:11,266][model8_pretrain.py][INFO] Epoch:[0/2](288500/4588595) loss:2.730 lr:0.0000100 epoch_Time:27207.0min: [2024-01-04 00:05:11,266][model8_pretrain.py][INFO] Epoch:[0/2](288500/4588595) loss:2.989 lr:0.0000100 epoch_Time:27207.0min: [2024-01-04 00:05:11,266][model8_pretrain.py][INFO] Epoch:[0/2](288500/4588595) loss:2.865 lr:0.0000100 epoch_Time:27207.0min: [2024-01-04 00:05:11,266][model8_pretrain.py][INFO] Epoch:[0/2](288500/4588595) loss:2.947 lr:0.0000100 epoch_Time:27207.0min: [2024-01-04 00:05:11,266][model8_pretrain.py][INFO] Epoch:[0/2](288500/4588595) loss:2.955 lr:0.0000100 epoch_Time:27207.0min: [2024-01-04 00:05:11,267][model8_pretrain.py][INFO] Epoch:[0/2](288500/4588595) loss:2.736 lr:0.0000100 epoch_Time:27207.0min: [2024-01-04 00:05:11,267][model8_pretrain.py][INFO] Epoch:[0/2](288500/4588595) loss:2.803 lr:0.0000100 epoch_Time:27207.0min: [2024-01-04 00:05:11,267][model8_pretrain.py][INFO] Epoch:[0/2](288500/4588595) loss:2.780 lr:0.0000100 epoch_Time:27207.0min: [2024-01-04 00:05:48,203][model8_pretrain.py][INFO] Epoch:[0/2](288600/4588595) loss:2.529 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:05:48,203][model8_pretrain.py][INFO] Epoch:[0/2](288600/4588595) loss:2.807 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:05:48,203][model8_pretrain.py][INFO] Epoch:[0/2](288600/4588595) loss:3.027 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:05:48,203][model8_pretrain.py][INFO] Epoch:[0/2](288600/4588595) loss:3.271 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:05:48,203][model8_pretrain.py][INFO] Epoch:[0/2](288600/4588595) loss:3.142 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:05:48,203][model8_pretrain.py][INFO] Epoch:[0/2](288600/4588595) loss:3.026 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:05:48,203][model8_pretrain.py][INFO] Epoch:[0/2](288600/4588595) loss:3.001 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:05:48,203][model8_pretrain.py][INFO] Epoch:[0/2](288600/4588595) loss:2.906 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:06:25,147][model8_pretrain.py][INFO] Epoch:[0/2](288700/4588595) loss:2.997 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:06:25,147][model8_pretrain.py][INFO] Epoch:[0/2](288700/4588595) loss:3.008 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:06:25,147][model8_pretrain.py][INFO] Epoch:[0/2](288700/4588595) loss:2.885 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:06:25,147][model8_pretrain.py][INFO] Epoch:[0/2](288700/4588595) loss:2.993 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:06:25,147][model8_pretrain.py][INFO] Epoch:[0/2](288700/4588595) loss:3.196 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:06:25,147][model8_pretrain.py][INFO] Epoch:[0/2](288700/4588595) loss:2.932 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:06:25,147][model8_pretrain.py][INFO] Epoch:[0/2](288700/4588595) loss:3.327 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:06:25,148][model8_pretrain.py][INFO] Epoch:[0/2](288700/4588595) loss:3.293 lr:0.0000100 epoch_Time:27206.0min: [2024-01-04 00:07:02,086][model8_pretrain.py][INFO] Epoch:[0/2](288800/4588595) loss:3.122 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:07:02,086][model8_pretrain.py][INFO] Epoch:[0/2](288800/4588595) loss:3.175 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:07:02,086][model8_pretrain.py][INFO] Epoch:[0/2](288800/4588595) loss:2.272 lr:0.0000100 epoch_Time:27205.0min: [2024-01-04 00:07:02,086][model8_pretrain.py][INFO] Epoch:[0/2](288800/4588595) loss:3.027 lr:0.0000100 epoch_Time:27205.0min: [2024-01-04 00:07:02,086][model8_pretrain.py][INFO] Epoch:[0/2](288800/4588595) loss:2.721 lr:0.0000100 epoch_Time:27205.0min: [2024-01-04 00:07:02,086][model8_pretrain.py][INFO] Epoch:[0/2](288800/4588595) loss:3.323 lr:0.0000100 epoch_Time:27205.0min: [2024-01-04 00:07:02,086][model8_pretrain.py][INFO] Epoch:[0/2](288800/4588595) loss:2.901 lr:0.0000100 epoch_Time:27205.0min: [2024-01-04 00:07:02,086][model8_pretrain.py][INFO] Epoch:[0/2](288800/4588595) loss:3.051 lr:0.0000100 epoch_Time:27205.0min: [2024-01-04 00:07:39,031][model8_pretrain.py][INFO] Epoch:[0/2](288900/4588595) loss:2.824 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:07:39,031][model8_pretrain.py][INFO] Epoch:[0/2](288900/4588595) loss:3.184 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:07:39,031][model8_pretrain.py][INFO] Epoch:[0/2](288900/4588595) loss:2.962 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:07:39,031][model8_pretrain.py][INFO] Epoch:[0/2](288900/4588595) loss:2.528 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:07:39,031][model8_pretrain.py][INFO] Epoch:[0/2](288900/4588595) loss:2.798 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:07:39,031][model8_pretrain.py][INFO] Epoch:[0/2](288900/4588595) loss:2.982 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:07:39,031][model8_pretrain.py][INFO] Epoch:[0/2](288900/4588595) loss:3.441 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:07:39,031][model8_pretrain.py][INFO] Epoch:[0/2](288900/4588595) loss:3.132 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:08:21,203][model8_pretrain.py][INFO] Epoch:[0/2](289000/4588595) loss:2.973 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:08:21,203][model8_pretrain.py][INFO] Epoch:[0/2](289000/4588595) loss:2.505 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:08:21,203][model8_pretrain.py][INFO] Epoch:[0/2](289000/4588595) loss:3.019 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:08:21,203][model8_pretrain.py][INFO] Epoch:[0/2](289000/4588595) loss:3.450 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:08:21,203][model8_pretrain.py][INFO] Epoch:[0/2](289000/4588595) loss:2.710 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:08:21,203][model8_pretrain.py][INFO] Epoch:[0/2](289000/4588595) loss:2.794 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:08:21,204][model8_pretrain.py][INFO] Epoch:[0/2](289000/4588595) loss:3.128 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:08:21,204][model8_pretrain.py][INFO] Epoch:[0/2](289000/4588595) loss:3.006 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:03,082][model8_pretrain.py][INFO] Epoch:[0/2](289100/4588595) loss:3.185 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:03,082][model8_pretrain.py][INFO] Epoch:[0/2](289100/4588595) loss:3.070 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:03,082][model8_pretrain.py][INFO] Epoch:[0/2](289100/4588595) loss:2.567 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:03,082][model8_pretrain.py][INFO] Epoch:[0/2](289100/4588595) loss:2.519 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:03,082][model8_pretrain.py][INFO] Epoch:[0/2](289100/4588595) loss:3.035 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:03,082][model8_pretrain.py][INFO] Epoch:[0/2](289100/4588595) loss:2.962 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:03,082][model8_pretrain.py][INFO] Epoch:[0/2](289100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:03,082][model8_pretrain.py][INFO] Epoch:[0/2](289100/4588595) loss:3.261 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:40,015][model8_pretrain.py][INFO] Epoch:[0/2](289200/4588595) loss:2.877 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:40,015][model8_pretrain.py][INFO] Epoch:[0/2](289200/4588595) loss:3.183 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:40,015][model8_pretrain.py][INFO] Epoch:[0/2](289200/4588595) loss:3.248 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:40,015][model8_pretrain.py][INFO] Epoch:[0/2](289200/4588595) loss:2.877 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:40,015][model8_pretrain.py][INFO] Epoch:[0/2](289200/4588595) loss:3.277 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:40,015][model8_pretrain.py][INFO] Epoch:[0/2](289200/4588595) loss:2.872 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:40,016][model8_pretrain.py][INFO] Epoch:[0/2](289200/4588595) loss:3.032 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:09:40,016][model8_pretrain.py][INFO] Epoch:[0/2](289200/4588595) loss:2.699 lr:0.0000100 epoch_Time:27204.0min: [2024-01-04 00:10:16,961][model8_pretrain.py][INFO] Epoch:[0/2](289300/4588595) loss:2.775 lr:0.0000100 epoch_Time:27203.0min: [2024-01-04 00:10:16,961][model8_pretrain.py][INFO] Epoch:[0/2](289300/4588595) loss:2.382 lr:0.0000100 epoch_Time:27203.0min: [2024-01-04 00:10:16,961][model8_pretrain.py][INFO] Epoch:[0/2](289300/4588595) loss:2.840 lr:0.0000100 epoch_Time:27203.0min: [2024-01-04 00:10:16,961][model8_pretrain.py][INFO] Epoch:[0/2](289300/4588595) loss:2.692 lr:0.0000100 epoch_Time:27203.0min: [2024-01-04 00:10:16,961][model8_pretrain.py][INFO] Epoch:[0/2](289300/4588595) loss:2.458 lr:0.0000100 epoch_Time:27203.0min: [2024-01-04 00:10:16,961][model8_pretrain.py][INFO] Epoch:[0/2](289300/4588595) loss:3.006 lr:0.0000100 epoch_Time:27203.0min: [2024-01-04 00:10:16,961][model8_pretrain.py][INFO] Epoch:[0/2](289300/4588595) loss:2.724 lr:0.0000100 epoch_Time:27203.0min: [2024-01-04 00:10:16,961][model8_pretrain.py][INFO] Epoch:[0/2](289300/4588595) loss:2.981 lr:0.0000100 epoch_Time:27203.0min: [2024-01-04 00:10:53,887][model8_pretrain.py][INFO] Epoch:[0/2](289400/4588595) loss:3.139 lr:0.0000100 epoch_Time:27202.0min: [2024-01-04 00:10:53,887][model8_pretrain.py][INFO] Epoch:[0/2](289400/4588595) loss:3.043 lr:0.0000100 epoch_Time:27202.0min: [2024-01-04 00:10:53,887][model8_pretrain.py][INFO] Epoch:[0/2](289400/4588595) loss:2.807 lr:0.0000100 epoch_Time:27202.0min: [2024-01-04 00:10:53,887][model8_pretrain.py][INFO] Epoch:[0/2](289400/4588595) loss:2.690 lr:0.0000100 epoch_Time:27202.0min: [2024-01-04 00:10:53,887][model8_pretrain.py][INFO] Epoch:[0/2](289400/4588595) loss:3.217 lr:0.0000100 epoch_Time:27202.0min: [2024-01-04 00:10:53,887][model8_pretrain.py][INFO] Epoch:[0/2](289400/4588595) loss:3.102 lr:0.0000100 epoch_Time:27202.0min: [2024-01-04 00:10:53,888][model8_pretrain.py][INFO] Epoch:[0/2](289400/4588595) loss:2.730 lr:0.0000100 epoch_Time:27202.0min: [2024-01-04 00:10:53,888][model8_pretrain.py][INFO] Epoch:[0/2](289400/4588595) loss:3.277 lr:0.0000100 epoch_Time:27202.0min: [2024-01-04 00:11:30,827][model8_pretrain.py][INFO] Epoch:[0/2](289500/4588595) loss:3.152 lr:0.0000100 epoch_Time:27201.0min: [2024-01-04 00:11:30,827][model8_pretrain.py][INFO] Epoch:[0/2](289500/4588595) loss:2.730 lr:0.0000100 epoch_Time:27201.0min: [2024-01-04 00:11:30,827][model8_pretrain.py][INFO] Epoch:[0/2](289500/4588595) loss:2.900 lr:0.0000100 epoch_Time:27201.0min: [2024-01-04 00:11:30,827][model8_pretrain.py][INFO] Epoch:[0/2](289500/4588595) loss:3.166 lr:0.0000100 epoch_Time:27201.0min: [2024-01-04 00:11:30,827][model8_pretrain.py][INFO] Epoch:[0/2](289500/4588595) loss:3.283 lr:0.0000100 epoch_Time:27201.0min: [2024-01-04 00:11:30,827][model8_pretrain.py][INFO] Epoch:[0/2](289500/4588595) loss:3.314 lr:0.0000100 epoch_Time:27201.0min: [2024-01-04 00:11:30,827][model8_pretrain.py][INFO] Epoch:[0/2](289500/4588595) loss:3.103 lr:0.0000100 epoch_Time:27201.0min: [2024-01-04 00:11:30,827][model8_pretrain.py][INFO] Epoch:[0/2](289500/4588595) loss:2.604 lr:0.0000100 epoch_Time:27201.0min: [2024-01-04 00:12:07,768][model8_pretrain.py][INFO] Epoch:[0/2](289600/4588595) loss:3.290 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:07,768][model8_pretrain.py][INFO] Epoch:[0/2](289600/4588595) loss:3.049 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:07,768][model8_pretrain.py][INFO] Epoch:[0/2](289600/4588595) loss:3.020 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:07,768][model8_pretrain.py][INFO] Epoch:[0/2](289600/4588595) loss:2.658 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:07,768][model8_pretrain.py][INFO] Epoch:[0/2](289600/4588595) loss:3.158 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:07,768][model8_pretrain.py][INFO] Epoch:[0/2](289600/4588595) loss:3.012 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:07,768][model8_pretrain.py][INFO] Epoch:[0/2](289600/4588595) loss:2.809 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:07,768][model8_pretrain.py][INFO] Epoch:[0/2](289600/4588595) loss:3.102 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:44,704][model8_pretrain.py][INFO] Epoch:[0/2](289700/4588595) loss:2.869 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:44,704][model8_pretrain.py][INFO] Epoch:[0/2](289700/4588595) loss:2.901 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:44,704][model8_pretrain.py][INFO] Epoch:[0/2](289700/4588595) loss:2.630 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:44,704][model8_pretrain.py][INFO] Epoch:[0/2](289700/4588595) loss:2.849 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:44,705][model8_pretrain.py][INFO] Epoch:[0/2](289700/4588595) loss:2.428 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:44,705][model8_pretrain.py][INFO] Epoch:[0/2](289700/4588595) loss:2.059 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:44,705][model8_pretrain.py][INFO] Epoch:[0/2](289700/4588595) loss:2.966 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:12:44,705][model8_pretrain.py][INFO] Epoch:[0/2](289700/4588595) loss:2.912 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:13:26,923][model8_pretrain.py][INFO] Epoch:[0/2](289800/4588595) loss:2.551 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:13:26,923][model8_pretrain.py][INFO] Epoch:[0/2](289800/4588595) loss:2.940 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:13:26,923][model8_pretrain.py][INFO] Epoch:[0/2](289800/4588595) loss:3.115 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:13:26,923][model8_pretrain.py][INFO] Epoch:[0/2](289800/4588595) loss:3.000 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:13:26,923][model8_pretrain.py][INFO] Epoch:[0/2](289800/4588595) loss:2.430 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:13:26,923][model8_pretrain.py][INFO] Epoch:[0/2](289800/4588595) loss:2.489 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:13:26,923][model8_pretrain.py][INFO] Epoch:[0/2](289800/4588595) loss:3.300 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:13:26,923][model8_pretrain.py][INFO] Epoch:[0/2](289800/4588595) loss:3.452 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:08,920][model8_pretrain.py][INFO] Epoch:[0/2](289900/4588595) loss:2.683 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:08,920][model8_pretrain.py][INFO] Epoch:[0/2](289900/4588595) loss:2.405 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:08,920][model8_pretrain.py][INFO] Epoch:[0/2](289900/4588595) loss:2.693 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:08,920][model8_pretrain.py][INFO] Epoch:[0/2](289900/4588595) loss:2.907 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:08,920][model8_pretrain.py][INFO] Epoch:[0/2](289900/4588595) loss:2.819 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:08,920][model8_pretrain.py][INFO] Epoch:[0/2](289900/4588595) loss:2.949 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:08,920][model8_pretrain.py][INFO] Epoch:[0/2](289900/4588595) loss:3.333 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:08,920][model8_pretrain.py][INFO] Epoch:[0/2](289900/4588595) loss:3.141 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:45,852][model8_pretrain.py][INFO] Epoch:[0/2](290000/4588595) loss:3.089 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:45,852][model8_pretrain.py][INFO] Epoch:[0/2](290000/4588595) loss:3.062 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:45,852][model8_pretrain.py][INFO] Epoch:[0/2](290000/4588595) loss:3.558 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:45,852][model8_pretrain.py][INFO] Epoch:[0/2](290000/4588595) loss:2.918 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:45,852][model8_pretrain.py][INFO] Epoch:[0/2](290000/4588595) loss:3.053 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:45,852][model8_pretrain.py][INFO] Epoch:[0/2](290000/4588595) loss:2.713 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:45,852][model8_pretrain.py][INFO] Epoch:[0/2](290000/4588595) loss:3.098 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:14:45,853][model8_pretrain.py][INFO] Epoch:[0/2](290000/4588595) loss:2.656 lr:0.0000100 epoch_Time:27200.0min: [2024-01-04 00:15:22,796][model8_pretrain.py][INFO] Epoch:[0/2](290100/4588595) loss:3.273 lr:0.0000100 epoch_Time:27198.0min: [2024-01-04 00:15:22,796][model8_pretrain.py][INFO] Epoch:[0/2](290100/4588595) loss:3.178 lr:0.0000100 epoch_Time:27198.0min: [2024-01-04 00:15:22,796][model8_pretrain.py][INFO] Epoch:[0/2](290100/4588595) loss:2.887 lr:0.0000100 epoch_Time:27198.0min: [2024-01-04 00:15:22,796][model8_pretrain.py][INFO] Epoch:[0/2](290100/4588595) loss:2.974 lr:0.0000100 epoch_Time:27198.0min: [2024-01-04 00:15:22,796][model8_pretrain.py][INFO] Epoch:[0/2](290100/4588595) loss:3.566 lr:0.0000100 epoch_Time:27198.0min: [2024-01-04 00:15:22,796][model8_pretrain.py][INFO] Epoch:[0/2](290100/4588595) loss:2.678 lr:0.0000100 epoch_Time:27198.0min: [2024-01-04 00:15:22,796][model8_pretrain.py][INFO] Epoch:[0/2](290100/4588595) loss:3.194 lr:0.0000100 epoch_Time:27198.0min: [2024-01-04 00:15:22,797][model8_pretrain.py][INFO] Epoch:[0/2](290100/4588595) loss:2.822 lr:0.0000100 epoch_Time:27198.0min: [2024-01-04 00:15:59,734][model8_pretrain.py][INFO] Epoch:[0/2](290200/4588595) loss:2.417 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:15:59,734][model8_pretrain.py][INFO] Epoch:[0/2](290200/4588595) loss:3.184 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:15:59,734][model8_pretrain.py][INFO] Epoch:[0/2](290200/4588595) loss:2.535 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:15:59,734][model8_pretrain.py][INFO] Epoch:[0/2](290200/4588595) loss:2.950 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:15:59,735][model8_pretrain.py][INFO] Epoch:[0/2](290200/4588595) loss:2.841 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:15:59,735][model8_pretrain.py][INFO] Epoch:[0/2](290200/4588595) loss:3.147 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:15:59,735][model8_pretrain.py][INFO] Epoch:[0/2](290200/4588595) loss:2.951 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:15:59,736][model8_pretrain.py][INFO] Epoch:[0/2](290200/4588595) loss:3.372 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:16:36,674][model8_pretrain.py][INFO] Epoch:[0/2](290300/4588595) loss:2.876 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:16:36,674][model8_pretrain.py][INFO] Epoch:[0/2](290300/4588595) loss:3.265 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:16:36,674][model8_pretrain.py][INFO] Epoch:[0/2](290300/4588595) loss:2.583 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:16:36,674][model8_pretrain.py][INFO] Epoch:[0/2](290300/4588595) loss:2.285 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:16:36,674][model8_pretrain.py][INFO] Epoch:[0/2](290300/4588595) loss:2.060 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:16:36,674][model8_pretrain.py][INFO] Epoch:[0/2](290300/4588595) loss:3.040 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:16:36,674][model8_pretrain.py][INFO] Epoch:[0/2](290300/4588595) loss:2.959 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:16:36,674][model8_pretrain.py][INFO] Epoch:[0/2](290300/4588595) loss:2.836 lr:0.0000100 epoch_Time:27197.0min: [2024-01-04 00:17:13,614][model8_pretrain.py][INFO] Epoch:[0/2](290400/4588595) loss:3.100 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:17:13,614][model8_pretrain.py][INFO] Epoch:[0/2](290400/4588595) loss:2.475 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:17:13,614][model8_pretrain.py][INFO] Epoch:[0/2](290400/4588595) loss:2.794 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:17:13,614][model8_pretrain.py][INFO] Epoch:[0/2](290400/4588595) loss:2.900 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:17:13,614][model8_pretrain.py][INFO] Epoch:[0/2](290400/4588595) loss:3.037 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:17:13,614][model8_pretrain.py][INFO] Epoch:[0/2](290400/4588595) loss:2.540 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:17:13,614][model8_pretrain.py][INFO] Epoch:[0/2](290400/4588595) loss:3.076 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:17:13,614][model8_pretrain.py][INFO] Epoch:[0/2](290400/4588595) loss:3.201 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:17:50,558][model8_pretrain.py][INFO] Epoch:[0/2](290500/4588595) loss:2.951 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:17:50,558][model8_pretrain.py][INFO] Epoch:[0/2](290500/4588595) loss:2.660 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:17:50,558][model8_pretrain.py][INFO] Epoch:[0/2](290500/4588595) loss:2.992 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:17:50,558][model8_pretrain.py][INFO] Epoch:[0/2](290500/4588595) loss:3.023 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:17:50,558][model8_pretrain.py][INFO] Epoch:[0/2](290500/4588595) loss:2.879 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:17:50,559][model8_pretrain.py][INFO] Epoch:[0/2](290500/4588595) loss:3.570 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:17:50,559][model8_pretrain.py][INFO] Epoch:[0/2](290500/4588595) loss:3.236 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:17:50,559][model8_pretrain.py][INFO] Epoch:[0/2](290500/4588595) loss:2.960 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:18:32,698][model8_pretrain.py][INFO] Epoch:[0/2](290600/4588595) loss:2.892 lr:0.0000100 epoch_Time:27195.0min: [2024-01-04 00:18:32,698][model8_pretrain.py][INFO] Epoch:[0/2](290600/4588595) loss:2.449 lr:0.0000100 epoch_Time:27195.0min: [2024-01-04 00:18:32,698][model8_pretrain.py][INFO] Epoch:[0/2](290600/4588595) loss:2.342 lr:0.0000100 epoch_Time:27195.0min: [2024-01-04 00:18:32,698][model8_pretrain.py][INFO] Epoch:[0/2](290600/4588595) loss:3.171 lr:0.0000100 epoch_Time:27195.0min: [2024-01-04 00:18:32,703][model8_pretrain.py][INFO] Epoch:[0/2](290600/4588595) loss:2.940 lr:0.0000100 epoch_Time:27195.0min: [2024-01-04 00:18:32,703][model8_pretrain.py][INFO] Epoch:[0/2](290600/4588595) loss:3.095 lr:0.0000100 epoch_Time:27195.0min: [2024-01-04 00:18:32,703][model8_pretrain.py][INFO] Epoch:[0/2](290600/4588595) loss:3.016 lr:0.0000100 epoch_Time:27195.0min: [2024-01-04 00:18:32,703][model8_pretrain.py][INFO] Epoch:[0/2](290600/4588595) loss:2.649 lr:0.0000100 epoch_Time:27195.0min: [2024-01-04 00:19:14,922][model8_pretrain.py][INFO] Epoch:[0/2](290700/4588595) loss:2.574 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:19:14,922][model8_pretrain.py][INFO] Epoch:[0/2](290700/4588595) loss:2.873 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:19:14,922][model8_pretrain.py][INFO] Epoch:[0/2](290700/4588595) loss:2.524 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:19:14,923][model8_pretrain.py][INFO] Epoch:[0/2](290700/4588595) loss:2.223 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:19:14,922][model8_pretrain.py][INFO] Epoch:[0/2](290700/4588595) loss:2.945 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:19:14,923][model8_pretrain.py][INFO] Epoch:[0/2](290700/4588595) loss:2.601 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:19:14,923][model8_pretrain.py][INFO] Epoch:[0/2](290700/4588595) loss:2.888 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:19:14,923][model8_pretrain.py][INFO] Epoch:[0/2](290700/4588595) loss:2.861 lr:0.0000100 epoch_Time:27196.0min: [2024-01-04 00:19:51,865][model8_pretrain.py][INFO] Epoch:[0/2](290800/4588595) loss:3.028 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:19:51,865][model8_pretrain.py][INFO] Epoch:[0/2](290800/4588595) loss:2.802 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:19:51,865][model8_pretrain.py][INFO] Epoch:[0/2](290800/4588595) loss:2.152 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:19:51,865][model8_pretrain.py][INFO] Epoch:[0/2](290800/4588595) loss:3.194 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:19:51,865][model8_pretrain.py][INFO] Epoch:[0/2](290800/4588595) loss:2.903 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:19:51,865][model8_pretrain.py][INFO] Epoch:[0/2](290800/4588595) loss:3.157 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:19:51,865][model8_pretrain.py][INFO] Epoch:[0/2](290800/4588595) loss:2.837 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:19:51,866][model8_pretrain.py][INFO] Epoch:[0/2](290800/4588595) loss:3.392 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:20:28,812][model8_pretrain.py][INFO] Epoch:[0/2](290900/4588595) loss:2.737 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:20:28,812][model8_pretrain.py][INFO] Epoch:[0/2](290900/4588595) loss:3.497 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:20:28,812][model8_pretrain.py][INFO] Epoch:[0/2](290900/4588595) loss:2.661 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:20:28,812][model8_pretrain.py][INFO] Epoch:[0/2](290900/4588595) loss:3.103 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:20:28,812][model8_pretrain.py][INFO] Epoch:[0/2](290900/4588595) loss:2.596 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:20:28,812][model8_pretrain.py][INFO] Epoch:[0/2](290900/4588595) loss:2.621 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:20:28,812][model8_pretrain.py][INFO] Epoch:[0/2](290900/4588595) loss:2.814 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:20:28,812][model8_pretrain.py][INFO] Epoch:[0/2](290900/4588595) loss:3.106 lr:0.0000100 epoch_Time:27194.0min: [2024-01-04 00:21:05,760][model8_pretrain.py][INFO] Epoch:[0/2](291000/4588595) loss:2.867 lr:0.0000100 epoch_Time:27193.0min: [2024-01-04 00:21:05,760][model8_pretrain.py][INFO] Epoch:[0/2](291000/4588595) loss:3.374 lr:0.0000100 epoch_Time:27193.0min: [2024-01-04 00:21:05,760][model8_pretrain.py][INFO] Epoch:[0/2](291000/4588595) loss:2.998 lr:0.0000100 epoch_Time:27193.0min: [2024-01-04 00:21:05,760][model8_pretrain.py][INFO] Epoch:[0/2](291000/4588595) loss:3.448 lr:0.0000100 epoch_Time:27193.0min: [2024-01-04 00:21:05,760][model8_pretrain.py][INFO] Epoch:[0/2](291000/4588595) loss:2.755 lr:0.0000100 epoch_Time:27193.0min: [2024-01-04 00:21:05,761][model8_pretrain.py][INFO] Epoch:[0/2](291000/4588595) loss:2.864 lr:0.0000100 epoch_Time:27193.0min: [2024-01-04 00:21:05,761][model8_pretrain.py][INFO] Epoch:[0/2](291000/4588595) loss:2.518 lr:0.0000100 epoch_Time:27193.0min: [2024-01-04 00:21:05,761][model8_pretrain.py][INFO] Epoch:[0/2](291000/4588595) loss:3.250 lr:0.0000100 epoch_Time:27193.0min: [2024-01-04 00:21:42,707][model8_pretrain.py][INFO] Epoch:[0/2](291100/4588595) loss:3.075 lr:0.0000100 epoch_Time:27192.0min: [2024-01-04 00:21:42,707][model8_pretrain.py][INFO] Epoch:[0/2](291100/4588595) loss:2.882 lr:0.0000100 epoch_Time:27192.0min: [2024-01-04 00:21:42,707][model8_pretrain.py][INFO] Epoch:[0/2](291100/4588595) loss:2.673 lr:0.0000100 epoch_Time:27192.0min: [2024-01-04 00:21:42,707][model8_pretrain.py][INFO] Epoch:[0/2](291100/4588595) loss:3.235 lr:0.0000100 epoch_Time:27192.0min: [2024-01-04 00:21:42,707][model8_pretrain.py][INFO] Epoch:[0/2](291100/4588595) loss:3.118 lr:0.0000100 epoch_Time:27192.0min: [2024-01-04 00:21:42,707][model8_pretrain.py][INFO] Epoch:[0/2](291100/4588595) loss:3.302 lr:0.0000100 epoch_Time:27192.0min: [2024-01-04 00:21:42,707][model8_pretrain.py][INFO] Epoch:[0/2](291100/4588595) loss:3.140 lr:0.0000100 epoch_Time:27192.0min: [2024-01-04 00:21:42,707][model8_pretrain.py][INFO] Epoch:[0/2](291100/4588595) loss:2.882 lr:0.0000100 epoch_Time:27192.0min: [2024-01-04 00:22:19,658][model8_pretrain.py][INFO] Epoch:[0/2](291200/4588595) loss:2.475 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:22:19,658][model8_pretrain.py][INFO] Epoch:[0/2](291200/4588595) loss:2.732 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:22:19,658][model8_pretrain.py][INFO] Epoch:[0/2](291200/4588595) loss:3.174 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:22:19,658][model8_pretrain.py][INFO] Epoch:[0/2](291200/4588595) loss:2.863 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:22:19,659][model8_pretrain.py][INFO] Epoch:[0/2](291200/4588595) loss:2.920 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:22:19,659][model8_pretrain.py][INFO] Epoch:[0/2](291200/4588595) loss:2.762 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:22:19,659][model8_pretrain.py][INFO] Epoch:[0/2](291200/4588595) loss:2.995 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:22:19,659][model8_pretrain.py][INFO] Epoch:[0/2](291200/4588595) loss:2.824 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:22:56,601][model8_pretrain.py][INFO] Epoch:[0/2](291300/4588595) loss:2.828 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:22:56,601][model8_pretrain.py][INFO] Epoch:[0/2](291300/4588595) loss:3.054 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:22:56,601][model8_pretrain.py][INFO] Epoch:[0/2](291300/4588595) loss:2.659 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:22:56,601][model8_pretrain.py][INFO] Epoch:[0/2](291300/4588595) loss:3.186 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:22:56,601][model8_pretrain.py][INFO] Epoch:[0/2](291300/4588595) loss:2.966 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:22:56,601][model8_pretrain.py][INFO] Epoch:[0/2](291300/4588595) loss:2.549 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:22:56,602][model8_pretrain.py][INFO] Epoch:[0/2](291300/4588595) loss:3.080 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:22:56,602][model8_pretrain.py][INFO] Epoch:[0/2](291300/4588595) loss:2.981 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:23:35,285][model8_pretrain.py][INFO] Epoch:[0/2](291400/4588595) loss:2.833 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:23:35,285][model8_pretrain.py][INFO] Epoch:[0/2](291400/4588595) loss:2.945 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:23:35,285][model8_pretrain.py][INFO] Epoch:[0/2](291400/4588595) loss:2.675 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:23:35,285][model8_pretrain.py][INFO] Epoch:[0/2](291400/4588595) loss:2.712 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:23:35,285][model8_pretrain.py][INFO] Epoch:[0/2](291400/4588595) loss:2.704 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:23:35,285][model8_pretrain.py][INFO] Epoch:[0/2](291400/4588595) loss:3.269 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:23:35,285][model8_pretrain.py][INFO] Epoch:[0/2](291400/4588595) loss:3.089 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:23:35,285][model8_pretrain.py][INFO] Epoch:[0/2](291400/4588595) loss:2.696 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:24:20,987][model8_pretrain.py][INFO] Epoch:[0/2](291500/4588595) loss:2.894 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:24:20,987][model8_pretrain.py][INFO] Epoch:[0/2](291500/4588595) loss:2.970 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:24:20,987][model8_pretrain.py][INFO] Epoch:[0/2](291500/4588595) loss:3.136 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:24:20,987][model8_pretrain.py][INFO] Epoch:[0/2](291500/4588595) loss:2.938 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:24:20,987][model8_pretrain.py][INFO] Epoch:[0/2](291500/4588595) loss:2.923 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:24:20,987][model8_pretrain.py][INFO] Epoch:[0/2](291500/4588595) loss:3.113 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:24:20,987][model8_pretrain.py][INFO] Epoch:[0/2](291500/4588595) loss:3.350 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:24:20,987][model8_pretrain.py][INFO] Epoch:[0/2](291500/4588595) loss:3.232 lr:0.0000100 epoch_Time:27191.0min: [2024-01-04 00:24:57,892][model8_pretrain.py][INFO] Epoch:[0/2](291600/4588595) loss:3.145 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:24:57,892][model8_pretrain.py][INFO] Epoch:[0/2](291600/4588595) loss:3.195 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:24:57,892][model8_pretrain.py][INFO] Epoch:[0/2](291600/4588595) loss:3.427 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:24:57,892][model8_pretrain.py][INFO] Epoch:[0/2](291600/4588595) loss:2.637 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:24:57,892][model8_pretrain.py][INFO] Epoch:[0/2](291600/4588595) loss:3.520 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:24:57,892][model8_pretrain.py][INFO] Epoch:[0/2](291600/4588595) loss:2.993 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:24:57,892][model8_pretrain.py][INFO] Epoch:[0/2](291600/4588595) loss:3.194 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:24:57,892][model8_pretrain.py][INFO] Epoch:[0/2](291600/4588595) loss:2.958 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:25:34,828][model8_pretrain.py][INFO] Epoch:[0/2](291700/4588595) loss:3.095 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:25:34,828][model8_pretrain.py][INFO] Epoch:[0/2](291700/4588595) loss:3.210 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:25:34,828][model8_pretrain.py][INFO] Epoch:[0/2](291700/4588595) loss:2.960 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:25:34,828][model8_pretrain.py][INFO] Epoch:[0/2](291700/4588595) loss:3.420 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:25:34,828][model8_pretrain.py][INFO] Epoch:[0/2](291700/4588595) loss:3.000 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:25:34,828][model8_pretrain.py][INFO] Epoch:[0/2](291700/4588595) loss:3.100 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:25:34,828][model8_pretrain.py][INFO] Epoch:[0/2](291700/4588595) loss:2.624 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:25:34,828][model8_pretrain.py][INFO] Epoch:[0/2](291700/4588595) loss:3.185 lr:0.0000100 epoch_Time:27190.0min: [2024-01-04 00:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](291800/4588595) loss:2.735 lr:0.0000100 epoch_Time:27188.0min: [2024-01-04 00:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](291800/4588595) loss:3.103 lr:0.0000100 epoch_Time:27188.0min: [2024-01-04 00:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](291800/4588595) loss:3.024 lr:0.0000100 epoch_Time:27188.0min: [2024-01-04 00:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](291800/4588595) loss:2.747 lr:0.0000100 epoch_Time:27188.0min: [2024-01-04 00:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](291800/4588595) loss:2.960 lr:0.0000100 epoch_Time:27188.0min: [2024-01-04 00:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](291800/4588595) loss:3.098 lr:0.0000100 epoch_Time:27188.0min: [2024-01-04 00:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](291800/4588595) loss:3.253 lr:0.0000100 epoch_Time:27188.0min: [2024-01-04 00:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](291800/4588595) loss:2.961 lr:0.0000100 epoch_Time:27188.0min: [2024-01-04 00:26:48,716][model8_pretrain.py][INFO] Epoch:[0/2](291900/4588595) loss:3.262 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:26:48,716][model8_pretrain.py][INFO] Epoch:[0/2](291900/4588595) loss:2.650 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:26:48,716][model8_pretrain.py][INFO] Epoch:[0/2](291900/4588595) loss:3.171 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:26:48,716][model8_pretrain.py][INFO] Epoch:[0/2](291900/4588595) loss:3.318 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:26:48,716][model8_pretrain.py][INFO] Epoch:[0/2](291900/4588595) loss:2.797 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:26:48,717][model8_pretrain.py][INFO] Epoch:[0/2](291900/4588595) loss:3.304 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:26:48,716][model8_pretrain.py][INFO] Epoch:[0/2](291900/4588595) loss:3.141 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:26:48,717][model8_pretrain.py][INFO] Epoch:[0/2](291900/4588595) loss:2.653 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:27:25,657][model8_pretrain.py][INFO] Epoch:[0/2](292000/4588595) loss:2.963 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:27:25,657][model8_pretrain.py][INFO] Epoch:[0/2](292000/4588595) loss:2.949 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:27:25,657][model8_pretrain.py][INFO] Epoch:[0/2](292000/4588595) loss:3.033 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:27:25,657][model8_pretrain.py][INFO] Epoch:[0/2](292000/4588595) loss:3.336 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:27:25,657][model8_pretrain.py][INFO] Epoch:[0/2](292000/4588595) loss:2.893 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:27:25,657][model8_pretrain.py][INFO] Epoch:[0/2](292000/4588595) loss:2.544 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:27:25,657][model8_pretrain.py][INFO] Epoch:[0/2](292000/4588595) loss:3.130 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:27:25,657][model8_pretrain.py][INFO] Epoch:[0/2](292000/4588595) loss:2.497 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:28:02,588][model8_pretrain.py][INFO] Epoch:[0/2](292100/4588595) loss:2.729 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:02,588][model8_pretrain.py][INFO] Epoch:[0/2](292100/4588595) loss:2.970 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:02,588][model8_pretrain.py][INFO] Epoch:[0/2](292100/4588595) loss:2.969 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:02,588][model8_pretrain.py][INFO] Epoch:[0/2](292100/4588595) loss:2.692 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:02,588][model8_pretrain.py][INFO] Epoch:[0/2](292100/4588595) loss:3.307 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:02,588][model8_pretrain.py][INFO] Epoch:[0/2](292100/4588595) loss:2.580 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:02,588][model8_pretrain.py][INFO] Epoch:[0/2](292100/4588595) loss:3.503 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:02,588][model8_pretrain.py][INFO] Epoch:[0/2](292100/4588595) loss:2.627 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:41,265][model8_pretrain.py][INFO] Epoch:[0/2](292200/4588595) loss:2.721 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:41,265][model8_pretrain.py][INFO] Epoch:[0/2](292200/4588595) loss:3.302 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:41,265][model8_pretrain.py][INFO] Epoch:[0/2](292200/4588595) loss:2.929 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:41,265][model8_pretrain.py][INFO] Epoch:[0/2](292200/4588595) loss:3.057 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:41,265][model8_pretrain.py][INFO] Epoch:[0/2](292200/4588595) loss:2.874 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:41,265][model8_pretrain.py][INFO] Epoch:[0/2](292200/4588595) loss:2.956 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:41,265][model8_pretrain.py][INFO] Epoch:[0/2](292200/4588595) loss:2.920 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:28:41,265][model8_pretrain.py][INFO] Epoch:[0/2](292200/4588595) loss:2.717 lr:0.0000100 epoch_Time:27186.0min: [2024-01-04 00:29:26,882][model8_pretrain.py][INFO] Epoch:[0/2](292300/4588595) loss:3.084 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:29:26,882][model8_pretrain.py][INFO] Epoch:[0/2](292300/4588595) loss:3.225 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:29:26,882][model8_pretrain.py][INFO] Epoch:[0/2](292300/4588595) loss:3.394 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:29:26,882][model8_pretrain.py][INFO] Epoch:[0/2](292300/4588595) loss:3.322 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:29:26,882][model8_pretrain.py][INFO] Epoch:[0/2](292300/4588595) loss:2.974 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:29:26,882][model8_pretrain.py][INFO] Epoch:[0/2](292300/4588595) loss:3.157 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:29:26,882][model8_pretrain.py][INFO] Epoch:[0/2](292300/4588595) loss:2.960 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:29:26,882][model8_pretrain.py][INFO] Epoch:[0/2](292300/4588595) loss:2.564 lr:0.0000100 epoch_Time:27187.0min: [2024-01-04 00:30:03,830][model8_pretrain.py][INFO] Epoch:[0/2](292400/4588595) loss:3.390 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:03,830][model8_pretrain.py][INFO] Epoch:[0/2](292400/4588595) loss:3.206 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:03,830][model8_pretrain.py][INFO] Epoch:[0/2](292400/4588595) loss:3.127 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:03,830][model8_pretrain.py][INFO] Epoch:[0/2](292400/4588595) loss:2.775 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:03,830][model8_pretrain.py][INFO] Epoch:[0/2](292400/4588595) loss:3.265 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:03,830][model8_pretrain.py][INFO] Epoch:[0/2](292400/4588595) loss:2.952 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:03,830][model8_pretrain.py][INFO] Epoch:[0/2](292400/4588595) loss:3.006 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:03,830][model8_pretrain.py][INFO] Epoch:[0/2](292400/4588595) loss:3.287 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:40,772][model8_pretrain.py][INFO] Epoch:[0/2](292500/4588595) loss:2.721 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:40,773][model8_pretrain.py][INFO] Epoch:[0/2](292500/4588595) loss:2.573 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:40,773][model8_pretrain.py][INFO] Epoch:[0/2](292500/4588595) loss:3.039 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:40,773][model8_pretrain.py][INFO] Epoch:[0/2](292500/4588595) loss:3.099 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:40,773][model8_pretrain.py][INFO] Epoch:[0/2](292500/4588595) loss:3.089 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:40,773][model8_pretrain.py][INFO] Epoch:[0/2](292500/4588595) loss:2.637 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:40,773][model8_pretrain.py][INFO] Epoch:[0/2](292500/4588595) loss:3.340 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:30:40,773][model8_pretrain.py][INFO] Epoch:[0/2](292500/4588595) loss:2.483 lr:0.0000100 epoch_Time:27185.0min: [2024-01-04 00:31:17,736][model8_pretrain.py][INFO] Epoch:[0/2](292600/4588595) loss:2.401 lr:0.0000100 epoch_Time:27184.0min: [2024-01-04 00:31:17,736][model8_pretrain.py][INFO] Epoch:[0/2](292600/4588595) loss:2.918 lr:0.0000100 epoch_Time:27184.0min: [2024-01-04 00:31:17,736][model8_pretrain.py][INFO] Epoch:[0/2](292600/4588595) loss:2.871 lr:0.0000100 epoch_Time:27184.0min: [2024-01-04 00:31:17,736][model8_pretrain.py][INFO] Epoch:[0/2](292600/4588595) loss:2.475 lr:0.0000100 epoch_Time:27184.0min: [2024-01-04 00:31:17,736][model8_pretrain.py][INFO] Epoch:[0/2](292600/4588595) loss:2.912 lr:0.0000100 epoch_Time:27184.0min: [2024-01-04 00:31:17,736][model8_pretrain.py][INFO] Epoch:[0/2](292600/4588595) loss:2.492 lr:0.0000100 epoch_Time:27184.0min: [2024-01-04 00:31:17,736][model8_pretrain.py][INFO] Epoch:[0/2](292600/4588595) loss:2.945 lr:0.0000100 epoch_Time:27184.0min: [2024-01-04 00:31:17,736][model8_pretrain.py][INFO] Epoch:[0/2](292600/4588595) loss:3.036 lr:0.0000100 epoch_Time:27184.0min: [2024-01-04 00:31:54,685][model8_pretrain.py][INFO] Epoch:[0/2](292700/4588595) loss:2.763 lr:0.0000100 epoch_Time:27183.0min: [2024-01-04 00:31:54,685][model8_pretrain.py][INFO] Epoch:[0/2](292700/4588595) loss:2.355 lr:0.0000100 epoch_Time:27183.0min: [2024-01-04 00:31:54,685][model8_pretrain.py][INFO] Epoch:[0/2](292700/4588595) loss:2.854 lr:0.0000100 epoch_Time:27183.0min: [2024-01-04 00:31:54,686][model8_pretrain.py][INFO] Epoch:[0/2](292700/4588595) loss:2.671 lr:0.0000100 epoch_Time:27183.0min: [2024-01-04 00:31:54,686][model8_pretrain.py][INFO] Epoch:[0/2](292700/4588595) loss:2.851 lr:0.0000100 epoch_Time:27183.0min: [2024-01-04 00:31:54,686][model8_pretrain.py][INFO] Epoch:[0/2](292700/4588595) loss:2.646 lr:0.0000100 epoch_Time:27183.0min: [2024-01-04 00:31:54,686][model8_pretrain.py][INFO] Epoch:[0/2](292700/4588595) loss:2.836 lr:0.0000100 epoch_Time:27183.0min: [2024-01-04 00:31:54,686][model8_pretrain.py][INFO] Epoch:[0/2](292700/4588595) loss:3.274 lr:0.0000100 epoch_Time:27183.0min: [2024-01-04 00:32:31,642][model8_pretrain.py][INFO] Epoch:[0/2](292800/4588595) loss:2.416 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:32:31,642][model8_pretrain.py][INFO] Epoch:[0/2](292800/4588595) loss:2.743 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:32:31,642][model8_pretrain.py][INFO] Epoch:[0/2](292800/4588595) loss:2.634 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:32:31,642][model8_pretrain.py][INFO] Epoch:[0/2](292800/4588595) loss:2.951 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:32:31,642][model8_pretrain.py][INFO] Epoch:[0/2](292800/4588595) loss:3.619 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:32:31,642][model8_pretrain.py][INFO] Epoch:[0/2](292800/4588595) loss:2.463 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:32:31,643][model8_pretrain.py][INFO] Epoch:[0/2](292800/4588595) loss:3.076 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:32:31,643][model8_pretrain.py][INFO] Epoch:[0/2](292800/4588595) loss:2.626 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:33:08,590][model8_pretrain.py][INFO] Epoch:[0/2](292900/4588595) loss:2.726 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:08,590][model8_pretrain.py][INFO] Epoch:[0/2](292900/4588595) loss:2.899 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:08,590][model8_pretrain.py][INFO] Epoch:[0/2](292900/4588595) loss:1.910 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:08,590][model8_pretrain.py][INFO] Epoch:[0/2](292900/4588595) loss:3.231 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:08,590][model8_pretrain.py][INFO] Epoch:[0/2](292900/4588595) loss:3.235 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:08,591][model8_pretrain.py][INFO] Epoch:[0/2](292900/4588595) loss:2.942 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:08,591][model8_pretrain.py][INFO] Epoch:[0/2](292900/4588595) loss:3.076 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:08,591][model8_pretrain.py][INFO] Epoch:[0/2](292900/4588595) loss:2.854 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:47,340][model8_pretrain.py][INFO] Epoch:[0/2](293000/4588595) loss:2.718 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:47,340][model8_pretrain.py][INFO] Epoch:[0/2](293000/4588595) loss:3.143 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:47,340][model8_pretrain.py][INFO] Epoch:[0/2](293000/4588595) loss:2.534 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:47,340][model8_pretrain.py][INFO] Epoch:[0/2](293000/4588595) loss:2.670 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:47,340][model8_pretrain.py][INFO] Epoch:[0/2](293000/4588595) loss:3.006 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:47,344][model8_pretrain.py][INFO] Epoch:[0/2](293000/4588595) loss:3.236 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:47,345][model8_pretrain.py][INFO] Epoch:[0/2](293000/4588595) loss:2.693 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:33:47,345][model8_pretrain.py][INFO] Epoch:[0/2](293000/4588595) loss:3.307 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:34:32,919][model8_pretrain.py][INFO] Epoch:[0/2](293100/4588595) loss:3.439 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:34:32,919][model8_pretrain.py][INFO] Epoch:[0/2](293100/4588595) loss:2.299 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:34:32,919][model8_pretrain.py][INFO] Epoch:[0/2](293100/4588595) loss:3.388 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:34:32,919][model8_pretrain.py][INFO] Epoch:[0/2](293100/4588595) loss:2.703 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:34:32,919][model8_pretrain.py][INFO] Epoch:[0/2](293100/4588595) loss:3.415 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:34:32,919][model8_pretrain.py][INFO] Epoch:[0/2](293100/4588595) loss:2.989 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:34:32,919][model8_pretrain.py][INFO] Epoch:[0/2](293100/4588595) loss:3.209 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:34:32,919][model8_pretrain.py][INFO] Epoch:[0/2](293100/4588595) loss:3.634 lr:0.0000100 epoch_Time:27182.0min: [2024-01-04 00:35:09,864][model8_pretrain.py][INFO] Epoch:[0/2](293200/4588595) loss:2.087 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:09,864][model8_pretrain.py][INFO] Epoch:[0/2](293200/4588595) loss:2.812 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:09,864][model8_pretrain.py][INFO] Epoch:[0/2](293200/4588595) loss:2.613 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:09,864][model8_pretrain.py][INFO] Epoch:[0/2](293200/4588595) loss:2.949 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:09,864][model8_pretrain.py][INFO] Epoch:[0/2](293200/4588595) loss:2.668 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:09,864][model8_pretrain.py][INFO] Epoch:[0/2](293200/4588595) loss:3.623 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:09,864][model8_pretrain.py][INFO] Epoch:[0/2](293200/4588595) loss:3.021 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:09,864][model8_pretrain.py][INFO] Epoch:[0/2](293200/4588595) loss:2.593 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:46,822][model8_pretrain.py][INFO] Epoch:[0/2](293300/4588595) loss:3.506 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:46,822][model8_pretrain.py][INFO] Epoch:[0/2](293300/4588595) loss:2.467 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:46,822][model8_pretrain.py][INFO] Epoch:[0/2](293300/4588595) loss:2.514 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:46,822][model8_pretrain.py][INFO] Epoch:[0/2](293300/4588595) loss:2.822 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:46,822][model8_pretrain.py][INFO] Epoch:[0/2](293300/4588595) loss:2.807 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:46,822][model8_pretrain.py][INFO] Epoch:[0/2](293300/4588595) loss:3.031 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:46,822][model8_pretrain.py][INFO] Epoch:[0/2](293300/4588595) loss:2.612 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:35:46,822][model8_pretrain.py][INFO] Epoch:[0/2](293300/4588595) loss:3.167 lr:0.0000100 epoch_Time:27181.0min: [2024-01-04 00:36:23,785][model8_pretrain.py][INFO] Epoch:[0/2](293400/4588595) loss:3.029 lr:0.0000100 epoch_Time:27180.0min: [2024-01-04 00:36:23,785][model8_pretrain.py][INFO] Epoch:[0/2](293400/4588595) loss:2.945 lr:0.0000100 epoch_Time:27180.0min: [2024-01-04 00:36:23,785][model8_pretrain.py][INFO] Epoch:[0/2](293400/4588595) loss:2.715 lr:0.0000100 epoch_Time:27180.0min: [2024-01-04 00:36:23,785][model8_pretrain.py][INFO] Epoch:[0/2](293400/4588595) loss:3.189 lr:0.0000100 epoch_Time:27180.0min: [2024-01-04 00:36:23,785][model8_pretrain.py][INFO] Epoch:[0/2](293400/4588595) loss:2.987 lr:0.0000100 epoch_Time:27180.0min: [2024-01-04 00:36:23,785][model8_pretrain.py][INFO] Epoch:[0/2](293400/4588595) loss:2.731 lr:0.0000100 epoch_Time:27180.0min: [2024-01-04 00:36:23,786][model8_pretrain.py][INFO] Epoch:[0/2](293400/4588595) loss:2.710 lr:0.0000100 epoch_Time:27180.0min: [2024-01-04 00:36:23,786][model8_pretrain.py][INFO] Epoch:[0/2](293400/4588595) loss:2.730 lr:0.0000100 epoch_Time:27180.0min: [2024-01-04 00:37:00,737][model8_pretrain.py][INFO] Epoch:[0/2](293500/4588595) loss:2.810 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:00,737][model8_pretrain.py][INFO] Epoch:[0/2](293500/4588595) loss:2.492 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:00,737][model8_pretrain.py][INFO] Epoch:[0/2](293500/4588595) loss:2.866 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:00,737][model8_pretrain.py][INFO] Epoch:[0/2](293500/4588595) loss:3.283 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:00,737][model8_pretrain.py][INFO] Epoch:[0/2](293500/4588595) loss:2.293 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:00,737][model8_pretrain.py][INFO] Epoch:[0/2](293500/4588595) loss:3.123 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:00,737][model8_pretrain.py][INFO] Epoch:[0/2](293500/4588595) loss:3.264 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:00,737][model8_pretrain.py][INFO] Epoch:[0/2](293500/4588595) loss:3.372 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:37,694][model8_pretrain.py][INFO] Epoch:[0/2](293600/4588595) loss:2.953 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:37,694][model8_pretrain.py][INFO] Epoch:[0/2](293600/4588595) loss:2.394 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:37,694][model8_pretrain.py][INFO] Epoch:[0/2](293600/4588595) loss:3.244 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:37,694][model8_pretrain.py][INFO] Epoch:[0/2](293600/4588595) loss:2.796 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:37,694][model8_pretrain.py][INFO] Epoch:[0/2](293600/4588595) loss:2.432 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:37,695][model8_pretrain.py][INFO] Epoch:[0/2](293600/4588595) loss:2.973 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:37,695][model8_pretrain.py][INFO] Epoch:[0/2](293600/4588595) loss:2.770 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:37:37,695][model8_pretrain.py][INFO] Epoch:[0/2](293600/4588595) loss:3.050 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:38:14,655][model8_pretrain.py][INFO] Epoch:[0/2](293700/4588595) loss:2.769 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:38:14,655][model8_pretrain.py][INFO] Epoch:[0/2](293700/4588595) loss:3.189 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:38:14,655][model8_pretrain.py][INFO] Epoch:[0/2](293700/4588595) loss:3.242 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:38:14,655][model8_pretrain.py][INFO] Epoch:[0/2](293700/4588595) loss:2.247 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:38:14,655][model8_pretrain.py][INFO] Epoch:[0/2](293700/4588595) loss:2.809 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:38:14,655][model8_pretrain.py][INFO] Epoch:[0/2](293700/4588595) loss:3.333 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:38:14,655][model8_pretrain.py][INFO] Epoch:[0/2](293700/4588595) loss:3.385 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:38:14,655][model8_pretrain.py][INFO] Epoch:[0/2](293700/4588595) loss:2.646 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:38:51,597][model8_pretrain.py][INFO] Epoch:[0/2](293800/4588595) loss:3.039 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:38:51,597][model8_pretrain.py][INFO] Epoch:[0/2](293800/4588595) loss:2.965 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:38:51,597][model8_pretrain.py][INFO] Epoch:[0/2](293800/4588595) loss:2.807 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:38:51,597][model8_pretrain.py][INFO] Epoch:[0/2](293800/4588595) loss:3.132 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:38:51,597][model8_pretrain.py][INFO] Epoch:[0/2](293800/4588595) loss:2.583 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:38:51,597][model8_pretrain.py][INFO] Epoch:[0/2](293800/4588595) loss:2.366 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:38:51,597][model8_pretrain.py][INFO] Epoch:[0/2](293800/4588595) loss:2.697 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:38:51,597][model8_pretrain.py][INFO] Epoch:[0/2](293800/4588595) loss:2.747 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:39:38,977][model8_pretrain.py][INFO] Epoch:[0/2](293900/4588595) loss:2.964 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:39:38,977][model8_pretrain.py][INFO] Epoch:[0/2](293900/4588595) loss:3.265 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:39:38,977][model8_pretrain.py][INFO] Epoch:[0/2](293900/4588595) loss:2.834 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:39:38,977][model8_pretrain.py][INFO] Epoch:[0/2](293900/4588595) loss:3.043 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:39:38,977][model8_pretrain.py][INFO] Epoch:[0/2](293900/4588595) loss:3.189 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:39:38,977][model8_pretrain.py][INFO] Epoch:[0/2](293900/4588595) loss:2.552 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:39:38,978][model8_pretrain.py][INFO] Epoch:[0/2](293900/4588595) loss:2.482 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:39:38,978][model8_pretrain.py][INFO] Epoch:[0/2](293900/4588595) loss:2.591 lr:0.0000100 epoch_Time:27178.0min: [2024-01-04 00:40:15,888][model8_pretrain.py][INFO] Epoch:[0/2](294000/4588595) loss:3.008 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:40:15,888][model8_pretrain.py][INFO] Epoch:[0/2](294000/4588595) loss:3.294 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:40:15,888][model8_pretrain.py][INFO] Epoch:[0/2](294000/4588595) loss:3.151 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:40:15,889][model8_pretrain.py][INFO] Epoch:[0/2](294000/4588595) loss:2.840 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:40:15,888][model8_pretrain.py][INFO] Epoch:[0/2](294000/4588595) loss:2.977 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:40:15,888][model8_pretrain.py][INFO] Epoch:[0/2](294000/4588595) loss:2.442 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:40:15,889][model8_pretrain.py][INFO] Epoch:[0/2](294000/4588595) loss:3.085 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:40:15,889][model8_pretrain.py][INFO] Epoch:[0/2](294000/4588595) loss:3.390 lr:0.0000100 epoch_Time:27177.0min: [2024-01-04 00:40:52,823][model8_pretrain.py][INFO] Epoch:[0/2](294100/4588595) loss:3.580 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:40:52,823][model8_pretrain.py][INFO] Epoch:[0/2](294100/4588595) loss:2.890 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:40:52,823][model8_pretrain.py][INFO] Epoch:[0/2](294100/4588595) loss:2.946 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:40:52,823][model8_pretrain.py][INFO] Epoch:[0/2](294100/4588595) loss:2.715 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:40:52,824][model8_pretrain.py][INFO] Epoch:[0/2](294100/4588595) loss:3.220 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:40:52,824][model8_pretrain.py][INFO] Epoch:[0/2](294100/4588595) loss:2.743 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:40:52,824][model8_pretrain.py][INFO] Epoch:[0/2](294100/4588595) loss:3.156 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:40:52,824][model8_pretrain.py][INFO] Epoch:[0/2](294100/4588595) loss:2.806 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:41:29,752][model8_pretrain.py][INFO] Epoch:[0/2](294200/4588595) loss:2.972 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:41:29,752][model8_pretrain.py][INFO] Epoch:[0/2](294200/4588595) loss:3.279 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:41:29,752][model8_pretrain.py][INFO] Epoch:[0/2](294200/4588595) loss:2.925 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:41:29,752][model8_pretrain.py][INFO] Epoch:[0/2](294200/4588595) loss:2.621 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:41:29,752][model8_pretrain.py][INFO] Epoch:[0/2](294200/4588595) loss:2.563 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:41:29,752][model8_pretrain.py][INFO] Epoch:[0/2](294200/4588595) loss:2.574 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:41:29,752][model8_pretrain.py][INFO] Epoch:[0/2](294200/4588595) loss:3.197 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:41:29,753][model8_pretrain.py][INFO] Epoch:[0/2](294200/4588595) loss:3.086 lr:0.0000100 epoch_Time:27175.0min: [2024-01-04 00:42:06,706][model8_pretrain.py][INFO] Epoch:[0/2](294300/4588595) loss:2.775 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:06,706][model8_pretrain.py][INFO] Epoch:[0/2](294300/4588595) loss:2.884 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:06,706][model8_pretrain.py][INFO] Epoch:[0/2](294300/4588595) loss:3.256 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:06,706][model8_pretrain.py][INFO] Epoch:[0/2](294300/4588595) loss:3.324 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:06,706][model8_pretrain.py][INFO] Epoch:[0/2](294300/4588595) loss:2.501 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:06,706][model8_pretrain.py][INFO] Epoch:[0/2](294300/4588595) loss:2.680 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:06,706][model8_pretrain.py][INFO] Epoch:[0/2](294300/4588595) loss:3.446 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:06,706][model8_pretrain.py][INFO] Epoch:[0/2](294300/4588595) loss:2.282 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:43,648][model8_pretrain.py][INFO] Epoch:[0/2](294400/4588595) loss:2.687 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:43,648][model8_pretrain.py][INFO] Epoch:[0/2](294400/4588595) loss:2.681 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:43,648][model8_pretrain.py][INFO] Epoch:[0/2](294400/4588595) loss:3.219 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:43,648][model8_pretrain.py][INFO] Epoch:[0/2](294400/4588595) loss:2.963 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:43,648][model8_pretrain.py][INFO] Epoch:[0/2](294400/4588595) loss:3.069 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:43,648][model8_pretrain.py][INFO] Epoch:[0/2](294400/4588595) loss:3.091 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:43,648][model8_pretrain.py][INFO] Epoch:[0/2](294400/4588595) loss:3.120 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:42:43,648][model8_pretrain.py][INFO] Epoch:[0/2](294400/4588595) loss:2.738 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:43:20,605][model8_pretrain.py][INFO] Epoch:[0/2](294500/4588595) loss:2.388 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:43:20,605][model8_pretrain.py][INFO] Epoch:[0/2](294500/4588595) loss:2.507 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:43:20,605][model8_pretrain.py][INFO] Epoch:[0/2](294500/4588595) loss:2.852 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:43:20,605][model8_pretrain.py][INFO] Epoch:[0/2](294500/4588595) loss:3.163 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:43:20,605][model8_pretrain.py][INFO] Epoch:[0/2](294500/4588595) loss:2.571 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:43:20,605][model8_pretrain.py][INFO] Epoch:[0/2](294500/4588595) loss:2.957 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:43:20,605][model8_pretrain.py][INFO] Epoch:[0/2](294500/4588595) loss:3.052 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:43:20,605][model8_pretrain.py][INFO] Epoch:[0/2](294500/4588595) loss:2.978 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:43:57,534][model8_pretrain.py][INFO] Epoch:[0/2](294600/4588595) loss:2.591 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:43:57,534][model8_pretrain.py][INFO] Epoch:[0/2](294600/4588595) loss:3.177 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:43:57,534][model8_pretrain.py][INFO] Epoch:[0/2](294600/4588595) loss:2.984 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:43:57,534][model8_pretrain.py][INFO] Epoch:[0/2](294600/4588595) loss:2.785 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:43:57,534][model8_pretrain.py][INFO] Epoch:[0/2](294600/4588595) loss:3.112 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:43:57,534][model8_pretrain.py][INFO] Epoch:[0/2](294600/4588595) loss:2.547 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:43:57,534][model8_pretrain.py][INFO] Epoch:[0/2](294600/4588595) loss:2.568 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:43:57,535][model8_pretrain.py][INFO] Epoch:[0/2](294600/4588595) loss:3.458 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:44:45,000][model8_pretrain.py][INFO] Epoch:[0/2](294700/4588595) loss:3.070 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:44:45,000][model8_pretrain.py][INFO] Epoch:[0/2](294700/4588595) loss:2.533 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:44:45,000][model8_pretrain.py][INFO] Epoch:[0/2](294700/4588595) loss:2.783 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:44:45,000][model8_pretrain.py][INFO] Epoch:[0/2](294700/4588595) loss:3.372 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:44:45,000][model8_pretrain.py][INFO] Epoch:[0/2](294700/4588595) loss:2.994 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:44:45,000][model8_pretrain.py][INFO] Epoch:[0/2](294700/4588595) loss:3.047 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:44:45,000][model8_pretrain.py][INFO] Epoch:[0/2](294700/4588595) loss:3.005 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:44:45,000][model8_pretrain.py][INFO] Epoch:[0/2](294700/4588595) loss:3.109 lr:0.0000100 epoch_Time:27174.0min: [2024-01-04 00:45:21,934][model8_pretrain.py][INFO] Epoch:[0/2](294800/4588595) loss:2.956 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:45:21,934][model8_pretrain.py][INFO] Epoch:[0/2](294800/4588595) loss:2.554 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:45:21,934][model8_pretrain.py][INFO] Epoch:[0/2](294800/4588595) loss:2.948 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:45:21,934][model8_pretrain.py][INFO] Epoch:[0/2](294800/4588595) loss:2.826 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:45:21,934][model8_pretrain.py][INFO] Epoch:[0/2](294800/4588595) loss:2.567 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:45:21,934][model8_pretrain.py][INFO] Epoch:[0/2](294800/4588595) loss:3.037 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:45:21,934][model8_pretrain.py][INFO] Epoch:[0/2](294800/4588595) loss:2.820 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:45:21,934][model8_pretrain.py][INFO] Epoch:[0/2](294800/4588595) loss:3.210 lr:0.0000100 epoch_Time:27172.0min: [2024-01-04 00:45:58,881][model8_pretrain.py][INFO] Epoch:[0/2](294900/4588595) loss:2.839 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:45:58,881][model8_pretrain.py][INFO] Epoch:[0/2](294900/4588595) loss:2.904 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:45:58,881][model8_pretrain.py][INFO] Epoch:[0/2](294900/4588595) loss:3.031 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:45:58,881][model8_pretrain.py][INFO] Epoch:[0/2](294900/4588595) loss:3.043 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:45:58,881][model8_pretrain.py][INFO] Epoch:[0/2](294900/4588595) loss:3.354 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:45:58,881][model8_pretrain.py][INFO] Epoch:[0/2](294900/4588595) loss:2.714 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:45:58,881][model8_pretrain.py][INFO] Epoch:[0/2](294900/4588595) loss:2.217 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:45:58,881][model8_pretrain.py][INFO] Epoch:[0/2](294900/4588595) loss:2.249 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:46:35,826][model8_pretrain.py][INFO] Epoch:[0/2](295000/4588595) loss:2.577 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:46:35,826][model8_pretrain.py][INFO] Epoch:[0/2](295000/4588595) loss:2.276 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:46:35,826][model8_pretrain.py][INFO] Epoch:[0/2](295000/4588595) loss:3.438 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:46:35,826][model8_pretrain.py][INFO] Epoch:[0/2](295000/4588595) loss:2.897 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:46:35,826][model8_pretrain.py][INFO] Epoch:[0/2](295000/4588595) loss:2.726 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:46:35,826][model8_pretrain.py][INFO] Epoch:[0/2](295000/4588595) loss:2.596 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:46:35,826][model8_pretrain.py][INFO] Epoch:[0/2](295000/4588595) loss:2.928 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:46:35,826][model8_pretrain.py][INFO] Epoch:[0/2](295000/4588595) loss:3.070 lr:0.0000100 epoch_Time:27171.0min: [2024-01-04 00:47:12,775][model8_pretrain.py][INFO] Epoch:[0/2](295100/4588595) loss:2.958 lr:0.0000100 epoch_Time:27169.0min: [2024-01-04 00:47:12,775][model8_pretrain.py][INFO] Epoch:[0/2](295100/4588595) loss:3.169 lr:0.0000100 epoch_Time:27169.0min: [2024-01-04 00:47:12,775][model8_pretrain.py][INFO] Epoch:[0/2](295100/4588595) loss:2.434 lr:0.0000100 epoch_Time:27169.0min: [2024-01-04 00:47:12,775][model8_pretrain.py][INFO] Epoch:[0/2](295100/4588595) loss:2.922 lr:0.0000100 epoch_Time:27169.0min: [2024-01-04 00:47:12,775][model8_pretrain.py][INFO] Epoch:[0/2](295100/4588595) loss:2.872 lr:0.0000100 epoch_Time:27169.0min: [2024-01-04 00:47:12,775][model8_pretrain.py][INFO] Epoch:[0/2](295100/4588595) loss:2.644 lr:0.0000100 epoch_Time:27169.0min: [2024-01-04 00:47:12,775][model8_pretrain.py][INFO] Epoch:[0/2](295100/4588595) loss:2.847 lr:0.0000100 epoch_Time:27169.0min: [2024-01-04 00:47:12,775][model8_pretrain.py][INFO] Epoch:[0/2](295100/4588595) loss:3.160 lr:0.0000100 epoch_Time:27169.0min: [2024-01-04 00:47:49,721][model8_pretrain.py][INFO] Epoch:[0/2](295200/4588595) loss:3.244 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:47:49,721][model8_pretrain.py][INFO] Epoch:[0/2](295200/4588595) loss:3.314 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:47:49,721][model8_pretrain.py][INFO] Epoch:[0/2](295200/4588595) loss:3.221 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:47:49,721][model8_pretrain.py][INFO] Epoch:[0/2](295200/4588595) loss:2.863 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:47:49,721][model8_pretrain.py][INFO] Epoch:[0/2](295200/4588595) loss:3.173 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:47:49,721][model8_pretrain.py][INFO] Epoch:[0/2](295200/4588595) loss:3.187 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:47:49,721][model8_pretrain.py][INFO] Epoch:[0/2](295200/4588595) loss:2.848 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:47:49,721][model8_pretrain.py][INFO] Epoch:[0/2](295200/4588595) loss:3.247 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:48:26,659][model8_pretrain.py][INFO] Epoch:[0/2](295300/4588595) loss:3.122 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:48:26,659][model8_pretrain.py][INFO] Epoch:[0/2](295300/4588595) loss:2.510 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:48:26,659][model8_pretrain.py][INFO] Epoch:[0/2](295300/4588595) loss:2.836 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:48:26,659][model8_pretrain.py][INFO] Epoch:[0/2](295300/4588595) loss:3.109 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:48:26,659][model8_pretrain.py][INFO] Epoch:[0/2](295300/4588595) loss:2.940 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:48:26,659][model8_pretrain.py][INFO] Epoch:[0/2](295300/4588595) loss:2.798 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:48:26,659][model8_pretrain.py][INFO] Epoch:[0/2](295300/4588595) loss:3.448 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:48:26,659][model8_pretrain.py][INFO] Epoch:[0/2](295300/4588595) loss:2.861 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:49:03,651][model8_pretrain.py][INFO] Epoch:[0/2](295400/4588595) loss:3.321 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:49:03,651][model8_pretrain.py][INFO] Epoch:[0/2](295400/4588595) loss:2.428 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:49:03,651][model8_pretrain.py][INFO] Epoch:[0/2](295400/4588595) loss:2.846 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:49:03,652][model8_pretrain.py][INFO] Epoch:[0/2](295400/4588595) loss:2.393 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:49:03,652][model8_pretrain.py][INFO] Epoch:[0/2](295400/4588595) loss:2.670 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:49:03,652][model8_pretrain.py][INFO] Epoch:[0/2](295400/4588595) loss:3.011 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:49:03,652][model8_pretrain.py][INFO] Epoch:[0/2](295400/4588595) loss:2.907 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:49:03,652][model8_pretrain.py][INFO] Epoch:[0/2](295400/4588595) loss:2.923 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:49:51,056][model8_pretrain.py][INFO] Epoch:[0/2](295500/4588595) loss:2.715 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:49:51,056][model8_pretrain.py][INFO] Epoch:[0/2](295500/4588595) loss:2.604 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:49:51,056][model8_pretrain.py][INFO] Epoch:[0/2](295500/4588595) loss:3.268 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:49:51,056][model8_pretrain.py][INFO] Epoch:[0/2](295500/4588595) loss:2.969 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:49:51,056][model8_pretrain.py][INFO] Epoch:[0/2](295500/4588595) loss:2.817 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:49:51,056][model8_pretrain.py][INFO] Epoch:[0/2](295500/4588595) loss:2.384 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:49:51,056][model8_pretrain.py][INFO] Epoch:[0/2](295500/4588595) loss:2.610 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:49:51,056][model8_pretrain.py][INFO] Epoch:[0/2](295500/4588595) loss:2.427 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:50:27,989][model8_pretrain.py][INFO] Epoch:[0/2](295600/4588595) loss:2.431 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:50:27,989][model8_pretrain.py][INFO] Epoch:[0/2](295600/4588595) loss:3.391 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:50:27,990][model8_pretrain.py][INFO] Epoch:[0/2](295600/4588595) loss:2.949 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:50:27,990][model8_pretrain.py][INFO] Epoch:[0/2](295600/4588595) loss:3.213 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:50:27,990][model8_pretrain.py][INFO] Epoch:[0/2](295600/4588595) loss:2.836 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:50:27,990][model8_pretrain.py][INFO] Epoch:[0/2](295600/4588595) loss:3.096 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:50:27,990][model8_pretrain.py][INFO] Epoch:[0/2](295600/4588595) loss:2.521 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:50:27,990][model8_pretrain.py][INFO] Epoch:[0/2](295600/4588595) loss:2.211 lr:0.0000100 epoch_Time:27168.0min: [2024-01-04 00:51:04,925][model8_pretrain.py][INFO] Epoch:[0/2](295700/4588595) loss:2.775 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:51:04,925][model8_pretrain.py][INFO] Epoch:[0/2](295700/4588595) loss:2.712 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:51:04,925][model8_pretrain.py][INFO] Epoch:[0/2](295700/4588595) loss:2.669 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:51:04,925][model8_pretrain.py][INFO] Epoch:[0/2](295700/4588595) loss:2.941 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:51:04,925][model8_pretrain.py][INFO] Epoch:[0/2](295700/4588595) loss:2.858 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:51:04,925][model8_pretrain.py][INFO] Epoch:[0/2](295700/4588595) loss:2.609 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:51:04,925][model8_pretrain.py][INFO] Epoch:[0/2](295700/4588595) loss:3.079 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:51:04,926][model8_pretrain.py][INFO] Epoch:[0/2](295700/4588595) loss:2.865 lr:0.0000100 epoch_Time:27167.0min: [2024-01-04 00:51:41,862][model8_pretrain.py][INFO] Epoch:[0/2](295800/4588595) loss:2.958 lr:0.0000100 epoch_Time:27166.0min: [2024-01-04 00:51:41,862][model8_pretrain.py][INFO] Epoch:[0/2](295800/4588595) loss:2.969 lr:0.0000100 epoch_Time:27166.0min: [2024-01-04 00:51:41,862][model8_pretrain.py][INFO] Epoch:[0/2](295800/4588595) loss:2.738 lr:0.0000100 epoch_Time:27166.0min: [2024-01-04 00:51:41,862][model8_pretrain.py][INFO] Epoch:[0/2](295800/4588595) loss:2.845 lr:0.0000100 epoch_Time:27166.0min: [2024-01-04 00:51:41,862][model8_pretrain.py][INFO] Epoch:[0/2](295800/4588595) loss:3.253 lr:0.0000100 epoch_Time:27166.0min: [2024-01-04 00:51:41,862][model8_pretrain.py][INFO] Epoch:[0/2](295800/4588595) loss:3.197 lr:0.0000100 epoch_Time:27166.0min: [2024-01-04 00:51:41,862][model8_pretrain.py][INFO] Epoch:[0/2](295800/4588595) loss:3.083 lr:0.0000100 epoch_Time:27166.0min: [2024-01-04 00:51:41,862][model8_pretrain.py][INFO] Epoch:[0/2](295800/4588595) loss:2.992 lr:0.0000100 epoch_Time:27166.0min: [2024-01-04 00:52:18,809][model8_pretrain.py][INFO] Epoch:[0/2](295900/4588595) loss:1.926 lr:0.0000100 epoch_Time:27165.0min: [2024-01-04 00:52:18,809][model8_pretrain.py][INFO] Epoch:[0/2](295900/4588595) loss:2.971 lr:0.0000100 epoch_Time:27165.0min: [2024-01-04 00:52:18,809][model8_pretrain.py][INFO] Epoch:[0/2](295900/4588595) loss:2.590 lr:0.0000100 epoch_Time:27165.0min: [2024-01-04 00:52:18,809][model8_pretrain.py][INFO] Epoch:[0/2](295900/4588595) loss:2.839 lr:0.0000100 epoch_Time:27165.0min: [2024-01-04 00:52:18,809][model8_pretrain.py][INFO] Epoch:[0/2](295900/4588595) loss:2.862 lr:0.0000100 epoch_Time:27165.0min: [2024-01-04 00:52:18,809][model8_pretrain.py][INFO] Epoch:[0/2](295900/4588595) loss:2.870 lr:0.0000100 epoch_Time:27165.0min: [2024-01-04 00:52:18,810][model8_pretrain.py][INFO] Epoch:[0/2](295900/4588595) loss:2.841 lr:0.0000100 epoch_Time:27165.0min: [2024-01-04 00:52:18,810][model8_pretrain.py][INFO] Epoch:[0/2](295900/4588595) loss:2.298 lr:0.0000100 epoch_Time:27165.0min: [2024-01-04 00:52:55,750][model8_pretrain.py][INFO] Epoch:[0/2](296000/4588595) loss:2.761 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:52:55,750][model8_pretrain.py][INFO] Epoch:[0/2](296000/4588595) loss:2.956 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:52:55,750][model8_pretrain.py][INFO] Epoch:[0/2](296000/4588595) loss:3.091 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:52:55,750][model8_pretrain.py][INFO] Epoch:[0/2](296000/4588595) loss:2.305 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:52:55,750][model8_pretrain.py][INFO] Epoch:[0/2](296000/4588595) loss:2.842 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:52:55,750][model8_pretrain.py][INFO] Epoch:[0/2](296000/4588595) loss:3.148 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:52:55,751][model8_pretrain.py][INFO] Epoch:[0/2](296000/4588595) loss:2.811 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:52:55,751][model8_pretrain.py][INFO] Epoch:[0/2](296000/4588595) loss:2.854 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:53:32,674][model8_pretrain.py][INFO] Epoch:[0/2](296100/4588595) loss:3.200 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:53:32,674][model8_pretrain.py][INFO] Epoch:[0/2](296100/4588595) loss:2.997 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:53:32,674][model8_pretrain.py][INFO] Epoch:[0/2](296100/4588595) loss:2.442 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:53:32,674][model8_pretrain.py][INFO] Epoch:[0/2](296100/4588595) loss:2.635 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:53:32,674][model8_pretrain.py][INFO] Epoch:[0/2](296100/4588595) loss:3.135 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:53:32,674][model8_pretrain.py][INFO] Epoch:[0/2](296100/4588595) loss:3.161 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:53:32,674][model8_pretrain.py][INFO] Epoch:[0/2](296100/4588595) loss:3.094 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:53:32,674][model8_pretrain.py][INFO] Epoch:[0/2](296100/4588595) loss:3.371 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:54:09,604][model8_pretrain.py][INFO] Epoch:[0/2](296200/4588595) loss:3.409 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:54:09,604][model8_pretrain.py][INFO] Epoch:[0/2](296200/4588595) loss:3.478 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:54:09,604][model8_pretrain.py][INFO] Epoch:[0/2](296200/4588595) loss:2.651 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:54:09,604][model8_pretrain.py][INFO] Epoch:[0/2](296200/4588595) loss:3.247 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:54:09,604][model8_pretrain.py][INFO] Epoch:[0/2](296200/4588595) loss:2.590 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:54:09,604][model8_pretrain.py][INFO] Epoch:[0/2](296200/4588595) loss:3.053 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:54:09,604][model8_pretrain.py][INFO] Epoch:[0/2](296200/4588595) loss:2.935 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:54:09,604][model8_pretrain.py][INFO] Epoch:[0/2](296200/4588595) loss:2.865 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:54:56,935][model8_pretrain.py][INFO] Epoch:[0/2](296300/4588595) loss:2.981 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:54:56,935][model8_pretrain.py][INFO] Epoch:[0/2](296300/4588595) loss:3.112 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:54:56,935][model8_pretrain.py][INFO] Epoch:[0/2](296300/4588595) loss:2.915 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:54:56,935][model8_pretrain.py][INFO] Epoch:[0/2](296300/4588595) loss:3.162 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:54:56,935][model8_pretrain.py][INFO] Epoch:[0/2](296300/4588595) loss:2.201 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:54:56,935][model8_pretrain.py][INFO] Epoch:[0/2](296300/4588595) loss:2.769 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:54:56,935][model8_pretrain.py][INFO] Epoch:[0/2](296300/4588595) loss:3.176 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:54:56,935][model8_pretrain.py][INFO] Epoch:[0/2](296300/4588595) loss:2.781 lr:0.0000100 epoch_Time:27164.0min: [2024-01-04 00:55:33,865][model8_pretrain.py][INFO] Epoch:[0/2](296400/4588595) loss:3.350 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:55:33,865][model8_pretrain.py][INFO] Epoch:[0/2](296400/4588595) loss:2.533 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:55:33,865][model8_pretrain.py][INFO] Epoch:[0/2](296400/4588595) loss:3.333 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:55:33,865][model8_pretrain.py][INFO] Epoch:[0/2](296400/4588595) loss:2.976 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:55:33,865][model8_pretrain.py][INFO] Epoch:[0/2](296400/4588595) loss:3.092 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:55:33,865][model8_pretrain.py][INFO] Epoch:[0/2](296400/4588595) loss:3.373 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:55:33,865][model8_pretrain.py][INFO] Epoch:[0/2](296400/4588595) loss:2.660 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:55:33,865][model8_pretrain.py][INFO] Epoch:[0/2](296400/4588595) loss:3.137 lr:0.0000100 epoch_Time:27163.0min: [2024-01-04 00:56:10,812][model8_pretrain.py][INFO] Epoch:[0/2](296500/4588595) loss:3.260 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:56:10,812][model8_pretrain.py][INFO] Epoch:[0/2](296500/4588595) loss:2.942 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:56:10,812][model8_pretrain.py][INFO] Epoch:[0/2](296500/4588595) loss:3.259 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:56:10,812][model8_pretrain.py][INFO] Epoch:[0/2](296500/4588595) loss:2.940 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:56:10,812][model8_pretrain.py][INFO] Epoch:[0/2](296500/4588595) loss:3.096 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:56:10,812][model8_pretrain.py][INFO] Epoch:[0/2](296500/4588595) loss:3.108 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:56:10,812][model8_pretrain.py][INFO] Epoch:[0/2](296500/4588595) loss:3.117 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:56:10,812][model8_pretrain.py][INFO] Epoch:[0/2](296500/4588595) loss:2.848 lr:0.0000100 epoch_Time:27162.0min: [2024-01-04 00:56:47,751][model8_pretrain.py][INFO] Epoch:[0/2](296600/4588595) loss:3.149 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:56:47,751][model8_pretrain.py][INFO] Epoch:[0/2](296600/4588595) loss:3.112 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:56:47,751][model8_pretrain.py][INFO] Epoch:[0/2](296600/4588595) loss:2.972 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:56:47,751][model8_pretrain.py][INFO] Epoch:[0/2](296600/4588595) loss:3.002 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:56:47,751][model8_pretrain.py][INFO] Epoch:[0/2](296600/4588595) loss:2.710 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:56:47,751][model8_pretrain.py][INFO] Epoch:[0/2](296600/4588595) loss:2.603 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:56:47,751][model8_pretrain.py][INFO] Epoch:[0/2](296600/4588595) loss:3.226 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:56:47,751][model8_pretrain.py][INFO] Epoch:[0/2](296600/4588595) loss:3.356 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:57:24,663][model8_pretrain.py][INFO] Epoch:[0/2](296700/4588595) loss:3.126 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:57:24,663][model8_pretrain.py][INFO] Epoch:[0/2](296700/4588595) loss:2.777 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:57:24,663][model8_pretrain.py][INFO] Epoch:[0/2](296700/4588595) loss:2.542 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:57:24,663][model8_pretrain.py][INFO] Epoch:[0/2](296700/4588595) loss:3.054 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:57:24,663][model8_pretrain.py][INFO] Epoch:[0/2](296700/4588595) loss:2.784 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:57:24,663][model8_pretrain.py][INFO] Epoch:[0/2](296700/4588595) loss:2.915 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:57:24,663][model8_pretrain.py][INFO] Epoch:[0/2](296700/4588595) loss:3.034 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:57:24,663][model8_pretrain.py][INFO] Epoch:[0/2](296700/4588595) loss:2.536 lr:0.0000100 epoch_Time:27161.0min: [2024-01-04 00:58:01,613][model8_pretrain.py][INFO] Epoch:[0/2](296800/4588595) loss:3.199 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:01,613][model8_pretrain.py][INFO] Epoch:[0/2](296800/4588595) loss:2.529 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:01,613][model8_pretrain.py][INFO] Epoch:[0/2](296800/4588595) loss:2.803 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:01,613][model8_pretrain.py][INFO] Epoch:[0/2](296800/4588595) loss:2.550 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:01,613][model8_pretrain.py][INFO] Epoch:[0/2](296800/4588595) loss:2.505 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:01,613][model8_pretrain.py][INFO] Epoch:[0/2](296800/4588595) loss:3.095 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:01,613][model8_pretrain.py][INFO] Epoch:[0/2](296800/4588595) loss:2.706 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:01,614][model8_pretrain.py][INFO] Epoch:[0/2](296800/4588595) loss:3.489 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:38,557][model8_pretrain.py][INFO] Epoch:[0/2](296900/4588595) loss:2.964 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:38,557][model8_pretrain.py][INFO] Epoch:[0/2](296900/4588595) loss:3.326 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:38,557][model8_pretrain.py][INFO] Epoch:[0/2](296900/4588595) loss:2.811 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:38,557][model8_pretrain.py][INFO] Epoch:[0/2](296900/4588595) loss:3.016 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:38,557][model8_pretrain.py][INFO] Epoch:[0/2](296900/4588595) loss:2.923 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:38,557][model8_pretrain.py][INFO] Epoch:[0/2](296900/4588595) loss:2.910 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:38,557][model8_pretrain.py][INFO] Epoch:[0/2](296900/4588595) loss:2.586 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:58:38,557][model8_pretrain.py][INFO] Epoch:[0/2](296900/4588595) loss:3.536 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 00:59:15,494][model8_pretrain.py][INFO] Epoch:[0/2](297000/4588595) loss:2.573 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 00:59:15,494][model8_pretrain.py][INFO] Epoch:[0/2](297000/4588595) loss:2.567 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 00:59:15,494][model8_pretrain.py][INFO] Epoch:[0/2](297000/4588595) loss:3.110 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 00:59:15,494][model8_pretrain.py][INFO] Epoch:[0/2](297000/4588595) loss:3.039 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 00:59:15,494][model8_pretrain.py][INFO] Epoch:[0/2](297000/4588595) loss:2.613 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 00:59:15,494][model8_pretrain.py][INFO] Epoch:[0/2](297000/4588595) loss:3.455 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 00:59:15,494][model8_pretrain.py][INFO] Epoch:[0/2](297000/4588595) loss:2.119 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 00:59:15,494][model8_pretrain.py][INFO] Epoch:[0/2](297000/4588595) loss:2.967 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 01:00:02,882][model8_pretrain.py][INFO] Epoch:[0/2](297100/4588595) loss:3.454 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:02,882][model8_pretrain.py][INFO] Epoch:[0/2](297100/4588595) loss:2.541 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:02,882][model8_pretrain.py][INFO] Epoch:[0/2](297100/4588595) loss:2.912 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:02,882][model8_pretrain.py][INFO] Epoch:[0/2](297100/4588595) loss:2.247 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:02,882][model8_pretrain.py][INFO] Epoch:[0/2](297100/4588595) loss:2.985 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:02,882][model8_pretrain.py][INFO] Epoch:[0/2](297100/4588595) loss:3.050 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:02,882][model8_pretrain.py][INFO] Epoch:[0/2](297100/4588595) loss:3.288 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:02,882][model8_pretrain.py][INFO] Epoch:[0/2](297100/4588595) loss:2.335 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:39,831][model8_pretrain.py][INFO] Epoch:[0/2](297200/4588595) loss:2.933 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:39,831][model8_pretrain.py][INFO] Epoch:[0/2](297200/4588595) loss:2.897 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:39,831][model8_pretrain.py][INFO] Epoch:[0/2](297200/4588595) loss:2.861 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:39,831][model8_pretrain.py][INFO] Epoch:[0/2](297200/4588595) loss:2.996 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:39,831][model8_pretrain.py][INFO] Epoch:[0/2](297200/4588595) loss:3.070 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:39,831][model8_pretrain.py][INFO] Epoch:[0/2](297200/4588595) loss:2.493 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:39,831][model8_pretrain.py][INFO] Epoch:[0/2](297200/4588595) loss:2.826 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:00:39,831][model8_pretrain.py][INFO] Epoch:[0/2](297200/4588595) loss:2.558 lr:0.0000100 epoch_Time:27159.0min: [2024-01-04 01:01:16,782][model8_pretrain.py][INFO] Epoch:[0/2](297300/4588595) loss:2.712 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 01:01:16,782][model8_pretrain.py][INFO] Epoch:[0/2](297300/4588595) loss:3.178 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 01:01:16,782][model8_pretrain.py][INFO] Epoch:[0/2](297300/4588595) loss:2.825 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 01:01:16,782][model8_pretrain.py][INFO] Epoch:[0/2](297300/4588595) loss:2.519 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 01:01:16,782][model8_pretrain.py][INFO] Epoch:[0/2](297300/4588595) loss:2.787 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 01:01:16,782][model8_pretrain.py][INFO] Epoch:[0/2](297300/4588595) loss:2.666 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 01:01:16,782][model8_pretrain.py][INFO] Epoch:[0/2](297300/4588595) loss:3.245 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 01:01:16,782][model8_pretrain.py][INFO] Epoch:[0/2](297300/4588595) loss:3.061 lr:0.0000100 epoch_Time:27158.0min: [2024-01-04 01:01:53,729][model8_pretrain.py][INFO] Epoch:[0/2](297400/4588595) loss:2.781 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:01:53,730][model8_pretrain.py][INFO] Epoch:[0/2](297400/4588595) loss:2.500 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:01:53,730][model8_pretrain.py][INFO] Epoch:[0/2](297400/4588595) loss:3.086 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:01:53,730][model8_pretrain.py][INFO] Epoch:[0/2](297400/4588595) loss:3.202 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:01:53,730][model8_pretrain.py][INFO] Epoch:[0/2](297400/4588595) loss:3.304 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:01:53,730][model8_pretrain.py][INFO] Epoch:[0/2](297400/4588595) loss:2.621 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:01:53,730][model8_pretrain.py][INFO] Epoch:[0/2](297400/4588595) loss:3.166 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:01:53,730][model8_pretrain.py][INFO] Epoch:[0/2](297400/4588595) loss:2.502 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:02:30,686][model8_pretrain.py][INFO] Epoch:[0/2](297500/4588595) loss:2.939 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:02:30,686][model8_pretrain.py][INFO] Epoch:[0/2](297500/4588595) loss:3.148 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:02:30,686][model8_pretrain.py][INFO] Epoch:[0/2](297500/4588595) loss:2.916 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:02:30,686][model8_pretrain.py][INFO] Epoch:[0/2](297500/4588595) loss:3.494 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:02:30,686][model8_pretrain.py][INFO] Epoch:[0/2](297500/4588595) loss:2.651 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:02:30,686][model8_pretrain.py][INFO] Epoch:[0/2](297500/4588595) loss:2.397 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:02:30,686][model8_pretrain.py][INFO] Epoch:[0/2](297500/4588595) loss:3.188 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:02:30,686][model8_pretrain.py][INFO] Epoch:[0/2](297500/4588595) loss:3.074 lr:0.0000100 epoch_Time:27156.0min: [2024-01-04 01:03:07,640][model8_pretrain.py][INFO] Epoch:[0/2](297600/4588595) loss:2.914 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:07,640][model8_pretrain.py][INFO] Epoch:[0/2](297600/4588595) loss:2.890 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:07,640][model8_pretrain.py][INFO] Epoch:[0/2](297600/4588595) loss:3.431 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:07,640][model8_pretrain.py][INFO] Epoch:[0/2](297600/4588595) loss:2.873 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:07,640][model8_pretrain.py][INFO] Epoch:[0/2](297600/4588595) loss:3.330 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:07,640][model8_pretrain.py][INFO] Epoch:[0/2](297600/4588595) loss:2.592 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:07,640][model8_pretrain.py][INFO] Epoch:[0/2](297600/4588595) loss:2.734 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:07,640][model8_pretrain.py][INFO] Epoch:[0/2](297600/4588595) loss:2.972 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:44,599][model8_pretrain.py][INFO] Epoch:[0/2](297700/4588595) loss:3.022 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:44,599][model8_pretrain.py][INFO] Epoch:[0/2](297700/4588595) loss:2.684 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:44,599][model8_pretrain.py][INFO] Epoch:[0/2](297700/4588595) loss:2.781 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:44,599][model8_pretrain.py][INFO] Epoch:[0/2](297700/4588595) loss:3.120 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:44,599][model8_pretrain.py][INFO] Epoch:[0/2](297700/4588595) loss:2.575 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:44,599][model8_pretrain.py][INFO] Epoch:[0/2](297700/4588595) loss:3.032 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:44,599][model8_pretrain.py][INFO] Epoch:[0/2](297700/4588595) loss:2.871 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:03:44,599][model8_pretrain.py][INFO] Epoch:[0/2](297700/4588595) loss:2.850 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:04:21,551][model8_pretrain.py][INFO] Epoch:[0/2](297800/4588595) loss:2.746 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:04:21,551][model8_pretrain.py][INFO] Epoch:[0/2](297800/4588595) loss:3.107 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:04:21,551][model8_pretrain.py][INFO] Epoch:[0/2](297800/4588595) loss:2.732 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:04:21,551][model8_pretrain.py][INFO] Epoch:[0/2](297800/4588595) loss:2.927 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:04:21,551][model8_pretrain.py][INFO] Epoch:[0/2](297800/4588595) loss:3.155 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:04:21,551][model8_pretrain.py][INFO] Epoch:[0/2](297800/4588595) loss:2.284 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:04:21,551][model8_pretrain.py][INFO] Epoch:[0/2](297800/4588595) loss:2.991 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:04:21,551][model8_pretrain.py][INFO] Epoch:[0/2](297800/4588595) loss:2.478 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:05:08,799][model8_pretrain.py][INFO] Epoch:[0/2](297900/4588595) loss:3.119 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:05:08,799][model8_pretrain.py][INFO] Epoch:[0/2](297900/4588595) loss:2.461 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:05:08,799][model8_pretrain.py][INFO] Epoch:[0/2](297900/4588595) loss:2.741 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:05:08,799][model8_pretrain.py][INFO] Epoch:[0/2](297900/4588595) loss:3.168 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:05:08,799][model8_pretrain.py][INFO] Epoch:[0/2](297900/4588595) loss:3.080 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:05:08,799][model8_pretrain.py][INFO] Epoch:[0/2](297900/4588595) loss:3.257 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:05:08,799][model8_pretrain.py][INFO] Epoch:[0/2](297900/4588595) loss:3.277 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:05:08,799][model8_pretrain.py][INFO] Epoch:[0/2](297900/4588595) loss:2.664 lr:0.0000100 epoch_Time:27155.0min: [2024-01-04 01:05:45,721][model8_pretrain.py][INFO] Epoch:[0/2](298000/4588595) loss:3.130 lr:0.0000100 epoch_Time:27154.0min: [2024-01-04 01:05:45,721][model8_pretrain.py][INFO] Epoch:[0/2](298000/4588595) loss:3.387 lr:0.0000100 epoch_Time:27154.0min: [2024-01-04 01:05:45,721][model8_pretrain.py][INFO] Epoch:[0/2](298000/4588595) loss:3.108 lr:0.0000100 epoch_Time:27154.0min: [2024-01-04 01:05:45,721][model8_pretrain.py][INFO] Epoch:[0/2](298000/4588595) loss:3.191 lr:0.0000100 epoch_Time:27154.0min: [2024-01-04 01:05:45,721][model8_pretrain.py][INFO] Epoch:[0/2](298000/4588595) loss:2.952 lr:0.0000100 epoch_Time:27154.0min: [2024-01-04 01:05:45,721][model8_pretrain.py][INFO] Epoch:[0/2](298000/4588595) loss:3.208 lr:0.0000100 epoch_Time:27154.0min: [2024-01-04 01:05:45,721][model8_pretrain.py][INFO] Epoch:[0/2](298000/4588595) loss:2.827 lr:0.0000100 epoch_Time:27154.0min: [2024-01-04 01:05:45,721][model8_pretrain.py][INFO] Epoch:[0/2](298000/4588595) loss:3.029 lr:0.0000100 epoch_Time:27154.0min: [2024-01-04 01:06:22,657][model8_pretrain.py][INFO] Epoch:[0/2](298100/4588595) loss:2.183 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:06:22,657][model8_pretrain.py][INFO] Epoch:[0/2](298100/4588595) loss:3.337 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:06:22,657][model8_pretrain.py][INFO] Epoch:[0/2](298100/4588595) loss:3.012 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:06:22,657][model8_pretrain.py][INFO] Epoch:[0/2](298100/4588595) loss:3.274 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:06:22,657][model8_pretrain.py][INFO] Epoch:[0/2](298100/4588595) loss:3.015 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:06:22,657][model8_pretrain.py][INFO] Epoch:[0/2](298100/4588595) loss:3.251 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:06:22,657][model8_pretrain.py][INFO] Epoch:[0/2](298100/4588595) loss:3.209 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:06:22,657][model8_pretrain.py][INFO] Epoch:[0/2](298100/4588595) loss:2.831 lr:0.0000100 epoch_Time:27153.0min: [2024-01-04 01:06:59,601][model8_pretrain.py][INFO] Epoch:[0/2](298200/4588595) loss:2.586 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:06:59,601][model8_pretrain.py][INFO] Epoch:[0/2](298200/4588595) loss:2.528 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:06:59,601][model8_pretrain.py][INFO] Epoch:[0/2](298200/4588595) loss:3.067 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:06:59,601][model8_pretrain.py][INFO] Epoch:[0/2](298200/4588595) loss:2.138 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:06:59,601][model8_pretrain.py][INFO] Epoch:[0/2](298200/4588595) loss:3.056 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:06:59,601][model8_pretrain.py][INFO] Epoch:[0/2](298200/4588595) loss:2.719 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:06:59,601][model8_pretrain.py][INFO] Epoch:[0/2](298200/4588595) loss:2.665 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:06:59,601][model8_pretrain.py][INFO] Epoch:[0/2](298200/4588595) loss:2.482 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:07:36,534][model8_pretrain.py][INFO] Epoch:[0/2](298300/4588595) loss:2.598 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:07:36,534][model8_pretrain.py][INFO] Epoch:[0/2](298300/4588595) loss:2.849 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:07:36,534][model8_pretrain.py][INFO] Epoch:[0/2](298300/4588595) loss:2.789 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:07:36,534][model8_pretrain.py][INFO] Epoch:[0/2](298300/4588595) loss:2.138 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:07:36,534][model8_pretrain.py][INFO] Epoch:[0/2](298300/4588595) loss:2.588 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:07:36,534][model8_pretrain.py][INFO] Epoch:[0/2](298300/4588595) loss:2.837 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:07:36,535][model8_pretrain.py][INFO] Epoch:[0/2](298300/4588595) loss:3.029 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:07:36,535][model8_pretrain.py][INFO] Epoch:[0/2](298300/4588595) loss:2.637 lr:0.0000100 epoch_Time:27152.0min: [2024-01-04 01:08:13,473][model8_pretrain.py][INFO] Epoch:[0/2](298400/4588595) loss:2.854 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:08:13,473][model8_pretrain.py][INFO] Epoch:[0/2](298400/4588595) loss:2.882 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:08:13,473][model8_pretrain.py][INFO] Epoch:[0/2](298400/4588595) loss:2.502 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:08:13,473][model8_pretrain.py][INFO] Epoch:[0/2](298400/4588595) loss:2.386 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:08:13,473][model8_pretrain.py][INFO] Epoch:[0/2](298400/4588595) loss:2.193 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:08:13,473][model8_pretrain.py][INFO] Epoch:[0/2](298400/4588595) loss:2.941 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:08:13,473][model8_pretrain.py][INFO] Epoch:[0/2](298400/4588595) loss:2.853 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:08:13,473][model8_pretrain.py][INFO] Epoch:[0/2](298400/4588595) loss:3.009 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:08:50,408][model8_pretrain.py][INFO] Epoch:[0/2](298500/4588595) loss:2.823 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:08:50,408][model8_pretrain.py][INFO] Epoch:[0/2](298500/4588595) loss:2.594 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:08:50,408][model8_pretrain.py][INFO] Epoch:[0/2](298500/4588595) loss:2.967 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:08:50,408][model8_pretrain.py][INFO] Epoch:[0/2](298500/4588595) loss:3.026 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:08:50,408][model8_pretrain.py][INFO] Epoch:[0/2](298500/4588595) loss:2.979 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:08:50,408][model8_pretrain.py][INFO] Epoch:[0/2](298500/4588595) loss:2.186 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:08:50,408][model8_pretrain.py][INFO] Epoch:[0/2](298500/4588595) loss:2.243 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:08:50,408][model8_pretrain.py][INFO] Epoch:[0/2](298500/4588595) loss:3.505 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:09:27,346][model8_pretrain.py][INFO] Epoch:[0/2](298600/4588595) loss:3.316 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:09:27,346][model8_pretrain.py][INFO] Epoch:[0/2](298600/4588595) loss:2.729 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:09:27,346][model8_pretrain.py][INFO] Epoch:[0/2](298600/4588595) loss:2.603 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:09:27,346][model8_pretrain.py][INFO] Epoch:[0/2](298600/4588595) loss:2.851 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:09:27,346][model8_pretrain.py][INFO] Epoch:[0/2](298600/4588595) loss:3.071 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:09:27,346][model8_pretrain.py][INFO] Epoch:[0/2](298600/4588595) loss:3.138 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:09:27,347][model8_pretrain.py][INFO] Epoch:[0/2](298600/4588595) loss:3.041 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:09:27,347][model8_pretrain.py][INFO] Epoch:[0/2](298600/4588595) loss:3.122 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:10:14,453][model8_pretrain.py][INFO] Epoch:[0/2](298700/4588595) loss:3.144 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:10:14,453][model8_pretrain.py][INFO] Epoch:[0/2](298700/4588595) loss:3.017 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:10:14,453][model8_pretrain.py][INFO] Epoch:[0/2](298700/4588595) loss:3.138 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:10:14,453][model8_pretrain.py][INFO] Epoch:[0/2](298700/4588595) loss:3.001 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:10:14,453][model8_pretrain.py][INFO] Epoch:[0/2](298700/4588595) loss:2.290 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:10:14,453][model8_pretrain.py][INFO] Epoch:[0/2](298700/4588595) loss:3.251 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:10:14,453][model8_pretrain.py][INFO] Epoch:[0/2](298700/4588595) loss:3.163 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:10:14,453][model8_pretrain.py][INFO] Epoch:[0/2](298700/4588595) loss:2.713 lr:0.0000100 epoch_Time:27150.0min: [2024-01-04 01:10:51,375][model8_pretrain.py][INFO] Epoch:[0/2](298800/4588595) loss:2.992 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:10:51,375][model8_pretrain.py][INFO] Epoch:[0/2](298800/4588595) loss:2.883 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:10:51,375][model8_pretrain.py][INFO] Epoch:[0/2](298800/4588595) loss:2.979 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:10:51,375][model8_pretrain.py][INFO] Epoch:[0/2](298800/4588595) loss:3.046 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:10:51,376][model8_pretrain.py][INFO] Epoch:[0/2](298800/4588595) loss:3.304 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:10:51,375][model8_pretrain.py][INFO] Epoch:[0/2](298800/4588595) loss:3.008 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:10:51,376][model8_pretrain.py][INFO] Epoch:[0/2](298800/4588595) loss:3.063 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:10:51,376][model8_pretrain.py][INFO] Epoch:[0/2](298800/4588595) loss:3.104 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:11:28,308][model8_pretrain.py][INFO] Epoch:[0/2](298900/4588595) loss:2.984 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:11:28,308][model8_pretrain.py][INFO] Epoch:[0/2](298900/4588595) loss:3.039 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:11:28,309][model8_pretrain.py][INFO] Epoch:[0/2](298900/4588595) loss:2.681 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:11:28,309][model8_pretrain.py][INFO] Epoch:[0/2](298900/4588595) loss:2.982 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:11:28,309][model8_pretrain.py][INFO] Epoch:[0/2](298900/4588595) loss:3.149 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:11:28,309][model8_pretrain.py][INFO] Epoch:[0/2](298900/4588595) loss:3.336 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:11:28,309][model8_pretrain.py][INFO] Epoch:[0/2](298900/4588595) loss:3.159 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:11:28,309][model8_pretrain.py][INFO] Epoch:[0/2](298900/4588595) loss:3.243 lr:0.0000100 epoch_Time:27149.0min: [2024-01-04 01:12:05,239][model8_pretrain.py][INFO] Epoch:[0/2](299000/4588595) loss:2.733 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:05,239][model8_pretrain.py][INFO] Epoch:[0/2](299000/4588595) loss:2.577 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:05,239][model8_pretrain.py][INFO] Epoch:[0/2](299000/4588595) loss:2.468 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:05,239][model8_pretrain.py][INFO] Epoch:[0/2](299000/4588595) loss:2.750 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:05,239][model8_pretrain.py][INFO] Epoch:[0/2](299000/4588595) loss:2.348 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:05,239][model8_pretrain.py][INFO] Epoch:[0/2](299000/4588595) loss:2.879 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:05,239][model8_pretrain.py][INFO] Epoch:[0/2](299000/4588595) loss:3.102 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:05,239][model8_pretrain.py][INFO] Epoch:[0/2](299000/4588595) loss:2.621 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:42,165][model8_pretrain.py][INFO] Epoch:[0/2](299100/4588595) loss:2.200 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:42,165][model8_pretrain.py][INFO] Epoch:[0/2](299100/4588595) loss:2.783 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:42,165][model8_pretrain.py][INFO] Epoch:[0/2](299100/4588595) loss:2.823 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:42,165][model8_pretrain.py][INFO] Epoch:[0/2](299100/4588595) loss:3.053 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:42,165][model8_pretrain.py][INFO] Epoch:[0/2](299100/4588595) loss:3.249 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:42,165][model8_pretrain.py][INFO] Epoch:[0/2](299100/4588595) loss:2.396 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:42,166][model8_pretrain.py][INFO] Epoch:[0/2](299100/4588595) loss:3.198 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:12:42,166][model8_pretrain.py][INFO] Epoch:[0/2](299100/4588595) loss:2.572 lr:0.0000100 epoch_Time:27147.0min: [2024-01-04 01:13:19,101][model8_pretrain.py][INFO] Epoch:[0/2](299200/4588595) loss:2.879 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:13:19,101][model8_pretrain.py][INFO] Epoch:[0/2](299200/4588595) loss:3.046 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:13:19,101][model8_pretrain.py][INFO] Epoch:[0/2](299200/4588595) loss:2.976 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:13:19,101][model8_pretrain.py][INFO] Epoch:[0/2](299200/4588595) loss:2.723 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:13:19,101][model8_pretrain.py][INFO] Epoch:[0/2](299200/4588595) loss:2.782 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:13:19,101][model8_pretrain.py][INFO] Epoch:[0/2](299200/4588595) loss:2.650 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:13:19,101][model8_pretrain.py][INFO] Epoch:[0/2](299200/4588595) loss:2.502 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:13:19,102][model8_pretrain.py][INFO] Epoch:[0/2](299200/4588595) loss:3.338 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:13:56,034][model8_pretrain.py][INFO] Epoch:[0/2](299300/4588595) loss:3.027 lr:0.0000100 epoch_Time:27145.0min: [2024-01-04 01:13:56,034][model8_pretrain.py][INFO] Epoch:[0/2](299300/4588595) loss:2.745 lr:0.0000100 epoch_Time:27145.0min: [2024-01-04 01:13:56,034][model8_pretrain.py][INFO] Epoch:[0/2](299300/4588595) loss:2.358 lr:0.0000100 epoch_Time:27145.0min: [2024-01-04 01:13:56,034][model8_pretrain.py][INFO] Epoch:[0/2](299300/4588595) loss:3.095 lr:0.0000100 epoch_Time:27145.0min: [2024-01-04 01:13:56,034][model8_pretrain.py][INFO] Epoch:[0/2](299300/4588595) loss:2.841 lr:0.0000100 epoch_Time:27145.0min: [2024-01-04 01:13:56,034][model8_pretrain.py][INFO] Epoch:[0/2](299300/4588595) loss:2.797 lr:0.0000100 epoch_Time:27145.0min: [2024-01-04 01:13:56,034][model8_pretrain.py][INFO] Epoch:[0/2](299300/4588595) loss:2.655 lr:0.0000100 epoch_Time:27145.0min: [2024-01-04 01:13:56,034][model8_pretrain.py][INFO] Epoch:[0/2](299300/4588595) loss:3.046 lr:0.0000100 epoch_Time:27145.0min: [2024-01-04 01:14:32,972][model8_pretrain.py][INFO] Epoch:[0/2](299400/4588595) loss:2.740 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:14:32,972][model8_pretrain.py][INFO] Epoch:[0/2](299400/4588595) loss:2.887 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:14:32,972][model8_pretrain.py][INFO] Epoch:[0/2](299400/4588595) loss:2.339 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:14:32,972][model8_pretrain.py][INFO] Epoch:[0/2](299400/4588595) loss:3.003 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:14:32,972][model8_pretrain.py][INFO] Epoch:[0/2](299400/4588595) loss:3.003 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:14:32,972][model8_pretrain.py][INFO] Epoch:[0/2](299400/4588595) loss:3.345 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:14:32,972][model8_pretrain.py][INFO] Epoch:[0/2](299400/4588595) loss:3.347 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:14:32,976][model8_pretrain.py][INFO] Epoch:[0/2](299400/4588595) loss:3.276 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:15:20,004][model8_pretrain.py][INFO] Epoch:[0/2](299500/4588595) loss:3.029 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:15:20,004][model8_pretrain.py][INFO] Epoch:[0/2](299500/4588595) loss:2.926 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:15:20,004][model8_pretrain.py][INFO] Epoch:[0/2](299500/4588595) loss:3.087 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:15:20,004][model8_pretrain.py][INFO] Epoch:[0/2](299500/4588595) loss:2.590 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:15:20,004][model8_pretrain.py][INFO] Epoch:[0/2](299500/4588595) loss:3.404 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:15:20,004][model8_pretrain.py][INFO] Epoch:[0/2](299500/4588595) loss:3.440 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:15:20,004][model8_pretrain.py][INFO] Epoch:[0/2](299500/4588595) loss:2.768 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:15:20,004][model8_pretrain.py][INFO] Epoch:[0/2](299500/4588595) loss:3.098 lr:0.0000100 epoch_Time:27146.0min: [2024-01-04 01:15:56,934][model8_pretrain.py][INFO] Epoch:[0/2](299600/4588595) loss:3.172 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:15:56,934][model8_pretrain.py][INFO] Epoch:[0/2](299600/4588595) loss:3.252 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:15:56,934][model8_pretrain.py][INFO] Epoch:[0/2](299600/4588595) loss:3.259 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:15:56,934][model8_pretrain.py][INFO] Epoch:[0/2](299600/4588595) loss:3.145 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:15:56,934][model8_pretrain.py][INFO] Epoch:[0/2](299600/4588595) loss:2.659 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:15:56,934][model8_pretrain.py][INFO] Epoch:[0/2](299600/4588595) loss:3.128 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:15:56,935][model8_pretrain.py][INFO] Epoch:[0/2](299600/4588595) loss:2.978 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:15:56,936][model8_pretrain.py][INFO] Epoch:[0/2](299600/4588595) loss:3.223 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:16:33,866][model8_pretrain.py][INFO] Epoch:[0/2](299700/4588595) loss:2.925 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:16:33,866][model8_pretrain.py][INFO] Epoch:[0/2](299700/4588595) loss:2.998 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:16:33,866][model8_pretrain.py][INFO] Epoch:[0/2](299700/4588595) loss:2.469 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:16:33,866][model8_pretrain.py][INFO] Epoch:[0/2](299700/4588595) loss:3.060 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:16:33,866][model8_pretrain.py][INFO] Epoch:[0/2](299700/4588595) loss:3.261 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:16:33,866][model8_pretrain.py][INFO] Epoch:[0/2](299700/4588595) loss:2.879 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:16:33,866][model8_pretrain.py][INFO] Epoch:[0/2](299700/4588595) loss:2.669 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:16:33,867][model8_pretrain.py][INFO] Epoch:[0/2](299700/4588595) loss:2.696 lr:0.0000100 epoch_Time:27144.0min: [2024-01-04 01:17:10,810][model8_pretrain.py][INFO] Epoch:[0/2](299800/4588595) loss:2.677 lr:0.0000100 epoch_Time:27143.0min: [2024-01-04 01:17:10,810][model8_pretrain.py][INFO] Epoch:[0/2](299800/4588595) loss:2.547 lr:0.0000100 epoch_Time:27143.0min: [2024-01-04 01:17:10,810][model8_pretrain.py][INFO] Epoch:[0/2](299800/4588595) loss:3.246 lr:0.0000100 epoch_Time:27143.0min: [2024-01-04 01:17:10,810][model8_pretrain.py][INFO] Epoch:[0/2](299800/4588595) loss:3.211 lr:0.0000100 epoch_Time:27143.0min: [2024-01-04 01:17:10,810][model8_pretrain.py][INFO] Epoch:[0/2](299800/4588595) loss:2.797 lr:0.0000100 epoch_Time:27143.0min: [2024-01-04 01:17:10,810][model8_pretrain.py][INFO] Epoch:[0/2](299800/4588595) loss:2.784 lr:0.0000100 epoch_Time:27143.0min: [2024-01-04 01:17:10,810][model8_pretrain.py][INFO] Epoch:[0/2](299800/4588595) loss:2.942 lr:0.0000100 epoch_Time:27143.0min: [2024-01-04 01:17:10,810][model8_pretrain.py][INFO] Epoch:[0/2](299800/4588595) loss:2.916 lr:0.0000100 epoch_Time:27143.0min: [2024-01-04 01:17:47,758][model8_pretrain.py][INFO] Epoch:[0/2](299900/4588595) loss:2.993 lr:0.0000100 epoch_Time:27142.0min: [2024-01-04 01:17:47,758][model8_pretrain.py][INFO] Epoch:[0/2](299900/4588595) loss:2.705 lr:0.0000100 epoch_Time:27142.0min: [2024-01-04 01:17:47,758][model8_pretrain.py][INFO] Epoch:[0/2](299900/4588595) loss:3.390 lr:0.0000100 epoch_Time:27142.0min: [2024-01-04 01:17:47,758][model8_pretrain.py][INFO] Epoch:[0/2](299900/4588595) loss:2.862 lr:0.0000100 epoch_Time:27142.0min: [2024-01-04 01:17:47,758][model8_pretrain.py][INFO] Epoch:[0/2](299900/4588595) loss:2.699 lr:0.0000100 epoch_Time:27142.0min: [2024-01-04 01:17:47,758][model8_pretrain.py][INFO] Epoch:[0/2](299900/4588595) loss:2.518 lr:0.0000100 epoch_Time:27142.0min: [2024-01-04 01:17:47,758][model8_pretrain.py][INFO] Epoch:[0/2](299900/4588595) loss:2.961 lr:0.0000100 epoch_Time:27142.0min: [2024-01-04 01:17:47,758][model8_pretrain.py][INFO] Epoch:[0/2](299900/4588595) loss:3.363 lr:0.0000100 epoch_Time:27142.0min: [2024-01-04 01:18:24,694][model8_pretrain.py][INFO] Epoch:[0/2](300000/4588595) loss:2.797 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:18:24,694][model8_pretrain.py][INFO] Epoch:[0/2](300000/4588595) loss:3.002 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:18:24,694][model8_pretrain.py][INFO] Epoch:[0/2](300000/4588595) loss:2.887 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:18:24,694][model8_pretrain.py][INFO] Epoch:[0/2](300000/4588595) loss:2.723 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:18:24,694][model8_pretrain.py][INFO] Epoch:[0/2](300000/4588595) loss:2.630 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:18:24,694][model8_pretrain.py][INFO] Epoch:[0/2](300000/4588595) loss:2.981 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:18:24,694][model8_pretrain.py][INFO] Epoch:[0/2](300000/4588595) loss:2.959 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:18:24,694][model8_pretrain.py][INFO] Epoch:[0/2](300000/4588595) loss:2.667 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:19:01,637][model8_pretrain.py][INFO] Epoch:[0/2](300100/4588595) loss:2.242 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:01,637][model8_pretrain.py][INFO] Epoch:[0/2](300100/4588595) loss:2.574 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:01,637][model8_pretrain.py][INFO] Epoch:[0/2](300100/4588595) loss:2.573 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:01,637][model8_pretrain.py][INFO] Epoch:[0/2](300100/4588595) loss:2.809 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:01,637][model8_pretrain.py][INFO] Epoch:[0/2](300100/4588595) loss:3.004 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:01,637][model8_pretrain.py][INFO] Epoch:[0/2](300100/4588595) loss:2.654 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:01,638][model8_pretrain.py][INFO] Epoch:[0/2](300100/4588595) loss:3.049 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:01,638][model8_pretrain.py][INFO] Epoch:[0/2](300100/4588595) loss:2.936 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:38,573][model8_pretrain.py][INFO] Epoch:[0/2](300200/4588595) loss:2.914 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:38,573][model8_pretrain.py][INFO] Epoch:[0/2](300200/4588595) loss:2.936 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:38,573][model8_pretrain.py][INFO] Epoch:[0/2](300200/4588595) loss:3.374 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:38,573][model8_pretrain.py][INFO] Epoch:[0/2](300200/4588595) loss:2.608 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:38,573][model8_pretrain.py][INFO] Epoch:[0/2](300200/4588595) loss:2.960 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:38,573][model8_pretrain.py][INFO] Epoch:[0/2](300200/4588595) loss:3.015 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:38,573][model8_pretrain.py][INFO] Epoch:[0/2](300200/4588595) loss:2.888 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:19:38,574][model8_pretrain.py][INFO] Epoch:[0/2](300200/4588595) loss:3.065 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:20:25,686][model8_pretrain.py][INFO] Epoch:[0/2](300300/4588595) loss:2.001 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:20:25,686][model8_pretrain.py][INFO] Epoch:[0/2](300300/4588595) loss:2.657 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:20:25,686][model8_pretrain.py][INFO] Epoch:[0/2](300300/4588595) loss:2.519 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:20:25,686][model8_pretrain.py][INFO] Epoch:[0/2](300300/4588595) loss:2.660 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:20:25,686][model8_pretrain.py][INFO] Epoch:[0/2](300300/4588595) loss:3.123 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:20:25,686][model8_pretrain.py][INFO] Epoch:[0/2](300300/4588595) loss:2.793 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:20:25,686][model8_pretrain.py][INFO] Epoch:[0/2](300300/4588595) loss:2.708 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:20:25,686][model8_pretrain.py][INFO] Epoch:[0/2](300300/4588595) loss:2.118 lr:0.0000100 epoch_Time:27141.0min: [2024-01-04 01:21:02,602][model8_pretrain.py][INFO] Epoch:[0/2](300400/4588595) loss:2.580 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:02,602][model8_pretrain.py][INFO] Epoch:[0/2](300400/4588595) loss:3.163 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:02,602][model8_pretrain.py][INFO] Epoch:[0/2](300400/4588595) loss:2.494 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:02,602][model8_pretrain.py][INFO] Epoch:[0/2](300400/4588595) loss:3.227 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:02,602][model8_pretrain.py][INFO] Epoch:[0/2](300400/4588595) loss:2.762 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:02,603][model8_pretrain.py][INFO] Epoch:[0/2](300400/4588595) loss:2.274 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:02,603][model8_pretrain.py][INFO] Epoch:[0/2](300400/4588595) loss:2.615 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:02,603][model8_pretrain.py][INFO] Epoch:[0/2](300400/4588595) loss:2.923 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:39,527][model8_pretrain.py][INFO] Epoch:[0/2](300500/4588595) loss:2.753 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:39,527][model8_pretrain.py][INFO] Epoch:[0/2](300500/4588595) loss:3.439 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:39,527][model8_pretrain.py][INFO] Epoch:[0/2](300500/4588595) loss:2.991 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:39,527][model8_pretrain.py][INFO] Epoch:[0/2](300500/4588595) loss:2.368 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:39,527][model8_pretrain.py][INFO] Epoch:[0/2](300500/4588595) loss:3.282 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:39,527][model8_pretrain.py][INFO] Epoch:[0/2](300500/4588595) loss:3.133 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:39,527][model8_pretrain.py][INFO] Epoch:[0/2](300500/4588595) loss:2.942 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:21:39,527][model8_pretrain.py][INFO] Epoch:[0/2](300500/4588595) loss:2.850 lr:0.0000100 epoch_Time:27140.0min: [2024-01-04 01:22:16,461][model8_pretrain.py][INFO] Epoch:[0/2](300600/4588595) loss:2.856 lr:0.0000100 epoch_Time:27138.0min: [2024-01-04 01:22:16,461][model8_pretrain.py][INFO] Epoch:[0/2](300600/4588595) loss:3.019 lr:0.0000100 epoch_Time:27138.0min: [2024-01-04 01:22:16,461][model8_pretrain.py][INFO] Epoch:[0/2](300600/4588595) loss:2.702 lr:0.0000100 epoch_Time:27138.0min: [2024-01-04 01:22:16,461][model8_pretrain.py][INFO] Epoch:[0/2](300600/4588595) loss:2.892 lr:0.0000100 epoch_Time:27138.0min: [2024-01-04 01:22:16,461][model8_pretrain.py][INFO] Epoch:[0/2](300600/4588595) loss:3.140 lr:0.0000100 epoch_Time:27138.0min: [2024-01-04 01:22:16,461][model8_pretrain.py][INFO] Epoch:[0/2](300600/4588595) loss:2.842 lr:0.0000100 epoch_Time:27138.0min: [2024-01-04 01:22:16,461][model8_pretrain.py][INFO] Epoch:[0/2](300600/4588595) loss:2.627 lr:0.0000100 epoch_Time:27138.0min: [2024-01-04 01:22:16,461][model8_pretrain.py][INFO] Epoch:[0/2](300600/4588595) loss:3.000 lr:0.0000100 epoch_Time:27138.0min: [2024-01-04 01:22:53,401][model8_pretrain.py][INFO] Epoch:[0/2](300700/4588595) loss:3.055 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:22:53,401][model8_pretrain.py][INFO] Epoch:[0/2](300700/4588595) loss:2.956 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:22:53,401][model8_pretrain.py][INFO] Epoch:[0/2](300700/4588595) loss:2.870 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:22:53,401][model8_pretrain.py][INFO] Epoch:[0/2](300700/4588595) loss:2.604 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:22:53,401][model8_pretrain.py][INFO] Epoch:[0/2](300700/4588595) loss:2.799 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:22:53,402][model8_pretrain.py][INFO] Epoch:[0/2](300700/4588595) loss:3.012 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:22:53,401][model8_pretrain.py][INFO] Epoch:[0/2](300700/4588595) loss:2.930 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:22:53,401][model8_pretrain.py][INFO] Epoch:[0/2](300700/4588595) loss:3.176 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:23:30,336][model8_pretrain.py][INFO] Epoch:[0/2](300800/4588595) loss:3.112 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:23:30,336][model8_pretrain.py][INFO] Epoch:[0/2](300800/4588595) loss:2.964 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:23:30,336][model8_pretrain.py][INFO] Epoch:[0/2](300800/4588595) loss:2.392 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:23:30,336][model8_pretrain.py][INFO] Epoch:[0/2](300800/4588595) loss:2.835 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:23:30,336][model8_pretrain.py][INFO] Epoch:[0/2](300800/4588595) loss:2.796 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:23:30,336][model8_pretrain.py][INFO] Epoch:[0/2](300800/4588595) loss:2.950 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:23:30,336][model8_pretrain.py][INFO] Epoch:[0/2](300800/4588595) loss:2.741 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:23:30,336][model8_pretrain.py][INFO] Epoch:[0/2](300800/4588595) loss:3.068 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:24:07,270][model8_pretrain.py][INFO] Epoch:[0/2](300900/4588595) loss:2.813 lr:0.0000100 epoch_Time:27136.0min: [2024-01-04 01:24:07,270][model8_pretrain.py][INFO] Epoch:[0/2](300900/4588595) loss:3.017 lr:0.0000100 epoch_Time:27136.0min: [2024-01-04 01:24:07,270][model8_pretrain.py][INFO] Epoch:[0/2](300900/4588595) loss:3.371 lr:0.0000100 epoch_Time:27136.0min: [2024-01-04 01:24:07,270][model8_pretrain.py][INFO] Epoch:[0/2](300900/4588595) loss:3.356 lr:0.0000100 epoch_Time:27136.0min: [2024-01-04 01:24:07,270][model8_pretrain.py][INFO] Epoch:[0/2](300900/4588595) loss:2.654 lr:0.0000100 epoch_Time:27136.0min: [2024-01-04 01:24:07,270][model8_pretrain.py][INFO] Epoch:[0/2](300900/4588595) loss:2.903 lr:0.0000100 epoch_Time:27136.0min: [2024-01-04 01:24:07,271][model8_pretrain.py][INFO] Epoch:[0/2](300900/4588595) loss:3.048 lr:0.0000100 epoch_Time:27136.0min: [2024-01-04 01:24:07,271][model8_pretrain.py][INFO] Epoch:[0/2](300900/4588595) loss:3.126 lr:0.0000100 epoch_Time:27136.0min: [2024-01-04 01:24:44,211][model8_pretrain.py][INFO] Epoch:[0/2](301000/4588595) loss:2.078 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:24:44,211][model8_pretrain.py][INFO] Epoch:[0/2](301000/4588595) loss:3.056 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:24:44,211][model8_pretrain.py][INFO] Epoch:[0/2](301000/4588595) loss:3.476 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:24:44,211][model8_pretrain.py][INFO] Epoch:[0/2](301000/4588595) loss:3.515 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:24:44,211][model8_pretrain.py][INFO] Epoch:[0/2](301000/4588595) loss:3.095 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:24:44,211][model8_pretrain.py][INFO] Epoch:[0/2](301000/4588595) loss:3.512 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:24:44,211][model8_pretrain.py][INFO] Epoch:[0/2](301000/4588595) loss:3.612 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:24:44,211][model8_pretrain.py][INFO] Epoch:[0/2](301000/4588595) loss:2.681 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:25:31,339][model8_pretrain.py][INFO] Epoch:[0/2](301100/4588595) loss:2.875 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:25:31,339][model8_pretrain.py][INFO] Epoch:[0/2](301100/4588595) loss:2.518 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:25:31,339][model8_pretrain.py][INFO] Epoch:[0/2](301100/4588595) loss:2.781 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:25:31,339][model8_pretrain.py][INFO] Epoch:[0/2](301100/4588595) loss:2.916 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:25:31,339][model8_pretrain.py][INFO] Epoch:[0/2](301100/4588595) loss:3.105 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:25:31,339][model8_pretrain.py][INFO] Epoch:[0/2](301100/4588595) loss:2.363 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:25:31,339][model8_pretrain.py][INFO] Epoch:[0/2](301100/4588595) loss:2.762 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:25:31,339][model8_pretrain.py][INFO] Epoch:[0/2](301100/4588595) loss:3.019 lr:0.0000100 epoch_Time:27137.0min: [2024-01-04 01:26:08,266][model8_pretrain.py][INFO] Epoch:[0/2](301200/4588595) loss:3.197 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:08,266][model8_pretrain.py][INFO] Epoch:[0/2](301200/4588595) loss:2.883 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:08,266][model8_pretrain.py][INFO] Epoch:[0/2](301200/4588595) loss:2.856 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:08,266][model8_pretrain.py][INFO] Epoch:[0/2](301200/4588595) loss:3.207 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:08,266][model8_pretrain.py][INFO] Epoch:[0/2](301200/4588595) loss:2.736 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:08,266][model8_pretrain.py][INFO] Epoch:[0/2](301200/4588595) loss:2.823 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:08,266][model8_pretrain.py][INFO] Epoch:[0/2](301200/4588595) loss:2.906 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:08,266][model8_pretrain.py][INFO] Epoch:[0/2](301200/4588595) loss:2.934 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:45,195][model8_pretrain.py][INFO] Epoch:[0/2](301300/4588595) loss:2.499 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:45,195][model8_pretrain.py][INFO] Epoch:[0/2](301300/4588595) loss:3.044 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:45,196][model8_pretrain.py][INFO] Epoch:[0/2](301300/4588595) loss:3.034 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:45,195][model8_pretrain.py][INFO] Epoch:[0/2](301300/4588595) loss:3.494 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:45,196][model8_pretrain.py][INFO] Epoch:[0/2](301300/4588595) loss:2.969 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:45,196][model8_pretrain.py][INFO] Epoch:[0/2](301300/4588595) loss:2.972 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:45,196][model8_pretrain.py][INFO] Epoch:[0/2](301300/4588595) loss:3.306 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:26:45,196][model8_pretrain.py][INFO] Epoch:[0/2](301300/4588595) loss:2.931 lr:0.0000100 epoch_Time:27135.0min: [2024-01-04 01:27:22,131][model8_pretrain.py][INFO] Epoch:[0/2](301400/4588595) loss:3.069 lr:0.0000100 epoch_Time:27134.0min: [2024-01-04 01:27:22,131][model8_pretrain.py][INFO] Epoch:[0/2](301400/4588595) loss:3.351 lr:0.0000100 epoch_Time:27134.0min: [2024-01-04 01:27:22,131][model8_pretrain.py][INFO] Epoch:[0/2](301400/4588595) loss:2.562 lr:0.0000100 epoch_Time:27134.0min: [2024-01-04 01:27:22,131][model8_pretrain.py][INFO] Epoch:[0/2](301400/4588595) loss:3.359 lr:0.0000100 epoch_Time:27134.0min: [2024-01-04 01:27:22,131][model8_pretrain.py][INFO] Epoch:[0/2](301400/4588595) loss:2.549 lr:0.0000100 epoch_Time:27134.0min: [2024-01-04 01:27:22,131][model8_pretrain.py][INFO] Epoch:[0/2](301400/4588595) loss:2.847 lr:0.0000100 epoch_Time:27134.0min: [2024-01-04 01:27:22,131][model8_pretrain.py][INFO] Epoch:[0/2](301400/4588595) loss:2.617 lr:0.0000100 epoch_Time:27134.0min: [2024-01-04 01:27:22,131][model8_pretrain.py][INFO] Epoch:[0/2](301400/4588595) loss:2.903 lr:0.0000100 epoch_Time:27134.0min: [2024-01-04 01:27:59,065][model8_pretrain.py][INFO] Epoch:[0/2](301500/4588595) loss:3.081 lr:0.0000100 epoch_Time:27133.0min: [2024-01-04 01:27:59,065][model8_pretrain.py][INFO] Epoch:[0/2](301500/4588595) loss:3.032 lr:0.0000100 epoch_Time:27133.0min: [2024-01-04 01:27:59,065][model8_pretrain.py][INFO] Epoch:[0/2](301500/4588595) loss:3.028 lr:0.0000100 epoch_Time:27133.0min: [2024-01-04 01:27:59,065][model8_pretrain.py][INFO] Epoch:[0/2](301500/4588595) loss:3.016 lr:0.0000100 epoch_Time:27133.0min: [2024-01-04 01:27:59,065][model8_pretrain.py][INFO] Epoch:[0/2](301500/4588595) loss:2.630 lr:0.0000100 epoch_Time:27133.0min: [2024-01-04 01:27:59,065][model8_pretrain.py][INFO] Epoch:[0/2](301500/4588595) loss:3.064 lr:0.0000100 epoch_Time:27133.0min: [2024-01-04 01:27:59,066][model8_pretrain.py][INFO] Epoch:[0/2](301500/4588595) loss:2.804 lr:0.0000100 epoch_Time:27133.0min: [2024-01-04 01:27:59,066][model8_pretrain.py][INFO] Epoch:[0/2](301500/4588595) loss:2.667 lr:0.0000100 epoch_Time:27133.0min: [2024-01-04 01:28:35,999][model8_pretrain.py][INFO] Epoch:[0/2](301600/4588595) loss:2.278 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:28:35,999][model8_pretrain.py][INFO] Epoch:[0/2](301600/4588595) loss:3.075 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:28:35,999][model8_pretrain.py][INFO] Epoch:[0/2](301600/4588595) loss:3.187 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:28:35,999][model8_pretrain.py][INFO] Epoch:[0/2](301600/4588595) loss:2.952 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:28:35,999][model8_pretrain.py][INFO] Epoch:[0/2](301600/4588595) loss:2.954 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:28:35,999][model8_pretrain.py][INFO] Epoch:[0/2](301600/4588595) loss:2.594 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:28:35,999][model8_pretrain.py][INFO] Epoch:[0/2](301600/4588595) loss:2.921 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:28:36,000][model8_pretrain.py][INFO] Epoch:[0/2](301600/4588595) loss:2.866 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:29:12,932][model8_pretrain.py][INFO] Epoch:[0/2](301700/4588595) loss:2.863 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:29:12,932][model8_pretrain.py][INFO] Epoch:[0/2](301700/4588595) loss:2.782 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:29:12,932][model8_pretrain.py][INFO] Epoch:[0/2](301700/4588595) loss:3.171 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:29:12,932][model8_pretrain.py][INFO] Epoch:[0/2](301700/4588595) loss:3.054 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:29:12,932][model8_pretrain.py][INFO] Epoch:[0/2](301700/4588595) loss:2.919 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:29:12,932][model8_pretrain.py][INFO] Epoch:[0/2](301700/4588595) loss:3.392 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:29:12,933][model8_pretrain.py][INFO] Epoch:[0/2](301700/4588595) loss:2.670 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:29:12,932][model8_pretrain.py][INFO] Epoch:[0/2](301700/4588595) loss:2.710 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:29:49,869][model8_pretrain.py][INFO] Epoch:[0/2](301800/4588595) loss:2.550 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:29:49,869][model8_pretrain.py][INFO] Epoch:[0/2](301800/4588595) loss:2.867 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:29:49,869][model8_pretrain.py][INFO] Epoch:[0/2](301800/4588595) loss:2.819 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:29:49,869][model8_pretrain.py][INFO] Epoch:[0/2](301800/4588595) loss:2.952 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:29:49,869][model8_pretrain.py][INFO] Epoch:[0/2](301800/4588595) loss:2.352 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:29:49,869][model8_pretrain.py][INFO] Epoch:[0/2](301800/4588595) loss:3.017 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:29:49,870][model8_pretrain.py][INFO] Epoch:[0/2](301800/4588595) loss:2.147 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:29:49,870][model8_pretrain.py][INFO] Epoch:[0/2](301800/4588595) loss:2.773 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:30:36,924][model8_pretrain.py][INFO] Epoch:[0/2](301900/4588595) loss:3.002 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:30:36,924][model8_pretrain.py][INFO] Epoch:[0/2](301900/4588595) loss:2.880 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:30:36,924][model8_pretrain.py][INFO] Epoch:[0/2](301900/4588595) loss:2.842 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:30:36,924][model8_pretrain.py][INFO] Epoch:[0/2](301900/4588595) loss:2.902 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:30:36,924][model8_pretrain.py][INFO] Epoch:[0/2](301900/4588595) loss:3.338 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:30:36,924][model8_pretrain.py][INFO] Epoch:[0/2](301900/4588595) loss:2.843 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:30:36,924][model8_pretrain.py][INFO] Epoch:[0/2](301900/4588595) loss:3.201 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:30:36,924][model8_pretrain.py][INFO] Epoch:[0/2](301900/4588595) loss:3.145 lr:0.0000100 epoch_Time:27132.0min: [2024-01-04 01:31:13,850][model8_pretrain.py][INFO] Epoch:[0/2](302000/4588595) loss:2.703 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:31:13,850][model8_pretrain.py][INFO] Epoch:[0/2](302000/4588595) loss:2.704 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:31:13,850][model8_pretrain.py][INFO] Epoch:[0/2](302000/4588595) loss:3.190 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:31:13,850][model8_pretrain.py][INFO] Epoch:[0/2](302000/4588595) loss:2.688 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:31:13,850][model8_pretrain.py][INFO] Epoch:[0/2](302000/4588595) loss:3.529 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:31:13,850][model8_pretrain.py][INFO] Epoch:[0/2](302000/4588595) loss:2.959 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:31:13,850][model8_pretrain.py][INFO] Epoch:[0/2](302000/4588595) loss:2.769 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:31:13,850][model8_pretrain.py][INFO] Epoch:[0/2](302000/4588595) loss:2.940 lr:0.0000100 epoch_Time:27131.0min: [2024-01-04 01:31:50,783][model8_pretrain.py][INFO] Epoch:[0/2](302100/4588595) loss:2.592 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:31:50,783][model8_pretrain.py][INFO] Epoch:[0/2](302100/4588595) loss:2.674 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:31:50,783][model8_pretrain.py][INFO] Epoch:[0/2](302100/4588595) loss:2.395 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:31:50,783][model8_pretrain.py][INFO] Epoch:[0/2](302100/4588595) loss:3.154 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:31:50,783][model8_pretrain.py][INFO] Epoch:[0/2](302100/4588595) loss:2.769 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:31:50,784][model8_pretrain.py][INFO] Epoch:[0/2](302100/4588595) loss:2.543 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:31:50,784][model8_pretrain.py][INFO] Epoch:[0/2](302100/4588595) loss:3.090 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:31:50,784][model8_pretrain.py][INFO] Epoch:[0/2](302100/4588595) loss:3.019 lr:0.0000100 epoch_Time:27130.0min: [2024-01-04 01:32:27,722][model8_pretrain.py][INFO] Epoch:[0/2](302200/4588595) loss:2.957 lr:0.0000100 epoch_Time:27129.0min: [2024-01-04 01:32:27,722][model8_pretrain.py][INFO] Epoch:[0/2](302200/4588595) loss:3.453 lr:0.0000100 epoch_Time:27129.0min: [2024-01-04 01:32:27,722][model8_pretrain.py][INFO] Epoch:[0/2](302200/4588595) loss:3.313 lr:0.0000100 epoch_Time:27129.0min: [2024-01-04 01:32:27,722][model8_pretrain.py][INFO] Epoch:[0/2](302200/4588595) loss:2.688 lr:0.0000100 epoch_Time:27129.0min: [2024-01-04 01:32:27,722][model8_pretrain.py][INFO] Epoch:[0/2](302200/4588595) loss:3.156 lr:0.0000100 epoch_Time:27129.0min: [2024-01-04 01:32:27,723][model8_pretrain.py][INFO] Epoch:[0/2](302200/4588595) loss:2.902 lr:0.0000100 epoch_Time:27129.0min: [2024-01-04 01:32:27,723][model8_pretrain.py][INFO] Epoch:[0/2](302200/4588595) loss:3.232 lr:0.0000100 epoch_Time:27129.0min: [2024-01-04 01:32:27,723][model8_pretrain.py][INFO] Epoch:[0/2](302200/4588595) loss:2.587 lr:0.0000100 epoch_Time:27129.0min: [2024-01-04 01:33:04,670][model8_pretrain.py][INFO] Epoch:[0/2](302300/4588595) loss:3.715 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:04,670][model8_pretrain.py][INFO] Epoch:[0/2](302300/4588595) loss:2.987 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:04,670][model8_pretrain.py][INFO] Epoch:[0/2](302300/4588595) loss:2.723 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:04,670][model8_pretrain.py][INFO] Epoch:[0/2](302300/4588595) loss:3.092 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:04,670][model8_pretrain.py][INFO] Epoch:[0/2](302300/4588595) loss:3.005 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:04,670][model8_pretrain.py][INFO] Epoch:[0/2](302300/4588595) loss:2.516 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:04,670][model8_pretrain.py][INFO] Epoch:[0/2](302300/4588595) loss:2.664 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:04,670][model8_pretrain.py][INFO] Epoch:[0/2](302300/4588595) loss:3.335 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:41,611][model8_pretrain.py][INFO] Epoch:[0/2](302400/4588595) loss:2.496 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:41,611][model8_pretrain.py][INFO] Epoch:[0/2](302400/4588595) loss:2.649 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:41,611][model8_pretrain.py][INFO] Epoch:[0/2](302400/4588595) loss:2.790 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:41,611][model8_pretrain.py][INFO] Epoch:[0/2](302400/4588595) loss:3.245 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:41,611][model8_pretrain.py][INFO] Epoch:[0/2](302400/4588595) loss:2.599 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:41,611][model8_pretrain.py][INFO] Epoch:[0/2](302400/4588595) loss:3.286 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:41,611][model8_pretrain.py][INFO] Epoch:[0/2](302400/4588595) loss:3.185 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:33:41,612][model8_pretrain.py][INFO] Epoch:[0/2](302400/4588595) loss:2.665 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:34:18,557][model8_pretrain.py][INFO] Epoch:[0/2](302500/4588595) loss:3.309 lr:0.0000100 epoch_Time:27127.0min: [2024-01-04 01:34:18,557][model8_pretrain.py][INFO] Epoch:[0/2](302500/4588595) loss:3.043 lr:0.0000100 epoch_Time:27127.0min: [2024-01-04 01:34:18,557][model8_pretrain.py][INFO] Epoch:[0/2](302500/4588595) loss:2.793 lr:0.0000100 epoch_Time:27127.0min: [2024-01-04 01:34:18,557][model8_pretrain.py][INFO] Epoch:[0/2](302500/4588595) loss:2.496 lr:0.0000100 epoch_Time:27127.0min: [2024-01-04 01:34:18,557][model8_pretrain.py][INFO] Epoch:[0/2](302500/4588595) loss:2.185 lr:0.0000100 epoch_Time:27127.0min: [2024-01-04 01:34:18,557][model8_pretrain.py][INFO] Epoch:[0/2](302500/4588595) loss:3.283 lr:0.0000100 epoch_Time:27127.0min: [2024-01-04 01:34:18,557][model8_pretrain.py][INFO] Epoch:[0/2](302500/4588595) loss:2.700 lr:0.0000100 epoch_Time:27127.0min: [2024-01-04 01:34:18,557][model8_pretrain.py][INFO] Epoch:[0/2](302500/4588595) loss:2.979 lr:0.0000100 epoch_Time:27127.0min: [2024-01-04 01:34:55,500][model8_pretrain.py][INFO] Epoch:[0/2](302600/4588595) loss:3.354 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:34:55,500][model8_pretrain.py][INFO] Epoch:[0/2](302600/4588595) loss:2.347 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:34:55,500][model8_pretrain.py][INFO] Epoch:[0/2](302600/4588595) loss:3.513 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:34:55,500][model8_pretrain.py][INFO] Epoch:[0/2](302600/4588595) loss:2.866 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:34:55,500][model8_pretrain.py][INFO] Epoch:[0/2](302600/4588595) loss:3.466 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:34:55,500][model8_pretrain.py][INFO] Epoch:[0/2](302600/4588595) loss:2.926 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:34:55,500][model8_pretrain.py][INFO] Epoch:[0/2](302600/4588595) loss:2.725 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:34:55,500][model8_pretrain.py][INFO] Epoch:[0/2](302600/4588595) loss:3.274 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:35:42,444][model8_pretrain.py][INFO] Epoch:[0/2](302700/4588595) loss:3.261 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:35:42,444][model8_pretrain.py][INFO] Epoch:[0/2](302700/4588595) loss:3.334 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:35:42,444][model8_pretrain.py][INFO] Epoch:[0/2](302700/4588595) loss:3.221 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:35:42,444][model8_pretrain.py][INFO] Epoch:[0/2](302700/4588595) loss:2.214 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:35:42,444][model8_pretrain.py][INFO] Epoch:[0/2](302700/4588595) loss:2.979 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:35:42,444][model8_pretrain.py][INFO] Epoch:[0/2](302700/4588595) loss:2.973 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:35:42,444][model8_pretrain.py][INFO] Epoch:[0/2](302700/4588595) loss:3.351 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:35:42,444][model8_pretrain.py][INFO] Epoch:[0/2](302700/4588595) loss:2.700 lr:0.0000100 epoch_Time:27128.0min: [2024-01-04 01:36:19,378][model8_pretrain.py][INFO] Epoch:[0/2](302800/4588595) loss:2.827 lr:0.0000100 epoch_Time:27126.0min: [2024-01-04 01:36:19,378][model8_pretrain.py][INFO] Epoch:[0/2](302800/4588595) loss:3.164 lr:0.0000100 epoch_Time:27126.0min: [2024-01-04 01:36:19,378][model8_pretrain.py][INFO] Epoch:[0/2](302800/4588595) loss:2.654 lr:0.0000100 epoch_Time:27126.0min: [2024-01-04 01:36:19,378][model8_pretrain.py][INFO] Epoch:[0/2](302800/4588595) loss:2.949 lr:0.0000100 epoch_Time:27126.0min: [2024-01-04 01:36:19,378][model8_pretrain.py][INFO] Epoch:[0/2](302800/4588595) loss:2.968 lr:0.0000100 epoch_Time:27126.0min: [2024-01-04 01:36:19,378][model8_pretrain.py][INFO] Epoch:[0/2](302800/4588595) loss:3.148 lr:0.0000100 epoch_Time:27126.0min: [2024-01-04 01:36:19,378][model8_pretrain.py][INFO] Epoch:[0/2](302800/4588595) loss:3.164 lr:0.0000100 epoch_Time:27126.0min: [2024-01-04 01:36:19,378][model8_pretrain.py][INFO] Epoch:[0/2](302800/4588595) loss:3.058 lr:0.0000100 epoch_Time:27126.0min: [2024-01-04 01:36:56,314][model8_pretrain.py][INFO] Epoch:[0/2](302900/4588595) loss:3.418 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:36:56,314][model8_pretrain.py][INFO] Epoch:[0/2](302900/4588595) loss:2.993 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:36:56,314][model8_pretrain.py][INFO] Epoch:[0/2](302900/4588595) loss:2.535 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:36:56,314][model8_pretrain.py][INFO] Epoch:[0/2](302900/4588595) loss:3.515 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:36:56,314][model8_pretrain.py][INFO] Epoch:[0/2](302900/4588595) loss:2.517 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:36:56,314][model8_pretrain.py][INFO] Epoch:[0/2](302900/4588595) loss:3.210 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:36:56,314][model8_pretrain.py][INFO] Epoch:[0/2](302900/4588595) loss:3.097 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:36:56,314][model8_pretrain.py][INFO] Epoch:[0/2](302900/4588595) loss:2.781 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:37:33,252][model8_pretrain.py][INFO] Epoch:[0/2](303000/4588595) loss:2.779 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:37:33,252][model8_pretrain.py][INFO] Epoch:[0/2](303000/4588595) loss:3.140 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:37:33,252][model8_pretrain.py][INFO] Epoch:[0/2](303000/4588595) loss:2.866 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:37:33,252][model8_pretrain.py][INFO] Epoch:[0/2](303000/4588595) loss:2.397 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:37:33,252][model8_pretrain.py][INFO] Epoch:[0/2](303000/4588595) loss:2.606 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:37:33,252][model8_pretrain.py][INFO] Epoch:[0/2](303000/4588595) loss:2.938 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:37:33,252][model8_pretrain.py][INFO] Epoch:[0/2](303000/4588595) loss:2.950 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:37:33,252][model8_pretrain.py][INFO] Epoch:[0/2](303000/4588595) loss:3.148 lr:0.0000100 epoch_Time:27125.0min: [2024-01-04 01:38:10,204][model8_pretrain.py][INFO] Epoch:[0/2](303100/4588595) loss:3.221 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:10,204][model8_pretrain.py][INFO] Epoch:[0/2](303100/4588595) loss:2.733 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:10,204][model8_pretrain.py][INFO] Epoch:[0/2](303100/4588595) loss:3.006 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:10,204][model8_pretrain.py][INFO] Epoch:[0/2](303100/4588595) loss:2.801 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:10,204][model8_pretrain.py][INFO] Epoch:[0/2](303100/4588595) loss:3.054 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:10,204][model8_pretrain.py][INFO] Epoch:[0/2](303100/4588595) loss:2.812 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:10,205][model8_pretrain.py][INFO] Epoch:[0/2](303100/4588595) loss:2.773 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:10,205][model8_pretrain.py][INFO] Epoch:[0/2](303100/4588595) loss:3.197 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:47,152][model8_pretrain.py][INFO] Epoch:[0/2](303200/4588595) loss:3.204 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:47,152][model8_pretrain.py][INFO] Epoch:[0/2](303200/4588595) loss:2.904 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:47,152][model8_pretrain.py][INFO] Epoch:[0/2](303200/4588595) loss:2.947 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:47,152][model8_pretrain.py][INFO] Epoch:[0/2](303200/4588595) loss:3.148 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:47,152][model8_pretrain.py][INFO] Epoch:[0/2](303200/4588595) loss:3.143 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:47,152][model8_pretrain.py][INFO] Epoch:[0/2](303200/4588595) loss:3.054 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:47,152][model8_pretrain.py][INFO] Epoch:[0/2](303200/4588595) loss:2.852 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:38:47,152][model8_pretrain.py][INFO] Epoch:[0/2](303200/4588595) loss:2.447 lr:0.0000100 epoch_Time:27123.0min: [2024-01-04 01:39:24,090][model8_pretrain.py][INFO] Epoch:[0/2](303300/4588595) loss:2.745 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:39:24,090][model8_pretrain.py][INFO] Epoch:[0/2](303300/4588595) loss:2.836 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:39:24,090][model8_pretrain.py][INFO] Epoch:[0/2](303300/4588595) loss:3.407 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:39:24,090][model8_pretrain.py][INFO] Epoch:[0/2](303300/4588595) loss:3.038 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:39:24,090][model8_pretrain.py][INFO] Epoch:[0/2](303300/4588595) loss:2.891 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:39:24,090][model8_pretrain.py][INFO] Epoch:[0/2](303300/4588595) loss:2.716 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:39:24,090][model8_pretrain.py][INFO] Epoch:[0/2](303300/4588595) loss:2.793 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:39:24,090][model8_pretrain.py][INFO] Epoch:[0/2](303300/4588595) loss:3.290 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:40:01,038][model8_pretrain.py][INFO] Epoch:[0/2](303400/4588595) loss:2.505 lr:0.0000100 epoch_Time:27121.0min: [2024-01-04 01:40:01,038][model8_pretrain.py][INFO] Epoch:[0/2](303400/4588595) loss:3.281 lr:0.0000100 epoch_Time:27121.0min: [2024-01-04 01:40:01,038][model8_pretrain.py][INFO] Epoch:[0/2](303400/4588595) loss:2.136 lr:0.0000100 epoch_Time:27121.0min: [2024-01-04 01:40:01,038][model8_pretrain.py][INFO] Epoch:[0/2](303400/4588595) loss:2.948 lr:0.0000100 epoch_Time:27121.0min: [2024-01-04 01:40:01,038][model8_pretrain.py][INFO] Epoch:[0/2](303400/4588595) loss:2.131 lr:0.0000100 epoch_Time:27121.0min: [2024-01-04 01:40:01,038][model8_pretrain.py][INFO] Epoch:[0/2](303400/4588595) loss:2.743 lr:0.0000100 epoch_Time:27121.0min: [2024-01-04 01:40:01,038][model8_pretrain.py][INFO] Epoch:[0/2](303400/4588595) loss:2.530 lr:0.0000100 epoch_Time:27121.0min: [2024-01-04 01:40:01,038][model8_pretrain.py][INFO] Epoch:[0/2](303400/4588595) loss:3.277 lr:0.0000100 epoch_Time:27121.0min: [2024-01-04 01:40:47,953][model8_pretrain.py][INFO] Epoch:[0/2](303500/4588595) loss:3.354 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:40:47,953][model8_pretrain.py][INFO] Epoch:[0/2](303500/4588595) loss:2.908 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:40:47,953][model8_pretrain.py][INFO] Epoch:[0/2](303500/4588595) loss:2.913 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:40:47,953][model8_pretrain.py][INFO] Epoch:[0/2](303500/4588595) loss:3.248 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:40:47,953][model8_pretrain.py][INFO] Epoch:[0/2](303500/4588595) loss:2.506 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:40:47,953][model8_pretrain.py][INFO] Epoch:[0/2](303500/4588595) loss:2.832 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:40:47,953][model8_pretrain.py][INFO] Epoch:[0/2](303500/4588595) loss:3.080 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:40:47,953][model8_pretrain.py][INFO] Epoch:[0/2](303500/4588595) loss:3.284 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:41:24,847][model8_pretrain.py][INFO] Epoch:[0/2](303600/4588595) loss:3.023 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:41:24,847][model8_pretrain.py][INFO] Epoch:[0/2](303600/4588595) loss:3.257 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:41:24,847][model8_pretrain.py][INFO] Epoch:[0/2](303600/4588595) loss:3.087 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:41:24,847][model8_pretrain.py][INFO] Epoch:[0/2](303600/4588595) loss:2.959 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:41:24,847][model8_pretrain.py][INFO] Epoch:[0/2](303600/4588595) loss:2.899 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:41:24,848][model8_pretrain.py][INFO] Epoch:[0/2](303600/4588595) loss:3.148 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:41:24,847][model8_pretrain.py][INFO] Epoch:[0/2](303600/4588595) loss:2.647 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:41:24,848][model8_pretrain.py][INFO] Epoch:[0/2](303600/4588595) loss:2.920 lr:0.0000100 epoch_Time:27122.0min: [2024-01-04 01:42:01,785][model8_pretrain.py][INFO] Epoch:[0/2](303700/4588595) loss:3.339 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:01,785][model8_pretrain.py][INFO] Epoch:[0/2](303700/4588595) loss:2.630 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:01,785][model8_pretrain.py][INFO] Epoch:[0/2](303700/4588595) loss:2.550 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:01,785][model8_pretrain.py][INFO] Epoch:[0/2](303700/4588595) loss:3.468 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:01,785][model8_pretrain.py][INFO] Epoch:[0/2](303700/4588595) loss:2.904 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:01,785][model8_pretrain.py][INFO] Epoch:[0/2](303700/4588595) loss:3.307 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:01,785][model8_pretrain.py][INFO] Epoch:[0/2](303700/4588595) loss:2.209 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:01,785][model8_pretrain.py][INFO] Epoch:[0/2](303700/4588595) loss:3.478 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:38,726][model8_pretrain.py][INFO] Epoch:[0/2](303800/4588595) loss:2.922 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:38,726][model8_pretrain.py][INFO] Epoch:[0/2](303800/4588595) loss:2.288 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:38,726][model8_pretrain.py][INFO] Epoch:[0/2](303800/4588595) loss:3.358 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:38,726][model8_pretrain.py][INFO] Epoch:[0/2](303800/4588595) loss:3.143 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:38,726][model8_pretrain.py][INFO] Epoch:[0/2](303800/4588595) loss:2.419 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:38,726][model8_pretrain.py][INFO] Epoch:[0/2](303800/4588595) loss:2.701 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:38,726][model8_pretrain.py][INFO] Epoch:[0/2](303800/4588595) loss:3.257 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:42:38,727][model8_pretrain.py][INFO] Epoch:[0/2](303800/4588595) loss:2.711 lr:0.0000100 epoch_Time:27120.0min: [2024-01-04 01:43:15,662][model8_pretrain.py][INFO] Epoch:[0/2](303900/4588595) loss:3.016 lr:0.0000100 epoch_Time:27119.0min: [2024-01-04 01:43:15,662][model8_pretrain.py][INFO] Epoch:[0/2](303900/4588595) loss:2.779 lr:0.0000100 epoch_Time:27119.0min: [2024-01-04 01:43:15,662][model8_pretrain.py][INFO] Epoch:[0/2](303900/4588595) loss:2.076 lr:0.0000100 epoch_Time:27119.0min: [2024-01-04 01:43:15,662][model8_pretrain.py][INFO] Epoch:[0/2](303900/4588595) loss:3.146 lr:0.0000100 epoch_Time:27119.0min: [2024-01-04 01:43:15,662][model8_pretrain.py][INFO] Epoch:[0/2](303900/4588595) loss:2.767 lr:0.0000100 epoch_Time:27119.0min: [2024-01-04 01:43:15,662][model8_pretrain.py][INFO] Epoch:[0/2](303900/4588595) loss:3.354 lr:0.0000100 epoch_Time:27119.0min: [2024-01-04 01:43:15,662][model8_pretrain.py][INFO] Epoch:[0/2](303900/4588595) loss:3.183 lr:0.0000100 epoch_Time:27119.0min: [2024-01-04 01:43:15,662][model8_pretrain.py][INFO] Epoch:[0/2](303900/4588595) loss:2.756 lr:0.0000100 epoch_Time:27119.0min: [2024-01-04 01:43:52,596][model8_pretrain.py][INFO] Epoch:[0/2](304000/4588595) loss:3.266 lr:0.0000100 epoch_Time:27118.0min: [2024-01-04 01:43:52,596][model8_pretrain.py][INFO] Epoch:[0/2](304000/4588595) loss:2.921 lr:0.0000100 epoch_Time:27118.0min: [2024-01-04 01:43:52,596][model8_pretrain.py][INFO] Epoch:[0/2](304000/4588595) loss:2.916 lr:0.0000100 epoch_Time:27118.0min: [2024-01-04 01:43:52,596][model8_pretrain.py][INFO] Epoch:[0/2](304000/4588595) loss:2.770 lr:0.0000100 epoch_Time:27118.0min: [2024-01-04 01:43:52,596][model8_pretrain.py][INFO] Epoch:[0/2](304000/4588595) loss:3.293 lr:0.0000100 epoch_Time:27118.0min: [2024-01-04 01:43:52,596][model8_pretrain.py][INFO] Epoch:[0/2](304000/4588595) loss:2.940 lr:0.0000100 epoch_Time:27118.0min: [2024-01-04 01:43:52,596][model8_pretrain.py][INFO] Epoch:[0/2](304000/4588595) loss:2.846 lr:0.0000100 epoch_Time:27118.0min: [2024-01-04 01:43:52,597][model8_pretrain.py][INFO] Epoch:[0/2](304000/4588595) loss:3.273 lr:0.0000100 epoch_Time:27118.0min: [2024-01-04 01:44:29,536][model8_pretrain.py][INFO] Epoch:[0/2](304100/4588595) loss:3.201 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:44:29,536][model8_pretrain.py][INFO] Epoch:[0/2](304100/4588595) loss:2.823 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:44:29,536][model8_pretrain.py][INFO] Epoch:[0/2](304100/4588595) loss:2.478 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:44:29,536][model8_pretrain.py][INFO] Epoch:[0/2](304100/4588595) loss:2.868 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:44:29,536][model8_pretrain.py][INFO] Epoch:[0/2](304100/4588595) loss:2.913 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:44:29,536][model8_pretrain.py][INFO] Epoch:[0/2](304100/4588595) loss:2.641 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:44:29,536][model8_pretrain.py][INFO] Epoch:[0/2](304100/4588595) loss:3.118 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:44:29,537][model8_pretrain.py][INFO] Epoch:[0/2](304100/4588595) loss:3.011 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:45:06,473][model8_pretrain.py][INFO] Epoch:[0/2](304200/4588595) loss:2.957 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:45:06,473][model8_pretrain.py][INFO] Epoch:[0/2](304200/4588595) loss:2.923 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:45:06,473][model8_pretrain.py][INFO] Epoch:[0/2](304200/4588595) loss:2.547 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:45:06,473][model8_pretrain.py][INFO] Epoch:[0/2](304200/4588595) loss:3.367 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:45:06,473][model8_pretrain.py][INFO] Epoch:[0/2](304200/4588595) loss:2.417 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:45:06,473][model8_pretrain.py][INFO] Epoch:[0/2](304200/4588595) loss:2.802 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:45:06,473][model8_pretrain.py][INFO] Epoch:[0/2](304200/4588595) loss:3.102 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:45:06,473][model8_pretrain.py][INFO] Epoch:[0/2](304200/4588595) loss:2.949 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:45:53,544][model8_pretrain.py][INFO] Epoch:[0/2](304300/4588595) loss:3.182 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:45:53,544][model8_pretrain.py][INFO] Epoch:[0/2](304300/4588595) loss:3.323 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:45:53,544][model8_pretrain.py][INFO] Epoch:[0/2](304300/4588595) loss:2.999 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:45:53,544][model8_pretrain.py][INFO] Epoch:[0/2](304300/4588595) loss:3.047 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:45:53,544][model8_pretrain.py][INFO] Epoch:[0/2](304300/4588595) loss:2.652 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:45:53,544][model8_pretrain.py][INFO] Epoch:[0/2](304300/4588595) loss:3.330 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:45:53,544][model8_pretrain.py][INFO] Epoch:[0/2](304300/4588595) loss:2.589 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:45:53,544][model8_pretrain.py][INFO] Epoch:[0/2](304300/4588595) loss:2.808 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:46:30,476][model8_pretrain.py][INFO] Epoch:[0/2](304400/4588595) loss:3.030 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:46:30,476][model8_pretrain.py][INFO] Epoch:[0/2](304400/4588595) loss:2.769 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:46:30,476][model8_pretrain.py][INFO] Epoch:[0/2](304400/4588595) loss:2.837 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:46:30,476][model8_pretrain.py][INFO] Epoch:[0/2](304400/4588595) loss:2.781 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:46:30,476][model8_pretrain.py][INFO] Epoch:[0/2](304400/4588595) loss:3.247 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:46:30,476][model8_pretrain.py][INFO] Epoch:[0/2](304400/4588595) loss:2.406 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:46:30,476][model8_pretrain.py][INFO] Epoch:[0/2](304400/4588595) loss:2.746 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:46:30,476][model8_pretrain.py][INFO] Epoch:[0/2](304400/4588595) loss:2.743 lr:0.0000100 epoch_Time:27117.0min: [2024-01-04 01:47:07,421][model8_pretrain.py][INFO] Epoch:[0/2](304500/4588595) loss:2.920 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:07,421][model8_pretrain.py][INFO] Epoch:[0/2](304500/4588595) loss:2.879 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:07,421][model8_pretrain.py][INFO] Epoch:[0/2](304500/4588595) loss:2.979 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:07,421][model8_pretrain.py][INFO] Epoch:[0/2](304500/4588595) loss:2.969 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:07,421][model8_pretrain.py][INFO] Epoch:[0/2](304500/4588595) loss:3.245 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:07,421][model8_pretrain.py][INFO] Epoch:[0/2](304500/4588595) loss:3.019 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:07,421][model8_pretrain.py][INFO] Epoch:[0/2](304500/4588595) loss:3.183 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:07,421][model8_pretrain.py][INFO] Epoch:[0/2](304500/4588595) loss:2.892 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:44,354][model8_pretrain.py][INFO] Epoch:[0/2](304600/4588595) loss:2.672 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:44,354][model8_pretrain.py][INFO] Epoch:[0/2](304600/4588595) loss:2.984 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:44,354][model8_pretrain.py][INFO] Epoch:[0/2](304600/4588595) loss:2.865 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:44,354][model8_pretrain.py][INFO] Epoch:[0/2](304600/4588595) loss:3.139 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:44,354][model8_pretrain.py][INFO] Epoch:[0/2](304600/4588595) loss:2.689 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:44,354][model8_pretrain.py][INFO] Epoch:[0/2](304600/4588595) loss:2.281 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:44,354][model8_pretrain.py][INFO] Epoch:[0/2](304600/4588595) loss:3.224 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:47:44,354][model8_pretrain.py][INFO] Epoch:[0/2](304600/4588595) loss:3.168 lr:0.0000100 epoch_Time:27116.0min: [2024-01-04 01:48:21,302][model8_pretrain.py][INFO] Epoch:[0/2](304700/4588595) loss:2.889 lr:0.0000100 epoch_Time:27114.0min: [2024-01-04 01:48:21,302][model8_pretrain.py][INFO] Epoch:[0/2](304700/4588595) loss:2.600 lr:0.0000100 epoch_Time:27114.0min: [2024-01-04 01:48:21,302][model8_pretrain.py][INFO] Epoch:[0/2](304700/4588595) loss:3.005 lr:0.0000100 epoch_Time:27114.0min: [2024-01-04 01:48:21,302][model8_pretrain.py][INFO] Epoch:[0/2](304700/4588595) loss:3.544 lr:0.0000100 epoch_Time:27114.0min: [2024-01-04 01:48:21,302][model8_pretrain.py][INFO] Epoch:[0/2](304700/4588595) loss:2.881 lr:0.0000100 epoch_Time:27114.0min: [2024-01-04 01:48:21,302][model8_pretrain.py][INFO] Epoch:[0/2](304700/4588595) loss:3.085 lr:0.0000100 epoch_Time:27114.0min: [2024-01-04 01:48:21,302][model8_pretrain.py][INFO] Epoch:[0/2](304700/4588595) loss:2.836 lr:0.0000100 epoch_Time:27114.0min: [2024-01-04 01:48:21,302][model8_pretrain.py][INFO] Epoch:[0/2](304700/4588595) loss:3.120 lr:0.0000100 epoch_Time:27114.0min: [2024-01-04 01:48:58,245][model8_pretrain.py][INFO] Epoch:[0/2](304800/4588595) loss:3.346 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:48:58,246][model8_pretrain.py][INFO] Epoch:[0/2](304800/4588595) loss:3.219 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:48:58,246][model8_pretrain.py][INFO] Epoch:[0/2](304800/4588595) loss:3.294 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:48:58,246][model8_pretrain.py][INFO] Epoch:[0/2](304800/4588595) loss:2.896 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:48:58,246][model8_pretrain.py][INFO] Epoch:[0/2](304800/4588595) loss:2.874 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:48:58,246][model8_pretrain.py][INFO] Epoch:[0/2](304800/4588595) loss:3.577 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:48:58,246][model8_pretrain.py][INFO] Epoch:[0/2](304800/4588595) loss:3.142 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:48:58,246][model8_pretrain.py][INFO] Epoch:[0/2](304800/4588595) loss:3.035 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:49:35,180][model8_pretrain.py][INFO] Epoch:[0/2](304900/4588595) loss:2.978 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:49:35,180][model8_pretrain.py][INFO] Epoch:[0/2](304900/4588595) loss:3.003 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:49:35,180][model8_pretrain.py][INFO] Epoch:[0/2](304900/4588595) loss:3.007 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:49:35,180][model8_pretrain.py][INFO] Epoch:[0/2](304900/4588595) loss:2.790 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:49:35,180][model8_pretrain.py][INFO] Epoch:[0/2](304900/4588595) loss:2.713 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:49:35,180][model8_pretrain.py][INFO] Epoch:[0/2](304900/4588595) loss:3.079 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:49:35,180][model8_pretrain.py][INFO] Epoch:[0/2](304900/4588595) loss:2.954 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:49:35,181][model8_pretrain.py][INFO] Epoch:[0/2](304900/4588595) loss:2.961 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:50:12,125][model8_pretrain.py][INFO] Epoch:[0/2](305000/4588595) loss:3.209 lr:0.0000100 epoch_Time:27112.0min: [2024-01-04 01:50:12,125][model8_pretrain.py][INFO] Epoch:[0/2](305000/4588595) loss:2.565 lr:0.0000100 epoch_Time:27112.0min: [2024-01-04 01:50:12,125][model8_pretrain.py][INFO] Epoch:[0/2](305000/4588595) loss:2.766 lr:0.0000100 epoch_Time:27112.0min: [2024-01-04 01:50:12,125][model8_pretrain.py][INFO] Epoch:[0/2](305000/4588595) loss:3.274 lr:0.0000100 epoch_Time:27112.0min: [2024-01-04 01:50:12,125][model8_pretrain.py][INFO] Epoch:[0/2](305000/4588595) loss:2.844 lr:0.0000100 epoch_Time:27112.0min: [2024-01-04 01:50:12,125][model8_pretrain.py][INFO] Epoch:[0/2](305000/4588595) loss:3.101 lr:0.0000100 epoch_Time:27112.0min: [2024-01-04 01:50:12,125][model8_pretrain.py][INFO] Epoch:[0/2](305000/4588595) loss:3.029 lr:0.0000100 epoch_Time:27112.0min: [2024-01-04 01:50:12,125][model8_pretrain.py][INFO] Epoch:[0/2](305000/4588595) loss:2.893 lr:0.0000100 epoch_Time:27112.0min: [2024-01-04 01:50:59,347][model8_pretrain.py][INFO] Epoch:[0/2](305100/4588595) loss:3.183 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:50:59,347][model8_pretrain.py][INFO] Epoch:[0/2](305100/4588595) loss:3.287 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:50:59,347][model8_pretrain.py][INFO] Epoch:[0/2](305100/4588595) loss:2.524 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:50:59,347][model8_pretrain.py][INFO] Epoch:[0/2](305100/4588595) loss:2.531 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:50:59,347][model8_pretrain.py][INFO] Epoch:[0/2](305100/4588595) loss:2.550 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:50:59,347][model8_pretrain.py][INFO] Epoch:[0/2](305100/4588595) loss:3.127 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:50:59,347][model8_pretrain.py][INFO] Epoch:[0/2](305100/4588595) loss:2.968 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:50:59,347][model8_pretrain.py][INFO] Epoch:[0/2](305100/4588595) loss:3.290 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:51:36,263][model8_pretrain.py][INFO] Epoch:[0/2](305200/4588595) loss:3.121 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:51:36,263][model8_pretrain.py][INFO] Epoch:[0/2](305200/4588595) loss:2.959 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:51:36,263][model8_pretrain.py][INFO] Epoch:[0/2](305200/4588595) loss:2.471 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:51:36,263][model8_pretrain.py][INFO] Epoch:[0/2](305200/4588595) loss:2.563 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:51:36,263][model8_pretrain.py][INFO] Epoch:[0/2](305200/4588595) loss:2.947 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:51:36,263][model8_pretrain.py][INFO] Epoch:[0/2](305200/4588595) loss:2.501 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:51:36,263][model8_pretrain.py][INFO] Epoch:[0/2](305200/4588595) loss:3.053 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:51:36,263][model8_pretrain.py][INFO] Epoch:[0/2](305200/4588595) loss:3.205 lr:0.0000100 epoch_Time:27113.0min: [2024-01-04 01:52:13,199][model8_pretrain.py][INFO] Epoch:[0/2](305300/4588595) loss:2.906 lr:0.0000100 epoch_Time:27111.0min: [2024-01-04 01:52:13,199][model8_pretrain.py][INFO] Epoch:[0/2](305300/4588595) loss:2.207 lr:0.0000100 epoch_Time:27111.0min: [2024-01-04 01:52:13,199][model8_pretrain.py][INFO] Epoch:[0/2](305300/4588595) loss:2.985 lr:0.0000100 epoch_Time:27111.0min: [2024-01-04 01:52:13,199][model8_pretrain.py][INFO] Epoch:[0/2](305300/4588595) loss:2.716 lr:0.0000100 epoch_Time:27111.0min: [2024-01-04 01:52:13,199][model8_pretrain.py][INFO] Epoch:[0/2](305300/4588595) loss:2.837 lr:0.0000100 epoch_Time:27111.0min: [2024-01-04 01:52:13,199][model8_pretrain.py][INFO] Epoch:[0/2](305300/4588595) loss:2.993 lr:0.0000100 epoch_Time:27111.0min: [2024-01-04 01:52:13,199][model8_pretrain.py][INFO] Epoch:[0/2](305300/4588595) loss:2.127 lr:0.0000100 epoch_Time:27111.0min: [2024-01-04 01:52:13,199][model8_pretrain.py][INFO] Epoch:[0/2](305300/4588595) loss:3.144 lr:0.0000100 epoch_Time:27111.0min: [2024-01-04 01:52:50,133][model8_pretrain.py][INFO] Epoch:[0/2](305400/4588595) loss:2.949 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:52:50,133][model8_pretrain.py][INFO] Epoch:[0/2](305400/4588595) loss:2.756 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:52:50,133][model8_pretrain.py][INFO] Epoch:[0/2](305400/4588595) loss:3.619 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:52:50,133][model8_pretrain.py][INFO] Epoch:[0/2](305400/4588595) loss:3.008 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:52:50,133][model8_pretrain.py][INFO] Epoch:[0/2](305400/4588595) loss:2.882 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:52:50,133][model8_pretrain.py][INFO] Epoch:[0/2](305400/4588595) loss:2.887 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:52:50,133][model8_pretrain.py][INFO] Epoch:[0/2](305400/4588595) loss:2.798 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:52:50,133][model8_pretrain.py][INFO] Epoch:[0/2](305400/4588595) loss:2.881 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:53:27,068][model8_pretrain.py][INFO] Epoch:[0/2](305500/4588595) loss:2.841 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:53:27,068][model8_pretrain.py][INFO] Epoch:[0/2](305500/4588595) loss:2.676 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:53:27,068][model8_pretrain.py][INFO] Epoch:[0/2](305500/4588595) loss:3.067 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:53:27,068][model8_pretrain.py][INFO] Epoch:[0/2](305500/4588595) loss:2.936 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:53:27,068][model8_pretrain.py][INFO] Epoch:[0/2](305500/4588595) loss:3.031 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:53:27,068][model8_pretrain.py][INFO] Epoch:[0/2](305500/4588595) loss:2.398 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:53:27,068][model8_pretrain.py][INFO] Epoch:[0/2](305500/4588595) loss:2.623 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:53:27,068][model8_pretrain.py][INFO] Epoch:[0/2](305500/4588595) loss:3.305 lr:0.0000100 epoch_Time:27110.0min: [2024-01-04 01:54:04,006][model8_pretrain.py][INFO] Epoch:[0/2](305600/4588595) loss:2.872 lr:0.0000100 epoch_Time:27109.0min: [2024-01-04 01:54:04,006][model8_pretrain.py][INFO] Epoch:[0/2](305600/4588595) loss:2.531 lr:0.0000100 epoch_Time:27109.0min: [2024-01-04 01:54:04,006][model8_pretrain.py][INFO] Epoch:[0/2](305600/4588595) loss:2.747 lr:0.0000100 epoch_Time:27109.0min: [2024-01-04 01:54:04,006][model8_pretrain.py][INFO] Epoch:[0/2](305600/4588595) loss:2.728 lr:0.0000100 epoch_Time:27109.0min: [2024-01-04 01:54:04,006][model8_pretrain.py][INFO] Epoch:[0/2](305600/4588595) loss:2.862 lr:0.0000100 epoch_Time:27109.0min: [2024-01-04 01:54:04,006][model8_pretrain.py][INFO] Epoch:[0/2](305600/4588595) loss:3.020 lr:0.0000100 epoch_Time:27109.0min: [2024-01-04 01:54:04,006][model8_pretrain.py][INFO] Epoch:[0/2](305600/4588595) loss:3.319 lr:0.0000100 epoch_Time:27109.0min: [2024-01-04 01:54:04,006][model8_pretrain.py][INFO] Epoch:[0/2](305600/4588595) loss:3.394 lr:0.0000100 epoch_Time:27109.0min: [2024-01-04 01:54:40,939][model8_pretrain.py][INFO] Epoch:[0/2](305700/4588595) loss:3.031 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:54:40,939][model8_pretrain.py][INFO] Epoch:[0/2](305700/4588595) loss:2.644 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:54:40,939][model8_pretrain.py][INFO] Epoch:[0/2](305700/4588595) loss:2.726 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:54:40,939][model8_pretrain.py][INFO] Epoch:[0/2](305700/4588595) loss:3.014 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:54:40,939][model8_pretrain.py][INFO] Epoch:[0/2](305700/4588595) loss:2.881 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:54:40,940][model8_pretrain.py][INFO] Epoch:[0/2](305700/4588595) loss:2.781 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:54:40,940][model8_pretrain.py][INFO] Epoch:[0/2](305700/4588595) loss:3.071 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:54:40,940][model8_pretrain.py][INFO] Epoch:[0/2](305700/4588595) loss:2.260 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:55:17,876][model8_pretrain.py][INFO] Epoch:[0/2](305800/4588595) loss:3.117 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:55:17,876][model8_pretrain.py][INFO] Epoch:[0/2](305800/4588595) loss:2.380 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:55:17,876][model8_pretrain.py][INFO] Epoch:[0/2](305800/4588595) loss:3.351 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:55:17,876][model8_pretrain.py][INFO] Epoch:[0/2](305800/4588595) loss:2.735 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:55:17,876][model8_pretrain.py][INFO] Epoch:[0/2](305800/4588595) loss:2.971 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:55:17,876][model8_pretrain.py][INFO] Epoch:[0/2](305800/4588595) loss:2.696 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:55:17,876][model8_pretrain.py][INFO] Epoch:[0/2](305800/4588595) loss:2.935 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:55:17,876][model8_pretrain.py][INFO] Epoch:[0/2](305800/4588595) loss:3.305 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:56:05,303][model8_pretrain.py][INFO] Epoch:[0/2](305900/4588595) loss:2.565 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:05,303][model8_pretrain.py][INFO] Epoch:[0/2](305900/4588595) loss:2.726 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:05,303][model8_pretrain.py][INFO] Epoch:[0/2](305900/4588595) loss:3.038 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:05,303][model8_pretrain.py][INFO] Epoch:[0/2](305900/4588595) loss:2.498 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:05,303][model8_pretrain.py][INFO] Epoch:[0/2](305900/4588595) loss:3.176 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:05,303][model8_pretrain.py][INFO] Epoch:[0/2](305900/4588595) loss:2.257 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:05,303][model8_pretrain.py][INFO] Epoch:[0/2](305900/4588595) loss:2.913 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:05,303][model8_pretrain.py][INFO] Epoch:[0/2](305900/4588595) loss:2.964 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:42,224][model8_pretrain.py][INFO] Epoch:[0/2](306000/4588595) loss:2.931 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:42,224][model8_pretrain.py][INFO] Epoch:[0/2](306000/4588595) loss:3.110 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:42,224][model8_pretrain.py][INFO] Epoch:[0/2](306000/4588595) loss:3.437 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:42,224][model8_pretrain.py][INFO] Epoch:[0/2](306000/4588595) loss:3.359 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:42,224][model8_pretrain.py][INFO] Epoch:[0/2](306000/4588595) loss:2.715 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:42,224][model8_pretrain.py][INFO] Epoch:[0/2](306000/4588595) loss:3.008 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:42,224][model8_pretrain.py][INFO] Epoch:[0/2](306000/4588595) loss:2.933 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:56:42,224][model8_pretrain.py][INFO] Epoch:[0/2](306000/4588595) loss:2.659 lr:0.0000100 epoch_Time:27108.0min: [2024-01-04 01:57:19,162][model8_pretrain.py][INFO] Epoch:[0/2](306100/4588595) loss:2.898 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:57:19,162][model8_pretrain.py][INFO] Epoch:[0/2](306100/4588595) loss:3.272 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:57:19,162][model8_pretrain.py][INFO] Epoch:[0/2](306100/4588595) loss:2.670 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:57:19,162][model8_pretrain.py][INFO] Epoch:[0/2](306100/4588595) loss:2.814 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:57:19,162][model8_pretrain.py][INFO] Epoch:[0/2](306100/4588595) loss:2.786 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:57:19,162][model8_pretrain.py][INFO] Epoch:[0/2](306100/4588595) loss:2.859 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:57:19,162][model8_pretrain.py][INFO] Epoch:[0/2](306100/4588595) loss:2.936 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:57:19,162][model8_pretrain.py][INFO] Epoch:[0/2](306100/4588595) loss:2.663 lr:0.0000100 epoch_Time:27107.0min: [2024-01-04 01:57:56,096][model8_pretrain.py][INFO] Epoch:[0/2](306200/4588595) loss:2.309 lr:0.0000100 epoch_Time:27106.0min: [2024-01-04 01:57:56,096][model8_pretrain.py][INFO] Epoch:[0/2](306200/4588595) loss:3.049 lr:0.0000100 epoch_Time:27106.0min: [2024-01-04 01:57:56,096][model8_pretrain.py][INFO] Epoch:[0/2](306200/4588595) loss:2.817 lr:0.0000100 epoch_Time:27106.0min: [2024-01-04 01:57:56,096][model8_pretrain.py][INFO] Epoch:[0/2](306200/4588595) loss:2.287 lr:0.0000100 epoch_Time:27106.0min: [2024-01-04 01:57:56,096][model8_pretrain.py][INFO] Epoch:[0/2](306200/4588595) loss:3.276 lr:0.0000100 epoch_Time:27106.0min: [2024-01-04 01:57:56,097][model8_pretrain.py][INFO] Epoch:[0/2](306200/4588595) loss:1.917 lr:0.0000100 epoch_Time:27106.0min: [2024-01-04 01:57:56,097][model8_pretrain.py][INFO] Epoch:[0/2](306200/4588595) loss:3.287 lr:0.0000100 epoch_Time:27106.0min: [2024-01-04 01:57:56,097][model8_pretrain.py][INFO] Epoch:[0/2](306200/4588595) loss:3.394 lr:0.0000100 epoch_Time:27106.0min: [2024-01-04 01:58:33,041][model8_pretrain.py][INFO] Epoch:[0/2](306300/4588595) loss:3.197 lr:0.0000100 epoch_Time:27105.0min: [2024-01-04 01:58:33,041][model8_pretrain.py][INFO] Epoch:[0/2](306300/4588595) loss:2.981 lr:0.0000100 epoch_Time:27105.0min: [2024-01-04 01:58:33,041][model8_pretrain.py][INFO] Epoch:[0/2](306300/4588595) loss:2.738 lr:0.0000100 epoch_Time:27105.0min: [2024-01-04 01:58:33,041][model8_pretrain.py][INFO] Epoch:[0/2](306300/4588595) loss:2.629 lr:0.0000100 epoch_Time:27105.0min: [2024-01-04 01:58:33,041][model8_pretrain.py][INFO] Epoch:[0/2](306300/4588595) loss:2.265 lr:0.0000100 epoch_Time:27105.0min: [2024-01-04 01:58:33,041][model8_pretrain.py][INFO] Epoch:[0/2](306300/4588595) loss:2.550 lr:0.0000100 epoch_Time:27105.0min: [2024-01-04 01:58:33,042][model8_pretrain.py][INFO] Epoch:[0/2](306300/4588595) loss:3.298 lr:0.0000100 epoch_Time:27105.0min: [2024-01-04 01:58:33,042][model8_pretrain.py][INFO] Epoch:[0/2](306300/4588595) loss:2.602 lr:0.0000100 epoch_Time:27105.0min: [2024-01-04 01:59:09,980][model8_pretrain.py][INFO] Epoch:[0/2](306400/4588595) loss:3.100 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:09,980][model8_pretrain.py][INFO] Epoch:[0/2](306400/4588595) loss:2.750 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:09,980][model8_pretrain.py][INFO] Epoch:[0/2](306400/4588595) loss:3.109 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:09,980][model8_pretrain.py][INFO] Epoch:[0/2](306400/4588595) loss:2.440 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:09,980][model8_pretrain.py][INFO] Epoch:[0/2](306400/4588595) loss:3.216 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:09,980][model8_pretrain.py][INFO] Epoch:[0/2](306400/4588595) loss:3.438 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:09,980][model8_pretrain.py][INFO] Epoch:[0/2](306400/4588595) loss:2.495 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:09,980][model8_pretrain.py][INFO] Epoch:[0/2](306400/4588595) loss:2.596 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:46,922][model8_pretrain.py][INFO] Epoch:[0/2](306500/4588595) loss:3.242 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:46,922][model8_pretrain.py][INFO] Epoch:[0/2](306500/4588595) loss:2.953 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:46,922][model8_pretrain.py][INFO] Epoch:[0/2](306500/4588595) loss:2.983 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:46,922][model8_pretrain.py][INFO] Epoch:[0/2](306500/4588595) loss:3.073 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:46,922][model8_pretrain.py][INFO] Epoch:[0/2](306500/4588595) loss:2.629 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:46,922][model8_pretrain.py][INFO] Epoch:[0/2](306500/4588595) loss:2.857 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:46,922][model8_pretrain.py][INFO] Epoch:[0/2](306500/4588595) loss:2.999 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 01:59:46,923][model8_pretrain.py][INFO] Epoch:[0/2](306500/4588595) loss:3.396 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 02:00:23,869][model8_pretrain.py][INFO] Epoch:[0/2](306600/4588595) loss:3.103 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:00:23,869][model8_pretrain.py][INFO] Epoch:[0/2](306600/4588595) loss:3.227 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:00:23,869][model8_pretrain.py][INFO] Epoch:[0/2](306600/4588595) loss:2.491 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:00:23,869][model8_pretrain.py][INFO] Epoch:[0/2](306600/4588595) loss:2.734 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:00:23,869][model8_pretrain.py][INFO] Epoch:[0/2](306600/4588595) loss:2.955 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:00:23,869][model8_pretrain.py][INFO] Epoch:[0/2](306600/4588595) loss:2.782 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:00:23,869][model8_pretrain.py][INFO] Epoch:[0/2](306600/4588595) loss:2.931 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:00:23,869][model8_pretrain.py][INFO] Epoch:[0/2](306600/4588595) loss:3.148 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:01:11,004][model8_pretrain.py][INFO] Epoch:[0/2](306700/4588595) loss:3.253 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 02:01:11,005][model8_pretrain.py][INFO] Epoch:[0/2](306700/4588595) loss:3.159 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 02:01:11,005][model8_pretrain.py][INFO] Epoch:[0/2](306700/4588595) loss:2.775 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 02:01:11,005][model8_pretrain.py][INFO] Epoch:[0/2](306700/4588595) loss:2.983 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 02:01:11,005][model8_pretrain.py][INFO] Epoch:[0/2](306700/4588595) loss:3.023 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 02:01:11,005][model8_pretrain.py][INFO] Epoch:[0/2](306700/4588595) loss:2.816 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 02:01:11,005][model8_pretrain.py][INFO] Epoch:[0/2](306700/4588595) loss:2.630 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 02:01:11,005][model8_pretrain.py][INFO] Epoch:[0/2](306700/4588595) loss:2.950 lr:0.0000100 epoch_Time:27104.0min: [2024-01-04 02:01:47,933][model8_pretrain.py][INFO] Epoch:[0/2](306800/4588595) loss:3.051 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:01:47,933][model8_pretrain.py][INFO] Epoch:[0/2](306800/4588595) loss:2.196 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:01:47,933][model8_pretrain.py][INFO] Epoch:[0/2](306800/4588595) loss:2.542 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:01:47,933][model8_pretrain.py][INFO] Epoch:[0/2](306800/4588595) loss:2.758 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:01:47,933][model8_pretrain.py][INFO] Epoch:[0/2](306800/4588595) loss:2.486 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:01:47,933][model8_pretrain.py][INFO] Epoch:[0/2](306800/4588595) loss:3.009 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:01:47,933][model8_pretrain.py][INFO] Epoch:[0/2](306800/4588595) loss:3.171 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:01:47,933][model8_pretrain.py][INFO] Epoch:[0/2](306800/4588595) loss:2.915 lr:0.0000100 epoch_Time:27103.0min: [2024-01-04 02:02:24,874][model8_pretrain.py][INFO] Epoch:[0/2](306900/4588595) loss:2.300 lr:0.0000100 epoch_Time:27102.0min: [2024-01-04 02:02:24,874][model8_pretrain.py][INFO] Epoch:[0/2](306900/4588595) loss:3.101 lr:0.0000100 epoch_Time:27102.0min: [2024-01-04 02:02:24,874][model8_pretrain.py][INFO] Epoch:[0/2](306900/4588595) loss:2.197 lr:0.0000100 epoch_Time:27102.0min: [2024-01-04 02:02:24,874][model8_pretrain.py][INFO] Epoch:[0/2](306900/4588595) loss:2.771 lr:0.0000100 epoch_Time:27102.0min: [2024-01-04 02:02:24,874][model8_pretrain.py][INFO] Epoch:[0/2](306900/4588595) loss:3.120 lr:0.0000100 epoch_Time:27102.0min: [2024-01-04 02:02:24,874][model8_pretrain.py][INFO] Epoch:[0/2](306900/4588595) loss:3.393 lr:0.0000100 epoch_Time:27102.0min: [2024-01-04 02:02:24,874][model8_pretrain.py][INFO] Epoch:[0/2](306900/4588595) loss:2.778 lr:0.0000100 epoch_Time:27102.0min: [2024-01-04 02:02:24,874][model8_pretrain.py][INFO] Epoch:[0/2](306900/4588595) loss:3.100 lr:0.0000100 epoch_Time:27102.0min: [2024-01-04 02:03:01,813][model8_pretrain.py][INFO] Epoch:[0/2](307000/4588595) loss:2.822 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:01,813][model8_pretrain.py][INFO] Epoch:[0/2](307000/4588595) loss:3.362 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:01,813][model8_pretrain.py][INFO] Epoch:[0/2](307000/4588595) loss:2.766 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:01,813][model8_pretrain.py][INFO] Epoch:[0/2](307000/4588595) loss:2.672 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:01,813][model8_pretrain.py][INFO] Epoch:[0/2](307000/4588595) loss:3.365 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:01,813][model8_pretrain.py][INFO] Epoch:[0/2](307000/4588595) loss:2.742 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:01,814][model8_pretrain.py][INFO] Epoch:[0/2](307000/4588595) loss:2.812 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:01,814][model8_pretrain.py][INFO] Epoch:[0/2](307000/4588595) loss:2.825 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:38,771][model8_pretrain.py][INFO] Epoch:[0/2](307100/4588595) loss:3.603 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:38,771][model8_pretrain.py][INFO] Epoch:[0/2](307100/4588595) loss:3.494 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:38,771][model8_pretrain.py][INFO] Epoch:[0/2](307100/4588595) loss:2.897 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:38,771][model8_pretrain.py][INFO] Epoch:[0/2](307100/4588595) loss:2.686 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:38,771][model8_pretrain.py][INFO] Epoch:[0/2](307100/4588595) loss:2.780 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:38,771][model8_pretrain.py][INFO] Epoch:[0/2](307100/4588595) loss:3.209 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:38,771][model8_pretrain.py][INFO] Epoch:[0/2](307100/4588595) loss:3.072 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:03:38,771][model8_pretrain.py][INFO] Epoch:[0/2](307100/4588595) loss:2.942 lr:0.0000100 epoch_Time:27101.0min: [2024-01-04 02:04:15,717][model8_pretrain.py][INFO] Epoch:[0/2](307200/4588595) loss:3.035 lr:0.0000100 epoch_Time:27100.0min: [2024-01-04 02:04:15,717][model8_pretrain.py][INFO] Epoch:[0/2](307200/4588595) loss:2.190 lr:0.0000100 epoch_Time:27100.0min: [2024-01-04 02:04:15,717][model8_pretrain.py][INFO] Epoch:[0/2](307200/4588595) loss:2.508 lr:0.0000100 epoch_Time:27100.0min: [2024-01-04 02:04:15,717][model8_pretrain.py][INFO] Epoch:[0/2](307200/4588595) loss:3.025 lr:0.0000100 epoch_Time:27100.0min: [2024-01-04 02:04:15,717][model8_pretrain.py][INFO] Epoch:[0/2](307200/4588595) loss:2.841 lr:0.0000100 epoch_Time:27100.0min: [2024-01-04 02:04:15,717][model8_pretrain.py][INFO] Epoch:[0/2](307200/4588595) loss:2.907 lr:0.0000100 epoch_Time:27100.0min: [2024-01-04 02:04:15,717][model8_pretrain.py][INFO] Epoch:[0/2](307200/4588595) loss:2.417 lr:0.0000100 epoch_Time:27100.0min: [2024-01-04 02:04:15,717][model8_pretrain.py][INFO] Epoch:[0/2](307200/4588595) loss:2.613 lr:0.0000100 epoch_Time:27100.0min: [2024-01-04 02:04:52,657][model8_pretrain.py][INFO] Epoch:[0/2](307300/4588595) loss:2.875 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:04:52,657][model8_pretrain.py][INFO] Epoch:[0/2](307300/4588595) loss:2.888 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:04:52,657][model8_pretrain.py][INFO] Epoch:[0/2](307300/4588595) loss:3.249 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:04:52,657][model8_pretrain.py][INFO] Epoch:[0/2](307300/4588595) loss:2.535 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:04:52,657][model8_pretrain.py][INFO] Epoch:[0/2](307300/4588595) loss:3.069 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:04:52,657][model8_pretrain.py][INFO] Epoch:[0/2](307300/4588595) loss:2.992 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:04:52,657][model8_pretrain.py][INFO] Epoch:[0/2](307300/4588595) loss:2.311 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:04:52,657][model8_pretrain.py][INFO] Epoch:[0/2](307300/4588595) loss:2.813 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:05:29,599][model8_pretrain.py][INFO] Epoch:[0/2](307400/4588595) loss:2.598 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:05:29,599][model8_pretrain.py][INFO] Epoch:[0/2](307400/4588595) loss:2.418 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:05:29,600][model8_pretrain.py][INFO] Epoch:[0/2](307400/4588595) loss:3.166 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:05:29,600][model8_pretrain.py][INFO] Epoch:[0/2](307400/4588595) loss:2.468 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:05:29,600][model8_pretrain.py][INFO] Epoch:[0/2](307400/4588595) loss:2.522 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:05:29,600][model8_pretrain.py][INFO] Epoch:[0/2](307400/4588595) loss:3.082 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:05:29,600][model8_pretrain.py][INFO] Epoch:[0/2](307400/4588595) loss:3.345 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:05:29,600][model8_pretrain.py][INFO] Epoch:[0/2](307400/4588595) loss:2.845 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:06:15,327][model8_pretrain.py][INFO] Epoch:[0/2](307500/4588595) loss:3.106 lr:0.0000100 epoch_Time:27099.0min: [2024-01-04 02:06:15,327][model8_pretrain.py][INFO] Epoch:[0/2](307500/4588595) loss:2.728 lr:0.0000100 epoch_Time:27099.0min: [2024-01-04 02:06:15,327][model8_pretrain.py][INFO] Epoch:[0/2](307500/4588595) loss:3.033 lr:0.0000100 epoch_Time:27099.0min: [2024-01-04 02:06:15,327][model8_pretrain.py][INFO] Epoch:[0/2](307500/4588595) loss:2.811 lr:0.0000100 epoch_Time:27099.0min: [2024-01-04 02:06:15,328][model8_pretrain.py][INFO] Epoch:[0/2](307500/4588595) loss:3.180 lr:0.0000100 epoch_Time:27099.0min: [2024-01-04 02:06:15,328][model8_pretrain.py][INFO] Epoch:[0/2](307500/4588595) loss:2.416 lr:0.0000100 epoch_Time:27099.0min: [2024-01-04 02:06:15,328][model8_pretrain.py][INFO] Epoch:[0/2](307500/4588595) loss:2.636 lr:0.0000100 epoch_Time:27099.0min: [2024-01-04 02:06:16,832][model8_pretrain.py][INFO] Epoch:[0/2](307500/4588595) loss:2.912 lr:0.0000100 epoch_Time:27099.0min: [2024-01-04 02:06:53,757][model8_pretrain.py][INFO] Epoch:[0/2](307600/4588595) loss:3.159 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:06:53,757][model8_pretrain.py][INFO] Epoch:[0/2](307600/4588595) loss:3.030 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:06:53,757][model8_pretrain.py][INFO] Epoch:[0/2](307600/4588595) loss:3.137 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:06:53,757][model8_pretrain.py][INFO] Epoch:[0/2](307600/4588595) loss:2.518 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:06:53,757][model8_pretrain.py][INFO] Epoch:[0/2](307600/4588595) loss:2.968 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:06:53,757][model8_pretrain.py][INFO] Epoch:[0/2](307600/4588595) loss:3.416 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:06:53,757][model8_pretrain.py][INFO] Epoch:[0/2](307600/4588595) loss:2.749 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:06:53,758][model8_pretrain.py][INFO] Epoch:[0/2](307600/4588595) loss:2.250 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:07:30,708][model8_pretrain.py][INFO] Epoch:[0/2](307700/4588595) loss:2.643 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:07:30,708][model8_pretrain.py][INFO] Epoch:[0/2](307700/4588595) loss:2.732 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:07:30,708][model8_pretrain.py][INFO] Epoch:[0/2](307700/4588595) loss:3.207 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:07:30,708][model8_pretrain.py][INFO] Epoch:[0/2](307700/4588595) loss:2.826 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:07:30,708][model8_pretrain.py][INFO] Epoch:[0/2](307700/4588595) loss:2.983 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:07:30,708][model8_pretrain.py][INFO] Epoch:[0/2](307700/4588595) loss:3.142 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:07:30,708][model8_pretrain.py][INFO] Epoch:[0/2](307700/4588595) loss:3.052 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:07:30,708][model8_pretrain.py][INFO] Epoch:[0/2](307700/4588595) loss:2.870 lr:0.0000100 epoch_Time:27098.0min: [2024-01-04 02:08:07,664][model8_pretrain.py][INFO] Epoch:[0/2](307800/4588595) loss:3.071 lr:0.0000100 epoch_Time:27097.0min: [2024-01-04 02:08:07,664][model8_pretrain.py][INFO] Epoch:[0/2](307800/4588595) loss:3.197 lr:0.0000100 epoch_Time:27097.0min: [2024-01-04 02:08:07,664][model8_pretrain.py][INFO] Epoch:[0/2](307800/4588595) loss:2.857 lr:0.0000100 epoch_Time:27097.0min: [2024-01-04 02:08:07,664][model8_pretrain.py][INFO] Epoch:[0/2](307800/4588595) loss:3.008 lr:0.0000100 epoch_Time:27097.0min: [2024-01-04 02:08:07,664][model8_pretrain.py][INFO] Epoch:[0/2](307800/4588595) loss:3.348 lr:0.0000100 epoch_Time:27097.0min: [2024-01-04 02:08:07,664][model8_pretrain.py][INFO] Epoch:[0/2](307800/4588595) loss:3.170 lr:0.0000100 epoch_Time:27097.0min: [2024-01-04 02:08:07,665][model8_pretrain.py][INFO] Epoch:[0/2](307800/4588595) loss:3.160 lr:0.0000100 epoch_Time:27097.0min: [2024-01-04 02:08:07,664][model8_pretrain.py][INFO] Epoch:[0/2](307800/4588595) loss:3.319 lr:0.0000100 epoch_Time:27097.0min: [2024-01-04 02:08:44,613][model8_pretrain.py][INFO] Epoch:[0/2](307900/4588595) loss:2.459 lr:0.0000100 epoch_Time:27096.0min: [2024-01-04 02:08:44,613][model8_pretrain.py][INFO] Epoch:[0/2](307900/4588595) loss:3.063 lr:0.0000100 epoch_Time:27096.0min: [2024-01-04 02:08:44,613][model8_pretrain.py][INFO] Epoch:[0/2](307900/4588595) loss:3.158 lr:0.0000100 epoch_Time:27096.0min: [2024-01-04 02:08:44,613][model8_pretrain.py][INFO] Epoch:[0/2](307900/4588595) loss:2.941 lr:0.0000100 epoch_Time:27096.0min: [2024-01-04 02:08:44,613][model8_pretrain.py][INFO] Epoch:[0/2](307900/4588595) loss:2.714 lr:0.0000100 epoch_Time:27096.0min: [2024-01-04 02:08:44,613][model8_pretrain.py][INFO] Epoch:[0/2](307900/4588595) loss:2.928 lr:0.0000100 epoch_Time:27096.0min: [2024-01-04 02:08:44,613][model8_pretrain.py][INFO] Epoch:[0/2](307900/4588595) loss:3.146 lr:0.0000100 epoch_Time:27096.0min: [2024-01-04 02:08:44,613][model8_pretrain.py][INFO] Epoch:[0/2](307900/4588595) loss:2.817 lr:0.0000100 epoch_Time:27096.0min: [2024-01-04 02:09:21,571][model8_pretrain.py][INFO] Epoch:[0/2](308000/4588595) loss:3.070 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:09:21,571][model8_pretrain.py][INFO] Epoch:[0/2](308000/4588595) loss:3.069 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:09:21,571][model8_pretrain.py][INFO] Epoch:[0/2](308000/4588595) loss:3.196 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:09:21,571][model8_pretrain.py][INFO] Epoch:[0/2](308000/4588595) loss:2.592 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:09:21,571][model8_pretrain.py][INFO] Epoch:[0/2](308000/4588595) loss:3.022 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:09:21,571][model8_pretrain.py][INFO] Epoch:[0/2](308000/4588595) loss:2.350 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:09:21,571][model8_pretrain.py][INFO] Epoch:[0/2](308000/4588595) loss:3.258 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:09:21,572][model8_pretrain.py][INFO] Epoch:[0/2](308000/4588595) loss:3.054 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:09:58,526][model8_pretrain.py][INFO] Epoch:[0/2](308100/4588595) loss:2.897 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:09:58,526][model8_pretrain.py][INFO] Epoch:[0/2](308100/4588595) loss:2.284 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:09:58,526][model8_pretrain.py][INFO] Epoch:[0/2](308100/4588595) loss:3.292 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:09:58,526][model8_pretrain.py][INFO] Epoch:[0/2](308100/4588595) loss:2.906 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:09:58,527][model8_pretrain.py][INFO] Epoch:[0/2](308100/4588595) loss:3.030 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:09:58,527][model8_pretrain.py][INFO] Epoch:[0/2](308100/4588595) loss:2.842 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:09:58,527][model8_pretrain.py][INFO] Epoch:[0/2](308100/4588595) loss:3.251 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:09:58,527][model8_pretrain.py][INFO] Epoch:[0/2](308100/4588595) loss:3.017 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:10:35,485][model8_pretrain.py][INFO] Epoch:[0/2](308200/4588595) loss:3.000 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:10:35,485][model8_pretrain.py][INFO] Epoch:[0/2](308200/4588595) loss:3.255 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:10:35,485][model8_pretrain.py][INFO] Epoch:[0/2](308200/4588595) loss:2.353 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:10:35,485][model8_pretrain.py][INFO] Epoch:[0/2](308200/4588595) loss:3.236 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:10:35,485][model8_pretrain.py][INFO] Epoch:[0/2](308200/4588595) loss:2.799 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:10:35,485][model8_pretrain.py][INFO] Epoch:[0/2](308200/4588595) loss:3.352 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:10:35,485][model8_pretrain.py][INFO] Epoch:[0/2](308200/4588595) loss:2.303 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:10:35,485][model8_pretrain.py][INFO] Epoch:[0/2](308200/4588595) loss:2.821 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:11:21,175][model8_pretrain.py][INFO] Epoch:[0/2](308300/4588595) loss:3.113 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:11:21,175][model8_pretrain.py][INFO] Epoch:[0/2](308300/4588595) loss:3.361 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:11:21,175][model8_pretrain.py][INFO] Epoch:[0/2](308300/4588595) loss:3.071 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:11:21,175][model8_pretrain.py][INFO] Epoch:[0/2](308300/4588595) loss:2.767 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:11:21,175][model8_pretrain.py][INFO] Epoch:[0/2](308300/4588595) loss:3.172 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:11:21,179][model8_pretrain.py][INFO] Epoch:[0/2](308300/4588595) loss:2.944 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:11:21,180][model8_pretrain.py][INFO] Epoch:[0/2](308300/4588595) loss:3.417 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:11:21,191][model8_pretrain.py][INFO] Epoch:[0/2](308300/4588595) loss:2.733 lr:0.0000100 epoch_Time:27095.0min: [2024-01-04 02:11:59,629][model8_pretrain.py][INFO] Epoch:[0/2](308400/4588595) loss:2.910 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:11:59,629][model8_pretrain.py][INFO] Epoch:[0/2](308400/4588595) loss:3.395 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:11:59,629][model8_pretrain.py][INFO] Epoch:[0/2](308400/4588595) loss:3.406 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:11:59,629][model8_pretrain.py][INFO] Epoch:[0/2](308400/4588595) loss:2.892 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:11:59,629][model8_pretrain.py][INFO] Epoch:[0/2](308400/4588595) loss:2.015 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:11:59,629][model8_pretrain.py][INFO] Epoch:[0/2](308400/4588595) loss:2.976 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:11:59,629][model8_pretrain.py][INFO] Epoch:[0/2](308400/4588595) loss:3.000 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:11:59,629][model8_pretrain.py][INFO] Epoch:[0/2](308400/4588595) loss:2.733 lr:0.0000100 epoch_Time:27094.0min: [2024-01-04 02:12:36,592][model8_pretrain.py][INFO] Epoch:[0/2](308500/4588595) loss:2.873 lr:0.0000100 epoch_Time:27093.0min: [2024-01-04 02:12:36,593][model8_pretrain.py][INFO] Epoch:[0/2](308500/4588595) loss:3.367 lr:0.0000100 epoch_Time:27093.0min: [2024-01-04 02:12:36,593][model8_pretrain.py][INFO] Epoch:[0/2](308500/4588595) loss:2.382 lr:0.0000100 epoch_Time:27093.0min: [2024-01-04 02:12:36,593][model8_pretrain.py][INFO] Epoch:[0/2](308500/4588595) loss:2.901 lr:0.0000100 epoch_Time:27093.0min: [2024-01-04 02:12:36,593][model8_pretrain.py][INFO] Epoch:[0/2](308500/4588595) loss:2.665 lr:0.0000100 epoch_Time:27093.0min: [2024-01-04 02:12:36,593][model8_pretrain.py][INFO] Epoch:[0/2](308500/4588595) loss:2.676 lr:0.0000100 epoch_Time:27093.0min: [2024-01-04 02:12:36,594][model8_pretrain.py][INFO] Epoch:[0/2](308500/4588595) loss:3.236 lr:0.0000100 epoch_Time:27093.0min: [2024-01-04 02:12:36,594][model8_pretrain.py][INFO] Epoch:[0/2](308500/4588595) loss:2.672 lr:0.0000100 epoch_Time:27093.0min: [2024-01-04 02:13:13,557][model8_pretrain.py][INFO] Epoch:[0/2](308600/4588595) loss:3.353 lr:0.0000100 epoch_Time:27092.0min: [2024-01-04 02:13:13,557][model8_pretrain.py][INFO] Epoch:[0/2](308600/4588595) loss:2.803 lr:0.0000100 epoch_Time:27092.0min: [2024-01-04 02:13:13,557][model8_pretrain.py][INFO] Epoch:[0/2](308600/4588595) loss:3.372 lr:0.0000100 epoch_Time:27092.0min: [2024-01-04 02:13:13,557][model8_pretrain.py][INFO] Epoch:[0/2](308600/4588595) loss:3.049 lr:0.0000100 epoch_Time:27092.0min: [2024-01-04 02:13:13,557][model8_pretrain.py][INFO] Epoch:[0/2](308600/4588595) loss:3.198 lr:0.0000100 epoch_Time:27092.0min: [2024-01-04 02:13:13,557][model8_pretrain.py][INFO] Epoch:[0/2](308600/4588595) loss:2.893 lr:0.0000100 epoch_Time:27092.0min: [2024-01-04 02:13:13,557][model8_pretrain.py][INFO] Epoch:[0/2](308600/4588595) loss:3.002 lr:0.0000100 epoch_Time:27092.0min: [2024-01-04 02:13:13,557][model8_pretrain.py][INFO] Epoch:[0/2](308600/4588595) loss:2.826 lr:0.0000100 epoch_Time:27092.0min: [2024-01-04 02:13:50,488][model8_pretrain.py][INFO] Epoch:[0/2](308700/4588595) loss:3.234 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:13:50,488][model8_pretrain.py][INFO] Epoch:[0/2](308700/4588595) loss:2.571 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:13:50,488][model8_pretrain.py][INFO] Epoch:[0/2](308700/4588595) loss:3.148 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:13:50,488][model8_pretrain.py][INFO] Epoch:[0/2](308700/4588595) loss:2.650 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:13:50,488][model8_pretrain.py][INFO] Epoch:[0/2](308700/4588595) loss:2.885 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:13:50,488][model8_pretrain.py][INFO] Epoch:[0/2](308700/4588595) loss:2.775 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:13:50,489][model8_pretrain.py][INFO] Epoch:[0/2](308700/4588595) loss:2.627 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:13:50,489][model8_pretrain.py][INFO] Epoch:[0/2](308700/4588595) loss:3.252 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:14:27,440][model8_pretrain.py][INFO] Epoch:[0/2](308800/4588595) loss:3.219 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:14:27,440][model8_pretrain.py][INFO] Epoch:[0/2](308800/4588595) loss:3.260 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:14:27,440][model8_pretrain.py][INFO] Epoch:[0/2](308800/4588595) loss:3.164 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:14:27,440][model8_pretrain.py][INFO] Epoch:[0/2](308800/4588595) loss:3.021 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:14:27,440][model8_pretrain.py][INFO] Epoch:[0/2](308800/4588595) loss:2.726 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:14:27,441][model8_pretrain.py][INFO] Epoch:[0/2](308800/4588595) loss:2.956 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:14:27,441][model8_pretrain.py][INFO] Epoch:[0/2](308800/4588595) loss:2.808 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:14:27,441][model8_pretrain.py][INFO] Epoch:[0/2](308800/4588595) loss:2.838 lr:0.0000100 epoch_Time:27091.0min: [2024-01-04 02:15:04,375][model8_pretrain.py][INFO] Epoch:[0/2](308900/4588595) loss:2.601 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:04,375][model8_pretrain.py][INFO] Epoch:[0/2](308900/4588595) loss:3.225 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:04,375][model8_pretrain.py][INFO] Epoch:[0/2](308900/4588595) loss:3.200 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:04,375][model8_pretrain.py][INFO] Epoch:[0/2](308900/4588595) loss:2.919 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:04,376][model8_pretrain.py][INFO] Epoch:[0/2](308900/4588595) loss:3.298 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:04,376][model8_pretrain.py][INFO] Epoch:[0/2](308900/4588595) loss:3.090 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:04,376][model8_pretrain.py][INFO] Epoch:[0/2](308900/4588595) loss:2.852 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:04,376][model8_pretrain.py][INFO] Epoch:[0/2](308900/4588595) loss:2.519 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:41,321][model8_pretrain.py][INFO] Epoch:[0/2](309000/4588595) loss:2.700 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:41,321][model8_pretrain.py][INFO] Epoch:[0/2](309000/4588595) loss:2.294 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:41,321][model8_pretrain.py][INFO] Epoch:[0/2](309000/4588595) loss:3.164 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:41,321][model8_pretrain.py][INFO] Epoch:[0/2](309000/4588595) loss:2.454 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:41,321][model8_pretrain.py][INFO] Epoch:[0/2](309000/4588595) loss:3.378 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:41,322][model8_pretrain.py][INFO] Epoch:[0/2](309000/4588595) loss:2.878 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:41,322][model8_pretrain.py][INFO] Epoch:[0/2](309000/4588595) loss:2.790 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:15:41,322][model8_pretrain.py][INFO] Epoch:[0/2](309000/4588595) loss:2.761 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:16:23,502][model8_pretrain.py][INFO] Epoch:[0/2](309100/4588595) loss:3.307 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:16:23,502][model8_pretrain.py][INFO] Epoch:[0/2](309100/4588595) loss:3.030 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:16:23,502][model8_pretrain.py][INFO] Epoch:[0/2](309100/4588595) loss:2.780 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:16:23,502][model8_pretrain.py][INFO] Epoch:[0/2](309100/4588595) loss:2.732 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:16:23,502][model8_pretrain.py][INFO] Epoch:[0/2](309100/4588595) loss:2.830 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:16:23,502][model8_pretrain.py][INFO] Epoch:[0/2](309100/4588595) loss:2.860 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:16:23,502][model8_pretrain.py][INFO] Epoch:[0/2](309100/4588595) loss:3.107 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:16:23,502][model8_pretrain.py][INFO] Epoch:[0/2](309100/4588595) loss:3.149 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:05,647][model8_pretrain.py][INFO] Epoch:[0/2](309200/4588595) loss:3.492 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:05,647][model8_pretrain.py][INFO] Epoch:[0/2](309200/4588595) loss:2.797 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:05,647][model8_pretrain.py][INFO] Epoch:[0/2](309200/4588595) loss:3.173 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:05,647][model8_pretrain.py][INFO] Epoch:[0/2](309200/4588595) loss:2.563 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:05,647][model8_pretrain.py][INFO] Epoch:[0/2](309200/4588595) loss:2.587 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:05,647][model8_pretrain.py][INFO] Epoch:[0/2](309200/4588595) loss:2.832 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:05,647][model8_pretrain.py][INFO] Epoch:[0/2](309200/4588595) loss:2.685 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:05,647][model8_pretrain.py][INFO] Epoch:[0/2](309200/4588595) loss:2.918 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:42,580][model8_pretrain.py][INFO] Epoch:[0/2](309300/4588595) loss:2.316 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:42,580][model8_pretrain.py][INFO] Epoch:[0/2](309300/4588595) loss:2.959 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:42,580][model8_pretrain.py][INFO] Epoch:[0/2](309300/4588595) loss:2.691 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:42,580][model8_pretrain.py][INFO] Epoch:[0/2](309300/4588595) loss:2.154 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:42,580][model8_pretrain.py][INFO] Epoch:[0/2](309300/4588595) loss:2.217 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:42,580][model8_pretrain.py][INFO] Epoch:[0/2](309300/4588595) loss:2.946 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:42,580][model8_pretrain.py][INFO] Epoch:[0/2](309300/4588595) loss:3.160 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:17:42,581][model8_pretrain.py][INFO] Epoch:[0/2](309300/4588595) loss:3.098 lr:0.0000100 epoch_Time:27089.0min: [2024-01-04 02:18:19,524][model8_pretrain.py][INFO] Epoch:[0/2](309400/4588595) loss:2.568 lr:0.0000100 epoch_Time:27088.0min: [2024-01-04 02:18:19,524][model8_pretrain.py][INFO] Epoch:[0/2](309400/4588595) loss:2.481 lr:0.0000100 epoch_Time:27088.0min: [2024-01-04 02:18:19,524][model8_pretrain.py][INFO] Epoch:[0/2](309400/4588595) loss:2.874 lr:0.0000100 epoch_Time:27088.0min: [2024-01-04 02:18:19,524][model8_pretrain.py][INFO] Epoch:[0/2](309400/4588595) loss:3.008 lr:0.0000100 epoch_Time:27088.0min: [2024-01-04 02:18:19,524][model8_pretrain.py][INFO] Epoch:[0/2](309400/4588595) loss:2.901 lr:0.0000100 epoch_Time:27088.0min: [2024-01-04 02:18:19,524][model8_pretrain.py][INFO] Epoch:[0/2](309400/4588595) loss:2.806 lr:0.0000100 epoch_Time:27088.0min: [2024-01-04 02:18:19,524][model8_pretrain.py][INFO] Epoch:[0/2](309400/4588595) loss:3.221 lr:0.0000100 epoch_Time:27088.0min: [2024-01-04 02:18:19,524][model8_pretrain.py][INFO] Epoch:[0/2](309400/4588595) loss:2.527 lr:0.0000100 epoch_Time:27088.0min: [2024-01-04 02:18:56,468][model8_pretrain.py][INFO] Epoch:[0/2](309500/4588595) loss:3.158 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:18:56,468][model8_pretrain.py][INFO] Epoch:[0/2](309500/4588595) loss:2.990 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:18:56,468][model8_pretrain.py][INFO] Epoch:[0/2](309500/4588595) loss:3.040 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:18:56,468][model8_pretrain.py][INFO] Epoch:[0/2](309500/4588595) loss:3.170 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:18:56,468][model8_pretrain.py][INFO] Epoch:[0/2](309500/4588595) loss:3.096 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:18:56,468][model8_pretrain.py][INFO] Epoch:[0/2](309500/4588595) loss:2.495 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:18:56,468][model8_pretrain.py][INFO] Epoch:[0/2](309500/4588595) loss:3.267 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:18:56,468][model8_pretrain.py][INFO] Epoch:[0/2](309500/4588595) loss:3.264 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:19:33,400][model8_pretrain.py][INFO] Epoch:[0/2](309600/4588595) loss:2.493 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:19:33,400][model8_pretrain.py][INFO] Epoch:[0/2](309600/4588595) loss:1.961 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:19:33,400][model8_pretrain.py][INFO] Epoch:[0/2](309600/4588595) loss:2.764 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:19:33,400][model8_pretrain.py][INFO] Epoch:[0/2](309600/4588595) loss:2.778 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:19:33,400][model8_pretrain.py][INFO] Epoch:[0/2](309600/4588595) loss:2.985 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:19:33,400][model8_pretrain.py][INFO] Epoch:[0/2](309600/4588595) loss:3.158 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:19:33,400][model8_pretrain.py][INFO] Epoch:[0/2](309600/4588595) loss:2.654 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:19:33,401][model8_pretrain.py][INFO] Epoch:[0/2](309600/4588595) loss:2.800 lr:0.0000100 epoch_Time:27086.0min: [2024-01-04 02:20:10,342][model8_pretrain.py][INFO] Epoch:[0/2](309700/4588595) loss:2.269 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:10,342][model8_pretrain.py][INFO] Epoch:[0/2](309700/4588595) loss:2.854 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:10,342][model8_pretrain.py][INFO] Epoch:[0/2](309700/4588595) loss:2.807 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:10,342][model8_pretrain.py][INFO] Epoch:[0/2](309700/4588595) loss:2.590 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:10,342][model8_pretrain.py][INFO] Epoch:[0/2](309700/4588595) loss:2.896 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:10,342][model8_pretrain.py][INFO] Epoch:[0/2](309700/4588595) loss:2.471 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:10,342][model8_pretrain.py][INFO] Epoch:[0/2](309700/4588595) loss:3.548 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:10,342][model8_pretrain.py][INFO] Epoch:[0/2](309700/4588595) loss:2.834 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:47,290][model8_pretrain.py][INFO] Epoch:[0/2](309800/4588595) loss:2.906 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:47,290][model8_pretrain.py][INFO] Epoch:[0/2](309800/4588595) loss:2.975 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:47,290][model8_pretrain.py][INFO] Epoch:[0/2](309800/4588595) loss:2.351 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:47,290][model8_pretrain.py][INFO] Epoch:[0/2](309800/4588595) loss:2.951 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:47,290][model8_pretrain.py][INFO] Epoch:[0/2](309800/4588595) loss:2.743 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:47,290][model8_pretrain.py][INFO] Epoch:[0/2](309800/4588595) loss:3.054 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:47,290][model8_pretrain.py][INFO] Epoch:[0/2](309800/4588595) loss:2.803 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:20:47,290][model8_pretrain.py][INFO] Epoch:[0/2](309800/4588595) loss:3.062 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:21:29,473][model8_pretrain.py][INFO] Epoch:[0/2](309900/4588595) loss:2.695 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:21:29,473][model8_pretrain.py][INFO] Epoch:[0/2](309900/4588595) loss:3.166 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:21:29,473][model8_pretrain.py][INFO] Epoch:[0/2](309900/4588595) loss:3.283 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:21:29,473][model8_pretrain.py][INFO] Epoch:[0/2](309900/4588595) loss:2.903 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:21:29,473][model8_pretrain.py][INFO] Epoch:[0/2](309900/4588595) loss:2.577 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:21:29,473][model8_pretrain.py][INFO] Epoch:[0/2](309900/4588595) loss:2.865 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:21:29,473][model8_pretrain.py][INFO] Epoch:[0/2](309900/4588595) loss:3.016 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:21:29,473][model8_pretrain.py][INFO] Epoch:[0/2](309900/4588595) loss:3.003 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:22:11,577][model8_pretrain.py][INFO] Epoch:[0/2](310000/4588595) loss:2.721 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:22:11,577][model8_pretrain.py][INFO] Epoch:[0/2](310000/4588595) loss:3.131 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:22:11,577][model8_pretrain.py][INFO] Epoch:[0/2](310000/4588595) loss:3.186 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:22:11,577][model8_pretrain.py][INFO] Epoch:[0/2](310000/4588595) loss:3.058 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:22:11,577][model8_pretrain.py][INFO] Epoch:[0/2](310000/4588595) loss:2.914 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:22:11,577][model8_pretrain.py][INFO] Epoch:[0/2](310000/4588595) loss:2.888 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:22:11,577][model8_pretrain.py][INFO] Epoch:[0/2](310000/4588595) loss:3.267 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:22:11,577][model8_pretrain.py][INFO] Epoch:[0/2](310000/4588595) loss:2.826 lr:0.0000100 epoch_Time:27085.0min: [2024-01-04 02:22:48,515][model8_pretrain.py][INFO] Epoch:[0/2](310100/4588595) loss:3.081 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:22:48,515][model8_pretrain.py][INFO] Epoch:[0/2](310100/4588595) loss:3.069 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:22:48,515][model8_pretrain.py][INFO] Epoch:[0/2](310100/4588595) loss:2.828 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:22:48,515][model8_pretrain.py][INFO] Epoch:[0/2](310100/4588595) loss:3.108 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:22:48,515][model8_pretrain.py][INFO] Epoch:[0/2](310100/4588595) loss:3.168 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:22:48,515][model8_pretrain.py][INFO] Epoch:[0/2](310100/4588595) loss:2.967 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:22:48,515][model8_pretrain.py][INFO] Epoch:[0/2](310100/4588595) loss:2.936 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:22:48,515][model8_pretrain.py][INFO] Epoch:[0/2](310100/4588595) loss:2.943 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:23:25,445][model8_pretrain.py][INFO] Epoch:[0/2](310200/4588595) loss:2.244 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:23:25,445][model8_pretrain.py][INFO] Epoch:[0/2](310200/4588595) loss:2.381 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:23:25,445][model8_pretrain.py][INFO] Epoch:[0/2](310200/4588595) loss:3.327 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:23:25,445][model8_pretrain.py][INFO] Epoch:[0/2](310200/4588595) loss:2.864 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:23:25,445][model8_pretrain.py][INFO] Epoch:[0/2](310200/4588595) loss:2.978 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:23:25,445][model8_pretrain.py][INFO] Epoch:[0/2](310200/4588595) loss:2.732 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:23:25,445][model8_pretrain.py][INFO] Epoch:[0/2](310200/4588595) loss:2.958 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:23:25,445][model8_pretrain.py][INFO] Epoch:[0/2](310200/4588595) loss:3.106 lr:0.0000100 epoch_Time:27083.0min: [2024-01-04 02:24:02,385][model8_pretrain.py][INFO] Epoch:[0/2](310300/4588595) loss:2.959 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:02,385][model8_pretrain.py][INFO] Epoch:[0/2](310300/4588595) loss:2.872 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:02,385][model8_pretrain.py][INFO] Epoch:[0/2](310300/4588595) loss:3.077 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:02,385][model8_pretrain.py][INFO] Epoch:[0/2](310300/4588595) loss:2.401 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:02,385][model8_pretrain.py][INFO] Epoch:[0/2](310300/4588595) loss:2.859 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:02,386][model8_pretrain.py][INFO] Epoch:[0/2](310300/4588595) loss:3.241 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:02,386][model8_pretrain.py][INFO] Epoch:[0/2](310300/4588595) loss:2.989 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:02,386][model8_pretrain.py][INFO] Epoch:[0/2](310300/4588595) loss:3.345 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:39,327][model8_pretrain.py][INFO] Epoch:[0/2](310400/4588595) loss:2.907 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:39,327][model8_pretrain.py][INFO] Epoch:[0/2](310400/4588595) loss:3.380 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:39,327][model8_pretrain.py][INFO] Epoch:[0/2](310400/4588595) loss:3.323 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:39,327][model8_pretrain.py][INFO] Epoch:[0/2](310400/4588595) loss:2.664 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:39,327][model8_pretrain.py][INFO] Epoch:[0/2](310400/4588595) loss:2.475 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:39,327][model8_pretrain.py][INFO] Epoch:[0/2](310400/4588595) loss:2.518 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:39,327][model8_pretrain.py][INFO] Epoch:[0/2](310400/4588595) loss:2.365 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:24:39,327][model8_pretrain.py][INFO] Epoch:[0/2](310400/4588595) loss:2.682 lr:0.0000100 epoch_Time:27082.0min: [2024-01-04 02:25:16,269][model8_pretrain.py][INFO] Epoch:[0/2](310500/4588595) loss:3.080 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:25:16,269][model8_pretrain.py][INFO] Epoch:[0/2](310500/4588595) loss:2.770 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:25:16,269][model8_pretrain.py][INFO] Epoch:[0/2](310500/4588595) loss:2.796 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:25:16,269][model8_pretrain.py][INFO] Epoch:[0/2](310500/4588595) loss:2.468 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:25:16,269][model8_pretrain.py][INFO] Epoch:[0/2](310500/4588595) loss:1.940 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:25:16,269][model8_pretrain.py][INFO] Epoch:[0/2](310500/4588595) loss:3.520 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:25:16,269][model8_pretrain.py][INFO] Epoch:[0/2](310500/4588595) loss:2.594 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:25:16,269][model8_pretrain.py][INFO] Epoch:[0/2](310500/4588595) loss:3.168 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:25:53,210][model8_pretrain.py][INFO] Epoch:[0/2](310600/4588595) loss:2.816 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:25:53,210][model8_pretrain.py][INFO] Epoch:[0/2](310600/4588595) loss:2.957 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:25:53,210][model8_pretrain.py][INFO] Epoch:[0/2](310600/4588595) loss:2.922 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:25:53,210][model8_pretrain.py][INFO] Epoch:[0/2](310600/4588595) loss:2.843 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:25:53,210][model8_pretrain.py][INFO] Epoch:[0/2](310600/4588595) loss:2.587 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:25:53,210][model8_pretrain.py][INFO] Epoch:[0/2](310600/4588595) loss:3.539 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:25:53,210][model8_pretrain.py][INFO] Epoch:[0/2](310600/4588595) loss:2.608 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:25:53,210][model8_pretrain.py][INFO] Epoch:[0/2](310600/4588595) loss:2.925 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:26:35,370][model8_pretrain.py][INFO] Epoch:[0/2](310700/4588595) loss:3.453 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:26:35,370][model8_pretrain.py][INFO] Epoch:[0/2](310700/4588595) loss:2.689 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:26:35,371][model8_pretrain.py][INFO] Epoch:[0/2](310700/4588595) loss:2.825 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:26:35,371][model8_pretrain.py][INFO] Epoch:[0/2](310700/4588595) loss:3.259 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:26:35,375][model8_pretrain.py][INFO] Epoch:[0/2](310700/4588595) loss:3.045 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:26:35,375][model8_pretrain.py][INFO] Epoch:[0/2](310700/4588595) loss:2.805 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:26:35,375][model8_pretrain.py][INFO] Epoch:[0/2](310700/4588595) loss:2.541 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:26:35,375][model8_pretrain.py][INFO] Epoch:[0/2](310700/4588595) loss:3.346 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:27:17,588][model8_pretrain.py][INFO] Epoch:[0/2](310800/4588595) loss:3.196 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:27:17,588][model8_pretrain.py][INFO] Epoch:[0/2](310800/4588595) loss:2.621 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:27:17,588][model8_pretrain.py][INFO] Epoch:[0/2](310800/4588595) loss:2.763 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:27:17,588][model8_pretrain.py][INFO] Epoch:[0/2](310800/4588595) loss:3.178 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:27:17,588][model8_pretrain.py][INFO] Epoch:[0/2](310800/4588595) loss:3.106 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:27:17,588][model8_pretrain.py][INFO] Epoch:[0/2](310800/4588595) loss:3.156 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:27:17,588][model8_pretrain.py][INFO] Epoch:[0/2](310800/4588595) loss:2.481 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:27:17,588][model8_pretrain.py][INFO] Epoch:[0/2](310800/4588595) loss:2.986 lr:0.0000100 epoch_Time:27080.0min: [2024-01-04 02:27:54,513][model8_pretrain.py][INFO] Epoch:[0/2](310900/4588595) loss:3.149 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:27:54,513][model8_pretrain.py][INFO] Epoch:[0/2](310900/4588595) loss:3.203 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:27:54,513][model8_pretrain.py][INFO] Epoch:[0/2](310900/4588595) loss:3.317 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:27:54,513][model8_pretrain.py][INFO] Epoch:[0/2](310900/4588595) loss:2.513 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:27:54,513][model8_pretrain.py][INFO] Epoch:[0/2](310900/4588595) loss:2.678 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:27:54,513][model8_pretrain.py][INFO] Epoch:[0/2](310900/4588595) loss:2.502 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:27:54,513][model8_pretrain.py][INFO] Epoch:[0/2](310900/4588595) loss:2.893 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:27:54,513][model8_pretrain.py][INFO] Epoch:[0/2](310900/4588595) loss:2.795 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:28:31,449][model8_pretrain.py][INFO] Epoch:[0/2](311000/4588595) loss:3.524 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:28:31,449][model8_pretrain.py][INFO] Epoch:[0/2](311000/4588595) loss:3.154 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:28:31,449][model8_pretrain.py][INFO] Epoch:[0/2](311000/4588595) loss:2.689 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:28:31,449][model8_pretrain.py][INFO] Epoch:[0/2](311000/4588595) loss:2.890 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:28:31,449][model8_pretrain.py][INFO] Epoch:[0/2](311000/4588595) loss:3.339 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:28:31,449][model8_pretrain.py][INFO] Epoch:[0/2](311000/4588595) loss:2.591 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:28:31,450][model8_pretrain.py][INFO] Epoch:[0/2](311000/4588595) loss:3.335 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:28:31,450][model8_pretrain.py][INFO] Epoch:[0/2](311000/4588595) loss:2.834 lr:0.0000100 epoch_Time:27079.0min: [2024-01-04 02:29:08,398][model8_pretrain.py][INFO] Epoch:[0/2](311100/4588595) loss:3.022 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:08,398][model8_pretrain.py][INFO] Epoch:[0/2](311100/4588595) loss:3.197 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:08,398][model8_pretrain.py][INFO] Epoch:[0/2](311100/4588595) loss:2.514 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:08,398][model8_pretrain.py][INFO] Epoch:[0/2](311100/4588595) loss:2.775 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:08,398][model8_pretrain.py][INFO] Epoch:[0/2](311100/4588595) loss:3.073 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:08,398][model8_pretrain.py][INFO] Epoch:[0/2](311100/4588595) loss:2.638 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:08,398][model8_pretrain.py][INFO] Epoch:[0/2](311100/4588595) loss:1.781 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:08,398][model8_pretrain.py][INFO] Epoch:[0/2](311100/4588595) loss:3.031 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:45,333][model8_pretrain.py][INFO] Epoch:[0/2](311200/4588595) loss:2.939 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:45,333][model8_pretrain.py][INFO] Epoch:[0/2](311200/4588595) loss:2.894 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:45,333][model8_pretrain.py][INFO] Epoch:[0/2](311200/4588595) loss:2.942 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:45,333][model8_pretrain.py][INFO] Epoch:[0/2](311200/4588595) loss:2.638 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:45,333][model8_pretrain.py][INFO] Epoch:[0/2](311200/4588595) loss:3.039 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:45,333][model8_pretrain.py][INFO] Epoch:[0/2](311200/4588595) loss:2.225 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:45,333][model8_pretrain.py][INFO] Epoch:[0/2](311200/4588595) loss:2.792 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:29:45,333][model8_pretrain.py][INFO] Epoch:[0/2](311200/4588595) loss:3.024 lr:0.0000100 epoch_Time:27077.0min: [2024-01-04 02:30:22,273][model8_pretrain.py][INFO] Epoch:[0/2](311300/4588595) loss:2.731 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:30:22,273][model8_pretrain.py][INFO] Epoch:[0/2](311300/4588595) loss:2.922 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:30:22,273][model8_pretrain.py][INFO] Epoch:[0/2](311300/4588595) loss:3.045 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:30:22,273][model8_pretrain.py][INFO] Epoch:[0/2](311300/4588595) loss:3.358 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:30:22,273][model8_pretrain.py][INFO] Epoch:[0/2](311300/4588595) loss:3.411 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:30:22,273][model8_pretrain.py][INFO] Epoch:[0/2](311300/4588595) loss:2.863 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:30:22,273][model8_pretrain.py][INFO] Epoch:[0/2](311300/4588595) loss:2.850 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:30:22,273][model8_pretrain.py][INFO] Epoch:[0/2](311300/4588595) loss:2.861 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:30:59,211][model8_pretrain.py][INFO] Epoch:[0/2](311400/4588595) loss:2.990 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:30:59,212][model8_pretrain.py][INFO] Epoch:[0/2](311400/4588595) loss:3.025 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:30:59,212][model8_pretrain.py][INFO] Epoch:[0/2](311400/4588595) loss:3.300 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:30:59,212][model8_pretrain.py][INFO] Epoch:[0/2](311400/4588595) loss:2.806 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:30:59,212][model8_pretrain.py][INFO] Epoch:[0/2](311400/4588595) loss:3.040 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:30:59,212][model8_pretrain.py][INFO] Epoch:[0/2](311400/4588595) loss:2.919 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:30:59,212][model8_pretrain.py][INFO] Epoch:[0/2](311400/4588595) loss:3.010 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:30:59,212][model8_pretrain.py][INFO] Epoch:[0/2](311400/4588595) loss:3.233 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:31:37,889][model8_pretrain.py][INFO] Epoch:[0/2](311500/4588595) loss:3.307 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:31:37,889][model8_pretrain.py][INFO] Epoch:[0/2](311500/4588595) loss:3.168 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:31:37,889][model8_pretrain.py][INFO] Epoch:[0/2](311500/4588595) loss:3.209 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:31:37,889][model8_pretrain.py][INFO] Epoch:[0/2](311500/4588595) loss:2.747 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:31:37,889][model8_pretrain.py][INFO] Epoch:[0/2](311500/4588595) loss:3.186 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:31:37,889][model8_pretrain.py][INFO] Epoch:[0/2](311500/4588595) loss:3.016 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:31:37,889][model8_pretrain.py][INFO] Epoch:[0/2](311500/4588595) loss:3.327 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:31:37,889][model8_pretrain.py][INFO] Epoch:[0/2](311500/4588595) loss:2.729 lr:0.0000100 epoch_Time:27075.0min: [2024-01-04 02:32:23,489][model8_pretrain.py][INFO] Epoch:[0/2](311600/4588595) loss:2.757 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:32:23,489][model8_pretrain.py][INFO] Epoch:[0/2](311600/4588595) loss:2.976 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:32:23,489][model8_pretrain.py][INFO] Epoch:[0/2](311600/4588595) loss:3.055 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:32:23,489][model8_pretrain.py][INFO] Epoch:[0/2](311600/4588595) loss:3.177 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:32:23,489][model8_pretrain.py][INFO] Epoch:[0/2](311600/4588595) loss:2.961 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:32:23,489][model8_pretrain.py][INFO] Epoch:[0/2](311600/4588595) loss:2.313 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:32:23,489][model8_pretrain.py][INFO] Epoch:[0/2](311600/4588595) loss:2.514 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:32:23,489][model8_pretrain.py][INFO] Epoch:[0/2](311600/4588595) loss:3.010 lr:0.0000100 epoch_Time:27076.0min: [2024-01-04 02:33:00,421][model8_pretrain.py][INFO] Epoch:[0/2](311700/4588595) loss:2.620 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:00,421][model8_pretrain.py][INFO] Epoch:[0/2](311700/4588595) loss:2.901 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:00,421][model8_pretrain.py][INFO] Epoch:[0/2](311700/4588595) loss:3.334 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:00,421][model8_pretrain.py][INFO] Epoch:[0/2](311700/4588595) loss:2.885 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:00,421][model8_pretrain.py][INFO] Epoch:[0/2](311700/4588595) loss:3.188 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:00,421][model8_pretrain.py][INFO] Epoch:[0/2](311700/4588595) loss:2.950 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:00,422][model8_pretrain.py][INFO] Epoch:[0/2](311700/4588595) loss:3.175 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:00,423][model8_pretrain.py][INFO] Epoch:[0/2](311700/4588595) loss:2.922 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:37,359][model8_pretrain.py][INFO] Epoch:[0/2](311800/4588595) loss:3.316 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:37,359][model8_pretrain.py][INFO] Epoch:[0/2](311800/4588595) loss:2.657 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:37,359][model8_pretrain.py][INFO] Epoch:[0/2](311800/4588595) loss:2.921 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:37,359][model8_pretrain.py][INFO] Epoch:[0/2](311800/4588595) loss:2.718 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:37,359][model8_pretrain.py][INFO] Epoch:[0/2](311800/4588595) loss:2.626 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:37,359][model8_pretrain.py][INFO] Epoch:[0/2](311800/4588595) loss:2.941 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:37,359][model8_pretrain.py][INFO] Epoch:[0/2](311800/4588595) loss:3.326 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:33:37,359][model8_pretrain.py][INFO] Epoch:[0/2](311800/4588595) loss:3.226 lr:0.0000100 epoch_Time:27074.0min: [2024-01-04 02:34:14,273][model8_pretrain.py][INFO] Epoch:[0/2](311900/4588595) loss:3.099 lr:0.0000100 epoch_Time:27073.0min: [2024-01-04 02:34:14,273][model8_pretrain.py][INFO] Epoch:[0/2](311900/4588595) loss:2.994 lr:0.0000100 epoch_Time:27073.0min: [2024-01-04 02:34:14,273][model8_pretrain.py][INFO] Epoch:[0/2](311900/4588595) loss:2.767 lr:0.0000100 epoch_Time:27073.0min: [2024-01-04 02:34:14,273][model8_pretrain.py][INFO] Epoch:[0/2](311900/4588595) loss:2.728 lr:0.0000100 epoch_Time:27073.0min: [2024-01-04 02:34:14,273][model8_pretrain.py][INFO] Epoch:[0/2](311900/4588595) loss:2.109 lr:0.0000100 epoch_Time:27073.0min: [2024-01-04 02:34:14,273][model8_pretrain.py][INFO] Epoch:[0/2](311900/4588595) loss:2.564 lr:0.0000100 epoch_Time:27073.0min: [2024-01-04 02:34:14,273][model8_pretrain.py][INFO] Epoch:[0/2](311900/4588595) loss:2.589 lr:0.0000100 epoch_Time:27073.0min: [2024-01-04 02:34:14,274][model8_pretrain.py][INFO] Epoch:[0/2](311900/4588595) loss:2.944 lr:0.0000100 epoch_Time:27073.0min: [2024-01-04 02:34:51,207][model8_pretrain.py][INFO] Epoch:[0/2](312000/4588595) loss:3.031 lr:0.0000100 epoch_Time:27072.0min: [2024-01-04 02:34:51,207][model8_pretrain.py][INFO] Epoch:[0/2](312000/4588595) loss:3.423 lr:0.0000100 epoch_Time:27072.0min: [2024-01-04 02:34:51,207][model8_pretrain.py][INFO] Epoch:[0/2](312000/4588595) loss:2.406 lr:0.0000100 epoch_Time:27072.0min: [2024-01-04 02:34:51,207][model8_pretrain.py][INFO] Epoch:[0/2](312000/4588595) loss:3.528 lr:0.0000100 epoch_Time:27072.0min: [2024-01-04 02:34:51,207][model8_pretrain.py][INFO] Epoch:[0/2](312000/4588595) loss:3.305 lr:0.0000100 epoch_Time:27072.0min: [2024-01-04 02:34:51,207][model8_pretrain.py][INFO] Epoch:[0/2](312000/4588595) loss:2.984 lr:0.0000100 epoch_Time:27072.0min: [2024-01-04 02:34:51,207][model8_pretrain.py][INFO] Epoch:[0/2](312000/4588595) loss:2.771 lr:0.0000100 epoch_Time:27072.0min: [2024-01-04 02:34:51,207][model8_pretrain.py][INFO] Epoch:[0/2](312000/4588595) loss:2.973 lr:0.0000100 epoch_Time:27072.0min: [2024-01-04 02:35:28,150][model8_pretrain.py][INFO] Epoch:[0/2](312100/4588595) loss:2.907 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:35:28,150][model8_pretrain.py][INFO] Epoch:[0/2](312100/4588595) loss:3.010 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:35:28,150][model8_pretrain.py][INFO] Epoch:[0/2](312100/4588595) loss:3.095 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:35:28,150][model8_pretrain.py][INFO] Epoch:[0/2](312100/4588595) loss:2.603 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:35:28,150][model8_pretrain.py][INFO] Epoch:[0/2](312100/4588595) loss:3.182 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:35:28,150][model8_pretrain.py][INFO] Epoch:[0/2](312100/4588595) loss:3.249 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:35:28,150][model8_pretrain.py][INFO] Epoch:[0/2](312100/4588595) loss:3.172 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:35:28,150][model8_pretrain.py][INFO] Epoch:[0/2](312100/4588595) loss:2.087 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:36:05,084][model8_pretrain.py][INFO] Epoch:[0/2](312200/4588595) loss:2.638 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:05,084][model8_pretrain.py][INFO] Epoch:[0/2](312200/4588595) loss:3.079 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:05,084][model8_pretrain.py][INFO] Epoch:[0/2](312200/4588595) loss:2.540 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:05,084][model8_pretrain.py][INFO] Epoch:[0/2](312200/4588595) loss:2.711 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:05,084][model8_pretrain.py][INFO] Epoch:[0/2](312200/4588595) loss:2.705 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:05,084][model8_pretrain.py][INFO] Epoch:[0/2](312200/4588595) loss:2.740 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:05,085][model8_pretrain.py][INFO] Epoch:[0/2](312200/4588595) loss:2.920 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:05,085][model8_pretrain.py][INFO] Epoch:[0/2](312200/4588595) loss:3.492 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:43,792][model8_pretrain.py][INFO] Epoch:[0/2](312300/4588595) loss:2.688 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:43,793][model8_pretrain.py][INFO] Epoch:[0/2](312300/4588595) loss:2.871 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:43,793][model8_pretrain.py][INFO] Epoch:[0/2](312300/4588595) loss:2.535 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:43,793][model8_pretrain.py][INFO] Epoch:[0/2](312300/4588595) loss:2.568 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:43,793][model8_pretrain.py][INFO] Epoch:[0/2](312300/4588595) loss:2.996 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:43,793][model8_pretrain.py][INFO] Epoch:[0/2](312300/4588595) loss:2.846 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:43,793][model8_pretrain.py][INFO] Epoch:[0/2](312300/4588595) loss:2.423 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:36:43,793][model8_pretrain.py][INFO] Epoch:[0/2](312300/4588595) loss:2.823 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:37:29,504][model8_pretrain.py][INFO] Epoch:[0/2](312400/4588595) loss:3.360 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:37:29,504][model8_pretrain.py][INFO] Epoch:[0/2](312400/4588595) loss:2.807 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:37:29,504][model8_pretrain.py][INFO] Epoch:[0/2](312400/4588595) loss:3.006 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:37:29,504][model8_pretrain.py][INFO] Epoch:[0/2](312400/4588595) loss:2.894 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:37:29,504][model8_pretrain.py][INFO] Epoch:[0/2](312400/4588595) loss:2.769 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:37:29,504][model8_pretrain.py][INFO] Epoch:[0/2](312400/4588595) loss:2.630 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:37:29,505][model8_pretrain.py][INFO] Epoch:[0/2](312400/4588595) loss:3.510 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:37:29,505][model8_pretrain.py][INFO] Epoch:[0/2](312400/4588595) loss:3.384 lr:0.0000100 epoch_Time:27071.0min: [2024-01-04 02:38:06,439][model8_pretrain.py][INFO] Epoch:[0/2](312500/4588595) loss:3.072 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:06,439][model8_pretrain.py][INFO] Epoch:[0/2](312500/4588595) loss:3.172 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:06,439][model8_pretrain.py][INFO] Epoch:[0/2](312500/4588595) loss:2.793 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:06,439][model8_pretrain.py][INFO] Epoch:[0/2](312500/4588595) loss:2.665 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:06,439][model8_pretrain.py][INFO] Epoch:[0/2](312500/4588595) loss:3.035 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:06,439][model8_pretrain.py][INFO] Epoch:[0/2](312500/4588595) loss:2.698 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:06,440][model8_pretrain.py][INFO] Epoch:[0/2](312500/4588595) loss:3.158 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:06,440][model8_pretrain.py][INFO] Epoch:[0/2](312500/4588595) loss:3.077 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:43,370][model8_pretrain.py][INFO] Epoch:[0/2](312600/4588595) loss:2.917 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:43,370][model8_pretrain.py][INFO] Epoch:[0/2](312600/4588595) loss:2.757 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:43,370][model8_pretrain.py][INFO] Epoch:[0/2](312600/4588595) loss:3.629 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:43,370][model8_pretrain.py][INFO] Epoch:[0/2](312600/4588595) loss:3.241 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:43,370][model8_pretrain.py][INFO] Epoch:[0/2](312600/4588595) loss:2.584 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:43,370][model8_pretrain.py][INFO] Epoch:[0/2](312600/4588595) loss:2.818 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:43,371][model8_pretrain.py][INFO] Epoch:[0/2](312600/4588595) loss:3.400 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:38:43,371][model8_pretrain.py][INFO] Epoch:[0/2](312600/4588595) loss:2.770 lr:0.0000100 epoch_Time:27070.0min: [2024-01-04 02:39:20,308][model8_pretrain.py][INFO] Epoch:[0/2](312700/4588595) loss:2.819 lr:0.0000100 epoch_Time:27068.0min: [2024-01-04 02:39:20,308][model8_pretrain.py][INFO] Epoch:[0/2](312700/4588595) loss:3.127 lr:0.0000100 epoch_Time:27068.0min: [2024-01-04 02:39:20,308][model8_pretrain.py][INFO] Epoch:[0/2](312700/4588595) loss:3.180 lr:0.0000100 epoch_Time:27068.0min: [2024-01-04 02:39:20,308][model8_pretrain.py][INFO] Epoch:[0/2](312700/4588595) loss:2.773 lr:0.0000100 epoch_Time:27068.0min: [2024-01-04 02:39:20,308][model8_pretrain.py][INFO] Epoch:[0/2](312700/4588595) loss:2.608 lr:0.0000100 epoch_Time:27068.0min: [2024-01-04 02:39:20,308][model8_pretrain.py][INFO] Epoch:[0/2](312700/4588595) loss:2.766 lr:0.0000100 epoch_Time:27068.0min: [2024-01-04 02:39:20,308][model8_pretrain.py][INFO] Epoch:[0/2](312700/4588595) loss:3.089 lr:0.0000100 epoch_Time:27068.0min: [2024-01-04 02:39:20,308][model8_pretrain.py][INFO] Epoch:[0/2](312700/4588595) loss:2.627 lr:0.0000100 epoch_Time:27068.0min: [2024-01-04 02:39:57,226][model8_pretrain.py][INFO] Epoch:[0/2](312800/4588595) loss:3.305 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:39:57,226][model8_pretrain.py][INFO] Epoch:[0/2](312800/4588595) loss:3.230 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:39:57,226][model8_pretrain.py][INFO] Epoch:[0/2](312800/4588595) loss:2.814 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:39:57,226][model8_pretrain.py][INFO] Epoch:[0/2](312800/4588595) loss:3.181 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:39:57,226][model8_pretrain.py][INFO] Epoch:[0/2](312800/4588595) loss:2.948 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:39:57,226][model8_pretrain.py][INFO] Epoch:[0/2](312800/4588595) loss:2.918 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:39:57,227][model8_pretrain.py][INFO] Epoch:[0/2](312800/4588595) loss:3.113 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:39:57,227][model8_pretrain.py][INFO] Epoch:[0/2](312800/4588595) loss:3.354 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:40:34,149][model8_pretrain.py][INFO] Epoch:[0/2](312900/4588595) loss:3.466 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:40:34,149][model8_pretrain.py][INFO] Epoch:[0/2](312900/4588595) loss:2.454 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:40:34,149][model8_pretrain.py][INFO] Epoch:[0/2](312900/4588595) loss:2.364 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:40:34,149][model8_pretrain.py][INFO] Epoch:[0/2](312900/4588595) loss:3.041 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:40:34,149][model8_pretrain.py][INFO] Epoch:[0/2](312900/4588595) loss:3.113 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:40:34,149][model8_pretrain.py][INFO] Epoch:[0/2](312900/4588595) loss:3.126 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:40:34,149][model8_pretrain.py][INFO] Epoch:[0/2](312900/4588595) loss:3.098 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:40:34,150][model8_pretrain.py][INFO] Epoch:[0/2](312900/4588595) loss:2.648 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:41:11,085][model8_pretrain.py][INFO] Epoch:[0/2](313000/4588595) loss:2.641 lr:0.0000100 epoch_Time:27066.0min: [2024-01-04 02:41:11,085][model8_pretrain.py][INFO] Epoch:[0/2](313000/4588595) loss:2.834 lr:0.0000100 epoch_Time:27066.0min: [2024-01-04 02:41:11,085][model8_pretrain.py][INFO] Epoch:[0/2](313000/4588595) loss:2.767 lr:0.0000100 epoch_Time:27066.0min: [2024-01-04 02:41:11,085][model8_pretrain.py][INFO] Epoch:[0/2](313000/4588595) loss:2.943 lr:0.0000100 epoch_Time:27066.0min: [2024-01-04 02:41:11,085][model8_pretrain.py][INFO] Epoch:[0/2](313000/4588595) loss:2.992 lr:0.0000100 epoch_Time:27066.0min: [2024-01-04 02:41:11,085][model8_pretrain.py][INFO] Epoch:[0/2](313000/4588595) loss:2.998 lr:0.0000100 epoch_Time:27066.0min: [2024-01-04 02:41:11,085][model8_pretrain.py][INFO] Epoch:[0/2](313000/4588595) loss:2.972 lr:0.0000100 epoch_Time:27066.0min: [2024-01-04 02:41:11,085][model8_pretrain.py][INFO] Epoch:[0/2](313000/4588595) loss:2.740 lr:0.0000100 epoch_Time:27066.0min: [2024-01-04 02:41:49,786][model8_pretrain.py][INFO] Epoch:[0/2](313100/4588595) loss:3.257 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:41:49,786][model8_pretrain.py][INFO] Epoch:[0/2](313100/4588595) loss:3.134 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:41:49,786][model8_pretrain.py][INFO] Epoch:[0/2](313100/4588595) loss:2.501 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:41:49,786][model8_pretrain.py][INFO] Epoch:[0/2](313100/4588595) loss:2.521 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:41:49,786][model8_pretrain.py][INFO] Epoch:[0/2](313100/4588595) loss:2.565 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:41:49,791][model8_pretrain.py][INFO] Epoch:[0/2](313100/4588595) loss:3.347 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:41:49,791][model8_pretrain.py][INFO] Epoch:[0/2](313100/4588595) loss:3.107 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:41:49,791][model8_pretrain.py][INFO] Epoch:[0/2](313100/4588595) loss:2.566 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:42:35,407][model8_pretrain.py][INFO] Epoch:[0/2](313200/4588595) loss:3.341 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:42:35,407][model8_pretrain.py][INFO] Epoch:[0/2](313200/4588595) loss:3.228 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:42:35,407][model8_pretrain.py][INFO] Epoch:[0/2](313200/4588595) loss:2.538 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:42:35,407][model8_pretrain.py][INFO] Epoch:[0/2](313200/4588595) loss:3.155 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:42:35,407][model8_pretrain.py][INFO] Epoch:[0/2](313200/4588595) loss:2.426 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:42:35,407][model8_pretrain.py][INFO] Epoch:[0/2](313200/4588595) loss:2.744 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:42:35,407][model8_pretrain.py][INFO] Epoch:[0/2](313200/4588595) loss:2.663 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:42:35,407][model8_pretrain.py][INFO] Epoch:[0/2](313200/4588595) loss:2.931 lr:0.0000100 epoch_Time:27067.0min: [2024-01-04 02:43:12,339][model8_pretrain.py][INFO] Epoch:[0/2](313300/4588595) loss:2.961 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:43:12,339][model8_pretrain.py][INFO] Epoch:[0/2](313300/4588595) loss:3.053 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:43:12,339][model8_pretrain.py][INFO] Epoch:[0/2](313300/4588595) loss:2.732 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:43:12,339][model8_pretrain.py][INFO] Epoch:[0/2](313300/4588595) loss:3.219 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:43:12,339][model8_pretrain.py][INFO] Epoch:[0/2](313300/4588595) loss:2.992 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:43:12,339][model8_pretrain.py][INFO] Epoch:[0/2](313300/4588595) loss:2.972 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:43:12,339][model8_pretrain.py][INFO] Epoch:[0/2](313300/4588595) loss:2.838 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:43:12,339][model8_pretrain.py][INFO] Epoch:[0/2](313300/4588595) loss:3.382 lr:0.0000100 epoch_Time:27065.0min: [2024-01-04 02:43:49,288][model8_pretrain.py][INFO] Epoch:[0/2](313400/4588595) loss:3.349 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:43:49,288][model8_pretrain.py][INFO] Epoch:[0/2](313400/4588595) loss:2.913 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:43:49,288][model8_pretrain.py][INFO] Epoch:[0/2](313400/4588595) loss:2.580 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:43:49,288][model8_pretrain.py][INFO] Epoch:[0/2](313400/4588595) loss:2.631 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:43:49,288][model8_pretrain.py][INFO] Epoch:[0/2](313400/4588595) loss:3.031 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:43:49,288][model8_pretrain.py][INFO] Epoch:[0/2](313400/4588595) loss:2.172 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:43:49,288][model8_pretrain.py][INFO] Epoch:[0/2](313400/4588595) loss:2.415 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:43:49,288][model8_pretrain.py][INFO] Epoch:[0/2](313400/4588595) loss:3.071 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:44:26,228][model8_pretrain.py][INFO] Epoch:[0/2](313500/4588595) loss:2.560 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:44:26,228][model8_pretrain.py][INFO] Epoch:[0/2](313500/4588595) loss:2.855 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:44:26,228][model8_pretrain.py][INFO] Epoch:[0/2](313500/4588595) loss:2.633 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:44:26,228][model8_pretrain.py][INFO] Epoch:[0/2](313500/4588595) loss:3.042 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:44:26,228][model8_pretrain.py][INFO] Epoch:[0/2](313500/4588595) loss:2.951 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:44:26,229][model8_pretrain.py][INFO] Epoch:[0/2](313500/4588595) loss:2.420 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:44:26,229][model8_pretrain.py][INFO] Epoch:[0/2](313500/4588595) loss:2.332 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:44:26,229][model8_pretrain.py][INFO] Epoch:[0/2](313500/4588595) loss:2.594 lr:0.0000100 epoch_Time:27064.0min: [2024-01-04 02:45:03,183][model8_pretrain.py][INFO] Epoch:[0/2](313600/4588595) loss:3.318 lr:0.0000100 epoch_Time:27063.0min: [2024-01-04 02:45:03,183][model8_pretrain.py][INFO] Epoch:[0/2](313600/4588595) loss:2.350 lr:0.0000100 epoch_Time:27063.0min: [2024-01-04 02:45:03,183][model8_pretrain.py][INFO] Epoch:[0/2](313600/4588595) loss:3.176 lr:0.0000100 epoch_Time:27063.0min: [2024-01-04 02:45:03,183][model8_pretrain.py][INFO] Epoch:[0/2](313600/4588595) loss:2.786 lr:0.0000100 epoch_Time:27063.0min: [2024-01-04 02:45:03,183][model8_pretrain.py][INFO] Epoch:[0/2](313600/4588595) loss:3.192 lr:0.0000100 epoch_Time:27063.0min: [2024-01-04 02:45:03,183][model8_pretrain.py][INFO] Epoch:[0/2](313600/4588595) loss:2.646 lr:0.0000100 epoch_Time:27063.0min: [2024-01-04 02:45:03,183][model8_pretrain.py][INFO] Epoch:[0/2](313600/4588595) loss:2.900 lr:0.0000100 epoch_Time:27063.0min: [2024-01-04 02:45:03,183][model8_pretrain.py][INFO] Epoch:[0/2](313600/4588595) loss:2.705 lr:0.0000100 epoch_Time:27063.0min: [2024-01-04 02:45:40,119][model8_pretrain.py][INFO] Epoch:[0/2](313700/4588595) loss:3.292 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:45:40,119][model8_pretrain.py][INFO] Epoch:[0/2](313700/4588595) loss:2.483 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:45:40,119][model8_pretrain.py][INFO] Epoch:[0/2](313700/4588595) loss:2.941 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:45:40,119][model8_pretrain.py][INFO] Epoch:[0/2](313700/4588595) loss:2.771 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:45:40,120][model8_pretrain.py][INFO] Epoch:[0/2](313700/4588595) loss:2.804 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:45:40,119][model8_pretrain.py][INFO] Epoch:[0/2](313700/4588595) loss:2.534 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:45:40,120][model8_pretrain.py][INFO] Epoch:[0/2](313700/4588595) loss:2.472 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:45:40,120][model8_pretrain.py][INFO] Epoch:[0/2](313700/4588595) loss:3.114 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:46:17,055][model8_pretrain.py][INFO] Epoch:[0/2](313800/4588595) loss:3.113 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:46:17,055][model8_pretrain.py][INFO] Epoch:[0/2](313800/4588595) loss:3.268 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:46:17,055][model8_pretrain.py][INFO] Epoch:[0/2](313800/4588595) loss:2.541 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:46:17,055][model8_pretrain.py][INFO] Epoch:[0/2](313800/4588595) loss:3.100 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:46:17,055][model8_pretrain.py][INFO] Epoch:[0/2](313800/4588595) loss:2.926 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:46:17,055][model8_pretrain.py][INFO] Epoch:[0/2](313800/4588595) loss:2.786 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:46:17,056][model8_pretrain.py][INFO] Epoch:[0/2](313800/4588595) loss:3.097 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:46:17,057][model8_pretrain.py][INFO] Epoch:[0/2](313800/4588595) loss:2.653 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:46:53,979][model8_pretrain.py][INFO] Epoch:[0/2](313900/4588595) loss:2.481 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:46:53,979][model8_pretrain.py][INFO] Epoch:[0/2](313900/4588595) loss:2.652 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:46:53,979][model8_pretrain.py][INFO] Epoch:[0/2](313900/4588595) loss:2.722 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:46:53,979][model8_pretrain.py][INFO] Epoch:[0/2](313900/4588595) loss:2.696 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:46:53,979][model8_pretrain.py][INFO] Epoch:[0/2](313900/4588595) loss:3.402 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:46:53,979][model8_pretrain.py][INFO] Epoch:[0/2](313900/4588595) loss:2.702 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:46:53,979][model8_pretrain.py][INFO] Epoch:[0/2](313900/4588595) loss:3.195 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:46:53,979][model8_pretrain.py][INFO] Epoch:[0/2](313900/4588595) loss:2.876 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:47:41,358][model8_pretrain.py][INFO] Epoch:[0/2](314000/4588595) loss:3.245 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:47:41,358][model8_pretrain.py][INFO] Epoch:[0/2](314000/4588595) loss:3.056 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:47:41,358][model8_pretrain.py][INFO] Epoch:[0/2](314000/4588595) loss:2.843 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:47:41,358][model8_pretrain.py][INFO] Epoch:[0/2](314000/4588595) loss:3.264 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:47:41,358][model8_pretrain.py][INFO] Epoch:[0/2](314000/4588595) loss:3.073 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:47:41,358][model8_pretrain.py][INFO] Epoch:[0/2](314000/4588595) loss:3.234 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:47:41,358][model8_pretrain.py][INFO] Epoch:[0/2](314000/4588595) loss:2.680 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:47:41,358][model8_pretrain.py][INFO] Epoch:[0/2](314000/4588595) loss:3.361 lr:0.0000100 epoch_Time:27062.0min: [2024-01-04 02:48:18,285][model8_pretrain.py][INFO] Epoch:[0/2](314100/4588595) loss:3.086 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:48:18,286][model8_pretrain.py][INFO] Epoch:[0/2](314100/4588595) loss:3.099 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:48:18,286][model8_pretrain.py][INFO] Epoch:[0/2](314100/4588595) loss:3.092 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:48:18,286][model8_pretrain.py][INFO] Epoch:[0/2](314100/4588595) loss:2.402 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:48:18,286][model8_pretrain.py][INFO] Epoch:[0/2](314100/4588595) loss:2.668 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:48:18,285][model8_pretrain.py][INFO] Epoch:[0/2](314100/4588595) loss:2.836 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:48:18,286][model8_pretrain.py][INFO] Epoch:[0/2](314100/4588595) loss:2.892 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:48:18,286][model8_pretrain.py][INFO] Epoch:[0/2](314100/4588595) loss:2.974 lr:0.0000100 epoch_Time:27061.0min: [2024-01-04 02:48:55,221][model8_pretrain.py][INFO] Epoch:[0/2](314200/4588595) loss:2.918 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:48:55,221][model8_pretrain.py][INFO] Epoch:[0/2](314200/4588595) loss:2.943 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:48:55,221][model8_pretrain.py][INFO] Epoch:[0/2](314200/4588595) loss:2.623 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:48:55,221][model8_pretrain.py][INFO] Epoch:[0/2](314200/4588595) loss:3.495 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:48:55,221][model8_pretrain.py][INFO] Epoch:[0/2](314200/4588595) loss:2.715 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:48:55,221][model8_pretrain.py][INFO] Epoch:[0/2](314200/4588595) loss:2.967 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:48:55,221][model8_pretrain.py][INFO] Epoch:[0/2](314200/4588595) loss:2.845 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:48:55,221][model8_pretrain.py][INFO] Epoch:[0/2](314200/4588595) loss:3.149 lr:0.0000100 epoch_Time:27060.0min: [2024-01-04 02:49:32,158][model8_pretrain.py][INFO] Epoch:[0/2](314300/4588595) loss:2.771 lr:0.0000100 epoch_Time:27059.0min: [2024-01-04 02:49:32,159][model8_pretrain.py][INFO] Epoch:[0/2](314300/4588595) loss:3.667 lr:0.0000100 epoch_Time:27059.0min: [2024-01-04 02:49:32,159][model8_pretrain.py][INFO] Epoch:[0/2](314300/4588595) loss:3.507 lr:0.0000100 epoch_Time:27059.0min: [2024-01-04 02:49:32,159][model8_pretrain.py][INFO] Epoch:[0/2](314300/4588595) loss:3.052 lr:0.0000100 epoch_Time:27059.0min: [2024-01-04 02:49:32,159][model8_pretrain.py][INFO] Epoch:[0/2](314300/4588595) loss:2.633 lr:0.0000100 epoch_Time:27059.0min: [2024-01-04 02:49:32,159][model8_pretrain.py][INFO] Epoch:[0/2](314300/4588595) loss:2.891 lr:0.0000100 epoch_Time:27059.0min: [2024-01-04 02:49:32,159][model8_pretrain.py][INFO] Epoch:[0/2](314300/4588595) loss:2.708 lr:0.0000100 epoch_Time:27059.0min: [2024-01-04 02:49:32,159][model8_pretrain.py][INFO] Epoch:[0/2](314300/4588595) loss:2.587 lr:0.0000100 epoch_Time:27059.0min: [2024-01-04 02:50:09,097][model8_pretrain.py][INFO] Epoch:[0/2](314400/4588595) loss:3.309 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:09,097][model8_pretrain.py][INFO] Epoch:[0/2](314400/4588595) loss:3.293 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:09,097][model8_pretrain.py][INFO] Epoch:[0/2](314400/4588595) loss:3.058 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:09,097][model8_pretrain.py][INFO] Epoch:[0/2](314400/4588595) loss:2.936 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:09,097][model8_pretrain.py][INFO] Epoch:[0/2](314400/4588595) loss:3.070 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:09,097][model8_pretrain.py][INFO] Epoch:[0/2](314400/4588595) loss:2.514 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:09,097][model8_pretrain.py][INFO] Epoch:[0/2](314400/4588595) loss:3.070 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:09,097][model8_pretrain.py][INFO] Epoch:[0/2](314400/4588595) loss:2.738 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:46,053][model8_pretrain.py][INFO] Epoch:[0/2](314500/4588595) loss:2.964 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:46,053][model8_pretrain.py][INFO] Epoch:[0/2](314500/4588595) loss:2.944 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:46,053][model8_pretrain.py][INFO] Epoch:[0/2](314500/4588595) loss:3.351 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:46,053][model8_pretrain.py][INFO] Epoch:[0/2](314500/4588595) loss:2.531 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:46,053][model8_pretrain.py][INFO] Epoch:[0/2](314500/4588595) loss:3.148 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:46,053][model8_pretrain.py][INFO] Epoch:[0/2](314500/4588595) loss:2.874 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:46,053][model8_pretrain.py][INFO] Epoch:[0/2](314500/4588595) loss:2.953 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:50:46,053][model8_pretrain.py][INFO] Epoch:[0/2](314500/4588595) loss:2.609 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:51:22,985][model8_pretrain.py][INFO] Epoch:[0/2](314600/4588595) loss:2.904 lr:0.0000100 epoch_Time:27057.0min: [2024-01-04 02:51:22,986][model8_pretrain.py][INFO] Epoch:[0/2](314600/4588595) loss:2.944 lr:0.0000100 epoch_Time:27057.0min: [2024-01-04 02:51:22,986][model8_pretrain.py][INFO] Epoch:[0/2](314600/4588595) loss:2.767 lr:0.0000100 epoch_Time:27057.0min: [2024-01-04 02:51:22,986][model8_pretrain.py][INFO] Epoch:[0/2](314600/4588595) loss:2.943 lr:0.0000100 epoch_Time:27057.0min: [2024-01-04 02:51:22,986][model8_pretrain.py][INFO] Epoch:[0/2](314600/4588595) loss:3.206 lr:0.0000100 epoch_Time:27057.0min: [2024-01-04 02:51:22,986][model8_pretrain.py][INFO] Epoch:[0/2](314600/4588595) loss:3.011 lr:0.0000100 epoch_Time:27057.0min: [2024-01-04 02:51:22,986][model8_pretrain.py][INFO] Epoch:[0/2](314600/4588595) loss:3.040 lr:0.0000100 epoch_Time:27057.0min: [2024-01-04 02:51:22,986][model8_pretrain.py][INFO] Epoch:[0/2](314600/4588595) loss:3.064 lr:0.0000100 epoch_Time:27057.0min: [2024-01-04 02:51:59,914][model8_pretrain.py][INFO] Epoch:[0/2](314700/4588595) loss:3.267 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:51:59,914][model8_pretrain.py][INFO] Epoch:[0/2](314700/4588595) loss:2.435 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:51:59,914][model8_pretrain.py][INFO] Epoch:[0/2](314700/4588595) loss:3.349 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:51:59,914][model8_pretrain.py][INFO] Epoch:[0/2](314700/4588595) loss:2.749 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:51:59,914][model8_pretrain.py][INFO] Epoch:[0/2](314700/4588595) loss:3.051 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:51:59,914][model8_pretrain.py][INFO] Epoch:[0/2](314700/4588595) loss:3.254 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:51:59,914][model8_pretrain.py][INFO] Epoch:[0/2](314700/4588595) loss:2.760 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:51:59,914][model8_pretrain.py][INFO] Epoch:[0/2](314700/4588595) loss:3.267 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:52:46,984][model8_pretrain.py][INFO] Epoch:[0/2](314800/4588595) loss:2.572 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:52:46,984][model8_pretrain.py][INFO] Epoch:[0/2](314800/4588595) loss:2.789 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:52:46,984][model8_pretrain.py][INFO] Epoch:[0/2](314800/4588595) loss:3.039 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:52:46,984][model8_pretrain.py][INFO] Epoch:[0/2](314800/4588595) loss:2.862 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:52:46,984][model8_pretrain.py][INFO] Epoch:[0/2](314800/4588595) loss:2.599 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:52:46,984][model8_pretrain.py][INFO] Epoch:[0/2](314800/4588595) loss:2.715 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:52:46,984][model8_pretrain.py][INFO] Epoch:[0/2](314800/4588595) loss:3.146 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:52:46,984][model8_pretrain.py][INFO] Epoch:[0/2](314800/4588595) loss:1.935 lr:0.0000100 epoch_Time:27058.0min: [2024-01-04 02:53:23,892][model8_pretrain.py][INFO] Epoch:[0/2](314900/4588595) loss:3.122 lr:0.0000100 epoch_Time:27056.0min: [2024-01-04 02:53:23,892][model8_pretrain.py][INFO] Epoch:[0/2](314900/4588595) loss:2.723 lr:0.0000100 epoch_Time:27056.0min: [2024-01-04 02:53:23,892][model8_pretrain.py][INFO] Epoch:[0/2](314900/4588595) loss:2.719 lr:0.0000100 epoch_Time:27056.0min: [2024-01-04 02:53:23,892][model8_pretrain.py][INFO] Epoch:[0/2](314900/4588595) loss:3.035 lr:0.0000100 epoch_Time:27056.0min: [2024-01-04 02:53:23,892][model8_pretrain.py][INFO] Epoch:[0/2](314900/4588595) loss:3.520 lr:0.0000100 epoch_Time:27056.0min: [2024-01-04 02:53:23,892][model8_pretrain.py][INFO] Epoch:[0/2](314900/4588595) loss:2.943 lr:0.0000100 epoch_Time:27056.0min: [2024-01-04 02:53:23,892][model8_pretrain.py][INFO] Epoch:[0/2](314900/4588595) loss:3.303 lr:0.0000100 epoch_Time:27056.0min: [2024-01-04 02:53:23,893][model8_pretrain.py][INFO] Epoch:[0/2](314900/4588595) loss:2.611 lr:0.0000100 epoch_Time:27056.0min: [2024-01-04 02:54:00,843][model8_pretrain.py][INFO] Epoch:[0/2](315000/4588595) loss:2.751 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:00,843][model8_pretrain.py][INFO] Epoch:[0/2](315000/4588595) loss:3.236 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:00,843][model8_pretrain.py][INFO] Epoch:[0/2](315000/4588595) loss:3.057 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:00,843][model8_pretrain.py][INFO] Epoch:[0/2](315000/4588595) loss:3.237 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:00,843][model8_pretrain.py][INFO] Epoch:[0/2](315000/4588595) loss:3.261 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:00,843][model8_pretrain.py][INFO] Epoch:[0/2](315000/4588595) loss:2.548 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:00,843][model8_pretrain.py][INFO] Epoch:[0/2](315000/4588595) loss:2.840 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:00,843][model8_pretrain.py][INFO] Epoch:[0/2](315000/4588595) loss:2.848 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:37,791][model8_pretrain.py][INFO] Epoch:[0/2](315100/4588595) loss:2.518 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:37,791][model8_pretrain.py][INFO] Epoch:[0/2](315100/4588595) loss:3.014 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:37,791][model8_pretrain.py][INFO] Epoch:[0/2](315100/4588595) loss:3.133 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:37,791][model8_pretrain.py][INFO] Epoch:[0/2](315100/4588595) loss:2.080 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:37,791][model8_pretrain.py][INFO] Epoch:[0/2](315100/4588595) loss:3.728 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:37,792][model8_pretrain.py][INFO] Epoch:[0/2](315100/4588595) loss:2.959 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:37,792][model8_pretrain.py][INFO] Epoch:[0/2](315100/4588595) loss:2.704 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:54:37,792][model8_pretrain.py][INFO] Epoch:[0/2](315100/4588595) loss:2.351 lr:0.0000100 epoch_Time:27055.0min: [2024-01-04 02:55:14,730][model8_pretrain.py][INFO] Epoch:[0/2](315200/4588595) loss:3.194 lr:0.0000100 epoch_Time:27054.0min: [2024-01-04 02:55:14,730][model8_pretrain.py][INFO] Epoch:[0/2](315200/4588595) loss:3.318 lr:0.0000100 epoch_Time:27054.0min: [2024-01-04 02:55:14,730][model8_pretrain.py][INFO] Epoch:[0/2](315200/4588595) loss:3.357 lr:0.0000100 epoch_Time:27054.0min: [2024-01-04 02:55:14,730][model8_pretrain.py][INFO] Epoch:[0/2](315200/4588595) loss:3.234 lr:0.0000100 epoch_Time:27054.0min: [2024-01-04 02:55:14,730][model8_pretrain.py][INFO] Epoch:[0/2](315200/4588595) loss:3.066 lr:0.0000100 epoch_Time:27054.0min: [2024-01-04 02:55:14,730][model8_pretrain.py][INFO] Epoch:[0/2](315200/4588595) loss:2.880 lr:0.0000100 epoch_Time:27054.0min: [2024-01-04 02:55:14,730][model8_pretrain.py][INFO] Epoch:[0/2](315200/4588595) loss:2.848 lr:0.0000100 epoch_Time:27054.0min: [2024-01-04 02:55:14,730][model8_pretrain.py][INFO] Epoch:[0/2](315200/4588595) loss:3.179 lr:0.0000100 epoch_Time:27054.0min: [2024-01-04 02:55:51,677][model8_pretrain.py][INFO] Epoch:[0/2](315300/4588595) loss:2.455 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:55:51,677][model8_pretrain.py][INFO] Epoch:[0/2](315300/4588595) loss:2.941 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:55:51,677][model8_pretrain.py][INFO] Epoch:[0/2](315300/4588595) loss:3.096 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:55:51,677][model8_pretrain.py][INFO] Epoch:[0/2](315300/4588595) loss:2.593 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:55:51,677][model8_pretrain.py][INFO] Epoch:[0/2](315300/4588595) loss:2.149 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:55:51,677][model8_pretrain.py][INFO] Epoch:[0/2](315300/4588595) loss:2.170 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:55:51,677][model8_pretrain.py][INFO] Epoch:[0/2](315300/4588595) loss:2.767 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:55:51,677][model8_pretrain.py][INFO] Epoch:[0/2](315300/4588595) loss:2.354 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:56:28,613][model8_pretrain.py][INFO] Epoch:[0/2](315400/4588595) loss:2.960 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:56:28,613][model8_pretrain.py][INFO] Epoch:[0/2](315400/4588595) loss:3.174 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:56:28,613][model8_pretrain.py][INFO] Epoch:[0/2](315400/4588595) loss:3.002 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:56:28,613][model8_pretrain.py][INFO] Epoch:[0/2](315400/4588595) loss:2.772 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:56:28,613][model8_pretrain.py][INFO] Epoch:[0/2](315400/4588595) loss:3.078 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:56:28,613][model8_pretrain.py][INFO] Epoch:[0/2](315400/4588595) loss:2.936 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:56:28,613][model8_pretrain.py][INFO] Epoch:[0/2](315400/4588595) loss:2.753 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:56:28,614][model8_pretrain.py][INFO] Epoch:[0/2](315400/4588595) loss:3.350 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:57:05,548][model8_pretrain.py][INFO] Epoch:[0/2](315500/4588595) loss:3.631 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:57:05,548][model8_pretrain.py][INFO] Epoch:[0/2](315500/4588595) loss:3.203 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:57:05,548][model8_pretrain.py][INFO] Epoch:[0/2](315500/4588595) loss:2.996 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:57:05,548][model8_pretrain.py][INFO] Epoch:[0/2](315500/4588595) loss:3.147 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:57:05,548][model8_pretrain.py][INFO] Epoch:[0/2](315500/4588595) loss:2.817 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:57:05,548][model8_pretrain.py][INFO] Epoch:[0/2](315500/4588595) loss:2.942 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:57:05,548][model8_pretrain.py][INFO] Epoch:[0/2](315500/4588595) loss:2.529 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:57:05,548][model8_pretrain.py][INFO] Epoch:[0/2](315500/4588595) loss:2.596 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:57:52,633][model8_pretrain.py][INFO] Epoch:[0/2](315600/4588595) loss:3.204 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:57:52,633][model8_pretrain.py][INFO] Epoch:[0/2](315600/4588595) loss:2.839 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:57:52,633][model8_pretrain.py][INFO] Epoch:[0/2](315600/4588595) loss:3.001 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:57:52,633][model8_pretrain.py][INFO] Epoch:[0/2](315600/4588595) loss:2.277 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:57:52,633][model8_pretrain.py][INFO] Epoch:[0/2](315600/4588595) loss:2.534 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:57:52,633][model8_pretrain.py][INFO] Epoch:[0/2](315600/4588595) loss:2.911 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:57:52,633][model8_pretrain.py][INFO] Epoch:[0/2](315600/4588595) loss:2.736 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:57:52,633][model8_pretrain.py][INFO] Epoch:[0/2](315600/4588595) loss:3.214 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:58:29,572][model8_pretrain.py][INFO] Epoch:[0/2](315700/4588595) loss:3.053 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:58:29,572][model8_pretrain.py][INFO] Epoch:[0/2](315700/4588595) loss:3.270 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:58:29,572][model8_pretrain.py][INFO] Epoch:[0/2](315700/4588595) loss:3.201 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:58:29,572][model8_pretrain.py][INFO] Epoch:[0/2](315700/4588595) loss:3.161 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:58:29,572][model8_pretrain.py][INFO] Epoch:[0/2](315700/4588595) loss:3.375 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:58:29,572][model8_pretrain.py][INFO] Epoch:[0/2](315700/4588595) loss:2.671 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:58:29,572][model8_pretrain.py][INFO] Epoch:[0/2](315700/4588595) loss:3.072 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:58:29,572][model8_pretrain.py][INFO] Epoch:[0/2](315700/4588595) loss:2.953 lr:0.0000100 epoch_Time:27052.0min: [2024-01-04 02:59:06,510][model8_pretrain.py][INFO] Epoch:[0/2](315800/4588595) loss:3.018 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:59:06,510][model8_pretrain.py][INFO] Epoch:[0/2](315800/4588595) loss:2.662 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:59:06,510][model8_pretrain.py][INFO] Epoch:[0/2](315800/4588595) loss:3.269 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:59:06,511][model8_pretrain.py][INFO] Epoch:[0/2](315800/4588595) loss:3.311 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:59:06,511][model8_pretrain.py][INFO] Epoch:[0/2](315800/4588595) loss:2.858 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:59:06,511][model8_pretrain.py][INFO] Epoch:[0/2](315800/4588595) loss:2.941 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:59:06,511][model8_pretrain.py][INFO] Epoch:[0/2](315800/4588595) loss:2.057 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:59:06,512][model8_pretrain.py][INFO] Epoch:[0/2](315800/4588595) loss:2.578 lr:0.0000100 epoch_Time:27051.0min: [2024-01-04 02:59:43,440][model8_pretrain.py][INFO] Epoch:[0/2](315900/4588595) loss:2.493 lr:0.0000100 epoch_Time:27050.0min: [2024-01-04 02:59:43,441][model8_pretrain.py][INFO] Epoch:[0/2](315900/4588595) loss:2.734 lr:0.0000100 epoch_Time:27050.0min: [2024-01-04 02:59:43,441][model8_pretrain.py][INFO] Epoch:[0/2](315900/4588595) loss:2.398 lr:0.0000100 epoch_Time:27050.0min: [2024-01-04 02:59:43,441][model8_pretrain.py][INFO] Epoch:[0/2](315900/4588595) loss:3.261 lr:0.0000100 epoch_Time:27050.0min: [2024-01-04 02:59:43,441][model8_pretrain.py][INFO] Epoch:[0/2](315900/4588595) loss:2.365 lr:0.0000100 epoch_Time:27050.0min: [2024-01-04 02:59:43,441][model8_pretrain.py][INFO] Epoch:[0/2](315900/4588595) loss:2.506 lr:0.0000100 epoch_Time:27050.0min: [2024-01-04 02:59:43,441][model8_pretrain.py][INFO] Epoch:[0/2](315900/4588595) loss:2.764 lr:0.0000100 epoch_Time:27050.0min: [2024-01-04 02:59:43,441][model8_pretrain.py][INFO] Epoch:[0/2](315900/4588595) loss:2.536 lr:0.0000100 epoch_Time:27050.0min: [2024-01-04 03:00:20,377][model8_pretrain.py][INFO] Epoch:[0/2](316000/4588595) loss:3.473 lr:0.0000100 epoch_Time:27049.0min: [2024-01-04 03:00:20,377][model8_pretrain.py][INFO] Epoch:[0/2](316000/4588595) loss:3.015 lr:0.0000100 epoch_Time:27049.0min: [2024-01-04 03:00:20,377][model8_pretrain.py][INFO] Epoch:[0/2](316000/4588595) loss:3.237 lr:0.0000100 epoch_Time:27049.0min: [2024-01-04 03:00:20,377][model8_pretrain.py][INFO] Epoch:[0/2](316000/4588595) loss:2.858 lr:0.0000100 epoch_Time:27049.0min: [2024-01-04 03:00:20,377][model8_pretrain.py][INFO] Epoch:[0/2](316000/4588595) loss:2.928 lr:0.0000100 epoch_Time:27049.0min: [2024-01-04 03:00:20,377][model8_pretrain.py][INFO] Epoch:[0/2](316000/4588595) loss:3.058 lr:0.0000100 epoch_Time:27049.0min: [2024-01-04 03:00:20,377][model8_pretrain.py][INFO] Epoch:[0/2](316000/4588595) loss:2.572 lr:0.0000100 epoch_Time:27049.0min: [2024-01-04 03:00:20,377][model8_pretrain.py][INFO] Epoch:[0/2](316000/4588595) loss:2.067 lr:0.0000100 epoch_Time:27049.0min: [2024-01-04 03:00:57,317][model8_pretrain.py][INFO] Epoch:[0/2](316100/4588595) loss:2.527 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:00:57,317][model8_pretrain.py][INFO] Epoch:[0/2](316100/4588595) loss:3.073 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:00:57,317][model8_pretrain.py][INFO] Epoch:[0/2](316100/4588595) loss:2.522 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:00:57,317][model8_pretrain.py][INFO] Epoch:[0/2](316100/4588595) loss:2.745 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:00:57,317][model8_pretrain.py][INFO] Epoch:[0/2](316100/4588595) loss:2.627 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:00:57,317][model8_pretrain.py][INFO] Epoch:[0/2](316100/4588595) loss:2.636 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:00:57,317][model8_pretrain.py][INFO] Epoch:[0/2](316100/4588595) loss:2.531 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:00:57,317][model8_pretrain.py][INFO] Epoch:[0/2](316100/4588595) loss:2.536 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:01:34,243][model8_pretrain.py][INFO] Epoch:[0/2](316200/4588595) loss:3.026 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:01:34,243][model8_pretrain.py][INFO] Epoch:[0/2](316200/4588595) loss:2.766 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:01:34,243][model8_pretrain.py][INFO] Epoch:[0/2](316200/4588595) loss:2.928 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:01:34,243][model8_pretrain.py][INFO] Epoch:[0/2](316200/4588595) loss:3.074 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:01:34,243][model8_pretrain.py][INFO] Epoch:[0/2](316200/4588595) loss:2.797 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:01:34,243][model8_pretrain.py][INFO] Epoch:[0/2](316200/4588595) loss:3.198 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:01:34,243][model8_pretrain.py][INFO] Epoch:[0/2](316200/4588595) loss:2.332 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:01:34,244][model8_pretrain.py][INFO] Epoch:[0/2](316200/4588595) loss:3.063 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:02:11,172][model8_pretrain.py][INFO] Epoch:[0/2](316300/4588595) loss:3.038 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:02:11,172][model8_pretrain.py][INFO] Epoch:[0/2](316300/4588595) loss:3.160 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:02:11,172][model8_pretrain.py][INFO] Epoch:[0/2](316300/4588595) loss:3.065 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:02:11,172][model8_pretrain.py][INFO] Epoch:[0/2](316300/4588595) loss:2.996 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:02:11,172][model8_pretrain.py][INFO] Epoch:[0/2](316300/4588595) loss:2.982 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:02:11,172][model8_pretrain.py][INFO] Epoch:[0/2](316300/4588595) loss:2.576 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:02:11,172][model8_pretrain.py][INFO] Epoch:[0/2](316300/4588595) loss:2.869 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:02:11,172][model8_pretrain.py][INFO] Epoch:[0/2](316300/4588595) loss:2.875 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:02:58,577][model8_pretrain.py][INFO] Epoch:[0/2](316400/4588595) loss:3.032 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:02:58,577][model8_pretrain.py][INFO] Epoch:[0/2](316400/4588595) loss:2.583 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:02:58,577][model8_pretrain.py][INFO] Epoch:[0/2](316400/4588595) loss:1.868 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:02:58,577][model8_pretrain.py][INFO] Epoch:[0/2](316400/4588595) loss:3.065 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:02:58,577][model8_pretrain.py][INFO] Epoch:[0/2](316400/4588595) loss:2.703 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:02:58,577][model8_pretrain.py][INFO] Epoch:[0/2](316400/4588595) loss:3.290 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:02:58,577][model8_pretrain.py][INFO] Epoch:[0/2](316400/4588595) loss:2.793 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:02:58,577][model8_pretrain.py][INFO] Epoch:[0/2](316400/4588595) loss:3.307 lr:0.0000100 epoch_Time:27048.0min: [2024-01-04 03:03:35,514][model8_pretrain.py][INFO] Epoch:[0/2](316500/4588595) loss:2.959 lr:0.0000100 epoch_Time:27047.0min: [2024-01-04 03:03:35,514][model8_pretrain.py][INFO] Epoch:[0/2](316500/4588595) loss:3.142 lr:0.0000100 epoch_Time:27047.0min: [2024-01-04 03:03:35,514][model8_pretrain.py][INFO] Epoch:[0/2](316500/4588595) loss:3.166 lr:0.0000100 epoch_Time:27047.0min: [2024-01-04 03:03:35,514][model8_pretrain.py][INFO] Epoch:[0/2](316500/4588595) loss:2.739 lr:0.0000100 epoch_Time:27047.0min: [2024-01-04 03:03:35,514][model8_pretrain.py][INFO] Epoch:[0/2](316500/4588595) loss:2.902 lr:0.0000100 epoch_Time:27047.0min: [2024-01-04 03:03:35,514][model8_pretrain.py][INFO] Epoch:[0/2](316500/4588595) loss:2.893 lr:0.0000100 epoch_Time:27047.0min: [2024-01-04 03:03:35,514][model8_pretrain.py][INFO] Epoch:[0/2](316500/4588595) loss:2.623 lr:0.0000100 epoch_Time:27047.0min: [2024-01-04 03:03:35,514][model8_pretrain.py][INFO] Epoch:[0/2](316500/4588595) loss:2.001 lr:0.0000100 epoch_Time:27047.0min: [2024-01-04 03:04:12,457][model8_pretrain.py][INFO] Epoch:[0/2](316600/4588595) loss:2.709 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:04:12,457][model8_pretrain.py][INFO] Epoch:[0/2](316600/4588595) loss:2.951 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:04:12,457][model8_pretrain.py][INFO] Epoch:[0/2](316600/4588595) loss:2.724 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:04:12,457][model8_pretrain.py][INFO] Epoch:[0/2](316600/4588595) loss:3.188 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:04:12,457][model8_pretrain.py][INFO] Epoch:[0/2](316600/4588595) loss:3.079 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:04:12,457][model8_pretrain.py][INFO] Epoch:[0/2](316600/4588595) loss:3.006 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:04:12,457][model8_pretrain.py][INFO] Epoch:[0/2](316600/4588595) loss:3.400 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:04:12,457][model8_pretrain.py][INFO] Epoch:[0/2](316600/4588595) loss:2.655 lr:0.0000100 epoch_Time:27046.0min: [2024-01-04 03:04:49,397][model8_pretrain.py][INFO] Epoch:[0/2](316700/4588595) loss:2.728 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:04:49,397][model8_pretrain.py][INFO] Epoch:[0/2](316700/4588595) loss:2.592 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:04:49,397][model8_pretrain.py][INFO] Epoch:[0/2](316700/4588595) loss:2.640 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:04:49,397][model8_pretrain.py][INFO] Epoch:[0/2](316700/4588595) loss:2.461 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:04:49,398][model8_pretrain.py][INFO] Epoch:[0/2](316700/4588595) loss:3.442 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:04:49,397][model8_pretrain.py][INFO] Epoch:[0/2](316700/4588595) loss:3.557 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:04:49,398][model8_pretrain.py][INFO] Epoch:[0/2](316700/4588595) loss:2.617 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:04:49,398][model8_pretrain.py][INFO] Epoch:[0/2](316700/4588595) loss:2.862 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:05:26,326][model8_pretrain.py][INFO] Epoch:[0/2](316800/4588595) loss:3.056 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:05:26,326][model8_pretrain.py][INFO] Epoch:[0/2](316800/4588595) loss:3.287 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:05:26,326][model8_pretrain.py][INFO] Epoch:[0/2](316800/4588595) loss:3.602 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:05:26,326][model8_pretrain.py][INFO] Epoch:[0/2](316800/4588595) loss:3.170 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:05:26,326][model8_pretrain.py][INFO] Epoch:[0/2](316800/4588595) loss:2.824 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:05:26,326][model8_pretrain.py][INFO] Epoch:[0/2](316800/4588595) loss:3.056 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:05:26,326][model8_pretrain.py][INFO] Epoch:[0/2](316800/4588595) loss:2.866 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:05:26,326][model8_pretrain.py][INFO] Epoch:[0/2](316800/4588595) loss:2.554 lr:0.0000100 epoch_Time:27045.0min: [2024-01-04 03:06:03,254][model8_pretrain.py][INFO] Epoch:[0/2](316900/4588595) loss:2.481 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:03,254][model8_pretrain.py][INFO] Epoch:[0/2](316900/4588595) loss:3.016 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:03,254][model8_pretrain.py][INFO] Epoch:[0/2](316900/4588595) loss:3.085 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:03,254][model8_pretrain.py][INFO] Epoch:[0/2](316900/4588595) loss:2.897 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:03,254][model8_pretrain.py][INFO] Epoch:[0/2](316900/4588595) loss:2.458 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:03,254][model8_pretrain.py][INFO] Epoch:[0/2](316900/4588595) loss:2.995 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:03,254][model8_pretrain.py][INFO] Epoch:[0/2](316900/4588595) loss:2.792 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:03,255][model8_pretrain.py][INFO] Epoch:[0/2](316900/4588595) loss:2.643 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:40,195][model8_pretrain.py][INFO] Epoch:[0/2](317000/4588595) loss:3.326 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:40,195][model8_pretrain.py][INFO] Epoch:[0/2](317000/4588595) loss:2.908 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:40,195][model8_pretrain.py][INFO] Epoch:[0/2](317000/4588595) loss:3.217 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:40,195][model8_pretrain.py][INFO] Epoch:[0/2](317000/4588595) loss:2.890 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:40,195][model8_pretrain.py][INFO] Epoch:[0/2](317000/4588595) loss:3.003 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:40,195][model8_pretrain.py][INFO] Epoch:[0/2](317000/4588595) loss:2.763 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:40,195][model8_pretrain.py][INFO] Epoch:[0/2](317000/4588595) loss:2.402 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:06:40,195][model8_pretrain.py][INFO] Epoch:[0/2](317000/4588595) loss:3.058 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:07:17,128][model8_pretrain.py][INFO] Epoch:[0/2](317100/4588595) loss:3.062 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:07:17,128][model8_pretrain.py][INFO] Epoch:[0/2](317100/4588595) loss:2.673 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:07:17,128][model8_pretrain.py][INFO] Epoch:[0/2](317100/4588595) loss:2.879 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:07:17,128][model8_pretrain.py][INFO] Epoch:[0/2](317100/4588595) loss:2.521 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:07:17,128][model8_pretrain.py][INFO] Epoch:[0/2](317100/4588595) loss:2.949 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:07:17,128][model8_pretrain.py][INFO] Epoch:[0/2](317100/4588595) loss:3.304 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:07:17,128][model8_pretrain.py][INFO] Epoch:[0/2](317100/4588595) loss:3.232 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:07:17,128][model8_pretrain.py][INFO] Epoch:[0/2](317100/4588595) loss:2.853 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:08:04,268][model8_pretrain.py][INFO] Epoch:[0/2](317200/4588595) loss:3.054 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:04,268][model8_pretrain.py][INFO] Epoch:[0/2](317200/4588595) loss:2.854 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:04,268][model8_pretrain.py][INFO] Epoch:[0/2](317200/4588595) loss:3.124 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:04,268][model8_pretrain.py][INFO] Epoch:[0/2](317200/4588595) loss:2.809 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:04,268][model8_pretrain.py][INFO] Epoch:[0/2](317200/4588595) loss:2.759 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:04,268][model8_pretrain.py][INFO] Epoch:[0/2](317200/4588595) loss:3.197 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:04,268][model8_pretrain.py][INFO] Epoch:[0/2](317200/4588595) loss:2.702 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:04,268][model8_pretrain.py][INFO] Epoch:[0/2](317200/4588595) loss:2.845 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:41,215][model8_pretrain.py][INFO] Epoch:[0/2](317300/4588595) loss:3.374 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:41,215][model8_pretrain.py][INFO] Epoch:[0/2](317300/4588595) loss:2.968 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:41,215][model8_pretrain.py][INFO] Epoch:[0/2](317300/4588595) loss:2.708 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:41,215][model8_pretrain.py][INFO] Epoch:[0/2](317300/4588595) loss:2.843 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:41,215][model8_pretrain.py][INFO] Epoch:[0/2](317300/4588595) loss:3.057 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:41,215][model8_pretrain.py][INFO] Epoch:[0/2](317300/4588595) loss:2.601 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:41,215][model8_pretrain.py][INFO] Epoch:[0/2](317300/4588595) loss:2.818 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:08:41,215][model8_pretrain.py][INFO] Epoch:[0/2](317300/4588595) loss:3.255 lr:0.0000100 epoch_Time:27043.0min: [2024-01-04 03:09:18,167][model8_pretrain.py][INFO] Epoch:[0/2](317400/4588595) loss:3.315 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:09:18,167][model8_pretrain.py][INFO] Epoch:[0/2](317400/4588595) loss:3.015 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:09:18,167][model8_pretrain.py][INFO] Epoch:[0/2](317400/4588595) loss:3.085 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:09:18,167][model8_pretrain.py][INFO] Epoch:[0/2](317400/4588595) loss:3.391 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:09:18,167][model8_pretrain.py][INFO] Epoch:[0/2](317400/4588595) loss:2.805 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:09:18,167][model8_pretrain.py][INFO] Epoch:[0/2](317400/4588595) loss:2.994 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:09:18,167][model8_pretrain.py][INFO] Epoch:[0/2](317400/4588595) loss:2.783 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:09:18,167][model8_pretrain.py][INFO] Epoch:[0/2](317400/4588595) loss:2.633 lr:0.0000100 epoch_Time:27042.0min: [2024-01-04 03:09:55,123][model8_pretrain.py][INFO] Epoch:[0/2](317500/4588595) loss:3.150 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:09:55,123][model8_pretrain.py][INFO] Epoch:[0/2](317500/4588595) loss:3.014 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:09:55,123][model8_pretrain.py][INFO] Epoch:[0/2](317500/4588595) loss:2.297 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:09:55,123][model8_pretrain.py][INFO] Epoch:[0/2](317500/4588595) loss:2.788 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:09:55,123][model8_pretrain.py][INFO] Epoch:[0/2](317500/4588595) loss:2.797 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:09:55,124][model8_pretrain.py][INFO] Epoch:[0/2](317500/4588595) loss:3.425 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:09:55,124][model8_pretrain.py][INFO] Epoch:[0/2](317500/4588595) loss:3.092 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:09:55,124][model8_pretrain.py][INFO] Epoch:[0/2](317500/4588595) loss:2.774 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:10:32,080][model8_pretrain.py][INFO] Epoch:[0/2](317600/4588595) loss:3.585 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:10:32,080][model8_pretrain.py][INFO] Epoch:[0/2](317600/4588595) loss:2.849 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:10:32,080][model8_pretrain.py][INFO] Epoch:[0/2](317600/4588595) loss:3.344 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:10:32,080][model8_pretrain.py][INFO] Epoch:[0/2](317600/4588595) loss:2.503 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:10:32,080][model8_pretrain.py][INFO] Epoch:[0/2](317600/4588595) loss:3.255 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:10:32,080][model8_pretrain.py][INFO] Epoch:[0/2](317600/4588595) loss:3.102 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:10:32,080][model8_pretrain.py][INFO] Epoch:[0/2](317600/4588595) loss:2.960 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:10:32,080][model8_pretrain.py][INFO] Epoch:[0/2](317600/4588595) loss:2.542 lr:0.0000100 epoch_Time:27040.0min: [2024-01-04 03:11:09,036][model8_pretrain.py][INFO] Epoch:[0/2](317700/4588595) loss:3.250 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:09,036][model8_pretrain.py][INFO] Epoch:[0/2](317700/4588595) loss:2.736 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:09,036][model8_pretrain.py][INFO] Epoch:[0/2](317700/4588595) loss:2.940 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:09,036][model8_pretrain.py][INFO] Epoch:[0/2](317700/4588595) loss:2.765 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:09,036][model8_pretrain.py][INFO] Epoch:[0/2](317700/4588595) loss:2.950 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:09,036][model8_pretrain.py][INFO] Epoch:[0/2](317700/4588595) loss:2.615 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:09,036][model8_pretrain.py][INFO] Epoch:[0/2](317700/4588595) loss:2.875 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:09,037][model8_pretrain.py][INFO] Epoch:[0/2](317700/4588595) loss:2.768 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:45,990][model8_pretrain.py][INFO] Epoch:[0/2](317800/4588595) loss:3.264 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:45,990][model8_pretrain.py][INFO] Epoch:[0/2](317800/4588595) loss:3.221 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:45,990][model8_pretrain.py][INFO] Epoch:[0/2](317800/4588595) loss:2.488 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:45,990][model8_pretrain.py][INFO] Epoch:[0/2](317800/4588595) loss:3.421 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:45,990][model8_pretrain.py][INFO] Epoch:[0/2](317800/4588595) loss:2.741 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:45,990][model8_pretrain.py][INFO] Epoch:[0/2](317800/4588595) loss:3.211 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:45,990][model8_pretrain.py][INFO] Epoch:[0/2](317800/4588595) loss:2.687 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:11:45,990][model8_pretrain.py][INFO] Epoch:[0/2](317800/4588595) loss:2.971 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:12:22,967][model8_pretrain.py][INFO] Epoch:[0/2](317900/4588595) loss:3.239 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:12:22,967][model8_pretrain.py][INFO] Epoch:[0/2](317900/4588595) loss:2.727 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:12:22,967][model8_pretrain.py][INFO] Epoch:[0/2](317900/4588595) loss:3.132 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:12:22,967][model8_pretrain.py][INFO] Epoch:[0/2](317900/4588595) loss:3.068 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:12:22,967][model8_pretrain.py][INFO] Epoch:[0/2](317900/4588595) loss:2.082 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:12:22,967][model8_pretrain.py][INFO] Epoch:[0/2](317900/4588595) loss:3.294 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:12:22,967][model8_pretrain.py][INFO] Epoch:[0/2](317900/4588595) loss:2.819 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:12:22,968][model8_pretrain.py][INFO] Epoch:[0/2](317900/4588595) loss:2.779 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:13:10,301][model8_pretrain.py][INFO] Epoch:[0/2](318000/4588595) loss:3.006 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:13:10,301][model8_pretrain.py][INFO] Epoch:[0/2](318000/4588595) loss:2.726 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:13:10,301][model8_pretrain.py][INFO] Epoch:[0/2](318000/4588595) loss:2.325 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:13:10,301][model8_pretrain.py][INFO] Epoch:[0/2](318000/4588595) loss:2.685 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:13:10,301][model8_pretrain.py][INFO] Epoch:[0/2](318000/4588595) loss:2.687 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:13:10,301][model8_pretrain.py][INFO] Epoch:[0/2](318000/4588595) loss:2.887 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:13:10,301][model8_pretrain.py][INFO] Epoch:[0/2](318000/4588595) loss:2.936 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:13:10,301][model8_pretrain.py][INFO] Epoch:[0/2](318000/4588595) loss:2.586 lr:0.0000100 epoch_Time:27039.0min: [2024-01-04 03:13:47,237][model8_pretrain.py][INFO] Epoch:[0/2](318100/4588595) loss:3.182 lr:0.0000100 epoch_Time:27038.0min: [2024-01-04 03:13:47,237][model8_pretrain.py][INFO] Epoch:[0/2](318100/4588595) loss:2.635 lr:0.0000100 epoch_Time:27038.0min: [2024-01-04 03:13:47,237][model8_pretrain.py][INFO] Epoch:[0/2](318100/4588595) loss:2.975 lr:0.0000100 epoch_Time:27038.0min: [2024-01-04 03:13:47,237][model8_pretrain.py][INFO] Epoch:[0/2](318100/4588595) loss:2.689 lr:0.0000100 epoch_Time:27038.0min: [2024-01-04 03:13:47,237][model8_pretrain.py][INFO] Epoch:[0/2](318100/4588595) loss:3.168 lr:0.0000100 epoch_Time:27038.0min: [2024-01-04 03:13:47,237][model8_pretrain.py][INFO] Epoch:[0/2](318100/4588595) loss:3.294 lr:0.0000100 epoch_Time:27038.0min: [2024-01-04 03:13:47,237][model8_pretrain.py][INFO] Epoch:[0/2](318100/4588595) loss:3.091 lr:0.0000100 epoch_Time:27038.0min: [2024-01-04 03:13:47,237][model8_pretrain.py][INFO] Epoch:[0/2](318100/4588595) loss:3.079 lr:0.0000100 epoch_Time:27038.0min: [2024-01-04 03:14:24,179][model8_pretrain.py][INFO] Epoch:[0/2](318200/4588595) loss:3.418 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:14:24,179][model8_pretrain.py][INFO] Epoch:[0/2](318200/4588595) loss:2.742 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:14:24,179][model8_pretrain.py][INFO] Epoch:[0/2](318200/4588595) loss:2.895 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:14:24,179][model8_pretrain.py][INFO] Epoch:[0/2](318200/4588595) loss:3.070 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:14:24,179][model8_pretrain.py][INFO] Epoch:[0/2](318200/4588595) loss:2.863 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:14:24,179][model8_pretrain.py][INFO] Epoch:[0/2](318200/4588595) loss:2.710 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:14:24,179][model8_pretrain.py][INFO] Epoch:[0/2](318200/4588595) loss:3.069 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:14:24,179][model8_pretrain.py][INFO] Epoch:[0/2](318200/4588595) loss:3.326 lr:0.0000100 epoch_Time:27037.0min: [2024-01-04 03:15:01,128][model8_pretrain.py][INFO] Epoch:[0/2](318300/4588595) loss:2.863 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:01,128][model8_pretrain.py][INFO] Epoch:[0/2](318300/4588595) loss:3.383 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:01,128][model8_pretrain.py][INFO] Epoch:[0/2](318300/4588595) loss:2.981 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:01,128][model8_pretrain.py][INFO] Epoch:[0/2](318300/4588595) loss:2.704 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:01,128][model8_pretrain.py][INFO] Epoch:[0/2](318300/4588595) loss:2.927 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:01,128][model8_pretrain.py][INFO] Epoch:[0/2](318300/4588595) loss:2.767 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:01,128][model8_pretrain.py][INFO] Epoch:[0/2](318300/4588595) loss:2.760 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:01,129][model8_pretrain.py][INFO] Epoch:[0/2](318300/4588595) loss:2.800 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:38,075][model8_pretrain.py][INFO] Epoch:[0/2](318400/4588595) loss:2.996 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:38,075][model8_pretrain.py][INFO] Epoch:[0/2](318400/4588595) loss:2.732 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:38,075][model8_pretrain.py][INFO] Epoch:[0/2](318400/4588595) loss:3.193 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:38,075][model8_pretrain.py][INFO] Epoch:[0/2](318400/4588595) loss:3.050 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:38,075][model8_pretrain.py][INFO] Epoch:[0/2](318400/4588595) loss:3.357 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:38,075][model8_pretrain.py][INFO] Epoch:[0/2](318400/4588595) loss:2.670 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:38,075][model8_pretrain.py][INFO] Epoch:[0/2](318400/4588595) loss:2.613 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:15:38,075][model8_pretrain.py][INFO] Epoch:[0/2](318400/4588595) loss:3.356 lr:0.0000100 epoch_Time:27036.0min: [2024-01-04 03:16:15,024][model8_pretrain.py][INFO] Epoch:[0/2](318500/4588595) loss:3.092 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:16:15,024][model8_pretrain.py][INFO] Epoch:[0/2](318500/4588595) loss:3.008 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:16:15,024][model8_pretrain.py][INFO] Epoch:[0/2](318500/4588595) loss:2.509 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:16:15,024][model8_pretrain.py][INFO] Epoch:[0/2](318500/4588595) loss:2.921 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:16:15,024][model8_pretrain.py][INFO] Epoch:[0/2](318500/4588595) loss:3.028 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:16:15,024][model8_pretrain.py][INFO] Epoch:[0/2](318500/4588595) loss:2.647 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:16:15,024][model8_pretrain.py][INFO] Epoch:[0/2](318500/4588595) loss:2.935 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:16:15,024][model8_pretrain.py][INFO] Epoch:[0/2](318500/4588595) loss:2.886 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:16:51,974][model8_pretrain.py][INFO] Epoch:[0/2](318600/4588595) loss:2.891 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:16:51,974][model8_pretrain.py][INFO] Epoch:[0/2](318600/4588595) loss:2.760 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:16:51,974][model8_pretrain.py][INFO] Epoch:[0/2](318600/4588595) loss:2.725 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:16:51,974][model8_pretrain.py][INFO] Epoch:[0/2](318600/4588595) loss:3.016 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:16:51,974][model8_pretrain.py][INFO] Epoch:[0/2](318600/4588595) loss:2.891 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:16:51,974][model8_pretrain.py][INFO] Epoch:[0/2](318600/4588595) loss:3.155 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:16:51,974][model8_pretrain.py][INFO] Epoch:[0/2](318600/4588595) loss:2.928 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:16:51,974][model8_pretrain.py][INFO] Epoch:[0/2](318600/4588595) loss:3.045 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:17:28,926][model8_pretrain.py][INFO] Epoch:[0/2](318700/4588595) loss:2.761 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:17:28,926][model8_pretrain.py][INFO] Epoch:[0/2](318700/4588595) loss:2.570 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:17:28,926][model8_pretrain.py][INFO] Epoch:[0/2](318700/4588595) loss:2.826 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:17:28,926][model8_pretrain.py][INFO] Epoch:[0/2](318700/4588595) loss:2.635 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:17:28,926][model8_pretrain.py][INFO] Epoch:[0/2](318700/4588595) loss:3.367 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:17:28,926][model8_pretrain.py][INFO] Epoch:[0/2](318700/4588595) loss:2.772 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:17:28,926][model8_pretrain.py][INFO] Epoch:[0/2](318700/4588595) loss:2.990 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:17:28,926][model8_pretrain.py][INFO] Epoch:[0/2](318700/4588595) loss:2.533 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:18:16,089][model8_pretrain.py][INFO] Epoch:[0/2](318800/4588595) loss:2.845 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:18:16,089][model8_pretrain.py][INFO] Epoch:[0/2](318800/4588595) loss:3.152 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:18:16,089][model8_pretrain.py][INFO] Epoch:[0/2](318800/4588595) loss:3.210 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:18:16,089][model8_pretrain.py][INFO] Epoch:[0/2](318800/4588595) loss:2.882 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:18:16,089][model8_pretrain.py][INFO] Epoch:[0/2](318800/4588595) loss:2.679 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:18:16,089][model8_pretrain.py][INFO] Epoch:[0/2](318800/4588595) loss:2.953 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:18:16,090][model8_pretrain.py][INFO] Epoch:[0/2](318800/4588595) loss:3.021 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:18:16,090][model8_pretrain.py][INFO] Epoch:[0/2](318800/4588595) loss:2.235 lr:0.0000100 epoch_Time:27034.0min: [2024-01-04 03:18:53,023][model8_pretrain.py][INFO] Epoch:[0/2](318900/4588595) loss:2.885 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:18:53,023][model8_pretrain.py][INFO] Epoch:[0/2](318900/4588595) loss:3.120 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:18:53,023][model8_pretrain.py][INFO] Epoch:[0/2](318900/4588595) loss:3.180 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:18:53,023][model8_pretrain.py][INFO] Epoch:[0/2](318900/4588595) loss:3.125 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:18:53,023][model8_pretrain.py][INFO] Epoch:[0/2](318900/4588595) loss:2.921 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:18:53,023][model8_pretrain.py][INFO] Epoch:[0/2](318900/4588595) loss:3.597 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:18:53,023][model8_pretrain.py][INFO] Epoch:[0/2](318900/4588595) loss:2.993 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:18:53,023][model8_pretrain.py][INFO] Epoch:[0/2](318900/4588595) loss:2.803 lr:0.0000100 epoch_Time:27033.0min: [2024-01-04 03:19:29,971][model8_pretrain.py][INFO] Epoch:[0/2](319000/4588595) loss:2.858 lr:0.0000100 epoch_Time:27032.0min: [2024-01-04 03:19:29,971][model8_pretrain.py][INFO] Epoch:[0/2](319000/4588595) loss:3.380 lr:0.0000100 epoch_Time:27032.0min: [2024-01-04 03:19:29,971][model8_pretrain.py][INFO] Epoch:[0/2](319000/4588595) loss:2.829 lr:0.0000100 epoch_Time:27032.0min: [2024-01-04 03:19:29,971][model8_pretrain.py][INFO] Epoch:[0/2](319000/4588595) loss:2.761 lr:0.0000100 epoch_Time:27032.0min: [2024-01-04 03:19:29,971][model8_pretrain.py][INFO] Epoch:[0/2](319000/4588595) loss:2.901 lr:0.0000100 epoch_Time:27032.0min: [2024-01-04 03:19:29,971][model8_pretrain.py][INFO] Epoch:[0/2](319000/4588595) loss:3.021 lr:0.0000100 epoch_Time:27032.0min: [2024-01-04 03:19:29,971][model8_pretrain.py][INFO] Epoch:[0/2](319000/4588595) loss:2.929 lr:0.0000100 epoch_Time:27032.0min: [2024-01-04 03:19:29,972][model8_pretrain.py][INFO] Epoch:[0/2](319000/4588595) loss:2.634 lr:0.0000100 epoch_Time:27032.0min: [2024-01-04 03:20:06,928][model8_pretrain.py][INFO] Epoch:[0/2](319100/4588595) loss:2.876 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:06,928][model8_pretrain.py][INFO] Epoch:[0/2](319100/4588595) loss:2.665 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:06,928][model8_pretrain.py][INFO] Epoch:[0/2](319100/4588595) loss:3.084 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:06,928][model8_pretrain.py][INFO] Epoch:[0/2](319100/4588595) loss:2.794 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:06,928][model8_pretrain.py][INFO] Epoch:[0/2](319100/4588595) loss:3.113 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:06,928][model8_pretrain.py][INFO] Epoch:[0/2](319100/4588595) loss:2.461 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:06,928][model8_pretrain.py][INFO] Epoch:[0/2](319100/4588595) loss:2.963 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:06,928][model8_pretrain.py][INFO] Epoch:[0/2](319100/4588595) loss:2.550 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:43,867][model8_pretrain.py][INFO] Epoch:[0/2](319200/4588595) loss:3.098 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:43,867][model8_pretrain.py][INFO] Epoch:[0/2](319200/4588595) loss:2.757 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:43,867][model8_pretrain.py][INFO] Epoch:[0/2](319200/4588595) loss:3.034 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:43,867][model8_pretrain.py][INFO] Epoch:[0/2](319200/4588595) loss:2.605 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:43,867][model8_pretrain.py][INFO] Epoch:[0/2](319200/4588595) loss:2.831 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:43,867][model8_pretrain.py][INFO] Epoch:[0/2](319200/4588595) loss:2.664 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:43,867][model8_pretrain.py][INFO] Epoch:[0/2](319200/4588595) loss:3.423 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:20:43,867][model8_pretrain.py][INFO] Epoch:[0/2](319200/4588595) loss:2.739 lr:0.0000100 epoch_Time:27031.0min: [2024-01-04 03:21:20,821][model8_pretrain.py][INFO] Epoch:[0/2](319300/4588595) loss:2.977 lr:0.0000100 epoch_Time:27030.0min: [2024-01-04 03:21:20,821][model8_pretrain.py][INFO] Epoch:[0/2](319300/4588595) loss:2.721 lr:0.0000100 epoch_Time:27030.0min: [2024-01-04 03:21:20,821][model8_pretrain.py][INFO] Epoch:[0/2](319300/4588595) loss:2.868 lr:0.0000100 epoch_Time:27030.0min: [2024-01-04 03:21:20,821][model8_pretrain.py][INFO] Epoch:[0/2](319300/4588595) loss:2.844 lr:0.0000100 epoch_Time:27030.0min: [2024-01-04 03:21:20,821][model8_pretrain.py][INFO] Epoch:[0/2](319300/4588595) loss:2.951 lr:0.0000100 epoch_Time:27030.0min: [2024-01-04 03:21:20,821][model8_pretrain.py][INFO] Epoch:[0/2](319300/4588595) loss:2.320 lr:0.0000100 epoch_Time:27030.0min: [2024-01-04 03:21:20,821][model8_pretrain.py][INFO] Epoch:[0/2](319300/4588595) loss:2.558 lr:0.0000100 epoch_Time:27030.0min: [2024-01-04 03:21:20,821][model8_pretrain.py][INFO] Epoch:[0/2](319300/4588595) loss:3.126 lr:0.0000100 epoch_Time:27030.0min: [2024-01-04 03:21:57,755][model8_pretrain.py][INFO] Epoch:[0/2](319400/4588595) loss:2.770 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:21:57,755][model8_pretrain.py][INFO] Epoch:[0/2](319400/4588595) loss:2.091 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:21:57,755][model8_pretrain.py][INFO] Epoch:[0/2](319400/4588595) loss:2.680 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:21:57,755][model8_pretrain.py][INFO] Epoch:[0/2](319400/4588595) loss:2.753 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:21:57,755][model8_pretrain.py][INFO] Epoch:[0/2](319400/4588595) loss:3.382 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:21:57,755][model8_pretrain.py][INFO] Epoch:[0/2](319400/4588595) loss:3.224 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:21:57,755][model8_pretrain.py][INFO] Epoch:[0/2](319400/4588595) loss:3.322 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:21:57,755][model8_pretrain.py][INFO] Epoch:[0/2](319400/4588595) loss:3.149 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:22:34,689][model8_pretrain.py][INFO] Epoch:[0/2](319500/4588595) loss:2.536 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:22:34,689][model8_pretrain.py][INFO] Epoch:[0/2](319500/4588595) loss:2.440 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:22:34,689][model8_pretrain.py][INFO] Epoch:[0/2](319500/4588595) loss:2.323 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:22:34,689][model8_pretrain.py][INFO] Epoch:[0/2](319500/4588595) loss:3.442 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:22:34,689][model8_pretrain.py][INFO] Epoch:[0/2](319500/4588595) loss:3.297 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:22:34,689][model8_pretrain.py][INFO] Epoch:[0/2](319500/4588595) loss:3.151 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:22:34,690][model8_pretrain.py][INFO] Epoch:[0/2](319500/4588595) loss:2.973 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:22:34,690][model8_pretrain.py][INFO] Epoch:[0/2](319500/4588595) loss:3.015 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:23:21,817][model8_pretrain.py][INFO] Epoch:[0/2](319600/4588595) loss:3.495 lr:0.0000100 epoch_Time:27029.0min: [2024-01-04 03:23:21,817][model8_pretrain.py][INFO] Epoch:[0/2](319600/4588595) loss:3.307 lr:0.0000100 epoch_Time:27029.0min: [2024-01-04 03:23:21,817][model8_pretrain.py][INFO] Epoch:[0/2](319600/4588595) loss:2.765 lr:0.0000100 epoch_Time:27029.0min: [2024-01-04 03:23:21,817][model8_pretrain.py][INFO] Epoch:[0/2](319600/4588595) loss:3.277 lr:0.0000100 epoch_Time:27029.0min: [2024-01-04 03:23:21,817][model8_pretrain.py][INFO] Epoch:[0/2](319600/4588595) loss:3.341 lr:0.0000100 epoch_Time:27029.0min: [2024-01-04 03:23:21,817][model8_pretrain.py][INFO] Epoch:[0/2](319600/4588595) loss:3.524 lr:0.0000100 epoch_Time:27029.0min: [2024-01-04 03:23:21,817][model8_pretrain.py][INFO] Epoch:[0/2](319600/4588595) loss:2.705 lr:0.0000100 epoch_Time:27029.0min: [2024-01-04 03:23:21,817][model8_pretrain.py][INFO] Epoch:[0/2](319600/4588595) loss:3.145 lr:0.0000100 epoch_Time:27029.0min: [2024-01-04 03:23:58,755][model8_pretrain.py][INFO] Epoch:[0/2](319700/4588595) loss:3.077 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:23:58,755][model8_pretrain.py][INFO] Epoch:[0/2](319700/4588595) loss:2.819 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:23:58,755][model8_pretrain.py][INFO] Epoch:[0/2](319700/4588595) loss:2.940 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:23:58,755][model8_pretrain.py][INFO] Epoch:[0/2](319700/4588595) loss:2.814 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:23:58,755][model8_pretrain.py][INFO] Epoch:[0/2](319700/4588595) loss:2.218 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:23:58,755][model8_pretrain.py][INFO] Epoch:[0/2](319700/4588595) loss:3.429 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:23:58,755][model8_pretrain.py][INFO] Epoch:[0/2](319700/4588595) loss:3.006 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:23:58,755][model8_pretrain.py][INFO] Epoch:[0/2](319700/4588595) loss:3.465 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:24:35,698][model8_pretrain.py][INFO] Epoch:[0/2](319800/4588595) loss:2.830 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:24:35,698][model8_pretrain.py][INFO] Epoch:[0/2](319800/4588595) loss:2.704 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:24:35,698][model8_pretrain.py][INFO] Epoch:[0/2](319800/4588595) loss:2.803 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:24:35,698][model8_pretrain.py][INFO] Epoch:[0/2](319800/4588595) loss:3.320 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:24:35,698][model8_pretrain.py][INFO] Epoch:[0/2](319800/4588595) loss:3.079 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:24:35,698][model8_pretrain.py][INFO] Epoch:[0/2](319800/4588595) loss:3.353 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:24:35,698][model8_pretrain.py][INFO] Epoch:[0/2](319800/4588595) loss:2.775 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:24:35,698][model8_pretrain.py][INFO] Epoch:[0/2](319800/4588595) loss:2.765 lr:0.0000100 epoch_Time:27028.0min: [2024-01-04 03:25:12,641][model8_pretrain.py][INFO] Epoch:[0/2](319900/4588595) loss:3.248 lr:0.0000100 epoch_Time:27027.0min: [2024-01-04 03:25:12,641][model8_pretrain.py][INFO] Epoch:[0/2](319900/4588595) loss:2.769 lr:0.0000100 epoch_Time:27027.0min: [2024-01-04 03:25:12,641][model8_pretrain.py][INFO] Epoch:[0/2](319900/4588595) loss:3.503 lr:0.0000100 epoch_Time:27027.0min: [2024-01-04 03:25:12,642][model8_pretrain.py][INFO] Epoch:[0/2](319900/4588595) loss:2.563 lr:0.0000100 epoch_Time:27027.0min: [2024-01-04 03:25:12,642][model8_pretrain.py][INFO] Epoch:[0/2](319900/4588595) loss:2.526 lr:0.0000100 epoch_Time:27027.0min: [2024-01-04 03:25:12,641][model8_pretrain.py][INFO] Epoch:[0/2](319900/4588595) loss:3.033 lr:0.0000100 epoch_Time:27027.0min: [2024-01-04 03:25:12,642][model8_pretrain.py][INFO] Epoch:[0/2](319900/4588595) loss:2.881 lr:0.0000100 epoch_Time:27027.0min: [2024-01-04 03:25:12,642][model8_pretrain.py][INFO] Epoch:[0/2](319900/4588595) loss:3.226 lr:0.0000100 epoch_Time:27027.0min: [2024-01-04 03:25:49,583][model8_pretrain.py][INFO] Epoch:[0/2](320000/4588595) loss:3.198 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:25:49,583][model8_pretrain.py][INFO] Epoch:[0/2](320000/4588595) loss:3.036 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:25:49,583][model8_pretrain.py][INFO] Epoch:[0/2](320000/4588595) loss:3.100 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:25:49,583][model8_pretrain.py][INFO] Epoch:[0/2](320000/4588595) loss:2.732 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:25:49,583][model8_pretrain.py][INFO] Epoch:[0/2](320000/4588595) loss:2.857 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:25:49,583][model8_pretrain.py][INFO] Epoch:[0/2](320000/4588595) loss:2.503 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:25:49,583][model8_pretrain.py][INFO] Epoch:[0/2](320000/4588595) loss:2.895 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:25:49,583][model8_pretrain.py][INFO] Epoch:[0/2](320000/4588595) loss:3.275 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:26:26,524][model8_pretrain.py][INFO] Epoch:[0/2](320100/4588595) loss:3.047 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:26:26,524][model8_pretrain.py][INFO] Epoch:[0/2](320100/4588595) loss:2.843 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:26:26,524][model8_pretrain.py][INFO] Epoch:[0/2](320100/4588595) loss:3.196 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:26:26,524][model8_pretrain.py][INFO] Epoch:[0/2](320100/4588595) loss:2.305 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:26:26,524][model8_pretrain.py][INFO] Epoch:[0/2](320100/4588595) loss:3.243 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:26:26,524][model8_pretrain.py][INFO] Epoch:[0/2](320100/4588595) loss:2.828 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:26:26,524][model8_pretrain.py][INFO] Epoch:[0/2](320100/4588595) loss:3.036 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:26:26,524][model8_pretrain.py][INFO] Epoch:[0/2](320100/4588595) loss:3.268 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:27:03,464][model8_pretrain.py][INFO] Epoch:[0/2](320200/4588595) loss:3.207 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:03,464][model8_pretrain.py][INFO] Epoch:[0/2](320200/4588595) loss:3.208 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:03,464][model8_pretrain.py][INFO] Epoch:[0/2](320200/4588595) loss:2.694 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:03,464][model8_pretrain.py][INFO] Epoch:[0/2](320200/4588595) loss:2.703 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:03,464][model8_pretrain.py][INFO] Epoch:[0/2](320200/4588595) loss:3.201 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:03,464][model8_pretrain.py][INFO] Epoch:[0/2](320200/4588595) loss:3.408 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:03,464][model8_pretrain.py][INFO] Epoch:[0/2](320200/4588595) loss:2.445 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:03,465][model8_pretrain.py][INFO] Epoch:[0/2](320200/4588595) loss:2.633 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:40,409][model8_pretrain.py][INFO] Epoch:[0/2](320300/4588595) loss:3.020 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:40,409][model8_pretrain.py][INFO] Epoch:[0/2](320300/4588595) loss:2.828 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:40,409][model8_pretrain.py][INFO] Epoch:[0/2](320300/4588595) loss:2.275 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:40,409][model8_pretrain.py][INFO] Epoch:[0/2](320300/4588595) loss:2.528 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:40,409][model8_pretrain.py][INFO] Epoch:[0/2](320300/4588595) loss:2.989 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:40,409][model8_pretrain.py][INFO] Epoch:[0/2](320300/4588595) loss:3.169 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:40,409][model8_pretrain.py][INFO] Epoch:[0/2](320300/4588595) loss:3.261 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:27:40,409][model8_pretrain.py][INFO] Epoch:[0/2](320300/4588595) loss:2.561 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:28:27,418][model8_pretrain.py][INFO] Epoch:[0/2](320400/4588595) loss:2.956 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:28:27,419][model8_pretrain.py][INFO] Epoch:[0/2](320400/4588595) loss:2.522 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:28:27,419][model8_pretrain.py][INFO] Epoch:[0/2](320400/4588595) loss:3.031 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:28:27,419][model8_pretrain.py][INFO] Epoch:[0/2](320400/4588595) loss:2.326 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:28:27,419][model8_pretrain.py][INFO] Epoch:[0/2](320400/4588595) loss:2.713 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:28:27,419][model8_pretrain.py][INFO] Epoch:[0/2](320400/4588595) loss:2.525 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:28:27,419][model8_pretrain.py][INFO] Epoch:[0/2](320400/4588595) loss:2.942 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:28:27,420][model8_pretrain.py][INFO] Epoch:[0/2](320400/4588595) loss:3.029 lr:0.0000100 epoch_Time:27025.0min: [2024-01-04 03:29:04,359][model8_pretrain.py][INFO] Epoch:[0/2](320500/4588595) loss:2.438 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:29:04,359][model8_pretrain.py][INFO] Epoch:[0/2](320500/4588595) loss:2.749 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:29:04,359][model8_pretrain.py][INFO] Epoch:[0/2](320500/4588595) loss:2.294 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:29:04,359][model8_pretrain.py][INFO] Epoch:[0/2](320500/4588595) loss:3.452 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:29:04,359][model8_pretrain.py][INFO] Epoch:[0/2](320500/4588595) loss:3.040 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:29:04,359][model8_pretrain.py][INFO] Epoch:[0/2](320500/4588595) loss:2.963 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:29:04,359][model8_pretrain.py][INFO] Epoch:[0/2](320500/4588595) loss:2.909 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:29:04,360][model8_pretrain.py][INFO] Epoch:[0/2](320500/4588595) loss:2.815 lr:0.0000100 epoch_Time:27024.0min: [2024-01-04 03:29:41,305][model8_pretrain.py][INFO] Epoch:[0/2](320600/4588595) loss:3.335 lr:0.0000100 epoch_Time:27023.0min: [2024-01-04 03:29:41,305][model8_pretrain.py][INFO] Epoch:[0/2](320600/4588595) loss:2.687 lr:0.0000100 epoch_Time:27023.0min: [2024-01-04 03:29:41,305][model8_pretrain.py][INFO] Epoch:[0/2](320600/4588595) loss:2.702 lr:0.0000100 epoch_Time:27023.0min: [2024-01-04 03:29:41,305][model8_pretrain.py][INFO] Epoch:[0/2](320600/4588595) loss:2.429 lr:0.0000100 epoch_Time:27023.0min: [2024-01-04 03:29:41,305][model8_pretrain.py][INFO] Epoch:[0/2](320600/4588595) loss:2.662 lr:0.0000100 epoch_Time:27023.0min: [2024-01-04 03:29:41,305][model8_pretrain.py][INFO] Epoch:[0/2](320600/4588595) loss:2.717 lr:0.0000100 epoch_Time:27023.0min: [2024-01-04 03:29:41,305][model8_pretrain.py][INFO] Epoch:[0/2](320600/4588595) loss:2.955 lr:0.0000100 epoch_Time:27023.0min: [2024-01-04 03:29:41,305][model8_pretrain.py][INFO] Epoch:[0/2](320600/4588595) loss:2.612 lr:0.0000100 epoch_Time:27023.0min: [2024-01-04 03:30:18,260][model8_pretrain.py][INFO] Epoch:[0/2](320700/4588595) loss:2.741 lr:0.0000100 epoch_Time:27022.0min: [2024-01-04 03:30:18,260][model8_pretrain.py][INFO] Epoch:[0/2](320700/4588595) loss:2.823 lr:0.0000100 epoch_Time:27022.0min: [2024-01-04 03:30:18,260][model8_pretrain.py][INFO] Epoch:[0/2](320700/4588595) loss:2.969 lr:0.0000100 epoch_Time:27022.0min: [2024-01-04 03:30:18,260][model8_pretrain.py][INFO] Epoch:[0/2](320700/4588595) loss:2.897 lr:0.0000100 epoch_Time:27022.0min: [2024-01-04 03:30:18,260][model8_pretrain.py][INFO] Epoch:[0/2](320700/4588595) loss:2.867 lr:0.0000100 epoch_Time:27022.0min: [2024-01-04 03:30:18,260][model8_pretrain.py][INFO] Epoch:[0/2](320700/4588595) loss:3.165 lr:0.0000100 epoch_Time:27022.0min: [2024-01-04 03:30:18,261][model8_pretrain.py][INFO] Epoch:[0/2](320700/4588595) loss:3.005 lr:0.0000100 epoch_Time:27022.0min: [2024-01-04 03:30:18,261][model8_pretrain.py][INFO] Epoch:[0/2](320700/4588595) loss:2.931 lr:0.0000100 epoch_Time:27022.0min: [2024-01-04 03:30:55,193][model8_pretrain.py][INFO] Epoch:[0/2](320800/4588595) loss:3.434 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:30:55,193][model8_pretrain.py][INFO] Epoch:[0/2](320800/4588595) loss:3.196 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:30:55,193][model8_pretrain.py][INFO] Epoch:[0/2](320800/4588595) loss:3.064 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:30:55,193][model8_pretrain.py][INFO] Epoch:[0/2](320800/4588595) loss:3.080 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:30:55,193][model8_pretrain.py][INFO] Epoch:[0/2](320800/4588595) loss:2.763 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:30:55,193][model8_pretrain.py][INFO] Epoch:[0/2](320800/4588595) loss:2.807 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:30:55,193][model8_pretrain.py][INFO] Epoch:[0/2](320800/4588595) loss:2.773 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:30:55,193][model8_pretrain.py][INFO] Epoch:[0/2](320800/4588595) loss:2.633 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:31:32,125][model8_pretrain.py][INFO] Epoch:[0/2](320900/4588595) loss:3.169 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:31:32,125][model8_pretrain.py][INFO] Epoch:[0/2](320900/4588595) loss:3.360 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:31:32,125][model8_pretrain.py][INFO] Epoch:[0/2](320900/4588595) loss:3.177 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:31:32,125][model8_pretrain.py][INFO] Epoch:[0/2](320900/4588595) loss:2.648 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:31:32,125][model8_pretrain.py][INFO] Epoch:[0/2](320900/4588595) loss:2.342 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:31:32,125][model8_pretrain.py][INFO] Epoch:[0/2](320900/4588595) loss:2.421 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:31:32,125][model8_pretrain.py][INFO] Epoch:[0/2](320900/4588595) loss:3.060 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:31:32,125][model8_pretrain.py][INFO] Epoch:[0/2](320900/4588595) loss:3.107 lr:0.0000100 epoch_Time:27021.0min: [2024-01-04 03:32:09,067][model8_pretrain.py][INFO] Epoch:[0/2](321000/4588595) loss:2.621 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:09,067][model8_pretrain.py][INFO] Epoch:[0/2](321000/4588595) loss:3.290 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:09,067][model8_pretrain.py][INFO] Epoch:[0/2](321000/4588595) loss:2.720 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:09,067][model8_pretrain.py][INFO] Epoch:[0/2](321000/4588595) loss:2.668 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:09,067][model8_pretrain.py][INFO] Epoch:[0/2](321000/4588595) loss:2.968 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:09,067][model8_pretrain.py][INFO] Epoch:[0/2](321000/4588595) loss:2.875 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:09,068][model8_pretrain.py][INFO] Epoch:[0/2](321000/4588595) loss:2.875 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:09,068][model8_pretrain.py][INFO] Epoch:[0/2](321000/4588595) loss:2.970 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:46,007][model8_pretrain.py][INFO] Epoch:[0/2](321100/4588595) loss:3.182 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:46,007][model8_pretrain.py][INFO] Epoch:[0/2](321100/4588595) loss:3.052 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:46,007][model8_pretrain.py][INFO] Epoch:[0/2](321100/4588595) loss:3.077 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:46,007][model8_pretrain.py][INFO] Epoch:[0/2](321100/4588595) loss:2.895 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:46,007][model8_pretrain.py][INFO] Epoch:[0/2](321100/4588595) loss:2.234 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:46,007][model8_pretrain.py][INFO] Epoch:[0/2](321100/4588595) loss:2.491 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:46,007][model8_pretrain.py][INFO] Epoch:[0/2](321100/4588595) loss:2.937 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:32:46,007][model8_pretrain.py][INFO] Epoch:[0/2](321100/4588595) loss:2.362 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:33:33,106][model8_pretrain.py][INFO] Epoch:[0/2](321200/4588595) loss:2.837 lr:0.0000100 epoch_Time:27020.0min: [2024-01-04 03:33:33,106][model8_pretrain.py][INFO] Epoch:[0/2](321200/4588595) loss:3.056 lr:0.0000100 epoch_Time:27020.0min: [2024-01-04 03:33:33,106][model8_pretrain.py][INFO] Epoch:[0/2](321200/4588595) loss:2.972 lr:0.0000100 epoch_Time:27020.0min: [2024-01-04 03:33:33,106][model8_pretrain.py][INFO] Epoch:[0/2](321200/4588595) loss:2.701 lr:0.0000100 epoch_Time:27020.0min: [2024-01-04 03:33:33,106][model8_pretrain.py][INFO] Epoch:[0/2](321200/4588595) loss:2.261 lr:0.0000100 epoch_Time:27020.0min: [2024-01-04 03:33:33,106][model8_pretrain.py][INFO] Epoch:[0/2](321200/4588595) loss:2.868 lr:0.0000100 epoch_Time:27020.0min: [2024-01-04 03:33:33,106][model8_pretrain.py][INFO] Epoch:[0/2](321200/4588595) loss:3.201 lr:0.0000100 epoch_Time:27020.0min: [2024-01-04 03:33:33,106][model8_pretrain.py][INFO] Epoch:[0/2](321200/4588595) loss:2.792 lr:0.0000100 epoch_Time:27020.0min: [2024-01-04 03:34:10,037][model8_pretrain.py][INFO] Epoch:[0/2](321300/4588595) loss:2.937 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:10,037][model8_pretrain.py][INFO] Epoch:[0/2](321300/4588595) loss:3.349 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:10,037][model8_pretrain.py][INFO] Epoch:[0/2](321300/4588595) loss:2.756 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:10,037][model8_pretrain.py][INFO] Epoch:[0/2](321300/4588595) loss:2.693 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:10,037][model8_pretrain.py][INFO] Epoch:[0/2](321300/4588595) loss:3.004 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:10,037][model8_pretrain.py][INFO] Epoch:[0/2](321300/4588595) loss:3.279 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:10,037][model8_pretrain.py][INFO] Epoch:[0/2](321300/4588595) loss:2.967 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:10,038][model8_pretrain.py][INFO] Epoch:[0/2](321300/4588595) loss:3.006 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:46,949][model8_pretrain.py][INFO] Epoch:[0/2](321400/4588595) loss:2.355 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:46,949][model8_pretrain.py][INFO] Epoch:[0/2](321400/4588595) loss:2.873 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:46,949][model8_pretrain.py][INFO] Epoch:[0/2](321400/4588595) loss:3.174 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:46,949][model8_pretrain.py][INFO] Epoch:[0/2](321400/4588595) loss:2.910 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:46,949][model8_pretrain.py][INFO] Epoch:[0/2](321400/4588595) loss:3.159 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:46,949][model8_pretrain.py][INFO] Epoch:[0/2](321400/4588595) loss:2.927 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:46,949][model8_pretrain.py][INFO] Epoch:[0/2](321400/4588595) loss:3.381 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:34:46,949][model8_pretrain.py][INFO] Epoch:[0/2](321400/4588595) loss:2.630 lr:0.0000100 epoch_Time:27019.0min: [2024-01-04 03:35:23,896][model8_pretrain.py][INFO] Epoch:[0/2](321500/4588595) loss:2.660 lr:0.0000100 epoch_Time:27017.0min: [2024-01-04 03:35:23,896][model8_pretrain.py][INFO] Epoch:[0/2](321500/4588595) loss:2.913 lr:0.0000100 epoch_Time:27017.0min: [2024-01-04 03:35:23,896][model8_pretrain.py][INFO] Epoch:[0/2](321500/4588595) loss:3.274 lr:0.0000100 epoch_Time:27017.0min: [2024-01-04 03:35:23,896][model8_pretrain.py][INFO] Epoch:[0/2](321500/4588595) loss:3.325 lr:0.0000100 epoch_Time:27017.0min: [2024-01-04 03:35:23,896][model8_pretrain.py][INFO] Epoch:[0/2](321500/4588595) loss:3.233 lr:0.0000100 epoch_Time:27018.0min: [2024-01-04 03:35:23,896][model8_pretrain.py][INFO] Epoch:[0/2](321500/4588595) loss:2.752 lr:0.0000100 epoch_Time:27017.0min: [2024-01-04 03:35:23,896][model8_pretrain.py][INFO] Epoch:[0/2](321500/4588595) loss:2.792 lr:0.0000100 epoch_Time:27017.0min: [2024-01-04 03:35:23,896][model8_pretrain.py][INFO] Epoch:[0/2](321500/4588595) loss:2.927 lr:0.0000100 epoch_Time:27017.0min: [2024-01-04 03:36:00,831][model8_pretrain.py][INFO] Epoch:[0/2](321600/4588595) loss:3.083 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:00,831][model8_pretrain.py][INFO] Epoch:[0/2](321600/4588595) loss:2.446 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:00,831][model8_pretrain.py][INFO] Epoch:[0/2](321600/4588595) loss:3.355 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:00,831][model8_pretrain.py][INFO] Epoch:[0/2](321600/4588595) loss:2.885 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:00,831][model8_pretrain.py][INFO] Epoch:[0/2](321600/4588595) loss:3.113 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:00,831][model8_pretrain.py][INFO] Epoch:[0/2](321600/4588595) loss:2.689 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:00,831][model8_pretrain.py][INFO] Epoch:[0/2](321600/4588595) loss:3.123 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:00,832][model8_pretrain.py][INFO] Epoch:[0/2](321600/4588595) loss:2.637 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](321700/4588595) loss:2.999 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](321700/4588595) loss:3.386 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](321700/4588595) loss:3.090 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](321700/4588595) loss:2.798 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](321700/4588595) loss:2.693 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](321700/4588595) loss:2.822 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:37,767][model8_pretrain.py][INFO] Epoch:[0/2](321700/4588595) loss:2.918 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:36:37,768][model8_pretrain.py][INFO] Epoch:[0/2](321700/4588595) loss:3.145 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:37:14,693][model8_pretrain.py][INFO] Epoch:[0/2](321800/4588595) loss:2.991 lr:0.0000100 epoch_Time:27015.0min: [2024-01-04 03:37:14,693][model8_pretrain.py][INFO] Epoch:[0/2](321800/4588595) loss:2.934 lr:0.0000100 epoch_Time:27015.0min: [2024-01-04 03:37:14,693][model8_pretrain.py][INFO] Epoch:[0/2](321800/4588595) loss:2.909 lr:0.0000100 epoch_Time:27015.0min: [2024-01-04 03:37:14,693][model8_pretrain.py][INFO] Epoch:[0/2](321800/4588595) loss:2.946 lr:0.0000100 epoch_Time:27015.0min: [2024-01-04 03:37:14,693][model8_pretrain.py][INFO] Epoch:[0/2](321800/4588595) loss:3.023 lr:0.0000100 epoch_Time:27015.0min: [2024-01-04 03:37:14,693][model8_pretrain.py][INFO] Epoch:[0/2](321800/4588595) loss:2.877 lr:0.0000100 epoch_Time:27015.0min: [2024-01-04 03:37:14,693][model8_pretrain.py][INFO] Epoch:[0/2](321800/4588595) loss:3.066 lr:0.0000100 epoch_Time:27015.0min: [2024-01-04 03:37:14,693][model8_pretrain.py][INFO] Epoch:[0/2](321800/4588595) loss:2.481 lr:0.0000100 epoch_Time:27015.0min: [2024-01-04 03:37:51,638][model8_pretrain.py][INFO] Epoch:[0/2](321900/4588595) loss:3.190 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:37:51,638][model8_pretrain.py][INFO] Epoch:[0/2](321900/4588595) loss:3.087 lr:0.0000100 epoch_Time:27014.0min: [2024-01-04 03:37:51,638][model8_pretrain.py][INFO] Epoch:[0/2](321900/4588595) loss:3.293 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:37:51,638][model8_pretrain.py][INFO] Epoch:[0/2](321900/4588595) loss:3.038 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:37:51,638][model8_pretrain.py][INFO] Epoch:[0/2](321900/4588595) loss:2.780 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:37:51,638][model8_pretrain.py][INFO] Epoch:[0/2](321900/4588595) loss:2.680 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:37:51,638][model8_pretrain.py][INFO] Epoch:[0/2](321900/4588595) loss:3.099 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:37:51,639][model8_pretrain.py][INFO] Epoch:[0/2](321900/4588595) loss:3.438 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:38:38,837][model8_pretrain.py][INFO] Epoch:[0/2](322000/4588595) loss:2.421 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:38:38,837][model8_pretrain.py][INFO] Epoch:[0/2](322000/4588595) loss:2.652 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:38:38,837][model8_pretrain.py][INFO] Epoch:[0/2](322000/4588595) loss:2.809 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:38:38,837][model8_pretrain.py][INFO] Epoch:[0/2](322000/4588595) loss:2.617 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:38:38,837][model8_pretrain.py][INFO] Epoch:[0/2](322000/4588595) loss:3.115 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:38:38,837][model8_pretrain.py][INFO] Epoch:[0/2](322000/4588595) loss:3.153 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:38:38,837][model8_pretrain.py][INFO] Epoch:[0/2](322000/4588595) loss:2.823 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:38:38,837][model8_pretrain.py][INFO] Epoch:[0/2](322000/4588595) loss:2.798 lr:0.0000100 epoch_Time:27016.0min: [2024-01-04 03:39:15,776][model8_pretrain.py][INFO] Epoch:[0/2](322100/4588595) loss:2.125 lr:0.0000100 epoch_Time:27014.0min: [2024-01-04 03:39:15,776][model8_pretrain.py][INFO] Epoch:[0/2](322100/4588595) loss:2.556 lr:0.0000100 epoch_Time:27014.0min: [2024-01-04 03:39:15,776][model8_pretrain.py][INFO] Epoch:[0/2](322100/4588595) loss:3.087 lr:0.0000100 epoch_Time:27014.0min: [2024-01-04 03:39:15,776][model8_pretrain.py][INFO] Epoch:[0/2](322100/4588595) loss:3.125 lr:0.0000100 epoch_Time:27014.0min: [2024-01-04 03:39:15,776][model8_pretrain.py][INFO] Epoch:[0/2](322100/4588595) loss:2.599 lr:0.0000100 epoch_Time:27014.0min: [2024-01-04 03:39:15,777][model8_pretrain.py][INFO] Epoch:[0/2](322100/4588595) loss:2.667 lr:0.0000100 epoch_Time:27014.0min: [2024-01-04 03:39:15,777][model8_pretrain.py][INFO] Epoch:[0/2](322100/4588595) loss:2.930 lr:0.0000100 epoch_Time:27014.0min: [2024-01-04 03:39:15,777][model8_pretrain.py][INFO] Epoch:[0/2](322100/4588595) loss:3.280 lr:0.0000100 epoch_Time:27014.0min: [2024-01-04 03:39:52,740][model8_pretrain.py][INFO] Epoch:[0/2](322200/4588595) loss:2.357 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:39:52,740][model8_pretrain.py][INFO] Epoch:[0/2](322200/4588595) loss:3.024 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:39:52,740][model8_pretrain.py][INFO] Epoch:[0/2](322200/4588595) loss:3.065 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:39:52,740][model8_pretrain.py][INFO] Epoch:[0/2](322200/4588595) loss:2.772 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:39:52,740][model8_pretrain.py][INFO] Epoch:[0/2](322200/4588595) loss:2.878 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:39:52,740][model8_pretrain.py][INFO] Epoch:[0/2](322200/4588595) loss:2.643 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:39:52,740][model8_pretrain.py][INFO] Epoch:[0/2](322200/4588595) loss:2.111 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:39:52,740][model8_pretrain.py][INFO] Epoch:[0/2](322200/4588595) loss:2.509 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:40:29,693][model8_pretrain.py][INFO] Epoch:[0/2](322300/4588595) loss:3.173 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:40:29,693][model8_pretrain.py][INFO] Epoch:[0/2](322300/4588595) loss:2.147 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:40:29,693][model8_pretrain.py][INFO] Epoch:[0/2](322300/4588595) loss:2.343 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:40:29,693][model8_pretrain.py][INFO] Epoch:[0/2](322300/4588595) loss:3.398 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:40:29,693][model8_pretrain.py][INFO] Epoch:[0/2](322300/4588595) loss:2.364 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:40:29,693][model8_pretrain.py][INFO] Epoch:[0/2](322300/4588595) loss:2.944 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:40:29,694][model8_pretrain.py][INFO] Epoch:[0/2](322300/4588595) loss:3.358 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:40:29,694][model8_pretrain.py][INFO] Epoch:[0/2](322300/4588595) loss:3.063 lr:0.0000100 epoch_Time:27013.0min: [2024-01-04 03:41:06,640][model8_pretrain.py][INFO] Epoch:[0/2](322400/4588595) loss:3.342 lr:0.0000100 epoch_Time:27012.0min: [2024-01-04 03:41:06,640][model8_pretrain.py][INFO] Epoch:[0/2](322400/4588595) loss:3.068 lr:0.0000100 epoch_Time:27012.0min: [2024-01-04 03:41:06,640][model8_pretrain.py][INFO] Epoch:[0/2](322400/4588595) loss:3.325 lr:0.0000100 epoch_Time:27012.0min: [2024-01-04 03:41:06,640][model8_pretrain.py][INFO] Epoch:[0/2](322400/4588595) loss:2.847 lr:0.0000100 epoch_Time:27012.0min: [2024-01-04 03:41:06,640][model8_pretrain.py][INFO] Epoch:[0/2](322400/4588595) loss:2.487 lr:0.0000100 epoch_Time:27012.0min: [2024-01-04 03:41:06,640][model8_pretrain.py][INFO] Epoch:[0/2](322400/4588595) loss:3.473 lr:0.0000100 epoch_Time:27012.0min: [2024-01-04 03:41:06,641][model8_pretrain.py][INFO] Epoch:[0/2](322400/4588595) loss:2.847 lr:0.0000100 epoch_Time:27012.0min: [2024-01-04 03:41:06,641][model8_pretrain.py][INFO] Epoch:[0/2](322400/4588595) loss:2.866 lr:0.0000100 epoch_Time:27012.0min: [2024-01-04 03:41:43,613][model8_pretrain.py][INFO] Epoch:[0/2](322500/4588595) loss:3.041 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:41:43,613][model8_pretrain.py][INFO] Epoch:[0/2](322500/4588595) loss:2.822 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:41:43,613][model8_pretrain.py][INFO] Epoch:[0/2](322500/4588595) loss:3.095 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:41:43,613][model8_pretrain.py][INFO] Epoch:[0/2](322500/4588595) loss:2.729 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:41:43,613][model8_pretrain.py][INFO] Epoch:[0/2](322500/4588595) loss:2.854 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:41:43,613][model8_pretrain.py][INFO] Epoch:[0/2](322500/4588595) loss:3.067 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:41:43,613][model8_pretrain.py][INFO] Epoch:[0/2](322500/4588595) loss:2.493 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:41:43,613][model8_pretrain.py][INFO] Epoch:[0/2](322500/4588595) loss:2.692 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:42:20,573][model8_pretrain.py][INFO] Epoch:[0/2](322600/4588595) loss:2.815 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:42:20,573][model8_pretrain.py][INFO] Epoch:[0/2](322600/4588595) loss:3.409 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:42:20,573][model8_pretrain.py][INFO] Epoch:[0/2](322600/4588595) loss:2.883 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:42:20,573][model8_pretrain.py][INFO] Epoch:[0/2](322600/4588595) loss:2.865 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:42:20,573][model8_pretrain.py][INFO] Epoch:[0/2](322600/4588595) loss:2.424 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:42:20,573][model8_pretrain.py][INFO] Epoch:[0/2](322600/4588595) loss:2.912 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:42:20,573][model8_pretrain.py][INFO] Epoch:[0/2](322600/4588595) loss:3.012 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:42:20,573][model8_pretrain.py][INFO] Epoch:[0/2](322600/4588595) loss:3.022 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:42:57,522][model8_pretrain.py][INFO] Epoch:[0/2](322700/4588595) loss:3.433 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:42:57,522][model8_pretrain.py][INFO] Epoch:[0/2](322700/4588595) loss:2.400 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:42:57,522][model8_pretrain.py][INFO] Epoch:[0/2](322700/4588595) loss:2.813 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:42:57,522][model8_pretrain.py][INFO] Epoch:[0/2](322700/4588595) loss:3.038 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:42:57,522][model8_pretrain.py][INFO] Epoch:[0/2](322700/4588595) loss:2.546 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:42:57,522][model8_pretrain.py][INFO] Epoch:[0/2](322700/4588595) loss:2.891 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:42:57,522][model8_pretrain.py][INFO] Epoch:[0/2](322700/4588595) loss:2.867 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:42:57,522][model8_pretrain.py][INFO] Epoch:[0/2](322700/4588595) loss:3.314 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:43:44,915][model8_pretrain.py][INFO] Epoch:[0/2](322800/4588595) loss:3.285 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:43:44,915][model8_pretrain.py][INFO] Epoch:[0/2](322800/4588595) loss:2.981 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:43:44,915][model8_pretrain.py][INFO] Epoch:[0/2](322800/4588595) loss:3.065 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:43:44,915][model8_pretrain.py][INFO] Epoch:[0/2](322800/4588595) loss:2.974 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:43:44,915][model8_pretrain.py][INFO] Epoch:[0/2](322800/4588595) loss:2.750 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:43:44,915][model8_pretrain.py][INFO] Epoch:[0/2](322800/4588595) loss:3.055 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:43:44,915][model8_pretrain.py][INFO] Epoch:[0/2](322800/4588595) loss:3.142 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:43:44,915][model8_pretrain.py][INFO] Epoch:[0/2](322800/4588595) loss:3.215 lr:0.0000100 epoch_Time:27011.0min: [2024-01-04 03:44:21,836][model8_pretrain.py][INFO] Epoch:[0/2](322900/4588595) loss:3.058 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:44:21,836][model8_pretrain.py][INFO] Epoch:[0/2](322900/4588595) loss:2.250 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:44:21,836][model8_pretrain.py][INFO] Epoch:[0/2](322900/4588595) loss:3.177 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:44:21,836][model8_pretrain.py][INFO] Epoch:[0/2](322900/4588595) loss:2.297 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:44:21,836][model8_pretrain.py][INFO] Epoch:[0/2](322900/4588595) loss:2.955 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:44:21,837][model8_pretrain.py][INFO] Epoch:[0/2](322900/4588595) loss:3.300 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:44:21,837][model8_pretrain.py][INFO] Epoch:[0/2](322900/4588595) loss:2.551 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:44:21,837][model8_pretrain.py][INFO] Epoch:[0/2](322900/4588595) loss:2.534 lr:0.0000100 epoch_Time:27010.0min: [2024-01-04 03:44:58,750][model8_pretrain.py][INFO] Epoch:[0/2](323000/4588595) loss:3.138 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:44:58,750][model8_pretrain.py][INFO] Epoch:[0/2](323000/4588595) loss:3.235 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:44:58,750][model8_pretrain.py][INFO] Epoch:[0/2](323000/4588595) loss:2.680 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:44:58,750][model8_pretrain.py][INFO] Epoch:[0/2](323000/4588595) loss:3.032 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:44:58,750][model8_pretrain.py][INFO] Epoch:[0/2](323000/4588595) loss:3.245 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:44:58,750][model8_pretrain.py][INFO] Epoch:[0/2](323000/4588595) loss:2.372 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:44:58,750][model8_pretrain.py][INFO] Epoch:[0/2](323000/4588595) loss:2.666 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:44:58,750][model8_pretrain.py][INFO] Epoch:[0/2](323000/4588595) loss:2.595 lr:0.0000100 epoch_Time:27009.0min: [2024-01-04 03:45:35,683][model8_pretrain.py][INFO] Epoch:[0/2](323100/4588595) loss:2.877 lr:0.0000100 epoch_Time:27008.0min: [2024-01-04 03:45:35,683][model8_pretrain.py][INFO] Epoch:[0/2](323100/4588595) loss:2.698 lr:0.0000100 epoch_Time:27008.0min: [2024-01-04 03:45:35,683][model8_pretrain.py][INFO] Epoch:[0/2](323100/4588595) loss:2.618 lr:0.0000100 epoch_Time:27008.0min: [2024-01-04 03:45:35,683][model8_pretrain.py][INFO] Epoch:[0/2](323100/4588595) loss:3.018 lr:0.0000100 epoch_Time:27008.0min: [2024-01-04 03:45:35,683][model8_pretrain.py][INFO] Epoch:[0/2](323100/4588595) loss:3.156 lr:0.0000100 epoch_Time:27008.0min: [2024-01-04 03:45:35,683][model8_pretrain.py][INFO] Epoch:[0/2](323100/4588595) loss:3.161 lr:0.0000100 epoch_Time:27008.0min: [2024-01-04 03:45:35,683][model8_pretrain.py][INFO] Epoch:[0/2](323100/4588595) loss:2.707 lr:0.0000100 epoch_Time:27008.0min: [2024-01-04 03:45:35,683][model8_pretrain.py][INFO] Epoch:[0/2](323100/4588595) loss:3.209 lr:0.0000100 epoch_Time:27008.0min: [2024-01-04 03:46:12,616][model8_pretrain.py][INFO] Epoch:[0/2](323200/4588595) loss:2.864 lr:0.0000100 epoch_Time:27007.0min: [2024-01-04 03:46:12,616][model8_pretrain.py][INFO] Epoch:[0/2](323200/4588595) loss:2.643 lr:0.0000100 epoch_Time:27007.0min: [2024-01-04 03:46:12,616][model8_pretrain.py][INFO] Epoch:[0/2](323200/4588595) loss:2.639 lr:0.0000100 epoch_Time:27007.0min: [2024-01-04 03:46:12,616][model8_pretrain.py][INFO] Epoch:[0/2](323200/4588595) loss:3.034 lr:0.0000100 epoch_Time:27007.0min: [2024-01-04 03:46:12,616][model8_pretrain.py][INFO] Epoch:[0/2](323200/4588595) loss:3.163 lr:0.0000100 epoch_Time:27007.0min: [2024-01-04 03:46:12,616][model8_pretrain.py][INFO] Epoch:[0/2](323200/4588595) loss:3.419 lr:0.0000100 epoch_Time:27007.0min: [2024-01-04 03:46:12,616][model8_pretrain.py][INFO] Epoch:[0/2](323200/4588595) loss:2.863 lr:0.0000100 epoch_Time:27007.0min: [2024-01-04 03:46:12,616][model8_pretrain.py][INFO] Epoch:[0/2](323200/4588595) loss:2.863 lr:0.0000100 epoch_Time:27007.0min: [2024-01-04 03:46:49,545][model8_pretrain.py][INFO] Epoch:[0/2](323300/4588595) loss:2.878 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:46:49,545][model8_pretrain.py][INFO] Epoch:[0/2](323300/4588595) loss:2.689 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:46:49,545][model8_pretrain.py][INFO] Epoch:[0/2](323300/4588595) loss:2.811 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:46:49,545][model8_pretrain.py][INFO] Epoch:[0/2](323300/4588595) loss:2.693 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:46:49,545][model8_pretrain.py][INFO] Epoch:[0/2](323300/4588595) loss:2.798 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:46:49,545][model8_pretrain.py][INFO] Epoch:[0/2](323300/4588595) loss:3.366 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:46:49,545][model8_pretrain.py][INFO] Epoch:[0/2](323300/4588595) loss:2.803 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:46:49,545][model8_pretrain.py][INFO] Epoch:[0/2](323300/4588595) loss:3.011 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:47:26,473][model8_pretrain.py][INFO] Epoch:[0/2](323400/4588595) loss:2.744 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:47:26,473][model8_pretrain.py][INFO] Epoch:[0/2](323400/4588595) loss:2.848 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:47:26,473][model8_pretrain.py][INFO] Epoch:[0/2](323400/4588595) loss:3.341 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:47:26,473][model8_pretrain.py][INFO] Epoch:[0/2](323400/4588595) loss:2.994 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:47:26,473][model8_pretrain.py][INFO] Epoch:[0/2](323400/4588595) loss:3.002 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:47:26,473][model8_pretrain.py][INFO] Epoch:[0/2](323400/4588595) loss:3.201 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:47:26,473][model8_pretrain.py][INFO] Epoch:[0/2](323400/4588595) loss:2.622 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:47:26,473][model8_pretrain.py][INFO] Epoch:[0/2](323400/4588595) loss:3.050 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:48:03,402][model8_pretrain.py][INFO] Epoch:[0/2](323500/4588595) loss:2.707 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:48:03,402][model8_pretrain.py][INFO] Epoch:[0/2](323500/4588595) loss:2.023 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:48:03,402][model8_pretrain.py][INFO] Epoch:[0/2](323500/4588595) loss:2.367 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:48:03,402][model8_pretrain.py][INFO] Epoch:[0/2](323500/4588595) loss:2.656 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:48:03,402][model8_pretrain.py][INFO] Epoch:[0/2](323500/4588595) loss:3.021 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:48:03,402][model8_pretrain.py][INFO] Epoch:[0/2](323500/4588595) loss:3.381 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:48:03,402][model8_pretrain.py][INFO] Epoch:[0/2](323500/4588595) loss:3.410 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:48:03,402][model8_pretrain.py][INFO] Epoch:[0/2](323500/4588595) loss:3.056 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:48:50,767][model8_pretrain.py][INFO] Epoch:[0/2](323600/4588595) loss:3.066 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:48:50,767][model8_pretrain.py][INFO] Epoch:[0/2](323600/4588595) loss:2.762 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:48:50,767][model8_pretrain.py][INFO] Epoch:[0/2](323600/4588595) loss:3.040 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:48:50,767][model8_pretrain.py][INFO] Epoch:[0/2](323600/4588595) loss:3.039 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:48:50,767][model8_pretrain.py][INFO] Epoch:[0/2](323600/4588595) loss:2.386 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:48:50,767][model8_pretrain.py][INFO] Epoch:[0/2](323600/4588595) loss:3.218 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:48:50,767][model8_pretrain.py][INFO] Epoch:[0/2](323600/4588595) loss:3.030 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:48:50,768][model8_pretrain.py][INFO] Epoch:[0/2](323600/4588595) loss:2.597 lr:0.0000100 epoch_Time:27006.0min: [2024-01-04 03:49:27,702][model8_pretrain.py][INFO] Epoch:[0/2](323700/4588595) loss:2.682 lr:0.0000100 epoch_Time:27005.0min: [2024-01-04 03:49:27,701][model8_pretrain.py][INFO] Epoch:[0/2](323700/4588595) loss:3.040 lr:0.0000100 epoch_Time:27005.0min: [2024-01-04 03:49:27,701][model8_pretrain.py][INFO] Epoch:[0/2](323700/4588595) loss:2.302 lr:0.0000100 epoch_Time:27005.0min: [2024-01-04 03:49:27,702][model8_pretrain.py][INFO] Epoch:[0/2](323700/4588595) loss:2.880 lr:0.0000100 epoch_Time:27005.0min: [2024-01-04 03:49:27,702][model8_pretrain.py][INFO] Epoch:[0/2](323700/4588595) loss:3.017 lr:0.0000100 epoch_Time:27005.0min: [2024-01-04 03:49:27,702][model8_pretrain.py][INFO] Epoch:[0/2](323700/4588595) loss:2.835 lr:0.0000100 epoch_Time:27005.0min: [2024-01-04 03:49:27,702][model8_pretrain.py][INFO] Epoch:[0/2](323700/4588595) loss:3.186 lr:0.0000100 epoch_Time:27005.0min: [2024-01-04 03:49:27,702][model8_pretrain.py][INFO] Epoch:[0/2](323700/4588595) loss:3.284 lr:0.0000100 epoch_Time:27005.0min: [2024-01-04 03:50:04,641][model8_pretrain.py][INFO] Epoch:[0/2](323800/4588595) loss:3.029 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:04,641][model8_pretrain.py][INFO] Epoch:[0/2](323800/4588595) loss:3.148 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:04,641][model8_pretrain.py][INFO] Epoch:[0/2](323800/4588595) loss:2.818 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:04,641][model8_pretrain.py][INFO] Epoch:[0/2](323800/4588595) loss:3.013 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:04,641][model8_pretrain.py][INFO] Epoch:[0/2](323800/4588595) loss:3.015 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:04,641][model8_pretrain.py][INFO] Epoch:[0/2](323800/4588595) loss:3.208 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:04,641][model8_pretrain.py][INFO] Epoch:[0/2](323800/4588595) loss:2.960 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:04,641][model8_pretrain.py][INFO] Epoch:[0/2](323800/4588595) loss:2.765 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:41,582][model8_pretrain.py][INFO] Epoch:[0/2](323900/4588595) loss:3.471 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:41,582][model8_pretrain.py][INFO] Epoch:[0/2](323900/4588595) loss:2.939 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:41,582][model8_pretrain.py][INFO] Epoch:[0/2](323900/4588595) loss:2.779 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:41,583][model8_pretrain.py][INFO] Epoch:[0/2](323900/4588595) loss:2.776 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:41,582][model8_pretrain.py][INFO] Epoch:[0/2](323900/4588595) loss:2.639 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:41,583][model8_pretrain.py][INFO] Epoch:[0/2](323900/4588595) loss:3.058 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:41,583][model8_pretrain.py][INFO] Epoch:[0/2](323900/4588595) loss:3.157 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:50:41,583][model8_pretrain.py][INFO] Epoch:[0/2](323900/4588595) loss:3.045 lr:0.0000100 epoch_Time:27004.0min: [2024-01-04 03:51:18,522][model8_pretrain.py][INFO] Epoch:[0/2](324000/4588595) loss:2.854 lr:0.0000100 epoch_Time:27003.0min: [2024-01-04 03:51:18,522][model8_pretrain.py][INFO] Epoch:[0/2](324000/4588595) loss:2.567 lr:0.0000100 epoch_Time:27003.0min: [2024-01-04 03:51:18,523][model8_pretrain.py][INFO] Epoch:[0/2](324000/4588595) loss:3.022 lr:0.0000100 epoch_Time:27003.0min: [2024-01-04 03:51:18,523][model8_pretrain.py][INFO] Epoch:[0/2](324000/4588595) loss:2.413 lr:0.0000100 epoch_Time:27003.0min: [2024-01-04 03:51:18,523][model8_pretrain.py][INFO] Epoch:[0/2](324000/4588595) loss:3.160 lr:0.0000100 epoch_Time:27003.0min: [2024-01-04 03:51:18,523][model8_pretrain.py][INFO] Epoch:[0/2](324000/4588595) loss:3.062 lr:0.0000100 epoch_Time:27003.0min: [2024-01-04 03:51:18,523][model8_pretrain.py][INFO] Epoch:[0/2](324000/4588595) loss:3.346 lr:0.0000100 epoch_Time:27003.0min: [2024-01-04 03:51:18,523][model8_pretrain.py][INFO] Epoch:[0/2](324000/4588595) loss:3.132 lr:0.0000100 epoch_Time:27003.0min: [2024-01-04 03:51:55,455][model8_pretrain.py][INFO] Epoch:[0/2](324100/4588595) loss:3.088 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:51:55,455][model8_pretrain.py][INFO] Epoch:[0/2](324100/4588595) loss:2.817 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:51:55,455][model8_pretrain.py][INFO] Epoch:[0/2](324100/4588595) loss:2.549 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:51:55,455][model8_pretrain.py][INFO] Epoch:[0/2](324100/4588595) loss:2.959 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:51:55,455][model8_pretrain.py][INFO] Epoch:[0/2](324100/4588595) loss:3.415 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:51:55,455][model8_pretrain.py][INFO] Epoch:[0/2](324100/4588595) loss:3.283 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:51:55,455][model8_pretrain.py][INFO] Epoch:[0/2](324100/4588595) loss:2.971 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:51:55,455][model8_pretrain.py][INFO] Epoch:[0/2](324100/4588595) loss:3.244 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:52:32,399][model8_pretrain.py][INFO] Epoch:[0/2](324200/4588595) loss:3.342 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:52:32,399][model8_pretrain.py][INFO] Epoch:[0/2](324200/4588595) loss:2.473 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:52:32,399][model8_pretrain.py][INFO] Epoch:[0/2](324200/4588595) loss:2.454 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:52:32,399][model8_pretrain.py][INFO] Epoch:[0/2](324200/4588595) loss:3.093 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:52:32,399][model8_pretrain.py][INFO] Epoch:[0/2](324200/4588595) loss:3.616 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:52:32,399][model8_pretrain.py][INFO] Epoch:[0/2](324200/4588595) loss:3.177 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:52:32,399][model8_pretrain.py][INFO] Epoch:[0/2](324200/4588595) loss:2.929 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:52:32,399][model8_pretrain.py][INFO] Epoch:[0/2](324200/4588595) loss:3.046 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:53:09,330][model8_pretrain.py][INFO] Epoch:[0/2](324300/4588595) loss:2.674 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:53:09,330][model8_pretrain.py][INFO] Epoch:[0/2](324300/4588595) loss:2.925 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:53:09,330][model8_pretrain.py][INFO] Epoch:[0/2](324300/4588595) loss:3.019 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:53:09,330][model8_pretrain.py][INFO] Epoch:[0/2](324300/4588595) loss:3.108 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:53:09,330][model8_pretrain.py][INFO] Epoch:[0/2](324300/4588595) loss:3.238 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:53:09,330][model8_pretrain.py][INFO] Epoch:[0/2](324300/4588595) loss:3.266 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:53:09,330][model8_pretrain.py][INFO] Epoch:[0/2](324300/4588595) loss:2.562 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:53:09,330][model8_pretrain.py][INFO] Epoch:[0/2](324300/4588595) loss:3.430 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:53:56,732][model8_pretrain.py][INFO] Epoch:[0/2](324400/4588595) loss:3.241 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:53:56,732][model8_pretrain.py][INFO] Epoch:[0/2](324400/4588595) loss:3.212 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:53:56,732][model8_pretrain.py][INFO] Epoch:[0/2](324400/4588595) loss:2.886 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:53:56,732][model8_pretrain.py][INFO] Epoch:[0/2](324400/4588595) loss:2.784 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:53:56,732][model8_pretrain.py][INFO] Epoch:[0/2](324400/4588595) loss:2.701 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:53:56,732][model8_pretrain.py][INFO] Epoch:[0/2](324400/4588595) loss:2.972 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:53:56,732][model8_pretrain.py][INFO] Epoch:[0/2](324400/4588595) loss:3.284 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:53:56,732][model8_pretrain.py][INFO] Epoch:[0/2](324400/4588595) loss:2.660 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:54:33,660][model8_pretrain.py][INFO] Epoch:[0/2](324500/4588595) loss:3.040 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:54:33,660][model8_pretrain.py][INFO] Epoch:[0/2](324500/4588595) loss:2.735 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:54:33,660][model8_pretrain.py][INFO] Epoch:[0/2](324500/4588595) loss:2.604 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:54:33,660][model8_pretrain.py][INFO] Epoch:[0/2](324500/4588595) loss:3.155 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:54:33,660][model8_pretrain.py][INFO] Epoch:[0/2](324500/4588595) loss:2.996 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:54:33,660][model8_pretrain.py][INFO] Epoch:[0/2](324500/4588595) loss:3.376 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:54:33,660][model8_pretrain.py][INFO] Epoch:[0/2](324500/4588595) loss:2.666 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:54:33,660][model8_pretrain.py][INFO] Epoch:[0/2](324500/4588595) loss:3.113 lr:0.0000100 epoch_Time:27001.0min: [2024-01-04 03:55:10,605][model8_pretrain.py][INFO] Epoch:[0/2](324600/4588595) loss:2.569 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:55:10,605][model8_pretrain.py][INFO] Epoch:[0/2](324600/4588595) loss:2.535 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:55:10,605][model8_pretrain.py][INFO] Epoch:[0/2](324600/4588595) loss:3.155 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:55:10,605][model8_pretrain.py][INFO] Epoch:[0/2](324600/4588595) loss:2.935 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:55:10,605][model8_pretrain.py][INFO] Epoch:[0/2](324600/4588595) loss:2.972 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:55:10,605][model8_pretrain.py][INFO] Epoch:[0/2](324600/4588595) loss:2.298 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:55:10,605][model8_pretrain.py][INFO] Epoch:[0/2](324600/4588595) loss:2.931 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:55:10,605][model8_pretrain.py][INFO] Epoch:[0/2](324600/4588595) loss:2.418 lr:0.0000100 epoch_Time:27000.0min: [2024-01-04 03:55:47,540][model8_pretrain.py][INFO] Epoch:[0/2](324700/4588595) loss:3.365 lr:0.0000100 epoch_Time:26999.0min: [2024-01-04 03:55:47,540][model8_pretrain.py][INFO] Epoch:[0/2](324700/4588595) loss:2.352 lr:0.0000100 epoch_Time:26999.0min: [2024-01-04 03:55:47,540][model8_pretrain.py][INFO] Epoch:[0/2](324700/4588595) loss:2.903 lr:0.0000100 epoch_Time:26999.0min: [2024-01-04 03:55:47,540][model8_pretrain.py][INFO] Epoch:[0/2](324700/4588595) loss:3.087 lr:0.0000100 epoch_Time:26999.0min: [2024-01-04 03:55:47,540][model8_pretrain.py][INFO] Epoch:[0/2](324700/4588595) loss:3.238 lr:0.0000100 epoch_Time:26999.0min: [2024-01-04 03:55:47,540][model8_pretrain.py][INFO] Epoch:[0/2](324700/4588595) loss:3.358 lr:0.0000100 epoch_Time:26999.0min: [2024-01-04 03:55:47,540][model8_pretrain.py][INFO] Epoch:[0/2](324700/4588595) loss:2.592 lr:0.0000100 epoch_Time:26999.0min: [2024-01-04 03:55:47,540][model8_pretrain.py][INFO] Epoch:[0/2](324700/4588595) loss:2.984 lr:0.0000100 epoch_Time:26999.0min: [2024-01-04 03:56:24,471][model8_pretrain.py][INFO] Epoch:[0/2](324800/4588595) loss:2.056 lr:0.0000100 epoch_Time:26998.0min: [2024-01-04 03:56:24,471][model8_pretrain.py][INFO] Epoch:[0/2](324800/4588595) loss:2.939 lr:0.0000100 epoch_Time:26998.0min: [2024-01-04 03:56:24,471][model8_pretrain.py][INFO] Epoch:[0/2](324800/4588595) loss:2.951 lr:0.0000100 epoch_Time:26998.0min: [2024-01-04 03:56:24,471][model8_pretrain.py][INFO] Epoch:[0/2](324800/4588595) loss:3.212 lr:0.0000100 epoch_Time:26998.0min: [2024-01-04 03:56:24,471][model8_pretrain.py][INFO] Epoch:[0/2](324800/4588595) loss:3.222 lr:0.0000100 epoch_Time:26998.0min: [2024-01-04 03:56:24,471][model8_pretrain.py][INFO] Epoch:[0/2](324800/4588595) loss:2.550 lr:0.0000100 epoch_Time:26998.0min: [2024-01-04 03:56:24,471][model8_pretrain.py][INFO] Epoch:[0/2](324800/4588595) loss:2.917 lr:0.0000100 epoch_Time:26998.0min: [2024-01-04 03:56:24,471][model8_pretrain.py][INFO] Epoch:[0/2](324800/4588595) loss:3.021 lr:0.0000100 epoch_Time:26998.0min: [2024-01-04 03:57:01,408][model8_pretrain.py][INFO] Epoch:[0/2](324900/4588595) loss:2.602 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:01,408][model8_pretrain.py][INFO] Epoch:[0/2](324900/4588595) loss:3.286 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:01,408][model8_pretrain.py][INFO] Epoch:[0/2](324900/4588595) loss:2.373 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:01,408][model8_pretrain.py][INFO] Epoch:[0/2](324900/4588595) loss:2.869 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:01,408][model8_pretrain.py][INFO] Epoch:[0/2](324900/4588595) loss:2.934 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:01,408][model8_pretrain.py][INFO] Epoch:[0/2](324900/4588595) loss:2.570 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:01,408][model8_pretrain.py][INFO] Epoch:[0/2](324900/4588595) loss:2.757 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:01,408][model8_pretrain.py][INFO] Epoch:[0/2](324900/4588595) loss:3.271 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:38,345][model8_pretrain.py][INFO] Epoch:[0/2](325000/4588595) loss:2.628 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:38,345][model8_pretrain.py][INFO] Epoch:[0/2](325000/4588595) loss:2.802 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:38,345][model8_pretrain.py][INFO] Epoch:[0/2](325000/4588595) loss:2.694 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:38,345][model8_pretrain.py][INFO] Epoch:[0/2](325000/4588595) loss:2.834 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:38,345][model8_pretrain.py][INFO] Epoch:[0/2](325000/4588595) loss:2.860 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:38,345][model8_pretrain.py][INFO] Epoch:[0/2](325000/4588595) loss:2.900 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:38,345][model8_pretrain.py][INFO] Epoch:[0/2](325000/4588595) loss:2.602 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:57:38,345][model8_pretrain.py][INFO] Epoch:[0/2](325000/4588595) loss:2.835 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:58:15,273][model8_pretrain.py][INFO] Epoch:[0/2](325100/4588595) loss:2.864 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 03:58:15,273][model8_pretrain.py][INFO] Epoch:[0/2](325100/4588595) loss:3.042 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 03:58:15,273][model8_pretrain.py][INFO] Epoch:[0/2](325100/4588595) loss:2.704 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 03:58:15,273][model8_pretrain.py][INFO] Epoch:[0/2](325100/4588595) loss:2.994 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 03:58:15,273][model8_pretrain.py][INFO] Epoch:[0/2](325100/4588595) loss:3.052 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 03:58:15,273][model8_pretrain.py][INFO] Epoch:[0/2](325100/4588595) loss:2.829 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 03:58:15,273][model8_pretrain.py][INFO] Epoch:[0/2](325100/4588595) loss:2.664 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 03:58:15,273][model8_pretrain.py][INFO] Epoch:[0/2](325100/4588595) loss:2.625 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 03:59:02,677][model8_pretrain.py][INFO] Epoch:[0/2](325200/4588595) loss:2.750 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:59:02,677][model8_pretrain.py][INFO] Epoch:[0/2](325200/4588595) loss:2.662 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:59:02,677][model8_pretrain.py][INFO] Epoch:[0/2](325200/4588595) loss:2.677 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:59:02,678][model8_pretrain.py][INFO] Epoch:[0/2](325200/4588595) loss:2.649 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:59:02,678][model8_pretrain.py][INFO] Epoch:[0/2](325200/4588595) loss:3.279 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:59:02,678][model8_pretrain.py][INFO] Epoch:[0/2](325200/4588595) loss:2.872 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:59:02,678][model8_pretrain.py][INFO] Epoch:[0/2](325200/4588595) loss:2.753 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:59:02,678][model8_pretrain.py][INFO] Epoch:[0/2](325200/4588595) loss:2.411 lr:0.0000100 epoch_Time:26997.0min: [2024-01-04 03:59:39,598][model8_pretrain.py][INFO] Epoch:[0/2](325300/4588595) loss:2.722 lr:0.0000100 epoch_Time:26996.0min: [2024-01-04 03:59:39,598][model8_pretrain.py][INFO] Epoch:[0/2](325300/4588595) loss:3.111 lr:0.0000100 epoch_Time:26996.0min: [2024-01-04 03:59:39,598][model8_pretrain.py][INFO] Epoch:[0/2](325300/4588595) loss:2.957 lr:0.0000100 epoch_Time:26996.0min: [2024-01-04 03:59:39,598][model8_pretrain.py][INFO] Epoch:[0/2](325300/4588595) loss:2.750 lr:0.0000100 epoch_Time:26996.0min: [2024-01-04 03:59:39,598][model8_pretrain.py][INFO] Epoch:[0/2](325300/4588595) loss:3.259 lr:0.0000100 epoch_Time:26996.0min: [2024-01-04 03:59:39,598][model8_pretrain.py][INFO] Epoch:[0/2](325300/4588595) loss:3.221 lr:0.0000100 epoch_Time:26996.0min: [2024-01-04 03:59:39,598][model8_pretrain.py][INFO] Epoch:[0/2](325300/4588595) loss:3.162 lr:0.0000100 epoch_Time:26996.0min: [2024-01-04 03:59:39,598][model8_pretrain.py][INFO] Epoch:[0/2](325300/4588595) loss:3.276 lr:0.0000100 epoch_Time:26996.0min: [2024-01-04 04:00:16,553][model8_pretrain.py][INFO] Epoch:[0/2](325400/4588595) loss:2.850 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 04:00:16,553][model8_pretrain.py][INFO] Epoch:[0/2](325400/4588595) loss:2.736 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 04:00:16,553][model8_pretrain.py][INFO] Epoch:[0/2](325400/4588595) loss:2.555 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 04:00:16,553][model8_pretrain.py][INFO] Epoch:[0/2](325400/4588595) loss:2.816 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 04:00:16,553][model8_pretrain.py][INFO] Epoch:[0/2](325400/4588595) loss:3.215 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 04:00:16,553][model8_pretrain.py][INFO] Epoch:[0/2](325400/4588595) loss:3.266 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 04:00:16,554][model8_pretrain.py][INFO] Epoch:[0/2](325400/4588595) loss:2.790 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 04:00:16,554][model8_pretrain.py][INFO] Epoch:[0/2](325400/4588595) loss:2.915 lr:0.0000100 epoch_Time:26995.0min: [2024-01-04 04:00:53,509][model8_pretrain.py][INFO] Epoch:[0/2](325500/4588595) loss:2.587 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:00:53,509][model8_pretrain.py][INFO] Epoch:[0/2](325500/4588595) loss:2.730 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:00:53,509][model8_pretrain.py][INFO] Epoch:[0/2](325500/4588595) loss:3.460 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:00:53,509][model8_pretrain.py][INFO] Epoch:[0/2](325500/4588595) loss:2.184 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:00:53,509][model8_pretrain.py][INFO] Epoch:[0/2](325500/4588595) loss:2.754 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:00:53,509][model8_pretrain.py][INFO] Epoch:[0/2](325500/4588595) loss:3.533 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:00:53,509][model8_pretrain.py][INFO] Epoch:[0/2](325500/4588595) loss:2.862 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:00:53,509][model8_pretrain.py][INFO] Epoch:[0/2](325500/4588595) loss:2.411 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:01:30,452][model8_pretrain.py][INFO] Epoch:[0/2](325600/4588595) loss:2.988 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:01:30,452][model8_pretrain.py][INFO] Epoch:[0/2](325600/4588595) loss:2.441 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:01:30,452][model8_pretrain.py][INFO] Epoch:[0/2](325600/4588595) loss:3.149 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:01:30,452][model8_pretrain.py][INFO] Epoch:[0/2](325600/4588595) loss:3.309 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:01:30,452][model8_pretrain.py][INFO] Epoch:[0/2](325600/4588595) loss:2.717 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:01:30,452][model8_pretrain.py][INFO] Epoch:[0/2](325600/4588595) loss:2.713 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:01:30,452][model8_pretrain.py][INFO] Epoch:[0/2](325600/4588595) loss:3.119 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:01:30,452][model8_pretrain.py][INFO] Epoch:[0/2](325600/4588595) loss:3.486 lr:0.0000100 epoch_Time:26994.0min: [2024-01-04 04:02:07,406][model8_pretrain.py][INFO] Epoch:[0/2](325700/4588595) loss:2.601 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:07,406][model8_pretrain.py][INFO] Epoch:[0/2](325700/4588595) loss:2.554 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:07,406][model8_pretrain.py][INFO] Epoch:[0/2](325700/4588595) loss:3.319 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:07,406][model8_pretrain.py][INFO] Epoch:[0/2](325700/4588595) loss:3.106 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:07,406][model8_pretrain.py][INFO] Epoch:[0/2](325700/4588595) loss:3.285 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:07,406][model8_pretrain.py][INFO] Epoch:[0/2](325700/4588595) loss:2.421 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:07,406][model8_pretrain.py][INFO] Epoch:[0/2](325700/4588595) loss:2.977 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:07,406][model8_pretrain.py][INFO] Epoch:[0/2](325700/4588595) loss:2.559 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:44,361][model8_pretrain.py][INFO] Epoch:[0/2](325800/4588595) loss:2.711 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:44,361][model8_pretrain.py][INFO] Epoch:[0/2](325800/4588595) loss:3.470 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:44,361][model8_pretrain.py][INFO] Epoch:[0/2](325800/4588595) loss:3.550 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:44,361][model8_pretrain.py][INFO] Epoch:[0/2](325800/4588595) loss:2.838 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:44,361][model8_pretrain.py][INFO] Epoch:[0/2](325800/4588595) loss:3.178 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:44,361][model8_pretrain.py][INFO] Epoch:[0/2](325800/4588595) loss:2.948 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:44,361][model8_pretrain.py][INFO] Epoch:[0/2](325800/4588595) loss:3.121 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:02:44,361][model8_pretrain.py][INFO] Epoch:[0/2](325800/4588595) loss:2.890 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:03:21,316][model8_pretrain.py][INFO] Epoch:[0/2](325900/4588595) loss:2.890 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:03:21,316][model8_pretrain.py][INFO] Epoch:[0/2](325900/4588595) loss:3.278 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:03:21,316][model8_pretrain.py][INFO] Epoch:[0/2](325900/4588595) loss:2.744 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:03:21,316][model8_pretrain.py][INFO] Epoch:[0/2](325900/4588595) loss:3.040 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:03:21,316][model8_pretrain.py][INFO] Epoch:[0/2](325900/4588595) loss:3.440 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:03:21,316][model8_pretrain.py][INFO] Epoch:[0/2](325900/4588595) loss:2.927 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:03:21,316][model8_pretrain.py][INFO] Epoch:[0/2](325900/4588595) loss:2.587 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:03:21,316][model8_pretrain.py][INFO] Epoch:[0/2](325900/4588595) loss:3.274 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:04:08,702][model8_pretrain.py][INFO] Epoch:[0/2](326000/4588595) loss:2.658 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:08,702][model8_pretrain.py][INFO] Epoch:[0/2](326000/4588595) loss:3.177 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:08,702][model8_pretrain.py][INFO] Epoch:[0/2](326000/4588595) loss:3.053 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:08,702][model8_pretrain.py][INFO] Epoch:[0/2](326000/4588595) loss:2.933 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:08,702][model8_pretrain.py][INFO] Epoch:[0/2](326000/4588595) loss:2.907 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:08,702][model8_pretrain.py][INFO] Epoch:[0/2](326000/4588595) loss:3.296 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:08,702][model8_pretrain.py][INFO] Epoch:[0/2](326000/4588595) loss:3.062 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:08,702][model8_pretrain.py][INFO] Epoch:[0/2](326000/4588595) loss:2.457 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:45,596][model8_pretrain.py][INFO] Epoch:[0/2](326100/4588595) loss:2.303 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:45,596][model8_pretrain.py][INFO] Epoch:[0/2](326100/4588595) loss:3.063 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:45,596][model8_pretrain.py][INFO] Epoch:[0/2](326100/4588595) loss:2.932 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:45,596][model8_pretrain.py][INFO] Epoch:[0/2](326100/4588595) loss:3.110 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:45,596][model8_pretrain.py][INFO] Epoch:[0/2](326100/4588595) loss:2.668 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:45,596][model8_pretrain.py][INFO] Epoch:[0/2](326100/4588595) loss:2.882 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:45,596][model8_pretrain.py][INFO] Epoch:[0/2](326100/4588595) loss:2.980 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:04:45,596][model8_pretrain.py][INFO] Epoch:[0/2](326100/4588595) loss:3.222 lr:0.0000100 epoch_Time:26992.0min: [2024-01-04 04:05:22,533][model8_pretrain.py][INFO] Epoch:[0/2](326200/4588595) loss:2.996 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:05:22,533][model8_pretrain.py][INFO] Epoch:[0/2](326200/4588595) loss:3.313 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:05:22,533][model8_pretrain.py][INFO] Epoch:[0/2](326200/4588595) loss:3.182 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:05:22,533][model8_pretrain.py][INFO] Epoch:[0/2](326200/4588595) loss:2.697 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:05:22,533][model8_pretrain.py][INFO] Epoch:[0/2](326200/4588595) loss:3.230 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:05:22,534][model8_pretrain.py][INFO] Epoch:[0/2](326200/4588595) loss:3.091 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:05:22,534][model8_pretrain.py][INFO] Epoch:[0/2](326200/4588595) loss:3.201 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:05:22,534][model8_pretrain.py][INFO] Epoch:[0/2](326200/4588595) loss:2.861 lr:0.0000100 epoch_Time:26991.0min: [2024-01-04 04:05:59,484][model8_pretrain.py][INFO] Epoch:[0/2](326300/4588595) loss:2.950 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:05:59,484][model8_pretrain.py][INFO] Epoch:[0/2](326300/4588595) loss:2.970 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:05:59,484][model8_pretrain.py][INFO] Epoch:[0/2](326300/4588595) loss:2.675 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:05:59,484][model8_pretrain.py][INFO] Epoch:[0/2](326300/4588595) loss:2.938 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:05:59,484][model8_pretrain.py][INFO] Epoch:[0/2](326300/4588595) loss:2.934 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:05:59,484][model8_pretrain.py][INFO] Epoch:[0/2](326300/4588595) loss:2.636 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:05:59,484][model8_pretrain.py][INFO] Epoch:[0/2](326300/4588595) loss:3.089 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:05:59,484][model8_pretrain.py][INFO] Epoch:[0/2](326300/4588595) loss:3.051 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:06:36,428][model8_pretrain.py][INFO] Epoch:[0/2](326400/4588595) loss:2.627 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:06:36,428][model8_pretrain.py][INFO] Epoch:[0/2](326400/4588595) loss:3.158 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:06:36,428][model8_pretrain.py][INFO] Epoch:[0/2](326400/4588595) loss:2.987 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:06:36,428][model8_pretrain.py][INFO] Epoch:[0/2](326400/4588595) loss:2.498 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:06:36,428][model8_pretrain.py][INFO] Epoch:[0/2](326400/4588595) loss:2.881 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:06:36,428][model8_pretrain.py][INFO] Epoch:[0/2](326400/4588595) loss:2.589 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:06:36,428][model8_pretrain.py][INFO] Epoch:[0/2](326400/4588595) loss:3.472 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:06:36,428][model8_pretrain.py][INFO] Epoch:[0/2](326400/4588595) loss:2.834 lr:0.0000100 epoch_Time:26989.0min: [2024-01-04 04:07:13,364][model8_pretrain.py][INFO] Epoch:[0/2](326500/4588595) loss:3.288 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:07:13,364][model8_pretrain.py][INFO] Epoch:[0/2](326500/4588595) loss:3.003 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:07:13,364][model8_pretrain.py][INFO] Epoch:[0/2](326500/4588595) loss:2.857 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:07:13,364][model8_pretrain.py][INFO] Epoch:[0/2](326500/4588595) loss:3.092 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:07:13,364][model8_pretrain.py][INFO] Epoch:[0/2](326500/4588595) loss:2.956 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:07:13,364][model8_pretrain.py][INFO] Epoch:[0/2](326500/4588595) loss:2.807 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:07:13,364][model8_pretrain.py][INFO] Epoch:[0/2](326500/4588595) loss:3.131 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:07:13,364][model8_pretrain.py][INFO] Epoch:[0/2](326500/4588595) loss:3.189 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:07:50,299][model8_pretrain.py][INFO] Epoch:[0/2](326600/4588595) loss:3.324 lr:0.0000100 epoch_Time:26987.0min: [2024-01-04 04:07:50,299][model8_pretrain.py][INFO] Epoch:[0/2](326600/4588595) loss:2.663 lr:0.0000100 epoch_Time:26987.0min: [2024-01-04 04:07:50,299][model8_pretrain.py][INFO] Epoch:[0/2](326600/4588595) loss:2.690 lr:0.0000100 epoch_Time:26987.0min: [2024-01-04 04:07:50,299][model8_pretrain.py][INFO] Epoch:[0/2](326600/4588595) loss:2.848 lr:0.0000100 epoch_Time:26987.0min: [2024-01-04 04:07:50,299][model8_pretrain.py][INFO] Epoch:[0/2](326600/4588595) loss:2.914 lr:0.0000100 epoch_Time:26987.0min: [2024-01-04 04:07:50,299][model8_pretrain.py][INFO] Epoch:[0/2](326600/4588595) loss:3.040 lr:0.0000100 epoch_Time:26987.0min: [2024-01-04 04:07:50,299][model8_pretrain.py][INFO] Epoch:[0/2](326600/4588595) loss:2.796 lr:0.0000100 epoch_Time:26987.0min: [2024-01-04 04:07:50,299][model8_pretrain.py][INFO] Epoch:[0/2](326600/4588595) loss:2.791 lr:0.0000100 epoch_Time:26987.0min: [2024-01-04 04:08:27,250][model8_pretrain.py][INFO] Epoch:[0/2](326700/4588595) loss:2.945 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:08:27,250][model8_pretrain.py][INFO] Epoch:[0/2](326700/4588595) loss:2.927 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:08:27,250][model8_pretrain.py][INFO] Epoch:[0/2](326700/4588595) loss:2.383 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:08:27,250][model8_pretrain.py][INFO] Epoch:[0/2](326700/4588595) loss:2.793 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:08:27,250][model8_pretrain.py][INFO] Epoch:[0/2](326700/4588595) loss:2.935 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:08:27,250][model8_pretrain.py][INFO] Epoch:[0/2](326700/4588595) loss:2.951 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:08:27,250][model8_pretrain.py][INFO] Epoch:[0/2](326700/4588595) loss:3.041 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:08:27,250][model8_pretrain.py][INFO] Epoch:[0/2](326700/4588595) loss:3.272 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:09:14,557][model8_pretrain.py][INFO] Epoch:[0/2](326800/4588595) loss:3.089 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:09:14,557][model8_pretrain.py][INFO] Epoch:[0/2](326800/4588595) loss:2.917 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:09:14,557][model8_pretrain.py][INFO] Epoch:[0/2](326800/4588595) loss:2.912 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:09:14,557][model8_pretrain.py][INFO] Epoch:[0/2](326800/4588595) loss:2.886 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:09:14,557][model8_pretrain.py][INFO] Epoch:[0/2](326800/4588595) loss:3.198 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:09:14,557][model8_pretrain.py][INFO] Epoch:[0/2](326800/4588595) loss:3.123 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:09:14,557][model8_pretrain.py][INFO] Epoch:[0/2](326800/4588595) loss:2.416 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:09:14,557][model8_pretrain.py][INFO] Epoch:[0/2](326800/4588595) loss:2.991 lr:0.0000100 epoch_Time:26988.0min: [2024-01-04 04:09:51,482][model8_pretrain.py][INFO] Epoch:[0/2](326900/4588595) loss:2.759 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:09:51,482][model8_pretrain.py][INFO] Epoch:[0/2](326900/4588595) loss:2.802 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:09:51,482][model8_pretrain.py][INFO] Epoch:[0/2](326900/4588595) loss:2.665 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:09:51,482][model8_pretrain.py][INFO] Epoch:[0/2](326900/4588595) loss:2.916 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:09:51,482][model8_pretrain.py][INFO] Epoch:[0/2](326900/4588595) loss:2.923 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:09:51,482][model8_pretrain.py][INFO] Epoch:[0/2](326900/4588595) loss:3.325 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:09:51,482][model8_pretrain.py][INFO] Epoch:[0/2](326900/4588595) loss:2.911 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:09:51,482][model8_pretrain.py][INFO] Epoch:[0/2](326900/4588595) loss:2.056 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:10:28,436][model8_pretrain.py][INFO] Epoch:[0/2](327000/4588595) loss:2.639 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:10:28,436][model8_pretrain.py][INFO] Epoch:[0/2](327000/4588595) loss:2.858 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:10:28,436][model8_pretrain.py][INFO] Epoch:[0/2](327000/4588595) loss:2.958 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:10:28,436][model8_pretrain.py][INFO] Epoch:[0/2](327000/4588595) loss:3.084 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:10:28,436][model8_pretrain.py][INFO] Epoch:[0/2](327000/4588595) loss:2.536 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:10:28,436][model8_pretrain.py][INFO] Epoch:[0/2](327000/4588595) loss:2.787 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:10:28,436][model8_pretrain.py][INFO] Epoch:[0/2](327000/4588595) loss:2.795 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:10:28,436][model8_pretrain.py][INFO] Epoch:[0/2](327000/4588595) loss:2.984 lr:0.0000100 epoch_Time:26986.0min: [2024-01-04 04:11:05,401][model8_pretrain.py][INFO] Epoch:[0/2](327100/4588595) loss:3.058 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:05,401][model8_pretrain.py][INFO] Epoch:[0/2](327100/4588595) loss:3.470 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:05,401][model8_pretrain.py][INFO] Epoch:[0/2](327100/4588595) loss:3.064 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:05,401][model8_pretrain.py][INFO] Epoch:[0/2](327100/4588595) loss:2.708 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:05,401][model8_pretrain.py][INFO] Epoch:[0/2](327100/4588595) loss:3.006 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:05,401][model8_pretrain.py][INFO] Epoch:[0/2](327100/4588595) loss:2.595 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:05,401][model8_pretrain.py][INFO] Epoch:[0/2](327100/4588595) loss:2.815 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:05,401][model8_pretrain.py][INFO] Epoch:[0/2](327100/4588595) loss:3.101 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:42,350][model8_pretrain.py][INFO] Epoch:[0/2](327200/4588595) loss:3.200 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:42,350][model8_pretrain.py][INFO] Epoch:[0/2](327200/4588595) loss:3.018 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:42,350][model8_pretrain.py][INFO] Epoch:[0/2](327200/4588595) loss:2.631 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:42,350][model8_pretrain.py][INFO] Epoch:[0/2](327200/4588595) loss:3.028 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:42,350][model8_pretrain.py][INFO] Epoch:[0/2](327200/4588595) loss:2.787 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:42,350][model8_pretrain.py][INFO] Epoch:[0/2](327200/4588595) loss:2.369 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:42,350][model8_pretrain.py][INFO] Epoch:[0/2](327200/4588595) loss:3.283 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:11:42,351][model8_pretrain.py][INFO] Epoch:[0/2](327200/4588595) loss:2.566 lr:0.0000100 epoch_Time:26985.0min: [2024-01-04 04:12:19,299][model8_pretrain.py][INFO] Epoch:[0/2](327300/4588595) loss:2.982 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:12:19,299][model8_pretrain.py][INFO] Epoch:[0/2](327300/4588595) loss:2.619 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:12:19,299][model8_pretrain.py][INFO] Epoch:[0/2](327300/4588595) loss:2.363 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:12:19,299][model8_pretrain.py][INFO] Epoch:[0/2](327300/4588595) loss:2.653 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:12:19,299][model8_pretrain.py][INFO] Epoch:[0/2](327300/4588595) loss:3.039 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:12:19,299][model8_pretrain.py][INFO] Epoch:[0/2](327300/4588595) loss:2.168 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:12:19,299][model8_pretrain.py][INFO] Epoch:[0/2](327300/4588595) loss:2.599 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:12:19,299][model8_pretrain.py][INFO] Epoch:[0/2](327300/4588595) loss:2.771 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:12:56,248][model8_pretrain.py][INFO] Epoch:[0/2](327400/4588595) loss:2.612 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:12:56,248][model8_pretrain.py][INFO] Epoch:[0/2](327400/4588595) loss:2.775 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:12:56,248][model8_pretrain.py][INFO] Epoch:[0/2](327400/4588595) loss:2.643 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:12:56,248][model8_pretrain.py][INFO] Epoch:[0/2](327400/4588595) loss:2.678 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:12:56,248][model8_pretrain.py][INFO] Epoch:[0/2](327400/4588595) loss:2.628 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:12:56,248][model8_pretrain.py][INFO] Epoch:[0/2](327400/4588595) loss:2.738 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:12:56,248][model8_pretrain.py][INFO] Epoch:[0/2](327400/4588595) loss:2.635 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:12:56,248][model8_pretrain.py][INFO] Epoch:[0/2](327400/4588595) loss:2.902 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:13:33,195][model8_pretrain.py][INFO] Epoch:[0/2](327500/4588595) loss:3.136 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:13:33,195][model8_pretrain.py][INFO] Epoch:[0/2](327500/4588595) loss:2.302 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:13:33,195][model8_pretrain.py][INFO] Epoch:[0/2](327500/4588595) loss:3.647 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:13:33,195][model8_pretrain.py][INFO] Epoch:[0/2](327500/4588595) loss:2.805 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:13:33,195][model8_pretrain.py][INFO] Epoch:[0/2](327500/4588595) loss:2.786 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:13:33,195][model8_pretrain.py][INFO] Epoch:[0/2](327500/4588595) loss:2.464 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:13:33,195][model8_pretrain.py][INFO] Epoch:[0/2](327500/4588595) loss:3.341 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:13:33,195][model8_pretrain.py][INFO] Epoch:[0/2](327500/4588595) loss:3.077 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:14:18,847][model8_pretrain.py][INFO] Epoch:[0/2](327600/4588595) loss:2.456 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:14:18,847][model8_pretrain.py][INFO] Epoch:[0/2](327600/4588595) loss:2.884 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:14:18,847][model8_pretrain.py][INFO] Epoch:[0/2](327600/4588595) loss:2.830 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:14:18,847][model8_pretrain.py][INFO] Epoch:[0/2](327600/4588595) loss:3.024 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:14:18,847][model8_pretrain.py][INFO] Epoch:[0/2](327600/4588595) loss:2.840 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:14:18,847][model8_pretrain.py][INFO] Epoch:[0/2](327600/4588595) loss:3.025 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:14:18,848][model8_pretrain.py][INFO] Epoch:[0/2](327600/4588595) loss:3.251 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:14:18,848][model8_pretrain.py][INFO] Epoch:[0/2](327600/4588595) loss:3.042 lr:0.0000100 epoch_Time:26983.0min: [2024-01-04 04:14:57,451][model8_pretrain.py][INFO] Epoch:[0/2](327700/4588595) loss:2.438 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:14:57,451][model8_pretrain.py][INFO] Epoch:[0/2](327700/4588595) loss:2.627 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:14:57,451][model8_pretrain.py][INFO] Epoch:[0/2](327700/4588595) loss:2.619 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:14:57,451][model8_pretrain.py][INFO] Epoch:[0/2](327700/4588595) loss:3.202 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:14:57,451][model8_pretrain.py][INFO] Epoch:[0/2](327700/4588595) loss:2.932 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:14:57,451][model8_pretrain.py][INFO] Epoch:[0/2](327700/4588595) loss:3.467 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:14:57,451][model8_pretrain.py][INFO] Epoch:[0/2](327700/4588595) loss:2.622 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:14:57,451][model8_pretrain.py][INFO] Epoch:[0/2](327700/4588595) loss:2.659 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:15:34,398][model8_pretrain.py][INFO] Epoch:[0/2](327800/4588595) loss:2.980 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:15:34,398][model8_pretrain.py][INFO] Epoch:[0/2](327800/4588595) loss:2.897 lr:0.0000100 epoch_Time:26981.0min: [2024-01-04 04:15:34,398][model8_pretrain.py][INFO] Epoch:[0/2](327800/4588595) loss:2.839 lr:0.0000100 epoch_Time:26981.0min: [2024-01-04 04:15:34,398][model8_pretrain.py][INFO] Epoch:[0/2](327800/4588595) loss:2.911 lr:0.0000100 epoch_Time:26981.0min: [2024-01-04 04:15:34,398][model8_pretrain.py][INFO] Epoch:[0/2](327800/4588595) loss:3.135 lr:0.0000100 epoch_Time:26981.0min: [2024-01-04 04:15:34,398][model8_pretrain.py][INFO] Epoch:[0/2](327800/4588595) loss:2.932 lr:0.0000100 epoch_Time:26981.0min: [2024-01-04 04:15:34,398][model8_pretrain.py][INFO] Epoch:[0/2](327800/4588595) loss:3.151 lr:0.0000100 epoch_Time:26982.0min: [2024-01-04 04:15:34,398][model8_pretrain.py][INFO] Epoch:[0/2](327800/4588595) loss:3.144 lr:0.0000100 epoch_Time:26981.0min: [2024-01-04 04:16:11,350][model8_pretrain.py][INFO] Epoch:[0/2](327900/4588595) loss:3.263 lr:0.0000100 epoch_Time:26980.0min: [2024-01-04 04:16:11,350][model8_pretrain.py][INFO] Epoch:[0/2](327900/4588595) loss:2.349 lr:0.0000100 epoch_Time:26980.0min: [2024-01-04 04:16:11,350][model8_pretrain.py][INFO] Epoch:[0/2](327900/4588595) loss:2.835 lr:0.0000100 epoch_Time:26980.0min: [2024-01-04 04:16:11,350][model8_pretrain.py][INFO] Epoch:[0/2](327900/4588595) loss:2.684 lr:0.0000100 epoch_Time:26980.0min: [2024-01-04 04:16:11,350][model8_pretrain.py][INFO] Epoch:[0/2](327900/4588595) loss:2.769 lr:0.0000100 epoch_Time:26980.0min: [2024-01-04 04:16:11,350][model8_pretrain.py][INFO] Epoch:[0/2](327900/4588595) loss:2.958 lr:0.0000100 epoch_Time:26980.0min: [2024-01-04 04:16:11,350][model8_pretrain.py][INFO] Epoch:[0/2](327900/4588595) loss:2.877 lr:0.0000100 epoch_Time:26980.0min: [2024-01-04 04:16:11,351][model8_pretrain.py][INFO] Epoch:[0/2](327900/4588595) loss:2.802 lr:0.0000100 epoch_Time:26980.0min: [2024-01-04 04:16:48,292][model8_pretrain.py][INFO] Epoch:[0/2](328000/4588595) loss:2.697 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:16:48,292][model8_pretrain.py][INFO] Epoch:[0/2](328000/4588595) loss:2.450 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:16:48,292][model8_pretrain.py][INFO] Epoch:[0/2](328000/4588595) loss:2.948 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:16:48,292][model8_pretrain.py][INFO] Epoch:[0/2](328000/4588595) loss:3.036 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:16:48,292][model8_pretrain.py][INFO] Epoch:[0/2](328000/4588595) loss:2.807 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:16:48,292][model8_pretrain.py][INFO] Epoch:[0/2](328000/4588595) loss:2.903 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:16:48,292][model8_pretrain.py][INFO] Epoch:[0/2](328000/4588595) loss:3.125 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:16:48,292][model8_pretrain.py][INFO] Epoch:[0/2](328000/4588595) loss:2.863 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:17:25,231][model8_pretrain.py][INFO] Epoch:[0/2](328100/4588595) loss:2.849 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:17:25,231][model8_pretrain.py][INFO] Epoch:[0/2](328100/4588595) loss:3.503 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:17:25,231][model8_pretrain.py][INFO] Epoch:[0/2](328100/4588595) loss:2.716 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:17:25,231][model8_pretrain.py][INFO] Epoch:[0/2](328100/4588595) loss:3.430 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:17:25,231][model8_pretrain.py][INFO] Epoch:[0/2](328100/4588595) loss:3.319 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:17:25,231][model8_pretrain.py][INFO] Epoch:[0/2](328100/4588595) loss:2.848 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:17:25,231][model8_pretrain.py][INFO] Epoch:[0/2](328100/4588595) loss:2.984 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:17:25,231][model8_pretrain.py][INFO] Epoch:[0/2](328100/4588595) loss:2.942 lr:0.0000100 epoch_Time:26979.0min: [2024-01-04 04:18:02,168][model8_pretrain.py][INFO] Epoch:[0/2](328200/4588595) loss:2.888 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:18:02,168][model8_pretrain.py][INFO] Epoch:[0/2](328200/4588595) loss:3.167 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:18:02,168][model8_pretrain.py][INFO] Epoch:[0/2](328200/4588595) loss:2.898 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:18:02,168][model8_pretrain.py][INFO] Epoch:[0/2](328200/4588595) loss:2.637 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:18:02,168][model8_pretrain.py][INFO] Epoch:[0/2](328200/4588595) loss:2.634 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:18:02,168][model8_pretrain.py][INFO] Epoch:[0/2](328200/4588595) loss:3.166 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:18:02,169][model8_pretrain.py][INFO] Epoch:[0/2](328200/4588595) loss:3.071 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:18:02,169][model8_pretrain.py][INFO] Epoch:[0/2](328200/4588595) loss:2.835 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:18:39,116][model8_pretrain.py][INFO] Epoch:[0/2](328300/4588595) loss:2.911 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:18:39,116][model8_pretrain.py][INFO] Epoch:[0/2](328300/4588595) loss:2.354 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:18:39,116][model8_pretrain.py][INFO] Epoch:[0/2](328300/4588595) loss:3.246 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:18:39,116][model8_pretrain.py][INFO] Epoch:[0/2](328300/4588595) loss:2.970 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:18:39,116][model8_pretrain.py][INFO] Epoch:[0/2](328300/4588595) loss:2.875 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:18:39,116][model8_pretrain.py][INFO] Epoch:[0/2](328300/4588595) loss:2.867 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:18:39,116][model8_pretrain.py][INFO] Epoch:[0/2](328300/4588595) loss:3.333 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:18:39,116][model8_pretrain.py][INFO] Epoch:[0/2](328300/4588595) loss:2.966 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:19:24,728][model8_pretrain.py][INFO] Epoch:[0/2](328400/4588595) loss:2.832 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:19:24,728][model8_pretrain.py][INFO] Epoch:[0/2](328400/4588595) loss:3.144 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:19:24,728][model8_pretrain.py][INFO] Epoch:[0/2](328400/4588595) loss:3.169 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:19:24,733][model8_pretrain.py][INFO] Epoch:[0/2](328400/4588595) loss:2.788 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:19:24,733][model8_pretrain.py][INFO] Epoch:[0/2](328400/4588595) loss:2.494 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:19:24,733][model8_pretrain.py][INFO] Epoch:[0/2](328400/4588595) loss:3.040 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:19:24,733][model8_pretrain.py][INFO] Epoch:[0/2](328400/4588595) loss:3.078 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:19:24,733][model8_pretrain.py][INFO] Epoch:[0/2](328400/4588595) loss:3.159 lr:0.0000100 epoch_Time:26978.0min: [2024-01-04 04:20:03,360][model8_pretrain.py][INFO] Epoch:[0/2](328500/4588595) loss:2.869 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:03,360][model8_pretrain.py][INFO] Epoch:[0/2](328500/4588595) loss:2.737 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:03,360][model8_pretrain.py][INFO] Epoch:[0/2](328500/4588595) loss:2.313 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:03,360][model8_pretrain.py][INFO] Epoch:[0/2](328500/4588595) loss:3.061 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:03,360][model8_pretrain.py][INFO] Epoch:[0/2](328500/4588595) loss:3.030 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:03,360][model8_pretrain.py][INFO] Epoch:[0/2](328500/4588595) loss:2.907 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:03,360][model8_pretrain.py][INFO] Epoch:[0/2](328500/4588595) loss:2.238 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:03,360][model8_pretrain.py][INFO] Epoch:[0/2](328500/4588595) loss:2.817 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:40,289][model8_pretrain.py][INFO] Epoch:[0/2](328600/4588595) loss:2.683 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:40,289][model8_pretrain.py][INFO] Epoch:[0/2](328600/4588595) loss:2.687 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:40,289][model8_pretrain.py][INFO] Epoch:[0/2](328600/4588595) loss:3.153 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:40,289][model8_pretrain.py][INFO] Epoch:[0/2](328600/4588595) loss:2.848 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:40,289][model8_pretrain.py][INFO] Epoch:[0/2](328600/4588595) loss:3.056 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:40,289][model8_pretrain.py][INFO] Epoch:[0/2](328600/4588595) loss:3.395 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:40,289][model8_pretrain.py][INFO] Epoch:[0/2](328600/4588595) loss:2.666 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:20:40,289][model8_pretrain.py][INFO] Epoch:[0/2](328600/4588595) loss:3.134 lr:0.0000100 epoch_Time:26977.0min: [2024-01-04 04:21:17,228][model8_pretrain.py][INFO] Epoch:[0/2](328700/4588595) loss:2.845 lr:0.0000100 epoch_Time:26976.0min: [2024-01-04 04:21:17,228][model8_pretrain.py][INFO] Epoch:[0/2](328700/4588595) loss:2.721 lr:0.0000100 epoch_Time:26976.0min: [2024-01-04 04:21:17,228][model8_pretrain.py][INFO] Epoch:[0/2](328700/4588595) loss:2.978 lr:0.0000100 epoch_Time:26976.0min: [2024-01-04 04:21:17,228][model8_pretrain.py][INFO] Epoch:[0/2](328700/4588595) loss:2.975 lr:0.0000100 epoch_Time:26976.0min: [2024-01-04 04:21:17,228][model8_pretrain.py][INFO] Epoch:[0/2](328700/4588595) loss:3.174 lr:0.0000100 epoch_Time:26976.0min: [2024-01-04 04:21:17,229][model8_pretrain.py][INFO] Epoch:[0/2](328700/4588595) loss:2.868 lr:0.0000100 epoch_Time:26976.0min: [2024-01-04 04:21:17,229][model8_pretrain.py][INFO] Epoch:[0/2](328700/4588595) loss:2.792 lr:0.0000100 epoch_Time:26976.0min: [2024-01-04 04:21:17,229][model8_pretrain.py][INFO] Epoch:[0/2](328700/4588595) loss:3.525 lr:0.0000100 epoch_Time:26976.0min: [2024-01-04 04:21:54,155][model8_pretrain.py][INFO] Epoch:[0/2](328800/4588595) loss:2.954 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:21:54,155][model8_pretrain.py][INFO] Epoch:[0/2](328800/4588595) loss:3.358 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:21:54,155][model8_pretrain.py][INFO] Epoch:[0/2](328800/4588595) loss:2.822 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:21:54,155][model8_pretrain.py][INFO] Epoch:[0/2](328800/4588595) loss:2.877 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:21:54,156][model8_pretrain.py][INFO] Epoch:[0/2](328800/4588595) loss:3.190 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:21:54,156][model8_pretrain.py][INFO] Epoch:[0/2](328800/4588595) loss:2.636 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:21:54,156][model8_pretrain.py][INFO] Epoch:[0/2](328800/4588595) loss:3.115 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:21:54,156][model8_pretrain.py][INFO] Epoch:[0/2](328800/4588595) loss:2.776 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:22:31,089][model8_pretrain.py][INFO] Epoch:[0/2](328900/4588595) loss:2.630 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:22:31,089][model8_pretrain.py][INFO] Epoch:[0/2](328900/4588595) loss:2.523 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:22:31,089][model8_pretrain.py][INFO] Epoch:[0/2](328900/4588595) loss:2.895 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:22:31,089][model8_pretrain.py][INFO] Epoch:[0/2](328900/4588595) loss:2.658 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:22:31,089][model8_pretrain.py][INFO] Epoch:[0/2](328900/4588595) loss:2.961 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:22:31,089][model8_pretrain.py][INFO] Epoch:[0/2](328900/4588595) loss:2.777 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:22:31,089][model8_pretrain.py][INFO] Epoch:[0/2](328900/4588595) loss:2.907 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:22:31,089][model8_pretrain.py][INFO] Epoch:[0/2](328900/4588595) loss:2.686 lr:0.0000100 epoch_Time:26974.0min: [2024-01-04 04:23:08,026][model8_pretrain.py][INFO] Epoch:[0/2](329000/4588595) loss:3.093 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:08,026][model8_pretrain.py][INFO] Epoch:[0/2](329000/4588595) loss:3.152 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:08,026][model8_pretrain.py][INFO] Epoch:[0/2](329000/4588595) loss:2.893 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:08,026][model8_pretrain.py][INFO] Epoch:[0/2](329000/4588595) loss:3.337 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:08,026][model8_pretrain.py][INFO] Epoch:[0/2](329000/4588595) loss:2.716 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:08,026][model8_pretrain.py][INFO] Epoch:[0/2](329000/4588595) loss:3.350 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:08,026][model8_pretrain.py][INFO] Epoch:[0/2](329000/4588595) loss:2.346 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:08,026][model8_pretrain.py][INFO] Epoch:[0/2](329000/4588595) loss:2.864 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:44,957][model8_pretrain.py][INFO] Epoch:[0/2](329100/4588595) loss:3.068 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:44,957][model8_pretrain.py][INFO] Epoch:[0/2](329100/4588595) loss:2.984 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:44,957][model8_pretrain.py][INFO] Epoch:[0/2](329100/4588595) loss:2.515 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:44,957][model8_pretrain.py][INFO] Epoch:[0/2](329100/4588595) loss:2.730 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:44,957][model8_pretrain.py][INFO] Epoch:[0/2](329100/4588595) loss:2.727 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:44,957][model8_pretrain.py][INFO] Epoch:[0/2](329100/4588595) loss:3.098 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:44,957][model8_pretrain.py][INFO] Epoch:[0/2](329100/4588595) loss:3.247 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:23:44,958][model8_pretrain.py][INFO] Epoch:[0/2](329100/4588595) loss:3.059 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:24:27,083][model8_pretrain.py][INFO] Epoch:[0/2](329200/4588595) loss:2.528 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:24:27,083][model8_pretrain.py][INFO] Epoch:[0/2](329200/4588595) loss:2.830 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:24:27,083][model8_pretrain.py][INFO] Epoch:[0/2](329200/4588595) loss:2.998 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:24:27,083][model8_pretrain.py][INFO] Epoch:[0/2](329200/4588595) loss:2.933 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:24:27,083][model8_pretrain.py][INFO] Epoch:[0/2](329200/4588595) loss:2.947 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:24:27,083][model8_pretrain.py][INFO] Epoch:[0/2](329200/4588595) loss:2.821 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:24:27,083][model8_pretrain.py][INFO] Epoch:[0/2](329200/4588595) loss:2.533 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:24:27,083][model8_pretrain.py][INFO] Epoch:[0/2](329200/4588595) loss:2.952 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:25:09,130][model8_pretrain.py][INFO] Epoch:[0/2](329300/4588595) loss:2.770 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:25:09,130][model8_pretrain.py][INFO] Epoch:[0/2](329300/4588595) loss:3.211 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:25:09,130][model8_pretrain.py][INFO] Epoch:[0/2](329300/4588595) loss:3.258 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:25:09,130][model8_pretrain.py][INFO] Epoch:[0/2](329300/4588595) loss:2.817 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:25:09,130][model8_pretrain.py][INFO] Epoch:[0/2](329300/4588595) loss:2.876 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:25:09,130][model8_pretrain.py][INFO] Epoch:[0/2](329300/4588595) loss:3.074 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:25:09,130][model8_pretrain.py][INFO] Epoch:[0/2](329300/4588595) loss:2.732 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:25:09,130][model8_pretrain.py][INFO] Epoch:[0/2](329300/4588595) loss:3.285 lr:0.0000100 epoch_Time:26973.0min: [2024-01-04 04:25:46,086][model8_pretrain.py][INFO] Epoch:[0/2](329400/4588595) loss:2.825 lr:0.0000100 epoch_Time:26972.0min: [2024-01-04 04:25:46,086][model8_pretrain.py][INFO] Epoch:[0/2](329400/4588595) loss:2.710 lr:0.0000100 epoch_Time:26972.0min: [2024-01-04 04:25:46,086][model8_pretrain.py][INFO] Epoch:[0/2](329400/4588595) loss:2.903 lr:0.0000100 epoch_Time:26972.0min: [2024-01-04 04:25:46,086][model8_pretrain.py][INFO] Epoch:[0/2](329400/4588595) loss:2.949 lr:0.0000100 epoch_Time:26972.0min: [2024-01-04 04:25:46,086][model8_pretrain.py][INFO] Epoch:[0/2](329400/4588595) loss:2.654 lr:0.0000100 epoch_Time:26972.0min: [2024-01-04 04:25:46,086][model8_pretrain.py][INFO] Epoch:[0/2](329400/4588595) loss:3.011 lr:0.0000100 epoch_Time:26972.0min: [2024-01-04 04:25:46,086][model8_pretrain.py][INFO] Epoch:[0/2](329400/4588595) loss:3.197 lr:0.0000100 epoch_Time:26972.0min: [2024-01-04 04:25:46,086][model8_pretrain.py][INFO] Epoch:[0/2](329400/4588595) loss:3.116 lr:0.0000100 epoch_Time:26972.0min: [2024-01-04 04:26:23,032][model8_pretrain.py][INFO] Epoch:[0/2](329500/4588595) loss:3.067 lr:0.0000100 epoch_Time:26971.0min: [2024-01-04 04:26:23,032][model8_pretrain.py][INFO] Epoch:[0/2](329500/4588595) loss:3.030 lr:0.0000100 epoch_Time:26971.0min: [2024-01-04 04:26:23,032][model8_pretrain.py][INFO] Epoch:[0/2](329500/4588595) loss:2.511 lr:0.0000100 epoch_Time:26971.0min: [2024-01-04 04:26:23,032][model8_pretrain.py][INFO] Epoch:[0/2](329500/4588595) loss:2.572 lr:0.0000100 epoch_Time:26971.0min: [2024-01-04 04:26:23,032][model8_pretrain.py][INFO] Epoch:[0/2](329500/4588595) loss:2.288 lr:0.0000100 epoch_Time:26971.0min: [2024-01-04 04:26:23,032][model8_pretrain.py][INFO] Epoch:[0/2](329500/4588595) loss:3.156 lr:0.0000100 epoch_Time:26971.0min: [2024-01-04 04:26:23,033][model8_pretrain.py][INFO] Epoch:[0/2](329500/4588595) loss:2.451 lr:0.0000100 epoch_Time:26971.0min: [2024-01-04 04:26:23,033][model8_pretrain.py][INFO] Epoch:[0/2](329500/4588595) loss:3.368 lr:0.0000100 epoch_Time:26971.0min: [2024-01-04 04:26:59,978][model8_pretrain.py][INFO] Epoch:[0/2](329600/4588595) loss:2.305 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:26:59,978][model8_pretrain.py][INFO] Epoch:[0/2](329600/4588595) loss:3.117 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:26:59,978][model8_pretrain.py][INFO] Epoch:[0/2](329600/4588595) loss:1.900 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:26:59,978][model8_pretrain.py][INFO] Epoch:[0/2](329600/4588595) loss:2.534 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:26:59,978][model8_pretrain.py][INFO] Epoch:[0/2](329600/4588595) loss:2.722 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:26:59,978][model8_pretrain.py][INFO] Epoch:[0/2](329600/4588595) loss:2.789 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:26:59,978][model8_pretrain.py][INFO] Epoch:[0/2](329600/4588595) loss:2.897 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:26:59,978][model8_pretrain.py][INFO] Epoch:[0/2](329600/4588595) loss:2.939 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:27:36,937][model8_pretrain.py][INFO] Epoch:[0/2](329700/4588595) loss:2.811 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:27:36,937][model8_pretrain.py][INFO] Epoch:[0/2](329700/4588595) loss:2.706 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:27:36,937][model8_pretrain.py][INFO] Epoch:[0/2](329700/4588595) loss:3.030 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:27:36,937][model8_pretrain.py][INFO] Epoch:[0/2](329700/4588595) loss:3.079 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:27:36,937][model8_pretrain.py][INFO] Epoch:[0/2](329700/4588595) loss:2.831 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:27:36,937][model8_pretrain.py][INFO] Epoch:[0/2](329700/4588595) loss:2.494 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:27:36,938][model8_pretrain.py][INFO] Epoch:[0/2](329700/4588595) loss:2.957 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:27:36,938][model8_pretrain.py][INFO] Epoch:[0/2](329700/4588595) loss:3.063 lr:0.0000100 epoch_Time:26970.0min: [2024-01-04 04:28:13,884][model8_pretrain.py][INFO] Epoch:[0/2](329800/4588595) loss:3.023 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:28:13,884][model8_pretrain.py][INFO] Epoch:[0/2](329800/4588595) loss:3.055 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:28:13,884][model8_pretrain.py][INFO] Epoch:[0/2](329800/4588595) loss:3.251 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:28:13,884][model8_pretrain.py][INFO] Epoch:[0/2](329800/4588595) loss:3.051 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:28:13,884][model8_pretrain.py][INFO] Epoch:[0/2](329800/4588595) loss:3.196 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:28:13,884][model8_pretrain.py][INFO] Epoch:[0/2](329800/4588595) loss:3.057 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:28:13,884][model8_pretrain.py][INFO] Epoch:[0/2](329800/4588595) loss:2.012 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:28:13,885][model8_pretrain.py][INFO] Epoch:[0/2](329800/4588595) loss:2.966 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:28:50,826][model8_pretrain.py][INFO] Epoch:[0/2](329900/4588595) loss:3.148 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:28:50,826][model8_pretrain.py][INFO] Epoch:[0/2](329900/4588595) loss:2.896 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:28:50,826][model8_pretrain.py][INFO] Epoch:[0/2](329900/4588595) loss:3.230 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:28:50,826][model8_pretrain.py][INFO] Epoch:[0/2](329900/4588595) loss:2.725 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:28:50,826][model8_pretrain.py][INFO] Epoch:[0/2](329900/4588595) loss:2.468 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:28:50,826][model8_pretrain.py][INFO] Epoch:[0/2](329900/4588595) loss:3.000 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:28:50,826][model8_pretrain.py][INFO] Epoch:[0/2](329900/4588595) loss:3.129 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:28:50,826][model8_pretrain.py][INFO] Epoch:[0/2](329900/4588595) loss:2.981 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:29:32,986][model8_pretrain.py][INFO] Epoch:[0/2](330000/4588595) loss:2.975 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:29:32,986][model8_pretrain.py][INFO] Epoch:[0/2](330000/4588595) loss:2.217 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:29:32,986][model8_pretrain.py][INFO] Epoch:[0/2](330000/4588595) loss:2.624 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:29:32,986][model8_pretrain.py][INFO] Epoch:[0/2](330000/4588595) loss:2.742 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:29:32,986][model8_pretrain.py][INFO] Epoch:[0/2](330000/4588595) loss:3.199 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:29:32,987][model8_pretrain.py][INFO] Epoch:[0/2](330000/4588595) loss:3.056 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:29:32,987][model8_pretrain.py][INFO] Epoch:[0/2](330000/4588595) loss:2.398 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:29:32,987][model8_pretrain.py][INFO] Epoch:[0/2](330000/4588595) loss:3.110 lr:0.0000100 epoch_Time:26968.0min: [2024-01-04 04:30:17,915][model8_pretrain.py][INFO] Epoch:[0/2](330100/4588595) loss:2.387 lr:0.0000100 epoch_Time:26969.0min: [2024-01-04 04:30:17,915][model8_pretrain.py][INFO] Epoch:[0/2](330100/4588595) loss:3.156 lr:0.0000100 epoch_Time:26969.0min: [2024-01-04 04:30:17,915][model8_pretrain.py][INFO] Epoch:[0/2](330100/4588595) loss:2.906 lr:0.0000100 epoch_Time:26969.0min: [2024-01-04 04:30:17,915][model8_pretrain.py][INFO] Epoch:[0/2](330100/4588595) loss:3.196 lr:0.0000100 epoch_Time:26969.0min: [2024-01-04 04:30:17,915][model8_pretrain.py][INFO] Epoch:[0/2](330100/4588595) loss:3.139 lr:0.0000100 epoch_Time:26969.0min: [2024-01-04 04:30:17,915][model8_pretrain.py][INFO] Epoch:[0/2](330100/4588595) loss:3.100 lr:0.0000100 epoch_Time:26969.0min: [2024-01-04 04:30:17,915][model8_pretrain.py][INFO] Epoch:[0/2](330100/4588595) loss:2.704 lr:0.0000100 epoch_Time:26969.0min: [2024-01-04 04:30:17,915][model8_pretrain.py][INFO] Epoch:[0/2](330100/4588595) loss:2.902 lr:0.0000100 epoch_Time:26969.0min: [2024-01-04 04:30:54,844][model8_pretrain.py][INFO] Epoch:[0/2](330200/4588595) loss:3.065 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:30:54,844][model8_pretrain.py][INFO] Epoch:[0/2](330200/4588595) loss:2.544 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:30:54,844][model8_pretrain.py][INFO] Epoch:[0/2](330200/4588595) loss:3.055 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:30:54,844][model8_pretrain.py][INFO] Epoch:[0/2](330200/4588595) loss:3.176 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:30:54,844][model8_pretrain.py][INFO] Epoch:[0/2](330200/4588595) loss:2.857 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:30:54,844][model8_pretrain.py][INFO] Epoch:[0/2](330200/4588595) loss:2.659 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:30:54,844][model8_pretrain.py][INFO] Epoch:[0/2](330200/4588595) loss:2.912 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:30:54,845][model8_pretrain.py][INFO] Epoch:[0/2](330200/4588595) loss:2.615 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:31:31,756][model8_pretrain.py][INFO] Epoch:[0/2](330300/4588595) loss:2.982 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:31:31,756][model8_pretrain.py][INFO] Epoch:[0/2](330300/4588595) loss:2.637 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:31:31,756][model8_pretrain.py][INFO] Epoch:[0/2](330300/4588595) loss:2.799 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:31:31,756][model8_pretrain.py][INFO] Epoch:[0/2](330300/4588595) loss:2.969 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:31:31,756][model8_pretrain.py][INFO] Epoch:[0/2](330300/4588595) loss:2.913 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:31:31,756][model8_pretrain.py][INFO] Epoch:[0/2](330300/4588595) loss:2.490 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:31:31,756][model8_pretrain.py][INFO] Epoch:[0/2](330300/4588595) loss:3.011 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:31:31,756][model8_pretrain.py][INFO] Epoch:[0/2](330300/4588595) loss:2.770 lr:0.0000100 epoch_Time:26967.0min: [2024-01-04 04:32:08,702][model8_pretrain.py][INFO] Epoch:[0/2](330400/4588595) loss:2.744 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:08,702][model8_pretrain.py][INFO] Epoch:[0/2](330400/4588595) loss:2.751 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:08,702][model8_pretrain.py][INFO] Epoch:[0/2](330400/4588595) loss:3.398 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:08,702][model8_pretrain.py][INFO] Epoch:[0/2](330400/4588595) loss:2.661 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:08,702][model8_pretrain.py][INFO] Epoch:[0/2](330400/4588595) loss:2.537 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:08,702][model8_pretrain.py][INFO] Epoch:[0/2](330400/4588595) loss:3.081 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:08,702][model8_pretrain.py][INFO] Epoch:[0/2](330400/4588595) loss:2.245 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:08,702][model8_pretrain.py][INFO] Epoch:[0/2](330400/4588595) loss:3.165 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:45,627][model8_pretrain.py][INFO] Epoch:[0/2](330500/4588595) loss:3.284 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:45,627][model8_pretrain.py][INFO] Epoch:[0/2](330500/4588595) loss:3.331 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:45,627][model8_pretrain.py][INFO] Epoch:[0/2](330500/4588595) loss:2.830 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:45,627][model8_pretrain.py][INFO] Epoch:[0/2](330500/4588595) loss:2.464 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:45,627][model8_pretrain.py][INFO] Epoch:[0/2](330500/4588595) loss:3.155 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:45,627][model8_pretrain.py][INFO] Epoch:[0/2](330500/4588595) loss:3.261 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:45,627][model8_pretrain.py][INFO] Epoch:[0/2](330500/4588595) loss:2.202 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:32:45,628][model8_pretrain.py][INFO] Epoch:[0/2](330500/4588595) loss:2.679 lr:0.0000100 epoch_Time:26966.0min: [2024-01-04 04:33:22,552][model8_pretrain.py][INFO] Epoch:[0/2](330600/4588595) loss:2.517 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:33:22,552][model8_pretrain.py][INFO] Epoch:[0/2](330600/4588595) loss:2.932 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:33:22,552][model8_pretrain.py][INFO] Epoch:[0/2](330600/4588595) loss:3.373 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:33:22,552][model8_pretrain.py][INFO] Epoch:[0/2](330600/4588595) loss:3.089 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:33:22,552][model8_pretrain.py][INFO] Epoch:[0/2](330600/4588595) loss:2.384 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:33:22,552][model8_pretrain.py][INFO] Epoch:[0/2](330600/4588595) loss:3.389 lr:0.0000100 epoch_Time:26965.0min: [2024-01-04 04:33:22,552][model8_pretrain.py][INFO] Epoch:[0/2](330600/4588595) loss:3.261 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:33:22,552][model8_pretrain.py][INFO] Epoch:[0/2](330600/4588595) loss:2.906 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:33:59,515][model8_pretrain.py][INFO] Epoch:[0/2](330700/4588595) loss:2.850 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:33:59,515][model8_pretrain.py][INFO] Epoch:[0/2](330700/4588595) loss:2.912 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:33:59,515][model8_pretrain.py][INFO] Epoch:[0/2](330700/4588595) loss:2.726 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:33:59,515][model8_pretrain.py][INFO] Epoch:[0/2](330700/4588595) loss:3.274 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:33:59,515][model8_pretrain.py][INFO] Epoch:[0/2](330700/4588595) loss:2.769 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:33:59,515][model8_pretrain.py][INFO] Epoch:[0/2](330700/4588595) loss:3.300 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:33:59,515][model8_pretrain.py][INFO] Epoch:[0/2](330700/4588595) loss:3.032 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:33:59,515][model8_pretrain.py][INFO] Epoch:[0/2](330700/4588595) loss:2.547 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:34:41,494][model8_pretrain.py][INFO] Epoch:[0/2](330800/4588595) loss:2.962 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:34:41,494][model8_pretrain.py][INFO] Epoch:[0/2](330800/4588595) loss:2.928 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:34:41,495][model8_pretrain.py][INFO] Epoch:[0/2](330800/4588595) loss:3.143 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:34:41,496][model8_pretrain.py][INFO] Epoch:[0/2](330800/4588595) loss:3.343 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:34:41,499][model8_pretrain.py][INFO] Epoch:[0/2](330800/4588595) loss:3.179 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:34:41,499][model8_pretrain.py][INFO] Epoch:[0/2](330800/4588595) loss:3.259 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:34:41,499][model8_pretrain.py][INFO] Epoch:[0/2](330800/4588595) loss:2.371 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:34:41,499][model8_pretrain.py][INFO] Epoch:[0/2](330800/4588595) loss:3.279 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:35:23,657][model8_pretrain.py][INFO] Epoch:[0/2](330900/4588595) loss:2.755 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:35:23,657][model8_pretrain.py][INFO] Epoch:[0/2](330900/4588595) loss:2.730 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:35:23,657][model8_pretrain.py][INFO] Epoch:[0/2](330900/4588595) loss:2.797 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:35:23,657][model8_pretrain.py][INFO] Epoch:[0/2](330900/4588595) loss:2.558 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:35:23,657][model8_pretrain.py][INFO] Epoch:[0/2](330900/4588595) loss:2.882 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:35:23,657][model8_pretrain.py][INFO] Epoch:[0/2](330900/4588595) loss:2.589 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:35:23,657][model8_pretrain.py][INFO] Epoch:[0/2](330900/4588595) loss:2.141 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:35:23,657][model8_pretrain.py][INFO] Epoch:[0/2](330900/4588595) loss:2.750 lr:0.0000100 epoch_Time:26964.0min: [2024-01-04 04:36:00,593][model8_pretrain.py][INFO] Epoch:[0/2](331000/4588595) loss:2.888 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:00,593][model8_pretrain.py][INFO] Epoch:[0/2](331000/4588595) loss:2.801 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:00,593][model8_pretrain.py][INFO] Epoch:[0/2](331000/4588595) loss:3.015 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:00,593][model8_pretrain.py][INFO] Epoch:[0/2](331000/4588595) loss:2.977 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:00,593][model8_pretrain.py][INFO] Epoch:[0/2](331000/4588595) loss:2.779 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:00,593][model8_pretrain.py][INFO] Epoch:[0/2](331000/4588595) loss:3.129 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:00,593][model8_pretrain.py][INFO] Epoch:[0/2](331000/4588595) loss:3.161 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:00,593][model8_pretrain.py][INFO] Epoch:[0/2](331000/4588595) loss:3.285 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:37,542][model8_pretrain.py][INFO] Epoch:[0/2](331100/4588595) loss:2.985 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:37,542][model8_pretrain.py][INFO] Epoch:[0/2](331100/4588595) loss:2.450 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:37,542][model8_pretrain.py][INFO] Epoch:[0/2](331100/4588595) loss:3.052 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:37,542][model8_pretrain.py][INFO] Epoch:[0/2](331100/4588595) loss:2.908 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:37,542][model8_pretrain.py][INFO] Epoch:[0/2](331100/4588595) loss:2.881 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:37,542][model8_pretrain.py][INFO] Epoch:[0/2](331100/4588595) loss:2.581 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:37,542][model8_pretrain.py][INFO] Epoch:[0/2](331100/4588595) loss:2.902 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:36:37,542][model8_pretrain.py][INFO] Epoch:[0/2](331100/4588595) loss:2.432 lr:0.0000100 epoch_Time:26963.0min: [2024-01-04 04:37:14,468][model8_pretrain.py][INFO] Epoch:[0/2](331200/4588595) loss:2.578 lr:0.0000100 epoch_Time:26961.0min: [2024-01-04 04:37:14,468][model8_pretrain.py][INFO] Epoch:[0/2](331200/4588595) loss:2.927 lr:0.0000100 epoch_Time:26961.0min: [2024-01-04 04:37:14,468][model8_pretrain.py][INFO] Epoch:[0/2](331200/4588595) loss:3.088 lr:0.0000100 epoch_Time:26961.0min: [2024-01-04 04:37:14,468][model8_pretrain.py][INFO] Epoch:[0/2](331200/4588595) loss:2.332 lr:0.0000100 epoch_Time:26961.0min: [2024-01-04 04:37:14,468][model8_pretrain.py][INFO] Epoch:[0/2](331200/4588595) loss:3.203 lr:0.0000100 epoch_Time:26961.0min: [2024-01-04 04:37:14,468][model8_pretrain.py][INFO] Epoch:[0/2](331200/4588595) loss:2.895 lr:0.0000100 epoch_Time:26961.0min: [2024-01-04 04:37:14,468][model8_pretrain.py][INFO] Epoch:[0/2](331200/4588595) loss:3.327 lr:0.0000100 epoch_Time:26961.0min: [2024-01-04 04:37:14,468][model8_pretrain.py][INFO] Epoch:[0/2](331200/4588595) loss:2.996 lr:0.0000100 epoch_Time:26961.0min: [2024-01-04 04:37:51,391][model8_pretrain.py][INFO] Epoch:[0/2](331300/4588595) loss:3.181 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:37:51,391][model8_pretrain.py][INFO] Epoch:[0/2](331300/4588595) loss:2.895 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:37:51,391][model8_pretrain.py][INFO] Epoch:[0/2](331300/4588595) loss:2.902 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:37:51,392][model8_pretrain.py][INFO] Epoch:[0/2](331300/4588595) loss:2.911 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:37:51,392][model8_pretrain.py][INFO] Epoch:[0/2](331300/4588595) loss:3.191 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:37:51,392][model8_pretrain.py][INFO] Epoch:[0/2](331300/4588595) loss:3.072 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:37:51,392][model8_pretrain.py][INFO] Epoch:[0/2](331300/4588595) loss:2.934 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:37:51,392][model8_pretrain.py][INFO] Epoch:[0/2](331300/4588595) loss:2.617 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:38:28,363][model8_pretrain.py][INFO] Epoch:[0/2](331400/4588595) loss:3.229 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:38:28,364][model8_pretrain.py][INFO] Epoch:[0/2](331400/4588595) loss:3.254 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:38:28,364][model8_pretrain.py][INFO] Epoch:[0/2](331400/4588595) loss:3.017 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:38:28,364][model8_pretrain.py][INFO] Epoch:[0/2](331400/4588595) loss:3.124 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:38:28,364][model8_pretrain.py][INFO] Epoch:[0/2](331400/4588595) loss:2.914 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:38:28,364][model8_pretrain.py][INFO] Epoch:[0/2](331400/4588595) loss:2.650 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:38:28,364][model8_pretrain.py][INFO] Epoch:[0/2](331400/4588595) loss:2.659 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:38:28,364][model8_pretrain.py][INFO] Epoch:[0/2](331400/4588595) loss:2.890 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:39:05,304][model8_pretrain.py][INFO] Epoch:[0/2](331500/4588595) loss:2.646 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:05,304][model8_pretrain.py][INFO] Epoch:[0/2](331500/4588595) loss:3.172 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:05,304][model8_pretrain.py][INFO] Epoch:[0/2](331500/4588595) loss:3.109 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:05,304][model8_pretrain.py][INFO] Epoch:[0/2](331500/4588595) loss:2.633 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:05,304][model8_pretrain.py][INFO] Epoch:[0/2](331500/4588595) loss:2.642 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:05,304][model8_pretrain.py][INFO] Epoch:[0/2](331500/4588595) loss:2.787 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:05,304][model8_pretrain.py][INFO] Epoch:[0/2](331500/4588595) loss:3.275 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:05,304][model8_pretrain.py][INFO] Epoch:[0/2](331500/4588595) loss:2.524 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:43,980][model8_pretrain.py][INFO] Epoch:[0/2](331600/4588595) loss:3.519 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:43,980][model8_pretrain.py][INFO] Epoch:[0/2](331600/4588595) loss:3.251 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:43,980][model8_pretrain.py][INFO] Epoch:[0/2](331600/4588595) loss:2.860 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:43,980][model8_pretrain.py][INFO] Epoch:[0/2](331600/4588595) loss:3.381 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:43,980][model8_pretrain.py][INFO] Epoch:[0/2](331600/4588595) loss:2.838 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:43,980][model8_pretrain.py][INFO] Epoch:[0/2](331600/4588595) loss:2.742 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:43,981][model8_pretrain.py][INFO] Epoch:[0/2](331600/4588595) loss:3.033 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:39:43,981][model8_pretrain.py][INFO] Epoch:[0/2](331600/4588595) loss:2.977 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:40:31,154][model8_pretrain.py][INFO] Epoch:[0/2](331700/4588595) loss:3.215 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:40:31,154][model8_pretrain.py][INFO] Epoch:[0/2](331700/4588595) loss:3.009 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:40:31,154][model8_pretrain.py][INFO] Epoch:[0/2](331700/4588595) loss:2.587 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:40:31,154][model8_pretrain.py][INFO] Epoch:[0/2](331700/4588595) loss:2.949 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:40:31,154][model8_pretrain.py][INFO] Epoch:[0/2](331700/4588595) loss:3.629 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:40:31,154][model8_pretrain.py][INFO] Epoch:[0/2](331700/4588595) loss:2.829 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:40:31,154][model8_pretrain.py][INFO] Epoch:[0/2](331700/4588595) loss:3.076 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:40:31,155][model8_pretrain.py][INFO] Epoch:[0/2](331700/4588595) loss:2.962 lr:0.0000100 epoch_Time:26960.0min: [2024-01-04 04:41:08,097][model8_pretrain.py][INFO] Epoch:[0/2](331800/4588595) loss:3.012 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:41:08,097][model8_pretrain.py][INFO] Epoch:[0/2](331800/4588595) loss:2.829 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:41:08,097][model8_pretrain.py][INFO] Epoch:[0/2](331800/4588595) loss:3.059 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:41:08,097][model8_pretrain.py][INFO] Epoch:[0/2](331800/4588595) loss:3.030 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:41:08,097][model8_pretrain.py][INFO] Epoch:[0/2](331800/4588595) loss:3.313 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:41:08,097][model8_pretrain.py][INFO] Epoch:[0/2](331800/4588595) loss:3.139 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:41:08,097][model8_pretrain.py][INFO] Epoch:[0/2](331800/4588595) loss:2.321 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:41:08,097][model8_pretrain.py][INFO] Epoch:[0/2](331800/4588595) loss:3.408 lr:0.0000100 epoch_Time:26959.0min: [2024-01-04 04:41:45,041][model8_pretrain.py][INFO] Epoch:[0/2](331900/4588595) loss:2.946 lr:0.0000100 epoch_Time:26958.0min: [2024-01-04 04:41:45,041][model8_pretrain.py][INFO] Epoch:[0/2](331900/4588595) loss:3.129 lr:0.0000100 epoch_Time:26958.0min: [2024-01-04 04:41:45,041][model8_pretrain.py][INFO] Epoch:[0/2](331900/4588595) loss:2.666 lr:0.0000100 epoch_Time:26958.0min: [2024-01-04 04:41:45,041][model8_pretrain.py][INFO] Epoch:[0/2](331900/4588595) loss:2.793 lr:0.0000100 epoch_Time:26958.0min: [2024-01-04 04:41:45,041][model8_pretrain.py][INFO] Epoch:[0/2](331900/4588595) loss:2.917 lr:0.0000100 epoch_Time:26958.0min: [2024-01-04 04:41:45,041][model8_pretrain.py][INFO] Epoch:[0/2](331900/4588595) loss:2.270 lr:0.0000100 epoch_Time:26958.0min: [2024-01-04 04:41:45,041][model8_pretrain.py][INFO] Epoch:[0/2](331900/4588595) loss:3.154 lr:0.0000100 epoch_Time:26958.0min: [2024-01-04 04:41:45,041][model8_pretrain.py][INFO] Epoch:[0/2](331900/4588595) loss:3.066 lr:0.0000100 epoch_Time:26958.0min: [2024-01-04 04:42:21,974][model8_pretrain.py][INFO] Epoch:[0/2](332000/4588595) loss:3.063 lr:0.0000100 epoch_Time:26957.0min: [2024-01-04 04:42:21,974][model8_pretrain.py][INFO] Epoch:[0/2](332000/4588595) loss:3.051 lr:0.0000100 epoch_Time:26957.0min: [2024-01-04 04:42:21,974][model8_pretrain.py][INFO] Epoch:[0/2](332000/4588595) loss:2.666 lr:0.0000100 epoch_Time:26957.0min: [2024-01-04 04:42:21,974][model8_pretrain.py][INFO] Epoch:[0/2](332000/4588595) loss:2.623 lr:0.0000100 epoch_Time:26957.0min: [2024-01-04 04:42:21,974][model8_pretrain.py][INFO] Epoch:[0/2](332000/4588595) loss:3.159 lr:0.0000100 epoch_Time:26957.0min: [2024-01-04 04:42:21,974][model8_pretrain.py][INFO] Epoch:[0/2](332000/4588595) loss:2.683 lr:0.0000100 epoch_Time:26957.0min: [2024-01-04 04:42:21,974][model8_pretrain.py][INFO] Epoch:[0/2](332000/4588595) loss:2.670 lr:0.0000100 epoch_Time:26957.0min: [2024-01-04 04:42:21,974][model8_pretrain.py][INFO] Epoch:[0/2](332000/4588595) loss:2.992 lr:0.0000100 epoch_Time:26957.0min: [2024-01-04 04:42:58,907][model8_pretrain.py][INFO] Epoch:[0/2](332100/4588595) loss:3.252 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:42:58,907][model8_pretrain.py][INFO] Epoch:[0/2](332100/4588595) loss:2.522 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:42:58,907][model8_pretrain.py][INFO] Epoch:[0/2](332100/4588595) loss:3.213 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:42:58,907][model8_pretrain.py][INFO] Epoch:[0/2](332100/4588595) loss:2.915 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:42:58,908][model8_pretrain.py][INFO] Epoch:[0/2](332100/4588595) loss:3.011 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:42:58,908][model8_pretrain.py][INFO] Epoch:[0/2](332100/4588595) loss:3.277 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:42:58,908][model8_pretrain.py][INFO] Epoch:[0/2](332100/4588595) loss:2.849 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:42:58,908][model8_pretrain.py][INFO] Epoch:[0/2](332100/4588595) loss:3.162 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:43:35,838][model8_pretrain.py][INFO] Epoch:[0/2](332200/4588595) loss:2.664 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:43:35,838][model8_pretrain.py][INFO] Epoch:[0/2](332200/4588595) loss:2.803 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:43:35,838][model8_pretrain.py][INFO] Epoch:[0/2](332200/4588595) loss:2.447 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:43:35,838][model8_pretrain.py][INFO] Epoch:[0/2](332200/4588595) loss:2.744 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:43:35,838][model8_pretrain.py][INFO] Epoch:[0/2](332200/4588595) loss:2.599 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:43:35,838][model8_pretrain.py][INFO] Epoch:[0/2](332200/4588595) loss:2.182 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:43:35,838][model8_pretrain.py][INFO] Epoch:[0/2](332200/4588595) loss:2.851 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:43:35,838][model8_pretrain.py][INFO] Epoch:[0/2](332200/4588595) loss:2.786 lr:0.0000100 epoch_Time:26956.0min: [2024-01-04 04:44:12,774][model8_pretrain.py][INFO] Epoch:[0/2](332300/4588595) loss:2.917 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:12,774][model8_pretrain.py][INFO] Epoch:[0/2](332300/4588595) loss:3.064 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:12,774][model8_pretrain.py][INFO] Epoch:[0/2](332300/4588595) loss:2.548 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:12,774][model8_pretrain.py][INFO] Epoch:[0/2](332300/4588595) loss:2.021 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:12,774][model8_pretrain.py][INFO] Epoch:[0/2](332300/4588595) loss:3.468 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:12,774][model8_pretrain.py][INFO] Epoch:[0/2](332300/4588595) loss:3.569 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:12,774][model8_pretrain.py][INFO] Epoch:[0/2](332300/4588595) loss:2.687 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:12,775][model8_pretrain.py][INFO] Epoch:[0/2](332300/4588595) loss:2.850 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:51,440][model8_pretrain.py][INFO] Epoch:[0/2](332400/4588595) loss:2.698 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:51,440][model8_pretrain.py][INFO] Epoch:[0/2](332400/4588595) loss:3.125 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:51,440][model8_pretrain.py][INFO] Epoch:[0/2](332400/4588595) loss:2.661 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:51,440][model8_pretrain.py][INFO] Epoch:[0/2](332400/4588595) loss:2.771 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:51,440][model8_pretrain.py][INFO] Epoch:[0/2](332400/4588595) loss:2.864 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:51,440][model8_pretrain.py][INFO] Epoch:[0/2](332400/4588595) loss:3.066 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:51,440][model8_pretrain.py][INFO] Epoch:[0/2](332400/4588595) loss:2.540 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:44:51,440][model8_pretrain.py][INFO] Epoch:[0/2](332400/4588595) loss:2.896 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:45:36,519][model8_pretrain.py][INFO] Epoch:[0/2](332500/4588595) loss:3.130 lr:0.0000100 epoch_Time:26955.0min: [2024-01-04 04:45:36,519][model8_pretrain.py][INFO] Epoch:[0/2](332500/4588595) loss:2.734 lr:0.0000100 epoch_Time:26955.0min: [2024-01-04 04:45:36,519][model8_pretrain.py][INFO] Epoch:[0/2](332500/4588595) loss:2.681 lr:0.0000100 epoch_Time:26955.0min: [2024-01-04 04:45:36,519][model8_pretrain.py][INFO] Epoch:[0/2](332500/4588595) loss:3.158 lr:0.0000100 epoch_Time:26955.0min: [2024-01-04 04:45:36,522][model8_pretrain.py][INFO] Epoch:[0/2](332500/4588595) loss:2.545 lr:0.0000100 epoch_Time:26955.0min: [2024-01-04 04:45:36,522][model8_pretrain.py][INFO] Epoch:[0/2](332500/4588595) loss:2.636 lr:0.0000100 epoch_Time:26955.0min: [2024-01-04 04:45:36,522][model8_pretrain.py][INFO] Epoch:[0/2](332500/4588595) loss:2.783 lr:0.0000100 epoch_Time:26955.0min: [2024-01-04 04:45:36,522][model8_pretrain.py][INFO] Epoch:[0/2](332500/4588595) loss:3.125 lr:0.0000100 epoch_Time:26955.0min: [2024-01-04 04:46:13,461][model8_pretrain.py][INFO] Epoch:[0/2](332600/4588595) loss:3.116 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:46:13,461][model8_pretrain.py][INFO] Epoch:[0/2](332600/4588595) loss:3.272 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:46:13,461][model8_pretrain.py][INFO] Epoch:[0/2](332600/4588595) loss:2.929 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:46:13,461][model8_pretrain.py][INFO] Epoch:[0/2](332600/4588595) loss:3.451 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:46:13,461][model8_pretrain.py][INFO] Epoch:[0/2](332600/4588595) loss:3.115 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:46:13,461][model8_pretrain.py][INFO] Epoch:[0/2](332600/4588595) loss:3.164 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:46:13,461][model8_pretrain.py][INFO] Epoch:[0/2](332600/4588595) loss:3.259 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:46:13,461][model8_pretrain.py][INFO] Epoch:[0/2](332600/4588595) loss:2.555 lr:0.0000100 epoch_Time:26954.0min: [2024-01-04 04:46:50,396][model8_pretrain.py][INFO] Epoch:[0/2](332700/4588595) loss:3.250 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:46:50,396][model8_pretrain.py][INFO] Epoch:[0/2](332700/4588595) loss:2.610 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:46:50,396][model8_pretrain.py][INFO] Epoch:[0/2](332700/4588595) loss:2.873 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:46:50,396][model8_pretrain.py][INFO] Epoch:[0/2](332700/4588595) loss:2.915 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:46:50,396][model8_pretrain.py][INFO] Epoch:[0/2](332700/4588595) loss:2.856 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:46:50,396][model8_pretrain.py][INFO] Epoch:[0/2](332700/4588595) loss:2.963 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:46:50,396][model8_pretrain.py][INFO] Epoch:[0/2](332700/4588595) loss:2.628 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:46:50,397][model8_pretrain.py][INFO] Epoch:[0/2](332700/4588595) loss:3.255 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:47:27,357][model8_pretrain.py][INFO] Epoch:[0/2](332800/4588595) loss:2.926 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:47:27,357][model8_pretrain.py][INFO] Epoch:[0/2](332800/4588595) loss:2.625 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:47:27,357][model8_pretrain.py][INFO] Epoch:[0/2](332800/4588595) loss:2.979 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:47:27,357][model8_pretrain.py][INFO] Epoch:[0/2](332800/4588595) loss:3.160 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:47:27,357][model8_pretrain.py][INFO] Epoch:[0/2](332800/4588595) loss:2.983 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:47:27,357][model8_pretrain.py][INFO] Epoch:[0/2](332800/4588595) loss:3.099 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:47:27,357][model8_pretrain.py][INFO] Epoch:[0/2](332800/4588595) loss:3.267 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:47:27,358][model8_pretrain.py][INFO] Epoch:[0/2](332800/4588595) loss:3.245 lr:0.0000100 epoch_Time:26953.0min: [2024-01-04 04:48:04,300][model8_pretrain.py][INFO] Epoch:[0/2](332900/4588595) loss:3.266 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:04,300][model8_pretrain.py][INFO] Epoch:[0/2](332900/4588595) loss:3.241 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:04,300][model8_pretrain.py][INFO] Epoch:[0/2](332900/4588595) loss:2.787 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:04,300][model8_pretrain.py][INFO] Epoch:[0/2](332900/4588595) loss:2.366 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:04,300][model8_pretrain.py][INFO] Epoch:[0/2](332900/4588595) loss:2.647 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:04,300][model8_pretrain.py][INFO] Epoch:[0/2](332900/4588595) loss:2.959 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:04,300][model8_pretrain.py][INFO] Epoch:[0/2](332900/4588595) loss:2.481 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:04,300][model8_pretrain.py][INFO] Epoch:[0/2](332900/4588595) loss:2.474 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:41,244][model8_pretrain.py][INFO] Epoch:[0/2](333000/4588595) loss:3.136 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:41,244][model8_pretrain.py][INFO] Epoch:[0/2](333000/4588595) loss:1.890 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:41,244][model8_pretrain.py][INFO] Epoch:[0/2](333000/4588595) loss:2.410 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:41,244][model8_pretrain.py][INFO] Epoch:[0/2](333000/4588595) loss:2.383 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:41,244][model8_pretrain.py][INFO] Epoch:[0/2](333000/4588595) loss:3.156 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:41,244][model8_pretrain.py][INFO] Epoch:[0/2](333000/4588595) loss:2.385 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:41,244][model8_pretrain.py][INFO] Epoch:[0/2](333000/4588595) loss:2.892 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:48:41,244][model8_pretrain.py][INFO] Epoch:[0/2](333000/4588595) loss:3.130 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:49:18,181][model8_pretrain.py][INFO] Epoch:[0/2](333100/4588595) loss:3.277 lr:0.0000100 epoch_Time:26950.0min: [2024-01-04 04:49:18,181][model8_pretrain.py][INFO] Epoch:[0/2](333100/4588595) loss:3.091 lr:0.0000100 epoch_Time:26950.0min: [2024-01-04 04:49:18,181][model8_pretrain.py][INFO] Epoch:[0/2](333100/4588595) loss:2.418 lr:0.0000100 epoch_Time:26950.0min: [2024-01-04 04:49:18,181][model8_pretrain.py][INFO] Epoch:[0/2](333100/4588595) loss:2.915 lr:0.0000100 epoch_Time:26950.0min: [2024-01-04 04:49:18,181][model8_pretrain.py][INFO] Epoch:[0/2](333100/4588595) loss:2.940 lr:0.0000100 epoch_Time:26950.0min: [2024-01-04 04:49:18,181][model8_pretrain.py][INFO] Epoch:[0/2](333100/4588595) loss:2.110 lr:0.0000100 epoch_Time:26950.0min: [2024-01-04 04:49:18,181][model8_pretrain.py][INFO] Epoch:[0/2](333100/4588595) loss:3.364 lr:0.0000100 epoch_Time:26950.0min: [2024-01-04 04:49:18,181][model8_pretrain.py][INFO] Epoch:[0/2](333100/4588595) loss:2.199 lr:0.0000100 epoch_Time:26950.0min: [2024-01-04 04:49:56,894][model8_pretrain.py][INFO] Epoch:[0/2](333200/4588595) loss:3.243 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:49:56,894][model8_pretrain.py][INFO] Epoch:[0/2](333200/4588595) loss:3.172 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:49:56,894][model8_pretrain.py][INFO] Epoch:[0/2](333200/4588595) loss:2.654 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:49:56,894][model8_pretrain.py][INFO] Epoch:[0/2](333200/4588595) loss:3.326 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:49:56,894][model8_pretrain.py][INFO] Epoch:[0/2](333200/4588595) loss:2.692 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:49:56,896][model8_pretrain.py][INFO] Epoch:[0/2](333200/4588595) loss:2.880 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:49:56,897][model8_pretrain.py][INFO] Epoch:[0/2](333200/4588595) loss:2.990 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:49:56,897][model8_pretrain.py][INFO] Epoch:[0/2](333200/4588595) loss:2.619 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:50:42,058][model8_pretrain.py][INFO] Epoch:[0/2](333300/4588595) loss:3.078 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:50:42,058][model8_pretrain.py][INFO] Epoch:[0/2](333300/4588595) loss:3.185 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:50:42,058][model8_pretrain.py][INFO] Epoch:[0/2](333300/4588595) loss:3.161 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:50:42,058][model8_pretrain.py][INFO] Epoch:[0/2](333300/4588595) loss:2.751 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:50:42,058][model8_pretrain.py][INFO] Epoch:[0/2](333300/4588595) loss:2.742 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:50:42,058][model8_pretrain.py][INFO] Epoch:[0/2](333300/4588595) loss:2.380 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:50:42,058][model8_pretrain.py][INFO] Epoch:[0/2](333300/4588595) loss:3.657 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:50:42,058][model8_pretrain.py][INFO] Epoch:[0/2](333300/4588595) loss:3.099 lr:0.0000100 epoch_Time:26951.0min: [2024-01-04 04:51:19,007][model8_pretrain.py][INFO] Epoch:[0/2](333400/4588595) loss:2.992 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:51:19,007][model8_pretrain.py][INFO] Epoch:[0/2](333400/4588595) loss:2.766 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:51:19,007][model8_pretrain.py][INFO] Epoch:[0/2](333400/4588595) loss:2.202 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:51:19,007][model8_pretrain.py][INFO] Epoch:[0/2](333400/4588595) loss:2.848 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:51:19,007][model8_pretrain.py][INFO] Epoch:[0/2](333400/4588595) loss:2.915 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:51:19,007][model8_pretrain.py][INFO] Epoch:[0/2](333400/4588595) loss:3.121 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:51:19,008][model8_pretrain.py][INFO] Epoch:[0/2](333400/4588595) loss:2.653 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:51:19,008][model8_pretrain.py][INFO] Epoch:[0/2](333400/4588595) loss:3.261 lr:0.0000100 epoch_Time:26949.0min: [2024-01-04 04:51:55,947][model8_pretrain.py][INFO] Epoch:[0/2](333500/4588595) loss:2.661 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:51:55,947][model8_pretrain.py][INFO] Epoch:[0/2](333500/4588595) loss:2.788 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:51:55,947][model8_pretrain.py][INFO] Epoch:[0/2](333500/4588595) loss:2.650 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:51:55,947][model8_pretrain.py][INFO] Epoch:[0/2](333500/4588595) loss:2.458 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:51:55,947][model8_pretrain.py][INFO] Epoch:[0/2](333500/4588595) loss:2.558 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:51:55,947][model8_pretrain.py][INFO] Epoch:[0/2](333500/4588595) loss:3.108 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:51:55,947][model8_pretrain.py][INFO] Epoch:[0/2](333500/4588595) loss:2.990 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:51:55,947][model8_pretrain.py][INFO] Epoch:[0/2](333500/4588595) loss:2.713 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:52:32,906][model8_pretrain.py][INFO] Epoch:[0/2](333600/4588595) loss:2.731 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:52:32,906][model8_pretrain.py][INFO] Epoch:[0/2](333600/4588595) loss:2.787 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:52:32,906][model8_pretrain.py][INFO] Epoch:[0/2](333600/4588595) loss:2.496 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:52:32,906][model8_pretrain.py][INFO] Epoch:[0/2](333600/4588595) loss:3.100 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:52:32,906][model8_pretrain.py][INFO] Epoch:[0/2](333600/4588595) loss:3.080 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:52:32,906][model8_pretrain.py][INFO] Epoch:[0/2](333600/4588595) loss:2.967 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:52:32,906][model8_pretrain.py][INFO] Epoch:[0/2](333600/4588595) loss:2.782 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:52:32,906][model8_pretrain.py][INFO] Epoch:[0/2](333600/4588595) loss:2.750 lr:0.0000100 epoch_Time:26948.0min: [2024-01-04 04:53:09,851][model8_pretrain.py][INFO] Epoch:[0/2](333700/4588595) loss:2.564 lr:0.0000100 epoch_Time:26947.0min: [2024-01-04 04:53:09,851][model8_pretrain.py][INFO] Epoch:[0/2](333700/4588595) loss:3.025 lr:0.0000100 epoch_Time:26947.0min: [2024-01-04 04:53:09,851][model8_pretrain.py][INFO] Epoch:[0/2](333700/4588595) loss:3.282 lr:0.0000100 epoch_Time:26947.0min: [2024-01-04 04:53:09,851][model8_pretrain.py][INFO] Epoch:[0/2](333700/4588595) loss:2.577 lr:0.0000100 epoch_Time:26947.0min: [2024-01-04 04:53:09,851][model8_pretrain.py][INFO] Epoch:[0/2](333700/4588595) loss:3.170 lr:0.0000100 epoch_Time:26947.0min: [2024-01-04 04:53:09,851][model8_pretrain.py][INFO] Epoch:[0/2](333700/4588595) loss:3.075 lr:0.0000100 epoch_Time:26947.0min: [2024-01-04 04:53:09,851][model8_pretrain.py][INFO] Epoch:[0/2](333700/4588595) loss:2.922 lr:0.0000100 epoch_Time:26947.0min: [2024-01-04 04:53:09,851][model8_pretrain.py][INFO] Epoch:[0/2](333700/4588595) loss:2.923 lr:0.0000100 epoch_Time:26947.0min: [2024-01-04 04:53:46,794][model8_pretrain.py][INFO] Epoch:[0/2](333800/4588595) loss:2.905 lr:0.0000100 epoch_Time:26946.0min: [2024-01-04 04:53:46,795][model8_pretrain.py][INFO] Epoch:[0/2](333800/4588595) loss:2.589 lr:0.0000100 epoch_Time:26946.0min: [2024-01-04 04:53:46,795][model8_pretrain.py][INFO] Epoch:[0/2](333800/4588595) loss:3.192 lr:0.0000100 epoch_Time:26946.0min: [2024-01-04 04:53:46,795][model8_pretrain.py][INFO] Epoch:[0/2](333800/4588595) loss:3.159 lr:0.0000100 epoch_Time:26946.0min: [2024-01-04 04:53:46,795][model8_pretrain.py][INFO] Epoch:[0/2](333800/4588595) loss:3.272 lr:0.0000100 epoch_Time:26946.0min: [2024-01-04 04:53:46,795][model8_pretrain.py][INFO] Epoch:[0/2](333800/4588595) loss:3.125 lr:0.0000100 epoch_Time:26946.0min: [2024-01-04 04:53:46,795][model8_pretrain.py][INFO] Epoch:[0/2](333800/4588595) loss:2.757 lr:0.0000100 epoch_Time:26946.0min: [2024-01-04 04:53:46,795][model8_pretrain.py][INFO] Epoch:[0/2](333800/4588595) loss:3.231 lr:0.0000100 epoch_Time:26946.0min: [2024-01-04 04:54:23,720][model8_pretrain.py][INFO] Epoch:[0/2](333900/4588595) loss:2.759 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:54:23,720][model8_pretrain.py][INFO] Epoch:[0/2](333900/4588595) loss:2.808 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:54:23,720][model8_pretrain.py][INFO] Epoch:[0/2](333900/4588595) loss:3.283 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:54:23,720][model8_pretrain.py][INFO] Epoch:[0/2](333900/4588595) loss:3.388 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:54:23,721][model8_pretrain.py][INFO] Epoch:[0/2](333900/4588595) loss:2.909 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:54:23,721][model8_pretrain.py][INFO] Epoch:[0/2](333900/4588595) loss:2.807 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:54:23,721][model8_pretrain.py][INFO] Epoch:[0/2](333900/4588595) loss:3.190 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:54:23,721][model8_pretrain.py][INFO] Epoch:[0/2](333900/4588595) loss:2.840 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:55:00,665][model8_pretrain.py][INFO] Epoch:[0/2](334000/4588595) loss:3.167 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:55:00,665][model8_pretrain.py][INFO] Epoch:[0/2](334000/4588595) loss:3.021 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:55:00,665][model8_pretrain.py][INFO] Epoch:[0/2](334000/4588595) loss:2.798 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:55:00,665][model8_pretrain.py][INFO] Epoch:[0/2](334000/4588595) loss:2.918 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:55:00,665][model8_pretrain.py][INFO] Epoch:[0/2](334000/4588595) loss:2.726 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:55:00,665][model8_pretrain.py][INFO] Epoch:[0/2](334000/4588595) loss:2.712 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:55:00,665][model8_pretrain.py][INFO] Epoch:[0/2](334000/4588595) loss:3.171 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:55:00,665][model8_pretrain.py][INFO] Epoch:[0/2](334000/4588595) loss:2.841 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:55:47,868][model8_pretrain.py][INFO] Epoch:[0/2](334100/4588595) loss:2.603 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:55:47,868][model8_pretrain.py][INFO] Epoch:[0/2](334100/4588595) loss:2.853 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:55:47,869][model8_pretrain.py][INFO] Epoch:[0/2](334100/4588595) loss:3.228 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:55:47,869][model8_pretrain.py][INFO] Epoch:[0/2](334100/4588595) loss:2.551 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:55:47,868][model8_pretrain.py][INFO] Epoch:[0/2](334100/4588595) loss:2.857 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:55:47,869][model8_pretrain.py][INFO] Epoch:[0/2](334100/4588595) loss:2.993 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:55:47,869][model8_pretrain.py][INFO] Epoch:[0/2](334100/4588595) loss:2.901 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:55:47,869][model8_pretrain.py][INFO] Epoch:[0/2](334100/4588595) loss:2.607 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:56:24,805][model8_pretrain.py][INFO] Epoch:[0/2](334200/4588595) loss:2.524 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:56:24,805][model8_pretrain.py][INFO] Epoch:[0/2](334200/4588595) loss:2.279 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:56:24,805][model8_pretrain.py][INFO] Epoch:[0/2](334200/4588595) loss:2.823 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:56:24,805][model8_pretrain.py][INFO] Epoch:[0/2](334200/4588595) loss:2.823 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:56:24,805][model8_pretrain.py][INFO] Epoch:[0/2](334200/4588595) loss:3.120 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:56:24,805][model8_pretrain.py][INFO] Epoch:[0/2](334200/4588595) loss:3.105 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:56:24,805][model8_pretrain.py][INFO] Epoch:[0/2](334200/4588595) loss:3.205 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:56:24,805][model8_pretrain.py][INFO] Epoch:[0/2](334200/4588595) loss:3.188 lr:0.0000100 epoch_Time:26945.0min: [2024-01-04 04:57:01,751][model8_pretrain.py][INFO] Epoch:[0/2](334300/4588595) loss:2.398 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:57:01,751][model8_pretrain.py][INFO] Epoch:[0/2](334300/4588595) loss:2.956 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:57:01,751][model8_pretrain.py][INFO] Epoch:[0/2](334300/4588595) loss:2.596 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:57:01,751][model8_pretrain.py][INFO] Epoch:[0/2](334300/4588595) loss:3.220 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:57:01,751][model8_pretrain.py][INFO] Epoch:[0/2](334300/4588595) loss:3.017 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:57:01,751][model8_pretrain.py][INFO] Epoch:[0/2](334300/4588595) loss:2.435 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:57:01,751][model8_pretrain.py][INFO] Epoch:[0/2](334300/4588595) loss:3.208 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:57:01,752][model8_pretrain.py][INFO] Epoch:[0/2](334300/4588595) loss:2.780 lr:0.0000100 epoch_Time:26944.0min: [2024-01-04 04:57:38,703][model8_pretrain.py][INFO] Epoch:[0/2](334400/4588595) loss:2.154 lr:0.0000100 epoch_Time:26943.0min: [2024-01-04 04:57:38,703][model8_pretrain.py][INFO] Epoch:[0/2](334400/4588595) loss:2.710 lr:0.0000100 epoch_Time:26943.0min: [2024-01-04 04:57:38,703][model8_pretrain.py][INFO] Epoch:[0/2](334400/4588595) loss:2.633 lr:0.0000100 epoch_Time:26943.0min: [2024-01-04 04:57:38,703][model8_pretrain.py][INFO] Epoch:[0/2](334400/4588595) loss:3.179 lr:0.0000100 epoch_Time:26943.0min: [2024-01-04 04:57:38,703][model8_pretrain.py][INFO] Epoch:[0/2](334400/4588595) loss:3.146 lr:0.0000100 epoch_Time:26943.0min: [2024-01-04 04:57:38,703][model8_pretrain.py][INFO] Epoch:[0/2](334400/4588595) loss:2.811 lr:0.0000100 epoch_Time:26943.0min: [2024-01-04 04:57:38,704][model8_pretrain.py][INFO] Epoch:[0/2](334400/4588595) loss:2.989 lr:0.0000100 epoch_Time:26943.0min: [2024-01-04 04:57:38,704][model8_pretrain.py][INFO] Epoch:[0/2](334400/4588595) loss:3.042 lr:0.0000100 epoch_Time:26943.0min: [2024-01-04 04:58:15,672][model8_pretrain.py][INFO] Epoch:[0/2](334500/4588595) loss:3.299 lr:0.0000100 epoch_Time:26942.0min: [2024-01-04 04:58:15,672][model8_pretrain.py][INFO] Epoch:[0/2](334500/4588595) loss:3.213 lr:0.0000100 epoch_Time:26942.0min: [2024-01-04 04:58:15,672][model8_pretrain.py][INFO] Epoch:[0/2](334500/4588595) loss:2.503 lr:0.0000100 epoch_Time:26942.0min: [2024-01-04 04:58:15,672][model8_pretrain.py][INFO] Epoch:[0/2](334500/4588595) loss:3.014 lr:0.0000100 epoch_Time:26942.0min: [2024-01-04 04:58:15,672][model8_pretrain.py][INFO] Epoch:[0/2](334500/4588595) loss:2.976 lr:0.0000100 epoch_Time:26942.0min: [2024-01-04 04:58:15,672][model8_pretrain.py][INFO] Epoch:[0/2](334500/4588595) loss:3.223 lr:0.0000100 epoch_Time:26942.0min: [2024-01-04 04:58:15,673][model8_pretrain.py][INFO] Epoch:[0/2](334500/4588595) loss:3.224 lr:0.0000100 epoch_Time:26942.0min: [2024-01-04 04:58:15,673][model8_pretrain.py][INFO] Epoch:[0/2](334500/4588595) loss:2.938 lr:0.0000100 epoch_Time:26942.0min: [2024-01-04 04:58:52,631][model8_pretrain.py][INFO] Epoch:[0/2](334600/4588595) loss:2.380 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:58:52,631][model8_pretrain.py][INFO] Epoch:[0/2](334600/4588595) loss:2.630 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:58:52,631][model8_pretrain.py][INFO] Epoch:[0/2](334600/4588595) loss:2.775 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:58:52,631][model8_pretrain.py][INFO] Epoch:[0/2](334600/4588595) loss:2.559 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:58:52,631][model8_pretrain.py][INFO] Epoch:[0/2](334600/4588595) loss:3.284 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:58:52,631][model8_pretrain.py][INFO] Epoch:[0/2](334600/4588595) loss:2.994 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:58:52,631][model8_pretrain.py][INFO] Epoch:[0/2](334600/4588595) loss:2.823 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:58:52,631][model8_pretrain.py][INFO] Epoch:[0/2](334600/4588595) loss:2.854 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:59:29,572][model8_pretrain.py][INFO] Epoch:[0/2](334700/4588595) loss:2.653 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:59:29,572][model8_pretrain.py][INFO] Epoch:[0/2](334700/4588595) loss:3.205 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:59:29,572][model8_pretrain.py][INFO] Epoch:[0/2](334700/4588595) loss:2.527 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:59:29,572][model8_pretrain.py][INFO] Epoch:[0/2](334700/4588595) loss:2.670 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:59:29,572][model8_pretrain.py][INFO] Epoch:[0/2](334700/4588595) loss:2.891 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:59:29,572][model8_pretrain.py][INFO] Epoch:[0/2](334700/4588595) loss:2.692 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:59:29,572][model8_pretrain.py][INFO] Epoch:[0/2](334700/4588595) loss:3.130 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 04:59:29,572][model8_pretrain.py][INFO] Epoch:[0/2](334700/4588595) loss:2.615 lr:0.0000100 epoch_Time:26941.0min: [2024-01-04 05:00:06,522][model8_pretrain.py][INFO] Epoch:[0/2](334800/4588595) loss:2.486 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:00:06,522][model8_pretrain.py][INFO] Epoch:[0/2](334800/4588595) loss:2.977 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:00:06,522][model8_pretrain.py][INFO] Epoch:[0/2](334800/4588595) loss:2.881 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:00:06,522][model8_pretrain.py][INFO] Epoch:[0/2](334800/4588595) loss:3.205 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:00:06,522][model8_pretrain.py][INFO] Epoch:[0/2](334800/4588595) loss:2.765 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:00:06,522][model8_pretrain.py][INFO] Epoch:[0/2](334800/4588595) loss:2.527 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:00:06,522][model8_pretrain.py][INFO] Epoch:[0/2](334800/4588595) loss:3.029 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:00:06,522][model8_pretrain.py][INFO] Epoch:[0/2](334800/4588595) loss:3.312 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:00:53,866][model8_pretrain.py][INFO] Epoch:[0/2](334900/4588595) loss:3.119 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:00:53,866][model8_pretrain.py][INFO] Epoch:[0/2](334900/4588595) loss:3.503 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:00:53,866][model8_pretrain.py][INFO] Epoch:[0/2](334900/4588595) loss:2.811 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:00:53,866][model8_pretrain.py][INFO] Epoch:[0/2](334900/4588595) loss:3.219 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:00:53,866][model8_pretrain.py][INFO] Epoch:[0/2](334900/4588595) loss:2.958 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:00:53,866][model8_pretrain.py][INFO] Epoch:[0/2](334900/4588595) loss:3.227 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:00:53,866][model8_pretrain.py][INFO] Epoch:[0/2](334900/4588595) loss:3.040 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:00:53,866][model8_pretrain.py][INFO] Epoch:[0/2](334900/4588595) loss:2.703 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:01:30,802][model8_pretrain.py][INFO] Epoch:[0/2](335000/4588595) loss:3.026 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:01:30,802][model8_pretrain.py][INFO] Epoch:[0/2](335000/4588595) loss:3.008 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:01:30,802][model8_pretrain.py][INFO] Epoch:[0/2](335000/4588595) loss:2.881 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:01:30,802][model8_pretrain.py][INFO] Epoch:[0/2](335000/4588595) loss:2.799 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:01:30,802][model8_pretrain.py][INFO] Epoch:[0/2](335000/4588595) loss:3.044 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:01:30,802][model8_pretrain.py][INFO] Epoch:[0/2](335000/4588595) loss:3.208 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:01:30,802][model8_pretrain.py][INFO] Epoch:[0/2](335000/4588595) loss:2.709 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:01:30,802][model8_pretrain.py][INFO] Epoch:[0/2](335000/4588595) loss:2.397 lr:0.0000100 epoch_Time:26940.0min: [2024-01-04 05:02:07,743][model8_pretrain.py][INFO] Epoch:[0/2](335100/4588595) loss:3.057 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:07,743][model8_pretrain.py][INFO] Epoch:[0/2](335100/4588595) loss:2.921 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:07,743][model8_pretrain.py][INFO] Epoch:[0/2](335100/4588595) loss:3.181 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:07,743][model8_pretrain.py][INFO] Epoch:[0/2](335100/4588595) loss:3.026 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:07,743][model8_pretrain.py][INFO] Epoch:[0/2](335100/4588595) loss:3.329 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:07,743][model8_pretrain.py][INFO] Epoch:[0/2](335100/4588595) loss:2.755 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:07,743][model8_pretrain.py][INFO] Epoch:[0/2](335100/4588595) loss:2.861 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:07,744][model8_pretrain.py][INFO] Epoch:[0/2](335100/4588595) loss:3.107 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:44,682][model8_pretrain.py][INFO] Epoch:[0/2](335200/4588595) loss:2.823 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:44,682][model8_pretrain.py][INFO] Epoch:[0/2](335200/4588595) loss:2.469 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:44,683][model8_pretrain.py][INFO] Epoch:[0/2](335200/4588595) loss:3.077 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:44,683][model8_pretrain.py][INFO] Epoch:[0/2](335200/4588595) loss:2.718 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:44,683][model8_pretrain.py][INFO] Epoch:[0/2](335200/4588595) loss:2.777 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:44,683][model8_pretrain.py][INFO] Epoch:[0/2](335200/4588595) loss:3.286 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:44,683][model8_pretrain.py][INFO] Epoch:[0/2](335200/4588595) loss:2.708 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:02:44,683][model8_pretrain.py][INFO] Epoch:[0/2](335200/4588595) loss:3.106 lr:0.0000100 epoch_Time:26939.0min: [2024-01-04 05:03:21,621][model8_pretrain.py][INFO] Epoch:[0/2](335300/4588595) loss:2.825 lr:0.0000100 epoch_Time:26938.0min: [2024-01-04 05:03:21,621][model8_pretrain.py][INFO] Epoch:[0/2](335300/4588595) loss:3.200 lr:0.0000100 epoch_Time:26938.0min: [2024-01-04 05:03:21,621][model8_pretrain.py][INFO] Epoch:[0/2](335300/4588595) loss:2.251 lr:0.0000100 epoch_Time:26938.0min: [2024-01-04 05:03:21,621][model8_pretrain.py][INFO] Epoch:[0/2](335300/4588595) loss:2.601 lr:0.0000100 epoch_Time:26938.0min: [2024-01-04 05:03:21,621][model8_pretrain.py][INFO] Epoch:[0/2](335300/4588595) loss:3.268 lr:0.0000100 epoch_Time:26938.0min: [2024-01-04 05:03:21,621][model8_pretrain.py][INFO] Epoch:[0/2](335300/4588595) loss:2.756 lr:0.0000100 epoch_Time:26938.0min: [2024-01-04 05:03:21,621][model8_pretrain.py][INFO] Epoch:[0/2](335300/4588595) loss:2.428 lr:0.0000100 epoch_Time:26938.0min: [2024-01-04 05:03:21,622][model8_pretrain.py][INFO] Epoch:[0/2](335300/4588595) loss:2.803 lr:0.0000100 epoch_Time:26938.0min: [2024-01-04 05:03:58,558][model8_pretrain.py][INFO] Epoch:[0/2](335400/4588595) loss:2.944 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:03:58,558][model8_pretrain.py][INFO] Epoch:[0/2](335400/4588595) loss:2.810 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:03:58,558][model8_pretrain.py][INFO] Epoch:[0/2](335400/4588595) loss:3.035 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:03:58,558][model8_pretrain.py][INFO] Epoch:[0/2](335400/4588595) loss:2.724 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:03:58,558][model8_pretrain.py][INFO] Epoch:[0/2](335400/4588595) loss:2.455 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:03:58,559][model8_pretrain.py][INFO] Epoch:[0/2](335400/4588595) loss:2.999 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:03:58,559][model8_pretrain.py][INFO] Epoch:[0/2](335400/4588595) loss:3.130 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:03:58,559][model8_pretrain.py][INFO] Epoch:[0/2](335400/4588595) loss:3.147 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:04:35,485][model8_pretrain.py][INFO] Epoch:[0/2](335500/4588595) loss:2.723 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:04:35,485][model8_pretrain.py][INFO] Epoch:[0/2](335500/4588595) loss:3.212 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:04:35,485][model8_pretrain.py][INFO] Epoch:[0/2](335500/4588595) loss:2.410 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:04:35,485][model8_pretrain.py][INFO] Epoch:[0/2](335500/4588595) loss:3.133 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:04:35,485][model8_pretrain.py][INFO] Epoch:[0/2](335500/4588595) loss:3.130 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:04:35,485][model8_pretrain.py][INFO] Epoch:[0/2](335500/4588595) loss:2.406 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:04:35,485][model8_pretrain.py][INFO] Epoch:[0/2](335500/4588595) loss:2.854 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:04:35,485][model8_pretrain.py][INFO] Epoch:[0/2](335500/4588595) loss:3.185 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:05:12,413][model8_pretrain.py][INFO] Epoch:[0/2](335600/4588595) loss:2.722 lr:0.0000100 epoch_Time:26935.0min: [2024-01-04 05:05:12,413][model8_pretrain.py][INFO] Epoch:[0/2](335600/4588595) loss:3.159 lr:0.0000100 epoch_Time:26935.0min: [2024-01-04 05:05:12,413][model8_pretrain.py][INFO] Epoch:[0/2](335600/4588595) loss:2.866 lr:0.0000100 epoch_Time:26935.0min: [2024-01-04 05:05:12,413][model8_pretrain.py][INFO] Epoch:[0/2](335600/4588595) loss:3.144 lr:0.0000100 epoch_Time:26935.0min: [2024-01-04 05:05:12,413][model8_pretrain.py][INFO] Epoch:[0/2](335600/4588595) loss:2.863 lr:0.0000100 epoch_Time:26935.0min: [2024-01-04 05:05:12,413][model8_pretrain.py][INFO] Epoch:[0/2](335600/4588595) loss:2.970 lr:0.0000100 epoch_Time:26935.0min: [2024-01-04 05:05:12,413][model8_pretrain.py][INFO] Epoch:[0/2](335600/4588595) loss:3.013 lr:0.0000100 epoch_Time:26935.0min: [2024-01-04 05:05:12,413][model8_pretrain.py][INFO] Epoch:[0/2](335600/4588595) loss:3.513 lr:0.0000100 epoch_Time:26935.0min: [2024-01-04 05:05:59,709][model8_pretrain.py][INFO] Epoch:[0/2](335700/4588595) loss:2.654 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:05:59,709][model8_pretrain.py][INFO] Epoch:[0/2](335700/4588595) loss:2.435 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:05:59,709][model8_pretrain.py][INFO] Epoch:[0/2](335700/4588595) loss:3.139 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:05:59,709][model8_pretrain.py][INFO] Epoch:[0/2](335700/4588595) loss:2.659 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:05:59,709][model8_pretrain.py][INFO] Epoch:[0/2](335700/4588595) loss:2.573 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:05:59,709][model8_pretrain.py][INFO] Epoch:[0/2](335700/4588595) loss:2.965 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:05:59,709][model8_pretrain.py][INFO] Epoch:[0/2](335700/4588595) loss:2.952 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:05:59,709][model8_pretrain.py][INFO] Epoch:[0/2](335700/4588595) loss:2.946 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:06:36,653][model8_pretrain.py][INFO] Epoch:[0/2](335800/4588595) loss:3.085 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:06:36,653][model8_pretrain.py][INFO] Epoch:[0/2](335800/4588595) loss:3.160 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:06:36,653][model8_pretrain.py][INFO] Epoch:[0/2](335800/4588595) loss:2.497 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:06:36,653][model8_pretrain.py][INFO] Epoch:[0/2](335800/4588595) loss:2.966 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:06:36,653][model8_pretrain.py][INFO] Epoch:[0/2](335800/4588595) loss:2.160 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:06:36,653][model8_pretrain.py][INFO] Epoch:[0/2](335800/4588595) loss:2.779 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:06:36,653][model8_pretrain.py][INFO] Epoch:[0/2](335800/4588595) loss:2.428 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:06:36,653][model8_pretrain.py][INFO] Epoch:[0/2](335800/4588595) loss:2.827 lr:0.0000100 epoch_Time:26936.0min: [2024-01-04 05:07:13,574][model8_pretrain.py][INFO] Epoch:[0/2](335900/4588595) loss:3.128 lr:0.0000100 epoch_Time:26934.0min: [2024-01-04 05:07:13,574][model8_pretrain.py][INFO] Epoch:[0/2](335900/4588595) loss:2.998 lr:0.0000100 epoch_Time:26934.0min: [2024-01-04 05:07:13,574][model8_pretrain.py][INFO] Epoch:[0/2](335900/4588595) loss:2.822 lr:0.0000100 epoch_Time:26934.0min: [2024-01-04 05:07:13,574][model8_pretrain.py][INFO] Epoch:[0/2](335900/4588595) loss:3.177 lr:0.0000100 epoch_Time:26934.0min: [2024-01-04 05:07:13,574][model8_pretrain.py][INFO] Epoch:[0/2](335900/4588595) loss:2.087 lr:0.0000100 epoch_Time:26934.0min: [2024-01-04 05:07:13,574][model8_pretrain.py][INFO] Epoch:[0/2](335900/4588595) loss:3.325 lr:0.0000100 epoch_Time:26934.0min: [2024-01-04 05:07:13,575][model8_pretrain.py][INFO] Epoch:[0/2](335900/4588595) loss:2.950 lr:0.0000100 epoch_Time:26934.0min: [2024-01-04 05:07:13,575][model8_pretrain.py][INFO] Epoch:[0/2](335900/4588595) loss:2.697 lr:0.0000100 epoch_Time:26934.0min: [2024-01-04 05:07:50,485][model8_pretrain.py][INFO] Epoch:[0/2](336000/4588595) loss:2.680 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:07:50,485][model8_pretrain.py][INFO] Epoch:[0/2](336000/4588595) loss:2.645 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:07:50,485][model8_pretrain.py][INFO] Epoch:[0/2](336000/4588595) loss:3.075 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:07:50,485][model8_pretrain.py][INFO] Epoch:[0/2](336000/4588595) loss:2.938 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:07:50,485][model8_pretrain.py][INFO] Epoch:[0/2](336000/4588595) loss:3.187 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:07:50,485][model8_pretrain.py][INFO] Epoch:[0/2](336000/4588595) loss:2.311 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:07:50,486][model8_pretrain.py][INFO] Epoch:[0/2](336000/4588595) loss:3.355 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:07:50,486][model8_pretrain.py][INFO] Epoch:[0/2](336000/4588595) loss:2.603 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:08:27,423][model8_pretrain.py][INFO] Epoch:[0/2](336100/4588595) loss:3.274 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:08:27,423][model8_pretrain.py][INFO] Epoch:[0/2](336100/4588595) loss:3.098 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:08:27,423][model8_pretrain.py][INFO] Epoch:[0/2](336100/4588595) loss:3.026 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:08:27,423][model8_pretrain.py][INFO] Epoch:[0/2](336100/4588595) loss:2.907 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:08:27,423][model8_pretrain.py][INFO] Epoch:[0/2](336100/4588595) loss:2.723 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:08:27,423][model8_pretrain.py][INFO] Epoch:[0/2](336100/4588595) loss:3.084 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:08:27,423][model8_pretrain.py][INFO] Epoch:[0/2](336100/4588595) loss:2.504 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:08:27,424][model8_pretrain.py][INFO] Epoch:[0/2](336100/4588595) loss:2.624 lr:0.0000100 epoch_Time:26933.0min: [2024-01-04 05:09:04,350][model8_pretrain.py][INFO] Epoch:[0/2](336200/4588595) loss:3.028 lr:0.0000100 epoch_Time:26932.0min: [2024-01-04 05:09:04,350][model8_pretrain.py][INFO] Epoch:[0/2](336200/4588595) loss:2.559 lr:0.0000100 epoch_Time:26932.0min: [2024-01-04 05:09:04,350][model8_pretrain.py][INFO] Epoch:[0/2](336200/4588595) loss:3.239 lr:0.0000100 epoch_Time:26932.0min: [2024-01-04 05:09:04,350][model8_pretrain.py][INFO] Epoch:[0/2](336200/4588595) loss:3.260 lr:0.0000100 epoch_Time:26932.0min: [2024-01-04 05:09:04,350][model8_pretrain.py][INFO] Epoch:[0/2](336200/4588595) loss:2.742 lr:0.0000100 epoch_Time:26932.0min: [2024-01-04 05:09:04,350][model8_pretrain.py][INFO] Epoch:[0/2](336200/4588595) loss:3.215 lr:0.0000100 epoch_Time:26932.0min: [2024-01-04 05:09:04,350][model8_pretrain.py][INFO] Epoch:[0/2](336200/4588595) loss:3.264 lr:0.0000100 epoch_Time:26932.0min: [2024-01-04 05:09:04,350][model8_pretrain.py][INFO] Epoch:[0/2](336200/4588595) loss:2.968 lr:0.0000100 epoch_Time:26932.0min: [2024-01-04 05:09:41,280][model8_pretrain.py][INFO] Epoch:[0/2](336300/4588595) loss:3.093 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:09:41,280][model8_pretrain.py][INFO] Epoch:[0/2](336300/4588595) loss:3.284 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:09:41,280][model8_pretrain.py][INFO] Epoch:[0/2](336300/4588595) loss:3.132 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:09:41,280][model8_pretrain.py][INFO] Epoch:[0/2](336300/4588595) loss:2.935 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:09:41,280][model8_pretrain.py][INFO] Epoch:[0/2](336300/4588595) loss:3.087 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:09:41,280][model8_pretrain.py][INFO] Epoch:[0/2](336300/4588595) loss:2.831 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:09:41,280][model8_pretrain.py][INFO] Epoch:[0/2](336300/4588595) loss:2.772 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:09:41,280][model8_pretrain.py][INFO] Epoch:[0/2](336300/4588595) loss:2.379 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:10:18,237][model8_pretrain.py][INFO] Epoch:[0/2](336400/4588595) loss:3.203 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:10:18,237][model8_pretrain.py][INFO] Epoch:[0/2](336400/4588595) loss:3.086 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:10:18,237][model8_pretrain.py][INFO] Epoch:[0/2](336400/4588595) loss:2.760 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:10:18,237][model8_pretrain.py][INFO] Epoch:[0/2](336400/4588595) loss:2.663 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:10:18,237][model8_pretrain.py][INFO] Epoch:[0/2](336400/4588595) loss:2.637 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:10:18,237][model8_pretrain.py][INFO] Epoch:[0/2](336400/4588595) loss:3.117 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:10:18,237][model8_pretrain.py][INFO] Epoch:[0/2](336400/4588595) loss:2.617 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:10:18,237][model8_pretrain.py][INFO] Epoch:[0/2](336400/4588595) loss:2.532 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:11:05,152][model8_pretrain.py][INFO] Epoch:[0/2](336500/4588595) loss:2.911 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:05,152][model8_pretrain.py][INFO] Epoch:[0/2](336500/4588595) loss:2.881 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:05,152][model8_pretrain.py][INFO] Epoch:[0/2](336500/4588595) loss:2.667 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:05,152][model8_pretrain.py][INFO] Epoch:[0/2](336500/4588595) loss:2.896 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:05,152][model8_pretrain.py][INFO] Epoch:[0/2](336500/4588595) loss:3.146 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:05,152][model8_pretrain.py][INFO] Epoch:[0/2](336500/4588595) loss:3.106 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:05,152][model8_pretrain.py][INFO] Epoch:[0/2](336500/4588595) loss:3.135 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:05,152][model8_pretrain.py][INFO] Epoch:[0/2](336500/4588595) loss:2.658 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:42,079][model8_pretrain.py][INFO] Epoch:[0/2](336600/4588595) loss:3.030 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:42,079][model8_pretrain.py][INFO] Epoch:[0/2](336600/4588595) loss:2.661 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:42,079][model8_pretrain.py][INFO] Epoch:[0/2](336600/4588595) loss:2.418 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:42,079][model8_pretrain.py][INFO] Epoch:[0/2](336600/4588595) loss:3.393 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:42,079][model8_pretrain.py][INFO] Epoch:[0/2](336600/4588595) loss:2.771 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:42,079][model8_pretrain.py][INFO] Epoch:[0/2](336600/4588595) loss:2.913 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:42,079][model8_pretrain.py][INFO] Epoch:[0/2](336600/4588595) loss:3.121 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:11:42,079][model8_pretrain.py][INFO] Epoch:[0/2](336600/4588595) loss:3.178 lr:0.0000100 epoch_Time:26931.0min: [2024-01-04 05:12:19,008][model8_pretrain.py][INFO] Epoch:[0/2](336700/4588595) loss:3.117 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:12:19,008][model8_pretrain.py][INFO] Epoch:[0/2](336700/4588595) loss:3.397 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:12:19,008][model8_pretrain.py][INFO] Epoch:[0/2](336700/4588595) loss:3.622 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:12:19,008][model8_pretrain.py][INFO] Epoch:[0/2](336700/4588595) loss:2.542 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:12:19,008][model8_pretrain.py][INFO] Epoch:[0/2](336700/4588595) loss:2.628 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:12:19,008][model8_pretrain.py][INFO] Epoch:[0/2](336700/4588595) loss:3.343 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:12:19,008][model8_pretrain.py][INFO] Epoch:[0/2](336700/4588595) loss:2.567 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:12:19,008][model8_pretrain.py][INFO] Epoch:[0/2](336700/4588595) loss:2.729 lr:0.0000100 epoch_Time:26930.0min: [2024-01-04 05:12:55,931][model8_pretrain.py][INFO] Epoch:[0/2](336800/4588595) loss:2.976 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:12:55,931][model8_pretrain.py][INFO] Epoch:[0/2](336800/4588595) loss:3.189 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:12:55,931][model8_pretrain.py][INFO] Epoch:[0/2](336800/4588595) loss:2.971 lr:0.0000100 epoch_Time:26929.0min: [2024-01-04 05:12:55,931][model8_pretrain.py][INFO] Epoch:[0/2](336800/4588595) loss:3.041 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:12:55,931][model8_pretrain.py][INFO] Epoch:[0/2](336800/4588595) loss:2.193 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:12:55,931][model8_pretrain.py][INFO] Epoch:[0/2](336800/4588595) loss:3.130 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:12:55,931][model8_pretrain.py][INFO] Epoch:[0/2](336800/4588595) loss:2.833 lr:0.0000100 epoch_Time:26929.0min: [2024-01-04 05:12:55,931][model8_pretrain.py][INFO] Epoch:[0/2](336800/4588595) loss:3.499 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:13:32,860][model8_pretrain.py][INFO] Epoch:[0/2](336900/4588595) loss:2.731 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:13:32,860][model8_pretrain.py][INFO] Epoch:[0/2](336900/4588595) loss:2.693 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:13:32,860][model8_pretrain.py][INFO] Epoch:[0/2](336900/4588595) loss:2.786 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:13:32,860][model8_pretrain.py][INFO] Epoch:[0/2](336900/4588595) loss:2.709 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:13:32,860][model8_pretrain.py][INFO] Epoch:[0/2](336900/4588595) loss:2.795 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:13:32,860][model8_pretrain.py][INFO] Epoch:[0/2](336900/4588595) loss:2.568 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:13:32,860][model8_pretrain.py][INFO] Epoch:[0/2](336900/4588595) loss:2.586 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:13:32,861][model8_pretrain.py][INFO] Epoch:[0/2](336900/4588595) loss:2.933 lr:0.0000100 epoch_Time:26928.0min: [2024-01-04 05:14:09,791][model8_pretrain.py][INFO] Epoch:[0/2](337000/4588595) loss:3.340 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:09,791][model8_pretrain.py][INFO] Epoch:[0/2](337000/4588595) loss:2.758 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:09,791][model8_pretrain.py][INFO] Epoch:[0/2](337000/4588595) loss:2.745 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:09,791][model8_pretrain.py][INFO] Epoch:[0/2](337000/4588595) loss:3.168 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:09,791][model8_pretrain.py][INFO] Epoch:[0/2](337000/4588595) loss:2.853 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:09,791][model8_pretrain.py][INFO] Epoch:[0/2](337000/4588595) loss:2.808 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:09,791][model8_pretrain.py][INFO] Epoch:[0/2](337000/4588595) loss:2.511 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:09,791][model8_pretrain.py][INFO] Epoch:[0/2](337000/4588595) loss:3.064 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:46,704][model8_pretrain.py][INFO] Epoch:[0/2](337100/4588595) loss:3.087 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:46,704][model8_pretrain.py][INFO] Epoch:[0/2](337100/4588595) loss:3.075 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:46,704][model8_pretrain.py][INFO] Epoch:[0/2](337100/4588595) loss:3.051 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:46,704][model8_pretrain.py][INFO] Epoch:[0/2](337100/4588595) loss:2.547 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:46,704][model8_pretrain.py][INFO] Epoch:[0/2](337100/4588595) loss:2.993 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:46,704][model8_pretrain.py][INFO] Epoch:[0/2](337100/4588595) loss:2.631 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:46,704][model8_pretrain.py][INFO] Epoch:[0/2](337100/4588595) loss:3.304 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:14:46,705][model8_pretrain.py][INFO] Epoch:[0/2](337100/4588595) loss:2.939 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:15:23,642][model8_pretrain.py][INFO] Epoch:[0/2](337200/4588595) loss:3.135 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:15:23,642][model8_pretrain.py][INFO] Epoch:[0/2](337200/4588595) loss:3.014 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:15:23,642][model8_pretrain.py][INFO] Epoch:[0/2](337200/4588595) loss:3.075 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:15:23,643][model8_pretrain.py][INFO] Epoch:[0/2](337200/4588595) loss:2.778 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:15:23,643][model8_pretrain.py][INFO] Epoch:[0/2](337200/4588595) loss:2.973 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:15:23,643][model8_pretrain.py][INFO] Epoch:[0/2](337200/4588595) loss:3.013 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:15:23,643][model8_pretrain.py][INFO] Epoch:[0/2](337200/4588595) loss:3.071 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:15:23,644][model8_pretrain.py][INFO] Epoch:[0/2](337200/4588595) loss:2.881 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:16:10,547][model8_pretrain.py][INFO] Epoch:[0/2](337300/4588595) loss:2.891 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:16:10,547][model8_pretrain.py][INFO] Epoch:[0/2](337300/4588595) loss:2.415 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:16:10,547][model8_pretrain.py][INFO] Epoch:[0/2](337300/4588595) loss:2.733 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:16:10,547][model8_pretrain.py][INFO] Epoch:[0/2](337300/4588595) loss:2.631 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:16:10,547][model8_pretrain.py][INFO] Epoch:[0/2](337300/4588595) loss:3.154 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:16:10,547][model8_pretrain.py][INFO] Epoch:[0/2](337300/4588595) loss:3.393 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:16:10,547][model8_pretrain.py][INFO] Epoch:[0/2](337300/4588595) loss:2.676 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:16:10,547][model8_pretrain.py][INFO] Epoch:[0/2](337300/4588595) loss:3.469 lr:0.0000100 epoch_Time:26927.0min: [2024-01-04 05:16:47,467][model8_pretrain.py][INFO] Epoch:[0/2](337400/4588595) loss:2.448 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:16:47,467][model8_pretrain.py][INFO] Epoch:[0/2](337400/4588595) loss:3.106 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:16:47,468][model8_pretrain.py][INFO] Epoch:[0/2](337400/4588595) loss:3.241 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:16:47,468][model8_pretrain.py][INFO] Epoch:[0/2](337400/4588595) loss:2.887 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:16:47,467][model8_pretrain.py][INFO] Epoch:[0/2](337400/4588595) loss:2.480 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:16:47,468][model8_pretrain.py][INFO] Epoch:[0/2](337400/4588595) loss:3.236 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:16:47,468][model8_pretrain.py][INFO] Epoch:[0/2](337400/4588595) loss:2.511 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:16:47,468][model8_pretrain.py][INFO] Epoch:[0/2](337400/4588595) loss:2.955 lr:0.0000100 epoch_Time:26926.0min: [2024-01-04 05:17:24,395][model8_pretrain.py][INFO] Epoch:[0/2](337500/4588595) loss:2.999 lr:0.0000100 epoch_Time:26925.0min: [2024-01-04 05:17:24,396][model8_pretrain.py][INFO] Epoch:[0/2](337500/4588595) loss:1.958 lr:0.0000100 epoch_Time:26925.0min: [2024-01-04 05:17:24,396][model8_pretrain.py][INFO] Epoch:[0/2](337500/4588595) loss:2.380 lr:0.0000100 epoch_Time:26925.0min: [2024-01-04 05:17:24,396][model8_pretrain.py][INFO] Epoch:[0/2](337500/4588595) loss:2.986 lr:0.0000100 epoch_Time:26925.0min: [2024-01-04 05:17:24,396][model8_pretrain.py][INFO] Epoch:[0/2](337500/4588595) loss:3.102 lr:0.0000100 epoch_Time:26925.0min: [2024-01-04 05:17:24,396][model8_pretrain.py][INFO] Epoch:[0/2](337500/4588595) loss:3.468 lr:0.0000100 epoch_Time:26925.0min: [2024-01-04 05:17:24,396][model8_pretrain.py][INFO] Epoch:[0/2](337500/4588595) loss:2.674 lr:0.0000100 epoch_Time:26925.0min: [2024-01-04 05:17:24,396][model8_pretrain.py][INFO] Epoch:[0/2](337500/4588595) loss:2.810 lr:0.0000100 epoch_Time:26925.0min: [2024-01-04 05:18:01,331][model8_pretrain.py][INFO] Epoch:[0/2](337600/4588595) loss:2.975 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:01,331][model8_pretrain.py][INFO] Epoch:[0/2](337600/4588595) loss:3.116 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:01,331][model8_pretrain.py][INFO] Epoch:[0/2](337600/4588595) loss:3.028 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:01,331][model8_pretrain.py][INFO] Epoch:[0/2](337600/4588595) loss:2.816 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:01,332][model8_pretrain.py][INFO] Epoch:[0/2](337600/4588595) loss:3.603 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:01,332][model8_pretrain.py][INFO] Epoch:[0/2](337600/4588595) loss:3.083 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:01,332][model8_pretrain.py][INFO] Epoch:[0/2](337600/4588595) loss:2.639 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:01,333][model8_pretrain.py][INFO] Epoch:[0/2](337600/4588595) loss:2.844 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:38,269][model8_pretrain.py][INFO] Epoch:[0/2](337700/4588595) loss:3.142 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:38,269][model8_pretrain.py][INFO] Epoch:[0/2](337700/4588595) loss:2.861 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:38,269][model8_pretrain.py][INFO] Epoch:[0/2](337700/4588595) loss:3.118 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:38,269][model8_pretrain.py][INFO] Epoch:[0/2](337700/4588595) loss:2.743 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:38,269][model8_pretrain.py][INFO] Epoch:[0/2](337700/4588595) loss:2.623 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:38,269][model8_pretrain.py][INFO] Epoch:[0/2](337700/4588595) loss:2.823 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:38,269][model8_pretrain.py][INFO] Epoch:[0/2](337700/4588595) loss:2.596 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:18:38,269][model8_pretrain.py][INFO] Epoch:[0/2](337700/4588595) loss:3.198 lr:0.0000100 epoch_Time:26924.0min: [2024-01-04 05:19:15,205][model8_pretrain.py][INFO] Epoch:[0/2](337800/4588595) loss:2.337 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:19:15,205][model8_pretrain.py][INFO] Epoch:[0/2](337800/4588595) loss:3.009 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:19:15,206][model8_pretrain.py][INFO] Epoch:[0/2](337800/4588595) loss:2.786 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:19:15,206][model8_pretrain.py][INFO] Epoch:[0/2](337800/4588595) loss:2.476 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:19:15,206][model8_pretrain.py][INFO] Epoch:[0/2](337800/4588595) loss:2.745 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:19:15,206][model8_pretrain.py][INFO] Epoch:[0/2](337800/4588595) loss:3.021 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:19:15,206][model8_pretrain.py][INFO] Epoch:[0/2](337800/4588595) loss:2.141 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:19:15,206][model8_pretrain.py][INFO] Epoch:[0/2](337800/4588595) loss:2.752 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:19:52,132][model8_pretrain.py][INFO] Epoch:[0/2](337900/4588595) loss:2.808 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:19:52,132][model8_pretrain.py][INFO] Epoch:[0/2](337900/4588595) loss:3.136 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:19:52,132][model8_pretrain.py][INFO] Epoch:[0/2](337900/4588595) loss:3.076 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:19:52,132][model8_pretrain.py][INFO] Epoch:[0/2](337900/4588595) loss:2.967 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:19:52,132][model8_pretrain.py][INFO] Epoch:[0/2](337900/4588595) loss:2.755 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:19:52,132][model8_pretrain.py][INFO] Epoch:[0/2](337900/4588595) loss:2.518 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:19:52,132][model8_pretrain.py][INFO] Epoch:[0/2](337900/4588595) loss:3.419 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:19:52,132][model8_pretrain.py][INFO] Epoch:[0/2](337900/4588595) loss:3.020 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:20:29,062][model8_pretrain.py][INFO] Epoch:[0/2](338000/4588595) loss:2.742 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:20:29,062][model8_pretrain.py][INFO] Epoch:[0/2](338000/4588595) loss:2.662 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:20:29,062][model8_pretrain.py][INFO] Epoch:[0/2](338000/4588595) loss:3.094 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:20:29,062][model8_pretrain.py][INFO] Epoch:[0/2](338000/4588595) loss:3.009 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:20:29,062][model8_pretrain.py][INFO] Epoch:[0/2](338000/4588595) loss:3.158 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:20:29,062][model8_pretrain.py][INFO] Epoch:[0/2](338000/4588595) loss:2.595 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:20:29,062][model8_pretrain.py][INFO] Epoch:[0/2](338000/4588595) loss:3.053 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:20:29,062][model8_pretrain.py][INFO] Epoch:[0/2](338000/4588595) loss:3.356 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:21:15,910][model8_pretrain.py][INFO] Epoch:[0/2](338100/4588595) loss:2.346 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:21:15,910][model8_pretrain.py][INFO] Epoch:[0/2](338100/4588595) loss:3.065 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:21:15,910][model8_pretrain.py][INFO] Epoch:[0/2](338100/4588595) loss:2.593 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:21:15,910][model8_pretrain.py][INFO] Epoch:[0/2](338100/4588595) loss:2.674 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:21:15,910][model8_pretrain.py][INFO] Epoch:[0/2](338100/4588595) loss:2.912 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:21:15,910][model8_pretrain.py][INFO] Epoch:[0/2](338100/4588595) loss:3.045 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:21:15,910][model8_pretrain.py][INFO] Epoch:[0/2](338100/4588595) loss:2.808 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:21:15,910][model8_pretrain.py][INFO] Epoch:[0/2](338100/4588595) loss:2.509 lr:0.0000100 epoch_Time:26922.0min: [2024-01-04 05:21:52,868][model8_pretrain.py][INFO] Epoch:[0/2](338200/4588595) loss:2.977 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:21:52,868][model8_pretrain.py][INFO] Epoch:[0/2](338200/4588595) loss:2.498 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:21:52,868][model8_pretrain.py][INFO] Epoch:[0/2](338200/4588595) loss:2.998 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:21:52,868][model8_pretrain.py][INFO] Epoch:[0/2](338200/4588595) loss:3.151 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:21:52,868][model8_pretrain.py][INFO] Epoch:[0/2](338200/4588595) loss:3.200 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:21:52,868][model8_pretrain.py][INFO] Epoch:[0/2](338200/4588595) loss:2.478 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:21:52,868][model8_pretrain.py][INFO] Epoch:[0/2](338200/4588595) loss:3.060 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:21:52,869][model8_pretrain.py][INFO] Epoch:[0/2](338200/4588595) loss:3.081 lr:0.0000100 epoch_Time:26921.0min: [2024-01-04 05:22:29,820][model8_pretrain.py][INFO] Epoch:[0/2](338300/4588595) loss:2.978 lr:0.0000100 epoch_Time:26920.0min: [2024-01-04 05:22:29,820][model8_pretrain.py][INFO] Epoch:[0/2](338300/4588595) loss:3.598 lr:0.0000100 epoch_Time:26920.0min: [2024-01-04 05:22:29,820][model8_pretrain.py][INFO] Epoch:[0/2](338300/4588595) loss:3.134 lr:0.0000100 epoch_Time:26920.0min: [2024-01-04 05:22:29,820][model8_pretrain.py][INFO] Epoch:[0/2](338300/4588595) loss:2.266 lr:0.0000100 epoch_Time:26920.0min: [2024-01-04 05:22:29,820][model8_pretrain.py][INFO] Epoch:[0/2](338300/4588595) loss:2.941 lr:0.0000100 epoch_Time:26920.0min: [2024-01-04 05:22:29,820][model8_pretrain.py][INFO] Epoch:[0/2](338300/4588595) loss:3.060 lr:0.0000100 epoch_Time:26920.0min: [2024-01-04 05:22:29,820][model8_pretrain.py][INFO] Epoch:[0/2](338300/4588595) loss:3.037 lr:0.0000100 epoch_Time:26920.0min: [2024-01-04 05:22:29,821][model8_pretrain.py][INFO] Epoch:[0/2](338300/4588595) loss:2.855 lr:0.0000100 epoch_Time:26920.0min: [2024-01-04 05:23:06,763][model8_pretrain.py][INFO] Epoch:[0/2](338400/4588595) loss:2.749 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:06,763][model8_pretrain.py][INFO] Epoch:[0/2](338400/4588595) loss:3.109 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:06,763][model8_pretrain.py][INFO] Epoch:[0/2](338400/4588595) loss:2.998 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:06,763][model8_pretrain.py][INFO] Epoch:[0/2](338400/4588595) loss:2.977 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:06,763][model8_pretrain.py][INFO] Epoch:[0/2](338400/4588595) loss:2.824 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:06,763][model8_pretrain.py][INFO] Epoch:[0/2](338400/4588595) loss:3.175 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:06,763][model8_pretrain.py][INFO] Epoch:[0/2](338400/4588595) loss:3.187 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:06,763][model8_pretrain.py][INFO] Epoch:[0/2](338400/4588595) loss:2.945 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:43,691][model8_pretrain.py][INFO] Epoch:[0/2](338500/4588595) loss:3.271 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:43,691][model8_pretrain.py][INFO] Epoch:[0/2](338500/4588595) loss:3.210 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:43,691][model8_pretrain.py][INFO] Epoch:[0/2](338500/4588595) loss:2.744 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:43,691][model8_pretrain.py][INFO] Epoch:[0/2](338500/4588595) loss:2.938 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:43,691][model8_pretrain.py][INFO] Epoch:[0/2](338500/4588595) loss:3.463 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:43,691][model8_pretrain.py][INFO] Epoch:[0/2](338500/4588595) loss:3.105 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:43,691][model8_pretrain.py][INFO] Epoch:[0/2](338500/4588595) loss:2.826 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:23:43,691][model8_pretrain.py][INFO] Epoch:[0/2](338500/4588595) loss:3.071 lr:0.0000100 epoch_Time:26919.0min: [2024-01-04 05:24:20,624][model8_pretrain.py][INFO] Epoch:[0/2](338600/4588595) loss:2.978 lr:0.0000100 epoch_Time:26918.0min: [2024-01-04 05:24:20,624][model8_pretrain.py][INFO] Epoch:[0/2](338600/4588595) loss:2.958 lr:0.0000100 epoch_Time:26918.0min: [2024-01-04 05:24:20,624][model8_pretrain.py][INFO] Epoch:[0/2](338600/4588595) loss:3.301 lr:0.0000100 epoch_Time:26918.0min: [2024-01-04 05:24:20,624][model8_pretrain.py][INFO] Epoch:[0/2](338600/4588595) loss:2.878 lr:0.0000100 epoch_Time:26918.0min: [2024-01-04 05:24:20,624][model8_pretrain.py][INFO] Epoch:[0/2](338600/4588595) loss:3.051 lr:0.0000100 epoch_Time:26918.0min: [2024-01-04 05:24:20,624][model8_pretrain.py][INFO] Epoch:[0/2](338600/4588595) loss:3.160 lr:0.0000100 epoch_Time:26918.0min: [2024-01-04 05:24:20,624][model8_pretrain.py][INFO] Epoch:[0/2](338600/4588595) loss:3.159 lr:0.0000100 epoch_Time:26918.0min: [2024-01-04 05:24:20,624][model8_pretrain.py][INFO] Epoch:[0/2](338600/4588595) loss:3.233 lr:0.0000100 epoch_Time:26918.0min: [2024-01-04 05:24:57,559][model8_pretrain.py][INFO] Epoch:[0/2](338700/4588595) loss:2.706 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:24:57,559][model8_pretrain.py][INFO] Epoch:[0/2](338700/4588595) loss:2.997 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:24:57,559][model8_pretrain.py][INFO] Epoch:[0/2](338700/4588595) loss:3.365 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:24:57,559][model8_pretrain.py][INFO] Epoch:[0/2](338700/4588595) loss:2.842 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:24:57,559][model8_pretrain.py][INFO] Epoch:[0/2](338700/4588595) loss:2.743 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:24:57,559][model8_pretrain.py][INFO] Epoch:[0/2](338700/4588595) loss:2.554 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:24:57,559][model8_pretrain.py][INFO] Epoch:[0/2](338700/4588595) loss:2.105 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:24:57,560][model8_pretrain.py][INFO] Epoch:[0/2](338700/4588595) loss:2.822 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:25:34,488][model8_pretrain.py][INFO] Epoch:[0/2](338800/4588595) loss:3.147 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:25:34,488][model8_pretrain.py][INFO] Epoch:[0/2](338800/4588595) loss:2.569 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:25:34,488][model8_pretrain.py][INFO] Epoch:[0/2](338800/4588595) loss:2.864 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:25:34,488][model8_pretrain.py][INFO] Epoch:[0/2](338800/4588595) loss:2.578 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:25:34,488][model8_pretrain.py][INFO] Epoch:[0/2](338800/4588595) loss:2.729 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:25:34,488][model8_pretrain.py][INFO] Epoch:[0/2](338800/4588595) loss:3.328 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:25:34,488][model8_pretrain.py][INFO] Epoch:[0/2](338800/4588595) loss:2.774 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:25:34,488][model8_pretrain.py][INFO] Epoch:[0/2](338800/4588595) loss:3.223 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:26:21,288][model8_pretrain.py][INFO] Epoch:[0/2](338900/4588595) loss:2.386 lr:0.0000100 epoch_Time:26917.0min: [2024-01-04 05:26:21,288][model8_pretrain.py][INFO] Epoch:[0/2](338900/4588595) loss:3.156 lr:0.0000100 epoch_Time:26917.0min: [2024-01-04 05:26:21,288][model8_pretrain.py][INFO] Epoch:[0/2](338900/4588595) loss:2.038 lr:0.0000100 epoch_Time:26917.0min: [2024-01-04 05:26:21,288][model8_pretrain.py][INFO] Epoch:[0/2](338900/4588595) loss:2.520 lr:0.0000100 epoch_Time:26917.0min: [2024-01-04 05:26:21,288][model8_pretrain.py][INFO] Epoch:[0/2](338900/4588595) loss:3.382 lr:0.0000100 epoch_Time:26917.0min: [2024-01-04 05:26:21,288][model8_pretrain.py][INFO] Epoch:[0/2](338900/4588595) loss:2.429 lr:0.0000100 epoch_Time:26917.0min: [2024-01-04 05:26:21,288][model8_pretrain.py][INFO] Epoch:[0/2](338900/4588595) loss:2.852 lr:0.0000100 epoch_Time:26917.0min: [2024-01-04 05:26:21,288][model8_pretrain.py][INFO] Epoch:[0/2](338900/4588595) loss:2.855 lr:0.0000100 epoch_Time:26917.0min: [2024-01-04 05:26:58,219][model8_pretrain.py][INFO] Epoch:[0/2](339000/4588595) loss:2.288 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:26:58,219][model8_pretrain.py][INFO] Epoch:[0/2](339000/4588595) loss:2.743 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:26:58,219][model8_pretrain.py][INFO] Epoch:[0/2](339000/4588595) loss:2.557 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:26:58,219][model8_pretrain.py][INFO] Epoch:[0/2](339000/4588595) loss:3.045 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:26:58,219][model8_pretrain.py][INFO] Epoch:[0/2](339000/4588595) loss:3.307 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:26:58,219][model8_pretrain.py][INFO] Epoch:[0/2](339000/4588595) loss:2.792 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:26:58,219][model8_pretrain.py][INFO] Epoch:[0/2](339000/4588595) loss:2.876 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:26:58,219][model8_pretrain.py][INFO] Epoch:[0/2](339000/4588595) loss:2.554 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:27:35,153][model8_pretrain.py][INFO] Epoch:[0/2](339100/4588595) loss:2.518 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:27:35,153][model8_pretrain.py][INFO] Epoch:[0/2](339100/4588595) loss:3.022 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:27:35,153][model8_pretrain.py][INFO] Epoch:[0/2](339100/4588595) loss:2.608 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:27:35,153][model8_pretrain.py][INFO] Epoch:[0/2](339100/4588595) loss:2.741 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:27:35,153][model8_pretrain.py][INFO] Epoch:[0/2](339100/4588595) loss:2.999 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:27:35,153][model8_pretrain.py][INFO] Epoch:[0/2](339100/4588595) loss:2.837 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:27:35,153][model8_pretrain.py][INFO] Epoch:[0/2](339100/4588595) loss:2.790 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:27:35,153][model8_pretrain.py][INFO] Epoch:[0/2](339100/4588595) loss:2.552 lr:0.0000100 epoch_Time:26916.0min: [2024-01-04 05:28:12,086][model8_pretrain.py][INFO] Epoch:[0/2](339200/4588595) loss:2.974 lr:0.0000100 epoch_Time:26914.0min: [2024-01-04 05:28:12,086][model8_pretrain.py][INFO] Epoch:[0/2](339200/4588595) loss:2.949 lr:0.0000100 epoch_Time:26914.0min: [2024-01-04 05:28:12,086][model8_pretrain.py][INFO] Epoch:[0/2](339200/4588595) loss:3.019 lr:0.0000100 epoch_Time:26914.0min: [2024-01-04 05:28:12,086][model8_pretrain.py][INFO] Epoch:[0/2](339200/4588595) loss:2.989 lr:0.0000100 epoch_Time:26914.0min: [2024-01-04 05:28:12,086][model8_pretrain.py][INFO] Epoch:[0/2](339200/4588595) loss:3.484 lr:0.0000100 epoch_Time:26914.0min: [2024-01-04 05:28:12,086][model8_pretrain.py][INFO] Epoch:[0/2](339200/4588595) loss:3.235 lr:0.0000100 epoch_Time:26914.0min: [2024-01-04 05:28:12,086][model8_pretrain.py][INFO] Epoch:[0/2](339200/4588595) loss:2.548 lr:0.0000100 epoch_Time:26914.0min: [2024-01-04 05:28:12,086][model8_pretrain.py][INFO] Epoch:[0/2](339200/4588595) loss:2.861 lr:0.0000100 epoch_Time:26914.0min: [2024-01-04 05:28:49,014][model8_pretrain.py][INFO] Epoch:[0/2](339300/4588595) loss:2.959 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:28:49,014][model8_pretrain.py][INFO] Epoch:[0/2](339300/4588595) loss:2.552 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:28:49,014][model8_pretrain.py][INFO] Epoch:[0/2](339300/4588595) loss:2.620 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:28:49,014][model8_pretrain.py][INFO] Epoch:[0/2](339300/4588595) loss:3.144 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:28:49,014][model8_pretrain.py][INFO] Epoch:[0/2](339300/4588595) loss:3.447 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:28:49,014][model8_pretrain.py][INFO] Epoch:[0/2](339300/4588595) loss:2.996 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:28:49,014][model8_pretrain.py][INFO] Epoch:[0/2](339300/4588595) loss:2.289 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:28:49,015][model8_pretrain.py][INFO] Epoch:[0/2](339300/4588595) loss:2.021 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:29:25,942][model8_pretrain.py][INFO] Epoch:[0/2](339400/4588595) loss:3.324 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:29:25,942][model8_pretrain.py][INFO] Epoch:[0/2](339400/4588595) loss:2.751 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:29:25,942][model8_pretrain.py][INFO] Epoch:[0/2](339400/4588595) loss:2.344 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:29:25,942][model8_pretrain.py][INFO] Epoch:[0/2](339400/4588595) loss:2.341 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:29:25,942][model8_pretrain.py][INFO] Epoch:[0/2](339400/4588595) loss:2.949 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:29:25,942][model8_pretrain.py][INFO] Epoch:[0/2](339400/4588595) loss:2.531 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:29:25,942][model8_pretrain.py][INFO] Epoch:[0/2](339400/4588595) loss:2.963 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:29:25,943][model8_pretrain.py][INFO] Epoch:[0/2](339400/4588595) loss:2.663 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:30:02,886][model8_pretrain.py][INFO] Epoch:[0/2](339500/4588595) loss:2.929 lr:0.0000100 epoch_Time:26912.0min: [2024-01-04 05:30:02,886][model8_pretrain.py][INFO] Epoch:[0/2](339500/4588595) loss:2.987 lr:0.0000100 epoch_Time:26912.0min: [2024-01-04 05:30:02,886][model8_pretrain.py][INFO] Epoch:[0/2](339500/4588595) loss:2.872 lr:0.0000100 epoch_Time:26912.0min: [2024-01-04 05:30:02,886][model8_pretrain.py][INFO] Epoch:[0/2](339500/4588595) loss:3.419 lr:0.0000100 epoch_Time:26912.0min: [2024-01-04 05:30:02,886][model8_pretrain.py][INFO] Epoch:[0/2](339500/4588595) loss:2.494 lr:0.0000100 epoch_Time:26912.0min: [2024-01-04 05:30:02,886][model8_pretrain.py][INFO] Epoch:[0/2](339500/4588595) loss:3.084 lr:0.0000100 epoch_Time:26912.0min: [2024-01-04 05:30:02,886][model8_pretrain.py][INFO] Epoch:[0/2](339500/4588595) loss:3.066 lr:0.0000100 epoch_Time:26912.0min: [2024-01-04 05:30:02,886][model8_pretrain.py][INFO] Epoch:[0/2](339500/4588595) loss:2.626 lr:0.0000100 epoch_Time:26912.0min: [2024-01-04 05:30:39,823][model8_pretrain.py][INFO] Epoch:[0/2](339600/4588595) loss:3.053 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:30:39,823][model8_pretrain.py][INFO] Epoch:[0/2](339600/4588595) loss:2.851 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:30:39,823][model8_pretrain.py][INFO] Epoch:[0/2](339600/4588595) loss:2.698 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:30:39,823][model8_pretrain.py][INFO] Epoch:[0/2](339600/4588595) loss:3.127 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:30:39,823][model8_pretrain.py][INFO] Epoch:[0/2](339600/4588595) loss:3.016 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:30:39,823][model8_pretrain.py][INFO] Epoch:[0/2](339600/4588595) loss:2.739 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:30:39,823][model8_pretrain.py][INFO] Epoch:[0/2](339600/4588595) loss:2.673 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:30:39,823][model8_pretrain.py][INFO] Epoch:[0/2](339600/4588595) loss:2.579 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:31:26,960][model8_pretrain.py][INFO] Epoch:[0/2](339700/4588595) loss:2.203 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:31:26,960][model8_pretrain.py][INFO] Epoch:[0/2](339700/4588595) loss:2.944 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:31:26,960][model8_pretrain.py][INFO] Epoch:[0/2](339700/4588595) loss:3.147 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:31:26,960][model8_pretrain.py][INFO] Epoch:[0/2](339700/4588595) loss:2.768 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:31:26,961][model8_pretrain.py][INFO] Epoch:[0/2](339700/4588595) loss:2.706 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:31:26,961][model8_pretrain.py][INFO] Epoch:[0/2](339700/4588595) loss:2.297 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:31:26,961][model8_pretrain.py][INFO] Epoch:[0/2](339700/4588595) loss:2.889 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:31:26,961][model8_pretrain.py][INFO] Epoch:[0/2](339700/4588595) loss:2.988 lr:0.0000100 epoch_Time:26913.0min: [2024-01-04 05:32:03,887][model8_pretrain.py][INFO] Epoch:[0/2](339800/4588595) loss:3.067 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:03,887][model8_pretrain.py][INFO] Epoch:[0/2](339800/4588595) loss:3.000 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:03,887][model8_pretrain.py][INFO] Epoch:[0/2](339800/4588595) loss:3.035 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:03,887][model8_pretrain.py][INFO] Epoch:[0/2](339800/4588595) loss:3.189 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:03,887][model8_pretrain.py][INFO] Epoch:[0/2](339800/4588595) loss:2.934 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:03,887][model8_pretrain.py][INFO] Epoch:[0/2](339800/4588595) loss:2.976 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:03,887][model8_pretrain.py][INFO] Epoch:[0/2](339800/4588595) loss:2.912 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:03,887][model8_pretrain.py][INFO] Epoch:[0/2](339800/4588595) loss:3.093 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:40,819][model8_pretrain.py][INFO] Epoch:[0/2](339900/4588595) loss:2.790 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:40,820][model8_pretrain.py][INFO] Epoch:[0/2](339900/4588595) loss:2.163 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:40,820][model8_pretrain.py][INFO] Epoch:[0/2](339900/4588595) loss:2.818 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:40,820][model8_pretrain.py][INFO] Epoch:[0/2](339900/4588595) loss:2.654 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:40,820][model8_pretrain.py][INFO] Epoch:[0/2](339900/4588595) loss:3.204 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:40,820][model8_pretrain.py][INFO] Epoch:[0/2](339900/4588595) loss:2.391 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:40,820][model8_pretrain.py][INFO] Epoch:[0/2](339900/4588595) loss:2.772 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:32:40,820][model8_pretrain.py][INFO] Epoch:[0/2](339900/4588595) loss:2.834 lr:0.0000100 epoch_Time:26911.0min: [2024-01-04 05:33:17,752][model8_pretrain.py][INFO] Epoch:[0/2](340000/4588595) loss:2.905 lr:0.0000100 epoch_Time:26910.0min: [2024-01-04 05:33:17,752][model8_pretrain.py][INFO] Epoch:[0/2](340000/4588595) loss:3.111 lr:0.0000100 epoch_Time:26910.0min: [2024-01-04 05:33:17,752][model8_pretrain.py][INFO] Epoch:[0/2](340000/4588595) loss:1.980 lr:0.0000100 epoch_Time:26910.0min: [2024-01-04 05:33:17,752][model8_pretrain.py][INFO] Epoch:[0/2](340000/4588595) loss:3.104 lr:0.0000100 epoch_Time:26910.0min: [2024-01-04 05:33:17,752][model8_pretrain.py][INFO] Epoch:[0/2](340000/4588595) loss:2.754 lr:0.0000100 epoch_Time:26910.0min: [2024-01-04 05:33:17,752][model8_pretrain.py][INFO] Epoch:[0/2](340000/4588595) loss:3.213 lr:0.0000100 epoch_Time:26910.0min: [2024-01-04 05:33:17,752][model8_pretrain.py][INFO] Epoch:[0/2](340000/4588595) loss:2.531 lr:0.0000100 epoch_Time:26910.0min: [2024-01-04 05:33:17,752][model8_pretrain.py][INFO] Epoch:[0/2](340000/4588595) loss:2.874 lr:0.0000100 epoch_Time:26910.0min: [2024-01-04 05:33:54,677][model8_pretrain.py][INFO] Epoch:[0/2](340100/4588595) loss:3.571 lr:0.0000100 epoch_Time:26909.0min: [2024-01-04 05:33:54,677][model8_pretrain.py][INFO] Epoch:[0/2](340100/4588595) loss:3.300 lr:0.0000100 epoch_Time:26909.0min: [2024-01-04 05:33:54,677][model8_pretrain.py][INFO] Epoch:[0/2](340100/4588595) loss:2.722 lr:0.0000100 epoch_Time:26909.0min: [2024-01-04 05:33:54,677][model8_pretrain.py][INFO] Epoch:[0/2](340100/4588595) loss:2.727 lr:0.0000100 epoch_Time:26909.0min: [2024-01-04 05:33:54,677][model8_pretrain.py][INFO] Epoch:[0/2](340100/4588595) loss:3.308 lr:0.0000100 epoch_Time:26909.0min: [2024-01-04 05:33:54,677][model8_pretrain.py][INFO] Epoch:[0/2](340100/4588595) loss:3.283 lr:0.0000100 epoch_Time:26909.0min: [2024-01-04 05:33:54,677][model8_pretrain.py][INFO] Epoch:[0/2](340100/4588595) loss:3.201 lr:0.0000100 epoch_Time:26909.0min: [2024-01-04 05:33:54,677][model8_pretrain.py][INFO] Epoch:[0/2](340100/4588595) loss:2.295 lr:0.0000100 epoch_Time:26909.0min: [2024-01-04 05:34:31,602][model8_pretrain.py][INFO] Epoch:[0/2](340200/4588595) loss:3.237 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:34:31,602][model8_pretrain.py][INFO] Epoch:[0/2](340200/4588595) loss:3.124 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:34:31,602][model8_pretrain.py][INFO] Epoch:[0/2](340200/4588595) loss:3.546 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:34:31,602][model8_pretrain.py][INFO] Epoch:[0/2](340200/4588595) loss:2.613 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:34:31,602][model8_pretrain.py][INFO] Epoch:[0/2](340200/4588595) loss:3.002 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:34:31,602][model8_pretrain.py][INFO] Epoch:[0/2](340200/4588595) loss:3.228 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:34:31,603][model8_pretrain.py][INFO] Epoch:[0/2](340200/4588595) loss:2.658 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:34:31,603][model8_pretrain.py][INFO] Epoch:[0/2](340200/4588595) loss:2.878 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:35:08,525][model8_pretrain.py][INFO] Epoch:[0/2](340300/4588595) loss:2.514 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:08,526][model8_pretrain.py][INFO] Epoch:[0/2](340300/4588595) loss:3.178 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:08,526][model8_pretrain.py][INFO] Epoch:[0/2](340300/4588595) loss:2.909 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:08,526][model8_pretrain.py][INFO] Epoch:[0/2](340300/4588595) loss:2.450 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:08,526][model8_pretrain.py][INFO] Epoch:[0/2](340300/4588595) loss:2.406 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:08,526][model8_pretrain.py][INFO] Epoch:[0/2](340300/4588595) loss:3.203 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:08,526][model8_pretrain.py][INFO] Epoch:[0/2](340300/4588595) loss:3.405 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:08,526][model8_pretrain.py][INFO] Epoch:[0/2](340300/4588595) loss:2.956 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:45,447][model8_pretrain.py][INFO] Epoch:[0/2](340400/4588595) loss:2.703 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:45,447][model8_pretrain.py][INFO] Epoch:[0/2](340400/4588595) loss:2.714 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:45,447][model8_pretrain.py][INFO] Epoch:[0/2](340400/4588595) loss:3.021 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:45,447][model8_pretrain.py][INFO] Epoch:[0/2](340400/4588595) loss:3.516 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:45,447][model8_pretrain.py][INFO] Epoch:[0/2](340400/4588595) loss:2.424 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:45,447][model8_pretrain.py][INFO] Epoch:[0/2](340400/4588595) loss:2.978 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:45,447][model8_pretrain.py][INFO] Epoch:[0/2](340400/4588595) loss:2.766 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:35:45,447][model8_pretrain.py][INFO] Epoch:[0/2](340400/4588595) loss:2.993 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:36:32,764][model8_pretrain.py][INFO] Epoch:[0/2](340500/4588595) loss:2.980 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:36:32,764][model8_pretrain.py][INFO] Epoch:[0/2](340500/4588595) loss:2.697 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:36:32,764][model8_pretrain.py][INFO] Epoch:[0/2](340500/4588595) loss:3.129 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:36:32,764][model8_pretrain.py][INFO] Epoch:[0/2](340500/4588595) loss:3.053 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:36:32,764][model8_pretrain.py][INFO] Epoch:[0/2](340500/4588595) loss:2.675 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:36:32,764][model8_pretrain.py][INFO] Epoch:[0/2](340500/4588595) loss:2.822 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:36:32,764][model8_pretrain.py][INFO] Epoch:[0/2](340500/4588595) loss:2.940 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:36:32,764][model8_pretrain.py][INFO] Epoch:[0/2](340500/4588595) loss:2.523 lr:0.0000100 epoch_Time:26908.0min: [2024-01-04 05:37:09,699][model8_pretrain.py][INFO] Epoch:[0/2](340600/4588595) loss:2.605 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:37:09,699][model8_pretrain.py][INFO] Epoch:[0/2](340600/4588595) loss:3.211 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:37:09,699][model8_pretrain.py][INFO] Epoch:[0/2](340600/4588595) loss:2.735 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:37:09,699][model8_pretrain.py][INFO] Epoch:[0/2](340600/4588595) loss:3.256 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:37:09,699][model8_pretrain.py][INFO] Epoch:[0/2](340600/4588595) loss:3.435 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:37:09,699][model8_pretrain.py][INFO] Epoch:[0/2](340600/4588595) loss:3.125 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:37:09,699][model8_pretrain.py][INFO] Epoch:[0/2](340600/4588595) loss:2.368 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:37:09,699][model8_pretrain.py][INFO] Epoch:[0/2](340600/4588595) loss:2.729 lr:0.0000100 epoch_Time:26907.0min: [2024-01-04 05:37:46,640][model8_pretrain.py][INFO] Epoch:[0/2](340700/4588595) loss:2.376 lr:0.0000100 epoch_Time:26906.0min: [2024-01-04 05:37:46,640][model8_pretrain.py][INFO] Epoch:[0/2](340700/4588595) loss:2.770 lr:0.0000100 epoch_Time:26906.0min: [2024-01-04 05:37:46,640][model8_pretrain.py][INFO] Epoch:[0/2](340700/4588595) loss:3.046 lr:0.0000100 epoch_Time:26906.0min: [2024-01-04 05:37:46,640][model8_pretrain.py][INFO] Epoch:[0/2](340700/4588595) loss:3.171 lr:0.0000100 epoch_Time:26906.0min: [2024-01-04 05:37:46,640][model8_pretrain.py][INFO] Epoch:[0/2](340700/4588595) loss:3.198 lr:0.0000100 epoch_Time:26906.0min: [2024-01-04 05:37:46,640][model8_pretrain.py][INFO] Epoch:[0/2](340700/4588595) loss:3.259 lr:0.0000100 epoch_Time:26906.0min: [2024-01-04 05:37:46,640][model8_pretrain.py][INFO] Epoch:[0/2](340700/4588595) loss:2.854 lr:0.0000100 epoch_Time:26906.0min: [2024-01-04 05:37:46,640][model8_pretrain.py][INFO] Epoch:[0/2](340700/4588595) loss:2.341 lr:0.0000100 epoch_Time:26906.0min: [2024-01-04 05:38:23,588][model8_pretrain.py][INFO] Epoch:[0/2](340800/4588595) loss:2.870 lr:0.0000100 epoch_Time:26905.0min: [2024-01-04 05:38:23,588][model8_pretrain.py][INFO] Epoch:[0/2](340800/4588595) loss:3.475 lr:0.0000100 epoch_Time:26905.0min: [2024-01-04 05:38:23,588][model8_pretrain.py][INFO] Epoch:[0/2](340800/4588595) loss:3.645 lr:0.0000100 epoch_Time:26905.0min: [2024-01-04 05:38:23,588][model8_pretrain.py][INFO] Epoch:[0/2](340800/4588595) loss:2.940 lr:0.0000100 epoch_Time:26905.0min: [2024-01-04 05:38:23,588][model8_pretrain.py][INFO] Epoch:[0/2](340800/4588595) loss:3.003 lr:0.0000100 epoch_Time:26905.0min: [2024-01-04 05:38:23,588][model8_pretrain.py][INFO] Epoch:[0/2](340800/4588595) loss:3.021 lr:0.0000100 epoch_Time:26905.0min: [2024-01-04 05:38:23,588][model8_pretrain.py][INFO] Epoch:[0/2](340800/4588595) loss:2.845 lr:0.0000100 epoch_Time:26905.0min: [2024-01-04 05:38:23,589][model8_pretrain.py][INFO] Epoch:[0/2](340800/4588595) loss:2.644 lr:0.0000100 epoch_Time:26905.0min: [2024-01-04 05:39:00,540][model8_pretrain.py][INFO] Epoch:[0/2](340900/4588595) loss:3.061 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:00,540][model8_pretrain.py][INFO] Epoch:[0/2](340900/4588595) loss:2.772 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:00,540][model8_pretrain.py][INFO] Epoch:[0/2](340900/4588595) loss:2.947 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:00,540][model8_pretrain.py][INFO] Epoch:[0/2](340900/4588595) loss:3.504 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:00,540][model8_pretrain.py][INFO] Epoch:[0/2](340900/4588595) loss:2.963 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:00,540][model8_pretrain.py][INFO] Epoch:[0/2](340900/4588595) loss:2.329 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:00,540][model8_pretrain.py][INFO] Epoch:[0/2](340900/4588595) loss:3.024 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:00,540][model8_pretrain.py][INFO] Epoch:[0/2](340900/4588595) loss:3.309 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:37,476][model8_pretrain.py][INFO] Epoch:[0/2](341000/4588595) loss:3.019 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:37,476][model8_pretrain.py][INFO] Epoch:[0/2](341000/4588595) loss:2.623 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:37,476][model8_pretrain.py][INFO] Epoch:[0/2](341000/4588595) loss:2.129 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:37,476][model8_pretrain.py][INFO] Epoch:[0/2](341000/4588595) loss:3.072 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:37,476][model8_pretrain.py][INFO] Epoch:[0/2](341000/4588595) loss:2.636 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:37,477][model8_pretrain.py][INFO] Epoch:[0/2](341000/4588595) loss:2.820 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:37,477][model8_pretrain.py][INFO] Epoch:[0/2](341000/4588595) loss:3.035 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:39:37,477][model8_pretrain.py][INFO] Epoch:[0/2](341000/4588595) loss:2.691 lr:0.0000100 epoch_Time:26904.0min: [2024-01-04 05:40:14,404][model8_pretrain.py][INFO] Epoch:[0/2](341100/4588595) loss:3.146 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:40:14,404][model8_pretrain.py][INFO] Epoch:[0/2](341100/4588595) loss:2.843 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:40:14,404][model8_pretrain.py][INFO] Epoch:[0/2](341100/4588595) loss:2.935 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:40:14,404][model8_pretrain.py][INFO] Epoch:[0/2](341100/4588595) loss:2.747 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:40:14,404][model8_pretrain.py][INFO] Epoch:[0/2](341100/4588595) loss:2.836 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:40:14,404][model8_pretrain.py][INFO] Epoch:[0/2](341100/4588595) loss:2.595 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:40:14,404][model8_pretrain.py][INFO] Epoch:[0/2](341100/4588595) loss:2.897 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:40:14,404][model8_pretrain.py][INFO] Epoch:[0/2](341100/4588595) loss:2.973 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:40:51,340][model8_pretrain.py][INFO] Epoch:[0/2](341200/4588595) loss:3.236 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:40:51,340][model8_pretrain.py][INFO] Epoch:[0/2](341200/4588595) loss:3.082 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:40:51,340][model8_pretrain.py][INFO] Epoch:[0/2](341200/4588595) loss:2.363 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:40:51,340][model8_pretrain.py][INFO] Epoch:[0/2](341200/4588595) loss:2.532 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:40:51,340][model8_pretrain.py][INFO] Epoch:[0/2](341200/4588595) loss:2.953 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:40:51,340][model8_pretrain.py][INFO] Epoch:[0/2](341200/4588595) loss:2.940 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:40:51,340][model8_pretrain.py][INFO] Epoch:[0/2](341200/4588595) loss:2.502 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:40:51,340][model8_pretrain.py][INFO] Epoch:[0/2](341200/4588595) loss:3.473 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:41:38,708][model8_pretrain.py][INFO] Epoch:[0/2](341300/4588595) loss:3.237 lr:0.0000100 epoch_Time:26903.0min: [2024-01-04 05:41:38,708][model8_pretrain.py][INFO] Epoch:[0/2](341300/4588595) loss:3.164 lr:0.0000100 epoch_Time:26903.0min: [2024-01-04 05:41:38,708][model8_pretrain.py][INFO] Epoch:[0/2](341300/4588595) loss:2.991 lr:0.0000100 epoch_Time:26903.0min: [2024-01-04 05:41:38,708][model8_pretrain.py][INFO] Epoch:[0/2](341300/4588595) loss:2.899 lr:0.0000100 epoch_Time:26903.0min: [2024-01-04 05:41:38,708][model8_pretrain.py][INFO] Epoch:[0/2](341300/4588595) loss:2.855 lr:0.0000100 epoch_Time:26903.0min: [2024-01-04 05:41:38,708][model8_pretrain.py][INFO] Epoch:[0/2](341300/4588595) loss:2.628 lr:0.0000100 epoch_Time:26903.0min: [2024-01-04 05:41:38,708][model8_pretrain.py][INFO] Epoch:[0/2](341300/4588595) loss:3.082 lr:0.0000100 epoch_Time:26903.0min: [2024-01-04 05:41:38,708][model8_pretrain.py][INFO] Epoch:[0/2](341300/4588595) loss:2.532 lr:0.0000100 epoch_Time:26903.0min: [2024-01-04 05:42:15,627][model8_pretrain.py][INFO] Epoch:[0/2](341400/4588595) loss:3.017 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:42:15,627][model8_pretrain.py][INFO] Epoch:[0/2](341400/4588595) loss:2.320 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:42:15,627][model8_pretrain.py][INFO] Epoch:[0/2](341400/4588595) loss:2.599 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:42:15,627][model8_pretrain.py][INFO] Epoch:[0/2](341400/4588595) loss:2.968 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:42:15,627][model8_pretrain.py][INFO] Epoch:[0/2](341400/4588595) loss:3.452 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:42:15,627][model8_pretrain.py][INFO] Epoch:[0/2](341400/4588595) loss:2.733 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:42:15,627][model8_pretrain.py][INFO] Epoch:[0/2](341400/4588595) loss:3.236 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:42:15,627][model8_pretrain.py][INFO] Epoch:[0/2](341400/4588595) loss:2.241 lr:0.0000100 epoch_Time:26902.0min: [2024-01-04 05:42:52,558][model8_pretrain.py][INFO] Epoch:[0/2](341500/4588595) loss:2.744 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:42:52,558][model8_pretrain.py][INFO] Epoch:[0/2](341500/4588595) loss:2.392 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:42:52,558][model8_pretrain.py][INFO] Epoch:[0/2](341500/4588595) loss:3.127 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:42:52,558][model8_pretrain.py][INFO] Epoch:[0/2](341500/4588595) loss:2.974 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:42:52,558][model8_pretrain.py][INFO] Epoch:[0/2](341500/4588595) loss:3.235 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:42:52,558][model8_pretrain.py][INFO] Epoch:[0/2](341500/4588595) loss:3.377 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:42:52,558][model8_pretrain.py][INFO] Epoch:[0/2](341500/4588595) loss:3.064 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:42:52,558][model8_pretrain.py][INFO] Epoch:[0/2](341500/4588595) loss:2.809 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:43:29,523][model8_pretrain.py][INFO] Epoch:[0/2](341600/4588595) loss:3.019 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:43:29,523][model8_pretrain.py][INFO] Epoch:[0/2](341600/4588595) loss:2.730 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:43:29,523][model8_pretrain.py][INFO] Epoch:[0/2](341600/4588595) loss:3.001 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:43:29,523][model8_pretrain.py][INFO] Epoch:[0/2](341600/4588595) loss:2.424 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:43:29,523][model8_pretrain.py][INFO] Epoch:[0/2](341600/4588595) loss:2.868 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:43:29,523][model8_pretrain.py][INFO] Epoch:[0/2](341600/4588595) loss:3.009 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:43:29,523][model8_pretrain.py][INFO] Epoch:[0/2](341600/4588595) loss:2.761 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:43:29,523][model8_pretrain.py][INFO] Epoch:[0/2](341600/4588595) loss:2.999 lr:0.0000100 epoch_Time:26901.0min: [2024-01-04 05:44:06,488][model8_pretrain.py][INFO] Epoch:[0/2](341700/4588595) loss:2.765 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:06,488][model8_pretrain.py][INFO] Epoch:[0/2](341700/4588595) loss:2.997 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:06,488][model8_pretrain.py][INFO] Epoch:[0/2](341700/4588595) loss:2.419 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:06,488][model8_pretrain.py][INFO] Epoch:[0/2](341700/4588595) loss:3.039 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:06,488][model8_pretrain.py][INFO] Epoch:[0/2](341700/4588595) loss:2.750 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:06,488][model8_pretrain.py][INFO] Epoch:[0/2](341700/4588595) loss:3.181 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:06,488][model8_pretrain.py][INFO] Epoch:[0/2](341700/4588595) loss:2.765 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:06,488][model8_pretrain.py][INFO] Epoch:[0/2](341700/4588595) loss:3.068 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:43,437][model8_pretrain.py][INFO] Epoch:[0/2](341800/4588595) loss:2.354 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:43,437][model8_pretrain.py][INFO] Epoch:[0/2](341800/4588595) loss:3.189 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:43,437][model8_pretrain.py][INFO] Epoch:[0/2](341800/4588595) loss:2.986 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:43,437][model8_pretrain.py][INFO] Epoch:[0/2](341800/4588595) loss:2.952 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:43,437][model8_pretrain.py][INFO] Epoch:[0/2](341800/4588595) loss:3.217 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:43,437][model8_pretrain.py][INFO] Epoch:[0/2](341800/4588595) loss:2.789 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:43,437][model8_pretrain.py][INFO] Epoch:[0/2](341800/4588595) loss:3.544 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:44:43,437][model8_pretrain.py][INFO] Epoch:[0/2](341800/4588595) loss:3.166 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:45:20,396][model8_pretrain.py][INFO] Epoch:[0/2](341900/4588595) loss:2.576 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:45:20,396][model8_pretrain.py][INFO] Epoch:[0/2](341900/4588595) loss:2.476 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:45:20,396][model8_pretrain.py][INFO] Epoch:[0/2](341900/4588595) loss:2.857 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:45:20,396][model8_pretrain.py][INFO] Epoch:[0/2](341900/4588595) loss:2.433 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:45:20,396][model8_pretrain.py][INFO] Epoch:[0/2](341900/4588595) loss:2.585 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:45:20,396][model8_pretrain.py][INFO] Epoch:[0/2](341900/4588595) loss:3.098 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:45:20,396][model8_pretrain.py][INFO] Epoch:[0/2](341900/4588595) loss:3.512 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:45:20,396][model8_pretrain.py][INFO] Epoch:[0/2](341900/4588595) loss:2.991 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:45:57,335][model8_pretrain.py][INFO] Epoch:[0/2](342000/4588595) loss:3.200 lr:0.0000100 epoch_Time:26897.0min: [2024-01-04 05:45:57,335][model8_pretrain.py][INFO] Epoch:[0/2](342000/4588595) loss:3.071 lr:0.0000100 epoch_Time:26897.0min: [2024-01-04 05:45:57,335][model8_pretrain.py][INFO] Epoch:[0/2](342000/4588595) loss:2.348 lr:0.0000100 epoch_Time:26897.0min: [2024-01-04 05:45:57,335][model8_pretrain.py][INFO] Epoch:[0/2](342000/4588595) loss:2.640 lr:0.0000100 epoch_Time:26897.0min: [2024-01-04 05:45:57,335][model8_pretrain.py][INFO] Epoch:[0/2](342000/4588595) loss:2.075 lr:0.0000100 epoch_Time:26897.0min: [2024-01-04 05:45:57,335][model8_pretrain.py][INFO] Epoch:[0/2](342000/4588595) loss:3.079 lr:0.0000100 epoch_Time:26897.0min: [2024-01-04 05:45:57,335][model8_pretrain.py][INFO] Epoch:[0/2](342000/4588595) loss:2.814 lr:0.0000100 epoch_Time:26897.0min: [2024-01-04 05:45:57,336][model8_pretrain.py][INFO] Epoch:[0/2](342000/4588595) loss:2.781 lr:0.0000100 epoch_Time:26897.0min: [2024-01-04 05:46:44,652][model8_pretrain.py][INFO] Epoch:[0/2](342100/4588595) loss:3.126 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:46:44,652][model8_pretrain.py][INFO] Epoch:[0/2](342100/4588595) loss:3.068 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:46:44,652][model8_pretrain.py][INFO] Epoch:[0/2](342100/4588595) loss:2.822 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:46:44,652][model8_pretrain.py][INFO] Epoch:[0/2](342100/4588595) loss:2.718 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:46:44,652][model8_pretrain.py][INFO] Epoch:[0/2](342100/4588595) loss:2.748 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:46:44,652][model8_pretrain.py][INFO] Epoch:[0/2](342100/4588595) loss:2.953 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:46:44,652][model8_pretrain.py][INFO] Epoch:[0/2](342100/4588595) loss:2.997 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:46:44,653][model8_pretrain.py][INFO] Epoch:[0/2](342100/4588595) loss:3.037 lr:0.0000100 epoch_Time:26899.0min: [2024-01-04 05:47:21,590][model8_pretrain.py][INFO] Epoch:[0/2](342200/4588595) loss:2.624 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:47:21,590][model8_pretrain.py][INFO] Epoch:[0/2](342200/4588595) loss:3.617 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:47:21,590][model8_pretrain.py][INFO] Epoch:[0/2](342200/4588595) loss:2.902 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:47:21,590][model8_pretrain.py][INFO] Epoch:[0/2](342200/4588595) loss:2.480 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:47:21,590][model8_pretrain.py][INFO] Epoch:[0/2](342200/4588595) loss:3.205 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:47:21,590][model8_pretrain.py][INFO] Epoch:[0/2](342200/4588595) loss:3.174 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:47:21,590][model8_pretrain.py][INFO] Epoch:[0/2](342200/4588595) loss:2.704 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:47:21,591][model8_pretrain.py][INFO] Epoch:[0/2](342200/4588595) loss:3.066 lr:0.0000100 epoch_Time:26898.0min: [2024-01-04 05:47:58,594][model8_pretrain.py][INFO] Epoch:[0/2](342300/4588595) loss:3.306 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:47:58,594][model8_pretrain.py][INFO] Epoch:[0/2](342300/4588595) loss:3.466 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:47:58,594][model8_pretrain.py][INFO] Epoch:[0/2](342300/4588595) loss:3.177 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:47:58,594][model8_pretrain.py][INFO] Epoch:[0/2](342300/4588595) loss:3.495 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:47:58,594][model8_pretrain.py][INFO] Epoch:[0/2](342300/4588595) loss:2.527 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:47:58,595][model8_pretrain.py][INFO] Epoch:[0/2](342300/4588595) loss:2.366 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:47:58,595][model8_pretrain.py][INFO] Epoch:[0/2](342300/4588595) loss:3.363 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:47:58,596][model8_pretrain.py][INFO] Epoch:[0/2](342300/4588595) loss:2.552 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:48:35,576][model8_pretrain.py][INFO] Epoch:[0/2](342400/4588595) loss:2.075 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:48:35,576][model8_pretrain.py][INFO] Epoch:[0/2](342400/4588595) loss:3.095 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:48:35,576][model8_pretrain.py][INFO] Epoch:[0/2](342400/4588595) loss:3.303 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:48:35,576][model8_pretrain.py][INFO] Epoch:[0/2](342400/4588595) loss:3.115 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:48:35,576][model8_pretrain.py][INFO] Epoch:[0/2](342400/4588595) loss:2.940 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:48:35,576][model8_pretrain.py][INFO] Epoch:[0/2](342400/4588595) loss:2.216 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:48:35,576][model8_pretrain.py][INFO] Epoch:[0/2](342400/4588595) loss:2.657 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:48:35,576][model8_pretrain.py][INFO] Epoch:[0/2](342400/4588595) loss:2.563 lr:0.0000100 epoch_Time:26896.0min: [2024-01-04 05:49:12,552][model8_pretrain.py][INFO] Epoch:[0/2](342500/4588595) loss:3.189 lr:0.0000100 epoch_Time:26895.0min: [2024-01-04 05:49:12,552][model8_pretrain.py][INFO] Epoch:[0/2](342500/4588595) loss:2.762 lr:0.0000100 epoch_Time:26895.0min: [2024-01-04 05:49:12,552][model8_pretrain.py][INFO] Epoch:[0/2](342500/4588595) loss:2.836 lr:0.0000100 epoch_Time:26895.0min: [2024-01-04 05:49:12,552][model8_pretrain.py][INFO] Epoch:[0/2](342500/4588595) loss:3.070 lr:0.0000100 epoch_Time:26895.0min: [2024-01-04 05:49:12,552][model8_pretrain.py][INFO] Epoch:[0/2](342500/4588595) loss:2.629 lr:0.0000100 epoch_Time:26895.0min: [2024-01-04 05:49:12,552][model8_pretrain.py][INFO] Epoch:[0/2](342500/4588595) loss:2.705 lr:0.0000100 epoch_Time:26895.0min: [2024-01-04 05:49:12,552][model8_pretrain.py][INFO] Epoch:[0/2](342500/4588595) loss:2.868 lr:0.0000100 epoch_Time:26895.0min: [2024-01-04 05:49:12,552][model8_pretrain.py][INFO] Epoch:[0/2](342500/4588595) loss:3.209 lr:0.0000100 epoch_Time:26895.0min: [2024-01-04 05:49:49,510][model8_pretrain.py][INFO] Epoch:[0/2](342600/4588595) loss:3.244 lr:0.0000100 epoch_Time:26894.0min: [2024-01-04 05:49:49,510][model8_pretrain.py][INFO] Epoch:[0/2](342600/4588595) loss:3.179 lr:0.0000100 epoch_Time:26894.0min: [2024-01-04 05:49:49,510][model8_pretrain.py][INFO] Epoch:[0/2](342600/4588595) loss:3.070 lr:0.0000100 epoch_Time:26894.0min: [2024-01-04 05:49:49,510][model8_pretrain.py][INFO] Epoch:[0/2](342600/4588595) loss:3.256 lr:0.0000100 epoch_Time:26894.0min: [2024-01-04 05:49:49,510][model8_pretrain.py][INFO] Epoch:[0/2](342600/4588595) loss:2.903 lr:0.0000100 epoch_Time:26894.0min: [2024-01-04 05:49:49,510][model8_pretrain.py][INFO] Epoch:[0/2](342600/4588595) loss:2.726 lr:0.0000100 epoch_Time:26894.0min: [2024-01-04 05:49:49,510][model8_pretrain.py][INFO] Epoch:[0/2](342600/4588595) loss:2.864 lr:0.0000100 epoch_Time:26894.0min: [2024-01-04 05:49:49,510][model8_pretrain.py][INFO] Epoch:[0/2](342600/4588595) loss:2.654 lr:0.0000100 epoch_Time:26894.0min: [2024-01-04 05:50:26,464][model8_pretrain.py][INFO] Epoch:[0/2](342700/4588595) loss:3.159 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:50:26,464][model8_pretrain.py][INFO] Epoch:[0/2](342700/4588595) loss:2.990 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:50:26,464][model8_pretrain.py][INFO] Epoch:[0/2](342700/4588595) loss:2.639 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:50:26,464][model8_pretrain.py][INFO] Epoch:[0/2](342700/4588595) loss:1.906 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:50:26,464][model8_pretrain.py][INFO] Epoch:[0/2](342700/4588595) loss:3.016 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:50:26,464][model8_pretrain.py][INFO] Epoch:[0/2](342700/4588595) loss:3.236 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:50:26,464][model8_pretrain.py][INFO] Epoch:[0/2](342700/4588595) loss:2.951 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:50:26,464][model8_pretrain.py][INFO] Epoch:[0/2](342700/4588595) loss:3.120 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:51:03,464][model8_pretrain.py][INFO] Epoch:[0/2](342800/4588595) loss:3.381 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:51:03,464][model8_pretrain.py][INFO] Epoch:[0/2](342800/4588595) loss:3.174 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:51:03,464][model8_pretrain.py][INFO] Epoch:[0/2](342800/4588595) loss:3.222 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:51:03,464][model8_pretrain.py][INFO] Epoch:[0/2](342800/4588595) loss:3.116 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:51:03,464][model8_pretrain.py][INFO] Epoch:[0/2](342800/4588595) loss:2.975 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:51:03,464][model8_pretrain.py][INFO] Epoch:[0/2](342800/4588595) loss:2.582 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:51:03,464][model8_pretrain.py][INFO] Epoch:[0/2](342800/4588595) loss:2.210 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:51:03,465][model8_pretrain.py][INFO] Epoch:[0/2](342800/4588595) loss:2.655 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:51:50,823][model8_pretrain.py][INFO] Epoch:[0/2](342900/4588595) loss:3.077 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:51:50,823][model8_pretrain.py][INFO] Epoch:[0/2](342900/4588595) loss:3.233 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:51:50,823][model8_pretrain.py][INFO] Epoch:[0/2](342900/4588595) loss:2.525 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:51:50,823][model8_pretrain.py][INFO] Epoch:[0/2](342900/4588595) loss:2.775 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:51:50,823][model8_pretrain.py][INFO] Epoch:[0/2](342900/4588595) loss:3.164 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:51:50,823][model8_pretrain.py][INFO] Epoch:[0/2](342900/4588595) loss:3.519 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:51:50,823][model8_pretrain.py][INFO] Epoch:[0/2](342900/4588595) loss:3.158 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:51:50,823][model8_pretrain.py][INFO] Epoch:[0/2](342900/4588595) loss:2.341 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:52:27,722][model8_pretrain.py][INFO] Epoch:[0/2](343000/4588595) loss:3.263 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:52:27,722][model8_pretrain.py][INFO] Epoch:[0/2](343000/4588595) loss:2.556 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:52:27,722][model8_pretrain.py][INFO] Epoch:[0/2](343000/4588595) loss:2.857 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:52:27,722][model8_pretrain.py][INFO] Epoch:[0/2](343000/4588595) loss:2.692 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:52:27,722][model8_pretrain.py][INFO] Epoch:[0/2](343000/4588595) loss:3.014 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:52:27,722][model8_pretrain.py][INFO] Epoch:[0/2](343000/4588595) loss:2.839 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:52:27,722][model8_pretrain.py][INFO] Epoch:[0/2](343000/4588595) loss:2.651 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:52:27,722][model8_pretrain.py][INFO] Epoch:[0/2](343000/4588595) loss:2.543 lr:0.0000100 epoch_Time:26893.0min: [2024-01-04 05:53:04,685][model8_pretrain.py][INFO] Epoch:[0/2](343100/4588595) loss:3.076 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:04,685][model8_pretrain.py][INFO] Epoch:[0/2](343100/4588595) loss:3.037 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:04,685][model8_pretrain.py][INFO] Epoch:[0/2](343100/4588595) loss:2.331 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:04,685][model8_pretrain.py][INFO] Epoch:[0/2](343100/4588595) loss:3.106 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:04,685][model8_pretrain.py][INFO] Epoch:[0/2](343100/4588595) loss:2.746 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:04,685][model8_pretrain.py][INFO] Epoch:[0/2](343100/4588595) loss:3.246 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:04,685][model8_pretrain.py][INFO] Epoch:[0/2](343100/4588595) loss:2.439 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:04,685][model8_pretrain.py][INFO] Epoch:[0/2](343100/4588595) loss:3.187 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:41,613][model8_pretrain.py][INFO] Epoch:[0/2](343200/4588595) loss:2.756 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:41,613][model8_pretrain.py][INFO] Epoch:[0/2](343200/4588595) loss:3.160 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:41,613][model8_pretrain.py][INFO] Epoch:[0/2](343200/4588595) loss:3.011 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:41,613][model8_pretrain.py][INFO] Epoch:[0/2](343200/4588595) loss:2.279 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:41,613][model8_pretrain.py][INFO] Epoch:[0/2](343200/4588595) loss:3.298 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:41,613][model8_pretrain.py][INFO] Epoch:[0/2](343200/4588595) loss:3.014 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:41,613][model8_pretrain.py][INFO] Epoch:[0/2](343200/4588595) loss:3.301 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:53:41,613][model8_pretrain.py][INFO] Epoch:[0/2](343200/4588595) loss:2.759 lr:0.0000100 epoch_Time:26892.0min: [2024-01-04 05:54:18,556][model8_pretrain.py][INFO] Epoch:[0/2](343300/4588595) loss:2.444 lr:0.0000100 epoch_Time:26890.0min: [2024-01-04 05:54:18,556][model8_pretrain.py][INFO] Epoch:[0/2](343300/4588595) loss:2.759 lr:0.0000100 epoch_Time:26890.0min: [2024-01-04 05:54:18,556][model8_pretrain.py][INFO] Epoch:[0/2](343300/4588595) loss:2.978 lr:0.0000100 epoch_Time:26890.0min: [2024-01-04 05:54:18,556][model8_pretrain.py][INFO] Epoch:[0/2](343300/4588595) loss:2.925 lr:0.0000100 epoch_Time:26890.0min: [2024-01-04 05:54:18,556][model8_pretrain.py][INFO] Epoch:[0/2](343300/4588595) loss:2.599 lr:0.0000100 epoch_Time:26890.0min: [2024-01-04 05:54:18,556][model8_pretrain.py][INFO] Epoch:[0/2](343300/4588595) loss:2.662 lr:0.0000100 epoch_Time:26890.0min: [2024-01-04 05:54:18,556][model8_pretrain.py][INFO] Epoch:[0/2](343300/4588595) loss:2.806 lr:0.0000100 epoch_Time:26890.0min: [2024-01-04 05:54:18,557][model8_pretrain.py][INFO] Epoch:[0/2](343300/4588595) loss:3.376 lr:0.0000100 epoch_Time:26890.0min: [2024-01-04 05:54:55,494][model8_pretrain.py][INFO] Epoch:[0/2](343400/4588595) loss:2.887 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:54:55,494][model8_pretrain.py][INFO] Epoch:[0/2](343400/4588595) loss:3.136 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:54:55,494][model8_pretrain.py][INFO] Epoch:[0/2](343400/4588595) loss:3.165 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:54:55,494][model8_pretrain.py][INFO] Epoch:[0/2](343400/4588595) loss:3.088 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:54:55,494][model8_pretrain.py][INFO] Epoch:[0/2](343400/4588595) loss:2.883 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:54:55,494][model8_pretrain.py][INFO] Epoch:[0/2](343400/4588595) loss:3.547 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:54:55,494][model8_pretrain.py][INFO] Epoch:[0/2](343400/4588595) loss:2.752 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:54:55,494][model8_pretrain.py][INFO] Epoch:[0/2](343400/4588595) loss:2.929 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:55:32,433][model8_pretrain.py][INFO] Epoch:[0/2](343500/4588595) loss:2.604 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:55:32,433][model8_pretrain.py][INFO] Epoch:[0/2](343500/4588595) loss:2.893 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:55:32,433][model8_pretrain.py][INFO] Epoch:[0/2](343500/4588595) loss:3.043 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:55:32,433][model8_pretrain.py][INFO] Epoch:[0/2](343500/4588595) loss:2.991 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:55:32,433][model8_pretrain.py][INFO] Epoch:[0/2](343500/4588595) loss:3.036 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:55:32,433][model8_pretrain.py][INFO] Epoch:[0/2](343500/4588595) loss:2.977 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:55:32,433][model8_pretrain.py][INFO] Epoch:[0/2](343500/4588595) loss:3.310 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:55:32,433][model8_pretrain.py][INFO] Epoch:[0/2](343500/4588595) loss:2.770 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:56:09,374][model8_pretrain.py][INFO] Epoch:[0/2](343600/4588595) loss:2.860 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:56:09,374][model8_pretrain.py][INFO] Epoch:[0/2](343600/4588595) loss:2.654 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:56:09,374][model8_pretrain.py][INFO] Epoch:[0/2](343600/4588595) loss:2.968 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:56:09,374][model8_pretrain.py][INFO] Epoch:[0/2](343600/4588595) loss:2.340 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:56:09,374][model8_pretrain.py][INFO] Epoch:[0/2](343600/4588595) loss:2.706 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:56:09,374][model8_pretrain.py][INFO] Epoch:[0/2](343600/4588595) loss:3.012 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:56:09,374][model8_pretrain.py][INFO] Epoch:[0/2](343600/4588595) loss:2.857 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:56:09,375][model8_pretrain.py][INFO] Epoch:[0/2](343600/4588595) loss:2.671 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:56:56,653][model8_pretrain.py][INFO] Epoch:[0/2](343700/4588595) loss:2.812 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:56:56,653][model8_pretrain.py][INFO] Epoch:[0/2](343700/4588595) loss:2.783 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:56:56,653][model8_pretrain.py][INFO] Epoch:[0/2](343700/4588595) loss:3.055 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:56:56,653][model8_pretrain.py][INFO] Epoch:[0/2](343700/4588595) loss:3.153 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:56:56,654][model8_pretrain.py][INFO] Epoch:[0/2](343700/4588595) loss:3.012 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:56:56,654][model8_pretrain.py][INFO] Epoch:[0/2](343700/4588595) loss:2.482 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:56:56,654][model8_pretrain.py][INFO] Epoch:[0/2](343700/4588595) loss:2.961 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:56:56,654][model8_pretrain.py][INFO] Epoch:[0/2](343700/4588595) loss:2.701 lr:0.0000100 epoch_Time:26889.0min: [2024-01-04 05:57:33,580][model8_pretrain.py][INFO] Epoch:[0/2](343800/4588595) loss:2.204 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:57:33,580][model8_pretrain.py][INFO] Epoch:[0/2](343800/4588595) loss:2.959 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:57:33,580][model8_pretrain.py][INFO] Epoch:[0/2](343800/4588595) loss:2.561 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:57:33,580][model8_pretrain.py][INFO] Epoch:[0/2](343800/4588595) loss:3.035 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:57:33,580][model8_pretrain.py][INFO] Epoch:[0/2](343800/4588595) loss:2.639 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:57:33,580][model8_pretrain.py][INFO] Epoch:[0/2](343800/4588595) loss:3.519 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:57:33,580][model8_pretrain.py][INFO] Epoch:[0/2](343800/4588595) loss:2.599 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:57:33,580][model8_pretrain.py][INFO] Epoch:[0/2](343800/4588595) loss:2.806 lr:0.0000100 epoch_Time:26888.0min: [2024-01-04 05:58:10,518][model8_pretrain.py][INFO] Epoch:[0/2](343900/4588595) loss:3.003 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:10,518][model8_pretrain.py][INFO] Epoch:[0/2](343900/4588595) loss:3.186 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:10,518][model8_pretrain.py][INFO] Epoch:[0/2](343900/4588595) loss:3.320 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:10,518][model8_pretrain.py][INFO] Epoch:[0/2](343900/4588595) loss:3.261 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:10,518][model8_pretrain.py][INFO] Epoch:[0/2](343900/4588595) loss:3.050 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:10,518][model8_pretrain.py][INFO] Epoch:[0/2](343900/4588595) loss:2.581 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:10,518][model8_pretrain.py][INFO] Epoch:[0/2](343900/4588595) loss:2.159 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:10,518][model8_pretrain.py][INFO] Epoch:[0/2](343900/4588595) loss:3.035 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:47,453][model8_pretrain.py][INFO] Epoch:[0/2](344000/4588595) loss:2.488 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:47,453][model8_pretrain.py][INFO] Epoch:[0/2](344000/4588595) loss:2.982 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:47,453][model8_pretrain.py][INFO] Epoch:[0/2](344000/4588595) loss:2.876 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:47,453][model8_pretrain.py][INFO] Epoch:[0/2](344000/4588595) loss:1.659 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:47,453][model8_pretrain.py][INFO] Epoch:[0/2](344000/4588595) loss:3.035 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:47,453][model8_pretrain.py][INFO] Epoch:[0/2](344000/4588595) loss:2.828 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:47,453][model8_pretrain.py][INFO] Epoch:[0/2](344000/4588595) loss:3.246 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:58:47,453][model8_pretrain.py][INFO] Epoch:[0/2](344000/4588595) loss:3.108 lr:0.0000100 epoch_Time:26887.0min: [2024-01-04 05:59:24,404][model8_pretrain.py][INFO] Epoch:[0/2](344100/4588595) loss:2.578 lr:0.0000100 epoch_Time:26886.0min: [2024-01-04 05:59:24,404][model8_pretrain.py][INFO] Epoch:[0/2](344100/4588595) loss:3.112 lr:0.0000100 epoch_Time:26886.0min: [2024-01-04 05:59:24,404][model8_pretrain.py][INFO] Epoch:[0/2](344100/4588595) loss:2.884 lr:0.0000100 epoch_Time:26886.0min: [2024-01-04 05:59:24,404][model8_pretrain.py][INFO] Epoch:[0/2](344100/4588595) loss:3.390 lr:0.0000100 epoch_Time:26886.0min: [2024-01-04 05:59:24,405][model8_pretrain.py][INFO] Epoch:[0/2](344100/4588595) loss:2.466 lr:0.0000100 epoch_Time:26886.0min: [2024-01-04 05:59:24,405][model8_pretrain.py][INFO] Epoch:[0/2](344100/4588595) loss:3.021 lr:0.0000100 epoch_Time:26886.0min: [2024-01-04 05:59:24,405][model8_pretrain.py][INFO] Epoch:[0/2](344100/4588595) loss:2.162 lr:0.0000100 epoch_Time:26886.0min: [2024-01-04 05:59:24,405][model8_pretrain.py][INFO] Epoch:[0/2](344100/4588595) loss:2.620 lr:0.0000100 epoch_Time:26886.0min: [2024-01-04 06:00:01,355][model8_pretrain.py][INFO] Epoch:[0/2](344200/4588595) loss:2.956 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:01,355][model8_pretrain.py][INFO] Epoch:[0/2](344200/4588595) loss:3.175 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:01,355][model8_pretrain.py][INFO] Epoch:[0/2](344200/4588595) loss:2.630 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:01,355][model8_pretrain.py][INFO] Epoch:[0/2](344200/4588595) loss:2.780 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:01,355][model8_pretrain.py][INFO] Epoch:[0/2](344200/4588595) loss:2.766 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:01,355][model8_pretrain.py][INFO] Epoch:[0/2](344200/4588595) loss:3.065 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:01,355][model8_pretrain.py][INFO] Epoch:[0/2](344200/4588595) loss:2.887 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:01,355][model8_pretrain.py][INFO] Epoch:[0/2](344200/4588595) loss:3.478 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:38,291][model8_pretrain.py][INFO] Epoch:[0/2](344300/4588595) loss:2.924 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:38,291][model8_pretrain.py][INFO] Epoch:[0/2](344300/4588595) loss:3.044 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:38,292][model8_pretrain.py][INFO] Epoch:[0/2](344300/4588595) loss:2.416 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:38,292][model8_pretrain.py][INFO] Epoch:[0/2](344300/4588595) loss:3.111 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:38,292][model8_pretrain.py][INFO] Epoch:[0/2](344300/4588595) loss:3.012 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:38,292][model8_pretrain.py][INFO] Epoch:[0/2](344300/4588595) loss:2.966 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:38,292][model8_pretrain.py][INFO] Epoch:[0/2](344300/4588595) loss:3.102 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:00:38,292][model8_pretrain.py][INFO] Epoch:[0/2](344300/4588595) loss:2.827 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:01:15,222][model8_pretrain.py][INFO] Epoch:[0/2](344400/4588595) loss:2.415 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:01:15,222][model8_pretrain.py][INFO] Epoch:[0/2](344400/4588595) loss:2.110 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:01:15,222][model8_pretrain.py][INFO] Epoch:[0/2](344400/4588595) loss:2.993 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:01:15,222][model8_pretrain.py][INFO] Epoch:[0/2](344400/4588595) loss:2.905 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:01:15,222][model8_pretrain.py][INFO] Epoch:[0/2](344400/4588595) loss:2.676 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:01:15,222][model8_pretrain.py][INFO] Epoch:[0/2](344400/4588595) loss:2.386 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:01:15,222][model8_pretrain.py][INFO] Epoch:[0/2](344400/4588595) loss:3.133 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:01:15,223][model8_pretrain.py][INFO] Epoch:[0/2](344400/4588595) loss:2.433 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:02:02,806][model8_pretrain.py][INFO] Epoch:[0/2](344500/4588595) loss:2.887 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:02,806][model8_pretrain.py][INFO] Epoch:[0/2](344500/4588595) loss:3.066 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:02,806][model8_pretrain.py][INFO] Epoch:[0/2](344500/4588595) loss:3.252 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:02,806][model8_pretrain.py][INFO] Epoch:[0/2](344500/4588595) loss:3.053 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:02,806][model8_pretrain.py][INFO] Epoch:[0/2](344500/4588595) loss:2.838 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:02,806][model8_pretrain.py][INFO] Epoch:[0/2](344500/4588595) loss:2.690 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:02,806][model8_pretrain.py][INFO] Epoch:[0/2](344500/4588595) loss:3.022 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:02,806][model8_pretrain.py][INFO] Epoch:[0/2](344500/4588595) loss:3.292 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:39,725][model8_pretrain.py][INFO] Epoch:[0/2](344600/4588595) loss:2.997 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:39,725][model8_pretrain.py][INFO] Epoch:[0/2](344600/4588595) loss:2.869 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:39,725][model8_pretrain.py][INFO] Epoch:[0/2](344600/4588595) loss:3.000 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:39,725][model8_pretrain.py][INFO] Epoch:[0/2](344600/4588595) loss:2.602 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:39,725][model8_pretrain.py][INFO] Epoch:[0/2](344600/4588595) loss:2.596 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:39,725][model8_pretrain.py][INFO] Epoch:[0/2](344600/4588595) loss:2.810 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:39,725][model8_pretrain.py][INFO] Epoch:[0/2](344600/4588595) loss:3.118 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:02:39,725][model8_pretrain.py][INFO] Epoch:[0/2](344600/4588595) loss:3.107 lr:0.0000100 epoch_Time:26884.0min: [2024-01-04 06:03:16,651][model8_pretrain.py][INFO] Epoch:[0/2](344700/4588595) loss:2.974 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:03:16,651][model8_pretrain.py][INFO] Epoch:[0/2](344700/4588595) loss:3.212 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:03:16,651][model8_pretrain.py][INFO] Epoch:[0/2](344700/4588595) loss:3.419 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:03:16,651][model8_pretrain.py][INFO] Epoch:[0/2](344700/4588595) loss:3.079 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:03:16,651][model8_pretrain.py][INFO] Epoch:[0/2](344700/4588595) loss:2.659 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:03:16,651][model8_pretrain.py][INFO] Epoch:[0/2](344700/4588595) loss:2.960 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:03:16,652][model8_pretrain.py][INFO] Epoch:[0/2](344700/4588595) loss:2.829 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:03:16,652][model8_pretrain.py][INFO] Epoch:[0/2](344700/4588595) loss:2.863 lr:0.0000100 epoch_Time:26883.0min: [2024-01-04 06:03:53,588][model8_pretrain.py][INFO] Epoch:[0/2](344800/4588595) loss:3.309 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:03:53,588][model8_pretrain.py][INFO] Epoch:[0/2](344800/4588595) loss:3.086 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:03:53,588][model8_pretrain.py][INFO] Epoch:[0/2](344800/4588595) loss:2.809 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:03:53,588][model8_pretrain.py][INFO] Epoch:[0/2](344800/4588595) loss:2.891 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:03:53,588][model8_pretrain.py][INFO] Epoch:[0/2](344800/4588595) loss:3.560 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:03:53,588][model8_pretrain.py][INFO] Epoch:[0/2](344800/4588595) loss:2.964 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:03:53,588][model8_pretrain.py][INFO] Epoch:[0/2](344800/4588595) loss:2.723 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:03:53,588][model8_pretrain.py][INFO] Epoch:[0/2](344800/4588595) loss:2.843 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:04:30,522][model8_pretrain.py][INFO] Epoch:[0/2](344900/4588595) loss:3.181 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:04:30,522][model8_pretrain.py][INFO] Epoch:[0/2](344900/4588595) loss:3.259 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:04:30,522][model8_pretrain.py][INFO] Epoch:[0/2](344900/4588595) loss:2.751 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:04:30,522][model8_pretrain.py][INFO] Epoch:[0/2](344900/4588595) loss:2.951 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:04:30,522][model8_pretrain.py][INFO] Epoch:[0/2](344900/4588595) loss:3.029 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:04:30,522][model8_pretrain.py][INFO] Epoch:[0/2](344900/4588595) loss:3.198 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:04:30,522][model8_pretrain.py][INFO] Epoch:[0/2](344900/4588595) loss:2.775 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:04:30,522][model8_pretrain.py][INFO] Epoch:[0/2](344900/4588595) loss:3.486 lr:0.0000100 epoch_Time:26881.0min: [2024-01-04 06:05:07,474][model8_pretrain.py][INFO] Epoch:[0/2](345000/4588595) loss:2.661 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:07,474][model8_pretrain.py][INFO] Epoch:[0/2](345000/4588595) loss:3.132 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:07,474][model8_pretrain.py][INFO] Epoch:[0/2](345000/4588595) loss:3.257 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:07,474][model8_pretrain.py][INFO] Epoch:[0/2](345000/4588595) loss:3.018 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:07,474][model8_pretrain.py][INFO] Epoch:[0/2](345000/4588595) loss:2.845 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:07,474][model8_pretrain.py][INFO] Epoch:[0/2](345000/4588595) loss:2.639 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:07,474][model8_pretrain.py][INFO] Epoch:[0/2](345000/4588595) loss:2.749 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:07,474][model8_pretrain.py][INFO] Epoch:[0/2](345000/4588595) loss:3.141 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:44,399][model8_pretrain.py][INFO] Epoch:[0/2](345100/4588595) loss:3.688 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:44,399][model8_pretrain.py][INFO] Epoch:[0/2](345100/4588595) loss:2.904 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:44,399][model8_pretrain.py][INFO] Epoch:[0/2](345100/4588595) loss:2.763 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:44,399][model8_pretrain.py][INFO] Epoch:[0/2](345100/4588595) loss:3.076 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:44,400][model8_pretrain.py][INFO] Epoch:[0/2](345100/4588595) loss:3.328 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:44,400][model8_pretrain.py][INFO] Epoch:[0/2](345100/4588595) loss:2.918 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:44,400][model8_pretrain.py][INFO] Epoch:[0/2](345100/4588595) loss:2.775 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:05:44,400][model8_pretrain.py][INFO] Epoch:[0/2](345100/4588595) loss:3.163 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:06:21,351][model8_pretrain.py][INFO] Epoch:[0/2](345200/4588595) loss:2.309 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:06:21,351][model8_pretrain.py][INFO] Epoch:[0/2](345200/4588595) loss:2.538 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:06:21,351][model8_pretrain.py][INFO] Epoch:[0/2](345200/4588595) loss:2.675 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:06:21,351][model8_pretrain.py][INFO] Epoch:[0/2](345200/4588595) loss:2.968 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:06:21,351][model8_pretrain.py][INFO] Epoch:[0/2](345200/4588595) loss:2.555 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:06:21,351][model8_pretrain.py][INFO] Epoch:[0/2](345200/4588595) loss:3.050 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:06:21,351][model8_pretrain.py][INFO] Epoch:[0/2](345200/4588595) loss:2.985 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:06:21,351][model8_pretrain.py][INFO] Epoch:[0/2](345200/4588595) loss:2.817 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:07:08,648][model8_pretrain.py][INFO] Epoch:[0/2](345300/4588595) loss:3.240 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:07:08,649][model8_pretrain.py][INFO] Epoch:[0/2](345300/4588595) loss:3.044 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:07:08,649][model8_pretrain.py][INFO] Epoch:[0/2](345300/4588595) loss:3.110 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:07:08,649][model8_pretrain.py][INFO] Epoch:[0/2](345300/4588595) loss:2.640 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:07:08,649][model8_pretrain.py][INFO] Epoch:[0/2](345300/4588595) loss:2.205 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:07:08,649][model8_pretrain.py][INFO] Epoch:[0/2](345300/4588595) loss:3.198 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:07:08,649][model8_pretrain.py][INFO] Epoch:[0/2](345300/4588595) loss:3.351 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:07:08,649][model8_pretrain.py][INFO] Epoch:[0/2](345300/4588595) loss:3.152 lr:0.0000100 epoch_Time:26880.0min: [2024-01-04 06:07:45,571][model8_pretrain.py][INFO] Epoch:[0/2](345400/4588595) loss:2.852 lr:0.0000100 epoch_Time:26879.0min: [2024-01-04 06:07:45,571][model8_pretrain.py][INFO] Epoch:[0/2](345400/4588595) loss:2.795 lr:0.0000100 epoch_Time:26879.0min: [2024-01-04 06:07:45,571][model8_pretrain.py][INFO] Epoch:[0/2](345400/4588595) loss:2.628 lr:0.0000100 epoch_Time:26879.0min: [2024-01-04 06:07:45,571][model8_pretrain.py][INFO] Epoch:[0/2](345400/4588595) loss:2.843 lr:0.0000100 epoch_Time:26879.0min: [2024-01-04 06:07:45,571][model8_pretrain.py][INFO] Epoch:[0/2](345400/4588595) loss:3.475 lr:0.0000100 epoch_Time:26879.0min: [2024-01-04 06:07:45,571][model8_pretrain.py][INFO] Epoch:[0/2](345400/4588595) loss:3.047 lr:0.0000100 epoch_Time:26879.0min: [2024-01-04 06:07:45,571][model8_pretrain.py][INFO] Epoch:[0/2](345400/4588595) loss:2.727 lr:0.0000100 epoch_Time:26879.0min: [2024-01-04 06:07:45,571][model8_pretrain.py][INFO] Epoch:[0/2](345400/4588595) loss:2.872 lr:0.0000100 epoch_Time:26879.0min: [2024-01-04 06:08:22,510][model8_pretrain.py][INFO] Epoch:[0/2](345500/4588595) loss:2.699 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:08:22,510][model8_pretrain.py][INFO] Epoch:[0/2](345500/4588595) loss:2.679 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:08:22,510][model8_pretrain.py][INFO] Epoch:[0/2](345500/4588595) loss:2.411 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:08:22,510][model8_pretrain.py][INFO] Epoch:[0/2](345500/4588595) loss:3.028 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:08:22,510][model8_pretrain.py][INFO] Epoch:[0/2](345500/4588595) loss:2.964 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:08:22,511][model8_pretrain.py][INFO] Epoch:[0/2](345500/4588595) loss:2.231 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:08:22,511][model8_pretrain.py][INFO] Epoch:[0/2](345500/4588595) loss:2.882 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:08:22,511][model8_pretrain.py][INFO] Epoch:[0/2](345500/4588595) loss:2.828 lr:0.0000100 epoch_Time:26878.0min: [2024-01-04 06:08:59,450][model8_pretrain.py][INFO] Epoch:[0/2](345600/4588595) loss:2.776 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:08:59,450][model8_pretrain.py][INFO] Epoch:[0/2](345600/4588595) loss:3.089 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:08:59,450][model8_pretrain.py][INFO] Epoch:[0/2](345600/4588595) loss:2.774 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:08:59,450][model8_pretrain.py][INFO] Epoch:[0/2](345600/4588595) loss:3.147 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:08:59,450][model8_pretrain.py][INFO] Epoch:[0/2](345600/4588595) loss:3.016 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:08:59,450][model8_pretrain.py][INFO] Epoch:[0/2](345600/4588595) loss:2.795 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:08:59,450][model8_pretrain.py][INFO] Epoch:[0/2](345600/4588595) loss:3.111 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:08:59,449][model8_pretrain.py][INFO] Epoch:[0/2](345600/4588595) loss:3.104 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:09:36,388][model8_pretrain.py][INFO] Epoch:[0/2](345700/4588595) loss:2.844 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:09:36,388][model8_pretrain.py][INFO] Epoch:[0/2](345700/4588595) loss:3.129 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:09:36,388][model8_pretrain.py][INFO] Epoch:[0/2](345700/4588595) loss:2.458 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:09:36,388][model8_pretrain.py][INFO] Epoch:[0/2](345700/4588595) loss:2.850 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:09:36,388][model8_pretrain.py][INFO] Epoch:[0/2](345700/4588595) loss:2.781 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:09:36,388][model8_pretrain.py][INFO] Epoch:[0/2](345700/4588595) loss:2.860 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:09:36,388][model8_pretrain.py][INFO] Epoch:[0/2](345700/4588595) loss:3.023 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:09:36,388][model8_pretrain.py][INFO] Epoch:[0/2](345700/4588595) loss:3.004 lr:0.0000100 epoch_Time:26877.0min: [2024-01-04 06:10:13,335][model8_pretrain.py][INFO] Epoch:[0/2](345800/4588595) loss:2.681 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:10:13,335][model8_pretrain.py][INFO] Epoch:[0/2](345800/4588595) loss:3.168 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:10:13,335][model8_pretrain.py][INFO] Epoch:[0/2](345800/4588595) loss:3.123 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:10:13,335][model8_pretrain.py][INFO] Epoch:[0/2](345800/4588595) loss:2.721 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:10:13,335][model8_pretrain.py][INFO] Epoch:[0/2](345800/4588595) loss:3.229 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:10:13,335][model8_pretrain.py][INFO] Epoch:[0/2](345800/4588595) loss:2.782 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:10:13,335][model8_pretrain.py][INFO] Epoch:[0/2](345800/4588595) loss:2.804 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:10:13,335][model8_pretrain.py][INFO] Epoch:[0/2](345800/4588595) loss:2.179 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:10:50,277][model8_pretrain.py][INFO] Epoch:[0/2](345900/4588595) loss:2.769 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:10:50,277][model8_pretrain.py][INFO] Epoch:[0/2](345900/4588595) loss:2.967 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:10:50,277][model8_pretrain.py][INFO] Epoch:[0/2](345900/4588595) loss:2.713 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:10:50,277][model8_pretrain.py][INFO] Epoch:[0/2](345900/4588595) loss:2.865 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:10:50,277][model8_pretrain.py][INFO] Epoch:[0/2](345900/4588595) loss:2.448 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:10:50,277][model8_pretrain.py][INFO] Epoch:[0/2](345900/4588595) loss:2.917 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:10:50,277][model8_pretrain.py][INFO] Epoch:[0/2](345900/4588595) loss:3.423 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:10:50,277][model8_pretrain.py][INFO] Epoch:[0/2](345900/4588595) loss:3.033 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:11:27,219][model8_pretrain.py][INFO] Epoch:[0/2](346000/4588595) loss:3.194 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:11:27,219][model8_pretrain.py][INFO] Epoch:[0/2](346000/4588595) loss:2.166 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:11:27,219][model8_pretrain.py][INFO] Epoch:[0/2](346000/4588595) loss:2.868 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:11:27,219][model8_pretrain.py][INFO] Epoch:[0/2](346000/4588595) loss:3.155 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:11:27,219][model8_pretrain.py][INFO] Epoch:[0/2](346000/4588595) loss:3.337 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:11:27,219][model8_pretrain.py][INFO] Epoch:[0/2](346000/4588595) loss:3.448 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:11:27,219][model8_pretrain.py][INFO] Epoch:[0/2](346000/4588595) loss:2.891 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:11:27,219][model8_pretrain.py][INFO] Epoch:[0/2](346000/4588595) loss:2.707 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:12:14,556][model8_pretrain.py][INFO] Epoch:[0/2](346100/4588595) loss:2.765 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:12:14,556][model8_pretrain.py][INFO] Epoch:[0/2](346100/4588595) loss:2.968 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:12:14,556][model8_pretrain.py][INFO] Epoch:[0/2](346100/4588595) loss:3.198 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:12:14,556][model8_pretrain.py][INFO] Epoch:[0/2](346100/4588595) loss:2.579 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:12:14,556][model8_pretrain.py][INFO] Epoch:[0/2](346100/4588595) loss:3.352 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:12:14,556][model8_pretrain.py][INFO] Epoch:[0/2](346100/4588595) loss:3.606 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:12:14,556][model8_pretrain.py][INFO] Epoch:[0/2](346100/4588595) loss:2.958 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:12:14,556][model8_pretrain.py][INFO] Epoch:[0/2](346100/4588595) loss:2.704 lr:0.0000100 epoch_Time:26875.0min: [2024-01-04 06:12:51,483][model8_pretrain.py][INFO] Epoch:[0/2](346200/4588595) loss:3.149 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:12:51,483][model8_pretrain.py][INFO] Epoch:[0/2](346200/4588595) loss:2.560 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:12:51,483][model8_pretrain.py][INFO] Epoch:[0/2](346200/4588595) loss:2.858 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:12:51,483][model8_pretrain.py][INFO] Epoch:[0/2](346200/4588595) loss:2.485 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:12:51,483][model8_pretrain.py][INFO] Epoch:[0/2](346200/4588595) loss:3.156 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:12:51,483][model8_pretrain.py][INFO] Epoch:[0/2](346200/4588595) loss:2.883 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:12:51,483][model8_pretrain.py][INFO] Epoch:[0/2](346200/4588595) loss:2.791 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:12:51,484][model8_pretrain.py][INFO] Epoch:[0/2](346200/4588595) loss:2.890 lr:0.0000100 epoch_Time:26874.0min: [2024-01-04 06:13:28,424][model8_pretrain.py][INFO] Epoch:[0/2](346300/4588595) loss:3.230 lr:0.0000100 epoch_Time:26873.0min: [2024-01-04 06:13:28,424][model8_pretrain.py][INFO] Epoch:[0/2](346300/4588595) loss:3.085 lr:0.0000100 epoch_Time:26873.0min: [2024-01-04 06:13:28,424][model8_pretrain.py][INFO] Epoch:[0/2](346300/4588595) loss:2.678 lr:0.0000100 epoch_Time:26873.0min: [2024-01-04 06:13:28,424][model8_pretrain.py][INFO] Epoch:[0/2](346300/4588595) loss:3.190 lr:0.0000100 epoch_Time:26873.0min: [2024-01-04 06:13:28,424][model8_pretrain.py][INFO] Epoch:[0/2](346300/4588595) loss:3.106 lr:0.0000100 epoch_Time:26873.0min: [2024-01-04 06:13:28,424][model8_pretrain.py][INFO] Epoch:[0/2](346300/4588595) loss:2.614 lr:0.0000100 epoch_Time:26873.0min: [2024-01-04 06:13:28,424][model8_pretrain.py][INFO] Epoch:[0/2](346300/4588595) loss:2.905 lr:0.0000100 epoch_Time:26873.0min: [2024-01-04 06:13:28,424][model8_pretrain.py][INFO] Epoch:[0/2](346300/4588595) loss:3.058 lr:0.0000100 epoch_Time:26873.0min: [2024-01-04 06:14:05,364][model8_pretrain.py][INFO] Epoch:[0/2](346400/4588595) loss:2.665 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:05,364][model8_pretrain.py][INFO] Epoch:[0/2](346400/4588595) loss:2.159 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:05,364][model8_pretrain.py][INFO] Epoch:[0/2](346400/4588595) loss:2.502 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:05,364][model8_pretrain.py][INFO] Epoch:[0/2](346400/4588595) loss:2.508 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:05,364][model8_pretrain.py][INFO] Epoch:[0/2](346400/4588595) loss:2.867 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:05,364][model8_pretrain.py][INFO] Epoch:[0/2](346400/4588595) loss:2.976 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:05,364][model8_pretrain.py][INFO] Epoch:[0/2](346400/4588595) loss:2.924 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:05,365][model8_pretrain.py][INFO] Epoch:[0/2](346400/4588595) loss:3.154 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:42,304][model8_pretrain.py][INFO] Epoch:[0/2](346500/4588595) loss:2.754 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:42,305][model8_pretrain.py][INFO] Epoch:[0/2](346500/4588595) loss:2.465 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:42,305][model8_pretrain.py][INFO] Epoch:[0/2](346500/4588595) loss:2.910 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:42,305][model8_pretrain.py][INFO] Epoch:[0/2](346500/4588595) loss:3.559 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:42,305][model8_pretrain.py][INFO] Epoch:[0/2](346500/4588595) loss:2.613 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:42,305][model8_pretrain.py][INFO] Epoch:[0/2](346500/4588595) loss:3.094 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:42,305][model8_pretrain.py][INFO] Epoch:[0/2](346500/4588595) loss:2.611 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:14:42,305][model8_pretrain.py][INFO] Epoch:[0/2](346500/4588595) loss:3.040 lr:0.0000100 epoch_Time:26872.0min: [2024-01-04 06:15:19,243][model8_pretrain.py][INFO] Epoch:[0/2](346600/4588595) loss:2.690 lr:0.0000100 epoch_Time:26871.0min: [2024-01-04 06:15:19,243][model8_pretrain.py][INFO] Epoch:[0/2](346600/4588595) loss:2.653 lr:0.0000100 epoch_Time:26871.0min: [2024-01-04 06:15:19,243][model8_pretrain.py][INFO] Epoch:[0/2](346600/4588595) loss:3.169 lr:0.0000100 epoch_Time:26871.0min: [2024-01-04 06:15:19,243][model8_pretrain.py][INFO] Epoch:[0/2](346600/4588595) loss:2.528 lr:0.0000100 epoch_Time:26871.0min: [2024-01-04 06:15:19,243][model8_pretrain.py][INFO] Epoch:[0/2](346600/4588595) loss:2.852 lr:0.0000100 epoch_Time:26871.0min: [2024-01-04 06:15:19,243][model8_pretrain.py][INFO] Epoch:[0/2](346600/4588595) loss:3.203 lr:0.0000100 epoch_Time:26871.0min: [2024-01-04 06:15:19,243][model8_pretrain.py][INFO] Epoch:[0/2](346600/4588595) loss:2.641 lr:0.0000100 epoch_Time:26871.0min: [2024-01-04 06:15:19,243][model8_pretrain.py][INFO] Epoch:[0/2](346600/4588595) loss:2.599 lr:0.0000100 epoch_Time:26871.0min: [2024-01-04 06:15:56,180][model8_pretrain.py][INFO] Epoch:[0/2](346700/4588595) loss:2.613 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:15:56,180][model8_pretrain.py][INFO] Epoch:[0/2](346700/4588595) loss:2.312 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:15:56,180][model8_pretrain.py][INFO] Epoch:[0/2](346700/4588595) loss:3.009 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:15:56,180][model8_pretrain.py][INFO] Epoch:[0/2](346700/4588595) loss:3.067 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:15:56,180][model8_pretrain.py][INFO] Epoch:[0/2](346700/4588595) loss:2.100 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:15:56,180][model8_pretrain.py][INFO] Epoch:[0/2](346700/4588595) loss:3.287 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:15:56,180][model8_pretrain.py][INFO] Epoch:[0/2](346700/4588595) loss:3.186 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:15:56,180][model8_pretrain.py][INFO] Epoch:[0/2](346700/4588595) loss:2.626 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:16:33,125][model8_pretrain.py][INFO] Epoch:[0/2](346800/4588595) loss:3.217 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:16:33,125][model8_pretrain.py][INFO] Epoch:[0/2](346800/4588595) loss:3.265 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:16:33,125][model8_pretrain.py][INFO] Epoch:[0/2](346800/4588595) loss:2.138 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:16:33,125][model8_pretrain.py][INFO] Epoch:[0/2](346800/4588595) loss:3.149 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:16:33,125][model8_pretrain.py][INFO] Epoch:[0/2](346800/4588595) loss:2.640 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:16:33,125][model8_pretrain.py][INFO] Epoch:[0/2](346800/4588595) loss:2.888 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:16:33,125][model8_pretrain.py][INFO] Epoch:[0/2](346800/4588595) loss:2.847 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:16:33,125][model8_pretrain.py][INFO] Epoch:[0/2](346800/4588595) loss:2.562 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:17:20,393][model8_pretrain.py][INFO] Epoch:[0/2](346900/4588595) loss:2.872 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:17:20,393][model8_pretrain.py][INFO] Epoch:[0/2](346900/4588595) loss:2.852 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:17:20,393][model8_pretrain.py][INFO] Epoch:[0/2](346900/4588595) loss:2.809 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:17:20,393][model8_pretrain.py][INFO] Epoch:[0/2](346900/4588595) loss:3.156 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:17:20,393][model8_pretrain.py][INFO] Epoch:[0/2](346900/4588595) loss:2.554 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:17:20,393][model8_pretrain.py][INFO] Epoch:[0/2](346900/4588595) loss:3.436 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:17:20,393][model8_pretrain.py][INFO] Epoch:[0/2](346900/4588595) loss:2.714 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:17:20,393][model8_pretrain.py][INFO] Epoch:[0/2](346900/4588595) loss:2.868 lr:0.0000100 epoch_Time:26870.0min: [2024-01-04 06:17:57,318][model8_pretrain.py][INFO] Epoch:[0/2](347000/4588595) loss:3.088 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:17:57,318][model8_pretrain.py][INFO] Epoch:[0/2](347000/4588595) loss:2.759 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:17:57,318][model8_pretrain.py][INFO] Epoch:[0/2](347000/4588595) loss:3.415 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:17:57,318][model8_pretrain.py][INFO] Epoch:[0/2](347000/4588595) loss:2.441 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:17:57,318][model8_pretrain.py][INFO] Epoch:[0/2](347000/4588595) loss:3.177 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:17:57,318][model8_pretrain.py][INFO] Epoch:[0/2](347000/4588595) loss:2.946 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:17:57,319][model8_pretrain.py][INFO] Epoch:[0/2](347000/4588595) loss:3.035 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:17:57,319][model8_pretrain.py][INFO] Epoch:[0/2](347000/4588595) loss:2.849 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:18:34,260][model8_pretrain.py][INFO] Epoch:[0/2](347100/4588595) loss:2.373 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:18:34,260][model8_pretrain.py][INFO] Epoch:[0/2](347100/4588595) loss:2.562 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:18:34,260][model8_pretrain.py][INFO] Epoch:[0/2](347100/4588595) loss:2.825 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:18:34,260][model8_pretrain.py][INFO] Epoch:[0/2](347100/4588595) loss:2.765 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:18:34,260][model8_pretrain.py][INFO] Epoch:[0/2](347100/4588595) loss:2.522 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:18:34,260][model8_pretrain.py][INFO] Epoch:[0/2](347100/4588595) loss:2.957 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:18:34,260][model8_pretrain.py][INFO] Epoch:[0/2](347100/4588595) loss:3.097 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:18:34,260][model8_pretrain.py][INFO] Epoch:[0/2](347100/4588595) loss:3.113 lr:0.0000100 epoch_Time:26869.0min: [2024-01-04 06:19:11,197][model8_pretrain.py][INFO] Epoch:[0/2](347200/4588595) loss:3.199 lr:0.0000100 epoch_Time:26868.0min: [2024-01-04 06:19:11,197][model8_pretrain.py][INFO] Epoch:[0/2](347200/4588595) loss:2.674 lr:0.0000100 epoch_Time:26868.0min: [2024-01-04 06:19:11,197][model8_pretrain.py][INFO] Epoch:[0/2](347200/4588595) loss:3.037 lr:0.0000100 epoch_Time:26868.0min: [2024-01-04 06:19:11,197][model8_pretrain.py][INFO] Epoch:[0/2](347200/4588595) loss:2.668 lr:0.0000100 epoch_Time:26868.0min: [2024-01-04 06:19:11,197][model8_pretrain.py][INFO] Epoch:[0/2](347200/4588595) loss:3.247 lr:0.0000100 epoch_Time:26868.0min: [2024-01-04 06:19:11,197][model8_pretrain.py][INFO] Epoch:[0/2](347200/4588595) loss:2.734 lr:0.0000100 epoch_Time:26868.0min: [2024-01-04 06:19:11,197][model8_pretrain.py][INFO] Epoch:[0/2](347200/4588595) loss:3.533 lr:0.0000100 epoch_Time:26868.0min: [2024-01-04 06:19:11,197][model8_pretrain.py][INFO] Epoch:[0/2](347200/4588595) loss:2.666 lr:0.0000100 epoch_Time:26868.0min: [2024-01-04 06:19:48,139][model8_pretrain.py][INFO] Epoch:[0/2](347300/4588595) loss:3.172 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:19:48,139][model8_pretrain.py][INFO] Epoch:[0/2](347300/4588595) loss:2.993 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:19:48,139][model8_pretrain.py][INFO] Epoch:[0/2](347300/4588595) loss:2.129 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:19:48,139][model8_pretrain.py][INFO] Epoch:[0/2](347300/4588595) loss:2.609 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:19:48,139][model8_pretrain.py][INFO] Epoch:[0/2](347300/4588595) loss:2.700 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:19:48,139][model8_pretrain.py][INFO] Epoch:[0/2](347300/4588595) loss:3.088 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:19:48,139][model8_pretrain.py][INFO] Epoch:[0/2](347300/4588595) loss:3.288 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:19:48,140][model8_pretrain.py][INFO] Epoch:[0/2](347300/4588595) loss:2.579 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:20:25,078][model8_pretrain.py][INFO] Epoch:[0/2](347400/4588595) loss:3.474 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:20:25,078][model8_pretrain.py][INFO] Epoch:[0/2](347400/4588595) loss:2.737 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:20:25,078][model8_pretrain.py][INFO] Epoch:[0/2](347400/4588595) loss:2.929 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:20:25,078][model8_pretrain.py][INFO] Epoch:[0/2](347400/4588595) loss:3.223 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:20:25,079][model8_pretrain.py][INFO] Epoch:[0/2](347400/4588595) loss:2.991 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:20:25,079][model8_pretrain.py][INFO] Epoch:[0/2](347400/4588595) loss:3.295 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:20:25,079][model8_pretrain.py][INFO] Epoch:[0/2](347400/4588595) loss:3.000 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:20:25,079][model8_pretrain.py][INFO] Epoch:[0/2](347400/4588595) loss:3.110 lr:0.0000100 epoch_Time:26866.0min: [2024-01-04 06:21:02,022][model8_pretrain.py][INFO] Epoch:[0/2](347500/4588595) loss:2.461 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:02,022][model8_pretrain.py][INFO] Epoch:[0/2](347500/4588595) loss:3.169 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:02,022][model8_pretrain.py][INFO] Epoch:[0/2](347500/4588595) loss:2.760 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:02,022][model8_pretrain.py][INFO] Epoch:[0/2](347500/4588595) loss:2.771 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:02,022][model8_pretrain.py][INFO] Epoch:[0/2](347500/4588595) loss:3.060 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:02,022][model8_pretrain.py][INFO] Epoch:[0/2](347500/4588595) loss:2.509 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:02,022][model8_pretrain.py][INFO] Epoch:[0/2](347500/4588595) loss:3.259 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:02,022][model8_pretrain.py][INFO] Epoch:[0/2](347500/4588595) loss:2.591 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:38,961][model8_pretrain.py][INFO] Epoch:[0/2](347600/4588595) loss:2.540 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:38,961][model8_pretrain.py][INFO] Epoch:[0/2](347600/4588595) loss:2.968 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:38,961][model8_pretrain.py][INFO] Epoch:[0/2](347600/4588595) loss:3.303 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:38,961][model8_pretrain.py][INFO] Epoch:[0/2](347600/4588595) loss:2.910 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:38,961][model8_pretrain.py][INFO] Epoch:[0/2](347600/4588595) loss:3.017 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:38,961][model8_pretrain.py][INFO] Epoch:[0/2](347600/4588595) loss:2.442 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:38,961][model8_pretrain.py][INFO] Epoch:[0/2](347600/4588595) loss:2.902 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:21:38,961][model8_pretrain.py][INFO] Epoch:[0/2](347600/4588595) loss:2.958 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:22:24,320][model8_pretrain.py][INFO] Epoch:[0/2](347700/4588595) loss:3.319 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:22:24,320][model8_pretrain.py][INFO] Epoch:[0/2](347700/4588595) loss:2.717 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:22:24,320][model8_pretrain.py][INFO] Epoch:[0/2](347700/4588595) loss:2.631 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:22:24,320][model8_pretrain.py][INFO] Epoch:[0/2](347700/4588595) loss:2.781 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:22:24,320][model8_pretrain.py][INFO] Epoch:[0/2](347700/4588595) loss:2.662 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:22:24,320][model8_pretrain.py][INFO] Epoch:[0/2](347700/4588595) loss:3.363 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:22:24,320][model8_pretrain.py][INFO] Epoch:[0/2](347700/4588595) loss:2.337 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:22:24,320][model8_pretrain.py][INFO] Epoch:[0/2](347700/4588595) loss:2.414 lr:0.0000100 epoch_Time:26865.0min: [2024-01-04 06:23:02,990][model8_pretrain.py][INFO] Epoch:[0/2](347800/4588595) loss:2.504 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:02,990][model8_pretrain.py][INFO] Epoch:[0/2](347800/4588595) loss:3.158 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:02,990][model8_pretrain.py][INFO] Epoch:[0/2](347800/4588595) loss:2.729 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:02,990][model8_pretrain.py][INFO] Epoch:[0/2](347800/4588595) loss:3.051 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:02,990][model8_pretrain.py][INFO] Epoch:[0/2](347800/4588595) loss:2.747 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:02,990][model8_pretrain.py][INFO] Epoch:[0/2](347800/4588595) loss:2.613 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:02,990][model8_pretrain.py][INFO] Epoch:[0/2](347800/4588595) loss:3.100 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:02,990][model8_pretrain.py][INFO] Epoch:[0/2](347800/4588595) loss:2.706 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:39,944][model8_pretrain.py][INFO] Epoch:[0/2](347900/4588595) loss:2.924 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:39,944][model8_pretrain.py][INFO] Epoch:[0/2](347900/4588595) loss:2.497 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:39,944][model8_pretrain.py][INFO] Epoch:[0/2](347900/4588595) loss:2.588 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:39,944][model8_pretrain.py][INFO] Epoch:[0/2](347900/4588595) loss:2.720 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:39,944][model8_pretrain.py][INFO] Epoch:[0/2](347900/4588595) loss:3.126 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:39,944][model8_pretrain.py][INFO] Epoch:[0/2](347900/4588595) loss:2.872 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:39,944][model8_pretrain.py][INFO] Epoch:[0/2](347900/4588595) loss:2.868 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:23:39,944][model8_pretrain.py][INFO] Epoch:[0/2](347900/4588595) loss:2.958 lr:0.0000100 epoch_Time:26864.0min: [2024-01-04 06:24:16,898][model8_pretrain.py][INFO] Epoch:[0/2](348000/4588595) loss:2.927 lr:0.0000100 epoch_Time:26863.0min: [2024-01-04 06:24:16,898][model8_pretrain.py][INFO] Epoch:[0/2](348000/4588595) loss:3.417 lr:0.0000100 epoch_Time:26863.0min: [2024-01-04 06:24:16,898][model8_pretrain.py][INFO] Epoch:[0/2](348000/4588595) loss:2.611 lr:0.0000100 epoch_Time:26863.0min: [2024-01-04 06:24:16,898][model8_pretrain.py][INFO] Epoch:[0/2](348000/4588595) loss:2.800 lr:0.0000100 epoch_Time:26863.0min: [2024-01-04 06:24:16,898][model8_pretrain.py][INFO] Epoch:[0/2](348000/4588595) loss:2.268 lr:0.0000100 epoch_Time:26863.0min: [2024-01-04 06:24:16,898][model8_pretrain.py][INFO] Epoch:[0/2](348000/4588595) loss:2.554 lr:0.0000100 epoch_Time:26863.0min: [2024-01-04 06:24:16,898][model8_pretrain.py][INFO] Epoch:[0/2](348000/4588595) loss:3.056 lr:0.0000100 epoch_Time:26863.0min: [2024-01-04 06:24:16,898][model8_pretrain.py][INFO] Epoch:[0/2](348000/4588595) loss:2.724 lr:0.0000100 epoch_Time:26863.0min: [2024-01-04 06:24:53,864][model8_pretrain.py][INFO] Epoch:[0/2](348100/4588595) loss:3.099 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:24:53,864][model8_pretrain.py][INFO] Epoch:[0/2](348100/4588595) loss:2.944 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:24:53,864][model8_pretrain.py][INFO] Epoch:[0/2](348100/4588595) loss:3.243 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:24:53,864][model8_pretrain.py][INFO] Epoch:[0/2](348100/4588595) loss:2.774 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:24:53,864][model8_pretrain.py][INFO] Epoch:[0/2](348100/4588595) loss:2.728 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:24:53,864][model8_pretrain.py][INFO] Epoch:[0/2](348100/4588595) loss:2.949 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:24:53,864][model8_pretrain.py][INFO] Epoch:[0/2](348100/4588595) loss:2.905 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:24:53,864][model8_pretrain.py][INFO] Epoch:[0/2](348100/4588595) loss:3.057 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:25:30,831][model8_pretrain.py][INFO] Epoch:[0/2](348200/4588595) loss:3.191 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:25:30,831][model8_pretrain.py][INFO] Epoch:[0/2](348200/4588595) loss:2.684 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:25:30,831][model8_pretrain.py][INFO] Epoch:[0/2](348200/4588595) loss:3.035 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:25:30,831][model8_pretrain.py][INFO] Epoch:[0/2](348200/4588595) loss:2.898 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:25:30,831][model8_pretrain.py][INFO] Epoch:[0/2](348200/4588595) loss:2.980 lr:0.0000100 epoch_Time:26862.0min: [2024-01-04 06:25:30,831][model8_pretrain.py][INFO] Epoch:[0/2](348200/4588595) loss:2.781 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:25:30,831][model8_pretrain.py][INFO] Epoch:[0/2](348200/4588595) loss:2.661 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:25:30,831][model8_pretrain.py][INFO] Epoch:[0/2](348200/4588595) loss:3.101 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:26:07,788][model8_pretrain.py][INFO] Epoch:[0/2](348300/4588595) loss:2.851 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:07,788][model8_pretrain.py][INFO] Epoch:[0/2](348300/4588595) loss:3.268 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:07,788][model8_pretrain.py][INFO] Epoch:[0/2](348300/4588595) loss:2.550 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:07,788][model8_pretrain.py][INFO] Epoch:[0/2](348300/4588595) loss:3.340 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:07,788][model8_pretrain.py][INFO] Epoch:[0/2](348300/4588595) loss:2.534 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:07,788][model8_pretrain.py][INFO] Epoch:[0/2](348300/4588595) loss:2.683 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:07,788][model8_pretrain.py][INFO] Epoch:[0/2](348300/4588595) loss:2.665 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:07,788][model8_pretrain.py][INFO] Epoch:[0/2](348300/4588595) loss:2.706 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:44,745][model8_pretrain.py][INFO] Epoch:[0/2](348400/4588595) loss:3.105 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:44,745][model8_pretrain.py][INFO] Epoch:[0/2](348400/4588595) loss:2.869 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:44,745][model8_pretrain.py][INFO] Epoch:[0/2](348400/4588595) loss:3.220 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:44,745][model8_pretrain.py][INFO] Epoch:[0/2](348400/4588595) loss:1.547 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:44,745][model8_pretrain.py][INFO] Epoch:[0/2](348400/4588595) loss:2.228 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:44,745][model8_pretrain.py][INFO] Epoch:[0/2](348400/4588595) loss:3.403 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:44,745][model8_pretrain.py][INFO] Epoch:[0/2](348400/4588595) loss:2.536 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:26:44,745][model8_pretrain.py][INFO] Epoch:[0/2](348400/4588595) loss:3.077 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:27:30,191][model8_pretrain.py][INFO] Epoch:[0/2](348500/4588595) loss:2.834 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:27:30,191][model8_pretrain.py][INFO] Epoch:[0/2](348500/4588595) loss:2.864 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:27:30,191][model8_pretrain.py][INFO] Epoch:[0/2](348500/4588595) loss:3.135 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:27:30,196][model8_pretrain.py][INFO] Epoch:[0/2](348500/4588595) loss:3.513 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:27:30,196][model8_pretrain.py][INFO] Epoch:[0/2](348500/4588595) loss:2.619 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:27:30,196][model8_pretrain.py][INFO] Epoch:[0/2](348500/4588595) loss:3.027 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:27:30,196][model8_pretrain.py][INFO] Epoch:[0/2](348500/4588595) loss:2.544 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:27:30,196][model8_pretrain.py][INFO] Epoch:[0/2](348500/4588595) loss:2.959 lr:0.0000100 epoch_Time:26861.0min: [2024-01-04 06:28:08,779][model8_pretrain.py][INFO] Epoch:[0/2](348600/4588595) loss:2.874 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:08,779][model8_pretrain.py][INFO] Epoch:[0/2](348600/4588595) loss:3.189 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:08,779][model8_pretrain.py][INFO] Epoch:[0/2](348600/4588595) loss:3.040 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:08,779][model8_pretrain.py][INFO] Epoch:[0/2](348600/4588595) loss:2.168 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:08,779][model8_pretrain.py][INFO] Epoch:[0/2](348600/4588595) loss:2.882 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:08,780][model8_pretrain.py][INFO] Epoch:[0/2](348600/4588595) loss:3.247 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:08,780][model8_pretrain.py][INFO] Epoch:[0/2](348600/4588595) loss:1.723 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:08,780][model8_pretrain.py][INFO] Epoch:[0/2](348600/4588595) loss:2.142 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:45,713][model8_pretrain.py][INFO] Epoch:[0/2](348700/4588595) loss:2.903 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:45,713][model8_pretrain.py][INFO] Epoch:[0/2](348700/4588595) loss:2.930 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:45,713][model8_pretrain.py][INFO] Epoch:[0/2](348700/4588595) loss:3.051 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:45,713][model8_pretrain.py][INFO] Epoch:[0/2](348700/4588595) loss:2.751 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:45,713][model8_pretrain.py][INFO] Epoch:[0/2](348700/4588595) loss:2.966 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:45,713][model8_pretrain.py][INFO] Epoch:[0/2](348700/4588595) loss:2.854 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:45,713][model8_pretrain.py][INFO] Epoch:[0/2](348700/4588595) loss:2.795 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:28:45,713][model8_pretrain.py][INFO] Epoch:[0/2](348700/4588595) loss:2.861 lr:0.0000100 epoch_Time:26860.0min: [2024-01-04 06:29:22,643][model8_pretrain.py][INFO] Epoch:[0/2](348800/4588595) loss:3.088 lr:0.0000100 epoch_Time:26858.0min: [2024-01-04 06:29:22,643][model8_pretrain.py][INFO] Epoch:[0/2](348800/4588595) loss:2.814 lr:0.0000100 epoch_Time:26858.0min: [2024-01-04 06:29:22,643][model8_pretrain.py][INFO] Epoch:[0/2](348800/4588595) loss:2.503 lr:0.0000100 epoch_Time:26858.0min: [2024-01-04 06:29:22,643][model8_pretrain.py][INFO] Epoch:[0/2](348800/4588595) loss:2.627 lr:0.0000100 epoch_Time:26858.0min: [2024-01-04 06:29:22,643][model8_pretrain.py][INFO] Epoch:[0/2](348800/4588595) loss:2.801 lr:0.0000100 epoch_Time:26858.0min: [2024-01-04 06:29:22,643][model8_pretrain.py][INFO] Epoch:[0/2](348800/4588595) loss:2.848 lr:0.0000100 epoch_Time:26858.0min: [2024-01-04 06:29:22,644][model8_pretrain.py][INFO] Epoch:[0/2](348800/4588595) loss:3.027 lr:0.0000100 epoch_Time:26858.0min: [2024-01-04 06:29:22,644][model8_pretrain.py][INFO] Epoch:[0/2](348800/4588595) loss:3.030 lr:0.0000100 epoch_Time:26858.0min: [2024-01-04 06:29:59,563][model8_pretrain.py][INFO] Epoch:[0/2](348900/4588595) loss:2.577 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:29:59,563][model8_pretrain.py][INFO] Epoch:[0/2](348900/4588595) loss:3.262 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:29:59,563][model8_pretrain.py][INFO] Epoch:[0/2](348900/4588595) loss:3.192 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:29:59,563][model8_pretrain.py][INFO] Epoch:[0/2](348900/4588595) loss:2.232 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:29:59,563][model8_pretrain.py][INFO] Epoch:[0/2](348900/4588595) loss:2.616 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:29:59,563][model8_pretrain.py][INFO] Epoch:[0/2](348900/4588595) loss:3.007 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:29:59,563][model8_pretrain.py][INFO] Epoch:[0/2](348900/4588595) loss:2.754 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:29:59,564][model8_pretrain.py][INFO] Epoch:[0/2](348900/4588595) loss:2.602 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:30:36,478][model8_pretrain.py][INFO] Epoch:[0/2](349000/4588595) loss:2.811 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:30:36,478][model8_pretrain.py][INFO] Epoch:[0/2](349000/4588595) loss:2.832 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:30:36,478][model8_pretrain.py][INFO] Epoch:[0/2](349000/4588595) loss:2.662 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:30:36,478][model8_pretrain.py][INFO] Epoch:[0/2](349000/4588595) loss:3.047 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:30:36,478][model8_pretrain.py][INFO] Epoch:[0/2](349000/4588595) loss:2.484 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:30:36,478][model8_pretrain.py][INFO] Epoch:[0/2](349000/4588595) loss:3.443 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:30:36,478][model8_pretrain.py][INFO] Epoch:[0/2](349000/4588595) loss:2.879 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:30:36,478][model8_pretrain.py][INFO] Epoch:[0/2](349000/4588595) loss:2.903 lr:0.0000100 epoch_Time:26857.0min: [2024-01-04 06:31:13,416][model8_pretrain.py][INFO] Epoch:[0/2](349100/4588595) loss:3.010 lr:0.0000100 epoch_Time:26856.0min: [2024-01-04 06:31:13,416][model8_pretrain.py][INFO] Epoch:[0/2](349100/4588595) loss:3.447 lr:0.0000100 epoch_Time:26856.0min: [2024-01-04 06:31:13,416][model8_pretrain.py][INFO] Epoch:[0/2](349100/4588595) loss:2.611 lr:0.0000100 epoch_Time:26856.0min: [2024-01-04 06:31:13,417][model8_pretrain.py][INFO] Epoch:[0/2](349100/4588595) loss:2.865 lr:0.0000100 epoch_Time:26856.0min: [2024-01-04 06:31:13,417][model8_pretrain.py][INFO] Epoch:[0/2](349100/4588595) loss:3.130 lr:0.0000100 epoch_Time:26856.0min: [2024-01-04 06:31:13,417][model8_pretrain.py][INFO] Epoch:[0/2](349100/4588595) loss:2.750 lr:0.0000100 epoch_Time:26856.0min: [2024-01-04 06:31:13,417][model8_pretrain.py][INFO] Epoch:[0/2](349100/4588595) loss:2.812 lr:0.0000100 epoch_Time:26856.0min: [2024-01-04 06:31:13,417][model8_pretrain.py][INFO] Epoch:[0/2](349100/4588595) loss:2.738 lr:0.0000100 epoch_Time:26856.0min: [2024-01-04 06:31:50,350][model8_pretrain.py][INFO] Epoch:[0/2](349200/4588595) loss:3.319 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:31:50,350][model8_pretrain.py][INFO] Epoch:[0/2](349200/4588595) loss:3.429 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:31:50,350][model8_pretrain.py][INFO] Epoch:[0/2](349200/4588595) loss:3.388 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:31:50,350][model8_pretrain.py][INFO] Epoch:[0/2](349200/4588595) loss:2.369 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:31:50,350][model8_pretrain.py][INFO] Epoch:[0/2](349200/4588595) loss:3.162 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:31:50,350][model8_pretrain.py][INFO] Epoch:[0/2](349200/4588595) loss:2.974 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:31:50,351][model8_pretrain.py][INFO] Epoch:[0/2](349200/4588595) loss:2.457 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:31:50,351][model8_pretrain.py][INFO] Epoch:[0/2](349200/4588595) loss:2.867 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:32:32,251][model8_pretrain.py][INFO] Epoch:[0/2](349300/4588595) loss:3.099 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:32:32,251][model8_pretrain.py][INFO] Epoch:[0/2](349300/4588595) loss:2.553 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:32:32,252][model8_pretrain.py][INFO] Epoch:[0/2](349300/4588595) loss:2.182 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:32:32,252][model8_pretrain.py][INFO] Epoch:[0/2](349300/4588595) loss:3.272 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:32:32,252][model8_pretrain.py][INFO] Epoch:[0/2](349300/4588595) loss:2.529 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:32:32,252][model8_pretrain.py][INFO] Epoch:[0/2](349300/4588595) loss:2.779 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:32:32,252][model8_pretrain.py][INFO] Epoch:[0/2](349300/4588595) loss:2.820 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:32:32,252][model8_pretrain.py][INFO] Epoch:[0/2](349300/4588595) loss:3.078 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:33:14,332][model8_pretrain.py][INFO] Epoch:[0/2](349400/4588595) loss:3.468 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:33:14,332][model8_pretrain.py][INFO] Epoch:[0/2](349400/4588595) loss:2.404 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:33:14,332][model8_pretrain.py][INFO] Epoch:[0/2](349400/4588595) loss:2.972 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:33:14,332][model8_pretrain.py][INFO] Epoch:[0/2](349400/4588595) loss:3.080 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:33:14,332][model8_pretrain.py][INFO] Epoch:[0/2](349400/4588595) loss:3.351 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:33:14,332][model8_pretrain.py][INFO] Epoch:[0/2](349400/4588595) loss:2.609 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:33:14,332][model8_pretrain.py][INFO] Epoch:[0/2](349400/4588595) loss:2.122 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:33:14,332][model8_pretrain.py][INFO] Epoch:[0/2](349400/4588595) loss:2.747 lr:0.0000100 epoch_Time:26855.0min: [2024-01-04 06:33:51,263][model8_pretrain.py][INFO] Epoch:[0/2](349500/4588595) loss:2.910 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:33:51,263][model8_pretrain.py][INFO] Epoch:[0/2](349500/4588595) loss:2.630 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:33:51,263][model8_pretrain.py][INFO] Epoch:[0/2](349500/4588595) loss:2.645 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:33:51,263][model8_pretrain.py][INFO] Epoch:[0/2](349500/4588595) loss:3.245 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:33:51,263][model8_pretrain.py][INFO] Epoch:[0/2](349500/4588595) loss:3.692 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:33:51,263][model8_pretrain.py][INFO] Epoch:[0/2](349500/4588595) loss:2.877 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:33:51,263][model8_pretrain.py][INFO] Epoch:[0/2](349500/4588595) loss:2.513 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:33:51,263][model8_pretrain.py][INFO] Epoch:[0/2](349500/4588595) loss:2.591 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:34:28,204][model8_pretrain.py][INFO] Epoch:[0/2](349600/4588595) loss:3.303 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:34:28,204][model8_pretrain.py][INFO] Epoch:[0/2](349600/4588595) loss:2.546 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:34:28,204][model8_pretrain.py][INFO] Epoch:[0/2](349600/4588595) loss:3.188 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:34:28,204][model8_pretrain.py][INFO] Epoch:[0/2](349600/4588595) loss:3.046 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:34:28,204][model8_pretrain.py][INFO] Epoch:[0/2](349600/4588595) loss:3.272 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:34:28,204][model8_pretrain.py][INFO] Epoch:[0/2](349600/4588595) loss:3.333 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:34:28,204][model8_pretrain.py][INFO] Epoch:[0/2](349600/4588595) loss:2.204 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:34:28,204][model8_pretrain.py][INFO] Epoch:[0/2](349600/4588595) loss:2.308 lr:0.0000100 epoch_Time:26854.0min: [2024-01-04 06:35:05,141][model8_pretrain.py][INFO] Epoch:[0/2](349700/4588595) loss:2.980 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:05,141][model8_pretrain.py][INFO] Epoch:[0/2](349700/4588595) loss:2.822 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:05,141][model8_pretrain.py][INFO] Epoch:[0/2](349700/4588595) loss:2.201 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:05,141][model8_pretrain.py][INFO] Epoch:[0/2](349700/4588595) loss:2.809 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:05,141][model8_pretrain.py][INFO] Epoch:[0/2](349700/4588595) loss:2.569 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:05,141][model8_pretrain.py][INFO] Epoch:[0/2](349700/4588595) loss:2.832 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:05,141][model8_pretrain.py][INFO] Epoch:[0/2](349700/4588595) loss:2.586 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:05,142][model8_pretrain.py][INFO] Epoch:[0/2](349700/4588595) loss:2.863 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:42,085][model8_pretrain.py][INFO] Epoch:[0/2](349800/4588595) loss:3.195 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:42,085][model8_pretrain.py][INFO] Epoch:[0/2](349800/4588595) loss:2.818 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:42,086][model8_pretrain.py][INFO] Epoch:[0/2](349800/4588595) loss:2.713 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:42,086][model8_pretrain.py][INFO] Epoch:[0/2](349800/4588595) loss:2.337 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:42,086][model8_pretrain.py][INFO] Epoch:[0/2](349800/4588595) loss:2.633 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:42,086][model8_pretrain.py][INFO] Epoch:[0/2](349800/4588595) loss:2.670 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:42,086][model8_pretrain.py][INFO] Epoch:[0/2](349800/4588595) loss:2.575 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:35:42,086][model8_pretrain.py][INFO] Epoch:[0/2](349800/4588595) loss:2.792 lr:0.0000100 epoch_Time:26852.0min: [2024-01-04 06:36:19,016][model8_pretrain.py][INFO] Epoch:[0/2](349900/4588595) loss:2.502 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:36:19,017][model8_pretrain.py][INFO] Epoch:[0/2](349900/4588595) loss:2.999 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:36:19,017][model8_pretrain.py][INFO] Epoch:[0/2](349900/4588595) loss:3.593 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:36:19,017][model8_pretrain.py][INFO] Epoch:[0/2](349900/4588595) loss:3.015 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:36:19,017][model8_pretrain.py][INFO] Epoch:[0/2](349900/4588595) loss:2.296 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:36:19,017][model8_pretrain.py][INFO] Epoch:[0/2](349900/4588595) loss:3.011 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:36:19,017][model8_pretrain.py][INFO] Epoch:[0/2](349900/4588595) loss:2.679 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:36:19,017][model8_pretrain.py][INFO] Epoch:[0/2](349900/4588595) loss:3.568 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:36:55,950][model8_pretrain.py][INFO] Epoch:[0/2](350000/4588595) loss:2.972 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:36:55,950][model8_pretrain.py][INFO] Epoch:[0/2](350000/4588595) loss:2.532 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:36:55,950][model8_pretrain.py][INFO] Epoch:[0/2](350000/4588595) loss:2.897 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:36:55,950][model8_pretrain.py][INFO] Epoch:[0/2](350000/4588595) loss:2.843 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:36:55,951][model8_pretrain.py][INFO] Epoch:[0/2](350000/4588595) loss:3.056 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:36:55,950][model8_pretrain.py][INFO] Epoch:[0/2](350000/4588595) loss:2.813 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:36:55,951][model8_pretrain.py][INFO] Epoch:[0/2](350000/4588595) loss:3.404 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:36:55,951][model8_pretrain.py][INFO] Epoch:[0/2](350000/4588595) loss:2.308 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:37:37,916][model8_pretrain.py][INFO] Epoch:[0/2](350100/4588595) loss:2.622 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:37:37,916][model8_pretrain.py][INFO] Epoch:[0/2](350100/4588595) loss:2.991 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:37:37,916][model8_pretrain.py][INFO] Epoch:[0/2](350100/4588595) loss:3.228 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:37:37,916][model8_pretrain.py][INFO] Epoch:[0/2](350100/4588595) loss:2.581 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:37:37,916][model8_pretrain.py][INFO] Epoch:[0/2](350100/4588595) loss:2.799 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:37:37,916][model8_pretrain.py][INFO] Epoch:[0/2](350100/4588595) loss:3.132 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:37:37,916][model8_pretrain.py][INFO] Epoch:[0/2](350100/4588595) loss:2.734 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:37:37,916][model8_pretrain.py][INFO] Epoch:[0/2](350100/4588595) loss:2.580 lr:0.0000100 epoch_Time:26851.0min: [2024-01-04 06:38:19,756][model8_pretrain.py][INFO] Epoch:[0/2](350200/4588595) loss:2.728 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:38:19,756][model8_pretrain.py][INFO] Epoch:[0/2](350200/4588595) loss:2.315 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:38:19,756][model8_pretrain.py][INFO] Epoch:[0/2](350200/4588595) loss:3.228 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:38:19,756][model8_pretrain.py][INFO] Epoch:[0/2](350200/4588595) loss:2.828 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:38:19,756][model8_pretrain.py][INFO] Epoch:[0/2](350200/4588595) loss:2.999 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:38:19,756][model8_pretrain.py][INFO] Epoch:[0/2](350200/4588595) loss:2.952 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:38:19,756][model8_pretrain.py][INFO] Epoch:[0/2](350200/4588595) loss:3.018 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:38:19,756][model8_pretrain.py][INFO] Epoch:[0/2](350200/4588595) loss:2.825 lr:0.0000100 epoch_Time:26850.0min: [2024-01-04 06:38:56,686][model8_pretrain.py][INFO] Epoch:[0/2](350300/4588595) loss:2.891 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:38:56,686][model8_pretrain.py][INFO] Epoch:[0/2](350300/4588595) loss:2.530 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:38:56,686][model8_pretrain.py][INFO] Epoch:[0/2](350300/4588595) loss:2.907 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:38:56,686][model8_pretrain.py][INFO] Epoch:[0/2](350300/4588595) loss:3.129 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:38:56,686][model8_pretrain.py][INFO] Epoch:[0/2](350300/4588595) loss:3.152 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:38:56,686][model8_pretrain.py][INFO] Epoch:[0/2](350300/4588595) loss:2.993 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:38:56,686][model8_pretrain.py][INFO] Epoch:[0/2](350300/4588595) loss:3.034 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:38:56,686][model8_pretrain.py][INFO] Epoch:[0/2](350300/4588595) loss:2.872 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:39:33,607][model8_pretrain.py][INFO] Epoch:[0/2](350400/4588595) loss:2.472 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:39:33,607][model8_pretrain.py][INFO] Epoch:[0/2](350400/4588595) loss:3.109 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:39:33,607][model8_pretrain.py][INFO] Epoch:[0/2](350400/4588595) loss:2.646 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:39:33,607][model8_pretrain.py][INFO] Epoch:[0/2](350400/4588595) loss:2.063 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:39:33,607][model8_pretrain.py][INFO] Epoch:[0/2](350400/4588595) loss:3.317 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:39:33,607][model8_pretrain.py][INFO] Epoch:[0/2](350400/4588595) loss:2.740 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:39:33,607][model8_pretrain.py][INFO] Epoch:[0/2](350400/4588595) loss:2.715 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:39:33,607][model8_pretrain.py][INFO] Epoch:[0/2](350400/4588595) loss:2.542 lr:0.0000100 epoch_Time:26849.0min: [2024-01-04 06:40:10,541][model8_pretrain.py][INFO] Epoch:[0/2](350500/4588595) loss:3.334 lr:0.0000100 epoch_Time:26848.0min: [2024-01-04 06:40:10,541][model8_pretrain.py][INFO] Epoch:[0/2](350500/4588595) loss:3.089 lr:0.0000100 epoch_Time:26848.0min: [2024-01-04 06:40:10,541][model8_pretrain.py][INFO] Epoch:[0/2](350500/4588595) loss:2.872 lr:0.0000100 epoch_Time:26848.0min: [2024-01-04 06:40:10,541][model8_pretrain.py][INFO] Epoch:[0/2](350500/4588595) loss:2.553 lr:0.0000100 epoch_Time:26848.0min: [2024-01-04 06:40:10,541][model8_pretrain.py][INFO] Epoch:[0/2](350500/4588595) loss:2.899 lr:0.0000100 epoch_Time:26848.0min: [2024-01-04 06:40:10,541][model8_pretrain.py][INFO] Epoch:[0/2](350500/4588595) loss:2.532 lr:0.0000100 epoch_Time:26848.0min: [2024-01-04 06:40:10,541][model8_pretrain.py][INFO] Epoch:[0/2](350500/4588595) loss:2.443 lr:0.0000100 epoch_Time:26848.0min: [2024-01-04 06:40:10,541][model8_pretrain.py][INFO] Epoch:[0/2](350500/4588595) loss:2.716 lr:0.0000100 epoch_Time:26848.0min: [2024-01-04 06:40:47,472][model8_pretrain.py][INFO] Epoch:[0/2](350600/4588595) loss:3.096 lr:0.0000100 epoch_Time:26847.0min: [2024-01-04 06:40:47,472][model8_pretrain.py][INFO] Epoch:[0/2](350600/4588595) loss:3.275 lr:0.0000100 epoch_Time:26847.0min: [2024-01-04 06:40:47,472][model8_pretrain.py][INFO] Epoch:[0/2](350600/4588595) loss:2.708 lr:0.0000100 epoch_Time:26847.0min: [2024-01-04 06:40:47,472][model8_pretrain.py][INFO] Epoch:[0/2](350600/4588595) loss:2.983 lr:0.0000100 epoch_Time:26847.0min: [2024-01-04 06:40:47,472][model8_pretrain.py][INFO] Epoch:[0/2](350600/4588595) loss:2.400 lr:0.0000100 epoch_Time:26847.0min: [2024-01-04 06:40:47,472][model8_pretrain.py][INFO] Epoch:[0/2](350600/4588595) loss:2.835 lr:0.0000100 epoch_Time:26847.0min: [2024-01-04 06:40:47,472][model8_pretrain.py][INFO] Epoch:[0/2](350600/4588595) loss:2.673 lr:0.0000100 epoch_Time:26847.0min: [2024-01-04 06:40:47,472][model8_pretrain.py][INFO] Epoch:[0/2](350600/4588595) loss:3.094 lr:0.0000100 epoch_Time:26847.0min: [2024-01-04 06:41:24,409][model8_pretrain.py][INFO] Epoch:[0/2](350700/4588595) loss:3.257 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:41:24,409][model8_pretrain.py][INFO] Epoch:[0/2](350700/4588595) loss:3.389 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:41:24,409][model8_pretrain.py][INFO] Epoch:[0/2](350700/4588595) loss:2.875 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:41:24,409][model8_pretrain.py][INFO] Epoch:[0/2](350700/4588595) loss:2.963 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:41:24,409][model8_pretrain.py][INFO] Epoch:[0/2](350700/4588595) loss:3.166 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:41:24,409][model8_pretrain.py][INFO] Epoch:[0/2](350700/4588595) loss:2.411 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:41:24,410][model8_pretrain.py][INFO] Epoch:[0/2](350700/4588595) loss:2.647 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:41:24,410][model8_pretrain.py][INFO] Epoch:[0/2](350700/4588595) loss:2.789 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:42:01,363][model8_pretrain.py][INFO] Epoch:[0/2](350800/4588595) loss:2.517 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:42:01,363][model8_pretrain.py][INFO] Epoch:[0/2](350800/4588595) loss:3.215 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:42:01,363][model8_pretrain.py][INFO] Epoch:[0/2](350800/4588595) loss:2.881 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:42:01,363][model8_pretrain.py][INFO] Epoch:[0/2](350800/4588595) loss:2.815 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:42:01,363][model8_pretrain.py][INFO] Epoch:[0/2](350800/4588595) loss:2.888 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:42:01,363][model8_pretrain.py][INFO] Epoch:[0/2](350800/4588595) loss:2.728 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:42:01,363][model8_pretrain.py][INFO] Epoch:[0/2](350800/4588595) loss:2.700 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:42:01,363][model8_pretrain.py][INFO] Epoch:[0/2](350800/4588595) loss:2.948 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:42:43,512][model8_pretrain.py][INFO] Epoch:[0/2](350900/4588595) loss:2.994 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:42:43,512][model8_pretrain.py][INFO] Epoch:[0/2](350900/4588595) loss:2.701 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:42:43,512][model8_pretrain.py][INFO] Epoch:[0/2](350900/4588595) loss:2.910 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:42:43,512][model8_pretrain.py][INFO] Epoch:[0/2](350900/4588595) loss:2.733 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:42:43,516][model8_pretrain.py][INFO] Epoch:[0/2](350900/4588595) loss:2.853 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:42:43,517][model8_pretrain.py][INFO] Epoch:[0/2](350900/4588595) loss:3.049 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:42:43,517][model8_pretrain.py][INFO] Epoch:[0/2](350900/4588595) loss:2.902 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:42:43,517][model8_pretrain.py][INFO] Epoch:[0/2](350900/4588595) loss:2.380 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:43:25,360][model8_pretrain.py][INFO] Epoch:[0/2](351000/4588595) loss:2.372 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:43:25,360][model8_pretrain.py][INFO] Epoch:[0/2](351000/4588595) loss:2.837 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:43:25,360][model8_pretrain.py][INFO] Epoch:[0/2](351000/4588595) loss:2.851 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:43:25,360][model8_pretrain.py][INFO] Epoch:[0/2](351000/4588595) loss:2.518 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:43:25,360][model8_pretrain.py][INFO] Epoch:[0/2](351000/4588595) loss:2.992 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:43:25,360][model8_pretrain.py][INFO] Epoch:[0/2](351000/4588595) loss:3.101 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:43:25,360][model8_pretrain.py][INFO] Epoch:[0/2](351000/4588595) loss:3.200 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:43:25,361][model8_pretrain.py][INFO] Epoch:[0/2](351000/4588595) loss:3.407 lr:0.0000100 epoch_Time:26846.0min: [2024-01-04 06:44:02,302][model8_pretrain.py][INFO] Epoch:[0/2](351100/4588595) loss:2.722 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:44:02,302][model8_pretrain.py][INFO] Epoch:[0/2](351100/4588595) loss:2.854 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:44:02,302][model8_pretrain.py][INFO] Epoch:[0/2](351100/4588595) loss:3.082 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:44:02,302][model8_pretrain.py][INFO] Epoch:[0/2](351100/4588595) loss:3.319 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:44:02,302][model8_pretrain.py][INFO] Epoch:[0/2](351100/4588595) loss:3.083 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:44:02,302][model8_pretrain.py][INFO] Epoch:[0/2](351100/4588595) loss:2.479 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:44:02,302][model8_pretrain.py][INFO] Epoch:[0/2](351100/4588595) loss:2.780 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:44:02,302][model8_pretrain.py][INFO] Epoch:[0/2](351100/4588595) loss:3.045 lr:0.0000100 epoch_Time:26845.0min: [2024-01-04 06:44:39,250][model8_pretrain.py][INFO] Epoch:[0/2](351200/4588595) loss:2.759 lr:0.0000100 epoch_Time:26844.0min: [2024-01-04 06:44:39,250][model8_pretrain.py][INFO] Epoch:[0/2](351200/4588595) loss:3.140 lr:0.0000100 epoch_Time:26844.0min: [2024-01-04 06:44:39,250][model8_pretrain.py][INFO] Epoch:[0/2](351200/4588595) loss:2.876 lr:0.0000100 epoch_Time:26844.0min: [2024-01-04 06:44:39,250][model8_pretrain.py][INFO] Epoch:[0/2](351200/4588595) loss:2.809 lr:0.0000100 epoch_Time:26844.0min: [2024-01-04 06:44:39,250][model8_pretrain.py][INFO] Epoch:[0/2](351200/4588595) loss:2.913 lr:0.0000100 epoch_Time:26844.0min: [2024-01-04 06:44:39,250][model8_pretrain.py][INFO] Epoch:[0/2](351200/4588595) loss:2.594 lr:0.0000100 epoch_Time:26844.0min: [2024-01-04 06:44:39,250][model8_pretrain.py][INFO] Epoch:[0/2](351200/4588595) loss:2.701 lr:0.0000100 epoch_Time:26844.0min: [2024-01-04 06:44:39,250][model8_pretrain.py][INFO] Epoch:[0/2](351200/4588595) loss:2.782 lr:0.0000100 epoch_Time:26844.0min: [2024-01-04 06:45:16,188][model8_pretrain.py][INFO] Epoch:[0/2](351300/4588595) loss:2.786 lr:0.0000100 epoch_Time:26843.0min: [2024-01-04 06:45:16,188][model8_pretrain.py][INFO] Epoch:[0/2](351300/4588595) loss:2.551 lr:0.0000100 epoch_Time:26843.0min: [2024-01-04 06:45:16,188][model8_pretrain.py][INFO] Epoch:[0/2](351300/4588595) loss:2.932 lr:0.0000100 epoch_Time:26843.0min: [2024-01-04 06:45:16,188][model8_pretrain.py][INFO] Epoch:[0/2](351300/4588595) loss:3.246 lr:0.0000100 epoch_Time:26843.0min: [2024-01-04 06:45:16,188][model8_pretrain.py][INFO] Epoch:[0/2](351300/4588595) loss:3.154 lr:0.0000100 epoch_Time:26843.0min: [2024-01-04 06:45:16,188][model8_pretrain.py][INFO] Epoch:[0/2](351300/4588595) loss:2.199 lr:0.0000100 epoch_Time:26843.0min: [2024-01-04 06:45:16,188][model8_pretrain.py][INFO] Epoch:[0/2](351300/4588595) loss:2.683 lr:0.0000100 epoch_Time:26843.0min: [2024-01-04 06:45:16,189][model8_pretrain.py][INFO] Epoch:[0/2](351300/4588595) loss:2.612 lr:0.0000100 epoch_Time:26843.0min: [2024-01-04 06:45:53,132][model8_pretrain.py][INFO] Epoch:[0/2](351400/4588595) loss:2.388 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:45:53,132][model8_pretrain.py][INFO] Epoch:[0/2](351400/4588595) loss:2.557 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:45:53,132][model8_pretrain.py][INFO] Epoch:[0/2](351400/4588595) loss:2.954 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:45:53,132][model8_pretrain.py][INFO] Epoch:[0/2](351400/4588595) loss:2.541 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:45:53,132][model8_pretrain.py][INFO] Epoch:[0/2](351400/4588595) loss:2.943 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:45:53,132][model8_pretrain.py][INFO] Epoch:[0/2](351400/4588595) loss:2.667 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:45:53,132][model8_pretrain.py][INFO] Epoch:[0/2](351400/4588595) loss:3.001 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:45:53,132][model8_pretrain.py][INFO] Epoch:[0/2](351400/4588595) loss:2.370 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:46:30,076][model8_pretrain.py][INFO] Epoch:[0/2](351500/4588595) loss:2.336 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:46:30,076][model8_pretrain.py][INFO] Epoch:[0/2](351500/4588595) loss:2.999 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:46:30,076][model8_pretrain.py][INFO] Epoch:[0/2](351500/4588595) loss:3.533 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:46:30,076][model8_pretrain.py][INFO] Epoch:[0/2](351500/4588595) loss:3.239 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:46:30,076][model8_pretrain.py][INFO] Epoch:[0/2](351500/4588595) loss:2.991 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:46:30,076][model8_pretrain.py][INFO] Epoch:[0/2](351500/4588595) loss:2.929 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:46:30,076][model8_pretrain.py][INFO] Epoch:[0/2](351500/4588595) loss:2.608 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:46:30,077][model8_pretrain.py][INFO] Epoch:[0/2](351500/4588595) loss:2.709 lr:0.0000100 epoch_Time:26842.0min: [2024-01-04 06:47:07,024][model8_pretrain.py][INFO] Epoch:[0/2](351600/4588595) loss:2.558 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:07,024][model8_pretrain.py][INFO] Epoch:[0/2](351600/4588595) loss:2.582 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:07,024][model8_pretrain.py][INFO] Epoch:[0/2](351600/4588595) loss:2.989 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:07,024][model8_pretrain.py][INFO] Epoch:[0/2](351600/4588595) loss:3.200 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:07,024][model8_pretrain.py][INFO] Epoch:[0/2](351600/4588595) loss:2.901 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:07,024][model8_pretrain.py][INFO] Epoch:[0/2](351600/4588595) loss:2.801 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:07,024][model8_pretrain.py][INFO] Epoch:[0/2](351600/4588595) loss:2.305 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:07,025][model8_pretrain.py][INFO] Epoch:[0/2](351600/4588595) loss:2.495 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:45,701][model8_pretrain.py][INFO] Epoch:[0/2](351700/4588595) loss:2.891 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:45,701][model8_pretrain.py][INFO] Epoch:[0/2](351700/4588595) loss:2.591 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:45,701][model8_pretrain.py][INFO] Epoch:[0/2](351700/4588595) loss:3.175 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:45,701][model8_pretrain.py][INFO] Epoch:[0/2](351700/4588595) loss:2.690 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:45,701][model8_pretrain.py][INFO] Epoch:[0/2](351700/4588595) loss:2.817 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:47:45,701][model8_pretrain.py][INFO] Epoch:[0/2](351700/4588595) loss:3.287 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:47:45,701][model8_pretrain.py][INFO] Epoch:[0/2](351700/4588595) loss:2.826 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:47:45,701][model8_pretrain.py][INFO] Epoch:[0/2](351700/4588595) loss:3.221 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:48:31,140][model8_pretrain.py][INFO] Epoch:[0/2](351800/4588595) loss:2.304 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:48:31,140][model8_pretrain.py][INFO] Epoch:[0/2](351800/4588595) loss:3.169 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:48:31,140][model8_pretrain.py][INFO] Epoch:[0/2](351800/4588595) loss:2.901 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:48:31,140][model8_pretrain.py][INFO] Epoch:[0/2](351800/4588595) loss:3.308 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:48:31,140][model8_pretrain.py][INFO] Epoch:[0/2](351800/4588595) loss:2.974 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:48:31,140][model8_pretrain.py][INFO] Epoch:[0/2](351800/4588595) loss:2.968 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:48:31,140][model8_pretrain.py][INFO] Epoch:[0/2](351800/4588595) loss:3.288 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:48:31,140][model8_pretrain.py][INFO] Epoch:[0/2](351800/4588595) loss:2.933 lr:0.0000100 epoch_Time:26841.0min: [2024-01-04 06:49:08,088][model8_pretrain.py][INFO] Epoch:[0/2](351900/4588595) loss:3.243 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:08,088][model8_pretrain.py][INFO] Epoch:[0/2](351900/4588595) loss:3.050 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:08,088][model8_pretrain.py][INFO] Epoch:[0/2](351900/4588595) loss:2.987 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:08,088][model8_pretrain.py][INFO] Epoch:[0/2](351900/4588595) loss:2.784 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:08,088][model8_pretrain.py][INFO] Epoch:[0/2](351900/4588595) loss:2.416 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:08,088][model8_pretrain.py][INFO] Epoch:[0/2](351900/4588595) loss:2.739 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:08,088][model8_pretrain.py][INFO] Epoch:[0/2](351900/4588595) loss:3.327 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:08,088][model8_pretrain.py][INFO] Epoch:[0/2](351900/4588595) loss:2.416 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:45,026][model8_pretrain.py][INFO] Epoch:[0/2](352000/4588595) loss:2.647 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:45,026][model8_pretrain.py][INFO] Epoch:[0/2](352000/4588595) loss:3.063 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:45,026][model8_pretrain.py][INFO] Epoch:[0/2](352000/4588595) loss:3.054 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:45,026][model8_pretrain.py][INFO] Epoch:[0/2](352000/4588595) loss:2.917 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:45,026][model8_pretrain.py][INFO] Epoch:[0/2](352000/4588595) loss:3.447 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:45,026][model8_pretrain.py][INFO] Epoch:[0/2](352000/4588595) loss:3.013 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:45,026][model8_pretrain.py][INFO] Epoch:[0/2](352000/4588595) loss:2.531 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:49:45,026][model8_pretrain.py][INFO] Epoch:[0/2](352000/4588595) loss:2.577 lr:0.0000100 epoch_Time:26840.0min: [2024-01-04 06:50:21,975][model8_pretrain.py][INFO] Epoch:[0/2](352100/4588595) loss:2.886 lr:0.0000100 epoch_Time:26838.0min: [2024-01-04 06:50:21,975][model8_pretrain.py][INFO] Epoch:[0/2](352100/4588595) loss:3.196 lr:0.0000100 epoch_Time:26838.0min: [2024-01-04 06:50:21,975][model8_pretrain.py][INFO] Epoch:[0/2](352100/4588595) loss:2.750 lr:0.0000100 epoch_Time:26838.0min: [2024-01-04 06:50:21,975][model8_pretrain.py][INFO] Epoch:[0/2](352100/4588595) loss:2.818 lr:0.0000100 epoch_Time:26838.0min: [2024-01-04 06:50:21,975][model8_pretrain.py][INFO] Epoch:[0/2](352100/4588595) loss:2.665 lr:0.0000100 epoch_Time:26838.0min: [2024-01-04 06:50:21,975][model8_pretrain.py][INFO] Epoch:[0/2](352100/4588595) loss:2.981 lr:0.0000100 epoch_Time:26838.0min: [2024-01-04 06:50:21,975][model8_pretrain.py][INFO] Epoch:[0/2](352100/4588595) loss:2.691 lr:0.0000100 epoch_Time:26838.0min: [2024-01-04 06:50:21,975][model8_pretrain.py][INFO] Epoch:[0/2](352100/4588595) loss:2.139 lr:0.0000100 epoch_Time:26838.0min: [2024-01-04 06:50:58,905][model8_pretrain.py][INFO] Epoch:[0/2](352200/4588595) loss:3.142 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:50:58,905][model8_pretrain.py][INFO] Epoch:[0/2](352200/4588595) loss:2.723 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:50:58,905][model8_pretrain.py][INFO] Epoch:[0/2](352200/4588595) loss:2.432 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:50:58,905][model8_pretrain.py][INFO] Epoch:[0/2](352200/4588595) loss:3.153 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:50:58,905][model8_pretrain.py][INFO] Epoch:[0/2](352200/4588595) loss:2.539 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:50:58,905][model8_pretrain.py][INFO] Epoch:[0/2](352200/4588595) loss:3.081 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:50:58,905][model8_pretrain.py][INFO] Epoch:[0/2](352200/4588595) loss:3.190 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:50:58,905][model8_pretrain.py][INFO] Epoch:[0/2](352200/4588595) loss:3.536 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:51:35,844][model8_pretrain.py][INFO] Epoch:[0/2](352300/4588595) loss:2.925 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:51:35,844][model8_pretrain.py][INFO] Epoch:[0/2](352300/4588595) loss:3.293 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:51:35,844][model8_pretrain.py][INFO] Epoch:[0/2](352300/4588595) loss:2.796 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:51:35,844][model8_pretrain.py][INFO] Epoch:[0/2](352300/4588595) loss:2.822 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:51:35,844][model8_pretrain.py][INFO] Epoch:[0/2](352300/4588595) loss:3.226 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:51:35,844][model8_pretrain.py][INFO] Epoch:[0/2](352300/4588595) loss:3.186 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:51:35,845][model8_pretrain.py][INFO] Epoch:[0/2](352300/4588595) loss:2.673 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:51:35,845][model8_pretrain.py][INFO] Epoch:[0/2](352300/4588595) loss:3.196 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:52:12,790][model8_pretrain.py][INFO] Epoch:[0/2](352400/4588595) loss:2.429 lr:0.0000100 epoch_Time:26836.0min: [2024-01-04 06:52:12,790][model8_pretrain.py][INFO] Epoch:[0/2](352400/4588595) loss:2.390 lr:0.0000100 epoch_Time:26836.0min: [2024-01-04 06:52:12,790][model8_pretrain.py][INFO] Epoch:[0/2](352400/4588595) loss:3.188 lr:0.0000100 epoch_Time:26836.0min: [2024-01-04 06:52:12,790][model8_pretrain.py][INFO] Epoch:[0/2](352400/4588595) loss:2.803 lr:0.0000100 epoch_Time:26836.0min: [2024-01-04 06:52:12,790][model8_pretrain.py][INFO] Epoch:[0/2](352400/4588595) loss:3.226 lr:0.0000100 epoch_Time:26836.0min: [2024-01-04 06:52:12,790][model8_pretrain.py][INFO] Epoch:[0/2](352400/4588595) loss:3.034 lr:0.0000100 epoch_Time:26836.0min: [2024-01-04 06:52:12,791][model8_pretrain.py][INFO] Epoch:[0/2](352400/4588595) loss:2.606 lr:0.0000100 epoch_Time:26836.0min: [2024-01-04 06:52:12,791][model8_pretrain.py][INFO] Epoch:[0/2](352400/4588595) loss:2.602 lr:0.0000100 epoch_Time:26836.0min: [2024-01-04 06:52:51,447][model8_pretrain.py][INFO] Epoch:[0/2](352500/4588595) loss:2.565 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:52:51,447][model8_pretrain.py][INFO] Epoch:[0/2](352500/4588595) loss:2.992 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:52:51,447][model8_pretrain.py][INFO] Epoch:[0/2](352500/4588595) loss:2.943 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:52:51,447][model8_pretrain.py][INFO] Epoch:[0/2](352500/4588595) loss:3.302 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:52:51,447][model8_pretrain.py][INFO] Epoch:[0/2](352500/4588595) loss:3.148 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:52:51,447][model8_pretrain.py][INFO] Epoch:[0/2](352500/4588595) loss:3.022 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:52:51,447][model8_pretrain.py][INFO] Epoch:[0/2](352500/4588595) loss:2.900 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:52:51,447][model8_pretrain.py][INFO] Epoch:[0/2](352500/4588595) loss:2.670 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:53:37,168][model8_pretrain.py][INFO] Epoch:[0/2](352600/4588595) loss:2.817 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:53:37,169][model8_pretrain.py][INFO] Epoch:[0/2](352600/4588595) loss:2.926 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:53:37,169][model8_pretrain.py][INFO] Epoch:[0/2](352600/4588595) loss:2.082 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:53:37,169][model8_pretrain.py][INFO] Epoch:[0/2](352600/4588595) loss:2.470 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:53:37,169][model8_pretrain.py][INFO] Epoch:[0/2](352600/4588595) loss:2.822 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:53:37,169][model8_pretrain.py][INFO] Epoch:[0/2](352600/4588595) loss:3.070 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:53:37,169][model8_pretrain.py][INFO] Epoch:[0/2](352600/4588595) loss:2.877 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:53:37,169][model8_pretrain.py][INFO] Epoch:[0/2](352600/4588595) loss:2.971 lr:0.0000100 epoch_Time:26837.0min: [2024-01-04 06:54:14,191][model8_pretrain.py][INFO] Epoch:[0/2](352700/4588595) loss:2.308 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:54:14,191][model8_pretrain.py][INFO] Epoch:[0/2](352700/4588595) loss:3.179 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:54:14,191][model8_pretrain.py][INFO] Epoch:[0/2](352700/4588595) loss:2.607 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:54:14,191][model8_pretrain.py][INFO] Epoch:[0/2](352700/4588595) loss:2.817 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:54:14,191][model8_pretrain.py][INFO] Epoch:[0/2](352700/4588595) loss:2.701 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:54:14,191][model8_pretrain.py][INFO] Epoch:[0/2](352700/4588595) loss:3.079 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:54:14,192][model8_pretrain.py][INFO] Epoch:[0/2](352700/4588595) loss:3.105 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:54:14,192][model8_pretrain.py][INFO] Epoch:[0/2](352700/4588595) loss:2.751 lr:0.0000100 epoch_Time:26835.0min: [2024-01-04 06:54:51,137][model8_pretrain.py][INFO] Epoch:[0/2](352800/4588595) loss:2.869 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:54:51,137][model8_pretrain.py][INFO] Epoch:[0/2](352800/4588595) loss:2.496 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:54:51,137][model8_pretrain.py][INFO] Epoch:[0/2](352800/4588595) loss:3.098 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:54:51,137][model8_pretrain.py][INFO] Epoch:[0/2](352800/4588595) loss:3.077 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:54:51,137][model8_pretrain.py][INFO] Epoch:[0/2](352800/4588595) loss:3.088 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:54:51,137][model8_pretrain.py][INFO] Epoch:[0/2](352800/4588595) loss:3.110 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:54:51,137][model8_pretrain.py][INFO] Epoch:[0/2](352800/4588595) loss:3.551 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:54:51,137][model8_pretrain.py][INFO] Epoch:[0/2](352800/4588595) loss:2.605 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:55:28,093][model8_pretrain.py][INFO] Epoch:[0/2](352900/4588595) loss:3.015 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:55:28,093][model8_pretrain.py][INFO] Epoch:[0/2](352900/4588595) loss:3.268 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:55:28,094][model8_pretrain.py][INFO] Epoch:[0/2](352900/4588595) loss:2.978 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:55:28,094][model8_pretrain.py][INFO] Epoch:[0/2](352900/4588595) loss:2.934 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:55:28,094][model8_pretrain.py][INFO] Epoch:[0/2](352900/4588595) loss:2.948 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:55:28,094][model8_pretrain.py][INFO] Epoch:[0/2](352900/4588595) loss:3.191 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:55:28,094][model8_pretrain.py][INFO] Epoch:[0/2](352900/4588595) loss:2.773 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:55:28,094][model8_pretrain.py][INFO] Epoch:[0/2](352900/4588595) loss:2.574 lr:0.0000100 epoch_Time:26834.0min: [2024-01-04 06:56:05,049][model8_pretrain.py][INFO] Epoch:[0/2](353000/4588595) loss:3.250 lr:0.0000100 epoch_Time:26833.0min: [2024-01-04 06:56:05,049][model8_pretrain.py][INFO] Epoch:[0/2](353000/4588595) loss:2.969 lr:0.0000100 epoch_Time:26833.0min: [2024-01-04 06:56:05,049][model8_pretrain.py][INFO] Epoch:[0/2](353000/4588595) loss:3.009 lr:0.0000100 epoch_Time:26833.0min: [2024-01-04 06:56:05,049][model8_pretrain.py][INFO] Epoch:[0/2](353000/4588595) loss:2.602 lr:0.0000100 epoch_Time:26833.0min: [2024-01-04 06:56:05,049][model8_pretrain.py][INFO] Epoch:[0/2](353000/4588595) loss:2.992 lr:0.0000100 epoch_Time:26833.0min: [2024-01-04 06:56:05,049][model8_pretrain.py][INFO] Epoch:[0/2](353000/4588595) loss:2.994 lr:0.0000100 epoch_Time:26833.0min: [2024-01-04 06:56:05,049][model8_pretrain.py][INFO] Epoch:[0/2](353000/4588595) loss:2.837 lr:0.0000100 epoch_Time:26833.0min: [2024-01-04 06:56:05,049][model8_pretrain.py][INFO] Epoch:[0/2](353000/4588595) loss:2.967 lr:0.0000100 epoch_Time:26833.0min: [2024-01-04 06:56:41,994][model8_pretrain.py][INFO] Epoch:[0/2](353100/4588595) loss:3.315 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:56:41,994][model8_pretrain.py][INFO] Epoch:[0/2](353100/4588595) loss:2.892 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:56:41,994][model8_pretrain.py][INFO] Epoch:[0/2](353100/4588595) loss:2.813 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:56:41,994][model8_pretrain.py][INFO] Epoch:[0/2](353100/4588595) loss:2.978 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:56:41,994][model8_pretrain.py][INFO] Epoch:[0/2](353100/4588595) loss:2.672 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:56:41,994][model8_pretrain.py][INFO] Epoch:[0/2](353100/4588595) loss:2.746 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:56:41,994][model8_pretrain.py][INFO] Epoch:[0/2](353100/4588595) loss:3.034 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:56:41,994][model8_pretrain.py][INFO] Epoch:[0/2](353100/4588595) loss:2.599 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:57:18,909][model8_pretrain.py][INFO] Epoch:[0/2](353200/4588595) loss:3.011 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:57:18,909][model8_pretrain.py][INFO] Epoch:[0/2](353200/4588595) loss:2.870 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:57:18,909][model8_pretrain.py][INFO] Epoch:[0/2](353200/4588595) loss:3.161 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:57:18,909][model8_pretrain.py][INFO] Epoch:[0/2](353200/4588595) loss:3.237 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:57:18,909][model8_pretrain.py][INFO] Epoch:[0/2](353200/4588595) loss:2.493 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:57:18,909][model8_pretrain.py][INFO] Epoch:[0/2](353200/4588595) loss:3.590 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:57:18,909][model8_pretrain.py][INFO] Epoch:[0/2](353200/4588595) loss:2.712 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:57:18,909][model8_pretrain.py][INFO] Epoch:[0/2](353200/4588595) loss:2.499 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:57:57,610][model8_pretrain.py][INFO] Epoch:[0/2](353300/4588595) loss:2.802 lr:0.0000100 epoch_Time:26830.0min: [2024-01-04 06:57:57,610][model8_pretrain.py][INFO] Epoch:[0/2](353300/4588595) loss:3.021 lr:0.0000100 epoch_Time:26830.0min: [2024-01-04 06:57:57,610][model8_pretrain.py][INFO] Epoch:[0/2](353300/4588595) loss:2.953 lr:0.0000100 epoch_Time:26830.0min: [2024-01-04 06:57:57,610][model8_pretrain.py][INFO] Epoch:[0/2](353300/4588595) loss:2.754 lr:0.0000100 epoch_Time:26830.0min: [2024-01-04 06:57:57,610][model8_pretrain.py][INFO] Epoch:[0/2](353300/4588595) loss:2.853 lr:0.0000100 epoch_Time:26830.0min: [2024-01-04 06:57:57,615][model8_pretrain.py][INFO] Epoch:[0/2](353300/4588595) loss:2.891 lr:0.0000100 epoch_Time:26830.0min: [2024-01-04 06:57:57,615][model8_pretrain.py][INFO] Epoch:[0/2](353300/4588595) loss:2.907 lr:0.0000100 epoch_Time:26830.0min: [2024-01-04 06:57:57,615][model8_pretrain.py][INFO] Epoch:[0/2](353300/4588595) loss:2.892 lr:0.0000100 epoch_Time:26830.0min: [2024-01-04 06:58:42,950][model8_pretrain.py][INFO] Epoch:[0/2](353400/4588595) loss:2.564 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:58:42,950][model8_pretrain.py][INFO] Epoch:[0/2](353400/4588595) loss:2.683 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:58:42,950][model8_pretrain.py][INFO] Epoch:[0/2](353400/4588595) loss:3.154 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:58:42,950][model8_pretrain.py][INFO] Epoch:[0/2](353400/4588595) loss:2.511 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:58:42,950][model8_pretrain.py][INFO] Epoch:[0/2](353400/4588595) loss:2.754 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:58:42,950][model8_pretrain.py][INFO] Epoch:[0/2](353400/4588595) loss:2.663 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:58:42,950][model8_pretrain.py][INFO] Epoch:[0/2](353400/4588595) loss:3.034 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:58:42,950][model8_pretrain.py][INFO] Epoch:[0/2](353400/4588595) loss:2.723 lr:0.0000100 epoch_Time:26832.0min: [2024-01-04 06:59:19,888][model8_pretrain.py][INFO] Epoch:[0/2](353500/4588595) loss:3.084 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:59:19,888][model8_pretrain.py][INFO] Epoch:[0/2](353500/4588595) loss:2.898 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:59:19,888][model8_pretrain.py][INFO] Epoch:[0/2](353500/4588595) loss:3.009 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:59:19,888][model8_pretrain.py][INFO] Epoch:[0/2](353500/4588595) loss:2.821 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:59:19,888][model8_pretrain.py][INFO] Epoch:[0/2](353500/4588595) loss:2.634 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:59:19,888][model8_pretrain.py][INFO] Epoch:[0/2](353500/4588595) loss:3.289 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:59:19,888][model8_pretrain.py][INFO] Epoch:[0/2](353500/4588595) loss:3.300 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:59:19,889][model8_pretrain.py][INFO] Epoch:[0/2](353500/4588595) loss:3.094 lr:0.0000100 epoch_Time:26831.0min: [2024-01-04 06:59:56,823][model8_pretrain.py][INFO] Epoch:[0/2](353600/4588595) loss:2.559 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 06:59:56,823][model8_pretrain.py][INFO] Epoch:[0/2](353600/4588595) loss:2.374 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 06:59:56,823][model8_pretrain.py][INFO] Epoch:[0/2](353600/4588595) loss:2.576 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 06:59:56,823][model8_pretrain.py][INFO] Epoch:[0/2](353600/4588595) loss:3.054 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 06:59:56,823][model8_pretrain.py][INFO] Epoch:[0/2](353600/4588595) loss:2.748 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 06:59:56,823][model8_pretrain.py][INFO] Epoch:[0/2](353600/4588595) loss:3.006 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 06:59:56,823][model8_pretrain.py][INFO] Epoch:[0/2](353600/4588595) loss:3.512 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 06:59:56,823][model8_pretrain.py][INFO] Epoch:[0/2](353600/4588595) loss:3.267 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 07:00:33,731][model8_pretrain.py][INFO] Epoch:[0/2](353700/4588595) loss:2.716 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 07:00:33,731][model8_pretrain.py][INFO] Epoch:[0/2](353700/4588595) loss:2.995 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 07:00:33,731][model8_pretrain.py][INFO] Epoch:[0/2](353700/4588595) loss:2.831 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 07:00:33,731][model8_pretrain.py][INFO] Epoch:[0/2](353700/4588595) loss:3.158 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 07:00:33,731][model8_pretrain.py][INFO] Epoch:[0/2](353700/4588595) loss:2.976 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 07:00:33,731][model8_pretrain.py][INFO] Epoch:[0/2](353700/4588595) loss:2.314 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 07:00:33,731][model8_pretrain.py][INFO] Epoch:[0/2](353700/4588595) loss:2.747 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 07:00:33,732][model8_pretrain.py][INFO] Epoch:[0/2](353700/4588595) loss:2.949 lr:0.0000100 epoch_Time:26829.0min: [2024-01-04 07:01:10,675][model8_pretrain.py][INFO] Epoch:[0/2](353800/4588595) loss:2.579 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:10,675][model8_pretrain.py][INFO] Epoch:[0/2](353800/4588595) loss:3.273 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:10,675][model8_pretrain.py][INFO] Epoch:[0/2](353800/4588595) loss:2.675 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:10,675][model8_pretrain.py][INFO] Epoch:[0/2](353800/4588595) loss:3.009 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:10,675][model8_pretrain.py][INFO] Epoch:[0/2](353800/4588595) loss:2.891 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:10,675][model8_pretrain.py][INFO] Epoch:[0/2](353800/4588595) loss:3.491 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:10,675][model8_pretrain.py][INFO] Epoch:[0/2](353800/4588595) loss:2.893 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:10,675][model8_pretrain.py][INFO] Epoch:[0/2](353800/4588595) loss:3.025 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:47,612][model8_pretrain.py][INFO] Epoch:[0/2](353900/4588595) loss:3.058 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:01:47,612][model8_pretrain.py][INFO] Epoch:[0/2](353900/4588595) loss:3.039 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:47,612][model8_pretrain.py][INFO] Epoch:[0/2](353900/4588595) loss:3.187 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:01:47,612][model8_pretrain.py][INFO] Epoch:[0/2](353900/4588595) loss:2.512 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:01:47,612][model8_pretrain.py][INFO] Epoch:[0/2](353900/4588595) loss:3.008 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:01:47,612][model8_pretrain.py][INFO] Epoch:[0/2](353900/4588595) loss:3.457 lr:0.0000100 epoch_Time:26828.0min: [2024-01-04 07:01:47,612][model8_pretrain.py][INFO] Epoch:[0/2](353900/4588595) loss:2.410 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:01:47,612][model8_pretrain.py][INFO] Epoch:[0/2](353900/4588595) loss:1.913 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:02:24,560][model8_pretrain.py][INFO] Epoch:[0/2](354000/4588595) loss:2.542 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:02:24,560][model8_pretrain.py][INFO] Epoch:[0/2](354000/4588595) loss:2.335 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:02:24,560][model8_pretrain.py][INFO] Epoch:[0/2](354000/4588595) loss:2.311 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:02:24,560][model8_pretrain.py][INFO] Epoch:[0/2](354000/4588595) loss:3.298 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:02:24,560][model8_pretrain.py][INFO] Epoch:[0/2](354000/4588595) loss:2.770 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:02:24,560][model8_pretrain.py][INFO] Epoch:[0/2](354000/4588595) loss:2.827 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:02:24,560][model8_pretrain.py][INFO] Epoch:[0/2](354000/4588595) loss:2.764 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:02:24,560][model8_pretrain.py][INFO] Epoch:[0/2](354000/4588595) loss:3.309 lr:0.0000100 epoch_Time:26827.0min: [2024-01-04 07:03:01,502][model8_pretrain.py][INFO] Epoch:[0/2](354100/4588595) loss:3.276 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:03:01,502][model8_pretrain.py][INFO] Epoch:[0/2](354100/4588595) loss:2.445 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:03:01,502][model8_pretrain.py][INFO] Epoch:[0/2](354100/4588595) loss:3.018 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:03:01,502][model8_pretrain.py][INFO] Epoch:[0/2](354100/4588595) loss:2.803 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:03:01,502][model8_pretrain.py][INFO] Epoch:[0/2](354100/4588595) loss:3.090 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:03:01,502][model8_pretrain.py][INFO] Epoch:[0/2](354100/4588595) loss:2.046 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:03:01,503][model8_pretrain.py][INFO] Epoch:[0/2](354100/4588595) loss:2.804 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:03:01,503][model8_pretrain.py][INFO] Epoch:[0/2](354100/4588595) loss:3.159 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:03:48,544][model8_pretrain.py][INFO] Epoch:[0/2](354200/4588595) loss:2.598 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:03:48,544][model8_pretrain.py][INFO] Epoch:[0/2](354200/4588595) loss:3.092 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:03:48,544][model8_pretrain.py][INFO] Epoch:[0/2](354200/4588595) loss:3.114 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:03:48,544][model8_pretrain.py][INFO] Epoch:[0/2](354200/4588595) loss:2.644 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:03:48,544][model8_pretrain.py][INFO] Epoch:[0/2](354200/4588595) loss:2.790 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:03:48,544][model8_pretrain.py][INFO] Epoch:[0/2](354200/4588595) loss:2.796 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:03:48,544][model8_pretrain.py][INFO] Epoch:[0/2](354200/4588595) loss:2.719 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:03:48,544][model8_pretrain.py][INFO] Epoch:[0/2](354200/4588595) loss:3.009 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:04:25,482][model8_pretrain.py][INFO] Epoch:[0/2](354300/4588595) loss:2.282 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:04:25,482][model8_pretrain.py][INFO] Epoch:[0/2](354300/4588595) loss:3.504 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:04:25,482][model8_pretrain.py][INFO] Epoch:[0/2](354300/4588595) loss:3.215 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:04:25,482][model8_pretrain.py][INFO] Epoch:[0/2](354300/4588595) loss:3.236 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:04:25,482][model8_pretrain.py][INFO] Epoch:[0/2](354300/4588595) loss:3.153 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:04:25,482][model8_pretrain.py][INFO] Epoch:[0/2](354300/4588595) loss:2.741 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:04:25,482][model8_pretrain.py][INFO] Epoch:[0/2](354300/4588595) loss:2.930 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:04:25,483][model8_pretrain.py][INFO] Epoch:[0/2](354300/4588595) loss:2.927 lr:0.0000100 epoch_Time:26826.0min: [2024-01-04 07:05:02,423][model8_pretrain.py][INFO] Epoch:[0/2](354400/4588595) loss:2.594 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:02,423][model8_pretrain.py][INFO] Epoch:[0/2](354400/4588595) loss:2.984 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:02,423][model8_pretrain.py][INFO] Epoch:[0/2](354400/4588595) loss:3.167 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:02,423][model8_pretrain.py][INFO] Epoch:[0/2](354400/4588595) loss:2.700 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:02,423][model8_pretrain.py][INFO] Epoch:[0/2](354400/4588595) loss:2.350 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:02,423][model8_pretrain.py][INFO] Epoch:[0/2](354400/4588595) loss:3.143 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:02,423][model8_pretrain.py][INFO] Epoch:[0/2](354400/4588595) loss:2.956 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:02,423][model8_pretrain.py][INFO] Epoch:[0/2](354400/4588595) loss:2.748 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:39,360][model8_pretrain.py][INFO] Epoch:[0/2](354500/4588595) loss:2.807 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:39,360][model8_pretrain.py][INFO] Epoch:[0/2](354500/4588595) loss:2.797 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:39,360][model8_pretrain.py][INFO] Epoch:[0/2](354500/4588595) loss:2.639 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:39,360][model8_pretrain.py][INFO] Epoch:[0/2](354500/4588595) loss:3.124 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:39,360][model8_pretrain.py][INFO] Epoch:[0/2](354500/4588595) loss:2.445 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:39,360][model8_pretrain.py][INFO] Epoch:[0/2](354500/4588595) loss:2.833 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:39,360][model8_pretrain.py][INFO] Epoch:[0/2](354500/4588595) loss:2.316 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:05:39,360][model8_pretrain.py][INFO] Epoch:[0/2](354500/4588595) loss:2.610 lr:0.0000100 epoch_Time:26825.0min: [2024-01-04 07:06:16,295][model8_pretrain.py][INFO] Epoch:[0/2](354600/4588595) loss:2.607 lr:0.0000100 epoch_Time:26823.0min: [2024-01-04 07:06:16,295][model8_pretrain.py][INFO] Epoch:[0/2](354600/4588595) loss:3.127 lr:0.0000100 epoch_Time:26823.0min: [2024-01-04 07:06:16,295][model8_pretrain.py][INFO] Epoch:[0/2](354600/4588595) loss:3.053 lr:0.0000100 epoch_Time:26823.0min: [2024-01-04 07:06:16,295][model8_pretrain.py][INFO] Epoch:[0/2](354600/4588595) loss:2.387 lr:0.0000100 epoch_Time:26823.0min: [2024-01-04 07:06:16,295][model8_pretrain.py][INFO] Epoch:[0/2](354600/4588595) loss:3.128 lr:0.0000100 epoch_Time:26823.0min: [2024-01-04 07:06:16,295][model8_pretrain.py][INFO] Epoch:[0/2](354600/4588595) loss:2.637 lr:0.0000100 epoch_Time:26823.0min: [2024-01-04 07:06:16,295][model8_pretrain.py][INFO] Epoch:[0/2](354600/4588595) loss:2.472 lr:0.0000100 epoch_Time:26823.0min: [2024-01-04 07:06:16,296][model8_pretrain.py][INFO] Epoch:[0/2](354600/4588595) loss:3.196 lr:0.0000100 epoch_Time:26823.0min: [2024-01-04 07:06:53,252][model8_pretrain.py][INFO] Epoch:[0/2](354700/4588595) loss:2.969 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:06:53,252][model8_pretrain.py][INFO] Epoch:[0/2](354700/4588595) loss:2.818 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:06:53,252][model8_pretrain.py][INFO] Epoch:[0/2](354700/4588595) loss:2.647 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:06:53,252][model8_pretrain.py][INFO] Epoch:[0/2](354700/4588595) loss:3.147 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:06:53,252][model8_pretrain.py][INFO] Epoch:[0/2](354700/4588595) loss:3.228 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:06:53,252][model8_pretrain.py][INFO] Epoch:[0/2](354700/4588595) loss:2.784 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:06:53,252][model8_pretrain.py][INFO] Epoch:[0/2](354700/4588595) loss:3.350 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:06:53,252][model8_pretrain.py][INFO] Epoch:[0/2](354700/4588595) loss:2.586 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:07:30,195][model8_pretrain.py][INFO] Epoch:[0/2](354800/4588595) loss:2.330 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:07:30,195][model8_pretrain.py][INFO] Epoch:[0/2](354800/4588595) loss:2.845 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:07:30,195][model8_pretrain.py][INFO] Epoch:[0/2](354800/4588595) loss:2.359 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:07:30,195][model8_pretrain.py][INFO] Epoch:[0/2](354800/4588595) loss:3.536 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:07:30,195][model8_pretrain.py][INFO] Epoch:[0/2](354800/4588595) loss:3.024 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:07:30,195][model8_pretrain.py][INFO] Epoch:[0/2](354800/4588595) loss:2.805 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:07:30,195][model8_pretrain.py][INFO] Epoch:[0/2](354800/4588595) loss:3.083 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:07:30,195][model8_pretrain.py][INFO] Epoch:[0/2](354800/4588595) loss:2.695 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:08:07,133][model8_pretrain.py][INFO] Epoch:[0/2](354900/4588595) loss:2.099 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:08:07,133][model8_pretrain.py][INFO] Epoch:[0/2](354900/4588595) loss:2.707 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:08:07,133][model8_pretrain.py][INFO] Epoch:[0/2](354900/4588595) loss:2.587 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:08:07,133][model8_pretrain.py][INFO] Epoch:[0/2](354900/4588595) loss:2.870 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:08:07,133][model8_pretrain.py][INFO] Epoch:[0/2](354900/4588595) loss:3.051 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:08:07,133][model8_pretrain.py][INFO] Epoch:[0/2](354900/4588595) loss:2.901 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:08:07,133][model8_pretrain.py][INFO] Epoch:[0/2](354900/4588595) loss:3.276 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:08:07,133][model8_pretrain.py][INFO] Epoch:[0/2](354900/4588595) loss:2.905 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:08:54,239][model8_pretrain.py][INFO] Epoch:[0/2](355000/4588595) loss:2.482 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:08:54,239][model8_pretrain.py][INFO] Epoch:[0/2](355000/4588595) loss:3.384 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:08:54,239][model8_pretrain.py][INFO] Epoch:[0/2](355000/4588595) loss:2.626 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:08:54,239][model8_pretrain.py][INFO] Epoch:[0/2](355000/4588595) loss:3.078 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:08:54,239][model8_pretrain.py][INFO] Epoch:[0/2](355000/4588595) loss:2.422 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:08:54,239][model8_pretrain.py][INFO] Epoch:[0/2](355000/4588595) loss:2.632 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:08:54,239][model8_pretrain.py][INFO] Epoch:[0/2](355000/4588595) loss:2.230 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:08:54,239][model8_pretrain.py][INFO] Epoch:[0/2](355000/4588595) loss:2.640 lr:0.0000100 epoch_Time:26822.0min: [2024-01-04 07:09:31,174][model8_pretrain.py][INFO] Epoch:[0/2](355100/4588595) loss:3.162 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:09:31,174][model8_pretrain.py][INFO] Epoch:[0/2](355100/4588595) loss:2.969 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:09:31,174][model8_pretrain.py][INFO] Epoch:[0/2](355100/4588595) loss:3.138 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:09:31,174][model8_pretrain.py][INFO] Epoch:[0/2](355100/4588595) loss:2.367 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:09:31,174][model8_pretrain.py][INFO] Epoch:[0/2](355100/4588595) loss:2.905 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:09:31,174][model8_pretrain.py][INFO] Epoch:[0/2](355100/4588595) loss:2.956 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:09:31,174][model8_pretrain.py][INFO] Epoch:[0/2](355100/4588595) loss:2.853 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:09:31,174][model8_pretrain.py][INFO] Epoch:[0/2](355100/4588595) loss:2.517 lr:0.0000100 epoch_Time:26821.0min: [2024-01-04 07:10:08,113][model8_pretrain.py][INFO] Epoch:[0/2](355200/4588595) loss:3.136 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:08,113][model8_pretrain.py][INFO] Epoch:[0/2](355200/4588595) loss:2.573 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:08,113][model8_pretrain.py][INFO] Epoch:[0/2](355200/4588595) loss:2.770 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:08,113][model8_pretrain.py][INFO] Epoch:[0/2](355200/4588595) loss:2.598 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:08,113][model8_pretrain.py][INFO] Epoch:[0/2](355200/4588595) loss:2.934 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:08,114][model8_pretrain.py][INFO] Epoch:[0/2](355200/4588595) loss:2.450 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:08,114][model8_pretrain.py][INFO] Epoch:[0/2](355200/4588595) loss:2.499 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:08,114][model8_pretrain.py][INFO] Epoch:[0/2](355200/4588595) loss:2.625 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:45,052][model8_pretrain.py][INFO] Epoch:[0/2](355300/4588595) loss:2.833 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:45,052][model8_pretrain.py][INFO] Epoch:[0/2](355300/4588595) loss:2.602 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:45,052][model8_pretrain.py][INFO] Epoch:[0/2](355300/4588595) loss:3.507 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:45,052][model8_pretrain.py][INFO] Epoch:[0/2](355300/4588595) loss:3.309 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:45,053][model8_pretrain.py][INFO] Epoch:[0/2](355300/4588595) loss:3.054 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:45,053][model8_pretrain.py][INFO] Epoch:[0/2](355300/4588595) loss:2.941 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:45,053][model8_pretrain.py][INFO] Epoch:[0/2](355300/4588595) loss:3.033 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:10:45,053][model8_pretrain.py][INFO] Epoch:[0/2](355300/4588595) loss:2.784 lr:0.0000100 epoch_Time:26820.0min: [2024-01-04 07:11:21,998][model8_pretrain.py][INFO] Epoch:[0/2](355400/4588595) loss:3.142 lr:0.0000100 epoch_Time:26819.0min: [2024-01-04 07:11:21,998][model8_pretrain.py][INFO] Epoch:[0/2](355400/4588595) loss:3.308 lr:0.0000100 epoch_Time:26819.0min: [2024-01-04 07:11:21,998][model8_pretrain.py][INFO] Epoch:[0/2](355400/4588595) loss:3.115 lr:0.0000100 epoch_Time:26819.0min: [2024-01-04 07:11:21,998][model8_pretrain.py][INFO] Epoch:[0/2](355400/4588595) loss:3.317 lr:0.0000100 epoch_Time:26819.0min: [2024-01-04 07:11:21,998][model8_pretrain.py][INFO] Epoch:[0/2](355400/4588595) loss:2.869 lr:0.0000100 epoch_Time:26819.0min: [2024-01-04 07:11:21,998][model8_pretrain.py][INFO] Epoch:[0/2](355400/4588595) loss:2.988 lr:0.0000100 epoch_Time:26819.0min: [2024-01-04 07:11:21,998][model8_pretrain.py][INFO] Epoch:[0/2](355400/4588595) loss:2.945 lr:0.0000100 epoch_Time:26819.0min: [2024-01-04 07:11:21,998][model8_pretrain.py][INFO] Epoch:[0/2](355400/4588595) loss:3.095 lr:0.0000100 epoch_Time:26819.0min: [2024-01-04 07:11:58,934][model8_pretrain.py][INFO] Epoch:[0/2](355500/4588595) loss:2.799 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:11:58,934][model8_pretrain.py][INFO] Epoch:[0/2](355500/4588595) loss:2.960 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:11:58,934][model8_pretrain.py][INFO] Epoch:[0/2](355500/4588595) loss:2.705 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:11:58,934][model8_pretrain.py][INFO] Epoch:[0/2](355500/4588595) loss:3.256 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:11:58,934][model8_pretrain.py][INFO] Epoch:[0/2](355500/4588595) loss:2.697 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:11:58,934][model8_pretrain.py][INFO] Epoch:[0/2](355500/4588595) loss:2.860 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:11:58,934][model8_pretrain.py][INFO] Epoch:[0/2](355500/4588595) loss:2.566 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:11:58,934][model8_pretrain.py][INFO] Epoch:[0/2](355500/4588595) loss:3.442 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:12:35,863][model8_pretrain.py][INFO] Epoch:[0/2](355600/4588595) loss:2.789 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:12:35,863][model8_pretrain.py][INFO] Epoch:[0/2](355600/4588595) loss:2.596 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:12:35,863][model8_pretrain.py][INFO] Epoch:[0/2](355600/4588595) loss:2.675 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:12:35,863][model8_pretrain.py][INFO] Epoch:[0/2](355600/4588595) loss:2.635 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:12:35,863][model8_pretrain.py][INFO] Epoch:[0/2](355600/4588595) loss:2.679 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:12:35,863][model8_pretrain.py][INFO] Epoch:[0/2](355600/4588595) loss:3.348 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:12:35,864][model8_pretrain.py][INFO] Epoch:[0/2](355600/4588595) loss:3.024 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:12:35,864][model8_pretrain.py][INFO] Epoch:[0/2](355600/4588595) loss:2.541 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:13:12,793][model8_pretrain.py][INFO] Epoch:[0/2](355700/4588595) loss:2.978 lr:0.0000100 epoch_Time:26816.0min: [2024-01-04 07:13:12,793][model8_pretrain.py][INFO] Epoch:[0/2](355700/4588595) loss:2.865 lr:0.0000100 epoch_Time:26816.0min: [2024-01-04 07:13:12,793][model8_pretrain.py][INFO] Epoch:[0/2](355700/4588595) loss:2.978 lr:0.0000100 epoch_Time:26816.0min: [2024-01-04 07:13:12,793][model8_pretrain.py][INFO] Epoch:[0/2](355700/4588595) loss:2.993 lr:0.0000100 epoch_Time:26816.0min: [2024-01-04 07:13:12,793][model8_pretrain.py][INFO] Epoch:[0/2](355700/4588595) loss:3.285 lr:0.0000100 epoch_Time:26816.0min: [2024-01-04 07:13:12,793][model8_pretrain.py][INFO] Epoch:[0/2](355700/4588595) loss:2.842 lr:0.0000100 epoch_Time:26816.0min: [2024-01-04 07:13:12,793][model8_pretrain.py][INFO] Epoch:[0/2](355700/4588595) loss:3.023 lr:0.0000100 epoch_Time:26816.0min: [2024-01-04 07:13:12,793][model8_pretrain.py][INFO] Epoch:[0/2](355700/4588595) loss:2.690 lr:0.0000100 epoch_Time:26816.0min: [2024-01-04 07:13:59,568][model8_pretrain.py][INFO] Epoch:[0/2](355800/4588595) loss:2.192 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:13:59,568][model8_pretrain.py][INFO] Epoch:[0/2](355800/4588595) loss:2.743 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:13:59,568][model8_pretrain.py][INFO] Epoch:[0/2](355800/4588595) loss:2.892 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:13:59,568][model8_pretrain.py][INFO] Epoch:[0/2](355800/4588595) loss:3.014 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:13:59,568][model8_pretrain.py][INFO] Epoch:[0/2](355800/4588595) loss:3.188 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:13:59,568][model8_pretrain.py][INFO] Epoch:[0/2](355800/4588595) loss:2.782 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:13:59,568][model8_pretrain.py][INFO] Epoch:[0/2](355800/4588595) loss:3.004 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:13:59,568][model8_pretrain.py][INFO] Epoch:[0/2](355800/4588595) loss:2.487 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:14:36,520][model8_pretrain.py][INFO] Epoch:[0/2](355900/4588595) loss:2.892 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:14:36,520][model8_pretrain.py][INFO] Epoch:[0/2](355900/4588595) loss:3.140 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:14:36,520][model8_pretrain.py][INFO] Epoch:[0/2](355900/4588595) loss:2.505 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:14:36,520][model8_pretrain.py][INFO] Epoch:[0/2](355900/4588595) loss:2.184 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:14:36,520][model8_pretrain.py][INFO] Epoch:[0/2](355900/4588595) loss:3.062 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:14:36,520][model8_pretrain.py][INFO] Epoch:[0/2](355900/4588595) loss:2.900 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:14:36,520][model8_pretrain.py][INFO] Epoch:[0/2](355900/4588595) loss:2.469 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:14:36,521][model8_pretrain.py][INFO] Epoch:[0/2](355900/4588595) loss:2.968 lr:0.0000100 epoch_Time:26817.0min: [2024-01-04 07:15:13,458][model8_pretrain.py][INFO] Epoch:[0/2](356000/4588595) loss:3.024 lr:0.0000100 epoch_Time:26815.0min: [2024-01-04 07:15:13,458][model8_pretrain.py][INFO] Epoch:[0/2](356000/4588595) loss:3.187 lr:0.0000100 epoch_Time:26815.0min: [2024-01-04 07:15:13,458][model8_pretrain.py][INFO] Epoch:[0/2](356000/4588595) loss:3.110 lr:0.0000100 epoch_Time:26815.0min: [2024-01-04 07:15:13,458][model8_pretrain.py][INFO] Epoch:[0/2](356000/4588595) loss:3.079 lr:0.0000100 epoch_Time:26815.0min: [2024-01-04 07:15:13,458][model8_pretrain.py][INFO] Epoch:[0/2](356000/4588595) loss:2.773 lr:0.0000100 epoch_Time:26815.0min: [2024-01-04 07:15:13,458][model8_pretrain.py][INFO] Epoch:[0/2](356000/4588595) loss:2.967 lr:0.0000100 epoch_Time:26815.0min: [2024-01-04 07:15:13,458][model8_pretrain.py][INFO] Epoch:[0/2](356000/4588595) loss:3.159 lr:0.0000100 epoch_Time:26815.0min: [2024-01-04 07:15:13,458][model8_pretrain.py][INFO] Epoch:[0/2](356000/4588595) loss:3.526 lr:0.0000100 epoch_Time:26815.0min: [2024-01-04 07:15:50,395][model8_pretrain.py][INFO] Epoch:[0/2](356100/4588595) loss:2.819 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:15:50,396][model8_pretrain.py][INFO] Epoch:[0/2](356100/4588595) loss:3.118 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:15:50,396][model8_pretrain.py][INFO] Epoch:[0/2](356100/4588595) loss:2.252 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:15:50,396][model8_pretrain.py][INFO] Epoch:[0/2](356100/4588595) loss:3.123 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:15:50,396][model8_pretrain.py][INFO] Epoch:[0/2](356100/4588595) loss:2.981 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:15:50,396][model8_pretrain.py][INFO] Epoch:[0/2](356100/4588595) loss:2.668 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:15:50,396][model8_pretrain.py][INFO] Epoch:[0/2](356100/4588595) loss:3.141 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:15:50,396][model8_pretrain.py][INFO] Epoch:[0/2](356100/4588595) loss:2.177 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:16:27,337][model8_pretrain.py][INFO] Epoch:[0/2](356200/4588595) loss:3.173 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:16:27,337][model8_pretrain.py][INFO] Epoch:[0/2](356200/4588595) loss:2.742 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:16:27,337][model8_pretrain.py][INFO] Epoch:[0/2](356200/4588595) loss:2.853 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:16:27,337][model8_pretrain.py][INFO] Epoch:[0/2](356200/4588595) loss:3.227 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:16:27,337][model8_pretrain.py][INFO] Epoch:[0/2](356200/4588595) loss:2.875 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:16:27,337][model8_pretrain.py][INFO] Epoch:[0/2](356200/4588595) loss:2.943 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:16:27,337][model8_pretrain.py][INFO] Epoch:[0/2](356200/4588595) loss:3.001 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:16:27,337][model8_pretrain.py][INFO] Epoch:[0/2](356200/4588595) loss:2.883 lr:0.0000100 epoch_Time:26814.0min: [2024-01-04 07:17:04,271][model8_pretrain.py][INFO] Epoch:[0/2](356300/4588595) loss:2.714 lr:0.0000100 epoch_Time:26813.0min: [2024-01-04 07:17:04,271][model8_pretrain.py][INFO] Epoch:[0/2](356300/4588595) loss:2.914 lr:0.0000100 epoch_Time:26813.0min: [2024-01-04 07:17:04,271][model8_pretrain.py][INFO] Epoch:[0/2](356300/4588595) loss:3.041 lr:0.0000100 epoch_Time:26813.0min: [2024-01-04 07:17:04,271][model8_pretrain.py][INFO] Epoch:[0/2](356300/4588595) loss:2.452 lr:0.0000100 epoch_Time:26813.0min: [2024-01-04 07:17:04,271][model8_pretrain.py][INFO] Epoch:[0/2](356300/4588595) loss:2.522 lr:0.0000100 epoch_Time:26813.0min: [2024-01-04 07:17:04,271][model8_pretrain.py][INFO] Epoch:[0/2](356300/4588595) loss:2.742 lr:0.0000100 epoch_Time:26813.0min: [2024-01-04 07:17:04,272][model8_pretrain.py][INFO] Epoch:[0/2](356300/4588595) loss:2.885 lr:0.0000100 epoch_Time:26813.0min: [2024-01-04 07:17:04,272][model8_pretrain.py][INFO] Epoch:[0/2](356300/4588595) loss:3.322 lr:0.0000100 epoch_Time:26813.0min: [2024-01-04 07:17:41,206][model8_pretrain.py][INFO] Epoch:[0/2](356400/4588595) loss:3.243 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:17:41,206][model8_pretrain.py][INFO] Epoch:[0/2](356400/4588595) loss:2.937 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:17:41,206][model8_pretrain.py][INFO] Epoch:[0/2](356400/4588595) loss:2.982 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:17:41,206][model8_pretrain.py][INFO] Epoch:[0/2](356400/4588595) loss:2.936 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:17:41,206][model8_pretrain.py][INFO] Epoch:[0/2](356400/4588595) loss:2.901 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:17:41,206][model8_pretrain.py][INFO] Epoch:[0/2](356400/4588595) loss:2.693 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:17:41,206][model8_pretrain.py][INFO] Epoch:[0/2](356400/4588595) loss:2.718 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:17:41,206][model8_pretrain.py][INFO] Epoch:[0/2](356400/4588595) loss:2.810 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:18:18,150][model8_pretrain.py][INFO] Epoch:[0/2](356500/4588595) loss:2.807 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:18:18,150][model8_pretrain.py][INFO] Epoch:[0/2](356500/4588595) loss:2.307 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:18:18,150][model8_pretrain.py][INFO] Epoch:[0/2](356500/4588595) loss:2.860 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:18:18,150][model8_pretrain.py][INFO] Epoch:[0/2](356500/4588595) loss:2.236 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:18:18,150][model8_pretrain.py][INFO] Epoch:[0/2](356500/4588595) loss:2.953 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:18:18,150][model8_pretrain.py][INFO] Epoch:[0/2](356500/4588595) loss:2.858 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:18:18,151][model8_pretrain.py][INFO] Epoch:[0/2](356500/4588595) loss:2.598 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:18:18,151][model8_pretrain.py][INFO] Epoch:[0/2](356500/4588595) loss:2.785 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:19:05,309][model8_pretrain.py][INFO] Epoch:[0/2](356600/4588595) loss:2.845 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:05,309][model8_pretrain.py][INFO] Epoch:[0/2](356600/4588595) loss:2.987 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:05,309][model8_pretrain.py][INFO] Epoch:[0/2](356600/4588595) loss:2.862 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:05,309][model8_pretrain.py][INFO] Epoch:[0/2](356600/4588595) loss:2.682 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:05,309][model8_pretrain.py][INFO] Epoch:[0/2](356600/4588595) loss:2.286 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:05,309][model8_pretrain.py][INFO] Epoch:[0/2](356600/4588595) loss:2.837 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:05,309][model8_pretrain.py][INFO] Epoch:[0/2](356600/4588595) loss:3.356 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:05,309][model8_pretrain.py][INFO] Epoch:[0/2](356600/4588595) loss:3.010 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:42,248][model8_pretrain.py][INFO] Epoch:[0/2](356700/4588595) loss:2.762 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:42,248][model8_pretrain.py][INFO] Epoch:[0/2](356700/4588595) loss:2.562 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:42,248][model8_pretrain.py][INFO] Epoch:[0/2](356700/4588595) loss:2.876 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:42,248][model8_pretrain.py][INFO] Epoch:[0/2](356700/4588595) loss:3.217 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:42,248][model8_pretrain.py][INFO] Epoch:[0/2](356700/4588595) loss:2.983 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:42,248][model8_pretrain.py][INFO] Epoch:[0/2](356700/4588595) loss:3.123 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:42,248][model8_pretrain.py][INFO] Epoch:[0/2](356700/4588595) loss:3.092 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:19:42,248][model8_pretrain.py][INFO] Epoch:[0/2](356700/4588595) loss:3.211 lr:0.0000100 epoch_Time:26812.0min: [2024-01-04 07:20:19,194][model8_pretrain.py][INFO] Epoch:[0/2](356800/4588595) loss:2.825 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:20:19,194][model8_pretrain.py][INFO] Epoch:[0/2](356800/4588595) loss:2.903 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:20:19,194][model8_pretrain.py][INFO] Epoch:[0/2](356800/4588595) loss:3.067 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:20:19,194][model8_pretrain.py][INFO] Epoch:[0/2](356800/4588595) loss:3.002 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:20:19,194][model8_pretrain.py][INFO] Epoch:[0/2](356800/4588595) loss:3.230 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:20:19,194][model8_pretrain.py][INFO] Epoch:[0/2](356800/4588595) loss:2.979 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:20:19,194][model8_pretrain.py][INFO] Epoch:[0/2](356800/4588595) loss:2.472 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:20:19,194][model8_pretrain.py][INFO] Epoch:[0/2](356800/4588595) loss:3.183 lr:0.0000100 epoch_Time:26811.0min: [2024-01-04 07:20:56,132][model8_pretrain.py][INFO] Epoch:[0/2](356900/4588595) loss:3.007 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:20:56,132][model8_pretrain.py][INFO] Epoch:[0/2](356900/4588595) loss:2.516 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:20:56,132][model8_pretrain.py][INFO] Epoch:[0/2](356900/4588595) loss:2.225 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:20:56,132][model8_pretrain.py][INFO] Epoch:[0/2](356900/4588595) loss:2.100 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:20:56,132][model8_pretrain.py][INFO] Epoch:[0/2](356900/4588595) loss:3.118 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:20:56,132][model8_pretrain.py][INFO] Epoch:[0/2](356900/4588595) loss:2.382 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:20:56,132][model8_pretrain.py][INFO] Epoch:[0/2](356900/4588595) loss:2.807 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:20:56,133][model8_pretrain.py][INFO] Epoch:[0/2](356900/4588595) loss:2.893 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:21:33,071][model8_pretrain.py][INFO] Epoch:[0/2](357000/4588595) loss:3.307 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:21:33,071][model8_pretrain.py][INFO] Epoch:[0/2](357000/4588595) loss:2.684 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:21:33,071][model8_pretrain.py][INFO] Epoch:[0/2](357000/4588595) loss:2.891 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:21:33,071][model8_pretrain.py][INFO] Epoch:[0/2](357000/4588595) loss:2.640 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:21:33,071][model8_pretrain.py][INFO] Epoch:[0/2](357000/4588595) loss:2.720 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:21:33,071][model8_pretrain.py][INFO] Epoch:[0/2](357000/4588595) loss:2.970 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:21:33,071][model8_pretrain.py][INFO] Epoch:[0/2](357000/4588595) loss:2.606 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:21:33,071][model8_pretrain.py][INFO] Epoch:[0/2](357000/4588595) loss:2.790 lr:0.0000100 epoch_Time:26809.0min: [2024-01-04 07:22:10,009][model8_pretrain.py][INFO] Epoch:[0/2](357100/4588595) loss:3.106 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:10,009][model8_pretrain.py][INFO] Epoch:[0/2](357100/4588595) loss:2.973 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:10,009][model8_pretrain.py][INFO] Epoch:[0/2](357100/4588595) loss:3.064 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:10,009][model8_pretrain.py][INFO] Epoch:[0/2](357100/4588595) loss:2.708 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:10,009][model8_pretrain.py][INFO] Epoch:[0/2](357100/4588595) loss:2.495 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:10,009][model8_pretrain.py][INFO] Epoch:[0/2](357100/4588595) loss:3.162 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:10,009][model8_pretrain.py][INFO] Epoch:[0/2](357100/4588595) loss:2.784 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:10,009][model8_pretrain.py][INFO] Epoch:[0/2](357100/4588595) loss:2.778 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:46,950][model8_pretrain.py][INFO] Epoch:[0/2](357200/4588595) loss:3.352 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:46,950][model8_pretrain.py][INFO] Epoch:[0/2](357200/4588595) loss:2.562 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:46,950][model8_pretrain.py][INFO] Epoch:[0/2](357200/4588595) loss:2.991 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:46,950][model8_pretrain.py][INFO] Epoch:[0/2](357200/4588595) loss:2.648 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:46,950][model8_pretrain.py][INFO] Epoch:[0/2](357200/4588595) loss:2.663 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:46,950][model8_pretrain.py][INFO] Epoch:[0/2](357200/4588595) loss:3.207 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:46,950][model8_pretrain.py][INFO] Epoch:[0/2](357200/4588595) loss:3.380 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:22:46,950][model8_pretrain.py][INFO] Epoch:[0/2](357200/4588595) loss:2.763 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:23:23,891][model8_pretrain.py][INFO] Epoch:[0/2](357300/4588595) loss:2.704 lr:0.0000100 epoch_Time:26807.0min: [2024-01-04 07:23:23,891][model8_pretrain.py][INFO] Epoch:[0/2](357300/4588595) loss:2.725 lr:0.0000100 epoch_Time:26807.0min: [2024-01-04 07:23:23,891][model8_pretrain.py][INFO] Epoch:[0/2](357300/4588595) loss:3.193 lr:0.0000100 epoch_Time:26807.0min: [2024-01-04 07:23:23,891][model8_pretrain.py][INFO] Epoch:[0/2](357300/4588595) loss:2.650 lr:0.0000100 epoch_Time:26807.0min: [2024-01-04 07:23:23,891][model8_pretrain.py][INFO] Epoch:[0/2](357300/4588595) loss:2.657 lr:0.0000100 epoch_Time:26807.0min: [2024-01-04 07:23:23,891][model8_pretrain.py][INFO] Epoch:[0/2](357300/4588595) loss:2.713 lr:0.0000100 epoch_Time:26807.0min: [2024-01-04 07:23:23,892][model8_pretrain.py][INFO] Epoch:[0/2](357300/4588595) loss:2.650 lr:0.0000100 epoch_Time:26807.0min: [2024-01-04 07:23:23,892][model8_pretrain.py][INFO] Epoch:[0/2](357300/4588595) loss:3.241 lr:0.0000100 epoch_Time:26807.0min: [2024-01-04 07:24:11,058][model8_pretrain.py][INFO] Epoch:[0/2](357400/4588595) loss:2.639 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:24:11,058][model8_pretrain.py][INFO] Epoch:[0/2](357400/4588595) loss:2.982 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:24:11,058][model8_pretrain.py][INFO] Epoch:[0/2](357400/4588595) loss:2.753 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:24:11,058][model8_pretrain.py][INFO] Epoch:[0/2](357400/4588595) loss:2.686 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:24:11,058][model8_pretrain.py][INFO] Epoch:[0/2](357400/4588595) loss:2.498 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:24:11,058][model8_pretrain.py][INFO] Epoch:[0/2](357400/4588595) loss:2.524 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:24:11,058][model8_pretrain.py][INFO] Epoch:[0/2](357400/4588595) loss:2.153 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:24:11,058][model8_pretrain.py][INFO] Epoch:[0/2](357400/4588595) loss:2.872 lr:0.0000100 epoch_Time:26808.0min: [2024-01-04 07:24:47,998][model8_pretrain.py][INFO] Epoch:[0/2](357500/4588595) loss:1.916 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:24:47,998][model8_pretrain.py][INFO] Epoch:[0/2](357500/4588595) loss:2.813 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:24:47,998][model8_pretrain.py][INFO] Epoch:[0/2](357500/4588595) loss:2.576 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:24:47,999][model8_pretrain.py][INFO] Epoch:[0/2](357500/4588595) loss:2.805 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:24:47,998][model8_pretrain.py][INFO] Epoch:[0/2](357500/4588595) loss:2.902 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:24:47,998][model8_pretrain.py][INFO] Epoch:[0/2](357500/4588595) loss:2.546 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:24:47,999][model8_pretrain.py][INFO] Epoch:[0/2](357500/4588595) loss:3.616 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:24:47,999][model8_pretrain.py][INFO] Epoch:[0/2](357500/4588595) loss:2.902 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:25:24,943][model8_pretrain.py][INFO] Epoch:[0/2](357600/4588595) loss:2.490 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:25:24,943][model8_pretrain.py][INFO] Epoch:[0/2](357600/4588595) loss:2.984 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:25:24,943][model8_pretrain.py][INFO] Epoch:[0/2](357600/4588595) loss:2.751 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:25:24,943][model8_pretrain.py][INFO] Epoch:[0/2](357600/4588595) loss:2.795 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:25:24,943][model8_pretrain.py][INFO] Epoch:[0/2](357600/4588595) loss:3.016 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:25:24,943][model8_pretrain.py][INFO] Epoch:[0/2](357600/4588595) loss:3.214 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:25:24,943][model8_pretrain.py][INFO] Epoch:[0/2](357600/4588595) loss:2.921 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:25:24,943][model8_pretrain.py][INFO] Epoch:[0/2](357600/4588595) loss:2.081 lr:0.0000100 epoch_Time:26806.0min: [2024-01-04 07:26:01,890][model8_pretrain.py][INFO] Epoch:[0/2](357700/4588595) loss:2.751 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:01,890][model8_pretrain.py][INFO] Epoch:[0/2](357700/4588595) loss:3.137 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:01,890][model8_pretrain.py][INFO] Epoch:[0/2](357700/4588595) loss:2.944 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:01,890][model8_pretrain.py][INFO] Epoch:[0/2](357700/4588595) loss:2.907 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:01,891][model8_pretrain.py][INFO] Epoch:[0/2](357700/4588595) loss:2.380 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:01,891][model8_pretrain.py][INFO] Epoch:[0/2](357700/4588595) loss:2.783 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:01,891][model8_pretrain.py][INFO] Epoch:[0/2](357700/4588595) loss:3.046 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:01,891][model8_pretrain.py][INFO] Epoch:[0/2](357700/4588595) loss:3.293 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:38,837][model8_pretrain.py][INFO] Epoch:[0/2](357800/4588595) loss:3.121 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:38,837][model8_pretrain.py][INFO] Epoch:[0/2](357800/4588595) loss:3.079 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:38,837][model8_pretrain.py][INFO] Epoch:[0/2](357800/4588595) loss:2.794 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:38,837][model8_pretrain.py][INFO] Epoch:[0/2](357800/4588595) loss:2.881 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:38,837][model8_pretrain.py][INFO] Epoch:[0/2](357800/4588595) loss:2.159 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:38,837][model8_pretrain.py][INFO] Epoch:[0/2](357800/4588595) loss:2.894 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:38,837][model8_pretrain.py][INFO] Epoch:[0/2](357800/4588595) loss:2.763 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:26:38,837][model8_pretrain.py][INFO] Epoch:[0/2](357800/4588595) loss:2.291 lr:0.0000100 epoch_Time:26805.0min: [2024-01-04 07:27:15,788][model8_pretrain.py][INFO] Epoch:[0/2](357900/4588595) loss:2.904 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:27:15,788][model8_pretrain.py][INFO] Epoch:[0/2](357900/4588595) loss:2.848 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:27:15,788][model8_pretrain.py][INFO] Epoch:[0/2](357900/4588595) loss:2.909 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:27:15,788][model8_pretrain.py][INFO] Epoch:[0/2](357900/4588595) loss:2.262 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:27:15,788][model8_pretrain.py][INFO] Epoch:[0/2](357900/4588595) loss:2.670 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:27:15,788][model8_pretrain.py][INFO] Epoch:[0/2](357900/4588595) loss:3.064 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:27:15,788][model8_pretrain.py][INFO] Epoch:[0/2](357900/4588595) loss:1.984 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:27:15,788][model8_pretrain.py][INFO] Epoch:[0/2](357900/4588595) loss:2.762 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:27:52,733][model8_pretrain.py][INFO] Epoch:[0/2](358000/4588595) loss:3.298 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:27:52,733][model8_pretrain.py][INFO] Epoch:[0/2](358000/4588595) loss:2.493 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:27:52,733][model8_pretrain.py][INFO] Epoch:[0/2](358000/4588595) loss:3.016 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:27:52,733][model8_pretrain.py][INFO] Epoch:[0/2](358000/4588595) loss:3.250 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:27:52,733][model8_pretrain.py][INFO] Epoch:[0/2](358000/4588595) loss:3.023 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:27:52,733][model8_pretrain.py][INFO] Epoch:[0/2](358000/4588595) loss:2.939 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:27:52,733][model8_pretrain.py][INFO] Epoch:[0/2](358000/4588595) loss:2.570 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:27:52,733][model8_pretrain.py][INFO] Epoch:[0/2](358000/4588595) loss:2.970 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:28:29,681][model8_pretrain.py][INFO] Epoch:[0/2](358100/4588595) loss:3.116 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:28:29,681][model8_pretrain.py][INFO] Epoch:[0/2](358100/4588595) loss:2.852 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:28:29,682][model8_pretrain.py][INFO] Epoch:[0/2](358100/4588595) loss:3.105 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:28:29,682][model8_pretrain.py][INFO] Epoch:[0/2](358100/4588595) loss:2.670 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:28:29,682][model8_pretrain.py][INFO] Epoch:[0/2](358100/4588595) loss:3.361 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:28:29,682][model8_pretrain.py][INFO] Epoch:[0/2](358100/4588595) loss:2.979 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:28:29,682][model8_pretrain.py][INFO] Epoch:[0/2](358100/4588595) loss:2.682 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:28:29,682][model8_pretrain.py][INFO] Epoch:[0/2](358100/4588595) loss:3.143 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:29:16,926][model8_pretrain.py][INFO] Epoch:[0/2](358200/4588595) loss:2.864 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:29:16,926][model8_pretrain.py][INFO] Epoch:[0/2](358200/4588595) loss:3.087 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:29:16,926][model8_pretrain.py][INFO] Epoch:[0/2](358200/4588595) loss:3.406 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:29:16,926][model8_pretrain.py][INFO] Epoch:[0/2](358200/4588595) loss:3.130 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:29:16,926][model8_pretrain.py][INFO] Epoch:[0/2](358200/4588595) loss:2.455 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:29:16,926][model8_pretrain.py][INFO] Epoch:[0/2](358200/4588595) loss:2.892 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:29:16,926][model8_pretrain.py][INFO] Epoch:[0/2](358200/4588595) loss:3.165 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:29:16,926][model8_pretrain.py][INFO] Epoch:[0/2](358200/4588595) loss:2.329 lr:0.0000100 epoch_Time:26803.0min: [2024-01-04 07:29:53,825][model8_pretrain.py][INFO] Epoch:[0/2](358300/4588595) loss:3.396 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:29:53,825][model8_pretrain.py][INFO] Epoch:[0/2](358300/4588595) loss:2.828 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:29:53,825][model8_pretrain.py][INFO] Epoch:[0/2](358300/4588595) loss:3.066 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:29:53,825][model8_pretrain.py][INFO] Epoch:[0/2](358300/4588595) loss:2.987 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:29:53,825][model8_pretrain.py][INFO] Epoch:[0/2](358300/4588595) loss:2.725 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:29:53,825][model8_pretrain.py][INFO] Epoch:[0/2](358300/4588595) loss:2.761 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:29:53,825][model8_pretrain.py][INFO] Epoch:[0/2](358300/4588595) loss:2.963 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:29:53,825][model8_pretrain.py][INFO] Epoch:[0/2](358300/4588595) loss:2.606 lr:0.0000100 epoch_Time:26802.0min: [2024-01-04 07:30:30,767][model8_pretrain.py][INFO] Epoch:[0/2](358400/4588595) loss:3.146 lr:0.0000100 epoch_Time:26801.0min: [2024-01-04 07:30:30,767][model8_pretrain.py][INFO] Epoch:[0/2](358400/4588595) loss:3.151 lr:0.0000100 epoch_Time:26801.0min: [2024-01-04 07:30:30,767][model8_pretrain.py][INFO] Epoch:[0/2](358400/4588595) loss:2.576 lr:0.0000100 epoch_Time:26801.0min: [2024-01-04 07:30:30,767][model8_pretrain.py][INFO] Epoch:[0/2](358400/4588595) loss:3.357 lr:0.0000100 epoch_Time:26801.0min: [2024-01-04 07:30:30,767][model8_pretrain.py][INFO] Epoch:[0/2](358400/4588595) loss:2.267 lr:0.0000100 epoch_Time:26801.0min: [2024-01-04 07:30:30,767][model8_pretrain.py][INFO] Epoch:[0/2](358400/4588595) loss:3.286 lr:0.0000100 epoch_Time:26801.0min: [2024-01-04 07:30:30,767][model8_pretrain.py][INFO] Epoch:[0/2](358400/4588595) loss:3.559 lr:0.0000100 epoch_Time:26801.0min: [2024-01-04 07:30:30,768][model8_pretrain.py][INFO] Epoch:[0/2](358400/4588595) loss:2.759 lr:0.0000100 epoch_Time:26801.0min: [2024-01-04 07:31:07,709][model8_pretrain.py][INFO] Epoch:[0/2](358500/4588595) loss:3.072 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:07,709][model8_pretrain.py][INFO] Epoch:[0/2](358500/4588595) loss:3.171 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:07,709][model8_pretrain.py][INFO] Epoch:[0/2](358500/4588595) loss:2.738 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:07,709][model8_pretrain.py][INFO] Epoch:[0/2](358500/4588595) loss:3.142 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:07,709][model8_pretrain.py][INFO] Epoch:[0/2](358500/4588595) loss:2.746 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:07,709][model8_pretrain.py][INFO] Epoch:[0/2](358500/4588595) loss:3.210 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:07,709][model8_pretrain.py][INFO] Epoch:[0/2](358500/4588595) loss:2.736 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:07,709][model8_pretrain.py][INFO] Epoch:[0/2](358500/4588595) loss:3.036 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:44,647][model8_pretrain.py][INFO] Epoch:[0/2](358600/4588595) loss:3.050 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:44,647][model8_pretrain.py][INFO] Epoch:[0/2](358600/4588595) loss:1.545 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:44,647][model8_pretrain.py][INFO] Epoch:[0/2](358600/4588595) loss:2.826 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:44,647][model8_pretrain.py][INFO] Epoch:[0/2](358600/4588595) loss:2.449 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:44,647][model8_pretrain.py][INFO] Epoch:[0/2](358600/4588595) loss:3.015 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:44,647][model8_pretrain.py][INFO] Epoch:[0/2](358600/4588595) loss:2.806 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:44,647][model8_pretrain.py][INFO] Epoch:[0/2](358600/4588595) loss:2.369 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:31:44,647][model8_pretrain.py][INFO] Epoch:[0/2](358600/4588595) loss:3.406 lr:0.0000100 epoch_Time:26800.0min: [2024-01-04 07:32:21,588][model8_pretrain.py][INFO] Epoch:[0/2](358700/4588595) loss:2.862 lr:0.0000100 epoch_Time:26799.0min: [2024-01-04 07:32:21,588][model8_pretrain.py][INFO] Epoch:[0/2](358700/4588595) loss:2.574 lr:0.0000100 epoch_Time:26799.0min: [2024-01-04 07:32:21,588][model8_pretrain.py][INFO] Epoch:[0/2](358700/4588595) loss:3.139 lr:0.0000100 epoch_Time:26799.0min: [2024-01-04 07:32:21,588][model8_pretrain.py][INFO] Epoch:[0/2](358700/4588595) loss:3.364 lr:0.0000100 epoch_Time:26799.0min: [2024-01-04 07:32:21,589][model8_pretrain.py][INFO] Epoch:[0/2](358700/4588595) loss:2.974 lr:0.0000100 epoch_Time:26799.0min: [2024-01-04 07:32:21,589][model8_pretrain.py][INFO] Epoch:[0/2](358700/4588595) loss:2.911 lr:0.0000100 epoch_Time:26799.0min: [2024-01-04 07:32:21,589][model8_pretrain.py][INFO] Epoch:[0/2](358700/4588595) loss:2.676 lr:0.0000100 epoch_Time:26799.0min: [2024-01-04 07:32:21,589][model8_pretrain.py][INFO] Epoch:[0/2](358700/4588595) loss:3.168 lr:0.0000100 epoch_Time:26799.0min: [2024-01-04 07:32:58,529][model8_pretrain.py][INFO] Epoch:[0/2](358800/4588595) loss:2.765 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:32:58,529][model8_pretrain.py][INFO] Epoch:[0/2](358800/4588595) loss:2.399 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:32:58,529][model8_pretrain.py][INFO] Epoch:[0/2](358800/4588595) loss:2.493 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:32:58,529][model8_pretrain.py][INFO] Epoch:[0/2](358800/4588595) loss:2.819 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:32:58,529][model8_pretrain.py][INFO] Epoch:[0/2](358800/4588595) loss:3.563 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:32:58,529][model8_pretrain.py][INFO] Epoch:[0/2](358800/4588595) loss:2.839 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:32:58,529][model8_pretrain.py][INFO] Epoch:[0/2](358800/4588595) loss:2.816 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:32:58,529][model8_pretrain.py][INFO] Epoch:[0/2](358800/4588595) loss:3.037 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:33:35,472][model8_pretrain.py][INFO] Epoch:[0/2](358900/4588595) loss:2.755 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:33:35,472][model8_pretrain.py][INFO] Epoch:[0/2](358900/4588595) loss:3.099 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:33:35,472][model8_pretrain.py][INFO] Epoch:[0/2](358900/4588595) loss:2.983 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:33:35,472][model8_pretrain.py][INFO] Epoch:[0/2](358900/4588595) loss:2.867 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:33:35,472][model8_pretrain.py][INFO] Epoch:[0/2](358900/4588595) loss:2.858 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:33:35,472][model8_pretrain.py][INFO] Epoch:[0/2](358900/4588595) loss:3.436 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:33:35,472][model8_pretrain.py][INFO] Epoch:[0/2](358900/4588595) loss:2.535 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:33:35,472][model8_pretrain.py][INFO] Epoch:[0/2](358900/4588595) loss:3.082 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:34:22,800][model8_pretrain.py][INFO] Epoch:[0/2](359000/4588595) loss:2.726 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:34:22,800][model8_pretrain.py][INFO] Epoch:[0/2](359000/4588595) loss:3.082 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:34:22,800][model8_pretrain.py][INFO] Epoch:[0/2](359000/4588595) loss:2.694 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:34:22,800][model8_pretrain.py][INFO] Epoch:[0/2](359000/4588595) loss:2.538 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:34:22,800][model8_pretrain.py][INFO] Epoch:[0/2](359000/4588595) loss:3.486 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:34:22,800][model8_pretrain.py][INFO] Epoch:[0/2](359000/4588595) loss:3.072 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:34:22,800][model8_pretrain.py][INFO] Epoch:[0/2](359000/4588595) loss:3.012 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:34:22,800][model8_pretrain.py][INFO] Epoch:[0/2](359000/4588595) loss:3.085 lr:0.0000100 epoch_Time:26798.0min: [2024-01-04 07:34:59,725][model8_pretrain.py][INFO] Epoch:[0/2](359100/4588595) loss:3.526 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:34:59,725][model8_pretrain.py][INFO] Epoch:[0/2](359100/4588595) loss:2.459 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:34:59,725][model8_pretrain.py][INFO] Epoch:[0/2](359100/4588595) loss:3.356 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:34:59,725][model8_pretrain.py][INFO] Epoch:[0/2](359100/4588595) loss:2.484 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:34:59,725][model8_pretrain.py][INFO] Epoch:[0/2](359100/4588595) loss:3.216 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:34:59,725][model8_pretrain.py][INFO] Epoch:[0/2](359100/4588595) loss:2.596 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:34:59,725][model8_pretrain.py][INFO] Epoch:[0/2](359100/4588595) loss:2.858 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:34:59,725][model8_pretrain.py][INFO] Epoch:[0/2](359100/4588595) loss:3.234 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:35:36,670][model8_pretrain.py][INFO] Epoch:[0/2](359200/4588595) loss:3.226 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:35:36,670][model8_pretrain.py][INFO] Epoch:[0/2](359200/4588595) loss:2.644 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:35:36,670][model8_pretrain.py][INFO] Epoch:[0/2](359200/4588595) loss:3.526 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:35:36,670][model8_pretrain.py][INFO] Epoch:[0/2](359200/4588595) loss:2.484 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:35:36,670][model8_pretrain.py][INFO] Epoch:[0/2](359200/4588595) loss:2.703 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:35:36,670][model8_pretrain.py][INFO] Epoch:[0/2](359200/4588595) loss:2.689 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:35:36,670][model8_pretrain.py][INFO] Epoch:[0/2](359200/4588595) loss:3.016 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:35:36,670][model8_pretrain.py][INFO] Epoch:[0/2](359200/4588595) loss:2.861 lr:0.0000100 epoch_Time:26797.0min: [2024-01-04 07:36:13,616][model8_pretrain.py][INFO] Epoch:[0/2](359300/4588595) loss:2.917 lr:0.0000100 epoch_Time:26796.0min: [2024-01-04 07:36:13,616][model8_pretrain.py][INFO] Epoch:[0/2](359300/4588595) loss:2.441 lr:0.0000100 epoch_Time:26796.0min: [2024-01-04 07:36:13,616][model8_pretrain.py][INFO] Epoch:[0/2](359300/4588595) loss:3.186 lr:0.0000100 epoch_Time:26796.0min: [2024-01-04 07:36:13,616][model8_pretrain.py][INFO] Epoch:[0/2](359300/4588595) loss:2.477 lr:0.0000100 epoch_Time:26796.0min: [2024-01-04 07:36:13,616][model8_pretrain.py][INFO] Epoch:[0/2](359300/4588595) loss:2.583 lr:0.0000100 epoch_Time:26796.0min: [2024-01-04 07:36:13,616][model8_pretrain.py][INFO] Epoch:[0/2](359300/4588595) loss:2.729 lr:0.0000100 epoch_Time:26796.0min: [2024-01-04 07:36:13,616][model8_pretrain.py][INFO] Epoch:[0/2](359300/4588595) loss:2.970 lr:0.0000100 epoch_Time:26796.0min: [2024-01-04 07:36:13,617][model8_pretrain.py][INFO] Epoch:[0/2](359300/4588595) loss:3.299 lr:0.0000100 epoch_Time:26796.0min: [2024-01-04 07:36:50,552][model8_pretrain.py][INFO] Epoch:[0/2](359400/4588595) loss:2.850 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:36:50,552][model8_pretrain.py][INFO] Epoch:[0/2](359400/4588595) loss:3.144 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:36:50,552][model8_pretrain.py][INFO] Epoch:[0/2](359400/4588595) loss:2.806 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:36:50,552][model8_pretrain.py][INFO] Epoch:[0/2](359400/4588595) loss:2.496 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:36:50,552][model8_pretrain.py][INFO] Epoch:[0/2](359400/4588595) loss:3.117 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:36:50,552][model8_pretrain.py][INFO] Epoch:[0/2](359400/4588595) loss:2.111 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:36:50,552][model8_pretrain.py][INFO] Epoch:[0/2](359400/4588595) loss:2.734 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:36:50,552][model8_pretrain.py][INFO] Epoch:[0/2](359400/4588595) loss:3.102 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:37:27,488][model8_pretrain.py][INFO] Epoch:[0/2](359500/4588595) loss:2.951 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:37:27,488][model8_pretrain.py][INFO] Epoch:[0/2](359500/4588595) loss:2.450 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:37:27,488][model8_pretrain.py][INFO] Epoch:[0/2](359500/4588595) loss:2.272 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:37:27,488][model8_pretrain.py][INFO] Epoch:[0/2](359500/4588595) loss:2.768 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:37:27,488][model8_pretrain.py][INFO] Epoch:[0/2](359500/4588595) loss:2.427 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:37:27,488][model8_pretrain.py][INFO] Epoch:[0/2](359500/4588595) loss:2.968 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:37:27,488][model8_pretrain.py][INFO] Epoch:[0/2](359500/4588595) loss:2.858 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:37:27,490][model8_pretrain.py][INFO] Epoch:[0/2](359500/4588595) loss:1.930 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:38:04,430][model8_pretrain.py][INFO] Epoch:[0/2](359600/4588595) loss:3.228 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:04,430][model8_pretrain.py][INFO] Epoch:[0/2](359600/4588595) loss:2.588 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:04,430][model8_pretrain.py][INFO] Epoch:[0/2](359600/4588595) loss:2.915 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:04,430][model8_pretrain.py][INFO] Epoch:[0/2](359600/4588595) loss:2.753 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:04,430][model8_pretrain.py][INFO] Epoch:[0/2](359600/4588595) loss:2.175 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:04,430][model8_pretrain.py][INFO] Epoch:[0/2](359600/4588595) loss:2.199 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:04,430][model8_pretrain.py][INFO] Epoch:[0/2](359600/4588595) loss:2.777 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:04,430][model8_pretrain.py][INFO] Epoch:[0/2](359600/4588595) loss:3.011 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:41,374][model8_pretrain.py][INFO] Epoch:[0/2](359700/4588595) loss:2.664 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:41,374][model8_pretrain.py][INFO] Epoch:[0/2](359700/4588595) loss:2.817 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:41,374][model8_pretrain.py][INFO] Epoch:[0/2](359700/4588595) loss:2.774 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:41,374][model8_pretrain.py][INFO] Epoch:[0/2](359700/4588595) loss:3.096 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:41,375][model8_pretrain.py][INFO] Epoch:[0/2](359700/4588595) loss:3.435 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:41,374][model8_pretrain.py][INFO] Epoch:[0/2](359700/4588595) loss:2.763 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:41,375][model8_pretrain.py][INFO] Epoch:[0/2](359700/4588595) loss:3.216 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:38:41,375][model8_pretrain.py][INFO] Epoch:[0/2](359700/4588595) loss:2.551 lr:0.0000100 epoch_Time:26793.0min: [2024-01-04 07:39:28,716][model8_pretrain.py][INFO] Epoch:[0/2](359800/4588595) loss:3.137 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:39:28,716][model8_pretrain.py][INFO] Epoch:[0/2](359800/4588595) loss:2.663 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:39:28,716][model8_pretrain.py][INFO] Epoch:[0/2](359800/4588595) loss:3.288 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:39:28,716][model8_pretrain.py][INFO] Epoch:[0/2](359800/4588595) loss:3.074 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:39:28,716][model8_pretrain.py][INFO] Epoch:[0/2](359800/4588595) loss:2.781 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:39:28,716][model8_pretrain.py][INFO] Epoch:[0/2](359800/4588595) loss:2.962 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:39:28,716][model8_pretrain.py][INFO] Epoch:[0/2](359800/4588595) loss:2.771 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:39:28,716][model8_pretrain.py][INFO] Epoch:[0/2](359800/4588595) loss:2.822 lr:0.0000100 epoch_Time:26794.0min: [2024-01-04 07:40:05,652][model8_pretrain.py][INFO] Epoch:[0/2](359900/4588595) loss:3.405 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:05,652][model8_pretrain.py][INFO] Epoch:[0/2](359900/4588595) loss:2.361 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:05,652][model8_pretrain.py][INFO] Epoch:[0/2](359900/4588595) loss:3.094 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:05,652][model8_pretrain.py][INFO] Epoch:[0/2](359900/4588595) loss:3.002 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:05,652][model8_pretrain.py][INFO] Epoch:[0/2](359900/4588595) loss:3.201 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:05,652][model8_pretrain.py][INFO] Epoch:[0/2](359900/4588595) loss:3.225 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:05,652][model8_pretrain.py][INFO] Epoch:[0/2](359900/4588595) loss:3.256 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:05,652][model8_pretrain.py][INFO] Epoch:[0/2](359900/4588595) loss:3.332 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:42,611][model8_pretrain.py][INFO] Epoch:[0/2](360000/4588595) loss:2.622 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:42,611][model8_pretrain.py][INFO] Epoch:[0/2](360000/4588595) loss:2.636 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:42,611][model8_pretrain.py][INFO] Epoch:[0/2](360000/4588595) loss:2.848 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:42,611][model8_pretrain.py][INFO] Epoch:[0/2](360000/4588595) loss:2.668 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:42,611][model8_pretrain.py][INFO] Epoch:[0/2](360000/4588595) loss:2.951 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:42,611][model8_pretrain.py][INFO] Epoch:[0/2](360000/4588595) loss:2.977 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:42,611][model8_pretrain.py][INFO] Epoch:[0/2](360000/4588595) loss:2.962 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:40:42,611][model8_pretrain.py][INFO] Epoch:[0/2](360000/4588595) loss:2.687 lr:0.0000100 epoch_Time:26792.0min: [2024-01-04 07:41:19,566][model8_pretrain.py][INFO] Epoch:[0/2](360100/4588595) loss:3.199 lr:0.0000100 epoch_Time:26791.0min: [2024-01-04 07:41:19,566][model8_pretrain.py][INFO] Epoch:[0/2](360100/4588595) loss:2.600 lr:0.0000100 epoch_Time:26791.0min: [2024-01-04 07:41:19,566][model8_pretrain.py][INFO] Epoch:[0/2](360100/4588595) loss:2.825 lr:0.0000100 epoch_Time:26791.0min: [2024-01-04 07:41:19,566][model8_pretrain.py][INFO] Epoch:[0/2](360100/4588595) loss:3.141 lr:0.0000100 epoch_Time:26791.0min: [2024-01-04 07:41:19,566][model8_pretrain.py][INFO] Epoch:[0/2](360100/4588595) loss:2.427 lr:0.0000100 epoch_Time:26791.0min: [2024-01-04 07:41:19,566][model8_pretrain.py][INFO] Epoch:[0/2](360100/4588595) loss:2.943 lr:0.0000100 epoch_Time:26791.0min: [2024-01-04 07:41:19,566][model8_pretrain.py][INFO] Epoch:[0/2](360100/4588595) loss:3.166 lr:0.0000100 epoch_Time:26791.0min: [2024-01-04 07:41:19,566][model8_pretrain.py][INFO] Epoch:[0/2](360100/4588595) loss:3.135 lr:0.0000100 epoch_Time:26791.0min: [2024-01-04 07:41:56,519][model8_pretrain.py][INFO] Epoch:[0/2](360200/4588595) loss:3.188 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:41:56,519][model8_pretrain.py][INFO] Epoch:[0/2](360200/4588595) loss:2.944 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:41:56,519][model8_pretrain.py][INFO] Epoch:[0/2](360200/4588595) loss:3.060 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:41:56,519][model8_pretrain.py][INFO] Epoch:[0/2](360200/4588595) loss:2.738 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:41:56,519][model8_pretrain.py][INFO] Epoch:[0/2](360200/4588595) loss:3.514 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:41:56,519][model8_pretrain.py][INFO] Epoch:[0/2](360200/4588595) loss:3.144 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:41:56,519][model8_pretrain.py][INFO] Epoch:[0/2](360200/4588595) loss:3.527 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:41:56,519][model8_pretrain.py][INFO] Epoch:[0/2](360200/4588595) loss:3.299 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:42:33,482][model8_pretrain.py][INFO] Epoch:[0/2](360300/4588595) loss:2.982 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:42:33,482][model8_pretrain.py][INFO] Epoch:[0/2](360300/4588595) loss:2.792 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:42:33,482][model8_pretrain.py][INFO] Epoch:[0/2](360300/4588595) loss:2.845 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:42:33,482][model8_pretrain.py][INFO] Epoch:[0/2](360300/4588595) loss:3.238 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:42:33,482][model8_pretrain.py][INFO] Epoch:[0/2](360300/4588595) loss:2.803 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:42:33,482][model8_pretrain.py][INFO] Epoch:[0/2](360300/4588595) loss:2.781 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:42:33,482][model8_pretrain.py][INFO] Epoch:[0/2](360300/4588595) loss:2.581 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:42:33,482][model8_pretrain.py][INFO] Epoch:[0/2](360300/4588595) loss:2.434 lr:0.0000100 epoch_Time:26790.0min: [2024-01-04 07:43:10,438][model8_pretrain.py][INFO] Epoch:[0/2](360400/4588595) loss:2.631 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:10,438][model8_pretrain.py][INFO] Epoch:[0/2](360400/4588595) loss:2.896 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:10,438][model8_pretrain.py][INFO] Epoch:[0/2](360400/4588595) loss:2.870 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:10,438][model8_pretrain.py][INFO] Epoch:[0/2](360400/4588595) loss:3.230 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:10,438][model8_pretrain.py][INFO] Epoch:[0/2](360400/4588595) loss:2.621 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:10,438][model8_pretrain.py][INFO] Epoch:[0/2](360400/4588595) loss:3.247 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:10,439][model8_pretrain.py][INFO] Epoch:[0/2](360400/4588595) loss:3.194 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:10,439][model8_pretrain.py][INFO] Epoch:[0/2](360400/4588595) loss:2.964 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:47,377][model8_pretrain.py][INFO] Epoch:[0/2](360500/4588595) loss:3.207 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:47,377][model8_pretrain.py][INFO] Epoch:[0/2](360500/4588595) loss:2.668 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:47,377][model8_pretrain.py][INFO] Epoch:[0/2](360500/4588595) loss:2.878 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:47,377][model8_pretrain.py][INFO] Epoch:[0/2](360500/4588595) loss:3.361 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:47,377][model8_pretrain.py][INFO] Epoch:[0/2](360500/4588595) loss:2.674 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:47,377][model8_pretrain.py][INFO] Epoch:[0/2](360500/4588595) loss:2.794 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:47,377][model8_pretrain.py][INFO] Epoch:[0/2](360500/4588595) loss:2.915 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:43:47,377][model8_pretrain.py][INFO] Epoch:[0/2](360500/4588595) loss:3.276 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:44:34,728][model8_pretrain.py][INFO] Epoch:[0/2](360600/4588595) loss:3.123 lr:0.0000100 epoch_Time:26789.0min: [2024-01-04 07:44:34,728][model8_pretrain.py][INFO] Epoch:[0/2](360600/4588595) loss:3.117 lr:0.0000100 epoch_Time:26789.0min: [2024-01-04 07:44:34,728][model8_pretrain.py][INFO] Epoch:[0/2](360600/4588595) loss:3.223 lr:0.0000100 epoch_Time:26789.0min: [2024-01-04 07:44:34,728][model8_pretrain.py][INFO] Epoch:[0/2](360600/4588595) loss:2.911 lr:0.0000100 epoch_Time:26789.0min: [2024-01-04 07:44:34,728][model8_pretrain.py][INFO] Epoch:[0/2](360600/4588595) loss:3.025 lr:0.0000100 epoch_Time:26789.0min: [2024-01-04 07:44:34,728][model8_pretrain.py][INFO] Epoch:[0/2](360600/4588595) loss:2.526 lr:0.0000100 epoch_Time:26789.0min: [2024-01-04 07:44:34,728][model8_pretrain.py][INFO] Epoch:[0/2](360600/4588595) loss:2.412 lr:0.0000100 epoch_Time:26789.0min: [2024-01-04 07:44:34,728][model8_pretrain.py][INFO] Epoch:[0/2](360600/4588595) loss:3.105 lr:0.0000100 epoch_Time:26789.0min: [2024-01-04 07:45:11,649][model8_pretrain.py][INFO] Epoch:[0/2](360700/4588595) loss:2.908 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:45:11,649][model8_pretrain.py][INFO] Epoch:[0/2](360700/4588595) loss:2.841 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:45:11,649][model8_pretrain.py][INFO] Epoch:[0/2](360700/4588595) loss:3.033 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:45:11,649][model8_pretrain.py][INFO] Epoch:[0/2](360700/4588595) loss:2.888 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:45:11,650][model8_pretrain.py][INFO] Epoch:[0/2](360700/4588595) loss:2.749 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:45:11,650][model8_pretrain.py][INFO] Epoch:[0/2](360700/4588595) loss:2.296 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:45:11,650][model8_pretrain.py][INFO] Epoch:[0/2](360700/4588595) loss:2.466 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:45:11,650][model8_pretrain.py][INFO] Epoch:[0/2](360700/4588595) loss:3.172 lr:0.0000100 epoch_Time:26788.0min: [2024-01-04 07:45:48,572][model8_pretrain.py][INFO] Epoch:[0/2](360800/4588595) loss:2.641 lr:0.0000100 epoch_Time:26787.0min: [2024-01-04 07:45:48,572][model8_pretrain.py][INFO] Epoch:[0/2](360800/4588595) loss:3.289 lr:0.0000100 epoch_Time:26787.0min: [2024-01-04 07:45:48,572][model8_pretrain.py][INFO] Epoch:[0/2](360800/4588595) loss:3.011 lr:0.0000100 epoch_Time:26787.0min: [2024-01-04 07:45:48,572][model8_pretrain.py][INFO] Epoch:[0/2](360800/4588595) loss:2.766 lr:0.0000100 epoch_Time:26787.0min: [2024-01-04 07:45:48,572][model8_pretrain.py][INFO] Epoch:[0/2](360800/4588595) loss:2.634 lr:0.0000100 epoch_Time:26787.0min: [2024-01-04 07:45:48,572][model8_pretrain.py][INFO] Epoch:[0/2](360800/4588595) loss:3.172 lr:0.0000100 epoch_Time:26787.0min: [2024-01-04 07:45:48,572][model8_pretrain.py][INFO] Epoch:[0/2](360800/4588595) loss:3.037 lr:0.0000100 epoch_Time:26787.0min: [2024-01-04 07:45:48,572][model8_pretrain.py][INFO] Epoch:[0/2](360800/4588595) loss:3.080 lr:0.0000100 epoch_Time:26787.0min: [2024-01-04 07:46:25,506][model8_pretrain.py][INFO] Epoch:[0/2](360900/4588595) loss:2.679 lr:0.0000100 epoch_Time:26786.0min: [2024-01-04 07:46:25,506][model8_pretrain.py][INFO] Epoch:[0/2](360900/4588595) loss:3.021 lr:0.0000100 epoch_Time:26786.0min: [2024-01-04 07:46:25,506][model8_pretrain.py][INFO] Epoch:[0/2](360900/4588595) loss:2.469 lr:0.0000100 epoch_Time:26786.0min: [2024-01-04 07:46:25,506][model8_pretrain.py][INFO] Epoch:[0/2](360900/4588595) loss:2.973 lr:0.0000100 epoch_Time:26786.0min: [2024-01-04 07:46:25,506][model8_pretrain.py][INFO] Epoch:[0/2](360900/4588595) loss:3.066 lr:0.0000100 epoch_Time:26786.0min: [2024-01-04 07:46:25,506][model8_pretrain.py][INFO] Epoch:[0/2](360900/4588595) loss:3.137 lr:0.0000100 epoch_Time:26786.0min: [2024-01-04 07:46:25,506][model8_pretrain.py][INFO] Epoch:[0/2](360900/4588595) loss:2.897 lr:0.0000100 epoch_Time:26786.0min: [2024-01-04 07:46:25,506][model8_pretrain.py][INFO] Epoch:[0/2](360900/4588595) loss:3.227 lr:0.0000100 epoch_Time:26786.0min: [2024-01-04 07:47:02,441][model8_pretrain.py][INFO] Epoch:[0/2](361000/4588595) loss:3.050 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:02,441][model8_pretrain.py][INFO] Epoch:[0/2](361000/4588595) loss:3.183 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:02,441][model8_pretrain.py][INFO] Epoch:[0/2](361000/4588595) loss:3.308 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:02,441][model8_pretrain.py][INFO] Epoch:[0/2](361000/4588595) loss:2.477 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:02,442][model8_pretrain.py][INFO] Epoch:[0/2](361000/4588595) loss:2.646 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:02,442][model8_pretrain.py][INFO] Epoch:[0/2](361000/4588595) loss:2.805 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:02,442][model8_pretrain.py][INFO] Epoch:[0/2](361000/4588595) loss:3.059 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:02,442][model8_pretrain.py][INFO] Epoch:[0/2](361000/4588595) loss:2.818 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:39,371][model8_pretrain.py][INFO] Epoch:[0/2](361100/4588595) loss:2.715 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:39,371][model8_pretrain.py][INFO] Epoch:[0/2](361100/4588595) loss:2.741 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:39,371][model8_pretrain.py][INFO] Epoch:[0/2](361100/4588595) loss:2.764 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:39,372][model8_pretrain.py][INFO] Epoch:[0/2](361100/4588595) loss:2.827 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:39,372][model8_pretrain.py][INFO] Epoch:[0/2](361100/4588595) loss:2.929 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:39,372][model8_pretrain.py][INFO] Epoch:[0/2](361100/4588595) loss:2.606 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:39,372][model8_pretrain.py][INFO] Epoch:[0/2](361100/4588595) loss:3.013 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:47:39,372][model8_pretrain.py][INFO] Epoch:[0/2](361100/4588595) loss:3.311 lr:0.0000100 epoch_Time:26785.0min: [2024-01-04 07:48:16,306][model8_pretrain.py][INFO] Epoch:[0/2](361200/4588595) loss:2.864 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:48:16,306][model8_pretrain.py][INFO] Epoch:[0/2](361200/4588595) loss:2.393 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:48:16,306][model8_pretrain.py][INFO] Epoch:[0/2](361200/4588595) loss:2.911 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:48:16,306][model8_pretrain.py][INFO] Epoch:[0/2](361200/4588595) loss:3.028 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:48:16,306][model8_pretrain.py][INFO] Epoch:[0/2](361200/4588595) loss:2.572 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:48:16,306][model8_pretrain.py][INFO] Epoch:[0/2](361200/4588595) loss:3.065 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:48:16,306][model8_pretrain.py][INFO] Epoch:[0/2](361200/4588595) loss:3.063 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:48:16,306][model8_pretrain.py][INFO] Epoch:[0/2](361200/4588595) loss:2.792 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:48:53,238][model8_pretrain.py][INFO] Epoch:[0/2](361300/4588595) loss:3.177 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:48:53,238][model8_pretrain.py][INFO] Epoch:[0/2](361300/4588595) loss:3.117 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:48:53,238][model8_pretrain.py][INFO] Epoch:[0/2](361300/4588595) loss:2.066 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:48:53,238][model8_pretrain.py][INFO] Epoch:[0/2](361300/4588595) loss:2.749 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:48:53,238][model8_pretrain.py][INFO] Epoch:[0/2](361300/4588595) loss:2.914 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:48:53,238][model8_pretrain.py][INFO] Epoch:[0/2](361300/4588595) loss:2.990 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:48:53,238][model8_pretrain.py][INFO] Epoch:[0/2](361300/4588595) loss:3.030 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:48:53,238][model8_pretrain.py][INFO] Epoch:[0/2](361300/4588595) loss:3.399 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:49:40,614][model8_pretrain.py][INFO] Epoch:[0/2](361400/4588595) loss:2.782 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:49:40,614][model8_pretrain.py][INFO] Epoch:[0/2](361400/4588595) loss:2.363 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:49:40,614][model8_pretrain.py][INFO] Epoch:[0/2](361400/4588595) loss:2.560 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:49:40,614][model8_pretrain.py][INFO] Epoch:[0/2](361400/4588595) loss:2.908 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:49:40,614][model8_pretrain.py][INFO] Epoch:[0/2](361400/4588595) loss:2.764 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:49:40,614][model8_pretrain.py][INFO] Epoch:[0/2](361400/4588595) loss:2.180 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:49:40,614][model8_pretrain.py][INFO] Epoch:[0/2](361400/4588595) loss:3.191 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:49:40,614][model8_pretrain.py][INFO] Epoch:[0/2](361400/4588595) loss:2.603 lr:0.0000100 epoch_Time:26784.0min: [2024-01-04 07:50:17,547][model8_pretrain.py][INFO] Epoch:[0/2](361500/4588595) loss:2.948 lr:0.0000100 epoch_Time:26783.0min: [2024-01-04 07:50:17,547][model8_pretrain.py][INFO] Epoch:[0/2](361500/4588595) loss:3.214 lr:0.0000100 epoch_Time:26783.0min: [2024-01-04 07:50:17,547][model8_pretrain.py][INFO] Epoch:[0/2](361500/4588595) loss:2.794 lr:0.0000100 epoch_Time:26783.0min: [2024-01-04 07:50:17,547][model8_pretrain.py][INFO] Epoch:[0/2](361500/4588595) loss:3.171 lr:0.0000100 epoch_Time:26783.0min: [2024-01-04 07:50:17,547][model8_pretrain.py][INFO] Epoch:[0/2](361500/4588595) loss:2.639 lr:0.0000100 epoch_Time:26783.0min: [2024-01-04 07:50:17,547][model8_pretrain.py][INFO] Epoch:[0/2](361500/4588595) loss:3.351 lr:0.0000100 epoch_Time:26783.0min: [2024-01-04 07:50:17,547][model8_pretrain.py][INFO] Epoch:[0/2](361500/4588595) loss:3.000 lr:0.0000100 epoch_Time:26783.0min: [2024-01-04 07:50:17,548][model8_pretrain.py][INFO] Epoch:[0/2](361500/4588595) loss:2.677 lr:0.0000100 epoch_Time:26783.0min: [2024-01-04 07:50:54,480][model8_pretrain.py][INFO] Epoch:[0/2](361600/4588595) loss:2.974 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:50:54,480][model8_pretrain.py][INFO] Epoch:[0/2](361600/4588595) loss:3.256 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:50:54,480][model8_pretrain.py][INFO] Epoch:[0/2](361600/4588595) loss:3.125 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:50:54,480][model8_pretrain.py][INFO] Epoch:[0/2](361600/4588595) loss:3.099 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:50:54,480][model8_pretrain.py][INFO] Epoch:[0/2](361600/4588595) loss:2.359 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:50:54,480][model8_pretrain.py][INFO] Epoch:[0/2](361600/4588595) loss:3.247 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:50:54,480][model8_pretrain.py][INFO] Epoch:[0/2](361600/4588595) loss:3.126 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:50:54,480][model8_pretrain.py][INFO] Epoch:[0/2](361600/4588595) loss:3.152 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:51:31,416][model8_pretrain.py][INFO] Epoch:[0/2](361700/4588595) loss:2.756 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:51:31,416][model8_pretrain.py][INFO] Epoch:[0/2](361700/4588595) loss:2.347 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:51:31,416][model8_pretrain.py][INFO] Epoch:[0/2](361700/4588595) loss:3.005 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:51:31,416][model8_pretrain.py][INFO] Epoch:[0/2](361700/4588595) loss:3.073 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:51:31,416][model8_pretrain.py][INFO] Epoch:[0/2](361700/4588595) loss:2.829 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:51:31,416][model8_pretrain.py][INFO] Epoch:[0/2](361700/4588595) loss:3.013 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:51:31,416][model8_pretrain.py][INFO] Epoch:[0/2](361700/4588595) loss:2.819 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:51:31,416][model8_pretrain.py][INFO] Epoch:[0/2](361700/4588595) loss:2.178 lr:0.0000100 epoch_Time:26782.0min: [2024-01-04 07:52:08,360][model8_pretrain.py][INFO] Epoch:[0/2](361800/4588595) loss:2.929 lr:0.0000100 epoch_Time:26781.0min: [2024-01-04 07:52:08,360][model8_pretrain.py][INFO] Epoch:[0/2](361800/4588595) loss:2.722 lr:0.0000100 epoch_Time:26781.0min: [2024-01-04 07:52:08,360][model8_pretrain.py][INFO] Epoch:[0/2](361800/4588595) loss:2.636 lr:0.0000100 epoch_Time:26781.0min: [2024-01-04 07:52:08,360][model8_pretrain.py][INFO] Epoch:[0/2](361800/4588595) loss:2.972 lr:0.0000100 epoch_Time:26781.0min: [2024-01-04 07:52:08,360][model8_pretrain.py][INFO] Epoch:[0/2](361800/4588595) loss:3.190 lr:0.0000100 epoch_Time:26781.0min: [2024-01-04 07:52:08,360][model8_pretrain.py][INFO] Epoch:[0/2](361800/4588595) loss:3.060 lr:0.0000100 epoch_Time:26781.0min: [2024-01-04 07:52:08,360][model8_pretrain.py][INFO] Epoch:[0/2](361800/4588595) loss:2.620 lr:0.0000100 epoch_Time:26781.0min: [2024-01-04 07:52:08,360][model8_pretrain.py][INFO] Epoch:[0/2](361800/4588595) loss:3.041 lr:0.0000100 epoch_Time:26781.0min: [2024-01-04 07:52:45,300][model8_pretrain.py][INFO] Epoch:[0/2](361900/4588595) loss:3.013 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:52:45,300][model8_pretrain.py][INFO] Epoch:[0/2](361900/4588595) loss:2.868 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:52:45,300][model8_pretrain.py][INFO] Epoch:[0/2](361900/4588595) loss:2.962 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:52:45,300][model8_pretrain.py][INFO] Epoch:[0/2](361900/4588595) loss:2.913 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:52:45,300][model8_pretrain.py][INFO] Epoch:[0/2](361900/4588595) loss:2.787 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:52:45,300][model8_pretrain.py][INFO] Epoch:[0/2](361900/4588595) loss:2.840 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:52:45,300][model8_pretrain.py][INFO] Epoch:[0/2](361900/4588595) loss:2.677 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:52:45,300][model8_pretrain.py][INFO] Epoch:[0/2](361900/4588595) loss:3.017 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:53:22,244][model8_pretrain.py][INFO] Epoch:[0/2](362000/4588595) loss:2.770 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:53:22,244][model8_pretrain.py][INFO] Epoch:[0/2](362000/4588595) loss:2.427 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:53:22,244][model8_pretrain.py][INFO] Epoch:[0/2](362000/4588595) loss:3.221 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:53:22,244][model8_pretrain.py][INFO] Epoch:[0/2](362000/4588595) loss:3.542 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:53:22,244][model8_pretrain.py][INFO] Epoch:[0/2](362000/4588595) loss:2.854 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:53:22,244][model8_pretrain.py][INFO] Epoch:[0/2](362000/4588595) loss:2.933 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:53:22,244][model8_pretrain.py][INFO] Epoch:[0/2](362000/4588595) loss:2.885 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:53:22,244][model8_pretrain.py][INFO] Epoch:[0/2](362000/4588595) loss:2.748 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:53:59,195][model8_pretrain.py][INFO] Epoch:[0/2](362100/4588595) loss:2.937 lr:0.0000100 epoch_Time:26778.0min: [2024-01-04 07:53:59,195][model8_pretrain.py][INFO] Epoch:[0/2](362100/4588595) loss:3.239 lr:0.0000100 epoch_Time:26778.0min: [2024-01-04 07:53:59,195][model8_pretrain.py][INFO] Epoch:[0/2](362100/4588595) loss:3.085 lr:0.0000100 epoch_Time:26778.0min: [2024-01-04 07:53:59,195][model8_pretrain.py][INFO] Epoch:[0/2](362100/4588595) loss:3.108 lr:0.0000100 epoch_Time:26778.0min: [2024-01-04 07:53:59,195][model8_pretrain.py][INFO] Epoch:[0/2](362100/4588595) loss:2.718 lr:0.0000100 epoch_Time:26778.0min: [2024-01-04 07:53:59,195][model8_pretrain.py][INFO] Epoch:[0/2](362100/4588595) loss:3.205 lr:0.0000100 epoch_Time:26778.0min: [2024-01-04 07:53:59,196][model8_pretrain.py][INFO] Epoch:[0/2](362100/4588595) loss:2.907 lr:0.0000100 epoch_Time:26778.0min: [2024-01-04 07:53:59,196][model8_pretrain.py][INFO] Epoch:[0/2](362100/4588595) loss:3.150 lr:0.0000100 epoch_Time:26778.0min: [2024-01-04 07:54:46,479][model8_pretrain.py][INFO] Epoch:[0/2](362200/4588595) loss:3.149 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:54:46,479][model8_pretrain.py][INFO] Epoch:[0/2](362200/4588595) loss:3.119 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:54:46,479][model8_pretrain.py][INFO] Epoch:[0/2](362200/4588595) loss:2.755 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:54:46,479][model8_pretrain.py][INFO] Epoch:[0/2](362200/4588595) loss:3.168 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:54:46,479][model8_pretrain.py][INFO] Epoch:[0/2](362200/4588595) loss:3.133 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:54:46,479][model8_pretrain.py][INFO] Epoch:[0/2](362200/4588595) loss:2.754 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:54:46,479][model8_pretrain.py][INFO] Epoch:[0/2](362200/4588595) loss:2.760 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:54:46,479][model8_pretrain.py][INFO] Epoch:[0/2](362200/4588595) loss:2.811 lr:0.0000100 epoch_Time:26780.0min: [2024-01-04 07:55:23,407][model8_pretrain.py][INFO] Epoch:[0/2](362300/4588595) loss:2.808 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:55:23,407][model8_pretrain.py][INFO] Epoch:[0/2](362300/4588595) loss:2.624 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:55:23,407][model8_pretrain.py][INFO] Epoch:[0/2](362300/4588595) loss:2.286 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:55:23,407][model8_pretrain.py][INFO] Epoch:[0/2](362300/4588595) loss:2.747 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:55:23,407][model8_pretrain.py][INFO] Epoch:[0/2](362300/4588595) loss:3.088 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:55:23,407][model8_pretrain.py][INFO] Epoch:[0/2](362300/4588595) loss:2.920 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:55:23,407][model8_pretrain.py][INFO] Epoch:[0/2](362300/4588595) loss:3.490 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:55:23,407][model8_pretrain.py][INFO] Epoch:[0/2](362300/4588595) loss:2.913 lr:0.0000100 epoch_Time:26779.0min: [2024-01-04 07:56:00,343][model8_pretrain.py][INFO] Epoch:[0/2](362400/4588595) loss:2.840 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:00,343][model8_pretrain.py][INFO] Epoch:[0/2](362400/4588595) loss:2.774 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:00,343][model8_pretrain.py][INFO] Epoch:[0/2](362400/4588595) loss:3.068 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:00,343][model8_pretrain.py][INFO] Epoch:[0/2](362400/4588595) loss:2.638 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:00,343][model8_pretrain.py][INFO] Epoch:[0/2](362400/4588595) loss:3.022 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:00,343][model8_pretrain.py][INFO] Epoch:[0/2](362400/4588595) loss:3.080 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:00,343][model8_pretrain.py][INFO] Epoch:[0/2](362400/4588595) loss:3.120 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:00,343][model8_pretrain.py][INFO] Epoch:[0/2](362400/4588595) loss:2.728 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:37,269][model8_pretrain.py][INFO] Epoch:[0/2](362500/4588595) loss:2.553 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:37,269][model8_pretrain.py][INFO] Epoch:[0/2](362500/4588595) loss:3.145 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:37,269][model8_pretrain.py][INFO] Epoch:[0/2](362500/4588595) loss:2.788 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:37,269][model8_pretrain.py][INFO] Epoch:[0/2](362500/4588595) loss:3.238 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:37,269][model8_pretrain.py][INFO] Epoch:[0/2](362500/4588595) loss:3.483 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:37,269][model8_pretrain.py][INFO] Epoch:[0/2](362500/4588595) loss:3.083 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:37,269][model8_pretrain.py][INFO] Epoch:[0/2](362500/4588595) loss:2.812 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:56:37,269][model8_pretrain.py][INFO] Epoch:[0/2](362500/4588595) loss:3.139 lr:0.0000100 epoch_Time:26777.0min: [2024-01-04 07:57:14,203][model8_pretrain.py][INFO] Epoch:[0/2](362600/4588595) loss:2.909 lr:0.0000100 epoch_Time:26776.0min: [2024-01-04 07:57:14,203][model8_pretrain.py][INFO] Epoch:[0/2](362600/4588595) loss:2.867 lr:0.0000100 epoch_Time:26776.0min: [2024-01-04 07:57:14,203][model8_pretrain.py][INFO] Epoch:[0/2](362600/4588595) loss:3.401 lr:0.0000100 epoch_Time:26776.0min: [2024-01-04 07:57:14,203][model8_pretrain.py][INFO] Epoch:[0/2](362600/4588595) loss:3.114 lr:0.0000100 epoch_Time:26776.0min: [2024-01-04 07:57:14,203][model8_pretrain.py][INFO] Epoch:[0/2](362600/4588595) loss:2.997 lr:0.0000100 epoch_Time:26776.0min: [2024-01-04 07:57:14,203][model8_pretrain.py][INFO] Epoch:[0/2](362600/4588595) loss:2.469 lr:0.0000100 epoch_Time:26776.0min: [2024-01-04 07:57:14,203][model8_pretrain.py][INFO] Epoch:[0/2](362600/4588595) loss:2.327 lr:0.0000100 epoch_Time:26776.0min: [2024-01-04 07:57:14,203][model8_pretrain.py][INFO] Epoch:[0/2](362600/4588595) loss:2.884 lr:0.0000100 epoch_Time:26776.0min: [2024-01-04 07:57:51,150][model8_pretrain.py][INFO] Epoch:[0/2](362700/4588595) loss:2.859 lr:0.0000100 epoch_Time:26775.0min: [2024-01-04 07:57:51,150][model8_pretrain.py][INFO] Epoch:[0/2](362700/4588595) loss:2.741 lr:0.0000100 epoch_Time:26775.0min: [2024-01-04 07:57:51,150][model8_pretrain.py][INFO] Epoch:[0/2](362700/4588595) loss:3.256 lr:0.0000100 epoch_Time:26775.0min: [2024-01-04 07:57:51,150][model8_pretrain.py][INFO] Epoch:[0/2](362700/4588595) loss:2.955 lr:0.0000100 epoch_Time:26775.0min: [2024-01-04 07:57:51,150][model8_pretrain.py][INFO] Epoch:[0/2](362700/4588595) loss:3.146 lr:0.0000100 epoch_Time:26775.0min: [2024-01-04 07:57:51,150][model8_pretrain.py][INFO] Epoch:[0/2](362700/4588595) loss:2.837 lr:0.0000100 epoch_Time:26775.0min: [2024-01-04 07:57:51,150][model8_pretrain.py][INFO] Epoch:[0/2](362700/4588595) loss:2.736 lr:0.0000100 epoch_Time:26775.0min: [2024-01-04 07:57:51,150][model8_pretrain.py][INFO] Epoch:[0/2](362700/4588595) loss:2.605 lr:0.0000100 epoch_Time:26775.0min: [2024-01-04 07:58:28,095][model8_pretrain.py][INFO] Epoch:[0/2](362800/4588595) loss:3.232 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:58:28,095][model8_pretrain.py][INFO] Epoch:[0/2](362800/4588595) loss:2.415 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:58:28,095][model8_pretrain.py][INFO] Epoch:[0/2](362800/4588595) loss:2.823 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:58:28,096][model8_pretrain.py][INFO] Epoch:[0/2](362800/4588595) loss:2.901 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:58:28,096][model8_pretrain.py][INFO] Epoch:[0/2](362800/4588595) loss:2.698 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:58:28,096][model8_pretrain.py][INFO] Epoch:[0/2](362800/4588595) loss:3.094 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:58:28,096][model8_pretrain.py][INFO] Epoch:[0/2](362800/4588595) loss:3.215 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:58:28,096][model8_pretrain.py][INFO] Epoch:[0/2](362800/4588595) loss:3.181 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](362900/4588595) loss:3.451 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 07:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](362900/4588595) loss:2.877 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 07:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](362900/4588595) loss:2.753 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 07:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](362900/4588595) loss:3.391 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 07:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](362900/4588595) loss:2.665 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 07:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](362900/4588595) loss:3.339 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 07:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](362900/4588595) loss:2.561 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 07:59:05,036][model8_pretrain.py][INFO] Epoch:[0/2](362900/4588595) loss:3.119 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 07:59:52,348][model8_pretrain.py][INFO] Epoch:[0/2](363000/4588595) loss:2.544 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:59:52,348][model8_pretrain.py][INFO] Epoch:[0/2](363000/4588595) loss:2.655 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:59:52,348][model8_pretrain.py][INFO] Epoch:[0/2](363000/4588595) loss:2.906 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:59:52,348][model8_pretrain.py][INFO] Epoch:[0/2](363000/4588595) loss:2.981 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:59:52,348][model8_pretrain.py][INFO] Epoch:[0/2](363000/4588595) loss:2.682 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:59:52,348][model8_pretrain.py][INFO] Epoch:[0/2](363000/4588595) loss:3.235 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:59:52,348][model8_pretrain.py][INFO] Epoch:[0/2](363000/4588595) loss:2.817 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 07:59:52,348][model8_pretrain.py][INFO] Epoch:[0/2](363000/4588595) loss:2.791 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 08:00:29,326][model8_pretrain.py][INFO] Epoch:[0/2](363100/4588595) loss:2.883 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 08:00:29,326][model8_pretrain.py][INFO] Epoch:[0/2](363100/4588595) loss:2.893 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 08:00:29,326][model8_pretrain.py][INFO] Epoch:[0/2](363100/4588595) loss:2.533 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 08:00:29,326][model8_pretrain.py][INFO] Epoch:[0/2](363100/4588595) loss:2.019 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 08:00:29,326][model8_pretrain.py][INFO] Epoch:[0/2](363100/4588595) loss:2.899 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 08:00:29,326][model8_pretrain.py][INFO] Epoch:[0/2](363100/4588595) loss:2.886 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 08:00:29,326][model8_pretrain.py][INFO] Epoch:[0/2](363100/4588595) loss:2.934 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 08:00:29,326][model8_pretrain.py][INFO] Epoch:[0/2](363100/4588595) loss:2.180 lr:0.0000100 epoch_Time:26774.0min: [2024-01-04 08:01:06,335][model8_pretrain.py][INFO] Epoch:[0/2](363200/4588595) loss:2.297 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:06,335][model8_pretrain.py][INFO] Epoch:[0/2](363200/4588595) loss:3.352 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:06,335][model8_pretrain.py][INFO] Epoch:[0/2](363200/4588595) loss:2.679 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:06,335][model8_pretrain.py][INFO] Epoch:[0/2](363200/4588595) loss:2.416 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:06,335][model8_pretrain.py][INFO] Epoch:[0/2](363200/4588595) loss:2.769 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:06,335][model8_pretrain.py][INFO] Epoch:[0/2](363200/4588595) loss:2.227 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:06,335][model8_pretrain.py][INFO] Epoch:[0/2](363200/4588595) loss:3.044 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:06,335][model8_pretrain.py][INFO] Epoch:[0/2](363200/4588595) loss:3.162 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:43,313][model8_pretrain.py][INFO] Epoch:[0/2](363300/4588595) loss:3.251 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:43,313][model8_pretrain.py][INFO] Epoch:[0/2](363300/4588595) loss:2.897 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:43,313][model8_pretrain.py][INFO] Epoch:[0/2](363300/4588595) loss:3.019 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:43,313][model8_pretrain.py][INFO] Epoch:[0/2](363300/4588595) loss:2.826 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:43,313][model8_pretrain.py][INFO] Epoch:[0/2](363300/4588595) loss:2.812 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:43,313][model8_pretrain.py][INFO] Epoch:[0/2](363300/4588595) loss:2.921 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:43,314][model8_pretrain.py][INFO] Epoch:[0/2](363300/4588595) loss:3.254 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:01:43,314][model8_pretrain.py][INFO] Epoch:[0/2](363300/4588595) loss:3.143 lr:0.0000100 epoch_Time:26773.0min: [2024-01-04 08:02:20,256][model8_pretrain.py][INFO] Epoch:[0/2](363400/4588595) loss:3.148 lr:0.0000100 epoch_Time:26771.0min: [2024-01-04 08:02:20,256][model8_pretrain.py][INFO] Epoch:[0/2](363400/4588595) loss:2.498 lr:0.0000100 epoch_Time:26771.0min: [2024-01-04 08:02:20,256][model8_pretrain.py][INFO] Epoch:[0/2](363400/4588595) loss:2.791 lr:0.0000100 epoch_Time:26771.0min: [2024-01-04 08:02:20,256][model8_pretrain.py][INFO] Epoch:[0/2](363400/4588595) loss:3.241 lr:0.0000100 epoch_Time:26771.0min: [2024-01-04 08:02:20,256][model8_pretrain.py][INFO] Epoch:[0/2](363400/4588595) loss:2.564 lr:0.0000100 epoch_Time:26771.0min: [2024-01-04 08:02:20,256][model8_pretrain.py][INFO] Epoch:[0/2](363400/4588595) loss:3.074 lr:0.0000100 epoch_Time:26771.0min: [2024-01-04 08:02:20,256][model8_pretrain.py][INFO] Epoch:[0/2](363400/4588595) loss:2.700 lr:0.0000100 epoch_Time:26771.0min: [2024-01-04 08:02:20,256][model8_pretrain.py][INFO] Epoch:[0/2](363400/4588595) loss:3.314 lr:0.0000100 epoch_Time:26771.0min: [2024-01-04 08:02:57,186][model8_pretrain.py][INFO] Epoch:[0/2](363500/4588595) loss:2.820 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:02:57,186][model8_pretrain.py][INFO] Epoch:[0/2](363500/4588595) loss:2.411 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:02:57,186][model8_pretrain.py][INFO] Epoch:[0/2](363500/4588595) loss:2.672 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:02:57,186][model8_pretrain.py][INFO] Epoch:[0/2](363500/4588595) loss:3.200 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:02:57,186][model8_pretrain.py][INFO] Epoch:[0/2](363500/4588595) loss:2.821 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:02:57,186][model8_pretrain.py][INFO] Epoch:[0/2](363500/4588595) loss:3.023 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:02:57,186][model8_pretrain.py][INFO] Epoch:[0/2](363500/4588595) loss:3.329 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:02:57,186][model8_pretrain.py][INFO] Epoch:[0/2](363500/4588595) loss:2.748 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:03:34,137][model8_pretrain.py][INFO] Epoch:[0/2](363600/4588595) loss:2.945 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:03:34,137][model8_pretrain.py][INFO] Epoch:[0/2](363600/4588595) loss:2.653 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:03:34,138][model8_pretrain.py][INFO] Epoch:[0/2](363600/4588595) loss:2.950 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:03:34,138][model8_pretrain.py][INFO] Epoch:[0/2](363600/4588595) loss:2.949 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:03:34,138][model8_pretrain.py][INFO] Epoch:[0/2](363600/4588595) loss:2.610 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:03:34,138][model8_pretrain.py][INFO] Epoch:[0/2](363600/4588595) loss:3.007 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:03:34,138][model8_pretrain.py][INFO] Epoch:[0/2](363600/4588595) loss:2.780 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:03:34,138][model8_pretrain.py][INFO] Epoch:[0/2](363600/4588595) loss:3.199 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:04:11,086][model8_pretrain.py][INFO] Epoch:[0/2](363700/4588595) loss:3.026 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:11,086][model8_pretrain.py][INFO] Epoch:[0/2](363700/4588595) loss:3.061 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:11,086][model8_pretrain.py][INFO] Epoch:[0/2](363700/4588595) loss:2.958 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:11,086][model8_pretrain.py][INFO] Epoch:[0/2](363700/4588595) loss:2.997 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:11,086][model8_pretrain.py][INFO] Epoch:[0/2](363700/4588595) loss:3.057 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:11,086][model8_pretrain.py][INFO] Epoch:[0/2](363700/4588595) loss:2.518 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:11,086][model8_pretrain.py][INFO] Epoch:[0/2](363700/4588595) loss:3.288 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:11,086][model8_pretrain.py][INFO] Epoch:[0/2](363700/4588595) loss:2.800 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:58,080][model8_pretrain.py][INFO] Epoch:[0/2](363800/4588595) loss:2.710 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:04:58,080][model8_pretrain.py][INFO] Epoch:[0/2](363800/4588595) loss:2.917 lr:0.0000100 epoch_Time:26770.0min: [2024-01-04 08:04:58,080][model8_pretrain.py][INFO] Epoch:[0/2](363800/4588595) loss:2.917 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:58,080][model8_pretrain.py][INFO] Epoch:[0/2](363800/4588595) loss:2.860 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:58,080][model8_pretrain.py][INFO] Epoch:[0/2](363800/4588595) loss:3.114 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:58,080][model8_pretrain.py][INFO] Epoch:[0/2](363800/4588595) loss:2.976 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:58,080][model8_pretrain.py][INFO] Epoch:[0/2](363800/4588595) loss:2.838 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:04:58,080][model8_pretrain.py][INFO] Epoch:[0/2](363800/4588595) loss:3.146 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:05:35,046][model8_pretrain.py][INFO] Epoch:[0/2](363900/4588595) loss:3.267 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:05:35,046][model8_pretrain.py][INFO] Epoch:[0/2](363900/4588595) loss:2.809 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:05:35,046][model8_pretrain.py][INFO] Epoch:[0/2](363900/4588595) loss:2.902 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:05:35,046][model8_pretrain.py][INFO] Epoch:[0/2](363900/4588595) loss:3.232 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:05:35,046][model8_pretrain.py][INFO] Epoch:[0/2](363900/4588595) loss:3.169 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:05:35,046][model8_pretrain.py][INFO] Epoch:[0/2](363900/4588595) loss:3.140 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:05:35,046][model8_pretrain.py][INFO] Epoch:[0/2](363900/4588595) loss:3.205 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:05:35,047][model8_pretrain.py][INFO] Epoch:[0/2](363900/4588595) loss:2.863 lr:0.0000100 epoch_Time:26769.0min: [2024-01-04 08:06:11,987][model8_pretrain.py][INFO] Epoch:[0/2](364000/4588595) loss:3.146 lr:0.0000100 epoch_Time:26768.0min: [2024-01-04 08:06:11,987][model8_pretrain.py][INFO] Epoch:[0/2](364000/4588595) loss:2.533 lr:0.0000100 epoch_Time:26768.0min: [2024-01-04 08:06:11,987][model8_pretrain.py][INFO] Epoch:[0/2](364000/4588595) loss:2.701 lr:0.0000100 epoch_Time:26768.0min: [2024-01-04 08:06:11,987][model8_pretrain.py][INFO] Epoch:[0/2](364000/4588595) loss:3.031 lr:0.0000100 epoch_Time:26768.0min: [2024-01-04 08:06:11,987][model8_pretrain.py][INFO] Epoch:[0/2](364000/4588595) loss:2.521 lr:0.0000100 epoch_Time:26768.0min: [2024-01-04 08:06:11,987][model8_pretrain.py][INFO] Epoch:[0/2](364000/4588595) loss:2.776 lr:0.0000100 epoch_Time:26768.0min: [2024-01-04 08:06:11,987][model8_pretrain.py][INFO] Epoch:[0/2](364000/4588595) loss:3.209 lr:0.0000100 epoch_Time:26768.0min: [2024-01-04 08:06:11,987][model8_pretrain.py][INFO] Epoch:[0/2](364000/4588595) loss:3.127 lr:0.0000100 epoch_Time:26768.0min: [2024-01-04 08:06:48,924][model8_pretrain.py][INFO] Epoch:[0/2](364100/4588595) loss:2.772 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:06:48,924][model8_pretrain.py][INFO] Epoch:[0/2](364100/4588595) loss:2.650 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:06:48,924][model8_pretrain.py][INFO] Epoch:[0/2](364100/4588595) loss:3.062 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:06:48,924][model8_pretrain.py][INFO] Epoch:[0/2](364100/4588595) loss:2.919 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:06:48,924][model8_pretrain.py][INFO] Epoch:[0/2](364100/4588595) loss:2.385 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:06:48,924][model8_pretrain.py][INFO] Epoch:[0/2](364100/4588595) loss:3.303 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:06:48,924][model8_pretrain.py][INFO] Epoch:[0/2](364100/4588595) loss:2.687 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:06:48,925][model8_pretrain.py][INFO] Epoch:[0/2](364100/4588595) loss:2.974 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:07:25,859][model8_pretrain.py][INFO] Epoch:[0/2](364200/4588595) loss:2.798 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:07:25,859][model8_pretrain.py][INFO] Epoch:[0/2](364200/4588595) loss:2.763 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:07:25,859][model8_pretrain.py][INFO] Epoch:[0/2](364200/4588595) loss:2.779 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:07:25,859][model8_pretrain.py][INFO] Epoch:[0/2](364200/4588595) loss:3.330 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:07:25,859][model8_pretrain.py][INFO] Epoch:[0/2](364200/4588595) loss:2.575 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:07:25,859][model8_pretrain.py][INFO] Epoch:[0/2](364200/4588595) loss:2.375 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:07:25,859][model8_pretrain.py][INFO] Epoch:[0/2](364200/4588595) loss:2.771 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:07:25,859][model8_pretrain.py][INFO] Epoch:[0/2](364200/4588595) loss:2.786 lr:0.0000100 epoch_Time:26767.0min: [2024-01-04 08:08:02,816][model8_pretrain.py][INFO] Epoch:[0/2](364300/4588595) loss:3.092 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:02,816][model8_pretrain.py][INFO] Epoch:[0/2](364300/4588595) loss:3.222 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:02,816][model8_pretrain.py][INFO] Epoch:[0/2](364300/4588595) loss:3.099 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:02,816][model8_pretrain.py][INFO] Epoch:[0/2](364300/4588595) loss:2.886 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:02,816][model8_pretrain.py][INFO] Epoch:[0/2](364300/4588595) loss:2.793 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:02,816][model8_pretrain.py][INFO] Epoch:[0/2](364300/4588595) loss:2.878 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:02,816][model8_pretrain.py][INFO] Epoch:[0/2](364300/4588595) loss:2.308 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:02,816][model8_pretrain.py][INFO] Epoch:[0/2](364300/4588595) loss:2.419 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:39,749][model8_pretrain.py][INFO] Epoch:[0/2](364400/4588595) loss:2.858 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:39,749][model8_pretrain.py][INFO] Epoch:[0/2](364400/4588595) loss:2.623 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:39,749][model8_pretrain.py][INFO] Epoch:[0/2](364400/4588595) loss:2.649 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:39,749][model8_pretrain.py][INFO] Epoch:[0/2](364400/4588595) loss:3.075 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:39,749][model8_pretrain.py][INFO] Epoch:[0/2](364400/4588595) loss:2.418 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:39,749][model8_pretrain.py][INFO] Epoch:[0/2](364400/4588595) loss:2.496 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:39,749][model8_pretrain.py][INFO] Epoch:[0/2](364400/4588595) loss:2.816 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:08:39,749][model8_pretrain.py][INFO] Epoch:[0/2](364400/4588595) loss:3.060 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:09:16,688][model8_pretrain.py][INFO] Epoch:[0/2](364500/4588595) loss:2.778 lr:0.0000100 epoch_Time:26764.0min: [2024-01-04 08:09:16,688][model8_pretrain.py][INFO] Epoch:[0/2](364500/4588595) loss:3.050 lr:0.0000100 epoch_Time:26764.0min: [2024-01-04 08:09:16,688][model8_pretrain.py][INFO] Epoch:[0/2](364500/4588595) loss:3.062 lr:0.0000100 epoch_Time:26764.0min: [2024-01-04 08:09:16,688][model8_pretrain.py][INFO] Epoch:[0/2](364500/4588595) loss:3.131 lr:0.0000100 epoch_Time:26764.0min: [2024-01-04 08:09:16,689][model8_pretrain.py][INFO] Epoch:[0/2](364500/4588595) loss:2.756 lr:0.0000100 epoch_Time:26764.0min: [2024-01-04 08:09:16,689][model8_pretrain.py][INFO] Epoch:[0/2](364500/4588595) loss:2.567 lr:0.0000100 epoch_Time:26764.0min: [2024-01-04 08:09:16,689][model8_pretrain.py][INFO] Epoch:[0/2](364500/4588595) loss:3.311 lr:0.0000100 epoch_Time:26764.0min: [2024-01-04 08:09:16,689][model8_pretrain.py][INFO] Epoch:[0/2](364500/4588595) loss:2.708 lr:0.0000100 epoch_Time:26764.0min: [2024-01-04 08:10:03,896][model8_pretrain.py][INFO] Epoch:[0/2](364600/4588595) loss:3.189 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:03,896][model8_pretrain.py][INFO] Epoch:[0/2](364600/4588595) loss:2.635 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:03,896][model8_pretrain.py][INFO] Epoch:[0/2](364600/4588595) loss:2.880 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:03,896][model8_pretrain.py][INFO] Epoch:[0/2](364600/4588595) loss:3.231 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:03,896][model8_pretrain.py][INFO] Epoch:[0/2](364600/4588595) loss:2.938 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:03,896][model8_pretrain.py][INFO] Epoch:[0/2](364600/4588595) loss:3.286 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:03,896][model8_pretrain.py][INFO] Epoch:[0/2](364600/4588595) loss:2.694 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:03,896][model8_pretrain.py][INFO] Epoch:[0/2](364600/4588595) loss:3.159 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:40,822][model8_pretrain.py][INFO] Epoch:[0/2](364700/4588595) loss:2.511 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:40,822][model8_pretrain.py][INFO] Epoch:[0/2](364700/4588595) loss:3.082 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:40,822][model8_pretrain.py][INFO] Epoch:[0/2](364700/4588595) loss:2.690 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:40,822][model8_pretrain.py][INFO] Epoch:[0/2](364700/4588595) loss:2.802 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:40,822][model8_pretrain.py][INFO] Epoch:[0/2](364700/4588595) loss:3.433 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:40,822][model8_pretrain.py][INFO] Epoch:[0/2](364700/4588595) loss:3.164 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:40,822][model8_pretrain.py][INFO] Epoch:[0/2](364700/4588595) loss:2.921 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:10:40,822][model8_pretrain.py][INFO] Epoch:[0/2](364700/4588595) loss:2.713 lr:0.0000100 epoch_Time:26765.0min: [2024-01-04 08:11:17,756][model8_pretrain.py][INFO] Epoch:[0/2](364800/4588595) loss:2.710 lr:0.0000100 epoch_Time:26763.0min: [2024-01-04 08:11:17,756][model8_pretrain.py][INFO] Epoch:[0/2](364800/4588595) loss:2.893 lr:0.0000100 epoch_Time:26763.0min: [2024-01-04 08:11:17,756][model8_pretrain.py][INFO] Epoch:[0/2](364800/4588595) loss:2.584 lr:0.0000100 epoch_Time:26763.0min: [2024-01-04 08:11:17,756][model8_pretrain.py][INFO] Epoch:[0/2](364800/4588595) loss:3.028 lr:0.0000100 epoch_Time:26763.0min: [2024-01-04 08:11:17,756][model8_pretrain.py][INFO] Epoch:[0/2](364800/4588595) loss:3.403 lr:0.0000100 epoch_Time:26763.0min: [2024-01-04 08:11:17,756][model8_pretrain.py][INFO] Epoch:[0/2](364800/4588595) loss:2.447 lr:0.0000100 epoch_Time:26763.0min: [2024-01-04 08:11:17,756][model8_pretrain.py][INFO] Epoch:[0/2](364800/4588595) loss:3.147 lr:0.0000100 epoch_Time:26763.0min: [2024-01-04 08:11:17,756][model8_pretrain.py][INFO] Epoch:[0/2](364800/4588595) loss:2.834 lr:0.0000100 epoch_Time:26763.0min: [2024-01-04 08:11:54,693][model8_pretrain.py][INFO] Epoch:[0/2](364900/4588595) loss:2.694 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:11:54,693][model8_pretrain.py][INFO] Epoch:[0/2](364900/4588595) loss:2.977 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:11:54,693][model8_pretrain.py][INFO] Epoch:[0/2](364900/4588595) loss:2.570 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:11:54,693][model8_pretrain.py][INFO] Epoch:[0/2](364900/4588595) loss:2.824 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:11:54,693][model8_pretrain.py][INFO] Epoch:[0/2](364900/4588595) loss:2.765 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:11:54,693][model8_pretrain.py][INFO] Epoch:[0/2](364900/4588595) loss:2.999 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:11:54,694][model8_pretrain.py][INFO] Epoch:[0/2](364900/4588595) loss:3.016 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:11:54,694][model8_pretrain.py][INFO] Epoch:[0/2](364900/4588595) loss:2.302 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:12:31,644][model8_pretrain.py][INFO] Epoch:[0/2](365000/4588595) loss:3.276 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:12:31,644][model8_pretrain.py][INFO] Epoch:[0/2](365000/4588595) loss:3.003 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:12:31,644][model8_pretrain.py][INFO] Epoch:[0/2](365000/4588595) loss:2.908 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:12:31,644][model8_pretrain.py][INFO] Epoch:[0/2](365000/4588595) loss:2.790 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:12:31,644][model8_pretrain.py][INFO] Epoch:[0/2](365000/4588595) loss:2.780 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:12:31,644][model8_pretrain.py][INFO] Epoch:[0/2](365000/4588595) loss:2.543 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:12:31,644][model8_pretrain.py][INFO] Epoch:[0/2](365000/4588595) loss:2.934 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:12:31,644][model8_pretrain.py][INFO] Epoch:[0/2](365000/4588595) loss:3.327 lr:0.0000100 epoch_Time:26762.0min: [2024-01-04 08:13:08,581][model8_pretrain.py][INFO] Epoch:[0/2](365100/4588595) loss:3.082 lr:0.0000100 epoch_Time:26761.0min: [2024-01-04 08:13:08,581][model8_pretrain.py][INFO] Epoch:[0/2](365100/4588595) loss:2.721 lr:0.0000100 epoch_Time:26761.0min: [2024-01-04 08:13:08,581][model8_pretrain.py][INFO] Epoch:[0/2](365100/4588595) loss:2.871 lr:0.0000100 epoch_Time:26761.0min: [2024-01-04 08:13:08,581][model8_pretrain.py][INFO] Epoch:[0/2](365100/4588595) loss:2.877 lr:0.0000100 epoch_Time:26761.0min: [2024-01-04 08:13:08,581][model8_pretrain.py][INFO] Epoch:[0/2](365100/4588595) loss:3.450 lr:0.0000100 epoch_Time:26761.0min: [2024-01-04 08:13:08,581][model8_pretrain.py][INFO] Epoch:[0/2](365100/4588595) loss:2.578 lr:0.0000100 epoch_Time:26761.0min: [2024-01-04 08:13:08,581][model8_pretrain.py][INFO] Epoch:[0/2](365100/4588595) loss:2.960 lr:0.0000100 epoch_Time:26761.0min: [2024-01-04 08:13:08,581][model8_pretrain.py][INFO] Epoch:[0/2](365100/4588595) loss:2.529 lr:0.0000100 epoch_Time:26761.0min: [2024-01-04 08:13:45,518][model8_pretrain.py][INFO] Epoch:[0/2](365200/4588595) loss:3.151 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:13:45,518][model8_pretrain.py][INFO] Epoch:[0/2](365200/4588595) loss:2.743 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:13:45,518][model8_pretrain.py][INFO] Epoch:[0/2](365200/4588595) loss:2.989 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:13:45,518][model8_pretrain.py][INFO] Epoch:[0/2](365200/4588595) loss:2.521 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:13:45,518][model8_pretrain.py][INFO] Epoch:[0/2](365200/4588595) loss:2.610 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:13:45,518][model8_pretrain.py][INFO] Epoch:[0/2](365200/4588595) loss:2.748 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:13:45,518][model8_pretrain.py][INFO] Epoch:[0/2](365200/4588595) loss:2.237 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:13:45,518][model8_pretrain.py][INFO] Epoch:[0/2](365200/4588595) loss:2.684 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:14:22,466][model8_pretrain.py][INFO] Epoch:[0/2](365300/4588595) loss:2.773 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:14:22,466][model8_pretrain.py][INFO] Epoch:[0/2](365300/4588595) loss:2.675 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:14:22,466][model8_pretrain.py][INFO] Epoch:[0/2](365300/4588595) loss:3.369 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:14:22,466][model8_pretrain.py][INFO] Epoch:[0/2](365300/4588595) loss:2.821 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:14:22,466][model8_pretrain.py][INFO] Epoch:[0/2](365300/4588595) loss:2.346 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:14:22,466][model8_pretrain.py][INFO] Epoch:[0/2](365300/4588595) loss:2.158 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:14:22,466][model8_pretrain.py][INFO] Epoch:[0/2](365300/4588595) loss:3.083 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:14:22,466][model8_pretrain.py][INFO] Epoch:[0/2](365300/4588595) loss:2.497 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:15:09,427][model8_pretrain.py][INFO] Epoch:[0/2](365400/4588595) loss:3.027 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:09,427][model8_pretrain.py][INFO] Epoch:[0/2](365400/4588595) loss:3.189 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:09,427][model8_pretrain.py][INFO] Epoch:[0/2](365400/4588595) loss:2.681 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:09,427][model8_pretrain.py][INFO] Epoch:[0/2](365400/4588595) loss:2.975 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:09,427][model8_pretrain.py][INFO] Epoch:[0/2](365400/4588595) loss:2.990 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:09,427][model8_pretrain.py][INFO] Epoch:[0/2](365400/4588595) loss:3.071 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:09,427][model8_pretrain.py][INFO] Epoch:[0/2](365400/4588595) loss:3.177 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:09,427][model8_pretrain.py][INFO] Epoch:[0/2](365400/4588595) loss:2.962 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:46,343][model8_pretrain.py][INFO] Epoch:[0/2](365500/4588595) loss:2.842 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:46,343][model8_pretrain.py][INFO] Epoch:[0/2](365500/4588595) loss:3.014 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:46,343][model8_pretrain.py][INFO] Epoch:[0/2](365500/4588595) loss:2.847 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:46,343][model8_pretrain.py][INFO] Epoch:[0/2](365500/4588595) loss:2.863 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:46,343][model8_pretrain.py][INFO] Epoch:[0/2](365500/4588595) loss:3.163 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:46,343][model8_pretrain.py][INFO] Epoch:[0/2](365500/4588595) loss:3.324 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:46,343][model8_pretrain.py][INFO] Epoch:[0/2](365500/4588595) loss:2.897 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:15:46,343][model8_pretrain.py][INFO] Epoch:[0/2](365500/4588595) loss:3.223 lr:0.0000100 epoch_Time:26760.0min: [2024-01-04 08:16:23,281][model8_pretrain.py][INFO] Epoch:[0/2](365600/4588595) loss:2.835 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:16:23,281][model8_pretrain.py][INFO] Epoch:[0/2](365600/4588595) loss:3.222 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:16:23,281][model8_pretrain.py][INFO] Epoch:[0/2](365600/4588595) loss:2.820 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:16:23,281][model8_pretrain.py][INFO] Epoch:[0/2](365600/4588595) loss:3.457 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:16:23,281][model8_pretrain.py][INFO] Epoch:[0/2](365600/4588595) loss:3.234 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:16:23,281][model8_pretrain.py][INFO] Epoch:[0/2](365600/4588595) loss:3.513 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:16:23,281][model8_pretrain.py][INFO] Epoch:[0/2](365600/4588595) loss:3.283 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:16:23,281][model8_pretrain.py][INFO] Epoch:[0/2](365600/4588595) loss:3.278 lr:0.0000100 epoch_Time:26759.0min: [2024-01-04 08:17:00,226][model8_pretrain.py][INFO] Epoch:[0/2](365700/4588595) loss:3.276 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:00,226][model8_pretrain.py][INFO] Epoch:[0/2](365700/4588595) loss:3.169 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:00,226][model8_pretrain.py][INFO] Epoch:[0/2](365700/4588595) loss:3.020 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:00,226][model8_pretrain.py][INFO] Epoch:[0/2](365700/4588595) loss:2.969 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:00,226][model8_pretrain.py][INFO] Epoch:[0/2](365700/4588595) loss:2.718 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:00,226][model8_pretrain.py][INFO] Epoch:[0/2](365700/4588595) loss:2.874 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:00,226][model8_pretrain.py][INFO] Epoch:[0/2](365700/4588595) loss:2.960 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:00,227][model8_pretrain.py][INFO] Epoch:[0/2](365700/4588595) loss:2.972 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:37,164][model8_pretrain.py][INFO] Epoch:[0/2](365800/4588595) loss:3.275 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:37,164][model8_pretrain.py][INFO] Epoch:[0/2](365800/4588595) loss:2.954 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:37,164][model8_pretrain.py][INFO] Epoch:[0/2](365800/4588595) loss:3.034 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:37,164][model8_pretrain.py][INFO] Epoch:[0/2](365800/4588595) loss:3.001 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:37,164][model8_pretrain.py][INFO] Epoch:[0/2](365800/4588595) loss:2.774 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:37,164][model8_pretrain.py][INFO] Epoch:[0/2](365800/4588595) loss:2.858 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:37,164][model8_pretrain.py][INFO] Epoch:[0/2](365800/4588595) loss:3.344 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:17:37,164][model8_pretrain.py][INFO] Epoch:[0/2](365800/4588595) loss:2.434 lr:0.0000100 epoch_Time:26757.0min: [2024-01-04 08:18:14,110][model8_pretrain.py][INFO] Epoch:[0/2](365900/4588595) loss:2.350 lr:0.0000100 epoch_Time:26756.0min: [2024-01-04 08:18:14,110][model8_pretrain.py][INFO] Epoch:[0/2](365900/4588595) loss:2.795 lr:0.0000100 epoch_Time:26756.0min: [2024-01-04 08:18:14,110][model8_pretrain.py][INFO] Epoch:[0/2](365900/4588595) loss:2.748 lr:0.0000100 epoch_Time:26756.0min: [2024-01-04 08:18:14,110][model8_pretrain.py][INFO] Epoch:[0/2](365900/4588595) loss:2.965 lr:0.0000100 epoch_Time:26756.0min: [2024-01-04 08:18:14,110][model8_pretrain.py][INFO] Epoch:[0/2](365900/4588595) loss:2.872 lr:0.0000100 epoch_Time:26756.0min: [2024-01-04 08:18:14,110][model8_pretrain.py][INFO] Epoch:[0/2](365900/4588595) loss:3.216 lr:0.0000100 epoch_Time:26756.0min: [2024-01-04 08:18:14,110][model8_pretrain.py][INFO] Epoch:[0/2](365900/4588595) loss:2.719 lr:0.0000100 epoch_Time:26756.0min: [2024-01-04 08:18:14,111][model8_pretrain.py][INFO] Epoch:[0/2](365900/4588595) loss:2.760 lr:0.0000100 epoch_Time:26756.0min: [2024-01-04 08:18:51,057][model8_pretrain.py][INFO] Epoch:[0/2](366000/4588595) loss:2.794 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:18:51,057][model8_pretrain.py][INFO] Epoch:[0/2](366000/4588595) loss:3.259 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:18:51,057][model8_pretrain.py][INFO] Epoch:[0/2](366000/4588595) loss:2.746 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:18:51,057][model8_pretrain.py][INFO] Epoch:[0/2](366000/4588595) loss:3.168 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:18:51,057][model8_pretrain.py][INFO] Epoch:[0/2](366000/4588595) loss:3.013 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:18:51,057][model8_pretrain.py][INFO] Epoch:[0/2](366000/4588595) loss:2.946 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:18:51,057][model8_pretrain.py][INFO] Epoch:[0/2](366000/4588595) loss:3.364 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:18:51,057][model8_pretrain.py][INFO] Epoch:[0/2](366000/4588595) loss:2.674 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:19:28,010][model8_pretrain.py][INFO] Epoch:[0/2](366100/4588595) loss:2.426 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:19:28,010][model8_pretrain.py][INFO] Epoch:[0/2](366100/4588595) loss:3.159 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:19:28,010][model8_pretrain.py][INFO] Epoch:[0/2](366100/4588595) loss:3.100 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:19:28,010][model8_pretrain.py][INFO] Epoch:[0/2](366100/4588595) loss:3.125 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:19:28,010][model8_pretrain.py][INFO] Epoch:[0/2](366100/4588595) loss:2.311 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:19:28,010][model8_pretrain.py][INFO] Epoch:[0/2](366100/4588595) loss:3.027 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:19:28,010][model8_pretrain.py][INFO] Epoch:[0/2](366100/4588595) loss:2.762 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:19:28,010][model8_pretrain.py][INFO] Epoch:[0/2](366100/4588595) loss:2.817 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:20:15,209][model8_pretrain.py][INFO] Epoch:[0/2](366200/4588595) loss:3.275 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:20:15,209][model8_pretrain.py][INFO] Epoch:[0/2](366200/4588595) loss:2.926 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:20:15,209][model8_pretrain.py][INFO] Epoch:[0/2](366200/4588595) loss:2.978 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:20:15,209][model8_pretrain.py][INFO] Epoch:[0/2](366200/4588595) loss:2.293 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:20:15,209][model8_pretrain.py][INFO] Epoch:[0/2](366200/4588595) loss:3.020 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:20:15,209][model8_pretrain.py][INFO] Epoch:[0/2](366200/4588595) loss:3.058 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:20:15,209][model8_pretrain.py][INFO] Epoch:[0/2](366200/4588595) loss:3.102 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:20:15,209][model8_pretrain.py][INFO] Epoch:[0/2](366200/4588595) loss:2.462 lr:0.0000100 epoch_Time:26755.0min: [2024-01-04 08:20:52,136][model8_pretrain.py][INFO] Epoch:[0/2](366300/4588595) loss:2.751 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:20:52,136][model8_pretrain.py][INFO] Epoch:[0/2](366300/4588595) loss:3.349 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:20:52,136][model8_pretrain.py][INFO] Epoch:[0/2](366300/4588595) loss:3.367 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:20:52,136][model8_pretrain.py][INFO] Epoch:[0/2](366300/4588595) loss:2.446 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:20:52,136][model8_pretrain.py][INFO] Epoch:[0/2](366300/4588595) loss:3.262 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:20:52,136][model8_pretrain.py][INFO] Epoch:[0/2](366300/4588595) loss:2.222 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:20:52,136][model8_pretrain.py][INFO] Epoch:[0/2](366300/4588595) loss:2.391 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:20:52,137][model8_pretrain.py][INFO] Epoch:[0/2](366300/4588595) loss:2.937 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:21:29,059][model8_pretrain.py][INFO] Epoch:[0/2](366400/4588595) loss:3.043 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:21:29,059][model8_pretrain.py][INFO] Epoch:[0/2](366400/4588595) loss:2.772 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:21:29,059][model8_pretrain.py][INFO] Epoch:[0/2](366400/4588595) loss:3.202 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:21:29,059][model8_pretrain.py][INFO] Epoch:[0/2](366400/4588595) loss:2.160 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:21:29,059][model8_pretrain.py][INFO] Epoch:[0/2](366400/4588595) loss:2.773 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:21:29,059][model8_pretrain.py][INFO] Epoch:[0/2](366400/4588595) loss:2.494 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:21:29,059][model8_pretrain.py][INFO] Epoch:[0/2](366400/4588595) loss:3.341 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:21:29,059][model8_pretrain.py][INFO] Epoch:[0/2](366400/4588595) loss:3.353 lr:0.0000100 epoch_Time:26754.0min: [2024-01-04 08:22:05,985][model8_pretrain.py][INFO] Epoch:[0/2](366500/4588595) loss:3.023 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:05,985][model8_pretrain.py][INFO] Epoch:[0/2](366500/4588595) loss:3.414 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:05,985][model8_pretrain.py][INFO] Epoch:[0/2](366500/4588595) loss:2.287 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:05,985][model8_pretrain.py][INFO] Epoch:[0/2](366500/4588595) loss:2.573 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:05,985][model8_pretrain.py][INFO] Epoch:[0/2](366500/4588595) loss:3.465 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:05,985][model8_pretrain.py][INFO] Epoch:[0/2](366500/4588595) loss:3.157 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:05,985][model8_pretrain.py][INFO] Epoch:[0/2](366500/4588595) loss:2.526 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:05,985][model8_pretrain.py][INFO] Epoch:[0/2](366500/4588595) loss:2.779 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:42,914][model8_pretrain.py][INFO] Epoch:[0/2](366600/4588595) loss:2.914 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:42,914][model8_pretrain.py][INFO] Epoch:[0/2](366600/4588595) loss:2.618 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:42,914][model8_pretrain.py][INFO] Epoch:[0/2](366600/4588595) loss:2.689 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:42,914][model8_pretrain.py][INFO] Epoch:[0/2](366600/4588595) loss:3.005 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:42,914][model8_pretrain.py][INFO] Epoch:[0/2](366600/4588595) loss:2.855 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:42,914][model8_pretrain.py][INFO] Epoch:[0/2](366600/4588595) loss:3.012 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:42,914][model8_pretrain.py][INFO] Epoch:[0/2](366600/4588595) loss:2.773 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:22:42,914][model8_pretrain.py][INFO] Epoch:[0/2](366600/4588595) loss:3.098 lr:0.0000100 epoch_Time:26753.0min: [2024-01-04 08:23:19,844][model8_pretrain.py][INFO] Epoch:[0/2](366700/4588595) loss:3.055 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:23:19,845][model8_pretrain.py][INFO] Epoch:[0/2](366700/4588595) loss:3.281 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:23:19,845][model8_pretrain.py][INFO] Epoch:[0/2](366700/4588595) loss:3.076 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:23:19,845][model8_pretrain.py][INFO] Epoch:[0/2](366700/4588595) loss:2.919 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:23:19,845][model8_pretrain.py][INFO] Epoch:[0/2](366700/4588595) loss:2.924 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:23:19,845][model8_pretrain.py][INFO] Epoch:[0/2](366700/4588595) loss:2.649 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:23:19,845][model8_pretrain.py][INFO] Epoch:[0/2](366700/4588595) loss:2.577 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:23:19,845][model8_pretrain.py][INFO] Epoch:[0/2](366700/4588595) loss:2.984 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:23:56,789][model8_pretrain.py][INFO] Epoch:[0/2](366800/4588595) loss:2.803 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:23:56,789][model8_pretrain.py][INFO] Epoch:[0/2](366800/4588595) loss:3.473 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:23:56,789][model8_pretrain.py][INFO] Epoch:[0/2](366800/4588595) loss:3.177 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:23:56,789][model8_pretrain.py][INFO] Epoch:[0/2](366800/4588595) loss:3.319 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:23:56,789][model8_pretrain.py][INFO] Epoch:[0/2](366800/4588595) loss:3.260 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:23:56,789][model8_pretrain.py][INFO] Epoch:[0/2](366800/4588595) loss:2.993 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:23:56,789][model8_pretrain.py][INFO] Epoch:[0/2](366800/4588595) loss:3.344 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:23:56,790][model8_pretrain.py][INFO] Epoch:[0/2](366800/4588595) loss:2.547 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:24:33,722][model8_pretrain.py][INFO] Epoch:[0/2](366900/4588595) loss:2.528 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:24:33,722][model8_pretrain.py][INFO] Epoch:[0/2](366900/4588595) loss:3.644 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:24:33,722][model8_pretrain.py][INFO] Epoch:[0/2](366900/4588595) loss:2.336 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:24:33,722][model8_pretrain.py][INFO] Epoch:[0/2](366900/4588595) loss:2.837 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:24:33,722][model8_pretrain.py][INFO] Epoch:[0/2](366900/4588595) loss:2.784 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:24:33,722][model8_pretrain.py][INFO] Epoch:[0/2](366900/4588595) loss:2.616 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:24:33,722][model8_pretrain.py][INFO] Epoch:[0/2](366900/4588595) loss:2.928 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:24:33,722][model8_pretrain.py][INFO] Epoch:[0/2](366900/4588595) loss:2.932 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:25:20,844][model8_pretrain.py][INFO] Epoch:[0/2](367000/4588595) loss:2.971 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:25:20,844][model8_pretrain.py][INFO] Epoch:[0/2](367000/4588595) loss:3.031 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:25:20,844][model8_pretrain.py][INFO] Epoch:[0/2](367000/4588595) loss:2.884 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:25:20,844][model8_pretrain.py][INFO] Epoch:[0/2](367000/4588595) loss:3.110 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:25:20,844][model8_pretrain.py][INFO] Epoch:[0/2](367000/4588595) loss:2.163 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:25:20,844][model8_pretrain.py][INFO] Epoch:[0/2](367000/4588595) loss:2.845 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:25:20,844][model8_pretrain.py][INFO] Epoch:[0/2](367000/4588595) loss:3.209 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:25:20,844][model8_pretrain.py][INFO] Epoch:[0/2](367000/4588595) loss:2.514 lr:0.0000100 epoch_Time:26751.0min: [2024-01-04 08:25:57,767][model8_pretrain.py][INFO] Epoch:[0/2](367100/4588595) loss:3.071 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:25:57,767][model8_pretrain.py][INFO] Epoch:[0/2](367100/4588595) loss:3.460 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:25:57,767][model8_pretrain.py][INFO] Epoch:[0/2](367100/4588595) loss:2.277 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:25:57,767][model8_pretrain.py][INFO] Epoch:[0/2](367100/4588595) loss:3.107 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:25:57,767][model8_pretrain.py][INFO] Epoch:[0/2](367100/4588595) loss:2.868 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:25:57,767][model8_pretrain.py][INFO] Epoch:[0/2](367100/4588595) loss:3.099 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:25:57,767][model8_pretrain.py][INFO] Epoch:[0/2](367100/4588595) loss:3.325 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:25:57,768][model8_pretrain.py][INFO] Epoch:[0/2](367100/4588595) loss:2.908 lr:0.0000100 epoch_Time:26750.0min: [2024-01-04 08:26:34,712][model8_pretrain.py][INFO] Epoch:[0/2](367200/4588595) loss:2.614 lr:0.0000100 epoch_Time:26749.0min: [2024-01-04 08:26:34,712][model8_pretrain.py][INFO] Epoch:[0/2](367200/4588595) loss:3.166 lr:0.0000100 epoch_Time:26749.0min: [2024-01-04 08:26:34,712][model8_pretrain.py][INFO] Epoch:[0/2](367200/4588595) loss:2.852 lr:0.0000100 epoch_Time:26749.0min: [2024-01-04 08:26:34,712][model8_pretrain.py][INFO] Epoch:[0/2](367200/4588595) loss:3.216 lr:0.0000100 epoch_Time:26749.0min: [2024-01-04 08:26:34,712][model8_pretrain.py][INFO] Epoch:[0/2](367200/4588595) loss:2.717 lr:0.0000100 epoch_Time:26749.0min: [2024-01-04 08:26:34,712][model8_pretrain.py][INFO] Epoch:[0/2](367200/4588595) loss:2.735 lr:0.0000100 epoch_Time:26749.0min: [2024-01-04 08:26:34,712][model8_pretrain.py][INFO] Epoch:[0/2](367200/4588595) loss:3.118 lr:0.0000100 epoch_Time:26749.0min: [2024-01-04 08:26:34,712][model8_pretrain.py][INFO] Epoch:[0/2](367200/4588595) loss:2.730 lr:0.0000100 epoch_Time:26749.0min: [2024-01-04 08:27:11,655][model8_pretrain.py][INFO] Epoch:[0/2](367300/4588595) loss:2.899 lr:0.0000100 epoch_Time:26748.0min: [2024-01-04 08:27:11,655][model8_pretrain.py][INFO] Epoch:[0/2](367300/4588595) loss:2.511 lr:0.0000100 epoch_Time:26748.0min: [2024-01-04 08:27:11,655][model8_pretrain.py][INFO] Epoch:[0/2](367300/4588595) loss:2.826 lr:0.0000100 epoch_Time:26748.0min: [2024-01-04 08:27:11,655][model8_pretrain.py][INFO] Epoch:[0/2](367300/4588595) loss:3.125 lr:0.0000100 epoch_Time:26748.0min: [2024-01-04 08:27:11,655][model8_pretrain.py][INFO] Epoch:[0/2](367300/4588595) loss:2.723 lr:0.0000100 epoch_Time:26748.0min: [2024-01-04 08:27:11,655][model8_pretrain.py][INFO] Epoch:[0/2](367300/4588595) loss:2.337 lr:0.0000100 epoch_Time:26748.0min: [2024-01-04 08:27:11,655][model8_pretrain.py][INFO] Epoch:[0/2](367300/4588595) loss:2.961 lr:0.0000100 epoch_Time:26748.0min: [2024-01-04 08:27:11,656][model8_pretrain.py][INFO] Epoch:[0/2](367300/4588595) loss:3.107 lr:0.0000100 epoch_Time:26748.0min: [2024-01-04 08:27:48,597][model8_pretrain.py][INFO] Epoch:[0/2](367400/4588595) loss:3.470 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:27:48,597][model8_pretrain.py][INFO] Epoch:[0/2](367400/4588595) loss:2.866 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:27:48,597][model8_pretrain.py][INFO] Epoch:[0/2](367400/4588595) loss:2.936 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:27:48,597][model8_pretrain.py][INFO] Epoch:[0/2](367400/4588595) loss:3.296 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:27:48,597][model8_pretrain.py][INFO] Epoch:[0/2](367400/4588595) loss:2.610 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:27:48,597][model8_pretrain.py][INFO] Epoch:[0/2](367400/4588595) loss:3.256 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:27:48,597][model8_pretrain.py][INFO] Epoch:[0/2](367400/4588595) loss:3.190 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:27:48,598][model8_pretrain.py][INFO] Epoch:[0/2](367400/4588595) loss:3.122 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:28:25,514][model8_pretrain.py][INFO] Epoch:[0/2](367500/4588595) loss:3.013 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:28:25,514][model8_pretrain.py][INFO] Epoch:[0/2](367500/4588595) loss:2.952 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:28:25,514][model8_pretrain.py][INFO] Epoch:[0/2](367500/4588595) loss:3.132 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:28:25,514][model8_pretrain.py][INFO] Epoch:[0/2](367500/4588595) loss:3.533 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:28:25,514][model8_pretrain.py][INFO] Epoch:[0/2](367500/4588595) loss:2.558 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:28:25,514][model8_pretrain.py][INFO] Epoch:[0/2](367500/4588595) loss:2.782 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:28:25,514][model8_pretrain.py][INFO] Epoch:[0/2](367500/4588595) loss:3.591 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:28:25,514][model8_pretrain.py][INFO] Epoch:[0/2](367500/4588595) loss:2.425 lr:0.0000100 epoch_Time:26747.0min: [2024-01-04 08:29:02,460][model8_pretrain.py][INFO] Epoch:[0/2](367600/4588595) loss:2.489 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:02,460][model8_pretrain.py][INFO] Epoch:[0/2](367600/4588595) loss:3.200 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:02,460][model8_pretrain.py][INFO] Epoch:[0/2](367600/4588595) loss:3.151 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:02,460][model8_pretrain.py][INFO] Epoch:[0/2](367600/4588595) loss:2.763 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:02,460][model8_pretrain.py][INFO] Epoch:[0/2](367600/4588595) loss:2.721 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:02,460][model8_pretrain.py][INFO] Epoch:[0/2](367600/4588595) loss:3.333 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:02,460][model8_pretrain.py][INFO] Epoch:[0/2](367600/4588595) loss:3.215 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:02,461][model8_pretrain.py][INFO] Epoch:[0/2](367600/4588595) loss:2.461 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:39,422][model8_pretrain.py][INFO] Epoch:[0/2](367700/4588595) loss:3.013 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:39,423][model8_pretrain.py][INFO] Epoch:[0/2](367700/4588595) loss:3.275 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:39,423][model8_pretrain.py][INFO] Epoch:[0/2](367700/4588595) loss:3.180 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:39,423][model8_pretrain.py][INFO] Epoch:[0/2](367700/4588595) loss:2.928 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:39,423][model8_pretrain.py][INFO] Epoch:[0/2](367700/4588595) loss:2.361 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:39,423][model8_pretrain.py][INFO] Epoch:[0/2](367700/4588595) loss:2.930 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:39,423][model8_pretrain.py][INFO] Epoch:[0/2](367700/4588595) loss:3.335 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:29:39,423][model8_pretrain.py][INFO] Epoch:[0/2](367700/4588595) loss:2.646 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:30:25,791][model8_pretrain.py][INFO] Epoch:[0/2](367800/4588595) loss:2.746 lr:0.0000100 epoch_Time:26746.0min: [2024-01-04 08:30:25,791][model8_pretrain.py][INFO] Epoch:[0/2](367800/4588595) loss:2.761 lr:0.0000100 epoch_Time:26746.0min: [2024-01-04 08:30:25,791][model8_pretrain.py][INFO] Epoch:[0/2](367800/4588595) loss:2.676 lr:0.0000100 epoch_Time:26746.0min: [2024-01-04 08:30:25,791][model8_pretrain.py][INFO] Epoch:[0/2](367800/4588595) loss:3.398 lr:0.0000100 epoch_Time:26746.0min: [2024-01-04 08:30:25,792][model8_pretrain.py][INFO] Epoch:[0/2](367800/4588595) loss:2.533 lr:0.0000100 epoch_Time:26746.0min: [2024-01-04 08:30:25,792][model8_pretrain.py][INFO] Epoch:[0/2](367800/4588595) loss:2.847 lr:0.0000100 epoch_Time:26746.0min: [2024-01-04 08:30:25,792][model8_pretrain.py][INFO] Epoch:[0/2](367800/4588595) loss:2.532 lr:0.0000100 epoch_Time:26746.0min: [2024-01-04 08:30:25,792][model8_pretrain.py][INFO] Epoch:[0/2](367800/4588595) loss:2.628 lr:0.0000100 epoch_Time:26746.0min: [2024-01-04 08:31:04,305][model8_pretrain.py][INFO] Epoch:[0/2](367900/4588595) loss:2.528 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:04,305][model8_pretrain.py][INFO] Epoch:[0/2](367900/4588595) loss:2.364 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:04,305][model8_pretrain.py][INFO] Epoch:[0/2](367900/4588595) loss:3.346 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:04,305][model8_pretrain.py][INFO] Epoch:[0/2](367900/4588595) loss:2.879 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:04,305][model8_pretrain.py][INFO] Epoch:[0/2](367900/4588595) loss:2.651 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:04,305][model8_pretrain.py][INFO] Epoch:[0/2](367900/4588595) loss:3.096 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:04,305][model8_pretrain.py][INFO] Epoch:[0/2](367900/4588595) loss:3.001 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:04,305][model8_pretrain.py][INFO] Epoch:[0/2](367900/4588595) loss:2.366 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:41,236][model8_pretrain.py][INFO] Epoch:[0/2](368000/4588595) loss:3.522 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:41,236][model8_pretrain.py][INFO] Epoch:[0/2](368000/4588595) loss:2.713 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:41,236][model8_pretrain.py][INFO] Epoch:[0/2](368000/4588595) loss:2.631 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:41,236][model8_pretrain.py][INFO] Epoch:[0/2](368000/4588595) loss:3.134 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:41,236][model8_pretrain.py][INFO] Epoch:[0/2](368000/4588595) loss:2.934 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:41,236][model8_pretrain.py][INFO] Epoch:[0/2](368000/4588595) loss:3.221 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:41,236][model8_pretrain.py][INFO] Epoch:[0/2](368000/4588595) loss:2.523 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:31:41,236][model8_pretrain.py][INFO] Epoch:[0/2](368000/4588595) loss:3.566 lr:0.0000100 epoch_Time:26745.0min: [2024-01-04 08:32:18,167][model8_pretrain.py][INFO] Epoch:[0/2](368100/4588595) loss:2.955 lr:0.0000100 epoch_Time:26744.0min: [2024-01-04 08:32:18,167][model8_pretrain.py][INFO] Epoch:[0/2](368100/4588595) loss:2.813 lr:0.0000100 epoch_Time:26744.0min: [2024-01-04 08:32:18,167][model8_pretrain.py][INFO] Epoch:[0/2](368100/4588595) loss:3.020 lr:0.0000100 epoch_Time:26744.0min: [2024-01-04 08:32:18,167][model8_pretrain.py][INFO] Epoch:[0/2](368100/4588595) loss:2.674 lr:0.0000100 epoch_Time:26744.0min: [2024-01-04 08:32:18,167][model8_pretrain.py][INFO] Epoch:[0/2](368100/4588595) loss:2.999 lr:0.0000100 epoch_Time:26744.0min: [2024-01-04 08:32:18,167][model8_pretrain.py][INFO] Epoch:[0/2](368100/4588595) loss:1.983 lr:0.0000100 epoch_Time:26744.0min: [2024-01-04 08:32:18,167][model8_pretrain.py][INFO] Epoch:[0/2](368100/4588595) loss:2.963 lr:0.0000100 epoch_Time:26744.0min: [2024-01-04 08:32:18,168][model8_pretrain.py][INFO] Epoch:[0/2](368100/4588595) loss:2.834 lr:0.0000100 epoch_Time:26744.0min: [2024-01-04 08:32:55,115][model8_pretrain.py][INFO] Epoch:[0/2](368200/4588595) loss:2.905 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:32:55,115][model8_pretrain.py][INFO] Epoch:[0/2](368200/4588595) loss:2.911 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:32:55,115][model8_pretrain.py][INFO] Epoch:[0/2](368200/4588595) loss:2.514 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:32:55,115][model8_pretrain.py][INFO] Epoch:[0/2](368200/4588595) loss:3.082 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:32:55,115][model8_pretrain.py][INFO] Epoch:[0/2](368200/4588595) loss:2.911 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:32:55,115][model8_pretrain.py][INFO] Epoch:[0/2](368200/4588595) loss:2.956 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:32:55,115][model8_pretrain.py][INFO] Epoch:[0/2](368200/4588595) loss:2.621 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:32:55,115][model8_pretrain.py][INFO] Epoch:[0/2](368200/4588595) loss:2.921 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:33:32,063][model8_pretrain.py][INFO] Epoch:[0/2](368300/4588595) loss:2.998 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:33:32,063][model8_pretrain.py][INFO] Epoch:[0/2](368300/4588595) loss:3.363 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:33:32,063][model8_pretrain.py][INFO] Epoch:[0/2](368300/4588595) loss:3.170 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:33:32,063][model8_pretrain.py][INFO] Epoch:[0/2](368300/4588595) loss:2.911 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:33:32,063][model8_pretrain.py][INFO] Epoch:[0/2](368300/4588595) loss:2.336 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:33:32,063][model8_pretrain.py][INFO] Epoch:[0/2](368300/4588595) loss:2.879 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:33:32,064][model8_pretrain.py][INFO] Epoch:[0/2](368300/4588595) loss:3.205 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:33:32,063][model8_pretrain.py][INFO] Epoch:[0/2](368300/4588595) loss:3.311 lr:0.0000100 epoch_Time:26742.0min: [2024-01-04 08:34:09,011][model8_pretrain.py][INFO] Epoch:[0/2](368400/4588595) loss:2.872 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:09,011][model8_pretrain.py][INFO] Epoch:[0/2](368400/4588595) loss:2.714 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:09,011][model8_pretrain.py][INFO] Epoch:[0/2](368400/4588595) loss:2.576 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:09,011][model8_pretrain.py][INFO] Epoch:[0/2](368400/4588595) loss:2.878 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:09,012][model8_pretrain.py][INFO] Epoch:[0/2](368400/4588595) loss:3.235 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:09,011][model8_pretrain.py][INFO] Epoch:[0/2](368400/4588595) loss:2.636 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:09,012][model8_pretrain.py][INFO] Epoch:[0/2](368400/4588595) loss:2.615 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:09,012][model8_pretrain.py][INFO] Epoch:[0/2](368400/4588595) loss:3.089 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:45,987][model8_pretrain.py][INFO] Epoch:[0/2](368500/4588595) loss:3.455 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:45,987][model8_pretrain.py][INFO] Epoch:[0/2](368500/4588595) loss:2.660 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:45,987][model8_pretrain.py][INFO] Epoch:[0/2](368500/4588595) loss:2.568 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:45,987][model8_pretrain.py][INFO] Epoch:[0/2](368500/4588595) loss:3.103 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:45,987][model8_pretrain.py][INFO] Epoch:[0/2](368500/4588595) loss:2.574 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:45,987][model8_pretrain.py][INFO] Epoch:[0/2](368500/4588595) loss:2.986 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:45,987][model8_pretrain.py][INFO] Epoch:[0/2](368500/4588595) loss:2.360 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:34:45,987][model8_pretrain.py][INFO] Epoch:[0/2](368500/4588595) loss:2.757 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:35:31,625][model8_pretrain.py][INFO] Epoch:[0/2](368600/4588595) loss:2.888 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:35:31,625][model8_pretrain.py][INFO] Epoch:[0/2](368600/4588595) loss:3.046 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:35:31,625][model8_pretrain.py][INFO] Epoch:[0/2](368600/4588595) loss:2.234 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:35:31,625][model8_pretrain.py][INFO] Epoch:[0/2](368600/4588595) loss:2.272 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:35:31,625][model8_pretrain.py][INFO] Epoch:[0/2](368600/4588595) loss:2.524 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:35:31,625][model8_pretrain.py][INFO] Epoch:[0/2](368600/4588595) loss:2.478 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:35:31,629][model8_pretrain.py][INFO] Epoch:[0/2](368600/4588595) loss:3.038 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:35:31,630][model8_pretrain.py][INFO] Epoch:[0/2](368600/4588595) loss:3.075 lr:0.0000100 epoch_Time:26741.0min: [2024-01-04 08:36:10,009][model8_pretrain.py][INFO] Epoch:[0/2](368700/4588595) loss:3.118 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:10,009][model8_pretrain.py][INFO] Epoch:[0/2](368700/4588595) loss:2.592 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:10,009][model8_pretrain.py][INFO] Epoch:[0/2](368700/4588595) loss:2.502 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:10,009][model8_pretrain.py][INFO] Epoch:[0/2](368700/4588595) loss:2.560 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:10,009][model8_pretrain.py][INFO] Epoch:[0/2](368700/4588595) loss:3.094 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:10,009][model8_pretrain.py][INFO] Epoch:[0/2](368700/4588595) loss:2.972 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:10,009][model8_pretrain.py][INFO] Epoch:[0/2](368700/4588595) loss:2.906 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:10,009][model8_pretrain.py][INFO] Epoch:[0/2](368700/4588595) loss:2.453 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:46,939][model8_pretrain.py][INFO] Epoch:[0/2](368800/4588595) loss:2.911 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:46,939][model8_pretrain.py][INFO] Epoch:[0/2](368800/4588595) loss:2.776 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:46,939][model8_pretrain.py][INFO] Epoch:[0/2](368800/4588595) loss:2.949 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:46,939][model8_pretrain.py][INFO] Epoch:[0/2](368800/4588595) loss:2.765 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:46,939][model8_pretrain.py][INFO] Epoch:[0/2](368800/4588595) loss:3.182 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:46,939][model8_pretrain.py][INFO] Epoch:[0/2](368800/4588595) loss:2.737 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:46,940][model8_pretrain.py][INFO] Epoch:[0/2](368800/4588595) loss:2.943 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:36:46,940][model8_pretrain.py][INFO] Epoch:[0/2](368800/4588595) loss:2.561 lr:0.0000100 epoch_Time:26740.0min: [2024-01-04 08:37:23,864][model8_pretrain.py][INFO] Epoch:[0/2](368900/4588595) loss:2.493 lr:0.0000100 epoch_Time:26739.0min: [2024-01-04 08:37:23,864][model8_pretrain.py][INFO] Epoch:[0/2](368900/4588595) loss:3.080 lr:0.0000100 epoch_Time:26739.0min: [2024-01-04 08:37:23,864][model8_pretrain.py][INFO] Epoch:[0/2](368900/4588595) loss:1.678 lr:0.0000100 epoch_Time:26739.0min: [2024-01-04 08:37:23,864][model8_pretrain.py][INFO] Epoch:[0/2](368900/4588595) loss:2.876 lr:0.0000100 epoch_Time:26739.0min: [2024-01-04 08:37:23,864][model8_pretrain.py][INFO] Epoch:[0/2](368900/4588595) loss:2.559 lr:0.0000100 epoch_Time:26739.0min: [2024-01-04 08:37:23,865][model8_pretrain.py][INFO] Epoch:[0/2](368900/4588595) loss:3.130 lr:0.0000100 epoch_Time:26739.0min: [2024-01-04 08:37:23,865][model8_pretrain.py][INFO] Epoch:[0/2](368900/4588595) loss:2.928 lr:0.0000100 epoch_Time:26739.0min: [2024-01-04 08:37:23,865][model8_pretrain.py][INFO] Epoch:[0/2](368900/4588595) loss:3.167 lr:0.0000100 epoch_Time:26739.0min: [2024-01-04 08:38:00,797][model8_pretrain.py][INFO] Epoch:[0/2](369000/4588595) loss:2.808 lr:0.0000100 epoch_Time:26738.0min: [2024-01-04 08:38:00,797][model8_pretrain.py][INFO] Epoch:[0/2](369000/4588595) loss:2.993 lr:0.0000100 epoch_Time:26738.0min: [2024-01-04 08:38:00,797][model8_pretrain.py][INFO] Epoch:[0/2](369000/4588595) loss:2.852 lr:0.0000100 epoch_Time:26738.0min: [2024-01-04 08:38:00,797][model8_pretrain.py][INFO] Epoch:[0/2](369000/4588595) loss:2.645 lr:0.0000100 epoch_Time:26738.0min: [2024-01-04 08:38:00,797][model8_pretrain.py][INFO] Epoch:[0/2](369000/4588595) loss:2.652 lr:0.0000100 epoch_Time:26738.0min: [2024-01-04 08:38:00,798][model8_pretrain.py][INFO] Epoch:[0/2](369000/4588595) loss:2.948 lr:0.0000100 epoch_Time:26738.0min: [2024-01-04 08:38:00,798][model8_pretrain.py][INFO] Epoch:[0/2](369000/4588595) loss:2.134 lr:0.0000100 epoch_Time:26738.0min: [2024-01-04 08:38:00,798][model8_pretrain.py][INFO] Epoch:[0/2](369000/4588595) loss:3.112 lr:0.0000100 epoch_Time:26738.0min: [2024-01-04 08:38:37,740][model8_pretrain.py][INFO] Epoch:[0/2](369100/4588595) loss:2.873 lr:0.0000100 epoch_Time:26737.0min: [2024-01-04 08:38:37,740][model8_pretrain.py][INFO] Epoch:[0/2](369100/4588595) loss:3.045 lr:0.0000100 epoch_Time:26737.0min: [2024-01-04 08:38:37,740][model8_pretrain.py][INFO] Epoch:[0/2](369100/4588595) loss:3.044 lr:0.0000100 epoch_Time:26737.0min: [2024-01-04 08:38:37,740][model8_pretrain.py][INFO] Epoch:[0/2](369100/4588595) loss:2.832 lr:0.0000100 epoch_Time:26737.0min: [2024-01-04 08:38:37,740][model8_pretrain.py][INFO] Epoch:[0/2](369100/4588595) loss:2.945 lr:0.0000100 epoch_Time:26737.0min: [2024-01-04 08:38:37,740][model8_pretrain.py][INFO] Epoch:[0/2](369100/4588595) loss:3.368 lr:0.0000100 epoch_Time:26737.0min: [2024-01-04 08:38:37,740][model8_pretrain.py][INFO] Epoch:[0/2](369100/4588595) loss:2.611 lr:0.0000100 epoch_Time:26737.0min: [2024-01-04 08:38:37,740][model8_pretrain.py][INFO] Epoch:[0/2](369100/4588595) loss:2.133 lr:0.0000100 epoch_Time:26737.0min: [2024-01-04 08:39:14,677][model8_pretrain.py][INFO] Epoch:[0/2](369200/4588595) loss:3.214 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:39:14,677][model8_pretrain.py][INFO] Epoch:[0/2](369200/4588595) loss:3.335 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:39:14,677][model8_pretrain.py][INFO] Epoch:[0/2](369200/4588595) loss:2.963 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:39:14,677][model8_pretrain.py][INFO] Epoch:[0/2](369200/4588595) loss:3.428 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:39:14,677][model8_pretrain.py][INFO] Epoch:[0/2](369200/4588595) loss:2.842 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:39:14,677][model8_pretrain.py][INFO] Epoch:[0/2](369200/4588595) loss:2.495 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:39:14,677][model8_pretrain.py][INFO] Epoch:[0/2](369200/4588595) loss:3.193 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:39:14,677][model8_pretrain.py][INFO] Epoch:[0/2](369200/4588595) loss:3.558 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:39:51,615][model8_pretrain.py][INFO] Epoch:[0/2](369300/4588595) loss:2.990 lr:0.0000100 epoch_Time:26735.0min: [2024-01-04 08:39:51,615][model8_pretrain.py][INFO] Epoch:[0/2](369300/4588595) loss:3.035 lr:0.0000100 epoch_Time:26735.0min: [2024-01-04 08:39:51,615][model8_pretrain.py][INFO] Epoch:[0/2](369300/4588595) loss:3.350 lr:0.0000100 epoch_Time:26735.0min: [2024-01-04 08:39:51,616][model8_pretrain.py][INFO] Epoch:[0/2](369300/4588595) loss:2.634 lr:0.0000100 epoch_Time:26735.0min: [2024-01-04 08:39:51,616][model8_pretrain.py][INFO] Epoch:[0/2](369300/4588595) loss:2.625 lr:0.0000100 epoch_Time:26735.0min: [2024-01-04 08:39:51,616][model8_pretrain.py][INFO] Epoch:[0/2](369300/4588595) loss:2.344 lr:0.0000100 epoch_Time:26735.0min: [2024-01-04 08:39:51,616][model8_pretrain.py][INFO] Epoch:[0/2](369300/4588595) loss:2.912 lr:0.0000100 epoch_Time:26735.0min: [2024-01-04 08:39:51,616][model8_pretrain.py][INFO] Epoch:[0/2](369300/4588595) loss:2.758 lr:0.0000100 epoch_Time:26735.0min: [2024-01-04 08:40:33,750][model8_pretrain.py][INFO] Epoch:[0/2](369400/4588595) loss:3.197 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:40:33,750][model8_pretrain.py][INFO] Epoch:[0/2](369400/4588595) loss:2.072 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:40:33,750][model8_pretrain.py][INFO] Epoch:[0/2](369400/4588595) loss:2.778 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:40:33,750][model8_pretrain.py][INFO] Epoch:[0/2](369400/4588595) loss:2.974 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:40:33,750][model8_pretrain.py][INFO] Epoch:[0/2](369400/4588595) loss:2.966 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:40:33,750][model8_pretrain.py][INFO] Epoch:[0/2](369400/4588595) loss:3.052 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:40:33,750][model8_pretrain.py][INFO] Epoch:[0/2](369400/4588595) loss:2.852 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:40:33,750][model8_pretrain.py][INFO] Epoch:[0/2](369400/4588595) loss:2.293 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:41:15,883][model8_pretrain.py][INFO] Epoch:[0/2](369500/4588595) loss:2.959 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:41:15,883][model8_pretrain.py][INFO] Epoch:[0/2](369500/4588595) loss:2.412 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:41:15,883][model8_pretrain.py][INFO] Epoch:[0/2](369500/4588595) loss:3.337 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:41:15,883][model8_pretrain.py][INFO] Epoch:[0/2](369500/4588595) loss:2.463 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:41:15,883][model8_pretrain.py][INFO] Epoch:[0/2](369500/4588595) loss:3.016 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:41:15,883][model8_pretrain.py][INFO] Epoch:[0/2](369500/4588595) loss:3.044 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:41:15,883][model8_pretrain.py][INFO] Epoch:[0/2](369500/4588595) loss:2.772 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:41:15,884][model8_pretrain.py][INFO] Epoch:[0/2](369500/4588595) loss:2.615 lr:0.0000100 epoch_Time:26736.0min: [2024-01-04 08:41:52,794][model8_pretrain.py][INFO] Epoch:[0/2](369600/4588595) loss:2.844 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:41:52,794][model8_pretrain.py][INFO] Epoch:[0/2](369600/4588595) loss:2.963 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:41:52,794][model8_pretrain.py][INFO] Epoch:[0/2](369600/4588595) loss:3.155 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:41:52,794][model8_pretrain.py][INFO] Epoch:[0/2](369600/4588595) loss:2.688 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:41:52,794][model8_pretrain.py][INFO] Epoch:[0/2](369600/4588595) loss:3.312 lr:0.0000100 epoch_Time:26735.0min: [2024-01-04 08:41:52,794][model8_pretrain.py][INFO] Epoch:[0/2](369600/4588595) loss:2.409 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:41:52,794][model8_pretrain.py][INFO] Epoch:[0/2](369600/4588595) loss:3.043 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:41:52,794][model8_pretrain.py][INFO] Epoch:[0/2](369600/4588595) loss:3.064 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:42:29,728][model8_pretrain.py][INFO] Epoch:[0/2](369700/4588595) loss:3.133 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:42:29,728][model8_pretrain.py][INFO] Epoch:[0/2](369700/4588595) loss:2.253 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:42:29,728][model8_pretrain.py][INFO] Epoch:[0/2](369700/4588595) loss:2.603 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:42:29,728][model8_pretrain.py][INFO] Epoch:[0/2](369700/4588595) loss:2.715 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:42:29,728][model8_pretrain.py][INFO] Epoch:[0/2](369700/4588595) loss:3.628 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:42:29,728][model8_pretrain.py][INFO] Epoch:[0/2](369700/4588595) loss:2.950 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:42:29,728][model8_pretrain.py][INFO] Epoch:[0/2](369700/4588595) loss:3.050 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:42:29,728][model8_pretrain.py][INFO] Epoch:[0/2](369700/4588595) loss:3.056 lr:0.0000100 epoch_Time:26734.0min: [2024-01-04 08:43:06,658][model8_pretrain.py][INFO] Epoch:[0/2](369800/4588595) loss:2.845 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:06,659][model8_pretrain.py][INFO] Epoch:[0/2](369800/4588595) loss:2.938 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:06,659][model8_pretrain.py][INFO] Epoch:[0/2](369800/4588595) loss:3.060 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:06,659][model8_pretrain.py][INFO] Epoch:[0/2](369800/4588595) loss:2.973 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:06,659][model8_pretrain.py][INFO] Epoch:[0/2](369800/4588595) loss:2.678 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:06,659][model8_pretrain.py][INFO] Epoch:[0/2](369800/4588595) loss:2.603 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:06,659][model8_pretrain.py][INFO] Epoch:[0/2](369800/4588595) loss:2.831 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:06,659][model8_pretrain.py][INFO] Epoch:[0/2](369800/4588595) loss:2.990 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:43,603][model8_pretrain.py][INFO] Epoch:[0/2](369900/4588595) loss:3.059 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:43,603][model8_pretrain.py][INFO] Epoch:[0/2](369900/4588595) loss:2.259 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:43,603][model8_pretrain.py][INFO] Epoch:[0/2](369900/4588595) loss:2.254 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:43,603][model8_pretrain.py][INFO] Epoch:[0/2](369900/4588595) loss:2.763 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:43,603][model8_pretrain.py][INFO] Epoch:[0/2](369900/4588595) loss:2.952 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:43,603][model8_pretrain.py][INFO] Epoch:[0/2](369900/4588595) loss:3.441 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:43,603][model8_pretrain.py][INFO] Epoch:[0/2](369900/4588595) loss:2.844 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:43:43,603][model8_pretrain.py][INFO] Epoch:[0/2](369900/4588595) loss:2.415 lr:0.0000100 epoch_Time:26733.0min: [2024-01-04 08:44:20,644][model8_pretrain.py][INFO] Epoch:[0/2](370000/4588595) loss:2.858 lr:0.0000100 epoch_Time:26732.0min: [2024-01-04 08:44:20,644][model8_pretrain.py][INFO] Epoch:[0/2](370000/4588595) loss:2.307 lr:0.0000100 epoch_Time:26732.0min: [2024-01-04 08:44:20,644][model8_pretrain.py][INFO] Epoch:[0/2](370000/4588595) loss:2.653 lr:0.0000100 epoch_Time:26732.0min: [2024-01-04 08:44:20,644][model8_pretrain.py][INFO] Epoch:[0/2](370000/4588595) loss:2.641 lr:0.0000100 epoch_Time:26732.0min: [2024-01-04 08:44:20,644][model8_pretrain.py][INFO] Epoch:[0/2](370000/4588595) loss:2.943 lr:0.0000100 epoch_Time:26732.0min: [2024-01-04 08:44:20,644][model8_pretrain.py][INFO] Epoch:[0/2](370000/4588595) loss:3.690 lr:0.0000100 epoch_Time:26732.0min: [2024-01-04 08:44:20,644][model8_pretrain.py][INFO] Epoch:[0/2](370000/4588595) loss:3.199 lr:0.0000100 epoch_Time:26732.0min: [2024-01-04 08:44:20,644][model8_pretrain.py][INFO] Epoch:[0/2](370000/4588595) loss:2.833 lr:0.0000100 epoch_Time:26732.0min: [2024-01-04 08:44:57,609][model8_pretrain.py][INFO] Epoch:[0/2](370100/4588595) loss:3.304 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:44:57,609][model8_pretrain.py][INFO] Epoch:[0/2](370100/4588595) loss:2.749 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:44:57,609][model8_pretrain.py][INFO] Epoch:[0/2](370100/4588595) loss:2.703 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:44:57,609][model8_pretrain.py][INFO] Epoch:[0/2](370100/4588595) loss:3.053 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:44:57,609][model8_pretrain.py][INFO] Epoch:[0/2](370100/4588595) loss:3.293 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:44:57,609][model8_pretrain.py][INFO] Epoch:[0/2](370100/4588595) loss:2.264 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:44:57,609][model8_pretrain.py][INFO] Epoch:[0/2](370100/4588595) loss:3.083 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:44:57,609][model8_pretrain.py][INFO] Epoch:[0/2](370100/4588595) loss:3.038 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:45:39,766][model8_pretrain.py][INFO] Epoch:[0/2](370200/4588595) loss:2.931 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:45:39,766][model8_pretrain.py][INFO] Epoch:[0/2](370200/4588595) loss:2.663 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:45:39,766][model8_pretrain.py][INFO] Epoch:[0/2](370200/4588595) loss:2.970 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:45:39,766][model8_pretrain.py][INFO] Epoch:[0/2](370200/4588595) loss:2.885 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:45:39,766][model8_pretrain.py][INFO] Epoch:[0/2](370200/4588595) loss:2.943 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:45:39,766][model8_pretrain.py][INFO] Epoch:[0/2](370200/4588595) loss:2.522 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:45:39,766][model8_pretrain.py][INFO] Epoch:[0/2](370200/4588595) loss:2.981 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:45:39,766][model8_pretrain.py][INFO] Epoch:[0/2](370200/4588595) loss:2.807 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:46:21,727][model8_pretrain.py][INFO] Epoch:[0/2](370300/4588595) loss:2.630 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:46:21,727][model8_pretrain.py][INFO] Epoch:[0/2](370300/4588595) loss:2.975 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:46:21,727][model8_pretrain.py][INFO] Epoch:[0/2](370300/4588595) loss:1.997 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:46:21,727][model8_pretrain.py][INFO] Epoch:[0/2](370300/4588595) loss:2.563 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:46:21,727][model8_pretrain.py][INFO] Epoch:[0/2](370300/4588595) loss:2.366 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:46:21,727][model8_pretrain.py][INFO] Epoch:[0/2](370300/4588595) loss:2.418 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:46:21,727][model8_pretrain.py][INFO] Epoch:[0/2](370300/4588595) loss:2.558 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:46:21,727][model8_pretrain.py][INFO] Epoch:[0/2](370300/4588595) loss:3.033 lr:0.0000100 epoch_Time:26731.0min: [2024-01-04 08:46:58,661][model8_pretrain.py][INFO] Epoch:[0/2](370400/4588595) loss:2.765 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:46:58,661][model8_pretrain.py][INFO] Epoch:[0/2](370400/4588595) loss:2.743 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:46:58,662][model8_pretrain.py][INFO] Epoch:[0/2](370400/4588595) loss:2.630 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:46:58,662][model8_pretrain.py][INFO] Epoch:[0/2](370400/4588595) loss:2.787 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:46:58,661][model8_pretrain.py][INFO] Epoch:[0/2](370400/4588595) loss:2.997 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:46:58,662][model8_pretrain.py][INFO] Epoch:[0/2](370400/4588595) loss:3.080 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:46:58,662][model8_pretrain.py][INFO] Epoch:[0/2](370400/4588595) loss:2.995 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:46:58,662][model8_pretrain.py][INFO] Epoch:[0/2](370400/4588595) loss:2.525 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:47:35,599][model8_pretrain.py][INFO] Epoch:[0/2](370500/4588595) loss:2.516 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:47:35,599][model8_pretrain.py][INFO] Epoch:[0/2](370500/4588595) loss:3.088 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:47:35,599][model8_pretrain.py][INFO] Epoch:[0/2](370500/4588595) loss:3.136 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:47:35,599][model8_pretrain.py][INFO] Epoch:[0/2](370500/4588595) loss:2.645 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:47:35,599][model8_pretrain.py][INFO] Epoch:[0/2](370500/4588595) loss:2.600 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:47:35,599][model8_pretrain.py][INFO] Epoch:[0/2](370500/4588595) loss:2.867 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:47:35,599][model8_pretrain.py][INFO] Epoch:[0/2](370500/4588595) loss:2.803 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:47:35,599][model8_pretrain.py][INFO] Epoch:[0/2](370500/4588595) loss:2.902 lr:0.0000100 epoch_Time:26730.0min: [2024-01-04 08:48:12,536][model8_pretrain.py][INFO] Epoch:[0/2](370600/4588595) loss:2.814 lr:0.0000100 epoch_Time:26728.0min: [2024-01-04 08:48:12,537][model8_pretrain.py][INFO] Epoch:[0/2](370600/4588595) loss:2.779 lr:0.0000100 epoch_Time:26728.0min: [2024-01-04 08:48:12,537][model8_pretrain.py][INFO] Epoch:[0/2](370600/4588595) loss:2.861 lr:0.0000100 epoch_Time:26728.0min: [2024-01-04 08:48:12,537][model8_pretrain.py][INFO] Epoch:[0/2](370600/4588595) loss:3.151 lr:0.0000100 epoch_Time:26728.0min: [2024-01-04 08:48:12,537][model8_pretrain.py][INFO] Epoch:[0/2](370600/4588595) loss:2.478 lr:0.0000100 epoch_Time:26728.0min: [2024-01-04 08:48:12,537][model8_pretrain.py][INFO] Epoch:[0/2](370600/4588595) loss:3.139 lr:0.0000100 epoch_Time:26728.0min: [2024-01-04 08:48:12,537][model8_pretrain.py][INFO] Epoch:[0/2](370600/4588595) loss:2.852 lr:0.0000100 epoch_Time:26728.0min: [2024-01-04 08:48:12,537][model8_pretrain.py][INFO] Epoch:[0/2](370600/4588595) loss:2.754 lr:0.0000100 epoch_Time:26728.0min: [2024-01-04 08:48:49,473][model8_pretrain.py][INFO] Epoch:[0/2](370700/4588595) loss:2.587 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:48:49,473][model8_pretrain.py][INFO] Epoch:[0/2](370700/4588595) loss:2.563 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:48:49,473][model8_pretrain.py][INFO] Epoch:[0/2](370700/4588595) loss:2.785 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:48:49,473][model8_pretrain.py][INFO] Epoch:[0/2](370700/4588595) loss:3.046 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:48:49,473][model8_pretrain.py][INFO] Epoch:[0/2](370700/4588595) loss:2.666 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:48:49,473][model8_pretrain.py][INFO] Epoch:[0/2](370700/4588595) loss:3.139 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:48:49,473][model8_pretrain.py][INFO] Epoch:[0/2](370700/4588595) loss:3.161 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:48:49,473][model8_pretrain.py][INFO] Epoch:[0/2](370700/4588595) loss:2.991 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:49:26,410][model8_pretrain.py][INFO] Epoch:[0/2](370800/4588595) loss:3.004 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:49:26,410][model8_pretrain.py][INFO] Epoch:[0/2](370800/4588595) loss:3.237 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:49:26,410][model8_pretrain.py][INFO] Epoch:[0/2](370800/4588595) loss:2.876 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:49:26,410][model8_pretrain.py][INFO] Epoch:[0/2](370800/4588595) loss:2.807 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:49:26,410][model8_pretrain.py][INFO] Epoch:[0/2](370800/4588595) loss:2.878 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:49:26,410][model8_pretrain.py][INFO] Epoch:[0/2](370800/4588595) loss:2.977 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:49:26,410][model8_pretrain.py][INFO] Epoch:[0/2](370800/4588595) loss:2.864 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:49:26,410][model8_pretrain.py][INFO] Epoch:[0/2](370800/4588595) loss:2.614 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:50:03,346][model8_pretrain.py][INFO] Epoch:[0/2](370900/4588595) loss:3.011 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:50:03,346][model8_pretrain.py][INFO] Epoch:[0/2](370900/4588595) loss:2.889 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:50:03,346][model8_pretrain.py][INFO] Epoch:[0/2](370900/4588595) loss:3.047 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:50:03,346][model8_pretrain.py][INFO] Epoch:[0/2](370900/4588595) loss:2.309 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:50:03,346][model8_pretrain.py][INFO] Epoch:[0/2](370900/4588595) loss:2.983 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:50:03,346][model8_pretrain.py][INFO] Epoch:[0/2](370900/4588595) loss:2.944 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:50:03,346][model8_pretrain.py][INFO] Epoch:[0/2](370900/4588595) loss:2.904 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:50:03,347][model8_pretrain.py][INFO] Epoch:[0/2](370900/4588595) loss:2.684 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:50:45,500][model8_pretrain.py][INFO] Epoch:[0/2](371000/4588595) loss:2.405 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:50:45,500][model8_pretrain.py][INFO] Epoch:[0/2](371000/4588595) loss:2.856 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:50:45,500][model8_pretrain.py][INFO] Epoch:[0/2](371000/4588595) loss:3.445 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:50:45,500][model8_pretrain.py][INFO] Epoch:[0/2](371000/4588595) loss:3.183 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:50:45,502][model8_pretrain.py][INFO] Epoch:[0/2](371000/4588595) loss:3.132 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:50:45,504][model8_pretrain.py][INFO] Epoch:[0/2](371000/4588595) loss:2.827 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:50:45,505][model8_pretrain.py][INFO] Epoch:[0/2](371000/4588595) loss:2.725 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:50:45,505][model8_pretrain.py][INFO] Epoch:[0/2](371000/4588595) loss:2.639 lr:0.0000100 epoch_Time:26727.0min: [2024-01-04 08:51:27,382][model8_pretrain.py][INFO] Epoch:[0/2](371100/4588595) loss:3.112 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:51:27,382][model8_pretrain.py][INFO] Epoch:[0/2](371100/4588595) loss:3.330 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:51:27,382][model8_pretrain.py][INFO] Epoch:[0/2](371100/4588595) loss:3.168 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:51:27,382][model8_pretrain.py][INFO] Epoch:[0/2](371100/4588595) loss:3.131 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:51:27,382][model8_pretrain.py][INFO] Epoch:[0/2](371100/4588595) loss:2.667 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:51:27,382][model8_pretrain.py][INFO] Epoch:[0/2](371100/4588595) loss:2.658 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:51:27,382][model8_pretrain.py][INFO] Epoch:[0/2](371100/4588595) loss:2.909 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:51:27,382][model8_pretrain.py][INFO] Epoch:[0/2](371100/4588595) loss:3.087 lr:0.0000100 epoch_Time:26726.0min: [2024-01-04 08:52:04,328][model8_pretrain.py][INFO] Epoch:[0/2](371200/4588595) loss:3.251 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:04,328][model8_pretrain.py][INFO] Epoch:[0/2](371200/4588595) loss:3.037 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:04,328][model8_pretrain.py][INFO] Epoch:[0/2](371200/4588595) loss:2.246 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:04,328][model8_pretrain.py][INFO] Epoch:[0/2](371200/4588595) loss:2.066 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:04,328][model8_pretrain.py][INFO] Epoch:[0/2](371200/4588595) loss:2.596 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:04,328][model8_pretrain.py][INFO] Epoch:[0/2](371200/4588595) loss:3.262 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:04,328][model8_pretrain.py][INFO] Epoch:[0/2](371200/4588595) loss:2.873 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:04,328][model8_pretrain.py][INFO] Epoch:[0/2](371200/4588595) loss:3.278 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:41,262][model8_pretrain.py][INFO] Epoch:[0/2](371300/4588595) loss:2.790 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:41,262][model8_pretrain.py][INFO] Epoch:[0/2](371300/4588595) loss:2.573 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:41,262][model8_pretrain.py][INFO] Epoch:[0/2](371300/4588595) loss:3.079 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:41,262][model8_pretrain.py][INFO] Epoch:[0/2](371300/4588595) loss:2.707 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:41,262][model8_pretrain.py][INFO] Epoch:[0/2](371300/4588595) loss:2.688 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:41,262][model8_pretrain.py][INFO] Epoch:[0/2](371300/4588595) loss:2.737 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:41,262][model8_pretrain.py][INFO] Epoch:[0/2](371300/4588595) loss:3.245 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:52:41,263][model8_pretrain.py][INFO] Epoch:[0/2](371300/4588595) loss:2.890 lr:0.0000100 epoch_Time:26725.0min: [2024-01-04 08:53:18,205][model8_pretrain.py][INFO] Epoch:[0/2](371400/4588595) loss:3.153 lr:0.0000100 epoch_Time:26724.0min: [2024-01-04 08:53:18,205][model8_pretrain.py][INFO] Epoch:[0/2](371400/4588595) loss:2.983 lr:0.0000100 epoch_Time:26724.0min: [2024-01-04 08:53:18,205][model8_pretrain.py][INFO] Epoch:[0/2](371400/4588595) loss:2.993 lr:0.0000100 epoch_Time:26724.0min: [2024-01-04 08:53:18,205][model8_pretrain.py][INFO] Epoch:[0/2](371400/4588595) loss:2.640 lr:0.0000100 epoch_Time:26724.0min: [2024-01-04 08:53:18,205][model8_pretrain.py][INFO] Epoch:[0/2](371400/4588595) loss:2.762 lr:0.0000100 epoch_Time:26724.0min: [2024-01-04 08:53:18,205][model8_pretrain.py][INFO] Epoch:[0/2](371400/4588595) loss:3.077 lr:0.0000100 epoch_Time:26724.0min: [2024-01-04 08:53:18,205][model8_pretrain.py][INFO] Epoch:[0/2](371400/4588595) loss:2.164 lr:0.0000100 epoch_Time:26724.0min: [2024-01-04 08:53:18,205][model8_pretrain.py][INFO] Epoch:[0/2](371400/4588595) loss:2.875 lr:0.0000100 epoch_Time:26724.0min: [2024-01-04 08:53:55,140][model8_pretrain.py][INFO] Epoch:[0/2](371500/4588595) loss:2.786 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:53:55,140][model8_pretrain.py][INFO] Epoch:[0/2](371500/4588595) loss:2.705 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:53:55,140][model8_pretrain.py][INFO] Epoch:[0/2](371500/4588595) loss:3.238 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:53:55,140][model8_pretrain.py][INFO] Epoch:[0/2](371500/4588595) loss:2.436 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:53:55,140][model8_pretrain.py][INFO] Epoch:[0/2](371500/4588595) loss:2.930 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:53:55,140][model8_pretrain.py][INFO] Epoch:[0/2](371500/4588595) loss:3.089 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:53:55,140][model8_pretrain.py][INFO] Epoch:[0/2](371500/4588595) loss:3.027 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:53:55,140][model8_pretrain.py][INFO] Epoch:[0/2](371500/4588595) loss:3.039 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:54:32,082][model8_pretrain.py][INFO] Epoch:[0/2](371600/4588595) loss:2.897 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:54:32,082][model8_pretrain.py][INFO] Epoch:[0/2](371600/4588595) loss:2.968 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:54:32,083][model8_pretrain.py][INFO] Epoch:[0/2](371600/4588595) loss:2.798 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:54:32,083][model8_pretrain.py][INFO] Epoch:[0/2](371600/4588595) loss:2.852 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:54:32,083][model8_pretrain.py][INFO] Epoch:[0/2](371600/4588595) loss:2.894 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:54:32,083][model8_pretrain.py][INFO] Epoch:[0/2](371600/4588595) loss:3.259 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:54:32,083][model8_pretrain.py][INFO] Epoch:[0/2](371600/4588595) loss:2.522 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:54:32,083][model8_pretrain.py][INFO] Epoch:[0/2](371600/4588595) loss:2.895 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:55:09,021][model8_pretrain.py][INFO] Epoch:[0/2](371700/4588595) loss:2.952 lr:0.0000100 epoch_Time:26721.0min: [2024-01-04 08:55:09,021][model8_pretrain.py][INFO] Epoch:[0/2](371700/4588595) loss:3.071 lr:0.0000100 epoch_Time:26721.0min: [2024-01-04 08:55:09,021][model8_pretrain.py][INFO] Epoch:[0/2](371700/4588595) loss:2.726 lr:0.0000100 epoch_Time:26721.0min: [2024-01-04 08:55:09,021][model8_pretrain.py][INFO] Epoch:[0/2](371700/4588595) loss:2.547 lr:0.0000100 epoch_Time:26721.0min: [2024-01-04 08:55:09,021][model8_pretrain.py][INFO] Epoch:[0/2](371700/4588595) loss:3.317 lr:0.0000100 epoch_Time:26721.0min: [2024-01-04 08:55:09,021][model8_pretrain.py][INFO] Epoch:[0/2](371700/4588595) loss:2.304 lr:0.0000100 epoch_Time:26721.0min: [2024-01-04 08:55:09,021][model8_pretrain.py][INFO] Epoch:[0/2](371700/4588595) loss:3.040 lr:0.0000100 epoch_Time:26721.0min: [2024-01-04 08:55:09,022][model8_pretrain.py][INFO] Epoch:[0/2](371700/4588595) loss:2.968 lr:0.0000100 epoch_Time:26721.0min: [2024-01-04 08:55:47,810][model8_pretrain.py][INFO] Epoch:[0/2](371800/4588595) loss:2.853 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:55:47,810][model8_pretrain.py][INFO] Epoch:[0/2](371800/4588595) loss:3.095 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:55:47,810][model8_pretrain.py][INFO] Epoch:[0/2](371800/4588595) loss:3.454 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:55:47,810][model8_pretrain.py][INFO] Epoch:[0/2](371800/4588595) loss:2.829 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:55:47,810][model8_pretrain.py][INFO] Epoch:[0/2](371800/4588595) loss:2.018 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:55:47,810][model8_pretrain.py][INFO] Epoch:[0/2](371800/4588595) loss:3.020 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:55:47,810][model8_pretrain.py][INFO] Epoch:[0/2](371800/4588595) loss:2.201 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:55:47,810][model8_pretrain.py][INFO] Epoch:[0/2](371800/4588595) loss:2.913 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:56:33,168][model8_pretrain.py][INFO] Epoch:[0/2](371900/4588595) loss:2.960 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:56:33,168][model8_pretrain.py][INFO] Epoch:[0/2](371900/4588595) loss:2.786 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:56:33,168][model8_pretrain.py][INFO] Epoch:[0/2](371900/4588595) loss:2.748 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:56:33,168][model8_pretrain.py][INFO] Epoch:[0/2](371900/4588595) loss:2.683 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:56:33,168][model8_pretrain.py][INFO] Epoch:[0/2](371900/4588595) loss:2.579 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:56:33,168][model8_pretrain.py][INFO] Epoch:[0/2](371900/4588595) loss:2.753 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:56:33,168][model8_pretrain.py][INFO] Epoch:[0/2](371900/4588595) loss:2.737 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:56:33,169][model8_pretrain.py][INFO] Epoch:[0/2](371900/4588595) loss:2.971 lr:0.0000100 epoch_Time:26722.0min: [2024-01-04 08:57:10,102][model8_pretrain.py][INFO] Epoch:[0/2](372000/4588595) loss:2.772 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:10,102][model8_pretrain.py][INFO] Epoch:[0/2](372000/4588595) loss:2.887 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:10,102][model8_pretrain.py][INFO] Epoch:[0/2](372000/4588595) loss:3.089 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:10,102][model8_pretrain.py][INFO] Epoch:[0/2](372000/4588595) loss:2.599 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:10,102][model8_pretrain.py][INFO] Epoch:[0/2](372000/4588595) loss:2.462 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:10,102][model8_pretrain.py][INFO] Epoch:[0/2](372000/4588595) loss:2.917 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:10,102][model8_pretrain.py][INFO] Epoch:[0/2](372000/4588595) loss:2.768 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:10,102][model8_pretrain.py][INFO] Epoch:[0/2](372000/4588595) loss:3.383 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:47,041][model8_pretrain.py][INFO] Epoch:[0/2](372100/4588595) loss:2.727 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:47,041][model8_pretrain.py][INFO] Epoch:[0/2](372100/4588595) loss:2.670 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:47,041][model8_pretrain.py][INFO] Epoch:[0/2](372100/4588595) loss:2.951 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:47,041][model8_pretrain.py][INFO] Epoch:[0/2](372100/4588595) loss:2.674 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:47,041][model8_pretrain.py][INFO] Epoch:[0/2](372100/4588595) loss:3.077 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:47,041][model8_pretrain.py][INFO] Epoch:[0/2](372100/4588595) loss:3.320 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:47,041][model8_pretrain.py][INFO] Epoch:[0/2](372100/4588595) loss:3.357 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:57:47,041][model8_pretrain.py][INFO] Epoch:[0/2](372100/4588595) loss:2.405 lr:0.0000100 epoch_Time:26720.0min: [2024-01-04 08:58:23,987][model8_pretrain.py][INFO] Epoch:[0/2](372200/4588595) loss:2.486 lr:0.0000100 epoch_Time:26719.0min: [2024-01-04 08:58:23,987][model8_pretrain.py][INFO] Epoch:[0/2](372200/4588595) loss:3.128 lr:0.0000100 epoch_Time:26719.0min: [2024-01-04 08:58:23,987][model8_pretrain.py][INFO] Epoch:[0/2](372200/4588595) loss:2.855 lr:0.0000100 epoch_Time:26719.0min: [2024-01-04 08:58:23,987][model8_pretrain.py][INFO] Epoch:[0/2](372200/4588595) loss:3.237 lr:0.0000100 epoch_Time:26719.0min: [2024-01-04 08:58:23,987][model8_pretrain.py][INFO] Epoch:[0/2](372200/4588595) loss:2.983 lr:0.0000100 epoch_Time:26719.0min: [2024-01-04 08:58:23,987][model8_pretrain.py][INFO] Epoch:[0/2](372200/4588595) loss:2.898 lr:0.0000100 epoch_Time:26719.0min: [2024-01-04 08:58:23,987][model8_pretrain.py][INFO] Epoch:[0/2](372200/4588595) loss:2.894 lr:0.0000100 epoch_Time:26719.0min: [2024-01-04 08:58:23,987][model8_pretrain.py][INFO] Epoch:[0/2](372200/4588595) loss:2.841 lr:0.0000100 epoch_Time:26719.0min: [2024-01-04 08:59:00,933][model8_pretrain.py][INFO] Epoch:[0/2](372300/4588595) loss:3.002 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:00,933][model8_pretrain.py][INFO] Epoch:[0/2](372300/4588595) loss:2.855 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:00,933][model8_pretrain.py][INFO] Epoch:[0/2](372300/4588595) loss:2.941 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:00,933][model8_pretrain.py][INFO] Epoch:[0/2](372300/4588595) loss:2.819 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:00,934][model8_pretrain.py][INFO] Epoch:[0/2](372300/4588595) loss:2.758 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:00,933][model8_pretrain.py][INFO] Epoch:[0/2](372300/4588595) loss:2.955 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:00,934][model8_pretrain.py][INFO] Epoch:[0/2](372300/4588595) loss:3.145 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:00,934][model8_pretrain.py][INFO] Epoch:[0/2](372300/4588595) loss:3.079 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:37,878][model8_pretrain.py][INFO] Epoch:[0/2](372400/4588595) loss:2.674 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:37,878][model8_pretrain.py][INFO] Epoch:[0/2](372400/4588595) loss:3.080 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:37,878][model8_pretrain.py][INFO] Epoch:[0/2](372400/4588595) loss:3.000 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:37,878][model8_pretrain.py][INFO] Epoch:[0/2](372400/4588595) loss:3.333 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:37,878][model8_pretrain.py][INFO] Epoch:[0/2](372400/4588595) loss:3.494 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:37,878][model8_pretrain.py][INFO] Epoch:[0/2](372400/4588595) loss:3.395 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:37,878][model8_pretrain.py][INFO] Epoch:[0/2](372400/4588595) loss:2.702 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 08:59:37,878][model8_pretrain.py][INFO] Epoch:[0/2](372400/4588595) loss:2.717 lr:0.0000100 epoch_Time:26718.0min: [2024-01-04 09:00:14,813][model8_pretrain.py][INFO] Epoch:[0/2](372500/4588595) loss:3.013 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:14,813][model8_pretrain.py][INFO] Epoch:[0/2](372500/4588595) loss:2.786 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:14,813][model8_pretrain.py][INFO] Epoch:[0/2](372500/4588595) loss:2.920 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:14,813][model8_pretrain.py][INFO] Epoch:[0/2](372500/4588595) loss:2.493 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:14,813][model8_pretrain.py][INFO] Epoch:[0/2](372500/4588595) loss:3.180 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:14,813][model8_pretrain.py][INFO] Epoch:[0/2](372500/4588595) loss:3.000 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:14,814][model8_pretrain.py][INFO] Epoch:[0/2](372500/4588595) loss:3.054 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:14,814][model8_pretrain.py][INFO] Epoch:[0/2](372500/4588595) loss:3.055 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:53,567][model8_pretrain.py][INFO] Epoch:[0/2](372600/4588595) loss:3.127 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:53,567][model8_pretrain.py][INFO] Epoch:[0/2](372600/4588595) loss:3.137 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:53,567][model8_pretrain.py][INFO] Epoch:[0/2](372600/4588595) loss:3.212 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:53,567][model8_pretrain.py][INFO] Epoch:[0/2](372600/4588595) loss:3.188 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:53,567][model8_pretrain.py][INFO] Epoch:[0/2](372600/4588595) loss:3.087 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:53,567][model8_pretrain.py][INFO] Epoch:[0/2](372600/4588595) loss:2.463 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:53,567][model8_pretrain.py][INFO] Epoch:[0/2](372600/4588595) loss:2.682 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:00:53,567][model8_pretrain.py][INFO] Epoch:[0/2](372600/4588595) loss:3.033 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:01:38,663][model8_pretrain.py][INFO] Epoch:[0/2](372700/4588595) loss:3.143 lr:0.0000100 epoch_Time:26717.0min: [2024-01-04 09:01:38,663][model8_pretrain.py][INFO] Epoch:[0/2](372700/4588595) loss:3.478 lr:0.0000100 epoch_Time:26717.0min: [2024-01-04 09:01:38,663][model8_pretrain.py][INFO] Epoch:[0/2](372700/4588595) loss:2.955 lr:0.0000100 epoch_Time:26717.0min: [2024-01-04 09:01:38,663][model8_pretrain.py][INFO] Epoch:[0/2](372700/4588595) loss:2.209 lr:0.0000100 epoch_Time:26717.0min: [2024-01-04 09:01:38,663][model8_pretrain.py][INFO] Epoch:[0/2](372700/4588595) loss:3.342 lr:0.0000100 epoch_Time:26717.0min: [2024-01-04 09:01:38,663][model8_pretrain.py][INFO] Epoch:[0/2](372700/4588595) loss:2.304 lr:0.0000100 epoch_Time:26717.0min: [2024-01-04 09:01:38,663][model8_pretrain.py][INFO] Epoch:[0/2](372700/4588595) loss:2.710 lr:0.0000100 epoch_Time:26717.0min: [2024-01-04 09:01:38,663][model8_pretrain.py][INFO] Epoch:[0/2](372700/4588595) loss:3.233 lr:0.0000100 epoch_Time:26717.0min: [2024-01-04 09:02:15,599][model8_pretrain.py][INFO] Epoch:[0/2](372800/4588595) loss:2.775 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:02:15,599][model8_pretrain.py][INFO] Epoch:[0/2](372800/4588595) loss:2.348 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:02:15,599][model8_pretrain.py][INFO] Epoch:[0/2](372800/4588595) loss:3.219 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:02:15,599][model8_pretrain.py][INFO] Epoch:[0/2](372800/4588595) loss:2.969 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:02:15,599][model8_pretrain.py][INFO] Epoch:[0/2](372800/4588595) loss:3.159 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:02:15,599][model8_pretrain.py][INFO] Epoch:[0/2](372800/4588595) loss:2.704 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:02:15,599][model8_pretrain.py][INFO] Epoch:[0/2](372800/4588595) loss:3.236 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:02:15,599][model8_pretrain.py][INFO] Epoch:[0/2](372800/4588595) loss:2.826 lr:0.0000100 epoch_Time:26716.0min: [2024-01-04 09:02:52,545][model8_pretrain.py][INFO] Epoch:[0/2](372900/4588595) loss:3.731 lr:0.0000100 epoch_Time:26715.0min: [2024-01-04 09:02:52,545][model8_pretrain.py][INFO] Epoch:[0/2](372900/4588595) loss:2.906 lr:0.0000100 epoch_Time:26715.0min: [2024-01-04 09:02:52,545][model8_pretrain.py][INFO] Epoch:[0/2](372900/4588595) loss:2.550 lr:0.0000100 epoch_Time:26715.0min: [2024-01-04 09:02:52,545][model8_pretrain.py][INFO] Epoch:[0/2](372900/4588595) loss:2.812 lr:0.0000100 epoch_Time:26715.0min: [2024-01-04 09:02:52,545][model8_pretrain.py][INFO] Epoch:[0/2](372900/4588595) loss:2.850 lr:0.0000100 epoch_Time:26715.0min: [2024-01-04 09:02:52,545][model8_pretrain.py][INFO] Epoch:[0/2](372900/4588595) loss:2.403 lr:0.0000100 epoch_Time:26715.0min: [2024-01-04 09:02:52,545][model8_pretrain.py][INFO] Epoch:[0/2](372900/4588595) loss:2.818 lr:0.0000100 epoch_Time:26715.0min: [2024-01-04 09:02:52,545][model8_pretrain.py][INFO] Epoch:[0/2](372900/4588595) loss:2.566 lr:0.0000100 epoch_Time:26715.0min: [2024-01-04 09:03:29,499][model8_pretrain.py][INFO] Epoch:[0/2](373000/4588595) loss:3.369 lr:0.0000100 epoch_Time:26714.0min: [2024-01-04 09:03:29,499][model8_pretrain.py][INFO] Epoch:[0/2](373000/4588595) loss:2.477 lr:0.0000100 epoch_Time:26714.0min: [2024-01-04 09:03:29,499][model8_pretrain.py][INFO] Epoch:[0/2](373000/4588595) loss:2.905 lr:0.0000100 epoch_Time:26714.0min: [2024-01-04 09:03:29,499][model8_pretrain.py][INFO] Epoch:[0/2](373000/4588595) loss:3.121 lr:0.0000100 epoch_Time:26714.0min: [2024-01-04 09:03:29,499][model8_pretrain.py][INFO] Epoch:[0/2](373000/4588595) loss:3.016 lr:0.0000100 epoch_Time:26714.0min: [2024-01-04 09:03:29,499][model8_pretrain.py][INFO] Epoch:[0/2](373000/4588595) loss:2.932 lr:0.0000100 epoch_Time:26714.0min: [2024-01-04 09:03:29,499][model8_pretrain.py][INFO] Epoch:[0/2](373000/4588595) loss:2.616 lr:0.0000100 epoch_Time:26714.0min: [2024-01-04 09:03:29,499][model8_pretrain.py][INFO] Epoch:[0/2](373000/4588595) loss:2.956 lr:0.0000100 epoch_Time:26714.0min: [2024-01-04 09:04:06,433][model8_pretrain.py][INFO] Epoch:[0/2](373100/4588595) loss:2.746 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:06,433][model8_pretrain.py][INFO] Epoch:[0/2](373100/4588595) loss:3.205 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:06,433][model8_pretrain.py][INFO] Epoch:[0/2](373100/4588595) loss:3.312 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:06,433][model8_pretrain.py][INFO] Epoch:[0/2](373100/4588595) loss:3.162 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:06,433][model8_pretrain.py][INFO] Epoch:[0/2](373100/4588595) loss:3.026 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:06,433][model8_pretrain.py][INFO] Epoch:[0/2](373100/4588595) loss:2.187 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:06,433][model8_pretrain.py][INFO] Epoch:[0/2](373100/4588595) loss:2.335 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:06,433][model8_pretrain.py][INFO] Epoch:[0/2](373100/4588595) loss:3.022 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:43,374][model8_pretrain.py][INFO] Epoch:[0/2](373200/4588595) loss:2.880 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:43,374][model8_pretrain.py][INFO] Epoch:[0/2](373200/4588595) loss:2.681 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:43,374][model8_pretrain.py][INFO] Epoch:[0/2](373200/4588595) loss:3.415 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:43,374][model8_pretrain.py][INFO] Epoch:[0/2](373200/4588595) loss:2.570 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:43,374][model8_pretrain.py][INFO] Epoch:[0/2](373200/4588595) loss:3.251 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:43,374][model8_pretrain.py][INFO] Epoch:[0/2](373200/4588595) loss:3.116 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:43,374][model8_pretrain.py][INFO] Epoch:[0/2](373200/4588595) loss:2.801 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:04:43,374][model8_pretrain.py][INFO] Epoch:[0/2](373200/4588595) loss:3.001 lr:0.0000100 epoch_Time:26713.0min: [2024-01-04 09:05:20,323][model8_pretrain.py][INFO] Epoch:[0/2](373300/4588595) loss:2.755 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:05:20,323][model8_pretrain.py][INFO] Epoch:[0/2](373300/4588595) loss:2.568 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:05:20,323][model8_pretrain.py][INFO] Epoch:[0/2](373300/4588595) loss:2.813 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:05:20,323][model8_pretrain.py][INFO] Epoch:[0/2](373300/4588595) loss:2.933 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:05:20,323][model8_pretrain.py][INFO] Epoch:[0/2](373300/4588595) loss:3.207 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:05:20,323][model8_pretrain.py][INFO] Epoch:[0/2](373300/4588595) loss:2.649 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:05:20,323][model8_pretrain.py][INFO] Epoch:[0/2](373300/4588595) loss:3.167 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:05:20,323][model8_pretrain.py][INFO] Epoch:[0/2](373300/4588595) loss:3.284 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:05:59,036][model8_pretrain.py][INFO] Epoch:[0/2](373400/4588595) loss:3.013 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:05:59,036][model8_pretrain.py][INFO] Epoch:[0/2](373400/4588595) loss:3.029 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:05:59,036][model8_pretrain.py][INFO] Epoch:[0/2](373400/4588595) loss:2.636 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:05:59,036][model8_pretrain.py][INFO] Epoch:[0/2](373400/4588595) loss:2.704 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:05:59,036][model8_pretrain.py][INFO] Epoch:[0/2](373400/4588595) loss:2.827 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:05:59,041][model8_pretrain.py][INFO] Epoch:[0/2](373400/4588595) loss:2.760 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:05:59,041][model8_pretrain.py][INFO] Epoch:[0/2](373400/4588595) loss:2.867 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:05:59,041][model8_pretrain.py][INFO] Epoch:[0/2](373400/4588595) loss:2.638 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:06:44,316][model8_pretrain.py][INFO] Epoch:[0/2](373500/4588595) loss:3.099 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:06:44,316][model8_pretrain.py][INFO] Epoch:[0/2](373500/4588595) loss:2.682 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:06:44,316][model8_pretrain.py][INFO] Epoch:[0/2](373500/4588595) loss:2.771 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:06:44,316][model8_pretrain.py][INFO] Epoch:[0/2](373500/4588595) loss:2.969 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:06:44,316][model8_pretrain.py][INFO] Epoch:[0/2](373500/4588595) loss:2.755 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:06:44,316][model8_pretrain.py][INFO] Epoch:[0/2](373500/4588595) loss:2.959 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:06:44,316][model8_pretrain.py][INFO] Epoch:[0/2](373500/4588595) loss:2.782 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:06:44,316][model8_pretrain.py][INFO] Epoch:[0/2](373500/4588595) loss:3.226 lr:0.0000100 epoch_Time:26712.0min: [2024-01-04 09:07:21,252][model8_pretrain.py][INFO] Epoch:[0/2](373600/4588595) loss:3.186 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:07:21,253][model8_pretrain.py][INFO] Epoch:[0/2](373600/4588595) loss:2.784 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:07:21,253][model8_pretrain.py][INFO] Epoch:[0/2](373600/4588595) loss:3.087 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:07:21,253][model8_pretrain.py][INFO] Epoch:[0/2](373600/4588595) loss:3.119 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:07:21,253][model8_pretrain.py][INFO] Epoch:[0/2](373600/4588595) loss:3.224 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:07:21,253][model8_pretrain.py][INFO] Epoch:[0/2](373600/4588595) loss:3.421 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:07:21,253][model8_pretrain.py][INFO] Epoch:[0/2](373600/4588595) loss:2.986 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:07:21,253][model8_pretrain.py][INFO] Epoch:[0/2](373600/4588595) loss:2.974 lr:0.0000100 epoch_Time:26711.0min: [2024-01-04 09:07:58,198][model8_pretrain.py][INFO] Epoch:[0/2](373700/4588595) loss:3.327 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:07:58,198][model8_pretrain.py][INFO] Epoch:[0/2](373700/4588595) loss:3.076 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:07:58,198][model8_pretrain.py][INFO] Epoch:[0/2](373700/4588595) loss:3.133 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:07:58,198][model8_pretrain.py][INFO] Epoch:[0/2](373700/4588595) loss:3.000 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:07:58,198][model8_pretrain.py][INFO] Epoch:[0/2](373700/4588595) loss:3.029 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:07:58,198][model8_pretrain.py][INFO] Epoch:[0/2](373700/4588595) loss:2.813 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:07:58,198][model8_pretrain.py][INFO] Epoch:[0/2](373700/4588595) loss:2.722 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:07:58,198][model8_pretrain.py][INFO] Epoch:[0/2](373700/4588595) loss:2.246 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:08:35,137][model8_pretrain.py][INFO] Epoch:[0/2](373800/4588595) loss:2.347 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:08:35,137][model8_pretrain.py][INFO] Epoch:[0/2](373800/4588595) loss:3.251 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:08:35,137][model8_pretrain.py][INFO] Epoch:[0/2](373800/4588595) loss:2.592 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:08:35,137][model8_pretrain.py][INFO] Epoch:[0/2](373800/4588595) loss:2.685 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:08:35,137][model8_pretrain.py][INFO] Epoch:[0/2](373800/4588595) loss:2.884 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:08:35,137][model8_pretrain.py][INFO] Epoch:[0/2](373800/4588595) loss:2.542 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:08:35,137][model8_pretrain.py][INFO] Epoch:[0/2](373800/4588595) loss:3.076 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:08:35,137][model8_pretrain.py][INFO] Epoch:[0/2](373800/4588595) loss:3.317 lr:0.0000100 epoch_Time:26710.0min: [2024-01-04 09:09:12,089][model8_pretrain.py][INFO] Epoch:[0/2](373900/4588595) loss:2.622 lr:0.0000100 epoch_Time:26708.0min: [2024-01-04 09:09:12,090][model8_pretrain.py][INFO] Epoch:[0/2](373900/4588595) loss:2.673 lr:0.0000100 epoch_Time:26708.0min: [2024-01-04 09:09:12,090][model8_pretrain.py][INFO] Epoch:[0/2](373900/4588595) loss:2.897 lr:0.0000100 epoch_Time:26708.0min: [2024-01-04 09:09:12,090][model8_pretrain.py][INFO] Epoch:[0/2](373900/4588595) loss:2.673 lr:0.0000100 epoch_Time:26708.0min: [2024-01-04 09:09:12,090][model8_pretrain.py][INFO] Epoch:[0/2](373900/4588595) loss:3.249 lr:0.0000100 epoch_Time:26708.0min: [2024-01-04 09:09:12,090][model8_pretrain.py][INFO] Epoch:[0/2](373900/4588595) loss:3.150 lr:0.0000100 epoch_Time:26708.0min: [2024-01-04 09:09:12,090][model8_pretrain.py][INFO] Epoch:[0/2](373900/4588595) loss:2.644 lr:0.0000100 epoch_Time:26708.0min: [2024-01-04 09:09:12,090][model8_pretrain.py][INFO] Epoch:[0/2](373900/4588595) loss:2.847 lr:0.0000100 epoch_Time:26708.0min: [2024-01-04 09:09:49,029][model8_pretrain.py][INFO] Epoch:[0/2](374000/4588595) loss:2.313 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:09:49,029][model8_pretrain.py][INFO] Epoch:[0/2](374000/4588595) loss:2.783 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:09:49,029][model8_pretrain.py][INFO] Epoch:[0/2](374000/4588595) loss:3.009 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:09:49,029][model8_pretrain.py][INFO] Epoch:[0/2](374000/4588595) loss:3.067 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:09:49,029][model8_pretrain.py][INFO] Epoch:[0/2](374000/4588595) loss:3.200 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:09:49,029][model8_pretrain.py][INFO] Epoch:[0/2](374000/4588595) loss:3.184 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:09:49,029][model8_pretrain.py][INFO] Epoch:[0/2](374000/4588595) loss:2.830 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:09:49,029][model8_pretrain.py][INFO] Epoch:[0/2](374000/4588595) loss:3.173 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:10:25,970][model8_pretrain.py][INFO] Epoch:[0/2](374100/4588595) loss:2.936 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:10:25,970][model8_pretrain.py][INFO] Epoch:[0/2](374100/4588595) loss:3.063 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:10:25,970][model8_pretrain.py][INFO] Epoch:[0/2](374100/4588595) loss:2.808 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:10:25,970][model8_pretrain.py][INFO] Epoch:[0/2](374100/4588595) loss:2.977 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:10:25,970][model8_pretrain.py][INFO] Epoch:[0/2](374100/4588595) loss:2.035 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:10:25,970][model8_pretrain.py][INFO] Epoch:[0/2](374100/4588595) loss:2.921 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:10:25,970][model8_pretrain.py][INFO] Epoch:[0/2](374100/4588595) loss:2.313 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:10:25,970][model8_pretrain.py][INFO] Epoch:[0/2](374100/4588595) loss:2.617 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:11:02,915][model8_pretrain.py][INFO] Epoch:[0/2](374200/4588595) loss:3.103 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:11:02,915][model8_pretrain.py][INFO] Epoch:[0/2](374200/4588595) loss:2.928 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:11:02,915][model8_pretrain.py][INFO] Epoch:[0/2](374200/4588595) loss:3.022 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:11:02,915][model8_pretrain.py][INFO] Epoch:[0/2](374200/4588595) loss:3.206 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:11:02,915][model8_pretrain.py][INFO] Epoch:[0/2](374200/4588595) loss:2.657 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:11:02,915][model8_pretrain.py][INFO] Epoch:[0/2](374200/4588595) loss:2.309 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:11:02,915][model8_pretrain.py][INFO] Epoch:[0/2](374200/4588595) loss:2.993 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:11:02,915][model8_pretrain.py][INFO] Epoch:[0/2](374200/4588595) loss:3.150 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:11:50,027][model8_pretrain.py][INFO] Epoch:[0/2](374300/4588595) loss:3.014 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:11:50,027][model8_pretrain.py][INFO] Epoch:[0/2](374300/4588595) loss:2.387 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:11:50,027][model8_pretrain.py][INFO] Epoch:[0/2](374300/4588595) loss:2.556 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:11:50,027][model8_pretrain.py][INFO] Epoch:[0/2](374300/4588595) loss:2.674 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:11:50,027][model8_pretrain.py][INFO] Epoch:[0/2](374300/4588595) loss:3.078 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:11:50,027][model8_pretrain.py][INFO] Epoch:[0/2](374300/4588595) loss:2.914 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:11:50,027][model8_pretrain.py][INFO] Epoch:[0/2](374300/4588595) loss:2.660 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:11:50,027][model8_pretrain.py][INFO] Epoch:[0/2](374300/4588595) loss:2.678 lr:0.0000100 epoch_Time:26707.0min: [2024-01-04 09:12:26,953][model8_pretrain.py][INFO] Epoch:[0/2](374400/4588595) loss:2.702 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:12:26,953][model8_pretrain.py][INFO] Epoch:[0/2](374400/4588595) loss:2.398 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:12:26,953][model8_pretrain.py][INFO] Epoch:[0/2](374400/4588595) loss:2.523 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:12:26,953][model8_pretrain.py][INFO] Epoch:[0/2](374400/4588595) loss:2.663 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:12:26,953][model8_pretrain.py][INFO] Epoch:[0/2](374400/4588595) loss:3.113 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:12:26,953][model8_pretrain.py][INFO] Epoch:[0/2](374400/4588595) loss:2.215 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:12:26,953][model8_pretrain.py][INFO] Epoch:[0/2](374400/4588595) loss:2.442 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:12:26,954][model8_pretrain.py][INFO] Epoch:[0/2](374400/4588595) loss:3.089 lr:0.0000100 epoch_Time:26706.0min: [2024-01-04 09:13:03,903][model8_pretrain.py][INFO] Epoch:[0/2](374500/4588595) loss:2.177 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:03,904][model8_pretrain.py][INFO] Epoch:[0/2](374500/4588595) loss:2.571 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:03,904][model8_pretrain.py][INFO] Epoch:[0/2](374500/4588595) loss:2.956 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:03,904][model8_pretrain.py][INFO] Epoch:[0/2](374500/4588595) loss:2.985 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:03,904][model8_pretrain.py][INFO] Epoch:[0/2](374500/4588595) loss:2.939 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:03,904][model8_pretrain.py][INFO] Epoch:[0/2](374500/4588595) loss:2.409 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:03,904][model8_pretrain.py][INFO] Epoch:[0/2](374500/4588595) loss:2.910 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:03,904][model8_pretrain.py][INFO] Epoch:[0/2](374500/4588595) loss:3.031 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:40,841][model8_pretrain.py][INFO] Epoch:[0/2](374600/4588595) loss:3.320 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:40,841][model8_pretrain.py][INFO] Epoch:[0/2](374600/4588595) loss:2.781 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:40,841][model8_pretrain.py][INFO] Epoch:[0/2](374600/4588595) loss:2.587 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:40,841][model8_pretrain.py][INFO] Epoch:[0/2](374600/4588595) loss:2.469 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:40,841][model8_pretrain.py][INFO] Epoch:[0/2](374600/4588595) loss:3.181 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:40,841][model8_pretrain.py][INFO] Epoch:[0/2](374600/4588595) loss:3.382 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:40,841][model8_pretrain.py][INFO] Epoch:[0/2](374600/4588595) loss:2.978 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:13:40,841][model8_pretrain.py][INFO] Epoch:[0/2](374600/4588595) loss:2.886 lr:0.0000100 epoch_Time:26705.0min: [2024-01-04 09:14:17,799][model8_pretrain.py][INFO] Epoch:[0/2](374700/4588595) loss:3.442 lr:0.0000100 epoch_Time:26704.0min: [2024-01-04 09:14:17,799][model8_pretrain.py][INFO] Epoch:[0/2](374700/4588595) loss:2.850 lr:0.0000100 epoch_Time:26704.0min: [2024-01-04 09:14:17,799][model8_pretrain.py][INFO] Epoch:[0/2](374700/4588595) loss:2.872 lr:0.0000100 epoch_Time:26704.0min: [2024-01-04 09:14:17,799][model8_pretrain.py][INFO] Epoch:[0/2](374700/4588595) loss:2.892 lr:0.0000100 epoch_Time:26704.0min: [2024-01-04 09:14:17,800][model8_pretrain.py][INFO] Epoch:[0/2](374700/4588595) loss:2.648 lr:0.0000100 epoch_Time:26704.0min: [2024-01-04 09:14:17,800][model8_pretrain.py][INFO] Epoch:[0/2](374700/4588595) loss:2.948 lr:0.0000100 epoch_Time:26704.0min: [2024-01-04 09:14:17,800][model8_pretrain.py][INFO] Epoch:[0/2](374700/4588595) loss:2.389 lr:0.0000100 epoch_Time:26704.0min: [2024-01-04 09:14:17,800][model8_pretrain.py][INFO] Epoch:[0/2](374700/4588595) loss:3.195 lr:0.0000100 epoch_Time:26704.0min: [2024-01-04 09:14:54,718][model8_pretrain.py][INFO] Epoch:[0/2](374800/4588595) loss:3.100 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:14:54,718][model8_pretrain.py][INFO] Epoch:[0/2](374800/4588595) loss:3.193 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:14:54,718][model8_pretrain.py][INFO] Epoch:[0/2](374800/4588595) loss:2.560 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:14:54,718][model8_pretrain.py][INFO] Epoch:[0/2](374800/4588595) loss:2.842 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:14:54,718][model8_pretrain.py][INFO] Epoch:[0/2](374800/4588595) loss:2.405 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:14:54,718][model8_pretrain.py][INFO] Epoch:[0/2](374800/4588595) loss:3.386 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:14:54,718][model8_pretrain.py][INFO] Epoch:[0/2](374800/4588595) loss:2.536 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:14:54,718][model8_pretrain.py][INFO] Epoch:[0/2](374800/4588595) loss:2.301 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:15:31,661][model8_pretrain.py][INFO] Epoch:[0/2](374900/4588595) loss:3.031 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:15:31,661][model8_pretrain.py][INFO] Epoch:[0/2](374900/4588595) loss:3.014 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:15:31,661][model8_pretrain.py][INFO] Epoch:[0/2](374900/4588595) loss:3.127 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:15:31,661][model8_pretrain.py][INFO] Epoch:[0/2](374900/4588595) loss:2.579 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:15:31,661][model8_pretrain.py][INFO] Epoch:[0/2](374900/4588595) loss:3.197 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:15:31,661][model8_pretrain.py][INFO] Epoch:[0/2](374900/4588595) loss:3.123 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:15:31,661][model8_pretrain.py][INFO] Epoch:[0/2](374900/4588595) loss:2.912 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:15:31,661][model8_pretrain.py][INFO] Epoch:[0/2](374900/4588595) loss:3.324 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:16:08,576][model8_pretrain.py][INFO] Epoch:[0/2](375000/4588595) loss:2.719 lr:0.0000100 epoch_Time:26701.0min: [2024-01-04 09:16:08,576][model8_pretrain.py][INFO] Epoch:[0/2](375000/4588595) loss:3.218 lr:0.0000100 epoch_Time:26701.0min: [2024-01-04 09:16:08,576][model8_pretrain.py][INFO] Epoch:[0/2](375000/4588595) loss:3.195 lr:0.0000100 epoch_Time:26701.0min: [2024-01-04 09:16:08,576][model8_pretrain.py][INFO] Epoch:[0/2](375000/4588595) loss:2.592 lr:0.0000100 epoch_Time:26701.0min: [2024-01-04 09:16:08,576][model8_pretrain.py][INFO] Epoch:[0/2](375000/4588595) loss:3.104 lr:0.0000100 epoch_Time:26701.0min: [2024-01-04 09:16:08,576][model8_pretrain.py][INFO] Epoch:[0/2](375000/4588595) loss:3.288 lr:0.0000100 epoch_Time:26701.0min: [2024-01-04 09:16:08,576][model8_pretrain.py][INFO] Epoch:[0/2](375000/4588595) loss:2.717 lr:0.0000100 epoch_Time:26701.0min: [2024-01-04 09:16:08,576][model8_pretrain.py][INFO] Epoch:[0/2](375000/4588595) loss:2.638 lr:0.0000100 epoch_Time:26701.0min: [2024-01-04 09:16:55,716][model8_pretrain.py][INFO] Epoch:[0/2](375100/4588595) loss:2.676 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:16:55,716][model8_pretrain.py][INFO] Epoch:[0/2](375100/4588595) loss:2.866 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:16:55,716][model8_pretrain.py][INFO] Epoch:[0/2](375100/4588595) loss:2.586 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:16:55,716][model8_pretrain.py][INFO] Epoch:[0/2](375100/4588595) loss:2.648 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:16:55,716][model8_pretrain.py][INFO] Epoch:[0/2](375100/4588595) loss:2.674 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:16:55,716][model8_pretrain.py][INFO] Epoch:[0/2](375100/4588595) loss:2.922 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:16:55,716][model8_pretrain.py][INFO] Epoch:[0/2](375100/4588595) loss:2.733 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:16:55,716][model8_pretrain.py][INFO] Epoch:[0/2](375100/4588595) loss:2.936 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:17:32,648][model8_pretrain.py][INFO] Epoch:[0/2](375200/4588595) loss:3.070 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:17:32,648][model8_pretrain.py][INFO] Epoch:[0/2](375200/4588595) loss:3.070 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:17:32,648][model8_pretrain.py][INFO] Epoch:[0/2](375200/4588595) loss:3.282 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:17:32,648][model8_pretrain.py][INFO] Epoch:[0/2](375200/4588595) loss:2.338 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:17:32,648][model8_pretrain.py][INFO] Epoch:[0/2](375200/4588595) loss:2.990 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:17:32,648][model8_pretrain.py][INFO] Epoch:[0/2](375200/4588595) loss:3.090 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:17:32,648][model8_pretrain.py][INFO] Epoch:[0/2](375200/4588595) loss:3.116 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:17:32,649][model8_pretrain.py][INFO] Epoch:[0/2](375200/4588595) loss:2.399 lr:0.0000100 epoch_Time:26702.0min: [2024-01-04 09:18:09,562][model8_pretrain.py][INFO] Epoch:[0/2](375300/4588595) loss:2.882 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:09,562][model8_pretrain.py][INFO] Epoch:[0/2](375300/4588595) loss:3.097 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:09,562][model8_pretrain.py][INFO] Epoch:[0/2](375300/4588595) loss:2.452 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:09,562][model8_pretrain.py][INFO] Epoch:[0/2](375300/4588595) loss:2.535 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:09,562][model8_pretrain.py][INFO] Epoch:[0/2](375300/4588595) loss:3.039 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:09,562][model8_pretrain.py][INFO] Epoch:[0/2](375300/4588595) loss:2.904 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:09,563][model8_pretrain.py][INFO] Epoch:[0/2](375300/4588595) loss:2.940 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:09,563][model8_pretrain.py][INFO] Epoch:[0/2](375300/4588595) loss:3.195 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:46,498][model8_pretrain.py][INFO] Epoch:[0/2](375400/4588595) loss:2.823 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:46,498][model8_pretrain.py][INFO] Epoch:[0/2](375400/4588595) loss:3.007 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:46,498][model8_pretrain.py][INFO] Epoch:[0/2](375400/4588595) loss:2.505 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:46,498][model8_pretrain.py][INFO] Epoch:[0/2](375400/4588595) loss:2.143 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:46,498][model8_pretrain.py][INFO] Epoch:[0/2](375400/4588595) loss:2.863 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:46,498][model8_pretrain.py][INFO] Epoch:[0/2](375400/4588595) loss:2.767 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:46,498][model8_pretrain.py][INFO] Epoch:[0/2](375400/4588595) loss:2.506 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:18:46,498][model8_pretrain.py][INFO] Epoch:[0/2](375400/4588595) loss:2.867 lr:0.0000100 epoch_Time:26700.0min: [2024-01-04 09:19:23,435][model8_pretrain.py][INFO] Epoch:[0/2](375500/4588595) loss:3.049 lr:0.0000100 epoch_Time:26699.0min: [2024-01-04 09:19:23,435][model8_pretrain.py][INFO] Epoch:[0/2](375500/4588595) loss:2.276 lr:0.0000100 epoch_Time:26699.0min: [2024-01-04 09:19:23,435][model8_pretrain.py][INFO] Epoch:[0/2](375500/4588595) loss:2.797 lr:0.0000100 epoch_Time:26699.0min: [2024-01-04 09:19:23,435][model8_pretrain.py][INFO] Epoch:[0/2](375500/4588595) loss:3.167 lr:0.0000100 epoch_Time:26699.0min: [2024-01-04 09:19:23,435][model8_pretrain.py][INFO] Epoch:[0/2](375500/4588595) loss:3.273 lr:0.0000100 epoch_Time:26699.0min: [2024-01-04 09:19:23,435][model8_pretrain.py][INFO] Epoch:[0/2](375500/4588595) loss:2.941 lr:0.0000100 epoch_Time:26699.0min: [2024-01-04 09:19:23,435][model8_pretrain.py][INFO] Epoch:[0/2](375500/4588595) loss:2.988 lr:0.0000100 epoch_Time:26699.0min: [2024-01-04 09:19:23,435][model8_pretrain.py][INFO] Epoch:[0/2](375500/4588595) loss:3.043 lr:0.0000100 epoch_Time:26699.0min: [2024-01-04 09:20:00,375][model8_pretrain.py][INFO] Epoch:[0/2](375600/4588595) loss:2.975 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:00,375][model8_pretrain.py][INFO] Epoch:[0/2](375600/4588595) loss:2.795 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:00,375][model8_pretrain.py][INFO] Epoch:[0/2](375600/4588595) loss:2.926 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:00,375][model8_pretrain.py][INFO] Epoch:[0/2](375600/4588595) loss:2.779 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:00,375][model8_pretrain.py][INFO] Epoch:[0/2](375600/4588595) loss:2.194 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:00,375][model8_pretrain.py][INFO] Epoch:[0/2](375600/4588595) loss:3.330 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:00,376][model8_pretrain.py][INFO] Epoch:[0/2](375600/4588595) loss:3.154 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:00,376][model8_pretrain.py][INFO] Epoch:[0/2](375600/4588595) loss:2.891 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:37,304][model8_pretrain.py][INFO] Epoch:[0/2](375700/4588595) loss:3.012 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:37,305][model8_pretrain.py][INFO] Epoch:[0/2](375700/4588595) loss:2.686 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:37,305][model8_pretrain.py][INFO] Epoch:[0/2](375700/4588595) loss:2.985 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:37,305][model8_pretrain.py][INFO] Epoch:[0/2](375700/4588595) loss:2.876 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:37,305][model8_pretrain.py][INFO] Epoch:[0/2](375700/4588595) loss:3.161 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:37,305][model8_pretrain.py][INFO] Epoch:[0/2](375700/4588595) loss:2.859 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:37,305][model8_pretrain.py][INFO] Epoch:[0/2](375700/4588595) loss:2.897 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:20:37,305][model8_pretrain.py][INFO] Epoch:[0/2](375700/4588595) loss:3.174 lr:0.0000100 epoch_Time:26698.0min: [2024-01-04 09:21:14,248][model8_pretrain.py][INFO] Epoch:[0/2](375800/4588595) loss:2.918 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:21:14,248][model8_pretrain.py][INFO] Epoch:[0/2](375800/4588595) loss:2.551 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:21:14,248][model8_pretrain.py][INFO] Epoch:[0/2](375800/4588595) loss:3.150 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:21:14,248][model8_pretrain.py][INFO] Epoch:[0/2](375800/4588595) loss:2.365 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:21:14,248][model8_pretrain.py][INFO] Epoch:[0/2](375800/4588595) loss:3.121 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:21:14,248][model8_pretrain.py][INFO] Epoch:[0/2](375800/4588595) loss:3.398 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:21:14,248][model8_pretrain.py][INFO] Epoch:[0/2](375800/4588595) loss:2.650 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:21:14,248][model8_pretrain.py][INFO] Epoch:[0/2](375800/4588595) loss:2.103 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:22:01,526][model8_pretrain.py][INFO] Epoch:[0/2](375900/4588595) loss:2.495 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:01,526][model8_pretrain.py][INFO] Epoch:[0/2](375900/4588595) loss:2.437 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:01,526][model8_pretrain.py][INFO] Epoch:[0/2](375900/4588595) loss:3.414 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:01,526][model8_pretrain.py][INFO] Epoch:[0/2](375900/4588595) loss:2.684 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:01,526][model8_pretrain.py][INFO] Epoch:[0/2](375900/4588595) loss:2.987 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:01,526][model8_pretrain.py][INFO] Epoch:[0/2](375900/4588595) loss:3.008 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:01,526][model8_pretrain.py][INFO] Epoch:[0/2](375900/4588595) loss:2.893 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:01,526][model8_pretrain.py][INFO] Epoch:[0/2](375900/4588595) loss:2.568 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:38,467][model8_pretrain.py][INFO] Epoch:[0/2](376000/4588595) loss:3.284 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:38,467][model8_pretrain.py][INFO] Epoch:[0/2](376000/4588595) loss:2.929 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:38,467][model8_pretrain.py][INFO] Epoch:[0/2](376000/4588595) loss:2.818 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:38,467][model8_pretrain.py][INFO] Epoch:[0/2](376000/4588595) loss:2.524 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:38,467][model8_pretrain.py][INFO] Epoch:[0/2](376000/4588595) loss:3.449 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:38,467][model8_pretrain.py][INFO] Epoch:[0/2](376000/4588595) loss:2.976 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:38,467][model8_pretrain.py][INFO] Epoch:[0/2](376000/4588595) loss:3.329 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:22:38,467][model8_pretrain.py][INFO] Epoch:[0/2](376000/4588595) loss:3.194 lr:0.0000100 epoch_Time:26697.0min: [2024-01-04 09:23:15,406][model8_pretrain.py][INFO] Epoch:[0/2](376100/4588595) loss:2.613 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:23:15,406][model8_pretrain.py][INFO] Epoch:[0/2](376100/4588595) loss:3.144 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:23:15,406][model8_pretrain.py][INFO] Epoch:[0/2](376100/4588595) loss:2.671 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:23:15,406][model8_pretrain.py][INFO] Epoch:[0/2](376100/4588595) loss:2.802 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:23:15,405][model8_pretrain.py][INFO] Epoch:[0/2](376100/4588595) loss:2.697 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:23:15,406][model8_pretrain.py][INFO] Epoch:[0/2](376100/4588595) loss:3.252 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:23:15,406][model8_pretrain.py][INFO] Epoch:[0/2](376100/4588595) loss:2.939 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:23:15,406][model8_pretrain.py][INFO] Epoch:[0/2](376100/4588595) loss:3.137 lr:0.0000100 epoch_Time:26696.0min: [2024-01-04 09:23:52,352][model8_pretrain.py][INFO] Epoch:[0/2](376200/4588595) loss:2.747 lr:0.0000100 epoch_Time:26695.0min: [2024-01-04 09:23:52,352][model8_pretrain.py][INFO] Epoch:[0/2](376200/4588595) loss:2.731 lr:0.0000100 epoch_Time:26695.0min: [2024-01-04 09:23:52,352][model8_pretrain.py][INFO] Epoch:[0/2](376200/4588595) loss:2.608 lr:0.0000100 epoch_Time:26695.0min: [2024-01-04 09:23:52,352][model8_pretrain.py][INFO] Epoch:[0/2](376200/4588595) loss:2.660 lr:0.0000100 epoch_Time:26695.0min: [2024-01-04 09:23:52,352][model8_pretrain.py][INFO] Epoch:[0/2](376200/4588595) loss:2.261 lr:0.0000100 epoch_Time:26695.0min: [2024-01-04 09:23:52,352][model8_pretrain.py][INFO] Epoch:[0/2](376200/4588595) loss:2.949 lr:0.0000100 epoch_Time:26695.0min: [2024-01-04 09:23:52,352][model8_pretrain.py][INFO] Epoch:[0/2](376200/4588595) loss:2.523 lr:0.0000100 epoch_Time:26695.0min: [2024-01-04 09:23:52,353][model8_pretrain.py][INFO] Epoch:[0/2](376200/4588595) loss:2.620 lr:0.0000100 epoch_Time:26695.0min: [2024-01-04 09:24:29,286][model8_pretrain.py][INFO] Epoch:[0/2](376300/4588595) loss:2.818 lr:0.0000100 epoch_Time:26694.0min: [2024-01-04 09:24:29,286][model8_pretrain.py][INFO] Epoch:[0/2](376300/4588595) loss:3.008 lr:0.0000100 epoch_Time:26694.0min: [2024-01-04 09:24:29,286][model8_pretrain.py][INFO] Epoch:[0/2](376300/4588595) loss:3.206 lr:0.0000100 epoch_Time:26694.0min: [2024-01-04 09:24:29,286][model8_pretrain.py][INFO] Epoch:[0/2](376300/4588595) loss:3.115 lr:0.0000100 epoch_Time:26694.0min: [2024-01-04 09:24:29,286][model8_pretrain.py][INFO] Epoch:[0/2](376300/4588595) loss:2.737 lr:0.0000100 epoch_Time:26694.0min: [2024-01-04 09:24:29,286][model8_pretrain.py][INFO] Epoch:[0/2](376300/4588595) loss:2.710 lr:0.0000100 epoch_Time:26694.0min: [2024-01-04 09:24:29,286][model8_pretrain.py][INFO] Epoch:[0/2](376300/4588595) loss:2.593 lr:0.0000100 epoch_Time:26694.0min: [2024-01-04 09:24:29,286][model8_pretrain.py][INFO] Epoch:[0/2](376300/4588595) loss:3.206 lr:0.0000100 epoch_Time:26694.0min: [2024-01-04 09:25:06,222][model8_pretrain.py][INFO] Epoch:[0/2](376400/4588595) loss:2.582 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:06,222][model8_pretrain.py][INFO] Epoch:[0/2](376400/4588595) loss:2.669 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:06,222][model8_pretrain.py][INFO] Epoch:[0/2](376400/4588595) loss:2.740 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:06,222][model8_pretrain.py][INFO] Epoch:[0/2](376400/4588595) loss:2.531 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:06,222][model8_pretrain.py][INFO] Epoch:[0/2](376400/4588595) loss:2.840 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:06,222][model8_pretrain.py][INFO] Epoch:[0/2](376400/4588595) loss:2.894 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:06,222][model8_pretrain.py][INFO] Epoch:[0/2](376400/4588595) loss:2.231 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:06,223][model8_pretrain.py][INFO] Epoch:[0/2](376400/4588595) loss:3.215 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:43,161][model8_pretrain.py][INFO] Epoch:[0/2](376500/4588595) loss:2.688 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:43,161][model8_pretrain.py][INFO] Epoch:[0/2](376500/4588595) loss:3.515 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:43,161][model8_pretrain.py][INFO] Epoch:[0/2](376500/4588595) loss:2.752 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:43,161][model8_pretrain.py][INFO] Epoch:[0/2](376500/4588595) loss:2.810 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:43,161][model8_pretrain.py][INFO] Epoch:[0/2](376500/4588595) loss:2.728 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:43,161][model8_pretrain.py][INFO] Epoch:[0/2](376500/4588595) loss:2.711 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:43,161][model8_pretrain.py][INFO] Epoch:[0/2](376500/4588595) loss:2.708 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:25:43,161][model8_pretrain.py][INFO] Epoch:[0/2](376500/4588595) loss:3.417 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:26:20,095][model8_pretrain.py][INFO] Epoch:[0/2](376600/4588595) loss:3.139 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:26:20,095][model8_pretrain.py][INFO] Epoch:[0/2](376600/4588595) loss:3.067 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:26:20,095][model8_pretrain.py][INFO] Epoch:[0/2](376600/4588595) loss:3.116 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:26:20,095][model8_pretrain.py][INFO] Epoch:[0/2](376600/4588595) loss:3.119 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:26:20,095][model8_pretrain.py][INFO] Epoch:[0/2](376600/4588595) loss:2.787 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:26:20,095][model8_pretrain.py][INFO] Epoch:[0/2](376600/4588595) loss:3.348 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:26:20,095][model8_pretrain.py][INFO] Epoch:[0/2](376600/4588595) loss:3.121 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:26:20,095][model8_pretrain.py][INFO] Epoch:[0/2](376600/4588595) loss:2.979 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:27:07,475][model8_pretrain.py][INFO] Epoch:[0/2](376700/4588595) loss:2.810 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:27:07,475][model8_pretrain.py][INFO] Epoch:[0/2](376700/4588595) loss:2.906 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:27:07,475][model8_pretrain.py][INFO] Epoch:[0/2](376700/4588595) loss:2.742 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:27:07,475][model8_pretrain.py][INFO] Epoch:[0/2](376700/4588595) loss:2.335 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:27:07,475][model8_pretrain.py][INFO] Epoch:[0/2](376700/4588595) loss:2.491 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:27:07,475][model8_pretrain.py][INFO] Epoch:[0/2](376700/4588595) loss:3.463 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:27:07,475][model8_pretrain.py][INFO] Epoch:[0/2](376700/4588595) loss:2.756 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:27:07,475][model8_pretrain.py][INFO] Epoch:[0/2](376700/4588595) loss:2.889 lr:0.0000100 epoch_Time:26693.0min: [2024-01-04 09:27:44,397][model8_pretrain.py][INFO] Epoch:[0/2](376800/4588595) loss:2.693 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:27:44,397][model8_pretrain.py][INFO] Epoch:[0/2](376800/4588595) loss:2.768 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:27:44,397][model8_pretrain.py][INFO] Epoch:[0/2](376800/4588595) loss:2.591 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:27:44,397][model8_pretrain.py][INFO] Epoch:[0/2](376800/4588595) loss:2.741 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:27:44,397][model8_pretrain.py][INFO] Epoch:[0/2](376800/4588595) loss:3.029 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:27:44,397][model8_pretrain.py][INFO] Epoch:[0/2](376800/4588595) loss:2.394 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:27:44,397][model8_pretrain.py][INFO] Epoch:[0/2](376800/4588595) loss:2.670 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:27:44,397][model8_pretrain.py][INFO] Epoch:[0/2](376800/4588595) loss:2.384 lr:0.0000100 epoch_Time:26692.0min: [2024-01-04 09:28:21,329][model8_pretrain.py][INFO] Epoch:[0/2](376900/4588595) loss:3.198 lr:0.0000100 epoch_Time:26691.0min: [2024-01-04 09:28:21,329][model8_pretrain.py][INFO] Epoch:[0/2](376900/4588595) loss:2.591 lr:0.0000100 epoch_Time:26691.0min: [2024-01-04 09:28:21,329][model8_pretrain.py][INFO] Epoch:[0/2](376900/4588595) loss:2.702 lr:0.0000100 epoch_Time:26691.0min: [2024-01-04 09:28:21,329][model8_pretrain.py][INFO] Epoch:[0/2](376900/4588595) loss:2.878 lr:0.0000100 epoch_Time:26691.0min: [2024-01-04 09:28:21,329][model8_pretrain.py][INFO] Epoch:[0/2](376900/4588595) loss:2.470 lr:0.0000100 epoch_Time:26691.0min: [2024-01-04 09:28:21,329][model8_pretrain.py][INFO] Epoch:[0/2](376900/4588595) loss:2.512 lr:0.0000100 epoch_Time:26691.0min: [2024-01-04 09:28:21,329][model8_pretrain.py][INFO] Epoch:[0/2](376900/4588595) loss:2.674 lr:0.0000100 epoch_Time:26691.0min: [2024-01-04 09:28:21,329][model8_pretrain.py][INFO] Epoch:[0/2](376900/4588595) loss:3.268 lr:0.0000100 epoch_Time:26691.0min: [2024-01-04 09:28:58,261][model8_pretrain.py][INFO] Epoch:[0/2](377000/4588595) loss:2.754 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:28:58,261][model8_pretrain.py][INFO] Epoch:[0/2](377000/4588595) loss:3.011 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:28:58,261][model8_pretrain.py][INFO] Epoch:[0/2](377000/4588595) loss:3.048 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:28:58,261][model8_pretrain.py][INFO] Epoch:[0/2](377000/4588595) loss:2.776 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:28:58,261][model8_pretrain.py][INFO] Epoch:[0/2](377000/4588595) loss:3.195 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:28:58,261][model8_pretrain.py][INFO] Epoch:[0/2](377000/4588595) loss:2.578 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:28:58,261][model8_pretrain.py][INFO] Epoch:[0/2](377000/4588595) loss:2.732 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:28:58,261][model8_pretrain.py][INFO] Epoch:[0/2](377000/4588595) loss:2.907 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:29:35,197][model8_pretrain.py][INFO] Epoch:[0/2](377100/4588595) loss:2.368 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:29:35,197][model8_pretrain.py][INFO] Epoch:[0/2](377100/4588595) loss:2.877 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:29:35,197][model8_pretrain.py][INFO] Epoch:[0/2](377100/4588595) loss:2.424 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:29:35,197][model8_pretrain.py][INFO] Epoch:[0/2](377100/4588595) loss:2.503 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:29:35,197][model8_pretrain.py][INFO] Epoch:[0/2](377100/4588595) loss:2.612 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:29:35,197][model8_pretrain.py][INFO] Epoch:[0/2](377100/4588595) loss:3.401 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:29:35,197][model8_pretrain.py][INFO] Epoch:[0/2](377100/4588595) loss:2.945 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:29:35,197][model8_pretrain.py][INFO] Epoch:[0/2](377100/4588595) loss:2.819 lr:0.0000100 epoch_Time:26690.0min: [2024-01-04 09:30:12,178][model8_pretrain.py][INFO] Epoch:[0/2](377200/4588595) loss:2.716 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:30:12,178][model8_pretrain.py][INFO] Epoch:[0/2](377200/4588595) loss:3.346 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:30:12,178][model8_pretrain.py][INFO] Epoch:[0/2](377200/4588595) loss:2.487 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:30:12,178][model8_pretrain.py][INFO] Epoch:[0/2](377200/4588595) loss:2.716 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:30:12,178][model8_pretrain.py][INFO] Epoch:[0/2](377200/4588595) loss:3.033 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:30:12,178][model8_pretrain.py][INFO] Epoch:[0/2](377200/4588595) loss:2.742 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:30:12,178][model8_pretrain.py][INFO] Epoch:[0/2](377200/4588595) loss:2.361 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:30:12,178][model8_pretrain.py][INFO] Epoch:[0/2](377200/4588595) loss:3.293 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:30:49,120][model8_pretrain.py][INFO] Epoch:[0/2](377300/4588595) loss:3.130 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:30:49,120][model8_pretrain.py][INFO] Epoch:[0/2](377300/4588595) loss:2.448 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:30:49,120][model8_pretrain.py][INFO] Epoch:[0/2](377300/4588595) loss:2.900 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:30:49,120][model8_pretrain.py][INFO] Epoch:[0/2](377300/4588595) loss:2.495 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:30:49,120][model8_pretrain.py][INFO] Epoch:[0/2](377300/4588595) loss:3.012 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:30:49,120][model8_pretrain.py][INFO] Epoch:[0/2](377300/4588595) loss:3.146 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:30:49,120][model8_pretrain.py][INFO] Epoch:[0/2](377300/4588595) loss:2.280 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:30:49,120][model8_pretrain.py][INFO] Epoch:[0/2](377300/4588595) loss:2.780 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:31:26,056][model8_pretrain.py][INFO] Epoch:[0/2](377400/4588595) loss:2.939 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:31:26,056][model8_pretrain.py][INFO] Epoch:[0/2](377400/4588595) loss:2.651 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:31:26,056][model8_pretrain.py][INFO] Epoch:[0/2](377400/4588595) loss:2.574 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:31:26,056][model8_pretrain.py][INFO] Epoch:[0/2](377400/4588595) loss:3.081 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:31:26,056][model8_pretrain.py][INFO] Epoch:[0/2](377400/4588595) loss:2.939 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:31:26,056][model8_pretrain.py][INFO] Epoch:[0/2](377400/4588595) loss:2.429 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:31:26,056][model8_pretrain.py][INFO] Epoch:[0/2](377400/4588595) loss:2.824 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:31:26,056][model8_pretrain.py][INFO] Epoch:[0/2](377400/4588595) loss:2.849 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:32:13,327][model8_pretrain.py][INFO] Epoch:[0/2](377500/4588595) loss:2.715 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:32:13,327][model8_pretrain.py][INFO] Epoch:[0/2](377500/4588595) loss:2.658 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:32:13,327][model8_pretrain.py][INFO] Epoch:[0/2](377500/4588595) loss:3.004 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:32:13,327][model8_pretrain.py][INFO] Epoch:[0/2](377500/4588595) loss:3.262 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:32:13,327][model8_pretrain.py][INFO] Epoch:[0/2](377500/4588595) loss:2.424 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:32:13,327][model8_pretrain.py][INFO] Epoch:[0/2](377500/4588595) loss:2.869 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:32:13,327][model8_pretrain.py][INFO] Epoch:[0/2](377500/4588595) loss:2.994 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:32:13,327][model8_pretrain.py][INFO] Epoch:[0/2](377500/4588595) loss:3.297 lr:0.0000100 epoch_Time:26688.0min: [2024-01-04 09:32:50,263][model8_pretrain.py][INFO] Epoch:[0/2](377600/4588595) loss:3.128 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:32:50,263][model8_pretrain.py][INFO] Epoch:[0/2](377600/4588595) loss:2.615 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:32:50,263][model8_pretrain.py][INFO] Epoch:[0/2](377600/4588595) loss:2.507 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:32:50,263][model8_pretrain.py][INFO] Epoch:[0/2](377600/4588595) loss:2.675 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:32:50,263][model8_pretrain.py][INFO] Epoch:[0/2](377600/4588595) loss:3.010 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:32:50,263][model8_pretrain.py][INFO] Epoch:[0/2](377600/4588595) loss:3.104 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:32:50,263][model8_pretrain.py][INFO] Epoch:[0/2](377600/4588595) loss:2.619 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:32:50,263][model8_pretrain.py][INFO] Epoch:[0/2](377600/4588595) loss:2.972 lr:0.0000100 epoch_Time:26687.0min: [2024-01-04 09:33:27,197][model8_pretrain.py][INFO] Epoch:[0/2](377700/4588595) loss:3.155 lr:0.0000100 epoch_Time:26686.0min: [2024-01-04 09:33:27,197][model8_pretrain.py][INFO] Epoch:[0/2](377700/4588595) loss:3.111 lr:0.0000100 epoch_Time:26686.0min: [2024-01-04 09:33:27,197][model8_pretrain.py][INFO] Epoch:[0/2](377700/4588595) loss:2.921 lr:0.0000100 epoch_Time:26686.0min: [2024-01-04 09:33:27,197][model8_pretrain.py][INFO] Epoch:[0/2](377700/4588595) loss:3.285 lr:0.0000100 epoch_Time:26686.0min: [2024-01-04 09:33:27,197][model8_pretrain.py][INFO] Epoch:[0/2](377700/4588595) loss:2.708 lr:0.0000100 epoch_Time:26686.0min: [2024-01-04 09:33:27,197][model8_pretrain.py][INFO] Epoch:[0/2](377700/4588595) loss:3.000 lr:0.0000100 epoch_Time:26686.0min: [2024-01-04 09:33:27,197][model8_pretrain.py][INFO] Epoch:[0/2](377700/4588595) loss:3.309 lr:0.0000100 epoch_Time:26686.0min: [2024-01-04 09:33:27,197][model8_pretrain.py][INFO] Epoch:[0/2](377700/4588595) loss:2.556 lr:0.0000100 epoch_Time:26686.0min: [2024-01-04 09:34:04,135][model8_pretrain.py][INFO] Epoch:[0/2](377800/4588595) loss:3.089 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:04,135][model8_pretrain.py][INFO] Epoch:[0/2](377800/4588595) loss:3.142 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:04,135][model8_pretrain.py][INFO] Epoch:[0/2](377800/4588595) loss:2.882 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:04,135][model8_pretrain.py][INFO] Epoch:[0/2](377800/4588595) loss:2.923 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:04,136][model8_pretrain.py][INFO] Epoch:[0/2](377800/4588595) loss:2.695 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:04,136][model8_pretrain.py][INFO] Epoch:[0/2](377800/4588595) loss:3.104 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:04,136][model8_pretrain.py][INFO] Epoch:[0/2](377800/4588595) loss:2.726 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:04,136][model8_pretrain.py][INFO] Epoch:[0/2](377800/4588595) loss:2.688 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:41,085][model8_pretrain.py][INFO] Epoch:[0/2](377900/4588595) loss:3.135 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:41,085][model8_pretrain.py][INFO] Epoch:[0/2](377900/4588595) loss:3.014 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:41,085][model8_pretrain.py][INFO] Epoch:[0/2](377900/4588595) loss:2.957 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:41,085][model8_pretrain.py][INFO] Epoch:[0/2](377900/4588595) loss:3.321 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:41,085][model8_pretrain.py][INFO] Epoch:[0/2](377900/4588595) loss:2.961 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:41,085][model8_pretrain.py][INFO] Epoch:[0/2](377900/4588595) loss:2.690 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:41,085][model8_pretrain.py][INFO] Epoch:[0/2](377900/4588595) loss:2.464 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:34:41,086][model8_pretrain.py][INFO] Epoch:[0/2](377900/4588595) loss:2.787 lr:0.0000100 epoch_Time:26685.0min: [2024-01-04 09:35:18,023][model8_pretrain.py][INFO] Epoch:[0/2](378000/4588595) loss:2.716 lr:0.0000100 epoch_Time:26684.0min: [2024-01-04 09:35:18,023][model8_pretrain.py][INFO] Epoch:[0/2](378000/4588595) loss:2.936 lr:0.0000100 epoch_Time:26684.0min: [2024-01-04 09:35:18,023][model8_pretrain.py][INFO] Epoch:[0/2](378000/4588595) loss:3.025 lr:0.0000100 epoch_Time:26684.0min: [2024-01-04 09:35:18,023][model8_pretrain.py][INFO] Epoch:[0/2](378000/4588595) loss:2.425 lr:0.0000100 epoch_Time:26684.0min: [2024-01-04 09:35:18,023][model8_pretrain.py][INFO] Epoch:[0/2](378000/4588595) loss:2.786 lr:0.0000100 epoch_Time:26684.0min: [2024-01-04 09:35:18,023][model8_pretrain.py][INFO] Epoch:[0/2](378000/4588595) loss:3.259 lr:0.0000100 epoch_Time:26684.0min: [2024-01-04 09:35:18,023][model8_pretrain.py][INFO] Epoch:[0/2](378000/4588595) loss:2.655 lr:0.0000100 epoch_Time:26684.0min: [2024-01-04 09:35:18,023][model8_pretrain.py][INFO] Epoch:[0/2](378000/4588595) loss:2.998 lr:0.0000100 epoch_Time:26684.0min: [2024-01-04 09:35:54,958][model8_pretrain.py][INFO] Epoch:[0/2](378100/4588595) loss:2.319 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:35:54,958][model8_pretrain.py][INFO] Epoch:[0/2](378100/4588595) loss:2.762 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:35:54,958][model8_pretrain.py][INFO] Epoch:[0/2](378100/4588595) loss:2.939 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:35:54,958][model8_pretrain.py][INFO] Epoch:[0/2](378100/4588595) loss:3.089 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:35:54,958][model8_pretrain.py][INFO] Epoch:[0/2](378100/4588595) loss:3.540 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:35:54,958][model8_pretrain.py][INFO] Epoch:[0/2](378100/4588595) loss:3.179 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:35:54,958][model8_pretrain.py][INFO] Epoch:[0/2](378100/4588595) loss:3.077 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:35:54,958][model8_pretrain.py][INFO] Epoch:[0/2](378100/4588595) loss:2.840 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:36:31,889][model8_pretrain.py][INFO] Epoch:[0/2](378200/4588595) loss:3.510 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:36:31,889][model8_pretrain.py][INFO] Epoch:[0/2](378200/4588595) loss:3.202 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:36:31,889][model8_pretrain.py][INFO] Epoch:[0/2](378200/4588595) loss:3.110 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:36:31,889][model8_pretrain.py][INFO] Epoch:[0/2](378200/4588595) loss:2.943 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:36:31,889][model8_pretrain.py][INFO] Epoch:[0/2](378200/4588595) loss:2.782 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:36:31,889][model8_pretrain.py][INFO] Epoch:[0/2](378200/4588595) loss:3.017 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:36:31,889][model8_pretrain.py][INFO] Epoch:[0/2](378200/4588595) loss:2.768 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:36:31,889][model8_pretrain.py][INFO] Epoch:[0/2](378200/4588595) loss:2.473 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:37:19,346][model8_pretrain.py][INFO] Epoch:[0/2](378300/4588595) loss:3.285 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:37:19,346][model8_pretrain.py][INFO] Epoch:[0/2](378300/4588595) loss:2.654 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:37:19,346][model8_pretrain.py][INFO] Epoch:[0/2](378300/4588595) loss:2.981 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:37:19,346][model8_pretrain.py][INFO] Epoch:[0/2](378300/4588595) loss:2.875 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:37:19,347][model8_pretrain.py][INFO] Epoch:[0/2](378300/4588595) loss:3.313 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:37:19,347][model8_pretrain.py][INFO] Epoch:[0/2](378300/4588595) loss:2.822 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:37:19,347][model8_pretrain.py][INFO] Epoch:[0/2](378300/4588595) loss:2.973 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:37:19,347][model8_pretrain.py][INFO] Epoch:[0/2](378300/4588595) loss:2.585 lr:0.0000100 epoch_Time:26683.0min: [2024-01-04 09:37:56,261][model8_pretrain.py][INFO] Epoch:[0/2](378400/4588595) loss:2.372 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:37:56,261][model8_pretrain.py][INFO] Epoch:[0/2](378400/4588595) loss:2.961 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:37:56,261][model8_pretrain.py][INFO] Epoch:[0/2](378400/4588595) loss:3.254 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:37:56,261][model8_pretrain.py][INFO] Epoch:[0/2](378400/4588595) loss:2.766 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:37:56,261][model8_pretrain.py][INFO] Epoch:[0/2](378400/4588595) loss:2.551 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:37:56,261][model8_pretrain.py][INFO] Epoch:[0/2](378400/4588595) loss:3.078 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:37:56,261][model8_pretrain.py][INFO] Epoch:[0/2](378400/4588595) loss:3.362 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:37:56,261][model8_pretrain.py][INFO] Epoch:[0/2](378400/4588595) loss:2.252 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:38:33,189][model8_pretrain.py][INFO] Epoch:[0/2](378500/4588595) loss:2.413 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:38:33,189][model8_pretrain.py][INFO] Epoch:[0/2](378500/4588595) loss:2.464 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:38:33,189][model8_pretrain.py][INFO] Epoch:[0/2](378500/4588595) loss:2.907 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:38:33,189][model8_pretrain.py][INFO] Epoch:[0/2](378500/4588595) loss:2.969 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:38:33,189][model8_pretrain.py][INFO] Epoch:[0/2](378500/4588595) loss:3.052 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:38:33,189][model8_pretrain.py][INFO] Epoch:[0/2](378500/4588595) loss:3.294 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:38:33,189][model8_pretrain.py][INFO] Epoch:[0/2](378500/4588595) loss:2.901 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:38:33,190][model8_pretrain.py][INFO] Epoch:[0/2](378500/4588595) loss:2.688 lr:0.0000100 epoch_Time:26682.0min: [2024-01-04 09:39:10,119][model8_pretrain.py][INFO] Epoch:[0/2](378600/4588595) loss:2.994 lr:0.0000100 epoch_Time:26681.0min: [2024-01-04 09:39:10,119][model8_pretrain.py][INFO] Epoch:[0/2](378600/4588595) loss:3.234 lr:0.0000100 epoch_Time:26681.0min: [2024-01-04 09:39:10,119][model8_pretrain.py][INFO] Epoch:[0/2](378600/4588595) loss:3.177 lr:0.0000100 epoch_Time:26681.0min: [2024-01-04 09:39:10,119][model8_pretrain.py][INFO] Epoch:[0/2](378600/4588595) loss:3.362 lr:0.0000100 epoch_Time:26681.0min: [2024-01-04 09:39:10,119][model8_pretrain.py][INFO] Epoch:[0/2](378600/4588595) loss:2.832 lr:0.0000100 epoch_Time:26681.0min: [2024-01-04 09:39:10,119][model8_pretrain.py][INFO] Epoch:[0/2](378600/4588595) loss:2.489 lr:0.0000100 epoch_Time:26681.0min: [2024-01-04 09:39:10,119][model8_pretrain.py][INFO] Epoch:[0/2](378600/4588595) loss:2.392 lr:0.0000100 epoch_Time:26681.0min: [2024-01-04 09:39:10,119][model8_pretrain.py][INFO] Epoch:[0/2](378600/4588595) loss:3.043 lr:0.0000100 epoch_Time:26681.0min: [2024-01-04 09:39:47,049][model8_pretrain.py][INFO] Epoch:[0/2](378700/4588595) loss:2.922 lr:0.0000100 epoch_Time:26680.0min: [2024-01-04 09:39:47,049][model8_pretrain.py][INFO] Epoch:[0/2](378700/4588595) loss:2.501 lr:0.0000100 epoch_Time:26680.0min: [2024-01-04 09:39:47,049][model8_pretrain.py][INFO] Epoch:[0/2](378700/4588595) loss:3.034 lr:0.0000100 epoch_Time:26680.0min: [2024-01-04 09:39:47,049][model8_pretrain.py][INFO] Epoch:[0/2](378700/4588595) loss:2.864 lr:0.0000100 epoch_Time:26680.0min: [2024-01-04 09:39:47,049][model8_pretrain.py][INFO] Epoch:[0/2](378700/4588595) loss:2.558 lr:0.0000100 epoch_Time:26680.0min: [2024-01-04 09:39:47,049][model8_pretrain.py][INFO] Epoch:[0/2](378700/4588595) loss:3.032 lr:0.0000100 epoch_Time:26680.0min: [2024-01-04 09:39:47,049][model8_pretrain.py][INFO] Epoch:[0/2](378700/4588595) loss:2.596 lr:0.0000100 epoch_Time:26680.0min: [2024-01-04 09:39:47,049][model8_pretrain.py][INFO] Epoch:[0/2](378700/4588595) loss:3.124 lr:0.0000100 epoch_Time:26680.0min: [2024-01-04 09:40:23,980][model8_pretrain.py][INFO] Epoch:[0/2](378800/4588595) loss:2.929 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:40:23,980][model8_pretrain.py][INFO] Epoch:[0/2](378800/4588595) loss:3.075 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:40:23,980][model8_pretrain.py][INFO] Epoch:[0/2](378800/4588595) loss:2.839 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:40:23,980][model8_pretrain.py][INFO] Epoch:[0/2](378800/4588595) loss:2.309 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:40:23,980][model8_pretrain.py][INFO] Epoch:[0/2](378800/4588595) loss:2.852 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:40:23,980][model8_pretrain.py][INFO] Epoch:[0/2](378800/4588595) loss:3.037 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:40:23,980][model8_pretrain.py][INFO] Epoch:[0/2](378800/4588595) loss:2.901 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:40:23,980][model8_pretrain.py][INFO] Epoch:[0/2](378800/4588595) loss:2.677 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:41:00,905][model8_pretrain.py][INFO] Epoch:[0/2](378900/4588595) loss:2.846 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:00,905][model8_pretrain.py][INFO] Epoch:[0/2](378900/4588595) loss:2.545 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:00,906][model8_pretrain.py][INFO] Epoch:[0/2](378900/4588595) loss:2.953 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:00,906][model8_pretrain.py][INFO] Epoch:[0/2](378900/4588595) loss:2.836 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:00,906][model8_pretrain.py][INFO] Epoch:[0/2](378900/4588595) loss:2.528 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:00,906][model8_pretrain.py][INFO] Epoch:[0/2](378900/4588595) loss:2.966 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:00,906][model8_pretrain.py][INFO] Epoch:[0/2](378900/4588595) loss:2.752 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:00,906][model8_pretrain.py][INFO] Epoch:[0/2](378900/4588595) loss:2.362 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:37,848][model8_pretrain.py][INFO] Epoch:[0/2](379000/4588595) loss:3.368 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:37,848][model8_pretrain.py][INFO] Epoch:[0/2](379000/4588595) loss:2.522 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:37,848][model8_pretrain.py][INFO] Epoch:[0/2](379000/4588595) loss:3.683 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:37,848][model8_pretrain.py][INFO] Epoch:[0/2](379000/4588595) loss:2.872 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:37,848][model8_pretrain.py][INFO] Epoch:[0/2](379000/4588595) loss:2.855 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:37,848][model8_pretrain.py][INFO] Epoch:[0/2](379000/4588595) loss:2.265 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:37,848][model8_pretrain.py][INFO] Epoch:[0/2](379000/4588595) loss:3.181 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:41:37,848][model8_pretrain.py][INFO] Epoch:[0/2](379000/4588595) loss:2.710 lr:0.0000100 epoch_Time:26678.0min: [2024-01-04 09:42:25,020][model8_pretrain.py][INFO] Epoch:[0/2](379100/4588595) loss:3.006 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:42:25,020][model8_pretrain.py][INFO] Epoch:[0/2](379100/4588595) loss:3.348 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:42:25,020][model8_pretrain.py][INFO] Epoch:[0/2](379100/4588595) loss:2.441 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:42:25,020][model8_pretrain.py][INFO] Epoch:[0/2](379100/4588595) loss:3.294 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:42:25,020][model8_pretrain.py][INFO] Epoch:[0/2](379100/4588595) loss:3.031 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:42:25,020][model8_pretrain.py][INFO] Epoch:[0/2](379100/4588595) loss:3.188 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:42:25,020][model8_pretrain.py][INFO] Epoch:[0/2](379100/4588595) loss:3.304 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:42:25,020][model8_pretrain.py][INFO] Epoch:[0/2](379100/4588595) loss:3.167 lr:0.0000100 epoch_Time:26679.0min: [2024-01-04 09:43:01,940][model8_pretrain.py][INFO] Epoch:[0/2](379200/4588595) loss:3.131 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:01,940][model8_pretrain.py][INFO] Epoch:[0/2](379200/4588595) loss:2.419 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:01,940][model8_pretrain.py][INFO] Epoch:[0/2](379200/4588595) loss:2.372 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:01,940][model8_pretrain.py][INFO] Epoch:[0/2](379200/4588595) loss:3.028 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:01,940][model8_pretrain.py][INFO] Epoch:[0/2](379200/4588595) loss:2.515 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:01,940][model8_pretrain.py][INFO] Epoch:[0/2](379200/4588595) loss:2.774 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:01,940][model8_pretrain.py][INFO] Epoch:[0/2](379200/4588595) loss:2.783 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:01,941][model8_pretrain.py][INFO] Epoch:[0/2](379200/4588595) loss:3.200 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:38,872][model8_pretrain.py][INFO] Epoch:[0/2](379300/4588595) loss:2.962 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:38,872][model8_pretrain.py][INFO] Epoch:[0/2](379300/4588595) loss:2.851 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:38,872][model8_pretrain.py][INFO] Epoch:[0/2](379300/4588595) loss:3.547 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:38,872][model8_pretrain.py][INFO] Epoch:[0/2](379300/4588595) loss:2.757 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:38,872][model8_pretrain.py][INFO] Epoch:[0/2](379300/4588595) loss:2.924 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:38,872][model8_pretrain.py][INFO] Epoch:[0/2](379300/4588595) loss:2.904 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:38,872][model8_pretrain.py][INFO] Epoch:[0/2](379300/4588595) loss:2.774 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:43:38,872][model8_pretrain.py][INFO] Epoch:[0/2](379300/4588595) loss:3.516 lr:0.0000100 epoch_Time:26677.0min: [2024-01-04 09:44:15,804][model8_pretrain.py][INFO] Epoch:[0/2](379400/4588595) loss:2.943 lr:0.0000100 epoch_Time:26676.0min: [2024-01-04 09:44:15,804][model8_pretrain.py][INFO] Epoch:[0/2](379400/4588595) loss:2.854 lr:0.0000100 epoch_Time:26676.0min: [2024-01-04 09:44:15,804][model8_pretrain.py][INFO] Epoch:[0/2](379400/4588595) loss:2.989 lr:0.0000100 epoch_Time:26676.0min: [2024-01-04 09:44:15,804][model8_pretrain.py][INFO] Epoch:[0/2](379400/4588595) loss:3.193 lr:0.0000100 epoch_Time:26676.0min: [2024-01-04 09:44:15,804][model8_pretrain.py][INFO] Epoch:[0/2](379400/4588595) loss:2.634 lr:0.0000100 epoch_Time:26676.0min: [2024-01-04 09:44:15,804][model8_pretrain.py][INFO] Epoch:[0/2](379400/4588595) loss:2.417 lr:0.0000100 epoch_Time:26676.0min: [2024-01-04 09:44:15,804][model8_pretrain.py][INFO] Epoch:[0/2](379400/4588595) loss:2.830 lr:0.0000100 epoch_Time:26676.0min: [2024-01-04 09:44:15,804][model8_pretrain.py][INFO] Epoch:[0/2](379400/4588595) loss:2.678 lr:0.0000100 epoch_Time:26676.0min: [2024-01-04 09:44:52,729][model8_pretrain.py][INFO] Epoch:[0/2](379500/4588595) loss:2.583 lr:0.0000100 epoch_Time:26675.0min: [2024-01-04 09:44:52,729][model8_pretrain.py][INFO] Epoch:[0/2](379500/4588595) loss:2.953 lr:0.0000100 epoch_Time:26675.0min: [2024-01-04 09:44:52,729][model8_pretrain.py][INFO] Epoch:[0/2](379500/4588595) loss:2.734 lr:0.0000100 epoch_Time:26675.0min: [2024-01-04 09:44:52,729][model8_pretrain.py][INFO] Epoch:[0/2](379500/4588595) loss:3.092 lr:0.0000100 epoch_Time:26675.0min: [2024-01-04 09:44:52,729][model8_pretrain.py][INFO] Epoch:[0/2](379500/4588595) loss:2.774 lr:0.0000100 epoch_Time:26675.0min: [2024-01-04 09:44:52,730][model8_pretrain.py][INFO] Epoch:[0/2](379500/4588595) loss:2.646 lr:0.0000100 epoch_Time:26675.0min: [2024-01-04 09:44:52,730][model8_pretrain.py][INFO] Epoch:[0/2](379500/4588595) loss:3.070 lr:0.0000100 epoch_Time:26675.0min: [2024-01-04 09:44:52,729][model8_pretrain.py][INFO] Epoch:[0/2](379500/4588595) loss:2.513 lr:0.0000100 epoch_Time:26675.0min: [2024-01-04 09:45:29,667][model8_pretrain.py][INFO] Epoch:[0/2](379600/4588595) loss:2.765 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:45:29,667][model8_pretrain.py][INFO] Epoch:[0/2](379600/4588595) loss:2.718 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:45:29,667][model8_pretrain.py][INFO] Epoch:[0/2](379600/4588595) loss:2.682 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:45:29,667][model8_pretrain.py][INFO] Epoch:[0/2](379600/4588595) loss:3.352 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:45:29,667][model8_pretrain.py][INFO] Epoch:[0/2](379600/4588595) loss:2.775 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:45:29,667][model8_pretrain.py][INFO] Epoch:[0/2](379600/4588595) loss:2.570 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:45:29,667][model8_pretrain.py][INFO] Epoch:[0/2](379600/4588595) loss:2.887 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:45:29,667][model8_pretrain.py][INFO] Epoch:[0/2](379600/4588595) loss:2.790 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:46:06,614][model8_pretrain.py][INFO] Epoch:[0/2](379700/4588595) loss:3.003 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:06,614][model8_pretrain.py][INFO] Epoch:[0/2](379700/4588595) loss:2.938 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:06,614][model8_pretrain.py][INFO] Epoch:[0/2](379700/4588595) loss:2.451 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:06,614][model8_pretrain.py][INFO] Epoch:[0/2](379700/4588595) loss:3.106 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:06,614][model8_pretrain.py][INFO] Epoch:[0/2](379700/4588595) loss:2.526 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:06,614][model8_pretrain.py][INFO] Epoch:[0/2](379700/4588595) loss:2.663 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:06,614][model8_pretrain.py][INFO] Epoch:[0/2](379700/4588595) loss:3.388 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:06,614][model8_pretrain.py][INFO] Epoch:[0/2](379700/4588595) loss:3.060 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:43,563][model8_pretrain.py][INFO] Epoch:[0/2](379800/4588595) loss:2.767 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:43,563][model8_pretrain.py][INFO] Epoch:[0/2](379800/4588595) loss:2.619 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:43,563][model8_pretrain.py][INFO] Epoch:[0/2](379800/4588595) loss:3.232 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:43,563][model8_pretrain.py][INFO] Epoch:[0/2](379800/4588595) loss:3.170 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:43,563][model8_pretrain.py][INFO] Epoch:[0/2](379800/4588595) loss:3.103 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:43,564][model8_pretrain.py][INFO] Epoch:[0/2](379800/4588595) loss:2.650 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:43,564][model8_pretrain.py][INFO] Epoch:[0/2](379800/4588595) loss:3.149 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:46:43,564][model8_pretrain.py][INFO] Epoch:[0/2](379800/4588595) loss:2.514 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:47:30,593][model8_pretrain.py][INFO] Epoch:[0/2](379900/4588595) loss:3.235 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:47:30,593][model8_pretrain.py][INFO] Epoch:[0/2](379900/4588595) loss:2.893 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:47:30,593][model8_pretrain.py][INFO] Epoch:[0/2](379900/4588595) loss:2.953 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:47:30,593][model8_pretrain.py][INFO] Epoch:[0/2](379900/4588595) loss:2.576 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:47:30,593][model8_pretrain.py][INFO] Epoch:[0/2](379900/4588595) loss:2.722 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:47:30,593][model8_pretrain.py][INFO] Epoch:[0/2](379900/4588595) loss:2.536 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:47:30,593][model8_pretrain.py][INFO] Epoch:[0/2](379900/4588595) loss:2.912 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:47:30,593][model8_pretrain.py][INFO] Epoch:[0/2](379900/4588595) loss:2.821 lr:0.0000100 epoch_Time:26674.0min: [2024-01-04 09:48:07,523][model8_pretrain.py][INFO] Epoch:[0/2](380000/4588595) loss:3.217 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:48:07,523][model8_pretrain.py][INFO] Epoch:[0/2](380000/4588595) loss:2.500 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:48:07,523][model8_pretrain.py][INFO] Epoch:[0/2](380000/4588595) loss:2.641 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:48:07,523][model8_pretrain.py][INFO] Epoch:[0/2](380000/4588595) loss:2.963 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:48:07,523][model8_pretrain.py][INFO] Epoch:[0/2](380000/4588595) loss:2.712 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:48:07,523][model8_pretrain.py][INFO] Epoch:[0/2](380000/4588595) loss:3.190 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:48:07,523][model8_pretrain.py][INFO] Epoch:[0/2](380000/4588595) loss:3.263 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:48:07,523][model8_pretrain.py][INFO] Epoch:[0/2](380000/4588595) loss:2.759 lr:0.0000100 epoch_Time:26673.0min: [2024-01-04 09:48:44,473][model8_pretrain.py][INFO] Epoch:[0/2](380100/4588595) loss:2.813 lr:0.0000100 epoch_Time:26672.0min: [2024-01-04 09:48:44,473][model8_pretrain.py][INFO] Epoch:[0/2](380100/4588595) loss:3.279 lr:0.0000100 epoch_Time:26672.0min: [2024-01-04 09:48:44,473][model8_pretrain.py][INFO] Epoch:[0/2](380100/4588595) loss:2.682 lr:0.0000100 epoch_Time:26672.0min: [2024-01-04 09:48:44,473][model8_pretrain.py][INFO] Epoch:[0/2](380100/4588595) loss:2.447 lr:0.0000100 epoch_Time:26672.0min: [2024-01-04 09:48:44,473][model8_pretrain.py][INFO] Epoch:[0/2](380100/4588595) loss:2.552 lr:0.0000100 epoch_Time:26672.0min: [2024-01-04 09:48:44,473][model8_pretrain.py][INFO] Epoch:[0/2](380100/4588595) loss:2.472 lr:0.0000100 epoch_Time:26672.0min: [2024-01-04 09:48:44,473][model8_pretrain.py][INFO] Epoch:[0/2](380100/4588595) loss:2.901 lr:0.0000100 epoch_Time:26672.0min: [2024-01-04 09:48:44,473][model8_pretrain.py][INFO] Epoch:[0/2](380100/4588595) loss:2.729 lr:0.0000100 epoch_Time:26672.0min: [2024-01-04 09:49:21,422][model8_pretrain.py][INFO] Epoch:[0/2](380200/4588595) loss:2.874 lr:0.0000100 epoch_Time:26671.0min: [2024-01-04 09:49:21,422][model8_pretrain.py][INFO] Epoch:[0/2](380200/4588595) loss:2.900 lr:0.0000100 epoch_Time:26671.0min: [2024-01-04 09:49:21,422][model8_pretrain.py][INFO] Epoch:[0/2](380200/4588595) loss:2.948 lr:0.0000100 epoch_Time:26671.0min: [2024-01-04 09:49:21,422][model8_pretrain.py][INFO] Epoch:[0/2](380200/4588595) loss:2.586 lr:0.0000100 epoch_Time:26671.0min: [2024-01-04 09:49:21,422][model8_pretrain.py][INFO] Epoch:[0/2](380200/4588595) loss:2.663 lr:0.0000100 epoch_Time:26671.0min: [2024-01-04 09:49:21,422][model8_pretrain.py][INFO] Epoch:[0/2](380200/4588595) loss:2.804 lr:0.0000100 epoch_Time:26671.0min: [2024-01-04 09:49:21,422][model8_pretrain.py][INFO] Epoch:[0/2](380200/4588595) loss:2.801 lr:0.0000100 epoch_Time:26671.0min: [2024-01-04 09:49:21,422][model8_pretrain.py][INFO] Epoch:[0/2](380200/4588595) loss:2.905 lr:0.0000100 epoch_Time:26671.0min: [2024-01-04 09:49:58,370][model8_pretrain.py][INFO] Epoch:[0/2](380300/4588595) loss:3.220 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:49:58,370][model8_pretrain.py][INFO] Epoch:[0/2](380300/4588595) loss:2.636 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:49:58,370][model8_pretrain.py][INFO] Epoch:[0/2](380300/4588595) loss:3.380 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:49:58,370][model8_pretrain.py][INFO] Epoch:[0/2](380300/4588595) loss:2.525 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:49:58,370][model8_pretrain.py][INFO] Epoch:[0/2](380300/4588595) loss:2.395 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:49:58,370][model8_pretrain.py][INFO] Epoch:[0/2](380300/4588595) loss:3.304 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:49:58,370][model8_pretrain.py][INFO] Epoch:[0/2](380300/4588595) loss:2.643 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:49:58,370][model8_pretrain.py][INFO] Epoch:[0/2](380300/4588595) loss:2.704 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:50:35,323][model8_pretrain.py][INFO] Epoch:[0/2](380400/4588595) loss:2.909 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:50:35,323][model8_pretrain.py][INFO] Epoch:[0/2](380400/4588595) loss:2.876 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:50:35,323][model8_pretrain.py][INFO] Epoch:[0/2](380400/4588595) loss:2.886 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:50:35,323][model8_pretrain.py][INFO] Epoch:[0/2](380400/4588595) loss:2.745 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:50:35,323][model8_pretrain.py][INFO] Epoch:[0/2](380400/4588595) loss:2.952 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:50:35,323][model8_pretrain.py][INFO] Epoch:[0/2](380400/4588595) loss:2.838 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:50:35,323][model8_pretrain.py][INFO] Epoch:[0/2](380400/4588595) loss:3.267 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:50:35,323][model8_pretrain.py][INFO] Epoch:[0/2](380400/4588595) loss:3.081 lr:0.0000100 epoch_Time:26670.0min: [2024-01-04 09:51:12,262][model8_pretrain.py][INFO] Epoch:[0/2](380500/4588595) loss:2.951 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:51:12,262][model8_pretrain.py][INFO] Epoch:[0/2](380500/4588595) loss:2.670 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:51:12,262][model8_pretrain.py][INFO] Epoch:[0/2](380500/4588595) loss:2.456 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:51:12,262][model8_pretrain.py][INFO] Epoch:[0/2](380500/4588595) loss:2.853 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:51:12,262][model8_pretrain.py][INFO] Epoch:[0/2](380500/4588595) loss:2.908 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:51:12,262][model8_pretrain.py][INFO] Epoch:[0/2](380500/4588595) loss:2.534 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:51:12,262][model8_pretrain.py][INFO] Epoch:[0/2](380500/4588595) loss:2.601 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:51:12,262][model8_pretrain.py][INFO] Epoch:[0/2](380500/4588595) loss:3.147 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:51:49,199][model8_pretrain.py][INFO] Epoch:[0/2](380600/4588595) loss:2.272 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:51:49,199][model8_pretrain.py][INFO] Epoch:[0/2](380600/4588595) loss:2.725 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:51:49,199][model8_pretrain.py][INFO] Epoch:[0/2](380600/4588595) loss:2.659 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:51:49,199][model8_pretrain.py][INFO] Epoch:[0/2](380600/4588595) loss:2.585 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:51:49,199][model8_pretrain.py][INFO] Epoch:[0/2](380600/4588595) loss:3.122 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:51:49,199][model8_pretrain.py][INFO] Epoch:[0/2](380600/4588595) loss:2.747 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:51:49,199][model8_pretrain.py][INFO] Epoch:[0/2](380600/4588595) loss:2.411 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:51:49,199][model8_pretrain.py][INFO] Epoch:[0/2](380600/4588595) loss:3.376 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:52:36,538][model8_pretrain.py][INFO] Epoch:[0/2](380700/4588595) loss:2.686 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:52:36,538][model8_pretrain.py][INFO] Epoch:[0/2](380700/4588595) loss:2.959 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:52:36,538][model8_pretrain.py][INFO] Epoch:[0/2](380700/4588595) loss:3.181 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:52:36,538][model8_pretrain.py][INFO] Epoch:[0/2](380700/4588595) loss:3.002 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:52:36,538][model8_pretrain.py][INFO] Epoch:[0/2](380700/4588595) loss:3.226 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:52:36,538][model8_pretrain.py][INFO] Epoch:[0/2](380700/4588595) loss:3.280 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:52:36,538][model8_pretrain.py][INFO] Epoch:[0/2](380700/4588595) loss:3.131 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:52:36,538][model8_pretrain.py][INFO] Epoch:[0/2](380700/4588595) loss:2.593 lr:0.0000100 epoch_Time:26669.0min: [2024-01-04 09:53:13,469][model8_pretrain.py][INFO] Epoch:[0/2](380800/4588595) loss:3.295 lr:0.0000100 epoch_Time:26668.0min: [2024-01-04 09:53:13,469][model8_pretrain.py][INFO] Epoch:[0/2](380800/4588595) loss:2.738 lr:0.0000100 epoch_Time:26668.0min: [2024-01-04 09:53:13,469][model8_pretrain.py][INFO] Epoch:[0/2](380800/4588595) loss:2.690 lr:0.0000100 epoch_Time:26668.0min: [2024-01-04 09:53:13,469][model8_pretrain.py][INFO] Epoch:[0/2](380800/4588595) loss:2.842 lr:0.0000100 epoch_Time:26668.0min: [2024-01-04 09:53:13,469][model8_pretrain.py][INFO] Epoch:[0/2](380800/4588595) loss:2.797 lr:0.0000100 epoch_Time:26668.0min: [2024-01-04 09:53:13,469][model8_pretrain.py][INFO] Epoch:[0/2](380800/4588595) loss:2.942 lr:0.0000100 epoch_Time:26668.0min: [2024-01-04 09:53:13,469][model8_pretrain.py][INFO] Epoch:[0/2](380800/4588595) loss:2.563 lr:0.0000100 epoch_Time:26668.0min: [2024-01-04 09:53:13,469][model8_pretrain.py][INFO] Epoch:[0/2](380800/4588595) loss:3.069 lr:0.0000100 epoch_Time:26668.0min: [2024-01-04 09:53:50,398][model8_pretrain.py][INFO] Epoch:[0/2](380900/4588595) loss:3.249 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:53:50,398][model8_pretrain.py][INFO] Epoch:[0/2](380900/4588595) loss:2.909 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:53:50,398][model8_pretrain.py][INFO] Epoch:[0/2](380900/4588595) loss:3.232 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:53:50,398][model8_pretrain.py][INFO] Epoch:[0/2](380900/4588595) loss:2.863 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:53:50,398][model8_pretrain.py][INFO] Epoch:[0/2](380900/4588595) loss:2.718 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:53:50,398][model8_pretrain.py][INFO] Epoch:[0/2](380900/4588595) loss:2.578 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:53:50,398][model8_pretrain.py][INFO] Epoch:[0/2](380900/4588595) loss:3.277 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:53:50,398][model8_pretrain.py][INFO] Epoch:[0/2](380900/4588595) loss:2.967 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:54:27,339][model8_pretrain.py][INFO] Epoch:[0/2](381000/4588595) loss:2.765 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:54:27,339][model8_pretrain.py][INFO] Epoch:[0/2](381000/4588595) loss:3.211 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:54:27,339][model8_pretrain.py][INFO] Epoch:[0/2](381000/4588595) loss:3.239 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:54:27,339][model8_pretrain.py][INFO] Epoch:[0/2](381000/4588595) loss:3.131 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:54:27,339][model8_pretrain.py][INFO] Epoch:[0/2](381000/4588595) loss:3.050 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:54:27,339][model8_pretrain.py][INFO] Epoch:[0/2](381000/4588595) loss:2.978 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:54:27,339][model8_pretrain.py][INFO] Epoch:[0/2](381000/4588595) loss:2.843 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:54:27,339][model8_pretrain.py][INFO] Epoch:[0/2](381000/4588595) loss:2.202 lr:0.0000100 epoch_Time:26667.0min: [2024-01-04 09:55:04,278][model8_pretrain.py][INFO] Epoch:[0/2](381100/4588595) loss:2.390 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:04,278][model8_pretrain.py][INFO] Epoch:[0/2](381100/4588595) loss:2.553 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:04,278][model8_pretrain.py][INFO] Epoch:[0/2](381100/4588595) loss:2.852 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:04,278][model8_pretrain.py][INFO] Epoch:[0/2](381100/4588595) loss:2.782 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:04,278][model8_pretrain.py][INFO] Epoch:[0/2](381100/4588595) loss:3.257 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:04,278][model8_pretrain.py][INFO] Epoch:[0/2](381100/4588595) loss:3.301 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:04,278][model8_pretrain.py][INFO] Epoch:[0/2](381100/4588595) loss:2.973 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:04,278][model8_pretrain.py][INFO] Epoch:[0/2](381100/4588595) loss:2.700 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:41,211][model8_pretrain.py][INFO] Epoch:[0/2](381200/4588595) loss:2.813 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:41,211][model8_pretrain.py][INFO] Epoch:[0/2](381200/4588595) loss:3.066 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:41,211][model8_pretrain.py][INFO] Epoch:[0/2](381200/4588595) loss:2.790 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:41,211][model8_pretrain.py][INFO] Epoch:[0/2](381200/4588595) loss:2.678 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:41,211][model8_pretrain.py][INFO] Epoch:[0/2](381200/4588595) loss:2.574 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:41,212][model8_pretrain.py][INFO] Epoch:[0/2](381200/4588595) loss:2.865 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:41,212][model8_pretrain.py][INFO] Epoch:[0/2](381200/4588595) loss:3.124 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:55:41,212][model8_pretrain.py][INFO] Epoch:[0/2](381200/4588595) loss:2.143 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:56:18,145][model8_pretrain.py][INFO] Epoch:[0/2](381300/4588595) loss:2.893 lr:0.0000100 epoch_Time:26664.0min: [2024-01-04 09:56:18,145][model8_pretrain.py][INFO] Epoch:[0/2](381300/4588595) loss:2.559 lr:0.0000100 epoch_Time:26664.0min: [2024-01-04 09:56:18,145][model8_pretrain.py][INFO] Epoch:[0/2](381300/4588595) loss:2.815 lr:0.0000100 epoch_Time:26664.0min: [2024-01-04 09:56:18,145][model8_pretrain.py][INFO] Epoch:[0/2](381300/4588595) loss:2.815 lr:0.0000100 epoch_Time:26664.0min: [2024-01-04 09:56:18,145][model8_pretrain.py][INFO] Epoch:[0/2](381300/4588595) loss:3.112 lr:0.0000100 epoch_Time:26664.0min: [2024-01-04 09:56:18,145][model8_pretrain.py][INFO] Epoch:[0/2](381300/4588595) loss:2.412 lr:0.0000100 epoch_Time:26664.0min: [2024-01-04 09:56:18,146][model8_pretrain.py][INFO] Epoch:[0/2](381300/4588595) loss:2.850 lr:0.0000100 epoch_Time:26664.0min: [2024-01-04 09:56:18,146][model8_pretrain.py][INFO] Epoch:[0/2](381300/4588595) loss:2.992 lr:0.0000100 epoch_Time:26664.0min: [2024-01-04 09:56:55,084][model8_pretrain.py][INFO] Epoch:[0/2](381400/4588595) loss:3.111 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:56:55,084][model8_pretrain.py][INFO] Epoch:[0/2](381400/4588595) loss:2.561 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:56:55,084][model8_pretrain.py][INFO] Epoch:[0/2](381400/4588595) loss:2.913 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:56:55,084][model8_pretrain.py][INFO] Epoch:[0/2](381400/4588595) loss:3.049 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:56:55,084][model8_pretrain.py][INFO] Epoch:[0/2](381400/4588595) loss:2.996 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:56:55,084][model8_pretrain.py][INFO] Epoch:[0/2](381400/4588595) loss:2.631 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:56:55,084][model8_pretrain.py][INFO] Epoch:[0/2](381400/4588595) loss:3.028 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:56:55,084][model8_pretrain.py][INFO] Epoch:[0/2](381400/4588595) loss:2.804 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:57:42,350][model8_pretrain.py][INFO] Epoch:[0/2](381500/4588595) loss:3.518 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:57:42,350][model8_pretrain.py][INFO] Epoch:[0/2](381500/4588595) loss:2.739 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:57:42,350][model8_pretrain.py][INFO] Epoch:[0/2](381500/4588595) loss:3.114 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:57:42,350][model8_pretrain.py][INFO] Epoch:[0/2](381500/4588595) loss:2.862 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:57:42,350][model8_pretrain.py][INFO] Epoch:[0/2](381500/4588595) loss:2.806 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:57:42,350][model8_pretrain.py][INFO] Epoch:[0/2](381500/4588595) loss:3.053 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](381500/4588595) loss:3.197 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:57:42,351][model8_pretrain.py][INFO] Epoch:[0/2](381500/4588595) loss:3.242 lr:0.0000100 epoch_Time:26665.0min: [2024-01-04 09:58:19,266][model8_pretrain.py][INFO] Epoch:[0/2](381600/4588595) loss:3.248 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:58:19,266][model8_pretrain.py][INFO] Epoch:[0/2](381600/4588595) loss:2.979 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:58:19,266][model8_pretrain.py][INFO] Epoch:[0/2](381600/4588595) loss:2.794 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:58:19,266][model8_pretrain.py][INFO] Epoch:[0/2](381600/4588595) loss:3.207 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:58:19,266][model8_pretrain.py][INFO] Epoch:[0/2](381600/4588595) loss:2.448 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:58:19,266][model8_pretrain.py][INFO] Epoch:[0/2](381600/4588595) loss:3.024 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:58:19,266][model8_pretrain.py][INFO] Epoch:[0/2](381600/4588595) loss:2.534 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:58:19,267][model8_pretrain.py][INFO] Epoch:[0/2](381600/4588595) loss:3.151 lr:0.0000100 epoch_Time:26663.0min: [2024-01-04 09:58:56,217][model8_pretrain.py][INFO] Epoch:[0/2](381700/4588595) loss:2.876 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:58:56,217][model8_pretrain.py][INFO] Epoch:[0/2](381700/4588595) loss:3.089 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:58:56,217][model8_pretrain.py][INFO] Epoch:[0/2](381700/4588595) loss:2.663 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:58:56,217][model8_pretrain.py][INFO] Epoch:[0/2](381700/4588595) loss:2.969 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:58:56,217][model8_pretrain.py][INFO] Epoch:[0/2](381700/4588595) loss:2.794 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:58:56,217][model8_pretrain.py][INFO] Epoch:[0/2](381700/4588595) loss:2.829 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:58:56,217][model8_pretrain.py][INFO] Epoch:[0/2](381700/4588595) loss:2.937 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:58:56,217][model8_pretrain.py][INFO] Epoch:[0/2](381700/4588595) loss:2.800 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:59:33,167][model8_pretrain.py][INFO] Epoch:[0/2](381800/4588595) loss:2.568 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:59:33,167][model8_pretrain.py][INFO] Epoch:[0/2](381800/4588595) loss:1.877 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:59:33,167][model8_pretrain.py][INFO] Epoch:[0/2](381800/4588595) loss:2.918 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:59:33,167][model8_pretrain.py][INFO] Epoch:[0/2](381800/4588595) loss:3.238 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:59:33,167][model8_pretrain.py][INFO] Epoch:[0/2](381800/4588595) loss:2.170 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:59:33,167][model8_pretrain.py][INFO] Epoch:[0/2](381800/4588595) loss:3.120 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:59:33,167][model8_pretrain.py][INFO] Epoch:[0/2](381800/4588595) loss:2.768 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 09:59:33,167][model8_pretrain.py][INFO] Epoch:[0/2](381800/4588595) loss:2.904 lr:0.0000100 epoch_Time:26662.0min: [2024-01-04 10:00:10,124][model8_pretrain.py][INFO] Epoch:[0/2](381900/4588595) loss:3.438 lr:0.0000100 epoch_Time:26661.0min: [2024-01-04 10:00:10,124][model8_pretrain.py][INFO] Epoch:[0/2](381900/4588595) loss:2.965 lr:0.0000100 epoch_Time:26661.0min: [2024-01-04 10:00:10,124][model8_pretrain.py][INFO] Epoch:[0/2](381900/4588595) loss:3.030 lr:0.0000100 epoch_Time:26661.0min: [2024-01-04 10:00:10,124][model8_pretrain.py][INFO] Epoch:[0/2](381900/4588595) loss:3.210 lr:0.0000100 epoch_Time:26661.0min: [2024-01-04 10:00:10,124][model8_pretrain.py][INFO] Epoch:[0/2](381900/4588595) loss:2.539 lr:0.0000100 epoch_Time:26661.0min: [2024-01-04 10:00:10,124][model8_pretrain.py][INFO] Epoch:[0/2](381900/4588595) loss:2.590 lr:0.0000100 epoch_Time:26661.0min: [2024-01-04 10:00:10,124][model8_pretrain.py][INFO] Epoch:[0/2](381900/4588595) loss:3.271 lr:0.0000100 epoch_Time:26661.0min: [2024-01-04 10:00:10,124][model8_pretrain.py][INFO] Epoch:[0/2](381900/4588595) loss:2.258 lr:0.0000100 epoch_Time:26661.0min: [2024-01-04 10:00:47,076][model8_pretrain.py][INFO] Epoch:[0/2](382000/4588595) loss:3.049 lr:0.0000100 epoch_Time:26660.0min: [2024-01-04 10:00:47,076][model8_pretrain.py][INFO] Epoch:[0/2](382000/4588595) loss:3.031 lr:0.0000100 epoch_Time:26660.0min: [2024-01-04 10:00:47,076][model8_pretrain.py][INFO] Epoch:[0/2](382000/4588595) loss:2.951 lr:0.0000100 epoch_Time:26660.0min: [2024-01-04 10:00:47,076][model8_pretrain.py][INFO] Epoch:[0/2](382000/4588595) loss:2.806 lr:0.0000100 epoch_Time:26660.0min: [2024-01-04 10:00:47,076][model8_pretrain.py][INFO] Epoch:[0/2](382000/4588595) loss:3.359 lr:0.0000100 epoch_Time:26660.0min: [2024-01-04 10:00:47,076][model8_pretrain.py][INFO] Epoch:[0/2](382000/4588595) loss:2.742 lr:0.0000100 epoch_Time:26660.0min: [2024-01-04 10:00:47,076][model8_pretrain.py][INFO] Epoch:[0/2](382000/4588595) loss:2.777 lr:0.0000100 epoch_Time:26660.0min: [2024-01-04 10:00:47,076][model8_pretrain.py][INFO] Epoch:[0/2](382000/4588595) loss:3.191 lr:0.0000100 epoch_Time:26660.0min: [2024-01-04 10:01:24,042][model8_pretrain.py][INFO] Epoch:[0/2](382100/4588595) loss:3.294 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:01:24,042][model8_pretrain.py][INFO] Epoch:[0/2](382100/4588595) loss:2.410 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:01:24,042][model8_pretrain.py][INFO] Epoch:[0/2](382100/4588595) loss:2.231 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:01:24,042][model8_pretrain.py][INFO] Epoch:[0/2](382100/4588595) loss:3.016 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:01:24,042][model8_pretrain.py][INFO] Epoch:[0/2](382100/4588595) loss:3.036 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:01:24,042][model8_pretrain.py][INFO] Epoch:[0/2](382100/4588595) loss:3.388 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:01:24,042][model8_pretrain.py][INFO] Epoch:[0/2](382100/4588595) loss:2.891 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:01:24,042][model8_pretrain.py][INFO] Epoch:[0/2](382100/4588595) loss:2.670 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:02:01,007][model8_pretrain.py][INFO] Epoch:[0/2](382200/4588595) loss:3.188 lr:0.0000100 epoch_Time:26658.0min: [2024-01-04 10:02:01,007][model8_pretrain.py][INFO] Epoch:[0/2](382200/4588595) loss:3.335 lr:0.0000100 epoch_Time:26658.0min: [2024-01-04 10:02:01,007][model8_pretrain.py][INFO] Epoch:[0/2](382200/4588595) loss:2.556 lr:0.0000100 epoch_Time:26658.0min: [2024-01-04 10:02:01,007][model8_pretrain.py][INFO] Epoch:[0/2](382200/4588595) loss:2.839 lr:0.0000100 epoch_Time:26658.0min: [2024-01-04 10:02:01,007][model8_pretrain.py][INFO] Epoch:[0/2](382200/4588595) loss:2.988 lr:0.0000100 epoch_Time:26658.0min: [2024-01-04 10:02:01,007][model8_pretrain.py][INFO] Epoch:[0/2](382200/4588595) loss:2.634 lr:0.0000100 epoch_Time:26658.0min: [2024-01-04 10:02:01,008][model8_pretrain.py][INFO] Epoch:[0/2](382200/4588595) loss:3.242 lr:0.0000100 epoch_Time:26658.0min: [2024-01-04 10:02:01,008][model8_pretrain.py][INFO] Epoch:[0/2](382200/4588595) loss:3.109 lr:0.0000100 epoch_Time:26658.0min: [2024-01-04 10:02:48,402][model8_pretrain.py][INFO] Epoch:[0/2](382300/4588595) loss:2.986 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:02:48,402][model8_pretrain.py][INFO] Epoch:[0/2](382300/4588595) loss:2.933 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:02:48,402][model8_pretrain.py][INFO] Epoch:[0/2](382300/4588595) loss:2.590 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:02:48,402][model8_pretrain.py][INFO] Epoch:[0/2](382300/4588595) loss:2.940 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:02:48,402][model8_pretrain.py][INFO] Epoch:[0/2](382300/4588595) loss:3.144 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:02:48,402][model8_pretrain.py][INFO] Epoch:[0/2](382300/4588595) loss:2.611 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:02:48,402][model8_pretrain.py][INFO] Epoch:[0/2](382300/4588595) loss:3.200 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:02:48,402][model8_pretrain.py][INFO] Epoch:[0/2](382300/4588595) loss:2.831 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:03:25,319][model8_pretrain.py][INFO] Epoch:[0/2](382400/4588595) loss:2.919 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:03:25,319][model8_pretrain.py][INFO] Epoch:[0/2](382400/4588595) loss:2.977 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:03:25,319][model8_pretrain.py][INFO] Epoch:[0/2](382400/4588595) loss:3.028 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:03:25,319][model8_pretrain.py][INFO] Epoch:[0/2](382400/4588595) loss:2.130 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:03:25,319][model8_pretrain.py][INFO] Epoch:[0/2](382400/4588595) loss:3.401 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:03:25,319][model8_pretrain.py][INFO] Epoch:[0/2](382400/4588595) loss:2.454 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:03:25,320][model8_pretrain.py][INFO] Epoch:[0/2](382400/4588595) loss:2.586 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:03:25,320][model8_pretrain.py][INFO] Epoch:[0/2](382400/4588595) loss:3.177 lr:0.0000100 epoch_Time:26659.0min: [2024-01-04 10:04:02,260][model8_pretrain.py][INFO] Epoch:[0/2](382500/4588595) loss:2.803 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:02,260][model8_pretrain.py][INFO] Epoch:[0/2](382500/4588595) loss:3.039 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:02,260][model8_pretrain.py][INFO] Epoch:[0/2](382500/4588595) loss:2.911 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:02,260][model8_pretrain.py][INFO] Epoch:[0/2](382500/4588595) loss:2.810 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:02,260][model8_pretrain.py][INFO] Epoch:[0/2](382500/4588595) loss:2.781 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:02,260][model8_pretrain.py][INFO] Epoch:[0/2](382500/4588595) loss:2.353 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:02,260][model8_pretrain.py][INFO] Epoch:[0/2](382500/4588595) loss:3.119 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:02,260][model8_pretrain.py][INFO] Epoch:[0/2](382500/4588595) loss:3.105 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:39,209][model8_pretrain.py][INFO] Epoch:[0/2](382600/4588595) loss:2.792 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:39,209][model8_pretrain.py][INFO] Epoch:[0/2](382600/4588595) loss:3.488 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:39,209][model8_pretrain.py][INFO] Epoch:[0/2](382600/4588595) loss:2.940 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:39,209][model8_pretrain.py][INFO] Epoch:[0/2](382600/4588595) loss:2.610 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:39,209][model8_pretrain.py][INFO] Epoch:[0/2](382600/4588595) loss:2.314 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:39,209][model8_pretrain.py][INFO] Epoch:[0/2](382600/4588595) loss:2.756 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:39,209][model8_pretrain.py][INFO] Epoch:[0/2](382600/4588595) loss:2.818 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:04:39,209][model8_pretrain.py][INFO] Epoch:[0/2](382600/4588595) loss:3.186 lr:0.0000100 epoch_Time:26657.0min: [2024-01-04 10:05:16,137][model8_pretrain.py][INFO] Epoch:[0/2](382700/4588595) loss:2.895 lr:0.0000100 epoch_Time:26656.0min: [2024-01-04 10:05:16,137][model8_pretrain.py][INFO] Epoch:[0/2](382700/4588595) loss:2.810 lr:0.0000100 epoch_Time:26656.0min: [2024-01-04 10:05:16,137][model8_pretrain.py][INFO] Epoch:[0/2](382700/4588595) loss:2.991 lr:0.0000100 epoch_Time:26656.0min: [2024-01-04 10:05:16,137][model8_pretrain.py][INFO] Epoch:[0/2](382700/4588595) loss:2.552 lr:0.0000100 epoch_Time:26656.0min: [2024-01-04 10:05:16,137][model8_pretrain.py][INFO] Epoch:[0/2](382700/4588595) loss:3.628 lr:0.0000100 epoch_Time:26656.0min: [2024-01-04 10:05:16,137][model8_pretrain.py][INFO] Epoch:[0/2](382700/4588595) loss:2.924 lr:0.0000100 epoch_Time:26656.0min: [2024-01-04 10:05:16,137][model8_pretrain.py][INFO] Epoch:[0/2](382700/4588595) loss:2.730 lr:0.0000100 epoch_Time:26656.0min: [2024-01-04 10:05:16,138][model8_pretrain.py][INFO] Epoch:[0/2](382700/4588595) loss:2.701 lr:0.0000100 epoch_Time:26656.0min: [2024-01-04 10:05:53,068][model8_pretrain.py][INFO] Epoch:[0/2](382800/4588595) loss:2.866 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:05:53,068][model8_pretrain.py][INFO] Epoch:[0/2](382800/4588595) loss:3.269 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:05:53,068][model8_pretrain.py][INFO] Epoch:[0/2](382800/4588595) loss:2.734 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:05:53,068][model8_pretrain.py][INFO] Epoch:[0/2](382800/4588595) loss:2.884 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:05:53,068][model8_pretrain.py][INFO] Epoch:[0/2](382800/4588595) loss:2.870 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:05:53,068][model8_pretrain.py][INFO] Epoch:[0/2](382800/4588595) loss:3.246 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:05:53,069][model8_pretrain.py][INFO] Epoch:[0/2](382800/4588595) loss:2.737 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:05:53,069][model8_pretrain.py][INFO] Epoch:[0/2](382800/4588595) loss:3.134 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:06:30,006][model8_pretrain.py][INFO] Epoch:[0/2](382900/4588595) loss:2.794 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:06:30,006][model8_pretrain.py][INFO] Epoch:[0/2](382900/4588595) loss:2.832 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:06:30,006][model8_pretrain.py][INFO] Epoch:[0/2](382900/4588595) loss:2.874 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:06:30,006][model8_pretrain.py][INFO] Epoch:[0/2](382900/4588595) loss:3.305 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:06:30,006][model8_pretrain.py][INFO] Epoch:[0/2](382900/4588595) loss:2.657 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:06:30,006][model8_pretrain.py][INFO] Epoch:[0/2](382900/4588595) loss:2.873 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:06:30,007][model8_pretrain.py][INFO] Epoch:[0/2](382900/4588595) loss:2.923 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:06:30,008][model8_pretrain.py][INFO] Epoch:[0/2](382900/4588595) loss:2.228 lr:0.0000100 epoch_Time:26655.0min: [2024-01-04 10:07:06,957][model8_pretrain.py][INFO] Epoch:[0/2](383000/4588595) loss:2.618 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:07:06,957][model8_pretrain.py][INFO] Epoch:[0/2](383000/4588595) loss:3.052 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:07:06,957][model8_pretrain.py][INFO] Epoch:[0/2](383000/4588595) loss:2.848 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:07:06,957][model8_pretrain.py][INFO] Epoch:[0/2](383000/4588595) loss:3.009 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:07:06,957][model8_pretrain.py][INFO] Epoch:[0/2](383000/4588595) loss:2.915 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:07:06,957][model8_pretrain.py][INFO] Epoch:[0/2](383000/4588595) loss:2.914 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:07:06,957][model8_pretrain.py][INFO] Epoch:[0/2](383000/4588595) loss:3.323 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:07:06,957][model8_pretrain.py][INFO] Epoch:[0/2](383000/4588595) loss:3.300 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:07:54,385][model8_pretrain.py][INFO] Epoch:[0/2](383100/4588595) loss:2.517 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:07:54,385][model8_pretrain.py][INFO] Epoch:[0/2](383100/4588595) loss:2.947 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:07:54,385][model8_pretrain.py][INFO] Epoch:[0/2](383100/4588595) loss:2.695 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:07:54,385][model8_pretrain.py][INFO] Epoch:[0/2](383100/4588595) loss:2.467 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:07:54,385][model8_pretrain.py][INFO] Epoch:[0/2](383100/4588595) loss:3.305 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:07:54,385][model8_pretrain.py][INFO] Epoch:[0/2](383100/4588595) loss:2.289 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:07:54,385][model8_pretrain.py][INFO] Epoch:[0/2](383100/4588595) loss:3.367 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:07:54,385][model8_pretrain.py][INFO] Epoch:[0/2](383100/4588595) loss:2.867 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:08:31,306][model8_pretrain.py][INFO] Epoch:[0/2](383200/4588595) loss:3.295 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:08:31,306][model8_pretrain.py][INFO] Epoch:[0/2](383200/4588595) loss:2.795 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:08:31,306][model8_pretrain.py][INFO] Epoch:[0/2](383200/4588595) loss:2.821 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:08:31,306][model8_pretrain.py][INFO] Epoch:[0/2](383200/4588595) loss:3.200 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:08:31,306][model8_pretrain.py][INFO] Epoch:[0/2](383200/4588595) loss:3.183 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:08:31,306][model8_pretrain.py][INFO] Epoch:[0/2](383200/4588595) loss:2.732 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:08:31,306][model8_pretrain.py][INFO] Epoch:[0/2](383200/4588595) loss:3.067 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:08:31,306][model8_pretrain.py][INFO] Epoch:[0/2](383200/4588595) loss:2.469 lr:0.0000100 epoch_Time:26654.0min: [2024-01-04 10:09:08,232][model8_pretrain.py][INFO] Epoch:[0/2](383300/4588595) loss:2.740 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:08,233][model8_pretrain.py][INFO] Epoch:[0/2](383300/4588595) loss:2.463 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:08,233][model8_pretrain.py][INFO] Epoch:[0/2](383300/4588595) loss:3.376 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:08,233][model8_pretrain.py][INFO] Epoch:[0/2](383300/4588595) loss:3.239 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:08,233][model8_pretrain.py][INFO] Epoch:[0/2](383300/4588595) loss:2.458 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:08,233][model8_pretrain.py][INFO] Epoch:[0/2](383300/4588595) loss:2.779 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:08,233][model8_pretrain.py][INFO] Epoch:[0/2](383300/4588595) loss:2.827 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:08,234][model8_pretrain.py][INFO] Epoch:[0/2](383300/4588595) loss:2.843 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:45,165][model8_pretrain.py][INFO] Epoch:[0/2](383400/4588595) loss:2.720 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:45,165][model8_pretrain.py][INFO] Epoch:[0/2](383400/4588595) loss:2.787 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:45,165][model8_pretrain.py][INFO] Epoch:[0/2](383400/4588595) loss:2.082 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:45,165][model8_pretrain.py][INFO] Epoch:[0/2](383400/4588595) loss:2.948 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:45,166][model8_pretrain.py][INFO] Epoch:[0/2](383400/4588595) loss:3.334 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:45,166][model8_pretrain.py][INFO] Epoch:[0/2](383400/4588595) loss:2.828 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:45,166][model8_pretrain.py][INFO] Epoch:[0/2](383400/4588595) loss:2.536 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:09:45,166][model8_pretrain.py][INFO] Epoch:[0/2](383400/4588595) loss:2.758 lr:0.0000100 epoch_Time:26653.0min: [2024-01-04 10:10:22,100][model8_pretrain.py][INFO] Epoch:[0/2](383500/4588595) loss:3.344 lr:0.0000100 epoch_Time:26651.0min: [2024-01-04 10:10:22,100][model8_pretrain.py][INFO] Epoch:[0/2](383500/4588595) loss:2.826 lr:0.0000100 epoch_Time:26651.0min: [2024-01-04 10:10:22,100][model8_pretrain.py][INFO] Epoch:[0/2](383500/4588595) loss:2.691 lr:0.0000100 epoch_Time:26651.0min: [2024-01-04 10:10:22,100][model8_pretrain.py][INFO] Epoch:[0/2](383500/4588595) loss:2.745 lr:0.0000100 epoch_Time:26651.0min: [2024-01-04 10:10:22,101][model8_pretrain.py][INFO] Epoch:[0/2](383500/4588595) loss:2.893 lr:0.0000100 epoch_Time:26651.0min: [2024-01-04 10:10:22,101][model8_pretrain.py][INFO] Epoch:[0/2](383500/4588595) loss:2.739 lr:0.0000100 epoch_Time:26651.0min: [2024-01-04 10:10:22,101][model8_pretrain.py][INFO] Epoch:[0/2](383500/4588595) loss:3.136 lr:0.0000100 epoch_Time:26651.0min: [2024-01-04 10:10:22,101][model8_pretrain.py][INFO] Epoch:[0/2](383500/4588595) loss:2.777 lr:0.0000100 epoch_Time:26651.0min: [2024-01-04 10:10:59,036][model8_pretrain.py][INFO] Epoch:[0/2](383600/4588595) loss:3.095 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:10:59,036][model8_pretrain.py][INFO] Epoch:[0/2](383600/4588595) loss:2.189 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:10:59,036][model8_pretrain.py][INFO] Epoch:[0/2](383600/4588595) loss:3.014 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:10:59,036][model8_pretrain.py][INFO] Epoch:[0/2](383600/4588595) loss:3.017 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:10:59,036][model8_pretrain.py][INFO] Epoch:[0/2](383600/4588595) loss:3.352 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:10:59,036][model8_pretrain.py][INFO] Epoch:[0/2](383600/4588595) loss:2.520 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:10:59,036][model8_pretrain.py][INFO] Epoch:[0/2](383600/4588595) loss:2.664 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:10:59,036][model8_pretrain.py][INFO] Epoch:[0/2](383600/4588595) loss:2.993 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:11:35,947][model8_pretrain.py][INFO] Epoch:[0/2](383700/4588595) loss:2.722 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:11:35,947][model8_pretrain.py][INFO] Epoch:[0/2](383700/4588595) loss:3.049 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:11:35,947][model8_pretrain.py][INFO] Epoch:[0/2](383700/4588595) loss:3.312 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:11:35,947][model8_pretrain.py][INFO] Epoch:[0/2](383700/4588595) loss:2.551 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:11:35,947][model8_pretrain.py][INFO] Epoch:[0/2](383700/4588595) loss:2.727 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:11:35,947][model8_pretrain.py][INFO] Epoch:[0/2](383700/4588595) loss:3.118 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:11:35,947][model8_pretrain.py][INFO] Epoch:[0/2](383700/4588595) loss:2.669 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:11:35,948][model8_pretrain.py][INFO] Epoch:[0/2](383700/4588595) loss:2.777 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:12:12,887][model8_pretrain.py][INFO] Epoch:[0/2](383800/4588595) loss:2.797 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:12:12,887][model8_pretrain.py][INFO] Epoch:[0/2](383800/4588595) loss:2.884 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:12:12,888][model8_pretrain.py][INFO] Epoch:[0/2](383800/4588595) loss:2.821 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:12:12,888][model8_pretrain.py][INFO] Epoch:[0/2](383800/4588595) loss:3.233 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:12:12,888][model8_pretrain.py][INFO] Epoch:[0/2](383800/4588595) loss:2.378 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:12:12,888][model8_pretrain.py][INFO] Epoch:[0/2](383800/4588595) loss:3.241 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:12:12,888][model8_pretrain.py][INFO] Epoch:[0/2](383800/4588595) loss:2.520 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:12:12,888][model8_pretrain.py][INFO] Epoch:[0/2](383800/4588595) loss:2.664 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:13:00,110][model8_pretrain.py][INFO] Epoch:[0/2](383900/4588595) loss:2.904 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:13:00,110][model8_pretrain.py][INFO] Epoch:[0/2](383900/4588595) loss:2.997 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:13:00,110][model8_pretrain.py][INFO] Epoch:[0/2](383900/4588595) loss:2.016 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:13:00,110][model8_pretrain.py][INFO] Epoch:[0/2](383900/4588595) loss:3.240 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:13:00,110][model8_pretrain.py][INFO] Epoch:[0/2](383900/4588595) loss:2.481 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:13:00,110][model8_pretrain.py][INFO] Epoch:[0/2](383900/4588595) loss:3.001 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:13:00,110][model8_pretrain.py][INFO] Epoch:[0/2](383900/4588595) loss:3.374 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:13:00,110][model8_pretrain.py][INFO] Epoch:[0/2](383900/4588595) loss:3.259 lr:0.0000100 epoch_Time:26650.0min: [2024-01-04 10:13:37,028][model8_pretrain.py][INFO] Epoch:[0/2](384000/4588595) loss:3.206 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:13:37,028][model8_pretrain.py][INFO] Epoch:[0/2](384000/4588595) loss:2.821 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:13:37,028][model8_pretrain.py][INFO] Epoch:[0/2](384000/4588595) loss:2.736 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:13:37,028][model8_pretrain.py][INFO] Epoch:[0/2](384000/4588595) loss:2.963 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:13:37,028][model8_pretrain.py][INFO] Epoch:[0/2](384000/4588595) loss:2.710 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:13:37,028][model8_pretrain.py][INFO] Epoch:[0/2](384000/4588595) loss:3.002 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:13:37,028][model8_pretrain.py][INFO] Epoch:[0/2](384000/4588595) loss:2.746 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:13:37,028][model8_pretrain.py][INFO] Epoch:[0/2](384000/4588595) loss:3.235 lr:0.0000100 epoch_Time:26649.0min: [2024-01-04 10:14:13,974][model8_pretrain.py][INFO] Epoch:[0/2](384100/4588595) loss:2.952 lr:0.0000100 epoch_Time:26648.0min: [2024-01-04 10:14:13,974][model8_pretrain.py][INFO] Epoch:[0/2](384100/4588595) loss:3.283 lr:0.0000100 epoch_Time:26648.0min: [2024-01-04 10:14:13,974][model8_pretrain.py][INFO] Epoch:[0/2](384100/4588595) loss:3.223 lr:0.0000100 epoch_Time:26648.0min: [2024-01-04 10:14:13,974][model8_pretrain.py][INFO] Epoch:[0/2](384100/4588595) loss:2.895 lr:0.0000100 epoch_Time:26648.0min: [2024-01-04 10:14:13,974][model8_pretrain.py][INFO] Epoch:[0/2](384100/4588595) loss:3.163 lr:0.0000100 epoch_Time:26648.0min: [2024-01-04 10:14:13,974][model8_pretrain.py][INFO] Epoch:[0/2](384100/4588595) loss:2.786 lr:0.0000100 epoch_Time:26648.0min: [2024-01-04 10:14:13,974][model8_pretrain.py][INFO] Epoch:[0/2](384100/4588595) loss:2.256 lr:0.0000100 epoch_Time:26648.0min: [2024-01-04 10:14:13,975][model8_pretrain.py][INFO] Epoch:[0/2](384100/4588595) loss:2.554 lr:0.0000100 epoch_Time:26648.0min: [2024-01-04 10:14:50,908][model8_pretrain.py][INFO] Epoch:[0/2](384200/4588595) loss:3.015 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:14:50,908][model8_pretrain.py][INFO] Epoch:[0/2](384200/4588595) loss:3.059 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:14:50,908][model8_pretrain.py][INFO] Epoch:[0/2](384200/4588595) loss:2.384 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:14:50,908][model8_pretrain.py][INFO] Epoch:[0/2](384200/4588595) loss:2.789 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:14:50,908][model8_pretrain.py][INFO] Epoch:[0/2](384200/4588595) loss:2.979 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:14:50,908][model8_pretrain.py][INFO] Epoch:[0/2](384200/4588595) loss:2.570 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:14:50,908][model8_pretrain.py][INFO] Epoch:[0/2](384200/4588595) loss:2.799 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:14:50,908][model8_pretrain.py][INFO] Epoch:[0/2](384200/4588595) loss:3.288 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:15:27,837][model8_pretrain.py][INFO] Epoch:[0/2](384300/4588595) loss:2.443 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:15:27,837][model8_pretrain.py][INFO] Epoch:[0/2](384300/4588595) loss:2.840 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:15:27,837][model8_pretrain.py][INFO] Epoch:[0/2](384300/4588595) loss:2.591 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:15:27,837][model8_pretrain.py][INFO] Epoch:[0/2](384300/4588595) loss:3.286 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:15:27,837][model8_pretrain.py][INFO] Epoch:[0/2](384300/4588595) loss:2.915 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:15:27,837][model8_pretrain.py][INFO] Epoch:[0/2](384300/4588595) loss:3.138 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:15:27,837][model8_pretrain.py][INFO] Epoch:[0/2](384300/4588595) loss:2.870 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:15:27,837][model8_pretrain.py][INFO] Epoch:[0/2](384300/4588595) loss:2.934 lr:0.0000100 epoch_Time:26647.0min: [2024-01-04 10:16:04,781][model8_pretrain.py][INFO] Epoch:[0/2](384400/4588595) loss:3.294 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:04,781][model8_pretrain.py][INFO] Epoch:[0/2](384400/4588595) loss:2.569 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:04,781][model8_pretrain.py][INFO] Epoch:[0/2](384400/4588595) loss:2.885 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:04,781][model8_pretrain.py][INFO] Epoch:[0/2](384400/4588595) loss:3.295 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:04,781][model8_pretrain.py][INFO] Epoch:[0/2](384400/4588595) loss:3.127 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:04,781][model8_pretrain.py][INFO] Epoch:[0/2](384400/4588595) loss:2.841 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:04,781][model8_pretrain.py][INFO] Epoch:[0/2](384400/4588595) loss:3.253 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:04,781][model8_pretrain.py][INFO] Epoch:[0/2](384400/4588595) loss:2.650 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:41,773][model8_pretrain.py][INFO] Epoch:[0/2](384500/4588595) loss:3.069 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:41,773][model8_pretrain.py][INFO] Epoch:[0/2](384500/4588595) loss:3.196 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:41,773][model8_pretrain.py][INFO] Epoch:[0/2](384500/4588595) loss:2.625 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:41,773][model8_pretrain.py][INFO] Epoch:[0/2](384500/4588595) loss:2.993 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:41,773][model8_pretrain.py][INFO] Epoch:[0/2](384500/4588595) loss:3.369 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:41,773][model8_pretrain.py][INFO] Epoch:[0/2](384500/4588595) loss:2.803 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:41,773][model8_pretrain.py][INFO] Epoch:[0/2](384500/4588595) loss:2.452 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:16:41,773][model8_pretrain.py][INFO] Epoch:[0/2](384500/4588595) loss:2.651 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:17:18,711][model8_pretrain.py][INFO] Epoch:[0/2](384600/4588595) loss:2.817 lr:0.0000100 epoch_Time:26644.0min: [2024-01-04 10:17:18,711][model8_pretrain.py][INFO] Epoch:[0/2](384600/4588595) loss:2.497 lr:0.0000100 epoch_Time:26644.0min: [2024-01-04 10:17:18,711][model8_pretrain.py][INFO] Epoch:[0/2](384600/4588595) loss:3.055 lr:0.0000100 epoch_Time:26644.0min: [2024-01-04 10:17:18,711][model8_pretrain.py][INFO] Epoch:[0/2](384600/4588595) loss:3.074 lr:0.0000100 epoch_Time:26644.0min: [2024-01-04 10:17:18,711][model8_pretrain.py][INFO] Epoch:[0/2](384600/4588595) loss:3.015 lr:0.0000100 epoch_Time:26644.0min: [2024-01-04 10:17:18,711][model8_pretrain.py][INFO] Epoch:[0/2](384600/4588595) loss:2.601 lr:0.0000100 epoch_Time:26644.0min: [2024-01-04 10:17:18,711][model8_pretrain.py][INFO] Epoch:[0/2](384600/4588595) loss:3.205 lr:0.0000100 epoch_Time:26644.0min: [2024-01-04 10:17:18,711][model8_pretrain.py][INFO] Epoch:[0/2](384600/4588595) loss:2.468 lr:0.0000100 epoch_Time:26644.0min: [2024-01-04 10:18:05,812][model8_pretrain.py][INFO] Epoch:[0/2](384700/4588595) loss:2.978 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:05,812][model8_pretrain.py][INFO] Epoch:[0/2](384700/4588595) loss:2.870 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:05,812][model8_pretrain.py][INFO] Epoch:[0/2](384700/4588595) loss:2.738 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:05,812][model8_pretrain.py][INFO] Epoch:[0/2](384700/4588595) loss:2.836 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:05,812][model8_pretrain.py][INFO] Epoch:[0/2](384700/4588595) loss:2.756 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:05,812][model8_pretrain.py][INFO] Epoch:[0/2](384700/4588595) loss:3.152 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:05,812][model8_pretrain.py][INFO] Epoch:[0/2](384700/4588595) loss:2.695 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:05,812][model8_pretrain.py][INFO] Epoch:[0/2](384700/4588595) loss:2.810 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:42,723][model8_pretrain.py][INFO] Epoch:[0/2](384800/4588595) loss:2.742 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:42,723][model8_pretrain.py][INFO] Epoch:[0/2](384800/4588595) loss:3.133 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:42,723][model8_pretrain.py][INFO] Epoch:[0/2](384800/4588595) loss:3.343 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:42,723][model8_pretrain.py][INFO] Epoch:[0/2](384800/4588595) loss:3.350 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:42,723][model8_pretrain.py][INFO] Epoch:[0/2](384800/4588595) loss:2.565 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:42,723][model8_pretrain.py][INFO] Epoch:[0/2](384800/4588595) loss:2.678 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:42,723][model8_pretrain.py][INFO] Epoch:[0/2](384800/4588595) loss:2.897 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:18:42,723][model8_pretrain.py][INFO] Epoch:[0/2](384800/4588595) loss:3.096 lr:0.0000100 epoch_Time:26645.0min: [2024-01-04 10:19:19,646][model8_pretrain.py][INFO] Epoch:[0/2](384900/4588595) loss:2.985 lr:0.0000100 epoch_Time:26643.0min: [2024-01-04 10:19:19,646][model8_pretrain.py][INFO] Epoch:[0/2](384900/4588595) loss:2.924 lr:0.0000100 epoch_Time:26643.0min: [2024-01-04 10:19:19,646][model8_pretrain.py][INFO] Epoch:[0/2](384900/4588595) loss:3.166 lr:0.0000100 epoch_Time:26643.0min: [2024-01-04 10:19:19,646][model8_pretrain.py][INFO] Epoch:[0/2](384900/4588595) loss:3.230 lr:0.0000100 epoch_Time:26643.0min: [2024-01-04 10:19:19,646][model8_pretrain.py][INFO] Epoch:[0/2](384900/4588595) loss:2.723 lr:0.0000100 epoch_Time:26643.0min: [2024-01-04 10:19:19,646][model8_pretrain.py][INFO] Epoch:[0/2](384900/4588595) loss:2.820 lr:0.0000100 epoch_Time:26643.0min: [2024-01-04 10:19:19,646][model8_pretrain.py][INFO] Epoch:[0/2](384900/4588595) loss:3.062 lr:0.0000100 epoch_Time:26643.0min: [2024-01-04 10:19:19,647][model8_pretrain.py][INFO] Epoch:[0/2](384900/4588595) loss:2.951 lr:0.0000100 epoch_Time:26643.0min: [2024-01-04 10:19:56,577][model8_pretrain.py][INFO] Epoch:[0/2](385000/4588595) loss:2.986 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:19:56,577][model8_pretrain.py][INFO] Epoch:[0/2](385000/4588595) loss:3.010 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:19:56,577][model8_pretrain.py][INFO] Epoch:[0/2](385000/4588595) loss:2.754 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:19:56,577][model8_pretrain.py][INFO] Epoch:[0/2](385000/4588595) loss:3.403 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:19:56,577][model8_pretrain.py][INFO] Epoch:[0/2](385000/4588595) loss:2.806 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:19:56,577][model8_pretrain.py][INFO] Epoch:[0/2](385000/4588595) loss:3.129 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:19:56,577][model8_pretrain.py][INFO] Epoch:[0/2](385000/4588595) loss:2.894 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:19:56,577][model8_pretrain.py][INFO] Epoch:[0/2](385000/4588595) loss:3.135 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:20:33,507][model8_pretrain.py][INFO] Epoch:[0/2](385100/4588595) loss:2.758 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:20:33,507][model8_pretrain.py][INFO] Epoch:[0/2](385100/4588595) loss:3.295 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:20:33,507][model8_pretrain.py][INFO] Epoch:[0/2](385100/4588595) loss:3.031 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:20:33,507][model8_pretrain.py][INFO] Epoch:[0/2](385100/4588595) loss:2.631 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:20:33,507][model8_pretrain.py][INFO] Epoch:[0/2](385100/4588595) loss:3.289 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:20:33,507][model8_pretrain.py][INFO] Epoch:[0/2](385100/4588595) loss:3.013 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:20:33,507][model8_pretrain.py][INFO] Epoch:[0/2](385100/4588595) loss:2.873 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:20:33,507][model8_pretrain.py][INFO] Epoch:[0/2](385100/4588595) loss:2.764 lr:0.0000100 epoch_Time:26642.0min: [2024-01-04 10:21:10,451][model8_pretrain.py][INFO] Epoch:[0/2](385200/4588595) loss:3.299 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:10,451][model8_pretrain.py][INFO] Epoch:[0/2](385200/4588595) loss:2.782 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:10,451][model8_pretrain.py][INFO] Epoch:[0/2](385200/4588595) loss:3.054 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:10,451][model8_pretrain.py][INFO] Epoch:[0/2](385200/4588595) loss:2.674 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:10,451][model8_pretrain.py][INFO] Epoch:[0/2](385200/4588595) loss:3.051 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:10,451][model8_pretrain.py][INFO] Epoch:[0/2](385200/4588595) loss:2.670 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:10,451][model8_pretrain.py][INFO] Epoch:[0/2](385200/4588595) loss:2.675 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:10,451][model8_pretrain.py][INFO] Epoch:[0/2](385200/4588595) loss:2.428 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:47,406][model8_pretrain.py][INFO] Epoch:[0/2](385300/4588595) loss:3.313 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:47,406][model8_pretrain.py][INFO] Epoch:[0/2](385300/4588595) loss:3.184 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:47,406][model8_pretrain.py][INFO] Epoch:[0/2](385300/4588595) loss:3.466 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:47,407][model8_pretrain.py][INFO] Epoch:[0/2](385300/4588595) loss:3.029 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:47,407][model8_pretrain.py][INFO] Epoch:[0/2](385300/4588595) loss:3.141 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:47,407][model8_pretrain.py][INFO] Epoch:[0/2](385300/4588595) loss:2.911 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:47,407][model8_pretrain.py][INFO] Epoch:[0/2](385300/4588595) loss:2.850 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:21:47,407][model8_pretrain.py][INFO] Epoch:[0/2](385300/4588595) loss:2.209 lr:0.0000100 epoch_Time:26641.0min: [2024-01-04 10:22:24,382][model8_pretrain.py][INFO] Epoch:[0/2](385400/4588595) loss:2.657 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:22:24,382][model8_pretrain.py][INFO] Epoch:[0/2](385400/4588595) loss:2.973 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:22:24,382][model8_pretrain.py][INFO] Epoch:[0/2](385400/4588595) loss:2.796 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:22:24,382][model8_pretrain.py][INFO] Epoch:[0/2](385400/4588595) loss:2.868 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:22:24,382][model8_pretrain.py][INFO] Epoch:[0/2](385400/4588595) loss:2.636 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:22:24,382][model8_pretrain.py][INFO] Epoch:[0/2](385400/4588595) loss:2.744 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:22:24,382][model8_pretrain.py][INFO] Epoch:[0/2](385400/4588595) loss:2.884 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:22:24,383][model8_pretrain.py][INFO] Epoch:[0/2](385400/4588595) loss:2.840 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:23:11,507][model8_pretrain.py][INFO] Epoch:[0/2](385500/4588595) loss:3.359 lr:0.0000100 epoch_Time:26640.0min: [2024-01-04 10:23:11,507][model8_pretrain.py][INFO] Epoch:[0/2](385500/4588595) loss:2.917 lr:0.0000100 epoch_Time:26640.0min: [2024-01-04 10:23:11,507][model8_pretrain.py][INFO] Epoch:[0/2](385500/4588595) loss:3.064 lr:0.0000100 epoch_Time:26640.0min: [2024-01-04 10:23:11,507][model8_pretrain.py][INFO] Epoch:[0/2](385500/4588595) loss:3.351 lr:0.0000100 epoch_Time:26640.0min: [2024-01-04 10:23:11,507][model8_pretrain.py][INFO] Epoch:[0/2](385500/4588595) loss:2.996 lr:0.0000100 epoch_Time:26640.0min: [2024-01-04 10:23:11,507][model8_pretrain.py][INFO] Epoch:[0/2](385500/4588595) loss:2.645 lr:0.0000100 epoch_Time:26640.0min: [2024-01-04 10:23:11,507][model8_pretrain.py][INFO] Epoch:[0/2](385500/4588595) loss:3.157 lr:0.0000100 epoch_Time:26640.0min: [2024-01-04 10:23:11,508][model8_pretrain.py][INFO] Epoch:[0/2](385500/4588595) loss:2.996 lr:0.0000100 epoch_Time:26640.0min: [2024-01-04 10:23:48,430][model8_pretrain.py][INFO] Epoch:[0/2](385600/4588595) loss:3.105 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:23:48,430][model8_pretrain.py][INFO] Epoch:[0/2](385600/4588595) loss:2.552 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:23:48,430][model8_pretrain.py][INFO] Epoch:[0/2](385600/4588595) loss:2.906 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:23:48,430][model8_pretrain.py][INFO] Epoch:[0/2](385600/4588595) loss:2.760 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:23:48,430][model8_pretrain.py][INFO] Epoch:[0/2](385600/4588595) loss:2.965 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:23:48,430][model8_pretrain.py][INFO] Epoch:[0/2](385600/4588595) loss:2.880 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:23:48,430][model8_pretrain.py][INFO] Epoch:[0/2](385600/4588595) loss:2.797 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:23:48,430][model8_pretrain.py][INFO] Epoch:[0/2](385600/4588595) loss:3.038 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:24:25,358][model8_pretrain.py][INFO] Epoch:[0/2](385700/4588595) loss:2.260 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:24:25,358][model8_pretrain.py][INFO] Epoch:[0/2](385700/4588595) loss:2.728 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:24:25,358][model8_pretrain.py][INFO] Epoch:[0/2](385700/4588595) loss:2.366 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:24:25,358][model8_pretrain.py][INFO] Epoch:[0/2](385700/4588595) loss:3.001 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:24:25,358][model8_pretrain.py][INFO] Epoch:[0/2](385700/4588595) loss:2.970 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:24:25,358][model8_pretrain.py][INFO] Epoch:[0/2](385700/4588595) loss:2.940 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:24:25,358][model8_pretrain.py][INFO] Epoch:[0/2](385700/4588595) loss:2.625 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:24:25,359][model8_pretrain.py][INFO] Epoch:[0/2](385700/4588595) loss:2.842 lr:0.0000100 epoch_Time:26639.0min: [2024-01-04 10:25:02,289][model8_pretrain.py][INFO] Epoch:[0/2](385800/4588595) loss:2.093 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:02,289][model8_pretrain.py][INFO] Epoch:[0/2](385800/4588595) loss:3.052 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:02,290][model8_pretrain.py][INFO] Epoch:[0/2](385800/4588595) loss:2.764 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:02,290][model8_pretrain.py][INFO] Epoch:[0/2](385800/4588595) loss:3.025 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:02,290][model8_pretrain.py][INFO] Epoch:[0/2](385800/4588595) loss:2.633 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:02,290][model8_pretrain.py][INFO] Epoch:[0/2](385800/4588595) loss:2.588 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:02,290][model8_pretrain.py][INFO] Epoch:[0/2](385800/4588595) loss:2.391 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:02,290][model8_pretrain.py][INFO] Epoch:[0/2](385800/4588595) loss:3.161 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:39,221][model8_pretrain.py][INFO] Epoch:[0/2](385900/4588595) loss:3.101 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:39,221][model8_pretrain.py][INFO] Epoch:[0/2](385900/4588595) loss:3.312 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:39,221][model8_pretrain.py][INFO] Epoch:[0/2](385900/4588595) loss:2.559 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:39,221][model8_pretrain.py][INFO] Epoch:[0/2](385900/4588595) loss:2.920 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:39,221][model8_pretrain.py][INFO] Epoch:[0/2](385900/4588595) loss:2.465 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:39,221][model8_pretrain.py][INFO] Epoch:[0/2](385900/4588595) loss:3.073 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:39,221][model8_pretrain.py][INFO] Epoch:[0/2](385900/4588595) loss:2.774 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:25:39,222][model8_pretrain.py][INFO] Epoch:[0/2](385900/4588595) loss:2.743 lr:0.0000100 epoch_Time:26637.0min: [2024-01-04 10:26:16,153][model8_pretrain.py][INFO] Epoch:[0/2](386000/4588595) loss:2.585 lr:0.0000100 epoch_Time:26636.0min: [2024-01-04 10:26:16,153][model8_pretrain.py][INFO] Epoch:[0/2](386000/4588595) loss:2.944 lr:0.0000100 epoch_Time:26636.0min: [2024-01-04 10:26:16,153][model8_pretrain.py][INFO] Epoch:[0/2](386000/4588595) loss:2.275 lr:0.0000100 epoch_Time:26636.0min: [2024-01-04 10:26:16,153][model8_pretrain.py][INFO] Epoch:[0/2](386000/4588595) loss:2.871 lr:0.0000100 epoch_Time:26636.0min: [2024-01-04 10:26:16,153][model8_pretrain.py][INFO] Epoch:[0/2](386000/4588595) loss:2.596 lr:0.0000100 epoch_Time:26636.0min: [2024-01-04 10:26:16,153][model8_pretrain.py][INFO] Epoch:[0/2](386000/4588595) loss:3.437 lr:0.0000100 epoch_Time:26636.0min: [2024-01-04 10:26:16,153][model8_pretrain.py][INFO] Epoch:[0/2](386000/4588595) loss:2.839 lr:0.0000100 epoch_Time:26636.0min: [2024-01-04 10:26:16,154][model8_pretrain.py][INFO] Epoch:[0/2](386000/4588595) loss:2.840 lr:0.0000100 epoch_Time:26636.0min: [2024-01-04 10:26:53,079][model8_pretrain.py][INFO] Epoch:[0/2](386100/4588595) loss:3.280 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:26:53,079][model8_pretrain.py][INFO] Epoch:[0/2](386100/4588595) loss:3.346 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:26:53,079][model8_pretrain.py][INFO] Epoch:[0/2](386100/4588595) loss:3.057 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:26:53,079][model8_pretrain.py][INFO] Epoch:[0/2](386100/4588595) loss:2.363 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:26:53,079][model8_pretrain.py][INFO] Epoch:[0/2](386100/4588595) loss:2.868 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:26:53,079][model8_pretrain.py][INFO] Epoch:[0/2](386100/4588595) loss:2.587 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:26:53,079][model8_pretrain.py][INFO] Epoch:[0/2](386100/4588595) loss:2.825 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:26:53,079][model8_pretrain.py][INFO] Epoch:[0/2](386100/4588595) loss:3.110 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:27:30,001][model8_pretrain.py][INFO] Epoch:[0/2](386200/4588595) loss:3.090 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:27:30,001][model8_pretrain.py][INFO] Epoch:[0/2](386200/4588595) loss:2.703 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:27:30,002][model8_pretrain.py][INFO] Epoch:[0/2](386200/4588595) loss:3.545 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:27:30,002][model8_pretrain.py][INFO] Epoch:[0/2](386200/4588595) loss:2.895 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:27:30,002][model8_pretrain.py][INFO] Epoch:[0/2](386200/4588595) loss:3.156 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:27:30,002][model8_pretrain.py][INFO] Epoch:[0/2](386200/4588595) loss:3.376 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:27:30,002][model8_pretrain.py][INFO] Epoch:[0/2](386200/4588595) loss:3.103 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:27:30,002][model8_pretrain.py][INFO] Epoch:[0/2](386200/4588595) loss:1.940 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:28:17,070][model8_pretrain.py][INFO] Epoch:[0/2](386300/4588595) loss:2.301 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:28:17,070][model8_pretrain.py][INFO] Epoch:[0/2](386300/4588595) loss:3.278 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:28:17,070][model8_pretrain.py][INFO] Epoch:[0/2](386300/4588595) loss:2.120 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:28:17,070][model8_pretrain.py][INFO] Epoch:[0/2](386300/4588595) loss:3.426 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:28:17,070][model8_pretrain.py][INFO] Epoch:[0/2](386300/4588595) loss:2.744 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:28:17,070][model8_pretrain.py][INFO] Epoch:[0/2](386300/4588595) loss:2.719 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:28:17,070][model8_pretrain.py][INFO] Epoch:[0/2](386300/4588595) loss:2.844 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:28:17,070][model8_pretrain.py][INFO] Epoch:[0/2](386300/4588595) loss:3.126 lr:0.0000100 epoch_Time:26635.0min: [2024-01-04 10:28:53,996][model8_pretrain.py][INFO] Epoch:[0/2](386400/4588595) loss:2.896 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:28:53,996][model8_pretrain.py][INFO] Epoch:[0/2](386400/4588595) loss:2.486 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:28:53,996][model8_pretrain.py][INFO] Epoch:[0/2](386400/4588595) loss:2.639 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:28:53,996][model8_pretrain.py][INFO] Epoch:[0/2](386400/4588595) loss:2.272 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:28:53,996][model8_pretrain.py][INFO] Epoch:[0/2](386400/4588595) loss:3.003 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:28:53,996][model8_pretrain.py][INFO] Epoch:[0/2](386400/4588595) loss:2.943 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:28:53,996][model8_pretrain.py][INFO] Epoch:[0/2](386400/4588595) loss:2.781 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:28:53,997][model8_pretrain.py][INFO] Epoch:[0/2](386400/4588595) loss:2.792 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:29:30,914][model8_pretrain.py][INFO] Epoch:[0/2](386500/4588595) loss:3.048 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:29:30,914][model8_pretrain.py][INFO] Epoch:[0/2](386500/4588595) loss:2.814 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:29:30,914][model8_pretrain.py][INFO] Epoch:[0/2](386500/4588595) loss:3.171 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:29:30,914][model8_pretrain.py][INFO] Epoch:[0/2](386500/4588595) loss:2.807 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:29:30,914][model8_pretrain.py][INFO] Epoch:[0/2](386500/4588595) loss:3.527 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:29:30,914][model8_pretrain.py][INFO] Epoch:[0/2](386500/4588595) loss:3.085 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:29:30,914][model8_pretrain.py][INFO] Epoch:[0/2](386500/4588595) loss:3.114 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:29:30,914][model8_pretrain.py][INFO] Epoch:[0/2](386500/4588595) loss:2.697 lr:0.0000100 epoch_Time:26634.0min: [2024-01-04 10:30:07,857][model8_pretrain.py][INFO] Epoch:[0/2](386600/4588595) loss:2.907 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:07,857][model8_pretrain.py][INFO] Epoch:[0/2](386600/4588595) loss:2.993 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:07,857][model8_pretrain.py][INFO] Epoch:[0/2](386600/4588595) loss:2.587 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:07,857][model8_pretrain.py][INFO] Epoch:[0/2](386600/4588595) loss:3.166 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:07,857][model8_pretrain.py][INFO] Epoch:[0/2](386600/4588595) loss:2.663 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:07,857][model8_pretrain.py][INFO] Epoch:[0/2](386600/4588595) loss:2.590 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:07,857][model8_pretrain.py][INFO] Epoch:[0/2](386600/4588595) loss:3.100 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:07,857][model8_pretrain.py][INFO] Epoch:[0/2](386600/4588595) loss:2.678 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:44,796][model8_pretrain.py][INFO] Epoch:[0/2](386700/4588595) loss:3.073 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:44,797][model8_pretrain.py][INFO] Epoch:[0/2](386700/4588595) loss:2.966 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:44,797][model8_pretrain.py][INFO] Epoch:[0/2](386700/4588595) loss:3.071 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:44,797][model8_pretrain.py][INFO] Epoch:[0/2](386700/4588595) loss:2.736 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:44,797][model8_pretrain.py][INFO] Epoch:[0/2](386700/4588595) loss:2.412 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:44,797][model8_pretrain.py][INFO] Epoch:[0/2](386700/4588595) loss:2.763 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:44,797][model8_pretrain.py][INFO] Epoch:[0/2](386700/4588595) loss:2.566 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:30:44,797][model8_pretrain.py][INFO] Epoch:[0/2](386700/4588595) loss:3.117 lr:0.0000100 epoch_Time:26633.0min: [2024-01-04 10:31:21,753][model8_pretrain.py][INFO] Epoch:[0/2](386800/4588595) loss:2.655 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:31:21,753][model8_pretrain.py][INFO] Epoch:[0/2](386800/4588595) loss:2.265 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:31:21,753][model8_pretrain.py][INFO] Epoch:[0/2](386800/4588595) loss:2.651 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:31:21,753][model8_pretrain.py][INFO] Epoch:[0/2](386800/4588595) loss:2.928 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:31:21,753][model8_pretrain.py][INFO] Epoch:[0/2](386800/4588595) loss:3.588 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:31:21,753][model8_pretrain.py][INFO] Epoch:[0/2](386800/4588595) loss:3.109 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:31:21,754][model8_pretrain.py][INFO] Epoch:[0/2](386800/4588595) loss:2.564 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:31:21,754][model8_pretrain.py][INFO] Epoch:[0/2](386800/4588595) loss:3.076 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:31:58,701][model8_pretrain.py][INFO] Epoch:[0/2](386900/4588595) loss:2.734 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:31:58,702][model8_pretrain.py][INFO] Epoch:[0/2](386900/4588595) loss:3.188 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:31:58,701][model8_pretrain.py][INFO] Epoch:[0/2](386900/4588595) loss:3.282 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:31:58,702][model8_pretrain.py][INFO] Epoch:[0/2](386900/4588595) loss:2.890 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:31:58,702][model8_pretrain.py][INFO] Epoch:[0/2](386900/4588595) loss:3.343 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:31:58,702][model8_pretrain.py][INFO] Epoch:[0/2](386900/4588595) loss:3.361 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:31:58,702][model8_pretrain.py][INFO] Epoch:[0/2](386900/4588595) loss:3.039 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:31:58,701][model8_pretrain.py][INFO] Epoch:[0/2](386900/4588595) loss:2.781 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:32:35,642][model8_pretrain.py][INFO] Epoch:[0/2](387000/4588595) loss:2.986 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:32:35,642][model8_pretrain.py][INFO] Epoch:[0/2](387000/4588595) loss:2.756 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:32:35,642][model8_pretrain.py][INFO] Epoch:[0/2](387000/4588595) loss:2.781 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:32:35,642][model8_pretrain.py][INFO] Epoch:[0/2](387000/4588595) loss:2.437 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:32:35,642][model8_pretrain.py][INFO] Epoch:[0/2](387000/4588595) loss:3.062 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:32:35,642][model8_pretrain.py][INFO] Epoch:[0/2](387000/4588595) loss:3.168 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:32:35,642][model8_pretrain.py][INFO] Epoch:[0/2](387000/4588595) loss:3.007 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:32:35,642][model8_pretrain.py][INFO] Epoch:[0/2](387000/4588595) loss:3.395 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:33:23,247][model8_pretrain.py][INFO] Epoch:[0/2](387100/4588595) loss:2.857 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:33:23,247][model8_pretrain.py][INFO] Epoch:[0/2](387100/4588595) loss:3.186 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:33:23,247][model8_pretrain.py][INFO] Epoch:[0/2](387100/4588595) loss:2.845 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:33:23,247][model8_pretrain.py][INFO] Epoch:[0/2](387100/4588595) loss:2.956 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:33:23,247][model8_pretrain.py][INFO] Epoch:[0/2](387100/4588595) loss:3.238 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:33:23,247][model8_pretrain.py][INFO] Epoch:[0/2](387100/4588595) loss:3.004 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:33:23,247][model8_pretrain.py][INFO] Epoch:[0/2](387100/4588595) loss:3.099 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:33:23,247][model8_pretrain.py][INFO] Epoch:[0/2](387100/4588595) loss:2.620 lr:0.0000100 epoch_Time:26631.0min: [2024-01-04 10:34:00,126][model8_pretrain.py][INFO] Epoch:[0/2](387200/4588595) loss:3.024 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:34:00,126][model8_pretrain.py][INFO] Epoch:[0/2](387200/4588595) loss:3.445 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:34:00,126][model8_pretrain.py][INFO] Epoch:[0/2](387200/4588595) loss:2.147 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:34:00,126][model8_pretrain.py][INFO] Epoch:[0/2](387200/4588595) loss:2.842 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:34:00,127][model8_pretrain.py][INFO] Epoch:[0/2](387200/4588595) loss:2.523 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:34:00,127][model8_pretrain.py][INFO] Epoch:[0/2](387200/4588595) loss:3.196 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:34:00,127][model8_pretrain.py][INFO] Epoch:[0/2](387200/4588595) loss:2.888 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:34:00,127][model8_pretrain.py][INFO] Epoch:[0/2](387200/4588595) loss:2.560 lr:0.0000100 epoch_Time:26630.0min: [2024-01-04 10:34:37,047][model8_pretrain.py][INFO] Epoch:[0/2](387300/4588595) loss:2.882 lr:0.0000100 epoch_Time:26629.0min: [2024-01-04 10:34:37,047][model8_pretrain.py][INFO] Epoch:[0/2](387300/4588595) loss:3.332 lr:0.0000100 epoch_Time:26629.0min: [2024-01-04 10:34:37,047][model8_pretrain.py][INFO] Epoch:[0/2](387300/4588595) loss:3.104 lr:0.0000100 epoch_Time:26629.0min: [2024-01-04 10:34:37,047][model8_pretrain.py][INFO] Epoch:[0/2](387300/4588595) loss:2.949 lr:0.0000100 epoch_Time:26629.0min: [2024-01-04 10:34:37,047][model8_pretrain.py][INFO] Epoch:[0/2](387300/4588595) loss:2.635 lr:0.0000100 epoch_Time:26629.0min: [2024-01-04 10:34:37,047][model8_pretrain.py][INFO] Epoch:[0/2](387300/4588595) loss:3.328 lr:0.0000100 epoch_Time:26629.0min: [2024-01-04 10:34:37,047][model8_pretrain.py][INFO] Epoch:[0/2](387300/4588595) loss:2.965 lr:0.0000100 epoch_Time:26629.0min: [2024-01-04 10:34:37,047][model8_pretrain.py][INFO] Epoch:[0/2](387300/4588595) loss:3.435 lr:0.0000100 epoch_Time:26629.0min: [2024-01-04 10:35:13,975][model8_pretrain.py][INFO] Epoch:[0/2](387400/4588595) loss:3.129 lr:0.0000100 epoch_Time:26628.0min: [2024-01-04 10:35:13,975][model8_pretrain.py][INFO] Epoch:[0/2](387400/4588595) loss:2.453 lr:0.0000100 epoch_Time:26628.0min: [2024-01-04 10:35:13,975][model8_pretrain.py][INFO] Epoch:[0/2](387400/4588595) loss:2.911 lr:0.0000100 epoch_Time:26628.0min: [2024-01-04 10:35:13,975][model8_pretrain.py][INFO] Epoch:[0/2](387400/4588595) loss:3.060 lr:0.0000100 epoch_Time:26628.0min: [2024-01-04 10:35:13,975][model8_pretrain.py][INFO] Epoch:[0/2](387400/4588595) loss:3.169 lr:0.0000100 epoch_Time:26628.0min: [2024-01-04 10:35:13,975][model8_pretrain.py][INFO] Epoch:[0/2](387400/4588595) loss:2.513 lr:0.0000100 epoch_Time:26628.0min: [2024-01-04 10:35:13,975][model8_pretrain.py][INFO] Epoch:[0/2](387400/4588595) loss:3.094 lr:0.0000100 epoch_Time:26628.0min: [2024-01-04 10:35:13,975][model8_pretrain.py][INFO] Epoch:[0/2](387400/4588595) loss:2.787 lr:0.0000100 epoch_Time:26628.0min: [2024-01-04 10:35:50,890][model8_pretrain.py][INFO] Epoch:[0/2](387500/4588595) loss:2.902 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:35:50,890][model8_pretrain.py][INFO] Epoch:[0/2](387500/4588595) loss:3.068 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:35:50,890][model8_pretrain.py][INFO] Epoch:[0/2](387500/4588595) loss:3.357 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:35:50,890][model8_pretrain.py][INFO] Epoch:[0/2](387500/4588595) loss:2.823 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:35:50,890][model8_pretrain.py][INFO] Epoch:[0/2](387500/4588595) loss:2.702 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:35:50,890][model8_pretrain.py][INFO] Epoch:[0/2](387500/4588595) loss:3.175 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:35:50,890][model8_pretrain.py][INFO] Epoch:[0/2](387500/4588595) loss:2.803 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:35:50,890][model8_pretrain.py][INFO] Epoch:[0/2](387500/4588595) loss:2.822 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:36:27,805][model8_pretrain.py][INFO] Epoch:[0/2](387600/4588595) loss:2.938 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:36:27,805][model8_pretrain.py][INFO] Epoch:[0/2](387600/4588595) loss:3.063 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:36:27,805][model8_pretrain.py][INFO] Epoch:[0/2](387600/4588595) loss:2.764 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:36:27,805][model8_pretrain.py][INFO] Epoch:[0/2](387600/4588595) loss:2.993 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:36:27,805][model8_pretrain.py][INFO] Epoch:[0/2](387600/4588595) loss:2.912 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:36:27,805][model8_pretrain.py][INFO] Epoch:[0/2](387600/4588595) loss:2.786 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:36:27,806][model8_pretrain.py][INFO] Epoch:[0/2](387600/4588595) loss:3.268 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:36:27,806][model8_pretrain.py][INFO] Epoch:[0/2](387600/4588595) loss:2.809 lr:0.0000100 epoch_Time:26627.0min: [2024-01-04 10:37:04,728][model8_pretrain.py][INFO] Epoch:[0/2](387700/4588595) loss:3.598 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:04,728][model8_pretrain.py][INFO] Epoch:[0/2](387700/4588595) loss:2.337 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:04,728][model8_pretrain.py][INFO] Epoch:[0/2](387700/4588595) loss:3.103 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:04,728][model8_pretrain.py][INFO] Epoch:[0/2](387700/4588595) loss:2.906 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:04,728][model8_pretrain.py][INFO] Epoch:[0/2](387700/4588595) loss:2.514 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:04,728][model8_pretrain.py][INFO] Epoch:[0/2](387700/4588595) loss:2.875 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:04,729][model8_pretrain.py][INFO] Epoch:[0/2](387700/4588595) loss:2.871 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:04,729][model8_pretrain.py][INFO] Epoch:[0/2](387700/4588595) loss:2.608 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:41,660][model8_pretrain.py][INFO] Epoch:[0/2](387800/4588595) loss:2.847 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:41,660][model8_pretrain.py][INFO] Epoch:[0/2](387800/4588595) loss:3.134 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:41,660][model8_pretrain.py][INFO] Epoch:[0/2](387800/4588595) loss:2.296 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:41,660][model8_pretrain.py][INFO] Epoch:[0/2](387800/4588595) loss:2.988 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:41,660][model8_pretrain.py][INFO] Epoch:[0/2](387800/4588595) loss:2.726 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:41,660][model8_pretrain.py][INFO] Epoch:[0/2](387800/4588595) loss:3.132 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:41,660][model8_pretrain.py][INFO] Epoch:[0/2](387800/4588595) loss:3.035 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:37:41,660][model8_pretrain.py][INFO] Epoch:[0/2](387800/4588595) loss:2.098 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:38:27,367][model8_pretrain.py][INFO] Epoch:[0/2](387900/4588595) loss:2.725 lr:0.0000100 epoch_Time:26626.0min: [2024-01-04 10:38:27,367][model8_pretrain.py][INFO] Epoch:[0/2](387900/4588595) loss:3.471 lr:0.0000100 epoch_Time:26626.0min: [2024-01-04 10:38:27,367][model8_pretrain.py][INFO] Epoch:[0/2](387900/4588595) loss:2.569 lr:0.0000100 epoch_Time:26626.0min: [2024-01-04 10:38:27,367][model8_pretrain.py][INFO] Epoch:[0/2](387900/4588595) loss:3.305 lr:0.0000100 epoch_Time:26626.0min: [2024-01-04 10:38:27,367][model8_pretrain.py][INFO] Epoch:[0/2](387900/4588595) loss:3.091 lr:0.0000100 epoch_Time:26626.0min: [2024-01-04 10:38:27,367][model8_pretrain.py][INFO] Epoch:[0/2](387900/4588595) loss:2.725 lr:0.0000100 epoch_Time:26626.0min: [2024-01-04 10:38:27,367][model8_pretrain.py][INFO] Epoch:[0/2](387900/4588595) loss:3.021 lr:0.0000100 epoch_Time:26626.0min: [2024-01-04 10:38:27,367][model8_pretrain.py][INFO] Epoch:[0/2](387900/4588595) loss:2.679 lr:0.0000100 epoch_Time:26626.0min: [2024-01-04 10:39:06,013][model8_pretrain.py][INFO] Epoch:[0/2](388000/4588595) loss:3.140 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:06,013][model8_pretrain.py][INFO] Epoch:[0/2](388000/4588595) loss:2.944 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:06,013][model8_pretrain.py][INFO] Epoch:[0/2](388000/4588595) loss:3.385 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:06,013][model8_pretrain.py][INFO] Epoch:[0/2](388000/4588595) loss:3.296 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:06,013][model8_pretrain.py][INFO] Epoch:[0/2](388000/4588595) loss:3.234 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:06,013][model8_pretrain.py][INFO] Epoch:[0/2](388000/4588595) loss:3.269 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:06,013][model8_pretrain.py][INFO] Epoch:[0/2](388000/4588595) loss:2.406 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:06,014][model8_pretrain.py][INFO] Epoch:[0/2](388000/4588595) loss:2.979 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:42,941][model8_pretrain.py][INFO] Epoch:[0/2](388100/4588595) loss:3.223 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:42,941][model8_pretrain.py][INFO] Epoch:[0/2](388100/4588595) loss:2.585 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:42,941][model8_pretrain.py][INFO] Epoch:[0/2](388100/4588595) loss:3.023 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:42,941][model8_pretrain.py][INFO] Epoch:[0/2](388100/4588595) loss:2.493 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:42,941][model8_pretrain.py][INFO] Epoch:[0/2](388100/4588595) loss:2.888 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:42,941][model8_pretrain.py][INFO] Epoch:[0/2](388100/4588595) loss:2.872 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:42,941][model8_pretrain.py][INFO] Epoch:[0/2](388100/4588595) loss:2.700 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:39:42,941][model8_pretrain.py][INFO] Epoch:[0/2](388100/4588595) loss:2.712 lr:0.0000100 epoch_Time:26625.0min: [2024-01-04 10:40:19,866][model8_pretrain.py][INFO] Epoch:[0/2](388200/4588595) loss:3.109 lr:0.0000100 epoch_Time:26623.0min: [2024-01-04 10:40:19,866][model8_pretrain.py][INFO] Epoch:[0/2](388200/4588595) loss:2.726 lr:0.0000100 epoch_Time:26623.0min: [2024-01-04 10:40:19,866][model8_pretrain.py][INFO] Epoch:[0/2](388200/4588595) loss:2.694 lr:0.0000100 epoch_Time:26623.0min: [2024-01-04 10:40:19,866][model8_pretrain.py][INFO] Epoch:[0/2](388200/4588595) loss:2.561 lr:0.0000100 epoch_Time:26623.0min: [2024-01-04 10:40:19,866][model8_pretrain.py][INFO] Epoch:[0/2](388200/4588595) loss:2.832 lr:0.0000100 epoch_Time:26623.0min: [2024-01-04 10:40:19,866][model8_pretrain.py][INFO] Epoch:[0/2](388200/4588595) loss:2.870 lr:0.0000100 epoch_Time:26623.0min: [2024-01-04 10:40:19,866][model8_pretrain.py][INFO] Epoch:[0/2](388200/4588595) loss:2.838 lr:0.0000100 epoch_Time:26623.0min: [2024-01-04 10:40:19,866][model8_pretrain.py][INFO] Epoch:[0/2](388200/4588595) loss:2.903 lr:0.0000100 epoch_Time:26623.0min: [2024-01-04 10:40:56,787][model8_pretrain.py][INFO] Epoch:[0/2](388300/4588595) loss:2.660 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:40:56,787][model8_pretrain.py][INFO] Epoch:[0/2](388300/4588595) loss:3.166 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:40:56,787][model8_pretrain.py][INFO] Epoch:[0/2](388300/4588595) loss:3.069 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:40:56,787][model8_pretrain.py][INFO] Epoch:[0/2](388300/4588595) loss:2.871 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:40:56,787][model8_pretrain.py][INFO] Epoch:[0/2](388300/4588595) loss:3.036 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:40:56,787][model8_pretrain.py][INFO] Epoch:[0/2](388300/4588595) loss:2.485 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:40:56,788][model8_pretrain.py][INFO] Epoch:[0/2](388300/4588595) loss:3.364 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:40:56,788][model8_pretrain.py][INFO] Epoch:[0/2](388300/4588595) loss:2.983 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:41:33,712][model8_pretrain.py][INFO] Epoch:[0/2](388400/4588595) loss:3.188 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:41:33,712][model8_pretrain.py][INFO] Epoch:[0/2](388400/4588595) loss:2.672 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:41:33,712][model8_pretrain.py][INFO] Epoch:[0/2](388400/4588595) loss:2.589 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:41:33,712][model8_pretrain.py][INFO] Epoch:[0/2](388400/4588595) loss:2.760 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:41:33,712][model8_pretrain.py][INFO] Epoch:[0/2](388400/4588595) loss:3.036 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:41:33,712][model8_pretrain.py][INFO] Epoch:[0/2](388400/4588595) loss:2.634 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:41:33,713][model8_pretrain.py][INFO] Epoch:[0/2](388400/4588595) loss:3.060 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:41:33,713][model8_pretrain.py][INFO] Epoch:[0/2](388400/4588595) loss:2.915 lr:0.0000100 epoch_Time:26622.0min: [2024-01-04 10:42:10,649][model8_pretrain.py][INFO] Epoch:[0/2](388500/4588595) loss:2.737 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:10,649][model8_pretrain.py][INFO] Epoch:[0/2](388500/4588595) loss:2.866 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:10,649][model8_pretrain.py][INFO] Epoch:[0/2](388500/4588595) loss:3.021 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:10,649][model8_pretrain.py][INFO] Epoch:[0/2](388500/4588595) loss:2.238 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:10,649][model8_pretrain.py][INFO] Epoch:[0/2](388500/4588595) loss:2.973 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:10,649][model8_pretrain.py][INFO] Epoch:[0/2](388500/4588595) loss:2.653 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:10,649][model8_pretrain.py][INFO] Epoch:[0/2](388500/4588595) loss:2.585 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:10,649][model8_pretrain.py][INFO] Epoch:[0/2](388500/4588595) loss:3.062 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:47,580][model8_pretrain.py][INFO] Epoch:[0/2](388600/4588595) loss:2.380 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:47,580][model8_pretrain.py][INFO] Epoch:[0/2](388600/4588595) loss:3.017 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:47,581][model8_pretrain.py][INFO] Epoch:[0/2](388600/4588595) loss:2.901 lr:0.0000100 epoch_Time:26620.0min: [2024-01-04 10:42:47,581][model8_pretrain.py][INFO] Epoch:[0/2](388600/4588595) loss:2.636 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:47,580][model8_pretrain.py][INFO] Epoch:[0/2](388600/4588595) loss:2.586 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:47,581][model8_pretrain.py][INFO] Epoch:[0/2](388600/4588595) loss:3.105 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:47,581][model8_pretrain.py][INFO] Epoch:[0/2](388600/4588595) loss:2.934 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:42:47,581][model8_pretrain.py][INFO] Epoch:[0/2](388600/4588595) loss:3.267 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:43:33,097][model8_pretrain.py][INFO] Epoch:[0/2](388700/4588595) loss:2.614 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:43:33,097][model8_pretrain.py][INFO] Epoch:[0/2](388700/4588595) loss:2.683 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:43:33,097][model8_pretrain.py][INFO] Epoch:[0/2](388700/4588595) loss:3.005 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:43:33,097][model8_pretrain.py][INFO] Epoch:[0/2](388700/4588595) loss:3.269 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:43:33,097][model8_pretrain.py][INFO] Epoch:[0/2](388700/4588595) loss:2.575 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:43:33,098][model8_pretrain.py][INFO] Epoch:[0/2](388700/4588595) loss:2.618 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:43:33,100][model8_pretrain.py][INFO] Epoch:[0/2](388700/4588595) loss:3.111 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:43:33,102][model8_pretrain.py][INFO] Epoch:[0/2](388700/4588595) loss:2.631 lr:0.0000100 epoch_Time:26621.0min: [2024-01-04 10:44:11,694][model8_pretrain.py][INFO] Epoch:[0/2](388800/4588595) loss:2.332 lr:0.0000100 epoch_Time:26620.0min: [2024-01-04 10:44:11,694][model8_pretrain.py][INFO] Epoch:[0/2](388800/4588595) loss:3.010 lr:0.0000100 epoch_Time:26620.0min: [2024-01-04 10:44:11,694][model8_pretrain.py][INFO] Epoch:[0/2](388800/4588595) loss:2.887 lr:0.0000100 epoch_Time:26620.0min: [2024-01-04 10:44:11,694][model8_pretrain.py][INFO] Epoch:[0/2](388800/4588595) loss:2.777 lr:0.0000100 epoch_Time:26620.0min: [2024-01-04 10:44:11,694][model8_pretrain.py][INFO] Epoch:[0/2](388800/4588595) loss:2.512 lr:0.0000100 epoch_Time:26620.0min: [2024-01-04 10:44:11,694][model8_pretrain.py][INFO] Epoch:[0/2](388800/4588595) loss:2.791 lr:0.0000100 epoch_Time:26620.0min: [2024-01-04 10:44:11,694][model8_pretrain.py][INFO] Epoch:[0/2](388800/4588595) loss:2.898 lr:0.0000100 epoch_Time:26620.0min: [2024-01-04 10:44:11,695][model8_pretrain.py][INFO] Epoch:[0/2](388800/4588595) loss:2.982 lr:0.0000100 epoch_Time:26620.0min: [2024-01-04 10:44:48,622][model8_pretrain.py][INFO] Epoch:[0/2](388900/4588595) loss:2.978 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:44:48,622][model8_pretrain.py][INFO] Epoch:[0/2](388900/4588595) loss:2.663 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:44:48,622][model8_pretrain.py][INFO] Epoch:[0/2](388900/4588595) loss:2.621 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:44:48,622][model8_pretrain.py][INFO] Epoch:[0/2](388900/4588595) loss:2.701 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:44:48,622][model8_pretrain.py][INFO] Epoch:[0/2](388900/4588595) loss:3.378 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:44:48,622][model8_pretrain.py][INFO] Epoch:[0/2](388900/4588595) loss:2.779 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:44:48,622][model8_pretrain.py][INFO] Epoch:[0/2](388900/4588595) loss:2.998 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:44:48,622][model8_pretrain.py][INFO] Epoch:[0/2](388900/4588595) loss:3.113 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:45:25,547][model8_pretrain.py][INFO] Epoch:[0/2](389000/4588595) loss:3.214 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:45:25,547][model8_pretrain.py][INFO] Epoch:[0/2](389000/4588595) loss:2.628 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:45:25,547][model8_pretrain.py][INFO] Epoch:[0/2](389000/4588595) loss:2.512 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:45:25,547][model8_pretrain.py][INFO] Epoch:[0/2](389000/4588595) loss:2.871 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:45:25,547][model8_pretrain.py][INFO] Epoch:[0/2](389000/4588595) loss:2.691 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:45:25,547][model8_pretrain.py][INFO] Epoch:[0/2](389000/4588595) loss:1.955 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:45:25,547][model8_pretrain.py][INFO] Epoch:[0/2](389000/4588595) loss:2.511 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:45:25,547][model8_pretrain.py][INFO] Epoch:[0/2](389000/4588595) loss:3.013 lr:0.0000100 epoch_Time:26619.0min: [2024-01-04 10:46:02,477][model8_pretrain.py][INFO] Epoch:[0/2](389100/4588595) loss:2.928 lr:0.0000100 epoch_Time:26618.0min: [2024-01-04 10:46:02,477][model8_pretrain.py][INFO] Epoch:[0/2](389100/4588595) loss:2.768 lr:0.0000100 epoch_Time:26618.0min: [2024-01-04 10:46:02,477][model8_pretrain.py][INFO] Epoch:[0/2](389100/4588595) loss:1.986 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:02,477][model8_pretrain.py][INFO] Epoch:[0/2](389100/4588595) loss:2.357 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:02,477][model8_pretrain.py][INFO] Epoch:[0/2](389100/4588595) loss:2.886 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:02,477][model8_pretrain.py][INFO] Epoch:[0/2](389100/4588595) loss:3.130 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:02,477][model8_pretrain.py][INFO] Epoch:[0/2](389100/4588595) loss:3.300 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:02,477][model8_pretrain.py][INFO] Epoch:[0/2](389100/4588595) loss:2.441 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:39,420][model8_pretrain.py][INFO] Epoch:[0/2](389200/4588595) loss:2.838 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:39,420][model8_pretrain.py][INFO] Epoch:[0/2](389200/4588595) loss:3.344 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:39,420][model8_pretrain.py][INFO] Epoch:[0/2](389200/4588595) loss:2.349 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:39,420][model8_pretrain.py][INFO] Epoch:[0/2](389200/4588595) loss:2.083 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:39,420][model8_pretrain.py][INFO] Epoch:[0/2](389200/4588595) loss:3.250 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:39,420][model8_pretrain.py][INFO] Epoch:[0/2](389200/4588595) loss:2.863 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:39,420][model8_pretrain.py][INFO] Epoch:[0/2](389200/4588595) loss:2.603 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:46:39,420][model8_pretrain.py][INFO] Epoch:[0/2](389200/4588595) loss:2.935 lr:0.0000100 epoch_Time:26617.0min: [2024-01-04 10:47:16,350][model8_pretrain.py][INFO] Epoch:[0/2](389300/4588595) loss:2.763 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:47:16,350][model8_pretrain.py][INFO] Epoch:[0/2](389300/4588595) loss:3.455 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:47:16,350][model8_pretrain.py][INFO] Epoch:[0/2](389300/4588595) loss:3.397 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:47:16,350][model8_pretrain.py][INFO] Epoch:[0/2](389300/4588595) loss:2.736 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:47:16,350][model8_pretrain.py][INFO] Epoch:[0/2](389300/4588595) loss:3.107 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:47:16,350][model8_pretrain.py][INFO] Epoch:[0/2](389300/4588595) loss:3.363 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:47:16,351][model8_pretrain.py][INFO] Epoch:[0/2](389300/4588595) loss:2.262 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:47:16,351][model8_pretrain.py][INFO] Epoch:[0/2](389300/4588595) loss:2.955 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:47:53,280][model8_pretrain.py][INFO] Epoch:[0/2](389400/4588595) loss:2.774 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:47:53,280][model8_pretrain.py][INFO] Epoch:[0/2](389400/4588595) loss:2.878 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:47:53,280][model8_pretrain.py][INFO] Epoch:[0/2](389400/4588595) loss:2.646 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:47:53,280][model8_pretrain.py][INFO] Epoch:[0/2](389400/4588595) loss:3.025 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:47:53,280][model8_pretrain.py][INFO] Epoch:[0/2](389400/4588595) loss:2.745 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:47:53,280][model8_pretrain.py][INFO] Epoch:[0/2](389400/4588595) loss:3.555 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:47:53,280][model8_pretrain.py][INFO] Epoch:[0/2](389400/4588595) loss:1.978 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:47:53,280][model8_pretrain.py][INFO] Epoch:[0/2](389400/4588595) loss:2.528 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:48:35,447][model8_pretrain.py][INFO] Epoch:[0/2](389500/4588595) loss:3.377 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:48:35,447][model8_pretrain.py][INFO] Epoch:[0/2](389500/4588595) loss:3.332 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:48:35,447][model8_pretrain.py][INFO] Epoch:[0/2](389500/4588595) loss:3.004 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:48:35,447][model8_pretrain.py][INFO] Epoch:[0/2](389500/4588595) loss:2.880 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:48:35,447][model8_pretrain.py][INFO] Epoch:[0/2](389500/4588595) loss:3.012 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:48:35,447][model8_pretrain.py][INFO] Epoch:[0/2](389500/4588595) loss:2.651 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:48:35,447][model8_pretrain.py][INFO] Epoch:[0/2](389500/4588595) loss:2.774 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:48:35,447][model8_pretrain.py][INFO] Epoch:[0/2](389500/4588595) loss:3.343 lr:0.0000100 epoch_Time:26616.0min: [2024-01-04 10:49:17,353][model8_pretrain.py][INFO] Epoch:[0/2](389600/4588595) loss:2.633 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:49:17,353][model8_pretrain.py][INFO] Epoch:[0/2](389600/4588595) loss:3.288 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:49:17,353][model8_pretrain.py][INFO] Epoch:[0/2](389600/4588595) loss:3.212 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:49:17,353][model8_pretrain.py][INFO] Epoch:[0/2](389600/4588595) loss:2.939 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:49:17,353][model8_pretrain.py][INFO] Epoch:[0/2](389600/4588595) loss:2.777 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:49:17,353][model8_pretrain.py][INFO] Epoch:[0/2](389600/4588595) loss:3.371 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:49:17,353][model8_pretrain.py][INFO] Epoch:[0/2](389600/4588595) loss:3.025 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:49:17,353][model8_pretrain.py][INFO] Epoch:[0/2](389600/4588595) loss:2.787 lr:0.0000100 epoch_Time:26615.0min: [2024-01-04 10:49:54,288][model8_pretrain.py][INFO] Epoch:[0/2](389700/4588595) loss:3.086 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:49:54,288][model8_pretrain.py][INFO] Epoch:[0/2](389700/4588595) loss:2.337 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:49:54,288][model8_pretrain.py][INFO] Epoch:[0/2](389700/4588595) loss:2.942 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:49:54,288][model8_pretrain.py][INFO] Epoch:[0/2](389700/4588595) loss:2.961 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:49:54,288][model8_pretrain.py][INFO] Epoch:[0/2](389700/4588595) loss:2.795 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:49:54,288][model8_pretrain.py][INFO] Epoch:[0/2](389700/4588595) loss:2.714 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:49:54,288][model8_pretrain.py][INFO] Epoch:[0/2](389700/4588595) loss:2.556 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:49:54,288][model8_pretrain.py][INFO] Epoch:[0/2](389700/4588595) loss:3.046 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:50:31,257][model8_pretrain.py][INFO] Epoch:[0/2](389800/4588595) loss:3.309 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:50:31,257][model8_pretrain.py][INFO] Epoch:[0/2](389800/4588595) loss:2.387 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:50:31,257][model8_pretrain.py][INFO] Epoch:[0/2](389800/4588595) loss:3.055 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:50:31,257][model8_pretrain.py][INFO] Epoch:[0/2](389800/4588595) loss:2.887 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:50:31,257][model8_pretrain.py][INFO] Epoch:[0/2](389800/4588595) loss:2.874 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:50:31,257][model8_pretrain.py][INFO] Epoch:[0/2](389800/4588595) loss:2.585 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:50:31,257][model8_pretrain.py][INFO] Epoch:[0/2](389800/4588595) loss:2.661 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:50:31,257][model8_pretrain.py][INFO] Epoch:[0/2](389800/4588595) loss:3.158 lr:0.0000100 epoch_Time:26614.0min: [2024-01-04 10:51:08,195][model8_pretrain.py][INFO] Epoch:[0/2](389900/4588595) loss:2.813 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:08,195][model8_pretrain.py][INFO] Epoch:[0/2](389900/4588595) loss:3.367 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:08,195][model8_pretrain.py][INFO] Epoch:[0/2](389900/4588595) loss:2.809 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:08,195][model8_pretrain.py][INFO] Epoch:[0/2](389900/4588595) loss:2.852 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:08,195][model8_pretrain.py][INFO] Epoch:[0/2](389900/4588595) loss:2.502 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:08,195][model8_pretrain.py][INFO] Epoch:[0/2](389900/4588595) loss:3.139 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:08,195][model8_pretrain.py][INFO] Epoch:[0/2](389900/4588595) loss:2.438 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:08,195][model8_pretrain.py][INFO] Epoch:[0/2](389900/4588595) loss:2.813 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:45,124][model8_pretrain.py][INFO] Epoch:[0/2](390000/4588595) loss:3.168 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:45,124][model8_pretrain.py][INFO] Epoch:[0/2](390000/4588595) loss:3.117 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:45,124][model8_pretrain.py][INFO] Epoch:[0/2](390000/4588595) loss:2.881 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:45,124][model8_pretrain.py][INFO] Epoch:[0/2](390000/4588595) loss:1.990 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:45,124][model8_pretrain.py][INFO] Epoch:[0/2](390000/4588595) loss:3.008 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:45,124][model8_pretrain.py][INFO] Epoch:[0/2](390000/4588595) loss:2.406 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:45,124][model8_pretrain.py][INFO] Epoch:[0/2](390000/4588595) loss:2.759 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:51:45,125][model8_pretrain.py][INFO] Epoch:[0/2](390000/4588595) loss:2.883 lr:0.0000100 epoch_Time:26613.0min: [2024-01-04 10:52:22,061][model8_pretrain.py][INFO] Epoch:[0/2](390100/4588595) loss:3.153 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:52:22,061][model8_pretrain.py][INFO] Epoch:[0/2](390100/4588595) loss:3.317 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:52:22,061][model8_pretrain.py][INFO] Epoch:[0/2](390100/4588595) loss:3.131 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:52:22,061][model8_pretrain.py][INFO] Epoch:[0/2](390100/4588595) loss:2.958 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:52:22,061][model8_pretrain.py][INFO] Epoch:[0/2](390100/4588595) loss:2.186 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:52:22,061][model8_pretrain.py][INFO] Epoch:[0/2](390100/4588595) loss:2.779 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:52:22,061][model8_pretrain.py][INFO] Epoch:[0/2](390100/4588595) loss:2.584 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:52:22,062][model8_pretrain.py][INFO] Epoch:[0/2](390100/4588595) loss:2.783 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:52:58,991][model8_pretrain.py][INFO] Epoch:[0/2](390200/4588595) loss:2.595 lr:0.0000100 epoch_Time:26610.0min: [2024-01-04 10:52:58,991][model8_pretrain.py][INFO] Epoch:[0/2](390200/4588595) loss:2.135 lr:0.0000100 epoch_Time:26610.0min: [2024-01-04 10:52:58,991][model8_pretrain.py][INFO] Epoch:[0/2](390200/4588595) loss:2.551 lr:0.0000100 epoch_Time:26610.0min: [2024-01-04 10:52:58,991][model8_pretrain.py][INFO] Epoch:[0/2](390200/4588595) loss:2.442 lr:0.0000100 epoch_Time:26610.0min: [2024-01-04 10:52:58,991][model8_pretrain.py][INFO] Epoch:[0/2](390200/4588595) loss:3.158 lr:0.0000100 epoch_Time:26610.0min: [2024-01-04 10:52:58,991][model8_pretrain.py][INFO] Epoch:[0/2](390200/4588595) loss:3.176 lr:0.0000100 epoch_Time:26610.0min: [2024-01-04 10:52:58,991][model8_pretrain.py][INFO] Epoch:[0/2](390200/4588595) loss:2.383 lr:0.0000100 epoch_Time:26610.0min: [2024-01-04 10:52:58,991][model8_pretrain.py][INFO] Epoch:[0/2](390200/4588595) loss:3.040 lr:0.0000100 epoch_Time:26610.0min: [2024-01-04 10:53:41,039][model8_pretrain.py][INFO] Epoch:[0/2](390300/4588595) loss:2.796 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:53:41,039][model8_pretrain.py][INFO] Epoch:[0/2](390300/4588595) loss:3.162 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:53:41,039][model8_pretrain.py][INFO] Epoch:[0/2](390300/4588595) loss:3.369 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:53:41,039][model8_pretrain.py][INFO] Epoch:[0/2](390300/4588595) loss:2.783 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:53:41,039][model8_pretrain.py][INFO] Epoch:[0/2](390300/4588595) loss:2.668 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:53:41,039][model8_pretrain.py][INFO] Epoch:[0/2](390300/4588595) loss:2.801 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:53:41,039][model8_pretrain.py][INFO] Epoch:[0/2](390300/4588595) loss:2.320 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:53:41,039][model8_pretrain.py][INFO] Epoch:[0/2](390300/4588595) loss:3.043 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:54:22,935][model8_pretrain.py][INFO] Epoch:[0/2](390400/4588595) loss:2.544 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:54:22,935][model8_pretrain.py][INFO] Epoch:[0/2](390400/4588595) loss:2.703 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:54:22,935][model8_pretrain.py][INFO] Epoch:[0/2](390400/4588595) loss:2.309 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:54:22,935][model8_pretrain.py][INFO] Epoch:[0/2](390400/4588595) loss:2.148 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:54:22,935][model8_pretrain.py][INFO] Epoch:[0/2](390400/4588595) loss:2.693 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:54:22,935][model8_pretrain.py][INFO] Epoch:[0/2](390400/4588595) loss:3.248 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:54:22,935][model8_pretrain.py][INFO] Epoch:[0/2](390400/4588595) loss:2.607 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:54:22,935][model8_pretrain.py][INFO] Epoch:[0/2](390400/4588595) loss:3.402 lr:0.0000100 epoch_Time:26611.0min: [2024-01-04 10:54:59,876][model8_pretrain.py][INFO] Epoch:[0/2](390500/4588595) loss:3.088 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:54:59,876][model8_pretrain.py][INFO] Epoch:[0/2](390500/4588595) loss:3.078 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:54:59,876][model8_pretrain.py][INFO] Epoch:[0/2](390500/4588595) loss:2.491 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:54:59,876][model8_pretrain.py][INFO] Epoch:[0/2](390500/4588595) loss:3.338 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:54:59,876][model8_pretrain.py][INFO] Epoch:[0/2](390500/4588595) loss:2.774 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:54:59,876][model8_pretrain.py][INFO] Epoch:[0/2](390500/4588595) loss:2.523 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:54:59,876][model8_pretrain.py][INFO] Epoch:[0/2](390500/4588595) loss:3.073 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:54:59,876][model8_pretrain.py][INFO] Epoch:[0/2](390500/4588595) loss:2.682 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:55:36,807][model8_pretrain.py][INFO] Epoch:[0/2](390600/4588595) loss:3.003 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:55:36,807][model8_pretrain.py][INFO] Epoch:[0/2](390600/4588595) loss:2.664 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:55:36,808][model8_pretrain.py][INFO] Epoch:[0/2](390600/4588595) loss:2.901 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:55:36,808][model8_pretrain.py][INFO] Epoch:[0/2](390600/4588595) loss:3.291 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:55:36,808][model8_pretrain.py][INFO] Epoch:[0/2](390600/4588595) loss:2.672 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:55:36,808][model8_pretrain.py][INFO] Epoch:[0/2](390600/4588595) loss:2.609 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:55:36,808][model8_pretrain.py][INFO] Epoch:[0/2](390600/4588595) loss:3.463 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:55:36,808][model8_pretrain.py][INFO] Epoch:[0/2](390600/4588595) loss:3.064 lr:0.0000100 epoch_Time:26609.0min: [2024-01-04 10:56:13,738][model8_pretrain.py][INFO] Epoch:[0/2](390700/4588595) loss:2.103 lr:0.0000100 epoch_Time:26608.0min: [2024-01-04 10:56:13,738][model8_pretrain.py][INFO] Epoch:[0/2](390700/4588595) loss:2.846 lr:0.0000100 epoch_Time:26608.0min: [2024-01-04 10:56:13,738][model8_pretrain.py][INFO] Epoch:[0/2](390700/4588595) loss:3.231 lr:0.0000100 epoch_Time:26608.0min: [2024-01-04 10:56:13,738][model8_pretrain.py][INFO] Epoch:[0/2](390700/4588595) loss:3.335 lr:0.0000100 epoch_Time:26608.0min: [2024-01-04 10:56:13,738][model8_pretrain.py][INFO] Epoch:[0/2](390700/4588595) loss:2.868 lr:0.0000100 epoch_Time:26608.0min: [2024-01-04 10:56:13,738][model8_pretrain.py][INFO] Epoch:[0/2](390700/4588595) loss:2.239 lr:0.0000100 epoch_Time:26608.0min: [2024-01-04 10:56:13,738][model8_pretrain.py][INFO] Epoch:[0/2](390700/4588595) loss:3.208 lr:0.0000100 epoch_Time:26608.0min: [2024-01-04 10:56:13,738][model8_pretrain.py][INFO] Epoch:[0/2](390700/4588595) loss:3.207 lr:0.0000100 epoch_Time:26608.0min: [2024-01-04 10:56:50,667][model8_pretrain.py][INFO] Epoch:[0/2](390800/4588595) loss:2.172 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:56:50,667][model8_pretrain.py][INFO] Epoch:[0/2](390800/4588595) loss:3.067 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:56:50,667][model8_pretrain.py][INFO] Epoch:[0/2](390800/4588595) loss:2.809 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:56:50,667][model8_pretrain.py][INFO] Epoch:[0/2](390800/4588595) loss:3.109 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:56:50,667][model8_pretrain.py][INFO] Epoch:[0/2](390800/4588595) loss:2.910 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:56:50,667][model8_pretrain.py][INFO] Epoch:[0/2](390800/4588595) loss:2.955 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:56:50,667][model8_pretrain.py][INFO] Epoch:[0/2](390800/4588595) loss:2.548 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:56:50,667][model8_pretrain.py][INFO] Epoch:[0/2](390800/4588595) loss:3.312 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:57:27,603][model8_pretrain.py][INFO] Epoch:[0/2](390900/4588595) loss:2.708 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:57:27,603][model8_pretrain.py][INFO] Epoch:[0/2](390900/4588595) loss:2.790 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:57:27,603][model8_pretrain.py][INFO] Epoch:[0/2](390900/4588595) loss:3.126 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:57:27,603][model8_pretrain.py][INFO] Epoch:[0/2](390900/4588595) loss:3.068 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:57:27,604][model8_pretrain.py][INFO] Epoch:[0/2](390900/4588595) loss:2.742 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:57:27,604][model8_pretrain.py][INFO] Epoch:[0/2](390900/4588595) loss:3.126 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:57:27,603][model8_pretrain.py][INFO] Epoch:[0/2](390900/4588595) loss:3.159 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:57:27,604][model8_pretrain.py][INFO] Epoch:[0/2](390900/4588595) loss:2.372 lr:0.0000100 epoch_Time:26607.0min: [2024-01-04 10:58:04,538][model8_pretrain.py][INFO] Epoch:[0/2](391000/4588595) loss:3.084 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 10:58:04,538][model8_pretrain.py][INFO] Epoch:[0/2](391000/4588595) loss:2.685 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 10:58:04,538][model8_pretrain.py][INFO] Epoch:[0/2](391000/4588595) loss:2.941 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 10:58:04,538][model8_pretrain.py][INFO] Epoch:[0/2](391000/4588595) loss:2.956 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 10:58:04,538][model8_pretrain.py][INFO] Epoch:[0/2](391000/4588595) loss:2.442 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 10:58:04,538][model8_pretrain.py][INFO] Epoch:[0/2](391000/4588595) loss:2.446 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 10:58:04,538][model8_pretrain.py][INFO] Epoch:[0/2](391000/4588595) loss:3.076 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 10:58:04,538][model8_pretrain.py][INFO] Epoch:[0/2](391000/4588595) loss:2.487 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 10:58:46,485][model8_pretrain.py][INFO] Epoch:[0/2](391100/4588595) loss:2.705 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:58:46,485][model8_pretrain.py][INFO] Epoch:[0/2](391100/4588595) loss:2.878 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:58:46,485][model8_pretrain.py][INFO] Epoch:[0/2](391100/4588595) loss:2.514 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:58:46,485][model8_pretrain.py][INFO] Epoch:[0/2](391100/4588595) loss:2.695 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:58:46,489][model8_pretrain.py][INFO] Epoch:[0/2](391100/4588595) loss:3.060 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:58:46,490][model8_pretrain.py][INFO] Epoch:[0/2](391100/4588595) loss:3.426 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:58:46,490][model8_pretrain.py][INFO] Epoch:[0/2](391100/4588595) loss:2.690 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:58:46,490][model8_pretrain.py][INFO] Epoch:[0/2](391100/4588595) loss:2.776 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:59:28,437][model8_pretrain.py][INFO] Epoch:[0/2](391200/4588595) loss:2.520 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:59:28,437][model8_pretrain.py][INFO] Epoch:[0/2](391200/4588595) loss:2.820 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:59:28,437][model8_pretrain.py][INFO] Epoch:[0/2](391200/4588595) loss:2.235 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:59:28,437][model8_pretrain.py][INFO] Epoch:[0/2](391200/4588595) loss:2.995 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:59:28,437][model8_pretrain.py][INFO] Epoch:[0/2](391200/4588595) loss:3.191 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:59:28,437][model8_pretrain.py][INFO] Epoch:[0/2](391200/4588595) loss:2.977 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:59:28,437][model8_pretrain.py][INFO] Epoch:[0/2](391200/4588595) loss:2.959 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 10:59:28,437][model8_pretrain.py][INFO] Epoch:[0/2](391200/4588595) loss:2.847 lr:0.0000100 epoch_Time:26606.0min: [2024-01-04 11:00:05,380][model8_pretrain.py][INFO] Epoch:[0/2](391300/4588595) loss:3.038 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:05,380][model8_pretrain.py][INFO] Epoch:[0/2](391300/4588595) loss:2.529 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:05,380][model8_pretrain.py][INFO] Epoch:[0/2](391300/4588595) loss:2.817 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:05,380][model8_pretrain.py][INFO] Epoch:[0/2](391300/4588595) loss:3.008 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:05,380][model8_pretrain.py][INFO] Epoch:[0/2](391300/4588595) loss:2.723 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:05,380][model8_pretrain.py][INFO] Epoch:[0/2](391300/4588595) loss:2.206 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:05,380][model8_pretrain.py][INFO] Epoch:[0/2](391300/4588595) loss:3.208 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:05,380][model8_pretrain.py][INFO] Epoch:[0/2](391300/4588595) loss:2.882 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:42,327][model8_pretrain.py][INFO] Epoch:[0/2](391400/4588595) loss:2.089 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:42,327][model8_pretrain.py][INFO] Epoch:[0/2](391400/4588595) loss:3.118 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:42,327][model8_pretrain.py][INFO] Epoch:[0/2](391400/4588595) loss:3.162 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:42,327][model8_pretrain.py][INFO] Epoch:[0/2](391400/4588595) loss:2.655 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:42,327][model8_pretrain.py][INFO] Epoch:[0/2](391400/4588595) loss:2.821 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:42,327][model8_pretrain.py][INFO] Epoch:[0/2](391400/4588595) loss:2.881 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:42,327][model8_pretrain.py][INFO] Epoch:[0/2](391400/4588595) loss:2.943 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:00:42,327][model8_pretrain.py][INFO] Epoch:[0/2](391400/4588595) loss:3.095 lr:0.0000100 epoch_Time:26605.0min: [2024-01-04 11:01:19,278][model8_pretrain.py][INFO] Epoch:[0/2](391500/4588595) loss:2.722 lr:0.0000100 epoch_Time:26603.0min: [2024-01-04 11:01:19,278][model8_pretrain.py][INFO] Epoch:[0/2](391500/4588595) loss:3.377 lr:0.0000100 epoch_Time:26603.0min: [2024-01-04 11:01:19,278][model8_pretrain.py][INFO] Epoch:[0/2](391500/4588595) loss:2.768 lr:0.0000100 epoch_Time:26603.0min: [2024-01-04 11:01:19,278][model8_pretrain.py][INFO] Epoch:[0/2](391500/4588595) loss:3.039 lr:0.0000100 epoch_Time:26603.0min: [2024-01-04 11:01:19,278][model8_pretrain.py][INFO] Epoch:[0/2](391500/4588595) loss:2.627 lr:0.0000100 epoch_Time:26603.0min: [2024-01-04 11:01:19,278][model8_pretrain.py][INFO] Epoch:[0/2](391500/4588595) loss:2.907 lr:0.0000100 epoch_Time:26603.0min: [2024-01-04 11:01:19,278][model8_pretrain.py][INFO] Epoch:[0/2](391500/4588595) loss:2.813 lr:0.0000100 epoch_Time:26603.0min: [2024-01-04 11:01:19,278][model8_pretrain.py][INFO] Epoch:[0/2](391500/4588595) loss:3.168 lr:0.0000100 epoch_Time:26603.0min: [2024-01-04 11:01:56,230][model8_pretrain.py][INFO] Epoch:[0/2](391600/4588595) loss:2.913 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:01:56,230][model8_pretrain.py][INFO] Epoch:[0/2](391600/4588595) loss:2.608 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:01:56,230][model8_pretrain.py][INFO] Epoch:[0/2](391600/4588595) loss:2.692 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:01:56,230][model8_pretrain.py][INFO] Epoch:[0/2](391600/4588595) loss:2.541 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:01:56,230][model8_pretrain.py][INFO] Epoch:[0/2](391600/4588595) loss:3.100 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:01:56,230][model8_pretrain.py][INFO] Epoch:[0/2](391600/4588595) loss:2.985 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:01:56,230][model8_pretrain.py][INFO] Epoch:[0/2](391600/4588595) loss:2.465 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:01:56,230][model8_pretrain.py][INFO] Epoch:[0/2](391600/4588595) loss:2.534 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:02:33,192][model8_pretrain.py][INFO] Epoch:[0/2](391700/4588595) loss:2.783 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:02:33,192][model8_pretrain.py][INFO] Epoch:[0/2](391700/4588595) loss:2.647 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:02:33,192][model8_pretrain.py][INFO] Epoch:[0/2](391700/4588595) loss:2.871 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:02:33,192][model8_pretrain.py][INFO] Epoch:[0/2](391700/4588595) loss:2.679 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:02:33,192][model8_pretrain.py][INFO] Epoch:[0/2](391700/4588595) loss:3.378 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:02:33,192][model8_pretrain.py][INFO] Epoch:[0/2](391700/4588595) loss:2.880 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:02:33,192][model8_pretrain.py][INFO] Epoch:[0/2](391700/4588595) loss:2.689 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:02:33,192][model8_pretrain.py][INFO] Epoch:[0/2](391700/4588595) loss:2.664 lr:0.0000100 epoch_Time:26602.0min: [2024-01-04 11:03:10,146][model8_pretrain.py][INFO] Epoch:[0/2](391800/4588595) loss:3.245 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:03:10,146][model8_pretrain.py][INFO] Epoch:[0/2](391800/4588595) loss:2.895 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:03:10,146][model8_pretrain.py][INFO] Epoch:[0/2](391800/4588595) loss:2.959 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:03:10,146][model8_pretrain.py][INFO] Epoch:[0/2](391800/4588595) loss:2.844 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:03:10,146][model8_pretrain.py][INFO] Epoch:[0/2](391800/4588595) loss:2.281 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:03:10,146][model8_pretrain.py][INFO] Epoch:[0/2](391800/4588595) loss:3.171 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:03:10,147][model8_pretrain.py][INFO] Epoch:[0/2](391800/4588595) loss:3.182 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:03:10,146][model8_pretrain.py][INFO] Epoch:[0/2](391800/4588595) loss:2.688 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:03:48,826][model8_pretrain.py][INFO] Epoch:[0/2](391900/4588595) loss:3.377 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:03:48,826][model8_pretrain.py][INFO] Epoch:[0/2](391900/4588595) loss:2.223 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:03:48,826][model8_pretrain.py][INFO] Epoch:[0/2](391900/4588595) loss:3.164 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:03:48,826][model8_pretrain.py][INFO] Epoch:[0/2](391900/4588595) loss:3.182 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:03:48,826][model8_pretrain.py][INFO] Epoch:[0/2](391900/4588595) loss:3.122 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:03:48,826][model8_pretrain.py][INFO] Epoch:[0/2](391900/4588595) loss:2.171 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:03:48,826][model8_pretrain.py][INFO] Epoch:[0/2](391900/4588595) loss:3.233 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:03:48,827][model8_pretrain.py][INFO] Epoch:[0/2](391900/4588595) loss:3.330 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:04:32,678][model8_pretrain.py][INFO] Epoch:[0/2](392000/4588595) loss:2.492 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:04:32,678][model8_pretrain.py][INFO] Epoch:[0/2](392000/4588595) loss:2.821 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:04:32,678][model8_pretrain.py][INFO] Epoch:[0/2](392000/4588595) loss:2.940 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:04:32,678][model8_pretrain.py][INFO] Epoch:[0/2](392000/4588595) loss:3.188 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:04:32,678][model8_pretrain.py][INFO] Epoch:[0/2](392000/4588595) loss:2.515 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:04:32,678][model8_pretrain.py][INFO] Epoch:[0/2](392000/4588595) loss:2.758 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:04:32,678][model8_pretrain.py][INFO] Epoch:[0/2](392000/4588595) loss:3.096 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:04:32,678][model8_pretrain.py][INFO] Epoch:[0/2](392000/4588595) loss:2.688 lr:0.0000100 epoch_Time:26601.0min: [2024-01-04 11:05:09,619][model8_pretrain.py][INFO] Epoch:[0/2](392100/4588595) loss:2.884 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:09,619][model8_pretrain.py][INFO] Epoch:[0/2](392100/4588595) loss:2.642 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:09,619][model8_pretrain.py][INFO] Epoch:[0/2](392100/4588595) loss:3.609 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:09,619][model8_pretrain.py][INFO] Epoch:[0/2](392100/4588595) loss:3.178 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:09,619][model8_pretrain.py][INFO] Epoch:[0/2](392100/4588595) loss:2.712 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:09,619][model8_pretrain.py][INFO] Epoch:[0/2](392100/4588595) loss:3.012 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:09,619][model8_pretrain.py][INFO] Epoch:[0/2](392100/4588595) loss:2.762 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:09,619][model8_pretrain.py][INFO] Epoch:[0/2](392100/4588595) loss:3.033 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:46,573][model8_pretrain.py][INFO] Epoch:[0/2](392200/4588595) loss:2.864 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:46,573][model8_pretrain.py][INFO] Epoch:[0/2](392200/4588595) loss:2.916 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:46,573][model8_pretrain.py][INFO] Epoch:[0/2](392200/4588595) loss:2.561 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:46,573][model8_pretrain.py][INFO] Epoch:[0/2](392200/4588595) loss:2.988 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:46,573][model8_pretrain.py][INFO] Epoch:[0/2](392200/4588595) loss:2.903 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:46,573][model8_pretrain.py][INFO] Epoch:[0/2](392200/4588595) loss:3.274 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:46,573][model8_pretrain.py][INFO] Epoch:[0/2](392200/4588595) loss:2.815 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:05:46,573][model8_pretrain.py][INFO] Epoch:[0/2](392200/4588595) loss:2.152 lr:0.0000100 epoch_Time:26600.0min: [2024-01-04 11:06:23,528][model8_pretrain.py][INFO] Epoch:[0/2](392300/4588595) loss:3.120 lr:0.0000100 epoch_Time:26598.0min: [2024-01-04 11:06:23,528][model8_pretrain.py][INFO] Epoch:[0/2](392300/4588595) loss:2.483 lr:0.0000100 epoch_Time:26598.0min: [2024-01-04 11:06:23,528][model8_pretrain.py][INFO] Epoch:[0/2](392300/4588595) loss:2.372 lr:0.0000100 epoch_Time:26598.0min: [2024-01-04 11:06:23,528][model8_pretrain.py][INFO] Epoch:[0/2](392300/4588595) loss:2.299 lr:0.0000100 epoch_Time:26598.0min: [2024-01-04 11:06:23,528][model8_pretrain.py][INFO] Epoch:[0/2](392300/4588595) loss:2.961 lr:0.0000100 epoch_Time:26598.0min: [2024-01-04 11:06:23,528][model8_pretrain.py][INFO] Epoch:[0/2](392300/4588595) loss:2.800 lr:0.0000100 epoch_Time:26598.0min: [2024-01-04 11:06:23,528][model8_pretrain.py][INFO] Epoch:[0/2](392300/4588595) loss:2.560 lr:0.0000100 epoch_Time:26598.0min: [2024-01-04 11:06:23,528][model8_pretrain.py][INFO] Epoch:[0/2](392300/4588595) loss:3.299 lr:0.0000100 epoch_Time:26598.0min: [2024-01-04 11:07:00,481][model8_pretrain.py][INFO] Epoch:[0/2](392400/4588595) loss:2.877 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:00,481][model8_pretrain.py][INFO] Epoch:[0/2](392400/4588595) loss:2.378 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:00,481][model8_pretrain.py][INFO] Epoch:[0/2](392400/4588595) loss:3.194 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:00,481][model8_pretrain.py][INFO] Epoch:[0/2](392400/4588595) loss:2.944 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:00,481][model8_pretrain.py][INFO] Epoch:[0/2](392400/4588595) loss:3.207 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:00,481][model8_pretrain.py][INFO] Epoch:[0/2](392400/4588595) loss:1.864 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:00,481][model8_pretrain.py][INFO] Epoch:[0/2](392400/4588595) loss:2.774 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:00,481][model8_pretrain.py][INFO] Epoch:[0/2](392400/4588595) loss:3.315 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:37,403][model8_pretrain.py][INFO] Epoch:[0/2](392500/4588595) loss:2.876 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:37,403][model8_pretrain.py][INFO] Epoch:[0/2](392500/4588595) loss:2.761 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:37,403][model8_pretrain.py][INFO] Epoch:[0/2](392500/4588595) loss:2.203 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:37,403][model8_pretrain.py][INFO] Epoch:[0/2](392500/4588595) loss:2.825 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:37,403][model8_pretrain.py][INFO] Epoch:[0/2](392500/4588595) loss:2.954 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:37,403][model8_pretrain.py][INFO] Epoch:[0/2](392500/4588595) loss:1.933 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:37,403][model8_pretrain.py][INFO] Epoch:[0/2](392500/4588595) loss:3.383 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:07:37,403][model8_pretrain.py][INFO] Epoch:[0/2](392500/4588595) loss:2.932 lr:0.0000100 epoch_Time:26597.0min: [2024-01-04 11:08:14,361][model8_pretrain.py][INFO] Epoch:[0/2](392600/4588595) loss:2.624 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:08:14,361][model8_pretrain.py][INFO] Epoch:[0/2](392600/4588595) loss:2.966 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:08:14,361][model8_pretrain.py][INFO] Epoch:[0/2](392600/4588595) loss:3.008 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:08:14,361][model8_pretrain.py][INFO] Epoch:[0/2](392600/4588595) loss:2.753 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:08:14,361][model8_pretrain.py][INFO] Epoch:[0/2](392600/4588595) loss:3.006 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:08:14,361][model8_pretrain.py][INFO] Epoch:[0/2](392600/4588595) loss:2.730 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:08:14,361][model8_pretrain.py][INFO] Epoch:[0/2](392600/4588595) loss:2.516 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:08:14,361][model8_pretrain.py][INFO] Epoch:[0/2](392600/4588595) loss:2.853 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:08:53,082][model8_pretrain.py][INFO] Epoch:[0/2](392700/4588595) loss:3.057 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:08:53,082][model8_pretrain.py][INFO] Epoch:[0/2](392700/4588595) loss:3.455 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:08:53,082][model8_pretrain.py][INFO] Epoch:[0/2](392700/4588595) loss:2.412 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:08:53,082][model8_pretrain.py][INFO] Epoch:[0/2](392700/4588595) loss:2.612 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:08:53,082][model8_pretrain.py][INFO] Epoch:[0/2](392700/4588595) loss:2.777 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:08:53,082][model8_pretrain.py][INFO] Epoch:[0/2](392700/4588595) loss:2.658 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:08:53,082][model8_pretrain.py][INFO] Epoch:[0/2](392700/4588595) loss:2.732 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:08:53,082][model8_pretrain.py][INFO] Epoch:[0/2](392700/4588595) loss:3.168 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:09:37,053][model8_pretrain.py][INFO] Epoch:[0/2](392800/4588595) loss:3.358 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:09:37,053][model8_pretrain.py][INFO] Epoch:[0/2](392800/4588595) loss:3.184 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:09:37,053][model8_pretrain.py][INFO] Epoch:[0/2](392800/4588595) loss:2.967 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:09:37,053][model8_pretrain.py][INFO] Epoch:[0/2](392800/4588595) loss:2.795 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:09:37,053][model8_pretrain.py][INFO] Epoch:[0/2](392800/4588595) loss:2.955 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:09:37,053][model8_pretrain.py][INFO] Epoch:[0/2](392800/4588595) loss:2.256 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:09:37,053][model8_pretrain.py][INFO] Epoch:[0/2](392800/4588595) loss:2.609 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:09:37,053][model8_pretrain.py][INFO] Epoch:[0/2](392800/4588595) loss:3.145 lr:0.0000100 epoch_Time:26596.0min: [2024-01-04 11:10:13,982][model8_pretrain.py][INFO] Epoch:[0/2](392900/4588595) loss:2.662 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:10:13,982][model8_pretrain.py][INFO] Epoch:[0/2](392900/4588595) loss:3.038 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:10:13,982][model8_pretrain.py][INFO] Epoch:[0/2](392900/4588595) loss:2.949 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:10:13,982][model8_pretrain.py][INFO] Epoch:[0/2](392900/4588595) loss:3.122 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:10:13,982][model8_pretrain.py][INFO] Epoch:[0/2](392900/4588595) loss:2.804 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:10:13,982][model8_pretrain.py][INFO] Epoch:[0/2](392900/4588595) loss:3.031 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:10:13,982][model8_pretrain.py][INFO] Epoch:[0/2](392900/4588595) loss:2.979 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:10:13,982][model8_pretrain.py][INFO] Epoch:[0/2](392900/4588595) loss:2.137 lr:0.0000100 epoch_Time:26595.0min: [2024-01-04 11:10:50,918][model8_pretrain.py][INFO] Epoch:[0/2](393000/4588595) loss:2.754 lr:0.0000100 epoch_Time:26594.0min: [2024-01-04 11:10:50,918][model8_pretrain.py][INFO] Epoch:[0/2](393000/4588595) loss:3.100 lr:0.0000100 epoch_Time:26594.0min: [2024-01-04 11:10:50,918][model8_pretrain.py][INFO] Epoch:[0/2](393000/4588595) loss:3.051 lr:0.0000100 epoch_Time:26594.0min: [2024-01-04 11:10:50,918][model8_pretrain.py][INFO] Epoch:[0/2](393000/4588595) loss:2.987 lr:0.0000100 epoch_Time:26594.0min: [2024-01-04 11:10:50,918][model8_pretrain.py][INFO] Epoch:[0/2](393000/4588595) loss:2.754 lr:0.0000100 epoch_Time:26594.0min: [2024-01-04 11:10:50,918][model8_pretrain.py][INFO] Epoch:[0/2](393000/4588595) loss:2.588 lr:0.0000100 epoch_Time:26594.0min: [2024-01-04 11:10:50,918][model8_pretrain.py][INFO] Epoch:[0/2](393000/4588595) loss:3.262 lr:0.0000100 epoch_Time:26594.0min: [2024-01-04 11:10:50,918][model8_pretrain.py][INFO] Epoch:[0/2](393000/4588595) loss:2.986 lr:0.0000100 epoch_Time:26594.0min: [2024-01-04 11:11:27,843][model8_pretrain.py][INFO] Epoch:[0/2](393100/4588595) loss:2.859 lr:0.0000100 epoch_Time:26593.0min: [2024-01-04 11:11:27,843][model8_pretrain.py][INFO] Epoch:[0/2](393100/4588595) loss:3.104 lr:0.0000100 epoch_Time:26593.0min: [2024-01-04 11:11:27,843][model8_pretrain.py][INFO] Epoch:[0/2](393100/4588595) loss:2.336 lr:0.0000100 epoch_Time:26593.0min: [2024-01-04 11:11:27,843][model8_pretrain.py][INFO] Epoch:[0/2](393100/4588595) loss:2.924 lr:0.0000100 epoch_Time:26593.0min: [2024-01-04 11:11:27,843][model8_pretrain.py][INFO] Epoch:[0/2](393100/4588595) loss:2.539 lr:0.0000100 epoch_Time:26593.0min: [2024-01-04 11:11:27,843][model8_pretrain.py][INFO] Epoch:[0/2](393100/4588595) loss:3.147 lr:0.0000100 epoch_Time:26593.0min: [2024-01-04 11:11:27,843][model8_pretrain.py][INFO] Epoch:[0/2](393100/4588595) loss:3.040 lr:0.0000100 epoch_Time:26593.0min: [2024-01-04 11:11:27,843][model8_pretrain.py][INFO] Epoch:[0/2](393100/4588595) loss:2.746 lr:0.0000100 epoch_Time:26593.0min: [2024-01-04 11:12:04,779][model8_pretrain.py][INFO] Epoch:[0/2](393200/4588595) loss:2.983 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:04,779][model8_pretrain.py][INFO] Epoch:[0/2](393200/4588595) loss:2.795 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:04,779][model8_pretrain.py][INFO] Epoch:[0/2](393200/4588595) loss:2.774 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:04,779][model8_pretrain.py][INFO] Epoch:[0/2](393200/4588595) loss:3.155 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:04,779][model8_pretrain.py][INFO] Epoch:[0/2](393200/4588595) loss:2.960 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:04,779][model8_pretrain.py][INFO] Epoch:[0/2](393200/4588595) loss:2.356 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:04,779][model8_pretrain.py][INFO] Epoch:[0/2](393200/4588595) loss:3.228 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:04,780][model8_pretrain.py][INFO] Epoch:[0/2](393200/4588595) loss:2.706 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:41,712][model8_pretrain.py][INFO] Epoch:[0/2](393300/4588595) loss:3.023 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:41,712][model8_pretrain.py][INFO] Epoch:[0/2](393300/4588595) loss:3.118 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:41,712][model8_pretrain.py][INFO] Epoch:[0/2](393300/4588595) loss:2.751 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:41,712][model8_pretrain.py][INFO] Epoch:[0/2](393300/4588595) loss:2.940 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:41,712][model8_pretrain.py][INFO] Epoch:[0/2](393300/4588595) loss:3.087 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:41,712][model8_pretrain.py][INFO] Epoch:[0/2](393300/4588595) loss:3.513 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:41,712][model8_pretrain.py][INFO] Epoch:[0/2](393300/4588595) loss:3.268 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:12:41,712][model8_pretrain.py][INFO] Epoch:[0/2](393300/4588595) loss:2.945 lr:0.0000100 epoch_Time:26592.0min: [2024-01-04 11:13:18,646][model8_pretrain.py][INFO] Epoch:[0/2](393400/4588595) loss:2.504 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:13:18,646][model8_pretrain.py][INFO] Epoch:[0/2](393400/4588595) loss:2.438 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:13:18,646][model8_pretrain.py][INFO] Epoch:[0/2](393400/4588595) loss:2.638 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:13:18,646][model8_pretrain.py][INFO] Epoch:[0/2](393400/4588595) loss:3.082 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:13:18,646][model8_pretrain.py][INFO] Epoch:[0/2](393400/4588595) loss:2.807 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:13:18,646][model8_pretrain.py][INFO] Epoch:[0/2](393400/4588595) loss:2.732 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:13:18,646][model8_pretrain.py][INFO] Epoch:[0/2](393400/4588595) loss:2.997 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:13:18,646][model8_pretrain.py][INFO] Epoch:[0/2](393400/4588595) loss:2.393 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:13:57,303][model8_pretrain.py][INFO] Epoch:[0/2](393500/4588595) loss:2.455 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:13:57,303][model8_pretrain.py][INFO] Epoch:[0/2](393500/4588595) loss:3.585 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:13:57,303][model8_pretrain.py][INFO] Epoch:[0/2](393500/4588595) loss:3.018 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:13:57,303][model8_pretrain.py][INFO] Epoch:[0/2](393500/4588595) loss:2.824 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:13:57,303][model8_pretrain.py][INFO] Epoch:[0/2](393500/4588595) loss:3.005 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:13:57,307][model8_pretrain.py][INFO] Epoch:[0/2](393500/4588595) loss:2.359 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:13:57,308][model8_pretrain.py][INFO] Epoch:[0/2](393500/4588595) loss:2.817 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:13:57,308][model8_pretrain.py][INFO] Epoch:[0/2](393500/4588595) loss:3.340 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:14:41,192][model8_pretrain.py][INFO] Epoch:[0/2](393600/4588595) loss:3.366 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:14:41,192][model8_pretrain.py][INFO] Epoch:[0/2](393600/4588595) loss:2.569 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:14:41,192][model8_pretrain.py][INFO] Epoch:[0/2](393600/4588595) loss:2.721 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:14:41,192][model8_pretrain.py][INFO] Epoch:[0/2](393600/4588595) loss:2.829 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:14:41,192][model8_pretrain.py][INFO] Epoch:[0/2](393600/4588595) loss:3.372 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:14:41,192][model8_pretrain.py][INFO] Epoch:[0/2](393600/4588595) loss:3.060 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:14:41,192][model8_pretrain.py][INFO] Epoch:[0/2](393600/4588595) loss:2.806 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:14:41,192][model8_pretrain.py][INFO] Epoch:[0/2](393600/4588595) loss:3.436 lr:0.0000100 epoch_Time:26591.0min: [2024-01-04 11:15:18,130][model8_pretrain.py][INFO] Epoch:[0/2](393700/4588595) loss:2.864 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:15:18,130][model8_pretrain.py][INFO] Epoch:[0/2](393700/4588595) loss:3.347 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:15:18,130][model8_pretrain.py][INFO] Epoch:[0/2](393700/4588595) loss:2.558 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:15:18,130][model8_pretrain.py][INFO] Epoch:[0/2](393700/4588595) loss:3.425 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:15:18,130][model8_pretrain.py][INFO] Epoch:[0/2](393700/4588595) loss:2.857 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:15:18,130][model8_pretrain.py][INFO] Epoch:[0/2](393700/4588595) loss:3.097 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:15:18,131][model8_pretrain.py][INFO] Epoch:[0/2](393700/4588595) loss:2.814 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:15:18,131][model8_pretrain.py][INFO] Epoch:[0/2](393700/4588595) loss:2.707 lr:0.0000100 epoch_Time:26590.0min: [2024-01-04 11:15:55,067][model8_pretrain.py][INFO] Epoch:[0/2](393800/4588595) loss:2.776 lr:0.0000100 epoch_Time:26589.0min: [2024-01-04 11:15:55,067][model8_pretrain.py][INFO] Epoch:[0/2](393800/4588595) loss:3.289 lr:0.0000100 epoch_Time:26589.0min: [2024-01-04 11:15:55,067][model8_pretrain.py][INFO] Epoch:[0/2](393800/4588595) loss:2.940 lr:0.0000100 epoch_Time:26589.0min: [2024-01-04 11:15:55,068][model8_pretrain.py][INFO] Epoch:[0/2](393800/4588595) loss:3.011 lr:0.0000100 epoch_Time:26589.0min: [2024-01-04 11:15:55,068][model8_pretrain.py][INFO] Epoch:[0/2](393800/4588595) loss:2.647 lr:0.0000100 epoch_Time:26589.0min: [2024-01-04 11:15:55,068][model8_pretrain.py][INFO] Epoch:[0/2](393800/4588595) loss:2.934 lr:0.0000100 epoch_Time:26589.0min: [2024-01-04 11:15:55,068][model8_pretrain.py][INFO] Epoch:[0/2](393800/4588595) loss:2.867 lr:0.0000100 epoch_Time:26589.0min: [2024-01-04 11:15:55,068][model8_pretrain.py][INFO] Epoch:[0/2](393800/4588595) loss:2.638 lr:0.0000100 epoch_Time:26589.0min: [2024-01-04 11:16:32,004][model8_pretrain.py][INFO] Epoch:[0/2](393900/4588595) loss:3.206 lr:0.0000100 epoch_Time:26588.0min: [2024-01-04 11:16:32,004][model8_pretrain.py][INFO] Epoch:[0/2](393900/4588595) loss:2.958 lr:0.0000100 epoch_Time:26588.0min: [2024-01-04 11:16:32,004][model8_pretrain.py][INFO] Epoch:[0/2](393900/4588595) loss:2.812 lr:0.0000100 epoch_Time:26588.0min: [2024-01-04 11:16:32,004][model8_pretrain.py][INFO] Epoch:[0/2](393900/4588595) loss:3.338 lr:0.0000100 epoch_Time:26588.0min: [2024-01-04 11:16:32,004][model8_pretrain.py][INFO] Epoch:[0/2](393900/4588595) loss:2.364 lr:0.0000100 epoch_Time:26588.0min: [2024-01-04 11:16:32,004][model8_pretrain.py][INFO] Epoch:[0/2](393900/4588595) loss:3.030 lr:0.0000100 epoch_Time:26588.0min: [2024-01-04 11:16:32,004][model8_pretrain.py][INFO] Epoch:[0/2](393900/4588595) loss:3.103 lr:0.0000100 epoch_Time:26588.0min: [2024-01-04 11:16:32,005][model8_pretrain.py][INFO] Epoch:[0/2](393900/4588595) loss:2.633 lr:0.0000100 epoch_Time:26588.0min: [2024-01-04 11:17:08,937][model8_pretrain.py][INFO] Epoch:[0/2](394000/4588595) loss:2.705 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:08,937][model8_pretrain.py][INFO] Epoch:[0/2](394000/4588595) loss:2.776 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:08,937][model8_pretrain.py][INFO] Epoch:[0/2](394000/4588595) loss:2.383 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:08,937][model8_pretrain.py][INFO] Epoch:[0/2](394000/4588595) loss:3.158 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:08,937][model8_pretrain.py][INFO] Epoch:[0/2](394000/4588595) loss:2.500 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:08,938][model8_pretrain.py][INFO] Epoch:[0/2](394000/4588595) loss:2.978 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:08,938][model8_pretrain.py][INFO] Epoch:[0/2](394000/4588595) loss:2.949 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:08,938][model8_pretrain.py][INFO] Epoch:[0/2](394000/4588595) loss:3.066 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:45,871][model8_pretrain.py][INFO] Epoch:[0/2](394100/4588595) loss:2.884 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:45,871][model8_pretrain.py][INFO] Epoch:[0/2](394100/4588595) loss:3.201 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:45,871][model8_pretrain.py][INFO] Epoch:[0/2](394100/4588595) loss:2.023 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:45,871][model8_pretrain.py][INFO] Epoch:[0/2](394100/4588595) loss:2.793 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:45,871][model8_pretrain.py][INFO] Epoch:[0/2](394100/4588595) loss:2.797 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:45,871][model8_pretrain.py][INFO] Epoch:[0/2](394100/4588595) loss:2.691 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:45,871][model8_pretrain.py][INFO] Epoch:[0/2](394100/4588595) loss:2.763 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:17:45,871][model8_pretrain.py][INFO] Epoch:[0/2](394100/4588595) loss:3.072 lr:0.0000100 epoch_Time:26587.0min: [2024-01-04 11:18:22,809][model8_pretrain.py][INFO] Epoch:[0/2](394200/4588595) loss:3.102 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:18:22,809][model8_pretrain.py][INFO] Epoch:[0/2](394200/4588595) loss:2.596 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:18:22,809][model8_pretrain.py][INFO] Epoch:[0/2](394200/4588595) loss:3.035 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:18:22,809][model8_pretrain.py][INFO] Epoch:[0/2](394200/4588595) loss:3.227 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:18:22,809][model8_pretrain.py][INFO] Epoch:[0/2](394200/4588595) loss:2.390 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:18:22,809][model8_pretrain.py][INFO] Epoch:[0/2](394200/4588595) loss:2.947 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:18:22,809][model8_pretrain.py][INFO] Epoch:[0/2](394200/4588595) loss:3.271 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:18:22,809][model8_pretrain.py][INFO] Epoch:[0/2](394200/4588595) loss:3.193 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:18:59,711][model8_pretrain.py][INFO] Epoch:[0/2](394300/4588595) loss:2.573 lr:0.0000100 epoch_Time:26584.0min: [2024-01-04 11:18:59,711][model8_pretrain.py][INFO] Epoch:[0/2](394300/4588595) loss:2.889 lr:0.0000100 epoch_Time:26584.0min: [2024-01-04 11:18:59,711][model8_pretrain.py][INFO] Epoch:[0/2](394300/4588595) loss:2.959 lr:0.0000100 epoch_Time:26584.0min: [2024-01-04 11:18:59,711][model8_pretrain.py][INFO] Epoch:[0/2](394300/4588595) loss:2.680 lr:0.0000100 epoch_Time:26584.0min: [2024-01-04 11:18:59,711][model8_pretrain.py][INFO] Epoch:[0/2](394300/4588595) loss:2.780 lr:0.0000100 epoch_Time:26584.0min: [2024-01-04 11:18:59,711][model8_pretrain.py][INFO] Epoch:[0/2](394300/4588595) loss:3.013 lr:0.0000100 epoch_Time:26584.0min: [2024-01-04 11:18:59,711][model8_pretrain.py][INFO] Epoch:[0/2](394300/4588595) loss:2.327 lr:0.0000100 epoch_Time:26584.0min: [2024-01-04 11:18:59,711][model8_pretrain.py][INFO] Epoch:[0/2](394300/4588595) loss:3.173 lr:0.0000100 epoch_Time:26584.0min: [2024-01-04 11:19:45,321][model8_pretrain.py][INFO] Epoch:[0/2](394400/4588595) loss:3.054 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:19:45,321][model8_pretrain.py][INFO] Epoch:[0/2](394400/4588595) loss:2.829 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:19:45,321][model8_pretrain.py][INFO] Epoch:[0/2](394400/4588595) loss:3.339 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:19:45,321][model8_pretrain.py][INFO] Epoch:[0/2](394400/4588595) loss:3.108 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:19:45,321][model8_pretrain.py][INFO] Epoch:[0/2](394400/4588595) loss:2.403 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:19:45,321][model8_pretrain.py][INFO] Epoch:[0/2](394400/4588595) loss:2.414 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:19:45,321][model8_pretrain.py][INFO] Epoch:[0/2](394400/4588595) loss:2.300 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:19:45,322][model8_pretrain.py][INFO] Epoch:[0/2](394400/4588595) loss:2.287 lr:0.0000100 epoch_Time:26586.0min: [2024-01-04 11:20:22,262][model8_pretrain.py][INFO] Epoch:[0/2](394500/4588595) loss:3.013 lr:0.0000100 epoch_Time:26585.0min: [2024-01-04 11:20:22,262][model8_pretrain.py][INFO] Epoch:[0/2](394500/4588595) loss:2.833 lr:0.0000100 epoch_Time:26585.0min: [2024-01-04 11:20:22,262][model8_pretrain.py][INFO] Epoch:[0/2](394500/4588595) loss:3.109 lr:0.0000100 epoch_Time:26585.0min: [2024-01-04 11:20:22,262][model8_pretrain.py][INFO] Epoch:[0/2](394500/4588595) loss:2.580 lr:0.0000100 epoch_Time:26585.0min: [2024-01-04 11:20:22,262][model8_pretrain.py][INFO] Epoch:[0/2](394500/4588595) loss:2.556 lr:0.0000100 epoch_Time:26585.0min: [2024-01-04 11:20:22,262][model8_pretrain.py][INFO] Epoch:[0/2](394500/4588595) loss:2.680 lr:0.0000100 epoch_Time:26585.0min: [2024-01-04 11:20:22,263][model8_pretrain.py][INFO] Epoch:[0/2](394500/4588595) loss:3.300 lr:0.0000100 epoch_Time:26585.0min: [2024-01-04 11:20:22,263][model8_pretrain.py][INFO] Epoch:[0/2](394500/4588595) loss:2.883 lr:0.0000100 epoch_Time:26585.0min: [2024-01-04 11:20:59,215][model8_pretrain.py][INFO] Epoch:[0/2](394600/4588595) loss:2.722 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:20:59,215][model8_pretrain.py][INFO] Epoch:[0/2](394600/4588595) loss:2.991 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:20:59,215][model8_pretrain.py][INFO] Epoch:[0/2](394600/4588595) loss:2.609 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:20:59,215][model8_pretrain.py][INFO] Epoch:[0/2](394600/4588595) loss:3.013 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:20:59,215][model8_pretrain.py][INFO] Epoch:[0/2](394600/4588595) loss:2.777 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:20:59,215][model8_pretrain.py][INFO] Epoch:[0/2](394600/4588595) loss:2.951 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:20:59,215][model8_pretrain.py][INFO] Epoch:[0/2](394600/4588595) loss:2.699 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:20:59,215][model8_pretrain.py][INFO] Epoch:[0/2](394600/4588595) loss:3.208 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:21:36,155][model8_pretrain.py][INFO] Epoch:[0/2](394700/4588595) loss:2.623 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:21:36,155][model8_pretrain.py][INFO] Epoch:[0/2](394700/4588595) loss:2.533 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:21:36,155][model8_pretrain.py][INFO] Epoch:[0/2](394700/4588595) loss:3.344 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:21:36,155][model8_pretrain.py][INFO] Epoch:[0/2](394700/4588595) loss:3.075 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:21:36,155][model8_pretrain.py][INFO] Epoch:[0/2](394700/4588595) loss:3.055 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:21:36,155][model8_pretrain.py][INFO] Epoch:[0/2](394700/4588595) loss:3.039 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:21:36,155][model8_pretrain.py][INFO] Epoch:[0/2](394700/4588595) loss:2.985 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:21:36,155][model8_pretrain.py][INFO] Epoch:[0/2](394700/4588595) loss:2.961 lr:0.0000100 epoch_Time:26583.0min: [2024-01-04 11:22:13,104][model8_pretrain.py][INFO] Epoch:[0/2](394800/4588595) loss:3.051 lr:0.0000100 epoch_Time:26582.0min: [2024-01-04 11:22:13,104][model8_pretrain.py][INFO] Epoch:[0/2](394800/4588595) loss:3.319 lr:0.0000100 epoch_Time:26582.0min: [2024-01-04 11:22:13,105][model8_pretrain.py][INFO] Epoch:[0/2](394800/4588595) loss:2.765 lr:0.0000100 epoch_Time:26582.0min: [2024-01-04 11:22:13,105][model8_pretrain.py][INFO] Epoch:[0/2](394800/4588595) loss:2.743 lr:0.0000100 epoch_Time:26582.0min: [2024-01-04 11:22:13,105][model8_pretrain.py][INFO] Epoch:[0/2](394800/4588595) loss:3.124 lr:0.0000100 epoch_Time:26582.0min: [2024-01-04 11:22:13,105][model8_pretrain.py][INFO] Epoch:[0/2](394800/4588595) loss:2.277 lr:0.0000100 epoch_Time:26582.0min: [2024-01-04 11:22:13,105][model8_pretrain.py][INFO] Epoch:[0/2](394800/4588595) loss:2.954 lr:0.0000100 epoch_Time:26582.0min: [2024-01-04 11:22:13,105][model8_pretrain.py][INFO] Epoch:[0/2](394800/4588595) loss:2.799 lr:0.0000100 epoch_Time:26582.0min: [2024-01-04 11:22:50,050][model8_pretrain.py][INFO] Epoch:[0/2](394900/4588595) loss:3.394 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:22:50,050][model8_pretrain.py][INFO] Epoch:[0/2](394900/4588595) loss:2.951 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:22:50,050][model8_pretrain.py][INFO] Epoch:[0/2](394900/4588595) loss:2.290 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:22:50,050][model8_pretrain.py][INFO] Epoch:[0/2](394900/4588595) loss:3.331 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:22:50,050][model8_pretrain.py][INFO] Epoch:[0/2](394900/4588595) loss:3.201 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:22:50,050][model8_pretrain.py][INFO] Epoch:[0/2](394900/4588595) loss:2.765 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:22:50,050][model8_pretrain.py][INFO] Epoch:[0/2](394900/4588595) loss:2.932 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:22:50,050][model8_pretrain.py][INFO] Epoch:[0/2](394900/4588595) loss:2.740 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:23:26,993][model8_pretrain.py][INFO] Epoch:[0/2](395000/4588595) loss:2.590 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:23:26,993][model8_pretrain.py][INFO] Epoch:[0/2](395000/4588595) loss:2.766 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:23:26,993][model8_pretrain.py][INFO] Epoch:[0/2](395000/4588595) loss:2.874 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:23:26,993][model8_pretrain.py][INFO] Epoch:[0/2](395000/4588595) loss:2.822 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:23:26,993][model8_pretrain.py][INFO] Epoch:[0/2](395000/4588595) loss:3.009 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:23:26,993][model8_pretrain.py][INFO] Epoch:[0/2](395000/4588595) loss:3.383 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:23:26,993][model8_pretrain.py][INFO] Epoch:[0/2](395000/4588595) loss:2.546 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:23:26,993][model8_pretrain.py][INFO] Epoch:[0/2](395000/4588595) loss:3.100 lr:0.0000100 epoch_Time:26581.0min: [2024-01-04 11:24:03,932][model8_pretrain.py][INFO] Epoch:[0/2](395100/4588595) loss:2.861 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:24:03,932][model8_pretrain.py][INFO] Epoch:[0/2](395100/4588595) loss:2.933 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:24:03,932][model8_pretrain.py][INFO] Epoch:[0/2](395100/4588595) loss:2.847 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:24:03,932][model8_pretrain.py][INFO] Epoch:[0/2](395100/4588595) loss:3.335 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:24:03,932][model8_pretrain.py][INFO] Epoch:[0/2](395100/4588595) loss:2.405 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:24:03,932][model8_pretrain.py][INFO] Epoch:[0/2](395100/4588595) loss:2.391 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:24:03,932][model8_pretrain.py][INFO] Epoch:[0/2](395100/4588595) loss:2.734 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:24:03,932][model8_pretrain.py][INFO] Epoch:[0/2](395100/4588595) loss:3.062 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:24:49,681][model8_pretrain.py][INFO] Epoch:[0/2](395200/4588595) loss:3.375 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:24:49,681][model8_pretrain.py][INFO] Epoch:[0/2](395200/4588595) loss:3.090 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:24:49,681][model8_pretrain.py][INFO] Epoch:[0/2](395200/4588595) loss:2.457 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:24:49,681][model8_pretrain.py][INFO] Epoch:[0/2](395200/4588595) loss:3.059 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:24:49,681][model8_pretrain.py][INFO] Epoch:[0/2](395200/4588595) loss:2.174 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:24:49,681][model8_pretrain.py][INFO] Epoch:[0/2](395200/4588595) loss:2.914 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:24:49,681][model8_pretrain.py][INFO] Epoch:[0/2](395200/4588595) loss:3.161 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:24:49,681][model8_pretrain.py][INFO] Epoch:[0/2](395200/4588595) loss:2.792 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:25:26,613][model8_pretrain.py][INFO] Epoch:[0/2](395300/4588595) loss:2.921 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:25:26,613][model8_pretrain.py][INFO] Epoch:[0/2](395300/4588595) loss:2.795 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:25:26,613][model8_pretrain.py][INFO] Epoch:[0/2](395300/4588595) loss:2.288 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:25:26,613][model8_pretrain.py][INFO] Epoch:[0/2](395300/4588595) loss:2.778 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:25:26,613][model8_pretrain.py][INFO] Epoch:[0/2](395300/4588595) loss:2.627 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:25:26,613][model8_pretrain.py][INFO] Epoch:[0/2](395300/4588595) loss:2.851 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:25:26,613][model8_pretrain.py][INFO] Epoch:[0/2](395300/4588595) loss:3.597 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:25:26,613][model8_pretrain.py][INFO] Epoch:[0/2](395300/4588595) loss:2.406 lr:0.0000100 epoch_Time:26580.0min: [2024-01-04 11:26:03,546][model8_pretrain.py][INFO] Epoch:[0/2](395400/4588595) loss:2.840 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:26:03,546][model8_pretrain.py][INFO] Epoch:[0/2](395400/4588595) loss:3.017 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:26:03,546][model8_pretrain.py][INFO] Epoch:[0/2](395400/4588595) loss:2.519 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:26:03,546][model8_pretrain.py][INFO] Epoch:[0/2](395400/4588595) loss:2.634 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:26:03,546][model8_pretrain.py][INFO] Epoch:[0/2](395400/4588595) loss:2.589 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:26:03,546][model8_pretrain.py][INFO] Epoch:[0/2](395400/4588595) loss:3.074 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:26:03,546][model8_pretrain.py][INFO] Epoch:[0/2](395400/4588595) loss:2.653 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:26:03,546][model8_pretrain.py][INFO] Epoch:[0/2](395400/4588595) loss:2.448 lr:0.0000100 epoch_Time:26579.0min: [2024-01-04 11:26:40,477][model8_pretrain.py][INFO] Epoch:[0/2](395500/4588595) loss:2.818 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:26:40,477][model8_pretrain.py][INFO] Epoch:[0/2](395500/4588595) loss:3.081 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:26:40,477][model8_pretrain.py][INFO] Epoch:[0/2](395500/4588595) loss:3.018 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:26:40,477][model8_pretrain.py][INFO] Epoch:[0/2](395500/4588595) loss:2.756 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:26:40,477][model8_pretrain.py][INFO] Epoch:[0/2](395500/4588595) loss:3.138 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:26:40,477][model8_pretrain.py][INFO] Epoch:[0/2](395500/4588595) loss:2.623 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:26:40,477][model8_pretrain.py][INFO] Epoch:[0/2](395500/4588595) loss:2.520 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:26:40,477][model8_pretrain.py][INFO] Epoch:[0/2](395500/4588595) loss:2.424 lr:0.0000100 epoch_Time:26578.0min: [2024-01-04 11:27:17,416][model8_pretrain.py][INFO] Epoch:[0/2](395600/4588595) loss:2.457 lr:0.0000100 epoch_Time:26577.0min: [2024-01-04 11:27:17,416][model8_pretrain.py][INFO] Epoch:[0/2](395600/4588595) loss:2.461 lr:0.0000100 epoch_Time:26577.0min: [2024-01-04 11:27:17,416][model8_pretrain.py][INFO] Epoch:[0/2](395600/4588595) loss:2.765 lr:0.0000100 epoch_Time:26577.0min: [2024-01-04 11:27:17,416][model8_pretrain.py][INFO] Epoch:[0/2](395600/4588595) loss:3.072 lr:0.0000100 epoch_Time:26577.0min: [2024-01-04 11:27:17,416][model8_pretrain.py][INFO] Epoch:[0/2](395600/4588595) loss:2.839 lr:0.0000100 epoch_Time:26577.0min: [2024-01-04 11:27:17,416][model8_pretrain.py][INFO] Epoch:[0/2](395600/4588595) loss:2.868 lr:0.0000100 epoch_Time:26577.0min: [2024-01-04 11:27:17,416][model8_pretrain.py][INFO] Epoch:[0/2](395600/4588595) loss:2.546 lr:0.0000100 epoch_Time:26577.0min: [2024-01-04 11:27:17,417][model8_pretrain.py][INFO] Epoch:[0/2](395600/4588595) loss:3.253 lr:0.0000100 epoch_Time:26577.0min: [2024-01-04 11:27:54,354][model8_pretrain.py][INFO] Epoch:[0/2](395700/4588595) loss:2.295 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:27:54,354][model8_pretrain.py][INFO] Epoch:[0/2](395700/4588595) loss:3.071 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:27:54,354][model8_pretrain.py][INFO] Epoch:[0/2](395700/4588595) loss:2.360 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:27:54,354][model8_pretrain.py][INFO] Epoch:[0/2](395700/4588595) loss:3.222 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:27:54,354][model8_pretrain.py][INFO] Epoch:[0/2](395700/4588595) loss:2.917 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:27:54,354][model8_pretrain.py][INFO] Epoch:[0/2](395700/4588595) loss:3.055 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:27:54,354][model8_pretrain.py][INFO] Epoch:[0/2](395700/4588595) loss:3.480 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:27:54,355][model8_pretrain.py][INFO] Epoch:[0/2](395700/4588595) loss:2.701 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:28:31,317][model8_pretrain.py][INFO] Epoch:[0/2](395800/4588595) loss:3.058 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:28:31,317][model8_pretrain.py][INFO] Epoch:[0/2](395800/4588595) loss:3.044 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:28:31,317][model8_pretrain.py][INFO] Epoch:[0/2](395800/4588595) loss:2.629 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:28:31,317][model8_pretrain.py][INFO] Epoch:[0/2](395800/4588595) loss:2.423 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:28:31,317][model8_pretrain.py][INFO] Epoch:[0/2](395800/4588595) loss:2.877 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:28:31,317][model8_pretrain.py][INFO] Epoch:[0/2](395800/4588595) loss:2.598 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:28:31,317][model8_pretrain.py][INFO] Epoch:[0/2](395800/4588595) loss:3.011 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:28:31,317][model8_pretrain.py][INFO] Epoch:[0/2](395800/4588595) loss:3.286 lr:0.0000100 epoch_Time:26576.0min: [2024-01-04 11:29:08,251][model8_pretrain.py][INFO] Epoch:[0/2](395900/4588595) loss:2.265 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:29:08,251][model8_pretrain.py][INFO] Epoch:[0/2](395900/4588595) loss:2.980 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:29:08,251][model8_pretrain.py][INFO] Epoch:[0/2](395900/4588595) loss:3.204 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:29:08,251][model8_pretrain.py][INFO] Epoch:[0/2](395900/4588595) loss:3.228 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:29:08,251][model8_pretrain.py][INFO] Epoch:[0/2](395900/4588595) loss:2.785 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:29:08,251][model8_pretrain.py][INFO] Epoch:[0/2](395900/4588595) loss:2.793 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:29:08,251][model8_pretrain.py][INFO] Epoch:[0/2](395900/4588595) loss:2.803 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:29:08,252][model8_pretrain.py][INFO] Epoch:[0/2](395900/4588595) loss:3.174 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:29:53,994][model8_pretrain.py][INFO] Epoch:[0/2](396000/4588595) loss:2.663 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:29:53,994][model8_pretrain.py][INFO] Epoch:[0/2](396000/4588595) loss:3.153 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:29:53,994][model8_pretrain.py][INFO] Epoch:[0/2](396000/4588595) loss:3.254 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:29:53,994][model8_pretrain.py][INFO] Epoch:[0/2](396000/4588595) loss:3.095 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:29:53,994][model8_pretrain.py][INFO] Epoch:[0/2](396000/4588595) loss:3.160 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:29:53,994][model8_pretrain.py][INFO] Epoch:[0/2](396000/4588595) loss:2.829 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:29:53,994][model8_pretrain.py][INFO] Epoch:[0/2](396000/4588595) loss:2.430 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:29:53,994][model8_pretrain.py][INFO] Epoch:[0/2](396000/4588595) loss:3.207 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:30:30,928][model8_pretrain.py][INFO] Epoch:[0/2](396100/4588595) loss:2.849 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:30:30,928][model8_pretrain.py][INFO] Epoch:[0/2](396100/4588595) loss:3.312 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:30:30,928][model8_pretrain.py][INFO] Epoch:[0/2](396100/4588595) loss:2.136 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:30:30,928][model8_pretrain.py][INFO] Epoch:[0/2](396100/4588595) loss:3.269 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:30:30,928][model8_pretrain.py][INFO] Epoch:[0/2](396100/4588595) loss:2.962 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:30:30,928][model8_pretrain.py][INFO] Epoch:[0/2](396100/4588595) loss:2.879 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:30:30,928][model8_pretrain.py][INFO] Epoch:[0/2](396100/4588595) loss:2.399 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:30:30,928][model8_pretrain.py][INFO] Epoch:[0/2](396100/4588595) loss:3.035 lr:0.0000100 epoch_Time:26575.0min: [2024-01-04 11:31:07,868][model8_pretrain.py][INFO] Epoch:[0/2](396200/4588595) loss:3.016 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:31:07,868][model8_pretrain.py][INFO] Epoch:[0/2](396200/4588595) loss:2.547 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:31:07,868][model8_pretrain.py][INFO] Epoch:[0/2](396200/4588595) loss:2.699 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:31:07,868][model8_pretrain.py][INFO] Epoch:[0/2](396200/4588595) loss:2.677 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:31:07,868][model8_pretrain.py][INFO] Epoch:[0/2](396200/4588595) loss:2.671 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:31:07,868][model8_pretrain.py][INFO] Epoch:[0/2](396200/4588595) loss:2.876 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:31:07,868][model8_pretrain.py][INFO] Epoch:[0/2](396200/4588595) loss:2.595 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:31:07,868][model8_pretrain.py][INFO] Epoch:[0/2](396200/4588595) loss:2.400 lr:0.0000100 epoch_Time:26574.0min: [2024-01-04 11:31:44,800][model8_pretrain.py][INFO] Epoch:[0/2](396300/4588595) loss:2.514 lr:0.0000100 epoch_Time:26573.0min: [2024-01-04 11:31:44,800][model8_pretrain.py][INFO] Epoch:[0/2](396300/4588595) loss:2.638 lr:0.0000100 epoch_Time:26573.0min: [2024-01-04 11:31:44,800][model8_pretrain.py][INFO] Epoch:[0/2](396300/4588595) loss:2.434 lr:0.0000100 epoch_Time:26573.0min: [2024-01-04 11:31:44,800][model8_pretrain.py][INFO] Epoch:[0/2](396300/4588595) loss:2.644 lr:0.0000100 epoch_Time:26573.0min: [2024-01-04 11:31:44,800][model8_pretrain.py][INFO] Epoch:[0/2](396300/4588595) loss:2.346 lr:0.0000100 epoch_Time:26573.0min: [2024-01-04 11:31:44,800][model8_pretrain.py][INFO] Epoch:[0/2](396300/4588595) loss:2.696 lr:0.0000100 epoch_Time:26573.0min: [2024-01-04 11:31:44,800][model8_pretrain.py][INFO] Epoch:[0/2](396300/4588595) loss:3.395 lr:0.0000100 epoch_Time:26573.0min: [2024-01-04 11:31:44,800][model8_pretrain.py][INFO] Epoch:[0/2](396300/4588595) loss:2.890 lr:0.0000100 epoch_Time:26573.0min: [2024-01-04 11:32:21,748][model8_pretrain.py][INFO] Epoch:[0/2](396400/4588595) loss:2.856 lr:0.0000100 epoch_Time:26572.0min: [2024-01-04 11:32:21,748][model8_pretrain.py][INFO] Epoch:[0/2](396400/4588595) loss:3.062 lr:0.0000100 epoch_Time:26572.0min: [2024-01-04 11:32:21,748][model8_pretrain.py][INFO] Epoch:[0/2](396400/4588595) loss:2.828 lr:0.0000100 epoch_Time:26572.0min: [2024-01-04 11:32:21,748][model8_pretrain.py][INFO] Epoch:[0/2](396400/4588595) loss:2.529 lr:0.0000100 epoch_Time:26572.0min: [2024-01-04 11:32:21,748][model8_pretrain.py][INFO] Epoch:[0/2](396400/4588595) loss:3.152 lr:0.0000100 epoch_Time:26572.0min: [2024-01-04 11:32:21,749][model8_pretrain.py][INFO] Epoch:[0/2](396400/4588595) loss:2.845 lr:0.0000100 epoch_Time:26572.0min: [2024-01-04 11:32:21,748][model8_pretrain.py][INFO] Epoch:[0/2](396400/4588595) loss:2.792 lr:0.0000100 epoch_Time:26572.0min: [2024-01-04 11:32:21,749][model8_pretrain.py][INFO] Epoch:[0/2](396400/4588595) loss:2.033 lr:0.0000100 epoch_Time:26572.0min: [2024-01-04 11:32:58,689][model8_pretrain.py][INFO] Epoch:[0/2](396500/4588595) loss:2.833 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:32:58,689][model8_pretrain.py][INFO] Epoch:[0/2](396500/4588595) loss:3.280 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:32:58,689][model8_pretrain.py][INFO] Epoch:[0/2](396500/4588595) loss:2.524 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:32:58,689][model8_pretrain.py][INFO] Epoch:[0/2](396500/4588595) loss:2.990 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:32:58,689][model8_pretrain.py][INFO] Epoch:[0/2](396500/4588595) loss:2.597 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:32:58,690][model8_pretrain.py][INFO] Epoch:[0/2](396500/4588595) loss:2.714 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:32:58,690][model8_pretrain.py][INFO] Epoch:[0/2](396500/4588595) loss:3.227 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:32:58,691][model8_pretrain.py][INFO] Epoch:[0/2](396500/4588595) loss:2.981 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:33:35,640][model8_pretrain.py][INFO] Epoch:[0/2](396600/4588595) loss:2.713 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:33:35,640][model8_pretrain.py][INFO] Epoch:[0/2](396600/4588595) loss:3.005 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:33:35,640][model8_pretrain.py][INFO] Epoch:[0/2](396600/4588595) loss:3.225 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:33:35,640][model8_pretrain.py][INFO] Epoch:[0/2](396600/4588595) loss:2.933 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:33:35,640][model8_pretrain.py][INFO] Epoch:[0/2](396600/4588595) loss:2.267 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:33:35,640][model8_pretrain.py][INFO] Epoch:[0/2](396600/4588595) loss:3.452 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:33:35,640][model8_pretrain.py][INFO] Epoch:[0/2](396600/4588595) loss:3.276 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:33:35,640][model8_pretrain.py][INFO] Epoch:[0/2](396600/4588595) loss:3.230 lr:0.0000100 epoch_Time:26571.0min: [2024-01-04 11:34:12,593][model8_pretrain.py][INFO] Epoch:[0/2](396700/4588595) loss:2.915 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:34:12,593][model8_pretrain.py][INFO] Epoch:[0/2](396700/4588595) loss:2.833 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:34:12,593][model8_pretrain.py][INFO] Epoch:[0/2](396700/4588595) loss:2.907 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:34:12,593][model8_pretrain.py][INFO] Epoch:[0/2](396700/4588595) loss:3.323 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:34:12,593][model8_pretrain.py][INFO] Epoch:[0/2](396700/4588595) loss:2.805 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:34:12,593][model8_pretrain.py][INFO] Epoch:[0/2](396700/4588595) loss:3.157 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:34:12,593][model8_pretrain.py][INFO] Epoch:[0/2](396700/4588595) loss:2.736 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:34:12,593][model8_pretrain.py][INFO] Epoch:[0/2](396700/4588595) loss:2.366 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:34:58,320][model8_pretrain.py][INFO] Epoch:[0/2](396800/4588595) loss:3.115 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:34:58,320][model8_pretrain.py][INFO] Epoch:[0/2](396800/4588595) loss:2.943 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:34:58,320][model8_pretrain.py][INFO] Epoch:[0/2](396800/4588595) loss:3.134 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:34:58,320][model8_pretrain.py][INFO] Epoch:[0/2](396800/4588595) loss:2.929 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:34:58,320][model8_pretrain.py][INFO] Epoch:[0/2](396800/4588595) loss:2.936 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:34:58,320][model8_pretrain.py][INFO] Epoch:[0/2](396800/4588595) loss:2.957 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:34:58,321][model8_pretrain.py][INFO] Epoch:[0/2](396800/4588595) loss:2.910 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:34:58,321][model8_pretrain.py][INFO] Epoch:[0/2](396800/4588595) loss:2.896 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:35:35,250][model8_pretrain.py][INFO] Epoch:[0/2](396900/4588595) loss:3.112 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:35:35,250][model8_pretrain.py][INFO] Epoch:[0/2](396900/4588595) loss:2.431 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:35:35,250][model8_pretrain.py][INFO] Epoch:[0/2](396900/4588595) loss:3.112 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:35:35,250][model8_pretrain.py][INFO] Epoch:[0/2](396900/4588595) loss:2.639 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:35:35,250][model8_pretrain.py][INFO] Epoch:[0/2](396900/4588595) loss:2.556 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:35:35,250][model8_pretrain.py][INFO] Epoch:[0/2](396900/4588595) loss:2.641 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:35:35,251][model8_pretrain.py][INFO] Epoch:[0/2](396900/4588595) loss:3.266 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:35:35,251][model8_pretrain.py][INFO] Epoch:[0/2](396900/4588595) loss:2.742 lr:0.0000100 epoch_Time:26570.0min: [2024-01-04 11:36:12,200][model8_pretrain.py][INFO] Epoch:[0/2](397000/4588595) loss:2.757 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:36:12,200][model8_pretrain.py][INFO] Epoch:[0/2](397000/4588595) loss:3.072 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:36:12,200][model8_pretrain.py][INFO] Epoch:[0/2](397000/4588595) loss:2.545 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:36:12,200][model8_pretrain.py][INFO] Epoch:[0/2](397000/4588595) loss:3.397 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:36:12,200][model8_pretrain.py][INFO] Epoch:[0/2](397000/4588595) loss:2.578 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:36:12,200][model8_pretrain.py][INFO] Epoch:[0/2](397000/4588595) loss:2.452 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:36:12,200][model8_pretrain.py][INFO] Epoch:[0/2](397000/4588595) loss:3.314 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:36:12,201][model8_pretrain.py][INFO] Epoch:[0/2](397000/4588595) loss:2.900 lr:0.0000100 epoch_Time:26569.0min: [2024-01-04 11:36:49,145][model8_pretrain.py][INFO] Epoch:[0/2](397100/4588595) loss:2.863 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:36:49,145][model8_pretrain.py][INFO] Epoch:[0/2](397100/4588595) loss:2.798 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:36:49,145][model8_pretrain.py][INFO] Epoch:[0/2](397100/4588595) loss:2.655 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:36:49,145][model8_pretrain.py][INFO] Epoch:[0/2](397100/4588595) loss:3.030 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:36:49,146][model8_pretrain.py][INFO] Epoch:[0/2](397100/4588595) loss:2.678 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:36:49,146][model8_pretrain.py][INFO] Epoch:[0/2](397100/4588595) loss:3.069 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:36:49,146][model8_pretrain.py][INFO] Epoch:[0/2](397100/4588595) loss:2.676 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:36:49,146][model8_pretrain.py][INFO] Epoch:[0/2](397100/4588595) loss:2.436 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:37:26,082][model8_pretrain.py][INFO] Epoch:[0/2](397200/4588595) loss:2.831 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:37:26,082][model8_pretrain.py][INFO] Epoch:[0/2](397200/4588595) loss:2.581 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:37:26,082][model8_pretrain.py][INFO] Epoch:[0/2](397200/4588595) loss:2.965 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:37:26,082][model8_pretrain.py][INFO] Epoch:[0/2](397200/4588595) loss:3.022 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:37:26,082][model8_pretrain.py][INFO] Epoch:[0/2](397200/4588595) loss:2.654 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:37:26,082][model8_pretrain.py][INFO] Epoch:[0/2](397200/4588595) loss:3.103 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:37:26,082][model8_pretrain.py][INFO] Epoch:[0/2](397200/4588595) loss:3.356 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:37:26,082][model8_pretrain.py][INFO] Epoch:[0/2](397200/4588595) loss:2.851 lr:0.0000100 epoch_Time:26567.0min: [2024-01-04 11:38:03,021][model8_pretrain.py][INFO] Epoch:[0/2](397300/4588595) loss:3.272 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:03,021][model8_pretrain.py][INFO] Epoch:[0/2](397300/4588595) loss:3.054 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:03,021][model8_pretrain.py][INFO] Epoch:[0/2](397300/4588595) loss:2.710 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:03,021][model8_pretrain.py][INFO] Epoch:[0/2](397300/4588595) loss:2.768 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:03,021][model8_pretrain.py][INFO] Epoch:[0/2](397300/4588595) loss:2.616 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:03,021][model8_pretrain.py][INFO] Epoch:[0/2](397300/4588595) loss:2.670 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:03,021][model8_pretrain.py][INFO] Epoch:[0/2](397300/4588595) loss:3.079 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:03,021][model8_pretrain.py][INFO] Epoch:[0/2](397300/4588595) loss:2.789 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:39,980][model8_pretrain.py][INFO] Epoch:[0/2](397400/4588595) loss:3.377 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:39,981][model8_pretrain.py][INFO] Epoch:[0/2](397400/4588595) loss:3.083 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:39,981][model8_pretrain.py][INFO] Epoch:[0/2](397400/4588595) loss:2.663 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:39,981][model8_pretrain.py][INFO] Epoch:[0/2](397400/4588595) loss:3.317 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:39,981][model8_pretrain.py][INFO] Epoch:[0/2](397400/4588595) loss:2.613 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:39,981][model8_pretrain.py][INFO] Epoch:[0/2](397400/4588595) loss:3.316 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:39,981][model8_pretrain.py][INFO] Epoch:[0/2](397400/4588595) loss:2.682 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:38:39,981][model8_pretrain.py][INFO] Epoch:[0/2](397400/4588595) loss:2.990 lr:0.0000100 epoch_Time:26566.0min: [2024-01-04 11:39:16,916][model8_pretrain.py][INFO] Epoch:[0/2](397500/4588595) loss:2.998 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:39:16,916][model8_pretrain.py][INFO] Epoch:[0/2](397500/4588595) loss:2.965 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:39:16,916][model8_pretrain.py][INFO] Epoch:[0/2](397500/4588595) loss:2.259 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:39:16,916][model8_pretrain.py][INFO] Epoch:[0/2](397500/4588595) loss:2.257 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:39:16,916][model8_pretrain.py][INFO] Epoch:[0/2](397500/4588595) loss:2.872 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:39:16,916][model8_pretrain.py][INFO] Epoch:[0/2](397500/4588595) loss:3.311 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:39:16,916][model8_pretrain.py][INFO] Epoch:[0/2](397500/4588595) loss:3.358 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:39:16,916][model8_pretrain.py][INFO] Epoch:[0/2](397500/4588595) loss:2.660 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:40:02,620][model8_pretrain.py][INFO] Epoch:[0/2](397600/4588595) loss:3.297 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:02,620][model8_pretrain.py][INFO] Epoch:[0/2](397600/4588595) loss:2.839 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:02,620][model8_pretrain.py][INFO] Epoch:[0/2](397600/4588595) loss:3.152 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:02,620][model8_pretrain.py][INFO] Epoch:[0/2](397600/4588595) loss:2.755 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:02,620][model8_pretrain.py][INFO] Epoch:[0/2](397600/4588595) loss:2.852 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:02,621][model8_pretrain.py][INFO] Epoch:[0/2](397600/4588595) loss:3.055 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:02,621][model8_pretrain.py][INFO] Epoch:[0/2](397600/4588595) loss:2.963 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:02,622][model8_pretrain.py][INFO] Epoch:[0/2](397600/4588595) loss:3.293 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:39,561][model8_pretrain.py][INFO] Epoch:[0/2](397700/4588595) loss:3.352 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:39,561][model8_pretrain.py][INFO] Epoch:[0/2](397700/4588595) loss:2.202 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:39,561][model8_pretrain.py][INFO] Epoch:[0/2](397700/4588595) loss:2.892 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:39,561][model8_pretrain.py][INFO] Epoch:[0/2](397700/4588595) loss:2.967 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:39,561][model8_pretrain.py][INFO] Epoch:[0/2](397700/4588595) loss:2.731 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:39,561][model8_pretrain.py][INFO] Epoch:[0/2](397700/4588595) loss:2.883 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:39,562][model8_pretrain.py][INFO] Epoch:[0/2](397700/4588595) loss:3.097 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:40:39,563][model8_pretrain.py][INFO] Epoch:[0/2](397700/4588595) loss:2.982 lr:0.0000100 epoch_Time:26565.0min: [2024-01-04 11:41:16,501][model8_pretrain.py][INFO] Epoch:[0/2](397800/4588595) loss:3.210 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:41:16,501][model8_pretrain.py][INFO] Epoch:[0/2](397800/4588595) loss:2.821 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:41:16,501][model8_pretrain.py][INFO] Epoch:[0/2](397800/4588595) loss:2.677 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:41:16,501][model8_pretrain.py][INFO] Epoch:[0/2](397800/4588595) loss:2.614 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:41:16,501][model8_pretrain.py][INFO] Epoch:[0/2](397800/4588595) loss:2.938 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:41:16,501][model8_pretrain.py][INFO] Epoch:[0/2](397800/4588595) loss:3.210 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:41:16,501][model8_pretrain.py][INFO] Epoch:[0/2](397800/4588595) loss:2.464 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:41:16,501][model8_pretrain.py][INFO] Epoch:[0/2](397800/4588595) loss:3.197 lr:0.0000100 epoch_Time:26564.0min: [2024-01-04 11:41:53,437][model8_pretrain.py][INFO] Epoch:[0/2](397900/4588595) loss:3.014 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:41:53,437][model8_pretrain.py][INFO] Epoch:[0/2](397900/4588595) loss:3.029 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:41:53,437][model8_pretrain.py][INFO] Epoch:[0/2](397900/4588595) loss:2.855 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:41:53,437][model8_pretrain.py][INFO] Epoch:[0/2](397900/4588595) loss:2.714 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:41:53,437][model8_pretrain.py][INFO] Epoch:[0/2](397900/4588595) loss:2.915 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:41:53,437][model8_pretrain.py][INFO] Epoch:[0/2](397900/4588595) loss:2.826 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:41:53,437][model8_pretrain.py][INFO] Epoch:[0/2](397900/4588595) loss:2.732 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:41:53,438][model8_pretrain.py][INFO] Epoch:[0/2](397900/4588595) loss:2.518 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:42:30,394][model8_pretrain.py][INFO] Epoch:[0/2](398000/4588595) loss:2.639 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:42:30,394][model8_pretrain.py][INFO] Epoch:[0/2](398000/4588595) loss:2.944 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:42:30,395][model8_pretrain.py][INFO] Epoch:[0/2](398000/4588595) loss:2.852 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:42:30,395][model8_pretrain.py][INFO] Epoch:[0/2](398000/4588595) loss:2.470 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:42:30,395][model8_pretrain.py][INFO] Epoch:[0/2](398000/4588595) loss:2.966 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:42:30,395][model8_pretrain.py][INFO] Epoch:[0/2](398000/4588595) loss:3.358 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:42:30,395][model8_pretrain.py][INFO] Epoch:[0/2](398000/4588595) loss:2.698 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:42:30,395][model8_pretrain.py][INFO] Epoch:[0/2](398000/4588595) loss:2.663 lr:0.0000100 epoch_Time:26562.0min: [2024-01-04 11:43:07,336][model8_pretrain.py][INFO] Epoch:[0/2](398100/4588595) loss:2.672 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:07,336][model8_pretrain.py][INFO] Epoch:[0/2](398100/4588595) loss:2.315 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:07,336][model8_pretrain.py][INFO] Epoch:[0/2](398100/4588595) loss:2.909 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:07,336][model8_pretrain.py][INFO] Epoch:[0/2](398100/4588595) loss:3.017 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:07,336][model8_pretrain.py][INFO] Epoch:[0/2](398100/4588595) loss:2.927 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:07,336][model8_pretrain.py][INFO] Epoch:[0/2](398100/4588595) loss:3.187 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:07,336][model8_pretrain.py][INFO] Epoch:[0/2](398100/4588595) loss:2.566 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:07,336][model8_pretrain.py][INFO] Epoch:[0/2](398100/4588595) loss:3.194 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:44,287][model8_pretrain.py][INFO] Epoch:[0/2](398200/4588595) loss:2.564 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:44,287][model8_pretrain.py][INFO] Epoch:[0/2](398200/4588595) loss:3.492 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:44,287][model8_pretrain.py][INFO] Epoch:[0/2](398200/4588595) loss:3.193 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:44,287][model8_pretrain.py][INFO] Epoch:[0/2](398200/4588595) loss:3.132 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:44,287][model8_pretrain.py][INFO] Epoch:[0/2](398200/4588595) loss:3.249 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:44,287][model8_pretrain.py][INFO] Epoch:[0/2](398200/4588595) loss:2.840 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:44,287][model8_pretrain.py][INFO] Epoch:[0/2](398200/4588595) loss:3.004 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:43:44,287][model8_pretrain.py][INFO] Epoch:[0/2](398200/4588595) loss:3.063 lr:0.0000100 epoch_Time:26561.0min: [2024-01-04 11:44:21,223][model8_pretrain.py][INFO] Epoch:[0/2](398300/4588595) loss:2.532 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:44:21,223][model8_pretrain.py][INFO] Epoch:[0/2](398300/4588595) loss:3.128 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:44:21,223][model8_pretrain.py][INFO] Epoch:[0/2](398300/4588595) loss:2.543 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:44:21,223][model8_pretrain.py][INFO] Epoch:[0/2](398300/4588595) loss:2.950 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:44:21,223][model8_pretrain.py][INFO] Epoch:[0/2](398300/4588595) loss:2.412 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:44:21,223][model8_pretrain.py][INFO] Epoch:[0/2](398300/4588595) loss:3.283 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:44:21,223][model8_pretrain.py][INFO] Epoch:[0/2](398300/4588595) loss:2.706 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:44:21,223][model8_pretrain.py][INFO] Epoch:[0/2](398300/4588595) loss:2.735 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:45:06,924][model8_pretrain.py][INFO] Epoch:[0/2](398400/4588595) loss:3.018 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:06,924][model8_pretrain.py][INFO] Epoch:[0/2](398400/4588595) loss:2.510 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:06,924][model8_pretrain.py][INFO] Epoch:[0/2](398400/4588595) loss:3.120 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:06,924][model8_pretrain.py][INFO] Epoch:[0/2](398400/4588595) loss:3.205 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:06,924][model8_pretrain.py][INFO] Epoch:[0/2](398400/4588595) loss:3.147 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:06,924][model8_pretrain.py][INFO] Epoch:[0/2](398400/4588595) loss:2.365 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:06,924][model8_pretrain.py][INFO] Epoch:[0/2](398400/4588595) loss:3.007 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:06,925][model8_pretrain.py][INFO] Epoch:[0/2](398400/4588595) loss:2.917 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:43,855][model8_pretrain.py][INFO] Epoch:[0/2](398500/4588595) loss:3.137 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:43,855][model8_pretrain.py][INFO] Epoch:[0/2](398500/4588595) loss:2.868 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:43,855][model8_pretrain.py][INFO] Epoch:[0/2](398500/4588595) loss:2.921 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:43,855][model8_pretrain.py][INFO] Epoch:[0/2](398500/4588595) loss:2.668 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:43,855][model8_pretrain.py][INFO] Epoch:[0/2](398500/4588595) loss:2.516 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:43,855][model8_pretrain.py][INFO] Epoch:[0/2](398500/4588595) loss:3.150 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:43,855][model8_pretrain.py][INFO] Epoch:[0/2](398500/4588595) loss:2.952 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:45:43,855][model8_pretrain.py][INFO] Epoch:[0/2](398500/4588595) loss:2.937 lr:0.0000100 epoch_Time:26560.0min: [2024-01-04 11:46:20,794][model8_pretrain.py][INFO] Epoch:[0/2](398600/4588595) loss:3.187 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:46:20,794][model8_pretrain.py][INFO] Epoch:[0/2](398600/4588595) loss:3.433 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:46:20,794][model8_pretrain.py][INFO] Epoch:[0/2](398600/4588595) loss:2.775 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:46:20,794][model8_pretrain.py][INFO] Epoch:[0/2](398600/4588595) loss:3.396 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:46:20,794][model8_pretrain.py][INFO] Epoch:[0/2](398600/4588595) loss:3.077 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:46:20,794][model8_pretrain.py][INFO] Epoch:[0/2](398600/4588595) loss:3.075 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:46:20,794][model8_pretrain.py][INFO] Epoch:[0/2](398600/4588595) loss:2.543 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:46:20,794][model8_pretrain.py][INFO] Epoch:[0/2](398600/4588595) loss:2.543 lr:0.0000100 epoch_Time:26559.0min: [2024-01-04 11:46:57,735][model8_pretrain.py][INFO] Epoch:[0/2](398700/4588595) loss:3.187 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:46:57,735][model8_pretrain.py][INFO] Epoch:[0/2](398700/4588595) loss:2.695 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:46:57,735][model8_pretrain.py][INFO] Epoch:[0/2](398700/4588595) loss:2.849 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:46:57,735][model8_pretrain.py][INFO] Epoch:[0/2](398700/4588595) loss:2.775 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:46:57,735][model8_pretrain.py][INFO] Epoch:[0/2](398700/4588595) loss:2.788 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:46:57,735][model8_pretrain.py][INFO] Epoch:[0/2](398700/4588595) loss:2.931 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:46:57,735][model8_pretrain.py][INFO] Epoch:[0/2](398700/4588595) loss:3.186 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:46:57,735][model8_pretrain.py][INFO] Epoch:[0/2](398700/4588595) loss:3.041 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:47:34,681][model8_pretrain.py][INFO] Epoch:[0/2](398800/4588595) loss:2.589 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:47:34,681][model8_pretrain.py][INFO] Epoch:[0/2](398800/4588595) loss:3.051 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:47:34,681][model8_pretrain.py][INFO] Epoch:[0/2](398800/4588595) loss:2.836 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:47:34,681][model8_pretrain.py][INFO] Epoch:[0/2](398800/4588595) loss:2.844 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:47:34,681][model8_pretrain.py][INFO] Epoch:[0/2](398800/4588595) loss:2.569 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:47:34,681][model8_pretrain.py][INFO] Epoch:[0/2](398800/4588595) loss:2.658 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:47:34,681][model8_pretrain.py][INFO] Epoch:[0/2](398800/4588595) loss:2.569 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:47:34,681][model8_pretrain.py][INFO] Epoch:[0/2](398800/4588595) loss:2.668 lr:0.0000100 epoch_Time:26557.0min: [2024-01-04 11:48:11,627][model8_pretrain.py][INFO] Epoch:[0/2](398900/4588595) loss:2.563 lr:0.0000100 epoch_Time:26556.0min: [2024-01-04 11:48:11,627][model8_pretrain.py][INFO] Epoch:[0/2](398900/4588595) loss:3.120 lr:0.0000100 epoch_Time:26556.0min: [2024-01-04 11:48:11,627][model8_pretrain.py][INFO] Epoch:[0/2](398900/4588595) loss:2.840 lr:0.0000100 epoch_Time:26556.0min: [2024-01-04 11:48:11,627][model8_pretrain.py][INFO] Epoch:[0/2](398900/4588595) loss:3.030 lr:0.0000100 epoch_Time:26556.0min: [2024-01-04 11:48:11,627][model8_pretrain.py][INFO] Epoch:[0/2](398900/4588595) loss:2.795 lr:0.0000100 epoch_Time:26556.0min: [2024-01-04 11:48:11,627][model8_pretrain.py][INFO] Epoch:[0/2](398900/4588595) loss:2.688 lr:0.0000100 epoch_Time:26556.0min: [2024-01-04 11:48:11,627][model8_pretrain.py][INFO] Epoch:[0/2](398900/4588595) loss:3.049 lr:0.0000100 epoch_Time:26556.0min: [2024-01-04 11:48:11,627][model8_pretrain.py][INFO] Epoch:[0/2](398900/4588595) loss:2.240 lr:0.0000100 epoch_Time:26556.0min: [2024-01-04 11:48:48,567][model8_pretrain.py][INFO] Epoch:[0/2](399000/4588595) loss:3.078 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:48:48,567][model8_pretrain.py][INFO] Epoch:[0/2](399000/4588595) loss:2.873 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:48:48,567][model8_pretrain.py][INFO] Epoch:[0/2](399000/4588595) loss:2.753 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:48:48,567][model8_pretrain.py][INFO] Epoch:[0/2](399000/4588595) loss:2.535 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:48:48,567][model8_pretrain.py][INFO] Epoch:[0/2](399000/4588595) loss:2.655 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:48:48,567][model8_pretrain.py][INFO] Epoch:[0/2](399000/4588595) loss:2.618 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:48:48,567][model8_pretrain.py][INFO] Epoch:[0/2](399000/4588595) loss:2.714 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:48:48,567][model8_pretrain.py][INFO] Epoch:[0/2](399000/4588595) loss:3.314 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:49:25,512][model8_pretrain.py][INFO] Epoch:[0/2](399100/4588595) loss:3.164 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:49:25,512][model8_pretrain.py][INFO] Epoch:[0/2](399100/4588595) loss:2.923 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:49:25,512][model8_pretrain.py][INFO] Epoch:[0/2](399100/4588595) loss:3.094 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:49:25,512][model8_pretrain.py][INFO] Epoch:[0/2](399100/4588595) loss:2.754 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:49:25,512][model8_pretrain.py][INFO] Epoch:[0/2](399100/4588595) loss:2.720 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:49:25,512][model8_pretrain.py][INFO] Epoch:[0/2](399100/4588595) loss:2.981 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:49:25,512][model8_pretrain.py][INFO] Epoch:[0/2](399100/4588595) loss:2.651 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:49:25,512][model8_pretrain.py][INFO] Epoch:[0/2](399100/4588595) loss:2.874 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:50:11,245][model8_pretrain.py][INFO] Epoch:[0/2](399200/4588595) loss:2.851 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:50:11,245][model8_pretrain.py][INFO] Epoch:[0/2](399200/4588595) loss:3.280 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:50:11,245][model8_pretrain.py][INFO] Epoch:[0/2](399200/4588595) loss:2.888 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:50:11,245][model8_pretrain.py][INFO] Epoch:[0/2](399200/4588595) loss:2.509 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:50:11,245][model8_pretrain.py][INFO] Epoch:[0/2](399200/4588595) loss:2.767 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:50:11,245][model8_pretrain.py][INFO] Epoch:[0/2](399200/4588595) loss:3.276 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:50:11,245][model8_pretrain.py][INFO] Epoch:[0/2](399200/4588595) loss:3.186 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:50:11,245][model8_pretrain.py][INFO] Epoch:[0/2](399200/4588595) loss:2.671 lr:0.0000100 epoch_Time:26555.0min: [2024-01-04 11:50:48,174][model8_pretrain.py][INFO] Epoch:[0/2](399300/4588595) loss:2.447 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:50:48,174][model8_pretrain.py][INFO] Epoch:[0/2](399300/4588595) loss:2.701 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:50:48,174][model8_pretrain.py][INFO] Epoch:[0/2](399300/4588595) loss:3.075 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:50:48,175][model8_pretrain.py][INFO] Epoch:[0/2](399300/4588595) loss:2.886 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:50:48,176][model8_pretrain.py][INFO] Epoch:[0/2](399300/4588595) loss:2.391 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:50:48,176][model8_pretrain.py][INFO] Epoch:[0/2](399300/4588595) loss:2.988 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:50:48,176][model8_pretrain.py][INFO] Epoch:[0/2](399300/4588595) loss:2.870 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:50:48,176][model8_pretrain.py][INFO] Epoch:[0/2](399300/4588595) loss:2.977 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:51:25,136][model8_pretrain.py][INFO] Epoch:[0/2](399400/4588595) loss:2.927 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:51:25,136][model8_pretrain.py][INFO] Epoch:[0/2](399400/4588595) loss:2.758 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:51:25,136][model8_pretrain.py][INFO] Epoch:[0/2](399400/4588595) loss:2.957 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:51:25,136][model8_pretrain.py][INFO] Epoch:[0/2](399400/4588595) loss:2.779 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:51:25,136][model8_pretrain.py][INFO] Epoch:[0/2](399400/4588595) loss:2.786 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:51:25,136][model8_pretrain.py][INFO] Epoch:[0/2](399400/4588595) loss:2.979 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:51:25,136][model8_pretrain.py][INFO] Epoch:[0/2](399400/4588595) loss:2.742 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:51:25,137][model8_pretrain.py][INFO] Epoch:[0/2](399400/4588595) loss:2.672 lr:0.0000100 epoch_Time:26554.0min: [2024-01-04 11:52:02,091][model8_pretrain.py][INFO] Epoch:[0/2](399500/4588595) loss:2.649 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:02,091][model8_pretrain.py][INFO] Epoch:[0/2](399500/4588595) loss:2.988 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:02,091][model8_pretrain.py][INFO] Epoch:[0/2](399500/4588595) loss:3.185 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:02,091][model8_pretrain.py][INFO] Epoch:[0/2](399500/4588595) loss:2.699 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:02,091][model8_pretrain.py][INFO] Epoch:[0/2](399500/4588595) loss:3.085 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:02,091][model8_pretrain.py][INFO] Epoch:[0/2](399500/4588595) loss:3.296 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:02,091][model8_pretrain.py][INFO] Epoch:[0/2](399500/4588595) loss:2.786 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:02,091][model8_pretrain.py][INFO] Epoch:[0/2](399500/4588595) loss:3.018 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:39,041][model8_pretrain.py][INFO] Epoch:[0/2](399600/4588595) loss:3.203 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:39,041][model8_pretrain.py][INFO] Epoch:[0/2](399600/4588595) loss:3.344 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:39,041][model8_pretrain.py][INFO] Epoch:[0/2](399600/4588595) loss:2.604 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:39,041][model8_pretrain.py][INFO] Epoch:[0/2](399600/4588595) loss:2.917 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:39,041][model8_pretrain.py][INFO] Epoch:[0/2](399600/4588595) loss:3.471 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:39,041][model8_pretrain.py][INFO] Epoch:[0/2](399600/4588595) loss:2.858 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:39,041][model8_pretrain.py][INFO] Epoch:[0/2](399600/4588595) loss:2.658 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:52:39,041][model8_pretrain.py][INFO] Epoch:[0/2](399600/4588595) loss:2.292 lr:0.0000100 epoch_Time:26552.0min: [2024-01-04 11:53:16,011][model8_pretrain.py][INFO] Epoch:[0/2](399700/4588595) loss:2.519 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:53:16,011][model8_pretrain.py][INFO] Epoch:[0/2](399700/4588595) loss:2.877 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:53:16,011][model8_pretrain.py][INFO] Epoch:[0/2](399700/4588595) loss:3.016 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:53:16,011][model8_pretrain.py][INFO] Epoch:[0/2](399700/4588595) loss:3.071 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:53:16,011][model8_pretrain.py][INFO] Epoch:[0/2](399700/4588595) loss:2.861 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:53:16,011][model8_pretrain.py][INFO] Epoch:[0/2](399700/4588595) loss:2.453 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:53:16,011][model8_pretrain.py][INFO] Epoch:[0/2](399700/4588595) loss:2.900 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:53:16,011][model8_pretrain.py][INFO] Epoch:[0/2](399700/4588595) loss:3.023 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:53:52,965][model8_pretrain.py][INFO] Epoch:[0/2](399800/4588595) loss:2.700 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:53:52,965][model8_pretrain.py][INFO] Epoch:[0/2](399800/4588595) loss:3.016 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:53:52,965][model8_pretrain.py][INFO] Epoch:[0/2](399800/4588595) loss:2.640 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:53:52,965][model8_pretrain.py][INFO] Epoch:[0/2](399800/4588595) loss:2.826 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:53:52,965][model8_pretrain.py][INFO] Epoch:[0/2](399800/4588595) loss:2.682 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:53:52,965][model8_pretrain.py][INFO] Epoch:[0/2](399800/4588595) loss:2.738 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:53:52,965][model8_pretrain.py][INFO] Epoch:[0/2](399800/4588595) loss:2.764 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:53:52,965][model8_pretrain.py][INFO] Epoch:[0/2](399800/4588595) loss:2.639 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:54:29,927][model8_pretrain.py][INFO] Epoch:[0/2](399900/4588595) loss:2.715 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:54:29,927][model8_pretrain.py][INFO] Epoch:[0/2](399900/4588595) loss:3.115 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:54:29,927][model8_pretrain.py][INFO] Epoch:[0/2](399900/4588595) loss:2.992 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:54:29,927][model8_pretrain.py][INFO] Epoch:[0/2](399900/4588595) loss:2.517 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:54:29,927][model8_pretrain.py][INFO] Epoch:[0/2](399900/4588595) loss:2.960 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:54:29,927][model8_pretrain.py][INFO] Epoch:[0/2](399900/4588595) loss:2.627 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:54:29,928][model8_pretrain.py][INFO] Epoch:[0/2](399900/4588595) loss:2.734 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:54:29,928][model8_pretrain.py][INFO] Epoch:[0/2](399900/4588595) loss:2.734 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:55:15,528][model8_pretrain.py][INFO] Epoch:[0/2](400000/4588595) loss:2.834 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:55:15,528][model8_pretrain.py][INFO] Epoch:[0/2](400000/4588595) loss:2.825 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:55:15,528][model8_pretrain.py][INFO] Epoch:[0/2](400000/4588595) loss:3.221 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:55:15,528][model8_pretrain.py][INFO] Epoch:[0/2](400000/4588595) loss:2.368 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:55:15,528][model8_pretrain.py][INFO] Epoch:[0/2](400000/4588595) loss:3.297 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:55:15,528][model8_pretrain.py][INFO] Epoch:[0/2](400000/4588595) loss:2.359 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:55:15,529][model8_pretrain.py][INFO] Epoch:[0/2](400000/4588595) loss:2.789 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:55:15,529][model8_pretrain.py][INFO] Epoch:[0/2](400000/4588595) loss:2.609 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:55:35,300][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_400000.pth [2024-01-04 11:55:35,300][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_400000.pth [2024-01-04 11:55:35,300][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_400000.pth [2024-01-04 11:55:35,300][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_400000.pth [2024-01-04 11:55:35,300][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_400000.pth [2024-01-04 11:55:35,300][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_400000.pth [2024-01-04 11:55:35,302][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_400000.pth [2024-01-04 11:55:35,303][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_400000.pth [2024-01-04 11:56:12,244][model8_pretrain.py][INFO] Epoch:[0/2](400100/4588595) loss:2.700 lr:0.0000100 epoch_Time:26553.0min: [2024-01-04 11:56:12,244][model8_pretrain.py][INFO] Epoch:[0/2](400100/4588595) loss:3.053 lr:0.0000100 epoch_Time:26553.0min: [2024-01-04 11:56:12,244][model8_pretrain.py][INFO] Epoch:[0/2](400100/4588595) loss:3.094 lr:0.0000100 epoch_Time:26553.0min: [2024-01-04 11:56:12,244][model8_pretrain.py][INFO] Epoch:[0/2](400100/4588595) loss:3.183 lr:0.0000100 epoch_Time:26553.0min: [2024-01-04 11:56:12,244][model8_pretrain.py][INFO] Epoch:[0/2](400100/4588595) loss:2.997 lr:0.0000100 epoch_Time:26553.0min: [2024-01-04 11:56:12,244][model8_pretrain.py][INFO] Epoch:[0/2](400100/4588595) loss:3.163 lr:0.0000100 epoch_Time:26553.0min: [2024-01-04 11:56:12,244][model8_pretrain.py][INFO] Epoch:[0/2](400100/4588595) loss:2.520 lr:0.0000100 epoch_Time:26553.0min: [2024-01-04 11:56:12,244][model8_pretrain.py][INFO] Epoch:[0/2](400100/4588595) loss:3.000 lr:0.0000100 epoch_Time:26553.0min: [2024-01-04 11:56:49,170][model8_pretrain.py][INFO] Epoch:[0/2](400200/4588595) loss:2.230 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:56:49,170][model8_pretrain.py][INFO] Epoch:[0/2](400200/4588595) loss:3.208 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:56:49,170][model8_pretrain.py][INFO] Epoch:[0/2](400200/4588595) loss:2.868 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:56:49,170][model8_pretrain.py][INFO] Epoch:[0/2](400200/4588595) loss:3.012 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:56:49,171][model8_pretrain.py][INFO] Epoch:[0/2](400200/4588595) loss:2.952 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:56:49,171][model8_pretrain.py][INFO] Epoch:[0/2](400200/4588595) loss:3.082 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:56:49,171][model8_pretrain.py][INFO] Epoch:[0/2](400200/4588595) loss:2.670 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:56:49,171][model8_pretrain.py][INFO] Epoch:[0/2](400200/4588595) loss:2.901 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:57:26,097][model8_pretrain.py][INFO] Epoch:[0/2](400300/4588595) loss:2.971 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:57:26,097][model8_pretrain.py][INFO] Epoch:[0/2](400300/4588595) loss:2.527 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:57:26,097][model8_pretrain.py][INFO] Epoch:[0/2](400300/4588595) loss:2.472 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:57:26,097][model8_pretrain.py][INFO] Epoch:[0/2](400300/4588595) loss:3.312 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:57:26,097][model8_pretrain.py][INFO] Epoch:[0/2](400300/4588595) loss:3.158 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:57:26,097][model8_pretrain.py][INFO] Epoch:[0/2](400300/4588595) loss:2.941 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:57:26,097][model8_pretrain.py][INFO] Epoch:[0/2](400300/4588595) loss:2.827 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:57:26,097][model8_pretrain.py][INFO] Epoch:[0/2](400300/4588595) loss:2.700 lr:0.0000100 epoch_Time:26551.0min: [2024-01-04 11:58:03,038][model8_pretrain.py][INFO] Epoch:[0/2](400400/4588595) loss:2.413 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:03,038][model8_pretrain.py][INFO] Epoch:[0/2](400400/4588595) loss:2.833 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:03,038][model8_pretrain.py][INFO] Epoch:[0/2](400400/4588595) loss:3.043 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:03,038][model8_pretrain.py][INFO] Epoch:[0/2](400400/4588595) loss:3.192 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:03,039][model8_pretrain.py][INFO] Epoch:[0/2](400400/4588595) loss:2.535 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:03,039][model8_pretrain.py][INFO] Epoch:[0/2](400400/4588595) loss:2.911 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:03,039][model8_pretrain.py][INFO] Epoch:[0/2](400400/4588595) loss:3.326 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:03,039][model8_pretrain.py][INFO] Epoch:[0/2](400400/4588595) loss:3.170 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:39,973][model8_pretrain.py][INFO] Epoch:[0/2](400500/4588595) loss:2.844 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:39,973][model8_pretrain.py][INFO] Epoch:[0/2](400500/4588595) loss:2.728 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:39,973][model8_pretrain.py][INFO] Epoch:[0/2](400500/4588595) loss:2.490 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:39,973][model8_pretrain.py][INFO] Epoch:[0/2](400500/4588595) loss:2.923 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:39,973][model8_pretrain.py][INFO] Epoch:[0/2](400500/4588595) loss:1.746 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:39,973][model8_pretrain.py][INFO] Epoch:[0/2](400500/4588595) loss:3.155 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:39,974][model8_pretrain.py][INFO] Epoch:[0/2](400500/4588595) loss:3.246 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:58:39,974][model8_pretrain.py][INFO] Epoch:[0/2](400500/4588595) loss:2.684 lr:0.0000100 epoch_Time:26550.0min: [2024-01-04 11:59:16,919][model8_pretrain.py][INFO] Epoch:[0/2](400600/4588595) loss:3.202 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 11:59:16,919][model8_pretrain.py][INFO] Epoch:[0/2](400600/4588595) loss:2.996 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 11:59:16,919][model8_pretrain.py][INFO] Epoch:[0/2](400600/4588595) loss:2.637 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 11:59:16,919][model8_pretrain.py][INFO] Epoch:[0/2](400600/4588595) loss:2.689 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 11:59:16,919][model8_pretrain.py][INFO] Epoch:[0/2](400600/4588595) loss:2.777 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 11:59:16,919][model8_pretrain.py][INFO] Epoch:[0/2](400600/4588595) loss:2.613 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 11:59:16,919][model8_pretrain.py][INFO] Epoch:[0/2](400600/4588595) loss:2.420 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 11:59:16,919][model8_pretrain.py][INFO] Epoch:[0/2](400600/4588595) loss:3.050 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 11:59:53,869][model8_pretrain.py][INFO] Epoch:[0/2](400700/4588595) loss:2.911 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 11:59:53,868][model8_pretrain.py][INFO] Epoch:[0/2](400700/4588595) loss:2.798 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 11:59:53,869][model8_pretrain.py][INFO] Epoch:[0/2](400700/4588595) loss:2.846 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 11:59:53,869][model8_pretrain.py][INFO] Epoch:[0/2](400700/4588595) loss:3.107 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 11:59:53,869][model8_pretrain.py][INFO] Epoch:[0/2](400700/4588595) loss:2.941 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 11:59:53,869][model8_pretrain.py][INFO] Epoch:[0/2](400700/4588595) loss:2.305 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 11:59:53,869][model8_pretrain.py][INFO] Epoch:[0/2](400700/4588595) loss:3.160 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 11:59:53,869][model8_pretrain.py][INFO] Epoch:[0/2](400700/4588595) loss:2.573 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 12:00:41,102][model8_pretrain.py][INFO] Epoch:[0/2](400800/4588595) loss:2.138 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 12:00:41,102][model8_pretrain.py][INFO] Epoch:[0/2](400800/4588595) loss:3.281 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 12:00:41,102][model8_pretrain.py][INFO] Epoch:[0/2](400800/4588595) loss:3.011 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 12:00:41,102][model8_pretrain.py][INFO] Epoch:[0/2](400800/4588595) loss:2.788 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 12:00:41,102][model8_pretrain.py][INFO] Epoch:[0/2](400800/4588595) loss:2.751 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 12:00:41,102][model8_pretrain.py][INFO] Epoch:[0/2](400800/4588595) loss:2.368 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 12:00:41,102][model8_pretrain.py][INFO] Epoch:[0/2](400800/4588595) loss:2.920 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 12:00:41,102][model8_pretrain.py][INFO] Epoch:[0/2](400800/4588595) loss:2.725 lr:0.0000100 epoch_Time:26549.0min: [2024-01-04 12:01:18,000][model8_pretrain.py][INFO] Epoch:[0/2](400900/4588595) loss:2.987 lr:0.0000100 epoch_Time:26548.0min: [2024-01-04 12:01:18,000][model8_pretrain.py][INFO] Epoch:[0/2](400900/4588595) loss:2.725 lr:0.0000100 epoch_Time:26548.0min: [2024-01-04 12:01:18,000][model8_pretrain.py][INFO] Epoch:[0/2](400900/4588595) loss:2.418 lr:0.0000100 epoch_Time:26548.0min: [2024-01-04 12:01:18,000][model8_pretrain.py][INFO] Epoch:[0/2](400900/4588595) loss:2.627 lr:0.0000100 epoch_Time:26548.0min: [2024-01-04 12:01:18,000][model8_pretrain.py][INFO] Epoch:[0/2](400900/4588595) loss:2.443 lr:0.0000100 epoch_Time:26548.0min: [2024-01-04 12:01:18,000][model8_pretrain.py][INFO] Epoch:[0/2](400900/4588595) loss:2.919 lr:0.0000100 epoch_Time:26548.0min: [2024-01-04 12:01:18,000][model8_pretrain.py][INFO] Epoch:[0/2](400900/4588595) loss:2.853 lr:0.0000100 epoch_Time:26548.0min: [2024-01-04 12:01:18,001][model8_pretrain.py][INFO] Epoch:[0/2](400900/4588595) loss:2.477 lr:0.0000100 epoch_Time:26548.0min: [2024-01-04 12:01:54,934][model8_pretrain.py][INFO] Epoch:[0/2](401000/4588595) loss:2.950 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 12:01:54,934][model8_pretrain.py][INFO] Epoch:[0/2](401000/4588595) loss:2.719 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 12:01:54,934][model8_pretrain.py][INFO] Epoch:[0/2](401000/4588595) loss:2.603 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 12:01:54,934][model8_pretrain.py][INFO] Epoch:[0/2](401000/4588595) loss:2.476 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 12:01:54,934][model8_pretrain.py][INFO] Epoch:[0/2](401000/4588595) loss:2.900 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 12:01:54,934][model8_pretrain.py][INFO] Epoch:[0/2](401000/4588595) loss:2.823 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 12:01:54,934][model8_pretrain.py][INFO] Epoch:[0/2](401000/4588595) loss:3.136 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 12:01:54,934][model8_pretrain.py][INFO] Epoch:[0/2](401000/4588595) loss:3.284 lr:0.0000100 epoch_Time:26547.0min: [2024-01-04 12:02:31,880][model8_pretrain.py][INFO] Epoch:[0/2](401100/4588595) loss:3.122 lr:0.0000100 epoch_Time:26546.0min: [2024-01-04 12:02:31,880][model8_pretrain.py][INFO] Epoch:[0/2](401100/4588595) loss:2.553 lr:0.0000100 epoch_Time:26546.0min: [2024-01-04 12:02:31,880][model8_pretrain.py][INFO] Epoch:[0/2](401100/4588595) loss:2.609 lr:0.0000100 epoch_Time:26546.0min: [2024-01-04 12:02:31,880][model8_pretrain.py][INFO] Epoch:[0/2](401100/4588595) loss:3.148 lr:0.0000100 epoch_Time:26546.0min: [2024-01-04 12:02:31,880][model8_pretrain.py][INFO] Epoch:[0/2](401100/4588595) loss:3.215 lr:0.0000100 epoch_Time:26546.0min: [2024-01-04 12:02:31,880][model8_pretrain.py][INFO] Epoch:[0/2](401100/4588595) loss:2.742 lr:0.0000100 epoch_Time:26546.0min: [2024-01-04 12:02:31,880][model8_pretrain.py][INFO] Epoch:[0/2](401100/4588595) loss:2.590 lr:0.0000100 epoch_Time:26546.0min: [2024-01-04 12:02:31,880][model8_pretrain.py][INFO] Epoch:[0/2](401100/4588595) loss:2.891 lr:0.0000100 epoch_Time:26546.0min: [2024-01-04 12:03:08,818][model8_pretrain.py][INFO] Epoch:[0/2](401200/4588595) loss:2.922 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:08,818][model8_pretrain.py][INFO] Epoch:[0/2](401200/4588595) loss:2.778 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:08,818][model8_pretrain.py][INFO] Epoch:[0/2](401200/4588595) loss:2.444 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:08,818][model8_pretrain.py][INFO] Epoch:[0/2](401200/4588595) loss:2.771 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:08,818][model8_pretrain.py][INFO] Epoch:[0/2](401200/4588595) loss:2.952 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:08,818][model8_pretrain.py][INFO] Epoch:[0/2](401200/4588595) loss:2.915 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:08,818][model8_pretrain.py][INFO] Epoch:[0/2](401200/4588595) loss:3.239 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:08,818][model8_pretrain.py][INFO] Epoch:[0/2](401200/4588595) loss:3.145 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:45,752][model8_pretrain.py][INFO] Epoch:[0/2](401300/4588595) loss:2.871 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:45,752][model8_pretrain.py][INFO] Epoch:[0/2](401300/4588595) loss:3.209 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:45,752][model8_pretrain.py][INFO] Epoch:[0/2](401300/4588595) loss:2.680 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:45,752][model8_pretrain.py][INFO] Epoch:[0/2](401300/4588595) loss:2.768 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:45,752][model8_pretrain.py][INFO] Epoch:[0/2](401300/4588595) loss:1.900 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:45,752][model8_pretrain.py][INFO] Epoch:[0/2](401300/4588595) loss:2.950 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:45,752][model8_pretrain.py][INFO] Epoch:[0/2](401300/4588595) loss:3.099 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:03:45,753][model8_pretrain.py][INFO] Epoch:[0/2](401300/4588595) loss:2.578 lr:0.0000100 epoch_Time:26545.0min: [2024-01-04 12:04:22,692][model8_pretrain.py][INFO] Epoch:[0/2](401400/4588595) loss:3.046 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:04:22,692][model8_pretrain.py][INFO] Epoch:[0/2](401400/4588595) loss:2.830 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:04:22,692][model8_pretrain.py][INFO] Epoch:[0/2](401400/4588595) loss:2.561 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:04:22,692][model8_pretrain.py][INFO] Epoch:[0/2](401400/4588595) loss:3.261 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:04:22,692][model8_pretrain.py][INFO] Epoch:[0/2](401400/4588595) loss:3.055 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:04:22,692][model8_pretrain.py][INFO] Epoch:[0/2](401400/4588595) loss:3.144 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:04:22,693][model8_pretrain.py][INFO] Epoch:[0/2](401400/4588595) loss:3.294 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:04:22,692][model8_pretrain.py][INFO] Epoch:[0/2](401400/4588595) loss:2.107 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:04:59,635][model8_pretrain.py][INFO] Epoch:[0/2](401500/4588595) loss:2.950 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:04:59,635][model8_pretrain.py][INFO] Epoch:[0/2](401500/4588595) loss:2.396 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:04:59,635][model8_pretrain.py][INFO] Epoch:[0/2](401500/4588595) loss:3.006 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:04:59,635][model8_pretrain.py][INFO] Epoch:[0/2](401500/4588595) loss:2.618 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:04:59,635][model8_pretrain.py][INFO] Epoch:[0/2](401500/4588595) loss:3.264 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:04:59,635][model8_pretrain.py][INFO] Epoch:[0/2](401500/4588595) loss:3.455 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:04:59,635][model8_pretrain.py][INFO] Epoch:[0/2](401500/4588595) loss:2.610 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:04:59,635][model8_pretrain.py][INFO] Epoch:[0/2](401500/4588595) loss:3.050 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:05:46,809][model8_pretrain.py][INFO] Epoch:[0/2](401600/4588595) loss:2.138 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:05:46,809][model8_pretrain.py][INFO] Epoch:[0/2](401600/4588595) loss:2.619 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:05:46,809][model8_pretrain.py][INFO] Epoch:[0/2](401600/4588595) loss:3.425 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:05:46,809][model8_pretrain.py][INFO] Epoch:[0/2](401600/4588595) loss:2.725 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:05:46,809][model8_pretrain.py][INFO] Epoch:[0/2](401600/4588595) loss:2.965 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:05:46,809][model8_pretrain.py][INFO] Epoch:[0/2](401600/4588595) loss:3.120 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:05:46,809][model8_pretrain.py][INFO] Epoch:[0/2](401600/4588595) loss:2.829 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:05:46,809][model8_pretrain.py][INFO] Epoch:[0/2](401600/4588595) loss:2.644 lr:0.0000100 epoch_Time:26544.0min: [2024-01-04 12:06:23,740][model8_pretrain.py][INFO] Epoch:[0/2](401700/4588595) loss:3.387 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:06:23,740][model8_pretrain.py][INFO] Epoch:[0/2](401700/4588595) loss:2.561 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:06:23,740][model8_pretrain.py][INFO] Epoch:[0/2](401700/4588595) loss:2.489 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:06:23,740][model8_pretrain.py][INFO] Epoch:[0/2](401700/4588595) loss:3.268 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:06:23,740][model8_pretrain.py][INFO] Epoch:[0/2](401700/4588595) loss:2.654 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:06:23,740][model8_pretrain.py][INFO] Epoch:[0/2](401700/4588595) loss:1.416 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:06:23,740][model8_pretrain.py][INFO] Epoch:[0/2](401700/4588595) loss:2.736 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:06:23,740][model8_pretrain.py][INFO] Epoch:[0/2](401700/4588595) loss:2.860 lr:0.0000100 epoch_Time:26543.0min: [2024-01-04 12:07:00,679][model8_pretrain.py][INFO] Epoch:[0/2](401800/4588595) loss:2.468 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:00,679][model8_pretrain.py][INFO] Epoch:[0/2](401800/4588595) loss:2.709 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:00,679][model8_pretrain.py][INFO] Epoch:[0/2](401800/4588595) loss:2.518 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:00,679][model8_pretrain.py][INFO] Epoch:[0/2](401800/4588595) loss:3.056 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:00,679][model8_pretrain.py][INFO] Epoch:[0/2](401800/4588595) loss:2.938 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:00,679][model8_pretrain.py][INFO] Epoch:[0/2](401800/4588595) loss:2.470 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:00,679][model8_pretrain.py][INFO] Epoch:[0/2](401800/4588595) loss:2.965 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:00,679][model8_pretrain.py][INFO] Epoch:[0/2](401800/4588595) loss:2.913 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:37,613][model8_pretrain.py][INFO] Epoch:[0/2](401900/4588595) loss:3.450 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:37,613][model8_pretrain.py][INFO] Epoch:[0/2](401900/4588595) loss:2.780 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:37,613][model8_pretrain.py][INFO] Epoch:[0/2](401900/4588595) loss:2.828 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:37,613][model8_pretrain.py][INFO] Epoch:[0/2](401900/4588595) loss:2.642 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:37,613][model8_pretrain.py][INFO] Epoch:[0/2](401900/4588595) loss:2.867 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:37,613][model8_pretrain.py][INFO] Epoch:[0/2](401900/4588595) loss:2.967 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:37,613][model8_pretrain.py][INFO] Epoch:[0/2](401900/4588595) loss:3.084 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:07:37,613][model8_pretrain.py][INFO] Epoch:[0/2](401900/4588595) loss:3.154 lr:0.0000100 epoch_Time:26542.0min: [2024-01-04 12:08:14,552][model8_pretrain.py][INFO] Epoch:[0/2](402000/4588595) loss:2.527 lr:0.0000100 epoch_Time:26540.0min: [2024-01-04 12:08:14,552][model8_pretrain.py][INFO] Epoch:[0/2](402000/4588595) loss:2.390 lr:0.0000100 epoch_Time:26540.0min: [2024-01-04 12:08:14,552][model8_pretrain.py][INFO] Epoch:[0/2](402000/4588595) loss:2.953 lr:0.0000100 epoch_Time:26540.0min: [2024-01-04 12:08:14,552][model8_pretrain.py][INFO] Epoch:[0/2](402000/4588595) loss:3.364 lr:0.0000100 epoch_Time:26540.0min: [2024-01-04 12:08:14,552][model8_pretrain.py][INFO] Epoch:[0/2](402000/4588595) loss:2.834 lr:0.0000100 epoch_Time:26540.0min: [2024-01-04 12:08:14,553][model8_pretrain.py][INFO] Epoch:[0/2](402000/4588595) loss:2.406 lr:0.0000100 epoch_Time:26540.0min: [2024-01-04 12:08:14,553][model8_pretrain.py][INFO] Epoch:[0/2](402000/4588595) loss:2.587 lr:0.0000100 epoch_Time:26540.0min: [2024-01-04 12:08:14,553][model8_pretrain.py][INFO] Epoch:[0/2](402000/4588595) loss:2.839 lr:0.0000100 epoch_Time:26540.0min: [2024-01-04 12:08:51,490][model8_pretrain.py][INFO] Epoch:[0/2](402100/4588595) loss:3.058 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:08:51,490][model8_pretrain.py][INFO] Epoch:[0/2](402100/4588595) loss:2.961 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:08:51,490][model8_pretrain.py][INFO] Epoch:[0/2](402100/4588595) loss:3.228 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:08:51,490][model8_pretrain.py][INFO] Epoch:[0/2](402100/4588595) loss:2.947 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:08:51,490][model8_pretrain.py][INFO] Epoch:[0/2](402100/4588595) loss:3.162 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:08:51,490][model8_pretrain.py][INFO] Epoch:[0/2](402100/4588595) loss:2.861 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:08:51,490][model8_pretrain.py][INFO] Epoch:[0/2](402100/4588595) loss:2.246 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:08:51,491][model8_pretrain.py][INFO] Epoch:[0/2](402100/4588595) loss:2.578 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:09:28,435][model8_pretrain.py][INFO] Epoch:[0/2](402200/4588595) loss:3.143 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:09:28,435][model8_pretrain.py][INFO] Epoch:[0/2](402200/4588595) loss:3.044 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:09:28,435][model8_pretrain.py][INFO] Epoch:[0/2](402200/4588595) loss:3.592 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:09:28,435][model8_pretrain.py][INFO] Epoch:[0/2](402200/4588595) loss:3.015 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:09:28,435][model8_pretrain.py][INFO] Epoch:[0/2](402200/4588595) loss:2.245 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:09:28,435][model8_pretrain.py][INFO] Epoch:[0/2](402200/4588595) loss:3.259 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:09:28,435][model8_pretrain.py][INFO] Epoch:[0/2](402200/4588595) loss:2.929 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:09:28,435][model8_pretrain.py][INFO] Epoch:[0/2](402200/4588595) loss:3.135 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:10:05,375][model8_pretrain.py][INFO] Epoch:[0/2](402300/4588595) loss:2.754 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:10:05,375][model8_pretrain.py][INFO] Epoch:[0/2](402300/4588595) loss:3.020 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:10:05,375][model8_pretrain.py][INFO] Epoch:[0/2](402300/4588595) loss:2.808 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:10:05,375][model8_pretrain.py][INFO] Epoch:[0/2](402300/4588595) loss:3.105 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:10:05,375][model8_pretrain.py][INFO] Epoch:[0/2](402300/4588595) loss:2.496 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:10:05,375][model8_pretrain.py][INFO] Epoch:[0/2](402300/4588595) loss:2.253 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:10:05,375][model8_pretrain.py][INFO] Epoch:[0/2](402300/4588595) loss:3.371 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:10:05,375][model8_pretrain.py][INFO] Epoch:[0/2](402300/4588595) loss:2.802 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:10:52,481][model8_pretrain.py][INFO] Epoch:[0/2](402400/4588595) loss:3.455 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:10:52,481][model8_pretrain.py][INFO] Epoch:[0/2](402400/4588595) loss:2.287 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:10:52,481][model8_pretrain.py][INFO] Epoch:[0/2](402400/4588595) loss:2.395 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:10:52,481][model8_pretrain.py][INFO] Epoch:[0/2](402400/4588595) loss:2.509 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:10:52,482][model8_pretrain.py][INFO] Epoch:[0/2](402400/4588595) loss:3.422 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:10:52,482][model8_pretrain.py][INFO] Epoch:[0/2](402400/4588595) loss:2.761 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:10:52,482][model8_pretrain.py][INFO] Epoch:[0/2](402400/4588595) loss:3.146 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:10:52,482][model8_pretrain.py][INFO] Epoch:[0/2](402400/4588595) loss:2.615 lr:0.0000100 epoch_Time:26539.0min: [2024-01-04 12:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](402500/4588595) loss:2.246 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](402500/4588595) loss:3.474 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](402500/4588595) loss:3.151 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](402500/4588595) loss:2.824 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](402500/4588595) loss:2.801 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:11:29,413][model8_pretrain.py][INFO] Epoch:[0/2](402500/4588595) loss:3.090 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:11:29,414][model8_pretrain.py][INFO] Epoch:[0/2](402500/4588595) loss:2.863 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:11:29,414][model8_pretrain.py][INFO] Epoch:[0/2](402500/4588595) loss:2.655 lr:0.0000100 epoch_Time:26538.0min: [2024-01-04 12:12:06,354][model8_pretrain.py][INFO] Epoch:[0/2](402600/4588595) loss:2.692 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:06,355][model8_pretrain.py][INFO] Epoch:[0/2](402600/4588595) loss:3.165 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:06,355][model8_pretrain.py][INFO] Epoch:[0/2](402600/4588595) loss:2.909 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:06,355][model8_pretrain.py][INFO] Epoch:[0/2](402600/4588595) loss:3.058 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:06,355][model8_pretrain.py][INFO] Epoch:[0/2](402600/4588595) loss:2.522 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:06,355][model8_pretrain.py][INFO] Epoch:[0/2](402600/4588595) loss:2.924 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:06,355][model8_pretrain.py][INFO] Epoch:[0/2](402600/4588595) loss:2.765 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:06,355][model8_pretrain.py][INFO] Epoch:[0/2](402600/4588595) loss:3.095 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:43,307][model8_pretrain.py][INFO] Epoch:[0/2](402700/4588595) loss:2.728 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:43,307][model8_pretrain.py][INFO] Epoch:[0/2](402700/4588595) loss:2.851 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:43,307][model8_pretrain.py][INFO] Epoch:[0/2](402700/4588595) loss:2.960 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:43,307][model8_pretrain.py][INFO] Epoch:[0/2](402700/4588595) loss:3.083 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:43,307][model8_pretrain.py][INFO] Epoch:[0/2](402700/4588595) loss:2.779 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:43,307][model8_pretrain.py][INFO] Epoch:[0/2](402700/4588595) loss:2.861 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:43,307][model8_pretrain.py][INFO] Epoch:[0/2](402700/4588595) loss:2.948 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:12:43,308][model8_pretrain.py][INFO] Epoch:[0/2](402700/4588595) loss:2.546 lr:0.0000100 epoch_Time:26537.0min: [2024-01-04 12:13:20,228][model8_pretrain.py][INFO] Epoch:[0/2](402800/4588595) loss:3.094 lr:0.0000100 epoch_Time:26536.0min: [2024-01-04 12:13:20,229][model8_pretrain.py][INFO] Epoch:[0/2](402800/4588595) loss:2.540 lr:0.0000100 epoch_Time:26536.0min: [2024-01-04 12:13:20,229][model8_pretrain.py][INFO] Epoch:[0/2](402800/4588595) loss:3.010 lr:0.0000100 epoch_Time:26536.0min: [2024-01-04 12:13:20,229][model8_pretrain.py][INFO] Epoch:[0/2](402800/4588595) loss:3.202 lr:0.0000100 epoch_Time:26536.0min: [2024-01-04 12:13:20,229][model8_pretrain.py][INFO] Epoch:[0/2](402800/4588595) loss:3.523 lr:0.0000100 epoch_Time:26536.0min: [2024-01-04 12:13:20,229][model8_pretrain.py][INFO] Epoch:[0/2](402800/4588595) loss:3.127 lr:0.0000100 epoch_Time:26536.0min: [2024-01-04 12:13:20,229][model8_pretrain.py][INFO] Epoch:[0/2](402800/4588595) loss:3.241 lr:0.0000100 epoch_Time:26536.0min: [2024-01-04 12:13:20,229][model8_pretrain.py][INFO] Epoch:[0/2](402800/4588595) loss:2.555 lr:0.0000100 epoch_Time:26536.0min: [2024-01-04 12:13:57,164][model8_pretrain.py][INFO] Epoch:[0/2](402900/4588595) loss:2.729 lr:0.0000100 epoch_Time:26535.0min: [2024-01-04 12:13:57,164][model8_pretrain.py][INFO] Epoch:[0/2](402900/4588595) loss:2.086 lr:0.0000100 epoch_Time:26535.0min: [2024-01-04 12:13:57,164][model8_pretrain.py][INFO] Epoch:[0/2](402900/4588595) loss:3.082 lr:0.0000100 epoch_Time:26535.0min: [2024-01-04 12:13:57,164][model8_pretrain.py][INFO] Epoch:[0/2](402900/4588595) loss:2.830 lr:0.0000100 epoch_Time:26535.0min: [2024-01-04 12:13:57,164][model8_pretrain.py][INFO] Epoch:[0/2](402900/4588595) loss:3.084 lr:0.0000100 epoch_Time:26535.0min: [2024-01-04 12:13:57,164][model8_pretrain.py][INFO] Epoch:[0/2](402900/4588595) loss:2.683 lr:0.0000100 epoch_Time:26535.0min: [2024-01-04 12:13:57,164][model8_pretrain.py][INFO] Epoch:[0/2](402900/4588595) loss:2.315 lr:0.0000100 epoch_Time:26535.0min: [2024-01-04 12:13:57,165][model8_pretrain.py][INFO] Epoch:[0/2](402900/4588595) loss:3.080 lr:0.0000100 epoch_Time:26535.0min: [2024-01-04 12:14:34,116][model8_pretrain.py][INFO] Epoch:[0/2](403000/4588595) loss:2.697 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:14:34,116][model8_pretrain.py][INFO] Epoch:[0/2](403000/4588595) loss:2.622 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:14:34,116][model8_pretrain.py][INFO] Epoch:[0/2](403000/4588595) loss:2.449 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:14:34,116][model8_pretrain.py][INFO] Epoch:[0/2](403000/4588595) loss:3.152 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:14:34,116][model8_pretrain.py][INFO] Epoch:[0/2](403000/4588595) loss:2.696 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:14:34,116][model8_pretrain.py][INFO] Epoch:[0/2](403000/4588595) loss:3.125 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:14:34,116][model8_pretrain.py][INFO] Epoch:[0/2](403000/4588595) loss:2.861 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:14:34,116][model8_pretrain.py][INFO] Epoch:[0/2](403000/4588595) loss:3.159 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:15:11,063][model8_pretrain.py][INFO] Epoch:[0/2](403100/4588595) loss:2.610 lr:0.0000100 epoch_Time:26533.0min: [2024-01-04 12:15:11,063][model8_pretrain.py][INFO] Epoch:[0/2](403100/4588595) loss:2.859 lr:0.0000100 epoch_Time:26533.0min: [2024-01-04 12:15:11,063][model8_pretrain.py][INFO] Epoch:[0/2](403100/4588595) loss:2.524 lr:0.0000100 epoch_Time:26533.0min: [2024-01-04 12:15:11,063][model8_pretrain.py][INFO] Epoch:[0/2](403100/4588595) loss:1.945 lr:0.0000100 epoch_Time:26533.0min: [2024-01-04 12:15:11,063][model8_pretrain.py][INFO] Epoch:[0/2](403100/4588595) loss:2.588 lr:0.0000100 epoch_Time:26533.0min: [2024-01-04 12:15:11,064][model8_pretrain.py][INFO] Epoch:[0/2](403100/4588595) loss:2.660 lr:0.0000100 epoch_Time:26533.0min: [2024-01-04 12:15:11,064][model8_pretrain.py][INFO] Epoch:[0/2](403100/4588595) loss:3.162 lr:0.0000100 epoch_Time:26533.0min: [2024-01-04 12:15:11,064][model8_pretrain.py][INFO] Epoch:[0/2](403100/4588595) loss:2.791 lr:0.0000100 epoch_Time:26533.0min: [2024-01-04 12:15:58,201][model8_pretrain.py][INFO] Epoch:[0/2](403200/4588595) loss:2.972 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:15:58,201][model8_pretrain.py][INFO] Epoch:[0/2](403200/4588595) loss:3.074 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:15:58,201][model8_pretrain.py][INFO] Epoch:[0/2](403200/4588595) loss:2.353 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:15:58,201][model8_pretrain.py][INFO] Epoch:[0/2](403200/4588595) loss:2.669 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:15:58,201][model8_pretrain.py][INFO] Epoch:[0/2](403200/4588595) loss:2.942 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:15:58,201][model8_pretrain.py][INFO] Epoch:[0/2](403200/4588595) loss:2.460 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:15:58,201][model8_pretrain.py][INFO] Epoch:[0/2](403200/4588595) loss:2.601 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:15:58,201][model8_pretrain.py][INFO] Epoch:[0/2](403200/4588595) loss:2.802 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:16:35,128][model8_pretrain.py][INFO] Epoch:[0/2](403300/4588595) loss:3.346 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:16:35,129][model8_pretrain.py][INFO] Epoch:[0/2](403300/4588595) loss:2.639 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:16:35,129][model8_pretrain.py][INFO] Epoch:[0/2](403300/4588595) loss:2.522 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:16:35,129][model8_pretrain.py][INFO] Epoch:[0/2](403300/4588595) loss:2.869 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:16:35,129][model8_pretrain.py][INFO] Epoch:[0/2](403300/4588595) loss:2.361 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:16:35,129][model8_pretrain.py][INFO] Epoch:[0/2](403300/4588595) loss:2.433 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:16:35,129][model8_pretrain.py][INFO] Epoch:[0/2](403300/4588595) loss:3.230 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:16:35,129][model8_pretrain.py][INFO] Epoch:[0/2](403300/4588595) loss:2.453 lr:0.0000100 epoch_Time:26534.0min: [2024-01-04 12:17:12,063][model8_pretrain.py][INFO] Epoch:[0/2](403400/4588595) loss:3.009 lr:0.0000100 epoch_Time:26532.0min: [2024-01-04 12:17:12,063][model8_pretrain.py][INFO] Epoch:[0/2](403400/4588595) loss:2.619 lr:0.0000100 epoch_Time:26532.0min: [2024-01-04 12:17:12,063][model8_pretrain.py][INFO] Epoch:[0/2](403400/4588595) loss:3.231 lr:0.0000100 epoch_Time:26532.0min: [2024-01-04 12:17:12,063][model8_pretrain.py][INFO] Epoch:[0/2](403400/4588595) loss:2.820 lr:0.0000100 epoch_Time:26532.0min: [2024-01-04 12:17:12,063][model8_pretrain.py][INFO] Epoch:[0/2](403400/4588595) loss:3.302 lr:0.0000100 epoch_Time:26532.0min: [2024-01-04 12:17:12,063][model8_pretrain.py][INFO] Epoch:[0/2](403400/4588595) loss:2.843 lr:0.0000100 epoch_Time:26532.0min: [2024-01-04 12:17:12,063][model8_pretrain.py][INFO] Epoch:[0/2](403400/4588595) loss:3.228 lr:0.0000100 epoch_Time:26532.0min: [2024-01-04 12:17:12,063][model8_pretrain.py][INFO] Epoch:[0/2](403400/4588595) loss:3.105 lr:0.0000100 epoch_Time:26532.0min: [2024-01-04 12:17:48,998][model8_pretrain.py][INFO] Epoch:[0/2](403500/4588595) loss:2.987 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:17:48,998][model8_pretrain.py][INFO] Epoch:[0/2](403500/4588595) loss:2.866 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:17:48,998][model8_pretrain.py][INFO] Epoch:[0/2](403500/4588595) loss:2.619 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:17:48,998][model8_pretrain.py][INFO] Epoch:[0/2](403500/4588595) loss:2.757 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:17:48,998][model8_pretrain.py][INFO] Epoch:[0/2](403500/4588595) loss:2.555 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:17:48,998][model8_pretrain.py][INFO] Epoch:[0/2](403500/4588595) loss:2.842 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:17:48,998][model8_pretrain.py][INFO] Epoch:[0/2](403500/4588595) loss:2.821 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:17:48,998][model8_pretrain.py][INFO] Epoch:[0/2](403500/4588595) loss:3.360 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:18:25,942][model8_pretrain.py][INFO] Epoch:[0/2](403600/4588595) loss:2.469 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:18:25,942][model8_pretrain.py][INFO] Epoch:[0/2](403600/4588595) loss:2.981 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:18:25,942][model8_pretrain.py][INFO] Epoch:[0/2](403600/4588595) loss:3.208 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:18:25,942][model8_pretrain.py][INFO] Epoch:[0/2](403600/4588595) loss:3.005 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:18:25,942][model8_pretrain.py][INFO] Epoch:[0/2](403600/4588595) loss:2.567 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:18:25,942][model8_pretrain.py][INFO] Epoch:[0/2](403600/4588595) loss:2.674 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:18:25,942][model8_pretrain.py][INFO] Epoch:[0/2](403600/4588595) loss:3.457 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:18:25,943][model8_pretrain.py][INFO] Epoch:[0/2](403600/4588595) loss:2.617 lr:0.0000100 epoch_Time:26531.0min: [2024-01-04 12:19:02,879][model8_pretrain.py][INFO] Epoch:[0/2](403700/4588595) loss:3.124 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:02,879][model8_pretrain.py][INFO] Epoch:[0/2](403700/4588595) loss:2.194 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:02,879][model8_pretrain.py][INFO] Epoch:[0/2](403700/4588595) loss:2.888 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:02,879][model8_pretrain.py][INFO] Epoch:[0/2](403700/4588595) loss:2.838 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:02,879][model8_pretrain.py][INFO] Epoch:[0/2](403700/4588595) loss:2.791 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:02,879][model8_pretrain.py][INFO] Epoch:[0/2](403700/4588595) loss:3.053 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:02,879][model8_pretrain.py][INFO] Epoch:[0/2](403700/4588595) loss:2.908 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:02,880][model8_pretrain.py][INFO] Epoch:[0/2](403700/4588595) loss:2.742 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:39,809][model8_pretrain.py][INFO] Epoch:[0/2](403800/4588595) loss:3.416 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:39,809][model8_pretrain.py][INFO] Epoch:[0/2](403800/4588595) loss:3.179 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:39,809][model8_pretrain.py][INFO] Epoch:[0/2](403800/4588595) loss:2.942 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:39,809][model8_pretrain.py][INFO] Epoch:[0/2](403800/4588595) loss:2.943 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:39,809][model8_pretrain.py][INFO] Epoch:[0/2](403800/4588595) loss:3.475 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:39,809][model8_pretrain.py][INFO] Epoch:[0/2](403800/4588595) loss:3.557 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:39,809][model8_pretrain.py][INFO] Epoch:[0/2](403800/4588595) loss:3.014 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:19:39,809][model8_pretrain.py][INFO] Epoch:[0/2](403800/4588595) loss:2.495 lr:0.0000100 epoch_Time:26530.0min: [2024-01-04 12:20:16,719][model8_pretrain.py][INFO] Epoch:[0/2](403900/4588595) loss:3.057 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:20:16,719][model8_pretrain.py][INFO] Epoch:[0/2](403900/4588595) loss:3.212 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:20:16,719][model8_pretrain.py][INFO] Epoch:[0/2](403900/4588595) loss:3.072 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:20:16,719][model8_pretrain.py][INFO] Epoch:[0/2](403900/4588595) loss:2.911 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:20:16,719][model8_pretrain.py][INFO] Epoch:[0/2](403900/4588595) loss:2.407 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:20:16,719][model8_pretrain.py][INFO] Epoch:[0/2](403900/4588595) loss:2.914 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:20:16,719][model8_pretrain.py][INFO] Epoch:[0/2](403900/4588595) loss:2.776 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:20:16,719][model8_pretrain.py][INFO] Epoch:[0/2](403900/4588595) loss:2.688 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:21:03,648][model8_pretrain.py][INFO] Epoch:[0/2](404000/4588595) loss:2.787 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:03,648][model8_pretrain.py][INFO] Epoch:[0/2](404000/4588595) loss:3.138 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:03,648][model8_pretrain.py][INFO] Epoch:[0/2](404000/4588595) loss:2.809 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:03,648][model8_pretrain.py][INFO] Epoch:[0/2](404000/4588595) loss:2.960 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:03,648][model8_pretrain.py][INFO] Epoch:[0/2](404000/4588595) loss:2.373 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:03,648][model8_pretrain.py][INFO] Epoch:[0/2](404000/4588595) loss:2.356 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:03,648][model8_pretrain.py][INFO] Epoch:[0/2](404000/4588595) loss:2.660 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:03,648][model8_pretrain.py][INFO] Epoch:[0/2](404000/4588595) loss:3.089 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:40,576][model8_pretrain.py][INFO] Epoch:[0/2](404100/4588595) loss:2.590 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:40,576][model8_pretrain.py][INFO] Epoch:[0/2](404100/4588595) loss:2.608 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:40,576][model8_pretrain.py][INFO] Epoch:[0/2](404100/4588595) loss:2.971 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:40,576][model8_pretrain.py][INFO] Epoch:[0/2](404100/4588595) loss:3.311 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:40,576][model8_pretrain.py][INFO] Epoch:[0/2](404100/4588595) loss:2.801 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:40,576][model8_pretrain.py][INFO] Epoch:[0/2](404100/4588595) loss:2.925 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:40,576][model8_pretrain.py][INFO] Epoch:[0/2](404100/4588595) loss:3.120 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:21:40,576][model8_pretrain.py][INFO] Epoch:[0/2](404100/4588595) loss:3.145 lr:0.0000100 epoch_Time:26529.0min: [2024-01-04 12:22:17,512][model8_pretrain.py][INFO] Epoch:[0/2](404200/4588595) loss:3.115 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:22:17,512][model8_pretrain.py][INFO] Epoch:[0/2](404200/4588595) loss:3.011 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:22:17,512][model8_pretrain.py][INFO] Epoch:[0/2](404200/4588595) loss:2.670 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:22:17,512][model8_pretrain.py][INFO] Epoch:[0/2](404200/4588595) loss:3.324 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:22:17,512][model8_pretrain.py][INFO] Epoch:[0/2](404200/4588595) loss:3.340 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:22:17,512][model8_pretrain.py][INFO] Epoch:[0/2](404200/4588595) loss:3.212 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:22:17,512][model8_pretrain.py][INFO] Epoch:[0/2](404200/4588595) loss:2.961 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:22:17,512][model8_pretrain.py][INFO] Epoch:[0/2](404200/4588595) loss:3.658 lr:0.0000100 epoch_Time:26528.0min: [2024-01-04 12:22:54,449][model8_pretrain.py][INFO] Epoch:[0/2](404300/4588595) loss:3.088 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:22:54,449][model8_pretrain.py][INFO] Epoch:[0/2](404300/4588595) loss:3.079 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:22:54,449][model8_pretrain.py][INFO] Epoch:[0/2](404300/4588595) loss:2.697 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:22:54,449][model8_pretrain.py][INFO] Epoch:[0/2](404300/4588595) loss:2.617 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:22:54,450][model8_pretrain.py][INFO] Epoch:[0/2](404300/4588595) loss:3.180 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:22:54,450][model8_pretrain.py][INFO] Epoch:[0/2](404300/4588595) loss:2.843 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:22:54,450][model8_pretrain.py][INFO] Epoch:[0/2](404300/4588595) loss:2.839 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:22:54,450][model8_pretrain.py][INFO] Epoch:[0/2](404300/4588595) loss:2.809 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:23:31,397][model8_pretrain.py][INFO] Epoch:[0/2](404400/4588595) loss:3.017 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:23:31,397][model8_pretrain.py][INFO] Epoch:[0/2](404400/4588595) loss:3.082 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:23:31,397][model8_pretrain.py][INFO] Epoch:[0/2](404400/4588595) loss:3.291 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:23:31,397][model8_pretrain.py][INFO] Epoch:[0/2](404400/4588595) loss:2.872 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:23:31,397][model8_pretrain.py][INFO] Epoch:[0/2](404400/4588595) loss:3.137 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:23:31,397][model8_pretrain.py][INFO] Epoch:[0/2](404400/4588595) loss:2.964 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:23:31,397][model8_pretrain.py][INFO] Epoch:[0/2](404400/4588595) loss:2.900 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:23:31,397][model8_pretrain.py][INFO] Epoch:[0/2](404400/4588595) loss:2.736 lr:0.0000100 epoch_Time:26526.0min: [2024-01-04 12:24:08,341][model8_pretrain.py][INFO] Epoch:[0/2](404500/4588595) loss:2.957 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:08,341][model8_pretrain.py][INFO] Epoch:[0/2](404500/4588595) loss:2.805 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:08,341][model8_pretrain.py][INFO] Epoch:[0/2](404500/4588595) loss:2.928 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:08,341][model8_pretrain.py][INFO] Epoch:[0/2](404500/4588595) loss:2.912 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:08,341][model8_pretrain.py][INFO] Epoch:[0/2](404500/4588595) loss:2.848 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:08,341][model8_pretrain.py][INFO] Epoch:[0/2](404500/4588595) loss:3.194 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:08,341][model8_pretrain.py][INFO] Epoch:[0/2](404500/4588595) loss:2.767 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:08,341][model8_pretrain.py][INFO] Epoch:[0/2](404500/4588595) loss:1.739 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:45,281][model8_pretrain.py][INFO] Epoch:[0/2](404600/4588595) loss:2.966 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:45,281][model8_pretrain.py][INFO] Epoch:[0/2](404600/4588595) loss:2.361 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:45,281][model8_pretrain.py][INFO] Epoch:[0/2](404600/4588595) loss:2.564 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:45,281][model8_pretrain.py][INFO] Epoch:[0/2](404600/4588595) loss:2.802 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:45,281][model8_pretrain.py][INFO] Epoch:[0/2](404600/4588595) loss:2.003 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:45,281][model8_pretrain.py][INFO] Epoch:[0/2](404600/4588595) loss:2.402 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:45,281][model8_pretrain.py][INFO] Epoch:[0/2](404600/4588595) loss:3.069 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:24:45,281][model8_pretrain.py][INFO] Epoch:[0/2](404600/4588595) loss:3.098 lr:0.0000100 epoch_Time:26525.0min: [2024-01-04 12:25:22,221][model8_pretrain.py][INFO] Epoch:[0/2](404700/4588595) loss:2.959 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:25:22,221][model8_pretrain.py][INFO] Epoch:[0/2](404700/4588595) loss:2.821 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:25:22,221][model8_pretrain.py][INFO] Epoch:[0/2](404700/4588595) loss:2.509 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:25:22,221][model8_pretrain.py][INFO] Epoch:[0/2](404700/4588595) loss:2.901 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:25:22,221][model8_pretrain.py][INFO] Epoch:[0/2](404700/4588595) loss:2.454 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:25:22,221][model8_pretrain.py][INFO] Epoch:[0/2](404700/4588595) loss:2.734 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:25:22,221][model8_pretrain.py][INFO] Epoch:[0/2](404700/4588595) loss:2.936 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:25:22,221][model8_pretrain.py][INFO] Epoch:[0/2](404700/4588595) loss:2.886 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:08,962][model8_pretrain.py][INFO] Epoch:[0/2](404800/4588595) loss:2.895 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:08,962][model8_pretrain.py][INFO] Epoch:[0/2](404800/4588595) loss:2.877 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:08,962][model8_pretrain.py][INFO] Epoch:[0/2](404800/4588595) loss:2.548 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:08,962][model8_pretrain.py][INFO] Epoch:[0/2](404800/4588595) loss:3.024 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:08,962][model8_pretrain.py][INFO] Epoch:[0/2](404800/4588595) loss:2.751 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:08,962][model8_pretrain.py][INFO] Epoch:[0/2](404800/4588595) loss:2.817 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:08,962][model8_pretrain.py][INFO] Epoch:[0/2](404800/4588595) loss:2.977 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:08,962][model8_pretrain.py][INFO] Epoch:[0/2](404800/4588595) loss:2.966 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:45,885][model8_pretrain.py][INFO] Epoch:[0/2](404900/4588595) loss:2.109 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:45,885][model8_pretrain.py][INFO] Epoch:[0/2](404900/4588595) loss:2.820 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:45,885][model8_pretrain.py][INFO] Epoch:[0/2](404900/4588595) loss:2.944 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:45,885][model8_pretrain.py][INFO] Epoch:[0/2](404900/4588595) loss:2.615 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:45,885][model8_pretrain.py][INFO] Epoch:[0/2](404900/4588595) loss:2.689 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:45,885][model8_pretrain.py][INFO] Epoch:[0/2](404900/4588595) loss:3.079 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:45,885][model8_pretrain.py][INFO] Epoch:[0/2](404900/4588595) loss:2.955 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:26:45,885][model8_pretrain.py][INFO] Epoch:[0/2](404900/4588595) loss:3.014 lr:0.0000100 epoch_Time:26524.0min: [2024-01-04 12:27:22,817][model8_pretrain.py][INFO] Epoch:[0/2](405000/4588595) loss:3.069 lr:0.0000100 epoch_Time:26523.0min: [2024-01-04 12:27:22,817][model8_pretrain.py][INFO] Epoch:[0/2](405000/4588595) loss:2.404 lr:0.0000100 epoch_Time:26523.0min: [2024-01-04 12:27:22,817][model8_pretrain.py][INFO] Epoch:[0/2](405000/4588595) loss:3.107 lr:0.0000100 epoch_Time:26523.0min: [2024-01-04 12:27:22,817][model8_pretrain.py][INFO] Epoch:[0/2](405000/4588595) loss:3.338 lr:0.0000100 epoch_Time:26523.0min: [2024-01-04 12:27:22,817][model8_pretrain.py][INFO] Epoch:[0/2](405000/4588595) loss:3.355 lr:0.0000100 epoch_Time:26523.0min: [2024-01-04 12:27:22,818][model8_pretrain.py][INFO] Epoch:[0/2](405000/4588595) loss:2.685 lr:0.0000100 epoch_Time:26523.0min: [2024-01-04 12:27:22,818][model8_pretrain.py][INFO] Epoch:[0/2](405000/4588595) loss:2.886 lr:0.0000100 epoch_Time:26523.0min: [2024-01-04 12:27:22,818][model8_pretrain.py][INFO] Epoch:[0/2](405000/4588595) loss:2.709 lr:0.0000100 epoch_Time:26523.0min: [2024-01-04 12:27:59,746][model8_pretrain.py][INFO] Epoch:[0/2](405100/4588595) loss:2.434 lr:0.0000100 epoch_Time:26522.0min: [2024-01-04 12:27:59,746][model8_pretrain.py][INFO] Epoch:[0/2](405100/4588595) loss:2.944 lr:0.0000100 epoch_Time:26522.0min: [2024-01-04 12:27:59,746][model8_pretrain.py][INFO] Epoch:[0/2](405100/4588595) loss:2.834 lr:0.0000100 epoch_Time:26522.0min: [2024-01-04 12:27:59,746][model8_pretrain.py][INFO] Epoch:[0/2](405100/4588595) loss:3.311 lr:0.0000100 epoch_Time:26522.0min: [2024-01-04 12:27:59,746][model8_pretrain.py][INFO] Epoch:[0/2](405100/4588595) loss:3.105 lr:0.0000100 epoch_Time:26522.0min: [2024-01-04 12:27:59,746][model8_pretrain.py][INFO] Epoch:[0/2](405100/4588595) loss:2.865 lr:0.0000100 epoch_Time:26522.0min: [2024-01-04 12:27:59,746][model8_pretrain.py][INFO] Epoch:[0/2](405100/4588595) loss:2.903 lr:0.0000100 epoch_Time:26522.0min: [2024-01-04 12:27:59,746][model8_pretrain.py][INFO] Epoch:[0/2](405100/4588595) loss:3.149 lr:0.0000100 epoch_Time:26522.0min: [2024-01-04 12:28:36,684][model8_pretrain.py][INFO] Epoch:[0/2](405200/4588595) loss:2.722 lr:0.0000100 epoch_Time:26521.0min: [2024-01-04 12:28:36,683][model8_pretrain.py][INFO] Epoch:[0/2](405200/4588595) loss:2.846 lr:0.0000100 epoch_Time:26521.0min: [2024-01-04 12:28:36,684][model8_pretrain.py][INFO] Epoch:[0/2](405200/4588595) loss:2.086 lr:0.0000100 epoch_Time:26521.0min: [2024-01-04 12:28:36,684][model8_pretrain.py][INFO] Epoch:[0/2](405200/4588595) loss:3.066 lr:0.0000100 epoch_Time:26521.0min: [2024-01-04 12:28:36,684][model8_pretrain.py][INFO] Epoch:[0/2](405200/4588595) loss:2.935 lr:0.0000100 epoch_Time:26521.0min: [2024-01-04 12:28:36,684][model8_pretrain.py][INFO] Epoch:[0/2](405200/4588595) loss:2.633 lr:0.0000100 epoch_Time:26521.0min: [2024-01-04 12:28:36,684][model8_pretrain.py][INFO] Epoch:[0/2](405200/4588595) loss:2.944 lr:0.0000100 epoch_Time:26521.0min: [2024-01-04 12:28:36,684][model8_pretrain.py][INFO] Epoch:[0/2](405200/4588595) loss:3.179 lr:0.0000100 epoch_Time:26521.0min: [2024-01-04 12:29:13,621][model8_pretrain.py][INFO] Epoch:[0/2](405300/4588595) loss:2.248 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:29:13,621][model8_pretrain.py][INFO] Epoch:[0/2](405300/4588595) loss:2.826 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:29:13,621][model8_pretrain.py][INFO] Epoch:[0/2](405300/4588595) loss:3.204 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:29:13,621][model8_pretrain.py][INFO] Epoch:[0/2](405300/4588595) loss:2.776 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:29:13,621][model8_pretrain.py][INFO] Epoch:[0/2](405300/4588595) loss:2.786 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:29:13,621][model8_pretrain.py][INFO] Epoch:[0/2](405300/4588595) loss:3.249 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:29:13,621][model8_pretrain.py][INFO] Epoch:[0/2](405300/4588595) loss:3.200 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:29:13,621][model8_pretrain.py][INFO] Epoch:[0/2](405300/4588595) loss:2.987 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:29:50,560][model8_pretrain.py][INFO] Epoch:[0/2](405400/4588595) loss:2.606 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:29:50,560][model8_pretrain.py][INFO] Epoch:[0/2](405400/4588595) loss:2.277 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:29:50,560][model8_pretrain.py][INFO] Epoch:[0/2](405400/4588595) loss:2.900 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:29:50,560][model8_pretrain.py][INFO] Epoch:[0/2](405400/4588595) loss:3.457 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:29:50,560][model8_pretrain.py][INFO] Epoch:[0/2](405400/4588595) loss:2.512 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:29:50,560][model8_pretrain.py][INFO] Epoch:[0/2](405400/4588595) loss:2.992 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:29:50,560][model8_pretrain.py][INFO] Epoch:[0/2](405400/4588595) loss:2.654 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:29:50,560][model8_pretrain.py][INFO] Epoch:[0/2](405400/4588595) loss:3.136 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:30:27,507][model8_pretrain.py][INFO] Epoch:[0/2](405500/4588595) loss:2.625 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:30:27,507][model8_pretrain.py][INFO] Epoch:[0/2](405500/4588595) loss:2.946 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:30:27,507][model8_pretrain.py][INFO] Epoch:[0/2](405500/4588595) loss:2.681 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:30:27,507][model8_pretrain.py][INFO] Epoch:[0/2](405500/4588595) loss:2.952 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:30:27,508][model8_pretrain.py][INFO] Epoch:[0/2](405500/4588595) loss:2.868 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:30:27,508][model8_pretrain.py][INFO] Epoch:[0/2](405500/4588595) loss:2.960 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:30:27,508][model8_pretrain.py][INFO] Epoch:[0/2](405500/4588595) loss:2.551 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:30:27,508][model8_pretrain.py][INFO] Epoch:[0/2](405500/4588595) loss:2.725 lr:0.0000100 epoch_Time:26519.0min: [2024-01-04 12:31:14,604][model8_pretrain.py][INFO] Epoch:[0/2](405600/4588595) loss:2.148 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:31:14,604][model8_pretrain.py][INFO] Epoch:[0/2](405600/4588595) loss:2.483 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:31:14,604][model8_pretrain.py][INFO] Epoch:[0/2](405600/4588595) loss:2.918 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:31:14,604][model8_pretrain.py][INFO] Epoch:[0/2](405600/4588595) loss:2.278 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:31:14,604][model8_pretrain.py][INFO] Epoch:[0/2](405600/4588595) loss:2.577 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:31:14,604][model8_pretrain.py][INFO] Epoch:[0/2](405600/4588595) loss:3.263 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:31:14,604][model8_pretrain.py][INFO] Epoch:[0/2](405600/4588595) loss:3.050 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:31:14,604][model8_pretrain.py][INFO] Epoch:[0/2](405600/4588595) loss:3.378 lr:0.0000100 epoch_Time:26520.0min: [2024-01-04 12:31:51,530][model8_pretrain.py][INFO] Epoch:[0/2](405700/4588595) loss:2.964 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:31:51,530][model8_pretrain.py][INFO] Epoch:[0/2](405700/4588595) loss:2.792 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:31:51,530][model8_pretrain.py][INFO] Epoch:[0/2](405700/4588595) loss:3.036 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:31:51,530][model8_pretrain.py][INFO] Epoch:[0/2](405700/4588595) loss:2.474 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:31:51,530][model8_pretrain.py][INFO] Epoch:[0/2](405700/4588595) loss:3.354 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:31:51,530][model8_pretrain.py][INFO] Epoch:[0/2](405700/4588595) loss:3.143 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:31:51,530][model8_pretrain.py][INFO] Epoch:[0/2](405700/4588595) loss:2.373 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:31:51,530][model8_pretrain.py][INFO] Epoch:[0/2](405700/4588595) loss:3.372 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:32:28,471][model8_pretrain.py][INFO] Epoch:[0/2](405800/4588595) loss:3.002 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:32:28,471][model8_pretrain.py][INFO] Epoch:[0/2](405800/4588595) loss:2.700 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:32:28,471][model8_pretrain.py][INFO] Epoch:[0/2](405800/4588595) loss:3.093 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:32:28,471][model8_pretrain.py][INFO] Epoch:[0/2](405800/4588595) loss:2.597 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:32:28,471][model8_pretrain.py][INFO] Epoch:[0/2](405800/4588595) loss:3.117 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:32:28,471][model8_pretrain.py][INFO] Epoch:[0/2](405800/4588595) loss:2.746 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:32:28,471][model8_pretrain.py][INFO] Epoch:[0/2](405800/4588595) loss:3.321 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:32:28,471][model8_pretrain.py][INFO] Epoch:[0/2](405800/4588595) loss:3.204 lr:0.0000100 epoch_Time:26518.0min: [2024-01-04 12:33:05,417][model8_pretrain.py][INFO] Epoch:[0/2](405900/4588595) loss:3.162 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:05,417][model8_pretrain.py][INFO] Epoch:[0/2](405900/4588595) loss:3.508 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:05,417][model8_pretrain.py][INFO] Epoch:[0/2](405900/4588595) loss:2.734 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:05,417][model8_pretrain.py][INFO] Epoch:[0/2](405900/4588595) loss:3.006 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:05,417][model8_pretrain.py][INFO] Epoch:[0/2](405900/4588595) loss:3.083 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:05,417][model8_pretrain.py][INFO] Epoch:[0/2](405900/4588595) loss:2.797 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:05,417][model8_pretrain.py][INFO] Epoch:[0/2](405900/4588595) loss:2.601 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:05,417][model8_pretrain.py][INFO] Epoch:[0/2](405900/4588595) loss:3.103 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:42,364][model8_pretrain.py][INFO] Epoch:[0/2](406000/4588595) loss:2.178 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:42,364][model8_pretrain.py][INFO] Epoch:[0/2](406000/4588595) loss:2.469 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:42,364][model8_pretrain.py][INFO] Epoch:[0/2](406000/4588595) loss:2.312 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:42,364][model8_pretrain.py][INFO] Epoch:[0/2](406000/4588595) loss:3.448 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:42,364][model8_pretrain.py][INFO] Epoch:[0/2](406000/4588595) loss:3.149 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:42,364][model8_pretrain.py][INFO] Epoch:[0/2](406000/4588595) loss:2.578 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:42,364][model8_pretrain.py][INFO] Epoch:[0/2](406000/4588595) loss:3.387 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:33:42,364][model8_pretrain.py][INFO] Epoch:[0/2](406000/4588595) loss:2.550 lr:0.0000100 epoch_Time:26517.0min: [2024-01-04 12:34:19,304][model8_pretrain.py][INFO] Epoch:[0/2](406100/4588595) loss:2.966 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:34:19,304][model8_pretrain.py][INFO] Epoch:[0/2](406100/4588595) loss:2.591 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:34:19,304][model8_pretrain.py][INFO] Epoch:[0/2](406100/4588595) loss:3.060 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:34:19,304][model8_pretrain.py][INFO] Epoch:[0/2](406100/4588595) loss:2.761 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:34:19,304][model8_pretrain.py][INFO] Epoch:[0/2](406100/4588595) loss:1.950 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:34:19,304][model8_pretrain.py][INFO] Epoch:[0/2](406100/4588595) loss:2.835 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:34:19,304][model8_pretrain.py][INFO] Epoch:[0/2](406100/4588595) loss:2.880 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:34:19,304][model8_pretrain.py][INFO] Epoch:[0/2](406100/4588595) loss:2.886 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:34:56,243][model8_pretrain.py][INFO] Epoch:[0/2](406200/4588595) loss:2.845 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:34:56,243][model8_pretrain.py][INFO] Epoch:[0/2](406200/4588595) loss:2.322 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:34:56,243][model8_pretrain.py][INFO] Epoch:[0/2](406200/4588595) loss:2.818 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:34:56,243][model8_pretrain.py][INFO] Epoch:[0/2](406200/4588595) loss:2.590 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:34:56,243][model8_pretrain.py][INFO] Epoch:[0/2](406200/4588595) loss:2.556 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:34:56,243][model8_pretrain.py][INFO] Epoch:[0/2](406200/4588595) loss:2.571 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:34:56,243][model8_pretrain.py][INFO] Epoch:[0/2](406200/4588595) loss:2.827 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:34:56,243][model8_pretrain.py][INFO] Epoch:[0/2](406200/4588595) loss:2.954 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:35:33,190][model8_pretrain.py][INFO] Epoch:[0/2](406300/4588595) loss:2.772 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:35:33,190][model8_pretrain.py][INFO] Epoch:[0/2](406300/4588595) loss:2.996 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:35:33,190][model8_pretrain.py][INFO] Epoch:[0/2](406300/4588595) loss:3.393 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:35:33,190][model8_pretrain.py][INFO] Epoch:[0/2](406300/4588595) loss:2.610 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:35:33,190][model8_pretrain.py][INFO] Epoch:[0/2](406300/4588595) loss:2.568 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:35:33,190][model8_pretrain.py][INFO] Epoch:[0/2](406300/4588595) loss:2.690 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:35:33,191][model8_pretrain.py][INFO] Epoch:[0/2](406300/4588595) loss:2.361 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:35:33,191][model8_pretrain.py][INFO] Epoch:[0/2](406300/4588595) loss:2.299 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:36:20,445][model8_pretrain.py][INFO] Epoch:[0/2](406400/4588595) loss:2.913 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:36:20,445][model8_pretrain.py][INFO] Epoch:[0/2](406400/4588595) loss:3.118 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:36:20,445][model8_pretrain.py][INFO] Epoch:[0/2](406400/4588595) loss:2.660 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:36:20,445][model8_pretrain.py][INFO] Epoch:[0/2](406400/4588595) loss:3.002 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:36:20,445][model8_pretrain.py][INFO] Epoch:[0/2](406400/4588595) loss:3.191 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:36:20,445][model8_pretrain.py][INFO] Epoch:[0/2](406400/4588595) loss:2.571 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:36:20,445][model8_pretrain.py][INFO] Epoch:[0/2](406400/4588595) loss:3.022 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:36:20,445][model8_pretrain.py][INFO] Epoch:[0/2](406400/4588595) loss:3.286 lr:0.0000100 epoch_Time:26515.0min: [2024-01-04 12:36:57,366][model8_pretrain.py][INFO] Epoch:[0/2](406500/4588595) loss:2.699 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:36:57,366][model8_pretrain.py][INFO] Epoch:[0/2](406500/4588595) loss:2.791 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:36:57,366][model8_pretrain.py][INFO] Epoch:[0/2](406500/4588595) loss:3.076 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:36:57,366][model8_pretrain.py][INFO] Epoch:[0/2](406500/4588595) loss:2.396 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:36:57,366][model8_pretrain.py][INFO] Epoch:[0/2](406500/4588595) loss:2.570 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:36:57,366][model8_pretrain.py][INFO] Epoch:[0/2](406500/4588595) loss:3.303 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:36:57,367][model8_pretrain.py][INFO] Epoch:[0/2](406500/4588595) loss:2.594 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:36:57,367][model8_pretrain.py][INFO] Epoch:[0/2](406500/4588595) loss:3.197 lr:0.0000100 epoch_Time:26514.0min: [2024-01-04 12:37:34,302][model8_pretrain.py][INFO] Epoch:[0/2](406600/4588595) loss:2.750 lr:0.0000100 epoch_Time:26513.0min: [2024-01-04 12:37:34,302][model8_pretrain.py][INFO] Epoch:[0/2](406600/4588595) loss:2.971 lr:0.0000100 epoch_Time:26513.0min: [2024-01-04 12:37:34,302][model8_pretrain.py][INFO] Epoch:[0/2](406600/4588595) loss:2.950 lr:0.0000100 epoch_Time:26513.0min: [2024-01-04 12:37:34,302][model8_pretrain.py][INFO] Epoch:[0/2](406600/4588595) loss:2.767 lr:0.0000100 epoch_Time:26513.0min: [2024-01-04 12:37:34,302][model8_pretrain.py][INFO] Epoch:[0/2](406600/4588595) loss:2.618 lr:0.0000100 epoch_Time:26513.0min: [2024-01-04 12:37:34,302][model8_pretrain.py][INFO] Epoch:[0/2](406600/4588595) loss:2.945 lr:0.0000100 epoch_Time:26513.0min: [2024-01-04 12:37:34,302][model8_pretrain.py][INFO] Epoch:[0/2](406600/4588595) loss:3.078 lr:0.0000100 epoch_Time:26513.0min: [2024-01-04 12:37:34,302][model8_pretrain.py][INFO] Epoch:[0/2](406600/4588595) loss:2.829 lr:0.0000100 epoch_Time:26513.0min: [2024-01-04 12:38:11,245][model8_pretrain.py][INFO] Epoch:[0/2](406700/4588595) loss:2.707 lr:0.0000100 epoch_Time:26512.0min: [2024-01-04 12:38:11,245][model8_pretrain.py][INFO] Epoch:[0/2](406700/4588595) loss:2.691 lr:0.0000100 epoch_Time:26512.0min: [2024-01-04 12:38:11,245][model8_pretrain.py][INFO] Epoch:[0/2](406700/4588595) loss:2.960 lr:0.0000100 epoch_Time:26512.0min: [2024-01-04 12:38:11,245][model8_pretrain.py][INFO] Epoch:[0/2](406700/4588595) loss:3.453 lr:0.0000100 epoch_Time:26512.0min: [2024-01-04 12:38:11,245][model8_pretrain.py][INFO] Epoch:[0/2](406700/4588595) loss:2.912 lr:0.0000100 epoch_Time:26512.0min: [2024-01-04 12:38:11,245][model8_pretrain.py][INFO] Epoch:[0/2](406700/4588595) loss:3.027 lr:0.0000100 epoch_Time:26512.0min: [2024-01-04 12:38:11,246][model8_pretrain.py][INFO] Epoch:[0/2](406700/4588595) loss:3.094 lr:0.0000100 epoch_Time:26512.0min: [2024-01-04 12:38:11,246][model8_pretrain.py][INFO] Epoch:[0/2](406700/4588595) loss:3.286 lr:0.0000100 epoch_Time:26512.0min: [2024-01-04 12:38:48,188][model8_pretrain.py][INFO] Epoch:[0/2](406800/4588595) loss:2.924 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:38:48,188][model8_pretrain.py][INFO] Epoch:[0/2](406800/4588595) loss:3.264 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:38:48,188][model8_pretrain.py][INFO] Epoch:[0/2](406800/4588595) loss:3.233 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:38:48,188][model8_pretrain.py][INFO] Epoch:[0/2](406800/4588595) loss:2.676 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:38:48,188][model8_pretrain.py][INFO] Epoch:[0/2](406800/4588595) loss:2.618 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:38:48,189][model8_pretrain.py][INFO] Epoch:[0/2](406800/4588595) loss:2.858 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:38:48,189][model8_pretrain.py][INFO] Epoch:[0/2](406800/4588595) loss:2.827 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:38:48,191][model8_pretrain.py][INFO] Epoch:[0/2](406800/4588595) loss:2.803 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:39:25,133][model8_pretrain.py][INFO] Epoch:[0/2](406900/4588595) loss:2.812 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:39:25,133][model8_pretrain.py][INFO] Epoch:[0/2](406900/4588595) loss:2.550 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:39:25,133][model8_pretrain.py][INFO] Epoch:[0/2](406900/4588595) loss:2.805 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:39:25,133][model8_pretrain.py][INFO] Epoch:[0/2](406900/4588595) loss:3.638 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:39:25,133][model8_pretrain.py][INFO] Epoch:[0/2](406900/4588595) loss:2.670 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:39:25,133][model8_pretrain.py][INFO] Epoch:[0/2](406900/4588595) loss:2.982 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:39:25,133][model8_pretrain.py][INFO] Epoch:[0/2](406900/4588595) loss:2.895 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:39:25,133][model8_pretrain.py][INFO] Epoch:[0/2](406900/4588595) loss:2.668 lr:0.0000100 epoch_Time:26511.0min: [2024-01-04 12:40:02,064][model8_pretrain.py][INFO] Epoch:[0/2](407000/4588595) loss:3.008 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:40:02,064][model8_pretrain.py][INFO] Epoch:[0/2](407000/4588595) loss:2.445 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:40:02,064][model8_pretrain.py][INFO] Epoch:[0/2](407000/4588595) loss:2.872 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:40:02,064][model8_pretrain.py][INFO] Epoch:[0/2](407000/4588595) loss:3.278 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:40:02,064][model8_pretrain.py][INFO] Epoch:[0/2](407000/4588595) loss:2.412 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:40:02,064][model8_pretrain.py][INFO] Epoch:[0/2](407000/4588595) loss:2.479 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:40:02,065][model8_pretrain.py][INFO] Epoch:[0/2](407000/4588595) loss:2.420 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:40:02,065][model8_pretrain.py][INFO] Epoch:[0/2](407000/4588595) loss:3.216 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:40:38,989][model8_pretrain.py][INFO] Epoch:[0/2](407100/4588595) loss:2.324 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:40:38,989][model8_pretrain.py][INFO] Epoch:[0/2](407100/4588595) loss:2.927 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:40:38,989][model8_pretrain.py][INFO] Epoch:[0/2](407100/4588595) loss:2.482 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:40:38,989][model8_pretrain.py][INFO] Epoch:[0/2](407100/4588595) loss:2.734 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:40:38,989][model8_pretrain.py][INFO] Epoch:[0/2](407100/4588595) loss:3.021 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:40:38,989][model8_pretrain.py][INFO] Epoch:[0/2](407100/4588595) loss:2.771 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:40:38,989][model8_pretrain.py][INFO] Epoch:[0/2](407100/4588595) loss:2.620 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:40:38,989][model8_pretrain.py][INFO] Epoch:[0/2](407100/4588595) loss:3.367 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:41:26,125][model8_pretrain.py][INFO] Epoch:[0/2](407200/4588595) loss:3.408 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:41:26,125][model8_pretrain.py][INFO] Epoch:[0/2](407200/4588595) loss:2.355 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:41:26,125][model8_pretrain.py][INFO] Epoch:[0/2](407200/4588595) loss:3.127 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:41:26,125][model8_pretrain.py][INFO] Epoch:[0/2](407200/4588595) loss:2.839 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:41:26,125][model8_pretrain.py][INFO] Epoch:[0/2](407200/4588595) loss:2.521 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:41:26,125][model8_pretrain.py][INFO] Epoch:[0/2](407200/4588595) loss:2.589 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:41:26,125][model8_pretrain.py][INFO] Epoch:[0/2](407200/4588595) loss:2.816 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:41:26,125][model8_pretrain.py][INFO] Epoch:[0/2](407200/4588595) loss:2.771 lr:0.0000100 epoch_Time:26510.0min: [2024-01-04 12:42:03,049][model8_pretrain.py][INFO] Epoch:[0/2](407300/4588595) loss:3.122 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:03,049][model8_pretrain.py][INFO] Epoch:[0/2](407300/4588595) loss:2.633 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:03,049][model8_pretrain.py][INFO] Epoch:[0/2](407300/4588595) loss:3.361 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:03,049][model8_pretrain.py][INFO] Epoch:[0/2](407300/4588595) loss:3.108 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:03,049][model8_pretrain.py][INFO] Epoch:[0/2](407300/4588595) loss:3.224 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:03,049][model8_pretrain.py][INFO] Epoch:[0/2](407300/4588595) loss:3.094 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:03,049][model8_pretrain.py][INFO] Epoch:[0/2](407300/4588595) loss:2.394 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:03,049][model8_pretrain.py][INFO] Epoch:[0/2](407300/4588595) loss:2.767 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:40,000][model8_pretrain.py][INFO] Epoch:[0/2](407400/4588595) loss:2.979 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:40,000][model8_pretrain.py][INFO] Epoch:[0/2](407400/4588595) loss:3.027 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:40,000][model8_pretrain.py][INFO] Epoch:[0/2](407400/4588595) loss:2.383 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:40,000][model8_pretrain.py][INFO] Epoch:[0/2](407400/4588595) loss:2.797 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:40,000][model8_pretrain.py][INFO] Epoch:[0/2](407400/4588595) loss:2.846 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:40,000][model8_pretrain.py][INFO] Epoch:[0/2](407400/4588595) loss:3.157 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:40,001][model8_pretrain.py][INFO] Epoch:[0/2](407400/4588595) loss:2.772 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:42:40,001][model8_pretrain.py][INFO] Epoch:[0/2](407400/4588595) loss:2.948 lr:0.0000100 epoch_Time:26509.0min: [2024-01-04 12:43:16,957][model8_pretrain.py][INFO] Epoch:[0/2](407500/4588595) loss:2.902 lr:0.0000100 epoch_Time:26507.0min: [2024-01-04 12:43:16,957][model8_pretrain.py][INFO] Epoch:[0/2](407500/4588595) loss:2.739 lr:0.0000100 epoch_Time:26507.0min: [2024-01-04 12:43:16,957][model8_pretrain.py][INFO] Epoch:[0/2](407500/4588595) loss:2.788 lr:0.0000100 epoch_Time:26507.0min: [2024-01-04 12:43:16,957][model8_pretrain.py][INFO] Epoch:[0/2](407500/4588595) loss:2.337 lr:0.0000100 epoch_Time:26507.0min: [2024-01-04 12:43:16,957][model8_pretrain.py][INFO] Epoch:[0/2](407500/4588595) loss:3.211 lr:0.0000100 epoch_Time:26507.0min: [2024-01-04 12:43:16,957][model8_pretrain.py][INFO] Epoch:[0/2](407500/4588595) loss:3.084 lr:0.0000100 epoch_Time:26507.0min: [2024-01-04 12:43:16,957][model8_pretrain.py][INFO] Epoch:[0/2](407500/4588595) loss:2.401 lr:0.0000100 epoch_Time:26507.0min: [2024-01-04 12:43:16,957][model8_pretrain.py][INFO] Epoch:[0/2](407500/4588595) loss:2.594 lr:0.0000100 epoch_Time:26507.0min: [2024-01-04 12:43:53,926][model8_pretrain.py][INFO] Epoch:[0/2](407600/4588595) loss:3.035 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:43:53,926][model8_pretrain.py][INFO] Epoch:[0/2](407600/4588595) loss:2.781 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:43:53,926][model8_pretrain.py][INFO] Epoch:[0/2](407600/4588595) loss:2.903 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:43:53,926][model8_pretrain.py][INFO] Epoch:[0/2](407600/4588595) loss:3.224 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:43:53,926][model8_pretrain.py][INFO] Epoch:[0/2](407600/4588595) loss:2.986 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:43:53,926][model8_pretrain.py][INFO] Epoch:[0/2](407600/4588595) loss:3.086 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:43:53,926][model8_pretrain.py][INFO] Epoch:[0/2](407600/4588595) loss:2.823 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:43:53,927][model8_pretrain.py][INFO] Epoch:[0/2](407600/4588595) loss:2.730 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:44:30,876][model8_pretrain.py][INFO] Epoch:[0/2](407700/4588595) loss:2.630 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:44:30,876][model8_pretrain.py][INFO] Epoch:[0/2](407700/4588595) loss:3.481 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:44:30,876][model8_pretrain.py][INFO] Epoch:[0/2](407700/4588595) loss:2.350 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:44:30,876][model8_pretrain.py][INFO] Epoch:[0/2](407700/4588595) loss:3.298 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:44:30,876][model8_pretrain.py][INFO] Epoch:[0/2](407700/4588595) loss:3.059 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:44:30,876][model8_pretrain.py][INFO] Epoch:[0/2](407700/4588595) loss:2.665 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:44:30,876][model8_pretrain.py][INFO] Epoch:[0/2](407700/4588595) loss:2.969 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:44:30,876][model8_pretrain.py][INFO] Epoch:[0/2](407700/4588595) loss:2.692 lr:0.0000100 epoch_Time:26506.0min: [2024-01-04 12:45:07,818][model8_pretrain.py][INFO] Epoch:[0/2](407800/4588595) loss:2.824 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:07,818][model8_pretrain.py][INFO] Epoch:[0/2](407800/4588595) loss:2.978 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:07,818][model8_pretrain.py][INFO] Epoch:[0/2](407800/4588595) loss:2.630 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:07,818][model8_pretrain.py][INFO] Epoch:[0/2](407800/4588595) loss:2.681 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:07,818][model8_pretrain.py][INFO] Epoch:[0/2](407800/4588595) loss:2.694 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:07,818][model8_pretrain.py][INFO] Epoch:[0/2](407800/4588595) loss:2.075 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:07,818][model8_pretrain.py][INFO] Epoch:[0/2](407800/4588595) loss:3.026 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:07,819][model8_pretrain.py][INFO] Epoch:[0/2](407800/4588595) loss:3.388 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:44,776][model8_pretrain.py][INFO] Epoch:[0/2](407900/4588595) loss:2.897 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:44,776][model8_pretrain.py][INFO] Epoch:[0/2](407900/4588595) loss:2.912 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:44,776][model8_pretrain.py][INFO] Epoch:[0/2](407900/4588595) loss:3.128 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:44,776][model8_pretrain.py][INFO] Epoch:[0/2](407900/4588595) loss:2.781 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:44,776][model8_pretrain.py][INFO] Epoch:[0/2](407900/4588595) loss:2.945 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:44,776][model8_pretrain.py][INFO] Epoch:[0/2](407900/4588595) loss:3.067 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:44,776][model8_pretrain.py][INFO] Epoch:[0/2](407900/4588595) loss:2.616 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:45:44,776][model8_pretrain.py][INFO] Epoch:[0/2](407900/4588595) loss:2.775 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:46:31,909][model8_pretrain.py][INFO] Epoch:[0/2](408000/4588595) loss:3.341 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:46:31,909][model8_pretrain.py][INFO] Epoch:[0/2](408000/4588595) loss:2.417 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:46:31,909][model8_pretrain.py][INFO] Epoch:[0/2](408000/4588595) loss:3.124 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:46:31,909][model8_pretrain.py][INFO] Epoch:[0/2](408000/4588595) loss:2.376 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:46:31,909][model8_pretrain.py][INFO] Epoch:[0/2](408000/4588595) loss:2.571 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:46:31,909][model8_pretrain.py][INFO] Epoch:[0/2](408000/4588595) loss:3.051 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:46:31,909][model8_pretrain.py][INFO] Epoch:[0/2](408000/4588595) loss:2.780 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:46:31,910][model8_pretrain.py][INFO] Epoch:[0/2](408000/4588595) loss:3.068 lr:0.0000100 epoch_Time:26505.0min: [2024-01-04 12:47:08,846][model8_pretrain.py][INFO] Epoch:[0/2](408100/4588595) loss:2.928 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:08,846][model8_pretrain.py][INFO] Epoch:[0/2](408100/4588595) loss:2.945 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:08,846][model8_pretrain.py][INFO] Epoch:[0/2](408100/4588595) loss:2.952 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:08,846][model8_pretrain.py][INFO] Epoch:[0/2](408100/4588595) loss:2.612 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:08,846][model8_pretrain.py][INFO] Epoch:[0/2](408100/4588595) loss:3.180 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:08,846][model8_pretrain.py][INFO] Epoch:[0/2](408100/4588595) loss:2.609 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:08,846][model8_pretrain.py][INFO] Epoch:[0/2](408100/4588595) loss:3.002 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:08,846][model8_pretrain.py][INFO] Epoch:[0/2](408100/4588595) loss:2.381 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:45,809][model8_pretrain.py][INFO] Epoch:[0/2](408200/4588595) loss:2.956 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:45,809][model8_pretrain.py][INFO] Epoch:[0/2](408200/4588595) loss:2.615 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:45,809][model8_pretrain.py][INFO] Epoch:[0/2](408200/4588595) loss:3.082 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:45,809][model8_pretrain.py][INFO] Epoch:[0/2](408200/4588595) loss:2.176 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:45,809][model8_pretrain.py][INFO] Epoch:[0/2](408200/4588595) loss:2.760 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:45,809][model8_pretrain.py][INFO] Epoch:[0/2](408200/4588595) loss:2.631 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:45,809][model8_pretrain.py][INFO] Epoch:[0/2](408200/4588595) loss:2.346 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:47:45,809][model8_pretrain.py][INFO] Epoch:[0/2](408200/4588595) loss:2.626 lr:0.0000100 epoch_Time:26504.0min: [2024-01-04 12:48:22,749][model8_pretrain.py][INFO] Epoch:[0/2](408300/4588595) loss:3.248 lr:0.0000100 epoch_Time:26503.0min: [2024-01-04 12:48:22,749][model8_pretrain.py][INFO] Epoch:[0/2](408300/4588595) loss:2.658 lr:0.0000100 epoch_Time:26503.0min: [2024-01-04 12:48:22,749][model8_pretrain.py][INFO] Epoch:[0/2](408300/4588595) loss:3.191 lr:0.0000100 epoch_Time:26503.0min: [2024-01-04 12:48:22,749][model8_pretrain.py][INFO] Epoch:[0/2](408300/4588595) loss:3.266 lr:0.0000100 epoch_Time:26503.0min: [2024-01-04 12:48:22,749][model8_pretrain.py][INFO] Epoch:[0/2](408300/4588595) loss:2.951 lr:0.0000100 epoch_Time:26503.0min: [2024-01-04 12:48:22,749][model8_pretrain.py][INFO] Epoch:[0/2](408300/4588595) loss:3.244 lr:0.0000100 epoch_Time:26503.0min: [2024-01-04 12:48:22,750][model8_pretrain.py][INFO] Epoch:[0/2](408300/4588595) loss:2.904 lr:0.0000100 epoch_Time:26503.0min: [2024-01-04 12:48:22,750][model8_pretrain.py][INFO] Epoch:[0/2](408300/4588595) loss:2.716 lr:0.0000100 epoch_Time:26503.0min: [2024-01-04 12:48:59,700][model8_pretrain.py][INFO] Epoch:[0/2](408400/4588595) loss:2.792 lr:0.0000100 epoch_Time:26502.0min: [2024-01-04 12:48:59,700][model8_pretrain.py][INFO] Epoch:[0/2](408400/4588595) loss:3.088 lr:0.0000100 epoch_Time:26502.0min: [2024-01-04 12:48:59,700][model8_pretrain.py][INFO] Epoch:[0/2](408400/4588595) loss:2.529 lr:0.0000100 epoch_Time:26502.0min: [2024-01-04 12:48:59,700][model8_pretrain.py][INFO] Epoch:[0/2](408400/4588595) loss:3.078 lr:0.0000100 epoch_Time:26502.0min: [2024-01-04 12:48:59,700][model8_pretrain.py][INFO] Epoch:[0/2](408400/4588595) loss:3.021 lr:0.0000100 epoch_Time:26502.0min: [2024-01-04 12:48:59,700][model8_pretrain.py][INFO] Epoch:[0/2](408400/4588595) loss:3.139 lr:0.0000100 epoch_Time:26502.0min: [2024-01-04 12:48:59,700][model8_pretrain.py][INFO] Epoch:[0/2](408400/4588595) loss:3.307 lr:0.0000100 epoch_Time:26502.0min: [2024-01-04 12:48:59,700][model8_pretrain.py][INFO] Epoch:[0/2](408400/4588595) loss:3.203 lr:0.0000100 epoch_Time:26502.0min: [2024-01-04 12:49:36,649][model8_pretrain.py][INFO] Epoch:[0/2](408500/4588595) loss:3.415 lr:0.0000100 epoch_Time:26501.0min: [2024-01-04 12:49:36,649][model8_pretrain.py][INFO] Epoch:[0/2](408500/4588595) loss:2.787 lr:0.0000100 epoch_Time:26501.0min: [2024-01-04 12:49:36,649][model8_pretrain.py][INFO] Epoch:[0/2](408500/4588595) loss:3.281 lr:0.0000100 epoch_Time:26501.0min: [2024-01-04 12:49:36,649][model8_pretrain.py][INFO] Epoch:[0/2](408500/4588595) loss:2.892 lr:0.0000100 epoch_Time:26501.0min: [2024-01-04 12:49:36,649][model8_pretrain.py][INFO] Epoch:[0/2](408500/4588595) loss:2.588 lr:0.0000100 epoch_Time:26501.0min: [2024-01-04 12:49:36,649][model8_pretrain.py][INFO] Epoch:[0/2](408500/4588595) loss:3.058 lr:0.0000100 epoch_Time:26501.0min: [2024-01-04 12:49:36,649][model8_pretrain.py][INFO] Epoch:[0/2](408500/4588595) loss:2.922 lr:0.0000100 epoch_Time:26501.0min: [2024-01-04 12:49:36,649][model8_pretrain.py][INFO] Epoch:[0/2](408500/4588595) loss:2.922 lr:0.0000100 epoch_Time:26501.0min: [2024-01-04 12:50:13,601][model8_pretrain.py][INFO] Epoch:[0/2](408600/4588595) loss:2.388 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:50:13,601][model8_pretrain.py][INFO] Epoch:[0/2](408600/4588595) loss:3.082 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:50:13,601][model8_pretrain.py][INFO] Epoch:[0/2](408600/4588595) loss:3.176 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:50:13,601][model8_pretrain.py][INFO] Epoch:[0/2](408600/4588595) loss:3.195 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:50:13,601][model8_pretrain.py][INFO] Epoch:[0/2](408600/4588595) loss:2.579 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:50:13,601][model8_pretrain.py][INFO] Epoch:[0/2](408600/4588595) loss:3.393 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:50:13,601][model8_pretrain.py][INFO] Epoch:[0/2](408600/4588595) loss:3.117 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:50:13,601][model8_pretrain.py][INFO] Epoch:[0/2](408600/4588595) loss:2.904 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:50:50,539][model8_pretrain.py][INFO] Epoch:[0/2](408700/4588595) loss:2.776 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:50:50,539][model8_pretrain.py][INFO] Epoch:[0/2](408700/4588595) loss:2.634 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:50:50,539][model8_pretrain.py][INFO] Epoch:[0/2](408700/4588595) loss:3.155 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:50:50,539][model8_pretrain.py][INFO] Epoch:[0/2](408700/4588595) loss:2.592 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:50:50,539][model8_pretrain.py][INFO] Epoch:[0/2](408700/4588595) loss:3.065 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:50:50,539][model8_pretrain.py][INFO] Epoch:[0/2](408700/4588595) loss:2.637 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:50:50,540][model8_pretrain.py][INFO] Epoch:[0/2](408700/4588595) loss:2.892 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:50:50,540][model8_pretrain.py][INFO] Epoch:[0/2](408700/4588595) loss:3.436 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:51:35,931][model8_pretrain.py][INFO] Epoch:[0/2](408800/4588595) loss:2.699 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:51:35,931][model8_pretrain.py][INFO] Epoch:[0/2](408800/4588595) loss:3.050 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:51:35,931][model8_pretrain.py][INFO] Epoch:[0/2](408800/4588595) loss:2.749 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:51:35,931][model8_pretrain.py][INFO] Epoch:[0/2](408800/4588595) loss:3.228 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:51:35,931][model8_pretrain.py][INFO] Epoch:[0/2](408800/4588595) loss:2.977 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:51:35,931][model8_pretrain.py][INFO] Epoch:[0/2](408800/4588595) loss:3.317 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:51:35,931][model8_pretrain.py][INFO] Epoch:[0/2](408800/4588595) loss:3.012 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:51:35,931][model8_pretrain.py][INFO] Epoch:[0/2](408800/4588595) loss:3.058 lr:0.0000100 epoch_Time:26500.0min: [2024-01-04 12:52:14,565][model8_pretrain.py][INFO] Epoch:[0/2](408900/4588595) loss:2.761 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:52:14,565][model8_pretrain.py][INFO] Epoch:[0/2](408900/4588595) loss:3.376 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:52:14,565][model8_pretrain.py][INFO] Epoch:[0/2](408900/4588595) loss:2.575 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:52:14,565][model8_pretrain.py][INFO] Epoch:[0/2](408900/4588595) loss:2.843 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:52:14,565][model8_pretrain.py][INFO] Epoch:[0/2](408900/4588595) loss:2.431 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:52:14,565][model8_pretrain.py][INFO] Epoch:[0/2](408900/4588595) loss:2.656 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:52:14,565][model8_pretrain.py][INFO] Epoch:[0/2](408900/4588595) loss:3.056 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:52:14,565][model8_pretrain.py][INFO] Epoch:[0/2](408900/4588595) loss:3.172 lr:0.0000100 epoch_Time:26499.0min: [2024-01-04 12:52:51,502][model8_pretrain.py][INFO] Epoch:[0/2](409000/4588595) loss:2.334 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:52:51,502][model8_pretrain.py][INFO] Epoch:[0/2](409000/4588595) loss:2.712 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:52:51,502][model8_pretrain.py][INFO] Epoch:[0/2](409000/4588595) loss:2.977 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:52:51,502][model8_pretrain.py][INFO] Epoch:[0/2](409000/4588595) loss:1.828 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:52:51,502][model8_pretrain.py][INFO] Epoch:[0/2](409000/4588595) loss:2.791 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:52:51,502][model8_pretrain.py][INFO] Epoch:[0/2](409000/4588595) loss:3.029 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:52:51,502][model8_pretrain.py][INFO] Epoch:[0/2](409000/4588595) loss:2.754 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:52:51,502][model8_pretrain.py][INFO] Epoch:[0/2](409000/4588595) loss:2.720 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:53:28,444][model8_pretrain.py][INFO] Epoch:[0/2](409100/4588595) loss:2.466 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:53:28,445][model8_pretrain.py][INFO] Epoch:[0/2](409100/4588595) loss:3.261 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:53:28,445][model8_pretrain.py][INFO] Epoch:[0/2](409100/4588595) loss:2.562 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:53:28,445][model8_pretrain.py][INFO] Epoch:[0/2](409100/4588595) loss:2.810 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:53:28,445][model8_pretrain.py][INFO] Epoch:[0/2](409100/4588595) loss:2.898 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:53:28,445][model8_pretrain.py][INFO] Epoch:[0/2](409100/4588595) loss:2.741 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:53:28,445][model8_pretrain.py][INFO] Epoch:[0/2](409100/4588595) loss:3.095 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:53:28,445][model8_pretrain.py][INFO] Epoch:[0/2](409100/4588595) loss:2.838 lr:0.0000100 epoch_Time:26498.0min: [2024-01-04 12:54:05,390][model8_pretrain.py][INFO] Epoch:[0/2](409200/4588595) loss:2.797 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:05,390][model8_pretrain.py][INFO] Epoch:[0/2](409200/4588595) loss:2.482 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:05,390][model8_pretrain.py][INFO] Epoch:[0/2](409200/4588595) loss:3.080 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:05,390][model8_pretrain.py][INFO] Epoch:[0/2](409200/4588595) loss:2.949 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:05,390][model8_pretrain.py][INFO] Epoch:[0/2](409200/4588595) loss:3.159 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:05,390][model8_pretrain.py][INFO] Epoch:[0/2](409200/4588595) loss:3.292 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:05,390][model8_pretrain.py][INFO] Epoch:[0/2](409200/4588595) loss:2.765 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:05,390][model8_pretrain.py][INFO] Epoch:[0/2](409200/4588595) loss:3.011 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:42,332][model8_pretrain.py][INFO] Epoch:[0/2](409300/4588595) loss:3.365 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:42,332][model8_pretrain.py][INFO] Epoch:[0/2](409300/4588595) loss:2.644 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:42,332][model8_pretrain.py][INFO] Epoch:[0/2](409300/4588595) loss:3.239 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:42,332][model8_pretrain.py][INFO] Epoch:[0/2](409300/4588595) loss:3.154 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:42,332][model8_pretrain.py][INFO] Epoch:[0/2](409300/4588595) loss:2.779 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:42,332][model8_pretrain.py][INFO] Epoch:[0/2](409300/4588595) loss:2.651 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:42,332][model8_pretrain.py][INFO] Epoch:[0/2](409300/4588595) loss:2.853 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:54:42,332][model8_pretrain.py][INFO] Epoch:[0/2](409300/4588595) loss:2.881 lr:0.0000100 epoch_Time:26497.0min: [2024-01-04 12:55:19,266][model8_pretrain.py][INFO] Epoch:[0/2](409400/4588595) loss:2.618 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:55:19,266][model8_pretrain.py][INFO] Epoch:[0/2](409400/4588595) loss:2.256 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:55:19,266][model8_pretrain.py][INFO] Epoch:[0/2](409400/4588595) loss:3.324 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:55:19,266][model8_pretrain.py][INFO] Epoch:[0/2](409400/4588595) loss:2.894 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:55:19,266][model8_pretrain.py][INFO] Epoch:[0/2](409400/4588595) loss:2.867 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:55:19,267][model8_pretrain.py][INFO] Epoch:[0/2](409400/4588595) loss:3.147 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:55:19,267][model8_pretrain.py][INFO] Epoch:[0/2](409400/4588595) loss:2.694 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:55:19,267][model8_pretrain.py][INFO] Epoch:[0/2](409400/4588595) loss:3.305 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:55:56,189][model8_pretrain.py][INFO] Epoch:[0/2](409500/4588595) loss:2.707 lr:0.0000100 epoch_Time:26494.0min: [2024-01-04 12:55:56,189][model8_pretrain.py][INFO] Epoch:[0/2](409500/4588595) loss:2.941 lr:0.0000100 epoch_Time:26494.0min: [2024-01-04 12:55:56,190][model8_pretrain.py][INFO] Epoch:[0/2](409500/4588595) loss:2.689 lr:0.0000100 epoch_Time:26494.0min: [2024-01-04 12:55:56,190][model8_pretrain.py][INFO] Epoch:[0/2](409500/4588595) loss:2.935 lr:0.0000100 epoch_Time:26494.0min: [2024-01-04 12:55:56,190][model8_pretrain.py][INFO] Epoch:[0/2](409500/4588595) loss:2.552 lr:0.0000100 epoch_Time:26494.0min: [2024-01-04 12:55:56,190][model8_pretrain.py][INFO] Epoch:[0/2](409500/4588595) loss:3.149 lr:0.0000100 epoch_Time:26494.0min: [2024-01-04 12:55:56,190][model8_pretrain.py][INFO] Epoch:[0/2](409500/4588595) loss:2.907 lr:0.0000100 epoch_Time:26494.0min: [2024-01-04 12:55:56,190][model8_pretrain.py][INFO] Epoch:[0/2](409500/4588595) loss:2.910 lr:0.0000100 epoch_Time:26494.0min: [2024-01-04 12:56:41,592][model8_pretrain.py][INFO] Epoch:[0/2](409600/4588595) loss:3.104 lr:0.0000100 epoch_Time:26496.0min: [2024-01-04 12:56:41,592][model8_pretrain.py][INFO] Epoch:[0/2](409600/4588595) loss:2.772 lr:0.0000100 epoch_Time:26496.0min: [2024-01-04 12:56:41,592][model8_pretrain.py][INFO] Epoch:[0/2](409600/4588595) loss:2.687 lr:0.0000100 epoch_Time:26496.0min: [2024-01-04 12:56:41,592][model8_pretrain.py][INFO] Epoch:[0/2](409600/4588595) loss:3.121 lr:0.0000100 epoch_Time:26496.0min: [2024-01-04 12:56:41,593][model8_pretrain.py][INFO] Epoch:[0/2](409600/4588595) loss:2.681 lr:0.0000100 epoch_Time:26496.0min: [2024-01-04 12:56:41,593][model8_pretrain.py][INFO] Epoch:[0/2](409600/4588595) loss:3.101 lr:0.0000100 epoch_Time:26496.0min: [2024-01-04 12:56:41,593][model8_pretrain.py][INFO] Epoch:[0/2](409600/4588595) loss:3.203 lr:0.0000100 epoch_Time:26496.0min: [2024-01-04 12:56:41,593][model8_pretrain.py][INFO] Epoch:[0/2](409600/4588595) loss:3.155 lr:0.0000100 epoch_Time:26496.0min: [2024-01-04 12:57:20,205][model8_pretrain.py][INFO] Epoch:[0/2](409700/4588595) loss:2.687 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:57:20,205][model8_pretrain.py][INFO] Epoch:[0/2](409700/4588595) loss:2.767 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:57:20,205][model8_pretrain.py][INFO] Epoch:[0/2](409700/4588595) loss:2.667 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:57:20,205][model8_pretrain.py][INFO] Epoch:[0/2](409700/4588595) loss:3.316 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:57:20,205][model8_pretrain.py][INFO] Epoch:[0/2](409700/4588595) loss:2.731 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:57:20,205][model8_pretrain.py][INFO] Epoch:[0/2](409700/4588595) loss:2.383 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:57:20,205][model8_pretrain.py][INFO] Epoch:[0/2](409700/4588595) loss:2.954 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:57:20,205][model8_pretrain.py][INFO] Epoch:[0/2](409700/4588595) loss:2.854 lr:0.0000100 epoch_Time:26495.0min: [2024-01-04 12:57:57,105][model8_pretrain.py][INFO] Epoch:[0/2](409800/4588595) loss:3.435 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:57:57,105][model8_pretrain.py][INFO] Epoch:[0/2](409800/4588595) loss:2.174 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:57:57,105][model8_pretrain.py][INFO] Epoch:[0/2](409800/4588595) loss:3.386 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:57:57,105][model8_pretrain.py][INFO] Epoch:[0/2](409800/4588595) loss:2.726 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:57:57,105][model8_pretrain.py][INFO] Epoch:[0/2](409800/4588595) loss:2.630 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:57:57,105][model8_pretrain.py][INFO] Epoch:[0/2](409800/4588595) loss:2.241 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:57:57,105][model8_pretrain.py][INFO] Epoch:[0/2](409800/4588595) loss:2.834 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:57:57,105][model8_pretrain.py][INFO] Epoch:[0/2](409800/4588595) loss:3.085 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:58:34,047][model8_pretrain.py][INFO] Epoch:[0/2](409900/4588595) loss:3.221 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:58:34,047][model8_pretrain.py][INFO] Epoch:[0/2](409900/4588595) loss:3.332 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:58:34,047][model8_pretrain.py][INFO] Epoch:[0/2](409900/4588595) loss:3.225 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:58:34,047][model8_pretrain.py][INFO] Epoch:[0/2](409900/4588595) loss:2.567 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:58:34,047][model8_pretrain.py][INFO] Epoch:[0/2](409900/4588595) loss:3.347 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:58:34,048][model8_pretrain.py][INFO] Epoch:[0/2](409900/4588595) loss:2.966 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:58:34,048][model8_pretrain.py][INFO] Epoch:[0/2](409900/4588595) loss:2.952 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:58:34,049][model8_pretrain.py][INFO] Epoch:[0/2](409900/4588595) loss:3.258 lr:0.0000100 epoch_Time:26493.0min: [2024-01-04 12:59:11,049][model8_pretrain.py][INFO] Epoch:[0/2](410000/4588595) loss:2.793 lr:0.0000100 epoch_Time:26492.0min: [2024-01-04 12:59:11,049][model8_pretrain.py][INFO] Epoch:[0/2](410000/4588595) loss:3.083 lr:0.0000100 epoch_Time:26492.0min: [2024-01-04 12:59:11,049][model8_pretrain.py][INFO] Epoch:[0/2](410000/4588595) loss:2.984 lr:0.0000100 epoch_Time:26492.0min: [2024-01-04 12:59:11,049][model8_pretrain.py][INFO] Epoch:[0/2](410000/4588595) loss:2.799 lr:0.0000100 epoch_Time:26492.0min: [2024-01-04 12:59:11,049][model8_pretrain.py][INFO] Epoch:[0/2](410000/4588595) loss:2.844 lr:0.0000100 epoch_Time:26492.0min: [2024-01-04 12:59:11,049][model8_pretrain.py][INFO] Epoch:[0/2](410000/4588595) loss:2.035 lr:0.0000100 epoch_Time:26492.0min: [2024-01-04 12:59:11,049][model8_pretrain.py][INFO] Epoch:[0/2](410000/4588595) loss:3.418 lr:0.0000100 epoch_Time:26492.0min: [2024-01-04 12:59:11,049][model8_pretrain.py][INFO] Epoch:[0/2](410000/4588595) loss:3.126 lr:0.0000100 epoch_Time:26492.0min: [2024-01-04 12:59:47,979][model8_pretrain.py][INFO] Epoch:[0/2](410100/4588595) loss:3.210 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 12:59:47,980][model8_pretrain.py][INFO] Epoch:[0/2](410100/4588595) loss:2.989 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 12:59:47,980][model8_pretrain.py][INFO] Epoch:[0/2](410100/4588595) loss:3.032 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 12:59:47,980][model8_pretrain.py][INFO] Epoch:[0/2](410100/4588595) loss:2.712 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 12:59:47,980][model8_pretrain.py][INFO] Epoch:[0/2](410100/4588595) loss:2.429 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 12:59:47,980][model8_pretrain.py][INFO] Epoch:[0/2](410100/4588595) loss:3.262 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 12:59:47,980][model8_pretrain.py][INFO] Epoch:[0/2](410100/4588595) loss:3.260 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 12:59:47,980][model8_pretrain.py][INFO] Epoch:[0/2](410100/4588595) loss:3.233 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 13:00:24,926][model8_pretrain.py][INFO] Epoch:[0/2](410200/4588595) loss:2.844 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 13:00:24,926][model8_pretrain.py][INFO] Epoch:[0/2](410200/4588595) loss:2.889 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 13:00:24,926][model8_pretrain.py][INFO] Epoch:[0/2](410200/4588595) loss:2.792 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 13:00:24,926][model8_pretrain.py][INFO] Epoch:[0/2](410200/4588595) loss:2.836 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 13:00:24,926][model8_pretrain.py][INFO] Epoch:[0/2](410200/4588595) loss:2.931 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 13:00:24,926][model8_pretrain.py][INFO] Epoch:[0/2](410200/4588595) loss:2.576 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 13:00:24,926][model8_pretrain.py][INFO] Epoch:[0/2](410200/4588595) loss:3.003 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 13:00:24,926][model8_pretrain.py][INFO] Epoch:[0/2](410200/4588595) loss:2.536 lr:0.0000100 epoch_Time:26491.0min: [2024-01-04 13:01:01,851][model8_pretrain.py][INFO] Epoch:[0/2](410300/4588595) loss:2.653 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:01:01,851][model8_pretrain.py][INFO] Epoch:[0/2](410300/4588595) loss:2.920 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:01:01,851][model8_pretrain.py][INFO] Epoch:[0/2](410300/4588595) loss:2.520 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:01:01,851][model8_pretrain.py][INFO] Epoch:[0/2](410300/4588595) loss:2.993 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:01:01,851][model8_pretrain.py][INFO] Epoch:[0/2](410300/4588595) loss:2.789 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:01:01,851][model8_pretrain.py][INFO] Epoch:[0/2](410300/4588595) loss:2.975 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:01:01,851][model8_pretrain.py][INFO] Epoch:[0/2](410300/4588595) loss:3.113 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:01:01,851][model8_pretrain.py][INFO] Epoch:[0/2](410300/4588595) loss:2.797 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:01:43,820][model8_pretrain.py][INFO] Epoch:[0/2](410400/4588595) loss:2.541 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:01:43,820][model8_pretrain.py][INFO] Epoch:[0/2](410400/4588595) loss:2.991 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:01:43,820][model8_pretrain.py][INFO] Epoch:[0/2](410400/4588595) loss:3.067 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:01:43,820][model8_pretrain.py][INFO] Epoch:[0/2](410400/4588595) loss:3.240 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:01:43,820][model8_pretrain.py][INFO] Epoch:[0/2](410400/4588595) loss:2.487 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:01:43,820][model8_pretrain.py][INFO] Epoch:[0/2](410400/4588595) loss:3.232 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:01:43,820][model8_pretrain.py][INFO] Epoch:[0/2](410400/4588595) loss:2.716 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:01:45,509][model8_pretrain.py][INFO] Epoch:[0/2](410400/4588595) loss:3.321 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:02:25,916][model8_pretrain.py][INFO] Epoch:[0/2](410500/4588595) loss:2.875 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:02:25,916][model8_pretrain.py][INFO] Epoch:[0/2](410500/4588595) loss:2.858 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:02:25,916][model8_pretrain.py][INFO] Epoch:[0/2](410500/4588595) loss:2.977 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:02:25,916][model8_pretrain.py][INFO] Epoch:[0/2](410500/4588595) loss:3.012 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:02:25,916][model8_pretrain.py][INFO] Epoch:[0/2](410500/4588595) loss:2.844 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:02:25,916][model8_pretrain.py][INFO] Epoch:[0/2](410500/4588595) loss:3.057 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:02:25,916][model8_pretrain.py][INFO] Epoch:[0/2](410500/4588595) loss:3.383 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:02:25,916][model8_pretrain.py][INFO] Epoch:[0/2](410500/4588595) loss:2.925 lr:0.0000100 epoch_Time:26490.0min: [2024-01-04 13:03:02,860][model8_pretrain.py][INFO] Epoch:[0/2](410600/4588595) loss:2.511 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:03:02,860][model8_pretrain.py][INFO] Epoch:[0/2](410600/4588595) loss:3.180 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:03:02,860][model8_pretrain.py][INFO] Epoch:[0/2](410600/4588595) loss:2.865 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:03:02,860][model8_pretrain.py][INFO] Epoch:[0/2](410600/4588595) loss:3.685 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:03:02,860][model8_pretrain.py][INFO] Epoch:[0/2](410600/4588595) loss:2.440 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:03:02,860][model8_pretrain.py][INFO] Epoch:[0/2](410600/4588595) loss:2.732 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:03:02,860][model8_pretrain.py][INFO] Epoch:[0/2](410600/4588595) loss:3.079 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:03:02,861][model8_pretrain.py][INFO] Epoch:[0/2](410600/4588595) loss:3.176 lr:0.0000100 epoch_Time:26489.0min: [2024-01-04 13:03:39,798][model8_pretrain.py][INFO] Epoch:[0/2](410700/4588595) loss:2.934 lr:0.0000100 epoch_Time:26488.0min: [2024-01-04 13:03:39,798][model8_pretrain.py][INFO] Epoch:[0/2](410700/4588595) loss:2.827 lr:0.0000100 epoch_Time:26488.0min: [2024-01-04 13:03:39,798][model8_pretrain.py][INFO] Epoch:[0/2](410700/4588595) loss:2.717 lr:0.0000100 epoch_Time:26488.0min: [2024-01-04 13:03:39,798][model8_pretrain.py][INFO] Epoch:[0/2](410700/4588595) loss:2.640 lr:0.0000100 epoch_Time:26488.0min: [2024-01-04 13:03:39,798][model8_pretrain.py][INFO] Epoch:[0/2](410700/4588595) loss:2.613 lr:0.0000100 epoch_Time:26488.0min: [2024-01-04 13:03:39,798][model8_pretrain.py][INFO] Epoch:[0/2](410700/4588595) loss:2.852 lr:0.0000100 epoch_Time:26488.0min: [2024-01-04 13:03:39,798][model8_pretrain.py][INFO] Epoch:[0/2](410700/4588595) loss:3.058 lr:0.0000100 epoch_Time:26488.0min: [2024-01-04 13:03:39,798][model8_pretrain.py][INFO] Epoch:[0/2](410700/4588595) loss:2.707 lr:0.0000100 epoch_Time:26488.0min: [2024-01-04 13:04:16,738][model8_pretrain.py][INFO] Epoch:[0/2](410800/4588595) loss:2.701 lr:0.0000100 epoch_Time:26487.0min: [2024-01-04 13:04:16,738][model8_pretrain.py][INFO] Epoch:[0/2](410800/4588595) loss:2.747 lr:0.0000100 epoch_Time:26487.0min: [2024-01-04 13:04:16,739][model8_pretrain.py][INFO] Epoch:[0/2](410800/4588595) loss:2.847 lr:0.0000100 epoch_Time:26487.0min: [2024-01-04 13:04:16,739][model8_pretrain.py][INFO] Epoch:[0/2](410800/4588595) loss:2.815 lr:0.0000100 epoch_Time:26487.0min: [2024-01-04 13:04:16,739][model8_pretrain.py][INFO] Epoch:[0/2](410800/4588595) loss:3.051 lr:0.0000100 epoch_Time:26487.0min: [2024-01-04 13:04:16,739][model8_pretrain.py][INFO] Epoch:[0/2](410800/4588595) loss:2.781 lr:0.0000100 epoch_Time:26487.0min: [2024-01-04 13:04:16,739][model8_pretrain.py][INFO] Epoch:[0/2](410800/4588595) loss:3.131 lr:0.0000100 epoch_Time:26487.0min: [2024-01-04 13:04:16,739][model8_pretrain.py][INFO] Epoch:[0/2](410800/4588595) loss:2.512 lr:0.0000100 epoch_Time:26487.0min: [2024-01-04 13:04:53,677][model8_pretrain.py][INFO] Epoch:[0/2](410900/4588595) loss:2.082 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:04:53,677][model8_pretrain.py][INFO] Epoch:[0/2](410900/4588595) loss:3.182 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:04:53,677][model8_pretrain.py][INFO] Epoch:[0/2](410900/4588595) loss:3.052 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:04:53,677][model8_pretrain.py][INFO] Epoch:[0/2](410900/4588595) loss:2.733 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:04:53,677][model8_pretrain.py][INFO] Epoch:[0/2](410900/4588595) loss:2.687 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:04:53,677][model8_pretrain.py][INFO] Epoch:[0/2](410900/4588595) loss:2.777 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:04:53,677][model8_pretrain.py][INFO] Epoch:[0/2](410900/4588595) loss:2.885 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:04:53,677][model8_pretrain.py][INFO] Epoch:[0/2](410900/4588595) loss:3.162 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:05:30,625][model8_pretrain.py][INFO] Epoch:[0/2](411000/4588595) loss:2.593 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:05:30,625][model8_pretrain.py][INFO] Epoch:[0/2](411000/4588595) loss:2.285 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:05:30,625][model8_pretrain.py][INFO] Epoch:[0/2](411000/4588595) loss:3.098 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:05:30,625][model8_pretrain.py][INFO] Epoch:[0/2](411000/4588595) loss:2.882 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:05:30,625][model8_pretrain.py][INFO] Epoch:[0/2](411000/4588595) loss:2.762 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:05:30,625][model8_pretrain.py][INFO] Epoch:[0/2](411000/4588595) loss:2.762 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:05:30,625][model8_pretrain.py][INFO] Epoch:[0/2](411000/4588595) loss:2.387 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:05:30,625][model8_pretrain.py][INFO] Epoch:[0/2](411000/4588595) loss:2.551 lr:0.0000100 epoch_Time:26486.0min: [2024-01-04 13:06:07,578][model8_pretrain.py][INFO] Epoch:[0/2](411100/4588595) loss:3.238 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:06:07,578][model8_pretrain.py][INFO] Epoch:[0/2](411100/4588595) loss:2.961 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:06:07,578][model8_pretrain.py][INFO] Epoch:[0/2](411100/4588595) loss:3.376 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:06:07,578][model8_pretrain.py][INFO] Epoch:[0/2](411100/4588595) loss:2.599 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:06:07,578][model8_pretrain.py][INFO] Epoch:[0/2](411100/4588595) loss:2.631 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:06:07,578][model8_pretrain.py][INFO] Epoch:[0/2](411100/4588595) loss:3.172 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:06:07,578][model8_pretrain.py][INFO] Epoch:[0/2](411100/4588595) loss:2.711 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:06:07,578][model8_pretrain.py][INFO] Epoch:[0/2](411100/4588595) loss:1.860 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:06:49,757][model8_pretrain.py][INFO] Epoch:[0/2](411200/4588595) loss:2.522 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:06:49,757][model8_pretrain.py][INFO] Epoch:[0/2](411200/4588595) loss:2.924 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:06:49,757][model8_pretrain.py][INFO] Epoch:[0/2](411200/4588595) loss:2.621 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:06:49,757][model8_pretrain.py][INFO] Epoch:[0/2](411200/4588595) loss:2.624 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:06:49,758][model8_pretrain.py][INFO] Epoch:[0/2](411200/4588595) loss:2.926 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:06:49,758][model8_pretrain.py][INFO] Epoch:[0/2](411200/4588595) loss:2.757 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:06:49,758][model8_pretrain.py][INFO] Epoch:[0/2](411200/4588595) loss:2.820 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:06:49,760][model8_pretrain.py][INFO] Epoch:[0/2](411200/4588595) loss:2.937 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:07:31,866][model8_pretrain.py][INFO] Epoch:[0/2](411300/4588595) loss:2.523 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:07:31,866][model8_pretrain.py][INFO] Epoch:[0/2](411300/4588595) loss:2.785 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:07:31,866][model8_pretrain.py][INFO] Epoch:[0/2](411300/4588595) loss:3.058 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:07:31,866][model8_pretrain.py][INFO] Epoch:[0/2](411300/4588595) loss:2.883 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:07:31,866][model8_pretrain.py][INFO] Epoch:[0/2](411300/4588595) loss:3.116 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:07:31,866][model8_pretrain.py][INFO] Epoch:[0/2](411300/4588595) loss:2.242 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:07:31,866][model8_pretrain.py][INFO] Epoch:[0/2](411300/4588595) loss:2.891 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:07:31,866][model8_pretrain.py][INFO] Epoch:[0/2](411300/4588595) loss:3.087 lr:0.0000100 epoch_Time:26485.0min: [2024-01-04 13:08:08,804][model8_pretrain.py][INFO] Epoch:[0/2](411400/4588595) loss:2.399 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:08,804][model8_pretrain.py][INFO] Epoch:[0/2](411400/4588595) loss:2.717 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:08,804][model8_pretrain.py][INFO] Epoch:[0/2](411400/4588595) loss:2.988 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:08,804][model8_pretrain.py][INFO] Epoch:[0/2](411400/4588595) loss:3.329 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:08,805][model8_pretrain.py][INFO] Epoch:[0/2](411400/4588595) loss:2.550 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:08,805][model8_pretrain.py][INFO] Epoch:[0/2](411400/4588595) loss:3.088 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:08,805][model8_pretrain.py][INFO] Epoch:[0/2](411400/4588595) loss:3.294 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:08,805][model8_pretrain.py][INFO] Epoch:[0/2](411400/4588595) loss:2.706 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:45,739][model8_pretrain.py][INFO] Epoch:[0/2](411500/4588595) loss:2.134 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:45,739][model8_pretrain.py][INFO] Epoch:[0/2](411500/4588595) loss:2.393 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:45,739][model8_pretrain.py][INFO] Epoch:[0/2](411500/4588595) loss:3.260 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:45,739][model8_pretrain.py][INFO] Epoch:[0/2](411500/4588595) loss:2.958 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:45,739][model8_pretrain.py][INFO] Epoch:[0/2](411500/4588595) loss:2.376 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:45,739][model8_pretrain.py][INFO] Epoch:[0/2](411500/4588595) loss:3.017 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:45,739][model8_pretrain.py][INFO] Epoch:[0/2](411500/4588595) loss:2.754 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:08:45,740][model8_pretrain.py][INFO] Epoch:[0/2](411500/4588595) loss:2.921 lr:0.0000100 epoch_Time:26484.0min: [2024-01-04 13:09:22,680][model8_pretrain.py][INFO] Epoch:[0/2](411600/4588595) loss:2.608 lr:0.0000100 epoch_Time:26483.0min: [2024-01-04 13:09:22,680][model8_pretrain.py][INFO] Epoch:[0/2](411600/4588595) loss:2.746 lr:0.0000100 epoch_Time:26483.0min: [2024-01-04 13:09:22,680][model8_pretrain.py][INFO] Epoch:[0/2](411600/4588595) loss:2.805 lr:0.0000100 epoch_Time:26483.0min: [2024-01-04 13:09:22,680][model8_pretrain.py][INFO] Epoch:[0/2](411600/4588595) loss:3.052 lr:0.0000100 epoch_Time:26483.0min: [2024-01-04 13:09:22,680][model8_pretrain.py][INFO] Epoch:[0/2](411600/4588595) loss:2.684 lr:0.0000100 epoch_Time:26483.0min: [2024-01-04 13:09:22,680][model8_pretrain.py][INFO] Epoch:[0/2](411600/4588595) loss:2.891 lr:0.0000100 epoch_Time:26483.0min: [2024-01-04 13:09:22,680][model8_pretrain.py][INFO] Epoch:[0/2](411600/4588595) loss:2.927 lr:0.0000100 epoch_Time:26483.0min: [2024-01-04 13:09:22,680][model8_pretrain.py][INFO] Epoch:[0/2](411600/4588595) loss:3.035 lr:0.0000100 epoch_Time:26483.0min: [2024-01-04 13:09:59,618][model8_pretrain.py][INFO] Epoch:[0/2](411700/4588595) loss:3.243 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:09:59,618][model8_pretrain.py][INFO] Epoch:[0/2](411700/4588595) loss:3.230 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:09:59,618][model8_pretrain.py][INFO] Epoch:[0/2](411700/4588595) loss:2.788 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:09:59,618][model8_pretrain.py][INFO] Epoch:[0/2](411700/4588595) loss:2.939 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:09:59,618][model8_pretrain.py][INFO] Epoch:[0/2](411700/4588595) loss:2.990 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:09:59,618][model8_pretrain.py][INFO] Epoch:[0/2](411700/4588595) loss:3.094 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:09:59,618][model8_pretrain.py][INFO] Epoch:[0/2](411700/4588595) loss:2.888 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:09:59,618][model8_pretrain.py][INFO] Epoch:[0/2](411700/4588595) loss:3.425 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:10:36,550][model8_pretrain.py][INFO] Epoch:[0/2](411800/4588595) loss:2.584 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:10:36,550][model8_pretrain.py][INFO] Epoch:[0/2](411800/4588595) loss:3.065 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:10:36,550][model8_pretrain.py][INFO] Epoch:[0/2](411800/4588595) loss:2.919 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:10:36,550][model8_pretrain.py][INFO] Epoch:[0/2](411800/4588595) loss:2.656 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:10:36,550][model8_pretrain.py][INFO] Epoch:[0/2](411800/4588595) loss:2.840 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:10:36,550][model8_pretrain.py][INFO] Epoch:[0/2](411800/4588595) loss:2.751 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:10:36,550][model8_pretrain.py][INFO] Epoch:[0/2](411800/4588595) loss:2.474 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:10:36,550][model8_pretrain.py][INFO] Epoch:[0/2](411800/4588595) loss:2.931 lr:0.0000100 epoch_Time:26481.0min: [2024-01-04 13:11:13,476][model8_pretrain.py][INFO] Epoch:[0/2](411900/4588595) loss:2.410 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:13,476][model8_pretrain.py][INFO] Epoch:[0/2](411900/4588595) loss:2.960 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:13,476][model8_pretrain.py][INFO] Epoch:[0/2](411900/4588595) loss:2.855 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:13,476][model8_pretrain.py][INFO] Epoch:[0/2](411900/4588595) loss:2.447 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:13,476][model8_pretrain.py][INFO] Epoch:[0/2](411900/4588595) loss:2.276 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:13,476][model8_pretrain.py][INFO] Epoch:[0/2](411900/4588595) loss:3.012 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:13,476][model8_pretrain.py][INFO] Epoch:[0/2](411900/4588595) loss:2.669 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:13,476][model8_pretrain.py][INFO] Epoch:[0/2](411900/4588595) loss:3.043 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:55,563][model8_pretrain.py][INFO] Epoch:[0/2](412000/4588595) loss:2.815 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:55,563][model8_pretrain.py][INFO] Epoch:[0/2](412000/4588595) loss:2.577 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:55,563][model8_pretrain.py][INFO] Epoch:[0/2](412000/4588595) loss:2.577 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:55,563][model8_pretrain.py][INFO] Epoch:[0/2](412000/4588595) loss:2.755 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:55,563][model8_pretrain.py][INFO] Epoch:[0/2](412000/4588595) loss:2.236 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:55,563][model8_pretrain.py][INFO] Epoch:[0/2](412000/4588595) loss:2.790 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:55,563][model8_pretrain.py][INFO] Epoch:[0/2](412000/4588595) loss:2.692 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:11:55,563][model8_pretrain.py][INFO] Epoch:[0/2](412000/4588595) loss:3.188 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:12:37,649][model8_pretrain.py][INFO] Epoch:[0/2](412100/4588595) loss:3.119 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:12:37,649][model8_pretrain.py][INFO] Epoch:[0/2](412100/4588595) loss:3.119 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:12:37,649][model8_pretrain.py][INFO] Epoch:[0/2](412100/4588595) loss:3.041 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:12:37,649][model8_pretrain.py][INFO] Epoch:[0/2](412100/4588595) loss:2.555 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:12:37,649][model8_pretrain.py][INFO] Epoch:[0/2](412100/4588595) loss:3.051 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:12:37,649][model8_pretrain.py][INFO] Epoch:[0/2](412100/4588595) loss:2.987 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:12:37,649][model8_pretrain.py][INFO] Epoch:[0/2](412100/4588595) loss:3.351 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:12:37,649][model8_pretrain.py][INFO] Epoch:[0/2](412100/4588595) loss:2.908 lr:0.0000100 epoch_Time:26480.0min: [2024-01-04 13:13:14,574][model8_pretrain.py][INFO] Epoch:[0/2](412200/4588595) loss:2.342 lr:0.0000100 epoch_Time:26479.0min: [2024-01-04 13:13:14,574][model8_pretrain.py][INFO] Epoch:[0/2](412200/4588595) loss:3.071 lr:0.0000100 epoch_Time:26479.0min: [2024-01-04 13:13:14,574][model8_pretrain.py][INFO] Epoch:[0/2](412200/4588595) loss:2.784 lr:0.0000100 epoch_Time:26479.0min: [2024-01-04 13:13:14,574][model8_pretrain.py][INFO] Epoch:[0/2](412200/4588595) loss:3.032 lr:0.0000100 epoch_Time:26479.0min: [2024-01-04 13:13:14,574][model8_pretrain.py][INFO] Epoch:[0/2](412200/4588595) loss:3.026 lr:0.0000100 epoch_Time:26479.0min: [2024-01-04 13:13:14,574][model8_pretrain.py][INFO] Epoch:[0/2](412200/4588595) loss:3.569 lr:0.0000100 epoch_Time:26479.0min: [2024-01-04 13:13:14,574][model8_pretrain.py][INFO] Epoch:[0/2](412200/4588595) loss:3.066 lr:0.0000100 epoch_Time:26479.0min: [2024-01-04 13:13:14,574][model8_pretrain.py][INFO] Epoch:[0/2](412200/4588595) loss:3.049 lr:0.0000100 epoch_Time:26479.0min: [2024-01-04 13:13:51,507][model8_pretrain.py][INFO] Epoch:[0/2](412300/4588595) loss:3.129 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:13:51,507][model8_pretrain.py][INFO] Epoch:[0/2](412300/4588595) loss:2.902 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:13:51,507][model8_pretrain.py][INFO] Epoch:[0/2](412300/4588595) loss:3.245 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:13:51,507][model8_pretrain.py][INFO] Epoch:[0/2](412300/4588595) loss:2.930 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:13:51,507][model8_pretrain.py][INFO] Epoch:[0/2](412300/4588595) loss:2.526 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:13:51,507][model8_pretrain.py][INFO] Epoch:[0/2](412300/4588595) loss:2.938 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:13:51,507][model8_pretrain.py][INFO] Epoch:[0/2](412300/4588595) loss:2.806 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:13:51,507][model8_pretrain.py][INFO] Epoch:[0/2](412300/4588595) loss:2.313 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:14:28,446][model8_pretrain.py][INFO] Epoch:[0/2](412400/4588595) loss:3.056 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:14:28,446][model8_pretrain.py][INFO] Epoch:[0/2](412400/4588595) loss:2.833 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:14:28,446][model8_pretrain.py][INFO] Epoch:[0/2](412400/4588595) loss:2.880 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:14:28,446][model8_pretrain.py][INFO] Epoch:[0/2](412400/4588595) loss:2.561 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:14:28,446][model8_pretrain.py][INFO] Epoch:[0/2](412400/4588595) loss:2.534 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:14:28,446][model8_pretrain.py][INFO] Epoch:[0/2](412400/4588595) loss:2.886 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:14:28,446][model8_pretrain.py][INFO] Epoch:[0/2](412400/4588595) loss:3.015 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:14:28,446][model8_pretrain.py][INFO] Epoch:[0/2](412400/4588595) loss:3.154 lr:0.0000100 epoch_Time:26478.0min: [2024-01-04 13:15:05,371][model8_pretrain.py][INFO] Epoch:[0/2](412500/4588595) loss:2.539 lr:0.0000100 epoch_Time:26477.0min: [2024-01-04 13:15:05,371][model8_pretrain.py][INFO] Epoch:[0/2](412500/4588595) loss:2.964 lr:0.0000100 epoch_Time:26477.0min: [2024-01-04 13:15:05,371][model8_pretrain.py][INFO] Epoch:[0/2](412500/4588595) loss:2.520 lr:0.0000100 epoch_Time:26477.0min: [2024-01-04 13:15:05,371][model8_pretrain.py][INFO] Epoch:[0/2](412500/4588595) loss:2.421 lr:0.0000100 epoch_Time:26477.0min: [2024-01-04 13:15:05,371][model8_pretrain.py][INFO] Epoch:[0/2](412500/4588595) loss:3.055 lr:0.0000100 epoch_Time:26477.0min: [2024-01-04 13:15:05,371][model8_pretrain.py][INFO] Epoch:[0/2](412500/4588595) loss:2.741 lr:0.0000100 epoch_Time:26477.0min: [2024-01-04 13:15:05,371][model8_pretrain.py][INFO] Epoch:[0/2](412500/4588595) loss:3.506 lr:0.0000100 epoch_Time:26477.0min: [2024-01-04 13:15:05,372][model8_pretrain.py][INFO] Epoch:[0/2](412500/4588595) loss:2.302 lr:0.0000100 epoch_Time:26477.0min: [2024-01-04 13:15:42,296][model8_pretrain.py][INFO] Epoch:[0/2](412600/4588595) loss:2.668 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:15:42,296][model8_pretrain.py][INFO] Epoch:[0/2](412600/4588595) loss:3.399 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:15:42,296][model8_pretrain.py][INFO] Epoch:[0/2](412600/4588595) loss:2.621 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:15:42,296][model8_pretrain.py][INFO] Epoch:[0/2](412600/4588595) loss:3.071 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:15:42,296][model8_pretrain.py][INFO] Epoch:[0/2](412600/4588595) loss:3.118 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:15:42,296][model8_pretrain.py][INFO] Epoch:[0/2](412600/4588595) loss:3.109 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:15:42,296][model8_pretrain.py][INFO] Epoch:[0/2](412600/4588595) loss:2.873 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:15:42,296][model8_pretrain.py][INFO] Epoch:[0/2](412600/4588595) loss:2.259 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:16:19,218][model8_pretrain.py][INFO] Epoch:[0/2](412700/4588595) loss:3.240 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:16:19,218][model8_pretrain.py][INFO] Epoch:[0/2](412700/4588595) loss:3.096 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:16:19,218][model8_pretrain.py][INFO] Epoch:[0/2](412700/4588595) loss:3.032 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:16:19,218][model8_pretrain.py][INFO] Epoch:[0/2](412700/4588595) loss:3.111 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:16:19,218][model8_pretrain.py][INFO] Epoch:[0/2](412700/4588595) loss:3.084 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:16:19,218][model8_pretrain.py][INFO] Epoch:[0/2](412700/4588595) loss:2.757 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:16:19,219][model8_pretrain.py][INFO] Epoch:[0/2](412700/4588595) loss:3.068 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:16:19,219][model8_pretrain.py][INFO] Epoch:[0/2](412700/4588595) loss:3.196 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:16:57,879][model8_pretrain.py][INFO] Epoch:[0/2](412800/4588595) loss:2.936 lr:0.0000100 epoch_Time:26474.0min: [2024-01-04 13:16:57,879][model8_pretrain.py][INFO] Epoch:[0/2](412800/4588595) loss:2.813 lr:0.0000100 epoch_Time:26474.0min: [2024-01-04 13:16:57,879][model8_pretrain.py][INFO] Epoch:[0/2](412800/4588595) loss:2.599 lr:0.0000100 epoch_Time:26474.0min: [2024-01-04 13:16:57,879][model8_pretrain.py][INFO] Epoch:[0/2](412800/4588595) loss:2.971 lr:0.0000100 epoch_Time:26474.0min: [2024-01-04 13:16:57,879][model8_pretrain.py][INFO] Epoch:[0/2](412800/4588595) loss:2.844 lr:0.0000100 epoch_Time:26474.0min: [2024-01-04 13:16:57,879][model8_pretrain.py][INFO] Epoch:[0/2](412800/4588595) loss:3.142 lr:0.0000100 epoch_Time:26474.0min: [2024-01-04 13:16:57,879][model8_pretrain.py][INFO] Epoch:[0/2](412800/4588595) loss:2.776 lr:0.0000100 epoch_Time:26474.0min: [2024-01-04 13:16:57,879][model8_pretrain.py][INFO] Epoch:[0/2](412800/4588595) loss:2.728 lr:0.0000100 epoch_Time:26474.0min: [2024-01-04 13:17:43,392][model8_pretrain.py][INFO] Epoch:[0/2](412900/4588595) loss:2.695 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:17:43,392][model8_pretrain.py][INFO] Epoch:[0/2](412900/4588595) loss:3.242 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:17:43,392][model8_pretrain.py][INFO] Epoch:[0/2](412900/4588595) loss:2.860 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:17:43,392][model8_pretrain.py][INFO] Epoch:[0/2](412900/4588595) loss:2.808 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:17:43,392][model8_pretrain.py][INFO] Epoch:[0/2](412900/4588595) loss:3.151 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:17:43,392][model8_pretrain.py][INFO] Epoch:[0/2](412900/4588595) loss:2.926 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:17:43,392][model8_pretrain.py][INFO] Epoch:[0/2](412900/4588595) loss:2.984 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:17:43,392][model8_pretrain.py][INFO] Epoch:[0/2](412900/4588595) loss:2.941 lr:0.0000100 epoch_Time:26476.0min: [2024-01-04 13:18:20,341][model8_pretrain.py][INFO] Epoch:[0/2](413000/4588595) loss:2.365 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:18:20,341][model8_pretrain.py][INFO] Epoch:[0/2](413000/4588595) loss:2.636 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:18:20,341][model8_pretrain.py][INFO] Epoch:[0/2](413000/4588595) loss:2.936 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:18:20,341][model8_pretrain.py][INFO] Epoch:[0/2](413000/4588595) loss:3.347 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:18:20,341][model8_pretrain.py][INFO] Epoch:[0/2](413000/4588595) loss:2.269 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:18:20,341][model8_pretrain.py][INFO] Epoch:[0/2](413000/4588595) loss:2.675 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:18:20,341][model8_pretrain.py][INFO] Epoch:[0/2](413000/4588595) loss:2.875 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:18:20,341][model8_pretrain.py][INFO] Epoch:[0/2](413000/4588595) loss:2.830 lr:0.0000100 epoch_Time:26475.0min: [2024-01-04 13:18:57,281][model8_pretrain.py][INFO] Epoch:[0/2](413100/4588595) loss:3.396 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:18:57,281][model8_pretrain.py][INFO] Epoch:[0/2](413100/4588595) loss:3.021 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:18:57,281][model8_pretrain.py][INFO] Epoch:[0/2](413100/4588595) loss:2.441 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:18:57,281][model8_pretrain.py][INFO] Epoch:[0/2](413100/4588595) loss:2.977 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:18:57,281][model8_pretrain.py][INFO] Epoch:[0/2](413100/4588595) loss:3.165 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:18:57,281][model8_pretrain.py][INFO] Epoch:[0/2](413100/4588595) loss:2.678 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:18:57,281][model8_pretrain.py][INFO] Epoch:[0/2](413100/4588595) loss:2.795 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:18:57,282][model8_pretrain.py][INFO] Epoch:[0/2](413100/4588595) loss:2.572 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:19:34,228][model8_pretrain.py][INFO] Epoch:[0/2](413200/4588595) loss:2.565 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:19:34,228][model8_pretrain.py][INFO] Epoch:[0/2](413200/4588595) loss:2.779 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:19:34,228][model8_pretrain.py][INFO] Epoch:[0/2](413200/4588595) loss:2.615 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:19:34,228][model8_pretrain.py][INFO] Epoch:[0/2](413200/4588595) loss:3.368 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:19:34,228][model8_pretrain.py][INFO] Epoch:[0/2](413200/4588595) loss:2.784 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:19:34,228][model8_pretrain.py][INFO] Epoch:[0/2](413200/4588595) loss:2.965 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:19:34,228][model8_pretrain.py][INFO] Epoch:[0/2](413200/4588595) loss:2.540 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:19:34,228][model8_pretrain.py][INFO] Epoch:[0/2](413200/4588595) loss:2.926 lr:0.0000100 epoch_Time:26473.0min: [2024-01-04 13:20:11,171][model8_pretrain.py][INFO] Epoch:[0/2](413300/4588595) loss:2.776 lr:0.0000100 epoch_Time:26472.0min: [2024-01-04 13:20:11,171][model8_pretrain.py][INFO] Epoch:[0/2](413300/4588595) loss:3.360 lr:0.0000100 epoch_Time:26472.0min: [2024-01-04 13:20:11,171][model8_pretrain.py][INFO] Epoch:[0/2](413300/4588595) loss:2.997 lr:0.0000100 epoch_Time:26472.0min: [2024-01-04 13:20:11,171][model8_pretrain.py][INFO] Epoch:[0/2](413300/4588595) loss:2.539 lr:0.0000100 epoch_Time:26472.0min: [2024-01-04 13:20:11,171][model8_pretrain.py][INFO] Epoch:[0/2](413300/4588595) loss:2.618 lr:0.0000100 epoch_Time:26472.0min: [2024-01-04 13:20:11,171][model8_pretrain.py][INFO] Epoch:[0/2](413300/4588595) loss:3.083 lr:0.0000100 epoch_Time:26472.0min: [2024-01-04 13:20:11,171][model8_pretrain.py][INFO] Epoch:[0/2](413300/4588595) loss:3.540 lr:0.0000100 epoch_Time:26472.0min: [2024-01-04 13:20:11,172][model8_pretrain.py][INFO] Epoch:[0/2](413300/4588595) loss:2.981 lr:0.0000100 epoch_Time:26472.0min: [2024-01-04 13:20:48,109][model8_pretrain.py][INFO] Epoch:[0/2](413400/4588595) loss:2.691 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:20:48,110][model8_pretrain.py][INFO] Epoch:[0/2](413400/4588595) loss:2.745 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:20:48,110][model8_pretrain.py][INFO] Epoch:[0/2](413400/4588595) loss:3.237 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:20:48,110][model8_pretrain.py][INFO] Epoch:[0/2](413400/4588595) loss:3.125 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:20:48,110][model8_pretrain.py][INFO] Epoch:[0/2](413400/4588595) loss:2.876 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:20:48,110][model8_pretrain.py][INFO] Epoch:[0/2](413400/4588595) loss:2.921 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:20:48,110][model8_pretrain.py][INFO] Epoch:[0/2](413400/4588595) loss:2.159 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:20:48,110][model8_pretrain.py][INFO] Epoch:[0/2](413400/4588595) loss:2.770 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:21:25,050][model8_pretrain.py][INFO] Epoch:[0/2](413500/4588595) loss:3.173 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:21:25,050][model8_pretrain.py][INFO] Epoch:[0/2](413500/4588595) loss:3.495 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:21:25,050][model8_pretrain.py][INFO] Epoch:[0/2](413500/4588595) loss:3.099 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:21:25,050][model8_pretrain.py][INFO] Epoch:[0/2](413500/4588595) loss:2.522 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:21:25,050][model8_pretrain.py][INFO] Epoch:[0/2](413500/4588595) loss:2.965 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:21:25,050][model8_pretrain.py][INFO] Epoch:[0/2](413500/4588595) loss:2.991 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:21:25,050][model8_pretrain.py][INFO] Epoch:[0/2](413500/4588595) loss:3.019 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:21:25,050][model8_pretrain.py][INFO] Epoch:[0/2](413500/4588595) loss:3.155 lr:0.0000100 epoch_Time:26471.0min: [2024-01-04 13:22:03,744][model8_pretrain.py][INFO] Epoch:[0/2](413600/4588595) loss:2.417 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:03,744][model8_pretrain.py][INFO] Epoch:[0/2](413600/4588595) loss:2.996 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:03,744][model8_pretrain.py][INFO] Epoch:[0/2](413600/4588595) loss:3.101 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:03,744][model8_pretrain.py][INFO] Epoch:[0/2](413600/4588595) loss:3.220 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:03,744][model8_pretrain.py][INFO] Epoch:[0/2](413600/4588595) loss:2.779 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:03,744][model8_pretrain.py][INFO] Epoch:[0/2](413600/4588595) loss:2.582 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:03,744][model8_pretrain.py][INFO] Epoch:[0/2](413600/4588595) loss:2.834 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:03,744][model8_pretrain.py][INFO] Epoch:[0/2](413600/4588595) loss:2.544 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:49,305][model8_pretrain.py][INFO] Epoch:[0/2](413700/4588595) loss:2.195 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:49,305][model8_pretrain.py][INFO] Epoch:[0/2](413700/4588595) loss:3.061 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:49,305][model8_pretrain.py][INFO] Epoch:[0/2](413700/4588595) loss:3.205 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:49,305][model8_pretrain.py][INFO] Epoch:[0/2](413700/4588595) loss:2.881 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:49,305][model8_pretrain.py][INFO] Epoch:[0/2](413700/4588595) loss:3.311 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:49,305][model8_pretrain.py][INFO] Epoch:[0/2](413700/4588595) loss:2.904 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:49,305][model8_pretrain.py][INFO] Epoch:[0/2](413700/4588595) loss:3.031 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:22:49,305][model8_pretrain.py][INFO] Epoch:[0/2](413700/4588595) loss:3.076 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:23:26,242][model8_pretrain.py][INFO] Epoch:[0/2](413800/4588595) loss:2.957 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:23:26,242][model8_pretrain.py][INFO] Epoch:[0/2](413800/4588595) loss:2.599 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:23:26,242][model8_pretrain.py][INFO] Epoch:[0/2](413800/4588595) loss:3.299 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:23:26,242][model8_pretrain.py][INFO] Epoch:[0/2](413800/4588595) loss:3.175 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:23:26,242][model8_pretrain.py][INFO] Epoch:[0/2](413800/4588595) loss:2.859 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:23:26,242][model8_pretrain.py][INFO] Epoch:[0/2](413800/4588595) loss:3.463 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:23:26,242][model8_pretrain.py][INFO] Epoch:[0/2](413800/4588595) loss:3.275 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:23:26,242][model8_pretrain.py][INFO] Epoch:[0/2](413800/4588595) loss:2.811 lr:0.0000100 epoch_Time:26470.0min: [2024-01-04 13:24:03,187][model8_pretrain.py][INFO] Epoch:[0/2](413900/4588595) loss:2.751 lr:0.0000100 epoch_Time:26469.0min: [2024-01-04 13:24:03,187][model8_pretrain.py][INFO] Epoch:[0/2](413900/4588595) loss:3.070 lr:0.0000100 epoch_Time:26469.0min: [2024-01-04 13:24:03,187][model8_pretrain.py][INFO] Epoch:[0/2](413900/4588595) loss:2.872 lr:0.0000100 epoch_Time:26469.0min: [2024-01-04 13:24:03,187][model8_pretrain.py][INFO] Epoch:[0/2](413900/4588595) loss:2.357 lr:0.0000100 epoch_Time:26469.0min: [2024-01-04 13:24:03,187][model8_pretrain.py][INFO] Epoch:[0/2](413900/4588595) loss:2.507 lr:0.0000100 epoch_Time:26469.0min: [2024-01-04 13:24:03,187][model8_pretrain.py][INFO] Epoch:[0/2](413900/4588595) loss:2.354 lr:0.0000100 epoch_Time:26469.0min: [2024-01-04 13:24:03,187][model8_pretrain.py][INFO] Epoch:[0/2](413900/4588595) loss:2.140 lr:0.0000100 epoch_Time:26469.0min: [2024-01-04 13:24:03,187][model8_pretrain.py][INFO] Epoch:[0/2](413900/4588595) loss:2.749 lr:0.0000100 epoch_Time:26469.0min: [2024-01-04 13:24:40,092][model8_pretrain.py][INFO] Epoch:[0/2](414000/4588595) loss:2.514 lr:0.0000100 epoch_Time:26468.0min: [2024-01-04 13:24:40,092][model8_pretrain.py][INFO] Epoch:[0/2](414000/4588595) loss:3.014 lr:0.0000100 epoch_Time:26468.0min: [2024-01-04 13:24:40,092][model8_pretrain.py][INFO] Epoch:[0/2](414000/4588595) loss:2.991 lr:0.0000100 epoch_Time:26468.0min: [2024-01-04 13:24:40,092][model8_pretrain.py][INFO] Epoch:[0/2](414000/4588595) loss:3.073 lr:0.0000100 epoch_Time:26468.0min: [2024-01-04 13:24:40,092][model8_pretrain.py][INFO] Epoch:[0/2](414000/4588595) loss:2.654 lr:0.0000100 epoch_Time:26468.0min: [2024-01-04 13:24:40,092][model8_pretrain.py][INFO] Epoch:[0/2](414000/4588595) loss:2.750 lr:0.0000100 epoch_Time:26468.0min: [2024-01-04 13:24:40,092][model8_pretrain.py][INFO] Epoch:[0/2](414000/4588595) loss:3.158 lr:0.0000100 epoch_Time:26468.0min: [2024-01-04 13:24:40,093][model8_pretrain.py][INFO] Epoch:[0/2](414000/4588595) loss:2.552 lr:0.0000100 epoch_Time:26468.0min: [2024-01-04 13:25:17,035][model8_pretrain.py][INFO] Epoch:[0/2](414100/4588595) loss:2.992 lr:0.0000100 epoch_Time:26467.0min: [2024-01-04 13:25:17,035][model8_pretrain.py][INFO] Epoch:[0/2](414100/4588595) loss:3.318 lr:0.0000100 epoch_Time:26467.0min: [2024-01-04 13:25:17,035][model8_pretrain.py][INFO] Epoch:[0/2](414100/4588595) loss:2.899 lr:0.0000100 epoch_Time:26467.0min: [2024-01-04 13:25:17,035][model8_pretrain.py][INFO] Epoch:[0/2](414100/4588595) loss:2.529 lr:0.0000100 epoch_Time:26467.0min: [2024-01-04 13:25:17,035][model8_pretrain.py][INFO] Epoch:[0/2](414100/4588595) loss:2.749 lr:0.0000100 epoch_Time:26467.0min: [2024-01-04 13:25:17,036][model8_pretrain.py][INFO] Epoch:[0/2](414100/4588595) loss:2.999 lr:0.0000100 epoch_Time:26467.0min: [2024-01-04 13:25:17,036][model8_pretrain.py][INFO] Epoch:[0/2](414100/4588595) loss:3.277 lr:0.0000100 epoch_Time:26467.0min: [2024-01-04 13:25:17,036][model8_pretrain.py][INFO] Epoch:[0/2](414100/4588595) loss:3.171 lr:0.0000100 epoch_Time:26467.0min: [2024-01-04 13:25:53,977][model8_pretrain.py][INFO] Epoch:[0/2](414200/4588595) loss:3.176 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:25:53,977][model8_pretrain.py][INFO] Epoch:[0/2](414200/4588595) loss:2.818 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:25:53,977][model8_pretrain.py][INFO] Epoch:[0/2](414200/4588595) loss:3.032 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:25:53,977][model8_pretrain.py][INFO] Epoch:[0/2](414200/4588595) loss:2.895 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:25:53,977][model8_pretrain.py][INFO] Epoch:[0/2](414200/4588595) loss:3.181 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:25:53,977][model8_pretrain.py][INFO] Epoch:[0/2](414200/4588595) loss:2.862 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:25:53,977][model8_pretrain.py][INFO] Epoch:[0/2](414200/4588595) loss:2.993 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:25:53,978][model8_pretrain.py][INFO] Epoch:[0/2](414200/4588595) loss:2.734 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:26:30,935][model8_pretrain.py][INFO] Epoch:[0/2](414300/4588595) loss:2.485 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:26:30,935][model8_pretrain.py][INFO] Epoch:[0/2](414300/4588595) loss:3.019 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:26:30,935][model8_pretrain.py][INFO] Epoch:[0/2](414300/4588595) loss:3.207 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:26:30,935][model8_pretrain.py][INFO] Epoch:[0/2](414300/4588595) loss:2.877 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:26:30,935][model8_pretrain.py][INFO] Epoch:[0/2](414300/4588595) loss:2.554 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:26:30,935][model8_pretrain.py][INFO] Epoch:[0/2](414300/4588595) loss:2.735 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:26:30,935][model8_pretrain.py][INFO] Epoch:[0/2](414300/4588595) loss:2.867 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:26:30,935][model8_pretrain.py][INFO] Epoch:[0/2](414300/4588595) loss:2.456 lr:0.0000100 epoch_Time:26466.0min: [2024-01-04 13:27:09,635][model8_pretrain.py][INFO] Epoch:[0/2](414400/4588595) loss:2.858 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:09,635][model8_pretrain.py][INFO] Epoch:[0/2](414400/4588595) loss:2.350 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:09,635][model8_pretrain.py][INFO] Epoch:[0/2](414400/4588595) loss:2.761 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:09,635][model8_pretrain.py][INFO] Epoch:[0/2](414400/4588595) loss:3.185 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:09,635][model8_pretrain.py][INFO] Epoch:[0/2](414400/4588595) loss:2.882 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:09,636][model8_pretrain.py][INFO] Epoch:[0/2](414400/4588595) loss:2.778 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:09,635][model8_pretrain.py][INFO] Epoch:[0/2](414400/4588595) loss:2.481 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:09,636][model8_pretrain.py][INFO] Epoch:[0/2](414400/4588595) loss:2.533 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:55,599][model8_pretrain.py][INFO] Epoch:[0/2](414500/4588595) loss:2.765 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:55,599][model8_pretrain.py][INFO] Epoch:[0/2](414500/4588595) loss:2.392 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:55,599][model8_pretrain.py][INFO] Epoch:[0/2](414500/4588595) loss:3.181 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:55,599][model8_pretrain.py][INFO] Epoch:[0/2](414500/4588595) loss:3.063 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:55,599][model8_pretrain.py][INFO] Epoch:[0/2](414500/4588595) loss:2.797 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:55,599][model8_pretrain.py][INFO] Epoch:[0/2](414500/4588595) loss:2.728 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:55,599][model8_pretrain.py][INFO] Epoch:[0/2](414500/4588595) loss:3.134 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:27:55,599][model8_pretrain.py][INFO] Epoch:[0/2](414500/4588595) loss:3.196 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:28:32,535][model8_pretrain.py][INFO] Epoch:[0/2](414600/4588595) loss:3.006 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:28:32,535][model8_pretrain.py][INFO] Epoch:[0/2](414600/4588595) loss:2.767 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:28:32,535][model8_pretrain.py][INFO] Epoch:[0/2](414600/4588595) loss:3.038 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:28:32,535][model8_pretrain.py][INFO] Epoch:[0/2](414600/4588595) loss:3.451 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:28:32,535][model8_pretrain.py][INFO] Epoch:[0/2](414600/4588595) loss:2.859 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:28:32,535][model8_pretrain.py][INFO] Epoch:[0/2](414600/4588595) loss:3.283 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:28:32,535][model8_pretrain.py][INFO] Epoch:[0/2](414600/4588595) loss:2.869 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:28:32,535][model8_pretrain.py][INFO] Epoch:[0/2](414600/4588595) loss:2.959 lr:0.0000100 epoch_Time:26465.0min: [2024-01-04 13:29:09,476][model8_pretrain.py][INFO] Epoch:[0/2](414700/4588595) loss:2.011 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:09,476][model8_pretrain.py][INFO] Epoch:[0/2](414700/4588595) loss:2.594 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:09,476][model8_pretrain.py][INFO] Epoch:[0/2](414700/4588595) loss:2.871 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:09,476][model8_pretrain.py][INFO] Epoch:[0/2](414700/4588595) loss:2.560 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:09,476][model8_pretrain.py][INFO] Epoch:[0/2](414700/4588595) loss:3.217 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:09,476][model8_pretrain.py][INFO] Epoch:[0/2](414700/4588595) loss:2.668 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:09,477][model8_pretrain.py][INFO] Epoch:[0/2](414700/4588595) loss:2.990 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:09,477][model8_pretrain.py][INFO] Epoch:[0/2](414700/4588595) loss:2.669 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:46,408][model8_pretrain.py][INFO] Epoch:[0/2](414800/4588595) loss:3.007 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:46,408][model8_pretrain.py][INFO] Epoch:[0/2](414800/4588595) loss:3.061 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:46,408][model8_pretrain.py][INFO] Epoch:[0/2](414800/4588595) loss:3.167 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:46,408][model8_pretrain.py][INFO] Epoch:[0/2](414800/4588595) loss:3.186 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:46,409][model8_pretrain.py][INFO] Epoch:[0/2](414800/4588595) loss:2.607 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:46,409][model8_pretrain.py][INFO] Epoch:[0/2](414800/4588595) loss:2.911 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:46,409][model8_pretrain.py][INFO] Epoch:[0/2](414800/4588595) loss:2.885 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:29:46,409][model8_pretrain.py][INFO] Epoch:[0/2](414800/4588595) loss:2.136 lr:0.0000100 epoch_Time:26464.0min: [2024-01-04 13:30:23,355][model8_pretrain.py][INFO] Epoch:[0/2](414900/4588595) loss:2.524 lr:0.0000100 epoch_Time:26463.0min: [2024-01-04 13:30:23,355][model8_pretrain.py][INFO] Epoch:[0/2](414900/4588595) loss:2.696 lr:0.0000100 epoch_Time:26463.0min: [2024-01-04 13:30:23,355][model8_pretrain.py][INFO] Epoch:[0/2](414900/4588595) loss:2.569 lr:0.0000100 epoch_Time:26463.0min: [2024-01-04 13:30:23,355][model8_pretrain.py][INFO] Epoch:[0/2](414900/4588595) loss:3.139 lr:0.0000100 epoch_Time:26463.0min: [2024-01-04 13:30:23,355][model8_pretrain.py][INFO] Epoch:[0/2](414900/4588595) loss:2.973 lr:0.0000100 epoch_Time:26463.0min: [2024-01-04 13:30:23,355][model8_pretrain.py][INFO] Epoch:[0/2](414900/4588595) loss:3.377 lr:0.0000100 epoch_Time:26463.0min: [2024-01-04 13:30:23,355][model8_pretrain.py][INFO] Epoch:[0/2](414900/4588595) loss:3.020 lr:0.0000100 epoch_Time:26463.0min: [2024-01-04 13:30:23,356][model8_pretrain.py][INFO] Epoch:[0/2](414900/4588595) loss:2.737 lr:0.0000100 epoch_Time:26463.0min: [2024-01-04 13:31:00,299][model8_pretrain.py][INFO] Epoch:[0/2](415000/4588595) loss:2.841 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:00,299][model8_pretrain.py][INFO] Epoch:[0/2](415000/4588595) loss:2.859 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:00,299][model8_pretrain.py][INFO] Epoch:[0/2](415000/4588595) loss:2.754 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:00,299][model8_pretrain.py][INFO] Epoch:[0/2](415000/4588595) loss:2.871 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:00,299][model8_pretrain.py][INFO] Epoch:[0/2](415000/4588595) loss:2.615 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:00,299][model8_pretrain.py][INFO] Epoch:[0/2](415000/4588595) loss:2.815 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:00,299][model8_pretrain.py][INFO] Epoch:[0/2](415000/4588595) loss:3.291 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:00,299][model8_pretrain.py][INFO] Epoch:[0/2](415000/4588595) loss:2.519 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:37,232][model8_pretrain.py][INFO] Epoch:[0/2](415100/4588595) loss:2.498 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:37,232][model8_pretrain.py][INFO] Epoch:[0/2](415100/4588595) loss:2.646 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:37,232][model8_pretrain.py][INFO] Epoch:[0/2](415100/4588595) loss:2.986 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:37,232][model8_pretrain.py][INFO] Epoch:[0/2](415100/4588595) loss:2.589 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:37,232][model8_pretrain.py][INFO] Epoch:[0/2](415100/4588595) loss:2.971 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:37,232][model8_pretrain.py][INFO] Epoch:[0/2](415100/4588595) loss:3.549 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:37,232][model8_pretrain.py][INFO] Epoch:[0/2](415100/4588595) loss:3.132 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:31:37,232][model8_pretrain.py][INFO] Epoch:[0/2](415100/4588595) loss:2.528 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:32:14,169][model8_pretrain.py][INFO] Epoch:[0/2](415200/4588595) loss:2.783 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:32:14,169][model8_pretrain.py][INFO] Epoch:[0/2](415200/4588595) loss:2.133 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:32:14,169][model8_pretrain.py][INFO] Epoch:[0/2](415200/4588595) loss:2.956 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:32:14,169][model8_pretrain.py][INFO] Epoch:[0/2](415200/4588595) loss:2.813 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:32:14,169][model8_pretrain.py][INFO] Epoch:[0/2](415200/4588595) loss:3.371 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:32:14,169][model8_pretrain.py][INFO] Epoch:[0/2](415200/4588595) loss:2.878 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:32:14,169][model8_pretrain.py][INFO] Epoch:[0/2](415200/4588595) loss:2.552 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:32:14,169][model8_pretrain.py][INFO] Epoch:[0/2](415200/4588595) loss:2.363 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:33:01,502][model8_pretrain.py][INFO] Epoch:[0/2](415300/4588595) loss:2.952 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:33:01,502][model8_pretrain.py][INFO] Epoch:[0/2](415300/4588595) loss:2.525 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:33:01,502][model8_pretrain.py][INFO] Epoch:[0/2](415300/4588595) loss:2.859 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:33:01,502][model8_pretrain.py][INFO] Epoch:[0/2](415300/4588595) loss:3.106 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:33:01,503][model8_pretrain.py][INFO] Epoch:[0/2](415300/4588595) loss:3.016 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:33:01,502][model8_pretrain.py][INFO] Epoch:[0/2](415300/4588595) loss:2.890 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:33:01,502][model8_pretrain.py][INFO] Epoch:[0/2](415300/4588595) loss:2.872 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:33:01,503][model8_pretrain.py][INFO] Epoch:[0/2](415300/4588595) loss:2.791 lr:0.0000100 epoch_Time:26461.0min: [2024-01-04 13:33:38,435][model8_pretrain.py][INFO] Epoch:[0/2](415400/4588595) loss:2.636 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:33:38,435][model8_pretrain.py][INFO] Epoch:[0/2](415400/4588595) loss:2.647 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:33:38,435][model8_pretrain.py][INFO] Epoch:[0/2](415400/4588595) loss:3.235 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:33:38,435][model8_pretrain.py][INFO] Epoch:[0/2](415400/4588595) loss:2.842 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:33:38,435][model8_pretrain.py][INFO] Epoch:[0/2](415400/4588595) loss:2.625 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:33:38,435][model8_pretrain.py][INFO] Epoch:[0/2](415400/4588595) loss:2.369 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:33:38,435][model8_pretrain.py][INFO] Epoch:[0/2](415400/4588595) loss:3.261 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:33:38,435][model8_pretrain.py][INFO] Epoch:[0/2](415400/4588595) loss:2.313 lr:0.0000100 epoch_Time:26460.0min: [2024-01-04 13:34:15,377][model8_pretrain.py][INFO] Epoch:[0/2](415500/4588595) loss:2.820 lr:0.0000100 epoch_Time:26459.0min: [2024-01-04 13:34:15,377][model8_pretrain.py][INFO] Epoch:[0/2](415500/4588595) loss:2.692 lr:0.0000100 epoch_Time:26459.0min: [2024-01-04 13:34:15,377][model8_pretrain.py][INFO] Epoch:[0/2](415500/4588595) loss:3.203 lr:0.0000100 epoch_Time:26459.0min: [2024-01-04 13:34:15,377][model8_pretrain.py][INFO] Epoch:[0/2](415500/4588595) loss:2.698 lr:0.0000100 epoch_Time:26459.0min: [2024-01-04 13:34:15,377][model8_pretrain.py][INFO] Epoch:[0/2](415500/4588595) loss:2.965 lr:0.0000100 epoch_Time:26459.0min: [2024-01-04 13:34:15,377][model8_pretrain.py][INFO] Epoch:[0/2](415500/4588595) loss:2.947 lr:0.0000100 epoch_Time:26459.0min: [2024-01-04 13:34:15,377][model8_pretrain.py][INFO] Epoch:[0/2](415500/4588595) loss:3.086 lr:0.0000100 epoch_Time:26459.0min: [2024-01-04 13:34:15,377][model8_pretrain.py][INFO] Epoch:[0/2](415500/4588595) loss:3.372 lr:0.0000100 epoch_Time:26459.0min: [2024-01-04 13:34:52,312][model8_pretrain.py][INFO] Epoch:[0/2](415600/4588595) loss:3.154 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:34:52,312][model8_pretrain.py][INFO] Epoch:[0/2](415600/4588595) loss:3.119 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:34:52,312][model8_pretrain.py][INFO] Epoch:[0/2](415600/4588595) loss:2.499 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:34:52,312][model8_pretrain.py][INFO] Epoch:[0/2](415600/4588595) loss:3.181 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:34:52,312][model8_pretrain.py][INFO] Epoch:[0/2](415600/4588595) loss:2.710 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:34:52,312][model8_pretrain.py][INFO] Epoch:[0/2](415600/4588595) loss:2.540 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:34:52,312][model8_pretrain.py][INFO] Epoch:[0/2](415600/4588595) loss:2.878 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:34:52,312][model8_pretrain.py][INFO] Epoch:[0/2](415600/4588595) loss:2.962 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:35:29,264][model8_pretrain.py][INFO] Epoch:[0/2](415700/4588595) loss:2.844 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:35:29,265][model8_pretrain.py][INFO] Epoch:[0/2](415700/4588595) loss:2.784 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:35:29,264][model8_pretrain.py][INFO] Epoch:[0/2](415700/4588595) loss:3.075 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:35:29,265][model8_pretrain.py][INFO] Epoch:[0/2](415700/4588595) loss:3.235 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:35:29,265][model8_pretrain.py][INFO] Epoch:[0/2](415700/4588595) loss:3.123 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:35:29,265][model8_pretrain.py][INFO] Epoch:[0/2](415700/4588595) loss:2.786 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:35:29,265][model8_pretrain.py][INFO] Epoch:[0/2](415700/4588595) loss:2.841 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:35:29,265][model8_pretrain.py][INFO] Epoch:[0/2](415700/4588595) loss:2.669 lr:0.0000100 epoch_Time:26458.0min: [2024-01-04 13:36:06,223][model8_pretrain.py][INFO] Epoch:[0/2](415800/4588595) loss:3.125 lr:0.0000100 epoch_Time:26457.0min: [2024-01-04 13:36:06,223][model8_pretrain.py][INFO] Epoch:[0/2](415800/4588595) loss:3.155 lr:0.0000100 epoch_Time:26457.0min: [2024-01-04 13:36:06,223][model8_pretrain.py][INFO] Epoch:[0/2](415800/4588595) loss:2.160 lr:0.0000100 epoch_Time:26457.0min: [2024-01-04 13:36:06,223][model8_pretrain.py][INFO] Epoch:[0/2](415800/4588595) loss:2.796 lr:0.0000100 epoch_Time:26457.0min: [2024-01-04 13:36:06,223][model8_pretrain.py][INFO] Epoch:[0/2](415800/4588595) loss:3.083 lr:0.0000100 epoch_Time:26457.0min: [2024-01-04 13:36:06,223][model8_pretrain.py][INFO] Epoch:[0/2](415800/4588595) loss:2.842 lr:0.0000100 epoch_Time:26457.0min: [2024-01-04 13:36:06,223][model8_pretrain.py][INFO] Epoch:[0/2](415800/4588595) loss:3.228 lr:0.0000100 epoch_Time:26457.0min: [2024-01-04 13:36:06,223][model8_pretrain.py][INFO] Epoch:[0/2](415800/4588595) loss:3.443 lr:0.0000100 epoch_Time:26457.0min: [2024-01-04 13:36:43,165][model8_pretrain.py][INFO] Epoch:[0/2](415900/4588595) loss:2.848 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:36:43,165][model8_pretrain.py][INFO] Epoch:[0/2](415900/4588595) loss:2.814 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:36:43,165][model8_pretrain.py][INFO] Epoch:[0/2](415900/4588595) loss:2.389 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:36:43,165][model8_pretrain.py][INFO] Epoch:[0/2](415900/4588595) loss:3.196 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:36:43,165][model8_pretrain.py][INFO] Epoch:[0/2](415900/4588595) loss:2.798 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:36:43,165][model8_pretrain.py][INFO] Epoch:[0/2](415900/4588595) loss:2.903 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:36:43,165][model8_pretrain.py][INFO] Epoch:[0/2](415900/4588595) loss:2.305 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:36:43,166][model8_pretrain.py][INFO] Epoch:[0/2](415900/4588595) loss:2.903 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:37:20,104][model8_pretrain.py][INFO] Epoch:[0/2](416000/4588595) loss:2.338 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:37:20,104][model8_pretrain.py][INFO] Epoch:[0/2](416000/4588595) loss:2.841 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:37:20,104][model8_pretrain.py][INFO] Epoch:[0/2](416000/4588595) loss:3.320 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:37:20,105][model8_pretrain.py][INFO] Epoch:[0/2](416000/4588595) loss:3.280 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:37:20,105][model8_pretrain.py][INFO] Epoch:[0/2](416000/4588595) loss:3.154 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:37:20,105][model8_pretrain.py][INFO] Epoch:[0/2](416000/4588595) loss:2.906 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:37:20,105][model8_pretrain.py][INFO] Epoch:[0/2](416000/4588595) loss:2.657 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:37:20,105][model8_pretrain.py][INFO] Epoch:[0/2](416000/4588595) loss:2.981 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:38:07,404][model8_pretrain.py][INFO] Epoch:[0/2](416100/4588595) loss:2.782 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:07,404][model8_pretrain.py][INFO] Epoch:[0/2](416100/4588595) loss:3.194 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:07,405][model8_pretrain.py][INFO] Epoch:[0/2](416100/4588595) loss:2.916 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:07,405][model8_pretrain.py][INFO] Epoch:[0/2](416100/4588595) loss:2.487 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:07,405][model8_pretrain.py][INFO] Epoch:[0/2](416100/4588595) loss:3.026 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:07,405][model8_pretrain.py][INFO] Epoch:[0/2](416100/4588595) loss:3.084 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:07,405][model8_pretrain.py][INFO] Epoch:[0/2](416100/4588595) loss:3.193 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:07,405][model8_pretrain.py][INFO] Epoch:[0/2](416100/4588595) loss:3.022 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:44,350][model8_pretrain.py][INFO] Epoch:[0/2](416200/4588595) loss:3.360 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:44,350][model8_pretrain.py][INFO] Epoch:[0/2](416200/4588595) loss:2.298 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:44,350][model8_pretrain.py][INFO] Epoch:[0/2](416200/4588595) loss:3.163 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:44,350][model8_pretrain.py][INFO] Epoch:[0/2](416200/4588595) loss:2.868 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:44,350][model8_pretrain.py][INFO] Epoch:[0/2](416200/4588595) loss:2.944 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:44,350][model8_pretrain.py][INFO] Epoch:[0/2](416200/4588595) loss:2.885 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:44,350][model8_pretrain.py][INFO] Epoch:[0/2](416200/4588595) loss:2.890 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:38:44,350][model8_pretrain.py][INFO] Epoch:[0/2](416200/4588595) loss:2.833 lr:0.0000100 epoch_Time:26456.0min: [2024-01-04 13:39:21,305][model8_pretrain.py][INFO] Epoch:[0/2](416300/4588595) loss:3.200 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:39:21,305][model8_pretrain.py][INFO] Epoch:[0/2](416300/4588595) loss:3.146 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:39:21,305][model8_pretrain.py][INFO] Epoch:[0/2](416300/4588595) loss:3.035 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:39:21,305][model8_pretrain.py][INFO] Epoch:[0/2](416300/4588595) loss:3.004 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:39:21,305][model8_pretrain.py][INFO] Epoch:[0/2](416300/4588595) loss:2.704 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:39:21,305][model8_pretrain.py][INFO] Epoch:[0/2](416300/4588595) loss:3.344 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:39:21,305][model8_pretrain.py][INFO] Epoch:[0/2](416300/4588595) loss:2.784 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:39:21,305][model8_pretrain.py][INFO] Epoch:[0/2](416300/4588595) loss:2.666 lr:0.0000100 epoch_Time:26455.0min: [2024-01-04 13:39:58,253][model8_pretrain.py][INFO] Epoch:[0/2](416400/4588595) loss:2.876 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:39:58,253][model8_pretrain.py][INFO] Epoch:[0/2](416400/4588595) loss:2.717 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:39:58,253][model8_pretrain.py][INFO] Epoch:[0/2](416400/4588595) loss:2.284 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:39:58,253][model8_pretrain.py][INFO] Epoch:[0/2](416400/4588595) loss:2.912 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:39:58,253][model8_pretrain.py][INFO] Epoch:[0/2](416400/4588595) loss:3.184 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:39:58,254][model8_pretrain.py][INFO] Epoch:[0/2](416400/4588595) loss:3.165 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:39:58,254][model8_pretrain.py][INFO] Epoch:[0/2](416400/4588595) loss:2.505 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:39:58,254][model8_pretrain.py][INFO] Epoch:[0/2](416400/4588595) loss:3.015 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:40:35,209][model8_pretrain.py][INFO] Epoch:[0/2](416500/4588595) loss:3.048 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:40:35,209][model8_pretrain.py][INFO] Epoch:[0/2](416500/4588595) loss:2.729 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:40:35,209][model8_pretrain.py][INFO] Epoch:[0/2](416500/4588595) loss:2.492 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:40:35,209][model8_pretrain.py][INFO] Epoch:[0/2](416500/4588595) loss:2.615 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:40:35,209][model8_pretrain.py][INFO] Epoch:[0/2](416500/4588595) loss:2.956 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:40:35,209][model8_pretrain.py][INFO] Epoch:[0/2](416500/4588595) loss:2.941 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:40:35,209][model8_pretrain.py][INFO] Epoch:[0/2](416500/4588595) loss:2.744 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:40:35,209][model8_pretrain.py][INFO] Epoch:[0/2](416500/4588595) loss:2.937 lr:0.0000100 epoch_Time:26453.0min: [2024-01-04 13:41:12,167][model8_pretrain.py][INFO] Epoch:[0/2](416600/4588595) loss:2.275 lr:0.0000100 epoch_Time:26452.0min: [2024-01-04 13:41:12,167][model8_pretrain.py][INFO] Epoch:[0/2](416600/4588595) loss:3.214 lr:0.0000100 epoch_Time:26452.0min: [2024-01-04 13:41:12,167][model8_pretrain.py][INFO] Epoch:[0/2](416600/4588595) loss:3.064 lr:0.0000100 epoch_Time:26452.0min: [2024-01-04 13:41:12,167][model8_pretrain.py][INFO] Epoch:[0/2](416600/4588595) loss:2.940 lr:0.0000100 epoch_Time:26452.0min: [2024-01-04 13:41:12,167][model8_pretrain.py][INFO] Epoch:[0/2](416600/4588595) loss:2.387 lr:0.0000100 epoch_Time:26452.0min: [2024-01-04 13:41:12,167][model8_pretrain.py][INFO] Epoch:[0/2](416600/4588595) loss:3.531 lr:0.0000100 epoch_Time:26452.0min: [2024-01-04 13:41:12,167][model8_pretrain.py][INFO] Epoch:[0/2](416600/4588595) loss:3.098 lr:0.0000100 epoch_Time:26452.0min: [2024-01-04 13:41:12,167][model8_pretrain.py][INFO] Epoch:[0/2](416600/4588595) loss:2.809 lr:0.0000100 epoch_Time:26452.0min: [2024-01-04 13:41:49,131][model8_pretrain.py][INFO] Epoch:[0/2](416700/4588595) loss:2.883 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:41:49,131][model8_pretrain.py][INFO] Epoch:[0/2](416700/4588595) loss:2.919 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:41:49,131][model8_pretrain.py][INFO] Epoch:[0/2](416700/4588595) loss:2.836 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:41:49,131][model8_pretrain.py][INFO] Epoch:[0/2](416700/4588595) loss:2.422 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:41:49,131][model8_pretrain.py][INFO] Epoch:[0/2](416700/4588595) loss:2.473 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:41:49,131][model8_pretrain.py][INFO] Epoch:[0/2](416700/4588595) loss:2.630 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:41:49,131][model8_pretrain.py][INFO] Epoch:[0/2](416700/4588595) loss:2.882 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:41:49,131][model8_pretrain.py][INFO] Epoch:[0/2](416700/4588595) loss:2.783 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:42:26,086][model8_pretrain.py][INFO] Epoch:[0/2](416800/4588595) loss:3.190 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:42:26,086][model8_pretrain.py][INFO] Epoch:[0/2](416800/4588595) loss:3.425 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:42:26,086][model8_pretrain.py][INFO] Epoch:[0/2](416800/4588595) loss:2.749 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:42:26,086][model8_pretrain.py][INFO] Epoch:[0/2](416800/4588595) loss:2.852 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:42:26,086][model8_pretrain.py][INFO] Epoch:[0/2](416800/4588595) loss:2.831 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:42:26,086][model8_pretrain.py][INFO] Epoch:[0/2](416800/4588595) loss:2.813 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:42:26,086][model8_pretrain.py][INFO] Epoch:[0/2](416800/4588595) loss:3.022 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:42:26,086][model8_pretrain.py][INFO] Epoch:[0/2](416800/4588595) loss:2.843 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:43:13,395][model8_pretrain.py][INFO] Epoch:[0/2](416900/4588595) loss:2.835 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:43:13,395][model8_pretrain.py][INFO] Epoch:[0/2](416900/4588595) loss:2.676 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:43:13,395][model8_pretrain.py][INFO] Epoch:[0/2](416900/4588595) loss:2.579 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:43:13,395][model8_pretrain.py][INFO] Epoch:[0/2](416900/4588595) loss:3.082 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:43:13,395][model8_pretrain.py][INFO] Epoch:[0/2](416900/4588595) loss:2.865 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:43:13,395][model8_pretrain.py][INFO] Epoch:[0/2](416900/4588595) loss:2.582 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:43:13,395][model8_pretrain.py][INFO] Epoch:[0/2](416900/4588595) loss:3.258 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:43:13,395][model8_pretrain.py][INFO] Epoch:[0/2](416900/4588595) loss:2.756 lr:0.0000100 epoch_Time:26451.0min: [2024-01-04 13:43:50,328][model8_pretrain.py][INFO] Epoch:[0/2](417000/4588595) loss:2.802 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:43:50,328][model8_pretrain.py][INFO] Epoch:[0/2](417000/4588595) loss:2.559 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:43:50,328][model8_pretrain.py][INFO] Epoch:[0/2](417000/4588595) loss:2.650 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:43:50,328][model8_pretrain.py][INFO] Epoch:[0/2](417000/4588595) loss:3.075 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:43:50,328][model8_pretrain.py][INFO] Epoch:[0/2](417000/4588595) loss:3.250 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:43:50,328][model8_pretrain.py][INFO] Epoch:[0/2](417000/4588595) loss:3.200 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:43:50,328][model8_pretrain.py][INFO] Epoch:[0/2](417000/4588595) loss:2.755 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:43:50,328][model8_pretrain.py][INFO] Epoch:[0/2](417000/4588595) loss:3.068 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:44:27,261][model8_pretrain.py][INFO] Epoch:[0/2](417100/4588595) loss:2.792 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:44:27,261][model8_pretrain.py][INFO] Epoch:[0/2](417100/4588595) loss:2.873 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:44:27,261][model8_pretrain.py][INFO] Epoch:[0/2](417100/4588595) loss:2.943 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:44:27,261][model8_pretrain.py][INFO] Epoch:[0/2](417100/4588595) loss:2.708 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:44:27,261][model8_pretrain.py][INFO] Epoch:[0/2](417100/4588595) loss:2.882 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:44:27,261][model8_pretrain.py][INFO] Epoch:[0/2](417100/4588595) loss:2.993 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:44:27,261][model8_pretrain.py][INFO] Epoch:[0/2](417100/4588595) loss:2.621 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:44:27,262][model8_pretrain.py][INFO] Epoch:[0/2](417100/4588595) loss:2.608 lr:0.0000100 epoch_Time:26450.0min: [2024-01-04 13:45:04,201][model8_pretrain.py][INFO] Epoch:[0/2](417200/4588595) loss:2.947 lr:0.0000100 epoch_Time:26449.0min: [2024-01-04 13:45:04,200][model8_pretrain.py][INFO] Epoch:[0/2](417200/4588595) loss:2.429 lr:0.0000100 epoch_Time:26449.0min: [2024-01-04 13:45:04,201][model8_pretrain.py][INFO] Epoch:[0/2](417200/4588595) loss:3.148 lr:0.0000100 epoch_Time:26449.0min: [2024-01-04 13:45:04,201][model8_pretrain.py][INFO] Epoch:[0/2](417200/4588595) loss:3.133 lr:0.0000100 epoch_Time:26449.0min: [2024-01-04 13:45:04,201][model8_pretrain.py][INFO] Epoch:[0/2](417200/4588595) loss:2.882 lr:0.0000100 epoch_Time:26449.0min: [2024-01-04 13:45:04,201][model8_pretrain.py][INFO] Epoch:[0/2](417200/4588595) loss:2.703 lr:0.0000100 epoch_Time:26449.0min: [2024-01-04 13:45:04,201][model8_pretrain.py][INFO] Epoch:[0/2](417200/4588595) loss:2.507 lr:0.0000100 epoch_Time:26449.0min: [2024-01-04 13:45:04,201][model8_pretrain.py][INFO] Epoch:[0/2](417200/4588595) loss:2.512 lr:0.0000100 epoch_Time:26449.0min: [2024-01-04 13:45:41,163][model8_pretrain.py][INFO] Epoch:[0/2](417300/4588595) loss:3.108 lr:0.0000100 epoch_Time:26448.0min: [2024-01-04 13:45:41,163][model8_pretrain.py][INFO] Epoch:[0/2](417300/4588595) loss:3.396 lr:0.0000100 epoch_Time:26448.0min: [2024-01-04 13:45:41,163][model8_pretrain.py][INFO] Epoch:[0/2](417300/4588595) loss:2.938 lr:0.0000100 epoch_Time:26448.0min: [2024-01-04 13:45:41,163][model8_pretrain.py][INFO] Epoch:[0/2](417300/4588595) loss:3.097 lr:0.0000100 epoch_Time:26448.0min: [2024-01-04 13:45:41,163][model8_pretrain.py][INFO] Epoch:[0/2](417300/4588595) loss:2.926 lr:0.0000100 epoch_Time:26448.0min: [2024-01-04 13:45:41,163][model8_pretrain.py][INFO] Epoch:[0/2](417300/4588595) loss:2.727 lr:0.0000100 epoch_Time:26448.0min: [2024-01-04 13:45:41,163][model8_pretrain.py][INFO] Epoch:[0/2](417300/4588595) loss:3.015 lr:0.0000100 epoch_Time:26448.0min: [2024-01-04 13:45:41,163][model8_pretrain.py][INFO] Epoch:[0/2](417300/4588595) loss:3.223 lr:0.0000100 epoch_Time:26448.0min: [2024-01-04 13:46:18,128][model8_pretrain.py][INFO] Epoch:[0/2](417400/4588595) loss:2.899 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:46:18,128][model8_pretrain.py][INFO] Epoch:[0/2](417400/4588595) loss:3.000 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:46:18,128][model8_pretrain.py][INFO] Epoch:[0/2](417400/4588595) loss:2.992 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:46:18,128][model8_pretrain.py][INFO] Epoch:[0/2](417400/4588595) loss:2.951 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:46:18,128][model8_pretrain.py][INFO] Epoch:[0/2](417400/4588595) loss:2.538 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:46:18,129][model8_pretrain.py][INFO] Epoch:[0/2](417400/4588595) loss:2.669 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:46:18,129][model8_pretrain.py][INFO] Epoch:[0/2](417400/4588595) loss:2.773 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:46:18,129][model8_pretrain.py][INFO] Epoch:[0/2](417400/4588595) loss:2.563 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:46:55,081][model8_pretrain.py][INFO] Epoch:[0/2](417500/4588595) loss:2.780 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:46:55,081][model8_pretrain.py][INFO] Epoch:[0/2](417500/4588595) loss:2.934 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:46:55,081][model8_pretrain.py][INFO] Epoch:[0/2](417500/4588595) loss:3.234 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:46:55,081][model8_pretrain.py][INFO] Epoch:[0/2](417500/4588595) loss:2.925 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:46:55,081][model8_pretrain.py][INFO] Epoch:[0/2](417500/4588595) loss:2.759 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:46:55,081][model8_pretrain.py][INFO] Epoch:[0/2](417500/4588595) loss:2.328 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:46:55,081][model8_pretrain.py][INFO] Epoch:[0/2](417500/4588595) loss:2.920 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:46:55,081][model8_pretrain.py][INFO] Epoch:[0/2](417500/4588595) loss:2.581 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:47:31,985][model8_pretrain.py][INFO] Epoch:[0/2](417600/4588595) loss:2.815 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:47:31,985][model8_pretrain.py][INFO] Epoch:[0/2](417600/4588595) loss:2.899 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:47:31,985][model8_pretrain.py][INFO] Epoch:[0/2](417600/4588595) loss:3.267 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:47:31,985][model8_pretrain.py][INFO] Epoch:[0/2](417600/4588595) loss:2.977 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:47:31,985][model8_pretrain.py][INFO] Epoch:[0/2](417600/4588595) loss:2.767 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:47:31,985][model8_pretrain.py][INFO] Epoch:[0/2](417600/4588595) loss:3.138 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:47:31,985][model8_pretrain.py][INFO] Epoch:[0/2](417600/4588595) loss:2.787 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:47:31,986][model8_pretrain.py][INFO] Epoch:[0/2](417600/4588595) loss:2.686 lr:0.0000100 epoch_Time:26446.0min: [2024-01-04 13:48:19,366][model8_pretrain.py][INFO] Epoch:[0/2](417700/4588595) loss:2.131 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:48:19,366][model8_pretrain.py][INFO] Epoch:[0/2](417700/4588595) loss:3.423 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:48:19,366][model8_pretrain.py][INFO] Epoch:[0/2](417700/4588595) loss:2.676 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:48:19,366][model8_pretrain.py][INFO] Epoch:[0/2](417700/4588595) loss:2.887 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:48:19,366][model8_pretrain.py][INFO] Epoch:[0/2](417700/4588595) loss:2.196 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:48:19,366][model8_pretrain.py][INFO] Epoch:[0/2](417700/4588595) loss:3.183 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:48:19,366][model8_pretrain.py][INFO] Epoch:[0/2](417700/4588595) loss:2.871 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:48:19,366][model8_pretrain.py][INFO] Epoch:[0/2](417700/4588595) loss:3.128 lr:0.0000100 epoch_Time:26447.0min: [2024-01-04 13:48:56,297][model8_pretrain.py][INFO] Epoch:[0/2](417800/4588595) loss:2.719 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:48:56,297][model8_pretrain.py][INFO] Epoch:[0/2](417800/4588595) loss:2.910 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:48:56,297][model8_pretrain.py][INFO] Epoch:[0/2](417800/4588595) loss:3.369 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:48:56,297][model8_pretrain.py][INFO] Epoch:[0/2](417800/4588595) loss:3.018 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:48:56,297][model8_pretrain.py][INFO] Epoch:[0/2](417800/4588595) loss:2.706 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:48:56,297][model8_pretrain.py][INFO] Epoch:[0/2](417800/4588595) loss:3.269 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:48:56,297][model8_pretrain.py][INFO] Epoch:[0/2](417800/4588595) loss:2.743 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:48:56,297][model8_pretrain.py][INFO] Epoch:[0/2](417800/4588595) loss:2.895 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:49:33,233][model8_pretrain.py][INFO] Epoch:[0/2](417900/4588595) loss:2.617 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:49:33,233][model8_pretrain.py][INFO] Epoch:[0/2](417900/4588595) loss:2.916 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:49:33,234][model8_pretrain.py][INFO] Epoch:[0/2](417900/4588595) loss:2.856 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:49:33,234][model8_pretrain.py][INFO] Epoch:[0/2](417900/4588595) loss:3.344 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:49:33,234][model8_pretrain.py][INFO] Epoch:[0/2](417900/4588595) loss:3.656 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:49:33,234][model8_pretrain.py][INFO] Epoch:[0/2](417900/4588595) loss:2.647 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:49:33,234][model8_pretrain.py][INFO] Epoch:[0/2](417900/4588595) loss:2.543 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:49:33,234][model8_pretrain.py][INFO] Epoch:[0/2](417900/4588595) loss:3.205 lr:0.0000100 epoch_Time:26445.0min: [2024-01-04 13:50:10,175][model8_pretrain.py][INFO] Epoch:[0/2](418000/4588595) loss:3.092 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:10,175][model8_pretrain.py][INFO] Epoch:[0/2](418000/4588595) loss:2.957 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:10,175][model8_pretrain.py][INFO] Epoch:[0/2](418000/4588595) loss:2.940 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:10,175][model8_pretrain.py][INFO] Epoch:[0/2](418000/4588595) loss:2.371 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:10,175][model8_pretrain.py][INFO] Epoch:[0/2](418000/4588595) loss:2.992 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:10,175][model8_pretrain.py][INFO] Epoch:[0/2](418000/4588595) loss:2.786 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:10,176][model8_pretrain.py][INFO] Epoch:[0/2](418000/4588595) loss:2.640 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:10,176][model8_pretrain.py][INFO] Epoch:[0/2](418000/4588595) loss:3.476 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:47,106][model8_pretrain.py][INFO] Epoch:[0/2](418100/4588595) loss:3.308 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:47,106][model8_pretrain.py][INFO] Epoch:[0/2](418100/4588595) loss:3.037 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:47,106][model8_pretrain.py][INFO] Epoch:[0/2](418100/4588595) loss:2.727 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:47,106][model8_pretrain.py][INFO] Epoch:[0/2](418100/4588595) loss:2.851 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:47,106][model8_pretrain.py][INFO] Epoch:[0/2](418100/4588595) loss:2.795 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:47,106][model8_pretrain.py][INFO] Epoch:[0/2](418100/4588595) loss:2.685 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:47,106][model8_pretrain.py][INFO] Epoch:[0/2](418100/4588595) loss:3.064 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:50:47,106][model8_pretrain.py][INFO] Epoch:[0/2](418100/4588595) loss:2.834 lr:0.0000100 epoch_Time:26444.0min: [2024-01-04 13:51:24,048][model8_pretrain.py][INFO] Epoch:[0/2](418200/4588595) loss:2.612 lr:0.0000100 epoch_Time:26443.0min: [2024-01-04 13:51:24,048][model8_pretrain.py][INFO] Epoch:[0/2](418200/4588595) loss:3.030 lr:0.0000100 epoch_Time:26443.0min: [2024-01-04 13:51:24,048][model8_pretrain.py][INFO] Epoch:[0/2](418200/4588595) loss:2.653 lr:0.0000100 epoch_Time:26443.0min: [2024-01-04 13:51:24,048][model8_pretrain.py][INFO] Epoch:[0/2](418200/4588595) loss:3.273 lr:0.0000100 epoch_Time:26443.0min: [2024-01-04 13:51:24,048][model8_pretrain.py][INFO] Epoch:[0/2](418200/4588595) loss:3.072 lr:0.0000100 epoch_Time:26443.0min: [2024-01-04 13:51:24,048][model8_pretrain.py][INFO] Epoch:[0/2](418200/4588595) loss:2.992 lr:0.0000100 epoch_Time:26443.0min: [2024-01-04 13:51:24,048][model8_pretrain.py][INFO] Epoch:[0/2](418200/4588595) loss:2.627 lr:0.0000100 epoch_Time:26443.0min: [2024-01-04 13:51:24,048][model8_pretrain.py][INFO] Epoch:[0/2](418200/4588595) loss:2.761 lr:0.0000100 epoch_Time:26443.0min: [2024-01-04 13:52:00,983][model8_pretrain.py][INFO] Epoch:[0/2](418300/4588595) loss:3.282 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:00,983][model8_pretrain.py][INFO] Epoch:[0/2](418300/4588595) loss:2.603 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:00,983][model8_pretrain.py][INFO] Epoch:[0/2](418300/4588595) loss:2.133 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:00,983][model8_pretrain.py][INFO] Epoch:[0/2](418300/4588595) loss:3.139 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:00,983][model8_pretrain.py][INFO] Epoch:[0/2](418300/4588595) loss:2.937 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:00,983][model8_pretrain.py][INFO] Epoch:[0/2](418300/4588595) loss:2.772 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:00,983][model8_pretrain.py][INFO] Epoch:[0/2](418300/4588595) loss:3.182 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:00,983][model8_pretrain.py][INFO] Epoch:[0/2](418300/4588595) loss:2.806 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:37,916][model8_pretrain.py][INFO] Epoch:[0/2](418400/4588595) loss:2.693 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:37,916][model8_pretrain.py][INFO] Epoch:[0/2](418400/4588595) loss:2.632 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:37,916][model8_pretrain.py][INFO] Epoch:[0/2](418400/4588595) loss:2.761 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:37,916][model8_pretrain.py][INFO] Epoch:[0/2](418400/4588595) loss:2.908 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:37,916][model8_pretrain.py][INFO] Epoch:[0/2](418400/4588595) loss:3.050 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:37,916][model8_pretrain.py][INFO] Epoch:[0/2](418400/4588595) loss:2.634 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:37,916][model8_pretrain.py][INFO] Epoch:[0/2](418400/4588595) loss:3.373 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:52:37,916][model8_pretrain.py][INFO] Epoch:[0/2](418400/4588595) loss:2.661 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:53:25,143][model8_pretrain.py][INFO] Epoch:[0/2](418500/4588595) loss:2.564 lr:0.0000100 epoch_Time:26442.0min: [2024-01-04 13:53:25,143][model8_pretrain.py][INFO] Epoch:[0/2](418500/4588595) loss:2.755 lr:0.0000100 epoch_Time:26442.0min: [2024-01-04 13:53:25,144][model8_pretrain.py][INFO] Epoch:[0/2](418500/4588595) loss:2.789 lr:0.0000100 epoch_Time:26442.0min: [2024-01-04 13:53:25,144][model8_pretrain.py][INFO] Epoch:[0/2](418500/4588595) loss:3.391 lr:0.0000100 epoch_Time:26442.0min: [2024-01-04 13:53:25,144][model8_pretrain.py][INFO] Epoch:[0/2](418500/4588595) loss:2.915 lr:0.0000100 epoch_Time:26442.0min: [2024-01-04 13:53:25,144][model8_pretrain.py][INFO] Epoch:[0/2](418500/4588595) loss:2.679 lr:0.0000100 epoch_Time:26442.0min: [2024-01-04 13:53:25,144][model8_pretrain.py][INFO] Epoch:[0/2](418500/4588595) loss:3.104 lr:0.0000100 epoch_Time:26442.0min: [2024-01-04 13:53:25,144][model8_pretrain.py][INFO] Epoch:[0/2](418500/4588595) loss:3.064 lr:0.0000100 epoch_Time:26442.0min: [2024-01-04 13:54:02,076][model8_pretrain.py][INFO] Epoch:[0/2](418600/4588595) loss:2.861 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:54:02,076][model8_pretrain.py][INFO] Epoch:[0/2](418600/4588595) loss:2.576 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:54:02,076][model8_pretrain.py][INFO] Epoch:[0/2](418600/4588595) loss:2.267 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:54:02,076][model8_pretrain.py][INFO] Epoch:[0/2](418600/4588595) loss:2.894 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:54:02,076][model8_pretrain.py][INFO] Epoch:[0/2](418600/4588595) loss:2.261 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:54:02,076][model8_pretrain.py][INFO] Epoch:[0/2](418600/4588595) loss:2.734 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:54:02,076][model8_pretrain.py][INFO] Epoch:[0/2](418600/4588595) loss:3.152 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:54:02,076][model8_pretrain.py][INFO] Epoch:[0/2](418600/4588595) loss:2.963 lr:0.0000100 epoch_Time:26441.0min: [2024-01-04 13:54:38,990][model8_pretrain.py][INFO] Epoch:[0/2](418700/4588595) loss:3.378 lr:0.0000100 epoch_Time:26440.0min: [2024-01-04 13:54:38,990][model8_pretrain.py][INFO] Epoch:[0/2](418700/4588595) loss:2.119 lr:0.0000100 epoch_Time:26440.0min: [2024-01-04 13:54:38,990][model8_pretrain.py][INFO] Epoch:[0/2](418700/4588595) loss:2.924 lr:0.0000100 epoch_Time:26440.0min: [2024-01-04 13:54:38,990][model8_pretrain.py][INFO] Epoch:[0/2](418700/4588595) loss:2.780 lr:0.0000100 epoch_Time:26440.0min: [2024-01-04 13:54:38,990][model8_pretrain.py][INFO] Epoch:[0/2](418700/4588595) loss:2.829 lr:0.0000100 epoch_Time:26440.0min: [2024-01-04 13:54:38,990][model8_pretrain.py][INFO] Epoch:[0/2](418700/4588595) loss:2.887 lr:0.0000100 epoch_Time:26440.0min: [2024-01-04 13:54:38,990][model8_pretrain.py][INFO] Epoch:[0/2](418700/4588595) loss:2.419 lr:0.0000100 epoch_Time:26440.0min: [2024-01-04 13:54:38,990][model8_pretrain.py][INFO] Epoch:[0/2](418700/4588595) loss:2.969 lr:0.0000100 epoch_Time:26440.0min: [2024-01-04 13:55:15,946][model8_pretrain.py][INFO] Epoch:[0/2](418800/4588595) loss:2.870 lr:0.0000100 epoch_Time:26439.0min: [2024-01-04 13:55:15,946][model8_pretrain.py][INFO] Epoch:[0/2](418800/4588595) loss:3.095 lr:0.0000100 epoch_Time:26439.0min: [2024-01-04 13:55:15,946][model8_pretrain.py][INFO] Epoch:[0/2](418800/4588595) loss:2.678 lr:0.0000100 epoch_Time:26439.0min: [2024-01-04 13:55:15,946][model8_pretrain.py][INFO] Epoch:[0/2](418800/4588595) loss:2.657 lr:0.0000100 epoch_Time:26439.0min: [2024-01-04 13:55:15,946][model8_pretrain.py][INFO] Epoch:[0/2](418800/4588595) loss:2.724 lr:0.0000100 epoch_Time:26439.0min: [2024-01-04 13:55:15,946][model8_pretrain.py][INFO] Epoch:[0/2](418800/4588595) loss:2.879 lr:0.0000100 epoch_Time:26439.0min: [2024-01-04 13:55:15,946][model8_pretrain.py][INFO] Epoch:[0/2](418800/4588595) loss:3.202 lr:0.0000100 epoch_Time:26439.0min: [2024-01-04 13:55:15,946][model8_pretrain.py][INFO] Epoch:[0/2](418800/4588595) loss:2.949 lr:0.0000100 epoch_Time:26439.0min: [2024-01-04 13:55:52,883][model8_pretrain.py][INFO] Epoch:[0/2](418900/4588595) loss:2.915 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:55:52,883][model8_pretrain.py][INFO] Epoch:[0/2](418900/4588595) loss:2.579 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:55:52,883][model8_pretrain.py][INFO] Epoch:[0/2](418900/4588595) loss:2.943 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:55:52,884][model8_pretrain.py][INFO] Epoch:[0/2](418900/4588595) loss:2.764 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:55:52,884][model8_pretrain.py][INFO] Epoch:[0/2](418900/4588595) loss:2.563 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:55:52,884][model8_pretrain.py][INFO] Epoch:[0/2](418900/4588595) loss:2.798 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:55:52,884][model8_pretrain.py][INFO] Epoch:[0/2](418900/4588595) loss:2.711 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:55:52,884][model8_pretrain.py][INFO] Epoch:[0/2](418900/4588595) loss:2.395 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:56:29,830][model8_pretrain.py][INFO] Epoch:[0/2](419000/4588595) loss:2.681 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:56:29,830][model8_pretrain.py][INFO] Epoch:[0/2](419000/4588595) loss:3.126 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:56:29,830][model8_pretrain.py][INFO] Epoch:[0/2](419000/4588595) loss:3.028 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:56:29,830][model8_pretrain.py][INFO] Epoch:[0/2](419000/4588595) loss:3.226 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:56:29,830][model8_pretrain.py][INFO] Epoch:[0/2](419000/4588595) loss:2.695 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:56:29,830][model8_pretrain.py][INFO] Epoch:[0/2](419000/4588595) loss:2.731 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:56:29,831][model8_pretrain.py][INFO] Epoch:[0/2](419000/4588595) loss:3.132 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:56:29,831][model8_pretrain.py][INFO] Epoch:[0/2](419000/4588595) loss:2.545 lr:0.0000100 epoch_Time:26438.0min: [2024-01-04 13:57:06,776][model8_pretrain.py][INFO] Epoch:[0/2](419100/4588595) loss:3.103 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:57:06,776][model8_pretrain.py][INFO] Epoch:[0/2](419100/4588595) loss:2.617 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:57:06,776][model8_pretrain.py][INFO] Epoch:[0/2](419100/4588595) loss:2.491 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:57:06,777][model8_pretrain.py][INFO] Epoch:[0/2](419100/4588595) loss:2.953 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:57:06,777][model8_pretrain.py][INFO] Epoch:[0/2](419100/4588595) loss:2.754 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:57:06,777][model8_pretrain.py][INFO] Epoch:[0/2](419100/4588595) loss:2.903 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:57:06,777][model8_pretrain.py][INFO] Epoch:[0/2](419100/4588595) loss:3.104 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:57:06,777][model8_pretrain.py][INFO] Epoch:[0/2](419100/4588595) loss:2.733 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:57:43,722][model8_pretrain.py][INFO] Epoch:[0/2](419200/4588595) loss:2.939 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:57:43,722][model8_pretrain.py][INFO] Epoch:[0/2](419200/4588595) loss:2.991 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:57:43,722][model8_pretrain.py][INFO] Epoch:[0/2](419200/4588595) loss:2.594 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:57:43,722][model8_pretrain.py][INFO] Epoch:[0/2](419200/4588595) loss:2.898 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:57:43,723][model8_pretrain.py][INFO] Epoch:[0/2](419200/4588595) loss:2.817 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:57:43,722][model8_pretrain.py][INFO] Epoch:[0/2](419200/4588595) loss:3.029 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:57:43,723][model8_pretrain.py][INFO] Epoch:[0/2](419200/4588595) loss:2.798 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:57:43,723][model8_pretrain.py][INFO] Epoch:[0/2](419200/4588595) loss:3.125 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:58:30,993][model8_pretrain.py][INFO] Epoch:[0/2](419300/4588595) loss:3.485 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:58:30,993][model8_pretrain.py][INFO] Epoch:[0/2](419300/4588595) loss:2.971 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:58:30,993][model8_pretrain.py][INFO] Epoch:[0/2](419300/4588595) loss:2.259 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:58:30,993][model8_pretrain.py][INFO] Epoch:[0/2](419300/4588595) loss:2.843 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:58:30,993][model8_pretrain.py][INFO] Epoch:[0/2](419300/4588595) loss:2.748 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:58:30,993][model8_pretrain.py][INFO] Epoch:[0/2](419300/4588595) loss:2.856 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:58:30,993][model8_pretrain.py][INFO] Epoch:[0/2](419300/4588595) loss:2.034 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:58:30,993][model8_pretrain.py][INFO] Epoch:[0/2](419300/4588595) loss:3.149 lr:0.0000100 epoch_Time:26437.0min: [2024-01-04 13:59:07,940][model8_pretrain.py][INFO] Epoch:[0/2](419400/4588595) loss:2.428 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:07,940][model8_pretrain.py][INFO] Epoch:[0/2](419400/4588595) loss:2.854 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:07,940][model8_pretrain.py][INFO] Epoch:[0/2](419400/4588595) loss:2.840 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:07,940][model8_pretrain.py][INFO] Epoch:[0/2](419400/4588595) loss:2.699 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:07,940][model8_pretrain.py][INFO] Epoch:[0/2](419400/4588595) loss:2.767 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:07,940][model8_pretrain.py][INFO] Epoch:[0/2](419400/4588595) loss:2.698 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:07,940][model8_pretrain.py][INFO] Epoch:[0/2](419400/4588595) loss:2.504 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:07,940][model8_pretrain.py][INFO] Epoch:[0/2](419400/4588595) loss:2.799 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:44,896][model8_pretrain.py][INFO] Epoch:[0/2](419500/4588595) loss:2.537 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:44,896][model8_pretrain.py][INFO] Epoch:[0/2](419500/4588595) loss:3.312 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:44,896][model8_pretrain.py][INFO] Epoch:[0/2](419500/4588595) loss:2.727 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:44,896][model8_pretrain.py][INFO] Epoch:[0/2](419500/4588595) loss:3.079 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:44,896][model8_pretrain.py][INFO] Epoch:[0/2](419500/4588595) loss:3.237 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:44,896][model8_pretrain.py][INFO] Epoch:[0/2](419500/4588595) loss:2.883 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:44,896][model8_pretrain.py][INFO] Epoch:[0/2](419500/4588595) loss:2.712 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 13:59:44,896][model8_pretrain.py][INFO] Epoch:[0/2](419500/4588595) loss:3.289 lr:0.0000100 epoch_Time:26436.0min: [2024-01-04 14:00:21,850][model8_pretrain.py][INFO] Epoch:[0/2](419600/4588595) loss:2.362 lr:0.0000100 epoch_Time:26434.0min: [2024-01-04 14:00:21,850][model8_pretrain.py][INFO] Epoch:[0/2](419600/4588595) loss:3.103 lr:0.0000100 epoch_Time:26434.0min: [2024-01-04 14:00:21,850][model8_pretrain.py][INFO] Epoch:[0/2](419600/4588595) loss:2.372 lr:0.0000100 epoch_Time:26434.0min: [2024-01-04 14:00:21,850][model8_pretrain.py][INFO] Epoch:[0/2](419600/4588595) loss:2.558 lr:0.0000100 epoch_Time:26434.0min: [2024-01-04 14:00:21,850][model8_pretrain.py][INFO] Epoch:[0/2](419600/4588595) loss:3.615 lr:0.0000100 epoch_Time:26434.0min: [2024-01-04 14:00:21,850][model8_pretrain.py][INFO] Epoch:[0/2](419600/4588595) loss:2.864 lr:0.0000100 epoch_Time:26434.0min: [2024-01-04 14:00:21,850][model8_pretrain.py][INFO] Epoch:[0/2](419600/4588595) loss:3.232 lr:0.0000100 epoch_Time:26434.0min: [2024-01-04 14:00:21,851][model8_pretrain.py][INFO] Epoch:[0/2](419600/4588595) loss:2.797 lr:0.0000100 epoch_Time:26434.0min: [2024-01-04 14:00:58,800][model8_pretrain.py][INFO] Epoch:[0/2](419700/4588595) loss:3.016 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:00:58,800][model8_pretrain.py][INFO] Epoch:[0/2](419700/4588595) loss:2.450 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:00:58,800][model8_pretrain.py][INFO] Epoch:[0/2](419700/4588595) loss:2.940 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:00:58,800][model8_pretrain.py][INFO] Epoch:[0/2](419700/4588595) loss:2.840 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:00:58,800][model8_pretrain.py][INFO] Epoch:[0/2](419700/4588595) loss:3.303 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:00:58,800][model8_pretrain.py][INFO] Epoch:[0/2](419700/4588595) loss:3.041 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:00:58,800][model8_pretrain.py][INFO] Epoch:[0/2](419700/4588595) loss:3.101 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:00:58,801][model8_pretrain.py][INFO] Epoch:[0/2](419700/4588595) loss:3.114 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:01:35,756][model8_pretrain.py][INFO] Epoch:[0/2](419800/4588595) loss:2.946 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:01:35,756][model8_pretrain.py][INFO] Epoch:[0/2](419800/4588595) loss:2.632 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:01:35,756][model8_pretrain.py][INFO] Epoch:[0/2](419800/4588595) loss:2.459 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:01:35,756][model8_pretrain.py][INFO] Epoch:[0/2](419800/4588595) loss:3.080 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:01:35,756][model8_pretrain.py][INFO] Epoch:[0/2](419800/4588595) loss:2.565 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:01:35,756][model8_pretrain.py][INFO] Epoch:[0/2](419800/4588595) loss:3.248 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:01:35,756][model8_pretrain.py][INFO] Epoch:[0/2](419800/4588595) loss:3.430 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:01:35,756][model8_pretrain.py][INFO] Epoch:[0/2](419800/4588595) loss:2.846 lr:0.0000100 epoch_Time:26433.0min: [2024-01-04 14:02:12,719][model8_pretrain.py][INFO] Epoch:[0/2](419900/4588595) loss:2.965 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:02:12,719][model8_pretrain.py][INFO] Epoch:[0/2](419900/4588595) loss:2.352 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:02:12,719][model8_pretrain.py][INFO] Epoch:[0/2](419900/4588595) loss:2.955 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:02:12,719][model8_pretrain.py][INFO] Epoch:[0/2](419900/4588595) loss:3.309 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:02:12,719][model8_pretrain.py][INFO] Epoch:[0/2](419900/4588595) loss:2.559 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:02:12,719][model8_pretrain.py][INFO] Epoch:[0/2](419900/4588595) loss:2.979 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:02:12,719][model8_pretrain.py][INFO] Epoch:[0/2](419900/4588595) loss:3.291 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:02:12,719][model8_pretrain.py][INFO] Epoch:[0/2](419900/4588595) loss:2.788 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:02:49,680][model8_pretrain.py][INFO] Epoch:[0/2](420000/4588595) loss:3.202 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:02:49,680][model8_pretrain.py][INFO] Epoch:[0/2](420000/4588595) loss:2.863 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:02:49,680][model8_pretrain.py][INFO] Epoch:[0/2](420000/4588595) loss:2.862 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:02:49,680][model8_pretrain.py][INFO] Epoch:[0/2](420000/4588595) loss:3.004 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:02:49,680][model8_pretrain.py][INFO] Epoch:[0/2](420000/4588595) loss:3.080 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:02:49,681][model8_pretrain.py][INFO] Epoch:[0/2](420000/4588595) loss:2.666 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:02:49,681][model8_pretrain.py][INFO] Epoch:[0/2](420000/4588595) loss:2.316 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:02:49,681][model8_pretrain.py][INFO] Epoch:[0/2](420000/4588595) loss:2.931 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:03:36,518][model8_pretrain.py][INFO] Epoch:[0/2](420100/4588595) loss:2.240 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:03:36,518][model8_pretrain.py][INFO] Epoch:[0/2](420100/4588595) loss:2.984 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:03:36,518][model8_pretrain.py][INFO] Epoch:[0/2](420100/4588595) loss:3.129 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:03:36,518][model8_pretrain.py][INFO] Epoch:[0/2](420100/4588595) loss:2.759 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:03:36,518][model8_pretrain.py][INFO] Epoch:[0/2](420100/4588595) loss:2.957 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:03:36,518][model8_pretrain.py][INFO] Epoch:[0/2](420100/4588595) loss:3.532 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:03:36,518][model8_pretrain.py][INFO] Epoch:[0/2](420100/4588595) loss:3.178 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:03:36,518][model8_pretrain.py][INFO] Epoch:[0/2](420100/4588595) loss:3.091 lr:0.0000100 epoch_Time:26432.0min: [2024-01-04 14:04:13,445][model8_pretrain.py][INFO] Epoch:[0/2](420200/4588595) loss:2.677 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:04:13,445][model8_pretrain.py][INFO] Epoch:[0/2](420200/4588595) loss:2.861 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:04:13,445][model8_pretrain.py][INFO] Epoch:[0/2](420200/4588595) loss:2.779 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:04:13,445][model8_pretrain.py][INFO] Epoch:[0/2](420200/4588595) loss:3.784 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:04:13,445][model8_pretrain.py][INFO] Epoch:[0/2](420200/4588595) loss:2.876 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:04:13,445][model8_pretrain.py][INFO] Epoch:[0/2](420200/4588595) loss:2.385 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:04:13,445][model8_pretrain.py][INFO] Epoch:[0/2](420200/4588595) loss:2.552 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:04:13,446][model8_pretrain.py][INFO] Epoch:[0/2](420200/4588595) loss:2.501 lr:0.0000100 epoch_Time:26431.0min: [2024-01-04 14:04:50,385][model8_pretrain.py][INFO] Epoch:[0/2](420300/4588595) loss:3.039 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:04:50,385][model8_pretrain.py][INFO] Epoch:[0/2](420300/4588595) loss:2.409 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:04:50,385][model8_pretrain.py][INFO] Epoch:[0/2](420300/4588595) loss:2.304 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:04:50,385][model8_pretrain.py][INFO] Epoch:[0/2](420300/4588595) loss:2.982 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:04:50,385][model8_pretrain.py][INFO] Epoch:[0/2](420300/4588595) loss:3.075 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:04:50,385][model8_pretrain.py][INFO] Epoch:[0/2](420300/4588595) loss:2.723 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:04:50,385][model8_pretrain.py][INFO] Epoch:[0/2](420300/4588595) loss:3.197 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:04:50,385][model8_pretrain.py][INFO] Epoch:[0/2](420300/4588595) loss:3.045 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:05:27,322][model8_pretrain.py][INFO] Epoch:[0/2](420400/4588595) loss:3.184 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:05:27,322][model8_pretrain.py][INFO] Epoch:[0/2](420400/4588595) loss:2.919 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:05:27,322][model8_pretrain.py][INFO] Epoch:[0/2](420400/4588595) loss:2.420 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:05:27,322][model8_pretrain.py][INFO] Epoch:[0/2](420400/4588595) loss:2.755 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:05:27,322][model8_pretrain.py][INFO] Epoch:[0/2](420400/4588595) loss:2.213 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:05:27,322][model8_pretrain.py][INFO] Epoch:[0/2](420400/4588595) loss:2.670 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:05:27,322][model8_pretrain.py][INFO] Epoch:[0/2](420400/4588595) loss:3.051 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:05:27,322][model8_pretrain.py][INFO] Epoch:[0/2](420400/4588595) loss:3.007 lr:0.0000100 epoch_Time:26430.0min: [2024-01-04 14:06:04,259][model8_pretrain.py][INFO] Epoch:[0/2](420500/4588595) loss:3.270 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:04,259][model8_pretrain.py][INFO] Epoch:[0/2](420500/4588595) loss:2.706 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:04,259][model8_pretrain.py][INFO] Epoch:[0/2](420500/4588595) loss:2.659 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:04,259][model8_pretrain.py][INFO] Epoch:[0/2](420500/4588595) loss:3.047 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:04,260][model8_pretrain.py][INFO] Epoch:[0/2](420500/4588595) loss:2.734 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:04,260][model8_pretrain.py][INFO] Epoch:[0/2](420500/4588595) loss:2.546 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:04,260][model8_pretrain.py][INFO] Epoch:[0/2](420500/4588595) loss:2.980 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:04,260][model8_pretrain.py][INFO] Epoch:[0/2](420500/4588595) loss:3.140 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:41,193][model8_pretrain.py][INFO] Epoch:[0/2](420600/4588595) loss:3.216 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:41,193][model8_pretrain.py][INFO] Epoch:[0/2](420600/4588595) loss:2.575 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:41,193][model8_pretrain.py][INFO] Epoch:[0/2](420600/4588595) loss:3.133 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:41,193][model8_pretrain.py][INFO] Epoch:[0/2](420600/4588595) loss:2.922 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:41,193][model8_pretrain.py][INFO] Epoch:[0/2](420600/4588595) loss:3.003 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:41,193][model8_pretrain.py][INFO] Epoch:[0/2](420600/4588595) loss:2.868 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:41,193][model8_pretrain.py][INFO] Epoch:[0/2](420600/4588595) loss:2.497 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:06:41,193][model8_pretrain.py][INFO] Epoch:[0/2](420600/4588595) loss:3.170 lr:0.0000100 epoch_Time:26428.0min: [2024-01-04 14:07:18,126][model8_pretrain.py][INFO] Epoch:[0/2](420700/4588595) loss:2.299 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:07:18,126][model8_pretrain.py][INFO] Epoch:[0/2](420700/4588595) loss:2.468 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:07:18,126][model8_pretrain.py][INFO] Epoch:[0/2](420700/4588595) loss:2.441 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:07:18,126][model8_pretrain.py][INFO] Epoch:[0/2](420700/4588595) loss:1.731 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:07:18,126][model8_pretrain.py][INFO] Epoch:[0/2](420700/4588595) loss:3.564 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:07:18,126][model8_pretrain.py][INFO] Epoch:[0/2](420700/4588595) loss:3.332 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:07:18,126][model8_pretrain.py][INFO] Epoch:[0/2](420700/4588595) loss:3.146 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:07:18,127][model8_pretrain.py][INFO] Epoch:[0/2](420700/4588595) loss:2.803 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:07:55,080][model8_pretrain.py][INFO] Epoch:[0/2](420800/4588595) loss:2.976 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:07:55,080][model8_pretrain.py][INFO] Epoch:[0/2](420800/4588595) loss:3.000 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:07:55,080][model8_pretrain.py][INFO] Epoch:[0/2](420800/4588595) loss:3.089 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:07:55,080][model8_pretrain.py][INFO] Epoch:[0/2](420800/4588595) loss:2.639 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:07:55,081][model8_pretrain.py][INFO] Epoch:[0/2](420800/4588595) loss:3.121 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:07:55,081][model8_pretrain.py][INFO] Epoch:[0/2](420800/4588595) loss:2.561 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:07:55,081][model8_pretrain.py][INFO] Epoch:[0/2](420800/4588595) loss:2.770 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:07:55,081][model8_pretrain.py][INFO] Epoch:[0/2](420800/4588595) loss:2.882 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:08:41,838][model8_pretrain.py][INFO] Epoch:[0/2](420900/4588595) loss:2.788 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:08:41,838][model8_pretrain.py][INFO] Epoch:[0/2](420900/4588595) loss:2.545 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:08:41,838][model8_pretrain.py][INFO] Epoch:[0/2](420900/4588595) loss:2.628 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:08:41,838][model8_pretrain.py][INFO] Epoch:[0/2](420900/4588595) loss:2.654 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:08:41,838][model8_pretrain.py][INFO] Epoch:[0/2](420900/4588595) loss:3.120 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:08:41,838][model8_pretrain.py][INFO] Epoch:[0/2](420900/4588595) loss:2.580 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:08:41,838][model8_pretrain.py][INFO] Epoch:[0/2](420900/4588595) loss:2.947 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:08:41,840][model8_pretrain.py][INFO] Epoch:[0/2](420900/4588595) loss:2.667 lr:0.0000100 epoch_Time:26427.0min: [2024-01-04 14:09:18,783][model8_pretrain.py][INFO] Epoch:[0/2](421000/4588595) loss:3.009 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:09:18,783][model8_pretrain.py][INFO] Epoch:[0/2](421000/4588595) loss:3.350 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:09:18,783][model8_pretrain.py][INFO] Epoch:[0/2](421000/4588595) loss:2.776 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:09:18,783][model8_pretrain.py][INFO] Epoch:[0/2](421000/4588595) loss:2.945 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:09:18,783][model8_pretrain.py][INFO] Epoch:[0/2](421000/4588595) loss:2.322 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:09:18,783][model8_pretrain.py][INFO] Epoch:[0/2](421000/4588595) loss:3.490 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:09:18,783][model8_pretrain.py][INFO] Epoch:[0/2](421000/4588595) loss:2.885 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:09:18,783][model8_pretrain.py][INFO] Epoch:[0/2](421000/4588595) loss:2.255 lr:0.0000100 epoch_Time:26426.0min: [2024-01-04 14:09:55,732][model8_pretrain.py][INFO] Epoch:[0/2](421100/4588595) loss:2.827 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:09:55,732][model8_pretrain.py][INFO] Epoch:[0/2](421100/4588595) loss:2.909 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:09:55,732][model8_pretrain.py][INFO] Epoch:[0/2](421100/4588595) loss:2.905 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:09:55,732][model8_pretrain.py][INFO] Epoch:[0/2](421100/4588595) loss:3.004 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:09:55,732][model8_pretrain.py][INFO] Epoch:[0/2](421100/4588595) loss:2.625 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:09:55,732][model8_pretrain.py][INFO] Epoch:[0/2](421100/4588595) loss:2.810 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:09:55,732][model8_pretrain.py][INFO] Epoch:[0/2](421100/4588595) loss:2.658 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:09:55,732][model8_pretrain.py][INFO] Epoch:[0/2](421100/4588595) loss:2.719 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:10:32,688][model8_pretrain.py][INFO] Epoch:[0/2](421200/4588595) loss:3.206 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:10:32,688][model8_pretrain.py][INFO] Epoch:[0/2](421200/4588595) loss:3.200 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:10:32,688][model8_pretrain.py][INFO] Epoch:[0/2](421200/4588595) loss:2.851 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:10:32,688][model8_pretrain.py][INFO] Epoch:[0/2](421200/4588595) loss:2.976 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:10:32,688][model8_pretrain.py][INFO] Epoch:[0/2](421200/4588595) loss:2.840 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:10:32,688][model8_pretrain.py][INFO] Epoch:[0/2](421200/4588595) loss:3.000 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:10:32,689][model8_pretrain.py][INFO] Epoch:[0/2](421200/4588595) loss:2.415 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:10:32,689][model8_pretrain.py][INFO] Epoch:[0/2](421200/4588595) loss:3.448 lr:0.0000100 epoch_Time:26425.0min: [2024-01-04 14:11:09,648][model8_pretrain.py][INFO] Epoch:[0/2](421300/4588595) loss:2.973 lr:0.0000100 epoch_Time:26424.0min: [2024-01-04 14:11:09,648][model8_pretrain.py][INFO] Epoch:[0/2](421300/4588595) loss:3.061 lr:0.0000100 epoch_Time:26424.0min: [2024-01-04 14:11:09,648][model8_pretrain.py][INFO] Epoch:[0/2](421300/4588595) loss:2.834 lr:0.0000100 epoch_Time:26424.0min: [2024-01-04 14:11:09,648][model8_pretrain.py][INFO] Epoch:[0/2](421300/4588595) loss:1.834 lr:0.0000100 epoch_Time:26424.0min: [2024-01-04 14:11:09,648][model8_pretrain.py][INFO] Epoch:[0/2](421300/4588595) loss:2.762 lr:0.0000100 epoch_Time:26424.0min: [2024-01-04 14:11:09,648][model8_pretrain.py][INFO] Epoch:[0/2](421300/4588595) loss:3.465 lr:0.0000100 epoch_Time:26424.0min: [2024-01-04 14:11:09,649][model8_pretrain.py][INFO] Epoch:[0/2](421300/4588595) loss:2.786 lr:0.0000100 epoch_Time:26424.0min: [2024-01-04 14:11:09,649][model8_pretrain.py][INFO] Epoch:[0/2](421300/4588595) loss:3.059 lr:0.0000100 epoch_Time:26424.0min: [2024-01-04 14:11:46,595][model8_pretrain.py][INFO] Epoch:[0/2](421400/4588595) loss:2.801 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:11:46,595][model8_pretrain.py][INFO] Epoch:[0/2](421400/4588595) loss:2.816 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:11:46,595][model8_pretrain.py][INFO] Epoch:[0/2](421400/4588595) loss:2.830 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:11:46,595][model8_pretrain.py][INFO] Epoch:[0/2](421400/4588595) loss:3.264 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:11:46,595][model8_pretrain.py][INFO] Epoch:[0/2](421400/4588595) loss:2.418 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:11:46,595][model8_pretrain.py][INFO] Epoch:[0/2](421400/4588595) loss:2.779 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:11:46,595][model8_pretrain.py][INFO] Epoch:[0/2](421400/4588595) loss:3.075 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:11:46,595][model8_pretrain.py][INFO] Epoch:[0/2](421400/4588595) loss:3.086 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:12:23,548][model8_pretrain.py][INFO] Epoch:[0/2](421500/4588595) loss:2.824 lr:0.0000100 epoch_Time:26422.0min: [2024-01-04 14:12:23,548][model8_pretrain.py][INFO] Epoch:[0/2](421500/4588595) loss:2.564 lr:0.0000100 epoch_Time:26422.0min: [2024-01-04 14:12:23,548][model8_pretrain.py][INFO] Epoch:[0/2](421500/4588595) loss:3.122 lr:0.0000100 epoch_Time:26422.0min: [2024-01-04 14:12:23,548][model8_pretrain.py][INFO] Epoch:[0/2](421500/4588595) loss:2.668 lr:0.0000100 epoch_Time:26422.0min: [2024-01-04 14:12:23,548][model8_pretrain.py][INFO] Epoch:[0/2](421500/4588595) loss:3.424 lr:0.0000100 epoch_Time:26422.0min: [2024-01-04 14:12:23,548][model8_pretrain.py][INFO] Epoch:[0/2](421500/4588595) loss:2.994 lr:0.0000100 epoch_Time:26422.0min: [2024-01-04 14:12:23,548][model8_pretrain.py][INFO] Epoch:[0/2](421500/4588595) loss:3.265 lr:0.0000100 epoch_Time:26422.0min: [2024-01-04 14:12:23,548][model8_pretrain.py][INFO] Epoch:[0/2](421500/4588595) loss:2.347 lr:0.0000100 epoch_Time:26422.0min: [2024-01-04 14:13:00,501][model8_pretrain.py][INFO] Epoch:[0/2](421600/4588595) loss:2.712 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:13:00,501][model8_pretrain.py][INFO] Epoch:[0/2](421600/4588595) loss:3.106 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:13:00,501][model8_pretrain.py][INFO] Epoch:[0/2](421600/4588595) loss:2.707 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:13:00,501][model8_pretrain.py][INFO] Epoch:[0/2](421600/4588595) loss:3.153 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:13:00,501][model8_pretrain.py][INFO] Epoch:[0/2](421600/4588595) loss:2.659 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:13:00,501][model8_pretrain.py][INFO] Epoch:[0/2](421600/4588595) loss:3.065 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:13:00,501][model8_pretrain.py][INFO] Epoch:[0/2](421600/4588595) loss:2.841 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:13:00,501][model8_pretrain.py][INFO] Epoch:[0/2](421600/4588595) loss:2.285 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:13:47,117][model8_pretrain.py][INFO] Epoch:[0/2](421700/4588595) loss:2.951 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:13:47,117][model8_pretrain.py][INFO] Epoch:[0/2](421700/4588595) loss:3.132 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:13:47,117][model8_pretrain.py][INFO] Epoch:[0/2](421700/4588595) loss:3.021 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:13:47,117][model8_pretrain.py][INFO] Epoch:[0/2](421700/4588595) loss:2.604 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:13:47,117][model8_pretrain.py][INFO] Epoch:[0/2](421700/4588595) loss:2.944 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:13:47,117][model8_pretrain.py][INFO] Epoch:[0/2](421700/4588595) loss:3.201 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:13:47,117][model8_pretrain.py][INFO] Epoch:[0/2](421700/4588595) loss:2.890 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:13:47,118][model8_pretrain.py][INFO] Epoch:[0/2](421700/4588595) loss:2.771 lr:0.0000100 epoch_Time:26423.0min: [2024-01-04 14:14:24,060][model8_pretrain.py][INFO] Epoch:[0/2](421800/4588595) loss:3.126 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:14:24,061][model8_pretrain.py][INFO] Epoch:[0/2](421800/4588595) loss:3.214 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:14:24,061][model8_pretrain.py][INFO] Epoch:[0/2](421800/4588595) loss:2.900 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:14:24,061][model8_pretrain.py][INFO] Epoch:[0/2](421800/4588595) loss:2.554 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:14:24,061][model8_pretrain.py][INFO] Epoch:[0/2](421800/4588595) loss:3.044 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:14:24,061][model8_pretrain.py][INFO] Epoch:[0/2](421800/4588595) loss:2.626 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:14:24,061][model8_pretrain.py][INFO] Epoch:[0/2](421800/4588595) loss:2.962 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:14:24,061][model8_pretrain.py][INFO] Epoch:[0/2](421800/4588595) loss:3.064 lr:0.0000100 epoch_Time:26421.0min: [2024-01-04 14:15:01,023][model8_pretrain.py][INFO] Epoch:[0/2](421900/4588595) loss:3.144 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:01,023][model8_pretrain.py][INFO] Epoch:[0/2](421900/4588595) loss:2.781 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:01,023][model8_pretrain.py][INFO] Epoch:[0/2](421900/4588595) loss:3.191 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:01,023][model8_pretrain.py][INFO] Epoch:[0/2](421900/4588595) loss:2.862 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:01,023][model8_pretrain.py][INFO] Epoch:[0/2](421900/4588595) loss:2.827 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:01,023][model8_pretrain.py][INFO] Epoch:[0/2](421900/4588595) loss:2.685 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:01,023][model8_pretrain.py][INFO] Epoch:[0/2](421900/4588595) loss:2.951 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:01,023][model8_pretrain.py][INFO] Epoch:[0/2](421900/4588595) loss:2.914 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:37,979][model8_pretrain.py][INFO] Epoch:[0/2](422000/4588595) loss:3.148 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:37,979][model8_pretrain.py][INFO] Epoch:[0/2](422000/4588595) loss:2.924 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:37,979][model8_pretrain.py][INFO] Epoch:[0/2](422000/4588595) loss:2.994 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:37,979][model8_pretrain.py][INFO] Epoch:[0/2](422000/4588595) loss:2.448 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:37,979][model8_pretrain.py][INFO] Epoch:[0/2](422000/4588595) loss:2.475 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:37,979][model8_pretrain.py][INFO] Epoch:[0/2](422000/4588595) loss:3.341 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:37,979][model8_pretrain.py][INFO] Epoch:[0/2](422000/4588595) loss:2.961 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:15:37,979][model8_pretrain.py][INFO] Epoch:[0/2](422000/4588595) loss:2.495 lr:0.0000100 epoch_Time:26420.0min: [2024-01-04 14:16:14,937][model8_pretrain.py][INFO] Epoch:[0/2](422100/4588595) loss:2.516 lr:0.0000100 epoch_Time:26419.0min: [2024-01-04 14:16:14,937][model8_pretrain.py][INFO] Epoch:[0/2](422100/4588595) loss:3.067 lr:0.0000100 epoch_Time:26419.0min: [2024-01-04 14:16:14,937][model8_pretrain.py][INFO] Epoch:[0/2](422100/4588595) loss:2.836 lr:0.0000100 epoch_Time:26419.0min: [2024-01-04 14:16:14,937][model8_pretrain.py][INFO] Epoch:[0/2](422100/4588595) loss:2.939 lr:0.0000100 epoch_Time:26419.0min: [2024-01-04 14:16:14,937][model8_pretrain.py][INFO] Epoch:[0/2](422100/4588595) loss:2.839 lr:0.0000100 epoch_Time:26419.0min: [2024-01-04 14:16:14,937][model8_pretrain.py][INFO] Epoch:[0/2](422100/4588595) loss:3.002 lr:0.0000100 epoch_Time:26419.0min: [2024-01-04 14:16:14,937][model8_pretrain.py][INFO] Epoch:[0/2](422100/4588595) loss:2.750 lr:0.0000100 epoch_Time:26419.0min: [2024-01-04 14:16:14,937][model8_pretrain.py][INFO] Epoch:[0/2](422100/4588595) loss:2.531 lr:0.0000100 epoch_Time:26419.0min: [2024-01-04 14:16:51,908][model8_pretrain.py][INFO] Epoch:[0/2](422200/4588595) loss:2.999 lr:0.0000100 epoch_Time:26418.0min: [2024-01-04 14:16:51,908][model8_pretrain.py][INFO] Epoch:[0/2](422200/4588595) loss:2.811 lr:0.0000100 epoch_Time:26418.0min: [2024-01-04 14:16:51,908][model8_pretrain.py][INFO] Epoch:[0/2](422200/4588595) loss:3.392 lr:0.0000100 epoch_Time:26418.0min: [2024-01-04 14:16:51,908][model8_pretrain.py][INFO] Epoch:[0/2](422200/4588595) loss:2.442 lr:0.0000100 epoch_Time:26418.0min: [2024-01-04 14:16:51,908][model8_pretrain.py][INFO] Epoch:[0/2](422200/4588595) loss:3.124 lr:0.0000100 epoch_Time:26418.0min: [2024-01-04 14:16:51,908][model8_pretrain.py][INFO] Epoch:[0/2](422200/4588595) loss:2.697 lr:0.0000100 epoch_Time:26418.0min: [2024-01-04 14:16:51,908][model8_pretrain.py][INFO] Epoch:[0/2](422200/4588595) loss:2.682 lr:0.0000100 epoch_Time:26418.0min: [2024-01-04 14:16:51,909][model8_pretrain.py][INFO] Epoch:[0/2](422200/4588595) loss:2.818 lr:0.0000100 epoch_Time:26418.0min: [2024-01-04 14:17:28,876][model8_pretrain.py][INFO] Epoch:[0/2](422300/4588595) loss:3.108 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:17:28,876][model8_pretrain.py][INFO] Epoch:[0/2](422300/4588595) loss:3.243 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:17:28,876][model8_pretrain.py][INFO] Epoch:[0/2](422300/4588595) loss:3.275 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:17:28,876][model8_pretrain.py][INFO] Epoch:[0/2](422300/4588595) loss:2.690 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:17:28,876][model8_pretrain.py][INFO] Epoch:[0/2](422300/4588595) loss:2.351 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:17:28,876][model8_pretrain.py][INFO] Epoch:[0/2](422300/4588595) loss:3.197 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:17:28,876][model8_pretrain.py][INFO] Epoch:[0/2](422300/4588595) loss:2.784 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:17:28,876][model8_pretrain.py][INFO] Epoch:[0/2](422300/4588595) loss:2.692 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:18:05,839][model8_pretrain.py][INFO] Epoch:[0/2](422400/4588595) loss:2.558 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:18:05,839][model8_pretrain.py][INFO] Epoch:[0/2](422400/4588595) loss:3.136 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:18:05,839][model8_pretrain.py][INFO] Epoch:[0/2](422400/4588595) loss:2.154 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:18:05,839][model8_pretrain.py][INFO] Epoch:[0/2](422400/4588595) loss:3.470 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:18:05,839][model8_pretrain.py][INFO] Epoch:[0/2](422400/4588595) loss:2.997 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:18:05,839][model8_pretrain.py][INFO] Epoch:[0/2](422400/4588595) loss:3.227 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:18:05,839][model8_pretrain.py][INFO] Epoch:[0/2](422400/4588595) loss:3.093 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:18:05,839][model8_pretrain.py][INFO] Epoch:[0/2](422400/4588595) loss:3.064 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:18:53,476][model8_pretrain.py][INFO] Epoch:[0/2](422500/4588595) loss:2.924 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:18:53,476][model8_pretrain.py][INFO] Epoch:[0/2](422500/4588595) loss:3.063 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:18:53,476][model8_pretrain.py][INFO] Epoch:[0/2](422500/4588595) loss:3.479 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:18:53,476][model8_pretrain.py][INFO] Epoch:[0/2](422500/4588595) loss:2.000 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:18:53,476][model8_pretrain.py][INFO] Epoch:[0/2](422500/4588595) loss:2.659 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:18:53,476][model8_pretrain.py][INFO] Epoch:[0/2](422500/4588595) loss:2.696 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:18:53,477][model8_pretrain.py][INFO] Epoch:[0/2](422500/4588595) loss:2.587 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:18:53,477][model8_pretrain.py][INFO] Epoch:[0/2](422500/4588595) loss:3.274 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:19:30,406][model8_pretrain.py][INFO] Epoch:[0/2](422600/4588595) loss:2.922 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:19:30,406][model8_pretrain.py][INFO] Epoch:[0/2](422600/4588595) loss:3.184 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:19:30,406][model8_pretrain.py][INFO] Epoch:[0/2](422600/4588595) loss:3.068 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:19:30,406][model8_pretrain.py][INFO] Epoch:[0/2](422600/4588595) loss:2.876 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:19:30,406][model8_pretrain.py][INFO] Epoch:[0/2](422600/4588595) loss:3.206 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:19:30,407][model8_pretrain.py][INFO] Epoch:[0/2](422600/4588595) loss:2.328 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:19:30,407][model8_pretrain.py][INFO] Epoch:[0/2](422600/4588595) loss:2.844 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:19:30,407][model8_pretrain.py][INFO] Epoch:[0/2](422600/4588595) loss:3.071 lr:0.0000100 epoch_Time:26417.0min: [2024-01-04 14:20:07,344][model8_pretrain.py][INFO] Epoch:[0/2](422700/4588595) loss:2.550 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:20:07,344][model8_pretrain.py][INFO] Epoch:[0/2](422700/4588595) loss:2.995 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:20:07,344][model8_pretrain.py][INFO] Epoch:[0/2](422700/4588595) loss:2.269 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:20:07,344][model8_pretrain.py][INFO] Epoch:[0/2](422700/4588595) loss:2.700 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:20:07,344][model8_pretrain.py][INFO] Epoch:[0/2](422700/4588595) loss:2.684 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:20:07,344][model8_pretrain.py][INFO] Epoch:[0/2](422700/4588595) loss:2.546 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:20:07,344][model8_pretrain.py][INFO] Epoch:[0/2](422700/4588595) loss:2.906 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:20:07,344][model8_pretrain.py][INFO] Epoch:[0/2](422700/4588595) loss:3.346 lr:0.0000100 epoch_Time:26416.0min: [2024-01-04 14:20:44,282][model8_pretrain.py][INFO] Epoch:[0/2](422800/4588595) loss:2.460 lr:0.0000100 epoch_Time:26415.0min: [2024-01-04 14:20:44,282][model8_pretrain.py][INFO] Epoch:[0/2](422800/4588595) loss:2.842 lr:0.0000100 epoch_Time:26415.0min: [2024-01-04 14:20:44,282][model8_pretrain.py][INFO] Epoch:[0/2](422800/4588595) loss:2.874 lr:0.0000100 epoch_Time:26415.0min: [2024-01-04 14:20:44,282][model8_pretrain.py][INFO] Epoch:[0/2](422800/4588595) loss:2.799 lr:0.0000100 epoch_Time:26415.0min: [2024-01-04 14:20:44,282][model8_pretrain.py][INFO] Epoch:[0/2](422800/4588595) loss:2.324 lr:0.0000100 epoch_Time:26415.0min: [2024-01-04 14:20:44,282][model8_pretrain.py][INFO] Epoch:[0/2](422800/4588595) loss:2.744 lr:0.0000100 epoch_Time:26415.0min: [2024-01-04 14:20:44,282][model8_pretrain.py][INFO] Epoch:[0/2](422800/4588595) loss:2.202 lr:0.0000100 epoch_Time:26415.0min: [2024-01-04 14:20:44,283][model8_pretrain.py][INFO] Epoch:[0/2](422800/4588595) loss:2.728 lr:0.0000100 epoch_Time:26415.0min: [2024-01-04 14:21:21,225][model8_pretrain.py][INFO] Epoch:[0/2](422900/4588595) loss:2.667 lr:0.0000100 epoch_Time:26414.0min: [2024-01-04 14:21:21,225][model8_pretrain.py][INFO] Epoch:[0/2](422900/4588595) loss:3.072 lr:0.0000100 epoch_Time:26414.0min: [2024-01-04 14:21:21,225][model8_pretrain.py][INFO] Epoch:[0/2](422900/4588595) loss:3.120 lr:0.0000100 epoch_Time:26414.0min: [2024-01-04 14:21:21,225][model8_pretrain.py][INFO] Epoch:[0/2](422900/4588595) loss:3.086 lr:0.0000100 epoch_Time:26414.0min: [2024-01-04 14:21:21,225][model8_pretrain.py][INFO] Epoch:[0/2](422900/4588595) loss:3.034 lr:0.0000100 epoch_Time:26414.0min: [2024-01-04 14:21:21,225][model8_pretrain.py][INFO] Epoch:[0/2](422900/4588595) loss:2.643 lr:0.0000100 epoch_Time:26414.0min: [2024-01-04 14:21:21,225][model8_pretrain.py][INFO] Epoch:[0/2](422900/4588595) loss:2.616 lr:0.0000100 epoch_Time:26414.0min: [2024-01-04 14:21:21,225][model8_pretrain.py][INFO] Epoch:[0/2](422900/4588595) loss:2.814 lr:0.0000100 epoch_Time:26414.0min: [2024-01-04 14:21:58,174][model8_pretrain.py][INFO] Epoch:[0/2](423000/4588595) loss:2.961 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:21:58,175][model8_pretrain.py][INFO] Epoch:[0/2](423000/4588595) loss:3.511 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:21:58,175][model8_pretrain.py][INFO] Epoch:[0/2](423000/4588595) loss:2.571 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:21:58,175][model8_pretrain.py][INFO] Epoch:[0/2](423000/4588595) loss:2.617 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:21:58,175][model8_pretrain.py][INFO] Epoch:[0/2](423000/4588595) loss:3.188 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:21:58,176][model8_pretrain.py][INFO] Epoch:[0/2](423000/4588595) loss:2.856 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:21:58,176][model8_pretrain.py][INFO] Epoch:[0/2](423000/4588595) loss:3.273 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:21:58,176][model8_pretrain.py][INFO] Epoch:[0/2](423000/4588595) loss:3.254 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:22:35,113][model8_pretrain.py][INFO] Epoch:[0/2](423100/4588595) loss:3.081 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:22:35,113][model8_pretrain.py][INFO] Epoch:[0/2](423100/4588595) loss:2.599 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:22:35,113][model8_pretrain.py][INFO] Epoch:[0/2](423100/4588595) loss:2.753 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:22:35,113][model8_pretrain.py][INFO] Epoch:[0/2](423100/4588595) loss:2.447 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:22:35,113][model8_pretrain.py][INFO] Epoch:[0/2](423100/4588595) loss:2.785 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:22:35,113][model8_pretrain.py][INFO] Epoch:[0/2](423100/4588595) loss:2.708 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:22:35,113][model8_pretrain.py][INFO] Epoch:[0/2](423100/4588595) loss:2.943 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:22:35,113][model8_pretrain.py][INFO] Epoch:[0/2](423100/4588595) loss:2.819 lr:0.0000100 epoch_Time:26413.0min: [2024-01-04 14:23:12,091][model8_pretrain.py][INFO] Epoch:[0/2](423200/4588595) loss:3.592 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:12,091][model8_pretrain.py][INFO] Epoch:[0/2](423200/4588595) loss:2.497 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:12,091][model8_pretrain.py][INFO] Epoch:[0/2](423200/4588595) loss:2.867 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:12,091][model8_pretrain.py][INFO] Epoch:[0/2](423200/4588595) loss:2.864 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:12,091][model8_pretrain.py][INFO] Epoch:[0/2](423200/4588595) loss:2.970 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:12,091][model8_pretrain.py][INFO] Epoch:[0/2](423200/4588595) loss:2.798 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:12,091][model8_pretrain.py][INFO] Epoch:[0/2](423200/4588595) loss:2.784 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:12,092][model8_pretrain.py][INFO] Epoch:[0/2](423200/4588595) loss:3.186 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:59,211][model8_pretrain.py][INFO] Epoch:[0/2](423300/4588595) loss:3.280 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:59,211][model8_pretrain.py][INFO] Epoch:[0/2](423300/4588595) loss:2.329 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:59,211][model8_pretrain.py][INFO] Epoch:[0/2](423300/4588595) loss:2.838 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:59,211][model8_pretrain.py][INFO] Epoch:[0/2](423300/4588595) loss:3.247 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:59,211][model8_pretrain.py][INFO] Epoch:[0/2](423300/4588595) loss:2.639 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:59,211][model8_pretrain.py][INFO] Epoch:[0/2](423300/4588595) loss:3.260 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:59,211][model8_pretrain.py][INFO] Epoch:[0/2](423300/4588595) loss:2.822 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:23:59,211][model8_pretrain.py][INFO] Epoch:[0/2](423300/4588595) loss:2.947 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:24:36,134][model8_pretrain.py][INFO] Epoch:[0/2](423400/4588595) loss:3.214 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:24:36,134][model8_pretrain.py][INFO] Epoch:[0/2](423400/4588595) loss:2.819 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:24:36,134][model8_pretrain.py][INFO] Epoch:[0/2](423400/4588595) loss:3.179 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:24:36,134][model8_pretrain.py][INFO] Epoch:[0/2](423400/4588595) loss:2.826 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:24:36,134][model8_pretrain.py][INFO] Epoch:[0/2](423400/4588595) loss:2.638 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:24:36,134][model8_pretrain.py][INFO] Epoch:[0/2](423400/4588595) loss:2.832 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:24:36,134][model8_pretrain.py][INFO] Epoch:[0/2](423400/4588595) loss:3.116 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:24:36,134][model8_pretrain.py][INFO] Epoch:[0/2](423400/4588595) loss:2.879 lr:0.0000100 epoch_Time:26412.0min: [2024-01-04 14:25:13,036][model8_pretrain.py][INFO] Epoch:[0/2](423500/4588595) loss:2.772 lr:0.0000100 epoch_Time:26411.0min: [2024-01-04 14:25:13,036][model8_pretrain.py][INFO] Epoch:[0/2](423500/4588595) loss:2.641 lr:0.0000100 epoch_Time:26411.0min: [2024-01-04 14:25:13,036][model8_pretrain.py][INFO] Epoch:[0/2](423500/4588595) loss:2.638 lr:0.0000100 epoch_Time:26411.0min: [2024-01-04 14:25:13,036][model8_pretrain.py][INFO] Epoch:[0/2](423500/4588595) loss:2.846 lr:0.0000100 epoch_Time:26411.0min: [2024-01-04 14:25:13,036][model8_pretrain.py][INFO] Epoch:[0/2](423500/4588595) loss:2.545 lr:0.0000100 epoch_Time:26411.0min: [2024-01-04 14:25:13,036][model8_pretrain.py][INFO] Epoch:[0/2](423500/4588595) loss:3.131 lr:0.0000100 epoch_Time:26411.0min: [2024-01-04 14:25:13,036][model8_pretrain.py][INFO] Epoch:[0/2](423500/4588595) loss:3.207 lr:0.0000100 epoch_Time:26411.0min: [2024-01-04 14:25:13,036][model8_pretrain.py][INFO] Epoch:[0/2](423500/4588595) loss:3.181 lr:0.0000100 epoch_Time:26411.0min: [2024-01-04 14:25:49,964][model8_pretrain.py][INFO] Epoch:[0/2](423600/4588595) loss:2.773 lr:0.0000100 epoch_Time:26410.0min: [2024-01-04 14:25:49,964][model8_pretrain.py][INFO] Epoch:[0/2](423600/4588595) loss:2.999 lr:0.0000100 epoch_Time:26410.0min: [2024-01-04 14:25:49,964][model8_pretrain.py][INFO] Epoch:[0/2](423600/4588595) loss:2.813 lr:0.0000100 epoch_Time:26410.0min: [2024-01-04 14:25:49,964][model8_pretrain.py][INFO] Epoch:[0/2](423600/4588595) loss:2.698 lr:0.0000100 epoch_Time:26410.0min: [2024-01-04 14:25:49,964][model8_pretrain.py][INFO] Epoch:[0/2](423600/4588595) loss:3.160 lr:0.0000100 epoch_Time:26410.0min: [2024-01-04 14:25:49,964][model8_pretrain.py][INFO] Epoch:[0/2](423600/4588595) loss:1.962 lr:0.0000100 epoch_Time:26410.0min: [2024-01-04 14:25:49,965][model8_pretrain.py][INFO] Epoch:[0/2](423600/4588595) loss:3.507 lr:0.0000100 epoch_Time:26410.0min: [2024-01-04 14:25:49,965][model8_pretrain.py][INFO] Epoch:[0/2](423600/4588595) loss:2.886 lr:0.0000100 epoch_Time:26410.0min: [2024-01-04 14:26:26,901][model8_pretrain.py][INFO] Epoch:[0/2](423700/4588595) loss:2.890 lr:0.0000100 epoch_Time:26409.0min: [2024-01-04 14:26:26,901][model8_pretrain.py][INFO] Epoch:[0/2](423700/4588595) loss:2.025 lr:0.0000100 epoch_Time:26409.0min: [2024-01-04 14:26:26,901][model8_pretrain.py][INFO] Epoch:[0/2](423700/4588595) loss:2.585 lr:0.0000100 epoch_Time:26409.0min: [2024-01-04 14:26:26,901][model8_pretrain.py][INFO] Epoch:[0/2](423700/4588595) loss:2.746 lr:0.0000100 epoch_Time:26409.0min: [2024-01-04 14:26:26,901][model8_pretrain.py][INFO] Epoch:[0/2](423700/4588595) loss:2.761 lr:0.0000100 epoch_Time:26409.0min: [2024-01-04 14:26:26,901][model8_pretrain.py][INFO] Epoch:[0/2](423700/4588595) loss:3.271 lr:0.0000100 epoch_Time:26409.0min: [2024-01-04 14:26:26,901][model8_pretrain.py][INFO] Epoch:[0/2](423700/4588595) loss:2.595 lr:0.0000100 epoch_Time:26409.0min: [2024-01-04 14:26:26,901][model8_pretrain.py][INFO] Epoch:[0/2](423700/4588595) loss:2.963 lr:0.0000100 epoch_Time:26409.0min: [2024-01-04 14:27:03,826][model8_pretrain.py][INFO] Epoch:[0/2](423800/4588595) loss:2.895 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:03,826][model8_pretrain.py][INFO] Epoch:[0/2](423800/4588595) loss:3.053 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:03,826][model8_pretrain.py][INFO] Epoch:[0/2](423800/4588595) loss:3.074 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:03,826][model8_pretrain.py][INFO] Epoch:[0/2](423800/4588595) loss:2.774 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:03,826][model8_pretrain.py][INFO] Epoch:[0/2](423800/4588595) loss:2.877 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:03,826][model8_pretrain.py][INFO] Epoch:[0/2](423800/4588595) loss:2.441 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:03,826][model8_pretrain.py][INFO] Epoch:[0/2](423800/4588595) loss:3.139 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:03,826][model8_pretrain.py][INFO] Epoch:[0/2](423800/4588595) loss:3.348 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:40,782][model8_pretrain.py][INFO] Epoch:[0/2](423900/4588595) loss:2.407 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:40,782][model8_pretrain.py][INFO] Epoch:[0/2](423900/4588595) loss:3.077 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:40,782][model8_pretrain.py][INFO] Epoch:[0/2](423900/4588595) loss:2.968 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:40,782][model8_pretrain.py][INFO] Epoch:[0/2](423900/4588595) loss:2.342 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:40,782][model8_pretrain.py][INFO] Epoch:[0/2](423900/4588595) loss:3.515 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:40,782][model8_pretrain.py][INFO] Epoch:[0/2](423900/4588595) loss:2.860 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:40,782][model8_pretrain.py][INFO] Epoch:[0/2](423900/4588595) loss:2.797 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:27:40,782][model8_pretrain.py][INFO] Epoch:[0/2](423900/4588595) loss:2.394 lr:0.0000100 epoch_Time:26408.0min: [2024-01-04 14:28:17,720][model8_pretrain.py][INFO] Epoch:[0/2](424000/4588595) loss:3.044 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:28:17,720][model8_pretrain.py][INFO] Epoch:[0/2](424000/4588595) loss:2.549 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:28:17,720][model8_pretrain.py][INFO] Epoch:[0/2](424000/4588595) loss:2.948 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:28:17,720][model8_pretrain.py][INFO] Epoch:[0/2](424000/4588595) loss:3.133 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:28:17,720][model8_pretrain.py][INFO] Epoch:[0/2](424000/4588595) loss:2.595 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:28:17,720][model8_pretrain.py][INFO] Epoch:[0/2](424000/4588595) loss:3.187 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:28:17,720][model8_pretrain.py][INFO] Epoch:[0/2](424000/4588595) loss:3.347 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:28:17,720][model8_pretrain.py][INFO] Epoch:[0/2](424000/4588595) loss:2.679 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:04,807][model8_pretrain.py][INFO] Epoch:[0/2](424100/4588595) loss:2.608 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:04,807][model8_pretrain.py][INFO] Epoch:[0/2](424100/4588595) loss:2.332 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:04,807][model8_pretrain.py][INFO] Epoch:[0/2](424100/4588595) loss:2.689 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:04,807][model8_pretrain.py][INFO] Epoch:[0/2](424100/4588595) loss:3.316 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:04,807][model8_pretrain.py][INFO] Epoch:[0/2](424100/4588595) loss:2.700 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:04,807][model8_pretrain.py][INFO] Epoch:[0/2](424100/4588595) loss:2.737 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:04,807][model8_pretrain.py][INFO] Epoch:[0/2](424100/4588595) loss:3.124 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:04,807][model8_pretrain.py][INFO] Epoch:[0/2](424100/4588595) loss:3.055 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:41,706][model8_pretrain.py][INFO] Epoch:[0/2](424200/4588595) loss:2.966 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:41,706][model8_pretrain.py][INFO] Epoch:[0/2](424200/4588595) loss:2.740 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:41,706][model8_pretrain.py][INFO] Epoch:[0/2](424200/4588595) loss:2.969 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:41,706][model8_pretrain.py][INFO] Epoch:[0/2](424200/4588595) loss:2.975 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:41,706][model8_pretrain.py][INFO] Epoch:[0/2](424200/4588595) loss:2.827 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:41,706][model8_pretrain.py][INFO] Epoch:[0/2](424200/4588595) loss:2.773 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:41,706][model8_pretrain.py][INFO] Epoch:[0/2](424200/4588595) loss:2.895 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:29:41,706][model8_pretrain.py][INFO] Epoch:[0/2](424200/4588595) loss:2.642 lr:0.0000100 epoch_Time:26407.0min: [2024-01-04 14:30:18,657][model8_pretrain.py][INFO] Epoch:[0/2](424300/4588595) loss:3.053 lr:0.0000100 epoch_Time:26406.0min: [2024-01-04 14:30:18,657][model8_pretrain.py][INFO] Epoch:[0/2](424300/4588595) loss:3.026 lr:0.0000100 epoch_Time:26406.0min: [2024-01-04 14:30:18,657][model8_pretrain.py][INFO] Epoch:[0/2](424300/4588595) loss:2.523 lr:0.0000100 epoch_Time:26406.0min: [2024-01-04 14:30:18,657][model8_pretrain.py][INFO] Epoch:[0/2](424300/4588595) loss:3.308 lr:0.0000100 epoch_Time:26406.0min: [2024-01-04 14:30:18,657][model8_pretrain.py][INFO] Epoch:[0/2](424300/4588595) loss:2.943 lr:0.0000100 epoch_Time:26406.0min: [2024-01-04 14:30:18,657][model8_pretrain.py][INFO] Epoch:[0/2](424300/4588595) loss:2.535 lr:0.0000100 epoch_Time:26406.0min: [2024-01-04 14:30:18,657][model8_pretrain.py][INFO] Epoch:[0/2](424300/4588595) loss:2.787 lr:0.0000100 epoch_Time:26406.0min: [2024-01-04 14:30:18,657][model8_pretrain.py][INFO] Epoch:[0/2](424300/4588595) loss:3.448 lr:0.0000100 epoch_Time:26406.0min: [2024-01-04 14:30:55,598][model8_pretrain.py][INFO] Epoch:[0/2](424400/4588595) loss:3.150 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:30:55,598][model8_pretrain.py][INFO] Epoch:[0/2](424400/4588595) loss:2.786 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:30:55,598][model8_pretrain.py][INFO] Epoch:[0/2](424400/4588595) loss:3.193 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:30:55,598][model8_pretrain.py][INFO] Epoch:[0/2](424400/4588595) loss:2.916 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:30:55,598][model8_pretrain.py][INFO] Epoch:[0/2](424400/4588595) loss:2.971 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:30:55,598][model8_pretrain.py][INFO] Epoch:[0/2](424400/4588595) loss:3.352 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:30:55,598][model8_pretrain.py][INFO] Epoch:[0/2](424400/4588595) loss:2.514 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:30:55,598][model8_pretrain.py][INFO] Epoch:[0/2](424400/4588595) loss:2.527 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:31:32,545][model8_pretrain.py][INFO] Epoch:[0/2](424500/4588595) loss:2.781 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:31:32,545][model8_pretrain.py][INFO] Epoch:[0/2](424500/4588595) loss:3.015 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:31:32,545][model8_pretrain.py][INFO] Epoch:[0/2](424500/4588595) loss:3.123 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:31:32,545][model8_pretrain.py][INFO] Epoch:[0/2](424500/4588595) loss:2.392 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:31:32,545][model8_pretrain.py][INFO] Epoch:[0/2](424500/4588595) loss:3.169 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:31:32,545][model8_pretrain.py][INFO] Epoch:[0/2](424500/4588595) loss:3.401 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:31:32,545][model8_pretrain.py][INFO] Epoch:[0/2](424500/4588595) loss:2.362 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:31:32,545][model8_pretrain.py][INFO] Epoch:[0/2](424500/4588595) loss:2.563 lr:0.0000100 epoch_Time:26405.0min: [2024-01-04 14:32:09,507][model8_pretrain.py][INFO] Epoch:[0/2](424600/4588595) loss:2.892 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:09,507][model8_pretrain.py][INFO] Epoch:[0/2](424600/4588595) loss:2.262 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:09,507][model8_pretrain.py][INFO] Epoch:[0/2](424600/4588595) loss:2.495 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:09,507][model8_pretrain.py][INFO] Epoch:[0/2](424600/4588595) loss:2.772 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:09,507][model8_pretrain.py][INFO] Epoch:[0/2](424600/4588595) loss:3.317 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:09,507][model8_pretrain.py][INFO] Epoch:[0/2](424600/4588595) loss:3.017 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:09,507][model8_pretrain.py][INFO] Epoch:[0/2](424600/4588595) loss:3.249 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:09,507][model8_pretrain.py][INFO] Epoch:[0/2](424600/4588595) loss:2.638 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:46,465][model8_pretrain.py][INFO] Epoch:[0/2](424700/4588595) loss:2.887 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:46,465][model8_pretrain.py][INFO] Epoch:[0/2](424700/4588595) loss:2.764 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:46,465][model8_pretrain.py][INFO] Epoch:[0/2](424700/4588595) loss:2.542 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:46,465][model8_pretrain.py][INFO] Epoch:[0/2](424700/4588595) loss:3.261 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:46,465][model8_pretrain.py][INFO] Epoch:[0/2](424700/4588595) loss:3.214 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:46,465][model8_pretrain.py][INFO] Epoch:[0/2](424700/4588595) loss:2.977 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:46,465][model8_pretrain.py][INFO] Epoch:[0/2](424700/4588595) loss:2.992 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:32:46,466][model8_pretrain.py][INFO] Epoch:[0/2](424700/4588595) loss:2.381 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:33:23,415][model8_pretrain.py][INFO] Epoch:[0/2](424800/4588595) loss:2.946 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:33:23,415][model8_pretrain.py][INFO] Epoch:[0/2](424800/4588595) loss:2.493 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:33:23,415][model8_pretrain.py][INFO] Epoch:[0/2](424800/4588595) loss:3.006 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:33:23,415][model8_pretrain.py][INFO] Epoch:[0/2](424800/4588595) loss:3.127 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:33:23,415][model8_pretrain.py][INFO] Epoch:[0/2](424800/4588595) loss:3.318 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:33:23,415][model8_pretrain.py][INFO] Epoch:[0/2](424800/4588595) loss:2.811 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:33:23,415][model8_pretrain.py][INFO] Epoch:[0/2](424800/4588595) loss:3.146 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:33:23,415][model8_pretrain.py][INFO] Epoch:[0/2](424800/4588595) loss:3.401 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:34:10,417][model8_pretrain.py][INFO] Epoch:[0/2](424900/4588595) loss:2.813 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:34:10,417][model8_pretrain.py][INFO] Epoch:[0/2](424900/4588595) loss:2.881 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:34:10,417][model8_pretrain.py][INFO] Epoch:[0/2](424900/4588595) loss:3.031 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:34:10,417][model8_pretrain.py][INFO] Epoch:[0/2](424900/4588595) loss:3.036 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:34:10,417][model8_pretrain.py][INFO] Epoch:[0/2](424900/4588595) loss:2.173 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:34:10,417][model8_pretrain.py][INFO] Epoch:[0/2](424900/4588595) loss:2.641 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:34:10,417][model8_pretrain.py][INFO] Epoch:[0/2](424900/4588595) loss:3.082 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:34:10,417][model8_pretrain.py][INFO] Epoch:[0/2](424900/4588595) loss:3.064 lr:0.0000100 epoch_Time:26403.0min: [2024-01-04 14:34:47,345][model8_pretrain.py][INFO] Epoch:[0/2](425000/4588595) loss:3.114 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:34:47,345][model8_pretrain.py][INFO] Epoch:[0/2](425000/4588595) loss:3.163 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:34:47,345][model8_pretrain.py][INFO] Epoch:[0/2](425000/4588595) loss:3.051 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:34:47,345][model8_pretrain.py][INFO] Epoch:[0/2](425000/4588595) loss:3.140 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:34:47,345][model8_pretrain.py][INFO] Epoch:[0/2](425000/4588595) loss:2.388 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:34:47,345][model8_pretrain.py][INFO] Epoch:[0/2](425000/4588595) loss:2.865 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:34:47,345][model8_pretrain.py][INFO] Epoch:[0/2](425000/4588595) loss:2.416 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:34:47,345][model8_pretrain.py][INFO] Epoch:[0/2](425000/4588595) loss:3.135 lr:0.0000100 epoch_Time:26402.0min: [2024-01-04 14:35:24,271][model8_pretrain.py][INFO] Epoch:[0/2](425100/4588595) loss:3.309 lr:0.0000100 epoch_Time:26401.0min: [2024-01-04 14:35:24,271][model8_pretrain.py][INFO] Epoch:[0/2](425100/4588595) loss:3.091 lr:0.0000100 epoch_Time:26401.0min: [2024-01-04 14:35:24,271][model8_pretrain.py][INFO] Epoch:[0/2](425100/4588595) loss:2.883 lr:0.0000100 epoch_Time:26401.0min: [2024-01-04 14:35:24,271][model8_pretrain.py][INFO] Epoch:[0/2](425100/4588595) loss:2.867 lr:0.0000100 epoch_Time:26401.0min: [2024-01-04 14:35:24,271][model8_pretrain.py][INFO] Epoch:[0/2](425100/4588595) loss:2.990 lr:0.0000100 epoch_Time:26401.0min: [2024-01-04 14:35:24,271][model8_pretrain.py][INFO] Epoch:[0/2](425100/4588595) loss:3.017 lr:0.0000100 epoch_Time:26401.0min: [2024-01-04 14:35:24,272][model8_pretrain.py][INFO] Epoch:[0/2](425100/4588595) loss:1.565 lr:0.0000100 epoch_Time:26401.0min: [2024-01-04 14:35:24,272][model8_pretrain.py][INFO] Epoch:[0/2](425100/4588595) loss:2.826 lr:0.0000100 epoch_Time:26401.0min: [2024-01-04 14:36:01,197][model8_pretrain.py][INFO] Epoch:[0/2](425200/4588595) loss:2.441 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:01,197][model8_pretrain.py][INFO] Epoch:[0/2](425200/4588595) loss:3.025 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:01,197][model8_pretrain.py][INFO] Epoch:[0/2](425200/4588595) loss:2.861 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:01,197][model8_pretrain.py][INFO] Epoch:[0/2](425200/4588595) loss:2.843 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:01,197][model8_pretrain.py][INFO] Epoch:[0/2](425200/4588595) loss:3.352 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:01,197][model8_pretrain.py][INFO] Epoch:[0/2](425200/4588595) loss:3.205 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:01,197][model8_pretrain.py][INFO] Epoch:[0/2](425200/4588595) loss:2.839 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:01,197][model8_pretrain.py][INFO] Epoch:[0/2](425200/4588595) loss:3.289 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:38,127][model8_pretrain.py][INFO] Epoch:[0/2](425300/4588595) loss:3.037 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:38,127][model8_pretrain.py][INFO] Epoch:[0/2](425300/4588595) loss:2.631 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:38,127][model8_pretrain.py][INFO] Epoch:[0/2](425300/4588595) loss:2.671 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:38,127][model8_pretrain.py][INFO] Epoch:[0/2](425300/4588595) loss:3.180 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:38,127][model8_pretrain.py][INFO] Epoch:[0/2](425300/4588595) loss:3.368 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:38,127][model8_pretrain.py][INFO] Epoch:[0/2](425300/4588595) loss:2.556 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:38,128][model8_pretrain.py][INFO] Epoch:[0/2](425300/4588595) loss:3.236 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:36:38,128][model8_pretrain.py][INFO] Epoch:[0/2](425300/4588595) loss:2.490 lr:0.0000100 epoch_Time:26400.0min: [2024-01-04 14:37:15,054][model8_pretrain.py][INFO] Epoch:[0/2](425400/4588595) loss:2.870 lr:0.0000100 epoch_Time:26399.0min: [2024-01-04 14:37:15,054][model8_pretrain.py][INFO] Epoch:[0/2](425400/4588595) loss:2.974 lr:0.0000100 epoch_Time:26399.0min: [2024-01-04 14:37:15,054][model8_pretrain.py][INFO] Epoch:[0/2](425400/4588595) loss:3.090 lr:0.0000100 epoch_Time:26399.0min: [2024-01-04 14:37:15,054][model8_pretrain.py][INFO] Epoch:[0/2](425400/4588595) loss:3.327 lr:0.0000100 epoch_Time:26399.0min: [2024-01-04 14:37:15,054][model8_pretrain.py][INFO] Epoch:[0/2](425400/4588595) loss:2.550 lr:0.0000100 epoch_Time:26399.0min: [2024-01-04 14:37:15,054][model8_pretrain.py][INFO] Epoch:[0/2](425400/4588595) loss:2.788 lr:0.0000100 epoch_Time:26399.0min: [2024-01-04 14:37:15,054][model8_pretrain.py][INFO] Epoch:[0/2](425400/4588595) loss:3.129 lr:0.0000100 epoch_Time:26399.0min: [2024-01-04 14:37:15,054][model8_pretrain.py][INFO] Epoch:[0/2](425400/4588595) loss:3.059 lr:0.0000100 epoch_Time:26399.0min: [2024-01-04 14:37:51,980][model8_pretrain.py][INFO] Epoch:[0/2](425500/4588595) loss:2.867 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:37:51,980][model8_pretrain.py][INFO] Epoch:[0/2](425500/4588595) loss:2.527 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:37:51,980][model8_pretrain.py][INFO] Epoch:[0/2](425500/4588595) loss:3.128 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:37:51,980][model8_pretrain.py][INFO] Epoch:[0/2](425500/4588595) loss:2.909 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:37:51,980][model8_pretrain.py][INFO] Epoch:[0/2](425500/4588595) loss:3.422 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:37:51,981][model8_pretrain.py][INFO] Epoch:[0/2](425500/4588595) loss:2.971 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:37:51,980][model8_pretrain.py][INFO] Epoch:[0/2](425500/4588595) loss:2.991 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:37:51,981][model8_pretrain.py][INFO] Epoch:[0/2](425500/4588595) loss:2.916 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:38:28,913][model8_pretrain.py][INFO] Epoch:[0/2](425600/4588595) loss:2.735 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:38:28,913][model8_pretrain.py][INFO] Epoch:[0/2](425600/4588595) loss:2.984 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:38:28,913][model8_pretrain.py][INFO] Epoch:[0/2](425600/4588595) loss:2.941 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:38:28,913][model8_pretrain.py][INFO] Epoch:[0/2](425600/4588595) loss:3.709 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:38:28,913][model8_pretrain.py][INFO] Epoch:[0/2](425600/4588595) loss:2.944 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:38:28,913][model8_pretrain.py][INFO] Epoch:[0/2](425600/4588595) loss:2.935 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:38:28,913][model8_pretrain.py][INFO] Epoch:[0/2](425600/4588595) loss:2.160 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:38:28,913][model8_pretrain.py][INFO] Epoch:[0/2](425600/4588595) loss:3.053 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:39:15,923][model8_pretrain.py][INFO] Epoch:[0/2](425700/4588595) loss:2.990 lr:0.0000100 epoch_Time:26398.0min: [2024-01-04 14:39:15,923][model8_pretrain.py][INFO] Epoch:[0/2](425700/4588595) loss:3.320 lr:0.0000100 epoch_Time:26398.0min: [2024-01-04 14:39:15,923][model8_pretrain.py][INFO] Epoch:[0/2](425700/4588595) loss:2.422 lr:0.0000100 epoch_Time:26398.0min: [2024-01-04 14:39:15,923][model8_pretrain.py][INFO] Epoch:[0/2](425700/4588595) loss:3.219 lr:0.0000100 epoch_Time:26398.0min: [2024-01-04 14:39:15,923][model8_pretrain.py][INFO] Epoch:[0/2](425700/4588595) loss:2.874 lr:0.0000100 epoch_Time:26398.0min: [2024-01-04 14:39:15,923][model8_pretrain.py][INFO] Epoch:[0/2](425700/4588595) loss:3.013 lr:0.0000100 epoch_Time:26398.0min: [2024-01-04 14:39:15,923][model8_pretrain.py][INFO] Epoch:[0/2](425700/4588595) loss:2.510 lr:0.0000100 epoch_Time:26398.0min: [2024-01-04 14:39:15,923][model8_pretrain.py][INFO] Epoch:[0/2](425700/4588595) loss:2.638 lr:0.0000100 epoch_Time:26398.0min: [2024-01-04 14:39:52,852][model8_pretrain.py][INFO] Epoch:[0/2](425800/4588595) loss:2.751 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:39:52,852][model8_pretrain.py][INFO] Epoch:[0/2](425800/4588595) loss:3.187 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:39:52,852][model8_pretrain.py][INFO] Epoch:[0/2](425800/4588595) loss:2.176 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:39:52,852][model8_pretrain.py][INFO] Epoch:[0/2](425800/4588595) loss:2.939 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:39:52,852][model8_pretrain.py][INFO] Epoch:[0/2](425800/4588595) loss:3.352 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:39:52,852][model8_pretrain.py][INFO] Epoch:[0/2](425800/4588595) loss:2.677 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:39:52,852][model8_pretrain.py][INFO] Epoch:[0/2](425800/4588595) loss:3.150 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:39:52,852][model8_pretrain.py][INFO] Epoch:[0/2](425800/4588595) loss:3.000 lr:0.0000100 epoch_Time:26397.0min: [2024-01-04 14:40:29,784][model8_pretrain.py][INFO] Epoch:[0/2](425900/4588595) loss:3.252 lr:0.0000100 epoch_Time:26396.0min: [2024-01-04 14:40:29,784][model8_pretrain.py][INFO] Epoch:[0/2](425900/4588595) loss:3.335 lr:0.0000100 epoch_Time:26396.0min: [2024-01-04 14:40:29,784][model8_pretrain.py][INFO] Epoch:[0/2](425900/4588595) loss:2.772 lr:0.0000100 epoch_Time:26396.0min: [2024-01-04 14:40:29,784][model8_pretrain.py][INFO] Epoch:[0/2](425900/4588595) loss:2.916 lr:0.0000100 epoch_Time:26396.0min: [2024-01-04 14:40:29,784][model8_pretrain.py][INFO] Epoch:[0/2](425900/4588595) loss:3.041 lr:0.0000100 epoch_Time:26396.0min: [2024-01-04 14:40:29,784][model8_pretrain.py][INFO] Epoch:[0/2](425900/4588595) loss:2.900 lr:0.0000100 epoch_Time:26396.0min: [2024-01-04 14:40:29,784][model8_pretrain.py][INFO] Epoch:[0/2](425900/4588595) loss:2.835 lr:0.0000100 epoch_Time:26396.0min: [2024-01-04 14:40:29,784][model8_pretrain.py][INFO] Epoch:[0/2](425900/4588595) loss:2.739 lr:0.0000100 epoch_Time:26396.0min: [2024-01-04 14:41:06,714][model8_pretrain.py][INFO] Epoch:[0/2](426000/4588595) loss:2.439 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:06,714][model8_pretrain.py][INFO] Epoch:[0/2](426000/4588595) loss:2.931 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:06,714][model8_pretrain.py][INFO] Epoch:[0/2](426000/4588595) loss:2.882 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:06,714][model8_pretrain.py][INFO] Epoch:[0/2](426000/4588595) loss:2.587 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:06,714][model8_pretrain.py][INFO] Epoch:[0/2](426000/4588595) loss:2.930 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:06,714][model8_pretrain.py][INFO] Epoch:[0/2](426000/4588595) loss:2.828 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:06,714][model8_pretrain.py][INFO] Epoch:[0/2](426000/4588595) loss:2.406 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:06,714][model8_pretrain.py][INFO] Epoch:[0/2](426000/4588595) loss:2.985 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:43,648][model8_pretrain.py][INFO] Epoch:[0/2](426100/4588595) loss:2.687 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:43,648][model8_pretrain.py][INFO] Epoch:[0/2](426100/4588595) loss:3.229 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:43,648][model8_pretrain.py][INFO] Epoch:[0/2](426100/4588595) loss:3.198 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:43,648][model8_pretrain.py][INFO] Epoch:[0/2](426100/4588595) loss:2.676 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:43,648][model8_pretrain.py][INFO] Epoch:[0/2](426100/4588595) loss:2.824 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:43,648][model8_pretrain.py][INFO] Epoch:[0/2](426100/4588595) loss:2.761 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:43,648][model8_pretrain.py][INFO] Epoch:[0/2](426100/4588595) loss:2.758 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:41:43,649][model8_pretrain.py][INFO] Epoch:[0/2](426100/4588595) loss:3.270 lr:0.0000100 epoch_Time:26395.0min: [2024-01-04 14:42:20,590][model8_pretrain.py][INFO] Epoch:[0/2](426200/4588595) loss:3.100 lr:0.0000100 epoch_Time:26394.0min: [2024-01-04 14:42:20,590][model8_pretrain.py][INFO] Epoch:[0/2](426200/4588595) loss:2.623 lr:0.0000100 epoch_Time:26394.0min: [2024-01-04 14:42:20,590][model8_pretrain.py][INFO] Epoch:[0/2](426200/4588595) loss:3.149 lr:0.0000100 epoch_Time:26394.0min: [2024-01-04 14:42:20,590][model8_pretrain.py][INFO] Epoch:[0/2](426200/4588595) loss:2.927 lr:0.0000100 epoch_Time:26394.0min: [2024-01-04 14:42:20,590][model8_pretrain.py][INFO] Epoch:[0/2](426200/4588595) loss:2.961 lr:0.0000100 epoch_Time:26394.0min: [2024-01-04 14:42:20,590][model8_pretrain.py][INFO] Epoch:[0/2](426200/4588595) loss:2.880 lr:0.0000100 epoch_Time:26394.0min: [2024-01-04 14:42:20,590][model8_pretrain.py][INFO] Epoch:[0/2](426200/4588595) loss:3.517 lr:0.0000100 epoch_Time:26394.0min: [2024-01-04 14:42:20,591][model8_pretrain.py][INFO] Epoch:[0/2](426200/4588595) loss:2.828 lr:0.0000100 epoch_Time:26394.0min: [2024-01-04 14:42:57,531][model8_pretrain.py][INFO] Epoch:[0/2](426300/4588595) loss:2.999 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:42:57,531][model8_pretrain.py][INFO] Epoch:[0/2](426300/4588595) loss:2.605 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:42:57,531][model8_pretrain.py][INFO] Epoch:[0/2](426300/4588595) loss:2.686 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:42:57,531][model8_pretrain.py][INFO] Epoch:[0/2](426300/4588595) loss:3.193 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:42:57,531][model8_pretrain.py][INFO] Epoch:[0/2](426300/4588595) loss:2.734 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:42:57,531][model8_pretrain.py][INFO] Epoch:[0/2](426300/4588595) loss:2.761 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:42:57,531][model8_pretrain.py][INFO] Epoch:[0/2](426300/4588595) loss:2.915 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:42:57,532][model8_pretrain.py][INFO] Epoch:[0/2](426300/4588595) loss:3.183 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:43:34,476][model8_pretrain.py][INFO] Epoch:[0/2](426400/4588595) loss:2.476 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:43:34,476][model8_pretrain.py][INFO] Epoch:[0/2](426400/4588595) loss:2.883 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:43:34,476][model8_pretrain.py][INFO] Epoch:[0/2](426400/4588595) loss:3.173 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:43:34,476][model8_pretrain.py][INFO] Epoch:[0/2](426400/4588595) loss:3.499 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:43:34,476][model8_pretrain.py][INFO] Epoch:[0/2](426400/4588595) loss:3.121 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:43:34,477][model8_pretrain.py][INFO] Epoch:[0/2](426400/4588595) loss:3.101 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:43:34,477][model8_pretrain.py][INFO] Epoch:[0/2](426400/4588595) loss:3.373 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:43:34,477][model8_pretrain.py][INFO] Epoch:[0/2](426400/4588595) loss:2.774 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:44:21,482][model8_pretrain.py][INFO] Epoch:[0/2](426500/4588595) loss:3.378 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:44:21,482][model8_pretrain.py][INFO] Epoch:[0/2](426500/4588595) loss:3.075 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:44:21,482][model8_pretrain.py][INFO] Epoch:[0/2](426500/4588595) loss:3.320 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:44:21,482][model8_pretrain.py][INFO] Epoch:[0/2](426500/4588595) loss:2.763 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:44:21,482][model8_pretrain.py][INFO] Epoch:[0/2](426500/4588595) loss:2.597 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:44:21,482][model8_pretrain.py][INFO] Epoch:[0/2](426500/4588595) loss:3.012 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:44:21,482][model8_pretrain.py][INFO] Epoch:[0/2](426500/4588595) loss:2.804 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:44:21,482][model8_pretrain.py][INFO] Epoch:[0/2](426500/4588595) loss:2.256 lr:0.0000100 epoch_Time:26393.0min: [2024-01-04 14:44:58,406][model8_pretrain.py][INFO] Epoch:[0/2](426600/4588595) loss:3.087 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:44:58,406][model8_pretrain.py][INFO] Epoch:[0/2](426600/4588595) loss:2.903 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:44:58,406][model8_pretrain.py][INFO] Epoch:[0/2](426600/4588595) loss:3.272 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:44:58,406][model8_pretrain.py][INFO] Epoch:[0/2](426600/4588595) loss:2.731 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:44:58,406][model8_pretrain.py][INFO] Epoch:[0/2](426600/4588595) loss:2.597 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:44:58,407][model8_pretrain.py][INFO] Epoch:[0/2](426600/4588595) loss:3.152 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:44:58,407][model8_pretrain.py][INFO] Epoch:[0/2](426600/4588595) loss:2.978 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:44:58,407][model8_pretrain.py][INFO] Epoch:[0/2](426600/4588595) loss:2.380 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:45:35,335][model8_pretrain.py][INFO] Epoch:[0/2](426700/4588595) loss:3.303 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:45:35,335][model8_pretrain.py][INFO] Epoch:[0/2](426700/4588595) loss:2.999 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:45:35,335][model8_pretrain.py][INFO] Epoch:[0/2](426700/4588595) loss:3.081 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:45:35,335][model8_pretrain.py][INFO] Epoch:[0/2](426700/4588595) loss:3.106 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:45:35,335][model8_pretrain.py][INFO] Epoch:[0/2](426700/4588595) loss:2.707 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:45:35,335][model8_pretrain.py][INFO] Epoch:[0/2](426700/4588595) loss:2.933 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:45:35,335][model8_pretrain.py][INFO] Epoch:[0/2](426700/4588595) loss:2.823 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:45:35,335][model8_pretrain.py][INFO] Epoch:[0/2](426700/4588595) loss:2.851 lr:0.0000100 epoch_Time:26392.0min: [2024-01-04 14:46:12,273][model8_pretrain.py][INFO] Epoch:[0/2](426800/4588595) loss:2.296 lr:0.0000100 epoch_Time:26390.0min: [2024-01-04 14:46:12,273][model8_pretrain.py][INFO] Epoch:[0/2](426800/4588595) loss:2.828 lr:0.0000100 epoch_Time:26390.0min: [2024-01-04 14:46:12,273][model8_pretrain.py][INFO] Epoch:[0/2](426800/4588595) loss:2.481 lr:0.0000100 epoch_Time:26390.0min: [2024-01-04 14:46:12,273][model8_pretrain.py][INFO] Epoch:[0/2](426800/4588595) loss:2.848 lr:0.0000100 epoch_Time:26390.0min: [2024-01-04 14:46:12,273][model8_pretrain.py][INFO] Epoch:[0/2](426800/4588595) loss:2.499 lr:0.0000100 epoch_Time:26390.0min: [2024-01-04 14:46:12,273][model8_pretrain.py][INFO] Epoch:[0/2](426800/4588595) loss:3.456 lr:0.0000100 epoch_Time:26390.0min: [2024-01-04 14:46:12,273][model8_pretrain.py][INFO] Epoch:[0/2](426800/4588595) loss:2.810 lr:0.0000100 epoch_Time:26390.0min: [2024-01-04 14:46:12,273][model8_pretrain.py][INFO] Epoch:[0/2](426800/4588595) loss:2.567 lr:0.0000100 epoch_Time:26390.0min: [2024-01-04 14:46:49,206][model8_pretrain.py][INFO] Epoch:[0/2](426900/4588595) loss:2.803 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:46:49,206][model8_pretrain.py][INFO] Epoch:[0/2](426900/4588595) loss:3.050 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:46:49,206][model8_pretrain.py][INFO] Epoch:[0/2](426900/4588595) loss:2.947 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:46:49,206][model8_pretrain.py][INFO] Epoch:[0/2](426900/4588595) loss:2.204 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:46:49,206][model8_pretrain.py][INFO] Epoch:[0/2](426900/4588595) loss:2.828 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:46:49,206][model8_pretrain.py][INFO] Epoch:[0/2](426900/4588595) loss:3.282 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:46:49,206][model8_pretrain.py][INFO] Epoch:[0/2](426900/4588595) loss:3.040 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:46:49,207][model8_pretrain.py][INFO] Epoch:[0/2](426900/4588595) loss:2.741 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:47:26,129][model8_pretrain.py][INFO] Epoch:[0/2](427000/4588595) loss:3.105 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:47:26,129][model8_pretrain.py][INFO] Epoch:[0/2](427000/4588595) loss:3.073 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:47:26,129][model8_pretrain.py][INFO] Epoch:[0/2](427000/4588595) loss:2.803 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:47:26,129][model8_pretrain.py][INFO] Epoch:[0/2](427000/4588595) loss:2.635 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:47:26,129][model8_pretrain.py][INFO] Epoch:[0/2](427000/4588595) loss:2.579 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:47:26,129][model8_pretrain.py][INFO] Epoch:[0/2](427000/4588595) loss:2.615 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:47:26,129][model8_pretrain.py][INFO] Epoch:[0/2](427000/4588595) loss:2.656 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:47:26,129][model8_pretrain.py][INFO] Epoch:[0/2](427000/4588595) loss:3.047 lr:0.0000100 epoch_Time:26389.0min: [2024-01-04 14:48:03,070][model8_pretrain.py][INFO] Epoch:[0/2](427100/4588595) loss:2.379 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:03,070][model8_pretrain.py][INFO] Epoch:[0/2](427100/4588595) loss:3.066 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:03,070][model8_pretrain.py][INFO] Epoch:[0/2](427100/4588595) loss:3.064 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:03,070][model8_pretrain.py][INFO] Epoch:[0/2](427100/4588595) loss:2.586 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:03,070][model8_pretrain.py][INFO] Epoch:[0/2](427100/4588595) loss:2.897 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:03,070][model8_pretrain.py][INFO] Epoch:[0/2](427100/4588595) loss:2.446 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:03,070][model8_pretrain.py][INFO] Epoch:[0/2](427100/4588595) loss:2.913 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:03,070][model8_pretrain.py][INFO] Epoch:[0/2](427100/4588595) loss:2.142 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:39,996][model8_pretrain.py][INFO] Epoch:[0/2](427200/4588595) loss:3.340 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:39,996][model8_pretrain.py][INFO] Epoch:[0/2](427200/4588595) loss:3.088 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:39,996][model8_pretrain.py][INFO] Epoch:[0/2](427200/4588595) loss:2.901 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:39,996][model8_pretrain.py][INFO] Epoch:[0/2](427200/4588595) loss:3.280 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:39,996][model8_pretrain.py][INFO] Epoch:[0/2](427200/4588595) loss:2.164 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:39,996][model8_pretrain.py][INFO] Epoch:[0/2](427200/4588595) loss:2.721 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:39,996][model8_pretrain.py][INFO] Epoch:[0/2](427200/4588595) loss:2.974 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:48:39,996][model8_pretrain.py][INFO] Epoch:[0/2](427200/4588595) loss:3.148 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:49:27,082][model8_pretrain.py][INFO] Epoch:[0/2](427300/4588595) loss:3.073 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:49:27,082][model8_pretrain.py][INFO] Epoch:[0/2](427300/4588595) loss:2.823 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:49:27,082][model8_pretrain.py][INFO] Epoch:[0/2](427300/4588595) loss:2.542 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:49:27,082][model8_pretrain.py][INFO] Epoch:[0/2](427300/4588595) loss:2.980 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:49:27,082][model8_pretrain.py][INFO] Epoch:[0/2](427300/4588595) loss:2.610 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:49:27,082][model8_pretrain.py][INFO] Epoch:[0/2](427300/4588595) loss:2.307 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:49:27,082][model8_pretrain.py][INFO] Epoch:[0/2](427300/4588595) loss:3.003 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:49:27,082][model8_pretrain.py][INFO] Epoch:[0/2](427300/4588595) loss:3.044 lr:0.0000100 epoch_Time:26388.0min: [2024-01-04 14:50:04,017][model8_pretrain.py][INFO] Epoch:[0/2](427400/4588595) loss:2.604 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:04,017][model8_pretrain.py][INFO] Epoch:[0/2](427400/4588595) loss:2.424 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:04,017][model8_pretrain.py][INFO] Epoch:[0/2](427400/4588595) loss:3.116 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:04,017][model8_pretrain.py][INFO] Epoch:[0/2](427400/4588595) loss:2.824 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:04,017][model8_pretrain.py][INFO] Epoch:[0/2](427400/4588595) loss:3.555 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:04,017][model8_pretrain.py][INFO] Epoch:[0/2](427400/4588595) loss:2.630 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:04,017][model8_pretrain.py][INFO] Epoch:[0/2](427400/4588595) loss:2.122 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:04,017][model8_pretrain.py][INFO] Epoch:[0/2](427400/4588595) loss:2.957 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:40,966][model8_pretrain.py][INFO] Epoch:[0/2](427500/4588595) loss:3.560 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:40,966][model8_pretrain.py][INFO] Epoch:[0/2](427500/4588595) loss:2.671 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:40,966][model8_pretrain.py][INFO] Epoch:[0/2](427500/4588595) loss:2.740 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:40,966][model8_pretrain.py][INFO] Epoch:[0/2](427500/4588595) loss:2.597 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:40,966][model8_pretrain.py][INFO] Epoch:[0/2](427500/4588595) loss:2.875 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:40,966][model8_pretrain.py][INFO] Epoch:[0/2](427500/4588595) loss:2.870 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:40,966][model8_pretrain.py][INFO] Epoch:[0/2](427500/4588595) loss:2.841 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:50:40,966][model8_pretrain.py][INFO] Epoch:[0/2](427500/4588595) loss:2.531 lr:0.0000100 epoch_Time:26387.0min: [2024-01-04 14:51:17,911][model8_pretrain.py][INFO] Epoch:[0/2](427600/4588595) loss:2.752 lr:0.0000100 epoch_Time:26386.0min: [2024-01-04 14:51:17,911][model8_pretrain.py][INFO] Epoch:[0/2](427600/4588595) loss:3.052 lr:0.0000100 epoch_Time:26386.0min: [2024-01-04 14:51:17,911][model8_pretrain.py][INFO] Epoch:[0/2](427600/4588595) loss:3.117 lr:0.0000100 epoch_Time:26386.0min: [2024-01-04 14:51:17,911][model8_pretrain.py][INFO] Epoch:[0/2](427600/4588595) loss:3.183 lr:0.0000100 epoch_Time:26386.0min: [2024-01-04 14:51:17,911][model8_pretrain.py][INFO] Epoch:[0/2](427600/4588595) loss:2.786 lr:0.0000100 epoch_Time:26386.0min: [2024-01-04 14:51:17,911][model8_pretrain.py][INFO] Epoch:[0/2](427600/4588595) loss:3.019 lr:0.0000100 epoch_Time:26386.0min: [2024-01-04 14:51:17,911][model8_pretrain.py][INFO] Epoch:[0/2](427600/4588595) loss:3.170 lr:0.0000100 epoch_Time:26386.0min: [2024-01-04 14:51:17,911][model8_pretrain.py][INFO] Epoch:[0/2](427600/4588595) loss:3.161 lr:0.0000100 epoch_Time:26386.0min: [2024-01-04 14:51:54,861][model8_pretrain.py][INFO] Epoch:[0/2](427700/4588595) loss:2.737 lr:0.0000100 epoch_Time:26385.0min: [2024-01-04 14:51:54,861][model8_pretrain.py][INFO] Epoch:[0/2](427700/4588595) loss:3.025 lr:0.0000100 epoch_Time:26385.0min: [2024-01-04 14:51:54,861][model8_pretrain.py][INFO] Epoch:[0/2](427700/4588595) loss:3.284 lr:0.0000100 epoch_Time:26385.0min: [2024-01-04 14:51:54,861][model8_pretrain.py][INFO] Epoch:[0/2](427700/4588595) loss:3.412 lr:0.0000100 epoch_Time:26385.0min: [2024-01-04 14:51:54,861][model8_pretrain.py][INFO] Epoch:[0/2](427700/4588595) loss:2.952 lr:0.0000100 epoch_Time:26385.0min: [2024-01-04 14:51:54,861][model8_pretrain.py][INFO] Epoch:[0/2](427700/4588595) loss:3.256 lr:0.0000100 epoch_Time:26385.0min: [2024-01-04 14:51:54,861][model8_pretrain.py][INFO] Epoch:[0/2](427700/4588595) loss:3.584 lr:0.0000100 epoch_Time:26385.0min: [2024-01-04 14:51:54,861][model8_pretrain.py][INFO] Epoch:[0/2](427700/4588595) loss:2.527 lr:0.0000100 epoch_Time:26385.0min: [2024-01-04 14:52:31,820][model8_pretrain.py][INFO] Epoch:[0/2](427800/4588595) loss:2.527 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:52:31,820][model8_pretrain.py][INFO] Epoch:[0/2](427800/4588595) loss:2.752 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:52:31,820][model8_pretrain.py][INFO] Epoch:[0/2](427800/4588595) loss:3.029 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:52:31,820][model8_pretrain.py][INFO] Epoch:[0/2](427800/4588595) loss:2.992 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:52:31,820][model8_pretrain.py][INFO] Epoch:[0/2](427800/4588595) loss:3.322 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:52:31,820][model8_pretrain.py][INFO] Epoch:[0/2](427800/4588595) loss:1.809 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:52:31,820][model8_pretrain.py][INFO] Epoch:[0/2](427800/4588595) loss:3.005 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:52:31,820][model8_pretrain.py][INFO] Epoch:[0/2](427800/4588595) loss:1.757 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:53:08,776][model8_pretrain.py][INFO] Epoch:[0/2](427900/4588595) loss:3.031 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:08,776][model8_pretrain.py][INFO] Epoch:[0/2](427900/4588595) loss:2.810 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:08,776][model8_pretrain.py][INFO] Epoch:[0/2](427900/4588595) loss:3.063 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:08,776][model8_pretrain.py][INFO] Epoch:[0/2](427900/4588595) loss:3.156 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:08,776][model8_pretrain.py][INFO] Epoch:[0/2](427900/4588595) loss:3.394 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:08,776][model8_pretrain.py][INFO] Epoch:[0/2](427900/4588595) loss:2.570 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:08,776][model8_pretrain.py][INFO] Epoch:[0/2](427900/4588595) loss:3.538 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:08,776][model8_pretrain.py][INFO] Epoch:[0/2](427900/4588595) loss:3.073 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:45,728][model8_pretrain.py][INFO] Epoch:[0/2](428000/4588595) loss:3.351 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:45,728][model8_pretrain.py][INFO] Epoch:[0/2](428000/4588595) loss:2.804 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:45,728][model8_pretrain.py][INFO] Epoch:[0/2](428000/4588595) loss:3.310 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:45,728][model8_pretrain.py][INFO] Epoch:[0/2](428000/4588595) loss:3.029 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:45,728][model8_pretrain.py][INFO] Epoch:[0/2](428000/4588595) loss:3.229 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:45,728][model8_pretrain.py][INFO] Epoch:[0/2](428000/4588595) loss:3.117 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:45,728][model8_pretrain.py][INFO] Epoch:[0/2](428000/4588595) loss:3.146 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:53:45,728][model8_pretrain.py][INFO] Epoch:[0/2](428000/4588595) loss:2.698 lr:0.0000100 epoch_Time:26383.0min: [2024-01-04 14:54:33,019][model8_pretrain.py][INFO] Epoch:[0/2](428100/4588595) loss:2.841 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:54:33,019][model8_pretrain.py][INFO] Epoch:[0/2](428100/4588595) loss:2.921 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:54:33,019][model8_pretrain.py][INFO] Epoch:[0/2](428100/4588595) loss:2.965 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:54:33,019][model8_pretrain.py][INFO] Epoch:[0/2](428100/4588595) loss:2.981 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:54:33,019][model8_pretrain.py][INFO] Epoch:[0/2](428100/4588595) loss:2.899 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:54:33,019][model8_pretrain.py][INFO] Epoch:[0/2](428100/4588595) loss:3.007 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:54:33,019][model8_pretrain.py][INFO] Epoch:[0/2](428100/4588595) loss:3.501 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:54:33,019][model8_pretrain.py][INFO] Epoch:[0/2](428100/4588595) loss:2.625 lr:0.0000100 epoch_Time:26384.0min: [2024-01-04 14:55:09,952][model8_pretrain.py][INFO] Epoch:[0/2](428200/4588595) loss:2.623 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:09,952][model8_pretrain.py][INFO] Epoch:[0/2](428200/4588595) loss:2.893 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:09,952][model8_pretrain.py][INFO] Epoch:[0/2](428200/4588595) loss:2.958 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:09,952][model8_pretrain.py][INFO] Epoch:[0/2](428200/4588595) loss:3.126 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:09,952][model8_pretrain.py][INFO] Epoch:[0/2](428200/4588595) loss:2.070 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:09,952][model8_pretrain.py][INFO] Epoch:[0/2](428200/4588595) loss:3.044 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:09,952][model8_pretrain.py][INFO] Epoch:[0/2](428200/4588595) loss:2.744 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:09,952][model8_pretrain.py][INFO] Epoch:[0/2](428200/4588595) loss:2.290 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:46,889][model8_pretrain.py][INFO] Epoch:[0/2](428300/4588595) loss:2.964 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:46,889][model8_pretrain.py][INFO] Epoch:[0/2](428300/4588595) loss:2.843 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:46,889][model8_pretrain.py][INFO] Epoch:[0/2](428300/4588595) loss:3.361 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:46,889][model8_pretrain.py][INFO] Epoch:[0/2](428300/4588595) loss:2.749 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:46,889][model8_pretrain.py][INFO] Epoch:[0/2](428300/4588595) loss:3.338 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:46,889][model8_pretrain.py][INFO] Epoch:[0/2](428300/4588595) loss:3.174 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:46,890][model8_pretrain.py][INFO] Epoch:[0/2](428300/4588595) loss:2.630 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:55:46,890][model8_pretrain.py][INFO] Epoch:[0/2](428300/4588595) loss:2.835 lr:0.0000100 epoch_Time:26382.0min: [2024-01-04 14:56:23,801][model8_pretrain.py][INFO] Epoch:[0/2](428400/4588595) loss:2.939 lr:0.0000100 epoch_Time:26381.0min: [2024-01-04 14:56:23,801][model8_pretrain.py][INFO] Epoch:[0/2](428400/4588595) loss:2.860 lr:0.0000100 epoch_Time:26381.0min: [2024-01-04 14:56:23,801][model8_pretrain.py][INFO] Epoch:[0/2](428400/4588595) loss:2.816 lr:0.0000100 epoch_Time:26381.0min: [2024-01-04 14:56:23,801][model8_pretrain.py][INFO] Epoch:[0/2](428400/4588595) loss:2.997 lr:0.0000100 epoch_Time:26381.0min: [2024-01-04 14:56:23,801][model8_pretrain.py][INFO] Epoch:[0/2](428400/4588595) loss:2.805 lr:0.0000100 epoch_Time:26381.0min: [2024-01-04 14:56:23,801][model8_pretrain.py][INFO] Epoch:[0/2](428400/4588595) loss:2.204 lr:0.0000100 epoch_Time:26381.0min: [2024-01-04 14:56:23,801][model8_pretrain.py][INFO] Epoch:[0/2](428400/4588595) loss:2.223 lr:0.0000100 epoch_Time:26381.0min: [2024-01-04 14:56:23,801][model8_pretrain.py][INFO] Epoch:[0/2](428400/4588595) loss:2.387 lr:0.0000100 epoch_Time:26381.0min: [2024-01-04 14:57:00,738][model8_pretrain.py][INFO] Epoch:[0/2](428500/4588595) loss:2.973 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:00,738][model8_pretrain.py][INFO] Epoch:[0/2](428500/4588595) loss:2.336 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:00,738][model8_pretrain.py][INFO] Epoch:[0/2](428500/4588595) loss:3.277 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:00,738][model8_pretrain.py][INFO] Epoch:[0/2](428500/4588595) loss:2.589 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:00,738][model8_pretrain.py][INFO] Epoch:[0/2](428500/4588595) loss:3.398 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:00,738][model8_pretrain.py][INFO] Epoch:[0/2](428500/4588595) loss:3.232 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:00,738][model8_pretrain.py][INFO] Epoch:[0/2](428500/4588595) loss:3.155 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:00,738][model8_pretrain.py][INFO] Epoch:[0/2](428500/4588595) loss:3.105 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:37,666][model8_pretrain.py][INFO] Epoch:[0/2](428600/4588595) loss:2.980 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:37,666][model8_pretrain.py][INFO] Epoch:[0/2](428600/4588595) loss:2.716 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:37,666][model8_pretrain.py][INFO] Epoch:[0/2](428600/4588595) loss:3.075 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:37,666][model8_pretrain.py][INFO] Epoch:[0/2](428600/4588595) loss:2.997 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:37,666][model8_pretrain.py][INFO] Epoch:[0/2](428600/4588595) loss:2.734 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:37,666][model8_pretrain.py][INFO] Epoch:[0/2](428600/4588595) loss:3.132 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:37,666][model8_pretrain.py][INFO] Epoch:[0/2](428600/4588595) loss:3.092 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:57:37,666][model8_pretrain.py][INFO] Epoch:[0/2](428600/4588595) loss:2.871 lr:0.0000100 epoch_Time:26380.0min: [2024-01-04 14:58:14,602][model8_pretrain.py][INFO] Epoch:[0/2](428700/4588595) loss:2.870 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 14:58:14,602][model8_pretrain.py][INFO] Epoch:[0/2](428700/4588595) loss:3.071 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 14:58:14,602][model8_pretrain.py][INFO] Epoch:[0/2](428700/4588595) loss:3.273 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 14:58:14,602][model8_pretrain.py][INFO] Epoch:[0/2](428700/4588595) loss:3.025 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 14:58:14,602][model8_pretrain.py][INFO] Epoch:[0/2](428700/4588595) loss:2.772 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 14:58:14,602][model8_pretrain.py][INFO] Epoch:[0/2](428700/4588595) loss:3.096 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 14:58:14,602][model8_pretrain.py][INFO] Epoch:[0/2](428700/4588595) loss:3.089 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 14:58:14,602][model8_pretrain.py][INFO] Epoch:[0/2](428700/4588595) loss:2.529 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 14:58:51,542][model8_pretrain.py][INFO] Epoch:[0/2](428800/4588595) loss:2.398 lr:0.0000100 epoch_Time:26377.0min: [2024-01-04 14:58:51,542][model8_pretrain.py][INFO] Epoch:[0/2](428800/4588595) loss:3.040 lr:0.0000100 epoch_Time:26377.0min: [2024-01-04 14:58:51,542][model8_pretrain.py][INFO] Epoch:[0/2](428800/4588595) loss:3.008 lr:0.0000100 epoch_Time:26377.0min: [2024-01-04 14:58:51,542][model8_pretrain.py][INFO] Epoch:[0/2](428800/4588595) loss:3.153 lr:0.0000100 epoch_Time:26377.0min: [2024-01-04 14:58:51,543][model8_pretrain.py][INFO] Epoch:[0/2](428800/4588595) loss:2.617 lr:0.0000100 epoch_Time:26377.0min: [2024-01-04 14:58:51,543][model8_pretrain.py][INFO] Epoch:[0/2](428800/4588595) loss:3.273 lr:0.0000100 epoch_Time:26377.0min: [2024-01-04 14:58:51,543][model8_pretrain.py][INFO] Epoch:[0/2](428800/4588595) loss:3.136 lr:0.0000100 epoch_Time:26377.0min: [2024-01-04 14:58:51,543][model8_pretrain.py][INFO] Epoch:[0/2](428800/4588595) loss:2.689 lr:0.0000100 epoch_Time:26377.0min: [2024-01-04 14:59:37,213][model8_pretrain.py][INFO] Epoch:[0/2](428900/4588595) loss:2.320 lr:0.0000100 epoch_Time:26379.0min: [2024-01-04 14:59:37,213][model8_pretrain.py][INFO] Epoch:[0/2](428900/4588595) loss:3.072 lr:0.0000100 epoch_Time:26379.0min: [2024-01-04 14:59:37,213][model8_pretrain.py][INFO] Epoch:[0/2](428900/4588595) loss:2.379 lr:0.0000100 epoch_Time:26379.0min: [2024-01-04 14:59:37,213][model8_pretrain.py][INFO] Epoch:[0/2](428900/4588595) loss:3.143 lr:0.0000100 epoch_Time:26379.0min: [2024-01-04 14:59:37,213][model8_pretrain.py][INFO] Epoch:[0/2](428900/4588595) loss:3.158 lr:0.0000100 epoch_Time:26379.0min: [2024-01-04 14:59:37,213][model8_pretrain.py][INFO] Epoch:[0/2](428900/4588595) loss:2.897 lr:0.0000100 epoch_Time:26379.0min: [2024-01-04 14:59:37,214][model8_pretrain.py][INFO] Epoch:[0/2](428900/4588595) loss:3.315 lr:0.0000100 epoch_Time:26379.0min: [2024-01-04 14:59:37,215][model8_pretrain.py][INFO] Epoch:[0/2](428900/4588595) loss:2.681 lr:0.0000100 epoch_Time:26379.0min: [2024-01-04 15:00:15,864][model8_pretrain.py][INFO] Epoch:[0/2](429000/4588595) loss:2.634 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 15:00:15,864][model8_pretrain.py][INFO] Epoch:[0/2](429000/4588595) loss:3.029 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 15:00:15,864][model8_pretrain.py][INFO] Epoch:[0/2](429000/4588595) loss:3.105 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 15:00:15,864][model8_pretrain.py][INFO] Epoch:[0/2](429000/4588595) loss:2.945 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 15:00:15,864][model8_pretrain.py][INFO] Epoch:[0/2](429000/4588595) loss:2.680 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 15:00:15,864][model8_pretrain.py][INFO] Epoch:[0/2](429000/4588595) loss:2.154 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 15:00:15,864][model8_pretrain.py][INFO] Epoch:[0/2](429000/4588595) loss:3.039 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 15:00:15,864][model8_pretrain.py][INFO] Epoch:[0/2](429000/4588595) loss:2.369 lr:0.0000100 epoch_Time:26378.0min: [2024-01-04 15:00:52,795][model8_pretrain.py][INFO] Epoch:[0/2](429100/4588595) loss:2.676 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:00:52,795][model8_pretrain.py][INFO] Epoch:[0/2](429100/4588595) loss:2.922 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:00:52,795][model8_pretrain.py][INFO] Epoch:[0/2](429100/4588595) loss:2.826 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:00:52,795][model8_pretrain.py][INFO] Epoch:[0/2](429100/4588595) loss:3.447 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:00:52,795][model8_pretrain.py][INFO] Epoch:[0/2](429100/4588595) loss:2.996 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:00:52,795][model8_pretrain.py][INFO] Epoch:[0/2](429100/4588595) loss:2.643 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:00:52,795][model8_pretrain.py][INFO] Epoch:[0/2](429100/4588595) loss:2.443 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:00:52,795][model8_pretrain.py][INFO] Epoch:[0/2](429100/4588595) loss:2.742 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:01:29,737][model8_pretrain.py][INFO] Epoch:[0/2](429200/4588595) loss:3.123 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:01:29,737][model8_pretrain.py][INFO] Epoch:[0/2](429200/4588595) loss:3.209 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:01:29,737][model8_pretrain.py][INFO] Epoch:[0/2](429200/4588595) loss:2.841 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:01:29,737][model8_pretrain.py][INFO] Epoch:[0/2](429200/4588595) loss:2.851 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:01:29,737][model8_pretrain.py][INFO] Epoch:[0/2](429200/4588595) loss:3.091 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:01:29,737][model8_pretrain.py][INFO] Epoch:[0/2](429200/4588595) loss:2.840 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:01:29,737][model8_pretrain.py][INFO] Epoch:[0/2](429200/4588595) loss:3.078 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:01:29,737][model8_pretrain.py][INFO] Epoch:[0/2](429200/4588595) loss:2.551 lr:0.0000100 epoch_Time:26376.0min: [2024-01-04 15:02:06,682][model8_pretrain.py][INFO] Epoch:[0/2](429300/4588595) loss:2.421 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:06,682][model8_pretrain.py][INFO] Epoch:[0/2](429300/4588595) loss:3.142 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:06,682][model8_pretrain.py][INFO] Epoch:[0/2](429300/4588595) loss:3.109 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:06,682][model8_pretrain.py][INFO] Epoch:[0/2](429300/4588595) loss:2.125 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:06,682][model8_pretrain.py][INFO] Epoch:[0/2](429300/4588595) loss:2.901 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:06,682][model8_pretrain.py][INFO] Epoch:[0/2](429300/4588595) loss:2.680 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:06,682][model8_pretrain.py][INFO] Epoch:[0/2](429300/4588595) loss:2.214 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:06,682][model8_pretrain.py][INFO] Epoch:[0/2](429300/4588595) loss:3.046 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:43,628][model8_pretrain.py][INFO] Epoch:[0/2](429400/4588595) loss:2.993 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:43,628][model8_pretrain.py][INFO] Epoch:[0/2](429400/4588595) loss:3.146 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:43,628][model8_pretrain.py][INFO] Epoch:[0/2](429400/4588595) loss:3.093 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:43,628][model8_pretrain.py][INFO] Epoch:[0/2](429400/4588595) loss:3.091 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:43,628][model8_pretrain.py][INFO] Epoch:[0/2](429400/4588595) loss:3.009 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:43,628][model8_pretrain.py][INFO] Epoch:[0/2](429400/4588595) loss:2.890 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:43,628][model8_pretrain.py][INFO] Epoch:[0/2](429400/4588595) loss:2.569 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:02:43,628][model8_pretrain.py][INFO] Epoch:[0/2](429400/4588595) loss:3.023 lr:0.0000100 epoch_Time:26375.0min: [2024-01-04 15:03:20,578][model8_pretrain.py][INFO] Epoch:[0/2](429500/4588595) loss:2.505 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:03:20,578][model8_pretrain.py][INFO] Epoch:[0/2](429500/4588595) loss:2.677 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:03:20,578][model8_pretrain.py][INFO] Epoch:[0/2](429500/4588595) loss:2.641 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:03:20,578][model8_pretrain.py][INFO] Epoch:[0/2](429500/4588595) loss:3.044 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:03:20,579][model8_pretrain.py][INFO] Epoch:[0/2](429500/4588595) loss:2.750 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:03:20,579][model8_pretrain.py][INFO] Epoch:[0/2](429500/4588595) loss:2.880 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:03:20,579][model8_pretrain.py][INFO] Epoch:[0/2](429500/4588595) loss:2.767 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:03:20,578][model8_pretrain.py][INFO] Epoch:[0/2](429500/4588595) loss:3.077 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:03:57,517][model8_pretrain.py][INFO] Epoch:[0/2](429600/4588595) loss:3.005 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:03:57,517][model8_pretrain.py][INFO] Epoch:[0/2](429600/4588595) loss:3.313 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:03:57,517][model8_pretrain.py][INFO] Epoch:[0/2](429600/4588595) loss:2.652 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:03:57,517][model8_pretrain.py][INFO] Epoch:[0/2](429600/4588595) loss:3.449 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:03:57,517][model8_pretrain.py][INFO] Epoch:[0/2](429600/4588595) loss:2.720 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:03:57,518][model8_pretrain.py][INFO] Epoch:[0/2](429600/4588595) loss:2.995 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:03:57,518][model8_pretrain.py][INFO] Epoch:[0/2](429600/4588595) loss:2.888 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:03:57,518][model8_pretrain.py][INFO] Epoch:[0/2](429600/4588595) loss:2.657 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:04:43,267][model8_pretrain.py][INFO] Epoch:[0/2](429700/4588595) loss:2.669 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:04:43,267][model8_pretrain.py][INFO] Epoch:[0/2](429700/4588595) loss:2.479 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:04:43,267][model8_pretrain.py][INFO] Epoch:[0/2](429700/4588595) loss:2.696 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:04:43,267][model8_pretrain.py][INFO] Epoch:[0/2](429700/4588595) loss:2.601 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:04:43,267][model8_pretrain.py][INFO] Epoch:[0/2](429700/4588595) loss:3.079 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:04:43,267][model8_pretrain.py][INFO] Epoch:[0/2](429700/4588595) loss:3.094 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:04:43,267][model8_pretrain.py][INFO] Epoch:[0/2](429700/4588595) loss:3.255 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:04:43,267][model8_pretrain.py][INFO] Epoch:[0/2](429700/4588595) loss:2.550 lr:0.0000100 epoch_Time:26374.0min: [2024-01-04 15:05:21,873][model8_pretrain.py][INFO] Epoch:[0/2](429800/4588595) loss:2.937 lr:0.0000100 epoch_Time:26373.0min: [2024-01-04 15:05:21,873][model8_pretrain.py][INFO] Epoch:[0/2](429800/4588595) loss:3.267 lr:0.0000100 epoch_Time:26373.0min: [2024-01-04 15:05:21,873][model8_pretrain.py][INFO] Epoch:[0/2](429800/4588595) loss:2.388 lr:0.0000100 epoch_Time:26373.0min: [2024-01-04 15:05:21,873][model8_pretrain.py][INFO] Epoch:[0/2](429800/4588595) loss:2.009 lr:0.0000100 epoch_Time:26373.0min: [2024-01-04 15:05:21,873][model8_pretrain.py][INFO] Epoch:[0/2](429800/4588595) loss:2.901 lr:0.0000100 epoch_Time:26373.0min: [2024-01-04 15:05:21,873][model8_pretrain.py][INFO] Epoch:[0/2](429800/4588595) loss:2.525 lr:0.0000100 epoch_Time:26373.0min: [2024-01-04 15:05:21,873][model8_pretrain.py][INFO] Epoch:[0/2](429800/4588595) loss:2.973 lr:0.0000100 epoch_Time:26373.0min: [2024-01-04 15:05:21,874][model8_pretrain.py][INFO] Epoch:[0/2](429800/4588595) loss:2.543 lr:0.0000100 epoch_Time:26373.0min: [2024-01-04 15:05:58,810][model8_pretrain.py][INFO] Epoch:[0/2](429900/4588595) loss:3.017 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:05:58,810][model8_pretrain.py][INFO] Epoch:[0/2](429900/4588595) loss:2.618 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:05:58,810][model8_pretrain.py][INFO] Epoch:[0/2](429900/4588595) loss:3.148 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:05:58,810][model8_pretrain.py][INFO] Epoch:[0/2](429900/4588595) loss:2.880 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:05:58,810][model8_pretrain.py][INFO] Epoch:[0/2](429900/4588595) loss:2.882 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:05:58,810][model8_pretrain.py][INFO] Epoch:[0/2](429900/4588595) loss:3.160 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:05:58,810][model8_pretrain.py][INFO] Epoch:[0/2](429900/4588595) loss:2.662 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:05:58,810][model8_pretrain.py][INFO] Epoch:[0/2](429900/4588595) loss:3.028 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:06:35,752][model8_pretrain.py][INFO] Epoch:[0/2](430000/4588595) loss:2.943 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:06:35,752][model8_pretrain.py][INFO] Epoch:[0/2](430000/4588595) loss:2.978 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:06:35,752][model8_pretrain.py][INFO] Epoch:[0/2](430000/4588595) loss:2.449 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:06:35,752][model8_pretrain.py][INFO] Epoch:[0/2](430000/4588595) loss:3.010 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:06:35,752][model8_pretrain.py][INFO] Epoch:[0/2](430000/4588595) loss:2.957 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:06:35,752][model8_pretrain.py][INFO] Epoch:[0/2](430000/4588595) loss:2.617 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:06:35,752][model8_pretrain.py][INFO] Epoch:[0/2](430000/4588595) loss:2.263 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:06:35,752][model8_pretrain.py][INFO] Epoch:[0/2](430000/4588595) loss:2.258 lr:0.0000100 epoch_Time:26372.0min: [2024-01-04 15:07:12,692][model8_pretrain.py][INFO] Epoch:[0/2](430100/4588595) loss:3.084 lr:0.0000100 epoch_Time:26370.0min: [2024-01-04 15:07:12,692][model8_pretrain.py][INFO] Epoch:[0/2](430100/4588595) loss:3.173 lr:0.0000100 epoch_Time:26370.0min: [2024-01-04 15:07:12,692][model8_pretrain.py][INFO] Epoch:[0/2](430100/4588595) loss:2.288 lr:0.0000100 epoch_Time:26370.0min: [2024-01-04 15:07:12,692][model8_pretrain.py][INFO] Epoch:[0/2](430100/4588595) loss:2.870 lr:0.0000100 epoch_Time:26370.0min: [2024-01-04 15:07:12,692][model8_pretrain.py][INFO] Epoch:[0/2](430100/4588595) loss:2.588 lr:0.0000100 epoch_Time:26370.0min: [2024-01-04 15:07:12,692][model8_pretrain.py][INFO] Epoch:[0/2](430100/4588595) loss:3.073 lr:0.0000100 epoch_Time:26370.0min: [2024-01-04 15:07:12,692][model8_pretrain.py][INFO] Epoch:[0/2](430100/4588595) loss:1.893 lr:0.0000100 epoch_Time:26370.0min: [2024-01-04 15:07:12,693][model8_pretrain.py][INFO] Epoch:[0/2](430100/4588595) loss:2.816 lr:0.0000100 epoch_Time:26370.0min: [2024-01-04 15:07:49,664][model8_pretrain.py][INFO] Epoch:[0/2](430200/4588595) loss:2.764 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:07:49,664][model8_pretrain.py][INFO] Epoch:[0/2](430200/4588595) loss:2.782 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:07:49,664][model8_pretrain.py][INFO] Epoch:[0/2](430200/4588595) loss:2.672 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:07:49,664][model8_pretrain.py][INFO] Epoch:[0/2](430200/4588595) loss:2.937 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:07:49,664][model8_pretrain.py][INFO] Epoch:[0/2](430200/4588595) loss:3.066 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:07:49,664][model8_pretrain.py][INFO] Epoch:[0/2](430200/4588595) loss:2.655 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:07:49,664][model8_pretrain.py][INFO] Epoch:[0/2](430200/4588595) loss:2.568 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:07:49,664][model8_pretrain.py][INFO] Epoch:[0/2](430200/4588595) loss:3.009 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:08:26,608][model8_pretrain.py][INFO] Epoch:[0/2](430300/4588595) loss:3.287 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:08:26,608][model8_pretrain.py][INFO] Epoch:[0/2](430300/4588595) loss:2.322 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:08:26,608][model8_pretrain.py][INFO] Epoch:[0/2](430300/4588595) loss:3.453 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:08:26,608][model8_pretrain.py][INFO] Epoch:[0/2](430300/4588595) loss:2.831 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:08:26,608][model8_pretrain.py][INFO] Epoch:[0/2](430300/4588595) loss:2.869 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:08:26,608][model8_pretrain.py][INFO] Epoch:[0/2](430300/4588595) loss:2.995 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:08:26,608][model8_pretrain.py][INFO] Epoch:[0/2](430300/4588595) loss:2.857 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:08:26,609][model8_pretrain.py][INFO] Epoch:[0/2](430300/4588595) loss:3.360 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:09:03,543][model8_pretrain.py][INFO] Epoch:[0/2](430400/4588595) loss:3.378 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:03,543][model8_pretrain.py][INFO] Epoch:[0/2](430400/4588595) loss:3.013 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:03,543][model8_pretrain.py][INFO] Epoch:[0/2](430400/4588595) loss:3.040 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:03,543][model8_pretrain.py][INFO] Epoch:[0/2](430400/4588595) loss:2.993 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:03,543][model8_pretrain.py][INFO] Epoch:[0/2](430400/4588595) loss:2.847 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:03,543][model8_pretrain.py][INFO] Epoch:[0/2](430400/4588595) loss:2.591 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:03,543][model8_pretrain.py][INFO] Epoch:[0/2](430400/4588595) loss:2.764 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:03,543][model8_pretrain.py][INFO] Epoch:[0/2](430400/4588595) loss:3.701 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:45,652][model8_pretrain.py][INFO] Epoch:[0/2](430500/4588595) loss:3.385 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:45,652][model8_pretrain.py][INFO] Epoch:[0/2](430500/4588595) loss:2.127 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:45,652][model8_pretrain.py][INFO] Epoch:[0/2](430500/4588595) loss:2.712 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:45,652][model8_pretrain.py][INFO] Epoch:[0/2](430500/4588595) loss:3.366 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:45,652][model8_pretrain.py][INFO] Epoch:[0/2](430500/4588595) loss:3.285 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:45,652][model8_pretrain.py][INFO] Epoch:[0/2](430500/4588595) loss:2.679 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:45,652][model8_pretrain.py][INFO] Epoch:[0/2](430500/4588595) loss:2.638 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:09:47,339][model8_pretrain.py][INFO] Epoch:[0/2](430500/4588595) loss:2.821 lr:0.0000100 epoch_Time:26369.0min: [2024-01-04 15:10:27,779][model8_pretrain.py][INFO] Epoch:[0/2](430600/4588595) loss:2.869 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:10:27,779][model8_pretrain.py][INFO] Epoch:[0/2](430600/4588595) loss:3.061 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:10:27,779][model8_pretrain.py][INFO] Epoch:[0/2](430600/4588595) loss:2.841 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:10:27,779][model8_pretrain.py][INFO] Epoch:[0/2](430600/4588595) loss:2.978 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:10:27,779][model8_pretrain.py][INFO] Epoch:[0/2](430600/4588595) loss:2.479 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:10:27,779][model8_pretrain.py][INFO] Epoch:[0/2](430600/4588595) loss:2.889 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:10:27,779][model8_pretrain.py][INFO] Epoch:[0/2](430600/4588595) loss:2.752 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:10:27,779][model8_pretrain.py][INFO] Epoch:[0/2](430600/4588595) loss:2.590 lr:0.0000100 epoch_Time:26368.0min: [2024-01-04 15:11:04,737][model8_pretrain.py][INFO] Epoch:[0/2](430700/4588595) loss:2.888 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:04,737][model8_pretrain.py][INFO] Epoch:[0/2](430700/4588595) loss:3.109 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:04,737][model8_pretrain.py][INFO] Epoch:[0/2](430700/4588595) loss:3.044 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:04,737][model8_pretrain.py][INFO] Epoch:[0/2](430700/4588595) loss:2.934 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:04,737][model8_pretrain.py][INFO] Epoch:[0/2](430700/4588595) loss:1.901 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:04,737][model8_pretrain.py][INFO] Epoch:[0/2](430700/4588595) loss:2.770 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:04,737][model8_pretrain.py][INFO] Epoch:[0/2](430700/4588595) loss:2.590 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:04,738][model8_pretrain.py][INFO] Epoch:[0/2](430700/4588595) loss:2.686 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:41,692][model8_pretrain.py][INFO] Epoch:[0/2](430800/4588595) loss:2.696 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:41,692][model8_pretrain.py][INFO] Epoch:[0/2](430800/4588595) loss:2.125 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:41,692][model8_pretrain.py][INFO] Epoch:[0/2](430800/4588595) loss:2.871 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:41,692][model8_pretrain.py][INFO] Epoch:[0/2](430800/4588595) loss:2.744 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:41,692][model8_pretrain.py][INFO] Epoch:[0/2](430800/4588595) loss:2.668 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:41,692][model8_pretrain.py][INFO] Epoch:[0/2](430800/4588595) loss:3.278 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:41,692][model8_pretrain.py][INFO] Epoch:[0/2](430800/4588595) loss:2.525 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:11:41,692][model8_pretrain.py][INFO] Epoch:[0/2](430800/4588595) loss:2.790 lr:0.0000100 epoch_Time:26367.0min: [2024-01-04 15:12:18,652][model8_pretrain.py][INFO] Epoch:[0/2](430900/4588595) loss:2.962 lr:0.0000100 epoch_Time:26366.0min: [2024-01-04 15:12:18,652][model8_pretrain.py][INFO] Epoch:[0/2](430900/4588595) loss:2.962 lr:0.0000100 epoch_Time:26366.0min: [2024-01-04 15:12:18,652][model8_pretrain.py][INFO] Epoch:[0/2](430900/4588595) loss:2.378 lr:0.0000100 epoch_Time:26366.0min: [2024-01-04 15:12:18,652][model8_pretrain.py][INFO] Epoch:[0/2](430900/4588595) loss:2.686 lr:0.0000100 epoch_Time:26366.0min: [2024-01-04 15:12:18,652][model8_pretrain.py][INFO] Epoch:[0/2](430900/4588595) loss:3.142 lr:0.0000100 epoch_Time:26366.0min: [2024-01-04 15:12:18,652][model8_pretrain.py][INFO] Epoch:[0/2](430900/4588595) loss:2.305 lr:0.0000100 epoch_Time:26366.0min: [2024-01-04 15:12:18,652][model8_pretrain.py][INFO] Epoch:[0/2](430900/4588595) loss:2.886 lr:0.0000100 epoch_Time:26366.0min: [2024-01-04 15:12:18,652][model8_pretrain.py][INFO] Epoch:[0/2](430900/4588595) loss:2.365 lr:0.0000100 epoch_Time:26366.0min: [2024-01-04 15:12:55,622][model8_pretrain.py][INFO] Epoch:[0/2](431000/4588595) loss:2.925 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:12:55,622][model8_pretrain.py][INFO] Epoch:[0/2](431000/4588595) loss:2.579 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:12:55,622][model8_pretrain.py][INFO] Epoch:[0/2](431000/4588595) loss:3.150 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:12:55,622][model8_pretrain.py][INFO] Epoch:[0/2](431000/4588595) loss:2.888 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:12:55,622][model8_pretrain.py][INFO] Epoch:[0/2](431000/4588595) loss:2.332 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:12:55,622][model8_pretrain.py][INFO] Epoch:[0/2](431000/4588595) loss:2.980 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:12:55,622][model8_pretrain.py][INFO] Epoch:[0/2](431000/4588595) loss:3.522 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:12:55,622][model8_pretrain.py][INFO] Epoch:[0/2](431000/4588595) loss:2.687 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:13:32,580][model8_pretrain.py][INFO] Epoch:[0/2](431100/4588595) loss:3.185 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:13:32,580][model8_pretrain.py][INFO] Epoch:[0/2](431100/4588595) loss:2.980 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:13:32,580][model8_pretrain.py][INFO] Epoch:[0/2](431100/4588595) loss:3.155 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:13:32,580][model8_pretrain.py][INFO] Epoch:[0/2](431100/4588595) loss:3.256 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:13:32,580][model8_pretrain.py][INFO] Epoch:[0/2](431100/4588595) loss:2.648 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:13:32,580][model8_pretrain.py][INFO] Epoch:[0/2](431100/4588595) loss:2.850 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:13:32,580][model8_pretrain.py][INFO] Epoch:[0/2](431100/4588595) loss:3.141 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:13:32,581][model8_pretrain.py][INFO] Epoch:[0/2](431100/4588595) loss:3.071 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:14:09,546][model8_pretrain.py][INFO] Epoch:[0/2](431200/4588595) loss:3.141 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:09,546][model8_pretrain.py][INFO] Epoch:[0/2](431200/4588595) loss:3.217 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:09,546][model8_pretrain.py][INFO] Epoch:[0/2](431200/4588595) loss:3.135 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:09,546][model8_pretrain.py][INFO] Epoch:[0/2](431200/4588595) loss:2.576 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:09,546][model8_pretrain.py][INFO] Epoch:[0/2](431200/4588595) loss:2.438 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:09,546][model8_pretrain.py][INFO] Epoch:[0/2](431200/4588595) loss:2.895 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:09,546][model8_pretrain.py][INFO] Epoch:[0/2](431200/4588595) loss:3.206 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:09,546][model8_pretrain.py][INFO] Epoch:[0/2](431200/4588595) loss:2.939 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:51,665][model8_pretrain.py][INFO] Epoch:[0/2](431300/4588595) loss:3.004 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:51,665][model8_pretrain.py][INFO] Epoch:[0/2](431300/4588595) loss:3.321 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:51,665][model8_pretrain.py][INFO] Epoch:[0/2](431300/4588595) loss:2.307 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:51,665][model8_pretrain.py][INFO] Epoch:[0/2](431300/4588595) loss:2.941 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:51,665][model8_pretrain.py][INFO] Epoch:[0/2](431300/4588595) loss:3.170 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:51,665][model8_pretrain.py][INFO] Epoch:[0/2](431300/4588595) loss:3.025 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:51,665][model8_pretrain.py][INFO] Epoch:[0/2](431300/4588595) loss:2.901 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:14:51,665][model8_pretrain.py][INFO] Epoch:[0/2](431300/4588595) loss:2.191 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:15:33,742][model8_pretrain.py][INFO] Epoch:[0/2](431400/4588595) loss:2.894 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:15:33,742][model8_pretrain.py][INFO] Epoch:[0/2](431400/4588595) loss:2.687 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:15:33,742][model8_pretrain.py][INFO] Epoch:[0/2](431400/4588595) loss:3.144 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:15:33,742][model8_pretrain.py][INFO] Epoch:[0/2](431400/4588595) loss:2.765 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:15:33,742][model8_pretrain.py][INFO] Epoch:[0/2](431400/4588595) loss:2.337 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:15:33,742][model8_pretrain.py][INFO] Epoch:[0/2](431400/4588595) loss:2.710 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:15:33,742][model8_pretrain.py][INFO] Epoch:[0/2](431400/4588595) loss:2.615 lr:0.0000100 epoch_Time:26364.0min: [2024-01-04 15:15:33,742][model8_pretrain.py][INFO] Epoch:[0/2](431400/4588595) loss:2.989 lr:0.0000100 epoch_Time:26363.0min: [2024-01-04 15:16:10,677][model8_pretrain.py][INFO] Epoch:[0/2](431500/4588595) loss:2.550 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:16:10,677][model8_pretrain.py][INFO] Epoch:[0/2](431500/4588595) loss:2.727 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:16:10,678][model8_pretrain.py][INFO] Epoch:[0/2](431500/4588595) loss:3.014 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:16:10,678][model8_pretrain.py][INFO] Epoch:[0/2](431500/4588595) loss:2.487 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:16:10,678][model8_pretrain.py][INFO] Epoch:[0/2](431500/4588595) loss:2.981 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:16:10,678][model8_pretrain.py][INFO] Epoch:[0/2](431500/4588595) loss:3.169 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:16:10,678][model8_pretrain.py][INFO] Epoch:[0/2](431500/4588595) loss:2.992 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:16:10,678][model8_pretrain.py][INFO] Epoch:[0/2](431500/4588595) loss:3.029 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:16:47,612][model8_pretrain.py][INFO] Epoch:[0/2](431600/4588595) loss:2.925 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:16:47,612][model8_pretrain.py][INFO] Epoch:[0/2](431600/4588595) loss:2.555 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:16:47,612][model8_pretrain.py][INFO] Epoch:[0/2](431600/4588595) loss:3.138 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:16:47,612][model8_pretrain.py][INFO] Epoch:[0/2](431600/4588595) loss:3.241 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:16:47,612][model8_pretrain.py][INFO] Epoch:[0/2](431600/4588595) loss:3.372 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:16:47,612][model8_pretrain.py][INFO] Epoch:[0/2](431600/4588595) loss:2.876 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:16:47,612][model8_pretrain.py][INFO] Epoch:[0/2](431600/4588595) loss:3.001 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:16:47,612][model8_pretrain.py][INFO] Epoch:[0/2](431600/4588595) loss:3.000 lr:0.0000100 epoch_Time:26362.0min: [2024-01-04 15:17:24,517][model8_pretrain.py][INFO] Epoch:[0/2](431700/4588595) loss:3.141 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:17:24,517][model8_pretrain.py][INFO] Epoch:[0/2](431700/4588595) loss:3.100 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:17:24,517][model8_pretrain.py][INFO] Epoch:[0/2](431700/4588595) loss:2.759 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:17:24,517][model8_pretrain.py][INFO] Epoch:[0/2](431700/4588595) loss:2.967 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:17:24,517][model8_pretrain.py][INFO] Epoch:[0/2](431700/4588595) loss:2.993 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:17:24,517][model8_pretrain.py][INFO] Epoch:[0/2](431700/4588595) loss:2.858 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:17:24,517][model8_pretrain.py][INFO] Epoch:[0/2](431700/4588595) loss:2.995 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:17:24,517][model8_pretrain.py][INFO] Epoch:[0/2](431700/4588595) loss:2.574 lr:0.0000100 epoch_Time:26361.0min: [2024-01-04 15:18:01,460][model8_pretrain.py][INFO] Epoch:[0/2](431800/4588595) loss:3.263 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:01,460][model8_pretrain.py][INFO] Epoch:[0/2](431800/4588595) loss:3.359 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:01,460][model8_pretrain.py][INFO] Epoch:[0/2](431800/4588595) loss:2.951 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:01,460][model8_pretrain.py][INFO] Epoch:[0/2](431800/4588595) loss:2.565 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:01,460][model8_pretrain.py][INFO] Epoch:[0/2](431800/4588595) loss:2.479 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:01,460][model8_pretrain.py][INFO] Epoch:[0/2](431800/4588595) loss:3.113 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:01,460][model8_pretrain.py][INFO] Epoch:[0/2](431800/4588595) loss:2.344 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:01,460][model8_pretrain.py][INFO] Epoch:[0/2](431800/4588595) loss:2.667 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:38,415][model8_pretrain.py][INFO] Epoch:[0/2](431900/4588595) loss:3.529 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:38,415][model8_pretrain.py][INFO] Epoch:[0/2](431900/4588595) loss:2.831 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:38,415][model8_pretrain.py][INFO] Epoch:[0/2](431900/4588595) loss:2.673 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:38,415][model8_pretrain.py][INFO] Epoch:[0/2](431900/4588595) loss:2.591 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:38,415][model8_pretrain.py][INFO] Epoch:[0/2](431900/4588595) loss:2.888 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:38,415][model8_pretrain.py][INFO] Epoch:[0/2](431900/4588595) loss:2.571 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:38,415][model8_pretrain.py][INFO] Epoch:[0/2](431900/4588595) loss:2.567 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:18:38,416][model8_pretrain.py][INFO] Epoch:[0/2](431900/4588595) loss:2.328 lr:0.0000100 epoch_Time:26360.0min: [2024-01-04 15:19:15,347][model8_pretrain.py][INFO] Epoch:[0/2](432000/4588595) loss:2.477 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:15,348][model8_pretrain.py][INFO] Epoch:[0/2](432000/4588595) loss:2.577 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:15,348][model8_pretrain.py][INFO] Epoch:[0/2](432000/4588595) loss:2.693 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:15,348][model8_pretrain.py][INFO] Epoch:[0/2](432000/4588595) loss:2.325 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:15,348][model8_pretrain.py][INFO] Epoch:[0/2](432000/4588595) loss:2.954 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:15,348][model8_pretrain.py][INFO] Epoch:[0/2](432000/4588595) loss:3.118 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:15,348][model8_pretrain.py][INFO] Epoch:[0/2](432000/4588595) loss:2.462 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:15,348][model8_pretrain.py][INFO] Epoch:[0/2](432000/4588595) loss:2.865 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:57,477][model8_pretrain.py][INFO] Epoch:[0/2](432100/4588595) loss:2.451 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:57,477][model8_pretrain.py][INFO] Epoch:[0/2](432100/4588595) loss:3.515 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:57,477][model8_pretrain.py][INFO] Epoch:[0/2](432100/4588595) loss:3.350 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:57,477][model8_pretrain.py][INFO] Epoch:[0/2](432100/4588595) loss:2.932 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:57,477][model8_pretrain.py][INFO] Epoch:[0/2](432100/4588595) loss:2.598 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:57,477][model8_pretrain.py][INFO] Epoch:[0/2](432100/4588595) loss:2.723 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:57,477][model8_pretrain.py][INFO] Epoch:[0/2](432100/4588595) loss:2.771 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:19:57,477][model8_pretrain.py][INFO] Epoch:[0/2](432100/4588595) loss:2.398 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:20:39,610][model8_pretrain.py][INFO] Epoch:[0/2](432200/4588595) loss:2.911 lr:0.0000100 epoch_Time:26359.0min: [2024-01-04 15:20:39,610][model8_pretrain.py][INFO] Epoch:[0/2](432200/4588595) loss:3.309 lr:0.0000100 epoch_Time:26359.0min: [2024-01-04 15:20:39,610][model8_pretrain.py][INFO] Epoch:[0/2](432200/4588595) loss:2.395 lr:0.0000100 epoch_Time:26359.0min: [2024-01-04 15:20:39,610][model8_pretrain.py][INFO] Epoch:[0/2](432200/4588595) loss:2.736 lr:0.0000100 epoch_Time:26359.0min: [2024-01-04 15:20:39,610][model8_pretrain.py][INFO] Epoch:[0/2](432200/4588595) loss:3.055 lr:0.0000100 epoch_Time:26359.0min: [2024-01-04 15:20:39,610][model8_pretrain.py][INFO] Epoch:[0/2](432200/4588595) loss:2.804 lr:0.0000100 epoch_Time:26359.0min: [2024-01-04 15:20:39,610][model8_pretrain.py][INFO] Epoch:[0/2](432200/4588595) loss:2.901 lr:0.0000100 epoch_Time:26359.0min: [2024-01-04 15:20:39,610][model8_pretrain.py][INFO] Epoch:[0/2](432200/4588595) loss:2.912 lr:0.0000100 epoch_Time:26359.0min: [2024-01-04 15:21:16,556][model8_pretrain.py][INFO] Epoch:[0/2](432300/4588595) loss:2.818 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:21:16,556][model8_pretrain.py][INFO] Epoch:[0/2](432300/4588595) loss:2.673 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:21:16,556][model8_pretrain.py][INFO] Epoch:[0/2](432300/4588595) loss:2.948 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:21:16,556][model8_pretrain.py][INFO] Epoch:[0/2](432300/4588595) loss:2.978 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:21:16,556][model8_pretrain.py][INFO] Epoch:[0/2](432300/4588595) loss:3.073 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:21:16,556][model8_pretrain.py][INFO] Epoch:[0/2](432300/4588595) loss:3.042 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:21:16,556][model8_pretrain.py][INFO] Epoch:[0/2](432300/4588595) loss:2.544 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:21:16,556][model8_pretrain.py][INFO] Epoch:[0/2](432300/4588595) loss:2.369 lr:0.0000100 epoch_Time:26358.0min: [2024-01-04 15:21:53,492][model8_pretrain.py][INFO] Epoch:[0/2](432400/4588595) loss:2.895 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:21:53,492][model8_pretrain.py][INFO] Epoch:[0/2](432400/4588595) loss:2.938 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:21:53,492][model8_pretrain.py][INFO] Epoch:[0/2](432400/4588595) loss:3.103 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:21:53,492][model8_pretrain.py][INFO] Epoch:[0/2](432400/4588595) loss:2.687 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:21:53,492][model8_pretrain.py][INFO] Epoch:[0/2](432400/4588595) loss:2.894 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:21:53,492][model8_pretrain.py][INFO] Epoch:[0/2](432400/4588595) loss:3.517 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:21:53,492][model8_pretrain.py][INFO] Epoch:[0/2](432400/4588595) loss:2.890 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:21:53,492][model8_pretrain.py][INFO] Epoch:[0/2](432400/4588595) loss:2.903 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:22:30,432][model8_pretrain.py][INFO] Epoch:[0/2](432500/4588595) loss:3.195 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:22:30,432][model8_pretrain.py][INFO] Epoch:[0/2](432500/4588595) loss:2.703 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:22:30,432][model8_pretrain.py][INFO] Epoch:[0/2](432500/4588595) loss:2.964 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:22:30,432][model8_pretrain.py][INFO] Epoch:[0/2](432500/4588595) loss:3.001 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:22:30,432][model8_pretrain.py][INFO] Epoch:[0/2](432500/4588595) loss:2.758 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:22:30,432][model8_pretrain.py][INFO] Epoch:[0/2](432500/4588595) loss:3.078 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:22:30,432][model8_pretrain.py][INFO] Epoch:[0/2](432500/4588595) loss:2.955 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:22:30,432][model8_pretrain.py][INFO] Epoch:[0/2](432500/4588595) loss:2.965 lr:0.0000100 epoch_Time:26356.0min: [2024-01-04 15:23:07,377][model8_pretrain.py][INFO] Epoch:[0/2](432600/4588595) loss:3.402 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:07,377][model8_pretrain.py][INFO] Epoch:[0/2](432600/4588595) loss:3.127 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:07,377][model8_pretrain.py][INFO] Epoch:[0/2](432600/4588595) loss:2.967 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:07,377][model8_pretrain.py][INFO] Epoch:[0/2](432600/4588595) loss:2.941 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:07,377][model8_pretrain.py][INFO] Epoch:[0/2](432600/4588595) loss:2.861 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:07,377][model8_pretrain.py][INFO] Epoch:[0/2](432600/4588595) loss:2.218 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:07,377][model8_pretrain.py][INFO] Epoch:[0/2](432600/4588595) loss:3.013 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:07,378][model8_pretrain.py][INFO] Epoch:[0/2](432600/4588595) loss:2.511 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:44,380][model8_pretrain.py][INFO] Epoch:[0/2](432700/4588595) loss:2.955 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:44,380][model8_pretrain.py][INFO] Epoch:[0/2](432700/4588595) loss:2.463 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:44,380][model8_pretrain.py][INFO] Epoch:[0/2](432700/4588595) loss:2.605 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:44,380][model8_pretrain.py][INFO] Epoch:[0/2](432700/4588595) loss:3.224 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:44,380][model8_pretrain.py][INFO] Epoch:[0/2](432700/4588595) loss:3.391 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:44,380][model8_pretrain.py][INFO] Epoch:[0/2](432700/4588595) loss:2.503 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:44,380][model8_pretrain.py][INFO] Epoch:[0/2](432700/4588595) loss:3.061 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:23:44,381][model8_pretrain.py][INFO] Epoch:[0/2](432700/4588595) loss:2.947 lr:0.0000100 epoch_Time:26355.0min: [2024-01-04 15:24:21,331][model8_pretrain.py][INFO] Epoch:[0/2](432800/4588595) loss:2.843 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:24:21,331][model8_pretrain.py][INFO] Epoch:[0/2](432800/4588595) loss:3.010 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:24:21,331][model8_pretrain.py][INFO] Epoch:[0/2](432800/4588595) loss:3.083 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:24:21,331][model8_pretrain.py][INFO] Epoch:[0/2](432800/4588595) loss:2.954 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:24:21,331][model8_pretrain.py][INFO] Epoch:[0/2](432800/4588595) loss:3.195 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:24:21,331][model8_pretrain.py][INFO] Epoch:[0/2](432800/4588595) loss:3.341 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:24:21,331][model8_pretrain.py][INFO] Epoch:[0/2](432800/4588595) loss:3.080 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:24:21,331][model8_pretrain.py][INFO] Epoch:[0/2](432800/4588595) loss:2.571 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:25:00,006][model8_pretrain.py][INFO] Epoch:[0/2](432900/4588595) loss:2.881 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:25:00,006][model8_pretrain.py][INFO] Epoch:[0/2](432900/4588595) loss:2.598 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:25:00,006][model8_pretrain.py][INFO] Epoch:[0/2](432900/4588595) loss:3.193 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:25:00,006][model8_pretrain.py][INFO] Epoch:[0/2](432900/4588595) loss:3.056 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:25:00,006][model8_pretrain.py][INFO] Epoch:[0/2](432900/4588595) loss:3.078 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:25:00,006][model8_pretrain.py][INFO] Epoch:[0/2](432900/4588595) loss:2.795 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:25:00,006][model8_pretrain.py][INFO] Epoch:[0/2](432900/4588595) loss:2.854 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:25:00,006][model8_pretrain.py][INFO] Epoch:[0/2](432900/4588595) loss:3.109 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:25:45,670][model8_pretrain.py][INFO] Epoch:[0/2](433000/4588595) loss:2.888 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:25:45,670][model8_pretrain.py][INFO] Epoch:[0/2](433000/4588595) loss:2.483 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:25:45,670][model8_pretrain.py][INFO] Epoch:[0/2](433000/4588595) loss:2.773 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:25:45,670][model8_pretrain.py][INFO] Epoch:[0/2](433000/4588595) loss:2.912 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:25:45,670][model8_pretrain.py][INFO] Epoch:[0/2](433000/4588595) loss:2.288 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:25:45,670][model8_pretrain.py][INFO] Epoch:[0/2](433000/4588595) loss:3.303 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:25:45,670][model8_pretrain.py][INFO] Epoch:[0/2](433000/4588595) loss:3.024 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:25:45,670][model8_pretrain.py][INFO] Epoch:[0/2](433000/4588595) loss:2.647 lr:0.0000100 epoch_Time:26354.0min: [2024-01-04 15:26:22,610][model8_pretrain.py][INFO] Epoch:[0/2](433100/4588595) loss:2.654 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:26:22,610][model8_pretrain.py][INFO] Epoch:[0/2](433100/4588595) loss:2.871 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:26:22,610][model8_pretrain.py][INFO] Epoch:[0/2](433100/4588595) loss:3.281 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:26:22,610][model8_pretrain.py][INFO] Epoch:[0/2](433100/4588595) loss:2.068 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:26:22,610][model8_pretrain.py][INFO] Epoch:[0/2](433100/4588595) loss:2.729 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:26:22,610][model8_pretrain.py][INFO] Epoch:[0/2](433100/4588595) loss:2.526 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:26:22,611][model8_pretrain.py][INFO] Epoch:[0/2](433100/4588595) loss:2.969 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:26:22,611][model8_pretrain.py][INFO] Epoch:[0/2](433100/4588595) loss:3.213 lr:0.0000100 epoch_Time:26353.0min: [2024-01-04 15:26:59,548][model8_pretrain.py][INFO] Epoch:[0/2](433200/4588595) loss:3.023 lr:0.0000100 epoch_Time:26352.0min: [2024-01-04 15:26:59,548][model8_pretrain.py][INFO] Epoch:[0/2](433200/4588595) loss:2.812 lr:0.0000100 epoch_Time:26352.0min: [2024-01-04 15:26:59,548][model8_pretrain.py][INFO] Epoch:[0/2](433200/4588595) loss:2.653 lr:0.0000100 epoch_Time:26352.0min: [2024-01-04 15:26:59,548][model8_pretrain.py][INFO] Epoch:[0/2](433200/4588595) loss:3.088 lr:0.0000100 epoch_Time:26352.0min: [2024-01-04 15:26:59,548][model8_pretrain.py][INFO] Epoch:[0/2](433200/4588595) loss:2.823 lr:0.0000100 epoch_Time:26352.0min: [2024-01-04 15:26:59,548][model8_pretrain.py][INFO] Epoch:[0/2](433200/4588595) loss:3.062 lr:0.0000100 epoch_Time:26352.0min: [2024-01-04 15:26:59,548][model8_pretrain.py][INFO] Epoch:[0/2](433200/4588595) loss:2.606 lr:0.0000100 epoch_Time:26352.0min: [2024-01-04 15:26:59,549][model8_pretrain.py][INFO] Epoch:[0/2](433200/4588595) loss:2.591 lr:0.0000100 epoch_Time:26352.0min: [2024-01-04 15:27:36,484][model8_pretrain.py][INFO] Epoch:[0/2](433300/4588595) loss:2.950 lr:0.0000100 epoch_Time:26351.0min: [2024-01-04 15:27:36,485][model8_pretrain.py][INFO] Epoch:[0/2](433300/4588595) loss:2.477 lr:0.0000100 epoch_Time:26351.0min: [2024-01-04 15:27:36,485][model8_pretrain.py][INFO] Epoch:[0/2](433300/4588595) loss:2.018 lr:0.0000100 epoch_Time:26351.0min: [2024-01-04 15:27:36,485][model8_pretrain.py][INFO] Epoch:[0/2](433300/4588595) loss:2.325 lr:0.0000100 epoch_Time:26351.0min: [2024-01-04 15:27:36,485][model8_pretrain.py][INFO] Epoch:[0/2](433300/4588595) loss:2.657 lr:0.0000100 epoch_Time:26351.0min: [2024-01-04 15:27:36,485][model8_pretrain.py][INFO] Epoch:[0/2](433300/4588595) loss:2.925 lr:0.0000100 epoch_Time:26351.0min: [2024-01-04 15:27:36,485][model8_pretrain.py][INFO] Epoch:[0/2](433300/4588595) loss:2.979 lr:0.0000100 epoch_Time:26351.0min: [2024-01-04 15:27:36,485][model8_pretrain.py][INFO] Epoch:[0/2](433300/4588595) loss:2.890 lr:0.0000100 epoch_Time:26351.0min: [2024-01-04 15:28:13,420][model8_pretrain.py][INFO] Epoch:[0/2](433400/4588595) loss:2.498 lr:0.0000100 epoch_Time:26350.0min: [2024-01-04 15:28:13,420][model8_pretrain.py][INFO] Epoch:[0/2](433400/4588595) loss:3.104 lr:0.0000100 epoch_Time:26350.0min: [2024-01-04 15:28:13,420][model8_pretrain.py][INFO] Epoch:[0/2](433400/4588595) loss:2.450 lr:0.0000100 epoch_Time:26350.0min: [2024-01-04 15:28:13,420][model8_pretrain.py][INFO] Epoch:[0/2](433400/4588595) loss:2.871 lr:0.0000100 epoch_Time:26350.0min: [2024-01-04 15:28:13,421][model8_pretrain.py][INFO] Epoch:[0/2](433400/4588595) loss:2.784 lr:0.0000100 epoch_Time:26350.0min: [2024-01-04 15:28:13,421][model8_pretrain.py][INFO] Epoch:[0/2](433400/4588595) loss:2.884 lr:0.0000100 epoch_Time:26350.0min: [2024-01-04 15:28:13,421][model8_pretrain.py][INFO] Epoch:[0/2](433400/4588595) loss:2.960 lr:0.0000100 epoch_Time:26350.0min: [2024-01-04 15:28:13,421][model8_pretrain.py][INFO] Epoch:[0/2](433400/4588595) loss:2.901 lr:0.0000100 epoch_Time:26350.0min: [2024-01-04 15:28:50,360][model8_pretrain.py][INFO] Epoch:[0/2](433500/4588595) loss:2.739 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:28:50,360][model8_pretrain.py][INFO] Epoch:[0/2](433500/4588595) loss:2.884 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:28:50,360][model8_pretrain.py][INFO] Epoch:[0/2](433500/4588595) loss:3.246 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:28:50,360][model8_pretrain.py][INFO] Epoch:[0/2](433500/4588595) loss:2.709 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:28:50,360][model8_pretrain.py][INFO] Epoch:[0/2](433500/4588595) loss:2.726 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:28:50,360][model8_pretrain.py][INFO] Epoch:[0/2](433500/4588595) loss:2.535 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:28:50,360][model8_pretrain.py][INFO] Epoch:[0/2](433500/4588595) loss:3.388 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:28:50,360][model8_pretrain.py][INFO] Epoch:[0/2](433500/4588595) loss:2.950 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:29:27,299][model8_pretrain.py][INFO] Epoch:[0/2](433600/4588595) loss:2.975 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:29:27,299][model8_pretrain.py][INFO] Epoch:[0/2](433600/4588595) loss:2.859 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:29:27,299][model8_pretrain.py][INFO] Epoch:[0/2](433600/4588595) loss:2.077 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:29:27,299][model8_pretrain.py][INFO] Epoch:[0/2](433600/4588595) loss:2.763 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:29:27,299][model8_pretrain.py][INFO] Epoch:[0/2](433600/4588595) loss:3.085 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:29:27,299][model8_pretrain.py][INFO] Epoch:[0/2](433600/4588595) loss:3.334 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:29:27,299][model8_pretrain.py][INFO] Epoch:[0/2](433600/4588595) loss:2.904 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:29:27,299][model8_pretrain.py][INFO] Epoch:[0/2](433600/4588595) loss:3.191 lr:0.0000100 epoch_Time:26349.0min: [2024-01-04 15:30:06,055][model8_pretrain.py][INFO] Epoch:[0/2](433700/4588595) loss:2.620 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:06,055][model8_pretrain.py][INFO] Epoch:[0/2](433700/4588595) loss:3.244 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:06,055][model8_pretrain.py][INFO] Epoch:[0/2](433700/4588595) loss:1.984 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:06,055][model8_pretrain.py][INFO] Epoch:[0/2](433700/4588595) loss:3.364 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:06,055][model8_pretrain.py][INFO] Epoch:[0/2](433700/4588595) loss:3.002 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:06,055][model8_pretrain.py][INFO] Epoch:[0/2](433700/4588595) loss:2.590 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:06,055][model8_pretrain.py][INFO] Epoch:[0/2](433700/4588595) loss:2.830 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:06,056][model8_pretrain.py][INFO] Epoch:[0/2](433700/4588595) loss:1.953 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:51,617][model8_pretrain.py][INFO] Epoch:[0/2](433800/4588595) loss:2.650 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:51,617][model8_pretrain.py][INFO] Epoch:[0/2](433800/4588595) loss:3.060 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:51,617][model8_pretrain.py][INFO] Epoch:[0/2](433800/4588595) loss:3.081 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:51,617][model8_pretrain.py][INFO] Epoch:[0/2](433800/4588595) loss:2.793 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:51,617][model8_pretrain.py][INFO] Epoch:[0/2](433800/4588595) loss:3.130 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:51,617][model8_pretrain.py][INFO] Epoch:[0/2](433800/4588595) loss:2.782 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:51,617][model8_pretrain.py][INFO] Epoch:[0/2](433800/4588595) loss:3.120 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:30:51,617][model8_pretrain.py][INFO] Epoch:[0/2](433800/4588595) loss:2.015 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:31:28,558][model8_pretrain.py][INFO] Epoch:[0/2](433900/4588595) loss:2.323 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:31:28,559][model8_pretrain.py][INFO] Epoch:[0/2](433900/4588595) loss:2.844 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:31:28,559][model8_pretrain.py][INFO] Epoch:[0/2](433900/4588595) loss:3.271 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:31:28,559][model8_pretrain.py][INFO] Epoch:[0/2](433900/4588595) loss:3.146 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:31:28,559][model8_pretrain.py][INFO] Epoch:[0/2](433900/4588595) loss:2.871 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:31:28,559][model8_pretrain.py][INFO] Epoch:[0/2](433900/4588595) loss:2.702 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:31:28,559][model8_pretrain.py][INFO] Epoch:[0/2](433900/4588595) loss:3.217 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:31:28,559][model8_pretrain.py][INFO] Epoch:[0/2](433900/4588595) loss:2.438 lr:0.0000100 epoch_Time:26348.0min: [2024-01-04 15:32:05,510][model8_pretrain.py][INFO] Epoch:[0/2](434000/4588595) loss:3.192 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:05,510][model8_pretrain.py][INFO] Epoch:[0/2](434000/4588595) loss:2.804 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:05,510][model8_pretrain.py][INFO] Epoch:[0/2](434000/4588595) loss:2.966 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:05,510][model8_pretrain.py][INFO] Epoch:[0/2](434000/4588595) loss:2.888 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:05,510][model8_pretrain.py][INFO] Epoch:[0/2](434000/4588595) loss:2.525 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:05,510][model8_pretrain.py][INFO] Epoch:[0/2](434000/4588595) loss:2.610 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:05,510][model8_pretrain.py][INFO] Epoch:[0/2](434000/4588595) loss:3.013 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:05,510][model8_pretrain.py][INFO] Epoch:[0/2](434000/4588595) loss:2.796 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:42,473][model8_pretrain.py][INFO] Epoch:[0/2](434100/4588595) loss:2.765 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:42,473][model8_pretrain.py][INFO] Epoch:[0/2](434100/4588595) loss:2.754 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:42,473][model8_pretrain.py][INFO] Epoch:[0/2](434100/4588595) loss:2.991 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:42,473][model8_pretrain.py][INFO] Epoch:[0/2](434100/4588595) loss:2.764 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:42,473][model8_pretrain.py][INFO] Epoch:[0/2](434100/4588595) loss:3.274 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:42,474][model8_pretrain.py][INFO] Epoch:[0/2](434100/4588595) loss:2.816 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:42,474][model8_pretrain.py][INFO] Epoch:[0/2](434100/4588595) loss:2.497 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:32:42,474][model8_pretrain.py][INFO] Epoch:[0/2](434100/4588595) loss:2.718 lr:0.0000100 epoch_Time:26347.0min: [2024-01-04 15:33:19,426][model8_pretrain.py][INFO] Epoch:[0/2](434200/4588595) loss:3.145 lr:0.0000100 epoch_Time:26346.0min: [2024-01-04 15:33:19,426][model8_pretrain.py][INFO] Epoch:[0/2](434200/4588595) loss:2.855 lr:0.0000100 epoch_Time:26346.0min: [2024-01-04 15:33:19,426][model8_pretrain.py][INFO] Epoch:[0/2](434200/4588595) loss:2.931 lr:0.0000100 epoch_Time:26346.0min: [2024-01-04 15:33:19,426][model8_pretrain.py][INFO] Epoch:[0/2](434200/4588595) loss:2.832 lr:0.0000100 epoch_Time:26346.0min: [2024-01-04 15:33:19,426][model8_pretrain.py][INFO] Epoch:[0/2](434200/4588595) loss:2.966 lr:0.0000100 epoch_Time:26346.0min: [2024-01-04 15:33:19,426][model8_pretrain.py][INFO] Epoch:[0/2](434200/4588595) loss:2.932 lr:0.0000100 epoch_Time:26346.0min: [2024-01-04 15:33:19,426][model8_pretrain.py][INFO] Epoch:[0/2](434200/4588595) loss:2.747 lr:0.0000100 epoch_Time:26346.0min: [2024-01-04 15:33:19,426][model8_pretrain.py][INFO] Epoch:[0/2](434200/4588595) loss:3.065 lr:0.0000100 epoch_Time:26346.0min: [2024-01-04 15:33:56,381][model8_pretrain.py][INFO] Epoch:[0/2](434300/4588595) loss:2.790 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:33:56,381][model8_pretrain.py][INFO] Epoch:[0/2](434300/4588595) loss:3.078 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:33:56,381][model8_pretrain.py][INFO] Epoch:[0/2](434300/4588595) loss:3.023 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:33:56,381][model8_pretrain.py][INFO] Epoch:[0/2](434300/4588595) loss:3.050 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:33:56,381][model8_pretrain.py][INFO] Epoch:[0/2](434300/4588595) loss:2.732 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:33:56,381][model8_pretrain.py][INFO] Epoch:[0/2](434300/4588595) loss:2.592 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:33:56,381][model8_pretrain.py][INFO] Epoch:[0/2](434300/4588595) loss:3.020 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:33:56,381][model8_pretrain.py][INFO] Epoch:[0/2](434300/4588595) loss:2.663 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:34:33,341][model8_pretrain.py][INFO] Epoch:[0/2](434400/4588595) loss:2.753 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:34:33,341][model8_pretrain.py][INFO] Epoch:[0/2](434400/4588595) loss:2.822 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:34:33,341][model8_pretrain.py][INFO] Epoch:[0/2](434400/4588595) loss:3.053 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:34:33,342][model8_pretrain.py][INFO] Epoch:[0/2](434400/4588595) loss:3.211 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:34:33,342][model8_pretrain.py][INFO] Epoch:[0/2](434400/4588595) loss:2.950 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:34:33,342][model8_pretrain.py][INFO] Epoch:[0/2](434400/4588595) loss:2.631 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:34:33,342][model8_pretrain.py][INFO] Epoch:[0/2](434400/4588595) loss:3.312 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:34:33,343][model8_pretrain.py][INFO] Epoch:[0/2](434400/4588595) loss:3.175 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:35:12,035][model8_pretrain.py][INFO] Epoch:[0/2](434500/4588595) loss:3.328 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:35:12,035][model8_pretrain.py][INFO] Epoch:[0/2](434500/4588595) loss:3.009 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:35:12,035][model8_pretrain.py][INFO] Epoch:[0/2](434500/4588595) loss:2.527 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:35:12,035][model8_pretrain.py][INFO] Epoch:[0/2](434500/4588595) loss:3.108 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:35:12,035][model8_pretrain.py][INFO] Epoch:[0/2](434500/4588595) loss:2.716 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:35:12,035][model8_pretrain.py][INFO] Epoch:[0/2](434500/4588595) loss:2.619 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:35:12,035][model8_pretrain.py][INFO] Epoch:[0/2](434500/4588595) loss:2.904 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:35:12,035][model8_pretrain.py][INFO] Epoch:[0/2](434500/4588595) loss:2.780 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:35:57,635][model8_pretrain.py][INFO] Epoch:[0/2](434600/4588595) loss:3.263 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:35:57,635][model8_pretrain.py][INFO] Epoch:[0/2](434600/4588595) loss:2.991 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:35:57,635][model8_pretrain.py][INFO] Epoch:[0/2](434600/4588595) loss:2.227 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:35:57,635][model8_pretrain.py][INFO] Epoch:[0/2](434600/4588595) loss:2.901 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:35:57,635][model8_pretrain.py][INFO] Epoch:[0/2](434600/4588595) loss:2.496 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:35:57,635][model8_pretrain.py][INFO] Epoch:[0/2](434600/4588595) loss:2.728 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:35:57,635][model8_pretrain.py][INFO] Epoch:[0/2](434600/4588595) loss:2.250 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:35:57,635][model8_pretrain.py][INFO] Epoch:[0/2](434600/4588595) loss:3.033 lr:0.0000100 epoch_Time:26344.0min: [2024-01-04 15:36:34,575][model8_pretrain.py][INFO] Epoch:[0/2](434700/4588595) loss:3.086 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:36:34,575][model8_pretrain.py][INFO] Epoch:[0/2](434700/4588595) loss:3.313 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:36:34,575][model8_pretrain.py][INFO] Epoch:[0/2](434700/4588595) loss:2.415 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:36:34,575][model8_pretrain.py][INFO] Epoch:[0/2](434700/4588595) loss:2.720 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:36:34,575][model8_pretrain.py][INFO] Epoch:[0/2](434700/4588595) loss:3.153 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:36:34,575][model8_pretrain.py][INFO] Epoch:[0/2](434700/4588595) loss:2.830 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:36:34,575][model8_pretrain.py][INFO] Epoch:[0/2](434700/4588595) loss:3.161 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:36:34,576][model8_pretrain.py][INFO] Epoch:[0/2](434700/4588595) loss:3.124 lr:0.0000100 epoch_Time:26343.0min: [2024-01-04 15:37:11,526][model8_pretrain.py][INFO] Epoch:[0/2](434800/4588595) loss:3.107 lr:0.0000100 epoch_Time:26342.0min: [2024-01-04 15:37:11,526][model8_pretrain.py][INFO] Epoch:[0/2](434800/4588595) loss:3.149 lr:0.0000100 epoch_Time:26342.0min: [2024-01-04 15:37:11,526][model8_pretrain.py][INFO] Epoch:[0/2](434800/4588595) loss:3.046 lr:0.0000100 epoch_Time:26342.0min: [2024-01-04 15:37:11,526][model8_pretrain.py][INFO] Epoch:[0/2](434800/4588595) loss:3.125 lr:0.0000100 epoch_Time:26342.0min: [2024-01-04 15:37:11,526][model8_pretrain.py][INFO] Epoch:[0/2](434800/4588595) loss:3.083 lr:0.0000100 epoch_Time:26342.0min: [2024-01-04 15:37:11,526][model8_pretrain.py][INFO] Epoch:[0/2](434800/4588595) loss:3.009 lr:0.0000100 epoch_Time:26342.0min: [2024-01-04 15:37:11,526][model8_pretrain.py][INFO] Epoch:[0/2](434800/4588595) loss:3.269 lr:0.0000100 epoch_Time:26342.0min: [2024-01-04 15:37:11,526][model8_pretrain.py][INFO] Epoch:[0/2](434800/4588595) loss:3.153 lr:0.0000100 epoch_Time:26342.0min: [2024-01-04 15:37:48,484][model8_pretrain.py][INFO] Epoch:[0/2](434900/4588595) loss:2.433 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:37:48,484][model8_pretrain.py][INFO] Epoch:[0/2](434900/4588595) loss:3.247 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:37:48,484][model8_pretrain.py][INFO] Epoch:[0/2](434900/4588595) loss:2.706 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:37:48,484][model8_pretrain.py][INFO] Epoch:[0/2](434900/4588595) loss:2.587 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:37:48,484][model8_pretrain.py][INFO] Epoch:[0/2](434900/4588595) loss:2.082 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:37:48,484][model8_pretrain.py][INFO] Epoch:[0/2](434900/4588595) loss:2.706 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:37:48,484][model8_pretrain.py][INFO] Epoch:[0/2](434900/4588595) loss:2.710 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:37:48,484][model8_pretrain.py][INFO] Epoch:[0/2](434900/4588595) loss:3.149 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:38:25,443][model8_pretrain.py][INFO] Epoch:[0/2](435000/4588595) loss:3.184 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:38:25,443][model8_pretrain.py][INFO] Epoch:[0/2](435000/4588595) loss:2.449 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:38:25,443][model8_pretrain.py][INFO] Epoch:[0/2](435000/4588595) loss:2.845 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:38:25,443][model8_pretrain.py][INFO] Epoch:[0/2](435000/4588595) loss:3.383 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:38:25,443][model8_pretrain.py][INFO] Epoch:[0/2](435000/4588595) loss:2.624 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:38:25,443][model8_pretrain.py][INFO] Epoch:[0/2](435000/4588595) loss:2.268 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:38:25,444][model8_pretrain.py][INFO] Epoch:[0/2](435000/4588595) loss:2.538 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:38:25,444][model8_pretrain.py][INFO] Epoch:[0/2](435000/4588595) loss:3.220 lr:0.0000100 epoch_Time:26341.0min: [2024-01-04 15:39:02,387][model8_pretrain.py][INFO] Epoch:[0/2](435100/4588595) loss:2.320 lr:0.0000100 epoch_Time:26340.0min: [2024-01-04 15:39:02,387][model8_pretrain.py][INFO] Epoch:[0/2](435100/4588595) loss:2.551 lr:0.0000100 epoch_Time:26340.0min: [2024-01-04 15:39:02,387][model8_pretrain.py][INFO] Epoch:[0/2](435100/4588595) loss:2.575 lr:0.0000100 epoch_Time:26340.0min: [2024-01-04 15:39:02,387][model8_pretrain.py][INFO] Epoch:[0/2](435100/4588595) loss:2.234 lr:0.0000100 epoch_Time:26340.0min: [2024-01-04 15:39:02,387][model8_pretrain.py][INFO] Epoch:[0/2](435100/4588595) loss:2.978 lr:0.0000100 epoch_Time:26340.0min: [2024-01-04 15:39:02,388][model8_pretrain.py][INFO] Epoch:[0/2](435100/4588595) loss:3.021 lr:0.0000100 epoch_Time:26340.0min: [2024-01-04 15:39:02,387][model8_pretrain.py][INFO] Epoch:[0/2](435100/4588595) loss:2.535 lr:0.0000100 epoch_Time:26340.0min: [2024-01-04 15:39:02,388][model8_pretrain.py][INFO] Epoch:[0/2](435100/4588595) loss:2.801 lr:0.0000100 epoch_Time:26340.0min: [2024-01-04 15:39:39,349][model8_pretrain.py][INFO] Epoch:[0/2](435200/4588595) loss:3.029 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:39:39,349][model8_pretrain.py][INFO] Epoch:[0/2](435200/4588595) loss:3.421 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:39:39,349][model8_pretrain.py][INFO] Epoch:[0/2](435200/4588595) loss:3.152 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:39:39,349][model8_pretrain.py][INFO] Epoch:[0/2](435200/4588595) loss:2.760 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:39:39,349][model8_pretrain.py][INFO] Epoch:[0/2](435200/4588595) loss:2.038 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:39:39,349][model8_pretrain.py][INFO] Epoch:[0/2](435200/4588595) loss:2.942 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:39:39,349][model8_pretrain.py][INFO] Epoch:[0/2](435200/4588595) loss:2.992 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:39:39,349][model8_pretrain.py][INFO] Epoch:[0/2](435200/4588595) loss:2.683 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:40:16,302][model8_pretrain.py][INFO] Epoch:[0/2](435300/4588595) loss:2.685 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:40:16,302][model8_pretrain.py][INFO] Epoch:[0/2](435300/4588595) loss:3.049 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:40:16,302][model8_pretrain.py][INFO] Epoch:[0/2](435300/4588595) loss:2.582 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:40:16,302][model8_pretrain.py][INFO] Epoch:[0/2](435300/4588595) loss:2.969 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:40:16,302][model8_pretrain.py][INFO] Epoch:[0/2](435300/4588595) loss:3.035 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:40:16,302][model8_pretrain.py][INFO] Epoch:[0/2](435300/4588595) loss:3.173 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:40:16,302][model8_pretrain.py][INFO] Epoch:[0/2](435300/4588595) loss:2.772 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:40:16,302][model8_pretrain.py][INFO] Epoch:[0/2](435300/4588595) loss:3.269 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:41:03,692][model8_pretrain.py][INFO] Epoch:[0/2](435400/4588595) loss:2.238 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:03,692][model8_pretrain.py][INFO] Epoch:[0/2](435400/4588595) loss:1.899 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:03,692][model8_pretrain.py][INFO] Epoch:[0/2](435400/4588595) loss:2.963 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:03,692][model8_pretrain.py][INFO] Epoch:[0/2](435400/4588595) loss:2.622 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:03,692][model8_pretrain.py][INFO] Epoch:[0/2](435400/4588595) loss:2.909 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:03,692][model8_pretrain.py][INFO] Epoch:[0/2](435400/4588595) loss:2.429 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:03,692][model8_pretrain.py][INFO] Epoch:[0/2](435400/4588595) loss:3.153 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:03,692][model8_pretrain.py][INFO] Epoch:[0/2](435400/4588595) loss:2.382 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:40,633][model8_pretrain.py][INFO] Epoch:[0/2](435500/4588595) loss:2.747 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:40,633][model8_pretrain.py][INFO] Epoch:[0/2](435500/4588595) loss:2.857 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:40,633][model8_pretrain.py][INFO] Epoch:[0/2](435500/4588595) loss:3.265 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:40,633][model8_pretrain.py][INFO] Epoch:[0/2](435500/4588595) loss:1.930 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:40,633][model8_pretrain.py][INFO] Epoch:[0/2](435500/4588595) loss:2.863 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:40,633][model8_pretrain.py][INFO] Epoch:[0/2](435500/4588595) loss:2.756 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:40,633][model8_pretrain.py][INFO] Epoch:[0/2](435500/4588595) loss:2.704 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:41:40,633][model8_pretrain.py][INFO] Epoch:[0/2](435500/4588595) loss:3.185 lr:0.0000100 epoch_Time:26339.0min: [2024-01-04 15:42:17,574][model8_pretrain.py][INFO] Epoch:[0/2](435600/4588595) loss:2.662 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:42:17,574][model8_pretrain.py][INFO] Epoch:[0/2](435600/4588595) loss:1.822 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:42:17,574][model8_pretrain.py][INFO] Epoch:[0/2](435600/4588595) loss:2.796 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:42:17,574][model8_pretrain.py][INFO] Epoch:[0/2](435600/4588595) loss:2.593 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:42:17,574][model8_pretrain.py][INFO] Epoch:[0/2](435600/4588595) loss:2.740 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:42:17,574][model8_pretrain.py][INFO] Epoch:[0/2](435600/4588595) loss:3.141 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:42:17,574][model8_pretrain.py][INFO] Epoch:[0/2](435600/4588595) loss:2.491 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:42:17,574][model8_pretrain.py][INFO] Epoch:[0/2](435600/4588595) loss:3.122 lr:0.0000100 epoch_Time:26338.0min: [2024-01-04 15:42:54,518][model8_pretrain.py][INFO] Epoch:[0/2](435700/4588595) loss:2.383 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:42:54,518][model8_pretrain.py][INFO] Epoch:[0/2](435700/4588595) loss:3.506 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:42:54,518][model8_pretrain.py][INFO] Epoch:[0/2](435700/4588595) loss:3.160 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:42:54,518][model8_pretrain.py][INFO] Epoch:[0/2](435700/4588595) loss:3.083 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:42:54,518][model8_pretrain.py][INFO] Epoch:[0/2](435700/4588595) loss:2.928 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:42:54,518][model8_pretrain.py][INFO] Epoch:[0/2](435700/4588595) loss:2.638 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:42:54,518][model8_pretrain.py][INFO] Epoch:[0/2](435700/4588595) loss:2.829 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:42:54,518][model8_pretrain.py][INFO] Epoch:[0/2](435700/4588595) loss:2.696 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:43:31,460][model8_pretrain.py][INFO] Epoch:[0/2](435800/4588595) loss:2.791 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:43:31,460][model8_pretrain.py][INFO] Epoch:[0/2](435800/4588595) loss:2.906 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:43:31,460][model8_pretrain.py][INFO] Epoch:[0/2](435800/4588595) loss:2.704 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:43:31,460][model8_pretrain.py][INFO] Epoch:[0/2](435800/4588595) loss:2.866 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:43:31,460][model8_pretrain.py][INFO] Epoch:[0/2](435800/4588595) loss:2.640 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:43:31,460][model8_pretrain.py][INFO] Epoch:[0/2](435800/4588595) loss:2.542 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:43:31,460][model8_pretrain.py][INFO] Epoch:[0/2](435800/4588595) loss:2.864 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:43:31,461][model8_pretrain.py][INFO] Epoch:[0/2](435800/4588595) loss:3.071 lr:0.0000100 epoch_Time:26336.0min: [2024-01-04 15:44:08,417][model8_pretrain.py][INFO] Epoch:[0/2](435900/4588595) loss:2.840 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:08,417][model8_pretrain.py][INFO] Epoch:[0/2](435900/4588595) loss:2.798 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:08,417][model8_pretrain.py][INFO] Epoch:[0/2](435900/4588595) loss:3.376 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:08,417][model8_pretrain.py][INFO] Epoch:[0/2](435900/4588595) loss:2.366 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:08,417][model8_pretrain.py][INFO] Epoch:[0/2](435900/4588595) loss:2.684 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:08,417][model8_pretrain.py][INFO] Epoch:[0/2](435900/4588595) loss:2.612 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:08,417][model8_pretrain.py][INFO] Epoch:[0/2](435900/4588595) loss:3.053 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:08,417][model8_pretrain.py][INFO] Epoch:[0/2](435900/4588595) loss:2.965 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:45,365][model8_pretrain.py][INFO] Epoch:[0/2](436000/4588595) loss:2.472 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:45,365][model8_pretrain.py][INFO] Epoch:[0/2](436000/4588595) loss:2.815 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:45,366][model8_pretrain.py][INFO] Epoch:[0/2](436000/4588595) loss:2.750 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:45,366][model8_pretrain.py][INFO] Epoch:[0/2](436000/4588595) loss:2.650 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:45,366][model8_pretrain.py][INFO] Epoch:[0/2](436000/4588595) loss:3.007 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:45,366][model8_pretrain.py][INFO] Epoch:[0/2](436000/4588595) loss:2.863 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:45,366][model8_pretrain.py][INFO] Epoch:[0/2](436000/4588595) loss:2.893 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:44:45,366][model8_pretrain.py][INFO] Epoch:[0/2](436000/4588595) loss:2.766 lr:0.0000100 epoch_Time:26335.0min: [2024-01-04 15:45:22,314][model8_pretrain.py][INFO] Epoch:[0/2](436100/4588595) loss:1.872 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:45:22,314][model8_pretrain.py][INFO] Epoch:[0/2](436100/4588595) loss:2.902 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:45:22,314][model8_pretrain.py][INFO] Epoch:[0/2](436100/4588595) loss:3.310 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:45:22,314][model8_pretrain.py][INFO] Epoch:[0/2](436100/4588595) loss:2.810 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:45:22,314][model8_pretrain.py][INFO] Epoch:[0/2](436100/4588595) loss:2.943 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:45:22,314][model8_pretrain.py][INFO] Epoch:[0/2](436100/4588595) loss:2.684 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:45:22,314][model8_pretrain.py][INFO] Epoch:[0/2](436100/4588595) loss:2.894 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:45:22,314][model8_pretrain.py][INFO] Epoch:[0/2](436100/4588595) loss:2.946 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:09,645][model8_pretrain.py][INFO] Epoch:[0/2](436200/4588595) loss:2.976 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:09,645][model8_pretrain.py][INFO] Epoch:[0/2](436200/4588595) loss:2.962 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:09,645][model8_pretrain.py][INFO] Epoch:[0/2](436200/4588595) loss:3.124 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:09,645][model8_pretrain.py][INFO] Epoch:[0/2](436200/4588595) loss:2.854 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:09,645][model8_pretrain.py][INFO] Epoch:[0/2](436200/4588595) loss:2.480 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:09,645][model8_pretrain.py][INFO] Epoch:[0/2](436200/4588595) loss:2.878 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:09,646][model8_pretrain.py][INFO] Epoch:[0/2](436200/4588595) loss:2.888 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:09,645][model8_pretrain.py][INFO] Epoch:[0/2](436200/4588595) loss:2.982 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:46,581][model8_pretrain.py][INFO] Epoch:[0/2](436300/4588595) loss:3.126 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:46,581][model8_pretrain.py][INFO] Epoch:[0/2](436300/4588595) loss:3.228 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:46,581][model8_pretrain.py][INFO] Epoch:[0/2](436300/4588595) loss:2.786 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:46,581][model8_pretrain.py][INFO] Epoch:[0/2](436300/4588595) loss:3.384 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:46,581][model8_pretrain.py][INFO] Epoch:[0/2](436300/4588595) loss:2.621 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:46,581][model8_pretrain.py][INFO] Epoch:[0/2](436300/4588595) loss:2.522 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:46,581][model8_pretrain.py][INFO] Epoch:[0/2](436300/4588595) loss:3.233 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:46:46,581][model8_pretrain.py][INFO] Epoch:[0/2](436300/4588595) loss:2.956 lr:0.0000100 epoch_Time:26334.0min: [2024-01-04 15:47:23,483][model8_pretrain.py][INFO] Epoch:[0/2](436400/4588595) loss:2.765 lr:0.0000100 epoch_Time:26333.0min: [2024-01-04 15:47:23,483][model8_pretrain.py][INFO] Epoch:[0/2](436400/4588595) loss:3.112 lr:0.0000100 epoch_Time:26333.0min: [2024-01-04 15:47:23,483][model8_pretrain.py][INFO] Epoch:[0/2](436400/4588595) loss:2.560 lr:0.0000100 epoch_Time:26333.0min: [2024-01-04 15:47:23,483][model8_pretrain.py][INFO] Epoch:[0/2](436400/4588595) loss:2.526 lr:0.0000100 epoch_Time:26333.0min: [2024-01-04 15:47:23,483][model8_pretrain.py][INFO] Epoch:[0/2](436400/4588595) loss:3.129 lr:0.0000100 epoch_Time:26333.0min: [2024-01-04 15:47:23,483][model8_pretrain.py][INFO] Epoch:[0/2](436400/4588595) loss:2.471 lr:0.0000100 epoch_Time:26333.0min: [2024-01-04 15:47:23,483][model8_pretrain.py][INFO] Epoch:[0/2](436400/4588595) loss:2.888 lr:0.0000100 epoch_Time:26333.0min: [2024-01-04 15:47:23,484][model8_pretrain.py][INFO] Epoch:[0/2](436400/4588595) loss:2.986 lr:0.0000100 epoch_Time:26333.0min: [2024-01-04 15:48:00,412][model8_pretrain.py][INFO] Epoch:[0/2](436500/4588595) loss:3.186 lr:0.0000100 epoch_Time:26332.0min: [2024-01-04 15:48:00,412][model8_pretrain.py][INFO] Epoch:[0/2](436500/4588595) loss:3.150 lr:0.0000100 epoch_Time:26332.0min: [2024-01-04 15:48:00,412][model8_pretrain.py][INFO] Epoch:[0/2](436500/4588595) loss:2.494 lr:0.0000100 epoch_Time:26332.0min: [2024-01-04 15:48:00,412][model8_pretrain.py][INFO] Epoch:[0/2](436500/4588595) loss:2.745 lr:0.0000100 epoch_Time:26332.0min: [2024-01-04 15:48:00,412][model8_pretrain.py][INFO] Epoch:[0/2](436500/4588595) loss:2.528 lr:0.0000100 epoch_Time:26332.0min: [2024-01-04 15:48:00,412][model8_pretrain.py][INFO] Epoch:[0/2](436500/4588595) loss:3.311 lr:0.0000100 epoch_Time:26332.0min: [2024-01-04 15:48:00,412][model8_pretrain.py][INFO] Epoch:[0/2](436500/4588595) loss:2.843 lr:0.0000100 epoch_Time:26332.0min: [2024-01-04 15:48:00,412][model8_pretrain.py][INFO] Epoch:[0/2](436500/4588595) loss:3.063 lr:0.0000100 epoch_Time:26332.0min: [2024-01-04 15:48:37,355][model8_pretrain.py][INFO] Epoch:[0/2](436600/4588595) loss:3.199 lr:0.0000100 epoch_Time:26331.0min: [2024-01-04 15:48:37,355][model8_pretrain.py][INFO] Epoch:[0/2](436600/4588595) loss:2.495 lr:0.0000100 epoch_Time:26331.0min: [2024-01-04 15:48:37,355][model8_pretrain.py][INFO] Epoch:[0/2](436600/4588595) loss:3.347 lr:0.0000100 epoch_Time:26331.0min: [2024-01-04 15:48:37,355][model8_pretrain.py][INFO] Epoch:[0/2](436600/4588595) loss:2.927 lr:0.0000100 epoch_Time:26331.0min: [2024-01-04 15:48:37,355][model8_pretrain.py][INFO] Epoch:[0/2](436600/4588595) loss:2.596 lr:0.0000100 epoch_Time:26331.0min: [2024-01-04 15:48:37,355][model8_pretrain.py][INFO] Epoch:[0/2](436600/4588595) loss:2.930 lr:0.0000100 epoch_Time:26331.0min: [2024-01-04 15:48:37,355][model8_pretrain.py][INFO] Epoch:[0/2](436600/4588595) loss:2.566 lr:0.0000100 epoch_Time:26331.0min: [2024-01-04 15:48:37,355][model8_pretrain.py][INFO] Epoch:[0/2](436600/4588595) loss:3.067 lr:0.0000100 epoch_Time:26331.0min: [2024-01-04 15:49:14,303][model8_pretrain.py][INFO] Epoch:[0/2](436700/4588595) loss:2.749 lr:0.0000100 epoch_Time:26330.0min: [2024-01-04 15:49:14,303][model8_pretrain.py][INFO] Epoch:[0/2](436700/4588595) loss:3.126 lr:0.0000100 epoch_Time:26330.0min: [2024-01-04 15:49:14,303][model8_pretrain.py][INFO] Epoch:[0/2](436700/4588595) loss:2.839 lr:0.0000100 epoch_Time:26330.0min: [2024-01-04 15:49:14,303][model8_pretrain.py][INFO] Epoch:[0/2](436700/4588595) loss:2.550 lr:0.0000100 epoch_Time:26330.0min: [2024-01-04 15:49:14,303][model8_pretrain.py][INFO] Epoch:[0/2](436700/4588595) loss:3.056 lr:0.0000100 epoch_Time:26330.0min: [2024-01-04 15:49:14,303][model8_pretrain.py][INFO] Epoch:[0/2](436700/4588595) loss:2.587 lr:0.0000100 epoch_Time:26330.0min: [2024-01-04 15:49:14,303][model8_pretrain.py][INFO] Epoch:[0/2](436700/4588595) loss:2.690 lr:0.0000100 epoch_Time:26330.0min: [2024-01-04 15:49:14,303][model8_pretrain.py][INFO] Epoch:[0/2](436700/4588595) loss:3.469 lr:0.0000100 epoch_Time:26330.0min: [2024-01-04 15:49:51,239][model8_pretrain.py][INFO] Epoch:[0/2](436800/4588595) loss:3.218 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:49:51,239][model8_pretrain.py][INFO] Epoch:[0/2](436800/4588595) loss:3.126 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:49:51,239][model8_pretrain.py][INFO] Epoch:[0/2](436800/4588595) loss:2.960 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:49:51,239][model8_pretrain.py][INFO] Epoch:[0/2](436800/4588595) loss:2.827 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:49:51,239][model8_pretrain.py][INFO] Epoch:[0/2](436800/4588595) loss:2.679 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:49:51,240][model8_pretrain.py][INFO] Epoch:[0/2](436800/4588595) loss:2.692 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:49:51,240][model8_pretrain.py][INFO] Epoch:[0/2](436800/4588595) loss:2.730 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:49:51,240][model8_pretrain.py][INFO] Epoch:[0/2](436800/4588595) loss:2.849 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:50:28,197][model8_pretrain.py][INFO] Epoch:[0/2](436900/4588595) loss:2.963 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:50:28,197][model8_pretrain.py][INFO] Epoch:[0/2](436900/4588595) loss:2.970 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:50:28,197][model8_pretrain.py][INFO] Epoch:[0/2](436900/4588595) loss:3.156 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:50:28,197][model8_pretrain.py][INFO] Epoch:[0/2](436900/4588595) loss:2.959 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:50:28,197][model8_pretrain.py][INFO] Epoch:[0/2](436900/4588595) loss:3.110 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:50:28,197][model8_pretrain.py][INFO] Epoch:[0/2](436900/4588595) loss:2.651 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:50:28,197][model8_pretrain.py][INFO] Epoch:[0/2](436900/4588595) loss:3.199 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:50:28,197][model8_pretrain.py][INFO] Epoch:[0/2](436900/4588595) loss:3.196 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:51:15,254][model8_pretrain.py][INFO] Epoch:[0/2](437000/4588595) loss:2.923 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:51:15,254][model8_pretrain.py][INFO] Epoch:[0/2](437000/4588595) loss:3.047 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:51:15,254][model8_pretrain.py][INFO] Epoch:[0/2](437000/4588595) loss:2.942 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:51:15,254][model8_pretrain.py][INFO] Epoch:[0/2](437000/4588595) loss:3.135 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:51:15,254][model8_pretrain.py][INFO] Epoch:[0/2](437000/4588595) loss:3.088 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:51:15,254][model8_pretrain.py][INFO] Epoch:[0/2](437000/4588595) loss:2.920 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:51:15,254][model8_pretrain.py][INFO] Epoch:[0/2](437000/4588595) loss:2.605 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:51:15,254][model8_pretrain.py][INFO] Epoch:[0/2](437000/4588595) loss:2.893 lr:0.0000100 epoch_Time:26329.0min: [2024-01-04 15:51:52,183][model8_pretrain.py][INFO] Epoch:[0/2](437100/4588595) loss:2.887 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:51:52,184][model8_pretrain.py][INFO] Epoch:[0/2](437100/4588595) loss:1.725 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:51:52,184][model8_pretrain.py][INFO] Epoch:[0/2](437100/4588595) loss:2.918 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:51:52,184][model8_pretrain.py][INFO] Epoch:[0/2](437100/4588595) loss:3.070 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:51:52,184][model8_pretrain.py][INFO] Epoch:[0/2](437100/4588595) loss:2.831 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:51:52,184][model8_pretrain.py][INFO] Epoch:[0/2](437100/4588595) loss:2.858 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:51:52,184][model8_pretrain.py][INFO] Epoch:[0/2](437100/4588595) loss:2.638 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:51:52,184][model8_pretrain.py][INFO] Epoch:[0/2](437100/4588595) loss:3.383 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:52:29,126][model8_pretrain.py][INFO] Epoch:[0/2](437200/4588595) loss:2.619 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:52:29,126][model8_pretrain.py][INFO] Epoch:[0/2](437200/4588595) loss:2.856 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:52:29,126][model8_pretrain.py][INFO] Epoch:[0/2](437200/4588595) loss:2.931 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:52:29,126][model8_pretrain.py][INFO] Epoch:[0/2](437200/4588595) loss:3.069 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:52:29,126][model8_pretrain.py][INFO] Epoch:[0/2](437200/4588595) loss:2.768 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:52:29,126][model8_pretrain.py][INFO] Epoch:[0/2](437200/4588595) loss:3.008 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:52:29,126][model8_pretrain.py][INFO] Epoch:[0/2](437200/4588595) loss:3.423 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:52:29,126][model8_pretrain.py][INFO] Epoch:[0/2](437200/4588595) loss:2.511 lr:0.0000100 epoch_Time:26328.0min: [2024-01-04 15:53:06,048][model8_pretrain.py][INFO] Epoch:[0/2](437300/4588595) loss:3.015 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:06,048][model8_pretrain.py][INFO] Epoch:[0/2](437300/4588595) loss:3.044 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:06,048][model8_pretrain.py][INFO] Epoch:[0/2](437300/4588595) loss:2.810 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:06,048][model8_pretrain.py][INFO] Epoch:[0/2](437300/4588595) loss:3.170 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:06,048][model8_pretrain.py][INFO] Epoch:[0/2](437300/4588595) loss:2.620 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:06,048][model8_pretrain.py][INFO] Epoch:[0/2](437300/4588595) loss:3.083 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:06,048][model8_pretrain.py][INFO] Epoch:[0/2](437300/4588595) loss:3.020 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:06,048][model8_pretrain.py][INFO] Epoch:[0/2](437300/4588595) loss:3.362 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:42,989][model8_pretrain.py][INFO] Epoch:[0/2](437400/4588595) loss:2.954 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:42,989][model8_pretrain.py][INFO] Epoch:[0/2](437400/4588595) loss:2.651 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:42,989][model8_pretrain.py][INFO] Epoch:[0/2](437400/4588595) loss:2.863 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:42,989][model8_pretrain.py][INFO] Epoch:[0/2](437400/4588595) loss:2.638 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:42,989][model8_pretrain.py][INFO] Epoch:[0/2](437400/4588595) loss:3.136 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:42,989][model8_pretrain.py][INFO] Epoch:[0/2](437400/4588595) loss:3.147 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:42,989][model8_pretrain.py][INFO] Epoch:[0/2](437400/4588595) loss:2.853 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:53:42,989][model8_pretrain.py][INFO] Epoch:[0/2](437400/4588595) loss:2.951 lr:0.0000100 epoch_Time:26327.0min: [2024-01-04 15:54:19,933][model8_pretrain.py][INFO] Epoch:[0/2](437500/4588595) loss:2.621 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:54:19,933][model8_pretrain.py][INFO] Epoch:[0/2](437500/4588595) loss:2.902 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:54:19,933][model8_pretrain.py][INFO] Epoch:[0/2](437500/4588595) loss:2.836 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:54:19,933][model8_pretrain.py][INFO] Epoch:[0/2](437500/4588595) loss:2.736 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:54:19,933][model8_pretrain.py][INFO] Epoch:[0/2](437500/4588595) loss:3.208 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:54:19,933][model8_pretrain.py][INFO] Epoch:[0/2](437500/4588595) loss:2.672 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:54:19,933][model8_pretrain.py][INFO] Epoch:[0/2](437500/4588595) loss:2.364 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:54:19,933][model8_pretrain.py][INFO] Epoch:[0/2](437500/4588595) loss:3.249 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:54:56,892][model8_pretrain.py][INFO] Epoch:[0/2](437600/4588595) loss:2.920 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:54:56,892][model8_pretrain.py][INFO] Epoch:[0/2](437600/4588595) loss:2.502 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:54:56,892][model8_pretrain.py][INFO] Epoch:[0/2](437600/4588595) loss:2.381 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:54:56,892][model8_pretrain.py][INFO] Epoch:[0/2](437600/4588595) loss:2.929 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:54:56,892][model8_pretrain.py][INFO] Epoch:[0/2](437600/4588595) loss:2.696 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:54:56,892][model8_pretrain.py][INFO] Epoch:[0/2](437600/4588595) loss:3.478 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:54:56,892][model8_pretrain.py][INFO] Epoch:[0/2](437600/4588595) loss:2.543 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:54:56,892][model8_pretrain.py][INFO] Epoch:[0/2](437600/4588595) loss:2.095 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:55:33,839][model8_pretrain.py][INFO] Epoch:[0/2](437700/4588595) loss:2.864 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:55:33,839][model8_pretrain.py][INFO] Epoch:[0/2](437700/4588595) loss:2.975 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:55:33,839][model8_pretrain.py][INFO] Epoch:[0/2](437700/4588595) loss:2.801 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:55:33,839][model8_pretrain.py][INFO] Epoch:[0/2](437700/4588595) loss:2.717 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:55:33,839][model8_pretrain.py][INFO] Epoch:[0/2](437700/4588595) loss:2.763 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:55:33,839][model8_pretrain.py][INFO] Epoch:[0/2](437700/4588595) loss:2.609 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:55:33,839][model8_pretrain.py][INFO] Epoch:[0/2](437700/4588595) loss:2.987 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:55:33,839][model8_pretrain.py][INFO] Epoch:[0/2](437700/4588595) loss:2.850 lr:0.0000100 epoch_Time:26324.0min: [2024-01-04 15:56:20,702][model8_pretrain.py][INFO] Epoch:[0/2](437800/4588595) loss:2.733 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:56:20,702][model8_pretrain.py][INFO] Epoch:[0/2](437800/4588595) loss:2.693 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:56:20,702][model8_pretrain.py][INFO] Epoch:[0/2](437800/4588595) loss:2.631 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:56:20,702][model8_pretrain.py][INFO] Epoch:[0/2](437800/4588595) loss:2.763 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:56:20,702][model8_pretrain.py][INFO] Epoch:[0/2](437800/4588595) loss:2.512 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:56:20,702][model8_pretrain.py][INFO] Epoch:[0/2](437800/4588595) loss:2.630 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:56:20,703][model8_pretrain.py][INFO] Epoch:[0/2](437800/4588595) loss:3.219 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:56:20,703][model8_pretrain.py][INFO] Epoch:[0/2](437800/4588595) loss:2.997 lr:0.0000100 epoch_Time:26325.0min: [2024-01-04 15:56:57,643][model8_pretrain.py][INFO] Epoch:[0/2](437900/4588595) loss:3.056 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:56:57,643][model8_pretrain.py][INFO] Epoch:[0/2](437900/4588595) loss:2.573 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:56:57,643][model8_pretrain.py][INFO] Epoch:[0/2](437900/4588595) loss:2.941 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:56:57,643][model8_pretrain.py][INFO] Epoch:[0/2](437900/4588595) loss:2.575 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:56:57,643][model8_pretrain.py][INFO] Epoch:[0/2](437900/4588595) loss:3.091 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:56:57,643][model8_pretrain.py][INFO] Epoch:[0/2](437900/4588595) loss:3.260 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:56:57,643][model8_pretrain.py][INFO] Epoch:[0/2](437900/4588595) loss:2.701 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:56:57,643][model8_pretrain.py][INFO] Epoch:[0/2](437900/4588595) loss:2.637 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:57:34,589][model8_pretrain.py][INFO] Epoch:[0/2](438000/4588595) loss:2.354 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:57:34,590][model8_pretrain.py][INFO] Epoch:[0/2](438000/4588595) loss:2.650 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:57:34,590][model8_pretrain.py][INFO] Epoch:[0/2](438000/4588595) loss:2.942 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:57:34,590][model8_pretrain.py][INFO] Epoch:[0/2](438000/4588595) loss:3.155 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:57:34,590][model8_pretrain.py][INFO] Epoch:[0/2](438000/4588595) loss:2.498 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:57:34,590][model8_pretrain.py][INFO] Epoch:[0/2](438000/4588595) loss:2.648 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:57:34,590][model8_pretrain.py][INFO] Epoch:[0/2](438000/4588595) loss:3.061 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:57:34,590][model8_pretrain.py][INFO] Epoch:[0/2](438000/4588595) loss:2.958 lr:0.0000100 epoch_Time:26323.0min: [2024-01-04 15:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](438100/4588595) loss:2.855 lr:0.0000100 epoch_Time:26322.0min: [2024-01-04 15:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](438100/4588595) loss:2.974 lr:0.0000100 epoch_Time:26322.0min: [2024-01-04 15:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](438100/4588595) loss:3.069 lr:0.0000100 epoch_Time:26322.0min: [2024-01-04 15:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](438100/4588595) loss:2.997 lr:0.0000100 epoch_Time:26322.0min: [2024-01-04 15:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](438100/4588595) loss:2.143 lr:0.0000100 epoch_Time:26322.0min: [2024-01-04 15:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](438100/4588595) loss:3.086 lr:0.0000100 epoch_Time:26322.0min: [2024-01-04 15:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](438100/4588595) loss:2.610 lr:0.0000100 epoch_Time:26322.0min: [2024-01-04 15:58:11,534][model8_pretrain.py][INFO] Epoch:[0/2](438100/4588595) loss:2.994 lr:0.0000100 epoch_Time:26322.0min: [2024-01-04 15:58:48,472][model8_pretrain.py][INFO] Epoch:[0/2](438200/4588595) loss:3.296 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:58:48,472][model8_pretrain.py][INFO] Epoch:[0/2](438200/4588595) loss:3.181 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:58:48,472][model8_pretrain.py][INFO] Epoch:[0/2](438200/4588595) loss:3.233 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:58:48,472][model8_pretrain.py][INFO] Epoch:[0/2](438200/4588595) loss:3.051 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:58:48,472][model8_pretrain.py][INFO] Epoch:[0/2](438200/4588595) loss:2.386 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:58:48,472][model8_pretrain.py][INFO] Epoch:[0/2](438200/4588595) loss:2.829 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:58:48,472][model8_pretrain.py][INFO] Epoch:[0/2](438200/4588595) loss:3.178 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:58:48,472][model8_pretrain.py][INFO] Epoch:[0/2](438200/4588595) loss:2.848 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:59:25,436][model8_pretrain.py][INFO] Epoch:[0/2](438300/4588595) loss:2.239 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:59:25,436][model8_pretrain.py][INFO] Epoch:[0/2](438300/4588595) loss:2.886 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:59:25,436][model8_pretrain.py][INFO] Epoch:[0/2](438300/4588595) loss:2.826 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:59:25,436][model8_pretrain.py][INFO] Epoch:[0/2](438300/4588595) loss:2.901 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:59:25,436][model8_pretrain.py][INFO] Epoch:[0/2](438300/4588595) loss:3.006 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:59:25,436][model8_pretrain.py][INFO] Epoch:[0/2](438300/4588595) loss:2.655 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:59:25,436][model8_pretrain.py][INFO] Epoch:[0/2](438300/4588595) loss:3.204 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 15:59:25,436][model8_pretrain.py][INFO] Epoch:[0/2](438300/4588595) loss:3.316 lr:0.0000100 epoch_Time:26321.0min: [2024-01-04 16:00:02,357][model8_pretrain.py][INFO] Epoch:[0/2](438400/4588595) loss:2.347 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:02,357][model8_pretrain.py][INFO] Epoch:[0/2](438400/4588595) loss:2.789 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:02,357][model8_pretrain.py][INFO] Epoch:[0/2](438400/4588595) loss:3.326 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:02,357][model8_pretrain.py][INFO] Epoch:[0/2](438400/4588595) loss:3.235 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:02,357][model8_pretrain.py][INFO] Epoch:[0/2](438400/4588595) loss:3.145 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:02,357][model8_pretrain.py][INFO] Epoch:[0/2](438400/4588595) loss:3.025 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:02,357][model8_pretrain.py][INFO] Epoch:[0/2](438400/4588595) loss:2.961 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:02,357][model8_pretrain.py][INFO] Epoch:[0/2](438400/4588595) loss:2.725 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:39,299][model8_pretrain.py][INFO] Epoch:[0/2](438500/4588595) loss:3.042 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:39,299][model8_pretrain.py][INFO] Epoch:[0/2](438500/4588595) loss:2.803 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:39,299][model8_pretrain.py][INFO] Epoch:[0/2](438500/4588595) loss:2.588 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:39,299][model8_pretrain.py][INFO] Epoch:[0/2](438500/4588595) loss:2.916 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:39,299][model8_pretrain.py][INFO] Epoch:[0/2](438500/4588595) loss:2.685 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:39,299][model8_pretrain.py][INFO] Epoch:[0/2](438500/4588595) loss:3.402 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:39,299][model8_pretrain.py][INFO] Epoch:[0/2](438500/4588595) loss:3.255 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:00:39,299][model8_pretrain.py][INFO] Epoch:[0/2](438500/4588595) loss:3.026 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:01:26,134][model8_pretrain.py][INFO] Epoch:[0/2](438600/4588595) loss:2.865 lr:0.0000100 epoch_Time:26320.0min: [2024-01-04 16:01:26,134][model8_pretrain.py][INFO] Epoch:[0/2](438600/4588595) loss:2.658 lr:0.0000100 epoch_Time:26320.0min: [2024-01-04 16:01:26,134][model8_pretrain.py][INFO] Epoch:[0/2](438600/4588595) loss:2.869 lr:0.0000100 epoch_Time:26320.0min: [2024-01-04 16:01:26,134][model8_pretrain.py][INFO] Epoch:[0/2](438600/4588595) loss:2.801 lr:0.0000100 epoch_Time:26320.0min: [2024-01-04 16:01:26,134][model8_pretrain.py][INFO] Epoch:[0/2](438600/4588595) loss:2.877 lr:0.0000100 epoch_Time:26320.0min: [2024-01-04 16:01:26,134][model8_pretrain.py][INFO] Epoch:[0/2](438600/4588595) loss:2.739 lr:0.0000100 epoch_Time:26320.0min: [2024-01-04 16:01:26,134][model8_pretrain.py][INFO] Epoch:[0/2](438600/4588595) loss:2.189 lr:0.0000100 epoch_Time:26320.0min: [2024-01-04 16:01:26,134][model8_pretrain.py][INFO] Epoch:[0/2](438600/4588595) loss:3.249 lr:0.0000100 epoch_Time:26320.0min: [2024-01-04 16:02:03,082][model8_pretrain.py][INFO] Epoch:[0/2](438700/4588595) loss:2.849 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:02:03,082][model8_pretrain.py][INFO] Epoch:[0/2](438700/4588595) loss:2.529 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:02:03,083][model8_pretrain.py][INFO] Epoch:[0/2](438700/4588595) loss:3.627 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:02:03,083][model8_pretrain.py][INFO] Epoch:[0/2](438700/4588595) loss:2.724 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:02:03,083][model8_pretrain.py][INFO] Epoch:[0/2](438700/4588595) loss:2.859 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:02:03,083][model8_pretrain.py][INFO] Epoch:[0/2](438700/4588595) loss:2.448 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:02:03,083][model8_pretrain.py][INFO] Epoch:[0/2](438700/4588595) loss:2.816 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:02:03,083][model8_pretrain.py][INFO] Epoch:[0/2](438700/4588595) loss:3.238 lr:0.0000100 epoch_Time:26319.0min: [2024-01-04 16:02:40,026][model8_pretrain.py][INFO] Epoch:[0/2](438800/4588595) loss:2.586 lr:0.0000100 epoch_Time:26318.0min: [2024-01-04 16:02:40,026][model8_pretrain.py][INFO] Epoch:[0/2](438800/4588595) loss:2.951 lr:0.0000100 epoch_Time:26318.0min: [2024-01-04 16:02:40,026][model8_pretrain.py][INFO] Epoch:[0/2](438800/4588595) loss:3.159 lr:0.0000100 epoch_Time:26318.0min: [2024-01-04 16:02:40,026][model8_pretrain.py][INFO] Epoch:[0/2](438800/4588595) loss:3.634 lr:0.0000100 epoch_Time:26318.0min: [2024-01-04 16:02:40,026][model8_pretrain.py][INFO] Epoch:[0/2](438800/4588595) loss:2.063 lr:0.0000100 epoch_Time:26318.0min: [2024-01-04 16:02:40,026][model8_pretrain.py][INFO] Epoch:[0/2](438800/4588595) loss:3.223 lr:0.0000100 epoch_Time:26318.0min: [2024-01-04 16:02:40,026][model8_pretrain.py][INFO] Epoch:[0/2](438800/4588595) loss:3.261 lr:0.0000100 epoch_Time:26318.0min: [2024-01-04 16:02:40,026][model8_pretrain.py][INFO] Epoch:[0/2](438800/4588595) loss:2.968 lr:0.0000100 epoch_Time:26318.0min: [2024-01-04 16:03:16,971][model8_pretrain.py][INFO] Epoch:[0/2](438900/4588595) loss:3.082 lr:0.0000100 epoch_Time:26317.0min: [2024-01-04 16:03:16,971][model8_pretrain.py][INFO] Epoch:[0/2](438900/4588595) loss:2.932 lr:0.0000100 epoch_Time:26317.0min: [2024-01-04 16:03:16,971][model8_pretrain.py][INFO] Epoch:[0/2](438900/4588595) loss:3.165 lr:0.0000100 epoch_Time:26317.0min: [2024-01-04 16:03:16,971][model8_pretrain.py][INFO] Epoch:[0/2](438900/4588595) loss:3.035 lr:0.0000100 epoch_Time:26317.0min: [2024-01-04 16:03:16,971][model8_pretrain.py][INFO] Epoch:[0/2](438900/4588595) loss:2.301 lr:0.0000100 epoch_Time:26317.0min: [2024-01-04 16:03:16,971][model8_pretrain.py][INFO] Epoch:[0/2](438900/4588595) loss:3.619 lr:0.0000100 epoch_Time:26317.0min: [2024-01-04 16:03:16,971][model8_pretrain.py][INFO] Epoch:[0/2](438900/4588595) loss:2.896 lr:0.0000100 epoch_Time:26317.0min: [2024-01-04 16:03:16,971][model8_pretrain.py][INFO] Epoch:[0/2](438900/4588595) loss:2.740 lr:0.0000100 epoch_Time:26317.0min: [2024-01-04 16:03:53,901][model8_pretrain.py][INFO] Epoch:[0/2](439000/4588595) loss:3.133 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:03:53,901][model8_pretrain.py][INFO] Epoch:[0/2](439000/4588595) loss:2.725 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:03:53,901][model8_pretrain.py][INFO] Epoch:[0/2](439000/4588595) loss:3.353 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:03:53,901][model8_pretrain.py][INFO] Epoch:[0/2](439000/4588595) loss:3.073 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:03:53,901][model8_pretrain.py][INFO] Epoch:[0/2](439000/4588595) loss:2.804 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:03:53,901][model8_pretrain.py][INFO] Epoch:[0/2](439000/4588595) loss:2.458 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:03:53,901][model8_pretrain.py][INFO] Epoch:[0/2](439000/4588595) loss:2.966 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:03:53,901][model8_pretrain.py][INFO] Epoch:[0/2](439000/4588595) loss:2.903 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:04:30,835][model8_pretrain.py][INFO] Epoch:[0/2](439100/4588595) loss:2.782 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:04:30,835][model8_pretrain.py][INFO] Epoch:[0/2](439100/4588595) loss:2.985 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:04:30,835][model8_pretrain.py][INFO] Epoch:[0/2](439100/4588595) loss:2.960 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:04:30,835][model8_pretrain.py][INFO] Epoch:[0/2](439100/4588595) loss:2.734 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:04:30,835][model8_pretrain.py][INFO] Epoch:[0/2](439100/4588595) loss:3.207 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:04:30,835][model8_pretrain.py][INFO] Epoch:[0/2](439100/4588595) loss:3.404 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:04:30,835][model8_pretrain.py][INFO] Epoch:[0/2](439100/4588595) loss:2.932 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:04:30,835][model8_pretrain.py][INFO] Epoch:[0/2](439100/4588595) loss:3.122 lr:0.0000100 epoch_Time:26316.0min: [2024-01-04 16:05:07,770][model8_pretrain.py][INFO] Epoch:[0/2](439200/4588595) loss:3.296 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:05:07,770][model8_pretrain.py][INFO] Epoch:[0/2](439200/4588595) loss:2.200 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:05:07,770][model8_pretrain.py][INFO] Epoch:[0/2](439200/4588595) loss:2.952 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:05:07,770][model8_pretrain.py][INFO] Epoch:[0/2](439200/4588595) loss:2.988 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:05:07,770][model8_pretrain.py][INFO] Epoch:[0/2](439200/4588595) loss:2.653 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:05:07,770][model8_pretrain.py][INFO] Epoch:[0/2](439200/4588595) loss:3.036 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:05:07,770][model8_pretrain.py][INFO] Epoch:[0/2](439200/4588595) loss:2.827 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:05:07,771][model8_pretrain.py][INFO] Epoch:[0/2](439200/4588595) loss:3.125 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:05:44,720][model8_pretrain.py][INFO] Epoch:[0/2](439300/4588595) loss:2.693 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:05:44,720][model8_pretrain.py][INFO] Epoch:[0/2](439300/4588595) loss:2.424 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:05:44,720][model8_pretrain.py][INFO] Epoch:[0/2](439300/4588595) loss:3.069 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:05:44,720][model8_pretrain.py][INFO] Epoch:[0/2](439300/4588595) loss:3.050 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:05:44,720][model8_pretrain.py][INFO] Epoch:[0/2](439300/4588595) loss:3.031 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:05:44,720][model8_pretrain.py][INFO] Epoch:[0/2](439300/4588595) loss:2.678 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:05:44,720][model8_pretrain.py][INFO] Epoch:[0/2](439300/4588595) loss:3.078 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:05:44,720][model8_pretrain.py][INFO] Epoch:[0/2](439300/4588595) loss:3.102 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:06:31,531][model8_pretrain.py][INFO] Epoch:[0/2](439400/4588595) loss:2.979 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:06:31,531][model8_pretrain.py][INFO] Epoch:[0/2](439400/4588595) loss:3.247 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:06:31,531][model8_pretrain.py][INFO] Epoch:[0/2](439400/4588595) loss:3.067 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:06:31,531][model8_pretrain.py][INFO] Epoch:[0/2](439400/4588595) loss:2.714 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:06:31,531][model8_pretrain.py][INFO] Epoch:[0/2](439400/4588595) loss:2.792 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:06:31,531][model8_pretrain.py][INFO] Epoch:[0/2](439400/4588595) loss:2.332 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:06:31,531][model8_pretrain.py][INFO] Epoch:[0/2](439400/4588595) loss:2.614 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:06:31,531][model8_pretrain.py][INFO] Epoch:[0/2](439400/4588595) loss:3.074 lr:0.0000100 epoch_Time:26315.0min: [2024-01-04 16:07:08,472][model8_pretrain.py][INFO] Epoch:[0/2](439500/4588595) loss:3.143 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:08,472][model8_pretrain.py][INFO] Epoch:[0/2](439500/4588595) loss:2.656 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:08,472][model8_pretrain.py][INFO] Epoch:[0/2](439500/4588595) loss:2.531 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:08,472][model8_pretrain.py][INFO] Epoch:[0/2](439500/4588595) loss:2.761 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:08,472][model8_pretrain.py][INFO] Epoch:[0/2](439500/4588595) loss:3.012 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:08,472][model8_pretrain.py][INFO] Epoch:[0/2](439500/4588595) loss:2.728 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:08,472][model8_pretrain.py][INFO] Epoch:[0/2](439500/4588595) loss:3.229 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:08,472][model8_pretrain.py][INFO] Epoch:[0/2](439500/4588595) loss:3.206 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:45,401][model8_pretrain.py][INFO] Epoch:[0/2](439600/4588595) loss:3.071 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:45,401][model8_pretrain.py][INFO] Epoch:[0/2](439600/4588595) loss:2.721 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:45,401][model8_pretrain.py][INFO] Epoch:[0/2](439600/4588595) loss:2.851 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:45,401][model8_pretrain.py][INFO] Epoch:[0/2](439600/4588595) loss:2.514 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:45,401][model8_pretrain.py][INFO] Epoch:[0/2](439600/4588595) loss:3.252 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:45,401][model8_pretrain.py][INFO] Epoch:[0/2](439600/4588595) loss:2.661 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:45,401][model8_pretrain.py][INFO] Epoch:[0/2](439600/4588595) loss:3.316 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:07:45,401][model8_pretrain.py][INFO] Epoch:[0/2](439600/4588595) loss:2.959 lr:0.0000100 epoch_Time:26314.0min: [2024-01-04 16:08:22,332][model8_pretrain.py][INFO] Epoch:[0/2](439700/4588595) loss:2.980 lr:0.0000100 epoch_Time:26312.0min: [2024-01-04 16:08:22,332][model8_pretrain.py][INFO] Epoch:[0/2](439700/4588595) loss:2.680 lr:0.0000100 epoch_Time:26312.0min: [2024-01-04 16:08:22,332][model8_pretrain.py][INFO] Epoch:[0/2](439700/4588595) loss:3.666 lr:0.0000100 epoch_Time:26312.0min: [2024-01-04 16:08:22,332][model8_pretrain.py][INFO] Epoch:[0/2](439700/4588595) loss:3.069 lr:0.0000100 epoch_Time:26312.0min: [2024-01-04 16:08:22,332][model8_pretrain.py][INFO] Epoch:[0/2](439700/4588595) loss:3.287 lr:0.0000100 epoch_Time:26312.0min: [2024-01-04 16:08:22,332][model8_pretrain.py][INFO] Epoch:[0/2](439700/4588595) loss:3.161 lr:0.0000100 epoch_Time:26312.0min: [2024-01-04 16:08:22,332][model8_pretrain.py][INFO] Epoch:[0/2](439700/4588595) loss:3.012 lr:0.0000100 epoch_Time:26312.0min: [2024-01-04 16:08:22,332][model8_pretrain.py][INFO] Epoch:[0/2](439700/4588595) loss:3.287 lr:0.0000100 epoch_Time:26312.0min: [2024-01-04 16:08:59,265][model8_pretrain.py][INFO] Epoch:[0/2](439800/4588595) loss:2.814 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:08:59,265][model8_pretrain.py][INFO] Epoch:[0/2](439800/4588595) loss:2.744 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:08:59,265][model8_pretrain.py][INFO] Epoch:[0/2](439800/4588595) loss:2.622 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:08:59,265][model8_pretrain.py][INFO] Epoch:[0/2](439800/4588595) loss:2.502 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:08:59,265][model8_pretrain.py][INFO] Epoch:[0/2](439800/4588595) loss:2.593 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:08:59,266][model8_pretrain.py][INFO] Epoch:[0/2](439800/4588595) loss:3.097 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:08:59,265][model8_pretrain.py][INFO] Epoch:[0/2](439800/4588595) loss:2.886 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:08:59,266][model8_pretrain.py][INFO] Epoch:[0/2](439800/4588595) loss:2.409 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:09:36,190][model8_pretrain.py][INFO] Epoch:[0/2](439900/4588595) loss:2.985 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:09:36,190][model8_pretrain.py][INFO] Epoch:[0/2](439900/4588595) loss:2.940 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:09:36,190][model8_pretrain.py][INFO] Epoch:[0/2](439900/4588595) loss:3.312 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:09:36,190][model8_pretrain.py][INFO] Epoch:[0/2](439900/4588595) loss:2.913 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:09:36,190][model8_pretrain.py][INFO] Epoch:[0/2](439900/4588595) loss:2.674 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:09:36,190][model8_pretrain.py][INFO] Epoch:[0/2](439900/4588595) loss:1.796 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:09:36,190][model8_pretrain.py][INFO] Epoch:[0/2](439900/4588595) loss:3.049 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:09:36,191][model8_pretrain.py][INFO] Epoch:[0/2](439900/4588595) loss:3.231 lr:0.0000100 epoch_Time:26311.0min: [2024-01-04 16:10:13,124][model8_pretrain.py][INFO] Epoch:[0/2](440000/4588595) loss:3.030 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:10:13,124][model8_pretrain.py][INFO] Epoch:[0/2](440000/4588595) loss:3.003 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:10:13,124][model8_pretrain.py][INFO] Epoch:[0/2](440000/4588595) loss:2.756 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:10:13,124][model8_pretrain.py][INFO] Epoch:[0/2](440000/4588595) loss:3.312 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:10:13,124][model8_pretrain.py][INFO] Epoch:[0/2](440000/4588595) loss:2.718 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:10:13,124][model8_pretrain.py][INFO] Epoch:[0/2](440000/4588595) loss:3.244 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:10:13,124][model8_pretrain.py][INFO] Epoch:[0/2](440000/4588595) loss:2.345 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:10:13,124][model8_pretrain.py][INFO] Epoch:[0/2](440000/4588595) loss:2.728 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:10:50,072][model8_pretrain.py][INFO] Epoch:[0/2](440100/4588595) loss:3.191 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:10:50,072][model8_pretrain.py][INFO] Epoch:[0/2](440100/4588595) loss:2.673 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:10:50,072][model8_pretrain.py][INFO] Epoch:[0/2](440100/4588595) loss:2.919 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:10:50,072][model8_pretrain.py][INFO] Epoch:[0/2](440100/4588595) loss:2.710 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:10:50,072][model8_pretrain.py][INFO] Epoch:[0/2](440100/4588595) loss:3.021 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:10:50,072][model8_pretrain.py][INFO] Epoch:[0/2](440100/4588595) loss:2.712 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:10:50,072][model8_pretrain.py][INFO] Epoch:[0/2](440100/4588595) loss:3.011 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:10:50,072][model8_pretrain.py][INFO] Epoch:[0/2](440100/4588595) loss:2.843 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:11:37,142][model8_pretrain.py][INFO] Epoch:[0/2](440200/4588595) loss:2.182 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:11:37,142][model8_pretrain.py][INFO] Epoch:[0/2](440200/4588595) loss:3.039 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:11:37,142][model8_pretrain.py][INFO] Epoch:[0/2](440200/4588595) loss:2.491 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:11:37,142][model8_pretrain.py][INFO] Epoch:[0/2](440200/4588595) loss:3.113 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:11:37,142][model8_pretrain.py][INFO] Epoch:[0/2](440200/4588595) loss:3.262 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:11:37,142][model8_pretrain.py][INFO] Epoch:[0/2](440200/4588595) loss:2.917 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:11:37,142][model8_pretrain.py][INFO] Epoch:[0/2](440200/4588595) loss:2.343 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:11:37,142][model8_pretrain.py][INFO] Epoch:[0/2](440200/4588595) loss:2.889 lr:0.0000100 epoch_Time:26310.0min: [2024-01-04 16:12:14,075][model8_pretrain.py][INFO] Epoch:[0/2](440300/4588595) loss:3.291 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:12:14,075][model8_pretrain.py][INFO] Epoch:[0/2](440300/4588595) loss:3.233 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:12:14,075][model8_pretrain.py][INFO] Epoch:[0/2](440300/4588595) loss:3.014 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:12:14,075][model8_pretrain.py][INFO] Epoch:[0/2](440300/4588595) loss:3.049 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:12:14,075][model8_pretrain.py][INFO] Epoch:[0/2](440300/4588595) loss:2.695 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:12:14,075][model8_pretrain.py][INFO] Epoch:[0/2](440300/4588595) loss:2.671 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:12:14,075][model8_pretrain.py][INFO] Epoch:[0/2](440300/4588595) loss:3.074 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:12:14,075][model8_pretrain.py][INFO] Epoch:[0/2](440300/4588595) loss:2.898 lr:0.0000100 epoch_Time:26309.0min: [2024-01-04 16:12:51,002][model8_pretrain.py][INFO] Epoch:[0/2](440400/4588595) loss:2.865 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:12:51,002][model8_pretrain.py][INFO] Epoch:[0/2](440400/4588595) loss:2.831 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:12:51,002][model8_pretrain.py][INFO] Epoch:[0/2](440400/4588595) loss:3.264 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:12:51,002][model8_pretrain.py][INFO] Epoch:[0/2](440400/4588595) loss:2.818 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:12:51,002][model8_pretrain.py][INFO] Epoch:[0/2](440400/4588595) loss:2.647 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:12:51,002][model8_pretrain.py][INFO] Epoch:[0/2](440400/4588595) loss:2.621 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:12:51,002][model8_pretrain.py][INFO] Epoch:[0/2](440400/4588595) loss:2.433 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:12:51,003][model8_pretrain.py][INFO] Epoch:[0/2](440400/4588595) loss:3.001 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:13:27,929][model8_pretrain.py][INFO] Epoch:[0/2](440500/4588595) loss:2.927 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:13:27,929][model8_pretrain.py][INFO] Epoch:[0/2](440500/4588595) loss:3.387 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:13:27,929][model8_pretrain.py][INFO] Epoch:[0/2](440500/4588595) loss:3.112 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:13:27,929][model8_pretrain.py][INFO] Epoch:[0/2](440500/4588595) loss:2.895 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:13:27,929][model8_pretrain.py][INFO] Epoch:[0/2](440500/4588595) loss:3.401 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:13:27,929][model8_pretrain.py][INFO] Epoch:[0/2](440500/4588595) loss:2.275 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:13:27,929][model8_pretrain.py][INFO] Epoch:[0/2](440500/4588595) loss:3.165 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:13:27,929][model8_pretrain.py][INFO] Epoch:[0/2](440500/4588595) loss:2.791 lr:0.0000100 epoch_Time:26308.0min: [2024-01-04 16:14:04,878][model8_pretrain.py][INFO] Epoch:[0/2](440600/4588595) loss:2.599 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:04,878][model8_pretrain.py][INFO] Epoch:[0/2](440600/4588595) loss:3.465 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:04,878][model8_pretrain.py][INFO] Epoch:[0/2](440600/4588595) loss:2.714 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:04,877][model8_pretrain.py][INFO] Epoch:[0/2](440600/4588595) loss:2.434 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:04,878][model8_pretrain.py][INFO] Epoch:[0/2](440600/4588595) loss:2.582 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:04,878][model8_pretrain.py][INFO] Epoch:[0/2](440600/4588595) loss:3.029 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:04,878][model8_pretrain.py][INFO] Epoch:[0/2](440600/4588595) loss:2.829 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:04,878][model8_pretrain.py][INFO] Epoch:[0/2](440600/4588595) loss:2.085 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:41,817][model8_pretrain.py][INFO] Epoch:[0/2](440700/4588595) loss:2.721 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:41,817][model8_pretrain.py][INFO] Epoch:[0/2](440700/4588595) loss:2.738 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:41,817][model8_pretrain.py][INFO] Epoch:[0/2](440700/4588595) loss:2.874 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:41,817][model8_pretrain.py][INFO] Epoch:[0/2](440700/4588595) loss:2.844 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:41,817][model8_pretrain.py][INFO] Epoch:[0/2](440700/4588595) loss:2.880 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:41,817][model8_pretrain.py][INFO] Epoch:[0/2](440700/4588595) loss:2.954 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:41,817][model8_pretrain.py][INFO] Epoch:[0/2](440700/4588595) loss:3.134 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:14:41,817][model8_pretrain.py][INFO] Epoch:[0/2](440700/4588595) loss:1.823 lr:0.0000100 epoch_Time:26306.0min: [2024-01-04 16:15:18,758][model8_pretrain.py][INFO] Epoch:[0/2](440800/4588595) loss:3.044 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:15:18,758][model8_pretrain.py][INFO] Epoch:[0/2](440800/4588595) loss:2.750 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:15:18,758][model8_pretrain.py][INFO] Epoch:[0/2](440800/4588595) loss:3.108 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:15:18,758][model8_pretrain.py][INFO] Epoch:[0/2](440800/4588595) loss:2.668 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:15:18,758][model8_pretrain.py][INFO] Epoch:[0/2](440800/4588595) loss:3.207 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:15:18,758][model8_pretrain.py][INFO] Epoch:[0/2](440800/4588595) loss:2.872 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:15:18,759][model8_pretrain.py][INFO] Epoch:[0/2](440800/4588595) loss:2.527 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:15:18,759][model8_pretrain.py][INFO] Epoch:[0/2](440800/4588595) loss:2.562 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:15:55,693][model8_pretrain.py][INFO] Epoch:[0/2](440900/4588595) loss:2.375 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:15:55,694][model8_pretrain.py][INFO] Epoch:[0/2](440900/4588595) loss:2.620 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:15:55,694][model8_pretrain.py][INFO] Epoch:[0/2](440900/4588595) loss:2.593 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:15:55,694][model8_pretrain.py][INFO] Epoch:[0/2](440900/4588595) loss:2.774 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:15:55,694][model8_pretrain.py][INFO] Epoch:[0/2](440900/4588595) loss:3.150 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:15:55,694][model8_pretrain.py][INFO] Epoch:[0/2](440900/4588595) loss:2.552 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:15:55,694][model8_pretrain.py][INFO] Epoch:[0/2](440900/4588595) loss:2.793 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:15:55,694][model8_pretrain.py][INFO] Epoch:[0/2](440900/4588595) loss:2.276 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:16:43,007][model8_pretrain.py][INFO] Epoch:[0/2](441000/4588595) loss:3.069 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:16:43,007][model8_pretrain.py][INFO] Epoch:[0/2](441000/4588595) loss:2.606 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:16:43,007][model8_pretrain.py][INFO] Epoch:[0/2](441000/4588595) loss:3.040 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:16:43,007][model8_pretrain.py][INFO] Epoch:[0/2](441000/4588595) loss:3.336 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:16:43,007][model8_pretrain.py][INFO] Epoch:[0/2](441000/4588595) loss:3.004 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:16:43,007][model8_pretrain.py][INFO] Epoch:[0/2](441000/4588595) loss:3.098 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:16:43,007][model8_pretrain.py][INFO] Epoch:[0/2](441000/4588595) loss:3.277 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:16:43,007][model8_pretrain.py][INFO] Epoch:[0/2](441000/4588595) loss:2.235 lr:0.0000100 epoch_Time:26305.0min: [2024-01-04 16:17:19,942][model8_pretrain.py][INFO] Epoch:[0/2](441100/4588595) loss:2.609 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:17:19,942][model8_pretrain.py][INFO] Epoch:[0/2](441100/4588595) loss:2.881 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:17:19,942][model8_pretrain.py][INFO] Epoch:[0/2](441100/4588595) loss:2.134 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:17:19,942][model8_pretrain.py][INFO] Epoch:[0/2](441100/4588595) loss:2.798 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:17:19,942][model8_pretrain.py][INFO] Epoch:[0/2](441100/4588595) loss:2.618 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:17:19,942][model8_pretrain.py][INFO] Epoch:[0/2](441100/4588595) loss:2.709 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:17:19,942][model8_pretrain.py][INFO] Epoch:[0/2](441100/4588595) loss:1.990 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:17:19,942][model8_pretrain.py][INFO] Epoch:[0/2](441100/4588595) loss:3.218 lr:0.0000100 epoch_Time:26304.0min: [2024-01-04 16:17:56,890][model8_pretrain.py][INFO] Epoch:[0/2](441200/4588595) loss:2.303 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:17:56,890][model8_pretrain.py][INFO] Epoch:[0/2](441200/4588595) loss:2.966 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:17:56,890][model8_pretrain.py][INFO] Epoch:[0/2](441200/4588595) loss:2.529 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:17:56,890][model8_pretrain.py][INFO] Epoch:[0/2](441200/4588595) loss:2.767 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:17:56,890][model8_pretrain.py][INFO] Epoch:[0/2](441200/4588595) loss:3.061 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:17:56,890][model8_pretrain.py][INFO] Epoch:[0/2](441200/4588595) loss:3.079 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:17:56,890][model8_pretrain.py][INFO] Epoch:[0/2](441200/4588595) loss:1.626 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:17:56,890][model8_pretrain.py][INFO] Epoch:[0/2](441200/4588595) loss:2.623 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:18:33,837][model8_pretrain.py][INFO] Epoch:[0/2](441300/4588595) loss:3.121 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:18:33,837][model8_pretrain.py][INFO] Epoch:[0/2](441300/4588595) loss:3.130 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:18:33,837][model8_pretrain.py][INFO] Epoch:[0/2](441300/4588595) loss:2.599 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:18:33,837][model8_pretrain.py][INFO] Epoch:[0/2](441300/4588595) loss:2.528 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:18:33,837][model8_pretrain.py][INFO] Epoch:[0/2](441300/4588595) loss:2.906 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:18:33,838][model8_pretrain.py][INFO] Epoch:[0/2](441300/4588595) loss:2.887 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:18:33,838][model8_pretrain.py][INFO] Epoch:[0/2](441300/4588595) loss:2.788 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:18:33,838][model8_pretrain.py][INFO] Epoch:[0/2](441300/4588595) loss:3.254 lr:0.0000100 epoch_Time:26303.0min: [2024-01-04 16:19:10,776][model8_pretrain.py][INFO] Epoch:[0/2](441400/4588595) loss:3.388 lr:0.0000100 epoch_Time:26302.0min: [2024-01-04 16:19:10,776][model8_pretrain.py][INFO] Epoch:[0/2](441400/4588595) loss:2.561 lr:0.0000100 epoch_Time:26302.0min: [2024-01-04 16:19:10,776][model8_pretrain.py][INFO] Epoch:[0/2](441400/4588595) loss:2.978 lr:0.0000100 epoch_Time:26302.0min: [2024-01-04 16:19:10,776][model8_pretrain.py][INFO] Epoch:[0/2](441400/4588595) loss:2.986 lr:0.0000100 epoch_Time:26302.0min: [2024-01-04 16:19:10,776][model8_pretrain.py][INFO] Epoch:[0/2](441400/4588595) loss:2.278 lr:0.0000100 epoch_Time:26302.0min: [2024-01-04 16:19:10,776][model8_pretrain.py][INFO] Epoch:[0/2](441400/4588595) loss:2.521 lr:0.0000100 epoch_Time:26302.0min: [2024-01-04 16:19:10,776][model8_pretrain.py][INFO] Epoch:[0/2](441400/4588595) loss:2.476 lr:0.0000100 epoch_Time:26302.0min: [2024-01-04 16:19:10,777][model8_pretrain.py][INFO] Epoch:[0/2](441400/4588595) loss:2.697 lr:0.0000100 epoch_Time:26302.0min: [2024-01-04 16:19:47,731][model8_pretrain.py][INFO] Epoch:[0/2](441500/4588595) loss:2.957 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:19:47,731][model8_pretrain.py][INFO] Epoch:[0/2](441500/4588595) loss:2.891 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:19:47,731][model8_pretrain.py][INFO] Epoch:[0/2](441500/4588595) loss:2.618 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:19:47,731][model8_pretrain.py][INFO] Epoch:[0/2](441500/4588595) loss:2.920 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:19:47,731][model8_pretrain.py][INFO] Epoch:[0/2](441500/4588595) loss:2.889 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:19:47,731][model8_pretrain.py][INFO] Epoch:[0/2](441500/4588595) loss:2.877 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:19:47,731][model8_pretrain.py][INFO] Epoch:[0/2](441500/4588595) loss:2.734 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:19:47,731][model8_pretrain.py][INFO] Epoch:[0/2](441500/4588595) loss:2.915 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:20:24,682][model8_pretrain.py][INFO] Epoch:[0/2](441600/4588595) loss:2.945 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:20:24,682][model8_pretrain.py][INFO] Epoch:[0/2](441600/4588595) loss:2.286 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:20:24,682][model8_pretrain.py][INFO] Epoch:[0/2](441600/4588595) loss:3.193 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:20:24,682][model8_pretrain.py][INFO] Epoch:[0/2](441600/4588595) loss:2.737 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:20:24,682][model8_pretrain.py][INFO] Epoch:[0/2](441600/4588595) loss:2.758 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:20:24,682][model8_pretrain.py][INFO] Epoch:[0/2](441600/4588595) loss:2.937 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:20:24,682][model8_pretrain.py][INFO] Epoch:[0/2](441600/4588595) loss:2.616 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:20:24,682][model8_pretrain.py][INFO] Epoch:[0/2](441600/4588595) loss:3.040 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:21:01,645][model8_pretrain.py][INFO] Epoch:[0/2](441700/4588595) loss:2.663 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:21:01,645][model8_pretrain.py][INFO] Epoch:[0/2](441700/4588595) loss:3.277 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:21:01,645][model8_pretrain.py][INFO] Epoch:[0/2](441700/4588595) loss:2.994 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:21:01,645][model8_pretrain.py][INFO] Epoch:[0/2](441700/4588595) loss:2.928 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:21:01,645][model8_pretrain.py][INFO] Epoch:[0/2](441700/4588595) loss:3.421 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:21:01,645][model8_pretrain.py][INFO] Epoch:[0/2](441700/4588595) loss:3.040 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:21:01,645][model8_pretrain.py][INFO] Epoch:[0/2](441700/4588595) loss:2.615 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:21:01,645][model8_pretrain.py][INFO] Epoch:[0/2](441700/4588595) loss:2.524 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:21:48,902][model8_pretrain.py][INFO] Epoch:[0/2](441800/4588595) loss:2.776 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:21:48,902][model8_pretrain.py][INFO] Epoch:[0/2](441800/4588595) loss:2.689 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:21:48,902][model8_pretrain.py][INFO] Epoch:[0/2](441800/4588595) loss:3.205 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:21:48,902][model8_pretrain.py][INFO] Epoch:[0/2](441800/4588595) loss:3.063 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:21:48,902][model8_pretrain.py][INFO] Epoch:[0/2](441800/4588595) loss:3.146 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:21:48,902][model8_pretrain.py][INFO] Epoch:[0/2](441800/4588595) loss:3.087 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:21:48,902][model8_pretrain.py][INFO] Epoch:[0/2](441800/4588595) loss:2.573 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:21:48,902][model8_pretrain.py][INFO] Epoch:[0/2](441800/4588595) loss:3.303 lr:0.0000100 epoch_Time:26300.0min: [2024-01-04 16:22:25,832][model8_pretrain.py][INFO] Epoch:[0/2](441900/4588595) loss:2.275 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:22:25,832][model8_pretrain.py][INFO] Epoch:[0/2](441900/4588595) loss:2.404 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:22:25,832][model8_pretrain.py][INFO] Epoch:[0/2](441900/4588595) loss:2.833 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:22:25,832][model8_pretrain.py][INFO] Epoch:[0/2](441900/4588595) loss:2.988 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:22:25,832][model8_pretrain.py][INFO] Epoch:[0/2](441900/4588595) loss:3.027 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:22:25,832][model8_pretrain.py][INFO] Epoch:[0/2](441900/4588595) loss:3.244 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:22:25,832][model8_pretrain.py][INFO] Epoch:[0/2](441900/4588595) loss:3.130 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:22:25,832][model8_pretrain.py][INFO] Epoch:[0/2](441900/4588595) loss:2.570 lr:0.0000100 epoch_Time:26299.0min: [2024-01-04 16:23:02,768][model8_pretrain.py][INFO] Epoch:[0/2](442000/4588595) loss:2.982 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:02,768][model8_pretrain.py][INFO] Epoch:[0/2](442000/4588595) loss:2.613 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:02,768][model8_pretrain.py][INFO] Epoch:[0/2](442000/4588595) loss:3.087 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:02,768][model8_pretrain.py][INFO] Epoch:[0/2](442000/4588595) loss:3.111 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:02,768][model8_pretrain.py][INFO] Epoch:[0/2](442000/4588595) loss:3.303 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:02,768][model8_pretrain.py][INFO] Epoch:[0/2](442000/4588595) loss:2.409 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:02,768][model8_pretrain.py][INFO] Epoch:[0/2](442000/4588595) loss:3.510 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:02,768][model8_pretrain.py][INFO] Epoch:[0/2](442000/4588595) loss:1.952 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:39,693][model8_pretrain.py][INFO] Epoch:[0/2](442100/4588595) loss:2.802 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:39,693][model8_pretrain.py][INFO] Epoch:[0/2](442100/4588595) loss:3.036 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:39,693][model8_pretrain.py][INFO] Epoch:[0/2](442100/4588595) loss:2.988 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:39,693][model8_pretrain.py][INFO] Epoch:[0/2](442100/4588595) loss:2.904 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:39,693][model8_pretrain.py][INFO] Epoch:[0/2](442100/4588595) loss:3.061 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:39,693][model8_pretrain.py][INFO] Epoch:[0/2](442100/4588595) loss:3.286 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:39,693][model8_pretrain.py][INFO] Epoch:[0/2](442100/4588595) loss:2.133 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:23:39,693][model8_pretrain.py][INFO] Epoch:[0/2](442100/4588595) loss:2.669 lr:0.0000100 epoch_Time:26298.0min: [2024-01-04 16:24:16,628][model8_pretrain.py][INFO] Epoch:[0/2](442200/4588595) loss:2.971 lr:0.0000100 epoch_Time:26297.0min: [2024-01-04 16:24:16,628][model8_pretrain.py][INFO] Epoch:[0/2](442200/4588595) loss:3.126 lr:0.0000100 epoch_Time:26297.0min: [2024-01-04 16:24:16,628][model8_pretrain.py][INFO] Epoch:[0/2](442200/4588595) loss:2.925 lr:0.0000100 epoch_Time:26297.0min: [2024-01-04 16:24:16,628][model8_pretrain.py][INFO] Epoch:[0/2](442200/4588595) loss:3.400 lr:0.0000100 epoch_Time:26297.0min: [2024-01-04 16:24:16,628][model8_pretrain.py][INFO] Epoch:[0/2](442200/4588595) loss:2.773 lr:0.0000100 epoch_Time:26297.0min: [2024-01-04 16:24:16,628][model8_pretrain.py][INFO] Epoch:[0/2](442200/4588595) loss:3.455 lr:0.0000100 epoch_Time:26297.0min: [2024-01-04 16:24:16,628][model8_pretrain.py][INFO] Epoch:[0/2](442200/4588595) loss:2.990 lr:0.0000100 epoch_Time:26297.0min: [2024-01-04 16:24:16,628][model8_pretrain.py][INFO] Epoch:[0/2](442200/4588595) loss:3.109 lr:0.0000100 epoch_Time:26297.0min: [2024-01-04 16:24:53,562][model8_pretrain.py][INFO] Epoch:[0/2](442300/4588595) loss:2.729 lr:0.0000100 epoch_Time:26296.0min: [2024-01-04 16:24:53,562][model8_pretrain.py][INFO] Epoch:[0/2](442300/4588595) loss:3.024 lr:0.0000100 epoch_Time:26296.0min: [2024-01-04 16:24:53,562][model8_pretrain.py][INFO] Epoch:[0/2](442300/4588595) loss:2.874 lr:0.0000100 epoch_Time:26296.0min: [2024-01-04 16:24:53,562][model8_pretrain.py][INFO] Epoch:[0/2](442300/4588595) loss:2.174 lr:0.0000100 epoch_Time:26296.0min: [2024-01-04 16:24:53,562][model8_pretrain.py][INFO] Epoch:[0/2](442300/4588595) loss:2.680 lr:0.0000100 epoch_Time:26296.0min: [2024-01-04 16:24:53,562][model8_pretrain.py][INFO] Epoch:[0/2](442300/4588595) loss:1.967 lr:0.0000100 epoch_Time:26296.0min: [2024-01-04 16:24:53,562][model8_pretrain.py][INFO] Epoch:[0/2](442300/4588595) loss:2.883 lr:0.0000100 epoch_Time:26296.0min: [2024-01-04 16:24:53,562][model8_pretrain.py][INFO] Epoch:[0/2](442300/4588595) loss:3.025 lr:0.0000100 epoch_Time:26296.0min: [2024-01-04 16:25:30,499][model8_pretrain.py][INFO] Epoch:[0/2](442400/4588595) loss:3.330 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:25:30,499][model8_pretrain.py][INFO] Epoch:[0/2](442400/4588595) loss:2.687 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:25:30,499][model8_pretrain.py][INFO] Epoch:[0/2](442400/4588595) loss:3.133 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:25:30,499][model8_pretrain.py][INFO] Epoch:[0/2](442400/4588595) loss:2.773 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:25:30,499][model8_pretrain.py][INFO] Epoch:[0/2](442400/4588595) loss:3.084 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:25:30,499][model8_pretrain.py][INFO] Epoch:[0/2](442400/4588595) loss:2.953 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:25:30,499][model8_pretrain.py][INFO] Epoch:[0/2](442400/4588595) loss:3.033 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:25:30,499][model8_pretrain.py][INFO] Epoch:[0/2](442400/4588595) loss:3.269 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:26:07,441][model8_pretrain.py][INFO] Epoch:[0/2](442500/4588595) loss:2.719 lr:0.0000100 epoch_Time:26294.0min: [2024-01-04 16:26:07,441][model8_pretrain.py][INFO] Epoch:[0/2](442500/4588595) loss:2.478 lr:0.0000100 epoch_Time:26294.0min: [2024-01-04 16:26:07,441][model8_pretrain.py][INFO] Epoch:[0/2](442500/4588595) loss:3.038 lr:0.0000100 epoch_Time:26294.0min: [2024-01-04 16:26:07,441][model8_pretrain.py][INFO] Epoch:[0/2](442500/4588595) loss:3.276 lr:0.0000100 epoch_Time:26294.0min: [2024-01-04 16:26:07,441][model8_pretrain.py][INFO] Epoch:[0/2](442500/4588595) loss:2.900 lr:0.0000100 epoch_Time:26294.0min: [2024-01-04 16:26:07,441][model8_pretrain.py][INFO] Epoch:[0/2](442500/4588595) loss:2.628 lr:0.0000100 epoch_Time:26294.0min: [2024-01-04 16:26:07,441][model8_pretrain.py][INFO] Epoch:[0/2](442500/4588595) loss:2.723 lr:0.0000100 epoch_Time:26294.0min: [2024-01-04 16:26:07,441][model8_pretrain.py][INFO] Epoch:[0/2](442500/4588595) loss:3.128 lr:0.0000100 epoch_Time:26294.0min: [2024-01-04 16:26:54,560][model8_pretrain.py][INFO] Epoch:[0/2](442600/4588595) loss:2.247 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:26:54,560][model8_pretrain.py][INFO] Epoch:[0/2](442600/4588595) loss:2.917 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:26:54,560][model8_pretrain.py][INFO] Epoch:[0/2](442600/4588595) loss:3.200 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:26:54,560][model8_pretrain.py][INFO] Epoch:[0/2](442600/4588595) loss:3.088 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:26:54,560][model8_pretrain.py][INFO] Epoch:[0/2](442600/4588595) loss:2.853 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:26:54,560][model8_pretrain.py][INFO] Epoch:[0/2](442600/4588595) loss:3.076 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:26:54,560][model8_pretrain.py][INFO] Epoch:[0/2](442600/4588595) loss:2.670 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:26:54,560][model8_pretrain.py][INFO] Epoch:[0/2](442600/4588595) loss:2.615 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:27:31,493][model8_pretrain.py][INFO] Epoch:[0/2](442700/4588595) loss:3.323 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:27:31,493][model8_pretrain.py][INFO] Epoch:[0/2](442700/4588595) loss:2.854 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:27:31,493][model8_pretrain.py][INFO] Epoch:[0/2](442700/4588595) loss:3.174 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:27:31,493][model8_pretrain.py][INFO] Epoch:[0/2](442700/4588595) loss:2.520 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:27:31,493][model8_pretrain.py][INFO] Epoch:[0/2](442700/4588595) loss:2.706 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:27:31,493][model8_pretrain.py][INFO] Epoch:[0/2](442700/4588595) loss:3.025 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:27:31,493][model8_pretrain.py][INFO] Epoch:[0/2](442700/4588595) loss:2.553 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:27:31,493][model8_pretrain.py][INFO] Epoch:[0/2](442700/4588595) loss:2.629 lr:0.0000100 epoch_Time:26295.0min: [2024-01-04 16:28:08,420][model8_pretrain.py][INFO] Epoch:[0/2](442800/4588595) loss:2.643 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:08,420][model8_pretrain.py][INFO] Epoch:[0/2](442800/4588595) loss:3.114 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:08,420][model8_pretrain.py][INFO] Epoch:[0/2](442800/4588595) loss:2.717 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:08,420][model8_pretrain.py][INFO] Epoch:[0/2](442800/4588595) loss:2.837 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:08,420][model8_pretrain.py][INFO] Epoch:[0/2](442800/4588595) loss:2.849 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:08,420][model8_pretrain.py][INFO] Epoch:[0/2](442800/4588595) loss:2.856 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:08,420][model8_pretrain.py][INFO] Epoch:[0/2](442800/4588595) loss:2.986 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:08,420][model8_pretrain.py][INFO] Epoch:[0/2](442800/4588595) loss:2.641 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:45,356][model8_pretrain.py][INFO] Epoch:[0/2](442900/4588595) loss:2.760 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:45,356][model8_pretrain.py][INFO] Epoch:[0/2](442900/4588595) loss:2.596 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:45,356][model8_pretrain.py][INFO] Epoch:[0/2](442900/4588595) loss:2.787 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:45,356][model8_pretrain.py][INFO] Epoch:[0/2](442900/4588595) loss:3.200 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:45,356][model8_pretrain.py][INFO] Epoch:[0/2](442900/4588595) loss:3.112 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:45,356][model8_pretrain.py][INFO] Epoch:[0/2](442900/4588595) loss:2.641 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:45,356][model8_pretrain.py][INFO] Epoch:[0/2](442900/4588595) loss:2.600 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:28:45,356][model8_pretrain.py][INFO] Epoch:[0/2](442900/4588595) loss:2.915 lr:0.0000100 epoch_Time:26293.0min: [2024-01-04 16:29:22,290][model8_pretrain.py][INFO] Epoch:[0/2](443000/4588595) loss:3.117 lr:0.0000100 epoch_Time:26292.0min: [2024-01-04 16:29:22,291][model8_pretrain.py][INFO] Epoch:[0/2](443000/4588595) loss:3.042 lr:0.0000100 epoch_Time:26292.0min: [2024-01-04 16:29:22,291][model8_pretrain.py][INFO] Epoch:[0/2](443000/4588595) loss:2.693 lr:0.0000100 epoch_Time:26292.0min: [2024-01-04 16:29:22,291][model8_pretrain.py][INFO] Epoch:[0/2](443000/4588595) loss:2.806 lr:0.0000100 epoch_Time:26292.0min: [2024-01-04 16:29:22,291][model8_pretrain.py][INFO] Epoch:[0/2](443000/4588595) loss:2.384 lr:0.0000100 epoch_Time:26292.0min: [2024-01-04 16:29:22,291][model8_pretrain.py][INFO] Epoch:[0/2](443000/4588595) loss:3.045 lr:0.0000100 epoch_Time:26292.0min: [2024-01-04 16:29:22,291][model8_pretrain.py][INFO] Epoch:[0/2](443000/4588595) loss:3.103 lr:0.0000100 epoch_Time:26292.0min: [2024-01-04 16:29:22,291][model8_pretrain.py][INFO] Epoch:[0/2](443000/4588595) loss:2.644 lr:0.0000100 epoch_Time:26292.0min: [2024-01-04 16:29:59,207][model8_pretrain.py][INFO] Epoch:[0/2](443100/4588595) loss:2.662 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:29:59,207][model8_pretrain.py][INFO] Epoch:[0/2](443100/4588595) loss:3.134 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:29:59,207][model8_pretrain.py][INFO] Epoch:[0/2](443100/4588595) loss:2.515 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:29:59,207][model8_pretrain.py][INFO] Epoch:[0/2](443100/4588595) loss:3.254 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:29:59,207][model8_pretrain.py][INFO] Epoch:[0/2](443100/4588595) loss:2.381 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:29:59,207][model8_pretrain.py][INFO] Epoch:[0/2](443100/4588595) loss:2.929 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:29:59,207][model8_pretrain.py][INFO] Epoch:[0/2](443100/4588595) loss:3.093 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:29:59,207][model8_pretrain.py][INFO] Epoch:[0/2](443100/4588595) loss:3.190 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:30:36,144][model8_pretrain.py][INFO] Epoch:[0/2](443200/4588595) loss:2.556 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:30:36,144][model8_pretrain.py][INFO] Epoch:[0/2](443200/4588595) loss:3.008 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:30:36,144][model8_pretrain.py][INFO] Epoch:[0/2](443200/4588595) loss:3.351 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:30:36,144][model8_pretrain.py][INFO] Epoch:[0/2](443200/4588595) loss:3.180 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:30:36,144][model8_pretrain.py][INFO] Epoch:[0/2](443200/4588595) loss:3.236 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:30:36,144][model8_pretrain.py][INFO] Epoch:[0/2](443200/4588595) loss:3.124 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:30:36,144][model8_pretrain.py][INFO] Epoch:[0/2](443200/4588595) loss:2.859 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:30:36,144][model8_pretrain.py][INFO] Epoch:[0/2](443200/4588595) loss:3.240 lr:0.0000100 epoch_Time:26291.0min: [2024-01-04 16:31:13,075][model8_pretrain.py][INFO] Epoch:[0/2](443300/4588595) loss:2.655 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:31:13,075][model8_pretrain.py][INFO] Epoch:[0/2](443300/4588595) loss:2.350 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:31:13,075][model8_pretrain.py][INFO] Epoch:[0/2](443300/4588595) loss:2.814 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:31:13,075][model8_pretrain.py][INFO] Epoch:[0/2](443300/4588595) loss:3.467 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:31:13,075][model8_pretrain.py][INFO] Epoch:[0/2](443300/4588595) loss:3.109 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:31:13,075][model8_pretrain.py][INFO] Epoch:[0/2](443300/4588595) loss:2.941 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:31:13,075][model8_pretrain.py][INFO] Epoch:[0/2](443300/4588595) loss:2.810 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:31:13,075][model8_pretrain.py][INFO] Epoch:[0/2](443300/4588595) loss:2.780 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:32:00,139][model8_pretrain.py][INFO] Epoch:[0/2](443400/4588595) loss:3.231 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:00,139][model8_pretrain.py][INFO] Epoch:[0/2](443400/4588595) loss:2.663 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:00,139][model8_pretrain.py][INFO] Epoch:[0/2](443400/4588595) loss:2.835 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:00,140][model8_pretrain.py][INFO] Epoch:[0/2](443400/4588595) loss:2.290 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:00,140][model8_pretrain.py][INFO] Epoch:[0/2](443400/4588595) loss:3.044 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:00,140][model8_pretrain.py][INFO] Epoch:[0/2](443400/4588595) loss:2.846 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:00,140][model8_pretrain.py][INFO] Epoch:[0/2](443400/4588595) loss:3.290 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:00,140][model8_pretrain.py][INFO] Epoch:[0/2](443400/4588595) loss:2.650 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:37,032][model8_pretrain.py][INFO] Epoch:[0/2](443500/4588595) loss:2.830 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:37,032][model8_pretrain.py][INFO] Epoch:[0/2](443500/4588595) loss:3.149 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:37,032][model8_pretrain.py][INFO] Epoch:[0/2](443500/4588595) loss:3.140 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:37,032][model8_pretrain.py][INFO] Epoch:[0/2](443500/4588595) loss:2.926 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:37,032][model8_pretrain.py][INFO] Epoch:[0/2](443500/4588595) loss:3.190 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:37,032][model8_pretrain.py][INFO] Epoch:[0/2](443500/4588595) loss:3.294 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:37,032][model8_pretrain.py][INFO] Epoch:[0/2](443500/4588595) loss:3.137 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:32:37,032][model8_pretrain.py][INFO] Epoch:[0/2](443500/4588595) loss:3.305 lr:0.0000100 epoch_Time:26290.0min: [2024-01-04 16:33:13,955][model8_pretrain.py][INFO] Epoch:[0/2](443600/4588595) loss:2.506 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:33:13,955][model8_pretrain.py][INFO] Epoch:[0/2](443600/4588595) loss:3.270 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:33:13,955][model8_pretrain.py][INFO] Epoch:[0/2](443600/4588595) loss:2.398 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:33:13,955][model8_pretrain.py][INFO] Epoch:[0/2](443600/4588595) loss:2.595 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:33:13,955][model8_pretrain.py][INFO] Epoch:[0/2](443600/4588595) loss:3.074 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:33:13,955][model8_pretrain.py][INFO] Epoch:[0/2](443600/4588595) loss:2.946 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:33:13,955][model8_pretrain.py][INFO] Epoch:[0/2](443600/4588595) loss:3.058 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:33:13,955][model8_pretrain.py][INFO] Epoch:[0/2](443600/4588595) loss:2.738 lr:0.0000100 epoch_Time:26289.0min: [2024-01-04 16:33:50,869][model8_pretrain.py][INFO] Epoch:[0/2](443700/4588595) loss:2.695 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:33:50,869][model8_pretrain.py][INFO] Epoch:[0/2](443700/4588595) loss:2.737 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:33:50,869][model8_pretrain.py][INFO] Epoch:[0/2](443700/4588595) loss:2.787 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:33:50,869][model8_pretrain.py][INFO] Epoch:[0/2](443700/4588595) loss:2.593 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:33:50,869][model8_pretrain.py][INFO] Epoch:[0/2](443700/4588595) loss:2.605 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:33:50,869][model8_pretrain.py][INFO] Epoch:[0/2](443700/4588595) loss:3.011 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:33:50,869][model8_pretrain.py][INFO] Epoch:[0/2](443700/4588595) loss:3.233 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:33:50,869][model8_pretrain.py][INFO] Epoch:[0/2](443700/4588595) loss:3.053 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:34:27,799][model8_pretrain.py][INFO] Epoch:[0/2](443800/4588595) loss:3.140 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:34:27,799][model8_pretrain.py][INFO] Epoch:[0/2](443800/4588595) loss:2.769 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:34:27,799][model8_pretrain.py][INFO] Epoch:[0/2](443800/4588595) loss:3.053 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:34:27,799][model8_pretrain.py][INFO] Epoch:[0/2](443800/4588595) loss:2.852 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:34:27,799][model8_pretrain.py][INFO] Epoch:[0/2](443800/4588595) loss:2.959 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:34:27,799][model8_pretrain.py][INFO] Epoch:[0/2](443800/4588595) loss:2.673 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:34:27,799][model8_pretrain.py][INFO] Epoch:[0/2](443800/4588595) loss:3.261 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:34:27,799][model8_pretrain.py][INFO] Epoch:[0/2](443800/4588595) loss:2.184 lr:0.0000100 epoch_Time:26287.0min: [2024-01-04 16:35:04,729][model8_pretrain.py][INFO] Epoch:[0/2](443900/4588595) loss:2.778 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:04,729][model8_pretrain.py][INFO] Epoch:[0/2](443900/4588595) loss:2.859 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:04,729][model8_pretrain.py][INFO] Epoch:[0/2](443900/4588595) loss:2.369 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:04,729][model8_pretrain.py][INFO] Epoch:[0/2](443900/4588595) loss:2.752 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:04,729][model8_pretrain.py][INFO] Epoch:[0/2](443900/4588595) loss:2.847 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:04,729][model8_pretrain.py][INFO] Epoch:[0/2](443900/4588595) loss:2.905 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:04,729][model8_pretrain.py][INFO] Epoch:[0/2](443900/4588595) loss:2.633 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:04,730][model8_pretrain.py][INFO] Epoch:[0/2](443900/4588595) loss:2.640 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:41,658][model8_pretrain.py][INFO] Epoch:[0/2](444000/4588595) loss:3.270 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:41,658][model8_pretrain.py][INFO] Epoch:[0/2](444000/4588595) loss:2.303 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:41,658][model8_pretrain.py][INFO] Epoch:[0/2](444000/4588595) loss:2.819 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:41,658][model8_pretrain.py][INFO] Epoch:[0/2](444000/4588595) loss:2.632 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:41,658][model8_pretrain.py][INFO] Epoch:[0/2](444000/4588595) loss:2.967 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:41,658][model8_pretrain.py][INFO] Epoch:[0/2](444000/4588595) loss:2.980 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:41,658][model8_pretrain.py][INFO] Epoch:[0/2](444000/4588595) loss:2.985 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:35:41,658][model8_pretrain.py][INFO] Epoch:[0/2](444000/4588595) loss:2.657 lr:0.0000100 epoch_Time:26286.0min: [2024-01-04 16:36:18,579][model8_pretrain.py][INFO] Epoch:[0/2](444100/4588595) loss:3.307 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:36:18,579][model8_pretrain.py][INFO] Epoch:[0/2](444100/4588595) loss:2.874 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:36:18,579][model8_pretrain.py][INFO] Epoch:[0/2](444100/4588595) loss:2.800 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:36:18,579][model8_pretrain.py][INFO] Epoch:[0/2](444100/4588595) loss:2.141 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:36:18,579][model8_pretrain.py][INFO] Epoch:[0/2](444100/4588595) loss:3.037 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:36:18,579][model8_pretrain.py][INFO] Epoch:[0/2](444100/4588595) loss:2.663 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:36:18,579][model8_pretrain.py][INFO] Epoch:[0/2](444100/4588595) loss:2.673 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:36:18,579][model8_pretrain.py][INFO] Epoch:[0/2](444100/4588595) loss:2.867 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:05,639][model8_pretrain.py][INFO] Epoch:[0/2](444200/4588595) loss:2.806 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:05,639][model8_pretrain.py][INFO] Epoch:[0/2](444200/4588595) loss:2.915 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:05,639][model8_pretrain.py][INFO] Epoch:[0/2](444200/4588595) loss:2.423 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:05,639][model8_pretrain.py][INFO] Epoch:[0/2](444200/4588595) loss:3.138 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:05,639][model8_pretrain.py][INFO] Epoch:[0/2](444200/4588595) loss:3.214 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:05,639][model8_pretrain.py][INFO] Epoch:[0/2](444200/4588595) loss:2.808 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:05,639][model8_pretrain.py][INFO] Epoch:[0/2](444200/4588595) loss:2.674 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:05,639][model8_pretrain.py][INFO] Epoch:[0/2](444200/4588595) loss:2.500 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:42,531][model8_pretrain.py][INFO] Epoch:[0/2](444300/4588595) loss:3.171 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:42,531][model8_pretrain.py][INFO] Epoch:[0/2](444300/4588595) loss:2.888 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:42,531][model8_pretrain.py][INFO] Epoch:[0/2](444300/4588595) loss:3.215 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:42,531][model8_pretrain.py][INFO] Epoch:[0/2](444300/4588595) loss:3.063 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:42,531][model8_pretrain.py][INFO] Epoch:[0/2](444300/4588595) loss:3.024 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:42,531][model8_pretrain.py][INFO] Epoch:[0/2](444300/4588595) loss:2.956 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:42,531][model8_pretrain.py][INFO] Epoch:[0/2](444300/4588595) loss:2.425 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:37:42,531][model8_pretrain.py][INFO] Epoch:[0/2](444300/4588595) loss:2.995 lr:0.0000100 epoch_Time:26285.0min: [2024-01-04 16:38:19,466][model8_pretrain.py][INFO] Epoch:[0/2](444400/4588595) loss:2.889 lr:0.0000100 epoch_Time:26284.0min: [2024-01-04 16:38:19,466][model8_pretrain.py][INFO] Epoch:[0/2](444400/4588595) loss:3.096 lr:0.0000100 epoch_Time:26284.0min: [2024-01-04 16:38:19,466][model8_pretrain.py][INFO] Epoch:[0/2](444400/4588595) loss:2.713 lr:0.0000100 epoch_Time:26284.0min: [2024-01-04 16:38:19,466][model8_pretrain.py][INFO] Epoch:[0/2](444400/4588595) loss:2.836 lr:0.0000100 epoch_Time:26284.0min: [2024-01-04 16:38:19,466][model8_pretrain.py][INFO] Epoch:[0/2](444400/4588595) loss:2.876 lr:0.0000100 epoch_Time:26284.0min: [2024-01-04 16:38:19,466][model8_pretrain.py][INFO] Epoch:[0/2](444400/4588595) loss:2.806 lr:0.0000100 epoch_Time:26284.0min: [2024-01-04 16:38:19,466][model8_pretrain.py][INFO] Epoch:[0/2](444400/4588595) loss:2.591 lr:0.0000100 epoch_Time:26284.0min: [2024-01-04 16:38:19,466][model8_pretrain.py][INFO] Epoch:[0/2](444400/4588595) loss:3.073 lr:0.0000100 epoch_Time:26284.0min: [2024-01-04 16:38:56,390][model8_pretrain.py][INFO] Epoch:[0/2](444500/4588595) loss:2.442 lr:0.0000100 epoch_Time:26283.0min: [2024-01-04 16:38:56,390][model8_pretrain.py][INFO] Epoch:[0/2](444500/4588595) loss:3.135 lr:0.0000100 epoch_Time:26283.0min: [2024-01-04 16:38:56,390][model8_pretrain.py][INFO] Epoch:[0/2](444500/4588595) loss:2.993 lr:0.0000100 epoch_Time:26283.0min: [2024-01-04 16:38:56,390][model8_pretrain.py][INFO] Epoch:[0/2](444500/4588595) loss:2.844 lr:0.0000100 epoch_Time:26283.0min: [2024-01-04 16:38:56,390][model8_pretrain.py][INFO] Epoch:[0/2](444500/4588595) loss:2.742 lr:0.0000100 epoch_Time:26283.0min: [2024-01-04 16:38:56,390][model8_pretrain.py][INFO] Epoch:[0/2](444500/4588595) loss:3.172 lr:0.0000100 epoch_Time:26283.0min: [2024-01-04 16:38:56,390][model8_pretrain.py][INFO] Epoch:[0/2](444500/4588595) loss:2.865 lr:0.0000100 epoch_Time:26283.0min: [2024-01-04 16:38:56,390][model8_pretrain.py][INFO] Epoch:[0/2](444500/4588595) loss:3.166 lr:0.0000100 epoch_Time:26283.0min: [2024-01-04 16:39:33,325][model8_pretrain.py][INFO] Epoch:[0/2](444600/4588595) loss:3.263 lr:0.0000100 epoch_Time:26282.0min: [2024-01-04 16:39:33,325][model8_pretrain.py][INFO] Epoch:[0/2](444600/4588595) loss:2.440 lr:0.0000100 epoch_Time:26282.0min: [2024-01-04 16:39:33,325][model8_pretrain.py][INFO] Epoch:[0/2](444600/4588595) loss:2.659 lr:0.0000100 epoch_Time:26282.0min: [2024-01-04 16:39:33,326][model8_pretrain.py][INFO] Epoch:[0/2](444600/4588595) loss:2.941 lr:0.0000100 epoch_Time:26282.0min: [2024-01-04 16:39:33,326][model8_pretrain.py][INFO] Epoch:[0/2](444600/4588595) loss:3.038 lr:0.0000100 epoch_Time:26282.0min: [2024-01-04 16:39:33,326][model8_pretrain.py][INFO] Epoch:[0/2](444600/4588595) loss:3.125 lr:0.0000100 epoch_Time:26282.0min: [2024-01-04 16:39:33,326][model8_pretrain.py][INFO] Epoch:[0/2](444600/4588595) loss:2.663 lr:0.0000100 epoch_Time:26282.0min: [2024-01-04 16:39:33,326][model8_pretrain.py][INFO] Epoch:[0/2](444600/4588595) loss:3.290 lr:0.0000100 epoch_Time:26282.0min: [2024-01-04 16:40:10,268][model8_pretrain.py][INFO] Epoch:[0/2](444700/4588595) loss:2.464 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:10,268][model8_pretrain.py][INFO] Epoch:[0/2](444700/4588595) loss:2.495 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:10,268][model8_pretrain.py][INFO] Epoch:[0/2](444700/4588595) loss:2.620 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:10,268][model8_pretrain.py][INFO] Epoch:[0/2](444700/4588595) loss:2.676 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:10,268][model8_pretrain.py][INFO] Epoch:[0/2](444700/4588595) loss:3.265 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:10,268][model8_pretrain.py][INFO] Epoch:[0/2](444700/4588595) loss:2.986 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:10,268][model8_pretrain.py][INFO] Epoch:[0/2](444700/4588595) loss:2.734 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:10,268][model8_pretrain.py][INFO] Epoch:[0/2](444700/4588595) loss:2.911 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:47,204][model8_pretrain.py][INFO] Epoch:[0/2](444800/4588595) loss:2.996 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:47,204][model8_pretrain.py][INFO] Epoch:[0/2](444800/4588595) loss:3.183 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:47,204][model8_pretrain.py][INFO] Epoch:[0/2](444800/4588595) loss:2.991 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:47,204][model8_pretrain.py][INFO] Epoch:[0/2](444800/4588595) loss:2.944 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:47,204][model8_pretrain.py][INFO] Epoch:[0/2](444800/4588595) loss:2.873 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:47,204][model8_pretrain.py][INFO] Epoch:[0/2](444800/4588595) loss:2.761 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:47,204][model8_pretrain.py][INFO] Epoch:[0/2](444800/4588595) loss:2.783 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:40:47,204][model8_pretrain.py][INFO] Epoch:[0/2](444800/4588595) loss:3.291 lr:0.0000100 epoch_Time:26281.0min: [2024-01-04 16:41:24,139][model8_pretrain.py][INFO] Epoch:[0/2](444900/4588595) loss:3.237 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:41:24,139][model8_pretrain.py][INFO] Epoch:[0/2](444900/4588595) loss:2.999 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:41:24,139][model8_pretrain.py][INFO] Epoch:[0/2](444900/4588595) loss:2.664 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:41:24,139][model8_pretrain.py][INFO] Epoch:[0/2](444900/4588595) loss:2.503 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:41:24,139][model8_pretrain.py][INFO] Epoch:[0/2](444900/4588595) loss:2.713 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:41:24,139][model8_pretrain.py][INFO] Epoch:[0/2](444900/4588595) loss:3.151 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:41:24,139][model8_pretrain.py][INFO] Epoch:[0/2](444900/4588595) loss:3.095 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:41:24,139][model8_pretrain.py][INFO] Epoch:[0/2](444900/4588595) loss:3.201 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:42:11,217][model8_pretrain.py][INFO] Epoch:[0/2](445000/4588595) loss:2.779 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:42:11,217][model8_pretrain.py][INFO] Epoch:[0/2](445000/4588595) loss:2.598 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:42:11,217][model8_pretrain.py][INFO] Epoch:[0/2](445000/4588595) loss:2.646 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:42:11,217][model8_pretrain.py][INFO] Epoch:[0/2](445000/4588595) loss:3.066 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:42:11,217][model8_pretrain.py][INFO] Epoch:[0/2](445000/4588595) loss:2.684 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:42:11,217][model8_pretrain.py][INFO] Epoch:[0/2](445000/4588595) loss:3.644 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:42:11,217][model8_pretrain.py][INFO] Epoch:[0/2](445000/4588595) loss:3.092 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:42:11,217][model8_pretrain.py][INFO] Epoch:[0/2](445000/4588595) loss:3.116 lr:0.0000100 epoch_Time:26280.0min: [2024-01-04 16:42:48,127][model8_pretrain.py][INFO] Epoch:[0/2](445100/4588595) loss:2.632 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:42:48,127][model8_pretrain.py][INFO] Epoch:[0/2](445100/4588595) loss:2.972 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:42:48,128][model8_pretrain.py][INFO] Epoch:[0/2](445100/4588595) loss:2.951 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:42:48,127][model8_pretrain.py][INFO] Epoch:[0/2](445100/4588595) loss:2.898 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:42:48,128][model8_pretrain.py][INFO] Epoch:[0/2](445100/4588595) loss:2.802 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:42:48,128][model8_pretrain.py][INFO] Epoch:[0/2](445100/4588595) loss:2.827 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:42:48,128][model8_pretrain.py][INFO] Epoch:[0/2](445100/4588595) loss:3.341 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:42:48,128][model8_pretrain.py][INFO] Epoch:[0/2](445100/4588595) loss:2.822 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:43:25,055][model8_pretrain.py][INFO] Epoch:[0/2](445200/4588595) loss:3.193 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:43:25,055][model8_pretrain.py][INFO] Epoch:[0/2](445200/4588595) loss:2.301 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:43:25,055][model8_pretrain.py][INFO] Epoch:[0/2](445200/4588595) loss:3.616 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:43:25,055][model8_pretrain.py][INFO] Epoch:[0/2](445200/4588595) loss:3.376 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:43:25,055][model8_pretrain.py][INFO] Epoch:[0/2](445200/4588595) loss:1.920 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:43:25,055][model8_pretrain.py][INFO] Epoch:[0/2](445200/4588595) loss:2.951 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:43:25,055][model8_pretrain.py][INFO] Epoch:[0/2](445200/4588595) loss:2.699 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:43:25,056][model8_pretrain.py][INFO] Epoch:[0/2](445200/4588595) loss:2.778 lr:0.0000100 epoch_Time:26279.0min: [2024-01-04 16:44:01,977][model8_pretrain.py][INFO] Epoch:[0/2](445300/4588595) loss:3.236 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:01,977][model8_pretrain.py][INFO] Epoch:[0/2](445300/4588595) loss:2.858 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:01,977][model8_pretrain.py][INFO] Epoch:[0/2](445300/4588595) loss:2.942 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:01,977][model8_pretrain.py][INFO] Epoch:[0/2](445300/4588595) loss:2.962 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:01,977][model8_pretrain.py][INFO] Epoch:[0/2](445300/4588595) loss:3.069 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:01,977][model8_pretrain.py][INFO] Epoch:[0/2](445300/4588595) loss:2.794 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:01,977][model8_pretrain.py][INFO] Epoch:[0/2](445300/4588595) loss:3.178 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:01,978][model8_pretrain.py][INFO] Epoch:[0/2](445300/4588595) loss:2.794 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:38,908][model8_pretrain.py][INFO] Epoch:[0/2](445400/4588595) loss:3.163 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:38,908][model8_pretrain.py][INFO] Epoch:[0/2](445400/4588595) loss:3.374 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:38,908][model8_pretrain.py][INFO] Epoch:[0/2](445400/4588595) loss:3.071 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:38,908][model8_pretrain.py][INFO] Epoch:[0/2](445400/4588595) loss:2.826 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:38,908][model8_pretrain.py][INFO] Epoch:[0/2](445400/4588595) loss:2.471 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:38,908][model8_pretrain.py][INFO] Epoch:[0/2](445400/4588595) loss:2.497 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:38,908][model8_pretrain.py][INFO] Epoch:[0/2](445400/4588595) loss:2.820 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:44:38,908][model8_pretrain.py][INFO] Epoch:[0/2](445400/4588595) loss:2.908 lr:0.0000100 epoch_Time:26278.0min: [2024-01-04 16:45:15,847][model8_pretrain.py][INFO] Epoch:[0/2](445500/4588595) loss:3.084 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:45:15,847][model8_pretrain.py][INFO] Epoch:[0/2](445500/4588595) loss:2.692 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:45:15,847][model8_pretrain.py][INFO] Epoch:[0/2](445500/4588595) loss:3.265 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:45:15,847][model8_pretrain.py][INFO] Epoch:[0/2](445500/4588595) loss:3.074 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:45:15,847][model8_pretrain.py][INFO] Epoch:[0/2](445500/4588595) loss:3.087 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:45:15,847][model8_pretrain.py][INFO] Epoch:[0/2](445500/4588595) loss:3.167 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:45:15,847][model8_pretrain.py][INFO] Epoch:[0/2](445500/4588595) loss:3.072 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:45:15,847][model8_pretrain.py][INFO] Epoch:[0/2](445500/4588595) loss:3.257 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:45:52,779][model8_pretrain.py][INFO] Epoch:[0/2](445600/4588595) loss:2.852 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:45:52,779][model8_pretrain.py][INFO] Epoch:[0/2](445600/4588595) loss:2.479 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:45:52,779][model8_pretrain.py][INFO] Epoch:[0/2](445600/4588595) loss:2.536 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:45:52,780][model8_pretrain.py][INFO] Epoch:[0/2](445600/4588595) loss:3.283 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:45:52,780][model8_pretrain.py][INFO] Epoch:[0/2](445600/4588595) loss:2.780 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:45:52,780][model8_pretrain.py][INFO] Epoch:[0/2](445600/4588595) loss:3.134 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:45:52,780][model8_pretrain.py][INFO] Epoch:[0/2](445600/4588595) loss:2.914 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:45:52,781][model8_pretrain.py][INFO] Epoch:[0/2](445600/4588595) loss:2.995 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:46:29,706][model8_pretrain.py][INFO] Epoch:[0/2](445700/4588595) loss:3.182 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:46:29,706][model8_pretrain.py][INFO] Epoch:[0/2](445700/4588595) loss:3.075 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:46:29,706][model8_pretrain.py][INFO] Epoch:[0/2](445700/4588595) loss:2.642 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:46:29,706][model8_pretrain.py][INFO] Epoch:[0/2](445700/4588595) loss:2.944 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:46:29,707][model8_pretrain.py][INFO] Epoch:[0/2](445700/4588595) loss:3.029 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:46:29,707][model8_pretrain.py][INFO] Epoch:[0/2](445700/4588595) loss:2.570 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:46:29,707][model8_pretrain.py][INFO] Epoch:[0/2](445700/4588595) loss:2.561 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:46:29,707][model8_pretrain.py][INFO] Epoch:[0/2](445700/4588595) loss:2.667 lr:0.0000100 epoch_Time:26275.0min: [2024-01-04 16:47:17,031][model8_pretrain.py][INFO] Epoch:[0/2](445800/4588595) loss:2.824 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:47:17,031][model8_pretrain.py][INFO] Epoch:[0/2](445800/4588595) loss:3.110 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:47:17,031][model8_pretrain.py][INFO] Epoch:[0/2](445800/4588595) loss:2.897 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:47:17,031][model8_pretrain.py][INFO] Epoch:[0/2](445800/4588595) loss:2.951 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:47:17,031][model8_pretrain.py][INFO] Epoch:[0/2](445800/4588595) loss:2.578 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:47:17,031][model8_pretrain.py][INFO] Epoch:[0/2](445800/4588595) loss:2.634 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:47:17,031][model8_pretrain.py][INFO] Epoch:[0/2](445800/4588595) loss:2.981 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:47:17,031][model8_pretrain.py][INFO] Epoch:[0/2](445800/4588595) loss:1.754 lr:0.0000100 epoch_Time:26276.0min: [2024-01-04 16:47:54,002][model8_pretrain.py][INFO] Epoch:[0/2](445900/4588595) loss:2.742 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:47:54,002][model8_pretrain.py][INFO] Epoch:[0/2](445900/4588595) loss:2.207 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:47:54,002][model8_pretrain.py][INFO] Epoch:[0/2](445900/4588595) loss:2.627 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:47:54,002][model8_pretrain.py][INFO] Epoch:[0/2](445900/4588595) loss:3.092 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:47:54,002][model8_pretrain.py][INFO] Epoch:[0/2](445900/4588595) loss:2.699 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:47:54,002][model8_pretrain.py][INFO] Epoch:[0/2](445900/4588595) loss:2.586 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:47:54,002][model8_pretrain.py][INFO] Epoch:[0/2](445900/4588595) loss:2.763 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:47:54,003][model8_pretrain.py][INFO] Epoch:[0/2](445900/4588595) loss:3.319 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:48:30,957][model8_pretrain.py][INFO] Epoch:[0/2](446000/4588595) loss:2.780 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:48:30,957][model8_pretrain.py][INFO] Epoch:[0/2](446000/4588595) loss:3.020 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:48:30,957][model8_pretrain.py][INFO] Epoch:[0/2](446000/4588595) loss:2.575 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:48:30,957][model8_pretrain.py][INFO] Epoch:[0/2](446000/4588595) loss:2.809 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:48:30,957][model8_pretrain.py][INFO] Epoch:[0/2](446000/4588595) loss:2.871 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:48:30,957][model8_pretrain.py][INFO] Epoch:[0/2](446000/4588595) loss:3.076 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:48:30,957][model8_pretrain.py][INFO] Epoch:[0/2](446000/4588595) loss:2.862 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:48:30,957][model8_pretrain.py][INFO] Epoch:[0/2](446000/4588595) loss:1.975 lr:0.0000100 epoch_Time:26274.0min: [2024-01-04 16:49:07,898][model8_pretrain.py][INFO] Epoch:[0/2](446100/4588595) loss:3.094 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:07,899][model8_pretrain.py][INFO] Epoch:[0/2](446100/4588595) loss:2.473 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:07,899][model8_pretrain.py][INFO] Epoch:[0/2](446100/4588595) loss:2.741 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:07,899][model8_pretrain.py][INFO] Epoch:[0/2](446100/4588595) loss:2.735 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:07,899][model8_pretrain.py][INFO] Epoch:[0/2](446100/4588595) loss:2.719 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:07,899][model8_pretrain.py][INFO] Epoch:[0/2](446100/4588595) loss:2.585 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:07,899][model8_pretrain.py][INFO] Epoch:[0/2](446100/4588595) loss:3.119 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:07,899][model8_pretrain.py][INFO] Epoch:[0/2](446100/4588595) loss:3.020 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:44,842][model8_pretrain.py][INFO] Epoch:[0/2](446200/4588595) loss:2.145 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:44,842][model8_pretrain.py][INFO] Epoch:[0/2](446200/4588595) loss:3.058 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:44,842][model8_pretrain.py][INFO] Epoch:[0/2](446200/4588595) loss:3.241 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:44,842][model8_pretrain.py][INFO] Epoch:[0/2](446200/4588595) loss:3.139 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:44,842][model8_pretrain.py][INFO] Epoch:[0/2](446200/4588595) loss:2.693 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:44,842][model8_pretrain.py][INFO] Epoch:[0/2](446200/4588595) loss:2.708 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:44,842][model8_pretrain.py][INFO] Epoch:[0/2](446200/4588595) loss:3.019 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:49:44,842][model8_pretrain.py][INFO] Epoch:[0/2](446200/4588595) loss:2.944 lr:0.0000100 epoch_Time:26273.0min: [2024-01-04 16:50:21,807][model8_pretrain.py][INFO] Epoch:[0/2](446300/4588595) loss:2.995 lr:0.0000100 epoch_Time:26272.0min: [2024-01-04 16:50:21,807][model8_pretrain.py][INFO] Epoch:[0/2](446300/4588595) loss:2.791 lr:0.0000100 epoch_Time:26272.0min: [2024-01-04 16:50:21,807][model8_pretrain.py][INFO] Epoch:[0/2](446300/4588595) loss:2.780 lr:0.0000100 epoch_Time:26272.0min: [2024-01-04 16:50:21,807][model8_pretrain.py][INFO] Epoch:[0/2](446300/4588595) loss:2.955 lr:0.0000100 epoch_Time:26272.0min: [2024-01-04 16:50:21,807][model8_pretrain.py][INFO] Epoch:[0/2](446300/4588595) loss:2.917 lr:0.0000100 epoch_Time:26272.0min: [2024-01-04 16:50:21,808][model8_pretrain.py][INFO] Epoch:[0/2](446300/4588595) loss:2.785 lr:0.0000100 epoch_Time:26272.0min: [2024-01-04 16:50:21,808][model8_pretrain.py][INFO] Epoch:[0/2](446300/4588595) loss:3.021 lr:0.0000100 epoch_Time:26272.0min: [2024-01-04 16:50:21,808][model8_pretrain.py][INFO] Epoch:[0/2](446300/4588595) loss:2.792 lr:0.0000100 epoch_Time:26272.0min: [2024-01-04 16:50:58,737][model8_pretrain.py][INFO] Epoch:[0/2](446400/4588595) loss:2.725 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:50:58,737][model8_pretrain.py][INFO] Epoch:[0/2](446400/4588595) loss:2.889 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:50:58,737][model8_pretrain.py][INFO] Epoch:[0/2](446400/4588595) loss:2.747 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:50:58,737][model8_pretrain.py][INFO] Epoch:[0/2](446400/4588595) loss:2.972 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:50:58,737][model8_pretrain.py][INFO] Epoch:[0/2](446400/4588595) loss:3.265 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:50:58,737][model8_pretrain.py][INFO] Epoch:[0/2](446400/4588595) loss:2.531 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:50:58,737][model8_pretrain.py][INFO] Epoch:[0/2](446400/4588595) loss:2.631 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:50:58,737][model8_pretrain.py][INFO] Epoch:[0/2](446400/4588595) loss:2.414 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:51:35,667][model8_pretrain.py][INFO] Epoch:[0/2](446500/4588595) loss:2.737 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:51:35,667][model8_pretrain.py][INFO] Epoch:[0/2](446500/4588595) loss:2.049 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:51:35,667][model8_pretrain.py][INFO] Epoch:[0/2](446500/4588595) loss:2.893 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:51:35,667][model8_pretrain.py][INFO] Epoch:[0/2](446500/4588595) loss:2.808 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:51:35,667][model8_pretrain.py][INFO] Epoch:[0/2](446500/4588595) loss:3.180 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:51:35,667][model8_pretrain.py][INFO] Epoch:[0/2](446500/4588595) loss:3.026 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:51:35,668][model8_pretrain.py][INFO] Epoch:[0/2](446500/4588595) loss:2.760 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:51:35,669][model8_pretrain.py][INFO] Epoch:[0/2](446500/4588595) loss:2.893 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:52:22,824][model8_pretrain.py][INFO] Epoch:[0/2](446600/4588595) loss:2.899 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:52:22,824][model8_pretrain.py][INFO] Epoch:[0/2](446600/4588595) loss:2.830 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:52:22,824][model8_pretrain.py][INFO] Epoch:[0/2](446600/4588595) loss:2.919 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:52:22,824][model8_pretrain.py][INFO] Epoch:[0/2](446600/4588595) loss:2.590 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:52:22,824][model8_pretrain.py][INFO] Epoch:[0/2](446600/4588595) loss:3.434 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:52:22,824][model8_pretrain.py][INFO] Epoch:[0/2](446600/4588595) loss:2.793 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:52:22,824][model8_pretrain.py][INFO] Epoch:[0/2](446600/4588595) loss:2.322 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:52:22,824][model8_pretrain.py][INFO] Epoch:[0/2](446600/4588595) loss:3.266 lr:0.0000100 epoch_Time:26271.0min: [2024-01-04 16:52:59,748][model8_pretrain.py][INFO] Epoch:[0/2](446700/4588595) loss:2.988 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:52:59,748][model8_pretrain.py][INFO] Epoch:[0/2](446700/4588595) loss:2.196 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:52:59,748][model8_pretrain.py][INFO] Epoch:[0/2](446700/4588595) loss:3.018 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:52:59,748][model8_pretrain.py][INFO] Epoch:[0/2](446700/4588595) loss:2.613 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:52:59,748][model8_pretrain.py][INFO] Epoch:[0/2](446700/4588595) loss:2.861 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:52:59,748][model8_pretrain.py][INFO] Epoch:[0/2](446700/4588595) loss:3.165 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:52:59,749][model8_pretrain.py][INFO] Epoch:[0/2](446700/4588595) loss:2.503 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:52:59,749][model8_pretrain.py][INFO] Epoch:[0/2](446700/4588595) loss:3.137 lr:0.0000100 epoch_Time:26270.0min: [2024-01-04 16:53:36,673][model8_pretrain.py][INFO] Epoch:[0/2](446800/4588595) loss:2.917 lr:0.0000100 epoch_Time:26269.0min: [2024-01-04 16:53:36,673][model8_pretrain.py][INFO] Epoch:[0/2](446800/4588595) loss:3.065 lr:0.0000100 epoch_Time:26269.0min: [2024-01-04 16:53:36,673][model8_pretrain.py][INFO] Epoch:[0/2](446800/4588595) loss:2.630 lr:0.0000100 epoch_Time:26269.0min: [2024-01-04 16:53:36,673][model8_pretrain.py][INFO] Epoch:[0/2](446800/4588595) loss:2.842 lr:0.0000100 epoch_Time:26269.0min: [2024-01-04 16:53:36,673][model8_pretrain.py][INFO] Epoch:[0/2](446800/4588595) loss:3.186 lr:0.0000100 epoch_Time:26269.0min: [2024-01-04 16:53:36,673][model8_pretrain.py][INFO] Epoch:[0/2](446800/4588595) loss:3.394 lr:0.0000100 epoch_Time:26269.0min: [2024-01-04 16:53:36,673][model8_pretrain.py][INFO] Epoch:[0/2](446800/4588595) loss:2.506 lr:0.0000100 epoch_Time:26269.0min: [2024-01-04 16:53:36,673][model8_pretrain.py][INFO] Epoch:[0/2](446800/4588595) loss:3.062 lr:0.0000100 epoch_Time:26269.0min: [2024-01-04 16:54:13,608][model8_pretrain.py][INFO] Epoch:[0/2](446900/4588595) loss:2.752 lr:0.0000100 epoch_Time:26268.0min: [2024-01-04 16:54:13,609][model8_pretrain.py][INFO] Epoch:[0/2](446900/4588595) loss:2.714 lr:0.0000100 epoch_Time:26268.0min: [2024-01-04 16:54:13,609][model8_pretrain.py][INFO] Epoch:[0/2](446900/4588595) loss:3.235 lr:0.0000100 epoch_Time:26268.0min: [2024-01-04 16:54:13,609][model8_pretrain.py][INFO] Epoch:[0/2](446900/4588595) loss:3.343 lr:0.0000100 epoch_Time:26268.0min: [2024-01-04 16:54:13,609][model8_pretrain.py][INFO] Epoch:[0/2](446900/4588595) loss:3.230 lr:0.0000100 epoch_Time:26268.0min: [2024-01-04 16:54:13,609][model8_pretrain.py][INFO] Epoch:[0/2](446900/4588595) loss:2.945 lr:0.0000100 epoch_Time:26268.0min: [2024-01-04 16:54:13,609][model8_pretrain.py][INFO] Epoch:[0/2](446900/4588595) loss:2.177 lr:0.0000100 epoch_Time:26268.0min: [2024-01-04 16:54:13,609][model8_pretrain.py][INFO] Epoch:[0/2](446900/4588595) loss:2.807 lr:0.0000100 epoch_Time:26268.0min: [2024-01-04 16:54:50,547][model8_pretrain.py][INFO] Epoch:[0/2](447000/4588595) loss:3.136 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:54:50,547][model8_pretrain.py][INFO] Epoch:[0/2](447000/4588595) loss:3.249 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:54:50,547][model8_pretrain.py][INFO] Epoch:[0/2](447000/4588595) loss:2.874 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:54:50,547][model8_pretrain.py][INFO] Epoch:[0/2](447000/4588595) loss:3.151 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:54:50,547][model8_pretrain.py][INFO] Epoch:[0/2](447000/4588595) loss:3.158 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:54:50,547][model8_pretrain.py][INFO] Epoch:[0/2](447000/4588595) loss:3.156 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:54:50,547][model8_pretrain.py][INFO] Epoch:[0/2](447000/4588595) loss:2.774 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:54:50,547][model8_pretrain.py][INFO] Epoch:[0/2](447000/4588595) loss:2.683 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:55:27,475][model8_pretrain.py][INFO] Epoch:[0/2](447100/4588595) loss:2.584 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:55:27,475][model8_pretrain.py][INFO] Epoch:[0/2](447100/4588595) loss:2.604 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:55:27,475][model8_pretrain.py][INFO] Epoch:[0/2](447100/4588595) loss:2.725 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:55:27,475][model8_pretrain.py][INFO] Epoch:[0/2](447100/4588595) loss:2.862 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:55:27,475][model8_pretrain.py][INFO] Epoch:[0/2](447100/4588595) loss:2.766 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:55:27,475][model8_pretrain.py][INFO] Epoch:[0/2](447100/4588595) loss:2.690 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:55:27,476][model8_pretrain.py][INFO] Epoch:[0/2](447100/4588595) loss:3.020 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:55:27,476][model8_pretrain.py][INFO] Epoch:[0/2](447100/4588595) loss:2.082 lr:0.0000100 epoch_Time:26267.0min: [2024-01-04 16:56:04,415][model8_pretrain.py][INFO] Epoch:[0/2](447200/4588595) loss:2.874 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:04,415][model8_pretrain.py][INFO] Epoch:[0/2](447200/4588595) loss:3.575 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:04,415][model8_pretrain.py][INFO] Epoch:[0/2](447200/4588595) loss:2.870 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:04,415][model8_pretrain.py][INFO] Epoch:[0/2](447200/4588595) loss:3.123 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:04,415][model8_pretrain.py][INFO] Epoch:[0/2](447200/4588595) loss:2.982 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:04,415][model8_pretrain.py][INFO] Epoch:[0/2](447200/4588595) loss:2.865 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:04,415][model8_pretrain.py][INFO] Epoch:[0/2](447200/4588595) loss:2.372 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:04,415][model8_pretrain.py][INFO] Epoch:[0/2](447200/4588595) loss:2.749 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:41,372][model8_pretrain.py][INFO] Epoch:[0/2](447300/4588595) loss:2.563 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:41,372][model8_pretrain.py][INFO] Epoch:[0/2](447300/4588595) loss:2.524 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:41,372][model8_pretrain.py][INFO] Epoch:[0/2](447300/4588595) loss:3.061 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:41,372][model8_pretrain.py][INFO] Epoch:[0/2](447300/4588595) loss:3.036 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:41,372][model8_pretrain.py][INFO] Epoch:[0/2](447300/4588595) loss:2.969 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:41,372][model8_pretrain.py][INFO] Epoch:[0/2](447300/4588595) loss:2.612 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:41,372][model8_pretrain.py][INFO] Epoch:[0/2](447300/4588595) loss:3.038 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:56:41,372][model8_pretrain.py][INFO] Epoch:[0/2](447300/4588595) loss:3.504 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:57:28,511][model8_pretrain.py][INFO] Epoch:[0/2](447400/4588595) loss:3.312 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:57:28,511][model8_pretrain.py][INFO] Epoch:[0/2](447400/4588595) loss:3.241 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:57:28,511][model8_pretrain.py][INFO] Epoch:[0/2](447400/4588595) loss:2.593 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:57:28,511][model8_pretrain.py][INFO] Epoch:[0/2](447400/4588595) loss:2.289 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:57:28,511][model8_pretrain.py][INFO] Epoch:[0/2](447400/4588595) loss:2.956 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:57:28,511][model8_pretrain.py][INFO] Epoch:[0/2](447400/4588595) loss:2.876 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:57:28,511][model8_pretrain.py][INFO] Epoch:[0/2](447400/4588595) loss:2.440 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:57:28,511][model8_pretrain.py][INFO] Epoch:[0/2](447400/4588595) loss:2.761 lr:0.0000100 epoch_Time:26266.0min: [2024-01-04 16:58:05,436][model8_pretrain.py][INFO] Epoch:[0/2](447500/4588595) loss:3.257 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:05,436][model8_pretrain.py][INFO] Epoch:[0/2](447500/4588595) loss:3.082 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:05,436][model8_pretrain.py][INFO] Epoch:[0/2](447500/4588595) loss:3.282 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:05,436][model8_pretrain.py][INFO] Epoch:[0/2](447500/4588595) loss:2.448 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:05,436][model8_pretrain.py][INFO] Epoch:[0/2](447500/4588595) loss:3.510 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:05,436][model8_pretrain.py][INFO] Epoch:[0/2](447500/4588595) loss:2.371 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:05,436][model8_pretrain.py][INFO] Epoch:[0/2](447500/4588595) loss:2.555 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:05,436][model8_pretrain.py][INFO] Epoch:[0/2](447500/4588595) loss:3.160 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:42,369][model8_pretrain.py][INFO] Epoch:[0/2](447600/4588595) loss:2.050 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:42,369][model8_pretrain.py][INFO] Epoch:[0/2](447600/4588595) loss:2.568 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:42,369][model8_pretrain.py][INFO] Epoch:[0/2](447600/4588595) loss:2.820 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:42,369][model8_pretrain.py][INFO] Epoch:[0/2](447600/4588595) loss:3.064 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:42,369][model8_pretrain.py][INFO] Epoch:[0/2](447600/4588595) loss:3.099 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:42,369][model8_pretrain.py][INFO] Epoch:[0/2](447600/4588595) loss:2.949 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:42,369][model8_pretrain.py][INFO] Epoch:[0/2](447600/4588595) loss:3.021 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:58:42,369][model8_pretrain.py][INFO] Epoch:[0/2](447600/4588595) loss:3.127 lr:0.0000100 epoch_Time:26265.0min: [2024-01-04 16:59:19,295][model8_pretrain.py][INFO] Epoch:[0/2](447700/4588595) loss:3.157 lr:0.0000100 epoch_Time:26264.0min: [2024-01-04 16:59:19,295][model8_pretrain.py][INFO] Epoch:[0/2](447700/4588595) loss:2.795 lr:0.0000100 epoch_Time:26264.0min: [2024-01-04 16:59:19,295][model8_pretrain.py][INFO] Epoch:[0/2](447700/4588595) loss:2.429 lr:0.0000100 epoch_Time:26264.0min: [2024-01-04 16:59:19,295][model8_pretrain.py][INFO] Epoch:[0/2](447700/4588595) loss:3.311 lr:0.0000100 epoch_Time:26264.0min: [2024-01-04 16:59:19,295][model8_pretrain.py][INFO] Epoch:[0/2](447700/4588595) loss:2.967 lr:0.0000100 epoch_Time:26264.0min: [2024-01-04 16:59:19,295][model8_pretrain.py][INFO] Epoch:[0/2](447700/4588595) loss:2.858 lr:0.0000100 epoch_Time:26264.0min: [2024-01-04 16:59:19,295][model8_pretrain.py][INFO] Epoch:[0/2](447700/4588595) loss:2.876 lr:0.0000100 epoch_Time:26264.0min: [2024-01-04 16:59:19,295][model8_pretrain.py][INFO] Epoch:[0/2](447700/4588595) loss:2.637 lr:0.0000100 epoch_Time:26264.0min: [2024-01-04 16:59:56,223][model8_pretrain.py][INFO] Epoch:[0/2](447800/4588595) loss:3.022 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 16:59:56,223][model8_pretrain.py][INFO] Epoch:[0/2](447800/4588595) loss:3.087 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 16:59:56,223][model8_pretrain.py][INFO] Epoch:[0/2](447800/4588595) loss:2.347 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 16:59:56,223][model8_pretrain.py][INFO] Epoch:[0/2](447800/4588595) loss:2.806 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 16:59:56,223][model8_pretrain.py][INFO] Epoch:[0/2](447800/4588595) loss:2.779 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 16:59:56,224][model8_pretrain.py][INFO] Epoch:[0/2](447800/4588595) loss:3.113 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 16:59:56,224][model8_pretrain.py][INFO] Epoch:[0/2](447800/4588595) loss:1.926 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 16:59:56,224][model8_pretrain.py][INFO] Epoch:[0/2](447800/4588595) loss:2.536 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 17:00:33,161][model8_pretrain.py][INFO] Epoch:[0/2](447900/4588595) loss:2.688 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 17:00:33,161][model8_pretrain.py][INFO] Epoch:[0/2](447900/4588595) loss:3.027 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 17:00:33,161][model8_pretrain.py][INFO] Epoch:[0/2](447900/4588595) loss:2.451 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 17:00:33,161][model8_pretrain.py][INFO] Epoch:[0/2](447900/4588595) loss:2.379 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 17:00:33,161][model8_pretrain.py][INFO] Epoch:[0/2](447900/4588595) loss:2.798 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 17:00:33,161][model8_pretrain.py][INFO] Epoch:[0/2](447900/4588595) loss:3.403 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 17:00:33,161][model8_pretrain.py][INFO] Epoch:[0/2](447900/4588595) loss:2.897 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 17:00:33,161][model8_pretrain.py][INFO] Epoch:[0/2](447900/4588595) loss:2.689 lr:0.0000100 epoch_Time:26262.0min: [2024-01-04 17:01:10,096][model8_pretrain.py][INFO] Epoch:[0/2](448000/4588595) loss:2.996 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:10,096][model8_pretrain.py][INFO] Epoch:[0/2](448000/4588595) loss:3.155 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:10,096][model8_pretrain.py][INFO] Epoch:[0/2](448000/4588595) loss:2.371 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:10,096][model8_pretrain.py][INFO] Epoch:[0/2](448000/4588595) loss:2.591 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:10,096][model8_pretrain.py][INFO] Epoch:[0/2](448000/4588595) loss:2.825 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:10,096][model8_pretrain.py][INFO] Epoch:[0/2](448000/4588595) loss:3.403 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:10,097][model8_pretrain.py][INFO] Epoch:[0/2](448000/4588595) loss:3.083 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:10,096][model8_pretrain.py][INFO] Epoch:[0/2](448000/4588595) loss:2.424 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:47,028][model8_pretrain.py][INFO] Epoch:[0/2](448100/4588595) loss:1.876 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:47,028][model8_pretrain.py][INFO] Epoch:[0/2](448100/4588595) loss:2.937 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:47,028][model8_pretrain.py][INFO] Epoch:[0/2](448100/4588595) loss:2.968 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:47,028][model8_pretrain.py][INFO] Epoch:[0/2](448100/4588595) loss:3.273 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:47,028][model8_pretrain.py][INFO] Epoch:[0/2](448100/4588595) loss:3.037 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:47,028][model8_pretrain.py][INFO] Epoch:[0/2](448100/4588595) loss:3.151 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:47,028][model8_pretrain.py][INFO] Epoch:[0/2](448100/4588595) loss:2.936 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:01:47,028][model8_pretrain.py][INFO] Epoch:[0/2](448100/4588595) loss:2.253 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:02:34,214][model8_pretrain.py][INFO] Epoch:[0/2](448200/4588595) loss:2.835 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:02:34,214][model8_pretrain.py][INFO] Epoch:[0/2](448200/4588595) loss:2.873 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:02:34,214][model8_pretrain.py][INFO] Epoch:[0/2](448200/4588595) loss:3.027 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:02:34,214][model8_pretrain.py][INFO] Epoch:[0/2](448200/4588595) loss:2.997 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:02:34,214][model8_pretrain.py][INFO] Epoch:[0/2](448200/4588595) loss:2.535 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:02:34,214][model8_pretrain.py][INFO] Epoch:[0/2](448200/4588595) loss:3.209 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:02:34,214][model8_pretrain.py][INFO] Epoch:[0/2](448200/4588595) loss:2.700 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:02:34,214][model8_pretrain.py][INFO] Epoch:[0/2](448200/4588595) loss:3.118 lr:0.0000100 epoch_Time:26261.0min: [2024-01-04 17:03:11,121][model8_pretrain.py][INFO] Epoch:[0/2](448300/4588595) loss:2.539 lr:0.0000100 epoch_Time:26260.0min: [2024-01-04 17:03:11,121][model8_pretrain.py][INFO] Epoch:[0/2](448300/4588595) loss:2.624 lr:0.0000100 epoch_Time:26260.0min: [2024-01-04 17:03:11,122][model8_pretrain.py][INFO] Epoch:[0/2](448300/4588595) loss:2.528 lr:0.0000100 epoch_Time:26260.0min: [2024-01-04 17:03:11,122][model8_pretrain.py][INFO] Epoch:[0/2](448300/4588595) loss:2.878 lr:0.0000100 epoch_Time:26260.0min: [2024-01-04 17:03:11,122][model8_pretrain.py][INFO] Epoch:[0/2](448300/4588595) loss:3.418 lr:0.0000100 epoch_Time:26260.0min: [2024-01-04 17:03:11,122][model8_pretrain.py][INFO] Epoch:[0/2](448300/4588595) loss:2.736 lr:0.0000100 epoch_Time:26260.0min: [2024-01-04 17:03:11,122][model8_pretrain.py][INFO] Epoch:[0/2](448300/4588595) loss:2.755 lr:0.0000100 epoch_Time:26260.0min: [2024-01-04 17:03:11,122][model8_pretrain.py][INFO] Epoch:[0/2](448300/4588595) loss:3.217 lr:0.0000100 epoch_Time:26260.0min: [2024-01-04 17:03:48,041][model8_pretrain.py][INFO] Epoch:[0/2](448400/4588595) loss:3.390 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:03:48,041][model8_pretrain.py][INFO] Epoch:[0/2](448400/4588595) loss:1.898 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:03:48,041][model8_pretrain.py][INFO] Epoch:[0/2](448400/4588595) loss:3.235 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:03:48,041][model8_pretrain.py][INFO] Epoch:[0/2](448400/4588595) loss:3.032 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:03:48,041][model8_pretrain.py][INFO] Epoch:[0/2](448400/4588595) loss:2.259 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:03:48,041][model8_pretrain.py][INFO] Epoch:[0/2](448400/4588595) loss:2.456 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:03:48,041][model8_pretrain.py][INFO] Epoch:[0/2](448400/4588595) loss:2.978 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:03:48,041][model8_pretrain.py][INFO] Epoch:[0/2](448400/4588595) loss:2.763 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:04:24,970][model8_pretrain.py][INFO] Epoch:[0/2](448500/4588595) loss:3.131 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:04:24,970][model8_pretrain.py][INFO] Epoch:[0/2](448500/4588595) loss:2.900 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:04:24,970][model8_pretrain.py][INFO] Epoch:[0/2](448500/4588595) loss:2.827 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:04:24,970][model8_pretrain.py][INFO] Epoch:[0/2](448500/4588595) loss:3.230 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:04:24,970][model8_pretrain.py][INFO] Epoch:[0/2](448500/4588595) loss:2.668 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:04:24,970][model8_pretrain.py][INFO] Epoch:[0/2](448500/4588595) loss:2.719 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:04:24,970][model8_pretrain.py][INFO] Epoch:[0/2](448500/4588595) loss:2.724 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:04:24,970][model8_pretrain.py][INFO] Epoch:[0/2](448500/4588595) loss:2.763 lr:0.0000100 epoch_Time:26259.0min: [2024-01-04 17:05:01,899][model8_pretrain.py][INFO] Epoch:[0/2](448600/4588595) loss:2.499 lr:0.0000100 epoch_Time:26258.0min: [2024-01-04 17:05:01,899][model8_pretrain.py][INFO] Epoch:[0/2](448600/4588595) loss:3.133 lr:0.0000100 epoch_Time:26258.0min: [2024-01-04 17:05:01,899][model8_pretrain.py][INFO] Epoch:[0/2](448600/4588595) loss:2.480 lr:0.0000100 epoch_Time:26258.0min: [2024-01-04 17:05:01,900][model8_pretrain.py][INFO] Epoch:[0/2](448600/4588595) loss:2.912 lr:0.0000100 epoch_Time:26258.0min: [2024-01-04 17:05:01,899][model8_pretrain.py][INFO] Epoch:[0/2](448600/4588595) loss:3.507 lr:0.0000100 epoch_Time:26258.0min: [2024-01-04 17:05:01,899][model8_pretrain.py][INFO] Epoch:[0/2](448600/4588595) loss:3.708 lr:0.0000100 epoch_Time:26258.0min: [2024-01-04 17:05:01,900][model8_pretrain.py][INFO] Epoch:[0/2](448600/4588595) loss:3.239 lr:0.0000100 epoch_Time:26258.0min: [2024-01-04 17:05:01,900][model8_pretrain.py][INFO] Epoch:[0/2](448600/4588595) loss:3.119 lr:0.0000100 epoch_Time:26258.0min: [2024-01-04 17:05:38,837][model8_pretrain.py][INFO] Epoch:[0/2](448700/4588595) loss:2.736 lr:0.0000100 epoch_Time:26257.0min: [2024-01-04 17:05:38,837][model8_pretrain.py][INFO] Epoch:[0/2](448700/4588595) loss:2.835 lr:0.0000100 epoch_Time:26257.0min: [2024-01-04 17:05:38,837][model8_pretrain.py][INFO] Epoch:[0/2](448700/4588595) loss:2.632 lr:0.0000100 epoch_Time:26257.0min: [2024-01-04 17:05:38,837][model8_pretrain.py][INFO] Epoch:[0/2](448700/4588595) loss:2.293 lr:0.0000100 epoch_Time:26257.0min: [2024-01-04 17:05:38,837][model8_pretrain.py][INFO] Epoch:[0/2](448700/4588595) loss:2.587 lr:0.0000100 epoch_Time:26257.0min: [2024-01-04 17:05:38,837][model8_pretrain.py][INFO] Epoch:[0/2](448700/4588595) loss:2.704 lr:0.0000100 epoch_Time:26257.0min: [2024-01-04 17:05:38,837][model8_pretrain.py][INFO] Epoch:[0/2](448700/4588595) loss:3.298 lr:0.0000100 epoch_Time:26257.0min: [2024-01-04 17:05:38,837][model8_pretrain.py][INFO] Epoch:[0/2](448700/4588595) loss:2.672 lr:0.0000100 epoch_Time:26257.0min: [2024-01-04 17:06:15,777][model8_pretrain.py][INFO] Epoch:[0/2](448800/4588595) loss:2.989 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:06:15,777][model8_pretrain.py][INFO] Epoch:[0/2](448800/4588595) loss:3.319 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:06:15,777][model8_pretrain.py][INFO] Epoch:[0/2](448800/4588595) loss:2.808 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:06:15,777][model8_pretrain.py][INFO] Epoch:[0/2](448800/4588595) loss:2.579 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:06:15,777][model8_pretrain.py][INFO] Epoch:[0/2](448800/4588595) loss:2.709 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:06:15,777][model8_pretrain.py][INFO] Epoch:[0/2](448800/4588595) loss:2.919 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:06:15,777][model8_pretrain.py][INFO] Epoch:[0/2](448800/4588595) loss:3.140 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:06:15,777][model8_pretrain.py][INFO] Epoch:[0/2](448800/4588595) loss:3.007 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:06:52,700][model8_pretrain.py][INFO] Epoch:[0/2](448900/4588595) loss:2.530 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:06:52,700][model8_pretrain.py][INFO] Epoch:[0/2](448900/4588595) loss:3.037 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:06:52,700][model8_pretrain.py][INFO] Epoch:[0/2](448900/4588595) loss:2.779 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:06:52,701][model8_pretrain.py][INFO] Epoch:[0/2](448900/4588595) loss:2.870 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:06:52,701][model8_pretrain.py][INFO] Epoch:[0/2](448900/4588595) loss:2.907 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:06:52,701][model8_pretrain.py][INFO] Epoch:[0/2](448900/4588595) loss:2.886 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:06:52,701][model8_pretrain.py][INFO] Epoch:[0/2](448900/4588595) loss:2.475 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:06:52,701][model8_pretrain.py][INFO] Epoch:[0/2](448900/4588595) loss:2.589 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:07:38,078][model8_pretrain.py][INFO] Epoch:[0/2](449000/4588595) loss:3.268 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:07:38,078][model8_pretrain.py][INFO] Epoch:[0/2](449000/4588595) loss:2.770 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:07:38,078][model8_pretrain.py][INFO] Epoch:[0/2](449000/4588595) loss:2.973 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:07:38,079][model8_pretrain.py][INFO] Epoch:[0/2](449000/4588595) loss:3.194 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:07:38,078][model8_pretrain.py][INFO] Epoch:[0/2](449000/4588595) loss:2.485 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:07:38,079][model8_pretrain.py][INFO] Epoch:[0/2](449000/4588595) loss:3.150 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:07:38,079][model8_pretrain.py][INFO] Epoch:[0/2](449000/4588595) loss:2.892 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:07:38,079][model8_pretrain.py][INFO] Epoch:[0/2](449000/4588595) loss:2.805 lr:0.0000100 epoch_Time:26256.0min: [2024-01-04 17:08:16,678][model8_pretrain.py][INFO] Epoch:[0/2](449100/4588595) loss:2.795 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:08:16,678][model8_pretrain.py][INFO] Epoch:[0/2](449100/4588595) loss:2.900 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:08:16,678][model8_pretrain.py][INFO] Epoch:[0/2](449100/4588595) loss:2.746 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:08:16,678][model8_pretrain.py][INFO] Epoch:[0/2](449100/4588595) loss:3.110 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:08:16,678][model8_pretrain.py][INFO] Epoch:[0/2](449100/4588595) loss:3.132 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:08:16,678][model8_pretrain.py][INFO] Epoch:[0/2](449100/4588595) loss:3.045 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:08:16,678][model8_pretrain.py][INFO] Epoch:[0/2](449100/4588595) loss:2.831 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:08:16,678][model8_pretrain.py][INFO] Epoch:[0/2](449100/4588595) loss:2.557 lr:0.0000100 epoch_Time:26255.0min: [2024-01-04 17:08:53,607][model8_pretrain.py][INFO] Epoch:[0/2](449200/4588595) loss:3.093 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:08:53,607][model8_pretrain.py][INFO] Epoch:[0/2](449200/4588595) loss:2.568 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:08:53,607][model8_pretrain.py][INFO] Epoch:[0/2](449200/4588595) loss:2.903 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:08:53,607][model8_pretrain.py][INFO] Epoch:[0/2](449200/4588595) loss:3.661 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:08:53,607][model8_pretrain.py][INFO] Epoch:[0/2](449200/4588595) loss:2.851 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:08:53,607][model8_pretrain.py][INFO] Epoch:[0/2](449200/4588595) loss:2.704 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:08:53,607][model8_pretrain.py][INFO] Epoch:[0/2](449200/4588595) loss:2.752 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:08:53,608][model8_pretrain.py][INFO] Epoch:[0/2](449200/4588595) loss:2.130 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:09:30,540][model8_pretrain.py][INFO] Epoch:[0/2](449300/4588595) loss:2.997 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:09:30,540][model8_pretrain.py][INFO] Epoch:[0/2](449300/4588595) loss:2.921 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:09:30,540][model8_pretrain.py][INFO] Epoch:[0/2](449300/4588595) loss:2.887 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:09:30,540][model8_pretrain.py][INFO] Epoch:[0/2](449300/4588595) loss:2.639 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:09:30,540][model8_pretrain.py][INFO] Epoch:[0/2](449300/4588595) loss:3.254 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:09:30,540][model8_pretrain.py][INFO] Epoch:[0/2](449300/4588595) loss:2.895 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:09:30,540][model8_pretrain.py][INFO] Epoch:[0/2](449300/4588595) loss:2.586 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:09:30,540][model8_pretrain.py][INFO] Epoch:[0/2](449300/4588595) loss:2.883 lr:0.0000100 epoch_Time:26254.0min: [2024-01-04 17:10:07,480][model8_pretrain.py][INFO] Epoch:[0/2](449400/4588595) loss:2.919 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:07,480][model8_pretrain.py][INFO] Epoch:[0/2](449400/4588595) loss:2.349 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:07,480][model8_pretrain.py][INFO] Epoch:[0/2](449400/4588595) loss:2.940 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:07,480][model8_pretrain.py][INFO] Epoch:[0/2](449400/4588595) loss:2.950 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:07,480][model8_pretrain.py][INFO] Epoch:[0/2](449400/4588595) loss:3.043 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:07,480][model8_pretrain.py][INFO] Epoch:[0/2](449400/4588595) loss:2.831 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:07,480][model8_pretrain.py][INFO] Epoch:[0/2](449400/4588595) loss:3.296 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:07,480][model8_pretrain.py][INFO] Epoch:[0/2](449400/4588595) loss:2.644 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:44,413][model8_pretrain.py][INFO] Epoch:[0/2](449500/4588595) loss:3.111 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:44,413][model8_pretrain.py][INFO] Epoch:[0/2](449500/4588595) loss:2.819 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:44,413][model8_pretrain.py][INFO] Epoch:[0/2](449500/4588595) loss:3.130 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:44,413][model8_pretrain.py][INFO] Epoch:[0/2](449500/4588595) loss:2.531 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:44,414][model8_pretrain.py][INFO] Epoch:[0/2](449500/4588595) loss:2.495 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:44,414][model8_pretrain.py][INFO] Epoch:[0/2](449500/4588595) loss:3.007 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:44,414][model8_pretrain.py][INFO] Epoch:[0/2](449500/4588595) loss:2.742 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:10:44,414][model8_pretrain.py][INFO] Epoch:[0/2](449500/4588595) loss:2.284 lr:0.0000100 epoch_Time:26253.0min: [2024-01-04 17:11:21,347][model8_pretrain.py][INFO] Epoch:[0/2](449600/4588595) loss:2.891 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:11:21,347][model8_pretrain.py][INFO] Epoch:[0/2](449600/4588595) loss:2.953 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:11:21,347][model8_pretrain.py][INFO] Epoch:[0/2](449600/4588595) loss:3.251 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:11:21,347][model8_pretrain.py][INFO] Epoch:[0/2](449600/4588595) loss:3.075 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:11:21,347][model8_pretrain.py][INFO] Epoch:[0/2](449600/4588595) loss:3.015 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:11:21,347][model8_pretrain.py][INFO] Epoch:[0/2](449600/4588595) loss:2.797 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:11:21,348][model8_pretrain.py][INFO] Epoch:[0/2](449600/4588595) loss:2.420 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:11:21,348][model8_pretrain.py][INFO] Epoch:[0/2](449600/4588595) loss:2.172 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:11:58,272][model8_pretrain.py][INFO] Epoch:[0/2](449700/4588595) loss:2.769 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:11:58,272][model8_pretrain.py][INFO] Epoch:[0/2](449700/4588595) loss:2.931 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:11:58,272][model8_pretrain.py][INFO] Epoch:[0/2](449700/4588595) loss:2.325 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:11:58,272][model8_pretrain.py][INFO] Epoch:[0/2](449700/4588595) loss:3.224 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:11:58,273][model8_pretrain.py][INFO] Epoch:[0/2](449700/4588595) loss:3.380 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:11:58,273][model8_pretrain.py][INFO] Epoch:[0/2](449700/4588595) loss:3.084 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:11:58,273][model8_pretrain.py][INFO] Epoch:[0/2](449700/4588595) loss:2.851 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:11:58,273][model8_pretrain.py][INFO] Epoch:[0/2](449700/4588595) loss:2.312 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:12:43,755][model8_pretrain.py][INFO] Epoch:[0/2](449800/4588595) loss:2.813 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:12:43,755][model8_pretrain.py][INFO] Epoch:[0/2](449800/4588595) loss:3.140 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:12:43,755][model8_pretrain.py][INFO] Epoch:[0/2](449800/4588595) loss:2.373 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:12:43,755][model8_pretrain.py][INFO] Epoch:[0/2](449800/4588595) loss:2.618 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:12:43,755][model8_pretrain.py][INFO] Epoch:[0/2](449800/4588595) loss:2.235 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:12:43,755][model8_pretrain.py][INFO] Epoch:[0/2](449800/4588595) loss:3.204 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:12:43,755][model8_pretrain.py][INFO] Epoch:[0/2](449800/4588595) loss:2.989 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:12:43,755][model8_pretrain.py][INFO] Epoch:[0/2](449800/4588595) loss:3.216 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:13:22,358][model8_pretrain.py][INFO] Epoch:[0/2](449900/4588595) loss:2.776 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:13:22,358][model8_pretrain.py][INFO] Epoch:[0/2](449900/4588595) loss:3.163 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:13:22,358][model8_pretrain.py][INFO] Epoch:[0/2](449900/4588595) loss:3.097 lr:0.0000100 epoch_Time:26250.0min: [2024-01-04 17:13:22,358][model8_pretrain.py][INFO] Epoch:[0/2](449900/4588595) loss:2.961 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:13:22,358][model8_pretrain.py][INFO] Epoch:[0/2](449900/4588595) loss:3.009 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:13:22,358][model8_pretrain.py][INFO] Epoch:[0/2](449900/4588595) loss:2.393 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:13:22,358][model8_pretrain.py][INFO] Epoch:[0/2](449900/4588595) loss:2.631 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:13:22,358][model8_pretrain.py][INFO] Epoch:[0/2](449900/4588595) loss:3.161 lr:0.0000100 epoch_Time:26251.0min: [2024-01-04 17:13:59,290][model8_pretrain.py][INFO] Epoch:[0/2](450000/4588595) loss:2.950 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:13:59,290][model8_pretrain.py][INFO] Epoch:[0/2](450000/4588595) loss:2.323 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:13:59,290][model8_pretrain.py][INFO] Epoch:[0/2](450000/4588595) loss:3.264 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:13:59,290][model8_pretrain.py][INFO] Epoch:[0/2](450000/4588595) loss:2.590 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:13:59,290][model8_pretrain.py][INFO] Epoch:[0/2](450000/4588595) loss:2.462 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:13:59,290][model8_pretrain.py][INFO] Epoch:[0/2](450000/4588595) loss:2.903 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:13:59,291][model8_pretrain.py][INFO] Epoch:[0/2](450000/4588595) loss:3.275 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:13:59,291][model8_pretrain.py][INFO] Epoch:[0/2](450000/4588595) loss:3.173 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:14:36,225][model8_pretrain.py][INFO] Epoch:[0/2](450100/4588595) loss:2.369 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:14:36,225][model8_pretrain.py][INFO] Epoch:[0/2](450100/4588595) loss:3.362 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:14:36,225][model8_pretrain.py][INFO] Epoch:[0/2](450100/4588595) loss:2.626 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:14:36,225][model8_pretrain.py][INFO] Epoch:[0/2](450100/4588595) loss:2.570 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:14:36,225][model8_pretrain.py][INFO] Epoch:[0/2](450100/4588595) loss:2.788 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:14:36,226][model8_pretrain.py][INFO] Epoch:[0/2](450100/4588595) loss:2.866 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:14:36,225][model8_pretrain.py][INFO] Epoch:[0/2](450100/4588595) loss:2.771 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:14:36,226][model8_pretrain.py][INFO] Epoch:[0/2](450100/4588595) loss:2.651 lr:0.0000100 epoch_Time:26249.0min: [2024-01-04 17:15:13,172][model8_pretrain.py][INFO] Epoch:[0/2](450200/4588595) loss:3.027 lr:0.0000100 epoch_Time:26248.0min: [2024-01-04 17:15:13,172][model8_pretrain.py][INFO] Epoch:[0/2](450200/4588595) loss:3.049 lr:0.0000100 epoch_Time:26248.0min: [2024-01-04 17:15:13,173][model8_pretrain.py][INFO] Epoch:[0/2](450200/4588595) loss:2.785 lr:0.0000100 epoch_Time:26248.0min: [2024-01-04 17:15:13,173][model8_pretrain.py][INFO] Epoch:[0/2](450200/4588595) loss:3.047 lr:0.0000100 epoch_Time:26248.0min: [2024-01-04 17:15:13,173][model8_pretrain.py][INFO] Epoch:[0/2](450200/4588595) loss:3.008 lr:0.0000100 epoch_Time:26248.0min: [2024-01-04 17:15:13,173][model8_pretrain.py][INFO] Epoch:[0/2](450200/4588595) loss:2.436 lr:0.0000100 epoch_Time:26248.0min: [2024-01-04 17:15:13,173][model8_pretrain.py][INFO] Epoch:[0/2](450200/4588595) loss:3.099 lr:0.0000100 epoch_Time:26248.0min: [2024-01-04 17:15:13,173][model8_pretrain.py][INFO] Epoch:[0/2](450200/4588595) loss:3.076 lr:0.0000100 epoch_Time:26248.0min: [2024-01-04 17:15:50,107][model8_pretrain.py][INFO] Epoch:[0/2](450300/4588595) loss:2.907 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:15:50,107][model8_pretrain.py][INFO] Epoch:[0/2](450300/4588595) loss:2.522 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:15:50,107][model8_pretrain.py][INFO] Epoch:[0/2](450300/4588595) loss:2.962 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:15:50,107][model8_pretrain.py][INFO] Epoch:[0/2](450300/4588595) loss:2.778 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:15:50,107][model8_pretrain.py][INFO] Epoch:[0/2](450300/4588595) loss:2.970 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:15:50,107][model8_pretrain.py][INFO] Epoch:[0/2](450300/4588595) loss:2.783 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:15:50,107][model8_pretrain.py][INFO] Epoch:[0/2](450300/4588595) loss:2.690 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:15:50,107][model8_pretrain.py][INFO] Epoch:[0/2](450300/4588595) loss:3.274 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:16:27,044][model8_pretrain.py][INFO] Epoch:[0/2](450400/4588595) loss:2.759 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:16:27,044][model8_pretrain.py][INFO] Epoch:[0/2](450400/4588595) loss:2.439 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:16:27,044][model8_pretrain.py][INFO] Epoch:[0/2](450400/4588595) loss:3.257 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:16:27,044][model8_pretrain.py][INFO] Epoch:[0/2](450400/4588595) loss:2.856 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:16:27,044][model8_pretrain.py][INFO] Epoch:[0/2](450400/4588595) loss:3.134 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:16:27,044][model8_pretrain.py][INFO] Epoch:[0/2](450400/4588595) loss:2.534 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:16:27,044][model8_pretrain.py][INFO] Epoch:[0/2](450400/4588595) loss:2.727 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:16:27,044][model8_pretrain.py][INFO] Epoch:[0/2](450400/4588595) loss:2.568 lr:0.0000100 epoch_Time:26247.0min: [2024-01-04 17:17:03,986][model8_pretrain.py][INFO] Epoch:[0/2](450500/4588595) loss:2.456 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:17:03,986][model8_pretrain.py][INFO] Epoch:[0/2](450500/4588595) loss:2.632 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:17:03,986][model8_pretrain.py][INFO] Epoch:[0/2](450500/4588595) loss:2.156 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:17:03,986][model8_pretrain.py][INFO] Epoch:[0/2](450500/4588595) loss:2.417 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:17:03,986][model8_pretrain.py][INFO] Epoch:[0/2](450500/4588595) loss:3.093 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:17:03,986][model8_pretrain.py][INFO] Epoch:[0/2](450500/4588595) loss:2.949 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:17:03,987][model8_pretrain.py][INFO] Epoch:[0/2](450500/4588595) loss:2.633 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:17:03,987][model8_pretrain.py][INFO] Epoch:[0/2](450500/4588595) loss:3.074 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:17:46,104][model8_pretrain.py][INFO] Epoch:[0/2](450600/4588595) loss:2.486 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:17:46,104][model8_pretrain.py][INFO] Epoch:[0/2](450600/4588595) loss:3.016 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:17:46,104][model8_pretrain.py][INFO] Epoch:[0/2](450600/4588595) loss:2.943 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:17:46,104][model8_pretrain.py][INFO] Epoch:[0/2](450600/4588595) loss:3.452 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:17:46,104][model8_pretrain.py][INFO] Epoch:[0/2](450600/4588595) loss:3.518 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:17:46,105][model8_pretrain.py][INFO] Epoch:[0/2](450600/4588595) loss:2.746 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:17:46,105][model8_pretrain.py][INFO] Epoch:[0/2](450600/4588595) loss:2.740 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:17:47,779][model8_pretrain.py][INFO] Epoch:[0/2](450600/4588595) loss:2.081 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:18:28,167][model8_pretrain.py][INFO] Epoch:[0/2](450700/4588595) loss:3.086 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:18:28,168][model8_pretrain.py][INFO] Epoch:[0/2](450700/4588595) loss:2.974 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:18:28,168][model8_pretrain.py][INFO] Epoch:[0/2](450700/4588595) loss:2.367 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:18:28,168][model8_pretrain.py][INFO] Epoch:[0/2](450700/4588595) loss:3.405 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:18:28,168][model8_pretrain.py][INFO] Epoch:[0/2](450700/4588595) loss:3.232 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:18:28,168][model8_pretrain.py][INFO] Epoch:[0/2](450700/4588595) loss:2.997 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:18:28,168][model8_pretrain.py][INFO] Epoch:[0/2](450700/4588595) loss:2.901 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:18:28,168][model8_pretrain.py][INFO] Epoch:[0/2](450700/4588595) loss:3.309 lr:0.0000100 epoch_Time:26246.0min: [2024-01-04 17:19:05,098][model8_pretrain.py][INFO] Epoch:[0/2](450800/4588595) loss:2.490 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:19:05,098][model8_pretrain.py][INFO] Epoch:[0/2](450800/4588595) loss:3.020 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:19:05,098][model8_pretrain.py][INFO] Epoch:[0/2](450800/4588595) loss:2.877 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:19:05,098][model8_pretrain.py][INFO] Epoch:[0/2](450800/4588595) loss:2.365 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:19:05,098][model8_pretrain.py][INFO] Epoch:[0/2](450800/4588595) loss:2.376 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:19:05,098][model8_pretrain.py][INFO] Epoch:[0/2](450800/4588595) loss:3.138 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:19:05,098][model8_pretrain.py][INFO] Epoch:[0/2](450800/4588595) loss:2.570 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:19:05,099][model8_pretrain.py][INFO] Epoch:[0/2](450800/4588595) loss:2.733 lr:0.0000100 epoch_Time:26245.0min: [2024-01-04 17:19:42,045][model8_pretrain.py][INFO] Epoch:[0/2](450900/4588595) loss:2.907 lr:0.0000100 epoch_Time:26244.0min: [2024-01-04 17:19:42,045][model8_pretrain.py][INFO] Epoch:[0/2](450900/4588595) loss:2.234 lr:0.0000100 epoch_Time:26244.0min: [2024-01-04 17:19:42,045][model8_pretrain.py][INFO] Epoch:[0/2](450900/4588595) loss:2.706 lr:0.0000100 epoch_Time:26244.0min: [2024-01-04 17:19:42,045][model8_pretrain.py][INFO] Epoch:[0/2](450900/4588595) loss:2.818 lr:0.0000100 epoch_Time:26244.0min: [2024-01-04 17:19:42,045][model8_pretrain.py][INFO] Epoch:[0/2](450900/4588595) loss:2.475 lr:0.0000100 epoch_Time:26244.0min: [2024-01-04 17:19:42,046][model8_pretrain.py][INFO] Epoch:[0/2](450900/4588595) loss:2.813 lr:0.0000100 epoch_Time:26244.0min: [2024-01-04 17:19:42,046][model8_pretrain.py][INFO] Epoch:[0/2](450900/4588595) loss:3.249 lr:0.0000100 epoch_Time:26244.0min: [2024-01-04 17:19:42,046][model8_pretrain.py][INFO] Epoch:[0/2](450900/4588595) loss:2.987 lr:0.0000100 epoch_Time:26244.0min: [2024-01-04 17:20:18,961][model8_pretrain.py][INFO] Epoch:[0/2](451000/4588595) loss:3.275 lr:0.0000100 epoch_Time:26243.0min: [2024-01-04 17:20:18,961][model8_pretrain.py][INFO] Epoch:[0/2](451000/4588595) loss:2.934 lr:0.0000100 epoch_Time:26243.0min: [2024-01-04 17:20:18,961][model8_pretrain.py][INFO] Epoch:[0/2](451000/4588595) loss:2.798 lr:0.0000100 epoch_Time:26243.0min: [2024-01-04 17:20:18,961][model8_pretrain.py][INFO] Epoch:[0/2](451000/4588595) loss:2.535 lr:0.0000100 epoch_Time:26243.0min: [2024-01-04 17:20:18,961][model8_pretrain.py][INFO] Epoch:[0/2](451000/4588595) loss:2.718 lr:0.0000100 epoch_Time:26243.0min: [2024-01-04 17:20:18,961][model8_pretrain.py][INFO] Epoch:[0/2](451000/4588595) loss:2.490 lr:0.0000100 epoch_Time:26243.0min: [2024-01-04 17:20:18,962][model8_pretrain.py][INFO] Epoch:[0/2](451000/4588595) loss:2.739 lr:0.0000100 epoch_Time:26243.0min: [2024-01-04 17:20:18,962][model8_pretrain.py][INFO] Epoch:[0/2](451000/4588595) loss:2.248 lr:0.0000100 epoch_Time:26243.0min: [2024-01-04 17:20:55,901][model8_pretrain.py][INFO] Epoch:[0/2](451100/4588595) loss:2.777 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:20:55,901][model8_pretrain.py][INFO] Epoch:[0/2](451100/4588595) loss:3.383 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:20:55,901][model8_pretrain.py][INFO] Epoch:[0/2](451100/4588595) loss:2.729 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:20:55,901][model8_pretrain.py][INFO] Epoch:[0/2](451100/4588595) loss:3.176 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:20:55,901][model8_pretrain.py][INFO] Epoch:[0/2](451100/4588595) loss:3.229 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:20:55,901][model8_pretrain.py][INFO] Epoch:[0/2](451100/4588595) loss:3.180 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:20:55,901][model8_pretrain.py][INFO] Epoch:[0/2](451100/4588595) loss:2.528 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:20:55,901][model8_pretrain.py][INFO] Epoch:[0/2](451100/4588595) loss:2.779 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:21:32,840][model8_pretrain.py][INFO] Epoch:[0/2](451200/4588595) loss:2.097 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:21:32,840][model8_pretrain.py][INFO] Epoch:[0/2](451200/4588595) loss:2.850 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:21:32,840][model8_pretrain.py][INFO] Epoch:[0/2](451200/4588595) loss:2.800 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:21:32,840][model8_pretrain.py][INFO] Epoch:[0/2](451200/4588595) loss:3.091 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:21:32,840][model8_pretrain.py][INFO] Epoch:[0/2](451200/4588595) loss:2.876 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:21:32,840][model8_pretrain.py][INFO] Epoch:[0/2](451200/4588595) loss:2.874 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:21:32,840][model8_pretrain.py][INFO] Epoch:[0/2](451200/4588595) loss:3.047 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:21:32,840][model8_pretrain.py][INFO] Epoch:[0/2](451200/4588595) loss:2.821 lr:0.0000100 epoch_Time:26242.0min: [2024-01-04 17:22:09,775][model8_pretrain.py][INFO] Epoch:[0/2](451300/4588595) loss:2.904 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:22:09,775][model8_pretrain.py][INFO] Epoch:[0/2](451300/4588595) loss:2.892 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:22:09,775][model8_pretrain.py][INFO] Epoch:[0/2](451300/4588595) loss:2.766 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:22:09,775][model8_pretrain.py][INFO] Epoch:[0/2](451300/4588595) loss:2.744 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:22:09,775][model8_pretrain.py][INFO] Epoch:[0/2](451300/4588595) loss:2.879 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:22:09,775][model8_pretrain.py][INFO] Epoch:[0/2](451300/4588595) loss:3.097 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:22:09,775][model8_pretrain.py][INFO] Epoch:[0/2](451300/4588595) loss:2.751 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:22:09,775][model8_pretrain.py][INFO] Epoch:[0/2](451300/4588595) loss:3.266 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:22:51,896][model8_pretrain.py][INFO] Epoch:[0/2](451400/4588595) loss:2.543 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:22:51,896][model8_pretrain.py][INFO] Epoch:[0/2](451400/4588595) loss:2.345 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:22:51,896][model8_pretrain.py][INFO] Epoch:[0/2](451400/4588595) loss:2.843 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:22:51,896][model8_pretrain.py][INFO] Epoch:[0/2](451400/4588595) loss:2.560 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:22:51,896][model8_pretrain.py][INFO] Epoch:[0/2](451400/4588595) loss:2.921 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:22:51,896][model8_pretrain.py][INFO] Epoch:[0/2](451400/4588595) loss:2.997 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:22:51,896][model8_pretrain.py][INFO] Epoch:[0/2](451400/4588595) loss:2.639 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:22:51,896][model8_pretrain.py][INFO] Epoch:[0/2](451400/4588595) loss:2.731 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:23:33,924][model8_pretrain.py][INFO] Epoch:[0/2](451500/4588595) loss:2.970 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:23:33,924][model8_pretrain.py][INFO] Epoch:[0/2](451500/4588595) loss:3.119 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:23:33,924][model8_pretrain.py][INFO] Epoch:[0/2](451500/4588595) loss:2.891 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:23:33,924][model8_pretrain.py][INFO] Epoch:[0/2](451500/4588595) loss:3.205 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:23:33,924][model8_pretrain.py][INFO] Epoch:[0/2](451500/4588595) loss:2.971 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:23:33,924][model8_pretrain.py][INFO] Epoch:[0/2](451500/4588595) loss:3.073 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:23:33,924][model8_pretrain.py][INFO] Epoch:[0/2](451500/4588595) loss:2.791 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:23:33,924][model8_pretrain.py][INFO] Epoch:[0/2](451500/4588595) loss:2.685 lr:0.0000100 epoch_Time:26241.0min: [2024-01-04 17:24:10,856][model8_pretrain.py][INFO] Epoch:[0/2](451600/4588595) loss:3.035 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:24:10,856][model8_pretrain.py][INFO] Epoch:[0/2](451600/4588595) loss:2.800 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:24:10,856][model8_pretrain.py][INFO] Epoch:[0/2](451600/4588595) loss:3.156 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:24:10,856][model8_pretrain.py][INFO] Epoch:[0/2](451600/4588595) loss:2.357 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:24:10,856][model8_pretrain.py][INFO] Epoch:[0/2](451600/4588595) loss:2.334 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:24:10,856][model8_pretrain.py][INFO] Epoch:[0/2](451600/4588595) loss:2.916 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:24:10,856][model8_pretrain.py][INFO] Epoch:[0/2](451600/4588595) loss:2.577 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:24:10,856][model8_pretrain.py][INFO] Epoch:[0/2](451600/4588595) loss:2.838 lr:0.0000100 epoch_Time:26240.0min: [2024-01-04 17:24:47,791][model8_pretrain.py][INFO] Epoch:[0/2](451700/4588595) loss:2.878 lr:0.0000100 epoch_Time:26239.0min: [2024-01-04 17:24:47,791][model8_pretrain.py][INFO] Epoch:[0/2](451700/4588595) loss:2.788 lr:0.0000100 epoch_Time:26239.0min: [2024-01-04 17:24:47,791][model8_pretrain.py][INFO] Epoch:[0/2](451700/4588595) loss:2.552 lr:0.0000100 epoch_Time:26239.0min: [2024-01-04 17:24:47,791][model8_pretrain.py][INFO] Epoch:[0/2](451700/4588595) loss:2.239 lr:0.0000100 epoch_Time:26239.0min: [2024-01-04 17:24:47,791][model8_pretrain.py][INFO] Epoch:[0/2](451700/4588595) loss:3.538 lr:0.0000100 epoch_Time:26239.0min: [2024-01-04 17:24:47,791][model8_pretrain.py][INFO] Epoch:[0/2](451700/4588595) loss:2.648 lr:0.0000100 epoch_Time:26239.0min: [2024-01-04 17:24:47,791][model8_pretrain.py][INFO] Epoch:[0/2](451700/4588595) loss:2.834 lr:0.0000100 epoch_Time:26239.0min: [2024-01-04 17:24:47,791][model8_pretrain.py][INFO] Epoch:[0/2](451700/4588595) loss:2.513 lr:0.0000100 epoch_Time:26239.0min: [2024-01-04 17:25:24,735][model8_pretrain.py][INFO] Epoch:[0/2](451800/4588595) loss:2.870 lr:0.0000100 epoch_Time:26238.0min: [2024-01-04 17:25:24,735][model8_pretrain.py][INFO] Epoch:[0/2](451800/4588595) loss:2.779 lr:0.0000100 epoch_Time:26238.0min: [2024-01-04 17:25:24,735][model8_pretrain.py][INFO] Epoch:[0/2](451800/4588595) loss:3.124 lr:0.0000100 epoch_Time:26238.0min: [2024-01-04 17:25:24,735][model8_pretrain.py][INFO] Epoch:[0/2](451800/4588595) loss:3.309 lr:0.0000100 epoch_Time:26238.0min: [2024-01-04 17:25:24,735][model8_pretrain.py][INFO] Epoch:[0/2](451800/4588595) loss:3.080 lr:0.0000100 epoch_Time:26238.0min: [2024-01-04 17:25:24,735][model8_pretrain.py][INFO] Epoch:[0/2](451800/4588595) loss:2.757 lr:0.0000100 epoch_Time:26238.0min: [2024-01-04 17:25:24,735][model8_pretrain.py][INFO] Epoch:[0/2](451800/4588595) loss:2.337 lr:0.0000100 epoch_Time:26238.0min: [2024-01-04 17:25:24,735][model8_pretrain.py][INFO] Epoch:[0/2](451800/4588595) loss:1.814 lr:0.0000100 epoch_Time:26238.0min: [2024-01-04 17:26:01,668][model8_pretrain.py][INFO] Epoch:[0/2](451900/4588595) loss:3.129 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:01,668][model8_pretrain.py][INFO] Epoch:[0/2](451900/4588595) loss:2.796 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:01,668][model8_pretrain.py][INFO] Epoch:[0/2](451900/4588595) loss:2.895 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:01,668][model8_pretrain.py][INFO] Epoch:[0/2](451900/4588595) loss:3.091 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:01,668][model8_pretrain.py][INFO] Epoch:[0/2](451900/4588595) loss:2.547 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:01,668][model8_pretrain.py][INFO] Epoch:[0/2](451900/4588595) loss:3.163 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:01,668][model8_pretrain.py][INFO] Epoch:[0/2](451900/4588595) loss:3.019 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:01,668][model8_pretrain.py][INFO] Epoch:[0/2](451900/4588595) loss:2.583 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:38,633][model8_pretrain.py][INFO] Epoch:[0/2](452000/4588595) loss:2.764 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:38,633][model8_pretrain.py][INFO] Epoch:[0/2](452000/4588595) loss:3.100 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:38,633][model8_pretrain.py][INFO] Epoch:[0/2](452000/4588595) loss:3.027 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:38,633][model8_pretrain.py][INFO] Epoch:[0/2](452000/4588595) loss:2.258 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:38,633][model8_pretrain.py][INFO] Epoch:[0/2](452000/4588595) loss:3.047 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:38,633][model8_pretrain.py][INFO] Epoch:[0/2](452000/4588595) loss:2.324 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:38,633][model8_pretrain.py][INFO] Epoch:[0/2](452000/4588595) loss:2.907 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:26:38,633][model8_pretrain.py][INFO] Epoch:[0/2](452000/4588595) loss:3.093 lr:0.0000100 epoch_Time:26237.0min: [2024-01-04 17:27:15,592][model8_pretrain.py][INFO] Epoch:[0/2](452100/4588595) loss:3.015 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:15,592][model8_pretrain.py][INFO] Epoch:[0/2](452100/4588595) loss:2.656 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:15,592][model8_pretrain.py][INFO] Epoch:[0/2](452100/4588595) loss:3.381 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:15,592][model8_pretrain.py][INFO] Epoch:[0/2](452100/4588595) loss:3.347 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:15,592][model8_pretrain.py][INFO] Epoch:[0/2](452100/4588595) loss:3.226 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:15,592][model8_pretrain.py][INFO] Epoch:[0/2](452100/4588595) loss:2.560 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:15,592][model8_pretrain.py][INFO] Epoch:[0/2](452100/4588595) loss:3.094 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:15,592][model8_pretrain.py][INFO] Epoch:[0/2](452100/4588595) loss:2.839 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:57,693][model8_pretrain.py][INFO] Epoch:[0/2](452200/4588595) loss:1.959 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:57,693][model8_pretrain.py][INFO] Epoch:[0/2](452200/4588595) loss:2.731 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:57,693][model8_pretrain.py][INFO] Epoch:[0/2](452200/4588595) loss:2.875 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:57,694][model8_pretrain.py][INFO] Epoch:[0/2](452200/4588595) loss:2.685 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:57,694][model8_pretrain.py][INFO] Epoch:[0/2](452200/4588595) loss:3.060 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:57,694][model8_pretrain.py][INFO] Epoch:[0/2](452200/4588595) loss:2.736 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:57,694][model8_pretrain.py][INFO] Epoch:[0/2](452200/4588595) loss:3.000 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:27:57,694][model8_pretrain.py][INFO] Epoch:[0/2](452200/4588595) loss:2.894 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:28:39,816][model8_pretrain.py][INFO] Epoch:[0/2](452300/4588595) loss:2.216 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:28:39,816][model8_pretrain.py][INFO] Epoch:[0/2](452300/4588595) loss:2.597 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:28:39,816][model8_pretrain.py][INFO] Epoch:[0/2](452300/4588595) loss:3.090 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:28:39,816][model8_pretrain.py][INFO] Epoch:[0/2](452300/4588595) loss:2.736 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:28:39,816][model8_pretrain.py][INFO] Epoch:[0/2](452300/4588595) loss:2.977 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:28:39,816][model8_pretrain.py][INFO] Epoch:[0/2](452300/4588595) loss:2.671 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:28:39,816][model8_pretrain.py][INFO] Epoch:[0/2](452300/4588595) loss:2.824 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:28:39,830][model8_pretrain.py][INFO] Epoch:[0/2](452300/4588595) loss:3.322 lr:0.0000100 epoch_Time:26236.0min: [2024-01-04 17:29:16,797][model8_pretrain.py][INFO] Epoch:[0/2](452400/4588595) loss:2.892 lr:0.0000100 epoch_Time:26235.0min: [2024-01-04 17:29:16,798][model8_pretrain.py][INFO] Epoch:[0/2](452400/4588595) loss:2.919 lr:0.0000100 epoch_Time:26235.0min: [2024-01-04 17:29:16,798][model8_pretrain.py][INFO] Epoch:[0/2](452400/4588595) loss:3.105 lr:0.0000100 epoch_Time:26235.0min: [2024-01-04 17:29:16,798][model8_pretrain.py][INFO] Epoch:[0/2](452400/4588595) loss:3.305 lr:0.0000100 epoch_Time:26235.0min: [2024-01-04 17:29:16,798][model8_pretrain.py][INFO] Epoch:[0/2](452400/4588595) loss:2.527 lr:0.0000100 epoch_Time:26235.0min: [2024-01-04 17:29:16,798][model8_pretrain.py][INFO] Epoch:[0/2](452400/4588595) loss:2.515 lr:0.0000100 epoch_Time:26235.0min: [2024-01-04 17:29:16,798][model8_pretrain.py][INFO] Epoch:[0/2](452400/4588595) loss:2.770 lr:0.0000100 epoch_Time:26235.0min: [2024-01-04 17:29:16,798][model8_pretrain.py][INFO] Epoch:[0/2](452400/4588595) loss:3.023 lr:0.0000100 epoch_Time:26235.0min: [2024-01-04 17:29:53,734][model8_pretrain.py][INFO] Epoch:[0/2](452500/4588595) loss:3.047 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:29:53,734][model8_pretrain.py][INFO] Epoch:[0/2](452500/4588595) loss:2.342 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:29:53,734][model8_pretrain.py][INFO] Epoch:[0/2](452500/4588595) loss:3.163 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:29:53,734][model8_pretrain.py][INFO] Epoch:[0/2](452500/4588595) loss:2.654 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:29:53,734][model8_pretrain.py][INFO] Epoch:[0/2](452500/4588595) loss:2.638 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:29:53,734][model8_pretrain.py][INFO] Epoch:[0/2](452500/4588595) loss:2.992 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:29:53,734][model8_pretrain.py][INFO] Epoch:[0/2](452500/4588595) loss:3.018 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:29:53,735][model8_pretrain.py][INFO] Epoch:[0/2](452500/4588595) loss:2.934 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:30:30,670][model8_pretrain.py][INFO] Epoch:[0/2](452600/4588595) loss:2.938 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:30:30,670][model8_pretrain.py][INFO] Epoch:[0/2](452600/4588595) loss:2.662 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:30:30,670][model8_pretrain.py][INFO] Epoch:[0/2](452600/4588595) loss:2.821 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:30:30,670][model8_pretrain.py][INFO] Epoch:[0/2](452600/4588595) loss:2.838 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:30:30,670][model8_pretrain.py][INFO] Epoch:[0/2](452600/4588595) loss:2.646 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:30:30,670][model8_pretrain.py][INFO] Epoch:[0/2](452600/4588595) loss:2.554 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:30:30,670][model8_pretrain.py][INFO] Epoch:[0/2](452600/4588595) loss:2.382 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:30:30,670][model8_pretrain.py][INFO] Epoch:[0/2](452600/4588595) loss:2.542 lr:0.0000100 epoch_Time:26234.0min: [2024-01-04 17:31:07,604][model8_pretrain.py][INFO] Epoch:[0/2](452700/4588595) loss:2.890 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:07,604][model8_pretrain.py][INFO] Epoch:[0/2](452700/4588595) loss:2.984 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:07,604][model8_pretrain.py][INFO] Epoch:[0/2](452700/4588595) loss:2.680 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:07,604][model8_pretrain.py][INFO] Epoch:[0/2](452700/4588595) loss:3.342 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:07,604][model8_pretrain.py][INFO] Epoch:[0/2](452700/4588595) loss:3.265 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:07,604][model8_pretrain.py][INFO] Epoch:[0/2](452700/4588595) loss:2.897 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:07,604][model8_pretrain.py][INFO] Epoch:[0/2](452700/4588595) loss:2.340 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:07,605][model8_pretrain.py][INFO] Epoch:[0/2](452700/4588595) loss:2.624 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:44,532][model8_pretrain.py][INFO] Epoch:[0/2](452800/4588595) loss:2.856 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:44,532][model8_pretrain.py][INFO] Epoch:[0/2](452800/4588595) loss:3.249 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:44,532][model8_pretrain.py][INFO] Epoch:[0/2](452800/4588595) loss:2.561 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:44,532][model8_pretrain.py][INFO] Epoch:[0/2](452800/4588595) loss:2.992 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:44,532][model8_pretrain.py][INFO] Epoch:[0/2](452800/4588595) loss:2.875 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:44,532][model8_pretrain.py][INFO] Epoch:[0/2](452800/4588595) loss:2.670 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:44,533][model8_pretrain.py][INFO] Epoch:[0/2](452800/4588595) loss:2.465 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:31:44,533][model8_pretrain.py][INFO] Epoch:[0/2](452800/4588595) loss:2.947 lr:0.0000100 epoch_Time:26232.0min: [2024-01-04 17:32:21,465][model8_pretrain.py][INFO] Epoch:[0/2](452900/4588595) loss:3.001 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:32:21,465][model8_pretrain.py][INFO] Epoch:[0/2](452900/4588595) loss:2.477 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:32:21,465][model8_pretrain.py][INFO] Epoch:[0/2](452900/4588595) loss:2.942 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:32:21,465][model8_pretrain.py][INFO] Epoch:[0/2](452900/4588595) loss:3.001 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:32:21,466][model8_pretrain.py][INFO] Epoch:[0/2](452900/4588595) loss:2.366 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:32:21,465][model8_pretrain.py][INFO] Epoch:[0/2](452900/4588595) loss:2.809 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:32:21,465][model8_pretrain.py][INFO] Epoch:[0/2](452900/4588595) loss:2.709 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:32:21,466][model8_pretrain.py][INFO] Epoch:[0/2](452900/4588595) loss:3.158 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:33:00,164][model8_pretrain.py][INFO] Epoch:[0/2](453000/4588595) loss:3.470 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:33:00,164][model8_pretrain.py][INFO] Epoch:[0/2](453000/4588595) loss:2.591 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:33:00,164][model8_pretrain.py][INFO] Epoch:[0/2](453000/4588595) loss:3.054 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:33:00,165][model8_pretrain.py][INFO] Epoch:[0/2](453000/4588595) loss:2.689 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:33:00,165][model8_pretrain.py][INFO] Epoch:[0/2](453000/4588595) loss:2.665 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:33:00,165][model8_pretrain.py][INFO] Epoch:[0/2](453000/4588595) loss:2.907 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:33:00,165][model8_pretrain.py][INFO] Epoch:[0/2](453000/4588595) loss:2.553 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:33:00,165][model8_pretrain.py][INFO] Epoch:[0/2](453000/4588595) loss:2.727 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:33:45,706][model8_pretrain.py][INFO] Epoch:[0/2](453100/4588595) loss:3.110 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:33:45,706][model8_pretrain.py][INFO] Epoch:[0/2](453100/4588595) loss:2.893 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:33:45,706][model8_pretrain.py][INFO] Epoch:[0/2](453100/4588595) loss:2.939 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:33:45,706][model8_pretrain.py][INFO] Epoch:[0/2](453100/4588595) loss:2.790 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:33:45,706][model8_pretrain.py][INFO] Epoch:[0/2](453100/4588595) loss:2.737 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:33:45,706][model8_pretrain.py][INFO] Epoch:[0/2](453100/4588595) loss:2.601 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:33:45,706][model8_pretrain.py][INFO] Epoch:[0/2](453100/4588595) loss:2.806 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:33:45,707][model8_pretrain.py][INFO] Epoch:[0/2](453100/4588595) loss:2.510 lr:0.0000100 epoch_Time:26231.0min: [2024-01-04 17:34:22,688][model8_pretrain.py][INFO] Epoch:[0/2](453200/4588595) loss:3.180 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:34:22,688][model8_pretrain.py][INFO] Epoch:[0/2](453200/4588595) loss:2.634 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:34:22,688][model8_pretrain.py][INFO] Epoch:[0/2](453200/4588595) loss:3.002 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:34:22,688][model8_pretrain.py][INFO] Epoch:[0/2](453200/4588595) loss:2.674 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:34:22,688][model8_pretrain.py][INFO] Epoch:[0/2](453200/4588595) loss:2.575 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:34:22,688][model8_pretrain.py][INFO] Epoch:[0/2](453200/4588595) loss:2.990 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:34:22,688][model8_pretrain.py][INFO] Epoch:[0/2](453200/4588595) loss:2.427 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:34:22,688][model8_pretrain.py][INFO] Epoch:[0/2](453200/4588595) loss:2.422 lr:0.0000100 epoch_Time:26230.0min: [2024-01-04 17:34:59,660][model8_pretrain.py][INFO] Epoch:[0/2](453300/4588595) loss:2.596 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:34:59,660][model8_pretrain.py][INFO] Epoch:[0/2](453300/4588595) loss:3.156 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:34:59,660][model8_pretrain.py][INFO] Epoch:[0/2](453300/4588595) loss:2.892 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:34:59,660][model8_pretrain.py][INFO] Epoch:[0/2](453300/4588595) loss:3.002 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:34:59,660][model8_pretrain.py][INFO] Epoch:[0/2](453300/4588595) loss:2.532 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:34:59,661][model8_pretrain.py][INFO] Epoch:[0/2](453300/4588595) loss:2.414 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:34:59,661][model8_pretrain.py][INFO] Epoch:[0/2](453300/4588595) loss:3.201 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:34:59,661][model8_pretrain.py][INFO] Epoch:[0/2](453300/4588595) loss:2.923 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:35:36,630][model8_pretrain.py][INFO] Epoch:[0/2](453400/4588595) loss:2.829 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:35:36,630][model8_pretrain.py][INFO] Epoch:[0/2](453400/4588595) loss:2.705 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:35:36,630][model8_pretrain.py][INFO] Epoch:[0/2](453400/4588595) loss:3.027 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:35:36,630][model8_pretrain.py][INFO] Epoch:[0/2](453400/4588595) loss:3.578 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:35:36,630][model8_pretrain.py][INFO] Epoch:[0/2](453400/4588595) loss:3.308 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:35:36,630][model8_pretrain.py][INFO] Epoch:[0/2](453400/4588595) loss:2.927 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:35:36,630][model8_pretrain.py][INFO] Epoch:[0/2](453400/4588595) loss:2.881 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:35:36,630][model8_pretrain.py][INFO] Epoch:[0/2](453400/4588595) loss:2.925 lr:0.0000100 epoch_Time:26229.0min: [2024-01-04 17:36:13,598][model8_pretrain.py][INFO] Epoch:[0/2](453500/4588595) loss:2.602 lr:0.0000100 epoch_Time:26228.0min: [2024-01-04 17:36:13,598][model8_pretrain.py][INFO] Epoch:[0/2](453500/4588595) loss:2.832 lr:0.0000100 epoch_Time:26228.0min: [2024-01-04 17:36:13,598][model8_pretrain.py][INFO] Epoch:[0/2](453500/4588595) loss:2.883 lr:0.0000100 epoch_Time:26228.0min: [2024-01-04 17:36:13,598][model8_pretrain.py][INFO] Epoch:[0/2](453500/4588595) loss:2.706 lr:0.0000100 epoch_Time:26228.0min: [2024-01-04 17:36:13,598][model8_pretrain.py][INFO] Epoch:[0/2](453500/4588595) loss:2.966 lr:0.0000100 epoch_Time:26228.0min: [2024-01-04 17:36:13,598][model8_pretrain.py][INFO] Epoch:[0/2](453500/4588595) loss:2.731 lr:0.0000100 epoch_Time:26228.0min: [2024-01-04 17:36:13,598][model8_pretrain.py][INFO] Epoch:[0/2](453500/4588595) loss:3.100 lr:0.0000100 epoch_Time:26228.0min: [2024-01-04 17:36:13,599][model8_pretrain.py][INFO] Epoch:[0/2](453500/4588595) loss:3.085 lr:0.0000100 epoch_Time:26228.0min: [2024-01-04 17:36:50,546][model8_pretrain.py][INFO] Epoch:[0/2](453600/4588595) loss:2.983 lr:0.0000100 epoch_Time:26227.0min: [2024-01-04 17:36:50,546][model8_pretrain.py][INFO] Epoch:[0/2](453600/4588595) loss:3.106 lr:0.0000100 epoch_Time:26227.0min: [2024-01-04 17:36:50,546][model8_pretrain.py][INFO] Epoch:[0/2](453600/4588595) loss:2.974 lr:0.0000100 epoch_Time:26227.0min: [2024-01-04 17:36:50,546][model8_pretrain.py][INFO] Epoch:[0/2](453600/4588595) loss:2.633 lr:0.0000100 epoch_Time:26227.0min: [2024-01-04 17:36:50,546][model8_pretrain.py][INFO] Epoch:[0/2](453600/4588595) loss:2.790 lr:0.0000100 epoch_Time:26227.0min: [2024-01-04 17:36:50,546][model8_pretrain.py][INFO] Epoch:[0/2](453600/4588595) loss:3.423 lr:0.0000100 epoch_Time:26227.0min: [2024-01-04 17:36:50,546][model8_pretrain.py][INFO] Epoch:[0/2](453600/4588595) loss:2.724 lr:0.0000100 epoch_Time:26227.0min: [2024-01-04 17:36:50,546][model8_pretrain.py][INFO] Epoch:[0/2](453600/4588595) loss:3.254 lr:0.0000100 epoch_Time:26227.0min: [2024-01-04 17:37:27,485][model8_pretrain.py][INFO] Epoch:[0/2](453700/4588595) loss:3.086 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:37:27,485][model8_pretrain.py][INFO] Epoch:[0/2](453700/4588595) loss:2.142 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:37:27,485][model8_pretrain.py][INFO] Epoch:[0/2](453700/4588595) loss:2.446 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:37:27,485][model8_pretrain.py][INFO] Epoch:[0/2](453700/4588595) loss:2.729 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:37:27,485][model8_pretrain.py][INFO] Epoch:[0/2](453700/4588595) loss:2.837 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:37:27,485][model8_pretrain.py][INFO] Epoch:[0/2](453700/4588595) loss:2.927 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:37:27,485][model8_pretrain.py][INFO] Epoch:[0/2](453700/4588595) loss:2.902 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:37:27,486][model8_pretrain.py][INFO] Epoch:[0/2](453700/4588595) loss:3.408 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:38:06,174][model8_pretrain.py][INFO] Epoch:[0/2](453800/4588595) loss:3.622 lr:0.0000100 epoch_Time:26225.0min: [2024-01-04 17:38:06,174][model8_pretrain.py][INFO] Epoch:[0/2](453800/4588595) loss:3.386 lr:0.0000100 epoch_Time:26225.0min: [2024-01-04 17:38:06,174][model8_pretrain.py][INFO] Epoch:[0/2](453800/4588595) loss:2.647 lr:0.0000100 epoch_Time:26225.0min: [2024-01-04 17:38:06,174][model8_pretrain.py][INFO] Epoch:[0/2](453800/4588595) loss:2.905 lr:0.0000100 epoch_Time:26225.0min: [2024-01-04 17:38:06,174][model8_pretrain.py][INFO] Epoch:[0/2](453800/4588595) loss:2.891 lr:0.0000100 epoch_Time:26225.0min: [2024-01-04 17:38:06,174][model8_pretrain.py][INFO] Epoch:[0/2](453800/4588595) loss:3.048 lr:0.0000100 epoch_Time:26225.0min: [2024-01-04 17:38:06,175][model8_pretrain.py][INFO] Epoch:[0/2](453800/4588595) loss:3.312 lr:0.0000100 epoch_Time:26225.0min: [2024-01-04 17:38:06,175][model8_pretrain.py][INFO] Epoch:[0/2](453800/4588595) loss:3.033 lr:0.0000100 epoch_Time:26225.0min: [2024-01-04 17:38:51,804][model8_pretrain.py][INFO] Epoch:[0/2](453900/4588595) loss:3.609 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:38:51,804][model8_pretrain.py][INFO] Epoch:[0/2](453900/4588595) loss:2.386 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:38:51,804][model8_pretrain.py][INFO] Epoch:[0/2](453900/4588595) loss:2.512 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:38:51,804][model8_pretrain.py][INFO] Epoch:[0/2](453900/4588595) loss:2.756 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:38:51,804][model8_pretrain.py][INFO] Epoch:[0/2](453900/4588595) loss:2.543 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:38:51,804][model8_pretrain.py][INFO] Epoch:[0/2](453900/4588595) loss:3.176 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:38:51,804][model8_pretrain.py][INFO] Epoch:[0/2](453900/4588595) loss:3.192 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:38:51,804][model8_pretrain.py][INFO] Epoch:[0/2](453900/4588595) loss:3.028 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:39:28,740][model8_pretrain.py][INFO] Epoch:[0/2](454000/4588595) loss:2.895 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:39:28,740][model8_pretrain.py][INFO] Epoch:[0/2](454000/4588595) loss:2.674 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:39:28,740][model8_pretrain.py][INFO] Epoch:[0/2](454000/4588595) loss:2.720 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:39:28,740][model8_pretrain.py][INFO] Epoch:[0/2](454000/4588595) loss:2.759 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:39:28,740][model8_pretrain.py][INFO] Epoch:[0/2](454000/4588595) loss:2.220 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:39:28,740][model8_pretrain.py][INFO] Epoch:[0/2](454000/4588595) loss:2.596 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:39:28,740][model8_pretrain.py][INFO] Epoch:[0/2](454000/4588595) loss:3.073 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:39:28,740][model8_pretrain.py][INFO] Epoch:[0/2](454000/4588595) loss:3.173 lr:0.0000100 epoch_Time:26226.0min: [2024-01-04 17:40:05,672][model8_pretrain.py][INFO] Epoch:[0/2](454100/4588595) loss:2.832 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:05,672][model8_pretrain.py][INFO] Epoch:[0/2](454100/4588595) loss:2.696 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:05,672][model8_pretrain.py][INFO] Epoch:[0/2](454100/4588595) loss:2.930 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:05,672][model8_pretrain.py][INFO] Epoch:[0/2](454100/4588595) loss:2.787 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:05,672][model8_pretrain.py][INFO] Epoch:[0/2](454100/4588595) loss:3.009 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:05,672][model8_pretrain.py][INFO] Epoch:[0/2](454100/4588595) loss:2.776 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:05,672][model8_pretrain.py][INFO] Epoch:[0/2](454100/4588595) loss:2.845 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:05,672][model8_pretrain.py][INFO] Epoch:[0/2](454100/4588595) loss:2.736 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:42,610][model8_pretrain.py][INFO] Epoch:[0/2](454200/4588595) loss:3.007 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:42,610][model8_pretrain.py][INFO] Epoch:[0/2](454200/4588595) loss:2.679 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:42,610][model8_pretrain.py][INFO] Epoch:[0/2](454200/4588595) loss:2.895 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:42,610][model8_pretrain.py][INFO] Epoch:[0/2](454200/4588595) loss:2.948 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:42,610][model8_pretrain.py][INFO] Epoch:[0/2](454200/4588595) loss:2.697 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:42,610][model8_pretrain.py][INFO] Epoch:[0/2](454200/4588595) loss:3.129 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:42,610][model8_pretrain.py][INFO] Epoch:[0/2](454200/4588595) loss:2.363 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:40:42,610][model8_pretrain.py][INFO] Epoch:[0/2](454200/4588595) loss:2.681 lr:0.0000100 epoch_Time:26224.0min: [2024-01-04 17:41:19,537][model8_pretrain.py][INFO] Epoch:[0/2](454300/4588595) loss:2.696 lr:0.0000100 epoch_Time:26223.0min: [2024-01-04 17:41:19,537][model8_pretrain.py][INFO] Epoch:[0/2](454300/4588595) loss:2.825 lr:0.0000100 epoch_Time:26223.0min: [2024-01-04 17:41:19,537][model8_pretrain.py][INFO] Epoch:[0/2](454300/4588595) loss:3.022 lr:0.0000100 epoch_Time:26223.0min: [2024-01-04 17:41:19,537][model8_pretrain.py][INFO] Epoch:[0/2](454300/4588595) loss:2.984 lr:0.0000100 epoch_Time:26223.0min: [2024-01-04 17:41:19,537][model8_pretrain.py][INFO] Epoch:[0/2](454300/4588595) loss:2.794 lr:0.0000100 epoch_Time:26223.0min: [2024-01-04 17:41:19,537][model8_pretrain.py][INFO] Epoch:[0/2](454300/4588595) loss:3.011 lr:0.0000100 epoch_Time:26223.0min: [2024-01-04 17:41:19,537][model8_pretrain.py][INFO] Epoch:[0/2](454300/4588595) loss:3.130 lr:0.0000100 epoch_Time:26223.0min: [2024-01-04 17:41:19,537][model8_pretrain.py][INFO] Epoch:[0/2](454300/4588595) loss:3.046 lr:0.0000100 epoch_Time:26223.0min: [2024-01-04 17:41:56,467][model8_pretrain.py][INFO] Epoch:[0/2](454400/4588595) loss:3.166 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:41:56,467][model8_pretrain.py][INFO] Epoch:[0/2](454400/4588595) loss:2.792 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:41:56,468][model8_pretrain.py][INFO] Epoch:[0/2](454400/4588595) loss:2.573 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:41:56,468][model8_pretrain.py][INFO] Epoch:[0/2](454400/4588595) loss:2.725 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:41:56,468][model8_pretrain.py][INFO] Epoch:[0/2](454400/4588595) loss:3.167 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:41:56,468][model8_pretrain.py][INFO] Epoch:[0/2](454400/4588595) loss:3.221 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:41:56,468][model8_pretrain.py][INFO] Epoch:[0/2](454400/4588595) loss:3.056 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:41:56,468][model8_pretrain.py][INFO] Epoch:[0/2](454400/4588595) loss:3.094 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:42:33,405][model8_pretrain.py][INFO] Epoch:[0/2](454500/4588595) loss:3.096 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:42:33,405][model8_pretrain.py][INFO] Epoch:[0/2](454500/4588595) loss:3.133 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:42:33,405][model8_pretrain.py][INFO] Epoch:[0/2](454500/4588595) loss:2.716 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:42:33,405][model8_pretrain.py][INFO] Epoch:[0/2](454500/4588595) loss:2.855 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:42:33,405][model8_pretrain.py][INFO] Epoch:[0/2](454500/4588595) loss:3.271 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:42:33,405][model8_pretrain.py][INFO] Epoch:[0/2](454500/4588595) loss:2.980 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:42:33,405][model8_pretrain.py][INFO] Epoch:[0/2](454500/4588595) loss:2.741 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:42:33,405][model8_pretrain.py][INFO] Epoch:[0/2](454500/4588595) loss:2.646 lr:0.0000100 epoch_Time:26222.0min: [2024-01-04 17:43:12,097][model8_pretrain.py][INFO] Epoch:[0/2](454600/4588595) loss:3.141 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:12,097][model8_pretrain.py][INFO] Epoch:[0/2](454600/4588595) loss:2.481 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:12,097][model8_pretrain.py][INFO] Epoch:[0/2](454600/4588595) loss:2.711 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:12,097][model8_pretrain.py][INFO] Epoch:[0/2](454600/4588595) loss:3.374 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:12,097][model8_pretrain.py][INFO] Epoch:[0/2](454600/4588595) loss:3.096 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:12,097][model8_pretrain.py][INFO] Epoch:[0/2](454600/4588595) loss:3.034 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:12,097][model8_pretrain.py][INFO] Epoch:[0/2](454600/4588595) loss:2.996 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:12,097][model8_pretrain.py][INFO] Epoch:[0/2](454600/4588595) loss:3.080 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:57,493][model8_pretrain.py][INFO] Epoch:[0/2](454700/4588595) loss:2.827 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:57,493][model8_pretrain.py][INFO] Epoch:[0/2](454700/4588595) loss:2.843 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:57,493][model8_pretrain.py][INFO] Epoch:[0/2](454700/4588595) loss:2.190 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:57,493][model8_pretrain.py][INFO] Epoch:[0/2](454700/4588595) loss:3.272 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:57,493][model8_pretrain.py][INFO] Epoch:[0/2](454700/4588595) loss:2.896 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:57,493][model8_pretrain.py][INFO] Epoch:[0/2](454700/4588595) loss:2.884 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:57,493][model8_pretrain.py][INFO] Epoch:[0/2](454700/4588595) loss:2.569 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:43:57,493][model8_pretrain.py][INFO] Epoch:[0/2](454700/4588595) loss:3.085 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:44:34,428][model8_pretrain.py][INFO] Epoch:[0/2](454800/4588595) loss:2.882 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:44:34,428][model8_pretrain.py][INFO] Epoch:[0/2](454800/4588595) loss:2.618 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:44:34,428][model8_pretrain.py][INFO] Epoch:[0/2](454800/4588595) loss:2.635 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:44:34,428][model8_pretrain.py][INFO] Epoch:[0/2](454800/4588595) loss:2.576 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:44:34,428][model8_pretrain.py][INFO] Epoch:[0/2](454800/4588595) loss:2.590 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:44:34,428][model8_pretrain.py][INFO] Epoch:[0/2](454800/4588595) loss:3.142 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:44:34,428][model8_pretrain.py][INFO] Epoch:[0/2](454800/4588595) loss:2.938 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:44:34,428][model8_pretrain.py][INFO] Epoch:[0/2](454800/4588595) loss:2.810 lr:0.0000100 epoch_Time:26221.0min: [2024-01-04 17:45:11,361][model8_pretrain.py][INFO] Epoch:[0/2](454900/4588595) loss:2.820 lr:0.0000100 epoch_Time:26220.0min: [2024-01-04 17:45:11,361][model8_pretrain.py][INFO] Epoch:[0/2](454900/4588595) loss:3.230 lr:0.0000100 epoch_Time:26220.0min: [2024-01-04 17:45:11,361][model8_pretrain.py][INFO] Epoch:[0/2](454900/4588595) loss:3.371 lr:0.0000100 epoch_Time:26220.0min: [2024-01-04 17:45:11,361][model8_pretrain.py][INFO] Epoch:[0/2](454900/4588595) loss:2.896 lr:0.0000100 epoch_Time:26220.0min: [2024-01-04 17:45:11,361][model8_pretrain.py][INFO] Epoch:[0/2](454900/4588595) loss:2.774 lr:0.0000100 epoch_Time:26220.0min: [2024-01-04 17:45:11,361][model8_pretrain.py][INFO] Epoch:[0/2](454900/4588595) loss:2.711 lr:0.0000100 epoch_Time:26220.0min: [2024-01-04 17:45:11,361][model8_pretrain.py][INFO] Epoch:[0/2](454900/4588595) loss:2.966 lr:0.0000100 epoch_Time:26220.0min: [2024-01-04 17:45:11,361][model8_pretrain.py][INFO] Epoch:[0/2](454900/4588595) loss:3.111 lr:0.0000100 epoch_Time:26220.0min: [2024-01-04 17:45:48,331][model8_pretrain.py][INFO] Epoch:[0/2](455000/4588595) loss:2.623 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:45:48,331][model8_pretrain.py][INFO] Epoch:[0/2](455000/4588595) loss:2.370 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:45:48,331][model8_pretrain.py][INFO] Epoch:[0/2](455000/4588595) loss:2.772 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:45:48,331][model8_pretrain.py][INFO] Epoch:[0/2](455000/4588595) loss:3.056 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:45:48,331][model8_pretrain.py][INFO] Epoch:[0/2](455000/4588595) loss:3.147 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:45:48,331][model8_pretrain.py][INFO] Epoch:[0/2](455000/4588595) loss:2.960 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:45:48,331][model8_pretrain.py][INFO] Epoch:[0/2](455000/4588595) loss:2.790 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:45:48,331][model8_pretrain.py][INFO] Epoch:[0/2](455000/4588595) loss:2.998 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:46:25,272][model8_pretrain.py][INFO] Epoch:[0/2](455100/4588595) loss:2.573 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:46:25,272][model8_pretrain.py][INFO] Epoch:[0/2](455100/4588595) loss:3.224 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:46:25,272][model8_pretrain.py][INFO] Epoch:[0/2](455100/4588595) loss:3.075 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:46:25,272][model8_pretrain.py][INFO] Epoch:[0/2](455100/4588595) loss:3.024 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:46:25,272][model8_pretrain.py][INFO] Epoch:[0/2](455100/4588595) loss:3.263 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:46:25,272][model8_pretrain.py][INFO] Epoch:[0/2](455100/4588595) loss:3.087 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:46:25,272][model8_pretrain.py][INFO] Epoch:[0/2](455100/4588595) loss:2.384 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:46:25,272][model8_pretrain.py][INFO] Epoch:[0/2](455100/4588595) loss:3.285 lr:0.0000100 epoch_Time:26218.0min: [2024-01-04 17:47:02,209][model8_pretrain.py][INFO] Epoch:[0/2](455200/4588595) loss:3.001 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:02,209][model8_pretrain.py][INFO] Epoch:[0/2](455200/4588595) loss:2.615 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:02,209][model8_pretrain.py][INFO] Epoch:[0/2](455200/4588595) loss:2.988 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:02,209][model8_pretrain.py][INFO] Epoch:[0/2](455200/4588595) loss:3.161 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:02,209][model8_pretrain.py][INFO] Epoch:[0/2](455200/4588595) loss:2.783 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:02,209][model8_pretrain.py][INFO] Epoch:[0/2](455200/4588595) loss:2.708 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:02,209][model8_pretrain.py][INFO] Epoch:[0/2](455200/4588595) loss:3.132 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:02,209][model8_pretrain.py][INFO] Epoch:[0/2](455200/4588595) loss:3.435 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:39,136][model8_pretrain.py][INFO] Epoch:[0/2](455300/4588595) loss:3.025 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:39,136][model8_pretrain.py][INFO] Epoch:[0/2](455300/4588595) loss:3.161 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:39,136][model8_pretrain.py][INFO] Epoch:[0/2](455300/4588595) loss:3.401 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:39,136][model8_pretrain.py][INFO] Epoch:[0/2](455300/4588595) loss:2.234 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:39,136][model8_pretrain.py][INFO] Epoch:[0/2](455300/4588595) loss:2.875 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:39,136][model8_pretrain.py][INFO] Epoch:[0/2](455300/4588595) loss:2.749 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:39,136][model8_pretrain.py][INFO] Epoch:[0/2](455300/4588595) loss:3.304 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:47:39,136][model8_pretrain.py][INFO] Epoch:[0/2](455300/4588595) loss:2.834 lr:0.0000100 epoch_Time:26217.0min: [2024-01-04 17:48:16,088][model8_pretrain.py][INFO] Epoch:[0/2](455400/4588595) loss:2.696 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:48:16,088][model8_pretrain.py][INFO] Epoch:[0/2](455400/4588595) loss:3.113 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:48:16,088][model8_pretrain.py][INFO] Epoch:[0/2](455400/4588595) loss:2.709 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:48:16,088][model8_pretrain.py][INFO] Epoch:[0/2](455400/4588595) loss:2.580 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:48:16,088][model8_pretrain.py][INFO] Epoch:[0/2](455400/4588595) loss:2.333 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:48:16,088][model8_pretrain.py][INFO] Epoch:[0/2](455400/4588595) loss:2.463 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:48:16,088][model8_pretrain.py][INFO] Epoch:[0/2](455400/4588595) loss:2.650 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:48:16,088][model8_pretrain.py][INFO] Epoch:[0/2](455400/4588595) loss:3.041 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:03,000][model8_pretrain.py][INFO] Epoch:[0/2](455500/4588595) loss:2.513 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:03,000][model8_pretrain.py][INFO] Epoch:[0/2](455500/4588595) loss:3.075 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:03,000][model8_pretrain.py][INFO] Epoch:[0/2](455500/4588595) loss:3.154 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:03,000][model8_pretrain.py][INFO] Epoch:[0/2](455500/4588595) loss:3.014 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:03,000][model8_pretrain.py][INFO] Epoch:[0/2](455500/4588595) loss:3.010 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:03,000][model8_pretrain.py][INFO] Epoch:[0/2](455500/4588595) loss:3.099 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:03,000][model8_pretrain.py][INFO] Epoch:[0/2](455500/4588595) loss:2.626 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:03,000][model8_pretrain.py][INFO] Epoch:[0/2](455500/4588595) loss:2.760 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:39,915][model8_pretrain.py][INFO] Epoch:[0/2](455600/4588595) loss:2.978 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:39,915][model8_pretrain.py][INFO] Epoch:[0/2](455600/4588595) loss:2.684 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:39,915][model8_pretrain.py][INFO] Epoch:[0/2](455600/4588595) loss:2.837 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:39,915][model8_pretrain.py][INFO] Epoch:[0/2](455600/4588595) loss:2.663 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:39,915][model8_pretrain.py][INFO] Epoch:[0/2](455600/4588595) loss:2.469 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:39,915][model8_pretrain.py][INFO] Epoch:[0/2](455600/4588595) loss:3.513 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:39,915][model8_pretrain.py][INFO] Epoch:[0/2](455600/4588595) loss:3.264 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:49:39,915][model8_pretrain.py][INFO] Epoch:[0/2](455600/4588595) loss:2.636 lr:0.0000100 epoch_Time:26216.0min: [2024-01-04 17:50:16,844][model8_pretrain.py][INFO] Epoch:[0/2](455700/4588595) loss:2.507 lr:0.0000100 epoch_Time:26215.0min: [2024-01-04 17:50:16,844][model8_pretrain.py][INFO] Epoch:[0/2](455700/4588595) loss:2.665 lr:0.0000100 epoch_Time:26215.0min: [2024-01-04 17:50:16,844][model8_pretrain.py][INFO] Epoch:[0/2](455700/4588595) loss:2.881 lr:0.0000100 epoch_Time:26215.0min: [2024-01-04 17:50:16,844][model8_pretrain.py][INFO] Epoch:[0/2](455700/4588595) loss:2.786 lr:0.0000100 epoch_Time:26215.0min: [2024-01-04 17:50:16,844][model8_pretrain.py][INFO] Epoch:[0/2](455700/4588595) loss:2.598 lr:0.0000100 epoch_Time:26215.0min: [2024-01-04 17:50:16,844][model8_pretrain.py][INFO] Epoch:[0/2](455700/4588595) loss:3.327 lr:0.0000100 epoch_Time:26215.0min: [2024-01-04 17:50:16,844][model8_pretrain.py][INFO] Epoch:[0/2](455700/4588595) loss:2.602 lr:0.0000100 epoch_Time:26215.0min: [2024-01-04 17:50:16,844][model8_pretrain.py][INFO] Epoch:[0/2](455700/4588595) loss:2.564 lr:0.0000100 epoch_Time:26215.0min: [2024-01-04 17:50:53,754][model8_pretrain.py][INFO] Epoch:[0/2](455800/4588595) loss:2.437 lr:0.0000100 epoch_Time:26214.0min: [2024-01-04 17:50:53,754][model8_pretrain.py][INFO] Epoch:[0/2](455800/4588595) loss:2.818 lr:0.0000100 epoch_Time:26214.0min: [2024-01-04 17:50:53,754][model8_pretrain.py][INFO] Epoch:[0/2](455800/4588595) loss:3.413 lr:0.0000100 epoch_Time:26214.0min: [2024-01-04 17:50:53,754][model8_pretrain.py][INFO] Epoch:[0/2](455800/4588595) loss:3.147 lr:0.0000100 epoch_Time:26214.0min: [2024-01-04 17:50:53,754][model8_pretrain.py][INFO] Epoch:[0/2](455800/4588595) loss:2.759 lr:0.0000100 epoch_Time:26214.0min: [2024-01-04 17:50:53,754][model8_pretrain.py][INFO] Epoch:[0/2](455800/4588595) loss:3.092 lr:0.0000100 epoch_Time:26214.0min: [2024-01-04 17:50:53,754][model8_pretrain.py][INFO] Epoch:[0/2](455800/4588595) loss:2.474 lr:0.0000100 epoch_Time:26214.0min: [2024-01-04 17:50:53,754][model8_pretrain.py][INFO] Epoch:[0/2](455800/4588595) loss:2.688 lr:0.0000100 epoch_Time:26214.0min: [2024-01-04 17:51:30,691][model8_pretrain.py][INFO] Epoch:[0/2](455900/4588595) loss:3.226 lr:0.0000100 epoch_Time:26213.0min: [2024-01-04 17:51:30,691][model8_pretrain.py][INFO] Epoch:[0/2](455900/4588595) loss:2.501 lr:0.0000100 epoch_Time:26213.0min: [2024-01-04 17:51:30,691][model8_pretrain.py][INFO] Epoch:[0/2](455900/4588595) loss:3.135 lr:0.0000100 epoch_Time:26213.0min: [2024-01-04 17:51:30,691][model8_pretrain.py][INFO] Epoch:[0/2](455900/4588595) loss:3.120 lr:0.0000100 epoch_Time:26213.0min: [2024-01-04 17:51:30,691][model8_pretrain.py][INFO] Epoch:[0/2](455900/4588595) loss:3.237 lr:0.0000100 epoch_Time:26213.0min: [2024-01-04 17:51:30,691][model8_pretrain.py][INFO] Epoch:[0/2](455900/4588595) loss:2.399 lr:0.0000100 epoch_Time:26213.0min: [2024-01-04 17:51:30,691][model8_pretrain.py][INFO] Epoch:[0/2](455900/4588595) loss:2.571 lr:0.0000100 epoch_Time:26213.0min: [2024-01-04 17:51:30,691][model8_pretrain.py][INFO] Epoch:[0/2](455900/4588595) loss:2.408 lr:0.0000100 epoch_Time:26213.0min: [2024-01-04 17:52:07,632][model8_pretrain.py][INFO] Epoch:[0/2](456000/4588595) loss:2.840 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:07,632][model8_pretrain.py][INFO] Epoch:[0/2](456000/4588595) loss:2.998 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:07,632][model8_pretrain.py][INFO] Epoch:[0/2](456000/4588595) loss:2.953 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:07,632][model8_pretrain.py][INFO] Epoch:[0/2](456000/4588595) loss:2.714 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:07,632][model8_pretrain.py][INFO] Epoch:[0/2](456000/4588595) loss:2.425 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:07,632][model8_pretrain.py][INFO] Epoch:[0/2](456000/4588595) loss:2.737 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:07,632][model8_pretrain.py][INFO] Epoch:[0/2](456000/4588595) loss:2.720 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:07,632][model8_pretrain.py][INFO] Epoch:[0/2](456000/4588595) loss:2.786 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:44,560][model8_pretrain.py][INFO] Epoch:[0/2](456100/4588595) loss:3.084 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:44,560][model8_pretrain.py][INFO] Epoch:[0/2](456100/4588595) loss:2.805 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:44,560][model8_pretrain.py][INFO] Epoch:[0/2](456100/4588595) loss:2.700 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:44,560][model8_pretrain.py][INFO] Epoch:[0/2](456100/4588595) loss:2.991 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:44,560][model8_pretrain.py][INFO] Epoch:[0/2](456100/4588595) loss:3.119 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:44,560][model8_pretrain.py][INFO] Epoch:[0/2](456100/4588595) loss:2.769 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:44,560][model8_pretrain.py][INFO] Epoch:[0/2](456100/4588595) loss:2.772 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:52:44,561][model8_pretrain.py][INFO] Epoch:[0/2](456100/4588595) loss:3.393 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:53:21,498][model8_pretrain.py][INFO] Epoch:[0/2](456200/4588595) loss:2.560 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:53:21,498][model8_pretrain.py][INFO] Epoch:[0/2](456200/4588595) loss:3.427 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:53:21,498][model8_pretrain.py][INFO] Epoch:[0/2](456200/4588595) loss:2.204 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:53:21,498][model8_pretrain.py][INFO] Epoch:[0/2](456200/4588595) loss:2.916 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:53:21,498][model8_pretrain.py][INFO] Epoch:[0/2](456200/4588595) loss:2.747 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:53:21,498][model8_pretrain.py][INFO] Epoch:[0/2](456200/4588595) loss:3.094 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:53:21,498][model8_pretrain.py][INFO] Epoch:[0/2](456200/4588595) loss:2.817 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:53:21,498][model8_pretrain.py][INFO] Epoch:[0/2](456200/4588595) loss:3.578 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:54:10,259][model8_pretrain.py][INFO] Epoch:[0/2](456300/4588595) loss:2.699 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:54:10,259][model8_pretrain.py][INFO] Epoch:[0/2](456300/4588595) loss:2.915 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:54:10,259][model8_pretrain.py][INFO] Epoch:[0/2](456300/4588595) loss:2.926 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:54:10,259][model8_pretrain.py][INFO] Epoch:[0/2](456300/4588595) loss:3.019 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:54:10,260][model8_pretrain.py][INFO] Epoch:[0/2](456300/4588595) loss:3.449 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:54:10,259][model8_pretrain.py][INFO] Epoch:[0/2](456300/4588595) loss:2.292 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:54:10,260][model8_pretrain.py][INFO] Epoch:[0/2](456300/4588595) loss:2.997 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:54:10,260][model8_pretrain.py][INFO] Epoch:[0/2](456300/4588595) loss:2.797 lr:0.0000100 epoch_Time:26212.0min: [2024-01-04 17:54:47,173][model8_pretrain.py][INFO] Epoch:[0/2](456400/4588595) loss:3.467 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:54:47,173][model8_pretrain.py][INFO] Epoch:[0/2](456400/4588595) loss:2.706 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:54:47,173][model8_pretrain.py][INFO] Epoch:[0/2](456400/4588595) loss:3.136 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:54:47,173][model8_pretrain.py][INFO] Epoch:[0/2](456400/4588595) loss:3.105 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:54:47,173][model8_pretrain.py][INFO] Epoch:[0/2](456400/4588595) loss:2.783 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:54:47,173][model8_pretrain.py][INFO] Epoch:[0/2](456400/4588595) loss:3.207 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:54:47,173][model8_pretrain.py][INFO] Epoch:[0/2](456400/4588595) loss:2.721 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:54:47,174][model8_pretrain.py][INFO] Epoch:[0/2](456400/4588595) loss:2.341 lr:0.0000100 epoch_Time:26211.0min: [2024-01-04 17:55:24,095][model8_pretrain.py][INFO] Epoch:[0/2](456500/4588595) loss:2.990 lr:0.0000100 epoch_Time:26210.0min: [2024-01-04 17:55:24,095][model8_pretrain.py][INFO] Epoch:[0/2](456500/4588595) loss:2.842 lr:0.0000100 epoch_Time:26210.0min: [2024-01-04 17:55:24,095][model8_pretrain.py][INFO] Epoch:[0/2](456500/4588595) loss:2.990 lr:0.0000100 epoch_Time:26210.0min: [2024-01-04 17:55:24,095][model8_pretrain.py][INFO] Epoch:[0/2](456500/4588595) loss:2.711 lr:0.0000100 epoch_Time:26210.0min: [2024-01-04 17:55:24,095][model8_pretrain.py][INFO] Epoch:[0/2](456500/4588595) loss:2.845 lr:0.0000100 epoch_Time:26210.0min: [2024-01-04 17:55:24,096][model8_pretrain.py][INFO] Epoch:[0/2](456500/4588595) loss:2.284 lr:0.0000100 epoch_Time:26210.0min: [2024-01-04 17:55:24,096][model8_pretrain.py][INFO] Epoch:[0/2](456500/4588595) loss:2.748 lr:0.0000100 epoch_Time:26210.0min: [2024-01-04 17:55:24,096][model8_pretrain.py][INFO] Epoch:[0/2](456500/4588595) loss:2.834 lr:0.0000100 epoch_Time:26210.0min: [2024-01-04 17:56:01,032][model8_pretrain.py][INFO] Epoch:[0/2](456600/4588595) loss:2.866 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:01,032][model8_pretrain.py][INFO] Epoch:[0/2](456600/4588595) loss:2.981 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:01,032][model8_pretrain.py][INFO] Epoch:[0/2](456600/4588595) loss:2.792 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:01,032][model8_pretrain.py][INFO] Epoch:[0/2](456600/4588595) loss:2.366 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:01,032][model8_pretrain.py][INFO] Epoch:[0/2](456600/4588595) loss:2.875 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:01,032][model8_pretrain.py][INFO] Epoch:[0/2](456600/4588595) loss:2.760 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:01,032][model8_pretrain.py][INFO] Epoch:[0/2](456600/4588595) loss:3.294 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:01,032][model8_pretrain.py][INFO] Epoch:[0/2](456600/4588595) loss:2.686 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:37,952][model8_pretrain.py][INFO] Epoch:[0/2](456700/4588595) loss:2.925 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:37,952][model8_pretrain.py][INFO] Epoch:[0/2](456700/4588595) loss:2.546 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:37,952][model8_pretrain.py][INFO] Epoch:[0/2](456700/4588595) loss:2.957 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:37,952][model8_pretrain.py][INFO] Epoch:[0/2](456700/4588595) loss:2.999 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:37,952][model8_pretrain.py][INFO] Epoch:[0/2](456700/4588595) loss:2.608 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:37,952][model8_pretrain.py][INFO] Epoch:[0/2](456700/4588595) loss:2.751 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:37,952][model8_pretrain.py][INFO] Epoch:[0/2](456700/4588595) loss:2.934 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:56:37,952][model8_pretrain.py][INFO] Epoch:[0/2](456700/4588595) loss:3.189 lr:0.0000100 epoch_Time:26209.0min: [2024-01-04 17:57:14,886][model8_pretrain.py][INFO] Epoch:[0/2](456800/4588595) loss:2.742 lr:0.0000100 epoch_Time:26208.0min: [2024-01-04 17:57:14,886][model8_pretrain.py][INFO] Epoch:[0/2](456800/4588595) loss:3.108 lr:0.0000100 epoch_Time:26208.0min: [2024-01-04 17:57:14,886][model8_pretrain.py][INFO] Epoch:[0/2](456800/4588595) loss:2.494 lr:0.0000100 epoch_Time:26208.0min: [2024-01-04 17:57:14,886][model8_pretrain.py][INFO] Epoch:[0/2](456800/4588595) loss:2.088 lr:0.0000100 epoch_Time:26208.0min: [2024-01-04 17:57:14,886][model8_pretrain.py][INFO] Epoch:[0/2](456800/4588595) loss:2.629 lr:0.0000100 epoch_Time:26208.0min: [2024-01-04 17:57:14,886][model8_pretrain.py][INFO] Epoch:[0/2](456800/4588595) loss:2.925 lr:0.0000100 epoch_Time:26208.0min: [2024-01-04 17:57:14,886][model8_pretrain.py][INFO] Epoch:[0/2](456800/4588595) loss:2.688 lr:0.0000100 epoch_Time:26208.0min: [2024-01-04 17:57:14,886][model8_pretrain.py][INFO] Epoch:[0/2](456800/4588595) loss:3.112 lr:0.0000100 epoch_Time:26208.0min: [2024-01-04 17:57:51,815][model8_pretrain.py][INFO] Epoch:[0/2](456900/4588595) loss:2.810 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:57:51,815][model8_pretrain.py][INFO] Epoch:[0/2](456900/4588595) loss:2.624 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:57:51,815][model8_pretrain.py][INFO] Epoch:[0/2](456900/4588595) loss:2.668 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:57:51,815][model8_pretrain.py][INFO] Epoch:[0/2](456900/4588595) loss:2.587 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:57:51,815][model8_pretrain.py][INFO] Epoch:[0/2](456900/4588595) loss:3.184 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:57:51,815][model8_pretrain.py][INFO] Epoch:[0/2](456900/4588595) loss:2.704 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:57:51,815][model8_pretrain.py][INFO] Epoch:[0/2](456900/4588595) loss:2.838 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:57:51,815][model8_pretrain.py][INFO] Epoch:[0/2](456900/4588595) loss:2.796 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:58:28,773][model8_pretrain.py][INFO] Epoch:[0/2](457000/4588595) loss:3.139 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:58:28,773][model8_pretrain.py][INFO] Epoch:[0/2](457000/4588595) loss:2.855 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:58:28,773][model8_pretrain.py][INFO] Epoch:[0/2](457000/4588595) loss:2.893 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:58:28,773][model8_pretrain.py][INFO] Epoch:[0/2](457000/4588595) loss:2.942 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:58:28,773][model8_pretrain.py][INFO] Epoch:[0/2](457000/4588595) loss:3.177 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:58:28,773][model8_pretrain.py][INFO] Epoch:[0/2](457000/4588595) loss:2.869 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:58:28,773][model8_pretrain.py][INFO] Epoch:[0/2](457000/4588595) loss:2.682 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:58:28,773][model8_pretrain.py][INFO] Epoch:[0/2](457000/4588595) loss:2.757 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:59:15,650][model8_pretrain.py][INFO] Epoch:[0/2](457100/4588595) loss:3.342 lr:0.0000100 epoch_Time:26207.0min: [2024-01-04 17:59:15,650][model8_pretrain.py][INFO] Epoch:[0/2](457100/4588595) loss:2.745 lr:0.0000100 epoch_Time:26207.0min: [2024-01-04 17:59:15,650][model8_pretrain.py][INFO] Epoch:[0/2](457100/4588595) loss:2.504 lr:0.0000100 epoch_Time:26207.0min: [2024-01-04 17:59:15,650][model8_pretrain.py][INFO] Epoch:[0/2](457100/4588595) loss:2.428 lr:0.0000100 epoch_Time:26207.0min: [2024-01-04 17:59:15,650][model8_pretrain.py][INFO] Epoch:[0/2](457100/4588595) loss:3.236 lr:0.0000100 epoch_Time:26207.0min: [2024-01-04 17:59:15,650][model8_pretrain.py][INFO] Epoch:[0/2](457100/4588595) loss:3.109 lr:0.0000100 epoch_Time:26207.0min: [2024-01-04 17:59:15,650][model8_pretrain.py][INFO] Epoch:[0/2](457100/4588595) loss:2.583 lr:0.0000100 epoch_Time:26207.0min: [2024-01-04 17:59:15,650][model8_pretrain.py][INFO] Epoch:[0/2](457100/4588595) loss:2.737 lr:0.0000100 epoch_Time:26207.0min: [2024-01-04 17:59:52,595][model8_pretrain.py][INFO] Epoch:[0/2](457200/4588595) loss:2.662 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:59:52,595][model8_pretrain.py][INFO] Epoch:[0/2](457200/4588595) loss:3.314 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:59:52,595][model8_pretrain.py][INFO] Epoch:[0/2](457200/4588595) loss:2.598 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:59:52,595][model8_pretrain.py][INFO] Epoch:[0/2](457200/4588595) loss:2.607 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:59:52,595][model8_pretrain.py][INFO] Epoch:[0/2](457200/4588595) loss:2.449 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:59:52,595][model8_pretrain.py][INFO] Epoch:[0/2](457200/4588595) loss:3.005 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:59:52,595][model8_pretrain.py][INFO] Epoch:[0/2](457200/4588595) loss:3.185 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 17:59:52,595][model8_pretrain.py][INFO] Epoch:[0/2](457200/4588595) loss:3.272 lr:0.0000100 epoch_Time:26206.0min: [2024-01-04 18:00:29,536][model8_pretrain.py][INFO] Epoch:[0/2](457300/4588595) loss:2.998 lr:0.0000100 epoch_Time:26205.0min: [2024-01-04 18:00:29,536][model8_pretrain.py][INFO] Epoch:[0/2](457300/4588595) loss:2.796 lr:0.0000100 epoch_Time:26205.0min: [2024-01-04 18:00:29,536][model8_pretrain.py][INFO] Epoch:[0/2](457300/4588595) loss:2.523 lr:0.0000100 epoch_Time:26205.0min: [2024-01-04 18:00:29,536][model8_pretrain.py][INFO] Epoch:[0/2](457300/4588595) loss:2.579 lr:0.0000100 epoch_Time:26205.0min: [2024-01-04 18:00:29,536][model8_pretrain.py][INFO] Epoch:[0/2](457300/4588595) loss:2.304 lr:0.0000100 epoch_Time:26205.0min: [2024-01-04 18:00:29,536][model8_pretrain.py][INFO] Epoch:[0/2](457300/4588595) loss:2.885 lr:0.0000100 epoch_Time:26205.0min: [2024-01-04 18:00:29,536][model8_pretrain.py][INFO] Epoch:[0/2](457300/4588595) loss:2.766 lr:0.0000100 epoch_Time:26205.0min: [2024-01-04 18:00:29,536][model8_pretrain.py][INFO] Epoch:[0/2](457300/4588595) loss:3.052 lr:0.0000100 epoch_Time:26205.0min: [2024-01-04 18:01:06,469][model8_pretrain.py][INFO] Epoch:[0/2](457400/4588595) loss:2.433 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:06,469][model8_pretrain.py][INFO] Epoch:[0/2](457400/4588595) loss:3.001 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:06,469][model8_pretrain.py][INFO] Epoch:[0/2](457400/4588595) loss:2.949 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:06,469][model8_pretrain.py][INFO] Epoch:[0/2](457400/4588595) loss:2.746 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:06,469][model8_pretrain.py][INFO] Epoch:[0/2](457400/4588595) loss:2.743 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:06,469][model8_pretrain.py][INFO] Epoch:[0/2](457400/4588595) loss:2.651 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:06,469][model8_pretrain.py][INFO] Epoch:[0/2](457400/4588595) loss:2.497 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:06,469][model8_pretrain.py][INFO] Epoch:[0/2](457400/4588595) loss:3.082 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:43,433][model8_pretrain.py][INFO] Epoch:[0/2](457500/4588595) loss:2.649 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:43,433][model8_pretrain.py][INFO] Epoch:[0/2](457500/4588595) loss:2.491 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:43,433][model8_pretrain.py][INFO] Epoch:[0/2](457500/4588595) loss:3.166 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:43,433][model8_pretrain.py][INFO] Epoch:[0/2](457500/4588595) loss:3.285 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:43,433][model8_pretrain.py][INFO] Epoch:[0/2](457500/4588595) loss:2.411 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:43,433][model8_pretrain.py][INFO] Epoch:[0/2](457500/4588595) loss:2.611 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:43,433][model8_pretrain.py][INFO] Epoch:[0/2](457500/4588595) loss:2.449 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:01:43,434][model8_pretrain.py][INFO] Epoch:[0/2](457500/4588595) loss:2.931 lr:0.0000100 epoch_Time:26204.0min: [2024-01-04 18:02:20,376][model8_pretrain.py][INFO] Epoch:[0/2](457600/4588595) loss:3.252 lr:0.0000100 epoch_Time:26203.0min: [2024-01-04 18:02:20,376][model8_pretrain.py][INFO] Epoch:[0/2](457600/4588595) loss:3.017 lr:0.0000100 epoch_Time:26203.0min: [2024-01-04 18:02:20,376][model8_pretrain.py][INFO] Epoch:[0/2](457600/4588595) loss:2.995 lr:0.0000100 epoch_Time:26203.0min: [2024-01-04 18:02:20,376][model8_pretrain.py][INFO] Epoch:[0/2](457600/4588595) loss:2.649 lr:0.0000100 epoch_Time:26203.0min: [2024-01-04 18:02:20,376][model8_pretrain.py][INFO] Epoch:[0/2](457600/4588595) loss:2.338 lr:0.0000100 epoch_Time:26203.0min: [2024-01-04 18:02:20,376][model8_pretrain.py][INFO] Epoch:[0/2](457600/4588595) loss:2.935 lr:0.0000100 epoch_Time:26203.0min: [2024-01-04 18:02:20,376][model8_pretrain.py][INFO] Epoch:[0/2](457600/4588595) loss:2.604 lr:0.0000100 epoch_Time:26203.0min: [2024-01-04 18:02:20,376][model8_pretrain.py][INFO] Epoch:[0/2](457600/4588595) loss:3.067 lr:0.0000100 epoch_Time:26203.0min: [2024-01-04 18:02:57,330][model8_pretrain.py][INFO] Epoch:[0/2](457700/4588595) loss:3.032 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:02:57,330][model8_pretrain.py][INFO] Epoch:[0/2](457700/4588595) loss:2.918 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:02:57,330][model8_pretrain.py][INFO] Epoch:[0/2](457700/4588595) loss:2.381 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:02:57,330][model8_pretrain.py][INFO] Epoch:[0/2](457700/4588595) loss:3.339 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:02:57,330][model8_pretrain.py][INFO] Epoch:[0/2](457700/4588595) loss:3.215 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:02:57,330][model8_pretrain.py][INFO] Epoch:[0/2](457700/4588595) loss:2.660 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:02:57,330][model8_pretrain.py][INFO] Epoch:[0/2](457700/4588595) loss:2.974 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:02:57,330][model8_pretrain.py][INFO] Epoch:[0/2](457700/4588595) loss:2.823 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:03:34,278][model8_pretrain.py][INFO] Epoch:[0/2](457800/4588595) loss:2.922 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:03:34,278][model8_pretrain.py][INFO] Epoch:[0/2](457800/4588595) loss:2.863 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:03:34,278][model8_pretrain.py][INFO] Epoch:[0/2](457800/4588595) loss:2.254 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:03:34,278][model8_pretrain.py][INFO] Epoch:[0/2](457800/4588595) loss:2.777 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:03:34,278][model8_pretrain.py][INFO] Epoch:[0/2](457800/4588595) loss:2.726 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:03:34,278][model8_pretrain.py][INFO] Epoch:[0/2](457800/4588595) loss:2.562 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:03:34,278][model8_pretrain.py][INFO] Epoch:[0/2](457800/4588595) loss:3.358 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:03:34,278][model8_pretrain.py][INFO] Epoch:[0/2](457800/4588595) loss:3.056 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:04:22,064][model8_pretrain.py][INFO] Epoch:[0/2](457900/4588595) loss:3.516 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:04:22,064][model8_pretrain.py][INFO] Epoch:[0/2](457900/4588595) loss:2.733 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:04:22,064][model8_pretrain.py][INFO] Epoch:[0/2](457900/4588595) loss:3.209 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:04:22,064][model8_pretrain.py][INFO] Epoch:[0/2](457900/4588595) loss:2.964 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:04:22,064][model8_pretrain.py][INFO] Epoch:[0/2](457900/4588595) loss:3.098 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:04:22,065][model8_pretrain.py][INFO] Epoch:[0/2](457900/4588595) loss:2.750 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:04:22,065][model8_pretrain.py][INFO] Epoch:[0/2](457900/4588595) loss:2.930 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:04:22,066][model8_pretrain.py][INFO] Epoch:[0/2](457900/4588595) loss:2.969 lr:0.0000100 epoch_Time:26202.0min: [2024-01-04 18:04:58,996][model8_pretrain.py][INFO] Epoch:[0/2](458000/4588595) loss:3.021 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:04:58,996][model8_pretrain.py][INFO] Epoch:[0/2](458000/4588595) loss:3.148 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:04:58,996][model8_pretrain.py][INFO] Epoch:[0/2](458000/4588595) loss:2.616 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:04:58,996][model8_pretrain.py][INFO] Epoch:[0/2](458000/4588595) loss:2.729 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:04:58,996][model8_pretrain.py][INFO] Epoch:[0/2](458000/4588595) loss:2.884 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:04:58,996][model8_pretrain.py][INFO] Epoch:[0/2](458000/4588595) loss:2.543 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:04:58,996][model8_pretrain.py][INFO] Epoch:[0/2](458000/4588595) loss:3.171 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:04:58,998][model8_pretrain.py][INFO] Epoch:[0/2](458000/4588595) loss:2.828 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:05:35,931][model8_pretrain.py][INFO] Epoch:[0/2](458100/4588595) loss:2.833 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:05:35,931][model8_pretrain.py][INFO] Epoch:[0/2](458100/4588595) loss:2.714 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:05:35,931][model8_pretrain.py][INFO] Epoch:[0/2](458100/4588595) loss:2.938 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:05:35,931][model8_pretrain.py][INFO] Epoch:[0/2](458100/4588595) loss:2.792 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:05:35,931][model8_pretrain.py][INFO] Epoch:[0/2](458100/4588595) loss:2.921 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:05:35,931][model8_pretrain.py][INFO] Epoch:[0/2](458100/4588595) loss:3.088 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:05:35,931][model8_pretrain.py][INFO] Epoch:[0/2](458100/4588595) loss:2.586 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:05:35,931][model8_pretrain.py][INFO] Epoch:[0/2](458100/4588595) loss:2.761 lr:0.0000100 epoch_Time:26201.0min: [2024-01-04 18:06:12,869][model8_pretrain.py][INFO] Epoch:[0/2](458200/4588595) loss:2.767 lr:0.0000100 epoch_Time:26199.0min: [2024-01-04 18:06:12,869][model8_pretrain.py][INFO] Epoch:[0/2](458200/4588595) loss:2.591 lr:0.0000100 epoch_Time:26199.0min: [2024-01-04 18:06:12,869][model8_pretrain.py][INFO] Epoch:[0/2](458200/4588595) loss:2.853 lr:0.0000100 epoch_Time:26199.0min: [2024-01-04 18:06:12,869][model8_pretrain.py][INFO] Epoch:[0/2](458200/4588595) loss:2.209 lr:0.0000100 epoch_Time:26199.0min: [2024-01-04 18:06:12,869][model8_pretrain.py][INFO] Epoch:[0/2](458200/4588595) loss:2.583 lr:0.0000100 epoch_Time:26199.0min: [2024-01-04 18:06:12,869][model8_pretrain.py][INFO] Epoch:[0/2](458200/4588595) loss:2.879 lr:0.0000100 epoch_Time:26199.0min: [2024-01-04 18:06:12,869][model8_pretrain.py][INFO] Epoch:[0/2](458200/4588595) loss:2.750 lr:0.0000100 epoch_Time:26199.0min: [2024-01-04 18:06:12,870][model8_pretrain.py][INFO] Epoch:[0/2](458200/4588595) loss:2.558 lr:0.0000100 epoch_Time:26199.0min: [2024-01-04 18:06:49,808][model8_pretrain.py][INFO] Epoch:[0/2](458300/4588595) loss:2.688 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:06:49,808][model8_pretrain.py][INFO] Epoch:[0/2](458300/4588595) loss:3.150 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:06:49,808][model8_pretrain.py][INFO] Epoch:[0/2](458300/4588595) loss:2.767 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:06:49,808][model8_pretrain.py][INFO] Epoch:[0/2](458300/4588595) loss:3.031 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:06:49,808][model8_pretrain.py][INFO] Epoch:[0/2](458300/4588595) loss:2.313 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:06:49,808][model8_pretrain.py][INFO] Epoch:[0/2](458300/4588595) loss:2.303 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:06:49,809][model8_pretrain.py][INFO] Epoch:[0/2](458300/4588595) loss:2.458 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:06:49,809][model8_pretrain.py][INFO] Epoch:[0/2](458300/4588595) loss:2.903 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:07:26,741][model8_pretrain.py][INFO] Epoch:[0/2](458400/4588595) loss:2.863 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:07:26,741][model8_pretrain.py][INFO] Epoch:[0/2](458400/4588595) loss:2.932 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:07:26,741][model8_pretrain.py][INFO] Epoch:[0/2](458400/4588595) loss:2.802 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:07:26,741][model8_pretrain.py][INFO] Epoch:[0/2](458400/4588595) loss:3.093 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:07:26,741][model8_pretrain.py][INFO] Epoch:[0/2](458400/4588595) loss:2.959 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:07:26,741][model8_pretrain.py][INFO] Epoch:[0/2](458400/4588595) loss:2.044 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:07:26,741][model8_pretrain.py][INFO] Epoch:[0/2](458400/4588595) loss:2.600 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:07:26,741][model8_pretrain.py][INFO] Epoch:[0/2](458400/4588595) loss:2.546 lr:0.0000100 epoch_Time:26198.0min: [2024-01-04 18:08:03,683][model8_pretrain.py][INFO] Epoch:[0/2](458500/4588595) loss:2.971 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:03,683][model8_pretrain.py][INFO] Epoch:[0/2](458500/4588595) loss:2.348 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:03,683][model8_pretrain.py][INFO] Epoch:[0/2](458500/4588595) loss:2.840 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:03,683][model8_pretrain.py][INFO] Epoch:[0/2](458500/4588595) loss:2.584 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:03,683][model8_pretrain.py][INFO] Epoch:[0/2](458500/4588595) loss:2.668 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:03,683][model8_pretrain.py][INFO] Epoch:[0/2](458500/4588595) loss:2.430 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:03,683][model8_pretrain.py][INFO] Epoch:[0/2](458500/4588595) loss:3.026 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:03,684][model8_pretrain.py][INFO] Epoch:[0/2](458500/4588595) loss:2.626 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:40,628][model8_pretrain.py][INFO] Epoch:[0/2](458600/4588595) loss:3.303 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:40,628][model8_pretrain.py][INFO] Epoch:[0/2](458600/4588595) loss:3.361 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:40,628][model8_pretrain.py][INFO] Epoch:[0/2](458600/4588595) loss:3.145 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:40,628][model8_pretrain.py][INFO] Epoch:[0/2](458600/4588595) loss:3.353 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:40,628][model8_pretrain.py][INFO] Epoch:[0/2](458600/4588595) loss:2.653 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:40,628][model8_pretrain.py][INFO] Epoch:[0/2](458600/4588595) loss:3.230 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:40,628][model8_pretrain.py][INFO] Epoch:[0/2](458600/4588595) loss:2.394 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:08:40,628][model8_pretrain.py][INFO] Epoch:[0/2](458600/4588595) loss:2.946 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:09:28,434][model8_pretrain.py][INFO] Epoch:[0/2](458700/4588595) loss:2.271 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:09:28,434][model8_pretrain.py][INFO] Epoch:[0/2](458700/4588595) loss:3.208 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:09:28,434][model8_pretrain.py][INFO] Epoch:[0/2](458700/4588595) loss:2.846 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:09:28,434][model8_pretrain.py][INFO] Epoch:[0/2](458700/4588595) loss:2.525 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:09:28,434][model8_pretrain.py][INFO] Epoch:[0/2](458700/4588595) loss:3.054 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:09:28,434][model8_pretrain.py][INFO] Epoch:[0/2](458700/4588595) loss:3.048 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:09:28,434][model8_pretrain.py][INFO] Epoch:[0/2](458700/4588595) loss:2.638 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:09:28,434][model8_pretrain.py][INFO] Epoch:[0/2](458700/4588595) loss:3.058 lr:0.0000100 epoch_Time:26197.0min: [2024-01-04 18:10:05,376][model8_pretrain.py][INFO] Epoch:[0/2](458800/4588595) loss:2.701 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:05,376][model8_pretrain.py][INFO] Epoch:[0/2](458800/4588595) loss:2.697 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:05,376][model8_pretrain.py][INFO] Epoch:[0/2](458800/4588595) loss:2.243 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:05,376][model8_pretrain.py][INFO] Epoch:[0/2](458800/4588595) loss:3.164 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:05,376][model8_pretrain.py][INFO] Epoch:[0/2](458800/4588595) loss:3.005 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:05,377][model8_pretrain.py][INFO] Epoch:[0/2](458800/4588595) loss:2.659 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:05,377][model8_pretrain.py][INFO] Epoch:[0/2](458800/4588595) loss:3.108 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:05,377][model8_pretrain.py][INFO] Epoch:[0/2](458800/4588595) loss:3.177 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:42,328][model8_pretrain.py][INFO] Epoch:[0/2](458900/4588595) loss:2.816 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:42,328][model8_pretrain.py][INFO] Epoch:[0/2](458900/4588595) loss:2.923 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:42,328][model8_pretrain.py][INFO] Epoch:[0/2](458900/4588595) loss:2.594 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:42,328][model8_pretrain.py][INFO] Epoch:[0/2](458900/4588595) loss:2.654 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:42,328][model8_pretrain.py][INFO] Epoch:[0/2](458900/4588595) loss:2.604 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:42,328][model8_pretrain.py][INFO] Epoch:[0/2](458900/4588595) loss:2.661 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:42,328][model8_pretrain.py][INFO] Epoch:[0/2](458900/4588595) loss:2.208 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:10:42,328][model8_pretrain.py][INFO] Epoch:[0/2](458900/4588595) loss:3.173 lr:0.0000100 epoch_Time:26196.0min: [2024-01-04 18:11:19,280][model8_pretrain.py][INFO] Epoch:[0/2](459000/4588595) loss:2.651 lr:0.0000100 epoch_Time:26195.0min: [2024-01-04 18:11:19,280][model8_pretrain.py][INFO] Epoch:[0/2](459000/4588595) loss:3.060 lr:0.0000100 epoch_Time:26195.0min: [2024-01-04 18:11:19,280][model8_pretrain.py][INFO] Epoch:[0/2](459000/4588595) loss:3.171 lr:0.0000100 epoch_Time:26195.0min: [2024-01-04 18:11:19,280][model8_pretrain.py][INFO] Epoch:[0/2](459000/4588595) loss:2.433 lr:0.0000100 epoch_Time:26195.0min: [2024-01-04 18:11:19,280][model8_pretrain.py][INFO] Epoch:[0/2](459000/4588595) loss:3.085 lr:0.0000100 epoch_Time:26195.0min: [2024-01-04 18:11:19,280][model8_pretrain.py][INFO] Epoch:[0/2](459000/4588595) loss:3.057 lr:0.0000100 epoch_Time:26195.0min: [2024-01-04 18:11:19,280][model8_pretrain.py][INFO] Epoch:[0/2](459000/4588595) loss:2.749 lr:0.0000100 epoch_Time:26195.0min: [2024-01-04 18:11:19,280][model8_pretrain.py][INFO] Epoch:[0/2](459000/4588595) loss:2.946 lr:0.0000100 epoch_Time:26195.0min: [2024-01-04 18:11:56,238][model8_pretrain.py][INFO] Epoch:[0/2](459100/4588595) loss:3.155 lr:0.0000100 epoch_Time:26194.0min: [2024-01-04 18:11:56,238][model8_pretrain.py][INFO] Epoch:[0/2](459100/4588595) loss:2.877 lr:0.0000100 epoch_Time:26194.0min: [2024-01-04 18:11:56,238][model8_pretrain.py][INFO] Epoch:[0/2](459100/4588595) loss:3.297 lr:0.0000100 epoch_Time:26194.0min: [2024-01-04 18:11:56,238][model8_pretrain.py][INFO] Epoch:[0/2](459100/4588595) loss:3.024 lr:0.0000100 epoch_Time:26194.0min: [2024-01-04 18:11:56,238][model8_pretrain.py][INFO] Epoch:[0/2](459100/4588595) loss:3.030 lr:0.0000100 epoch_Time:26194.0min: [2024-01-04 18:11:56,238][model8_pretrain.py][INFO] Epoch:[0/2](459100/4588595) loss:2.958 lr:0.0000100 epoch_Time:26194.0min: [2024-01-04 18:11:56,238][model8_pretrain.py][INFO] Epoch:[0/2](459100/4588595) loss:3.071 lr:0.0000100 epoch_Time:26194.0min: [2024-01-04 18:11:56,238][model8_pretrain.py][INFO] Epoch:[0/2](459100/4588595) loss:3.242 lr:0.0000100 epoch_Time:26194.0min: [2024-01-04 18:12:33,193][model8_pretrain.py][INFO] Epoch:[0/2](459200/4588595) loss:2.848 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:12:33,193][model8_pretrain.py][INFO] Epoch:[0/2](459200/4588595) loss:2.856 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:12:33,193][model8_pretrain.py][INFO] Epoch:[0/2](459200/4588595) loss:3.274 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:12:33,193][model8_pretrain.py][INFO] Epoch:[0/2](459200/4588595) loss:2.789 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:12:33,193][model8_pretrain.py][INFO] Epoch:[0/2](459200/4588595) loss:2.828 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:12:33,193][model8_pretrain.py][INFO] Epoch:[0/2](459200/4588595) loss:3.156 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:12:33,193][model8_pretrain.py][INFO] Epoch:[0/2](459200/4588595) loss:2.811 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:12:33,193][model8_pretrain.py][INFO] Epoch:[0/2](459200/4588595) loss:3.139 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:13:10,176][model8_pretrain.py][INFO] Epoch:[0/2](459300/4588595) loss:2.580 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:10,176][model8_pretrain.py][INFO] Epoch:[0/2](459300/4588595) loss:3.059 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:10,176][model8_pretrain.py][INFO] Epoch:[0/2](459300/4588595) loss:2.455 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:10,176][model8_pretrain.py][INFO] Epoch:[0/2](459300/4588595) loss:2.244 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:10,176][model8_pretrain.py][INFO] Epoch:[0/2](459300/4588595) loss:2.800 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:10,176][model8_pretrain.py][INFO] Epoch:[0/2](459300/4588595) loss:2.675 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:10,176][model8_pretrain.py][INFO] Epoch:[0/2](459300/4588595) loss:2.966 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:10,176][model8_pretrain.py][INFO] Epoch:[0/2](459300/4588595) loss:2.698 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:47,122][model8_pretrain.py][INFO] Epoch:[0/2](459400/4588595) loss:2.863 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:47,122][model8_pretrain.py][INFO] Epoch:[0/2](459400/4588595) loss:2.888 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:47,122][model8_pretrain.py][INFO] Epoch:[0/2](459400/4588595) loss:3.072 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:47,122][model8_pretrain.py][INFO] Epoch:[0/2](459400/4588595) loss:2.821 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:47,122][model8_pretrain.py][INFO] Epoch:[0/2](459400/4588595) loss:2.691 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:47,122][model8_pretrain.py][INFO] Epoch:[0/2](459400/4588595) loss:2.778 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:47,122][model8_pretrain.py][INFO] Epoch:[0/2](459400/4588595) loss:1.927 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:13:47,122][model8_pretrain.py][INFO] Epoch:[0/2](459400/4588595) loss:2.796 lr:0.0000100 epoch_Time:26192.0min: [2024-01-04 18:14:34,250][model8_pretrain.py][INFO] Epoch:[0/2](459500/4588595) loss:3.036 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:14:34,250][model8_pretrain.py][INFO] Epoch:[0/2](459500/4588595) loss:2.575 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:14:34,250][model8_pretrain.py][INFO] Epoch:[0/2](459500/4588595) loss:3.222 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:14:34,250][model8_pretrain.py][INFO] Epoch:[0/2](459500/4588595) loss:2.980 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:14:34,250][model8_pretrain.py][INFO] Epoch:[0/2](459500/4588595) loss:2.825 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:14:34,250][model8_pretrain.py][INFO] Epoch:[0/2](459500/4588595) loss:2.743 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:14:34,250][model8_pretrain.py][INFO] Epoch:[0/2](459500/4588595) loss:3.043 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:14:34,250][model8_pretrain.py][INFO] Epoch:[0/2](459500/4588595) loss:3.312 lr:0.0000100 epoch_Time:26193.0min: [2024-01-04 18:15:11,191][model8_pretrain.py][INFO] Epoch:[0/2](459600/4588595) loss:2.226 lr:0.0000100 epoch_Time:26191.0min: [2024-01-04 18:15:11,191][model8_pretrain.py][INFO] Epoch:[0/2](459600/4588595) loss:2.874 lr:0.0000100 epoch_Time:26191.0min: [2024-01-04 18:15:11,191][model8_pretrain.py][INFO] Epoch:[0/2](459600/4588595) loss:2.549 lr:0.0000100 epoch_Time:26191.0min: [2024-01-04 18:15:11,191][model8_pretrain.py][INFO] Epoch:[0/2](459600/4588595) loss:2.610 lr:0.0000100 epoch_Time:26191.0min: [2024-01-04 18:15:11,191][model8_pretrain.py][INFO] Epoch:[0/2](459600/4588595) loss:3.449 lr:0.0000100 epoch_Time:26191.0min: [2024-01-04 18:15:11,191][model8_pretrain.py][INFO] Epoch:[0/2](459600/4588595) loss:2.748 lr:0.0000100 epoch_Time:26191.0min: [2024-01-04 18:15:11,191][model8_pretrain.py][INFO] Epoch:[0/2](459600/4588595) loss:2.962 lr:0.0000100 epoch_Time:26191.0min: [2024-01-04 18:15:11,192][model8_pretrain.py][INFO] Epoch:[0/2](459600/4588595) loss:2.573 lr:0.0000100 epoch_Time:26191.0min: [2024-01-04 18:15:48,134][model8_pretrain.py][INFO] Epoch:[0/2](459700/4588595) loss:2.238 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:15:48,134][model8_pretrain.py][INFO] Epoch:[0/2](459700/4588595) loss:3.208 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:15:48,134][model8_pretrain.py][INFO] Epoch:[0/2](459700/4588595) loss:3.241 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:15:48,134][model8_pretrain.py][INFO] Epoch:[0/2](459700/4588595) loss:2.503 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:15:48,134][model8_pretrain.py][INFO] Epoch:[0/2](459700/4588595) loss:2.536 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:15:48,134][model8_pretrain.py][INFO] Epoch:[0/2](459700/4588595) loss:3.050 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:15:48,134][model8_pretrain.py][INFO] Epoch:[0/2](459700/4588595) loss:2.262 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:15:48,134][model8_pretrain.py][INFO] Epoch:[0/2](459700/4588595) loss:2.488 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:16:25,092][model8_pretrain.py][INFO] Epoch:[0/2](459800/4588595) loss:2.607 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:16:25,092][model8_pretrain.py][INFO] Epoch:[0/2](459800/4588595) loss:2.628 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:16:25,092][model8_pretrain.py][INFO] Epoch:[0/2](459800/4588595) loss:3.018 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:16:25,092][model8_pretrain.py][INFO] Epoch:[0/2](459800/4588595) loss:2.422 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:16:25,092][model8_pretrain.py][INFO] Epoch:[0/2](459800/4588595) loss:3.152 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:16:25,092][model8_pretrain.py][INFO] Epoch:[0/2](459800/4588595) loss:2.989 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:16:25,093][model8_pretrain.py][INFO] Epoch:[0/2](459800/4588595) loss:2.885 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:16:25,093][model8_pretrain.py][INFO] Epoch:[0/2](459800/4588595) loss:3.052 lr:0.0000100 epoch_Time:26190.0min: [2024-01-04 18:17:02,057][model8_pretrain.py][INFO] Epoch:[0/2](459900/4588595) loss:2.708 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:02,057][model8_pretrain.py][INFO] Epoch:[0/2](459900/4588595) loss:2.902 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:02,057][model8_pretrain.py][INFO] Epoch:[0/2](459900/4588595) loss:2.964 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:02,057][model8_pretrain.py][INFO] Epoch:[0/2](459900/4588595) loss:2.974 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:02,057][model8_pretrain.py][INFO] Epoch:[0/2](459900/4588595) loss:2.358 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:02,057][model8_pretrain.py][INFO] Epoch:[0/2](459900/4588595) loss:3.142 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:02,057][model8_pretrain.py][INFO] Epoch:[0/2](459900/4588595) loss:2.490 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:02,057][model8_pretrain.py][INFO] Epoch:[0/2](459900/4588595) loss:3.146 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:39,005][model8_pretrain.py][INFO] Epoch:[0/2](460000/4588595) loss:3.095 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:39,005][model8_pretrain.py][INFO] Epoch:[0/2](460000/4588595) loss:2.848 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:39,005][model8_pretrain.py][INFO] Epoch:[0/2](460000/4588595) loss:2.745 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:39,005][model8_pretrain.py][INFO] Epoch:[0/2](460000/4588595) loss:3.246 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:39,005][model8_pretrain.py][INFO] Epoch:[0/2](460000/4588595) loss:2.535 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:39,005][model8_pretrain.py][INFO] Epoch:[0/2](460000/4588595) loss:3.019 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:39,005][model8_pretrain.py][INFO] Epoch:[0/2](460000/4588595) loss:1.939 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:17:39,005][model8_pretrain.py][INFO] Epoch:[0/2](460000/4588595) loss:2.728 lr:0.0000100 epoch_Time:26189.0min: [2024-01-04 18:18:15,964][model8_pretrain.py][INFO] Epoch:[0/2](460100/4588595) loss:2.565 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:18:15,964][model8_pretrain.py][INFO] Epoch:[0/2](460100/4588595) loss:2.777 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:18:15,964][model8_pretrain.py][INFO] Epoch:[0/2](460100/4588595) loss:3.075 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:18:15,964][model8_pretrain.py][INFO] Epoch:[0/2](460100/4588595) loss:2.931 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:18:15,964][model8_pretrain.py][INFO] Epoch:[0/2](460100/4588595) loss:2.435 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:18:15,965][model8_pretrain.py][INFO] Epoch:[0/2](460100/4588595) loss:2.469 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:18:15,965][model8_pretrain.py][INFO] Epoch:[0/2](460100/4588595) loss:2.258 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:18:15,965][model8_pretrain.py][INFO] Epoch:[0/2](460100/4588595) loss:2.766 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:18:52,913][model8_pretrain.py][INFO] Epoch:[0/2](460200/4588595) loss:3.068 lr:0.0000100 epoch_Time:26186.0min: [2024-01-04 18:18:52,913][model8_pretrain.py][INFO] Epoch:[0/2](460200/4588595) loss:2.958 lr:0.0000100 epoch_Time:26186.0min: [2024-01-04 18:18:52,913][model8_pretrain.py][INFO] Epoch:[0/2](460200/4588595) loss:2.998 lr:0.0000100 epoch_Time:26186.0min: [2024-01-04 18:18:52,913][model8_pretrain.py][INFO] Epoch:[0/2](460200/4588595) loss:3.224 lr:0.0000100 epoch_Time:26186.0min: [2024-01-04 18:18:52,913][model8_pretrain.py][INFO] Epoch:[0/2](460200/4588595) loss:2.956 lr:0.0000100 epoch_Time:26186.0min: [2024-01-04 18:18:52,913][model8_pretrain.py][INFO] Epoch:[0/2](460200/4588595) loss:3.283 lr:0.0000100 epoch_Time:26186.0min: [2024-01-04 18:18:52,913][model8_pretrain.py][INFO] Epoch:[0/2](460200/4588595) loss:3.041 lr:0.0000100 epoch_Time:26186.0min: [2024-01-04 18:18:52,913][model8_pretrain.py][INFO] Epoch:[0/2](460200/4588595) loss:2.763 lr:0.0000100 epoch_Time:26186.0min: [2024-01-04 18:19:40,087][model8_pretrain.py][INFO] Epoch:[0/2](460300/4588595) loss:2.997 lr:0.0000100 epoch_Time:26188.0min: [2024-01-04 18:19:40,087][model8_pretrain.py][INFO] Epoch:[0/2](460300/4588595) loss:3.367 lr:0.0000100 epoch_Time:26188.0min: [2024-01-04 18:19:40,087][model8_pretrain.py][INFO] Epoch:[0/2](460300/4588595) loss:3.168 lr:0.0000100 epoch_Time:26188.0min: [2024-01-04 18:19:40,087][model8_pretrain.py][INFO] Epoch:[0/2](460300/4588595) loss:2.339 lr:0.0000100 epoch_Time:26188.0min: [2024-01-04 18:19:40,087][model8_pretrain.py][INFO] Epoch:[0/2](460300/4588595) loss:2.673 lr:0.0000100 epoch_Time:26188.0min: [2024-01-04 18:19:40,087][model8_pretrain.py][INFO] Epoch:[0/2](460300/4588595) loss:3.090 lr:0.0000100 epoch_Time:26188.0min: [2024-01-04 18:19:40,087][model8_pretrain.py][INFO] Epoch:[0/2](460300/4588595) loss:2.536 lr:0.0000100 epoch_Time:26188.0min: [2024-01-04 18:19:40,087][model8_pretrain.py][INFO] Epoch:[0/2](460300/4588595) loss:2.612 lr:0.0000100 epoch_Time:26188.0min: [2024-01-04 18:20:17,035][model8_pretrain.py][INFO] Epoch:[0/2](460400/4588595) loss:3.110 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:20:17,035][model8_pretrain.py][INFO] Epoch:[0/2](460400/4588595) loss:2.682 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:20:17,035][model8_pretrain.py][INFO] Epoch:[0/2](460400/4588595) loss:3.095 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:20:17,036][model8_pretrain.py][INFO] Epoch:[0/2](460400/4588595) loss:3.015 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:20:17,035][model8_pretrain.py][INFO] Epoch:[0/2](460400/4588595) loss:2.725 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:20:17,036][model8_pretrain.py][INFO] Epoch:[0/2](460400/4588595) loss:2.966 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:20:17,036][model8_pretrain.py][INFO] Epoch:[0/2](460400/4588595) loss:2.965 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:20:17,036][model8_pretrain.py][INFO] Epoch:[0/2](460400/4588595) loss:2.805 lr:0.0000100 epoch_Time:26187.0min: [2024-01-04 18:20:53,984][model8_pretrain.py][INFO] Epoch:[0/2](460500/4588595) loss:2.981 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:20:53,984][model8_pretrain.py][INFO] Epoch:[0/2](460500/4588595) loss:2.943 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:20:53,984][model8_pretrain.py][INFO] Epoch:[0/2](460500/4588595) loss:2.513 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:20:53,984][model8_pretrain.py][INFO] Epoch:[0/2](460500/4588595) loss:3.068 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:20:53,984][model8_pretrain.py][INFO] Epoch:[0/2](460500/4588595) loss:3.223 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:20:53,984][model8_pretrain.py][INFO] Epoch:[0/2](460500/4588595) loss:2.991 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:20:53,984][model8_pretrain.py][INFO] Epoch:[0/2](460500/4588595) loss:3.419 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:20:53,984][model8_pretrain.py][INFO] Epoch:[0/2](460500/4588595) loss:3.042 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:21:30,933][model8_pretrain.py][INFO] Epoch:[0/2](460600/4588595) loss:2.960 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:21:30,933][model8_pretrain.py][INFO] Epoch:[0/2](460600/4588595) loss:1.868 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:21:30,933][model8_pretrain.py][INFO] Epoch:[0/2](460600/4588595) loss:2.979 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:21:30,933][model8_pretrain.py][INFO] Epoch:[0/2](460600/4588595) loss:3.592 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:21:30,933][model8_pretrain.py][INFO] Epoch:[0/2](460600/4588595) loss:2.658 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:21:30,933][model8_pretrain.py][INFO] Epoch:[0/2](460600/4588595) loss:2.173 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:21:30,933][model8_pretrain.py][INFO] Epoch:[0/2](460600/4588595) loss:3.401 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:21:30,933][model8_pretrain.py][INFO] Epoch:[0/2](460600/4588595) loss:2.646 lr:0.0000100 epoch_Time:26185.0min: [2024-01-04 18:22:07,883][model8_pretrain.py][INFO] Epoch:[0/2](460700/4588595) loss:3.084 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:07,883][model8_pretrain.py][INFO] Epoch:[0/2](460700/4588595) loss:2.523 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:07,883][model8_pretrain.py][INFO] Epoch:[0/2](460700/4588595) loss:2.856 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:07,883][model8_pretrain.py][INFO] Epoch:[0/2](460700/4588595) loss:3.152 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:07,883][model8_pretrain.py][INFO] Epoch:[0/2](460700/4588595) loss:2.816 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:07,883][model8_pretrain.py][INFO] Epoch:[0/2](460700/4588595) loss:2.840 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:07,884][model8_pretrain.py][INFO] Epoch:[0/2](460700/4588595) loss:2.734 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:07,884][model8_pretrain.py][INFO] Epoch:[0/2](460700/4588595) loss:2.746 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:44,848][model8_pretrain.py][INFO] Epoch:[0/2](460800/4588595) loss:2.972 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:44,848][model8_pretrain.py][INFO] Epoch:[0/2](460800/4588595) loss:2.548 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:44,848][model8_pretrain.py][INFO] Epoch:[0/2](460800/4588595) loss:2.733 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:44,848][model8_pretrain.py][INFO] Epoch:[0/2](460800/4588595) loss:3.419 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:44,848][model8_pretrain.py][INFO] Epoch:[0/2](460800/4588595) loss:3.088 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:44,848][model8_pretrain.py][INFO] Epoch:[0/2](460800/4588595) loss:2.782 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:44,848][model8_pretrain.py][INFO] Epoch:[0/2](460800/4588595) loss:2.874 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:22:44,848][model8_pretrain.py][INFO] Epoch:[0/2](460800/4588595) loss:2.963 lr:0.0000100 epoch_Time:26184.0min: [2024-01-04 18:23:21,819][model8_pretrain.py][INFO] Epoch:[0/2](460900/4588595) loss:2.661 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:23:21,819][model8_pretrain.py][INFO] Epoch:[0/2](460900/4588595) loss:2.929 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:23:21,819][model8_pretrain.py][INFO] Epoch:[0/2](460900/4588595) loss:3.218 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:23:21,819][model8_pretrain.py][INFO] Epoch:[0/2](460900/4588595) loss:3.384 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:23:21,819][model8_pretrain.py][INFO] Epoch:[0/2](460900/4588595) loss:2.090 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:23:21,819][model8_pretrain.py][INFO] Epoch:[0/2](460900/4588595) loss:3.284 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:23:21,819][model8_pretrain.py][INFO] Epoch:[0/2](460900/4588595) loss:3.106 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:23:21,820][model8_pretrain.py][INFO] Epoch:[0/2](460900/4588595) loss:3.180 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:23:58,784][model8_pretrain.py][INFO] Epoch:[0/2](461000/4588595) loss:3.012 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:23:58,784][model8_pretrain.py][INFO] Epoch:[0/2](461000/4588595) loss:2.911 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:23:58,784][model8_pretrain.py][INFO] Epoch:[0/2](461000/4588595) loss:3.454 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:23:58,784][model8_pretrain.py][INFO] Epoch:[0/2](461000/4588595) loss:2.357 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:23:58,784][model8_pretrain.py][INFO] Epoch:[0/2](461000/4588595) loss:2.612 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:23:58,784][model8_pretrain.py][INFO] Epoch:[0/2](461000/4588595) loss:2.613 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:23:58,784][model8_pretrain.py][INFO] Epoch:[0/2](461000/4588595) loss:2.806 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:23:58,785][model8_pretrain.py][INFO] Epoch:[0/2](461000/4588595) loss:2.526 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:24:45,993][model8_pretrain.py][INFO] Epoch:[0/2](461100/4588595) loss:1.859 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:24:45,993][model8_pretrain.py][INFO] Epoch:[0/2](461100/4588595) loss:2.542 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:24:45,993][model8_pretrain.py][INFO] Epoch:[0/2](461100/4588595) loss:2.844 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:24:45,993][model8_pretrain.py][INFO] Epoch:[0/2](461100/4588595) loss:2.774 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:24:45,993][model8_pretrain.py][INFO] Epoch:[0/2](461100/4588595) loss:3.222 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:24:45,993][model8_pretrain.py][INFO] Epoch:[0/2](461100/4588595) loss:2.306 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:24:45,993][model8_pretrain.py][INFO] Epoch:[0/2](461100/4588595) loss:2.626 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:24:45,993][model8_pretrain.py][INFO] Epoch:[0/2](461100/4588595) loss:2.594 lr:0.0000100 epoch_Time:26183.0min: [2024-01-04 18:25:22,924][model8_pretrain.py][INFO] Epoch:[0/2](461200/4588595) loss:2.524 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:25:22,924][model8_pretrain.py][INFO] Epoch:[0/2](461200/4588595) loss:2.774 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:25:22,924][model8_pretrain.py][INFO] Epoch:[0/2](461200/4588595) loss:3.022 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:25:22,924][model8_pretrain.py][INFO] Epoch:[0/2](461200/4588595) loss:2.792 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:25:22,924][model8_pretrain.py][INFO] Epoch:[0/2](461200/4588595) loss:2.981 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:25:22,924][model8_pretrain.py][INFO] Epoch:[0/2](461200/4588595) loss:3.254 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:25:22,924][model8_pretrain.py][INFO] Epoch:[0/2](461200/4588595) loss:2.296 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:25:22,924][model8_pretrain.py][INFO] Epoch:[0/2](461200/4588595) loss:3.208 lr:0.0000100 epoch_Time:26182.0min: [2024-01-04 18:25:59,856][model8_pretrain.py][INFO] Epoch:[0/2](461300/4588595) loss:2.754 lr:0.0000100 epoch_Time:26181.0min: [2024-01-04 18:25:59,856][model8_pretrain.py][INFO] Epoch:[0/2](461300/4588595) loss:2.811 lr:0.0000100 epoch_Time:26181.0min: [2024-01-04 18:25:59,856][model8_pretrain.py][INFO] Epoch:[0/2](461300/4588595) loss:3.120 lr:0.0000100 epoch_Time:26181.0min: [2024-01-04 18:25:59,856][model8_pretrain.py][INFO] Epoch:[0/2](461300/4588595) loss:3.047 lr:0.0000100 epoch_Time:26181.0min: [2024-01-04 18:25:59,856][model8_pretrain.py][INFO] Epoch:[0/2](461300/4588595) loss:3.188 lr:0.0000100 epoch_Time:26181.0min: [2024-01-04 18:25:59,856][model8_pretrain.py][INFO] Epoch:[0/2](461300/4588595) loss:2.559 lr:0.0000100 epoch_Time:26181.0min: [2024-01-04 18:25:59,856][model8_pretrain.py][INFO] Epoch:[0/2](461300/4588595) loss:2.730 lr:0.0000100 epoch_Time:26181.0min: [2024-01-04 18:25:59,856][model8_pretrain.py][INFO] Epoch:[0/2](461300/4588595) loss:2.696 lr:0.0000100 epoch_Time:26181.0min: [2024-01-04 18:26:36,788][model8_pretrain.py][INFO] Epoch:[0/2](461400/4588595) loss:3.053 lr:0.0000100 epoch_Time:26180.0min: [2024-01-04 18:26:36,788][model8_pretrain.py][INFO] Epoch:[0/2](461400/4588595) loss:2.927 lr:0.0000100 epoch_Time:26180.0min: [2024-01-04 18:26:36,788][model8_pretrain.py][INFO] Epoch:[0/2](461400/4588595) loss:2.948 lr:0.0000100 epoch_Time:26180.0min: [2024-01-04 18:26:36,788][model8_pretrain.py][INFO] Epoch:[0/2](461400/4588595) loss:3.053 lr:0.0000100 epoch_Time:26180.0min: [2024-01-04 18:26:36,788][model8_pretrain.py][INFO] Epoch:[0/2](461400/4588595) loss:3.017 lr:0.0000100 epoch_Time:26180.0min: [2024-01-04 18:26:36,788][model8_pretrain.py][INFO] Epoch:[0/2](461400/4588595) loss:3.112 lr:0.0000100 epoch_Time:26180.0min: [2024-01-04 18:26:36,788][model8_pretrain.py][INFO] Epoch:[0/2](461400/4588595) loss:2.862 lr:0.0000100 epoch_Time:26180.0min: [2024-01-04 18:26:36,789][model8_pretrain.py][INFO] Epoch:[0/2](461400/4588595) loss:3.096 lr:0.0000100 epoch_Time:26180.0min: [2024-01-04 18:27:13,727][model8_pretrain.py][INFO] Epoch:[0/2](461500/4588595) loss:2.799 lr:0.0000100 epoch_Time:26179.0min: [2024-01-04 18:27:13,727][model8_pretrain.py][INFO] Epoch:[0/2](461500/4588595) loss:2.379 lr:0.0000100 epoch_Time:26179.0min: [2024-01-04 18:27:13,727][model8_pretrain.py][INFO] Epoch:[0/2](461500/4588595) loss:3.347 lr:0.0000100 epoch_Time:26179.0min: [2024-01-04 18:27:13,727][model8_pretrain.py][INFO] Epoch:[0/2](461500/4588595) loss:3.190 lr:0.0000100 epoch_Time:26179.0min: [2024-01-04 18:27:13,727][model8_pretrain.py][INFO] Epoch:[0/2](461500/4588595) loss:2.696 lr:0.0000100 epoch_Time:26179.0min: [2024-01-04 18:27:13,727][model8_pretrain.py][INFO] Epoch:[0/2](461500/4588595) loss:2.509 lr:0.0000100 epoch_Time:26179.0min: [2024-01-04 18:27:13,727][model8_pretrain.py][INFO] Epoch:[0/2](461500/4588595) loss:2.743 lr:0.0000100 epoch_Time:26179.0min: [2024-01-04 18:27:13,728][model8_pretrain.py][INFO] Epoch:[0/2](461500/4588595) loss:3.153 lr:0.0000100 epoch_Time:26179.0min: [2024-01-04 18:27:50,602][model8_pretrain.py][INFO] Epoch:[0/2](461600/4588595) loss:2.820 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:27:50,602][model8_pretrain.py][INFO] Epoch:[0/2](461600/4588595) loss:2.724 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:27:50,602][model8_pretrain.py][INFO] Epoch:[0/2](461600/4588595) loss:3.137 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:27:50,602][model8_pretrain.py][INFO] Epoch:[0/2](461600/4588595) loss:2.448 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:27:50,602][model8_pretrain.py][INFO] Epoch:[0/2](461600/4588595) loss:2.653 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:27:50,602][model8_pretrain.py][INFO] Epoch:[0/2](461600/4588595) loss:3.092 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:27:50,603][model8_pretrain.py][INFO] Epoch:[0/2](461600/4588595) loss:2.879 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:27:50,603][model8_pretrain.py][INFO] Epoch:[0/2](461600/4588595) loss:2.982 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:28:27,533][model8_pretrain.py][INFO] Epoch:[0/2](461700/4588595) loss:3.011 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:28:27,533][model8_pretrain.py][INFO] Epoch:[0/2](461700/4588595) loss:2.799 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:28:27,533][model8_pretrain.py][INFO] Epoch:[0/2](461700/4588595) loss:2.503 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:28:27,533][model8_pretrain.py][INFO] Epoch:[0/2](461700/4588595) loss:2.447 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:28:27,533][model8_pretrain.py][INFO] Epoch:[0/2](461700/4588595) loss:3.220 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:28:27,533][model8_pretrain.py][INFO] Epoch:[0/2](461700/4588595) loss:2.635 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:28:27,533][model8_pretrain.py][INFO] Epoch:[0/2](461700/4588595) loss:2.760 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:28:27,533][model8_pretrain.py][INFO] Epoch:[0/2](461700/4588595) loss:3.212 lr:0.0000100 epoch_Time:26178.0min: [2024-01-04 18:29:04,465][model8_pretrain.py][INFO] Epoch:[0/2](461800/4588595) loss:2.264 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:04,465][model8_pretrain.py][INFO] Epoch:[0/2](461800/4588595) loss:2.730 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:04,465][model8_pretrain.py][INFO] Epoch:[0/2](461800/4588595) loss:2.820 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:04,465][model8_pretrain.py][INFO] Epoch:[0/2](461800/4588595) loss:2.287 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:04,465][model8_pretrain.py][INFO] Epoch:[0/2](461800/4588595) loss:3.019 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:04,465][model8_pretrain.py][INFO] Epoch:[0/2](461800/4588595) loss:3.159 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:04,465][model8_pretrain.py][INFO] Epoch:[0/2](461800/4588595) loss:2.802 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:04,465][model8_pretrain.py][INFO] Epoch:[0/2](461800/4588595) loss:3.245 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:51,610][model8_pretrain.py][INFO] Epoch:[0/2](461900/4588595) loss:3.364 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:51,611][model8_pretrain.py][INFO] Epoch:[0/2](461900/4588595) loss:2.898 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:51,611][model8_pretrain.py][INFO] Epoch:[0/2](461900/4588595) loss:2.524 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:51,611][model8_pretrain.py][INFO] Epoch:[0/2](461900/4588595) loss:3.212 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:51,611][model8_pretrain.py][INFO] Epoch:[0/2](461900/4588595) loss:2.806 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:51,611][model8_pretrain.py][INFO] Epoch:[0/2](461900/4588595) loss:2.811 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:51,611][model8_pretrain.py][INFO] Epoch:[0/2](461900/4588595) loss:2.756 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:29:51,611][model8_pretrain.py][INFO] Epoch:[0/2](461900/4588595) loss:3.365 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:30:28,545][model8_pretrain.py][INFO] Epoch:[0/2](462000/4588595) loss:2.034 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:30:28,546][model8_pretrain.py][INFO] Epoch:[0/2](462000/4588595) loss:3.153 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:30:28,546][model8_pretrain.py][INFO] Epoch:[0/2](462000/4588595) loss:2.410 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:30:28,546][model8_pretrain.py][INFO] Epoch:[0/2](462000/4588595) loss:2.715 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:30:28,546][model8_pretrain.py][INFO] Epoch:[0/2](462000/4588595) loss:2.717 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:30:28,546][model8_pretrain.py][INFO] Epoch:[0/2](462000/4588595) loss:2.781 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:30:28,546][model8_pretrain.py][INFO] Epoch:[0/2](462000/4588595) loss:2.913 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:30:28,546][model8_pretrain.py][INFO] Epoch:[0/2](462000/4588595) loss:2.977 lr:0.0000100 epoch_Time:26177.0min: [2024-01-04 18:31:05,476][model8_pretrain.py][INFO] Epoch:[0/2](462100/4588595) loss:2.793 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:05,476][model8_pretrain.py][INFO] Epoch:[0/2](462100/4588595) loss:2.704 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:05,476][model8_pretrain.py][INFO] Epoch:[0/2](462100/4588595) loss:3.025 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:05,476][model8_pretrain.py][INFO] Epoch:[0/2](462100/4588595) loss:2.854 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:05,476][model8_pretrain.py][INFO] Epoch:[0/2](462100/4588595) loss:2.994 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:05,476][model8_pretrain.py][INFO] Epoch:[0/2](462100/4588595) loss:2.220 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:05,476][model8_pretrain.py][INFO] Epoch:[0/2](462100/4588595) loss:2.880 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:05,476][model8_pretrain.py][INFO] Epoch:[0/2](462100/4588595) loss:2.783 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:42,406][model8_pretrain.py][INFO] Epoch:[0/2](462200/4588595) loss:2.980 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:42,406][model8_pretrain.py][INFO] Epoch:[0/2](462200/4588595) loss:3.042 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:42,406][model8_pretrain.py][INFO] Epoch:[0/2](462200/4588595) loss:2.855 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:42,406][model8_pretrain.py][INFO] Epoch:[0/2](462200/4588595) loss:2.461 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:42,406][model8_pretrain.py][INFO] Epoch:[0/2](462200/4588595) loss:3.560 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:42,406][model8_pretrain.py][INFO] Epoch:[0/2](462200/4588595) loss:2.521 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:42,407][model8_pretrain.py][INFO] Epoch:[0/2](462200/4588595) loss:2.712 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:31:42,407][model8_pretrain.py][INFO] Epoch:[0/2](462200/4588595) loss:2.375 lr:0.0000100 epoch_Time:26176.0min: [2024-01-04 18:32:19,342][model8_pretrain.py][INFO] Epoch:[0/2](462300/4588595) loss:3.000 lr:0.0000100 epoch_Time:26174.0min: [2024-01-04 18:32:19,342][model8_pretrain.py][INFO] Epoch:[0/2](462300/4588595) loss:3.052 lr:0.0000100 epoch_Time:26174.0min: [2024-01-04 18:32:19,342][model8_pretrain.py][INFO] Epoch:[0/2](462300/4588595) loss:3.169 lr:0.0000100 epoch_Time:26174.0min: [2024-01-04 18:32:19,342][model8_pretrain.py][INFO] Epoch:[0/2](462300/4588595) loss:2.236 lr:0.0000100 epoch_Time:26174.0min: [2024-01-04 18:32:19,342][model8_pretrain.py][INFO] Epoch:[0/2](462300/4588595) loss:2.913 lr:0.0000100 epoch_Time:26174.0min: [2024-01-04 18:32:19,342][model8_pretrain.py][INFO] Epoch:[0/2](462300/4588595) loss:3.021 lr:0.0000100 epoch_Time:26174.0min: [2024-01-04 18:32:19,342][model8_pretrain.py][INFO] Epoch:[0/2](462300/4588595) loss:3.047 lr:0.0000100 epoch_Time:26174.0min: [2024-01-04 18:32:19,342][model8_pretrain.py][INFO] Epoch:[0/2](462300/4588595) loss:3.068 lr:0.0000100 epoch_Time:26174.0min: [2024-01-04 18:32:56,268][model8_pretrain.py][INFO] Epoch:[0/2](462400/4588595) loss:2.959 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:32:56,268][model8_pretrain.py][INFO] Epoch:[0/2](462400/4588595) loss:2.828 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:32:56,268][model8_pretrain.py][INFO] Epoch:[0/2](462400/4588595) loss:2.552 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:32:56,268][model8_pretrain.py][INFO] Epoch:[0/2](462400/4588595) loss:2.799 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:32:56,268][model8_pretrain.py][INFO] Epoch:[0/2](462400/4588595) loss:3.133 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:32:56,268][model8_pretrain.py][INFO] Epoch:[0/2](462400/4588595) loss:2.636 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:32:56,268][model8_pretrain.py][INFO] Epoch:[0/2](462400/4588595) loss:2.622 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:32:56,268][model8_pretrain.py][INFO] Epoch:[0/2](462400/4588595) loss:3.006 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:33:33,203][model8_pretrain.py][INFO] Epoch:[0/2](462500/4588595) loss:2.377 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:33:33,203][model8_pretrain.py][INFO] Epoch:[0/2](462500/4588595) loss:2.799 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:33:33,203][model8_pretrain.py][INFO] Epoch:[0/2](462500/4588595) loss:2.786 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:33:33,203][model8_pretrain.py][INFO] Epoch:[0/2](462500/4588595) loss:2.996 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:33:33,203][model8_pretrain.py][INFO] Epoch:[0/2](462500/4588595) loss:3.108 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:33:33,203][model8_pretrain.py][INFO] Epoch:[0/2](462500/4588595) loss:3.129 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:33:33,204][model8_pretrain.py][INFO] Epoch:[0/2](462500/4588595) loss:2.543 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:33:33,204][model8_pretrain.py][INFO] Epoch:[0/2](462500/4588595) loss:3.025 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:34:10,176][model8_pretrain.py][INFO] Epoch:[0/2](462600/4588595) loss:2.882 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:34:10,176][model8_pretrain.py][INFO] Epoch:[0/2](462600/4588595) loss:2.313 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:34:10,176][model8_pretrain.py][INFO] Epoch:[0/2](462600/4588595) loss:2.811 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:34:10,176][model8_pretrain.py][INFO] Epoch:[0/2](462600/4588595) loss:2.635 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:34:10,176][model8_pretrain.py][INFO] Epoch:[0/2](462600/4588595) loss:2.817 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:34:10,176][model8_pretrain.py][INFO] Epoch:[0/2](462600/4588595) loss:2.624 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:34:10,176][model8_pretrain.py][INFO] Epoch:[0/2](462600/4588595) loss:2.831 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:34:10,176][model8_pretrain.py][INFO] Epoch:[0/2](462600/4588595) loss:3.131 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:34:58,072][model8_pretrain.py][INFO] Epoch:[0/2](462700/4588595) loss:2.542 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:34:58,072][model8_pretrain.py][INFO] Epoch:[0/2](462700/4588595) loss:3.128 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:34:58,072][model8_pretrain.py][INFO] Epoch:[0/2](462700/4588595) loss:3.175 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:34:58,072][model8_pretrain.py][INFO] Epoch:[0/2](462700/4588595) loss:2.753 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:34:58,072][model8_pretrain.py][INFO] Epoch:[0/2](462700/4588595) loss:2.656 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:34:58,072][model8_pretrain.py][INFO] Epoch:[0/2](462700/4588595) loss:3.145 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:34:58,073][model8_pretrain.py][INFO] Epoch:[0/2](462700/4588595) loss:3.120 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:34:58,073][model8_pretrain.py][INFO] Epoch:[0/2](462700/4588595) loss:2.596 lr:0.0000100 epoch_Time:26173.0min: [2024-01-04 18:35:34,997][model8_pretrain.py][INFO] Epoch:[0/2](462800/4588595) loss:3.374 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:35:34,997][model8_pretrain.py][INFO] Epoch:[0/2](462800/4588595) loss:2.871 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:35:34,997][model8_pretrain.py][INFO] Epoch:[0/2](462800/4588595) loss:1.964 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:35:34,997][model8_pretrain.py][INFO] Epoch:[0/2](462800/4588595) loss:2.834 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:35:34,997][model8_pretrain.py][INFO] Epoch:[0/2](462800/4588595) loss:3.091 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:35:34,997][model8_pretrain.py][INFO] Epoch:[0/2](462800/4588595) loss:3.125 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:35:34,997][model8_pretrain.py][INFO] Epoch:[0/2](462800/4588595) loss:2.796 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:35:34,997][model8_pretrain.py][INFO] Epoch:[0/2](462800/4588595) loss:2.933 lr:0.0000100 epoch_Time:26172.0min: [2024-01-04 18:36:11,938][model8_pretrain.py][INFO] Epoch:[0/2](462900/4588595) loss:3.198 lr:0.0000100 epoch_Time:26171.0min: [2024-01-04 18:36:11,938][model8_pretrain.py][INFO] Epoch:[0/2](462900/4588595) loss:3.133 lr:0.0000100 epoch_Time:26171.0min: [2024-01-04 18:36:11,938][model8_pretrain.py][INFO] Epoch:[0/2](462900/4588595) loss:2.815 lr:0.0000100 epoch_Time:26171.0min: [2024-01-04 18:36:11,938][model8_pretrain.py][INFO] Epoch:[0/2](462900/4588595) loss:2.334 lr:0.0000100 epoch_Time:26171.0min: [2024-01-04 18:36:11,938][model8_pretrain.py][INFO] Epoch:[0/2](462900/4588595) loss:3.169 lr:0.0000100 epoch_Time:26171.0min: [2024-01-04 18:36:11,938][model8_pretrain.py][INFO] Epoch:[0/2](462900/4588595) loss:2.720 lr:0.0000100 epoch_Time:26171.0min: [2024-01-04 18:36:11,938][model8_pretrain.py][INFO] Epoch:[0/2](462900/4588595) loss:3.099 lr:0.0000100 epoch_Time:26171.0min: [2024-01-04 18:36:11,938][model8_pretrain.py][INFO] Epoch:[0/2](462900/4588595) loss:3.402 lr:0.0000100 epoch_Time:26171.0min: [2024-01-04 18:36:48,876][model8_pretrain.py][INFO] Epoch:[0/2](463000/4588595) loss:2.972 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:36:48,876][model8_pretrain.py][INFO] Epoch:[0/2](463000/4588595) loss:2.529 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:36:48,876][model8_pretrain.py][INFO] Epoch:[0/2](463000/4588595) loss:2.605 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:36:48,876][model8_pretrain.py][INFO] Epoch:[0/2](463000/4588595) loss:1.932 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:36:48,876][model8_pretrain.py][INFO] Epoch:[0/2](463000/4588595) loss:2.946 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:36:48,876][model8_pretrain.py][INFO] Epoch:[0/2](463000/4588595) loss:2.668 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:36:48,876][model8_pretrain.py][INFO] Epoch:[0/2](463000/4588595) loss:2.331 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:36:48,876][model8_pretrain.py][INFO] Epoch:[0/2](463000/4588595) loss:3.125 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:37:25,819][model8_pretrain.py][INFO] Epoch:[0/2](463100/4588595) loss:2.470 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:37:25,819][model8_pretrain.py][INFO] Epoch:[0/2](463100/4588595) loss:3.311 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:37:25,819][model8_pretrain.py][INFO] Epoch:[0/2](463100/4588595) loss:2.730 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:37:25,819][model8_pretrain.py][INFO] Epoch:[0/2](463100/4588595) loss:2.861 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:37:25,819][model8_pretrain.py][INFO] Epoch:[0/2](463100/4588595) loss:2.612 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:37:25,819][model8_pretrain.py][INFO] Epoch:[0/2](463100/4588595) loss:2.449 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:37:25,819][model8_pretrain.py][INFO] Epoch:[0/2](463100/4588595) loss:3.142 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:37:25,819][model8_pretrain.py][INFO] Epoch:[0/2](463100/4588595) loss:3.471 lr:0.0000100 epoch_Time:26170.0min: [2024-01-04 18:38:02,760][model8_pretrain.py][INFO] Epoch:[0/2](463200/4588595) loss:2.748 lr:0.0000100 epoch_Time:26169.0min: [2024-01-04 18:38:02,760][model8_pretrain.py][INFO] Epoch:[0/2](463200/4588595) loss:2.890 lr:0.0000100 epoch_Time:26169.0min: [2024-01-04 18:38:02,760][model8_pretrain.py][INFO] Epoch:[0/2](463200/4588595) loss:2.628 lr:0.0000100 epoch_Time:26169.0min: [2024-01-04 18:38:02,760][model8_pretrain.py][INFO] Epoch:[0/2](463200/4588595) loss:3.385 lr:0.0000100 epoch_Time:26169.0min: [2024-01-04 18:38:02,760][model8_pretrain.py][INFO] Epoch:[0/2](463200/4588595) loss:2.772 lr:0.0000100 epoch_Time:26169.0min: [2024-01-04 18:38:02,760][model8_pretrain.py][INFO] Epoch:[0/2](463200/4588595) loss:2.487 lr:0.0000100 epoch_Time:26169.0min: [2024-01-04 18:38:02,760][model8_pretrain.py][INFO] Epoch:[0/2](463200/4588595) loss:2.875 lr:0.0000100 epoch_Time:26169.0min: [2024-01-04 18:38:02,760][model8_pretrain.py][INFO] Epoch:[0/2](463200/4588595) loss:3.288 lr:0.0000100 epoch_Time:26169.0min: [2024-01-04 18:38:39,699][model8_pretrain.py][INFO] Epoch:[0/2](463300/4588595) loss:2.800 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:38:39,699][model8_pretrain.py][INFO] Epoch:[0/2](463300/4588595) loss:3.059 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:38:39,699][model8_pretrain.py][INFO] Epoch:[0/2](463300/4588595) loss:2.863 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:38:39,699][model8_pretrain.py][INFO] Epoch:[0/2](463300/4588595) loss:2.840 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:38:39,699][model8_pretrain.py][INFO] Epoch:[0/2](463300/4588595) loss:2.595 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:38:39,699][model8_pretrain.py][INFO] Epoch:[0/2](463300/4588595) loss:2.732 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:38:39,699][model8_pretrain.py][INFO] Epoch:[0/2](463300/4588595) loss:2.720 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:38:39,699][model8_pretrain.py][INFO] Epoch:[0/2](463300/4588595) loss:3.051 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:39:16,645][model8_pretrain.py][INFO] Epoch:[0/2](463400/4588595) loss:2.668 lr:0.0000100 epoch_Time:26167.0min: [2024-01-04 18:39:16,645][model8_pretrain.py][INFO] Epoch:[0/2](463400/4588595) loss:2.919 lr:0.0000100 epoch_Time:26167.0min: [2024-01-04 18:39:16,645][model8_pretrain.py][INFO] Epoch:[0/2](463400/4588595) loss:3.169 lr:0.0000100 epoch_Time:26167.0min: [2024-01-04 18:39:16,645][model8_pretrain.py][INFO] Epoch:[0/2](463400/4588595) loss:2.866 lr:0.0000100 epoch_Time:26167.0min: [2024-01-04 18:39:16,645][model8_pretrain.py][INFO] Epoch:[0/2](463400/4588595) loss:3.115 lr:0.0000100 epoch_Time:26167.0min: [2024-01-04 18:39:16,646][model8_pretrain.py][INFO] Epoch:[0/2](463400/4588595) loss:3.028 lr:0.0000100 epoch_Time:26167.0min: [2024-01-04 18:39:16,646][model8_pretrain.py][INFO] Epoch:[0/2](463400/4588595) loss:2.450 lr:0.0000100 epoch_Time:26167.0min: [2024-01-04 18:39:16,646][model8_pretrain.py][INFO] Epoch:[0/2](463400/4588595) loss:2.717 lr:0.0000100 epoch_Time:26167.0min: [2024-01-04 18:40:04,031][model8_pretrain.py][INFO] Epoch:[0/2](463500/4588595) loss:2.615 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:04,031][model8_pretrain.py][INFO] Epoch:[0/2](463500/4588595) loss:2.879 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:04,031][model8_pretrain.py][INFO] Epoch:[0/2](463500/4588595) loss:2.532 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:04,031][model8_pretrain.py][INFO] Epoch:[0/2](463500/4588595) loss:3.026 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:04,031][model8_pretrain.py][INFO] Epoch:[0/2](463500/4588595) loss:2.616 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:04,031][model8_pretrain.py][INFO] Epoch:[0/2](463500/4588595) loss:2.604 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:04,031][model8_pretrain.py][INFO] Epoch:[0/2](463500/4588595) loss:3.000 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:04,031][model8_pretrain.py][INFO] Epoch:[0/2](463500/4588595) loss:3.175 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:40,948][model8_pretrain.py][INFO] Epoch:[0/2](463600/4588595) loss:3.104 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:40,948][model8_pretrain.py][INFO] Epoch:[0/2](463600/4588595) loss:2.680 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:40,948][model8_pretrain.py][INFO] Epoch:[0/2](463600/4588595) loss:2.753 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:40,948][model8_pretrain.py][INFO] Epoch:[0/2](463600/4588595) loss:3.105 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:40,948][model8_pretrain.py][INFO] Epoch:[0/2](463600/4588595) loss:3.009 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:40,948][model8_pretrain.py][INFO] Epoch:[0/2](463600/4588595) loss:2.686 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:40,948][model8_pretrain.py][INFO] Epoch:[0/2](463600/4588595) loss:2.610 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:40:40,949][model8_pretrain.py][INFO] Epoch:[0/2](463600/4588595) loss:2.987 lr:0.0000100 epoch_Time:26168.0min: [2024-01-04 18:41:17,875][model8_pretrain.py][INFO] Epoch:[0/2](463700/4588595) loss:2.515 lr:0.0000100 epoch_Time:26166.0min: [2024-01-04 18:41:17,875][model8_pretrain.py][INFO] Epoch:[0/2](463700/4588595) loss:3.176 lr:0.0000100 epoch_Time:26166.0min: [2024-01-04 18:41:17,875][model8_pretrain.py][INFO] Epoch:[0/2](463700/4588595) loss:2.952 lr:0.0000100 epoch_Time:26166.0min: [2024-01-04 18:41:17,875][model8_pretrain.py][INFO] Epoch:[0/2](463700/4588595) loss:3.026 lr:0.0000100 epoch_Time:26166.0min: [2024-01-04 18:41:17,875][model8_pretrain.py][INFO] Epoch:[0/2](463700/4588595) loss:3.043 lr:0.0000100 epoch_Time:26166.0min: [2024-01-04 18:41:17,875][model8_pretrain.py][INFO] Epoch:[0/2](463700/4588595) loss:3.252 lr:0.0000100 epoch_Time:26166.0min: [2024-01-04 18:41:17,875][model8_pretrain.py][INFO] Epoch:[0/2](463700/4588595) loss:3.078 lr:0.0000100 epoch_Time:26166.0min: [2024-01-04 18:41:17,875][model8_pretrain.py][INFO] Epoch:[0/2](463700/4588595) loss:2.891 lr:0.0000100 epoch_Time:26166.0min: [2024-01-04 18:41:54,810][model8_pretrain.py][INFO] Epoch:[0/2](463800/4588595) loss:3.040 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:41:54,810][model8_pretrain.py][INFO] Epoch:[0/2](463800/4588595) loss:3.136 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:41:54,810][model8_pretrain.py][INFO] Epoch:[0/2](463800/4588595) loss:2.977 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:41:54,810][model8_pretrain.py][INFO] Epoch:[0/2](463800/4588595) loss:2.723 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:41:54,810][model8_pretrain.py][INFO] Epoch:[0/2](463800/4588595) loss:2.979 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:41:54,810][model8_pretrain.py][INFO] Epoch:[0/2](463800/4588595) loss:2.617 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:41:54,810][model8_pretrain.py][INFO] Epoch:[0/2](463800/4588595) loss:2.432 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:41:54,811][model8_pretrain.py][INFO] Epoch:[0/2](463800/4588595) loss:3.272 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:42:31,752][model8_pretrain.py][INFO] Epoch:[0/2](463900/4588595) loss:3.210 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:42:31,752][model8_pretrain.py][INFO] Epoch:[0/2](463900/4588595) loss:3.653 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:42:31,752][model8_pretrain.py][INFO] Epoch:[0/2](463900/4588595) loss:3.113 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:42:31,752][model8_pretrain.py][INFO] Epoch:[0/2](463900/4588595) loss:3.244 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:42:31,752][model8_pretrain.py][INFO] Epoch:[0/2](463900/4588595) loss:3.088 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:42:31,752][model8_pretrain.py][INFO] Epoch:[0/2](463900/4588595) loss:2.799 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:42:31,752][model8_pretrain.py][INFO] Epoch:[0/2](463900/4588595) loss:2.770 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:42:31,752][model8_pretrain.py][INFO] Epoch:[0/2](463900/4588595) loss:2.351 lr:0.0000100 epoch_Time:26165.0min: [2024-01-04 18:43:08,674][model8_pretrain.py][INFO] Epoch:[0/2](464000/4588595) loss:3.165 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:08,674][model8_pretrain.py][INFO] Epoch:[0/2](464000/4588595) loss:3.126 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:08,674][model8_pretrain.py][INFO] Epoch:[0/2](464000/4588595) loss:2.949 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:08,674][model8_pretrain.py][INFO] Epoch:[0/2](464000/4588595) loss:2.777 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:08,674][model8_pretrain.py][INFO] Epoch:[0/2](464000/4588595) loss:2.858 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:08,674][model8_pretrain.py][INFO] Epoch:[0/2](464000/4588595) loss:2.677 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:08,674][model8_pretrain.py][INFO] Epoch:[0/2](464000/4588595) loss:2.600 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:08,674][model8_pretrain.py][INFO] Epoch:[0/2](464000/4588595) loss:3.151 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:45,597][model8_pretrain.py][INFO] Epoch:[0/2](464100/4588595) loss:2.530 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:45,597][model8_pretrain.py][INFO] Epoch:[0/2](464100/4588595) loss:3.135 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:45,597][model8_pretrain.py][INFO] Epoch:[0/2](464100/4588595) loss:2.615 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:45,597][model8_pretrain.py][INFO] Epoch:[0/2](464100/4588595) loss:3.078 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:45,597][model8_pretrain.py][INFO] Epoch:[0/2](464100/4588595) loss:2.633 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:45,597][model8_pretrain.py][INFO] Epoch:[0/2](464100/4588595) loss:2.823 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:45,597][model8_pretrain.py][INFO] Epoch:[0/2](464100/4588595) loss:3.127 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:43:45,598][model8_pretrain.py][INFO] Epoch:[0/2](464100/4588595) loss:2.865 lr:0.0000100 epoch_Time:26164.0min: [2024-01-04 18:44:22,526][model8_pretrain.py][INFO] Epoch:[0/2](464200/4588595) loss:2.453 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:44:22,526][model8_pretrain.py][INFO] Epoch:[0/2](464200/4588595) loss:3.123 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:44:22,526][model8_pretrain.py][INFO] Epoch:[0/2](464200/4588595) loss:3.157 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:44:22,526][model8_pretrain.py][INFO] Epoch:[0/2](464200/4588595) loss:2.891 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:44:22,527][model8_pretrain.py][INFO] Epoch:[0/2](464200/4588595) loss:3.278 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:44:22,527][model8_pretrain.py][INFO] Epoch:[0/2](464200/4588595) loss:2.793 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:44:22,527][model8_pretrain.py][INFO] Epoch:[0/2](464200/4588595) loss:2.973 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:44:22,527][model8_pretrain.py][INFO] Epoch:[0/2](464200/4588595) loss:3.233 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:45:09,950][model8_pretrain.py][INFO] Epoch:[0/2](464300/4588595) loss:2.845 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:09,950][model8_pretrain.py][INFO] Epoch:[0/2](464300/4588595) loss:2.836 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:09,950][model8_pretrain.py][INFO] Epoch:[0/2](464300/4588595) loss:2.778 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:09,950][model8_pretrain.py][INFO] Epoch:[0/2](464300/4588595) loss:2.922 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:09,950][model8_pretrain.py][INFO] Epoch:[0/2](464300/4588595) loss:2.854 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:09,950][model8_pretrain.py][INFO] Epoch:[0/2](464300/4588595) loss:2.575 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:09,950][model8_pretrain.py][INFO] Epoch:[0/2](464300/4588595) loss:2.474 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:09,950][model8_pretrain.py][INFO] Epoch:[0/2](464300/4588595) loss:2.457 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:46,863][model8_pretrain.py][INFO] Epoch:[0/2](464400/4588595) loss:2.475 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:46,863][model8_pretrain.py][INFO] Epoch:[0/2](464400/4588595) loss:2.924 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:46,863][model8_pretrain.py][INFO] Epoch:[0/2](464400/4588595) loss:2.596 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:46,863][model8_pretrain.py][INFO] Epoch:[0/2](464400/4588595) loss:2.551 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:46,863][model8_pretrain.py][INFO] Epoch:[0/2](464400/4588595) loss:3.349 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:46,863][model8_pretrain.py][INFO] Epoch:[0/2](464400/4588595) loss:3.013 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:46,863][model8_pretrain.py][INFO] Epoch:[0/2](464400/4588595) loss:2.666 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:45:46,864][model8_pretrain.py][INFO] Epoch:[0/2](464400/4588595) loss:2.257 lr:0.0000100 epoch_Time:26163.0min: [2024-01-04 18:46:23,789][model8_pretrain.py][INFO] Epoch:[0/2](464500/4588595) loss:2.883 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:46:23,789][model8_pretrain.py][INFO] Epoch:[0/2](464500/4588595) loss:2.721 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:46:23,789][model8_pretrain.py][INFO] Epoch:[0/2](464500/4588595) loss:2.539 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:46:23,789][model8_pretrain.py][INFO] Epoch:[0/2](464500/4588595) loss:2.472 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:46:23,789][model8_pretrain.py][INFO] Epoch:[0/2](464500/4588595) loss:3.030 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:46:23,789][model8_pretrain.py][INFO] Epoch:[0/2](464500/4588595) loss:2.767 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:46:23,789][model8_pretrain.py][INFO] Epoch:[0/2](464500/4588595) loss:2.530 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:46:23,789][model8_pretrain.py][INFO] Epoch:[0/2](464500/4588595) loss:2.650 lr:0.0000100 epoch_Time:26162.0min: [2024-01-04 18:47:00,716][model8_pretrain.py][INFO] Epoch:[0/2](464600/4588595) loss:2.670 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:00,717][model8_pretrain.py][INFO] Epoch:[0/2](464600/4588595) loss:3.079 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:00,717][model8_pretrain.py][INFO] Epoch:[0/2](464600/4588595) loss:2.666 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:00,717][model8_pretrain.py][INFO] Epoch:[0/2](464600/4588595) loss:3.109 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:00,717][model8_pretrain.py][INFO] Epoch:[0/2](464600/4588595) loss:2.850 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:00,717][model8_pretrain.py][INFO] Epoch:[0/2](464600/4588595) loss:2.371 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:00,717][model8_pretrain.py][INFO] Epoch:[0/2](464600/4588595) loss:2.659 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:00,717][model8_pretrain.py][INFO] Epoch:[0/2](464600/4588595) loss:2.764 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:37,651][model8_pretrain.py][INFO] Epoch:[0/2](464700/4588595) loss:1.979 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:37,651][model8_pretrain.py][INFO] Epoch:[0/2](464700/4588595) loss:3.267 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:37,651][model8_pretrain.py][INFO] Epoch:[0/2](464700/4588595) loss:3.087 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:37,651][model8_pretrain.py][INFO] Epoch:[0/2](464700/4588595) loss:2.804 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:37,651][model8_pretrain.py][INFO] Epoch:[0/2](464700/4588595) loss:2.099 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:37,651][model8_pretrain.py][INFO] Epoch:[0/2](464700/4588595) loss:2.544 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:37,651][model8_pretrain.py][INFO] Epoch:[0/2](464700/4588595) loss:2.883 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:47:37,651][model8_pretrain.py][INFO] Epoch:[0/2](464700/4588595) loss:2.670 lr:0.0000100 epoch_Time:26160.0min: [2024-01-04 18:48:14,631][model8_pretrain.py][INFO] Epoch:[0/2](464800/4588595) loss:3.179 lr:0.0000100 epoch_Time:26159.0min: [2024-01-04 18:48:14,631][model8_pretrain.py][INFO] Epoch:[0/2](464800/4588595) loss:2.594 lr:0.0000100 epoch_Time:26159.0min: [2024-01-04 18:48:14,631][model8_pretrain.py][INFO] Epoch:[0/2](464800/4588595) loss:2.882 lr:0.0000100 epoch_Time:26159.0min: [2024-01-04 18:48:14,631][model8_pretrain.py][INFO] Epoch:[0/2](464800/4588595) loss:2.844 lr:0.0000100 epoch_Time:26159.0min: [2024-01-04 18:48:14,631][model8_pretrain.py][INFO] Epoch:[0/2](464800/4588595) loss:3.197 lr:0.0000100 epoch_Time:26159.0min: [2024-01-04 18:48:14,631][model8_pretrain.py][INFO] Epoch:[0/2](464800/4588595) loss:2.884 lr:0.0000100 epoch_Time:26159.0min: [2024-01-04 18:48:14,631][model8_pretrain.py][INFO] Epoch:[0/2](464800/4588595) loss:2.601 lr:0.0000100 epoch_Time:26159.0min: [2024-01-04 18:48:14,632][model8_pretrain.py][INFO] Epoch:[0/2](464800/4588595) loss:3.181 lr:0.0000100 epoch_Time:26159.0min: [2024-01-04 18:48:51,570][model8_pretrain.py][INFO] Epoch:[0/2](464900/4588595) loss:2.833 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:48:51,570][model8_pretrain.py][INFO] Epoch:[0/2](464900/4588595) loss:2.622 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:48:51,570][model8_pretrain.py][INFO] Epoch:[0/2](464900/4588595) loss:2.730 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:48:51,571][model8_pretrain.py][INFO] Epoch:[0/2](464900/4588595) loss:2.045 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:48:51,570][model8_pretrain.py][INFO] Epoch:[0/2](464900/4588595) loss:2.982 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:48:51,571][model8_pretrain.py][INFO] Epoch:[0/2](464900/4588595) loss:2.900 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:48:51,571][model8_pretrain.py][INFO] Epoch:[0/2](464900/4588595) loss:2.915 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:48:51,571][model8_pretrain.py][INFO] Epoch:[0/2](464900/4588595) loss:2.990 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:49:28,508][model8_pretrain.py][INFO] Epoch:[0/2](465000/4588595) loss:2.904 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:49:28,508][model8_pretrain.py][INFO] Epoch:[0/2](465000/4588595) loss:2.998 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:49:28,508][model8_pretrain.py][INFO] Epoch:[0/2](465000/4588595) loss:2.599 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:49:28,508][model8_pretrain.py][INFO] Epoch:[0/2](465000/4588595) loss:3.031 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:49:28,508][model8_pretrain.py][INFO] Epoch:[0/2](465000/4588595) loss:3.133 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:49:28,508][model8_pretrain.py][INFO] Epoch:[0/2](465000/4588595) loss:2.856 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:49:28,508][model8_pretrain.py][INFO] Epoch:[0/2](465000/4588595) loss:3.030 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:49:28,508][model8_pretrain.py][INFO] Epoch:[0/2](465000/4588595) loss:3.388 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:50:15,828][model8_pretrain.py][INFO] Epoch:[0/2](465100/4588595) loss:2.761 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:50:15,828][model8_pretrain.py][INFO] Epoch:[0/2](465100/4588595) loss:2.841 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:50:15,828][model8_pretrain.py][INFO] Epoch:[0/2](465100/4588595) loss:2.858 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:50:15,828][model8_pretrain.py][INFO] Epoch:[0/2](465100/4588595) loss:3.323 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:50:15,828][model8_pretrain.py][INFO] Epoch:[0/2](465100/4588595) loss:2.642 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:50:15,828][model8_pretrain.py][INFO] Epoch:[0/2](465100/4588595) loss:2.707 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:50:15,828][model8_pretrain.py][INFO] Epoch:[0/2](465100/4588595) loss:2.679 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:50:15,828][model8_pretrain.py][INFO] Epoch:[0/2](465100/4588595) loss:3.056 lr:0.0000100 epoch_Time:26158.0min: [2024-01-04 18:50:52,741][model8_pretrain.py][INFO] Epoch:[0/2](465200/4588595) loss:2.919 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:50:52,741][model8_pretrain.py][INFO] Epoch:[0/2](465200/4588595) loss:2.932 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:50:52,741][model8_pretrain.py][INFO] Epoch:[0/2](465200/4588595) loss:2.656 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:50:52,741][model8_pretrain.py][INFO] Epoch:[0/2](465200/4588595) loss:2.885 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:50:52,741][model8_pretrain.py][INFO] Epoch:[0/2](465200/4588595) loss:3.048 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:50:52,741][model8_pretrain.py][INFO] Epoch:[0/2](465200/4588595) loss:2.838 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:50:52,741][model8_pretrain.py][INFO] Epoch:[0/2](465200/4588595) loss:3.349 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:50:52,741][model8_pretrain.py][INFO] Epoch:[0/2](465200/4588595) loss:3.064 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:51:29,684][model8_pretrain.py][INFO] Epoch:[0/2](465300/4588595) loss:3.324 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:51:29,684][model8_pretrain.py][INFO] Epoch:[0/2](465300/4588595) loss:3.057 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:51:29,684][model8_pretrain.py][INFO] Epoch:[0/2](465300/4588595) loss:3.208 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:51:29,684][model8_pretrain.py][INFO] Epoch:[0/2](465300/4588595) loss:2.695 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:51:29,684][model8_pretrain.py][INFO] Epoch:[0/2](465300/4588595) loss:2.639 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:51:29,684][model8_pretrain.py][INFO] Epoch:[0/2](465300/4588595) loss:2.876 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:51:29,684][model8_pretrain.py][INFO] Epoch:[0/2](465300/4588595) loss:2.650 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:51:29,684][model8_pretrain.py][INFO] Epoch:[0/2](465300/4588595) loss:2.540 lr:0.0000100 epoch_Time:26157.0min: [2024-01-04 18:52:06,619][model8_pretrain.py][INFO] Epoch:[0/2](465400/4588595) loss:2.492 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:06,620][model8_pretrain.py][INFO] Epoch:[0/2](465400/4588595) loss:2.765 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:06,620][model8_pretrain.py][INFO] Epoch:[0/2](465400/4588595) loss:2.983 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:06,620][model8_pretrain.py][INFO] Epoch:[0/2](465400/4588595) loss:3.134 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:06,620][model8_pretrain.py][INFO] Epoch:[0/2](465400/4588595) loss:2.425 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:06,620][model8_pretrain.py][INFO] Epoch:[0/2](465400/4588595) loss:3.263 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:06,620][model8_pretrain.py][INFO] Epoch:[0/2](465400/4588595) loss:2.818 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:06,620][model8_pretrain.py][INFO] Epoch:[0/2](465400/4588595) loss:2.674 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:43,550][model8_pretrain.py][INFO] Epoch:[0/2](465500/4588595) loss:3.094 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:43,550][model8_pretrain.py][INFO] Epoch:[0/2](465500/4588595) loss:2.977 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:43,550][model8_pretrain.py][INFO] Epoch:[0/2](465500/4588595) loss:2.842 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:43,550][model8_pretrain.py][INFO] Epoch:[0/2](465500/4588595) loss:2.769 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:43,550][model8_pretrain.py][INFO] Epoch:[0/2](465500/4588595) loss:2.474 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:43,550][model8_pretrain.py][INFO] Epoch:[0/2](465500/4588595) loss:2.995 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:43,550][model8_pretrain.py][INFO] Epoch:[0/2](465500/4588595) loss:2.909 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:52:43,551][model8_pretrain.py][INFO] Epoch:[0/2](465500/4588595) loss:3.072 lr:0.0000100 epoch_Time:26156.0min: [2024-01-04 18:53:20,487][model8_pretrain.py][INFO] Epoch:[0/2](465600/4588595) loss:2.935 lr:0.0000100 epoch_Time:26154.0min: [2024-01-04 18:53:20,487][model8_pretrain.py][INFO] Epoch:[0/2](465600/4588595) loss:2.266 lr:0.0000100 epoch_Time:26154.0min: [2024-01-04 18:53:20,487][model8_pretrain.py][INFO] Epoch:[0/2](465600/4588595) loss:2.366 lr:0.0000100 epoch_Time:26154.0min: [2024-01-04 18:53:20,487][model8_pretrain.py][INFO] Epoch:[0/2](465600/4588595) loss:2.865 lr:0.0000100 epoch_Time:26154.0min: [2024-01-04 18:53:20,487][model8_pretrain.py][INFO] Epoch:[0/2](465600/4588595) loss:2.776 lr:0.0000100 epoch_Time:26154.0min: [2024-01-04 18:53:20,487][model8_pretrain.py][INFO] Epoch:[0/2](465600/4588595) loss:2.218 lr:0.0000100 epoch_Time:26154.0min: [2024-01-04 18:53:20,487][model8_pretrain.py][INFO] Epoch:[0/2](465600/4588595) loss:2.489 lr:0.0000100 epoch_Time:26154.0min: [2024-01-04 18:53:20,487][model8_pretrain.py][INFO] Epoch:[0/2](465600/4588595) loss:3.391 lr:0.0000100 epoch_Time:26154.0min: [2024-01-04 18:53:57,423][model8_pretrain.py][INFO] Epoch:[0/2](465700/4588595) loss:2.785 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:53:57,423][model8_pretrain.py][INFO] Epoch:[0/2](465700/4588595) loss:2.725 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:53:57,423][model8_pretrain.py][INFO] Epoch:[0/2](465700/4588595) loss:2.498 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:53:57,423][model8_pretrain.py][INFO] Epoch:[0/2](465700/4588595) loss:3.305 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:53:57,423][model8_pretrain.py][INFO] Epoch:[0/2](465700/4588595) loss:2.714 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:53:57,423][model8_pretrain.py][INFO] Epoch:[0/2](465700/4588595) loss:2.417 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:53:57,423][model8_pretrain.py][INFO] Epoch:[0/2](465700/4588595) loss:2.580 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:53:57,423][model8_pretrain.py][INFO] Epoch:[0/2](465700/4588595) loss:2.611 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:54:34,363][model8_pretrain.py][INFO] Epoch:[0/2](465800/4588595) loss:2.960 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:54:34,363][model8_pretrain.py][INFO] Epoch:[0/2](465800/4588595) loss:2.673 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:54:34,363][model8_pretrain.py][INFO] Epoch:[0/2](465800/4588595) loss:3.039 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:54:34,363][model8_pretrain.py][INFO] Epoch:[0/2](465800/4588595) loss:2.807 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:54:34,363][model8_pretrain.py][INFO] Epoch:[0/2](465800/4588595) loss:3.104 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:54:34,363][model8_pretrain.py][INFO] Epoch:[0/2](465800/4588595) loss:3.040 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:54:34,363][model8_pretrain.py][INFO] Epoch:[0/2](465800/4588595) loss:2.770 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:54:34,363][model8_pretrain.py][INFO] Epoch:[0/2](465800/4588595) loss:2.467 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:55:21,635][model8_pretrain.py][INFO] Epoch:[0/2](465900/4588595) loss:2.497 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:55:21,635][model8_pretrain.py][INFO] Epoch:[0/2](465900/4588595) loss:3.020 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:55:21,635][model8_pretrain.py][INFO] Epoch:[0/2](465900/4588595) loss:2.785 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:55:21,635][model8_pretrain.py][INFO] Epoch:[0/2](465900/4588595) loss:2.364 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:55:21,635][model8_pretrain.py][INFO] Epoch:[0/2](465900/4588595) loss:2.726 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:55:21,635][model8_pretrain.py][INFO] Epoch:[0/2](465900/4588595) loss:2.798 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:55:21,635][model8_pretrain.py][INFO] Epoch:[0/2](465900/4588595) loss:3.366 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:55:21,635][model8_pretrain.py][INFO] Epoch:[0/2](465900/4588595) loss:2.828 lr:0.0000100 epoch_Time:26153.0min: [2024-01-04 18:55:58,547][model8_pretrain.py][INFO] Epoch:[0/2](466000/4588595) loss:2.284 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:55:58,547][model8_pretrain.py][INFO] Epoch:[0/2](466000/4588595) loss:2.763 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:55:58,547][model8_pretrain.py][INFO] Epoch:[0/2](466000/4588595) loss:3.127 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:55:58,547][model8_pretrain.py][INFO] Epoch:[0/2](466000/4588595) loss:3.258 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:55:58,547][model8_pretrain.py][INFO] Epoch:[0/2](466000/4588595) loss:3.529 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:55:58,547][model8_pretrain.py][INFO] Epoch:[0/2](466000/4588595) loss:3.256 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:55:58,548][model8_pretrain.py][INFO] Epoch:[0/2](466000/4588595) loss:3.016 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:55:58,548][model8_pretrain.py][INFO] Epoch:[0/2](466000/4588595) loss:3.509 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:56:35,472][model8_pretrain.py][INFO] Epoch:[0/2](466100/4588595) loss:2.946 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:56:35,472][model8_pretrain.py][INFO] Epoch:[0/2](466100/4588595) loss:3.322 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:56:35,473][model8_pretrain.py][INFO] Epoch:[0/2](466100/4588595) loss:2.298 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:56:35,473][model8_pretrain.py][INFO] Epoch:[0/2](466100/4588595) loss:2.659 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:56:35,473][model8_pretrain.py][INFO] Epoch:[0/2](466100/4588595) loss:2.706 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:56:35,473][model8_pretrain.py][INFO] Epoch:[0/2](466100/4588595) loss:2.627 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:56:35,473][model8_pretrain.py][INFO] Epoch:[0/2](466100/4588595) loss:2.266 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:56:35,474][model8_pretrain.py][INFO] Epoch:[0/2](466100/4588595) loss:2.957 lr:0.0000100 epoch_Time:26152.0min: [2024-01-04 18:57:12,401][model8_pretrain.py][INFO] Epoch:[0/2](466200/4588595) loss:2.482 lr:0.0000100 epoch_Time:26151.0min: [2024-01-04 18:57:12,401][model8_pretrain.py][INFO] Epoch:[0/2](466200/4588595) loss:3.200 lr:0.0000100 epoch_Time:26151.0min: [2024-01-04 18:57:12,401][model8_pretrain.py][INFO] Epoch:[0/2](466200/4588595) loss:3.093 lr:0.0000100 epoch_Time:26151.0min: [2024-01-04 18:57:12,401][model8_pretrain.py][INFO] Epoch:[0/2](466200/4588595) loss:2.672 lr:0.0000100 epoch_Time:26151.0min: [2024-01-04 18:57:12,401][model8_pretrain.py][INFO] Epoch:[0/2](466200/4588595) loss:2.284 lr:0.0000100 epoch_Time:26151.0min: [2024-01-04 18:57:12,401][model8_pretrain.py][INFO] Epoch:[0/2](466200/4588595) loss:3.225 lr:0.0000100 epoch_Time:26151.0min: [2024-01-04 18:57:12,401][model8_pretrain.py][INFO] Epoch:[0/2](466200/4588595) loss:2.825 lr:0.0000100 epoch_Time:26151.0min: [2024-01-04 18:57:12,401][model8_pretrain.py][INFO] Epoch:[0/2](466200/4588595) loss:3.166 lr:0.0000100 epoch_Time:26151.0min: [2024-01-04 18:57:49,329][model8_pretrain.py][INFO] Epoch:[0/2](466300/4588595) loss:2.812 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:57:49,329][model8_pretrain.py][INFO] Epoch:[0/2](466300/4588595) loss:2.946 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:57:49,329][model8_pretrain.py][INFO] Epoch:[0/2](466300/4588595) loss:2.997 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:57:49,329][model8_pretrain.py][INFO] Epoch:[0/2](466300/4588595) loss:3.296 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:57:49,330][model8_pretrain.py][INFO] Epoch:[0/2](466300/4588595) loss:2.426 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:57:49,330][model8_pretrain.py][INFO] Epoch:[0/2](466300/4588595) loss:2.597 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:57:49,330][model8_pretrain.py][INFO] Epoch:[0/2](466300/4588595) loss:2.996 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:57:49,330][model8_pretrain.py][INFO] Epoch:[0/2](466300/4588595) loss:2.862 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:58:26,262][model8_pretrain.py][INFO] Epoch:[0/2](466400/4588595) loss:3.096 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:58:26,262][model8_pretrain.py][INFO] Epoch:[0/2](466400/4588595) loss:3.072 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:58:26,262][model8_pretrain.py][INFO] Epoch:[0/2](466400/4588595) loss:3.247 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:58:26,262][model8_pretrain.py][INFO] Epoch:[0/2](466400/4588595) loss:2.636 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:58:26,262][model8_pretrain.py][INFO] Epoch:[0/2](466400/4588595) loss:2.999 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:58:26,262][model8_pretrain.py][INFO] Epoch:[0/2](466400/4588595) loss:2.328 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:58:26,262][model8_pretrain.py][INFO] Epoch:[0/2](466400/4588595) loss:2.712 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:58:26,262][model8_pretrain.py][INFO] Epoch:[0/2](466400/4588595) loss:3.116 lr:0.0000100 epoch_Time:26150.0min: [2024-01-04 18:59:03,193][model8_pretrain.py][INFO] Epoch:[0/2](466500/4588595) loss:3.015 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:03,193][model8_pretrain.py][INFO] Epoch:[0/2](466500/4588595) loss:2.631 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:03,193][model8_pretrain.py][INFO] Epoch:[0/2](466500/4588595) loss:2.742 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:03,193][model8_pretrain.py][INFO] Epoch:[0/2](466500/4588595) loss:2.999 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:03,193][model8_pretrain.py][INFO] Epoch:[0/2](466500/4588595) loss:3.069 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:03,193][model8_pretrain.py][INFO] Epoch:[0/2](466500/4588595) loss:3.057 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:03,193][model8_pretrain.py][INFO] Epoch:[0/2](466500/4588595) loss:3.226 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:03,193][model8_pretrain.py][INFO] Epoch:[0/2](466500/4588595) loss:3.018 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:40,119][model8_pretrain.py][INFO] Epoch:[0/2](466600/4588595) loss:3.047 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:40,119][model8_pretrain.py][INFO] Epoch:[0/2](466600/4588595) loss:2.909 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:40,119][model8_pretrain.py][INFO] Epoch:[0/2](466600/4588595) loss:3.080 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:40,119][model8_pretrain.py][INFO] Epoch:[0/2](466600/4588595) loss:3.234 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:40,119][model8_pretrain.py][INFO] Epoch:[0/2](466600/4588595) loss:2.574 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:40,119][model8_pretrain.py][INFO] Epoch:[0/2](466600/4588595) loss:2.257 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:40,120][model8_pretrain.py][INFO] Epoch:[0/2](466600/4588595) loss:2.924 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 18:59:40,120][model8_pretrain.py][INFO] Epoch:[0/2](466600/4588595) loss:2.552 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 19:00:27,485][model8_pretrain.py][INFO] Epoch:[0/2](466700/4588595) loss:2.982 lr:0.0000100 epoch_Time:26149.0min: [2024-01-04 19:00:27,485][model8_pretrain.py][INFO] Epoch:[0/2](466700/4588595) loss:2.996 lr:0.0000100 epoch_Time:26149.0min: [2024-01-04 19:00:27,485][model8_pretrain.py][INFO] Epoch:[0/2](466700/4588595) loss:2.991 lr:0.0000100 epoch_Time:26149.0min: [2024-01-04 19:00:27,485][model8_pretrain.py][INFO] Epoch:[0/2](466700/4588595) loss:2.789 lr:0.0000100 epoch_Time:26149.0min: [2024-01-04 19:00:27,485][model8_pretrain.py][INFO] Epoch:[0/2](466700/4588595) loss:2.049 lr:0.0000100 epoch_Time:26149.0min: [2024-01-04 19:00:27,485][model8_pretrain.py][INFO] Epoch:[0/2](466700/4588595) loss:3.027 lr:0.0000100 epoch_Time:26149.0min: [2024-01-04 19:00:27,485][model8_pretrain.py][INFO] Epoch:[0/2](466700/4588595) loss:2.452 lr:0.0000100 epoch_Time:26149.0min: [2024-01-04 19:00:27,485][model8_pretrain.py][INFO] Epoch:[0/2](466700/4588595) loss:3.021 lr:0.0000100 epoch_Time:26149.0min: [2024-01-04 19:01:04,379][model8_pretrain.py][INFO] Epoch:[0/2](466800/4588595) loss:2.576 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 19:01:04,379][model8_pretrain.py][INFO] Epoch:[0/2](466800/4588595) loss:2.927 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 19:01:04,379][model8_pretrain.py][INFO] Epoch:[0/2](466800/4588595) loss:1.854 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 19:01:04,379][model8_pretrain.py][INFO] Epoch:[0/2](466800/4588595) loss:2.662 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 19:01:04,379][model8_pretrain.py][INFO] Epoch:[0/2](466800/4588595) loss:2.910 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 19:01:04,379][model8_pretrain.py][INFO] Epoch:[0/2](466800/4588595) loss:3.093 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 19:01:04,379][model8_pretrain.py][INFO] Epoch:[0/2](466800/4588595) loss:3.285 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 19:01:04,379][model8_pretrain.py][INFO] Epoch:[0/2](466800/4588595) loss:3.003 lr:0.0000100 epoch_Time:26148.0min: [2024-01-04 19:01:41,304][model8_pretrain.py][INFO] Epoch:[0/2](466900/4588595) loss:3.094 lr:0.0000100 epoch_Time:26147.0min: [2024-01-04 19:01:41,304][model8_pretrain.py][INFO] Epoch:[0/2](466900/4588595) loss:2.656 lr:0.0000100 epoch_Time:26147.0min: [2024-01-04 19:01:41,304][model8_pretrain.py][INFO] Epoch:[0/2](466900/4588595) loss:2.969 lr:0.0000100 epoch_Time:26147.0min: [2024-01-04 19:01:41,304][model8_pretrain.py][INFO] Epoch:[0/2](466900/4588595) loss:3.063 lr:0.0000100 epoch_Time:26147.0min: [2024-01-04 19:01:41,304][model8_pretrain.py][INFO] Epoch:[0/2](466900/4588595) loss:2.582 lr:0.0000100 epoch_Time:26147.0min: [2024-01-04 19:01:41,305][model8_pretrain.py][INFO] Epoch:[0/2](466900/4588595) loss:2.524 lr:0.0000100 epoch_Time:26147.0min: [2024-01-04 19:01:41,305][model8_pretrain.py][INFO] Epoch:[0/2](466900/4588595) loss:2.224 lr:0.0000100 epoch_Time:26147.0min: [2024-01-04 19:01:41,305][model8_pretrain.py][INFO] Epoch:[0/2](466900/4588595) loss:2.861 lr:0.0000100 epoch_Time:26147.0min: [2024-01-04 19:02:18,232][model8_pretrain.py][INFO] Epoch:[0/2](467000/4588595) loss:2.677 lr:0.0000100 epoch_Time:26146.0min: [2024-01-04 19:02:18,232][model8_pretrain.py][INFO] Epoch:[0/2](467000/4588595) loss:2.590 lr:0.0000100 epoch_Time:26146.0min: [2024-01-04 19:02:18,232][model8_pretrain.py][INFO] Epoch:[0/2](467000/4588595) loss:2.194 lr:0.0000100 epoch_Time:26146.0min: [2024-01-04 19:02:18,232][model8_pretrain.py][INFO] Epoch:[0/2](467000/4588595) loss:3.430 lr:0.0000100 epoch_Time:26146.0min: [2024-01-04 19:02:18,232][model8_pretrain.py][INFO] Epoch:[0/2](467000/4588595) loss:2.689 lr:0.0000100 epoch_Time:26146.0min: [2024-01-04 19:02:18,232][model8_pretrain.py][INFO] Epoch:[0/2](467000/4588595) loss:2.851 lr:0.0000100 epoch_Time:26146.0min: [2024-01-04 19:02:18,232][model8_pretrain.py][INFO] Epoch:[0/2](467000/4588595) loss:2.909 lr:0.0000100 epoch_Time:26146.0min: [2024-01-04 19:02:18,232][model8_pretrain.py][INFO] Epoch:[0/2](467000/4588595) loss:2.716 lr:0.0000100 epoch_Time:26146.0min: [2024-01-04 19:02:55,166][model8_pretrain.py][INFO] Epoch:[0/2](467100/4588595) loss:2.911 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:02:55,166][model8_pretrain.py][INFO] Epoch:[0/2](467100/4588595) loss:3.188 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:02:55,166][model8_pretrain.py][INFO] Epoch:[0/2](467100/4588595) loss:3.161 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:02:55,166][model8_pretrain.py][INFO] Epoch:[0/2](467100/4588595) loss:3.415 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:02:55,166][model8_pretrain.py][INFO] Epoch:[0/2](467100/4588595) loss:2.630 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:02:55,166][model8_pretrain.py][INFO] Epoch:[0/2](467100/4588595) loss:2.818 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:02:55,166][model8_pretrain.py][INFO] Epoch:[0/2](467100/4588595) loss:3.014 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:02:55,166][model8_pretrain.py][INFO] Epoch:[0/2](467100/4588595) loss:3.030 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:03:32,120][model8_pretrain.py][INFO] Epoch:[0/2](467200/4588595) loss:3.316 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:03:32,120][model8_pretrain.py][INFO] Epoch:[0/2](467200/4588595) loss:3.108 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:03:32,120][model8_pretrain.py][INFO] Epoch:[0/2](467200/4588595) loss:2.586 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:03:32,120][model8_pretrain.py][INFO] Epoch:[0/2](467200/4588595) loss:3.133 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:03:32,120][model8_pretrain.py][INFO] Epoch:[0/2](467200/4588595) loss:3.059 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:03:32,120][model8_pretrain.py][INFO] Epoch:[0/2](467200/4588595) loss:2.780 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:03:32,120][model8_pretrain.py][INFO] Epoch:[0/2](467200/4588595) loss:2.643 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:03:32,120][model8_pretrain.py][INFO] Epoch:[0/2](467200/4588595) loss:3.206 lr:0.0000100 epoch_Time:26145.0min: [2024-01-04 19:04:09,048][model8_pretrain.py][INFO] Epoch:[0/2](467300/4588595) loss:2.487 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:04:09,048][model8_pretrain.py][INFO] Epoch:[0/2](467300/4588595) loss:2.760 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:04:09,048][model8_pretrain.py][INFO] Epoch:[0/2](467300/4588595) loss:2.681 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:04:09,048][model8_pretrain.py][INFO] Epoch:[0/2](467300/4588595) loss:2.415 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:04:09,048][model8_pretrain.py][INFO] Epoch:[0/2](467300/4588595) loss:2.748 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:04:09,048][model8_pretrain.py][INFO] Epoch:[0/2](467300/4588595) loss:3.135 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:04:09,048][model8_pretrain.py][INFO] Epoch:[0/2](467300/4588595) loss:2.534 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:04:09,048][model8_pretrain.py][INFO] Epoch:[0/2](467300/4588595) loss:3.509 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:04:45,971][model8_pretrain.py][INFO] Epoch:[0/2](467400/4588595) loss:3.398 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:04:45,971][model8_pretrain.py][INFO] Epoch:[0/2](467400/4588595) loss:2.945 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:04:45,971][model8_pretrain.py][INFO] Epoch:[0/2](467400/4588595) loss:2.515 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:04:45,971][model8_pretrain.py][INFO] Epoch:[0/2](467400/4588595) loss:3.059 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:04:45,971][model8_pretrain.py][INFO] Epoch:[0/2](467400/4588595) loss:1.999 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:04:45,971][model8_pretrain.py][INFO] Epoch:[0/2](467400/4588595) loss:2.676 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:04:45,971][model8_pretrain.py][INFO] Epoch:[0/2](467400/4588595) loss:3.217 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:04:45,971][model8_pretrain.py][INFO] Epoch:[0/2](467400/4588595) loss:1.768 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:05:33,231][model8_pretrain.py][INFO] Epoch:[0/2](467500/4588595) loss:2.437 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:05:33,231][model8_pretrain.py][INFO] Epoch:[0/2](467500/4588595) loss:2.988 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:05:33,231][model8_pretrain.py][INFO] Epoch:[0/2](467500/4588595) loss:2.996 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:05:33,231][model8_pretrain.py][INFO] Epoch:[0/2](467500/4588595) loss:2.992 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:05:33,231][model8_pretrain.py][INFO] Epoch:[0/2](467500/4588595) loss:2.501 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:05:33,231][model8_pretrain.py][INFO] Epoch:[0/2](467500/4588595) loss:3.086 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:05:33,231][model8_pretrain.py][INFO] Epoch:[0/2](467500/4588595) loss:2.866 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:05:33,231][model8_pretrain.py][INFO] Epoch:[0/2](467500/4588595) loss:2.913 lr:0.0000100 epoch_Time:26144.0min: [2024-01-04 19:06:10,147][model8_pretrain.py][INFO] Epoch:[0/2](467600/4588595) loss:3.229 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:10,147][model8_pretrain.py][INFO] Epoch:[0/2](467600/4588595) loss:3.168 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:10,147][model8_pretrain.py][INFO] Epoch:[0/2](467600/4588595) loss:2.671 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:10,147][model8_pretrain.py][INFO] Epoch:[0/2](467600/4588595) loss:2.988 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:10,147][model8_pretrain.py][INFO] Epoch:[0/2](467600/4588595) loss:2.986 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:10,147][model8_pretrain.py][INFO] Epoch:[0/2](467600/4588595) loss:3.137 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:10,147][model8_pretrain.py][INFO] Epoch:[0/2](467600/4588595) loss:2.150 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:10,147][model8_pretrain.py][INFO] Epoch:[0/2](467600/4588595) loss:3.231 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:47,079][model8_pretrain.py][INFO] Epoch:[0/2](467700/4588595) loss:2.704 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:47,079][model8_pretrain.py][INFO] Epoch:[0/2](467700/4588595) loss:2.834 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:47,079][model8_pretrain.py][INFO] Epoch:[0/2](467700/4588595) loss:2.309 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:47,079][model8_pretrain.py][INFO] Epoch:[0/2](467700/4588595) loss:2.894 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:47,079][model8_pretrain.py][INFO] Epoch:[0/2](467700/4588595) loss:3.120 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:47,079][model8_pretrain.py][INFO] Epoch:[0/2](467700/4588595) loss:3.091 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:47,079][model8_pretrain.py][INFO] Epoch:[0/2](467700/4588595) loss:2.868 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:06:47,079][model8_pretrain.py][INFO] Epoch:[0/2](467700/4588595) loss:3.000 lr:0.0000100 epoch_Time:26143.0min: [2024-01-04 19:07:24,014][model8_pretrain.py][INFO] Epoch:[0/2](467800/4588595) loss:3.154 lr:0.0000100 epoch_Time:26141.0min: [2024-01-04 19:07:24,014][model8_pretrain.py][INFO] Epoch:[0/2](467800/4588595) loss:3.307 lr:0.0000100 epoch_Time:26141.0min: [2024-01-04 19:07:24,014][model8_pretrain.py][INFO] Epoch:[0/2](467800/4588595) loss:3.039 lr:0.0000100 epoch_Time:26141.0min: [2024-01-04 19:07:24,014][model8_pretrain.py][INFO] Epoch:[0/2](467800/4588595) loss:2.595 lr:0.0000100 epoch_Time:26141.0min: [2024-01-04 19:07:24,014][model8_pretrain.py][INFO] Epoch:[0/2](467800/4588595) loss:3.113 lr:0.0000100 epoch_Time:26141.0min: [2024-01-04 19:07:24,015][model8_pretrain.py][INFO] Epoch:[0/2](467800/4588595) loss:3.023 lr:0.0000100 epoch_Time:26141.0min: [2024-01-04 19:07:24,015][model8_pretrain.py][INFO] Epoch:[0/2](467800/4588595) loss:2.713 lr:0.0000100 epoch_Time:26141.0min: [2024-01-04 19:07:24,015][model8_pretrain.py][INFO] Epoch:[0/2](467800/4588595) loss:2.643 lr:0.0000100 epoch_Time:26141.0min: [2024-01-04 19:08:00,950][model8_pretrain.py][INFO] Epoch:[0/2](467900/4588595) loss:2.948 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:00,950][model8_pretrain.py][INFO] Epoch:[0/2](467900/4588595) loss:3.360 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:00,950][model8_pretrain.py][INFO] Epoch:[0/2](467900/4588595) loss:2.746 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:00,950][model8_pretrain.py][INFO] Epoch:[0/2](467900/4588595) loss:3.104 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:00,950][model8_pretrain.py][INFO] Epoch:[0/2](467900/4588595) loss:2.365 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:00,950][model8_pretrain.py][INFO] Epoch:[0/2](467900/4588595) loss:2.896 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:00,950][model8_pretrain.py][INFO] Epoch:[0/2](467900/4588595) loss:3.236 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:00,950][model8_pretrain.py][INFO] Epoch:[0/2](467900/4588595) loss:2.819 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:37,926][model8_pretrain.py][INFO] Epoch:[0/2](468000/4588595) loss:3.099 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:37,926][model8_pretrain.py][INFO] Epoch:[0/2](468000/4588595) loss:2.801 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:37,926][model8_pretrain.py][INFO] Epoch:[0/2](468000/4588595) loss:2.820 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:37,926][model8_pretrain.py][INFO] Epoch:[0/2](468000/4588595) loss:3.099 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:37,926][model8_pretrain.py][INFO] Epoch:[0/2](468000/4588595) loss:2.757 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:37,926][model8_pretrain.py][INFO] Epoch:[0/2](468000/4588595) loss:3.188 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:37,926][model8_pretrain.py][INFO] Epoch:[0/2](468000/4588595) loss:2.683 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:08:37,927][model8_pretrain.py][INFO] Epoch:[0/2](468000/4588595) loss:2.686 lr:0.0000100 epoch_Time:26140.0min: [2024-01-04 19:09:14,864][model8_pretrain.py][INFO] Epoch:[0/2](468100/4588595) loss:2.948 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:09:14,864][model8_pretrain.py][INFO] Epoch:[0/2](468100/4588595) loss:2.809 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:09:14,864][model8_pretrain.py][INFO] Epoch:[0/2](468100/4588595) loss:3.144 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:09:14,864][model8_pretrain.py][INFO] Epoch:[0/2](468100/4588595) loss:2.834 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:09:14,864][model8_pretrain.py][INFO] Epoch:[0/2](468100/4588595) loss:3.181 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:09:14,864][model8_pretrain.py][INFO] Epoch:[0/2](468100/4588595) loss:3.211 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:09:14,864][model8_pretrain.py][INFO] Epoch:[0/2](468100/4588595) loss:2.826 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:09:14,865][model8_pretrain.py][INFO] Epoch:[0/2](468100/4588595) loss:2.842 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:09:51,805][model8_pretrain.py][INFO] Epoch:[0/2](468200/4588595) loss:2.906 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:09:51,805][model8_pretrain.py][INFO] Epoch:[0/2](468200/4588595) loss:2.498 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:09:51,805][model8_pretrain.py][INFO] Epoch:[0/2](468200/4588595) loss:2.740 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:09:51,805][model8_pretrain.py][INFO] Epoch:[0/2](468200/4588595) loss:3.439 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:09:51,805][model8_pretrain.py][INFO] Epoch:[0/2](468200/4588595) loss:2.771 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:09:51,805][model8_pretrain.py][INFO] Epoch:[0/2](468200/4588595) loss:2.740 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:09:51,805][model8_pretrain.py][INFO] Epoch:[0/2](468200/4588595) loss:3.238 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:09:51,805][model8_pretrain.py][INFO] Epoch:[0/2](468200/4588595) loss:3.176 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:10:39,077][model8_pretrain.py][INFO] Epoch:[0/2](468300/4588595) loss:3.139 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:10:39,077][model8_pretrain.py][INFO] Epoch:[0/2](468300/4588595) loss:2.512 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:10:39,077][model8_pretrain.py][INFO] Epoch:[0/2](468300/4588595) loss:2.900 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:10:39,077][model8_pretrain.py][INFO] Epoch:[0/2](468300/4588595) loss:2.439 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:10:39,077][model8_pretrain.py][INFO] Epoch:[0/2](468300/4588595) loss:3.106 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:10:39,077][model8_pretrain.py][INFO] Epoch:[0/2](468300/4588595) loss:2.473 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:10:39,077][model8_pretrain.py][INFO] Epoch:[0/2](468300/4588595) loss:3.173 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:10:39,077][model8_pretrain.py][INFO] Epoch:[0/2](468300/4588595) loss:2.916 lr:0.0000100 epoch_Time:26139.0min: [2024-01-04 19:11:16,032][model8_pretrain.py][INFO] Epoch:[0/2](468400/4588595) loss:2.934 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:11:16,032][model8_pretrain.py][INFO] Epoch:[0/2](468400/4588595) loss:2.768 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:11:16,032][model8_pretrain.py][INFO] Epoch:[0/2](468400/4588595) loss:2.809 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:11:16,032][model8_pretrain.py][INFO] Epoch:[0/2](468400/4588595) loss:2.859 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:11:16,032][model8_pretrain.py][INFO] Epoch:[0/2](468400/4588595) loss:2.521 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:11:16,032][model8_pretrain.py][INFO] Epoch:[0/2](468400/4588595) loss:3.425 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:11:16,032][model8_pretrain.py][INFO] Epoch:[0/2](468400/4588595) loss:2.999 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:11:16,032][model8_pretrain.py][INFO] Epoch:[0/2](468400/4588595) loss:2.882 lr:0.0000100 epoch_Time:26138.0min: [2024-01-04 19:11:52,966][model8_pretrain.py][INFO] Epoch:[0/2](468500/4588595) loss:3.047 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:11:52,966][model8_pretrain.py][INFO] Epoch:[0/2](468500/4588595) loss:2.952 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:11:52,966][model8_pretrain.py][INFO] Epoch:[0/2](468500/4588595) loss:2.804 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:11:52,966][model8_pretrain.py][INFO] Epoch:[0/2](468500/4588595) loss:2.801 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:11:52,967][model8_pretrain.py][INFO] Epoch:[0/2](468500/4588595) loss:3.053 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:11:52,967][model8_pretrain.py][INFO] Epoch:[0/2](468500/4588595) loss:2.576 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:11:52,967][model8_pretrain.py][INFO] Epoch:[0/2](468500/4588595) loss:2.926 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:11:52,967][model8_pretrain.py][INFO] Epoch:[0/2](468500/4588595) loss:3.074 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:12:29,904][model8_pretrain.py][INFO] Epoch:[0/2](468600/4588595) loss:2.899 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:12:29,904][model8_pretrain.py][INFO] Epoch:[0/2](468600/4588595) loss:3.029 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:12:29,904][model8_pretrain.py][INFO] Epoch:[0/2](468600/4588595) loss:2.906 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:12:29,904][model8_pretrain.py][INFO] Epoch:[0/2](468600/4588595) loss:2.783 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:12:29,904][model8_pretrain.py][INFO] Epoch:[0/2](468600/4588595) loss:2.267 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:12:29,904][model8_pretrain.py][INFO] Epoch:[0/2](468600/4588595) loss:2.421 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:12:29,904][model8_pretrain.py][INFO] Epoch:[0/2](468600/4588595) loss:2.914 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:12:29,904][model8_pretrain.py][INFO] Epoch:[0/2](468600/4588595) loss:2.668 lr:0.0000100 epoch_Time:26137.0min: [2024-01-04 19:13:06,840][model8_pretrain.py][INFO] Epoch:[0/2](468700/4588595) loss:2.180 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:06,840][model8_pretrain.py][INFO] Epoch:[0/2](468700/4588595) loss:2.132 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:06,840][model8_pretrain.py][INFO] Epoch:[0/2](468700/4588595) loss:2.073 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:06,840][model8_pretrain.py][INFO] Epoch:[0/2](468700/4588595) loss:2.803 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:06,840][model8_pretrain.py][INFO] Epoch:[0/2](468700/4588595) loss:2.794 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:06,840][model8_pretrain.py][INFO] Epoch:[0/2](468700/4588595) loss:3.199 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:06,840][model8_pretrain.py][INFO] Epoch:[0/2](468700/4588595) loss:3.020 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:06,840][model8_pretrain.py][INFO] Epoch:[0/2](468700/4588595) loss:3.229 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:43,775][model8_pretrain.py][INFO] Epoch:[0/2](468800/4588595) loss:2.742 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:43,776][model8_pretrain.py][INFO] Epoch:[0/2](468800/4588595) loss:2.770 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:43,776][model8_pretrain.py][INFO] Epoch:[0/2](468800/4588595) loss:2.826 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:43,776][model8_pretrain.py][INFO] Epoch:[0/2](468800/4588595) loss:2.897 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:43,776][model8_pretrain.py][INFO] Epoch:[0/2](468800/4588595) loss:2.883 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:43,776][model8_pretrain.py][INFO] Epoch:[0/2](468800/4588595) loss:2.668 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:43,776][model8_pretrain.py][INFO] Epoch:[0/2](468800/4588595) loss:3.239 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:13:43,776][model8_pretrain.py][INFO] Epoch:[0/2](468800/4588595) loss:3.326 lr:0.0000100 epoch_Time:26135.0min: [2024-01-04 19:14:20,710][model8_pretrain.py][INFO] Epoch:[0/2](468900/4588595) loss:2.741 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:14:20,710][model8_pretrain.py][INFO] Epoch:[0/2](468900/4588595) loss:2.621 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:14:20,710][model8_pretrain.py][INFO] Epoch:[0/2](468900/4588595) loss:2.947 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:14:20,710][model8_pretrain.py][INFO] Epoch:[0/2](468900/4588595) loss:2.891 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:14:20,710][model8_pretrain.py][INFO] Epoch:[0/2](468900/4588595) loss:2.720 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:14:20,710][model8_pretrain.py][INFO] Epoch:[0/2](468900/4588595) loss:2.835 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:14:20,711][model8_pretrain.py][INFO] Epoch:[0/2](468900/4588595) loss:3.318 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:14:20,711][model8_pretrain.py][INFO] Epoch:[0/2](468900/4588595) loss:3.019 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:14:57,647][model8_pretrain.py][INFO] Epoch:[0/2](469000/4588595) loss:2.892 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:14:57,647][model8_pretrain.py][INFO] Epoch:[0/2](469000/4588595) loss:2.865 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:14:57,647][model8_pretrain.py][INFO] Epoch:[0/2](469000/4588595) loss:2.737 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:14:57,647][model8_pretrain.py][INFO] Epoch:[0/2](469000/4588595) loss:3.001 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:14:57,647][model8_pretrain.py][INFO] Epoch:[0/2](469000/4588595) loss:3.009 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:14:57,647][model8_pretrain.py][INFO] Epoch:[0/2](469000/4588595) loss:3.046 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:14:57,647][model8_pretrain.py][INFO] Epoch:[0/2](469000/4588595) loss:3.345 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:14:57,647][model8_pretrain.py][INFO] Epoch:[0/2](469000/4588595) loss:3.057 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:15:43,318][model8_pretrain.py][INFO] Epoch:[0/2](469100/4588595) loss:2.652 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:15:43,318][model8_pretrain.py][INFO] Epoch:[0/2](469100/4588595) loss:2.540 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:15:43,318][model8_pretrain.py][INFO] Epoch:[0/2](469100/4588595) loss:3.394 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:15:43,318][model8_pretrain.py][INFO] Epoch:[0/2](469100/4588595) loss:2.751 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:15:43,318][model8_pretrain.py][INFO] Epoch:[0/2](469100/4588595) loss:3.171 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:15:43,318][model8_pretrain.py][INFO] Epoch:[0/2](469100/4588595) loss:2.781 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:15:43,318][model8_pretrain.py][INFO] Epoch:[0/2](469100/4588595) loss:3.054 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:15:43,318][model8_pretrain.py][INFO] Epoch:[0/2](469100/4588595) loss:2.677 lr:0.0000100 epoch_Time:26134.0min: [2024-01-04 19:16:21,904][model8_pretrain.py][INFO] Epoch:[0/2](469200/4588595) loss:3.135 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:16:21,904][model8_pretrain.py][INFO] Epoch:[0/2](469200/4588595) loss:3.084 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:16:21,904][model8_pretrain.py][INFO] Epoch:[0/2](469200/4588595) loss:3.385 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:16:21,904][model8_pretrain.py][INFO] Epoch:[0/2](469200/4588595) loss:2.757 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:16:21,904][model8_pretrain.py][INFO] Epoch:[0/2](469200/4588595) loss:2.704 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:16:21,904][model8_pretrain.py][INFO] Epoch:[0/2](469200/4588595) loss:3.189 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:16:21,904][model8_pretrain.py][INFO] Epoch:[0/2](469200/4588595) loss:3.102 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:16:21,904][model8_pretrain.py][INFO] Epoch:[0/2](469200/4588595) loss:3.045 lr:0.0000100 epoch_Time:26133.0min: [2024-01-04 19:16:58,834][model8_pretrain.py][INFO] Epoch:[0/2](469300/4588595) loss:2.864 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:16:58,834][model8_pretrain.py][INFO] Epoch:[0/2](469300/4588595) loss:3.047 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:16:58,834][model8_pretrain.py][INFO] Epoch:[0/2](469300/4588595) loss:2.968 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:16:58,834][model8_pretrain.py][INFO] Epoch:[0/2](469300/4588595) loss:3.108 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:16:58,834][model8_pretrain.py][INFO] Epoch:[0/2](469300/4588595) loss:2.503 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:16:58,834][model8_pretrain.py][INFO] Epoch:[0/2](469300/4588595) loss:2.816 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:16:58,834][model8_pretrain.py][INFO] Epoch:[0/2](469300/4588595) loss:2.665 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:16:58,834][model8_pretrain.py][INFO] Epoch:[0/2](469300/4588595) loss:2.670 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:17:35,766][model8_pretrain.py][INFO] Epoch:[0/2](469400/4588595) loss:3.026 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:17:35,766][model8_pretrain.py][INFO] Epoch:[0/2](469400/4588595) loss:2.604 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:17:35,766][model8_pretrain.py][INFO] Epoch:[0/2](469400/4588595) loss:3.134 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:17:35,766][model8_pretrain.py][INFO] Epoch:[0/2](469400/4588595) loss:3.180 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:17:35,766][model8_pretrain.py][INFO] Epoch:[0/2](469400/4588595) loss:3.379 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:17:35,766][model8_pretrain.py][INFO] Epoch:[0/2](469400/4588595) loss:3.073 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:17:35,766][model8_pretrain.py][INFO] Epoch:[0/2](469400/4588595) loss:2.660 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:17:35,766][model8_pretrain.py][INFO] Epoch:[0/2](469400/4588595) loss:2.943 lr:0.0000100 epoch_Time:26132.0min: [2024-01-04 19:18:12,701][model8_pretrain.py][INFO] Epoch:[0/2](469500/4588595) loss:3.151 lr:0.0000100 epoch_Time:26131.0min: [2024-01-04 19:18:12,701][model8_pretrain.py][INFO] Epoch:[0/2](469500/4588595) loss:2.964 lr:0.0000100 epoch_Time:26131.0min: [2024-01-04 19:18:12,701][model8_pretrain.py][INFO] Epoch:[0/2](469500/4588595) loss:3.294 lr:0.0000100 epoch_Time:26131.0min: [2024-01-04 19:18:12,701][model8_pretrain.py][INFO] Epoch:[0/2](469500/4588595) loss:2.811 lr:0.0000100 epoch_Time:26131.0min: [2024-01-04 19:18:12,701][model8_pretrain.py][INFO] Epoch:[0/2](469500/4588595) loss:2.880 lr:0.0000100 epoch_Time:26131.0min: [2024-01-04 19:18:12,701][model8_pretrain.py][INFO] Epoch:[0/2](469500/4588595) loss:3.546 lr:0.0000100 epoch_Time:26131.0min: [2024-01-04 19:18:12,701][model8_pretrain.py][INFO] Epoch:[0/2](469500/4588595) loss:2.875 lr:0.0000100 epoch_Time:26131.0min: [2024-01-04 19:18:12,701][model8_pretrain.py][INFO] Epoch:[0/2](469500/4588595) loss:3.143 lr:0.0000100 epoch_Time:26131.0min: [2024-01-04 19:18:49,641][model8_pretrain.py][INFO] Epoch:[0/2](469600/4588595) loss:3.095 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:18:49,641][model8_pretrain.py][INFO] Epoch:[0/2](469600/4588595) loss:2.706 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:18:49,641][model8_pretrain.py][INFO] Epoch:[0/2](469600/4588595) loss:2.431 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:18:49,641][model8_pretrain.py][INFO] Epoch:[0/2](469600/4588595) loss:2.964 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:18:49,641][model8_pretrain.py][INFO] Epoch:[0/2](469600/4588595) loss:2.914 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:18:49,641][model8_pretrain.py][INFO] Epoch:[0/2](469600/4588595) loss:3.259 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:18:49,641][model8_pretrain.py][INFO] Epoch:[0/2](469600/4588595) loss:2.933 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:18:49,641][model8_pretrain.py][INFO] Epoch:[0/2](469600/4588595) loss:2.842 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:19:26,546][model8_pretrain.py][INFO] Epoch:[0/2](469700/4588595) loss:2.765 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:19:26,546][model8_pretrain.py][INFO] Epoch:[0/2](469700/4588595) loss:2.525 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:19:26,546][model8_pretrain.py][INFO] Epoch:[0/2](469700/4588595) loss:2.999 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:19:26,546][model8_pretrain.py][INFO] Epoch:[0/2](469700/4588595) loss:2.809 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:19:26,546][model8_pretrain.py][INFO] Epoch:[0/2](469700/4588595) loss:2.452 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:19:26,546][model8_pretrain.py][INFO] Epoch:[0/2](469700/4588595) loss:3.253 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:19:26,546][model8_pretrain.py][INFO] Epoch:[0/2](469700/4588595) loss:3.334 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:19:26,546][model8_pretrain.py][INFO] Epoch:[0/2](469700/4588595) loss:2.975 lr:0.0000100 epoch_Time:26129.0min: [2024-01-04 19:20:03,476][model8_pretrain.py][INFO] Epoch:[0/2](469800/4588595) loss:3.100 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:03,476][model8_pretrain.py][INFO] Epoch:[0/2](469800/4588595) loss:2.785 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:03,476][model8_pretrain.py][INFO] Epoch:[0/2](469800/4588595) loss:2.536 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:03,476][model8_pretrain.py][INFO] Epoch:[0/2](469800/4588595) loss:2.618 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:03,476][model8_pretrain.py][INFO] Epoch:[0/2](469800/4588595) loss:2.955 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:03,476][model8_pretrain.py][INFO] Epoch:[0/2](469800/4588595) loss:3.215 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:03,476][model8_pretrain.py][INFO] Epoch:[0/2](469800/4588595) loss:2.824 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:03,476][model8_pretrain.py][INFO] Epoch:[0/2](469800/4588595) loss:2.379 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:49,082][model8_pretrain.py][INFO] Epoch:[0/2](469900/4588595) loss:2.762 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:49,082][model8_pretrain.py][INFO] Epoch:[0/2](469900/4588595) loss:2.513 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:49,082][model8_pretrain.py][INFO] Epoch:[0/2](469900/4588595) loss:2.397 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:49,082][model8_pretrain.py][INFO] Epoch:[0/2](469900/4588595) loss:2.651 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:49,082][model8_pretrain.py][INFO] Epoch:[0/2](469900/4588595) loss:2.850 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:49,082][model8_pretrain.py][INFO] Epoch:[0/2](469900/4588595) loss:2.744 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:49,082][model8_pretrain.py][INFO] Epoch:[0/2](469900/4588595) loss:3.034 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:20:49,082][model8_pretrain.py][INFO] Epoch:[0/2](469900/4588595) loss:2.998 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:21:27,669][model8_pretrain.py][INFO] Epoch:[0/2](470000/4588595) loss:3.005 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:21:27,669][model8_pretrain.py][INFO] Epoch:[0/2](470000/4588595) loss:2.884 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:21:27,669][model8_pretrain.py][INFO] Epoch:[0/2](470000/4588595) loss:2.604 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:21:27,669][model8_pretrain.py][INFO] Epoch:[0/2](470000/4588595) loss:2.995 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:21:27,669][model8_pretrain.py][INFO] Epoch:[0/2](470000/4588595) loss:2.770 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:21:27,669][model8_pretrain.py][INFO] Epoch:[0/2](470000/4588595) loss:2.779 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:21:27,669][model8_pretrain.py][INFO] Epoch:[0/2](470000/4588595) loss:2.968 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:21:27,669][model8_pretrain.py][INFO] Epoch:[0/2](470000/4588595) loss:2.881 lr:0.0000100 epoch_Time:26128.0min: [2024-01-04 19:22:04,601][model8_pretrain.py][INFO] Epoch:[0/2](470100/4588595) loss:3.039 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:04,601][model8_pretrain.py][INFO] Epoch:[0/2](470100/4588595) loss:2.509 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:04,601][model8_pretrain.py][INFO] Epoch:[0/2](470100/4588595) loss:2.735 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:04,601][model8_pretrain.py][INFO] Epoch:[0/2](470100/4588595) loss:2.739 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:04,601][model8_pretrain.py][INFO] Epoch:[0/2](470100/4588595) loss:3.291 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:04,601][model8_pretrain.py][INFO] Epoch:[0/2](470100/4588595) loss:2.416 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:04,602][model8_pretrain.py][INFO] Epoch:[0/2](470100/4588595) loss:2.765 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:04,602][model8_pretrain.py][INFO] Epoch:[0/2](470100/4588595) loss:2.747 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:41,540][model8_pretrain.py][INFO] Epoch:[0/2](470200/4588595) loss:3.015 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:41,540][model8_pretrain.py][INFO] Epoch:[0/2](470200/4588595) loss:2.976 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:41,540][model8_pretrain.py][INFO] Epoch:[0/2](470200/4588595) loss:3.251 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:41,540][model8_pretrain.py][INFO] Epoch:[0/2](470200/4588595) loss:2.685 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:41,540][model8_pretrain.py][INFO] Epoch:[0/2](470200/4588595) loss:2.317 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:41,540][model8_pretrain.py][INFO] Epoch:[0/2](470200/4588595) loss:2.564 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:41,540][model8_pretrain.py][INFO] Epoch:[0/2](470200/4588595) loss:2.852 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:22:41,540][model8_pretrain.py][INFO] Epoch:[0/2](470200/4588595) loss:2.663 lr:0.0000100 epoch_Time:26127.0min: [2024-01-04 19:23:18,476][model8_pretrain.py][INFO] Epoch:[0/2](470300/4588595) loss:3.352 lr:0.0000100 epoch_Time:26126.0min: [2024-01-04 19:23:18,476][model8_pretrain.py][INFO] Epoch:[0/2](470300/4588595) loss:2.447 lr:0.0000100 epoch_Time:26126.0min: [2024-01-04 19:23:18,476][model8_pretrain.py][INFO] Epoch:[0/2](470300/4588595) loss:2.731 lr:0.0000100 epoch_Time:26126.0min: [2024-01-04 19:23:18,476][model8_pretrain.py][INFO] Epoch:[0/2](470300/4588595) loss:3.196 lr:0.0000100 epoch_Time:26126.0min: [2024-01-04 19:23:18,476][model8_pretrain.py][INFO] Epoch:[0/2](470300/4588595) loss:3.030 lr:0.0000100 epoch_Time:26126.0min: [2024-01-04 19:23:18,476][model8_pretrain.py][INFO] Epoch:[0/2](470300/4588595) loss:3.120 lr:0.0000100 epoch_Time:26126.0min: [2024-01-04 19:23:18,476][model8_pretrain.py][INFO] Epoch:[0/2](470300/4588595) loss:2.987 lr:0.0000100 epoch_Time:26126.0min: [2024-01-04 19:23:18,477][model8_pretrain.py][INFO] Epoch:[0/2](470300/4588595) loss:2.707 lr:0.0000100 epoch_Time:26126.0min: [2024-01-04 19:23:55,417][model8_pretrain.py][INFO] Epoch:[0/2](470400/4588595) loss:3.260 lr:0.0000100 epoch_Time:26125.0min: [2024-01-04 19:23:55,417][model8_pretrain.py][INFO] Epoch:[0/2](470400/4588595) loss:1.937 lr:0.0000100 epoch_Time:26125.0min: [2024-01-04 19:23:55,417][model8_pretrain.py][INFO] Epoch:[0/2](470400/4588595) loss:2.882 lr:0.0000100 epoch_Time:26125.0min: [2024-01-04 19:23:55,417][model8_pretrain.py][INFO] Epoch:[0/2](470400/4588595) loss:3.306 lr:0.0000100 epoch_Time:26125.0min: [2024-01-04 19:23:55,417][model8_pretrain.py][INFO] Epoch:[0/2](470400/4588595) loss:2.411 lr:0.0000100 epoch_Time:26125.0min: [2024-01-04 19:23:55,417][model8_pretrain.py][INFO] Epoch:[0/2](470400/4588595) loss:2.652 lr:0.0000100 epoch_Time:26125.0min: [2024-01-04 19:23:55,417][model8_pretrain.py][INFO] Epoch:[0/2](470400/4588595) loss:2.680 lr:0.0000100 epoch_Time:26125.0min: [2024-01-04 19:23:55,417][model8_pretrain.py][INFO] Epoch:[0/2](470400/4588595) loss:2.890 lr:0.0000100 epoch_Time:26125.0min: [2024-01-04 19:24:32,353][model8_pretrain.py][INFO] Epoch:[0/2](470500/4588595) loss:2.827 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:24:32,353][model8_pretrain.py][INFO] Epoch:[0/2](470500/4588595) loss:3.005 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:24:32,353][model8_pretrain.py][INFO] Epoch:[0/2](470500/4588595) loss:3.285 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:24:32,353][model8_pretrain.py][INFO] Epoch:[0/2](470500/4588595) loss:2.973 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:24:32,353][model8_pretrain.py][INFO] Epoch:[0/2](470500/4588595) loss:2.563 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:24:32,353][model8_pretrain.py][INFO] Epoch:[0/2](470500/4588595) loss:2.798 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:24:32,353][model8_pretrain.py][INFO] Epoch:[0/2](470500/4588595) loss:2.723 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:24:32,354][model8_pretrain.py][INFO] Epoch:[0/2](470500/4588595) loss:2.825 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:25:09,301][model8_pretrain.py][INFO] Epoch:[0/2](470600/4588595) loss:2.483 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:09,301][model8_pretrain.py][INFO] Epoch:[0/2](470600/4588595) loss:2.699 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:09,301][model8_pretrain.py][INFO] Epoch:[0/2](470600/4588595) loss:2.818 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:09,301][model8_pretrain.py][INFO] Epoch:[0/2](470600/4588595) loss:2.532 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:09,301][model8_pretrain.py][INFO] Epoch:[0/2](470600/4588595) loss:3.042 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:09,301][model8_pretrain.py][INFO] Epoch:[0/2](470600/4588595) loss:2.812 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:09,302][model8_pretrain.py][INFO] Epoch:[0/2](470600/4588595) loss:2.977 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:09,302][model8_pretrain.py][INFO] Epoch:[0/2](470600/4588595) loss:2.967 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:51,520][model8_pretrain.py][INFO] Epoch:[0/2](470700/4588595) loss:2.789 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:51,520][model8_pretrain.py][INFO] Epoch:[0/2](470700/4588595) loss:3.043 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:51,520][model8_pretrain.py][INFO] Epoch:[0/2](470700/4588595) loss:2.345 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:51,520][model8_pretrain.py][INFO] Epoch:[0/2](470700/4588595) loss:2.981 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:51,520][model8_pretrain.py][INFO] Epoch:[0/2](470700/4588595) loss:2.545 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:51,520][model8_pretrain.py][INFO] Epoch:[0/2](470700/4588595) loss:2.902 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:51,520][model8_pretrain.py][INFO] Epoch:[0/2](470700/4588595) loss:2.933 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:25:51,520][model8_pretrain.py][INFO] Epoch:[0/2](470700/4588595) loss:2.837 lr:0.0000100 epoch_Time:26123.0min: [2024-01-04 19:26:33,583][model8_pretrain.py][INFO] Epoch:[0/2](470800/4588595) loss:3.075 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:26:33,583][model8_pretrain.py][INFO] Epoch:[0/2](470800/4588595) loss:2.393 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:26:33,583][model8_pretrain.py][INFO] Epoch:[0/2](470800/4588595) loss:2.550 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:26:33,583][model8_pretrain.py][INFO] Epoch:[0/2](470800/4588595) loss:2.903 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:26:33,583][model8_pretrain.py][INFO] Epoch:[0/2](470800/4588595) loss:1.711 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:26:33,584][model8_pretrain.py][INFO] Epoch:[0/2](470800/4588595) loss:2.458 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:26:33,583][model8_pretrain.py][INFO] Epoch:[0/2](470800/4588595) loss:2.759 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:26:33,584][model8_pretrain.py][INFO] Epoch:[0/2](470800/4588595) loss:2.311 lr:0.0000100 epoch_Time:26124.0min: [2024-01-04 19:27:10,484][model8_pretrain.py][INFO] Epoch:[0/2](470900/4588595) loss:2.232 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:10,484][model8_pretrain.py][INFO] Epoch:[0/2](470900/4588595) loss:2.814 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:10,484][model8_pretrain.py][INFO] Epoch:[0/2](470900/4588595) loss:2.737 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:10,484][model8_pretrain.py][INFO] Epoch:[0/2](470900/4588595) loss:3.329 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:10,484][model8_pretrain.py][INFO] Epoch:[0/2](470900/4588595) loss:3.175 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:10,484][model8_pretrain.py][INFO] Epoch:[0/2](470900/4588595) loss:2.915 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:10,484][model8_pretrain.py][INFO] Epoch:[0/2](470900/4588595) loss:3.134 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:10,484][model8_pretrain.py][INFO] Epoch:[0/2](470900/4588595) loss:2.620 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:47,420][model8_pretrain.py][INFO] Epoch:[0/2](471000/4588595) loss:3.008 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:47,420][model8_pretrain.py][INFO] Epoch:[0/2](471000/4588595) loss:2.753 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:47,420][model8_pretrain.py][INFO] Epoch:[0/2](471000/4588595) loss:2.862 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:47,420][model8_pretrain.py][INFO] Epoch:[0/2](471000/4588595) loss:3.095 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:47,420][model8_pretrain.py][INFO] Epoch:[0/2](471000/4588595) loss:2.710 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:47,420][model8_pretrain.py][INFO] Epoch:[0/2](471000/4588595) loss:2.344 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:47,420][model8_pretrain.py][INFO] Epoch:[0/2](471000/4588595) loss:2.497 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:27:47,420][model8_pretrain.py][INFO] Epoch:[0/2](471000/4588595) loss:3.017 lr:0.0000100 epoch_Time:26122.0min: [2024-01-04 19:28:24,361][model8_pretrain.py][INFO] Epoch:[0/2](471100/4588595) loss:2.624 lr:0.0000100 epoch_Time:26121.0min: [2024-01-04 19:28:24,361][model8_pretrain.py][INFO] Epoch:[0/2](471100/4588595) loss:2.930 lr:0.0000100 epoch_Time:26121.0min: [2024-01-04 19:28:24,361][model8_pretrain.py][INFO] Epoch:[0/2](471100/4588595) loss:3.279 lr:0.0000100 epoch_Time:26121.0min: [2024-01-04 19:28:24,361][model8_pretrain.py][INFO] Epoch:[0/2](471100/4588595) loss:2.748 lr:0.0000100 epoch_Time:26121.0min: [2024-01-04 19:28:24,361][model8_pretrain.py][INFO] Epoch:[0/2](471100/4588595) loss:2.597 lr:0.0000100 epoch_Time:26121.0min: [2024-01-04 19:28:24,361][model8_pretrain.py][INFO] Epoch:[0/2](471100/4588595) loss:2.894 lr:0.0000100 epoch_Time:26121.0min: [2024-01-04 19:28:24,361][model8_pretrain.py][INFO] Epoch:[0/2](471100/4588595) loss:3.136 lr:0.0000100 epoch_Time:26121.0min: [2024-01-04 19:28:24,361][model8_pretrain.py][INFO] Epoch:[0/2](471100/4588595) loss:2.861 lr:0.0000100 epoch_Time:26121.0min: [2024-01-04 19:29:01,322][model8_pretrain.py][INFO] Epoch:[0/2](471200/4588595) loss:3.268 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:01,322][model8_pretrain.py][INFO] Epoch:[0/2](471200/4588595) loss:2.363 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:01,322][model8_pretrain.py][INFO] Epoch:[0/2](471200/4588595) loss:3.008 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:01,322][model8_pretrain.py][INFO] Epoch:[0/2](471200/4588595) loss:2.484 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:01,322][model8_pretrain.py][INFO] Epoch:[0/2](471200/4588595) loss:2.673 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:01,322][model8_pretrain.py][INFO] Epoch:[0/2](471200/4588595) loss:3.168 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:01,322][model8_pretrain.py][INFO] Epoch:[0/2](471200/4588595) loss:3.266 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:01,322][model8_pretrain.py][INFO] Epoch:[0/2](471200/4588595) loss:2.989 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:38,271][model8_pretrain.py][INFO] Epoch:[0/2](471300/4588595) loss:2.516 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:38,271][model8_pretrain.py][INFO] Epoch:[0/2](471300/4588595) loss:3.300 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:38,271][model8_pretrain.py][INFO] Epoch:[0/2](471300/4588595) loss:2.479 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:38,271][model8_pretrain.py][INFO] Epoch:[0/2](471300/4588595) loss:2.357 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:38,271][model8_pretrain.py][INFO] Epoch:[0/2](471300/4588595) loss:3.005 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:38,271][model8_pretrain.py][INFO] Epoch:[0/2](471300/4588595) loss:2.646 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:38,271][model8_pretrain.py][INFO] Epoch:[0/2](471300/4588595) loss:2.756 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:29:38,271][model8_pretrain.py][INFO] Epoch:[0/2](471300/4588595) loss:2.652 lr:0.0000100 epoch_Time:26120.0min: [2024-01-04 19:30:15,224][model8_pretrain.py][INFO] Epoch:[0/2](471400/4588595) loss:2.684 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:15,224][model8_pretrain.py][INFO] Epoch:[0/2](471400/4588595) loss:1.964 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:30:15,224][model8_pretrain.py][INFO] Epoch:[0/2](471400/4588595) loss:2.925 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:15,224][model8_pretrain.py][INFO] Epoch:[0/2](471400/4588595) loss:2.618 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:15,224][model8_pretrain.py][INFO] Epoch:[0/2](471400/4588595) loss:3.405 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:30:15,224][model8_pretrain.py][INFO] Epoch:[0/2](471400/4588595) loss:2.545 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:15,224][model8_pretrain.py][INFO] Epoch:[0/2](471400/4588595) loss:2.760 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:30:15,224][model8_pretrain.py][INFO] Epoch:[0/2](471400/4588595) loss:2.369 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:57,322][model8_pretrain.py][INFO] Epoch:[0/2](471500/4588595) loss:3.023 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:57,322][model8_pretrain.py][INFO] Epoch:[0/2](471500/4588595) loss:2.807 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:57,322][model8_pretrain.py][INFO] Epoch:[0/2](471500/4588595) loss:2.498 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:57,322][model8_pretrain.py][INFO] Epoch:[0/2](471500/4588595) loss:2.958 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:57,322][model8_pretrain.py][INFO] Epoch:[0/2](471500/4588595) loss:2.878 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:57,322][model8_pretrain.py][INFO] Epoch:[0/2](471500/4588595) loss:3.107 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:57,322][model8_pretrain.py][INFO] Epoch:[0/2](471500/4588595) loss:2.746 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:30:57,323][model8_pretrain.py][INFO] Epoch:[0/2](471500/4588595) loss:2.400 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:31:39,382][model8_pretrain.py][INFO] Epoch:[0/2](471600/4588595) loss:3.003 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:31:39,382][model8_pretrain.py][INFO] Epoch:[0/2](471600/4588595) loss:2.429 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:31:39,382][model8_pretrain.py][INFO] Epoch:[0/2](471600/4588595) loss:3.466 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:31:39,382][model8_pretrain.py][INFO] Epoch:[0/2](471600/4588595) loss:2.974 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:31:39,382][model8_pretrain.py][INFO] Epoch:[0/2](471600/4588595) loss:2.324 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:31:39,382][model8_pretrain.py][INFO] Epoch:[0/2](471600/4588595) loss:2.986 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:31:39,382][model8_pretrain.py][INFO] Epoch:[0/2](471600/4588595) loss:2.670 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:31:39,382][model8_pretrain.py][INFO] Epoch:[0/2](471600/4588595) loss:2.820 lr:0.0000100 epoch_Time:26119.0min: [2024-01-04 19:32:16,324][model8_pretrain.py][INFO] Epoch:[0/2](471700/4588595) loss:2.453 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:32:16,324][model8_pretrain.py][INFO] Epoch:[0/2](471700/4588595) loss:2.886 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:32:16,324][model8_pretrain.py][INFO] Epoch:[0/2](471700/4588595) loss:3.084 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:32:16,324][model8_pretrain.py][INFO] Epoch:[0/2](471700/4588595) loss:2.958 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:32:16,324][model8_pretrain.py][INFO] Epoch:[0/2](471700/4588595) loss:2.612 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:32:16,324][model8_pretrain.py][INFO] Epoch:[0/2](471700/4588595) loss:3.334 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:32:16,324][model8_pretrain.py][INFO] Epoch:[0/2](471700/4588595) loss:3.005 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:32:16,324][model8_pretrain.py][INFO] Epoch:[0/2](471700/4588595) loss:3.392 lr:0.0000100 epoch_Time:26118.0min: [2024-01-04 19:32:53,272][model8_pretrain.py][INFO] Epoch:[0/2](471800/4588595) loss:3.103 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:32:53,272][model8_pretrain.py][INFO] Epoch:[0/2](471800/4588595) loss:2.384 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:32:53,272][model8_pretrain.py][INFO] Epoch:[0/2](471800/4588595) loss:3.317 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:32:53,272][model8_pretrain.py][INFO] Epoch:[0/2](471800/4588595) loss:2.639 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:32:53,272][model8_pretrain.py][INFO] Epoch:[0/2](471800/4588595) loss:3.008 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:32:53,272][model8_pretrain.py][INFO] Epoch:[0/2](471800/4588595) loss:3.060 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:32:53,272][model8_pretrain.py][INFO] Epoch:[0/2](471800/4588595) loss:2.745 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:32:53,272][model8_pretrain.py][INFO] Epoch:[0/2](471800/4588595) loss:3.019 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:33:30,222][model8_pretrain.py][INFO] Epoch:[0/2](471900/4588595) loss:2.351 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:33:30,222][model8_pretrain.py][INFO] Epoch:[0/2](471900/4588595) loss:2.226 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:33:30,222][model8_pretrain.py][INFO] Epoch:[0/2](471900/4588595) loss:3.028 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:33:30,222][model8_pretrain.py][INFO] Epoch:[0/2](471900/4588595) loss:2.878 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:33:30,222][model8_pretrain.py][INFO] Epoch:[0/2](471900/4588595) loss:2.728 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:33:30,222][model8_pretrain.py][INFO] Epoch:[0/2](471900/4588595) loss:3.045 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:33:30,222][model8_pretrain.py][INFO] Epoch:[0/2](471900/4588595) loss:2.390 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:33:30,222][model8_pretrain.py][INFO] Epoch:[0/2](471900/4588595) loss:2.879 lr:0.0000100 epoch_Time:26116.0min: [2024-01-04 19:34:07,170][model8_pretrain.py][INFO] Epoch:[0/2](472000/4588595) loss:3.187 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:07,170][model8_pretrain.py][INFO] Epoch:[0/2](472000/4588595) loss:2.568 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:07,170][model8_pretrain.py][INFO] Epoch:[0/2](472000/4588595) loss:3.566 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:07,170][model8_pretrain.py][INFO] Epoch:[0/2](472000/4588595) loss:3.113 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:07,170][model8_pretrain.py][INFO] Epoch:[0/2](472000/4588595) loss:2.750 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:07,170][model8_pretrain.py][INFO] Epoch:[0/2](472000/4588595) loss:2.532 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:07,170][model8_pretrain.py][INFO] Epoch:[0/2](472000/4588595) loss:2.804 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:07,170][model8_pretrain.py][INFO] Epoch:[0/2](472000/4588595) loss:2.412 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:44,117][model8_pretrain.py][INFO] Epoch:[0/2](472100/4588595) loss:2.914 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:44,117][model8_pretrain.py][INFO] Epoch:[0/2](472100/4588595) loss:2.676 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:44,117][model8_pretrain.py][INFO] Epoch:[0/2](472100/4588595) loss:2.837 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:44,117][model8_pretrain.py][INFO] Epoch:[0/2](472100/4588595) loss:2.580 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:44,117][model8_pretrain.py][INFO] Epoch:[0/2](472100/4588595) loss:2.844 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:44,117][model8_pretrain.py][INFO] Epoch:[0/2](472100/4588595) loss:2.701 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:44,117][model8_pretrain.py][INFO] Epoch:[0/2](472100/4588595) loss:2.728 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:34:44,117][model8_pretrain.py][INFO] Epoch:[0/2](472100/4588595) loss:2.967 lr:0.0000100 epoch_Time:26115.0min: [2024-01-04 19:35:21,076][model8_pretrain.py][INFO] Epoch:[0/2](472200/4588595) loss:2.714 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:35:21,076][model8_pretrain.py][INFO] Epoch:[0/2](472200/4588595) loss:3.104 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:35:21,076][model8_pretrain.py][INFO] Epoch:[0/2](472200/4588595) loss:2.817 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:35:21,076][model8_pretrain.py][INFO] Epoch:[0/2](472200/4588595) loss:2.655 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:35:21,076][model8_pretrain.py][INFO] Epoch:[0/2](472200/4588595) loss:3.022 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:35:21,076][model8_pretrain.py][INFO] Epoch:[0/2](472200/4588595) loss:2.562 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:35:21,076][model8_pretrain.py][INFO] Epoch:[0/2](472200/4588595) loss:2.771 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:35:21,076][model8_pretrain.py][INFO] Epoch:[0/2](472200/4588595) loss:2.485 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:36:03,405][model8_pretrain.py][INFO] Epoch:[0/2](472300/4588595) loss:2.582 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:36:03,405][model8_pretrain.py][INFO] Epoch:[0/2](472300/4588595) loss:3.009 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:36:03,405][model8_pretrain.py][INFO] Epoch:[0/2](472300/4588595) loss:2.726 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:36:03,405][model8_pretrain.py][INFO] Epoch:[0/2](472300/4588595) loss:2.421 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:36:03,405][model8_pretrain.py][INFO] Epoch:[0/2](472300/4588595) loss:2.762 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:36:03,406][model8_pretrain.py][INFO] Epoch:[0/2](472300/4588595) loss:3.214 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:36:03,406][model8_pretrain.py][INFO] Epoch:[0/2](472300/4588595) loss:2.500 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:36:03,407][model8_pretrain.py][INFO] Epoch:[0/2](472300/4588595) loss:2.287 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:36:45,454][model8_pretrain.py][INFO] Epoch:[0/2](472400/4588595) loss:3.075 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:36:45,454][model8_pretrain.py][INFO] Epoch:[0/2](472400/4588595) loss:3.017 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:36:45,454][model8_pretrain.py][INFO] Epoch:[0/2](472400/4588595) loss:2.382 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:36:45,454][model8_pretrain.py][INFO] Epoch:[0/2](472400/4588595) loss:3.570 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:36:45,454][model8_pretrain.py][INFO] Epoch:[0/2](472400/4588595) loss:3.068 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:36:45,454][model8_pretrain.py][INFO] Epoch:[0/2](472400/4588595) loss:2.141 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:36:45,454][model8_pretrain.py][INFO] Epoch:[0/2](472400/4588595) loss:2.896 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:36:45,454][model8_pretrain.py][INFO] Epoch:[0/2](472400/4588595) loss:2.708 lr:0.0000100 epoch_Time:26114.0min: [2024-01-04 19:37:22,423][model8_pretrain.py][INFO] Epoch:[0/2](472500/4588595) loss:2.571 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:37:22,423][model8_pretrain.py][INFO] Epoch:[0/2](472500/4588595) loss:2.778 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:37:22,423][model8_pretrain.py][INFO] Epoch:[0/2](472500/4588595) loss:2.434 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:37:22,423][model8_pretrain.py][INFO] Epoch:[0/2](472500/4588595) loss:3.080 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:37:22,423][model8_pretrain.py][INFO] Epoch:[0/2](472500/4588595) loss:2.681 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:37:22,423][model8_pretrain.py][INFO] Epoch:[0/2](472500/4588595) loss:2.928 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:37:22,423][model8_pretrain.py][INFO] Epoch:[0/2](472500/4588595) loss:2.695 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:37:22,423][model8_pretrain.py][INFO] Epoch:[0/2](472500/4588595) loss:2.988 lr:0.0000100 epoch_Time:26113.0min: [2024-01-04 19:37:59,371][model8_pretrain.py][INFO] Epoch:[0/2](472600/4588595) loss:3.172 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:37:59,371][model8_pretrain.py][INFO] Epoch:[0/2](472600/4588595) loss:3.062 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:37:59,371][model8_pretrain.py][INFO] Epoch:[0/2](472600/4588595) loss:2.934 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:37:59,371][model8_pretrain.py][INFO] Epoch:[0/2](472600/4588595) loss:2.618 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:37:59,371][model8_pretrain.py][INFO] Epoch:[0/2](472600/4588595) loss:2.487 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:37:59,371][model8_pretrain.py][INFO] Epoch:[0/2](472600/4588595) loss:2.472 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:37:59,371][model8_pretrain.py][INFO] Epoch:[0/2](472600/4588595) loss:2.817 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:37:59,371][model8_pretrain.py][INFO] Epoch:[0/2](472600/4588595) loss:2.041 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:38:36,309][model8_pretrain.py][INFO] Epoch:[0/2](472700/4588595) loss:2.569 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:38:36,309][model8_pretrain.py][INFO] Epoch:[0/2](472700/4588595) loss:3.118 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:38:36,309][model8_pretrain.py][INFO] Epoch:[0/2](472700/4588595) loss:2.461 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:38:36,309][model8_pretrain.py][INFO] Epoch:[0/2](472700/4588595) loss:2.721 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:38:36,309][model8_pretrain.py][INFO] Epoch:[0/2](472700/4588595) loss:2.891 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:38:36,309][model8_pretrain.py][INFO] Epoch:[0/2](472700/4588595) loss:2.956 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:38:36,309][model8_pretrain.py][INFO] Epoch:[0/2](472700/4588595) loss:2.406 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:38:36,309][model8_pretrain.py][INFO] Epoch:[0/2](472700/4588595) loss:2.925 lr:0.0000100 epoch_Time:26112.0min: [2024-01-04 19:39:13,255][model8_pretrain.py][INFO] Epoch:[0/2](472800/4588595) loss:2.286 lr:0.0000100 epoch_Time:26110.0min: [2024-01-04 19:39:13,255][model8_pretrain.py][INFO] Epoch:[0/2](472800/4588595) loss:2.732 lr:0.0000100 epoch_Time:26110.0min: [2024-01-04 19:39:13,255][model8_pretrain.py][INFO] Epoch:[0/2](472800/4588595) loss:2.717 lr:0.0000100 epoch_Time:26110.0min: [2024-01-04 19:39:13,255][model8_pretrain.py][INFO] Epoch:[0/2](472800/4588595) loss:2.809 lr:0.0000100 epoch_Time:26110.0min: [2024-01-04 19:39:13,255][model8_pretrain.py][INFO] Epoch:[0/2](472800/4588595) loss:2.615 lr:0.0000100 epoch_Time:26110.0min: [2024-01-04 19:39:13,255][model8_pretrain.py][INFO] Epoch:[0/2](472800/4588595) loss:3.250 lr:0.0000100 epoch_Time:26110.0min: [2024-01-04 19:39:13,255][model8_pretrain.py][INFO] Epoch:[0/2](472800/4588595) loss:2.816 lr:0.0000100 epoch_Time:26110.0min: [2024-01-04 19:39:13,255][model8_pretrain.py][INFO] Epoch:[0/2](472800/4588595) loss:1.981 lr:0.0000100 epoch_Time:26110.0min: [2024-01-04 19:39:50,192][model8_pretrain.py][INFO] Epoch:[0/2](472900/4588595) loss:2.759 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:39:50,192][model8_pretrain.py][INFO] Epoch:[0/2](472900/4588595) loss:2.986 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:39:50,192][model8_pretrain.py][INFO] Epoch:[0/2](472900/4588595) loss:2.549 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:39:50,192][model8_pretrain.py][INFO] Epoch:[0/2](472900/4588595) loss:3.208 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:39:50,192][model8_pretrain.py][INFO] Epoch:[0/2](472900/4588595) loss:2.600 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:39:50,192][model8_pretrain.py][INFO] Epoch:[0/2](472900/4588595) loss:3.132 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:39:50,193][model8_pretrain.py][INFO] Epoch:[0/2](472900/4588595) loss:2.803 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:39:50,193][model8_pretrain.py][INFO] Epoch:[0/2](472900/4588595) loss:2.808 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:40:27,136][model8_pretrain.py][INFO] Epoch:[0/2](473000/4588595) loss:3.205 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:40:27,136][model8_pretrain.py][INFO] Epoch:[0/2](473000/4588595) loss:2.934 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:40:27,136][model8_pretrain.py][INFO] Epoch:[0/2](473000/4588595) loss:2.893 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:40:27,137][model8_pretrain.py][INFO] Epoch:[0/2](473000/4588595) loss:3.088 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:40:27,137][model8_pretrain.py][INFO] Epoch:[0/2](473000/4588595) loss:2.676 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:40:27,137][model8_pretrain.py][INFO] Epoch:[0/2](473000/4588595) loss:2.916 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:40:27,137][model8_pretrain.py][INFO] Epoch:[0/2](473000/4588595) loss:2.967 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:40:27,138][model8_pretrain.py][INFO] Epoch:[0/2](473000/4588595) loss:3.160 lr:0.0000100 epoch_Time:26109.0min: [2024-01-04 19:41:05,831][model8_pretrain.py][INFO] Epoch:[0/2](473100/4588595) loss:2.867 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:05,831][model8_pretrain.py][INFO] Epoch:[0/2](473100/4588595) loss:3.016 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:05,831][model8_pretrain.py][INFO] Epoch:[0/2](473100/4588595) loss:2.330 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:05,831][model8_pretrain.py][INFO] Epoch:[0/2](473100/4588595) loss:2.704 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:05,831][model8_pretrain.py][INFO] Epoch:[0/2](473100/4588595) loss:2.599 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:05,831][model8_pretrain.py][INFO] Epoch:[0/2](473100/4588595) loss:3.370 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:05,831][model8_pretrain.py][INFO] Epoch:[0/2](473100/4588595) loss:3.024 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:05,831][model8_pretrain.py][INFO] Epoch:[0/2](473100/4588595) loss:2.870 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:50,849][model8_pretrain.py][INFO] Epoch:[0/2](473200/4588595) loss:2.848 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:50,850][model8_pretrain.py][INFO] Epoch:[0/2](473200/4588595) loss:2.624 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:50,850][model8_pretrain.py][INFO] Epoch:[0/2](473200/4588595) loss:2.538 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:50,850][model8_pretrain.py][INFO] Epoch:[0/2](473200/4588595) loss:2.264 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:50,850][model8_pretrain.py][INFO] Epoch:[0/2](473200/4588595) loss:3.334 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:50,850][model8_pretrain.py][INFO] Epoch:[0/2](473200/4588595) loss:2.779 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:50,850][model8_pretrain.py][INFO] Epoch:[0/2](473200/4588595) loss:2.776 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:41:50,850][model8_pretrain.py][INFO] Epoch:[0/2](473200/4588595) loss:2.819 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:42:27,779][model8_pretrain.py][INFO] Epoch:[0/2](473300/4588595) loss:2.825 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:42:27,779][model8_pretrain.py][INFO] Epoch:[0/2](473300/4588595) loss:2.789 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:42:27,779][model8_pretrain.py][INFO] Epoch:[0/2](473300/4588595) loss:2.783 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:42:27,779][model8_pretrain.py][INFO] Epoch:[0/2](473300/4588595) loss:3.015 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:42:27,779][model8_pretrain.py][INFO] Epoch:[0/2](473300/4588595) loss:2.517 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:42:27,779][model8_pretrain.py][INFO] Epoch:[0/2](473300/4588595) loss:2.870 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:42:27,779][model8_pretrain.py][INFO] Epoch:[0/2](473300/4588595) loss:3.136 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:42:27,779][model8_pretrain.py][INFO] Epoch:[0/2](473300/4588595) loss:3.427 lr:0.0000100 epoch_Time:26108.0min: [2024-01-04 19:43:04,711][model8_pretrain.py][INFO] Epoch:[0/2](473400/4588595) loss:2.145 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:04,711][model8_pretrain.py][INFO] Epoch:[0/2](473400/4588595) loss:2.744 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:04,711][model8_pretrain.py][INFO] Epoch:[0/2](473400/4588595) loss:2.652 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:04,711][model8_pretrain.py][INFO] Epoch:[0/2](473400/4588595) loss:2.996 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:04,711][model8_pretrain.py][INFO] Epoch:[0/2](473400/4588595) loss:3.285 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:04,711][model8_pretrain.py][INFO] Epoch:[0/2](473400/4588595) loss:3.001 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:04,711][model8_pretrain.py][INFO] Epoch:[0/2](473400/4588595) loss:2.055 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:04,711][model8_pretrain.py][INFO] Epoch:[0/2](473400/4588595) loss:3.166 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:41,641][model8_pretrain.py][INFO] Epoch:[0/2](473500/4588595) loss:3.247 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:41,641][model8_pretrain.py][INFO] Epoch:[0/2](473500/4588595) loss:2.000 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:41,641][model8_pretrain.py][INFO] Epoch:[0/2](473500/4588595) loss:3.193 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:41,641][model8_pretrain.py][INFO] Epoch:[0/2](473500/4588595) loss:2.639 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:41,641][model8_pretrain.py][INFO] Epoch:[0/2](473500/4588595) loss:2.380 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:41,641][model8_pretrain.py][INFO] Epoch:[0/2](473500/4588595) loss:3.220 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:41,641][model8_pretrain.py][INFO] Epoch:[0/2](473500/4588595) loss:2.464 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:43:41,641][model8_pretrain.py][INFO] Epoch:[0/2](473500/4588595) loss:2.679 lr:0.0000100 epoch_Time:26107.0min: [2024-01-04 19:44:18,579][model8_pretrain.py][INFO] Epoch:[0/2](473600/4588595) loss:3.039 lr:0.0000100 epoch_Time:26105.0min: [2024-01-04 19:44:18,579][model8_pretrain.py][INFO] Epoch:[0/2](473600/4588595) loss:2.388 lr:0.0000100 epoch_Time:26105.0min: [2024-01-04 19:44:18,580][model8_pretrain.py][INFO] Epoch:[0/2](473600/4588595) loss:2.640 lr:0.0000100 epoch_Time:26105.0min: [2024-01-04 19:44:18,580][model8_pretrain.py][INFO] Epoch:[0/2](473600/4588595) loss:3.207 lr:0.0000100 epoch_Time:26105.0min: [2024-01-04 19:44:18,579][model8_pretrain.py][INFO] Epoch:[0/2](473600/4588595) loss:2.790 lr:0.0000100 epoch_Time:26105.0min: [2024-01-04 19:44:18,580][model8_pretrain.py][INFO] Epoch:[0/2](473600/4588595) loss:3.241 lr:0.0000100 epoch_Time:26105.0min: [2024-01-04 19:44:18,580][model8_pretrain.py][INFO] Epoch:[0/2](473600/4588595) loss:3.488 lr:0.0000100 epoch_Time:26105.0min: [2024-01-04 19:44:18,580][model8_pretrain.py][INFO] Epoch:[0/2](473600/4588595) loss:2.899 lr:0.0000100 epoch_Time:26105.0min: [2024-01-04 19:44:55,507][model8_pretrain.py][INFO] Epoch:[0/2](473700/4588595) loss:2.991 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:44:55,507][model8_pretrain.py][INFO] Epoch:[0/2](473700/4588595) loss:3.501 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:44:55,507][model8_pretrain.py][INFO] Epoch:[0/2](473700/4588595) loss:3.032 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:44:55,507][model8_pretrain.py][INFO] Epoch:[0/2](473700/4588595) loss:2.318 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:44:55,507][model8_pretrain.py][INFO] Epoch:[0/2](473700/4588595) loss:3.293 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:44:55,507][model8_pretrain.py][INFO] Epoch:[0/2](473700/4588595) loss:3.031 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:44:55,507][model8_pretrain.py][INFO] Epoch:[0/2](473700/4588595) loss:2.350 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:44:55,507][model8_pretrain.py][INFO] Epoch:[0/2](473700/4588595) loss:3.392 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:45:32,439][model8_pretrain.py][INFO] Epoch:[0/2](473800/4588595) loss:2.908 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:45:32,439][model8_pretrain.py][INFO] Epoch:[0/2](473800/4588595) loss:3.213 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:45:32,439][model8_pretrain.py][INFO] Epoch:[0/2](473800/4588595) loss:2.978 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:45:32,439][model8_pretrain.py][INFO] Epoch:[0/2](473800/4588595) loss:2.283 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:45:32,439][model8_pretrain.py][INFO] Epoch:[0/2](473800/4588595) loss:2.944 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:45:32,439][model8_pretrain.py][INFO] Epoch:[0/2](473800/4588595) loss:2.941 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:45:32,439][model8_pretrain.py][INFO] Epoch:[0/2](473800/4588595) loss:2.459 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:45:32,439][model8_pretrain.py][INFO] Epoch:[0/2](473800/4588595) loss:2.589 lr:0.0000100 epoch_Time:26104.0min: [2024-01-04 19:46:11,172][model8_pretrain.py][INFO] Epoch:[0/2](473900/4588595) loss:2.902 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:11,172][model8_pretrain.py][INFO] Epoch:[0/2](473900/4588595) loss:2.904 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:11,172][model8_pretrain.py][INFO] Epoch:[0/2](473900/4588595) loss:3.082 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:11,172][model8_pretrain.py][INFO] Epoch:[0/2](473900/4588595) loss:2.986 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:11,172][model8_pretrain.py][INFO] Epoch:[0/2](473900/4588595) loss:3.090 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:11,172][model8_pretrain.py][INFO] Epoch:[0/2](473900/4588595) loss:2.857 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:11,172][model8_pretrain.py][INFO] Epoch:[0/2](473900/4588595) loss:2.942 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:11,172][model8_pretrain.py][INFO] Epoch:[0/2](473900/4588595) loss:3.234 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:55,971][model8_pretrain.py][INFO] Epoch:[0/2](474000/4588595) loss:2.798 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:55,971][model8_pretrain.py][INFO] Epoch:[0/2](474000/4588595) loss:3.061 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:55,971][model8_pretrain.py][INFO] Epoch:[0/2](474000/4588595) loss:2.532 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:55,971][model8_pretrain.py][INFO] Epoch:[0/2](474000/4588595) loss:2.270 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:55,971][model8_pretrain.py][INFO] Epoch:[0/2](474000/4588595) loss:2.431 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:55,971][model8_pretrain.py][INFO] Epoch:[0/2](474000/4588595) loss:2.501 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:55,971][model8_pretrain.py][INFO] Epoch:[0/2](474000/4588595) loss:2.576 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:46:55,971][model8_pretrain.py][INFO] Epoch:[0/2](474000/4588595) loss:2.952 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:47:32,898][model8_pretrain.py][INFO] Epoch:[0/2](474100/4588595) loss:2.826 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:47:32,898][model8_pretrain.py][INFO] Epoch:[0/2](474100/4588595) loss:2.489 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:47:32,898][model8_pretrain.py][INFO] Epoch:[0/2](474100/4588595) loss:2.748 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:47:32,898][model8_pretrain.py][INFO] Epoch:[0/2](474100/4588595) loss:3.200 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:47:32,898][model8_pretrain.py][INFO] Epoch:[0/2](474100/4588595) loss:2.936 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:47:32,898][model8_pretrain.py][INFO] Epoch:[0/2](474100/4588595) loss:3.242 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:47:32,898][model8_pretrain.py][INFO] Epoch:[0/2](474100/4588595) loss:3.127 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:47:32,898][model8_pretrain.py][INFO] Epoch:[0/2](474100/4588595) loss:3.394 lr:0.0000100 epoch_Time:26103.0min: [2024-01-04 19:48:09,835][model8_pretrain.py][INFO] Epoch:[0/2](474200/4588595) loss:3.140 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:09,835][model8_pretrain.py][INFO] Epoch:[0/2](474200/4588595) loss:1.796 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:09,835][model8_pretrain.py][INFO] Epoch:[0/2](474200/4588595) loss:3.040 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:09,835][model8_pretrain.py][INFO] Epoch:[0/2](474200/4588595) loss:2.958 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:09,835][model8_pretrain.py][INFO] Epoch:[0/2](474200/4588595) loss:2.968 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:09,835][model8_pretrain.py][INFO] Epoch:[0/2](474200/4588595) loss:1.998 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:09,835][model8_pretrain.py][INFO] Epoch:[0/2](474200/4588595) loss:2.830 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:09,835][model8_pretrain.py][INFO] Epoch:[0/2](474200/4588595) loss:2.365 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:46,771][model8_pretrain.py][INFO] Epoch:[0/2](474300/4588595) loss:3.472 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:46,771][model8_pretrain.py][INFO] Epoch:[0/2](474300/4588595) loss:2.570 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:46,771][model8_pretrain.py][INFO] Epoch:[0/2](474300/4588595) loss:3.149 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:46,771][model8_pretrain.py][INFO] Epoch:[0/2](474300/4588595) loss:2.870 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:46,771][model8_pretrain.py][INFO] Epoch:[0/2](474300/4588595) loss:3.276 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:46,771][model8_pretrain.py][INFO] Epoch:[0/2](474300/4588595) loss:2.931 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:46,771][model8_pretrain.py][INFO] Epoch:[0/2](474300/4588595) loss:2.951 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:48:46,772][model8_pretrain.py][INFO] Epoch:[0/2](474300/4588595) loss:2.911 lr:0.0000100 epoch_Time:26102.0min: [2024-01-04 19:49:23,716][model8_pretrain.py][INFO] Epoch:[0/2](474400/4588595) loss:3.286 lr:0.0000100 epoch_Time:26101.0min: [2024-01-04 19:49:23,716][model8_pretrain.py][INFO] Epoch:[0/2](474400/4588595) loss:3.147 lr:0.0000100 epoch_Time:26101.0min: [2024-01-04 19:49:23,716][model8_pretrain.py][INFO] Epoch:[0/2](474400/4588595) loss:3.015 lr:0.0000100 epoch_Time:26101.0min: [2024-01-04 19:49:23,716][model8_pretrain.py][INFO] Epoch:[0/2](474400/4588595) loss:3.065 lr:0.0000100 epoch_Time:26101.0min: [2024-01-04 19:49:23,716][model8_pretrain.py][INFO] Epoch:[0/2](474400/4588595) loss:2.582 lr:0.0000100 epoch_Time:26101.0min: [2024-01-04 19:49:23,716][model8_pretrain.py][INFO] Epoch:[0/2](474400/4588595) loss:3.109 lr:0.0000100 epoch_Time:26101.0min: [2024-01-04 19:49:23,716][model8_pretrain.py][INFO] Epoch:[0/2](474400/4588595) loss:3.199 lr:0.0000100 epoch_Time:26101.0min: [2024-01-04 19:49:23,716][model8_pretrain.py][INFO] Epoch:[0/2](474400/4588595) loss:2.951 lr:0.0000100 epoch_Time:26101.0min: [2024-01-04 19:50:00,656][model8_pretrain.py][INFO] Epoch:[0/2](474500/4588595) loss:2.763 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:00,656][model8_pretrain.py][INFO] Epoch:[0/2](474500/4588595) loss:3.016 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:00,656][model8_pretrain.py][INFO] Epoch:[0/2](474500/4588595) loss:2.599 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:00,656][model8_pretrain.py][INFO] Epoch:[0/2](474500/4588595) loss:2.511 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:00,656][model8_pretrain.py][INFO] Epoch:[0/2](474500/4588595) loss:2.674 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:00,656][model8_pretrain.py][INFO] Epoch:[0/2](474500/4588595) loss:3.242 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:00,656][model8_pretrain.py][INFO] Epoch:[0/2](474500/4588595) loss:2.660 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:00,656][model8_pretrain.py][INFO] Epoch:[0/2](474500/4588595) loss:3.180 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:37,596][model8_pretrain.py][INFO] Epoch:[0/2](474600/4588595) loss:3.437 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:37,596][model8_pretrain.py][INFO] Epoch:[0/2](474600/4588595) loss:2.978 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:37,596][model8_pretrain.py][INFO] Epoch:[0/2](474600/4588595) loss:3.275 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:37,596][model8_pretrain.py][INFO] Epoch:[0/2](474600/4588595) loss:2.911 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:37,596][model8_pretrain.py][INFO] Epoch:[0/2](474600/4588595) loss:3.108 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:37,596][model8_pretrain.py][INFO] Epoch:[0/2](474600/4588595) loss:3.027 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:37,596][model8_pretrain.py][INFO] Epoch:[0/2](474600/4588595) loss:2.892 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:50:37,596][model8_pretrain.py][INFO] Epoch:[0/2](474600/4588595) loss:3.035 lr:0.0000100 epoch_Time:26099.0min: [2024-01-04 19:51:16,260][model8_pretrain.py][INFO] Epoch:[0/2](474700/4588595) loss:2.658 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:51:16,260][model8_pretrain.py][INFO] Epoch:[0/2](474700/4588595) loss:3.096 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:51:16,260][model8_pretrain.py][INFO] Epoch:[0/2](474700/4588595) loss:2.670 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:51:16,260][model8_pretrain.py][INFO] Epoch:[0/2](474700/4588595) loss:3.195 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:51:16,260][model8_pretrain.py][INFO] Epoch:[0/2](474700/4588595) loss:3.206 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:51:16,260][model8_pretrain.py][INFO] Epoch:[0/2](474700/4588595) loss:2.985 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:51:16,260][model8_pretrain.py][INFO] Epoch:[0/2](474700/4588595) loss:2.515 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:51:16,260][model8_pretrain.py][INFO] Epoch:[0/2](474700/4588595) loss:2.808 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:01,204][model8_pretrain.py][INFO] Epoch:[0/2](474800/4588595) loss:2.898 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:01,205][model8_pretrain.py][INFO] Epoch:[0/2](474800/4588595) loss:2.685 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:01,205][model8_pretrain.py][INFO] Epoch:[0/2](474800/4588595) loss:2.918 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:01,205][model8_pretrain.py][INFO] Epoch:[0/2](474800/4588595) loss:2.955 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:01,205][model8_pretrain.py][INFO] Epoch:[0/2](474800/4588595) loss:3.234 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:01,205][model8_pretrain.py][INFO] Epoch:[0/2](474800/4588595) loss:2.158 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:01,205][model8_pretrain.py][INFO] Epoch:[0/2](474800/4588595) loss:2.697 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:01,205][model8_pretrain.py][INFO] Epoch:[0/2](474800/4588595) loss:2.791 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:38,141][model8_pretrain.py][INFO] Epoch:[0/2](474900/4588595) loss:2.998 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:38,141][model8_pretrain.py][INFO] Epoch:[0/2](474900/4588595) loss:3.005 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:38,141][model8_pretrain.py][INFO] Epoch:[0/2](474900/4588595) loss:3.065 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:38,141][model8_pretrain.py][INFO] Epoch:[0/2](474900/4588595) loss:2.790 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:38,141][model8_pretrain.py][INFO] Epoch:[0/2](474900/4588595) loss:2.875 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:38,141][model8_pretrain.py][INFO] Epoch:[0/2](474900/4588595) loss:2.939 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:38,141][model8_pretrain.py][INFO] Epoch:[0/2](474900/4588595) loss:2.742 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:52:38,142][model8_pretrain.py][INFO] Epoch:[0/2](474900/4588595) loss:2.866 lr:0.0000100 epoch_Time:26098.0min: [2024-01-04 19:53:15,086][model8_pretrain.py][INFO] Epoch:[0/2](475000/4588595) loss:2.587 lr:0.0000100 epoch_Time:26097.0min: [2024-01-04 19:53:15,086][model8_pretrain.py][INFO] Epoch:[0/2](475000/4588595) loss:2.631 lr:0.0000100 epoch_Time:26097.0min: [2024-01-04 19:53:15,086][model8_pretrain.py][INFO] Epoch:[0/2](475000/4588595) loss:2.846 lr:0.0000100 epoch_Time:26097.0min: [2024-01-04 19:53:15,086][model8_pretrain.py][INFO] Epoch:[0/2](475000/4588595) loss:3.257 lr:0.0000100 epoch_Time:26097.0min: [2024-01-04 19:53:15,086][model8_pretrain.py][INFO] Epoch:[0/2](475000/4588595) loss:2.639 lr:0.0000100 epoch_Time:26097.0min: [2024-01-04 19:53:15,086][model8_pretrain.py][INFO] Epoch:[0/2](475000/4588595) loss:2.640 lr:0.0000100 epoch_Time:26097.0min: [2024-01-04 19:53:15,086][model8_pretrain.py][INFO] Epoch:[0/2](475000/4588595) loss:2.867 lr:0.0000100 epoch_Time:26097.0min: [2024-01-04 19:53:15,087][model8_pretrain.py][INFO] Epoch:[0/2](475000/4588595) loss:2.648 lr:0.0000100 epoch_Time:26097.0min: [2024-01-04 19:53:52,021][model8_pretrain.py][INFO] Epoch:[0/2](475100/4588595) loss:2.604 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:53:52,021][model8_pretrain.py][INFO] Epoch:[0/2](475100/4588595) loss:3.017 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:53:52,021][model8_pretrain.py][INFO] Epoch:[0/2](475100/4588595) loss:2.713 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:53:52,021][model8_pretrain.py][INFO] Epoch:[0/2](475100/4588595) loss:2.742 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:53:52,021][model8_pretrain.py][INFO] Epoch:[0/2](475100/4588595) loss:3.628 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:53:52,021][model8_pretrain.py][INFO] Epoch:[0/2](475100/4588595) loss:2.974 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:53:52,021][model8_pretrain.py][INFO] Epoch:[0/2](475100/4588595) loss:2.863 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:53:52,021][model8_pretrain.py][INFO] Epoch:[0/2](475100/4588595) loss:2.675 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:54:28,966][model8_pretrain.py][INFO] Epoch:[0/2](475200/4588595) loss:3.179 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:54:28,966][model8_pretrain.py][INFO] Epoch:[0/2](475200/4588595) loss:3.211 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:54:28,966][model8_pretrain.py][INFO] Epoch:[0/2](475200/4588595) loss:3.236 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:54:28,967][model8_pretrain.py][INFO] Epoch:[0/2](475200/4588595) loss:3.168 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:54:28,967][model8_pretrain.py][INFO] Epoch:[0/2](475200/4588595) loss:2.848 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:54:28,967][model8_pretrain.py][INFO] Epoch:[0/2](475200/4588595) loss:2.912 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:54:28,967][model8_pretrain.py][INFO] Epoch:[0/2](475200/4588595) loss:2.352 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:54:28,967][model8_pretrain.py][INFO] Epoch:[0/2](475200/4588595) loss:2.759 lr:0.0000100 epoch_Time:26096.0min: [2024-01-04 19:55:05,904][model8_pretrain.py][INFO] Epoch:[0/2](475300/4588595) loss:3.188 lr:0.0000100 epoch_Time:26095.0min: [2024-01-04 19:55:05,904][model8_pretrain.py][INFO] Epoch:[0/2](475300/4588595) loss:2.646 lr:0.0000100 epoch_Time:26095.0min: [2024-01-04 19:55:05,904][model8_pretrain.py][INFO] Epoch:[0/2](475300/4588595) loss:2.996 lr:0.0000100 epoch_Time:26095.0min: [2024-01-04 19:55:05,904][model8_pretrain.py][INFO] Epoch:[0/2](475300/4588595) loss:3.115 lr:0.0000100 epoch_Time:26095.0min: [2024-01-04 19:55:05,904][model8_pretrain.py][INFO] Epoch:[0/2](475300/4588595) loss:2.499 lr:0.0000100 epoch_Time:26095.0min: [2024-01-04 19:55:05,904][model8_pretrain.py][INFO] Epoch:[0/2](475300/4588595) loss:3.227 lr:0.0000100 epoch_Time:26095.0min: [2024-01-04 19:55:05,904][model8_pretrain.py][INFO] Epoch:[0/2](475300/4588595) loss:2.001 lr:0.0000100 epoch_Time:26095.0min: [2024-01-04 19:55:05,904][model8_pretrain.py][INFO] Epoch:[0/2](475300/4588595) loss:2.757 lr:0.0000100 epoch_Time:26095.0min: [2024-01-04 19:55:42,853][model8_pretrain.py][INFO] Epoch:[0/2](475400/4588595) loss:2.419 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:55:42,853][model8_pretrain.py][INFO] Epoch:[0/2](475400/4588595) loss:2.304 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:55:42,853][model8_pretrain.py][INFO] Epoch:[0/2](475400/4588595) loss:3.304 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:55:42,853][model8_pretrain.py][INFO] Epoch:[0/2](475400/4588595) loss:2.696 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:55:42,853][model8_pretrain.py][INFO] Epoch:[0/2](475400/4588595) loss:2.727 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:55:42,853][model8_pretrain.py][INFO] Epoch:[0/2](475400/4588595) loss:3.035 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:55:42,853][model8_pretrain.py][INFO] Epoch:[0/2](475400/4588595) loss:2.783 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:55:42,853][model8_pretrain.py][INFO] Epoch:[0/2](475400/4588595) loss:2.857 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:56:19,797][model8_pretrain.py][INFO] Epoch:[0/2](475500/4588595) loss:2.685 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:56:19,797][model8_pretrain.py][INFO] Epoch:[0/2](475500/4588595) loss:2.591 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:56:19,797][model8_pretrain.py][INFO] Epoch:[0/2](475500/4588595) loss:3.187 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:56:19,797][model8_pretrain.py][INFO] Epoch:[0/2](475500/4588595) loss:3.067 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:56:19,797][model8_pretrain.py][INFO] Epoch:[0/2](475500/4588595) loss:3.140 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:56:19,797][model8_pretrain.py][INFO] Epoch:[0/2](475500/4588595) loss:2.734 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:56:19,798][model8_pretrain.py][INFO] Epoch:[0/2](475500/4588595) loss:2.734 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:56:19,798][model8_pretrain.py][INFO] Epoch:[0/2](475500/4588595) loss:2.803 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:57:06,480][model8_pretrain.py][INFO] Epoch:[0/2](475600/4588595) loss:3.192 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:57:06,480][model8_pretrain.py][INFO] Epoch:[0/2](475600/4588595) loss:2.615 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:57:06,480][model8_pretrain.py][INFO] Epoch:[0/2](475600/4588595) loss:2.229 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:57:06,480][model8_pretrain.py][INFO] Epoch:[0/2](475600/4588595) loss:3.051 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:57:06,480][model8_pretrain.py][INFO] Epoch:[0/2](475600/4588595) loss:2.810 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:57:06,480][model8_pretrain.py][INFO] Epoch:[0/2](475600/4588595) loss:2.906 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:57:06,480][model8_pretrain.py][INFO] Epoch:[0/2](475600/4588595) loss:2.510 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:57:06,480][model8_pretrain.py][INFO] Epoch:[0/2](475600/4588595) loss:2.912 lr:0.0000100 epoch_Time:26094.0min: [2024-01-04 19:57:43,445][model8_pretrain.py][INFO] Epoch:[0/2](475700/4588595) loss:2.830 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:57:43,445][model8_pretrain.py][INFO] Epoch:[0/2](475700/4588595) loss:3.242 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:57:43,445][model8_pretrain.py][INFO] Epoch:[0/2](475700/4588595) loss:2.938 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:57:43,445][model8_pretrain.py][INFO] Epoch:[0/2](475700/4588595) loss:2.567 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:57:43,445][model8_pretrain.py][INFO] Epoch:[0/2](475700/4588595) loss:2.667 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:57:43,445][model8_pretrain.py][INFO] Epoch:[0/2](475700/4588595) loss:2.748 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:57:43,445][model8_pretrain.py][INFO] Epoch:[0/2](475700/4588595) loss:2.967 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:57:43,445][model8_pretrain.py][INFO] Epoch:[0/2](475700/4588595) loss:2.828 lr:0.0000100 epoch_Time:26093.0min: [2024-01-04 19:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](475800/4588595) loss:3.050 lr:0.0000100 epoch_Time:26092.0min: [2024-01-04 19:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](475800/4588595) loss:2.756 lr:0.0000100 epoch_Time:26092.0min: [2024-01-04 19:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](475800/4588595) loss:2.905 lr:0.0000100 epoch_Time:26092.0min: [2024-01-04 19:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](475800/4588595) loss:2.607 lr:0.0000100 epoch_Time:26092.0min: [2024-01-04 19:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](475800/4588595) loss:2.715 lr:0.0000100 epoch_Time:26092.0min: [2024-01-04 19:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](475800/4588595) loss:2.996 lr:0.0000100 epoch_Time:26092.0min: [2024-01-04 19:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](475800/4588595) loss:3.518 lr:0.0000100 epoch_Time:26092.0min: [2024-01-04 19:58:20,395][model8_pretrain.py][INFO] Epoch:[0/2](475800/4588595) loss:3.050 lr:0.0000100 epoch_Time:26092.0min: [2024-01-04 19:58:57,358][model8_pretrain.py][INFO] Epoch:[0/2](475900/4588595) loss:3.001 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:58:57,358][model8_pretrain.py][INFO] Epoch:[0/2](475900/4588595) loss:2.714 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:58:57,358][model8_pretrain.py][INFO] Epoch:[0/2](475900/4588595) loss:3.043 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:58:57,358][model8_pretrain.py][INFO] Epoch:[0/2](475900/4588595) loss:2.698 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:58:57,358][model8_pretrain.py][INFO] Epoch:[0/2](475900/4588595) loss:3.077 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:58:57,358][model8_pretrain.py][INFO] Epoch:[0/2](475900/4588595) loss:2.661 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:58:57,358][model8_pretrain.py][INFO] Epoch:[0/2](475900/4588595) loss:2.776 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:58:57,358][model8_pretrain.py][INFO] Epoch:[0/2](475900/4588595) loss:2.994 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:59:34,319][model8_pretrain.py][INFO] Epoch:[0/2](476000/4588595) loss:2.705 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:59:34,319][model8_pretrain.py][INFO] Epoch:[0/2](476000/4588595) loss:2.821 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:59:34,319][model8_pretrain.py][INFO] Epoch:[0/2](476000/4588595) loss:2.872 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:59:34,319][model8_pretrain.py][INFO] Epoch:[0/2](476000/4588595) loss:2.502 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:59:34,319][model8_pretrain.py][INFO] Epoch:[0/2](476000/4588595) loss:2.756 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:59:34,319][model8_pretrain.py][INFO] Epoch:[0/2](476000/4588595) loss:2.175 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:59:34,320][model8_pretrain.py][INFO] Epoch:[0/2](476000/4588595) loss:3.127 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 19:59:34,320][model8_pretrain.py][INFO] Epoch:[0/2](476000/4588595) loss:2.651 lr:0.0000100 epoch_Time:26091.0min: [2024-01-04 20:00:11,265][model8_pretrain.py][INFO] Epoch:[0/2](476100/4588595) loss:3.081 lr:0.0000100 epoch_Time:26090.0min: [2024-01-04 20:00:11,265][model8_pretrain.py][INFO] Epoch:[0/2](476100/4588595) loss:3.153 lr:0.0000100 epoch_Time:26090.0min: [2024-01-04 20:00:11,265][model8_pretrain.py][INFO] Epoch:[0/2](476100/4588595) loss:3.043 lr:0.0000100 epoch_Time:26090.0min: [2024-01-04 20:00:11,265][model8_pretrain.py][INFO] Epoch:[0/2](476100/4588595) loss:2.767 lr:0.0000100 epoch_Time:26090.0min: [2024-01-04 20:00:11,265][model8_pretrain.py][INFO] Epoch:[0/2](476100/4588595) loss:2.850 lr:0.0000100 epoch_Time:26090.0min: [2024-01-04 20:00:11,265][model8_pretrain.py][INFO] Epoch:[0/2](476100/4588595) loss:2.730 lr:0.0000100 epoch_Time:26090.0min: [2024-01-04 20:00:11,265][model8_pretrain.py][INFO] Epoch:[0/2](476100/4588595) loss:2.620 lr:0.0000100 epoch_Time:26090.0min: [2024-01-04 20:00:11,265][model8_pretrain.py][INFO] Epoch:[0/2](476100/4588595) loss:3.261 lr:0.0000100 epoch_Time:26090.0min: [2024-01-04 20:00:48,205][model8_pretrain.py][INFO] Epoch:[0/2](476200/4588595) loss:3.190 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:00:48,205][model8_pretrain.py][INFO] Epoch:[0/2](476200/4588595) loss:3.115 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:00:48,205][model8_pretrain.py][INFO] Epoch:[0/2](476200/4588595) loss:3.072 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:00:48,205][model8_pretrain.py][INFO] Epoch:[0/2](476200/4588595) loss:2.111 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:00:48,205][model8_pretrain.py][INFO] Epoch:[0/2](476200/4588595) loss:2.980 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:00:48,205][model8_pretrain.py][INFO] Epoch:[0/2](476200/4588595) loss:2.367 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:00:48,205][model8_pretrain.py][INFO] Epoch:[0/2](476200/4588595) loss:2.772 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:00:48,205][model8_pretrain.py][INFO] Epoch:[0/2](476200/4588595) loss:3.195 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:01:25,128][model8_pretrain.py][INFO] Epoch:[0/2](476300/4588595) loss:2.916 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:01:25,128][model8_pretrain.py][INFO] Epoch:[0/2](476300/4588595) loss:2.861 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:01:25,128][model8_pretrain.py][INFO] Epoch:[0/2](476300/4588595) loss:3.210 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:01:25,128][model8_pretrain.py][INFO] Epoch:[0/2](476300/4588595) loss:3.159 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:01:25,128][model8_pretrain.py][INFO] Epoch:[0/2](476300/4588595) loss:3.417 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:01:25,128][model8_pretrain.py][INFO] Epoch:[0/2](476300/4588595) loss:2.555 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:01:25,128][model8_pretrain.py][INFO] Epoch:[0/2](476300/4588595) loss:3.287 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:01:25,128][model8_pretrain.py][INFO] Epoch:[0/2](476300/4588595) loss:3.139 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:02:12,451][model8_pretrain.py][INFO] Epoch:[0/2](476400/4588595) loss:2.705 lr:0.0000100 epoch_Time:26089.0min: [2024-01-04 20:02:12,451][model8_pretrain.py][INFO] Epoch:[0/2](476400/4588595) loss:3.109 lr:0.0000100 epoch_Time:26089.0min: [2024-01-04 20:02:12,451][model8_pretrain.py][INFO] Epoch:[0/2](476400/4588595) loss:2.474 lr:0.0000100 epoch_Time:26089.0min: [2024-01-04 20:02:12,452][model8_pretrain.py][INFO] Epoch:[0/2](476400/4588595) loss:2.496 lr:0.0000100 epoch_Time:26089.0min: [2024-01-04 20:02:12,451][model8_pretrain.py][INFO] Epoch:[0/2](476400/4588595) loss:3.331 lr:0.0000100 epoch_Time:26089.0min: [2024-01-04 20:02:12,452][model8_pretrain.py][INFO] Epoch:[0/2](476400/4588595) loss:2.631 lr:0.0000100 epoch_Time:26089.0min: [2024-01-04 20:02:12,452][model8_pretrain.py][INFO] Epoch:[0/2](476400/4588595) loss:3.269 lr:0.0000100 epoch_Time:26089.0min: [2024-01-04 20:02:12,452][model8_pretrain.py][INFO] Epoch:[0/2](476400/4588595) loss:2.825 lr:0.0000100 epoch_Time:26089.0min: [2024-01-04 20:02:49,379][model8_pretrain.py][INFO] Epoch:[0/2](476500/4588595) loss:3.243 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:02:49,379][model8_pretrain.py][INFO] Epoch:[0/2](476500/4588595) loss:3.000 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:02:49,379][model8_pretrain.py][INFO] Epoch:[0/2](476500/4588595) loss:3.075 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:02:49,379][model8_pretrain.py][INFO] Epoch:[0/2](476500/4588595) loss:3.311 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:02:49,379][model8_pretrain.py][INFO] Epoch:[0/2](476500/4588595) loss:3.257 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:02:49,379][model8_pretrain.py][INFO] Epoch:[0/2](476500/4588595) loss:3.118 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:02:49,379][model8_pretrain.py][INFO] Epoch:[0/2](476500/4588595) loss:2.310 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:02:49,379][model8_pretrain.py][INFO] Epoch:[0/2](476500/4588595) loss:3.076 lr:0.0000100 epoch_Time:26088.0min: [2024-01-04 20:03:26,309][model8_pretrain.py][INFO] Epoch:[0/2](476600/4588595) loss:2.966 lr:0.0000100 epoch_Time:26087.0min: [2024-01-04 20:03:26,309][model8_pretrain.py][INFO] Epoch:[0/2](476600/4588595) loss:2.729 lr:0.0000100 epoch_Time:26087.0min: [2024-01-04 20:03:26,309][model8_pretrain.py][INFO] Epoch:[0/2](476600/4588595) loss:3.301 lr:0.0000100 epoch_Time:26087.0min: [2024-01-04 20:03:26,309][model8_pretrain.py][INFO] Epoch:[0/2](476600/4588595) loss:3.020 lr:0.0000100 epoch_Time:26087.0min: [2024-01-04 20:03:26,309][model8_pretrain.py][INFO] Epoch:[0/2](476600/4588595) loss:3.035 lr:0.0000100 epoch_Time:26087.0min: [2024-01-04 20:03:26,309][model8_pretrain.py][INFO] Epoch:[0/2](476600/4588595) loss:2.652 lr:0.0000100 epoch_Time:26087.0min: [2024-01-04 20:03:26,309][model8_pretrain.py][INFO] Epoch:[0/2](476600/4588595) loss:2.706 lr:0.0000100 epoch_Time:26087.0min: [2024-01-04 20:03:26,309][model8_pretrain.py][INFO] Epoch:[0/2](476600/4588595) loss:3.027 lr:0.0000100 epoch_Time:26087.0min: [2024-01-04 20:04:03,241][model8_pretrain.py][INFO] Epoch:[0/2](476700/4588595) loss:2.395 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:03,241][model8_pretrain.py][INFO] Epoch:[0/2](476700/4588595) loss:2.887 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:03,241][model8_pretrain.py][INFO] Epoch:[0/2](476700/4588595) loss:2.988 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:03,241][model8_pretrain.py][INFO] Epoch:[0/2](476700/4588595) loss:3.202 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:03,241][model8_pretrain.py][INFO] Epoch:[0/2](476700/4588595) loss:2.592 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:03,241][model8_pretrain.py][INFO] Epoch:[0/2](476700/4588595) loss:2.812 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:03,241][model8_pretrain.py][INFO] Epoch:[0/2](476700/4588595) loss:2.642 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:03,241][model8_pretrain.py][INFO] Epoch:[0/2](476700/4588595) loss:3.130 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:40,175][model8_pretrain.py][INFO] Epoch:[0/2](476800/4588595) loss:3.125 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:40,175][model8_pretrain.py][INFO] Epoch:[0/2](476800/4588595) loss:3.158 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:40,175][model8_pretrain.py][INFO] Epoch:[0/2](476800/4588595) loss:2.922 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:40,175][model8_pretrain.py][INFO] Epoch:[0/2](476800/4588595) loss:2.404 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:40,175][model8_pretrain.py][INFO] Epoch:[0/2](476800/4588595) loss:3.184 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:40,175][model8_pretrain.py][INFO] Epoch:[0/2](476800/4588595) loss:2.227 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:40,175][model8_pretrain.py][INFO] Epoch:[0/2](476800/4588595) loss:2.755 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:04:40,175][model8_pretrain.py][INFO] Epoch:[0/2](476800/4588595) loss:2.653 lr:0.0000100 epoch_Time:26086.0min: [2024-01-04 20:05:17,105][model8_pretrain.py][INFO] Epoch:[0/2](476900/4588595) loss:2.582 lr:0.0000100 epoch_Time:26085.0min: [2024-01-04 20:05:17,105][model8_pretrain.py][INFO] Epoch:[0/2](476900/4588595) loss:2.939 lr:0.0000100 epoch_Time:26085.0min: [2024-01-04 20:05:17,105][model8_pretrain.py][INFO] Epoch:[0/2](476900/4588595) loss:3.075 lr:0.0000100 epoch_Time:26085.0min: [2024-01-04 20:05:17,105][model8_pretrain.py][INFO] Epoch:[0/2](476900/4588595) loss:2.686 lr:0.0000100 epoch_Time:26085.0min: [2024-01-04 20:05:17,105][model8_pretrain.py][INFO] Epoch:[0/2](476900/4588595) loss:3.163 lr:0.0000100 epoch_Time:26085.0min: [2024-01-04 20:05:17,105][model8_pretrain.py][INFO] Epoch:[0/2](476900/4588595) loss:2.978 lr:0.0000100 epoch_Time:26085.0min: [2024-01-04 20:05:17,105][model8_pretrain.py][INFO] Epoch:[0/2](476900/4588595) loss:2.924 lr:0.0000100 epoch_Time:26085.0min: [2024-01-04 20:05:17,106][model8_pretrain.py][INFO] Epoch:[0/2](476900/4588595) loss:2.965 lr:0.0000100 epoch_Time:26085.0min: [2024-01-04 20:05:54,047][model8_pretrain.py][INFO] Epoch:[0/2](477000/4588595) loss:2.863 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:05:54,047][model8_pretrain.py][INFO] Epoch:[0/2](477000/4588595) loss:3.138 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:05:54,047][model8_pretrain.py][INFO] Epoch:[0/2](477000/4588595) loss:2.984 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:05:54,047][model8_pretrain.py][INFO] Epoch:[0/2](477000/4588595) loss:3.139 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:05:54,047][model8_pretrain.py][INFO] Epoch:[0/2](477000/4588595) loss:3.027 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:05:54,048][model8_pretrain.py][INFO] Epoch:[0/2](477000/4588595) loss:3.269 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:05:54,048][model8_pretrain.py][INFO] Epoch:[0/2](477000/4588595) loss:3.104 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:05:54,049][model8_pretrain.py][INFO] Epoch:[0/2](477000/4588595) loss:2.229 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:06:30,987][model8_pretrain.py][INFO] Epoch:[0/2](477100/4588595) loss:2.832 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:06:30,987][model8_pretrain.py][INFO] Epoch:[0/2](477100/4588595) loss:2.438 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:06:30,987][model8_pretrain.py][INFO] Epoch:[0/2](477100/4588595) loss:3.371 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:06:30,987][model8_pretrain.py][INFO] Epoch:[0/2](477100/4588595) loss:3.173 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:06:30,987][model8_pretrain.py][INFO] Epoch:[0/2](477100/4588595) loss:3.018 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:06:30,987][model8_pretrain.py][INFO] Epoch:[0/2](477100/4588595) loss:3.271 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:06:30,988][model8_pretrain.py][INFO] Epoch:[0/2](477100/4588595) loss:2.691 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:06:30,988][model8_pretrain.py][INFO] Epoch:[0/2](477100/4588595) loss:2.925 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:07:18,228][model8_pretrain.py][INFO] Epoch:[0/2](477200/4588595) loss:2.962 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:07:18,228][model8_pretrain.py][INFO] Epoch:[0/2](477200/4588595) loss:2.311 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:07:18,228][model8_pretrain.py][INFO] Epoch:[0/2](477200/4588595) loss:2.979 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:07:18,228][model8_pretrain.py][INFO] Epoch:[0/2](477200/4588595) loss:2.945 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:07:18,228][model8_pretrain.py][INFO] Epoch:[0/2](477200/4588595) loss:2.875 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:07:18,228][model8_pretrain.py][INFO] Epoch:[0/2](477200/4588595) loss:3.202 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:07:18,228][model8_pretrain.py][INFO] Epoch:[0/2](477200/4588595) loss:2.552 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:07:18,228][model8_pretrain.py][INFO] Epoch:[0/2](477200/4588595) loss:2.944 lr:0.0000100 epoch_Time:26084.0min: [2024-01-04 20:07:55,154][model8_pretrain.py][INFO] Epoch:[0/2](477300/4588595) loss:2.992 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:07:55,154][model8_pretrain.py][INFO] Epoch:[0/2](477300/4588595) loss:2.545 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:07:55,154][model8_pretrain.py][INFO] Epoch:[0/2](477300/4588595) loss:2.808 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:07:55,154][model8_pretrain.py][INFO] Epoch:[0/2](477300/4588595) loss:2.772 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:07:55,154][model8_pretrain.py][INFO] Epoch:[0/2](477300/4588595) loss:2.423 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:07:55,154][model8_pretrain.py][INFO] Epoch:[0/2](477300/4588595) loss:3.465 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:07:55,154][model8_pretrain.py][INFO] Epoch:[0/2](477300/4588595) loss:2.005 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:07:55,154][model8_pretrain.py][INFO] Epoch:[0/2](477300/4588595) loss:3.069 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:08:32,090][model8_pretrain.py][INFO] Epoch:[0/2](477400/4588595) loss:3.150 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:08:32,090][model8_pretrain.py][INFO] Epoch:[0/2](477400/4588595) loss:3.440 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:08:32,090][model8_pretrain.py][INFO] Epoch:[0/2](477400/4588595) loss:2.568 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:08:32,090][model8_pretrain.py][INFO] Epoch:[0/2](477400/4588595) loss:2.670 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:08:32,090][model8_pretrain.py][INFO] Epoch:[0/2](477400/4588595) loss:2.105 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:08:32,090][model8_pretrain.py][INFO] Epoch:[0/2](477400/4588595) loss:1.878 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:08:32,090][model8_pretrain.py][INFO] Epoch:[0/2](477400/4588595) loss:3.170 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:08:32,090][model8_pretrain.py][INFO] Epoch:[0/2](477400/4588595) loss:3.002 lr:0.0000100 epoch_Time:26083.0min: [2024-01-04 20:09:09,024][model8_pretrain.py][INFO] Epoch:[0/2](477500/4588595) loss:2.932 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:09,024][model8_pretrain.py][INFO] Epoch:[0/2](477500/4588595) loss:2.579 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:09,024][model8_pretrain.py][INFO] Epoch:[0/2](477500/4588595) loss:2.775 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:09,024][model8_pretrain.py][INFO] Epoch:[0/2](477500/4588595) loss:2.973 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:09,024][model8_pretrain.py][INFO] Epoch:[0/2](477500/4588595) loss:2.805 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:09,024][model8_pretrain.py][INFO] Epoch:[0/2](477500/4588595) loss:2.843 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:09,024][model8_pretrain.py][INFO] Epoch:[0/2](477500/4588595) loss:2.743 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:09,024][model8_pretrain.py][INFO] Epoch:[0/2](477500/4588595) loss:3.060 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:45,957][model8_pretrain.py][INFO] Epoch:[0/2](477600/4588595) loss:3.076 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:45,957][model8_pretrain.py][INFO] Epoch:[0/2](477600/4588595) loss:3.134 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:45,957][model8_pretrain.py][INFO] Epoch:[0/2](477600/4588595) loss:3.280 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:45,957][model8_pretrain.py][INFO] Epoch:[0/2](477600/4588595) loss:2.366 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:45,957][model8_pretrain.py][INFO] Epoch:[0/2](477600/4588595) loss:2.548 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:45,957][model8_pretrain.py][INFO] Epoch:[0/2](477600/4588595) loss:2.998 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:45,957][model8_pretrain.py][INFO] Epoch:[0/2](477600/4588595) loss:2.650 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:09:45,957][model8_pretrain.py][INFO] Epoch:[0/2](477600/4588595) loss:3.252 lr:0.0000100 epoch_Time:26081.0min: [2024-01-04 20:10:22,877][model8_pretrain.py][INFO] Epoch:[0/2](477700/4588595) loss:3.120 lr:0.0000100 epoch_Time:26080.0min: [2024-01-04 20:10:22,877][model8_pretrain.py][INFO] Epoch:[0/2](477700/4588595) loss:3.239 lr:0.0000100 epoch_Time:26080.0min: [2024-01-04 20:10:22,877][model8_pretrain.py][INFO] Epoch:[0/2](477700/4588595) loss:3.289 lr:0.0000100 epoch_Time:26080.0min: [2024-01-04 20:10:22,877][model8_pretrain.py][INFO] Epoch:[0/2](477700/4588595) loss:2.630 lr:0.0000100 epoch_Time:26080.0min: [2024-01-04 20:10:22,877][model8_pretrain.py][INFO] Epoch:[0/2](477700/4588595) loss:2.873 lr:0.0000100 epoch_Time:26080.0min: [2024-01-04 20:10:22,877][model8_pretrain.py][INFO] Epoch:[0/2](477700/4588595) loss:3.031 lr:0.0000100 epoch_Time:26080.0min: [2024-01-04 20:10:22,877][model8_pretrain.py][INFO] Epoch:[0/2](477700/4588595) loss:3.119 lr:0.0000100 epoch_Time:26080.0min: [2024-01-04 20:10:22,877][model8_pretrain.py][INFO] Epoch:[0/2](477700/4588595) loss:3.139 lr:0.0000100 epoch_Time:26080.0min: [2024-01-04 20:10:59,814][model8_pretrain.py][INFO] Epoch:[0/2](477800/4588595) loss:3.202 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:10:59,814][model8_pretrain.py][INFO] Epoch:[0/2](477800/4588595) loss:2.544 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:10:59,814][model8_pretrain.py][INFO] Epoch:[0/2](477800/4588595) loss:2.659 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:10:59,814][model8_pretrain.py][INFO] Epoch:[0/2](477800/4588595) loss:2.835 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:10:59,814][model8_pretrain.py][INFO] Epoch:[0/2](477800/4588595) loss:3.125 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:10:59,814][model8_pretrain.py][INFO] Epoch:[0/2](477800/4588595) loss:2.983 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:10:59,814][model8_pretrain.py][INFO] Epoch:[0/2](477800/4588595) loss:2.865 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:10:59,815][model8_pretrain.py][INFO] Epoch:[0/2](477800/4588595) loss:2.778 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:11:36,748][model8_pretrain.py][INFO] Epoch:[0/2](477900/4588595) loss:3.238 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:11:36,749][model8_pretrain.py][INFO] Epoch:[0/2](477900/4588595) loss:2.925 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:11:36,749][model8_pretrain.py][INFO] Epoch:[0/2](477900/4588595) loss:3.099 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:11:36,748][model8_pretrain.py][INFO] Epoch:[0/2](477900/4588595) loss:2.202 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:11:36,749][model8_pretrain.py][INFO] Epoch:[0/2](477900/4588595) loss:2.448 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:11:36,749][model8_pretrain.py][INFO] Epoch:[0/2](477900/4588595) loss:2.562 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:11:36,749][model8_pretrain.py][INFO] Epoch:[0/2](477900/4588595) loss:3.076 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:11:36,749][model8_pretrain.py][INFO] Epoch:[0/2](477900/4588595) loss:3.017 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:12:24,172][model8_pretrain.py][INFO] Epoch:[0/2](478000/4588595) loss:2.834 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:12:24,172][model8_pretrain.py][INFO] Epoch:[0/2](478000/4588595) loss:3.348 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:12:24,172][model8_pretrain.py][INFO] Epoch:[0/2](478000/4588595) loss:2.921 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:12:24,172][model8_pretrain.py][INFO] Epoch:[0/2](478000/4588595) loss:2.994 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:12:24,172][model8_pretrain.py][INFO] Epoch:[0/2](478000/4588595) loss:2.580 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:12:24,172][model8_pretrain.py][INFO] Epoch:[0/2](478000/4588595) loss:2.800 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:12:24,172][model8_pretrain.py][INFO] Epoch:[0/2](478000/4588595) loss:2.576 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:12:24,173][model8_pretrain.py][INFO] Epoch:[0/2](478000/4588595) loss:2.965 lr:0.0000100 epoch_Time:26079.0min: [2024-01-04 20:13:01,101][model8_pretrain.py][INFO] Epoch:[0/2](478100/4588595) loss:2.677 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:01,101][model8_pretrain.py][INFO] Epoch:[0/2](478100/4588595) loss:2.668 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:01,102][model8_pretrain.py][INFO] Epoch:[0/2](478100/4588595) loss:2.805 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:01,102][model8_pretrain.py][INFO] Epoch:[0/2](478100/4588595) loss:3.136 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:01,102][model8_pretrain.py][INFO] Epoch:[0/2](478100/4588595) loss:3.114 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:01,101][model8_pretrain.py][INFO] Epoch:[0/2](478100/4588595) loss:2.646 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:01,102][model8_pretrain.py][INFO] Epoch:[0/2](478100/4588595) loss:2.555 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:01,102][model8_pretrain.py][INFO] Epoch:[0/2](478100/4588595) loss:2.929 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:38,025][model8_pretrain.py][INFO] Epoch:[0/2](478200/4588595) loss:3.250 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:38,025][model8_pretrain.py][INFO] Epoch:[0/2](478200/4588595) loss:2.753 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:38,025][model8_pretrain.py][INFO] Epoch:[0/2](478200/4588595) loss:2.769 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:38,025][model8_pretrain.py][INFO] Epoch:[0/2](478200/4588595) loss:2.635 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:38,026][model8_pretrain.py][INFO] Epoch:[0/2](478200/4588595) loss:2.865 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:38,026][model8_pretrain.py][INFO] Epoch:[0/2](478200/4588595) loss:2.316 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:38,027][model8_pretrain.py][INFO] Epoch:[0/2](478200/4588595) loss:3.134 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:13:38,027][model8_pretrain.py][INFO] Epoch:[0/2](478200/4588595) loss:2.867 lr:0.0000100 epoch_Time:26078.0min: [2024-01-04 20:14:15,007][model8_pretrain.py][INFO] Epoch:[0/2](478300/4588595) loss:2.769 lr:0.0000100 epoch_Time:26077.0min: [2024-01-04 20:14:15,007][model8_pretrain.py][INFO] Epoch:[0/2](478300/4588595) loss:2.943 lr:0.0000100 epoch_Time:26077.0min: [2024-01-04 20:14:15,007][model8_pretrain.py][INFO] Epoch:[0/2](478300/4588595) loss:2.736 lr:0.0000100 epoch_Time:26077.0min: [2024-01-04 20:14:15,007][model8_pretrain.py][INFO] Epoch:[0/2](478300/4588595) loss:3.151 lr:0.0000100 epoch_Time:26077.0min: [2024-01-04 20:14:15,007][model8_pretrain.py][INFO] Epoch:[0/2](478300/4588595) loss:2.835 lr:0.0000100 epoch_Time:26077.0min: [2024-01-04 20:14:15,007][model8_pretrain.py][INFO] Epoch:[0/2](478300/4588595) loss:2.939 lr:0.0000100 epoch_Time:26077.0min: [2024-01-04 20:14:15,007][model8_pretrain.py][INFO] Epoch:[0/2](478300/4588595) loss:2.413 lr:0.0000100 epoch_Time:26077.0min: [2024-01-04 20:14:15,007][model8_pretrain.py][INFO] Epoch:[0/2](478300/4588595) loss:2.666 lr:0.0000100 epoch_Time:26077.0min: [2024-01-04 20:14:51,963][model8_pretrain.py][INFO] Epoch:[0/2](478400/4588595) loss:3.451 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:14:51,963][model8_pretrain.py][INFO] Epoch:[0/2](478400/4588595) loss:3.209 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:14:51,963][model8_pretrain.py][INFO] Epoch:[0/2](478400/4588595) loss:3.270 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:14:51,963][model8_pretrain.py][INFO] Epoch:[0/2](478400/4588595) loss:2.583 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:14:51,963][model8_pretrain.py][INFO] Epoch:[0/2](478400/4588595) loss:2.620 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:14:51,963][model8_pretrain.py][INFO] Epoch:[0/2](478400/4588595) loss:3.248 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:14:51,963][model8_pretrain.py][INFO] Epoch:[0/2](478400/4588595) loss:2.585 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:14:51,963][model8_pretrain.py][INFO] Epoch:[0/2](478400/4588595) loss:3.157 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:15:28,901][model8_pretrain.py][INFO] Epoch:[0/2](478500/4588595) loss:2.746 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:15:28,901][model8_pretrain.py][INFO] Epoch:[0/2](478500/4588595) loss:3.045 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:15:28,901][model8_pretrain.py][INFO] Epoch:[0/2](478500/4588595) loss:2.694 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:15:28,901][model8_pretrain.py][INFO] Epoch:[0/2](478500/4588595) loss:2.942 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:15:28,902][model8_pretrain.py][INFO] Epoch:[0/2](478500/4588595) loss:3.212 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:15:28,902][model8_pretrain.py][INFO] Epoch:[0/2](478500/4588595) loss:3.132 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:15:28,902][model8_pretrain.py][INFO] Epoch:[0/2](478500/4588595) loss:2.238 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:15:28,902][model8_pretrain.py][INFO] Epoch:[0/2](478500/4588595) loss:2.615 lr:0.0000100 epoch_Time:26075.0min: [2024-01-04 20:16:05,831][model8_pretrain.py][INFO] Epoch:[0/2](478600/4588595) loss:2.780 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:05,831][model8_pretrain.py][INFO] Epoch:[0/2](478600/4588595) loss:2.653 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:05,831][model8_pretrain.py][INFO] Epoch:[0/2](478600/4588595) loss:2.485 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:05,831][model8_pretrain.py][INFO] Epoch:[0/2](478600/4588595) loss:2.636 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:05,831][model8_pretrain.py][INFO] Epoch:[0/2](478600/4588595) loss:2.781 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:05,831][model8_pretrain.py][INFO] Epoch:[0/2](478600/4588595) loss:2.483 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:05,832][model8_pretrain.py][INFO] Epoch:[0/2](478600/4588595) loss:2.697 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:05,832][model8_pretrain.py][INFO] Epoch:[0/2](478600/4588595) loss:2.485 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:42,754][model8_pretrain.py][INFO] Epoch:[0/2](478700/4588595) loss:2.992 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:42,754][model8_pretrain.py][INFO] Epoch:[0/2](478700/4588595) loss:2.709 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:42,754][model8_pretrain.py][INFO] Epoch:[0/2](478700/4588595) loss:2.692 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:42,754][model8_pretrain.py][INFO] Epoch:[0/2](478700/4588595) loss:3.078 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:42,754][model8_pretrain.py][INFO] Epoch:[0/2](478700/4588595) loss:3.098 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:42,754][model8_pretrain.py][INFO] Epoch:[0/2](478700/4588595) loss:3.187 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:42,754][model8_pretrain.py][INFO] Epoch:[0/2](478700/4588595) loss:2.816 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:16:42,754][model8_pretrain.py][INFO] Epoch:[0/2](478700/4588595) loss:2.759 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:17:30,042][model8_pretrain.py][INFO] Epoch:[0/2](478800/4588595) loss:2.656 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:17:30,043][model8_pretrain.py][INFO] Epoch:[0/2](478800/4588595) loss:2.319 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:17:30,043][model8_pretrain.py][INFO] Epoch:[0/2](478800/4588595) loss:2.633 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:17:30,043][model8_pretrain.py][INFO] Epoch:[0/2](478800/4588595) loss:2.102 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:17:30,043][model8_pretrain.py][INFO] Epoch:[0/2](478800/4588595) loss:3.246 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:17:30,043][model8_pretrain.py][INFO] Epoch:[0/2](478800/4588595) loss:3.087 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:17:30,043][model8_pretrain.py][INFO] Epoch:[0/2](478800/4588595) loss:2.392 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:17:30,043][model8_pretrain.py][INFO] Epoch:[0/2](478800/4588595) loss:2.680 lr:0.0000100 epoch_Time:26074.0min: [2024-01-04 20:18:06,972][model8_pretrain.py][INFO] Epoch:[0/2](478900/4588595) loss:2.733 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:06,972][model8_pretrain.py][INFO] Epoch:[0/2](478900/4588595) loss:3.315 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:06,972][model8_pretrain.py][INFO] Epoch:[0/2](478900/4588595) loss:2.619 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:06,972][model8_pretrain.py][INFO] Epoch:[0/2](478900/4588595) loss:2.682 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:06,972][model8_pretrain.py][INFO] Epoch:[0/2](478900/4588595) loss:3.129 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:06,972][model8_pretrain.py][INFO] Epoch:[0/2](478900/4588595) loss:2.784 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:06,972][model8_pretrain.py][INFO] Epoch:[0/2](478900/4588595) loss:2.780 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:06,973][model8_pretrain.py][INFO] Epoch:[0/2](478900/4588595) loss:2.995 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:43,900][model8_pretrain.py][INFO] Epoch:[0/2](479000/4588595) loss:2.804 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:43,900][model8_pretrain.py][INFO] Epoch:[0/2](479000/4588595) loss:2.548 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:43,900][model8_pretrain.py][INFO] Epoch:[0/2](479000/4588595) loss:2.762 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:43,900][model8_pretrain.py][INFO] Epoch:[0/2](479000/4588595) loss:3.008 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:43,900][model8_pretrain.py][INFO] Epoch:[0/2](479000/4588595) loss:3.088 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:43,900][model8_pretrain.py][INFO] Epoch:[0/2](479000/4588595) loss:3.329 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:43,900][model8_pretrain.py][INFO] Epoch:[0/2](479000/4588595) loss:2.425 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:18:43,901][model8_pretrain.py][INFO] Epoch:[0/2](479000/4588595) loss:2.478 lr:0.0000100 epoch_Time:26073.0min: [2024-01-04 20:19:20,835][model8_pretrain.py][INFO] Epoch:[0/2](479100/4588595) loss:2.811 lr:0.0000100 epoch_Time:26072.0min: [2024-01-04 20:19:20,835][model8_pretrain.py][INFO] Epoch:[0/2](479100/4588595) loss:2.847 lr:0.0000100 epoch_Time:26072.0min: [2024-01-04 20:19:20,835][model8_pretrain.py][INFO] Epoch:[0/2](479100/4588595) loss:2.754 lr:0.0000100 epoch_Time:26072.0min: [2024-01-04 20:19:20,835][model8_pretrain.py][INFO] Epoch:[0/2](479100/4588595) loss:3.069 lr:0.0000100 epoch_Time:26072.0min: [2024-01-04 20:19:20,835][model8_pretrain.py][INFO] Epoch:[0/2](479100/4588595) loss:2.963 lr:0.0000100 epoch_Time:26072.0min: [2024-01-04 20:19:20,835][model8_pretrain.py][INFO] Epoch:[0/2](479100/4588595) loss:2.700 lr:0.0000100 epoch_Time:26072.0min: [2024-01-04 20:19:20,835][model8_pretrain.py][INFO] Epoch:[0/2](479100/4588595) loss:3.051 lr:0.0000100 epoch_Time:26072.0min: [2024-01-04 20:19:20,835][model8_pretrain.py][INFO] Epoch:[0/2](479100/4588595) loss:3.265 lr:0.0000100 epoch_Time:26072.0min: [2024-01-04 20:19:57,766][model8_pretrain.py][INFO] Epoch:[0/2](479200/4588595) loss:2.620 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:19:57,766][model8_pretrain.py][INFO] Epoch:[0/2](479200/4588595) loss:3.182 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:19:57,766][model8_pretrain.py][INFO] Epoch:[0/2](479200/4588595) loss:3.021 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:19:57,766][model8_pretrain.py][INFO] Epoch:[0/2](479200/4588595) loss:3.166 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:19:57,766][model8_pretrain.py][INFO] Epoch:[0/2](479200/4588595) loss:3.058 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:19:57,766][model8_pretrain.py][INFO] Epoch:[0/2](479200/4588595) loss:2.460 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:19:57,766][model8_pretrain.py][INFO] Epoch:[0/2](479200/4588595) loss:3.313 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:19:57,766][model8_pretrain.py][INFO] Epoch:[0/2](479200/4588595) loss:3.342 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:20:34,706][model8_pretrain.py][INFO] Epoch:[0/2](479300/4588595) loss:2.162 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:20:34,706][model8_pretrain.py][INFO] Epoch:[0/2](479300/4588595) loss:3.000 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:20:34,706][model8_pretrain.py][INFO] Epoch:[0/2](479300/4588595) loss:2.914 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:20:34,706][model8_pretrain.py][INFO] Epoch:[0/2](479300/4588595) loss:3.065 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:20:34,706][model8_pretrain.py][INFO] Epoch:[0/2](479300/4588595) loss:2.624 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:20:34,706][model8_pretrain.py][INFO] Epoch:[0/2](479300/4588595) loss:3.172 lr:0.0000100 epoch_Time:26071.0min: [2024-01-04 20:20:34,707][model8_pretrain.py][INFO] Epoch:[0/2](479300/4588595) loss:2.542 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:20:34,707][model8_pretrain.py][INFO] Epoch:[0/2](479300/4588595) loss:2.898 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:21:11,644][model8_pretrain.py][INFO] Epoch:[0/2](479400/4588595) loss:2.988 lr:0.0000100 epoch_Time:26069.0min: [2024-01-04 20:21:11,644][model8_pretrain.py][INFO] Epoch:[0/2](479400/4588595) loss:2.786 lr:0.0000100 epoch_Time:26069.0min: [2024-01-04 20:21:11,644][model8_pretrain.py][INFO] Epoch:[0/2](479400/4588595) loss:2.864 lr:0.0000100 epoch_Time:26069.0min: [2024-01-04 20:21:11,644][model8_pretrain.py][INFO] Epoch:[0/2](479400/4588595) loss:2.686 lr:0.0000100 epoch_Time:26069.0min: [2024-01-04 20:21:11,644][model8_pretrain.py][INFO] Epoch:[0/2](479400/4588595) loss:3.215 lr:0.0000100 epoch_Time:26069.0min: [2024-01-04 20:21:11,644][model8_pretrain.py][INFO] Epoch:[0/2](479400/4588595) loss:3.364 lr:0.0000100 epoch_Time:26069.0min: [2024-01-04 20:21:11,644][model8_pretrain.py][INFO] Epoch:[0/2](479400/4588595) loss:2.744 lr:0.0000100 epoch_Time:26069.0min: [2024-01-04 20:21:11,644][model8_pretrain.py][INFO] Epoch:[0/2](479400/4588595) loss:2.962 lr:0.0000100 epoch_Time:26069.0min: [2024-01-04 20:21:48,579][model8_pretrain.py][INFO] Epoch:[0/2](479500/4588595) loss:3.083 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:21:48,579][model8_pretrain.py][INFO] Epoch:[0/2](479500/4588595) loss:3.131 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:21:48,579][model8_pretrain.py][INFO] Epoch:[0/2](479500/4588595) loss:3.099 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:21:48,579][model8_pretrain.py][INFO] Epoch:[0/2](479500/4588595) loss:3.041 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:21:48,579][model8_pretrain.py][INFO] Epoch:[0/2](479500/4588595) loss:3.104 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:21:48,579][model8_pretrain.py][INFO] Epoch:[0/2](479500/4588595) loss:2.625 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:21:48,579][model8_pretrain.py][INFO] Epoch:[0/2](479500/4588595) loss:3.444 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:21:48,579][model8_pretrain.py][INFO] Epoch:[0/2](479500/4588595) loss:2.994 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:22:35,994][model8_pretrain.py][INFO] Epoch:[0/2](479600/4588595) loss:2.883 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:22:35,994][model8_pretrain.py][INFO] Epoch:[0/2](479600/4588595) loss:2.746 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:22:35,994][model8_pretrain.py][INFO] Epoch:[0/2](479600/4588595) loss:2.596 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:22:35,994][model8_pretrain.py][INFO] Epoch:[0/2](479600/4588595) loss:2.583 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:22:35,994][model8_pretrain.py][INFO] Epoch:[0/2](479600/4588595) loss:2.722 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:22:35,994][model8_pretrain.py][INFO] Epoch:[0/2](479600/4588595) loss:2.836 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:22:35,994][model8_pretrain.py][INFO] Epoch:[0/2](479600/4588595) loss:2.859 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:22:35,994][model8_pretrain.py][INFO] Epoch:[0/2](479600/4588595) loss:2.911 lr:0.0000100 epoch_Time:26070.0min: [2024-01-04 20:23:12,926][model8_pretrain.py][INFO] Epoch:[0/2](479700/4588595) loss:3.151 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:23:12,926][model8_pretrain.py][INFO] Epoch:[0/2](479700/4588595) loss:2.325 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:23:12,926][model8_pretrain.py][INFO] Epoch:[0/2](479700/4588595) loss:2.786 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:23:12,926][model8_pretrain.py][INFO] Epoch:[0/2](479700/4588595) loss:2.646 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:23:12,926][model8_pretrain.py][INFO] Epoch:[0/2](479700/4588595) loss:3.102 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:23:12,926][model8_pretrain.py][INFO] Epoch:[0/2](479700/4588595) loss:2.318 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:23:12,926][model8_pretrain.py][INFO] Epoch:[0/2](479700/4588595) loss:2.362 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:23:12,926][model8_pretrain.py][INFO] Epoch:[0/2](479700/4588595) loss:3.090 lr:0.0000100 epoch_Time:26068.0min: [2024-01-04 20:23:49,861][model8_pretrain.py][INFO] Epoch:[0/2](479800/4588595) loss:2.713 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:23:49,861][model8_pretrain.py][INFO] Epoch:[0/2](479800/4588595) loss:3.334 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:23:49,861][model8_pretrain.py][INFO] Epoch:[0/2](479800/4588595) loss:2.834 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:23:49,861][model8_pretrain.py][INFO] Epoch:[0/2](479800/4588595) loss:2.723 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:23:49,861][model8_pretrain.py][INFO] Epoch:[0/2](479800/4588595) loss:2.790 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:23:49,861][model8_pretrain.py][INFO] Epoch:[0/2](479800/4588595) loss:2.704 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:23:49,861][model8_pretrain.py][INFO] Epoch:[0/2](479800/4588595) loss:1.925 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:23:49,861][model8_pretrain.py][INFO] Epoch:[0/2](479800/4588595) loss:2.573 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:24:26,798][model8_pretrain.py][INFO] Epoch:[0/2](479900/4588595) loss:2.426 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:24:26,798][model8_pretrain.py][INFO] Epoch:[0/2](479900/4588595) loss:3.106 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:24:26,798][model8_pretrain.py][INFO] Epoch:[0/2](479900/4588595) loss:2.891 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:24:26,798][model8_pretrain.py][INFO] Epoch:[0/2](479900/4588595) loss:2.751 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:24:26,798][model8_pretrain.py][INFO] Epoch:[0/2](479900/4588595) loss:2.610 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:24:26,798][model8_pretrain.py][INFO] Epoch:[0/2](479900/4588595) loss:2.947 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:24:26,798][model8_pretrain.py][INFO] Epoch:[0/2](479900/4588595) loss:3.160 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:24:26,798][model8_pretrain.py][INFO] Epoch:[0/2](479900/4588595) loss:3.081 lr:0.0000100 epoch_Time:26067.0min: [2024-01-04 20:25:03,721][model8_pretrain.py][INFO] Epoch:[0/2](480000/4588595) loss:2.730 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:03,721][model8_pretrain.py][INFO] Epoch:[0/2](480000/4588595) loss:2.833 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:03,721][model8_pretrain.py][INFO] Epoch:[0/2](480000/4588595) loss:1.901 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:03,722][model8_pretrain.py][INFO] Epoch:[0/2](480000/4588595) loss:3.196 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:03,722][model8_pretrain.py][INFO] Epoch:[0/2](480000/4588595) loss:2.784 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:03,722][model8_pretrain.py][INFO] Epoch:[0/2](480000/4588595) loss:3.474 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:03,722][model8_pretrain.py][INFO] Epoch:[0/2](480000/4588595) loss:2.736 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:03,722][model8_pretrain.py][INFO] Epoch:[0/2](480000/4588595) loss:3.030 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:40,654][model8_pretrain.py][INFO] Epoch:[0/2](480100/4588595) loss:3.029 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:40,654][model8_pretrain.py][INFO] Epoch:[0/2](480100/4588595) loss:2.462 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:40,654][model8_pretrain.py][INFO] Epoch:[0/2](480100/4588595) loss:3.225 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:40,654][model8_pretrain.py][INFO] Epoch:[0/2](480100/4588595) loss:2.753 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:40,654][model8_pretrain.py][INFO] Epoch:[0/2](480100/4588595) loss:3.237 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:40,654][model8_pretrain.py][INFO] Epoch:[0/2](480100/4588595) loss:3.081 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:40,654][model8_pretrain.py][INFO] Epoch:[0/2](480100/4588595) loss:3.377 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:25:40,654][model8_pretrain.py][INFO] Epoch:[0/2](480100/4588595) loss:2.525 lr:0.0000100 epoch_Time:26066.0min: [2024-01-04 20:26:17,575][model8_pretrain.py][INFO] Epoch:[0/2](480200/4588595) loss:2.912 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:26:17,575][model8_pretrain.py][INFO] Epoch:[0/2](480200/4588595) loss:3.393 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:26:17,575][model8_pretrain.py][INFO] Epoch:[0/2](480200/4588595) loss:2.893 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:26:17,575][model8_pretrain.py][INFO] Epoch:[0/2](480200/4588595) loss:2.727 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:26:17,575][model8_pretrain.py][INFO] Epoch:[0/2](480200/4588595) loss:3.116 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:26:17,575][model8_pretrain.py][INFO] Epoch:[0/2](480200/4588595) loss:2.715 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:26:17,575][model8_pretrain.py][INFO] Epoch:[0/2](480200/4588595) loss:2.557 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:26:17,575][model8_pretrain.py][INFO] Epoch:[0/2](480200/4588595) loss:2.633 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:26:54,506][model8_pretrain.py][INFO] Epoch:[0/2](480300/4588595) loss:2.925 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:26:54,506][model8_pretrain.py][INFO] Epoch:[0/2](480300/4588595) loss:2.310 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:26:54,506][model8_pretrain.py][INFO] Epoch:[0/2](480300/4588595) loss:2.581 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:26:54,506][model8_pretrain.py][INFO] Epoch:[0/2](480300/4588595) loss:2.568 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:26:54,506][model8_pretrain.py][INFO] Epoch:[0/2](480300/4588595) loss:2.891 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:26:54,506][model8_pretrain.py][INFO] Epoch:[0/2](480300/4588595) loss:2.962 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:26:54,506][model8_pretrain.py][INFO] Epoch:[0/2](480300/4588595) loss:2.448 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:26:54,506][model8_pretrain.py][INFO] Epoch:[0/2](480300/4588595) loss:3.073 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:27:41,945][model8_pretrain.py][INFO] Epoch:[0/2](480400/4588595) loss:2.874 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:27:41,945][model8_pretrain.py][INFO] Epoch:[0/2](480400/4588595) loss:3.112 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:27:41,945][model8_pretrain.py][INFO] Epoch:[0/2](480400/4588595) loss:2.924 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:27:41,945][model8_pretrain.py][INFO] Epoch:[0/2](480400/4588595) loss:2.944 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:27:41,945][model8_pretrain.py][INFO] Epoch:[0/2](480400/4588595) loss:2.613 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:27:41,945][model8_pretrain.py][INFO] Epoch:[0/2](480400/4588595) loss:2.826 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:27:41,945][model8_pretrain.py][INFO] Epoch:[0/2](480400/4588595) loss:2.988 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:27:41,945][model8_pretrain.py][INFO] Epoch:[0/2](480400/4588595) loss:3.049 lr:0.0000100 epoch_Time:26065.0min: [2024-01-04 20:28:18,883][model8_pretrain.py][INFO] Epoch:[0/2](480500/4588595) loss:2.817 lr:0.0000100 epoch_Time:26064.0min: [2024-01-04 20:28:18,883][model8_pretrain.py][INFO] Epoch:[0/2](480500/4588595) loss:2.736 lr:0.0000100 epoch_Time:26064.0min: [2024-01-04 20:28:18,883][model8_pretrain.py][INFO] Epoch:[0/2](480500/4588595) loss:3.007 lr:0.0000100 epoch_Time:26064.0min: [2024-01-04 20:28:18,883][model8_pretrain.py][INFO] Epoch:[0/2](480500/4588595) loss:2.972 lr:0.0000100 epoch_Time:26064.0min: [2024-01-04 20:28:18,883][model8_pretrain.py][INFO] Epoch:[0/2](480500/4588595) loss:2.608 lr:0.0000100 epoch_Time:26064.0min: [2024-01-04 20:28:18,884][model8_pretrain.py][INFO] Epoch:[0/2](480500/4588595) loss:2.642 lr:0.0000100 epoch_Time:26064.0min: [2024-01-04 20:28:18,884][model8_pretrain.py][INFO] Epoch:[0/2](480500/4588595) loss:3.085 lr:0.0000100 epoch_Time:26064.0min: [2024-01-04 20:28:18,884][model8_pretrain.py][INFO] Epoch:[0/2](480500/4588595) loss:3.501 lr:0.0000100 epoch_Time:26064.0min: [2024-01-04 20:28:55,842][model8_pretrain.py][INFO] Epoch:[0/2](480600/4588595) loss:2.902 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:28:55,842][model8_pretrain.py][INFO] Epoch:[0/2](480600/4588595) loss:2.931 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:28:55,842][model8_pretrain.py][INFO] Epoch:[0/2](480600/4588595) loss:2.768 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:28:55,842][model8_pretrain.py][INFO] Epoch:[0/2](480600/4588595) loss:2.626 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:28:55,842][model8_pretrain.py][INFO] Epoch:[0/2](480600/4588595) loss:2.491 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:28:55,842][model8_pretrain.py][INFO] Epoch:[0/2](480600/4588595) loss:3.120 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:28:55,842][model8_pretrain.py][INFO] Epoch:[0/2](480600/4588595) loss:2.997 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:28:55,842][model8_pretrain.py][INFO] Epoch:[0/2](480600/4588595) loss:3.148 lr:0.0000100 epoch_Time:26063.0min: [2024-01-04 20:29:32,799][model8_pretrain.py][INFO] Epoch:[0/2](480700/4588595) loss:3.225 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:29:32,799][model8_pretrain.py][INFO] Epoch:[0/2](480700/4588595) loss:3.033 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:29:32,799][model8_pretrain.py][INFO] Epoch:[0/2](480700/4588595) loss:3.049 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:29:32,799][model8_pretrain.py][INFO] Epoch:[0/2](480700/4588595) loss:2.103 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:29:32,799][model8_pretrain.py][INFO] Epoch:[0/2](480700/4588595) loss:2.921 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:29:32,799][model8_pretrain.py][INFO] Epoch:[0/2](480700/4588595) loss:2.229 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:29:32,799][model8_pretrain.py][INFO] Epoch:[0/2](480700/4588595) loss:2.596 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:29:32,799][model8_pretrain.py][INFO] Epoch:[0/2](480700/4588595) loss:2.593 lr:0.0000100 epoch_Time:26062.0min: [2024-01-04 20:30:09,764][model8_pretrain.py][INFO] Epoch:[0/2](480800/4588595) loss:2.862 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:09,764][model8_pretrain.py][INFO] Epoch:[0/2](480800/4588595) loss:3.257 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:09,764][model8_pretrain.py][INFO] Epoch:[0/2](480800/4588595) loss:2.504 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:09,764][model8_pretrain.py][INFO] Epoch:[0/2](480800/4588595) loss:2.979 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:09,764][model8_pretrain.py][INFO] Epoch:[0/2](480800/4588595) loss:3.161 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:09,764][model8_pretrain.py][INFO] Epoch:[0/2](480800/4588595) loss:2.855 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:09,764][model8_pretrain.py][INFO] Epoch:[0/2](480800/4588595) loss:2.686 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:09,765][model8_pretrain.py][INFO] Epoch:[0/2](480800/4588595) loss:2.917 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:46,713][model8_pretrain.py][INFO] Epoch:[0/2](480900/4588595) loss:2.697 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:46,713][model8_pretrain.py][INFO] Epoch:[0/2](480900/4588595) loss:2.604 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:46,713][model8_pretrain.py][INFO] Epoch:[0/2](480900/4588595) loss:3.132 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:46,713][model8_pretrain.py][INFO] Epoch:[0/2](480900/4588595) loss:3.103 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:46,713][model8_pretrain.py][INFO] Epoch:[0/2](480900/4588595) loss:3.013 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:46,713][model8_pretrain.py][INFO] Epoch:[0/2](480900/4588595) loss:2.936 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:46,713][model8_pretrain.py][INFO] Epoch:[0/2](480900/4588595) loss:3.032 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:30:46,713][model8_pretrain.py][INFO] Epoch:[0/2](480900/4588595) loss:3.455 lr:0.0000100 epoch_Time:26061.0min: [2024-01-04 20:31:23,672][model8_pretrain.py][INFO] Epoch:[0/2](481000/4588595) loss:2.808 lr:0.0000100 epoch_Time:26060.0min: [2024-01-04 20:31:23,672][model8_pretrain.py][INFO] Epoch:[0/2](481000/4588595) loss:2.914 lr:0.0000100 epoch_Time:26060.0min: [2024-01-04 20:31:23,672][model8_pretrain.py][INFO] Epoch:[0/2](481000/4588595) loss:2.469 lr:0.0000100 epoch_Time:26060.0min: [2024-01-04 20:31:23,672][model8_pretrain.py][INFO] Epoch:[0/2](481000/4588595) loss:2.608 lr:0.0000100 epoch_Time:26060.0min: [2024-01-04 20:31:23,672][model8_pretrain.py][INFO] Epoch:[0/2](481000/4588595) loss:3.026 lr:0.0000100 epoch_Time:26060.0min: [2024-01-04 20:31:23,672][model8_pretrain.py][INFO] Epoch:[0/2](481000/4588595) loss:2.798 lr:0.0000100 epoch_Time:26060.0min: [2024-01-04 20:31:23,672][model8_pretrain.py][INFO] Epoch:[0/2](481000/4588595) loss:2.463 lr:0.0000100 epoch_Time:26060.0min: [2024-01-04 20:31:23,672][model8_pretrain.py][INFO] Epoch:[0/2](481000/4588595) loss:2.578 lr:0.0000100 epoch_Time:26060.0min: [2024-01-04 20:32:00,633][model8_pretrain.py][INFO] Epoch:[0/2](481100/4588595) loss:2.832 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:00,633][model8_pretrain.py][INFO] Epoch:[0/2](481100/4588595) loss:2.409 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:00,633][model8_pretrain.py][INFO] Epoch:[0/2](481100/4588595) loss:2.550 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:00,633][model8_pretrain.py][INFO] Epoch:[0/2](481100/4588595) loss:2.458 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:00,633][model8_pretrain.py][INFO] Epoch:[0/2](481100/4588595) loss:2.920 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:00,633][model8_pretrain.py][INFO] Epoch:[0/2](481100/4588595) loss:3.076 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:00,633][model8_pretrain.py][INFO] Epoch:[0/2](481100/4588595) loss:2.559 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:00,634][model8_pretrain.py][INFO] Epoch:[0/2](481100/4588595) loss:3.317 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:48,038][model8_pretrain.py][INFO] Epoch:[0/2](481200/4588595) loss:2.519 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:48,038][model8_pretrain.py][INFO] Epoch:[0/2](481200/4588595) loss:2.860 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:48,038][model8_pretrain.py][INFO] Epoch:[0/2](481200/4588595) loss:2.903 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:48,038][model8_pretrain.py][INFO] Epoch:[0/2](481200/4588595) loss:2.833 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:48,038][model8_pretrain.py][INFO] Epoch:[0/2](481200/4588595) loss:2.776 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:48,038][model8_pretrain.py][INFO] Epoch:[0/2](481200/4588595) loss:3.022 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:48,038][model8_pretrain.py][INFO] Epoch:[0/2](481200/4588595) loss:2.979 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:32:48,038][model8_pretrain.py][INFO] Epoch:[0/2](481200/4588595) loss:3.072 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:33:24,983][model8_pretrain.py][INFO] Epoch:[0/2](481300/4588595) loss:3.013 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:33:24,983][model8_pretrain.py][INFO] Epoch:[0/2](481300/4588595) loss:3.048 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:33:24,983][model8_pretrain.py][INFO] Epoch:[0/2](481300/4588595) loss:2.978 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:33:24,983][model8_pretrain.py][INFO] Epoch:[0/2](481300/4588595) loss:2.469 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:33:24,983][model8_pretrain.py][INFO] Epoch:[0/2](481300/4588595) loss:2.915 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:33:24,984][model8_pretrain.py][INFO] Epoch:[0/2](481300/4588595) loss:2.672 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:33:24,984][model8_pretrain.py][INFO] Epoch:[0/2](481300/4588595) loss:3.053 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:33:24,984][model8_pretrain.py][INFO] Epoch:[0/2](481300/4588595) loss:2.470 lr:0.0000100 epoch_Time:26059.0min: [2024-01-04 20:34:01,922][model8_pretrain.py][INFO] Epoch:[0/2](481400/4588595) loss:2.987 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:01,922][model8_pretrain.py][INFO] Epoch:[0/2](481400/4588595) loss:3.249 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:01,922][model8_pretrain.py][INFO] Epoch:[0/2](481400/4588595) loss:2.964 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:01,922][model8_pretrain.py][INFO] Epoch:[0/2](481400/4588595) loss:3.052 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:01,922][model8_pretrain.py][INFO] Epoch:[0/2](481400/4588595) loss:2.074 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:01,922][model8_pretrain.py][INFO] Epoch:[0/2](481400/4588595) loss:3.104 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:01,922][model8_pretrain.py][INFO] Epoch:[0/2](481400/4588595) loss:2.640 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:01,922][model8_pretrain.py][INFO] Epoch:[0/2](481400/4588595) loss:2.864 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:38,872][model8_pretrain.py][INFO] Epoch:[0/2](481500/4588595) loss:2.665 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:38,872][model8_pretrain.py][INFO] Epoch:[0/2](481500/4588595) loss:3.017 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:38,872][model8_pretrain.py][INFO] Epoch:[0/2](481500/4588595) loss:2.564 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:38,872][model8_pretrain.py][INFO] Epoch:[0/2](481500/4588595) loss:2.537 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:38,872][model8_pretrain.py][INFO] Epoch:[0/2](481500/4588595) loss:2.393 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:38,872][model8_pretrain.py][INFO] Epoch:[0/2](481500/4588595) loss:2.983 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:38,872][model8_pretrain.py][INFO] Epoch:[0/2](481500/4588595) loss:2.335 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:34:38,872][model8_pretrain.py][INFO] Epoch:[0/2](481500/4588595) loss:2.160 lr:0.0000100 epoch_Time:26058.0min: [2024-01-04 20:35:15,818][model8_pretrain.py][INFO] Epoch:[0/2](481600/4588595) loss:2.928 lr:0.0000100 epoch_Time:26056.0min: [2024-01-04 20:35:15,818][model8_pretrain.py][INFO] Epoch:[0/2](481600/4588595) loss:3.123 lr:0.0000100 epoch_Time:26056.0min: [2024-01-04 20:35:15,818][model8_pretrain.py][INFO] Epoch:[0/2](481600/4588595) loss:2.783 lr:0.0000100 epoch_Time:26056.0min: [2024-01-04 20:35:15,818][model8_pretrain.py][INFO] Epoch:[0/2](481600/4588595) loss:2.144 lr:0.0000100 epoch_Time:26056.0min: [2024-01-04 20:35:15,818][model8_pretrain.py][INFO] Epoch:[0/2](481600/4588595) loss:3.162 lr:0.0000100 epoch_Time:26056.0min: [2024-01-04 20:35:15,818][model8_pretrain.py][INFO] Epoch:[0/2](481600/4588595) loss:2.724 lr:0.0000100 epoch_Time:26056.0min: [2024-01-04 20:35:15,818][model8_pretrain.py][INFO] Epoch:[0/2](481600/4588595) loss:2.890 lr:0.0000100 epoch_Time:26056.0min: [2024-01-04 20:35:15,818][model8_pretrain.py][INFO] Epoch:[0/2](481600/4588595) loss:2.456 lr:0.0000100 epoch_Time:26056.0min: [2024-01-04 20:35:52,760][model8_pretrain.py][INFO] Epoch:[0/2](481700/4588595) loss:2.389 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:35:52,760][model8_pretrain.py][INFO] Epoch:[0/2](481700/4588595) loss:2.927 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:35:52,760][model8_pretrain.py][INFO] Epoch:[0/2](481700/4588595) loss:2.869 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:35:52,760][model8_pretrain.py][INFO] Epoch:[0/2](481700/4588595) loss:2.937 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:35:52,760][model8_pretrain.py][INFO] Epoch:[0/2](481700/4588595) loss:3.016 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:35:52,760][model8_pretrain.py][INFO] Epoch:[0/2](481700/4588595) loss:3.076 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:35:52,760][model8_pretrain.py][INFO] Epoch:[0/2](481700/4588595) loss:3.039 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:35:52,760][model8_pretrain.py][INFO] Epoch:[0/2](481700/4588595) loss:3.219 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:36:29,707][model8_pretrain.py][INFO] Epoch:[0/2](481800/4588595) loss:2.970 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:36:29,707][model8_pretrain.py][INFO] Epoch:[0/2](481800/4588595) loss:2.877 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:36:29,707][model8_pretrain.py][INFO] Epoch:[0/2](481800/4588595) loss:2.139 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:36:29,707][model8_pretrain.py][INFO] Epoch:[0/2](481800/4588595) loss:2.709 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:36:29,707][model8_pretrain.py][INFO] Epoch:[0/2](481800/4588595) loss:2.577 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:36:29,707][model8_pretrain.py][INFO] Epoch:[0/2](481800/4588595) loss:2.673 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:36:29,707][model8_pretrain.py][INFO] Epoch:[0/2](481800/4588595) loss:2.957 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:36:29,707][model8_pretrain.py][INFO] Epoch:[0/2](481800/4588595) loss:2.645 lr:0.0000100 epoch_Time:26055.0min: [2024-01-04 20:37:06,650][model8_pretrain.py][INFO] Epoch:[0/2](481900/4588595) loss:3.023 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:06,650][model8_pretrain.py][INFO] Epoch:[0/2](481900/4588595) loss:3.084 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:06,650][model8_pretrain.py][INFO] Epoch:[0/2](481900/4588595) loss:3.407 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:06,650][model8_pretrain.py][INFO] Epoch:[0/2](481900/4588595) loss:2.496 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:06,650][model8_pretrain.py][INFO] Epoch:[0/2](481900/4588595) loss:2.985 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:06,650][model8_pretrain.py][INFO] Epoch:[0/2](481900/4588595) loss:3.101 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:06,650][model8_pretrain.py][INFO] Epoch:[0/2](481900/4588595) loss:2.980 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:06,650][model8_pretrain.py][INFO] Epoch:[0/2](481900/4588595) loss:3.246 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:54,001][model8_pretrain.py][INFO] Epoch:[0/2](482000/4588595) loss:2.589 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:54,001][model8_pretrain.py][INFO] Epoch:[0/2](482000/4588595) loss:3.347 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:54,001][model8_pretrain.py][INFO] Epoch:[0/2](482000/4588595) loss:3.197 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:54,001][model8_pretrain.py][INFO] Epoch:[0/2](482000/4588595) loss:2.998 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:54,001][model8_pretrain.py][INFO] Epoch:[0/2](482000/4588595) loss:3.340 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:54,001][model8_pretrain.py][INFO] Epoch:[0/2](482000/4588595) loss:2.334 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:54,001][model8_pretrain.py][INFO] Epoch:[0/2](482000/4588595) loss:2.820 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:37:54,002][model8_pretrain.py][INFO] Epoch:[0/2](482000/4588595) loss:2.749 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:38:30,930][model8_pretrain.py][INFO] Epoch:[0/2](482100/4588595) loss:2.517 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:38:30,930][model8_pretrain.py][INFO] Epoch:[0/2](482100/4588595) loss:2.936 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:38:30,930][model8_pretrain.py][INFO] Epoch:[0/2](482100/4588595) loss:3.132 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:38:30,930][model8_pretrain.py][INFO] Epoch:[0/2](482100/4588595) loss:2.935 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:38:30,930][model8_pretrain.py][INFO] Epoch:[0/2](482100/4588595) loss:2.583 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:38:30,930][model8_pretrain.py][INFO] Epoch:[0/2](482100/4588595) loss:2.385 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:38:30,930][model8_pretrain.py][INFO] Epoch:[0/2](482100/4588595) loss:2.601 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:38:30,930][model8_pretrain.py][INFO] Epoch:[0/2](482100/4588595) loss:2.915 lr:0.0000100 epoch_Time:26054.0min: [2024-01-04 20:39:07,868][model8_pretrain.py][INFO] Epoch:[0/2](482200/4588595) loss:2.885 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:07,868][model8_pretrain.py][INFO] Epoch:[0/2](482200/4588595) loss:2.783 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:07,868][model8_pretrain.py][INFO] Epoch:[0/2](482200/4588595) loss:2.917 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:07,868][model8_pretrain.py][INFO] Epoch:[0/2](482200/4588595) loss:2.938 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:07,868][model8_pretrain.py][INFO] Epoch:[0/2](482200/4588595) loss:2.949 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:07,868][model8_pretrain.py][INFO] Epoch:[0/2](482200/4588595) loss:3.230 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:07,868][model8_pretrain.py][INFO] Epoch:[0/2](482200/4588595) loss:2.705 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:07,868][model8_pretrain.py][INFO] Epoch:[0/2](482200/4588595) loss:2.057 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:44,818][model8_pretrain.py][INFO] Epoch:[0/2](482300/4588595) loss:2.963 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:44,818][model8_pretrain.py][INFO] Epoch:[0/2](482300/4588595) loss:2.726 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:44,818][model8_pretrain.py][INFO] Epoch:[0/2](482300/4588595) loss:2.740 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:44,818][model8_pretrain.py][INFO] Epoch:[0/2](482300/4588595) loss:3.126 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:44,818][model8_pretrain.py][INFO] Epoch:[0/2](482300/4588595) loss:3.239 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:44,818][model8_pretrain.py][INFO] Epoch:[0/2](482300/4588595) loss:2.899 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:44,818][model8_pretrain.py][INFO] Epoch:[0/2](482300/4588595) loss:3.059 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:39:44,818][model8_pretrain.py][INFO] Epoch:[0/2](482300/4588595) loss:2.549 lr:0.0000100 epoch_Time:26053.0min: [2024-01-04 20:40:21,742][model8_pretrain.py][INFO] Epoch:[0/2](482400/4588595) loss:3.017 lr:0.0000100 epoch_Time:26052.0min: [2024-01-04 20:40:21,742][model8_pretrain.py][INFO] Epoch:[0/2](482400/4588595) loss:2.911 lr:0.0000100 epoch_Time:26052.0min: [2024-01-04 20:40:21,742][model8_pretrain.py][INFO] Epoch:[0/2](482400/4588595) loss:3.260 lr:0.0000100 epoch_Time:26052.0min: [2024-01-04 20:40:21,742][model8_pretrain.py][INFO] Epoch:[0/2](482400/4588595) loss:2.930 lr:0.0000100 epoch_Time:26052.0min: [2024-01-04 20:40:21,743][model8_pretrain.py][INFO] Epoch:[0/2](482400/4588595) loss:2.695 lr:0.0000100 epoch_Time:26052.0min: [2024-01-04 20:40:21,743][model8_pretrain.py][INFO] Epoch:[0/2](482400/4588595) loss:2.701 lr:0.0000100 epoch_Time:26052.0min: [2024-01-04 20:40:21,743][model8_pretrain.py][INFO] Epoch:[0/2](482400/4588595) loss:2.754 lr:0.0000100 epoch_Time:26052.0min: [2024-01-04 20:40:21,743][model8_pretrain.py][INFO] Epoch:[0/2](482400/4588595) loss:2.573 lr:0.0000100 epoch_Time:26052.0min: [2024-01-04 20:40:58,690][model8_pretrain.py][INFO] Epoch:[0/2](482500/4588595) loss:3.229 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:40:58,690][model8_pretrain.py][INFO] Epoch:[0/2](482500/4588595) loss:3.029 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:40:58,690][model8_pretrain.py][INFO] Epoch:[0/2](482500/4588595) loss:2.761 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:40:58,690][model8_pretrain.py][INFO] Epoch:[0/2](482500/4588595) loss:2.971 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:40:58,691][model8_pretrain.py][INFO] Epoch:[0/2](482500/4588595) loss:2.758 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:40:58,691][model8_pretrain.py][INFO] Epoch:[0/2](482500/4588595) loss:2.337 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:40:58,691][model8_pretrain.py][INFO] Epoch:[0/2](482500/4588595) loss:2.899 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:40:58,691][model8_pretrain.py][INFO] Epoch:[0/2](482500/4588595) loss:3.270 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:41:35,626][model8_pretrain.py][INFO] Epoch:[0/2](482600/4588595) loss:3.206 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:41:35,626][model8_pretrain.py][INFO] Epoch:[0/2](482600/4588595) loss:2.850 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:41:35,626][model8_pretrain.py][INFO] Epoch:[0/2](482600/4588595) loss:3.337 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:41:35,626][model8_pretrain.py][INFO] Epoch:[0/2](482600/4588595) loss:2.640 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:41:35,626][model8_pretrain.py][INFO] Epoch:[0/2](482600/4588595) loss:3.248 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:41:35,626][model8_pretrain.py][INFO] Epoch:[0/2](482600/4588595) loss:2.601 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:41:35,626][model8_pretrain.py][INFO] Epoch:[0/2](482600/4588595) loss:2.735 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:41:35,626][model8_pretrain.py][INFO] Epoch:[0/2](482600/4588595) loss:2.942 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:42:12,552][model8_pretrain.py][INFO] Epoch:[0/2](482700/4588595) loss:2.653 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:42:12,552][model8_pretrain.py][INFO] Epoch:[0/2](482700/4588595) loss:3.085 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:42:12,552][model8_pretrain.py][INFO] Epoch:[0/2](482700/4588595) loss:3.119 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:42:12,552][model8_pretrain.py][INFO] Epoch:[0/2](482700/4588595) loss:2.539 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:42:12,552][model8_pretrain.py][INFO] Epoch:[0/2](482700/4588595) loss:3.174 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:42:12,553][model8_pretrain.py][INFO] Epoch:[0/2](482700/4588595) loss:2.627 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:42:12,553][model8_pretrain.py][INFO] Epoch:[0/2](482700/4588595) loss:3.347 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:42:12,553][model8_pretrain.py][INFO] Epoch:[0/2](482700/4588595) loss:2.974 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:42:59,989][model8_pretrain.py][INFO] Epoch:[0/2](482800/4588595) loss:2.223 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:42:59,989][model8_pretrain.py][INFO] Epoch:[0/2](482800/4588595) loss:2.954 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:42:59,989][model8_pretrain.py][INFO] Epoch:[0/2](482800/4588595) loss:2.681 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:42:59,989][model8_pretrain.py][INFO] Epoch:[0/2](482800/4588595) loss:2.623 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:42:59,989][model8_pretrain.py][INFO] Epoch:[0/2](482800/4588595) loss:3.056 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:42:59,989][model8_pretrain.py][INFO] Epoch:[0/2](482800/4588595) loss:3.329 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:42:59,990][model8_pretrain.py][INFO] Epoch:[0/2](482800/4588595) loss:2.037 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:42:59,990][model8_pretrain.py][INFO] Epoch:[0/2](482800/4588595) loss:2.656 lr:0.0000100 epoch_Time:26050.0min: [2024-01-04 20:43:36,914][model8_pretrain.py][INFO] Epoch:[0/2](482900/4588595) loss:3.277 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:43:36,914][model8_pretrain.py][INFO] Epoch:[0/2](482900/4588595) loss:2.107 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:43:36,914][model8_pretrain.py][INFO] Epoch:[0/2](482900/4588595) loss:2.747 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:43:36,914][model8_pretrain.py][INFO] Epoch:[0/2](482900/4588595) loss:2.752 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:43:36,914][model8_pretrain.py][INFO] Epoch:[0/2](482900/4588595) loss:2.922 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:43:36,914][model8_pretrain.py][INFO] Epoch:[0/2](482900/4588595) loss:2.990 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:43:36,914][model8_pretrain.py][INFO] Epoch:[0/2](482900/4588595) loss:3.315 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:43:36,915][model8_pretrain.py][INFO] Epoch:[0/2](482900/4588595) loss:3.166 lr:0.0000100 epoch_Time:26049.0min: [2024-01-04 20:44:13,845][model8_pretrain.py][INFO] Epoch:[0/2](483000/4588595) loss:3.164 lr:0.0000100 epoch_Time:26048.0min: [2024-01-04 20:44:13,845][model8_pretrain.py][INFO] Epoch:[0/2](483000/4588595) loss:3.058 lr:0.0000100 epoch_Time:26048.0min: [2024-01-04 20:44:13,845][model8_pretrain.py][INFO] Epoch:[0/2](483000/4588595) loss:3.282 lr:0.0000100 epoch_Time:26048.0min: [2024-01-04 20:44:13,845][model8_pretrain.py][INFO] Epoch:[0/2](483000/4588595) loss:3.255 lr:0.0000100 epoch_Time:26048.0min: [2024-01-04 20:44:13,845][model8_pretrain.py][INFO] Epoch:[0/2](483000/4588595) loss:2.547 lr:0.0000100 epoch_Time:26048.0min: [2024-01-04 20:44:13,845][model8_pretrain.py][INFO] Epoch:[0/2](483000/4588595) loss:2.447 lr:0.0000100 epoch_Time:26048.0min: [2024-01-04 20:44:13,845][model8_pretrain.py][INFO] Epoch:[0/2](483000/4588595) loss:2.965 lr:0.0000100 epoch_Time:26048.0min: [2024-01-04 20:44:13,845][model8_pretrain.py][INFO] Epoch:[0/2](483000/4588595) loss:2.166 lr:0.0000100 epoch_Time:26048.0min: [2024-01-04 20:44:50,780][model8_pretrain.py][INFO] Epoch:[0/2](483100/4588595) loss:2.663 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:44:50,781][model8_pretrain.py][INFO] Epoch:[0/2](483100/4588595) loss:2.645 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:44:50,780][model8_pretrain.py][INFO] Epoch:[0/2](483100/4588595) loss:3.297 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:44:50,780][model8_pretrain.py][INFO] Epoch:[0/2](483100/4588595) loss:2.845 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:44:50,780][model8_pretrain.py][INFO] Epoch:[0/2](483100/4588595) loss:3.535 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:44:50,780][model8_pretrain.py][INFO] Epoch:[0/2](483100/4588595) loss:2.724 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:44:50,780][model8_pretrain.py][INFO] Epoch:[0/2](483100/4588595) loss:2.798 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:44:50,780][model8_pretrain.py][INFO] Epoch:[0/2](483100/4588595) loss:2.565 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:45:27,725][model8_pretrain.py][INFO] Epoch:[0/2](483200/4588595) loss:3.379 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:45:27,725][model8_pretrain.py][INFO] Epoch:[0/2](483200/4588595) loss:3.059 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:45:27,726][model8_pretrain.py][INFO] Epoch:[0/2](483200/4588595) loss:2.679 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:45:27,726][model8_pretrain.py][INFO] Epoch:[0/2](483200/4588595) loss:2.954 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:45:27,726][model8_pretrain.py][INFO] Epoch:[0/2](483200/4588595) loss:2.560 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:45:27,726][model8_pretrain.py][INFO] Epoch:[0/2](483200/4588595) loss:2.654 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:45:27,726][model8_pretrain.py][INFO] Epoch:[0/2](483200/4588595) loss:3.015 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:45:27,726][model8_pretrain.py][INFO] Epoch:[0/2](483200/4588595) loss:2.855 lr:0.0000100 epoch_Time:26047.0min: [2024-01-04 20:46:04,670][model8_pretrain.py][INFO] Epoch:[0/2](483300/4588595) loss:3.233 lr:0.0000100 epoch_Time:26046.0min: [2024-01-04 20:46:04,670][model8_pretrain.py][INFO] Epoch:[0/2](483300/4588595) loss:2.024 lr:0.0000100 epoch_Time:26046.0min: [2024-01-04 20:46:04,670][model8_pretrain.py][INFO] Epoch:[0/2](483300/4588595) loss:2.107 lr:0.0000100 epoch_Time:26046.0min: [2024-01-04 20:46:04,670][model8_pretrain.py][INFO] Epoch:[0/2](483300/4588595) loss:3.270 lr:0.0000100 epoch_Time:26046.0min: [2024-01-04 20:46:04,670][model8_pretrain.py][INFO] Epoch:[0/2](483300/4588595) loss:2.667 lr:0.0000100 epoch_Time:26046.0min: [2024-01-04 20:46:04,670][model8_pretrain.py][INFO] Epoch:[0/2](483300/4588595) loss:2.601 lr:0.0000100 epoch_Time:26046.0min: [2024-01-04 20:46:04,670][model8_pretrain.py][INFO] Epoch:[0/2](483300/4588595) loss:2.801 lr:0.0000100 epoch_Time:26046.0min: [2024-01-04 20:46:04,671][model8_pretrain.py][INFO] Epoch:[0/2](483300/4588595) loss:2.679 lr:0.0000100 epoch_Time:26046.0min: [2024-01-04 20:46:41,614][model8_pretrain.py][INFO] Epoch:[0/2](483400/4588595) loss:2.960 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:46:41,614][model8_pretrain.py][INFO] Epoch:[0/2](483400/4588595) loss:2.579 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:46:41,614][model8_pretrain.py][INFO] Epoch:[0/2](483400/4588595) loss:2.667 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:46:41,614][model8_pretrain.py][INFO] Epoch:[0/2](483400/4588595) loss:2.786 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:46:41,614][model8_pretrain.py][INFO] Epoch:[0/2](483400/4588595) loss:2.276 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:46:41,614][model8_pretrain.py][INFO] Epoch:[0/2](483400/4588595) loss:3.275 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:46:41,614][model8_pretrain.py][INFO] Epoch:[0/2](483400/4588595) loss:2.260 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:46:41,615][model8_pretrain.py][INFO] Epoch:[0/2](483400/4588595) loss:2.870 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:47:18,537][model8_pretrain.py][INFO] Epoch:[0/2](483500/4588595) loss:3.305 lr:0.0000100 epoch_Time:26044.0min: [2024-01-04 20:47:18,537][model8_pretrain.py][INFO] Epoch:[0/2](483500/4588595) loss:2.760 lr:0.0000100 epoch_Time:26044.0min: [2024-01-04 20:47:18,537][model8_pretrain.py][INFO] Epoch:[0/2](483500/4588595) loss:2.930 lr:0.0000100 epoch_Time:26044.0min: [2024-01-04 20:47:18,537][model8_pretrain.py][INFO] Epoch:[0/2](483500/4588595) loss:2.886 lr:0.0000100 epoch_Time:26044.0min: [2024-01-04 20:47:18,537][model8_pretrain.py][INFO] Epoch:[0/2](483500/4588595) loss:2.565 lr:0.0000100 epoch_Time:26044.0min: [2024-01-04 20:47:18,537][model8_pretrain.py][INFO] Epoch:[0/2](483500/4588595) loss:2.326 lr:0.0000100 epoch_Time:26044.0min: [2024-01-04 20:47:18,537][model8_pretrain.py][INFO] Epoch:[0/2](483500/4588595) loss:2.628 lr:0.0000100 epoch_Time:26044.0min: [2024-01-04 20:47:18,537][model8_pretrain.py][INFO] Epoch:[0/2](483500/4588595) loss:2.663 lr:0.0000100 epoch_Time:26044.0min: [2024-01-04 20:48:05,944][model8_pretrain.py][INFO] Epoch:[0/2](483600/4588595) loss:3.031 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:05,944][model8_pretrain.py][INFO] Epoch:[0/2](483600/4588595) loss:3.282 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:05,944][model8_pretrain.py][INFO] Epoch:[0/2](483600/4588595) loss:2.432 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:05,944][model8_pretrain.py][INFO] Epoch:[0/2](483600/4588595) loss:3.411 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:05,944][model8_pretrain.py][INFO] Epoch:[0/2](483600/4588595) loss:2.404 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:05,944][model8_pretrain.py][INFO] Epoch:[0/2](483600/4588595) loss:3.266 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:05,944][model8_pretrain.py][INFO] Epoch:[0/2](483600/4588595) loss:2.782 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:05,944][model8_pretrain.py][INFO] Epoch:[0/2](483600/4588595) loss:3.007 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:42,881][model8_pretrain.py][INFO] Epoch:[0/2](483700/4588595) loss:2.637 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:42,881][model8_pretrain.py][INFO] Epoch:[0/2](483700/4588595) loss:2.692 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:42,881][model8_pretrain.py][INFO] Epoch:[0/2](483700/4588595) loss:2.939 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:42,881][model8_pretrain.py][INFO] Epoch:[0/2](483700/4588595) loss:2.665 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:42,881][model8_pretrain.py][INFO] Epoch:[0/2](483700/4588595) loss:2.982 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:42,881][model8_pretrain.py][INFO] Epoch:[0/2](483700/4588595) loss:2.840 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:42,881][model8_pretrain.py][INFO] Epoch:[0/2](483700/4588595) loss:3.070 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:48:42,882][model8_pretrain.py][INFO] Epoch:[0/2](483700/4588595) loss:1.845 lr:0.0000100 epoch_Time:26045.0min: [2024-01-04 20:49:19,827][model8_pretrain.py][INFO] Epoch:[0/2](483800/4588595) loss:3.025 lr:0.0000100 epoch_Time:26043.0min: [2024-01-04 20:49:19,827][model8_pretrain.py][INFO] Epoch:[0/2](483800/4588595) loss:2.989 lr:0.0000100 epoch_Time:26043.0min: [2024-01-04 20:49:19,827][model8_pretrain.py][INFO] Epoch:[0/2](483800/4588595) loss:2.732 lr:0.0000100 epoch_Time:26043.0min: [2024-01-04 20:49:19,827][model8_pretrain.py][INFO] Epoch:[0/2](483800/4588595) loss:2.671 lr:0.0000100 epoch_Time:26043.0min: [2024-01-04 20:49:19,827][model8_pretrain.py][INFO] Epoch:[0/2](483800/4588595) loss:2.616 lr:0.0000100 epoch_Time:26043.0min: [2024-01-04 20:49:19,827][model8_pretrain.py][INFO] Epoch:[0/2](483800/4588595) loss:3.468 lr:0.0000100 epoch_Time:26043.0min: [2024-01-04 20:49:19,827][model8_pretrain.py][INFO] Epoch:[0/2](483800/4588595) loss:2.856 lr:0.0000100 epoch_Time:26043.0min: [2024-01-04 20:49:19,827][model8_pretrain.py][INFO] Epoch:[0/2](483800/4588595) loss:2.230 lr:0.0000100 epoch_Time:26043.0min: [2024-01-04 20:49:56,762][model8_pretrain.py][INFO] Epoch:[0/2](483900/4588595) loss:2.997 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:49:56,762][model8_pretrain.py][INFO] Epoch:[0/2](483900/4588595) loss:3.221 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:49:56,762][model8_pretrain.py][INFO] Epoch:[0/2](483900/4588595) loss:2.752 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:49:56,762][model8_pretrain.py][INFO] Epoch:[0/2](483900/4588595) loss:2.610 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:49:56,762][model8_pretrain.py][INFO] Epoch:[0/2](483900/4588595) loss:3.197 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:49:56,762][model8_pretrain.py][INFO] Epoch:[0/2](483900/4588595) loss:2.634 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:49:56,762][model8_pretrain.py][INFO] Epoch:[0/2](483900/4588595) loss:3.135 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:49:56,763][model8_pretrain.py][INFO] Epoch:[0/2](483900/4588595) loss:3.133 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:50:33,698][model8_pretrain.py][INFO] Epoch:[0/2](484000/4588595) loss:3.121 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:50:33,698][model8_pretrain.py][INFO] Epoch:[0/2](484000/4588595) loss:2.950 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:50:33,698][model8_pretrain.py][INFO] Epoch:[0/2](484000/4588595) loss:3.064 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:50:33,698][model8_pretrain.py][INFO] Epoch:[0/2](484000/4588595) loss:3.288 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:50:33,698][model8_pretrain.py][INFO] Epoch:[0/2](484000/4588595) loss:2.645 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:50:33,698][model8_pretrain.py][INFO] Epoch:[0/2](484000/4588595) loss:2.764 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:50:33,698][model8_pretrain.py][INFO] Epoch:[0/2](484000/4588595) loss:3.110 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:50:33,699][model8_pretrain.py][INFO] Epoch:[0/2](484000/4588595) loss:2.877 lr:0.0000100 epoch_Time:26042.0min: [2024-01-04 20:51:10,658][model8_pretrain.py][INFO] Epoch:[0/2](484100/4588595) loss:2.825 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:10,658][model8_pretrain.py][INFO] Epoch:[0/2](484100/4588595) loss:2.683 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:10,658][model8_pretrain.py][INFO] Epoch:[0/2](484100/4588595) loss:2.725 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:10,658][model8_pretrain.py][INFO] Epoch:[0/2](484100/4588595) loss:2.929 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:10,658][model8_pretrain.py][INFO] Epoch:[0/2](484100/4588595) loss:3.068 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:10,658][model8_pretrain.py][INFO] Epoch:[0/2](484100/4588595) loss:1.956 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:10,658][model8_pretrain.py][INFO] Epoch:[0/2](484100/4588595) loss:2.597 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:10,658][model8_pretrain.py][INFO] Epoch:[0/2](484100/4588595) loss:3.359 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:47,594][model8_pretrain.py][INFO] Epoch:[0/2](484200/4588595) loss:3.056 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:47,594][model8_pretrain.py][INFO] Epoch:[0/2](484200/4588595) loss:2.510 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:47,594][model8_pretrain.py][INFO] Epoch:[0/2](484200/4588595) loss:2.950 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:51:47,594][model8_pretrain.py][INFO] Epoch:[0/2](484200/4588595) loss:2.866 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:47,594][model8_pretrain.py][INFO] Epoch:[0/2](484200/4588595) loss:2.753 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:47,594][model8_pretrain.py][INFO] Epoch:[0/2](484200/4588595) loss:2.960 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:47,594][model8_pretrain.py][INFO] Epoch:[0/2](484200/4588595) loss:2.867 lr:0.0000100 epoch_Time:26041.0min: [2024-01-04 20:51:47,594][model8_pretrain.py][INFO] Epoch:[0/2](484200/4588595) loss:2.771 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:52:24,528][model8_pretrain.py][INFO] Epoch:[0/2](484300/4588595) loss:2.597 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:52:24,528][model8_pretrain.py][INFO] Epoch:[0/2](484300/4588595) loss:3.227 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:52:24,528][model8_pretrain.py][INFO] Epoch:[0/2](484300/4588595) loss:2.954 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:52:24,528][model8_pretrain.py][INFO] Epoch:[0/2](484300/4588595) loss:3.142 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:52:24,528][model8_pretrain.py][INFO] Epoch:[0/2](484300/4588595) loss:2.652 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:52:24,528][model8_pretrain.py][INFO] Epoch:[0/2](484300/4588595) loss:2.710 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:52:24,528][model8_pretrain.py][INFO] Epoch:[0/2](484300/4588595) loss:3.277 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:52:24,528][model8_pretrain.py][INFO] Epoch:[0/2](484300/4588595) loss:2.675 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:53:11,939][model8_pretrain.py][INFO] Epoch:[0/2](484400/4588595) loss:2.815 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:53:11,940][model8_pretrain.py][INFO] Epoch:[0/2](484400/4588595) loss:2.770 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:53:11,940][model8_pretrain.py][INFO] Epoch:[0/2](484400/4588595) loss:2.815 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:53:11,940][model8_pretrain.py][INFO] Epoch:[0/2](484400/4588595) loss:2.934 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:53:11,940][model8_pretrain.py][INFO] Epoch:[0/2](484400/4588595) loss:2.664 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:53:11,940][model8_pretrain.py][INFO] Epoch:[0/2](484400/4588595) loss:2.922 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:53:11,940][model8_pretrain.py][INFO] Epoch:[0/2](484400/4588595) loss:2.730 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:53:11,940][model8_pretrain.py][INFO] Epoch:[0/2](484400/4588595) loss:2.842 lr:0.0000100 epoch_Time:26040.0min: [2024-01-04 20:53:48,876][model8_pretrain.py][INFO] Epoch:[0/2](484500/4588595) loss:2.690 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:53:48,877][model8_pretrain.py][INFO] Epoch:[0/2](484500/4588595) loss:3.257 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:53:48,877][model8_pretrain.py][INFO] Epoch:[0/2](484500/4588595) loss:2.994 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:53:48,877][model8_pretrain.py][INFO] Epoch:[0/2](484500/4588595) loss:3.112 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:53:48,877][model8_pretrain.py][INFO] Epoch:[0/2](484500/4588595) loss:2.499 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:53:48,877][model8_pretrain.py][INFO] Epoch:[0/2](484500/4588595) loss:2.818 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:53:48,877][model8_pretrain.py][INFO] Epoch:[0/2](484500/4588595) loss:3.285 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:53:48,877][model8_pretrain.py][INFO] Epoch:[0/2](484500/4588595) loss:2.935 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:54:25,828][model8_pretrain.py][INFO] Epoch:[0/2](484600/4588595) loss:1.836 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:54:25,828][model8_pretrain.py][INFO] Epoch:[0/2](484600/4588595) loss:2.651 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:54:25,828][model8_pretrain.py][INFO] Epoch:[0/2](484600/4588595) loss:2.288 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:54:25,828][model8_pretrain.py][INFO] Epoch:[0/2](484600/4588595) loss:2.145 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:54:25,828][model8_pretrain.py][INFO] Epoch:[0/2](484600/4588595) loss:2.730 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:54:25,828][model8_pretrain.py][INFO] Epoch:[0/2](484600/4588595) loss:3.214 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:54:25,829][model8_pretrain.py][INFO] Epoch:[0/2](484600/4588595) loss:3.124 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:54:25,828][model8_pretrain.py][INFO] Epoch:[0/2](484600/4588595) loss:3.274 lr:0.0000100 epoch_Time:26039.0min: [2024-01-04 20:55:02,784][model8_pretrain.py][INFO] Epoch:[0/2](484700/4588595) loss:2.396 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:02,784][model8_pretrain.py][INFO] Epoch:[0/2](484700/4588595) loss:2.615 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:02,784][model8_pretrain.py][INFO] Epoch:[0/2](484700/4588595) loss:2.769 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:02,784][model8_pretrain.py][INFO] Epoch:[0/2](484700/4588595) loss:2.677 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:02,784][model8_pretrain.py][INFO] Epoch:[0/2](484700/4588595) loss:3.045 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:02,784][model8_pretrain.py][INFO] Epoch:[0/2](484700/4588595) loss:2.812 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:02,784][model8_pretrain.py][INFO] Epoch:[0/2](484700/4588595) loss:2.898 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:02,784][model8_pretrain.py][INFO] Epoch:[0/2](484700/4588595) loss:2.935 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:39,719][model8_pretrain.py][INFO] Epoch:[0/2](484800/4588595) loss:2.819 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:39,719][model8_pretrain.py][INFO] Epoch:[0/2](484800/4588595) loss:2.920 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:39,719][model8_pretrain.py][INFO] Epoch:[0/2](484800/4588595) loss:3.116 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:39,719][model8_pretrain.py][INFO] Epoch:[0/2](484800/4588595) loss:2.724 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:39,719][model8_pretrain.py][INFO] Epoch:[0/2](484800/4588595) loss:3.039 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:39,719][model8_pretrain.py][INFO] Epoch:[0/2](484800/4588595) loss:2.894 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:39,719][model8_pretrain.py][INFO] Epoch:[0/2](484800/4588595) loss:2.953 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:55:39,719][model8_pretrain.py][INFO] Epoch:[0/2](484800/4588595) loss:3.401 lr:0.0000100 epoch_Time:26037.0min: [2024-01-04 20:56:16,658][model8_pretrain.py][INFO] Epoch:[0/2](484900/4588595) loss:3.133 lr:0.0000100 epoch_Time:26036.0min: [2024-01-04 20:56:16,658][model8_pretrain.py][INFO] Epoch:[0/2](484900/4588595) loss:3.214 lr:0.0000100 epoch_Time:26036.0min: [2024-01-04 20:56:16,659][model8_pretrain.py][INFO] Epoch:[0/2](484900/4588595) loss:2.567 lr:0.0000100 epoch_Time:26036.0min: [2024-01-04 20:56:16,659][model8_pretrain.py][INFO] Epoch:[0/2](484900/4588595) loss:2.333 lr:0.0000100 epoch_Time:26036.0min: [2024-01-04 20:56:16,659][model8_pretrain.py][INFO] Epoch:[0/2](484900/4588595) loss:3.091 lr:0.0000100 epoch_Time:26036.0min: [2024-01-04 20:56:16,659][model8_pretrain.py][INFO] Epoch:[0/2](484900/4588595) loss:3.024 lr:0.0000100 epoch_Time:26036.0min: [2024-01-04 20:56:16,659][model8_pretrain.py][INFO] Epoch:[0/2](484900/4588595) loss:2.941 lr:0.0000100 epoch_Time:26036.0min: [2024-01-04 20:56:16,659][model8_pretrain.py][INFO] Epoch:[0/2](484900/4588595) loss:3.097 lr:0.0000100 epoch_Time:26036.0min: [2024-01-04 20:56:53,610][model8_pretrain.py][INFO] Epoch:[0/2](485000/4588595) loss:2.836 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:56:53,610][model8_pretrain.py][INFO] Epoch:[0/2](485000/4588595) loss:3.346 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:56:53,610][model8_pretrain.py][INFO] Epoch:[0/2](485000/4588595) loss:3.162 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:56:53,611][model8_pretrain.py][INFO] Epoch:[0/2](485000/4588595) loss:3.240 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:56:53,611][model8_pretrain.py][INFO] Epoch:[0/2](485000/4588595) loss:2.763 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:56:53,611][model8_pretrain.py][INFO] Epoch:[0/2](485000/4588595) loss:2.490 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:56:53,611][model8_pretrain.py][INFO] Epoch:[0/2](485000/4588595) loss:2.222 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:56:53,611][model8_pretrain.py][INFO] Epoch:[0/2](485000/4588595) loss:3.415 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:57:30,548][model8_pretrain.py][INFO] Epoch:[0/2](485100/4588595) loss:3.071 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:57:30,548][model8_pretrain.py][INFO] Epoch:[0/2](485100/4588595) loss:3.190 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:57:30,548][model8_pretrain.py][INFO] Epoch:[0/2](485100/4588595) loss:2.406 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:57:30,548][model8_pretrain.py][INFO] Epoch:[0/2](485100/4588595) loss:2.929 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:57:30,548][model8_pretrain.py][INFO] Epoch:[0/2](485100/4588595) loss:3.088 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:57:30,548][model8_pretrain.py][INFO] Epoch:[0/2](485100/4588595) loss:3.425 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:57:30,548][model8_pretrain.py][INFO] Epoch:[0/2](485100/4588595) loss:3.020 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:57:30,548][model8_pretrain.py][INFO] Epoch:[0/2](485100/4588595) loss:2.799 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:58:17,904][model8_pretrain.py][INFO] Epoch:[0/2](485200/4588595) loss:2.583 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:58:17,904][model8_pretrain.py][INFO] Epoch:[0/2](485200/4588595) loss:2.336 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:58:17,904][model8_pretrain.py][INFO] Epoch:[0/2](485200/4588595) loss:2.907 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:58:17,904][model8_pretrain.py][INFO] Epoch:[0/2](485200/4588595) loss:2.577 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:58:17,904][model8_pretrain.py][INFO] Epoch:[0/2](485200/4588595) loss:2.662 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:58:17,905][model8_pretrain.py][INFO] Epoch:[0/2](485200/4588595) loss:3.381 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:58:17,905][model8_pretrain.py][INFO] Epoch:[0/2](485200/4588595) loss:2.982 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:58:17,905][model8_pretrain.py][INFO] Epoch:[0/2](485200/4588595) loss:2.472 lr:0.0000100 epoch_Time:26035.0min: [2024-01-04 20:58:54,834][model8_pretrain.py][INFO] Epoch:[0/2](485300/4588595) loss:2.786 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:58:54,834][model8_pretrain.py][INFO] Epoch:[0/2](485300/4588595) loss:2.862 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:58:54,834][model8_pretrain.py][INFO] Epoch:[0/2](485300/4588595) loss:3.432 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:58:54,834][model8_pretrain.py][INFO] Epoch:[0/2](485300/4588595) loss:2.341 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:58:54,834][model8_pretrain.py][INFO] Epoch:[0/2](485300/4588595) loss:2.541 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:58:54,834][model8_pretrain.py][INFO] Epoch:[0/2](485300/4588595) loss:3.129 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:58:54,834][model8_pretrain.py][INFO] Epoch:[0/2](485300/4588595) loss:3.064 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:58:54,834][model8_pretrain.py][INFO] Epoch:[0/2](485300/4588595) loss:3.141 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:59:31,783][model8_pretrain.py][INFO] Epoch:[0/2](485400/4588595) loss:2.798 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:59:31,783][model8_pretrain.py][INFO] Epoch:[0/2](485400/4588595) loss:2.849 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:59:31,783][model8_pretrain.py][INFO] Epoch:[0/2](485400/4588595) loss:2.929 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:59:31,783][model8_pretrain.py][INFO] Epoch:[0/2](485400/4588595) loss:2.697 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:59:31,783][model8_pretrain.py][INFO] Epoch:[0/2](485400/4588595) loss:3.088 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:59:31,783][model8_pretrain.py][INFO] Epoch:[0/2](485400/4588595) loss:3.122 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:59:31,783][model8_pretrain.py][INFO] Epoch:[0/2](485400/4588595) loss:3.352 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 20:59:31,783][model8_pretrain.py][INFO] Epoch:[0/2](485400/4588595) loss:3.219 lr:0.0000100 epoch_Time:26034.0min: [2024-01-04 21:00:08,720][model8_pretrain.py][INFO] Epoch:[0/2](485500/4588595) loss:2.907 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:08,720][model8_pretrain.py][INFO] Epoch:[0/2](485500/4588595) loss:2.993 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:08,720][model8_pretrain.py][INFO] Epoch:[0/2](485500/4588595) loss:2.974 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:08,720][model8_pretrain.py][INFO] Epoch:[0/2](485500/4588595) loss:1.968 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:08,720][model8_pretrain.py][INFO] Epoch:[0/2](485500/4588595) loss:2.682 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:08,720][model8_pretrain.py][INFO] Epoch:[0/2](485500/4588595) loss:2.203 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:08,720][model8_pretrain.py][INFO] Epoch:[0/2](485500/4588595) loss:2.625 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:08,720][model8_pretrain.py][INFO] Epoch:[0/2](485500/4588595) loss:3.470 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:45,649][model8_pretrain.py][INFO] Epoch:[0/2](485600/4588595) loss:3.135 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:45,649][model8_pretrain.py][INFO] Epoch:[0/2](485600/4588595) loss:3.332 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:45,649][model8_pretrain.py][INFO] Epoch:[0/2](485600/4588595) loss:2.837 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:45,649][model8_pretrain.py][INFO] Epoch:[0/2](485600/4588595) loss:3.244 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:45,649][model8_pretrain.py][INFO] Epoch:[0/2](485600/4588595) loss:3.120 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:45,649][model8_pretrain.py][INFO] Epoch:[0/2](485600/4588595) loss:2.565 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:45,649][model8_pretrain.py][INFO] Epoch:[0/2](485600/4588595) loss:3.355 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:00:45,649][model8_pretrain.py][INFO] Epoch:[0/2](485600/4588595) loss:2.819 lr:0.0000100 epoch_Time:26033.0min: [2024-01-04 21:01:22,589][model8_pretrain.py][INFO] Epoch:[0/2](485700/4588595) loss:3.182 lr:0.0000100 epoch_Time:26031.0min: [2024-01-04 21:01:22,589][model8_pretrain.py][INFO] Epoch:[0/2](485700/4588595) loss:2.377 lr:0.0000100 epoch_Time:26031.0min: [2024-01-04 21:01:22,589][model8_pretrain.py][INFO] Epoch:[0/2](485700/4588595) loss:3.024 lr:0.0000100 epoch_Time:26031.0min: [2024-01-04 21:01:22,590][model8_pretrain.py][INFO] Epoch:[0/2](485700/4588595) loss:3.359 lr:0.0000100 epoch_Time:26031.0min: [2024-01-04 21:01:22,590][model8_pretrain.py][INFO] Epoch:[0/2](485700/4588595) loss:2.511 lr:0.0000100 epoch_Time:26031.0min: [2024-01-04 21:01:22,590][model8_pretrain.py][INFO] Epoch:[0/2](485700/4588595) loss:2.448 lr:0.0000100 epoch_Time:26031.0min: [2024-01-04 21:01:22,590][model8_pretrain.py][INFO] Epoch:[0/2](485700/4588595) loss:2.246 lr:0.0000100 epoch_Time:26031.0min: [2024-01-04 21:01:22,590][model8_pretrain.py][INFO] Epoch:[0/2](485700/4588595) loss:2.604 lr:0.0000100 epoch_Time:26031.0min: [2024-01-04 21:01:59,527][model8_pretrain.py][INFO] Epoch:[0/2](485800/4588595) loss:3.033 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:01:59,527][model8_pretrain.py][INFO] Epoch:[0/2](485800/4588595) loss:3.038 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:01:59,527][model8_pretrain.py][INFO] Epoch:[0/2](485800/4588595) loss:2.751 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:01:59,527][model8_pretrain.py][INFO] Epoch:[0/2](485800/4588595) loss:2.633 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:01:59,527][model8_pretrain.py][INFO] Epoch:[0/2](485800/4588595) loss:2.846 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:01:59,527][model8_pretrain.py][INFO] Epoch:[0/2](485800/4588595) loss:2.804 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:01:59,527][model8_pretrain.py][INFO] Epoch:[0/2](485800/4588595) loss:2.712 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:01:59,527][model8_pretrain.py][INFO] Epoch:[0/2](485800/4588595) loss:2.852 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:02:36,459][model8_pretrain.py][INFO] Epoch:[0/2](485900/4588595) loss:3.318 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:02:36,459][model8_pretrain.py][INFO] Epoch:[0/2](485900/4588595) loss:2.134 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:02:36,459][model8_pretrain.py][INFO] Epoch:[0/2](485900/4588595) loss:3.030 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:02:36,459][model8_pretrain.py][INFO] Epoch:[0/2](485900/4588595) loss:2.488 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:02:36,459][model8_pretrain.py][INFO] Epoch:[0/2](485900/4588595) loss:2.951 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:02:36,459][model8_pretrain.py][INFO] Epoch:[0/2](485900/4588595) loss:3.122 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:02:36,459][model8_pretrain.py][INFO] Epoch:[0/2](485900/4588595) loss:3.056 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:02:36,459][model8_pretrain.py][INFO] Epoch:[0/2](485900/4588595) loss:2.995 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:03:24,062][model8_pretrain.py][INFO] Epoch:[0/2](486000/4588595) loss:3.480 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:03:24,062][model8_pretrain.py][INFO] Epoch:[0/2](486000/4588595) loss:2.812 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:03:24,062][model8_pretrain.py][INFO] Epoch:[0/2](486000/4588595) loss:3.141 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:03:24,062][model8_pretrain.py][INFO] Epoch:[0/2](486000/4588595) loss:2.723 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:03:24,062][model8_pretrain.py][INFO] Epoch:[0/2](486000/4588595) loss:2.853 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:03:24,062][model8_pretrain.py][INFO] Epoch:[0/2](486000/4588595) loss:3.088 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:03:24,062][model8_pretrain.py][INFO] Epoch:[0/2](486000/4588595) loss:2.672 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:03:24,063][model8_pretrain.py][INFO] Epoch:[0/2](486000/4588595) loss:3.258 lr:0.0000100 epoch_Time:26030.0min: [2024-01-04 21:04:01,003][model8_pretrain.py][INFO] Epoch:[0/2](486100/4588595) loss:2.443 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:01,003][model8_pretrain.py][INFO] Epoch:[0/2](486100/4588595) loss:2.762 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:01,003][model8_pretrain.py][INFO] Epoch:[0/2](486100/4588595) loss:3.020 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:01,003][model8_pretrain.py][INFO] Epoch:[0/2](486100/4588595) loss:2.925 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:01,003][model8_pretrain.py][INFO] Epoch:[0/2](486100/4588595) loss:3.397 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:01,003][model8_pretrain.py][INFO] Epoch:[0/2](486100/4588595) loss:3.056 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:01,003][model8_pretrain.py][INFO] Epoch:[0/2](486100/4588595) loss:2.517 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:01,003][model8_pretrain.py][INFO] Epoch:[0/2](486100/4588595) loss:3.493 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:37,952][model8_pretrain.py][INFO] Epoch:[0/2](486200/4588595) loss:2.886 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:37,952][model8_pretrain.py][INFO] Epoch:[0/2](486200/4588595) loss:3.164 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:37,952][model8_pretrain.py][INFO] Epoch:[0/2](486200/4588595) loss:3.094 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:37,952][model8_pretrain.py][INFO] Epoch:[0/2](486200/4588595) loss:2.874 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:37,952][model8_pretrain.py][INFO] Epoch:[0/2](486200/4588595) loss:2.865 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:37,952][model8_pretrain.py][INFO] Epoch:[0/2](486200/4588595) loss:2.716 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:37,952][model8_pretrain.py][INFO] Epoch:[0/2](486200/4588595) loss:2.836 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:04:37,952][model8_pretrain.py][INFO] Epoch:[0/2](486200/4588595) loss:2.905 lr:0.0000100 epoch_Time:26029.0min: [2024-01-04 21:05:14,896][model8_pretrain.py][INFO] Epoch:[0/2](486300/4588595) loss:3.398 lr:0.0000100 epoch_Time:26028.0min: [2024-01-04 21:05:14,896][model8_pretrain.py][INFO] Epoch:[0/2](486300/4588595) loss:2.904 lr:0.0000100 epoch_Time:26028.0min: [2024-01-04 21:05:14,896][model8_pretrain.py][INFO] Epoch:[0/2](486300/4588595) loss:2.749 lr:0.0000100 epoch_Time:26028.0min: [2024-01-04 21:05:14,896][model8_pretrain.py][INFO] Epoch:[0/2](486300/4588595) loss:3.056 lr:0.0000100 epoch_Time:26028.0min: [2024-01-04 21:05:14,896][model8_pretrain.py][INFO] Epoch:[0/2](486300/4588595) loss:2.507 lr:0.0000100 epoch_Time:26028.0min: [2024-01-04 21:05:14,896][model8_pretrain.py][INFO] Epoch:[0/2](486300/4588595) loss:2.950 lr:0.0000100 epoch_Time:26028.0min: [2024-01-04 21:05:14,896][model8_pretrain.py][INFO] Epoch:[0/2](486300/4588595) loss:2.766 lr:0.0000100 epoch_Time:26028.0min: [2024-01-04 21:05:14,896][model8_pretrain.py][INFO] Epoch:[0/2](486300/4588595) loss:2.900 lr:0.0000100 epoch_Time:26028.0min: [2024-01-04 21:05:51,837][model8_pretrain.py][INFO] Epoch:[0/2](486400/4588595) loss:2.476 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:05:51,837][model8_pretrain.py][INFO] Epoch:[0/2](486400/4588595) loss:2.677 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:05:51,837][model8_pretrain.py][INFO] Epoch:[0/2](486400/4588595) loss:3.090 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:05:51,837][model8_pretrain.py][INFO] Epoch:[0/2](486400/4588595) loss:2.610 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:05:51,837][model8_pretrain.py][INFO] Epoch:[0/2](486400/4588595) loss:3.139 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:05:51,837][model8_pretrain.py][INFO] Epoch:[0/2](486400/4588595) loss:2.844 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:05:51,837][model8_pretrain.py][INFO] Epoch:[0/2](486400/4588595) loss:3.036 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:05:51,837][model8_pretrain.py][INFO] Epoch:[0/2](486400/4588595) loss:1.872 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:06:28,794][model8_pretrain.py][INFO] Epoch:[0/2](486500/4588595) loss:2.617 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:06:28,794][model8_pretrain.py][INFO] Epoch:[0/2](486500/4588595) loss:3.111 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:06:28,794][model8_pretrain.py][INFO] Epoch:[0/2](486500/4588595) loss:3.062 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:06:28,794][model8_pretrain.py][INFO] Epoch:[0/2](486500/4588595) loss:2.607 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:06:28,794][model8_pretrain.py][INFO] Epoch:[0/2](486500/4588595) loss:2.598 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:06:28,794][model8_pretrain.py][INFO] Epoch:[0/2](486500/4588595) loss:2.911 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:06:28,794][model8_pretrain.py][INFO] Epoch:[0/2](486500/4588595) loss:2.848 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:06:28,794][model8_pretrain.py][INFO] Epoch:[0/2](486500/4588595) loss:3.095 lr:0.0000100 epoch_Time:26027.0min: [2024-01-04 21:07:05,732][model8_pretrain.py][INFO] Epoch:[0/2](486600/4588595) loss:2.904 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:05,732][model8_pretrain.py][INFO] Epoch:[0/2](486600/4588595) loss:2.742 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:05,732][model8_pretrain.py][INFO] Epoch:[0/2](486600/4588595) loss:2.621 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:05,732][model8_pretrain.py][INFO] Epoch:[0/2](486600/4588595) loss:2.942 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:05,732][model8_pretrain.py][INFO] Epoch:[0/2](486600/4588595) loss:2.775 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:05,732][model8_pretrain.py][INFO] Epoch:[0/2](486600/4588595) loss:2.828 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:05,733][model8_pretrain.py][INFO] Epoch:[0/2](486600/4588595) loss:2.447 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:05,733][model8_pretrain.py][INFO] Epoch:[0/2](486600/4588595) loss:2.118 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:42,668][model8_pretrain.py][INFO] Epoch:[0/2](486700/4588595) loss:2.723 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:42,668][model8_pretrain.py][INFO] Epoch:[0/2](486700/4588595) loss:2.811 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:42,668][model8_pretrain.py][INFO] Epoch:[0/2](486700/4588595) loss:3.093 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:42,668][model8_pretrain.py][INFO] Epoch:[0/2](486700/4588595) loss:3.132 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:42,668][model8_pretrain.py][INFO] Epoch:[0/2](486700/4588595) loss:2.572 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:42,669][model8_pretrain.py][INFO] Epoch:[0/2](486700/4588595) loss:2.858 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:42,669][model8_pretrain.py][INFO] Epoch:[0/2](486700/4588595) loss:2.822 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:07:42,669][model8_pretrain.py][INFO] Epoch:[0/2](486700/4588595) loss:3.120 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:08:30,042][model8_pretrain.py][INFO] Epoch:[0/2](486800/4588595) loss:2.585 lr:0.0000100 epoch_Time:26026.0min: [2024-01-04 21:08:30,042][model8_pretrain.py][INFO] Epoch:[0/2](486800/4588595) loss:2.454 lr:0.0000100 epoch_Time:26026.0min: [2024-01-04 21:08:30,042][model8_pretrain.py][INFO] Epoch:[0/2](486800/4588595) loss:3.176 lr:0.0000100 epoch_Time:26026.0min: [2024-01-04 21:08:30,042][model8_pretrain.py][INFO] Epoch:[0/2](486800/4588595) loss:2.728 lr:0.0000100 epoch_Time:26026.0min: [2024-01-04 21:08:30,042][model8_pretrain.py][INFO] Epoch:[0/2](486800/4588595) loss:3.288 lr:0.0000100 epoch_Time:26026.0min: [2024-01-04 21:08:30,042][model8_pretrain.py][INFO] Epoch:[0/2](486800/4588595) loss:3.150 lr:0.0000100 epoch_Time:26026.0min: [2024-01-04 21:08:30,042][model8_pretrain.py][INFO] Epoch:[0/2](486800/4588595) loss:2.904 lr:0.0000100 epoch_Time:26026.0min: [2024-01-04 21:08:30,042][model8_pretrain.py][INFO] Epoch:[0/2](486800/4588595) loss:3.142 lr:0.0000100 epoch_Time:26026.0min: [2024-01-04 21:09:06,938][model8_pretrain.py][INFO] Epoch:[0/2](486900/4588595) loss:2.451 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:09:06,938][model8_pretrain.py][INFO] Epoch:[0/2](486900/4588595) loss:3.120 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:09:06,938][model8_pretrain.py][INFO] Epoch:[0/2](486900/4588595) loss:3.096 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:09:06,938][model8_pretrain.py][INFO] Epoch:[0/2](486900/4588595) loss:3.171 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:09:06,938][model8_pretrain.py][INFO] Epoch:[0/2](486900/4588595) loss:2.522 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:09:06,938][model8_pretrain.py][INFO] Epoch:[0/2](486900/4588595) loss:2.672 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:09:06,938][model8_pretrain.py][INFO] Epoch:[0/2](486900/4588595) loss:2.985 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:09:06,938][model8_pretrain.py][INFO] Epoch:[0/2](486900/4588595) loss:2.612 lr:0.0000100 epoch_Time:26025.0min: [2024-01-04 21:09:43,873][model8_pretrain.py][INFO] Epoch:[0/2](487000/4588595) loss:2.889 lr:0.0000100 epoch_Time:26024.0min: [2024-01-04 21:09:43,873][model8_pretrain.py][INFO] Epoch:[0/2](487000/4588595) loss:2.724 lr:0.0000100 epoch_Time:26024.0min: [2024-01-04 21:09:43,873][model8_pretrain.py][INFO] Epoch:[0/2](487000/4588595) loss:2.718 lr:0.0000100 epoch_Time:26024.0min: [2024-01-04 21:09:43,873][model8_pretrain.py][INFO] Epoch:[0/2](487000/4588595) loss:3.254 lr:0.0000100 epoch_Time:26024.0min: [2024-01-04 21:09:43,873][model8_pretrain.py][INFO] Epoch:[0/2](487000/4588595) loss:2.487 lr:0.0000100 epoch_Time:26024.0min: [2024-01-04 21:09:43,873][model8_pretrain.py][INFO] Epoch:[0/2](487000/4588595) loss:2.461 lr:0.0000100 epoch_Time:26024.0min: [2024-01-04 21:09:43,874][model8_pretrain.py][INFO] Epoch:[0/2](487000/4588595) loss:3.005 lr:0.0000100 epoch_Time:26024.0min: [2024-01-04 21:09:43,873][model8_pretrain.py][INFO] Epoch:[0/2](487000/4588595) loss:3.338 lr:0.0000100 epoch_Time:26024.0min: [2024-01-04 21:10:20,807][model8_pretrain.py][INFO] Epoch:[0/2](487100/4588595) loss:2.587 lr:0.0000100 epoch_Time:26023.0min: [2024-01-04 21:10:20,807][model8_pretrain.py][INFO] Epoch:[0/2](487100/4588595) loss:3.140 lr:0.0000100 epoch_Time:26023.0min: [2024-01-04 21:10:20,807][model8_pretrain.py][INFO] Epoch:[0/2](487100/4588595) loss:2.903 lr:0.0000100 epoch_Time:26023.0min: [2024-01-04 21:10:20,807][model8_pretrain.py][INFO] Epoch:[0/2](487100/4588595) loss:3.024 lr:0.0000100 epoch_Time:26023.0min: [2024-01-04 21:10:20,807][model8_pretrain.py][INFO] Epoch:[0/2](487100/4588595) loss:2.462 lr:0.0000100 epoch_Time:26023.0min: [2024-01-04 21:10:20,807][model8_pretrain.py][INFO] Epoch:[0/2](487100/4588595) loss:2.710 lr:0.0000100 epoch_Time:26023.0min: [2024-01-04 21:10:20,808][model8_pretrain.py][INFO] Epoch:[0/2](487100/4588595) loss:2.891 lr:0.0000100 epoch_Time:26023.0min: [2024-01-04 21:10:20,808][model8_pretrain.py][INFO] Epoch:[0/2](487100/4588595) loss:3.435 lr:0.0000100 epoch_Time:26023.0min: [2024-01-04 21:10:57,745][model8_pretrain.py][INFO] Epoch:[0/2](487200/4588595) loss:3.149 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:10:57,745][model8_pretrain.py][INFO] Epoch:[0/2](487200/4588595) loss:2.712 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:10:57,745][model8_pretrain.py][INFO] Epoch:[0/2](487200/4588595) loss:2.730 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:10:57,745][model8_pretrain.py][INFO] Epoch:[0/2](487200/4588595) loss:2.929 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:10:57,745][model8_pretrain.py][INFO] Epoch:[0/2](487200/4588595) loss:2.764 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:10:57,745][model8_pretrain.py][INFO] Epoch:[0/2](487200/4588595) loss:3.125 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:10:57,745][model8_pretrain.py][INFO] Epoch:[0/2](487200/4588595) loss:2.877 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:10:57,745][model8_pretrain.py][INFO] Epoch:[0/2](487200/4588595) loss:2.967 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:11:34,678][model8_pretrain.py][INFO] Epoch:[0/2](487300/4588595) loss:2.956 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:11:34,678][model8_pretrain.py][INFO] Epoch:[0/2](487300/4588595) loss:2.953 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:11:34,679][model8_pretrain.py][INFO] Epoch:[0/2](487300/4588595) loss:3.091 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:11:34,679][model8_pretrain.py][INFO] Epoch:[0/2](487300/4588595) loss:2.658 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:11:34,679][model8_pretrain.py][INFO] Epoch:[0/2](487300/4588595) loss:3.333 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:11:34,679][model8_pretrain.py][INFO] Epoch:[0/2](487300/4588595) loss:3.019 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:11:34,679][model8_pretrain.py][INFO] Epoch:[0/2](487300/4588595) loss:2.922 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:11:34,679][model8_pretrain.py][INFO] Epoch:[0/2](487300/4588595) loss:3.023 lr:0.0000100 epoch_Time:26022.0min: [2024-01-04 21:12:11,610][model8_pretrain.py][INFO] Epoch:[0/2](487400/4588595) loss:2.493 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:12:11,610][model8_pretrain.py][INFO] Epoch:[0/2](487400/4588595) loss:2.708 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:12:11,610][model8_pretrain.py][INFO] Epoch:[0/2](487400/4588595) loss:2.499 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:12:11,610][model8_pretrain.py][INFO] Epoch:[0/2](487400/4588595) loss:3.389 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:12:11,610][model8_pretrain.py][INFO] Epoch:[0/2](487400/4588595) loss:2.684 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:12:11,610][model8_pretrain.py][INFO] Epoch:[0/2](487400/4588595) loss:2.691 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:12:11,610][model8_pretrain.py][INFO] Epoch:[0/2](487400/4588595) loss:2.724 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:12:11,610][model8_pretrain.py][INFO] Epoch:[0/2](487400/4588595) loss:3.029 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:12:48,550][model8_pretrain.py][INFO] Epoch:[0/2](487500/4588595) loss:3.147 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:12:48,550][model8_pretrain.py][INFO] Epoch:[0/2](487500/4588595) loss:2.195 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:12:48,550][model8_pretrain.py][INFO] Epoch:[0/2](487500/4588595) loss:2.874 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:12:48,550][model8_pretrain.py][INFO] Epoch:[0/2](487500/4588595) loss:2.579 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:12:48,550][model8_pretrain.py][INFO] Epoch:[0/2](487500/4588595) loss:2.709 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:12:48,550][model8_pretrain.py][INFO] Epoch:[0/2](487500/4588595) loss:2.804 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:12:48,550][model8_pretrain.py][INFO] Epoch:[0/2](487500/4588595) loss:3.247 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:12:48,550][model8_pretrain.py][INFO] Epoch:[0/2](487500/4588595) loss:2.782 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:13:35,973][model8_pretrain.py][INFO] Epoch:[0/2](487600/4588595) loss:2.685 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:13:35,973][model8_pretrain.py][INFO] Epoch:[0/2](487600/4588595) loss:2.236 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:13:35,973][model8_pretrain.py][INFO] Epoch:[0/2](487600/4588595) loss:2.881 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:13:35,974][model8_pretrain.py][INFO] Epoch:[0/2](487600/4588595) loss:2.968 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:13:35,974][model8_pretrain.py][INFO] Epoch:[0/2](487600/4588595) loss:3.283 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:13:35,974][model8_pretrain.py][INFO] Epoch:[0/2](487600/4588595) loss:3.064 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:13:35,974][model8_pretrain.py][INFO] Epoch:[0/2](487600/4588595) loss:2.893 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:13:35,974][model8_pretrain.py][INFO] Epoch:[0/2](487600/4588595) loss:2.588 lr:0.0000100 epoch_Time:26021.0min: [2024-01-04 21:14:12,900][model8_pretrain.py][INFO] Epoch:[0/2](487700/4588595) loss:2.663 lr:0.0000100 epoch_Time:26020.0min: [2024-01-04 21:14:12,900][model8_pretrain.py][INFO] Epoch:[0/2](487700/4588595) loss:2.071 lr:0.0000100 epoch_Time:26020.0min: [2024-01-04 21:14:12,900][model8_pretrain.py][INFO] Epoch:[0/2](487700/4588595) loss:3.204 lr:0.0000100 epoch_Time:26020.0min: [2024-01-04 21:14:12,900][model8_pretrain.py][INFO] Epoch:[0/2](487700/4588595) loss:3.103 lr:0.0000100 epoch_Time:26020.0min: [2024-01-04 21:14:12,900][model8_pretrain.py][INFO] Epoch:[0/2](487700/4588595) loss:2.982 lr:0.0000100 epoch_Time:26020.0min: [2024-01-04 21:14:12,900][model8_pretrain.py][INFO] Epoch:[0/2](487700/4588595) loss:2.838 lr:0.0000100 epoch_Time:26020.0min: [2024-01-04 21:14:12,900][model8_pretrain.py][INFO] Epoch:[0/2](487700/4588595) loss:2.519 lr:0.0000100 epoch_Time:26020.0min: [2024-01-04 21:14:12,900][model8_pretrain.py][INFO] Epoch:[0/2](487700/4588595) loss:2.202 lr:0.0000100 epoch_Time:26020.0min: [2024-01-04 21:14:49,851][model8_pretrain.py][INFO] Epoch:[0/2](487800/4588595) loss:3.162 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:14:49,851][model8_pretrain.py][INFO] Epoch:[0/2](487800/4588595) loss:2.734 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:14:49,851][model8_pretrain.py][INFO] Epoch:[0/2](487800/4588595) loss:3.302 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:14:49,851][model8_pretrain.py][INFO] Epoch:[0/2](487800/4588595) loss:2.827 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:14:49,852][model8_pretrain.py][INFO] Epoch:[0/2](487800/4588595) loss:2.865 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:14:49,852][model8_pretrain.py][INFO] Epoch:[0/2](487800/4588595) loss:2.405 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:14:49,852][model8_pretrain.py][INFO] Epoch:[0/2](487800/4588595) loss:2.623 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:14:49,852][model8_pretrain.py][INFO] Epoch:[0/2](487800/4588595) loss:3.092 lr:0.0000100 epoch_Time:26019.0min: [2024-01-04 21:15:26,793][model8_pretrain.py][INFO] Epoch:[0/2](487900/4588595) loss:3.108 lr:0.0000100 epoch_Time:26018.0min: [2024-01-04 21:15:26,793][model8_pretrain.py][INFO] Epoch:[0/2](487900/4588595) loss:2.352 lr:0.0000100 epoch_Time:26018.0min: [2024-01-04 21:15:26,793][model8_pretrain.py][INFO] Epoch:[0/2](487900/4588595) loss:3.502 lr:0.0000100 epoch_Time:26018.0min: [2024-01-04 21:15:26,793][model8_pretrain.py][INFO] Epoch:[0/2](487900/4588595) loss:2.707 lr:0.0000100 epoch_Time:26018.0min: [2024-01-04 21:15:26,793][model8_pretrain.py][INFO] Epoch:[0/2](487900/4588595) loss:2.563 lr:0.0000100 epoch_Time:26018.0min: [2024-01-04 21:15:26,794][model8_pretrain.py][INFO] Epoch:[0/2](487900/4588595) loss:2.919 lr:0.0000100 epoch_Time:26018.0min: [2024-01-04 21:15:26,794][model8_pretrain.py][INFO] Epoch:[0/2](487900/4588595) loss:2.768 lr:0.0000100 epoch_Time:26018.0min: [2024-01-04 21:15:26,794][model8_pretrain.py][INFO] Epoch:[0/2](487900/4588595) loss:3.188 lr:0.0000100 epoch_Time:26018.0min: [2024-01-04 21:16:03,731][model8_pretrain.py][INFO] Epoch:[0/2](488000/4588595) loss:2.128 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:03,731][model8_pretrain.py][INFO] Epoch:[0/2](488000/4588595) loss:2.138 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:03,732][model8_pretrain.py][INFO] Epoch:[0/2](488000/4588595) loss:2.743 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:03,732][model8_pretrain.py][INFO] Epoch:[0/2](488000/4588595) loss:2.780 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:03,732][model8_pretrain.py][INFO] Epoch:[0/2](488000/4588595) loss:2.532 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:03,732][model8_pretrain.py][INFO] Epoch:[0/2](488000/4588595) loss:2.871 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:03,732][model8_pretrain.py][INFO] Epoch:[0/2](488000/4588595) loss:2.963 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:03,732][model8_pretrain.py][INFO] Epoch:[0/2](488000/4588595) loss:3.062 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:40,692][model8_pretrain.py][INFO] Epoch:[0/2](488100/4588595) loss:2.667 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:40,692][model8_pretrain.py][INFO] Epoch:[0/2](488100/4588595) loss:3.115 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:40,692][model8_pretrain.py][INFO] Epoch:[0/2](488100/4588595) loss:2.385 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:40,692][model8_pretrain.py][INFO] Epoch:[0/2](488100/4588595) loss:2.776 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:40,692][model8_pretrain.py][INFO] Epoch:[0/2](488100/4588595) loss:2.783 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:40,692][model8_pretrain.py][INFO] Epoch:[0/2](488100/4588595) loss:3.299 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:40,692][model8_pretrain.py][INFO] Epoch:[0/2](488100/4588595) loss:3.032 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:16:40,692][model8_pretrain.py][INFO] Epoch:[0/2](488100/4588595) loss:2.996 lr:0.0000100 epoch_Time:26017.0min: [2024-01-04 21:17:17,623][model8_pretrain.py][INFO] Epoch:[0/2](488200/4588595) loss:2.622 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:17:17,623][model8_pretrain.py][INFO] Epoch:[0/2](488200/4588595) loss:2.182 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:17:17,623][model8_pretrain.py][INFO] Epoch:[0/2](488200/4588595) loss:2.624 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:17:17,623][model8_pretrain.py][INFO] Epoch:[0/2](488200/4588595) loss:3.140 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:17:17,623][model8_pretrain.py][INFO] Epoch:[0/2](488200/4588595) loss:2.586 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:17:17,623][model8_pretrain.py][INFO] Epoch:[0/2](488200/4588595) loss:2.207 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:17:17,623][model8_pretrain.py][INFO] Epoch:[0/2](488200/4588595) loss:3.221 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:17:17,623][model8_pretrain.py][INFO] Epoch:[0/2](488200/4588595) loss:2.926 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:17:54,556][model8_pretrain.py][INFO] Epoch:[0/2](488300/4588595) loss:3.322 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:17:54,556][model8_pretrain.py][INFO] Epoch:[0/2](488300/4588595) loss:2.840 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:17:54,556][model8_pretrain.py][INFO] Epoch:[0/2](488300/4588595) loss:3.177 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:17:54,556][model8_pretrain.py][INFO] Epoch:[0/2](488300/4588595) loss:2.923 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:17:54,556][model8_pretrain.py][INFO] Epoch:[0/2](488300/4588595) loss:3.110 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:17:54,556][model8_pretrain.py][INFO] Epoch:[0/2](488300/4588595) loss:2.595 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:17:54,556][model8_pretrain.py][INFO] Epoch:[0/2](488300/4588595) loss:2.930 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:17:54,556][model8_pretrain.py][INFO] Epoch:[0/2](488300/4588595) loss:2.595 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:18:41,935][model8_pretrain.py][INFO] Epoch:[0/2](488400/4588595) loss:2.637 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:18:41,935][model8_pretrain.py][INFO] Epoch:[0/2](488400/4588595) loss:3.117 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:18:41,935][model8_pretrain.py][INFO] Epoch:[0/2](488400/4588595) loss:2.297 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:18:41,935][model8_pretrain.py][INFO] Epoch:[0/2](488400/4588595) loss:2.976 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:18:41,935][model8_pretrain.py][INFO] Epoch:[0/2](488400/4588595) loss:2.824 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:18:41,935][model8_pretrain.py][INFO] Epoch:[0/2](488400/4588595) loss:2.811 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:18:41,935][model8_pretrain.py][INFO] Epoch:[0/2](488400/4588595) loss:2.812 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:18:41,935][model8_pretrain.py][INFO] Epoch:[0/2](488400/4588595) loss:3.194 lr:0.0000100 epoch_Time:26016.0min: [2024-01-04 21:19:18,859][model8_pretrain.py][INFO] Epoch:[0/2](488500/4588595) loss:3.075 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:19:18,859][model8_pretrain.py][INFO] Epoch:[0/2](488500/4588595) loss:2.807 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:19:18,859][model8_pretrain.py][INFO] Epoch:[0/2](488500/4588595) loss:3.057 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:19:18,859][model8_pretrain.py][INFO] Epoch:[0/2](488500/4588595) loss:3.028 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:19:18,859][model8_pretrain.py][INFO] Epoch:[0/2](488500/4588595) loss:2.503 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:19:18,859][model8_pretrain.py][INFO] Epoch:[0/2](488500/4588595) loss:2.725 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:19:18,859][model8_pretrain.py][INFO] Epoch:[0/2](488500/4588595) loss:2.473 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:19:18,859][model8_pretrain.py][INFO] Epoch:[0/2](488500/4588595) loss:3.280 lr:0.0000100 epoch_Time:26015.0min: [2024-01-04 21:19:55,802][model8_pretrain.py][INFO] Epoch:[0/2](488600/4588595) loss:2.950 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:19:55,802][model8_pretrain.py][INFO] Epoch:[0/2](488600/4588595) loss:3.134 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:19:55,802][model8_pretrain.py][INFO] Epoch:[0/2](488600/4588595) loss:2.458 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:19:55,802][model8_pretrain.py][INFO] Epoch:[0/2](488600/4588595) loss:3.002 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:19:55,802][model8_pretrain.py][INFO] Epoch:[0/2](488600/4588595) loss:2.924 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:19:55,802][model8_pretrain.py][INFO] Epoch:[0/2](488600/4588595) loss:2.677 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:19:55,802][model8_pretrain.py][INFO] Epoch:[0/2](488600/4588595) loss:3.226 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:19:55,802][model8_pretrain.py][INFO] Epoch:[0/2](488600/4588595) loss:2.602 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:20:32,752][model8_pretrain.py][INFO] Epoch:[0/2](488700/4588595) loss:2.985 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:20:32,752][model8_pretrain.py][INFO] Epoch:[0/2](488700/4588595) loss:3.266 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:20:32,752][model8_pretrain.py][INFO] Epoch:[0/2](488700/4588595) loss:2.759 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:20:32,752][model8_pretrain.py][INFO] Epoch:[0/2](488700/4588595) loss:2.787 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:20:32,752][model8_pretrain.py][INFO] Epoch:[0/2](488700/4588595) loss:3.370 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:20:32,752][model8_pretrain.py][INFO] Epoch:[0/2](488700/4588595) loss:2.633 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:20:32,752][model8_pretrain.py][INFO] Epoch:[0/2](488700/4588595) loss:2.707 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:20:32,752][model8_pretrain.py][INFO] Epoch:[0/2](488700/4588595) loss:3.642 lr:0.0000100 epoch_Time:26014.0min: [2024-01-04 21:21:09,693][model8_pretrain.py][INFO] Epoch:[0/2](488800/4588595) loss:2.905 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:09,693][model8_pretrain.py][INFO] Epoch:[0/2](488800/4588595) loss:3.081 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:09,693][model8_pretrain.py][INFO] Epoch:[0/2](488800/4588595) loss:3.268 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:09,693][model8_pretrain.py][INFO] Epoch:[0/2](488800/4588595) loss:2.780 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:09,693][model8_pretrain.py][INFO] Epoch:[0/2](488800/4588595) loss:2.887 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:09,693][model8_pretrain.py][INFO] Epoch:[0/2](488800/4588595) loss:2.905 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:09,693][model8_pretrain.py][INFO] Epoch:[0/2](488800/4588595) loss:3.292 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:09,693][model8_pretrain.py][INFO] Epoch:[0/2](488800/4588595) loss:2.719 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:46,629][model8_pretrain.py][INFO] Epoch:[0/2](488900/4588595) loss:3.285 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:46,629][model8_pretrain.py][INFO] Epoch:[0/2](488900/4588595) loss:2.780 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:46,629][model8_pretrain.py][INFO] Epoch:[0/2](488900/4588595) loss:3.172 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:46,629][model8_pretrain.py][INFO] Epoch:[0/2](488900/4588595) loss:2.719 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:46,629][model8_pretrain.py][INFO] Epoch:[0/2](488900/4588595) loss:2.567 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:46,629][model8_pretrain.py][INFO] Epoch:[0/2](488900/4588595) loss:3.511 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:46,629][model8_pretrain.py][INFO] Epoch:[0/2](488900/4588595) loss:3.295 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:21:46,629][model8_pretrain.py][INFO] Epoch:[0/2](488900/4588595) loss:2.566 lr:0.0000100 epoch_Time:26012.0min: [2024-01-04 21:22:23,568][model8_pretrain.py][INFO] Epoch:[0/2](489000/4588595) loss:2.771 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:22:23,568][model8_pretrain.py][INFO] Epoch:[0/2](489000/4588595) loss:2.762 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:22:23,568][model8_pretrain.py][INFO] Epoch:[0/2](489000/4588595) loss:3.356 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:22:23,568][model8_pretrain.py][INFO] Epoch:[0/2](489000/4588595) loss:2.817 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:22:23,568][model8_pretrain.py][INFO] Epoch:[0/2](489000/4588595) loss:2.233 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:22:23,568][model8_pretrain.py][INFO] Epoch:[0/2](489000/4588595) loss:2.700 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:22:23,568][model8_pretrain.py][INFO] Epoch:[0/2](489000/4588595) loss:2.978 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:22:23,568][model8_pretrain.py][INFO] Epoch:[0/2](489000/4588595) loss:2.811 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:23:00,497][model8_pretrain.py][INFO] Epoch:[0/2](489100/4588595) loss:3.196 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:23:00,498][model8_pretrain.py][INFO] Epoch:[0/2](489100/4588595) loss:3.243 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:23:00,498][model8_pretrain.py][INFO] Epoch:[0/2](489100/4588595) loss:2.877 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:23:00,498][model8_pretrain.py][INFO] Epoch:[0/2](489100/4588595) loss:3.078 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:23:00,498][model8_pretrain.py][INFO] Epoch:[0/2](489100/4588595) loss:3.255 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:23:00,498][model8_pretrain.py][INFO] Epoch:[0/2](489100/4588595) loss:2.970 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:23:00,498][model8_pretrain.py][INFO] Epoch:[0/2](489100/4588595) loss:3.265 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:23:00,498][model8_pretrain.py][INFO] Epoch:[0/2](489100/4588595) loss:3.104 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:23:46,189][model8_pretrain.py][INFO] Epoch:[0/2](489200/4588595) loss:2.919 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:23:46,189][model8_pretrain.py][INFO] Epoch:[0/2](489200/4588595) loss:3.220 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:23:46,189][model8_pretrain.py][INFO] Epoch:[0/2](489200/4588595) loss:2.738 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:23:46,189][model8_pretrain.py][INFO] Epoch:[0/2](489200/4588595) loss:3.082 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:23:46,189][model8_pretrain.py][INFO] Epoch:[0/2](489200/4588595) loss:2.999 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:23:46,189][model8_pretrain.py][INFO] Epoch:[0/2](489200/4588595) loss:2.965 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:23:46,189][model8_pretrain.py][INFO] Epoch:[0/2](489200/4588595) loss:2.959 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:23:46,189][model8_pretrain.py][INFO] Epoch:[0/2](489200/4588595) loss:2.647 lr:0.0000100 epoch_Time:26011.0min: [2024-01-04 21:24:24,812][model8_pretrain.py][INFO] Epoch:[0/2](489300/4588595) loss:1.834 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:24:24,812][model8_pretrain.py][INFO] Epoch:[0/2](489300/4588595) loss:2.114 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:24:24,812][model8_pretrain.py][INFO] Epoch:[0/2](489300/4588595) loss:3.025 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:24:24,812][model8_pretrain.py][INFO] Epoch:[0/2](489300/4588595) loss:3.023 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:24:24,812][model8_pretrain.py][INFO] Epoch:[0/2](489300/4588595) loss:3.071 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:24:24,812][model8_pretrain.py][INFO] Epoch:[0/2](489300/4588595) loss:2.964 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:24:24,813][model8_pretrain.py][INFO] Epoch:[0/2](489300/4588595) loss:2.958 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:24:24,812][model8_pretrain.py][INFO] Epoch:[0/2](489300/4588595) loss:2.358 lr:0.0000100 epoch_Time:26010.0min: [2024-01-04 21:25:01,768][model8_pretrain.py][INFO] Epoch:[0/2](489400/4588595) loss:2.778 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:01,768][model8_pretrain.py][INFO] Epoch:[0/2](489400/4588595) loss:2.279 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:01,768][model8_pretrain.py][INFO] Epoch:[0/2](489400/4588595) loss:2.925 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:01,768][model8_pretrain.py][INFO] Epoch:[0/2](489400/4588595) loss:2.750 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:01,768][model8_pretrain.py][INFO] Epoch:[0/2](489400/4588595) loss:2.946 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:01,768][model8_pretrain.py][INFO] Epoch:[0/2](489400/4588595) loss:3.035 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:01,768][model8_pretrain.py][INFO] Epoch:[0/2](489400/4588595) loss:2.693 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:01,768][model8_pretrain.py][INFO] Epoch:[0/2](489400/4588595) loss:3.135 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:38,722][model8_pretrain.py][INFO] Epoch:[0/2](489500/4588595) loss:2.397 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:38,722][model8_pretrain.py][INFO] Epoch:[0/2](489500/4588595) loss:2.880 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:38,722][model8_pretrain.py][INFO] Epoch:[0/2](489500/4588595) loss:3.223 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:38,722][model8_pretrain.py][INFO] Epoch:[0/2](489500/4588595) loss:2.847 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:38,723][model8_pretrain.py][INFO] Epoch:[0/2](489500/4588595) loss:2.912 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:38,723][model8_pretrain.py][INFO] Epoch:[0/2](489500/4588595) loss:2.860 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:38,723][model8_pretrain.py][INFO] Epoch:[0/2](489500/4588595) loss:3.261 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:25:38,723][model8_pretrain.py][INFO] Epoch:[0/2](489500/4588595) loss:2.463 lr:0.0000100 epoch_Time:26009.0min: [2024-01-04 21:26:15,673][model8_pretrain.py][INFO] Epoch:[0/2](489600/4588595) loss:2.786 lr:0.0000100 epoch_Time:26008.0min: [2024-01-04 21:26:15,673][model8_pretrain.py][INFO] Epoch:[0/2](489600/4588595) loss:2.630 lr:0.0000100 epoch_Time:26008.0min: [2024-01-04 21:26:15,673][model8_pretrain.py][INFO] Epoch:[0/2](489600/4588595) loss:2.434 lr:0.0000100 epoch_Time:26008.0min: [2024-01-04 21:26:15,673][model8_pretrain.py][INFO] Epoch:[0/2](489600/4588595) loss:3.430 lr:0.0000100 epoch_Time:26008.0min: [2024-01-04 21:26:15,673][model8_pretrain.py][INFO] Epoch:[0/2](489600/4588595) loss:2.952 lr:0.0000100 epoch_Time:26008.0min: [2024-01-04 21:26:15,673][model8_pretrain.py][INFO] Epoch:[0/2](489600/4588595) loss:3.527 lr:0.0000100 epoch_Time:26008.0min: [2024-01-04 21:26:15,673][model8_pretrain.py][INFO] Epoch:[0/2](489600/4588595) loss:3.018 lr:0.0000100 epoch_Time:26008.0min: [2024-01-04 21:26:15,673][model8_pretrain.py][INFO] Epoch:[0/2](489600/4588595) loss:2.885 lr:0.0000100 epoch_Time:26008.0min: [2024-01-04 21:26:52,634][model8_pretrain.py][INFO] Epoch:[0/2](489700/4588595) loss:2.692 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:26:52,634][model8_pretrain.py][INFO] Epoch:[0/2](489700/4588595) loss:3.053 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:26:52,635][model8_pretrain.py][INFO] Epoch:[0/2](489700/4588595) loss:2.624 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:26:52,635][model8_pretrain.py][INFO] Epoch:[0/2](489700/4588595) loss:2.209 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:26:52,635][model8_pretrain.py][INFO] Epoch:[0/2](489700/4588595) loss:3.056 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:26:52,635][model8_pretrain.py][INFO] Epoch:[0/2](489700/4588595) loss:2.929 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:26:52,635][model8_pretrain.py][INFO] Epoch:[0/2](489700/4588595) loss:3.314 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:26:52,635][model8_pretrain.py][INFO] Epoch:[0/2](489700/4588595) loss:2.664 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:27:29,601][model8_pretrain.py][INFO] Epoch:[0/2](489800/4588595) loss:2.656 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:27:29,601][model8_pretrain.py][INFO] Epoch:[0/2](489800/4588595) loss:2.793 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:27:29,601][model8_pretrain.py][INFO] Epoch:[0/2](489800/4588595) loss:3.429 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:27:29,601][model8_pretrain.py][INFO] Epoch:[0/2](489800/4588595) loss:2.288 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:27:29,601][model8_pretrain.py][INFO] Epoch:[0/2](489800/4588595) loss:2.780 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:27:29,601][model8_pretrain.py][INFO] Epoch:[0/2](489800/4588595) loss:3.057 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:27:29,601][model8_pretrain.py][INFO] Epoch:[0/2](489800/4588595) loss:2.954 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:27:29,601][model8_pretrain.py][INFO] Epoch:[0/2](489800/4588595) loss:2.801 lr:0.0000100 epoch_Time:26006.0min: [2024-01-04 21:28:06,547][model8_pretrain.py][INFO] Epoch:[0/2](489900/4588595) loss:3.017 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:06,547][model8_pretrain.py][INFO] Epoch:[0/2](489900/4588595) loss:2.659 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:06,547][model8_pretrain.py][INFO] Epoch:[0/2](489900/4588595) loss:2.766 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:06,547][model8_pretrain.py][INFO] Epoch:[0/2](489900/4588595) loss:2.746 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:06,547][model8_pretrain.py][INFO] Epoch:[0/2](489900/4588595) loss:2.839 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:06,547][model8_pretrain.py][INFO] Epoch:[0/2](489900/4588595) loss:2.633 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:06,547][model8_pretrain.py][INFO] Epoch:[0/2](489900/4588595) loss:2.853 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:06,548][model8_pretrain.py][INFO] Epoch:[0/2](489900/4588595) loss:2.534 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:52,258][model8_pretrain.py][INFO] Epoch:[0/2](490000/4588595) loss:2.813 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:52,258][model8_pretrain.py][INFO] Epoch:[0/2](490000/4588595) loss:2.921 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:52,258][model8_pretrain.py][INFO] Epoch:[0/2](490000/4588595) loss:3.001 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:52,258][model8_pretrain.py][INFO] Epoch:[0/2](490000/4588595) loss:2.868 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:52,258][model8_pretrain.py][INFO] Epoch:[0/2](490000/4588595) loss:2.481 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:52,258][model8_pretrain.py][INFO] Epoch:[0/2](490000/4588595) loss:3.143 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:52,258][model8_pretrain.py][INFO] Epoch:[0/2](490000/4588595) loss:2.961 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:28:52,258][model8_pretrain.py][INFO] Epoch:[0/2](490000/4588595) loss:2.325 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:29:30,878][model8_pretrain.py][INFO] Epoch:[0/2](490100/4588595) loss:2.648 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:29:30,877][model8_pretrain.py][INFO] Epoch:[0/2](490100/4588595) loss:2.932 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:29:30,878][model8_pretrain.py][INFO] Epoch:[0/2](490100/4588595) loss:2.850 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:29:30,878][model8_pretrain.py][INFO] Epoch:[0/2](490100/4588595) loss:2.970 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:29:30,878][model8_pretrain.py][INFO] Epoch:[0/2](490100/4588595) loss:3.104 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:29:30,878][model8_pretrain.py][INFO] Epoch:[0/2](490100/4588595) loss:3.011 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:29:30,878][model8_pretrain.py][INFO] Epoch:[0/2](490100/4588595) loss:3.354 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:29:30,878][model8_pretrain.py][INFO] Epoch:[0/2](490100/4588595) loss:3.034 lr:0.0000100 epoch_Time:26005.0min: [2024-01-04 21:30:07,823][model8_pretrain.py][INFO] Epoch:[0/2](490200/4588595) loss:2.899 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:07,822][model8_pretrain.py][INFO] Epoch:[0/2](490200/4588595) loss:3.448 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:07,823][model8_pretrain.py][INFO] Epoch:[0/2](490200/4588595) loss:2.697 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:07,823][model8_pretrain.py][INFO] Epoch:[0/2](490200/4588595) loss:2.796 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:07,823][model8_pretrain.py][INFO] Epoch:[0/2](490200/4588595) loss:2.662 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:07,823][model8_pretrain.py][INFO] Epoch:[0/2](490200/4588595) loss:2.707 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:07,823][model8_pretrain.py][INFO] Epoch:[0/2](490200/4588595) loss:2.599 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:07,823][model8_pretrain.py][INFO] Epoch:[0/2](490200/4588595) loss:3.038 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:44,794][model8_pretrain.py][INFO] Epoch:[0/2](490300/4588595) loss:3.195 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:44,794][model8_pretrain.py][INFO] Epoch:[0/2](490300/4588595) loss:3.270 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:44,794][model8_pretrain.py][INFO] Epoch:[0/2](490300/4588595) loss:3.143 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:44,794][model8_pretrain.py][INFO] Epoch:[0/2](490300/4588595) loss:2.508 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:44,794][model8_pretrain.py][INFO] Epoch:[0/2](490300/4588595) loss:3.031 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:44,794][model8_pretrain.py][INFO] Epoch:[0/2](490300/4588595) loss:2.672 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:44,794][model8_pretrain.py][INFO] Epoch:[0/2](490300/4588595) loss:2.523 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:30:44,794][model8_pretrain.py][INFO] Epoch:[0/2](490300/4588595) loss:2.752 lr:0.0000100 epoch_Time:26004.0min: [2024-01-04 21:31:21,756][model8_pretrain.py][INFO] Epoch:[0/2](490400/4588595) loss:3.178 lr:0.0000100 epoch_Time:26003.0min: [2024-01-04 21:31:21,756][model8_pretrain.py][INFO] Epoch:[0/2](490400/4588595) loss:3.392 lr:0.0000100 epoch_Time:26003.0min: [2024-01-04 21:31:21,756][model8_pretrain.py][INFO] Epoch:[0/2](490400/4588595) loss:2.790 lr:0.0000100 epoch_Time:26003.0min: [2024-01-04 21:31:21,757][model8_pretrain.py][INFO] Epoch:[0/2](490400/4588595) loss:2.299 lr:0.0000100 epoch_Time:26003.0min: [2024-01-04 21:31:21,757][model8_pretrain.py][INFO] Epoch:[0/2](490400/4588595) loss:2.480 lr:0.0000100 epoch_Time:26003.0min: [2024-01-04 21:31:21,757][model8_pretrain.py][INFO] Epoch:[0/2](490400/4588595) loss:3.161 lr:0.0000100 epoch_Time:26003.0min: [2024-01-04 21:31:21,757][model8_pretrain.py][INFO] Epoch:[0/2](490400/4588595) loss:2.892 lr:0.0000100 epoch_Time:26003.0min: [2024-01-04 21:31:21,757][model8_pretrain.py][INFO] Epoch:[0/2](490400/4588595) loss:2.882 lr:0.0000100 epoch_Time:26003.0min: [2024-01-04 21:31:58,695][model8_pretrain.py][INFO] Epoch:[0/2](490500/4588595) loss:2.979 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:31:58,695][model8_pretrain.py][INFO] Epoch:[0/2](490500/4588595) loss:3.190 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:31:58,695][model8_pretrain.py][INFO] Epoch:[0/2](490500/4588595) loss:2.942 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:31:58,695][model8_pretrain.py][INFO] Epoch:[0/2](490500/4588595) loss:2.624 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:31:58,695][model8_pretrain.py][INFO] Epoch:[0/2](490500/4588595) loss:2.315 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:31:58,695][model8_pretrain.py][INFO] Epoch:[0/2](490500/4588595) loss:3.005 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:31:58,695][model8_pretrain.py][INFO] Epoch:[0/2](490500/4588595) loss:3.048 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:31:58,695][model8_pretrain.py][INFO] Epoch:[0/2](490500/4588595) loss:3.227 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:32:35,624][model8_pretrain.py][INFO] Epoch:[0/2](490600/4588595) loss:3.392 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:32:35,624][model8_pretrain.py][INFO] Epoch:[0/2](490600/4588595) loss:2.497 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:32:35,624][model8_pretrain.py][INFO] Epoch:[0/2](490600/4588595) loss:3.589 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:32:35,624][model8_pretrain.py][INFO] Epoch:[0/2](490600/4588595) loss:3.093 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:32:35,624][model8_pretrain.py][INFO] Epoch:[0/2](490600/4588595) loss:2.196 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:32:35,624][model8_pretrain.py][INFO] Epoch:[0/2](490600/4588595) loss:2.796 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:32:35,624][model8_pretrain.py][INFO] Epoch:[0/2](490600/4588595) loss:3.342 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:32:35,624][model8_pretrain.py][INFO] Epoch:[0/2](490600/4588595) loss:2.545 lr:0.0000100 epoch_Time:26002.0min: [2024-01-04 21:33:12,566][model8_pretrain.py][INFO] Epoch:[0/2](490700/4588595) loss:2.578 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:12,566][model8_pretrain.py][INFO] Epoch:[0/2](490700/4588595) loss:2.023 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:12,566][model8_pretrain.py][INFO] Epoch:[0/2](490700/4588595) loss:2.578 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:12,566][model8_pretrain.py][INFO] Epoch:[0/2](490700/4588595) loss:3.142 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:12,566][model8_pretrain.py][INFO] Epoch:[0/2](490700/4588595) loss:2.417 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:12,566][model8_pretrain.py][INFO] Epoch:[0/2](490700/4588595) loss:2.232 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:12,566][model8_pretrain.py][INFO] Epoch:[0/2](490700/4588595) loss:3.280 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:12,566][model8_pretrain.py][INFO] Epoch:[0/2](490700/4588595) loss:2.504 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:54,837][model8_pretrain.py][INFO] Epoch:[0/2](490800/4588595) loss:2.672 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:54,837][model8_pretrain.py][INFO] Epoch:[0/2](490800/4588595) loss:3.356 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:54,837][model8_pretrain.py][INFO] Epoch:[0/2](490800/4588595) loss:3.151 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:54,837][model8_pretrain.py][INFO] Epoch:[0/2](490800/4588595) loss:3.229 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:54,837][model8_pretrain.py][INFO] Epoch:[0/2](490800/4588595) loss:3.262 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:54,837][model8_pretrain.py][INFO] Epoch:[0/2](490800/4588595) loss:2.980 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:54,838][model8_pretrain.py][INFO] Epoch:[0/2](490800/4588595) loss:2.867 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:33:54,838][model8_pretrain.py][INFO] Epoch:[0/2](490800/4588595) loss:3.041 lr:0.0000100 epoch_Time:26000.0min: [2024-01-04 21:34:36,705][model8_pretrain.py][INFO] Epoch:[0/2](490900/4588595) loss:2.735 lr:0.0000100 epoch_Time:26001.0min: [2024-01-04 21:34:36,705][model8_pretrain.py][INFO] Epoch:[0/2](490900/4588595) loss:3.142 lr:0.0000100 epoch_Time:26001.0min: [2024-01-04 21:34:36,705][model8_pretrain.py][INFO] Epoch:[0/2](490900/4588595) loss:3.083 lr:0.0000100 epoch_Time:26001.0min: [2024-01-04 21:34:36,705][model8_pretrain.py][INFO] Epoch:[0/2](490900/4588595) loss:2.604 lr:0.0000100 epoch_Time:26001.0min: [2024-01-04 21:34:36,705][model8_pretrain.py][INFO] Epoch:[0/2](490900/4588595) loss:3.108 lr:0.0000100 epoch_Time:26001.0min: [2024-01-04 21:34:36,705][model8_pretrain.py][INFO] Epoch:[0/2](490900/4588595) loss:2.829 lr:0.0000100 epoch_Time:26001.0min: [2024-01-04 21:34:36,705][model8_pretrain.py][INFO] Epoch:[0/2](490900/4588595) loss:2.161 lr:0.0000100 epoch_Time:26001.0min: [2024-01-04 21:34:36,705][model8_pretrain.py][INFO] Epoch:[0/2](490900/4588595) loss:3.149 lr:0.0000100 epoch_Time:26001.0min: [2024-01-04 21:35:13,643][model8_pretrain.py][INFO] Epoch:[0/2](491000/4588595) loss:3.152 lr:0.0000100 epoch_Time:25999.0min: [2024-01-04 21:35:13,643][model8_pretrain.py][INFO] Epoch:[0/2](491000/4588595) loss:2.730 lr:0.0000100 epoch_Time:25999.0min: [2024-01-04 21:35:13,643][model8_pretrain.py][INFO] Epoch:[0/2](491000/4588595) loss:3.009 lr:0.0000100 epoch_Time:25999.0min: [2024-01-04 21:35:13,643][model8_pretrain.py][INFO] Epoch:[0/2](491000/4588595) loss:3.011 lr:0.0000100 epoch_Time:25999.0min: [2024-01-04 21:35:13,643][model8_pretrain.py][INFO] Epoch:[0/2](491000/4588595) loss:2.545 lr:0.0000100 epoch_Time:25999.0min: [2024-01-04 21:35:13,643][model8_pretrain.py][INFO] Epoch:[0/2](491000/4588595) loss:2.774 lr:0.0000100 epoch_Time:25999.0min: [2024-01-04 21:35:13,644][model8_pretrain.py][INFO] Epoch:[0/2](491000/4588595) loss:2.710 lr:0.0000100 epoch_Time:25999.0min: [2024-01-04 21:35:13,644][model8_pretrain.py][INFO] Epoch:[0/2](491000/4588595) loss:3.042 lr:0.0000100 epoch_Time:25999.0min: [2024-01-04 21:35:50,596][model8_pretrain.py][INFO] Epoch:[0/2](491100/4588595) loss:3.101 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:35:50,596][model8_pretrain.py][INFO] Epoch:[0/2](491100/4588595) loss:2.352 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:35:50,596][model8_pretrain.py][INFO] Epoch:[0/2](491100/4588595) loss:3.109 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:35:50,596][model8_pretrain.py][INFO] Epoch:[0/2](491100/4588595) loss:2.618 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:35:50,596][model8_pretrain.py][INFO] Epoch:[0/2](491100/4588595) loss:3.140 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:35:50,596][model8_pretrain.py][INFO] Epoch:[0/2](491100/4588595) loss:3.222 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:35:50,596][model8_pretrain.py][INFO] Epoch:[0/2](491100/4588595) loss:2.439 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:35:50,597][model8_pretrain.py][INFO] Epoch:[0/2](491100/4588595) loss:3.446 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:36:27,535][model8_pretrain.py][INFO] Epoch:[0/2](491200/4588595) loss:3.310 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:36:27,535][model8_pretrain.py][INFO] Epoch:[0/2](491200/4588595) loss:2.745 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:36:27,535][model8_pretrain.py][INFO] Epoch:[0/2](491200/4588595) loss:2.553 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:36:27,535][model8_pretrain.py][INFO] Epoch:[0/2](491200/4588595) loss:2.992 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:36:27,535][model8_pretrain.py][INFO] Epoch:[0/2](491200/4588595) loss:2.637 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:36:27,535][model8_pretrain.py][INFO] Epoch:[0/2](491200/4588595) loss:3.105 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:36:27,535][model8_pretrain.py][INFO] Epoch:[0/2](491200/4588595) loss:3.314 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:36:27,535][model8_pretrain.py][INFO] Epoch:[0/2](491200/4588595) loss:3.330 lr:0.0000100 epoch_Time:25998.0min: [2024-01-04 21:37:04,479][model8_pretrain.py][INFO] Epoch:[0/2](491300/4588595) loss:2.680 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:04,479][model8_pretrain.py][INFO] Epoch:[0/2](491300/4588595) loss:2.593 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:04,479][model8_pretrain.py][INFO] Epoch:[0/2](491300/4588595) loss:2.983 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:04,479][model8_pretrain.py][INFO] Epoch:[0/2](491300/4588595) loss:3.031 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:04,479][model8_pretrain.py][INFO] Epoch:[0/2](491300/4588595) loss:3.086 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:04,479][model8_pretrain.py][INFO] Epoch:[0/2](491300/4588595) loss:3.206 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:04,479][model8_pretrain.py][INFO] Epoch:[0/2](491300/4588595) loss:3.032 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:04,479][model8_pretrain.py][INFO] Epoch:[0/2](491300/4588595) loss:2.955 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:41,419][model8_pretrain.py][INFO] Epoch:[0/2](491400/4588595) loss:2.650 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:41,419][model8_pretrain.py][INFO] Epoch:[0/2](491400/4588595) loss:2.690 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:41,419][model8_pretrain.py][INFO] Epoch:[0/2](491400/4588595) loss:2.696 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:41,419][model8_pretrain.py][INFO] Epoch:[0/2](491400/4588595) loss:2.659 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:41,419][model8_pretrain.py][INFO] Epoch:[0/2](491400/4588595) loss:2.840 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:41,419][model8_pretrain.py][INFO] Epoch:[0/2](491400/4588595) loss:3.176 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:41,420][model8_pretrain.py][INFO] Epoch:[0/2](491400/4588595) loss:2.862 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:37:41,420][model8_pretrain.py][INFO] Epoch:[0/2](491400/4588595) loss:2.670 lr:0.0000100 epoch_Time:25997.0min: [2024-01-04 21:38:18,355][model8_pretrain.py][INFO] Epoch:[0/2](491500/4588595) loss:3.064 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:38:18,355][model8_pretrain.py][INFO] Epoch:[0/2](491500/4588595) loss:2.911 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:38:18,355][model8_pretrain.py][INFO] Epoch:[0/2](491500/4588595) loss:2.885 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:38:18,355][model8_pretrain.py][INFO] Epoch:[0/2](491500/4588595) loss:2.698 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:38:18,355][model8_pretrain.py][INFO] Epoch:[0/2](491500/4588595) loss:2.975 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:38:18,355][model8_pretrain.py][INFO] Epoch:[0/2](491500/4588595) loss:2.272 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:38:18,356][model8_pretrain.py][INFO] Epoch:[0/2](491500/4588595) loss:2.671 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:38:18,356][model8_pretrain.py][INFO] Epoch:[0/2](491500/4588595) loss:2.796 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:39:00,581][model8_pretrain.py][INFO] Epoch:[0/2](491600/4588595) loss:2.647 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:39:00,581][model8_pretrain.py][INFO] Epoch:[0/2](491600/4588595) loss:3.206 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:39:00,581][model8_pretrain.py][INFO] Epoch:[0/2](491600/4588595) loss:3.078 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:39:00,581][model8_pretrain.py][INFO] Epoch:[0/2](491600/4588595) loss:2.617 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:39:00,581][model8_pretrain.py][INFO] Epoch:[0/2](491600/4588595) loss:2.556 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:39:00,581][model8_pretrain.py][INFO] Epoch:[0/2](491600/4588595) loss:3.145 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:39:00,581][model8_pretrain.py][INFO] Epoch:[0/2](491600/4588595) loss:2.509 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:39:00,581][model8_pretrain.py][INFO] Epoch:[0/2](491600/4588595) loss:2.488 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:39:42,233][model8_pretrain.py][INFO] Epoch:[0/2](491700/4588595) loss:2.772 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:39:42,233][model8_pretrain.py][INFO] Epoch:[0/2](491700/4588595) loss:3.012 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:39:42,233][model8_pretrain.py][INFO] Epoch:[0/2](491700/4588595) loss:2.983 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:39:42,233][model8_pretrain.py][INFO] Epoch:[0/2](491700/4588595) loss:2.883 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:39:42,233][model8_pretrain.py][INFO] Epoch:[0/2](491700/4588595) loss:2.853 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:39:42,233][model8_pretrain.py][INFO] Epoch:[0/2](491700/4588595) loss:3.059 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:39:42,233][model8_pretrain.py][INFO] Epoch:[0/2](491700/4588595) loss:2.465 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:39:42,233][model8_pretrain.py][INFO] Epoch:[0/2](491700/4588595) loss:2.496 lr:0.0000100 epoch_Time:25996.0min: [2024-01-04 21:40:19,196][model8_pretrain.py][INFO] Epoch:[0/2](491800/4588595) loss:2.827 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:40:19,196][model8_pretrain.py][INFO] Epoch:[0/2](491800/4588595) loss:2.627 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:40:19,196][model8_pretrain.py][INFO] Epoch:[0/2](491800/4588595) loss:3.307 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:40:19,196][model8_pretrain.py][INFO] Epoch:[0/2](491800/4588595) loss:2.146 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:40:19,196][model8_pretrain.py][INFO] Epoch:[0/2](491800/4588595) loss:3.351 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:40:19,196][model8_pretrain.py][INFO] Epoch:[0/2](491800/4588595) loss:2.998 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:40:19,196][model8_pretrain.py][INFO] Epoch:[0/2](491800/4588595) loss:2.825 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:40:19,196][model8_pretrain.py][INFO] Epoch:[0/2](491800/4588595) loss:2.544 lr:0.0000100 epoch_Time:25995.0min: [2024-01-04 21:40:56,134][model8_pretrain.py][INFO] Epoch:[0/2](491900/4588595) loss:2.948 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:40:56,134][model8_pretrain.py][INFO] Epoch:[0/2](491900/4588595) loss:2.662 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:40:56,134][model8_pretrain.py][INFO] Epoch:[0/2](491900/4588595) loss:3.223 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:40:56,134][model8_pretrain.py][INFO] Epoch:[0/2](491900/4588595) loss:2.938 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:40:56,134][model8_pretrain.py][INFO] Epoch:[0/2](491900/4588595) loss:3.169 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:40:56,134][model8_pretrain.py][INFO] Epoch:[0/2](491900/4588595) loss:3.126 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:40:56,134][model8_pretrain.py][INFO] Epoch:[0/2](491900/4588595) loss:1.738 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:40:56,134][model8_pretrain.py][INFO] Epoch:[0/2](491900/4588595) loss:2.970 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:41:33,074][model8_pretrain.py][INFO] Epoch:[0/2](492000/4588595) loss:2.833 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:41:33,074][model8_pretrain.py][INFO] Epoch:[0/2](492000/4588595) loss:3.120 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:41:33,074][model8_pretrain.py][INFO] Epoch:[0/2](492000/4588595) loss:2.722 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:41:33,074][model8_pretrain.py][INFO] Epoch:[0/2](492000/4588595) loss:2.769 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:41:33,074][model8_pretrain.py][INFO] Epoch:[0/2](492000/4588595) loss:2.600 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:41:33,074][model8_pretrain.py][INFO] Epoch:[0/2](492000/4588595) loss:3.164 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:41:33,074][model8_pretrain.py][INFO] Epoch:[0/2](492000/4588595) loss:2.963 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:41:33,074][model8_pretrain.py][INFO] Epoch:[0/2](492000/4588595) loss:3.101 lr:0.0000100 epoch_Time:25993.0min: [2024-01-04 21:42:10,010][model8_pretrain.py][INFO] Epoch:[0/2](492100/4588595) loss:2.843 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:10,010][model8_pretrain.py][INFO] Epoch:[0/2](492100/4588595) loss:3.177 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:10,010][model8_pretrain.py][INFO] Epoch:[0/2](492100/4588595) loss:2.222 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:10,010][model8_pretrain.py][INFO] Epoch:[0/2](492100/4588595) loss:3.142 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:10,010][model8_pretrain.py][INFO] Epoch:[0/2](492100/4588595) loss:2.277 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:10,010][model8_pretrain.py][INFO] Epoch:[0/2](492100/4588595) loss:2.910 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:10,010][model8_pretrain.py][INFO] Epoch:[0/2](492100/4588595) loss:2.535 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:10,010][model8_pretrain.py][INFO] Epoch:[0/2](492100/4588595) loss:2.264 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:46,950][model8_pretrain.py][INFO] Epoch:[0/2](492200/4588595) loss:2.590 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:46,950][model8_pretrain.py][INFO] Epoch:[0/2](492200/4588595) loss:2.422 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:46,950][model8_pretrain.py][INFO] Epoch:[0/2](492200/4588595) loss:2.925 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:46,950][model8_pretrain.py][INFO] Epoch:[0/2](492200/4588595) loss:3.374 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:46,950][model8_pretrain.py][INFO] Epoch:[0/2](492200/4588595) loss:2.707 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:46,950][model8_pretrain.py][INFO] Epoch:[0/2](492200/4588595) loss:3.178 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:46,950][model8_pretrain.py][INFO] Epoch:[0/2](492200/4588595) loss:2.484 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:42:46,950][model8_pretrain.py][INFO] Epoch:[0/2](492200/4588595) loss:3.212 lr:0.0000100 epoch_Time:25992.0min: [2024-01-04 21:43:23,888][model8_pretrain.py][INFO] Epoch:[0/2](492300/4588595) loss:2.986 lr:0.0000100 epoch_Time:25991.0min: [2024-01-04 21:43:23,888][model8_pretrain.py][INFO] Epoch:[0/2](492300/4588595) loss:2.745 lr:0.0000100 epoch_Time:25991.0min: [2024-01-04 21:43:23,888][model8_pretrain.py][INFO] Epoch:[0/2](492300/4588595) loss:2.745 lr:0.0000100 epoch_Time:25991.0min: [2024-01-04 21:43:23,888][model8_pretrain.py][INFO] Epoch:[0/2](492300/4588595) loss:2.345 lr:0.0000100 epoch_Time:25991.0min: [2024-01-04 21:43:23,888][model8_pretrain.py][INFO] Epoch:[0/2](492300/4588595) loss:2.755 lr:0.0000100 epoch_Time:25991.0min: [2024-01-04 21:43:23,888][model8_pretrain.py][INFO] Epoch:[0/2](492300/4588595) loss:2.250 lr:0.0000100 epoch_Time:25991.0min: [2024-01-04 21:43:23,888][model8_pretrain.py][INFO] Epoch:[0/2](492300/4588595) loss:2.891 lr:0.0000100 epoch_Time:25991.0min: [2024-01-04 21:43:23,889][model8_pretrain.py][INFO] Epoch:[0/2](492300/4588595) loss:2.553 lr:0.0000100 epoch_Time:25991.0min: [2024-01-04 21:44:06,035][model8_pretrain.py][INFO] Epoch:[0/2](492400/4588595) loss:2.287 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:06,035][model8_pretrain.py][INFO] Epoch:[0/2](492400/4588595) loss:2.786 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:06,035][model8_pretrain.py][INFO] Epoch:[0/2](492400/4588595) loss:2.700 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:06,038][model8_pretrain.py][INFO] Epoch:[0/2](492400/4588595) loss:2.905 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:06,038][model8_pretrain.py][INFO] Epoch:[0/2](492400/4588595) loss:2.833 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:06,038][model8_pretrain.py][INFO] Epoch:[0/2](492400/4588595) loss:2.161 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:06,039][model8_pretrain.py][INFO] Epoch:[0/2](492400/4588595) loss:3.137 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:06,040][model8_pretrain.py][INFO] Epoch:[0/2](492400/4588595) loss:2.642 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:47,710][model8_pretrain.py][INFO] Epoch:[0/2](492500/4588595) loss:2.771 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:47,710][model8_pretrain.py][INFO] Epoch:[0/2](492500/4588595) loss:3.163 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:47,710][model8_pretrain.py][INFO] Epoch:[0/2](492500/4588595) loss:2.656 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:47,710][model8_pretrain.py][INFO] Epoch:[0/2](492500/4588595) loss:3.211 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:47,710][model8_pretrain.py][INFO] Epoch:[0/2](492500/4588595) loss:3.021 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:47,710][model8_pretrain.py][INFO] Epoch:[0/2](492500/4588595) loss:2.417 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:47,710][model8_pretrain.py][INFO] Epoch:[0/2](492500/4588595) loss:2.378 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:44:47,710][model8_pretrain.py][INFO] Epoch:[0/2](492500/4588595) loss:2.901 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:45:24,654][model8_pretrain.py][INFO] Epoch:[0/2](492600/4588595) loss:2.789 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:45:24,654][model8_pretrain.py][INFO] Epoch:[0/2](492600/4588595) loss:2.595 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:45:24,654][model8_pretrain.py][INFO] Epoch:[0/2](492600/4588595) loss:3.098 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:45:24,654][model8_pretrain.py][INFO] Epoch:[0/2](492600/4588595) loss:2.359 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:45:24,654][model8_pretrain.py][INFO] Epoch:[0/2](492600/4588595) loss:2.794 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:45:24,654][model8_pretrain.py][INFO] Epoch:[0/2](492600/4588595) loss:2.889 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:45:24,654][model8_pretrain.py][INFO] Epoch:[0/2](492600/4588595) loss:2.836 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:45:24,654][model8_pretrain.py][INFO] Epoch:[0/2](492600/4588595) loss:2.719 lr:0.0000100 epoch_Time:25990.0min: [2024-01-04 21:46:01,597][model8_pretrain.py][INFO] Epoch:[0/2](492700/4588595) loss:2.708 lr:0.0000100 epoch_Time:25989.0min: [2024-01-04 21:46:01,597][model8_pretrain.py][INFO] Epoch:[0/2](492700/4588595) loss:2.646 lr:0.0000100 epoch_Time:25989.0min: [2024-01-04 21:46:01,598][model8_pretrain.py][INFO] Epoch:[0/2](492700/4588595) loss:2.897 lr:0.0000100 epoch_Time:25989.0min: [2024-01-04 21:46:01,598][model8_pretrain.py][INFO] Epoch:[0/2](492700/4588595) loss:2.574 lr:0.0000100 epoch_Time:25989.0min: [2024-01-04 21:46:01,598][model8_pretrain.py][INFO] Epoch:[0/2](492700/4588595) loss:2.991 lr:0.0000100 epoch_Time:25989.0min: [2024-01-04 21:46:01,598][model8_pretrain.py][INFO] Epoch:[0/2](492700/4588595) loss:2.764 lr:0.0000100 epoch_Time:25989.0min: [2024-01-04 21:46:01,598][model8_pretrain.py][INFO] Epoch:[0/2](492700/4588595) loss:3.210 lr:0.0000100 epoch_Time:25989.0min: [2024-01-04 21:46:01,598][model8_pretrain.py][INFO] Epoch:[0/2](492700/4588595) loss:2.797 lr:0.0000100 epoch_Time:25989.0min: [2024-01-04 21:46:38,542][model8_pretrain.py][INFO] Epoch:[0/2](492800/4588595) loss:3.410 lr:0.0000100 epoch_Time:25988.0min: [2024-01-04 21:46:38,542][model8_pretrain.py][INFO] Epoch:[0/2](492800/4588595) loss:2.165 lr:0.0000100 epoch_Time:25988.0min: [2024-01-04 21:46:38,542][model8_pretrain.py][INFO] Epoch:[0/2](492800/4588595) loss:3.223 lr:0.0000100 epoch_Time:25988.0min: [2024-01-04 21:46:38,542][model8_pretrain.py][INFO] Epoch:[0/2](492800/4588595) loss:2.290 lr:0.0000100 epoch_Time:25988.0min: [2024-01-04 21:46:38,542][model8_pretrain.py][INFO] Epoch:[0/2](492800/4588595) loss:2.761 lr:0.0000100 epoch_Time:25988.0min: [2024-01-04 21:46:38,542][model8_pretrain.py][INFO] Epoch:[0/2](492800/4588595) loss:2.859 lr:0.0000100 epoch_Time:25988.0min: [2024-01-04 21:46:38,542][model8_pretrain.py][INFO] Epoch:[0/2](492800/4588595) loss:2.874 lr:0.0000100 epoch_Time:25988.0min: [2024-01-04 21:46:38,542][model8_pretrain.py][INFO] Epoch:[0/2](492800/4588595) loss:2.789 lr:0.0000100 epoch_Time:25988.0min: [2024-01-04 21:47:15,488][model8_pretrain.py][INFO] Epoch:[0/2](492900/4588595) loss:2.451 lr:0.0000100 epoch_Time:25987.0min: [2024-01-04 21:47:15,488][model8_pretrain.py][INFO] Epoch:[0/2](492900/4588595) loss:2.982 lr:0.0000100 epoch_Time:25987.0min: [2024-01-04 21:47:15,488][model8_pretrain.py][INFO] Epoch:[0/2](492900/4588595) loss:3.227 lr:0.0000100 epoch_Time:25987.0min: [2024-01-04 21:47:15,488][model8_pretrain.py][INFO] Epoch:[0/2](492900/4588595) loss:2.736 lr:0.0000100 epoch_Time:25987.0min: [2024-01-04 21:47:15,488][model8_pretrain.py][INFO] Epoch:[0/2](492900/4588595) loss:3.189 lr:0.0000100 epoch_Time:25987.0min: [2024-01-04 21:47:15,488][model8_pretrain.py][INFO] Epoch:[0/2](492900/4588595) loss:2.616 lr:0.0000100 epoch_Time:25987.0min: [2024-01-04 21:47:15,488][model8_pretrain.py][INFO] Epoch:[0/2](492900/4588595) loss:2.734 lr:0.0000100 epoch_Time:25987.0min: [2024-01-04 21:47:15,488][model8_pretrain.py][INFO] Epoch:[0/2](492900/4588595) loss:1.921 lr:0.0000100 epoch_Time:25987.0min: [2024-01-04 21:47:52,417][model8_pretrain.py][INFO] Epoch:[0/2](493000/4588595) loss:3.217 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:47:52,417][model8_pretrain.py][INFO] Epoch:[0/2](493000/4588595) loss:2.674 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:47:52,417][model8_pretrain.py][INFO] Epoch:[0/2](493000/4588595) loss:2.844 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:47:52,417][model8_pretrain.py][INFO] Epoch:[0/2](493000/4588595) loss:3.207 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:47:52,417][model8_pretrain.py][INFO] Epoch:[0/2](493000/4588595) loss:2.704 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:47:52,417][model8_pretrain.py][INFO] Epoch:[0/2](493000/4588595) loss:2.639 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:47:52,417][model8_pretrain.py][INFO] Epoch:[0/2](493000/4588595) loss:2.786 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:47:52,417][model8_pretrain.py][INFO] Epoch:[0/2](493000/4588595) loss:2.879 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:48:29,353][model8_pretrain.py][INFO] Epoch:[0/2](493100/4588595) loss:3.112 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:48:29,353][model8_pretrain.py][INFO] Epoch:[0/2](493100/4588595) loss:2.748 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:48:29,353][model8_pretrain.py][INFO] Epoch:[0/2](493100/4588595) loss:3.103 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:48:29,353][model8_pretrain.py][INFO] Epoch:[0/2](493100/4588595) loss:2.805 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:48:29,353][model8_pretrain.py][INFO] Epoch:[0/2](493100/4588595) loss:2.978 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:48:29,353][model8_pretrain.py][INFO] Epoch:[0/2](493100/4588595) loss:2.665 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:48:29,353][model8_pretrain.py][INFO] Epoch:[0/2](493100/4588595) loss:2.755 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:48:29,353][model8_pretrain.py][INFO] Epoch:[0/2](493100/4588595) loss:2.899 lr:0.0000100 epoch_Time:25986.0min: [2024-01-04 21:49:08,053][model8_pretrain.py][INFO] Epoch:[0/2](493200/4588595) loss:2.663 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:08,053][model8_pretrain.py][INFO] Epoch:[0/2](493200/4588595) loss:2.893 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:08,053][model8_pretrain.py][INFO] Epoch:[0/2](493200/4588595) loss:2.657 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:08,053][model8_pretrain.py][INFO] Epoch:[0/2](493200/4588595) loss:2.508 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:08,053][model8_pretrain.py][INFO] Epoch:[0/2](493200/4588595) loss:2.804 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:08,054][model8_pretrain.py][INFO] Epoch:[0/2](493200/4588595) loss:2.801 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:08,054][model8_pretrain.py][INFO] Epoch:[0/2](493200/4588595) loss:2.786 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:08,054][model8_pretrain.py][INFO] Epoch:[0/2](493200/4588595) loss:2.888 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:53,129][model8_pretrain.py][INFO] Epoch:[0/2](493300/4588595) loss:3.064 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:53,129][model8_pretrain.py][INFO] Epoch:[0/2](493300/4588595) loss:3.226 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:53,129][model8_pretrain.py][INFO] Epoch:[0/2](493300/4588595) loss:2.521 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:53,129][model8_pretrain.py][INFO] Epoch:[0/2](493300/4588595) loss:2.916 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:53,129][model8_pretrain.py][INFO] Epoch:[0/2](493300/4588595) loss:2.951 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:53,129][model8_pretrain.py][INFO] Epoch:[0/2](493300/4588595) loss:2.672 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:53,130][model8_pretrain.py][INFO] Epoch:[0/2](493300/4588595) loss:2.632 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:49:53,130][model8_pretrain.py][INFO] Epoch:[0/2](493300/4588595) loss:2.732 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:50:30,080][model8_pretrain.py][INFO] Epoch:[0/2](493400/4588595) loss:3.197 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:50:30,080][model8_pretrain.py][INFO] Epoch:[0/2](493400/4588595) loss:2.777 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:50:30,080][model8_pretrain.py][INFO] Epoch:[0/2](493400/4588595) loss:2.737 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:50:30,080][model8_pretrain.py][INFO] Epoch:[0/2](493400/4588595) loss:3.062 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:50:30,080][model8_pretrain.py][INFO] Epoch:[0/2](493400/4588595) loss:3.062 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:50:30,080][model8_pretrain.py][INFO] Epoch:[0/2](493400/4588595) loss:2.895 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:50:30,080][model8_pretrain.py][INFO] Epoch:[0/2](493400/4588595) loss:2.971 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:50:30,080][model8_pretrain.py][INFO] Epoch:[0/2](493400/4588595) loss:2.232 lr:0.0000100 epoch_Time:25985.0min: [2024-01-04 21:51:07,021][model8_pretrain.py][INFO] Epoch:[0/2](493500/4588595) loss:2.984 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:07,021][model8_pretrain.py][INFO] Epoch:[0/2](493500/4588595) loss:3.338 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:07,021][model8_pretrain.py][INFO] Epoch:[0/2](493500/4588595) loss:2.973 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:07,021][model8_pretrain.py][INFO] Epoch:[0/2](493500/4588595) loss:2.950 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:07,021][model8_pretrain.py][INFO] Epoch:[0/2](493500/4588595) loss:2.736 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:07,021][model8_pretrain.py][INFO] Epoch:[0/2](493500/4588595) loss:2.931 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:07,021][model8_pretrain.py][INFO] Epoch:[0/2](493500/4588595) loss:3.214 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:07,021][model8_pretrain.py][INFO] Epoch:[0/2](493500/4588595) loss:2.664 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:43,963][model8_pretrain.py][INFO] Epoch:[0/2](493600/4588595) loss:2.995 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:43,963][model8_pretrain.py][INFO] Epoch:[0/2](493600/4588595) loss:3.085 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:43,963][model8_pretrain.py][INFO] Epoch:[0/2](493600/4588595) loss:2.729 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:43,963][model8_pretrain.py][INFO] Epoch:[0/2](493600/4588595) loss:2.843 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:43,963][model8_pretrain.py][INFO] Epoch:[0/2](493600/4588595) loss:3.059 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:43,963][model8_pretrain.py][INFO] Epoch:[0/2](493600/4588595) loss:3.025 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:43,963][model8_pretrain.py][INFO] Epoch:[0/2](493600/4588595) loss:3.097 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:51:43,964][model8_pretrain.py][INFO] Epoch:[0/2](493600/4588595) loss:3.210 lr:0.0000100 epoch_Time:25984.0min: [2024-01-04 21:52:20,883][model8_pretrain.py][INFO] Epoch:[0/2](493700/4588595) loss:3.270 lr:0.0000100 epoch_Time:25982.0min: [2024-01-04 21:52:20,883][model8_pretrain.py][INFO] Epoch:[0/2](493700/4588595) loss:3.112 lr:0.0000100 epoch_Time:25982.0min: [2024-01-04 21:52:20,883][model8_pretrain.py][INFO] Epoch:[0/2](493700/4588595) loss:2.325 lr:0.0000100 epoch_Time:25982.0min: [2024-01-04 21:52:20,883][model8_pretrain.py][INFO] Epoch:[0/2](493700/4588595) loss:2.915 lr:0.0000100 epoch_Time:25982.0min: [2024-01-04 21:52:20,883][model8_pretrain.py][INFO] Epoch:[0/2](493700/4588595) loss:3.188 lr:0.0000100 epoch_Time:25982.0min: [2024-01-04 21:52:20,883][model8_pretrain.py][INFO] Epoch:[0/2](493700/4588595) loss:3.236 lr:0.0000100 epoch_Time:25982.0min: [2024-01-04 21:52:20,883][model8_pretrain.py][INFO] Epoch:[0/2](493700/4588595) loss:2.743 lr:0.0000100 epoch_Time:25982.0min: [2024-01-04 21:52:20,883][model8_pretrain.py][INFO] Epoch:[0/2](493700/4588595) loss:3.023 lr:0.0000100 epoch_Time:25982.0min: [2024-01-04 21:52:57,815][model8_pretrain.py][INFO] Epoch:[0/2](493800/4588595) loss:2.891 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:52:57,815][model8_pretrain.py][INFO] Epoch:[0/2](493800/4588595) loss:2.645 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:52:57,815][model8_pretrain.py][INFO] Epoch:[0/2](493800/4588595) loss:3.183 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:52:57,815][model8_pretrain.py][INFO] Epoch:[0/2](493800/4588595) loss:3.213 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:52:57,815][model8_pretrain.py][INFO] Epoch:[0/2](493800/4588595) loss:2.792 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:52:57,815][model8_pretrain.py][INFO] Epoch:[0/2](493800/4588595) loss:2.498 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:52:57,816][model8_pretrain.py][INFO] Epoch:[0/2](493800/4588595) loss:2.813 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:52:57,816][model8_pretrain.py][INFO] Epoch:[0/2](493800/4588595) loss:2.874 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:53:34,742][model8_pretrain.py][INFO] Epoch:[0/2](493900/4588595) loss:3.235 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:53:34,742][model8_pretrain.py][INFO] Epoch:[0/2](493900/4588595) loss:2.966 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:53:34,742][model8_pretrain.py][INFO] Epoch:[0/2](493900/4588595) loss:2.614 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:53:34,742][model8_pretrain.py][INFO] Epoch:[0/2](493900/4588595) loss:3.005 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:53:34,742][model8_pretrain.py][INFO] Epoch:[0/2](493900/4588595) loss:3.126 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:53:34,742][model8_pretrain.py][INFO] Epoch:[0/2](493900/4588595) loss:2.781 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:53:34,742][model8_pretrain.py][INFO] Epoch:[0/2](493900/4588595) loss:2.416 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:53:34,742][model8_pretrain.py][INFO] Epoch:[0/2](493900/4588595) loss:2.972 lr:0.0000100 epoch_Time:25981.0min: [2024-01-04 21:54:13,500][model8_pretrain.py][INFO] Epoch:[0/2](494000/4588595) loss:3.009 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:13,500][model8_pretrain.py][INFO] Epoch:[0/2](494000/4588595) loss:2.613 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:13,500][model8_pretrain.py][INFO] Epoch:[0/2](494000/4588595) loss:3.019 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:13,500][model8_pretrain.py][INFO] Epoch:[0/2](494000/4588595) loss:3.241 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:13,500][model8_pretrain.py][INFO] Epoch:[0/2](494000/4588595) loss:2.461 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:13,500][model8_pretrain.py][INFO] Epoch:[0/2](494000/4588595) loss:2.759 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:13,500][model8_pretrain.py][INFO] Epoch:[0/2](494000/4588595) loss:3.202 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:13,500][model8_pretrain.py][INFO] Epoch:[0/2](494000/4588595) loss:2.798 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:58,458][model8_pretrain.py][INFO] Epoch:[0/2](494100/4588595) loss:2.666 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:58,458][model8_pretrain.py][INFO] Epoch:[0/2](494100/4588595) loss:3.415 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:58,458][model8_pretrain.py][INFO] Epoch:[0/2](494100/4588595) loss:2.896 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:58,458][model8_pretrain.py][INFO] Epoch:[0/2](494100/4588595) loss:2.768 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:58,458][model8_pretrain.py][INFO] Epoch:[0/2](494100/4588595) loss:3.166 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:58,458][model8_pretrain.py][INFO] Epoch:[0/2](494100/4588595) loss:3.081 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:58,458][model8_pretrain.py][INFO] Epoch:[0/2](494100/4588595) loss:2.811 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:54:58,458][model8_pretrain.py][INFO] Epoch:[0/2](494100/4588595) loss:3.353 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:55:35,390][model8_pretrain.py][INFO] Epoch:[0/2](494200/4588595) loss:3.061 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:55:35,390][model8_pretrain.py][INFO] Epoch:[0/2](494200/4588595) loss:2.938 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:55:35,390][model8_pretrain.py][INFO] Epoch:[0/2](494200/4588595) loss:3.057 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:55:35,390][model8_pretrain.py][INFO] Epoch:[0/2](494200/4588595) loss:2.992 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:55:35,390][model8_pretrain.py][INFO] Epoch:[0/2](494200/4588595) loss:3.220 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:55:35,390][model8_pretrain.py][INFO] Epoch:[0/2](494200/4588595) loss:2.958 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:55:35,390][model8_pretrain.py][INFO] Epoch:[0/2](494200/4588595) loss:3.199 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:55:35,390][model8_pretrain.py][INFO] Epoch:[0/2](494200/4588595) loss:2.824 lr:0.0000100 epoch_Time:25980.0min: [2024-01-04 21:56:12,332][model8_pretrain.py][INFO] Epoch:[0/2](494300/4588595) loss:3.110 lr:0.0000100 epoch_Time:25979.0min: [2024-01-04 21:56:12,332][model8_pretrain.py][INFO] Epoch:[0/2](494300/4588595) loss:2.064 lr:0.0000100 epoch_Time:25979.0min: [2024-01-04 21:56:12,332][model8_pretrain.py][INFO] Epoch:[0/2](494300/4588595) loss:2.877 lr:0.0000100 epoch_Time:25979.0min: [2024-01-04 21:56:12,332][model8_pretrain.py][INFO] Epoch:[0/2](494300/4588595) loss:2.471 lr:0.0000100 epoch_Time:25979.0min: [2024-01-04 21:56:12,332][model8_pretrain.py][INFO] Epoch:[0/2](494300/4588595) loss:2.865 lr:0.0000100 epoch_Time:25979.0min: [2024-01-04 21:56:12,332][model8_pretrain.py][INFO] Epoch:[0/2](494300/4588595) loss:2.565 lr:0.0000100 epoch_Time:25979.0min: [2024-01-04 21:56:12,332][model8_pretrain.py][INFO] Epoch:[0/2](494300/4588595) loss:2.873 lr:0.0000100 epoch_Time:25979.0min: [2024-01-04 21:56:12,332][model8_pretrain.py][INFO] Epoch:[0/2](494300/4588595) loss:2.777 lr:0.0000100 epoch_Time:25979.0min: [2024-01-04 21:56:49,271][model8_pretrain.py][INFO] Epoch:[0/2](494400/4588595) loss:3.173 lr:0.0000100 epoch_Time:25978.0min: [2024-01-04 21:56:49,271][model8_pretrain.py][INFO] Epoch:[0/2](494400/4588595) loss:2.520 lr:0.0000100 epoch_Time:25978.0min: [2024-01-04 21:56:49,271][model8_pretrain.py][INFO] Epoch:[0/2](494400/4588595) loss:3.226 lr:0.0000100 epoch_Time:25978.0min: [2024-01-04 21:56:49,271][model8_pretrain.py][INFO] Epoch:[0/2](494400/4588595) loss:2.727 lr:0.0000100 epoch_Time:25978.0min: [2024-01-04 21:56:49,271][model8_pretrain.py][INFO] Epoch:[0/2](494400/4588595) loss:2.734 lr:0.0000100 epoch_Time:25978.0min: [2024-01-04 21:56:49,271][model8_pretrain.py][INFO] Epoch:[0/2](494400/4588595) loss:2.635 lr:0.0000100 epoch_Time:25978.0min: [2024-01-04 21:56:49,271][model8_pretrain.py][INFO] Epoch:[0/2](494400/4588595) loss:2.699 lr:0.0000100 epoch_Time:25978.0min: [2024-01-04 21:56:49,271][model8_pretrain.py][INFO] Epoch:[0/2](494400/4588595) loss:2.561 lr:0.0000100 epoch_Time:25978.0min: [2024-01-04 21:57:26,200][model8_pretrain.py][INFO] Epoch:[0/2](494500/4588595) loss:2.581 lr:0.0000100 epoch_Time:25977.0min: [2024-01-04 21:57:26,200][model8_pretrain.py][INFO] Epoch:[0/2](494500/4588595) loss:2.959 lr:0.0000100 epoch_Time:25977.0min: [2024-01-04 21:57:26,200][model8_pretrain.py][INFO] Epoch:[0/2](494500/4588595) loss:2.841 lr:0.0000100 epoch_Time:25977.0min: [2024-01-04 21:57:26,200][model8_pretrain.py][INFO] Epoch:[0/2](494500/4588595) loss:2.539 lr:0.0000100 epoch_Time:25977.0min: [2024-01-04 21:57:26,200][model8_pretrain.py][INFO] Epoch:[0/2](494500/4588595) loss:2.766 lr:0.0000100 epoch_Time:25977.0min: [2024-01-04 21:57:26,200][model8_pretrain.py][INFO] Epoch:[0/2](494500/4588595) loss:3.276 lr:0.0000100 epoch_Time:25977.0min: [2024-01-04 21:57:26,200][model8_pretrain.py][INFO] Epoch:[0/2](494500/4588595) loss:2.813 lr:0.0000100 epoch_Time:25977.0min: [2024-01-04 21:57:26,200][model8_pretrain.py][INFO] Epoch:[0/2](494500/4588595) loss:3.280 lr:0.0000100 epoch_Time:25977.0min: [2024-01-04 21:58:03,133][model8_pretrain.py][INFO] Epoch:[0/2](494600/4588595) loss:2.808 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:03,133][model8_pretrain.py][INFO] Epoch:[0/2](494600/4588595) loss:3.419 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:03,133][model8_pretrain.py][INFO] Epoch:[0/2](494600/4588595) loss:3.015 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:03,133][model8_pretrain.py][INFO] Epoch:[0/2](494600/4588595) loss:3.064 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:03,133][model8_pretrain.py][INFO] Epoch:[0/2](494600/4588595) loss:2.525 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:03,133][model8_pretrain.py][INFO] Epoch:[0/2](494600/4588595) loss:2.803 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:03,133][model8_pretrain.py][INFO] Epoch:[0/2](494600/4588595) loss:3.447 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:03,133][model8_pretrain.py][INFO] Epoch:[0/2](494600/4588595) loss:2.670 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:40,066][model8_pretrain.py][INFO] Epoch:[0/2](494700/4588595) loss:2.963 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:40,066][model8_pretrain.py][INFO] Epoch:[0/2](494700/4588595) loss:3.052 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:40,066][model8_pretrain.py][INFO] Epoch:[0/2](494700/4588595) loss:2.909 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:40,066][model8_pretrain.py][INFO] Epoch:[0/2](494700/4588595) loss:2.968 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:40,066][model8_pretrain.py][INFO] Epoch:[0/2](494700/4588595) loss:3.028 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:40,066][model8_pretrain.py][INFO] Epoch:[0/2](494700/4588595) loss:3.017 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:40,066][model8_pretrain.py][INFO] Epoch:[0/2](494700/4588595) loss:2.884 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:58:40,067][model8_pretrain.py][INFO] Epoch:[0/2](494700/4588595) loss:2.960 lr:0.0000100 epoch_Time:25976.0min: [2024-01-04 21:59:18,769][model8_pretrain.py][INFO] Epoch:[0/2](494800/4588595) loss:3.122 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 21:59:18,769][model8_pretrain.py][INFO] Epoch:[0/2](494800/4588595) loss:2.799 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 21:59:18,769][model8_pretrain.py][INFO] Epoch:[0/2](494800/4588595) loss:3.211 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 21:59:18,769][model8_pretrain.py][INFO] Epoch:[0/2](494800/4588595) loss:2.816 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 21:59:18,769][model8_pretrain.py][INFO] Epoch:[0/2](494800/4588595) loss:2.643 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 21:59:18,769][model8_pretrain.py][INFO] Epoch:[0/2](494800/4588595) loss:3.102 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 21:59:18,769][model8_pretrain.py][INFO] Epoch:[0/2](494800/4588595) loss:2.779 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 21:59:18,769][model8_pretrain.py][INFO] Epoch:[0/2](494800/4588595) loss:3.050 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:04,242][model8_pretrain.py][INFO] Epoch:[0/2](494900/4588595) loss:2.762 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:04,242][model8_pretrain.py][INFO] Epoch:[0/2](494900/4588595) loss:3.065 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:04,242][model8_pretrain.py][INFO] Epoch:[0/2](494900/4588595) loss:2.407 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:04,242][model8_pretrain.py][INFO] Epoch:[0/2](494900/4588595) loss:3.137 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:04,242][model8_pretrain.py][INFO] Epoch:[0/2](494900/4588595) loss:2.868 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:04,242][model8_pretrain.py][INFO] Epoch:[0/2](494900/4588595) loss:2.829 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:04,242][model8_pretrain.py][INFO] Epoch:[0/2](494900/4588595) loss:2.803 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:04,242][model8_pretrain.py][INFO] Epoch:[0/2](494900/4588595) loss:2.481 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:41,178][model8_pretrain.py][INFO] Epoch:[0/2](495000/4588595) loss:3.094 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:41,178][model8_pretrain.py][INFO] Epoch:[0/2](495000/4588595) loss:3.050 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:41,178][model8_pretrain.py][INFO] Epoch:[0/2](495000/4588595) loss:2.933 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:41,178][model8_pretrain.py][INFO] Epoch:[0/2](495000/4588595) loss:2.784 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:41,178][model8_pretrain.py][INFO] Epoch:[0/2](495000/4588595) loss:2.868 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:41,178][model8_pretrain.py][INFO] Epoch:[0/2](495000/4588595) loss:3.041 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:41,178][model8_pretrain.py][INFO] Epoch:[0/2](495000/4588595) loss:2.423 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:00:41,178][model8_pretrain.py][INFO] Epoch:[0/2](495000/4588595) loss:2.937 lr:0.0000100 epoch_Time:25975.0min: [2024-01-04 22:01:18,112][model8_pretrain.py][INFO] Epoch:[0/2](495100/4588595) loss:2.445 lr:0.0000100 epoch_Time:25974.0min: [2024-01-04 22:01:18,112][model8_pretrain.py][INFO] Epoch:[0/2](495100/4588595) loss:3.031 lr:0.0000100 epoch_Time:25974.0min: [2024-01-04 22:01:18,112][model8_pretrain.py][INFO] Epoch:[0/2](495100/4588595) loss:3.320 lr:0.0000100 epoch_Time:25974.0min: [2024-01-04 22:01:18,112][model8_pretrain.py][INFO] Epoch:[0/2](495100/4588595) loss:2.357 lr:0.0000100 epoch_Time:25974.0min: [2024-01-04 22:01:18,112][model8_pretrain.py][INFO] Epoch:[0/2](495100/4588595) loss:2.875 lr:0.0000100 epoch_Time:25974.0min: [2024-01-04 22:01:18,112][model8_pretrain.py][INFO] Epoch:[0/2](495100/4588595) loss:2.993 lr:0.0000100 epoch_Time:25974.0min: [2024-01-04 22:01:18,113][model8_pretrain.py][INFO] Epoch:[0/2](495100/4588595) loss:2.818 lr:0.0000100 epoch_Time:25974.0min: [2024-01-04 22:01:18,113][model8_pretrain.py][INFO] Epoch:[0/2](495100/4588595) loss:3.096 lr:0.0000100 epoch_Time:25974.0min: [2024-01-04 22:01:55,042][model8_pretrain.py][INFO] Epoch:[0/2](495200/4588595) loss:2.164 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:01:55,042][model8_pretrain.py][INFO] Epoch:[0/2](495200/4588595) loss:2.633 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:01:55,042][model8_pretrain.py][INFO] Epoch:[0/2](495200/4588595) loss:2.466 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:01:55,042][model8_pretrain.py][INFO] Epoch:[0/2](495200/4588595) loss:3.358 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:01:55,042][model8_pretrain.py][INFO] Epoch:[0/2](495200/4588595) loss:2.332 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:01:55,042][model8_pretrain.py][INFO] Epoch:[0/2](495200/4588595) loss:2.685 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:01:55,042][model8_pretrain.py][INFO] Epoch:[0/2](495200/4588595) loss:2.962 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:01:55,042][model8_pretrain.py][INFO] Epoch:[0/2](495200/4588595) loss:2.659 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:02:31,971][model8_pretrain.py][INFO] Epoch:[0/2](495300/4588595) loss:2.537 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:02:31,971][model8_pretrain.py][INFO] Epoch:[0/2](495300/4588595) loss:2.484 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:02:31,971][model8_pretrain.py][INFO] Epoch:[0/2](495300/4588595) loss:3.275 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:02:31,971][model8_pretrain.py][INFO] Epoch:[0/2](495300/4588595) loss:3.396 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:02:31,971][model8_pretrain.py][INFO] Epoch:[0/2](495300/4588595) loss:2.589 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:02:31,971][model8_pretrain.py][INFO] Epoch:[0/2](495300/4588595) loss:3.041 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:02:31,971][model8_pretrain.py][INFO] Epoch:[0/2](495300/4588595) loss:2.400 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:02:31,971][model8_pretrain.py][INFO] Epoch:[0/2](495300/4588595) loss:2.826 lr:0.0000100 epoch_Time:25973.0min: [2024-01-04 22:03:08,901][model8_pretrain.py][INFO] Epoch:[0/2](495400/4588595) loss:3.299 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:08,901][model8_pretrain.py][INFO] Epoch:[0/2](495400/4588595) loss:3.129 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:08,902][model8_pretrain.py][INFO] Epoch:[0/2](495400/4588595) loss:3.166 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:08,902][model8_pretrain.py][INFO] Epoch:[0/2](495400/4588595) loss:2.854 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:08,901][model8_pretrain.py][INFO] Epoch:[0/2](495400/4588595) loss:2.563 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:08,902][model8_pretrain.py][INFO] Epoch:[0/2](495400/4588595) loss:3.351 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:08,903][model8_pretrain.py][INFO] Epoch:[0/2](495400/4588595) loss:3.455 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:08,904][model8_pretrain.py][INFO] Epoch:[0/2](495400/4588595) loss:2.236 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:45,831][model8_pretrain.py][INFO] Epoch:[0/2](495500/4588595) loss:2.919 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:45,831][model8_pretrain.py][INFO] Epoch:[0/2](495500/4588595) loss:2.660 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:45,831][model8_pretrain.py][INFO] Epoch:[0/2](495500/4588595) loss:2.623 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:45,831][model8_pretrain.py][INFO] Epoch:[0/2](495500/4588595) loss:2.844 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:45,831][model8_pretrain.py][INFO] Epoch:[0/2](495500/4588595) loss:3.131 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:45,831][model8_pretrain.py][INFO] Epoch:[0/2](495500/4588595) loss:2.927 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:45,831][model8_pretrain.py][INFO] Epoch:[0/2](495500/4588595) loss:2.858 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:03:45,831][model8_pretrain.py][INFO] Epoch:[0/2](495500/4588595) loss:3.306 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:04:22,756][model8_pretrain.py][INFO] Epoch:[0/2](495600/4588595) loss:2.866 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:04:22,756][model8_pretrain.py][INFO] Epoch:[0/2](495600/4588595) loss:2.423 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:04:22,756][model8_pretrain.py][INFO] Epoch:[0/2](495600/4588595) loss:2.765 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:04:22,756][model8_pretrain.py][INFO] Epoch:[0/2](495600/4588595) loss:2.854 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:04:22,756][model8_pretrain.py][INFO] Epoch:[0/2](495600/4588595) loss:2.441 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:04:22,756][model8_pretrain.py][INFO] Epoch:[0/2](495600/4588595) loss:2.701 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:04:22,756][model8_pretrain.py][INFO] Epoch:[0/2](495600/4588595) loss:3.205 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:04:22,756][model8_pretrain.py][INFO] Epoch:[0/2](495600/4588595) loss:2.975 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:05:10,111][model8_pretrain.py][INFO] Epoch:[0/2](495700/4588595) loss:3.251 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:05:10,111][model8_pretrain.py][INFO] Epoch:[0/2](495700/4588595) loss:3.078 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:05:10,111][model8_pretrain.py][INFO] Epoch:[0/2](495700/4588595) loss:2.817 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:05:10,111][model8_pretrain.py][INFO] Epoch:[0/2](495700/4588595) loss:3.128 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:05:10,111][model8_pretrain.py][INFO] Epoch:[0/2](495700/4588595) loss:2.428 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:05:10,111][model8_pretrain.py][INFO] Epoch:[0/2](495700/4588595) loss:2.387 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:05:10,111][model8_pretrain.py][INFO] Epoch:[0/2](495700/4588595) loss:2.081 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:05:10,112][model8_pretrain.py][INFO] Epoch:[0/2](495700/4588595) loss:2.595 lr:0.0000100 epoch_Time:25971.0min: [2024-01-04 22:05:47,022][model8_pretrain.py][INFO] Epoch:[0/2](495800/4588595) loss:2.490 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:05:47,022][model8_pretrain.py][INFO] Epoch:[0/2](495800/4588595) loss:2.638 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:05:47,022][model8_pretrain.py][INFO] Epoch:[0/2](495800/4588595) loss:2.806 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:05:47,022][model8_pretrain.py][INFO] Epoch:[0/2](495800/4588595) loss:3.224 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:05:47,022][model8_pretrain.py][INFO] Epoch:[0/2](495800/4588595) loss:3.320 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:05:47,022][model8_pretrain.py][INFO] Epoch:[0/2](495800/4588595) loss:2.576 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:05:47,022][model8_pretrain.py][INFO] Epoch:[0/2](495800/4588595) loss:2.324 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:05:47,023][model8_pretrain.py][INFO] Epoch:[0/2](495800/4588595) loss:2.781 lr:0.0000100 epoch_Time:25970.0min: [2024-01-04 22:06:23,965][model8_pretrain.py][INFO] Epoch:[0/2](495900/4588595) loss:2.693 lr:0.0000100 epoch_Time:25969.0min: [2024-01-04 22:06:23,965][model8_pretrain.py][INFO] Epoch:[0/2](495900/4588595) loss:2.340 lr:0.0000100 epoch_Time:25969.0min: [2024-01-04 22:06:23,965][model8_pretrain.py][INFO] Epoch:[0/2](495900/4588595) loss:3.424 lr:0.0000100 epoch_Time:25969.0min: [2024-01-04 22:06:23,965][model8_pretrain.py][INFO] Epoch:[0/2](495900/4588595) loss:2.957 lr:0.0000100 epoch_Time:25969.0min: [2024-01-04 22:06:23,965][model8_pretrain.py][INFO] Epoch:[0/2](495900/4588595) loss:2.822 lr:0.0000100 epoch_Time:25969.0min: [2024-01-04 22:06:23,965][model8_pretrain.py][INFO] Epoch:[0/2](495900/4588595) loss:2.767 lr:0.0000100 epoch_Time:25969.0min: [2024-01-04 22:06:23,965][model8_pretrain.py][INFO] Epoch:[0/2](495900/4588595) loss:2.962 lr:0.0000100 epoch_Time:25969.0min: [2024-01-04 22:06:23,965][model8_pretrain.py][INFO] Epoch:[0/2](495900/4588595) loss:2.909 lr:0.0000100 epoch_Time:25969.0min: [2024-01-04 22:07:00,895][model8_pretrain.py][INFO] Epoch:[0/2](496000/4588595) loss:3.237 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:00,895][model8_pretrain.py][INFO] Epoch:[0/2](496000/4588595) loss:2.879 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:00,895][model8_pretrain.py][INFO] Epoch:[0/2](496000/4588595) loss:2.724 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:00,895][model8_pretrain.py][INFO] Epoch:[0/2](496000/4588595) loss:3.041 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:00,895][model8_pretrain.py][INFO] Epoch:[0/2](496000/4588595) loss:3.267 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:00,895][model8_pretrain.py][INFO] Epoch:[0/2](496000/4588595) loss:3.231 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:00,895][model8_pretrain.py][INFO] Epoch:[0/2](496000/4588595) loss:3.162 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:00,895][model8_pretrain.py][INFO] Epoch:[0/2](496000/4588595) loss:2.861 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:37,831][model8_pretrain.py][INFO] Epoch:[0/2](496100/4588595) loss:3.156 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:37,831][model8_pretrain.py][INFO] Epoch:[0/2](496100/4588595) loss:2.398 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:37,831][model8_pretrain.py][INFO] Epoch:[0/2](496100/4588595) loss:2.990 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:37,831][model8_pretrain.py][INFO] Epoch:[0/2](496100/4588595) loss:3.197 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:37,831][model8_pretrain.py][INFO] Epoch:[0/2](496100/4588595) loss:2.470 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:37,831][model8_pretrain.py][INFO] Epoch:[0/2](496100/4588595) loss:3.264 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:37,831][model8_pretrain.py][INFO] Epoch:[0/2](496100/4588595) loss:2.971 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:07:37,831][model8_pretrain.py][INFO] Epoch:[0/2](496100/4588595) loss:2.547 lr:0.0000100 epoch_Time:25968.0min: [2024-01-04 22:08:14,784][model8_pretrain.py][INFO] Epoch:[0/2](496200/4588595) loss:3.296 lr:0.0000100 epoch_Time:25967.0min: [2024-01-04 22:08:14,784][model8_pretrain.py][INFO] Epoch:[0/2](496200/4588595) loss:2.786 lr:0.0000100 epoch_Time:25967.0min: [2024-01-04 22:08:14,784][model8_pretrain.py][INFO] Epoch:[0/2](496200/4588595) loss:2.864 lr:0.0000100 epoch_Time:25967.0min: [2024-01-04 22:08:14,784][model8_pretrain.py][INFO] Epoch:[0/2](496200/4588595) loss:2.448 lr:0.0000100 epoch_Time:25967.0min: [2024-01-04 22:08:14,784][model8_pretrain.py][INFO] Epoch:[0/2](496200/4588595) loss:3.188 lr:0.0000100 epoch_Time:25967.0min: [2024-01-04 22:08:14,785][model8_pretrain.py][INFO] Epoch:[0/2](496200/4588595) loss:3.069 lr:0.0000100 epoch_Time:25967.0min: [2024-01-04 22:08:14,785][model8_pretrain.py][INFO] Epoch:[0/2](496200/4588595) loss:3.053 lr:0.0000100 epoch_Time:25967.0min: [2024-01-04 22:08:14,785][model8_pretrain.py][INFO] Epoch:[0/2](496200/4588595) loss:2.731 lr:0.0000100 epoch_Time:25967.0min: [2024-01-04 22:08:51,711][model8_pretrain.py][INFO] Epoch:[0/2](496300/4588595) loss:2.466 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:08:51,711][model8_pretrain.py][INFO] Epoch:[0/2](496300/4588595) loss:3.066 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:08:51,711][model8_pretrain.py][INFO] Epoch:[0/2](496300/4588595) loss:2.875 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:08:51,711][model8_pretrain.py][INFO] Epoch:[0/2](496300/4588595) loss:2.689 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:08:51,711][model8_pretrain.py][INFO] Epoch:[0/2](496300/4588595) loss:3.078 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:08:51,711][model8_pretrain.py][INFO] Epoch:[0/2](496300/4588595) loss:2.966 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:08:51,711][model8_pretrain.py][INFO] Epoch:[0/2](496300/4588595) loss:3.016 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:08:51,711][model8_pretrain.py][INFO] Epoch:[0/2](496300/4588595) loss:2.879 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:09:28,633][model8_pretrain.py][INFO] Epoch:[0/2](496400/4588595) loss:2.598 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:09:28,633][model8_pretrain.py][INFO] Epoch:[0/2](496400/4588595) loss:2.522 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:09:28,633][model8_pretrain.py][INFO] Epoch:[0/2](496400/4588595) loss:3.063 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:09:28,633][model8_pretrain.py][INFO] Epoch:[0/2](496400/4588595) loss:2.673 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:09:28,633][model8_pretrain.py][INFO] Epoch:[0/2](496400/4588595) loss:2.717 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:09:28,633][model8_pretrain.py][INFO] Epoch:[0/2](496400/4588595) loss:2.883 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:09:28,633][model8_pretrain.py][INFO] Epoch:[0/2](496400/4588595) loss:2.551 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:09:28,633][model8_pretrain.py][INFO] Epoch:[0/2](496400/4588595) loss:2.914 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:10:15,992][model8_pretrain.py][INFO] Epoch:[0/2](496500/4588595) loss:3.330 lr:0.0000100 epoch_Time:25966.0min: [2024-01-04 22:10:15,992][model8_pretrain.py][INFO] Epoch:[0/2](496500/4588595) loss:3.002 lr:0.0000100 epoch_Time:25966.0min: [2024-01-04 22:10:15,992][model8_pretrain.py][INFO] Epoch:[0/2](496500/4588595) loss:1.727 lr:0.0000100 epoch_Time:25966.0min: [2024-01-04 22:10:15,992][model8_pretrain.py][INFO] Epoch:[0/2](496500/4588595) loss:2.478 lr:0.0000100 epoch_Time:25966.0min: [2024-01-04 22:10:15,992][model8_pretrain.py][INFO] Epoch:[0/2](496500/4588595) loss:3.264 lr:0.0000100 epoch_Time:25966.0min: [2024-01-04 22:10:15,992][model8_pretrain.py][INFO] Epoch:[0/2](496500/4588595) loss:2.563 lr:0.0000100 epoch_Time:25966.0min: [2024-01-04 22:10:15,992][model8_pretrain.py][INFO] Epoch:[0/2](496500/4588595) loss:2.657 lr:0.0000100 epoch_Time:25966.0min: [2024-01-04 22:10:15,992][model8_pretrain.py][INFO] Epoch:[0/2](496500/4588595) loss:2.685 lr:0.0000100 epoch_Time:25966.0min: [2024-01-04 22:10:52,934][model8_pretrain.py][INFO] Epoch:[0/2](496600/4588595) loss:2.919 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:10:52,934][model8_pretrain.py][INFO] Epoch:[0/2](496600/4588595) loss:3.188 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:10:52,934][model8_pretrain.py][INFO] Epoch:[0/2](496600/4588595) loss:2.838 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:10:52,934][model8_pretrain.py][INFO] Epoch:[0/2](496600/4588595) loss:2.790 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:10:52,934][model8_pretrain.py][INFO] Epoch:[0/2](496600/4588595) loss:3.018 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:10:52,934][model8_pretrain.py][INFO] Epoch:[0/2](496600/4588595) loss:2.683 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:10:52,934][model8_pretrain.py][INFO] Epoch:[0/2](496600/4588595) loss:2.832 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:10:52,934][model8_pretrain.py][INFO] Epoch:[0/2](496600/4588595) loss:2.736 lr:0.0000100 epoch_Time:25965.0min: [2024-01-04 22:11:29,885][model8_pretrain.py][INFO] Epoch:[0/2](496700/4588595) loss:2.909 lr:0.0000100 epoch_Time:25964.0min: [2024-01-04 22:11:29,885][model8_pretrain.py][INFO] Epoch:[0/2](496700/4588595) loss:2.533 lr:0.0000100 epoch_Time:25964.0min: [2024-01-04 22:11:29,885][model8_pretrain.py][INFO] Epoch:[0/2](496700/4588595) loss:3.314 lr:0.0000100 epoch_Time:25964.0min: [2024-01-04 22:11:29,885][model8_pretrain.py][INFO] Epoch:[0/2](496700/4588595) loss:2.858 lr:0.0000100 epoch_Time:25964.0min: [2024-01-04 22:11:29,885][model8_pretrain.py][INFO] Epoch:[0/2](496700/4588595) loss:2.596 lr:0.0000100 epoch_Time:25964.0min: [2024-01-04 22:11:29,885][model8_pretrain.py][INFO] Epoch:[0/2](496700/4588595) loss:3.075 lr:0.0000100 epoch_Time:25964.0min: [2024-01-04 22:11:29,885][model8_pretrain.py][INFO] Epoch:[0/2](496700/4588595) loss:3.195 lr:0.0000100 epoch_Time:25964.0min: [2024-01-04 22:11:29,885][model8_pretrain.py][INFO] Epoch:[0/2](496700/4588595) loss:3.079 lr:0.0000100 epoch_Time:25964.0min: [2024-01-04 22:12:06,790][model8_pretrain.py][INFO] Epoch:[0/2](496800/4588595) loss:2.983 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:06,790][model8_pretrain.py][INFO] Epoch:[0/2](496800/4588595) loss:2.859 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:06,790][model8_pretrain.py][INFO] Epoch:[0/2](496800/4588595) loss:3.018 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:06,790][model8_pretrain.py][INFO] Epoch:[0/2](496800/4588595) loss:3.015 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:06,790][model8_pretrain.py][INFO] Epoch:[0/2](496800/4588595) loss:3.219 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:06,790][model8_pretrain.py][INFO] Epoch:[0/2](496800/4588595) loss:2.451 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:06,790][model8_pretrain.py][INFO] Epoch:[0/2](496800/4588595) loss:2.464 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:06,790][model8_pretrain.py][INFO] Epoch:[0/2](496800/4588595) loss:3.119 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:43,723][model8_pretrain.py][INFO] Epoch:[0/2](496900/4588595) loss:3.072 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:43,723][model8_pretrain.py][INFO] Epoch:[0/2](496900/4588595) loss:2.361 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:43,723][model8_pretrain.py][INFO] Epoch:[0/2](496900/4588595) loss:2.610 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:43,724][model8_pretrain.py][INFO] Epoch:[0/2](496900/4588595) loss:2.634 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:43,724][model8_pretrain.py][INFO] Epoch:[0/2](496900/4588595) loss:3.175 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:43,724][model8_pretrain.py][INFO] Epoch:[0/2](496900/4588595) loss:3.053 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:43,724][model8_pretrain.py][INFO] Epoch:[0/2](496900/4588595) loss:2.723 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:12:43,724][model8_pretrain.py][INFO] Epoch:[0/2](496900/4588595) loss:2.936 lr:0.0000100 epoch_Time:25963.0min: [2024-01-04 22:13:20,648][model8_pretrain.py][INFO] Epoch:[0/2](497000/4588595) loss:2.374 lr:0.0000100 epoch_Time:25962.0min: [2024-01-04 22:13:20,648][model8_pretrain.py][INFO] Epoch:[0/2](497000/4588595) loss:2.369 lr:0.0000100 epoch_Time:25962.0min: [2024-01-04 22:13:20,648][model8_pretrain.py][INFO] Epoch:[0/2](497000/4588595) loss:3.103 lr:0.0000100 epoch_Time:25962.0min: [2024-01-04 22:13:20,648][model8_pretrain.py][INFO] Epoch:[0/2](497000/4588595) loss:2.911 lr:0.0000100 epoch_Time:25962.0min: [2024-01-04 22:13:20,648][model8_pretrain.py][INFO] Epoch:[0/2](497000/4588595) loss:2.905 lr:0.0000100 epoch_Time:25962.0min: [2024-01-04 22:13:20,648][model8_pretrain.py][INFO] Epoch:[0/2](497000/4588595) loss:3.130 lr:0.0000100 epoch_Time:25962.0min: [2024-01-04 22:13:20,649][model8_pretrain.py][INFO] Epoch:[0/2](497000/4588595) loss:2.487 lr:0.0000100 epoch_Time:25962.0min: [2024-01-04 22:13:20,649][model8_pretrain.py][INFO] Epoch:[0/2](497000/4588595) loss:3.339 lr:0.0000100 epoch_Time:25962.0min: [2024-01-04 22:13:57,575][model8_pretrain.py][INFO] Epoch:[0/2](497100/4588595) loss:2.685 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:13:57,575][model8_pretrain.py][INFO] Epoch:[0/2](497100/4588595) loss:2.913 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:13:57,575][model8_pretrain.py][INFO] Epoch:[0/2](497100/4588595) loss:2.890 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:13:57,575][model8_pretrain.py][INFO] Epoch:[0/2](497100/4588595) loss:2.411 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:13:57,575][model8_pretrain.py][INFO] Epoch:[0/2](497100/4588595) loss:2.985 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:13:57,575][model8_pretrain.py][INFO] Epoch:[0/2](497100/4588595) loss:2.191 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:13:57,575][model8_pretrain.py][INFO] Epoch:[0/2](497100/4588595) loss:2.694 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:13:57,575][model8_pretrain.py][INFO] Epoch:[0/2](497100/4588595) loss:2.804 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:14:34,521][model8_pretrain.py][INFO] Epoch:[0/2](497200/4588595) loss:2.716 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:14:34,521][model8_pretrain.py][INFO] Epoch:[0/2](497200/4588595) loss:2.740 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:14:34,521][model8_pretrain.py][INFO] Epoch:[0/2](497200/4588595) loss:3.650 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:14:34,521][model8_pretrain.py][INFO] Epoch:[0/2](497200/4588595) loss:2.596 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:14:34,521][model8_pretrain.py][INFO] Epoch:[0/2](497200/4588595) loss:2.630 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:14:34,521][model8_pretrain.py][INFO] Epoch:[0/2](497200/4588595) loss:2.852 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:14:34,521][model8_pretrain.py][INFO] Epoch:[0/2](497200/4588595) loss:2.905 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:14:34,521][model8_pretrain.py][INFO] Epoch:[0/2](497200/4588595) loss:3.227 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:15:21,885][model8_pretrain.py][INFO] Epoch:[0/2](497300/4588595) loss:3.100 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:15:21,886][model8_pretrain.py][INFO] Epoch:[0/2](497300/4588595) loss:2.943 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:15:21,886][model8_pretrain.py][INFO] Epoch:[0/2](497300/4588595) loss:2.734 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:15:21,886][model8_pretrain.py][INFO] Epoch:[0/2](497300/4588595) loss:3.128 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:15:21,886][model8_pretrain.py][INFO] Epoch:[0/2](497300/4588595) loss:2.921 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:15:21,886][model8_pretrain.py][INFO] Epoch:[0/2](497300/4588595) loss:2.646 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:15:21,886][model8_pretrain.py][INFO] Epoch:[0/2](497300/4588595) loss:2.906 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:15:21,886][model8_pretrain.py][INFO] Epoch:[0/2](497300/4588595) loss:2.811 lr:0.0000100 epoch_Time:25961.0min: [2024-01-04 22:15:58,819][model8_pretrain.py][INFO] Epoch:[0/2](497400/4588595) loss:2.962 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:15:58,820][model8_pretrain.py][INFO] Epoch:[0/2](497400/4588595) loss:3.151 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:15:58,819][model8_pretrain.py][INFO] Epoch:[0/2](497400/4588595) loss:3.015 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:15:58,820][model8_pretrain.py][INFO] Epoch:[0/2](497400/4588595) loss:2.658 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:15:58,820][model8_pretrain.py][INFO] Epoch:[0/2](497400/4588595) loss:1.800 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:15:58,820][model8_pretrain.py][INFO] Epoch:[0/2](497400/4588595) loss:2.438 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:15:58,820][model8_pretrain.py][INFO] Epoch:[0/2](497400/4588595) loss:3.459 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:15:58,820][model8_pretrain.py][INFO] Epoch:[0/2](497400/4588595) loss:2.658 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:16:35,763][model8_pretrain.py][INFO] Epoch:[0/2](497500/4588595) loss:3.122 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:16:35,763][model8_pretrain.py][INFO] Epoch:[0/2](497500/4588595) loss:3.003 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:16:35,763][model8_pretrain.py][INFO] Epoch:[0/2](497500/4588595) loss:2.803 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:16:35,763][model8_pretrain.py][INFO] Epoch:[0/2](497500/4588595) loss:2.386 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:16:35,763][model8_pretrain.py][INFO] Epoch:[0/2](497500/4588595) loss:2.854 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:16:35,763][model8_pretrain.py][INFO] Epoch:[0/2](497500/4588595) loss:2.762 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:16:35,764][model8_pretrain.py][INFO] Epoch:[0/2](497500/4588595) loss:2.853 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:16:35,764][model8_pretrain.py][INFO] Epoch:[0/2](497500/4588595) loss:2.885 lr:0.0000100 epoch_Time:25960.0min: [2024-01-04 22:17:12,706][model8_pretrain.py][INFO] Epoch:[0/2](497600/4588595) loss:2.761 lr:0.0000100 epoch_Time:25958.0min: [2024-01-04 22:17:12,706][model8_pretrain.py][INFO] Epoch:[0/2](497600/4588595) loss:2.781 lr:0.0000100 epoch_Time:25958.0min: [2024-01-04 22:17:12,706][model8_pretrain.py][INFO] Epoch:[0/2](497600/4588595) loss:2.891 lr:0.0000100 epoch_Time:25958.0min: [2024-01-04 22:17:12,706][model8_pretrain.py][INFO] Epoch:[0/2](497600/4588595) loss:3.088 lr:0.0000100 epoch_Time:25958.0min: [2024-01-04 22:17:12,706][model8_pretrain.py][INFO] Epoch:[0/2](497600/4588595) loss:2.742 lr:0.0000100 epoch_Time:25958.0min: [2024-01-04 22:17:12,706][model8_pretrain.py][INFO] Epoch:[0/2](497600/4588595) loss:2.409 lr:0.0000100 epoch_Time:25958.0min: [2024-01-04 22:17:12,706][model8_pretrain.py][INFO] Epoch:[0/2](497600/4588595) loss:3.160 lr:0.0000100 epoch_Time:25958.0min: [2024-01-04 22:17:12,707][model8_pretrain.py][INFO] Epoch:[0/2](497600/4588595) loss:2.846 lr:0.0000100 epoch_Time:25958.0min: [2024-01-04 22:17:49,647][model8_pretrain.py][INFO] Epoch:[0/2](497700/4588595) loss:2.567 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:17:49,647][model8_pretrain.py][INFO] Epoch:[0/2](497700/4588595) loss:2.803 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:17:49,647][model8_pretrain.py][INFO] Epoch:[0/2](497700/4588595) loss:2.630 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:17:49,647][model8_pretrain.py][INFO] Epoch:[0/2](497700/4588595) loss:2.839 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:17:49,647][model8_pretrain.py][INFO] Epoch:[0/2](497700/4588595) loss:2.174 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:17:49,647][model8_pretrain.py][INFO] Epoch:[0/2](497700/4588595) loss:3.288 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:17:49,647][model8_pretrain.py][INFO] Epoch:[0/2](497700/4588595) loss:2.999 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:17:49,647][model8_pretrain.py][INFO] Epoch:[0/2](497700/4588595) loss:2.990 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:18:26,586][model8_pretrain.py][INFO] Epoch:[0/2](497800/4588595) loss:2.562 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:18:26,586][model8_pretrain.py][INFO] Epoch:[0/2](497800/4588595) loss:2.669 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:18:26,586][model8_pretrain.py][INFO] Epoch:[0/2](497800/4588595) loss:2.569 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:18:26,586][model8_pretrain.py][INFO] Epoch:[0/2](497800/4588595) loss:2.207 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:18:26,586][model8_pretrain.py][INFO] Epoch:[0/2](497800/4588595) loss:2.496 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:18:26,586][model8_pretrain.py][INFO] Epoch:[0/2](497800/4588595) loss:2.563 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:18:26,586][model8_pretrain.py][INFO] Epoch:[0/2](497800/4588595) loss:2.891 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:18:26,586][model8_pretrain.py][INFO] Epoch:[0/2](497800/4588595) loss:2.864 lr:0.0000100 epoch_Time:25957.0min: [2024-01-04 22:19:03,536][model8_pretrain.py][INFO] Epoch:[0/2](497900/4588595) loss:2.675 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:03,536][model8_pretrain.py][INFO] Epoch:[0/2](497900/4588595) loss:3.482 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:03,536][model8_pretrain.py][INFO] Epoch:[0/2](497900/4588595) loss:2.343 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:03,536][model8_pretrain.py][INFO] Epoch:[0/2](497900/4588595) loss:3.381 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:03,536][model8_pretrain.py][INFO] Epoch:[0/2](497900/4588595) loss:3.146 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:03,536][model8_pretrain.py][INFO] Epoch:[0/2](497900/4588595) loss:2.766 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:03,536][model8_pretrain.py][INFO] Epoch:[0/2](497900/4588595) loss:2.814 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:03,537][model8_pretrain.py][INFO] Epoch:[0/2](497900/4588595) loss:2.747 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:40,469][model8_pretrain.py][INFO] Epoch:[0/2](498000/4588595) loss:2.775 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:40,469][model8_pretrain.py][INFO] Epoch:[0/2](498000/4588595) loss:2.735 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:40,469][model8_pretrain.py][INFO] Epoch:[0/2](498000/4588595) loss:3.090 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:40,469][model8_pretrain.py][INFO] Epoch:[0/2](498000/4588595) loss:2.406 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:40,469][model8_pretrain.py][INFO] Epoch:[0/2](498000/4588595) loss:3.072 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:40,469][model8_pretrain.py][INFO] Epoch:[0/2](498000/4588595) loss:2.900 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:40,469][model8_pretrain.py][INFO] Epoch:[0/2](498000/4588595) loss:2.541 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:19:40,469][model8_pretrain.py][INFO] Epoch:[0/2](498000/4588595) loss:3.065 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:20:27,820][model8_pretrain.py][INFO] Epoch:[0/2](498100/4588595) loss:2.863 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:20:27,820][model8_pretrain.py][INFO] Epoch:[0/2](498100/4588595) loss:2.391 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:20:27,820][model8_pretrain.py][INFO] Epoch:[0/2](498100/4588595) loss:3.230 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:20:27,820][model8_pretrain.py][INFO] Epoch:[0/2](498100/4588595) loss:3.098 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:20:27,820][model8_pretrain.py][INFO] Epoch:[0/2](498100/4588595) loss:3.233 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:20:27,820][model8_pretrain.py][INFO] Epoch:[0/2](498100/4588595) loss:2.702 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:20:27,820][model8_pretrain.py][INFO] Epoch:[0/2](498100/4588595) loss:2.458 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:20:27,820][model8_pretrain.py][INFO] Epoch:[0/2](498100/4588595) loss:2.543 lr:0.0000100 epoch_Time:25956.0min: [2024-01-04 22:21:04,771][model8_pretrain.py][INFO] Epoch:[0/2](498200/4588595) loss:3.134 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:04,771][model8_pretrain.py][INFO] Epoch:[0/2](498200/4588595) loss:2.879 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:04,771][model8_pretrain.py][INFO] Epoch:[0/2](498200/4588595) loss:2.999 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:04,771][model8_pretrain.py][INFO] Epoch:[0/2](498200/4588595) loss:3.083 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:04,771][model8_pretrain.py][INFO] Epoch:[0/2](498200/4588595) loss:2.846 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:04,771][model8_pretrain.py][INFO] Epoch:[0/2](498200/4588595) loss:3.378 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:04,771][model8_pretrain.py][INFO] Epoch:[0/2](498200/4588595) loss:2.717 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:04,771][model8_pretrain.py][INFO] Epoch:[0/2](498200/4588595) loss:2.983 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:41,709][model8_pretrain.py][INFO] Epoch:[0/2](498300/4588595) loss:2.068 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:41,709][model8_pretrain.py][INFO] Epoch:[0/2](498300/4588595) loss:3.103 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:41,709][model8_pretrain.py][INFO] Epoch:[0/2](498300/4588595) loss:2.362 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:41,709][model8_pretrain.py][INFO] Epoch:[0/2](498300/4588595) loss:2.529 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:41,709][model8_pretrain.py][INFO] Epoch:[0/2](498300/4588595) loss:2.996 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:41,709][model8_pretrain.py][INFO] Epoch:[0/2](498300/4588595) loss:2.799 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:41,709][model8_pretrain.py][INFO] Epoch:[0/2](498300/4588595) loss:2.528 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:21:41,710][model8_pretrain.py][INFO] Epoch:[0/2](498300/4588595) loss:2.829 lr:0.0000100 epoch_Time:25955.0min: [2024-01-04 22:22:18,652][model8_pretrain.py][INFO] Epoch:[0/2](498400/4588595) loss:2.835 lr:0.0000100 epoch_Time:25954.0min: [2024-01-04 22:22:18,652][model8_pretrain.py][INFO] Epoch:[0/2](498400/4588595) loss:2.900 lr:0.0000100 epoch_Time:25954.0min: [2024-01-04 22:22:18,652][model8_pretrain.py][INFO] Epoch:[0/2](498400/4588595) loss:3.225 lr:0.0000100 epoch_Time:25954.0min: [2024-01-04 22:22:18,652][model8_pretrain.py][INFO] Epoch:[0/2](498400/4588595) loss:3.334 lr:0.0000100 epoch_Time:25954.0min: [2024-01-04 22:22:18,652][model8_pretrain.py][INFO] Epoch:[0/2](498400/4588595) loss:3.063 lr:0.0000100 epoch_Time:25954.0min: [2024-01-04 22:22:18,652][model8_pretrain.py][INFO] Epoch:[0/2](498400/4588595) loss:2.692 lr:0.0000100 epoch_Time:25954.0min: [2024-01-04 22:22:18,652][model8_pretrain.py][INFO] Epoch:[0/2](498400/4588595) loss:3.133 lr:0.0000100 epoch_Time:25954.0min: [2024-01-04 22:22:18,652][model8_pretrain.py][INFO] Epoch:[0/2](498400/4588595) loss:2.893 lr:0.0000100 epoch_Time:25954.0min: [2024-01-04 22:22:55,590][model8_pretrain.py][INFO] Epoch:[0/2](498500/4588595) loss:2.948 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:22:55,590][model8_pretrain.py][INFO] Epoch:[0/2](498500/4588595) loss:2.836 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:22:55,590][model8_pretrain.py][INFO] Epoch:[0/2](498500/4588595) loss:3.017 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:22:55,590][model8_pretrain.py][INFO] Epoch:[0/2](498500/4588595) loss:2.915 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:22:55,590][model8_pretrain.py][INFO] Epoch:[0/2](498500/4588595) loss:3.308 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:22:55,590][model8_pretrain.py][INFO] Epoch:[0/2](498500/4588595) loss:2.586 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:22:55,590][model8_pretrain.py][INFO] Epoch:[0/2](498500/4588595) loss:2.728 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:22:55,591][model8_pretrain.py][INFO] Epoch:[0/2](498500/4588595) loss:2.991 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:23:32,532][model8_pretrain.py][INFO] Epoch:[0/2](498600/4588595) loss:2.820 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:23:32,533][model8_pretrain.py][INFO] Epoch:[0/2](498600/4588595) loss:2.261 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:23:32,533][model8_pretrain.py][INFO] Epoch:[0/2](498600/4588595) loss:3.011 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:23:32,533][model8_pretrain.py][INFO] Epoch:[0/2](498600/4588595) loss:2.695 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:23:32,533][model8_pretrain.py][INFO] Epoch:[0/2](498600/4588595) loss:3.269 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:23:32,533][model8_pretrain.py][INFO] Epoch:[0/2](498600/4588595) loss:3.158 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:23:32,533][model8_pretrain.py][INFO] Epoch:[0/2](498600/4588595) loss:3.319 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:23:32,533][model8_pretrain.py][INFO] Epoch:[0/2](498600/4588595) loss:2.881 lr:0.0000100 epoch_Time:25952.0min: [2024-01-04 22:24:09,460][model8_pretrain.py][INFO] Epoch:[0/2](498700/4588595) loss:2.987 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:09,460][model8_pretrain.py][INFO] Epoch:[0/2](498700/4588595) loss:3.028 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:09,460][model8_pretrain.py][INFO] Epoch:[0/2](498700/4588595) loss:3.062 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:09,460][model8_pretrain.py][INFO] Epoch:[0/2](498700/4588595) loss:3.450 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:09,460][model8_pretrain.py][INFO] Epoch:[0/2](498700/4588595) loss:2.538 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:09,460][model8_pretrain.py][INFO] Epoch:[0/2](498700/4588595) loss:2.197 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:09,462][model8_pretrain.py][INFO] Epoch:[0/2](498700/4588595) loss:2.804 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:09,462][model8_pretrain.py][INFO] Epoch:[0/2](498700/4588595) loss:3.484 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:46,377][model8_pretrain.py][INFO] Epoch:[0/2](498800/4588595) loss:2.630 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:46,377][model8_pretrain.py][INFO] Epoch:[0/2](498800/4588595) loss:2.712 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:46,377][model8_pretrain.py][INFO] Epoch:[0/2](498800/4588595) loss:2.891 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:46,377][model8_pretrain.py][INFO] Epoch:[0/2](498800/4588595) loss:3.172 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:46,377][model8_pretrain.py][INFO] Epoch:[0/2](498800/4588595) loss:2.782 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:46,377][model8_pretrain.py][INFO] Epoch:[0/2](498800/4588595) loss:3.053 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:46,377][model8_pretrain.py][INFO] Epoch:[0/2](498800/4588595) loss:3.231 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:24:46,378][model8_pretrain.py][INFO] Epoch:[0/2](498800/4588595) loss:2.911 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:25:33,788][model8_pretrain.py][INFO] Epoch:[0/2](498900/4588595) loss:3.004 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:25:33,788][model8_pretrain.py][INFO] Epoch:[0/2](498900/4588595) loss:2.507 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:25:33,788][model8_pretrain.py][INFO] Epoch:[0/2](498900/4588595) loss:2.771 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:25:33,788][model8_pretrain.py][INFO] Epoch:[0/2](498900/4588595) loss:3.017 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:25:33,788][model8_pretrain.py][INFO] Epoch:[0/2](498900/4588595) loss:2.549 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:25:33,788][model8_pretrain.py][INFO] Epoch:[0/2](498900/4588595) loss:3.373 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:25:33,788][model8_pretrain.py][INFO] Epoch:[0/2](498900/4588595) loss:2.914 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:25:33,789][model8_pretrain.py][INFO] Epoch:[0/2](498900/4588595) loss:3.004 lr:0.0000100 epoch_Time:25951.0min: [2024-01-04 22:26:10,722][model8_pretrain.py][INFO] Epoch:[0/2](499000/4588595) loss:2.497 lr:0.0000100 epoch_Time:25950.0min: [2024-01-04 22:26:10,722][model8_pretrain.py][INFO] Epoch:[0/2](499000/4588595) loss:2.412 lr:0.0000100 epoch_Time:25950.0min: [2024-01-04 22:26:10,722][model8_pretrain.py][INFO] Epoch:[0/2](499000/4588595) loss:2.475 lr:0.0000100 epoch_Time:25950.0min: [2024-01-04 22:26:10,722][model8_pretrain.py][INFO] Epoch:[0/2](499000/4588595) loss:3.165 lr:0.0000100 epoch_Time:25950.0min: [2024-01-04 22:26:10,722][model8_pretrain.py][INFO] Epoch:[0/2](499000/4588595) loss:3.220 lr:0.0000100 epoch_Time:25950.0min: [2024-01-04 22:26:10,722][model8_pretrain.py][INFO] Epoch:[0/2](499000/4588595) loss:2.929 lr:0.0000100 epoch_Time:25950.0min: [2024-01-04 22:26:10,722][model8_pretrain.py][INFO] Epoch:[0/2](499000/4588595) loss:2.821 lr:0.0000100 epoch_Time:25950.0min: [2024-01-04 22:26:10,722][model8_pretrain.py][INFO] Epoch:[0/2](499000/4588595) loss:3.259 lr:0.0000100 epoch_Time:25950.0min: [2024-01-04 22:26:47,667][model8_pretrain.py][INFO] Epoch:[0/2](499100/4588595) loss:1.915 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:26:47,667][model8_pretrain.py][INFO] Epoch:[0/2](499100/4588595) loss:2.761 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:26:47,667][model8_pretrain.py][INFO] Epoch:[0/2](499100/4588595) loss:3.439 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:26:47,667][model8_pretrain.py][INFO] Epoch:[0/2](499100/4588595) loss:2.640 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:26:47,667][model8_pretrain.py][INFO] Epoch:[0/2](499100/4588595) loss:2.626 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:26:47,667][model8_pretrain.py][INFO] Epoch:[0/2](499100/4588595) loss:2.704 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:26:47,668][model8_pretrain.py][INFO] Epoch:[0/2](499100/4588595) loss:2.810 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:26:47,668][model8_pretrain.py][INFO] Epoch:[0/2](499100/4588595) loss:2.567 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:27:24,599][model8_pretrain.py][INFO] Epoch:[0/2](499200/4588595) loss:2.960 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:27:24,599][model8_pretrain.py][INFO] Epoch:[0/2](499200/4588595) loss:2.453 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:27:24,599][model8_pretrain.py][INFO] Epoch:[0/2](499200/4588595) loss:3.086 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:27:24,599][model8_pretrain.py][INFO] Epoch:[0/2](499200/4588595) loss:2.798 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:27:24,599][model8_pretrain.py][INFO] Epoch:[0/2](499200/4588595) loss:3.371 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:27:24,599][model8_pretrain.py][INFO] Epoch:[0/2](499200/4588595) loss:3.018 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:27:24,599][model8_pretrain.py][INFO] Epoch:[0/2](499200/4588595) loss:2.956 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:27:24,599][model8_pretrain.py][INFO] Epoch:[0/2](499200/4588595) loss:2.718 lr:0.0000100 epoch_Time:25949.0min: [2024-01-04 22:28:01,543][model8_pretrain.py][INFO] Epoch:[0/2](499300/4588595) loss:3.311 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:01,543][model8_pretrain.py][INFO] Epoch:[0/2](499300/4588595) loss:3.102 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:01,544][model8_pretrain.py][INFO] Epoch:[0/2](499300/4588595) loss:2.152 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:01,544][model8_pretrain.py][INFO] Epoch:[0/2](499300/4588595) loss:3.255 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:01,543][model8_pretrain.py][INFO] Epoch:[0/2](499300/4588595) loss:2.372 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:01,544][model8_pretrain.py][INFO] Epoch:[0/2](499300/4588595) loss:2.302 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:01,544][model8_pretrain.py][INFO] Epoch:[0/2](499300/4588595) loss:3.253 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:01,544][model8_pretrain.py][INFO] Epoch:[0/2](499300/4588595) loss:2.342 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:38,478][model8_pretrain.py][INFO] Epoch:[0/2](499400/4588595) loss:3.217 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:28:38,478][model8_pretrain.py][INFO] Epoch:[0/2](499400/4588595) loss:2.435 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:38,478][model8_pretrain.py][INFO] Epoch:[0/2](499400/4588595) loss:3.053 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:38,478][model8_pretrain.py][INFO] Epoch:[0/2](499400/4588595) loss:3.154 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:28:38,478][model8_pretrain.py][INFO] Epoch:[0/2](499400/4588595) loss:2.779 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:38,478][model8_pretrain.py][INFO] Epoch:[0/2](499400/4588595) loss:2.519 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:38,478][model8_pretrain.py][INFO] Epoch:[0/2](499400/4588595) loss:2.679 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:28:38,478][model8_pretrain.py][INFO] Epoch:[0/2](499400/4588595) loss:2.439 lr:0.0000100 epoch_Time:25948.0min: [2024-01-04 22:29:15,402][model8_pretrain.py][INFO] Epoch:[0/2](499500/4588595) loss:2.946 lr:0.0000100 epoch_Time:25946.0min: [2024-01-04 22:29:15,402][model8_pretrain.py][INFO] Epoch:[0/2](499500/4588595) loss:2.660 lr:0.0000100 epoch_Time:25946.0min: [2024-01-04 22:29:15,402][model8_pretrain.py][INFO] Epoch:[0/2](499500/4588595) loss:2.579 lr:0.0000100 epoch_Time:25946.0min: [2024-01-04 22:29:15,402][model8_pretrain.py][INFO] Epoch:[0/2](499500/4588595) loss:2.100 lr:0.0000100 epoch_Time:25946.0min: [2024-01-04 22:29:15,402][model8_pretrain.py][INFO] Epoch:[0/2](499500/4588595) loss:2.872 lr:0.0000100 epoch_Time:25946.0min: [2024-01-04 22:29:15,402][model8_pretrain.py][INFO] Epoch:[0/2](499500/4588595) loss:3.044 lr:0.0000100 epoch_Time:25946.0min: [2024-01-04 22:29:15,402][model8_pretrain.py][INFO] Epoch:[0/2](499500/4588595) loss:3.218 lr:0.0000100 epoch_Time:25946.0min: [2024-01-04 22:29:15,402][model8_pretrain.py][INFO] Epoch:[0/2](499500/4588595) loss:2.972 lr:0.0000100 epoch_Time:25946.0min: [2024-01-04 22:29:52,358][model8_pretrain.py][INFO] Epoch:[0/2](499600/4588595) loss:2.942 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:29:52,359][model8_pretrain.py][INFO] Epoch:[0/2](499600/4588595) loss:2.890 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:29:52,359][model8_pretrain.py][INFO] Epoch:[0/2](499600/4588595) loss:2.776 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:29:52,359][model8_pretrain.py][INFO] Epoch:[0/2](499600/4588595) loss:2.546 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:29:52,359][model8_pretrain.py][INFO] Epoch:[0/2](499600/4588595) loss:2.856 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:29:52,359][model8_pretrain.py][INFO] Epoch:[0/2](499600/4588595) loss:2.764 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:29:52,359][model8_pretrain.py][INFO] Epoch:[0/2](499600/4588595) loss:2.553 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:29:52,359][model8_pretrain.py][INFO] Epoch:[0/2](499600/4588595) loss:2.910 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:30:39,744][model8_pretrain.py][INFO] Epoch:[0/2](499700/4588595) loss:3.001 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:30:39,744][model8_pretrain.py][INFO] Epoch:[0/2](499700/4588595) loss:2.787 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:30:39,744][model8_pretrain.py][INFO] Epoch:[0/2](499700/4588595) loss:3.275 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:30:39,744][model8_pretrain.py][INFO] Epoch:[0/2](499700/4588595) loss:2.494 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:30:39,744][model8_pretrain.py][INFO] Epoch:[0/2](499700/4588595) loss:3.117 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:30:39,744][model8_pretrain.py][INFO] Epoch:[0/2](499700/4588595) loss:2.792 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:30:39,744][model8_pretrain.py][INFO] Epoch:[0/2](499700/4588595) loss:2.670 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:30:39,744][model8_pretrain.py][INFO] Epoch:[0/2](499700/4588595) loss:2.749 lr:0.0000100 epoch_Time:25947.0min: [2024-01-04 22:31:16,676][model8_pretrain.py][INFO] Epoch:[0/2](499800/4588595) loss:2.876 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:31:16,676][model8_pretrain.py][INFO] Epoch:[0/2](499800/4588595) loss:2.709 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:31:16,676][model8_pretrain.py][INFO] Epoch:[0/2](499800/4588595) loss:2.104 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:31:16,676][model8_pretrain.py][INFO] Epoch:[0/2](499800/4588595) loss:2.282 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:31:16,676][model8_pretrain.py][INFO] Epoch:[0/2](499800/4588595) loss:2.919 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:31:16,676][model8_pretrain.py][INFO] Epoch:[0/2](499800/4588595) loss:3.015 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:31:16,676][model8_pretrain.py][INFO] Epoch:[0/2](499800/4588595) loss:2.499 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:31:16,676][model8_pretrain.py][INFO] Epoch:[0/2](499800/4588595) loss:2.762 lr:0.0000100 epoch_Time:25945.0min: [2024-01-04 22:31:53,626][model8_pretrain.py][INFO] Epoch:[0/2](499900/4588595) loss:2.371 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:31:53,626][model8_pretrain.py][INFO] Epoch:[0/2](499900/4588595) loss:2.868 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:31:53,626][model8_pretrain.py][INFO] Epoch:[0/2](499900/4588595) loss:2.902 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:31:53,626][model8_pretrain.py][INFO] Epoch:[0/2](499900/4588595) loss:2.481 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:31:53,626][model8_pretrain.py][INFO] Epoch:[0/2](499900/4588595) loss:3.110 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:31:53,626][model8_pretrain.py][INFO] Epoch:[0/2](499900/4588595) loss:2.676 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:31:53,626][model8_pretrain.py][INFO] Epoch:[0/2](499900/4588595) loss:1.705 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:31:53,627][model8_pretrain.py][INFO] Epoch:[0/2](499900/4588595) loss:2.453 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:32:30,575][model8_pretrain.py][INFO] Epoch:[0/2](500000/4588595) loss:2.885 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:32:30,575][model8_pretrain.py][INFO] Epoch:[0/2](500000/4588595) loss:3.090 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:32:30,575][model8_pretrain.py][INFO] Epoch:[0/2](500000/4588595) loss:3.039 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:32:30,575][model8_pretrain.py][INFO] Epoch:[0/2](500000/4588595) loss:2.998 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:32:30,575][model8_pretrain.py][INFO] Epoch:[0/2](500000/4588595) loss:2.649 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:32:30,575][model8_pretrain.py][INFO] Epoch:[0/2](500000/4588595) loss:2.471 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:32:30,575][model8_pretrain.py][INFO] Epoch:[0/2](500000/4588595) loss:2.927 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:32:30,576][model8_pretrain.py][INFO] Epoch:[0/2](500000/4588595) loss:1.927 lr:0.0000100 epoch_Time:25944.0min: [2024-01-04 22:33:07,527][model8_pretrain.py][INFO] Epoch:[0/2](500100/4588595) loss:3.180 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:07,527][model8_pretrain.py][INFO] Epoch:[0/2](500100/4588595) loss:3.343 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:07,527][model8_pretrain.py][INFO] Epoch:[0/2](500100/4588595) loss:2.226 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:07,527][model8_pretrain.py][INFO] Epoch:[0/2](500100/4588595) loss:2.391 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:07,527][model8_pretrain.py][INFO] Epoch:[0/2](500100/4588595) loss:2.669 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:07,527][model8_pretrain.py][INFO] Epoch:[0/2](500100/4588595) loss:2.915 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:07,528][model8_pretrain.py][INFO] Epoch:[0/2](500100/4588595) loss:2.298 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:07,528][model8_pretrain.py][INFO] Epoch:[0/2](500100/4588595) loss:2.793 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:44,478][model8_pretrain.py][INFO] Epoch:[0/2](500200/4588595) loss:2.570 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:44,478][model8_pretrain.py][INFO] Epoch:[0/2](500200/4588595) loss:2.871 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:44,478][model8_pretrain.py][INFO] Epoch:[0/2](500200/4588595) loss:2.848 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:44,478][model8_pretrain.py][INFO] Epoch:[0/2](500200/4588595) loss:3.013 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:44,478][model8_pretrain.py][INFO] Epoch:[0/2](500200/4588595) loss:2.528 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:44,478][model8_pretrain.py][INFO] Epoch:[0/2](500200/4588595) loss:3.269 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:44,478][model8_pretrain.py][INFO] Epoch:[0/2](500200/4588595) loss:3.458 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:33:44,479][model8_pretrain.py][INFO] Epoch:[0/2](500200/4588595) loss:2.825 lr:0.0000100 epoch_Time:25943.0min: [2024-01-04 22:34:21,440][model8_pretrain.py][INFO] Epoch:[0/2](500300/4588595) loss:2.659 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:34:21,440][model8_pretrain.py][INFO] Epoch:[0/2](500300/4588595) loss:2.345 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:34:21,440][model8_pretrain.py][INFO] Epoch:[0/2](500300/4588595) loss:2.924 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:34:21,440][model8_pretrain.py][INFO] Epoch:[0/2](500300/4588595) loss:2.865 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:34:21,440][model8_pretrain.py][INFO] Epoch:[0/2](500300/4588595) loss:3.002 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:34:21,440][model8_pretrain.py][INFO] Epoch:[0/2](500300/4588595) loss:2.415 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:34:21,440][model8_pretrain.py][INFO] Epoch:[0/2](500300/4588595) loss:3.284 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:34:21,440][model8_pretrain.py][INFO] Epoch:[0/2](500300/4588595) loss:2.374 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:34:58,393][model8_pretrain.py][INFO] Epoch:[0/2](500400/4588595) loss:3.189 lr:0.0000100 epoch_Time:25940.0min: [2024-01-04 22:34:58,393][model8_pretrain.py][INFO] Epoch:[0/2](500400/4588595) loss:2.360 lr:0.0000100 epoch_Time:25940.0min: [2024-01-04 22:34:58,393][model8_pretrain.py][INFO] Epoch:[0/2](500400/4588595) loss:2.691 lr:0.0000100 epoch_Time:25940.0min: [2024-01-04 22:34:58,393][model8_pretrain.py][INFO] Epoch:[0/2](500400/4588595) loss:3.132 lr:0.0000100 epoch_Time:25940.0min: [2024-01-04 22:34:58,393][model8_pretrain.py][INFO] Epoch:[0/2](500400/4588595) loss:2.604 lr:0.0000100 epoch_Time:25940.0min: [2024-01-04 22:34:58,393][model8_pretrain.py][INFO] Epoch:[0/2](500400/4588595) loss:2.502 lr:0.0000100 epoch_Time:25940.0min: [2024-01-04 22:34:58,393][model8_pretrain.py][INFO] Epoch:[0/2](500400/4588595) loss:3.155 lr:0.0000100 epoch_Time:25940.0min: [2024-01-04 22:34:58,393][model8_pretrain.py][INFO] Epoch:[0/2](500400/4588595) loss:2.868 lr:0.0000100 epoch_Time:25940.0min: [2024-01-04 22:35:45,846][model8_pretrain.py][INFO] Epoch:[0/2](500500/4588595) loss:2.895 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:35:45,846][model8_pretrain.py][INFO] Epoch:[0/2](500500/4588595) loss:2.882 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:35:45,846][model8_pretrain.py][INFO] Epoch:[0/2](500500/4588595) loss:2.495 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:35:45,846][model8_pretrain.py][INFO] Epoch:[0/2](500500/4588595) loss:2.998 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:35:45,846][model8_pretrain.py][INFO] Epoch:[0/2](500500/4588595) loss:2.751 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:35:45,846][model8_pretrain.py][INFO] Epoch:[0/2](500500/4588595) loss:2.637 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:35:45,846][model8_pretrain.py][INFO] Epoch:[0/2](500500/4588595) loss:3.011 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:35:45,846][model8_pretrain.py][INFO] Epoch:[0/2](500500/4588595) loss:2.944 lr:0.0000100 epoch_Time:25942.0min: [2024-01-04 22:36:22,783][model8_pretrain.py][INFO] Epoch:[0/2](500600/4588595) loss:3.089 lr:0.0000100 epoch_Time:25941.0min: [2024-01-04 22:36:22,783][model8_pretrain.py][INFO] Epoch:[0/2](500600/4588595) loss:2.643 lr:0.0000100 epoch_Time:25941.0min: [2024-01-04 22:36:22,783][model8_pretrain.py][INFO] Epoch:[0/2](500600/4588595) loss:2.255 lr:0.0000100 epoch_Time:25941.0min: [2024-01-04 22:36:22,783][model8_pretrain.py][INFO] Epoch:[0/2](500600/4588595) loss:2.841 lr:0.0000100 epoch_Time:25941.0min: [2024-01-04 22:36:22,783][model8_pretrain.py][INFO] Epoch:[0/2](500600/4588595) loss:3.026 lr:0.0000100 epoch_Time:25941.0min: [2024-01-04 22:36:22,783][model8_pretrain.py][INFO] Epoch:[0/2](500600/4588595) loss:2.779 lr:0.0000100 epoch_Time:25941.0min: [2024-01-04 22:36:22,783][model8_pretrain.py][INFO] Epoch:[0/2](500600/4588595) loss:3.349 lr:0.0000100 epoch_Time:25941.0min: [2024-01-04 22:36:22,783][model8_pretrain.py][INFO] Epoch:[0/2](500600/4588595) loss:3.188 lr:0.0000100 epoch_Time:25941.0min: [2024-01-04 22:36:59,693][model8_pretrain.py][INFO] Epoch:[0/2](500700/4588595) loss:2.297 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:36:59,693][model8_pretrain.py][INFO] Epoch:[0/2](500700/4588595) loss:2.976 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:36:59,693][model8_pretrain.py][INFO] Epoch:[0/2](500700/4588595) loss:2.913 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:36:59,693][model8_pretrain.py][INFO] Epoch:[0/2](500700/4588595) loss:2.749 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:36:59,693][model8_pretrain.py][INFO] Epoch:[0/2](500700/4588595) loss:2.970 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:36:59,693][model8_pretrain.py][INFO] Epoch:[0/2](500700/4588595) loss:3.281 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:36:59,693][model8_pretrain.py][INFO] Epoch:[0/2](500700/4588595) loss:2.510 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:36:59,693][model8_pretrain.py][INFO] Epoch:[0/2](500700/4588595) loss:2.980 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:37:36,627][model8_pretrain.py][INFO] Epoch:[0/2](500800/4588595) loss:2.999 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:37:36,627][model8_pretrain.py][INFO] Epoch:[0/2](500800/4588595) loss:2.804 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:37:36,627][model8_pretrain.py][INFO] Epoch:[0/2](500800/4588595) loss:3.114 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:37:36,627][model8_pretrain.py][INFO] Epoch:[0/2](500800/4588595) loss:2.963 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:37:36,627][model8_pretrain.py][INFO] Epoch:[0/2](500800/4588595) loss:3.408 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:37:36,627][model8_pretrain.py][INFO] Epoch:[0/2](500800/4588595) loss:2.864 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:37:36,627][model8_pretrain.py][INFO] Epoch:[0/2](500800/4588595) loss:2.495 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:37:36,627][model8_pretrain.py][INFO] Epoch:[0/2](500800/4588595) loss:2.642 lr:0.0000100 epoch_Time:25939.0min: [2024-01-04 22:38:13,589][model8_pretrain.py][INFO] Epoch:[0/2](500900/4588595) loss:2.358 lr:0.0000100 epoch_Time:25938.0min: [2024-01-04 22:38:13,589][model8_pretrain.py][INFO] Epoch:[0/2](500900/4588595) loss:2.842 lr:0.0000100 epoch_Time:25938.0min: [2024-01-04 22:38:13,589][model8_pretrain.py][INFO] Epoch:[0/2](500900/4588595) loss:2.986 lr:0.0000100 epoch_Time:25938.0min: [2024-01-04 22:38:13,589][model8_pretrain.py][INFO] Epoch:[0/2](500900/4588595) loss:2.777 lr:0.0000100 epoch_Time:25938.0min: [2024-01-04 22:38:13,589][model8_pretrain.py][INFO] Epoch:[0/2](500900/4588595) loss:2.902 lr:0.0000100 epoch_Time:25938.0min: [2024-01-04 22:38:13,590][model8_pretrain.py][INFO] Epoch:[0/2](500900/4588595) loss:2.750 lr:0.0000100 epoch_Time:25938.0min: [2024-01-04 22:38:13,589][model8_pretrain.py][INFO] Epoch:[0/2](500900/4588595) loss:2.939 lr:0.0000100 epoch_Time:25938.0min: [2024-01-04 22:38:13,590][model8_pretrain.py][INFO] Epoch:[0/2](500900/4588595) loss:2.745 lr:0.0000100 epoch_Time:25938.0min: [2024-01-04 22:38:50,526][model8_pretrain.py][INFO] Epoch:[0/2](501000/4588595) loss:2.545 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:38:50,526][model8_pretrain.py][INFO] Epoch:[0/2](501000/4588595) loss:2.806 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:38:50,526][model8_pretrain.py][INFO] Epoch:[0/2](501000/4588595) loss:2.746 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:38:50,526][model8_pretrain.py][INFO] Epoch:[0/2](501000/4588595) loss:3.219 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:38:50,526][model8_pretrain.py][INFO] Epoch:[0/2](501000/4588595) loss:3.154 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:38:50,526][model8_pretrain.py][INFO] Epoch:[0/2](501000/4588595) loss:2.623 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:38:50,526][model8_pretrain.py][INFO] Epoch:[0/2](501000/4588595) loss:3.157 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:38:50,526][model8_pretrain.py][INFO] Epoch:[0/2](501000/4588595) loss:2.803 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:39:27,472][model8_pretrain.py][INFO] Epoch:[0/2](501100/4588595) loss:2.446 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:39:27,472][model8_pretrain.py][INFO] Epoch:[0/2](501100/4588595) loss:2.894 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:39:27,472][model8_pretrain.py][INFO] Epoch:[0/2](501100/4588595) loss:2.977 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:39:27,472][model8_pretrain.py][INFO] Epoch:[0/2](501100/4588595) loss:2.720 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:39:27,472][model8_pretrain.py][INFO] Epoch:[0/2](501100/4588595) loss:2.621 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:39:27,472][model8_pretrain.py][INFO] Epoch:[0/2](501100/4588595) loss:3.199 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:39:27,472][model8_pretrain.py][INFO] Epoch:[0/2](501100/4588595) loss:2.975 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:39:27,472][model8_pretrain.py][INFO] Epoch:[0/2](501100/4588595) loss:2.860 lr:0.0000100 epoch_Time:25937.0min: [2024-01-04 22:40:04,438][model8_pretrain.py][INFO] Epoch:[0/2](501200/4588595) loss:3.196 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:04,438][model8_pretrain.py][INFO] Epoch:[0/2](501200/4588595) loss:2.288 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:04,438][model8_pretrain.py][INFO] Epoch:[0/2](501200/4588595) loss:2.674 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:04,438][model8_pretrain.py][INFO] Epoch:[0/2](501200/4588595) loss:2.692 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:04,438][model8_pretrain.py][INFO] Epoch:[0/2](501200/4588595) loss:2.995 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:04,438][model8_pretrain.py][INFO] Epoch:[0/2](501200/4588595) loss:2.499 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:04,438][model8_pretrain.py][INFO] Epoch:[0/2](501200/4588595) loss:3.234 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:04,438][model8_pretrain.py][INFO] Epoch:[0/2](501200/4588595) loss:3.308 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:51,773][model8_pretrain.py][INFO] Epoch:[0/2](501300/4588595) loss:2.850 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:51,773][model8_pretrain.py][INFO] Epoch:[0/2](501300/4588595) loss:3.315 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:51,773][model8_pretrain.py][INFO] Epoch:[0/2](501300/4588595) loss:3.188 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:51,773][model8_pretrain.py][INFO] Epoch:[0/2](501300/4588595) loss:3.095 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:51,773][model8_pretrain.py][INFO] Epoch:[0/2](501300/4588595) loss:2.818 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:51,773][model8_pretrain.py][INFO] Epoch:[0/2](501300/4588595) loss:3.115 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:51,773][model8_pretrain.py][INFO] Epoch:[0/2](501300/4588595) loss:2.591 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:40:51,773][model8_pretrain.py][INFO] Epoch:[0/2](501300/4588595) loss:2.590 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:41:28,693][model8_pretrain.py][INFO] Epoch:[0/2](501400/4588595) loss:2.993 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:41:28,693][model8_pretrain.py][INFO] Epoch:[0/2](501400/4588595) loss:2.707 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:41:28,693][model8_pretrain.py][INFO] Epoch:[0/2](501400/4588595) loss:3.141 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:41:28,693][model8_pretrain.py][INFO] Epoch:[0/2](501400/4588595) loss:2.252 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:41:28,693][model8_pretrain.py][INFO] Epoch:[0/2](501400/4588595) loss:2.247 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:41:28,693][model8_pretrain.py][INFO] Epoch:[0/2](501400/4588595) loss:2.955 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:41:28,693][model8_pretrain.py][INFO] Epoch:[0/2](501400/4588595) loss:3.153 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:41:28,694][model8_pretrain.py][INFO] Epoch:[0/2](501400/4588595) loss:2.709 lr:0.0000100 epoch_Time:25936.0min: [2024-01-04 22:42:05,621][model8_pretrain.py][INFO] Epoch:[0/2](501500/4588595) loss:2.834 lr:0.0000100 epoch_Time:25935.0min: [2024-01-04 22:42:05,621][model8_pretrain.py][INFO] Epoch:[0/2](501500/4588595) loss:3.173 lr:0.0000100 epoch_Time:25935.0min: [2024-01-04 22:42:05,621][model8_pretrain.py][INFO] Epoch:[0/2](501500/4588595) loss:3.062 lr:0.0000100 epoch_Time:25935.0min: [2024-01-04 22:42:05,621][model8_pretrain.py][INFO] Epoch:[0/2](501500/4588595) loss:2.887 lr:0.0000100 epoch_Time:25935.0min: [2024-01-04 22:42:05,621][model8_pretrain.py][INFO] Epoch:[0/2](501500/4588595) loss:2.680 lr:0.0000100 epoch_Time:25935.0min: [2024-01-04 22:42:05,621][model8_pretrain.py][INFO] Epoch:[0/2](501500/4588595) loss:2.741 lr:0.0000100 epoch_Time:25935.0min: [2024-01-04 22:42:05,621][model8_pretrain.py][INFO] Epoch:[0/2](501500/4588595) loss:2.048 lr:0.0000100 epoch_Time:25935.0min: [2024-01-04 22:42:05,621][model8_pretrain.py][INFO] Epoch:[0/2](501500/4588595) loss:3.005 lr:0.0000100 epoch_Time:25935.0min: [2024-01-04 22:42:42,556][model8_pretrain.py][INFO] Epoch:[0/2](501600/4588595) loss:2.614 lr:0.0000100 epoch_Time:25934.0min: [2024-01-04 22:42:42,556][model8_pretrain.py][INFO] Epoch:[0/2](501600/4588595) loss:3.340 lr:0.0000100 epoch_Time:25934.0min: [2024-01-04 22:42:42,556][model8_pretrain.py][INFO] Epoch:[0/2](501600/4588595) loss:3.167 lr:0.0000100 epoch_Time:25934.0min: [2024-01-04 22:42:42,556][model8_pretrain.py][INFO] Epoch:[0/2](501600/4588595) loss:2.912 lr:0.0000100 epoch_Time:25934.0min: [2024-01-04 22:42:42,556][model8_pretrain.py][INFO] Epoch:[0/2](501600/4588595) loss:3.219 lr:0.0000100 epoch_Time:25934.0min: [2024-01-04 22:42:42,556][model8_pretrain.py][INFO] Epoch:[0/2](501600/4588595) loss:2.814 lr:0.0000100 epoch_Time:25934.0min: [2024-01-04 22:42:42,556][model8_pretrain.py][INFO] Epoch:[0/2](501600/4588595) loss:2.974 lr:0.0000100 epoch_Time:25934.0min: [2024-01-04 22:42:42,557][model8_pretrain.py][INFO] Epoch:[0/2](501600/4588595) loss:3.115 lr:0.0000100 epoch_Time:25934.0min: [2024-01-04 22:43:19,499][model8_pretrain.py][INFO] Epoch:[0/2](501700/4588595) loss:2.664 lr:0.0000100 epoch_Time:25933.0min: [2024-01-04 22:43:19,499][model8_pretrain.py][INFO] Epoch:[0/2](501700/4588595) loss:2.850 lr:0.0000100 epoch_Time:25933.0min: [2024-01-04 22:43:19,499][model8_pretrain.py][INFO] Epoch:[0/2](501700/4588595) loss:2.831 lr:0.0000100 epoch_Time:25933.0min: [2024-01-04 22:43:19,499][model8_pretrain.py][INFO] Epoch:[0/2](501700/4588595) loss:3.282 lr:0.0000100 epoch_Time:25933.0min: [2024-01-04 22:43:19,499][model8_pretrain.py][INFO] Epoch:[0/2](501700/4588595) loss:2.339 lr:0.0000100 epoch_Time:25933.0min: [2024-01-04 22:43:19,499][model8_pretrain.py][INFO] Epoch:[0/2](501700/4588595) loss:2.798 lr:0.0000100 epoch_Time:25933.0min: [2024-01-04 22:43:19,499][model8_pretrain.py][INFO] Epoch:[0/2](501700/4588595) loss:2.516 lr:0.0000100 epoch_Time:25933.0min: [2024-01-04 22:43:19,499][model8_pretrain.py][INFO] Epoch:[0/2](501700/4588595) loss:3.007 lr:0.0000100 epoch_Time:25933.0min: [2024-01-04 22:43:56,425][model8_pretrain.py][INFO] Epoch:[0/2](501800/4588595) loss:3.097 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:43:56,425][model8_pretrain.py][INFO] Epoch:[0/2](501800/4588595) loss:2.501 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:43:56,425][model8_pretrain.py][INFO] Epoch:[0/2](501800/4588595) loss:3.152 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:43:56,425][model8_pretrain.py][INFO] Epoch:[0/2](501800/4588595) loss:2.889 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:43:56,425][model8_pretrain.py][INFO] Epoch:[0/2](501800/4588595) loss:2.529 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:43:56,425][model8_pretrain.py][INFO] Epoch:[0/2](501800/4588595) loss:2.138 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:43:56,425][model8_pretrain.py][INFO] Epoch:[0/2](501800/4588595) loss:3.216 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:43:56,426][model8_pretrain.py][INFO] Epoch:[0/2](501800/4588595) loss:2.978 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:44:33,358][model8_pretrain.py][INFO] Epoch:[0/2](501900/4588595) loss:2.446 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:44:33,358][model8_pretrain.py][INFO] Epoch:[0/2](501900/4588595) loss:2.850 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:44:33,358][model8_pretrain.py][INFO] Epoch:[0/2](501900/4588595) loss:2.421 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:44:33,358][model8_pretrain.py][INFO] Epoch:[0/2](501900/4588595) loss:3.035 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:44:33,358][model8_pretrain.py][INFO] Epoch:[0/2](501900/4588595) loss:3.177 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:44:33,358][model8_pretrain.py][INFO] Epoch:[0/2](501900/4588595) loss:3.310 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:44:33,358][model8_pretrain.py][INFO] Epoch:[0/2](501900/4588595) loss:2.897 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:44:33,358][model8_pretrain.py][INFO] Epoch:[0/2](501900/4588595) loss:2.925 lr:0.0000100 epoch_Time:25932.0min: [2024-01-04 22:45:10,291][model8_pretrain.py][INFO] Epoch:[0/2](502000/4588595) loss:3.057 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:10,291][model8_pretrain.py][INFO] Epoch:[0/2](502000/4588595) loss:2.973 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:10,291][model8_pretrain.py][INFO] Epoch:[0/2](502000/4588595) loss:2.888 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:10,291][model8_pretrain.py][INFO] Epoch:[0/2](502000/4588595) loss:3.227 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:10,291][model8_pretrain.py][INFO] Epoch:[0/2](502000/4588595) loss:2.229 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:10,291][model8_pretrain.py][INFO] Epoch:[0/2](502000/4588595) loss:2.646 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:10,291][model8_pretrain.py][INFO] Epoch:[0/2](502000/4588595) loss:2.908 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:10,291][model8_pretrain.py][INFO] Epoch:[0/2](502000/4588595) loss:3.107 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:57,687][model8_pretrain.py][INFO] Epoch:[0/2](502100/4588595) loss:2.305 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:57,687][model8_pretrain.py][INFO] Epoch:[0/2](502100/4588595) loss:2.736 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:57,687][model8_pretrain.py][INFO] Epoch:[0/2](502100/4588595) loss:3.090 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:57,687][model8_pretrain.py][INFO] Epoch:[0/2](502100/4588595) loss:2.330 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:57,687][model8_pretrain.py][INFO] Epoch:[0/2](502100/4588595) loss:2.216 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:57,687][model8_pretrain.py][INFO] Epoch:[0/2](502100/4588595) loss:3.130 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:57,687][model8_pretrain.py][INFO] Epoch:[0/2](502100/4588595) loss:2.230 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:45:57,687][model8_pretrain.py][INFO] Epoch:[0/2](502100/4588595) loss:2.778 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:46:34,610][model8_pretrain.py][INFO] Epoch:[0/2](502200/4588595) loss:3.066 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:46:34,610][model8_pretrain.py][INFO] Epoch:[0/2](502200/4588595) loss:2.615 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:46:34,610][model8_pretrain.py][INFO] Epoch:[0/2](502200/4588595) loss:3.021 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:46:34,610][model8_pretrain.py][INFO] Epoch:[0/2](502200/4588595) loss:3.003 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:46:34,610][model8_pretrain.py][INFO] Epoch:[0/2](502200/4588595) loss:3.228 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:46:34,610][model8_pretrain.py][INFO] Epoch:[0/2](502200/4588595) loss:2.442 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:46:34,611][model8_pretrain.py][INFO] Epoch:[0/2](502200/4588595) loss:2.768 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:46:34,611][model8_pretrain.py][INFO] Epoch:[0/2](502200/4588595) loss:2.948 lr:0.0000100 epoch_Time:25931.0min: [2024-01-04 22:47:11,544][model8_pretrain.py][INFO] Epoch:[0/2](502300/4588595) loss:2.478 lr:0.0000100 epoch_Time:25930.0min: [2024-01-04 22:47:11,544][model8_pretrain.py][INFO] Epoch:[0/2](502300/4588595) loss:3.184 lr:0.0000100 epoch_Time:25930.0min: [2024-01-04 22:47:11,544][model8_pretrain.py][INFO] Epoch:[0/2](502300/4588595) loss:3.073 lr:0.0000100 epoch_Time:25930.0min: [2024-01-04 22:47:11,545][model8_pretrain.py][INFO] Epoch:[0/2](502300/4588595) loss:2.838 lr:0.0000100 epoch_Time:25930.0min: [2024-01-04 22:47:11,544][model8_pretrain.py][INFO] Epoch:[0/2](502300/4588595) loss:3.228 lr:0.0000100 epoch_Time:25930.0min: [2024-01-04 22:47:11,545][model8_pretrain.py][INFO] Epoch:[0/2](502300/4588595) loss:3.026 lr:0.0000100 epoch_Time:25930.0min: [2024-01-04 22:47:11,545][model8_pretrain.py][INFO] Epoch:[0/2](502300/4588595) loss:2.976 lr:0.0000100 epoch_Time:25930.0min: [2024-01-04 22:47:11,545][model8_pretrain.py][INFO] Epoch:[0/2](502300/4588595) loss:2.709 lr:0.0000100 epoch_Time:25930.0min: [2024-01-04 22:47:48,483][model8_pretrain.py][INFO] Epoch:[0/2](502400/4588595) loss:2.772 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:47:48,483][model8_pretrain.py][INFO] Epoch:[0/2](502400/4588595) loss:3.504 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:47:48,483][model8_pretrain.py][INFO] Epoch:[0/2](502400/4588595) loss:3.027 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:47:48,483][model8_pretrain.py][INFO] Epoch:[0/2](502400/4588595) loss:2.695 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:47:48,483][model8_pretrain.py][INFO] Epoch:[0/2](502400/4588595) loss:2.557 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:47:48,484][model8_pretrain.py][INFO] Epoch:[0/2](502400/4588595) loss:2.768 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:47:48,484][model8_pretrain.py][INFO] Epoch:[0/2](502400/4588595) loss:3.095 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:47:48,484][model8_pretrain.py][INFO] Epoch:[0/2](502400/4588595) loss:2.287 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:48:25,424][model8_pretrain.py][INFO] Epoch:[0/2](502500/4588595) loss:2.455 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:48:25,424][model8_pretrain.py][INFO] Epoch:[0/2](502500/4588595) loss:2.900 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:48:25,424][model8_pretrain.py][INFO] Epoch:[0/2](502500/4588595) loss:3.152 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:48:25,424][model8_pretrain.py][INFO] Epoch:[0/2](502500/4588595) loss:2.741 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:48:25,424][model8_pretrain.py][INFO] Epoch:[0/2](502500/4588595) loss:2.636 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:48:25,424][model8_pretrain.py][INFO] Epoch:[0/2](502500/4588595) loss:3.058 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:48:25,424][model8_pretrain.py][INFO] Epoch:[0/2](502500/4588595) loss:2.640 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:48:25,424][model8_pretrain.py][INFO] Epoch:[0/2](502500/4588595) loss:3.138 lr:0.0000100 epoch_Time:25929.0min: [2024-01-04 22:49:02,358][model8_pretrain.py][INFO] Epoch:[0/2](502600/4588595) loss:3.349 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:02,358][model8_pretrain.py][INFO] Epoch:[0/2](502600/4588595) loss:2.233 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:02,358][model8_pretrain.py][INFO] Epoch:[0/2](502600/4588595) loss:3.395 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:02,358][model8_pretrain.py][INFO] Epoch:[0/2](502600/4588595) loss:2.493 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:02,358][model8_pretrain.py][INFO] Epoch:[0/2](502600/4588595) loss:3.059 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:02,358][model8_pretrain.py][INFO] Epoch:[0/2](502600/4588595) loss:2.701 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:02,359][model8_pretrain.py][INFO] Epoch:[0/2](502600/4588595) loss:2.661 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:02,359][model8_pretrain.py][INFO] Epoch:[0/2](502600/4588595) loss:2.134 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:39,277][model8_pretrain.py][INFO] Epoch:[0/2](502700/4588595) loss:2.778 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:39,277][model8_pretrain.py][INFO] Epoch:[0/2](502700/4588595) loss:2.814 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:39,277][model8_pretrain.py][INFO] Epoch:[0/2](502700/4588595) loss:2.886 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:39,277][model8_pretrain.py][INFO] Epoch:[0/2](502700/4588595) loss:3.067 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:39,277][model8_pretrain.py][INFO] Epoch:[0/2](502700/4588595) loss:2.559 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:39,277][model8_pretrain.py][INFO] Epoch:[0/2](502700/4588595) loss:3.157 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:39,277][model8_pretrain.py][INFO] Epoch:[0/2](502700/4588595) loss:2.511 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:49:39,277][model8_pretrain.py][INFO] Epoch:[0/2](502700/4588595) loss:2.971 lr:0.0000100 epoch_Time:25927.0min: [2024-01-04 22:50:16,200][model8_pretrain.py][INFO] Epoch:[0/2](502800/4588595) loss:2.712 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:50:16,200][model8_pretrain.py][INFO] Epoch:[0/2](502800/4588595) loss:3.012 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:50:16,200][model8_pretrain.py][INFO] Epoch:[0/2](502800/4588595) loss:2.807 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:50:16,200][model8_pretrain.py][INFO] Epoch:[0/2](502800/4588595) loss:2.913 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:50:16,200][model8_pretrain.py][INFO] Epoch:[0/2](502800/4588595) loss:2.733 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:50:16,200][model8_pretrain.py][INFO] Epoch:[0/2](502800/4588595) loss:3.201 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:50:16,200][model8_pretrain.py][INFO] Epoch:[0/2](502800/4588595) loss:2.571 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:50:16,200][model8_pretrain.py][INFO] Epoch:[0/2](502800/4588595) loss:2.879 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:03,582][model8_pretrain.py][INFO] Epoch:[0/2](502900/4588595) loss:3.053 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:03,582][model8_pretrain.py][INFO] Epoch:[0/2](502900/4588595) loss:2.690 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:03,582][model8_pretrain.py][INFO] Epoch:[0/2](502900/4588595) loss:3.203 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:03,582][model8_pretrain.py][INFO] Epoch:[0/2](502900/4588595) loss:2.090 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:03,582][model8_pretrain.py][INFO] Epoch:[0/2](502900/4588595) loss:3.022 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:03,582][model8_pretrain.py][INFO] Epoch:[0/2](502900/4588595) loss:2.869 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:03,582][model8_pretrain.py][INFO] Epoch:[0/2](502900/4588595) loss:3.273 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:03,582][model8_pretrain.py][INFO] Epoch:[0/2](502900/4588595) loss:2.405 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:40,504][model8_pretrain.py][INFO] Epoch:[0/2](503000/4588595) loss:2.817 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:40,504][model8_pretrain.py][INFO] Epoch:[0/2](503000/4588595) loss:2.939 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:40,505][model8_pretrain.py][INFO] Epoch:[0/2](503000/4588595) loss:2.525 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:40,505][model8_pretrain.py][INFO] Epoch:[0/2](503000/4588595) loss:3.267 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:40,505][model8_pretrain.py][INFO] Epoch:[0/2](503000/4588595) loss:3.051 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:40,505][model8_pretrain.py][INFO] Epoch:[0/2](503000/4588595) loss:2.780 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:40,505][model8_pretrain.py][INFO] Epoch:[0/2](503000/4588595) loss:2.783 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:51:40,505][model8_pretrain.py][INFO] Epoch:[0/2](503000/4588595) loss:3.069 lr:0.0000100 epoch_Time:25926.0min: [2024-01-04 22:52:17,441][model8_pretrain.py][INFO] Epoch:[0/2](503100/4588595) loss:3.046 lr:0.0000100 epoch_Time:25925.0min: [2024-01-04 22:52:17,441][model8_pretrain.py][INFO] Epoch:[0/2](503100/4588595) loss:3.188 lr:0.0000100 epoch_Time:25925.0min: [2024-01-04 22:52:17,441][model8_pretrain.py][INFO] Epoch:[0/2](503100/4588595) loss:2.259 lr:0.0000100 epoch_Time:25925.0min: [2024-01-04 22:52:17,441][model8_pretrain.py][INFO] Epoch:[0/2](503100/4588595) loss:2.844 lr:0.0000100 epoch_Time:25925.0min: [2024-01-04 22:52:17,441][model8_pretrain.py][INFO] Epoch:[0/2](503100/4588595) loss:2.985 lr:0.0000100 epoch_Time:25925.0min: [2024-01-04 22:52:17,441][model8_pretrain.py][INFO] Epoch:[0/2](503100/4588595) loss:2.754 lr:0.0000100 epoch_Time:25925.0min: [2024-01-04 22:52:17,441][model8_pretrain.py][INFO] Epoch:[0/2](503100/4588595) loss:3.417 lr:0.0000100 epoch_Time:25925.0min: [2024-01-04 22:52:17,441][model8_pretrain.py][INFO] Epoch:[0/2](503100/4588595) loss:3.350 lr:0.0000100 epoch_Time:25925.0min: [2024-01-04 22:52:54,376][model8_pretrain.py][INFO] Epoch:[0/2](503200/4588595) loss:2.676 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:52:54,376][model8_pretrain.py][INFO] Epoch:[0/2](503200/4588595) loss:2.987 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:52:54,376][model8_pretrain.py][INFO] Epoch:[0/2](503200/4588595) loss:3.165 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:52:54,376][model8_pretrain.py][INFO] Epoch:[0/2](503200/4588595) loss:2.354 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:52:54,376][model8_pretrain.py][INFO] Epoch:[0/2](503200/4588595) loss:2.888 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:52:54,377][model8_pretrain.py][INFO] Epoch:[0/2](503200/4588595) loss:2.557 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:52:54,377][model8_pretrain.py][INFO] Epoch:[0/2](503200/4588595) loss:3.020 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:52:54,377][model8_pretrain.py][INFO] Epoch:[0/2](503200/4588595) loss:3.049 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:53:31,316][model8_pretrain.py][INFO] Epoch:[0/2](503300/4588595) loss:2.600 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:53:31,316][model8_pretrain.py][INFO] Epoch:[0/2](503300/4588595) loss:3.103 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:53:31,316][model8_pretrain.py][INFO] Epoch:[0/2](503300/4588595) loss:2.927 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:53:31,316][model8_pretrain.py][INFO] Epoch:[0/2](503300/4588595) loss:2.570 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:53:31,316][model8_pretrain.py][INFO] Epoch:[0/2](503300/4588595) loss:2.737 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:53:31,316][model8_pretrain.py][INFO] Epoch:[0/2](503300/4588595) loss:2.165 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:53:31,316][model8_pretrain.py][INFO] Epoch:[0/2](503300/4588595) loss:3.107 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:53:31,316][model8_pretrain.py][INFO] Epoch:[0/2](503300/4588595) loss:2.901 lr:0.0000100 epoch_Time:25924.0min: [2024-01-04 22:54:08,246][model8_pretrain.py][INFO] Epoch:[0/2](503400/4588595) loss:3.272 lr:0.0000100 epoch_Time:25923.0min: [2024-01-04 22:54:08,246][model8_pretrain.py][INFO] Epoch:[0/2](503400/4588595) loss:3.098 lr:0.0000100 epoch_Time:25923.0min: [2024-01-04 22:54:08,246][model8_pretrain.py][INFO] Epoch:[0/2](503400/4588595) loss:2.600 lr:0.0000100 epoch_Time:25923.0min: [2024-01-04 22:54:08,246][model8_pretrain.py][INFO] Epoch:[0/2](503400/4588595) loss:2.807 lr:0.0000100 epoch_Time:25923.0min: [2024-01-04 22:54:08,246][model8_pretrain.py][INFO] Epoch:[0/2](503400/4588595) loss:3.490 lr:0.0000100 epoch_Time:25923.0min: [2024-01-04 22:54:08,246][model8_pretrain.py][INFO] Epoch:[0/2](503400/4588595) loss:3.157 lr:0.0000100 epoch_Time:25923.0min: [2024-01-04 22:54:08,246][model8_pretrain.py][INFO] Epoch:[0/2](503400/4588595) loss:2.494 lr:0.0000100 epoch_Time:25923.0min: [2024-01-04 22:54:08,246][model8_pretrain.py][INFO] Epoch:[0/2](503400/4588595) loss:3.169 lr:0.0000100 epoch_Time:25923.0min: [2024-01-04 22:54:45,185][model8_pretrain.py][INFO] Epoch:[0/2](503500/4588595) loss:3.261 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:54:45,185][model8_pretrain.py][INFO] Epoch:[0/2](503500/4588595) loss:2.607 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:54:45,185][model8_pretrain.py][INFO] Epoch:[0/2](503500/4588595) loss:2.824 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:54:45,185][model8_pretrain.py][INFO] Epoch:[0/2](503500/4588595) loss:2.624 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:54:45,185][model8_pretrain.py][INFO] Epoch:[0/2](503500/4588595) loss:2.923 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:54:45,185][model8_pretrain.py][INFO] Epoch:[0/2](503500/4588595) loss:3.179 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:54:45,185][model8_pretrain.py][INFO] Epoch:[0/2](503500/4588595) loss:2.256 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:54:45,186][model8_pretrain.py][INFO] Epoch:[0/2](503500/4588595) loss:2.739 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:55:22,115][model8_pretrain.py][INFO] Epoch:[0/2](503600/4588595) loss:2.834 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:55:22,115][model8_pretrain.py][INFO] Epoch:[0/2](503600/4588595) loss:2.921 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:55:22,115][model8_pretrain.py][INFO] Epoch:[0/2](503600/4588595) loss:3.169 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:55:22,115][model8_pretrain.py][INFO] Epoch:[0/2](503600/4588595) loss:3.098 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:55:22,115][model8_pretrain.py][INFO] Epoch:[0/2](503600/4588595) loss:2.597 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:55:22,115][model8_pretrain.py][INFO] Epoch:[0/2](503600/4588595) loss:2.543 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:55:22,115][model8_pretrain.py][INFO] Epoch:[0/2](503600/4588595) loss:2.408 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:55:22,115][model8_pretrain.py][INFO] Epoch:[0/2](503600/4588595) loss:2.808 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:56:09,611][model8_pretrain.py][INFO] Epoch:[0/2](503700/4588595) loss:2.190 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:56:09,611][model8_pretrain.py][INFO] Epoch:[0/2](503700/4588595) loss:3.027 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:56:09,611][model8_pretrain.py][INFO] Epoch:[0/2](503700/4588595) loss:2.648 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:56:09,611][model8_pretrain.py][INFO] Epoch:[0/2](503700/4588595) loss:2.825 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:56:09,612][model8_pretrain.py][INFO] Epoch:[0/2](503700/4588595) loss:2.708 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:56:09,612][model8_pretrain.py][INFO] Epoch:[0/2](503700/4588595) loss:3.113 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:56:09,612][model8_pretrain.py][INFO] Epoch:[0/2](503700/4588595) loss:3.178 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:56:09,612][model8_pretrain.py][INFO] Epoch:[0/2](503700/4588595) loss:2.883 lr:0.0000100 epoch_Time:25922.0min: [2024-01-04 22:56:46,553][model8_pretrain.py][INFO] Epoch:[0/2](503800/4588595) loss:2.389 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:56:46,553][model8_pretrain.py][INFO] Epoch:[0/2](503800/4588595) loss:2.987 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:56:46,553][model8_pretrain.py][INFO] Epoch:[0/2](503800/4588595) loss:2.417 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:56:46,553][model8_pretrain.py][INFO] Epoch:[0/2](503800/4588595) loss:2.437 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:56:46,553][model8_pretrain.py][INFO] Epoch:[0/2](503800/4588595) loss:3.199 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:56:46,553][model8_pretrain.py][INFO] Epoch:[0/2](503800/4588595) loss:2.655 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:56:46,553][model8_pretrain.py][INFO] Epoch:[0/2](503800/4588595) loss:3.136 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:56:46,554][model8_pretrain.py][INFO] Epoch:[0/2](503800/4588595) loss:2.777 lr:0.0000100 epoch_Time:25921.0min: [2024-01-04 22:57:23,512][model8_pretrain.py][INFO] Epoch:[0/2](503900/4588595) loss:2.847 lr:0.0000100 epoch_Time:25920.0min: [2024-01-04 22:57:23,512][model8_pretrain.py][INFO] Epoch:[0/2](503900/4588595) loss:2.685 lr:0.0000100 epoch_Time:25920.0min: [2024-01-04 22:57:23,512][model8_pretrain.py][INFO] Epoch:[0/2](503900/4588595) loss:3.112 lr:0.0000100 epoch_Time:25920.0min: [2024-01-04 22:57:23,512][model8_pretrain.py][INFO] Epoch:[0/2](503900/4588595) loss:2.784 lr:0.0000100 epoch_Time:25920.0min: [2024-01-04 22:57:23,512][model8_pretrain.py][INFO] Epoch:[0/2](503900/4588595) loss:2.912 lr:0.0000100 epoch_Time:25920.0min: [2024-01-04 22:57:23,512][model8_pretrain.py][INFO] Epoch:[0/2](503900/4588595) loss:2.645 lr:0.0000100 epoch_Time:25920.0min: [2024-01-04 22:57:23,512][model8_pretrain.py][INFO] Epoch:[0/2](503900/4588595) loss:2.769 lr:0.0000100 epoch_Time:25920.0min: [2024-01-04 22:57:23,512][model8_pretrain.py][INFO] Epoch:[0/2](503900/4588595) loss:2.948 lr:0.0000100 epoch_Time:25920.0min: [2024-01-04 22:58:00,479][model8_pretrain.py][INFO] Epoch:[0/2](504000/4588595) loss:2.367 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:00,479][model8_pretrain.py][INFO] Epoch:[0/2](504000/4588595) loss:2.817 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:00,479][model8_pretrain.py][INFO] Epoch:[0/2](504000/4588595) loss:3.103 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:00,479][model8_pretrain.py][INFO] Epoch:[0/2](504000/4588595) loss:2.629 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:00,479][model8_pretrain.py][INFO] Epoch:[0/2](504000/4588595) loss:3.232 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:00,479][model8_pretrain.py][INFO] Epoch:[0/2](504000/4588595) loss:3.030 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:00,479][model8_pretrain.py][INFO] Epoch:[0/2](504000/4588595) loss:3.046 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:00,480][model8_pretrain.py][INFO] Epoch:[0/2](504000/4588595) loss:2.709 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:37,430][model8_pretrain.py][INFO] Epoch:[0/2](504100/4588595) loss:2.857 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:37,430][model8_pretrain.py][INFO] Epoch:[0/2](504100/4588595) loss:2.712 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:37,430][model8_pretrain.py][INFO] Epoch:[0/2](504100/4588595) loss:3.032 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:37,430][model8_pretrain.py][INFO] Epoch:[0/2](504100/4588595) loss:2.887 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:37,430][model8_pretrain.py][INFO] Epoch:[0/2](504100/4588595) loss:2.528 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:37,430][model8_pretrain.py][INFO] Epoch:[0/2](504100/4588595) loss:2.603 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:37,430][model8_pretrain.py][INFO] Epoch:[0/2](504100/4588595) loss:2.847 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:58:37,430][model8_pretrain.py][INFO] Epoch:[0/2](504100/4588595) loss:3.036 lr:0.0000100 epoch_Time:25919.0min: [2024-01-04 22:59:14,373][model8_pretrain.py][INFO] Epoch:[0/2](504200/4588595) loss:2.484 lr:0.0000100 epoch_Time:25918.0min: [2024-01-04 22:59:14,373][model8_pretrain.py][INFO] Epoch:[0/2](504200/4588595) loss:3.191 lr:0.0000100 epoch_Time:25918.0min: [2024-01-04 22:59:14,373][model8_pretrain.py][INFO] Epoch:[0/2](504200/4588595) loss:3.233 lr:0.0000100 epoch_Time:25918.0min: [2024-01-04 22:59:14,373][model8_pretrain.py][INFO] Epoch:[0/2](504200/4588595) loss:3.022 lr:0.0000100 epoch_Time:25918.0min: [2024-01-04 22:59:14,373][model8_pretrain.py][INFO] Epoch:[0/2](504200/4588595) loss:3.286 lr:0.0000100 epoch_Time:25918.0min: [2024-01-04 22:59:14,373][model8_pretrain.py][INFO] Epoch:[0/2](504200/4588595) loss:2.402 lr:0.0000100 epoch_Time:25918.0min: [2024-01-04 22:59:14,373][model8_pretrain.py][INFO] Epoch:[0/2](504200/4588595) loss:2.707 lr:0.0000100 epoch_Time:25918.0min: [2024-01-04 22:59:14,373][model8_pretrain.py][INFO] Epoch:[0/2](504200/4588595) loss:2.753 lr:0.0000100 epoch_Time:25918.0min: [2024-01-04 22:59:51,316][model8_pretrain.py][INFO] Epoch:[0/2](504300/4588595) loss:2.418 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 22:59:51,316][model8_pretrain.py][INFO] Epoch:[0/2](504300/4588595) loss:3.032 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 22:59:51,316][model8_pretrain.py][INFO] Epoch:[0/2](504300/4588595) loss:2.872 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 22:59:51,316][model8_pretrain.py][INFO] Epoch:[0/2](504300/4588595) loss:2.453 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 22:59:51,316][model8_pretrain.py][INFO] Epoch:[0/2](504300/4588595) loss:2.632 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 22:59:51,316][model8_pretrain.py][INFO] Epoch:[0/2](504300/4588595) loss:2.661 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 22:59:51,316][model8_pretrain.py][INFO] Epoch:[0/2](504300/4588595) loss:2.551 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 22:59:51,316][model8_pretrain.py][INFO] Epoch:[0/2](504300/4588595) loss:2.253 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 23:00:28,256][model8_pretrain.py][INFO] Epoch:[0/2](504400/4588595) loss:3.346 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:00:28,256][model8_pretrain.py][INFO] Epoch:[0/2](504400/4588595) loss:2.927 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:00:28,256][model8_pretrain.py][INFO] Epoch:[0/2](504400/4588595) loss:2.659 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:00:28,256][model8_pretrain.py][INFO] Epoch:[0/2](504400/4588595) loss:3.280 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:00:28,256][model8_pretrain.py][INFO] Epoch:[0/2](504400/4588595) loss:2.115 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:00:28,256][model8_pretrain.py][INFO] Epoch:[0/2](504400/4588595) loss:3.229 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:00:28,256][model8_pretrain.py][INFO] Epoch:[0/2](504400/4588595) loss:3.114 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:00:28,256][model8_pretrain.py][INFO] Epoch:[0/2](504400/4588595) loss:3.155 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:01:15,645][model8_pretrain.py][INFO] Epoch:[0/2](504500/4588595) loss:2.860 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 23:01:15,645][model8_pretrain.py][INFO] Epoch:[0/2](504500/4588595) loss:3.389 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 23:01:15,645][model8_pretrain.py][INFO] Epoch:[0/2](504500/4588595) loss:2.316 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 23:01:15,645][model8_pretrain.py][INFO] Epoch:[0/2](504500/4588595) loss:3.088 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 23:01:15,645][model8_pretrain.py][INFO] Epoch:[0/2](504500/4588595) loss:2.809 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 23:01:15,645][model8_pretrain.py][INFO] Epoch:[0/2](504500/4588595) loss:3.259 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 23:01:15,645][model8_pretrain.py][INFO] Epoch:[0/2](504500/4588595) loss:2.842 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 23:01:15,645][model8_pretrain.py][INFO] Epoch:[0/2](504500/4588595) loss:3.005 lr:0.0000100 epoch_Time:25917.0min: [2024-01-04 23:01:52,546][model8_pretrain.py][INFO] Epoch:[0/2](504600/4588595) loss:2.684 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:01:52,546][model8_pretrain.py][INFO] Epoch:[0/2](504600/4588595) loss:3.082 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:01:52,546][model8_pretrain.py][INFO] Epoch:[0/2](504600/4588595) loss:2.605 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:01:52,546][model8_pretrain.py][INFO] Epoch:[0/2](504600/4588595) loss:3.013 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:01:52,546][model8_pretrain.py][INFO] Epoch:[0/2](504600/4588595) loss:2.920 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:01:52,546][model8_pretrain.py][INFO] Epoch:[0/2](504600/4588595) loss:2.822 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:01:52,546][model8_pretrain.py][INFO] Epoch:[0/2](504600/4588595) loss:2.692 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:01:52,546][model8_pretrain.py][INFO] Epoch:[0/2](504600/4588595) loss:3.206 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:02:29,482][model8_pretrain.py][INFO] Epoch:[0/2](504700/4588595) loss:2.777 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:02:29,482][model8_pretrain.py][INFO] Epoch:[0/2](504700/4588595) loss:2.706 lr:0.0000100 epoch_Time:25915.0min: [2024-01-04 23:02:29,482][model8_pretrain.py][INFO] Epoch:[0/2](504700/4588595) loss:2.519 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:02:29,482][model8_pretrain.py][INFO] Epoch:[0/2](504700/4588595) loss:3.052 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:02:29,482][model8_pretrain.py][INFO] Epoch:[0/2](504700/4588595) loss:2.322 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:02:29,482][model8_pretrain.py][INFO] Epoch:[0/2](504700/4588595) loss:3.003 lr:0.0000100 epoch_Time:25915.0min: [2024-01-04 23:02:29,482][model8_pretrain.py][INFO] Epoch:[0/2](504700/4588595) loss:2.695 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:02:29,482][model8_pretrain.py][INFO] Epoch:[0/2](504700/4588595) loss:2.780 lr:0.0000100 epoch_Time:25916.0min: [2024-01-04 23:03:06,415][model8_pretrain.py][INFO] Epoch:[0/2](504800/4588595) loss:3.530 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:06,415][model8_pretrain.py][INFO] Epoch:[0/2](504800/4588595) loss:2.395 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:06,415][model8_pretrain.py][INFO] Epoch:[0/2](504800/4588595) loss:3.167 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:06,415][model8_pretrain.py][INFO] Epoch:[0/2](504800/4588595) loss:2.301 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:06,415][model8_pretrain.py][INFO] Epoch:[0/2](504800/4588595) loss:3.383 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:06,415][model8_pretrain.py][INFO] Epoch:[0/2](504800/4588595) loss:2.877 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:06,415][model8_pretrain.py][INFO] Epoch:[0/2](504800/4588595) loss:2.604 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:06,415][model8_pretrain.py][INFO] Epoch:[0/2](504800/4588595) loss:2.858 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:43,346][model8_pretrain.py][INFO] Epoch:[0/2](504900/4588595) loss:2.931 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:43,346][model8_pretrain.py][INFO] Epoch:[0/2](504900/4588595) loss:2.232 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:43,346][model8_pretrain.py][INFO] Epoch:[0/2](504900/4588595) loss:2.913 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:43,346][model8_pretrain.py][INFO] Epoch:[0/2](504900/4588595) loss:2.933 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:43,346][model8_pretrain.py][INFO] Epoch:[0/2](504900/4588595) loss:2.935 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:43,346][model8_pretrain.py][INFO] Epoch:[0/2](504900/4588595) loss:3.314 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:43,346][model8_pretrain.py][INFO] Epoch:[0/2](504900/4588595) loss:2.837 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:03:43,346][model8_pretrain.py][INFO] Epoch:[0/2](504900/4588595) loss:3.166 lr:0.0000100 epoch_Time:25914.0min: [2024-01-04 23:04:20,276][model8_pretrain.py][INFO] Epoch:[0/2](505000/4588595) loss:2.793 lr:0.0000100 epoch_Time:25913.0min: [2024-01-04 23:04:20,276][model8_pretrain.py][INFO] Epoch:[0/2](505000/4588595) loss:3.165 lr:0.0000100 epoch_Time:25913.0min: [2024-01-04 23:04:20,276][model8_pretrain.py][INFO] Epoch:[0/2](505000/4588595) loss:2.814 lr:0.0000100 epoch_Time:25913.0min: [2024-01-04 23:04:20,276][model8_pretrain.py][INFO] Epoch:[0/2](505000/4588595) loss:2.728 lr:0.0000100 epoch_Time:25913.0min: [2024-01-04 23:04:20,276][model8_pretrain.py][INFO] Epoch:[0/2](505000/4588595) loss:2.838 lr:0.0000100 epoch_Time:25913.0min: [2024-01-04 23:04:20,276][model8_pretrain.py][INFO] Epoch:[0/2](505000/4588595) loss:2.806 lr:0.0000100 epoch_Time:25913.0min: [2024-01-04 23:04:20,276][model8_pretrain.py][INFO] Epoch:[0/2](505000/4588595) loss:2.976 lr:0.0000100 epoch_Time:25913.0min: [2024-01-04 23:04:20,276][model8_pretrain.py][INFO] Epoch:[0/2](505000/4588595) loss:3.071 lr:0.0000100 epoch_Time:25913.0min: [2024-01-04 23:04:57,198][model8_pretrain.py][INFO] Epoch:[0/2](505100/4588595) loss:3.422 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:04:57,198][model8_pretrain.py][INFO] Epoch:[0/2](505100/4588595) loss:3.390 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:04:57,198][model8_pretrain.py][INFO] Epoch:[0/2](505100/4588595) loss:2.798 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:04:57,198][model8_pretrain.py][INFO] Epoch:[0/2](505100/4588595) loss:2.748 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:04:57,198][model8_pretrain.py][INFO] Epoch:[0/2](505100/4588595) loss:2.186 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:04:57,198][model8_pretrain.py][INFO] Epoch:[0/2](505100/4588595) loss:2.734 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:04:57,198][model8_pretrain.py][INFO] Epoch:[0/2](505100/4588595) loss:2.494 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:04:57,198][model8_pretrain.py][INFO] Epoch:[0/2](505100/4588595) loss:2.814 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:05:34,120][model8_pretrain.py][INFO] Epoch:[0/2](505200/4588595) loss:2.698 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:05:34,120][model8_pretrain.py][INFO] Epoch:[0/2](505200/4588595) loss:2.931 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:05:34,120][model8_pretrain.py][INFO] Epoch:[0/2](505200/4588595) loss:2.924 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:05:34,120][model8_pretrain.py][INFO] Epoch:[0/2](505200/4588595) loss:2.543 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:05:34,120][model8_pretrain.py][INFO] Epoch:[0/2](505200/4588595) loss:2.985 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:05:34,120][model8_pretrain.py][INFO] Epoch:[0/2](505200/4588595) loss:3.156 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:05:34,120][model8_pretrain.py][INFO] Epoch:[0/2](505200/4588595) loss:2.969 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:05:34,120][model8_pretrain.py][INFO] Epoch:[0/2](505200/4588595) loss:3.147 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:06:21,537][model8_pretrain.py][INFO] Epoch:[0/2](505300/4588595) loss:2.898 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:06:21,537][model8_pretrain.py][INFO] Epoch:[0/2](505300/4588595) loss:2.333 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:06:21,537][model8_pretrain.py][INFO] Epoch:[0/2](505300/4588595) loss:2.314 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:06:21,537][model8_pretrain.py][INFO] Epoch:[0/2](505300/4588595) loss:2.179 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:06:21,537][model8_pretrain.py][INFO] Epoch:[0/2](505300/4588595) loss:2.423 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:06:21,537][model8_pretrain.py][INFO] Epoch:[0/2](505300/4588595) loss:3.279 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:06:21,537][model8_pretrain.py][INFO] Epoch:[0/2](505300/4588595) loss:3.097 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:06:21,537][model8_pretrain.py][INFO] Epoch:[0/2](505300/4588595) loss:3.088 lr:0.0000100 epoch_Time:25912.0min: [2024-01-04 23:06:58,460][model8_pretrain.py][INFO] Epoch:[0/2](505400/4588595) loss:3.258 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:06:58,460][model8_pretrain.py][INFO] Epoch:[0/2](505400/4588595) loss:3.050 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:06:58,460][model8_pretrain.py][INFO] Epoch:[0/2](505400/4588595) loss:3.089 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:06:58,460][model8_pretrain.py][INFO] Epoch:[0/2](505400/4588595) loss:2.294 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:06:58,460][model8_pretrain.py][INFO] Epoch:[0/2](505400/4588595) loss:2.552 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:06:58,460][model8_pretrain.py][INFO] Epoch:[0/2](505400/4588595) loss:2.844 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:06:58,460][model8_pretrain.py][INFO] Epoch:[0/2](505400/4588595) loss:3.134 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:06:58,461][model8_pretrain.py][INFO] Epoch:[0/2](505400/4588595) loss:2.652 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:07:35,413][model8_pretrain.py][INFO] Epoch:[0/2](505500/4588595) loss:2.649 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:07:35,413][model8_pretrain.py][INFO] Epoch:[0/2](505500/4588595) loss:3.167 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:07:35,413][model8_pretrain.py][INFO] Epoch:[0/2](505500/4588595) loss:2.646 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:07:35,413][model8_pretrain.py][INFO] Epoch:[0/2](505500/4588595) loss:2.947 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:07:35,414][model8_pretrain.py][INFO] Epoch:[0/2](505500/4588595) loss:3.186 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:07:35,413][model8_pretrain.py][INFO] Epoch:[0/2](505500/4588595) loss:3.393 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:07:35,414][model8_pretrain.py][INFO] Epoch:[0/2](505500/4588595) loss:3.527 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:07:35,414][model8_pretrain.py][INFO] Epoch:[0/2](505500/4588595) loss:2.501 lr:0.0000100 epoch_Time:25911.0min: [2024-01-04 23:08:12,348][model8_pretrain.py][INFO] Epoch:[0/2](505600/4588595) loss:2.289 lr:0.0000100 epoch_Time:25910.0min: [2024-01-04 23:08:12,348][model8_pretrain.py][INFO] Epoch:[0/2](505600/4588595) loss:2.614 lr:0.0000100 epoch_Time:25910.0min: [2024-01-04 23:08:12,348][model8_pretrain.py][INFO] Epoch:[0/2](505600/4588595) loss:2.910 lr:0.0000100 epoch_Time:25910.0min: [2024-01-04 23:08:12,348][model8_pretrain.py][INFO] Epoch:[0/2](505600/4588595) loss:2.799 lr:0.0000100 epoch_Time:25910.0min: [2024-01-04 23:08:12,348][model8_pretrain.py][INFO] Epoch:[0/2](505600/4588595) loss:2.319 lr:0.0000100 epoch_Time:25910.0min: [2024-01-04 23:08:12,348][model8_pretrain.py][INFO] Epoch:[0/2](505600/4588595) loss:3.076 lr:0.0000100 epoch_Time:25910.0min: [2024-01-04 23:08:12,349][model8_pretrain.py][INFO] Epoch:[0/2](505600/4588595) loss:3.170 lr:0.0000100 epoch_Time:25910.0min: [2024-01-04 23:08:12,349][model8_pretrain.py][INFO] Epoch:[0/2](505600/4588595) loss:2.855 lr:0.0000100 epoch_Time:25910.0min: [2024-01-04 23:08:49,279][model8_pretrain.py][INFO] Epoch:[0/2](505700/4588595) loss:2.857 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:08:49,279][model8_pretrain.py][INFO] Epoch:[0/2](505700/4588595) loss:2.748 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:08:49,279][model8_pretrain.py][INFO] Epoch:[0/2](505700/4588595) loss:3.312 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:08:49,279][model8_pretrain.py][INFO] Epoch:[0/2](505700/4588595) loss:2.999 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:08:49,279][model8_pretrain.py][INFO] Epoch:[0/2](505700/4588595) loss:2.644 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:08:49,279][model8_pretrain.py][INFO] Epoch:[0/2](505700/4588595) loss:3.090 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:08:49,279][model8_pretrain.py][INFO] Epoch:[0/2](505700/4588595) loss:2.675 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:08:49,279][model8_pretrain.py][INFO] Epoch:[0/2](505700/4588595) loss:2.691 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:09:26,212][model8_pretrain.py][INFO] Epoch:[0/2](505800/4588595) loss:3.088 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:09:26,212][model8_pretrain.py][INFO] Epoch:[0/2](505800/4588595) loss:3.216 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:09:26,212][model8_pretrain.py][INFO] Epoch:[0/2](505800/4588595) loss:2.705 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:09:26,212][model8_pretrain.py][INFO] Epoch:[0/2](505800/4588595) loss:3.470 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:09:26,212][model8_pretrain.py][INFO] Epoch:[0/2](505800/4588595) loss:2.702 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:09:26,212][model8_pretrain.py][INFO] Epoch:[0/2](505800/4588595) loss:2.768 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:09:26,212][model8_pretrain.py][INFO] Epoch:[0/2](505800/4588595) loss:2.698 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:09:26,212][model8_pretrain.py][INFO] Epoch:[0/2](505800/4588595) loss:2.738 lr:0.0000100 epoch_Time:25908.0min: [2024-01-04 23:10:03,146][model8_pretrain.py][INFO] Epoch:[0/2](505900/4588595) loss:2.387 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:03,146][model8_pretrain.py][INFO] Epoch:[0/2](505900/4588595) loss:2.209 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:03,146][model8_pretrain.py][INFO] Epoch:[0/2](505900/4588595) loss:2.938 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:03,146][model8_pretrain.py][INFO] Epoch:[0/2](505900/4588595) loss:1.739 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:03,146][model8_pretrain.py][INFO] Epoch:[0/2](505900/4588595) loss:2.903 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:03,146][model8_pretrain.py][INFO] Epoch:[0/2](505900/4588595) loss:2.972 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:03,146][model8_pretrain.py][INFO] Epoch:[0/2](505900/4588595) loss:2.954 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:03,146][model8_pretrain.py][INFO] Epoch:[0/2](505900/4588595) loss:2.799 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:40,081][model8_pretrain.py][INFO] Epoch:[0/2](506000/4588595) loss:2.832 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:40,081][model8_pretrain.py][INFO] Epoch:[0/2](506000/4588595) loss:2.934 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:40,081][model8_pretrain.py][INFO] Epoch:[0/2](506000/4588595) loss:2.851 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:40,081][model8_pretrain.py][INFO] Epoch:[0/2](506000/4588595) loss:2.897 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:40,081][model8_pretrain.py][INFO] Epoch:[0/2](506000/4588595) loss:3.053 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:40,082][model8_pretrain.py][INFO] Epoch:[0/2](506000/4588595) loss:3.184 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:40,081][model8_pretrain.py][INFO] Epoch:[0/2](506000/4588595) loss:2.998 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:10:40,082][model8_pretrain.py][INFO] Epoch:[0/2](506000/4588595) loss:2.669 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:11:27,462][model8_pretrain.py][INFO] Epoch:[0/2](506100/4588595) loss:2.848 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:11:27,462][model8_pretrain.py][INFO] Epoch:[0/2](506100/4588595) loss:2.849 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:11:27,462][model8_pretrain.py][INFO] Epoch:[0/2](506100/4588595) loss:3.255 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:11:27,462][model8_pretrain.py][INFO] Epoch:[0/2](506100/4588595) loss:2.896 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:11:27,462][model8_pretrain.py][INFO] Epoch:[0/2](506100/4588595) loss:3.170 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:11:27,462][model8_pretrain.py][INFO] Epoch:[0/2](506100/4588595) loss:2.837 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:11:27,462][model8_pretrain.py][INFO] Epoch:[0/2](506100/4588595) loss:2.457 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:11:27,462][model8_pretrain.py][INFO] Epoch:[0/2](506100/4588595) loss:2.912 lr:0.0000100 epoch_Time:25907.0min: [2024-01-04 23:12:04,362][model8_pretrain.py][INFO] Epoch:[0/2](506200/4588595) loss:3.493 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:04,362][model8_pretrain.py][INFO] Epoch:[0/2](506200/4588595) loss:2.392 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:04,362][model8_pretrain.py][INFO] Epoch:[0/2](506200/4588595) loss:3.059 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:04,362][model8_pretrain.py][INFO] Epoch:[0/2](506200/4588595) loss:3.144 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:04,362][model8_pretrain.py][INFO] Epoch:[0/2](506200/4588595) loss:2.996 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:04,362][model8_pretrain.py][INFO] Epoch:[0/2](506200/4588595) loss:2.933 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:04,362][model8_pretrain.py][INFO] Epoch:[0/2](506200/4588595) loss:2.770 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:04,362][model8_pretrain.py][INFO] Epoch:[0/2](506200/4588595) loss:2.674 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:41,299][model8_pretrain.py][INFO] Epoch:[0/2](506300/4588595) loss:2.651 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:41,299][model8_pretrain.py][INFO] Epoch:[0/2](506300/4588595) loss:3.230 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:41,299][model8_pretrain.py][INFO] Epoch:[0/2](506300/4588595) loss:2.693 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:41,299][model8_pretrain.py][INFO] Epoch:[0/2](506300/4588595) loss:2.852 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:41,299][model8_pretrain.py][INFO] Epoch:[0/2](506300/4588595) loss:2.542 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:41,299][model8_pretrain.py][INFO] Epoch:[0/2](506300/4588595) loss:2.876 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:41,299][model8_pretrain.py][INFO] Epoch:[0/2](506300/4588595) loss:3.132 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:12:41,299][model8_pretrain.py][INFO] Epoch:[0/2](506300/4588595) loss:2.464 lr:0.0000100 epoch_Time:25906.0min: [2024-01-04 23:13:18,234][model8_pretrain.py][INFO] Epoch:[0/2](506400/4588595) loss:2.519 lr:0.0000100 epoch_Time:25905.0min: [2024-01-04 23:13:18,234][model8_pretrain.py][INFO] Epoch:[0/2](506400/4588595) loss:2.749 lr:0.0000100 epoch_Time:25905.0min: [2024-01-04 23:13:18,234][model8_pretrain.py][INFO] Epoch:[0/2](506400/4588595) loss:2.509 lr:0.0000100 epoch_Time:25905.0min: [2024-01-04 23:13:18,234][model8_pretrain.py][INFO] Epoch:[0/2](506400/4588595) loss:2.761 lr:0.0000100 epoch_Time:25905.0min: [2024-01-04 23:13:18,234][model8_pretrain.py][INFO] Epoch:[0/2](506400/4588595) loss:2.827 lr:0.0000100 epoch_Time:25905.0min: [2024-01-04 23:13:18,234][model8_pretrain.py][INFO] Epoch:[0/2](506400/4588595) loss:2.239 lr:0.0000100 epoch_Time:25905.0min: [2024-01-04 23:13:18,234][model8_pretrain.py][INFO] Epoch:[0/2](506400/4588595) loss:2.609 lr:0.0000100 epoch_Time:25905.0min: [2024-01-04 23:13:18,235][model8_pretrain.py][INFO] Epoch:[0/2](506400/4588595) loss:2.700 lr:0.0000100 epoch_Time:25905.0min: [2024-01-04 23:13:55,163][model8_pretrain.py][INFO] Epoch:[0/2](506500/4588595) loss:3.400 lr:0.0000100 epoch_Time:25904.0min: [2024-01-04 23:13:55,163][model8_pretrain.py][INFO] Epoch:[0/2](506500/4588595) loss:2.930 lr:0.0000100 epoch_Time:25904.0min: [2024-01-04 23:13:55,163][model8_pretrain.py][INFO] Epoch:[0/2](506500/4588595) loss:3.158 lr:0.0000100 epoch_Time:25904.0min: [2024-01-04 23:13:55,163][model8_pretrain.py][INFO] Epoch:[0/2](506500/4588595) loss:2.955 lr:0.0000100 epoch_Time:25904.0min: [2024-01-04 23:13:55,163][model8_pretrain.py][INFO] Epoch:[0/2](506500/4588595) loss:2.657 lr:0.0000100 epoch_Time:25904.0min: [2024-01-04 23:13:55,163][model8_pretrain.py][INFO] Epoch:[0/2](506500/4588595) loss:2.745 lr:0.0000100 epoch_Time:25904.0min: [2024-01-04 23:13:55,163][model8_pretrain.py][INFO] Epoch:[0/2](506500/4588595) loss:2.894 lr:0.0000100 epoch_Time:25904.0min: [2024-01-04 23:13:55,163][model8_pretrain.py][INFO] Epoch:[0/2](506500/4588595) loss:3.067 lr:0.0000100 epoch_Time:25904.0min: [2024-01-04 23:14:32,094][model8_pretrain.py][INFO] Epoch:[0/2](506600/4588595) loss:2.388 lr:0.0000100 epoch_Time:25903.0min: [2024-01-04 23:14:32,094][model8_pretrain.py][INFO] Epoch:[0/2](506600/4588595) loss:2.594 lr:0.0000100 epoch_Time:25903.0min: [2024-01-04 23:14:32,094][model8_pretrain.py][INFO] Epoch:[0/2](506600/4588595) loss:3.108 lr:0.0000100 epoch_Time:25903.0min: [2024-01-04 23:14:32,094][model8_pretrain.py][INFO] Epoch:[0/2](506600/4588595) loss:2.583 lr:0.0000100 epoch_Time:25903.0min: [2024-01-04 23:14:32,094][model8_pretrain.py][INFO] Epoch:[0/2](506600/4588595) loss:2.787 lr:0.0000100 epoch_Time:25903.0min: [2024-01-04 23:14:32,094][model8_pretrain.py][INFO] Epoch:[0/2](506600/4588595) loss:2.929 lr:0.0000100 epoch_Time:25903.0min: [2024-01-04 23:14:32,094][model8_pretrain.py][INFO] Epoch:[0/2](506600/4588595) loss:2.866 lr:0.0000100 epoch_Time:25903.0min: [2024-01-04 23:14:32,094][model8_pretrain.py][INFO] Epoch:[0/2](506600/4588595) loss:2.497 lr:0.0000100 epoch_Time:25903.0min: [2024-01-04 23:15:09,017][model8_pretrain.py][INFO] Epoch:[0/2](506700/4588595) loss:2.685 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:09,017][model8_pretrain.py][INFO] Epoch:[0/2](506700/4588595) loss:2.635 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:09,017][model8_pretrain.py][INFO] Epoch:[0/2](506700/4588595) loss:2.859 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:09,017][model8_pretrain.py][INFO] Epoch:[0/2](506700/4588595) loss:3.054 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:09,017][model8_pretrain.py][INFO] Epoch:[0/2](506700/4588595) loss:2.882 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:09,017][model8_pretrain.py][INFO] Epoch:[0/2](506700/4588595) loss:3.099 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:09,017][model8_pretrain.py][INFO] Epoch:[0/2](506700/4588595) loss:2.941 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:09,018][model8_pretrain.py][INFO] Epoch:[0/2](506700/4588595) loss:2.790 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:45,934][model8_pretrain.py][INFO] Epoch:[0/2](506800/4588595) loss:2.526 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:45,934][model8_pretrain.py][INFO] Epoch:[0/2](506800/4588595) loss:2.833 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:45,934][model8_pretrain.py][INFO] Epoch:[0/2](506800/4588595) loss:2.857 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:45,934][model8_pretrain.py][INFO] Epoch:[0/2](506800/4588595) loss:2.605 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:45,934][model8_pretrain.py][INFO] Epoch:[0/2](506800/4588595) loss:2.809 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:45,934][model8_pretrain.py][INFO] Epoch:[0/2](506800/4588595) loss:2.537 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:45,934][model8_pretrain.py][INFO] Epoch:[0/2](506800/4588595) loss:3.183 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:15:45,934][model8_pretrain.py][INFO] Epoch:[0/2](506800/4588595) loss:2.214 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:16:33,385][model8_pretrain.py][INFO] Epoch:[0/2](506900/4588595) loss:2.667 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:16:33,385][model8_pretrain.py][INFO] Epoch:[0/2](506900/4588595) loss:2.949 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:16:33,385][model8_pretrain.py][INFO] Epoch:[0/2](506900/4588595) loss:2.704 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:16:33,385][model8_pretrain.py][INFO] Epoch:[0/2](506900/4588595) loss:2.996 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:16:33,385][model8_pretrain.py][INFO] Epoch:[0/2](506900/4588595) loss:3.351 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:16:33,385][model8_pretrain.py][INFO] Epoch:[0/2](506900/4588595) loss:2.634 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:16:33,385][model8_pretrain.py][INFO] Epoch:[0/2](506900/4588595) loss:2.477 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:16:33,385][model8_pretrain.py][INFO] Epoch:[0/2](506900/4588595) loss:2.454 lr:0.0000100 epoch_Time:25902.0min: [2024-01-04 23:17:10,308][model8_pretrain.py][INFO] Epoch:[0/2](507000/4588595) loss:2.544 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:10,308][model8_pretrain.py][INFO] Epoch:[0/2](507000/4588595) loss:3.029 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:10,308][model8_pretrain.py][INFO] Epoch:[0/2](507000/4588595) loss:2.892 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:10,308][model8_pretrain.py][INFO] Epoch:[0/2](507000/4588595) loss:2.967 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:10,308][model8_pretrain.py][INFO] Epoch:[0/2](507000/4588595) loss:2.926 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:10,308][model8_pretrain.py][INFO] Epoch:[0/2](507000/4588595) loss:2.741 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:10,308][model8_pretrain.py][INFO] Epoch:[0/2](507000/4588595) loss:3.184 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:10,308][model8_pretrain.py][INFO] Epoch:[0/2](507000/4588595) loss:2.958 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:47,241][model8_pretrain.py][INFO] Epoch:[0/2](507100/4588595) loss:2.224 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:47,241][model8_pretrain.py][INFO] Epoch:[0/2](507100/4588595) loss:2.936 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:47,241][model8_pretrain.py][INFO] Epoch:[0/2](507100/4588595) loss:2.670 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:47,241][model8_pretrain.py][INFO] Epoch:[0/2](507100/4588595) loss:3.200 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:47,241][model8_pretrain.py][INFO] Epoch:[0/2](507100/4588595) loss:2.757 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:47,241][model8_pretrain.py][INFO] Epoch:[0/2](507100/4588595) loss:2.193 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:47,241][model8_pretrain.py][INFO] Epoch:[0/2](507100/4588595) loss:2.440 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:17:47,241][model8_pretrain.py][INFO] Epoch:[0/2](507100/4588595) loss:2.622 lr:0.0000100 epoch_Time:25901.0min: [2024-01-04 23:18:24,178][model8_pretrain.py][INFO] Epoch:[0/2](507200/4588595) loss:3.166 lr:0.0000100 epoch_Time:25900.0min: [2024-01-04 23:18:24,178][model8_pretrain.py][INFO] Epoch:[0/2](507200/4588595) loss:2.853 lr:0.0000100 epoch_Time:25900.0min: [2024-01-04 23:18:24,178][model8_pretrain.py][INFO] Epoch:[0/2](507200/4588595) loss:2.843 lr:0.0000100 epoch_Time:25900.0min: [2024-01-04 23:18:24,178][model8_pretrain.py][INFO] Epoch:[0/2](507200/4588595) loss:2.402 lr:0.0000100 epoch_Time:25900.0min: [2024-01-04 23:18:24,178][model8_pretrain.py][INFO] Epoch:[0/2](507200/4588595) loss:2.746 lr:0.0000100 epoch_Time:25900.0min: [2024-01-04 23:18:24,178][model8_pretrain.py][INFO] Epoch:[0/2](507200/4588595) loss:2.668 lr:0.0000100 epoch_Time:25900.0min: [2024-01-04 23:18:24,178][model8_pretrain.py][INFO] Epoch:[0/2](507200/4588595) loss:2.996 lr:0.0000100 epoch_Time:25900.0min: [2024-01-04 23:18:24,178][model8_pretrain.py][INFO] Epoch:[0/2](507200/4588595) loss:2.708 lr:0.0000100 epoch_Time:25900.0min: [2024-01-04 23:19:01,135][model8_pretrain.py][INFO] Epoch:[0/2](507300/4588595) loss:2.953 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:01,135][model8_pretrain.py][INFO] Epoch:[0/2](507300/4588595) loss:2.740 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:01,135][model8_pretrain.py][INFO] Epoch:[0/2](507300/4588595) loss:2.980 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:01,135][model8_pretrain.py][INFO] Epoch:[0/2](507300/4588595) loss:3.147 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:01,135][model8_pretrain.py][INFO] Epoch:[0/2](507300/4588595) loss:2.736 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:01,135][model8_pretrain.py][INFO] Epoch:[0/2](507300/4588595) loss:2.678 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:01,135][model8_pretrain.py][INFO] Epoch:[0/2](507300/4588595) loss:3.060 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:01,135][model8_pretrain.py][INFO] Epoch:[0/2](507300/4588595) loss:2.652 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:38,069][model8_pretrain.py][INFO] Epoch:[0/2](507400/4588595) loss:2.618 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:38,069][model8_pretrain.py][INFO] Epoch:[0/2](507400/4588595) loss:3.065 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:38,069][model8_pretrain.py][INFO] Epoch:[0/2](507400/4588595) loss:3.023 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:38,069][model8_pretrain.py][INFO] Epoch:[0/2](507400/4588595) loss:2.878 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:38,069][model8_pretrain.py][INFO] Epoch:[0/2](507400/4588595) loss:2.497 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:38,069][model8_pretrain.py][INFO] Epoch:[0/2](507400/4588595) loss:2.658 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:38,070][model8_pretrain.py][INFO] Epoch:[0/2](507400/4588595) loss:3.174 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:19:38,070][model8_pretrain.py][INFO] Epoch:[0/2](507400/4588595) loss:2.776 lr:0.0000100 epoch_Time:25899.0min: [2024-01-04 23:20:14,994][model8_pretrain.py][INFO] Epoch:[0/2](507500/4588595) loss:3.146 lr:0.0000100 epoch_Time:25897.0min: [2024-01-04 23:20:14,994][model8_pretrain.py][INFO] Epoch:[0/2](507500/4588595) loss:2.997 lr:0.0000100 epoch_Time:25897.0min: [2024-01-04 23:20:14,994][model8_pretrain.py][INFO] Epoch:[0/2](507500/4588595) loss:2.484 lr:0.0000100 epoch_Time:25897.0min: [2024-01-04 23:20:14,994][model8_pretrain.py][INFO] Epoch:[0/2](507500/4588595) loss:3.012 lr:0.0000100 epoch_Time:25897.0min: [2024-01-04 23:20:14,994][model8_pretrain.py][INFO] Epoch:[0/2](507500/4588595) loss:3.028 lr:0.0000100 epoch_Time:25897.0min: [2024-01-04 23:20:14,994][model8_pretrain.py][INFO] Epoch:[0/2](507500/4588595) loss:2.966 lr:0.0000100 epoch_Time:25897.0min: [2024-01-04 23:20:14,994][model8_pretrain.py][INFO] Epoch:[0/2](507500/4588595) loss:2.974 lr:0.0000100 epoch_Time:25897.0min: [2024-01-04 23:20:14,994][model8_pretrain.py][INFO] Epoch:[0/2](507500/4588595) loss:2.818 lr:0.0000100 epoch_Time:25897.0min: [2024-01-04 23:20:51,923][model8_pretrain.py][INFO] Epoch:[0/2](507600/4588595) loss:2.961 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:20:51,923][model8_pretrain.py][INFO] Epoch:[0/2](507600/4588595) loss:2.097 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:20:51,923][model8_pretrain.py][INFO] Epoch:[0/2](507600/4588595) loss:2.579 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:20:51,923][model8_pretrain.py][INFO] Epoch:[0/2](507600/4588595) loss:2.986 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:20:51,923][model8_pretrain.py][INFO] Epoch:[0/2](507600/4588595) loss:3.326 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:20:51,923][model8_pretrain.py][INFO] Epoch:[0/2](507600/4588595) loss:2.057 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:20:51,923][model8_pretrain.py][INFO] Epoch:[0/2](507600/4588595) loss:2.931 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:20:51,923][model8_pretrain.py][INFO] Epoch:[0/2](507600/4588595) loss:2.880 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:21:39,379][model8_pretrain.py][INFO] Epoch:[0/2](507700/4588595) loss:2.778 lr:0.0000100 epoch_Time:25898.0min: [2024-01-04 23:21:39,379][model8_pretrain.py][INFO] Epoch:[0/2](507700/4588595) loss:2.510 lr:0.0000100 epoch_Time:25898.0min: [2024-01-04 23:21:39,379][model8_pretrain.py][INFO] Epoch:[0/2](507700/4588595) loss:2.900 lr:0.0000100 epoch_Time:25898.0min: [2024-01-04 23:21:39,379][model8_pretrain.py][INFO] Epoch:[0/2](507700/4588595) loss:2.341 lr:0.0000100 epoch_Time:25898.0min: [2024-01-04 23:21:39,379][model8_pretrain.py][INFO] Epoch:[0/2](507700/4588595) loss:2.350 lr:0.0000100 epoch_Time:25898.0min: [2024-01-04 23:21:39,379][model8_pretrain.py][INFO] Epoch:[0/2](507700/4588595) loss:2.845 lr:0.0000100 epoch_Time:25898.0min: [2024-01-04 23:21:39,379][model8_pretrain.py][INFO] Epoch:[0/2](507700/4588595) loss:3.241 lr:0.0000100 epoch_Time:25898.0min: [2024-01-04 23:21:39,379][model8_pretrain.py][INFO] Epoch:[0/2](507700/4588595) loss:2.576 lr:0.0000100 epoch_Time:25898.0min: [2024-01-04 23:22:16,291][model8_pretrain.py][INFO] Epoch:[0/2](507800/4588595) loss:3.163 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:22:16,291][model8_pretrain.py][INFO] Epoch:[0/2](507800/4588595) loss:2.531 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:22:16,291][model8_pretrain.py][INFO] Epoch:[0/2](507800/4588595) loss:3.229 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:22:16,291][model8_pretrain.py][INFO] Epoch:[0/2](507800/4588595) loss:3.058 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:22:16,291][model8_pretrain.py][INFO] Epoch:[0/2](507800/4588595) loss:3.033 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:22:16,291][model8_pretrain.py][INFO] Epoch:[0/2](507800/4588595) loss:2.516 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:22:16,291][model8_pretrain.py][INFO] Epoch:[0/2](507800/4588595) loss:2.847 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:22:16,291][model8_pretrain.py][INFO] Epoch:[0/2](507800/4588595) loss:2.961 lr:0.0000100 epoch_Time:25896.0min: [2024-01-04 23:22:53,218][model8_pretrain.py][INFO] Epoch:[0/2](507900/4588595) loss:2.981 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:22:53,218][model8_pretrain.py][INFO] Epoch:[0/2](507900/4588595) loss:3.258 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:22:53,218][model8_pretrain.py][INFO] Epoch:[0/2](507900/4588595) loss:3.414 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:22:53,218][model8_pretrain.py][INFO] Epoch:[0/2](507900/4588595) loss:2.853 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:22:53,218][model8_pretrain.py][INFO] Epoch:[0/2](507900/4588595) loss:2.738 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:22:53,218][model8_pretrain.py][INFO] Epoch:[0/2](507900/4588595) loss:2.971 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:22:53,219][model8_pretrain.py][INFO] Epoch:[0/2](507900/4588595) loss:3.215 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:22:53,219][model8_pretrain.py][INFO] Epoch:[0/2](507900/4588595) loss:3.342 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:23:30,145][model8_pretrain.py][INFO] Epoch:[0/2](508000/4588595) loss:3.188 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:23:30,145][model8_pretrain.py][INFO] Epoch:[0/2](508000/4588595) loss:2.956 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:23:30,145][model8_pretrain.py][INFO] Epoch:[0/2](508000/4588595) loss:2.923 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:23:30,145][model8_pretrain.py][INFO] Epoch:[0/2](508000/4588595) loss:3.141 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:23:30,145][model8_pretrain.py][INFO] Epoch:[0/2](508000/4588595) loss:2.685 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:23:30,145][model8_pretrain.py][INFO] Epoch:[0/2](508000/4588595) loss:2.901 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:23:30,145][model8_pretrain.py][INFO] Epoch:[0/2](508000/4588595) loss:3.146 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:23:30,145][model8_pretrain.py][INFO] Epoch:[0/2](508000/4588595) loss:2.799 lr:0.0000100 epoch_Time:25895.0min: [2024-01-04 23:24:07,080][model8_pretrain.py][INFO] Epoch:[0/2](508100/4588595) loss:3.101 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:07,080][model8_pretrain.py][INFO] Epoch:[0/2](508100/4588595) loss:3.040 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:07,080][model8_pretrain.py][INFO] Epoch:[0/2](508100/4588595) loss:2.180 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:07,080][model8_pretrain.py][INFO] Epoch:[0/2](508100/4588595) loss:2.657 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:07,080][model8_pretrain.py][INFO] Epoch:[0/2](508100/4588595) loss:3.404 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:07,080][model8_pretrain.py][INFO] Epoch:[0/2](508100/4588595) loss:2.530 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:07,080][model8_pretrain.py][INFO] Epoch:[0/2](508100/4588595) loss:2.448 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:07,080][model8_pretrain.py][INFO] Epoch:[0/2](508100/4588595) loss:2.905 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:44,007][model8_pretrain.py][INFO] Epoch:[0/2](508200/4588595) loss:2.925 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:44,007][model8_pretrain.py][INFO] Epoch:[0/2](508200/4588595) loss:2.674 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:44,007][model8_pretrain.py][INFO] Epoch:[0/2](508200/4588595) loss:3.085 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:44,007][model8_pretrain.py][INFO] Epoch:[0/2](508200/4588595) loss:3.144 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:44,007][model8_pretrain.py][INFO] Epoch:[0/2](508200/4588595) loss:3.187 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:44,007][model8_pretrain.py][INFO] Epoch:[0/2](508200/4588595) loss:3.368 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:44,007][model8_pretrain.py][INFO] Epoch:[0/2](508200/4588595) loss:2.773 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:24:44,007][model8_pretrain.py][INFO] Epoch:[0/2](508200/4588595) loss:2.696 lr:0.0000100 epoch_Time:25894.0min: [2024-01-04 23:25:20,936][model8_pretrain.py][INFO] Epoch:[0/2](508300/4588595) loss:2.784 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:25:20,936][model8_pretrain.py][INFO] Epoch:[0/2](508300/4588595) loss:3.363 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:25:20,936][model8_pretrain.py][INFO] Epoch:[0/2](508300/4588595) loss:2.675 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:25:20,936][model8_pretrain.py][INFO] Epoch:[0/2](508300/4588595) loss:2.672 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:25:20,936][model8_pretrain.py][INFO] Epoch:[0/2](508300/4588595) loss:2.494 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:25:20,936][model8_pretrain.py][INFO] Epoch:[0/2](508300/4588595) loss:2.793 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:25:20,936][model8_pretrain.py][INFO] Epoch:[0/2](508300/4588595) loss:3.170 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:25:20,936][model8_pretrain.py][INFO] Epoch:[0/2](508300/4588595) loss:2.161 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:25:57,863][model8_pretrain.py][INFO] Epoch:[0/2](508400/4588595) loss:2.367 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:25:57,863][model8_pretrain.py][INFO] Epoch:[0/2](508400/4588595) loss:2.984 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:25:57,863][model8_pretrain.py][INFO] Epoch:[0/2](508400/4588595) loss:2.709 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:25:57,863][model8_pretrain.py][INFO] Epoch:[0/2](508400/4588595) loss:3.147 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:25:57,863][model8_pretrain.py][INFO] Epoch:[0/2](508400/4588595) loss:2.247 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:25:57,863][model8_pretrain.py][INFO] Epoch:[0/2](508400/4588595) loss:2.769 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:25:57,863][model8_pretrain.py][INFO] Epoch:[0/2](508400/4588595) loss:2.337 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:25:57,863][model8_pretrain.py][INFO] Epoch:[0/2](508400/4588595) loss:2.811 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:26:45,284][model8_pretrain.py][INFO] Epoch:[0/2](508500/4588595) loss:3.264 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:26:45,284][model8_pretrain.py][INFO] Epoch:[0/2](508500/4588595) loss:2.979 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:26:45,284][model8_pretrain.py][INFO] Epoch:[0/2](508500/4588595) loss:2.100 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:26:45,284][model8_pretrain.py][INFO] Epoch:[0/2](508500/4588595) loss:2.609 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:26:45,284][model8_pretrain.py][INFO] Epoch:[0/2](508500/4588595) loss:3.162 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:26:45,284][model8_pretrain.py][INFO] Epoch:[0/2](508500/4588595) loss:3.097 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:26:45,284][model8_pretrain.py][INFO] Epoch:[0/2](508500/4588595) loss:2.380 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:26:45,284][model8_pretrain.py][INFO] Epoch:[0/2](508500/4588595) loss:2.700 lr:0.0000100 epoch_Time:25893.0min: [2024-01-04 23:27:22,193][model8_pretrain.py][INFO] Epoch:[0/2](508600/4588595) loss:2.813 lr:0.0000100 epoch_Time:25892.0min: [2024-01-04 23:27:22,193][model8_pretrain.py][INFO] Epoch:[0/2](508600/4588595) loss:2.609 lr:0.0000100 epoch_Time:25892.0min: [2024-01-04 23:27:22,193][model8_pretrain.py][INFO] Epoch:[0/2](508600/4588595) loss:2.588 lr:0.0000100 epoch_Time:25892.0min: [2024-01-04 23:27:22,193][model8_pretrain.py][INFO] Epoch:[0/2](508600/4588595) loss:2.821 lr:0.0000100 epoch_Time:25892.0min: [2024-01-04 23:27:22,193][model8_pretrain.py][INFO] Epoch:[0/2](508600/4588595) loss:3.316 lr:0.0000100 epoch_Time:25892.0min: [2024-01-04 23:27:22,193][model8_pretrain.py][INFO] Epoch:[0/2](508600/4588595) loss:3.022 lr:0.0000100 epoch_Time:25892.0min: [2024-01-04 23:27:22,193][model8_pretrain.py][INFO] Epoch:[0/2](508600/4588595) loss:2.416 lr:0.0000100 epoch_Time:25892.0min: [2024-01-04 23:27:22,193][model8_pretrain.py][INFO] Epoch:[0/2](508600/4588595) loss:3.005 lr:0.0000100 epoch_Time:25892.0min: [2024-01-04 23:27:59,133][model8_pretrain.py][INFO] Epoch:[0/2](508700/4588595) loss:2.899 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:27:59,134][model8_pretrain.py][INFO] Epoch:[0/2](508700/4588595) loss:2.595 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:27:59,134][model8_pretrain.py][INFO] Epoch:[0/2](508700/4588595) loss:2.361 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:27:59,134][model8_pretrain.py][INFO] Epoch:[0/2](508700/4588595) loss:2.613 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:27:59,134][model8_pretrain.py][INFO] Epoch:[0/2](508700/4588595) loss:3.008 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:27:59,134][model8_pretrain.py][INFO] Epoch:[0/2](508700/4588595) loss:2.379 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:27:59,134][model8_pretrain.py][INFO] Epoch:[0/2](508700/4588595) loss:3.179 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:27:59,134][model8_pretrain.py][INFO] Epoch:[0/2](508700/4588595) loss:2.583 lr:0.0000100 epoch_Time:25891.0min: [2024-01-04 23:28:36,092][model8_pretrain.py][INFO] Epoch:[0/2](508800/4588595) loss:2.857 lr:0.0000100 epoch_Time:25890.0min: [2024-01-04 23:28:36,092][model8_pretrain.py][INFO] Epoch:[0/2](508800/4588595) loss:2.880 lr:0.0000100 epoch_Time:25890.0min: [2024-01-04 23:28:36,092][model8_pretrain.py][INFO] Epoch:[0/2](508800/4588595) loss:3.258 lr:0.0000100 epoch_Time:25890.0min: [2024-01-04 23:28:36,092][model8_pretrain.py][INFO] Epoch:[0/2](508800/4588595) loss:2.541 lr:0.0000100 epoch_Time:25890.0min: [2024-01-04 23:28:36,092][model8_pretrain.py][INFO] Epoch:[0/2](508800/4588595) loss:2.997 lr:0.0000100 epoch_Time:25890.0min: [2024-01-04 23:28:36,092][model8_pretrain.py][INFO] Epoch:[0/2](508800/4588595) loss:2.955 lr:0.0000100 epoch_Time:25890.0min: [2024-01-04 23:28:36,092][model8_pretrain.py][INFO] Epoch:[0/2](508800/4588595) loss:3.184 lr:0.0000100 epoch_Time:25890.0min: [2024-01-04 23:28:36,092][model8_pretrain.py][INFO] Epoch:[0/2](508800/4588595) loss:3.072 lr:0.0000100 epoch_Time:25890.0min: [2024-01-04 23:29:13,059][model8_pretrain.py][INFO] Epoch:[0/2](508900/4588595) loss:2.844 lr:0.0000100 epoch_Time:25889.0min: [2024-01-04 23:29:13,060][model8_pretrain.py][INFO] Epoch:[0/2](508900/4588595) loss:3.132 lr:0.0000100 epoch_Time:25889.0min: [2024-01-04 23:29:13,060][model8_pretrain.py][INFO] Epoch:[0/2](508900/4588595) loss:2.334 lr:0.0000100 epoch_Time:25889.0min: [2024-01-04 23:29:13,060][model8_pretrain.py][INFO] Epoch:[0/2](508900/4588595) loss:2.740 lr:0.0000100 epoch_Time:25889.0min: [2024-01-04 23:29:13,060][model8_pretrain.py][INFO] Epoch:[0/2](508900/4588595) loss:2.387 lr:0.0000100 epoch_Time:25889.0min: [2024-01-04 23:29:13,060][model8_pretrain.py][INFO] Epoch:[0/2](508900/4588595) loss:2.220 lr:0.0000100 epoch_Time:25889.0min: [2024-01-04 23:29:13,060][model8_pretrain.py][INFO] Epoch:[0/2](508900/4588595) loss:2.920 lr:0.0000100 epoch_Time:25889.0min: [2024-01-04 23:29:13,061][model8_pretrain.py][INFO] Epoch:[0/2](508900/4588595) loss:2.834 lr:0.0000100 epoch_Time:25889.0min: [2024-01-04 23:29:50,006][model8_pretrain.py][INFO] Epoch:[0/2](509000/4588595) loss:2.688 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:29:50,006][model8_pretrain.py][INFO] Epoch:[0/2](509000/4588595) loss:2.421 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:29:50,006][model8_pretrain.py][INFO] Epoch:[0/2](509000/4588595) loss:2.915 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:29:50,006][model8_pretrain.py][INFO] Epoch:[0/2](509000/4588595) loss:2.749 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:29:50,006][model8_pretrain.py][INFO] Epoch:[0/2](509000/4588595) loss:3.016 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:29:50,006][model8_pretrain.py][INFO] Epoch:[0/2](509000/4588595) loss:3.343 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:29:50,006][model8_pretrain.py][INFO] Epoch:[0/2](509000/4588595) loss:2.866 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:29:50,006][model8_pretrain.py][INFO] Epoch:[0/2](509000/4588595) loss:2.966 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:30:26,951][model8_pretrain.py][INFO] Epoch:[0/2](509100/4588595) loss:2.766 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:30:26,952][model8_pretrain.py][INFO] Epoch:[0/2](509100/4588595) loss:3.269 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:30:26,952][model8_pretrain.py][INFO] Epoch:[0/2](509100/4588595) loss:2.274 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:30:26,952][model8_pretrain.py][INFO] Epoch:[0/2](509100/4588595) loss:2.963 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:30:26,952][model8_pretrain.py][INFO] Epoch:[0/2](509100/4588595) loss:2.880 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:30:26,952][model8_pretrain.py][INFO] Epoch:[0/2](509100/4588595) loss:3.001 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:30:26,952][model8_pretrain.py][INFO] Epoch:[0/2](509100/4588595) loss:3.225 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:30:26,952][model8_pretrain.py][INFO] Epoch:[0/2](509100/4588595) loss:3.021 lr:0.0000100 epoch_Time:25888.0min: [2024-01-04 23:31:03,870][model8_pretrain.py][INFO] Epoch:[0/2](509200/4588595) loss:3.486 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:03,870][model8_pretrain.py][INFO] Epoch:[0/2](509200/4588595) loss:2.558 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:03,870][model8_pretrain.py][INFO] Epoch:[0/2](509200/4588595) loss:3.159 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:03,870][model8_pretrain.py][INFO] Epoch:[0/2](509200/4588595) loss:2.873 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:03,870][model8_pretrain.py][INFO] Epoch:[0/2](509200/4588595) loss:2.976 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:03,870][model8_pretrain.py][INFO] Epoch:[0/2](509200/4588595) loss:2.652 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:03,870][model8_pretrain.py][INFO] Epoch:[0/2](509200/4588595) loss:2.282 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:03,870][model8_pretrain.py][INFO] Epoch:[0/2](509200/4588595) loss:3.154 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:49,350][model8_pretrain.py][INFO] Epoch:[0/2](509300/4588595) loss:3.075 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:49,350][model8_pretrain.py][INFO] Epoch:[0/2](509300/4588595) loss:2.791 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:49,350][model8_pretrain.py][INFO] Epoch:[0/2](509300/4588595) loss:2.814 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:49,350][model8_pretrain.py][INFO] Epoch:[0/2](509300/4588595) loss:2.757 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:49,350][model8_pretrain.py][INFO] Epoch:[0/2](509300/4588595) loss:2.441 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:49,350][model8_pretrain.py][INFO] Epoch:[0/2](509300/4588595) loss:2.984 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:49,350][model8_pretrain.py][INFO] Epoch:[0/2](509300/4588595) loss:2.476 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:31:49,350][model8_pretrain.py][INFO] Epoch:[0/2](509300/4588595) loss:2.903 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:32:27,957][model8_pretrain.py][INFO] Epoch:[0/2](509400/4588595) loss:3.114 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:32:27,957][model8_pretrain.py][INFO] Epoch:[0/2](509400/4588595) loss:3.316 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:32:27,957][model8_pretrain.py][INFO] Epoch:[0/2](509400/4588595) loss:2.675 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:32:27,957][model8_pretrain.py][INFO] Epoch:[0/2](509400/4588595) loss:2.116 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:32:27,957][model8_pretrain.py][INFO] Epoch:[0/2](509400/4588595) loss:3.010 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:32:27,958][model8_pretrain.py][INFO] Epoch:[0/2](509400/4588595) loss:2.328 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:32:27,958][model8_pretrain.py][INFO] Epoch:[0/2](509400/4588595) loss:2.824 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:32:27,958][model8_pretrain.py][INFO] Epoch:[0/2](509400/4588595) loss:2.743 lr:0.0000100 epoch_Time:25887.0min: [2024-01-04 23:33:04,902][model8_pretrain.py][INFO] Epoch:[0/2](509500/4588595) loss:3.345 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:04,902][model8_pretrain.py][INFO] Epoch:[0/2](509500/4588595) loss:2.769 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:04,902][model8_pretrain.py][INFO] Epoch:[0/2](509500/4588595) loss:2.353 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:04,902][model8_pretrain.py][INFO] Epoch:[0/2](509500/4588595) loss:2.716 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:04,903][model8_pretrain.py][INFO] Epoch:[0/2](509500/4588595) loss:2.536 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:04,902][model8_pretrain.py][INFO] Epoch:[0/2](509500/4588595) loss:2.929 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:04,903][model8_pretrain.py][INFO] Epoch:[0/2](509500/4588595) loss:2.630 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:04,903][model8_pretrain.py][INFO] Epoch:[0/2](509500/4588595) loss:2.623 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:41,848][model8_pretrain.py][INFO] Epoch:[0/2](509600/4588595) loss:2.531 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:41,848][model8_pretrain.py][INFO] Epoch:[0/2](509600/4588595) loss:3.157 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:41,848][model8_pretrain.py][INFO] Epoch:[0/2](509600/4588595) loss:2.893 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:41,848][model8_pretrain.py][INFO] Epoch:[0/2](509600/4588595) loss:2.350 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:41,848][model8_pretrain.py][INFO] Epoch:[0/2](509600/4588595) loss:2.638 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:41,848][model8_pretrain.py][INFO] Epoch:[0/2](509600/4588595) loss:3.369 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:41,848][model8_pretrain.py][INFO] Epoch:[0/2](509600/4588595) loss:3.281 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:33:41,848][model8_pretrain.py][INFO] Epoch:[0/2](509600/4588595) loss:2.667 lr:0.0000100 epoch_Time:25886.0min: [2024-01-04 23:34:18,801][model8_pretrain.py][INFO] Epoch:[0/2](509700/4588595) loss:2.495 lr:0.0000100 epoch_Time:25884.0min: [2024-01-04 23:34:18,801][model8_pretrain.py][INFO] Epoch:[0/2](509700/4588595) loss:3.388 lr:0.0000100 epoch_Time:25884.0min: [2024-01-04 23:34:18,801][model8_pretrain.py][INFO] Epoch:[0/2](509700/4588595) loss:2.701 lr:0.0000100 epoch_Time:25884.0min: [2024-01-04 23:34:18,801][model8_pretrain.py][INFO] Epoch:[0/2](509700/4588595) loss:2.922 lr:0.0000100 epoch_Time:25884.0min: [2024-01-04 23:34:18,801][model8_pretrain.py][INFO] Epoch:[0/2](509700/4588595) loss:2.766 lr:0.0000100 epoch_Time:25884.0min: [2024-01-04 23:34:18,802][model8_pretrain.py][INFO] Epoch:[0/2](509700/4588595) loss:2.895 lr:0.0000100 epoch_Time:25884.0min: [2024-01-04 23:34:18,802][model8_pretrain.py][INFO] Epoch:[0/2](509700/4588595) loss:3.137 lr:0.0000100 epoch_Time:25884.0min: [2024-01-04 23:34:18,802][model8_pretrain.py][INFO] Epoch:[0/2](509700/4588595) loss:3.077 lr:0.0000100 epoch_Time:25884.0min: [2024-01-04 23:34:55,737][model8_pretrain.py][INFO] Epoch:[0/2](509800/4588595) loss:2.686 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:34:55,737][model8_pretrain.py][INFO] Epoch:[0/2](509800/4588595) loss:2.474 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:34:55,737][model8_pretrain.py][INFO] Epoch:[0/2](509800/4588595) loss:2.482 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:34:55,737][model8_pretrain.py][INFO] Epoch:[0/2](509800/4588595) loss:3.252 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:34:55,737][model8_pretrain.py][INFO] Epoch:[0/2](509800/4588595) loss:3.101 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:34:55,737][model8_pretrain.py][INFO] Epoch:[0/2](509800/4588595) loss:3.019 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:34:55,737][model8_pretrain.py][INFO] Epoch:[0/2](509800/4588595) loss:3.014 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:34:55,737][model8_pretrain.py][INFO] Epoch:[0/2](509800/4588595) loss:3.010 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:35:32,684][model8_pretrain.py][INFO] Epoch:[0/2](509900/4588595) loss:2.579 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:35:32,684][model8_pretrain.py][INFO] Epoch:[0/2](509900/4588595) loss:2.300 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:35:32,684][model8_pretrain.py][INFO] Epoch:[0/2](509900/4588595) loss:2.891 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:35:32,684][model8_pretrain.py][INFO] Epoch:[0/2](509900/4588595) loss:3.466 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:35:32,684][model8_pretrain.py][INFO] Epoch:[0/2](509900/4588595) loss:3.102 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:35:32,684][model8_pretrain.py][INFO] Epoch:[0/2](509900/4588595) loss:2.692 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:35:32,684][model8_pretrain.py][INFO] Epoch:[0/2](509900/4588595) loss:2.937 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:35:32,684][model8_pretrain.py][INFO] Epoch:[0/2](509900/4588595) loss:2.652 lr:0.0000100 epoch_Time:25883.0min: [2024-01-04 23:36:09,635][model8_pretrain.py][INFO] Epoch:[0/2](510000/4588595) loss:2.936 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:09,635][model8_pretrain.py][INFO] Epoch:[0/2](510000/4588595) loss:3.023 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:09,635][model8_pretrain.py][INFO] Epoch:[0/2](510000/4588595) loss:2.790 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:09,635][model8_pretrain.py][INFO] Epoch:[0/2](510000/4588595) loss:2.997 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:09,635][model8_pretrain.py][INFO] Epoch:[0/2](510000/4588595) loss:2.805 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:09,635][model8_pretrain.py][INFO] Epoch:[0/2](510000/4588595) loss:3.003 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:09,635][model8_pretrain.py][INFO] Epoch:[0/2](510000/4588595) loss:2.416 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:09,635][model8_pretrain.py][INFO] Epoch:[0/2](510000/4588595) loss:3.156 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:55,103][model8_pretrain.py][INFO] Epoch:[0/2](510100/4588595) loss:2.160 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:55,104][model8_pretrain.py][INFO] Epoch:[0/2](510100/4588595) loss:2.440 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:55,103][model8_pretrain.py][INFO] Epoch:[0/2](510100/4588595) loss:3.257 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:55,103][model8_pretrain.py][INFO] Epoch:[0/2](510100/4588595) loss:2.929 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:55,104][model8_pretrain.py][INFO] Epoch:[0/2](510100/4588595) loss:2.758 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:55,104][model8_pretrain.py][INFO] Epoch:[0/2](510100/4588595) loss:2.833 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:55,104][model8_pretrain.py][INFO] Epoch:[0/2](510100/4588595) loss:3.094 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:36:55,104][model8_pretrain.py][INFO] Epoch:[0/2](510100/4588595) loss:2.362 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:37:33,781][model8_pretrain.py][INFO] Epoch:[0/2](510200/4588595) loss:2.759 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:37:33,781][model8_pretrain.py][INFO] Epoch:[0/2](510200/4588595) loss:2.775 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:37:33,781][model8_pretrain.py][INFO] Epoch:[0/2](510200/4588595) loss:3.434 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:37:33,781][model8_pretrain.py][INFO] Epoch:[0/2](510200/4588595) loss:2.551 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:37:33,781][model8_pretrain.py][INFO] Epoch:[0/2](510200/4588595) loss:3.108 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:37:33,781][model8_pretrain.py][INFO] Epoch:[0/2](510200/4588595) loss:2.430 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:37:33,781][model8_pretrain.py][INFO] Epoch:[0/2](510200/4588595) loss:2.865 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:37:33,782][model8_pretrain.py][INFO] Epoch:[0/2](510200/4588595) loss:2.716 lr:0.0000100 epoch_Time:25882.0min: [2024-01-04 23:38:10,728][model8_pretrain.py][INFO] Epoch:[0/2](510300/4588595) loss:2.803 lr:0.0000100 epoch_Time:25881.0min: [2024-01-04 23:38:10,728][model8_pretrain.py][INFO] Epoch:[0/2](510300/4588595) loss:2.559 lr:0.0000100 epoch_Time:25881.0min: [2024-01-04 23:38:10,728][model8_pretrain.py][INFO] Epoch:[0/2](510300/4588595) loss:2.144 lr:0.0000100 epoch_Time:25881.0min: [2024-01-04 23:38:10,728][model8_pretrain.py][INFO] Epoch:[0/2](510300/4588595) loss:3.259 lr:0.0000100 epoch_Time:25881.0min: [2024-01-04 23:38:10,728][model8_pretrain.py][INFO] Epoch:[0/2](510300/4588595) loss:2.455 lr:0.0000100 epoch_Time:25881.0min: [2024-01-04 23:38:10,728][model8_pretrain.py][INFO] Epoch:[0/2](510300/4588595) loss:2.704 lr:0.0000100 epoch_Time:25881.0min: [2024-01-04 23:38:10,728][model8_pretrain.py][INFO] Epoch:[0/2](510300/4588595) loss:2.818 lr:0.0000100 epoch_Time:25881.0min: [2024-01-04 23:38:10,728][model8_pretrain.py][INFO] Epoch:[0/2](510300/4588595) loss:2.825 lr:0.0000100 epoch_Time:25881.0min: [2024-01-04 23:38:47,669][model8_pretrain.py][INFO] Epoch:[0/2](510400/4588595) loss:3.101 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:38:47,669][model8_pretrain.py][INFO] Epoch:[0/2](510400/4588595) loss:2.101 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:38:47,669][model8_pretrain.py][INFO] Epoch:[0/2](510400/4588595) loss:2.730 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:38:47,669][model8_pretrain.py][INFO] Epoch:[0/2](510400/4588595) loss:2.869 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:38:47,669][model8_pretrain.py][INFO] Epoch:[0/2](510400/4588595) loss:2.953 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:38:47,670][model8_pretrain.py][INFO] Epoch:[0/2](510400/4588595) loss:3.008 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:38:47,670][model8_pretrain.py][INFO] Epoch:[0/2](510400/4588595) loss:2.893 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:38:47,670][model8_pretrain.py][INFO] Epoch:[0/2](510400/4588595) loss:2.804 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:39:24,610][model8_pretrain.py][INFO] Epoch:[0/2](510500/4588595) loss:3.067 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:39:24,610][model8_pretrain.py][INFO] Epoch:[0/2](510500/4588595) loss:3.136 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:39:24,610][model8_pretrain.py][INFO] Epoch:[0/2](510500/4588595) loss:3.013 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:39:24,610][model8_pretrain.py][INFO] Epoch:[0/2](510500/4588595) loss:2.762 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:39:24,610][model8_pretrain.py][INFO] Epoch:[0/2](510500/4588595) loss:2.857 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:39:24,610][model8_pretrain.py][INFO] Epoch:[0/2](510500/4588595) loss:2.961 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:39:24,610][model8_pretrain.py][INFO] Epoch:[0/2](510500/4588595) loss:2.814 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:39:24,610][model8_pretrain.py][INFO] Epoch:[0/2](510500/4588595) loss:3.161 lr:0.0000100 epoch_Time:25880.0min: [2024-01-04 23:40:01,540][model8_pretrain.py][INFO] Epoch:[0/2](510600/4588595) loss:3.055 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:01,540][model8_pretrain.py][INFO] Epoch:[0/2](510600/4588595) loss:3.260 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:01,540][model8_pretrain.py][INFO] Epoch:[0/2](510600/4588595) loss:2.652 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:01,540][model8_pretrain.py][INFO] Epoch:[0/2](510600/4588595) loss:2.826 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:01,540][model8_pretrain.py][INFO] Epoch:[0/2](510600/4588595) loss:2.144 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:01,540][model8_pretrain.py][INFO] Epoch:[0/2](510600/4588595) loss:2.933 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:01,541][model8_pretrain.py][INFO] Epoch:[0/2](510600/4588595) loss:2.641 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:01,541][model8_pretrain.py][INFO] Epoch:[0/2](510600/4588595) loss:2.567 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:38,474][model8_pretrain.py][INFO] Epoch:[0/2](510700/4588595) loss:2.862 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:38,474][model8_pretrain.py][INFO] Epoch:[0/2](510700/4588595) loss:2.895 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:38,474][model8_pretrain.py][INFO] Epoch:[0/2](510700/4588595) loss:3.007 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:38,474][model8_pretrain.py][INFO] Epoch:[0/2](510700/4588595) loss:3.287 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:38,474][model8_pretrain.py][INFO] Epoch:[0/2](510700/4588595) loss:3.244 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:38,474][model8_pretrain.py][INFO] Epoch:[0/2](510700/4588595) loss:3.075 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:38,474][model8_pretrain.py][INFO] Epoch:[0/2](510700/4588595) loss:3.081 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:40:38,474][model8_pretrain.py][INFO] Epoch:[0/2](510700/4588595) loss:2.899 lr:0.0000100 epoch_Time:25878.0min: [2024-01-04 23:41:15,419][model8_pretrain.py][INFO] Epoch:[0/2](510800/4588595) loss:2.872 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:15,419][model8_pretrain.py][INFO] Epoch:[0/2](510800/4588595) loss:3.309 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:15,419][model8_pretrain.py][INFO] Epoch:[0/2](510800/4588595) loss:2.999 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:15,419][model8_pretrain.py][INFO] Epoch:[0/2](510800/4588595) loss:3.315 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:15,419][model8_pretrain.py][INFO] Epoch:[0/2](510800/4588595) loss:2.836 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:15,419][model8_pretrain.py][INFO] Epoch:[0/2](510800/4588595) loss:2.481 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:15,419][model8_pretrain.py][INFO] Epoch:[0/2](510800/4588595) loss:2.924 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:15,419][model8_pretrain.py][INFO] Epoch:[0/2](510800/4588595) loss:2.013 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:57,571][model8_pretrain.py][INFO] Epoch:[0/2](510900/4588595) loss:2.667 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:57,571][model8_pretrain.py][INFO] Epoch:[0/2](510900/4588595) loss:2.295 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:57,571][model8_pretrain.py][INFO] Epoch:[0/2](510900/4588595) loss:2.424 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:57,571][model8_pretrain.py][INFO] Epoch:[0/2](510900/4588595) loss:3.023 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:57,571][model8_pretrain.py][INFO] Epoch:[0/2](510900/4588595) loss:2.749 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:57,571][model8_pretrain.py][INFO] Epoch:[0/2](510900/4588595) loss:3.486 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:57,571][model8_pretrain.py][INFO] Epoch:[0/2](510900/4588595) loss:2.763 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:41:57,571][model8_pretrain.py][INFO] Epoch:[0/2](510900/4588595) loss:2.408 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:42:39,513][model8_pretrain.py][INFO] Epoch:[0/2](511000/4588595) loss:2.551 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:42:39,513][model8_pretrain.py][INFO] Epoch:[0/2](511000/4588595) loss:2.986 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:42:39,513][model8_pretrain.py][INFO] Epoch:[0/2](511000/4588595) loss:2.972 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:42:39,513][model8_pretrain.py][INFO] Epoch:[0/2](511000/4588595) loss:2.733 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:42:39,513][model8_pretrain.py][INFO] Epoch:[0/2](511000/4588595) loss:2.673 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:42:39,513][model8_pretrain.py][INFO] Epoch:[0/2](511000/4588595) loss:2.266 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:42:39,513][model8_pretrain.py][INFO] Epoch:[0/2](511000/4588595) loss:3.086 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:42:39,514][model8_pretrain.py][INFO] Epoch:[0/2](511000/4588595) loss:2.298 lr:0.0000100 epoch_Time:25877.0min: [2024-01-04 23:43:16,441][model8_pretrain.py][INFO] Epoch:[0/2](511100/4588595) loss:2.828 lr:0.0000100 epoch_Time:25876.0min: [2024-01-04 23:43:16,442][model8_pretrain.py][INFO] Epoch:[0/2](511100/4588595) loss:3.057 lr:0.0000100 epoch_Time:25876.0min: [2024-01-04 23:43:16,442][model8_pretrain.py][INFO] Epoch:[0/2](511100/4588595) loss:2.775 lr:0.0000100 epoch_Time:25876.0min: [2024-01-04 23:43:16,442][model8_pretrain.py][INFO] Epoch:[0/2](511100/4588595) loss:2.948 lr:0.0000100 epoch_Time:25876.0min: [2024-01-04 23:43:16,442][model8_pretrain.py][INFO] Epoch:[0/2](511100/4588595) loss:2.648 lr:0.0000100 epoch_Time:25876.0min: [2024-01-04 23:43:16,442][model8_pretrain.py][INFO] Epoch:[0/2](511100/4588595) loss:3.180 lr:0.0000100 epoch_Time:25876.0min: [2024-01-04 23:43:16,442][model8_pretrain.py][INFO] Epoch:[0/2](511100/4588595) loss:2.912 lr:0.0000100 epoch_Time:25876.0min: [2024-01-04 23:43:16,442][model8_pretrain.py][INFO] Epoch:[0/2](511100/4588595) loss:2.490 lr:0.0000100 epoch_Time:25876.0min: [2024-01-04 23:43:53,391][model8_pretrain.py][INFO] Epoch:[0/2](511200/4588595) loss:2.653 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:43:53,391][model8_pretrain.py][INFO] Epoch:[0/2](511200/4588595) loss:3.379 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:43:53,391][model8_pretrain.py][INFO] Epoch:[0/2](511200/4588595) loss:2.947 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:43:53,391][model8_pretrain.py][INFO] Epoch:[0/2](511200/4588595) loss:3.032 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:43:53,391][model8_pretrain.py][INFO] Epoch:[0/2](511200/4588595) loss:2.809 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:43:53,391][model8_pretrain.py][INFO] Epoch:[0/2](511200/4588595) loss:2.889 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:43:53,391][model8_pretrain.py][INFO] Epoch:[0/2](511200/4588595) loss:2.741 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:43:53,391][model8_pretrain.py][INFO] Epoch:[0/2](511200/4588595) loss:2.539 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:44:30,331][model8_pretrain.py][INFO] Epoch:[0/2](511300/4588595) loss:2.343 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:44:30,331][model8_pretrain.py][INFO] Epoch:[0/2](511300/4588595) loss:2.716 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:44:30,331][model8_pretrain.py][INFO] Epoch:[0/2](511300/4588595) loss:2.406 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:44:30,331][model8_pretrain.py][INFO] Epoch:[0/2](511300/4588595) loss:2.944 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:44:30,331][model8_pretrain.py][INFO] Epoch:[0/2](511300/4588595) loss:2.709 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:44:30,331][model8_pretrain.py][INFO] Epoch:[0/2](511300/4588595) loss:2.636 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:44:30,331][model8_pretrain.py][INFO] Epoch:[0/2](511300/4588595) loss:2.751 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:44:30,331][model8_pretrain.py][INFO] Epoch:[0/2](511300/4588595) loss:2.917 lr:0.0000100 epoch_Time:25875.0min: [2024-01-04 23:45:07,262][model8_pretrain.py][INFO] Epoch:[0/2](511400/4588595) loss:2.626 lr:0.0000100 epoch_Time:25874.0min: [2024-01-04 23:45:07,262][model8_pretrain.py][INFO] Epoch:[0/2](511400/4588595) loss:3.107 lr:0.0000100 epoch_Time:25874.0min: [2024-01-04 23:45:07,262][model8_pretrain.py][INFO] Epoch:[0/2](511400/4588595) loss:2.581 lr:0.0000100 epoch_Time:25874.0min: [2024-01-04 23:45:07,262][model8_pretrain.py][INFO] Epoch:[0/2](511400/4588595) loss:3.167 lr:0.0000100 epoch_Time:25874.0min: [2024-01-04 23:45:07,262][model8_pretrain.py][INFO] Epoch:[0/2](511400/4588595) loss:2.273 lr:0.0000100 epoch_Time:25874.0min: [2024-01-04 23:45:07,262][model8_pretrain.py][INFO] Epoch:[0/2](511400/4588595) loss:3.355 lr:0.0000100 epoch_Time:25874.0min: [2024-01-04 23:45:07,262][model8_pretrain.py][INFO] Epoch:[0/2](511400/4588595) loss:2.826 lr:0.0000100 epoch_Time:25874.0min: [2024-01-04 23:45:07,262][model8_pretrain.py][INFO] Epoch:[0/2](511400/4588595) loss:2.407 lr:0.0000100 epoch_Time:25874.0min: [2024-01-04 23:45:44,191][model8_pretrain.py][INFO] Epoch:[0/2](511500/4588595) loss:3.317 lr:0.0000100 epoch_Time:25873.0min: [2024-01-04 23:45:44,191][model8_pretrain.py][INFO] Epoch:[0/2](511500/4588595) loss:2.867 lr:0.0000100 epoch_Time:25873.0min: [2024-01-04 23:45:44,191][model8_pretrain.py][INFO] Epoch:[0/2](511500/4588595) loss:3.168 lr:0.0000100 epoch_Time:25873.0min: [2024-01-04 23:45:44,191][model8_pretrain.py][INFO] Epoch:[0/2](511500/4588595) loss:2.670 lr:0.0000100 epoch_Time:25873.0min: [2024-01-04 23:45:44,191][model8_pretrain.py][INFO] Epoch:[0/2](511500/4588595) loss:2.721 lr:0.0000100 epoch_Time:25873.0min: [2024-01-04 23:45:44,191][model8_pretrain.py][INFO] Epoch:[0/2](511500/4588595) loss:2.706 lr:0.0000100 epoch_Time:25873.0min: [2024-01-04 23:45:44,191][model8_pretrain.py][INFO] Epoch:[0/2](511500/4588595) loss:1.914 lr:0.0000100 epoch_Time:25873.0min: [2024-01-04 23:45:44,191][model8_pretrain.py][INFO] Epoch:[0/2](511500/4588595) loss:3.117 lr:0.0000100 epoch_Time:25873.0min: [2024-01-04 23:46:21,141][model8_pretrain.py][INFO] Epoch:[0/2](511600/4588595) loss:2.806 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:46:21,141][model8_pretrain.py][INFO] Epoch:[0/2](511600/4588595) loss:2.969 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:46:21,141][model8_pretrain.py][INFO] Epoch:[0/2](511600/4588595) loss:2.934 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:46:21,141][model8_pretrain.py][INFO] Epoch:[0/2](511600/4588595) loss:2.795 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:46:21,141][model8_pretrain.py][INFO] Epoch:[0/2](511600/4588595) loss:2.959 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:46:21,141][model8_pretrain.py][INFO] Epoch:[0/2](511600/4588595) loss:2.873 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:46:21,142][model8_pretrain.py][INFO] Epoch:[0/2](511600/4588595) loss:2.409 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:46:21,142][model8_pretrain.py][INFO] Epoch:[0/2](511600/4588595) loss:2.906 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:03,229][model8_pretrain.py][INFO] Epoch:[0/2](511700/4588595) loss:2.552 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:03,229][model8_pretrain.py][INFO] Epoch:[0/2](511700/4588595) loss:2.590 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:03,229][model8_pretrain.py][INFO] Epoch:[0/2](511700/4588595) loss:3.100 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:03,229][model8_pretrain.py][INFO] Epoch:[0/2](511700/4588595) loss:3.301 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:03,229][model8_pretrain.py][INFO] Epoch:[0/2](511700/4588595) loss:2.946 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:03,229][model8_pretrain.py][INFO] Epoch:[0/2](511700/4588595) loss:2.470 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:03,229][model8_pretrain.py][INFO] Epoch:[0/2](511700/4588595) loss:3.465 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:03,229][model8_pretrain.py][INFO] Epoch:[0/2](511700/4588595) loss:3.248 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:45,191][model8_pretrain.py][INFO] Epoch:[0/2](511800/4588595) loss:2.830 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:45,190][model8_pretrain.py][INFO] Epoch:[0/2](511800/4588595) loss:2.544 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:45,191][model8_pretrain.py][INFO] Epoch:[0/2](511800/4588595) loss:2.758 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:45,191][model8_pretrain.py][INFO] Epoch:[0/2](511800/4588595) loss:3.012 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:45,191][model8_pretrain.py][INFO] Epoch:[0/2](511800/4588595) loss:3.292 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:45,191][model8_pretrain.py][INFO] Epoch:[0/2](511800/4588595) loss:3.065 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:45,191][model8_pretrain.py][INFO] Epoch:[0/2](511800/4588595) loss:2.953 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:47:45,191][model8_pretrain.py][INFO] Epoch:[0/2](511800/4588595) loss:2.759 lr:0.0000100 epoch_Time:25872.0min: [2024-01-04 23:48:22,129][model8_pretrain.py][INFO] Epoch:[0/2](511900/4588595) loss:2.880 lr:0.0000100 epoch_Time:25871.0min: [2024-01-04 23:48:22,129][model8_pretrain.py][INFO] Epoch:[0/2](511900/4588595) loss:2.381 lr:0.0000100 epoch_Time:25871.0min: [2024-01-04 23:48:22,129][model8_pretrain.py][INFO] Epoch:[0/2](511900/4588595) loss:2.666 lr:0.0000100 epoch_Time:25871.0min: [2024-01-04 23:48:22,129][model8_pretrain.py][INFO] Epoch:[0/2](511900/4588595) loss:2.659 lr:0.0000100 epoch_Time:25871.0min: [2024-01-04 23:48:22,129][model8_pretrain.py][INFO] Epoch:[0/2](511900/4588595) loss:2.827 lr:0.0000100 epoch_Time:25871.0min: [2024-01-04 23:48:22,129][model8_pretrain.py][INFO] Epoch:[0/2](511900/4588595) loss:2.930 lr:0.0000100 epoch_Time:25871.0min: [2024-01-04 23:48:22,130][model8_pretrain.py][INFO] Epoch:[0/2](511900/4588595) loss:2.888 lr:0.0000100 epoch_Time:25871.0min: [2024-01-04 23:48:22,130][model8_pretrain.py][INFO] Epoch:[0/2](511900/4588595) loss:3.053 lr:0.0000100 epoch_Time:25871.0min: [2024-01-04 23:48:59,066][model8_pretrain.py][INFO] Epoch:[0/2](512000/4588595) loss:2.919 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:48:59,066][model8_pretrain.py][INFO] Epoch:[0/2](512000/4588595) loss:2.984 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:48:59,066][model8_pretrain.py][INFO] Epoch:[0/2](512000/4588595) loss:2.694 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:48:59,066][model8_pretrain.py][INFO] Epoch:[0/2](512000/4588595) loss:2.752 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:48:59,066][model8_pretrain.py][INFO] Epoch:[0/2](512000/4588595) loss:2.609 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:48:59,066][model8_pretrain.py][INFO] Epoch:[0/2](512000/4588595) loss:2.920 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:48:59,066][model8_pretrain.py][INFO] Epoch:[0/2](512000/4588595) loss:3.027 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:48:59,066][model8_pretrain.py][INFO] Epoch:[0/2](512000/4588595) loss:2.713 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:49:36,014][model8_pretrain.py][INFO] Epoch:[0/2](512100/4588595) loss:3.032 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:49:36,014][model8_pretrain.py][INFO] Epoch:[0/2](512100/4588595) loss:2.533 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:49:36,014][model8_pretrain.py][INFO] Epoch:[0/2](512100/4588595) loss:2.874 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:49:36,014][model8_pretrain.py][INFO] Epoch:[0/2](512100/4588595) loss:2.975 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:49:36,014][model8_pretrain.py][INFO] Epoch:[0/2](512100/4588595) loss:2.666 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:49:36,014][model8_pretrain.py][INFO] Epoch:[0/2](512100/4588595) loss:3.198 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:49:36,014][model8_pretrain.py][INFO] Epoch:[0/2](512100/4588595) loss:2.791 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:49:36,014][model8_pretrain.py][INFO] Epoch:[0/2](512100/4588595) loss:2.775 lr:0.0000100 epoch_Time:25870.0min: [2024-01-04 23:50:12,956][model8_pretrain.py][INFO] Epoch:[0/2](512200/4588595) loss:2.715 lr:0.0000100 epoch_Time:25869.0min: [2024-01-04 23:50:12,956][model8_pretrain.py][INFO] Epoch:[0/2](512200/4588595) loss:2.612 lr:0.0000100 epoch_Time:25869.0min: [2024-01-04 23:50:12,956][model8_pretrain.py][INFO] Epoch:[0/2](512200/4588595) loss:3.363 lr:0.0000100 epoch_Time:25869.0min: [2024-01-04 23:50:12,956][model8_pretrain.py][INFO] Epoch:[0/2](512200/4588595) loss:3.121 lr:0.0000100 epoch_Time:25869.0min: [2024-01-04 23:50:12,956][model8_pretrain.py][INFO] Epoch:[0/2](512200/4588595) loss:3.061 lr:0.0000100 epoch_Time:25869.0min: [2024-01-04 23:50:12,956][model8_pretrain.py][INFO] Epoch:[0/2](512200/4588595) loss:3.132 lr:0.0000100 epoch_Time:25869.0min: [2024-01-04 23:50:12,956][model8_pretrain.py][INFO] Epoch:[0/2](512200/4588595) loss:2.779 lr:0.0000100 epoch_Time:25869.0min: [2024-01-04 23:50:12,956][model8_pretrain.py][INFO] Epoch:[0/2](512200/4588595) loss:2.878 lr:0.0000100 epoch_Time:25869.0min: [2024-01-04 23:50:49,889][model8_pretrain.py][INFO] Epoch:[0/2](512300/4588595) loss:3.316 lr:0.0000100 epoch_Time:25868.0min: [2024-01-04 23:50:49,889][model8_pretrain.py][INFO] Epoch:[0/2](512300/4588595) loss:2.863 lr:0.0000100 epoch_Time:25868.0min: [2024-01-04 23:50:49,889][model8_pretrain.py][INFO] Epoch:[0/2](512300/4588595) loss:2.910 lr:0.0000100 epoch_Time:25868.0min: [2024-01-04 23:50:49,889][model8_pretrain.py][INFO] Epoch:[0/2](512300/4588595) loss:2.808 lr:0.0000100 epoch_Time:25868.0min: [2024-01-04 23:50:49,889][model8_pretrain.py][INFO] Epoch:[0/2](512300/4588595) loss:3.046 lr:0.0000100 epoch_Time:25868.0min: [2024-01-04 23:50:49,889][model8_pretrain.py][INFO] Epoch:[0/2](512300/4588595) loss:2.469 lr:0.0000100 epoch_Time:25868.0min: [2024-01-04 23:50:49,889][model8_pretrain.py][INFO] Epoch:[0/2](512300/4588595) loss:2.724 lr:0.0000100 epoch_Time:25868.0min: [2024-01-04 23:50:49,889][model8_pretrain.py][INFO] Epoch:[0/2](512300/4588595) loss:2.871 lr:0.0000100 epoch_Time:25868.0min: [2024-01-04 23:51:26,822][model8_pretrain.py][INFO] Epoch:[0/2](512400/4588595) loss:3.257 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:51:26,822][model8_pretrain.py][INFO] Epoch:[0/2](512400/4588595) loss:2.691 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:51:26,822][model8_pretrain.py][INFO] Epoch:[0/2](512400/4588595) loss:2.891 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:51:26,822][model8_pretrain.py][INFO] Epoch:[0/2](512400/4588595) loss:3.110 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:51:26,822][model8_pretrain.py][INFO] Epoch:[0/2](512400/4588595) loss:3.031 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:51:26,822][model8_pretrain.py][INFO] Epoch:[0/2](512400/4588595) loss:3.047 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:51:26,822][model8_pretrain.py][INFO] Epoch:[0/2](512400/4588595) loss:2.843 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:51:26,822][model8_pretrain.py][INFO] Epoch:[0/2](512400/4588595) loss:3.352 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:08,736][model8_pretrain.py][INFO] Epoch:[0/2](512500/4588595) loss:3.116 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:08,736][model8_pretrain.py][INFO] Epoch:[0/2](512500/4588595) loss:2.836 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:08,740][model8_pretrain.py][INFO] Epoch:[0/2](512500/4588595) loss:3.219 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:08,741][model8_pretrain.py][INFO] Epoch:[0/2](512500/4588595) loss:2.766 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:08,741][model8_pretrain.py][INFO] Epoch:[0/2](512500/4588595) loss:3.031 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:08,741][model8_pretrain.py][INFO] Epoch:[0/2](512500/4588595) loss:3.055 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:08,741][model8_pretrain.py][INFO] Epoch:[0/2](512500/4588595) loss:2.783 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:08,795][model8_pretrain.py][INFO] Epoch:[0/2](512500/4588595) loss:2.835 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:50,795][model8_pretrain.py][INFO] Epoch:[0/2](512600/4588595) loss:3.052 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:50,795][model8_pretrain.py][INFO] Epoch:[0/2](512600/4588595) loss:3.088 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:50,795][model8_pretrain.py][INFO] Epoch:[0/2](512600/4588595) loss:2.107 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:50,795][model8_pretrain.py][INFO] Epoch:[0/2](512600/4588595) loss:2.818 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:50,795][model8_pretrain.py][INFO] Epoch:[0/2](512600/4588595) loss:2.913 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:50,795][model8_pretrain.py][INFO] Epoch:[0/2](512600/4588595) loss:2.805 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:50,795][model8_pretrain.py][INFO] Epoch:[0/2](512600/4588595) loss:2.702 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:52:50,795][model8_pretrain.py][INFO] Epoch:[0/2](512600/4588595) loss:2.831 lr:0.0000100 epoch_Time:25867.0min: [2024-01-04 23:53:27,734][model8_pretrain.py][INFO] Epoch:[0/2](512700/4588595) loss:2.976 lr:0.0000100 epoch_Time:25866.0min: [2024-01-04 23:53:27,734][model8_pretrain.py][INFO] Epoch:[0/2](512700/4588595) loss:2.368 lr:0.0000100 epoch_Time:25866.0min: [2024-01-04 23:53:27,734][model8_pretrain.py][INFO] Epoch:[0/2](512700/4588595) loss:2.526 lr:0.0000100 epoch_Time:25866.0min: [2024-01-04 23:53:27,734][model8_pretrain.py][INFO] Epoch:[0/2](512700/4588595) loss:2.925 lr:0.0000100 epoch_Time:25866.0min: [2024-01-04 23:53:27,734][model8_pretrain.py][INFO] Epoch:[0/2](512700/4588595) loss:2.984 lr:0.0000100 epoch_Time:25866.0min: [2024-01-04 23:53:27,734][model8_pretrain.py][INFO] Epoch:[0/2](512700/4588595) loss:2.726 lr:0.0000100 epoch_Time:25866.0min: [2024-01-04 23:53:27,734][model8_pretrain.py][INFO] Epoch:[0/2](512700/4588595) loss:3.347 lr:0.0000100 epoch_Time:25866.0min: [2024-01-04 23:53:27,735][model8_pretrain.py][INFO] Epoch:[0/2](512700/4588595) loss:2.825 lr:0.0000100 epoch_Time:25866.0min: [2024-01-04 23:54:04,684][model8_pretrain.py][INFO] Epoch:[0/2](512800/4588595) loss:3.129 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:04,684][model8_pretrain.py][INFO] Epoch:[0/2](512800/4588595) loss:2.647 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:04,684][model8_pretrain.py][INFO] Epoch:[0/2](512800/4588595) loss:2.789 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:04,684][model8_pretrain.py][INFO] Epoch:[0/2](512800/4588595) loss:3.174 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:04,684][model8_pretrain.py][INFO] Epoch:[0/2](512800/4588595) loss:2.897 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:04,684][model8_pretrain.py][INFO] Epoch:[0/2](512800/4588595) loss:3.175 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:04,684][model8_pretrain.py][INFO] Epoch:[0/2](512800/4588595) loss:3.032 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:04,684][model8_pretrain.py][INFO] Epoch:[0/2](512800/4588595) loss:2.440 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:41,632][model8_pretrain.py][INFO] Epoch:[0/2](512900/4588595) loss:2.807 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:41,632][model8_pretrain.py][INFO] Epoch:[0/2](512900/4588595) loss:2.360 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:41,632][model8_pretrain.py][INFO] Epoch:[0/2](512900/4588595) loss:2.910 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:41,632][model8_pretrain.py][INFO] Epoch:[0/2](512900/4588595) loss:2.720 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:41,632][model8_pretrain.py][INFO] Epoch:[0/2](512900/4588595) loss:3.100 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:41,632][model8_pretrain.py][INFO] Epoch:[0/2](512900/4588595) loss:2.357 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:41,633][model8_pretrain.py][INFO] Epoch:[0/2](512900/4588595) loss:2.649 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:54:41,633][model8_pretrain.py][INFO] Epoch:[0/2](512900/4588595) loss:2.985 lr:0.0000100 epoch_Time:25865.0min: [2024-01-04 23:55:18,553][model8_pretrain.py][INFO] Epoch:[0/2](513000/4588595) loss:2.407 lr:0.0000100 epoch_Time:25864.0min: [2024-01-04 23:55:18,553][model8_pretrain.py][INFO] Epoch:[0/2](513000/4588595) loss:3.174 lr:0.0000100 epoch_Time:25864.0min: [2024-01-04 23:55:18,553][model8_pretrain.py][INFO] Epoch:[0/2](513000/4588595) loss:2.730 lr:0.0000100 epoch_Time:25864.0min: [2024-01-04 23:55:18,553][model8_pretrain.py][INFO] Epoch:[0/2](513000/4588595) loss:2.796 lr:0.0000100 epoch_Time:25864.0min: [2024-01-04 23:55:18,553][model8_pretrain.py][INFO] Epoch:[0/2](513000/4588595) loss:3.016 lr:0.0000100 epoch_Time:25864.0min: [2024-01-04 23:55:18,553][model8_pretrain.py][INFO] Epoch:[0/2](513000/4588595) loss:2.955 lr:0.0000100 epoch_Time:25864.0min: [2024-01-04 23:55:18,553][model8_pretrain.py][INFO] Epoch:[0/2](513000/4588595) loss:3.494 lr:0.0000100 epoch_Time:25864.0min: [2024-01-04 23:55:18,554][model8_pretrain.py][INFO] Epoch:[0/2](513000/4588595) loss:2.576 lr:0.0000100 epoch_Time:25864.0min: [2024-01-04 23:55:55,487][model8_pretrain.py][INFO] Epoch:[0/2](513100/4588595) loss:2.845 lr:0.0000100 epoch_Time:25863.0min: [2024-01-04 23:55:55,487][model8_pretrain.py][INFO] Epoch:[0/2](513100/4588595) loss:2.568 lr:0.0000100 epoch_Time:25863.0min: [2024-01-04 23:55:55,487][model8_pretrain.py][INFO] Epoch:[0/2](513100/4588595) loss:2.759 lr:0.0000100 epoch_Time:25863.0min: [2024-01-04 23:55:55,487][model8_pretrain.py][INFO] Epoch:[0/2](513100/4588595) loss:2.798 lr:0.0000100 epoch_Time:25863.0min: [2024-01-04 23:55:55,487][model8_pretrain.py][INFO] Epoch:[0/2](513100/4588595) loss:2.644 lr:0.0000100 epoch_Time:25863.0min: [2024-01-04 23:55:55,487][model8_pretrain.py][INFO] Epoch:[0/2](513100/4588595) loss:2.423 lr:0.0000100 epoch_Time:25863.0min: [2024-01-04 23:55:55,487][model8_pretrain.py][INFO] Epoch:[0/2](513100/4588595) loss:2.482 lr:0.0000100 epoch_Time:25863.0min: [2024-01-04 23:55:55,487][model8_pretrain.py][INFO] Epoch:[0/2](513100/4588595) loss:2.929 lr:0.0000100 epoch_Time:25863.0min: [2024-01-04 23:56:32,415][model8_pretrain.py][INFO] Epoch:[0/2](513200/4588595) loss:3.319 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:56:32,415][model8_pretrain.py][INFO] Epoch:[0/2](513200/4588595) loss:2.659 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:56:32,415][model8_pretrain.py][INFO] Epoch:[0/2](513200/4588595) loss:3.333 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:56:32,415][model8_pretrain.py][INFO] Epoch:[0/2](513200/4588595) loss:3.332 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:56:32,415][model8_pretrain.py][INFO] Epoch:[0/2](513200/4588595) loss:3.107 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:56:32,415][model8_pretrain.py][INFO] Epoch:[0/2](513200/4588595) loss:2.799 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:56:32,415][model8_pretrain.py][INFO] Epoch:[0/2](513200/4588595) loss:2.643 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:56:32,415][model8_pretrain.py][INFO] Epoch:[0/2](513200/4588595) loss:2.574 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:11,086][model8_pretrain.py][INFO] Epoch:[0/2](513300/4588595) loss:3.002 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:11,086][model8_pretrain.py][INFO] Epoch:[0/2](513300/4588595) loss:2.264 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:11,086][model8_pretrain.py][INFO] Epoch:[0/2](513300/4588595) loss:2.746 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:11,086][model8_pretrain.py][INFO] Epoch:[0/2](513300/4588595) loss:2.934 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:11,086][model8_pretrain.py][INFO] Epoch:[0/2](513300/4588595) loss:3.280 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:11,086][model8_pretrain.py][INFO] Epoch:[0/2](513300/4588595) loss:2.245 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:11,086][model8_pretrain.py][INFO] Epoch:[0/2](513300/4588595) loss:2.618 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:11,086][model8_pretrain.py][INFO] Epoch:[0/2](513300/4588595) loss:3.112 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:56,375][model8_pretrain.py][INFO] Epoch:[0/2](513400/4588595) loss:3.272 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:56,375][model8_pretrain.py][INFO] Epoch:[0/2](513400/4588595) loss:2.489 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:56,375][model8_pretrain.py][INFO] Epoch:[0/2](513400/4588595) loss:2.766 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:56,375][model8_pretrain.py][INFO] Epoch:[0/2](513400/4588595) loss:2.748 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:56,375][model8_pretrain.py][INFO] Epoch:[0/2](513400/4588595) loss:2.501 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:56,375][model8_pretrain.py][INFO] Epoch:[0/2](513400/4588595) loss:2.950 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:56,375][model8_pretrain.py][INFO] Epoch:[0/2](513400/4588595) loss:2.851 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:57:56,375][model8_pretrain.py][INFO] Epoch:[0/2](513400/4588595) loss:2.738 lr:0.0000100 epoch_Time:25862.0min: [2024-01-04 23:58:33,280][model8_pretrain.py][INFO] Epoch:[0/2](513500/4588595) loss:3.211 lr:0.0000100 epoch_Time:25861.0min: [2024-01-04 23:58:33,280][model8_pretrain.py][INFO] Epoch:[0/2](513500/4588595) loss:3.040 lr:0.0000100 epoch_Time:25861.0min: [2024-01-04 23:58:33,280][model8_pretrain.py][INFO] Epoch:[0/2](513500/4588595) loss:2.536 lr:0.0000100 epoch_Time:25861.0min: [2024-01-04 23:58:33,280][model8_pretrain.py][INFO] Epoch:[0/2](513500/4588595) loss:2.710 lr:0.0000100 epoch_Time:25861.0min: [2024-01-04 23:58:33,280][model8_pretrain.py][INFO] Epoch:[0/2](513500/4588595) loss:2.732 lr:0.0000100 epoch_Time:25861.0min: [2024-01-04 23:58:33,280][model8_pretrain.py][INFO] Epoch:[0/2](513500/4588595) loss:2.521 lr:0.0000100 epoch_Time:25861.0min: [2024-01-04 23:58:33,280][model8_pretrain.py][INFO] Epoch:[0/2](513500/4588595) loss:2.991 lr:0.0000100 epoch_Time:25861.0min: [2024-01-04 23:58:33,281][model8_pretrain.py][INFO] Epoch:[0/2](513500/4588595) loss:2.092 lr:0.0000100 epoch_Time:25861.0min: [2024-01-04 23:59:10,210][model8_pretrain.py][INFO] Epoch:[0/2](513600/4588595) loss:2.859 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:10,210][model8_pretrain.py][INFO] Epoch:[0/2](513600/4588595) loss:3.002 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:10,210][model8_pretrain.py][INFO] Epoch:[0/2](513600/4588595) loss:2.824 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:10,210][model8_pretrain.py][INFO] Epoch:[0/2](513600/4588595) loss:2.567 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:10,210][model8_pretrain.py][INFO] Epoch:[0/2](513600/4588595) loss:2.645 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:10,210][model8_pretrain.py][INFO] Epoch:[0/2](513600/4588595) loss:2.320 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:10,210][model8_pretrain.py][INFO] Epoch:[0/2](513600/4588595) loss:3.213 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:10,210][model8_pretrain.py][INFO] Epoch:[0/2](513600/4588595) loss:2.526 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:47,146][model8_pretrain.py][INFO] Epoch:[0/2](513700/4588595) loss:3.055 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:47,146][model8_pretrain.py][INFO] Epoch:[0/2](513700/4588595) loss:2.476 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:47,146][model8_pretrain.py][INFO] Epoch:[0/2](513700/4588595) loss:2.166 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:47,146][model8_pretrain.py][INFO] Epoch:[0/2](513700/4588595) loss:3.210 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:47,146][model8_pretrain.py][INFO] Epoch:[0/2](513700/4588595) loss:3.208 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:47,146][model8_pretrain.py][INFO] Epoch:[0/2](513700/4588595) loss:2.845 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:47,147][model8_pretrain.py][INFO] Epoch:[0/2](513700/4588595) loss:2.375 lr:0.0000100 epoch_Time:25860.0min: [2024-01-04 23:59:47,147][model8_pretrain.py][INFO] Epoch:[0/2](513700/4588595) loss:2.758 lr:0.0000100 epoch_Time:25860.0min: [2024-01-05 00:00:24,083][model8_pretrain.py][INFO] Epoch:[0/2](513800/4588595) loss:2.971 lr:0.0000100 epoch_Time:25859.0min: [2024-01-05 00:00:24,083][model8_pretrain.py][INFO] Epoch:[0/2](513800/4588595) loss:2.665 lr:0.0000100 epoch_Time:25859.0min: [2024-01-05 00:00:24,083][model8_pretrain.py][INFO] Epoch:[0/2](513800/4588595) loss:2.868 lr:0.0000100 epoch_Time:25859.0min: [2024-01-05 00:00:24,083][model8_pretrain.py][INFO] Epoch:[0/2](513800/4588595) loss:3.277 lr:0.0000100 epoch_Time:25859.0min: [2024-01-05 00:00:24,083][model8_pretrain.py][INFO] Epoch:[0/2](513800/4588595) loss:3.041 lr:0.0000100 epoch_Time:25859.0min: [2024-01-05 00:00:24,083][model8_pretrain.py][INFO] Epoch:[0/2](513800/4588595) loss:2.443 lr:0.0000100 epoch_Time:25859.0min: [2024-01-05 00:00:24,083][model8_pretrain.py][INFO] Epoch:[0/2](513800/4588595) loss:2.904 lr:0.0000100 epoch_Time:25859.0min: [2024-01-05 00:00:24,083][model8_pretrain.py][INFO] Epoch:[0/2](513800/4588595) loss:3.153 lr:0.0000100 epoch_Time:25859.0min: [2024-01-05 00:01:00,985][model8_pretrain.py][INFO] Epoch:[0/2](513900/4588595) loss:3.181 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:00,985][model8_pretrain.py][INFO] Epoch:[0/2](513900/4588595) loss:2.957 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:00,985][model8_pretrain.py][INFO] Epoch:[0/2](513900/4588595) loss:2.721 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:00,985][model8_pretrain.py][INFO] Epoch:[0/2](513900/4588595) loss:3.033 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:00,985][model8_pretrain.py][INFO] Epoch:[0/2](513900/4588595) loss:2.181 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:00,985][model8_pretrain.py][INFO] Epoch:[0/2](513900/4588595) loss:2.260 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:00,985][model8_pretrain.py][INFO] Epoch:[0/2](513900/4588595) loss:2.189 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:00,985][model8_pretrain.py][INFO] Epoch:[0/2](513900/4588595) loss:3.424 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:37,906][model8_pretrain.py][INFO] Epoch:[0/2](514000/4588595) loss:3.619 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:37,906][model8_pretrain.py][INFO] Epoch:[0/2](514000/4588595) loss:2.483 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:37,906][model8_pretrain.py][INFO] Epoch:[0/2](514000/4588595) loss:2.794 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:37,906][model8_pretrain.py][INFO] Epoch:[0/2](514000/4588595) loss:2.769 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:37,906][model8_pretrain.py][INFO] Epoch:[0/2](514000/4588595) loss:2.919 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:37,906][model8_pretrain.py][INFO] Epoch:[0/2](514000/4588595) loss:2.654 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:37,906][model8_pretrain.py][INFO] Epoch:[0/2](514000/4588595) loss:2.500 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:01:37,907][model8_pretrain.py][INFO] Epoch:[0/2](514000/4588595) loss:3.136 lr:0.0000100 epoch_Time:25858.0min: [2024-01-05 00:02:16,581][model8_pretrain.py][INFO] Epoch:[0/2](514100/4588595) loss:2.998 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:02:16,581][model8_pretrain.py][INFO] Epoch:[0/2](514100/4588595) loss:2.957 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:02:16,581][model8_pretrain.py][INFO] Epoch:[0/2](514100/4588595) loss:2.759 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:02:16,581][model8_pretrain.py][INFO] Epoch:[0/2](514100/4588595) loss:2.873 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:02:16,581][model8_pretrain.py][INFO] Epoch:[0/2](514100/4588595) loss:2.752 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:02:16,581][model8_pretrain.py][INFO] Epoch:[0/2](514100/4588595) loss:2.226 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:02:16,581][model8_pretrain.py][INFO] Epoch:[0/2](514100/4588595) loss:2.666 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:02:16,582][model8_pretrain.py][INFO] Epoch:[0/2](514100/4588595) loss:3.110 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:01,981][model8_pretrain.py][INFO] Epoch:[0/2](514200/4588595) loss:3.039 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:01,981][model8_pretrain.py][INFO] Epoch:[0/2](514200/4588595) loss:2.591 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:01,981][model8_pretrain.py][INFO] Epoch:[0/2](514200/4588595) loss:2.894 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:01,982][model8_pretrain.py][INFO] Epoch:[0/2](514200/4588595) loss:3.176 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:01,982][model8_pretrain.py][INFO] Epoch:[0/2](514200/4588595) loss:2.920 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:01,982][model8_pretrain.py][INFO] Epoch:[0/2](514200/4588595) loss:2.819 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:01,982][model8_pretrain.py][INFO] Epoch:[0/2](514200/4588595) loss:3.210 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:01,983][model8_pretrain.py][INFO] Epoch:[0/2](514200/4588595) loss:2.819 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:38,914][model8_pretrain.py][INFO] Epoch:[0/2](514300/4588595) loss:2.535 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:38,914][model8_pretrain.py][INFO] Epoch:[0/2](514300/4588595) loss:2.979 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:38,914][model8_pretrain.py][INFO] Epoch:[0/2](514300/4588595) loss:2.856 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:38,914][model8_pretrain.py][INFO] Epoch:[0/2](514300/4588595) loss:3.106 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:38,914][model8_pretrain.py][INFO] Epoch:[0/2](514300/4588595) loss:2.711 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:38,914][model8_pretrain.py][INFO] Epoch:[0/2](514300/4588595) loss:2.925 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:38,914][model8_pretrain.py][INFO] Epoch:[0/2](514300/4588595) loss:2.591 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:03:38,914][model8_pretrain.py][INFO] Epoch:[0/2](514300/4588595) loss:2.843 lr:0.0000100 epoch_Time:25857.0min: [2024-01-05 00:04:15,842][model8_pretrain.py][INFO] Epoch:[0/2](514400/4588595) loss:2.805 lr:0.0000100 epoch_Time:25855.0min: [2024-01-05 00:04:15,842][model8_pretrain.py][INFO] Epoch:[0/2](514400/4588595) loss:3.442 lr:0.0000100 epoch_Time:25855.0min: [2024-01-05 00:04:15,842][model8_pretrain.py][INFO] Epoch:[0/2](514400/4588595) loss:2.662 lr:0.0000100 epoch_Time:25855.0min: [2024-01-05 00:04:15,842][model8_pretrain.py][INFO] Epoch:[0/2](514400/4588595) loss:2.617 lr:0.0000100 epoch_Time:25855.0min: [2024-01-05 00:04:15,842][model8_pretrain.py][INFO] Epoch:[0/2](514400/4588595) loss:3.170 lr:0.0000100 epoch_Time:25855.0min: [2024-01-05 00:04:15,842][model8_pretrain.py][INFO] Epoch:[0/2](514400/4588595) loss:3.043 lr:0.0000100 epoch_Time:25855.0min: [2024-01-05 00:04:15,842][model8_pretrain.py][INFO] Epoch:[0/2](514400/4588595) loss:3.068 lr:0.0000100 epoch_Time:25855.0min: [2024-01-05 00:04:15,842][model8_pretrain.py][INFO] Epoch:[0/2](514400/4588595) loss:3.096 lr:0.0000100 epoch_Time:25855.0min: [2024-01-05 00:04:52,775][model8_pretrain.py][INFO] Epoch:[0/2](514500/4588595) loss:2.788 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:04:52,775][model8_pretrain.py][INFO] Epoch:[0/2](514500/4588595) loss:2.366 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:04:52,775][model8_pretrain.py][INFO] Epoch:[0/2](514500/4588595) loss:3.260 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:04:52,775][model8_pretrain.py][INFO] Epoch:[0/2](514500/4588595) loss:2.080 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:04:52,775][model8_pretrain.py][INFO] Epoch:[0/2](514500/4588595) loss:2.519 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:04:52,775][model8_pretrain.py][INFO] Epoch:[0/2](514500/4588595) loss:2.479 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:04:52,775][model8_pretrain.py][INFO] Epoch:[0/2](514500/4588595) loss:2.866 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:04:52,775][model8_pretrain.py][INFO] Epoch:[0/2](514500/4588595) loss:2.338 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:05:29,701][model8_pretrain.py][INFO] Epoch:[0/2](514600/4588595) loss:3.281 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:05:29,701][model8_pretrain.py][INFO] Epoch:[0/2](514600/4588595) loss:3.145 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:05:29,701][model8_pretrain.py][INFO] Epoch:[0/2](514600/4588595) loss:2.344 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:05:29,701][model8_pretrain.py][INFO] Epoch:[0/2](514600/4588595) loss:2.899 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:05:29,701][model8_pretrain.py][INFO] Epoch:[0/2](514600/4588595) loss:2.999 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:05:29,701][model8_pretrain.py][INFO] Epoch:[0/2](514600/4588595) loss:2.738 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:05:29,701][model8_pretrain.py][INFO] Epoch:[0/2](514600/4588595) loss:3.493 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:05:29,702][model8_pretrain.py][INFO] Epoch:[0/2](514600/4588595) loss:3.147 lr:0.0000100 epoch_Time:25854.0min: [2024-01-05 00:06:06,639][model8_pretrain.py][INFO] Epoch:[0/2](514700/4588595) loss:3.350 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:06,639][model8_pretrain.py][INFO] Epoch:[0/2](514700/4588595) loss:3.274 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:06,639][model8_pretrain.py][INFO] Epoch:[0/2](514700/4588595) loss:3.086 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:06,639][model8_pretrain.py][INFO] Epoch:[0/2](514700/4588595) loss:2.867 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:06,639][model8_pretrain.py][INFO] Epoch:[0/2](514700/4588595) loss:2.626 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:06,639][model8_pretrain.py][INFO] Epoch:[0/2](514700/4588595) loss:3.270 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:06,639][model8_pretrain.py][INFO] Epoch:[0/2](514700/4588595) loss:2.781 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:06,639][model8_pretrain.py][INFO] Epoch:[0/2](514700/4588595) loss:3.342 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:43,572][model8_pretrain.py][INFO] Epoch:[0/2](514800/4588595) loss:2.631 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:43,572][model8_pretrain.py][INFO] Epoch:[0/2](514800/4588595) loss:2.950 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:43,572][model8_pretrain.py][INFO] Epoch:[0/2](514800/4588595) loss:3.396 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:43,572][model8_pretrain.py][INFO] Epoch:[0/2](514800/4588595) loss:2.403 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:43,572][model8_pretrain.py][INFO] Epoch:[0/2](514800/4588595) loss:2.445 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:43,572][model8_pretrain.py][INFO] Epoch:[0/2](514800/4588595) loss:3.044 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:43,572][model8_pretrain.py][INFO] Epoch:[0/2](514800/4588595) loss:2.516 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:06:43,573][model8_pretrain.py][INFO] Epoch:[0/2](514800/4588595) loss:2.870 lr:0.0000100 epoch_Time:25853.0min: [2024-01-05 00:07:22,247][model8_pretrain.py][INFO] Epoch:[0/2](514900/4588595) loss:3.075 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:07:22,247][model8_pretrain.py][INFO] Epoch:[0/2](514900/4588595) loss:3.373 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:07:22,247][model8_pretrain.py][INFO] Epoch:[0/2](514900/4588595) loss:2.829 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:07:22,247][model8_pretrain.py][INFO] Epoch:[0/2](514900/4588595) loss:2.502 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:07:22,247][model8_pretrain.py][INFO] Epoch:[0/2](514900/4588595) loss:3.171 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:07:22,247][model8_pretrain.py][INFO] Epoch:[0/2](514900/4588595) loss:2.780 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:07:22,247][model8_pretrain.py][INFO] Epoch:[0/2](514900/4588595) loss:2.828 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:07:22,247][model8_pretrain.py][INFO] Epoch:[0/2](514900/4588595) loss:2.470 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:07,753][model8_pretrain.py][INFO] Epoch:[0/2](515000/4588595) loss:2.599 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:07,753][model8_pretrain.py][INFO] Epoch:[0/2](515000/4588595) loss:3.113 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:07,753][model8_pretrain.py][INFO] Epoch:[0/2](515000/4588595) loss:3.072 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:07,753][model8_pretrain.py][INFO] Epoch:[0/2](515000/4588595) loss:3.130 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:07,753][model8_pretrain.py][INFO] Epoch:[0/2](515000/4588595) loss:2.746 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:07,753][model8_pretrain.py][INFO] Epoch:[0/2](515000/4588595) loss:2.905 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:07,753][model8_pretrain.py][INFO] Epoch:[0/2](515000/4588595) loss:3.031 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:07,753][model8_pretrain.py][INFO] Epoch:[0/2](515000/4588595) loss:3.021 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:44,695][model8_pretrain.py][INFO] Epoch:[0/2](515100/4588595) loss:2.378 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:44,695][model8_pretrain.py][INFO] Epoch:[0/2](515100/4588595) loss:2.498 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:44,695][model8_pretrain.py][INFO] Epoch:[0/2](515100/4588595) loss:2.280 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:44,695][model8_pretrain.py][INFO] Epoch:[0/2](515100/4588595) loss:2.908 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:44,695][model8_pretrain.py][INFO] Epoch:[0/2](515100/4588595) loss:2.932 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:44,695][model8_pretrain.py][INFO] Epoch:[0/2](515100/4588595) loss:2.896 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:44,695][model8_pretrain.py][INFO] Epoch:[0/2](515100/4588595) loss:3.126 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:08:44,696][model8_pretrain.py][INFO] Epoch:[0/2](515100/4588595) loss:2.954 lr:0.0000100 epoch_Time:25852.0min: [2024-01-05 00:09:21,639][model8_pretrain.py][INFO] Epoch:[0/2](515200/4588595) loss:2.658 lr:0.0000100 epoch_Time:25851.0min: [2024-01-05 00:09:21,639][model8_pretrain.py][INFO] Epoch:[0/2](515200/4588595) loss:2.809 lr:0.0000100 epoch_Time:25851.0min: [2024-01-05 00:09:21,639][model8_pretrain.py][INFO] Epoch:[0/2](515200/4588595) loss:3.208 lr:0.0000100 epoch_Time:25851.0min: [2024-01-05 00:09:21,639][model8_pretrain.py][INFO] Epoch:[0/2](515200/4588595) loss:2.906 lr:0.0000100 epoch_Time:25851.0min: [2024-01-05 00:09:21,639][model8_pretrain.py][INFO] Epoch:[0/2](515200/4588595) loss:3.314 lr:0.0000100 epoch_Time:25851.0min: [2024-01-05 00:09:21,639][model8_pretrain.py][INFO] Epoch:[0/2](515200/4588595) loss:2.481 lr:0.0000100 epoch_Time:25851.0min: [2024-01-05 00:09:21,639][model8_pretrain.py][INFO] Epoch:[0/2](515200/4588595) loss:3.095 lr:0.0000100 epoch_Time:25851.0min: [2024-01-05 00:09:21,639][model8_pretrain.py][INFO] Epoch:[0/2](515200/4588595) loss:2.954 lr:0.0000100 epoch_Time:25851.0min: [2024-01-05 00:09:58,583][model8_pretrain.py][INFO] Epoch:[0/2](515300/4588595) loss:2.367 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:09:58,583][model8_pretrain.py][INFO] Epoch:[0/2](515300/4588595) loss:2.922 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:09:58,583][model8_pretrain.py][INFO] Epoch:[0/2](515300/4588595) loss:2.628 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:09:58,583][model8_pretrain.py][INFO] Epoch:[0/2](515300/4588595) loss:3.164 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:09:58,583][model8_pretrain.py][INFO] Epoch:[0/2](515300/4588595) loss:2.886 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:09:58,584][model8_pretrain.py][INFO] Epoch:[0/2](515300/4588595) loss:2.805 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:09:58,584][model8_pretrain.py][INFO] Epoch:[0/2](515300/4588595) loss:3.093 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:09:58,584][model8_pretrain.py][INFO] Epoch:[0/2](515300/4588595) loss:2.581 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:10:35,544][model8_pretrain.py][INFO] Epoch:[0/2](515400/4588595) loss:3.053 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:10:35,544][model8_pretrain.py][INFO] Epoch:[0/2](515400/4588595) loss:2.381 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:10:35,544][model8_pretrain.py][INFO] Epoch:[0/2](515400/4588595) loss:2.974 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:10:35,545][model8_pretrain.py][INFO] Epoch:[0/2](515400/4588595) loss:2.984 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:10:35,545][model8_pretrain.py][INFO] Epoch:[0/2](515400/4588595) loss:2.560 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:10:35,545][model8_pretrain.py][INFO] Epoch:[0/2](515400/4588595) loss:3.279 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:10:35,545][model8_pretrain.py][INFO] Epoch:[0/2](515400/4588595) loss:3.128 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:10:35,545][model8_pretrain.py][INFO] Epoch:[0/2](515400/4588595) loss:2.700 lr:0.0000100 epoch_Time:25849.0min: [2024-01-05 00:11:12,480][model8_pretrain.py][INFO] Epoch:[0/2](515500/4588595) loss:2.978 lr:0.0000100 epoch_Time:25848.0min: [2024-01-05 00:11:12,480][model8_pretrain.py][INFO] Epoch:[0/2](515500/4588595) loss:2.460 lr:0.0000100 epoch_Time:25848.0min: [2024-01-05 00:11:12,481][model8_pretrain.py][INFO] Epoch:[0/2](515500/4588595) loss:3.127 lr:0.0000100 epoch_Time:25848.0min: [2024-01-05 00:11:12,481][model8_pretrain.py][INFO] Epoch:[0/2](515500/4588595) loss:3.111 lr:0.0000100 epoch_Time:25848.0min: [2024-01-05 00:11:12,481][model8_pretrain.py][INFO] Epoch:[0/2](515500/4588595) loss:3.145 lr:0.0000100 epoch_Time:25848.0min: [2024-01-05 00:11:12,481][model8_pretrain.py][INFO] Epoch:[0/2](515500/4588595) loss:3.147 lr:0.0000100 epoch_Time:25848.0min: [2024-01-05 00:11:12,481][model8_pretrain.py][INFO] Epoch:[0/2](515500/4588595) loss:3.012 lr:0.0000100 epoch_Time:25848.0min: [2024-01-05 00:11:12,481][model8_pretrain.py][INFO] Epoch:[0/2](515500/4588595) loss:2.066 lr:0.0000100 epoch_Time:25848.0min: [2024-01-05 00:11:49,426][model8_pretrain.py][INFO] Epoch:[0/2](515600/4588595) loss:3.178 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:11:49,426][model8_pretrain.py][INFO] Epoch:[0/2](515600/4588595) loss:3.134 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:11:49,426][model8_pretrain.py][INFO] Epoch:[0/2](515600/4588595) loss:2.734 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:11:49,426][model8_pretrain.py][INFO] Epoch:[0/2](515600/4588595) loss:1.879 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:11:49,426][model8_pretrain.py][INFO] Epoch:[0/2](515600/4588595) loss:3.301 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:11:49,426][model8_pretrain.py][INFO] Epoch:[0/2](515600/4588595) loss:2.537 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:11:49,426][model8_pretrain.py][INFO] Epoch:[0/2](515600/4588595) loss:2.570 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:11:49,426][model8_pretrain.py][INFO] Epoch:[0/2](515600/4588595) loss:2.605 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:12:26,363][model8_pretrain.py][INFO] Epoch:[0/2](515700/4588595) loss:3.473 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:12:26,363][model8_pretrain.py][INFO] Epoch:[0/2](515700/4588595) loss:3.089 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:12:26,363][model8_pretrain.py][INFO] Epoch:[0/2](515700/4588595) loss:3.072 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:12:26,363][model8_pretrain.py][INFO] Epoch:[0/2](515700/4588595) loss:2.413 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:12:26,363][model8_pretrain.py][INFO] Epoch:[0/2](515700/4588595) loss:3.030 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:12:26,363][model8_pretrain.py][INFO] Epoch:[0/2](515700/4588595) loss:2.711 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:12:26,363][model8_pretrain.py][INFO] Epoch:[0/2](515700/4588595) loss:3.054 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:12:26,364][model8_pretrain.py][INFO] Epoch:[0/2](515700/4588595) loss:2.641 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:13:13,796][model8_pretrain.py][INFO] Epoch:[0/2](515800/4588595) loss:1.979 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:13:13,796][model8_pretrain.py][INFO] Epoch:[0/2](515800/4588595) loss:3.132 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:13:13,796][model8_pretrain.py][INFO] Epoch:[0/2](515800/4588595) loss:2.612 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:13:13,796][model8_pretrain.py][INFO] Epoch:[0/2](515800/4588595) loss:3.312 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:13:13,796][model8_pretrain.py][INFO] Epoch:[0/2](515800/4588595) loss:2.564 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:13:13,796][model8_pretrain.py][INFO] Epoch:[0/2](515800/4588595) loss:2.331 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:13:13,796][model8_pretrain.py][INFO] Epoch:[0/2](515800/4588595) loss:2.527 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:13:13,796][model8_pretrain.py][INFO] Epoch:[0/2](515800/4588595) loss:2.448 lr:0.0000100 epoch_Time:25847.0min: [2024-01-05 00:13:50,737][model8_pretrain.py][INFO] Epoch:[0/2](515900/4588595) loss:3.117 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:13:50,737][model8_pretrain.py][INFO] Epoch:[0/2](515900/4588595) loss:3.169 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:13:50,737][model8_pretrain.py][INFO] Epoch:[0/2](515900/4588595) loss:2.489 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:13:50,737][model8_pretrain.py][INFO] Epoch:[0/2](515900/4588595) loss:2.831 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:13:50,737][model8_pretrain.py][INFO] Epoch:[0/2](515900/4588595) loss:2.972 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:13:50,737][model8_pretrain.py][INFO] Epoch:[0/2](515900/4588595) loss:2.514 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:13:50,737][model8_pretrain.py][INFO] Epoch:[0/2](515900/4588595) loss:3.154 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:13:50,737][model8_pretrain.py][INFO] Epoch:[0/2](515900/4588595) loss:3.083 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:14:27,687][model8_pretrain.py][INFO] Epoch:[0/2](516000/4588595) loss:3.181 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:14:27,688][model8_pretrain.py][INFO] Epoch:[0/2](516000/4588595) loss:2.852 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:14:27,688][model8_pretrain.py][INFO] Epoch:[0/2](516000/4588595) loss:2.863 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:14:27,688][model8_pretrain.py][INFO] Epoch:[0/2](516000/4588595) loss:2.778 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:14:27,688][model8_pretrain.py][INFO] Epoch:[0/2](516000/4588595) loss:2.946 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:14:27,688][model8_pretrain.py][INFO] Epoch:[0/2](516000/4588595) loss:2.609 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:14:27,688][model8_pretrain.py][INFO] Epoch:[0/2](516000/4588595) loss:3.178 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:14:27,688][model8_pretrain.py][INFO] Epoch:[0/2](516000/4588595) loss:3.176 lr:0.0000100 epoch_Time:25846.0min: [2024-01-05 00:15:04,627][model8_pretrain.py][INFO] Epoch:[0/2](516100/4588595) loss:3.140 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:04,627][model8_pretrain.py][INFO] Epoch:[0/2](516100/4588595) loss:3.065 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:04,627][model8_pretrain.py][INFO] Epoch:[0/2](516100/4588595) loss:2.546 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:04,627][model8_pretrain.py][INFO] Epoch:[0/2](516100/4588595) loss:3.012 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:04,627][model8_pretrain.py][INFO] Epoch:[0/2](516100/4588595) loss:3.328 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:04,628][model8_pretrain.py][INFO] Epoch:[0/2](516100/4588595) loss:2.412 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:04,628][model8_pretrain.py][INFO] Epoch:[0/2](516100/4588595) loss:2.788 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:04,628][model8_pretrain.py][INFO] Epoch:[0/2](516100/4588595) loss:2.788 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:41,558][model8_pretrain.py][INFO] Epoch:[0/2](516200/4588595) loss:2.457 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:41,558][model8_pretrain.py][INFO] Epoch:[0/2](516200/4588595) loss:2.605 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:41,558][model8_pretrain.py][INFO] Epoch:[0/2](516200/4588595) loss:2.951 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:41,558][model8_pretrain.py][INFO] Epoch:[0/2](516200/4588595) loss:2.946 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:41,558][model8_pretrain.py][INFO] Epoch:[0/2](516200/4588595) loss:2.899 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:41,558][model8_pretrain.py][INFO] Epoch:[0/2](516200/4588595) loss:3.035 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:41,558][model8_pretrain.py][INFO] Epoch:[0/2](516200/4588595) loss:3.288 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:15:41,579][model8_pretrain.py][INFO] Epoch:[0/2](516200/4588595) loss:2.295 lr:0.0000100 epoch_Time:25845.0min: [2024-01-05 00:16:18,516][model8_pretrain.py][INFO] Epoch:[0/2](516300/4588595) loss:3.315 lr:0.0000100 epoch_Time:25843.0min: [2024-01-05 00:16:18,516][model8_pretrain.py][INFO] Epoch:[0/2](516300/4588595) loss:2.875 lr:0.0000100 epoch_Time:25843.0min: [2024-01-05 00:16:18,516][model8_pretrain.py][INFO] Epoch:[0/2](516300/4588595) loss:2.961 lr:0.0000100 epoch_Time:25843.0min: [2024-01-05 00:16:18,516][model8_pretrain.py][INFO] Epoch:[0/2](516300/4588595) loss:3.451 lr:0.0000100 epoch_Time:25843.0min: [2024-01-05 00:16:18,516][model8_pretrain.py][INFO] Epoch:[0/2](516300/4588595) loss:3.183 lr:0.0000100 epoch_Time:25843.0min: [2024-01-05 00:16:18,516][model8_pretrain.py][INFO] Epoch:[0/2](516300/4588595) loss:2.896 lr:0.0000100 epoch_Time:25843.0min: [2024-01-05 00:16:18,516][model8_pretrain.py][INFO] Epoch:[0/2](516300/4588595) loss:2.530 lr:0.0000100 epoch_Time:25843.0min: [2024-01-05 00:16:18,516][model8_pretrain.py][INFO] Epoch:[0/2](516300/4588595) loss:2.969 lr:0.0000100 epoch_Time:25843.0min: [2024-01-05 00:16:55,456][model8_pretrain.py][INFO] Epoch:[0/2](516400/4588595) loss:2.835 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:16:55,457][model8_pretrain.py][INFO] Epoch:[0/2](516400/4588595) loss:2.507 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:16:55,457][model8_pretrain.py][INFO] Epoch:[0/2](516400/4588595) loss:2.999 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:16:55,457][model8_pretrain.py][INFO] Epoch:[0/2](516400/4588595) loss:2.116 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:16:55,457][model8_pretrain.py][INFO] Epoch:[0/2](516400/4588595) loss:3.096 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:16:55,457][model8_pretrain.py][INFO] Epoch:[0/2](516400/4588595) loss:2.778 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:16:55,457][model8_pretrain.py][INFO] Epoch:[0/2](516400/4588595) loss:2.702 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:16:55,457][model8_pretrain.py][INFO] Epoch:[0/2](516400/4588595) loss:3.310 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:17:32,390][model8_pretrain.py][INFO] Epoch:[0/2](516500/4588595) loss:2.272 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:17:32,390][model8_pretrain.py][INFO] Epoch:[0/2](516500/4588595) loss:2.816 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:17:32,390][model8_pretrain.py][INFO] Epoch:[0/2](516500/4588595) loss:2.859 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:17:32,390][model8_pretrain.py][INFO] Epoch:[0/2](516500/4588595) loss:2.688 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:17:32,390][model8_pretrain.py][INFO] Epoch:[0/2](516500/4588595) loss:2.600 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:17:32,390][model8_pretrain.py][INFO] Epoch:[0/2](516500/4588595) loss:2.757 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:17:32,390][model8_pretrain.py][INFO] Epoch:[0/2](516500/4588595) loss:2.842 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:17:32,390][model8_pretrain.py][INFO] Epoch:[0/2](516500/4588595) loss:2.989 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:18:19,727][model8_pretrain.py][INFO] Epoch:[0/2](516600/4588595) loss:2.305 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:18:19,727][model8_pretrain.py][INFO] Epoch:[0/2](516600/4588595) loss:2.815 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:18:19,727][model8_pretrain.py][INFO] Epoch:[0/2](516600/4588595) loss:2.857 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:18:19,727][model8_pretrain.py][INFO] Epoch:[0/2](516600/4588595) loss:3.184 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:18:19,727][model8_pretrain.py][INFO] Epoch:[0/2](516600/4588595) loss:2.809 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:18:19,727][model8_pretrain.py][INFO] Epoch:[0/2](516600/4588595) loss:2.946 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:18:19,727][model8_pretrain.py][INFO] Epoch:[0/2](516600/4588595) loss:2.911 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:18:19,727][model8_pretrain.py][INFO] Epoch:[0/2](516600/4588595) loss:2.802 lr:0.0000100 epoch_Time:25842.0min: [2024-01-05 00:18:56,659][model8_pretrain.py][INFO] Epoch:[0/2](516700/4588595) loss:2.731 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:18:56,659][model8_pretrain.py][INFO] Epoch:[0/2](516700/4588595) loss:2.597 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:18:56,659][model8_pretrain.py][INFO] Epoch:[0/2](516700/4588595) loss:2.562 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:18:56,659][model8_pretrain.py][INFO] Epoch:[0/2](516700/4588595) loss:2.432 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:18:56,659][model8_pretrain.py][INFO] Epoch:[0/2](516700/4588595) loss:3.179 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:18:56,659][model8_pretrain.py][INFO] Epoch:[0/2](516700/4588595) loss:3.067 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:18:56,659][model8_pretrain.py][INFO] Epoch:[0/2](516700/4588595) loss:2.290 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:18:56,659][model8_pretrain.py][INFO] Epoch:[0/2](516700/4588595) loss:2.820 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:19:33,605][model8_pretrain.py][INFO] Epoch:[0/2](516800/4588595) loss:2.870 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:19:33,605][model8_pretrain.py][INFO] Epoch:[0/2](516800/4588595) loss:2.993 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:19:33,605][model8_pretrain.py][INFO] Epoch:[0/2](516800/4588595) loss:3.202 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:19:33,605][model8_pretrain.py][INFO] Epoch:[0/2](516800/4588595) loss:3.104 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:19:33,605][model8_pretrain.py][INFO] Epoch:[0/2](516800/4588595) loss:3.020 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:19:33,605][model8_pretrain.py][INFO] Epoch:[0/2](516800/4588595) loss:3.023 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:19:33,605][model8_pretrain.py][INFO] Epoch:[0/2](516800/4588595) loss:2.883 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:19:33,605][model8_pretrain.py][INFO] Epoch:[0/2](516800/4588595) loss:3.125 lr:0.0000100 epoch_Time:25841.0min: [2024-01-05 00:20:10,545][model8_pretrain.py][INFO] Epoch:[0/2](516900/4588595) loss:3.091 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:10,545][model8_pretrain.py][INFO] Epoch:[0/2](516900/4588595) loss:1.955 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:10,545][model8_pretrain.py][INFO] Epoch:[0/2](516900/4588595) loss:2.870 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:10,545][model8_pretrain.py][INFO] Epoch:[0/2](516900/4588595) loss:3.171 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:10,545][model8_pretrain.py][INFO] Epoch:[0/2](516900/4588595) loss:2.962 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:10,545][model8_pretrain.py][INFO] Epoch:[0/2](516900/4588595) loss:3.083 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:10,545][model8_pretrain.py][INFO] Epoch:[0/2](516900/4588595) loss:3.083 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:10,545][model8_pretrain.py][INFO] Epoch:[0/2](516900/4588595) loss:3.089 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:47,482][model8_pretrain.py][INFO] Epoch:[0/2](517000/4588595) loss:2.728 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:47,482][model8_pretrain.py][INFO] Epoch:[0/2](517000/4588595) loss:2.957 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:47,482][model8_pretrain.py][INFO] Epoch:[0/2](517000/4588595) loss:3.316 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:47,482][model8_pretrain.py][INFO] Epoch:[0/2](517000/4588595) loss:2.918 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:47,482][model8_pretrain.py][INFO] Epoch:[0/2](517000/4588595) loss:2.627 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:47,482][model8_pretrain.py][INFO] Epoch:[0/2](517000/4588595) loss:3.166 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:47,482][model8_pretrain.py][INFO] Epoch:[0/2](517000/4588595) loss:2.661 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:20:47,482][model8_pretrain.py][INFO] Epoch:[0/2](517000/4588595) loss:2.496 lr:0.0000100 epoch_Time:25840.0min: [2024-01-05 00:21:24,411][model8_pretrain.py][INFO] Epoch:[0/2](517100/4588595) loss:2.518 lr:0.0000100 epoch_Time:25839.0min: [2024-01-05 00:21:24,411][model8_pretrain.py][INFO] Epoch:[0/2](517100/4588595) loss:3.186 lr:0.0000100 epoch_Time:25839.0min: [2024-01-05 00:21:24,411][model8_pretrain.py][INFO] Epoch:[0/2](517100/4588595) loss:2.741 lr:0.0000100 epoch_Time:25839.0min: [2024-01-05 00:21:24,411][model8_pretrain.py][INFO] Epoch:[0/2](517100/4588595) loss:2.119 lr:0.0000100 epoch_Time:25839.0min: [2024-01-05 00:21:24,411][model8_pretrain.py][INFO] Epoch:[0/2](517100/4588595) loss:2.982 lr:0.0000100 epoch_Time:25839.0min: [2024-01-05 00:21:24,411][model8_pretrain.py][INFO] Epoch:[0/2](517100/4588595) loss:2.741 lr:0.0000100 epoch_Time:25839.0min: [2024-01-05 00:21:24,412][model8_pretrain.py][INFO] Epoch:[0/2](517100/4588595) loss:3.333 lr:0.0000100 epoch_Time:25839.0min: [2024-01-05 00:21:24,412][model8_pretrain.py][INFO] Epoch:[0/2](517100/4588595) loss:2.610 lr:0.0000100 epoch_Time:25839.0min: [2024-01-05 00:22:01,343][model8_pretrain.py][INFO] Epoch:[0/2](517200/4588595) loss:3.012 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:01,343][model8_pretrain.py][INFO] Epoch:[0/2](517200/4588595) loss:2.645 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:01,343][model8_pretrain.py][INFO] Epoch:[0/2](517200/4588595) loss:2.678 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:01,343][model8_pretrain.py][INFO] Epoch:[0/2](517200/4588595) loss:2.834 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:01,343][model8_pretrain.py][INFO] Epoch:[0/2](517200/4588595) loss:2.479 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:01,343][model8_pretrain.py][INFO] Epoch:[0/2](517200/4588595) loss:2.569 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:01,343][model8_pretrain.py][INFO] Epoch:[0/2](517200/4588595) loss:3.436 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:01,344][model8_pretrain.py][INFO] Epoch:[0/2](517200/4588595) loss:2.323 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:38,278][model8_pretrain.py][INFO] Epoch:[0/2](517300/4588595) loss:3.359 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:38,278][model8_pretrain.py][INFO] Epoch:[0/2](517300/4588595) loss:2.594 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:38,278][model8_pretrain.py][INFO] Epoch:[0/2](517300/4588595) loss:2.813 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:38,278][model8_pretrain.py][INFO] Epoch:[0/2](517300/4588595) loss:2.862 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:38,278][model8_pretrain.py][INFO] Epoch:[0/2](517300/4588595) loss:3.147 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:38,278][model8_pretrain.py][INFO] Epoch:[0/2](517300/4588595) loss:3.055 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:38,278][model8_pretrain.py][INFO] Epoch:[0/2](517300/4588595) loss:2.303 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:22:38,278][model8_pretrain.py][INFO] Epoch:[0/2](517300/4588595) loss:3.115 lr:0.0000100 epoch_Time:25837.0min: [2024-01-05 00:23:25,683][model8_pretrain.py][INFO] Epoch:[0/2](517400/4588595) loss:3.072 lr:0.0000100 epoch_Time:25838.0min: [2024-01-05 00:23:25,683][model8_pretrain.py][INFO] Epoch:[0/2](517400/4588595) loss:2.383 lr:0.0000100 epoch_Time:25838.0min: [2024-01-05 00:23:25,683][model8_pretrain.py][INFO] Epoch:[0/2](517400/4588595) loss:3.156 lr:0.0000100 epoch_Time:25838.0min: [2024-01-05 00:23:25,683][model8_pretrain.py][INFO] Epoch:[0/2](517400/4588595) loss:2.767 lr:0.0000100 epoch_Time:25838.0min: [2024-01-05 00:23:25,683][model8_pretrain.py][INFO] Epoch:[0/2](517400/4588595) loss:2.516 lr:0.0000100 epoch_Time:25838.0min: [2024-01-05 00:23:25,683][model8_pretrain.py][INFO] Epoch:[0/2](517400/4588595) loss:3.117 lr:0.0000100 epoch_Time:25838.0min: [2024-01-05 00:23:25,683][model8_pretrain.py][INFO] Epoch:[0/2](517400/4588595) loss:2.972 lr:0.0000100 epoch_Time:25838.0min: [2024-01-05 00:23:25,683][model8_pretrain.py][INFO] Epoch:[0/2](517400/4588595) loss:2.994 lr:0.0000100 epoch_Time:25838.0min: [2024-01-05 00:24:02,616][model8_pretrain.py][INFO] Epoch:[0/2](517500/4588595) loss:2.884 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:02,616][model8_pretrain.py][INFO] Epoch:[0/2](517500/4588595) loss:2.669 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:02,616][model8_pretrain.py][INFO] Epoch:[0/2](517500/4588595) loss:2.961 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:02,616][model8_pretrain.py][INFO] Epoch:[0/2](517500/4588595) loss:2.711 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:02,616][model8_pretrain.py][INFO] Epoch:[0/2](517500/4588595) loss:2.844 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:02,616][model8_pretrain.py][INFO] Epoch:[0/2](517500/4588595) loss:2.657 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:02,616][model8_pretrain.py][INFO] Epoch:[0/2](517500/4588595) loss:2.452 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:02,616][model8_pretrain.py][INFO] Epoch:[0/2](517500/4588595) loss:3.228 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:39,545][model8_pretrain.py][INFO] Epoch:[0/2](517600/4588595) loss:2.876 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:39,545][model8_pretrain.py][INFO] Epoch:[0/2](517600/4588595) loss:2.215 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:39,545][model8_pretrain.py][INFO] Epoch:[0/2](517600/4588595) loss:2.797 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:39,545][model8_pretrain.py][INFO] Epoch:[0/2](517600/4588595) loss:2.687 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:39,545][model8_pretrain.py][INFO] Epoch:[0/2](517600/4588595) loss:3.137 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:39,545][model8_pretrain.py][INFO] Epoch:[0/2](517600/4588595) loss:3.321 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:39,545][model8_pretrain.py][INFO] Epoch:[0/2](517600/4588595) loss:3.377 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:24:39,545][model8_pretrain.py][INFO] Epoch:[0/2](517600/4588595) loss:2.585 lr:0.0000100 epoch_Time:25836.0min: [2024-01-05 00:25:16,462][model8_pretrain.py][INFO] Epoch:[0/2](517700/4588595) loss:2.306 lr:0.0000100 epoch_Time:25835.0min: [2024-01-05 00:25:16,462][model8_pretrain.py][INFO] Epoch:[0/2](517700/4588595) loss:2.467 lr:0.0000100 epoch_Time:25835.0min: [2024-01-05 00:25:16,462][model8_pretrain.py][INFO] Epoch:[0/2](517700/4588595) loss:3.179 lr:0.0000100 epoch_Time:25835.0min: [2024-01-05 00:25:16,462][model8_pretrain.py][INFO] Epoch:[0/2](517700/4588595) loss:3.099 lr:0.0000100 epoch_Time:25835.0min: [2024-01-05 00:25:16,462][model8_pretrain.py][INFO] Epoch:[0/2](517700/4588595) loss:2.813 lr:0.0000100 epoch_Time:25835.0min: [2024-01-05 00:25:16,462][model8_pretrain.py][INFO] Epoch:[0/2](517700/4588595) loss:2.929 lr:0.0000100 epoch_Time:25835.0min: [2024-01-05 00:25:16,462][model8_pretrain.py][INFO] Epoch:[0/2](517700/4588595) loss:3.204 lr:0.0000100 epoch_Time:25835.0min: [2024-01-05 00:25:16,462][model8_pretrain.py][INFO] Epoch:[0/2](517700/4588595) loss:2.621 lr:0.0000100 epoch_Time:25835.0min: [2024-01-05 00:25:53,379][model8_pretrain.py][INFO] Epoch:[0/2](517800/4588595) loss:2.885 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:25:53,379][model8_pretrain.py][INFO] Epoch:[0/2](517800/4588595) loss:2.846 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:25:53,379][model8_pretrain.py][INFO] Epoch:[0/2](517800/4588595) loss:2.655 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:25:53,379][model8_pretrain.py][INFO] Epoch:[0/2](517800/4588595) loss:3.223 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:25:53,379][model8_pretrain.py][INFO] Epoch:[0/2](517800/4588595) loss:3.000 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:25:53,379][model8_pretrain.py][INFO] Epoch:[0/2](517800/4588595) loss:2.970 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:25:53,379][model8_pretrain.py][INFO] Epoch:[0/2](517800/4588595) loss:2.786 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:25:53,379][model8_pretrain.py][INFO] Epoch:[0/2](517800/4588595) loss:2.654 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:26:30,295][model8_pretrain.py][INFO] Epoch:[0/2](517900/4588595) loss:3.037 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:26:30,295][model8_pretrain.py][INFO] Epoch:[0/2](517900/4588595) loss:3.191 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:26:30,295][model8_pretrain.py][INFO] Epoch:[0/2](517900/4588595) loss:3.055 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:26:30,295][model8_pretrain.py][INFO] Epoch:[0/2](517900/4588595) loss:2.773 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:26:30,295][model8_pretrain.py][INFO] Epoch:[0/2](517900/4588595) loss:2.402 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:26:30,295][model8_pretrain.py][INFO] Epoch:[0/2](517900/4588595) loss:2.891 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:26:30,295][model8_pretrain.py][INFO] Epoch:[0/2](517900/4588595) loss:2.507 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:26:30,295][model8_pretrain.py][INFO] Epoch:[0/2](517900/4588595) loss:2.271 lr:0.0000100 epoch_Time:25834.0min: [2024-01-05 00:27:07,206][model8_pretrain.py][INFO] Epoch:[0/2](518000/4588595) loss:3.067 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:27:07,206][model8_pretrain.py][INFO] Epoch:[0/2](518000/4588595) loss:2.793 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:27:07,206][model8_pretrain.py][INFO] Epoch:[0/2](518000/4588595) loss:2.938 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:27:07,206][model8_pretrain.py][INFO] Epoch:[0/2](518000/4588595) loss:2.346 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:27:07,206][model8_pretrain.py][INFO] Epoch:[0/2](518000/4588595) loss:3.050 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:27:07,206][model8_pretrain.py][INFO] Epoch:[0/2](518000/4588595) loss:3.038 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:27:07,206][model8_pretrain.py][INFO] Epoch:[0/2](518000/4588595) loss:3.048 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:27:07,206][model8_pretrain.py][INFO] Epoch:[0/2](518000/4588595) loss:2.590 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:27:44,125][model8_pretrain.py][INFO] Epoch:[0/2](518100/4588595) loss:2.619 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:27:44,125][model8_pretrain.py][INFO] Epoch:[0/2](518100/4588595) loss:2.984 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:27:44,125][model8_pretrain.py][INFO] Epoch:[0/2](518100/4588595) loss:2.849 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:27:44,125][model8_pretrain.py][INFO] Epoch:[0/2](518100/4588595) loss:2.859 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:27:44,125][model8_pretrain.py][INFO] Epoch:[0/2](518100/4588595) loss:2.791 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:27:44,126][model8_pretrain.py][INFO] Epoch:[0/2](518100/4588595) loss:3.084 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:27:44,126][model8_pretrain.py][INFO] Epoch:[0/2](518100/4588595) loss:2.526 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:27:44,127][model8_pretrain.py][INFO] Epoch:[0/2](518100/4588595) loss:3.038 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:28:31,478][model8_pretrain.py][INFO] Epoch:[0/2](518200/4588595) loss:2.652 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:28:31,478][model8_pretrain.py][INFO] Epoch:[0/2](518200/4588595) loss:2.955 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:28:31,478][model8_pretrain.py][INFO] Epoch:[0/2](518200/4588595) loss:2.830 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:28:31,478][model8_pretrain.py][INFO] Epoch:[0/2](518200/4588595) loss:2.963 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:28:31,478][model8_pretrain.py][INFO] Epoch:[0/2](518200/4588595) loss:2.725 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:28:31,478][model8_pretrain.py][INFO] Epoch:[0/2](518200/4588595) loss:2.878 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:28:31,478][model8_pretrain.py][INFO] Epoch:[0/2](518200/4588595) loss:2.770 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:28:31,478][model8_pretrain.py][INFO] Epoch:[0/2](518200/4588595) loss:2.117 lr:0.0000100 epoch_Time:25833.0min: [2024-01-05 00:29:08,426][model8_pretrain.py][INFO] Epoch:[0/2](518300/4588595) loss:2.785 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:29:08,426][model8_pretrain.py][INFO] Epoch:[0/2](518300/4588595) loss:2.196 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:29:08,426][model8_pretrain.py][INFO] Epoch:[0/2](518300/4588595) loss:2.924 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:29:08,426][model8_pretrain.py][INFO] Epoch:[0/2](518300/4588595) loss:3.138 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:29:08,426][model8_pretrain.py][INFO] Epoch:[0/2](518300/4588595) loss:3.182 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:29:08,426][model8_pretrain.py][INFO] Epoch:[0/2](518300/4588595) loss:3.104 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:29:08,426][model8_pretrain.py][INFO] Epoch:[0/2](518300/4588595) loss:2.681 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:29:08,426][model8_pretrain.py][INFO] Epoch:[0/2](518300/4588595) loss:3.216 lr:0.0000100 epoch_Time:25832.0min: [2024-01-05 00:29:45,365][model8_pretrain.py][INFO] Epoch:[0/2](518400/4588595) loss:2.922 lr:0.0000100 epoch_Time:25831.0min: [2024-01-05 00:29:45,365][model8_pretrain.py][INFO] Epoch:[0/2](518400/4588595) loss:2.711 lr:0.0000100 epoch_Time:25831.0min: [2024-01-05 00:29:45,365][model8_pretrain.py][INFO] Epoch:[0/2](518400/4588595) loss:3.032 lr:0.0000100 epoch_Time:25831.0min: [2024-01-05 00:29:45,365][model8_pretrain.py][INFO] Epoch:[0/2](518400/4588595) loss:3.162 lr:0.0000100 epoch_Time:25831.0min: [2024-01-05 00:29:45,365][model8_pretrain.py][INFO] Epoch:[0/2](518400/4588595) loss:3.040 lr:0.0000100 epoch_Time:25831.0min: [2024-01-05 00:29:45,365][model8_pretrain.py][INFO] Epoch:[0/2](518400/4588595) loss:2.949 lr:0.0000100 epoch_Time:25831.0min: [2024-01-05 00:29:45,365][model8_pretrain.py][INFO] Epoch:[0/2](518400/4588595) loss:3.213 lr:0.0000100 epoch_Time:25831.0min: [2024-01-05 00:29:45,365][model8_pretrain.py][INFO] Epoch:[0/2](518400/4588595) loss:2.711 lr:0.0000100 epoch_Time:25831.0min: [2024-01-05 00:30:22,316][model8_pretrain.py][INFO] Epoch:[0/2](518500/4588595) loss:2.207 lr:0.0000100 epoch_Time:25830.0min: [2024-01-05 00:30:22,316][model8_pretrain.py][INFO] Epoch:[0/2](518500/4588595) loss:2.542 lr:0.0000100 epoch_Time:25830.0min: [2024-01-05 00:30:22,316][model8_pretrain.py][INFO] Epoch:[0/2](518500/4588595) loss:2.976 lr:0.0000100 epoch_Time:25830.0min: [2024-01-05 00:30:22,316][model8_pretrain.py][INFO] Epoch:[0/2](518500/4588595) loss:3.385 lr:0.0000100 epoch_Time:25830.0min: [2024-01-05 00:30:22,316][model8_pretrain.py][INFO] Epoch:[0/2](518500/4588595) loss:3.015 lr:0.0000100 epoch_Time:25830.0min: [2024-01-05 00:30:22,316][model8_pretrain.py][INFO] Epoch:[0/2](518500/4588595) loss:2.797 lr:0.0000100 epoch_Time:25830.0min: [2024-01-05 00:30:22,316][model8_pretrain.py][INFO] Epoch:[0/2](518500/4588595) loss:3.262 lr:0.0000100 epoch_Time:25830.0min: [2024-01-05 00:30:22,316][model8_pretrain.py][INFO] Epoch:[0/2](518500/4588595) loss:2.812 lr:0.0000100 epoch_Time:25830.0min: [2024-01-05 00:30:59,249][model8_pretrain.py][INFO] Epoch:[0/2](518600/4588595) loss:3.039 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:30:59,249][model8_pretrain.py][INFO] Epoch:[0/2](518600/4588595) loss:2.576 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:30:59,249][model8_pretrain.py][INFO] Epoch:[0/2](518600/4588595) loss:2.672 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:30:59,249][model8_pretrain.py][INFO] Epoch:[0/2](518600/4588595) loss:2.840 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:30:59,249][model8_pretrain.py][INFO] Epoch:[0/2](518600/4588595) loss:2.701 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:30:59,249][model8_pretrain.py][INFO] Epoch:[0/2](518600/4588595) loss:2.620 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:30:59,249][model8_pretrain.py][INFO] Epoch:[0/2](518600/4588595) loss:3.105 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:30:59,250][model8_pretrain.py][INFO] Epoch:[0/2](518600/4588595) loss:2.983 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:31:36,188][model8_pretrain.py][INFO] Epoch:[0/2](518700/4588595) loss:2.779 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:31:36,188][model8_pretrain.py][INFO] Epoch:[0/2](518700/4588595) loss:2.094 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:31:36,188][model8_pretrain.py][INFO] Epoch:[0/2](518700/4588595) loss:2.899 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:31:36,189][model8_pretrain.py][INFO] Epoch:[0/2](518700/4588595) loss:2.692 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:31:36,188][model8_pretrain.py][INFO] Epoch:[0/2](518700/4588595) loss:2.056 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:31:36,188][model8_pretrain.py][INFO] Epoch:[0/2](518700/4588595) loss:2.302 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:31:36,188][model8_pretrain.py][INFO] Epoch:[0/2](518700/4588595) loss:2.678 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:31:36,188][model8_pretrain.py][INFO] Epoch:[0/2](518700/4588595) loss:2.461 lr:0.0000100 epoch_Time:25829.0min: [2024-01-05 00:32:13,120][model8_pretrain.py][INFO] Epoch:[0/2](518800/4588595) loss:2.941 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:32:13,120][model8_pretrain.py][INFO] Epoch:[0/2](518800/4588595) loss:3.048 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:32:13,120][model8_pretrain.py][INFO] Epoch:[0/2](518800/4588595) loss:2.585 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:32:13,120][model8_pretrain.py][INFO] Epoch:[0/2](518800/4588595) loss:2.785 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:32:13,120][model8_pretrain.py][INFO] Epoch:[0/2](518800/4588595) loss:2.918 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:32:13,120][model8_pretrain.py][INFO] Epoch:[0/2](518800/4588595) loss:2.572 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:32:13,120][model8_pretrain.py][INFO] Epoch:[0/2](518800/4588595) loss:2.838 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:32:13,121][model8_pretrain.py][INFO] Epoch:[0/2](518800/4588595) loss:2.962 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:32:50,051][model8_pretrain.py][INFO] Epoch:[0/2](518900/4588595) loss:3.203 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:32:50,051][model8_pretrain.py][INFO] Epoch:[0/2](518900/4588595) loss:2.801 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:32:50,051][model8_pretrain.py][INFO] Epoch:[0/2](518900/4588595) loss:3.312 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:32:50,051][model8_pretrain.py][INFO] Epoch:[0/2](518900/4588595) loss:2.788 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:32:50,051][model8_pretrain.py][INFO] Epoch:[0/2](518900/4588595) loss:2.833 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:32:50,051][model8_pretrain.py][INFO] Epoch:[0/2](518900/4588595) loss:2.897 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:32:50,051][model8_pretrain.py][INFO] Epoch:[0/2](518900/4588595) loss:2.965 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:32:50,051][model8_pretrain.py][INFO] Epoch:[0/2](518900/4588595) loss:2.728 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:33:37,441][model8_pretrain.py][INFO] Epoch:[0/2](519000/4588595) loss:2.665 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:33:37,441][model8_pretrain.py][INFO] Epoch:[0/2](519000/4588595) loss:3.159 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:33:37,441][model8_pretrain.py][INFO] Epoch:[0/2](519000/4588595) loss:2.692 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:33:37,441][model8_pretrain.py][INFO] Epoch:[0/2](519000/4588595) loss:2.902 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:33:37,441][model8_pretrain.py][INFO] Epoch:[0/2](519000/4588595) loss:2.672 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:33:37,441][model8_pretrain.py][INFO] Epoch:[0/2](519000/4588595) loss:2.883 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:33:37,442][model8_pretrain.py][INFO] Epoch:[0/2](519000/4588595) loss:2.959 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:33:37,442][model8_pretrain.py][INFO] Epoch:[0/2](519000/4588595) loss:2.732 lr:0.0000100 epoch_Time:25828.0min: [2024-01-05 00:34:14,372][model8_pretrain.py][INFO] Epoch:[0/2](519100/4588595) loss:2.753 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:34:14,372][model8_pretrain.py][INFO] Epoch:[0/2](519100/4588595) loss:2.821 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:34:14,372][model8_pretrain.py][INFO] Epoch:[0/2](519100/4588595) loss:2.876 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:34:14,372][model8_pretrain.py][INFO] Epoch:[0/2](519100/4588595) loss:2.932 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:34:14,372][model8_pretrain.py][INFO] Epoch:[0/2](519100/4588595) loss:2.961 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:34:14,372][model8_pretrain.py][INFO] Epoch:[0/2](519100/4588595) loss:2.859 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:34:14,372][model8_pretrain.py][INFO] Epoch:[0/2](519100/4588595) loss:2.402 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:34:14,372][model8_pretrain.py][INFO] Epoch:[0/2](519100/4588595) loss:2.420 lr:0.0000100 epoch_Time:25827.0min: [2024-01-05 00:34:51,304][model8_pretrain.py][INFO] Epoch:[0/2](519200/4588595) loss:3.448 lr:0.0000100 epoch_Time:25826.0min: [2024-01-05 00:34:51,304][model8_pretrain.py][INFO] Epoch:[0/2](519200/4588595) loss:2.262 lr:0.0000100 epoch_Time:25826.0min: [2024-01-05 00:34:51,304][model8_pretrain.py][INFO] Epoch:[0/2](519200/4588595) loss:2.648 lr:0.0000100 epoch_Time:25826.0min: [2024-01-05 00:34:51,304][model8_pretrain.py][INFO] Epoch:[0/2](519200/4588595) loss:3.045 lr:0.0000100 epoch_Time:25826.0min: [2024-01-05 00:34:51,304][model8_pretrain.py][INFO] Epoch:[0/2](519200/4588595) loss:2.167 lr:0.0000100 epoch_Time:25826.0min: [2024-01-05 00:34:51,304][model8_pretrain.py][INFO] Epoch:[0/2](519200/4588595) loss:2.494 lr:0.0000100 epoch_Time:25826.0min: [2024-01-05 00:34:51,305][model8_pretrain.py][INFO] Epoch:[0/2](519200/4588595) loss:2.853 lr:0.0000100 epoch_Time:25826.0min: [2024-01-05 00:34:51,305][model8_pretrain.py][INFO] Epoch:[0/2](519200/4588595) loss:2.956 lr:0.0000100 epoch_Time:25826.0min: [2024-01-05 00:35:28,234][model8_pretrain.py][INFO] Epoch:[0/2](519300/4588595) loss:3.300 lr:0.0000100 epoch_Time:25825.0min: [2024-01-05 00:35:28,234][model8_pretrain.py][INFO] Epoch:[0/2](519300/4588595) loss:2.347 lr:0.0000100 epoch_Time:25825.0min: [2024-01-05 00:35:28,234][model8_pretrain.py][INFO] Epoch:[0/2](519300/4588595) loss:2.487 lr:0.0000100 epoch_Time:25825.0min: [2024-01-05 00:35:28,234][model8_pretrain.py][INFO] Epoch:[0/2](519300/4588595) loss:2.601 lr:0.0000100 epoch_Time:25825.0min: [2024-01-05 00:35:28,234][model8_pretrain.py][INFO] Epoch:[0/2](519300/4588595) loss:3.009 lr:0.0000100 epoch_Time:25825.0min: [2024-01-05 00:35:28,234][model8_pretrain.py][INFO] Epoch:[0/2](519300/4588595) loss:2.533 lr:0.0000100 epoch_Time:25825.0min: [2024-01-05 00:35:28,234][model8_pretrain.py][INFO] Epoch:[0/2](519300/4588595) loss:3.227 lr:0.0000100 epoch_Time:25825.0min: [2024-01-05 00:35:28,234][model8_pretrain.py][INFO] Epoch:[0/2](519300/4588595) loss:2.644 lr:0.0000100 epoch_Time:25825.0min: [2024-01-05 00:36:05,155][model8_pretrain.py][INFO] Epoch:[0/2](519400/4588595) loss:2.815 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:05,155][model8_pretrain.py][INFO] Epoch:[0/2](519400/4588595) loss:3.410 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:05,155][model8_pretrain.py][INFO] Epoch:[0/2](519400/4588595) loss:2.783 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:05,155][model8_pretrain.py][INFO] Epoch:[0/2](519400/4588595) loss:3.292 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:05,155][model8_pretrain.py][INFO] Epoch:[0/2](519400/4588595) loss:3.146 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:05,155][model8_pretrain.py][INFO] Epoch:[0/2](519400/4588595) loss:2.517 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:05,155][model8_pretrain.py][INFO] Epoch:[0/2](519400/4588595) loss:3.178 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:05,155][model8_pretrain.py][INFO] Epoch:[0/2](519400/4588595) loss:2.540 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:42,082][model8_pretrain.py][INFO] Epoch:[0/2](519500/4588595) loss:2.958 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:42,082][model8_pretrain.py][INFO] Epoch:[0/2](519500/4588595) loss:2.950 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:42,082][model8_pretrain.py][INFO] Epoch:[0/2](519500/4588595) loss:2.143 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:42,082][model8_pretrain.py][INFO] Epoch:[0/2](519500/4588595) loss:3.065 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:42,082][model8_pretrain.py][INFO] Epoch:[0/2](519500/4588595) loss:2.618 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:42,082][model8_pretrain.py][INFO] Epoch:[0/2](519500/4588595) loss:2.929 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:42,082][model8_pretrain.py][INFO] Epoch:[0/2](519500/4588595) loss:2.824 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:36:42,082][model8_pretrain.py][INFO] Epoch:[0/2](519500/4588595) loss:2.683 lr:0.0000100 epoch_Time:25824.0min: [2024-01-05 00:37:19,006][model8_pretrain.py][INFO] Epoch:[0/2](519600/4588595) loss:3.335 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:37:19,006][model8_pretrain.py][INFO] Epoch:[0/2](519600/4588595) loss:2.761 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:37:19,006][model8_pretrain.py][INFO] Epoch:[0/2](519600/4588595) loss:2.774 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:37:19,006][model8_pretrain.py][INFO] Epoch:[0/2](519600/4588595) loss:2.772 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:37:19,006][model8_pretrain.py][INFO] Epoch:[0/2](519600/4588595) loss:2.536 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:37:19,006][model8_pretrain.py][INFO] Epoch:[0/2](519600/4588595) loss:2.758 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:37:19,006][model8_pretrain.py][INFO] Epoch:[0/2](519600/4588595) loss:2.870 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:37:19,006][model8_pretrain.py][INFO] Epoch:[0/2](519600/4588595) loss:2.104 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:37:55,924][model8_pretrain.py][INFO] Epoch:[0/2](519700/4588595) loss:2.517 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:37:55,924][model8_pretrain.py][INFO] Epoch:[0/2](519700/4588595) loss:2.006 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:37:55,924][model8_pretrain.py][INFO] Epoch:[0/2](519700/4588595) loss:2.774 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:37:55,924][model8_pretrain.py][INFO] Epoch:[0/2](519700/4588595) loss:2.993 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:37:55,924][model8_pretrain.py][INFO] Epoch:[0/2](519700/4588595) loss:3.066 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:37:55,924][model8_pretrain.py][INFO] Epoch:[0/2](519700/4588595) loss:2.806 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:37:55,924][model8_pretrain.py][INFO] Epoch:[0/2](519700/4588595) loss:3.218 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:37:55,924][model8_pretrain.py][INFO] Epoch:[0/2](519700/4588595) loss:3.113 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:38:42,930][model8_pretrain.py][INFO] Epoch:[0/2](519800/4588595) loss:3.117 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:38:42,930][model8_pretrain.py][INFO] Epoch:[0/2](519800/4588595) loss:2.910 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:38:42,930][model8_pretrain.py][INFO] Epoch:[0/2](519800/4588595) loss:2.936 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:38:42,930][model8_pretrain.py][INFO] Epoch:[0/2](519800/4588595) loss:3.041 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:38:42,930][model8_pretrain.py][INFO] Epoch:[0/2](519800/4588595) loss:2.748 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:38:42,930][model8_pretrain.py][INFO] Epoch:[0/2](519800/4588595) loss:2.708 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:38:42,930][model8_pretrain.py][INFO] Epoch:[0/2](519800/4588595) loss:3.037 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:38:42,930][model8_pretrain.py][INFO] Epoch:[0/2](519800/4588595) loss:2.731 lr:0.0000100 epoch_Time:25823.0min: [2024-01-05 00:39:19,863][model8_pretrain.py][INFO] Epoch:[0/2](519900/4588595) loss:2.664 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:39:19,863][model8_pretrain.py][INFO] Epoch:[0/2](519900/4588595) loss:2.563 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:39:19,863][model8_pretrain.py][INFO] Epoch:[0/2](519900/4588595) loss:2.762 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:39:19,863][model8_pretrain.py][INFO] Epoch:[0/2](519900/4588595) loss:2.761 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:39:19,863][model8_pretrain.py][INFO] Epoch:[0/2](519900/4588595) loss:3.306 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:39:19,863][model8_pretrain.py][INFO] Epoch:[0/2](519900/4588595) loss:3.359 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:39:19,863][model8_pretrain.py][INFO] Epoch:[0/2](519900/4588595) loss:2.988 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:39:19,863][model8_pretrain.py][INFO] Epoch:[0/2](519900/4588595) loss:3.059 lr:0.0000100 epoch_Time:25822.0min: [2024-01-05 00:39:56,799][model8_pretrain.py][INFO] Epoch:[0/2](520000/4588595) loss:2.927 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:39:56,799][model8_pretrain.py][INFO] Epoch:[0/2](520000/4588595) loss:2.807 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:39:56,800][model8_pretrain.py][INFO] Epoch:[0/2](520000/4588595) loss:2.219 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:39:56,800][model8_pretrain.py][INFO] Epoch:[0/2](520000/4588595) loss:2.485 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:39:56,800][model8_pretrain.py][INFO] Epoch:[0/2](520000/4588595) loss:2.490 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:39:56,800][model8_pretrain.py][INFO] Epoch:[0/2](520000/4588595) loss:2.388 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:39:56,800][model8_pretrain.py][INFO] Epoch:[0/2](520000/4588595) loss:3.115 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:39:56,800][model8_pretrain.py][INFO] Epoch:[0/2](520000/4588595) loss:2.866 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:40:33,743][model8_pretrain.py][INFO] Epoch:[0/2](520100/4588595) loss:3.118 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:40:33,743][model8_pretrain.py][INFO] Epoch:[0/2](520100/4588595) loss:2.671 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:40:33,743][model8_pretrain.py][INFO] Epoch:[0/2](520100/4588595) loss:2.201 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:40:33,743][model8_pretrain.py][INFO] Epoch:[0/2](520100/4588595) loss:3.167 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:40:33,743][model8_pretrain.py][INFO] Epoch:[0/2](520100/4588595) loss:2.887 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:40:33,743][model8_pretrain.py][INFO] Epoch:[0/2](520100/4588595) loss:2.214 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:40:33,743][model8_pretrain.py][INFO] Epoch:[0/2](520100/4588595) loss:2.344 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:40:33,744][model8_pretrain.py][INFO] Epoch:[0/2](520100/4588595) loss:2.480 lr:0.0000100 epoch_Time:25821.0min: [2024-01-05 00:41:10,686][model8_pretrain.py][INFO] Epoch:[0/2](520200/4588595) loss:3.438 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:10,686][model8_pretrain.py][INFO] Epoch:[0/2](520200/4588595) loss:2.773 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:10,686][model8_pretrain.py][INFO] Epoch:[0/2](520200/4588595) loss:3.479 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:10,686][model8_pretrain.py][INFO] Epoch:[0/2](520200/4588595) loss:2.745 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:10,687][model8_pretrain.py][INFO] Epoch:[0/2](520200/4588595) loss:2.522 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:10,687][model8_pretrain.py][INFO] Epoch:[0/2](520200/4588595) loss:3.105 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:10,687][model8_pretrain.py][INFO] Epoch:[0/2](520200/4588595) loss:2.736 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:10,687][model8_pretrain.py][INFO] Epoch:[0/2](520200/4588595) loss:2.479 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:47,611][model8_pretrain.py][INFO] Epoch:[0/2](520300/4588595) loss:2.472 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:41:47,611][model8_pretrain.py][INFO] Epoch:[0/2](520300/4588595) loss:2.590 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:47,611][model8_pretrain.py][INFO] Epoch:[0/2](520300/4588595) loss:3.069 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:41:47,611][model8_pretrain.py][INFO] Epoch:[0/2](520300/4588595) loss:2.517 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:41:47,611][model8_pretrain.py][INFO] Epoch:[0/2](520300/4588595) loss:2.664 lr:0.0000100 epoch_Time:25819.0min: [2024-01-05 00:41:47,611][model8_pretrain.py][INFO] Epoch:[0/2](520300/4588595) loss:2.757 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:41:47,611][model8_pretrain.py][INFO] Epoch:[0/2](520300/4588595) loss:2.562 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:41:47,611][model8_pretrain.py][INFO] Epoch:[0/2](520300/4588595) loss:3.107 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:42:24,543][model8_pretrain.py][INFO] Epoch:[0/2](520400/4588595) loss:3.377 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:42:24,543][model8_pretrain.py][INFO] Epoch:[0/2](520400/4588595) loss:2.743 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:42:24,543][model8_pretrain.py][INFO] Epoch:[0/2](520400/4588595) loss:2.898 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:42:24,543][model8_pretrain.py][INFO] Epoch:[0/2](520400/4588595) loss:2.725 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:42:24,543][model8_pretrain.py][INFO] Epoch:[0/2](520400/4588595) loss:2.822 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:42:24,543][model8_pretrain.py][INFO] Epoch:[0/2](520400/4588595) loss:3.072 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:42:24,543][model8_pretrain.py][INFO] Epoch:[0/2](520400/4588595) loss:3.010 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:42:24,543][model8_pretrain.py][INFO] Epoch:[0/2](520400/4588595) loss:2.490 lr:0.0000100 epoch_Time:25818.0min: [2024-01-05 00:43:01,470][model8_pretrain.py][INFO] Epoch:[0/2](520500/4588595) loss:2.705 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:01,470][model8_pretrain.py][INFO] Epoch:[0/2](520500/4588595) loss:2.823 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:01,470][model8_pretrain.py][INFO] Epoch:[0/2](520500/4588595) loss:2.525 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:01,470][model8_pretrain.py][INFO] Epoch:[0/2](520500/4588595) loss:2.890 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:01,470][model8_pretrain.py][INFO] Epoch:[0/2](520500/4588595) loss:2.632 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:01,470][model8_pretrain.py][INFO] Epoch:[0/2](520500/4588595) loss:2.911 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:01,470][model8_pretrain.py][INFO] Epoch:[0/2](520500/4588595) loss:3.153 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:01,470][model8_pretrain.py][INFO] Epoch:[0/2](520500/4588595) loss:2.036 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:48,632][model8_pretrain.py][INFO] Epoch:[0/2](520600/4588595) loss:2.373 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:48,632][model8_pretrain.py][INFO] Epoch:[0/2](520600/4588595) loss:3.073 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:48,632][model8_pretrain.py][INFO] Epoch:[0/2](520600/4588595) loss:1.942 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:48,632][model8_pretrain.py][INFO] Epoch:[0/2](520600/4588595) loss:2.802 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:48,632][model8_pretrain.py][INFO] Epoch:[0/2](520600/4588595) loss:3.414 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:48,632][model8_pretrain.py][INFO] Epoch:[0/2](520600/4588595) loss:2.830 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:48,632][model8_pretrain.py][INFO] Epoch:[0/2](520600/4588595) loss:2.812 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:43:48,633][model8_pretrain.py][INFO] Epoch:[0/2](520600/4588595) loss:2.813 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:44:25,569][model8_pretrain.py][INFO] Epoch:[0/2](520700/4588595) loss:2.816 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:44:25,569][model8_pretrain.py][INFO] Epoch:[0/2](520700/4588595) loss:2.257 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:44:25,569][model8_pretrain.py][INFO] Epoch:[0/2](520700/4588595) loss:2.967 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:44:25,569][model8_pretrain.py][INFO] Epoch:[0/2](520700/4588595) loss:3.003 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:44:25,569][model8_pretrain.py][INFO] Epoch:[0/2](520700/4588595) loss:3.264 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:44:25,569][model8_pretrain.py][INFO] Epoch:[0/2](520700/4588595) loss:3.030 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:44:25,569][model8_pretrain.py][INFO] Epoch:[0/2](520700/4588595) loss:2.955 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:44:25,569][model8_pretrain.py][INFO] Epoch:[0/2](520700/4588595) loss:2.773 lr:0.0000100 epoch_Time:25817.0min: [2024-01-05 00:45:02,536][model8_pretrain.py][INFO] Epoch:[0/2](520800/4588595) loss:2.954 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:02,536][model8_pretrain.py][INFO] Epoch:[0/2](520800/4588595) loss:2.875 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:02,536][model8_pretrain.py][INFO] Epoch:[0/2](520800/4588595) loss:3.216 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:02,536][model8_pretrain.py][INFO] Epoch:[0/2](520800/4588595) loss:2.225 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:02,536][model8_pretrain.py][INFO] Epoch:[0/2](520800/4588595) loss:3.262 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:02,536][model8_pretrain.py][INFO] Epoch:[0/2](520800/4588595) loss:2.761 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:02,537][model8_pretrain.py][INFO] Epoch:[0/2](520800/4588595) loss:2.657 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:02,537][model8_pretrain.py][INFO] Epoch:[0/2](520800/4588595) loss:2.942 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:39,493][model8_pretrain.py][INFO] Epoch:[0/2](520900/4588595) loss:3.205 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:39,493][model8_pretrain.py][INFO] Epoch:[0/2](520900/4588595) loss:2.739 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:39,493][model8_pretrain.py][INFO] Epoch:[0/2](520900/4588595) loss:3.089 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:39,493][model8_pretrain.py][INFO] Epoch:[0/2](520900/4588595) loss:2.428 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:39,493][model8_pretrain.py][INFO] Epoch:[0/2](520900/4588595) loss:3.364 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:39,493][model8_pretrain.py][INFO] Epoch:[0/2](520900/4588595) loss:2.363 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:39,493][model8_pretrain.py][INFO] Epoch:[0/2](520900/4588595) loss:2.624 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:45:39,493][model8_pretrain.py][INFO] Epoch:[0/2](520900/4588595) loss:2.891 lr:0.0000100 epoch_Time:25816.0min: [2024-01-05 00:46:16,452][model8_pretrain.py][INFO] Epoch:[0/2](521000/4588595) loss:2.849 lr:0.0000100 epoch_Time:25815.0min: [2024-01-05 00:46:16,452][model8_pretrain.py][INFO] Epoch:[0/2](521000/4588595) loss:2.778 lr:0.0000100 epoch_Time:25815.0min: [2024-01-05 00:46:16,452][model8_pretrain.py][INFO] Epoch:[0/2](521000/4588595) loss:2.634 lr:0.0000100 epoch_Time:25815.0min: [2024-01-05 00:46:16,452][model8_pretrain.py][INFO] Epoch:[0/2](521000/4588595) loss:2.749 lr:0.0000100 epoch_Time:25815.0min: [2024-01-05 00:46:16,452][model8_pretrain.py][INFO] Epoch:[0/2](521000/4588595) loss:2.766 lr:0.0000100 epoch_Time:25815.0min: [2024-01-05 00:46:16,452][model8_pretrain.py][INFO] Epoch:[0/2](521000/4588595) loss:3.018 lr:0.0000100 epoch_Time:25815.0min: [2024-01-05 00:46:16,452][model8_pretrain.py][INFO] Epoch:[0/2](521000/4588595) loss:3.029 lr:0.0000100 epoch_Time:25815.0min: [2024-01-05 00:46:16,452][model8_pretrain.py][INFO] Epoch:[0/2](521000/4588595) loss:2.658 lr:0.0000100 epoch_Time:25815.0min: [2024-01-05 00:46:53,404][model8_pretrain.py][INFO] Epoch:[0/2](521100/4588595) loss:3.375 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:46:53,404][model8_pretrain.py][INFO] Epoch:[0/2](521100/4588595) loss:2.965 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:46:53,404][model8_pretrain.py][INFO] Epoch:[0/2](521100/4588595) loss:2.388 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:46:53,404][model8_pretrain.py][INFO] Epoch:[0/2](521100/4588595) loss:3.172 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:46:53,404][model8_pretrain.py][INFO] Epoch:[0/2](521100/4588595) loss:2.493 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:46:53,404][model8_pretrain.py][INFO] Epoch:[0/2](521100/4588595) loss:3.391 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:46:53,404][model8_pretrain.py][INFO] Epoch:[0/2](521100/4588595) loss:2.779 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:46:53,404][model8_pretrain.py][INFO] Epoch:[0/2](521100/4588595) loss:2.872 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:47:30,354][model8_pretrain.py][INFO] Epoch:[0/2](521200/4588595) loss:2.459 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:47:30,354][model8_pretrain.py][INFO] Epoch:[0/2](521200/4588595) loss:2.589 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:47:30,354][model8_pretrain.py][INFO] Epoch:[0/2](521200/4588595) loss:3.063 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:47:30,354][model8_pretrain.py][INFO] Epoch:[0/2](521200/4588595) loss:2.838 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:47:30,354][model8_pretrain.py][INFO] Epoch:[0/2](521200/4588595) loss:2.764 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:47:30,354][model8_pretrain.py][INFO] Epoch:[0/2](521200/4588595) loss:2.749 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:47:30,354][model8_pretrain.py][INFO] Epoch:[0/2](521200/4588595) loss:3.489 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:47:30,354][model8_pretrain.py][INFO] Epoch:[0/2](521200/4588595) loss:2.948 lr:0.0000100 epoch_Time:25813.0min: [2024-01-05 00:48:07,291][model8_pretrain.py][INFO] Epoch:[0/2](521300/4588595) loss:2.985 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:07,291][model8_pretrain.py][INFO] Epoch:[0/2](521300/4588595) loss:2.606 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:07,291][model8_pretrain.py][INFO] Epoch:[0/2](521300/4588595) loss:2.332 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:07,291][model8_pretrain.py][INFO] Epoch:[0/2](521300/4588595) loss:2.594 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:07,291][model8_pretrain.py][INFO] Epoch:[0/2](521300/4588595) loss:2.586 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:07,291][model8_pretrain.py][INFO] Epoch:[0/2](521300/4588595) loss:2.900 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:07,291][model8_pretrain.py][INFO] Epoch:[0/2](521300/4588595) loss:3.024 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:07,291][model8_pretrain.py][INFO] Epoch:[0/2](521300/4588595) loss:3.102 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:54,455][model8_pretrain.py][INFO] Epoch:[0/2](521400/4588595) loss:3.128 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:54,455][model8_pretrain.py][INFO] Epoch:[0/2](521400/4588595) loss:3.056 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:54,455][model8_pretrain.py][INFO] Epoch:[0/2](521400/4588595) loss:2.961 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:54,455][model8_pretrain.py][INFO] Epoch:[0/2](521400/4588595) loss:2.948 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:54,455][model8_pretrain.py][INFO] Epoch:[0/2](521400/4588595) loss:2.911 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:54,455][model8_pretrain.py][INFO] Epoch:[0/2](521400/4588595) loss:2.478 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:54,455][model8_pretrain.py][INFO] Epoch:[0/2](521400/4588595) loss:2.985 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:48:54,455][model8_pretrain.py][INFO] Epoch:[0/2](521400/4588595) loss:2.936 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:49:31,381][model8_pretrain.py][INFO] Epoch:[0/2](521500/4588595) loss:2.775 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:49:31,381][model8_pretrain.py][INFO] Epoch:[0/2](521500/4588595) loss:2.469 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:49:31,381][model8_pretrain.py][INFO] Epoch:[0/2](521500/4588595) loss:2.892 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:49:31,381][model8_pretrain.py][INFO] Epoch:[0/2](521500/4588595) loss:3.116 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:49:31,381][model8_pretrain.py][INFO] Epoch:[0/2](521500/4588595) loss:3.157 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:49:31,381][model8_pretrain.py][INFO] Epoch:[0/2](521500/4588595) loss:2.981 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:49:31,381][model8_pretrain.py][INFO] Epoch:[0/2](521500/4588595) loss:2.558 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:49:31,381][model8_pretrain.py][INFO] Epoch:[0/2](521500/4588595) loss:2.450 lr:0.0000100 epoch_Time:25812.0min: [2024-01-05 00:50:08,313][model8_pretrain.py][INFO] Epoch:[0/2](521600/4588595) loss:2.556 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:08,313][model8_pretrain.py][INFO] Epoch:[0/2](521600/4588595) loss:2.986 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:08,313][model8_pretrain.py][INFO] Epoch:[0/2](521600/4588595) loss:2.688 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:08,313][model8_pretrain.py][INFO] Epoch:[0/2](521600/4588595) loss:2.720 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:08,313][model8_pretrain.py][INFO] Epoch:[0/2](521600/4588595) loss:2.046 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:08,313][model8_pretrain.py][INFO] Epoch:[0/2](521600/4588595) loss:2.947 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:08,313][model8_pretrain.py][INFO] Epoch:[0/2](521600/4588595) loss:3.237 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:08,313][model8_pretrain.py][INFO] Epoch:[0/2](521600/4588595) loss:2.660 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:45,240][model8_pretrain.py][INFO] Epoch:[0/2](521700/4588595) loss:2.799 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:45,240][model8_pretrain.py][INFO] Epoch:[0/2](521700/4588595) loss:2.420 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:45,240][model8_pretrain.py][INFO] Epoch:[0/2](521700/4588595) loss:3.253 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:45,240][model8_pretrain.py][INFO] Epoch:[0/2](521700/4588595) loss:3.298 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:45,240][model8_pretrain.py][INFO] Epoch:[0/2](521700/4588595) loss:2.467 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:45,240][model8_pretrain.py][INFO] Epoch:[0/2](521700/4588595) loss:2.775 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:45,240][model8_pretrain.py][INFO] Epoch:[0/2](521700/4588595) loss:3.065 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:50:45,240][model8_pretrain.py][INFO] Epoch:[0/2](521700/4588595) loss:2.561 lr:0.0000100 epoch_Time:25811.0min: [2024-01-05 00:51:22,170][model8_pretrain.py][INFO] Epoch:[0/2](521800/4588595) loss:3.453 lr:0.0000100 epoch_Time:25810.0min: [2024-01-05 00:51:22,170][model8_pretrain.py][INFO] Epoch:[0/2](521800/4588595) loss:2.705 lr:0.0000100 epoch_Time:25810.0min: [2024-01-05 00:51:22,170][model8_pretrain.py][INFO] Epoch:[0/2](521800/4588595) loss:2.494 lr:0.0000100 epoch_Time:25810.0min: [2024-01-05 00:51:22,170][model8_pretrain.py][INFO] Epoch:[0/2](521800/4588595) loss:3.140 lr:0.0000100 epoch_Time:25810.0min: [2024-01-05 00:51:22,170][model8_pretrain.py][INFO] Epoch:[0/2](521800/4588595) loss:2.724 lr:0.0000100 epoch_Time:25810.0min: [2024-01-05 00:51:22,170][model8_pretrain.py][INFO] Epoch:[0/2](521800/4588595) loss:3.250 lr:0.0000100 epoch_Time:25810.0min: [2024-01-05 00:51:22,170][model8_pretrain.py][INFO] Epoch:[0/2](521800/4588595) loss:3.204 lr:0.0000100 epoch_Time:25810.0min: [2024-01-05 00:51:22,170][model8_pretrain.py][INFO] Epoch:[0/2](521800/4588595) loss:2.869 lr:0.0000100 epoch_Time:25810.0min: [2024-01-05 00:51:59,099][model8_pretrain.py][INFO] Epoch:[0/2](521900/4588595) loss:2.797 lr:0.0000100 epoch_Time:25809.0min: [2024-01-05 00:51:59,099][model8_pretrain.py][INFO] Epoch:[0/2](521900/4588595) loss:2.804 lr:0.0000100 epoch_Time:25809.0min: [2024-01-05 00:51:59,099][model8_pretrain.py][INFO] Epoch:[0/2](521900/4588595) loss:2.603 lr:0.0000100 epoch_Time:25809.0min: [2024-01-05 00:51:59,099][model8_pretrain.py][INFO] Epoch:[0/2](521900/4588595) loss:3.216 lr:0.0000100 epoch_Time:25809.0min: [2024-01-05 00:51:59,099][model8_pretrain.py][INFO] Epoch:[0/2](521900/4588595) loss:2.740 lr:0.0000100 epoch_Time:25809.0min: [2024-01-05 00:51:59,099][model8_pretrain.py][INFO] Epoch:[0/2](521900/4588595) loss:2.895 lr:0.0000100 epoch_Time:25809.0min: [2024-01-05 00:51:59,099][model8_pretrain.py][INFO] Epoch:[0/2](521900/4588595) loss:2.684 lr:0.0000100 epoch_Time:25809.0min: [2024-01-05 00:51:59,099][model8_pretrain.py][INFO] Epoch:[0/2](521900/4588595) loss:2.669 lr:0.0000100 epoch_Time:25809.0min: [2024-01-05 00:52:36,027][model8_pretrain.py][INFO] Epoch:[0/2](522000/4588595) loss:3.171 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:52:36,027][model8_pretrain.py][INFO] Epoch:[0/2](522000/4588595) loss:2.954 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:52:36,027][model8_pretrain.py][INFO] Epoch:[0/2](522000/4588595) loss:2.809 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:52:36,027][model8_pretrain.py][INFO] Epoch:[0/2](522000/4588595) loss:2.615 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:52:36,027][model8_pretrain.py][INFO] Epoch:[0/2](522000/4588595) loss:2.982 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:52:36,027][model8_pretrain.py][INFO] Epoch:[0/2](522000/4588595) loss:3.239 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:52:36,028][model8_pretrain.py][INFO] Epoch:[0/2](522000/4588595) loss:2.645 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:52:36,028][model8_pretrain.py][INFO] Epoch:[0/2](522000/4588595) loss:2.587 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:53:12,948][model8_pretrain.py][INFO] Epoch:[0/2](522100/4588595) loss:2.874 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:53:12,948][model8_pretrain.py][INFO] Epoch:[0/2](522100/4588595) loss:3.376 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:53:12,949][model8_pretrain.py][INFO] Epoch:[0/2](522100/4588595) loss:2.654 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:53:12,949][model8_pretrain.py][INFO] Epoch:[0/2](522100/4588595) loss:2.248 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:53:12,949][model8_pretrain.py][INFO] Epoch:[0/2](522100/4588595) loss:2.957 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:53:12,949][model8_pretrain.py][INFO] Epoch:[0/2](522100/4588595) loss:3.263 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:53:12,949][model8_pretrain.py][INFO] Epoch:[0/2](522100/4588595) loss:2.710 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:53:12,949][model8_pretrain.py][INFO] Epoch:[0/2](522100/4588595) loss:3.416 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:54:00,077][model8_pretrain.py][INFO] Epoch:[0/2](522200/4588595) loss:2.808 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:54:00,077][model8_pretrain.py][INFO] Epoch:[0/2](522200/4588595) loss:2.597 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:54:00,077][model8_pretrain.py][INFO] Epoch:[0/2](522200/4588595) loss:2.990 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:54:00,077][model8_pretrain.py][INFO] Epoch:[0/2](522200/4588595) loss:3.107 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:54:00,077][model8_pretrain.py][INFO] Epoch:[0/2](522200/4588595) loss:3.089 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:54:00,077][model8_pretrain.py][INFO] Epoch:[0/2](522200/4588595) loss:2.628 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:54:00,077][model8_pretrain.py][INFO] Epoch:[0/2](522200/4588595) loss:2.728 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:54:00,077][model8_pretrain.py][INFO] Epoch:[0/2](522200/4588595) loss:3.141 lr:0.0000100 epoch_Time:25808.0min: [2024-01-05 00:54:37,010][model8_pretrain.py][INFO] Epoch:[0/2](522300/4588595) loss:2.698 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:54:37,010][model8_pretrain.py][INFO] Epoch:[0/2](522300/4588595) loss:3.059 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:54:37,010][model8_pretrain.py][INFO] Epoch:[0/2](522300/4588595) loss:2.825 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:54:37,010][model8_pretrain.py][INFO] Epoch:[0/2](522300/4588595) loss:3.100 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:54:37,010][model8_pretrain.py][INFO] Epoch:[0/2](522300/4588595) loss:2.647 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:54:37,010][model8_pretrain.py][INFO] Epoch:[0/2](522300/4588595) loss:2.813 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:54:37,010][model8_pretrain.py][INFO] Epoch:[0/2](522300/4588595) loss:2.861 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:54:37,011][model8_pretrain.py][INFO] Epoch:[0/2](522300/4588595) loss:2.813 lr:0.0000100 epoch_Time:25807.0min: [2024-01-05 00:55:13,945][model8_pretrain.py][INFO] Epoch:[0/2](522400/4588595) loss:2.962 lr:0.0000100 epoch_Time:25806.0min: [2024-01-05 00:55:13,944][model8_pretrain.py][INFO] Epoch:[0/2](522400/4588595) loss:2.174 lr:0.0000100 epoch_Time:25806.0min: [2024-01-05 00:55:13,945][model8_pretrain.py][INFO] Epoch:[0/2](522400/4588595) loss:3.490 lr:0.0000100 epoch_Time:25806.0min: [2024-01-05 00:55:13,945][model8_pretrain.py][INFO] Epoch:[0/2](522400/4588595) loss:3.283 lr:0.0000100 epoch_Time:25806.0min: [2024-01-05 00:55:13,945][model8_pretrain.py][INFO] Epoch:[0/2](522400/4588595) loss:2.981 lr:0.0000100 epoch_Time:25806.0min: [2024-01-05 00:55:13,945][model8_pretrain.py][INFO] Epoch:[0/2](522400/4588595) loss:2.776 lr:0.0000100 epoch_Time:25806.0min: [2024-01-05 00:55:13,945][model8_pretrain.py][INFO] Epoch:[0/2](522400/4588595) loss:2.812 lr:0.0000100 epoch_Time:25806.0min: [2024-01-05 00:55:13,946][model8_pretrain.py][INFO] Epoch:[0/2](522400/4588595) loss:2.565 lr:0.0000100 epoch_Time:25806.0min: [2024-01-05 00:55:50,882][model8_pretrain.py][INFO] Epoch:[0/2](522500/4588595) loss:2.895 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:55:50,882][model8_pretrain.py][INFO] Epoch:[0/2](522500/4588595) loss:2.766 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:55:50,882][model8_pretrain.py][INFO] Epoch:[0/2](522500/4588595) loss:3.037 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:55:50,882][model8_pretrain.py][INFO] Epoch:[0/2](522500/4588595) loss:3.011 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:55:50,882][model8_pretrain.py][INFO] Epoch:[0/2](522500/4588595) loss:3.017 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:55:50,882][model8_pretrain.py][INFO] Epoch:[0/2](522500/4588595) loss:2.378 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:55:50,882][model8_pretrain.py][INFO] Epoch:[0/2](522500/4588595) loss:2.419 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:55:50,882][model8_pretrain.py][INFO] Epoch:[0/2](522500/4588595) loss:3.024 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:56:27,854][model8_pretrain.py][INFO] Epoch:[0/2](522600/4588595) loss:2.770 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:56:27,854][model8_pretrain.py][INFO] Epoch:[0/2](522600/4588595) loss:3.185 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:56:27,854][model8_pretrain.py][INFO] Epoch:[0/2](522600/4588595) loss:2.857 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:56:27,854][model8_pretrain.py][INFO] Epoch:[0/2](522600/4588595) loss:2.152 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:56:27,854][model8_pretrain.py][INFO] Epoch:[0/2](522600/4588595) loss:3.057 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:56:27,854][model8_pretrain.py][INFO] Epoch:[0/2](522600/4588595) loss:2.658 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:56:27,854][model8_pretrain.py][INFO] Epoch:[0/2](522600/4588595) loss:2.848 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:56:27,855][model8_pretrain.py][INFO] Epoch:[0/2](522600/4588595) loss:3.075 lr:0.0000100 epoch_Time:25805.0min: [2024-01-05 00:57:04,812][model8_pretrain.py][INFO] Epoch:[0/2](522700/4588595) loss:3.115 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:04,812][model8_pretrain.py][INFO] Epoch:[0/2](522700/4588595) loss:2.154 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:04,812][model8_pretrain.py][INFO] Epoch:[0/2](522700/4588595) loss:3.335 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:04,812][model8_pretrain.py][INFO] Epoch:[0/2](522700/4588595) loss:2.581 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:04,812][model8_pretrain.py][INFO] Epoch:[0/2](522700/4588595) loss:2.456 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:04,812][model8_pretrain.py][INFO] Epoch:[0/2](522700/4588595) loss:2.923 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:04,812][model8_pretrain.py][INFO] Epoch:[0/2](522700/4588595) loss:2.976 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:04,812][model8_pretrain.py][INFO] Epoch:[0/2](522700/4588595) loss:2.401 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:41,771][model8_pretrain.py][INFO] Epoch:[0/2](522800/4588595) loss:2.483 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:41,771][model8_pretrain.py][INFO] Epoch:[0/2](522800/4588595) loss:2.814 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:41,771][model8_pretrain.py][INFO] Epoch:[0/2](522800/4588595) loss:2.698 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:41,771][model8_pretrain.py][INFO] Epoch:[0/2](522800/4588595) loss:3.225 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:41,771][model8_pretrain.py][INFO] Epoch:[0/2](522800/4588595) loss:3.196 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:41,771][model8_pretrain.py][INFO] Epoch:[0/2](522800/4588595) loss:2.946 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:41,771][model8_pretrain.py][INFO] Epoch:[0/2](522800/4588595) loss:2.887 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:57:41,771][model8_pretrain.py][INFO] Epoch:[0/2](522800/4588595) loss:3.010 lr:0.0000100 epoch_Time:25804.0min: [2024-01-05 00:58:18,738][model8_pretrain.py][INFO] Epoch:[0/2](522900/4588595) loss:3.330 lr:0.0000100 epoch_Time:25802.0min: [2024-01-05 00:58:18,738][model8_pretrain.py][INFO] Epoch:[0/2](522900/4588595) loss:2.531 lr:0.0000100 epoch_Time:25802.0min: [2024-01-05 00:58:18,739][model8_pretrain.py][INFO] Epoch:[0/2](522900/4588595) loss:3.307 lr:0.0000100 epoch_Time:25802.0min: [2024-01-05 00:58:18,739][model8_pretrain.py][INFO] Epoch:[0/2](522900/4588595) loss:2.315 lr:0.0000100 epoch_Time:25802.0min: [2024-01-05 00:58:18,739][model8_pretrain.py][INFO] Epoch:[0/2](522900/4588595) loss:2.902 lr:0.0000100 epoch_Time:25802.0min: [2024-01-05 00:58:18,739][model8_pretrain.py][INFO] Epoch:[0/2](522900/4588595) loss:2.568 lr:0.0000100 epoch_Time:25802.0min: [2024-01-05 00:58:18,739][model8_pretrain.py][INFO] Epoch:[0/2](522900/4588595) loss:2.748 lr:0.0000100 epoch_Time:25802.0min: [2024-01-05 00:58:18,739][model8_pretrain.py][INFO] Epoch:[0/2](522900/4588595) loss:3.212 lr:0.0000100 epoch_Time:25802.0min: [2024-01-05 00:59:06,111][model8_pretrain.py][INFO] Epoch:[0/2](523000/4588595) loss:3.136 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:06,111][model8_pretrain.py][INFO] Epoch:[0/2](523000/4588595) loss:3.115 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:06,111][model8_pretrain.py][INFO] Epoch:[0/2](523000/4588595) loss:2.900 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:06,111][model8_pretrain.py][INFO] Epoch:[0/2](523000/4588595) loss:2.723 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:06,111][model8_pretrain.py][INFO] Epoch:[0/2](523000/4588595) loss:3.070 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:06,111][model8_pretrain.py][INFO] Epoch:[0/2](523000/4588595) loss:2.927 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:06,111][model8_pretrain.py][INFO] Epoch:[0/2](523000/4588595) loss:2.513 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:06,111][model8_pretrain.py][INFO] Epoch:[0/2](523000/4588595) loss:2.786 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:43,036][model8_pretrain.py][INFO] Epoch:[0/2](523100/4588595) loss:3.202 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:43,036][model8_pretrain.py][INFO] Epoch:[0/2](523100/4588595) loss:2.531 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:43,036][model8_pretrain.py][INFO] Epoch:[0/2](523100/4588595) loss:3.489 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:43,036][model8_pretrain.py][INFO] Epoch:[0/2](523100/4588595) loss:3.234 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:43,036][model8_pretrain.py][INFO] Epoch:[0/2](523100/4588595) loss:2.845 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:43,036][model8_pretrain.py][INFO] Epoch:[0/2](523100/4588595) loss:2.733 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:43,036][model8_pretrain.py][INFO] Epoch:[0/2](523100/4588595) loss:3.105 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 00:59:43,036][model8_pretrain.py][INFO] Epoch:[0/2](523100/4588595) loss:3.025 lr:0.0000100 epoch_Time:25803.0min: [2024-01-05 01:00:19,971][model8_pretrain.py][INFO] Epoch:[0/2](523200/4588595) loss:2.893 lr:0.0000100 epoch_Time:25801.0min: [2024-01-05 01:00:19,971][model8_pretrain.py][INFO] Epoch:[0/2](523200/4588595) loss:2.749 lr:0.0000100 epoch_Time:25801.0min: [2024-01-05 01:00:19,971][model8_pretrain.py][INFO] Epoch:[0/2](523200/4588595) loss:2.309 lr:0.0000100 epoch_Time:25801.0min: [2024-01-05 01:00:19,971][model8_pretrain.py][INFO] Epoch:[0/2](523200/4588595) loss:2.571 lr:0.0000100 epoch_Time:25801.0min: [2024-01-05 01:00:19,972][model8_pretrain.py][INFO] Epoch:[0/2](523200/4588595) loss:2.905 lr:0.0000100 epoch_Time:25801.0min: [2024-01-05 01:00:19,972][model8_pretrain.py][INFO] Epoch:[0/2](523200/4588595) loss:3.093 lr:0.0000100 epoch_Time:25801.0min: [2024-01-05 01:00:19,972][model8_pretrain.py][INFO] Epoch:[0/2](523200/4588595) loss:2.946 lr:0.0000100 epoch_Time:25801.0min: [2024-01-05 01:00:19,972][model8_pretrain.py][INFO] Epoch:[0/2](523200/4588595) loss:2.158 lr:0.0000100 epoch_Time:25801.0min: [2024-01-05 01:00:56,924][model8_pretrain.py][INFO] Epoch:[0/2](523300/4588595) loss:2.880 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:00:56,924][model8_pretrain.py][INFO] Epoch:[0/2](523300/4588595) loss:2.711 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:00:56,924][model8_pretrain.py][INFO] Epoch:[0/2](523300/4588595) loss:2.771 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:00:56,924][model8_pretrain.py][INFO] Epoch:[0/2](523300/4588595) loss:3.188 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:00:56,924][model8_pretrain.py][INFO] Epoch:[0/2](523300/4588595) loss:2.766 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:00:56,925][model8_pretrain.py][INFO] Epoch:[0/2](523300/4588595) loss:1.830 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:00:56,925][model8_pretrain.py][INFO] Epoch:[0/2](523300/4588595) loss:2.855 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:00:56,925][model8_pretrain.py][INFO] Epoch:[0/2](523300/4588595) loss:3.103 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:01:33,896][model8_pretrain.py][INFO] Epoch:[0/2](523400/4588595) loss:3.043 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:01:33,896][model8_pretrain.py][INFO] Epoch:[0/2](523400/4588595) loss:2.346 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:01:33,896][model8_pretrain.py][INFO] Epoch:[0/2](523400/4588595) loss:2.403 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:01:33,896][model8_pretrain.py][INFO] Epoch:[0/2](523400/4588595) loss:2.693 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:01:33,896][model8_pretrain.py][INFO] Epoch:[0/2](523400/4588595) loss:2.793 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:01:33,896][model8_pretrain.py][INFO] Epoch:[0/2](523400/4588595) loss:2.946 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:01:33,896][model8_pretrain.py][INFO] Epoch:[0/2](523400/4588595) loss:2.308 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:01:33,896][model8_pretrain.py][INFO] Epoch:[0/2](523400/4588595) loss:2.926 lr:0.0000100 epoch_Time:25800.0min: [2024-01-05 01:02:10,860][model8_pretrain.py][INFO] Epoch:[0/2](523500/4588595) loss:2.848 lr:0.0000100 epoch_Time:25799.0min: [2024-01-05 01:02:10,860][model8_pretrain.py][INFO] Epoch:[0/2](523500/4588595) loss:2.787 lr:0.0000100 epoch_Time:25799.0min: [2024-01-05 01:02:10,860][model8_pretrain.py][INFO] Epoch:[0/2](523500/4588595) loss:2.651 lr:0.0000100 epoch_Time:25799.0min: [2024-01-05 01:02:10,860][model8_pretrain.py][INFO] Epoch:[0/2](523500/4588595) loss:2.590 lr:0.0000100 epoch_Time:25799.0min: [2024-01-05 01:02:10,860][model8_pretrain.py][INFO] Epoch:[0/2](523500/4588595) loss:2.628 lr:0.0000100 epoch_Time:25799.0min: [2024-01-05 01:02:10,860][model8_pretrain.py][INFO] Epoch:[0/2](523500/4588595) loss:2.928 lr:0.0000100 epoch_Time:25799.0min: [2024-01-05 01:02:10,860][model8_pretrain.py][INFO] Epoch:[0/2](523500/4588595) loss:3.425 lr:0.0000100 epoch_Time:25799.0min: [2024-01-05 01:02:10,860][model8_pretrain.py][INFO] Epoch:[0/2](523500/4588595) loss:3.063 lr:0.0000100 epoch_Time:25799.0min: [2024-01-05 01:02:47,796][model8_pretrain.py][INFO] Epoch:[0/2](523600/4588595) loss:2.956 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:02:47,796][model8_pretrain.py][INFO] Epoch:[0/2](523600/4588595) loss:3.435 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:02:47,796][model8_pretrain.py][INFO] Epoch:[0/2](523600/4588595) loss:3.413 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:02:47,796][model8_pretrain.py][INFO] Epoch:[0/2](523600/4588595) loss:2.955 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:02:47,796][model8_pretrain.py][INFO] Epoch:[0/2](523600/4588595) loss:2.341 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:02:47,796][model8_pretrain.py][INFO] Epoch:[0/2](523600/4588595) loss:2.987 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:02:47,796][model8_pretrain.py][INFO] Epoch:[0/2](523600/4588595) loss:2.970 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:02:47,796][model8_pretrain.py][INFO] Epoch:[0/2](523600/4588595) loss:3.027 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:03:24,731][model8_pretrain.py][INFO] Epoch:[0/2](523700/4588595) loss:2.762 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:03:24,731][model8_pretrain.py][INFO] Epoch:[0/2](523700/4588595) loss:3.388 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:03:24,731][model8_pretrain.py][INFO] Epoch:[0/2](523700/4588595) loss:2.831 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:03:24,731][model8_pretrain.py][INFO] Epoch:[0/2](523700/4588595) loss:2.959 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:03:24,731][model8_pretrain.py][INFO] Epoch:[0/2](523700/4588595) loss:2.171 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:03:24,731][model8_pretrain.py][INFO] Epoch:[0/2](523700/4588595) loss:3.216 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:03:24,731][model8_pretrain.py][INFO] Epoch:[0/2](523700/4588595) loss:2.728 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:03:24,731][model8_pretrain.py][INFO] Epoch:[0/2](523700/4588595) loss:2.406 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:04:12,122][model8_pretrain.py][INFO] Epoch:[0/2](523800/4588595) loss:2.926 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:04:12,122][model8_pretrain.py][INFO] Epoch:[0/2](523800/4588595) loss:1.945 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:04:12,122][model8_pretrain.py][INFO] Epoch:[0/2](523800/4588595) loss:2.109 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:04:12,122][model8_pretrain.py][INFO] Epoch:[0/2](523800/4588595) loss:2.763 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:04:12,122][model8_pretrain.py][INFO] Epoch:[0/2](523800/4588595) loss:2.391 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:04:12,122][model8_pretrain.py][INFO] Epoch:[0/2](523800/4588595) loss:3.181 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:04:12,122][model8_pretrain.py][INFO] Epoch:[0/2](523800/4588595) loss:3.148 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:04:12,122][model8_pretrain.py][INFO] Epoch:[0/2](523800/4588595) loss:2.479 lr:0.0000100 epoch_Time:25798.0min: [2024-01-05 01:04:49,057][model8_pretrain.py][INFO] Epoch:[0/2](523900/4588595) loss:3.307 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:04:49,057][model8_pretrain.py][INFO] Epoch:[0/2](523900/4588595) loss:2.659 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:04:49,057][model8_pretrain.py][INFO] Epoch:[0/2](523900/4588595) loss:2.903 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:04:49,057][model8_pretrain.py][INFO] Epoch:[0/2](523900/4588595) loss:3.257 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:04:49,057][model8_pretrain.py][INFO] Epoch:[0/2](523900/4588595) loss:2.505 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:04:49,057][model8_pretrain.py][INFO] Epoch:[0/2](523900/4588595) loss:3.038 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:04:49,057][model8_pretrain.py][INFO] Epoch:[0/2](523900/4588595) loss:3.254 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:04:49,057][model8_pretrain.py][INFO] Epoch:[0/2](523900/4588595) loss:2.745 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:05:26,001][model8_pretrain.py][INFO] Epoch:[0/2](524000/4588595) loss:2.885 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:05:26,001][model8_pretrain.py][INFO] Epoch:[0/2](524000/4588595) loss:2.739 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:05:26,001][model8_pretrain.py][INFO] Epoch:[0/2](524000/4588595) loss:2.584 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:05:26,001][model8_pretrain.py][INFO] Epoch:[0/2](524000/4588595) loss:2.849 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:05:26,001][model8_pretrain.py][INFO] Epoch:[0/2](524000/4588595) loss:2.977 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:05:26,001][model8_pretrain.py][INFO] Epoch:[0/2](524000/4588595) loss:2.675 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:05:26,001][model8_pretrain.py][INFO] Epoch:[0/2](524000/4588595) loss:2.396 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:05:26,001][model8_pretrain.py][INFO] Epoch:[0/2](524000/4588595) loss:3.116 lr:0.0000100 epoch_Time:25797.0min: [2024-01-05 01:06:02,950][model8_pretrain.py][INFO] Epoch:[0/2](524100/4588595) loss:2.577 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:02,950][model8_pretrain.py][INFO] Epoch:[0/2](524100/4588595) loss:2.642 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:02,950][model8_pretrain.py][INFO] Epoch:[0/2](524100/4588595) loss:2.666 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:02,950][model8_pretrain.py][INFO] Epoch:[0/2](524100/4588595) loss:3.148 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:02,950][model8_pretrain.py][INFO] Epoch:[0/2](524100/4588595) loss:2.819 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:02,950][model8_pretrain.py][INFO] Epoch:[0/2](524100/4588595) loss:2.419 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:02,950][model8_pretrain.py][INFO] Epoch:[0/2](524100/4588595) loss:2.463 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:02,950][model8_pretrain.py][INFO] Epoch:[0/2](524100/4588595) loss:2.837 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:39,911][model8_pretrain.py][INFO] Epoch:[0/2](524200/4588595) loss:3.038 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:39,911][model8_pretrain.py][INFO] Epoch:[0/2](524200/4588595) loss:2.780 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:39,911][model8_pretrain.py][INFO] Epoch:[0/2](524200/4588595) loss:2.365 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:39,911][model8_pretrain.py][INFO] Epoch:[0/2](524200/4588595) loss:2.963 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:39,911][model8_pretrain.py][INFO] Epoch:[0/2](524200/4588595) loss:3.230 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:39,911][model8_pretrain.py][INFO] Epoch:[0/2](524200/4588595) loss:3.284 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:39,911][model8_pretrain.py][INFO] Epoch:[0/2](524200/4588595) loss:2.395 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:06:39,911][model8_pretrain.py][INFO] Epoch:[0/2](524200/4588595) loss:3.132 lr:0.0000100 epoch_Time:25795.0min: [2024-01-05 01:07:16,822][model8_pretrain.py][INFO] Epoch:[0/2](524300/4588595) loss:2.594 lr:0.0000100 epoch_Time:25794.0min: [2024-01-05 01:07:16,822][model8_pretrain.py][INFO] Epoch:[0/2](524300/4588595) loss:3.163 lr:0.0000100 epoch_Time:25794.0min: [2024-01-05 01:07:16,822][model8_pretrain.py][INFO] Epoch:[0/2](524300/4588595) loss:2.708 lr:0.0000100 epoch_Time:25794.0min: [2024-01-05 01:07:16,822][model8_pretrain.py][INFO] Epoch:[0/2](524300/4588595) loss:2.291 lr:0.0000100 epoch_Time:25794.0min: [2024-01-05 01:07:16,822][model8_pretrain.py][INFO] Epoch:[0/2](524300/4588595) loss:2.420 lr:0.0000100 epoch_Time:25794.0min: [2024-01-05 01:07:16,822][model8_pretrain.py][INFO] Epoch:[0/2](524300/4588595) loss:2.912 lr:0.0000100 epoch_Time:25794.0min: [2024-01-05 01:07:16,822][model8_pretrain.py][INFO] Epoch:[0/2](524300/4588595) loss:2.929 lr:0.0000100 epoch_Time:25794.0min: [2024-01-05 01:07:16,822][model8_pretrain.py][INFO] Epoch:[0/2](524300/4588595) loss:2.376 lr:0.0000100 epoch_Time:25794.0min: [2024-01-05 01:07:53,753][model8_pretrain.py][INFO] Epoch:[0/2](524400/4588595) loss:2.093 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:07:53,753][model8_pretrain.py][INFO] Epoch:[0/2](524400/4588595) loss:2.739 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:07:53,753][model8_pretrain.py][INFO] Epoch:[0/2](524400/4588595) loss:2.623 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:07:53,753][model8_pretrain.py][INFO] Epoch:[0/2](524400/4588595) loss:2.898 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:07:53,753][model8_pretrain.py][INFO] Epoch:[0/2](524400/4588595) loss:3.717 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:07:53,753][model8_pretrain.py][INFO] Epoch:[0/2](524400/4588595) loss:2.874 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:07:53,753][model8_pretrain.py][INFO] Epoch:[0/2](524400/4588595) loss:3.063 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:07:53,753][model8_pretrain.py][INFO] Epoch:[0/2](524400/4588595) loss:2.862 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:08:30,689][model8_pretrain.py][INFO] Epoch:[0/2](524500/4588595) loss:2.728 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:08:30,689][model8_pretrain.py][INFO] Epoch:[0/2](524500/4588595) loss:2.718 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:08:30,689][model8_pretrain.py][INFO] Epoch:[0/2](524500/4588595) loss:2.319 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:08:30,689][model8_pretrain.py][INFO] Epoch:[0/2](524500/4588595) loss:2.950 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:08:30,689][model8_pretrain.py][INFO] Epoch:[0/2](524500/4588595) loss:3.292 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:08:30,689][model8_pretrain.py][INFO] Epoch:[0/2](524500/4588595) loss:2.724 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:08:30,689][model8_pretrain.py][INFO] Epoch:[0/2](524500/4588595) loss:2.669 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:08:30,689][model8_pretrain.py][INFO] Epoch:[0/2](524500/4588595) loss:3.018 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:09:18,063][model8_pretrain.py][INFO] Epoch:[0/2](524600/4588595) loss:3.061 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:09:18,063][model8_pretrain.py][INFO] Epoch:[0/2](524600/4588595) loss:2.322 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:09:18,063][model8_pretrain.py][INFO] Epoch:[0/2](524600/4588595) loss:2.812 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:09:18,063][model8_pretrain.py][INFO] Epoch:[0/2](524600/4588595) loss:2.808 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:09:18,063][model8_pretrain.py][INFO] Epoch:[0/2](524600/4588595) loss:3.120 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:09:18,063][model8_pretrain.py][INFO] Epoch:[0/2](524600/4588595) loss:2.786 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:09:18,064][model8_pretrain.py][INFO] Epoch:[0/2](524600/4588595) loss:2.785 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:09:18,064][model8_pretrain.py][INFO] Epoch:[0/2](524600/4588595) loss:2.198 lr:0.0000100 epoch_Time:25793.0min: [2024-01-05 01:09:54,995][model8_pretrain.py][INFO] Epoch:[0/2](524700/4588595) loss:2.579 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:09:54,995][model8_pretrain.py][INFO] Epoch:[0/2](524700/4588595) loss:2.072 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:09:54,995][model8_pretrain.py][INFO] Epoch:[0/2](524700/4588595) loss:2.523 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:09:54,995][model8_pretrain.py][INFO] Epoch:[0/2](524700/4588595) loss:2.485 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:09:54,995][model8_pretrain.py][INFO] Epoch:[0/2](524700/4588595) loss:3.260 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:09:54,995][model8_pretrain.py][INFO] Epoch:[0/2](524700/4588595) loss:2.513 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:09:54,995][model8_pretrain.py][INFO] Epoch:[0/2](524700/4588595) loss:2.691 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:09:54,995][model8_pretrain.py][INFO] Epoch:[0/2](524700/4588595) loss:2.454 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:10:31,923][model8_pretrain.py][INFO] Epoch:[0/2](524800/4588595) loss:2.703 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:10:31,923][model8_pretrain.py][INFO] Epoch:[0/2](524800/4588595) loss:3.235 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:10:31,923][model8_pretrain.py][INFO] Epoch:[0/2](524800/4588595) loss:2.866 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:10:31,923][model8_pretrain.py][INFO] Epoch:[0/2](524800/4588595) loss:3.114 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:10:31,923][model8_pretrain.py][INFO] Epoch:[0/2](524800/4588595) loss:3.079 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:10:31,923][model8_pretrain.py][INFO] Epoch:[0/2](524800/4588595) loss:2.811 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:10:31,923][model8_pretrain.py][INFO] Epoch:[0/2](524800/4588595) loss:2.286 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:10:31,923][model8_pretrain.py][INFO] Epoch:[0/2](524800/4588595) loss:2.979 lr:0.0000100 epoch_Time:25792.0min: [2024-01-05 01:11:08,854][model8_pretrain.py][INFO] Epoch:[0/2](524900/4588595) loss:3.172 lr:0.0000100 epoch_Time:25791.0min: [2024-01-05 01:11:08,854][model8_pretrain.py][INFO] Epoch:[0/2](524900/4588595) loss:3.025 lr:0.0000100 epoch_Time:25791.0min: [2024-01-05 01:11:08,854][model8_pretrain.py][INFO] Epoch:[0/2](524900/4588595) loss:2.514 lr:0.0000100 epoch_Time:25791.0min: [2024-01-05 01:11:08,855][model8_pretrain.py][INFO] Epoch:[0/2](524900/4588595) loss:2.700 lr:0.0000100 epoch_Time:25791.0min: [2024-01-05 01:11:08,855][model8_pretrain.py][INFO] Epoch:[0/2](524900/4588595) loss:3.061 lr:0.0000100 epoch_Time:25791.0min: [2024-01-05 01:11:08,854][model8_pretrain.py][INFO] Epoch:[0/2](524900/4588595) loss:2.749 lr:0.0000100 epoch_Time:25791.0min: [2024-01-05 01:11:08,855][model8_pretrain.py][INFO] Epoch:[0/2](524900/4588595) loss:3.411 lr:0.0000100 epoch_Time:25791.0min: [2024-01-05 01:11:08,855][model8_pretrain.py][INFO] Epoch:[0/2](524900/4588595) loss:2.411 lr:0.0000100 epoch_Time:25791.0min: [2024-01-05 01:11:45,791][model8_pretrain.py][INFO] Epoch:[0/2](525000/4588595) loss:2.930 lr:0.0000100 epoch_Time:25790.0min: [2024-01-05 01:11:45,791][model8_pretrain.py][INFO] Epoch:[0/2](525000/4588595) loss:3.255 lr:0.0000100 epoch_Time:25790.0min: [2024-01-05 01:11:45,791][model8_pretrain.py][INFO] Epoch:[0/2](525000/4588595) loss:3.016 lr:0.0000100 epoch_Time:25790.0min: [2024-01-05 01:11:45,791][model8_pretrain.py][INFO] Epoch:[0/2](525000/4588595) loss:2.725 lr:0.0000100 epoch_Time:25790.0min: [2024-01-05 01:11:45,791][model8_pretrain.py][INFO] Epoch:[0/2](525000/4588595) loss:3.014 lr:0.0000100 epoch_Time:25790.0min: [2024-01-05 01:11:45,791][model8_pretrain.py][INFO] Epoch:[0/2](525000/4588595) loss:3.225 lr:0.0000100 epoch_Time:25790.0min: [2024-01-05 01:11:45,791][model8_pretrain.py][INFO] Epoch:[0/2](525000/4588595) loss:3.174 lr:0.0000100 epoch_Time:25790.0min: [2024-01-05 01:11:45,791][model8_pretrain.py][INFO] Epoch:[0/2](525000/4588595) loss:2.510 lr:0.0000100 epoch_Time:25790.0min: [2024-01-05 01:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](525100/4588595) loss:2.459 lr:0.0000100 epoch_Time:25789.0min: [2024-01-05 01:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](525100/4588595) loss:3.217 lr:0.0000100 epoch_Time:25789.0min: [2024-01-05 01:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](525100/4588595) loss:3.067 lr:0.0000100 epoch_Time:25789.0min: [2024-01-05 01:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](525100/4588595) loss:2.736 lr:0.0000100 epoch_Time:25789.0min: [2024-01-05 01:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](525100/4588595) loss:2.947 lr:0.0000100 epoch_Time:25789.0min: [2024-01-05 01:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](525100/4588595) loss:2.714 lr:0.0000100 epoch_Time:25789.0min: [2024-01-05 01:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](525100/4588595) loss:3.062 lr:0.0000100 epoch_Time:25789.0min: [2024-01-05 01:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](525100/4588595) loss:2.596 lr:0.0000100 epoch_Time:25789.0min: [2024-01-05 01:12:59,652][model8_pretrain.py][INFO] Epoch:[0/2](525200/4588595) loss:2.721 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:12:59,652][model8_pretrain.py][INFO] Epoch:[0/2](525200/4588595) loss:3.113 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:12:59,652][model8_pretrain.py][INFO] Epoch:[0/2](525200/4588595) loss:2.971 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:12:59,652][model8_pretrain.py][INFO] Epoch:[0/2](525200/4588595) loss:2.306 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:12:59,652][model8_pretrain.py][INFO] Epoch:[0/2](525200/4588595) loss:2.992 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:12:59,652][model8_pretrain.py][INFO] Epoch:[0/2](525200/4588595) loss:2.916 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:12:59,652][model8_pretrain.py][INFO] Epoch:[0/2](525200/4588595) loss:3.278 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:12:59,652][model8_pretrain.py][INFO] Epoch:[0/2](525200/4588595) loss:2.667 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:13:36,580][model8_pretrain.py][INFO] Epoch:[0/2](525300/4588595) loss:2.578 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:13:36,580][model8_pretrain.py][INFO] Epoch:[0/2](525300/4588595) loss:2.580 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:13:36,580][model8_pretrain.py][INFO] Epoch:[0/2](525300/4588595) loss:3.474 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:13:36,580][model8_pretrain.py][INFO] Epoch:[0/2](525300/4588595) loss:2.965 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:13:36,580][model8_pretrain.py][INFO] Epoch:[0/2](525300/4588595) loss:2.646 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:13:36,580][model8_pretrain.py][INFO] Epoch:[0/2](525300/4588595) loss:3.271 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:13:36,580][model8_pretrain.py][INFO] Epoch:[0/2](525300/4588595) loss:2.944 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:13:36,580][model8_pretrain.py][INFO] Epoch:[0/2](525300/4588595) loss:2.696 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:14:23,893][model8_pretrain.py][INFO] Epoch:[0/2](525400/4588595) loss:2.618 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:14:23,893][model8_pretrain.py][INFO] Epoch:[0/2](525400/4588595) loss:3.101 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:14:23,893][model8_pretrain.py][INFO] Epoch:[0/2](525400/4588595) loss:3.080 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:14:23,893][model8_pretrain.py][INFO] Epoch:[0/2](525400/4588595) loss:2.871 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:14:23,893][model8_pretrain.py][INFO] Epoch:[0/2](525400/4588595) loss:3.136 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:14:23,894][model8_pretrain.py][INFO] Epoch:[0/2](525400/4588595) loss:2.812 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:14:23,894][model8_pretrain.py][INFO] Epoch:[0/2](525400/4588595) loss:2.842 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:14:23,894][model8_pretrain.py][INFO] Epoch:[0/2](525400/4588595) loss:2.887 lr:0.0000100 epoch_Time:25788.0min: [2024-01-05 01:15:00,808][model8_pretrain.py][INFO] Epoch:[0/2](525500/4588595) loss:2.947 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:00,808][model8_pretrain.py][INFO] Epoch:[0/2](525500/4588595) loss:3.007 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:00,808][model8_pretrain.py][INFO] Epoch:[0/2](525500/4588595) loss:3.036 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:00,808][model8_pretrain.py][INFO] Epoch:[0/2](525500/4588595) loss:2.972 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:00,808][model8_pretrain.py][INFO] Epoch:[0/2](525500/4588595) loss:3.004 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:00,808][model8_pretrain.py][INFO] Epoch:[0/2](525500/4588595) loss:2.882 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:00,808][model8_pretrain.py][INFO] Epoch:[0/2](525500/4588595) loss:2.833 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:00,808][model8_pretrain.py][INFO] Epoch:[0/2](525500/4588595) loss:2.759 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:37,731][model8_pretrain.py][INFO] Epoch:[0/2](525600/4588595) loss:2.901 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:37,731][model8_pretrain.py][INFO] Epoch:[0/2](525600/4588595) loss:3.011 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:37,731][model8_pretrain.py][INFO] Epoch:[0/2](525600/4588595) loss:3.188 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:37,731][model8_pretrain.py][INFO] Epoch:[0/2](525600/4588595) loss:2.669 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:37,731][model8_pretrain.py][INFO] Epoch:[0/2](525600/4588595) loss:3.037 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:37,731][model8_pretrain.py][INFO] Epoch:[0/2](525600/4588595) loss:2.778 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:37,731][model8_pretrain.py][INFO] Epoch:[0/2](525600/4588595) loss:2.928 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:15:37,731][model8_pretrain.py][INFO] Epoch:[0/2](525600/4588595) loss:2.655 lr:0.0000100 epoch_Time:25787.0min: [2024-01-05 01:16:14,653][model8_pretrain.py][INFO] Epoch:[0/2](525700/4588595) loss:3.394 lr:0.0000100 epoch_Time:25786.0min: [2024-01-05 01:16:14,653][model8_pretrain.py][INFO] Epoch:[0/2](525700/4588595) loss:3.301 lr:0.0000100 epoch_Time:25786.0min: [2024-01-05 01:16:14,653][model8_pretrain.py][INFO] Epoch:[0/2](525700/4588595) loss:3.105 lr:0.0000100 epoch_Time:25786.0min: [2024-01-05 01:16:14,653][model8_pretrain.py][INFO] Epoch:[0/2](525700/4588595) loss:2.194 lr:0.0000100 epoch_Time:25786.0min: [2024-01-05 01:16:14,653][model8_pretrain.py][INFO] Epoch:[0/2](525700/4588595) loss:2.724 lr:0.0000100 epoch_Time:25786.0min: [2024-01-05 01:16:14,653][model8_pretrain.py][INFO] Epoch:[0/2](525700/4588595) loss:2.568 lr:0.0000100 epoch_Time:25786.0min: [2024-01-05 01:16:14,653][model8_pretrain.py][INFO] Epoch:[0/2](525700/4588595) loss:2.776 lr:0.0000100 epoch_Time:25786.0min: [2024-01-05 01:16:14,653][model8_pretrain.py][INFO] Epoch:[0/2](525700/4588595) loss:3.185 lr:0.0000100 epoch_Time:25786.0min: [2024-01-05 01:16:51,583][model8_pretrain.py][INFO] Epoch:[0/2](525800/4588595) loss:3.345 lr:0.0000100 epoch_Time:25785.0min: [2024-01-05 01:16:51,583][model8_pretrain.py][INFO] Epoch:[0/2](525800/4588595) loss:3.141 lr:0.0000100 epoch_Time:25785.0min: [2024-01-05 01:16:51,583][model8_pretrain.py][INFO] Epoch:[0/2](525800/4588595) loss:2.945 lr:0.0000100 epoch_Time:25785.0min: [2024-01-05 01:16:51,583][model8_pretrain.py][INFO] Epoch:[0/2](525800/4588595) loss:2.684 lr:0.0000100 epoch_Time:25785.0min: [2024-01-05 01:16:51,583][model8_pretrain.py][INFO] Epoch:[0/2](525800/4588595) loss:2.695 lr:0.0000100 epoch_Time:25785.0min: [2024-01-05 01:16:51,583][model8_pretrain.py][INFO] Epoch:[0/2](525800/4588595) loss:2.229 lr:0.0000100 epoch_Time:25785.0min: [2024-01-05 01:16:51,583][model8_pretrain.py][INFO] Epoch:[0/2](525800/4588595) loss:2.926 lr:0.0000100 epoch_Time:25785.0min: [2024-01-05 01:16:51,583][model8_pretrain.py][INFO] Epoch:[0/2](525800/4588595) loss:3.013 lr:0.0000100 epoch_Time:25785.0min: [2024-01-05 01:17:28,521][model8_pretrain.py][INFO] Epoch:[0/2](525900/4588595) loss:2.715 lr:0.0000100 epoch_Time:25784.0min: [2024-01-05 01:17:28,521][model8_pretrain.py][INFO] Epoch:[0/2](525900/4588595) loss:2.823 lr:0.0000100 epoch_Time:25784.0min: [2024-01-05 01:17:28,521][model8_pretrain.py][INFO] Epoch:[0/2](525900/4588595) loss:3.131 lr:0.0000100 epoch_Time:25784.0min: [2024-01-05 01:17:28,521][model8_pretrain.py][INFO] Epoch:[0/2](525900/4588595) loss:2.490 lr:0.0000100 epoch_Time:25784.0min: [2024-01-05 01:17:28,521][model8_pretrain.py][INFO] Epoch:[0/2](525900/4588595) loss:3.433 lr:0.0000100 epoch_Time:25784.0min: [2024-01-05 01:17:28,521][model8_pretrain.py][INFO] Epoch:[0/2](525900/4588595) loss:2.836 lr:0.0000100 epoch_Time:25784.0min: [2024-01-05 01:17:28,521][model8_pretrain.py][INFO] Epoch:[0/2](525900/4588595) loss:3.123 lr:0.0000100 epoch_Time:25784.0min: [2024-01-05 01:17:28,522][model8_pretrain.py][INFO] Epoch:[0/2](525900/4588595) loss:2.883 lr:0.0000100 epoch_Time:25784.0min: [2024-01-05 01:18:05,454][model8_pretrain.py][INFO] Epoch:[0/2](526000/4588595) loss:2.231 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:05,454][model8_pretrain.py][INFO] Epoch:[0/2](526000/4588595) loss:3.105 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:05,454][model8_pretrain.py][INFO] Epoch:[0/2](526000/4588595) loss:3.244 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:05,454][model8_pretrain.py][INFO] Epoch:[0/2](526000/4588595) loss:2.474 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:05,454][model8_pretrain.py][INFO] Epoch:[0/2](526000/4588595) loss:2.820 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:05,454][model8_pretrain.py][INFO] Epoch:[0/2](526000/4588595) loss:3.089 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:05,454][model8_pretrain.py][INFO] Epoch:[0/2](526000/4588595) loss:3.042 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:05,454][model8_pretrain.py][INFO] Epoch:[0/2](526000/4588595) loss:2.279 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:42,382][model8_pretrain.py][INFO] Epoch:[0/2](526100/4588595) loss:2.674 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:42,382][model8_pretrain.py][INFO] Epoch:[0/2](526100/4588595) loss:2.787 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:42,382][model8_pretrain.py][INFO] Epoch:[0/2](526100/4588595) loss:2.455 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:42,382][model8_pretrain.py][INFO] Epoch:[0/2](526100/4588595) loss:3.019 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:42,382][model8_pretrain.py][INFO] Epoch:[0/2](526100/4588595) loss:3.034 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:42,382][model8_pretrain.py][INFO] Epoch:[0/2](526100/4588595) loss:2.914 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:42,382][model8_pretrain.py][INFO] Epoch:[0/2](526100/4588595) loss:2.360 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:18:42,382][model8_pretrain.py][INFO] Epoch:[0/2](526100/4588595) loss:2.541 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:19:29,700][model8_pretrain.py][INFO] Epoch:[0/2](526200/4588595) loss:2.620 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:19:29,700][model8_pretrain.py][INFO] Epoch:[0/2](526200/4588595) loss:2.702 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:19:29,700][model8_pretrain.py][INFO] Epoch:[0/2](526200/4588595) loss:2.419 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:19:29,700][model8_pretrain.py][INFO] Epoch:[0/2](526200/4588595) loss:3.301 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:19:29,700][model8_pretrain.py][INFO] Epoch:[0/2](526200/4588595) loss:3.154 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:19:29,700][model8_pretrain.py][INFO] Epoch:[0/2](526200/4588595) loss:2.997 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:19:29,700][model8_pretrain.py][INFO] Epoch:[0/2](526200/4588595) loss:3.008 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:19:29,700][model8_pretrain.py][INFO] Epoch:[0/2](526200/4588595) loss:3.135 lr:0.0000100 epoch_Time:25783.0min: [2024-01-05 01:20:06,625][model8_pretrain.py][INFO] Epoch:[0/2](526300/4588595) loss:2.509 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:06,625][model8_pretrain.py][INFO] Epoch:[0/2](526300/4588595) loss:3.064 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:06,625][model8_pretrain.py][INFO] Epoch:[0/2](526300/4588595) loss:2.479 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:06,625][model8_pretrain.py][INFO] Epoch:[0/2](526300/4588595) loss:2.278 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:06,625][model8_pretrain.py][INFO] Epoch:[0/2](526300/4588595) loss:2.494 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:06,625][model8_pretrain.py][INFO] Epoch:[0/2](526300/4588595) loss:2.287 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:06,625][model8_pretrain.py][INFO] Epoch:[0/2](526300/4588595) loss:2.765 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:06,625][model8_pretrain.py][INFO] Epoch:[0/2](526300/4588595) loss:2.549 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:43,554][model8_pretrain.py][INFO] Epoch:[0/2](526400/4588595) loss:2.875 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:43,554][model8_pretrain.py][INFO] Epoch:[0/2](526400/4588595) loss:2.730 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:43,554][model8_pretrain.py][INFO] Epoch:[0/2](526400/4588595) loss:2.879 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:43,554][model8_pretrain.py][INFO] Epoch:[0/2](526400/4588595) loss:2.903 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:43,554][model8_pretrain.py][INFO] Epoch:[0/2](526400/4588595) loss:2.495 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:43,554][model8_pretrain.py][INFO] Epoch:[0/2](526400/4588595) loss:2.649 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:43,554][model8_pretrain.py][INFO] Epoch:[0/2](526400/4588595) loss:2.444 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:20:43,554][model8_pretrain.py][INFO] Epoch:[0/2](526400/4588595) loss:2.164 lr:0.0000100 epoch_Time:25782.0min: [2024-01-05 01:21:20,477][model8_pretrain.py][INFO] Epoch:[0/2](526500/4588595) loss:3.277 lr:0.0000100 epoch_Time:25781.0min: [2024-01-05 01:21:20,478][model8_pretrain.py][INFO] Epoch:[0/2](526500/4588595) loss:3.288 lr:0.0000100 epoch_Time:25781.0min: [2024-01-05 01:21:20,478][model8_pretrain.py][INFO] Epoch:[0/2](526500/4588595) loss:1.608 lr:0.0000100 epoch_Time:25781.0min: [2024-01-05 01:21:20,478][model8_pretrain.py][INFO] Epoch:[0/2](526500/4588595) loss:2.805 lr:0.0000100 epoch_Time:25781.0min: [2024-01-05 01:21:20,478][model8_pretrain.py][INFO] Epoch:[0/2](526500/4588595) loss:3.082 lr:0.0000100 epoch_Time:25781.0min: [2024-01-05 01:21:20,478][model8_pretrain.py][INFO] Epoch:[0/2](526500/4588595) loss:2.780 lr:0.0000100 epoch_Time:25781.0min: [2024-01-05 01:21:20,478][model8_pretrain.py][INFO] Epoch:[0/2](526500/4588595) loss:2.573 lr:0.0000100 epoch_Time:25781.0min: [2024-01-05 01:21:20,479][model8_pretrain.py][INFO] Epoch:[0/2](526500/4588595) loss:2.990 lr:0.0000100 epoch_Time:25781.0min: [2024-01-05 01:21:57,404][model8_pretrain.py][INFO] Epoch:[0/2](526600/4588595) loss:2.482 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:21:57,404][model8_pretrain.py][INFO] Epoch:[0/2](526600/4588595) loss:2.808 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:21:57,404][model8_pretrain.py][INFO] Epoch:[0/2](526600/4588595) loss:3.138 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:21:57,404][model8_pretrain.py][INFO] Epoch:[0/2](526600/4588595) loss:2.942 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:21:57,404][model8_pretrain.py][INFO] Epoch:[0/2](526600/4588595) loss:3.022 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:21:57,404][model8_pretrain.py][INFO] Epoch:[0/2](526600/4588595) loss:2.597 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:21:57,404][model8_pretrain.py][INFO] Epoch:[0/2](526600/4588595) loss:3.039 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:21:57,405][model8_pretrain.py][INFO] Epoch:[0/2](526600/4588595) loss:2.873 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:22:34,353][model8_pretrain.py][INFO] Epoch:[0/2](526700/4588595) loss:3.146 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:22:34,353][model8_pretrain.py][INFO] Epoch:[0/2](526700/4588595) loss:3.342 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:22:34,353][model8_pretrain.py][INFO] Epoch:[0/2](526700/4588595) loss:2.530 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:22:34,353][model8_pretrain.py][INFO] Epoch:[0/2](526700/4588595) loss:3.011 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:22:34,353][model8_pretrain.py][INFO] Epoch:[0/2](526700/4588595) loss:2.841 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:22:34,353][model8_pretrain.py][INFO] Epoch:[0/2](526700/4588595) loss:3.266 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:22:34,353][model8_pretrain.py][INFO] Epoch:[0/2](526700/4588595) loss:2.892 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:22:34,353][model8_pretrain.py][INFO] Epoch:[0/2](526700/4588595) loss:3.363 lr:0.0000100 epoch_Time:25780.0min: [2024-01-05 01:23:11,290][model8_pretrain.py][INFO] Epoch:[0/2](526800/4588595) loss:1.735 lr:0.0000100 epoch_Time:25778.0min: [2024-01-05 01:23:11,290][model8_pretrain.py][INFO] Epoch:[0/2](526800/4588595) loss:2.747 lr:0.0000100 epoch_Time:25778.0min: [2024-01-05 01:23:11,290][model8_pretrain.py][INFO] Epoch:[0/2](526800/4588595) loss:3.378 lr:0.0000100 epoch_Time:25778.0min: [2024-01-05 01:23:11,290][model8_pretrain.py][INFO] Epoch:[0/2](526800/4588595) loss:2.789 lr:0.0000100 epoch_Time:25778.0min: [2024-01-05 01:23:11,290][model8_pretrain.py][INFO] Epoch:[0/2](526800/4588595) loss:2.572 lr:0.0000100 epoch_Time:25778.0min: [2024-01-05 01:23:11,290][model8_pretrain.py][INFO] Epoch:[0/2](526800/4588595) loss:2.776 lr:0.0000100 epoch_Time:25778.0min: [2024-01-05 01:23:11,290][model8_pretrain.py][INFO] Epoch:[0/2](526800/4588595) loss:2.902 lr:0.0000100 epoch_Time:25778.0min: [2024-01-05 01:23:11,290][model8_pretrain.py][INFO] Epoch:[0/2](526800/4588595) loss:2.775 lr:0.0000100 epoch_Time:25778.0min: [2024-01-05 01:23:48,219][model8_pretrain.py][INFO] Epoch:[0/2](526900/4588595) loss:2.565 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:23:48,220][model8_pretrain.py][INFO] Epoch:[0/2](526900/4588595) loss:2.968 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:23:48,220][model8_pretrain.py][INFO] Epoch:[0/2](526900/4588595) loss:2.937 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:23:48,220][model8_pretrain.py][INFO] Epoch:[0/2](526900/4588595) loss:2.277 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:23:48,220][model8_pretrain.py][INFO] Epoch:[0/2](526900/4588595) loss:2.744 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:23:48,220][model8_pretrain.py][INFO] Epoch:[0/2](526900/4588595) loss:2.520 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:23:48,220][model8_pretrain.py][INFO] Epoch:[0/2](526900/4588595) loss:2.643 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:23:48,220][model8_pretrain.py][INFO] Epoch:[0/2](526900/4588595) loss:2.497 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:24:35,474][model8_pretrain.py][INFO] Epoch:[0/2](527000/4588595) loss:2.758 lr:0.0000100 epoch_Time:25779.0min: [2024-01-05 01:24:35,474][model8_pretrain.py][INFO] Epoch:[0/2](527000/4588595) loss:2.263 lr:0.0000100 epoch_Time:25779.0min: [2024-01-05 01:24:35,474][model8_pretrain.py][INFO] Epoch:[0/2](527000/4588595) loss:3.187 lr:0.0000100 epoch_Time:25779.0min: [2024-01-05 01:24:35,474][model8_pretrain.py][INFO] Epoch:[0/2](527000/4588595) loss:2.798 lr:0.0000100 epoch_Time:25779.0min: [2024-01-05 01:24:35,474][model8_pretrain.py][INFO] Epoch:[0/2](527000/4588595) loss:2.865 lr:0.0000100 epoch_Time:25779.0min: [2024-01-05 01:24:35,474][model8_pretrain.py][INFO] Epoch:[0/2](527000/4588595) loss:3.055 lr:0.0000100 epoch_Time:25779.0min: [2024-01-05 01:24:35,474][model8_pretrain.py][INFO] Epoch:[0/2](527000/4588595) loss:3.044 lr:0.0000100 epoch_Time:25779.0min: [2024-01-05 01:24:35,474][model8_pretrain.py][INFO] Epoch:[0/2](527000/4588595) loss:3.075 lr:0.0000100 epoch_Time:25779.0min: [2024-01-05 01:25:12,401][model8_pretrain.py][INFO] Epoch:[0/2](527100/4588595) loss:2.580 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:25:12,401][model8_pretrain.py][INFO] Epoch:[0/2](527100/4588595) loss:2.504 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:25:12,401][model8_pretrain.py][INFO] Epoch:[0/2](527100/4588595) loss:2.202 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:25:12,401][model8_pretrain.py][INFO] Epoch:[0/2](527100/4588595) loss:2.076 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:25:12,401][model8_pretrain.py][INFO] Epoch:[0/2](527100/4588595) loss:2.979 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:25:12,401][model8_pretrain.py][INFO] Epoch:[0/2](527100/4588595) loss:3.370 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:25:12,401][model8_pretrain.py][INFO] Epoch:[0/2](527100/4588595) loss:2.557 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:25:12,401][model8_pretrain.py][INFO] Epoch:[0/2](527100/4588595) loss:2.810 lr:0.0000100 epoch_Time:25777.0min: [2024-01-05 01:25:49,345][model8_pretrain.py][INFO] Epoch:[0/2](527200/4588595) loss:3.301 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:25:49,345][model8_pretrain.py][INFO] Epoch:[0/2](527200/4588595) loss:2.302 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:25:49,345][model8_pretrain.py][INFO] Epoch:[0/2](527200/4588595) loss:2.822 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:25:49,345][model8_pretrain.py][INFO] Epoch:[0/2](527200/4588595) loss:2.984 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:25:49,345][model8_pretrain.py][INFO] Epoch:[0/2](527200/4588595) loss:3.190 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:25:49,345][model8_pretrain.py][INFO] Epoch:[0/2](527200/4588595) loss:2.469 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:25:49,345][model8_pretrain.py][INFO] Epoch:[0/2](527200/4588595) loss:2.803 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:25:49,345][model8_pretrain.py][INFO] Epoch:[0/2](527200/4588595) loss:3.375 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:26:26,294][model8_pretrain.py][INFO] Epoch:[0/2](527300/4588595) loss:2.338 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:26:26,294][model8_pretrain.py][INFO] Epoch:[0/2](527300/4588595) loss:3.130 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:26:26,294][model8_pretrain.py][INFO] Epoch:[0/2](527300/4588595) loss:2.837 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:26:26,294][model8_pretrain.py][INFO] Epoch:[0/2](527300/4588595) loss:2.956 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:26:26,294][model8_pretrain.py][INFO] Epoch:[0/2](527300/4588595) loss:2.778 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:26:26,294][model8_pretrain.py][INFO] Epoch:[0/2](527300/4588595) loss:2.557 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:26:26,294][model8_pretrain.py][INFO] Epoch:[0/2](527300/4588595) loss:3.209 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:26:26,294][model8_pretrain.py][INFO] Epoch:[0/2](527300/4588595) loss:2.740 lr:0.0000100 epoch_Time:25776.0min: [2024-01-05 01:27:03,246][model8_pretrain.py][INFO] Epoch:[0/2](527400/4588595) loss:2.835 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:03,246][model8_pretrain.py][INFO] Epoch:[0/2](527400/4588595) loss:2.253 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:03,246][model8_pretrain.py][INFO] Epoch:[0/2](527400/4588595) loss:2.835 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:03,246][model8_pretrain.py][INFO] Epoch:[0/2](527400/4588595) loss:3.213 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:03,246][model8_pretrain.py][INFO] Epoch:[0/2](527400/4588595) loss:2.953 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:03,246][model8_pretrain.py][INFO] Epoch:[0/2](527400/4588595) loss:3.355 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:03,246][model8_pretrain.py][INFO] Epoch:[0/2](527400/4588595) loss:2.555 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:03,246][model8_pretrain.py][INFO] Epoch:[0/2](527400/4588595) loss:2.419 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:40,193][model8_pretrain.py][INFO] Epoch:[0/2](527500/4588595) loss:2.634 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:40,193][model8_pretrain.py][INFO] Epoch:[0/2](527500/4588595) loss:3.008 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:40,193][model8_pretrain.py][INFO] Epoch:[0/2](527500/4588595) loss:2.483 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:40,193][model8_pretrain.py][INFO] Epoch:[0/2](527500/4588595) loss:2.791 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:40,193][model8_pretrain.py][INFO] Epoch:[0/2](527500/4588595) loss:2.865 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:40,193][model8_pretrain.py][INFO] Epoch:[0/2](527500/4588595) loss:2.693 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:40,194][model8_pretrain.py][INFO] Epoch:[0/2](527500/4588595) loss:2.916 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:27:40,194][model8_pretrain.py][INFO] Epoch:[0/2](527500/4588595) loss:3.379 lr:0.0000100 epoch_Time:25775.0min: [2024-01-05 01:28:17,146][model8_pretrain.py][INFO] Epoch:[0/2](527600/4588595) loss:3.024 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:28:17,146][model8_pretrain.py][INFO] Epoch:[0/2](527600/4588595) loss:3.206 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:28:17,146][model8_pretrain.py][INFO] Epoch:[0/2](527600/4588595) loss:2.955 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:28:17,146][model8_pretrain.py][INFO] Epoch:[0/2](527600/4588595) loss:2.652 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:28:17,146][model8_pretrain.py][INFO] Epoch:[0/2](527600/4588595) loss:2.953 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:28:17,146][model8_pretrain.py][INFO] Epoch:[0/2](527600/4588595) loss:2.497 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:28:17,146][model8_pretrain.py][INFO] Epoch:[0/2](527600/4588595) loss:3.152 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:28:17,146][model8_pretrain.py][INFO] Epoch:[0/2](527600/4588595) loss:2.895 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:28:54,094][model8_pretrain.py][INFO] Epoch:[0/2](527700/4588595) loss:2.749 lr:0.0000100 epoch_Time:25772.0min: [2024-01-05 01:28:54,094][model8_pretrain.py][INFO] Epoch:[0/2](527700/4588595) loss:2.190 lr:0.0000100 epoch_Time:25772.0min: [2024-01-05 01:28:54,094][model8_pretrain.py][INFO] Epoch:[0/2](527700/4588595) loss:3.025 lr:0.0000100 epoch_Time:25772.0min: [2024-01-05 01:28:54,094][model8_pretrain.py][INFO] Epoch:[0/2](527700/4588595) loss:2.706 lr:0.0000100 epoch_Time:25772.0min: [2024-01-05 01:28:54,094][model8_pretrain.py][INFO] Epoch:[0/2](527700/4588595) loss:2.998 lr:0.0000100 epoch_Time:25772.0min: [2024-01-05 01:28:54,094][model8_pretrain.py][INFO] Epoch:[0/2](527700/4588595) loss:2.589 lr:0.0000100 epoch_Time:25772.0min: [2024-01-05 01:28:54,094][model8_pretrain.py][INFO] Epoch:[0/2](527700/4588595) loss:2.820 lr:0.0000100 epoch_Time:25772.0min: [2024-01-05 01:28:54,094][model8_pretrain.py][INFO] Epoch:[0/2](527700/4588595) loss:2.709 lr:0.0000100 epoch_Time:25772.0min: [2024-01-05 01:29:41,259][model8_pretrain.py][INFO] Epoch:[0/2](527800/4588595) loss:2.871 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:29:41,259][model8_pretrain.py][INFO] Epoch:[0/2](527800/4588595) loss:3.082 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:29:41,259][model8_pretrain.py][INFO] Epoch:[0/2](527800/4588595) loss:2.801 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:29:41,259][model8_pretrain.py][INFO] Epoch:[0/2](527800/4588595) loss:3.040 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:29:41,259][model8_pretrain.py][INFO] Epoch:[0/2](527800/4588595) loss:3.103 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:29:41,259][model8_pretrain.py][INFO] Epoch:[0/2](527800/4588595) loss:3.111 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:29:41,259][model8_pretrain.py][INFO] Epoch:[0/2](527800/4588595) loss:3.134 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:29:41,259][model8_pretrain.py][INFO] Epoch:[0/2](527800/4588595) loss:3.129 lr:0.0000100 epoch_Time:25774.0min: [2024-01-05 01:30:18,188][model8_pretrain.py][INFO] Epoch:[0/2](527900/4588595) loss:3.010 lr:0.0000100 epoch_Time:25773.0min: [2024-01-05 01:30:18,188][model8_pretrain.py][INFO] Epoch:[0/2](527900/4588595) loss:2.116 lr:0.0000100 epoch_Time:25773.0min: [2024-01-05 01:30:18,189][model8_pretrain.py][INFO] Epoch:[0/2](527900/4588595) loss:2.709 lr:0.0000100 epoch_Time:25773.0min: [2024-01-05 01:30:18,189][model8_pretrain.py][INFO] Epoch:[0/2](527900/4588595) loss:3.422 lr:0.0000100 epoch_Time:25773.0min: [2024-01-05 01:30:18,189][model8_pretrain.py][INFO] Epoch:[0/2](527900/4588595) loss:2.787 lr:0.0000100 epoch_Time:25773.0min: [2024-01-05 01:30:18,189][model8_pretrain.py][INFO] Epoch:[0/2](527900/4588595) loss:3.194 lr:0.0000100 epoch_Time:25773.0min: [2024-01-05 01:30:18,189][model8_pretrain.py][INFO] Epoch:[0/2](527900/4588595) loss:3.026 lr:0.0000100 epoch_Time:25773.0min: [2024-01-05 01:30:18,189][model8_pretrain.py][INFO] Epoch:[0/2](527900/4588595) loss:2.988 lr:0.0000100 epoch_Time:25773.0min: [2024-01-05 01:30:55,132][model8_pretrain.py][INFO] Epoch:[0/2](528000/4588595) loss:2.228 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:30:55,132][model8_pretrain.py][INFO] Epoch:[0/2](528000/4588595) loss:2.456 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:30:55,132][model8_pretrain.py][INFO] Epoch:[0/2](528000/4588595) loss:2.715 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:30:55,132][model8_pretrain.py][INFO] Epoch:[0/2](528000/4588595) loss:2.775 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:30:55,132][model8_pretrain.py][INFO] Epoch:[0/2](528000/4588595) loss:2.495 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:30:55,132][model8_pretrain.py][INFO] Epoch:[0/2](528000/4588595) loss:3.094 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:30:55,132][model8_pretrain.py][INFO] Epoch:[0/2](528000/4588595) loss:2.604 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:30:55,132][model8_pretrain.py][INFO] Epoch:[0/2](528000/4588595) loss:2.854 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:31:32,068][model8_pretrain.py][INFO] Epoch:[0/2](528100/4588595) loss:2.873 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:31:32,069][model8_pretrain.py][INFO] Epoch:[0/2](528100/4588595) loss:2.326 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:31:32,069][model8_pretrain.py][INFO] Epoch:[0/2](528100/4588595) loss:2.801 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:31:32,068][model8_pretrain.py][INFO] Epoch:[0/2](528100/4588595) loss:2.856 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:31:32,069][model8_pretrain.py][INFO] Epoch:[0/2](528100/4588595) loss:2.889 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:31:32,069][model8_pretrain.py][INFO] Epoch:[0/2](528100/4588595) loss:2.916 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:31:32,069][model8_pretrain.py][INFO] Epoch:[0/2](528100/4588595) loss:3.592 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:31:32,069][model8_pretrain.py][INFO] Epoch:[0/2](528100/4588595) loss:2.867 lr:0.0000100 epoch_Time:25771.0min: [2024-01-05 01:32:09,001][model8_pretrain.py][INFO] Epoch:[0/2](528200/4588595) loss:2.893 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:09,001][model8_pretrain.py][INFO] Epoch:[0/2](528200/4588595) loss:3.228 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:09,001][model8_pretrain.py][INFO] Epoch:[0/2](528200/4588595) loss:2.991 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:09,001][model8_pretrain.py][INFO] Epoch:[0/2](528200/4588595) loss:3.312 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:09,001][model8_pretrain.py][INFO] Epoch:[0/2](528200/4588595) loss:3.213 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:09,001][model8_pretrain.py][INFO] Epoch:[0/2](528200/4588595) loss:2.745 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:09,001][model8_pretrain.py][INFO] Epoch:[0/2](528200/4588595) loss:2.202 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:09,001][model8_pretrain.py][INFO] Epoch:[0/2](528200/4588595) loss:2.518 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:45,933][model8_pretrain.py][INFO] Epoch:[0/2](528300/4588595) loss:2.369 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:45,933][model8_pretrain.py][INFO] Epoch:[0/2](528300/4588595) loss:2.978 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:45,933][model8_pretrain.py][INFO] Epoch:[0/2](528300/4588595) loss:2.685 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:45,933][model8_pretrain.py][INFO] Epoch:[0/2](528300/4588595) loss:2.923 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:45,933][model8_pretrain.py][INFO] Epoch:[0/2](528300/4588595) loss:2.660 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:45,933][model8_pretrain.py][INFO] Epoch:[0/2](528300/4588595) loss:2.882 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:45,933][model8_pretrain.py][INFO] Epoch:[0/2](528300/4588595) loss:3.387 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:32:45,933][model8_pretrain.py][INFO] Epoch:[0/2](528300/4588595) loss:2.703 lr:0.0000100 epoch_Time:25770.0min: [2024-01-05 01:33:22,892][model8_pretrain.py][INFO] Epoch:[0/2](528400/4588595) loss:3.144 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:33:22,892][model8_pretrain.py][INFO] Epoch:[0/2](528400/4588595) loss:3.351 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:33:22,892][model8_pretrain.py][INFO] Epoch:[0/2](528400/4588595) loss:2.635 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:33:22,892][model8_pretrain.py][INFO] Epoch:[0/2](528400/4588595) loss:2.472 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:33:22,892][model8_pretrain.py][INFO] Epoch:[0/2](528400/4588595) loss:3.091 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:33:22,892][model8_pretrain.py][INFO] Epoch:[0/2](528400/4588595) loss:2.955 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:33:22,892][model8_pretrain.py][INFO] Epoch:[0/2](528400/4588595) loss:2.651 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:33:22,893][model8_pretrain.py][INFO] Epoch:[0/2](528400/4588595) loss:2.419 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:33:59,823][model8_pretrain.py][INFO] Epoch:[0/2](528500/4588595) loss:2.309 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:33:59,823][model8_pretrain.py][INFO] Epoch:[0/2](528500/4588595) loss:2.997 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:33:59,823][model8_pretrain.py][INFO] Epoch:[0/2](528500/4588595) loss:2.787 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:33:59,823][model8_pretrain.py][INFO] Epoch:[0/2](528500/4588595) loss:3.285 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:33:59,823][model8_pretrain.py][INFO] Epoch:[0/2](528500/4588595) loss:2.597 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:33:59,823][model8_pretrain.py][INFO] Epoch:[0/2](528500/4588595) loss:2.652 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:33:59,823][model8_pretrain.py][INFO] Epoch:[0/2](528500/4588595) loss:2.923 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:33:59,823][model8_pretrain.py][INFO] Epoch:[0/2](528500/4588595) loss:2.169 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:34:46,970][model8_pretrain.py][INFO] Epoch:[0/2](528600/4588595) loss:3.498 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:34:46,970][model8_pretrain.py][INFO] Epoch:[0/2](528600/4588595) loss:2.649 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:34:46,970][model8_pretrain.py][INFO] Epoch:[0/2](528600/4588595) loss:2.504 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:34:46,970][model8_pretrain.py][INFO] Epoch:[0/2](528600/4588595) loss:2.448 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:34:46,970][model8_pretrain.py][INFO] Epoch:[0/2](528600/4588595) loss:3.166 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:34:46,970][model8_pretrain.py][INFO] Epoch:[0/2](528600/4588595) loss:3.337 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:34:46,970][model8_pretrain.py][INFO] Epoch:[0/2](528600/4588595) loss:3.165 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:34:46,970][model8_pretrain.py][INFO] Epoch:[0/2](528600/4588595) loss:2.797 lr:0.0000100 epoch_Time:25769.0min: [2024-01-05 01:35:23,895][model8_pretrain.py][INFO] Epoch:[0/2](528700/4588595) loss:2.555 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:35:23,895][model8_pretrain.py][INFO] Epoch:[0/2](528700/4588595) loss:2.820 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:35:23,895][model8_pretrain.py][INFO] Epoch:[0/2](528700/4588595) loss:2.909 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:35:23,895][model8_pretrain.py][INFO] Epoch:[0/2](528700/4588595) loss:3.232 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:35:23,895][model8_pretrain.py][INFO] Epoch:[0/2](528700/4588595) loss:2.521 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:35:23,895][model8_pretrain.py][INFO] Epoch:[0/2](528700/4588595) loss:2.419 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:35:23,895][model8_pretrain.py][INFO] Epoch:[0/2](528700/4588595) loss:3.084 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:35:23,895][model8_pretrain.py][INFO] Epoch:[0/2](528700/4588595) loss:3.265 lr:0.0000100 epoch_Time:25768.0min: [2024-01-05 01:36:00,825][model8_pretrain.py][INFO] Epoch:[0/2](528800/4588595) loss:2.989 lr:0.0000100 epoch_Time:25767.0min: [2024-01-05 01:36:00,825][model8_pretrain.py][INFO] Epoch:[0/2](528800/4588595) loss:2.514 lr:0.0000100 epoch_Time:25767.0min: [2024-01-05 01:36:00,825][model8_pretrain.py][INFO] Epoch:[0/2](528800/4588595) loss:2.479 lr:0.0000100 epoch_Time:25767.0min: [2024-01-05 01:36:00,825][model8_pretrain.py][INFO] Epoch:[0/2](528800/4588595) loss:2.304 lr:0.0000100 epoch_Time:25767.0min: [2024-01-05 01:36:00,825][model8_pretrain.py][INFO] Epoch:[0/2](528800/4588595) loss:3.062 lr:0.0000100 epoch_Time:25767.0min: [2024-01-05 01:36:00,825][model8_pretrain.py][INFO] Epoch:[0/2](528800/4588595) loss:2.989 lr:0.0000100 epoch_Time:25767.0min: [2024-01-05 01:36:00,825][model8_pretrain.py][INFO] Epoch:[0/2](528800/4588595) loss:2.812 lr:0.0000100 epoch_Time:25767.0min: [2024-01-05 01:36:00,825][model8_pretrain.py][INFO] Epoch:[0/2](528800/4588595) loss:2.767 lr:0.0000100 epoch_Time:25767.0min: [2024-01-05 01:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](528900/4588595) loss:2.984 lr:0.0000100 epoch_Time:25766.0min: [2024-01-05 01:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](528900/4588595) loss:2.501 lr:0.0000100 epoch_Time:25766.0min: [2024-01-05 01:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](528900/4588595) loss:2.554 lr:0.0000100 epoch_Time:25766.0min: [2024-01-05 01:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](528900/4588595) loss:3.419 lr:0.0000100 epoch_Time:25766.0min: [2024-01-05 01:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](528900/4588595) loss:2.790 lr:0.0000100 epoch_Time:25766.0min: [2024-01-05 01:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](528900/4588595) loss:2.671 lr:0.0000100 epoch_Time:25766.0min: [2024-01-05 01:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](528900/4588595) loss:2.510 lr:0.0000100 epoch_Time:25766.0min: [2024-01-05 01:36:37,766][model8_pretrain.py][INFO] Epoch:[0/2](528900/4588595) loss:2.972 lr:0.0000100 epoch_Time:25766.0min: [2024-01-05 01:37:14,713][model8_pretrain.py][INFO] Epoch:[0/2](529000/4588595) loss:2.758 lr:0.0000100 epoch_Time:25765.0min: [2024-01-05 01:37:14,713][model8_pretrain.py][INFO] Epoch:[0/2](529000/4588595) loss:2.872 lr:0.0000100 epoch_Time:25765.0min: [2024-01-05 01:37:14,713][model8_pretrain.py][INFO] Epoch:[0/2](529000/4588595) loss:2.672 lr:0.0000100 epoch_Time:25765.0min: [2024-01-05 01:37:14,713][model8_pretrain.py][INFO] Epoch:[0/2](529000/4588595) loss:3.055 lr:0.0000100 epoch_Time:25765.0min: [2024-01-05 01:37:14,713][model8_pretrain.py][INFO] Epoch:[0/2](529000/4588595) loss:2.888 lr:0.0000100 epoch_Time:25765.0min: [2024-01-05 01:37:14,713][model8_pretrain.py][INFO] Epoch:[0/2](529000/4588595) loss:2.411 lr:0.0000100 epoch_Time:25765.0min: [2024-01-05 01:37:14,713][model8_pretrain.py][INFO] Epoch:[0/2](529000/4588595) loss:2.365 lr:0.0000100 epoch_Time:25765.0min: [2024-01-05 01:37:14,713][model8_pretrain.py][INFO] Epoch:[0/2](529000/4588595) loss:2.628 lr:0.0000100 epoch_Time:25765.0min: [2024-01-05 01:37:51,681][model8_pretrain.py][INFO] Epoch:[0/2](529100/4588595) loss:2.495 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:37:51,682][model8_pretrain.py][INFO] Epoch:[0/2](529100/4588595) loss:2.807 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:37:51,682][model8_pretrain.py][INFO] Epoch:[0/2](529100/4588595) loss:2.833 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:37:51,682][model8_pretrain.py][INFO] Epoch:[0/2](529100/4588595) loss:2.391 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:37:51,682][model8_pretrain.py][INFO] Epoch:[0/2](529100/4588595) loss:2.685 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:37:51,682][model8_pretrain.py][INFO] Epoch:[0/2](529100/4588595) loss:2.412 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:37:51,682][model8_pretrain.py][INFO] Epoch:[0/2](529100/4588595) loss:2.943 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:37:51,682][model8_pretrain.py][INFO] Epoch:[0/2](529100/4588595) loss:2.168 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:38:28,624][model8_pretrain.py][INFO] Epoch:[0/2](529200/4588595) loss:3.174 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:38:28,624][model8_pretrain.py][INFO] Epoch:[0/2](529200/4588595) loss:2.759 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:38:28,624][model8_pretrain.py][INFO] Epoch:[0/2](529200/4588595) loss:3.282 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:38:28,624][model8_pretrain.py][INFO] Epoch:[0/2](529200/4588595) loss:2.681 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:38:28,624][model8_pretrain.py][INFO] Epoch:[0/2](529200/4588595) loss:2.712 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:38:28,624][model8_pretrain.py][INFO] Epoch:[0/2](529200/4588595) loss:3.375 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:38:28,624][model8_pretrain.py][INFO] Epoch:[0/2](529200/4588595) loss:2.122 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:38:28,624][model8_pretrain.py][INFO] Epoch:[0/2](529200/4588595) loss:3.011 lr:0.0000100 epoch_Time:25764.0min: [2024-01-05 01:39:05,566][model8_pretrain.py][INFO] Epoch:[0/2](529300/4588595) loss:2.934 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:05,566][model8_pretrain.py][INFO] Epoch:[0/2](529300/4588595) loss:3.026 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:05,566][model8_pretrain.py][INFO] Epoch:[0/2](529300/4588595) loss:2.505 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:05,566][model8_pretrain.py][INFO] Epoch:[0/2](529300/4588595) loss:2.901 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:05,566][model8_pretrain.py][INFO] Epoch:[0/2](529300/4588595) loss:2.869 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:05,566][model8_pretrain.py][INFO] Epoch:[0/2](529300/4588595) loss:3.106 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:05,566][model8_pretrain.py][INFO] Epoch:[0/2](529300/4588595) loss:2.998 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:05,567][model8_pretrain.py][INFO] Epoch:[0/2](529300/4588595) loss:3.127 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:52,448][model8_pretrain.py][INFO] Epoch:[0/2](529400/4588595) loss:2.777 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:52,448][model8_pretrain.py][INFO] Epoch:[0/2](529400/4588595) loss:3.121 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:52,448][model8_pretrain.py][INFO] Epoch:[0/2](529400/4588595) loss:2.911 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:52,448][model8_pretrain.py][INFO] Epoch:[0/2](529400/4588595) loss:3.036 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:52,448][model8_pretrain.py][INFO] Epoch:[0/2](529400/4588595) loss:3.136 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:52,448][model8_pretrain.py][INFO] Epoch:[0/2](529400/4588595) loss:3.097 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:52,448][model8_pretrain.py][INFO] Epoch:[0/2](529400/4588595) loss:2.678 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:39:52,448][model8_pretrain.py][INFO] Epoch:[0/2](529400/4588595) loss:3.058 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:40:31,055][model8_pretrain.py][INFO] Epoch:[0/2](529500/4588595) loss:2.674 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:40:31,055][model8_pretrain.py][INFO] Epoch:[0/2](529500/4588595) loss:2.980 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:40:31,055][model8_pretrain.py][INFO] Epoch:[0/2](529500/4588595) loss:2.336 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:40:31,055][model8_pretrain.py][INFO] Epoch:[0/2](529500/4588595) loss:2.918 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:40:31,055][model8_pretrain.py][INFO] Epoch:[0/2](529500/4588595) loss:2.815 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:40:31,055][model8_pretrain.py][INFO] Epoch:[0/2](529500/4588595) loss:2.986 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:40:31,055][model8_pretrain.py][INFO] Epoch:[0/2](529500/4588595) loss:2.923 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:40:31,055][model8_pretrain.py][INFO] Epoch:[0/2](529500/4588595) loss:3.105 lr:0.0000100 epoch_Time:25763.0min: [2024-01-05 01:41:07,978][model8_pretrain.py][INFO] Epoch:[0/2](529600/4588595) loss:2.716 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:07,978][model8_pretrain.py][INFO] Epoch:[0/2](529600/4588595) loss:2.299 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:07,978][model8_pretrain.py][INFO] Epoch:[0/2](529600/4588595) loss:2.919 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:07,978][model8_pretrain.py][INFO] Epoch:[0/2](529600/4588595) loss:2.840 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:07,979][model8_pretrain.py][INFO] Epoch:[0/2](529600/4588595) loss:2.539 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:07,979][model8_pretrain.py][INFO] Epoch:[0/2](529600/4588595) loss:3.014 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:07,979][model8_pretrain.py][INFO] Epoch:[0/2](529600/4588595) loss:2.956 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:07,979][model8_pretrain.py][INFO] Epoch:[0/2](529600/4588595) loss:3.266 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:44,910][model8_pretrain.py][INFO] Epoch:[0/2](529700/4588595) loss:3.304 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:44,910][model8_pretrain.py][INFO] Epoch:[0/2](529700/4588595) loss:3.088 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:44,910][model8_pretrain.py][INFO] Epoch:[0/2](529700/4588595) loss:2.974 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:44,910][model8_pretrain.py][INFO] Epoch:[0/2](529700/4588595) loss:3.092 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:44,911][model8_pretrain.py][INFO] Epoch:[0/2](529700/4588595) loss:2.832 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:44,910][model8_pretrain.py][INFO] Epoch:[0/2](529700/4588595) loss:2.890 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:44,911][model8_pretrain.py][INFO] Epoch:[0/2](529700/4588595) loss:3.267 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:41:44,910][model8_pretrain.py][INFO] Epoch:[0/2](529700/4588595) loss:2.860 lr:0.0000100 epoch_Time:25762.0min: [2024-01-05 01:42:21,862][model8_pretrain.py][INFO] Epoch:[0/2](529800/4588595) loss:3.219 lr:0.0000100 epoch_Time:25761.0min: [2024-01-05 01:42:21,862][model8_pretrain.py][INFO] Epoch:[0/2](529800/4588595) loss:2.946 lr:0.0000100 epoch_Time:25761.0min: [2024-01-05 01:42:21,862][model8_pretrain.py][INFO] Epoch:[0/2](529800/4588595) loss:2.508 lr:0.0000100 epoch_Time:25761.0min: [2024-01-05 01:42:21,862][model8_pretrain.py][INFO] Epoch:[0/2](529800/4588595) loss:2.940 lr:0.0000100 epoch_Time:25761.0min: [2024-01-05 01:42:21,862][model8_pretrain.py][INFO] Epoch:[0/2](529800/4588595) loss:2.977 lr:0.0000100 epoch_Time:25761.0min: [2024-01-05 01:42:21,862][model8_pretrain.py][INFO] Epoch:[0/2](529800/4588595) loss:3.040 lr:0.0000100 epoch_Time:25761.0min: [2024-01-05 01:42:21,862][model8_pretrain.py][INFO] Epoch:[0/2](529800/4588595) loss:2.692 lr:0.0000100 epoch_Time:25761.0min: [2024-01-05 01:42:21,862][model8_pretrain.py][INFO] Epoch:[0/2](529800/4588595) loss:3.097 lr:0.0000100 epoch_Time:25761.0min: [2024-01-05 01:42:58,797][model8_pretrain.py][INFO] Epoch:[0/2](529900/4588595) loss:2.923 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:42:58,797][model8_pretrain.py][INFO] Epoch:[0/2](529900/4588595) loss:3.019 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:42:58,797][model8_pretrain.py][INFO] Epoch:[0/2](529900/4588595) loss:2.490 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:42:58,797][model8_pretrain.py][INFO] Epoch:[0/2](529900/4588595) loss:2.260 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:42:58,797][model8_pretrain.py][INFO] Epoch:[0/2](529900/4588595) loss:3.032 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:42:58,797][model8_pretrain.py][INFO] Epoch:[0/2](529900/4588595) loss:2.377 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:42:58,797][model8_pretrain.py][INFO] Epoch:[0/2](529900/4588595) loss:3.157 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:42:58,797][model8_pretrain.py][INFO] Epoch:[0/2](529900/4588595) loss:3.102 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:43:35,734][model8_pretrain.py][INFO] Epoch:[0/2](530000/4588595) loss:2.716 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:43:35,734][model8_pretrain.py][INFO] Epoch:[0/2](530000/4588595) loss:2.950 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:43:35,734][model8_pretrain.py][INFO] Epoch:[0/2](530000/4588595) loss:2.785 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:43:35,734][model8_pretrain.py][INFO] Epoch:[0/2](530000/4588595) loss:3.020 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:43:35,734][model8_pretrain.py][INFO] Epoch:[0/2](530000/4588595) loss:3.069 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:43:35,734][model8_pretrain.py][INFO] Epoch:[0/2](530000/4588595) loss:2.807 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:43:35,734][model8_pretrain.py][INFO] Epoch:[0/2](530000/4588595) loss:2.483 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:43:35,735][model8_pretrain.py][INFO] Epoch:[0/2](530000/4588595) loss:3.050 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:44:12,662][model8_pretrain.py][INFO] Epoch:[0/2](530100/4588595) loss:3.042 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:12,662][model8_pretrain.py][INFO] Epoch:[0/2](530100/4588595) loss:3.070 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:12,662][model8_pretrain.py][INFO] Epoch:[0/2](530100/4588595) loss:2.540 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:12,662][model8_pretrain.py][INFO] Epoch:[0/2](530100/4588595) loss:3.420 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:12,662][model8_pretrain.py][INFO] Epoch:[0/2](530100/4588595) loss:3.277 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:12,662][model8_pretrain.py][INFO] Epoch:[0/2](530100/4588595) loss:2.671 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:12,662][model8_pretrain.py][INFO] Epoch:[0/2](530100/4588595) loss:3.255 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:12,662][model8_pretrain.py][INFO] Epoch:[0/2](530100/4588595) loss:2.805 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:59,551][model8_pretrain.py][INFO] Epoch:[0/2](530200/4588595) loss:2.678 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:59,551][model8_pretrain.py][INFO] Epoch:[0/2](530200/4588595) loss:2.921 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:59,551][model8_pretrain.py][INFO] Epoch:[0/2](530200/4588595) loss:2.140 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:59,551][model8_pretrain.py][INFO] Epoch:[0/2](530200/4588595) loss:2.970 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:59,551][model8_pretrain.py][INFO] Epoch:[0/2](530200/4588595) loss:3.187 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:59,551][model8_pretrain.py][INFO] Epoch:[0/2](530200/4588595) loss:2.629 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:59,552][model8_pretrain.py][INFO] Epoch:[0/2](530200/4588595) loss:2.465 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:44:59,552][model8_pretrain.py][INFO] Epoch:[0/2](530200/4588595) loss:2.903 lr:0.0000100 epoch_Time:25758.0min: [2024-01-05 01:45:38,155][model8_pretrain.py][INFO] Epoch:[0/2](530300/4588595) loss:2.750 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:45:38,155][model8_pretrain.py][INFO] Epoch:[0/2](530300/4588595) loss:2.821 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:45:38,156][model8_pretrain.py][INFO] Epoch:[0/2](530300/4588595) loss:2.724 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:45:38,157][model8_pretrain.py][INFO] Epoch:[0/2](530300/4588595) loss:2.268 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:45:38,157][model8_pretrain.py][INFO] Epoch:[0/2](530300/4588595) loss:2.748 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:45:38,157][model8_pretrain.py][INFO] Epoch:[0/2](530300/4588595) loss:2.162 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:45:38,157][model8_pretrain.py][INFO] Epoch:[0/2](530300/4588595) loss:2.619 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:45:38,157][model8_pretrain.py][INFO] Epoch:[0/2](530300/4588595) loss:2.703 lr:0.0000100 epoch_Time:25759.0min: [2024-01-05 01:46:15,091][model8_pretrain.py][INFO] Epoch:[0/2](530400/4588595) loss:2.599 lr:0.0000100 epoch_Time:25757.0min: [2024-01-05 01:46:15,091][model8_pretrain.py][INFO] Epoch:[0/2](530400/4588595) loss:3.036 lr:0.0000100 epoch_Time:25757.0min: [2024-01-05 01:46:15,091][model8_pretrain.py][INFO] Epoch:[0/2](530400/4588595) loss:2.891 lr:0.0000100 epoch_Time:25757.0min: [2024-01-05 01:46:15,091][model8_pretrain.py][INFO] Epoch:[0/2](530400/4588595) loss:2.931 lr:0.0000100 epoch_Time:25757.0min: [2024-01-05 01:46:15,091][model8_pretrain.py][INFO] Epoch:[0/2](530400/4588595) loss:2.936 lr:0.0000100 epoch_Time:25757.0min: [2024-01-05 01:46:15,091][model8_pretrain.py][INFO] Epoch:[0/2](530400/4588595) loss:3.070 lr:0.0000100 epoch_Time:25757.0min: [2024-01-05 01:46:15,091][model8_pretrain.py][INFO] Epoch:[0/2](530400/4588595) loss:2.782 lr:0.0000100 epoch_Time:25757.0min: [2024-01-05 01:46:15,091][model8_pretrain.py][INFO] Epoch:[0/2](530400/4588595) loss:3.286 lr:0.0000100 epoch_Time:25757.0min: [2024-01-05 01:46:52,021][model8_pretrain.py][INFO] Epoch:[0/2](530500/4588595) loss:3.526 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:46:52,021][model8_pretrain.py][INFO] Epoch:[0/2](530500/4588595) loss:2.673 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:46:52,021][model8_pretrain.py][INFO] Epoch:[0/2](530500/4588595) loss:2.813 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:46:52,021][model8_pretrain.py][INFO] Epoch:[0/2](530500/4588595) loss:3.089 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:46:52,021][model8_pretrain.py][INFO] Epoch:[0/2](530500/4588595) loss:2.787 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:46:52,021][model8_pretrain.py][INFO] Epoch:[0/2](530500/4588595) loss:2.787 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:46:52,021][model8_pretrain.py][INFO] Epoch:[0/2](530500/4588595) loss:2.400 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:46:52,021][model8_pretrain.py][INFO] Epoch:[0/2](530500/4588595) loss:2.946 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:47:28,950][model8_pretrain.py][INFO] Epoch:[0/2](530600/4588595) loss:3.140 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:47:28,950][model8_pretrain.py][INFO] Epoch:[0/2](530600/4588595) loss:2.917 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:47:28,950][model8_pretrain.py][INFO] Epoch:[0/2](530600/4588595) loss:3.153 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:47:28,950][model8_pretrain.py][INFO] Epoch:[0/2](530600/4588595) loss:2.979 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:47:28,950][model8_pretrain.py][INFO] Epoch:[0/2](530600/4588595) loss:2.659 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:47:28,950][model8_pretrain.py][INFO] Epoch:[0/2](530600/4588595) loss:2.750 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:47:28,950][model8_pretrain.py][INFO] Epoch:[0/2](530600/4588595) loss:3.248 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:47:28,951][model8_pretrain.py][INFO] Epoch:[0/2](530600/4588595) loss:2.924 lr:0.0000100 epoch_Time:25756.0min: [2024-01-05 01:48:05,883][model8_pretrain.py][INFO] Epoch:[0/2](530700/4588595) loss:2.927 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:05,883][model8_pretrain.py][INFO] Epoch:[0/2](530700/4588595) loss:2.665 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:05,883][model8_pretrain.py][INFO] Epoch:[0/2](530700/4588595) loss:2.596 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:05,883][model8_pretrain.py][INFO] Epoch:[0/2](530700/4588595) loss:3.102 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:05,883][model8_pretrain.py][INFO] Epoch:[0/2](530700/4588595) loss:2.918 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:05,883][model8_pretrain.py][INFO] Epoch:[0/2](530700/4588595) loss:3.096 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:05,883][model8_pretrain.py][INFO] Epoch:[0/2](530700/4588595) loss:2.799 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:05,883][model8_pretrain.py][INFO] Epoch:[0/2](530700/4588595) loss:3.266 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:42,824][model8_pretrain.py][INFO] Epoch:[0/2](530800/4588595) loss:2.899 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:42,824][model8_pretrain.py][INFO] Epoch:[0/2](530800/4588595) loss:2.772 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:42,824][model8_pretrain.py][INFO] Epoch:[0/2](530800/4588595) loss:2.908 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:42,824][model8_pretrain.py][INFO] Epoch:[0/2](530800/4588595) loss:2.760 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:42,824][model8_pretrain.py][INFO] Epoch:[0/2](530800/4588595) loss:3.184 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:42,824][model8_pretrain.py][INFO] Epoch:[0/2](530800/4588595) loss:2.932 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:42,824][model8_pretrain.py][INFO] Epoch:[0/2](530800/4588595) loss:2.739 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:48:42,824][model8_pretrain.py][INFO] Epoch:[0/2](530800/4588595) loss:2.949 lr:0.0000100 epoch_Time:25755.0min: [2024-01-05 01:49:19,742][model8_pretrain.py][INFO] Epoch:[0/2](530900/4588595) loss:2.199 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:49:19,742][model8_pretrain.py][INFO] Epoch:[0/2](530900/4588595) loss:2.495 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:49:19,742][model8_pretrain.py][INFO] Epoch:[0/2](530900/4588595) loss:2.952 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:49:19,742][model8_pretrain.py][INFO] Epoch:[0/2](530900/4588595) loss:2.289 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:49:19,742][model8_pretrain.py][INFO] Epoch:[0/2](530900/4588595) loss:2.588 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:49:19,742][model8_pretrain.py][INFO] Epoch:[0/2](530900/4588595) loss:1.839 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:49:19,742][model8_pretrain.py][INFO] Epoch:[0/2](530900/4588595) loss:2.994 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:49:19,742][model8_pretrain.py][INFO] Epoch:[0/2](530900/4588595) loss:3.021 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:50:03,254][model8_pretrain.py][INFO] Epoch:[0/2](531000/4588595) loss:2.928 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:50:03,254][model8_pretrain.py][INFO] Epoch:[0/2](531000/4588595) loss:2.760 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:50:03,254][model8_pretrain.py][INFO] Epoch:[0/2](531000/4588595) loss:3.678 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:50:03,254][model8_pretrain.py][INFO] Epoch:[0/2](531000/4588595) loss:2.860 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:50:03,254][model8_pretrain.py][INFO] Epoch:[0/2](531000/4588595) loss:3.214 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:50:03,254][model8_pretrain.py][INFO] Epoch:[0/2](531000/4588595) loss:2.996 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:50:03,254][model8_pretrain.py][INFO] Epoch:[0/2](531000/4588595) loss:2.845 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:50:03,254][model8_pretrain.py][INFO] Epoch:[0/2](531000/4588595) loss:3.213 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:50:45,335][model8_pretrain.py][INFO] Epoch:[0/2](531100/4588595) loss:3.215 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:50:45,335][model8_pretrain.py][INFO] Epoch:[0/2](531100/4588595) loss:3.164 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:50:45,335][model8_pretrain.py][INFO] Epoch:[0/2](531100/4588595) loss:2.938 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:50:45,335][model8_pretrain.py][INFO] Epoch:[0/2](531100/4588595) loss:2.995 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:50:45,336][model8_pretrain.py][INFO] Epoch:[0/2](531100/4588595) loss:3.510 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:50:45,336][model8_pretrain.py][INFO] Epoch:[0/2](531100/4588595) loss:2.846 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:50:45,336][model8_pretrain.py][INFO] Epoch:[0/2](531100/4588595) loss:2.956 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:50:45,336][model8_pretrain.py][INFO] Epoch:[0/2](531100/4588595) loss:2.841 lr:0.0000100 epoch_Time:25754.0min: [2024-01-05 01:51:22,263][model8_pretrain.py][INFO] Epoch:[0/2](531200/4588595) loss:2.972 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:51:22,263][model8_pretrain.py][INFO] Epoch:[0/2](531200/4588595) loss:2.617 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:51:22,263][model8_pretrain.py][INFO] Epoch:[0/2](531200/4588595) loss:3.081 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:51:22,263][model8_pretrain.py][INFO] Epoch:[0/2](531200/4588595) loss:2.193 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:51:22,263][model8_pretrain.py][INFO] Epoch:[0/2](531200/4588595) loss:2.308 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:51:22,263][model8_pretrain.py][INFO] Epoch:[0/2](531200/4588595) loss:2.670 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:51:22,263][model8_pretrain.py][INFO] Epoch:[0/2](531200/4588595) loss:2.746 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:51:22,263][model8_pretrain.py][INFO] Epoch:[0/2](531200/4588595) loss:2.681 lr:0.0000100 epoch_Time:25753.0min: [2024-01-05 01:51:59,196][model8_pretrain.py][INFO] Epoch:[0/2](531300/4588595) loss:2.965 lr:0.0000100 epoch_Time:25752.0min: [2024-01-05 01:51:59,196][model8_pretrain.py][INFO] Epoch:[0/2](531300/4588595) loss:2.542 lr:0.0000100 epoch_Time:25752.0min: [2024-01-05 01:51:59,196][model8_pretrain.py][INFO] Epoch:[0/2](531300/4588595) loss:2.818 lr:0.0000100 epoch_Time:25752.0min: [2024-01-05 01:51:59,196][model8_pretrain.py][INFO] Epoch:[0/2](531300/4588595) loss:2.792 lr:0.0000100 epoch_Time:25752.0min: [2024-01-05 01:51:59,196][model8_pretrain.py][INFO] Epoch:[0/2](531300/4588595) loss:2.734 lr:0.0000100 epoch_Time:25752.0min: [2024-01-05 01:51:59,196][model8_pretrain.py][INFO] Epoch:[0/2](531300/4588595) loss:3.243 lr:0.0000100 epoch_Time:25752.0min: [2024-01-05 01:51:59,196][model8_pretrain.py][INFO] Epoch:[0/2](531300/4588595) loss:2.974 lr:0.0000100 epoch_Time:25752.0min: [2024-01-05 01:51:59,196][model8_pretrain.py][INFO] Epoch:[0/2](531300/4588595) loss:2.787 lr:0.0000100 epoch_Time:25752.0min: [2024-01-05 01:52:36,131][model8_pretrain.py][INFO] Epoch:[0/2](531400/4588595) loss:2.450 lr:0.0000100 epoch_Time:25751.0min: [2024-01-05 01:52:36,131][model8_pretrain.py][INFO] Epoch:[0/2](531400/4588595) loss:2.550 lr:0.0000100 epoch_Time:25751.0min: [2024-01-05 01:52:36,131][model8_pretrain.py][INFO] Epoch:[0/2](531400/4588595) loss:2.905 lr:0.0000100 epoch_Time:25751.0min: [2024-01-05 01:52:36,131][model8_pretrain.py][INFO] Epoch:[0/2](531400/4588595) loss:2.901 lr:0.0000100 epoch_Time:25751.0min: [2024-01-05 01:52:36,131][model8_pretrain.py][INFO] Epoch:[0/2](531400/4588595) loss:2.822 lr:0.0000100 epoch_Time:25751.0min: [2024-01-05 01:52:36,131][model8_pretrain.py][INFO] Epoch:[0/2](531400/4588595) loss:2.542 lr:0.0000100 epoch_Time:25751.0min: [2024-01-05 01:52:36,131][model8_pretrain.py][INFO] Epoch:[0/2](531400/4588595) loss:2.853 lr:0.0000100 epoch_Time:25751.0min: [2024-01-05 01:52:36,131][model8_pretrain.py][INFO] Epoch:[0/2](531400/4588595) loss:2.655 lr:0.0000100 epoch_Time:25751.0min: [2024-01-05 01:53:13,063][model8_pretrain.py][INFO] Epoch:[0/2](531500/4588595) loss:3.169 lr:0.0000100 epoch_Time:25750.0min: [2024-01-05 01:53:13,063][model8_pretrain.py][INFO] Epoch:[0/2](531500/4588595) loss:2.740 lr:0.0000100 epoch_Time:25750.0min: [2024-01-05 01:53:13,063][model8_pretrain.py][INFO] Epoch:[0/2](531500/4588595) loss:2.874 lr:0.0000100 epoch_Time:25750.0min: [2024-01-05 01:53:13,063][model8_pretrain.py][INFO] Epoch:[0/2](531500/4588595) loss:2.973 lr:0.0000100 epoch_Time:25750.0min: [2024-01-05 01:53:13,063][model8_pretrain.py][INFO] Epoch:[0/2](531500/4588595) loss:2.421 lr:0.0000100 epoch_Time:25750.0min: [2024-01-05 01:53:13,063][model8_pretrain.py][INFO] Epoch:[0/2](531500/4588595) loss:2.726 lr:0.0000100 epoch_Time:25750.0min: [2024-01-05 01:53:13,063][model8_pretrain.py][INFO] Epoch:[0/2](531500/4588595) loss:2.532 lr:0.0000100 epoch_Time:25750.0min: [2024-01-05 01:53:13,063][model8_pretrain.py][INFO] Epoch:[0/2](531500/4588595) loss:2.395 lr:0.0000100 epoch_Time:25750.0min: [2024-01-05 01:53:49,999][model8_pretrain.py][INFO] Epoch:[0/2](531600/4588595) loss:2.495 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:53:49,999][model8_pretrain.py][INFO] Epoch:[0/2](531600/4588595) loss:2.594 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:53:49,999][model8_pretrain.py][INFO] Epoch:[0/2](531600/4588595) loss:3.199 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:53:49,999][model8_pretrain.py][INFO] Epoch:[0/2](531600/4588595) loss:2.773 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:53:49,999][model8_pretrain.py][INFO] Epoch:[0/2](531600/4588595) loss:2.411 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:53:49,999][model8_pretrain.py][INFO] Epoch:[0/2](531600/4588595) loss:2.673 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:53:49,999][model8_pretrain.py][INFO] Epoch:[0/2](531600/4588595) loss:3.297 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:53:49,999][model8_pretrain.py][INFO] Epoch:[0/2](531600/4588595) loss:3.184 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:54:26,950][model8_pretrain.py][INFO] Epoch:[0/2](531700/4588595) loss:3.347 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:54:26,950][model8_pretrain.py][INFO] Epoch:[0/2](531700/4588595) loss:3.226 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:54:26,950][model8_pretrain.py][INFO] Epoch:[0/2](531700/4588595) loss:2.526 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:54:26,950][model8_pretrain.py][INFO] Epoch:[0/2](531700/4588595) loss:2.842 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:54:26,950][model8_pretrain.py][INFO] Epoch:[0/2](531700/4588595) loss:2.568 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:54:26,950][model8_pretrain.py][INFO] Epoch:[0/2](531700/4588595) loss:2.922 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:54:26,950][model8_pretrain.py][INFO] Epoch:[0/2](531700/4588595) loss:2.854 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:54:26,950][model8_pretrain.py][INFO] Epoch:[0/2](531700/4588595) loss:3.222 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:55:10,236][model8_pretrain.py][INFO] Epoch:[0/2](531800/4588595) loss:3.145 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:55:10,236][model8_pretrain.py][INFO] Epoch:[0/2](531800/4588595) loss:2.552 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:55:10,236][model8_pretrain.py][INFO] Epoch:[0/2](531800/4588595) loss:3.439 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:55:10,236][model8_pretrain.py][INFO] Epoch:[0/2](531800/4588595) loss:3.049 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:55:10,236][model8_pretrain.py][INFO] Epoch:[0/2](531800/4588595) loss:2.347 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:55:10,236][model8_pretrain.py][INFO] Epoch:[0/2](531800/4588595) loss:2.904 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:55:10,236][model8_pretrain.py][INFO] Epoch:[0/2](531800/4588595) loss:2.781 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:55:10,236][model8_pretrain.py][INFO] Epoch:[0/2](531800/4588595) loss:2.523 lr:0.0000100 epoch_Time:25749.0min: [2024-01-05 01:55:52,284][model8_pretrain.py][INFO] Epoch:[0/2](531900/4588595) loss:2.896 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:55:52,284][model8_pretrain.py][INFO] Epoch:[0/2](531900/4588595) loss:2.607 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:55:52,284][model8_pretrain.py][INFO] Epoch:[0/2](531900/4588595) loss:2.436 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:55:52,284][model8_pretrain.py][INFO] Epoch:[0/2](531900/4588595) loss:2.532 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:55:52,284][model8_pretrain.py][INFO] Epoch:[0/2](531900/4588595) loss:3.095 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:55:52,285][model8_pretrain.py][INFO] Epoch:[0/2](531900/4588595) loss:1.779 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:55:52,285][model8_pretrain.py][INFO] Epoch:[0/2](531900/4588595) loss:2.724 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:55:52,285][model8_pretrain.py][INFO] Epoch:[0/2](531900/4588595) loss:2.468 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:56:29,235][model8_pretrain.py][INFO] Epoch:[0/2](532000/4588595) loss:2.947 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:56:29,235][model8_pretrain.py][INFO] Epoch:[0/2](532000/4588595) loss:3.115 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:56:29,235][model8_pretrain.py][INFO] Epoch:[0/2](532000/4588595) loss:2.961 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:56:29,235][model8_pretrain.py][INFO] Epoch:[0/2](532000/4588595) loss:2.910 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:56:29,235][model8_pretrain.py][INFO] Epoch:[0/2](532000/4588595) loss:2.153 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:56:29,235][model8_pretrain.py][INFO] Epoch:[0/2](532000/4588595) loss:2.983 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:56:29,235][model8_pretrain.py][INFO] Epoch:[0/2](532000/4588595) loss:2.594 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:56:29,235][model8_pretrain.py][INFO] Epoch:[0/2](532000/4588595) loss:2.314 lr:0.0000100 epoch_Time:25748.0min: [2024-01-05 01:57:06,184][model8_pretrain.py][INFO] Epoch:[0/2](532100/4588595) loss:2.710 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:06,184][model8_pretrain.py][INFO] Epoch:[0/2](532100/4588595) loss:2.588 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:06,184][model8_pretrain.py][INFO] Epoch:[0/2](532100/4588595) loss:2.432 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:06,184][model8_pretrain.py][INFO] Epoch:[0/2](532100/4588595) loss:2.658 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:06,184][model8_pretrain.py][INFO] Epoch:[0/2](532100/4588595) loss:2.876 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:06,184][model8_pretrain.py][INFO] Epoch:[0/2](532100/4588595) loss:2.744 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:06,184][model8_pretrain.py][INFO] Epoch:[0/2](532100/4588595) loss:2.977 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:06,184][model8_pretrain.py][INFO] Epoch:[0/2](532100/4588595) loss:2.544 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:43,130][model8_pretrain.py][INFO] Epoch:[0/2](532200/4588595) loss:2.541 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:43,130][model8_pretrain.py][INFO] Epoch:[0/2](532200/4588595) loss:2.804 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:43,130][model8_pretrain.py][INFO] Epoch:[0/2](532200/4588595) loss:2.620 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:43,130][model8_pretrain.py][INFO] Epoch:[0/2](532200/4588595) loss:2.551 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:43,130][model8_pretrain.py][INFO] Epoch:[0/2](532200/4588595) loss:2.533 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:43,130][model8_pretrain.py][INFO] Epoch:[0/2](532200/4588595) loss:2.867 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:43,130][model8_pretrain.py][INFO] Epoch:[0/2](532200/4588595) loss:2.894 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:57:43,130][model8_pretrain.py][INFO] Epoch:[0/2](532200/4588595) loss:3.008 lr:0.0000100 epoch_Time:25747.0min: [2024-01-05 01:58:20,099][model8_pretrain.py][INFO] Epoch:[0/2](532300/4588595) loss:2.825 lr:0.0000100 epoch_Time:25746.0min: [2024-01-05 01:58:20,100][model8_pretrain.py][INFO] Epoch:[0/2](532300/4588595) loss:3.220 lr:0.0000100 epoch_Time:25746.0min: [2024-01-05 01:58:20,100][model8_pretrain.py][INFO] Epoch:[0/2](532300/4588595) loss:2.575 lr:0.0000100 epoch_Time:25746.0min: [2024-01-05 01:58:20,100][model8_pretrain.py][INFO] Epoch:[0/2](532300/4588595) loss:2.703 lr:0.0000100 epoch_Time:25746.0min: [2024-01-05 01:58:20,100][model8_pretrain.py][INFO] Epoch:[0/2](532300/4588595) loss:3.117 lr:0.0000100 epoch_Time:25746.0min: [2024-01-05 01:58:20,100][model8_pretrain.py][INFO] Epoch:[0/2](532300/4588595) loss:2.508 lr:0.0000100 epoch_Time:25746.0min: [2024-01-05 01:58:20,100][model8_pretrain.py][INFO] Epoch:[0/2](532300/4588595) loss:2.781 lr:0.0000100 epoch_Time:25746.0min: [2024-01-05 01:58:20,100][model8_pretrain.py][INFO] Epoch:[0/2](532300/4588595) loss:2.908 lr:0.0000100 epoch_Time:25746.0min: [2024-01-05 01:58:57,048][model8_pretrain.py][INFO] Epoch:[0/2](532400/4588595) loss:2.973 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:58:57,048][model8_pretrain.py][INFO] Epoch:[0/2](532400/4588595) loss:2.683 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:58:57,048][model8_pretrain.py][INFO] Epoch:[0/2](532400/4588595) loss:3.129 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:58:57,048][model8_pretrain.py][INFO] Epoch:[0/2](532400/4588595) loss:3.133 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:58:57,048][model8_pretrain.py][INFO] Epoch:[0/2](532400/4588595) loss:3.499 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:58:57,048][model8_pretrain.py][INFO] Epoch:[0/2](532400/4588595) loss:2.363 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:58:57,048][model8_pretrain.py][INFO] Epoch:[0/2](532400/4588595) loss:3.025 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:58:57,048][model8_pretrain.py][INFO] Epoch:[0/2](532400/4588595) loss:2.535 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:59:33,995][model8_pretrain.py][INFO] Epoch:[0/2](532500/4588595) loss:2.131 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:59:33,995][model8_pretrain.py][INFO] Epoch:[0/2](532500/4588595) loss:2.999 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:59:33,995][model8_pretrain.py][INFO] Epoch:[0/2](532500/4588595) loss:2.165 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:59:33,995][model8_pretrain.py][INFO] Epoch:[0/2](532500/4588595) loss:2.505 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:59:33,995][model8_pretrain.py][INFO] Epoch:[0/2](532500/4588595) loss:3.061 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:59:33,995][model8_pretrain.py][INFO] Epoch:[0/2](532500/4588595) loss:2.641 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:59:33,995][model8_pretrain.py][INFO] Epoch:[0/2](532500/4588595) loss:3.177 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 01:59:33,995][model8_pretrain.py][INFO] Epoch:[0/2](532500/4588595) loss:2.584 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:15,783][model8_pretrain.py][INFO] Epoch:[0/2](532600/4588595) loss:3.037 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:15,783][model8_pretrain.py][INFO] Epoch:[0/2](532600/4588595) loss:3.110 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:15,787][model8_pretrain.py][INFO] Epoch:[0/2](532600/4588595) loss:2.807 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:15,788][model8_pretrain.py][INFO] Epoch:[0/2](532600/4588595) loss:2.725 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:15,788][model8_pretrain.py][INFO] Epoch:[0/2](532600/4588595) loss:3.371 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:15,788][model8_pretrain.py][INFO] Epoch:[0/2](532600/4588595) loss:2.985 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:16,017][model8_pretrain.py][INFO] Epoch:[0/2](532600/4588595) loss:3.401 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:17,337][model8_pretrain.py][INFO] Epoch:[0/2](532600/4588595) loss:2.873 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:59,482][model8_pretrain.py][INFO] Epoch:[0/2](532700/4588595) loss:2.953 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:59,482][model8_pretrain.py][INFO] Epoch:[0/2](532700/4588595) loss:3.008 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:59,482][model8_pretrain.py][INFO] Epoch:[0/2](532700/4588595) loss:3.318 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:59,482][model8_pretrain.py][INFO] Epoch:[0/2](532700/4588595) loss:2.834 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:59,482][model8_pretrain.py][INFO] Epoch:[0/2](532700/4588595) loss:2.523 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:59,482][model8_pretrain.py][INFO] Epoch:[0/2](532700/4588595) loss:2.801 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:59,482][model8_pretrain.py][INFO] Epoch:[0/2](532700/4588595) loss:2.589 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:00:59,482][model8_pretrain.py][INFO] Epoch:[0/2](532700/4588595) loss:2.154 lr:0.0000100 epoch_Time:25744.0min: [2024-01-05 02:01:36,424][model8_pretrain.py][INFO] Epoch:[0/2](532800/4588595) loss:3.296 lr:0.0000100 epoch_Time:25743.0min: [2024-01-05 02:01:36,424][model8_pretrain.py][INFO] Epoch:[0/2](532800/4588595) loss:2.328 lr:0.0000100 epoch_Time:25743.0min: [2024-01-05 02:01:36,424][model8_pretrain.py][INFO] Epoch:[0/2](532800/4588595) loss:2.285 lr:0.0000100 epoch_Time:25743.0min: [2024-01-05 02:01:36,424][model8_pretrain.py][INFO] Epoch:[0/2](532800/4588595) loss:2.238 lr:0.0000100 epoch_Time:25743.0min: [2024-01-05 02:01:36,424][model8_pretrain.py][INFO] Epoch:[0/2](532800/4588595) loss:3.122 lr:0.0000100 epoch_Time:25743.0min: [2024-01-05 02:01:36,424][model8_pretrain.py][INFO] Epoch:[0/2](532800/4588595) loss:3.385 lr:0.0000100 epoch_Time:25743.0min: [2024-01-05 02:01:36,424][model8_pretrain.py][INFO] Epoch:[0/2](532800/4588595) loss:2.388 lr:0.0000100 epoch_Time:25743.0min: [2024-01-05 02:01:36,424][model8_pretrain.py][INFO] Epoch:[0/2](532800/4588595) loss:2.729 lr:0.0000100 epoch_Time:25743.0min: [2024-01-05 02:02:13,362][model8_pretrain.py][INFO] Epoch:[0/2](532900/4588595) loss:2.999 lr:0.0000100 epoch_Time:25742.0min: [2024-01-05 02:02:13,362][model8_pretrain.py][INFO] Epoch:[0/2](532900/4588595) loss:2.974 lr:0.0000100 epoch_Time:25742.0min: [2024-01-05 02:02:13,362][model8_pretrain.py][INFO] Epoch:[0/2](532900/4588595) loss:2.784 lr:0.0000100 epoch_Time:25742.0min: [2024-01-05 02:02:13,362][model8_pretrain.py][INFO] Epoch:[0/2](532900/4588595) loss:3.142 lr:0.0000100 epoch_Time:25742.0min: [2024-01-05 02:02:13,362][model8_pretrain.py][INFO] Epoch:[0/2](532900/4588595) loss:3.058 lr:0.0000100 epoch_Time:25742.0min: [2024-01-05 02:02:13,362][model8_pretrain.py][INFO] Epoch:[0/2](532900/4588595) loss:3.322 lr:0.0000100 epoch_Time:25742.0min: [2024-01-05 02:02:13,362][model8_pretrain.py][INFO] Epoch:[0/2](532900/4588595) loss:3.098 lr:0.0000100 epoch_Time:25742.0min: [2024-01-05 02:02:13,362][model8_pretrain.py][INFO] Epoch:[0/2](532900/4588595) loss:2.396 lr:0.0000100 epoch_Time:25742.0min: [2024-01-05 02:02:50,324][model8_pretrain.py][INFO] Epoch:[0/2](533000/4588595) loss:2.782 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:02:50,324][model8_pretrain.py][INFO] Epoch:[0/2](533000/4588595) loss:2.770 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:02:50,324][model8_pretrain.py][INFO] Epoch:[0/2](533000/4588595) loss:3.130 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:02:50,324][model8_pretrain.py][INFO] Epoch:[0/2](533000/4588595) loss:3.221 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:02:50,324][model8_pretrain.py][INFO] Epoch:[0/2](533000/4588595) loss:2.789 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:02:50,324][model8_pretrain.py][INFO] Epoch:[0/2](533000/4588595) loss:3.213 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:02:50,324][model8_pretrain.py][INFO] Epoch:[0/2](533000/4588595) loss:3.234 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:02:50,324][model8_pretrain.py][INFO] Epoch:[0/2](533000/4588595) loss:3.287 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:03:27,293][model8_pretrain.py][INFO] Epoch:[0/2](533100/4588595) loss:3.222 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:03:27,293][model8_pretrain.py][INFO] Epoch:[0/2](533100/4588595) loss:1.926 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:03:27,293][model8_pretrain.py][INFO] Epoch:[0/2](533100/4588595) loss:2.502 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:03:27,293][model8_pretrain.py][INFO] Epoch:[0/2](533100/4588595) loss:3.125 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:03:27,294][model8_pretrain.py][INFO] Epoch:[0/2](533100/4588595) loss:2.712 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:03:27,294][model8_pretrain.py][INFO] Epoch:[0/2](533100/4588595) loss:2.693 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:03:27,294][model8_pretrain.py][INFO] Epoch:[0/2](533100/4588595) loss:3.075 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:03:27,294][model8_pretrain.py][INFO] Epoch:[0/2](533100/4588595) loss:2.459 lr:0.0000100 epoch_Time:25741.0min: [2024-01-05 02:04:04,245][model8_pretrain.py][INFO] Epoch:[0/2](533200/4588595) loss:2.265 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:04,245][model8_pretrain.py][INFO] Epoch:[0/2](533200/4588595) loss:3.017 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:04,245][model8_pretrain.py][INFO] Epoch:[0/2](533200/4588595) loss:2.889 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:04,245][model8_pretrain.py][INFO] Epoch:[0/2](533200/4588595) loss:3.334 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:04,245][model8_pretrain.py][INFO] Epoch:[0/2](533200/4588595) loss:2.991 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:04,245][model8_pretrain.py][INFO] Epoch:[0/2](533200/4588595) loss:2.764 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:04,245][model8_pretrain.py][INFO] Epoch:[0/2](533200/4588595) loss:2.433 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:04,245][model8_pretrain.py][INFO] Epoch:[0/2](533200/4588595) loss:2.357 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:41,184][model8_pretrain.py][INFO] Epoch:[0/2](533300/4588595) loss:2.715 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:41,184][model8_pretrain.py][INFO] Epoch:[0/2](533300/4588595) loss:2.238 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:41,184][model8_pretrain.py][INFO] Epoch:[0/2](533300/4588595) loss:2.508 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:41,184][model8_pretrain.py][INFO] Epoch:[0/2](533300/4588595) loss:2.667 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:41,184][model8_pretrain.py][INFO] Epoch:[0/2](533300/4588595) loss:2.517 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:41,184][model8_pretrain.py][INFO] Epoch:[0/2](533300/4588595) loss:2.319 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:41,184][model8_pretrain.py][INFO] Epoch:[0/2](533300/4588595) loss:2.905 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:04:41,185][model8_pretrain.py][INFO] Epoch:[0/2](533300/4588595) loss:2.941 lr:0.0000100 epoch_Time:25740.0min: [2024-01-05 02:05:19,851][model8_pretrain.py][INFO] Epoch:[0/2](533400/4588595) loss:3.029 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:05:19,851][model8_pretrain.py][INFO] Epoch:[0/2](533400/4588595) loss:2.548 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:05:19,852][model8_pretrain.py][INFO] Epoch:[0/2](533400/4588595) loss:2.824 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:05:19,852][model8_pretrain.py][INFO] Epoch:[0/2](533400/4588595) loss:2.991 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:05:19,852][model8_pretrain.py][INFO] Epoch:[0/2](533400/4588595) loss:2.711 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:05:19,852][model8_pretrain.py][INFO] Epoch:[0/2](533400/4588595) loss:2.971 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:05:19,852][model8_pretrain.py][INFO] Epoch:[0/2](533400/4588595) loss:2.943 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:05:19,852][model8_pretrain.py][INFO] Epoch:[0/2](533400/4588595) loss:2.829 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:07,189][model8_pretrain.py][INFO] Epoch:[0/2](533500/4588595) loss:3.175 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:07,189][model8_pretrain.py][INFO] Epoch:[0/2](533500/4588595) loss:2.302 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:07,189][model8_pretrain.py][INFO] Epoch:[0/2](533500/4588595) loss:2.508 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:07,189][model8_pretrain.py][INFO] Epoch:[0/2](533500/4588595) loss:2.842 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:07,189][model8_pretrain.py][INFO] Epoch:[0/2](533500/4588595) loss:2.670 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:07,189][model8_pretrain.py][INFO] Epoch:[0/2](533500/4588595) loss:2.593 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:07,189][model8_pretrain.py][INFO] Epoch:[0/2](533500/4588595) loss:3.164 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:07,189][model8_pretrain.py][INFO] Epoch:[0/2](533500/4588595) loss:3.065 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:44,130][model8_pretrain.py][INFO] Epoch:[0/2](533600/4588595) loss:2.028 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:44,130][model8_pretrain.py][INFO] Epoch:[0/2](533600/4588595) loss:3.273 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:44,130][model8_pretrain.py][INFO] Epoch:[0/2](533600/4588595) loss:2.800 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:44,130][model8_pretrain.py][INFO] Epoch:[0/2](533600/4588595) loss:2.781 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:44,130][model8_pretrain.py][INFO] Epoch:[0/2](533600/4588595) loss:3.093 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:44,130][model8_pretrain.py][INFO] Epoch:[0/2](533600/4588595) loss:2.856 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:44,130][model8_pretrain.py][INFO] Epoch:[0/2](533600/4588595) loss:2.997 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:06:44,130][model8_pretrain.py][INFO] Epoch:[0/2](533600/4588595) loss:3.273 lr:0.0000100 epoch_Time:25739.0min: [2024-01-05 02:07:21,066][model8_pretrain.py][INFO] Epoch:[0/2](533700/4588595) loss:2.804 lr:0.0000100 epoch_Time:25738.0min: [2024-01-05 02:07:21,066][model8_pretrain.py][INFO] Epoch:[0/2](533700/4588595) loss:2.476 lr:0.0000100 epoch_Time:25738.0min: [2024-01-05 02:07:21,066][model8_pretrain.py][INFO] Epoch:[0/2](533700/4588595) loss:2.710 lr:0.0000100 epoch_Time:25738.0min: [2024-01-05 02:07:21,066][model8_pretrain.py][INFO] Epoch:[0/2](533700/4588595) loss:2.870 lr:0.0000100 epoch_Time:25738.0min: [2024-01-05 02:07:21,066][model8_pretrain.py][INFO] Epoch:[0/2](533700/4588595) loss:2.700 lr:0.0000100 epoch_Time:25738.0min: [2024-01-05 02:07:21,066][model8_pretrain.py][INFO] Epoch:[0/2](533700/4588595) loss:2.403 lr:0.0000100 epoch_Time:25738.0min: [2024-01-05 02:07:21,066][model8_pretrain.py][INFO] Epoch:[0/2](533700/4588595) loss:2.753 lr:0.0000100 epoch_Time:25738.0min: [2024-01-05 02:07:21,066][model8_pretrain.py][INFO] Epoch:[0/2](533700/4588595) loss:2.691 lr:0.0000100 epoch_Time:25738.0min: [2024-01-05 02:07:58,006][model8_pretrain.py][INFO] Epoch:[0/2](533800/4588595) loss:3.319 lr:0.0000100 epoch_Time:25737.0min: [2024-01-05 02:07:58,006][model8_pretrain.py][INFO] Epoch:[0/2](533800/4588595) loss:2.615 lr:0.0000100 epoch_Time:25737.0min: [2024-01-05 02:07:58,006][model8_pretrain.py][INFO] Epoch:[0/2](533800/4588595) loss:2.968 lr:0.0000100 epoch_Time:25737.0min: [2024-01-05 02:07:58,006][model8_pretrain.py][INFO] Epoch:[0/2](533800/4588595) loss:2.710 lr:0.0000100 epoch_Time:25737.0min: [2024-01-05 02:07:58,006][model8_pretrain.py][INFO] Epoch:[0/2](533800/4588595) loss:2.771 lr:0.0000100 epoch_Time:25737.0min: [2024-01-05 02:07:58,006][model8_pretrain.py][INFO] Epoch:[0/2](533800/4588595) loss:2.787 lr:0.0000100 epoch_Time:25737.0min: [2024-01-05 02:07:58,006][model8_pretrain.py][INFO] Epoch:[0/2](533800/4588595) loss:2.846 lr:0.0000100 epoch_Time:25737.0min: [2024-01-05 02:07:58,006][model8_pretrain.py][INFO] Epoch:[0/2](533800/4588595) loss:2.749 lr:0.0000100 epoch_Time:25737.0min: [2024-01-05 02:08:34,928][model8_pretrain.py][INFO] Epoch:[0/2](533900/4588595) loss:3.121 lr:0.0000100 epoch_Time:25736.0min: [2024-01-05 02:08:34,928][model8_pretrain.py][INFO] Epoch:[0/2](533900/4588595) loss:2.829 lr:0.0000100 epoch_Time:25736.0min: [2024-01-05 02:08:34,928][model8_pretrain.py][INFO] Epoch:[0/2](533900/4588595) loss:3.075 lr:0.0000100 epoch_Time:25736.0min: [2024-01-05 02:08:34,928][model8_pretrain.py][INFO] Epoch:[0/2](533900/4588595) loss:3.132 lr:0.0000100 epoch_Time:25736.0min: [2024-01-05 02:08:34,928][model8_pretrain.py][INFO] Epoch:[0/2](533900/4588595) loss:3.003 lr:0.0000100 epoch_Time:25736.0min: [2024-01-05 02:08:34,928][model8_pretrain.py][INFO] Epoch:[0/2](533900/4588595) loss:3.019 lr:0.0000100 epoch_Time:25736.0min: [2024-01-05 02:08:34,928][model8_pretrain.py][INFO] Epoch:[0/2](533900/4588595) loss:2.894 lr:0.0000100 epoch_Time:25736.0min: [2024-01-05 02:08:34,928][model8_pretrain.py][INFO] Epoch:[0/2](533900/4588595) loss:3.014 lr:0.0000100 epoch_Time:25736.0min: [2024-01-05 02:09:11,857][model8_pretrain.py][INFO] Epoch:[0/2](534000/4588595) loss:3.171 lr:0.0000100 epoch_Time:25735.0min: [2024-01-05 02:09:11,857][model8_pretrain.py][INFO] Epoch:[0/2](534000/4588595) loss:2.357 lr:0.0000100 epoch_Time:25735.0min: [2024-01-05 02:09:11,857][model8_pretrain.py][INFO] Epoch:[0/2](534000/4588595) loss:3.401 lr:0.0000100 epoch_Time:25735.0min: [2024-01-05 02:09:11,857][model8_pretrain.py][INFO] Epoch:[0/2](534000/4588595) loss:2.787 lr:0.0000100 epoch_Time:25735.0min: [2024-01-05 02:09:11,857][model8_pretrain.py][INFO] Epoch:[0/2](534000/4588595) loss:2.739 lr:0.0000100 epoch_Time:25735.0min: [2024-01-05 02:09:11,857][model8_pretrain.py][INFO] Epoch:[0/2](534000/4588595) loss:2.904 lr:0.0000100 epoch_Time:25735.0min: [2024-01-05 02:09:11,857][model8_pretrain.py][INFO] Epoch:[0/2](534000/4588595) loss:2.520 lr:0.0000100 epoch_Time:25735.0min: [2024-01-05 02:09:11,857][model8_pretrain.py][INFO] Epoch:[0/2](534000/4588595) loss:3.220 lr:0.0000100 epoch_Time:25735.0min: [2024-01-05 02:09:48,789][model8_pretrain.py][INFO] Epoch:[0/2](534100/4588595) loss:3.078 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:09:48,789][model8_pretrain.py][INFO] Epoch:[0/2](534100/4588595) loss:2.334 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:09:48,789][model8_pretrain.py][INFO] Epoch:[0/2](534100/4588595) loss:2.891 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:09:48,789][model8_pretrain.py][INFO] Epoch:[0/2](534100/4588595) loss:2.729 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:09:48,789][model8_pretrain.py][INFO] Epoch:[0/2](534100/4588595) loss:3.219 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:09:48,789][model8_pretrain.py][INFO] Epoch:[0/2](534100/4588595) loss:3.074 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:09:48,789][model8_pretrain.py][INFO] Epoch:[0/2](534100/4588595) loss:3.090 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:09:48,789][model8_pretrain.py][INFO] Epoch:[0/2](534100/4588595) loss:2.667 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:10:27,446][model8_pretrain.py][INFO] Epoch:[0/2](534200/4588595) loss:3.242 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:10:27,447][model8_pretrain.py][INFO] Epoch:[0/2](534200/4588595) loss:2.913 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:10:27,447][model8_pretrain.py][INFO] Epoch:[0/2](534200/4588595) loss:2.760 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:10:27,447][model8_pretrain.py][INFO] Epoch:[0/2](534200/4588595) loss:3.342 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:10:27,447][model8_pretrain.py][INFO] Epoch:[0/2](534200/4588595) loss:2.536 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:10:27,447][model8_pretrain.py][INFO] Epoch:[0/2](534200/4588595) loss:2.869 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:10:27,447][model8_pretrain.py][INFO] Epoch:[0/2](534200/4588595) loss:2.906 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:10:27,447][model8_pretrain.py][INFO] Epoch:[0/2](534200/4588595) loss:2.521 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:11:14,760][model8_pretrain.py][INFO] Epoch:[0/2](534300/4588595) loss:2.828 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:11:14,760][model8_pretrain.py][INFO] Epoch:[0/2](534300/4588595) loss:2.847 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:11:14,760][model8_pretrain.py][INFO] Epoch:[0/2](534300/4588595) loss:2.836 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:11:14,760][model8_pretrain.py][INFO] Epoch:[0/2](534300/4588595) loss:2.904 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:11:14,760][model8_pretrain.py][INFO] Epoch:[0/2](534300/4588595) loss:2.686 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:11:14,760][model8_pretrain.py][INFO] Epoch:[0/2](534300/4588595) loss:2.738 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:11:14,760][model8_pretrain.py][INFO] Epoch:[0/2](534300/4588595) loss:2.941 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:11:14,760][model8_pretrain.py][INFO] Epoch:[0/2](534300/4588595) loss:3.043 lr:0.0000100 epoch_Time:25734.0min: [2024-01-05 02:11:51,694][model8_pretrain.py][INFO] Epoch:[0/2](534400/4588595) loss:2.592 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:11:51,694][model8_pretrain.py][INFO] Epoch:[0/2](534400/4588595) loss:2.649 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:11:51,694][model8_pretrain.py][INFO] Epoch:[0/2](534400/4588595) loss:2.661 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:11:51,694][model8_pretrain.py][INFO] Epoch:[0/2](534400/4588595) loss:2.542 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:11:51,695][model8_pretrain.py][INFO] Epoch:[0/2](534400/4588595) loss:3.057 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:11:51,695][model8_pretrain.py][INFO] Epoch:[0/2](534400/4588595) loss:2.481 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:11:51,695][model8_pretrain.py][INFO] Epoch:[0/2](534400/4588595) loss:2.966 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:11:51,695][model8_pretrain.py][INFO] Epoch:[0/2](534400/4588595) loss:2.256 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:12:28,632][model8_pretrain.py][INFO] Epoch:[0/2](534500/4588595) loss:2.853 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:12:28,632][model8_pretrain.py][INFO] Epoch:[0/2](534500/4588595) loss:3.143 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:12:28,632][model8_pretrain.py][INFO] Epoch:[0/2](534500/4588595) loss:3.336 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:12:28,632][model8_pretrain.py][INFO] Epoch:[0/2](534500/4588595) loss:2.700 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:12:28,632][model8_pretrain.py][INFO] Epoch:[0/2](534500/4588595) loss:2.934 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:12:28,632][model8_pretrain.py][INFO] Epoch:[0/2](534500/4588595) loss:3.500 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:12:28,632][model8_pretrain.py][INFO] Epoch:[0/2](534500/4588595) loss:2.677 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:12:28,632][model8_pretrain.py][INFO] Epoch:[0/2](534500/4588595) loss:2.278 lr:0.0000100 epoch_Time:25733.0min: [2024-01-05 02:13:05,567][model8_pretrain.py][INFO] Epoch:[0/2](534600/4588595) loss:2.940 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:05,567][model8_pretrain.py][INFO] Epoch:[0/2](534600/4588595) loss:2.615 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:05,567][model8_pretrain.py][INFO] Epoch:[0/2](534600/4588595) loss:2.872 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:05,567][model8_pretrain.py][INFO] Epoch:[0/2](534600/4588595) loss:3.067 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:05,567][model8_pretrain.py][INFO] Epoch:[0/2](534600/4588595) loss:2.890 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:05,567][model8_pretrain.py][INFO] Epoch:[0/2](534600/4588595) loss:3.309 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:05,567][model8_pretrain.py][INFO] Epoch:[0/2](534600/4588595) loss:3.178 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:05,567][model8_pretrain.py][INFO] Epoch:[0/2](534600/4588595) loss:2.964 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:42,501][model8_pretrain.py][INFO] Epoch:[0/2](534700/4588595) loss:2.954 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:42,501][model8_pretrain.py][INFO] Epoch:[0/2](534700/4588595) loss:2.819 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:42,501][model8_pretrain.py][INFO] Epoch:[0/2](534700/4588595) loss:3.306 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:42,501][model8_pretrain.py][INFO] Epoch:[0/2](534700/4588595) loss:3.121 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:42,501][model8_pretrain.py][INFO] Epoch:[0/2](534700/4588595) loss:3.286 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:42,501][model8_pretrain.py][INFO] Epoch:[0/2](534700/4588595) loss:2.894 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:42,501][model8_pretrain.py][INFO] Epoch:[0/2](534700/4588595) loss:2.320 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:13:42,501][model8_pretrain.py][INFO] Epoch:[0/2](534700/4588595) loss:2.957 lr:0.0000100 epoch_Time:25732.0min: [2024-01-05 02:14:19,438][model8_pretrain.py][INFO] Epoch:[0/2](534800/4588595) loss:2.617 lr:0.0000100 epoch_Time:25731.0min: [2024-01-05 02:14:19,438][model8_pretrain.py][INFO] Epoch:[0/2](534800/4588595) loss:2.696 lr:0.0000100 epoch_Time:25731.0min: [2024-01-05 02:14:19,438][model8_pretrain.py][INFO] Epoch:[0/2](534800/4588595) loss:3.188 lr:0.0000100 epoch_Time:25731.0min: [2024-01-05 02:14:19,438][model8_pretrain.py][INFO] Epoch:[0/2](534800/4588595) loss:2.722 lr:0.0000100 epoch_Time:25731.0min: [2024-01-05 02:14:19,438][model8_pretrain.py][INFO] Epoch:[0/2](534800/4588595) loss:2.764 lr:0.0000100 epoch_Time:25731.0min: [2024-01-05 02:14:19,438][model8_pretrain.py][INFO] Epoch:[0/2](534800/4588595) loss:2.212 lr:0.0000100 epoch_Time:25731.0min: [2024-01-05 02:14:19,438][model8_pretrain.py][INFO] Epoch:[0/2](534800/4588595) loss:3.017 lr:0.0000100 epoch_Time:25731.0min: [2024-01-05 02:14:19,438][model8_pretrain.py][INFO] Epoch:[0/2](534800/4588595) loss:2.638 lr:0.0000100 epoch_Time:25731.0min: [2024-01-05 02:14:56,367][model8_pretrain.py][INFO] Epoch:[0/2](534900/4588595) loss:2.625 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:14:56,367][model8_pretrain.py][INFO] Epoch:[0/2](534900/4588595) loss:2.974 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:14:56,367][model8_pretrain.py][INFO] Epoch:[0/2](534900/4588595) loss:2.452 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:14:56,367][model8_pretrain.py][INFO] Epoch:[0/2](534900/4588595) loss:2.226 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:14:56,367][model8_pretrain.py][INFO] Epoch:[0/2](534900/4588595) loss:3.047 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:14:56,367][model8_pretrain.py][INFO] Epoch:[0/2](534900/4588595) loss:2.932 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:14:56,367][model8_pretrain.py][INFO] Epoch:[0/2](534900/4588595) loss:2.317 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:14:56,367][model8_pretrain.py][INFO] Epoch:[0/2](534900/4588595) loss:2.636 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:15:35,067][model8_pretrain.py][INFO] Epoch:[0/2](535000/4588595) loss:2.660 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:15:35,067][model8_pretrain.py][INFO] Epoch:[0/2](535000/4588595) loss:2.838 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:15:35,067][model8_pretrain.py][INFO] Epoch:[0/2](535000/4588595) loss:3.162 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:15:35,067][model8_pretrain.py][INFO] Epoch:[0/2](535000/4588595) loss:2.835 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:15:35,067][model8_pretrain.py][INFO] Epoch:[0/2](535000/4588595) loss:2.153 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:15:35,067][model8_pretrain.py][INFO] Epoch:[0/2](535000/4588595) loss:2.982 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:15:35,067][model8_pretrain.py][INFO] Epoch:[0/2](535000/4588595) loss:2.657 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:15:35,067][model8_pretrain.py][INFO] Epoch:[0/2](535000/4588595) loss:2.605 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:16:22,433][model8_pretrain.py][INFO] Epoch:[0/2](535100/4588595) loss:3.127 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:16:22,433][model8_pretrain.py][INFO] Epoch:[0/2](535100/4588595) loss:2.387 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:16:22,433][model8_pretrain.py][INFO] Epoch:[0/2](535100/4588595) loss:2.837 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:16:22,433][model8_pretrain.py][INFO] Epoch:[0/2](535100/4588595) loss:2.219 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:16:22,433][model8_pretrain.py][INFO] Epoch:[0/2](535100/4588595) loss:2.941 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:16:22,433][model8_pretrain.py][INFO] Epoch:[0/2](535100/4588595) loss:2.952 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:16:22,433][model8_pretrain.py][INFO] Epoch:[0/2](535100/4588595) loss:2.349 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:16:22,433][model8_pretrain.py][INFO] Epoch:[0/2](535100/4588595) loss:2.597 lr:0.0000100 epoch_Time:25730.0min: [2024-01-05 02:16:59,364][model8_pretrain.py][INFO] Epoch:[0/2](535200/4588595) loss:2.802 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:16:59,364][model8_pretrain.py][INFO] Epoch:[0/2](535200/4588595) loss:2.924 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:16:59,364][model8_pretrain.py][INFO] Epoch:[0/2](535200/4588595) loss:2.821 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:16:59,364][model8_pretrain.py][INFO] Epoch:[0/2](535200/4588595) loss:3.283 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:16:59,364][model8_pretrain.py][INFO] Epoch:[0/2](535200/4588595) loss:2.602 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:16:59,365][model8_pretrain.py][INFO] Epoch:[0/2](535200/4588595) loss:2.543 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:16:59,365][model8_pretrain.py][INFO] Epoch:[0/2](535200/4588595) loss:2.552 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:16:59,365][model8_pretrain.py][INFO] Epoch:[0/2](535200/4588595) loss:2.754 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:17:36,306][model8_pretrain.py][INFO] Epoch:[0/2](535300/4588595) loss:3.229 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:17:36,306][model8_pretrain.py][INFO] Epoch:[0/2](535300/4588595) loss:2.492 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:17:36,306][model8_pretrain.py][INFO] Epoch:[0/2](535300/4588595) loss:2.862 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:17:36,306][model8_pretrain.py][INFO] Epoch:[0/2](535300/4588595) loss:2.708 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:17:36,306][model8_pretrain.py][INFO] Epoch:[0/2](535300/4588595) loss:3.047 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:17:36,306][model8_pretrain.py][INFO] Epoch:[0/2](535300/4588595) loss:2.565 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:17:36,306][model8_pretrain.py][INFO] Epoch:[0/2](535300/4588595) loss:3.015 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:17:36,306][model8_pretrain.py][INFO] Epoch:[0/2](535300/4588595) loss:3.140 lr:0.0000100 epoch_Time:25729.0min: [2024-01-05 02:18:13,253][model8_pretrain.py][INFO] Epoch:[0/2](535400/4588595) loss:2.902 lr:0.0000100 epoch_Time:25727.0min: [2024-01-05 02:18:13,253][model8_pretrain.py][INFO] Epoch:[0/2](535400/4588595) loss:2.569 lr:0.0000100 epoch_Time:25727.0min: [2024-01-05 02:18:13,253][model8_pretrain.py][INFO] Epoch:[0/2](535400/4588595) loss:3.154 lr:0.0000100 epoch_Time:25727.0min: [2024-01-05 02:18:13,253][model8_pretrain.py][INFO] Epoch:[0/2](535400/4588595) loss:2.935 lr:0.0000100 epoch_Time:25727.0min: [2024-01-05 02:18:13,253][model8_pretrain.py][INFO] Epoch:[0/2](535400/4588595) loss:2.913 lr:0.0000100 epoch_Time:25727.0min: [2024-01-05 02:18:13,253][model8_pretrain.py][INFO] Epoch:[0/2](535400/4588595) loss:2.942 lr:0.0000100 epoch_Time:25727.0min: [2024-01-05 02:18:13,253][model8_pretrain.py][INFO] Epoch:[0/2](535400/4588595) loss:2.941 lr:0.0000100 epoch_Time:25727.0min: [2024-01-05 02:18:13,253][model8_pretrain.py][INFO] Epoch:[0/2](535400/4588595) loss:2.727 lr:0.0000100 epoch_Time:25727.0min: [2024-01-05 02:18:50,189][model8_pretrain.py][INFO] Epoch:[0/2](535500/4588595) loss:3.266 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:18:50,189][model8_pretrain.py][INFO] Epoch:[0/2](535500/4588595) loss:2.697 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:18:50,189][model8_pretrain.py][INFO] Epoch:[0/2](535500/4588595) loss:2.625 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:18:50,189][model8_pretrain.py][INFO] Epoch:[0/2](535500/4588595) loss:3.149 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:18:50,189][model8_pretrain.py][INFO] Epoch:[0/2](535500/4588595) loss:3.025 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:18:50,189][model8_pretrain.py][INFO] Epoch:[0/2](535500/4588595) loss:3.446 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:18:50,189][model8_pretrain.py][INFO] Epoch:[0/2](535500/4588595) loss:3.109 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:18:50,189][model8_pretrain.py][INFO] Epoch:[0/2](535500/4588595) loss:3.118 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:19:27,121][model8_pretrain.py][INFO] Epoch:[0/2](535600/4588595) loss:2.619 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:19:27,121][model8_pretrain.py][INFO] Epoch:[0/2](535600/4588595) loss:3.069 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:19:27,121][model8_pretrain.py][INFO] Epoch:[0/2](535600/4588595) loss:2.420 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:19:27,121][model8_pretrain.py][INFO] Epoch:[0/2](535600/4588595) loss:2.990 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:19:27,121][model8_pretrain.py][INFO] Epoch:[0/2](535600/4588595) loss:2.826 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:19:27,121][model8_pretrain.py][INFO] Epoch:[0/2](535600/4588595) loss:2.956 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:19:27,121][model8_pretrain.py][INFO] Epoch:[0/2](535600/4588595) loss:2.958 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:19:27,122][model8_pretrain.py][INFO] Epoch:[0/2](535600/4588595) loss:2.846 lr:0.0000100 epoch_Time:25726.0min: [2024-01-05 02:20:04,068][model8_pretrain.py][INFO] Epoch:[0/2](535700/4588595) loss:2.804 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:04,068][model8_pretrain.py][INFO] Epoch:[0/2](535700/4588595) loss:2.314 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:04,068][model8_pretrain.py][INFO] Epoch:[0/2](535700/4588595) loss:2.890 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:04,068][model8_pretrain.py][INFO] Epoch:[0/2](535700/4588595) loss:2.942 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:04,068][model8_pretrain.py][INFO] Epoch:[0/2](535700/4588595) loss:2.515 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:04,068][model8_pretrain.py][INFO] Epoch:[0/2](535700/4588595) loss:3.094 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:04,068][model8_pretrain.py][INFO] Epoch:[0/2](535700/4588595) loss:3.047 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:04,068][model8_pretrain.py][INFO] Epoch:[0/2](535700/4588595) loss:3.543 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:40,999][model8_pretrain.py][INFO] Epoch:[0/2](535800/4588595) loss:3.417 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:40,999][model8_pretrain.py][INFO] Epoch:[0/2](535800/4588595) loss:3.117 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:40,999][model8_pretrain.py][INFO] Epoch:[0/2](535800/4588595) loss:3.190 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:40,999][model8_pretrain.py][INFO] Epoch:[0/2](535800/4588595) loss:3.391 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:40,999][model8_pretrain.py][INFO] Epoch:[0/2](535800/4588595) loss:2.902 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:40,999][model8_pretrain.py][INFO] Epoch:[0/2](535800/4588595) loss:3.004 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:40,999][model8_pretrain.py][INFO] Epoch:[0/2](535800/4588595) loss:2.427 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:20:40,999][model8_pretrain.py][INFO] Epoch:[0/2](535800/4588595) loss:2.327 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:21:30,074][model8_pretrain.py][INFO] Epoch:[0/2](535900/4588595) loss:2.889 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:21:30,074][model8_pretrain.py][INFO] Epoch:[0/2](535900/4588595) loss:3.241 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:21:30,074][model8_pretrain.py][INFO] Epoch:[0/2](535900/4588595) loss:3.153 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:21:30,074][model8_pretrain.py][INFO] Epoch:[0/2](535900/4588595) loss:2.639 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:21:30,074][model8_pretrain.py][INFO] Epoch:[0/2](535900/4588595) loss:2.509 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:21:30,074][model8_pretrain.py][INFO] Epoch:[0/2](535900/4588595) loss:2.672 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:21:30,075][model8_pretrain.py][INFO] Epoch:[0/2](535900/4588595) loss:2.309 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:21:30,075][model8_pretrain.py][INFO] Epoch:[0/2](535900/4588595) loss:3.002 lr:0.0000100 epoch_Time:25725.0min: [2024-01-05 02:22:07,016][model8_pretrain.py][INFO] Epoch:[0/2](536000/4588595) loss:2.967 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:07,016][model8_pretrain.py][INFO] Epoch:[0/2](536000/4588595) loss:2.806 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:07,016][model8_pretrain.py][INFO] Epoch:[0/2](536000/4588595) loss:2.882 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:07,016][model8_pretrain.py][INFO] Epoch:[0/2](536000/4588595) loss:3.123 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:07,016][model8_pretrain.py][INFO] Epoch:[0/2](536000/4588595) loss:2.445 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:07,017][model8_pretrain.py][INFO] Epoch:[0/2](536000/4588595) loss:3.021 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:07,017][model8_pretrain.py][INFO] Epoch:[0/2](536000/4588595) loss:2.413 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:07,017][model8_pretrain.py][INFO] Epoch:[0/2](536000/4588595) loss:2.257 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:43,975][model8_pretrain.py][INFO] Epoch:[0/2](536100/4588595) loss:2.570 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:43,975][model8_pretrain.py][INFO] Epoch:[0/2](536100/4588595) loss:3.370 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:43,975][model8_pretrain.py][INFO] Epoch:[0/2](536100/4588595) loss:2.906 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:43,975][model8_pretrain.py][INFO] Epoch:[0/2](536100/4588595) loss:3.032 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:43,975][model8_pretrain.py][INFO] Epoch:[0/2](536100/4588595) loss:2.997 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:43,976][model8_pretrain.py][INFO] Epoch:[0/2](536100/4588595) loss:3.200 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:43,976][model8_pretrain.py][INFO] Epoch:[0/2](536100/4588595) loss:2.802 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:22:43,977][model8_pretrain.py][INFO] Epoch:[0/2](536100/4588595) loss:2.752 lr:0.0000100 epoch_Time:25724.0min: [2024-01-05 02:23:20,915][model8_pretrain.py][INFO] Epoch:[0/2](536200/4588595) loss:2.930 lr:0.0000100 epoch_Time:25723.0min: [2024-01-05 02:23:20,915][model8_pretrain.py][INFO] Epoch:[0/2](536200/4588595) loss:2.924 lr:0.0000100 epoch_Time:25723.0min: [2024-01-05 02:23:20,915][model8_pretrain.py][INFO] Epoch:[0/2](536200/4588595) loss:3.082 lr:0.0000100 epoch_Time:25723.0min: [2024-01-05 02:23:20,915][model8_pretrain.py][INFO] Epoch:[0/2](536200/4588595) loss:2.920 lr:0.0000100 epoch_Time:25723.0min: [2024-01-05 02:23:20,915][model8_pretrain.py][INFO] Epoch:[0/2](536200/4588595) loss:2.773 lr:0.0000100 epoch_Time:25723.0min: [2024-01-05 02:23:20,915][model8_pretrain.py][INFO] Epoch:[0/2](536200/4588595) loss:3.084 lr:0.0000100 epoch_Time:25723.0min: [2024-01-05 02:23:20,915][model8_pretrain.py][INFO] Epoch:[0/2](536200/4588595) loss:3.165 lr:0.0000100 epoch_Time:25723.0min: [2024-01-05 02:23:20,915][model8_pretrain.py][INFO] Epoch:[0/2](536200/4588595) loss:2.557 lr:0.0000100 epoch_Time:25723.0min: [2024-01-05 02:23:57,843][model8_pretrain.py][INFO] Epoch:[0/2](536300/4588595) loss:2.936 lr:0.0000100 epoch_Time:25722.0min: [2024-01-05 02:23:57,843][model8_pretrain.py][INFO] Epoch:[0/2](536300/4588595) loss:2.761 lr:0.0000100 epoch_Time:25722.0min: [2024-01-05 02:23:57,843][model8_pretrain.py][INFO] Epoch:[0/2](536300/4588595) loss:2.975 lr:0.0000100 epoch_Time:25722.0min: [2024-01-05 02:23:57,843][model8_pretrain.py][INFO] Epoch:[0/2](536300/4588595) loss:2.732 lr:0.0000100 epoch_Time:25722.0min: [2024-01-05 02:23:57,843][model8_pretrain.py][INFO] Epoch:[0/2](536300/4588595) loss:3.240 lr:0.0000100 epoch_Time:25722.0min: [2024-01-05 02:23:57,843][model8_pretrain.py][INFO] Epoch:[0/2](536300/4588595) loss:2.148 lr:0.0000100 epoch_Time:25722.0min: [2024-01-05 02:23:57,843][model8_pretrain.py][INFO] Epoch:[0/2](536300/4588595) loss:2.628 lr:0.0000100 epoch_Time:25722.0min: [2024-01-05 02:23:57,843][model8_pretrain.py][INFO] Epoch:[0/2](536300/4588595) loss:2.775 lr:0.0000100 epoch_Time:25722.0min: [2024-01-05 02:24:34,782][model8_pretrain.py][INFO] Epoch:[0/2](536400/4588595) loss:2.910 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:24:34,782][model8_pretrain.py][INFO] Epoch:[0/2](536400/4588595) loss:3.066 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:24:34,782][model8_pretrain.py][INFO] Epoch:[0/2](536400/4588595) loss:3.118 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:24:34,782][model8_pretrain.py][INFO] Epoch:[0/2](536400/4588595) loss:2.561 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:24:34,782][model8_pretrain.py][INFO] Epoch:[0/2](536400/4588595) loss:2.930 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:24:34,782][model8_pretrain.py][INFO] Epoch:[0/2](536400/4588595) loss:2.905 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:24:34,782][model8_pretrain.py][INFO] Epoch:[0/2](536400/4588595) loss:2.808 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:24:34,783][model8_pretrain.py][INFO] Epoch:[0/2](536400/4588595) loss:2.989 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:25:11,723][model8_pretrain.py][INFO] Epoch:[0/2](536500/4588595) loss:2.861 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:25:11,723][model8_pretrain.py][INFO] Epoch:[0/2](536500/4588595) loss:2.811 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:25:11,723][model8_pretrain.py][INFO] Epoch:[0/2](536500/4588595) loss:2.596 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:25:11,723][model8_pretrain.py][INFO] Epoch:[0/2](536500/4588595) loss:3.074 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:25:11,723][model8_pretrain.py][INFO] Epoch:[0/2](536500/4588595) loss:2.677 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:25:11,723][model8_pretrain.py][INFO] Epoch:[0/2](536500/4588595) loss:2.780 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:25:11,723][model8_pretrain.py][INFO] Epoch:[0/2](536500/4588595) loss:2.858 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:25:11,723][model8_pretrain.py][INFO] Epoch:[0/2](536500/4588595) loss:3.224 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:25:48,651][model8_pretrain.py][INFO] Epoch:[0/2](536600/4588595) loss:2.868 lr:0.0000100 epoch_Time:25719.0min: [2024-01-05 02:25:48,651][model8_pretrain.py][INFO] Epoch:[0/2](536600/4588595) loss:2.394 lr:0.0000100 epoch_Time:25719.0min: [2024-01-05 02:25:48,651][model8_pretrain.py][INFO] Epoch:[0/2](536600/4588595) loss:2.908 lr:0.0000100 epoch_Time:25719.0min: [2024-01-05 02:25:48,651][model8_pretrain.py][INFO] Epoch:[0/2](536600/4588595) loss:3.005 lr:0.0000100 epoch_Time:25719.0min: [2024-01-05 02:25:48,651][model8_pretrain.py][INFO] Epoch:[0/2](536600/4588595) loss:2.911 lr:0.0000100 epoch_Time:25719.0min: [2024-01-05 02:25:48,652][model8_pretrain.py][INFO] Epoch:[0/2](536600/4588595) loss:2.342 lr:0.0000100 epoch_Time:25719.0min: [2024-01-05 02:25:48,652][model8_pretrain.py][INFO] Epoch:[0/2](536600/4588595) loss:2.430 lr:0.0000100 epoch_Time:25719.0min: [2024-01-05 02:25:48,652][model8_pretrain.py][INFO] Epoch:[0/2](536600/4588595) loss:2.304 lr:0.0000100 epoch_Time:25719.0min: [2024-01-05 02:26:37,714][model8_pretrain.py][INFO] Epoch:[0/2](536700/4588595) loss:2.680 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:26:37,714][model8_pretrain.py][INFO] Epoch:[0/2](536700/4588595) loss:2.879 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:26:37,714][model8_pretrain.py][INFO] Epoch:[0/2](536700/4588595) loss:3.430 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:26:37,714][model8_pretrain.py][INFO] Epoch:[0/2](536700/4588595) loss:3.079 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:26:37,714][model8_pretrain.py][INFO] Epoch:[0/2](536700/4588595) loss:3.560 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:26:37,714][model8_pretrain.py][INFO] Epoch:[0/2](536700/4588595) loss:2.825 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:26:37,714][model8_pretrain.py][INFO] Epoch:[0/2](536700/4588595) loss:2.937 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:26:37,714][model8_pretrain.py][INFO] Epoch:[0/2](536700/4588595) loss:2.802 lr:0.0000100 epoch_Time:25721.0min: [2024-01-05 02:27:14,649][model8_pretrain.py][INFO] Epoch:[0/2](536800/4588595) loss:2.463 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:27:14,649][model8_pretrain.py][INFO] Epoch:[0/2](536800/4588595) loss:2.887 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:27:14,649][model8_pretrain.py][INFO] Epoch:[0/2](536800/4588595) loss:2.545 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:27:14,649][model8_pretrain.py][INFO] Epoch:[0/2](536800/4588595) loss:3.025 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:27:14,649][model8_pretrain.py][INFO] Epoch:[0/2](536800/4588595) loss:2.663 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:27:14,649][model8_pretrain.py][INFO] Epoch:[0/2](536800/4588595) loss:3.041 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:27:14,649][model8_pretrain.py][INFO] Epoch:[0/2](536800/4588595) loss:2.941 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:27:14,649][model8_pretrain.py][INFO] Epoch:[0/2](536800/4588595) loss:3.119 lr:0.0000100 epoch_Time:25720.0min: [2024-01-05 02:27:51,589][model8_pretrain.py][INFO] Epoch:[0/2](536900/4588595) loss:2.910 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:27:51,589][model8_pretrain.py][INFO] Epoch:[0/2](536900/4588595) loss:2.990 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:27:51,589][model8_pretrain.py][INFO] Epoch:[0/2](536900/4588595) loss:2.913 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:27:51,589][model8_pretrain.py][INFO] Epoch:[0/2](536900/4588595) loss:2.296 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:27:51,589][model8_pretrain.py][INFO] Epoch:[0/2](536900/4588595) loss:2.870 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:27:51,589][model8_pretrain.py][INFO] Epoch:[0/2](536900/4588595) loss:2.729 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:27:51,589][model8_pretrain.py][INFO] Epoch:[0/2](536900/4588595) loss:3.169 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:27:51,589][model8_pretrain.py][INFO] Epoch:[0/2](536900/4588595) loss:2.781 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:28:28,534][model8_pretrain.py][INFO] Epoch:[0/2](537000/4588595) loss:2.769 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:28:28,534][model8_pretrain.py][INFO] Epoch:[0/2](537000/4588595) loss:3.269 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:28:28,534][model8_pretrain.py][INFO] Epoch:[0/2](537000/4588595) loss:2.222 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:28:28,534][model8_pretrain.py][INFO] Epoch:[0/2](537000/4588595) loss:3.261 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:28:28,534][model8_pretrain.py][INFO] Epoch:[0/2](537000/4588595) loss:3.006 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:28:28,534][model8_pretrain.py][INFO] Epoch:[0/2](537000/4588595) loss:2.331 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:28:28,534][model8_pretrain.py][INFO] Epoch:[0/2](537000/4588595) loss:2.811 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:28:28,534][model8_pretrain.py][INFO] Epoch:[0/2](537000/4588595) loss:3.157 lr:0.0000100 epoch_Time:25718.0min: [2024-01-05 02:29:05,474][model8_pretrain.py][INFO] Epoch:[0/2](537100/4588595) loss:2.611 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:05,474][model8_pretrain.py][INFO] Epoch:[0/2](537100/4588595) loss:2.921 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:05,474][model8_pretrain.py][INFO] Epoch:[0/2](537100/4588595) loss:3.035 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:05,474][model8_pretrain.py][INFO] Epoch:[0/2](537100/4588595) loss:2.769 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:05,474][model8_pretrain.py][INFO] Epoch:[0/2](537100/4588595) loss:2.001 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:05,474][model8_pretrain.py][INFO] Epoch:[0/2](537100/4588595) loss:2.607 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:05,475][model8_pretrain.py][INFO] Epoch:[0/2](537100/4588595) loss:2.724 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:05,475][model8_pretrain.py][INFO] Epoch:[0/2](537100/4588595) loss:2.890 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:42,412][model8_pretrain.py][INFO] Epoch:[0/2](537200/4588595) loss:2.936 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:42,412][model8_pretrain.py][INFO] Epoch:[0/2](537200/4588595) loss:3.321 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:42,412][model8_pretrain.py][INFO] Epoch:[0/2](537200/4588595) loss:2.945 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:42,412][model8_pretrain.py][INFO] Epoch:[0/2](537200/4588595) loss:2.828 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:42,412][model8_pretrain.py][INFO] Epoch:[0/2](537200/4588595) loss:2.917 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:42,412][model8_pretrain.py][INFO] Epoch:[0/2](537200/4588595) loss:2.886 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:42,412][model8_pretrain.py][INFO] Epoch:[0/2](537200/4588595) loss:2.533 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:29:42,412][model8_pretrain.py][INFO] Epoch:[0/2](537200/4588595) loss:2.762 lr:0.0000100 epoch_Time:25717.0min: [2024-01-05 02:30:19,317][model8_pretrain.py][INFO] Epoch:[0/2](537300/4588595) loss:3.251 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:30:19,317][model8_pretrain.py][INFO] Epoch:[0/2](537300/4588595) loss:2.677 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:30:19,317][model8_pretrain.py][INFO] Epoch:[0/2](537300/4588595) loss:2.957 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:30:19,317][model8_pretrain.py][INFO] Epoch:[0/2](537300/4588595) loss:2.741 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:30:19,317][model8_pretrain.py][INFO] Epoch:[0/2](537300/4588595) loss:3.406 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:30:19,317][model8_pretrain.py][INFO] Epoch:[0/2](537300/4588595) loss:2.651 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:30:19,317][model8_pretrain.py][INFO] Epoch:[0/2](537300/4588595) loss:2.778 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:30:19,317][model8_pretrain.py][INFO] Epoch:[0/2](537300/4588595) loss:3.199 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:30:56,252][model8_pretrain.py][INFO] Epoch:[0/2](537400/4588595) loss:2.789 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:30:56,252][model8_pretrain.py][INFO] Epoch:[0/2](537400/4588595) loss:3.085 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:30:56,252][model8_pretrain.py][INFO] Epoch:[0/2](537400/4588595) loss:2.560 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:30:56,252][model8_pretrain.py][INFO] Epoch:[0/2](537400/4588595) loss:2.690 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:30:56,252][model8_pretrain.py][INFO] Epoch:[0/2](537400/4588595) loss:2.931 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:30:56,252][model8_pretrain.py][INFO] Epoch:[0/2](537400/4588595) loss:2.788 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:30:56,252][model8_pretrain.py][INFO] Epoch:[0/2](537400/4588595) loss:2.865 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:30:56,252][model8_pretrain.py][INFO] Epoch:[0/2](537400/4588595) loss:2.862 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:31:45,354][model8_pretrain.py][INFO] Epoch:[0/2](537500/4588595) loss:3.098 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:31:45,354][model8_pretrain.py][INFO] Epoch:[0/2](537500/4588595) loss:2.770 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:31:45,354][model8_pretrain.py][INFO] Epoch:[0/2](537500/4588595) loss:2.952 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:31:45,354][model8_pretrain.py][INFO] Epoch:[0/2](537500/4588595) loss:3.163 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:31:45,354][model8_pretrain.py][INFO] Epoch:[0/2](537500/4588595) loss:2.934 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:31:45,354][model8_pretrain.py][INFO] Epoch:[0/2](537500/4588595) loss:2.864 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:31:45,354][model8_pretrain.py][INFO] Epoch:[0/2](537500/4588595) loss:2.729 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:31:45,354][model8_pretrain.py][INFO] Epoch:[0/2](537500/4588595) loss:2.870 lr:0.0000100 epoch_Time:25716.0min: [2024-01-05 02:32:22,285][model8_pretrain.py][INFO] Epoch:[0/2](537600/4588595) loss:3.107 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:32:22,285][model8_pretrain.py][INFO] Epoch:[0/2](537600/4588595) loss:2.258 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:32:22,285][model8_pretrain.py][INFO] Epoch:[0/2](537600/4588595) loss:2.180 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:32:22,285][model8_pretrain.py][INFO] Epoch:[0/2](537600/4588595) loss:3.182 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:32:22,285][model8_pretrain.py][INFO] Epoch:[0/2](537600/4588595) loss:3.351 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:32:22,285][model8_pretrain.py][INFO] Epoch:[0/2](537600/4588595) loss:3.144 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:32:22,285][model8_pretrain.py][INFO] Epoch:[0/2](537600/4588595) loss:3.062 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:32:22,286][model8_pretrain.py][INFO] Epoch:[0/2](537600/4588595) loss:2.041 lr:0.0000100 epoch_Time:25715.0min: [2024-01-05 02:32:59,226][model8_pretrain.py][INFO] Epoch:[0/2](537700/4588595) loss:2.877 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:32:59,226][model8_pretrain.py][INFO] Epoch:[0/2](537700/4588595) loss:2.795 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:32:59,226][model8_pretrain.py][INFO] Epoch:[0/2](537700/4588595) loss:3.090 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:32:59,226][model8_pretrain.py][INFO] Epoch:[0/2](537700/4588595) loss:2.877 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:32:59,226][model8_pretrain.py][INFO] Epoch:[0/2](537700/4588595) loss:2.642 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:32:59,226][model8_pretrain.py][INFO] Epoch:[0/2](537700/4588595) loss:2.903 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:32:59,226][model8_pretrain.py][INFO] Epoch:[0/2](537700/4588595) loss:3.101 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:32:59,226][model8_pretrain.py][INFO] Epoch:[0/2](537700/4588595) loss:2.951 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:33:36,170][model8_pretrain.py][INFO] Epoch:[0/2](537800/4588595) loss:3.472 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:33:36,170][model8_pretrain.py][INFO] Epoch:[0/2](537800/4588595) loss:2.807 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:33:36,170][model8_pretrain.py][INFO] Epoch:[0/2](537800/4588595) loss:2.968 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:33:36,170][model8_pretrain.py][INFO] Epoch:[0/2](537800/4588595) loss:2.303 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:33:36,170][model8_pretrain.py][INFO] Epoch:[0/2](537800/4588595) loss:2.066 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:33:36,170][model8_pretrain.py][INFO] Epoch:[0/2](537800/4588595) loss:2.808 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:33:36,170][model8_pretrain.py][INFO] Epoch:[0/2](537800/4588595) loss:3.221 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:33:36,170][model8_pretrain.py][INFO] Epoch:[0/2](537800/4588595) loss:3.026 lr:0.0000100 epoch_Time:25714.0min: [2024-01-05 02:34:13,116][model8_pretrain.py][INFO] Epoch:[0/2](537900/4588595) loss:2.654 lr:0.0000100 epoch_Time:25712.0min: [2024-01-05 02:34:13,116][model8_pretrain.py][INFO] Epoch:[0/2](537900/4588595) loss:2.076 lr:0.0000100 epoch_Time:25712.0min: [2024-01-05 02:34:13,116][model8_pretrain.py][INFO] Epoch:[0/2](537900/4588595) loss:2.527 lr:0.0000100 epoch_Time:25712.0min: [2024-01-05 02:34:13,117][model8_pretrain.py][INFO] Epoch:[0/2](537900/4588595) loss:2.570 lr:0.0000100 epoch_Time:25712.0min: [2024-01-05 02:34:13,117][model8_pretrain.py][INFO] Epoch:[0/2](537900/4588595) loss:2.762 lr:0.0000100 epoch_Time:25712.0min: [2024-01-05 02:34:13,118][model8_pretrain.py][INFO] Epoch:[0/2](537900/4588595) loss:2.596 lr:0.0000100 epoch_Time:25712.0min: [2024-01-05 02:34:13,118][model8_pretrain.py][INFO] Epoch:[0/2](537900/4588595) loss:3.083 lr:0.0000100 epoch_Time:25712.0min: [2024-01-05 02:34:13,118][model8_pretrain.py][INFO] Epoch:[0/2](537900/4588595) loss:2.839 lr:0.0000100 epoch_Time:25712.0min: [2024-01-05 02:34:50,058][model8_pretrain.py][INFO] Epoch:[0/2](538000/4588595) loss:2.826 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:34:50,058][model8_pretrain.py][INFO] Epoch:[0/2](538000/4588595) loss:2.880 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:34:50,059][model8_pretrain.py][INFO] Epoch:[0/2](538000/4588595) loss:2.411 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:34:50,059][model8_pretrain.py][INFO] Epoch:[0/2](538000/4588595) loss:3.001 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:34:50,059][model8_pretrain.py][INFO] Epoch:[0/2](538000/4588595) loss:3.136 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:34:50,059][model8_pretrain.py][INFO] Epoch:[0/2](538000/4588595) loss:2.466 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:34:50,059][model8_pretrain.py][INFO] Epoch:[0/2](538000/4588595) loss:2.925 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:34:50,059][model8_pretrain.py][INFO] Epoch:[0/2](538000/4588595) loss:2.828 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:35:26,990][model8_pretrain.py][INFO] Epoch:[0/2](538100/4588595) loss:2.853 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:35:26,990][model8_pretrain.py][INFO] Epoch:[0/2](538100/4588595) loss:3.255 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:35:26,991][model8_pretrain.py][INFO] Epoch:[0/2](538100/4588595) loss:2.758 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:35:26,991][model8_pretrain.py][INFO] Epoch:[0/2](538100/4588595) loss:2.475 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:35:26,991][model8_pretrain.py][INFO] Epoch:[0/2](538100/4588595) loss:2.562 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:35:26,991][model8_pretrain.py][INFO] Epoch:[0/2](538100/4588595) loss:3.096 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:35:26,991][model8_pretrain.py][INFO] Epoch:[0/2](538100/4588595) loss:2.935 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:35:26,991][model8_pretrain.py][INFO] Epoch:[0/2](538100/4588595) loss:2.726 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:36:03,894][model8_pretrain.py][INFO] Epoch:[0/2](538200/4588595) loss:2.920 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:36:03,894][model8_pretrain.py][INFO] Epoch:[0/2](538200/4588595) loss:2.858 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:36:03,894][model8_pretrain.py][INFO] Epoch:[0/2](538200/4588595) loss:2.875 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:36:03,894][model8_pretrain.py][INFO] Epoch:[0/2](538200/4588595) loss:3.008 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:36:03,894][model8_pretrain.py][INFO] Epoch:[0/2](538200/4588595) loss:2.841 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:36:03,894][model8_pretrain.py][INFO] Epoch:[0/2](538200/4588595) loss:2.397 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:36:03,894][model8_pretrain.py][INFO] Epoch:[0/2](538200/4588595) loss:2.941 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:36:03,894][model8_pretrain.py][INFO] Epoch:[0/2](538200/4588595) loss:2.871 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:36:52,903][model8_pretrain.py][INFO] Epoch:[0/2](538300/4588595) loss:2.314 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:36:52,903][model8_pretrain.py][INFO] Epoch:[0/2](538300/4588595) loss:2.953 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:36:52,903][model8_pretrain.py][INFO] Epoch:[0/2](538300/4588595) loss:2.467 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:36:52,903][model8_pretrain.py][INFO] Epoch:[0/2](538300/4588595) loss:2.702 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:36:52,903][model8_pretrain.py][INFO] Epoch:[0/2](538300/4588595) loss:3.329 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:36:52,903][model8_pretrain.py][INFO] Epoch:[0/2](538300/4588595) loss:2.856 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:36:52,903][model8_pretrain.py][INFO] Epoch:[0/2](538300/4588595) loss:2.724 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:36:52,904][model8_pretrain.py][INFO] Epoch:[0/2](538300/4588595) loss:3.291 lr:0.0000100 epoch_Time:25711.0min: [2024-01-05 02:37:29,844][model8_pretrain.py][INFO] Epoch:[0/2](538400/4588595) loss:3.075 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:37:29,845][model8_pretrain.py][INFO] Epoch:[0/2](538400/4588595) loss:2.876 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:37:29,845][model8_pretrain.py][INFO] Epoch:[0/2](538400/4588595) loss:2.089 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:37:29,845][model8_pretrain.py][INFO] Epoch:[0/2](538400/4588595) loss:3.077 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:37:29,845][model8_pretrain.py][INFO] Epoch:[0/2](538400/4588595) loss:2.718 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:37:29,845][model8_pretrain.py][INFO] Epoch:[0/2](538400/4588595) loss:3.137 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:37:29,845][model8_pretrain.py][INFO] Epoch:[0/2](538400/4588595) loss:3.233 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:37:29,845][model8_pretrain.py][INFO] Epoch:[0/2](538400/4588595) loss:3.519 lr:0.0000100 epoch_Time:25710.0min: [2024-01-05 02:38:06,789][model8_pretrain.py][INFO] Epoch:[0/2](538500/4588595) loss:2.900 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:06,789][model8_pretrain.py][INFO] Epoch:[0/2](538500/4588595) loss:2.686 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:06,789][model8_pretrain.py][INFO] Epoch:[0/2](538500/4588595) loss:3.099 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:06,789][model8_pretrain.py][INFO] Epoch:[0/2](538500/4588595) loss:3.168 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:06,789][model8_pretrain.py][INFO] Epoch:[0/2](538500/4588595) loss:2.575 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:06,789][model8_pretrain.py][INFO] Epoch:[0/2](538500/4588595) loss:3.197 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:06,789][model8_pretrain.py][INFO] Epoch:[0/2](538500/4588595) loss:3.241 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:06,789][model8_pretrain.py][INFO] Epoch:[0/2](538500/4588595) loss:2.977 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:43,746][model8_pretrain.py][INFO] Epoch:[0/2](538600/4588595) loss:2.970 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:43,746][model8_pretrain.py][INFO] Epoch:[0/2](538600/4588595) loss:2.606 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:43,746][model8_pretrain.py][INFO] Epoch:[0/2](538600/4588595) loss:3.220 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:43,746][model8_pretrain.py][INFO] Epoch:[0/2](538600/4588595) loss:2.912 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:43,746][model8_pretrain.py][INFO] Epoch:[0/2](538600/4588595) loss:2.952 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:43,746][model8_pretrain.py][INFO] Epoch:[0/2](538600/4588595) loss:3.266 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:43,746][model8_pretrain.py][INFO] Epoch:[0/2](538600/4588595) loss:2.996 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:38:43,746][model8_pretrain.py][INFO] Epoch:[0/2](538600/4588595) loss:3.001 lr:0.0000100 epoch_Time:25709.0min: [2024-01-05 02:39:20,695][model8_pretrain.py][INFO] Epoch:[0/2](538700/4588595) loss:2.466 lr:0.0000100 epoch_Time:25708.0min: [2024-01-05 02:39:20,695][model8_pretrain.py][INFO] Epoch:[0/2](538700/4588595) loss:3.143 lr:0.0000100 epoch_Time:25708.0min: [2024-01-05 02:39:20,695][model8_pretrain.py][INFO] Epoch:[0/2](538700/4588595) loss:2.600 lr:0.0000100 epoch_Time:25708.0min: [2024-01-05 02:39:20,695][model8_pretrain.py][INFO] Epoch:[0/2](538700/4588595) loss:2.631 lr:0.0000100 epoch_Time:25708.0min: [2024-01-05 02:39:20,695][model8_pretrain.py][INFO] Epoch:[0/2](538700/4588595) loss:2.840 lr:0.0000100 epoch_Time:25708.0min: [2024-01-05 02:39:20,695][model8_pretrain.py][INFO] Epoch:[0/2](538700/4588595) loss:2.897 lr:0.0000100 epoch_Time:25708.0min: [2024-01-05 02:39:20,695][model8_pretrain.py][INFO] Epoch:[0/2](538700/4588595) loss:2.894 lr:0.0000100 epoch_Time:25708.0min: [2024-01-05 02:39:20,695][model8_pretrain.py][INFO] Epoch:[0/2](538700/4588595) loss:3.110 lr:0.0000100 epoch_Time:25708.0min: [2024-01-05 02:39:57,637][model8_pretrain.py][INFO] Epoch:[0/2](538800/4588595) loss:2.810 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:39:57,637][model8_pretrain.py][INFO] Epoch:[0/2](538800/4588595) loss:2.890 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:39:57,637][model8_pretrain.py][INFO] Epoch:[0/2](538800/4588595) loss:2.922 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:39:57,637][model8_pretrain.py][INFO] Epoch:[0/2](538800/4588595) loss:3.368 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:39:57,638][model8_pretrain.py][INFO] Epoch:[0/2](538800/4588595) loss:2.605 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:39:57,638][model8_pretrain.py][INFO] Epoch:[0/2](538800/4588595) loss:2.786 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:39:57,638][model8_pretrain.py][INFO] Epoch:[0/2](538800/4588595) loss:2.614 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:39:57,638][model8_pretrain.py][INFO] Epoch:[0/2](538800/4588595) loss:3.123 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:40:34,587][model8_pretrain.py][INFO] Epoch:[0/2](538900/4588595) loss:2.530 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:40:34,587][model8_pretrain.py][INFO] Epoch:[0/2](538900/4588595) loss:3.344 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:40:34,587][model8_pretrain.py][INFO] Epoch:[0/2](538900/4588595) loss:2.839 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:40:34,587][model8_pretrain.py][INFO] Epoch:[0/2](538900/4588595) loss:3.044 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:40:34,587][model8_pretrain.py][INFO] Epoch:[0/2](538900/4588595) loss:3.037 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:40:34,587][model8_pretrain.py][INFO] Epoch:[0/2](538900/4588595) loss:2.817 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:40:34,587][model8_pretrain.py][INFO] Epoch:[0/2](538900/4588595) loss:2.826 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:40:34,587][model8_pretrain.py][INFO] Epoch:[0/2](538900/4588595) loss:3.150 lr:0.0000100 epoch_Time:25707.0min: [2024-01-05 02:41:11,527][model8_pretrain.py][INFO] Epoch:[0/2](539000/4588595) loss:2.794 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:41:11,527][model8_pretrain.py][INFO] Epoch:[0/2](539000/4588595) loss:2.807 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:41:11,527][model8_pretrain.py][INFO] Epoch:[0/2](539000/4588595) loss:2.754 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:41:11,527][model8_pretrain.py][INFO] Epoch:[0/2](539000/4588595) loss:3.003 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:41:11,527][model8_pretrain.py][INFO] Epoch:[0/2](539000/4588595) loss:2.807 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:41:11,527][model8_pretrain.py][INFO] Epoch:[0/2](539000/4588595) loss:2.331 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:41:11,527][model8_pretrain.py][INFO] Epoch:[0/2](539000/4588595) loss:2.748 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:41:11,527][model8_pretrain.py][INFO] Epoch:[0/2](539000/4588595) loss:2.745 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:42:00,475][model8_pretrain.py][INFO] Epoch:[0/2](539100/4588595) loss:2.257 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:00,475][model8_pretrain.py][INFO] Epoch:[0/2](539100/4588595) loss:2.388 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:00,475][model8_pretrain.py][INFO] Epoch:[0/2](539100/4588595) loss:2.944 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:00,475][model8_pretrain.py][INFO] Epoch:[0/2](539100/4588595) loss:2.733 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:00,475][model8_pretrain.py][INFO] Epoch:[0/2](539100/4588595) loss:2.927 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:00,475][model8_pretrain.py][INFO] Epoch:[0/2](539100/4588595) loss:3.137 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:00,475][model8_pretrain.py][INFO] Epoch:[0/2](539100/4588595) loss:3.495 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:00,475][model8_pretrain.py][INFO] Epoch:[0/2](539100/4588595) loss:2.270 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:37,407][model8_pretrain.py][INFO] Epoch:[0/2](539200/4588595) loss:2.371 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:37,407][model8_pretrain.py][INFO] Epoch:[0/2](539200/4588595) loss:2.555 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:37,407][model8_pretrain.py][INFO] Epoch:[0/2](539200/4588595) loss:2.805 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:37,407][model8_pretrain.py][INFO] Epoch:[0/2](539200/4588595) loss:3.431 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:37,407][model8_pretrain.py][INFO] Epoch:[0/2](539200/4588595) loss:2.807 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:37,407][model8_pretrain.py][INFO] Epoch:[0/2](539200/4588595) loss:3.684 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:37,407][model8_pretrain.py][INFO] Epoch:[0/2](539200/4588595) loss:3.097 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:42:37,408][model8_pretrain.py][INFO] Epoch:[0/2](539200/4588595) loss:3.089 lr:0.0000100 epoch_Time:25706.0min: [2024-01-05 02:43:14,348][model8_pretrain.py][INFO] Epoch:[0/2](539300/4588595) loss:2.369 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:43:14,348][model8_pretrain.py][INFO] Epoch:[0/2](539300/4588595) loss:3.321 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:43:14,348][model8_pretrain.py][INFO] Epoch:[0/2](539300/4588595) loss:2.239 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:43:14,348][model8_pretrain.py][INFO] Epoch:[0/2](539300/4588595) loss:3.090 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:43:14,348][model8_pretrain.py][INFO] Epoch:[0/2](539300/4588595) loss:2.990 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:43:14,348][model8_pretrain.py][INFO] Epoch:[0/2](539300/4588595) loss:3.042 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:43:14,348][model8_pretrain.py][INFO] Epoch:[0/2](539300/4588595) loss:2.681 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:43:14,348][model8_pretrain.py][INFO] Epoch:[0/2](539300/4588595) loss:3.265 lr:0.0000100 epoch_Time:25705.0min: [2024-01-05 02:43:51,286][model8_pretrain.py][INFO] Epoch:[0/2](539400/4588595) loss:2.452 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:43:51,286][model8_pretrain.py][INFO] Epoch:[0/2](539400/4588595) loss:3.176 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:43:51,286][model8_pretrain.py][INFO] Epoch:[0/2](539400/4588595) loss:2.793 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:43:51,286][model8_pretrain.py][INFO] Epoch:[0/2](539400/4588595) loss:2.876 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:43:51,286][model8_pretrain.py][INFO] Epoch:[0/2](539400/4588595) loss:2.904 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:43:51,286][model8_pretrain.py][INFO] Epoch:[0/2](539400/4588595) loss:2.618 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:43:51,286][model8_pretrain.py][INFO] Epoch:[0/2](539400/4588595) loss:2.651 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:43:51,286][model8_pretrain.py][INFO] Epoch:[0/2](539400/4588595) loss:3.133 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:44:28,219][model8_pretrain.py][INFO] Epoch:[0/2](539500/4588595) loss:2.853 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:44:28,219][model8_pretrain.py][INFO] Epoch:[0/2](539500/4588595) loss:2.775 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:44:28,220][model8_pretrain.py][INFO] Epoch:[0/2](539500/4588595) loss:2.572 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:44:28,220][model8_pretrain.py][INFO] Epoch:[0/2](539500/4588595) loss:2.059 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:44:28,220][model8_pretrain.py][INFO] Epoch:[0/2](539500/4588595) loss:2.754 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:44:28,220][model8_pretrain.py][INFO] Epoch:[0/2](539500/4588595) loss:2.628 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:44:28,220][model8_pretrain.py][INFO] Epoch:[0/2](539500/4588595) loss:3.055 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:44:28,220][model8_pretrain.py][INFO] Epoch:[0/2](539500/4588595) loss:2.743 lr:0.0000100 epoch_Time:25703.0min: [2024-01-05 02:45:05,149][model8_pretrain.py][INFO] Epoch:[0/2](539600/4588595) loss:2.800 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:05,149][model8_pretrain.py][INFO] Epoch:[0/2](539600/4588595) loss:2.451 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:05,149][model8_pretrain.py][INFO] Epoch:[0/2](539600/4588595) loss:2.706 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:05,149][model8_pretrain.py][INFO] Epoch:[0/2](539600/4588595) loss:2.750 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:05,149][model8_pretrain.py][INFO] Epoch:[0/2](539600/4588595) loss:2.720 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:05,149][model8_pretrain.py][INFO] Epoch:[0/2](539600/4588595) loss:3.035 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:05,149][model8_pretrain.py][INFO] Epoch:[0/2](539600/4588595) loss:2.976 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:05,150][model8_pretrain.py][INFO] Epoch:[0/2](539600/4588595) loss:3.645 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:42,081][model8_pretrain.py][INFO] Epoch:[0/2](539700/4588595) loss:2.578 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:42,081][model8_pretrain.py][INFO] Epoch:[0/2](539700/4588595) loss:2.399 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:42,081][model8_pretrain.py][INFO] Epoch:[0/2](539700/4588595) loss:2.891 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:42,081][model8_pretrain.py][INFO] Epoch:[0/2](539700/4588595) loss:3.306 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:42,081][model8_pretrain.py][INFO] Epoch:[0/2](539700/4588595) loss:2.739 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:42,081][model8_pretrain.py][INFO] Epoch:[0/2](539700/4588595) loss:2.362 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:42,081][model8_pretrain.py][INFO] Epoch:[0/2](539700/4588595) loss:2.345 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:45:42,081][model8_pretrain.py][INFO] Epoch:[0/2](539700/4588595) loss:2.050 lr:0.0000100 epoch_Time:25702.0min: [2024-01-05 02:46:19,015][model8_pretrain.py][INFO] Epoch:[0/2](539800/4588595) loss:2.826 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:46:19,015][model8_pretrain.py][INFO] Epoch:[0/2](539800/4588595) loss:2.421 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:46:19,015][model8_pretrain.py][INFO] Epoch:[0/2](539800/4588595) loss:2.750 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:46:19,015][model8_pretrain.py][INFO] Epoch:[0/2](539800/4588595) loss:2.851 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:46:19,015][model8_pretrain.py][INFO] Epoch:[0/2](539800/4588595) loss:2.815 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:46:19,015][model8_pretrain.py][INFO] Epoch:[0/2](539800/4588595) loss:2.885 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:46:19,015][model8_pretrain.py][INFO] Epoch:[0/2](539800/4588595) loss:2.850 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:46:19,015][model8_pretrain.py][INFO] Epoch:[0/2](539800/4588595) loss:2.605 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:07,865][model8_pretrain.py][INFO] Epoch:[0/2](539900/4588595) loss:3.236 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:07,865][model8_pretrain.py][INFO] Epoch:[0/2](539900/4588595) loss:2.822 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:07,865][model8_pretrain.py][INFO] Epoch:[0/2](539900/4588595) loss:1.785 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:07,865][model8_pretrain.py][INFO] Epoch:[0/2](539900/4588595) loss:2.277 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:07,865][model8_pretrain.py][INFO] Epoch:[0/2](539900/4588595) loss:2.812 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:07,865][model8_pretrain.py][INFO] Epoch:[0/2](539900/4588595) loss:2.823 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:07,865][model8_pretrain.py][INFO] Epoch:[0/2](539900/4588595) loss:2.229 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:07,865][model8_pretrain.py][INFO] Epoch:[0/2](539900/4588595) loss:2.771 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:44,790][model8_pretrain.py][INFO] Epoch:[0/2](540000/4588595) loss:2.478 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:44,790][model8_pretrain.py][INFO] Epoch:[0/2](540000/4588595) loss:3.052 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:44,790][model8_pretrain.py][INFO] Epoch:[0/2](540000/4588595) loss:2.519 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:44,790][model8_pretrain.py][INFO] Epoch:[0/2](540000/4588595) loss:3.152 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:44,790][model8_pretrain.py][INFO] Epoch:[0/2](540000/4588595) loss:2.970 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:44,790][model8_pretrain.py][INFO] Epoch:[0/2](540000/4588595) loss:2.671 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:44,790][model8_pretrain.py][INFO] Epoch:[0/2](540000/4588595) loss:3.131 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:47:44,790][model8_pretrain.py][INFO] Epoch:[0/2](540000/4588595) loss:2.571 lr:0.0000100 epoch_Time:25701.0min: [2024-01-05 02:48:21,721][model8_pretrain.py][INFO] Epoch:[0/2](540100/4588595) loss:2.985 lr:0.0000100 epoch_Time:25700.0min: [2024-01-05 02:48:21,721][model8_pretrain.py][INFO] Epoch:[0/2](540100/4588595) loss:2.800 lr:0.0000100 epoch_Time:25700.0min: [2024-01-05 02:48:21,721][model8_pretrain.py][INFO] Epoch:[0/2](540100/4588595) loss:2.809 lr:0.0000100 epoch_Time:25700.0min: [2024-01-05 02:48:21,721][model8_pretrain.py][INFO] Epoch:[0/2](540100/4588595) loss:2.585 lr:0.0000100 epoch_Time:25700.0min: [2024-01-05 02:48:21,721][model8_pretrain.py][INFO] Epoch:[0/2](540100/4588595) loss:2.860 lr:0.0000100 epoch_Time:25700.0min: [2024-01-05 02:48:21,721][model8_pretrain.py][INFO] Epoch:[0/2](540100/4588595) loss:3.344 lr:0.0000100 epoch_Time:25700.0min: [2024-01-05 02:48:21,721][model8_pretrain.py][INFO] Epoch:[0/2](540100/4588595) loss:2.408 lr:0.0000100 epoch_Time:25700.0min: [2024-01-05 02:48:21,721][model8_pretrain.py][INFO] Epoch:[0/2](540100/4588595) loss:2.881 lr:0.0000100 epoch_Time:25700.0min: [2024-01-05 02:48:58,658][model8_pretrain.py][INFO] Epoch:[0/2](540200/4588595) loss:3.171 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:48:58,658][model8_pretrain.py][INFO] Epoch:[0/2](540200/4588595) loss:2.557 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:48:58,658][model8_pretrain.py][INFO] Epoch:[0/2](540200/4588595) loss:2.499 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:48:58,658][model8_pretrain.py][INFO] Epoch:[0/2](540200/4588595) loss:3.129 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:48:58,658][model8_pretrain.py][INFO] Epoch:[0/2](540200/4588595) loss:2.751 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:48:58,659][model8_pretrain.py][INFO] Epoch:[0/2](540200/4588595) loss:2.508 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:48:58,659][model8_pretrain.py][INFO] Epoch:[0/2](540200/4588595) loss:2.677 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:48:58,659][model8_pretrain.py][INFO] Epoch:[0/2](540200/4588595) loss:3.494 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:49:35,591][model8_pretrain.py][INFO] Epoch:[0/2](540300/4588595) loss:2.326 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:49:35,591][model8_pretrain.py][INFO] Epoch:[0/2](540300/4588595) loss:2.603 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:49:35,591][model8_pretrain.py][INFO] Epoch:[0/2](540300/4588595) loss:3.194 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:49:35,591][model8_pretrain.py][INFO] Epoch:[0/2](540300/4588595) loss:2.770 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:49:35,591][model8_pretrain.py][INFO] Epoch:[0/2](540300/4588595) loss:2.978 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:49:35,591][model8_pretrain.py][INFO] Epoch:[0/2](540300/4588595) loss:2.021 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:49:35,591][model8_pretrain.py][INFO] Epoch:[0/2](540300/4588595) loss:3.022 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:49:35,591][model8_pretrain.py][INFO] Epoch:[0/2](540300/4588595) loss:3.069 lr:0.0000100 epoch_Time:25699.0min: [2024-01-05 02:50:12,530][model8_pretrain.py][INFO] Epoch:[0/2](540400/4588595) loss:2.806 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:50:12,530][model8_pretrain.py][INFO] Epoch:[0/2](540400/4588595) loss:2.879 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:50:12,530][model8_pretrain.py][INFO] Epoch:[0/2](540400/4588595) loss:2.378 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:50:12,530][model8_pretrain.py][INFO] Epoch:[0/2](540400/4588595) loss:3.013 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:50:12,530][model8_pretrain.py][INFO] Epoch:[0/2](540400/4588595) loss:2.563 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:50:12,530][model8_pretrain.py][INFO] Epoch:[0/2](540400/4588595) loss:3.111 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:50:12,530][model8_pretrain.py][INFO] Epoch:[0/2](540400/4588595) loss:2.712 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:50:12,530][model8_pretrain.py][INFO] Epoch:[0/2](540400/4588595) loss:2.715 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:50:49,488][model8_pretrain.py][INFO] Epoch:[0/2](540500/4588595) loss:2.436 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:50:49,488][model8_pretrain.py][INFO] Epoch:[0/2](540500/4588595) loss:3.156 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:50:49,488][model8_pretrain.py][INFO] Epoch:[0/2](540500/4588595) loss:3.008 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:50:49,488][model8_pretrain.py][INFO] Epoch:[0/2](540500/4588595) loss:2.564 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:50:49,488][model8_pretrain.py][INFO] Epoch:[0/2](540500/4588595) loss:2.418 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:50:49,489][model8_pretrain.py][INFO] Epoch:[0/2](540500/4588595) loss:2.460 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:50:49,489][model8_pretrain.py][INFO] Epoch:[0/2](540500/4588595) loss:3.139 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:50:49,489][model8_pretrain.py][INFO] Epoch:[0/2](540500/4588595) loss:2.837 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:51:26,435][model8_pretrain.py][INFO] Epoch:[0/2](540600/4588595) loss:2.762 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:51:26,435][model8_pretrain.py][INFO] Epoch:[0/2](540600/4588595) loss:2.935 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:51:26,435][model8_pretrain.py][INFO] Epoch:[0/2](540600/4588595) loss:2.780 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:51:26,435][model8_pretrain.py][INFO] Epoch:[0/2](540600/4588595) loss:2.385 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:51:26,435][model8_pretrain.py][INFO] Epoch:[0/2](540600/4588595) loss:3.396 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:51:26,435][model8_pretrain.py][INFO] Epoch:[0/2](540600/4588595) loss:2.944 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:51:26,435][model8_pretrain.py][INFO] Epoch:[0/2](540600/4588595) loss:2.953 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:51:26,435][model8_pretrain.py][INFO] Epoch:[0/2](540600/4588595) loss:2.556 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:52:15,530][model8_pretrain.py][INFO] Epoch:[0/2](540700/4588595) loss:3.094 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:52:15,530][model8_pretrain.py][INFO] Epoch:[0/2](540700/4588595) loss:2.538 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:52:15,530][model8_pretrain.py][INFO] Epoch:[0/2](540700/4588595) loss:2.974 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:52:15,530][model8_pretrain.py][INFO] Epoch:[0/2](540700/4588595) loss:3.073 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:52:15,530][model8_pretrain.py][INFO] Epoch:[0/2](540700/4588595) loss:2.512 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:52:15,530][model8_pretrain.py][INFO] Epoch:[0/2](540700/4588595) loss:2.665 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:52:15,530][model8_pretrain.py][INFO] Epoch:[0/2](540700/4588595) loss:2.915 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:52:15,531][model8_pretrain.py][INFO] Epoch:[0/2](540700/4588595) loss:3.339 lr:0.0000100 epoch_Time:25697.0min: [2024-01-05 02:52:52,466][model8_pretrain.py][INFO] Epoch:[0/2](540800/4588595) loss:2.384 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:52:52,466][model8_pretrain.py][INFO] Epoch:[0/2](540800/4588595) loss:2.788 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:52:52,466][model8_pretrain.py][INFO] Epoch:[0/2](540800/4588595) loss:2.596 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:52:52,466][model8_pretrain.py][INFO] Epoch:[0/2](540800/4588595) loss:2.580 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:52:52,466][model8_pretrain.py][INFO] Epoch:[0/2](540800/4588595) loss:3.194 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:52:52,466][model8_pretrain.py][INFO] Epoch:[0/2](540800/4588595) loss:2.838 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:52:52,466][model8_pretrain.py][INFO] Epoch:[0/2](540800/4588595) loss:3.039 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:52:52,466][model8_pretrain.py][INFO] Epoch:[0/2](540800/4588595) loss:2.466 lr:0.0000100 epoch_Time:25696.0min: [2024-01-05 02:53:29,404][model8_pretrain.py][INFO] Epoch:[0/2](540900/4588595) loss:2.940 lr:0.0000100 epoch_Time:25695.0min: [2024-01-05 02:53:29,404][model8_pretrain.py][INFO] Epoch:[0/2](540900/4588595) loss:2.878 lr:0.0000100 epoch_Time:25695.0min: [2024-01-05 02:53:29,404][model8_pretrain.py][INFO] Epoch:[0/2](540900/4588595) loss:2.416 lr:0.0000100 epoch_Time:25695.0min: [2024-01-05 02:53:29,404][model8_pretrain.py][INFO] Epoch:[0/2](540900/4588595) loss:3.235 lr:0.0000100 epoch_Time:25695.0min: [2024-01-05 02:53:29,404][model8_pretrain.py][INFO] Epoch:[0/2](540900/4588595) loss:3.127 lr:0.0000100 epoch_Time:25695.0min: [2024-01-05 02:53:29,404][model8_pretrain.py][INFO] Epoch:[0/2](540900/4588595) loss:2.757 lr:0.0000100 epoch_Time:25695.0min: [2024-01-05 02:53:29,404][model8_pretrain.py][INFO] Epoch:[0/2](540900/4588595) loss:2.324 lr:0.0000100 epoch_Time:25695.0min: [2024-01-05 02:53:29,404][model8_pretrain.py][INFO] Epoch:[0/2](540900/4588595) loss:2.651 lr:0.0000100 epoch_Time:25695.0min: [2024-01-05 02:54:06,344][model8_pretrain.py][INFO] Epoch:[0/2](541000/4588595) loss:3.049 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:06,344][model8_pretrain.py][INFO] Epoch:[0/2](541000/4588595) loss:3.074 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:06,344][model8_pretrain.py][INFO] Epoch:[0/2](541000/4588595) loss:2.964 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:06,344][model8_pretrain.py][INFO] Epoch:[0/2](541000/4588595) loss:3.110 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:06,344][model8_pretrain.py][INFO] Epoch:[0/2](541000/4588595) loss:3.006 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:06,344][model8_pretrain.py][INFO] Epoch:[0/2](541000/4588595) loss:3.021 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:06,345][model8_pretrain.py][INFO] Epoch:[0/2](541000/4588595) loss:2.768 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:06,345][model8_pretrain.py][INFO] Epoch:[0/2](541000/4588595) loss:3.115 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:43,279][model8_pretrain.py][INFO] Epoch:[0/2](541100/4588595) loss:3.464 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:43,279][model8_pretrain.py][INFO] Epoch:[0/2](541100/4588595) loss:3.090 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:43,279][model8_pretrain.py][INFO] Epoch:[0/2](541100/4588595) loss:2.688 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:43,279][model8_pretrain.py][INFO] Epoch:[0/2](541100/4588595) loss:3.231 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:43,279][model8_pretrain.py][INFO] Epoch:[0/2](541100/4588595) loss:2.933 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:43,279][model8_pretrain.py][INFO] Epoch:[0/2](541100/4588595) loss:2.537 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:43,279][model8_pretrain.py][INFO] Epoch:[0/2](541100/4588595) loss:2.935 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:54:43,279][model8_pretrain.py][INFO] Epoch:[0/2](541100/4588595) loss:2.733 lr:0.0000100 epoch_Time:25694.0min: [2024-01-05 02:55:20,257][model8_pretrain.py][INFO] Epoch:[0/2](541200/4588595) loss:3.402 lr:0.0000100 epoch_Time:25693.0min: [2024-01-05 02:55:20,257][model8_pretrain.py][INFO] Epoch:[0/2](541200/4588595) loss:2.776 lr:0.0000100 epoch_Time:25693.0min: [2024-01-05 02:55:20,257][model8_pretrain.py][INFO] Epoch:[0/2](541200/4588595) loss:2.663 lr:0.0000100 epoch_Time:25693.0min: [2024-01-05 02:55:20,257][model8_pretrain.py][INFO] Epoch:[0/2](541200/4588595) loss:2.947 lr:0.0000100 epoch_Time:25693.0min: [2024-01-05 02:55:20,257][model8_pretrain.py][INFO] Epoch:[0/2](541200/4588595) loss:2.859 lr:0.0000100 epoch_Time:25693.0min: [2024-01-05 02:55:20,257][model8_pretrain.py][INFO] Epoch:[0/2](541200/4588595) loss:2.647 lr:0.0000100 epoch_Time:25693.0min: [2024-01-05 02:55:20,258][model8_pretrain.py][INFO] Epoch:[0/2](541200/4588595) loss:3.056 lr:0.0000100 epoch_Time:25693.0min: [2024-01-05 02:55:20,258][model8_pretrain.py][INFO] Epoch:[0/2](541200/4588595) loss:3.329 lr:0.0000100 epoch_Time:25693.0min: [2024-01-05 02:55:57,184][model8_pretrain.py][INFO] Epoch:[0/2](541300/4588595) loss:2.723 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:55:57,184][model8_pretrain.py][INFO] Epoch:[0/2](541300/4588595) loss:3.037 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:55:57,184][model8_pretrain.py][INFO] Epoch:[0/2](541300/4588595) loss:2.967 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:55:57,184][model8_pretrain.py][INFO] Epoch:[0/2](541300/4588595) loss:3.006 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:55:57,184][model8_pretrain.py][INFO] Epoch:[0/2](541300/4588595) loss:3.136 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:55:57,184][model8_pretrain.py][INFO] Epoch:[0/2](541300/4588595) loss:2.962 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:55:57,184][model8_pretrain.py][INFO] Epoch:[0/2](541300/4588595) loss:2.295 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:55:57,184][model8_pretrain.py][INFO] Epoch:[0/2](541300/4588595) loss:2.640 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:56:34,105][model8_pretrain.py][INFO] Epoch:[0/2](541400/4588595) loss:2.860 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:56:34,105][model8_pretrain.py][INFO] Epoch:[0/2](541400/4588595) loss:2.931 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:56:34,105][model8_pretrain.py][INFO] Epoch:[0/2](541400/4588595) loss:2.789 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:56:34,105][model8_pretrain.py][INFO] Epoch:[0/2](541400/4588595) loss:2.532 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:56:34,105][model8_pretrain.py][INFO] Epoch:[0/2](541400/4588595) loss:3.221 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:56:34,105][model8_pretrain.py][INFO] Epoch:[0/2](541400/4588595) loss:3.232 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:56:34,105][model8_pretrain.py][INFO] Epoch:[0/2](541400/4588595) loss:2.769 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:56:34,105][model8_pretrain.py][INFO] Epoch:[0/2](541400/4588595) loss:2.533 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:57:22,883][model8_pretrain.py][INFO] Epoch:[0/2](541500/4588595) loss:3.216 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:57:22,883][model8_pretrain.py][INFO] Epoch:[0/2](541500/4588595) loss:2.362 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:57:22,883][model8_pretrain.py][INFO] Epoch:[0/2](541500/4588595) loss:2.830 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:57:22,883][model8_pretrain.py][INFO] Epoch:[0/2](541500/4588595) loss:2.682 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:57:22,883][model8_pretrain.py][INFO] Epoch:[0/2](541500/4588595) loss:3.008 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:57:22,883][model8_pretrain.py][INFO] Epoch:[0/2](541500/4588595) loss:2.836 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:57:22,883][model8_pretrain.py][INFO] Epoch:[0/2](541500/4588595) loss:2.961 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:57:22,883][model8_pretrain.py][INFO] Epoch:[0/2](541500/4588595) loss:2.624 lr:0.0000100 epoch_Time:25692.0min: [2024-01-05 02:57:59,810][model8_pretrain.py][INFO] Epoch:[0/2](541600/4588595) loss:3.072 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:57:59,810][model8_pretrain.py][INFO] Epoch:[0/2](541600/4588595) loss:3.290 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:57:59,811][model8_pretrain.py][INFO] Epoch:[0/2](541600/4588595) loss:3.023 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:57:59,811][model8_pretrain.py][INFO] Epoch:[0/2](541600/4588595) loss:2.959 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:57:59,811][model8_pretrain.py][INFO] Epoch:[0/2](541600/4588595) loss:2.475 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:57:59,811][model8_pretrain.py][INFO] Epoch:[0/2](541600/4588595) loss:2.568 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:57:59,811][model8_pretrain.py][INFO] Epoch:[0/2](541600/4588595) loss:3.356 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:57:59,811][model8_pretrain.py][INFO] Epoch:[0/2](541600/4588595) loss:2.771 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:58:36,743][model8_pretrain.py][INFO] Epoch:[0/2](541700/4588595) loss:2.539 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:58:36,743][model8_pretrain.py][INFO] Epoch:[0/2](541700/4588595) loss:3.082 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:58:36,743][model8_pretrain.py][INFO] Epoch:[0/2](541700/4588595) loss:2.720 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:58:36,743][model8_pretrain.py][INFO] Epoch:[0/2](541700/4588595) loss:3.233 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:58:36,743][model8_pretrain.py][INFO] Epoch:[0/2](541700/4588595) loss:3.078 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:58:36,743][model8_pretrain.py][INFO] Epoch:[0/2](541700/4588595) loss:2.786 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:58:36,743][model8_pretrain.py][INFO] Epoch:[0/2](541700/4588595) loss:3.346 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:58:36,743][model8_pretrain.py][INFO] Epoch:[0/2](541700/4588595) loss:3.039 lr:0.0000100 epoch_Time:25691.0min: [2024-01-05 02:59:13,682][model8_pretrain.py][INFO] Epoch:[0/2](541800/4588595) loss:3.302 lr:0.0000100 epoch_Time:25690.0min: [2024-01-05 02:59:13,682][model8_pretrain.py][INFO] Epoch:[0/2](541800/4588595) loss:2.905 lr:0.0000100 epoch_Time:25690.0min: [2024-01-05 02:59:13,682][model8_pretrain.py][INFO] Epoch:[0/2](541800/4588595) loss:3.046 lr:0.0000100 epoch_Time:25690.0min: [2024-01-05 02:59:13,682][model8_pretrain.py][INFO] Epoch:[0/2](541800/4588595) loss:3.074 lr:0.0000100 epoch_Time:25690.0min: [2024-01-05 02:59:13,682][model8_pretrain.py][INFO] Epoch:[0/2](541800/4588595) loss:2.860 lr:0.0000100 epoch_Time:25690.0min: [2024-01-05 02:59:13,682][model8_pretrain.py][INFO] Epoch:[0/2](541800/4588595) loss:2.274 lr:0.0000100 epoch_Time:25690.0min: [2024-01-05 02:59:13,682][model8_pretrain.py][INFO] Epoch:[0/2](541800/4588595) loss:3.038 lr:0.0000100 epoch_Time:25690.0min: [2024-01-05 02:59:13,682][model8_pretrain.py][INFO] Epoch:[0/2](541800/4588595) loss:2.877 lr:0.0000100 epoch_Time:25690.0min: [2024-01-05 02:59:50,629][model8_pretrain.py][INFO] Epoch:[0/2](541900/4588595) loss:3.128 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 02:59:50,629][model8_pretrain.py][INFO] Epoch:[0/2](541900/4588595) loss:3.164 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 02:59:50,629][model8_pretrain.py][INFO] Epoch:[0/2](541900/4588595) loss:2.808 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 02:59:50,629][model8_pretrain.py][INFO] Epoch:[0/2](541900/4588595) loss:3.039 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 02:59:50,629][model8_pretrain.py][INFO] Epoch:[0/2](541900/4588595) loss:3.151 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 02:59:50,629][model8_pretrain.py][INFO] Epoch:[0/2](541900/4588595) loss:2.953 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 02:59:50,629][model8_pretrain.py][INFO] Epoch:[0/2](541900/4588595) loss:2.888 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 02:59:50,630][model8_pretrain.py][INFO] Epoch:[0/2](541900/4588595) loss:2.615 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 03:00:27,555][model8_pretrain.py][INFO] Epoch:[0/2](542000/4588595) loss:3.027 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 03:00:27,555][model8_pretrain.py][INFO] Epoch:[0/2](542000/4588595) loss:2.804 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 03:00:27,555][model8_pretrain.py][INFO] Epoch:[0/2](542000/4588595) loss:2.881 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 03:00:27,555][model8_pretrain.py][INFO] Epoch:[0/2](542000/4588595) loss:3.274 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 03:00:27,555][model8_pretrain.py][INFO] Epoch:[0/2](542000/4588595) loss:3.108 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 03:00:27,555][model8_pretrain.py][INFO] Epoch:[0/2](542000/4588595) loss:3.032 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 03:00:27,556][model8_pretrain.py][INFO] Epoch:[0/2](542000/4588595) loss:2.925 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 03:00:27,556][model8_pretrain.py][INFO] Epoch:[0/2](542000/4588595) loss:2.744 lr:0.0000100 epoch_Time:25688.0min: [2024-01-05 03:01:04,483][model8_pretrain.py][INFO] Epoch:[0/2](542100/4588595) loss:3.048 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:04,483][model8_pretrain.py][INFO] Epoch:[0/2](542100/4588595) loss:2.666 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:04,483][model8_pretrain.py][INFO] Epoch:[0/2](542100/4588595) loss:2.844 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:04,483][model8_pretrain.py][INFO] Epoch:[0/2](542100/4588595) loss:2.395 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:04,483][model8_pretrain.py][INFO] Epoch:[0/2](542100/4588595) loss:3.179 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:04,483][model8_pretrain.py][INFO] Epoch:[0/2](542100/4588595) loss:2.240 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:04,483][model8_pretrain.py][INFO] Epoch:[0/2](542100/4588595) loss:3.058 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:04,483][model8_pretrain.py][INFO] Epoch:[0/2](542100/4588595) loss:3.382 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:41,409][model8_pretrain.py][INFO] Epoch:[0/2](542200/4588595) loss:3.224 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:41,409][model8_pretrain.py][INFO] Epoch:[0/2](542200/4588595) loss:2.768 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:41,409][model8_pretrain.py][INFO] Epoch:[0/2](542200/4588595) loss:3.169 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:41,409][model8_pretrain.py][INFO] Epoch:[0/2](542200/4588595) loss:2.735 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:41,409][model8_pretrain.py][INFO] Epoch:[0/2](542200/4588595) loss:2.893 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:41,409][model8_pretrain.py][INFO] Epoch:[0/2](542200/4588595) loss:3.135 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:41,409][model8_pretrain.py][INFO] Epoch:[0/2](542200/4588595) loss:2.945 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:01:41,409][model8_pretrain.py][INFO] Epoch:[0/2](542200/4588595) loss:2.756 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:02:30,422][model8_pretrain.py][INFO] Epoch:[0/2](542300/4588595) loss:2.875 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:02:30,422][model8_pretrain.py][INFO] Epoch:[0/2](542300/4588595) loss:3.090 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:02:30,423][model8_pretrain.py][INFO] Epoch:[0/2](542300/4588595) loss:3.193 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:02:30,423][model8_pretrain.py][INFO] Epoch:[0/2](542300/4588595) loss:1.838 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:02:30,423][model8_pretrain.py][INFO] Epoch:[0/2](542300/4588595) loss:2.953 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:02:30,423][model8_pretrain.py][INFO] Epoch:[0/2](542300/4588595) loss:2.608 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:02:30,423][model8_pretrain.py][INFO] Epoch:[0/2](542300/4588595) loss:3.072 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:02:30,423][model8_pretrain.py][INFO] Epoch:[0/2](542300/4588595) loss:3.199 lr:0.0000100 epoch_Time:25687.0min: [2024-01-05 03:03:07,360][model8_pretrain.py][INFO] Epoch:[0/2](542400/4588595) loss:3.178 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:07,360][model8_pretrain.py][INFO] Epoch:[0/2](542400/4588595) loss:2.511 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:07,360][model8_pretrain.py][INFO] Epoch:[0/2](542400/4588595) loss:2.197 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:07,360][model8_pretrain.py][INFO] Epoch:[0/2](542400/4588595) loss:3.037 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:07,360][model8_pretrain.py][INFO] Epoch:[0/2](542400/4588595) loss:2.187 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:07,360][model8_pretrain.py][INFO] Epoch:[0/2](542400/4588595) loss:3.147 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:07,360][model8_pretrain.py][INFO] Epoch:[0/2](542400/4588595) loss:2.449 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:07,360][model8_pretrain.py][INFO] Epoch:[0/2](542400/4588595) loss:3.248 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:44,311][model8_pretrain.py][INFO] Epoch:[0/2](542500/4588595) loss:2.980 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:44,311][model8_pretrain.py][INFO] Epoch:[0/2](542500/4588595) loss:2.530 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:44,311][model8_pretrain.py][INFO] Epoch:[0/2](542500/4588595) loss:3.165 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:44,311][model8_pretrain.py][INFO] Epoch:[0/2](542500/4588595) loss:2.726 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:44,311][model8_pretrain.py][INFO] Epoch:[0/2](542500/4588595) loss:3.122 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:44,311][model8_pretrain.py][INFO] Epoch:[0/2](542500/4588595) loss:2.890 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:44,311][model8_pretrain.py][INFO] Epoch:[0/2](542500/4588595) loss:3.005 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:03:44,311][model8_pretrain.py][INFO] Epoch:[0/2](542500/4588595) loss:2.473 lr:0.0000100 epoch_Time:25686.0min: [2024-01-05 03:04:21,228][model8_pretrain.py][INFO] Epoch:[0/2](542600/4588595) loss:2.727 lr:0.0000100 epoch_Time:25685.0min: [2024-01-05 03:04:21,228][model8_pretrain.py][INFO] Epoch:[0/2](542600/4588595) loss:3.112 lr:0.0000100 epoch_Time:25685.0min: [2024-01-05 03:04:21,228][model8_pretrain.py][INFO] Epoch:[0/2](542600/4588595) loss:2.405 lr:0.0000100 epoch_Time:25685.0min: [2024-01-05 03:04:21,228][model8_pretrain.py][INFO] Epoch:[0/2](542600/4588595) loss:3.131 lr:0.0000100 epoch_Time:25685.0min: [2024-01-05 03:04:21,228][model8_pretrain.py][INFO] Epoch:[0/2](542600/4588595) loss:2.502 lr:0.0000100 epoch_Time:25685.0min: [2024-01-05 03:04:21,228][model8_pretrain.py][INFO] Epoch:[0/2](542600/4588595) loss:2.539 lr:0.0000100 epoch_Time:25685.0min: [2024-01-05 03:04:21,228][model8_pretrain.py][INFO] Epoch:[0/2](542600/4588595) loss:2.974 lr:0.0000100 epoch_Time:25685.0min: [2024-01-05 03:04:21,228][model8_pretrain.py][INFO] Epoch:[0/2](542600/4588595) loss:2.879 lr:0.0000100 epoch_Time:25685.0min: [2024-01-05 03:04:58,172][model8_pretrain.py][INFO] Epoch:[0/2](542700/4588595) loss:2.963 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:04:58,172][model8_pretrain.py][INFO] Epoch:[0/2](542700/4588595) loss:2.397 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:04:58,172][model8_pretrain.py][INFO] Epoch:[0/2](542700/4588595) loss:2.785 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:04:58,172][model8_pretrain.py][INFO] Epoch:[0/2](542700/4588595) loss:3.008 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:04:58,172][model8_pretrain.py][INFO] Epoch:[0/2](542700/4588595) loss:2.933 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:04:58,172][model8_pretrain.py][INFO] Epoch:[0/2](542700/4588595) loss:2.790 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:04:58,173][model8_pretrain.py][INFO] Epoch:[0/2](542700/4588595) loss:2.783 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:04:58,173][model8_pretrain.py][INFO] Epoch:[0/2](542700/4588595) loss:2.927 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:05:35,111][model8_pretrain.py][INFO] Epoch:[0/2](542800/4588595) loss:2.879 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:05:35,111][model8_pretrain.py][INFO] Epoch:[0/2](542800/4588595) loss:2.950 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:05:35,111][model8_pretrain.py][INFO] Epoch:[0/2](542800/4588595) loss:3.039 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:05:35,111][model8_pretrain.py][INFO] Epoch:[0/2](542800/4588595) loss:2.959 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:05:35,111][model8_pretrain.py][INFO] Epoch:[0/2](542800/4588595) loss:3.173 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:05:35,111][model8_pretrain.py][INFO] Epoch:[0/2](542800/4588595) loss:2.806 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:05:35,111][model8_pretrain.py][INFO] Epoch:[0/2](542800/4588595) loss:3.138 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:05:35,111][model8_pretrain.py][INFO] Epoch:[0/2](542800/4588595) loss:3.396 lr:0.0000100 epoch_Time:25684.0min: [2024-01-05 03:06:12,048][model8_pretrain.py][INFO] Epoch:[0/2](542900/4588595) loss:2.847 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:06:12,048][model8_pretrain.py][INFO] Epoch:[0/2](542900/4588595) loss:2.889 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:06:12,048][model8_pretrain.py][INFO] Epoch:[0/2](542900/4588595) loss:3.176 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:06:12,048][model8_pretrain.py][INFO] Epoch:[0/2](542900/4588595) loss:2.961 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:06:12,048][model8_pretrain.py][INFO] Epoch:[0/2](542900/4588595) loss:2.872 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:06:12,048][model8_pretrain.py][INFO] Epoch:[0/2](542900/4588595) loss:3.309 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:06:12,048][model8_pretrain.py][INFO] Epoch:[0/2](542900/4588595) loss:2.834 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:06:12,048][model8_pretrain.py][INFO] Epoch:[0/2](542900/4588595) loss:2.861 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:06:48,970][model8_pretrain.py][INFO] Epoch:[0/2](543000/4588595) loss:2.376 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:06:48,970][model8_pretrain.py][INFO] Epoch:[0/2](543000/4588595) loss:2.715 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:06:48,970][model8_pretrain.py][INFO] Epoch:[0/2](543000/4588595) loss:2.562 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:06:48,970][model8_pretrain.py][INFO] Epoch:[0/2](543000/4588595) loss:2.908 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:06:48,970][model8_pretrain.py][INFO] Epoch:[0/2](543000/4588595) loss:3.225 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:06:48,970][model8_pretrain.py][INFO] Epoch:[0/2](543000/4588595) loss:2.535 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:06:48,970][model8_pretrain.py][INFO] Epoch:[0/2](543000/4588595) loss:2.953 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:06:48,970][model8_pretrain.py][INFO] Epoch:[0/2](543000/4588595) loss:2.861 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:07:38,134][model8_pretrain.py][INFO] Epoch:[0/2](543100/4588595) loss:2.547 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:07:38,134][model8_pretrain.py][INFO] Epoch:[0/2](543100/4588595) loss:3.158 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:07:38,135][model8_pretrain.py][INFO] Epoch:[0/2](543100/4588595) loss:3.232 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:07:38,135][model8_pretrain.py][INFO] Epoch:[0/2](543100/4588595) loss:2.804 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:07:38,135][model8_pretrain.py][INFO] Epoch:[0/2](543100/4588595) loss:2.556 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:07:38,135][model8_pretrain.py][INFO] Epoch:[0/2](543100/4588595) loss:3.066 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:07:38,135][model8_pretrain.py][INFO] Epoch:[0/2](543100/4588595) loss:2.947 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:07:38,135][model8_pretrain.py][INFO] Epoch:[0/2](543100/4588595) loss:2.554 lr:0.0000100 epoch_Time:25683.0min: [2024-01-05 03:08:15,064][model8_pretrain.py][INFO] Epoch:[0/2](543200/4588595) loss:2.397 lr:0.0000100 epoch_Time:25682.0min: [2024-01-05 03:08:15,064][model8_pretrain.py][INFO] Epoch:[0/2](543200/4588595) loss:2.512 lr:0.0000100 epoch_Time:25682.0min: [2024-01-05 03:08:15,064][model8_pretrain.py][INFO] Epoch:[0/2](543200/4588595) loss:3.196 lr:0.0000100 epoch_Time:25682.0min: [2024-01-05 03:08:15,064][model8_pretrain.py][INFO] Epoch:[0/2](543200/4588595) loss:2.810 lr:0.0000100 epoch_Time:25682.0min: [2024-01-05 03:08:15,064][model8_pretrain.py][INFO] Epoch:[0/2](543200/4588595) loss:2.898 lr:0.0000100 epoch_Time:25682.0min: [2024-01-05 03:08:15,064][model8_pretrain.py][INFO] Epoch:[0/2](543200/4588595) loss:2.955 lr:0.0000100 epoch_Time:25682.0min: [2024-01-05 03:08:15,064][model8_pretrain.py][INFO] Epoch:[0/2](543200/4588595) loss:2.761 lr:0.0000100 epoch_Time:25682.0min: [2024-01-05 03:08:15,064][model8_pretrain.py][INFO] Epoch:[0/2](543200/4588595) loss:2.851 lr:0.0000100 epoch_Time:25682.0min: [2024-01-05 03:08:51,998][model8_pretrain.py][INFO] Epoch:[0/2](543300/4588595) loss:2.800 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:08:51,998][model8_pretrain.py][INFO] Epoch:[0/2](543300/4588595) loss:3.006 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:08:51,998][model8_pretrain.py][INFO] Epoch:[0/2](543300/4588595) loss:2.740 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:08:51,998][model8_pretrain.py][INFO] Epoch:[0/2](543300/4588595) loss:2.934 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:08:51,998][model8_pretrain.py][INFO] Epoch:[0/2](543300/4588595) loss:3.128 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:08:51,998][model8_pretrain.py][INFO] Epoch:[0/2](543300/4588595) loss:2.628 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:08:51,998][model8_pretrain.py][INFO] Epoch:[0/2](543300/4588595) loss:2.746 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:08:51,999][model8_pretrain.py][INFO] Epoch:[0/2](543300/4588595) loss:2.845 lr:0.0000100 epoch_Time:25681.0min: [2024-01-05 03:09:28,937][model8_pretrain.py][INFO] Epoch:[0/2](543400/4588595) loss:2.551 lr:0.0000100 epoch_Time:25680.0min: [2024-01-05 03:09:28,937][model8_pretrain.py][INFO] Epoch:[0/2](543400/4588595) loss:2.701 lr:0.0000100 epoch_Time:25680.0min: [2024-01-05 03:09:28,937][model8_pretrain.py][INFO] Epoch:[0/2](543400/4588595) loss:3.026 lr:0.0000100 epoch_Time:25680.0min: [2024-01-05 03:09:28,937][model8_pretrain.py][INFO] Epoch:[0/2](543400/4588595) loss:2.808 lr:0.0000100 epoch_Time:25680.0min: [2024-01-05 03:09:28,937][model8_pretrain.py][INFO] Epoch:[0/2](543400/4588595) loss:2.731 lr:0.0000100 epoch_Time:25680.0min: [2024-01-05 03:09:28,937][model8_pretrain.py][INFO] Epoch:[0/2](543400/4588595) loss:3.027 lr:0.0000100 epoch_Time:25680.0min: [2024-01-05 03:09:28,937][model8_pretrain.py][INFO] Epoch:[0/2](543400/4588595) loss:2.506 lr:0.0000100 epoch_Time:25680.0min: [2024-01-05 03:09:28,937][model8_pretrain.py][INFO] Epoch:[0/2](543400/4588595) loss:2.896 lr:0.0000100 epoch_Time:25680.0min: [2024-01-05 03:10:05,874][model8_pretrain.py][INFO] Epoch:[0/2](543500/4588595) loss:2.834 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:05,874][model8_pretrain.py][INFO] Epoch:[0/2](543500/4588595) loss:2.297 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:05,874][model8_pretrain.py][INFO] Epoch:[0/2](543500/4588595) loss:3.509 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:05,874][model8_pretrain.py][INFO] Epoch:[0/2](543500/4588595) loss:2.258 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:05,874][model8_pretrain.py][INFO] Epoch:[0/2](543500/4588595) loss:2.737 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:05,874][model8_pretrain.py][INFO] Epoch:[0/2](543500/4588595) loss:2.888 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:05,874][model8_pretrain.py][INFO] Epoch:[0/2](543500/4588595) loss:2.732 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:05,874][model8_pretrain.py][INFO] Epoch:[0/2](543500/4588595) loss:2.259 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:42,806][model8_pretrain.py][INFO] Epoch:[0/2](543600/4588595) loss:3.242 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:42,806][model8_pretrain.py][INFO] Epoch:[0/2](543600/4588595) loss:2.933 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:42,806][model8_pretrain.py][INFO] Epoch:[0/2](543600/4588595) loss:2.904 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:42,806][model8_pretrain.py][INFO] Epoch:[0/2](543600/4588595) loss:2.707 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:42,806][model8_pretrain.py][INFO] Epoch:[0/2](543600/4588595) loss:2.328 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:42,806][model8_pretrain.py][INFO] Epoch:[0/2](543600/4588595) loss:3.153 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:42,806][model8_pretrain.py][INFO] Epoch:[0/2](543600/4588595) loss:2.867 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:10:42,806][model8_pretrain.py][INFO] Epoch:[0/2](543600/4588595) loss:2.868 lr:0.0000100 epoch_Time:25679.0min: [2024-01-05 03:11:19,735][model8_pretrain.py][INFO] Epoch:[0/2](543700/4588595) loss:2.643 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:11:19,735][model8_pretrain.py][INFO] Epoch:[0/2](543700/4588595) loss:2.757 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:11:19,735][model8_pretrain.py][INFO] Epoch:[0/2](543700/4588595) loss:2.479 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:11:19,735][model8_pretrain.py][INFO] Epoch:[0/2](543700/4588595) loss:3.117 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:11:19,735][model8_pretrain.py][INFO] Epoch:[0/2](543700/4588595) loss:2.903 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:11:19,735][model8_pretrain.py][INFO] Epoch:[0/2](543700/4588595) loss:2.551 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:11:19,735][model8_pretrain.py][INFO] Epoch:[0/2](543700/4588595) loss:2.859 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:11:19,736][model8_pretrain.py][INFO] Epoch:[0/2](543700/4588595) loss:2.959 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:11:56,666][model8_pretrain.py][INFO] Epoch:[0/2](543800/4588595) loss:2.820 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:11:56,666][model8_pretrain.py][INFO] Epoch:[0/2](543800/4588595) loss:2.734 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:11:56,666][model8_pretrain.py][INFO] Epoch:[0/2](543800/4588595) loss:3.196 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:11:56,666][model8_pretrain.py][INFO] Epoch:[0/2](543800/4588595) loss:2.805 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:11:56,666][model8_pretrain.py][INFO] Epoch:[0/2](543800/4588595) loss:3.015 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:11:56,666][model8_pretrain.py][INFO] Epoch:[0/2](543800/4588595) loss:3.331 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:11:56,666][model8_pretrain.py][INFO] Epoch:[0/2](543800/4588595) loss:2.936 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:11:56,666][model8_pretrain.py][INFO] Epoch:[0/2](543800/4588595) loss:3.213 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:12:45,776][model8_pretrain.py][INFO] Epoch:[0/2](543900/4588595) loss:2.750 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:12:45,776][model8_pretrain.py][INFO] Epoch:[0/2](543900/4588595) loss:2.851 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:12:45,776][model8_pretrain.py][INFO] Epoch:[0/2](543900/4588595) loss:3.079 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:12:45,776][model8_pretrain.py][INFO] Epoch:[0/2](543900/4588595) loss:2.285 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:12:45,776][model8_pretrain.py][INFO] Epoch:[0/2](543900/4588595) loss:2.568 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:12:45,776][model8_pretrain.py][INFO] Epoch:[0/2](543900/4588595) loss:3.052 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:12:45,776][model8_pretrain.py][INFO] Epoch:[0/2](543900/4588595) loss:3.302 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:12:45,776][model8_pretrain.py][INFO] Epoch:[0/2](543900/4588595) loss:2.717 lr:0.0000100 epoch_Time:25678.0min: [2024-01-05 03:13:22,702][model8_pretrain.py][INFO] Epoch:[0/2](544000/4588595) loss:2.858 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:13:22,702][model8_pretrain.py][INFO] Epoch:[0/2](544000/4588595) loss:3.350 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:13:22,702][model8_pretrain.py][INFO] Epoch:[0/2](544000/4588595) loss:2.826 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:13:22,702][model8_pretrain.py][INFO] Epoch:[0/2](544000/4588595) loss:2.579 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:13:22,702][model8_pretrain.py][INFO] Epoch:[0/2](544000/4588595) loss:3.102 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:13:22,702][model8_pretrain.py][INFO] Epoch:[0/2](544000/4588595) loss:3.280 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:13:22,702][model8_pretrain.py][INFO] Epoch:[0/2](544000/4588595) loss:3.038 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:13:22,702][model8_pretrain.py][INFO] Epoch:[0/2](544000/4588595) loss:2.945 lr:0.0000100 epoch_Time:25677.0min: [2024-01-05 03:13:59,638][model8_pretrain.py][INFO] Epoch:[0/2](544100/4588595) loss:2.690 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:13:59,638][model8_pretrain.py][INFO] Epoch:[0/2](544100/4588595) loss:2.354 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:13:59,638][model8_pretrain.py][INFO] Epoch:[0/2](544100/4588595) loss:3.236 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:13:59,638][model8_pretrain.py][INFO] Epoch:[0/2](544100/4588595) loss:3.016 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:13:59,638][model8_pretrain.py][INFO] Epoch:[0/2](544100/4588595) loss:2.373 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:13:59,638][model8_pretrain.py][INFO] Epoch:[0/2](544100/4588595) loss:2.732 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:13:59,638][model8_pretrain.py][INFO] Epoch:[0/2](544100/4588595) loss:2.669 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:13:59,638][model8_pretrain.py][INFO] Epoch:[0/2](544100/4588595) loss:3.502 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:14:36,585][model8_pretrain.py][INFO] Epoch:[0/2](544200/4588595) loss:2.725 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:14:36,585][model8_pretrain.py][INFO] Epoch:[0/2](544200/4588595) loss:2.967 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:14:36,585][model8_pretrain.py][INFO] Epoch:[0/2](544200/4588595) loss:3.079 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:14:36,585][model8_pretrain.py][INFO] Epoch:[0/2](544200/4588595) loss:2.793 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:14:36,585][model8_pretrain.py][INFO] Epoch:[0/2](544200/4588595) loss:2.443 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:14:36,585][model8_pretrain.py][INFO] Epoch:[0/2](544200/4588595) loss:2.965 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:14:36,585][model8_pretrain.py][INFO] Epoch:[0/2](544200/4588595) loss:3.080 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:14:36,585][model8_pretrain.py][INFO] Epoch:[0/2](544200/4588595) loss:2.787 lr:0.0000100 epoch_Time:25676.0min: [2024-01-05 03:15:13,516][model8_pretrain.py][INFO] Epoch:[0/2](544300/4588595) loss:3.026 lr:0.0000100 epoch_Time:25675.0min: [2024-01-05 03:15:13,516][model8_pretrain.py][INFO] Epoch:[0/2](544300/4588595) loss:3.065 lr:0.0000100 epoch_Time:25675.0min: [2024-01-05 03:15:13,516][model8_pretrain.py][INFO] Epoch:[0/2](544300/4588595) loss:2.983 lr:0.0000100 epoch_Time:25675.0min: [2024-01-05 03:15:13,517][model8_pretrain.py][INFO] Epoch:[0/2](544300/4588595) loss:2.778 lr:0.0000100 epoch_Time:25675.0min: [2024-01-05 03:15:13,516][model8_pretrain.py][INFO] Epoch:[0/2](544300/4588595) loss:2.963 lr:0.0000100 epoch_Time:25675.0min: [2024-01-05 03:15:13,516][model8_pretrain.py][INFO] Epoch:[0/2](544300/4588595) loss:2.775 lr:0.0000100 epoch_Time:25675.0min: [2024-01-05 03:15:13,516][model8_pretrain.py][INFO] Epoch:[0/2](544300/4588595) loss:2.727 lr:0.0000100 epoch_Time:25675.0min: [2024-01-05 03:15:13,517][model8_pretrain.py][INFO] Epoch:[0/2](544300/4588595) loss:2.909 lr:0.0000100 epoch_Time:25675.0min: [2024-01-05 03:15:50,443][model8_pretrain.py][INFO] Epoch:[0/2](544400/4588595) loss:3.088 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:15:50,443][model8_pretrain.py][INFO] Epoch:[0/2](544400/4588595) loss:2.534 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:15:50,443][model8_pretrain.py][INFO] Epoch:[0/2](544400/4588595) loss:3.290 lr:0.0000100 epoch_Time:25674.0min: [2024-01-05 03:15:50,443][model8_pretrain.py][INFO] Epoch:[0/2](544400/4588595) loss:2.708 lr:0.0000100 epoch_Time:25674.0min: [2024-01-05 03:15:50,443][model8_pretrain.py][INFO] Epoch:[0/2](544400/4588595) loss:3.108 lr:0.0000100 epoch_Time:25674.0min: [2024-01-05 03:15:50,443][model8_pretrain.py][INFO] Epoch:[0/2](544400/4588595) loss:3.351 lr:0.0000100 epoch_Time:25674.0min: [2024-01-05 03:15:50,443][model8_pretrain.py][INFO] Epoch:[0/2](544400/4588595) loss:3.285 lr:0.0000100 epoch_Time:25674.0min: [2024-01-05 03:15:50,443][model8_pretrain.py][INFO] Epoch:[0/2](544400/4588595) loss:3.477 lr:0.0000100 epoch_Time:25674.0min: [2024-01-05 03:16:27,402][model8_pretrain.py][INFO] Epoch:[0/2](544500/4588595) loss:3.016 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:16:27,402][model8_pretrain.py][INFO] Epoch:[0/2](544500/4588595) loss:2.227 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:16:27,402][model8_pretrain.py][INFO] Epoch:[0/2](544500/4588595) loss:3.052 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:16:27,402][model8_pretrain.py][INFO] Epoch:[0/2](544500/4588595) loss:2.913 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:16:27,402][model8_pretrain.py][INFO] Epoch:[0/2](544500/4588595) loss:2.636 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:16:27,402][model8_pretrain.py][INFO] Epoch:[0/2](544500/4588595) loss:2.745 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:16:27,402][model8_pretrain.py][INFO] Epoch:[0/2](544500/4588595) loss:2.370 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:16:27,402][model8_pretrain.py][INFO] Epoch:[0/2](544500/4588595) loss:3.488 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:17:04,375][model8_pretrain.py][INFO] Epoch:[0/2](544600/4588595) loss:3.454 lr:0.0000100 epoch_Time:25672.0min: [2024-01-05 03:17:04,375][model8_pretrain.py][INFO] Epoch:[0/2](544600/4588595) loss:2.614 lr:0.0000100 epoch_Time:25672.0min: [2024-01-05 03:17:04,375][model8_pretrain.py][INFO] Epoch:[0/2](544600/4588595) loss:2.256 lr:0.0000100 epoch_Time:25672.0min: [2024-01-05 03:17:04,375][model8_pretrain.py][INFO] Epoch:[0/2](544600/4588595) loss:2.989 lr:0.0000100 epoch_Time:25672.0min: [2024-01-05 03:17:04,375][model8_pretrain.py][INFO] Epoch:[0/2](544600/4588595) loss:3.359 lr:0.0000100 epoch_Time:25672.0min: [2024-01-05 03:17:04,375][model8_pretrain.py][INFO] Epoch:[0/2](544600/4588595) loss:2.947 lr:0.0000100 epoch_Time:25672.0min: [2024-01-05 03:17:04,375][model8_pretrain.py][INFO] Epoch:[0/2](544600/4588595) loss:3.068 lr:0.0000100 epoch_Time:25672.0min: [2024-01-05 03:17:04,375][model8_pretrain.py][INFO] Epoch:[0/2](544600/4588595) loss:3.079 lr:0.0000100 epoch_Time:25672.0min: [2024-01-05 03:17:53,843][model8_pretrain.py][INFO] Epoch:[0/2](544700/4588595) loss:3.214 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:17:53,843][model8_pretrain.py][INFO] Epoch:[0/2](544700/4588595) loss:3.128 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:17:53,843][model8_pretrain.py][INFO] Epoch:[0/2](544700/4588595) loss:2.900 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:17:53,843][model8_pretrain.py][INFO] Epoch:[0/2](544700/4588595) loss:2.867 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:17:53,843][model8_pretrain.py][INFO] Epoch:[0/2](544700/4588595) loss:2.669 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:17:53,843][model8_pretrain.py][INFO] Epoch:[0/2](544700/4588595) loss:3.023 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:17:53,843][model8_pretrain.py][INFO] Epoch:[0/2](544700/4588595) loss:3.158 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:17:53,843][model8_pretrain.py][INFO] Epoch:[0/2](544700/4588595) loss:3.000 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:18:30,783][model8_pretrain.py][INFO] Epoch:[0/2](544800/4588595) loss:3.707 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:18:30,783][model8_pretrain.py][INFO] Epoch:[0/2](544800/4588595) loss:2.821 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:18:30,783][model8_pretrain.py][INFO] Epoch:[0/2](544800/4588595) loss:2.963 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:18:30,783][model8_pretrain.py][INFO] Epoch:[0/2](544800/4588595) loss:2.670 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:18:30,783][model8_pretrain.py][INFO] Epoch:[0/2](544800/4588595) loss:3.121 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:18:30,783][model8_pretrain.py][INFO] Epoch:[0/2](544800/4588595) loss:2.064 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:18:30,783][model8_pretrain.py][INFO] Epoch:[0/2](544800/4588595) loss:3.247 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:18:30,783][model8_pretrain.py][INFO] Epoch:[0/2](544800/4588595) loss:3.238 lr:0.0000100 epoch_Time:25673.0min: [2024-01-05 03:19:07,714][model8_pretrain.py][INFO] Epoch:[0/2](544900/4588595) loss:2.687 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:07,714][model8_pretrain.py][INFO] Epoch:[0/2](544900/4588595) loss:2.631 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:07,714][model8_pretrain.py][INFO] Epoch:[0/2](544900/4588595) loss:3.315 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:07,714][model8_pretrain.py][INFO] Epoch:[0/2](544900/4588595) loss:2.685 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:07,714][model8_pretrain.py][INFO] Epoch:[0/2](544900/4588595) loss:2.868 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:07,714][model8_pretrain.py][INFO] Epoch:[0/2](544900/4588595) loss:2.912 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:07,714][model8_pretrain.py][INFO] Epoch:[0/2](544900/4588595) loss:1.988 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:07,714][model8_pretrain.py][INFO] Epoch:[0/2](544900/4588595) loss:3.054 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:44,657][model8_pretrain.py][INFO] Epoch:[0/2](545000/4588595) loss:3.309 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:44,657][model8_pretrain.py][INFO] Epoch:[0/2](545000/4588595) loss:3.410 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:44,657][model8_pretrain.py][INFO] Epoch:[0/2](545000/4588595) loss:2.842 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:44,657][model8_pretrain.py][INFO] Epoch:[0/2](545000/4588595) loss:3.287 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:44,657][model8_pretrain.py][INFO] Epoch:[0/2](545000/4588595) loss:2.659 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:44,657][model8_pretrain.py][INFO] Epoch:[0/2](545000/4588595) loss:3.395 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:44,657][model8_pretrain.py][INFO] Epoch:[0/2](545000/4588595) loss:3.168 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:19:44,657][model8_pretrain.py][INFO] Epoch:[0/2](545000/4588595) loss:2.704 lr:0.0000100 epoch_Time:25671.0min: [2024-01-05 03:20:21,613][model8_pretrain.py][INFO] Epoch:[0/2](545100/4588595) loss:3.586 lr:0.0000100 epoch_Time:25670.0min: [2024-01-05 03:20:21,613][model8_pretrain.py][INFO] Epoch:[0/2](545100/4588595) loss:2.583 lr:0.0000100 epoch_Time:25670.0min: [2024-01-05 03:20:21,613][model8_pretrain.py][INFO] Epoch:[0/2](545100/4588595) loss:2.334 lr:0.0000100 epoch_Time:25670.0min: [2024-01-05 03:20:21,613][model8_pretrain.py][INFO] Epoch:[0/2](545100/4588595) loss:2.490 lr:0.0000100 epoch_Time:25670.0min: [2024-01-05 03:20:21,613][model8_pretrain.py][INFO] Epoch:[0/2](545100/4588595) loss:2.944 lr:0.0000100 epoch_Time:25670.0min: [2024-01-05 03:20:21,613][model8_pretrain.py][INFO] Epoch:[0/2](545100/4588595) loss:3.177 lr:0.0000100 epoch_Time:25670.0min: [2024-01-05 03:20:21,613][model8_pretrain.py][INFO] Epoch:[0/2](545100/4588595) loss:2.627 lr:0.0000100 epoch_Time:25670.0min: [2024-01-05 03:20:21,613][model8_pretrain.py][INFO] Epoch:[0/2](545100/4588595) loss:2.812 lr:0.0000100 epoch_Time:25670.0min: [2024-01-05 03:20:58,546][model8_pretrain.py][INFO] Epoch:[0/2](545200/4588595) loss:2.328 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:20:58,546][model8_pretrain.py][INFO] Epoch:[0/2](545200/4588595) loss:2.679 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:20:58,546][model8_pretrain.py][INFO] Epoch:[0/2](545200/4588595) loss:2.909 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:20:58,546][model8_pretrain.py][INFO] Epoch:[0/2](545200/4588595) loss:2.955 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:20:58,546][model8_pretrain.py][INFO] Epoch:[0/2](545200/4588595) loss:2.341 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:20:58,546][model8_pretrain.py][INFO] Epoch:[0/2](545200/4588595) loss:3.127 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:20:58,546][model8_pretrain.py][INFO] Epoch:[0/2](545200/4588595) loss:3.055 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:20:58,546][model8_pretrain.py][INFO] Epoch:[0/2](545200/4588595) loss:2.834 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:21:35,482][model8_pretrain.py][INFO] Epoch:[0/2](545300/4588595) loss:2.717 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:21:35,482][model8_pretrain.py][INFO] Epoch:[0/2](545300/4588595) loss:2.982 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:21:35,482][model8_pretrain.py][INFO] Epoch:[0/2](545300/4588595) loss:3.039 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:21:35,482][model8_pretrain.py][INFO] Epoch:[0/2](545300/4588595) loss:2.870 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:21:35,482][model8_pretrain.py][INFO] Epoch:[0/2](545300/4588595) loss:2.945 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:21:35,482][model8_pretrain.py][INFO] Epoch:[0/2](545300/4588595) loss:2.807 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:21:35,482][model8_pretrain.py][INFO] Epoch:[0/2](545300/4588595) loss:2.975 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:21:35,482][model8_pretrain.py][INFO] Epoch:[0/2](545300/4588595) loss:3.067 lr:0.0000100 epoch_Time:25669.0min: [2024-01-05 03:22:12,408][model8_pretrain.py][INFO] Epoch:[0/2](545400/4588595) loss:2.193 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:22:12,408][model8_pretrain.py][INFO] Epoch:[0/2](545400/4588595) loss:2.687 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:22:12,408][model8_pretrain.py][INFO] Epoch:[0/2](545400/4588595) loss:1.862 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:22:12,408][model8_pretrain.py][INFO] Epoch:[0/2](545400/4588595) loss:2.721 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:22:12,408][model8_pretrain.py][INFO] Epoch:[0/2](545400/4588595) loss:3.148 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:22:12,408][model8_pretrain.py][INFO] Epoch:[0/2](545400/4588595) loss:3.269 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:22:12,408][model8_pretrain.py][INFO] Epoch:[0/2](545400/4588595) loss:2.730 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:22:12,409][model8_pretrain.py][INFO] Epoch:[0/2](545400/4588595) loss:2.517 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:01,557][model8_pretrain.py][INFO] Epoch:[0/2](545500/4588595) loss:2.897 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:01,557][model8_pretrain.py][INFO] Epoch:[0/2](545500/4588595) loss:2.916 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:01,557][model8_pretrain.py][INFO] Epoch:[0/2](545500/4588595) loss:2.244 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:01,557][model8_pretrain.py][INFO] Epoch:[0/2](545500/4588595) loss:2.573 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:01,557][model8_pretrain.py][INFO] Epoch:[0/2](545500/4588595) loss:3.255 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:01,557][model8_pretrain.py][INFO] Epoch:[0/2](545500/4588595) loss:2.536 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:01,557][model8_pretrain.py][INFO] Epoch:[0/2](545500/4588595) loss:3.114 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:01,557][model8_pretrain.py][INFO] Epoch:[0/2](545500/4588595) loss:3.442 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:38,484][model8_pretrain.py][INFO] Epoch:[0/2](545600/4588595) loss:2.739 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:38,484][model8_pretrain.py][INFO] Epoch:[0/2](545600/4588595) loss:2.583 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:38,484][model8_pretrain.py][INFO] Epoch:[0/2](545600/4588595) loss:3.203 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:38,484][model8_pretrain.py][INFO] Epoch:[0/2](545600/4588595) loss:3.065 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:38,484][model8_pretrain.py][INFO] Epoch:[0/2](545600/4588595) loss:2.917 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:38,484][model8_pretrain.py][INFO] Epoch:[0/2](545600/4588595) loss:2.639 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:38,484][model8_pretrain.py][INFO] Epoch:[0/2](545600/4588595) loss:2.864 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:23:38,485][model8_pretrain.py][INFO] Epoch:[0/2](545600/4588595) loss:3.294 lr:0.0000100 epoch_Time:25668.0min: [2024-01-05 03:24:15,423][model8_pretrain.py][INFO] Epoch:[0/2](545700/4588595) loss:2.389 lr:0.0000100 epoch_Time:25667.0min: [2024-01-05 03:24:15,423][model8_pretrain.py][INFO] Epoch:[0/2](545700/4588595) loss:2.767 lr:0.0000100 epoch_Time:25667.0min: [2024-01-05 03:24:15,423][model8_pretrain.py][INFO] Epoch:[0/2](545700/4588595) loss:2.250 lr:0.0000100 epoch_Time:25667.0min: [2024-01-05 03:24:15,423][model8_pretrain.py][INFO] Epoch:[0/2](545700/4588595) loss:3.186 lr:0.0000100 epoch_Time:25667.0min: [2024-01-05 03:24:15,423][model8_pretrain.py][INFO] Epoch:[0/2](545700/4588595) loss:2.706 lr:0.0000100 epoch_Time:25667.0min: [2024-01-05 03:24:15,423][model8_pretrain.py][INFO] Epoch:[0/2](545700/4588595) loss:3.169 lr:0.0000100 epoch_Time:25667.0min: [2024-01-05 03:24:15,424][model8_pretrain.py][INFO] Epoch:[0/2](545700/4588595) loss:3.006 lr:0.0000100 epoch_Time:25667.0min: [2024-01-05 03:24:15,424][model8_pretrain.py][INFO] Epoch:[0/2](545700/4588595) loss:3.024 lr:0.0000100 epoch_Time:25667.0min: [2024-01-05 03:24:52,365][model8_pretrain.py][INFO] Epoch:[0/2](545800/4588595) loss:3.246 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:24:52,365][model8_pretrain.py][INFO] Epoch:[0/2](545800/4588595) loss:2.707 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:24:52,365][model8_pretrain.py][INFO] Epoch:[0/2](545800/4588595) loss:2.749 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:24:52,365][model8_pretrain.py][INFO] Epoch:[0/2](545800/4588595) loss:3.081 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:24:52,365][model8_pretrain.py][INFO] Epoch:[0/2](545800/4588595) loss:3.007 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:24:52,365][model8_pretrain.py][INFO] Epoch:[0/2](545800/4588595) loss:3.160 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:24:52,366][model8_pretrain.py][INFO] Epoch:[0/2](545800/4588595) loss:3.169 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:24:52,366][model8_pretrain.py][INFO] Epoch:[0/2](545800/4588595) loss:2.775 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:25:29,309][model8_pretrain.py][INFO] Epoch:[0/2](545900/4588595) loss:2.606 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:25:29,309][model8_pretrain.py][INFO] Epoch:[0/2](545900/4588595) loss:3.067 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:25:29,309][model8_pretrain.py][INFO] Epoch:[0/2](545900/4588595) loss:2.526 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:25:29,309][model8_pretrain.py][INFO] Epoch:[0/2](545900/4588595) loss:2.840 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:25:29,309][model8_pretrain.py][INFO] Epoch:[0/2](545900/4588595) loss:3.180 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:25:29,309][model8_pretrain.py][INFO] Epoch:[0/2](545900/4588595) loss:2.549 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:25:29,309][model8_pretrain.py][INFO] Epoch:[0/2](545900/4588595) loss:2.435 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:25:29,309][model8_pretrain.py][INFO] Epoch:[0/2](545900/4588595) loss:2.803 lr:0.0000100 epoch_Time:25666.0min: [2024-01-05 03:26:06,207][model8_pretrain.py][INFO] Epoch:[0/2](546000/4588595) loss:2.366 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:06,207][model8_pretrain.py][INFO] Epoch:[0/2](546000/4588595) loss:2.621 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:06,207][model8_pretrain.py][INFO] Epoch:[0/2](546000/4588595) loss:3.439 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:06,207][model8_pretrain.py][INFO] Epoch:[0/2](546000/4588595) loss:2.777 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:06,207][model8_pretrain.py][INFO] Epoch:[0/2](546000/4588595) loss:3.225 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:06,207][model8_pretrain.py][INFO] Epoch:[0/2](546000/4588595) loss:3.364 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:06,207][model8_pretrain.py][INFO] Epoch:[0/2](546000/4588595) loss:3.205 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:06,207][model8_pretrain.py][INFO] Epoch:[0/2](546000/4588595) loss:2.771 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:43,135][model8_pretrain.py][INFO] Epoch:[0/2](546100/4588595) loss:2.941 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:43,135][model8_pretrain.py][INFO] Epoch:[0/2](546100/4588595) loss:2.568 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:43,135][model8_pretrain.py][INFO] Epoch:[0/2](546100/4588595) loss:2.528 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:43,136][model8_pretrain.py][INFO] Epoch:[0/2](546100/4588595) loss:2.443 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:43,135][model8_pretrain.py][INFO] Epoch:[0/2](546100/4588595) loss:2.834 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:43,136][model8_pretrain.py][INFO] Epoch:[0/2](546100/4588595) loss:2.660 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:43,136][model8_pretrain.py][INFO] Epoch:[0/2](546100/4588595) loss:2.814 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:26:43,136][model8_pretrain.py][INFO] Epoch:[0/2](546100/4588595) loss:2.754 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:27:20,061][model8_pretrain.py][INFO] Epoch:[0/2](546200/4588595) loss:2.872 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:27:20,061][model8_pretrain.py][INFO] Epoch:[0/2](546200/4588595) loss:3.221 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:27:20,061][model8_pretrain.py][INFO] Epoch:[0/2](546200/4588595) loss:2.755 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:27:20,061][model8_pretrain.py][INFO] Epoch:[0/2](546200/4588595) loss:2.827 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:27:20,061][model8_pretrain.py][INFO] Epoch:[0/2](546200/4588595) loss:2.572 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:27:20,061][model8_pretrain.py][INFO] Epoch:[0/2](546200/4588595) loss:2.276 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:27:20,061][model8_pretrain.py][INFO] Epoch:[0/2](546200/4588595) loss:3.149 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:27:20,061][model8_pretrain.py][INFO] Epoch:[0/2](546200/4588595) loss:3.046 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:28:09,157][model8_pretrain.py][INFO] Epoch:[0/2](546300/4588595) loss:3.150 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:28:09,157][model8_pretrain.py][INFO] Epoch:[0/2](546300/4588595) loss:2.487 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:28:09,157][model8_pretrain.py][INFO] Epoch:[0/2](546300/4588595) loss:2.810 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:28:09,157][model8_pretrain.py][INFO] Epoch:[0/2](546300/4588595) loss:2.901 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:28:09,157][model8_pretrain.py][INFO] Epoch:[0/2](546300/4588595) loss:3.180 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:28:09,157][model8_pretrain.py][INFO] Epoch:[0/2](546300/4588595) loss:2.539 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:28:09,157][model8_pretrain.py][INFO] Epoch:[0/2](546300/4588595) loss:2.217 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:28:09,157][model8_pretrain.py][INFO] Epoch:[0/2](546300/4588595) loss:2.876 lr:0.0000100 epoch_Time:25664.0min: [2024-01-05 03:28:46,077][model8_pretrain.py][INFO] Epoch:[0/2](546400/4588595) loss:2.546 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:28:46,077][model8_pretrain.py][INFO] Epoch:[0/2](546400/4588595) loss:3.086 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:28:46,077][model8_pretrain.py][INFO] Epoch:[0/2](546400/4588595) loss:2.568 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:28:46,077][model8_pretrain.py][INFO] Epoch:[0/2](546400/4588595) loss:3.175 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:28:46,077][model8_pretrain.py][INFO] Epoch:[0/2](546400/4588595) loss:2.513 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:28:46,077][model8_pretrain.py][INFO] Epoch:[0/2](546400/4588595) loss:2.849 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:28:46,077][model8_pretrain.py][INFO] Epoch:[0/2](546400/4588595) loss:2.469 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:28:46,077][model8_pretrain.py][INFO] Epoch:[0/2](546400/4588595) loss:3.072 lr:0.0000100 epoch_Time:25663.0min: [2024-01-05 03:29:23,022][model8_pretrain.py][INFO] Epoch:[0/2](546500/4588595) loss:2.618 lr:0.0000100 epoch_Time:25662.0min: [2024-01-05 03:29:23,022][model8_pretrain.py][INFO] Epoch:[0/2](546500/4588595) loss:2.890 lr:0.0000100 epoch_Time:25662.0min: [2024-01-05 03:29:23,022][model8_pretrain.py][INFO] Epoch:[0/2](546500/4588595) loss:2.550 lr:0.0000100 epoch_Time:25662.0min: [2024-01-05 03:29:23,022][model8_pretrain.py][INFO] Epoch:[0/2](546500/4588595) loss:2.864 lr:0.0000100 epoch_Time:25662.0min: [2024-01-05 03:29:23,022][model8_pretrain.py][INFO] Epoch:[0/2](546500/4588595) loss:2.793 lr:0.0000100 epoch_Time:25662.0min: [2024-01-05 03:29:23,022][model8_pretrain.py][INFO] Epoch:[0/2](546500/4588595) loss:2.847 lr:0.0000100 epoch_Time:25662.0min: [2024-01-05 03:29:23,022][model8_pretrain.py][INFO] Epoch:[0/2](546500/4588595) loss:2.941 lr:0.0000100 epoch_Time:25662.0min: [2024-01-05 03:29:23,022][model8_pretrain.py][INFO] Epoch:[0/2](546500/4588595) loss:2.451 lr:0.0000100 epoch_Time:25662.0min: [2024-01-05 03:29:59,965][model8_pretrain.py][INFO] Epoch:[0/2](546600/4588595) loss:2.633 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:29:59,965][model8_pretrain.py][INFO] Epoch:[0/2](546600/4588595) loss:3.045 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:29:59,965][model8_pretrain.py][INFO] Epoch:[0/2](546600/4588595) loss:2.975 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:29:59,965][model8_pretrain.py][INFO] Epoch:[0/2](546600/4588595) loss:2.228 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:29:59,965][model8_pretrain.py][INFO] Epoch:[0/2](546600/4588595) loss:2.652 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:29:59,965][model8_pretrain.py][INFO] Epoch:[0/2](546600/4588595) loss:3.162 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:29:59,965][model8_pretrain.py][INFO] Epoch:[0/2](546600/4588595) loss:3.104 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:29:59,965][model8_pretrain.py][INFO] Epoch:[0/2](546600/4588595) loss:3.207 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:30:36,903][model8_pretrain.py][INFO] Epoch:[0/2](546700/4588595) loss:3.090 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:30:36,903][model8_pretrain.py][INFO] Epoch:[0/2](546700/4588595) loss:3.118 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:30:36,904][model8_pretrain.py][INFO] Epoch:[0/2](546700/4588595) loss:2.598 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:30:36,904][model8_pretrain.py][INFO] Epoch:[0/2](546700/4588595) loss:2.693 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:30:36,904][model8_pretrain.py][INFO] Epoch:[0/2](546700/4588595) loss:3.049 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:30:36,904][model8_pretrain.py][INFO] Epoch:[0/2](546700/4588595) loss:2.367 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:30:36,904][model8_pretrain.py][INFO] Epoch:[0/2](546700/4588595) loss:3.011 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:30:36,904][model8_pretrain.py][INFO] Epoch:[0/2](546700/4588595) loss:2.353 lr:0.0000100 epoch_Time:25661.0min: [2024-01-05 03:31:13,831][model8_pretrain.py][INFO] Epoch:[0/2](546800/4588595) loss:3.200 lr:0.0000100 epoch_Time:25660.0min: [2024-01-05 03:31:13,831][model8_pretrain.py][INFO] Epoch:[0/2](546800/4588595) loss:3.114 lr:0.0000100 epoch_Time:25660.0min: [2024-01-05 03:31:13,831][model8_pretrain.py][INFO] Epoch:[0/2](546800/4588595) loss:2.775 lr:0.0000100 epoch_Time:25660.0min: [2024-01-05 03:31:13,831][model8_pretrain.py][INFO] Epoch:[0/2](546800/4588595) loss:3.136 lr:0.0000100 epoch_Time:25660.0min: [2024-01-05 03:31:13,831][model8_pretrain.py][INFO] Epoch:[0/2](546800/4588595) loss:3.044 lr:0.0000100 epoch_Time:25660.0min: [2024-01-05 03:31:13,831][model8_pretrain.py][INFO] Epoch:[0/2](546800/4588595) loss:3.323 lr:0.0000100 epoch_Time:25660.0min: [2024-01-05 03:31:13,831][model8_pretrain.py][INFO] Epoch:[0/2](546800/4588595) loss:2.406 lr:0.0000100 epoch_Time:25660.0min: [2024-01-05 03:31:13,831][model8_pretrain.py][INFO] Epoch:[0/2](546800/4588595) loss:3.000 lr:0.0000100 epoch_Time:25660.0min: [2024-01-05 03:31:50,763][model8_pretrain.py][INFO] Epoch:[0/2](546900/4588595) loss:3.447 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:31:50,763][model8_pretrain.py][INFO] Epoch:[0/2](546900/4588595) loss:3.245 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:31:50,763][model8_pretrain.py][INFO] Epoch:[0/2](546900/4588595) loss:2.821 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:31:50,763][model8_pretrain.py][INFO] Epoch:[0/2](546900/4588595) loss:2.715 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:31:50,763][model8_pretrain.py][INFO] Epoch:[0/2](546900/4588595) loss:2.927 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:31:50,763][model8_pretrain.py][INFO] Epoch:[0/2](546900/4588595) loss:2.198 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:31:50,763][model8_pretrain.py][INFO] Epoch:[0/2](546900/4588595) loss:2.407 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:31:50,763][model8_pretrain.py][INFO] Epoch:[0/2](546900/4588595) loss:2.908 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:32:27,693][model8_pretrain.py][INFO] Epoch:[0/2](547000/4588595) loss:2.979 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:32:27,693][model8_pretrain.py][INFO] Epoch:[0/2](547000/4588595) loss:3.196 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:32:27,693][model8_pretrain.py][INFO] Epoch:[0/2](547000/4588595) loss:2.763 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:32:27,693][model8_pretrain.py][INFO] Epoch:[0/2](547000/4588595) loss:2.648 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:32:27,693][model8_pretrain.py][INFO] Epoch:[0/2](547000/4588595) loss:2.882 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:32:27,693][model8_pretrain.py][INFO] Epoch:[0/2](547000/4588595) loss:2.909 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:32:27,694][model8_pretrain.py][INFO] Epoch:[0/2](547000/4588595) loss:2.430 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:32:27,694][model8_pretrain.py][INFO] Epoch:[0/2](547000/4588595) loss:2.641 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:33:16,665][model8_pretrain.py][INFO] Epoch:[0/2](547100/4588595) loss:2.199 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:33:16,665][model8_pretrain.py][INFO] Epoch:[0/2](547100/4588595) loss:2.436 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:33:16,665][model8_pretrain.py][INFO] Epoch:[0/2](547100/4588595) loss:3.030 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:33:16,665][model8_pretrain.py][INFO] Epoch:[0/2](547100/4588595) loss:3.095 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:33:16,665][model8_pretrain.py][INFO] Epoch:[0/2](547100/4588595) loss:3.172 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:33:16,665][model8_pretrain.py][INFO] Epoch:[0/2](547100/4588595) loss:2.255 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:33:16,665][model8_pretrain.py][INFO] Epoch:[0/2](547100/4588595) loss:3.167 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:33:16,665][model8_pretrain.py][INFO] Epoch:[0/2](547100/4588595) loss:2.692 lr:0.0000100 epoch_Time:25659.0min: [2024-01-05 03:33:53,595][model8_pretrain.py][INFO] Epoch:[0/2](547200/4588595) loss:2.520 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:33:53,595][model8_pretrain.py][INFO] Epoch:[0/2](547200/4588595) loss:2.398 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:33:53,595][model8_pretrain.py][INFO] Epoch:[0/2](547200/4588595) loss:2.773 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:33:53,595][model8_pretrain.py][INFO] Epoch:[0/2](547200/4588595) loss:2.750 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:33:53,595][model8_pretrain.py][INFO] Epoch:[0/2](547200/4588595) loss:2.839 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:33:53,596][model8_pretrain.py][INFO] Epoch:[0/2](547200/4588595) loss:2.786 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:33:53,596][model8_pretrain.py][INFO] Epoch:[0/2](547200/4588595) loss:2.701 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:33:53,595][model8_pretrain.py][INFO] Epoch:[0/2](547200/4588595) loss:2.590 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:34:30,549][model8_pretrain.py][INFO] Epoch:[0/2](547300/4588595) loss:2.916 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:34:30,549][model8_pretrain.py][INFO] Epoch:[0/2](547300/4588595) loss:3.311 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:34:30,549][model8_pretrain.py][INFO] Epoch:[0/2](547300/4588595) loss:2.552 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:34:30,549][model8_pretrain.py][INFO] Epoch:[0/2](547300/4588595) loss:3.006 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:34:30,549][model8_pretrain.py][INFO] Epoch:[0/2](547300/4588595) loss:2.765 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:34:30,549][model8_pretrain.py][INFO] Epoch:[0/2](547300/4588595) loss:2.247 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:34:30,549][model8_pretrain.py][INFO] Epoch:[0/2](547300/4588595) loss:2.611 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:34:30,549][model8_pretrain.py][INFO] Epoch:[0/2](547300/4588595) loss:3.298 lr:0.0000100 epoch_Time:25658.0min: [2024-01-05 03:35:07,502][model8_pretrain.py][INFO] Epoch:[0/2](547400/4588595) loss:2.080 lr:0.0000100 epoch_Time:25657.0min: [2024-01-05 03:35:07,502][model8_pretrain.py][INFO] Epoch:[0/2](547400/4588595) loss:3.336 lr:0.0000100 epoch_Time:25657.0min: [2024-01-05 03:35:07,502][model8_pretrain.py][INFO] Epoch:[0/2](547400/4588595) loss:3.028 lr:0.0000100 epoch_Time:25657.0min: [2024-01-05 03:35:07,502][model8_pretrain.py][INFO] Epoch:[0/2](547400/4588595) loss:2.874 lr:0.0000100 epoch_Time:25657.0min: [2024-01-05 03:35:07,502][model8_pretrain.py][INFO] Epoch:[0/2](547400/4588595) loss:2.749 lr:0.0000100 epoch_Time:25657.0min: [2024-01-05 03:35:07,502][model8_pretrain.py][INFO] Epoch:[0/2](547400/4588595) loss:3.202 lr:0.0000100 epoch_Time:25657.0min: [2024-01-05 03:35:07,502][model8_pretrain.py][INFO] Epoch:[0/2](547400/4588595) loss:3.120 lr:0.0000100 epoch_Time:25657.0min: [2024-01-05 03:35:07,502][model8_pretrain.py][INFO] Epoch:[0/2](547400/4588595) loss:2.814 lr:0.0000100 epoch_Time:25657.0min: [2024-01-05 03:35:44,454][model8_pretrain.py][INFO] Epoch:[0/2](547500/4588595) loss:2.918 lr:0.0000100 epoch_Time:25656.0min: [2024-01-05 03:35:44,455][model8_pretrain.py][INFO] Epoch:[0/2](547500/4588595) loss:3.063 lr:0.0000100 epoch_Time:25656.0min: [2024-01-05 03:35:44,454][model8_pretrain.py][INFO] Epoch:[0/2](547500/4588595) loss:3.265 lr:0.0000100 epoch_Time:25656.0min: [2024-01-05 03:35:44,455][model8_pretrain.py][INFO] Epoch:[0/2](547500/4588595) loss:2.205 lr:0.0000100 epoch_Time:25656.0min: [2024-01-05 03:35:44,455][model8_pretrain.py][INFO] Epoch:[0/2](547500/4588595) loss:2.985 lr:0.0000100 epoch_Time:25656.0min: [2024-01-05 03:35:44,455][model8_pretrain.py][INFO] Epoch:[0/2](547500/4588595) loss:2.148 lr:0.0000100 epoch_Time:25656.0min: [2024-01-05 03:35:44,455][model8_pretrain.py][INFO] Epoch:[0/2](547500/4588595) loss:3.104 lr:0.0000100 epoch_Time:25656.0min: [2024-01-05 03:35:44,455][model8_pretrain.py][INFO] Epoch:[0/2](547500/4588595) loss:3.029 lr:0.0000100 epoch_Time:25656.0min: [2024-01-05 03:36:21,397][model8_pretrain.py][INFO] Epoch:[0/2](547600/4588595) loss:2.278 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:36:21,397][model8_pretrain.py][INFO] Epoch:[0/2](547600/4588595) loss:2.928 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:36:21,397][model8_pretrain.py][INFO] Epoch:[0/2](547600/4588595) loss:2.503 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:36:21,397][model8_pretrain.py][INFO] Epoch:[0/2](547600/4588595) loss:2.807 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:36:21,397][model8_pretrain.py][INFO] Epoch:[0/2](547600/4588595) loss:2.841 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:36:21,398][model8_pretrain.py][INFO] Epoch:[0/2](547600/4588595) loss:3.017 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:36:21,398][model8_pretrain.py][INFO] Epoch:[0/2](547600/4588595) loss:2.656 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:36:21,399][model8_pretrain.py][INFO] Epoch:[0/2](547600/4588595) loss:2.423 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:36:58,340][model8_pretrain.py][INFO] Epoch:[0/2](547700/4588595) loss:3.014 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:36:58,340][model8_pretrain.py][INFO] Epoch:[0/2](547700/4588595) loss:2.656 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:36:58,340][model8_pretrain.py][INFO] Epoch:[0/2](547700/4588595) loss:2.707 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:36:58,340][model8_pretrain.py][INFO] Epoch:[0/2](547700/4588595) loss:2.877 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:36:58,340][model8_pretrain.py][INFO] Epoch:[0/2](547700/4588595) loss:3.361 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:36:58,340][model8_pretrain.py][INFO] Epoch:[0/2](547700/4588595) loss:2.990 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:36:58,341][model8_pretrain.py][INFO] Epoch:[0/2](547700/4588595) loss:2.725 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:36:58,341][model8_pretrain.py][INFO] Epoch:[0/2](547700/4588595) loss:3.151 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:37:35,280][model8_pretrain.py][INFO] Epoch:[0/2](547800/4588595) loss:3.397 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:37:35,280][model8_pretrain.py][INFO] Epoch:[0/2](547800/4588595) loss:3.301 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:37:35,280][model8_pretrain.py][INFO] Epoch:[0/2](547800/4588595) loss:2.874 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:37:35,280][model8_pretrain.py][INFO] Epoch:[0/2](547800/4588595) loss:2.583 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:37:35,280][model8_pretrain.py][INFO] Epoch:[0/2](547800/4588595) loss:2.709 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:37:35,280][model8_pretrain.py][INFO] Epoch:[0/2](547800/4588595) loss:2.302 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:37:35,280][model8_pretrain.py][INFO] Epoch:[0/2](547800/4588595) loss:2.927 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:37:35,280][model8_pretrain.py][INFO] Epoch:[0/2](547800/4588595) loss:3.286 lr:0.0000100 epoch_Time:25654.0min: [2024-01-05 03:38:25,300][model8_pretrain.py][INFO] Epoch:[0/2](547900/4588595) loss:3.022 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:38:25,300][model8_pretrain.py][INFO] Epoch:[0/2](547900/4588595) loss:2.477 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:38:25,300][model8_pretrain.py][INFO] Epoch:[0/2](547900/4588595) loss:2.757 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:38:25,300][model8_pretrain.py][INFO] Epoch:[0/2](547900/4588595) loss:2.752 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:38:25,300][model8_pretrain.py][INFO] Epoch:[0/2](547900/4588595) loss:2.561 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:38:25,300][model8_pretrain.py][INFO] Epoch:[0/2](547900/4588595) loss:3.335 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:38:25,300][model8_pretrain.py][INFO] Epoch:[0/2](547900/4588595) loss:3.258 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:38:25,300][model8_pretrain.py][INFO] Epoch:[0/2](547900/4588595) loss:2.832 lr:0.0000100 epoch_Time:25655.0min: [2024-01-05 03:39:02,225][model8_pretrain.py][INFO] Epoch:[0/2](548000/4588595) loss:2.465 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:02,225][model8_pretrain.py][INFO] Epoch:[0/2](548000/4588595) loss:2.638 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:02,225][model8_pretrain.py][INFO] Epoch:[0/2](548000/4588595) loss:3.139 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:02,225][model8_pretrain.py][INFO] Epoch:[0/2](548000/4588595) loss:2.752 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:02,225][model8_pretrain.py][INFO] Epoch:[0/2](548000/4588595) loss:3.436 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:02,225][model8_pretrain.py][INFO] Epoch:[0/2](548000/4588595) loss:2.624 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:02,225][model8_pretrain.py][INFO] Epoch:[0/2](548000/4588595) loss:3.165 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:02,225][model8_pretrain.py][INFO] Epoch:[0/2](548000/4588595) loss:2.924 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:39,157][model8_pretrain.py][INFO] Epoch:[0/2](548100/4588595) loss:3.091 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:39,157][model8_pretrain.py][INFO] Epoch:[0/2](548100/4588595) loss:3.174 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:39,157][model8_pretrain.py][INFO] Epoch:[0/2](548100/4588595) loss:2.807 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:39,157][model8_pretrain.py][INFO] Epoch:[0/2](548100/4588595) loss:2.836 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:39,157][model8_pretrain.py][INFO] Epoch:[0/2](548100/4588595) loss:3.049 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:39,157][model8_pretrain.py][INFO] Epoch:[0/2](548100/4588595) loss:2.708 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:39,157][model8_pretrain.py][INFO] Epoch:[0/2](548100/4588595) loss:3.152 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:39:39,157][model8_pretrain.py][INFO] Epoch:[0/2](548100/4588595) loss:2.875 lr:0.0000100 epoch_Time:25653.0min: [2024-01-05 03:40:16,096][model8_pretrain.py][INFO] Epoch:[0/2](548200/4588595) loss:2.130 lr:0.0000100 epoch_Time:25652.0min: [2024-01-05 03:40:16,096][model8_pretrain.py][INFO] Epoch:[0/2](548200/4588595) loss:2.739 lr:0.0000100 epoch_Time:25652.0min: [2024-01-05 03:40:16,096][model8_pretrain.py][INFO] Epoch:[0/2](548200/4588595) loss:3.385 lr:0.0000100 epoch_Time:25652.0min: [2024-01-05 03:40:16,096][model8_pretrain.py][INFO] Epoch:[0/2](548200/4588595) loss:2.984 lr:0.0000100 epoch_Time:25652.0min: [2024-01-05 03:40:16,096][model8_pretrain.py][INFO] Epoch:[0/2](548200/4588595) loss:3.103 lr:0.0000100 epoch_Time:25652.0min: [2024-01-05 03:40:16,096][model8_pretrain.py][INFO] Epoch:[0/2](548200/4588595) loss:2.955 lr:0.0000100 epoch_Time:25652.0min: [2024-01-05 03:40:16,096][model8_pretrain.py][INFO] Epoch:[0/2](548200/4588595) loss:2.876 lr:0.0000100 epoch_Time:25652.0min: [2024-01-05 03:40:16,096][model8_pretrain.py][INFO] Epoch:[0/2](548200/4588595) loss:2.843 lr:0.0000100 epoch_Time:25652.0min: [2024-01-05 03:40:53,031][model8_pretrain.py][INFO] Epoch:[0/2](548300/4588595) loss:2.651 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:40:53,031][model8_pretrain.py][INFO] Epoch:[0/2](548300/4588595) loss:3.042 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:40:53,031][model8_pretrain.py][INFO] Epoch:[0/2](548300/4588595) loss:3.010 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:40:53,031][model8_pretrain.py][INFO] Epoch:[0/2](548300/4588595) loss:2.327 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:40:53,031][model8_pretrain.py][INFO] Epoch:[0/2](548300/4588595) loss:2.452 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:40:53,031][model8_pretrain.py][INFO] Epoch:[0/2](548300/4588595) loss:2.680 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:40:53,032][model8_pretrain.py][INFO] Epoch:[0/2](548300/4588595) loss:2.798 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:40:53,032][model8_pretrain.py][INFO] Epoch:[0/2](548300/4588595) loss:3.226 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:41:29,966][model8_pretrain.py][INFO] Epoch:[0/2](548400/4588595) loss:2.917 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:41:29,966][model8_pretrain.py][INFO] Epoch:[0/2](548400/4588595) loss:2.592 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:41:29,966][model8_pretrain.py][INFO] Epoch:[0/2](548400/4588595) loss:2.547 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:41:29,966][model8_pretrain.py][INFO] Epoch:[0/2](548400/4588595) loss:2.960 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:41:29,966][model8_pretrain.py][INFO] Epoch:[0/2](548400/4588595) loss:2.319 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:41:29,966][model8_pretrain.py][INFO] Epoch:[0/2](548400/4588595) loss:2.457 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:41:29,966][model8_pretrain.py][INFO] Epoch:[0/2](548400/4588595) loss:2.874 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:41:29,966][model8_pretrain.py][INFO] Epoch:[0/2](548400/4588595) loss:2.887 lr:0.0000100 epoch_Time:25651.0min: [2024-01-05 03:42:06,914][model8_pretrain.py][INFO] Epoch:[0/2](548500/4588595) loss:2.937 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:42:06,914][model8_pretrain.py][INFO] Epoch:[0/2](548500/4588595) loss:2.902 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:42:06,914][model8_pretrain.py][INFO] Epoch:[0/2](548500/4588595) loss:3.278 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:42:06,914][model8_pretrain.py][INFO] Epoch:[0/2](548500/4588595) loss:3.018 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:42:06,914][model8_pretrain.py][INFO] Epoch:[0/2](548500/4588595) loss:2.956 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:42:06,914][model8_pretrain.py][INFO] Epoch:[0/2](548500/4588595) loss:2.509 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:42:06,914][model8_pretrain.py][INFO] Epoch:[0/2](548500/4588595) loss:3.170 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:42:06,914][model8_pretrain.py][INFO] Epoch:[0/2](548500/4588595) loss:3.418 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:42:43,850][model8_pretrain.py][INFO] Epoch:[0/2](548600/4588595) loss:3.121 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:42:43,850][model8_pretrain.py][INFO] Epoch:[0/2](548600/4588595) loss:3.110 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:42:43,850][model8_pretrain.py][INFO] Epoch:[0/2](548600/4588595) loss:2.056 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:42:43,850][model8_pretrain.py][INFO] Epoch:[0/2](548600/4588595) loss:3.057 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:42:43,851][model8_pretrain.py][INFO] Epoch:[0/2](548600/4588595) loss:2.776 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:42:43,851][model8_pretrain.py][INFO] Epoch:[0/2](548600/4588595) loss:3.090 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:42:43,851][model8_pretrain.py][INFO] Epoch:[0/2](548600/4588595) loss:2.796 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:42:43,851][model8_pretrain.py][INFO] Epoch:[0/2](548600/4588595) loss:2.028 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:43:32,604][model8_pretrain.py][INFO] Epoch:[0/2](548700/4588595) loss:2.761 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:43:32,604][model8_pretrain.py][INFO] Epoch:[0/2](548700/4588595) loss:2.451 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:43:32,604][model8_pretrain.py][INFO] Epoch:[0/2](548700/4588595) loss:2.514 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:43:32,604][model8_pretrain.py][INFO] Epoch:[0/2](548700/4588595) loss:2.286 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:43:32,604][model8_pretrain.py][INFO] Epoch:[0/2](548700/4588595) loss:3.510 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:43:32,604][model8_pretrain.py][INFO] Epoch:[0/2](548700/4588595) loss:2.782 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:43:32,604][model8_pretrain.py][INFO] Epoch:[0/2](548700/4588595) loss:2.785 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:43:32,605][model8_pretrain.py][INFO] Epoch:[0/2](548700/4588595) loss:3.019 lr:0.0000100 epoch_Time:25650.0min: [2024-01-05 03:44:09,522][model8_pretrain.py][INFO] Epoch:[0/2](548800/4588595) loss:2.977 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:09,522][model8_pretrain.py][INFO] Epoch:[0/2](548800/4588595) loss:2.449 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:09,522][model8_pretrain.py][INFO] Epoch:[0/2](548800/4588595) loss:2.751 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:09,522][model8_pretrain.py][INFO] Epoch:[0/2](548800/4588595) loss:3.049 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:09,522][model8_pretrain.py][INFO] Epoch:[0/2](548800/4588595) loss:2.969 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:09,522][model8_pretrain.py][INFO] Epoch:[0/2](548800/4588595) loss:3.036 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:09,522][model8_pretrain.py][INFO] Epoch:[0/2](548800/4588595) loss:2.952 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:09,522][model8_pretrain.py][INFO] Epoch:[0/2](548800/4588595) loss:2.955 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:46,467][model8_pretrain.py][INFO] Epoch:[0/2](548900/4588595) loss:2.952 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:46,467][model8_pretrain.py][INFO] Epoch:[0/2](548900/4588595) loss:2.429 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:46,467][model8_pretrain.py][INFO] Epoch:[0/2](548900/4588595) loss:3.024 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:46,467][model8_pretrain.py][INFO] Epoch:[0/2](548900/4588595) loss:2.505 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:46,467][model8_pretrain.py][INFO] Epoch:[0/2](548900/4588595) loss:2.762 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:46,467][model8_pretrain.py][INFO] Epoch:[0/2](548900/4588595) loss:2.908 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:46,467][model8_pretrain.py][INFO] Epoch:[0/2](548900/4588595) loss:3.064 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:44:46,467][model8_pretrain.py][INFO] Epoch:[0/2](548900/4588595) loss:2.704 lr:0.0000100 epoch_Time:25649.0min: [2024-01-05 03:45:23,407][model8_pretrain.py][INFO] Epoch:[0/2](549000/4588595) loss:2.323 lr:0.0000100 epoch_Time:25647.0min: [2024-01-05 03:45:23,407][model8_pretrain.py][INFO] Epoch:[0/2](549000/4588595) loss:2.700 lr:0.0000100 epoch_Time:25647.0min: [2024-01-05 03:45:23,407][model8_pretrain.py][INFO] Epoch:[0/2](549000/4588595) loss:2.954 lr:0.0000100 epoch_Time:25647.0min: [2024-01-05 03:45:23,407][model8_pretrain.py][INFO] Epoch:[0/2](549000/4588595) loss:3.229 lr:0.0000100 epoch_Time:25647.0min: [2024-01-05 03:45:23,407][model8_pretrain.py][INFO] Epoch:[0/2](549000/4588595) loss:2.746 lr:0.0000100 epoch_Time:25647.0min: [2024-01-05 03:45:23,407][model8_pretrain.py][INFO] Epoch:[0/2](549000/4588595) loss:2.821 lr:0.0000100 epoch_Time:25647.0min: [2024-01-05 03:45:23,407][model8_pretrain.py][INFO] Epoch:[0/2](549000/4588595) loss:2.994 lr:0.0000100 epoch_Time:25647.0min: [2024-01-05 03:45:23,407][model8_pretrain.py][INFO] Epoch:[0/2](549000/4588595) loss:2.479 lr:0.0000100 epoch_Time:25647.0min: [2024-01-05 03:46:00,342][model8_pretrain.py][INFO] Epoch:[0/2](549100/4588595) loss:2.494 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:00,342][model8_pretrain.py][INFO] Epoch:[0/2](549100/4588595) loss:3.022 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:00,342][model8_pretrain.py][INFO] Epoch:[0/2](549100/4588595) loss:3.171 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:00,342][model8_pretrain.py][INFO] Epoch:[0/2](549100/4588595) loss:2.794 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:00,342][model8_pretrain.py][INFO] Epoch:[0/2](549100/4588595) loss:2.246 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:00,342][model8_pretrain.py][INFO] Epoch:[0/2](549100/4588595) loss:3.237 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:00,342][model8_pretrain.py][INFO] Epoch:[0/2](549100/4588595) loss:2.786 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:00,342][model8_pretrain.py][INFO] Epoch:[0/2](549100/4588595) loss:2.862 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:37,269][model8_pretrain.py][INFO] Epoch:[0/2](549200/4588595) loss:2.686 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:37,269][model8_pretrain.py][INFO] Epoch:[0/2](549200/4588595) loss:2.674 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:37,269][model8_pretrain.py][INFO] Epoch:[0/2](549200/4588595) loss:2.754 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:37,269][model8_pretrain.py][INFO] Epoch:[0/2](549200/4588595) loss:3.013 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:37,269][model8_pretrain.py][INFO] Epoch:[0/2](549200/4588595) loss:2.823 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:37,269][model8_pretrain.py][INFO] Epoch:[0/2](549200/4588595) loss:2.800 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:37,269][model8_pretrain.py][INFO] Epoch:[0/2](549200/4588595) loss:2.757 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:46:37,270][model8_pretrain.py][INFO] Epoch:[0/2](549200/4588595) loss:2.390 lr:0.0000100 epoch_Time:25646.0min: [2024-01-05 03:47:14,189][model8_pretrain.py][INFO] Epoch:[0/2](549300/4588595) loss:2.430 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:47:14,189][model8_pretrain.py][INFO] Epoch:[0/2](549300/4588595) loss:2.989 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:47:14,189][model8_pretrain.py][INFO] Epoch:[0/2](549300/4588595) loss:3.317 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:47:14,190][model8_pretrain.py][INFO] Epoch:[0/2](549300/4588595) loss:2.874 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:47:14,190][model8_pretrain.py][INFO] Epoch:[0/2](549300/4588595) loss:2.733 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:47:14,190][model8_pretrain.py][INFO] Epoch:[0/2](549300/4588595) loss:3.330 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:47:14,190][model8_pretrain.py][INFO] Epoch:[0/2](549300/4588595) loss:1.936 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:47:14,190][model8_pretrain.py][INFO] Epoch:[0/2](549300/4588595) loss:2.994 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:47:51,113][model8_pretrain.py][INFO] Epoch:[0/2](549400/4588595) loss:3.132 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:47:51,113][model8_pretrain.py][INFO] Epoch:[0/2](549400/4588595) loss:2.824 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:47:51,113][model8_pretrain.py][INFO] Epoch:[0/2](549400/4588595) loss:3.289 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:47:51,113][model8_pretrain.py][INFO] Epoch:[0/2](549400/4588595) loss:3.165 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:47:51,113][model8_pretrain.py][INFO] Epoch:[0/2](549400/4588595) loss:2.540 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:47:51,113][model8_pretrain.py][INFO] Epoch:[0/2](549400/4588595) loss:3.319 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:47:51,113][model8_pretrain.py][INFO] Epoch:[0/2](549400/4588595) loss:2.648 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:47:51,114][model8_pretrain.py][INFO] Epoch:[0/2](549400/4588595) loss:2.378 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:48:38,320][model8_pretrain.py][INFO] Epoch:[0/2](549500/4588595) loss:2.890 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:48:38,320][model8_pretrain.py][INFO] Epoch:[0/2](549500/4588595) loss:3.288 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:48:38,320][model8_pretrain.py][INFO] Epoch:[0/2](549500/4588595) loss:2.865 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:48:38,320][model8_pretrain.py][INFO] Epoch:[0/2](549500/4588595) loss:2.350 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:48:38,320][model8_pretrain.py][INFO] Epoch:[0/2](549500/4588595) loss:2.733 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:48:38,320][model8_pretrain.py][INFO] Epoch:[0/2](549500/4588595) loss:2.861 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:48:38,320][model8_pretrain.py][INFO] Epoch:[0/2](549500/4588595) loss:2.458 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:48:38,320][model8_pretrain.py][INFO] Epoch:[0/2](549500/4588595) loss:3.093 lr:0.0000100 epoch_Time:25645.0min: [2024-01-05 03:49:16,932][model8_pretrain.py][INFO] Epoch:[0/2](549600/4588595) loss:2.528 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:49:16,932][model8_pretrain.py][INFO] Epoch:[0/2](549600/4588595) loss:3.236 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:49:16,932][model8_pretrain.py][INFO] Epoch:[0/2](549600/4588595) loss:3.004 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:49:16,932][model8_pretrain.py][INFO] Epoch:[0/2](549600/4588595) loss:2.312 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:49:16,932][model8_pretrain.py][INFO] Epoch:[0/2](549600/4588595) loss:2.697 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:49:16,932][model8_pretrain.py][INFO] Epoch:[0/2](549600/4588595) loss:2.736 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:49:16,932][model8_pretrain.py][INFO] Epoch:[0/2](549600/4588595) loss:2.604 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:49:16,932][model8_pretrain.py][INFO] Epoch:[0/2](549600/4588595) loss:2.864 lr:0.0000100 epoch_Time:25644.0min: [2024-01-05 03:49:53,873][model8_pretrain.py][INFO] Epoch:[0/2](549700/4588595) loss:2.898 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:49:53,873][model8_pretrain.py][INFO] Epoch:[0/2](549700/4588595) loss:3.129 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:49:53,873][model8_pretrain.py][INFO] Epoch:[0/2](549700/4588595) loss:3.432 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:49:53,873][model8_pretrain.py][INFO] Epoch:[0/2](549700/4588595) loss:2.409 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:49:53,873][model8_pretrain.py][INFO] Epoch:[0/2](549700/4588595) loss:2.858 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:49:53,873][model8_pretrain.py][INFO] Epoch:[0/2](549700/4588595) loss:2.845 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:49:53,873][model8_pretrain.py][INFO] Epoch:[0/2](549700/4588595) loss:2.634 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:49:53,874][model8_pretrain.py][INFO] Epoch:[0/2](549700/4588595) loss:2.437 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:50:30,837][model8_pretrain.py][INFO] Epoch:[0/2](549800/4588595) loss:2.329 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:50:30,837][model8_pretrain.py][INFO] Epoch:[0/2](549800/4588595) loss:2.787 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:50:30,837][model8_pretrain.py][INFO] Epoch:[0/2](549800/4588595) loss:2.127 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:50:30,837][model8_pretrain.py][INFO] Epoch:[0/2](549800/4588595) loss:3.070 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:50:30,837][model8_pretrain.py][INFO] Epoch:[0/2](549800/4588595) loss:3.503 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:50:30,837][model8_pretrain.py][INFO] Epoch:[0/2](549800/4588595) loss:2.750 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:50:30,837][model8_pretrain.py][INFO] Epoch:[0/2](549800/4588595) loss:2.686 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:50:30,837][model8_pretrain.py][INFO] Epoch:[0/2](549800/4588595) loss:2.735 lr:0.0000100 epoch_Time:25643.0min: [2024-01-05 03:51:07,786][model8_pretrain.py][INFO] Epoch:[0/2](549900/4588595) loss:2.674 lr:0.0000100 epoch_Time:25642.0min: [2024-01-05 03:51:07,786][model8_pretrain.py][INFO] Epoch:[0/2](549900/4588595) loss:2.643 lr:0.0000100 epoch_Time:25642.0min: [2024-01-05 03:51:07,786][model8_pretrain.py][INFO] Epoch:[0/2](549900/4588595) loss:2.650 lr:0.0000100 epoch_Time:25642.0min: [2024-01-05 03:51:07,786][model8_pretrain.py][INFO] Epoch:[0/2](549900/4588595) loss:2.488 lr:0.0000100 epoch_Time:25642.0min: [2024-01-05 03:51:07,787][model8_pretrain.py][INFO] Epoch:[0/2](549900/4588595) loss:2.794 lr:0.0000100 epoch_Time:25642.0min: [2024-01-05 03:51:07,787][model8_pretrain.py][INFO] Epoch:[0/2](549900/4588595) loss:2.587 lr:0.0000100 epoch_Time:25642.0min: [2024-01-05 03:51:07,787][model8_pretrain.py][INFO] Epoch:[0/2](549900/4588595) loss:2.792 lr:0.0000100 epoch_Time:25642.0min: [2024-01-05 03:51:07,787][model8_pretrain.py][INFO] Epoch:[0/2](549900/4588595) loss:3.040 lr:0.0000100 epoch_Time:25642.0min: [2024-01-05 03:51:44,733][model8_pretrain.py][INFO] Epoch:[0/2](550000/4588595) loss:3.018 lr:0.0000100 epoch_Time:25641.0min: [2024-01-05 03:51:44,733][model8_pretrain.py][INFO] Epoch:[0/2](550000/4588595) loss:2.133 lr:0.0000100 epoch_Time:25641.0min: [2024-01-05 03:51:44,733][model8_pretrain.py][INFO] Epoch:[0/2](550000/4588595) loss:2.513 lr:0.0000100 epoch_Time:25641.0min: [2024-01-05 03:51:44,733][model8_pretrain.py][INFO] Epoch:[0/2](550000/4588595) loss:2.690 lr:0.0000100 epoch_Time:25641.0min: [2024-01-05 03:51:44,733][model8_pretrain.py][INFO] Epoch:[0/2](550000/4588595) loss:2.880 lr:0.0000100 epoch_Time:25641.0min: [2024-01-05 03:51:44,733][model8_pretrain.py][INFO] Epoch:[0/2](550000/4588595) loss:2.701 lr:0.0000100 epoch_Time:25641.0min: [2024-01-05 03:51:44,733][model8_pretrain.py][INFO] Epoch:[0/2](550000/4588595) loss:2.974 lr:0.0000100 epoch_Time:25641.0min: [2024-01-05 03:51:44,733][model8_pretrain.py][INFO] Epoch:[0/2](550000/4588595) loss:2.417 lr:0.0000100 epoch_Time:25641.0min: [2024-01-05 03:52:21,674][model8_pretrain.py][INFO] Epoch:[0/2](550100/4588595) loss:2.575 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:52:21,674][model8_pretrain.py][INFO] Epoch:[0/2](550100/4588595) loss:2.631 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:52:21,674][model8_pretrain.py][INFO] Epoch:[0/2](550100/4588595) loss:2.820 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:52:21,674][model8_pretrain.py][INFO] Epoch:[0/2](550100/4588595) loss:2.787 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:52:21,674][model8_pretrain.py][INFO] Epoch:[0/2](550100/4588595) loss:3.184 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:52:21,674][model8_pretrain.py][INFO] Epoch:[0/2](550100/4588595) loss:2.868 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:52:21,674][model8_pretrain.py][INFO] Epoch:[0/2](550100/4588595) loss:2.989 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:52:21,674][model8_pretrain.py][INFO] Epoch:[0/2](550100/4588595) loss:3.122 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:52:58,618][model8_pretrain.py][INFO] Epoch:[0/2](550200/4588595) loss:2.729 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:52:58,618][model8_pretrain.py][INFO] Epoch:[0/2](550200/4588595) loss:2.412 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:52:58,618][model8_pretrain.py][INFO] Epoch:[0/2](550200/4588595) loss:2.916 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:52:58,618][model8_pretrain.py][INFO] Epoch:[0/2](550200/4588595) loss:2.707 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:52:58,618][model8_pretrain.py][INFO] Epoch:[0/2](550200/4588595) loss:3.103 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:52:58,618][model8_pretrain.py][INFO] Epoch:[0/2](550200/4588595) loss:2.855 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:52:58,618][model8_pretrain.py][INFO] Epoch:[0/2](550200/4588595) loss:2.653 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:52:58,618][model8_pretrain.py][INFO] Epoch:[0/2](550200/4588595) loss:2.866 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:53:45,597][model8_pretrain.py][INFO] Epoch:[0/2](550300/4588595) loss:3.218 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:53:45,597][model8_pretrain.py][INFO] Epoch:[0/2](550300/4588595) loss:2.319 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:53:45,597][model8_pretrain.py][INFO] Epoch:[0/2](550300/4588595) loss:2.687 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:53:45,597][model8_pretrain.py][INFO] Epoch:[0/2](550300/4588595) loss:2.594 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:53:45,597][model8_pretrain.py][INFO] Epoch:[0/2](550300/4588595) loss:2.989 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:53:45,597][model8_pretrain.py][INFO] Epoch:[0/2](550300/4588595) loss:2.708 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:53:45,597][model8_pretrain.py][INFO] Epoch:[0/2](550300/4588595) loss:3.059 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:53:45,597][model8_pretrain.py][INFO] Epoch:[0/2](550300/4588595) loss:2.477 lr:0.0000100 epoch_Time:25640.0min: [2024-01-05 03:54:24,301][model8_pretrain.py][INFO] Epoch:[0/2](550400/4588595) loss:2.754 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:54:24,301][model8_pretrain.py][INFO] Epoch:[0/2](550400/4588595) loss:2.190 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:54:24,301][model8_pretrain.py][INFO] Epoch:[0/2](550400/4588595) loss:3.170 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:54:24,301][model8_pretrain.py][INFO] Epoch:[0/2](550400/4588595) loss:2.658 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:54:24,301][model8_pretrain.py][INFO] Epoch:[0/2](550400/4588595) loss:3.187 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:54:24,301][model8_pretrain.py][INFO] Epoch:[0/2](550400/4588595) loss:2.173 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:54:24,301][model8_pretrain.py][INFO] Epoch:[0/2](550400/4588595) loss:2.116 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:54:24,301][model8_pretrain.py][INFO] Epoch:[0/2](550400/4588595) loss:3.211 lr:0.0000100 epoch_Time:25639.0min: [2024-01-05 03:55:01,246][model8_pretrain.py][INFO] Epoch:[0/2](550500/4588595) loss:2.491 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:01,246][model8_pretrain.py][INFO] Epoch:[0/2](550500/4588595) loss:2.537 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:01,246][model8_pretrain.py][INFO] Epoch:[0/2](550500/4588595) loss:2.805 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:01,246][model8_pretrain.py][INFO] Epoch:[0/2](550500/4588595) loss:3.157 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:01,246][model8_pretrain.py][INFO] Epoch:[0/2](550500/4588595) loss:2.647 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:01,246][model8_pretrain.py][INFO] Epoch:[0/2](550500/4588595) loss:2.538 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:01,246][model8_pretrain.py][INFO] Epoch:[0/2](550500/4588595) loss:3.171 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:01,246][model8_pretrain.py][INFO] Epoch:[0/2](550500/4588595) loss:2.663 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:38,190][model8_pretrain.py][INFO] Epoch:[0/2](550600/4588595) loss:2.123 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:38,190][model8_pretrain.py][INFO] Epoch:[0/2](550600/4588595) loss:2.874 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:38,190][model8_pretrain.py][INFO] Epoch:[0/2](550600/4588595) loss:2.643 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:38,190][model8_pretrain.py][INFO] Epoch:[0/2](550600/4588595) loss:3.013 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:38,190][model8_pretrain.py][INFO] Epoch:[0/2](550600/4588595) loss:2.663 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:38,190][model8_pretrain.py][INFO] Epoch:[0/2](550600/4588595) loss:3.332 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:38,190][model8_pretrain.py][INFO] Epoch:[0/2](550600/4588595) loss:2.319 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:55:38,190][model8_pretrain.py][INFO] Epoch:[0/2](550600/4588595) loss:3.422 lr:0.0000100 epoch_Time:25638.0min: [2024-01-05 03:56:15,155][model8_pretrain.py][INFO] Epoch:[0/2](550700/4588595) loss:2.558 lr:0.0000100 epoch_Time:25637.0min: [2024-01-05 03:56:15,155][model8_pretrain.py][INFO] Epoch:[0/2](550700/4588595) loss:2.954 lr:0.0000100 epoch_Time:25637.0min: [2024-01-05 03:56:15,155][model8_pretrain.py][INFO] Epoch:[0/2](550700/4588595) loss:2.754 lr:0.0000100 epoch_Time:25637.0min: [2024-01-05 03:56:15,155][model8_pretrain.py][INFO] Epoch:[0/2](550700/4588595) loss:2.823 lr:0.0000100 epoch_Time:25637.0min: [2024-01-05 03:56:15,155][model8_pretrain.py][INFO] Epoch:[0/2](550700/4588595) loss:2.965 lr:0.0000100 epoch_Time:25637.0min: [2024-01-05 03:56:15,155][model8_pretrain.py][INFO] Epoch:[0/2](550700/4588595) loss:2.122 lr:0.0000100 epoch_Time:25637.0min: [2024-01-05 03:56:15,155][model8_pretrain.py][INFO] Epoch:[0/2](550700/4588595) loss:2.780 lr:0.0000100 epoch_Time:25637.0min: [2024-01-05 03:56:15,155][model8_pretrain.py][INFO] Epoch:[0/2](550700/4588595) loss:2.829 lr:0.0000100 epoch_Time:25637.0min: [2024-01-05 03:56:52,092][model8_pretrain.py][INFO] Epoch:[0/2](550800/4588595) loss:2.773 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:56:52,092][model8_pretrain.py][INFO] Epoch:[0/2](550800/4588595) loss:3.367 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:56:52,092][model8_pretrain.py][INFO] Epoch:[0/2](550800/4588595) loss:2.719 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:56:52,092][model8_pretrain.py][INFO] Epoch:[0/2](550800/4588595) loss:3.049 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:56:52,092][model8_pretrain.py][INFO] Epoch:[0/2](550800/4588595) loss:3.398 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:56:52,092][model8_pretrain.py][INFO] Epoch:[0/2](550800/4588595) loss:2.810 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:56:52,092][model8_pretrain.py][INFO] Epoch:[0/2](550800/4588595) loss:2.714 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:56:52,092][model8_pretrain.py][INFO] Epoch:[0/2](550800/4588595) loss:2.379 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:57:29,019][model8_pretrain.py][INFO] Epoch:[0/2](550900/4588595) loss:2.670 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:57:29,019][model8_pretrain.py][INFO] Epoch:[0/2](550900/4588595) loss:3.363 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:57:29,019][model8_pretrain.py][INFO] Epoch:[0/2](550900/4588595) loss:2.111 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:57:29,019][model8_pretrain.py][INFO] Epoch:[0/2](550900/4588595) loss:3.093 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:57:29,019][model8_pretrain.py][INFO] Epoch:[0/2](550900/4588595) loss:2.660 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:57:29,019][model8_pretrain.py][INFO] Epoch:[0/2](550900/4588595) loss:2.464 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:57:29,019][model8_pretrain.py][INFO] Epoch:[0/2](550900/4588595) loss:2.919 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:57:29,019][model8_pretrain.py][INFO] Epoch:[0/2](550900/4588595) loss:3.196 lr:0.0000100 epoch_Time:25636.0min: [2024-01-05 03:58:05,953][model8_pretrain.py][INFO] Epoch:[0/2](551000/4588595) loss:3.091 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:05,953][model8_pretrain.py][INFO] Epoch:[0/2](551000/4588595) loss:2.857 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:05,953][model8_pretrain.py][INFO] Epoch:[0/2](551000/4588595) loss:2.372 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:05,953][model8_pretrain.py][INFO] Epoch:[0/2](551000/4588595) loss:3.050 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:05,953][model8_pretrain.py][INFO] Epoch:[0/2](551000/4588595) loss:2.888 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:05,953][model8_pretrain.py][INFO] Epoch:[0/2](551000/4588595) loss:2.684 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:05,953][model8_pretrain.py][INFO] Epoch:[0/2](551000/4588595) loss:2.643 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:05,953][model8_pretrain.py][INFO] Epoch:[0/2](551000/4588595) loss:2.998 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:49,615][model8_pretrain.py][INFO] Epoch:[0/2](551100/4588595) loss:2.848 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:49,615][model8_pretrain.py][INFO] Epoch:[0/2](551100/4588595) loss:2.858 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:49,615][model8_pretrain.py][INFO] Epoch:[0/2](551100/4588595) loss:3.082 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:49,615][model8_pretrain.py][INFO] Epoch:[0/2](551100/4588595) loss:3.128 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:49,615][model8_pretrain.py][INFO] Epoch:[0/2](551100/4588595) loss:2.241 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:49,615][model8_pretrain.py][INFO] Epoch:[0/2](551100/4588595) loss:3.080 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:49,615][model8_pretrain.py][INFO] Epoch:[0/2](551100/4588595) loss:2.877 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:58:49,615][model8_pretrain.py][INFO] Epoch:[0/2](551100/4588595) loss:2.622 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 03:59:31,663][model8_pretrain.py][INFO] Epoch:[0/2](551200/4588595) loss:2.815 lr:0.0000100 epoch_Time:25635.0min: [2024-01-05 03:59:31,663][model8_pretrain.py][INFO] Epoch:[0/2](551200/4588595) loss:2.985 lr:0.0000100 epoch_Time:25635.0min: [2024-01-05 03:59:31,663][model8_pretrain.py][INFO] Epoch:[0/2](551200/4588595) loss:2.979 lr:0.0000100 epoch_Time:25635.0min: [2024-01-05 03:59:31,663][model8_pretrain.py][INFO] Epoch:[0/2](551200/4588595) loss:2.599 lr:0.0000100 epoch_Time:25635.0min: [2024-01-05 03:59:31,663][model8_pretrain.py][INFO] Epoch:[0/2](551200/4588595) loss:3.029 lr:0.0000100 epoch_Time:25635.0min: [2024-01-05 03:59:31,663][model8_pretrain.py][INFO] Epoch:[0/2](551200/4588595) loss:2.119 lr:0.0000100 epoch_Time:25635.0min: [2024-01-05 03:59:31,663][model8_pretrain.py][INFO] Epoch:[0/2](551200/4588595) loss:3.400 lr:0.0000100 epoch_Time:25635.0min: [2024-01-05 03:59:31,663][model8_pretrain.py][INFO] Epoch:[0/2](551200/4588595) loss:2.157 lr:0.0000100 epoch_Time:25635.0min: [2024-01-05 04:00:08,609][model8_pretrain.py][INFO] Epoch:[0/2](551300/4588595) loss:3.022 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 04:00:08,609][model8_pretrain.py][INFO] Epoch:[0/2](551300/4588595) loss:2.865 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 04:00:08,609][model8_pretrain.py][INFO] Epoch:[0/2](551300/4588595) loss:2.685 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 04:00:08,609][model8_pretrain.py][INFO] Epoch:[0/2](551300/4588595) loss:2.919 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 04:00:08,609][model8_pretrain.py][INFO] Epoch:[0/2](551300/4588595) loss:3.309 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 04:00:08,609][model8_pretrain.py][INFO] Epoch:[0/2](551300/4588595) loss:2.806 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 04:00:08,609][model8_pretrain.py][INFO] Epoch:[0/2](551300/4588595) loss:2.308 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 04:00:08,609][model8_pretrain.py][INFO] Epoch:[0/2](551300/4588595) loss:3.061 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 04:00:45,550][model8_pretrain.py][INFO] Epoch:[0/2](551400/4588595) loss:2.583 lr:0.0000100 epoch_Time:25633.0min: [2024-01-05 04:00:45,549][model8_pretrain.py][INFO] Epoch:[0/2](551400/4588595) loss:3.306 lr:0.0000100 epoch_Time:25633.0min: [2024-01-05 04:00:45,550][model8_pretrain.py][INFO] Epoch:[0/2](551400/4588595) loss:2.781 lr:0.0000100 epoch_Time:25633.0min: [2024-01-05 04:00:45,550][model8_pretrain.py][INFO] Epoch:[0/2](551400/4588595) loss:3.147 lr:0.0000100 epoch_Time:25633.0min: [2024-01-05 04:00:45,550][model8_pretrain.py][INFO] Epoch:[0/2](551400/4588595) loss:2.802 lr:0.0000100 epoch_Time:25634.0min: [2024-01-05 04:00:45,550][model8_pretrain.py][INFO] Epoch:[0/2](551400/4588595) loss:2.794 lr:0.0000100 epoch_Time:25633.0min: [2024-01-05 04:00:45,550][model8_pretrain.py][INFO] Epoch:[0/2](551400/4588595) loss:3.174 lr:0.0000100 epoch_Time:25633.0min: [2024-01-05 04:00:45,550][model8_pretrain.py][INFO] Epoch:[0/2](551400/4588595) loss:3.233 lr:0.0000100 epoch_Time:25633.0min: [2024-01-05 04:01:22,501][model8_pretrain.py][INFO] Epoch:[0/2](551500/4588595) loss:2.858 lr:0.0000100 epoch_Time:25632.0min: [2024-01-05 04:01:22,501][model8_pretrain.py][INFO] Epoch:[0/2](551500/4588595) loss:2.741 lr:0.0000100 epoch_Time:25632.0min: [2024-01-05 04:01:22,501][model8_pretrain.py][INFO] Epoch:[0/2](551500/4588595) loss:3.073 lr:0.0000100 epoch_Time:25632.0min: [2024-01-05 04:01:22,501][model8_pretrain.py][INFO] Epoch:[0/2](551500/4588595) loss:3.101 lr:0.0000100 epoch_Time:25632.0min: [2024-01-05 04:01:22,501][model8_pretrain.py][INFO] Epoch:[0/2](551500/4588595) loss:2.445 lr:0.0000100 epoch_Time:25632.0min: [2024-01-05 04:01:22,501][model8_pretrain.py][INFO] Epoch:[0/2](551500/4588595) loss:2.191 lr:0.0000100 epoch_Time:25632.0min: [2024-01-05 04:01:22,501][model8_pretrain.py][INFO] Epoch:[0/2](551500/4588595) loss:3.152 lr:0.0000100 epoch_Time:25632.0min: [2024-01-05 04:01:22,502][model8_pretrain.py][INFO] Epoch:[0/2](551500/4588595) loss:3.132 lr:0.0000100 epoch_Time:25632.0min: [2024-01-05 04:01:59,442][model8_pretrain.py][INFO] Epoch:[0/2](551600/4588595) loss:2.956 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:01:59,442][model8_pretrain.py][INFO] Epoch:[0/2](551600/4588595) loss:2.345 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:01:59,442][model8_pretrain.py][INFO] Epoch:[0/2](551600/4588595) loss:2.553 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:01:59,442][model8_pretrain.py][INFO] Epoch:[0/2](551600/4588595) loss:3.035 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:01:59,442][model8_pretrain.py][INFO] Epoch:[0/2](551600/4588595) loss:2.910 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:01:59,442][model8_pretrain.py][INFO] Epoch:[0/2](551600/4588595) loss:2.842 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:01:59,443][model8_pretrain.py][INFO] Epoch:[0/2](551600/4588595) loss:2.977 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:01:59,443][model8_pretrain.py][INFO] Epoch:[0/2](551600/4588595) loss:3.328 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:02:36,380][model8_pretrain.py][INFO] Epoch:[0/2](551700/4588595) loss:3.396 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:02:36,380][model8_pretrain.py][INFO] Epoch:[0/2](551700/4588595) loss:3.040 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:02:36,380][model8_pretrain.py][INFO] Epoch:[0/2](551700/4588595) loss:3.150 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:02:36,380][model8_pretrain.py][INFO] Epoch:[0/2](551700/4588595) loss:3.250 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:02:36,381][model8_pretrain.py][INFO] Epoch:[0/2](551700/4588595) loss:3.405 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:02:36,381][model8_pretrain.py][INFO] Epoch:[0/2](551700/4588595) loss:2.146 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:02:36,381][model8_pretrain.py][INFO] Epoch:[0/2](551700/4588595) loss:2.950 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:02:36,382][model8_pretrain.py][INFO] Epoch:[0/2](551700/4588595) loss:2.582 lr:0.0000100 epoch_Time:25631.0min: [2024-01-05 04:03:13,316][model8_pretrain.py][INFO] Epoch:[0/2](551800/4588595) loss:2.160 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:13,316][model8_pretrain.py][INFO] Epoch:[0/2](551800/4588595) loss:2.602 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:13,316][model8_pretrain.py][INFO] Epoch:[0/2](551800/4588595) loss:2.302 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:13,316][model8_pretrain.py][INFO] Epoch:[0/2](551800/4588595) loss:3.155 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:13,316][model8_pretrain.py][INFO] Epoch:[0/2](551800/4588595) loss:2.832 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:13,316][model8_pretrain.py][INFO] Epoch:[0/2](551800/4588595) loss:2.685 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:13,316][model8_pretrain.py][INFO] Epoch:[0/2](551800/4588595) loss:3.032 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:13,317][model8_pretrain.py][INFO] Epoch:[0/2](551800/4588595) loss:2.813 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:56,998][model8_pretrain.py][INFO] Epoch:[0/2](551900/4588595) loss:3.087 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:56,998][model8_pretrain.py][INFO] Epoch:[0/2](551900/4588595) loss:2.819 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:56,999][model8_pretrain.py][INFO] Epoch:[0/2](551900/4588595) loss:3.170 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:56,999][model8_pretrain.py][INFO] Epoch:[0/2](551900/4588595) loss:2.920 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:56,999][model8_pretrain.py][INFO] Epoch:[0/2](551900/4588595) loss:2.523 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:56,999][model8_pretrain.py][INFO] Epoch:[0/2](551900/4588595) loss:2.843 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:56,999][model8_pretrain.py][INFO] Epoch:[0/2](551900/4588595) loss:3.038 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:03:56,999][model8_pretrain.py][INFO] Epoch:[0/2](551900/4588595) loss:2.446 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:04:39,101][model8_pretrain.py][INFO] Epoch:[0/2](552000/4588595) loss:2.613 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:04:39,101][model8_pretrain.py][INFO] Epoch:[0/2](552000/4588595) loss:2.876 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:04:39,101][model8_pretrain.py][INFO] Epoch:[0/2](552000/4588595) loss:2.322 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:04:39,101][model8_pretrain.py][INFO] Epoch:[0/2](552000/4588595) loss:2.646 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:04:39,101][model8_pretrain.py][INFO] Epoch:[0/2](552000/4588595) loss:3.270 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:04:39,101][model8_pretrain.py][INFO] Epoch:[0/2](552000/4588595) loss:2.804 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:04:39,101][model8_pretrain.py][INFO] Epoch:[0/2](552000/4588595) loss:3.226 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:04:39,101][model8_pretrain.py][INFO] Epoch:[0/2](552000/4588595) loss:2.961 lr:0.0000100 epoch_Time:25630.0min: [2024-01-05 04:05:16,029][model8_pretrain.py][INFO] Epoch:[0/2](552100/4588595) loss:2.303 lr:0.0000100 epoch_Time:25629.0min: [2024-01-05 04:05:16,029][model8_pretrain.py][INFO] Epoch:[0/2](552100/4588595) loss:2.239 lr:0.0000100 epoch_Time:25629.0min: [2024-01-05 04:05:16,029][model8_pretrain.py][INFO] Epoch:[0/2](552100/4588595) loss:2.992 lr:0.0000100 epoch_Time:25629.0min: [2024-01-05 04:05:16,029][model8_pretrain.py][INFO] Epoch:[0/2](552100/4588595) loss:2.804 lr:0.0000100 epoch_Time:25629.0min: [2024-01-05 04:05:16,029][model8_pretrain.py][INFO] Epoch:[0/2](552100/4588595) loss:3.034 lr:0.0000100 epoch_Time:25629.0min: [2024-01-05 04:05:16,029][model8_pretrain.py][INFO] Epoch:[0/2](552100/4588595) loss:3.222 lr:0.0000100 epoch_Time:25629.0min: [2024-01-05 04:05:16,029][model8_pretrain.py][INFO] Epoch:[0/2](552100/4588595) loss:3.023 lr:0.0000100 epoch_Time:25629.0min: [2024-01-05 04:05:16,029][model8_pretrain.py][INFO] Epoch:[0/2](552100/4588595) loss:3.204 lr:0.0000100 epoch_Time:25629.0min: [2024-01-05 04:05:52,969][model8_pretrain.py][INFO] Epoch:[0/2](552200/4588595) loss:2.781 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:05:52,969][model8_pretrain.py][INFO] Epoch:[0/2](552200/4588595) loss:3.064 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:05:52,969][model8_pretrain.py][INFO] Epoch:[0/2](552200/4588595) loss:2.802 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:05:52,969][model8_pretrain.py][INFO] Epoch:[0/2](552200/4588595) loss:2.654 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:05:52,969][model8_pretrain.py][INFO] Epoch:[0/2](552200/4588595) loss:2.929 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:05:52,969][model8_pretrain.py][INFO] Epoch:[0/2](552200/4588595) loss:2.369 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:05:52,970][model8_pretrain.py][INFO] Epoch:[0/2](552200/4588595) loss:2.938 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:05:52,970][model8_pretrain.py][INFO] Epoch:[0/2](552200/4588595) loss:3.064 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:06:29,913][model8_pretrain.py][INFO] Epoch:[0/2](552300/4588595) loss:3.244 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:06:29,913][model8_pretrain.py][INFO] Epoch:[0/2](552300/4588595) loss:2.464 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:06:29,913][model8_pretrain.py][INFO] Epoch:[0/2](552300/4588595) loss:2.719 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:06:29,913][model8_pretrain.py][INFO] Epoch:[0/2](552300/4588595) loss:2.880 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:06:29,913][model8_pretrain.py][INFO] Epoch:[0/2](552300/4588595) loss:2.730 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:06:29,913][model8_pretrain.py][INFO] Epoch:[0/2](552300/4588595) loss:3.006 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:06:29,913][model8_pretrain.py][INFO] Epoch:[0/2](552300/4588595) loss:2.940 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:06:29,913][model8_pretrain.py][INFO] Epoch:[0/2](552300/4588595) loss:3.274 lr:0.0000100 epoch_Time:25628.0min: [2024-01-05 04:07:06,864][model8_pretrain.py][INFO] Epoch:[0/2](552400/4588595) loss:3.237 lr:0.0000100 epoch_Time:25627.0min: [2024-01-05 04:07:06,864][model8_pretrain.py][INFO] Epoch:[0/2](552400/4588595) loss:3.322 lr:0.0000100 epoch_Time:25627.0min: [2024-01-05 04:07:06,864][model8_pretrain.py][INFO] Epoch:[0/2](552400/4588595) loss:2.534 lr:0.0000100 epoch_Time:25627.0min: [2024-01-05 04:07:06,864][model8_pretrain.py][INFO] Epoch:[0/2](552400/4588595) loss:3.098 lr:0.0000100 epoch_Time:25627.0min: [2024-01-05 04:07:06,864][model8_pretrain.py][INFO] Epoch:[0/2](552400/4588595) loss:3.310 lr:0.0000100 epoch_Time:25627.0min: [2024-01-05 04:07:06,864][model8_pretrain.py][INFO] Epoch:[0/2](552400/4588595) loss:2.939 lr:0.0000100 epoch_Time:25627.0min: [2024-01-05 04:07:06,864][model8_pretrain.py][INFO] Epoch:[0/2](552400/4588595) loss:2.919 lr:0.0000100 epoch_Time:25627.0min: [2024-01-05 04:07:06,864][model8_pretrain.py][INFO] Epoch:[0/2](552400/4588595) loss:2.740 lr:0.0000100 epoch_Time:25627.0min: [2024-01-05 04:07:43,801][model8_pretrain.py][INFO] Epoch:[0/2](552500/4588595) loss:3.261 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:07:43,801][model8_pretrain.py][INFO] Epoch:[0/2](552500/4588595) loss:3.091 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:07:43,801][model8_pretrain.py][INFO] Epoch:[0/2](552500/4588595) loss:3.075 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:07:43,801][model8_pretrain.py][INFO] Epoch:[0/2](552500/4588595) loss:3.157 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:07:43,801][model8_pretrain.py][INFO] Epoch:[0/2](552500/4588595) loss:2.916 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:07:43,801][model8_pretrain.py][INFO] Epoch:[0/2](552500/4588595) loss:3.091 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:07:43,801][model8_pretrain.py][INFO] Epoch:[0/2](552500/4588595) loss:2.622 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:07:43,801][model8_pretrain.py][INFO] Epoch:[0/2](552500/4588595) loss:3.092 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:08:20,741][model8_pretrain.py][INFO] Epoch:[0/2](552600/4588595) loss:2.532 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:08:20,741][model8_pretrain.py][INFO] Epoch:[0/2](552600/4588595) loss:2.720 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:08:20,741][model8_pretrain.py][INFO] Epoch:[0/2](552600/4588595) loss:2.959 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:08:20,741][model8_pretrain.py][INFO] Epoch:[0/2](552600/4588595) loss:2.678 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:08:20,741][model8_pretrain.py][INFO] Epoch:[0/2](552600/4588595) loss:2.413 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:08:20,741][model8_pretrain.py][INFO] Epoch:[0/2](552600/4588595) loss:2.669 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:08:20,741][model8_pretrain.py][INFO] Epoch:[0/2](552600/4588595) loss:2.512 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:08:20,742][model8_pretrain.py][INFO] Epoch:[0/2](552600/4588595) loss:2.691 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:09:02,757][model8_pretrain.py][INFO] Epoch:[0/2](552700/4588595) loss:3.271 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:09:02,757][model8_pretrain.py][INFO] Epoch:[0/2](552700/4588595) loss:2.859 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:09:02,757][model8_pretrain.py][INFO] Epoch:[0/2](552700/4588595) loss:3.057 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:09:02,761][model8_pretrain.py][INFO] Epoch:[0/2](552700/4588595) loss:2.462 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:09:02,761][model8_pretrain.py][INFO] Epoch:[0/2](552700/4588595) loss:2.887 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:09:02,761][model8_pretrain.py][INFO] Epoch:[0/2](552700/4588595) loss:2.981 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:09:02,761][model8_pretrain.py][INFO] Epoch:[0/2](552700/4588595) loss:2.859 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:09:02,761][model8_pretrain.py][INFO] Epoch:[0/2](552700/4588595) loss:2.608 lr:0.0000100 epoch_Time:25625.0min: [2024-01-05 04:09:46,634][model8_pretrain.py][INFO] Epoch:[0/2](552800/4588595) loss:2.215 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:09:46,634][model8_pretrain.py][INFO] Epoch:[0/2](552800/4588595) loss:2.851 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:09:46,634][model8_pretrain.py][INFO] Epoch:[0/2](552800/4588595) loss:2.715 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:09:46,634][model8_pretrain.py][INFO] Epoch:[0/2](552800/4588595) loss:2.464 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:09:46,635][model8_pretrain.py][INFO] Epoch:[0/2](552800/4588595) loss:3.026 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:09:46,635][model8_pretrain.py][INFO] Epoch:[0/2](552800/4588595) loss:2.876 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:09:46,635][model8_pretrain.py][INFO] Epoch:[0/2](552800/4588595) loss:2.953 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:09:46,635][model8_pretrain.py][INFO] Epoch:[0/2](552800/4588595) loss:2.517 lr:0.0000100 epoch_Time:25626.0min: [2024-01-05 04:10:23,567][model8_pretrain.py][INFO] Epoch:[0/2](552900/4588595) loss:3.218 lr:0.0000100 epoch_Time:25624.0min: [2024-01-05 04:10:23,567][model8_pretrain.py][INFO] Epoch:[0/2](552900/4588595) loss:2.653 lr:0.0000100 epoch_Time:25624.0min: [2024-01-05 04:10:23,567][model8_pretrain.py][INFO] Epoch:[0/2](552900/4588595) loss:2.929 lr:0.0000100 epoch_Time:25624.0min: [2024-01-05 04:10:23,567][model8_pretrain.py][INFO] Epoch:[0/2](552900/4588595) loss:2.675 lr:0.0000100 epoch_Time:25624.0min: [2024-01-05 04:10:23,567][model8_pretrain.py][INFO] Epoch:[0/2](552900/4588595) loss:3.271 lr:0.0000100 epoch_Time:25624.0min: [2024-01-05 04:10:23,567][model8_pretrain.py][INFO] Epoch:[0/2](552900/4588595) loss:2.517 lr:0.0000100 epoch_Time:25624.0min: [2024-01-05 04:10:23,567][model8_pretrain.py][INFO] Epoch:[0/2](552900/4588595) loss:2.889 lr:0.0000100 epoch_Time:25624.0min: [2024-01-05 04:10:23,568][model8_pretrain.py][INFO] Epoch:[0/2](552900/4588595) loss:2.726 lr:0.0000100 epoch_Time:25624.0min: [2024-01-05 04:11:00,505][model8_pretrain.py][INFO] Epoch:[0/2](553000/4588595) loss:3.100 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:00,506][model8_pretrain.py][INFO] Epoch:[0/2](553000/4588595) loss:2.745 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:00,506][model8_pretrain.py][INFO] Epoch:[0/2](553000/4588595) loss:2.570 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:00,506][model8_pretrain.py][INFO] Epoch:[0/2](553000/4588595) loss:2.953 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:00,506][model8_pretrain.py][INFO] Epoch:[0/2](553000/4588595) loss:2.196 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:00,506][model8_pretrain.py][INFO] Epoch:[0/2](553000/4588595) loss:2.185 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:00,506][model8_pretrain.py][INFO] Epoch:[0/2](553000/4588595) loss:3.107 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:00,506][model8_pretrain.py][INFO] Epoch:[0/2](553000/4588595) loss:2.384 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:37,452][model8_pretrain.py][INFO] Epoch:[0/2](553100/4588595) loss:3.360 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:37,452][model8_pretrain.py][INFO] Epoch:[0/2](553100/4588595) loss:2.696 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:37,452][model8_pretrain.py][INFO] Epoch:[0/2](553100/4588595) loss:2.423 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:37,452][model8_pretrain.py][INFO] Epoch:[0/2](553100/4588595) loss:2.000 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:37,452][model8_pretrain.py][INFO] Epoch:[0/2](553100/4588595) loss:2.863 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:37,452][model8_pretrain.py][INFO] Epoch:[0/2](553100/4588595) loss:2.774 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:37,452][model8_pretrain.py][INFO] Epoch:[0/2](553100/4588595) loss:2.899 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:11:37,452][model8_pretrain.py][INFO] Epoch:[0/2](553100/4588595) loss:3.041 lr:0.0000100 epoch_Time:25623.0min: [2024-01-05 04:12:14,391][model8_pretrain.py][INFO] Epoch:[0/2](553200/4588595) loss:2.590 lr:0.0000100 epoch_Time:25622.0min: [2024-01-05 04:12:14,391][model8_pretrain.py][INFO] Epoch:[0/2](553200/4588595) loss:2.854 lr:0.0000100 epoch_Time:25622.0min: [2024-01-05 04:12:14,391][model8_pretrain.py][INFO] Epoch:[0/2](553200/4588595) loss:2.251 lr:0.0000100 epoch_Time:25622.0min: [2024-01-05 04:12:14,391][model8_pretrain.py][INFO] Epoch:[0/2](553200/4588595) loss:3.150 lr:0.0000100 epoch_Time:25622.0min: [2024-01-05 04:12:14,391][model8_pretrain.py][INFO] Epoch:[0/2](553200/4588595) loss:3.168 lr:0.0000100 epoch_Time:25622.0min: [2024-01-05 04:12:14,391][model8_pretrain.py][INFO] Epoch:[0/2](553200/4588595) loss:2.683 lr:0.0000100 epoch_Time:25622.0min: [2024-01-05 04:12:14,391][model8_pretrain.py][INFO] Epoch:[0/2](553200/4588595) loss:2.779 lr:0.0000100 epoch_Time:25622.0min: [2024-01-05 04:12:14,391][model8_pretrain.py][INFO] Epoch:[0/2](553200/4588595) loss:2.657 lr:0.0000100 epoch_Time:25622.0min: [2024-01-05 04:12:51,329][model8_pretrain.py][INFO] Epoch:[0/2](553300/4588595) loss:1.960 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:12:51,329][model8_pretrain.py][INFO] Epoch:[0/2](553300/4588595) loss:3.159 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:12:51,329][model8_pretrain.py][INFO] Epoch:[0/2](553300/4588595) loss:2.265 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:12:51,329][model8_pretrain.py][INFO] Epoch:[0/2](553300/4588595) loss:3.301 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:12:51,329][model8_pretrain.py][INFO] Epoch:[0/2](553300/4588595) loss:3.112 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:12:51,330][model8_pretrain.py][INFO] Epoch:[0/2](553300/4588595) loss:2.333 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:12:51,330][model8_pretrain.py][INFO] Epoch:[0/2](553300/4588595) loss:2.962 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:12:51,330][model8_pretrain.py][INFO] Epoch:[0/2](553300/4588595) loss:2.324 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:13:28,277][model8_pretrain.py][INFO] Epoch:[0/2](553400/4588595) loss:2.786 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:13:28,277][model8_pretrain.py][INFO] Epoch:[0/2](553400/4588595) loss:3.061 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:13:28,277][model8_pretrain.py][INFO] Epoch:[0/2](553400/4588595) loss:3.046 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:13:28,277][model8_pretrain.py][INFO] Epoch:[0/2](553400/4588595) loss:2.642 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:13:28,277][model8_pretrain.py][INFO] Epoch:[0/2](553400/4588595) loss:2.964 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:13:28,277][model8_pretrain.py][INFO] Epoch:[0/2](553400/4588595) loss:3.003 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:13:28,277][model8_pretrain.py][INFO] Epoch:[0/2](553400/4588595) loss:2.879 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:13:28,277][model8_pretrain.py][INFO] Epoch:[0/2](553400/4588595) loss:3.147 lr:0.0000100 epoch_Time:25621.0min: [2024-01-05 04:14:06,918][model8_pretrain.py][INFO] Epoch:[0/2](553500/4588595) loss:2.551 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:06,918][model8_pretrain.py][INFO] Epoch:[0/2](553500/4588595) loss:3.309 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:06,918][model8_pretrain.py][INFO] Epoch:[0/2](553500/4588595) loss:2.927 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:06,918][model8_pretrain.py][INFO] Epoch:[0/2](553500/4588595) loss:2.784 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:06,918][model8_pretrain.py][INFO] Epoch:[0/2](553500/4588595) loss:3.020 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:06,918][model8_pretrain.py][INFO] Epoch:[0/2](553500/4588595) loss:3.135 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:06,918][model8_pretrain.py][INFO] Epoch:[0/2](553500/4588595) loss:2.511 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:06,919][model8_pretrain.py][INFO] Epoch:[0/2](553500/4588595) loss:3.360 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:54,172][model8_pretrain.py][INFO] Epoch:[0/2](553600/4588595) loss:3.439 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:54,173][model8_pretrain.py][INFO] Epoch:[0/2](553600/4588595) loss:2.999 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:54,173][model8_pretrain.py][INFO] Epoch:[0/2](553600/4588595) loss:3.074 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:54,173][model8_pretrain.py][INFO] Epoch:[0/2](553600/4588595) loss:2.184 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:54,173][model8_pretrain.py][INFO] Epoch:[0/2](553600/4588595) loss:2.782 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:54,173][model8_pretrain.py][INFO] Epoch:[0/2](553600/4588595) loss:2.374 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:54,173][model8_pretrain.py][INFO] Epoch:[0/2](553600/4588595) loss:2.927 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:14:54,173][model8_pretrain.py][INFO] Epoch:[0/2](553600/4588595) loss:2.860 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:15:31,104][model8_pretrain.py][INFO] Epoch:[0/2](553700/4588595) loss:2.411 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:15:31,104][model8_pretrain.py][INFO] Epoch:[0/2](553700/4588595) loss:2.461 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:15:31,105][model8_pretrain.py][INFO] Epoch:[0/2](553700/4588595) loss:2.888 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:15:31,105][model8_pretrain.py][INFO] Epoch:[0/2](553700/4588595) loss:3.174 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:15:31,105][model8_pretrain.py][INFO] Epoch:[0/2](553700/4588595) loss:2.317 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:15:31,105][model8_pretrain.py][INFO] Epoch:[0/2](553700/4588595) loss:2.282 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:15:31,105][model8_pretrain.py][INFO] Epoch:[0/2](553700/4588595) loss:2.845 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:15:31,105][model8_pretrain.py][INFO] Epoch:[0/2](553700/4588595) loss:2.936 lr:0.0000100 epoch_Time:25620.0min: [2024-01-05 04:16:08,041][model8_pretrain.py][INFO] Epoch:[0/2](553800/4588595) loss:2.756 lr:0.0000100 epoch_Time:25619.0min: [2024-01-05 04:16:08,041][model8_pretrain.py][INFO] Epoch:[0/2](553800/4588595) loss:2.974 lr:0.0000100 epoch_Time:25619.0min: [2024-01-05 04:16:08,041][model8_pretrain.py][INFO] Epoch:[0/2](553800/4588595) loss:2.713 lr:0.0000100 epoch_Time:25619.0min: [2024-01-05 04:16:08,041][model8_pretrain.py][INFO] Epoch:[0/2](553800/4588595) loss:3.008 lr:0.0000100 epoch_Time:25619.0min: [2024-01-05 04:16:08,041][model8_pretrain.py][INFO] Epoch:[0/2](553800/4588595) loss:2.484 lr:0.0000100 epoch_Time:25619.0min: [2024-01-05 04:16:08,041][model8_pretrain.py][INFO] Epoch:[0/2](553800/4588595) loss:3.341 lr:0.0000100 epoch_Time:25619.0min: [2024-01-05 04:16:08,041][model8_pretrain.py][INFO] Epoch:[0/2](553800/4588595) loss:2.917 lr:0.0000100 epoch_Time:25619.0min: [2024-01-05 04:16:08,041][model8_pretrain.py][INFO] Epoch:[0/2](553800/4588595) loss:2.740 lr:0.0000100 epoch_Time:25619.0min: [2024-01-05 04:16:44,945][model8_pretrain.py][INFO] Epoch:[0/2](553900/4588595) loss:2.384 lr:0.0000100 epoch_Time:25618.0min: [2024-01-05 04:16:44,945][model8_pretrain.py][INFO] Epoch:[0/2](553900/4588595) loss:2.693 lr:0.0000100 epoch_Time:25618.0min: [2024-01-05 04:16:44,945][model8_pretrain.py][INFO] Epoch:[0/2](553900/4588595) loss:3.187 lr:0.0000100 epoch_Time:25618.0min: [2024-01-05 04:16:44,945][model8_pretrain.py][INFO] Epoch:[0/2](553900/4588595) loss:3.221 lr:0.0000100 epoch_Time:25618.0min: [2024-01-05 04:16:44,945][model8_pretrain.py][INFO] Epoch:[0/2](553900/4588595) loss:2.765 lr:0.0000100 epoch_Time:25618.0min: [2024-01-05 04:16:44,946][model8_pretrain.py][INFO] Epoch:[0/2](553900/4588595) loss:2.822 lr:0.0000100 epoch_Time:25618.0min: [2024-01-05 04:16:44,946][model8_pretrain.py][INFO] Epoch:[0/2](553900/4588595) loss:2.694 lr:0.0000100 epoch_Time:25618.0min: [2024-01-05 04:16:44,946][model8_pretrain.py][INFO] Epoch:[0/2](553900/4588595) loss:2.978 lr:0.0000100 epoch_Time:25618.0min: [2024-01-05 04:17:21,873][model8_pretrain.py][INFO] Epoch:[0/2](554000/4588595) loss:3.025 lr:0.0000100 epoch_Time:25617.0min: [2024-01-05 04:17:21,873][model8_pretrain.py][INFO] Epoch:[0/2](554000/4588595) loss:2.726 lr:0.0000100 epoch_Time:25617.0min: [2024-01-05 04:17:21,873][model8_pretrain.py][INFO] Epoch:[0/2](554000/4588595) loss:3.065 lr:0.0000100 epoch_Time:25617.0min: [2024-01-05 04:17:21,873][model8_pretrain.py][INFO] Epoch:[0/2](554000/4588595) loss:2.717 lr:0.0000100 epoch_Time:25617.0min: [2024-01-05 04:17:21,873][model8_pretrain.py][INFO] Epoch:[0/2](554000/4588595) loss:2.843 lr:0.0000100 epoch_Time:25617.0min: [2024-01-05 04:17:21,873][model8_pretrain.py][INFO] Epoch:[0/2](554000/4588595) loss:2.073 lr:0.0000100 epoch_Time:25617.0min: [2024-01-05 04:17:21,873][model8_pretrain.py][INFO] Epoch:[0/2](554000/4588595) loss:2.872 lr:0.0000100 epoch_Time:25617.0min: [2024-01-05 04:17:21,874][model8_pretrain.py][INFO] Epoch:[0/2](554000/4588595) loss:2.141 lr:0.0000100 epoch_Time:25617.0min: [2024-01-05 04:17:58,796][model8_pretrain.py][INFO] Epoch:[0/2](554100/4588595) loss:2.622 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:17:58,796][model8_pretrain.py][INFO] Epoch:[0/2](554100/4588595) loss:2.839 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:17:58,796][model8_pretrain.py][INFO] Epoch:[0/2](554100/4588595) loss:3.298 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:17:58,796][model8_pretrain.py][INFO] Epoch:[0/2](554100/4588595) loss:2.651 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:17:58,796][model8_pretrain.py][INFO] Epoch:[0/2](554100/4588595) loss:3.055 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:17:58,796][model8_pretrain.py][INFO] Epoch:[0/2](554100/4588595) loss:2.819 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:17:58,796][model8_pretrain.py][INFO] Epoch:[0/2](554100/4588595) loss:3.051 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:17:58,796][model8_pretrain.py][INFO] Epoch:[0/2](554100/4588595) loss:3.054 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:18:35,722][model8_pretrain.py][INFO] Epoch:[0/2](554200/4588595) loss:3.374 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:18:35,722][model8_pretrain.py][INFO] Epoch:[0/2](554200/4588595) loss:2.612 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:18:35,722][model8_pretrain.py][INFO] Epoch:[0/2](554200/4588595) loss:2.992 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:18:35,722][model8_pretrain.py][INFO] Epoch:[0/2](554200/4588595) loss:3.305 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:18:35,722][model8_pretrain.py][INFO] Epoch:[0/2](554200/4588595) loss:2.530 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:18:35,722][model8_pretrain.py][INFO] Epoch:[0/2](554200/4588595) loss:2.335 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:18:35,722][model8_pretrain.py][INFO] Epoch:[0/2](554200/4588595) loss:2.942 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:18:35,722][model8_pretrain.py][INFO] Epoch:[0/2](554200/4588595) loss:3.109 lr:0.0000100 epoch_Time:25616.0min: [2024-01-05 04:19:14,441][model8_pretrain.py][INFO] Epoch:[0/2](554300/4588595) loss:3.373 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:19:14,441][model8_pretrain.py][INFO] Epoch:[0/2](554300/4588595) loss:2.968 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:19:14,441][model8_pretrain.py][INFO] Epoch:[0/2](554300/4588595) loss:3.181 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:19:14,441][model8_pretrain.py][INFO] Epoch:[0/2](554300/4588595) loss:2.968 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:19:14,441][model8_pretrain.py][INFO] Epoch:[0/2](554300/4588595) loss:2.903 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:19:14,441][model8_pretrain.py][INFO] Epoch:[0/2](554300/4588595) loss:2.912 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:19:14,441][model8_pretrain.py][INFO] Epoch:[0/2](554300/4588595) loss:3.467 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:19:14,441][model8_pretrain.py][INFO] Epoch:[0/2](554300/4588595) loss:3.075 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:01,679][model8_pretrain.py][INFO] Epoch:[0/2](554400/4588595) loss:2.823 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:01,679][model8_pretrain.py][INFO] Epoch:[0/2](554400/4588595) loss:2.693 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:01,679][model8_pretrain.py][INFO] Epoch:[0/2](554400/4588595) loss:2.686 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:01,679][model8_pretrain.py][INFO] Epoch:[0/2](554400/4588595) loss:3.124 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:01,679][model8_pretrain.py][INFO] Epoch:[0/2](554400/4588595) loss:2.324 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:01,679][model8_pretrain.py][INFO] Epoch:[0/2](554400/4588595) loss:2.883 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:01,679][model8_pretrain.py][INFO] Epoch:[0/2](554400/4588595) loss:3.193 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:01,679][model8_pretrain.py][INFO] Epoch:[0/2](554400/4588595) loss:3.321 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:38,613][model8_pretrain.py][INFO] Epoch:[0/2](554500/4588595) loss:3.388 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:38,613][model8_pretrain.py][INFO] Epoch:[0/2](554500/4588595) loss:3.027 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:38,613][model8_pretrain.py][INFO] Epoch:[0/2](554500/4588595) loss:3.058 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:38,613][model8_pretrain.py][INFO] Epoch:[0/2](554500/4588595) loss:2.654 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:38,613][model8_pretrain.py][INFO] Epoch:[0/2](554500/4588595) loss:2.790 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:38,613][model8_pretrain.py][INFO] Epoch:[0/2](554500/4588595) loss:3.204 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:38,613][model8_pretrain.py][INFO] Epoch:[0/2](554500/4588595) loss:2.686 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:20:38,613][model8_pretrain.py][INFO] Epoch:[0/2](554500/4588595) loss:2.878 lr:0.0000100 epoch_Time:25615.0min: [2024-01-05 04:21:15,552][model8_pretrain.py][INFO] Epoch:[0/2](554600/4588595) loss:3.259 lr:0.0000100 epoch_Time:25614.0min: [2024-01-05 04:21:15,552][model8_pretrain.py][INFO] Epoch:[0/2](554600/4588595) loss:3.026 lr:0.0000100 epoch_Time:25614.0min: [2024-01-05 04:21:15,552][model8_pretrain.py][INFO] Epoch:[0/2](554600/4588595) loss:2.978 lr:0.0000100 epoch_Time:25614.0min: [2024-01-05 04:21:15,552][model8_pretrain.py][INFO] Epoch:[0/2](554600/4588595) loss:2.816 lr:0.0000100 epoch_Time:25614.0min: [2024-01-05 04:21:15,552][model8_pretrain.py][INFO] Epoch:[0/2](554600/4588595) loss:3.194 lr:0.0000100 epoch_Time:25614.0min: [2024-01-05 04:21:15,552][model8_pretrain.py][INFO] Epoch:[0/2](554600/4588595) loss:2.766 lr:0.0000100 epoch_Time:25614.0min: [2024-01-05 04:21:15,552][model8_pretrain.py][INFO] Epoch:[0/2](554600/4588595) loss:2.688 lr:0.0000100 epoch_Time:25614.0min: [2024-01-05 04:21:15,553][model8_pretrain.py][INFO] Epoch:[0/2](554600/4588595) loss:1.778 lr:0.0000100 epoch_Time:25614.0min: [2024-01-05 04:21:52,487][model8_pretrain.py][INFO] Epoch:[0/2](554700/4588595) loss:2.817 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:21:52,487][model8_pretrain.py][INFO] Epoch:[0/2](554700/4588595) loss:1.835 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:21:52,487][model8_pretrain.py][INFO] Epoch:[0/2](554700/4588595) loss:2.189 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:21:52,487][model8_pretrain.py][INFO] Epoch:[0/2](554700/4588595) loss:2.945 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:21:52,487][model8_pretrain.py][INFO] Epoch:[0/2](554700/4588595) loss:2.627 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:21:52,487][model8_pretrain.py][INFO] Epoch:[0/2](554700/4588595) loss:3.140 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:21:52,487][model8_pretrain.py][INFO] Epoch:[0/2](554700/4588595) loss:2.283 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:21:52,487][model8_pretrain.py][INFO] Epoch:[0/2](554700/4588595) loss:2.923 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:22:29,424][model8_pretrain.py][INFO] Epoch:[0/2](554800/4588595) loss:3.257 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:22:29,424][model8_pretrain.py][INFO] Epoch:[0/2](554800/4588595) loss:2.725 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:22:29,424][model8_pretrain.py][INFO] Epoch:[0/2](554800/4588595) loss:3.102 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:22:29,424][model8_pretrain.py][INFO] Epoch:[0/2](554800/4588595) loss:2.478 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:22:29,424][model8_pretrain.py][INFO] Epoch:[0/2](554800/4588595) loss:2.677 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:22:29,424][model8_pretrain.py][INFO] Epoch:[0/2](554800/4588595) loss:3.152 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:22:29,424][model8_pretrain.py][INFO] Epoch:[0/2](554800/4588595) loss:2.867 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:22:29,424][model8_pretrain.py][INFO] Epoch:[0/2](554800/4588595) loss:3.079 lr:0.0000100 epoch_Time:25613.0min: [2024-01-05 04:23:06,362][model8_pretrain.py][INFO] Epoch:[0/2](554900/4588595) loss:2.570 lr:0.0000100 epoch_Time:25612.0min: [2024-01-05 04:23:06,362][model8_pretrain.py][INFO] Epoch:[0/2](554900/4588595) loss:2.690 lr:0.0000100 epoch_Time:25612.0min: [2024-01-05 04:23:06,362][model8_pretrain.py][INFO] Epoch:[0/2](554900/4588595) loss:2.108 lr:0.0000100 epoch_Time:25612.0min: [2024-01-05 04:23:06,362][model8_pretrain.py][INFO] Epoch:[0/2](554900/4588595) loss:3.368 lr:0.0000100 epoch_Time:25612.0min: [2024-01-05 04:23:06,362][model8_pretrain.py][INFO] Epoch:[0/2](554900/4588595) loss:2.656 lr:0.0000100 epoch_Time:25612.0min: [2024-01-05 04:23:06,362][model8_pretrain.py][INFO] Epoch:[0/2](554900/4588595) loss:2.938 lr:0.0000100 epoch_Time:25612.0min: [2024-01-05 04:23:06,362][model8_pretrain.py][INFO] Epoch:[0/2](554900/4588595) loss:2.610 lr:0.0000100 epoch_Time:25612.0min: [2024-01-05 04:23:06,362][model8_pretrain.py][INFO] Epoch:[0/2](554900/4588595) loss:2.447 lr:0.0000100 epoch_Time:25612.0min: [2024-01-05 04:23:43,290][model8_pretrain.py][INFO] Epoch:[0/2](555000/4588595) loss:3.164 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:23:43,290][model8_pretrain.py][INFO] Epoch:[0/2](555000/4588595) loss:2.598 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:23:43,290][model8_pretrain.py][INFO] Epoch:[0/2](555000/4588595) loss:3.007 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:23:43,290][model8_pretrain.py][INFO] Epoch:[0/2](555000/4588595) loss:2.806 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:23:43,290][model8_pretrain.py][INFO] Epoch:[0/2](555000/4588595) loss:2.701 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:23:43,290][model8_pretrain.py][INFO] Epoch:[0/2](555000/4588595) loss:2.620 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:23:43,290][model8_pretrain.py][INFO] Epoch:[0/2](555000/4588595) loss:3.127 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:23:43,290][model8_pretrain.py][INFO] Epoch:[0/2](555000/4588595) loss:3.056 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:24:21,997][model8_pretrain.py][INFO] Epoch:[0/2](555100/4588595) loss:2.971 lr:0.0000100 epoch_Time:25610.0min: [2024-01-05 04:24:21,997][model8_pretrain.py][INFO] Epoch:[0/2](555100/4588595) loss:2.770 lr:0.0000100 epoch_Time:25610.0min: [2024-01-05 04:24:21,997][model8_pretrain.py][INFO] Epoch:[0/2](555100/4588595) loss:2.730 lr:0.0000100 epoch_Time:25610.0min: [2024-01-05 04:24:21,997][model8_pretrain.py][INFO] Epoch:[0/2](555100/4588595) loss:2.635 lr:0.0000100 epoch_Time:25610.0min: [2024-01-05 04:24:21,997][model8_pretrain.py][INFO] Epoch:[0/2](555100/4588595) loss:2.691 lr:0.0000100 epoch_Time:25610.0min: [2024-01-05 04:24:21,997][model8_pretrain.py][INFO] Epoch:[0/2](555100/4588595) loss:3.216 lr:0.0000100 epoch_Time:25610.0min: [2024-01-05 04:24:21,998][model8_pretrain.py][INFO] Epoch:[0/2](555100/4588595) loss:2.521 lr:0.0000100 epoch_Time:25610.0min: [2024-01-05 04:24:21,998][model8_pretrain.py][INFO] Epoch:[0/2](555100/4588595) loss:2.834 lr:0.0000100 epoch_Time:25610.0min: [2024-01-05 04:25:09,056][model8_pretrain.py][INFO] Epoch:[0/2](555200/4588595) loss:2.713 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:09,056][model8_pretrain.py][INFO] Epoch:[0/2](555200/4588595) loss:2.877 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:09,056][model8_pretrain.py][INFO] Epoch:[0/2](555200/4588595) loss:2.979 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:09,056][model8_pretrain.py][INFO] Epoch:[0/2](555200/4588595) loss:3.164 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:09,056][model8_pretrain.py][INFO] Epoch:[0/2](555200/4588595) loss:2.564 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:09,056][model8_pretrain.py][INFO] Epoch:[0/2](555200/4588595) loss:3.204 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:09,056][model8_pretrain.py][INFO] Epoch:[0/2](555200/4588595) loss:2.835 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:09,057][model8_pretrain.py][INFO] Epoch:[0/2](555200/4588595) loss:2.942 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:45,996][model8_pretrain.py][INFO] Epoch:[0/2](555300/4588595) loss:1.839 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:45,996][model8_pretrain.py][INFO] Epoch:[0/2](555300/4588595) loss:2.129 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:45,996][model8_pretrain.py][INFO] Epoch:[0/2](555300/4588595) loss:2.979 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:45,997][model8_pretrain.py][INFO] Epoch:[0/2](555300/4588595) loss:2.900 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:45,997][model8_pretrain.py][INFO] Epoch:[0/2](555300/4588595) loss:1.981 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:45,997][model8_pretrain.py][INFO] Epoch:[0/2](555300/4588595) loss:2.872 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:45,997][model8_pretrain.py][INFO] Epoch:[0/2](555300/4588595) loss:2.341 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:25:45,997][model8_pretrain.py][INFO] Epoch:[0/2](555300/4588595) loss:3.235 lr:0.0000100 epoch_Time:25611.0min: [2024-01-05 04:26:22,951][model8_pretrain.py][INFO] Epoch:[0/2](555400/4588595) loss:2.747 lr:0.0000100 epoch_Time:25609.0min: [2024-01-05 04:26:22,951][model8_pretrain.py][INFO] Epoch:[0/2](555400/4588595) loss:2.969 lr:0.0000100 epoch_Time:25609.0min: [2024-01-05 04:26:22,951][model8_pretrain.py][INFO] Epoch:[0/2](555400/4588595) loss:3.345 lr:0.0000100 epoch_Time:25609.0min: [2024-01-05 04:26:22,951][model8_pretrain.py][INFO] Epoch:[0/2](555400/4588595) loss:2.926 lr:0.0000100 epoch_Time:25609.0min: [2024-01-05 04:26:22,951][model8_pretrain.py][INFO] Epoch:[0/2](555400/4588595) loss:3.231 lr:0.0000100 epoch_Time:25609.0min: [2024-01-05 04:26:22,951][model8_pretrain.py][INFO] Epoch:[0/2](555400/4588595) loss:2.971 lr:0.0000100 epoch_Time:25609.0min: [2024-01-05 04:26:22,951][model8_pretrain.py][INFO] Epoch:[0/2](555400/4588595) loss:2.785 lr:0.0000100 epoch_Time:25609.0min: [2024-01-05 04:26:22,951][model8_pretrain.py][INFO] Epoch:[0/2](555400/4588595) loss:2.755 lr:0.0000100 epoch_Time:25609.0min: [2024-01-05 04:26:59,908][model8_pretrain.py][INFO] Epoch:[0/2](555500/4588595) loss:2.517 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:26:59,908][model8_pretrain.py][INFO] Epoch:[0/2](555500/4588595) loss:3.093 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:26:59,908][model8_pretrain.py][INFO] Epoch:[0/2](555500/4588595) loss:2.894 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:26:59,908][model8_pretrain.py][INFO] Epoch:[0/2](555500/4588595) loss:2.531 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:26:59,908][model8_pretrain.py][INFO] Epoch:[0/2](555500/4588595) loss:3.055 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:26:59,909][model8_pretrain.py][INFO] Epoch:[0/2](555500/4588595) loss:2.923 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:26:59,909][model8_pretrain.py][INFO] Epoch:[0/2](555500/4588595) loss:3.152 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:26:59,909][model8_pretrain.py][INFO] Epoch:[0/2](555500/4588595) loss:2.525 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:27:36,855][model8_pretrain.py][INFO] Epoch:[0/2](555600/4588595) loss:3.122 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:27:36,855][model8_pretrain.py][INFO] Epoch:[0/2](555600/4588595) loss:2.807 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:27:36,855][model8_pretrain.py][INFO] Epoch:[0/2](555600/4588595) loss:2.569 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:27:36,855][model8_pretrain.py][INFO] Epoch:[0/2](555600/4588595) loss:2.347 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:27:36,856][model8_pretrain.py][INFO] Epoch:[0/2](555600/4588595) loss:2.627 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:27:36,856][model8_pretrain.py][INFO] Epoch:[0/2](555600/4588595) loss:2.881 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:27:36,856][model8_pretrain.py][INFO] Epoch:[0/2](555600/4588595) loss:2.404 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:27:36,856][model8_pretrain.py][INFO] Epoch:[0/2](555600/4588595) loss:3.288 lr:0.0000100 epoch_Time:25608.0min: [2024-01-05 04:28:13,797][model8_pretrain.py][INFO] Epoch:[0/2](555700/4588595) loss:2.714 lr:0.0000100 epoch_Time:25607.0min: [2024-01-05 04:28:13,797][model8_pretrain.py][INFO] Epoch:[0/2](555700/4588595) loss:2.636 lr:0.0000100 epoch_Time:25607.0min: [2024-01-05 04:28:13,797][model8_pretrain.py][INFO] Epoch:[0/2](555700/4588595) loss:3.139 lr:0.0000100 epoch_Time:25607.0min: [2024-01-05 04:28:13,797][model8_pretrain.py][INFO] Epoch:[0/2](555700/4588595) loss:2.930 lr:0.0000100 epoch_Time:25607.0min: [2024-01-05 04:28:13,797][model8_pretrain.py][INFO] Epoch:[0/2](555700/4588595) loss:2.798 lr:0.0000100 epoch_Time:25607.0min: [2024-01-05 04:28:13,797][model8_pretrain.py][INFO] Epoch:[0/2](555700/4588595) loss:3.052 lr:0.0000100 epoch_Time:25607.0min: [2024-01-05 04:28:13,797][model8_pretrain.py][INFO] Epoch:[0/2](555700/4588595) loss:2.744 lr:0.0000100 epoch_Time:25607.0min: [2024-01-05 04:28:13,797][model8_pretrain.py][INFO] Epoch:[0/2](555700/4588595) loss:2.628 lr:0.0000100 epoch_Time:25607.0min: [2024-01-05 04:28:50,748][model8_pretrain.py][INFO] Epoch:[0/2](555800/4588595) loss:2.869 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:28:50,748][model8_pretrain.py][INFO] Epoch:[0/2](555800/4588595) loss:2.875 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:28:50,748][model8_pretrain.py][INFO] Epoch:[0/2](555800/4588595) loss:2.800 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:28:50,748][model8_pretrain.py][INFO] Epoch:[0/2](555800/4588595) loss:2.888 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:28:50,748][model8_pretrain.py][INFO] Epoch:[0/2](555800/4588595) loss:2.506 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:28:50,748][model8_pretrain.py][INFO] Epoch:[0/2](555800/4588595) loss:2.750 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:28:50,748][model8_pretrain.py][INFO] Epoch:[0/2](555800/4588595) loss:2.863 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:28:50,748][model8_pretrain.py][INFO] Epoch:[0/2](555800/4588595) loss:3.100 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:29:27,706][model8_pretrain.py][INFO] Epoch:[0/2](555900/4588595) loss:2.423 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:29:27,706][model8_pretrain.py][INFO] Epoch:[0/2](555900/4588595) loss:2.661 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:29:27,706][model8_pretrain.py][INFO] Epoch:[0/2](555900/4588595) loss:3.153 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:29:27,706][model8_pretrain.py][INFO] Epoch:[0/2](555900/4588595) loss:2.606 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:29:27,706][model8_pretrain.py][INFO] Epoch:[0/2](555900/4588595) loss:3.175 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:29:27,706][model8_pretrain.py][INFO] Epoch:[0/2](555900/4588595) loss:2.853 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:29:27,706][model8_pretrain.py][INFO] Epoch:[0/2](555900/4588595) loss:2.989 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:29:27,706][model8_pretrain.py][INFO] Epoch:[0/2](555900/4588595) loss:2.916 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:30:16,461][model8_pretrain.py][INFO] Epoch:[0/2](556000/4588595) loss:2.771 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:30:16,461][model8_pretrain.py][INFO] Epoch:[0/2](556000/4588595) loss:2.716 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:30:16,461][model8_pretrain.py][INFO] Epoch:[0/2](556000/4588595) loss:2.893 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:30:16,461][model8_pretrain.py][INFO] Epoch:[0/2](556000/4588595) loss:2.739 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:30:16,461][model8_pretrain.py][INFO] Epoch:[0/2](556000/4588595) loss:2.284 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:30:16,462][model8_pretrain.py][INFO] Epoch:[0/2](556000/4588595) loss:2.431 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:30:16,462][model8_pretrain.py][INFO] Epoch:[0/2](556000/4588595) loss:3.286 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:30:16,462][model8_pretrain.py][INFO] Epoch:[0/2](556000/4588595) loss:2.265 lr:0.0000100 epoch_Time:25606.0min: [2024-01-05 04:30:53,419][model8_pretrain.py][INFO] Epoch:[0/2](556100/4588595) loss:2.383 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:30:53,420][model8_pretrain.py][INFO] Epoch:[0/2](556100/4588595) loss:2.722 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:30:53,420][model8_pretrain.py][INFO] Epoch:[0/2](556100/4588595) loss:2.446 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:30:53,420][model8_pretrain.py][INFO] Epoch:[0/2](556100/4588595) loss:3.023 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:30:53,420][model8_pretrain.py][INFO] Epoch:[0/2](556100/4588595) loss:2.552 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:30:53,420][model8_pretrain.py][INFO] Epoch:[0/2](556100/4588595) loss:3.039 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:30:53,420][model8_pretrain.py][INFO] Epoch:[0/2](556100/4588595) loss:2.788 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:30:53,420][model8_pretrain.py][INFO] Epoch:[0/2](556100/4588595) loss:2.822 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:31:30,385][model8_pretrain.py][INFO] Epoch:[0/2](556200/4588595) loss:2.363 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:31:30,385][model8_pretrain.py][INFO] Epoch:[0/2](556200/4588595) loss:2.559 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:31:30,385][model8_pretrain.py][INFO] Epoch:[0/2](556200/4588595) loss:2.716 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:31:30,385][model8_pretrain.py][INFO] Epoch:[0/2](556200/4588595) loss:3.022 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:31:30,385][model8_pretrain.py][INFO] Epoch:[0/2](556200/4588595) loss:2.963 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:31:30,385][model8_pretrain.py][INFO] Epoch:[0/2](556200/4588595) loss:2.992 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:31:30,385][model8_pretrain.py][INFO] Epoch:[0/2](556200/4588595) loss:2.830 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:31:30,385][model8_pretrain.py][INFO] Epoch:[0/2](556200/4588595) loss:2.979 lr:0.0000100 epoch_Time:25605.0min: [2024-01-05 04:32:07,347][model8_pretrain.py][INFO] Epoch:[0/2](556300/4588595) loss:2.681 lr:0.0000100 epoch_Time:25604.0min: [2024-01-05 04:32:07,348][model8_pretrain.py][INFO] Epoch:[0/2](556300/4588595) loss:2.675 lr:0.0000100 epoch_Time:25604.0min: [2024-01-05 04:32:07,348][model8_pretrain.py][INFO] Epoch:[0/2](556300/4588595) loss:2.496 lr:0.0000100 epoch_Time:25604.0min: [2024-01-05 04:32:07,348][model8_pretrain.py][INFO] Epoch:[0/2](556300/4588595) loss:3.083 lr:0.0000100 epoch_Time:25604.0min: [2024-01-05 04:32:07,347][model8_pretrain.py][INFO] Epoch:[0/2](556300/4588595) loss:2.958 lr:0.0000100 epoch_Time:25604.0min: [2024-01-05 04:32:07,348][model8_pretrain.py][INFO] Epoch:[0/2](556300/4588595) loss:3.052 lr:0.0000100 epoch_Time:25604.0min: [2024-01-05 04:32:07,348][model8_pretrain.py][INFO] Epoch:[0/2](556300/4588595) loss:2.623 lr:0.0000100 epoch_Time:25604.0min: [2024-01-05 04:32:07,348][model8_pretrain.py][INFO] Epoch:[0/2](556300/4588595) loss:3.325 lr:0.0000100 epoch_Time:25604.0min: [2024-01-05 04:32:44,301][model8_pretrain.py][INFO] Epoch:[0/2](556400/4588595) loss:2.972 lr:0.0000100 epoch_Time:25603.0min: [2024-01-05 04:32:44,301][model8_pretrain.py][INFO] Epoch:[0/2](556400/4588595) loss:3.120 lr:0.0000100 epoch_Time:25603.0min: [2024-01-05 04:32:44,301][model8_pretrain.py][INFO] Epoch:[0/2](556400/4588595) loss:3.029 lr:0.0000100 epoch_Time:25603.0min: [2024-01-05 04:32:44,301][model8_pretrain.py][INFO] Epoch:[0/2](556400/4588595) loss:2.947 lr:0.0000100 epoch_Time:25603.0min: [2024-01-05 04:32:44,301][model8_pretrain.py][INFO] Epoch:[0/2](556400/4588595) loss:2.921 lr:0.0000100 epoch_Time:25603.0min: [2024-01-05 04:32:44,301][model8_pretrain.py][INFO] Epoch:[0/2](556400/4588595) loss:2.506 lr:0.0000100 epoch_Time:25603.0min: [2024-01-05 04:32:44,302][model8_pretrain.py][INFO] Epoch:[0/2](556400/4588595) loss:2.646 lr:0.0000100 epoch_Time:25603.0min: [2024-01-05 04:32:44,302][model8_pretrain.py][INFO] Epoch:[0/2](556400/4588595) loss:2.861 lr:0.0000100 epoch_Time:25603.0min: [2024-01-05 04:33:21,251][model8_pretrain.py][INFO] Epoch:[0/2](556500/4588595) loss:2.875 lr:0.0000100 epoch_Time:25602.0min: [2024-01-05 04:33:21,251][model8_pretrain.py][INFO] Epoch:[0/2](556500/4588595) loss:2.804 lr:0.0000100 epoch_Time:25602.0min: [2024-01-05 04:33:21,251][model8_pretrain.py][INFO] Epoch:[0/2](556500/4588595) loss:2.555 lr:0.0000100 epoch_Time:25602.0min: [2024-01-05 04:33:21,251][model8_pretrain.py][INFO] Epoch:[0/2](556500/4588595) loss:2.674 lr:0.0000100 epoch_Time:25602.0min: [2024-01-05 04:33:21,251][model8_pretrain.py][INFO] Epoch:[0/2](556500/4588595) loss:2.738 lr:0.0000100 epoch_Time:25602.0min: [2024-01-05 04:33:21,251][model8_pretrain.py][INFO] Epoch:[0/2](556500/4588595) loss:2.456 lr:0.0000100 epoch_Time:25602.0min: [2024-01-05 04:33:21,251][model8_pretrain.py][INFO] Epoch:[0/2](556500/4588595) loss:2.251 lr:0.0000100 epoch_Time:25602.0min: [2024-01-05 04:33:21,251][model8_pretrain.py][INFO] Epoch:[0/2](556500/4588595) loss:2.647 lr:0.0000100 epoch_Time:25602.0min: [2024-01-05 04:33:58,199][model8_pretrain.py][INFO] Epoch:[0/2](556600/4588595) loss:3.139 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:33:58,200][model8_pretrain.py][INFO] Epoch:[0/2](556600/4588595) loss:3.121 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:33:58,200][model8_pretrain.py][INFO] Epoch:[0/2](556600/4588595) loss:2.800 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:33:58,200][model8_pretrain.py][INFO] Epoch:[0/2](556600/4588595) loss:2.841 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:33:58,200][model8_pretrain.py][INFO] Epoch:[0/2](556600/4588595) loss:2.836 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:33:58,200][model8_pretrain.py][INFO] Epoch:[0/2](556600/4588595) loss:3.106 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:33:58,200][model8_pretrain.py][INFO] Epoch:[0/2](556600/4588595) loss:2.645 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:33:58,200][model8_pretrain.py][INFO] Epoch:[0/2](556600/4588595) loss:3.181 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:34:35,143][model8_pretrain.py][INFO] Epoch:[0/2](556700/4588595) loss:2.937 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:34:35,143][model8_pretrain.py][INFO] Epoch:[0/2](556700/4588595) loss:2.596 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:34:35,143][model8_pretrain.py][INFO] Epoch:[0/2](556700/4588595) loss:3.176 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:34:35,143][model8_pretrain.py][INFO] Epoch:[0/2](556700/4588595) loss:3.359 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:34:35,143][model8_pretrain.py][INFO] Epoch:[0/2](556700/4588595) loss:2.757 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:34:35,143][model8_pretrain.py][INFO] Epoch:[0/2](556700/4588595) loss:2.669 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:34:35,143][model8_pretrain.py][INFO] Epoch:[0/2](556700/4588595) loss:3.268 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:34:35,143][model8_pretrain.py][INFO] Epoch:[0/2](556700/4588595) loss:3.354 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:35:24,124][model8_pretrain.py][INFO] Epoch:[0/2](556800/4588595) loss:2.607 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:35:24,124][model8_pretrain.py][INFO] Epoch:[0/2](556800/4588595) loss:2.632 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:35:24,124][model8_pretrain.py][INFO] Epoch:[0/2](556800/4588595) loss:3.330 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:35:24,124][model8_pretrain.py][INFO] Epoch:[0/2](556800/4588595) loss:3.311 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:35:24,124][model8_pretrain.py][INFO] Epoch:[0/2](556800/4588595) loss:2.868 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:35:24,124][model8_pretrain.py][INFO] Epoch:[0/2](556800/4588595) loss:2.914 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:35:24,124][model8_pretrain.py][INFO] Epoch:[0/2](556800/4588595) loss:3.245 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:35:24,124][model8_pretrain.py][INFO] Epoch:[0/2](556800/4588595) loss:3.168 lr:0.0000100 epoch_Time:25601.0min: [2024-01-05 04:36:01,047][model8_pretrain.py][INFO] Epoch:[0/2](556900/4588595) loss:2.844 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:01,047][model8_pretrain.py][INFO] Epoch:[0/2](556900/4588595) loss:2.723 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:01,047][model8_pretrain.py][INFO] Epoch:[0/2](556900/4588595) loss:3.036 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:01,047][model8_pretrain.py][INFO] Epoch:[0/2](556900/4588595) loss:2.605 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:01,047][model8_pretrain.py][INFO] Epoch:[0/2](556900/4588595) loss:2.398 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:01,047][model8_pretrain.py][INFO] Epoch:[0/2](556900/4588595) loss:2.957 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:01,047][model8_pretrain.py][INFO] Epoch:[0/2](556900/4588595) loss:2.750 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:01,047][model8_pretrain.py][INFO] Epoch:[0/2](556900/4588595) loss:3.112 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:37,989][model8_pretrain.py][INFO] Epoch:[0/2](557000/4588595) loss:2.995 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:37,989][model8_pretrain.py][INFO] Epoch:[0/2](557000/4588595) loss:3.072 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:37,989][model8_pretrain.py][INFO] Epoch:[0/2](557000/4588595) loss:2.695 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:37,989][model8_pretrain.py][INFO] Epoch:[0/2](557000/4588595) loss:2.313 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:37,989][model8_pretrain.py][INFO] Epoch:[0/2](557000/4588595) loss:2.801 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:37,989][model8_pretrain.py][INFO] Epoch:[0/2](557000/4588595) loss:2.603 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:37,989][model8_pretrain.py][INFO] Epoch:[0/2](557000/4588595) loss:2.833 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:36:37,989][model8_pretrain.py][INFO] Epoch:[0/2](557000/4588595) loss:2.823 lr:0.0000100 epoch_Time:25600.0min: [2024-01-05 04:37:14,923][model8_pretrain.py][INFO] Epoch:[0/2](557100/4588595) loss:3.108 lr:0.0000100 epoch_Time:25599.0min: [2024-01-05 04:37:14,923][model8_pretrain.py][INFO] Epoch:[0/2](557100/4588595) loss:2.431 lr:0.0000100 epoch_Time:25599.0min: [2024-01-05 04:37:14,923][model8_pretrain.py][INFO] Epoch:[0/2](557100/4588595) loss:2.813 lr:0.0000100 epoch_Time:25599.0min: [2024-01-05 04:37:14,923][model8_pretrain.py][INFO] Epoch:[0/2](557100/4588595) loss:2.624 lr:0.0000100 epoch_Time:25599.0min: [2024-01-05 04:37:14,923][model8_pretrain.py][INFO] Epoch:[0/2](557100/4588595) loss:3.149 lr:0.0000100 epoch_Time:25599.0min: [2024-01-05 04:37:14,923][model8_pretrain.py][INFO] Epoch:[0/2](557100/4588595) loss:2.798 lr:0.0000100 epoch_Time:25599.0min: [2024-01-05 04:37:14,923][model8_pretrain.py][INFO] Epoch:[0/2](557100/4588595) loss:3.424 lr:0.0000100 epoch_Time:25599.0min: [2024-01-05 04:37:14,923][model8_pretrain.py][INFO] Epoch:[0/2](557100/4588595) loss:2.936 lr:0.0000100 epoch_Time:25599.0min: [2024-01-05 04:37:51,850][model8_pretrain.py][INFO] Epoch:[0/2](557200/4588595) loss:2.229 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:37:51,850][model8_pretrain.py][INFO] Epoch:[0/2](557200/4588595) loss:2.525 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:37:51,850][model8_pretrain.py][INFO] Epoch:[0/2](557200/4588595) loss:3.080 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:37:51,850][model8_pretrain.py][INFO] Epoch:[0/2](557200/4588595) loss:3.085 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:37:51,850][model8_pretrain.py][INFO] Epoch:[0/2](557200/4588595) loss:3.323 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:37:51,850][model8_pretrain.py][INFO] Epoch:[0/2](557200/4588595) loss:2.499 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:37:51,850][model8_pretrain.py][INFO] Epoch:[0/2](557200/4588595) loss:2.292 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:37:51,850][model8_pretrain.py][INFO] Epoch:[0/2](557200/4588595) loss:2.823 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](557300/4588595) loss:2.726 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](557300/4588595) loss:2.716 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](557300/4588595) loss:2.677 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](557300/4588595) loss:2.521 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](557300/4588595) loss:2.416 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](557300/4588595) loss:3.286 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:38:28,798][model8_pretrain.py][INFO] Epoch:[0/2](557300/4588595) loss:1.977 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:38:28,798][model8_pretrain.py][INFO] Epoch:[0/2](557300/4588595) loss:2.932 lr:0.0000100 epoch_Time:25598.0min: [2024-01-05 04:39:05,723][model8_pretrain.py][INFO] Epoch:[0/2](557400/4588595) loss:2.490 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:05,723][model8_pretrain.py][INFO] Epoch:[0/2](557400/4588595) loss:2.896 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:05,723][model8_pretrain.py][INFO] Epoch:[0/2](557400/4588595) loss:2.625 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:05,723][model8_pretrain.py][INFO] Epoch:[0/2](557400/4588595) loss:2.020 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:05,723][model8_pretrain.py][INFO] Epoch:[0/2](557400/4588595) loss:3.185 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:05,723][model8_pretrain.py][INFO] Epoch:[0/2](557400/4588595) loss:2.671 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:05,723][model8_pretrain.py][INFO] Epoch:[0/2](557400/4588595) loss:2.655 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:05,723][model8_pretrain.py][INFO] Epoch:[0/2](557400/4588595) loss:2.892 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:42,650][model8_pretrain.py][INFO] Epoch:[0/2](557500/4588595) loss:3.141 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:42,650][model8_pretrain.py][INFO] Epoch:[0/2](557500/4588595) loss:3.173 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:42,650][model8_pretrain.py][INFO] Epoch:[0/2](557500/4588595) loss:3.057 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:42,650][model8_pretrain.py][INFO] Epoch:[0/2](557500/4588595) loss:2.493 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:42,650][model8_pretrain.py][INFO] Epoch:[0/2](557500/4588595) loss:3.151 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:42,650][model8_pretrain.py][INFO] Epoch:[0/2](557500/4588595) loss:2.314 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:42,651][model8_pretrain.py][INFO] Epoch:[0/2](557500/4588595) loss:3.140 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:39:42,651][model8_pretrain.py][INFO] Epoch:[0/2](557500/4588595) loss:2.695 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:40:32,523][model8_pretrain.py][INFO] Epoch:[0/2](557600/4588595) loss:2.682 lr:0.0000100 epoch_Time:25597.0min: [2024-01-05 04:40:32,523][model8_pretrain.py][INFO] Epoch:[0/2](557600/4588595) loss:3.172 lr:0.0000100 epoch_Time:25597.0min: [2024-01-05 04:40:32,523][model8_pretrain.py][INFO] Epoch:[0/2](557600/4588595) loss:2.410 lr:0.0000100 epoch_Time:25597.0min: [2024-01-05 04:40:32,523][model8_pretrain.py][INFO] Epoch:[0/2](557600/4588595) loss:3.138 lr:0.0000100 epoch_Time:25597.0min: [2024-01-05 04:40:32,523][model8_pretrain.py][INFO] Epoch:[0/2](557600/4588595) loss:2.680 lr:0.0000100 epoch_Time:25597.0min: [2024-01-05 04:40:32,523][model8_pretrain.py][INFO] Epoch:[0/2](557600/4588595) loss:3.031 lr:0.0000100 epoch_Time:25597.0min: [2024-01-05 04:40:32,523][model8_pretrain.py][INFO] Epoch:[0/2](557600/4588595) loss:2.872 lr:0.0000100 epoch_Time:25597.0min: [2024-01-05 04:40:32,523][model8_pretrain.py][INFO] Epoch:[0/2](557600/4588595) loss:2.948 lr:0.0000100 epoch_Time:25597.0min: [2024-01-05 04:41:09,453][model8_pretrain.py][INFO] Epoch:[0/2](557700/4588595) loss:3.285 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:09,453][model8_pretrain.py][INFO] Epoch:[0/2](557700/4588595) loss:2.912 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:09,453][model8_pretrain.py][INFO] Epoch:[0/2](557700/4588595) loss:3.051 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:09,453][model8_pretrain.py][INFO] Epoch:[0/2](557700/4588595) loss:2.195 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:09,453][model8_pretrain.py][INFO] Epoch:[0/2](557700/4588595) loss:2.638 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:09,453][model8_pretrain.py][INFO] Epoch:[0/2](557700/4588595) loss:2.415 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:09,453][model8_pretrain.py][INFO] Epoch:[0/2](557700/4588595) loss:2.680 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:09,453][model8_pretrain.py][INFO] Epoch:[0/2](557700/4588595) loss:3.060 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:46,388][model8_pretrain.py][INFO] Epoch:[0/2](557800/4588595) loss:3.177 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:46,388][model8_pretrain.py][INFO] Epoch:[0/2](557800/4588595) loss:2.794 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:46,388][model8_pretrain.py][INFO] Epoch:[0/2](557800/4588595) loss:3.174 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:46,388][model8_pretrain.py][INFO] Epoch:[0/2](557800/4588595) loss:2.333 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:46,389][model8_pretrain.py][INFO] Epoch:[0/2](557800/4588595) loss:2.538 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:46,389][model8_pretrain.py][INFO] Epoch:[0/2](557800/4588595) loss:2.714 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:46,389][model8_pretrain.py][INFO] Epoch:[0/2](557800/4588595) loss:2.724 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:41:46,389][model8_pretrain.py][INFO] Epoch:[0/2](557800/4588595) loss:2.758 lr:0.0000100 epoch_Time:25596.0min: [2024-01-05 04:42:23,329][model8_pretrain.py][INFO] Epoch:[0/2](557900/4588595) loss:3.213 lr:0.0000100 epoch_Time:25594.0min: [2024-01-05 04:42:23,329][model8_pretrain.py][INFO] Epoch:[0/2](557900/4588595) loss:3.099 lr:0.0000100 epoch_Time:25594.0min: [2024-01-05 04:42:23,329][model8_pretrain.py][INFO] Epoch:[0/2](557900/4588595) loss:2.563 lr:0.0000100 epoch_Time:25594.0min: [2024-01-05 04:42:23,329][model8_pretrain.py][INFO] Epoch:[0/2](557900/4588595) loss:2.839 lr:0.0000100 epoch_Time:25594.0min: [2024-01-05 04:42:23,330][model8_pretrain.py][INFO] Epoch:[0/2](557900/4588595) loss:3.014 lr:0.0000100 epoch_Time:25594.0min: [2024-01-05 04:42:23,330][model8_pretrain.py][INFO] Epoch:[0/2](557900/4588595) loss:2.824 lr:0.0000100 epoch_Time:25594.0min: [2024-01-05 04:42:23,330][model8_pretrain.py][INFO] Epoch:[0/2](557900/4588595) loss:2.609 lr:0.0000100 epoch_Time:25594.0min: [2024-01-05 04:42:23,331][model8_pretrain.py][INFO] Epoch:[0/2](557900/4588595) loss:2.541 lr:0.0000100 epoch_Time:25594.0min: [2024-01-05 04:43:00,264][model8_pretrain.py][INFO] Epoch:[0/2](558000/4588595) loss:2.983 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:00,264][model8_pretrain.py][INFO] Epoch:[0/2](558000/4588595) loss:3.388 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:00,264][model8_pretrain.py][INFO] Epoch:[0/2](558000/4588595) loss:2.583 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:00,264][model8_pretrain.py][INFO] Epoch:[0/2](558000/4588595) loss:2.967 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:00,264][model8_pretrain.py][INFO] Epoch:[0/2](558000/4588595) loss:2.653 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:00,264][model8_pretrain.py][INFO] Epoch:[0/2](558000/4588595) loss:2.594 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:00,264][model8_pretrain.py][INFO] Epoch:[0/2](558000/4588595) loss:2.790 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:00,264][model8_pretrain.py][INFO] Epoch:[0/2](558000/4588595) loss:2.859 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:37,201][model8_pretrain.py][INFO] Epoch:[0/2](558100/4588595) loss:2.618 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:37,201][model8_pretrain.py][INFO] Epoch:[0/2](558100/4588595) loss:2.833 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:37,201][model8_pretrain.py][INFO] Epoch:[0/2](558100/4588595) loss:3.057 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:37,201][model8_pretrain.py][INFO] Epoch:[0/2](558100/4588595) loss:2.259 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:37,201][model8_pretrain.py][INFO] Epoch:[0/2](558100/4588595) loss:3.157 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:37,201][model8_pretrain.py][INFO] Epoch:[0/2](558100/4588595) loss:2.350 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:37,201][model8_pretrain.py][INFO] Epoch:[0/2](558100/4588595) loss:2.390 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:43:37,201][model8_pretrain.py][INFO] Epoch:[0/2](558100/4588595) loss:2.426 lr:0.0000100 epoch_Time:25593.0min: [2024-01-05 04:44:14,094][model8_pretrain.py][INFO] Epoch:[0/2](558200/4588595) loss:2.773 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:44:14,094][model8_pretrain.py][INFO] Epoch:[0/2](558200/4588595) loss:3.438 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:44:14,094][model8_pretrain.py][INFO] Epoch:[0/2](558200/4588595) loss:2.993 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:44:14,094][model8_pretrain.py][INFO] Epoch:[0/2](558200/4588595) loss:2.733 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:44:14,094][model8_pretrain.py][INFO] Epoch:[0/2](558200/4588595) loss:3.021 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:44:14,094][model8_pretrain.py][INFO] Epoch:[0/2](558200/4588595) loss:3.193 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:44:14,094][model8_pretrain.py][INFO] Epoch:[0/2](558200/4588595) loss:2.776 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:44:14,094][model8_pretrain.py][INFO] Epoch:[0/2](558200/4588595) loss:3.037 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:44:51,027][model8_pretrain.py][INFO] Epoch:[0/2](558300/4588595) loss:2.788 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:44:51,027][model8_pretrain.py][INFO] Epoch:[0/2](558300/4588595) loss:2.738 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:44:51,027][model8_pretrain.py][INFO] Epoch:[0/2](558300/4588595) loss:2.807 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:44:51,027][model8_pretrain.py][INFO] Epoch:[0/2](558300/4588595) loss:3.027 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:44:51,027][model8_pretrain.py][INFO] Epoch:[0/2](558300/4588595) loss:2.762 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:44:51,027][model8_pretrain.py][INFO] Epoch:[0/2](558300/4588595) loss:3.186 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:44:51,027][model8_pretrain.py][INFO] Epoch:[0/2](558300/4588595) loss:3.080 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:44:51,027][model8_pretrain.py][INFO] Epoch:[0/2](558300/4588595) loss:2.664 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:45:40,112][model8_pretrain.py][INFO] Epoch:[0/2](558400/4588595) loss:2.527 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:45:40,112][model8_pretrain.py][INFO] Epoch:[0/2](558400/4588595) loss:2.436 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:45:40,112][model8_pretrain.py][INFO] Epoch:[0/2](558400/4588595) loss:2.055 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:45:40,112][model8_pretrain.py][INFO] Epoch:[0/2](558400/4588595) loss:3.201 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:45:40,112][model8_pretrain.py][INFO] Epoch:[0/2](558400/4588595) loss:2.604 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:45:40,112][model8_pretrain.py][INFO] Epoch:[0/2](558400/4588595) loss:2.645 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:45:40,112][model8_pretrain.py][INFO] Epoch:[0/2](558400/4588595) loss:2.666 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:45:40,112][model8_pretrain.py][INFO] Epoch:[0/2](558400/4588595) loss:2.886 lr:0.0000100 epoch_Time:25592.0min: [2024-01-05 04:46:17,077][model8_pretrain.py][INFO] Epoch:[0/2](558500/4588595) loss:2.533 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:46:17,077][model8_pretrain.py][INFO] Epoch:[0/2](558500/4588595) loss:2.995 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:46:17,077][model8_pretrain.py][INFO] Epoch:[0/2](558500/4588595) loss:2.657 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:46:17,077][model8_pretrain.py][INFO] Epoch:[0/2](558500/4588595) loss:2.405 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:46:17,077][model8_pretrain.py][INFO] Epoch:[0/2](558500/4588595) loss:3.397 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:46:17,077][model8_pretrain.py][INFO] Epoch:[0/2](558500/4588595) loss:3.001 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:46:17,077][model8_pretrain.py][INFO] Epoch:[0/2](558500/4588595) loss:2.854 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:46:17,077][model8_pretrain.py][INFO] Epoch:[0/2](558500/4588595) loss:2.490 lr:0.0000100 epoch_Time:25591.0min: [2024-01-05 04:46:54,046][model8_pretrain.py][INFO] Epoch:[0/2](558600/4588595) loss:2.290 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:46:54,046][model8_pretrain.py][INFO] Epoch:[0/2](558600/4588595) loss:2.653 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:46:54,046][model8_pretrain.py][INFO] Epoch:[0/2](558600/4588595) loss:2.823 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:46:54,046][model8_pretrain.py][INFO] Epoch:[0/2](558600/4588595) loss:2.884 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:46:54,046][model8_pretrain.py][INFO] Epoch:[0/2](558600/4588595) loss:2.908 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:46:54,046][model8_pretrain.py][INFO] Epoch:[0/2](558600/4588595) loss:2.840 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:46:54,046][model8_pretrain.py][INFO] Epoch:[0/2](558600/4588595) loss:3.070 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:46:54,046][model8_pretrain.py][INFO] Epoch:[0/2](558600/4588595) loss:2.691 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:47:31,010][model8_pretrain.py][INFO] Epoch:[0/2](558700/4588595) loss:3.185 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:47:31,010][model8_pretrain.py][INFO] Epoch:[0/2](558700/4588595) loss:2.766 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:47:31,010][model8_pretrain.py][INFO] Epoch:[0/2](558700/4588595) loss:2.473 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:47:31,010][model8_pretrain.py][INFO] Epoch:[0/2](558700/4588595) loss:2.904 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:47:31,010][model8_pretrain.py][INFO] Epoch:[0/2](558700/4588595) loss:2.941 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:47:31,010][model8_pretrain.py][INFO] Epoch:[0/2](558700/4588595) loss:3.240 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:47:31,010][model8_pretrain.py][INFO] Epoch:[0/2](558700/4588595) loss:2.970 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:47:31,010][model8_pretrain.py][INFO] Epoch:[0/2](558700/4588595) loss:2.330 lr:0.0000100 epoch_Time:25590.0min: [2024-01-05 04:48:07,973][model8_pretrain.py][INFO] Epoch:[0/2](558800/4588595) loss:2.278 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:07,973][model8_pretrain.py][INFO] Epoch:[0/2](558800/4588595) loss:2.792 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:07,973][model8_pretrain.py][INFO] Epoch:[0/2](558800/4588595) loss:2.684 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:07,973][model8_pretrain.py][INFO] Epoch:[0/2](558800/4588595) loss:2.546 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:07,973][model8_pretrain.py][INFO] Epoch:[0/2](558800/4588595) loss:3.284 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:07,973][model8_pretrain.py][INFO] Epoch:[0/2](558800/4588595) loss:2.986 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:07,974][model8_pretrain.py][INFO] Epoch:[0/2](558800/4588595) loss:3.119 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:07,974][model8_pretrain.py][INFO] Epoch:[0/2](558800/4588595) loss:2.758 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:44,918][model8_pretrain.py][INFO] Epoch:[0/2](558900/4588595) loss:2.745 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:44,918][model8_pretrain.py][INFO] Epoch:[0/2](558900/4588595) loss:3.011 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:44,918][model8_pretrain.py][INFO] Epoch:[0/2](558900/4588595) loss:2.278 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:44,918][model8_pretrain.py][INFO] Epoch:[0/2](558900/4588595) loss:2.657 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:44,918][model8_pretrain.py][INFO] Epoch:[0/2](558900/4588595) loss:2.939 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:44,918][model8_pretrain.py][INFO] Epoch:[0/2](558900/4588595) loss:2.865 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:44,918][model8_pretrain.py][INFO] Epoch:[0/2](558900/4588595) loss:3.276 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:48:44,919][model8_pretrain.py][INFO] Epoch:[0/2](558900/4588595) loss:2.502 lr:0.0000100 epoch_Time:25589.0min: [2024-01-05 04:49:21,871][model8_pretrain.py][INFO] Epoch:[0/2](559000/4588595) loss:2.395 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:49:21,871][model8_pretrain.py][INFO] Epoch:[0/2](559000/4588595) loss:3.177 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:49:21,871][model8_pretrain.py][INFO] Epoch:[0/2](559000/4588595) loss:2.684 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:49:21,871][model8_pretrain.py][INFO] Epoch:[0/2](559000/4588595) loss:3.199 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:49:21,871][model8_pretrain.py][INFO] Epoch:[0/2](559000/4588595) loss:2.819 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:49:21,871][model8_pretrain.py][INFO] Epoch:[0/2](559000/4588595) loss:2.547 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:49:21,871][model8_pretrain.py][INFO] Epoch:[0/2](559000/4588595) loss:3.004 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:49:21,871][model8_pretrain.py][INFO] Epoch:[0/2](559000/4588595) loss:3.083 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:49:58,825][model8_pretrain.py][INFO] Epoch:[0/2](559100/4588595) loss:3.246 lr:0.0000100 epoch_Time:25586.0min: [2024-01-05 04:49:58,825][model8_pretrain.py][INFO] Epoch:[0/2](559100/4588595) loss:2.774 lr:0.0000100 epoch_Time:25586.0min: [2024-01-05 04:49:58,825][model8_pretrain.py][INFO] Epoch:[0/2](559100/4588595) loss:3.068 lr:0.0000100 epoch_Time:25586.0min: [2024-01-05 04:49:58,825][model8_pretrain.py][INFO] Epoch:[0/2](559100/4588595) loss:2.862 lr:0.0000100 epoch_Time:25586.0min: [2024-01-05 04:49:58,825][model8_pretrain.py][INFO] Epoch:[0/2](559100/4588595) loss:2.288 lr:0.0000100 epoch_Time:25586.0min: [2024-01-05 04:49:58,825][model8_pretrain.py][INFO] Epoch:[0/2](559100/4588595) loss:3.248 lr:0.0000100 epoch_Time:25586.0min: [2024-01-05 04:49:58,825][model8_pretrain.py][INFO] Epoch:[0/2](559100/4588595) loss:3.107 lr:0.0000100 epoch_Time:25586.0min: [2024-01-05 04:49:58,825][model8_pretrain.py][INFO] Epoch:[0/2](559100/4588595) loss:2.903 lr:0.0000100 epoch_Time:25586.0min: [2024-01-05 04:50:47,686][model8_pretrain.py][INFO] Epoch:[0/2](559200/4588595) loss:3.126 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:50:47,686][model8_pretrain.py][INFO] Epoch:[0/2](559200/4588595) loss:3.040 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:50:47,686][model8_pretrain.py][INFO] Epoch:[0/2](559200/4588595) loss:2.989 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:50:47,686][model8_pretrain.py][INFO] Epoch:[0/2](559200/4588595) loss:2.671 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:50:47,686][model8_pretrain.py][INFO] Epoch:[0/2](559200/4588595) loss:3.517 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:50:47,686][model8_pretrain.py][INFO] Epoch:[0/2](559200/4588595) loss:3.093 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:50:47,686][model8_pretrain.py][INFO] Epoch:[0/2](559200/4588595) loss:3.070 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:50:47,686][model8_pretrain.py][INFO] Epoch:[0/2](559200/4588595) loss:2.644 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:51:24,619][model8_pretrain.py][INFO] Epoch:[0/2](559300/4588595) loss:2.655 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:51:24,619][model8_pretrain.py][INFO] Epoch:[0/2](559300/4588595) loss:2.111 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:51:24,619][model8_pretrain.py][INFO] Epoch:[0/2](559300/4588595) loss:2.570 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:51:24,619][model8_pretrain.py][INFO] Epoch:[0/2](559300/4588595) loss:2.700 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:51:24,619][model8_pretrain.py][INFO] Epoch:[0/2](559300/4588595) loss:2.591 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:51:24,619][model8_pretrain.py][INFO] Epoch:[0/2](559300/4588595) loss:3.029 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:51:24,619][model8_pretrain.py][INFO] Epoch:[0/2](559300/4588595) loss:2.892 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:51:24,619][model8_pretrain.py][INFO] Epoch:[0/2](559300/4588595) loss:2.646 lr:0.0000100 epoch_Time:25587.0min: [2024-01-05 04:52:01,548][model8_pretrain.py][INFO] Epoch:[0/2](559400/4588595) loss:2.926 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:01,548][model8_pretrain.py][INFO] Epoch:[0/2](559400/4588595) loss:3.334 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:01,548][model8_pretrain.py][INFO] Epoch:[0/2](559400/4588595) loss:2.862 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:01,548][model8_pretrain.py][INFO] Epoch:[0/2](559400/4588595) loss:2.944 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:01,549][model8_pretrain.py][INFO] Epoch:[0/2](559400/4588595) loss:3.224 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:01,549][model8_pretrain.py][INFO] Epoch:[0/2](559400/4588595) loss:2.948 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:01,549][model8_pretrain.py][INFO] Epoch:[0/2](559400/4588595) loss:2.987 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:01,549][model8_pretrain.py][INFO] Epoch:[0/2](559400/4588595) loss:3.116 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:38,472][model8_pretrain.py][INFO] Epoch:[0/2](559500/4588595) loss:2.907 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:38,472][model8_pretrain.py][INFO] Epoch:[0/2](559500/4588595) loss:2.907 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:38,472][model8_pretrain.py][INFO] Epoch:[0/2](559500/4588595) loss:2.991 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:38,472][model8_pretrain.py][INFO] Epoch:[0/2](559500/4588595) loss:3.176 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:38,472][model8_pretrain.py][INFO] Epoch:[0/2](559500/4588595) loss:2.903 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:38,472][model8_pretrain.py][INFO] Epoch:[0/2](559500/4588595) loss:2.727 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:38,472][model8_pretrain.py][INFO] Epoch:[0/2](559500/4588595) loss:3.051 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:52:38,472][model8_pretrain.py][INFO] Epoch:[0/2](559500/4588595) loss:3.193 lr:0.0000100 epoch_Time:25585.0min: [2024-01-05 04:53:15,409][model8_pretrain.py][INFO] Epoch:[0/2](559600/4588595) loss:2.930 lr:0.0000100 epoch_Time:25584.0min: [2024-01-05 04:53:15,409][model8_pretrain.py][INFO] Epoch:[0/2](559600/4588595) loss:2.360 lr:0.0000100 epoch_Time:25584.0min: [2024-01-05 04:53:15,409][model8_pretrain.py][INFO] Epoch:[0/2](559600/4588595) loss:3.187 lr:0.0000100 epoch_Time:25584.0min: [2024-01-05 04:53:15,410][model8_pretrain.py][INFO] Epoch:[0/2](559600/4588595) loss:3.226 lr:0.0000100 epoch_Time:25584.0min: [2024-01-05 04:53:15,410][model8_pretrain.py][INFO] Epoch:[0/2](559600/4588595) loss:3.243 lr:0.0000100 epoch_Time:25584.0min: [2024-01-05 04:53:15,410][model8_pretrain.py][INFO] Epoch:[0/2](559600/4588595) loss:3.118 lr:0.0000100 epoch_Time:25584.0min: [2024-01-05 04:53:15,410][model8_pretrain.py][INFO] Epoch:[0/2](559600/4588595) loss:2.959 lr:0.0000100 epoch_Time:25584.0min: [2024-01-05 04:53:15,410][model8_pretrain.py][INFO] Epoch:[0/2](559600/4588595) loss:2.794 lr:0.0000100 epoch_Time:25584.0min: [2024-01-05 04:53:52,431][model8_pretrain.py][INFO] Epoch:[0/2](559700/4588595) loss:3.463 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:53:52,431][model8_pretrain.py][INFO] Epoch:[0/2](559700/4588595) loss:2.640 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:53:52,431][model8_pretrain.py][INFO] Epoch:[0/2](559700/4588595) loss:2.758 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:53:52,431][model8_pretrain.py][INFO] Epoch:[0/2](559700/4588595) loss:2.629 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:53:52,431][model8_pretrain.py][INFO] Epoch:[0/2](559700/4588595) loss:3.060 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:53:52,431][model8_pretrain.py][INFO] Epoch:[0/2](559700/4588595) loss:2.990 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:53:52,431][model8_pretrain.py][INFO] Epoch:[0/2](559700/4588595) loss:2.964 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:53:52,431][model8_pretrain.py][INFO] Epoch:[0/2](559700/4588595) loss:2.719 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:54:29,471][model8_pretrain.py][INFO] Epoch:[0/2](559800/4588595) loss:2.562 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:54:29,471][model8_pretrain.py][INFO] Epoch:[0/2](559800/4588595) loss:2.059 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:54:29,471][model8_pretrain.py][INFO] Epoch:[0/2](559800/4588595) loss:2.633 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:54:29,471][model8_pretrain.py][INFO] Epoch:[0/2](559800/4588595) loss:2.736 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:54:29,471][model8_pretrain.py][INFO] Epoch:[0/2](559800/4588595) loss:2.206 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:54:29,471][model8_pretrain.py][INFO] Epoch:[0/2](559800/4588595) loss:2.537 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:54:29,471][model8_pretrain.py][INFO] Epoch:[0/2](559800/4588595) loss:2.667 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:54:29,471][model8_pretrain.py][INFO] Epoch:[0/2](559800/4588595) loss:3.344 lr:0.0000100 epoch_Time:25583.0min: [2024-01-05 04:55:06,428][model8_pretrain.py][INFO] Epoch:[0/2](559900/4588595) loss:2.727 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:06,428][model8_pretrain.py][INFO] Epoch:[0/2](559900/4588595) loss:2.404 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:06,428][model8_pretrain.py][INFO] Epoch:[0/2](559900/4588595) loss:2.134 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:06,428][model8_pretrain.py][INFO] Epoch:[0/2](559900/4588595) loss:3.067 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:06,428][model8_pretrain.py][INFO] Epoch:[0/2](559900/4588595) loss:2.271 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:06,428][model8_pretrain.py][INFO] Epoch:[0/2](559900/4588595) loss:2.485 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:06,429][model8_pretrain.py][INFO] Epoch:[0/2](559900/4588595) loss:2.991 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:06,429][model8_pretrain.py][INFO] Epoch:[0/2](559900/4588595) loss:2.519 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:55,242][model8_pretrain.py][INFO] Epoch:[0/2](560000/4588595) loss:2.186 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:55,242][model8_pretrain.py][INFO] Epoch:[0/2](560000/4588595) loss:3.083 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:55,242][model8_pretrain.py][INFO] Epoch:[0/2](560000/4588595) loss:2.933 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:55,242][model8_pretrain.py][INFO] Epoch:[0/2](560000/4588595) loss:2.461 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:55,242][model8_pretrain.py][INFO] Epoch:[0/2](560000/4588595) loss:2.680 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:55,242][model8_pretrain.py][INFO] Epoch:[0/2](560000/4588595) loss:2.472 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:55,242][model8_pretrain.py][INFO] Epoch:[0/2](560000/4588595) loss:2.563 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:55:55,242][model8_pretrain.py][INFO] Epoch:[0/2](560000/4588595) loss:2.877 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:56:32,161][model8_pretrain.py][INFO] Epoch:[0/2](560100/4588595) loss:2.925 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:56:32,161][model8_pretrain.py][INFO] Epoch:[0/2](560100/4588595) loss:2.675 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:56:32,161][model8_pretrain.py][INFO] Epoch:[0/2](560100/4588595) loss:2.530 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:56:32,161][model8_pretrain.py][INFO] Epoch:[0/2](560100/4588595) loss:2.421 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:56:32,161][model8_pretrain.py][INFO] Epoch:[0/2](560100/4588595) loss:2.675 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:56:32,161][model8_pretrain.py][INFO] Epoch:[0/2](560100/4588595) loss:3.005 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:56:32,161][model8_pretrain.py][INFO] Epoch:[0/2](560100/4588595) loss:3.133 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:56:32,161][model8_pretrain.py][INFO] Epoch:[0/2](560100/4588595) loss:2.987 lr:0.0000100 epoch_Time:25582.0min: [2024-01-05 04:57:09,093][model8_pretrain.py][INFO] Epoch:[0/2](560200/4588595) loss:2.563 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:09,093][model8_pretrain.py][INFO] Epoch:[0/2](560200/4588595) loss:2.813 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:09,093][model8_pretrain.py][INFO] Epoch:[0/2](560200/4588595) loss:2.930 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:09,093][model8_pretrain.py][INFO] Epoch:[0/2](560200/4588595) loss:2.644 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:09,093][model8_pretrain.py][INFO] Epoch:[0/2](560200/4588595) loss:2.489 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:09,093][model8_pretrain.py][INFO] Epoch:[0/2](560200/4588595) loss:2.680 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:09,093][model8_pretrain.py][INFO] Epoch:[0/2](560200/4588595) loss:2.689 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:09,093][model8_pretrain.py][INFO] Epoch:[0/2](560200/4588595) loss:1.831 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:46,011][model8_pretrain.py][INFO] Epoch:[0/2](560300/4588595) loss:2.642 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:46,011][model8_pretrain.py][INFO] Epoch:[0/2](560300/4588595) loss:3.017 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:46,011][model8_pretrain.py][INFO] Epoch:[0/2](560300/4588595) loss:2.674 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:46,011][model8_pretrain.py][INFO] Epoch:[0/2](560300/4588595) loss:2.869 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:46,011][model8_pretrain.py][INFO] Epoch:[0/2](560300/4588595) loss:2.903 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:46,011][model8_pretrain.py][INFO] Epoch:[0/2](560300/4588595) loss:3.370 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:46,011][model8_pretrain.py][INFO] Epoch:[0/2](560300/4588595) loss:2.865 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:57:46,011][model8_pretrain.py][INFO] Epoch:[0/2](560300/4588595) loss:2.416 lr:0.0000100 epoch_Time:25581.0min: [2024-01-05 04:58:22,937][model8_pretrain.py][INFO] Epoch:[0/2](560400/4588595) loss:2.906 lr:0.0000100 epoch_Time:25579.0min: [2024-01-05 04:58:22,937][model8_pretrain.py][INFO] Epoch:[0/2](560400/4588595) loss:2.467 lr:0.0000100 epoch_Time:25579.0min: [2024-01-05 04:58:22,937][model8_pretrain.py][INFO] Epoch:[0/2](560400/4588595) loss:2.936 lr:0.0000100 epoch_Time:25579.0min: [2024-01-05 04:58:22,937][model8_pretrain.py][INFO] Epoch:[0/2](560400/4588595) loss:2.304 lr:0.0000100 epoch_Time:25579.0min: [2024-01-05 04:58:22,937][model8_pretrain.py][INFO] Epoch:[0/2](560400/4588595) loss:3.174 lr:0.0000100 epoch_Time:25579.0min: [2024-01-05 04:58:22,937][model8_pretrain.py][INFO] Epoch:[0/2](560400/4588595) loss:3.093 lr:0.0000100 epoch_Time:25579.0min: [2024-01-05 04:58:22,937][model8_pretrain.py][INFO] Epoch:[0/2](560400/4588595) loss:3.104 lr:0.0000100 epoch_Time:25579.0min: [2024-01-05 04:58:22,938][model8_pretrain.py][INFO] Epoch:[0/2](560400/4588595) loss:2.466 lr:0.0000100 epoch_Time:25579.0min: [2024-01-05 04:58:59,857][model8_pretrain.py][INFO] Epoch:[0/2](560500/4588595) loss:2.761 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:58:59,857][model8_pretrain.py][INFO] Epoch:[0/2](560500/4588595) loss:3.081 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:58:59,857][model8_pretrain.py][INFO] Epoch:[0/2](560500/4588595) loss:3.377 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:58:59,857][model8_pretrain.py][INFO] Epoch:[0/2](560500/4588595) loss:2.670 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:58:59,857][model8_pretrain.py][INFO] Epoch:[0/2](560500/4588595) loss:2.821 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:58:59,857][model8_pretrain.py][INFO] Epoch:[0/2](560500/4588595) loss:2.932 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:58:59,857][model8_pretrain.py][INFO] Epoch:[0/2](560500/4588595) loss:3.407 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:58:59,857][model8_pretrain.py][INFO] Epoch:[0/2](560500/4588595) loss:2.903 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:59:36,783][model8_pretrain.py][INFO] Epoch:[0/2](560600/4588595) loss:3.290 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:59:36,783][model8_pretrain.py][INFO] Epoch:[0/2](560600/4588595) loss:2.543 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:59:36,783][model8_pretrain.py][INFO] Epoch:[0/2](560600/4588595) loss:2.841 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:59:36,783][model8_pretrain.py][INFO] Epoch:[0/2](560600/4588595) loss:2.185 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:59:36,783][model8_pretrain.py][INFO] Epoch:[0/2](560600/4588595) loss:2.792 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:59:36,783][model8_pretrain.py][INFO] Epoch:[0/2](560600/4588595) loss:2.867 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:59:36,783][model8_pretrain.py][INFO] Epoch:[0/2](560600/4588595) loss:2.825 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 04:59:36,783][model8_pretrain.py][INFO] Epoch:[0/2](560600/4588595) loss:2.592 lr:0.0000100 epoch_Time:25578.0min: [2024-01-05 05:00:13,720][model8_pretrain.py][INFO] Epoch:[0/2](560700/4588595) loss:2.245 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:00:13,720][model8_pretrain.py][INFO] Epoch:[0/2](560700/4588595) loss:2.707 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:00:13,720][model8_pretrain.py][INFO] Epoch:[0/2](560700/4588595) loss:2.785 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:00:13,720][model8_pretrain.py][INFO] Epoch:[0/2](560700/4588595) loss:3.286 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:00:13,720][model8_pretrain.py][INFO] Epoch:[0/2](560700/4588595) loss:2.475 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:00:13,720][model8_pretrain.py][INFO] Epoch:[0/2](560700/4588595) loss:2.857 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:00:13,720][model8_pretrain.py][INFO] Epoch:[0/2](560700/4588595) loss:2.987 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:00:13,720][model8_pretrain.py][INFO] Epoch:[0/2](560700/4588595) loss:2.878 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:02,408][model8_pretrain.py][INFO] Epoch:[0/2](560800/4588595) loss:3.363 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:02,408][model8_pretrain.py][INFO] Epoch:[0/2](560800/4588595) loss:2.878 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:02,408][model8_pretrain.py][INFO] Epoch:[0/2](560800/4588595) loss:2.670 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:02,408][model8_pretrain.py][INFO] Epoch:[0/2](560800/4588595) loss:3.644 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:02,408][model8_pretrain.py][INFO] Epoch:[0/2](560800/4588595) loss:2.667 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:02,408][model8_pretrain.py][INFO] Epoch:[0/2](560800/4588595) loss:3.258 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:02,408][model8_pretrain.py][INFO] Epoch:[0/2](560800/4588595) loss:2.864 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:02,408][model8_pretrain.py][INFO] Epoch:[0/2](560800/4588595) loss:3.301 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:39,346][model8_pretrain.py][INFO] Epoch:[0/2](560900/4588595) loss:3.166 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:39,346][model8_pretrain.py][INFO] Epoch:[0/2](560900/4588595) loss:2.598 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:39,346][model8_pretrain.py][INFO] Epoch:[0/2](560900/4588595) loss:1.988 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:39,346][model8_pretrain.py][INFO] Epoch:[0/2](560900/4588595) loss:3.237 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:39,346][model8_pretrain.py][INFO] Epoch:[0/2](560900/4588595) loss:2.997 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:39,346][model8_pretrain.py][INFO] Epoch:[0/2](560900/4588595) loss:3.279 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:39,346][model8_pretrain.py][INFO] Epoch:[0/2](560900/4588595) loss:2.951 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:01:39,346][model8_pretrain.py][INFO] Epoch:[0/2](560900/4588595) loss:2.613 lr:0.0000100 epoch_Time:25577.0min: [2024-01-05 05:02:16,284][model8_pretrain.py][INFO] Epoch:[0/2](561000/4588595) loss:3.189 lr:0.0000100 epoch_Time:25576.0min: [2024-01-05 05:02:16,284][model8_pretrain.py][INFO] Epoch:[0/2](561000/4588595) loss:3.002 lr:0.0000100 epoch_Time:25576.0min: [2024-01-05 05:02:16,284][model8_pretrain.py][INFO] Epoch:[0/2](561000/4588595) loss:2.877 lr:0.0000100 epoch_Time:25576.0min: [2024-01-05 05:02:16,284][model8_pretrain.py][INFO] Epoch:[0/2](561000/4588595) loss:3.200 lr:0.0000100 epoch_Time:25576.0min: [2024-01-05 05:02:16,284][model8_pretrain.py][INFO] Epoch:[0/2](561000/4588595) loss:2.872 lr:0.0000100 epoch_Time:25576.0min: [2024-01-05 05:02:16,284][model8_pretrain.py][INFO] Epoch:[0/2](561000/4588595) loss:3.188 lr:0.0000100 epoch_Time:25576.0min: [2024-01-05 05:02:16,284][model8_pretrain.py][INFO] Epoch:[0/2](561000/4588595) loss:3.193 lr:0.0000100 epoch_Time:25576.0min: [2024-01-05 05:02:16,284][model8_pretrain.py][INFO] Epoch:[0/2](561000/4588595) loss:3.040 lr:0.0000100 epoch_Time:25576.0min: [2024-01-05 05:02:53,218][model8_pretrain.py][INFO] Epoch:[0/2](561100/4588595) loss:2.714 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:02:53,218][model8_pretrain.py][INFO] Epoch:[0/2](561100/4588595) loss:3.510 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:02:53,218][model8_pretrain.py][INFO] Epoch:[0/2](561100/4588595) loss:2.729 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:02:53,218][model8_pretrain.py][INFO] Epoch:[0/2](561100/4588595) loss:3.221 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:02:53,218][model8_pretrain.py][INFO] Epoch:[0/2](561100/4588595) loss:2.737 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:02:53,218][model8_pretrain.py][INFO] Epoch:[0/2](561100/4588595) loss:2.709 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:02:53,218][model8_pretrain.py][INFO] Epoch:[0/2](561100/4588595) loss:3.271 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:02:53,218][model8_pretrain.py][INFO] Epoch:[0/2](561100/4588595) loss:3.194 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:03:30,145][model8_pretrain.py][INFO] Epoch:[0/2](561200/4588595) loss:2.878 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:03:30,145][model8_pretrain.py][INFO] Epoch:[0/2](561200/4588595) loss:3.040 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:03:30,145][model8_pretrain.py][INFO] Epoch:[0/2](561200/4588595) loss:3.006 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:03:30,145][model8_pretrain.py][INFO] Epoch:[0/2](561200/4588595) loss:2.689 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:03:30,145][model8_pretrain.py][INFO] Epoch:[0/2](561200/4588595) loss:2.904 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:03:30,145][model8_pretrain.py][INFO] Epoch:[0/2](561200/4588595) loss:3.416 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:03:30,145][model8_pretrain.py][INFO] Epoch:[0/2](561200/4588595) loss:2.623 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:03:30,145][model8_pretrain.py][INFO] Epoch:[0/2](561200/4588595) loss:3.028 lr:0.0000100 epoch_Time:25575.0min: [2024-01-05 05:04:07,078][model8_pretrain.py][INFO] Epoch:[0/2](561300/4588595) loss:2.768 lr:0.0000100 epoch_Time:25574.0min: [2024-01-05 05:04:07,078][model8_pretrain.py][INFO] Epoch:[0/2](561300/4588595) loss:2.834 lr:0.0000100 epoch_Time:25574.0min: [2024-01-05 05:04:07,078][model8_pretrain.py][INFO] Epoch:[0/2](561300/4588595) loss:2.801 lr:0.0000100 epoch_Time:25574.0min: [2024-01-05 05:04:07,078][model8_pretrain.py][INFO] Epoch:[0/2](561300/4588595) loss:3.011 lr:0.0000100 epoch_Time:25574.0min: [2024-01-05 05:04:07,078][model8_pretrain.py][INFO] Epoch:[0/2](561300/4588595) loss:2.454 lr:0.0000100 epoch_Time:25574.0min: [2024-01-05 05:04:07,078][model8_pretrain.py][INFO] Epoch:[0/2](561300/4588595) loss:2.915 lr:0.0000100 epoch_Time:25574.0min: [2024-01-05 05:04:07,078][model8_pretrain.py][INFO] Epoch:[0/2](561300/4588595) loss:2.572 lr:0.0000100 epoch_Time:25574.0min: [2024-01-05 05:04:07,078][model8_pretrain.py][INFO] Epoch:[0/2](561300/4588595) loss:3.028 lr:0.0000100 epoch_Time:25574.0min: [2024-01-05 05:04:43,983][model8_pretrain.py][INFO] Epoch:[0/2](561400/4588595) loss:2.795 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:04:43,983][model8_pretrain.py][INFO] Epoch:[0/2](561400/4588595) loss:2.815 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:04:43,983][model8_pretrain.py][INFO] Epoch:[0/2](561400/4588595) loss:2.652 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:04:43,983][model8_pretrain.py][INFO] Epoch:[0/2](561400/4588595) loss:3.063 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:04:43,983][model8_pretrain.py][INFO] Epoch:[0/2](561400/4588595) loss:2.828 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:04:43,983][model8_pretrain.py][INFO] Epoch:[0/2](561400/4588595) loss:2.615 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:04:43,983][model8_pretrain.py][INFO] Epoch:[0/2](561400/4588595) loss:2.591 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:04:43,984][model8_pretrain.py][INFO] Epoch:[0/2](561400/4588595) loss:2.966 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:05:20,916][model8_pretrain.py][INFO] Epoch:[0/2](561500/4588595) loss:3.135 lr:0.0000100 epoch_Time:25572.0min: [2024-01-05 05:05:20,916][model8_pretrain.py][INFO] Epoch:[0/2](561500/4588595) loss:2.907 lr:0.0000100 epoch_Time:25572.0min: [2024-01-05 05:05:20,916][model8_pretrain.py][INFO] Epoch:[0/2](561500/4588595) loss:2.772 lr:0.0000100 epoch_Time:25572.0min: [2024-01-05 05:05:20,916][model8_pretrain.py][INFO] Epoch:[0/2](561500/4588595) loss:3.134 lr:0.0000100 epoch_Time:25572.0min: [2024-01-05 05:05:20,916][model8_pretrain.py][INFO] Epoch:[0/2](561500/4588595) loss:3.254 lr:0.0000100 epoch_Time:25572.0min: [2024-01-05 05:05:20,916][model8_pretrain.py][INFO] Epoch:[0/2](561500/4588595) loss:2.565 lr:0.0000100 epoch_Time:25572.0min: [2024-01-05 05:05:20,916][model8_pretrain.py][INFO] Epoch:[0/2](561500/4588595) loss:2.754 lr:0.0000100 epoch_Time:25572.0min: [2024-01-05 05:05:20,917][model8_pretrain.py][INFO] Epoch:[0/2](561500/4588595) loss:2.777 lr:0.0000100 epoch_Time:25572.0min: [2024-01-05 05:06:09,626][model8_pretrain.py][INFO] Epoch:[0/2](561600/4588595) loss:2.590 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:09,626][model8_pretrain.py][INFO] Epoch:[0/2](561600/4588595) loss:2.222 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:09,626][model8_pretrain.py][INFO] Epoch:[0/2](561600/4588595) loss:3.231 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:09,626][model8_pretrain.py][INFO] Epoch:[0/2](561600/4588595) loss:2.879 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:09,626][model8_pretrain.py][INFO] Epoch:[0/2](561600/4588595) loss:2.617 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:09,626][model8_pretrain.py][INFO] Epoch:[0/2](561600/4588595) loss:3.068 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:09,626][model8_pretrain.py][INFO] Epoch:[0/2](561600/4588595) loss:3.339 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:09,627][model8_pretrain.py][INFO] Epoch:[0/2](561600/4588595) loss:2.787 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:46,554][model8_pretrain.py][INFO] Epoch:[0/2](561700/4588595) loss:2.431 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:46,554][model8_pretrain.py][INFO] Epoch:[0/2](561700/4588595) loss:2.900 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:46,554][model8_pretrain.py][INFO] Epoch:[0/2](561700/4588595) loss:2.928 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:46,554][model8_pretrain.py][INFO] Epoch:[0/2](561700/4588595) loss:2.718 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:46,554][model8_pretrain.py][INFO] Epoch:[0/2](561700/4588595) loss:2.639 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:46,554][model8_pretrain.py][INFO] Epoch:[0/2](561700/4588595) loss:2.919 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:46,554][model8_pretrain.py][INFO] Epoch:[0/2](561700/4588595) loss:2.136 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:06:46,554][model8_pretrain.py][INFO] Epoch:[0/2](561700/4588595) loss:2.881 lr:0.0000100 epoch_Time:25573.0min: [2024-01-05 05:07:23,497][model8_pretrain.py][INFO] Epoch:[0/2](561800/4588595) loss:3.046 lr:0.0000100 epoch_Time:25571.0min: [2024-01-05 05:07:23,497][model8_pretrain.py][INFO] Epoch:[0/2](561800/4588595) loss:2.794 lr:0.0000100 epoch_Time:25571.0min: [2024-01-05 05:07:23,497][model8_pretrain.py][INFO] Epoch:[0/2](561800/4588595) loss:3.075 lr:0.0000100 epoch_Time:25571.0min: [2024-01-05 05:07:23,497][model8_pretrain.py][INFO] Epoch:[0/2](561800/4588595) loss:3.193 lr:0.0000100 epoch_Time:25571.0min: [2024-01-05 05:07:23,497][model8_pretrain.py][INFO] Epoch:[0/2](561800/4588595) loss:2.669 lr:0.0000100 epoch_Time:25571.0min: [2024-01-05 05:07:23,497][model8_pretrain.py][INFO] Epoch:[0/2](561800/4588595) loss:3.001 lr:0.0000100 epoch_Time:25571.0min: [2024-01-05 05:07:23,497][model8_pretrain.py][INFO] Epoch:[0/2](561800/4588595) loss:2.824 lr:0.0000100 epoch_Time:25571.0min: [2024-01-05 05:07:23,497][model8_pretrain.py][INFO] Epoch:[0/2](561800/4588595) loss:2.742 lr:0.0000100 epoch_Time:25571.0min: [2024-01-05 05:08:00,436][model8_pretrain.py][INFO] Epoch:[0/2](561900/4588595) loss:1.993 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:00,436][model8_pretrain.py][INFO] Epoch:[0/2](561900/4588595) loss:2.732 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:00,436][model8_pretrain.py][INFO] Epoch:[0/2](561900/4588595) loss:2.537 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:00,436][model8_pretrain.py][INFO] Epoch:[0/2](561900/4588595) loss:2.958 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:00,436][model8_pretrain.py][INFO] Epoch:[0/2](561900/4588595) loss:2.555 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:00,436][model8_pretrain.py][INFO] Epoch:[0/2](561900/4588595) loss:3.009 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:00,436][model8_pretrain.py][INFO] Epoch:[0/2](561900/4588595) loss:3.163 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:00,436][model8_pretrain.py][INFO] Epoch:[0/2](561900/4588595) loss:3.092 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:37,367][model8_pretrain.py][INFO] Epoch:[0/2](562000/4588595) loss:2.782 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:37,367][model8_pretrain.py][INFO] Epoch:[0/2](562000/4588595) loss:2.851 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:37,367][model8_pretrain.py][INFO] Epoch:[0/2](562000/4588595) loss:3.104 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:37,367][model8_pretrain.py][INFO] Epoch:[0/2](562000/4588595) loss:3.154 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:37,367][model8_pretrain.py][INFO] Epoch:[0/2](562000/4588595) loss:3.107 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:37,367][model8_pretrain.py][INFO] Epoch:[0/2](562000/4588595) loss:2.612 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:37,367][model8_pretrain.py][INFO] Epoch:[0/2](562000/4588595) loss:2.456 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:08:37,368][model8_pretrain.py][INFO] Epoch:[0/2](562000/4588595) loss:2.924 lr:0.0000100 epoch_Time:25570.0min: [2024-01-05 05:09:14,304][model8_pretrain.py][INFO] Epoch:[0/2](562100/4588595) loss:3.096 lr:0.0000100 epoch_Time:25569.0min: [2024-01-05 05:09:14,304][model8_pretrain.py][INFO] Epoch:[0/2](562100/4588595) loss:3.109 lr:0.0000100 epoch_Time:25569.0min: [2024-01-05 05:09:14,304][model8_pretrain.py][INFO] Epoch:[0/2](562100/4588595) loss:2.106 lr:0.0000100 epoch_Time:25569.0min: [2024-01-05 05:09:14,304][model8_pretrain.py][INFO] Epoch:[0/2](562100/4588595) loss:2.680 lr:0.0000100 epoch_Time:25569.0min: [2024-01-05 05:09:14,304][model8_pretrain.py][INFO] Epoch:[0/2](562100/4588595) loss:3.448 lr:0.0000100 epoch_Time:25569.0min: [2024-01-05 05:09:14,304][model8_pretrain.py][INFO] Epoch:[0/2](562100/4588595) loss:2.068 lr:0.0000100 epoch_Time:25569.0min: [2024-01-05 05:09:14,304][model8_pretrain.py][INFO] Epoch:[0/2](562100/4588595) loss:2.913 lr:0.0000100 epoch_Time:25569.0min: [2024-01-05 05:09:14,304][model8_pretrain.py][INFO] Epoch:[0/2](562100/4588595) loss:2.445 lr:0.0000100 epoch_Time:25569.0min: [2024-01-05 05:09:51,237][model8_pretrain.py][INFO] Epoch:[0/2](562200/4588595) loss:2.287 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:09:51,237][model8_pretrain.py][INFO] Epoch:[0/2](562200/4588595) loss:3.157 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:09:51,237][model8_pretrain.py][INFO] Epoch:[0/2](562200/4588595) loss:2.997 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:09:51,237][model8_pretrain.py][INFO] Epoch:[0/2](562200/4588595) loss:3.051 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:09:51,238][model8_pretrain.py][INFO] Epoch:[0/2](562200/4588595) loss:3.208 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:09:51,238][model8_pretrain.py][INFO] Epoch:[0/2](562200/4588595) loss:3.125 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:09:51,238][model8_pretrain.py][INFO] Epoch:[0/2](562200/4588595) loss:3.205 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:09:51,238][model8_pretrain.py][INFO] Epoch:[0/2](562200/4588595) loss:2.763 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:10:28,140][model8_pretrain.py][INFO] Epoch:[0/2](562300/4588595) loss:3.442 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:10:28,141][model8_pretrain.py][INFO] Epoch:[0/2](562300/4588595) loss:2.401 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:10:28,141][model8_pretrain.py][INFO] Epoch:[0/2](562300/4588595) loss:2.576 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:10:28,141][model8_pretrain.py][INFO] Epoch:[0/2](562300/4588595) loss:2.900 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:10:28,141][model8_pretrain.py][INFO] Epoch:[0/2](562300/4588595) loss:2.773 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:10:28,141][model8_pretrain.py][INFO] Epoch:[0/2](562300/4588595) loss:2.520 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:10:28,141][model8_pretrain.py][INFO] Epoch:[0/2](562300/4588595) loss:2.987 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:10:28,141][model8_pretrain.py][INFO] Epoch:[0/2](562300/4588595) loss:2.786 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:11:16,949][model8_pretrain.py][INFO] Epoch:[0/2](562400/4588595) loss:2.800 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:11:16,949][model8_pretrain.py][INFO] Epoch:[0/2](562400/4588595) loss:2.829 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:11:16,949][model8_pretrain.py][INFO] Epoch:[0/2](562400/4588595) loss:3.057 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:11:16,949][model8_pretrain.py][INFO] Epoch:[0/2](562400/4588595) loss:3.109 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:11:16,949][model8_pretrain.py][INFO] Epoch:[0/2](562400/4588595) loss:2.904 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:11:16,949][model8_pretrain.py][INFO] Epoch:[0/2](562400/4588595) loss:3.120 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:11:16,949][model8_pretrain.py][INFO] Epoch:[0/2](562400/4588595) loss:2.638 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:11:16,949][model8_pretrain.py][INFO] Epoch:[0/2](562400/4588595) loss:2.976 lr:0.0000100 epoch_Time:25568.0min: [2024-01-05 05:11:53,876][model8_pretrain.py][INFO] Epoch:[0/2](562500/4588595) loss:2.659 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:11:53,876][model8_pretrain.py][INFO] Epoch:[0/2](562500/4588595) loss:2.812 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:11:53,876][model8_pretrain.py][INFO] Epoch:[0/2](562500/4588595) loss:3.131 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:11:53,876][model8_pretrain.py][INFO] Epoch:[0/2](562500/4588595) loss:3.052 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:11:53,877][model8_pretrain.py][INFO] Epoch:[0/2](562500/4588595) loss:2.796 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:11:53,877][model8_pretrain.py][INFO] Epoch:[0/2](562500/4588595) loss:2.174 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:11:53,877][model8_pretrain.py][INFO] Epoch:[0/2](562500/4588595) loss:2.955 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:11:53,877][model8_pretrain.py][INFO] Epoch:[0/2](562500/4588595) loss:2.985 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:12:30,813][model8_pretrain.py][INFO] Epoch:[0/2](562600/4588595) loss:2.732 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:12:30,813][model8_pretrain.py][INFO] Epoch:[0/2](562600/4588595) loss:3.228 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:12:30,813][model8_pretrain.py][INFO] Epoch:[0/2](562600/4588595) loss:2.370 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:12:30,814][model8_pretrain.py][INFO] Epoch:[0/2](562600/4588595) loss:3.043 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:12:30,814][model8_pretrain.py][INFO] Epoch:[0/2](562600/4588595) loss:3.156 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:12:30,814][model8_pretrain.py][INFO] Epoch:[0/2](562600/4588595) loss:2.863 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:12:30,814][model8_pretrain.py][INFO] Epoch:[0/2](562600/4588595) loss:2.796 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:12:30,814][model8_pretrain.py][INFO] Epoch:[0/2](562600/4588595) loss:2.474 lr:0.0000100 epoch_Time:25567.0min: [2024-01-05 05:13:07,735][model8_pretrain.py][INFO] Epoch:[0/2](562700/4588595) loss:2.428 lr:0.0000100 epoch_Time:25566.0min: [2024-01-05 05:13:07,735][model8_pretrain.py][INFO] Epoch:[0/2](562700/4588595) loss:2.632 lr:0.0000100 epoch_Time:25566.0min: [2024-01-05 05:13:07,735][model8_pretrain.py][INFO] Epoch:[0/2](562700/4588595) loss:2.963 lr:0.0000100 epoch_Time:25566.0min: [2024-01-05 05:13:07,735][model8_pretrain.py][INFO] Epoch:[0/2](562700/4588595) loss:2.562 lr:0.0000100 epoch_Time:25566.0min: [2024-01-05 05:13:07,735][model8_pretrain.py][INFO] Epoch:[0/2](562700/4588595) loss:2.919 lr:0.0000100 epoch_Time:25566.0min: [2024-01-05 05:13:07,735][model8_pretrain.py][INFO] Epoch:[0/2](562700/4588595) loss:3.070 lr:0.0000100 epoch_Time:25566.0min: [2024-01-05 05:13:07,735][model8_pretrain.py][INFO] Epoch:[0/2](562700/4588595) loss:3.045 lr:0.0000100 epoch_Time:25566.0min: [2024-01-05 05:13:07,735][model8_pretrain.py][INFO] Epoch:[0/2](562700/4588595) loss:2.795 lr:0.0000100 epoch_Time:25566.0min: [2024-01-05 05:13:44,658][model8_pretrain.py][INFO] Epoch:[0/2](562800/4588595) loss:3.213 lr:0.0000100 epoch_Time:25565.0min: [2024-01-05 05:13:44,659][model8_pretrain.py][INFO] Epoch:[0/2](562800/4588595) loss:2.836 lr:0.0000100 epoch_Time:25565.0min: [2024-01-05 05:13:44,659][model8_pretrain.py][INFO] Epoch:[0/2](562800/4588595) loss:3.119 lr:0.0000100 epoch_Time:25565.0min: [2024-01-05 05:13:44,659][model8_pretrain.py][INFO] Epoch:[0/2](562800/4588595) loss:2.372 lr:0.0000100 epoch_Time:25565.0min: [2024-01-05 05:13:44,659][model8_pretrain.py][INFO] Epoch:[0/2](562800/4588595) loss:2.985 lr:0.0000100 epoch_Time:25565.0min: [2024-01-05 05:13:44,659][model8_pretrain.py][INFO] Epoch:[0/2](562800/4588595) loss:3.493 lr:0.0000100 epoch_Time:25565.0min: [2024-01-05 05:13:44,659][model8_pretrain.py][INFO] Epoch:[0/2](562800/4588595) loss:2.825 lr:0.0000100 epoch_Time:25565.0min: [2024-01-05 05:13:44,659][model8_pretrain.py][INFO] Epoch:[0/2](562800/4588595) loss:2.777 lr:0.0000100 epoch_Time:25565.0min: [2024-01-05 05:14:21,590][model8_pretrain.py][INFO] Epoch:[0/2](562900/4588595) loss:3.019 lr:0.0000100 epoch_Time:25564.0min: [2024-01-05 05:14:21,590][model8_pretrain.py][INFO] Epoch:[0/2](562900/4588595) loss:2.928 lr:0.0000100 epoch_Time:25564.0min: [2024-01-05 05:14:21,590][model8_pretrain.py][INFO] Epoch:[0/2](562900/4588595) loss:2.712 lr:0.0000100 epoch_Time:25564.0min: [2024-01-05 05:14:21,590][model8_pretrain.py][INFO] Epoch:[0/2](562900/4588595) loss:2.671 lr:0.0000100 epoch_Time:25564.0min: [2024-01-05 05:14:21,590][model8_pretrain.py][INFO] Epoch:[0/2](562900/4588595) loss:2.776 lr:0.0000100 epoch_Time:25564.0min: [2024-01-05 05:14:21,590][model8_pretrain.py][INFO] Epoch:[0/2](562900/4588595) loss:3.294 lr:0.0000100 epoch_Time:25564.0min: [2024-01-05 05:14:21,590][model8_pretrain.py][INFO] Epoch:[0/2](562900/4588595) loss:2.901 lr:0.0000100 epoch_Time:25564.0min: [2024-01-05 05:14:21,591][model8_pretrain.py][INFO] Epoch:[0/2](562900/4588595) loss:3.040 lr:0.0000100 epoch_Time:25564.0min: [2024-01-05 05:14:58,515][model8_pretrain.py][INFO] Epoch:[0/2](563000/4588595) loss:2.479 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:14:58,515][model8_pretrain.py][INFO] Epoch:[0/2](563000/4588595) loss:2.863 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:14:58,515][model8_pretrain.py][INFO] Epoch:[0/2](563000/4588595) loss:2.879 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:14:58,515][model8_pretrain.py][INFO] Epoch:[0/2](563000/4588595) loss:2.723 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:14:58,515][model8_pretrain.py][INFO] Epoch:[0/2](563000/4588595) loss:2.735 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:14:58,515][model8_pretrain.py][INFO] Epoch:[0/2](563000/4588595) loss:2.946 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:14:58,515][model8_pretrain.py][INFO] Epoch:[0/2](563000/4588595) loss:2.868 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:14:58,515][model8_pretrain.py][INFO] Epoch:[0/2](563000/4588595) loss:2.606 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:15:35,487][model8_pretrain.py][INFO] Epoch:[0/2](563100/4588595) loss:2.198 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:15:35,487][model8_pretrain.py][INFO] Epoch:[0/2](563100/4588595) loss:3.150 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:15:35,487][model8_pretrain.py][INFO] Epoch:[0/2](563100/4588595) loss:3.080 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:15:35,487][model8_pretrain.py][INFO] Epoch:[0/2](563100/4588595) loss:3.075 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:15:35,487][model8_pretrain.py][INFO] Epoch:[0/2](563100/4588595) loss:3.047 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:15:35,487][model8_pretrain.py][INFO] Epoch:[0/2](563100/4588595) loss:2.819 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:15:35,488][model8_pretrain.py][INFO] Epoch:[0/2](563100/4588595) loss:3.008 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:15:35,488][model8_pretrain.py][INFO] Epoch:[0/2](563100/4588595) loss:2.833 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:16:24,351][model8_pretrain.py][INFO] Epoch:[0/2](563200/4588595) loss:2.532 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:16:24,351][model8_pretrain.py][INFO] Epoch:[0/2](563200/4588595) loss:3.029 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:16:24,351][model8_pretrain.py][INFO] Epoch:[0/2](563200/4588595) loss:2.865 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:16:24,351][model8_pretrain.py][INFO] Epoch:[0/2](563200/4588595) loss:2.926 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:16:24,351][model8_pretrain.py][INFO] Epoch:[0/2](563200/4588595) loss:2.888 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:16:24,351][model8_pretrain.py][INFO] Epoch:[0/2](563200/4588595) loss:2.888 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:16:24,351][model8_pretrain.py][INFO] Epoch:[0/2](563200/4588595) loss:3.104 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:16:24,351][model8_pretrain.py][INFO] Epoch:[0/2](563200/4588595) loss:2.754 lr:0.0000100 epoch_Time:25563.0min: [2024-01-05 05:17:01,281][model8_pretrain.py][INFO] Epoch:[0/2](563300/4588595) loss:3.002 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:01,281][model8_pretrain.py][INFO] Epoch:[0/2](563300/4588595) loss:2.788 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:01,281][model8_pretrain.py][INFO] Epoch:[0/2](563300/4588595) loss:2.914 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:01,281][model8_pretrain.py][INFO] Epoch:[0/2](563300/4588595) loss:2.917 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:01,281][model8_pretrain.py][INFO] Epoch:[0/2](563300/4588595) loss:2.971 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:01,281][model8_pretrain.py][INFO] Epoch:[0/2](563300/4588595) loss:3.115 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:01,281][model8_pretrain.py][INFO] Epoch:[0/2](563300/4588595) loss:2.878 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:01,281][model8_pretrain.py][INFO] Epoch:[0/2](563300/4588595) loss:2.804 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:38,219][model8_pretrain.py][INFO] Epoch:[0/2](563400/4588595) loss:3.130 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:38,219][model8_pretrain.py][INFO] Epoch:[0/2](563400/4588595) loss:2.985 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:38,219][model8_pretrain.py][INFO] Epoch:[0/2](563400/4588595) loss:2.872 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:38,219][model8_pretrain.py][INFO] Epoch:[0/2](563400/4588595) loss:2.653 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:38,219][model8_pretrain.py][INFO] Epoch:[0/2](563400/4588595) loss:2.723 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:38,220][model8_pretrain.py][INFO] Epoch:[0/2](563400/4588595) loss:2.727 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:38,219][model8_pretrain.py][INFO] Epoch:[0/2](563400/4588595) loss:2.681 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:17:38,219][model8_pretrain.py][INFO] Epoch:[0/2](563400/4588595) loss:2.749 lr:0.0000100 epoch_Time:25562.0min: [2024-01-05 05:18:15,159][model8_pretrain.py][INFO] Epoch:[0/2](563500/4588595) loss:2.624 lr:0.0000100 epoch_Time:25561.0min: [2024-01-05 05:18:15,159][model8_pretrain.py][INFO] Epoch:[0/2](563500/4588595) loss:3.200 lr:0.0000100 epoch_Time:25561.0min: [2024-01-05 05:18:15,159][model8_pretrain.py][INFO] Epoch:[0/2](563500/4588595) loss:2.731 lr:0.0000100 epoch_Time:25561.0min: [2024-01-05 05:18:15,159][model8_pretrain.py][INFO] Epoch:[0/2](563500/4588595) loss:2.433 lr:0.0000100 epoch_Time:25561.0min: [2024-01-05 05:18:15,159][model8_pretrain.py][INFO] Epoch:[0/2](563500/4588595) loss:2.808 lr:0.0000100 epoch_Time:25561.0min: [2024-01-05 05:18:15,159][model8_pretrain.py][INFO] Epoch:[0/2](563500/4588595) loss:2.836 lr:0.0000100 epoch_Time:25561.0min: [2024-01-05 05:18:15,159][model8_pretrain.py][INFO] Epoch:[0/2](563500/4588595) loss:2.513 lr:0.0000100 epoch_Time:25561.0min: [2024-01-05 05:18:15,159][model8_pretrain.py][INFO] Epoch:[0/2](563500/4588595) loss:2.825 lr:0.0000100 epoch_Time:25561.0min: [2024-01-05 05:18:52,103][model8_pretrain.py][INFO] Epoch:[0/2](563600/4588595) loss:2.748 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:18:52,104][model8_pretrain.py][INFO] Epoch:[0/2](563600/4588595) loss:2.504 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:18:52,104][model8_pretrain.py][INFO] Epoch:[0/2](563600/4588595) loss:3.314 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:18:52,104][model8_pretrain.py][INFO] Epoch:[0/2](563600/4588595) loss:3.343 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:18:52,104][model8_pretrain.py][INFO] Epoch:[0/2](563600/4588595) loss:2.767 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:18:52,104][model8_pretrain.py][INFO] Epoch:[0/2](563600/4588595) loss:2.809 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:18:52,104][model8_pretrain.py][INFO] Epoch:[0/2](563600/4588595) loss:2.730 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:18:52,104][model8_pretrain.py][INFO] Epoch:[0/2](563600/4588595) loss:2.595 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:19:29,034][model8_pretrain.py][INFO] Epoch:[0/2](563700/4588595) loss:2.805 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:19:29,034][model8_pretrain.py][INFO] Epoch:[0/2](563700/4588595) loss:2.635 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:19:29,034][model8_pretrain.py][INFO] Epoch:[0/2](563700/4588595) loss:2.883 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:19:29,034][model8_pretrain.py][INFO] Epoch:[0/2](563700/4588595) loss:3.105 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:19:29,034][model8_pretrain.py][INFO] Epoch:[0/2](563700/4588595) loss:2.731 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:19:29,034][model8_pretrain.py][INFO] Epoch:[0/2](563700/4588595) loss:2.786 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:19:29,034][model8_pretrain.py][INFO] Epoch:[0/2](563700/4588595) loss:2.950 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:19:29,034][model8_pretrain.py][INFO] Epoch:[0/2](563700/4588595) loss:3.119 lr:0.0000100 epoch_Time:25560.0min: [2024-01-05 05:20:05,967][model8_pretrain.py][INFO] Epoch:[0/2](563800/4588595) loss:2.776 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:05,967][model8_pretrain.py][INFO] Epoch:[0/2](563800/4588595) loss:2.513 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:05,967][model8_pretrain.py][INFO] Epoch:[0/2](563800/4588595) loss:3.025 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:05,967][model8_pretrain.py][INFO] Epoch:[0/2](563800/4588595) loss:2.773 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:05,967][model8_pretrain.py][INFO] Epoch:[0/2](563800/4588595) loss:3.143 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:05,967][model8_pretrain.py][INFO] Epoch:[0/2](563800/4588595) loss:3.083 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:05,967][model8_pretrain.py][INFO] Epoch:[0/2](563800/4588595) loss:2.319 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:05,967][model8_pretrain.py][INFO] Epoch:[0/2](563800/4588595) loss:3.107 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:42,910][model8_pretrain.py][INFO] Epoch:[0/2](563900/4588595) loss:2.508 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:42,910][model8_pretrain.py][INFO] Epoch:[0/2](563900/4588595) loss:2.816 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:42,910][model8_pretrain.py][INFO] Epoch:[0/2](563900/4588595) loss:2.892 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:42,910][model8_pretrain.py][INFO] Epoch:[0/2](563900/4588595) loss:2.910 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:42,910][model8_pretrain.py][INFO] Epoch:[0/2](563900/4588595) loss:2.546 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:42,910][model8_pretrain.py][INFO] Epoch:[0/2](563900/4588595) loss:2.794 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:42,910][model8_pretrain.py][INFO] Epoch:[0/2](563900/4588595) loss:3.002 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:20:42,911][model8_pretrain.py][INFO] Epoch:[0/2](563900/4588595) loss:2.380 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:21:31,773][model8_pretrain.py][INFO] Epoch:[0/2](564000/4588595) loss:2.703 lr:0.0000100 epoch_Time:25559.0min: [2024-01-05 05:21:31,773][model8_pretrain.py][INFO] Epoch:[0/2](564000/4588595) loss:3.112 lr:0.0000100 epoch_Time:25559.0min: [2024-01-05 05:21:31,773][model8_pretrain.py][INFO] Epoch:[0/2](564000/4588595) loss:3.159 lr:0.0000100 epoch_Time:25559.0min: [2024-01-05 05:21:31,773][model8_pretrain.py][INFO] Epoch:[0/2](564000/4588595) loss:3.115 lr:0.0000100 epoch_Time:25559.0min: [2024-01-05 05:21:31,773][model8_pretrain.py][INFO] Epoch:[0/2](564000/4588595) loss:3.516 lr:0.0000100 epoch_Time:25559.0min: [2024-01-05 05:21:31,773][model8_pretrain.py][INFO] Epoch:[0/2](564000/4588595) loss:2.828 lr:0.0000100 epoch_Time:25559.0min: [2024-01-05 05:21:31,773][model8_pretrain.py][INFO] Epoch:[0/2](564000/4588595) loss:2.905 lr:0.0000100 epoch_Time:25559.0min: [2024-01-05 05:21:31,773][model8_pretrain.py][INFO] Epoch:[0/2](564000/4588595) loss:2.346 lr:0.0000100 epoch_Time:25559.0min: [2024-01-05 05:22:08,696][model8_pretrain.py][INFO] Epoch:[0/2](564100/4588595) loss:2.704 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:22:08,696][model8_pretrain.py][INFO] Epoch:[0/2](564100/4588595) loss:3.043 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:22:08,696][model8_pretrain.py][INFO] Epoch:[0/2](564100/4588595) loss:3.370 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:22:08,696][model8_pretrain.py][INFO] Epoch:[0/2](564100/4588595) loss:2.745 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:22:08,696][model8_pretrain.py][INFO] Epoch:[0/2](564100/4588595) loss:2.856 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:22:08,696][model8_pretrain.py][INFO] Epoch:[0/2](564100/4588595) loss:2.748 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:22:08,696][model8_pretrain.py][INFO] Epoch:[0/2](564100/4588595) loss:2.698 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:22:08,696][model8_pretrain.py][INFO] Epoch:[0/2](564100/4588595) loss:2.960 lr:0.0000100 epoch_Time:25558.0min: [2024-01-05 05:22:45,628][model8_pretrain.py][INFO] Epoch:[0/2](564200/4588595) loss:2.584 lr:0.0000100 epoch_Time:25557.0min: [2024-01-05 05:22:45,628][model8_pretrain.py][INFO] Epoch:[0/2](564200/4588595) loss:3.085 lr:0.0000100 epoch_Time:25557.0min: [2024-01-05 05:22:45,628][model8_pretrain.py][INFO] Epoch:[0/2](564200/4588595) loss:3.025 lr:0.0000100 epoch_Time:25557.0min: [2024-01-05 05:22:45,628][model8_pretrain.py][INFO] Epoch:[0/2](564200/4588595) loss:2.916 lr:0.0000100 epoch_Time:25557.0min: [2024-01-05 05:22:45,628][model8_pretrain.py][INFO] Epoch:[0/2](564200/4588595) loss:3.528 lr:0.0000100 epoch_Time:25557.0min: [2024-01-05 05:22:45,628][model8_pretrain.py][INFO] Epoch:[0/2](564200/4588595) loss:3.046 lr:0.0000100 epoch_Time:25557.0min: [2024-01-05 05:22:45,628][model8_pretrain.py][INFO] Epoch:[0/2](564200/4588595) loss:3.134 lr:0.0000100 epoch_Time:25557.0min: [2024-01-05 05:22:45,628][model8_pretrain.py][INFO] Epoch:[0/2](564200/4588595) loss:3.055 lr:0.0000100 epoch_Time:25557.0min: [2024-01-05 05:23:22,564][model8_pretrain.py][INFO] Epoch:[0/2](564300/4588595) loss:2.959 lr:0.0000100 epoch_Time:25556.0min: [2024-01-05 05:23:22,564][model8_pretrain.py][INFO] Epoch:[0/2](564300/4588595) loss:3.083 lr:0.0000100 epoch_Time:25556.0min: [2024-01-05 05:23:22,564][model8_pretrain.py][INFO] Epoch:[0/2](564300/4588595) loss:2.589 lr:0.0000100 epoch_Time:25556.0min: [2024-01-05 05:23:22,564][model8_pretrain.py][INFO] Epoch:[0/2](564300/4588595) loss:3.294 lr:0.0000100 epoch_Time:25556.0min: [2024-01-05 05:23:22,564][model8_pretrain.py][INFO] Epoch:[0/2](564300/4588595) loss:3.005 lr:0.0000100 epoch_Time:25556.0min: [2024-01-05 05:23:22,564][model8_pretrain.py][INFO] Epoch:[0/2](564300/4588595) loss:2.442 lr:0.0000100 epoch_Time:25556.0min: [2024-01-05 05:23:22,564][model8_pretrain.py][INFO] Epoch:[0/2](564300/4588595) loss:2.958 lr:0.0000100 epoch_Time:25556.0min: [2024-01-05 05:23:22,564][model8_pretrain.py][INFO] Epoch:[0/2](564300/4588595) loss:2.300 lr:0.0000100 epoch_Time:25556.0min: [2024-01-05 05:23:59,493][model8_pretrain.py][INFO] Epoch:[0/2](564400/4588595) loss:2.650 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:23:59,493][model8_pretrain.py][INFO] Epoch:[0/2](564400/4588595) loss:2.224 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:23:59,493][model8_pretrain.py][INFO] Epoch:[0/2](564400/4588595) loss:2.928 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:23:59,493][model8_pretrain.py][INFO] Epoch:[0/2](564400/4588595) loss:2.980 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:23:59,493][model8_pretrain.py][INFO] Epoch:[0/2](564400/4588595) loss:3.035 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:23:59,493][model8_pretrain.py][INFO] Epoch:[0/2](564400/4588595) loss:3.003 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:23:59,493][model8_pretrain.py][INFO] Epoch:[0/2](564400/4588595) loss:2.424 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:23:59,493][model8_pretrain.py][INFO] Epoch:[0/2](564400/4588595) loss:2.632 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:24:36,433][model8_pretrain.py][INFO] Epoch:[0/2](564500/4588595) loss:3.079 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:24:36,433][model8_pretrain.py][INFO] Epoch:[0/2](564500/4588595) loss:3.212 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:24:36,433][model8_pretrain.py][INFO] Epoch:[0/2](564500/4588595) loss:2.485 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:24:36,433][model8_pretrain.py][INFO] Epoch:[0/2](564500/4588595) loss:2.988 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:24:36,433][model8_pretrain.py][INFO] Epoch:[0/2](564500/4588595) loss:2.684 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:24:36,433][model8_pretrain.py][INFO] Epoch:[0/2](564500/4588595) loss:2.386 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:24:36,433][model8_pretrain.py][INFO] Epoch:[0/2](564500/4588595) loss:2.810 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:24:36,433][model8_pretrain.py][INFO] Epoch:[0/2](564500/4588595) loss:2.407 lr:0.0000100 epoch_Time:25555.0min: [2024-01-05 05:25:13,361][model8_pretrain.py][INFO] Epoch:[0/2](564600/4588595) loss:3.410 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:25:13,361][model8_pretrain.py][INFO] Epoch:[0/2](564600/4588595) loss:2.975 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:25:13,361][model8_pretrain.py][INFO] Epoch:[0/2](564600/4588595) loss:2.664 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:25:13,361][model8_pretrain.py][INFO] Epoch:[0/2](564600/4588595) loss:2.675 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:25:13,361][model8_pretrain.py][INFO] Epoch:[0/2](564600/4588595) loss:2.863 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:25:13,361][model8_pretrain.py][INFO] Epoch:[0/2](564600/4588595) loss:3.334 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:25:13,361][model8_pretrain.py][INFO] Epoch:[0/2](564600/4588595) loss:3.255 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:25:13,361][model8_pretrain.py][INFO] Epoch:[0/2](564600/4588595) loss:2.810 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:25:50,294][model8_pretrain.py][INFO] Epoch:[0/2](564700/4588595) loss:3.324 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:25:50,294][model8_pretrain.py][INFO] Epoch:[0/2](564700/4588595) loss:2.646 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:25:50,294][model8_pretrain.py][INFO] Epoch:[0/2](564700/4588595) loss:2.677 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:25:50,295][model8_pretrain.py][INFO] Epoch:[0/2](564700/4588595) loss:2.741 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:25:50,295][model8_pretrain.py][INFO] Epoch:[0/2](564700/4588595) loss:2.694 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:25:50,295][model8_pretrain.py][INFO] Epoch:[0/2](564700/4588595) loss:3.003 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:25:50,295][model8_pretrain.py][INFO] Epoch:[0/2](564700/4588595) loss:1.935 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:25:50,295][model8_pretrain.py][INFO] Epoch:[0/2](564700/4588595) loss:2.760 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:26:39,404][model8_pretrain.py][INFO] Epoch:[0/2](564800/4588595) loss:2.448 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:26:39,404][model8_pretrain.py][INFO] Epoch:[0/2](564800/4588595) loss:2.712 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:26:39,404][model8_pretrain.py][INFO] Epoch:[0/2](564800/4588595) loss:2.512 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:26:39,404][model8_pretrain.py][INFO] Epoch:[0/2](564800/4588595) loss:2.450 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:26:39,405][model8_pretrain.py][INFO] Epoch:[0/2](564800/4588595) loss:2.615 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:26:39,404][model8_pretrain.py][INFO] Epoch:[0/2](564800/4588595) loss:2.983 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:26:39,405][model8_pretrain.py][INFO] Epoch:[0/2](564800/4588595) loss:2.331 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:26:39,405][model8_pretrain.py][INFO] Epoch:[0/2](564800/4588595) loss:2.984 lr:0.0000100 epoch_Time:25554.0min: [2024-01-05 05:27:16,337][model8_pretrain.py][INFO] Epoch:[0/2](564900/4588595) loss:3.037 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:27:16,337][model8_pretrain.py][INFO] Epoch:[0/2](564900/4588595) loss:2.244 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:27:16,337][model8_pretrain.py][INFO] Epoch:[0/2](564900/4588595) loss:3.050 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:27:16,337][model8_pretrain.py][INFO] Epoch:[0/2](564900/4588595) loss:2.411 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:27:16,337][model8_pretrain.py][INFO] Epoch:[0/2](564900/4588595) loss:2.818 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:27:16,337][model8_pretrain.py][INFO] Epoch:[0/2](564900/4588595) loss:2.958 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:27:16,337][model8_pretrain.py][INFO] Epoch:[0/2](564900/4588595) loss:2.526 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:27:16,337][model8_pretrain.py][INFO] Epoch:[0/2](564900/4588595) loss:2.798 lr:0.0000100 epoch_Time:25553.0min: [2024-01-05 05:27:53,280][model8_pretrain.py][INFO] Epoch:[0/2](565000/4588595) loss:2.882 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:27:53,280][model8_pretrain.py][INFO] Epoch:[0/2](565000/4588595) loss:2.873 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:27:53,280][model8_pretrain.py][INFO] Epoch:[0/2](565000/4588595) loss:2.245 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:27:53,280][model8_pretrain.py][INFO] Epoch:[0/2](565000/4588595) loss:2.969 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:27:53,281][model8_pretrain.py][INFO] Epoch:[0/2](565000/4588595) loss:2.913 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:27:53,281][model8_pretrain.py][INFO] Epoch:[0/2](565000/4588595) loss:2.958 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:27:53,281][model8_pretrain.py][INFO] Epoch:[0/2](565000/4588595) loss:2.863 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:27:53,281][model8_pretrain.py][INFO] Epoch:[0/2](565000/4588595) loss:2.850 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:28:30,230][model8_pretrain.py][INFO] Epoch:[0/2](565100/4588595) loss:2.898 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:28:30,230][model8_pretrain.py][INFO] Epoch:[0/2](565100/4588595) loss:3.117 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:28:30,230][model8_pretrain.py][INFO] Epoch:[0/2](565100/4588595) loss:2.266 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:28:30,230][model8_pretrain.py][INFO] Epoch:[0/2](565100/4588595) loss:2.556 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:28:30,230][model8_pretrain.py][INFO] Epoch:[0/2](565100/4588595) loss:3.274 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:28:30,230][model8_pretrain.py][INFO] Epoch:[0/2](565100/4588595) loss:2.850 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:28:30,230][model8_pretrain.py][INFO] Epoch:[0/2](565100/4588595) loss:3.318 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:28:30,230][model8_pretrain.py][INFO] Epoch:[0/2](565100/4588595) loss:3.024 lr:0.0000100 epoch_Time:25552.0min: [2024-01-05 05:29:07,167][model8_pretrain.py][INFO] Epoch:[0/2](565200/4588595) loss:2.805 lr:0.0000100 epoch_Time:25551.0min: [2024-01-05 05:29:07,167][model8_pretrain.py][INFO] Epoch:[0/2](565200/4588595) loss:3.072 lr:0.0000100 epoch_Time:25551.0min: [2024-01-05 05:29:07,167][model8_pretrain.py][INFO] Epoch:[0/2](565200/4588595) loss:2.784 lr:0.0000100 epoch_Time:25551.0min: [2024-01-05 05:29:07,167][model8_pretrain.py][INFO] Epoch:[0/2](565200/4588595) loss:2.829 lr:0.0000100 epoch_Time:25551.0min: [2024-01-05 05:29:07,167][model8_pretrain.py][INFO] Epoch:[0/2](565200/4588595) loss:2.803 lr:0.0000100 epoch_Time:25551.0min: [2024-01-05 05:29:07,167][model8_pretrain.py][INFO] Epoch:[0/2](565200/4588595) loss:2.795 lr:0.0000100 epoch_Time:25551.0min: [2024-01-05 05:29:07,167][model8_pretrain.py][INFO] Epoch:[0/2](565200/4588595) loss:2.908 lr:0.0000100 epoch_Time:25551.0min: [2024-01-05 05:29:07,167][model8_pretrain.py][INFO] Epoch:[0/2](565200/4588595) loss:2.545 lr:0.0000100 epoch_Time:25551.0min: [2024-01-05 05:29:44,098][model8_pretrain.py][INFO] Epoch:[0/2](565300/4588595) loss:2.764 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:29:44,098][model8_pretrain.py][INFO] Epoch:[0/2](565300/4588595) loss:3.373 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:29:44,098][model8_pretrain.py][INFO] Epoch:[0/2](565300/4588595) loss:2.699 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:29:44,098][model8_pretrain.py][INFO] Epoch:[0/2](565300/4588595) loss:3.025 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:29:44,098][model8_pretrain.py][INFO] Epoch:[0/2](565300/4588595) loss:2.938 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:29:44,098][model8_pretrain.py][INFO] Epoch:[0/2](565300/4588595) loss:2.360 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:29:44,098][model8_pretrain.py][INFO] Epoch:[0/2](565300/4588595) loss:2.808 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:29:44,098][model8_pretrain.py][INFO] Epoch:[0/2](565300/4588595) loss:2.796 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:30:21,045][model8_pretrain.py][INFO] Epoch:[0/2](565400/4588595) loss:2.865 lr:0.0000100 epoch_Time:25549.0min: [2024-01-05 05:30:21,045][model8_pretrain.py][INFO] Epoch:[0/2](565400/4588595) loss:3.329 lr:0.0000100 epoch_Time:25549.0min: [2024-01-05 05:30:21,045][model8_pretrain.py][INFO] Epoch:[0/2](565400/4588595) loss:2.512 lr:0.0000100 epoch_Time:25549.0min: [2024-01-05 05:30:21,045][model8_pretrain.py][INFO] Epoch:[0/2](565400/4588595) loss:3.242 lr:0.0000100 epoch_Time:25549.0min: [2024-01-05 05:30:21,045][model8_pretrain.py][INFO] Epoch:[0/2](565400/4588595) loss:2.871 lr:0.0000100 epoch_Time:25549.0min: [2024-01-05 05:30:21,045][model8_pretrain.py][INFO] Epoch:[0/2](565400/4588595) loss:3.154 lr:0.0000100 epoch_Time:25549.0min: [2024-01-05 05:30:21,045][model8_pretrain.py][INFO] Epoch:[0/2](565400/4588595) loss:3.019 lr:0.0000100 epoch_Time:25549.0min: [2024-01-05 05:30:21,045][model8_pretrain.py][INFO] Epoch:[0/2](565400/4588595) loss:2.973 lr:0.0000100 epoch_Time:25549.0min: [2024-01-05 05:30:57,978][model8_pretrain.py][INFO] Epoch:[0/2](565500/4588595) loss:2.910 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:30:57,978][model8_pretrain.py][INFO] Epoch:[0/2](565500/4588595) loss:3.590 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:30:57,978][model8_pretrain.py][INFO] Epoch:[0/2](565500/4588595) loss:2.694 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:30:57,978][model8_pretrain.py][INFO] Epoch:[0/2](565500/4588595) loss:2.805 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:30:57,978][model8_pretrain.py][INFO] Epoch:[0/2](565500/4588595) loss:3.049 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:30:57,978][model8_pretrain.py][INFO] Epoch:[0/2](565500/4588595) loss:3.035 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:30:57,978][model8_pretrain.py][INFO] Epoch:[0/2](565500/4588595) loss:2.169 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:30:57,978][model8_pretrain.py][INFO] Epoch:[0/2](565500/4588595) loss:2.653 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:31:47,164][model8_pretrain.py][INFO] Epoch:[0/2](565600/4588595) loss:3.010 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:31:47,164][model8_pretrain.py][INFO] Epoch:[0/2](565600/4588595) loss:3.046 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:31:47,164][model8_pretrain.py][INFO] Epoch:[0/2](565600/4588595) loss:3.377 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:31:47,164][model8_pretrain.py][INFO] Epoch:[0/2](565600/4588595) loss:2.630 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:31:47,164][model8_pretrain.py][INFO] Epoch:[0/2](565600/4588595) loss:2.848 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:31:47,164][model8_pretrain.py][INFO] Epoch:[0/2](565600/4588595) loss:3.082 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:31:47,164][model8_pretrain.py][INFO] Epoch:[0/2](565600/4588595) loss:3.029 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:31:47,165][model8_pretrain.py][INFO] Epoch:[0/2](565600/4588595) loss:2.846 lr:0.0000100 epoch_Time:25550.0min: [2024-01-05 05:32:24,087][model8_pretrain.py][INFO] Epoch:[0/2](565700/4588595) loss:3.163 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:32:24,087][model8_pretrain.py][INFO] Epoch:[0/2](565700/4588595) loss:2.869 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:32:24,087][model8_pretrain.py][INFO] Epoch:[0/2](565700/4588595) loss:2.957 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:32:24,087][model8_pretrain.py][INFO] Epoch:[0/2](565700/4588595) loss:3.076 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:32:24,087][model8_pretrain.py][INFO] Epoch:[0/2](565700/4588595) loss:3.295 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:32:24,087][model8_pretrain.py][INFO] Epoch:[0/2](565700/4588595) loss:2.785 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:32:24,087][model8_pretrain.py][INFO] Epoch:[0/2](565700/4588595) loss:3.017 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:32:24,087][model8_pretrain.py][INFO] Epoch:[0/2](565700/4588595) loss:3.446 lr:0.0000100 epoch_Time:25548.0min: [2024-01-05 05:33:01,018][model8_pretrain.py][INFO] Epoch:[0/2](565800/4588595) loss:2.930 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:01,018][model8_pretrain.py][INFO] Epoch:[0/2](565800/4588595) loss:2.982 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:01,018][model8_pretrain.py][INFO] Epoch:[0/2](565800/4588595) loss:2.846 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:01,018][model8_pretrain.py][INFO] Epoch:[0/2](565800/4588595) loss:2.835 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:01,018][model8_pretrain.py][INFO] Epoch:[0/2](565800/4588595) loss:3.064 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:01,018][model8_pretrain.py][INFO] Epoch:[0/2](565800/4588595) loss:2.864 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:01,018][model8_pretrain.py][INFO] Epoch:[0/2](565800/4588595) loss:2.570 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:01,018][model8_pretrain.py][INFO] Epoch:[0/2](565800/4588595) loss:3.183 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:37,940][model8_pretrain.py][INFO] Epoch:[0/2](565900/4588595) loss:2.937 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:37,940][model8_pretrain.py][INFO] Epoch:[0/2](565900/4588595) loss:3.337 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:37,940][model8_pretrain.py][INFO] Epoch:[0/2](565900/4588595) loss:2.851 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:37,940][model8_pretrain.py][INFO] Epoch:[0/2](565900/4588595) loss:2.780 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:37,940][model8_pretrain.py][INFO] Epoch:[0/2](565900/4588595) loss:2.791 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:37,940][model8_pretrain.py][INFO] Epoch:[0/2](565900/4588595) loss:2.735 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:37,940][model8_pretrain.py][INFO] Epoch:[0/2](565900/4588595) loss:2.880 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:33:37,940][model8_pretrain.py][INFO] Epoch:[0/2](565900/4588595) loss:2.848 lr:0.0000100 epoch_Time:25547.0min: [2024-01-05 05:34:14,869][model8_pretrain.py][INFO] Epoch:[0/2](566000/4588595) loss:2.766 lr:0.0000100 epoch_Time:25546.0min: [2024-01-05 05:34:14,870][model8_pretrain.py][INFO] Epoch:[0/2](566000/4588595) loss:2.502 lr:0.0000100 epoch_Time:25546.0min: [2024-01-05 05:34:14,870][model8_pretrain.py][INFO] Epoch:[0/2](566000/4588595) loss:2.644 lr:0.0000100 epoch_Time:25546.0min: [2024-01-05 05:34:14,870][model8_pretrain.py][INFO] Epoch:[0/2](566000/4588595) loss:2.323 lr:0.0000100 epoch_Time:25546.0min: [2024-01-05 05:34:14,870][model8_pretrain.py][INFO] Epoch:[0/2](566000/4588595) loss:2.974 lr:0.0000100 epoch_Time:25546.0min: [2024-01-05 05:34:14,870][model8_pretrain.py][INFO] Epoch:[0/2](566000/4588595) loss:2.952 lr:0.0000100 epoch_Time:25546.0min: [2024-01-05 05:34:14,870][model8_pretrain.py][INFO] Epoch:[0/2](566000/4588595) loss:3.074 lr:0.0000100 epoch_Time:25546.0min: [2024-01-05 05:34:14,870][model8_pretrain.py][INFO] Epoch:[0/2](566000/4588595) loss:2.883 lr:0.0000100 epoch_Time:25546.0min: [2024-01-05 05:34:51,796][model8_pretrain.py][INFO] Epoch:[0/2](566100/4588595) loss:2.310 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:34:51,796][model8_pretrain.py][INFO] Epoch:[0/2](566100/4588595) loss:2.548 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:34:51,796][model8_pretrain.py][INFO] Epoch:[0/2](566100/4588595) loss:3.206 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:34:51,796][model8_pretrain.py][INFO] Epoch:[0/2](566100/4588595) loss:2.603 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:34:51,796][model8_pretrain.py][INFO] Epoch:[0/2](566100/4588595) loss:3.053 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:34:51,796][model8_pretrain.py][INFO] Epoch:[0/2](566100/4588595) loss:3.367 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:34:51,796][model8_pretrain.py][INFO] Epoch:[0/2](566100/4588595) loss:2.932 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:34:51,796][model8_pretrain.py][INFO] Epoch:[0/2](566100/4588595) loss:3.063 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:35:28,737][model8_pretrain.py][INFO] Epoch:[0/2](566200/4588595) loss:1.566 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:35:28,737][model8_pretrain.py][INFO] Epoch:[0/2](566200/4588595) loss:2.507 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:35:28,737][model8_pretrain.py][INFO] Epoch:[0/2](566200/4588595) loss:2.834 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:35:28,737][model8_pretrain.py][INFO] Epoch:[0/2](566200/4588595) loss:3.164 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:35:28,737][model8_pretrain.py][INFO] Epoch:[0/2](566200/4588595) loss:2.976 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:35:28,737][model8_pretrain.py][INFO] Epoch:[0/2](566200/4588595) loss:2.888 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:35:28,737][model8_pretrain.py][INFO] Epoch:[0/2](566200/4588595) loss:2.620 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:35:28,737][model8_pretrain.py][INFO] Epoch:[0/2](566200/4588595) loss:2.868 lr:0.0000100 epoch_Time:25545.0min: [2024-01-05 05:36:05,661][model8_pretrain.py][INFO] Epoch:[0/2](566300/4588595) loss:2.970 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:36:05,661][model8_pretrain.py][INFO] Epoch:[0/2](566300/4588595) loss:2.869 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:36:05,661][model8_pretrain.py][INFO] Epoch:[0/2](566300/4588595) loss:2.454 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:36:05,661][model8_pretrain.py][INFO] Epoch:[0/2](566300/4588595) loss:2.581 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:36:05,661][model8_pretrain.py][INFO] Epoch:[0/2](566300/4588595) loss:3.002 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:36:05,661][model8_pretrain.py][INFO] Epoch:[0/2](566300/4588595) loss:3.144 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:36:05,661][model8_pretrain.py][INFO] Epoch:[0/2](566300/4588595) loss:2.911 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:36:05,662][model8_pretrain.py][INFO] Epoch:[0/2](566300/4588595) loss:2.444 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:36:54,760][model8_pretrain.py][INFO] Epoch:[0/2](566400/4588595) loss:2.322 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:36:54,760][model8_pretrain.py][INFO] Epoch:[0/2](566400/4588595) loss:3.256 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:36:54,760][model8_pretrain.py][INFO] Epoch:[0/2](566400/4588595) loss:2.704 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:36:54,760][model8_pretrain.py][INFO] Epoch:[0/2](566400/4588595) loss:2.635 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:36:54,760][model8_pretrain.py][INFO] Epoch:[0/2](566400/4588595) loss:2.499 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:36:54,760][model8_pretrain.py][INFO] Epoch:[0/2](566400/4588595) loss:2.633 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:36:54,760][model8_pretrain.py][INFO] Epoch:[0/2](566400/4588595) loss:2.764 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:36:54,760][model8_pretrain.py][INFO] Epoch:[0/2](566400/4588595) loss:2.522 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:37:31,696][model8_pretrain.py][INFO] Epoch:[0/2](566500/4588595) loss:2.170 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:37:31,696][model8_pretrain.py][INFO] Epoch:[0/2](566500/4588595) loss:2.648 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:37:31,696][model8_pretrain.py][INFO] Epoch:[0/2](566500/4588595) loss:2.760 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:37:31,696][model8_pretrain.py][INFO] Epoch:[0/2](566500/4588595) loss:2.885 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:37:31,696][model8_pretrain.py][INFO] Epoch:[0/2](566500/4588595) loss:2.649 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:37:31,696][model8_pretrain.py][INFO] Epoch:[0/2](566500/4588595) loss:2.565 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:37:31,696][model8_pretrain.py][INFO] Epoch:[0/2](566500/4588595) loss:3.179 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:37:31,697][model8_pretrain.py][INFO] Epoch:[0/2](566500/4588595) loss:2.757 lr:0.0000100 epoch_Time:25544.0min: [2024-01-05 05:38:08,635][model8_pretrain.py][INFO] Epoch:[0/2](566600/4588595) loss:3.130 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:38:08,635][model8_pretrain.py][INFO] Epoch:[0/2](566600/4588595) loss:2.900 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:38:08,636][model8_pretrain.py][INFO] Epoch:[0/2](566600/4588595) loss:2.631 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:38:08,636][model8_pretrain.py][INFO] Epoch:[0/2](566600/4588595) loss:3.151 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:38:08,636][model8_pretrain.py][INFO] Epoch:[0/2](566600/4588595) loss:3.306 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:38:08,636][model8_pretrain.py][INFO] Epoch:[0/2](566600/4588595) loss:2.285 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:38:08,636][model8_pretrain.py][INFO] Epoch:[0/2](566600/4588595) loss:2.714 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:38:08,636][model8_pretrain.py][INFO] Epoch:[0/2](566600/4588595) loss:2.997 lr:0.0000100 epoch_Time:25543.0min: [2024-01-05 05:38:45,572][model8_pretrain.py][INFO] Epoch:[0/2](566700/4588595) loss:2.859 lr:0.0000100 epoch_Time:25542.0min: [2024-01-05 05:38:45,572][model8_pretrain.py][INFO] Epoch:[0/2](566700/4588595) loss:2.649 lr:0.0000100 epoch_Time:25542.0min: [2024-01-05 05:38:45,572][model8_pretrain.py][INFO] Epoch:[0/2](566700/4588595) loss:2.622 lr:0.0000100 epoch_Time:25542.0min: [2024-01-05 05:38:45,573][model8_pretrain.py][INFO] Epoch:[0/2](566700/4588595) loss:2.638 lr:0.0000100 epoch_Time:25542.0min: [2024-01-05 05:38:45,573][model8_pretrain.py][INFO] Epoch:[0/2](566700/4588595) loss:2.708 lr:0.0000100 epoch_Time:25542.0min: [2024-01-05 05:38:45,573][model8_pretrain.py][INFO] Epoch:[0/2](566700/4588595) loss:2.233 lr:0.0000100 epoch_Time:25542.0min: [2024-01-05 05:38:45,573][model8_pretrain.py][INFO] Epoch:[0/2](566700/4588595) loss:3.565 lr:0.0000100 epoch_Time:25542.0min: [2024-01-05 05:38:45,573][model8_pretrain.py][INFO] Epoch:[0/2](566700/4588595) loss:3.046 lr:0.0000100 epoch_Time:25542.0min: [2024-01-05 05:39:22,518][model8_pretrain.py][INFO] Epoch:[0/2](566800/4588595) loss:2.489 lr:0.0000100 epoch_Time:25541.0min: [2024-01-05 05:39:22,518][model8_pretrain.py][INFO] Epoch:[0/2](566800/4588595) loss:3.169 lr:0.0000100 epoch_Time:25541.0min: [2024-01-05 05:39:22,518][model8_pretrain.py][INFO] Epoch:[0/2](566800/4588595) loss:2.350 lr:0.0000100 epoch_Time:25541.0min: [2024-01-05 05:39:22,518][model8_pretrain.py][INFO] Epoch:[0/2](566800/4588595) loss:3.022 lr:0.0000100 epoch_Time:25541.0min: [2024-01-05 05:39:22,518][model8_pretrain.py][INFO] Epoch:[0/2](566800/4588595) loss:2.616 lr:0.0000100 epoch_Time:25541.0min: [2024-01-05 05:39:22,518][model8_pretrain.py][INFO] Epoch:[0/2](566800/4588595) loss:3.045 lr:0.0000100 epoch_Time:25541.0min: [2024-01-05 05:39:22,518][model8_pretrain.py][INFO] Epoch:[0/2](566800/4588595) loss:3.235 lr:0.0000100 epoch_Time:25541.0min: [2024-01-05 05:39:22,519][model8_pretrain.py][INFO] Epoch:[0/2](566800/4588595) loss:2.694 lr:0.0000100 epoch_Time:25541.0min: [2024-01-05 05:39:59,455][model8_pretrain.py][INFO] Epoch:[0/2](566900/4588595) loss:2.231 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:39:59,455][model8_pretrain.py][INFO] Epoch:[0/2](566900/4588595) loss:2.391 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:39:59,455][model8_pretrain.py][INFO] Epoch:[0/2](566900/4588595) loss:2.361 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:39:59,455][model8_pretrain.py][INFO] Epoch:[0/2](566900/4588595) loss:2.890 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:39:59,455][model8_pretrain.py][INFO] Epoch:[0/2](566900/4588595) loss:2.774 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:39:59,455][model8_pretrain.py][INFO] Epoch:[0/2](566900/4588595) loss:2.386 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:39:59,456][model8_pretrain.py][INFO] Epoch:[0/2](566900/4588595) loss:2.719 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:39:59,456][model8_pretrain.py][INFO] Epoch:[0/2](566900/4588595) loss:3.022 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:40:36,393][model8_pretrain.py][INFO] Epoch:[0/2](567000/4588595) loss:3.522 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:40:36,393][model8_pretrain.py][INFO] Epoch:[0/2](567000/4588595) loss:3.135 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:40:36,393][model8_pretrain.py][INFO] Epoch:[0/2](567000/4588595) loss:2.233 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:40:36,393][model8_pretrain.py][INFO] Epoch:[0/2](567000/4588595) loss:2.659 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:40:36,393][model8_pretrain.py][INFO] Epoch:[0/2](567000/4588595) loss:2.675 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:40:36,393][model8_pretrain.py][INFO] Epoch:[0/2](567000/4588595) loss:2.461 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:40:36,393][model8_pretrain.py][INFO] Epoch:[0/2](567000/4588595) loss:2.665 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:40:36,394][model8_pretrain.py][INFO] Epoch:[0/2](567000/4588595) loss:2.977 lr:0.0000100 epoch_Time:25540.0min: [2024-01-05 05:41:13,300][model8_pretrain.py][INFO] Epoch:[0/2](567100/4588595) loss:2.970 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:41:13,300][model8_pretrain.py][INFO] Epoch:[0/2](567100/4588595) loss:2.687 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:41:13,300][model8_pretrain.py][INFO] Epoch:[0/2](567100/4588595) loss:2.995 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:41:13,300][model8_pretrain.py][INFO] Epoch:[0/2](567100/4588595) loss:2.663 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:41:13,300][model8_pretrain.py][INFO] Epoch:[0/2](567100/4588595) loss:2.967 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:41:13,300][model8_pretrain.py][INFO] Epoch:[0/2](567100/4588595) loss:3.403 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:41:13,300][model8_pretrain.py][INFO] Epoch:[0/2](567100/4588595) loss:3.036 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:41:13,300][model8_pretrain.py][INFO] Epoch:[0/2](567100/4588595) loss:3.150 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:02,206][model8_pretrain.py][INFO] Epoch:[0/2](567200/4588595) loss:2.471 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:02,206][model8_pretrain.py][INFO] Epoch:[0/2](567200/4588595) loss:2.862 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:02,206][model8_pretrain.py][INFO] Epoch:[0/2](567200/4588595) loss:2.617 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:02,206][model8_pretrain.py][INFO] Epoch:[0/2](567200/4588595) loss:3.058 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:02,206][model8_pretrain.py][INFO] Epoch:[0/2](567200/4588595) loss:2.529 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:02,206][model8_pretrain.py][INFO] Epoch:[0/2](567200/4588595) loss:3.185 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:02,206][model8_pretrain.py][INFO] Epoch:[0/2](567200/4588595) loss:2.874 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:02,206][model8_pretrain.py][INFO] Epoch:[0/2](567200/4588595) loss:2.765 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:39,140][model8_pretrain.py][INFO] Epoch:[0/2](567300/4588595) loss:2.031 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:39,141][model8_pretrain.py][INFO] Epoch:[0/2](567300/4588595) loss:3.116 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:39,140][model8_pretrain.py][INFO] Epoch:[0/2](567300/4588595) loss:2.830 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:39,141][model8_pretrain.py][INFO] Epoch:[0/2](567300/4588595) loss:3.022 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:39,141][model8_pretrain.py][INFO] Epoch:[0/2](567300/4588595) loss:3.336 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:39,141][model8_pretrain.py][INFO] Epoch:[0/2](567300/4588595) loss:2.826 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:39,141][model8_pretrain.py][INFO] Epoch:[0/2](567300/4588595) loss:2.866 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:42:39,142][model8_pretrain.py][INFO] Epoch:[0/2](567300/4588595) loss:3.195 lr:0.0000100 epoch_Time:25539.0min: [2024-01-05 05:43:16,084][model8_pretrain.py][INFO] Epoch:[0/2](567400/4588595) loss:2.975 lr:0.0000100 epoch_Time:25538.0min: [2024-01-05 05:43:16,084][model8_pretrain.py][INFO] Epoch:[0/2](567400/4588595) loss:2.701 lr:0.0000100 epoch_Time:25538.0min: [2024-01-05 05:43:16,084][model8_pretrain.py][INFO] Epoch:[0/2](567400/4588595) loss:3.052 lr:0.0000100 epoch_Time:25538.0min: [2024-01-05 05:43:16,084][model8_pretrain.py][INFO] Epoch:[0/2](567400/4588595) loss:2.489 lr:0.0000100 epoch_Time:25538.0min: [2024-01-05 05:43:16,084][model8_pretrain.py][INFO] Epoch:[0/2](567400/4588595) loss:2.998 lr:0.0000100 epoch_Time:25538.0min: [2024-01-05 05:43:16,084][model8_pretrain.py][INFO] Epoch:[0/2](567400/4588595) loss:2.497 lr:0.0000100 epoch_Time:25538.0min: [2024-01-05 05:43:16,084][model8_pretrain.py][INFO] Epoch:[0/2](567400/4588595) loss:3.117 lr:0.0000100 epoch_Time:25538.0min: [2024-01-05 05:43:16,085][model8_pretrain.py][INFO] Epoch:[0/2](567400/4588595) loss:3.069 lr:0.0000100 epoch_Time:25538.0min: [2024-01-05 05:43:53,023][model8_pretrain.py][INFO] Epoch:[0/2](567500/4588595) loss:2.854 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:43:53,023][model8_pretrain.py][INFO] Epoch:[0/2](567500/4588595) loss:2.742 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:43:53,023][model8_pretrain.py][INFO] Epoch:[0/2](567500/4588595) loss:2.865 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:43:53,023][model8_pretrain.py][INFO] Epoch:[0/2](567500/4588595) loss:2.860 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:43:53,023][model8_pretrain.py][INFO] Epoch:[0/2](567500/4588595) loss:2.760 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:43:53,023][model8_pretrain.py][INFO] Epoch:[0/2](567500/4588595) loss:3.333 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:43:53,023][model8_pretrain.py][INFO] Epoch:[0/2](567500/4588595) loss:3.044 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:43:53,024][model8_pretrain.py][INFO] Epoch:[0/2](567500/4588595) loss:2.731 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:44:29,973][model8_pretrain.py][INFO] Epoch:[0/2](567600/4588595) loss:3.041 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:44:29,973][model8_pretrain.py][INFO] Epoch:[0/2](567600/4588595) loss:3.466 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:44:29,973][model8_pretrain.py][INFO] Epoch:[0/2](567600/4588595) loss:2.215 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:44:29,973][model8_pretrain.py][INFO] Epoch:[0/2](567600/4588595) loss:2.981 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:44:29,973][model8_pretrain.py][INFO] Epoch:[0/2](567600/4588595) loss:3.300 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:44:29,973][model8_pretrain.py][INFO] Epoch:[0/2](567600/4588595) loss:3.200 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:44:29,973][model8_pretrain.py][INFO] Epoch:[0/2](567600/4588595) loss:3.047 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:44:29,973][model8_pretrain.py][INFO] Epoch:[0/2](567600/4588595) loss:2.488 lr:0.0000100 epoch_Time:25537.0min: [2024-01-05 05:45:06,909][model8_pretrain.py][INFO] Epoch:[0/2](567700/4588595) loss:2.235 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:06,909][model8_pretrain.py][INFO] Epoch:[0/2](567700/4588595) loss:2.306 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:06,909][model8_pretrain.py][INFO] Epoch:[0/2](567700/4588595) loss:2.817 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:06,909][model8_pretrain.py][INFO] Epoch:[0/2](567700/4588595) loss:2.381 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:06,909][model8_pretrain.py][INFO] Epoch:[0/2](567700/4588595) loss:2.805 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:06,909][model8_pretrain.py][INFO] Epoch:[0/2](567700/4588595) loss:3.165 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:06,909][model8_pretrain.py][INFO] Epoch:[0/2](567700/4588595) loss:2.501 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:06,909][model8_pretrain.py][INFO] Epoch:[0/2](567700/4588595) loss:2.794 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:43,839][model8_pretrain.py][INFO] Epoch:[0/2](567800/4588595) loss:3.027 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:43,839][model8_pretrain.py][INFO] Epoch:[0/2](567800/4588595) loss:3.100 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:43,839][model8_pretrain.py][INFO] Epoch:[0/2](567800/4588595) loss:2.633 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:43,839][model8_pretrain.py][INFO] Epoch:[0/2](567800/4588595) loss:2.793 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:43,839][model8_pretrain.py][INFO] Epoch:[0/2](567800/4588595) loss:2.841 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:43,839][model8_pretrain.py][INFO] Epoch:[0/2](567800/4588595) loss:2.727 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:43,839][model8_pretrain.py][INFO] Epoch:[0/2](567800/4588595) loss:2.704 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:45:43,839][model8_pretrain.py][INFO] Epoch:[0/2](567800/4588595) loss:2.635 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:46:20,775][model8_pretrain.py][INFO] Epoch:[0/2](567900/4588595) loss:2.643 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:46:20,775][model8_pretrain.py][INFO] Epoch:[0/2](567900/4588595) loss:2.330 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:46:20,776][model8_pretrain.py][INFO] Epoch:[0/2](567900/4588595) loss:2.980 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:46:20,776][model8_pretrain.py][INFO] Epoch:[0/2](567900/4588595) loss:2.516 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:46:20,776][model8_pretrain.py][INFO] Epoch:[0/2](567900/4588595) loss:2.764 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:46:20,776][model8_pretrain.py][INFO] Epoch:[0/2](567900/4588595) loss:2.916 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:46:20,776][model8_pretrain.py][INFO] Epoch:[0/2](567900/4588595) loss:2.485 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:46:20,776][model8_pretrain.py][INFO] Epoch:[0/2](567900/4588595) loss:2.370 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:47:09,685][model8_pretrain.py][INFO] Epoch:[0/2](568000/4588595) loss:3.105 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:47:09,685][model8_pretrain.py][INFO] Epoch:[0/2](568000/4588595) loss:2.815 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:47:09,686][model8_pretrain.py][INFO] Epoch:[0/2](568000/4588595) loss:2.749 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:47:09,686][model8_pretrain.py][INFO] Epoch:[0/2](568000/4588595) loss:2.727 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:47:09,686][model8_pretrain.py][INFO] Epoch:[0/2](568000/4588595) loss:3.097 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:47:09,686][model8_pretrain.py][INFO] Epoch:[0/2](568000/4588595) loss:2.258 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:47:09,686][model8_pretrain.py][INFO] Epoch:[0/2](568000/4588595) loss:3.010 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:47:09,686][model8_pretrain.py][INFO] Epoch:[0/2](568000/4588595) loss:3.187 lr:0.0000100 epoch_Time:25535.0min: [2024-01-05 05:47:46,601][model8_pretrain.py][INFO] Epoch:[0/2](568100/4588595) loss:2.784 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:47:46,601][model8_pretrain.py][INFO] Epoch:[0/2](568100/4588595) loss:2.296 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:47:46,601][model8_pretrain.py][INFO] Epoch:[0/2](568100/4588595) loss:2.759 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:47:46,601][model8_pretrain.py][INFO] Epoch:[0/2](568100/4588595) loss:2.806 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:47:46,601][model8_pretrain.py][INFO] Epoch:[0/2](568100/4588595) loss:2.839 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:47:46,601][model8_pretrain.py][INFO] Epoch:[0/2](568100/4588595) loss:2.341 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:47:46,601][model8_pretrain.py][INFO] Epoch:[0/2](568100/4588595) loss:3.103 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:47:46,601][model8_pretrain.py][INFO] Epoch:[0/2](568100/4588595) loss:2.792 lr:0.0000100 epoch_Time:25534.0min: [2024-01-05 05:48:23,530][model8_pretrain.py][INFO] Epoch:[0/2](568200/4588595) loss:2.059 lr:0.0000100 epoch_Time:25533.0min: [2024-01-05 05:48:23,530][model8_pretrain.py][INFO] Epoch:[0/2](568200/4588595) loss:3.229 lr:0.0000100 epoch_Time:25533.0min: [2024-01-05 05:48:23,530][model8_pretrain.py][INFO] Epoch:[0/2](568200/4588595) loss:2.384 lr:0.0000100 epoch_Time:25533.0min: [2024-01-05 05:48:23,530][model8_pretrain.py][INFO] Epoch:[0/2](568200/4588595) loss:2.814 lr:0.0000100 epoch_Time:25533.0min: [2024-01-05 05:48:23,530][model8_pretrain.py][INFO] Epoch:[0/2](568200/4588595) loss:3.017 lr:0.0000100 epoch_Time:25533.0min: [2024-01-05 05:48:23,530][model8_pretrain.py][INFO] Epoch:[0/2](568200/4588595) loss:2.365 lr:0.0000100 epoch_Time:25533.0min: [2024-01-05 05:48:23,530][model8_pretrain.py][INFO] Epoch:[0/2](568200/4588595) loss:2.608 lr:0.0000100 epoch_Time:25533.0min: [2024-01-05 05:48:23,530][model8_pretrain.py][INFO] Epoch:[0/2](568200/4588595) loss:2.886 lr:0.0000100 epoch_Time:25533.0min: [2024-01-05 05:49:00,461][model8_pretrain.py][INFO] Epoch:[0/2](568300/4588595) loss:3.117 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:00,461][model8_pretrain.py][INFO] Epoch:[0/2](568300/4588595) loss:2.263 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:00,461][model8_pretrain.py][INFO] Epoch:[0/2](568300/4588595) loss:2.436 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:00,461][model8_pretrain.py][INFO] Epoch:[0/2](568300/4588595) loss:2.115 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:00,461][model8_pretrain.py][INFO] Epoch:[0/2](568300/4588595) loss:2.478 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:00,461][model8_pretrain.py][INFO] Epoch:[0/2](568300/4588595) loss:2.673 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:00,461][model8_pretrain.py][INFO] Epoch:[0/2](568300/4588595) loss:2.992 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:00,461][model8_pretrain.py][INFO] Epoch:[0/2](568300/4588595) loss:2.799 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:37,386][model8_pretrain.py][INFO] Epoch:[0/2](568400/4588595) loss:2.691 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:37,386][model8_pretrain.py][INFO] Epoch:[0/2](568400/4588595) loss:2.258 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:37,386][model8_pretrain.py][INFO] Epoch:[0/2](568400/4588595) loss:2.444 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:37,386][model8_pretrain.py][INFO] Epoch:[0/2](568400/4588595) loss:3.252 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:37,386][model8_pretrain.py][INFO] Epoch:[0/2](568400/4588595) loss:3.186 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:37,386][model8_pretrain.py][INFO] Epoch:[0/2](568400/4588595) loss:2.443 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:37,386][model8_pretrain.py][INFO] Epoch:[0/2](568400/4588595) loss:2.857 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:49:37,386][model8_pretrain.py][INFO] Epoch:[0/2](568400/4588595) loss:2.479 lr:0.0000100 epoch_Time:25532.0min: [2024-01-05 05:50:14,323][model8_pretrain.py][INFO] Epoch:[0/2](568500/4588595) loss:3.340 lr:0.0000100 epoch_Time:25531.0min: [2024-01-05 05:50:14,323][model8_pretrain.py][INFO] Epoch:[0/2](568500/4588595) loss:2.783 lr:0.0000100 epoch_Time:25531.0min: [2024-01-05 05:50:14,323][model8_pretrain.py][INFO] Epoch:[0/2](568500/4588595) loss:2.548 lr:0.0000100 epoch_Time:25531.0min: [2024-01-05 05:50:14,323][model8_pretrain.py][INFO] Epoch:[0/2](568500/4588595) loss:2.678 lr:0.0000100 epoch_Time:25531.0min: [2024-01-05 05:50:14,323][model8_pretrain.py][INFO] Epoch:[0/2](568500/4588595) loss:3.323 lr:0.0000100 epoch_Time:25531.0min: [2024-01-05 05:50:14,323][model8_pretrain.py][INFO] Epoch:[0/2](568500/4588595) loss:2.311 lr:0.0000100 epoch_Time:25531.0min: [2024-01-05 05:50:14,323][model8_pretrain.py][INFO] Epoch:[0/2](568500/4588595) loss:2.878 lr:0.0000100 epoch_Time:25531.0min: [2024-01-05 05:50:14,323][model8_pretrain.py][INFO] Epoch:[0/2](568500/4588595) loss:2.708 lr:0.0000100 epoch_Time:25531.0min: [2024-01-05 05:50:51,244][model8_pretrain.py][INFO] Epoch:[0/2](568600/4588595) loss:3.022 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:50:51,244][model8_pretrain.py][INFO] Epoch:[0/2](568600/4588595) loss:2.893 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:50:51,244][model8_pretrain.py][INFO] Epoch:[0/2](568600/4588595) loss:3.035 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:50:51,244][model8_pretrain.py][INFO] Epoch:[0/2](568600/4588595) loss:3.169 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:50:51,244][model8_pretrain.py][INFO] Epoch:[0/2](568600/4588595) loss:2.937 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:50:51,244][model8_pretrain.py][INFO] Epoch:[0/2](568600/4588595) loss:2.937 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:50:51,244][model8_pretrain.py][INFO] Epoch:[0/2](568600/4588595) loss:2.163 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:50:51,244][model8_pretrain.py][INFO] Epoch:[0/2](568600/4588595) loss:3.594 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:51:28,191][model8_pretrain.py][INFO] Epoch:[0/2](568700/4588595) loss:3.042 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:51:28,192][model8_pretrain.py][INFO] Epoch:[0/2](568700/4588595) loss:2.959 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:51:28,192][model8_pretrain.py][INFO] Epoch:[0/2](568700/4588595) loss:3.077 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:51:28,192][model8_pretrain.py][INFO] Epoch:[0/2](568700/4588595) loss:2.639 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:51:28,192][model8_pretrain.py][INFO] Epoch:[0/2](568700/4588595) loss:2.752 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:51:28,192][model8_pretrain.py][INFO] Epoch:[0/2](568700/4588595) loss:2.430 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:51:28,192][model8_pretrain.py][INFO] Epoch:[0/2](568700/4588595) loss:3.185 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:51:28,192][model8_pretrain.py][INFO] Epoch:[0/2](568700/4588595) loss:2.940 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:52:16,932][model8_pretrain.py][INFO] Epoch:[0/2](568800/4588595) loss:3.065 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:52:16,932][model8_pretrain.py][INFO] Epoch:[0/2](568800/4588595) loss:3.025 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:52:16,932][model8_pretrain.py][INFO] Epoch:[0/2](568800/4588595) loss:2.457 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:52:16,932][model8_pretrain.py][INFO] Epoch:[0/2](568800/4588595) loss:2.912 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:52:16,932][model8_pretrain.py][INFO] Epoch:[0/2](568800/4588595) loss:2.733 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:52:16,932][model8_pretrain.py][INFO] Epoch:[0/2](568800/4588595) loss:2.789 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:52:16,932][model8_pretrain.py][INFO] Epoch:[0/2](568800/4588595) loss:2.245 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:52:16,932][model8_pretrain.py][INFO] Epoch:[0/2](568800/4588595) loss:3.313 lr:0.0000100 epoch_Time:25530.0min: [2024-01-05 05:52:53,852][model8_pretrain.py][INFO] Epoch:[0/2](568900/4588595) loss:3.247 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:52:53,852][model8_pretrain.py][INFO] Epoch:[0/2](568900/4588595) loss:2.812 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:52:53,852][model8_pretrain.py][INFO] Epoch:[0/2](568900/4588595) loss:2.550 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:52:53,852][model8_pretrain.py][INFO] Epoch:[0/2](568900/4588595) loss:2.706 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:52:53,852][model8_pretrain.py][INFO] Epoch:[0/2](568900/4588595) loss:2.927 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:52:53,852][model8_pretrain.py][INFO] Epoch:[0/2](568900/4588595) loss:3.065 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:52:53,853][model8_pretrain.py][INFO] Epoch:[0/2](568900/4588595) loss:2.367 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:52:53,855][model8_pretrain.py][INFO] Epoch:[0/2](568900/4588595) loss:2.584 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:53:30,784][model8_pretrain.py][INFO] Epoch:[0/2](569000/4588595) loss:2.856 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:53:30,784][model8_pretrain.py][INFO] Epoch:[0/2](569000/4588595) loss:3.276 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:53:30,784][model8_pretrain.py][INFO] Epoch:[0/2](569000/4588595) loss:2.421 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:53:30,784][model8_pretrain.py][INFO] Epoch:[0/2](569000/4588595) loss:2.699 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:53:30,784][model8_pretrain.py][INFO] Epoch:[0/2](569000/4588595) loss:2.195 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:53:30,784][model8_pretrain.py][INFO] Epoch:[0/2](569000/4588595) loss:3.029 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:53:30,784][model8_pretrain.py][INFO] Epoch:[0/2](569000/4588595) loss:2.799 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:53:30,784][model8_pretrain.py][INFO] Epoch:[0/2](569000/4588595) loss:3.300 lr:0.0000100 epoch_Time:25529.0min: [2024-01-05 05:54:07,717][model8_pretrain.py][INFO] Epoch:[0/2](569100/4588595) loss:3.220 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:07,717][model8_pretrain.py][INFO] Epoch:[0/2](569100/4588595) loss:3.170 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:07,717][model8_pretrain.py][INFO] Epoch:[0/2](569100/4588595) loss:3.052 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:07,717][model8_pretrain.py][INFO] Epoch:[0/2](569100/4588595) loss:3.137 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:07,717][model8_pretrain.py][INFO] Epoch:[0/2](569100/4588595) loss:2.585 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:07,717][model8_pretrain.py][INFO] Epoch:[0/2](569100/4588595) loss:2.464 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:07,717][model8_pretrain.py][INFO] Epoch:[0/2](569100/4588595) loss:3.042 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:07,717][model8_pretrain.py][INFO] Epoch:[0/2](569100/4588595) loss:3.091 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:44,655][model8_pretrain.py][INFO] Epoch:[0/2](569200/4588595) loss:2.789 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:44,655][model8_pretrain.py][INFO] Epoch:[0/2](569200/4588595) loss:2.795 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:44,655][model8_pretrain.py][INFO] Epoch:[0/2](569200/4588595) loss:2.809 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:44,655][model8_pretrain.py][INFO] Epoch:[0/2](569200/4588595) loss:2.968 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:44,655][model8_pretrain.py][INFO] Epoch:[0/2](569200/4588595) loss:2.969 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:44,655][model8_pretrain.py][INFO] Epoch:[0/2](569200/4588595) loss:3.006 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:44,655][model8_pretrain.py][INFO] Epoch:[0/2](569200/4588595) loss:2.651 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:54:44,655][model8_pretrain.py][INFO] Epoch:[0/2](569200/4588595) loss:3.113 lr:0.0000100 epoch_Time:25527.0min: [2024-01-05 05:55:21,585][model8_pretrain.py][INFO] Epoch:[0/2](569300/4588595) loss:3.348 lr:0.0000100 epoch_Time:25526.0min: [2024-01-05 05:55:21,585][model8_pretrain.py][INFO] Epoch:[0/2](569300/4588595) loss:2.772 lr:0.0000100 epoch_Time:25526.0min: [2024-01-05 05:55:21,585][model8_pretrain.py][INFO] Epoch:[0/2](569300/4588595) loss:2.762 lr:0.0000100 epoch_Time:25526.0min: [2024-01-05 05:55:21,585][model8_pretrain.py][INFO] Epoch:[0/2](569300/4588595) loss:3.170 lr:0.0000100 epoch_Time:25526.0min: [2024-01-05 05:55:21,585][model8_pretrain.py][INFO] Epoch:[0/2](569300/4588595) loss:3.071 lr:0.0000100 epoch_Time:25526.0min: [2024-01-05 05:55:21,585][model8_pretrain.py][INFO] Epoch:[0/2](569300/4588595) loss:2.838 lr:0.0000100 epoch_Time:25526.0min: [2024-01-05 05:55:21,585][model8_pretrain.py][INFO] Epoch:[0/2](569300/4588595) loss:1.975 lr:0.0000100 epoch_Time:25526.0min: [2024-01-05 05:55:21,585][model8_pretrain.py][INFO] Epoch:[0/2](569300/4588595) loss:2.819 lr:0.0000100 epoch_Time:25526.0min: [2024-01-05 05:55:58,511][model8_pretrain.py][INFO] Epoch:[0/2](569400/4588595) loss:2.743 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:55:58,511][model8_pretrain.py][INFO] Epoch:[0/2](569400/4588595) loss:2.558 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:55:58,511][model8_pretrain.py][INFO] Epoch:[0/2](569400/4588595) loss:2.855 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:55:58,511][model8_pretrain.py][INFO] Epoch:[0/2](569400/4588595) loss:2.485 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:55:58,511][model8_pretrain.py][INFO] Epoch:[0/2](569400/4588595) loss:3.220 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:55:58,511][model8_pretrain.py][INFO] Epoch:[0/2](569400/4588595) loss:2.740 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:55:58,511][model8_pretrain.py][INFO] Epoch:[0/2](569400/4588595) loss:2.794 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:55:58,511][model8_pretrain.py][INFO] Epoch:[0/2](569400/4588595) loss:2.799 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:56:35,439][model8_pretrain.py][INFO] Epoch:[0/2](569500/4588595) loss:2.889 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:56:35,439][model8_pretrain.py][INFO] Epoch:[0/2](569500/4588595) loss:2.956 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:56:35,439][model8_pretrain.py][INFO] Epoch:[0/2](569500/4588595) loss:2.767 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:56:35,439][model8_pretrain.py][INFO] Epoch:[0/2](569500/4588595) loss:2.599 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:56:35,439][model8_pretrain.py][INFO] Epoch:[0/2](569500/4588595) loss:2.882 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:56:35,440][model8_pretrain.py][INFO] Epoch:[0/2](569500/4588595) loss:2.921 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:56:35,440][model8_pretrain.py][INFO] Epoch:[0/2](569500/4588595) loss:3.442 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:56:35,440][model8_pretrain.py][INFO] Epoch:[0/2](569500/4588595) loss:3.103 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:57:22,610][model8_pretrain.py][INFO] Epoch:[0/2](569600/4588595) loss:2.768 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:57:22,610][model8_pretrain.py][INFO] Epoch:[0/2](569600/4588595) loss:2.947 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:57:22,610][model8_pretrain.py][INFO] Epoch:[0/2](569600/4588595) loss:2.734 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:57:22,610][model8_pretrain.py][INFO] Epoch:[0/2](569600/4588595) loss:3.212 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:57:22,610][model8_pretrain.py][INFO] Epoch:[0/2](569600/4588595) loss:2.594 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:57:22,610][model8_pretrain.py][INFO] Epoch:[0/2](569600/4588595) loss:3.123 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:57:22,610][model8_pretrain.py][INFO] Epoch:[0/2](569600/4588595) loss:2.383 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:57:22,610][model8_pretrain.py][INFO] Epoch:[0/2](569600/4588595) loss:2.713 lr:0.0000100 epoch_Time:25525.0min: [2024-01-05 05:58:01,198][model8_pretrain.py][INFO] Epoch:[0/2](569700/4588595) loss:3.050 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:01,198][model8_pretrain.py][INFO] Epoch:[0/2](569700/4588595) loss:2.286 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:01,198][model8_pretrain.py][INFO] Epoch:[0/2](569700/4588595) loss:2.680 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:01,198][model8_pretrain.py][INFO] Epoch:[0/2](569700/4588595) loss:2.282 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:01,198][model8_pretrain.py][INFO] Epoch:[0/2](569700/4588595) loss:2.754 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:01,198][model8_pretrain.py][INFO] Epoch:[0/2](569700/4588595) loss:3.164 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:01,198][model8_pretrain.py][INFO] Epoch:[0/2](569700/4588595) loss:2.687 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:01,198][model8_pretrain.py][INFO] Epoch:[0/2](569700/4588595) loss:3.143 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:38,137][model8_pretrain.py][INFO] Epoch:[0/2](569800/4588595) loss:2.981 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:38,137][model8_pretrain.py][INFO] Epoch:[0/2](569800/4588595) loss:3.126 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:38,137][model8_pretrain.py][INFO] Epoch:[0/2](569800/4588595) loss:3.071 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:38,137][model8_pretrain.py][INFO] Epoch:[0/2](569800/4588595) loss:3.104 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:38,137][model8_pretrain.py][INFO] Epoch:[0/2](569800/4588595) loss:2.618 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:38,137][model8_pretrain.py][INFO] Epoch:[0/2](569800/4588595) loss:3.566 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:38,137][model8_pretrain.py][INFO] Epoch:[0/2](569800/4588595) loss:2.941 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:58:38,137][model8_pretrain.py][INFO] Epoch:[0/2](569800/4588595) loss:3.172 lr:0.0000100 epoch_Time:25524.0min: [2024-01-05 05:59:15,064][model8_pretrain.py][INFO] Epoch:[0/2](569900/4588595) loss:2.651 lr:0.0000100 epoch_Time:25523.0min: [2024-01-05 05:59:15,064][model8_pretrain.py][INFO] Epoch:[0/2](569900/4588595) loss:3.046 lr:0.0000100 epoch_Time:25523.0min: [2024-01-05 05:59:15,064][model8_pretrain.py][INFO] Epoch:[0/2](569900/4588595) loss:2.451 lr:0.0000100 epoch_Time:25523.0min: [2024-01-05 05:59:15,064][model8_pretrain.py][INFO] Epoch:[0/2](569900/4588595) loss:2.710 lr:0.0000100 epoch_Time:25523.0min: [2024-01-05 05:59:15,064][model8_pretrain.py][INFO] Epoch:[0/2](569900/4588595) loss:2.850 lr:0.0000100 epoch_Time:25523.0min: [2024-01-05 05:59:15,064][model8_pretrain.py][INFO] Epoch:[0/2](569900/4588595) loss:2.403 lr:0.0000100 epoch_Time:25523.0min: [2024-01-05 05:59:15,064][model8_pretrain.py][INFO] Epoch:[0/2](569900/4588595) loss:2.933 lr:0.0000100 epoch_Time:25523.0min: [2024-01-05 05:59:15,064][model8_pretrain.py][INFO] Epoch:[0/2](569900/4588595) loss:2.903 lr:0.0000100 epoch_Time:25523.0min: [2024-01-05 05:59:52,004][model8_pretrain.py][INFO] Epoch:[0/2](570000/4588595) loss:2.759 lr:0.0000100 epoch_Time:25522.0min: [2024-01-05 05:59:52,004][model8_pretrain.py][INFO] Epoch:[0/2](570000/4588595) loss:2.782 lr:0.0000100 epoch_Time:25522.0min: [2024-01-05 05:59:52,004][model8_pretrain.py][INFO] Epoch:[0/2](570000/4588595) loss:2.277 lr:0.0000100 epoch_Time:25522.0min: [2024-01-05 05:59:52,004][model8_pretrain.py][INFO] Epoch:[0/2](570000/4588595) loss:2.509 lr:0.0000100 epoch_Time:25522.0min: [2024-01-05 05:59:52,004][model8_pretrain.py][INFO] Epoch:[0/2](570000/4588595) loss:2.716 lr:0.0000100 epoch_Time:25522.0min: [2024-01-05 05:59:52,004][model8_pretrain.py][INFO] Epoch:[0/2](570000/4588595) loss:3.038 lr:0.0000100 epoch_Time:25522.0min: [2024-01-05 05:59:52,004][model8_pretrain.py][INFO] Epoch:[0/2](570000/4588595) loss:2.904 lr:0.0000100 epoch_Time:25522.0min: [2024-01-05 05:59:52,004][model8_pretrain.py][INFO] Epoch:[0/2](570000/4588595) loss:3.004 lr:0.0000100 epoch_Time:25522.0min: [2024-01-05 06:00:28,931][model8_pretrain.py][INFO] Epoch:[0/2](570100/4588595) loss:2.934 lr:0.0000100 epoch_Time:25521.0min: [2024-01-05 06:00:28,931][model8_pretrain.py][INFO] Epoch:[0/2](570100/4588595) loss:3.015 lr:0.0000100 epoch_Time:25521.0min: [2024-01-05 06:00:28,931][model8_pretrain.py][INFO] Epoch:[0/2](570100/4588595) loss:2.845 lr:0.0000100 epoch_Time:25521.0min: [2024-01-05 06:00:28,931][model8_pretrain.py][INFO] Epoch:[0/2](570100/4588595) loss:2.663 lr:0.0000100 epoch_Time:25521.0min: [2024-01-05 06:00:28,931][model8_pretrain.py][INFO] Epoch:[0/2](570100/4588595) loss:2.756 lr:0.0000100 epoch_Time:25521.0min: [2024-01-05 06:00:28,932][model8_pretrain.py][INFO] Epoch:[0/2](570100/4588595) loss:2.673 lr:0.0000100 epoch_Time:25521.0min: [2024-01-05 06:00:28,932][model8_pretrain.py][INFO] Epoch:[0/2](570100/4588595) loss:3.106 lr:0.0000100 epoch_Time:25521.0min: [2024-01-05 06:00:28,932][model8_pretrain.py][INFO] Epoch:[0/2](570100/4588595) loss:2.447 lr:0.0000100 epoch_Time:25521.0min: [2024-01-05 06:01:05,864][model8_pretrain.py][INFO] Epoch:[0/2](570200/4588595) loss:3.076 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:05,864][model8_pretrain.py][INFO] Epoch:[0/2](570200/4588595) loss:2.801 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:05,864][model8_pretrain.py][INFO] Epoch:[0/2](570200/4588595) loss:3.131 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:05,864][model8_pretrain.py][INFO] Epoch:[0/2](570200/4588595) loss:2.948 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:05,864][model8_pretrain.py][INFO] Epoch:[0/2](570200/4588595) loss:3.028 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:05,864][model8_pretrain.py][INFO] Epoch:[0/2](570200/4588595) loss:2.334 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:05,864][model8_pretrain.py][INFO] Epoch:[0/2](570200/4588595) loss:3.004 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:05,864][model8_pretrain.py][INFO] Epoch:[0/2](570200/4588595) loss:3.072 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:42,807][model8_pretrain.py][INFO] Epoch:[0/2](570300/4588595) loss:2.761 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:42,807][model8_pretrain.py][INFO] Epoch:[0/2](570300/4588595) loss:2.991 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:42,807][model8_pretrain.py][INFO] Epoch:[0/2](570300/4588595) loss:2.898 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:42,807][model8_pretrain.py][INFO] Epoch:[0/2](570300/4588595) loss:3.033 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:42,807][model8_pretrain.py][INFO] Epoch:[0/2](570300/4588595) loss:3.176 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:42,807][model8_pretrain.py][INFO] Epoch:[0/2](570300/4588595) loss:2.762 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:42,807][model8_pretrain.py][INFO] Epoch:[0/2](570300/4588595) loss:2.851 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:01:42,807][model8_pretrain.py][INFO] Epoch:[0/2](570300/4588595) loss:3.025 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:02:29,775][model8_pretrain.py][INFO] Epoch:[0/2](570400/4588595) loss:2.821 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:02:29,775][model8_pretrain.py][INFO] Epoch:[0/2](570400/4588595) loss:2.257 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:02:29,775][model8_pretrain.py][INFO] Epoch:[0/2](570400/4588595) loss:3.251 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:02:29,775][model8_pretrain.py][INFO] Epoch:[0/2](570400/4588595) loss:2.766 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:02:29,775][model8_pretrain.py][INFO] Epoch:[0/2](570400/4588595) loss:2.735 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:02:29,775][model8_pretrain.py][INFO] Epoch:[0/2](570400/4588595) loss:2.672 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:02:29,775][model8_pretrain.py][INFO] Epoch:[0/2](570400/4588595) loss:3.020 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:02:29,775][model8_pretrain.py][INFO] Epoch:[0/2](570400/4588595) loss:2.697 lr:0.0000100 epoch_Time:25520.0min: [2024-01-05 06:03:08,396][model8_pretrain.py][INFO] Epoch:[0/2](570500/4588595) loss:3.443 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:08,396][model8_pretrain.py][INFO] Epoch:[0/2](570500/4588595) loss:3.214 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:08,396][model8_pretrain.py][INFO] Epoch:[0/2](570500/4588595) loss:2.814 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:08,396][model8_pretrain.py][INFO] Epoch:[0/2](570500/4588595) loss:2.947 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:08,396][model8_pretrain.py][INFO] Epoch:[0/2](570500/4588595) loss:2.758 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:08,396][model8_pretrain.py][INFO] Epoch:[0/2](570500/4588595) loss:2.839 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:08,397][model8_pretrain.py][INFO] Epoch:[0/2](570500/4588595) loss:2.777 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:08,397][model8_pretrain.py][INFO] Epoch:[0/2](570500/4588595) loss:2.632 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:45,362][model8_pretrain.py][INFO] Epoch:[0/2](570600/4588595) loss:3.126 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:45,362][model8_pretrain.py][INFO] Epoch:[0/2](570600/4588595) loss:2.698 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:45,362][model8_pretrain.py][INFO] Epoch:[0/2](570600/4588595) loss:3.090 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:45,362][model8_pretrain.py][INFO] Epoch:[0/2](570600/4588595) loss:3.026 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:45,362][model8_pretrain.py][INFO] Epoch:[0/2](570600/4588595) loss:2.938 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:45,362][model8_pretrain.py][INFO] Epoch:[0/2](570600/4588595) loss:2.978 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:45,362][model8_pretrain.py][INFO] Epoch:[0/2](570600/4588595) loss:3.043 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:03:45,363][model8_pretrain.py][INFO] Epoch:[0/2](570600/4588595) loss:2.600 lr:0.0000100 epoch_Time:25519.0min: [2024-01-05 06:04:22,306][model8_pretrain.py][INFO] Epoch:[0/2](570700/4588595) loss:2.829 lr:0.0000100 epoch_Time:25518.0min: [2024-01-05 06:04:22,306][model8_pretrain.py][INFO] Epoch:[0/2](570700/4588595) loss:3.090 lr:0.0000100 epoch_Time:25518.0min: [2024-01-05 06:04:22,306][model8_pretrain.py][INFO] Epoch:[0/2](570700/4588595) loss:2.909 lr:0.0000100 epoch_Time:25518.0min: [2024-01-05 06:04:22,306][model8_pretrain.py][INFO] Epoch:[0/2](570700/4588595) loss:3.143 lr:0.0000100 epoch_Time:25518.0min: [2024-01-05 06:04:22,306][model8_pretrain.py][INFO] Epoch:[0/2](570700/4588595) loss:2.871 lr:0.0000100 epoch_Time:25518.0min: [2024-01-05 06:04:22,306][model8_pretrain.py][INFO] Epoch:[0/2](570700/4588595) loss:2.667 lr:0.0000100 epoch_Time:25518.0min: [2024-01-05 06:04:22,306][model8_pretrain.py][INFO] Epoch:[0/2](570700/4588595) loss:3.496 lr:0.0000100 epoch_Time:25518.0min: [2024-01-05 06:04:22,306][model8_pretrain.py][INFO] Epoch:[0/2](570700/4588595) loss:2.353 lr:0.0000100 epoch_Time:25518.0min: [2024-01-05 06:04:59,256][model8_pretrain.py][INFO] Epoch:[0/2](570800/4588595) loss:2.635 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:04:59,256][model8_pretrain.py][INFO] Epoch:[0/2](570800/4588595) loss:2.442 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:04:59,256][model8_pretrain.py][INFO] Epoch:[0/2](570800/4588595) loss:3.148 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:04:59,256][model8_pretrain.py][INFO] Epoch:[0/2](570800/4588595) loss:3.130 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:04:59,256][model8_pretrain.py][INFO] Epoch:[0/2](570800/4588595) loss:2.759 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:04:59,256][model8_pretrain.py][INFO] Epoch:[0/2](570800/4588595) loss:2.430 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:04:59,256][model8_pretrain.py][INFO] Epoch:[0/2](570800/4588595) loss:2.371 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:04:59,256][model8_pretrain.py][INFO] Epoch:[0/2](570800/4588595) loss:2.675 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:05:36,203][model8_pretrain.py][INFO] Epoch:[0/2](570900/4588595) loss:3.314 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:05:36,203][model8_pretrain.py][INFO] Epoch:[0/2](570900/4588595) loss:2.968 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:05:36,203][model8_pretrain.py][INFO] Epoch:[0/2](570900/4588595) loss:3.226 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:05:36,203][model8_pretrain.py][INFO] Epoch:[0/2](570900/4588595) loss:2.941 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:05:36,203][model8_pretrain.py][INFO] Epoch:[0/2](570900/4588595) loss:2.772 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:05:36,203][model8_pretrain.py][INFO] Epoch:[0/2](570900/4588595) loss:3.208 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:05:36,203][model8_pretrain.py][INFO] Epoch:[0/2](570900/4588595) loss:2.102 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:05:36,203][model8_pretrain.py][INFO] Epoch:[0/2](570900/4588595) loss:3.388 lr:0.0000100 epoch_Time:25517.0min: [2024-01-05 06:06:13,151][model8_pretrain.py][INFO] Epoch:[0/2](571000/4588595) loss:2.923 lr:0.0000100 epoch_Time:25516.0min: [2024-01-05 06:06:13,151][model8_pretrain.py][INFO] Epoch:[0/2](571000/4588595) loss:2.459 lr:0.0000100 epoch_Time:25516.0min: [2024-01-05 06:06:13,151][model8_pretrain.py][INFO] Epoch:[0/2](571000/4588595) loss:3.153 lr:0.0000100 epoch_Time:25516.0min: [2024-01-05 06:06:13,151][model8_pretrain.py][INFO] Epoch:[0/2](571000/4588595) loss:2.963 lr:0.0000100 epoch_Time:25516.0min: [2024-01-05 06:06:13,151][model8_pretrain.py][INFO] Epoch:[0/2](571000/4588595) loss:2.709 lr:0.0000100 epoch_Time:25516.0min: [2024-01-05 06:06:13,151][model8_pretrain.py][INFO] Epoch:[0/2](571000/4588595) loss:2.747 lr:0.0000100 epoch_Time:25516.0min: [2024-01-05 06:06:13,151][model8_pretrain.py][INFO] Epoch:[0/2](571000/4588595) loss:2.550 lr:0.0000100 epoch_Time:25516.0min: [2024-01-05 06:06:13,151][model8_pretrain.py][INFO] Epoch:[0/2](571000/4588595) loss:1.950 lr:0.0000100 epoch_Time:25516.0min: [2024-01-05 06:06:50,096][model8_pretrain.py][INFO] Epoch:[0/2](571100/4588595) loss:2.776 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:06:50,096][model8_pretrain.py][INFO] Epoch:[0/2](571100/4588595) loss:2.487 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:06:50,096][model8_pretrain.py][INFO] Epoch:[0/2](571100/4588595) loss:3.307 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:06:50,096][model8_pretrain.py][INFO] Epoch:[0/2](571100/4588595) loss:3.020 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:06:50,096][model8_pretrain.py][INFO] Epoch:[0/2](571100/4588595) loss:2.521 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:06:50,096][model8_pretrain.py][INFO] Epoch:[0/2](571100/4588595) loss:2.657 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:06:50,097][model8_pretrain.py][INFO] Epoch:[0/2](571100/4588595) loss:2.941 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:06:50,096][model8_pretrain.py][INFO] Epoch:[0/2](571100/4588595) loss:2.913 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:07:33,897][model8_pretrain.py][INFO] Epoch:[0/2](571200/4588595) loss:2.917 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:07:33,897][model8_pretrain.py][INFO] Epoch:[0/2](571200/4588595) loss:2.943 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:07:33,897][model8_pretrain.py][INFO] Epoch:[0/2](571200/4588595) loss:2.729 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:07:33,897][model8_pretrain.py][INFO] Epoch:[0/2](571200/4588595) loss:2.946 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:07:33,897][model8_pretrain.py][INFO] Epoch:[0/2](571200/4588595) loss:2.422 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:07:33,897][model8_pretrain.py][INFO] Epoch:[0/2](571200/4588595) loss:2.616 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:07:33,897][model8_pretrain.py][INFO] Epoch:[0/2](571200/4588595) loss:2.840 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:07:33,897][model8_pretrain.py][INFO] Epoch:[0/2](571200/4588595) loss:2.704 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:08:15,935][model8_pretrain.py][INFO] Epoch:[0/2](571300/4588595) loss:2.708 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:08:15,935][model8_pretrain.py][INFO] Epoch:[0/2](571300/4588595) loss:2.647 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:08:15,935][model8_pretrain.py][INFO] Epoch:[0/2](571300/4588595) loss:2.862 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:08:15,935][model8_pretrain.py][INFO] Epoch:[0/2](571300/4588595) loss:2.362 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:08:15,935][model8_pretrain.py][INFO] Epoch:[0/2](571300/4588595) loss:2.025 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:08:15,935][model8_pretrain.py][INFO] Epoch:[0/2](571300/4588595) loss:3.079 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:08:15,935][model8_pretrain.py][INFO] Epoch:[0/2](571300/4588595) loss:1.849 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:08:15,935][model8_pretrain.py][INFO] Epoch:[0/2](571300/4588595) loss:2.636 lr:0.0000100 epoch_Time:25515.0min: [2024-01-05 06:08:52,876][model8_pretrain.py][INFO] Epoch:[0/2](571400/4588595) loss:3.046 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:08:52,876][model8_pretrain.py][INFO] Epoch:[0/2](571400/4588595) loss:3.071 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:08:52,876][model8_pretrain.py][INFO] Epoch:[0/2](571400/4588595) loss:2.748 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:08:52,876][model8_pretrain.py][INFO] Epoch:[0/2](571400/4588595) loss:2.775 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:08:52,876][model8_pretrain.py][INFO] Epoch:[0/2](571400/4588595) loss:2.725 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:08:52,876][model8_pretrain.py][INFO] Epoch:[0/2](571400/4588595) loss:3.083 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:08:52,876][model8_pretrain.py][INFO] Epoch:[0/2](571400/4588595) loss:3.295 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:08:52,876][model8_pretrain.py][INFO] Epoch:[0/2](571400/4588595) loss:3.172 lr:0.0000100 epoch_Time:25514.0min: [2024-01-05 06:09:29,814][model8_pretrain.py][INFO] Epoch:[0/2](571500/4588595) loss:2.683 lr:0.0000100 epoch_Time:25513.0min: [2024-01-05 06:09:29,814][model8_pretrain.py][INFO] Epoch:[0/2](571500/4588595) loss:2.405 lr:0.0000100 epoch_Time:25513.0min: [2024-01-05 06:09:29,814][model8_pretrain.py][INFO] Epoch:[0/2](571500/4588595) loss:2.844 lr:0.0000100 epoch_Time:25513.0min: [2024-01-05 06:09:29,814][model8_pretrain.py][INFO] Epoch:[0/2](571500/4588595) loss:2.495 lr:0.0000100 epoch_Time:25513.0min: [2024-01-05 06:09:29,814][model8_pretrain.py][INFO] Epoch:[0/2](571500/4588595) loss:2.940 lr:0.0000100 epoch_Time:25513.0min: [2024-01-05 06:09:29,814][model8_pretrain.py][INFO] Epoch:[0/2](571500/4588595) loss:2.563 lr:0.0000100 epoch_Time:25513.0min: [2024-01-05 06:09:29,814][model8_pretrain.py][INFO] Epoch:[0/2](571500/4588595) loss:3.043 lr:0.0000100 epoch_Time:25513.0min: [2024-01-05 06:09:29,814][model8_pretrain.py][INFO] Epoch:[0/2](571500/4588595) loss:2.825 lr:0.0000100 epoch_Time:25513.0min: [2024-01-05 06:10:06,748][model8_pretrain.py][INFO] Epoch:[0/2](571600/4588595) loss:2.850 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:06,748][model8_pretrain.py][INFO] Epoch:[0/2](571600/4588595) loss:3.268 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:06,748][model8_pretrain.py][INFO] Epoch:[0/2](571600/4588595) loss:2.469 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:06,748][model8_pretrain.py][INFO] Epoch:[0/2](571600/4588595) loss:2.866 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:06,748][model8_pretrain.py][INFO] Epoch:[0/2](571600/4588595) loss:2.789 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:06,748][model8_pretrain.py][INFO] Epoch:[0/2](571600/4588595) loss:3.023 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:06,748][model8_pretrain.py][INFO] Epoch:[0/2](571600/4588595) loss:2.962 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:06,749][model8_pretrain.py][INFO] Epoch:[0/2](571600/4588595) loss:2.173 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:43,691][model8_pretrain.py][INFO] Epoch:[0/2](571700/4588595) loss:2.993 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:43,691][model8_pretrain.py][INFO] Epoch:[0/2](571700/4588595) loss:2.889 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:43,691][model8_pretrain.py][INFO] Epoch:[0/2](571700/4588595) loss:2.753 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:43,691][model8_pretrain.py][INFO] Epoch:[0/2](571700/4588595) loss:3.224 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:43,691][model8_pretrain.py][INFO] Epoch:[0/2](571700/4588595) loss:3.063 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:43,691][model8_pretrain.py][INFO] Epoch:[0/2](571700/4588595) loss:2.966 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:43,691][model8_pretrain.py][INFO] Epoch:[0/2](571700/4588595) loss:2.996 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:10:43,691][model8_pretrain.py][INFO] Epoch:[0/2](571700/4588595) loss:2.518 lr:0.0000100 epoch_Time:25512.0min: [2024-01-05 06:11:20,627][model8_pretrain.py][INFO] Epoch:[0/2](571800/4588595) loss:3.037 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:11:20,627][model8_pretrain.py][INFO] Epoch:[0/2](571800/4588595) loss:3.172 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:11:20,627][model8_pretrain.py][INFO] Epoch:[0/2](571800/4588595) loss:2.417 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:11:20,627][model8_pretrain.py][INFO] Epoch:[0/2](571800/4588595) loss:2.960 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:11:20,627][model8_pretrain.py][INFO] Epoch:[0/2](571800/4588595) loss:3.068 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:11:20,627][model8_pretrain.py][INFO] Epoch:[0/2](571800/4588595) loss:3.070 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:11:20,627][model8_pretrain.py][INFO] Epoch:[0/2](571800/4588595) loss:2.742 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:11:20,627][model8_pretrain.py][INFO] Epoch:[0/2](571800/4588595) loss:3.380 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:11:57,554][model8_pretrain.py][INFO] Epoch:[0/2](571900/4588595) loss:2.597 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:11:57,554][model8_pretrain.py][INFO] Epoch:[0/2](571900/4588595) loss:2.389 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:11:57,554][model8_pretrain.py][INFO] Epoch:[0/2](571900/4588595) loss:3.066 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:11:57,554][model8_pretrain.py][INFO] Epoch:[0/2](571900/4588595) loss:2.380 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:11:57,554][model8_pretrain.py][INFO] Epoch:[0/2](571900/4588595) loss:2.907 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:11:57,554][model8_pretrain.py][INFO] Epoch:[0/2](571900/4588595) loss:2.608 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:11:57,554][model8_pretrain.py][INFO] Epoch:[0/2](571900/4588595) loss:3.172 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:11:57,554][model8_pretrain.py][INFO] Epoch:[0/2](571900/4588595) loss:2.424 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:12:41,231][model8_pretrain.py][INFO] Epoch:[0/2](572000/4588595) loss:2.887 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:12:41,231][model8_pretrain.py][INFO] Epoch:[0/2](572000/4588595) loss:2.811 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:12:41,231][model8_pretrain.py][INFO] Epoch:[0/2](572000/4588595) loss:2.738 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:12:41,231][model8_pretrain.py][INFO] Epoch:[0/2](572000/4588595) loss:3.424 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:12:41,231][model8_pretrain.py][INFO] Epoch:[0/2](572000/4588595) loss:2.624 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:12:41,231][model8_pretrain.py][INFO] Epoch:[0/2](572000/4588595) loss:3.268 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:12:41,231][model8_pretrain.py][INFO] Epoch:[0/2](572000/4588595) loss:2.782 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:12:41,231][model8_pretrain.py][INFO] Epoch:[0/2](572000/4588595) loss:2.948 lr:0.0000100 epoch_Time:25511.0min: [2024-01-05 06:13:23,287][model8_pretrain.py][INFO] Epoch:[0/2](572100/4588595) loss:2.777 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:13:23,287][model8_pretrain.py][INFO] Epoch:[0/2](572100/4588595) loss:3.063 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:13:23,287][model8_pretrain.py][INFO] Epoch:[0/2](572100/4588595) loss:2.663 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:13:23,287][model8_pretrain.py][INFO] Epoch:[0/2](572100/4588595) loss:2.720 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:13:23,287][model8_pretrain.py][INFO] Epoch:[0/2](572100/4588595) loss:3.627 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:13:23,287][model8_pretrain.py][INFO] Epoch:[0/2](572100/4588595) loss:2.660 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:13:23,287][model8_pretrain.py][INFO] Epoch:[0/2](572100/4588595) loss:2.880 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:13:23,287][model8_pretrain.py][INFO] Epoch:[0/2](572100/4588595) loss:2.929 lr:0.0000100 epoch_Time:25510.0min: [2024-01-05 06:14:00,216][model8_pretrain.py][INFO] Epoch:[0/2](572200/4588595) loss:3.260 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:00,216][model8_pretrain.py][INFO] Epoch:[0/2](572200/4588595) loss:2.978 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:00,216][model8_pretrain.py][INFO] Epoch:[0/2](572200/4588595) loss:3.023 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:00,216][model8_pretrain.py][INFO] Epoch:[0/2](572200/4588595) loss:2.671 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:00,216][model8_pretrain.py][INFO] Epoch:[0/2](572200/4588595) loss:3.530 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:00,216][model8_pretrain.py][INFO] Epoch:[0/2](572200/4588595) loss:2.907 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:00,216][model8_pretrain.py][INFO] Epoch:[0/2](572200/4588595) loss:3.194 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:00,216][model8_pretrain.py][INFO] Epoch:[0/2](572200/4588595) loss:2.368 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:37,153][model8_pretrain.py][INFO] Epoch:[0/2](572300/4588595) loss:3.011 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:37,153][model8_pretrain.py][INFO] Epoch:[0/2](572300/4588595) loss:2.916 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:37,153][model8_pretrain.py][INFO] Epoch:[0/2](572300/4588595) loss:2.322 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:37,153][model8_pretrain.py][INFO] Epoch:[0/2](572300/4588595) loss:2.819 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:37,153][model8_pretrain.py][INFO] Epoch:[0/2](572300/4588595) loss:3.444 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:37,153][model8_pretrain.py][INFO] Epoch:[0/2](572300/4588595) loss:3.134 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:37,153][model8_pretrain.py][INFO] Epoch:[0/2](572300/4588595) loss:3.032 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:14:37,153][model8_pretrain.py][INFO] Epoch:[0/2](572300/4588595) loss:3.271 lr:0.0000100 epoch_Time:25509.0min: [2024-01-05 06:15:14,093][model8_pretrain.py][INFO] Epoch:[0/2](572400/4588595) loss:2.842 lr:0.0000100 epoch_Time:25508.0min: [2024-01-05 06:15:14,093][model8_pretrain.py][INFO] Epoch:[0/2](572400/4588595) loss:2.968 lr:0.0000100 epoch_Time:25508.0min: [2024-01-05 06:15:14,093][model8_pretrain.py][INFO] Epoch:[0/2](572400/4588595) loss:2.979 lr:0.0000100 epoch_Time:25508.0min: [2024-01-05 06:15:14,093][model8_pretrain.py][INFO] Epoch:[0/2](572400/4588595) loss:2.655 lr:0.0000100 epoch_Time:25508.0min: [2024-01-05 06:15:14,093][model8_pretrain.py][INFO] Epoch:[0/2](572400/4588595) loss:2.621 lr:0.0000100 epoch_Time:25508.0min: [2024-01-05 06:15:14,093][model8_pretrain.py][INFO] Epoch:[0/2](572400/4588595) loss:2.121 lr:0.0000100 epoch_Time:25508.0min: [2024-01-05 06:15:14,093][model8_pretrain.py][INFO] Epoch:[0/2](572400/4588595) loss:3.245 lr:0.0000100 epoch_Time:25508.0min: [2024-01-05 06:15:14,093][model8_pretrain.py][INFO] Epoch:[0/2](572400/4588595) loss:3.026 lr:0.0000100 epoch_Time:25508.0min: [2024-01-05 06:15:51,030][model8_pretrain.py][INFO] Epoch:[0/2](572500/4588595) loss:3.227 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:15:51,030][model8_pretrain.py][INFO] Epoch:[0/2](572500/4588595) loss:3.082 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:15:51,030][model8_pretrain.py][INFO] Epoch:[0/2](572500/4588595) loss:2.839 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:15:51,030][model8_pretrain.py][INFO] Epoch:[0/2](572500/4588595) loss:2.775 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:15:51,031][model8_pretrain.py][INFO] Epoch:[0/2](572500/4588595) loss:2.464 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:15:51,031][model8_pretrain.py][INFO] Epoch:[0/2](572500/4588595) loss:2.651 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:15:51,031][model8_pretrain.py][INFO] Epoch:[0/2](572500/4588595) loss:2.501 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:15:51,031][model8_pretrain.py][INFO] Epoch:[0/2](572500/4588595) loss:2.848 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:16:27,979][model8_pretrain.py][INFO] Epoch:[0/2](572600/4588595) loss:2.476 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:16:27,979][model8_pretrain.py][INFO] Epoch:[0/2](572600/4588595) loss:2.535 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:16:27,979][model8_pretrain.py][INFO] Epoch:[0/2](572600/4588595) loss:2.988 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:16:27,979][model8_pretrain.py][INFO] Epoch:[0/2](572600/4588595) loss:2.511 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:16:27,979][model8_pretrain.py][INFO] Epoch:[0/2](572600/4588595) loss:2.610 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:16:27,979][model8_pretrain.py][INFO] Epoch:[0/2](572600/4588595) loss:3.273 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:16:27,979][model8_pretrain.py][INFO] Epoch:[0/2](572600/4588595) loss:2.586 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:16:27,979][model8_pretrain.py][INFO] Epoch:[0/2](572600/4588595) loss:2.570 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:17:04,951][model8_pretrain.py][INFO] Epoch:[0/2](572700/4588595) loss:2.596 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:17:04,951][model8_pretrain.py][INFO] Epoch:[0/2](572700/4588595) loss:2.594 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:17:04,951][model8_pretrain.py][INFO] Epoch:[0/2](572700/4588595) loss:3.120 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:17:04,951][model8_pretrain.py][INFO] Epoch:[0/2](572700/4588595) loss:2.928 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:17:04,951][model8_pretrain.py][INFO] Epoch:[0/2](572700/4588595) loss:2.609 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:17:04,951][model8_pretrain.py][INFO] Epoch:[0/2](572700/4588595) loss:3.028 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:17:04,951][model8_pretrain.py][INFO] Epoch:[0/2](572700/4588595) loss:2.838 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:17:04,951][model8_pretrain.py][INFO] Epoch:[0/2](572700/4588595) loss:2.824 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:17:47,130][model8_pretrain.py][INFO] Epoch:[0/2](572800/4588595) loss:2.934 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:17:47,130][model8_pretrain.py][INFO] Epoch:[0/2](572800/4588595) loss:2.976 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:17:47,130][model8_pretrain.py][INFO] Epoch:[0/2](572800/4588595) loss:3.242 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:17:47,130][model8_pretrain.py][INFO] Epoch:[0/2](572800/4588595) loss:3.111 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:17:47,134][model8_pretrain.py][INFO] Epoch:[0/2](572800/4588595) loss:2.961 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:17:47,134][model8_pretrain.py][INFO] Epoch:[0/2](572800/4588595) loss:2.445 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:17:47,135][model8_pretrain.py][INFO] Epoch:[0/2](572800/4588595) loss:2.819 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:17:47,135][model8_pretrain.py][INFO] Epoch:[0/2](572800/4588595) loss:2.837 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:18:30,974][model8_pretrain.py][INFO] Epoch:[0/2](572900/4588595) loss:3.253 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:18:30,974][model8_pretrain.py][INFO] Epoch:[0/2](572900/4588595) loss:2.926 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:18:30,974][model8_pretrain.py][INFO] Epoch:[0/2](572900/4588595) loss:2.182 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:18:30,974][model8_pretrain.py][INFO] Epoch:[0/2](572900/4588595) loss:2.924 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:18:30,974][model8_pretrain.py][INFO] Epoch:[0/2](572900/4588595) loss:2.740 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:18:30,974][model8_pretrain.py][INFO] Epoch:[0/2](572900/4588595) loss:2.889 lr:0.0000100 epoch_Time:25506.0min: [2024-01-05 06:18:30,974][model8_pretrain.py][INFO] Epoch:[0/2](572900/4588595) loss:3.017 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:18:30,975][model8_pretrain.py][INFO] Epoch:[0/2](572900/4588595) loss:2.600 lr:0.0000100 epoch_Time:25505.0min: [2024-01-05 06:19:07,965][model8_pretrain.py][INFO] Epoch:[0/2](573000/4588595) loss:2.733 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:07,965][model8_pretrain.py][INFO] Epoch:[0/2](573000/4588595) loss:3.364 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:07,965][model8_pretrain.py][INFO] Epoch:[0/2](573000/4588595) loss:2.336 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:07,965][model8_pretrain.py][INFO] Epoch:[0/2](573000/4588595) loss:2.762 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:07,965][model8_pretrain.py][INFO] Epoch:[0/2](573000/4588595) loss:2.405 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:07,965][model8_pretrain.py][INFO] Epoch:[0/2](573000/4588595) loss:2.762 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:07,965][model8_pretrain.py][INFO] Epoch:[0/2](573000/4588595) loss:2.958 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:07,965][model8_pretrain.py][INFO] Epoch:[0/2](573000/4588595) loss:2.723 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:44,935][model8_pretrain.py][INFO] Epoch:[0/2](573100/4588595) loss:2.760 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:44,935][model8_pretrain.py][INFO] Epoch:[0/2](573100/4588595) loss:2.406 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:44,935][model8_pretrain.py][INFO] Epoch:[0/2](573100/4588595) loss:2.610 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:44,935][model8_pretrain.py][INFO] Epoch:[0/2](573100/4588595) loss:2.809 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:44,935][model8_pretrain.py][INFO] Epoch:[0/2](573100/4588595) loss:2.791 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:44,935][model8_pretrain.py][INFO] Epoch:[0/2](573100/4588595) loss:2.424 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:44,935][model8_pretrain.py][INFO] Epoch:[0/2](573100/4588595) loss:2.641 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:19:44,935][model8_pretrain.py][INFO] Epoch:[0/2](573100/4588595) loss:2.773 lr:0.0000100 epoch_Time:25504.0min: [2024-01-05 06:20:21,869][model8_pretrain.py][INFO] Epoch:[0/2](573200/4588595) loss:3.005 lr:0.0000100 epoch_Time:25503.0min: [2024-01-05 06:20:21,869][model8_pretrain.py][INFO] Epoch:[0/2](573200/4588595) loss:2.619 lr:0.0000100 epoch_Time:25503.0min: [2024-01-05 06:20:21,869][model8_pretrain.py][INFO] Epoch:[0/2](573200/4588595) loss:2.439 lr:0.0000100 epoch_Time:25503.0min: [2024-01-05 06:20:21,869][model8_pretrain.py][INFO] Epoch:[0/2](573200/4588595) loss:2.781 lr:0.0000100 epoch_Time:25503.0min: [2024-01-05 06:20:21,869][model8_pretrain.py][INFO] Epoch:[0/2](573200/4588595) loss:3.215 lr:0.0000100 epoch_Time:25503.0min: [2024-01-05 06:20:21,869][model8_pretrain.py][INFO] Epoch:[0/2](573200/4588595) loss:2.084 lr:0.0000100 epoch_Time:25503.0min: [2024-01-05 06:20:21,869][model8_pretrain.py][INFO] Epoch:[0/2](573200/4588595) loss:3.388 lr:0.0000100 epoch_Time:25503.0min: [2024-01-05 06:20:21,869][model8_pretrain.py][INFO] Epoch:[0/2](573200/4588595) loss:2.686 lr:0.0000100 epoch_Time:25503.0min: [2024-01-05 06:20:58,828][model8_pretrain.py][INFO] Epoch:[0/2](573300/4588595) loss:3.038 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:20:58,828][model8_pretrain.py][INFO] Epoch:[0/2](573300/4588595) loss:2.847 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:20:58,828][model8_pretrain.py][INFO] Epoch:[0/2](573300/4588595) loss:2.621 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:20:58,828][model8_pretrain.py][INFO] Epoch:[0/2](573300/4588595) loss:2.784 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:20:58,828][model8_pretrain.py][INFO] Epoch:[0/2](573300/4588595) loss:2.304 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:20:58,828][model8_pretrain.py][INFO] Epoch:[0/2](573300/4588595) loss:3.093 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:20:58,828][model8_pretrain.py][INFO] Epoch:[0/2](573300/4588595) loss:2.815 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:20:58,828][model8_pretrain.py][INFO] Epoch:[0/2](573300/4588595) loss:2.504 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:21:35,761][model8_pretrain.py][INFO] Epoch:[0/2](573400/4588595) loss:3.196 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:21:35,761][model8_pretrain.py][INFO] Epoch:[0/2](573400/4588595) loss:2.951 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:21:35,761][model8_pretrain.py][INFO] Epoch:[0/2](573400/4588595) loss:2.199 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:21:35,761][model8_pretrain.py][INFO] Epoch:[0/2](573400/4588595) loss:2.974 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:21:35,761][model8_pretrain.py][INFO] Epoch:[0/2](573400/4588595) loss:3.223 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:21:35,761][model8_pretrain.py][INFO] Epoch:[0/2](573400/4588595) loss:3.196 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:21:35,761][model8_pretrain.py][INFO] Epoch:[0/2](573400/4588595) loss:2.919 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:21:35,761][model8_pretrain.py][INFO] Epoch:[0/2](573400/4588595) loss:3.499 lr:0.0000100 epoch_Time:25502.0min: [2024-01-05 06:22:12,698][model8_pretrain.py][INFO] Epoch:[0/2](573500/4588595) loss:3.061 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:22:12,698][model8_pretrain.py][INFO] Epoch:[0/2](573500/4588595) loss:2.908 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:22:12,698][model8_pretrain.py][INFO] Epoch:[0/2](573500/4588595) loss:3.584 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:22:12,698][model8_pretrain.py][INFO] Epoch:[0/2](573500/4588595) loss:3.100 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:22:12,698][model8_pretrain.py][INFO] Epoch:[0/2](573500/4588595) loss:2.645 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:22:12,699][model8_pretrain.py][INFO] Epoch:[0/2](573500/4588595) loss:2.817 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:22:12,699][model8_pretrain.py][INFO] Epoch:[0/2](573500/4588595) loss:3.043 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:22:12,700][model8_pretrain.py][INFO] Epoch:[0/2](573500/4588595) loss:2.487 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:22:51,399][model8_pretrain.py][INFO] Epoch:[0/2](573600/4588595) loss:3.260 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:22:51,399][model8_pretrain.py][INFO] Epoch:[0/2](573600/4588595) loss:3.074 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:22:51,399][model8_pretrain.py][INFO] Epoch:[0/2](573600/4588595) loss:2.806 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:22:51,399][model8_pretrain.py][INFO] Epoch:[0/2](573600/4588595) loss:3.079 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:22:51,399][model8_pretrain.py][INFO] Epoch:[0/2](573600/4588595) loss:2.950 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:22:51,400][model8_pretrain.py][INFO] Epoch:[0/2](573600/4588595) loss:2.997 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:22:51,399][model8_pretrain.py][INFO] Epoch:[0/2](573600/4588595) loss:2.678 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:22:51,399][model8_pretrain.py][INFO] Epoch:[0/2](573600/4588595) loss:3.381 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:23:37,096][model8_pretrain.py][INFO] Epoch:[0/2](573700/4588595) loss:2.553 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:23:37,096][model8_pretrain.py][INFO] Epoch:[0/2](573700/4588595) loss:2.616 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:23:37,096][model8_pretrain.py][INFO] Epoch:[0/2](573700/4588595) loss:2.945 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:23:37,096][model8_pretrain.py][INFO] Epoch:[0/2](573700/4588595) loss:3.238 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:23:37,096][model8_pretrain.py][INFO] Epoch:[0/2](573700/4588595) loss:2.778 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:23:37,096][model8_pretrain.py][INFO] Epoch:[0/2](573700/4588595) loss:3.531 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:23:37,096][model8_pretrain.py][INFO] Epoch:[0/2](573700/4588595) loss:3.110 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:23:37,096][model8_pretrain.py][INFO] Epoch:[0/2](573700/4588595) loss:3.312 lr:0.0000100 epoch_Time:25501.0min: [2024-01-05 06:24:14,032][model8_pretrain.py][INFO] Epoch:[0/2](573800/4588595) loss:3.064 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:24:14,032][model8_pretrain.py][INFO] Epoch:[0/2](573800/4588595) loss:2.690 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:24:14,032][model8_pretrain.py][INFO] Epoch:[0/2](573800/4588595) loss:3.053 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:24:14,032][model8_pretrain.py][INFO] Epoch:[0/2](573800/4588595) loss:2.426 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:24:14,032][model8_pretrain.py][INFO] Epoch:[0/2](573800/4588595) loss:3.312 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:24:14,032][model8_pretrain.py][INFO] Epoch:[0/2](573800/4588595) loss:3.067 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:24:14,033][model8_pretrain.py][INFO] Epoch:[0/2](573800/4588595) loss:2.726 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:24:14,033][model8_pretrain.py][INFO] Epoch:[0/2](573800/4588595) loss:3.020 lr:0.0000100 epoch_Time:25500.0min: [2024-01-05 06:24:50,988][model8_pretrain.py][INFO] Epoch:[0/2](573900/4588595) loss:2.700 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:24:50,988][model8_pretrain.py][INFO] Epoch:[0/2](573900/4588595) loss:2.682 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:24:50,988][model8_pretrain.py][INFO] Epoch:[0/2](573900/4588595) loss:3.357 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:24:50,988][model8_pretrain.py][INFO] Epoch:[0/2](573900/4588595) loss:2.724 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:24:50,988][model8_pretrain.py][INFO] Epoch:[0/2](573900/4588595) loss:2.322 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:24:50,988][model8_pretrain.py][INFO] Epoch:[0/2](573900/4588595) loss:3.029 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:24:50,989][model8_pretrain.py][INFO] Epoch:[0/2](573900/4588595) loss:3.056 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:24:50,989][model8_pretrain.py][INFO] Epoch:[0/2](573900/4588595) loss:2.442 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:25:27,932][model8_pretrain.py][INFO] Epoch:[0/2](574000/4588595) loss:2.878 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:25:27,932][model8_pretrain.py][INFO] Epoch:[0/2](574000/4588595) loss:2.963 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:25:27,932][model8_pretrain.py][INFO] Epoch:[0/2](574000/4588595) loss:2.744 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:25:27,932][model8_pretrain.py][INFO] Epoch:[0/2](574000/4588595) loss:3.235 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:25:27,932][model8_pretrain.py][INFO] Epoch:[0/2](574000/4588595) loss:2.556 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:25:27,933][model8_pretrain.py][INFO] Epoch:[0/2](574000/4588595) loss:2.585 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:25:27,933][model8_pretrain.py][INFO] Epoch:[0/2](574000/4588595) loss:3.051 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:25:27,933][model8_pretrain.py][INFO] Epoch:[0/2](574000/4588595) loss:3.063 lr:0.0000100 epoch_Time:25498.0min: [2024-01-05 06:26:04,877][model8_pretrain.py][INFO] Epoch:[0/2](574100/4588595) loss:3.306 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:04,877][model8_pretrain.py][INFO] Epoch:[0/2](574100/4588595) loss:3.047 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:04,877][model8_pretrain.py][INFO] Epoch:[0/2](574100/4588595) loss:2.609 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:04,877][model8_pretrain.py][INFO] Epoch:[0/2](574100/4588595) loss:2.765 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:04,877][model8_pretrain.py][INFO] Epoch:[0/2](574100/4588595) loss:2.548 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:04,877][model8_pretrain.py][INFO] Epoch:[0/2](574100/4588595) loss:3.046 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:04,877][model8_pretrain.py][INFO] Epoch:[0/2](574100/4588595) loss:3.132 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:04,877][model8_pretrain.py][INFO] Epoch:[0/2](574100/4588595) loss:3.013 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:41,812][model8_pretrain.py][INFO] Epoch:[0/2](574200/4588595) loss:3.080 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:41,812][model8_pretrain.py][INFO] Epoch:[0/2](574200/4588595) loss:2.623 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:41,812][model8_pretrain.py][INFO] Epoch:[0/2](574200/4588595) loss:2.046 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:41,812][model8_pretrain.py][INFO] Epoch:[0/2](574200/4588595) loss:2.881 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:41,812][model8_pretrain.py][INFO] Epoch:[0/2](574200/4588595) loss:2.555 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:41,812][model8_pretrain.py][INFO] Epoch:[0/2](574200/4588595) loss:2.461 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:41,812][model8_pretrain.py][INFO] Epoch:[0/2](574200/4588595) loss:2.843 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:26:41,812][model8_pretrain.py][INFO] Epoch:[0/2](574200/4588595) loss:2.583 lr:0.0000100 epoch_Time:25497.0min: [2024-01-05 06:27:18,786][model8_pretrain.py][INFO] Epoch:[0/2](574300/4588595) loss:2.642 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:27:18,786][model8_pretrain.py][INFO] Epoch:[0/2](574300/4588595) loss:2.799 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:27:18,786][model8_pretrain.py][INFO] Epoch:[0/2](574300/4588595) loss:3.172 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:27:18,786][model8_pretrain.py][INFO] Epoch:[0/2](574300/4588595) loss:2.883 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:27:18,786][model8_pretrain.py][INFO] Epoch:[0/2](574300/4588595) loss:2.694 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:27:18,786][model8_pretrain.py][INFO] Epoch:[0/2](574300/4588595) loss:3.100 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:27:18,787][model8_pretrain.py][INFO] Epoch:[0/2](574300/4588595) loss:2.270 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:27:18,787][model8_pretrain.py][INFO] Epoch:[0/2](574300/4588595) loss:2.473 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:27:57,447][model8_pretrain.py][INFO] Epoch:[0/2](574400/4588595) loss:2.924 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:27:57,447][model8_pretrain.py][INFO] Epoch:[0/2](574400/4588595) loss:3.093 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:27:57,447][model8_pretrain.py][INFO] Epoch:[0/2](574400/4588595) loss:2.724 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:27:57,447][model8_pretrain.py][INFO] Epoch:[0/2](574400/4588595) loss:2.797 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:27:57,447][model8_pretrain.py][INFO] Epoch:[0/2](574400/4588595) loss:2.991 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:27:57,447][model8_pretrain.py][INFO] Epoch:[0/2](574400/4588595) loss:3.179 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:27:57,447][model8_pretrain.py][INFO] Epoch:[0/2](574400/4588595) loss:3.022 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:27:57,447][model8_pretrain.py][INFO] Epoch:[0/2](574400/4588595) loss:2.772 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:28:43,075][model8_pretrain.py][INFO] Epoch:[0/2](574500/4588595) loss:2.257 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:28:43,075][model8_pretrain.py][INFO] Epoch:[0/2](574500/4588595) loss:2.551 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:28:43,075][model8_pretrain.py][INFO] Epoch:[0/2](574500/4588595) loss:3.311 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:28:43,075][model8_pretrain.py][INFO] Epoch:[0/2](574500/4588595) loss:3.042 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:28:43,075][model8_pretrain.py][INFO] Epoch:[0/2](574500/4588595) loss:2.390 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:28:43,075][model8_pretrain.py][INFO] Epoch:[0/2](574500/4588595) loss:2.774 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:28:43,075][model8_pretrain.py][INFO] Epoch:[0/2](574500/4588595) loss:2.997 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:28:43,075][model8_pretrain.py][INFO] Epoch:[0/2](574500/4588595) loss:2.890 lr:0.0000100 epoch_Time:25496.0min: [2024-01-05 06:29:20,012][model8_pretrain.py][INFO] Epoch:[0/2](574600/4588595) loss:2.727 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:29:20,012][model8_pretrain.py][INFO] Epoch:[0/2](574600/4588595) loss:3.120 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:29:20,012][model8_pretrain.py][INFO] Epoch:[0/2](574600/4588595) loss:2.632 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:29:20,012][model8_pretrain.py][INFO] Epoch:[0/2](574600/4588595) loss:2.565 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:29:20,012][model8_pretrain.py][INFO] Epoch:[0/2](574600/4588595) loss:2.781 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:29:20,012][model8_pretrain.py][INFO] Epoch:[0/2](574600/4588595) loss:2.607 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:29:20,012][model8_pretrain.py][INFO] Epoch:[0/2](574600/4588595) loss:3.015 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:29:20,012][model8_pretrain.py][INFO] Epoch:[0/2](574600/4588595) loss:2.958 lr:0.0000100 epoch_Time:25495.0min: [2024-01-05 06:29:56,941][model8_pretrain.py][INFO] Epoch:[0/2](574700/4588595) loss:2.751 lr:0.0000100 epoch_Time:25494.0min: [2024-01-05 06:29:56,941][model8_pretrain.py][INFO] Epoch:[0/2](574700/4588595) loss:2.875 lr:0.0000100 epoch_Time:25494.0min: [2024-01-05 06:29:56,941][model8_pretrain.py][INFO] Epoch:[0/2](574700/4588595) loss:2.541 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:29:56,941][model8_pretrain.py][INFO] Epoch:[0/2](574700/4588595) loss:2.974 lr:0.0000100 epoch_Time:25494.0min: [2024-01-05 06:29:56,941][model8_pretrain.py][INFO] Epoch:[0/2](574700/4588595) loss:2.780 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:29:56,941][model8_pretrain.py][INFO] Epoch:[0/2](574700/4588595) loss:2.387 lr:0.0000100 epoch_Time:25494.0min: [2024-01-05 06:29:56,941][model8_pretrain.py][INFO] Epoch:[0/2](574700/4588595) loss:2.547 lr:0.0000100 epoch_Time:25494.0min: [2024-01-05 06:29:56,941][model8_pretrain.py][INFO] Epoch:[0/2](574700/4588595) loss:2.747 lr:0.0000100 epoch_Time:25494.0min: [2024-01-05 06:30:33,886][model8_pretrain.py][INFO] Epoch:[0/2](574800/4588595) loss:3.150 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:30:33,887][model8_pretrain.py][INFO] Epoch:[0/2](574800/4588595) loss:2.836 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:30:33,887][model8_pretrain.py][INFO] Epoch:[0/2](574800/4588595) loss:2.784 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:30:33,887][model8_pretrain.py][INFO] Epoch:[0/2](574800/4588595) loss:3.038 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:30:33,887][model8_pretrain.py][INFO] Epoch:[0/2](574800/4588595) loss:2.525 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:30:33,887][model8_pretrain.py][INFO] Epoch:[0/2](574800/4588595) loss:3.160 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:30:33,887][model8_pretrain.py][INFO] Epoch:[0/2](574800/4588595) loss:3.272 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:30:33,887][model8_pretrain.py][INFO] Epoch:[0/2](574800/4588595) loss:3.103 lr:0.0000100 epoch_Time:25493.0min: [2024-01-05 06:31:10,814][model8_pretrain.py][INFO] Epoch:[0/2](574900/4588595) loss:3.352 lr:0.0000100 epoch_Time:25492.0min: [2024-01-05 06:31:10,814][model8_pretrain.py][INFO] Epoch:[0/2](574900/4588595) loss:2.753 lr:0.0000100 epoch_Time:25492.0min: [2024-01-05 06:31:10,814][model8_pretrain.py][INFO] Epoch:[0/2](574900/4588595) loss:2.680 lr:0.0000100 epoch_Time:25492.0min: [2024-01-05 06:31:10,814][model8_pretrain.py][INFO] Epoch:[0/2](574900/4588595) loss:3.218 lr:0.0000100 epoch_Time:25492.0min: [2024-01-05 06:31:10,814][model8_pretrain.py][INFO] Epoch:[0/2](574900/4588595) loss:2.657 lr:0.0000100 epoch_Time:25492.0min: [2024-01-05 06:31:10,814][model8_pretrain.py][INFO] Epoch:[0/2](574900/4588595) loss:2.920 lr:0.0000100 epoch_Time:25492.0min: [2024-01-05 06:31:10,814][model8_pretrain.py][INFO] Epoch:[0/2](574900/4588595) loss:2.602 lr:0.0000100 epoch_Time:25492.0min: [2024-01-05 06:31:10,814][model8_pretrain.py][INFO] Epoch:[0/2](574900/4588595) loss:2.903 lr:0.0000100 epoch_Time:25492.0min: [2024-01-05 06:31:47,753][model8_pretrain.py][INFO] Epoch:[0/2](575000/4588595) loss:3.117 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:31:47,753][model8_pretrain.py][INFO] Epoch:[0/2](575000/4588595) loss:2.199 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:31:47,753][model8_pretrain.py][INFO] Epoch:[0/2](575000/4588595) loss:3.106 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:31:47,753][model8_pretrain.py][INFO] Epoch:[0/2](575000/4588595) loss:2.461 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:31:47,753][model8_pretrain.py][INFO] Epoch:[0/2](575000/4588595) loss:2.695 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:31:47,753][model8_pretrain.py][INFO] Epoch:[0/2](575000/4588595) loss:3.196 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:31:47,753][model8_pretrain.py][INFO] Epoch:[0/2](575000/4588595) loss:3.294 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:31:47,753][model8_pretrain.py][INFO] Epoch:[0/2](575000/4588595) loss:2.415 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:32:24,689][model8_pretrain.py][INFO] Epoch:[0/2](575100/4588595) loss:2.242 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:32:24,689][model8_pretrain.py][INFO] Epoch:[0/2](575100/4588595) loss:2.867 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:32:24,689][model8_pretrain.py][INFO] Epoch:[0/2](575100/4588595) loss:2.495 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:32:24,689][model8_pretrain.py][INFO] Epoch:[0/2](575100/4588595) loss:3.090 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:32:24,689][model8_pretrain.py][INFO] Epoch:[0/2](575100/4588595) loss:2.478 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:32:24,689][model8_pretrain.py][INFO] Epoch:[0/2](575100/4588595) loss:3.064 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:32:24,689][model8_pretrain.py][INFO] Epoch:[0/2](575100/4588595) loss:2.595 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:32:24,689][model8_pretrain.py][INFO] Epoch:[0/2](575100/4588595) loss:2.598 lr:0.0000100 epoch_Time:25491.0min: [2024-01-05 06:33:03,395][model8_pretrain.py][INFO] Epoch:[0/2](575200/4588595) loss:2.985 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:03,396][model8_pretrain.py][INFO] Epoch:[0/2](575200/4588595) loss:2.943 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:03,395][model8_pretrain.py][INFO] Epoch:[0/2](575200/4588595) loss:3.273 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:03,396][model8_pretrain.py][INFO] Epoch:[0/2](575200/4588595) loss:2.907 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:03,396][model8_pretrain.py][INFO] Epoch:[0/2](575200/4588595) loss:3.135 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:03,396][model8_pretrain.py][INFO] Epoch:[0/2](575200/4588595) loss:2.283 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:03,396][model8_pretrain.py][INFO] Epoch:[0/2](575200/4588595) loss:2.892 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:03,396][model8_pretrain.py][INFO] Epoch:[0/2](575200/4588595) loss:2.442 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:49,129][model8_pretrain.py][INFO] Epoch:[0/2](575300/4588595) loss:2.672 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:49,129][model8_pretrain.py][INFO] Epoch:[0/2](575300/4588595) loss:2.890 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:49,129][model8_pretrain.py][INFO] Epoch:[0/2](575300/4588595) loss:2.801 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:49,129][model8_pretrain.py][INFO] Epoch:[0/2](575300/4588595) loss:3.138 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:49,129][model8_pretrain.py][INFO] Epoch:[0/2](575300/4588595) loss:3.020 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:49,129][model8_pretrain.py][INFO] Epoch:[0/2](575300/4588595) loss:2.441 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:49,129][model8_pretrain.py][INFO] Epoch:[0/2](575300/4588595) loss:2.643 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:33:49,129][model8_pretrain.py][INFO] Epoch:[0/2](575300/4588595) loss:2.611 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:34:26,059][model8_pretrain.py][INFO] Epoch:[0/2](575400/4588595) loss:3.079 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:34:26,059][model8_pretrain.py][INFO] Epoch:[0/2](575400/4588595) loss:2.438 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:34:26,059][model8_pretrain.py][INFO] Epoch:[0/2](575400/4588595) loss:2.436 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:34:26,059][model8_pretrain.py][INFO] Epoch:[0/2](575400/4588595) loss:3.269 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:34:26,059][model8_pretrain.py][INFO] Epoch:[0/2](575400/4588595) loss:3.134 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:34:26,059][model8_pretrain.py][INFO] Epoch:[0/2](575400/4588595) loss:2.661 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:34:26,059][model8_pretrain.py][INFO] Epoch:[0/2](575400/4588595) loss:2.766 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:34:26,059][model8_pretrain.py][INFO] Epoch:[0/2](575400/4588595) loss:3.103 lr:0.0000100 epoch_Time:25490.0min: [2024-01-05 06:35:03,000][model8_pretrain.py][INFO] Epoch:[0/2](575500/4588595) loss:2.885 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:03,000][model8_pretrain.py][INFO] Epoch:[0/2](575500/4588595) loss:2.876 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:03,000][model8_pretrain.py][INFO] Epoch:[0/2](575500/4588595) loss:3.119 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:03,000][model8_pretrain.py][INFO] Epoch:[0/2](575500/4588595) loss:2.814 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:03,000][model8_pretrain.py][INFO] Epoch:[0/2](575500/4588595) loss:2.317 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:03,000][model8_pretrain.py][INFO] Epoch:[0/2](575500/4588595) loss:2.637 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:03,000][model8_pretrain.py][INFO] Epoch:[0/2](575500/4588595) loss:3.096 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:03,000][model8_pretrain.py][INFO] Epoch:[0/2](575500/4588595) loss:2.906 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:39,942][model8_pretrain.py][INFO] Epoch:[0/2](575600/4588595) loss:1.936 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:39,942][model8_pretrain.py][INFO] Epoch:[0/2](575600/4588595) loss:2.354 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:39,942][model8_pretrain.py][INFO] Epoch:[0/2](575600/4588595) loss:2.839 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:39,942][model8_pretrain.py][INFO] Epoch:[0/2](575600/4588595) loss:3.279 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:39,942][model8_pretrain.py][INFO] Epoch:[0/2](575600/4588595) loss:2.880 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:39,942][model8_pretrain.py][INFO] Epoch:[0/2](575600/4588595) loss:2.414 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:39,942][model8_pretrain.py][INFO] Epoch:[0/2](575600/4588595) loss:2.676 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:35:39,942][model8_pretrain.py][INFO] Epoch:[0/2](575600/4588595) loss:3.106 lr:0.0000100 epoch_Time:25489.0min: [2024-01-05 06:36:16,890][model8_pretrain.py][INFO] Epoch:[0/2](575700/4588595) loss:2.811 lr:0.0000100 epoch_Time:25487.0min: [2024-01-05 06:36:16,891][model8_pretrain.py][INFO] Epoch:[0/2](575700/4588595) loss:2.873 lr:0.0000100 epoch_Time:25487.0min: [2024-01-05 06:36:16,891][model8_pretrain.py][INFO] Epoch:[0/2](575700/4588595) loss:3.098 lr:0.0000100 epoch_Time:25487.0min: [2024-01-05 06:36:16,891][model8_pretrain.py][INFO] Epoch:[0/2](575700/4588595) loss:2.174 lr:0.0000100 epoch_Time:25487.0min: [2024-01-05 06:36:16,891][model8_pretrain.py][INFO] Epoch:[0/2](575700/4588595) loss:3.475 lr:0.0000100 epoch_Time:25487.0min: [2024-01-05 06:36:16,891][model8_pretrain.py][INFO] Epoch:[0/2](575700/4588595) loss:2.279 lr:0.0000100 epoch_Time:25487.0min: [2024-01-05 06:36:16,891][model8_pretrain.py][INFO] Epoch:[0/2](575700/4588595) loss:2.960 lr:0.0000100 epoch_Time:25487.0min: [2024-01-05 06:36:16,891][model8_pretrain.py][INFO] Epoch:[0/2](575700/4588595) loss:2.402 lr:0.0000100 epoch_Time:25487.0min: [2024-01-05 06:36:53,816][model8_pretrain.py][INFO] Epoch:[0/2](575800/4588595) loss:2.946 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:36:53,817][model8_pretrain.py][INFO] Epoch:[0/2](575800/4588595) loss:2.901 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:36:53,817][model8_pretrain.py][INFO] Epoch:[0/2](575800/4588595) loss:2.714 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:36:53,817][model8_pretrain.py][INFO] Epoch:[0/2](575800/4588595) loss:2.527 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:36:53,817][model8_pretrain.py][INFO] Epoch:[0/2](575800/4588595) loss:2.584 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:36:53,817][model8_pretrain.py][INFO] Epoch:[0/2](575800/4588595) loss:2.818 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:36:53,817][model8_pretrain.py][INFO] Epoch:[0/2](575800/4588595) loss:3.383 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:36:53,817][model8_pretrain.py][INFO] Epoch:[0/2](575800/4588595) loss:3.149 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:37:30,751][model8_pretrain.py][INFO] Epoch:[0/2](575900/4588595) loss:3.060 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:37:30,751][model8_pretrain.py][INFO] Epoch:[0/2](575900/4588595) loss:3.074 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:37:30,751][model8_pretrain.py][INFO] Epoch:[0/2](575900/4588595) loss:2.708 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:37:30,751][model8_pretrain.py][INFO] Epoch:[0/2](575900/4588595) loss:3.172 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:37:30,751][model8_pretrain.py][INFO] Epoch:[0/2](575900/4588595) loss:2.627 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:37:30,751][model8_pretrain.py][INFO] Epoch:[0/2](575900/4588595) loss:2.359 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:37:30,751][model8_pretrain.py][INFO] Epoch:[0/2](575900/4588595) loss:2.383 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:37:30,751][model8_pretrain.py][INFO] Epoch:[0/2](575900/4588595) loss:2.661 lr:0.0000100 epoch_Time:25486.0min: [2024-01-05 06:38:07,679][model8_pretrain.py][INFO] Epoch:[0/2](576000/4588595) loss:2.721 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:07,679][model8_pretrain.py][INFO] Epoch:[0/2](576000/4588595) loss:2.762 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:07,679][model8_pretrain.py][INFO] Epoch:[0/2](576000/4588595) loss:2.661 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:07,679][model8_pretrain.py][INFO] Epoch:[0/2](576000/4588595) loss:2.935 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:07,679][model8_pretrain.py][INFO] Epoch:[0/2](576000/4588595) loss:2.406 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:07,679][model8_pretrain.py][INFO] Epoch:[0/2](576000/4588595) loss:2.343 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:07,679][model8_pretrain.py][INFO] Epoch:[0/2](576000/4588595) loss:2.760 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:07,680][model8_pretrain.py][INFO] Epoch:[0/2](576000/4588595) loss:3.121 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:55,139][model8_pretrain.py][INFO] Epoch:[0/2](576100/4588595) loss:2.592 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:55,139][model8_pretrain.py][INFO] Epoch:[0/2](576100/4588595) loss:2.964 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:55,139][model8_pretrain.py][INFO] Epoch:[0/2](576100/4588595) loss:2.793 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:55,139][model8_pretrain.py][INFO] Epoch:[0/2](576100/4588595) loss:3.018 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:55,139][model8_pretrain.py][INFO] Epoch:[0/2](576100/4588595) loss:2.362 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:55,139][model8_pretrain.py][INFO] Epoch:[0/2](576100/4588595) loss:2.780 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:55,139][model8_pretrain.py][INFO] Epoch:[0/2](576100/4588595) loss:2.758 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:38:55,139][model8_pretrain.py][INFO] Epoch:[0/2](576100/4588595) loss:2.875 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:39:32,052][model8_pretrain.py][INFO] Epoch:[0/2](576200/4588595) loss:3.153 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:39:32,052][model8_pretrain.py][INFO] Epoch:[0/2](576200/4588595) loss:3.161 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:39:32,052][model8_pretrain.py][INFO] Epoch:[0/2](576200/4588595) loss:2.358 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:39:32,052][model8_pretrain.py][INFO] Epoch:[0/2](576200/4588595) loss:2.905 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:39:32,052][model8_pretrain.py][INFO] Epoch:[0/2](576200/4588595) loss:2.793 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:39:32,052][model8_pretrain.py][INFO] Epoch:[0/2](576200/4588595) loss:2.423 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:39:32,052][model8_pretrain.py][INFO] Epoch:[0/2](576200/4588595) loss:3.193 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:39:32,052][model8_pretrain.py][INFO] Epoch:[0/2](576200/4588595) loss:2.531 lr:0.0000100 epoch_Time:25485.0min: [2024-01-05 06:40:08,995][model8_pretrain.py][INFO] Epoch:[0/2](576300/4588595) loss:3.382 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:08,995][model8_pretrain.py][INFO] Epoch:[0/2](576300/4588595) loss:2.792 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:08,995][model8_pretrain.py][INFO] Epoch:[0/2](576300/4588595) loss:2.786 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:08,995][model8_pretrain.py][INFO] Epoch:[0/2](576300/4588595) loss:3.020 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:08,995][model8_pretrain.py][INFO] Epoch:[0/2](576300/4588595) loss:3.086 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:08,995][model8_pretrain.py][INFO] Epoch:[0/2](576300/4588595) loss:2.451 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:08,995][model8_pretrain.py][INFO] Epoch:[0/2](576300/4588595) loss:2.359 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:08,995][model8_pretrain.py][INFO] Epoch:[0/2](576300/4588595) loss:2.923 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:45,929][model8_pretrain.py][INFO] Epoch:[0/2](576400/4588595) loss:2.870 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:45,929][model8_pretrain.py][INFO] Epoch:[0/2](576400/4588595) loss:2.774 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:45,929][model8_pretrain.py][INFO] Epoch:[0/2](576400/4588595) loss:2.977 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:45,929][model8_pretrain.py][INFO] Epoch:[0/2](576400/4588595) loss:2.635 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:45,929][model8_pretrain.py][INFO] Epoch:[0/2](576400/4588595) loss:3.113 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:45,929][model8_pretrain.py][INFO] Epoch:[0/2](576400/4588595) loss:3.149 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:45,929][model8_pretrain.py][INFO] Epoch:[0/2](576400/4588595) loss:3.338 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:40:45,929][model8_pretrain.py][INFO] Epoch:[0/2](576400/4588595) loss:2.650 lr:0.0000100 epoch_Time:25484.0min: [2024-01-05 06:41:22,882][model8_pretrain.py][INFO] Epoch:[0/2](576500/4588595) loss:3.230 lr:0.0000100 epoch_Time:25483.0min: [2024-01-05 06:41:22,882][model8_pretrain.py][INFO] Epoch:[0/2](576500/4588595) loss:2.515 lr:0.0000100 epoch_Time:25482.0min: [2024-01-05 06:41:22,882][model8_pretrain.py][INFO] Epoch:[0/2](576500/4588595) loss:2.035 lr:0.0000100 epoch_Time:25482.0min: [2024-01-05 06:41:22,882][model8_pretrain.py][INFO] Epoch:[0/2](576500/4588595) loss:2.947 lr:0.0000100 epoch_Time:25483.0min: [2024-01-05 06:41:22,882][model8_pretrain.py][INFO] Epoch:[0/2](576500/4588595) loss:3.111 lr:0.0000100 epoch_Time:25482.0min: [2024-01-05 06:41:22,882][model8_pretrain.py][INFO] Epoch:[0/2](576500/4588595) loss:2.769 lr:0.0000100 epoch_Time:25482.0min: [2024-01-05 06:41:22,882][model8_pretrain.py][INFO] Epoch:[0/2](576500/4588595) loss:2.464 lr:0.0000100 epoch_Time:25482.0min: [2024-01-05 06:41:22,882][model8_pretrain.py][INFO] Epoch:[0/2](576500/4588595) loss:2.204 lr:0.0000100 epoch_Time:25482.0min: [2024-01-05 06:41:59,809][model8_pretrain.py][INFO] Epoch:[0/2](576600/4588595) loss:2.709 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:41:59,809][model8_pretrain.py][INFO] Epoch:[0/2](576600/4588595) loss:2.502 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:41:59,809][model8_pretrain.py][INFO] Epoch:[0/2](576600/4588595) loss:3.339 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:41:59,809][model8_pretrain.py][INFO] Epoch:[0/2](576600/4588595) loss:3.267 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:41:59,809][model8_pretrain.py][INFO] Epoch:[0/2](576600/4588595) loss:2.560 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:41:59,809][model8_pretrain.py][INFO] Epoch:[0/2](576600/4588595) loss:3.394 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:41:59,809][model8_pretrain.py][INFO] Epoch:[0/2](576600/4588595) loss:2.452 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:41:59,809][model8_pretrain.py][INFO] Epoch:[0/2](576600/4588595) loss:2.636 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:42:36,741][model8_pretrain.py][INFO] Epoch:[0/2](576700/4588595) loss:2.903 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:42:36,741][model8_pretrain.py][INFO] Epoch:[0/2](576700/4588595) loss:2.873 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:42:36,741][model8_pretrain.py][INFO] Epoch:[0/2](576700/4588595) loss:2.332 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:42:36,741][model8_pretrain.py][INFO] Epoch:[0/2](576700/4588595) loss:3.166 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:42:36,741][model8_pretrain.py][INFO] Epoch:[0/2](576700/4588595) loss:2.937 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:42:36,741][model8_pretrain.py][INFO] Epoch:[0/2](576700/4588595) loss:3.054 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:42:36,741][model8_pretrain.py][INFO] Epoch:[0/2](576700/4588595) loss:3.117 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:42:36,741][model8_pretrain.py][INFO] Epoch:[0/2](576700/4588595) loss:3.240 lr:0.0000100 epoch_Time:25481.0min: [2024-01-05 06:43:13,674][model8_pretrain.py][INFO] Epoch:[0/2](576800/4588595) loss:3.149 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:43:13,674][model8_pretrain.py][INFO] Epoch:[0/2](576800/4588595) loss:2.877 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:43:13,674][model8_pretrain.py][INFO] Epoch:[0/2](576800/4588595) loss:2.650 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:43:13,674][model8_pretrain.py][INFO] Epoch:[0/2](576800/4588595) loss:2.970 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:43:13,674][model8_pretrain.py][INFO] Epoch:[0/2](576800/4588595) loss:2.956 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:43:13,674][model8_pretrain.py][INFO] Epoch:[0/2](576800/4588595) loss:2.942 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:43:13,674][model8_pretrain.py][INFO] Epoch:[0/2](576800/4588595) loss:3.115 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:43:13,674][model8_pretrain.py][INFO] Epoch:[0/2](576800/4588595) loss:3.132 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:01,062][model8_pretrain.py][INFO] Epoch:[0/2](576900/4588595) loss:2.121 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:01,062][model8_pretrain.py][INFO] Epoch:[0/2](576900/4588595) loss:3.067 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:01,062][model8_pretrain.py][INFO] Epoch:[0/2](576900/4588595) loss:3.063 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:01,062][model8_pretrain.py][INFO] Epoch:[0/2](576900/4588595) loss:2.939 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:01,062][model8_pretrain.py][INFO] Epoch:[0/2](576900/4588595) loss:3.009 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:01,062][model8_pretrain.py][INFO] Epoch:[0/2](576900/4588595) loss:2.272 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:01,062][model8_pretrain.py][INFO] Epoch:[0/2](576900/4588595) loss:3.126 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:01,062][model8_pretrain.py][INFO] Epoch:[0/2](576900/4588595) loss:3.506 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:38,015][model8_pretrain.py][INFO] Epoch:[0/2](577000/4588595) loss:2.551 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:38,015][model8_pretrain.py][INFO] Epoch:[0/2](577000/4588595) loss:2.420 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:38,015][model8_pretrain.py][INFO] Epoch:[0/2](577000/4588595) loss:3.268 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:38,015][model8_pretrain.py][INFO] Epoch:[0/2](577000/4588595) loss:2.316 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:38,015][model8_pretrain.py][INFO] Epoch:[0/2](577000/4588595) loss:2.694 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:38,015][model8_pretrain.py][INFO] Epoch:[0/2](577000/4588595) loss:2.950 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:38,015][model8_pretrain.py][INFO] Epoch:[0/2](577000/4588595) loss:2.290 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:44:38,015][model8_pretrain.py][INFO] Epoch:[0/2](577000/4588595) loss:2.625 lr:0.0000100 epoch_Time:25480.0min: [2024-01-05 06:45:14,975][model8_pretrain.py][INFO] Epoch:[0/2](577100/4588595) loss:2.699 lr:0.0000100 epoch_Time:25479.0min: [2024-01-05 06:45:14,975][model8_pretrain.py][INFO] Epoch:[0/2](577100/4588595) loss:2.987 lr:0.0000100 epoch_Time:25479.0min: [2024-01-05 06:45:14,975][model8_pretrain.py][INFO] Epoch:[0/2](577100/4588595) loss:2.610 lr:0.0000100 epoch_Time:25479.0min: [2024-01-05 06:45:14,975][model8_pretrain.py][INFO] Epoch:[0/2](577100/4588595) loss:2.653 lr:0.0000100 epoch_Time:25479.0min: [2024-01-05 06:45:14,975][model8_pretrain.py][INFO] Epoch:[0/2](577100/4588595) loss:3.000 lr:0.0000100 epoch_Time:25479.0min: [2024-01-05 06:45:14,975][model8_pretrain.py][INFO] Epoch:[0/2](577100/4588595) loss:2.970 lr:0.0000100 epoch_Time:25479.0min: [2024-01-05 06:45:14,975][model8_pretrain.py][INFO] Epoch:[0/2](577100/4588595) loss:2.825 lr:0.0000100 epoch_Time:25479.0min: [2024-01-05 06:45:14,976][model8_pretrain.py][INFO] Epoch:[0/2](577100/4588595) loss:2.522 lr:0.0000100 epoch_Time:25479.0min: [2024-01-05 06:45:51,942][model8_pretrain.py][INFO] Epoch:[0/2](577200/4588595) loss:3.093 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:45:51,942][model8_pretrain.py][INFO] Epoch:[0/2](577200/4588595) loss:2.681 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:45:51,942][model8_pretrain.py][INFO] Epoch:[0/2](577200/4588595) loss:3.219 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:45:51,942][model8_pretrain.py][INFO] Epoch:[0/2](577200/4588595) loss:2.888 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:45:51,942][model8_pretrain.py][INFO] Epoch:[0/2](577200/4588595) loss:3.266 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:45:51,942][model8_pretrain.py][INFO] Epoch:[0/2](577200/4588595) loss:2.746 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:45:51,942][model8_pretrain.py][INFO] Epoch:[0/2](577200/4588595) loss:2.660 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:45:51,943][model8_pretrain.py][INFO] Epoch:[0/2](577200/4588595) loss:3.354 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:46:28,909][model8_pretrain.py][INFO] Epoch:[0/2](577300/4588595) loss:2.782 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:46:28,909][model8_pretrain.py][INFO] Epoch:[0/2](577300/4588595) loss:2.982 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:46:28,909][model8_pretrain.py][INFO] Epoch:[0/2](577300/4588595) loss:2.869 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:46:28,909][model8_pretrain.py][INFO] Epoch:[0/2](577300/4588595) loss:2.241 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:46:28,909][model8_pretrain.py][INFO] Epoch:[0/2](577300/4588595) loss:3.153 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:46:28,909][model8_pretrain.py][INFO] Epoch:[0/2](577300/4588595) loss:2.813 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:46:28,909][model8_pretrain.py][INFO] Epoch:[0/2](577300/4588595) loss:2.840 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:46:28,910][model8_pretrain.py][INFO] Epoch:[0/2](577300/4588595) loss:2.732 lr:0.0000100 epoch_Time:25478.0min: [2024-01-05 06:47:05,854][model8_pretrain.py][INFO] Epoch:[0/2](577400/4588595) loss:2.817 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:05,854][model8_pretrain.py][INFO] Epoch:[0/2](577400/4588595) loss:2.718 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:05,854][model8_pretrain.py][INFO] Epoch:[0/2](577400/4588595) loss:2.684 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:05,854][model8_pretrain.py][INFO] Epoch:[0/2](577400/4588595) loss:2.854 lr:0.0000100 epoch_Time:25477.0min: [2024-01-05 06:47:05,854][model8_pretrain.py][INFO] Epoch:[0/2](577400/4588595) loss:3.349 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:05,854][model8_pretrain.py][INFO] Epoch:[0/2](577400/4588595) loss:2.603 lr:0.0000100 epoch_Time:25477.0min: [2024-01-05 06:47:05,854][model8_pretrain.py][INFO] Epoch:[0/2](577400/4588595) loss:2.349 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:05,854][model8_pretrain.py][INFO] Epoch:[0/2](577400/4588595) loss:3.236 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:42,804][model8_pretrain.py][INFO] Epoch:[0/2](577500/4588595) loss:2.888 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:42,804][model8_pretrain.py][INFO] Epoch:[0/2](577500/4588595) loss:2.513 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:42,804][model8_pretrain.py][INFO] Epoch:[0/2](577500/4588595) loss:1.522 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:42,804][model8_pretrain.py][INFO] Epoch:[0/2](577500/4588595) loss:3.116 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:42,804][model8_pretrain.py][INFO] Epoch:[0/2](577500/4588595) loss:2.991 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:42,804][model8_pretrain.py][INFO] Epoch:[0/2](577500/4588595) loss:2.915 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:42,804][model8_pretrain.py][INFO] Epoch:[0/2](577500/4588595) loss:2.889 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:47:42,805][model8_pretrain.py][INFO] Epoch:[0/2](577500/4588595) loss:2.689 lr:0.0000100 epoch_Time:25476.0min: [2024-01-05 06:48:19,751][model8_pretrain.py][INFO] Epoch:[0/2](577600/4588595) loss:3.099 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:48:19,751][model8_pretrain.py][INFO] Epoch:[0/2](577600/4588595) loss:2.809 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:48:19,751][model8_pretrain.py][INFO] Epoch:[0/2](577600/4588595) loss:3.065 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:48:19,751][model8_pretrain.py][INFO] Epoch:[0/2](577600/4588595) loss:2.967 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:48:19,751][model8_pretrain.py][INFO] Epoch:[0/2](577600/4588595) loss:2.744 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:48:19,751][model8_pretrain.py][INFO] Epoch:[0/2](577600/4588595) loss:2.987 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:48:19,751][model8_pretrain.py][INFO] Epoch:[0/2](577600/4588595) loss:2.633 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:48:19,751][model8_pretrain.py][INFO] Epoch:[0/2](577600/4588595) loss:2.786 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:07,063][model8_pretrain.py][INFO] Epoch:[0/2](577700/4588595) loss:2.407 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:07,063][model8_pretrain.py][INFO] Epoch:[0/2](577700/4588595) loss:2.923 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:07,063][model8_pretrain.py][INFO] Epoch:[0/2](577700/4588595) loss:2.740 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:07,063][model8_pretrain.py][INFO] Epoch:[0/2](577700/4588595) loss:3.298 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:07,063][model8_pretrain.py][INFO] Epoch:[0/2](577700/4588595) loss:2.485 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:07,063][model8_pretrain.py][INFO] Epoch:[0/2](577700/4588595) loss:3.230 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:07,063][model8_pretrain.py][INFO] Epoch:[0/2](577700/4588595) loss:2.323 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:07,063][model8_pretrain.py][INFO] Epoch:[0/2](577700/4588595) loss:3.208 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:44,024][model8_pretrain.py][INFO] Epoch:[0/2](577800/4588595) loss:2.927 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:44,024][model8_pretrain.py][INFO] Epoch:[0/2](577800/4588595) loss:2.408 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:44,024][model8_pretrain.py][INFO] Epoch:[0/2](577800/4588595) loss:2.698 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:44,024][model8_pretrain.py][INFO] Epoch:[0/2](577800/4588595) loss:2.695 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:44,024][model8_pretrain.py][INFO] Epoch:[0/2](577800/4588595) loss:2.890 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:44,024][model8_pretrain.py][INFO] Epoch:[0/2](577800/4588595) loss:3.180 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:44,024][model8_pretrain.py][INFO] Epoch:[0/2](577800/4588595) loss:3.098 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:49:44,024][model8_pretrain.py][INFO] Epoch:[0/2](577800/4588595) loss:3.195 lr:0.0000100 epoch_Time:25475.0min: [2024-01-05 06:50:20,990][model8_pretrain.py][INFO] Epoch:[0/2](577900/4588595) loss:3.122 lr:0.0000100 epoch_Time:25474.0min: [2024-01-05 06:50:20,990][model8_pretrain.py][INFO] Epoch:[0/2](577900/4588595) loss:3.380 lr:0.0000100 epoch_Time:25474.0min: [2024-01-05 06:50:20,990][model8_pretrain.py][INFO] Epoch:[0/2](577900/4588595) loss:2.117 lr:0.0000100 epoch_Time:25474.0min: [2024-01-05 06:50:20,990][model8_pretrain.py][INFO] Epoch:[0/2](577900/4588595) loss:2.928 lr:0.0000100 epoch_Time:25474.0min: [2024-01-05 06:50:20,990][model8_pretrain.py][INFO] Epoch:[0/2](577900/4588595) loss:3.057 lr:0.0000100 epoch_Time:25474.0min: [2024-01-05 06:50:20,990][model8_pretrain.py][INFO] Epoch:[0/2](577900/4588595) loss:2.784 lr:0.0000100 epoch_Time:25474.0min: [2024-01-05 06:50:20,990][model8_pretrain.py][INFO] Epoch:[0/2](577900/4588595) loss:2.581 lr:0.0000100 epoch_Time:25474.0min: [2024-01-05 06:50:20,991][model8_pretrain.py][INFO] Epoch:[0/2](577900/4588595) loss:3.049 lr:0.0000100 epoch_Time:25474.0min: [2024-01-05 06:50:57,944][model8_pretrain.py][INFO] Epoch:[0/2](578000/4588595) loss:2.688 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:50:57,944][model8_pretrain.py][INFO] Epoch:[0/2](578000/4588595) loss:3.332 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:50:57,944][model8_pretrain.py][INFO] Epoch:[0/2](578000/4588595) loss:2.894 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:50:57,944][model8_pretrain.py][INFO] Epoch:[0/2](578000/4588595) loss:2.962 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:50:57,944][model8_pretrain.py][INFO] Epoch:[0/2](578000/4588595) loss:2.843 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:50:57,944][model8_pretrain.py][INFO] Epoch:[0/2](578000/4588595) loss:2.641 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:50:57,944][model8_pretrain.py][INFO] Epoch:[0/2](578000/4588595) loss:3.153 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:50:57,944][model8_pretrain.py][INFO] Epoch:[0/2](578000/4588595) loss:2.376 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:51:34,896][model8_pretrain.py][INFO] Epoch:[0/2](578100/4588595) loss:3.062 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:51:34,896][model8_pretrain.py][INFO] Epoch:[0/2](578100/4588595) loss:2.562 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:51:34,896][model8_pretrain.py][INFO] Epoch:[0/2](578100/4588595) loss:2.999 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:51:34,896][model8_pretrain.py][INFO] Epoch:[0/2](578100/4588595) loss:2.880 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:51:34,896][model8_pretrain.py][INFO] Epoch:[0/2](578100/4588595) loss:2.733 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:51:34,896][model8_pretrain.py][INFO] Epoch:[0/2](578100/4588595) loss:2.876 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:51:34,896][model8_pretrain.py][INFO] Epoch:[0/2](578100/4588595) loss:3.124 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:51:34,897][model8_pretrain.py][INFO] Epoch:[0/2](578100/4588595) loss:2.592 lr:0.0000100 epoch_Time:25473.0min: [2024-01-05 06:52:11,850][model8_pretrain.py][INFO] Epoch:[0/2](578200/4588595) loss:2.689 lr:0.0000100 epoch_Time:25472.0min: [2024-01-05 06:52:11,850][model8_pretrain.py][INFO] Epoch:[0/2](578200/4588595) loss:2.939 lr:0.0000100 epoch_Time:25472.0min: [2024-01-05 06:52:11,850][model8_pretrain.py][INFO] Epoch:[0/2](578200/4588595) loss:2.385 lr:0.0000100 epoch_Time:25472.0min: [2024-01-05 06:52:11,850][model8_pretrain.py][INFO] Epoch:[0/2](578200/4588595) loss:2.899 lr:0.0000100 epoch_Time:25472.0min: [2024-01-05 06:52:11,850][model8_pretrain.py][INFO] Epoch:[0/2](578200/4588595) loss:2.750 lr:0.0000100 epoch_Time:25472.0min: [2024-01-05 06:52:11,850][model8_pretrain.py][INFO] Epoch:[0/2](578200/4588595) loss:2.764 lr:0.0000100 epoch_Time:25472.0min: [2024-01-05 06:52:11,850][model8_pretrain.py][INFO] Epoch:[0/2](578200/4588595) loss:2.524 lr:0.0000100 epoch_Time:25472.0min: [2024-01-05 06:52:11,850][model8_pretrain.py][INFO] Epoch:[0/2](578200/4588595) loss:3.081 lr:0.0000100 epoch_Time:25472.0min: [2024-01-05 06:52:48,808][model8_pretrain.py][INFO] Epoch:[0/2](578300/4588595) loss:2.241 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:52:48,808][model8_pretrain.py][INFO] Epoch:[0/2](578300/4588595) loss:3.409 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:52:48,808][model8_pretrain.py][INFO] Epoch:[0/2](578300/4588595) loss:3.029 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:52:48,808][model8_pretrain.py][INFO] Epoch:[0/2](578300/4588595) loss:2.257 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:52:48,808][model8_pretrain.py][INFO] Epoch:[0/2](578300/4588595) loss:3.140 lr:0.0000100 epoch_Time:25471.0min: [2024-01-05 06:52:48,808][model8_pretrain.py][INFO] Epoch:[0/2](578300/4588595) loss:2.920 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:52:48,808][model8_pretrain.py][INFO] Epoch:[0/2](578300/4588595) loss:2.512 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:52:48,809][model8_pretrain.py][INFO] Epoch:[0/2](578300/4588595) loss:3.055 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:53:25,763][model8_pretrain.py][INFO] Epoch:[0/2](578400/4588595) loss:3.073 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:53:25,763][model8_pretrain.py][INFO] Epoch:[0/2](578400/4588595) loss:3.076 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:53:25,763][model8_pretrain.py][INFO] Epoch:[0/2](578400/4588595) loss:2.584 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:53:25,763][model8_pretrain.py][INFO] Epoch:[0/2](578400/4588595) loss:2.824 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:53:25,763][model8_pretrain.py][INFO] Epoch:[0/2](578400/4588595) loss:2.737 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:53:25,763][model8_pretrain.py][INFO] Epoch:[0/2](578400/4588595) loss:2.321 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:53:25,763][model8_pretrain.py][INFO] Epoch:[0/2](578400/4588595) loss:2.801 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:53:25,764][model8_pretrain.py][INFO] Epoch:[0/2](578400/4588595) loss:2.354 lr:0.0000100 epoch_Time:25470.0min: [2024-01-05 06:54:13,006][model8_pretrain.py][INFO] Epoch:[0/2](578500/4588595) loss:2.700 lr:0.0000100 epoch_Time:25471.0min: [2024-01-05 06:54:13,006][model8_pretrain.py][INFO] Epoch:[0/2](578500/4588595) loss:3.313 lr:0.0000100 epoch_Time:25471.0min: [2024-01-05 06:54:13,006][model8_pretrain.py][INFO] Epoch:[0/2](578500/4588595) loss:2.721 lr:0.0000100 epoch_Time:25471.0min: [2024-01-05 06:54:13,006][model8_pretrain.py][INFO] Epoch:[0/2](578500/4588595) loss:2.921 lr:0.0000100 epoch_Time:25471.0min: [2024-01-05 06:54:13,006][model8_pretrain.py][INFO] Epoch:[0/2](578500/4588595) loss:2.802 lr:0.0000100 epoch_Time:25471.0min: [2024-01-05 06:54:13,006][model8_pretrain.py][INFO] Epoch:[0/2](578500/4588595) loss:2.549 lr:0.0000100 epoch_Time:25471.0min: [2024-01-05 06:54:13,006][model8_pretrain.py][INFO] Epoch:[0/2](578500/4588595) loss:2.408 lr:0.0000100 epoch_Time:25471.0min: [2024-01-05 06:54:13,006][model8_pretrain.py][INFO] Epoch:[0/2](578500/4588595) loss:3.122 lr:0.0000100 epoch_Time:25471.0min: [2024-01-05 06:54:49,955][model8_pretrain.py][INFO] Epoch:[0/2](578600/4588595) loss:3.051 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:54:49,955][model8_pretrain.py][INFO] Epoch:[0/2](578600/4588595) loss:3.067 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:54:49,955][model8_pretrain.py][INFO] Epoch:[0/2](578600/4588595) loss:2.982 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:54:49,955][model8_pretrain.py][INFO] Epoch:[0/2](578600/4588595) loss:2.369 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:54:49,955][model8_pretrain.py][INFO] Epoch:[0/2](578600/4588595) loss:2.629 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:54:49,955][model8_pretrain.py][INFO] Epoch:[0/2](578600/4588595) loss:2.970 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:54:49,955][model8_pretrain.py][INFO] Epoch:[0/2](578600/4588595) loss:2.998 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:54:49,955][model8_pretrain.py][INFO] Epoch:[0/2](578600/4588595) loss:3.242 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:55:26,904][model8_pretrain.py][INFO] Epoch:[0/2](578700/4588595) loss:2.762 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:55:26,904][model8_pretrain.py][INFO] Epoch:[0/2](578700/4588595) loss:2.731 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:55:26,904][model8_pretrain.py][INFO] Epoch:[0/2](578700/4588595) loss:2.432 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:55:26,904][model8_pretrain.py][INFO] Epoch:[0/2](578700/4588595) loss:2.955 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:55:26,905][model8_pretrain.py][INFO] Epoch:[0/2](578700/4588595) loss:2.762 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:55:26,905][model8_pretrain.py][INFO] Epoch:[0/2](578700/4588595) loss:3.213 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:55:26,905][model8_pretrain.py][INFO] Epoch:[0/2](578700/4588595) loss:2.833 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:55:26,905][model8_pretrain.py][INFO] Epoch:[0/2](578700/4588595) loss:2.417 lr:0.0000100 epoch_Time:25469.0min: [2024-01-05 06:56:03,846][model8_pretrain.py][INFO] Epoch:[0/2](578800/4588595) loss:2.808 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:03,846][model8_pretrain.py][INFO] Epoch:[0/2](578800/4588595) loss:2.848 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:03,846][model8_pretrain.py][INFO] Epoch:[0/2](578800/4588595) loss:2.629 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:03,846][model8_pretrain.py][INFO] Epoch:[0/2](578800/4588595) loss:2.556 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:03,846][model8_pretrain.py][INFO] Epoch:[0/2](578800/4588595) loss:2.829 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:03,846][model8_pretrain.py][INFO] Epoch:[0/2](578800/4588595) loss:2.891 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:03,846][model8_pretrain.py][INFO] Epoch:[0/2](578800/4588595) loss:3.021 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:03,846][model8_pretrain.py][INFO] Epoch:[0/2](578800/4588595) loss:2.932 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:40,783][model8_pretrain.py][INFO] Epoch:[0/2](578900/4588595) loss:3.095 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:40,783][model8_pretrain.py][INFO] Epoch:[0/2](578900/4588595) loss:2.414 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:40,783][model8_pretrain.py][INFO] Epoch:[0/2](578900/4588595) loss:3.028 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:40,783][model8_pretrain.py][INFO] Epoch:[0/2](578900/4588595) loss:2.602 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:40,783][model8_pretrain.py][INFO] Epoch:[0/2](578900/4588595) loss:2.918 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:40,784][model8_pretrain.py][INFO] Epoch:[0/2](578900/4588595) loss:3.157 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:40,784][model8_pretrain.py][INFO] Epoch:[0/2](578900/4588595) loss:2.476 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:56:40,783][model8_pretrain.py][INFO] Epoch:[0/2](578900/4588595) loss:2.915 lr:0.0000100 epoch_Time:25468.0min: [2024-01-05 06:57:17,728][model8_pretrain.py][INFO] Epoch:[0/2](579000/4588595) loss:2.739 lr:0.0000100 epoch_Time:25467.0min: [2024-01-05 06:57:17,728][model8_pretrain.py][INFO] Epoch:[0/2](579000/4588595) loss:2.658 lr:0.0000100 epoch_Time:25467.0min: [2024-01-05 06:57:17,728][model8_pretrain.py][INFO] Epoch:[0/2](579000/4588595) loss:2.675 lr:0.0000100 epoch_Time:25467.0min: [2024-01-05 06:57:17,728][model8_pretrain.py][INFO] Epoch:[0/2](579000/4588595) loss:3.087 lr:0.0000100 epoch_Time:25467.0min: [2024-01-05 06:57:17,728][model8_pretrain.py][INFO] Epoch:[0/2](579000/4588595) loss:3.146 lr:0.0000100 epoch_Time:25467.0min: [2024-01-05 06:57:17,728][model8_pretrain.py][INFO] Epoch:[0/2](579000/4588595) loss:2.418 lr:0.0000100 epoch_Time:25467.0min: [2024-01-05 06:57:17,728][model8_pretrain.py][INFO] Epoch:[0/2](579000/4588595) loss:3.260 lr:0.0000100 epoch_Time:25467.0min: [2024-01-05 06:57:17,728][model8_pretrain.py][INFO] Epoch:[0/2](579000/4588595) loss:3.359 lr:0.0000100 epoch_Time:25467.0min: [2024-01-05 06:57:54,662][model8_pretrain.py][INFO] Epoch:[0/2](579100/4588595) loss:2.453 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:57:54,662][model8_pretrain.py][INFO] Epoch:[0/2](579100/4588595) loss:2.553 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:57:54,662][model8_pretrain.py][INFO] Epoch:[0/2](579100/4588595) loss:2.806 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:57:54,662][model8_pretrain.py][INFO] Epoch:[0/2](579100/4588595) loss:2.653 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:57:54,662][model8_pretrain.py][INFO] Epoch:[0/2](579100/4588595) loss:2.563 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:57:54,662][model8_pretrain.py][INFO] Epoch:[0/2](579100/4588595) loss:2.605 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:57:54,662][model8_pretrain.py][INFO] Epoch:[0/2](579100/4588595) loss:3.519 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:57:54,662][model8_pretrain.py][INFO] Epoch:[0/2](579100/4588595) loss:2.789 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:58:31,620][model8_pretrain.py][INFO] Epoch:[0/2](579200/4588595) loss:2.812 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:58:31,620][model8_pretrain.py][INFO] Epoch:[0/2](579200/4588595) loss:2.952 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:58:31,620][model8_pretrain.py][INFO] Epoch:[0/2](579200/4588595) loss:2.875 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:58:31,620][model8_pretrain.py][INFO] Epoch:[0/2](579200/4588595) loss:3.066 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:58:31,620][model8_pretrain.py][INFO] Epoch:[0/2](579200/4588595) loss:3.245 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:58:31,620][model8_pretrain.py][INFO] Epoch:[0/2](579200/4588595) loss:2.518 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:58:31,620][model8_pretrain.py][INFO] Epoch:[0/2](579200/4588595) loss:2.707 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:58:31,620][model8_pretrain.py][INFO] Epoch:[0/2](579200/4588595) loss:3.530 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:59:18,803][model8_pretrain.py][INFO] Epoch:[0/2](579300/4588595) loss:2.722 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:59:18,803][model8_pretrain.py][INFO] Epoch:[0/2](579300/4588595) loss:2.682 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:59:18,803][model8_pretrain.py][INFO] Epoch:[0/2](579300/4588595) loss:3.005 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:59:18,803][model8_pretrain.py][INFO] Epoch:[0/2](579300/4588595) loss:2.991 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:59:18,803][model8_pretrain.py][INFO] Epoch:[0/2](579300/4588595) loss:2.732 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:59:18,803][model8_pretrain.py][INFO] Epoch:[0/2](579300/4588595) loss:2.600 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:59:18,803][model8_pretrain.py][INFO] Epoch:[0/2](579300/4588595) loss:3.093 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:59:18,803][model8_pretrain.py][INFO] Epoch:[0/2](579300/4588595) loss:2.997 lr:0.0000100 epoch_Time:25466.0min: [2024-01-05 06:59:55,719][model8_pretrain.py][INFO] Epoch:[0/2](579400/4588595) loss:2.316 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:59:55,719][model8_pretrain.py][INFO] Epoch:[0/2](579400/4588595) loss:2.878 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:59:55,719][model8_pretrain.py][INFO] Epoch:[0/2](579400/4588595) loss:3.029 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:59:55,719][model8_pretrain.py][INFO] Epoch:[0/2](579400/4588595) loss:3.282 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:59:55,719][model8_pretrain.py][INFO] Epoch:[0/2](579400/4588595) loss:2.339 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:59:55,719][model8_pretrain.py][INFO] Epoch:[0/2](579400/4588595) loss:2.917 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:59:55,719][model8_pretrain.py][INFO] Epoch:[0/2](579400/4588595) loss:2.934 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 06:59:55,719][model8_pretrain.py][INFO] Epoch:[0/2](579400/4588595) loss:3.196 lr:0.0000100 epoch_Time:25465.0min: [2024-01-05 07:00:32,660][model8_pretrain.py][INFO] Epoch:[0/2](579500/4588595) loss:2.903 lr:0.0000100 epoch_Time:25464.0min: [2024-01-05 07:00:32,660][model8_pretrain.py][INFO] Epoch:[0/2](579500/4588595) loss:2.955 lr:0.0000100 epoch_Time:25464.0min: [2024-01-05 07:00:32,660][model8_pretrain.py][INFO] Epoch:[0/2](579500/4588595) loss:2.386 lr:0.0000100 epoch_Time:25464.0min: [2024-01-05 07:00:32,660][model8_pretrain.py][INFO] Epoch:[0/2](579500/4588595) loss:2.811 lr:0.0000100 epoch_Time:25464.0min: [2024-01-05 07:00:32,660][model8_pretrain.py][INFO] Epoch:[0/2](579500/4588595) loss:1.938 lr:0.0000100 epoch_Time:25464.0min: [2024-01-05 07:00:32,660][model8_pretrain.py][INFO] Epoch:[0/2](579500/4588595) loss:3.153 lr:0.0000100 epoch_Time:25464.0min: [2024-01-05 07:00:32,660][model8_pretrain.py][INFO] Epoch:[0/2](579500/4588595) loss:2.800 lr:0.0000100 epoch_Time:25464.0min: [2024-01-05 07:00:32,660][model8_pretrain.py][INFO] Epoch:[0/2](579500/4588595) loss:2.815 lr:0.0000100 epoch_Time:25464.0min: [2024-01-05 07:01:09,588][model8_pretrain.py][INFO] Epoch:[0/2](579600/4588595) loss:3.032 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:09,588][model8_pretrain.py][INFO] Epoch:[0/2](579600/4588595) loss:2.166 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:09,588][model8_pretrain.py][INFO] Epoch:[0/2](579600/4588595) loss:2.074 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:09,588][model8_pretrain.py][INFO] Epoch:[0/2](579600/4588595) loss:3.034 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:09,588][model8_pretrain.py][INFO] Epoch:[0/2](579600/4588595) loss:2.714 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:09,588][model8_pretrain.py][INFO] Epoch:[0/2](579600/4588595) loss:3.189 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:09,588][model8_pretrain.py][INFO] Epoch:[0/2](579600/4588595) loss:2.233 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:09,588][model8_pretrain.py][INFO] Epoch:[0/2](579600/4588595) loss:2.606 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:46,484][model8_pretrain.py][INFO] Epoch:[0/2](579700/4588595) loss:2.535 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:46,484][model8_pretrain.py][INFO] Epoch:[0/2](579700/4588595) loss:3.069 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:46,484][model8_pretrain.py][INFO] Epoch:[0/2](579700/4588595) loss:2.396 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:46,484][model8_pretrain.py][INFO] Epoch:[0/2](579700/4588595) loss:2.499 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:46,484][model8_pretrain.py][INFO] Epoch:[0/2](579700/4588595) loss:3.261 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:46,484][model8_pretrain.py][INFO] Epoch:[0/2](579700/4588595) loss:2.784 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:46,484][model8_pretrain.py][INFO] Epoch:[0/2](579700/4588595) loss:2.752 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:01:46,485][model8_pretrain.py][INFO] Epoch:[0/2](579700/4588595) loss:3.205 lr:0.0000100 epoch_Time:25463.0min: [2024-01-05 07:02:23,403][model8_pretrain.py][INFO] Epoch:[0/2](579800/4588595) loss:3.280 lr:0.0000100 epoch_Time:25462.0min: [2024-01-05 07:02:23,403][model8_pretrain.py][INFO] Epoch:[0/2](579800/4588595) loss:2.754 lr:0.0000100 epoch_Time:25462.0min: [2024-01-05 07:02:23,403][model8_pretrain.py][INFO] Epoch:[0/2](579800/4588595) loss:2.685 lr:0.0000100 epoch_Time:25462.0min: [2024-01-05 07:02:23,403][model8_pretrain.py][INFO] Epoch:[0/2](579800/4588595) loss:2.907 lr:0.0000100 epoch_Time:25462.0min: [2024-01-05 07:02:23,403][model8_pretrain.py][INFO] Epoch:[0/2](579800/4588595) loss:3.119 lr:0.0000100 epoch_Time:25462.0min: [2024-01-05 07:02:23,403][model8_pretrain.py][INFO] Epoch:[0/2](579800/4588595) loss:2.781 lr:0.0000100 epoch_Time:25462.0min: [2024-01-05 07:02:23,403][model8_pretrain.py][INFO] Epoch:[0/2](579800/4588595) loss:2.933 lr:0.0000100 epoch_Time:25462.0min: [2024-01-05 07:02:23,403][model8_pretrain.py][INFO] Epoch:[0/2](579800/4588595) loss:2.663 lr:0.0000100 epoch_Time:25462.0min: [2024-01-05 07:03:00,326][model8_pretrain.py][INFO] Epoch:[0/2](579900/4588595) loss:2.772 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:00,326][model8_pretrain.py][INFO] Epoch:[0/2](579900/4588595) loss:2.587 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:00,326][model8_pretrain.py][INFO] Epoch:[0/2](579900/4588595) loss:2.944 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:00,326][model8_pretrain.py][INFO] Epoch:[0/2](579900/4588595) loss:3.095 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:00,326][model8_pretrain.py][INFO] Epoch:[0/2](579900/4588595) loss:2.413 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:00,326][model8_pretrain.py][INFO] Epoch:[0/2](579900/4588595) loss:3.275 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:00,326][model8_pretrain.py][INFO] Epoch:[0/2](579900/4588595) loss:2.618 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:00,326][model8_pretrain.py][INFO] Epoch:[0/2](579900/4588595) loss:3.308 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:37,240][model8_pretrain.py][INFO] Epoch:[0/2](580000/4588595) loss:2.879 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:37,240][model8_pretrain.py][INFO] Epoch:[0/2](580000/4588595) loss:3.057 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:37,240][model8_pretrain.py][INFO] Epoch:[0/2](580000/4588595) loss:2.833 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:37,240][model8_pretrain.py][INFO] Epoch:[0/2](580000/4588595) loss:2.365 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:37,240][model8_pretrain.py][INFO] Epoch:[0/2](580000/4588595) loss:3.242 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:37,240][model8_pretrain.py][INFO] Epoch:[0/2](580000/4588595) loss:3.073 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:37,240][model8_pretrain.py][INFO] Epoch:[0/2](580000/4588595) loss:3.385 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:03:37,240][model8_pretrain.py][INFO] Epoch:[0/2](580000/4588595) loss:3.134 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:04:24,160][model8_pretrain.py][INFO] Epoch:[0/2](580100/4588595) loss:2.487 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:04:24,160][model8_pretrain.py][INFO] Epoch:[0/2](580100/4588595) loss:3.483 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:04:24,160][model8_pretrain.py][INFO] Epoch:[0/2](580100/4588595) loss:3.059 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:04:24,160][model8_pretrain.py][INFO] Epoch:[0/2](580100/4588595) loss:3.377 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:04:24,160][model8_pretrain.py][INFO] Epoch:[0/2](580100/4588595) loss:3.322 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:04:24,160][model8_pretrain.py][INFO] Epoch:[0/2](580100/4588595) loss:2.666 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:04:24,160][model8_pretrain.py][INFO] Epoch:[0/2](580100/4588595) loss:3.157 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:04:24,160][model8_pretrain.py][INFO] Epoch:[0/2](580100/4588595) loss:2.553 lr:0.0000100 epoch_Time:25461.0min: [2024-01-05 07:05:01,107][model8_pretrain.py][INFO] Epoch:[0/2](580200/4588595) loss:3.252 lr:0.0000100 epoch_Time:25460.0min: [2024-01-05 07:05:01,107][model8_pretrain.py][INFO] Epoch:[0/2](580200/4588595) loss:2.104 lr:0.0000100 epoch_Time:25460.0min: [2024-01-05 07:05:01,107][model8_pretrain.py][INFO] Epoch:[0/2](580200/4588595) loss:2.669 lr:0.0000100 epoch_Time:25460.0min: [2024-01-05 07:05:01,107][model8_pretrain.py][INFO] Epoch:[0/2](580200/4588595) loss:2.478 lr:0.0000100 epoch_Time:25460.0min: [2024-01-05 07:05:01,107][model8_pretrain.py][INFO] Epoch:[0/2](580200/4588595) loss:2.345 lr:0.0000100 epoch_Time:25460.0min: [2024-01-05 07:05:01,107][model8_pretrain.py][INFO] Epoch:[0/2](580200/4588595) loss:3.054 lr:0.0000100 epoch_Time:25460.0min: [2024-01-05 07:05:01,107][model8_pretrain.py][INFO] Epoch:[0/2](580200/4588595) loss:3.249 lr:0.0000100 epoch_Time:25460.0min: [2024-01-05 07:05:01,107][model8_pretrain.py][INFO] Epoch:[0/2](580200/4588595) loss:2.138 lr:0.0000100 epoch_Time:25460.0min: [2024-01-05 07:05:38,075][model8_pretrain.py][INFO] Epoch:[0/2](580300/4588595) loss:2.228 lr:0.0000100 epoch_Time:25459.0min: [2024-01-05 07:05:38,075][model8_pretrain.py][INFO] Epoch:[0/2](580300/4588595) loss:2.089 lr:0.0000100 epoch_Time:25459.0min: [2024-01-05 07:05:38,075][model8_pretrain.py][INFO] Epoch:[0/2](580300/4588595) loss:2.698 lr:0.0000100 epoch_Time:25459.0min: [2024-01-05 07:05:38,075][model8_pretrain.py][INFO] Epoch:[0/2](580300/4588595) loss:3.217 lr:0.0000100 epoch_Time:25459.0min: [2024-01-05 07:05:38,075][model8_pretrain.py][INFO] Epoch:[0/2](580300/4588595) loss:3.180 lr:0.0000100 epoch_Time:25459.0min: [2024-01-05 07:05:38,075][model8_pretrain.py][INFO] Epoch:[0/2](580300/4588595) loss:3.177 lr:0.0000100 epoch_Time:25459.0min: [2024-01-05 07:05:38,076][model8_pretrain.py][INFO] Epoch:[0/2](580300/4588595) loss:2.891 lr:0.0000100 epoch_Time:25459.0min: [2024-01-05 07:05:38,076][model8_pretrain.py][INFO] Epoch:[0/2](580300/4588595) loss:2.983 lr:0.0000100 epoch_Time:25459.0min: [2024-01-05 07:06:15,042][model8_pretrain.py][INFO] Epoch:[0/2](580400/4588595) loss:3.082 lr:0.0000100 epoch_Time:25458.0min: [2024-01-05 07:06:15,042][model8_pretrain.py][INFO] Epoch:[0/2](580400/4588595) loss:2.970 lr:0.0000100 epoch_Time:25458.0min: [2024-01-05 07:06:15,042][model8_pretrain.py][INFO] Epoch:[0/2](580400/4588595) loss:2.249 lr:0.0000100 epoch_Time:25458.0min: [2024-01-05 07:06:15,042][model8_pretrain.py][INFO] Epoch:[0/2](580400/4588595) loss:2.837 lr:0.0000100 epoch_Time:25458.0min: [2024-01-05 07:06:15,042][model8_pretrain.py][INFO] Epoch:[0/2](580400/4588595) loss:2.940 lr:0.0000100 epoch_Time:25458.0min: [2024-01-05 07:06:15,042][model8_pretrain.py][INFO] Epoch:[0/2](580400/4588595) loss:3.185 lr:0.0000100 epoch_Time:25458.0min: [2024-01-05 07:06:15,042][model8_pretrain.py][INFO] Epoch:[0/2](580400/4588595) loss:2.803 lr:0.0000100 epoch_Time:25458.0min: [2024-01-05 07:06:15,042][model8_pretrain.py][INFO] Epoch:[0/2](580400/4588595) loss:2.773 lr:0.0000100 epoch_Time:25458.0min: [2024-01-05 07:06:52,017][model8_pretrain.py][INFO] Epoch:[0/2](580500/4588595) loss:2.600 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:06:52,017][model8_pretrain.py][INFO] Epoch:[0/2](580500/4588595) loss:2.225 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:06:52,017][model8_pretrain.py][INFO] Epoch:[0/2](580500/4588595) loss:3.118 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:06:52,017][model8_pretrain.py][INFO] Epoch:[0/2](580500/4588595) loss:2.654 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:06:52,017][model8_pretrain.py][INFO] Epoch:[0/2](580500/4588595) loss:2.234 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:06:52,017][model8_pretrain.py][INFO] Epoch:[0/2](580500/4588595) loss:3.036 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:06:52,017][model8_pretrain.py][INFO] Epoch:[0/2](580500/4588595) loss:3.183 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:06:52,017][model8_pretrain.py][INFO] Epoch:[0/2](580500/4588595) loss:2.848 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:07:28,952][model8_pretrain.py][INFO] Epoch:[0/2](580600/4588595) loss:3.276 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:07:28,952][model8_pretrain.py][INFO] Epoch:[0/2](580600/4588595) loss:2.781 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:07:28,952][model8_pretrain.py][INFO] Epoch:[0/2](580600/4588595) loss:3.225 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:07:28,952][model8_pretrain.py][INFO] Epoch:[0/2](580600/4588595) loss:2.179 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:07:28,952][model8_pretrain.py][INFO] Epoch:[0/2](580600/4588595) loss:3.178 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:07:28,952][model8_pretrain.py][INFO] Epoch:[0/2](580600/4588595) loss:2.953 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:07:28,952][model8_pretrain.py][INFO] Epoch:[0/2](580600/4588595) loss:2.820 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:07:28,952][model8_pretrain.py][INFO] Epoch:[0/2](580600/4588595) loss:3.022 lr:0.0000100 epoch_Time:25457.0min: [2024-01-05 07:08:05,882][model8_pretrain.py][INFO] Epoch:[0/2](580700/4588595) loss:2.961 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:05,882][model8_pretrain.py][INFO] Epoch:[0/2](580700/4588595) loss:2.659 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:05,882][model8_pretrain.py][INFO] Epoch:[0/2](580700/4588595) loss:2.317 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:05,882][model8_pretrain.py][INFO] Epoch:[0/2](580700/4588595) loss:2.813 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:05,882][model8_pretrain.py][INFO] Epoch:[0/2](580700/4588595) loss:3.234 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:05,882][model8_pretrain.py][INFO] Epoch:[0/2](580700/4588595) loss:2.715 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:05,882][model8_pretrain.py][INFO] Epoch:[0/2](580700/4588595) loss:2.141 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:05,883][model8_pretrain.py][INFO] Epoch:[0/2](580700/4588595) loss:3.160 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:42,820][model8_pretrain.py][INFO] Epoch:[0/2](580800/4588595) loss:2.822 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:42,820][model8_pretrain.py][INFO] Epoch:[0/2](580800/4588595) loss:2.397 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:42,820][model8_pretrain.py][INFO] Epoch:[0/2](580800/4588595) loss:2.672 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:42,820][model8_pretrain.py][INFO] Epoch:[0/2](580800/4588595) loss:2.799 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:42,820][model8_pretrain.py][INFO] Epoch:[0/2](580800/4588595) loss:2.786 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:42,820][model8_pretrain.py][INFO] Epoch:[0/2](580800/4588595) loss:2.754 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:42,820][model8_pretrain.py][INFO] Epoch:[0/2](580800/4588595) loss:3.338 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:08:42,820][model8_pretrain.py][INFO] Epoch:[0/2](580800/4588595) loss:3.073 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:09:29,664][model8_pretrain.py][INFO] Epoch:[0/2](580900/4588595) loss:2.874 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:09:29,664][model8_pretrain.py][INFO] Epoch:[0/2](580900/4588595) loss:2.832 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:09:29,664][model8_pretrain.py][INFO] Epoch:[0/2](580900/4588595) loss:2.664 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:09:29,664][model8_pretrain.py][INFO] Epoch:[0/2](580900/4588595) loss:2.353 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:09:29,664][model8_pretrain.py][INFO] Epoch:[0/2](580900/4588595) loss:2.915 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:09:29,664][model8_pretrain.py][INFO] Epoch:[0/2](580900/4588595) loss:2.721 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:09:29,664][model8_pretrain.py][INFO] Epoch:[0/2](580900/4588595) loss:3.169 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:09:29,664][model8_pretrain.py][INFO] Epoch:[0/2](580900/4588595) loss:3.137 lr:0.0000100 epoch_Time:25456.0min: [2024-01-05 07:10:06,599][model8_pretrain.py][INFO] Epoch:[0/2](581000/4588595) loss:2.775 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:06,599][model8_pretrain.py][INFO] Epoch:[0/2](581000/4588595) loss:2.642 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:06,599][model8_pretrain.py][INFO] Epoch:[0/2](581000/4588595) loss:2.696 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:06,599][model8_pretrain.py][INFO] Epoch:[0/2](581000/4588595) loss:1.906 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:06,599][model8_pretrain.py][INFO] Epoch:[0/2](581000/4588595) loss:3.135 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:06,599][model8_pretrain.py][INFO] Epoch:[0/2](581000/4588595) loss:3.497 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:06,599][model8_pretrain.py][INFO] Epoch:[0/2](581000/4588595) loss:3.020 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:06,599][model8_pretrain.py][INFO] Epoch:[0/2](581000/4588595) loss:2.615 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:43,532][model8_pretrain.py][INFO] Epoch:[0/2](581100/4588595) loss:2.469 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:43,532][model8_pretrain.py][INFO] Epoch:[0/2](581100/4588595) loss:3.165 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:43,532][model8_pretrain.py][INFO] Epoch:[0/2](581100/4588595) loss:3.128 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:43,532][model8_pretrain.py][INFO] Epoch:[0/2](581100/4588595) loss:2.626 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:43,532][model8_pretrain.py][INFO] Epoch:[0/2](581100/4588595) loss:2.753 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:43,532][model8_pretrain.py][INFO] Epoch:[0/2](581100/4588595) loss:2.910 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:43,532][model8_pretrain.py][INFO] Epoch:[0/2](581100/4588595) loss:2.278 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:10:43,532][model8_pretrain.py][INFO] Epoch:[0/2](581100/4588595) loss:3.205 lr:0.0000100 epoch_Time:25455.0min: [2024-01-05 07:11:20,470][model8_pretrain.py][INFO] Epoch:[0/2](581200/4588595) loss:3.033 lr:0.0000100 epoch_Time:25453.0min: [2024-01-05 07:11:20,470][model8_pretrain.py][INFO] Epoch:[0/2](581200/4588595) loss:2.825 lr:0.0000100 epoch_Time:25453.0min: [2024-01-05 07:11:20,470][model8_pretrain.py][INFO] Epoch:[0/2](581200/4588595) loss:3.207 lr:0.0000100 epoch_Time:25453.0min: [2024-01-05 07:11:20,470][model8_pretrain.py][INFO] Epoch:[0/2](581200/4588595) loss:2.602 lr:0.0000100 epoch_Time:25453.0min: [2024-01-05 07:11:20,470][model8_pretrain.py][INFO] Epoch:[0/2](581200/4588595) loss:2.785 lr:0.0000100 epoch_Time:25453.0min: [2024-01-05 07:11:20,470][model8_pretrain.py][INFO] Epoch:[0/2](581200/4588595) loss:2.895 lr:0.0000100 epoch_Time:25453.0min: [2024-01-05 07:11:20,470][model8_pretrain.py][INFO] Epoch:[0/2](581200/4588595) loss:2.446 lr:0.0000100 epoch_Time:25453.0min: [2024-01-05 07:11:20,470][model8_pretrain.py][INFO] Epoch:[0/2](581200/4588595) loss:2.785 lr:0.0000100 epoch_Time:25453.0min: [2024-01-05 07:11:57,409][model8_pretrain.py][INFO] Epoch:[0/2](581300/4588595) loss:2.585 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:11:57,409][model8_pretrain.py][INFO] Epoch:[0/2](581300/4588595) loss:2.729 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:11:57,409][model8_pretrain.py][INFO] Epoch:[0/2](581300/4588595) loss:2.773 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:11:57,409][model8_pretrain.py][INFO] Epoch:[0/2](581300/4588595) loss:3.049 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:11:57,409][model8_pretrain.py][INFO] Epoch:[0/2](581300/4588595) loss:2.643 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:11:57,409][model8_pretrain.py][INFO] Epoch:[0/2](581300/4588595) loss:3.077 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:11:57,409][model8_pretrain.py][INFO] Epoch:[0/2](581300/4588595) loss:2.957 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:11:57,409][model8_pretrain.py][INFO] Epoch:[0/2](581300/4588595) loss:2.888 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:12:34,346][model8_pretrain.py][INFO] Epoch:[0/2](581400/4588595) loss:3.341 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:12:34,346][model8_pretrain.py][INFO] Epoch:[0/2](581400/4588595) loss:2.696 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:12:34,346][model8_pretrain.py][INFO] Epoch:[0/2](581400/4588595) loss:2.927 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:12:34,346][model8_pretrain.py][INFO] Epoch:[0/2](581400/4588595) loss:2.790 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:12:34,346][model8_pretrain.py][INFO] Epoch:[0/2](581400/4588595) loss:2.858 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:12:34,346][model8_pretrain.py][INFO] Epoch:[0/2](581400/4588595) loss:2.746 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:12:34,346][model8_pretrain.py][INFO] Epoch:[0/2](581400/4588595) loss:2.382 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:12:34,347][model8_pretrain.py][INFO] Epoch:[0/2](581400/4588595) loss:2.699 lr:0.0000100 epoch_Time:25452.0min: [2024-01-05 07:13:11,271][model8_pretrain.py][INFO] Epoch:[0/2](581500/4588595) loss:2.883 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:13:11,271][model8_pretrain.py][INFO] Epoch:[0/2](581500/4588595) loss:2.611 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:13:11,271][model8_pretrain.py][INFO] Epoch:[0/2](581500/4588595) loss:2.962 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:13:11,271][model8_pretrain.py][INFO] Epoch:[0/2](581500/4588595) loss:3.063 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:13:11,271][model8_pretrain.py][INFO] Epoch:[0/2](581500/4588595) loss:2.814 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:13:11,271][model8_pretrain.py][INFO] Epoch:[0/2](581500/4588595) loss:2.632 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:13:11,271][model8_pretrain.py][INFO] Epoch:[0/2](581500/4588595) loss:2.895 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:13:11,271][model8_pretrain.py][INFO] Epoch:[0/2](581500/4588595) loss:2.432 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:13:48,189][model8_pretrain.py][INFO] Epoch:[0/2](581600/4588595) loss:2.837 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:13:48,190][model8_pretrain.py][INFO] Epoch:[0/2](581600/4588595) loss:3.011 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:13:48,190][model8_pretrain.py][INFO] Epoch:[0/2](581600/4588595) loss:2.645 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:13:48,190][model8_pretrain.py][INFO] Epoch:[0/2](581600/4588595) loss:3.354 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:13:48,190][model8_pretrain.py][INFO] Epoch:[0/2](581600/4588595) loss:2.302 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:13:48,190][model8_pretrain.py][INFO] Epoch:[0/2](581600/4588595) loss:3.073 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:13:48,190][model8_pretrain.py][INFO] Epoch:[0/2](581600/4588595) loss:2.646 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:13:48,190][model8_pretrain.py][INFO] Epoch:[0/2](581600/4588595) loss:3.105 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:14:35,305][model8_pretrain.py][INFO] Epoch:[0/2](581700/4588595) loss:2.961 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:14:35,305][model8_pretrain.py][INFO] Epoch:[0/2](581700/4588595) loss:2.717 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:14:35,305][model8_pretrain.py][INFO] Epoch:[0/2](581700/4588595) loss:2.914 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:14:35,305][model8_pretrain.py][INFO] Epoch:[0/2](581700/4588595) loss:2.711 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:14:35,305][model8_pretrain.py][INFO] Epoch:[0/2](581700/4588595) loss:3.030 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:14:35,305][model8_pretrain.py][INFO] Epoch:[0/2](581700/4588595) loss:2.896 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:14:35,305][model8_pretrain.py][INFO] Epoch:[0/2](581700/4588595) loss:3.447 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:14:35,306][model8_pretrain.py][INFO] Epoch:[0/2](581700/4588595) loss:2.473 lr:0.0000100 epoch_Time:25451.0min: [2024-01-05 07:15:12,232][model8_pretrain.py][INFO] Epoch:[0/2](581800/4588595) loss:3.369 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:15:12,232][model8_pretrain.py][INFO] Epoch:[0/2](581800/4588595) loss:2.622 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:15:12,232][model8_pretrain.py][INFO] Epoch:[0/2](581800/4588595) loss:3.227 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:15:12,233][model8_pretrain.py][INFO] Epoch:[0/2](581800/4588595) loss:2.463 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:15:12,233][model8_pretrain.py][INFO] Epoch:[0/2](581800/4588595) loss:2.695 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:15:12,233][model8_pretrain.py][INFO] Epoch:[0/2](581800/4588595) loss:3.200 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:15:12,233][model8_pretrain.py][INFO] Epoch:[0/2](581800/4588595) loss:2.833 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:15:12,234][model8_pretrain.py][INFO] Epoch:[0/2](581800/4588595) loss:3.174 lr:0.0000100 epoch_Time:25450.0min: [2024-01-05 07:15:49,173][model8_pretrain.py][INFO] Epoch:[0/2](581900/4588595) loss:2.537 lr:0.0000100 epoch_Time:25449.0min: [2024-01-05 07:15:49,173][model8_pretrain.py][INFO] Epoch:[0/2](581900/4588595) loss:2.787 lr:0.0000100 epoch_Time:25449.0min: [2024-01-05 07:15:49,173][model8_pretrain.py][INFO] Epoch:[0/2](581900/4588595) loss:2.739 lr:0.0000100 epoch_Time:25449.0min: [2024-01-05 07:15:49,173][model8_pretrain.py][INFO] Epoch:[0/2](581900/4588595) loss:2.774 lr:0.0000100 epoch_Time:25449.0min: [2024-01-05 07:15:49,173][model8_pretrain.py][INFO] Epoch:[0/2](581900/4588595) loss:2.351 lr:0.0000100 epoch_Time:25449.0min: [2024-01-05 07:15:49,173][model8_pretrain.py][INFO] Epoch:[0/2](581900/4588595) loss:2.412 lr:0.0000100 epoch_Time:25449.0min: [2024-01-05 07:15:49,173][model8_pretrain.py][INFO] Epoch:[0/2](581900/4588595) loss:2.727 lr:0.0000100 epoch_Time:25449.0min: [2024-01-05 07:15:49,174][model8_pretrain.py][INFO] Epoch:[0/2](581900/4588595) loss:3.272 lr:0.0000100 epoch_Time:25449.0min: [2024-01-05 07:16:26,093][model8_pretrain.py][INFO] Epoch:[0/2](582000/4588595) loss:2.772 lr:0.0000100 epoch_Time:25448.0min: [2024-01-05 07:16:26,093][model8_pretrain.py][INFO] Epoch:[0/2](582000/4588595) loss:3.035 lr:0.0000100 epoch_Time:25448.0min: [2024-01-05 07:16:26,093][model8_pretrain.py][INFO] Epoch:[0/2](582000/4588595) loss:2.886 lr:0.0000100 epoch_Time:25448.0min: [2024-01-05 07:16:26,093][model8_pretrain.py][INFO] Epoch:[0/2](582000/4588595) loss:3.311 lr:0.0000100 epoch_Time:25448.0min: [2024-01-05 07:16:26,093][model8_pretrain.py][INFO] Epoch:[0/2](582000/4588595) loss:2.706 lr:0.0000100 epoch_Time:25448.0min: [2024-01-05 07:16:26,093][model8_pretrain.py][INFO] Epoch:[0/2](582000/4588595) loss:2.795 lr:0.0000100 epoch_Time:25448.0min: [2024-01-05 07:16:26,093][model8_pretrain.py][INFO] Epoch:[0/2](582000/4588595) loss:2.960 lr:0.0000100 epoch_Time:25448.0min: [2024-01-05 07:16:26,093][model8_pretrain.py][INFO] Epoch:[0/2](582000/4588595) loss:2.352 lr:0.0000100 epoch_Time:25448.0min: [2024-01-05 07:17:03,012][model8_pretrain.py][INFO] Epoch:[0/2](582100/4588595) loss:3.194 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:03,012][model8_pretrain.py][INFO] Epoch:[0/2](582100/4588595) loss:2.632 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:03,013][model8_pretrain.py][INFO] Epoch:[0/2](582100/4588595) loss:3.240 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:03,013][model8_pretrain.py][INFO] Epoch:[0/2](582100/4588595) loss:2.830 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:03,013][model8_pretrain.py][INFO] Epoch:[0/2](582100/4588595) loss:2.797 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:03,013][model8_pretrain.py][INFO] Epoch:[0/2](582100/4588595) loss:2.977 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:03,013][model8_pretrain.py][INFO] Epoch:[0/2](582100/4588595) loss:2.512 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:03,013][model8_pretrain.py][INFO] Epoch:[0/2](582100/4588595) loss:3.120 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:39,932][model8_pretrain.py][INFO] Epoch:[0/2](582200/4588595) loss:2.765 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:39,932][model8_pretrain.py][INFO] Epoch:[0/2](582200/4588595) loss:3.112 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:39,932][model8_pretrain.py][INFO] Epoch:[0/2](582200/4588595) loss:2.966 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:39,932][model8_pretrain.py][INFO] Epoch:[0/2](582200/4588595) loss:3.043 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:39,932][model8_pretrain.py][INFO] Epoch:[0/2](582200/4588595) loss:3.222 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:39,932][model8_pretrain.py][INFO] Epoch:[0/2](582200/4588595) loss:3.165 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:39,932][model8_pretrain.py][INFO] Epoch:[0/2](582200/4588595) loss:2.926 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:17:39,932][model8_pretrain.py][INFO] Epoch:[0/2](582200/4588595) loss:2.779 lr:0.0000100 epoch_Time:25447.0min: [2024-01-05 07:18:16,859][model8_pretrain.py][INFO] Epoch:[0/2](582300/4588595) loss:2.753 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:18:16,859][model8_pretrain.py][INFO] Epoch:[0/2](582300/4588595) loss:2.280 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:18:16,859][model8_pretrain.py][INFO] Epoch:[0/2](582300/4588595) loss:3.077 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:18:16,859][model8_pretrain.py][INFO] Epoch:[0/2](582300/4588595) loss:2.316 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:18:16,859][model8_pretrain.py][INFO] Epoch:[0/2](582300/4588595) loss:2.494 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:18:16,859][model8_pretrain.py][INFO] Epoch:[0/2](582300/4588595) loss:2.725 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:18:16,859][model8_pretrain.py][INFO] Epoch:[0/2](582300/4588595) loss:3.195 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:18:16,859][model8_pretrain.py][INFO] Epoch:[0/2](582300/4588595) loss:2.986 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:18:53,777][model8_pretrain.py][INFO] Epoch:[0/2](582400/4588595) loss:2.946 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:18:53,777][model8_pretrain.py][INFO] Epoch:[0/2](582400/4588595) loss:2.468 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:18:53,777][model8_pretrain.py][INFO] Epoch:[0/2](582400/4588595) loss:2.764 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:18:53,777][model8_pretrain.py][INFO] Epoch:[0/2](582400/4588595) loss:2.584 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:18:53,777][model8_pretrain.py][INFO] Epoch:[0/2](582400/4588595) loss:3.153 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:18:53,777][model8_pretrain.py][INFO] Epoch:[0/2](582400/4588595) loss:2.508 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:18:53,777][model8_pretrain.py][INFO] Epoch:[0/2](582400/4588595) loss:2.553 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:18:53,777][model8_pretrain.py][INFO] Epoch:[0/2](582400/4588595) loss:2.917 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:19:41,050][model8_pretrain.py][INFO] Epoch:[0/2](582500/4588595) loss:2.320 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:19:41,050][model8_pretrain.py][INFO] Epoch:[0/2](582500/4588595) loss:2.694 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:19:41,050][model8_pretrain.py][INFO] Epoch:[0/2](582500/4588595) loss:2.571 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:19:41,050][model8_pretrain.py][INFO] Epoch:[0/2](582500/4588595) loss:3.101 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:19:41,050][model8_pretrain.py][INFO] Epoch:[0/2](582500/4588595) loss:3.048 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:19:41,050][model8_pretrain.py][INFO] Epoch:[0/2](582500/4588595) loss:2.912 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:19:41,050][model8_pretrain.py][INFO] Epoch:[0/2](582500/4588595) loss:3.101 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:19:41,050][model8_pretrain.py][INFO] Epoch:[0/2](582500/4588595) loss:2.470 lr:0.0000100 epoch_Time:25446.0min: [2024-01-05 07:20:17,973][model8_pretrain.py][INFO] Epoch:[0/2](582600/4588595) loss:3.212 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:20:17,973][model8_pretrain.py][INFO] Epoch:[0/2](582600/4588595) loss:2.960 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:20:17,973][model8_pretrain.py][INFO] Epoch:[0/2](582600/4588595) loss:3.077 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:20:17,973][model8_pretrain.py][INFO] Epoch:[0/2](582600/4588595) loss:2.579 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:20:17,973][model8_pretrain.py][INFO] Epoch:[0/2](582600/4588595) loss:3.290 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:20:17,973][model8_pretrain.py][INFO] Epoch:[0/2](582600/4588595) loss:2.800 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:20:17,973][model8_pretrain.py][INFO] Epoch:[0/2](582600/4588595) loss:3.276 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:20:17,973][model8_pretrain.py][INFO] Epoch:[0/2](582600/4588595) loss:1.892 lr:0.0000100 epoch_Time:25445.0min: [2024-01-05 07:20:54,897][model8_pretrain.py][INFO] Epoch:[0/2](582700/4588595) loss:3.000 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:20:54,897][model8_pretrain.py][INFO] Epoch:[0/2](582700/4588595) loss:3.238 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:20:54,897][model8_pretrain.py][INFO] Epoch:[0/2](582700/4588595) loss:2.875 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:20:54,897][model8_pretrain.py][INFO] Epoch:[0/2](582700/4588595) loss:2.398 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:20:54,897][model8_pretrain.py][INFO] Epoch:[0/2](582700/4588595) loss:2.922 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:20:54,897][model8_pretrain.py][INFO] Epoch:[0/2](582700/4588595) loss:2.600 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:20:54,897][model8_pretrain.py][INFO] Epoch:[0/2](582700/4588595) loss:2.967 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:20:54,897][model8_pretrain.py][INFO] Epoch:[0/2](582700/4588595) loss:3.177 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:21:31,821][model8_pretrain.py][INFO] Epoch:[0/2](582800/4588595) loss:2.409 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:21:31,821][model8_pretrain.py][INFO] Epoch:[0/2](582800/4588595) loss:3.033 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:21:31,821][model8_pretrain.py][INFO] Epoch:[0/2](582800/4588595) loss:2.455 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:21:31,821][model8_pretrain.py][INFO] Epoch:[0/2](582800/4588595) loss:2.787 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:21:31,821][model8_pretrain.py][INFO] Epoch:[0/2](582800/4588595) loss:3.345 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:21:31,821][model8_pretrain.py][INFO] Epoch:[0/2](582800/4588595) loss:3.179 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:21:31,821][model8_pretrain.py][INFO] Epoch:[0/2](582800/4588595) loss:2.657 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:21:31,821][model8_pretrain.py][INFO] Epoch:[0/2](582800/4588595) loss:2.678 lr:0.0000100 epoch_Time:25444.0min: [2024-01-05 07:22:08,750][model8_pretrain.py][INFO] Epoch:[0/2](582900/4588595) loss:3.231 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:08,750][model8_pretrain.py][INFO] Epoch:[0/2](582900/4588595) loss:2.687 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:08,750][model8_pretrain.py][INFO] Epoch:[0/2](582900/4588595) loss:2.767 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:08,750][model8_pretrain.py][INFO] Epoch:[0/2](582900/4588595) loss:2.648 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:08,750][model8_pretrain.py][INFO] Epoch:[0/2](582900/4588595) loss:2.889 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:08,750][model8_pretrain.py][INFO] Epoch:[0/2](582900/4588595) loss:2.530 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:08,750][model8_pretrain.py][INFO] Epoch:[0/2](582900/4588595) loss:2.898 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:08,750][model8_pretrain.py][INFO] Epoch:[0/2](582900/4588595) loss:2.876 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:45,670][model8_pretrain.py][INFO] Epoch:[0/2](583000/4588595) loss:3.053 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:45,670][model8_pretrain.py][INFO] Epoch:[0/2](583000/4588595) loss:2.700 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:45,670][model8_pretrain.py][INFO] Epoch:[0/2](583000/4588595) loss:2.688 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:45,670][model8_pretrain.py][INFO] Epoch:[0/2](583000/4588595) loss:2.675 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:45,670][model8_pretrain.py][INFO] Epoch:[0/2](583000/4588595) loss:3.176 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:45,670][model8_pretrain.py][INFO] Epoch:[0/2](583000/4588595) loss:2.388 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:45,670][model8_pretrain.py][INFO] Epoch:[0/2](583000/4588595) loss:2.302 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:22:45,670][model8_pretrain.py][INFO] Epoch:[0/2](583000/4588595) loss:3.036 lr:0.0000100 epoch_Time:25442.0min: [2024-01-05 07:23:22,602][model8_pretrain.py][INFO] Epoch:[0/2](583100/4588595) loss:2.775 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:23:22,602][model8_pretrain.py][INFO] Epoch:[0/2](583100/4588595) loss:2.811 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:23:22,602][model8_pretrain.py][INFO] Epoch:[0/2](583100/4588595) loss:2.860 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:23:22,602][model8_pretrain.py][INFO] Epoch:[0/2](583100/4588595) loss:2.407 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:23:22,602][model8_pretrain.py][INFO] Epoch:[0/2](583100/4588595) loss:3.221 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:23:22,602][model8_pretrain.py][INFO] Epoch:[0/2](583100/4588595) loss:2.724 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:23:22,602][model8_pretrain.py][INFO] Epoch:[0/2](583100/4588595) loss:2.888 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:23:22,602][model8_pretrain.py][INFO] Epoch:[0/2](583100/4588595) loss:3.263 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:23:59,519][model8_pretrain.py][INFO] Epoch:[0/2](583200/4588595) loss:3.196 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:23:59,519][model8_pretrain.py][INFO] Epoch:[0/2](583200/4588595) loss:3.179 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:23:59,519][model8_pretrain.py][INFO] Epoch:[0/2](583200/4588595) loss:2.984 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:23:59,519][model8_pretrain.py][INFO] Epoch:[0/2](583200/4588595) loss:2.612 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:23:59,519][model8_pretrain.py][INFO] Epoch:[0/2](583200/4588595) loss:2.951 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:23:59,519][model8_pretrain.py][INFO] Epoch:[0/2](583200/4588595) loss:2.279 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:23:59,519][model8_pretrain.py][INFO] Epoch:[0/2](583200/4588595) loss:3.053 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:23:59,520][model8_pretrain.py][INFO] Epoch:[0/2](583200/4588595) loss:2.560 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:24:46,806][model8_pretrain.py][INFO] Epoch:[0/2](583300/4588595) loss:3.114 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:24:46,806][model8_pretrain.py][INFO] Epoch:[0/2](583300/4588595) loss:2.680 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:24:46,806][model8_pretrain.py][INFO] Epoch:[0/2](583300/4588595) loss:2.749 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:24:46,806][model8_pretrain.py][INFO] Epoch:[0/2](583300/4588595) loss:3.012 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:24:46,806][model8_pretrain.py][INFO] Epoch:[0/2](583300/4588595) loss:3.136 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:24:46,806][model8_pretrain.py][INFO] Epoch:[0/2](583300/4588595) loss:2.716 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:24:46,806][model8_pretrain.py][INFO] Epoch:[0/2](583300/4588595) loss:2.300 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:24:46,806][model8_pretrain.py][INFO] Epoch:[0/2](583300/4588595) loss:2.869 lr:0.0000100 epoch_Time:25441.0min: [2024-01-05 07:25:23,732][model8_pretrain.py][INFO] Epoch:[0/2](583400/4588595) loss:2.660 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:25:23,732][model8_pretrain.py][INFO] Epoch:[0/2](583400/4588595) loss:2.888 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:25:23,732][model8_pretrain.py][INFO] Epoch:[0/2](583400/4588595) loss:2.858 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:25:23,732][model8_pretrain.py][INFO] Epoch:[0/2](583400/4588595) loss:3.310 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:25:23,732][model8_pretrain.py][INFO] Epoch:[0/2](583400/4588595) loss:2.736 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:25:23,732][model8_pretrain.py][INFO] Epoch:[0/2](583400/4588595) loss:2.792 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:25:23,732][model8_pretrain.py][INFO] Epoch:[0/2](583400/4588595) loss:2.764 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:25:23,732][model8_pretrain.py][INFO] Epoch:[0/2](583400/4588595) loss:3.065 lr:0.0000100 epoch_Time:25440.0min: [2024-01-05 07:26:00,677][model8_pretrain.py][INFO] Epoch:[0/2](583500/4588595) loss:3.263 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:00,677][model8_pretrain.py][INFO] Epoch:[0/2](583500/4588595) loss:2.917 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:00,677][model8_pretrain.py][INFO] Epoch:[0/2](583500/4588595) loss:3.075 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:00,677][model8_pretrain.py][INFO] Epoch:[0/2](583500/4588595) loss:2.766 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:00,677][model8_pretrain.py][INFO] Epoch:[0/2](583500/4588595) loss:3.294 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:00,677][model8_pretrain.py][INFO] Epoch:[0/2](583500/4588595) loss:2.857 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:00,677][model8_pretrain.py][INFO] Epoch:[0/2](583500/4588595) loss:2.789 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:00,677][model8_pretrain.py][INFO] Epoch:[0/2](583500/4588595) loss:2.760 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:37,628][model8_pretrain.py][INFO] Epoch:[0/2](583600/4588595) loss:2.325 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:37,628][model8_pretrain.py][INFO] Epoch:[0/2](583600/4588595) loss:2.759 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:37,628][model8_pretrain.py][INFO] Epoch:[0/2](583600/4588595) loss:3.072 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:37,628][model8_pretrain.py][INFO] Epoch:[0/2](583600/4588595) loss:2.917 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:37,628][model8_pretrain.py][INFO] Epoch:[0/2](583600/4588595) loss:2.635 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:37,628][model8_pretrain.py][INFO] Epoch:[0/2](583600/4588595) loss:2.931 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:37,628][model8_pretrain.py][INFO] Epoch:[0/2](583600/4588595) loss:2.341 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:26:37,628][model8_pretrain.py][INFO] Epoch:[0/2](583600/4588595) loss:3.005 lr:0.0000100 epoch_Time:25439.0min: [2024-01-05 07:27:14,571][model8_pretrain.py][INFO] Epoch:[0/2](583700/4588595) loss:3.393 lr:0.0000100 epoch_Time:25438.0min: [2024-01-05 07:27:14,572][model8_pretrain.py][INFO] Epoch:[0/2](583700/4588595) loss:2.677 lr:0.0000100 epoch_Time:25438.0min: [2024-01-05 07:27:14,572][model8_pretrain.py][INFO] Epoch:[0/2](583700/4588595) loss:2.788 lr:0.0000100 epoch_Time:25438.0min: [2024-01-05 07:27:14,572][model8_pretrain.py][INFO] Epoch:[0/2](583700/4588595) loss:3.075 lr:0.0000100 epoch_Time:25438.0min: [2024-01-05 07:27:14,572][model8_pretrain.py][INFO] Epoch:[0/2](583700/4588595) loss:3.107 lr:0.0000100 epoch_Time:25438.0min: [2024-01-05 07:27:14,572][model8_pretrain.py][INFO] Epoch:[0/2](583700/4588595) loss:2.916 lr:0.0000100 epoch_Time:25438.0min: [2024-01-05 07:27:14,572][model8_pretrain.py][INFO] Epoch:[0/2](583700/4588595) loss:2.913 lr:0.0000100 epoch_Time:25438.0min: [2024-01-05 07:27:14,572][model8_pretrain.py][INFO] Epoch:[0/2](583700/4588595) loss:3.185 lr:0.0000100 epoch_Time:25438.0min: [2024-01-05 07:27:51,505][model8_pretrain.py][INFO] Epoch:[0/2](583800/4588595) loss:3.157 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:27:51,505][model8_pretrain.py][INFO] Epoch:[0/2](583800/4588595) loss:2.881 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:27:51,505][model8_pretrain.py][INFO] Epoch:[0/2](583800/4588595) loss:3.072 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:27:51,505][model8_pretrain.py][INFO] Epoch:[0/2](583800/4588595) loss:2.228 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:27:51,505][model8_pretrain.py][INFO] Epoch:[0/2](583800/4588595) loss:2.976 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:27:51,506][model8_pretrain.py][INFO] Epoch:[0/2](583800/4588595) loss:2.993 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:27:51,506][model8_pretrain.py][INFO] Epoch:[0/2](583800/4588595) loss:3.064 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:27:51,506][model8_pretrain.py][INFO] Epoch:[0/2](583800/4588595) loss:2.819 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:28:28,452][model8_pretrain.py][INFO] Epoch:[0/2](583900/4588595) loss:3.184 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:28:28,452][model8_pretrain.py][INFO] Epoch:[0/2](583900/4588595) loss:2.821 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:28:28,452][model8_pretrain.py][INFO] Epoch:[0/2](583900/4588595) loss:2.511 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:28:28,452][model8_pretrain.py][INFO] Epoch:[0/2](583900/4588595) loss:3.081 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:28:28,452][model8_pretrain.py][INFO] Epoch:[0/2](583900/4588595) loss:3.652 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:28:28,452][model8_pretrain.py][INFO] Epoch:[0/2](583900/4588595) loss:3.295 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:28:28,452][model8_pretrain.py][INFO] Epoch:[0/2](583900/4588595) loss:3.122 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:28:28,452][model8_pretrain.py][INFO] Epoch:[0/2](583900/4588595) loss:3.183 lr:0.0000100 epoch_Time:25436.0min: [2024-01-05 07:29:05,394][model8_pretrain.py][INFO] Epoch:[0/2](584000/4588595) loss:2.433 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:05,394][model8_pretrain.py][INFO] Epoch:[0/2](584000/4588595) loss:3.101 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:05,394][model8_pretrain.py][INFO] Epoch:[0/2](584000/4588595) loss:2.501 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:05,395][model8_pretrain.py][INFO] Epoch:[0/2](584000/4588595) loss:3.076 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:05,395][model8_pretrain.py][INFO] Epoch:[0/2](584000/4588595) loss:2.287 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:05,395][model8_pretrain.py][INFO] Epoch:[0/2](584000/4588595) loss:3.463 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:05,395][model8_pretrain.py][INFO] Epoch:[0/2](584000/4588595) loss:3.386 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:05,395][model8_pretrain.py][INFO] Epoch:[0/2](584000/4588595) loss:3.021 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:52,884][model8_pretrain.py][INFO] Epoch:[0/2](584100/4588595) loss:2.502 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:52,884][model8_pretrain.py][INFO] Epoch:[0/2](584100/4588595) loss:3.047 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:52,885][model8_pretrain.py][INFO] Epoch:[0/2](584100/4588595) loss:3.057 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:52,885][model8_pretrain.py][INFO] Epoch:[0/2](584100/4588595) loss:2.843 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:52,885][model8_pretrain.py][INFO] Epoch:[0/2](584100/4588595) loss:3.090 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:52,886][model8_pretrain.py][INFO] Epoch:[0/2](584100/4588595) loss:1.941 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:52,886][model8_pretrain.py][INFO] Epoch:[0/2](584100/4588595) loss:3.029 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:29:52,886][model8_pretrain.py][INFO] Epoch:[0/2](584100/4588595) loss:3.420 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:30:29,814][model8_pretrain.py][INFO] Epoch:[0/2](584200/4588595) loss:2.986 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:30:29,814][model8_pretrain.py][INFO] Epoch:[0/2](584200/4588595) loss:2.877 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:30:29,814][model8_pretrain.py][INFO] Epoch:[0/2](584200/4588595) loss:3.108 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:30:29,814][model8_pretrain.py][INFO] Epoch:[0/2](584200/4588595) loss:2.565 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:30:29,814][model8_pretrain.py][INFO] Epoch:[0/2](584200/4588595) loss:2.640 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:30:29,814][model8_pretrain.py][INFO] Epoch:[0/2](584200/4588595) loss:3.054 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:30:29,814][model8_pretrain.py][INFO] Epoch:[0/2](584200/4588595) loss:2.867 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:30:29,814][model8_pretrain.py][INFO] Epoch:[0/2](584200/4588595) loss:2.949 lr:0.0000100 epoch_Time:25435.0min: [2024-01-05 07:31:06,752][model8_pretrain.py][INFO] Epoch:[0/2](584300/4588595) loss:2.732 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:06,752][model8_pretrain.py][INFO] Epoch:[0/2](584300/4588595) loss:2.667 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:06,752][model8_pretrain.py][INFO] Epoch:[0/2](584300/4588595) loss:2.826 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:06,752][model8_pretrain.py][INFO] Epoch:[0/2](584300/4588595) loss:2.795 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:06,752][model8_pretrain.py][INFO] Epoch:[0/2](584300/4588595) loss:3.067 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:06,752][model8_pretrain.py][INFO] Epoch:[0/2](584300/4588595) loss:2.790 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:06,752][model8_pretrain.py][INFO] Epoch:[0/2](584300/4588595) loss:2.736 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:06,752][model8_pretrain.py][INFO] Epoch:[0/2](584300/4588595) loss:2.971 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:43,717][model8_pretrain.py][INFO] Epoch:[0/2](584400/4588595) loss:2.947 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:43,717][model8_pretrain.py][INFO] Epoch:[0/2](584400/4588595) loss:2.581 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:43,717][model8_pretrain.py][INFO] Epoch:[0/2](584400/4588595) loss:3.256 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:43,717][model8_pretrain.py][INFO] Epoch:[0/2](584400/4588595) loss:2.871 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:43,717][model8_pretrain.py][INFO] Epoch:[0/2](584400/4588595) loss:2.845 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:43,717][model8_pretrain.py][INFO] Epoch:[0/2](584400/4588595) loss:2.345 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:43,717][model8_pretrain.py][INFO] Epoch:[0/2](584400/4588595) loss:2.565 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:31:43,717][model8_pretrain.py][INFO] Epoch:[0/2](584400/4588595) loss:2.727 lr:0.0000100 epoch_Time:25434.0min: [2024-01-05 07:32:20,657][model8_pretrain.py][INFO] Epoch:[0/2](584500/4588595) loss:3.332 lr:0.0000100 epoch_Time:25433.0min: [2024-01-05 07:32:20,657][model8_pretrain.py][INFO] Epoch:[0/2](584500/4588595) loss:2.252 lr:0.0000100 epoch_Time:25433.0min: [2024-01-05 07:32:20,657][model8_pretrain.py][INFO] Epoch:[0/2](584500/4588595) loss:3.091 lr:0.0000100 epoch_Time:25433.0min: [2024-01-05 07:32:20,657][model8_pretrain.py][INFO] Epoch:[0/2](584500/4588595) loss:2.090 lr:0.0000100 epoch_Time:25433.0min: [2024-01-05 07:32:20,657][model8_pretrain.py][INFO] Epoch:[0/2](584500/4588595) loss:1.914 lr:0.0000100 epoch_Time:25433.0min: [2024-01-05 07:32:20,657][model8_pretrain.py][INFO] Epoch:[0/2](584500/4588595) loss:2.737 lr:0.0000100 epoch_Time:25433.0min: [2024-01-05 07:32:20,657][model8_pretrain.py][INFO] Epoch:[0/2](584500/4588595) loss:2.765 lr:0.0000100 epoch_Time:25433.0min: [2024-01-05 07:32:20,658][model8_pretrain.py][INFO] Epoch:[0/2](584500/4588595) loss:2.952 lr:0.0000100 epoch_Time:25433.0min: [2024-01-05 07:32:57,609][model8_pretrain.py][INFO] Epoch:[0/2](584600/4588595) loss:2.974 lr:0.0000100 epoch_Time:25432.0min: [2024-01-05 07:32:57,609][model8_pretrain.py][INFO] Epoch:[0/2](584600/4588595) loss:2.507 lr:0.0000100 epoch_Time:25432.0min: [2024-01-05 07:32:57,609][model8_pretrain.py][INFO] Epoch:[0/2](584600/4588595) loss:2.915 lr:0.0000100 epoch_Time:25432.0min: [2024-01-05 07:32:57,609][model8_pretrain.py][INFO] Epoch:[0/2](584600/4588595) loss:2.439 lr:0.0000100 epoch_Time:25432.0min: [2024-01-05 07:32:57,609][model8_pretrain.py][INFO] Epoch:[0/2](584600/4588595) loss:2.719 lr:0.0000100 epoch_Time:25432.0min: [2024-01-05 07:32:57,609][model8_pretrain.py][INFO] Epoch:[0/2](584600/4588595) loss:3.018 lr:0.0000100 epoch_Time:25432.0min: [2024-01-05 07:32:57,609][model8_pretrain.py][INFO] Epoch:[0/2](584600/4588595) loss:2.925 lr:0.0000100 epoch_Time:25432.0min: [2024-01-05 07:32:57,609][model8_pretrain.py][INFO] Epoch:[0/2](584600/4588595) loss:2.565 lr:0.0000100 epoch_Time:25432.0min: [2024-01-05 07:33:34,548][model8_pretrain.py][INFO] Epoch:[0/2](584700/4588595) loss:2.676 lr:0.0000100 epoch_Time:25431.0min: [2024-01-05 07:33:34,548][model8_pretrain.py][INFO] Epoch:[0/2](584700/4588595) loss:2.961 lr:0.0000100 epoch_Time:25431.0min: [2024-01-05 07:33:34,548][model8_pretrain.py][INFO] Epoch:[0/2](584700/4588595) loss:2.380 lr:0.0000100 epoch_Time:25431.0min: [2024-01-05 07:33:34,548][model8_pretrain.py][INFO] Epoch:[0/2](584700/4588595) loss:2.603 lr:0.0000100 epoch_Time:25431.0min: [2024-01-05 07:33:34,548][model8_pretrain.py][INFO] Epoch:[0/2](584700/4588595) loss:2.762 lr:0.0000100 epoch_Time:25431.0min: [2024-01-05 07:33:34,548][model8_pretrain.py][INFO] Epoch:[0/2](584700/4588595) loss:3.148 lr:0.0000100 epoch_Time:25431.0min: [2024-01-05 07:33:34,548][model8_pretrain.py][INFO] Epoch:[0/2](584700/4588595) loss:2.920 lr:0.0000100 epoch_Time:25431.0min: [2024-01-05 07:33:34,548][model8_pretrain.py][INFO] Epoch:[0/2](584700/4588595) loss:2.937 lr:0.0000100 epoch_Time:25431.0min: [2024-01-05 07:34:11,498][model8_pretrain.py][INFO] Epoch:[0/2](584800/4588595) loss:2.770 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:11,498][model8_pretrain.py][INFO] Epoch:[0/2](584800/4588595) loss:2.701 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:11,498][model8_pretrain.py][INFO] Epoch:[0/2](584800/4588595) loss:2.505 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:11,498][model8_pretrain.py][INFO] Epoch:[0/2](584800/4588595) loss:2.397 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:11,498][model8_pretrain.py][INFO] Epoch:[0/2](584800/4588595) loss:2.589 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:11,498][model8_pretrain.py][INFO] Epoch:[0/2](584800/4588595) loss:2.830 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:11,498][model8_pretrain.py][INFO] Epoch:[0/2](584800/4588595) loss:3.142 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:11,498][model8_pretrain.py][INFO] Epoch:[0/2](584800/4588595) loss:2.718 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:58,884][model8_pretrain.py][INFO] Epoch:[0/2](584900/4588595) loss:2.878 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:58,883][model8_pretrain.py][INFO] Epoch:[0/2](584900/4588595) loss:3.254 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:58,884][model8_pretrain.py][INFO] Epoch:[0/2](584900/4588595) loss:3.119 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:58,884][model8_pretrain.py][INFO] Epoch:[0/2](584900/4588595) loss:2.577 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:58,884][model8_pretrain.py][INFO] Epoch:[0/2](584900/4588595) loss:3.174 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:58,884][model8_pretrain.py][INFO] Epoch:[0/2](584900/4588595) loss:3.154 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:58,884][model8_pretrain.py][INFO] Epoch:[0/2](584900/4588595) loss:2.301 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:34:58,884][model8_pretrain.py][INFO] Epoch:[0/2](584900/4588595) loss:2.858 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:35:35,801][model8_pretrain.py][INFO] Epoch:[0/2](585000/4588595) loss:2.798 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:35:35,801][model8_pretrain.py][INFO] Epoch:[0/2](585000/4588595) loss:2.892 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:35:35,801][model8_pretrain.py][INFO] Epoch:[0/2](585000/4588595) loss:3.057 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:35:35,801][model8_pretrain.py][INFO] Epoch:[0/2](585000/4588595) loss:2.724 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:35:35,801][model8_pretrain.py][INFO] Epoch:[0/2](585000/4588595) loss:2.662 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:35:35,801][model8_pretrain.py][INFO] Epoch:[0/2](585000/4588595) loss:2.799 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:35:35,801][model8_pretrain.py][INFO] Epoch:[0/2](585000/4588595) loss:2.482 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:35:35,801][model8_pretrain.py][INFO] Epoch:[0/2](585000/4588595) loss:2.776 lr:0.0000100 epoch_Time:25430.0min: [2024-01-05 07:36:12,745][model8_pretrain.py][INFO] Epoch:[0/2](585100/4588595) loss:2.942 lr:0.0000100 epoch_Time:25429.0min: [2024-01-05 07:36:12,745][model8_pretrain.py][INFO] Epoch:[0/2](585100/4588595) loss:2.780 lr:0.0000100 epoch_Time:25429.0min: [2024-01-05 07:36:12,745][model8_pretrain.py][INFO] Epoch:[0/2](585100/4588595) loss:2.403 lr:0.0000100 epoch_Time:25429.0min: [2024-01-05 07:36:12,745][model8_pretrain.py][INFO] Epoch:[0/2](585100/4588595) loss:2.777 lr:0.0000100 epoch_Time:25429.0min: [2024-01-05 07:36:12,745][model8_pretrain.py][INFO] Epoch:[0/2](585100/4588595) loss:2.533 lr:0.0000100 epoch_Time:25429.0min: [2024-01-05 07:36:12,745][model8_pretrain.py][INFO] Epoch:[0/2](585100/4588595) loss:2.743 lr:0.0000100 epoch_Time:25429.0min: [2024-01-05 07:36:12,745][model8_pretrain.py][INFO] Epoch:[0/2](585100/4588595) loss:2.918 lr:0.0000100 epoch_Time:25429.0min: [2024-01-05 07:36:12,745][model8_pretrain.py][INFO] Epoch:[0/2](585100/4588595) loss:2.920 lr:0.0000100 epoch_Time:25429.0min: [2024-01-05 07:36:49,685][model8_pretrain.py][INFO] Epoch:[0/2](585200/4588595) loss:2.915 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:36:49,685][model8_pretrain.py][INFO] Epoch:[0/2](585200/4588595) loss:2.589 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:36:49,685][model8_pretrain.py][INFO] Epoch:[0/2](585200/4588595) loss:2.594 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:36:49,685][model8_pretrain.py][INFO] Epoch:[0/2](585200/4588595) loss:3.008 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:36:49,685][model8_pretrain.py][INFO] Epoch:[0/2](585200/4588595) loss:2.837 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:36:49,685][model8_pretrain.py][INFO] Epoch:[0/2](585200/4588595) loss:3.046 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:36:49,685][model8_pretrain.py][INFO] Epoch:[0/2](585200/4588595) loss:2.834 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:36:49,685][model8_pretrain.py][INFO] Epoch:[0/2](585200/4588595) loss:3.073 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:37:26,683][model8_pretrain.py][INFO] Epoch:[0/2](585300/4588595) loss:2.828 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:37:26,683][model8_pretrain.py][INFO] Epoch:[0/2](585300/4588595) loss:2.610 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:37:26,683][model8_pretrain.py][INFO] Epoch:[0/2](585300/4588595) loss:2.656 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:37:26,683][model8_pretrain.py][INFO] Epoch:[0/2](585300/4588595) loss:2.759 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:37:26,683][model8_pretrain.py][INFO] Epoch:[0/2](585300/4588595) loss:3.084 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:37:26,683][model8_pretrain.py][INFO] Epoch:[0/2](585300/4588595) loss:3.068 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:37:26,683][model8_pretrain.py][INFO] Epoch:[0/2](585300/4588595) loss:3.186 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:37:26,683][model8_pretrain.py][INFO] Epoch:[0/2](585300/4588595) loss:2.926 lr:0.0000100 epoch_Time:25428.0min: [2024-01-05 07:38:03,635][model8_pretrain.py][INFO] Epoch:[0/2](585400/4588595) loss:2.584 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:03,635][model8_pretrain.py][INFO] Epoch:[0/2](585400/4588595) loss:3.032 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:03,635][model8_pretrain.py][INFO] Epoch:[0/2](585400/4588595) loss:2.905 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:03,635][model8_pretrain.py][INFO] Epoch:[0/2](585400/4588595) loss:3.390 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:03,635][model8_pretrain.py][INFO] Epoch:[0/2](585400/4588595) loss:2.678 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:03,635][model8_pretrain.py][INFO] Epoch:[0/2](585400/4588595) loss:2.486 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:03,635][model8_pretrain.py][INFO] Epoch:[0/2](585400/4588595) loss:2.665 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:03,636][model8_pretrain.py][INFO] Epoch:[0/2](585400/4588595) loss:2.790 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:40,602][model8_pretrain.py][INFO] Epoch:[0/2](585500/4588595) loss:2.824 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:40,602][model8_pretrain.py][INFO] Epoch:[0/2](585500/4588595) loss:2.875 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:40,602][model8_pretrain.py][INFO] Epoch:[0/2](585500/4588595) loss:3.303 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:40,602][model8_pretrain.py][INFO] Epoch:[0/2](585500/4588595) loss:3.178 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:40,602][model8_pretrain.py][INFO] Epoch:[0/2](585500/4588595) loss:2.756 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:40,602][model8_pretrain.py][INFO] Epoch:[0/2](585500/4588595) loss:2.745 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:40,602][model8_pretrain.py][INFO] Epoch:[0/2](585500/4588595) loss:2.779 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:38:40,603][model8_pretrain.py][INFO] Epoch:[0/2](585500/4588595) loss:2.664 lr:0.0000100 epoch_Time:25427.0min: [2024-01-05 07:39:17,585][model8_pretrain.py][INFO] Epoch:[0/2](585600/4588595) loss:2.818 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:39:17,585][model8_pretrain.py][INFO] Epoch:[0/2](585600/4588595) loss:2.863 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:39:17,585][model8_pretrain.py][INFO] Epoch:[0/2](585600/4588595) loss:2.423 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:39:17,585][model8_pretrain.py][INFO] Epoch:[0/2](585600/4588595) loss:2.710 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:39:17,585][model8_pretrain.py][INFO] Epoch:[0/2](585600/4588595) loss:2.779 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:39:17,585][model8_pretrain.py][INFO] Epoch:[0/2](585600/4588595) loss:2.651 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:39:17,585][model8_pretrain.py][INFO] Epoch:[0/2](585600/4588595) loss:2.636 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:39:17,585][model8_pretrain.py][INFO] Epoch:[0/2](585600/4588595) loss:2.611 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:40:05,092][model8_pretrain.py][INFO] Epoch:[0/2](585700/4588595) loss:2.497 lr:0.0000100 epoch_Time:25426.0min: [2024-01-05 07:40:05,092][model8_pretrain.py][INFO] Epoch:[0/2](585700/4588595) loss:2.903 lr:0.0000100 epoch_Time:25426.0min: [2024-01-05 07:40:05,092][model8_pretrain.py][INFO] Epoch:[0/2](585700/4588595) loss:3.125 lr:0.0000100 epoch_Time:25426.0min: [2024-01-05 07:40:05,092][model8_pretrain.py][INFO] Epoch:[0/2](585700/4588595) loss:2.851 lr:0.0000100 epoch_Time:25426.0min: [2024-01-05 07:40:05,092][model8_pretrain.py][INFO] Epoch:[0/2](585700/4588595) loss:2.268 lr:0.0000100 epoch_Time:25426.0min: [2024-01-05 07:40:05,092][model8_pretrain.py][INFO] Epoch:[0/2](585700/4588595) loss:2.788 lr:0.0000100 epoch_Time:25426.0min: [2024-01-05 07:40:05,092][model8_pretrain.py][INFO] Epoch:[0/2](585700/4588595) loss:2.627 lr:0.0000100 epoch_Time:25426.0min: [2024-01-05 07:40:05,093][model8_pretrain.py][INFO] Epoch:[0/2](585700/4588595) loss:2.690 lr:0.0000100 epoch_Time:25426.0min: [2024-01-05 07:40:41,993][model8_pretrain.py][INFO] Epoch:[0/2](585800/4588595) loss:3.283 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:40:41,993][model8_pretrain.py][INFO] Epoch:[0/2](585800/4588595) loss:3.165 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:40:41,993][model8_pretrain.py][INFO] Epoch:[0/2](585800/4588595) loss:2.885 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:40:41,993][model8_pretrain.py][INFO] Epoch:[0/2](585800/4588595) loss:2.649 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:40:41,993][model8_pretrain.py][INFO] Epoch:[0/2](585800/4588595) loss:3.144 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:40:41,993][model8_pretrain.py][INFO] Epoch:[0/2](585800/4588595) loss:2.999 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:40:41,994][model8_pretrain.py][INFO] Epoch:[0/2](585800/4588595) loss:3.070 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:40:41,994][model8_pretrain.py][INFO] Epoch:[0/2](585800/4588595) loss:2.716 lr:0.0000100 epoch_Time:25425.0min: [2024-01-05 07:41:18,926][model8_pretrain.py][INFO] Epoch:[0/2](585900/4588595) loss:2.984 lr:0.0000100 epoch_Time:25424.0min: [2024-01-05 07:41:18,926][model8_pretrain.py][INFO] Epoch:[0/2](585900/4588595) loss:2.725 lr:0.0000100 epoch_Time:25424.0min: [2024-01-05 07:41:18,926][model8_pretrain.py][INFO] Epoch:[0/2](585900/4588595) loss:2.863 lr:0.0000100 epoch_Time:25424.0min: [2024-01-05 07:41:18,926][model8_pretrain.py][INFO] Epoch:[0/2](585900/4588595) loss:3.395 lr:0.0000100 epoch_Time:25424.0min: [2024-01-05 07:41:18,926][model8_pretrain.py][INFO] Epoch:[0/2](585900/4588595) loss:3.131 lr:0.0000100 epoch_Time:25424.0min: [2024-01-05 07:41:18,926][model8_pretrain.py][INFO] Epoch:[0/2](585900/4588595) loss:2.539 lr:0.0000100 epoch_Time:25424.0min: [2024-01-05 07:41:18,926][model8_pretrain.py][INFO] Epoch:[0/2](585900/4588595) loss:3.131 lr:0.0000100 epoch_Time:25424.0min: [2024-01-05 07:41:18,926][model8_pretrain.py][INFO] Epoch:[0/2](585900/4588595) loss:3.313 lr:0.0000100 epoch_Time:25424.0min: [2024-01-05 07:41:55,859][model8_pretrain.py][INFO] Epoch:[0/2](586000/4588595) loss:2.966 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:41:55,859][model8_pretrain.py][INFO] Epoch:[0/2](586000/4588595) loss:2.435 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:41:55,860][model8_pretrain.py][INFO] Epoch:[0/2](586000/4588595) loss:3.300 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:41:55,860][model8_pretrain.py][INFO] Epoch:[0/2](586000/4588595) loss:2.418 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:41:55,860][model8_pretrain.py][INFO] Epoch:[0/2](586000/4588595) loss:3.066 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:41:55,860][model8_pretrain.py][INFO] Epoch:[0/2](586000/4588595) loss:3.164 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:41:55,860][model8_pretrain.py][INFO] Epoch:[0/2](586000/4588595) loss:2.621 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:41:55,860][model8_pretrain.py][INFO] Epoch:[0/2](586000/4588595) loss:3.040 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:42:32,797][model8_pretrain.py][INFO] Epoch:[0/2](586100/4588595) loss:2.376 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:42:32,797][model8_pretrain.py][INFO] Epoch:[0/2](586100/4588595) loss:2.697 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:42:32,797][model8_pretrain.py][INFO] Epoch:[0/2](586100/4588595) loss:2.385 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:42:32,797][model8_pretrain.py][INFO] Epoch:[0/2](586100/4588595) loss:3.127 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:42:32,797][model8_pretrain.py][INFO] Epoch:[0/2](586100/4588595) loss:3.229 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:42:32,797][model8_pretrain.py][INFO] Epoch:[0/2](586100/4588595) loss:2.806 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:42:32,797][model8_pretrain.py][INFO] Epoch:[0/2](586100/4588595) loss:3.130 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:42:32,797][model8_pretrain.py][INFO] Epoch:[0/2](586100/4588595) loss:2.606 lr:0.0000100 epoch_Time:25423.0min: [2024-01-05 07:43:09,738][model8_pretrain.py][INFO] Epoch:[0/2](586200/4588595) loss:2.834 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:09,738][model8_pretrain.py][INFO] Epoch:[0/2](586200/4588595) loss:3.343 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:09,738][model8_pretrain.py][INFO] Epoch:[0/2](586200/4588595) loss:3.186 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:09,738][model8_pretrain.py][INFO] Epoch:[0/2](586200/4588595) loss:2.959 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:09,738][model8_pretrain.py][INFO] Epoch:[0/2](586200/4588595) loss:2.890 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:09,738][model8_pretrain.py][INFO] Epoch:[0/2](586200/4588595) loss:2.682 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:09,738][model8_pretrain.py][INFO] Epoch:[0/2](586200/4588595) loss:2.791 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:09,738][model8_pretrain.py][INFO] Epoch:[0/2](586200/4588595) loss:3.313 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:46,654][model8_pretrain.py][INFO] Epoch:[0/2](586300/4588595) loss:2.815 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:46,654][model8_pretrain.py][INFO] Epoch:[0/2](586300/4588595) loss:3.311 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:46,654][model8_pretrain.py][INFO] Epoch:[0/2](586300/4588595) loss:2.787 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:46,654][model8_pretrain.py][INFO] Epoch:[0/2](586300/4588595) loss:2.739 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:46,654][model8_pretrain.py][INFO] Epoch:[0/2](586300/4588595) loss:3.237 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:46,654][model8_pretrain.py][INFO] Epoch:[0/2](586300/4588595) loss:2.730 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:46,654][model8_pretrain.py][INFO] Epoch:[0/2](586300/4588595) loss:2.806 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:43:46,654][model8_pretrain.py][INFO] Epoch:[0/2](586300/4588595) loss:3.280 lr:0.0000100 epoch_Time:25422.0min: [2024-01-05 07:44:23,590][model8_pretrain.py][INFO] Epoch:[0/2](586400/4588595) loss:2.694 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:44:23,590][model8_pretrain.py][INFO] Epoch:[0/2](586400/4588595) loss:2.908 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:44:23,590][model8_pretrain.py][INFO] Epoch:[0/2](586400/4588595) loss:2.954 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:44:23,590][model8_pretrain.py][INFO] Epoch:[0/2](586400/4588595) loss:3.250 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:44:23,590][model8_pretrain.py][INFO] Epoch:[0/2](586400/4588595) loss:2.778 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:44:23,590][model8_pretrain.py][INFO] Epoch:[0/2](586400/4588595) loss:2.901 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:44:23,590][model8_pretrain.py][INFO] Epoch:[0/2](586400/4588595) loss:2.709 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:44:23,590][model8_pretrain.py][INFO] Epoch:[0/2](586400/4588595) loss:3.027 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:45:12,261][model8_pretrain.py][INFO] Epoch:[0/2](586500/4588595) loss:2.761 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:45:12,261][model8_pretrain.py][INFO] Epoch:[0/2](586500/4588595) loss:2.970 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:45:12,261][model8_pretrain.py][INFO] Epoch:[0/2](586500/4588595) loss:3.320 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:45:12,261][model8_pretrain.py][INFO] Epoch:[0/2](586500/4588595) loss:2.378 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:45:12,261][model8_pretrain.py][INFO] Epoch:[0/2](586500/4588595) loss:3.145 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:45:12,261][model8_pretrain.py][INFO] Epoch:[0/2](586500/4588595) loss:3.050 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:45:12,262][model8_pretrain.py][INFO] Epoch:[0/2](586500/4588595) loss:2.765 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:45:12,262][model8_pretrain.py][INFO] Epoch:[0/2](586500/4588595) loss:2.633 lr:0.0000100 epoch_Time:25421.0min: [2024-01-05 07:45:49,192][model8_pretrain.py][INFO] Epoch:[0/2](586600/4588595) loss:2.992 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:45:49,192][model8_pretrain.py][INFO] Epoch:[0/2](586600/4588595) loss:3.136 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:45:49,192][model8_pretrain.py][INFO] Epoch:[0/2](586600/4588595) loss:2.730 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:45:49,192][model8_pretrain.py][INFO] Epoch:[0/2](586600/4588595) loss:2.748 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:45:49,192][model8_pretrain.py][INFO] Epoch:[0/2](586600/4588595) loss:2.444 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:45:49,192][model8_pretrain.py][INFO] Epoch:[0/2](586600/4588595) loss:2.621 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:45:49,192][model8_pretrain.py][INFO] Epoch:[0/2](586600/4588595) loss:3.073 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:45:49,192][model8_pretrain.py][INFO] Epoch:[0/2](586600/4588595) loss:3.025 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:46:26,140][model8_pretrain.py][INFO] Epoch:[0/2](586700/4588595) loss:2.630 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:46:26,140][model8_pretrain.py][INFO] Epoch:[0/2](586700/4588595) loss:2.767 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:46:26,140][model8_pretrain.py][INFO] Epoch:[0/2](586700/4588595) loss:3.092 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:46:26,140][model8_pretrain.py][INFO] Epoch:[0/2](586700/4588595) loss:2.918 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:46:26,140][model8_pretrain.py][INFO] Epoch:[0/2](586700/4588595) loss:3.212 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:46:26,140][model8_pretrain.py][INFO] Epoch:[0/2](586700/4588595) loss:2.689 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:46:26,140][model8_pretrain.py][INFO] Epoch:[0/2](586700/4588595) loss:2.025 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:46:26,140][model8_pretrain.py][INFO] Epoch:[0/2](586700/4588595) loss:2.298 lr:0.0000100 epoch_Time:25420.0min: [2024-01-05 07:47:03,083][model8_pretrain.py][INFO] Epoch:[0/2](586800/4588595) loss:2.852 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:03,083][model8_pretrain.py][INFO] Epoch:[0/2](586800/4588595) loss:2.689 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:03,083][model8_pretrain.py][INFO] Epoch:[0/2](586800/4588595) loss:2.895 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:03,083][model8_pretrain.py][INFO] Epoch:[0/2](586800/4588595) loss:3.130 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:03,083][model8_pretrain.py][INFO] Epoch:[0/2](586800/4588595) loss:2.314 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:03,083][model8_pretrain.py][INFO] Epoch:[0/2](586800/4588595) loss:2.675 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:03,083][model8_pretrain.py][INFO] Epoch:[0/2](586800/4588595) loss:2.968 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:03,083][model8_pretrain.py][INFO] Epoch:[0/2](586800/4588595) loss:2.887 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:40,019][model8_pretrain.py][INFO] Epoch:[0/2](586900/4588595) loss:2.825 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:40,019][model8_pretrain.py][INFO] Epoch:[0/2](586900/4588595) loss:2.926 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:40,019][model8_pretrain.py][INFO] Epoch:[0/2](586900/4588595) loss:3.133 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:40,019][model8_pretrain.py][INFO] Epoch:[0/2](586900/4588595) loss:2.232 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:40,019][model8_pretrain.py][INFO] Epoch:[0/2](586900/4588595) loss:2.816 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:40,019][model8_pretrain.py][INFO] Epoch:[0/2](586900/4588595) loss:2.757 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:40,019][model8_pretrain.py][INFO] Epoch:[0/2](586900/4588595) loss:2.886 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:47:40,019][model8_pretrain.py][INFO] Epoch:[0/2](586900/4588595) loss:3.387 lr:0.0000100 epoch_Time:25418.0min: [2024-01-05 07:48:16,968][model8_pretrain.py][INFO] Epoch:[0/2](587000/4588595) loss:2.725 lr:0.0000100 epoch_Time:25417.0min: [2024-01-05 07:48:16,968][model8_pretrain.py][INFO] Epoch:[0/2](587000/4588595) loss:2.776 lr:0.0000100 epoch_Time:25417.0min: [2024-01-05 07:48:16,968][model8_pretrain.py][INFO] Epoch:[0/2](587000/4588595) loss:2.918 lr:0.0000100 epoch_Time:25417.0min: [2024-01-05 07:48:16,968][model8_pretrain.py][INFO] Epoch:[0/2](587000/4588595) loss:2.897 lr:0.0000100 epoch_Time:25417.0min: [2024-01-05 07:48:16,968][model8_pretrain.py][INFO] Epoch:[0/2](587000/4588595) loss:2.740 lr:0.0000100 epoch_Time:25417.0min: [2024-01-05 07:48:16,968][model8_pretrain.py][INFO] Epoch:[0/2](587000/4588595) loss:2.612 lr:0.0000100 epoch_Time:25417.0min: [2024-01-05 07:48:16,968][model8_pretrain.py][INFO] Epoch:[0/2](587000/4588595) loss:2.558 lr:0.0000100 epoch_Time:25417.0min: [2024-01-05 07:48:16,968][model8_pretrain.py][INFO] Epoch:[0/2](587000/4588595) loss:3.181 lr:0.0000100 epoch_Time:25417.0min: [2024-01-05 07:48:53,900][model8_pretrain.py][INFO] Epoch:[0/2](587100/4588595) loss:2.403 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:48:53,900][model8_pretrain.py][INFO] Epoch:[0/2](587100/4588595) loss:2.435 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:48:53,900][model8_pretrain.py][INFO] Epoch:[0/2](587100/4588595) loss:2.813 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:48:53,900][model8_pretrain.py][INFO] Epoch:[0/2](587100/4588595) loss:3.144 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:48:53,900][model8_pretrain.py][INFO] Epoch:[0/2](587100/4588595) loss:3.182 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:48:53,901][model8_pretrain.py][INFO] Epoch:[0/2](587100/4588595) loss:2.984 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:48:53,901][model8_pretrain.py][INFO] Epoch:[0/2](587100/4588595) loss:3.307 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:48:53,901][model8_pretrain.py][INFO] Epoch:[0/2](587100/4588595) loss:2.621 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:49:30,823][model8_pretrain.py][INFO] Epoch:[0/2](587200/4588595) loss:2.551 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:49:30,823][model8_pretrain.py][INFO] Epoch:[0/2](587200/4588595) loss:2.481 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:49:30,823][model8_pretrain.py][INFO] Epoch:[0/2](587200/4588595) loss:3.004 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:49:30,823][model8_pretrain.py][INFO] Epoch:[0/2](587200/4588595) loss:2.597 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:49:30,823][model8_pretrain.py][INFO] Epoch:[0/2](587200/4588595) loss:3.298 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:49:30,823][model8_pretrain.py][INFO] Epoch:[0/2](587200/4588595) loss:2.855 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:49:30,823][model8_pretrain.py][INFO] Epoch:[0/2](587200/4588595) loss:2.498 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:49:30,824][model8_pretrain.py][INFO] Epoch:[0/2](587200/4588595) loss:2.687 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:50:19,466][model8_pretrain.py][INFO] Epoch:[0/2](587300/4588595) loss:2.845 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:50:19,466][model8_pretrain.py][INFO] Epoch:[0/2](587300/4588595) loss:2.960 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:50:19,466][model8_pretrain.py][INFO] Epoch:[0/2](587300/4588595) loss:3.067 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:50:19,466][model8_pretrain.py][INFO] Epoch:[0/2](587300/4588595) loss:3.122 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:50:19,466][model8_pretrain.py][INFO] Epoch:[0/2](587300/4588595) loss:2.600 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:50:19,466][model8_pretrain.py][INFO] Epoch:[0/2](587300/4588595) loss:2.608 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:50:19,466][model8_pretrain.py][INFO] Epoch:[0/2](587300/4588595) loss:2.888 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:50:19,466][model8_pretrain.py][INFO] Epoch:[0/2](587300/4588595) loss:3.044 lr:0.0000100 epoch_Time:25416.0min: [2024-01-05 07:50:56,379][model8_pretrain.py][INFO] Epoch:[0/2](587400/4588595) loss:3.046 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:50:56,379][model8_pretrain.py][INFO] Epoch:[0/2](587400/4588595) loss:2.423 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:50:56,379][model8_pretrain.py][INFO] Epoch:[0/2](587400/4588595) loss:2.709 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:50:56,379][model8_pretrain.py][INFO] Epoch:[0/2](587400/4588595) loss:2.450 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:50:56,379][model8_pretrain.py][INFO] Epoch:[0/2](587400/4588595) loss:2.953 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:50:56,380][model8_pretrain.py][INFO] Epoch:[0/2](587400/4588595) loss:3.308 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:50:56,380][model8_pretrain.py][INFO] Epoch:[0/2](587400/4588595) loss:2.317 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:50:56,380][model8_pretrain.py][INFO] Epoch:[0/2](587400/4588595) loss:3.132 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:51:33,329][model8_pretrain.py][INFO] Epoch:[0/2](587500/4588595) loss:3.154 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:51:33,329][model8_pretrain.py][INFO] Epoch:[0/2](587500/4588595) loss:3.156 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:51:33,329][model8_pretrain.py][INFO] Epoch:[0/2](587500/4588595) loss:3.365 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:51:33,329][model8_pretrain.py][INFO] Epoch:[0/2](587500/4588595) loss:2.994 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:51:33,329][model8_pretrain.py][INFO] Epoch:[0/2](587500/4588595) loss:2.929 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:51:33,329][model8_pretrain.py][INFO] Epoch:[0/2](587500/4588595) loss:2.757 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:51:33,329][model8_pretrain.py][INFO] Epoch:[0/2](587500/4588595) loss:2.163 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:51:33,330][model8_pretrain.py][INFO] Epoch:[0/2](587500/4588595) loss:3.218 lr:0.0000100 epoch_Time:25415.0min: [2024-01-05 07:52:10,276][model8_pretrain.py][INFO] Epoch:[0/2](587600/4588595) loss:2.281 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:10,276][model8_pretrain.py][INFO] Epoch:[0/2](587600/4588595) loss:2.591 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:10,276][model8_pretrain.py][INFO] Epoch:[0/2](587600/4588595) loss:3.147 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:10,276][model8_pretrain.py][INFO] Epoch:[0/2](587600/4588595) loss:2.824 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:10,276][model8_pretrain.py][INFO] Epoch:[0/2](587600/4588595) loss:2.790 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:10,276][model8_pretrain.py][INFO] Epoch:[0/2](587600/4588595) loss:2.812 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:10,276][model8_pretrain.py][INFO] Epoch:[0/2](587600/4588595) loss:2.379 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:10,276][model8_pretrain.py][INFO] Epoch:[0/2](587600/4588595) loss:2.755 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:47,227][model8_pretrain.py][INFO] Epoch:[0/2](587700/4588595) loss:2.874 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:47,227][model8_pretrain.py][INFO] Epoch:[0/2](587700/4588595) loss:3.156 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:47,227][model8_pretrain.py][INFO] Epoch:[0/2](587700/4588595) loss:2.834 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:47,227][model8_pretrain.py][INFO] Epoch:[0/2](587700/4588595) loss:2.575 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:47,227][model8_pretrain.py][INFO] Epoch:[0/2](587700/4588595) loss:2.487 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:47,227][model8_pretrain.py][INFO] Epoch:[0/2](587700/4588595) loss:2.381 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:47,227][model8_pretrain.py][INFO] Epoch:[0/2](587700/4588595) loss:3.254 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:52:47,228][model8_pretrain.py][INFO] Epoch:[0/2](587700/4588595) loss:2.995 lr:0.0000100 epoch_Time:25414.0min: [2024-01-05 07:53:24,162][model8_pretrain.py][INFO] Epoch:[0/2](587800/4588595) loss:2.790 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:53:24,162][model8_pretrain.py][INFO] Epoch:[0/2](587800/4588595) loss:2.871 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:53:24,162][model8_pretrain.py][INFO] Epoch:[0/2](587800/4588595) loss:3.041 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:53:24,162][model8_pretrain.py][INFO] Epoch:[0/2](587800/4588595) loss:2.979 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:53:24,162][model8_pretrain.py][INFO] Epoch:[0/2](587800/4588595) loss:2.990 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:53:24,162][model8_pretrain.py][INFO] Epoch:[0/2](587800/4588595) loss:3.003 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:53:24,162][model8_pretrain.py][INFO] Epoch:[0/2](587800/4588595) loss:2.821 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:53:24,162][model8_pretrain.py][INFO] Epoch:[0/2](587800/4588595) loss:2.926 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:54:01,091][model8_pretrain.py][INFO] Epoch:[0/2](587900/4588595) loss:2.389 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:01,091][model8_pretrain.py][INFO] Epoch:[0/2](587900/4588595) loss:2.608 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:01,091][model8_pretrain.py][INFO] Epoch:[0/2](587900/4588595) loss:2.337 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:01,091][model8_pretrain.py][INFO] Epoch:[0/2](587900/4588595) loss:3.136 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:01,091][model8_pretrain.py][INFO] Epoch:[0/2](587900/4588595) loss:2.774 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:01,091][model8_pretrain.py][INFO] Epoch:[0/2](587900/4588595) loss:3.166 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:01,091][model8_pretrain.py][INFO] Epoch:[0/2](587900/4588595) loss:3.118 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:01,091][model8_pretrain.py][INFO] Epoch:[0/2](587900/4588595) loss:2.340 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:38,039][model8_pretrain.py][INFO] Epoch:[0/2](588000/4588595) loss:2.681 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:38,039][model8_pretrain.py][INFO] Epoch:[0/2](588000/4588595) loss:3.320 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:38,039][model8_pretrain.py][INFO] Epoch:[0/2](588000/4588595) loss:2.846 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:38,039][model8_pretrain.py][INFO] Epoch:[0/2](588000/4588595) loss:3.050 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:38,039][model8_pretrain.py][INFO] Epoch:[0/2](588000/4588595) loss:2.414 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:38,039][model8_pretrain.py][INFO] Epoch:[0/2](588000/4588595) loss:2.940 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:38,039][model8_pretrain.py][INFO] Epoch:[0/2](588000/4588595) loss:1.920 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:54:38,039][model8_pretrain.py][INFO] Epoch:[0/2](588000/4588595) loss:2.755 lr:0.0000100 epoch_Time:25411.0min: [2024-01-05 07:55:26,664][model8_pretrain.py][INFO] Epoch:[0/2](588100/4588595) loss:3.100 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:55:26,664][model8_pretrain.py][INFO] Epoch:[0/2](588100/4588595) loss:2.646 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:55:26,664][model8_pretrain.py][INFO] Epoch:[0/2](588100/4588595) loss:2.820 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:55:26,664][model8_pretrain.py][INFO] Epoch:[0/2](588100/4588595) loss:3.106 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:55:26,664][model8_pretrain.py][INFO] Epoch:[0/2](588100/4588595) loss:3.064 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:55:26,664][model8_pretrain.py][INFO] Epoch:[0/2](588100/4588595) loss:2.455 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:55:26,664][model8_pretrain.py][INFO] Epoch:[0/2](588100/4588595) loss:2.824 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:55:26,664][model8_pretrain.py][INFO] Epoch:[0/2](588100/4588595) loss:3.244 lr:0.0000100 epoch_Time:25412.0min: [2024-01-05 07:56:03,585][model8_pretrain.py][INFO] Epoch:[0/2](588200/4588595) loss:2.830 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:03,585][model8_pretrain.py][INFO] Epoch:[0/2](588200/4588595) loss:2.986 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:03,585][model8_pretrain.py][INFO] Epoch:[0/2](588200/4588595) loss:3.008 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:03,585][model8_pretrain.py][INFO] Epoch:[0/2](588200/4588595) loss:2.576 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:03,585][model8_pretrain.py][INFO] Epoch:[0/2](588200/4588595) loss:2.544 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:03,585][model8_pretrain.py][INFO] Epoch:[0/2](588200/4588595) loss:2.569 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:03,585][model8_pretrain.py][INFO] Epoch:[0/2](588200/4588595) loss:2.638 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:03,586][model8_pretrain.py][INFO] Epoch:[0/2](588200/4588595) loss:2.102 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:40,519][model8_pretrain.py][INFO] Epoch:[0/2](588300/4588595) loss:3.085 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:40,519][model8_pretrain.py][INFO] Epoch:[0/2](588300/4588595) loss:2.675 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:40,519][model8_pretrain.py][INFO] Epoch:[0/2](588300/4588595) loss:2.605 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:40,519][model8_pretrain.py][INFO] Epoch:[0/2](588300/4588595) loss:2.910 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:40,519][model8_pretrain.py][INFO] Epoch:[0/2](588300/4588595) loss:3.147 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:40,519][model8_pretrain.py][INFO] Epoch:[0/2](588300/4588595) loss:2.955 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:40,520][model8_pretrain.py][INFO] Epoch:[0/2](588300/4588595) loss:3.333 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:56:40,520][model8_pretrain.py][INFO] Epoch:[0/2](588300/4588595) loss:2.152 lr:0.0000100 epoch_Time:25410.0min: [2024-01-05 07:57:17,462][model8_pretrain.py][INFO] Epoch:[0/2](588400/4588595) loss:3.029 lr:0.0000100 epoch_Time:25409.0min: [2024-01-05 07:57:17,462][model8_pretrain.py][INFO] Epoch:[0/2](588400/4588595) loss:2.696 lr:0.0000100 epoch_Time:25409.0min: [2024-01-05 07:57:17,462][model8_pretrain.py][INFO] Epoch:[0/2](588400/4588595) loss:3.280 lr:0.0000100 epoch_Time:25409.0min: [2024-01-05 07:57:17,462][model8_pretrain.py][INFO] Epoch:[0/2](588400/4588595) loss:2.667 lr:0.0000100 epoch_Time:25409.0min: [2024-01-05 07:57:17,462][model8_pretrain.py][INFO] Epoch:[0/2](588400/4588595) loss:2.832 lr:0.0000100 epoch_Time:25409.0min: [2024-01-05 07:57:17,462][model8_pretrain.py][INFO] Epoch:[0/2](588400/4588595) loss:2.272 lr:0.0000100 epoch_Time:25409.0min: [2024-01-05 07:57:17,462][model8_pretrain.py][INFO] Epoch:[0/2](588400/4588595) loss:2.864 lr:0.0000100 epoch_Time:25409.0min: [2024-01-05 07:57:17,462][model8_pretrain.py][INFO] Epoch:[0/2](588400/4588595) loss:2.963 lr:0.0000100 epoch_Time:25409.0min: [2024-01-05 07:57:54,398][model8_pretrain.py][INFO] Epoch:[0/2](588500/4588595) loss:2.861 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:57:54,398][model8_pretrain.py][INFO] Epoch:[0/2](588500/4588595) loss:2.392 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:57:54,398][model8_pretrain.py][INFO] Epoch:[0/2](588500/4588595) loss:2.199 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:57:54,398][model8_pretrain.py][INFO] Epoch:[0/2](588500/4588595) loss:2.272 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:57:54,398][model8_pretrain.py][INFO] Epoch:[0/2](588500/4588595) loss:2.458 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:57:54,398][model8_pretrain.py][INFO] Epoch:[0/2](588500/4588595) loss:2.508 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:57:54,398][model8_pretrain.py][INFO] Epoch:[0/2](588500/4588595) loss:2.869 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:57:54,398][model8_pretrain.py][INFO] Epoch:[0/2](588500/4588595) loss:2.888 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:58:31,340][model8_pretrain.py][INFO] Epoch:[0/2](588600/4588595) loss:3.124 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:58:31,340][model8_pretrain.py][INFO] Epoch:[0/2](588600/4588595) loss:2.493 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:58:31,340][model8_pretrain.py][INFO] Epoch:[0/2](588600/4588595) loss:3.232 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:58:31,340][model8_pretrain.py][INFO] Epoch:[0/2](588600/4588595) loss:2.698 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:58:31,341][model8_pretrain.py][INFO] Epoch:[0/2](588600/4588595) loss:2.446 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:58:31,341][model8_pretrain.py][INFO] Epoch:[0/2](588600/4588595) loss:2.549 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:58:31,341][model8_pretrain.py][INFO] Epoch:[0/2](588600/4588595) loss:2.354 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:58:31,341][model8_pretrain.py][INFO] Epoch:[0/2](588600/4588595) loss:2.910 lr:0.0000100 epoch_Time:25408.0min: [2024-01-05 07:59:08,260][model8_pretrain.py][INFO] Epoch:[0/2](588700/4588595) loss:2.714 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 07:59:08,260][model8_pretrain.py][INFO] Epoch:[0/2](588700/4588595) loss:2.642 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 07:59:08,260][model8_pretrain.py][INFO] Epoch:[0/2](588700/4588595) loss:2.869 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 07:59:08,260][model8_pretrain.py][INFO] Epoch:[0/2](588700/4588595) loss:2.782 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 07:59:08,260][model8_pretrain.py][INFO] Epoch:[0/2](588700/4588595) loss:2.411 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 07:59:08,260][model8_pretrain.py][INFO] Epoch:[0/2](588700/4588595) loss:2.765 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 07:59:08,261][model8_pretrain.py][INFO] Epoch:[0/2](588700/4588595) loss:3.057 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 07:59:08,261][model8_pretrain.py][INFO] Epoch:[0/2](588700/4588595) loss:2.858 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 07:59:45,205][model8_pretrain.py][INFO] Epoch:[0/2](588800/4588595) loss:2.582 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 07:59:45,205][model8_pretrain.py][INFO] Epoch:[0/2](588800/4588595) loss:2.847 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 07:59:45,205][model8_pretrain.py][INFO] Epoch:[0/2](588800/4588595) loss:2.924 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 07:59:45,205][model8_pretrain.py][INFO] Epoch:[0/2](588800/4588595) loss:2.463 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 07:59:45,205][model8_pretrain.py][INFO] Epoch:[0/2](588800/4588595) loss:2.827 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 07:59:45,205][model8_pretrain.py][INFO] Epoch:[0/2](588800/4588595) loss:3.471 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 07:59:45,205][model8_pretrain.py][INFO] Epoch:[0/2](588800/4588595) loss:2.557 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 07:59:45,205][model8_pretrain.py][INFO] Epoch:[0/2](588800/4588595) loss:2.609 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 08:00:33,878][model8_pretrain.py][INFO] Epoch:[0/2](588900/4588595) loss:2.863 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 08:00:33,878][model8_pretrain.py][INFO] Epoch:[0/2](588900/4588595) loss:2.490 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 08:00:33,878][model8_pretrain.py][INFO] Epoch:[0/2](588900/4588595) loss:3.110 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 08:00:33,878][model8_pretrain.py][INFO] Epoch:[0/2](588900/4588595) loss:2.774 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 08:00:33,878][model8_pretrain.py][INFO] Epoch:[0/2](588900/4588595) loss:2.236 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 08:00:33,878][model8_pretrain.py][INFO] Epoch:[0/2](588900/4588595) loss:2.359 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 08:00:33,878][model8_pretrain.py][INFO] Epoch:[0/2](588900/4588595) loss:2.825 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 08:00:33,878][model8_pretrain.py][INFO] Epoch:[0/2](588900/4588595) loss:2.792 lr:0.0000100 epoch_Time:25407.0min: [2024-01-05 08:01:10,790][model8_pretrain.py][INFO] Epoch:[0/2](589000/4588595) loss:3.074 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 08:01:10,790][model8_pretrain.py][INFO] Epoch:[0/2](589000/4588595) loss:2.839 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 08:01:10,790][model8_pretrain.py][INFO] Epoch:[0/2](589000/4588595) loss:2.572 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 08:01:10,790][model8_pretrain.py][INFO] Epoch:[0/2](589000/4588595) loss:2.577 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 08:01:10,790][model8_pretrain.py][INFO] Epoch:[0/2](589000/4588595) loss:2.672 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 08:01:10,790][model8_pretrain.py][INFO] Epoch:[0/2](589000/4588595) loss:3.045 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 08:01:10,792][model8_pretrain.py][INFO] Epoch:[0/2](589000/4588595) loss:3.097 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 08:01:10,792][model8_pretrain.py][INFO] Epoch:[0/2](589000/4588595) loss:2.240 lr:0.0000100 epoch_Time:25406.0min: [2024-01-05 08:01:47,723][model8_pretrain.py][INFO] Epoch:[0/2](589100/4588595) loss:3.055 lr:0.0000100 epoch_Time:25405.0min: [2024-01-05 08:01:47,723][model8_pretrain.py][INFO] Epoch:[0/2](589100/4588595) loss:2.927 lr:0.0000100 epoch_Time:25405.0min: [2024-01-05 08:01:47,723][model8_pretrain.py][INFO] Epoch:[0/2](589100/4588595) loss:2.950 lr:0.0000100 epoch_Time:25405.0min: [2024-01-05 08:01:47,723][model8_pretrain.py][INFO] Epoch:[0/2](589100/4588595) loss:2.261 lr:0.0000100 epoch_Time:25405.0min: [2024-01-05 08:01:47,723][model8_pretrain.py][INFO] Epoch:[0/2](589100/4588595) loss:3.154 lr:0.0000100 epoch_Time:25405.0min: [2024-01-05 08:01:47,723][model8_pretrain.py][INFO] Epoch:[0/2](589100/4588595) loss:2.797 lr:0.0000100 epoch_Time:25405.0min: [2024-01-05 08:01:47,723][model8_pretrain.py][INFO] Epoch:[0/2](589100/4588595) loss:3.097 lr:0.0000100 epoch_Time:25405.0min: [2024-01-05 08:01:47,723][model8_pretrain.py][INFO] Epoch:[0/2](589100/4588595) loss:2.870 lr:0.0000100 epoch_Time:25405.0min: [2024-01-05 08:02:24,667][model8_pretrain.py][INFO] Epoch:[0/2](589200/4588595) loss:2.737 lr:0.0000100 epoch_Time:25404.0min: [2024-01-05 08:02:24,667][model8_pretrain.py][INFO] Epoch:[0/2](589200/4588595) loss:2.295 lr:0.0000100 epoch_Time:25404.0min: [2024-01-05 08:02:24,667][model8_pretrain.py][INFO] Epoch:[0/2](589200/4588595) loss:3.096 lr:0.0000100 epoch_Time:25404.0min: [2024-01-05 08:02:24,667][model8_pretrain.py][INFO] Epoch:[0/2](589200/4588595) loss:2.443 lr:0.0000100 epoch_Time:25404.0min: [2024-01-05 08:02:24,667][model8_pretrain.py][INFO] Epoch:[0/2](589200/4588595) loss:2.889 lr:0.0000100 epoch_Time:25404.0min: [2024-01-05 08:02:24,667][model8_pretrain.py][INFO] Epoch:[0/2](589200/4588595) loss:2.865 lr:0.0000100 epoch_Time:25404.0min: [2024-01-05 08:02:24,667][model8_pretrain.py][INFO] Epoch:[0/2](589200/4588595) loss:3.095 lr:0.0000100 epoch_Time:25404.0min: [2024-01-05 08:02:24,668][model8_pretrain.py][INFO] Epoch:[0/2](589200/4588595) loss:3.246 lr:0.0000100 epoch_Time:25404.0min: [2024-01-05 08:03:01,602][model8_pretrain.py][INFO] Epoch:[0/2](589300/4588595) loss:2.440 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:01,602][model8_pretrain.py][INFO] Epoch:[0/2](589300/4588595) loss:2.694 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:01,602][model8_pretrain.py][INFO] Epoch:[0/2](589300/4588595) loss:2.916 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:01,602][model8_pretrain.py][INFO] Epoch:[0/2](589300/4588595) loss:3.460 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:01,602][model8_pretrain.py][INFO] Epoch:[0/2](589300/4588595) loss:3.120 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:01,602][model8_pretrain.py][INFO] Epoch:[0/2](589300/4588595) loss:1.931 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:01,602][model8_pretrain.py][INFO] Epoch:[0/2](589300/4588595) loss:2.455 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:01,602][model8_pretrain.py][INFO] Epoch:[0/2](589300/4588595) loss:3.107 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:38,537][model8_pretrain.py][INFO] Epoch:[0/2](589400/4588595) loss:3.292 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:38,537][model8_pretrain.py][INFO] Epoch:[0/2](589400/4588595) loss:2.885 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:38,537][model8_pretrain.py][INFO] Epoch:[0/2](589400/4588595) loss:3.114 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:38,537][model8_pretrain.py][INFO] Epoch:[0/2](589400/4588595) loss:2.878 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:38,537][model8_pretrain.py][INFO] Epoch:[0/2](589400/4588595) loss:3.028 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:38,537][model8_pretrain.py][INFO] Epoch:[0/2](589400/4588595) loss:3.204 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:38,537][model8_pretrain.py][INFO] Epoch:[0/2](589400/4588595) loss:2.972 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:03:38,537][model8_pretrain.py][INFO] Epoch:[0/2](589400/4588595) loss:3.001 lr:0.0000100 epoch_Time:25403.0min: [2024-01-05 08:04:15,472][model8_pretrain.py][INFO] Epoch:[0/2](589500/4588595) loss:2.981 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:04:15,472][model8_pretrain.py][INFO] Epoch:[0/2](589500/4588595) loss:3.049 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:04:15,472][model8_pretrain.py][INFO] Epoch:[0/2](589500/4588595) loss:2.933 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:04:15,472][model8_pretrain.py][INFO] Epoch:[0/2](589500/4588595) loss:3.063 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:04:15,472][model8_pretrain.py][INFO] Epoch:[0/2](589500/4588595) loss:2.605 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:04:15,472][model8_pretrain.py][INFO] Epoch:[0/2](589500/4588595) loss:3.023 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:04:15,472][model8_pretrain.py][INFO] Epoch:[0/2](589500/4588595) loss:3.048 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:04:15,472][model8_pretrain.py][INFO] Epoch:[0/2](589500/4588595) loss:2.274 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:04:52,397][model8_pretrain.py][INFO] Epoch:[0/2](589600/4588595) loss:2.630 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:04:52,397][model8_pretrain.py][INFO] Epoch:[0/2](589600/4588595) loss:2.396 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:04:52,397][model8_pretrain.py][INFO] Epoch:[0/2](589600/4588595) loss:2.591 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:04:52,397][model8_pretrain.py][INFO] Epoch:[0/2](589600/4588595) loss:2.298 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:04:52,397][model8_pretrain.py][INFO] Epoch:[0/2](589600/4588595) loss:2.840 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:04:52,397][model8_pretrain.py][INFO] Epoch:[0/2](589600/4588595) loss:2.796 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:04:52,397][model8_pretrain.py][INFO] Epoch:[0/2](589600/4588595) loss:3.333 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:04:52,397][model8_pretrain.py][INFO] Epoch:[0/2](589600/4588595) loss:2.900 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:05:39,307][model8_pretrain.py][INFO] Epoch:[0/2](589700/4588595) loss:3.299 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:05:39,307][model8_pretrain.py][INFO] Epoch:[0/2](589700/4588595) loss:2.670 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:05:39,307][model8_pretrain.py][INFO] Epoch:[0/2](589700/4588595) loss:2.615 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:05:39,307][model8_pretrain.py][INFO] Epoch:[0/2](589700/4588595) loss:3.322 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:05:39,307][model8_pretrain.py][INFO] Epoch:[0/2](589700/4588595) loss:2.925 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:05:39,307][model8_pretrain.py][INFO] Epoch:[0/2](589700/4588595) loss:3.028 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:05:39,307][model8_pretrain.py][INFO] Epoch:[0/2](589700/4588595) loss:3.360 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:05:39,307][model8_pretrain.py][INFO] Epoch:[0/2](589700/4588595) loss:2.829 lr:0.0000100 epoch_Time:25402.0min: [2024-01-05 08:06:17,903][model8_pretrain.py][INFO] Epoch:[0/2](589800/4588595) loss:3.063 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:06:17,903][model8_pretrain.py][INFO] Epoch:[0/2](589800/4588595) loss:2.915 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:06:17,903][model8_pretrain.py][INFO] Epoch:[0/2](589800/4588595) loss:2.386 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:06:17,903][model8_pretrain.py][INFO] Epoch:[0/2](589800/4588595) loss:2.757 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:06:17,903][model8_pretrain.py][INFO] Epoch:[0/2](589800/4588595) loss:2.746 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:06:17,903][model8_pretrain.py][INFO] Epoch:[0/2](589800/4588595) loss:2.274 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:06:17,903][model8_pretrain.py][INFO] Epoch:[0/2](589800/4588595) loss:3.120 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:06:17,903][model8_pretrain.py][INFO] Epoch:[0/2](589800/4588595) loss:3.284 lr:0.0000100 epoch_Time:25401.0min: [2024-01-05 08:06:54,834][model8_pretrain.py][INFO] Epoch:[0/2](589900/4588595) loss:2.787 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:06:54,834][model8_pretrain.py][INFO] Epoch:[0/2](589900/4588595) loss:2.588 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:06:54,834][model8_pretrain.py][INFO] Epoch:[0/2](589900/4588595) loss:3.407 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:06:54,834][model8_pretrain.py][INFO] Epoch:[0/2](589900/4588595) loss:2.503 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:06:54,834][model8_pretrain.py][INFO] Epoch:[0/2](589900/4588595) loss:2.899 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:06:54,834][model8_pretrain.py][INFO] Epoch:[0/2](589900/4588595) loss:2.368 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:06:54,835][model8_pretrain.py][INFO] Epoch:[0/2](589900/4588595) loss:2.396 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:06:54,836][model8_pretrain.py][INFO] Epoch:[0/2](589900/4588595) loss:3.425 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:07:31,773][model8_pretrain.py][INFO] Epoch:[0/2](590000/4588595) loss:2.074 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:07:31,773][model8_pretrain.py][INFO] Epoch:[0/2](590000/4588595) loss:2.499 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:07:31,773][model8_pretrain.py][INFO] Epoch:[0/2](590000/4588595) loss:2.960 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:07:31,773][model8_pretrain.py][INFO] Epoch:[0/2](590000/4588595) loss:2.427 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:07:31,773][model8_pretrain.py][INFO] Epoch:[0/2](590000/4588595) loss:2.728 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:07:31,773][model8_pretrain.py][INFO] Epoch:[0/2](590000/4588595) loss:1.932 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:07:31,773][model8_pretrain.py][INFO] Epoch:[0/2](590000/4588595) loss:3.021 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:07:31,773][model8_pretrain.py][INFO] Epoch:[0/2](590000/4588595) loss:2.697 lr:0.0000100 epoch_Time:25400.0min: [2024-01-05 08:08:08,718][model8_pretrain.py][INFO] Epoch:[0/2](590100/4588595) loss:2.798 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:08,718][model8_pretrain.py][INFO] Epoch:[0/2](590100/4588595) loss:3.051 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:08,718][model8_pretrain.py][INFO] Epoch:[0/2](590100/4588595) loss:3.286 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:08,718][model8_pretrain.py][INFO] Epoch:[0/2](590100/4588595) loss:3.298 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:08,718][model8_pretrain.py][INFO] Epoch:[0/2](590100/4588595) loss:3.086 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:08,718][model8_pretrain.py][INFO] Epoch:[0/2](590100/4588595) loss:3.011 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:08,718][model8_pretrain.py][INFO] Epoch:[0/2](590100/4588595) loss:2.794 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:08,718][model8_pretrain.py][INFO] Epoch:[0/2](590100/4588595) loss:2.774 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:45,648][model8_pretrain.py][INFO] Epoch:[0/2](590200/4588595) loss:2.989 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:45,648][model8_pretrain.py][INFO] Epoch:[0/2](590200/4588595) loss:2.615 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:45,648][model8_pretrain.py][INFO] Epoch:[0/2](590200/4588595) loss:2.743 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:45,648][model8_pretrain.py][INFO] Epoch:[0/2](590200/4588595) loss:2.810 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:45,648][model8_pretrain.py][INFO] Epoch:[0/2](590200/4588595) loss:2.532 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:45,648][model8_pretrain.py][INFO] Epoch:[0/2](590200/4588595) loss:2.599 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:45,648][model8_pretrain.py][INFO] Epoch:[0/2](590200/4588595) loss:2.705 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:08:45,648][model8_pretrain.py][INFO] Epoch:[0/2](590200/4588595) loss:2.751 lr:0.0000100 epoch_Time:25398.0min: [2024-01-05 08:09:22,604][model8_pretrain.py][INFO] Epoch:[0/2](590300/4588595) loss:2.887 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:09:22,604][model8_pretrain.py][INFO] Epoch:[0/2](590300/4588595) loss:3.202 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:09:22,604][model8_pretrain.py][INFO] Epoch:[0/2](590300/4588595) loss:2.510 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:09:22,604][model8_pretrain.py][INFO] Epoch:[0/2](590300/4588595) loss:2.152 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:09:22,604][model8_pretrain.py][INFO] Epoch:[0/2](590300/4588595) loss:2.888 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:09:22,604][model8_pretrain.py][INFO] Epoch:[0/2](590300/4588595) loss:3.058 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:09:22,604][model8_pretrain.py][INFO] Epoch:[0/2](590300/4588595) loss:2.931 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:09:22,604][model8_pretrain.py][INFO] Epoch:[0/2](590300/4588595) loss:2.951 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:09:59,534][model8_pretrain.py][INFO] Epoch:[0/2](590400/4588595) loss:2.874 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:09:59,534][model8_pretrain.py][INFO] Epoch:[0/2](590400/4588595) loss:2.991 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:09:59,534][model8_pretrain.py][INFO] Epoch:[0/2](590400/4588595) loss:3.038 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:09:59,534][model8_pretrain.py][INFO] Epoch:[0/2](590400/4588595) loss:2.590 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:09:59,534][model8_pretrain.py][INFO] Epoch:[0/2](590400/4588595) loss:2.965 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:09:59,534][model8_pretrain.py][INFO] Epoch:[0/2](590400/4588595) loss:2.684 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:09:59,534][model8_pretrain.py][INFO] Epoch:[0/2](590400/4588595) loss:2.629 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:09:59,534][model8_pretrain.py][INFO] Epoch:[0/2](590400/4588595) loss:3.094 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:10:46,601][model8_pretrain.py][INFO] Epoch:[0/2](590500/4588595) loss:2.879 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:10:46,601][model8_pretrain.py][INFO] Epoch:[0/2](590500/4588595) loss:2.786 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:10:46,601][model8_pretrain.py][INFO] Epoch:[0/2](590500/4588595) loss:2.998 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:10:46,601][model8_pretrain.py][INFO] Epoch:[0/2](590500/4588595) loss:3.291 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:10:46,601][model8_pretrain.py][INFO] Epoch:[0/2](590500/4588595) loss:3.008 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:10:46,601][model8_pretrain.py][INFO] Epoch:[0/2](590500/4588595) loss:2.859 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:10:46,601][model8_pretrain.py][INFO] Epoch:[0/2](590500/4588595) loss:2.939 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:10:46,601][model8_pretrain.py][INFO] Epoch:[0/2](590500/4588595) loss:3.103 lr:0.0000100 epoch_Time:25397.0min: [2024-01-05 08:11:25,201][model8_pretrain.py][INFO] Epoch:[0/2](590600/4588595) loss:2.508 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:11:25,201][model8_pretrain.py][INFO] Epoch:[0/2](590600/4588595) loss:2.404 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:11:25,201][model8_pretrain.py][INFO] Epoch:[0/2](590600/4588595) loss:2.887 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:11:25,201][model8_pretrain.py][INFO] Epoch:[0/2](590600/4588595) loss:3.135 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:11:25,201][model8_pretrain.py][INFO] Epoch:[0/2](590600/4588595) loss:3.269 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:11:25,201][model8_pretrain.py][INFO] Epoch:[0/2](590600/4588595) loss:2.903 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:11:25,201][model8_pretrain.py][INFO] Epoch:[0/2](590600/4588595) loss:2.355 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:11:25,201][model8_pretrain.py][INFO] Epoch:[0/2](590600/4588595) loss:2.918 lr:0.0000100 epoch_Time:25396.0min: [2024-01-05 08:12:02,137][model8_pretrain.py][INFO] Epoch:[0/2](590700/4588595) loss:2.614 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:02,137][model8_pretrain.py][INFO] Epoch:[0/2](590700/4588595) loss:2.980 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:02,137][model8_pretrain.py][INFO] Epoch:[0/2](590700/4588595) loss:2.810 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:02,137][model8_pretrain.py][INFO] Epoch:[0/2](590700/4588595) loss:2.785 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:02,137][model8_pretrain.py][INFO] Epoch:[0/2](590700/4588595) loss:2.938 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:02,137][model8_pretrain.py][INFO] Epoch:[0/2](590700/4588595) loss:2.869 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:02,137][model8_pretrain.py][INFO] Epoch:[0/2](590700/4588595) loss:3.054 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:02,138][model8_pretrain.py][INFO] Epoch:[0/2](590700/4588595) loss:2.747 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:39,084][model8_pretrain.py][INFO] Epoch:[0/2](590800/4588595) loss:2.913 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:39,084][model8_pretrain.py][INFO] Epoch:[0/2](590800/4588595) loss:2.654 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:39,084][model8_pretrain.py][INFO] Epoch:[0/2](590800/4588595) loss:2.680 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:39,084][model8_pretrain.py][INFO] Epoch:[0/2](590800/4588595) loss:2.619 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:39,084][model8_pretrain.py][INFO] Epoch:[0/2](590800/4588595) loss:3.018 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:39,084][model8_pretrain.py][INFO] Epoch:[0/2](590800/4588595) loss:2.959 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:39,084][model8_pretrain.py][INFO] Epoch:[0/2](590800/4588595) loss:2.775 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:12:39,084][model8_pretrain.py][INFO] Epoch:[0/2](590800/4588595) loss:3.204 lr:0.0000100 epoch_Time:25395.0min: [2024-01-05 08:13:16,014][model8_pretrain.py][INFO] Epoch:[0/2](590900/4588595) loss:2.201 lr:0.0000100 epoch_Time:25394.0min: [2024-01-05 08:13:16,014][model8_pretrain.py][INFO] Epoch:[0/2](590900/4588595) loss:2.346 lr:0.0000100 epoch_Time:25394.0min: [2024-01-05 08:13:16,014][model8_pretrain.py][INFO] Epoch:[0/2](590900/4588595) loss:3.221 lr:0.0000100 epoch_Time:25394.0min: [2024-01-05 08:13:16,014][model8_pretrain.py][INFO] Epoch:[0/2](590900/4588595) loss:3.205 lr:0.0000100 epoch_Time:25394.0min: [2024-01-05 08:13:16,014][model8_pretrain.py][INFO] Epoch:[0/2](590900/4588595) loss:2.925 lr:0.0000100 epoch_Time:25394.0min: [2024-01-05 08:13:16,014][model8_pretrain.py][INFO] Epoch:[0/2](590900/4588595) loss:3.108 lr:0.0000100 epoch_Time:25394.0min: [2024-01-05 08:13:16,015][model8_pretrain.py][INFO] Epoch:[0/2](590900/4588595) loss:2.937 lr:0.0000100 epoch_Time:25394.0min: [2024-01-05 08:13:16,015][model8_pretrain.py][INFO] Epoch:[0/2](590900/4588595) loss:2.649 lr:0.0000100 epoch_Time:25394.0min: [2024-01-05 08:13:52,950][model8_pretrain.py][INFO] Epoch:[0/2](591000/4588595) loss:2.217 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:13:52,950][model8_pretrain.py][INFO] Epoch:[0/2](591000/4588595) loss:3.032 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:13:52,950][model8_pretrain.py][INFO] Epoch:[0/2](591000/4588595) loss:2.915 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:13:52,950][model8_pretrain.py][INFO] Epoch:[0/2](591000/4588595) loss:2.774 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:13:52,950][model8_pretrain.py][INFO] Epoch:[0/2](591000/4588595) loss:2.656 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:13:52,950][model8_pretrain.py][INFO] Epoch:[0/2](591000/4588595) loss:2.312 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:13:52,950][model8_pretrain.py][INFO] Epoch:[0/2](591000/4588595) loss:2.575 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:13:52,950][model8_pretrain.py][INFO] Epoch:[0/2](591000/4588595) loss:2.385 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:14:29,888][model8_pretrain.py][INFO] Epoch:[0/2](591100/4588595) loss:2.669 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:14:29,888][model8_pretrain.py][INFO] Epoch:[0/2](591100/4588595) loss:2.605 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:14:29,888][model8_pretrain.py][INFO] Epoch:[0/2](591100/4588595) loss:3.341 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:14:29,888][model8_pretrain.py][INFO] Epoch:[0/2](591100/4588595) loss:3.010 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:14:29,888][model8_pretrain.py][INFO] Epoch:[0/2](591100/4588595) loss:2.534 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:14:29,888][model8_pretrain.py][INFO] Epoch:[0/2](591100/4588595) loss:3.069 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:14:29,888][model8_pretrain.py][INFO] Epoch:[0/2](591100/4588595) loss:3.242 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:14:29,888][model8_pretrain.py][INFO] Epoch:[0/2](591100/4588595) loss:2.878 lr:0.0000100 epoch_Time:25393.0min: [2024-01-05 08:15:06,818][model8_pretrain.py][INFO] Epoch:[0/2](591200/4588595) loss:2.766 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:06,818][model8_pretrain.py][INFO] Epoch:[0/2](591200/4588595) loss:2.787 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:06,818][model8_pretrain.py][INFO] Epoch:[0/2](591200/4588595) loss:3.296 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:06,818][model8_pretrain.py][INFO] Epoch:[0/2](591200/4588595) loss:2.819 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:06,818][model8_pretrain.py][INFO] Epoch:[0/2](591200/4588595) loss:3.061 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:06,819][model8_pretrain.py][INFO] Epoch:[0/2](591200/4588595) loss:3.392 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:06,819][model8_pretrain.py][INFO] Epoch:[0/2](591200/4588595) loss:2.625 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:06,819][model8_pretrain.py][INFO] Epoch:[0/2](591200/4588595) loss:3.208 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:52,246][model8_pretrain.py][INFO] Epoch:[0/2](591300/4588595) loss:3.358 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:52,246][model8_pretrain.py][INFO] Epoch:[0/2](591300/4588595) loss:2.436 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:52,246][model8_pretrain.py][INFO] Epoch:[0/2](591300/4588595) loss:3.290 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:52,246][model8_pretrain.py][INFO] Epoch:[0/2](591300/4588595) loss:2.653 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:52,246][model8_pretrain.py][INFO] Epoch:[0/2](591300/4588595) loss:3.373 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:52,246][model8_pretrain.py][INFO] Epoch:[0/2](591300/4588595) loss:2.327 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:52,246][model8_pretrain.py][INFO] Epoch:[0/2](591300/4588595) loss:3.151 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:15:52,246][model8_pretrain.py][INFO] Epoch:[0/2](591300/4588595) loss:2.994 lr:0.0000100 epoch_Time:25391.0min: [2024-01-05 08:16:32,686][model8_pretrain.py][INFO] Epoch:[0/2](591400/4588595) loss:2.460 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:16:32,686][model8_pretrain.py][INFO] Epoch:[0/2](591400/4588595) loss:2.460 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:16:32,686][model8_pretrain.py][INFO] Epoch:[0/2](591400/4588595) loss:3.540 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:16:32,686][model8_pretrain.py][INFO] Epoch:[0/2](591400/4588595) loss:2.927 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:16:32,686][model8_pretrain.py][INFO] Epoch:[0/2](591400/4588595) loss:2.841 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:16:32,686][model8_pretrain.py][INFO] Epoch:[0/2](591400/4588595) loss:3.078 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:16:32,687][model8_pretrain.py][INFO] Epoch:[0/2](591400/4588595) loss:2.499 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:16:32,687][model8_pretrain.py][INFO] Epoch:[0/2](591400/4588595) loss:2.833 lr:0.0000100 epoch_Time:25392.0min: [2024-01-05 08:17:09,627][model8_pretrain.py][INFO] Epoch:[0/2](591500/4588595) loss:2.184 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:09,627][model8_pretrain.py][INFO] Epoch:[0/2](591500/4588595) loss:2.830 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:09,627][model8_pretrain.py][INFO] Epoch:[0/2](591500/4588595) loss:2.787 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:09,627][model8_pretrain.py][INFO] Epoch:[0/2](591500/4588595) loss:2.281 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:09,627][model8_pretrain.py][INFO] Epoch:[0/2](591500/4588595) loss:2.732 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:09,627][model8_pretrain.py][INFO] Epoch:[0/2](591500/4588595) loss:2.734 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:09,627][model8_pretrain.py][INFO] Epoch:[0/2](591500/4588595) loss:2.907 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:09,627][model8_pretrain.py][INFO] Epoch:[0/2](591500/4588595) loss:2.625 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:46,571][model8_pretrain.py][INFO] Epoch:[0/2](591600/4588595) loss:2.898 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:46,571][model8_pretrain.py][INFO] Epoch:[0/2](591600/4588595) loss:2.858 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:46,571][model8_pretrain.py][INFO] Epoch:[0/2](591600/4588595) loss:2.978 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:46,571][model8_pretrain.py][INFO] Epoch:[0/2](591600/4588595) loss:3.261 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:46,571][model8_pretrain.py][INFO] Epoch:[0/2](591600/4588595) loss:2.704 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:46,571][model8_pretrain.py][INFO] Epoch:[0/2](591600/4588595) loss:2.973 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:46,571][model8_pretrain.py][INFO] Epoch:[0/2](591600/4588595) loss:2.671 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:17:46,571][model8_pretrain.py][INFO] Epoch:[0/2](591600/4588595) loss:2.991 lr:0.0000100 epoch_Time:25390.0min: [2024-01-05 08:18:23,520][model8_pretrain.py][INFO] Epoch:[0/2](591700/4588595) loss:2.847 lr:0.0000100 epoch_Time:25389.0min: [2024-01-05 08:18:23,520][model8_pretrain.py][INFO] Epoch:[0/2](591700/4588595) loss:2.138 lr:0.0000100 epoch_Time:25389.0min: [2024-01-05 08:18:23,520][model8_pretrain.py][INFO] Epoch:[0/2](591700/4588595) loss:3.464 lr:0.0000100 epoch_Time:25389.0min: [2024-01-05 08:18:23,520][model8_pretrain.py][INFO] Epoch:[0/2](591700/4588595) loss:3.099 lr:0.0000100 epoch_Time:25389.0min: [2024-01-05 08:18:23,520][model8_pretrain.py][INFO] Epoch:[0/2](591700/4588595) loss:2.541 lr:0.0000100 epoch_Time:25389.0min: [2024-01-05 08:18:23,520][model8_pretrain.py][INFO] Epoch:[0/2](591700/4588595) loss:2.696 lr:0.0000100 epoch_Time:25389.0min: [2024-01-05 08:18:23,520][model8_pretrain.py][INFO] Epoch:[0/2](591700/4588595) loss:2.989 lr:0.0000100 epoch_Time:25389.0min: [2024-01-05 08:18:23,520][model8_pretrain.py][INFO] Epoch:[0/2](591700/4588595) loss:2.504 lr:0.0000100 epoch_Time:25389.0min: [2024-01-05 08:19:00,458][model8_pretrain.py][INFO] Epoch:[0/2](591800/4588595) loss:3.116 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:00,458][model8_pretrain.py][INFO] Epoch:[0/2](591800/4588595) loss:2.752 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:00,458][model8_pretrain.py][INFO] Epoch:[0/2](591800/4588595) loss:2.420 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:00,458][model8_pretrain.py][INFO] Epoch:[0/2](591800/4588595) loss:2.993 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:00,458][model8_pretrain.py][INFO] Epoch:[0/2](591800/4588595) loss:3.045 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:00,458][model8_pretrain.py][INFO] Epoch:[0/2](591800/4588595) loss:3.303 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:00,458][model8_pretrain.py][INFO] Epoch:[0/2](591800/4588595) loss:3.077 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:00,458][model8_pretrain.py][INFO] Epoch:[0/2](591800/4588595) loss:2.140 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:37,387][model8_pretrain.py][INFO] Epoch:[0/2](591900/4588595) loss:2.843 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:37,387][model8_pretrain.py][INFO] Epoch:[0/2](591900/4588595) loss:1.624 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:37,388][model8_pretrain.py][INFO] Epoch:[0/2](591900/4588595) loss:2.892 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:37,388][model8_pretrain.py][INFO] Epoch:[0/2](591900/4588595) loss:3.153 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:37,388][model8_pretrain.py][INFO] Epoch:[0/2](591900/4588595) loss:3.089 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:37,388][model8_pretrain.py][INFO] Epoch:[0/2](591900/4588595) loss:2.186 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:37,388][model8_pretrain.py][INFO] Epoch:[0/2](591900/4588595) loss:3.082 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:19:37,388][model8_pretrain.py][INFO] Epoch:[0/2](591900/4588595) loss:2.736 lr:0.0000100 epoch_Time:25388.0min: [2024-01-05 08:20:14,302][model8_pretrain.py][INFO] Epoch:[0/2](592000/4588595) loss:2.496 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:14,302][model8_pretrain.py][INFO] Epoch:[0/2](592000/4588595) loss:2.801 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:14,302][model8_pretrain.py][INFO] Epoch:[0/2](592000/4588595) loss:3.156 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:14,302][model8_pretrain.py][INFO] Epoch:[0/2](592000/4588595) loss:3.058 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:14,302][model8_pretrain.py][INFO] Epoch:[0/2](592000/4588595) loss:2.226 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:14,302][model8_pretrain.py][INFO] Epoch:[0/2](592000/4588595) loss:3.131 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:14,302][model8_pretrain.py][INFO] Epoch:[0/2](592000/4588595) loss:3.064 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:14,302][model8_pretrain.py][INFO] Epoch:[0/2](592000/4588595) loss:3.008 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:59,700][model8_pretrain.py][INFO] Epoch:[0/2](592100/4588595) loss:3.074 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:59,700][model8_pretrain.py][INFO] Epoch:[0/2](592100/4588595) loss:2.338 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:59,700][model8_pretrain.py][INFO] Epoch:[0/2](592100/4588595) loss:3.053 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:59,700][model8_pretrain.py][INFO] Epoch:[0/2](592100/4588595) loss:2.199 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:59,700][model8_pretrain.py][INFO] Epoch:[0/2](592100/4588595) loss:2.241 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:59,700][model8_pretrain.py][INFO] Epoch:[0/2](592100/4588595) loss:3.224 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:59,700][model8_pretrain.py][INFO] Epoch:[0/2](592100/4588595) loss:2.449 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:20:59,700][model8_pretrain.py][INFO] Epoch:[0/2](592100/4588595) loss:2.183 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:21:40,137][model8_pretrain.py][INFO] Epoch:[0/2](592200/4588595) loss:2.801 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:21:40,137][model8_pretrain.py][INFO] Epoch:[0/2](592200/4588595) loss:2.633 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:21:40,137][model8_pretrain.py][INFO] Epoch:[0/2](592200/4588595) loss:2.047 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:21:40,137][model8_pretrain.py][INFO] Epoch:[0/2](592200/4588595) loss:2.978 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:21:40,137][model8_pretrain.py][INFO] Epoch:[0/2](592200/4588595) loss:2.669 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:21:40,137][model8_pretrain.py][INFO] Epoch:[0/2](592200/4588595) loss:2.314 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:21:40,137][model8_pretrain.py][INFO] Epoch:[0/2](592200/4588595) loss:3.439 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:21:40,137][model8_pretrain.py][INFO] Epoch:[0/2](592200/4588595) loss:2.713 lr:0.0000100 epoch_Time:25387.0min: [2024-01-05 08:22:17,087][model8_pretrain.py][INFO] Epoch:[0/2](592300/4588595) loss:2.965 lr:0.0000100 epoch_Time:25386.0min: [2024-01-05 08:22:17,087][model8_pretrain.py][INFO] Epoch:[0/2](592300/4588595) loss:2.876 lr:0.0000100 epoch_Time:25386.0min: [2024-01-05 08:22:17,087][model8_pretrain.py][INFO] Epoch:[0/2](592300/4588595) loss:2.822 lr:0.0000100 epoch_Time:25386.0min: [2024-01-05 08:22:17,087][model8_pretrain.py][INFO] Epoch:[0/2](592300/4588595) loss:2.995 lr:0.0000100 epoch_Time:25386.0min: [2024-01-05 08:22:17,087][model8_pretrain.py][INFO] Epoch:[0/2](592300/4588595) loss:2.465 lr:0.0000100 epoch_Time:25386.0min: [2024-01-05 08:22:17,087][model8_pretrain.py][INFO] Epoch:[0/2](592300/4588595) loss:3.017 lr:0.0000100 epoch_Time:25386.0min: [2024-01-05 08:22:17,087][model8_pretrain.py][INFO] Epoch:[0/2](592300/4588595) loss:3.130 lr:0.0000100 epoch_Time:25386.0min: [2024-01-05 08:22:17,087][model8_pretrain.py][INFO] Epoch:[0/2](592300/4588595) loss:2.780 lr:0.0000100 epoch_Time:25386.0min: [2024-01-05 08:22:54,030][model8_pretrain.py][INFO] Epoch:[0/2](592400/4588595) loss:2.907 lr:0.0000100 epoch_Time:25385.0min: [2024-01-05 08:22:54,030][model8_pretrain.py][INFO] Epoch:[0/2](592400/4588595) loss:2.574 lr:0.0000100 epoch_Time:25385.0min: [2024-01-05 08:22:54,030][model8_pretrain.py][INFO] Epoch:[0/2](592400/4588595) loss:2.866 lr:0.0000100 epoch_Time:25385.0min: [2024-01-05 08:22:54,030][model8_pretrain.py][INFO] Epoch:[0/2](592400/4588595) loss:2.871 lr:0.0000100 epoch_Time:25385.0min: [2024-01-05 08:22:54,030][model8_pretrain.py][INFO] Epoch:[0/2](592400/4588595) loss:2.673 lr:0.0000100 epoch_Time:25385.0min: [2024-01-05 08:22:54,030][model8_pretrain.py][INFO] Epoch:[0/2](592400/4588595) loss:2.275 lr:0.0000100 epoch_Time:25385.0min: [2024-01-05 08:22:54,030][model8_pretrain.py][INFO] Epoch:[0/2](592400/4588595) loss:2.736 lr:0.0000100 epoch_Time:25385.0min: [2024-01-05 08:22:54,030][model8_pretrain.py][INFO] Epoch:[0/2](592400/4588595) loss:2.661 lr:0.0000100 epoch_Time:25385.0min: [2024-01-05 08:23:30,985][model8_pretrain.py][INFO] Epoch:[0/2](592500/4588595) loss:2.647 lr:0.0000100 epoch_Time:25384.0min: [2024-01-05 08:23:30,985][model8_pretrain.py][INFO] Epoch:[0/2](592500/4588595) loss:2.872 lr:0.0000100 epoch_Time:25384.0min: [2024-01-05 08:23:30,985][model8_pretrain.py][INFO] Epoch:[0/2](592500/4588595) loss:3.273 lr:0.0000100 epoch_Time:25384.0min: [2024-01-05 08:23:30,985][model8_pretrain.py][INFO] Epoch:[0/2](592500/4588595) loss:3.044 lr:0.0000100 epoch_Time:25384.0min: [2024-01-05 08:23:30,985][model8_pretrain.py][INFO] Epoch:[0/2](592500/4588595) loss:2.626 lr:0.0000100 epoch_Time:25384.0min: [2024-01-05 08:23:30,985][model8_pretrain.py][INFO] Epoch:[0/2](592500/4588595) loss:2.711 lr:0.0000100 epoch_Time:25384.0min: [2024-01-05 08:23:30,985][model8_pretrain.py][INFO] Epoch:[0/2](592500/4588595) loss:2.804 lr:0.0000100 epoch_Time:25384.0min: [2024-01-05 08:23:30,985][model8_pretrain.py][INFO] Epoch:[0/2](592500/4588595) loss:2.152 lr:0.0000100 epoch_Time:25384.0min: [2024-01-05 08:24:07,920][model8_pretrain.py][INFO] Epoch:[0/2](592600/4588595) loss:2.989 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:07,920][model8_pretrain.py][INFO] Epoch:[0/2](592600/4588595) loss:2.998 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:07,920][model8_pretrain.py][INFO] Epoch:[0/2](592600/4588595) loss:2.953 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:07,920][model8_pretrain.py][INFO] Epoch:[0/2](592600/4588595) loss:2.683 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:07,920][model8_pretrain.py][INFO] Epoch:[0/2](592600/4588595) loss:2.432 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:07,920][model8_pretrain.py][INFO] Epoch:[0/2](592600/4588595) loss:2.856 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:07,920][model8_pretrain.py][INFO] Epoch:[0/2](592600/4588595) loss:3.158 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:07,920][model8_pretrain.py][INFO] Epoch:[0/2](592600/4588595) loss:2.869 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:44,850][model8_pretrain.py][INFO] Epoch:[0/2](592700/4588595) loss:2.305 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:44,850][model8_pretrain.py][INFO] Epoch:[0/2](592700/4588595) loss:2.804 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:44,850][model8_pretrain.py][INFO] Epoch:[0/2](592700/4588595) loss:3.363 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:44,850][model8_pretrain.py][INFO] Epoch:[0/2](592700/4588595) loss:2.676 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:44,850][model8_pretrain.py][INFO] Epoch:[0/2](592700/4588595) loss:2.895 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:44,850][model8_pretrain.py][INFO] Epoch:[0/2](592700/4588595) loss:2.677 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:44,850][model8_pretrain.py][INFO] Epoch:[0/2](592700/4588595) loss:3.240 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:24:44,850][model8_pretrain.py][INFO] Epoch:[0/2](592700/4588595) loss:2.483 lr:0.0000100 epoch_Time:25383.0min: [2024-01-05 08:25:21,780][model8_pretrain.py][INFO] Epoch:[0/2](592800/4588595) loss:2.864 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:25:21,780][model8_pretrain.py][INFO] Epoch:[0/2](592800/4588595) loss:3.114 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:25:21,780][model8_pretrain.py][INFO] Epoch:[0/2](592800/4588595) loss:2.833 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:25:21,780][model8_pretrain.py][INFO] Epoch:[0/2](592800/4588595) loss:2.989 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:25:21,780][model8_pretrain.py][INFO] Epoch:[0/2](592800/4588595) loss:2.678 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:25:21,780][model8_pretrain.py][INFO] Epoch:[0/2](592800/4588595) loss:2.980 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:25:21,780][model8_pretrain.py][INFO] Epoch:[0/2](592800/4588595) loss:3.114 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:25:21,780][model8_pretrain.py][INFO] Epoch:[0/2](592800/4588595) loss:2.982 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:26:03,988][model8_pretrain.py][INFO] Epoch:[0/2](592900/4588595) loss:2.434 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:26:03,988][model8_pretrain.py][INFO] Epoch:[0/2](592900/4588595) loss:3.046 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:26:03,992][model8_pretrain.py][INFO] Epoch:[0/2](592900/4588595) loss:2.350 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:26:03,993][model8_pretrain.py][INFO] Epoch:[0/2](592900/4588595) loss:2.780 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:26:03,993][model8_pretrain.py][INFO] Epoch:[0/2](592900/4588595) loss:3.063 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:26:03,993][model8_pretrain.py][INFO] Epoch:[0/2](592900/4588595) loss:3.388 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:26:03,993][model8_pretrain.py][INFO] Epoch:[0/2](592900/4588595) loss:2.772 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:26:05,640][model8_pretrain.py][INFO] Epoch:[0/2](592900/4588595) loss:2.854 lr:0.0000100 epoch_Time:25382.0min: [2024-01-05 08:26:47,787][model8_pretrain.py][INFO] Epoch:[0/2](593000/4588595) loss:3.056 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:26:47,787][model8_pretrain.py][INFO] Epoch:[0/2](593000/4588595) loss:2.765 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:26:47,787][model8_pretrain.py][INFO] Epoch:[0/2](593000/4588595) loss:2.846 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:26:47,788][model8_pretrain.py][INFO] Epoch:[0/2](593000/4588595) loss:2.675 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:26:47,788][model8_pretrain.py][INFO] Epoch:[0/2](593000/4588595) loss:2.844 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:26:47,788][model8_pretrain.py][INFO] Epoch:[0/2](593000/4588595) loss:3.056 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:26:47,788][model8_pretrain.py][INFO] Epoch:[0/2](593000/4588595) loss:2.804 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:26:47,788][model8_pretrain.py][INFO] Epoch:[0/2](593000/4588595) loss:2.799 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:27:24,742][model8_pretrain.py][INFO] Epoch:[0/2](593100/4588595) loss:2.913 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:27:24,742][model8_pretrain.py][INFO] Epoch:[0/2](593100/4588595) loss:3.045 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:27:24,742][model8_pretrain.py][INFO] Epoch:[0/2](593100/4588595) loss:2.602 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:27:24,742][model8_pretrain.py][INFO] Epoch:[0/2](593100/4588595) loss:3.232 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:27:24,742][model8_pretrain.py][INFO] Epoch:[0/2](593100/4588595) loss:2.758 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:27:24,742][model8_pretrain.py][INFO] Epoch:[0/2](593100/4588595) loss:3.002 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:27:24,742][model8_pretrain.py][INFO] Epoch:[0/2](593100/4588595) loss:3.056 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:27:24,742][model8_pretrain.py][INFO] Epoch:[0/2](593100/4588595) loss:3.085 lr:0.0000100 epoch_Time:25381.0min: [2024-01-05 08:28:01,692][model8_pretrain.py][INFO] Epoch:[0/2](593200/4588595) loss:3.132 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:01,693][model8_pretrain.py][INFO] Epoch:[0/2](593200/4588595) loss:2.666 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:01,692][model8_pretrain.py][INFO] Epoch:[0/2](593200/4588595) loss:3.180 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:01,693][model8_pretrain.py][INFO] Epoch:[0/2](593200/4588595) loss:2.418 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:01,693][model8_pretrain.py][INFO] Epoch:[0/2](593200/4588595) loss:2.789 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:01,693][model8_pretrain.py][INFO] Epoch:[0/2](593200/4588595) loss:3.045 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:01,693][model8_pretrain.py][INFO] Epoch:[0/2](593200/4588595) loss:3.132 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:01,693][model8_pretrain.py][INFO] Epoch:[0/2](593200/4588595) loss:2.809 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:38,650][model8_pretrain.py][INFO] Epoch:[0/2](593300/4588595) loss:2.894 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:38,650][model8_pretrain.py][INFO] Epoch:[0/2](593300/4588595) loss:3.331 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:38,650][model8_pretrain.py][INFO] Epoch:[0/2](593300/4588595) loss:3.352 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:38,650][model8_pretrain.py][INFO] Epoch:[0/2](593300/4588595) loss:2.887 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:38,650][model8_pretrain.py][INFO] Epoch:[0/2](593300/4588595) loss:2.843 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:38,650][model8_pretrain.py][INFO] Epoch:[0/2](593300/4588595) loss:2.722 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:38,651][model8_pretrain.py][INFO] Epoch:[0/2](593300/4588595) loss:3.003 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:28:38,651][model8_pretrain.py][INFO] Epoch:[0/2](593300/4588595) loss:2.451 lr:0.0000100 epoch_Time:25380.0min: [2024-01-05 08:29:15,600][model8_pretrain.py][INFO] Epoch:[0/2](593400/4588595) loss:3.183 lr:0.0000100 epoch_Time:25379.0min: [2024-01-05 08:29:15,600][model8_pretrain.py][INFO] Epoch:[0/2](593400/4588595) loss:2.790 lr:0.0000100 epoch_Time:25379.0min: [2024-01-05 08:29:15,600][model8_pretrain.py][INFO] Epoch:[0/2](593400/4588595) loss:2.858 lr:0.0000100 epoch_Time:25379.0min: [2024-01-05 08:29:15,600][model8_pretrain.py][INFO] Epoch:[0/2](593400/4588595) loss:2.970 lr:0.0000100 epoch_Time:25379.0min: [2024-01-05 08:29:15,600][model8_pretrain.py][INFO] Epoch:[0/2](593400/4588595) loss:3.089 lr:0.0000100 epoch_Time:25379.0min: [2024-01-05 08:29:15,600][model8_pretrain.py][INFO] Epoch:[0/2](593400/4588595) loss:2.860 lr:0.0000100 epoch_Time:25379.0min: [2024-01-05 08:29:15,600][model8_pretrain.py][INFO] Epoch:[0/2](593400/4588595) loss:2.903 lr:0.0000100 epoch_Time:25379.0min: [2024-01-05 08:29:15,600][model8_pretrain.py][INFO] Epoch:[0/2](593400/4588595) loss:2.532 lr:0.0000100 epoch_Time:25379.0min: [2024-01-05 08:29:52,556][model8_pretrain.py][INFO] Epoch:[0/2](593500/4588595) loss:3.165 lr:0.0000100 epoch_Time:25378.0min: [2024-01-05 08:29:52,556][model8_pretrain.py][INFO] Epoch:[0/2](593500/4588595) loss:2.959 lr:0.0000100 epoch_Time:25378.0min: [2024-01-05 08:29:52,556][model8_pretrain.py][INFO] Epoch:[0/2](593500/4588595) loss:3.241 lr:0.0000100 epoch_Time:25378.0min: [2024-01-05 08:29:52,556][model8_pretrain.py][INFO] Epoch:[0/2](593500/4588595) loss:2.637 lr:0.0000100 epoch_Time:25378.0min: [2024-01-05 08:29:52,556][model8_pretrain.py][INFO] Epoch:[0/2](593500/4588595) loss:2.945 lr:0.0000100 epoch_Time:25378.0min: [2024-01-05 08:29:52,556][model8_pretrain.py][INFO] Epoch:[0/2](593500/4588595) loss:2.943 lr:0.0000100 epoch_Time:25378.0min: [2024-01-05 08:29:52,556][model8_pretrain.py][INFO] Epoch:[0/2](593500/4588595) loss:3.207 lr:0.0000100 epoch_Time:25378.0min: [2024-01-05 08:29:52,557][model8_pretrain.py][INFO] Epoch:[0/2](593500/4588595) loss:3.038 lr:0.0000100 epoch_Time:25378.0min: [2024-01-05 08:30:29,531][model8_pretrain.py][INFO] Epoch:[0/2](593600/4588595) loss:3.084 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:30:29,531][model8_pretrain.py][INFO] Epoch:[0/2](593600/4588595) loss:2.972 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:30:29,532][model8_pretrain.py][INFO] Epoch:[0/2](593600/4588595) loss:2.392 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:30:29,532][model8_pretrain.py][INFO] Epoch:[0/2](593600/4588595) loss:2.732 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:30:29,532][model8_pretrain.py][INFO] Epoch:[0/2](593600/4588595) loss:2.908 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:30:29,532][model8_pretrain.py][INFO] Epoch:[0/2](593600/4588595) loss:2.910 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:30:29,532][model8_pretrain.py][INFO] Epoch:[0/2](593600/4588595) loss:3.042 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:30:29,532][model8_pretrain.py][INFO] Epoch:[0/2](593600/4588595) loss:2.840 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:31:08,251][model8_pretrain.py][INFO] Epoch:[0/2](593700/4588595) loss:2.824 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:31:08,251][model8_pretrain.py][INFO] Epoch:[0/2](593700/4588595) loss:2.748 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:31:08,251][model8_pretrain.py][INFO] Epoch:[0/2](593700/4588595) loss:2.949 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:31:08,251][model8_pretrain.py][INFO] Epoch:[0/2](593700/4588595) loss:2.384 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:31:08,251][model8_pretrain.py][INFO] Epoch:[0/2](593700/4588595) loss:2.648 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:31:08,251][model8_pretrain.py][INFO] Epoch:[0/2](593700/4588595) loss:3.001 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:31:08,251][model8_pretrain.py][INFO] Epoch:[0/2](593700/4588595) loss:3.569 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:31:08,251][model8_pretrain.py][INFO] Epoch:[0/2](593700/4588595) loss:2.710 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:31:55,424][model8_pretrain.py][INFO] Epoch:[0/2](593800/4588595) loss:2.326 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:31:55,424][model8_pretrain.py][INFO] Epoch:[0/2](593800/4588595) loss:3.165 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:31:55,424][model8_pretrain.py][INFO] Epoch:[0/2](593800/4588595) loss:2.225 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:31:55,424][model8_pretrain.py][INFO] Epoch:[0/2](593800/4588595) loss:2.841 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:31:55,424][model8_pretrain.py][INFO] Epoch:[0/2](593800/4588595) loss:3.117 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:31:55,424][model8_pretrain.py][INFO] Epoch:[0/2](593800/4588595) loss:2.511 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:31:55,424][model8_pretrain.py][INFO] Epoch:[0/2](593800/4588595) loss:2.874 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:31:55,424][model8_pretrain.py][INFO] Epoch:[0/2](593800/4588595) loss:3.224 lr:0.0000100 epoch_Time:25377.0min: [2024-01-05 08:32:32,323][model8_pretrain.py][INFO] Epoch:[0/2](593900/4588595) loss:3.064 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:32:32,323][model8_pretrain.py][INFO] Epoch:[0/2](593900/4588595) loss:3.083 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:32:32,323][model8_pretrain.py][INFO] Epoch:[0/2](593900/4588595) loss:2.930 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:32:32,323][model8_pretrain.py][INFO] Epoch:[0/2](593900/4588595) loss:2.674 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:32:32,323][model8_pretrain.py][INFO] Epoch:[0/2](593900/4588595) loss:2.600 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:32:32,323][model8_pretrain.py][INFO] Epoch:[0/2](593900/4588595) loss:2.542 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:32:32,323][model8_pretrain.py][INFO] Epoch:[0/2](593900/4588595) loss:2.318 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:32:32,323][model8_pretrain.py][INFO] Epoch:[0/2](593900/4588595) loss:3.072 lr:0.0000100 epoch_Time:25376.0min: [2024-01-05 08:33:09,264][model8_pretrain.py][INFO] Epoch:[0/2](594000/4588595) loss:2.714 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:09,264][model8_pretrain.py][INFO] Epoch:[0/2](594000/4588595) loss:2.394 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:09,264][model8_pretrain.py][INFO] Epoch:[0/2](594000/4588595) loss:3.141 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:09,264][model8_pretrain.py][INFO] Epoch:[0/2](594000/4588595) loss:2.892 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:09,264][model8_pretrain.py][INFO] Epoch:[0/2](594000/4588595) loss:3.286 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:09,264][model8_pretrain.py][INFO] Epoch:[0/2](594000/4588595) loss:2.771 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:09,264][model8_pretrain.py][INFO] Epoch:[0/2](594000/4588595) loss:2.869 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:09,264][model8_pretrain.py][INFO] Epoch:[0/2](594000/4588595) loss:2.351 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:46,213][model8_pretrain.py][INFO] Epoch:[0/2](594100/4588595) loss:2.397 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:46,213][model8_pretrain.py][INFO] Epoch:[0/2](594100/4588595) loss:2.715 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:46,213][model8_pretrain.py][INFO] Epoch:[0/2](594100/4588595) loss:3.158 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:46,213][model8_pretrain.py][INFO] Epoch:[0/2](594100/4588595) loss:2.674 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:46,213][model8_pretrain.py][INFO] Epoch:[0/2](594100/4588595) loss:2.881 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:46,213][model8_pretrain.py][INFO] Epoch:[0/2](594100/4588595) loss:2.239 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:46,213][model8_pretrain.py][INFO] Epoch:[0/2](594100/4588595) loss:3.051 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:33:46,214][model8_pretrain.py][INFO] Epoch:[0/2](594100/4588595) loss:2.979 lr:0.0000100 epoch_Time:25375.0min: [2024-01-05 08:34:23,158][model8_pretrain.py][INFO] Epoch:[0/2](594200/4588595) loss:2.462 lr:0.0000100 epoch_Time:25374.0min: [2024-01-05 08:34:23,158][model8_pretrain.py][INFO] Epoch:[0/2](594200/4588595) loss:3.153 lr:0.0000100 epoch_Time:25374.0min: [2024-01-05 08:34:23,158][model8_pretrain.py][INFO] Epoch:[0/2](594200/4588595) loss:2.928 lr:0.0000100 epoch_Time:25374.0min: [2024-01-05 08:34:23,158][model8_pretrain.py][INFO] Epoch:[0/2](594200/4588595) loss:2.981 lr:0.0000100 epoch_Time:25374.0min: [2024-01-05 08:34:23,158][model8_pretrain.py][INFO] Epoch:[0/2](594200/4588595) loss:2.743 lr:0.0000100 epoch_Time:25374.0min: [2024-01-05 08:34:23,158][model8_pretrain.py][INFO] Epoch:[0/2](594200/4588595) loss:2.103 lr:0.0000100 epoch_Time:25374.0min: [2024-01-05 08:34:23,158][model8_pretrain.py][INFO] Epoch:[0/2](594200/4588595) loss:1.294 lr:0.0000100 epoch_Time:25374.0min: [2024-01-05 08:34:23,159][model8_pretrain.py][INFO] Epoch:[0/2](594200/4588595) loss:3.136 lr:0.0000100 epoch_Time:25374.0min: [2024-01-05 08:35:00,095][model8_pretrain.py][INFO] Epoch:[0/2](594300/4588595) loss:2.584 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:00,095][model8_pretrain.py][INFO] Epoch:[0/2](594300/4588595) loss:2.633 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:00,095][model8_pretrain.py][INFO] Epoch:[0/2](594300/4588595) loss:3.277 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:00,095][model8_pretrain.py][INFO] Epoch:[0/2](594300/4588595) loss:2.852 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:00,095][model8_pretrain.py][INFO] Epoch:[0/2](594300/4588595) loss:3.081 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:00,095][model8_pretrain.py][INFO] Epoch:[0/2](594300/4588595) loss:2.627 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:00,095][model8_pretrain.py][INFO] Epoch:[0/2](594300/4588595) loss:3.134 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:00,095][model8_pretrain.py][INFO] Epoch:[0/2](594300/4588595) loss:2.364 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:37,038][model8_pretrain.py][INFO] Epoch:[0/2](594400/4588595) loss:2.139 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:37,038][model8_pretrain.py][INFO] Epoch:[0/2](594400/4588595) loss:2.885 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:37,038][model8_pretrain.py][INFO] Epoch:[0/2](594400/4588595) loss:2.333 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:37,038][model8_pretrain.py][INFO] Epoch:[0/2](594400/4588595) loss:3.051 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:37,038][model8_pretrain.py][INFO] Epoch:[0/2](594400/4588595) loss:3.336 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:37,038][model8_pretrain.py][INFO] Epoch:[0/2](594400/4588595) loss:2.923 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:37,038][model8_pretrain.py][INFO] Epoch:[0/2](594400/4588595) loss:3.052 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:35:37,038][model8_pretrain.py][INFO] Epoch:[0/2](594400/4588595) loss:1.971 lr:0.0000100 epoch_Time:25373.0min: [2024-01-05 08:36:15,720][model8_pretrain.py][INFO] Epoch:[0/2](594500/4588595) loss:3.327 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:36:15,720][model8_pretrain.py][INFO] Epoch:[0/2](594500/4588595) loss:2.945 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:36:15,720][model8_pretrain.py][INFO] Epoch:[0/2](594500/4588595) loss:2.884 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:36:15,720][model8_pretrain.py][INFO] Epoch:[0/2](594500/4588595) loss:2.707 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:36:15,720][model8_pretrain.py][INFO] Epoch:[0/2](594500/4588595) loss:2.504 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:36:15,720][model8_pretrain.py][INFO] Epoch:[0/2](594500/4588595) loss:2.566 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:36:15,720][model8_pretrain.py][INFO] Epoch:[0/2](594500/4588595) loss:2.860 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:36:15,720][model8_pretrain.py][INFO] Epoch:[0/2](594500/4588595) loss:3.007 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:03,031][model8_pretrain.py][INFO] Epoch:[0/2](594600/4588595) loss:3.113 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:03,031][model8_pretrain.py][INFO] Epoch:[0/2](594600/4588595) loss:3.153 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:03,031][model8_pretrain.py][INFO] Epoch:[0/2](594600/4588595) loss:2.950 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:03,031][model8_pretrain.py][INFO] Epoch:[0/2](594600/4588595) loss:3.230 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:03,031][model8_pretrain.py][INFO] Epoch:[0/2](594600/4588595) loss:2.656 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:03,031][model8_pretrain.py][INFO] Epoch:[0/2](594600/4588595) loss:3.005 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:03,031][model8_pretrain.py][INFO] Epoch:[0/2](594600/4588595) loss:2.529 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:03,031][model8_pretrain.py][INFO] Epoch:[0/2](594600/4588595) loss:3.397 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:39,963][model8_pretrain.py][INFO] Epoch:[0/2](594700/4588595) loss:3.207 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:39,963][model8_pretrain.py][INFO] Epoch:[0/2](594700/4588595) loss:3.003 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:39,963][model8_pretrain.py][INFO] Epoch:[0/2](594700/4588595) loss:2.881 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:39,963][model8_pretrain.py][INFO] Epoch:[0/2](594700/4588595) loss:2.973 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:39,963][model8_pretrain.py][INFO] Epoch:[0/2](594700/4588595) loss:3.095 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:39,963][model8_pretrain.py][INFO] Epoch:[0/2](594700/4588595) loss:2.709 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:39,963][model8_pretrain.py][INFO] Epoch:[0/2](594700/4588595) loss:3.094 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:37:39,963][model8_pretrain.py][INFO] Epoch:[0/2](594700/4588595) loss:2.507 lr:0.0000100 epoch_Time:25372.0min: [2024-01-05 08:38:16,901][model8_pretrain.py][INFO] Epoch:[0/2](594800/4588595) loss:3.385 lr:0.0000100 epoch_Time:25371.0min: [2024-01-05 08:38:16,901][model8_pretrain.py][INFO] Epoch:[0/2](594800/4588595) loss:3.032 lr:0.0000100 epoch_Time:25371.0min: [2024-01-05 08:38:16,901][model8_pretrain.py][INFO] Epoch:[0/2](594800/4588595) loss:2.676 lr:0.0000100 epoch_Time:25371.0min: [2024-01-05 08:38:16,901][model8_pretrain.py][INFO] Epoch:[0/2](594800/4588595) loss:2.486 lr:0.0000100 epoch_Time:25371.0min: [2024-01-05 08:38:16,901][model8_pretrain.py][INFO] Epoch:[0/2](594800/4588595) loss:3.286 lr:0.0000100 epoch_Time:25371.0min: [2024-01-05 08:38:16,901][model8_pretrain.py][INFO] Epoch:[0/2](594800/4588595) loss:3.138 lr:0.0000100 epoch_Time:25371.0min: [2024-01-05 08:38:16,901][model8_pretrain.py][INFO] Epoch:[0/2](594800/4588595) loss:3.320 lr:0.0000100 epoch_Time:25371.0min: [2024-01-05 08:38:16,902][model8_pretrain.py][INFO] Epoch:[0/2](594800/4588595) loss:2.944 lr:0.0000100 epoch_Time:25371.0min: [2024-01-05 08:38:53,844][model8_pretrain.py][INFO] Epoch:[0/2](594900/4588595) loss:2.863 lr:0.0000100 epoch_Time:25370.0min: [2024-01-05 08:38:53,844][model8_pretrain.py][INFO] Epoch:[0/2](594900/4588595) loss:2.794 lr:0.0000100 epoch_Time:25370.0min: [2024-01-05 08:38:53,844][model8_pretrain.py][INFO] Epoch:[0/2](594900/4588595) loss:3.390 lr:0.0000100 epoch_Time:25370.0min: [2024-01-05 08:38:53,844][model8_pretrain.py][INFO] Epoch:[0/2](594900/4588595) loss:2.627 lr:0.0000100 epoch_Time:25370.0min: [2024-01-05 08:38:53,844][model8_pretrain.py][INFO] Epoch:[0/2](594900/4588595) loss:2.057 lr:0.0000100 epoch_Time:25370.0min: [2024-01-05 08:38:53,844][model8_pretrain.py][INFO] Epoch:[0/2](594900/4588595) loss:2.543 lr:0.0000100 epoch_Time:25370.0min: [2024-01-05 08:38:53,844][model8_pretrain.py][INFO] Epoch:[0/2](594900/4588595) loss:2.210 lr:0.0000100 epoch_Time:25370.0min: [2024-01-05 08:38:53,844][model8_pretrain.py][INFO] Epoch:[0/2](594900/4588595) loss:3.159 lr:0.0000100 epoch_Time:25370.0min: [2024-01-05 08:39:30,779][model8_pretrain.py][INFO] Epoch:[0/2](595000/4588595) loss:2.551 lr:0.0000100 epoch_Time:25369.0min: [2024-01-05 08:39:30,779][model8_pretrain.py][INFO] Epoch:[0/2](595000/4588595) loss:2.953 lr:0.0000100 epoch_Time:25369.0min: [2024-01-05 08:39:30,779][model8_pretrain.py][INFO] Epoch:[0/2](595000/4588595) loss:3.371 lr:0.0000100 epoch_Time:25369.0min: [2024-01-05 08:39:30,779][model8_pretrain.py][INFO] Epoch:[0/2](595000/4588595) loss:3.165 lr:0.0000100 epoch_Time:25369.0min: [2024-01-05 08:39:30,779][model8_pretrain.py][INFO] Epoch:[0/2](595000/4588595) loss:2.960 lr:0.0000100 epoch_Time:25369.0min: [2024-01-05 08:39:30,779][model8_pretrain.py][INFO] Epoch:[0/2](595000/4588595) loss:3.121 lr:0.0000100 epoch_Time:25369.0min: [2024-01-05 08:39:30,780][model8_pretrain.py][INFO] Epoch:[0/2](595000/4588595) loss:3.119 lr:0.0000100 epoch_Time:25369.0min: [2024-01-05 08:39:30,780][model8_pretrain.py][INFO] Epoch:[0/2](595000/4588595) loss:3.556 lr:0.0000100 epoch_Time:25369.0min: [2024-01-05 08:40:07,716][model8_pretrain.py][INFO] Epoch:[0/2](595100/4588595) loss:2.334 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:07,716][model8_pretrain.py][INFO] Epoch:[0/2](595100/4588595) loss:3.230 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:07,716][model8_pretrain.py][INFO] Epoch:[0/2](595100/4588595) loss:3.283 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:07,716][model8_pretrain.py][INFO] Epoch:[0/2](595100/4588595) loss:3.120 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:07,716][model8_pretrain.py][INFO] Epoch:[0/2](595100/4588595) loss:2.654 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:07,716][model8_pretrain.py][INFO] Epoch:[0/2](595100/4588595) loss:2.875 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:07,716][model8_pretrain.py][INFO] Epoch:[0/2](595100/4588595) loss:2.516 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:07,716][model8_pretrain.py][INFO] Epoch:[0/2](595100/4588595) loss:3.015 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:44,649][model8_pretrain.py][INFO] Epoch:[0/2](595200/4588595) loss:3.067 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:44,649][model8_pretrain.py][INFO] Epoch:[0/2](595200/4588595) loss:3.139 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:44,649][model8_pretrain.py][INFO] Epoch:[0/2](595200/4588595) loss:3.046 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:44,649][model8_pretrain.py][INFO] Epoch:[0/2](595200/4588595) loss:2.629 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:44,649][model8_pretrain.py][INFO] Epoch:[0/2](595200/4588595) loss:3.023 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:44,649][model8_pretrain.py][INFO] Epoch:[0/2](595200/4588595) loss:3.368 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:44,649][model8_pretrain.py][INFO] Epoch:[0/2](595200/4588595) loss:2.548 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:40:44,649][model8_pretrain.py][INFO] Epoch:[0/2](595200/4588595) loss:2.476 lr:0.0000100 epoch_Time:25368.0min: [2024-01-05 08:41:23,309][model8_pretrain.py][INFO] Epoch:[0/2](595300/4588595) loss:3.176 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:41:23,309][model8_pretrain.py][INFO] Epoch:[0/2](595300/4588595) loss:3.043 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:41:23,309][model8_pretrain.py][INFO] Epoch:[0/2](595300/4588595) loss:3.287 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:41:23,309][model8_pretrain.py][INFO] Epoch:[0/2](595300/4588595) loss:3.082 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:41:23,309][model8_pretrain.py][INFO] Epoch:[0/2](595300/4588595) loss:2.679 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:41:23,309][model8_pretrain.py][INFO] Epoch:[0/2](595300/4588595) loss:3.035 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:41:23,309][model8_pretrain.py][INFO] Epoch:[0/2](595300/4588595) loss:2.687 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:41:23,309][model8_pretrain.py][INFO] Epoch:[0/2](595300/4588595) loss:3.093 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:10,437][model8_pretrain.py][INFO] Epoch:[0/2](595400/4588595) loss:2.918 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:10,437][model8_pretrain.py][INFO] Epoch:[0/2](595400/4588595) loss:3.236 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:10,437][model8_pretrain.py][INFO] Epoch:[0/2](595400/4588595) loss:2.721 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:10,437][model8_pretrain.py][INFO] Epoch:[0/2](595400/4588595) loss:2.698 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:10,437][model8_pretrain.py][INFO] Epoch:[0/2](595400/4588595) loss:2.421 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:10,437][model8_pretrain.py][INFO] Epoch:[0/2](595400/4588595) loss:3.051 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:10,437][model8_pretrain.py][INFO] Epoch:[0/2](595400/4588595) loss:2.046 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:10,437][model8_pretrain.py][INFO] Epoch:[0/2](595400/4588595) loss:2.370 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:47,364][model8_pretrain.py][INFO] Epoch:[0/2](595500/4588595) loss:3.018 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:47,364][model8_pretrain.py][INFO] Epoch:[0/2](595500/4588595) loss:3.068 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:47,364][model8_pretrain.py][INFO] Epoch:[0/2](595500/4588595) loss:3.527 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:47,364][model8_pretrain.py][INFO] Epoch:[0/2](595500/4588595) loss:2.832 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:47,365][model8_pretrain.py][INFO] Epoch:[0/2](595500/4588595) loss:2.925 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:47,365][model8_pretrain.py][INFO] Epoch:[0/2](595500/4588595) loss:3.006 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:47,365][model8_pretrain.py][INFO] Epoch:[0/2](595500/4588595) loss:3.131 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:42:47,365][model8_pretrain.py][INFO] Epoch:[0/2](595500/4588595) loss:2.839 lr:0.0000100 epoch_Time:25367.0min: [2024-01-05 08:43:24,306][model8_pretrain.py][INFO] Epoch:[0/2](595600/4588595) loss:3.268 lr:0.0000100 epoch_Time:25366.0min: [2024-01-05 08:43:24,307][model8_pretrain.py][INFO] Epoch:[0/2](595600/4588595) loss:2.774 lr:0.0000100 epoch_Time:25366.0min: [2024-01-05 08:43:24,307][model8_pretrain.py][INFO] Epoch:[0/2](595600/4588595) loss:2.314 lr:0.0000100 epoch_Time:25366.0min: [2024-01-05 08:43:24,306][model8_pretrain.py][INFO] Epoch:[0/2](595600/4588595) loss:2.544 lr:0.0000100 epoch_Time:25366.0min: [2024-01-05 08:43:24,307][model8_pretrain.py][INFO] Epoch:[0/2](595600/4588595) loss:2.801 lr:0.0000100 epoch_Time:25366.0min: [2024-01-05 08:43:24,307][model8_pretrain.py][INFO] Epoch:[0/2](595600/4588595) loss:2.599 lr:0.0000100 epoch_Time:25366.0min: [2024-01-05 08:43:24,307][model8_pretrain.py][INFO] Epoch:[0/2](595600/4588595) loss:3.267 lr:0.0000100 epoch_Time:25366.0min: [2024-01-05 08:43:24,307][model8_pretrain.py][INFO] Epoch:[0/2](595600/4588595) loss:2.568 lr:0.0000100 epoch_Time:25366.0min: [2024-01-05 08:44:01,243][model8_pretrain.py][INFO] Epoch:[0/2](595700/4588595) loss:3.025 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:01,243][model8_pretrain.py][INFO] Epoch:[0/2](595700/4588595) loss:2.692 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:01,243][model8_pretrain.py][INFO] Epoch:[0/2](595700/4588595) loss:2.527 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:01,243][model8_pretrain.py][INFO] Epoch:[0/2](595700/4588595) loss:2.880 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:01,243][model8_pretrain.py][INFO] Epoch:[0/2](595700/4588595) loss:2.610 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:01,243][model8_pretrain.py][INFO] Epoch:[0/2](595700/4588595) loss:2.895 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:01,243][model8_pretrain.py][INFO] Epoch:[0/2](595700/4588595) loss:2.414 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:01,243][model8_pretrain.py][INFO] Epoch:[0/2](595700/4588595) loss:2.883 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:38,181][model8_pretrain.py][INFO] Epoch:[0/2](595800/4588595) loss:3.668 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:38,181][model8_pretrain.py][INFO] Epoch:[0/2](595800/4588595) loss:2.645 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:38,182][model8_pretrain.py][INFO] Epoch:[0/2](595800/4588595) loss:2.690 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:38,182][model8_pretrain.py][INFO] Epoch:[0/2](595800/4588595) loss:2.664 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:38,182][model8_pretrain.py][INFO] Epoch:[0/2](595800/4588595) loss:2.955 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:38,182][model8_pretrain.py][INFO] Epoch:[0/2](595800/4588595) loss:2.722 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:38,182][model8_pretrain.py][INFO] Epoch:[0/2](595800/4588595) loss:3.217 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:44:38,182][model8_pretrain.py][INFO] Epoch:[0/2](595800/4588595) loss:3.044 lr:0.0000100 epoch_Time:25365.0min: [2024-01-05 08:45:15,121][model8_pretrain.py][INFO] Epoch:[0/2](595900/4588595) loss:2.669 lr:0.0000100 epoch_Time:25364.0min: [2024-01-05 08:45:15,121][model8_pretrain.py][INFO] Epoch:[0/2](595900/4588595) loss:2.936 lr:0.0000100 epoch_Time:25364.0min: [2024-01-05 08:45:15,121][model8_pretrain.py][INFO] Epoch:[0/2](595900/4588595) loss:3.265 lr:0.0000100 epoch_Time:25364.0min: [2024-01-05 08:45:15,121][model8_pretrain.py][INFO] Epoch:[0/2](595900/4588595) loss:2.801 lr:0.0000100 epoch_Time:25364.0min: [2024-01-05 08:45:15,122][model8_pretrain.py][INFO] Epoch:[0/2](595900/4588595) loss:2.989 lr:0.0000100 epoch_Time:25364.0min: [2024-01-05 08:45:15,122][model8_pretrain.py][INFO] Epoch:[0/2](595900/4588595) loss:2.305 lr:0.0000100 epoch_Time:25364.0min: [2024-01-05 08:45:15,122][model8_pretrain.py][INFO] Epoch:[0/2](595900/4588595) loss:3.096 lr:0.0000100 epoch_Time:25364.0min: [2024-01-05 08:45:15,122][model8_pretrain.py][INFO] Epoch:[0/2](595900/4588595) loss:2.867 lr:0.0000100 epoch_Time:25364.0min: [2024-01-05 08:45:52,060][model8_pretrain.py][INFO] Epoch:[0/2](596000/4588595) loss:2.763 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:45:52,060][model8_pretrain.py][INFO] Epoch:[0/2](596000/4588595) loss:2.610 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:45:52,060][model8_pretrain.py][INFO] Epoch:[0/2](596000/4588595) loss:3.070 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:45:52,060][model8_pretrain.py][INFO] Epoch:[0/2](596000/4588595) loss:3.221 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:45:52,060][model8_pretrain.py][INFO] Epoch:[0/2](596000/4588595) loss:3.307 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:45:52,060][model8_pretrain.py][INFO] Epoch:[0/2](596000/4588595) loss:2.752 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:45:52,060][model8_pretrain.py][INFO] Epoch:[0/2](596000/4588595) loss:2.470 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:45:52,060][model8_pretrain.py][INFO] Epoch:[0/2](596000/4588595) loss:2.724 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:46:29,008][model8_pretrain.py][INFO] Epoch:[0/2](596100/4588595) loss:3.004 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:46:29,008][model8_pretrain.py][INFO] Epoch:[0/2](596100/4588595) loss:3.038 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:46:29,008][model8_pretrain.py][INFO] Epoch:[0/2](596100/4588595) loss:2.922 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:46:29,008][model8_pretrain.py][INFO] Epoch:[0/2](596100/4588595) loss:2.848 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:46:29,008][model8_pretrain.py][INFO] Epoch:[0/2](596100/4588595) loss:3.215 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:46:29,008][model8_pretrain.py][INFO] Epoch:[0/2](596100/4588595) loss:3.061 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:46:29,008][model8_pretrain.py][INFO] Epoch:[0/2](596100/4588595) loss:1.743 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:46:29,008][model8_pretrain.py][INFO] Epoch:[0/2](596100/4588595) loss:3.003 lr:0.0000100 epoch_Time:25362.0min: [2024-01-05 08:47:17,866][model8_pretrain.py][INFO] Epoch:[0/2](596200/4588595) loss:2.485 lr:0.0000100 epoch_Time:25363.0min: [2024-01-05 08:47:17,866][model8_pretrain.py][INFO] Epoch:[0/2](596200/4588595) loss:3.070 lr:0.0000100 epoch_Time:25363.0min: [2024-01-05 08:47:17,866][model8_pretrain.py][INFO] Epoch:[0/2](596200/4588595) loss:3.008 lr:0.0000100 epoch_Time:25363.0min: [2024-01-05 08:47:17,866][model8_pretrain.py][INFO] Epoch:[0/2](596200/4588595) loss:3.073 lr:0.0000100 epoch_Time:25363.0min: [2024-01-05 08:47:17,866][model8_pretrain.py][INFO] Epoch:[0/2](596200/4588595) loss:2.730 lr:0.0000100 epoch_Time:25363.0min: [2024-01-05 08:47:17,866][model8_pretrain.py][INFO] Epoch:[0/2](596200/4588595) loss:2.260 lr:0.0000100 epoch_Time:25363.0min: [2024-01-05 08:47:17,866][model8_pretrain.py][INFO] Epoch:[0/2](596200/4588595) loss:2.611 lr:0.0000100 epoch_Time:25363.0min: [2024-01-05 08:47:17,866][model8_pretrain.py][INFO] Epoch:[0/2](596200/4588595) loss:2.858 lr:0.0000100 epoch_Time:25363.0min: [2024-01-05 08:47:54,795][model8_pretrain.py][INFO] Epoch:[0/2](596300/4588595) loss:2.922 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:47:54,796][model8_pretrain.py][INFO] Epoch:[0/2](596300/4588595) loss:2.672 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:47:54,796][model8_pretrain.py][INFO] Epoch:[0/2](596300/4588595) loss:3.144 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:47:54,796][model8_pretrain.py][INFO] Epoch:[0/2](596300/4588595) loss:2.655 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:47:54,796][model8_pretrain.py][INFO] Epoch:[0/2](596300/4588595) loss:3.084 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:47:54,796][model8_pretrain.py][INFO] Epoch:[0/2](596300/4588595) loss:2.982 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:47:54,796][model8_pretrain.py][INFO] Epoch:[0/2](596300/4588595) loss:2.592 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:47:54,796][model8_pretrain.py][INFO] Epoch:[0/2](596300/4588595) loss:3.013 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:48:31,734][model8_pretrain.py][INFO] Epoch:[0/2](596400/4588595) loss:3.285 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:48:31,734][model8_pretrain.py][INFO] Epoch:[0/2](596400/4588595) loss:3.220 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:48:31,734][model8_pretrain.py][INFO] Epoch:[0/2](596400/4588595) loss:2.941 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:48:31,734][model8_pretrain.py][INFO] Epoch:[0/2](596400/4588595) loss:2.805 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:48:31,734][model8_pretrain.py][INFO] Epoch:[0/2](596400/4588595) loss:3.098 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:48:31,734][model8_pretrain.py][INFO] Epoch:[0/2](596400/4588595) loss:2.577 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:48:31,734][model8_pretrain.py][INFO] Epoch:[0/2](596400/4588595) loss:2.639 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:48:31,734][model8_pretrain.py][INFO] Epoch:[0/2](596400/4588595) loss:2.947 lr:0.0000100 epoch_Time:25361.0min: [2024-01-05 08:49:08,664][model8_pretrain.py][INFO] Epoch:[0/2](596500/4588595) loss:2.727 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:08,664][model8_pretrain.py][INFO] Epoch:[0/2](596500/4588595) loss:2.804 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:08,665][model8_pretrain.py][INFO] Epoch:[0/2](596500/4588595) loss:2.632 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:08,665][model8_pretrain.py][INFO] Epoch:[0/2](596500/4588595) loss:2.917 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:08,665][model8_pretrain.py][INFO] Epoch:[0/2](596500/4588595) loss:2.771 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:08,665][model8_pretrain.py][INFO] Epoch:[0/2](596500/4588595) loss:3.097 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:08,665][model8_pretrain.py][INFO] Epoch:[0/2](596500/4588595) loss:3.070 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:08,665][model8_pretrain.py][INFO] Epoch:[0/2](596500/4588595) loss:3.119 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:45,604][model8_pretrain.py][INFO] Epoch:[0/2](596600/4588595) loss:2.882 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:45,604][model8_pretrain.py][INFO] Epoch:[0/2](596600/4588595) loss:2.994 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:45,604][model8_pretrain.py][INFO] Epoch:[0/2](596600/4588595) loss:2.524 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:45,604][model8_pretrain.py][INFO] Epoch:[0/2](596600/4588595) loss:3.016 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:45,604][model8_pretrain.py][INFO] Epoch:[0/2](596600/4588595) loss:2.885 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:45,604][model8_pretrain.py][INFO] Epoch:[0/2](596600/4588595) loss:3.162 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:45,604][model8_pretrain.py][INFO] Epoch:[0/2](596600/4588595) loss:3.220 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:49:45,604][model8_pretrain.py][INFO] Epoch:[0/2](596600/4588595) loss:2.974 lr:0.0000100 epoch_Time:25360.0min: [2024-01-05 08:50:22,542][model8_pretrain.py][INFO] Epoch:[0/2](596700/4588595) loss:2.833 lr:0.0000100 epoch_Time:25359.0min: [2024-01-05 08:50:22,542][model8_pretrain.py][INFO] Epoch:[0/2](596700/4588595) loss:3.038 lr:0.0000100 epoch_Time:25359.0min: [2024-01-05 08:50:22,543][model8_pretrain.py][INFO] Epoch:[0/2](596700/4588595) loss:3.131 lr:0.0000100 epoch_Time:25359.0min: [2024-01-05 08:50:22,543][model8_pretrain.py][INFO] Epoch:[0/2](596700/4588595) loss:2.870 lr:0.0000100 epoch_Time:25359.0min: [2024-01-05 08:50:22,543][model8_pretrain.py][INFO] Epoch:[0/2](596700/4588595) loss:2.673 lr:0.0000100 epoch_Time:25359.0min: [2024-01-05 08:50:22,543][model8_pretrain.py][INFO] Epoch:[0/2](596700/4588595) loss:3.115 lr:0.0000100 epoch_Time:25359.0min: [2024-01-05 08:50:22,543][model8_pretrain.py][INFO] Epoch:[0/2](596700/4588595) loss:3.462 lr:0.0000100 epoch_Time:25359.0min: [2024-01-05 08:50:22,543][model8_pretrain.py][INFO] Epoch:[0/2](596700/4588595) loss:3.237 lr:0.0000100 epoch_Time:25359.0min: [2024-01-05 08:50:59,484][model8_pretrain.py][INFO] Epoch:[0/2](596800/4588595) loss:3.233 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:50:59,484][model8_pretrain.py][INFO] Epoch:[0/2](596800/4588595) loss:2.827 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:50:59,484][model8_pretrain.py][INFO] Epoch:[0/2](596800/4588595) loss:3.212 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:50:59,484][model8_pretrain.py][INFO] Epoch:[0/2](596800/4588595) loss:2.375 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:50:59,484][model8_pretrain.py][INFO] Epoch:[0/2](596800/4588595) loss:2.578 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:50:59,484][model8_pretrain.py][INFO] Epoch:[0/2](596800/4588595) loss:3.079 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:50:59,484][model8_pretrain.py][INFO] Epoch:[0/2](596800/4588595) loss:2.663 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:50:59,485][model8_pretrain.py][INFO] Epoch:[0/2](596800/4588595) loss:3.056 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:51:36,446][model8_pretrain.py][INFO] Epoch:[0/2](596900/4588595) loss:2.717 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:51:36,446][model8_pretrain.py][INFO] Epoch:[0/2](596900/4588595) loss:2.918 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:51:36,446][model8_pretrain.py][INFO] Epoch:[0/2](596900/4588595) loss:3.242 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:51:36,446][model8_pretrain.py][INFO] Epoch:[0/2](596900/4588595) loss:2.377 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:51:36,446][model8_pretrain.py][INFO] Epoch:[0/2](596900/4588595) loss:3.262 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:51:36,446][model8_pretrain.py][INFO] Epoch:[0/2](596900/4588595) loss:2.626 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:51:36,446][model8_pretrain.py][INFO] Epoch:[0/2](596900/4588595) loss:2.850 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:51:36,447][model8_pretrain.py][INFO] Epoch:[0/2](596900/4588595) loss:2.765 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:52:25,226][model8_pretrain.py][INFO] Epoch:[0/2](597000/4588595) loss:2.841 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:52:25,226][model8_pretrain.py][INFO] Epoch:[0/2](597000/4588595) loss:2.824 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:52:25,226][model8_pretrain.py][INFO] Epoch:[0/2](597000/4588595) loss:2.870 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:52:25,226][model8_pretrain.py][INFO] Epoch:[0/2](597000/4588595) loss:2.845 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:52:25,226][model8_pretrain.py][INFO] Epoch:[0/2](597000/4588595) loss:2.864 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:52:25,226][model8_pretrain.py][INFO] Epoch:[0/2](597000/4588595) loss:2.918 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:52:25,226][model8_pretrain.py][INFO] Epoch:[0/2](597000/4588595) loss:3.115 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:52:25,226][model8_pretrain.py][INFO] Epoch:[0/2](597000/4588595) loss:2.595 lr:0.0000100 epoch_Time:25358.0min: [2024-01-05 08:53:02,157][model8_pretrain.py][INFO] Epoch:[0/2](597100/4588595) loss:3.269 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:02,157][model8_pretrain.py][INFO] Epoch:[0/2](597100/4588595) loss:2.372 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:02,157][model8_pretrain.py][INFO] Epoch:[0/2](597100/4588595) loss:2.732 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:02,157][model8_pretrain.py][INFO] Epoch:[0/2](597100/4588595) loss:2.549 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:02,157][model8_pretrain.py][INFO] Epoch:[0/2](597100/4588595) loss:3.327 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:02,157][model8_pretrain.py][INFO] Epoch:[0/2](597100/4588595) loss:3.050 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:02,157][model8_pretrain.py][INFO] Epoch:[0/2](597100/4588595) loss:3.290 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:02,157][model8_pretrain.py][INFO] Epoch:[0/2](597100/4588595) loss:2.906 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:39,098][model8_pretrain.py][INFO] Epoch:[0/2](597200/4588595) loss:2.621 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:39,098][model8_pretrain.py][INFO] Epoch:[0/2](597200/4588595) loss:2.684 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:39,098][model8_pretrain.py][INFO] Epoch:[0/2](597200/4588595) loss:2.325 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:39,098][model8_pretrain.py][INFO] Epoch:[0/2](597200/4588595) loss:2.890 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:39,098][model8_pretrain.py][INFO] Epoch:[0/2](597200/4588595) loss:3.189 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:39,098][model8_pretrain.py][INFO] Epoch:[0/2](597200/4588595) loss:2.898 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:39,098][model8_pretrain.py][INFO] Epoch:[0/2](597200/4588595) loss:3.244 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:53:39,098][model8_pretrain.py][INFO] Epoch:[0/2](597200/4588595) loss:3.024 lr:0.0000100 epoch_Time:25357.0min: [2024-01-05 08:54:16,034][model8_pretrain.py][INFO] Epoch:[0/2](597300/4588595) loss:2.999 lr:0.0000100 epoch_Time:25355.0min: [2024-01-05 08:54:16,034][model8_pretrain.py][INFO] Epoch:[0/2](597300/4588595) loss:2.698 lr:0.0000100 epoch_Time:25355.0min: [2024-01-05 08:54:16,034][model8_pretrain.py][INFO] Epoch:[0/2](597300/4588595) loss:2.679 lr:0.0000100 epoch_Time:25355.0min: [2024-01-05 08:54:16,034][model8_pretrain.py][INFO] Epoch:[0/2](597300/4588595) loss:2.310 lr:0.0000100 epoch_Time:25355.0min: [2024-01-05 08:54:16,034][model8_pretrain.py][INFO] Epoch:[0/2](597300/4588595) loss:3.056 lr:0.0000100 epoch_Time:25355.0min: [2024-01-05 08:54:16,034][model8_pretrain.py][INFO] Epoch:[0/2](597300/4588595) loss:3.100 lr:0.0000100 epoch_Time:25355.0min: [2024-01-05 08:54:16,034][model8_pretrain.py][INFO] Epoch:[0/2](597300/4588595) loss:2.617 lr:0.0000100 epoch_Time:25355.0min: [2024-01-05 08:54:16,034][model8_pretrain.py][INFO] Epoch:[0/2](597300/4588595) loss:2.996 lr:0.0000100 epoch_Time:25355.0min: [2024-01-05 08:54:52,974][model8_pretrain.py][INFO] Epoch:[0/2](597400/4588595) loss:2.856 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:54:52,974][model8_pretrain.py][INFO] Epoch:[0/2](597400/4588595) loss:2.036 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:54:52,974][model8_pretrain.py][INFO] Epoch:[0/2](597400/4588595) loss:2.683 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:54:52,974][model8_pretrain.py][INFO] Epoch:[0/2](597400/4588595) loss:3.099 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:54:52,974][model8_pretrain.py][INFO] Epoch:[0/2](597400/4588595) loss:2.719 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:54:52,974][model8_pretrain.py][INFO] Epoch:[0/2](597400/4588595) loss:2.822 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:54:52,974][model8_pretrain.py][INFO] Epoch:[0/2](597400/4588595) loss:2.082 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:54:52,974][model8_pretrain.py][INFO] Epoch:[0/2](597400/4588595) loss:2.398 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:55:29,913][model8_pretrain.py][INFO] Epoch:[0/2](597500/4588595) loss:2.879 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:55:29,913][model8_pretrain.py][INFO] Epoch:[0/2](597500/4588595) loss:3.038 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:55:29,913][model8_pretrain.py][INFO] Epoch:[0/2](597500/4588595) loss:2.543 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:55:29,913][model8_pretrain.py][INFO] Epoch:[0/2](597500/4588595) loss:2.480 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:55:29,913][model8_pretrain.py][INFO] Epoch:[0/2](597500/4588595) loss:3.189 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:55:29,914][model8_pretrain.py][INFO] Epoch:[0/2](597500/4588595) loss:2.512 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:55:29,914][model8_pretrain.py][INFO] Epoch:[0/2](597500/4588595) loss:2.569 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:55:29,914][model8_pretrain.py][INFO] Epoch:[0/2](597500/4588595) loss:2.455 lr:0.0000100 epoch_Time:25354.0min: [2024-01-05 08:56:06,871][model8_pretrain.py][INFO] Epoch:[0/2](597600/4588595) loss:3.131 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:06,871][model8_pretrain.py][INFO] Epoch:[0/2](597600/4588595) loss:3.108 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:06,871][model8_pretrain.py][INFO] Epoch:[0/2](597600/4588595) loss:2.880 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:06,871][model8_pretrain.py][INFO] Epoch:[0/2](597600/4588595) loss:2.676 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:06,871][model8_pretrain.py][INFO] Epoch:[0/2](597600/4588595) loss:2.478 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:06,871][model8_pretrain.py][INFO] Epoch:[0/2](597600/4588595) loss:2.749 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:06,871][model8_pretrain.py][INFO] Epoch:[0/2](597600/4588595) loss:2.300 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:06,871][model8_pretrain.py][INFO] Epoch:[0/2](597600/4588595) loss:2.675 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:43,810][model8_pretrain.py][INFO] Epoch:[0/2](597700/4588595) loss:2.572 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:43,810][model8_pretrain.py][INFO] Epoch:[0/2](597700/4588595) loss:3.184 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:43,810][model8_pretrain.py][INFO] Epoch:[0/2](597700/4588595) loss:3.001 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:43,810][model8_pretrain.py][INFO] Epoch:[0/2](597700/4588595) loss:2.739 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:43,810][model8_pretrain.py][INFO] Epoch:[0/2](597700/4588595) loss:2.849 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:43,810][model8_pretrain.py][INFO] Epoch:[0/2](597700/4588595) loss:2.688 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:43,810][model8_pretrain.py][INFO] Epoch:[0/2](597700/4588595) loss:2.838 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:56:43,810][model8_pretrain.py][INFO] Epoch:[0/2](597700/4588595) loss:3.106 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:57:32,452][model8_pretrain.py][INFO] Epoch:[0/2](597800/4588595) loss:3.298 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:57:32,452][model8_pretrain.py][INFO] Epoch:[0/2](597800/4588595) loss:2.910 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:57:32,452][model8_pretrain.py][INFO] Epoch:[0/2](597800/4588595) loss:2.506 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:57:32,452][model8_pretrain.py][INFO] Epoch:[0/2](597800/4588595) loss:2.720 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:57:32,452][model8_pretrain.py][INFO] Epoch:[0/2](597800/4588595) loss:2.731 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:57:32,452][model8_pretrain.py][INFO] Epoch:[0/2](597800/4588595) loss:3.008 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:57:32,452][model8_pretrain.py][INFO] Epoch:[0/2](597800/4588595) loss:2.866 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:57:32,452][model8_pretrain.py][INFO] Epoch:[0/2](597800/4588595) loss:3.069 lr:0.0000100 epoch_Time:25353.0min: [2024-01-05 08:58:09,399][model8_pretrain.py][INFO] Epoch:[0/2](597900/4588595) loss:3.128 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:09,399][model8_pretrain.py][INFO] Epoch:[0/2](597900/4588595) loss:2.662 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:09,399][model8_pretrain.py][INFO] Epoch:[0/2](597900/4588595) loss:2.363 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:09,399][model8_pretrain.py][INFO] Epoch:[0/2](597900/4588595) loss:2.431 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:09,399][model8_pretrain.py][INFO] Epoch:[0/2](597900/4588595) loss:2.351 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:09,400][model8_pretrain.py][INFO] Epoch:[0/2](597900/4588595) loss:1.926 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:09,400][model8_pretrain.py][INFO] Epoch:[0/2](597900/4588595) loss:2.980 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:09,400][model8_pretrain.py][INFO] Epoch:[0/2](597900/4588595) loss:2.274 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:46,360][model8_pretrain.py][INFO] Epoch:[0/2](598000/4588595) loss:3.625 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:46,360][model8_pretrain.py][INFO] Epoch:[0/2](598000/4588595) loss:1.961 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:46,360][model8_pretrain.py][INFO] Epoch:[0/2](598000/4588595) loss:3.060 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:46,360][model8_pretrain.py][INFO] Epoch:[0/2](598000/4588595) loss:2.777 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:46,360][model8_pretrain.py][INFO] Epoch:[0/2](598000/4588595) loss:2.969 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:46,360][model8_pretrain.py][INFO] Epoch:[0/2](598000/4588595) loss:3.323 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:46,360][model8_pretrain.py][INFO] Epoch:[0/2](598000/4588595) loss:2.742 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:58:46,360][model8_pretrain.py][INFO] Epoch:[0/2](598000/4588595) loss:2.944 lr:0.0000100 epoch_Time:25352.0min: [2024-01-05 08:59:23,318][model8_pretrain.py][INFO] Epoch:[0/2](598100/4588595) loss:2.892 lr:0.0000100 epoch_Time:25351.0min: [2024-01-05 08:59:23,318][model8_pretrain.py][INFO] Epoch:[0/2](598100/4588595) loss:2.893 lr:0.0000100 epoch_Time:25351.0min: [2024-01-05 08:59:23,318][model8_pretrain.py][INFO] Epoch:[0/2](598100/4588595) loss:2.944 lr:0.0000100 epoch_Time:25351.0min: [2024-01-05 08:59:23,318][model8_pretrain.py][INFO] Epoch:[0/2](598100/4588595) loss:3.021 lr:0.0000100 epoch_Time:25351.0min: [2024-01-05 08:59:23,318][model8_pretrain.py][INFO] Epoch:[0/2](598100/4588595) loss:3.023 lr:0.0000100 epoch_Time:25351.0min: [2024-01-05 08:59:23,318][model8_pretrain.py][INFO] Epoch:[0/2](598100/4588595) loss:3.183 lr:0.0000100 epoch_Time:25351.0min: [2024-01-05 08:59:23,318][model8_pretrain.py][INFO] Epoch:[0/2](598100/4588595) loss:2.798 lr:0.0000100 epoch_Time:25351.0min: [2024-01-05 08:59:23,318][model8_pretrain.py][INFO] Epoch:[0/2](598100/4588595) loss:2.639 lr:0.0000100 epoch_Time:25351.0min: [2024-01-05 09:00:00,284][model8_pretrain.py][INFO] Epoch:[0/2](598200/4588595) loss:2.931 lr:0.0000100 epoch_Time:25350.0min: [2024-01-05 09:00:00,284][model8_pretrain.py][INFO] Epoch:[0/2](598200/4588595) loss:2.664 lr:0.0000100 epoch_Time:25350.0min: [2024-01-05 09:00:00,284][model8_pretrain.py][INFO] Epoch:[0/2](598200/4588595) loss:3.488 lr:0.0000100 epoch_Time:25350.0min: [2024-01-05 09:00:00,285][model8_pretrain.py][INFO] Epoch:[0/2](598200/4588595) loss:2.939 lr:0.0000100 epoch_Time:25350.0min: [2024-01-05 09:00:00,284][model8_pretrain.py][INFO] Epoch:[0/2](598200/4588595) loss:3.096 lr:0.0000100 epoch_Time:25350.0min: [2024-01-05 09:00:00,285][model8_pretrain.py][INFO] Epoch:[0/2](598200/4588595) loss:3.032 lr:0.0000100 epoch_Time:25350.0min: [2024-01-05 09:00:00,285][model8_pretrain.py][INFO] Epoch:[0/2](598200/4588595) loss:2.722 lr:0.0000100 epoch_Time:25350.0min: [2024-01-05 09:00:00,285][model8_pretrain.py][INFO] Epoch:[0/2](598200/4588595) loss:2.372 lr:0.0000100 epoch_Time:25350.0min: [2024-01-05 09:00:37,250][model8_pretrain.py][INFO] Epoch:[0/2](598300/4588595) loss:2.873 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:00:37,250][model8_pretrain.py][INFO] Epoch:[0/2](598300/4588595) loss:2.989 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:00:37,250][model8_pretrain.py][INFO] Epoch:[0/2](598300/4588595) loss:2.982 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:00:37,250][model8_pretrain.py][INFO] Epoch:[0/2](598300/4588595) loss:3.051 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:00:37,250][model8_pretrain.py][INFO] Epoch:[0/2](598300/4588595) loss:2.312 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:00:37,250][model8_pretrain.py][INFO] Epoch:[0/2](598300/4588595) loss:3.090 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:00:37,250][model8_pretrain.py][INFO] Epoch:[0/2](598300/4588595) loss:3.142 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:00:37,250][model8_pretrain.py][INFO] Epoch:[0/2](598300/4588595) loss:2.685 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:01:14,208][model8_pretrain.py][INFO] Epoch:[0/2](598400/4588595) loss:3.126 lr:0.0000100 epoch_Time:25348.0min: [2024-01-05 09:01:14,208][model8_pretrain.py][INFO] Epoch:[0/2](598400/4588595) loss:2.935 lr:0.0000100 epoch_Time:25348.0min: [2024-01-05 09:01:14,208][model8_pretrain.py][INFO] Epoch:[0/2](598400/4588595) loss:3.142 lr:0.0000100 epoch_Time:25348.0min: [2024-01-05 09:01:14,208][model8_pretrain.py][INFO] Epoch:[0/2](598400/4588595) loss:2.702 lr:0.0000100 epoch_Time:25348.0min: [2024-01-05 09:01:14,208][model8_pretrain.py][INFO] Epoch:[0/2](598400/4588595) loss:2.840 lr:0.0000100 epoch_Time:25348.0min: [2024-01-05 09:01:14,208][model8_pretrain.py][INFO] Epoch:[0/2](598400/4588595) loss:3.207 lr:0.0000100 epoch_Time:25348.0min: [2024-01-05 09:01:14,208][model8_pretrain.py][INFO] Epoch:[0/2](598400/4588595) loss:3.041 lr:0.0000100 epoch_Time:25348.0min: [2024-01-05 09:01:14,208][model8_pretrain.py][INFO] Epoch:[0/2](598400/4588595) loss:2.987 lr:0.0000100 epoch_Time:25348.0min: [2024-01-05 09:01:51,171][model8_pretrain.py][INFO] Epoch:[0/2](598500/4588595) loss:3.231 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:01:51,171][model8_pretrain.py][INFO] Epoch:[0/2](598500/4588595) loss:3.459 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:01:51,171][model8_pretrain.py][INFO] Epoch:[0/2](598500/4588595) loss:2.655 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:01:51,171][model8_pretrain.py][INFO] Epoch:[0/2](598500/4588595) loss:3.351 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:01:51,171][model8_pretrain.py][INFO] Epoch:[0/2](598500/4588595) loss:3.032 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:01:51,171][model8_pretrain.py][INFO] Epoch:[0/2](598500/4588595) loss:2.868 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:01:51,171][model8_pretrain.py][INFO] Epoch:[0/2](598500/4588595) loss:2.534 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:01:51,171][model8_pretrain.py][INFO] Epoch:[0/2](598500/4588595) loss:2.353 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:02:39,996][model8_pretrain.py][INFO] Epoch:[0/2](598600/4588595) loss:2.788 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:02:39,996][model8_pretrain.py][INFO] Epoch:[0/2](598600/4588595) loss:3.075 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:02:39,996][model8_pretrain.py][INFO] Epoch:[0/2](598600/4588595) loss:2.846 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:02:39,996][model8_pretrain.py][INFO] Epoch:[0/2](598600/4588595) loss:2.073 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:02:39,996][model8_pretrain.py][INFO] Epoch:[0/2](598600/4588595) loss:3.420 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:02:39,996][model8_pretrain.py][INFO] Epoch:[0/2](598600/4588595) loss:2.582 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:02:39,996][model8_pretrain.py][INFO] Epoch:[0/2](598600/4588595) loss:2.561 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:02:39,997][model8_pretrain.py][INFO] Epoch:[0/2](598600/4588595) loss:3.103 lr:0.0000100 epoch_Time:25349.0min: [2024-01-05 09:03:16,934][model8_pretrain.py][INFO] Epoch:[0/2](598700/4588595) loss:2.664 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:03:16,934][model8_pretrain.py][INFO] Epoch:[0/2](598700/4588595) loss:3.227 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:03:16,934][model8_pretrain.py][INFO] Epoch:[0/2](598700/4588595) loss:2.664 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:03:16,935][model8_pretrain.py][INFO] Epoch:[0/2](598700/4588595) loss:2.548 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:03:16,935][model8_pretrain.py][INFO] Epoch:[0/2](598700/4588595) loss:2.704 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:03:16,935][model8_pretrain.py][INFO] Epoch:[0/2](598700/4588595) loss:2.623 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:03:16,935][model8_pretrain.py][INFO] Epoch:[0/2](598700/4588595) loss:3.196 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:03:16,935][model8_pretrain.py][INFO] Epoch:[0/2](598700/4588595) loss:3.159 lr:0.0000100 epoch_Time:25347.0min: [2024-01-05 09:03:53,839][model8_pretrain.py][INFO] Epoch:[0/2](598800/4588595) loss:2.513 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:03:53,839][model8_pretrain.py][INFO] Epoch:[0/2](598800/4588595) loss:2.648 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:03:53,839][model8_pretrain.py][INFO] Epoch:[0/2](598800/4588595) loss:2.572 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:03:53,840][model8_pretrain.py][INFO] Epoch:[0/2](598800/4588595) loss:2.868 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:03:53,840][model8_pretrain.py][INFO] Epoch:[0/2](598800/4588595) loss:3.112 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:03:53,840][model8_pretrain.py][INFO] Epoch:[0/2](598800/4588595) loss:2.754 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:03:53,840][model8_pretrain.py][INFO] Epoch:[0/2](598800/4588595) loss:2.906 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:03:53,844][model8_pretrain.py][INFO] Epoch:[0/2](598800/4588595) loss:2.850 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:04:30,779][model8_pretrain.py][INFO] Epoch:[0/2](598900/4588595) loss:3.038 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:04:30,779][model8_pretrain.py][INFO] Epoch:[0/2](598900/4588595) loss:3.223 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:04:30,779][model8_pretrain.py][INFO] Epoch:[0/2](598900/4588595) loss:3.261 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:04:30,779][model8_pretrain.py][INFO] Epoch:[0/2](598900/4588595) loss:3.288 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:04:30,779][model8_pretrain.py][INFO] Epoch:[0/2](598900/4588595) loss:2.039 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:04:30,779][model8_pretrain.py][INFO] Epoch:[0/2](598900/4588595) loss:3.166 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:04:30,780][model8_pretrain.py][INFO] Epoch:[0/2](598900/4588595) loss:2.917 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:04:30,780][model8_pretrain.py][INFO] Epoch:[0/2](598900/4588595) loss:2.871 lr:0.0000100 epoch_Time:25346.0min: [2024-01-05 09:05:07,709][model8_pretrain.py][INFO] Epoch:[0/2](599000/4588595) loss:2.839 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:07,709][model8_pretrain.py][INFO] Epoch:[0/2](599000/4588595) loss:2.841 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:07,709][model8_pretrain.py][INFO] Epoch:[0/2](599000/4588595) loss:2.989 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:07,709][model8_pretrain.py][INFO] Epoch:[0/2](599000/4588595) loss:3.061 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:07,709][model8_pretrain.py][INFO] Epoch:[0/2](599000/4588595) loss:3.020 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:07,709][model8_pretrain.py][INFO] Epoch:[0/2](599000/4588595) loss:2.715 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:07,709][model8_pretrain.py][INFO] Epoch:[0/2](599000/4588595) loss:2.656 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:07,711][model8_pretrain.py][INFO] Epoch:[0/2](599000/4588595) loss:3.079 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:44,645][model8_pretrain.py][INFO] Epoch:[0/2](599100/4588595) loss:2.367 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:44,645][model8_pretrain.py][INFO] Epoch:[0/2](599100/4588595) loss:2.343 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:44,645][model8_pretrain.py][INFO] Epoch:[0/2](599100/4588595) loss:2.812 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:44,645][model8_pretrain.py][INFO] Epoch:[0/2](599100/4588595) loss:2.770 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:44,645][model8_pretrain.py][INFO] Epoch:[0/2](599100/4588595) loss:3.123 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:44,645][model8_pretrain.py][INFO] Epoch:[0/2](599100/4588595) loss:2.498 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:44,645][model8_pretrain.py][INFO] Epoch:[0/2](599100/4588595) loss:3.202 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:05:44,645][model8_pretrain.py][INFO] Epoch:[0/2](599100/4588595) loss:2.286 lr:0.0000100 epoch_Time:25345.0min: [2024-01-05 09:06:21,589][model8_pretrain.py][INFO] Epoch:[0/2](599200/4588595) loss:2.830 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:06:21,589][model8_pretrain.py][INFO] Epoch:[0/2](599200/4588595) loss:2.704 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:06:21,589][model8_pretrain.py][INFO] Epoch:[0/2](599200/4588595) loss:3.213 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:06:21,589][model8_pretrain.py][INFO] Epoch:[0/2](599200/4588595) loss:3.285 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:06:21,589][model8_pretrain.py][INFO] Epoch:[0/2](599200/4588595) loss:2.935 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:06:21,589][model8_pretrain.py][INFO] Epoch:[0/2](599200/4588595) loss:2.811 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:06:21,590][model8_pretrain.py][INFO] Epoch:[0/2](599200/4588595) loss:3.105 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:06:21,590][model8_pretrain.py][INFO] Epoch:[0/2](599200/4588595) loss:2.443 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:06:58,528][model8_pretrain.py][INFO] Epoch:[0/2](599300/4588595) loss:3.005 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:06:58,528][model8_pretrain.py][INFO] Epoch:[0/2](599300/4588595) loss:2.325 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:06:58,528][model8_pretrain.py][INFO] Epoch:[0/2](599300/4588595) loss:2.968 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:06:58,528][model8_pretrain.py][INFO] Epoch:[0/2](599300/4588595) loss:2.161 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:06:58,528][model8_pretrain.py][INFO] Epoch:[0/2](599300/4588595) loss:2.869 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:06:58,528][model8_pretrain.py][INFO] Epoch:[0/2](599300/4588595) loss:2.899 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:06:58,528][model8_pretrain.py][INFO] Epoch:[0/2](599300/4588595) loss:2.769 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:06:58,528][model8_pretrain.py][INFO] Epoch:[0/2](599300/4588595) loss:2.689 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:07:47,440][model8_pretrain.py][INFO] Epoch:[0/2](599400/4588595) loss:2.692 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:07:47,440][model8_pretrain.py][INFO] Epoch:[0/2](599400/4588595) loss:3.125 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:07:47,440][model8_pretrain.py][INFO] Epoch:[0/2](599400/4588595) loss:2.191 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:07:47,440][model8_pretrain.py][INFO] Epoch:[0/2](599400/4588595) loss:2.341 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:07:47,440][model8_pretrain.py][INFO] Epoch:[0/2](599400/4588595) loss:3.034 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:07:47,440][model8_pretrain.py][INFO] Epoch:[0/2](599400/4588595) loss:2.889 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:07:47,440][model8_pretrain.py][INFO] Epoch:[0/2](599400/4588595) loss:2.690 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:07:47,441][model8_pretrain.py][INFO] Epoch:[0/2](599400/4588595) loss:2.781 lr:0.0000100 epoch_Time:25344.0min: [2024-01-05 09:08:24,377][model8_pretrain.py][INFO] Epoch:[0/2](599500/4588595) loss:3.080 lr:0.0000100 epoch_Time:25343.0min: [2024-01-05 09:08:24,377][model8_pretrain.py][INFO] Epoch:[0/2](599500/4588595) loss:2.771 lr:0.0000100 epoch_Time:25343.0min: [2024-01-05 09:08:24,377][model8_pretrain.py][INFO] Epoch:[0/2](599500/4588595) loss:2.581 lr:0.0000100 epoch_Time:25343.0min: [2024-01-05 09:08:24,377][model8_pretrain.py][INFO] Epoch:[0/2](599500/4588595) loss:2.850 lr:0.0000100 epoch_Time:25343.0min: [2024-01-05 09:08:24,377][model8_pretrain.py][INFO] Epoch:[0/2](599500/4588595) loss:2.947 lr:0.0000100 epoch_Time:25343.0min: [2024-01-05 09:08:24,378][model8_pretrain.py][INFO] Epoch:[0/2](599500/4588595) loss:2.748 lr:0.0000100 epoch_Time:25343.0min: [2024-01-05 09:08:24,378][model8_pretrain.py][INFO] Epoch:[0/2](599500/4588595) loss:3.463 lr:0.0000100 epoch_Time:25343.0min: [2024-01-05 09:08:24,378][model8_pretrain.py][INFO] Epoch:[0/2](599500/4588595) loss:2.770 lr:0.0000100 epoch_Time:25343.0min: [2024-01-05 09:09:01,315][model8_pretrain.py][INFO] Epoch:[0/2](599600/4588595) loss:3.160 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:09:01,315][model8_pretrain.py][INFO] Epoch:[0/2](599600/4588595) loss:2.859 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:09:01,315][model8_pretrain.py][INFO] Epoch:[0/2](599600/4588595) loss:3.271 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:09:01,315][model8_pretrain.py][INFO] Epoch:[0/2](599600/4588595) loss:3.102 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:09:01,315][model8_pretrain.py][INFO] Epoch:[0/2](599600/4588595) loss:3.049 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:09:01,316][model8_pretrain.py][INFO] Epoch:[0/2](599600/4588595) loss:2.481 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:09:01,315][model8_pretrain.py][INFO] Epoch:[0/2](599600/4588595) loss:3.253 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:09:01,316][model8_pretrain.py][INFO] Epoch:[0/2](599600/4588595) loss:2.613 lr:0.0000100 epoch_Time:25342.0min: [2024-01-05 09:09:38,256][model8_pretrain.py][INFO] Epoch:[0/2](599700/4588595) loss:2.511 lr:0.0000100 epoch_Time:25341.0min: [2024-01-05 09:09:38,256][model8_pretrain.py][INFO] Epoch:[0/2](599700/4588595) loss:3.061 lr:0.0000100 epoch_Time:25341.0min: [2024-01-05 09:09:38,256][model8_pretrain.py][INFO] Epoch:[0/2](599700/4588595) loss:2.844 lr:0.0000100 epoch_Time:25341.0min: [2024-01-05 09:09:38,256][model8_pretrain.py][INFO] Epoch:[0/2](599700/4588595) loss:2.754 lr:0.0000100 epoch_Time:25341.0min: [2024-01-05 09:09:38,256][model8_pretrain.py][INFO] Epoch:[0/2](599700/4588595) loss:2.804 lr:0.0000100 epoch_Time:25341.0min: [2024-01-05 09:09:38,256][model8_pretrain.py][INFO] Epoch:[0/2](599700/4588595) loss:3.404 lr:0.0000100 epoch_Time:25341.0min: [2024-01-05 09:09:38,256][model8_pretrain.py][INFO] Epoch:[0/2](599700/4588595) loss:2.370 lr:0.0000100 epoch_Time:25341.0min: [2024-01-05 09:09:38,256][model8_pretrain.py][INFO] Epoch:[0/2](599700/4588595) loss:2.490 lr:0.0000100 epoch_Time:25341.0min: [2024-01-05 09:10:15,201][model8_pretrain.py][INFO] Epoch:[0/2](599800/4588595) loss:2.993 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:10:15,201][model8_pretrain.py][INFO] Epoch:[0/2](599800/4588595) loss:2.950 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:10:15,201][model8_pretrain.py][INFO] Epoch:[0/2](599800/4588595) loss:2.744 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:10:15,201][model8_pretrain.py][INFO] Epoch:[0/2](599800/4588595) loss:2.532 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:10:15,201][model8_pretrain.py][INFO] Epoch:[0/2](599800/4588595) loss:2.930 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:10:15,201][model8_pretrain.py][INFO] Epoch:[0/2](599800/4588595) loss:3.039 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:10:15,201][model8_pretrain.py][INFO] Epoch:[0/2](599800/4588595) loss:3.032 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:10:15,201][model8_pretrain.py][INFO] Epoch:[0/2](599800/4588595) loss:2.536 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:10:52,152][model8_pretrain.py][INFO] Epoch:[0/2](599900/4588595) loss:3.013 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:10:52,152][model8_pretrain.py][INFO] Epoch:[0/2](599900/4588595) loss:2.854 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:10:52,152][model8_pretrain.py][INFO] Epoch:[0/2](599900/4588595) loss:2.573 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:10:52,152][model8_pretrain.py][INFO] Epoch:[0/2](599900/4588595) loss:2.495 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:10:52,152][model8_pretrain.py][INFO] Epoch:[0/2](599900/4588595) loss:2.493 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:10:52,152][model8_pretrain.py][INFO] Epoch:[0/2](599900/4588595) loss:2.744 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:10:52,153][model8_pretrain.py][INFO] Epoch:[0/2](599900/4588595) loss:2.775 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:10:52,153][model8_pretrain.py][INFO] Epoch:[0/2](599900/4588595) loss:3.138 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:11:29,102][model8_pretrain.py][INFO] Epoch:[0/2](600000/4588595) loss:2.787 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:11:29,102][model8_pretrain.py][INFO] Epoch:[0/2](600000/4588595) loss:3.084 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:11:29,102][model8_pretrain.py][INFO] Epoch:[0/2](600000/4588595) loss:3.069 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:11:29,102][model8_pretrain.py][INFO] Epoch:[0/2](600000/4588595) loss:2.347 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:11:29,102][model8_pretrain.py][INFO] Epoch:[0/2](600000/4588595) loss:2.978 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:11:29,102][model8_pretrain.py][INFO] Epoch:[0/2](600000/4588595) loss:2.280 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:11:29,102][model8_pretrain.py][INFO] Epoch:[0/2](600000/4588595) loss:2.476 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:11:29,102][model8_pretrain.py][INFO] Epoch:[0/2](600000/4588595) loss:3.238 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:11:46,239][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_600000.pth [2024-01-05 09:11:46,239][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_600000.pth [2024-01-05 09:11:46,239][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_600000.pth [2024-01-05 09:11:46,239][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_600000.pth [2024-01-05 09:11:46,239][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_600000.pth [2024-01-05 09:11:46,240][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_600000.pth [2024-01-05 09:11:46,240][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_600000.pth [2024-01-05 09:11:46,241][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_600000.pth [2024-01-05 09:12:23,206][model8_pretrain.py][INFO] Epoch:[0/2](600100/4588595) loss:2.997 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:12:23,206][model8_pretrain.py][INFO] Epoch:[0/2](600100/4588595) loss:2.917 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:12:23,206][model8_pretrain.py][INFO] Epoch:[0/2](600100/4588595) loss:2.643 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:12:23,206][model8_pretrain.py][INFO] Epoch:[0/2](600100/4588595) loss:3.114 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:12:23,206][model8_pretrain.py][INFO] Epoch:[0/2](600100/4588595) loss:2.920 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:12:23,206][model8_pretrain.py][INFO] Epoch:[0/2](600100/4588595) loss:3.219 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:12:23,206][model8_pretrain.py][INFO] Epoch:[0/2](600100/4588595) loss:2.971 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:12:23,206][model8_pretrain.py][INFO] Epoch:[0/2](600100/4588595) loss:2.352 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:13:12,193][model8_pretrain.py][INFO] Epoch:[0/2](600200/4588595) loss:2.540 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:13:12,193][model8_pretrain.py][INFO] Epoch:[0/2](600200/4588595) loss:2.164 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:13:12,193][model8_pretrain.py][INFO] Epoch:[0/2](600200/4588595) loss:3.432 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:13:12,193][model8_pretrain.py][INFO] Epoch:[0/2](600200/4588595) loss:3.452 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:13:12,193][model8_pretrain.py][INFO] Epoch:[0/2](600200/4588595) loss:2.792 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:13:12,193][model8_pretrain.py][INFO] Epoch:[0/2](600200/4588595) loss:3.240 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:13:12,193][model8_pretrain.py][INFO] Epoch:[0/2](600200/4588595) loss:2.793 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:13:12,193][model8_pretrain.py][INFO] Epoch:[0/2](600200/4588595) loss:2.791 lr:0.0000100 epoch_Time:25340.0min: [2024-01-05 09:13:49,130][model8_pretrain.py][INFO] Epoch:[0/2](600300/4588595) loss:3.185 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:13:49,130][model8_pretrain.py][INFO] Epoch:[0/2](600300/4588595) loss:2.523 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:13:49,130][model8_pretrain.py][INFO] Epoch:[0/2](600300/4588595) loss:2.753 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:13:49,130][model8_pretrain.py][INFO] Epoch:[0/2](600300/4588595) loss:2.851 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:13:49,130][model8_pretrain.py][INFO] Epoch:[0/2](600300/4588595) loss:3.051 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:13:49,130][model8_pretrain.py][INFO] Epoch:[0/2](600300/4588595) loss:3.062 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:13:49,130][model8_pretrain.py][INFO] Epoch:[0/2](600300/4588595) loss:3.163 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:13:49,130][model8_pretrain.py][INFO] Epoch:[0/2](600300/4588595) loss:2.820 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:14:26,065][model8_pretrain.py][INFO] Epoch:[0/2](600400/4588595) loss:2.647 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:14:26,065][model8_pretrain.py][INFO] Epoch:[0/2](600400/4588595) loss:3.106 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:14:26,065][model8_pretrain.py][INFO] Epoch:[0/2](600400/4588595) loss:2.951 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:14:26,065][model8_pretrain.py][INFO] Epoch:[0/2](600400/4588595) loss:3.262 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:14:26,065][model8_pretrain.py][INFO] Epoch:[0/2](600400/4588595) loss:2.584 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:14:26,065][model8_pretrain.py][INFO] Epoch:[0/2](600400/4588595) loss:2.159 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:14:26,065][model8_pretrain.py][INFO] Epoch:[0/2](600400/4588595) loss:2.474 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:14:26,065][model8_pretrain.py][INFO] Epoch:[0/2](600400/4588595) loss:2.581 lr:0.0000100 epoch_Time:25339.0min: [2024-01-05 09:15:03,020][model8_pretrain.py][INFO] Epoch:[0/2](600500/4588595) loss:2.700 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:03,020][model8_pretrain.py][INFO] Epoch:[0/2](600500/4588595) loss:3.053 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:03,020][model8_pretrain.py][INFO] Epoch:[0/2](600500/4588595) loss:2.209 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:03,020][model8_pretrain.py][INFO] Epoch:[0/2](600500/4588595) loss:2.444 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:03,020][model8_pretrain.py][INFO] Epoch:[0/2](600500/4588595) loss:2.480 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:03,020][model8_pretrain.py][INFO] Epoch:[0/2](600500/4588595) loss:3.234 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:03,020][model8_pretrain.py][INFO] Epoch:[0/2](600500/4588595) loss:2.982 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:03,020][model8_pretrain.py][INFO] Epoch:[0/2](600500/4588595) loss:3.001 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:39,982][model8_pretrain.py][INFO] Epoch:[0/2](600600/4588595) loss:3.270 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:39,982][model8_pretrain.py][INFO] Epoch:[0/2](600600/4588595) loss:2.589 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:39,982][model8_pretrain.py][INFO] Epoch:[0/2](600600/4588595) loss:3.286 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:39,982][model8_pretrain.py][INFO] Epoch:[0/2](600600/4588595) loss:3.154 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:39,982][model8_pretrain.py][INFO] Epoch:[0/2](600600/4588595) loss:3.402 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:39,982][model8_pretrain.py][INFO] Epoch:[0/2](600600/4588595) loss:2.919 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:39,982][model8_pretrain.py][INFO] Epoch:[0/2](600600/4588595) loss:3.024 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:15:39,982][model8_pretrain.py][INFO] Epoch:[0/2](600600/4588595) loss:2.494 lr:0.0000100 epoch_Time:25338.0min: [2024-01-05 09:16:16,904][model8_pretrain.py][INFO] Epoch:[0/2](600700/4588595) loss:3.181 lr:0.0000100 epoch_Time:25337.0min: [2024-01-05 09:16:16,904][model8_pretrain.py][INFO] Epoch:[0/2](600700/4588595) loss:2.802 lr:0.0000100 epoch_Time:25337.0min: [2024-01-05 09:16:16,904][model8_pretrain.py][INFO] Epoch:[0/2](600700/4588595) loss:2.424 lr:0.0000100 epoch_Time:25337.0min: [2024-01-05 09:16:16,904][model8_pretrain.py][INFO] Epoch:[0/2](600700/4588595) loss:3.043 lr:0.0000100 epoch_Time:25337.0min: [2024-01-05 09:16:16,904][model8_pretrain.py][INFO] Epoch:[0/2](600700/4588595) loss:2.266 lr:0.0000100 epoch_Time:25337.0min: [2024-01-05 09:16:16,904][model8_pretrain.py][INFO] Epoch:[0/2](600700/4588595) loss:2.515 lr:0.0000100 epoch_Time:25337.0min: [2024-01-05 09:16:16,904][model8_pretrain.py][INFO] Epoch:[0/2](600700/4588595) loss:2.691 lr:0.0000100 epoch_Time:25337.0min: [2024-01-05 09:16:16,904][model8_pretrain.py][INFO] Epoch:[0/2](600700/4588595) loss:2.548 lr:0.0000100 epoch_Time:25337.0min: [2024-01-05 09:16:53,844][model8_pretrain.py][INFO] Epoch:[0/2](600800/4588595) loss:3.078 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:16:53,844][model8_pretrain.py][INFO] Epoch:[0/2](600800/4588595) loss:2.111 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:16:53,844][model8_pretrain.py][INFO] Epoch:[0/2](600800/4588595) loss:2.930 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:16:53,844][model8_pretrain.py][INFO] Epoch:[0/2](600800/4588595) loss:2.876 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:16:53,844][model8_pretrain.py][INFO] Epoch:[0/2](600800/4588595) loss:2.975 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:16:53,844][model8_pretrain.py][INFO] Epoch:[0/2](600800/4588595) loss:2.902 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:16:53,844][model8_pretrain.py][INFO] Epoch:[0/2](600800/4588595) loss:3.060 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:16:53,845][model8_pretrain.py][INFO] Epoch:[0/2](600800/4588595) loss:3.018 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:17:30,777][model8_pretrain.py][INFO] Epoch:[0/2](600900/4588595) loss:2.911 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:17:30,777][model8_pretrain.py][INFO] Epoch:[0/2](600900/4588595) loss:2.696 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:17:30,777][model8_pretrain.py][INFO] Epoch:[0/2](600900/4588595) loss:3.197 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:17:30,777][model8_pretrain.py][INFO] Epoch:[0/2](600900/4588595) loss:3.201 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:17:30,777][model8_pretrain.py][INFO] Epoch:[0/2](600900/4588595) loss:2.949 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:17:30,777][model8_pretrain.py][INFO] Epoch:[0/2](600900/4588595) loss:2.677 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:17:30,777][model8_pretrain.py][INFO] Epoch:[0/2](600900/4588595) loss:2.746 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:17:30,778][model8_pretrain.py][INFO] Epoch:[0/2](600900/4588595) loss:3.255 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:18:19,733][model8_pretrain.py][INFO] Epoch:[0/2](601000/4588595) loss:2.987 lr:0.0000100 epoch_Time:25336.0min: [2024-01-05 09:18:19,733][model8_pretrain.py][INFO] Epoch:[0/2](601000/4588595) loss:3.253 lr:0.0000100 epoch_Time:25336.0min: [2024-01-05 09:18:19,733][model8_pretrain.py][INFO] Epoch:[0/2](601000/4588595) loss:2.896 lr:0.0000100 epoch_Time:25336.0min: [2024-01-05 09:18:19,733][model8_pretrain.py][INFO] Epoch:[0/2](601000/4588595) loss:2.821 lr:0.0000100 epoch_Time:25336.0min: [2024-01-05 09:18:19,733][model8_pretrain.py][INFO] Epoch:[0/2](601000/4588595) loss:3.077 lr:0.0000100 epoch_Time:25336.0min: [2024-01-05 09:18:19,733][model8_pretrain.py][INFO] Epoch:[0/2](601000/4588595) loss:3.217 lr:0.0000100 epoch_Time:25336.0min: [2024-01-05 09:18:19,733][model8_pretrain.py][INFO] Epoch:[0/2](601000/4588595) loss:3.025 lr:0.0000100 epoch_Time:25336.0min: [2024-01-05 09:18:19,733][model8_pretrain.py][INFO] Epoch:[0/2](601000/4588595) loss:2.668 lr:0.0000100 epoch_Time:25336.0min: [2024-01-05 09:18:56,663][model8_pretrain.py][INFO] Epoch:[0/2](601100/4588595) loss:2.374 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:18:56,663][model8_pretrain.py][INFO] Epoch:[0/2](601100/4588595) loss:3.053 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:18:56,663][model8_pretrain.py][INFO] Epoch:[0/2](601100/4588595) loss:3.124 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:18:56,663][model8_pretrain.py][INFO] Epoch:[0/2](601100/4588595) loss:2.969 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:18:56,663][model8_pretrain.py][INFO] Epoch:[0/2](601100/4588595) loss:2.926 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:18:56,663][model8_pretrain.py][INFO] Epoch:[0/2](601100/4588595) loss:2.708 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:18:56,663][model8_pretrain.py][INFO] Epoch:[0/2](601100/4588595) loss:3.099 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:18:56,663][model8_pretrain.py][INFO] Epoch:[0/2](601100/4588595) loss:2.426 lr:0.0000100 epoch_Time:25335.0min: [2024-01-05 09:19:33,600][model8_pretrain.py][INFO] Epoch:[0/2](601200/4588595) loss:2.713 lr:0.0000100 epoch_Time:25334.0min: [2024-01-05 09:19:33,600][model8_pretrain.py][INFO] Epoch:[0/2](601200/4588595) loss:2.858 lr:0.0000100 epoch_Time:25334.0min: [2024-01-05 09:19:33,600][model8_pretrain.py][INFO] Epoch:[0/2](601200/4588595) loss:2.930 lr:0.0000100 epoch_Time:25334.0min: [2024-01-05 09:19:33,600][model8_pretrain.py][INFO] Epoch:[0/2](601200/4588595) loss:3.303 lr:0.0000100 epoch_Time:25334.0min: [2024-01-05 09:19:33,600][model8_pretrain.py][INFO] Epoch:[0/2](601200/4588595) loss:3.133 lr:0.0000100 epoch_Time:25334.0min: [2024-01-05 09:19:33,600][model8_pretrain.py][INFO] Epoch:[0/2](601200/4588595) loss:2.933 lr:0.0000100 epoch_Time:25334.0min: [2024-01-05 09:19:33,600][model8_pretrain.py][INFO] Epoch:[0/2](601200/4588595) loss:2.509 lr:0.0000100 epoch_Time:25334.0min: [2024-01-05 09:19:33,600][model8_pretrain.py][INFO] Epoch:[0/2](601200/4588595) loss:3.162 lr:0.0000100 epoch_Time:25334.0min: [2024-01-05 09:20:10,543][model8_pretrain.py][INFO] Epoch:[0/2](601300/4588595) loss:2.864 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:10,543][model8_pretrain.py][INFO] Epoch:[0/2](601300/4588595) loss:3.028 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:10,543][model8_pretrain.py][INFO] Epoch:[0/2](601300/4588595) loss:2.882 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:10,543][model8_pretrain.py][INFO] Epoch:[0/2](601300/4588595) loss:2.688 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:10,543][model8_pretrain.py][INFO] Epoch:[0/2](601300/4588595) loss:2.335 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:10,543][model8_pretrain.py][INFO] Epoch:[0/2](601300/4588595) loss:2.576 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:10,543][model8_pretrain.py][INFO] Epoch:[0/2](601300/4588595) loss:3.241 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:10,543][model8_pretrain.py][INFO] Epoch:[0/2](601300/4588595) loss:2.404 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:47,478][model8_pretrain.py][INFO] Epoch:[0/2](601400/4588595) loss:2.997 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:47,478][model8_pretrain.py][INFO] Epoch:[0/2](601400/4588595) loss:2.837 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:47,478][model8_pretrain.py][INFO] Epoch:[0/2](601400/4588595) loss:2.442 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:47,478][model8_pretrain.py][INFO] Epoch:[0/2](601400/4588595) loss:2.661 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:47,478][model8_pretrain.py][INFO] Epoch:[0/2](601400/4588595) loss:2.927 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:47,478][model8_pretrain.py][INFO] Epoch:[0/2](601400/4588595) loss:2.847 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:47,478][model8_pretrain.py][INFO] Epoch:[0/2](601400/4588595) loss:3.108 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:20:47,479][model8_pretrain.py][INFO] Epoch:[0/2](601400/4588595) loss:2.524 lr:0.0000100 epoch_Time:25333.0min: [2024-01-05 09:21:24,451][model8_pretrain.py][INFO] Epoch:[0/2](601500/4588595) loss:2.014 lr:0.0000100 epoch_Time:25332.0min: [2024-01-05 09:21:24,452][model8_pretrain.py][INFO] Epoch:[0/2](601500/4588595) loss:3.201 lr:0.0000100 epoch_Time:25332.0min: [2024-01-05 09:21:24,452][model8_pretrain.py][INFO] Epoch:[0/2](601500/4588595) loss:3.329 lr:0.0000100 epoch_Time:25332.0min: [2024-01-05 09:21:24,452][model8_pretrain.py][INFO] Epoch:[0/2](601500/4588595) loss:2.722 lr:0.0000100 epoch_Time:25332.0min: [2024-01-05 09:21:24,452][model8_pretrain.py][INFO] Epoch:[0/2](601500/4588595) loss:3.297 lr:0.0000100 epoch_Time:25332.0min: [2024-01-05 09:21:24,452][model8_pretrain.py][INFO] Epoch:[0/2](601500/4588595) loss:2.204 lr:0.0000100 epoch_Time:25332.0min: [2024-01-05 09:21:24,452][model8_pretrain.py][INFO] Epoch:[0/2](601500/4588595) loss:2.847 lr:0.0000100 epoch_Time:25332.0min: [2024-01-05 09:21:24,452][model8_pretrain.py][INFO] Epoch:[0/2](601500/4588595) loss:2.542 lr:0.0000100 epoch_Time:25332.0min: [2024-01-05 09:22:01,401][model8_pretrain.py][INFO] Epoch:[0/2](601600/4588595) loss:3.027 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:01,401][model8_pretrain.py][INFO] Epoch:[0/2](601600/4588595) loss:2.682 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:01,401][model8_pretrain.py][INFO] Epoch:[0/2](601600/4588595) loss:2.393 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:01,401][model8_pretrain.py][INFO] Epoch:[0/2](601600/4588595) loss:2.982 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:01,401][model8_pretrain.py][INFO] Epoch:[0/2](601600/4588595) loss:2.915 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:01,401][model8_pretrain.py][INFO] Epoch:[0/2](601600/4588595) loss:3.201 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:01,401][model8_pretrain.py][INFO] Epoch:[0/2](601600/4588595) loss:3.260 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:01,401][model8_pretrain.py][INFO] Epoch:[0/2](601600/4588595) loss:2.932 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:38,340][model8_pretrain.py][INFO] Epoch:[0/2](601700/4588595) loss:2.505 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:38,340][model8_pretrain.py][INFO] Epoch:[0/2](601700/4588595) loss:2.943 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:38,340][model8_pretrain.py][INFO] Epoch:[0/2](601700/4588595) loss:2.616 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:38,340][model8_pretrain.py][INFO] Epoch:[0/2](601700/4588595) loss:2.294 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:38,340][model8_pretrain.py][INFO] Epoch:[0/2](601700/4588595) loss:3.264 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:38,340][model8_pretrain.py][INFO] Epoch:[0/2](601700/4588595) loss:3.174 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:38,340][model8_pretrain.py][INFO] Epoch:[0/2](601700/4588595) loss:2.945 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:22:38,340][model8_pretrain.py][INFO] Epoch:[0/2](601700/4588595) loss:3.043 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:23:27,415][model8_pretrain.py][INFO] Epoch:[0/2](601800/4588595) loss:2.766 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:23:27,415][model8_pretrain.py][INFO] Epoch:[0/2](601800/4588595) loss:3.125 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:23:27,415][model8_pretrain.py][INFO] Epoch:[0/2](601800/4588595) loss:1.661 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:23:27,415][model8_pretrain.py][INFO] Epoch:[0/2](601800/4588595) loss:2.960 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:23:27,415][model8_pretrain.py][INFO] Epoch:[0/2](601800/4588595) loss:2.843 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:23:27,415][model8_pretrain.py][INFO] Epoch:[0/2](601800/4588595) loss:3.350 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:23:27,415][model8_pretrain.py][INFO] Epoch:[0/2](601800/4588595) loss:2.727 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:23:27,416][model8_pretrain.py][INFO] Epoch:[0/2](601800/4588595) loss:3.050 lr:0.0000100 epoch_Time:25331.0min: [2024-01-05 09:24:04,358][model8_pretrain.py][INFO] Epoch:[0/2](601900/4588595) loss:2.634 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:04,358][model8_pretrain.py][INFO] Epoch:[0/2](601900/4588595) loss:2.779 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:04,358][model8_pretrain.py][INFO] Epoch:[0/2](601900/4588595) loss:3.270 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:04,358][model8_pretrain.py][INFO] Epoch:[0/2](601900/4588595) loss:2.544 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:04,358][model8_pretrain.py][INFO] Epoch:[0/2](601900/4588595) loss:2.937 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:04,358][model8_pretrain.py][INFO] Epoch:[0/2](601900/4588595) loss:3.110 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:04,358][model8_pretrain.py][INFO] Epoch:[0/2](601900/4588595) loss:3.101 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:04,359][model8_pretrain.py][INFO] Epoch:[0/2](601900/4588595) loss:2.642 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:41,297][model8_pretrain.py][INFO] Epoch:[0/2](602000/4588595) loss:2.601 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:41,297][model8_pretrain.py][INFO] Epoch:[0/2](602000/4588595) loss:2.845 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:41,297][model8_pretrain.py][INFO] Epoch:[0/2](602000/4588595) loss:2.506 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:41,297][model8_pretrain.py][INFO] Epoch:[0/2](602000/4588595) loss:2.918 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:41,297][model8_pretrain.py][INFO] Epoch:[0/2](602000/4588595) loss:3.389 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:41,297][model8_pretrain.py][INFO] Epoch:[0/2](602000/4588595) loss:2.781 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:41,297][model8_pretrain.py][INFO] Epoch:[0/2](602000/4588595) loss:2.202 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:24:41,297][model8_pretrain.py][INFO] Epoch:[0/2](602000/4588595) loss:2.826 lr:0.0000100 epoch_Time:25330.0min: [2024-01-05 09:25:18,237][model8_pretrain.py][INFO] Epoch:[0/2](602100/4588595) loss:2.688 lr:0.0000100 epoch_Time:25329.0min: [2024-01-05 09:25:18,237][model8_pretrain.py][INFO] Epoch:[0/2](602100/4588595) loss:3.174 lr:0.0000100 epoch_Time:25329.0min: [2024-01-05 09:25:18,237][model8_pretrain.py][INFO] Epoch:[0/2](602100/4588595) loss:3.230 lr:0.0000100 epoch_Time:25329.0min: [2024-01-05 09:25:18,237][model8_pretrain.py][INFO] Epoch:[0/2](602100/4588595) loss:3.219 lr:0.0000100 epoch_Time:25329.0min: [2024-01-05 09:25:18,237][model8_pretrain.py][INFO] Epoch:[0/2](602100/4588595) loss:3.210 lr:0.0000100 epoch_Time:25329.0min: [2024-01-05 09:25:18,237][model8_pretrain.py][INFO] Epoch:[0/2](602100/4588595) loss:2.891 lr:0.0000100 epoch_Time:25329.0min: [2024-01-05 09:25:18,237][model8_pretrain.py][INFO] Epoch:[0/2](602100/4588595) loss:2.939 lr:0.0000100 epoch_Time:25329.0min: [2024-01-05 09:25:18,237][model8_pretrain.py][INFO] Epoch:[0/2](602100/4588595) loss:2.718 lr:0.0000100 epoch_Time:25329.0min: [2024-01-05 09:25:55,174][model8_pretrain.py][INFO] Epoch:[0/2](602200/4588595) loss:2.877 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:25:55,174][model8_pretrain.py][INFO] Epoch:[0/2](602200/4588595) loss:2.899 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:25:55,174][model8_pretrain.py][INFO] Epoch:[0/2](602200/4588595) loss:2.965 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:25:55,174][model8_pretrain.py][INFO] Epoch:[0/2](602200/4588595) loss:2.492 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:25:55,175][model8_pretrain.py][INFO] Epoch:[0/2](602200/4588595) loss:2.757 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:25:55,175][model8_pretrain.py][INFO] Epoch:[0/2](602200/4588595) loss:2.974 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:25:55,175][model8_pretrain.py][INFO] Epoch:[0/2](602200/4588595) loss:2.686 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:25:55,175][model8_pretrain.py][INFO] Epoch:[0/2](602200/4588595) loss:2.970 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:26:32,124][model8_pretrain.py][INFO] Epoch:[0/2](602300/4588595) loss:2.655 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:26:32,124][model8_pretrain.py][INFO] Epoch:[0/2](602300/4588595) loss:3.371 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:26:32,124][model8_pretrain.py][INFO] Epoch:[0/2](602300/4588595) loss:2.844 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:26:32,124][model8_pretrain.py][INFO] Epoch:[0/2](602300/4588595) loss:3.043 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:26:32,124][model8_pretrain.py][INFO] Epoch:[0/2](602300/4588595) loss:3.085 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:26:32,124][model8_pretrain.py][INFO] Epoch:[0/2](602300/4588595) loss:2.870 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:26:32,124][model8_pretrain.py][INFO] Epoch:[0/2](602300/4588595) loss:3.097 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:26:32,124][model8_pretrain.py][INFO] Epoch:[0/2](602300/4588595) loss:2.762 lr:0.0000100 epoch_Time:25327.0min: [2024-01-05 09:27:09,076][model8_pretrain.py][INFO] Epoch:[0/2](602400/4588595) loss:2.941 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:09,076][model8_pretrain.py][INFO] Epoch:[0/2](602400/4588595) loss:2.559 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:09,076][model8_pretrain.py][INFO] Epoch:[0/2](602400/4588595) loss:2.861 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:09,076][model8_pretrain.py][INFO] Epoch:[0/2](602400/4588595) loss:2.641 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:09,076][model8_pretrain.py][INFO] Epoch:[0/2](602400/4588595) loss:3.010 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:09,076][model8_pretrain.py][INFO] Epoch:[0/2](602400/4588595) loss:2.639 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:09,076][model8_pretrain.py][INFO] Epoch:[0/2](602400/4588595) loss:2.845 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:09,076][model8_pretrain.py][INFO] Epoch:[0/2](602400/4588595) loss:2.530 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:46,012][model8_pretrain.py][INFO] Epoch:[0/2](602500/4588595) loss:2.717 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:46,012][model8_pretrain.py][INFO] Epoch:[0/2](602500/4588595) loss:2.772 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:46,012][model8_pretrain.py][INFO] Epoch:[0/2](602500/4588595) loss:3.702 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:46,012][model8_pretrain.py][INFO] Epoch:[0/2](602500/4588595) loss:3.174 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:46,012][model8_pretrain.py][INFO] Epoch:[0/2](602500/4588595) loss:2.336 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:46,012][model8_pretrain.py][INFO] Epoch:[0/2](602500/4588595) loss:2.847 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:46,013][model8_pretrain.py][INFO] Epoch:[0/2](602500/4588595) loss:3.012 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:27:46,013][model8_pretrain.py][INFO] Epoch:[0/2](602500/4588595) loss:3.289 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:28:35,118][model8_pretrain.py][INFO] Epoch:[0/2](602600/4588595) loss:2.848 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:28:35,118][model8_pretrain.py][INFO] Epoch:[0/2](602600/4588595) loss:3.145 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:28:35,118][model8_pretrain.py][INFO] Epoch:[0/2](602600/4588595) loss:2.602 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:28:35,119][model8_pretrain.py][INFO] Epoch:[0/2](602600/4588595) loss:3.108 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:28:35,118][model8_pretrain.py][INFO] Epoch:[0/2](602600/4588595) loss:2.927 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:28:35,118][model8_pretrain.py][INFO] Epoch:[0/2](602600/4588595) loss:2.751 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:28:35,119][model8_pretrain.py][INFO] Epoch:[0/2](602600/4588595) loss:3.214 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:28:35,118][model8_pretrain.py][INFO] Epoch:[0/2](602600/4588595) loss:3.180 lr:0.0000100 epoch_Time:25326.0min: [2024-01-05 09:29:12,050][model8_pretrain.py][INFO] Epoch:[0/2](602700/4588595) loss:3.598 lr:0.0000100 epoch_Time:25325.0min: [2024-01-05 09:29:12,050][model8_pretrain.py][INFO] Epoch:[0/2](602700/4588595) loss:2.992 lr:0.0000100 epoch_Time:25325.0min: [2024-01-05 09:29:12,050][model8_pretrain.py][INFO] Epoch:[0/2](602700/4588595) loss:2.891 lr:0.0000100 epoch_Time:25325.0min: [2024-01-05 09:29:12,050][model8_pretrain.py][INFO] Epoch:[0/2](602700/4588595) loss:2.827 lr:0.0000100 epoch_Time:25325.0min: [2024-01-05 09:29:12,050][model8_pretrain.py][INFO] Epoch:[0/2](602700/4588595) loss:3.498 lr:0.0000100 epoch_Time:25325.0min: [2024-01-05 09:29:12,050][model8_pretrain.py][INFO] Epoch:[0/2](602700/4588595) loss:3.297 lr:0.0000100 epoch_Time:25325.0min: [2024-01-05 09:29:12,050][model8_pretrain.py][INFO] Epoch:[0/2](602700/4588595) loss:3.166 lr:0.0000100 epoch_Time:25325.0min: [2024-01-05 09:29:12,051][model8_pretrain.py][INFO] Epoch:[0/2](602700/4588595) loss:3.399 lr:0.0000100 epoch_Time:25325.0min: [2024-01-05 09:29:48,991][model8_pretrain.py][INFO] Epoch:[0/2](602800/4588595) loss:2.900 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:29:48,991][model8_pretrain.py][INFO] Epoch:[0/2](602800/4588595) loss:2.489 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:29:48,991][model8_pretrain.py][INFO] Epoch:[0/2](602800/4588595) loss:2.982 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:29:48,991][model8_pretrain.py][INFO] Epoch:[0/2](602800/4588595) loss:2.453 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:29:48,991][model8_pretrain.py][INFO] Epoch:[0/2](602800/4588595) loss:2.904 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:29:48,991][model8_pretrain.py][INFO] Epoch:[0/2](602800/4588595) loss:3.031 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:29:48,991][model8_pretrain.py][INFO] Epoch:[0/2](602800/4588595) loss:2.871 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:29:48,991][model8_pretrain.py][INFO] Epoch:[0/2](602800/4588595) loss:2.997 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:30:25,931][model8_pretrain.py][INFO] Epoch:[0/2](602900/4588595) loss:2.862 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:30:25,931][model8_pretrain.py][INFO] Epoch:[0/2](602900/4588595) loss:1.851 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:30:25,931][model8_pretrain.py][INFO] Epoch:[0/2](602900/4588595) loss:3.025 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:30:25,931][model8_pretrain.py][INFO] Epoch:[0/2](602900/4588595) loss:3.016 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:30:25,931][model8_pretrain.py][INFO] Epoch:[0/2](602900/4588595) loss:3.135 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:30:25,931][model8_pretrain.py][INFO] Epoch:[0/2](602900/4588595) loss:3.289 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:30:25,931][model8_pretrain.py][INFO] Epoch:[0/2](602900/4588595) loss:3.187 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:30:25,932][model8_pretrain.py][INFO] Epoch:[0/2](602900/4588595) loss:3.551 lr:0.0000100 epoch_Time:25324.0min: [2024-01-05 09:31:02,877][model8_pretrain.py][INFO] Epoch:[0/2](603000/4588595) loss:2.746 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:02,877][model8_pretrain.py][INFO] Epoch:[0/2](603000/4588595) loss:2.677 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:02,877][model8_pretrain.py][INFO] Epoch:[0/2](603000/4588595) loss:2.849 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:02,877][model8_pretrain.py][INFO] Epoch:[0/2](603000/4588595) loss:3.074 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:02,877][model8_pretrain.py][INFO] Epoch:[0/2](603000/4588595) loss:2.422 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:02,877][model8_pretrain.py][INFO] Epoch:[0/2](603000/4588595) loss:2.763 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:02,877][model8_pretrain.py][INFO] Epoch:[0/2](603000/4588595) loss:2.692 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:02,877][model8_pretrain.py][INFO] Epoch:[0/2](603000/4588595) loss:2.732 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:39,822][model8_pretrain.py][INFO] Epoch:[0/2](603100/4588595) loss:3.070 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:39,822][model8_pretrain.py][INFO] Epoch:[0/2](603100/4588595) loss:2.848 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:39,822][model8_pretrain.py][INFO] Epoch:[0/2](603100/4588595) loss:2.722 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:39,822][model8_pretrain.py][INFO] Epoch:[0/2](603100/4588595) loss:3.041 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:39,822][model8_pretrain.py][INFO] Epoch:[0/2](603100/4588595) loss:3.396 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:39,822][model8_pretrain.py][INFO] Epoch:[0/2](603100/4588595) loss:1.896 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:39,822][model8_pretrain.py][INFO] Epoch:[0/2](603100/4588595) loss:3.278 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:31:39,823][model8_pretrain.py][INFO] Epoch:[0/2](603100/4588595) loss:2.645 lr:0.0000100 epoch_Time:25323.0min: [2024-01-05 09:32:16,769][model8_pretrain.py][INFO] Epoch:[0/2](603200/4588595) loss:2.761 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:32:16,769][model8_pretrain.py][INFO] Epoch:[0/2](603200/4588595) loss:2.667 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:32:16,769][model8_pretrain.py][INFO] Epoch:[0/2](603200/4588595) loss:2.995 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:32:16,769][model8_pretrain.py][INFO] Epoch:[0/2](603200/4588595) loss:2.858 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:32:16,769][model8_pretrain.py][INFO] Epoch:[0/2](603200/4588595) loss:3.020 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:32:16,769][model8_pretrain.py][INFO] Epoch:[0/2](603200/4588595) loss:3.088 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:32:16,769][model8_pretrain.py][INFO] Epoch:[0/2](603200/4588595) loss:2.708 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:32:16,769][model8_pretrain.py][INFO] Epoch:[0/2](603200/4588595) loss:3.270 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:32:53,680][model8_pretrain.py][INFO] Epoch:[0/2](603300/4588595) loss:2.753 lr:0.0000100 epoch_Time:25320.0min: [2024-01-05 09:32:53,680][model8_pretrain.py][INFO] Epoch:[0/2](603300/4588595) loss:2.890 lr:0.0000100 epoch_Time:25320.0min: [2024-01-05 09:32:53,680][model8_pretrain.py][INFO] Epoch:[0/2](603300/4588595) loss:3.026 lr:0.0000100 epoch_Time:25320.0min: [2024-01-05 09:32:53,680][model8_pretrain.py][INFO] Epoch:[0/2](603300/4588595) loss:2.766 lr:0.0000100 epoch_Time:25320.0min: [2024-01-05 09:32:53,680][model8_pretrain.py][INFO] Epoch:[0/2](603300/4588595) loss:3.078 lr:0.0000100 epoch_Time:25320.0min: [2024-01-05 09:32:53,680][model8_pretrain.py][INFO] Epoch:[0/2](603300/4588595) loss:3.516 lr:0.0000100 epoch_Time:25320.0min: [2024-01-05 09:32:53,680][model8_pretrain.py][INFO] Epoch:[0/2](603300/4588595) loss:2.633 lr:0.0000100 epoch_Time:25320.0min: [2024-01-05 09:32:53,680][model8_pretrain.py][INFO] Epoch:[0/2](603300/4588595) loss:2.941 lr:0.0000100 epoch_Time:25320.0min: [2024-01-05 09:33:42,722][model8_pretrain.py][INFO] Epoch:[0/2](603400/4588595) loss:3.233 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:33:42,722][model8_pretrain.py][INFO] Epoch:[0/2](603400/4588595) loss:3.038 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:33:42,722][model8_pretrain.py][INFO] Epoch:[0/2](603400/4588595) loss:2.698 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:33:42,722][model8_pretrain.py][INFO] Epoch:[0/2](603400/4588595) loss:2.661 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:33:42,722][model8_pretrain.py][INFO] Epoch:[0/2](603400/4588595) loss:2.395 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:33:42,722][model8_pretrain.py][INFO] Epoch:[0/2](603400/4588595) loss:2.270 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:33:42,722][model8_pretrain.py][INFO] Epoch:[0/2](603400/4588595) loss:2.885 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:33:42,722][model8_pretrain.py][INFO] Epoch:[0/2](603400/4588595) loss:2.784 lr:0.0000100 epoch_Time:25322.0min: [2024-01-05 09:34:19,657][model8_pretrain.py][INFO] Epoch:[0/2](603500/4588595) loss:3.145 lr:0.0000100 epoch_Time:25321.0min: [2024-01-05 09:34:19,657][model8_pretrain.py][INFO] Epoch:[0/2](603500/4588595) loss:2.326 lr:0.0000100 epoch_Time:25321.0min: [2024-01-05 09:34:19,657][model8_pretrain.py][INFO] Epoch:[0/2](603500/4588595) loss:3.202 lr:0.0000100 epoch_Time:25321.0min: [2024-01-05 09:34:19,657][model8_pretrain.py][INFO] Epoch:[0/2](603500/4588595) loss:2.701 lr:0.0000100 epoch_Time:25321.0min: [2024-01-05 09:34:19,657][model8_pretrain.py][INFO] Epoch:[0/2](603500/4588595) loss:3.026 lr:0.0000100 epoch_Time:25321.0min: [2024-01-05 09:34:19,657][model8_pretrain.py][INFO] Epoch:[0/2](603500/4588595) loss:2.832 lr:0.0000100 epoch_Time:25321.0min: [2024-01-05 09:34:19,657][model8_pretrain.py][INFO] Epoch:[0/2](603500/4588595) loss:2.818 lr:0.0000100 epoch_Time:25321.0min: [2024-01-05 09:34:19,657][model8_pretrain.py][INFO] Epoch:[0/2](603500/4588595) loss:3.523 lr:0.0000100 epoch_Time:25321.0min: [2024-01-05 09:34:56,611][model8_pretrain.py][INFO] Epoch:[0/2](603600/4588595) loss:2.536 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:34:56,611][model8_pretrain.py][INFO] Epoch:[0/2](603600/4588595) loss:2.633 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:34:56,611][model8_pretrain.py][INFO] Epoch:[0/2](603600/4588595) loss:2.843 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:34:56,611][model8_pretrain.py][INFO] Epoch:[0/2](603600/4588595) loss:2.908 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:34:56,611][model8_pretrain.py][INFO] Epoch:[0/2](603600/4588595) loss:2.610 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:34:56,611][model8_pretrain.py][INFO] Epoch:[0/2](603600/4588595) loss:2.681 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:34:56,611][model8_pretrain.py][INFO] Epoch:[0/2](603600/4588595) loss:3.379 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:34:56,612][model8_pretrain.py][INFO] Epoch:[0/2](603600/4588595) loss:3.016 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:35:33,573][model8_pretrain.py][INFO] Epoch:[0/2](603700/4588595) loss:3.121 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:35:33,573][model8_pretrain.py][INFO] Epoch:[0/2](603700/4588595) loss:2.671 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:35:33,573][model8_pretrain.py][INFO] Epoch:[0/2](603700/4588595) loss:3.122 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:35:33,573][model8_pretrain.py][INFO] Epoch:[0/2](603700/4588595) loss:2.592 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:35:33,573][model8_pretrain.py][INFO] Epoch:[0/2](603700/4588595) loss:3.222 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:35:33,573][model8_pretrain.py][INFO] Epoch:[0/2](603700/4588595) loss:2.774 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:35:33,573][model8_pretrain.py][INFO] Epoch:[0/2](603700/4588595) loss:3.250 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:35:33,573][model8_pretrain.py][INFO] Epoch:[0/2](603700/4588595) loss:2.597 lr:0.0000100 epoch_Time:25319.0min: [2024-01-05 09:36:10,540][model8_pretrain.py][INFO] Epoch:[0/2](603800/4588595) loss:3.312 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:10,540][model8_pretrain.py][INFO] Epoch:[0/2](603800/4588595) loss:3.120 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:10,540][model8_pretrain.py][INFO] Epoch:[0/2](603800/4588595) loss:2.667 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:10,540][model8_pretrain.py][INFO] Epoch:[0/2](603800/4588595) loss:2.739 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:10,540][model8_pretrain.py][INFO] Epoch:[0/2](603800/4588595) loss:3.013 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:10,541][model8_pretrain.py][INFO] Epoch:[0/2](603800/4588595) loss:2.353 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:10,541][model8_pretrain.py][INFO] Epoch:[0/2](603800/4588595) loss:2.529 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:10,547][model8_pretrain.py][INFO] Epoch:[0/2](603800/4588595) loss:3.075 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:47,501][model8_pretrain.py][INFO] Epoch:[0/2](603900/4588595) loss:2.490 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:47,501][model8_pretrain.py][INFO] Epoch:[0/2](603900/4588595) loss:2.713 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:47,501][model8_pretrain.py][INFO] Epoch:[0/2](603900/4588595) loss:2.885 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:47,501][model8_pretrain.py][INFO] Epoch:[0/2](603900/4588595) loss:2.856 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:47,501][model8_pretrain.py][INFO] Epoch:[0/2](603900/4588595) loss:3.120 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:47,501][model8_pretrain.py][INFO] Epoch:[0/2](603900/4588595) loss:3.069 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:47,501][model8_pretrain.py][INFO] Epoch:[0/2](603900/4588595) loss:2.865 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:36:47,501][model8_pretrain.py][INFO] Epoch:[0/2](603900/4588595) loss:2.980 lr:0.0000100 epoch_Time:25318.0min: [2024-01-05 09:37:24,457][model8_pretrain.py][INFO] Epoch:[0/2](604000/4588595) loss:2.667 lr:0.0000100 epoch_Time:25317.0min: [2024-01-05 09:37:24,457][model8_pretrain.py][INFO] Epoch:[0/2](604000/4588595) loss:2.571 lr:0.0000100 epoch_Time:25317.0min: [2024-01-05 09:37:24,457][model8_pretrain.py][INFO] Epoch:[0/2](604000/4588595) loss:2.191 lr:0.0000100 epoch_Time:25317.0min: [2024-01-05 09:37:24,457][model8_pretrain.py][INFO] Epoch:[0/2](604000/4588595) loss:3.284 lr:0.0000100 epoch_Time:25317.0min: [2024-01-05 09:37:24,457][model8_pretrain.py][INFO] Epoch:[0/2](604000/4588595) loss:3.138 lr:0.0000100 epoch_Time:25317.0min: [2024-01-05 09:37:24,457][model8_pretrain.py][INFO] Epoch:[0/2](604000/4588595) loss:2.891 lr:0.0000100 epoch_Time:25317.0min: [2024-01-05 09:37:24,457][model8_pretrain.py][INFO] Epoch:[0/2](604000/4588595) loss:2.724 lr:0.0000100 epoch_Time:25317.0min: [2024-01-05 09:37:24,457][model8_pretrain.py][INFO] Epoch:[0/2](604000/4588595) loss:2.822 lr:0.0000100 epoch_Time:25317.0min: [2024-01-05 09:38:01,418][model8_pretrain.py][INFO] Epoch:[0/2](604100/4588595) loss:3.235 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:01,419][model8_pretrain.py][INFO] Epoch:[0/2](604100/4588595) loss:2.449 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:01,419][model8_pretrain.py][INFO] Epoch:[0/2](604100/4588595) loss:2.640 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:01,419][model8_pretrain.py][INFO] Epoch:[0/2](604100/4588595) loss:2.708 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:01,419][model8_pretrain.py][INFO] Epoch:[0/2](604100/4588595) loss:2.788 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:01,419][model8_pretrain.py][INFO] Epoch:[0/2](604100/4588595) loss:3.028 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:01,419][model8_pretrain.py][INFO] Epoch:[0/2](604100/4588595) loss:3.072 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:01,419][model8_pretrain.py][INFO] Epoch:[0/2](604100/4588595) loss:2.790 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:50,464][model8_pretrain.py][INFO] Epoch:[0/2](604200/4588595) loss:2.656 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:50,464][model8_pretrain.py][INFO] Epoch:[0/2](604200/4588595) loss:2.756 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:50,464][model8_pretrain.py][INFO] Epoch:[0/2](604200/4588595) loss:2.804 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:50,464][model8_pretrain.py][INFO] Epoch:[0/2](604200/4588595) loss:2.861 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:50,465][model8_pretrain.py][INFO] Epoch:[0/2](604200/4588595) loss:2.052 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:50,464][model8_pretrain.py][INFO] Epoch:[0/2](604200/4588595) loss:2.831 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:50,465][model8_pretrain.py][INFO] Epoch:[0/2](604200/4588595) loss:2.143 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:38:50,465][model8_pretrain.py][INFO] Epoch:[0/2](604200/4588595) loss:3.005 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:39:27,393][model8_pretrain.py][INFO] Epoch:[0/2](604300/4588595) loss:2.872 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:39:27,393][model8_pretrain.py][INFO] Epoch:[0/2](604300/4588595) loss:2.931 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:39:27,393][model8_pretrain.py][INFO] Epoch:[0/2](604300/4588595) loss:2.863 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:39:27,393][model8_pretrain.py][INFO] Epoch:[0/2](604300/4588595) loss:2.809 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:39:27,393][model8_pretrain.py][INFO] Epoch:[0/2](604300/4588595) loss:3.116 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:39:27,393][model8_pretrain.py][INFO] Epoch:[0/2](604300/4588595) loss:3.114 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:39:27,393][model8_pretrain.py][INFO] Epoch:[0/2](604300/4588595) loss:2.742 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:39:27,393][model8_pretrain.py][INFO] Epoch:[0/2](604300/4588595) loss:3.087 lr:0.0000100 epoch_Time:25316.0min: [2024-01-05 09:40:04,327][model8_pretrain.py][INFO] Epoch:[0/2](604400/4588595) loss:2.421 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:04,327][model8_pretrain.py][INFO] Epoch:[0/2](604400/4588595) loss:2.928 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:04,327][model8_pretrain.py][INFO] Epoch:[0/2](604400/4588595) loss:2.805 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:04,327][model8_pretrain.py][INFO] Epoch:[0/2](604400/4588595) loss:2.593 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:04,327][model8_pretrain.py][INFO] Epoch:[0/2](604400/4588595) loss:2.811 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:04,327][model8_pretrain.py][INFO] Epoch:[0/2](604400/4588595) loss:2.714 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:04,327][model8_pretrain.py][INFO] Epoch:[0/2](604400/4588595) loss:3.611 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:04,327][model8_pretrain.py][INFO] Epoch:[0/2](604400/4588595) loss:3.120 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:41,257][model8_pretrain.py][INFO] Epoch:[0/2](604500/4588595) loss:2.707 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:41,257][model8_pretrain.py][INFO] Epoch:[0/2](604500/4588595) loss:2.625 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:41,257][model8_pretrain.py][INFO] Epoch:[0/2](604500/4588595) loss:2.502 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:41,257][model8_pretrain.py][INFO] Epoch:[0/2](604500/4588595) loss:3.115 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:41,257][model8_pretrain.py][INFO] Epoch:[0/2](604500/4588595) loss:2.689 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:41,257][model8_pretrain.py][INFO] Epoch:[0/2](604500/4588595) loss:2.871 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:41,257][model8_pretrain.py][INFO] Epoch:[0/2](604500/4588595) loss:3.018 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:40:41,258][model8_pretrain.py][INFO] Epoch:[0/2](604500/4588595) loss:3.323 lr:0.0000100 epoch_Time:25315.0min: [2024-01-05 09:41:18,197][model8_pretrain.py][INFO] Epoch:[0/2](604600/4588595) loss:2.330 lr:0.0000100 epoch_Time:25313.0min: [2024-01-05 09:41:18,197][model8_pretrain.py][INFO] Epoch:[0/2](604600/4588595) loss:3.175 lr:0.0000100 epoch_Time:25313.0min: [2024-01-05 09:41:18,197][model8_pretrain.py][INFO] Epoch:[0/2](604600/4588595) loss:2.233 lr:0.0000100 epoch_Time:25313.0min: [2024-01-05 09:41:18,197][model8_pretrain.py][INFO] Epoch:[0/2](604600/4588595) loss:3.225 lr:0.0000100 epoch_Time:25313.0min: [2024-01-05 09:41:18,197][model8_pretrain.py][INFO] Epoch:[0/2](604600/4588595) loss:2.895 lr:0.0000100 epoch_Time:25313.0min: [2024-01-05 09:41:18,197][model8_pretrain.py][INFO] Epoch:[0/2](604600/4588595) loss:2.561 lr:0.0000100 epoch_Time:25313.0min: [2024-01-05 09:41:18,197][model8_pretrain.py][INFO] Epoch:[0/2](604600/4588595) loss:2.585 lr:0.0000100 epoch_Time:25313.0min: [2024-01-05 09:41:18,198][model8_pretrain.py][INFO] Epoch:[0/2](604600/4588595) loss:2.980 lr:0.0000100 epoch_Time:25313.0min: [2024-01-05 09:41:55,140][model8_pretrain.py][INFO] Epoch:[0/2](604700/4588595) loss:2.912 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:41:55,140][model8_pretrain.py][INFO] Epoch:[0/2](604700/4588595) loss:2.835 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:41:55,140][model8_pretrain.py][INFO] Epoch:[0/2](604700/4588595) loss:2.683 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:41:55,140][model8_pretrain.py][INFO] Epoch:[0/2](604700/4588595) loss:3.047 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:41:55,140][model8_pretrain.py][INFO] Epoch:[0/2](604700/4588595) loss:3.006 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:41:55,140][model8_pretrain.py][INFO] Epoch:[0/2](604700/4588595) loss:2.712 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:41:55,140][model8_pretrain.py][INFO] Epoch:[0/2](604700/4588595) loss:2.564 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:41:55,140][model8_pretrain.py][INFO] Epoch:[0/2](604700/4588595) loss:3.011 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:42:32,092][model8_pretrain.py][INFO] Epoch:[0/2](604800/4588595) loss:2.926 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:42:32,092][model8_pretrain.py][INFO] Epoch:[0/2](604800/4588595) loss:3.045 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:42:32,092][model8_pretrain.py][INFO] Epoch:[0/2](604800/4588595) loss:2.722 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:42:32,092][model8_pretrain.py][INFO] Epoch:[0/2](604800/4588595) loss:2.590 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:42:32,092][model8_pretrain.py][INFO] Epoch:[0/2](604800/4588595) loss:2.822 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:42:32,092][model8_pretrain.py][INFO] Epoch:[0/2](604800/4588595) loss:2.714 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:42:32,092][model8_pretrain.py][INFO] Epoch:[0/2](604800/4588595) loss:3.250 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:42:32,092][model8_pretrain.py][INFO] Epoch:[0/2](604800/4588595) loss:3.151 lr:0.0000100 epoch_Time:25312.0min: [2024-01-05 09:43:09,023][model8_pretrain.py][INFO] Epoch:[0/2](604900/4588595) loss:2.680 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:09,023][model8_pretrain.py][INFO] Epoch:[0/2](604900/4588595) loss:2.566 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:09,023][model8_pretrain.py][INFO] Epoch:[0/2](604900/4588595) loss:3.247 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:09,023][model8_pretrain.py][INFO] Epoch:[0/2](604900/4588595) loss:3.153 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:09,023][model8_pretrain.py][INFO] Epoch:[0/2](604900/4588595) loss:2.626 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:09,023][model8_pretrain.py][INFO] Epoch:[0/2](604900/4588595) loss:2.569 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:09,023][model8_pretrain.py][INFO] Epoch:[0/2](604900/4588595) loss:3.264 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:09,024][model8_pretrain.py][INFO] Epoch:[0/2](604900/4588595) loss:2.811 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:58,174][model8_pretrain.py][INFO] Epoch:[0/2](605000/4588595) loss:3.182 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:58,174][model8_pretrain.py][INFO] Epoch:[0/2](605000/4588595) loss:2.237 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:58,174][model8_pretrain.py][INFO] Epoch:[0/2](605000/4588595) loss:2.972 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:58,174][model8_pretrain.py][INFO] Epoch:[0/2](605000/4588595) loss:2.908 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:58,174][model8_pretrain.py][INFO] Epoch:[0/2](605000/4588595) loss:3.431 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:58,175][model8_pretrain.py][INFO] Epoch:[0/2](605000/4588595) loss:2.334 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:58,175][model8_pretrain.py][INFO] Epoch:[0/2](605000/4588595) loss:2.930 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:43:58,175][model8_pretrain.py][INFO] Epoch:[0/2](605000/4588595) loss:3.020 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:44:35,096][model8_pretrain.py][INFO] Epoch:[0/2](605100/4588595) loss:3.179 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:44:35,096][model8_pretrain.py][INFO] Epoch:[0/2](605100/4588595) loss:2.728 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:44:35,096][model8_pretrain.py][INFO] Epoch:[0/2](605100/4588595) loss:3.084 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:44:35,096][model8_pretrain.py][INFO] Epoch:[0/2](605100/4588595) loss:3.511 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:44:35,096][model8_pretrain.py][INFO] Epoch:[0/2](605100/4588595) loss:3.067 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:44:35,096][model8_pretrain.py][INFO] Epoch:[0/2](605100/4588595) loss:3.261 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:44:35,096][model8_pretrain.py][INFO] Epoch:[0/2](605100/4588595) loss:2.992 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:44:35,096][model8_pretrain.py][INFO] Epoch:[0/2](605100/4588595) loss:2.764 lr:0.0000100 epoch_Time:25311.0min: [2024-01-05 09:45:12,037][model8_pretrain.py][INFO] Epoch:[0/2](605200/4588595) loss:2.939 lr:0.0000100 epoch_Time:25310.0min: [2024-01-05 09:45:12,037][model8_pretrain.py][INFO] Epoch:[0/2](605200/4588595) loss:2.693 lr:0.0000100 epoch_Time:25310.0min: [2024-01-05 09:45:12,037][model8_pretrain.py][INFO] Epoch:[0/2](605200/4588595) loss:3.059 lr:0.0000100 epoch_Time:25310.0min: [2024-01-05 09:45:12,037][model8_pretrain.py][INFO] Epoch:[0/2](605200/4588595) loss:3.070 lr:0.0000100 epoch_Time:25310.0min: [2024-01-05 09:45:12,037][model8_pretrain.py][INFO] Epoch:[0/2](605200/4588595) loss:2.358 lr:0.0000100 epoch_Time:25310.0min: [2024-01-05 09:45:12,037][model8_pretrain.py][INFO] Epoch:[0/2](605200/4588595) loss:2.379 lr:0.0000100 epoch_Time:25310.0min: [2024-01-05 09:45:12,038][model8_pretrain.py][INFO] Epoch:[0/2](605200/4588595) loss:3.052 lr:0.0000100 epoch_Time:25310.0min: [2024-01-05 09:45:12,038][model8_pretrain.py][INFO] Epoch:[0/2](605200/4588595) loss:2.997 lr:0.0000100 epoch_Time:25310.0min: [2024-01-05 09:45:48,976][model8_pretrain.py][INFO] Epoch:[0/2](605300/4588595) loss:3.061 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:45:48,976][model8_pretrain.py][INFO] Epoch:[0/2](605300/4588595) loss:3.251 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:45:48,976][model8_pretrain.py][INFO] Epoch:[0/2](605300/4588595) loss:3.025 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:45:48,976][model8_pretrain.py][INFO] Epoch:[0/2](605300/4588595) loss:2.653 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:45:48,976][model8_pretrain.py][INFO] Epoch:[0/2](605300/4588595) loss:2.399 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:45:48,976][model8_pretrain.py][INFO] Epoch:[0/2](605300/4588595) loss:3.212 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:45:48,976][model8_pretrain.py][INFO] Epoch:[0/2](605300/4588595) loss:3.181 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:45:48,976][model8_pretrain.py][INFO] Epoch:[0/2](605300/4588595) loss:2.869 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:46:25,916][model8_pretrain.py][INFO] Epoch:[0/2](605400/4588595) loss:2.930 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:46:25,916][model8_pretrain.py][INFO] Epoch:[0/2](605400/4588595) loss:3.347 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:46:25,916][model8_pretrain.py][INFO] Epoch:[0/2](605400/4588595) loss:2.355 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:46:25,916][model8_pretrain.py][INFO] Epoch:[0/2](605400/4588595) loss:3.314 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:46:25,916][model8_pretrain.py][INFO] Epoch:[0/2](605400/4588595) loss:2.826 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:46:25,916][model8_pretrain.py][INFO] Epoch:[0/2](605400/4588595) loss:3.158 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:46:25,916][model8_pretrain.py][INFO] Epoch:[0/2](605400/4588595) loss:2.966 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:46:25,916][model8_pretrain.py][INFO] Epoch:[0/2](605400/4588595) loss:2.731 lr:0.0000100 epoch_Time:25309.0min: [2024-01-05 09:47:02,859][model8_pretrain.py][INFO] Epoch:[0/2](605500/4588595) loss:3.157 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:02,859][model8_pretrain.py][INFO] Epoch:[0/2](605500/4588595) loss:2.360 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:02,859][model8_pretrain.py][INFO] Epoch:[0/2](605500/4588595) loss:2.910 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:02,859][model8_pretrain.py][INFO] Epoch:[0/2](605500/4588595) loss:3.012 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:02,859][model8_pretrain.py][INFO] Epoch:[0/2](605500/4588595) loss:3.047 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:02,859][model8_pretrain.py][INFO] Epoch:[0/2](605500/4588595) loss:2.965 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:02,859][model8_pretrain.py][INFO] Epoch:[0/2](605500/4588595) loss:2.582 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:02,860][model8_pretrain.py][INFO] Epoch:[0/2](605500/4588595) loss:2.666 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:39,793][model8_pretrain.py][INFO] Epoch:[0/2](605600/4588595) loss:2.933 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:39,793][model8_pretrain.py][INFO] Epoch:[0/2](605600/4588595) loss:2.058 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:39,793][model8_pretrain.py][INFO] Epoch:[0/2](605600/4588595) loss:3.193 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:39,793][model8_pretrain.py][INFO] Epoch:[0/2](605600/4588595) loss:2.619 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:39,793][model8_pretrain.py][INFO] Epoch:[0/2](605600/4588595) loss:2.706 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:39,793][model8_pretrain.py][INFO] Epoch:[0/2](605600/4588595) loss:2.777 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:39,793][model8_pretrain.py][INFO] Epoch:[0/2](605600/4588595) loss:2.543 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:47:39,793][model8_pretrain.py][INFO] Epoch:[0/2](605600/4588595) loss:2.475 lr:0.0000100 epoch_Time:25308.0min: [2024-01-05 09:48:16,719][model8_pretrain.py][INFO] Epoch:[0/2](605700/4588595) loss:2.805 lr:0.0000100 epoch_Time:25306.0min: [2024-01-05 09:48:16,719][model8_pretrain.py][INFO] Epoch:[0/2](605700/4588595) loss:2.722 lr:0.0000100 epoch_Time:25306.0min: [2024-01-05 09:48:16,720][model8_pretrain.py][INFO] Epoch:[0/2](605700/4588595) loss:3.499 lr:0.0000100 epoch_Time:25306.0min: [2024-01-05 09:48:16,720][model8_pretrain.py][INFO] Epoch:[0/2](605700/4588595) loss:3.004 lr:0.0000100 epoch_Time:25306.0min: [2024-01-05 09:48:16,720][model8_pretrain.py][INFO] Epoch:[0/2](605700/4588595) loss:2.633 lr:0.0000100 epoch_Time:25306.0min: [2024-01-05 09:48:16,720][model8_pretrain.py][INFO] Epoch:[0/2](605700/4588595) loss:3.188 lr:0.0000100 epoch_Time:25306.0min: [2024-01-05 09:48:16,720][model8_pretrain.py][INFO] Epoch:[0/2](605700/4588595) loss:2.922 lr:0.0000100 epoch_Time:25306.0min: [2024-01-05 09:48:16,720][model8_pretrain.py][INFO] Epoch:[0/2](605700/4588595) loss:2.788 lr:0.0000100 epoch_Time:25306.0min: [2024-01-05 09:49:05,772][model8_pretrain.py][INFO] Epoch:[0/2](605800/4588595) loss:2.677 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:05,772][model8_pretrain.py][INFO] Epoch:[0/2](605800/4588595) loss:3.034 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:05,772][model8_pretrain.py][INFO] Epoch:[0/2](605800/4588595) loss:2.477 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:05,772][model8_pretrain.py][INFO] Epoch:[0/2](605800/4588595) loss:2.877 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:05,772][model8_pretrain.py][INFO] Epoch:[0/2](605800/4588595) loss:3.262 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:05,772][model8_pretrain.py][INFO] Epoch:[0/2](605800/4588595) loss:2.943 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:05,772][model8_pretrain.py][INFO] Epoch:[0/2](605800/4588595) loss:2.557 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:05,772][model8_pretrain.py][INFO] Epoch:[0/2](605800/4588595) loss:3.530 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:42,691][model8_pretrain.py][INFO] Epoch:[0/2](605900/4588595) loss:2.582 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:42,691][model8_pretrain.py][INFO] Epoch:[0/2](605900/4588595) loss:2.115 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:42,691][model8_pretrain.py][INFO] Epoch:[0/2](605900/4588595) loss:3.086 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:42,691][model8_pretrain.py][INFO] Epoch:[0/2](605900/4588595) loss:2.570 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:42,691][model8_pretrain.py][INFO] Epoch:[0/2](605900/4588595) loss:2.808 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:42,691][model8_pretrain.py][INFO] Epoch:[0/2](605900/4588595) loss:2.895 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:42,691][model8_pretrain.py][INFO] Epoch:[0/2](605900/4588595) loss:2.332 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:49:42,692][model8_pretrain.py][INFO] Epoch:[0/2](605900/4588595) loss:2.952 lr:0.0000100 epoch_Time:25307.0min: [2024-01-05 09:50:19,629][model8_pretrain.py][INFO] Epoch:[0/2](606000/4588595) loss:3.089 lr:0.0000100 epoch_Time:25305.0min: [2024-01-05 09:50:19,629][model8_pretrain.py][INFO] Epoch:[0/2](606000/4588595) loss:3.027 lr:0.0000100 epoch_Time:25305.0min: [2024-01-05 09:50:19,629][model8_pretrain.py][INFO] Epoch:[0/2](606000/4588595) loss:2.935 lr:0.0000100 epoch_Time:25305.0min: [2024-01-05 09:50:19,629][model8_pretrain.py][INFO] Epoch:[0/2](606000/4588595) loss:2.389 lr:0.0000100 epoch_Time:25305.0min: [2024-01-05 09:50:19,629][model8_pretrain.py][INFO] Epoch:[0/2](606000/4588595) loss:2.602 lr:0.0000100 epoch_Time:25305.0min: [2024-01-05 09:50:19,629][model8_pretrain.py][INFO] Epoch:[0/2](606000/4588595) loss:3.301 lr:0.0000100 epoch_Time:25305.0min: [2024-01-05 09:50:19,630][model8_pretrain.py][INFO] Epoch:[0/2](606000/4588595) loss:2.334 lr:0.0000100 epoch_Time:25305.0min: [2024-01-05 09:50:19,630][model8_pretrain.py][INFO] Epoch:[0/2](606000/4588595) loss:1.878 lr:0.0000100 epoch_Time:25305.0min: [2024-01-05 09:50:56,569][model8_pretrain.py][INFO] Epoch:[0/2](606100/4588595) loss:3.190 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:50:56,569][model8_pretrain.py][INFO] Epoch:[0/2](606100/4588595) loss:3.041 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:50:56,569][model8_pretrain.py][INFO] Epoch:[0/2](606100/4588595) loss:2.752 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:50:56,569][model8_pretrain.py][INFO] Epoch:[0/2](606100/4588595) loss:2.810 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:50:56,569][model8_pretrain.py][INFO] Epoch:[0/2](606100/4588595) loss:3.119 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:50:56,569][model8_pretrain.py][INFO] Epoch:[0/2](606100/4588595) loss:3.317 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:50:56,569][model8_pretrain.py][INFO] Epoch:[0/2](606100/4588595) loss:2.438 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:50:56,569][model8_pretrain.py][INFO] Epoch:[0/2](606100/4588595) loss:2.876 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:51:33,510][model8_pretrain.py][INFO] Epoch:[0/2](606200/4588595) loss:2.920 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:51:33,510][model8_pretrain.py][INFO] Epoch:[0/2](606200/4588595) loss:2.638 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:51:33,510][model8_pretrain.py][INFO] Epoch:[0/2](606200/4588595) loss:2.618 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:51:33,510][model8_pretrain.py][INFO] Epoch:[0/2](606200/4588595) loss:2.912 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:51:33,510][model8_pretrain.py][INFO] Epoch:[0/2](606200/4588595) loss:2.229 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:51:33,510][model8_pretrain.py][INFO] Epoch:[0/2](606200/4588595) loss:2.644 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:51:33,510][model8_pretrain.py][INFO] Epoch:[0/2](606200/4588595) loss:2.859 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:51:33,510][model8_pretrain.py][INFO] Epoch:[0/2](606200/4588595) loss:2.931 lr:0.0000100 epoch_Time:25304.0min: [2024-01-05 09:52:10,452][model8_pretrain.py][INFO] Epoch:[0/2](606300/4588595) loss:3.162 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:10,452][model8_pretrain.py][INFO] Epoch:[0/2](606300/4588595) loss:3.197 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:10,452][model8_pretrain.py][INFO] Epoch:[0/2](606300/4588595) loss:3.115 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:10,452][model8_pretrain.py][INFO] Epoch:[0/2](606300/4588595) loss:2.530 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:10,452][model8_pretrain.py][INFO] Epoch:[0/2](606300/4588595) loss:3.091 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:10,452][model8_pretrain.py][INFO] Epoch:[0/2](606300/4588595) loss:3.258 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:10,452][model8_pretrain.py][INFO] Epoch:[0/2](606300/4588595) loss:2.419 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:10,452][model8_pretrain.py][INFO] Epoch:[0/2](606300/4588595) loss:2.827 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:47,392][model8_pretrain.py][INFO] Epoch:[0/2](606400/4588595) loss:2.680 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:47,392][model8_pretrain.py][INFO] Epoch:[0/2](606400/4588595) loss:3.046 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:47,392][model8_pretrain.py][INFO] Epoch:[0/2](606400/4588595) loss:2.826 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:47,392][model8_pretrain.py][INFO] Epoch:[0/2](606400/4588595) loss:3.231 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:47,392][model8_pretrain.py][INFO] Epoch:[0/2](606400/4588595) loss:2.982 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:47,392][model8_pretrain.py][INFO] Epoch:[0/2](606400/4588595) loss:2.532 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:47,392][model8_pretrain.py][INFO] Epoch:[0/2](606400/4588595) loss:3.028 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:52:47,392][model8_pretrain.py][INFO] Epoch:[0/2](606400/4588595) loss:2.659 lr:0.0000100 epoch_Time:25303.0min: [2024-01-05 09:53:24,306][model8_pretrain.py][INFO] Epoch:[0/2](606500/4588595) loss:3.153 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:53:24,306][model8_pretrain.py][INFO] Epoch:[0/2](606500/4588595) loss:3.156 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:53:24,306][model8_pretrain.py][INFO] Epoch:[0/2](606500/4588595) loss:3.324 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:53:24,306][model8_pretrain.py][INFO] Epoch:[0/2](606500/4588595) loss:3.069 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:53:24,306][model8_pretrain.py][INFO] Epoch:[0/2](606500/4588595) loss:2.486 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:53:24,306][model8_pretrain.py][INFO] Epoch:[0/2](606500/4588595) loss:3.011 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:53:24,306][model8_pretrain.py][INFO] Epoch:[0/2](606500/4588595) loss:3.035 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:53:24,306][model8_pretrain.py][INFO] Epoch:[0/2](606500/4588595) loss:2.934 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:54:13,312][model8_pretrain.py][INFO] Epoch:[0/2](606600/4588595) loss:2.514 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:54:13,312][model8_pretrain.py][INFO] Epoch:[0/2](606600/4588595) loss:2.708 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:54:13,312][model8_pretrain.py][INFO] Epoch:[0/2](606600/4588595) loss:2.912 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:54:13,312][model8_pretrain.py][INFO] Epoch:[0/2](606600/4588595) loss:2.476 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:54:13,312][model8_pretrain.py][INFO] Epoch:[0/2](606600/4588595) loss:2.497 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:54:13,312][model8_pretrain.py][INFO] Epoch:[0/2](606600/4588595) loss:2.403 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:54:13,312][model8_pretrain.py][INFO] Epoch:[0/2](606600/4588595) loss:2.705 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:54:13,312][model8_pretrain.py][INFO] Epoch:[0/2](606600/4588595) loss:2.535 lr:0.0000100 epoch_Time:25302.0min: [2024-01-05 09:54:50,231][model8_pretrain.py][INFO] Epoch:[0/2](606700/4588595) loss:2.197 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:54:50,231][model8_pretrain.py][INFO] Epoch:[0/2](606700/4588595) loss:3.247 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:54:50,232][model8_pretrain.py][INFO] Epoch:[0/2](606700/4588595) loss:2.856 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:54:50,232][model8_pretrain.py][INFO] Epoch:[0/2](606700/4588595) loss:2.705 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:54:50,232][model8_pretrain.py][INFO] Epoch:[0/2](606700/4588595) loss:2.815 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:54:50,232][model8_pretrain.py][INFO] Epoch:[0/2](606700/4588595) loss:2.421 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:54:50,232][model8_pretrain.py][INFO] Epoch:[0/2](606700/4588595) loss:2.775 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:54:50,232][model8_pretrain.py][INFO] Epoch:[0/2](606700/4588595) loss:3.295 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:55:27,151][model8_pretrain.py][INFO] Epoch:[0/2](606800/4588595) loss:2.704 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:55:27,151][model8_pretrain.py][INFO] Epoch:[0/2](606800/4588595) loss:2.976 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:55:27,151][model8_pretrain.py][INFO] Epoch:[0/2](606800/4588595) loss:2.923 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:55:27,151][model8_pretrain.py][INFO] Epoch:[0/2](606800/4588595) loss:2.815 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:55:27,151][model8_pretrain.py][INFO] Epoch:[0/2](606800/4588595) loss:2.736 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:55:27,151][model8_pretrain.py][INFO] Epoch:[0/2](606800/4588595) loss:2.908 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:55:27,151][model8_pretrain.py][INFO] Epoch:[0/2](606800/4588595) loss:3.158 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:55:27,151][model8_pretrain.py][INFO] Epoch:[0/2](606800/4588595) loss:2.976 lr:0.0000100 epoch_Time:25301.0min: [2024-01-05 09:56:04,085][model8_pretrain.py][INFO] Epoch:[0/2](606900/4588595) loss:2.660 lr:0.0000100 epoch_Time:25300.0min: [2024-01-05 09:56:04,085][model8_pretrain.py][INFO] Epoch:[0/2](606900/4588595) loss:2.440 lr:0.0000100 epoch_Time:25300.0min: [2024-01-05 09:56:04,085][model8_pretrain.py][INFO] Epoch:[0/2](606900/4588595) loss:2.550 lr:0.0000100 epoch_Time:25300.0min: [2024-01-05 09:56:04,085][model8_pretrain.py][INFO] Epoch:[0/2](606900/4588595) loss:3.400 lr:0.0000100 epoch_Time:25300.0min: [2024-01-05 09:56:04,085][model8_pretrain.py][INFO] Epoch:[0/2](606900/4588595) loss:2.352 lr:0.0000100 epoch_Time:25300.0min: [2024-01-05 09:56:04,085][model8_pretrain.py][INFO] Epoch:[0/2](606900/4588595) loss:2.689 lr:0.0000100 epoch_Time:25300.0min: [2024-01-05 09:56:04,085][model8_pretrain.py][INFO] Epoch:[0/2](606900/4588595) loss:2.699 lr:0.0000100 epoch_Time:25300.0min: [2024-01-05 09:56:04,085][model8_pretrain.py][INFO] Epoch:[0/2](606900/4588595) loss:3.242 lr:0.0000100 epoch_Time:25300.0min: [2024-01-05 09:56:41,034][model8_pretrain.py][INFO] Epoch:[0/2](607000/4588595) loss:3.257 lr:0.0000100 epoch_Time:25299.0min: [2024-01-05 09:56:41,034][model8_pretrain.py][INFO] Epoch:[0/2](607000/4588595) loss:2.241 lr:0.0000100 epoch_Time:25299.0min: [2024-01-05 09:56:41,034][model8_pretrain.py][INFO] Epoch:[0/2](607000/4588595) loss:2.499 lr:0.0000100 epoch_Time:25299.0min: [2024-01-05 09:56:41,035][model8_pretrain.py][INFO] Epoch:[0/2](607000/4588595) loss:3.273 lr:0.0000100 epoch_Time:25299.0min: [2024-01-05 09:56:41,035][model8_pretrain.py][INFO] Epoch:[0/2](607000/4588595) loss:2.187 lr:0.0000100 epoch_Time:25299.0min: [2024-01-05 09:56:41,035][model8_pretrain.py][INFO] Epoch:[0/2](607000/4588595) loss:3.048 lr:0.0000100 epoch_Time:25299.0min: [2024-01-05 09:56:41,035][model8_pretrain.py][INFO] Epoch:[0/2](607000/4588595) loss:2.911 lr:0.0000100 epoch_Time:25299.0min: [2024-01-05 09:56:41,035][model8_pretrain.py][INFO] Epoch:[0/2](607000/4588595) loss:3.053 lr:0.0000100 epoch_Time:25299.0min: [2024-01-05 09:57:17,971][model8_pretrain.py][INFO] Epoch:[0/2](607100/4588595) loss:2.265 lr:0.0000100 epoch_Time:25298.0min: [2024-01-05 09:57:17,971][model8_pretrain.py][INFO] Epoch:[0/2](607100/4588595) loss:2.891 lr:0.0000100 epoch_Time:25298.0min: [2024-01-05 09:57:17,971][model8_pretrain.py][INFO] Epoch:[0/2](607100/4588595) loss:3.165 lr:0.0000100 epoch_Time:25298.0min: [2024-01-05 09:57:17,971][model8_pretrain.py][INFO] Epoch:[0/2](607100/4588595) loss:3.017 lr:0.0000100 epoch_Time:25298.0min: [2024-01-05 09:57:17,971][model8_pretrain.py][INFO] Epoch:[0/2](607100/4588595) loss:2.392 lr:0.0000100 epoch_Time:25298.0min: [2024-01-05 09:57:17,971][model8_pretrain.py][INFO] Epoch:[0/2](607100/4588595) loss:2.534 lr:0.0000100 epoch_Time:25298.0min: [2024-01-05 09:57:17,971][model8_pretrain.py][INFO] Epoch:[0/2](607100/4588595) loss:2.417 lr:0.0000100 epoch_Time:25298.0min: [2024-01-05 09:57:17,971][model8_pretrain.py][INFO] Epoch:[0/2](607100/4588595) loss:2.957 lr:0.0000100 epoch_Time:25298.0min: [2024-01-05 09:57:54,906][model8_pretrain.py][INFO] Epoch:[0/2](607200/4588595) loss:2.716 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:57:54,906][model8_pretrain.py][INFO] Epoch:[0/2](607200/4588595) loss:3.204 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:57:54,906][model8_pretrain.py][INFO] Epoch:[0/2](607200/4588595) loss:2.833 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:57:54,906][model8_pretrain.py][INFO] Epoch:[0/2](607200/4588595) loss:2.909 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:57:54,906][model8_pretrain.py][INFO] Epoch:[0/2](607200/4588595) loss:2.629 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:57:54,906][model8_pretrain.py][INFO] Epoch:[0/2](607200/4588595) loss:2.681 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:57:54,906][model8_pretrain.py][INFO] Epoch:[0/2](607200/4588595) loss:2.778 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:57:54,906][model8_pretrain.py][INFO] Epoch:[0/2](607200/4588595) loss:3.005 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:58:31,843][model8_pretrain.py][INFO] Epoch:[0/2](607300/4588595) loss:2.881 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:58:31,843][model8_pretrain.py][INFO] Epoch:[0/2](607300/4588595) loss:2.486 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:58:31,843][model8_pretrain.py][INFO] Epoch:[0/2](607300/4588595) loss:2.803 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:58:31,843][model8_pretrain.py][INFO] Epoch:[0/2](607300/4588595) loss:2.748 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:58:31,843][model8_pretrain.py][INFO] Epoch:[0/2](607300/4588595) loss:2.787 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:58:31,843][model8_pretrain.py][INFO] Epoch:[0/2](607300/4588595) loss:2.310 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:58:31,843][model8_pretrain.py][INFO] Epoch:[0/2](607300/4588595) loss:3.204 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:58:31,843][model8_pretrain.py][INFO] Epoch:[0/2](607300/4588595) loss:2.545 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:59:20,769][model8_pretrain.py][INFO] Epoch:[0/2](607400/4588595) loss:2.527 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:59:20,769][model8_pretrain.py][INFO] Epoch:[0/2](607400/4588595) loss:3.154 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:59:20,769][model8_pretrain.py][INFO] Epoch:[0/2](607400/4588595) loss:3.128 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:59:20,769][model8_pretrain.py][INFO] Epoch:[0/2](607400/4588595) loss:2.266 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:59:20,769][model8_pretrain.py][INFO] Epoch:[0/2](607400/4588595) loss:3.222 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:59:20,769][model8_pretrain.py][INFO] Epoch:[0/2](607400/4588595) loss:3.066 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:59:20,769][model8_pretrain.py][INFO] Epoch:[0/2](607400/4588595) loss:3.186 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:59:20,769][model8_pretrain.py][INFO] Epoch:[0/2](607400/4588595) loss:3.008 lr:0.0000100 epoch_Time:25297.0min: [2024-01-05 09:59:57,694][model8_pretrain.py][INFO] Epoch:[0/2](607500/4588595) loss:2.926 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 09:59:57,694][model8_pretrain.py][INFO] Epoch:[0/2](607500/4588595) loss:2.707 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 09:59:57,694][model8_pretrain.py][INFO] Epoch:[0/2](607500/4588595) loss:2.986 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 09:59:57,694][model8_pretrain.py][INFO] Epoch:[0/2](607500/4588595) loss:3.134 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 09:59:57,694][model8_pretrain.py][INFO] Epoch:[0/2](607500/4588595) loss:2.170 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 09:59:57,694][model8_pretrain.py][INFO] Epoch:[0/2](607500/4588595) loss:2.586 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 09:59:57,694][model8_pretrain.py][INFO] Epoch:[0/2](607500/4588595) loss:3.036 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 09:59:57,694][model8_pretrain.py][INFO] Epoch:[0/2](607500/4588595) loss:2.684 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 10:00:34,629][model8_pretrain.py][INFO] Epoch:[0/2](607600/4588595) loss:3.381 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 10:00:34,629][model8_pretrain.py][INFO] Epoch:[0/2](607600/4588595) loss:2.747 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 10:00:34,629][model8_pretrain.py][INFO] Epoch:[0/2](607600/4588595) loss:3.068 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 10:00:34,629][model8_pretrain.py][INFO] Epoch:[0/2](607600/4588595) loss:2.647 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 10:00:34,629][model8_pretrain.py][INFO] Epoch:[0/2](607600/4588595) loss:3.249 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 10:00:34,629][model8_pretrain.py][INFO] Epoch:[0/2](607600/4588595) loss:2.456 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 10:00:34,629][model8_pretrain.py][INFO] Epoch:[0/2](607600/4588595) loss:2.702 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 10:00:34,629][model8_pretrain.py][INFO] Epoch:[0/2](607600/4588595) loss:2.912 lr:0.0000100 epoch_Time:25296.0min: [2024-01-05 10:01:11,564][model8_pretrain.py][INFO] Epoch:[0/2](607700/4588595) loss:2.363 lr:0.0000100 epoch_Time:25295.0min: [2024-01-05 10:01:11,564][model8_pretrain.py][INFO] Epoch:[0/2](607700/4588595) loss:2.925 lr:0.0000100 epoch_Time:25295.0min: [2024-01-05 10:01:11,564][model8_pretrain.py][INFO] Epoch:[0/2](607700/4588595) loss:2.691 lr:0.0000100 epoch_Time:25295.0min: [2024-01-05 10:01:11,564][model8_pretrain.py][INFO] Epoch:[0/2](607700/4588595) loss:2.789 lr:0.0000100 epoch_Time:25295.0min: [2024-01-05 10:01:11,564][model8_pretrain.py][INFO] Epoch:[0/2](607700/4588595) loss:2.269 lr:0.0000100 epoch_Time:25295.0min: [2024-01-05 10:01:11,564][model8_pretrain.py][INFO] Epoch:[0/2](607700/4588595) loss:3.106 lr:0.0000100 epoch_Time:25295.0min: [2024-01-05 10:01:11,564][model8_pretrain.py][INFO] Epoch:[0/2](607700/4588595) loss:2.760 lr:0.0000100 epoch_Time:25295.0min: [2024-01-05 10:01:11,565][model8_pretrain.py][INFO] Epoch:[0/2](607700/4588595) loss:3.240 lr:0.0000100 epoch_Time:25295.0min: [2024-01-05 10:01:48,515][model8_pretrain.py][INFO] Epoch:[0/2](607800/4588595) loss:2.227 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:01:48,516][model8_pretrain.py][INFO] Epoch:[0/2](607800/4588595) loss:3.140 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:01:48,516][model8_pretrain.py][INFO] Epoch:[0/2](607800/4588595) loss:2.692 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:01:48,516][model8_pretrain.py][INFO] Epoch:[0/2](607800/4588595) loss:3.229 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:01:48,516][model8_pretrain.py][INFO] Epoch:[0/2](607800/4588595) loss:3.393 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:01:48,516][model8_pretrain.py][INFO] Epoch:[0/2](607800/4588595) loss:2.839 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:01:48,516][model8_pretrain.py][INFO] Epoch:[0/2](607800/4588595) loss:2.654 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:01:48,516][model8_pretrain.py][INFO] Epoch:[0/2](607800/4588595) loss:3.152 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:02:25,471][model8_pretrain.py][INFO] Epoch:[0/2](607900/4588595) loss:2.864 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:02:25,471][model8_pretrain.py][INFO] Epoch:[0/2](607900/4588595) loss:3.085 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:02:25,471][model8_pretrain.py][INFO] Epoch:[0/2](607900/4588595) loss:2.998 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:02:25,471][model8_pretrain.py][INFO] Epoch:[0/2](607900/4588595) loss:3.263 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:02:25,471][model8_pretrain.py][INFO] Epoch:[0/2](607900/4588595) loss:2.909 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:02:25,471][model8_pretrain.py][INFO] Epoch:[0/2](607900/4588595) loss:3.159 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:02:25,471][model8_pretrain.py][INFO] Epoch:[0/2](607900/4588595) loss:3.104 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:02:25,471][model8_pretrain.py][INFO] Epoch:[0/2](607900/4588595) loss:2.878 lr:0.0000100 epoch_Time:25294.0min: [2024-01-05 10:03:02,422][model8_pretrain.py][INFO] Epoch:[0/2](608000/4588595) loss:2.375 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:03:02,422][model8_pretrain.py][INFO] Epoch:[0/2](608000/4588595) loss:3.000 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:02,422][model8_pretrain.py][INFO] Epoch:[0/2](608000/4588595) loss:2.698 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:02,422][model8_pretrain.py][INFO] Epoch:[0/2](608000/4588595) loss:2.845 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:02,422][model8_pretrain.py][INFO] Epoch:[0/2](608000/4588595) loss:2.908 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:03:02,422][model8_pretrain.py][INFO] Epoch:[0/2](608000/4588595) loss:2.752 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:02,422][model8_pretrain.py][INFO] Epoch:[0/2](608000/4588595) loss:3.084 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:02,422][model8_pretrain.py][INFO] Epoch:[0/2](608000/4588595) loss:2.696 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:39,366][model8_pretrain.py][INFO] Epoch:[0/2](608100/4588595) loss:2.899 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:39,366][model8_pretrain.py][INFO] Epoch:[0/2](608100/4588595) loss:2.575 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:39,366][model8_pretrain.py][INFO] Epoch:[0/2](608100/4588595) loss:2.606 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:39,366][model8_pretrain.py][INFO] Epoch:[0/2](608100/4588595) loss:3.040 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:39,366][model8_pretrain.py][INFO] Epoch:[0/2](608100/4588595) loss:2.491 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:39,366][model8_pretrain.py][INFO] Epoch:[0/2](608100/4588595) loss:2.800 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:39,366][model8_pretrain.py][INFO] Epoch:[0/2](608100/4588595) loss:2.973 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:03:39,366][model8_pretrain.py][INFO] Epoch:[0/2](608100/4588595) loss:2.965 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:04:28,108][model8_pretrain.py][INFO] Epoch:[0/2](608200/4588595) loss:2.951 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:04:28,108][model8_pretrain.py][INFO] Epoch:[0/2](608200/4588595) loss:2.989 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:04:28,108][model8_pretrain.py][INFO] Epoch:[0/2](608200/4588595) loss:2.373 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:04:28,108][model8_pretrain.py][INFO] Epoch:[0/2](608200/4588595) loss:2.693 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:04:28,108][model8_pretrain.py][INFO] Epoch:[0/2](608200/4588595) loss:2.865 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:04:28,108][model8_pretrain.py][INFO] Epoch:[0/2](608200/4588595) loss:2.732 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:04:28,108][model8_pretrain.py][INFO] Epoch:[0/2](608200/4588595) loss:2.626 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:04:28,108][model8_pretrain.py][INFO] Epoch:[0/2](608200/4588595) loss:3.134 lr:0.0000100 epoch_Time:25293.0min: [2024-01-05 10:05:05,047][model8_pretrain.py][INFO] Epoch:[0/2](608300/4588595) loss:3.376 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:05:05,047][model8_pretrain.py][INFO] Epoch:[0/2](608300/4588595) loss:2.848 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:05:05,047][model8_pretrain.py][INFO] Epoch:[0/2](608300/4588595) loss:2.533 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:05:05,048][model8_pretrain.py][INFO] Epoch:[0/2](608300/4588595) loss:2.988 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:05:05,048][model8_pretrain.py][INFO] Epoch:[0/2](608300/4588595) loss:2.066 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:05:05,048][model8_pretrain.py][INFO] Epoch:[0/2](608300/4588595) loss:2.691 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:05:05,048][model8_pretrain.py][INFO] Epoch:[0/2](608300/4588595) loss:2.597 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:05:05,048][model8_pretrain.py][INFO] Epoch:[0/2](608300/4588595) loss:3.128 lr:0.0000100 epoch_Time:25292.0min: [2024-01-05 10:05:41,997][model8_pretrain.py][INFO] Epoch:[0/2](608400/4588595) loss:2.505 lr:0.0000100 epoch_Time:25291.0min: [2024-01-05 10:05:41,998][model8_pretrain.py][INFO] Epoch:[0/2](608400/4588595) loss:2.643 lr:0.0000100 epoch_Time:25291.0min: [2024-01-05 10:05:41,998][model8_pretrain.py][INFO] Epoch:[0/2](608400/4588595) loss:2.975 lr:0.0000100 epoch_Time:25291.0min: [2024-01-05 10:05:41,998][model8_pretrain.py][INFO] Epoch:[0/2](608400/4588595) loss:2.752 lr:0.0000100 epoch_Time:25291.0min: [2024-01-05 10:05:41,998][model8_pretrain.py][INFO] Epoch:[0/2](608400/4588595) loss:2.616 lr:0.0000100 epoch_Time:25291.0min: [2024-01-05 10:05:41,998][model8_pretrain.py][INFO] Epoch:[0/2](608400/4588595) loss:2.587 lr:0.0000100 epoch_Time:25291.0min: [2024-01-05 10:05:41,998][model8_pretrain.py][INFO] Epoch:[0/2](608400/4588595) loss:2.447 lr:0.0000100 epoch_Time:25291.0min: [2024-01-05 10:05:41,998][model8_pretrain.py][INFO] Epoch:[0/2](608400/4588595) loss:3.076 lr:0.0000100 epoch_Time:25291.0min: [2024-01-05 10:06:18,958][model8_pretrain.py][INFO] Epoch:[0/2](608500/4588595) loss:3.291 lr:0.0000100 epoch_Time:25290.0min: [2024-01-05 10:06:18,958][model8_pretrain.py][INFO] Epoch:[0/2](608500/4588595) loss:3.161 lr:0.0000100 epoch_Time:25290.0min: [2024-01-05 10:06:18,958][model8_pretrain.py][INFO] Epoch:[0/2](608500/4588595) loss:2.915 lr:0.0000100 epoch_Time:25290.0min: [2024-01-05 10:06:18,958][model8_pretrain.py][INFO] Epoch:[0/2](608500/4588595) loss:2.927 lr:0.0000100 epoch_Time:25290.0min: [2024-01-05 10:06:18,958][model8_pretrain.py][INFO] Epoch:[0/2](608500/4588595) loss:3.464 lr:0.0000100 epoch_Time:25290.0min: [2024-01-05 10:06:18,958][model8_pretrain.py][INFO] Epoch:[0/2](608500/4588595) loss:2.790 lr:0.0000100 epoch_Time:25290.0min: [2024-01-05 10:06:18,958][model8_pretrain.py][INFO] Epoch:[0/2](608500/4588595) loss:2.594 lr:0.0000100 epoch_Time:25290.0min: [2024-01-05 10:06:18,958][model8_pretrain.py][INFO] Epoch:[0/2](608500/4588595) loss:3.120 lr:0.0000100 epoch_Time:25290.0min: [2024-01-05 10:06:55,913][model8_pretrain.py][INFO] Epoch:[0/2](608600/4588595) loss:2.861 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:06:55,913][model8_pretrain.py][INFO] Epoch:[0/2](608600/4588595) loss:3.314 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:06:55,913][model8_pretrain.py][INFO] Epoch:[0/2](608600/4588595) loss:2.572 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:06:55,913][model8_pretrain.py][INFO] Epoch:[0/2](608600/4588595) loss:2.837 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:06:55,913][model8_pretrain.py][INFO] Epoch:[0/2](608600/4588595) loss:2.564 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:06:55,913][model8_pretrain.py][INFO] Epoch:[0/2](608600/4588595) loss:2.958 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:06:55,913][model8_pretrain.py][INFO] Epoch:[0/2](608600/4588595) loss:2.901 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:06:55,913][model8_pretrain.py][INFO] Epoch:[0/2](608600/4588595) loss:3.018 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:07:32,884][model8_pretrain.py][INFO] Epoch:[0/2](608700/4588595) loss:3.001 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:07:32,884][model8_pretrain.py][INFO] Epoch:[0/2](608700/4588595) loss:2.589 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:07:32,884][model8_pretrain.py][INFO] Epoch:[0/2](608700/4588595) loss:2.327 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:07:32,884][model8_pretrain.py][INFO] Epoch:[0/2](608700/4588595) loss:2.392 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:07:32,884][model8_pretrain.py][INFO] Epoch:[0/2](608700/4588595) loss:2.716 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:07:32,884][model8_pretrain.py][INFO] Epoch:[0/2](608700/4588595) loss:2.519 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:07:32,884][model8_pretrain.py][INFO] Epoch:[0/2](608700/4588595) loss:2.611 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:07:32,884][model8_pretrain.py][INFO] Epoch:[0/2](608700/4588595) loss:3.141 lr:0.0000100 epoch_Time:25289.0min: [2024-01-05 10:08:09,853][model8_pretrain.py][INFO] Epoch:[0/2](608800/4588595) loss:2.788 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:09,853][model8_pretrain.py][INFO] Epoch:[0/2](608800/4588595) loss:2.414 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:09,853][model8_pretrain.py][INFO] Epoch:[0/2](608800/4588595) loss:3.299 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:09,853][model8_pretrain.py][INFO] Epoch:[0/2](608800/4588595) loss:2.608 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:09,853][model8_pretrain.py][INFO] Epoch:[0/2](608800/4588595) loss:2.623 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:09,853][model8_pretrain.py][INFO] Epoch:[0/2](608800/4588595) loss:3.318 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:09,854][model8_pretrain.py][INFO] Epoch:[0/2](608800/4588595) loss:3.060 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:09,854][model8_pretrain.py][INFO] Epoch:[0/2](608800/4588595) loss:3.084 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:46,800][model8_pretrain.py][INFO] Epoch:[0/2](608900/4588595) loss:3.014 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:46,800][model8_pretrain.py][INFO] Epoch:[0/2](608900/4588595) loss:2.900 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:46,800][model8_pretrain.py][INFO] Epoch:[0/2](608900/4588595) loss:2.479 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:46,800][model8_pretrain.py][INFO] Epoch:[0/2](608900/4588595) loss:2.737 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:46,800][model8_pretrain.py][INFO] Epoch:[0/2](608900/4588595) loss:2.544 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:46,800][model8_pretrain.py][INFO] Epoch:[0/2](608900/4588595) loss:3.162 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:46,800][model8_pretrain.py][INFO] Epoch:[0/2](608900/4588595) loss:2.874 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:08:46,800][model8_pretrain.py][INFO] Epoch:[0/2](608900/4588595) loss:2.975 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:09:35,592][model8_pretrain.py][INFO] Epoch:[0/2](609000/4588595) loss:3.162 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:09:35,592][model8_pretrain.py][INFO] Epoch:[0/2](609000/4588595) loss:3.146 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:09:35,592][model8_pretrain.py][INFO] Epoch:[0/2](609000/4588595) loss:2.833 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:09:35,592][model8_pretrain.py][INFO] Epoch:[0/2](609000/4588595) loss:2.997 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:09:35,592][model8_pretrain.py][INFO] Epoch:[0/2](609000/4588595) loss:2.646 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:09:35,592][model8_pretrain.py][INFO] Epoch:[0/2](609000/4588595) loss:2.928 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:09:35,592][model8_pretrain.py][INFO] Epoch:[0/2](609000/4588595) loss:2.351 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:09:35,592][model8_pretrain.py][INFO] Epoch:[0/2](609000/4588595) loss:3.099 lr:0.0000100 epoch_Time:25288.0min: [2024-01-05 10:10:12,529][model8_pretrain.py][INFO] Epoch:[0/2](609100/4588595) loss:2.891 lr:0.0000100 epoch_Time:25287.0min: [2024-01-05 10:10:12,529][model8_pretrain.py][INFO] Epoch:[0/2](609100/4588595) loss:3.107 lr:0.0000100 epoch_Time:25287.0min: [2024-01-05 10:10:12,529][model8_pretrain.py][INFO] Epoch:[0/2](609100/4588595) loss:2.678 lr:0.0000100 epoch_Time:25287.0min: [2024-01-05 10:10:12,529][model8_pretrain.py][INFO] Epoch:[0/2](609100/4588595) loss:2.875 lr:0.0000100 epoch_Time:25287.0min: [2024-01-05 10:10:12,529][model8_pretrain.py][INFO] Epoch:[0/2](609100/4588595) loss:3.103 lr:0.0000100 epoch_Time:25287.0min: [2024-01-05 10:10:12,529][model8_pretrain.py][INFO] Epoch:[0/2](609100/4588595) loss:2.705 lr:0.0000100 epoch_Time:25287.0min: [2024-01-05 10:10:12,529][model8_pretrain.py][INFO] Epoch:[0/2](609100/4588595) loss:2.648 lr:0.0000100 epoch_Time:25287.0min: [2024-01-05 10:10:12,529][model8_pretrain.py][INFO] Epoch:[0/2](609100/4588595) loss:3.013 lr:0.0000100 epoch_Time:25287.0min: [2024-01-05 10:10:49,468][model8_pretrain.py][INFO] Epoch:[0/2](609200/4588595) loss:3.144 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:10:49,468][model8_pretrain.py][INFO] Epoch:[0/2](609200/4588595) loss:3.274 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:10:49,468][model8_pretrain.py][INFO] Epoch:[0/2](609200/4588595) loss:2.979 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:10:49,468][model8_pretrain.py][INFO] Epoch:[0/2](609200/4588595) loss:2.099 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:10:49,468][model8_pretrain.py][INFO] Epoch:[0/2](609200/4588595) loss:2.835 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:10:49,468][model8_pretrain.py][INFO] Epoch:[0/2](609200/4588595) loss:2.863 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:10:49,468][model8_pretrain.py][INFO] Epoch:[0/2](609200/4588595) loss:2.951 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:10:49,468][model8_pretrain.py][INFO] Epoch:[0/2](609200/4588595) loss:2.598 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:11:26,406][model8_pretrain.py][INFO] Epoch:[0/2](609300/4588595) loss:2.891 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:11:26,406][model8_pretrain.py][INFO] Epoch:[0/2](609300/4588595) loss:3.071 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:11:26,406][model8_pretrain.py][INFO] Epoch:[0/2](609300/4588595) loss:3.450 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:11:26,406][model8_pretrain.py][INFO] Epoch:[0/2](609300/4588595) loss:2.560 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:11:26,406][model8_pretrain.py][INFO] Epoch:[0/2](609300/4588595) loss:2.419 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:11:26,406][model8_pretrain.py][INFO] Epoch:[0/2](609300/4588595) loss:3.043 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:11:26,406][model8_pretrain.py][INFO] Epoch:[0/2](609300/4588595) loss:2.175 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:11:26,407][model8_pretrain.py][INFO] Epoch:[0/2](609300/4588595) loss:3.157 lr:0.0000100 epoch_Time:25286.0min: [2024-01-05 10:12:03,354][model8_pretrain.py][INFO] Epoch:[0/2](609400/4588595) loss:3.000 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:03,354][model8_pretrain.py][INFO] Epoch:[0/2](609400/4588595) loss:2.818 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:03,354][model8_pretrain.py][INFO] Epoch:[0/2](609400/4588595) loss:2.546 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:03,354][model8_pretrain.py][INFO] Epoch:[0/2](609400/4588595) loss:2.596 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:03,354][model8_pretrain.py][INFO] Epoch:[0/2](609400/4588595) loss:2.926 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:03,354][model8_pretrain.py][INFO] Epoch:[0/2](609400/4588595) loss:2.340 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:03,354][model8_pretrain.py][INFO] Epoch:[0/2](609400/4588595) loss:2.822 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:03,354][model8_pretrain.py][INFO] Epoch:[0/2](609400/4588595) loss:3.175 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:40,282][model8_pretrain.py][INFO] Epoch:[0/2](609500/4588595) loss:3.079 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:40,282][model8_pretrain.py][INFO] Epoch:[0/2](609500/4588595) loss:2.675 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:40,282][model8_pretrain.py][INFO] Epoch:[0/2](609500/4588595) loss:1.933 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:40,282][model8_pretrain.py][INFO] Epoch:[0/2](609500/4588595) loss:2.965 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:40,282][model8_pretrain.py][INFO] Epoch:[0/2](609500/4588595) loss:3.104 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:40,282][model8_pretrain.py][INFO] Epoch:[0/2](609500/4588595) loss:2.845 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:40,283][model8_pretrain.py][INFO] Epoch:[0/2](609500/4588595) loss:2.755 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:12:40,283][model8_pretrain.py][INFO] Epoch:[0/2](609500/4588595) loss:2.704 lr:0.0000100 epoch_Time:25284.0min: [2024-01-05 10:13:17,224][model8_pretrain.py][INFO] Epoch:[0/2](609600/4588595) loss:2.821 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:13:17,224][model8_pretrain.py][INFO] Epoch:[0/2](609600/4588595) loss:3.096 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:13:17,224][model8_pretrain.py][INFO] Epoch:[0/2](609600/4588595) loss:2.675 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:13:17,224][model8_pretrain.py][INFO] Epoch:[0/2](609600/4588595) loss:2.764 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:13:17,224][model8_pretrain.py][INFO] Epoch:[0/2](609600/4588595) loss:2.994 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:13:17,224][model8_pretrain.py][INFO] Epoch:[0/2](609600/4588595) loss:2.763 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:13:17,224][model8_pretrain.py][INFO] Epoch:[0/2](609600/4588595) loss:2.618 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:13:17,224][model8_pretrain.py][INFO] Epoch:[0/2](609600/4588595) loss:3.354 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:13:54,172][model8_pretrain.py][INFO] Epoch:[0/2](609700/4588595) loss:2.733 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:13:54,172][model8_pretrain.py][INFO] Epoch:[0/2](609700/4588595) loss:3.316 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:13:54,172][model8_pretrain.py][INFO] Epoch:[0/2](609700/4588595) loss:2.634 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:13:54,172][model8_pretrain.py][INFO] Epoch:[0/2](609700/4588595) loss:3.322 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:13:54,172][model8_pretrain.py][INFO] Epoch:[0/2](609700/4588595) loss:3.003 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:13:54,172][model8_pretrain.py][INFO] Epoch:[0/2](609700/4588595) loss:2.664 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:13:54,172][model8_pretrain.py][INFO] Epoch:[0/2](609700/4588595) loss:2.922 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:13:54,172][model8_pretrain.py][INFO] Epoch:[0/2](609700/4588595) loss:3.202 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:14:42,917][model8_pretrain.py][INFO] Epoch:[0/2](609800/4588595) loss:2.926 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:14:42,917][model8_pretrain.py][INFO] Epoch:[0/2](609800/4588595) loss:2.813 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:14:42,917][model8_pretrain.py][INFO] Epoch:[0/2](609800/4588595) loss:2.732 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:14:42,917][model8_pretrain.py][INFO] Epoch:[0/2](609800/4588595) loss:2.691 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:14:42,917][model8_pretrain.py][INFO] Epoch:[0/2](609800/4588595) loss:2.254 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:14:42,917][model8_pretrain.py][INFO] Epoch:[0/2](609800/4588595) loss:2.760 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:14:42,917][model8_pretrain.py][INFO] Epoch:[0/2](609800/4588595) loss:2.362 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:14:42,917][model8_pretrain.py][INFO] Epoch:[0/2](609800/4588595) loss:2.694 lr:0.0000100 epoch_Time:25283.0min: [2024-01-05 10:15:19,857][model8_pretrain.py][INFO] Epoch:[0/2](609900/4588595) loss:2.899 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:15:19,857][model8_pretrain.py][INFO] Epoch:[0/2](609900/4588595) loss:2.710 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:15:19,857][model8_pretrain.py][INFO] Epoch:[0/2](609900/4588595) loss:2.591 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:15:19,857][model8_pretrain.py][INFO] Epoch:[0/2](609900/4588595) loss:2.979 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:15:19,857][model8_pretrain.py][INFO] Epoch:[0/2](609900/4588595) loss:2.605 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:15:19,857][model8_pretrain.py][INFO] Epoch:[0/2](609900/4588595) loss:2.661 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:15:19,857][model8_pretrain.py][INFO] Epoch:[0/2](609900/4588595) loss:2.697 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:15:19,857][model8_pretrain.py][INFO] Epoch:[0/2](609900/4588595) loss:2.829 lr:0.0000100 epoch_Time:25282.0min: [2024-01-05 10:15:56,794][model8_pretrain.py][INFO] Epoch:[0/2](610000/4588595) loss:2.865 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:15:56,794][model8_pretrain.py][INFO] Epoch:[0/2](610000/4588595) loss:3.027 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:15:56,794][model8_pretrain.py][INFO] Epoch:[0/2](610000/4588595) loss:2.771 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:15:56,794][model8_pretrain.py][INFO] Epoch:[0/2](610000/4588595) loss:2.980 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:15:56,794][model8_pretrain.py][INFO] Epoch:[0/2](610000/4588595) loss:3.098 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:15:56,794][model8_pretrain.py][INFO] Epoch:[0/2](610000/4588595) loss:3.025 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:15:56,794][model8_pretrain.py][INFO] Epoch:[0/2](610000/4588595) loss:2.770 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:15:56,795][model8_pretrain.py][INFO] Epoch:[0/2](610000/4588595) loss:2.455 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:16:33,740][model8_pretrain.py][INFO] Epoch:[0/2](610100/4588595) loss:2.727 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:16:33,740][model8_pretrain.py][INFO] Epoch:[0/2](610100/4588595) loss:2.474 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:16:33,740][model8_pretrain.py][INFO] Epoch:[0/2](610100/4588595) loss:3.253 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:16:33,740][model8_pretrain.py][INFO] Epoch:[0/2](610100/4588595) loss:2.363 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:16:33,740][model8_pretrain.py][INFO] Epoch:[0/2](610100/4588595) loss:2.889 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:16:33,740][model8_pretrain.py][INFO] Epoch:[0/2](610100/4588595) loss:2.378 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:16:33,740][model8_pretrain.py][INFO] Epoch:[0/2](610100/4588595) loss:3.042 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:16:33,740][model8_pretrain.py][INFO] Epoch:[0/2](610100/4588595) loss:2.761 lr:0.0000100 epoch_Time:25281.0min: [2024-01-05 10:17:10,689][model8_pretrain.py][INFO] Epoch:[0/2](610200/4588595) loss:2.840 lr:0.0000100 epoch_Time:25280.0min: [2024-01-05 10:17:10,689][model8_pretrain.py][INFO] Epoch:[0/2](610200/4588595) loss:2.254 lr:0.0000100 epoch_Time:25280.0min: [2024-01-05 10:17:10,689][model8_pretrain.py][INFO] Epoch:[0/2](610200/4588595) loss:2.963 lr:0.0000100 epoch_Time:25280.0min: [2024-01-05 10:17:10,689][model8_pretrain.py][INFO] Epoch:[0/2](610200/4588595) loss:3.075 lr:0.0000100 epoch_Time:25280.0min: [2024-01-05 10:17:10,689][model8_pretrain.py][INFO] Epoch:[0/2](610200/4588595) loss:2.918 lr:0.0000100 epoch_Time:25280.0min: [2024-01-05 10:17:10,689][model8_pretrain.py][INFO] Epoch:[0/2](610200/4588595) loss:2.870 lr:0.0000100 epoch_Time:25280.0min: [2024-01-05 10:17:10,689][model8_pretrain.py][INFO] Epoch:[0/2](610200/4588595) loss:2.856 lr:0.0000100 epoch_Time:25280.0min: [2024-01-05 10:17:10,689][model8_pretrain.py][INFO] Epoch:[0/2](610200/4588595) loss:3.010 lr:0.0000100 epoch_Time:25280.0min: [2024-01-05 10:17:47,638][model8_pretrain.py][INFO] Epoch:[0/2](610300/4588595) loss:3.293 lr:0.0000100 epoch_Time:25279.0min: [2024-01-05 10:17:47,638][model8_pretrain.py][INFO] Epoch:[0/2](610300/4588595) loss:3.048 lr:0.0000100 epoch_Time:25279.0min: [2024-01-05 10:17:47,638][model8_pretrain.py][INFO] Epoch:[0/2](610300/4588595) loss:2.668 lr:0.0000100 epoch_Time:25279.0min: [2024-01-05 10:17:47,638][model8_pretrain.py][INFO] Epoch:[0/2](610300/4588595) loss:3.273 lr:0.0000100 epoch_Time:25279.0min: [2024-01-05 10:17:47,638][model8_pretrain.py][INFO] Epoch:[0/2](610300/4588595) loss:2.632 lr:0.0000100 epoch_Time:25279.0min: [2024-01-05 10:17:47,638][model8_pretrain.py][INFO] Epoch:[0/2](610300/4588595) loss:3.004 lr:0.0000100 epoch_Time:25279.0min: [2024-01-05 10:17:47,638][model8_pretrain.py][INFO] Epoch:[0/2](610300/4588595) loss:2.696 lr:0.0000100 epoch_Time:25279.0min: [2024-01-05 10:17:47,638][model8_pretrain.py][INFO] Epoch:[0/2](610300/4588595) loss:2.734 lr:0.0000100 epoch_Time:25279.0min: [2024-01-05 10:18:24,579][model8_pretrain.py][INFO] Epoch:[0/2](610400/4588595) loss:3.139 lr:0.0000100 epoch_Time:25278.0min: [2024-01-05 10:18:24,579][model8_pretrain.py][INFO] Epoch:[0/2](610400/4588595) loss:3.005 lr:0.0000100 epoch_Time:25278.0min: [2024-01-05 10:18:24,579][model8_pretrain.py][INFO] Epoch:[0/2](610400/4588595) loss:2.561 lr:0.0000100 epoch_Time:25278.0min: [2024-01-05 10:18:24,579][model8_pretrain.py][INFO] Epoch:[0/2](610400/4588595) loss:2.701 lr:0.0000100 epoch_Time:25278.0min: [2024-01-05 10:18:24,579][model8_pretrain.py][INFO] Epoch:[0/2](610400/4588595) loss:2.819 lr:0.0000100 epoch_Time:25278.0min: [2024-01-05 10:18:24,579][model8_pretrain.py][INFO] Epoch:[0/2](610400/4588595) loss:3.102 lr:0.0000100 epoch_Time:25278.0min: [2024-01-05 10:18:24,580][model8_pretrain.py][INFO] Epoch:[0/2](610400/4588595) loss:2.698 lr:0.0000100 epoch_Time:25278.0min: [2024-01-05 10:18:24,580][model8_pretrain.py][INFO] Epoch:[0/2](610400/4588595) loss:2.944 lr:0.0000100 epoch_Time:25278.0min: [2024-01-05 10:19:01,528][model8_pretrain.py][INFO] Epoch:[0/2](610500/4588595) loss:3.257 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:01,528][model8_pretrain.py][INFO] Epoch:[0/2](610500/4588595) loss:2.344 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:01,528][model8_pretrain.py][INFO] Epoch:[0/2](610500/4588595) loss:2.445 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:01,528][model8_pretrain.py][INFO] Epoch:[0/2](610500/4588595) loss:3.255 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:01,528][model8_pretrain.py][INFO] Epoch:[0/2](610500/4588595) loss:2.941 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:01,528][model8_pretrain.py][INFO] Epoch:[0/2](610500/4588595) loss:2.912 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:01,528][model8_pretrain.py][INFO] Epoch:[0/2](610500/4588595) loss:2.521 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:01,528][model8_pretrain.py][INFO] Epoch:[0/2](610500/4588595) loss:2.900 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:48,770][model8_pretrain.py][INFO] Epoch:[0/2](610600/4588595) loss:2.538 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:48,770][model8_pretrain.py][INFO] Epoch:[0/2](610600/4588595) loss:3.011 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:48,770][model8_pretrain.py][INFO] Epoch:[0/2](610600/4588595) loss:2.710 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:48,770][model8_pretrain.py][INFO] Epoch:[0/2](610600/4588595) loss:2.691 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:48,770][model8_pretrain.py][INFO] Epoch:[0/2](610600/4588595) loss:3.133 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:48,770][model8_pretrain.py][INFO] Epoch:[0/2](610600/4588595) loss:2.840 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:48,770][model8_pretrain.py][INFO] Epoch:[0/2](610600/4588595) loss:2.915 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:19:48,771][model8_pretrain.py][INFO] Epoch:[0/2](610600/4588595) loss:2.545 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:20:27,367][model8_pretrain.py][INFO] Epoch:[0/2](610700/4588595) loss:2.546 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:20:27,367][model8_pretrain.py][INFO] Epoch:[0/2](610700/4588595) loss:2.959 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:20:27,367][model8_pretrain.py][INFO] Epoch:[0/2](610700/4588595) loss:2.560 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:20:27,367][model8_pretrain.py][INFO] Epoch:[0/2](610700/4588595) loss:2.633 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:20:27,367][model8_pretrain.py][INFO] Epoch:[0/2](610700/4588595) loss:2.845 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:20:27,367][model8_pretrain.py][INFO] Epoch:[0/2](610700/4588595) loss:2.781 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:20:27,367][model8_pretrain.py][INFO] Epoch:[0/2](610700/4588595) loss:2.642 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:20:27,367][model8_pretrain.py][INFO] Epoch:[0/2](610700/4588595) loss:3.235 lr:0.0000100 epoch_Time:25277.0min: [2024-01-05 10:21:04,331][model8_pretrain.py][INFO] Epoch:[0/2](610800/4588595) loss:2.720 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:04,331][model8_pretrain.py][INFO] Epoch:[0/2](610800/4588595) loss:2.432 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:04,331][model8_pretrain.py][INFO] Epoch:[0/2](610800/4588595) loss:2.617 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:04,331][model8_pretrain.py][INFO] Epoch:[0/2](610800/4588595) loss:2.928 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:04,331][model8_pretrain.py][INFO] Epoch:[0/2](610800/4588595) loss:2.863 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:04,331][model8_pretrain.py][INFO] Epoch:[0/2](610800/4588595) loss:2.677 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:04,331][model8_pretrain.py][INFO] Epoch:[0/2](610800/4588595) loss:2.938 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:04,331][model8_pretrain.py][INFO] Epoch:[0/2](610800/4588595) loss:3.233 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:41,282][model8_pretrain.py][INFO] Epoch:[0/2](610900/4588595) loss:2.848 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:41,282][model8_pretrain.py][INFO] Epoch:[0/2](610900/4588595) loss:2.740 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:41,282][model8_pretrain.py][INFO] Epoch:[0/2](610900/4588595) loss:3.047 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:41,282][model8_pretrain.py][INFO] Epoch:[0/2](610900/4588595) loss:2.878 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:41,282][model8_pretrain.py][INFO] Epoch:[0/2](610900/4588595) loss:3.193 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:41,282][model8_pretrain.py][INFO] Epoch:[0/2](610900/4588595) loss:2.658 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:41,282][model8_pretrain.py][INFO] Epoch:[0/2](610900/4588595) loss:2.591 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:21:41,282][model8_pretrain.py][INFO] Epoch:[0/2](610900/4588595) loss:2.983 lr:0.0000100 epoch_Time:25276.0min: [2024-01-05 10:22:18,241][model8_pretrain.py][INFO] Epoch:[0/2](611000/4588595) loss:3.113 lr:0.0000100 epoch_Time:25275.0min: [2024-01-05 10:22:18,241][model8_pretrain.py][INFO] Epoch:[0/2](611000/4588595) loss:2.836 lr:0.0000100 epoch_Time:25275.0min: [2024-01-05 10:22:18,241][model8_pretrain.py][INFO] Epoch:[0/2](611000/4588595) loss:2.623 lr:0.0000100 epoch_Time:25275.0min: [2024-01-05 10:22:18,241][model8_pretrain.py][INFO] Epoch:[0/2](611000/4588595) loss:2.323 lr:0.0000100 epoch_Time:25275.0min: [2024-01-05 10:22:18,241][model8_pretrain.py][INFO] Epoch:[0/2](611000/4588595) loss:2.795 lr:0.0000100 epoch_Time:25275.0min: [2024-01-05 10:22:18,241][model8_pretrain.py][INFO] Epoch:[0/2](611000/4588595) loss:2.898 lr:0.0000100 epoch_Time:25275.0min: [2024-01-05 10:22:18,242][model8_pretrain.py][INFO] Epoch:[0/2](611000/4588595) loss:3.404 lr:0.0000100 epoch_Time:25275.0min: [2024-01-05 10:22:18,242][model8_pretrain.py][INFO] Epoch:[0/2](611000/4588595) loss:2.948 lr:0.0000100 epoch_Time:25275.0min: [2024-01-05 10:22:55,191][model8_pretrain.py][INFO] Epoch:[0/2](611100/4588595) loss:3.500 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:22:55,191][model8_pretrain.py][INFO] Epoch:[0/2](611100/4588595) loss:2.463 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:22:55,191][model8_pretrain.py][INFO] Epoch:[0/2](611100/4588595) loss:3.222 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:22:55,191][model8_pretrain.py][INFO] Epoch:[0/2](611100/4588595) loss:2.667 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:22:55,191][model8_pretrain.py][INFO] Epoch:[0/2](611100/4588595) loss:2.933 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:22:55,191][model8_pretrain.py][INFO] Epoch:[0/2](611100/4588595) loss:2.588 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:22:55,191][model8_pretrain.py][INFO] Epoch:[0/2](611100/4588595) loss:3.189 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:22:55,191][model8_pretrain.py][INFO] Epoch:[0/2](611100/4588595) loss:3.102 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:23:32,147][model8_pretrain.py][INFO] Epoch:[0/2](611200/4588595) loss:3.282 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:23:32,147][model8_pretrain.py][INFO] Epoch:[0/2](611200/4588595) loss:3.016 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:23:32,147][model8_pretrain.py][INFO] Epoch:[0/2](611200/4588595) loss:2.841 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:23:32,147][model8_pretrain.py][INFO] Epoch:[0/2](611200/4588595) loss:2.973 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:23:32,147][model8_pretrain.py][INFO] Epoch:[0/2](611200/4588595) loss:3.063 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:23:32,147][model8_pretrain.py][INFO] Epoch:[0/2](611200/4588595) loss:2.709 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:23:32,147][model8_pretrain.py][INFO] Epoch:[0/2](611200/4588595) loss:3.018 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:23:32,148][model8_pretrain.py][INFO] Epoch:[0/2](611200/4588595) loss:3.392 lr:0.0000100 epoch_Time:25274.0min: [2024-01-05 10:24:09,095][model8_pretrain.py][INFO] Epoch:[0/2](611300/4588595) loss:2.665 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:09,095][model8_pretrain.py][INFO] Epoch:[0/2](611300/4588595) loss:2.713 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:09,095][model8_pretrain.py][INFO] Epoch:[0/2](611300/4588595) loss:3.050 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:09,095][model8_pretrain.py][INFO] Epoch:[0/2](611300/4588595) loss:2.751 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:09,095][model8_pretrain.py][INFO] Epoch:[0/2](611300/4588595) loss:3.278 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:09,095][model8_pretrain.py][INFO] Epoch:[0/2](611300/4588595) loss:3.035 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:09,095][model8_pretrain.py][INFO] Epoch:[0/2](611300/4588595) loss:2.446 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:09,095][model8_pretrain.py][INFO] Epoch:[0/2](611300/4588595) loss:2.695 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:56,216][model8_pretrain.py][INFO] Epoch:[0/2](611400/4588595) loss:2.385 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:56,216][model8_pretrain.py][INFO] Epoch:[0/2](611400/4588595) loss:3.139 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:56,216][model8_pretrain.py][INFO] Epoch:[0/2](611400/4588595) loss:3.306 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:56,217][model8_pretrain.py][INFO] Epoch:[0/2](611400/4588595) loss:2.813 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:56,216][model8_pretrain.py][INFO] Epoch:[0/2](611400/4588595) loss:2.604 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:56,217][model8_pretrain.py][INFO] Epoch:[0/2](611400/4588595) loss:3.032 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:56,217][model8_pretrain.py][INFO] Epoch:[0/2](611400/4588595) loss:3.434 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:24:56,217][model8_pretrain.py][INFO] Epoch:[0/2](611400/4588595) loss:1.987 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:25:34,820][model8_pretrain.py][INFO] Epoch:[0/2](611500/4588595) loss:3.143 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:25:34,820][model8_pretrain.py][INFO] Epoch:[0/2](611500/4588595) loss:3.187 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:25:34,820][model8_pretrain.py][INFO] Epoch:[0/2](611500/4588595) loss:2.704 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:25:34,820][model8_pretrain.py][INFO] Epoch:[0/2](611500/4588595) loss:3.040 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:25:34,821][model8_pretrain.py][INFO] Epoch:[0/2](611500/4588595) loss:2.546 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:25:34,821][model8_pretrain.py][INFO] Epoch:[0/2](611500/4588595) loss:2.617 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:25:34,821][model8_pretrain.py][INFO] Epoch:[0/2](611500/4588595) loss:2.630 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:25:34,821][model8_pretrain.py][INFO] Epoch:[0/2](611500/4588595) loss:3.020 lr:0.0000100 epoch_Time:25273.0min: [2024-01-05 10:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](611600/4588595) loss:2.342 lr:0.0000100 epoch_Time:25272.0min: [2024-01-05 10:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](611600/4588595) loss:3.057 lr:0.0000100 epoch_Time:25272.0min: [2024-01-05 10:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](611600/4588595) loss:3.052 lr:0.0000100 epoch_Time:25272.0min: [2024-01-05 10:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](611600/4588595) loss:2.578 lr:0.0000100 epoch_Time:25272.0min: [2024-01-05 10:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](611600/4588595) loss:3.186 lr:0.0000100 epoch_Time:25272.0min: [2024-01-05 10:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](611600/4588595) loss:2.732 lr:0.0000100 epoch_Time:25272.0min: [2024-01-05 10:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](611600/4588595) loss:2.721 lr:0.0000100 epoch_Time:25272.0min: [2024-01-05 10:26:11,774][model8_pretrain.py][INFO] Epoch:[0/2](611600/4588595) loss:2.802 lr:0.0000100 epoch_Time:25272.0min: [2024-01-05 10:26:48,728][model8_pretrain.py][INFO] Epoch:[0/2](611700/4588595) loss:2.885 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:26:48,728][model8_pretrain.py][INFO] Epoch:[0/2](611700/4588595) loss:2.044 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:26:48,728][model8_pretrain.py][INFO] Epoch:[0/2](611700/4588595) loss:3.121 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:26:48,728][model8_pretrain.py][INFO] Epoch:[0/2](611700/4588595) loss:2.793 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:26:48,728][model8_pretrain.py][INFO] Epoch:[0/2](611700/4588595) loss:2.349 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:26:48,728][model8_pretrain.py][INFO] Epoch:[0/2](611700/4588595) loss:2.981 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:26:48,728][model8_pretrain.py][INFO] Epoch:[0/2](611700/4588595) loss:3.413 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:26:48,728][model8_pretrain.py][INFO] Epoch:[0/2](611700/4588595) loss:2.764 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:27:25,665][model8_pretrain.py][INFO] Epoch:[0/2](611800/4588595) loss:2.692 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:27:25,665][model8_pretrain.py][INFO] Epoch:[0/2](611800/4588595) loss:3.115 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:27:25,665][model8_pretrain.py][INFO] Epoch:[0/2](611800/4588595) loss:2.646 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:27:25,665][model8_pretrain.py][INFO] Epoch:[0/2](611800/4588595) loss:2.401 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:27:25,665][model8_pretrain.py][INFO] Epoch:[0/2](611800/4588595) loss:2.617 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:27:25,666][model8_pretrain.py][INFO] Epoch:[0/2](611800/4588595) loss:2.643 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:27:25,666][model8_pretrain.py][INFO] Epoch:[0/2](611800/4588595) loss:3.107 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:27:25,666][model8_pretrain.py][INFO] Epoch:[0/2](611800/4588595) loss:2.341 lr:0.0000100 epoch_Time:25270.0min: [2024-01-05 10:28:02,615][model8_pretrain.py][INFO] Epoch:[0/2](611900/4588595) loss:2.386 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:02,615][model8_pretrain.py][INFO] Epoch:[0/2](611900/4588595) loss:2.862 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:02,615][model8_pretrain.py][INFO] Epoch:[0/2](611900/4588595) loss:3.085 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:02,615][model8_pretrain.py][INFO] Epoch:[0/2](611900/4588595) loss:3.146 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:02,615][model8_pretrain.py][INFO] Epoch:[0/2](611900/4588595) loss:2.973 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:02,615][model8_pretrain.py][INFO] Epoch:[0/2](611900/4588595) loss:2.959 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:02,615][model8_pretrain.py][INFO] Epoch:[0/2](611900/4588595) loss:2.296 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:02,615][model8_pretrain.py][INFO] Epoch:[0/2](611900/4588595) loss:2.980 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:39,565][model8_pretrain.py][INFO] Epoch:[0/2](612000/4588595) loss:2.822 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:39,565][model8_pretrain.py][INFO] Epoch:[0/2](612000/4588595) loss:2.373 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:39,565][model8_pretrain.py][INFO] Epoch:[0/2](612000/4588595) loss:2.509 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:39,565][model8_pretrain.py][INFO] Epoch:[0/2](612000/4588595) loss:2.848 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:39,565][model8_pretrain.py][INFO] Epoch:[0/2](612000/4588595) loss:2.725 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:39,565][model8_pretrain.py][INFO] Epoch:[0/2](612000/4588595) loss:2.931 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:39,565][model8_pretrain.py][INFO] Epoch:[0/2](612000/4588595) loss:3.055 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:28:39,565][model8_pretrain.py][INFO] Epoch:[0/2](612000/4588595) loss:2.723 lr:0.0000100 epoch_Time:25269.0min: [2024-01-05 10:29:16,511][model8_pretrain.py][INFO] Epoch:[0/2](612100/4588595) loss:2.554 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:29:16,511][model8_pretrain.py][INFO] Epoch:[0/2](612100/4588595) loss:2.764 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:29:16,511][model8_pretrain.py][INFO] Epoch:[0/2](612100/4588595) loss:2.656 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:29:16,511][model8_pretrain.py][INFO] Epoch:[0/2](612100/4588595) loss:2.598 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:29:16,511][model8_pretrain.py][INFO] Epoch:[0/2](612100/4588595) loss:2.657 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:29:16,511][model8_pretrain.py][INFO] Epoch:[0/2](612100/4588595) loss:2.678 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:29:16,511][model8_pretrain.py][INFO] Epoch:[0/2](612100/4588595) loss:2.875 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:29:16,511][model8_pretrain.py][INFO] Epoch:[0/2](612100/4588595) loss:2.841 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:01,988][model8_pretrain.py][INFO] Epoch:[0/2](612200/4588595) loss:3.060 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:01,988][model8_pretrain.py][INFO] Epoch:[0/2](612200/4588595) loss:2.417 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:01,988][model8_pretrain.py][INFO] Epoch:[0/2](612200/4588595) loss:2.904 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:01,988][model8_pretrain.py][INFO] Epoch:[0/2](612200/4588595) loss:3.031 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:01,988][model8_pretrain.py][INFO] Epoch:[0/2](612200/4588595) loss:2.494 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:01,988][model8_pretrain.py][INFO] Epoch:[0/2](612200/4588595) loss:2.983 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:01,988][model8_pretrain.py][INFO] Epoch:[0/2](612200/4588595) loss:2.868 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:01,989][model8_pretrain.py][INFO] Epoch:[0/2](612200/4588595) loss:3.349 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:42,472][model8_pretrain.py][INFO] Epoch:[0/2](612300/4588595) loss:3.267 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:42,472][model8_pretrain.py][INFO] Epoch:[0/2](612300/4588595) loss:3.035 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:42,472][model8_pretrain.py][INFO] Epoch:[0/2](612300/4588595) loss:3.165 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:42,472][model8_pretrain.py][INFO] Epoch:[0/2](612300/4588595) loss:2.677 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:42,472][model8_pretrain.py][INFO] Epoch:[0/2](612300/4588595) loss:2.552 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:42,472][model8_pretrain.py][INFO] Epoch:[0/2](612300/4588595) loss:3.060 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:42,472][model8_pretrain.py][INFO] Epoch:[0/2](612300/4588595) loss:2.526 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:30:42,472][model8_pretrain.py][INFO] Epoch:[0/2](612300/4588595) loss:2.812 lr:0.0000100 epoch_Time:25268.0min: [2024-01-05 10:31:19,421][model8_pretrain.py][INFO] Epoch:[0/2](612400/4588595) loss:2.903 lr:0.0000100 epoch_Time:25267.0min: [2024-01-05 10:31:19,421][model8_pretrain.py][INFO] Epoch:[0/2](612400/4588595) loss:3.098 lr:0.0000100 epoch_Time:25267.0min: [2024-01-05 10:31:19,421][model8_pretrain.py][INFO] Epoch:[0/2](612400/4588595) loss:2.883 lr:0.0000100 epoch_Time:25267.0min: [2024-01-05 10:31:19,421][model8_pretrain.py][INFO] Epoch:[0/2](612400/4588595) loss:2.480 lr:0.0000100 epoch_Time:25267.0min: [2024-01-05 10:31:19,421][model8_pretrain.py][INFO] Epoch:[0/2](612400/4588595) loss:3.065 lr:0.0000100 epoch_Time:25267.0min: [2024-01-05 10:31:19,421][model8_pretrain.py][INFO] Epoch:[0/2](612400/4588595) loss:2.985 lr:0.0000100 epoch_Time:25267.0min: [2024-01-05 10:31:19,421][model8_pretrain.py][INFO] Epoch:[0/2](612400/4588595) loss:2.744 lr:0.0000100 epoch_Time:25267.0min: [2024-01-05 10:31:19,421][model8_pretrain.py][INFO] Epoch:[0/2](612400/4588595) loss:3.020 lr:0.0000100 epoch_Time:25267.0min: [2024-01-05 10:31:56,376][model8_pretrain.py][INFO] Epoch:[0/2](612500/4588595) loss:3.071 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:31:56,376][model8_pretrain.py][INFO] Epoch:[0/2](612500/4588595) loss:3.315 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:31:56,376][model8_pretrain.py][INFO] Epoch:[0/2](612500/4588595) loss:2.628 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:31:56,376][model8_pretrain.py][INFO] Epoch:[0/2](612500/4588595) loss:3.079 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:31:56,376][model8_pretrain.py][INFO] Epoch:[0/2](612500/4588595) loss:2.270 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:31:56,376][model8_pretrain.py][INFO] Epoch:[0/2](612500/4588595) loss:3.056 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:31:56,376][model8_pretrain.py][INFO] Epoch:[0/2](612500/4588595) loss:2.459 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:31:56,376][model8_pretrain.py][INFO] Epoch:[0/2](612500/4588595) loss:2.761 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:32:33,329][model8_pretrain.py][INFO] Epoch:[0/2](612600/4588595) loss:3.000 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:32:33,329][model8_pretrain.py][INFO] Epoch:[0/2](612600/4588595) loss:3.184 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:32:33,329][model8_pretrain.py][INFO] Epoch:[0/2](612600/4588595) loss:2.746 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:32:33,329][model8_pretrain.py][INFO] Epoch:[0/2](612600/4588595) loss:3.126 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:32:33,329][model8_pretrain.py][INFO] Epoch:[0/2](612600/4588595) loss:2.390 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:32:33,329][model8_pretrain.py][INFO] Epoch:[0/2](612600/4588595) loss:2.854 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:32:33,329][model8_pretrain.py][INFO] Epoch:[0/2](612600/4588595) loss:3.031 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:32:33,329][model8_pretrain.py][INFO] Epoch:[0/2](612600/4588595) loss:2.229 lr:0.0000100 epoch_Time:25266.0min: [2024-01-05 10:33:10,278][model8_pretrain.py][INFO] Epoch:[0/2](612700/4588595) loss:3.106 lr:0.0000100 epoch_Time:25265.0min: [2024-01-05 10:33:10,278][model8_pretrain.py][INFO] Epoch:[0/2](612700/4588595) loss:2.428 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:10,278][model8_pretrain.py][INFO] Epoch:[0/2](612700/4588595) loss:2.114 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:10,278][model8_pretrain.py][INFO] Epoch:[0/2](612700/4588595) loss:2.942 lr:0.0000100 epoch_Time:25265.0min: [2024-01-05 10:33:10,278][model8_pretrain.py][INFO] Epoch:[0/2](612700/4588595) loss:2.719 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:10,278][model8_pretrain.py][INFO] Epoch:[0/2](612700/4588595) loss:2.626 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:10,278][model8_pretrain.py][INFO] Epoch:[0/2](612700/4588595) loss:2.636 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:10,278][model8_pretrain.py][INFO] Epoch:[0/2](612700/4588595) loss:2.759 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:47,219][model8_pretrain.py][INFO] Epoch:[0/2](612800/4588595) loss:2.954 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:47,219][model8_pretrain.py][INFO] Epoch:[0/2](612800/4588595) loss:3.172 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:47,219][model8_pretrain.py][INFO] Epoch:[0/2](612800/4588595) loss:2.717 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:47,219][model8_pretrain.py][INFO] Epoch:[0/2](612800/4588595) loss:3.033 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:47,219][model8_pretrain.py][INFO] Epoch:[0/2](612800/4588595) loss:2.921 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:47,219][model8_pretrain.py][INFO] Epoch:[0/2](612800/4588595) loss:2.994 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:47,219][model8_pretrain.py][INFO] Epoch:[0/2](612800/4588595) loss:3.100 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:33:47,219][model8_pretrain.py][INFO] Epoch:[0/2](612800/4588595) loss:2.465 lr:0.0000100 epoch_Time:25264.0min: [2024-01-05 10:34:24,172][model8_pretrain.py][INFO] Epoch:[0/2](612900/4588595) loss:3.156 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:34:24,172][model8_pretrain.py][INFO] Epoch:[0/2](612900/4588595) loss:2.466 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:34:24,172][model8_pretrain.py][INFO] Epoch:[0/2](612900/4588595) loss:2.883 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:34:24,172][model8_pretrain.py][INFO] Epoch:[0/2](612900/4588595) loss:2.198 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:34:24,172][model8_pretrain.py][INFO] Epoch:[0/2](612900/4588595) loss:3.030 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:34:24,172][model8_pretrain.py][INFO] Epoch:[0/2](612900/4588595) loss:2.957 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:34:24,173][model8_pretrain.py][INFO] Epoch:[0/2](612900/4588595) loss:2.987 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:34:24,173][model8_pretrain.py][INFO] Epoch:[0/2](612900/4588595) loss:2.771 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:35:09,715][model8_pretrain.py][INFO] Epoch:[0/2](613000/4588595) loss:2.550 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:35:09,715][model8_pretrain.py][INFO] Epoch:[0/2](613000/4588595) loss:2.960 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:35:09,715][model8_pretrain.py][INFO] Epoch:[0/2](613000/4588595) loss:2.747 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:35:09,715][model8_pretrain.py][INFO] Epoch:[0/2](613000/4588595) loss:2.918 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:35:09,715][model8_pretrain.py][INFO] Epoch:[0/2](613000/4588595) loss:2.661 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:35:09,715][model8_pretrain.py][INFO] Epoch:[0/2](613000/4588595) loss:2.333 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:35:09,715][model8_pretrain.py][INFO] Epoch:[0/2](613000/4588595) loss:3.115 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:35:09,715][model8_pretrain.py][INFO] Epoch:[0/2](613000/4588595) loss:2.681 lr:0.0000100 epoch_Time:25263.0min: [2024-01-05 10:35:50,127][model8_pretrain.py][INFO] Epoch:[0/2](613100/4588595) loss:3.131 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:35:50,127][model8_pretrain.py][INFO] Epoch:[0/2](613100/4588595) loss:2.639 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:35:50,127][model8_pretrain.py][INFO] Epoch:[0/2](613100/4588595) loss:2.815 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:35:50,127][model8_pretrain.py][INFO] Epoch:[0/2](613100/4588595) loss:2.815 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:35:50,127][model8_pretrain.py][INFO] Epoch:[0/2](613100/4588595) loss:2.275 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:35:50,127][model8_pretrain.py][INFO] Epoch:[0/2](613100/4588595) loss:2.979 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:35:50,127][model8_pretrain.py][INFO] Epoch:[0/2](613100/4588595) loss:3.036 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:35:50,128][model8_pretrain.py][INFO] Epoch:[0/2](613100/4588595) loss:2.828 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:36:27,060][model8_pretrain.py][INFO] Epoch:[0/2](613200/4588595) loss:2.245 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:36:27,060][model8_pretrain.py][INFO] Epoch:[0/2](613200/4588595) loss:2.733 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:36:27,060][model8_pretrain.py][INFO] Epoch:[0/2](613200/4588595) loss:3.161 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:36:27,060][model8_pretrain.py][INFO] Epoch:[0/2](613200/4588595) loss:3.551 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:36:27,060][model8_pretrain.py][INFO] Epoch:[0/2](613200/4588595) loss:3.034 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:36:27,060][model8_pretrain.py][INFO] Epoch:[0/2](613200/4588595) loss:2.770 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:36:27,060][model8_pretrain.py][INFO] Epoch:[0/2](613200/4588595) loss:3.202 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:36:27,060][model8_pretrain.py][INFO] Epoch:[0/2](613200/4588595) loss:2.550 lr:0.0000100 epoch_Time:25262.0min: [2024-01-05 10:37:04,006][model8_pretrain.py][INFO] Epoch:[0/2](613300/4588595) loss:3.053 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:04,007][model8_pretrain.py][INFO] Epoch:[0/2](613300/4588595) loss:3.023 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:04,007][model8_pretrain.py][INFO] Epoch:[0/2](613300/4588595) loss:3.062 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:04,007][model8_pretrain.py][INFO] Epoch:[0/2](613300/4588595) loss:2.586 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:04,007][model8_pretrain.py][INFO] Epoch:[0/2](613300/4588595) loss:2.906 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:04,007][model8_pretrain.py][INFO] Epoch:[0/2](613300/4588595) loss:3.089 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:04,007][model8_pretrain.py][INFO] Epoch:[0/2](613300/4588595) loss:2.894 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:04,007][model8_pretrain.py][INFO] Epoch:[0/2](613300/4588595) loss:3.360 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:40,952][model8_pretrain.py][INFO] Epoch:[0/2](613400/4588595) loss:3.006 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:40,952][model8_pretrain.py][INFO] Epoch:[0/2](613400/4588595) loss:3.010 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:40,952][model8_pretrain.py][INFO] Epoch:[0/2](613400/4588595) loss:3.317 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:40,952][model8_pretrain.py][INFO] Epoch:[0/2](613400/4588595) loss:2.865 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:40,952][model8_pretrain.py][INFO] Epoch:[0/2](613400/4588595) loss:2.542 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:40,952][model8_pretrain.py][INFO] Epoch:[0/2](613400/4588595) loss:3.292 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:40,952][model8_pretrain.py][INFO] Epoch:[0/2](613400/4588595) loss:2.974 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:37:40,952][model8_pretrain.py][INFO] Epoch:[0/2](613400/4588595) loss:2.839 lr:0.0000100 epoch_Time:25261.0min: [2024-01-05 10:38:17,891][model8_pretrain.py][INFO] Epoch:[0/2](613500/4588595) loss:3.039 lr:0.0000100 epoch_Time:25260.0min: [2024-01-05 10:38:17,891][model8_pretrain.py][INFO] Epoch:[0/2](613500/4588595) loss:3.108 lr:0.0000100 epoch_Time:25260.0min: [2024-01-05 10:38:17,892][model8_pretrain.py][INFO] Epoch:[0/2](613500/4588595) loss:2.793 lr:0.0000100 epoch_Time:25260.0min: [2024-01-05 10:38:17,892][model8_pretrain.py][INFO] Epoch:[0/2](613500/4588595) loss:2.902 lr:0.0000100 epoch_Time:25260.0min: [2024-01-05 10:38:17,892][model8_pretrain.py][INFO] Epoch:[0/2](613500/4588595) loss:2.510 lr:0.0000100 epoch_Time:25260.0min: [2024-01-05 10:38:17,892][model8_pretrain.py][INFO] Epoch:[0/2](613500/4588595) loss:2.801 lr:0.0000100 epoch_Time:25260.0min: [2024-01-05 10:38:17,892][model8_pretrain.py][INFO] Epoch:[0/2](613500/4588595) loss:2.964 lr:0.0000100 epoch_Time:25260.0min: [2024-01-05 10:38:17,892][model8_pretrain.py][INFO] Epoch:[0/2](613500/4588595) loss:2.894 lr:0.0000100 epoch_Time:25260.0min: [2024-01-05 10:38:54,845][model8_pretrain.py][INFO] Epoch:[0/2](613600/4588595) loss:2.869 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:38:54,845][model8_pretrain.py][INFO] Epoch:[0/2](613600/4588595) loss:2.997 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:38:54,845][model8_pretrain.py][INFO] Epoch:[0/2](613600/4588595) loss:3.204 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:38:54,845][model8_pretrain.py][INFO] Epoch:[0/2](613600/4588595) loss:2.511 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:38:54,845][model8_pretrain.py][INFO] Epoch:[0/2](613600/4588595) loss:2.665 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:38:54,845][model8_pretrain.py][INFO] Epoch:[0/2](613600/4588595) loss:2.907 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:38:54,845][model8_pretrain.py][INFO] Epoch:[0/2](613600/4588595) loss:3.251 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:38:54,845][model8_pretrain.py][INFO] Epoch:[0/2](613600/4588595) loss:2.562 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:39:31,794][model8_pretrain.py][INFO] Epoch:[0/2](613700/4588595) loss:2.883 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:39:31,794][model8_pretrain.py][INFO] Epoch:[0/2](613700/4588595) loss:2.679 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:39:31,794][model8_pretrain.py][INFO] Epoch:[0/2](613700/4588595) loss:2.466 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:39:31,794][model8_pretrain.py][INFO] Epoch:[0/2](613700/4588595) loss:3.533 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:39:31,794][model8_pretrain.py][INFO] Epoch:[0/2](613700/4588595) loss:2.882 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:39:31,794][model8_pretrain.py][INFO] Epoch:[0/2](613700/4588595) loss:2.934 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:39:31,795][model8_pretrain.py][INFO] Epoch:[0/2](613700/4588595) loss:2.649 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:39:31,795][model8_pretrain.py][INFO] Epoch:[0/2](613700/4588595) loss:3.033 lr:0.0000100 epoch_Time:25259.0min: [2024-01-05 10:40:15,481][model8_pretrain.py][INFO] Epoch:[0/2](613800/4588595) loss:2.755 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:15,481][model8_pretrain.py][INFO] Epoch:[0/2](613800/4588595) loss:2.797 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:15,481][model8_pretrain.py][INFO] Epoch:[0/2](613800/4588595) loss:2.617 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:15,486][model8_pretrain.py][INFO] Epoch:[0/2](613800/4588595) loss:2.725 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:15,486][model8_pretrain.py][INFO] Epoch:[0/2](613800/4588595) loss:2.567 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:15,486][model8_pretrain.py][INFO] Epoch:[0/2](613800/4588595) loss:3.203 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:15,486][model8_pretrain.py][INFO] Epoch:[0/2](613800/4588595) loss:2.884 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:15,486][model8_pretrain.py][INFO] Epoch:[0/2](613800/4588595) loss:2.864 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:57,631][model8_pretrain.py][INFO] Epoch:[0/2](613900/4588595) loss:3.037 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:57,631][model8_pretrain.py][INFO] Epoch:[0/2](613900/4588595) loss:2.958 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:57,631][model8_pretrain.py][INFO] Epoch:[0/2](613900/4588595) loss:3.057 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:57,631][model8_pretrain.py][INFO] Epoch:[0/2](613900/4588595) loss:2.845 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:57,632][model8_pretrain.py][INFO] Epoch:[0/2](613900/4588595) loss:3.228 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:57,632][model8_pretrain.py][INFO] Epoch:[0/2](613900/4588595) loss:3.154 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:57,632][model8_pretrain.py][INFO] Epoch:[0/2](613900/4588595) loss:2.705 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:40:57,632][model8_pretrain.py][INFO] Epoch:[0/2](613900/4588595) loss:2.589 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:41:34,573][model8_pretrain.py][INFO] Epoch:[0/2](614000/4588595) loss:2.379 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:41:34,573][model8_pretrain.py][INFO] Epoch:[0/2](614000/4588595) loss:3.181 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:41:34,573][model8_pretrain.py][INFO] Epoch:[0/2](614000/4588595) loss:2.877 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:41:34,573][model8_pretrain.py][INFO] Epoch:[0/2](614000/4588595) loss:2.267 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:41:34,573][model8_pretrain.py][INFO] Epoch:[0/2](614000/4588595) loss:3.026 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:41:34,573][model8_pretrain.py][INFO] Epoch:[0/2](614000/4588595) loss:3.016 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:41:34,573][model8_pretrain.py][INFO] Epoch:[0/2](614000/4588595) loss:3.043 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:41:34,573][model8_pretrain.py][INFO] Epoch:[0/2](614000/4588595) loss:2.258 lr:0.0000100 epoch_Time:25258.0min: [2024-01-05 10:42:11,570][model8_pretrain.py][INFO] Epoch:[0/2](614100/4588595) loss:2.702 lr:0.0000100 epoch_Time:25256.0min: [2024-01-05 10:42:11,570][model8_pretrain.py][INFO] Epoch:[0/2](614100/4588595) loss:3.003 lr:0.0000100 epoch_Time:25256.0min: [2024-01-05 10:42:11,570][model8_pretrain.py][INFO] Epoch:[0/2](614100/4588595) loss:2.831 lr:0.0000100 epoch_Time:25256.0min: [2024-01-05 10:42:11,570][model8_pretrain.py][INFO] Epoch:[0/2](614100/4588595) loss:2.609 lr:0.0000100 epoch_Time:25256.0min: [2024-01-05 10:42:11,570][model8_pretrain.py][INFO] Epoch:[0/2](614100/4588595) loss:2.368 lr:0.0000100 epoch_Time:25256.0min: [2024-01-05 10:42:11,570][model8_pretrain.py][INFO] Epoch:[0/2](614100/4588595) loss:3.065 lr:0.0000100 epoch_Time:25256.0min: [2024-01-05 10:42:11,570][model8_pretrain.py][INFO] Epoch:[0/2](614100/4588595) loss:2.402 lr:0.0000100 epoch_Time:25256.0min: [2024-01-05 10:42:11,570][model8_pretrain.py][INFO] Epoch:[0/2](614100/4588595) loss:3.151 lr:0.0000100 epoch_Time:25256.0min: [2024-01-05 10:42:48,513][model8_pretrain.py][INFO] Epoch:[0/2](614200/4588595) loss:2.554 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:42:48,513][model8_pretrain.py][INFO] Epoch:[0/2](614200/4588595) loss:2.955 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:42:48,513][model8_pretrain.py][INFO] Epoch:[0/2](614200/4588595) loss:3.138 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:42:48,513][model8_pretrain.py][INFO] Epoch:[0/2](614200/4588595) loss:2.679 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:42:48,513][model8_pretrain.py][INFO] Epoch:[0/2](614200/4588595) loss:2.852 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:42:48,513][model8_pretrain.py][INFO] Epoch:[0/2](614200/4588595) loss:3.022 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:42:48,513][model8_pretrain.py][INFO] Epoch:[0/2](614200/4588595) loss:3.376 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:42:48,513][model8_pretrain.py][INFO] Epoch:[0/2](614200/4588595) loss:2.569 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:43:25,487][model8_pretrain.py][INFO] Epoch:[0/2](614300/4588595) loss:1.560 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:43:25,487][model8_pretrain.py][INFO] Epoch:[0/2](614300/4588595) loss:3.332 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:43:25,487][model8_pretrain.py][INFO] Epoch:[0/2](614300/4588595) loss:3.154 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:43:25,487][model8_pretrain.py][INFO] Epoch:[0/2](614300/4588595) loss:2.732 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:43:25,487][model8_pretrain.py][INFO] Epoch:[0/2](614300/4588595) loss:3.074 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:43:25,487][model8_pretrain.py][INFO] Epoch:[0/2](614300/4588595) loss:3.017 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:43:25,488][model8_pretrain.py][INFO] Epoch:[0/2](614300/4588595) loss:2.846 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:43:25,488][model8_pretrain.py][INFO] Epoch:[0/2](614300/4588595) loss:2.706 lr:0.0000100 epoch_Time:25255.0min: [2024-01-05 10:44:02,436][model8_pretrain.py][INFO] Epoch:[0/2](614400/4588595) loss:2.409 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:02,436][model8_pretrain.py][INFO] Epoch:[0/2](614400/4588595) loss:3.044 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:02,436][model8_pretrain.py][INFO] Epoch:[0/2](614400/4588595) loss:3.077 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:02,436][model8_pretrain.py][INFO] Epoch:[0/2](614400/4588595) loss:2.763 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:02,436][model8_pretrain.py][INFO] Epoch:[0/2](614400/4588595) loss:2.854 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:02,436][model8_pretrain.py][INFO] Epoch:[0/2](614400/4588595) loss:2.654 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:02,436][model8_pretrain.py][INFO] Epoch:[0/2](614400/4588595) loss:3.285 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:02,436][model8_pretrain.py][INFO] Epoch:[0/2](614400/4588595) loss:2.728 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:39,371][model8_pretrain.py][INFO] Epoch:[0/2](614500/4588595) loss:3.306 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:39,371][model8_pretrain.py][INFO] Epoch:[0/2](614500/4588595) loss:2.757 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:39,371][model8_pretrain.py][INFO] Epoch:[0/2](614500/4588595) loss:2.440 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:39,371][model8_pretrain.py][INFO] Epoch:[0/2](614500/4588595) loss:3.255 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:39,371][model8_pretrain.py][INFO] Epoch:[0/2](614500/4588595) loss:2.612 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:39,371][model8_pretrain.py][INFO] Epoch:[0/2](614500/4588595) loss:3.191 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:39,371][model8_pretrain.py][INFO] Epoch:[0/2](614500/4588595) loss:2.979 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:44:39,371][model8_pretrain.py][INFO] Epoch:[0/2](614500/4588595) loss:3.401 lr:0.0000100 epoch_Time:25254.0min: [2024-01-05 10:45:19,759][model8_pretrain.py][INFO] Epoch:[0/2](614600/4588595) loss:2.970 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:45:19,760][model8_pretrain.py][INFO] Epoch:[0/2](614600/4588595) loss:3.240 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:45:19,760][model8_pretrain.py][INFO] Epoch:[0/2](614600/4588595) loss:2.927 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:45:19,760][model8_pretrain.py][INFO] Epoch:[0/2](614600/4588595) loss:3.397 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:45:19,760][model8_pretrain.py][INFO] Epoch:[0/2](614600/4588595) loss:2.566 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:45:19,760][model8_pretrain.py][INFO] Epoch:[0/2](614600/4588595) loss:3.214 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:45:19,760][model8_pretrain.py][INFO] Epoch:[0/2](614600/4588595) loss:3.041 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:45:19,760][model8_pretrain.py][INFO] Epoch:[0/2](614600/4588595) loss:2.945 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:05,151][model8_pretrain.py][INFO] Epoch:[0/2](614700/4588595) loss:2.428 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:05,151][model8_pretrain.py][INFO] Epoch:[0/2](614700/4588595) loss:3.356 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:05,152][model8_pretrain.py][INFO] Epoch:[0/2](614700/4588595) loss:2.715 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:05,152][model8_pretrain.py][INFO] Epoch:[0/2](614700/4588595) loss:2.343 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:05,152][model8_pretrain.py][INFO] Epoch:[0/2](614700/4588595) loss:3.116 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:05,152][model8_pretrain.py][INFO] Epoch:[0/2](614700/4588595) loss:2.889 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:05,152][model8_pretrain.py][INFO] Epoch:[0/2](614700/4588595) loss:2.843 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:05,152][model8_pretrain.py][INFO] Epoch:[0/2](614700/4588595) loss:3.260 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:42,108][model8_pretrain.py][INFO] Epoch:[0/2](614800/4588595) loss:2.784 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:42,108][model8_pretrain.py][INFO] Epoch:[0/2](614800/4588595) loss:2.962 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:42,108][model8_pretrain.py][INFO] Epoch:[0/2](614800/4588595) loss:2.405 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:42,108][model8_pretrain.py][INFO] Epoch:[0/2](614800/4588595) loss:3.117 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:42,108][model8_pretrain.py][INFO] Epoch:[0/2](614800/4588595) loss:2.466 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:42,108][model8_pretrain.py][INFO] Epoch:[0/2](614800/4588595) loss:2.537 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:42,108][model8_pretrain.py][INFO] Epoch:[0/2](614800/4588595) loss:3.159 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:46:42,108][model8_pretrain.py][INFO] Epoch:[0/2](614800/4588595) loss:2.834 lr:0.0000100 epoch_Time:25253.0min: [2024-01-05 10:47:19,051][model8_pretrain.py][INFO] Epoch:[0/2](614900/4588595) loss:3.081 lr:0.0000100 epoch_Time:25252.0min: [2024-01-05 10:47:19,051][model8_pretrain.py][INFO] Epoch:[0/2](614900/4588595) loss:3.062 lr:0.0000100 epoch_Time:25252.0min: [2024-01-05 10:47:19,051][model8_pretrain.py][INFO] Epoch:[0/2](614900/4588595) loss:2.718 lr:0.0000100 epoch_Time:25252.0min: [2024-01-05 10:47:19,051][model8_pretrain.py][INFO] Epoch:[0/2](614900/4588595) loss:2.834 lr:0.0000100 epoch_Time:25252.0min: [2024-01-05 10:47:19,051][model8_pretrain.py][INFO] Epoch:[0/2](614900/4588595) loss:3.275 lr:0.0000100 epoch_Time:25252.0min: [2024-01-05 10:47:19,051][model8_pretrain.py][INFO] Epoch:[0/2](614900/4588595) loss:2.770 lr:0.0000100 epoch_Time:25252.0min: [2024-01-05 10:47:19,051][model8_pretrain.py][INFO] Epoch:[0/2](614900/4588595) loss:2.449 lr:0.0000100 epoch_Time:25252.0min: [2024-01-05 10:47:19,051][model8_pretrain.py][INFO] Epoch:[0/2](614900/4588595) loss:3.094 lr:0.0000100 epoch_Time:25252.0min: [2024-01-05 10:47:55,988][model8_pretrain.py][INFO] Epoch:[0/2](615000/4588595) loss:2.913 lr:0.0000100 epoch_Time:25251.0min: [2024-01-05 10:47:55,988][model8_pretrain.py][INFO] Epoch:[0/2](615000/4588595) loss:2.738 lr:0.0000100 epoch_Time:25251.0min: [2024-01-05 10:47:55,988][model8_pretrain.py][INFO] Epoch:[0/2](615000/4588595) loss:2.795 lr:0.0000100 epoch_Time:25251.0min: [2024-01-05 10:47:55,988][model8_pretrain.py][INFO] Epoch:[0/2](615000/4588595) loss:2.906 lr:0.0000100 epoch_Time:25251.0min: [2024-01-05 10:47:55,988][model8_pretrain.py][INFO] Epoch:[0/2](615000/4588595) loss:2.998 lr:0.0000100 epoch_Time:25251.0min: [2024-01-05 10:47:55,988][model8_pretrain.py][INFO] Epoch:[0/2](615000/4588595) loss:3.213 lr:0.0000100 epoch_Time:25251.0min: [2024-01-05 10:47:55,988][model8_pretrain.py][INFO] Epoch:[0/2](615000/4588595) loss:2.906 lr:0.0000100 epoch_Time:25251.0min: [2024-01-05 10:47:55,988][model8_pretrain.py][INFO] Epoch:[0/2](615000/4588595) loss:3.275 lr:0.0000100 epoch_Time:25251.0min: [2024-01-05 10:48:32,938][model8_pretrain.py][INFO] Epoch:[0/2](615100/4588595) loss:2.697 lr:0.0000100 epoch_Time:25250.0min: [2024-01-05 10:48:32,938][model8_pretrain.py][INFO] Epoch:[0/2](615100/4588595) loss:2.955 lr:0.0000100 epoch_Time:25250.0min: [2024-01-05 10:48:32,938][model8_pretrain.py][INFO] Epoch:[0/2](615100/4588595) loss:2.684 lr:0.0000100 epoch_Time:25250.0min: [2024-01-05 10:48:32,938][model8_pretrain.py][INFO] Epoch:[0/2](615100/4588595) loss:1.997 lr:0.0000100 epoch_Time:25250.0min: [2024-01-05 10:48:32,938][model8_pretrain.py][INFO] Epoch:[0/2](615100/4588595) loss:3.095 lr:0.0000100 epoch_Time:25250.0min: [2024-01-05 10:48:32,938][model8_pretrain.py][INFO] Epoch:[0/2](615100/4588595) loss:2.602 lr:0.0000100 epoch_Time:25250.0min: [2024-01-05 10:48:32,938][model8_pretrain.py][INFO] Epoch:[0/2](615100/4588595) loss:2.437 lr:0.0000100 epoch_Time:25250.0min: [2024-01-05 10:48:32,938][model8_pretrain.py][INFO] Epoch:[0/2](615100/4588595) loss:2.230 lr:0.0000100 epoch_Time:25250.0min: [2024-01-05 10:49:09,895][model8_pretrain.py][INFO] Epoch:[0/2](615200/4588595) loss:2.910 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:09,895][model8_pretrain.py][INFO] Epoch:[0/2](615200/4588595) loss:2.828 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:09,895][model8_pretrain.py][INFO] Epoch:[0/2](615200/4588595) loss:2.755 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:09,895][model8_pretrain.py][INFO] Epoch:[0/2](615200/4588595) loss:2.407 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:09,895][model8_pretrain.py][INFO] Epoch:[0/2](615200/4588595) loss:2.804 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:09,895][model8_pretrain.py][INFO] Epoch:[0/2](615200/4588595) loss:2.877 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:09,895][model8_pretrain.py][INFO] Epoch:[0/2](615200/4588595) loss:2.174 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:09,895][model8_pretrain.py][INFO] Epoch:[0/2](615200/4588595) loss:2.565 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:46,829][model8_pretrain.py][INFO] Epoch:[0/2](615300/4588595) loss:3.053 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:46,829][model8_pretrain.py][INFO] Epoch:[0/2](615300/4588595) loss:2.843 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:46,829][model8_pretrain.py][INFO] Epoch:[0/2](615300/4588595) loss:3.013 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:46,829][model8_pretrain.py][INFO] Epoch:[0/2](615300/4588595) loss:2.287 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:46,829][model8_pretrain.py][INFO] Epoch:[0/2](615300/4588595) loss:2.836 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:46,829][model8_pretrain.py][INFO] Epoch:[0/2](615300/4588595) loss:3.088 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:46,829][model8_pretrain.py][INFO] Epoch:[0/2](615300/4588595) loss:3.205 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:49:46,829][model8_pretrain.py][INFO] Epoch:[0/2](615300/4588595) loss:3.485 lr:0.0000100 epoch_Time:25249.0min: [2024-01-05 10:50:27,217][model8_pretrain.py][INFO] Epoch:[0/2](615400/4588595) loss:2.705 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:50:27,217][model8_pretrain.py][INFO] Epoch:[0/2](615400/4588595) loss:3.390 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:50:27,217][model8_pretrain.py][INFO] Epoch:[0/2](615400/4588595) loss:1.955 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:50:27,217][model8_pretrain.py][INFO] Epoch:[0/2](615400/4588595) loss:2.842 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:50:27,217][model8_pretrain.py][INFO] Epoch:[0/2](615400/4588595) loss:3.352 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:50:27,217][model8_pretrain.py][INFO] Epoch:[0/2](615400/4588595) loss:2.917 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:50:27,217][model8_pretrain.py][INFO] Epoch:[0/2](615400/4588595) loss:2.862 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:50:27,217][model8_pretrain.py][INFO] Epoch:[0/2](615400/4588595) loss:3.281 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:51:12,599][model8_pretrain.py][INFO] Epoch:[0/2](615500/4588595) loss:3.061 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:51:12,600][model8_pretrain.py][INFO] Epoch:[0/2](615500/4588595) loss:2.704 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:51:12,600][model8_pretrain.py][INFO] Epoch:[0/2](615500/4588595) loss:3.159 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:51:12,600][model8_pretrain.py][INFO] Epoch:[0/2](615500/4588595) loss:2.799 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:51:12,600][model8_pretrain.py][INFO] Epoch:[0/2](615500/4588595) loss:2.094 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:51:12,600][model8_pretrain.py][INFO] Epoch:[0/2](615500/4588595) loss:2.692 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:51:12,600][model8_pretrain.py][INFO] Epoch:[0/2](615500/4588595) loss:2.181 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:51:12,600][model8_pretrain.py][INFO] Epoch:[0/2](615500/4588595) loss:2.828 lr:0.0000100 epoch_Time:25248.0min: [2024-01-05 10:51:49,532][model8_pretrain.py][INFO] Epoch:[0/2](615600/4588595) loss:2.851 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:51:49,532][model8_pretrain.py][INFO] Epoch:[0/2](615600/4588595) loss:3.433 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:51:49,532][model8_pretrain.py][INFO] Epoch:[0/2](615600/4588595) loss:2.612 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:51:49,532][model8_pretrain.py][INFO] Epoch:[0/2](615600/4588595) loss:3.291 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:51:49,532][model8_pretrain.py][INFO] Epoch:[0/2](615600/4588595) loss:2.574 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:51:49,532][model8_pretrain.py][INFO] Epoch:[0/2](615600/4588595) loss:3.029 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:51:49,532][model8_pretrain.py][INFO] Epoch:[0/2](615600/4588595) loss:2.848 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:51:49,532][model8_pretrain.py][INFO] Epoch:[0/2](615600/4588595) loss:2.782 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:52:26,469][model8_pretrain.py][INFO] Epoch:[0/2](615700/4588595) loss:2.017 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:52:26,469][model8_pretrain.py][INFO] Epoch:[0/2](615700/4588595) loss:2.943 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:52:26,469][model8_pretrain.py][INFO] Epoch:[0/2](615700/4588595) loss:3.036 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:52:26,469][model8_pretrain.py][INFO] Epoch:[0/2](615700/4588595) loss:3.142 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:52:26,469][model8_pretrain.py][INFO] Epoch:[0/2](615700/4588595) loss:2.739 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:52:26,469][model8_pretrain.py][INFO] Epoch:[0/2](615700/4588595) loss:2.749 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:52:26,469][model8_pretrain.py][INFO] Epoch:[0/2](615700/4588595) loss:2.562 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:52:26,469][model8_pretrain.py][INFO] Epoch:[0/2](615700/4588595) loss:3.129 lr:0.0000100 epoch_Time:25247.0min: [2024-01-05 10:53:03,407][model8_pretrain.py][INFO] Epoch:[0/2](615800/4588595) loss:2.912 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:03,407][model8_pretrain.py][INFO] Epoch:[0/2](615800/4588595) loss:3.021 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:03,407][model8_pretrain.py][INFO] Epoch:[0/2](615800/4588595) loss:3.301 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:03,407][model8_pretrain.py][INFO] Epoch:[0/2](615800/4588595) loss:3.166 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:03,408][model8_pretrain.py][INFO] Epoch:[0/2](615800/4588595) loss:2.566 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:03,408][model8_pretrain.py][INFO] Epoch:[0/2](615800/4588595) loss:2.238 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:03,408][model8_pretrain.py][INFO] Epoch:[0/2](615800/4588595) loss:2.862 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:03,408][model8_pretrain.py][INFO] Epoch:[0/2](615800/4588595) loss:3.175 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:40,358][model8_pretrain.py][INFO] Epoch:[0/2](615900/4588595) loss:2.793 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:40,358][model8_pretrain.py][INFO] Epoch:[0/2](615900/4588595) loss:3.138 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:40,358][model8_pretrain.py][INFO] Epoch:[0/2](615900/4588595) loss:2.283 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:40,358][model8_pretrain.py][INFO] Epoch:[0/2](615900/4588595) loss:2.690 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:40,358][model8_pretrain.py][INFO] Epoch:[0/2](615900/4588595) loss:2.394 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:40,358][model8_pretrain.py][INFO] Epoch:[0/2](615900/4588595) loss:3.254 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:40,359][model8_pretrain.py][INFO] Epoch:[0/2](615900/4588595) loss:2.665 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:53:40,359][model8_pretrain.py][INFO] Epoch:[0/2](615900/4588595) loss:2.805 lr:0.0000100 epoch_Time:25246.0min: [2024-01-05 10:54:17,309][model8_pretrain.py][INFO] Epoch:[0/2](616000/4588595) loss:2.495 lr:0.0000100 epoch_Time:25245.0min: [2024-01-05 10:54:17,309][model8_pretrain.py][INFO] Epoch:[0/2](616000/4588595) loss:2.870 lr:0.0000100 epoch_Time:25245.0min: [2024-01-05 10:54:17,309][model8_pretrain.py][INFO] Epoch:[0/2](616000/4588595) loss:1.976 lr:0.0000100 epoch_Time:25245.0min: [2024-01-05 10:54:17,309][model8_pretrain.py][INFO] Epoch:[0/2](616000/4588595) loss:2.783 lr:0.0000100 epoch_Time:25245.0min: [2024-01-05 10:54:17,309][model8_pretrain.py][INFO] Epoch:[0/2](616000/4588595) loss:3.031 lr:0.0000100 epoch_Time:25245.0min: [2024-01-05 10:54:17,309][model8_pretrain.py][INFO] Epoch:[0/2](616000/4588595) loss:2.234 lr:0.0000100 epoch_Time:25245.0min: [2024-01-05 10:54:17,309][model8_pretrain.py][INFO] Epoch:[0/2](616000/4588595) loss:3.118 lr:0.0000100 epoch_Time:25245.0min: [2024-01-05 10:54:17,309][model8_pretrain.py][INFO] Epoch:[0/2](616000/4588595) loss:3.232 lr:0.0000100 epoch_Time:25245.0min: [2024-01-05 10:54:54,256][model8_pretrain.py][INFO] Epoch:[0/2](616100/4588595) loss:2.856 lr:0.0000100 epoch_Time:25243.0min: [2024-01-05 10:54:54,256][model8_pretrain.py][INFO] Epoch:[0/2](616100/4588595) loss:2.911 lr:0.0000100 epoch_Time:25243.0min: [2024-01-05 10:54:54,256][model8_pretrain.py][INFO] Epoch:[0/2](616100/4588595) loss:2.956 lr:0.0000100 epoch_Time:25243.0min: [2024-01-05 10:54:54,256][model8_pretrain.py][INFO] Epoch:[0/2](616100/4588595) loss:2.112 lr:0.0000100 epoch_Time:25243.0min: [2024-01-05 10:54:54,256][model8_pretrain.py][INFO] Epoch:[0/2](616100/4588595) loss:3.028 lr:0.0000100 epoch_Time:25243.0min: [2024-01-05 10:54:54,256][model8_pretrain.py][INFO] Epoch:[0/2](616100/4588595) loss:2.624 lr:0.0000100 epoch_Time:25243.0min: [2024-01-05 10:54:54,256][model8_pretrain.py][INFO] Epoch:[0/2](616100/4588595) loss:2.827 lr:0.0000100 epoch_Time:25243.0min: [2024-01-05 10:54:54,256][model8_pretrain.py][INFO] Epoch:[0/2](616100/4588595) loss:2.551 lr:0.0000100 epoch_Time:25243.0min: [2024-01-05 10:55:32,926][model8_pretrain.py][INFO] Epoch:[0/2](616200/4588595) loss:2.710 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:55:32,926][model8_pretrain.py][INFO] Epoch:[0/2](616200/4588595) loss:2.523 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:55:32,926][model8_pretrain.py][INFO] Epoch:[0/2](616200/4588595) loss:3.079 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:55:32,926][model8_pretrain.py][INFO] Epoch:[0/2](616200/4588595) loss:2.866 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:55:32,926][model8_pretrain.py][INFO] Epoch:[0/2](616200/4588595) loss:2.622 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:55:32,926][model8_pretrain.py][INFO] Epoch:[0/2](616200/4588595) loss:2.262 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:55:32,926][model8_pretrain.py][INFO] Epoch:[0/2](616200/4588595) loss:2.975 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:55:34,638][model8_pretrain.py][INFO] Epoch:[0/2](616200/4588595) loss:2.918 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:56:20,118][model8_pretrain.py][INFO] Epoch:[0/2](616300/4588595) loss:2.307 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:56:20,118][model8_pretrain.py][INFO] Epoch:[0/2](616300/4588595) loss:3.046 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:56:20,118][model8_pretrain.py][INFO] Epoch:[0/2](616300/4588595) loss:3.153 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:56:20,118][model8_pretrain.py][INFO] Epoch:[0/2](616300/4588595) loss:2.862 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:56:20,119][model8_pretrain.py][INFO] Epoch:[0/2](616300/4588595) loss:3.002 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:56:20,119][model8_pretrain.py][INFO] Epoch:[0/2](616300/4588595) loss:2.907 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:56:20,119][model8_pretrain.py][INFO] Epoch:[0/2](616300/4588595) loss:2.984 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:56:20,119][model8_pretrain.py][INFO] Epoch:[0/2](616300/4588595) loss:2.572 lr:0.0000100 epoch_Time:25244.0min: [2024-01-05 10:56:57,073][model8_pretrain.py][INFO] Epoch:[0/2](616400/4588595) loss:2.970 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:56:57,073][model8_pretrain.py][INFO] Epoch:[0/2](616400/4588595) loss:2.961 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:56:57,073][model8_pretrain.py][INFO] Epoch:[0/2](616400/4588595) loss:2.977 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:56:57,073][model8_pretrain.py][INFO] Epoch:[0/2](616400/4588595) loss:3.529 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:56:57,073][model8_pretrain.py][INFO] Epoch:[0/2](616400/4588595) loss:3.187 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:56:57,073][model8_pretrain.py][INFO] Epoch:[0/2](616400/4588595) loss:2.412 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:56:57,073][model8_pretrain.py][INFO] Epoch:[0/2](616400/4588595) loss:2.411 lr:0.0000100 epoch_Time:25243.0min: [2024-01-05 10:56:57,073][model8_pretrain.py][INFO] Epoch:[0/2](616400/4588595) loss:2.936 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:57:34,015][model8_pretrain.py][INFO] Epoch:[0/2](616500/4588595) loss:3.092 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:57:34,015][model8_pretrain.py][INFO] Epoch:[0/2](616500/4588595) loss:2.624 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:57:34,015][model8_pretrain.py][INFO] Epoch:[0/2](616500/4588595) loss:3.194 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:57:34,015][model8_pretrain.py][INFO] Epoch:[0/2](616500/4588595) loss:2.464 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:57:34,015][model8_pretrain.py][INFO] Epoch:[0/2](616500/4588595) loss:2.949 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:57:34,015][model8_pretrain.py][INFO] Epoch:[0/2](616500/4588595) loss:2.721 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:57:34,015][model8_pretrain.py][INFO] Epoch:[0/2](616500/4588595) loss:2.744 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:57:34,015][model8_pretrain.py][INFO] Epoch:[0/2](616500/4588595) loss:2.726 lr:0.0000100 epoch_Time:25242.0min: [2024-01-05 10:58:10,965][model8_pretrain.py][INFO] Epoch:[0/2](616600/4588595) loss:3.195 lr:0.0000100 epoch_Time:25241.0min: [2024-01-05 10:58:10,965][model8_pretrain.py][INFO] Epoch:[0/2](616600/4588595) loss:2.952 lr:0.0000100 epoch_Time:25241.0min: [2024-01-05 10:58:10,965][model8_pretrain.py][INFO] Epoch:[0/2](616600/4588595) loss:3.070 lr:0.0000100 epoch_Time:25241.0min: [2024-01-05 10:58:10,965][model8_pretrain.py][INFO] Epoch:[0/2](616600/4588595) loss:3.294 lr:0.0000100 epoch_Time:25241.0min: [2024-01-05 10:58:10,965][model8_pretrain.py][INFO] Epoch:[0/2](616600/4588595) loss:3.188 lr:0.0000100 epoch_Time:25241.0min: [2024-01-05 10:58:10,965][model8_pretrain.py][INFO] Epoch:[0/2](616600/4588595) loss:3.304 lr:0.0000100 epoch_Time:25241.0min: [2024-01-05 10:58:10,965][model8_pretrain.py][INFO] Epoch:[0/2](616600/4588595) loss:2.842 lr:0.0000100 epoch_Time:25241.0min: [2024-01-05 10:58:10,965][model8_pretrain.py][INFO] Epoch:[0/2](616600/4588595) loss:2.395 lr:0.0000100 epoch_Time:25241.0min: [2024-01-05 10:58:47,910][model8_pretrain.py][INFO] Epoch:[0/2](616700/4588595) loss:2.647 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:58:47,910][model8_pretrain.py][INFO] Epoch:[0/2](616700/4588595) loss:1.865 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:58:47,910][model8_pretrain.py][INFO] Epoch:[0/2](616700/4588595) loss:2.244 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:58:47,910][model8_pretrain.py][INFO] Epoch:[0/2](616700/4588595) loss:3.215 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:58:47,910][model8_pretrain.py][INFO] Epoch:[0/2](616700/4588595) loss:3.146 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:58:47,910][model8_pretrain.py][INFO] Epoch:[0/2](616700/4588595) loss:2.187 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:58:47,910][model8_pretrain.py][INFO] Epoch:[0/2](616700/4588595) loss:2.273 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:58:47,910][model8_pretrain.py][INFO] Epoch:[0/2](616700/4588595) loss:2.752 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:59:24,893][model8_pretrain.py][INFO] Epoch:[0/2](616800/4588595) loss:2.763 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:59:24,893][model8_pretrain.py][INFO] Epoch:[0/2](616800/4588595) loss:3.049 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:59:24,893][model8_pretrain.py][INFO] Epoch:[0/2](616800/4588595) loss:2.851 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:59:24,893][model8_pretrain.py][INFO] Epoch:[0/2](616800/4588595) loss:2.798 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:59:24,893][model8_pretrain.py][INFO] Epoch:[0/2](616800/4588595) loss:3.008 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:59:24,893][model8_pretrain.py][INFO] Epoch:[0/2](616800/4588595) loss:2.560 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:59:24,893][model8_pretrain.py][INFO] Epoch:[0/2](616800/4588595) loss:2.940 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 10:59:24,894][model8_pretrain.py][INFO] Epoch:[0/2](616800/4588595) loss:2.536 lr:0.0000100 epoch_Time:25240.0min: [2024-01-05 11:00:01,852][model8_pretrain.py][INFO] Epoch:[0/2](616900/4588595) loss:2.917 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:01,852][model8_pretrain.py][INFO] Epoch:[0/2](616900/4588595) loss:3.273 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:01,852][model8_pretrain.py][INFO] Epoch:[0/2](616900/4588595) loss:2.860 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:01,852][model8_pretrain.py][INFO] Epoch:[0/2](616900/4588595) loss:2.462 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:01,852][model8_pretrain.py][INFO] Epoch:[0/2](616900/4588595) loss:3.054 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:01,852][model8_pretrain.py][INFO] Epoch:[0/2](616900/4588595) loss:3.470 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:01,853][model8_pretrain.py][INFO] Epoch:[0/2](616900/4588595) loss:2.534 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:01,853][model8_pretrain.py][INFO] Epoch:[0/2](616900/4588595) loss:2.606 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:38,807][model8_pretrain.py][INFO] Epoch:[0/2](617000/4588595) loss:3.045 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:38,807][model8_pretrain.py][INFO] Epoch:[0/2](617000/4588595) loss:3.102 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:38,807][model8_pretrain.py][INFO] Epoch:[0/2](617000/4588595) loss:2.733 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:38,807][model8_pretrain.py][INFO] Epoch:[0/2](617000/4588595) loss:3.109 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:38,807][model8_pretrain.py][INFO] Epoch:[0/2](617000/4588595) loss:2.993 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:38,807][model8_pretrain.py][INFO] Epoch:[0/2](617000/4588595) loss:2.341 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:38,807][model8_pretrain.py][INFO] Epoch:[0/2](617000/4588595) loss:2.833 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:00:38,808][model8_pretrain.py][INFO] Epoch:[0/2](617000/4588595) loss:2.665 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:01:27,940][model8_pretrain.py][INFO] Epoch:[0/2](617100/4588595) loss:2.375 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:01:27,940][model8_pretrain.py][INFO] Epoch:[0/2](617100/4588595) loss:2.503 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:01:27,940][model8_pretrain.py][INFO] Epoch:[0/2](617100/4588595) loss:2.960 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:01:27,940][model8_pretrain.py][INFO] Epoch:[0/2](617100/4588595) loss:2.600 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:01:27,940][model8_pretrain.py][INFO] Epoch:[0/2](617100/4588595) loss:2.818 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:01:27,940][model8_pretrain.py][INFO] Epoch:[0/2](617100/4588595) loss:2.810 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:01:27,940][model8_pretrain.py][INFO] Epoch:[0/2](617100/4588595) loss:3.033 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:01:27,940][model8_pretrain.py][INFO] Epoch:[0/2](617100/4588595) loss:2.703 lr:0.0000100 epoch_Time:25239.0min: [2024-01-05 11:02:04,909][model8_pretrain.py][INFO] Epoch:[0/2](617200/4588595) loss:2.373 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:04,909][model8_pretrain.py][INFO] Epoch:[0/2](617200/4588595) loss:2.715 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:04,909][model8_pretrain.py][INFO] Epoch:[0/2](617200/4588595) loss:2.788 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:04,909][model8_pretrain.py][INFO] Epoch:[0/2](617200/4588595) loss:2.637 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:04,909][model8_pretrain.py][INFO] Epoch:[0/2](617200/4588595) loss:2.744 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:04,909][model8_pretrain.py][INFO] Epoch:[0/2](617200/4588595) loss:2.812 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:04,909][model8_pretrain.py][INFO] Epoch:[0/2](617200/4588595) loss:3.512 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:04,909][model8_pretrain.py][INFO] Epoch:[0/2](617200/4588595) loss:2.870 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:41,832][model8_pretrain.py][INFO] Epoch:[0/2](617300/4588595) loss:2.625 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:41,832][model8_pretrain.py][INFO] Epoch:[0/2](617300/4588595) loss:2.357 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:41,832][model8_pretrain.py][INFO] Epoch:[0/2](617300/4588595) loss:2.633 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:41,832][model8_pretrain.py][INFO] Epoch:[0/2](617300/4588595) loss:2.890 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:41,832][model8_pretrain.py][INFO] Epoch:[0/2](617300/4588595) loss:2.824 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:41,832][model8_pretrain.py][INFO] Epoch:[0/2](617300/4588595) loss:2.470 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:41,832][model8_pretrain.py][INFO] Epoch:[0/2](617300/4588595) loss:2.577 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:02:41,832][model8_pretrain.py][INFO] Epoch:[0/2](617300/4588595) loss:2.784 lr:0.0000100 epoch_Time:25238.0min: [2024-01-05 11:03:18,784][model8_pretrain.py][INFO] Epoch:[0/2](617400/4588595) loss:2.810 lr:0.0000100 epoch_Time:25237.0min: [2024-01-05 11:03:18,784][model8_pretrain.py][INFO] Epoch:[0/2](617400/4588595) loss:2.211 lr:0.0000100 epoch_Time:25237.0min: [2024-01-05 11:03:18,784][model8_pretrain.py][INFO] Epoch:[0/2](617400/4588595) loss:2.758 lr:0.0000100 epoch_Time:25237.0min: [2024-01-05 11:03:18,784][model8_pretrain.py][INFO] Epoch:[0/2](617400/4588595) loss:3.047 lr:0.0000100 epoch_Time:25237.0min: [2024-01-05 11:03:18,784][model8_pretrain.py][INFO] Epoch:[0/2](617400/4588595) loss:2.400 lr:0.0000100 epoch_Time:25237.0min: [2024-01-05 11:03:18,784][model8_pretrain.py][INFO] Epoch:[0/2](617400/4588595) loss:2.934 lr:0.0000100 epoch_Time:25237.0min: [2024-01-05 11:03:18,784][model8_pretrain.py][INFO] Epoch:[0/2](617400/4588595) loss:2.702 lr:0.0000100 epoch_Time:25237.0min: [2024-01-05 11:03:18,784][model8_pretrain.py][INFO] Epoch:[0/2](617400/4588595) loss:2.449 lr:0.0000100 epoch_Time:25237.0min: [2024-01-05 11:03:55,719][model8_pretrain.py][INFO] Epoch:[0/2](617500/4588595) loss:2.903 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:03:55,719][model8_pretrain.py][INFO] Epoch:[0/2](617500/4588595) loss:2.689 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:03:55,719][model8_pretrain.py][INFO] Epoch:[0/2](617500/4588595) loss:3.108 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:03:55,719][model8_pretrain.py][INFO] Epoch:[0/2](617500/4588595) loss:2.553 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:03:55,719][model8_pretrain.py][INFO] Epoch:[0/2](617500/4588595) loss:2.328 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:03:55,719][model8_pretrain.py][INFO] Epoch:[0/2](617500/4588595) loss:2.760 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:03:55,719][model8_pretrain.py][INFO] Epoch:[0/2](617500/4588595) loss:2.330 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:03:55,720][model8_pretrain.py][INFO] Epoch:[0/2](617500/4588595) loss:2.970 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:04:32,659][model8_pretrain.py][INFO] Epoch:[0/2](617600/4588595) loss:2.562 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:04:32,659][model8_pretrain.py][INFO] Epoch:[0/2](617600/4588595) loss:2.206 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:04:32,659][model8_pretrain.py][INFO] Epoch:[0/2](617600/4588595) loss:3.213 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:04:32,659][model8_pretrain.py][INFO] Epoch:[0/2](617600/4588595) loss:3.151 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:04:32,659][model8_pretrain.py][INFO] Epoch:[0/2](617600/4588595) loss:3.490 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:04:32,659][model8_pretrain.py][INFO] Epoch:[0/2](617600/4588595) loss:3.465 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:04:32,659][model8_pretrain.py][INFO] Epoch:[0/2](617600/4588595) loss:2.763 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:04:32,659][model8_pretrain.py][INFO] Epoch:[0/2](617600/4588595) loss:3.047 lr:0.0000100 epoch_Time:25235.0min: [2024-01-05 11:05:09,608][model8_pretrain.py][INFO] Epoch:[0/2](617700/4588595) loss:3.029 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:09,608][model8_pretrain.py][INFO] Epoch:[0/2](617700/4588595) loss:2.899 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:09,608][model8_pretrain.py][INFO] Epoch:[0/2](617700/4588595) loss:2.556 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:09,608][model8_pretrain.py][INFO] Epoch:[0/2](617700/4588595) loss:2.580 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:09,608][model8_pretrain.py][INFO] Epoch:[0/2](617700/4588595) loss:2.741 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:09,608][model8_pretrain.py][INFO] Epoch:[0/2](617700/4588595) loss:2.425 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:09,608][model8_pretrain.py][INFO] Epoch:[0/2](617700/4588595) loss:2.350 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:09,608][model8_pretrain.py][INFO] Epoch:[0/2](617700/4588595) loss:2.549 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:46,561][model8_pretrain.py][INFO] Epoch:[0/2](617800/4588595) loss:2.432 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:46,561][model8_pretrain.py][INFO] Epoch:[0/2](617800/4588595) loss:3.076 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:46,561][model8_pretrain.py][INFO] Epoch:[0/2](617800/4588595) loss:2.848 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:46,562][model8_pretrain.py][INFO] Epoch:[0/2](617800/4588595) loss:2.638 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:46,562][model8_pretrain.py][INFO] Epoch:[0/2](617800/4588595) loss:2.933 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:46,561][model8_pretrain.py][INFO] Epoch:[0/2](617800/4588595) loss:3.397 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:46,562][model8_pretrain.py][INFO] Epoch:[0/2](617800/4588595) loss:3.195 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:05:46,562][model8_pretrain.py][INFO] Epoch:[0/2](617800/4588595) loss:3.116 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:06:35,588][model8_pretrain.py][INFO] Epoch:[0/2](617900/4588595) loss:3.456 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:06:35,588][model8_pretrain.py][INFO] Epoch:[0/2](617900/4588595) loss:3.261 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:06:35,588][model8_pretrain.py][INFO] Epoch:[0/2](617900/4588595) loss:3.012 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:06:35,588][model8_pretrain.py][INFO] Epoch:[0/2](617900/4588595) loss:2.837 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:06:35,588][model8_pretrain.py][INFO] Epoch:[0/2](617900/4588595) loss:2.433 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:06:35,588][model8_pretrain.py][INFO] Epoch:[0/2](617900/4588595) loss:2.796 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:06:35,588][model8_pretrain.py][INFO] Epoch:[0/2](617900/4588595) loss:2.820 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:06:35,588][model8_pretrain.py][INFO] Epoch:[0/2](617900/4588595) loss:2.456 lr:0.0000100 epoch_Time:25234.0min: [2024-01-05 11:07:12,529][model8_pretrain.py][INFO] Epoch:[0/2](618000/4588595) loss:2.745 lr:0.0000100 epoch_Time:25233.0min: [2024-01-05 11:07:12,529][model8_pretrain.py][INFO] Epoch:[0/2](618000/4588595) loss:3.235 lr:0.0000100 epoch_Time:25233.0min: [2024-01-05 11:07:12,529][model8_pretrain.py][INFO] Epoch:[0/2](618000/4588595) loss:2.575 lr:0.0000100 epoch_Time:25233.0min: [2024-01-05 11:07:12,529][model8_pretrain.py][INFO] Epoch:[0/2](618000/4588595) loss:3.029 lr:0.0000100 epoch_Time:25233.0min: [2024-01-05 11:07:12,529][model8_pretrain.py][INFO] Epoch:[0/2](618000/4588595) loss:2.459 lr:0.0000100 epoch_Time:25233.0min: [2024-01-05 11:07:12,529][model8_pretrain.py][INFO] Epoch:[0/2](618000/4588595) loss:2.662 lr:0.0000100 epoch_Time:25233.0min: [2024-01-05 11:07:12,529][model8_pretrain.py][INFO] Epoch:[0/2](618000/4588595) loss:3.008 lr:0.0000100 epoch_Time:25233.0min: [2024-01-05 11:07:12,529][model8_pretrain.py][INFO] Epoch:[0/2](618000/4588595) loss:3.366 lr:0.0000100 epoch_Time:25233.0min: [2024-01-05 11:07:49,470][model8_pretrain.py][INFO] Epoch:[0/2](618100/4588595) loss:2.886 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:07:49,470][model8_pretrain.py][INFO] Epoch:[0/2](618100/4588595) loss:2.841 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:07:49,470][model8_pretrain.py][INFO] Epoch:[0/2](618100/4588595) loss:2.754 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:07:49,470][model8_pretrain.py][INFO] Epoch:[0/2](618100/4588595) loss:2.842 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:07:49,470][model8_pretrain.py][INFO] Epoch:[0/2](618100/4588595) loss:3.063 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:07:49,470][model8_pretrain.py][INFO] Epoch:[0/2](618100/4588595) loss:2.197 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:07:49,470][model8_pretrain.py][INFO] Epoch:[0/2](618100/4588595) loss:3.060 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:07:49,470][model8_pretrain.py][INFO] Epoch:[0/2](618100/4588595) loss:2.605 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:08:26,417][model8_pretrain.py][INFO] Epoch:[0/2](618200/4588595) loss:3.147 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:08:26,417][model8_pretrain.py][INFO] Epoch:[0/2](618200/4588595) loss:3.094 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:08:26,417][model8_pretrain.py][INFO] Epoch:[0/2](618200/4588595) loss:2.778 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:08:26,417][model8_pretrain.py][INFO] Epoch:[0/2](618200/4588595) loss:2.972 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:08:26,417][model8_pretrain.py][INFO] Epoch:[0/2](618200/4588595) loss:3.056 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:08:26,417][model8_pretrain.py][INFO] Epoch:[0/2](618200/4588595) loss:2.764 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:08:26,417][model8_pretrain.py][INFO] Epoch:[0/2](618200/4588595) loss:2.819 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:08:26,417][model8_pretrain.py][INFO] Epoch:[0/2](618200/4588595) loss:3.040 lr:0.0000100 epoch_Time:25232.0min: [2024-01-05 11:09:03,366][model8_pretrain.py][INFO] Epoch:[0/2](618300/4588595) loss:2.757 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:03,366][model8_pretrain.py][INFO] Epoch:[0/2](618300/4588595) loss:2.238 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:03,366][model8_pretrain.py][INFO] Epoch:[0/2](618300/4588595) loss:2.317 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:03,366][model8_pretrain.py][INFO] Epoch:[0/2](618300/4588595) loss:2.920 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:03,366][model8_pretrain.py][INFO] Epoch:[0/2](618300/4588595) loss:2.821 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:03,366][model8_pretrain.py][INFO] Epoch:[0/2](618300/4588595) loss:2.691 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:03,366][model8_pretrain.py][INFO] Epoch:[0/2](618300/4588595) loss:3.256 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:03,366][model8_pretrain.py][INFO] Epoch:[0/2](618300/4588595) loss:3.425 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:40,306][model8_pretrain.py][INFO] Epoch:[0/2](618400/4588595) loss:3.058 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:40,306][model8_pretrain.py][INFO] Epoch:[0/2](618400/4588595) loss:2.719 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:40,307][model8_pretrain.py][INFO] Epoch:[0/2](618400/4588595) loss:2.711 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:40,306][model8_pretrain.py][INFO] Epoch:[0/2](618400/4588595) loss:2.953 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:40,307][model8_pretrain.py][INFO] Epoch:[0/2](618400/4588595) loss:2.346 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:40,307][model8_pretrain.py][INFO] Epoch:[0/2](618400/4588595) loss:2.867 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:40,307][model8_pretrain.py][INFO] Epoch:[0/2](618400/4588595) loss:3.136 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:09:40,307][model8_pretrain.py][INFO] Epoch:[0/2](618400/4588595) loss:2.854 lr:0.0000100 epoch_Time:25231.0min: [2024-01-05 11:10:17,252][model8_pretrain.py][INFO] Epoch:[0/2](618500/4588595) loss:2.598 lr:0.0000100 epoch_Time:25229.0min: [2024-01-05 11:10:17,252][model8_pretrain.py][INFO] Epoch:[0/2](618500/4588595) loss:3.189 lr:0.0000100 epoch_Time:25229.0min: [2024-01-05 11:10:17,252][model8_pretrain.py][INFO] Epoch:[0/2](618500/4588595) loss:3.377 lr:0.0000100 epoch_Time:25229.0min: [2024-01-05 11:10:17,252][model8_pretrain.py][INFO] Epoch:[0/2](618500/4588595) loss:2.219 lr:0.0000100 epoch_Time:25229.0min: [2024-01-05 11:10:17,252][model8_pretrain.py][INFO] Epoch:[0/2](618500/4588595) loss:3.038 lr:0.0000100 epoch_Time:25229.0min: [2024-01-05 11:10:17,252][model8_pretrain.py][INFO] Epoch:[0/2](618500/4588595) loss:3.044 lr:0.0000100 epoch_Time:25229.0min: [2024-01-05 11:10:17,252][model8_pretrain.py][INFO] Epoch:[0/2](618500/4588595) loss:3.153 lr:0.0000100 epoch_Time:25229.0min: [2024-01-05 11:10:17,252][model8_pretrain.py][INFO] Epoch:[0/2](618500/4588595) loss:2.634 lr:0.0000100 epoch_Time:25229.0min: [2024-01-05 11:10:54,195][model8_pretrain.py][INFO] Epoch:[0/2](618600/4588595) loss:3.009 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:10:54,195][model8_pretrain.py][INFO] Epoch:[0/2](618600/4588595) loss:2.652 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:10:54,195][model8_pretrain.py][INFO] Epoch:[0/2](618600/4588595) loss:2.867 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:10:54,195][model8_pretrain.py][INFO] Epoch:[0/2](618600/4588595) loss:2.624 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:10:54,195][model8_pretrain.py][INFO] Epoch:[0/2](618600/4588595) loss:2.667 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:10:54,195][model8_pretrain.py][INFO] Epoch:[0/2](618600/4588595) loss:2.373 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:10:54,195][model8_pretrain.py][INFO] Epoch:[0/2](618600/4588595) loss:3.058 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:10:54,195][model8_pretrain.py][INFO] Epoch:[0/2](618600/4588595) loss:2.849 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:11:43,112][model8_pretrain.py][INFO] Epoch:[0/2](618700/4588595) loss:2.814 lr:0.0000100 epoch_Time:25230.0min: [2024-01-05 11:11:43,112][model8_pretrain.py][INFO] Epoch:[0/2](618700/4588595) loss:2.748 lr:0.0000100 epoch_Time:25230.0min: [2024-01-05 11:11:43,112][model8_pretrain.py][INFO] Epoch:[0/2](618700/4588595) loss:3.059 lr:0.0000100 epoch_Time:25230.0min: [2024-01-05 11:11:43,112][model8_pretrain.py][INFO] Epoch:[0/2](618700/4588595) loss:3.082 lr:0.0000100 epoch_Time:25230.0min: [2024-01-05 11:11:43,112][model8_pretrain.py][INFO] Epoch:[0/2](618700/4588595) loss:2.156 lr:0.0000100 epoch_Time:25230.0min: [2024-01-05 11:11:43,112][model8_pretrain.py][INFO] Epoch:[0/2](618700/4588595) loss:2.340 lr:0.0000100 epoch_Time:25230.0min: [2024-01-05 11:11:43,112][model8_pretrain.py][INFO] Epoch:[0/2](618700/4588595) loss:2.617 lr:0.0000100 epoch_Time:25230.0min: [2024-01-05 11:11:43,112][model8_pretrain.py][INFO] Epoch:[0/2](618700/4588595) loss:3.033 lr:0.0000100 epoch_Time:25230.0min: [2024-01-05 11:12:20,047][model8_pretrain.py][INFO] Epoch:[0/2](618800/4588595) loss:2.747 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:12:20,047][model8_pretrain.py][INFO] Epoch:[0/2](618800/4588595) loss:3.390 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:12:20,047][model8_pretrain.py][INFO] Epoch:[0/2](618800/4588595) loss:2.307 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:12:20,047][model8_pretrain.py][INFO] Epoch:[0/2](618800/4588595) loss:3.474 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:12:20,047][model8_pretrain.py][INFO] Epoch:[0/2](618800/4588595) loss:3.221 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:12:20,047][model8_pretrain.py][INFO] Epoch:[0/2](618800/4588595) loss:2.773 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:12:20,047][model8_pretrain.py][INFO] Epoch:[0/2](618800/4588595) loss:2.949 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:12:20,047][model8_pretrain.py][INFO] Epoch:[0/2](618800/4588595) loss:3.205 lr:0.0000100 epoch_Time:25228.0min: [2024-01-05 11:12:56,985][model8_pretrain.py][INFO] Epoch:[0/2](618900/4588595) loss:2.673 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:12:56,985][model8_pretrain.py][INFO] Epoch:[0/2](618900/4588595) loss:1.853 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:12:56,985][model8_pretrain.py][INFO] Epoch:[0/2](618900/4588595) loss:2.827 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:12:56,985][model8_pretrain.py][INFO] Epoch:[0/2](618900/4588595) loss:2.218 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:12:56,985][model8_pretrain.py][INFO] Epoch:[0/2](618900/4588595) loss:2.951 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:12:56,985][model8_pretrain.py][INFO] Epoch:[0/2](618900/4588595) loss:2.988 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:12:56,985][model8_pretrain.py][INFO] Epoch:[0/2](618900/4588595) loss:3.235 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:12:56,985][model8_pretrain.py][INFO] Epoch:[0/2](618900/4588595) loss:2.863 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:13:33,934][model8_pretrain.py][INFO] Epoch:[0/2](619000/4588595) loss:2.893 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:13:33,934][model8_pretrain.py][INFO] Epoch:[0/2](619000/4588595) loss:3.211 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:13:33,934][model8_pretrain.py][INFO] Epoch:[0/2](619000/4588595) loss:2.921 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:13:33,934][model8_pretrain.py][INFO] Epoch:[0/2](619000/4588595) loss:3.108 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:13:33,934][model8_pretrain.py][INFO] Epoch:[0/2](619000/4588595) loss:3.357 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:13:33,934][model8_pretrain.py][INFO] Epoch:[0/2](619000/4588595) loss:2.903 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:13:33,934][model8_pretrain.py][INFO] Epoch:[0/2](619000/4588595) loss:2.363 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:13:33,934][model8_pretrain.py][INFO] Epoch:[0/2](619000/4588595) loss:2.943 lr:0.0000100 epoch_Time:25227.0min: [2024-01-05 11:14:10,878][model8_pretrain.py][INFO] Epoch:[0/2](619100/4588595) loss:2.715 lr:0.0000100 epoch_Time:25226.0min: [2024-01-05 11:14:10,878][model8_pretrain.py][INFO] Epoch:[0/2](619100/4588595) loss:2.925 lr:0.0000100 epoch_Time:25226.0min: [2024-01-05 11:14:10,878][model8_pretrain.py][INFO] Epoch:[0/2](619100/4588595) loss:3.181 lr:0.0000100 epoch_Time:25226.0min: [2024-01-05 11:14:10,878][model8_pretrain.py][INFO] Epoch:[0/2](619100/4588595) loss:2.553 lr:0.0000100 epoch_Time:25226.0min: [2024-01-05 11:14:10,878][model8_pretrain.py][INFO] Epoch:[0/2](619100/4588595) loss:2.756 lr:0.0000100 epoch_Time:25226.0min: [2024-01-05 11:14:10,878][model8_pretrain.py][INFO] Epoch:[0/2](619100/4588595) loss:2.781 lr:0.0000100 epoch_Time:25226.0min: [2024-01-05 11:14:10,878][model8_pretrain.py][INFO] Epoch:[0/2](619100/4588595) loss:2.591 lr:0.0000100 epoch_Time:25226.0min: [2024-01-05 11:14:10,878][model8_pretrain.py][INFO] Epoch:[0/2](619100/4588595) loss:3.272 lr:0.0000100 epoch_Time:25226.0min: [2024-01-05 11:14:47,819][model8_pretrain.py][INFO] Epoch:[0/2](619200/4588595) loss:2.834 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:14:47,819][model8_pretrain.py][INFO] Epoch:[0/2](619200/4588595) loss:3.233 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:14:47,819][model8_pretrain.py][INFO] Epoch:[0/2](619200/4588595) loss:2.707 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:14:47,819][model8_pretrain.py][INFO] Epoch:[0/2](619200/4588595) loss:2.877 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:14:47,819][model8_pretrain.py][INFO] Epoch:[0/2](619200/4588595) loss:3.247 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:14:47,819][model8_pretrain.py][INFO] Epoch:[0/2](619200/4588595) loss:1.891 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:14:47,819][model8_pretrain.py][INFO] Epoch:[0/2](619200/4588595) loss:3.088 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:14:47,820][model8_pretrain.py][INFO] Epoch:[0/2](619200/4588595) loss:2.488 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:15:24,770][model8_pretrain.py][INFO] Epoch:[0/2](619300/4588595) loss:2.452 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:15:24,770][model8_pretrain.py][INFO] Epoch:[0/2](619300/4588595) loss:2.680 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:15:24,770][model8_pretrain.py][INFO] Epoch:[0/2](619300/4588595) loss:2.615 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:15:24,770][model8_pretrain.py][INFO] Epoch:[0/2](619300/4588595) loss:2.885 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:15:24,770][model8_pretrain.py][INFO] Epoch:[0/2](619300/4588595) loss:3.376 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:15:24,770][model8_pretrain.py][INFO] Epoch:[0/2](619300/4588595) loss:3.057 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:15:24,770][model8_pretrain.py][INFO] Epoch:[0/2](619300/4588595) loss:2.660 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:15:24,770][model8_pretrain.py][INFO] Epoch:[0/2](619300/4588595) loss:2.352 lr:0.0000100 epoch_Time:25225.0min: [2024-01-05 11:16:01,721][model8_pretrain.py][INFO] Epoch:[0/2](619400/4588595) loss:3.307 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:01,721][model8_pretrain.py][INFO] Epoch:[0/2](619400/4588595) loss:2.774 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:01,721][model8_pretrain.py][INFO] Epoch:[0/2](619400/4588595) loss:3.144 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:01,721][model8_pretrain.py][INFO] Epoch:[0/2](619400/4588595) loss:2.427 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:01,721][model8_pretrain.py][INFO] Epoch:[0/2](619400/4588595) loss:2.813 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:01,721][model8_pretrain.py][INFO] Epoch:[0/2](619400/4588595) loss:3.028 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:01,721][model8_pretrain.py][INFO] Epoch:[0/2](619400/4588595) loss:2.767 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:01,721][model8_pretrain.py][INFO] Epoch:[0/2](619400/4588595) loss:2.803 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:50,740][model8_pretrain.py][INFO] Epoch:[0/2](619500/4588595) loss:3.060 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:50,740][model8_pretrain.py][INFO] Epoch:[0/2](619500/4588595) loss:2.713 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:50,740][model8_pretrain.py][INFO] Epoch:[0/2](619500/4588595) loss:2.702 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:50,740][model8_pretrain.py][INFO] Epoch:[0/2](619500/4588595) loss:2.520 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:50,740][model8_pretrain.py][INFO] Epoch:[0/2](619500/4588595) loss:2.501 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:50,740][model8_pretrain.py][INFO] Epoch:[0/2](619500/4588595) loss:2.844 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:50,740][model8_pretrain.py][INFO] Epoch:[0/2](619500/4588595) loss:2.442 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:16:50,740][model8_pretrain.py][INFO] Epoch:[0/2](619500/4588595) loss:2.922 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:17:27,685][model8_pretrain.py][INFO] Epoch:[0/2](619600/4588595) loss:2.678 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:17:27,685][model8_pretrain.py][INFO] Epoch:[0/2](619600/4588595) loss:1.821 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:17:27,685][model8_pretrain.py][INFO] Epoch:[0/2](619600/4588595) loss:2.932 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:17:27,685][model8_pretrain.py][INFO] Epoch:[0/2](619600/4588595) loss:2.955 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:17:27,685][model8_pretrain.py][INFO] Epoch:[0/2](619600/4588595) loss:2.741 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:17:27,685][model8_pretrain.py][INFO] Epoch:[0/2](619600/4588595) loss:2.964 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:17:27,685][model8_pretrain.py][INFO] Epoch:[0/2](619600/4588595) loss:3.154 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:17:27,685][model8_pretrain.py][INFO] Epoch:[0/2](619600/4588595) loss:3.031 lr:0.0000100 epoch_Time:25224.0min: [2024-01-05 11:18:04,634][model8_pretrain.py][INFO] Epoch:[0/2](619700/4588595) loss:2.544 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:04,634][model8_pretrain.py][INFO] Epoch:[0/2](619700/4588595) loss:2.513 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:04,634][model8_pretrain.py][INFO] Epoch:[0/2](619700/4588595) loss:2.620 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:04,634][model8_pretrain.py][INFO] Epoch:[0/2](619700/4588595) loss:2.702 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:04,634][model8_pretrain.py][INFO] Epoch:[0/2](619700/4588595) loss:2.353 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:04,634][model8_pretrain.py][INFO] Epoch:[0/2](619700/4588595) loss:2.968 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:04,634][model8_pretrain.py][INFO] Epoch:[0/2](619700/4588595) loss:3.404 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:04,634][model8_pretrain.py][INFO] Epoch:[0/2](619700/4588595) loss:2.967 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:41,587][model8_pretrain.py][INFO] Epoch:[0/2](619800/4588595) loss:2.573 lr:0.0000100 epoch_Time:25222.0min: [2024-01-05 11:18:41,587][model8_pretrain.py][INFO] Epoch:[0/2](619800/4588595) loss:2.615 lr:0.0000100 epoch_Time:25222.0min: [2024-01-05 11:18:41,587][model8_pretrain.py][INFO] Epoch:[0/2](619800/4588595) loss:2.562 lr:0.0000100 epoch_Time:25222.0min: [2024-01-05 11:18:41,587][model8_pretrain.py][INFO] Epoch:[0/2](619800/4588595) loss:2.642 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:41,588][model8_pretrain.py][INFO] Epoch:[0/2](619800/4588595) loss:2.684 lr:0.0000100 epoch_Time:25222.0min: [2024-01-05 11:18:41,588][model8_pretrain.py][INFO] Epoch:[0/2](619800/4588595) loss:2.917 lr:0.0000100 epoch_Time:25223.0min: [2024-01-05 11:18:41,588][model8_pretrain.py][INFO] Epoch:[0/2](619800/4588595) loss:2.771 lr:0.0000100 epoch_Time:25222.0min: [2024-01-05 11:18:41,588][model8_pretrain.py][INFO] Epoch:[0/2](619800/4588595) loss:3.047 lr:0.0000100 epoch_Time:25222.0min: [2024-01-05 11:19:18,546][model8_pretrain.py][INFO] Epoch:[0/2](619900/4588595) loss:2.644 lr:0.0000100 epoch_Time:25221.0min: [2024-01-05 11:19:18,546][model8_pretrain.py][INFO] Epoch:[0/2](619900/4588595) loss:2.825 lr:0.0000100 epoch_Time:25221.0min: [2024-01-05 11:19:18,546][model8_pretrain.py][INFO] Epoch:[0/2](619900/4588595) loss:2.909 lr:0.0000100 epoch_Time:25221.0min: [2024-01-05 11:19:18,546][model8_pretrain.py][INFO] Epoch:[0/2](619900/4588595) loss:2.984 lr:0.0000100 epoch_Time:25221.0min: [2024-01-05 11:19:18,546][model8_pretrain.py][INFO] Epoch:[0/2](619900/4588595) loss:3.008 lr:0.0000100 epoch_Time:25221.0min: [2024-01-05 11:19:18,546][model8_pretrain.py][INFO] Epoch:[0/2](619900/4588595) loss:3.094 lr:0.0000100 epoch_Time:25221.0min: [2024-01-05 11:19:18,546][model8_pretrain.py][INFO] Epoch:[0/2](619900/4588595) loss:2.989 lr:0.0000100 epoch_Time:25221.0min: [2024-01-05 11:19:18,546][model8_pretrain.py][INFO] Epoch:[0/2](619900/4588595) loss:2.973 lr:0.0000100 epoch_Time:25221.0min: [2024-01-05 11:19:55,500][model8_pretrain.py][INFO] Epoch:[0/2](620000/4588595) loss:2.655 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:19:55,500][model8_pretrain.py][INFO] Epoch:[0/2](620000/4588595) loss:2.869 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:19:55,500][model8_pretrain.py][INFO] Epoch:[0/2](620000/4588595) loss:2.380 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:19:55,500][model8_pretrain.py][INFO] Epoch:[0/2](620000/4588595) loss:2.874 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:19:55,500][model8_pretrain.py][INFO] Epoch:[0/2](620000/4588595) loss:2.936 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:19:55,500][model8_pretrain.py][INFO] Epoch:[0/2](620000/4588595) loss:2.504 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:19:55,500][model8_pretrain.py][INFO] Epoch:[0/2](620000/4588595) loss:2.259 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:19:55,501][model8_pretrain.py][INFO] Epoch:[0/2](620000/4588595) loss:2.768 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:20:32,432][model8_pretrain.py][INFO] Epoch:[0/2](620100/4588595) loss:3.252 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:20:32,432][model8_pretrain.py][INFO] Epoch:[0/2](620100/4588595) loss:2.789 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:20:32,432][model8_pretrain.py][INFO] Epoch:[0/2](620100/4588595) loss:2.405 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:20:32,432][model8_pretrain.py][INFO] Epoch:[0/2](620100/4588595) loss:2.414 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:20:32,432][model8_pretrain.py][INFO] Epoch:[0/2](620100/4588595) loss:2.807 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:20:32,432][model8_pretrain.py][INFO] Epoch:[0/2](620100/4588595) loss:2.854 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:20:32,432][model8_pretrain.py][INFO] Epoch:[0/2](620100/4588595) loss:2.754 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:20:32,432][model8_pretrain.py][INFO] Epoch:[0/2](620100/4588595) loss:2.820 lr:0.0000100 epoch_Time:25220.0min: [2024-01-05 11:21:09,388][model8_pretrain.py][INFO] Epoch:[0/2](620200/4588595) loss:2.702 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:09,388][model8_pretrain.py][INFO] Epoch:[0/2](620200/4588595) loss:2.685 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:09,388][model8_pretrain.py][INFO] Epoch:[0/2](620200/4588595) loss:3.213 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:09,388][model8_pretrain.py][INFO] Epoch:[0/2](620200/4588595) loss:2.614 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:09,388][model8_pretrain.py][INFO] Epoch:[0/2](620200/4588595) loss:3.102 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:09,388][model8_pretrain.py][INFO] Epoch:[0/2](620200/4588595) loss:2.865 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:09,388][model8_pretrain.py][INFO] Epoch:[0/2](620200/4588595) loss:2.748 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:09,388][model8_pretrain.py][INFO] Epoch:[0/2](620200/4588595) loss:2.544 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:58,447][model8_pretrain.py][INFO] Epoch:[0/2](620300/4588595) loss:2.731 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:58,447][model8_pretrain.py][INFO] Epoch:[0/2](620300/4588595) loss:3.151 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:58,447][model8_pretrain.py][INFO] Epoch:[0/2](620300/4588595) loss:2.657 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:58,447][model8_pretrain.py][INFO] Epoch:[0/2](620300/4588595) loss:2.537 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:58,447][model8_pretrain.py][INFO] Epoch:[0/2](620300/4588595) loss:3.029 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:58,447][model8_pretrain.py][INFO] Epoch:[0/2](620300/4588595) loss:2.601 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:58,447][model8_pretrain.py][INFO] Epoch:[0/2](620300/4588595) loss:2.704 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:21:58,447][model8_pretrain.py][INFO] Epoch:[0/2](620300/4588595) loss:3.469 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:22:35,389][model8_pretrain.py][INFO] Epoch:[0/2](620400/4588595) loss:3.115 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:22:35,389][model8_pretrain.py][INFO] Epoch:[0/2](620400/4588595) loss:3.149 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:22:35,389][model8_pretrain.py][INFO] Epoch:[0/2](620400/4588595) loss:2.630 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:22:35,389][model8_pretrain.py][INFO] Epoch:[0/2](620400/4588595) loss:3.119 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:22:35,389][model8_pretrain.py][INFO] Epoch:[0/2](620400/4588595) loss:3.086 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:22:35,389][model8_pretrain.py][INFO] Epoch:[0/2](620400/4588595) loss:2.921 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:22:35,389][model8_pretrain.py][INFO] Epoch:[0/2](620400/4588595) loss:3.033 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:22:35,389][model8_pretrain.py][INFO] Epoch:[0/2](620400/4588595) loss:3.074 lr:0.0000100 epoch_Time:25219.0min: [2024-01-05 11:23:12,334][model8_pretrain.py][INFO] Epoch:[0/2](620500/4588595) loss:2.475 lr:0.0000100 epoch_Time:25218.0min: [2024-01-05 11:23:12,334][model8_pretrain.py][INFO] Epoch:[0/2](620500/4588595) loss:3.098 lr:0.0000100 epoch_Time:25218.0min: [2024-01-05 11:23:12,334][model8_pretrain.py][INFO] Epoch:[0/2](620500/4588595) loss:2.813 lr:0.0000100 epoch_Time:25218.0min: [2024-01-05 11:23:12,334][model8_pretrain.py][INFO] Epoch:[0/2](620500/4588595) loss:3.122 lr:0.0000100 epoch_Time:25218.0min: [2024-01-05 11:23:12,334][model8_pretrain.py][INFO] Epoch:[0/2](620500/4588595) loss:2.895 lr:0.0000100 epoch_Time:25218.0min: [2024-01-05 11:23:12,334][model8_pretrain.py][INFO] Epoch:[0/2](620500/4588595) loss:3.258 lr:0.0000100 epoch_Time:25218.0min: [2024-01-05 11:23:12,334][model8_pretrain.py][INFO] Epoch:[0/2](620500/4588595) loss:3.236 lr:0.0000100 epoch_Time:25218.0min: [2024-01-05 11:23:12,334][model8_pretrain.py][INFO] Epoch:[0/2](620500/4588595) loss:2.280 lr:0.0000100 epoch_Time:25218.0min: [2024-01-05 11:23:49,290][model8_pretrain.py][INFO] Epoch:[0/2](620600/4588595) loss:2.670 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:23:49,290][model8_pretrain.py][INFO] Epoch:[0/2](620600/4588595) loss:2.851 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:23:49,290][model8_pretrain.py][INFO] Epoch:[0/2](620600/4588595) loss:2.790 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:23:49,290][model8_pretrain.py][INFO] Epoch:[0/2](620600/4588595) loss:2.439 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:23:49,290][model8_pretrain.py][INFO] Epoch:[0/2](620600/4588595) loss:2.844 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:23:49,290][model8_pretrain.py][INFO] Epoch:[0/2](620600/4588595) loss:2.728 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:23:49,290][model8_pretrain.py][INFO] Epoch:[0/2](620600/4588595) loss:3.438 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:23:49,291][model8_pretrain.py][INFO] Epoch:[0/2](620600/4588595) loss:2.992 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:24:26,239][model8_pretrain.py][INFO] Epoch:[0/2](620700/4588595) loss:2.963 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:24:26,239][model8_pretrain.py][INFO] Epoch:[0/2](620700/4588595) loss:2.925 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:24:26,239][model8_pretrain.py][INFO] Epoch:[0/2](620700/4588595) loss:3.017 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:24:26,239][model8_pretrain.py][INFO] Epoch:[0/2](620700/4588595) loss:2.854 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:24:26,239][model8_pretrain.py][INFO] Epoch:[0/2](620700/4588595) loss:3.174 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:24:26,239][model8_pretrain.py][INFO] Epoch:[0/2](620700/4588595) loss:3.332 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:24:26,239][model8_pretrain.py][INFO] Epoch:[0/2](620700/4588595) loss:3.249 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:24:26,239][model8_pretrain.py][INFO] Epoch:[0/2](620700/4588595) loss:2.756 lr:0.0000100 epoch_Time:25217.0min: [2024-01-05 11:25:03,200][model8_pretrain.py][INFO] Epoch:[0/2](620800/4588595) loss:2.573 lr:0.0000100 epoch_Time:25216.0min: [2024-01-05 11:25:03,200][model8_pretrain.py][INFO] Epoch:[0/2](620800/4588595) loss:2.777 lr:0.0000100 epoch_Time:25216.0min: [2024-01-05 11:25:03,200][model8_pretrain.py][INFO] Epoch:[0/2](620800/4588595) loss:2.920 lr:0.0000100 epoch_Time:25216.0min: [2024-01-05 11:25:03,200][model8_pretrain.py][INFO] Epoch:[0/2](620800/4588595) loss:2.948 lr:0.0000100 epoch_Time:25216.0min: [2024-01-05 11:25:03,200][model8_pretrain.py][INFO] Epoch:[0/2](620800/4588595) loss:2.226 lr:0.0000100 epoch_Time:25216.0min: [2024-01-05 11:25:03,200][model8_pretrain.py][INFO] Epoch:[0/2](620800/4588595) loss:3.243 lr:0.0000100 epoch_Time:25216.0min: [2024-01-05 11:25:03,200][model8_pretrain.py][INFO] Epoch:[0/2](620800/4588595) loss:2.981 lr:0.0000100 epoch_Time:25216.0min: [2024-01-05 11:25:03,200][model8_pretrain.py][INFO] Epoch:[0/2](620800/4588595) loss:3.169 lr:0.0000100 epoch_Time:25216.0min: [2024-01-05 11:25:40,155][model8_pretrain.py][INFO] Epoch:[0/2](620900/4588595) loss:2.825 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:25:40,155][model8_pretrain.py][INFO] Epoch:[0/2](620900/4588595) loss:3.009 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:25:40,155][model8_pretrain.py][INFO] Epoch:[0/2](620900/4588595) loss:2.732 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:25:40,155][model8_pretrain.py][INFO] Epoch:[0/2](620900/4588595) loss:2.526 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:25:40,155][model8_pretrain.py][INFO] Epoch:[0/2](620900/4588595) loss:2.844 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:25:40,155][model8_pretrain.py][INFO] Epoch:[0/2](620900/4588595) loss:3.029 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:25:40,155][model8_pretrain.py][INFO] Epoch:[0/2](620900/4588595) loss:3.156 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:25:40,155][model8_pretrain.py][INFO] Epoch:[0/2](620900/4588595) loss:2.975 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:26:17,110][model8_pretrain.py][INFO] Epoch:[0/2](621000/4588595) loss:2.701 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:26:17,110][model8_pretrain.py][INFO] Epoch:[0/2](621000/4588595) loss:2.769 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:26:17,110][model8_pretrain.py][INFO] Epoch:[0/2](621000/4588595) loss:2.990 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:26:17,110][model8_pretrain.py][INFO] Epoch:[0/2](621000/4588595) loss:3.044 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:26:17,110][model8_pretrain.py][INFO] Epoch:[0/2](621000/4588595) loss:3.232 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:26:17,110][model8_pretrain.py][INFO] Epoch:[0/2](621000/4588595) loss:3.133 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:26:17,110][model8_pretrain.py][INFO] Epoch:[0/2](621000/4588595) loss:2.596 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:26:17,111][model8_pretrain.py][INFO] Epoch:[0/2](621000/4588595) loss:2.499 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:27:06,127][model8_pretrain.py][INFO] Epoch:[0/2](621100/4588595) loss:2.951 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:27:06,127][model8_pretrain.py][INFO] Epoch:[0/2](621100/4588595) loss:2.995 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:27:06,128][model8_pretrain.py][INFO] Epoch:[0/2](621100/4588595) loss:2.936 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:27:06,128][model8_pretrain.py][INFO] Epoch:[0/2](621100/4588595) loss:2.875 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:27:06,128][model8_pretrain.py][INFO] Epoch:[0/2](621100/4588595) loss:2.274 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:27:06,128][model8_pretrain.py][INFO] Epoch:[0/2](621100/4588595) loss:2.993 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:27:06,128][model8_pretrain.py][INFO] Epoch:[0/2](621100/4588595) loss:3.269 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:27:06,128][model8_pretrain.py][INFO] Epoch:[0/2](621100/4588595) loss:2.584 lr:0.0000100 epoch_Time:25215.0min: [2024-01-05 11:27:43,034][model8_pretrain.py][INFO] Epoch:[0/2](621200/4588595) loss:2.508 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:27:43,034][model8_pretrain.py][INFO] Epoch:[0/2](621200/4588595) loss:3.016 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:27:43,034][model8_pretrain.py][INFO] Epoch:[0/2](621200/4588595) loss:2.991 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:27:43,034][model8_pretrain.py][INFO] Epoch:[0/2](621200/4588595) loss:2.605 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:27:43,035][model8_pretrain.py][INFO] Epoch:[0/2](621200/4588595) loss:2.733 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:27:43,035][model8_pretrain.py][INFO] Epoch:[0/2](621200/4588595) loss:2.314 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:27:43,035][model8_pretrain.py][INFO] Epoch:[0/2](621200/4588595) loss:3.184 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:27:43,035][model8_pretrain.py][INFO] Epoch:[0/2](621200/4588595) loss:2.306 lr:0.0000100 epoch_Time:25214.0min: [2024-01-05 11:28:19,975][model8_pretrain.py][INFO] Epoch:[0/2](621300/4588595) loss:2.742 lr:0.0000100 epoch_Time:25213.0min: [2024-01-05 11:28:19,975][model8_pretrain.py][INFO] Epoch:[0/2](621300/4588595) loss:3.240 lr:0.0000100 epoch_Time:25213.0min: [2024-01-05 11:28:19,975][model8_pretrain.py][INFO] Epoch:[0/2](621300/4588595) loss:3.539 lr:0.0000100 epoch_Time:25213.0min: [2024-01-05 11:28:19,975][model8_pretrain.py][INFO] Epoch:[0/2](621300/4588595) loss:2.946 lr:0.0000100 epoch_Time:25213.0min: [2024-01-05 11:28:19,975][model8_pretrain.py][INFO] Epoch:[0/2](621300/4588595) loss:2.673 lr:0.0000100 epoch_Time:25213.0min: [2024-01-05 11:28:19,975][model8_pretrain.py][INFO] Epoch:[0/2](621300/4588595) loss:2.859 lr:0.0000100 epoch_Time:25213.0min: [2024-01-05 11:28:19,975][model8_pretrain.py][INFO] Epoch:[0/2](621300/4588595) loss:2.468 lr:0.0000100 epoch_Time:25213.0min: [2024-01-05 11:28:19,975][model8_pretrain.py][INFO] Epoch:[0/2](621300/4588595) loss:2.489 lr:0.0000100 epoch_Time:25213.0min: [2024-01-05 11:28:56,921][model8_pretrain.py][INFO] Epoch:[0/2](621400/4588595) loss:3.042 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:28:56,921][model8_pretrain.py][INFO] Epoch:[0/2](621400/4588595) loss:2.585 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:28:56,921][model8_pretrain.py][INFO] Epoch:[0/2](621400/4588595) loss:2.209 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:28:56,921][model8_pretrain.py][INFO] Epoch:[0/2](621400/4588595) loss:2.507 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:28:56,921][model8_pretrain.py][INFO] Epoch:[0/2](621400/4588595) loss:2.683 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:28:56,921][model8_pretrain.py][INFO] Epoch:[0/2](621400/4588595) loss:2.764 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:28:56,921][model8_pretrain.py][INFO] Epoch:[0/2](621400/4588595) loss:2.913 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:28:56,921][model8_pretrain.py][INFO] Epoch:[0/2](621400/4588595) loss:2.924 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:29:33,864][model8_pretrain.py][INFO] Epoch:[0/2](621500/4588595) loss:2.641 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:29:33,864][model8_pretrain.py][INFO] Epoch:[0/2](621500/4588595) loss:2.929 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:29:33,864][model8_pretrain.py][INFO] Epoch:[0/2](621500/4588595) loss:2.691 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:29:33,864][model8_pretrain.py][INFO] Epoch:[0/2](621500/4588595) loss:2.753 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:29:33,864][model8_pretrain.py][INFO] Epoch:[0/2](621500/4588595) loss:3.109 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:29:33,864][model8_pretrain.py][INFO] Epoch:[0/2](621500/4588595) loss:2.549 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:29:33,864][model8_pretrain.py][INFO] Epoch:[0/2](621500/4588595) loss:2.843 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:29:33,864][model8_pretrain.py][INFO] Epoch:[0/2](621500/4588595) loss:2.032 lr:0.0000100 epoch_Time:25212.0min: [2024-01-05 11:30:10,811][model8_pretrain.py][INFO] Epoch:[0/2](621600/4588595) loss:3.249 lr:0.0000100 epoch_Time:25211.0min: [2024-01-05 11:30:10,811][model8_pretrain.py][INFO] Epoch:[0/2](621600/4588595) loss:2.868 lr:0.0000100 epoch_Time:25211.0min: [2024-01-05 11:30:10,811][model8_pretrain.py][INFO] Epoch:[0/2](621600/4588595) loss:3.204 lr:0.0000100 epoch_Time:25211.0min: [2024-01-05 11:30:10,811][model8_pretrain.py][INFO] Epoch:[0/2](621600/4588595) loss:2.626 lr:0.0000100 epoch_Time:25211.0min: [2024-01-05 11:30:10,811][model8_pretrain.py][INFO] Epoch:[0/2](621600/4588595) loss:2.584 lr:0.0000100 epoch_Time:25211.0min: [2024-01-05 11:30:10,811][model8_pretrain.py][INFO] Epoch:[0/2](621600/4588595) loss:2.612 lr:0.0000100 epoch_Time:25211.0min: [2024-01-05 11:30:10,811][model8_pretrain.py][INFO] Epoch:[0/2](621600/4588595) loss:3.110 lr:0.0000100 epoch_Time:25211.0min: [2024-01-05 11:30:10,812][model8_pretrain.py][INFO] Epoch:[0/2](621600/4588595) loss:2.813 lr:0.0000100 epoch_Time:25211.0min: [2024-01-05 11:30:47,758][model8_pretrain.py][INFO] Epoch:[0/2](621700/4588595) loss:2.927 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:30:47,758][model8_pretrain.py][INFO] Epoch:[0/2](621700/4588595) loss:2.422 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:30:47,758][model8_pretrain.py][INFO] Epoch:[0/2](621700/4588595) loss:2.949 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:30:47,758][model8_pretrain.py][INFO] Epoch:[0/2](621700/4588595) loss:3.148 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:30:47,758][model8_pretrain.py][INFO] Epoch:[0/2](621700/4588595) loss:2.958 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:30:47,758][model8_pretrain.py][INFO] Epoch:[0/2](621700/4588595) loss:3.095 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:30:47,758][model8_pretrain.py][INFO] Epoch:[0/2](621700/4588595) loss:3.425 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:30:47,758][model8_pretrain.py][INFO] Epoch:[0/2](621700/4588595) loss:2.869 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:31:24,706][model8_pretrain.py][INFO] Epoch:[0/2](621800/4588595) loss:3.008 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:31:24,706][model8_pretrain.py][INFO] Epoch:[0/2](621800/4588595) loss:2.502 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:31:24,707][model8_pretrain.py][INFO] Epoch:[0/2](621800/4588595) loss:3.313 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:31:24,707][model8_pretrain.py][INFO] Epoch:[0/2](621800/4588595) loss:2.828 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:31:24,707][model8_pretrain.py][INFO] Epoch:[0/2](621800/4588595) loss:2.327 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:31:24,707][model8_pretrain.py][INFO] Epoch:[0/2](621800/4588595) loss:3.115 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:31:24,707][model8_pretrain.py][INFO] Epoch:[0/2](621800/4588595) loss:3.156 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:31:24,707][model8_pretrain.py][INFO] Epoch:[0/2](621800/4588595) loss:2.687 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:32:13,752][model8_pretrain.py][INFO] Epoch:[0/2](621900/4588595) loss:2.354 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:32:13,752][model8_pretrain.py][INFO] Epoch:[0/2](621900/4588595) loss:2.720 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:32:13,752][model8_pretrain.py][INFO] Epoch:[0/2](621900/4588595) loss:2.885 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:32:13,752][model8_pretrain.py][INFO] Epoch:[0/2](621900/4588595) loss:2.658 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:32:13,752][model8_pretrain.py][INFO] Epoch:[0/2](621900/4588595) loss:3.041 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:32:13,752][model8_pretrain.py][INFO] Epoch:[0/2](621900/4588595) loss:2.925 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:32:13,752][model8_pretrain.py][INFO] Epoch:[0/2](621900/4588595) loss:2.676 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:32:13,752][model8_pretrain.py][INFO] Epoch:[0/2](621900/4588595) loss:2.608 lr:0.0000100 epoch_Time:25210.0min: [2024-01-05 11:32:50,699][model8_pretrain.py][INFO] Epoch:[0/2](622000/4588595) loss:3.059 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:32:50,699][model8_pretrain.py][INFO] Epoch:[0/2](622000/4588595) loss:2.456 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:32:50,699][model8_pretrain.py][INFO] Epoch:[0/2](622000/4588595) loss:3.089 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:32:50,699][model8_pretrain.py][INFO] Epoch:[0/2](622000/4588595) loss:2.459 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:32:50,699][model8_pretrain.py][INFO] Epoch:[0/2](622000/4588595) loss:3.102 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:32:50,699][model8_pretrain.py][INFO] Epoch:[0/2](622000/4588595) loss:2.891 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:32:50,699][model8_pretrain.py][INFO] Epoch:[0/2](622000/4588595) loss:2.453 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:32:50,699][model8_pretrain.py][INFO] Epoch:[0/2](622000/4588595) loss:3.098 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:33:27,647][model8_pretrain.py][INFO] Epoch:[0/2](622100/4588595) loss:2.762 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:33:27,647][model8_pretrain.py][INFO] Epoch:[0/2](622100/4588595) loss:2.978 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:33:27,647][model8_pretrain.py][INFO] Epoch:[0/2](622100/4588595) loss:2.713 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:33:27,647][model8_pretrain.py][INFO] Epoch:[0/2](622100/4588595) loss:2.887 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:33:27,647][model8_pretrain.py][INFO] Epoch:[0/2](622100/4588595) loss:2.145 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:33:27,647][model8_pretrain.py][INFO] Epoch:[0/2](622100/4588595) loss:2.648 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:33:27,647][model8_pretrain.py][INFO] Epoch:[0/2](622100/4588595) loss:2.262 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:33:27,647][model8_pretrain.py][INFO] Epoch:[0/2](622100/4588595) loss:2.518 lr:0.0000100 epoch_Time:25209.0min: [2024-01-05 11:34:04,591][model8_pretrain.py][INFO] Epoch:[0/2](622200/4588595) loss:2.947 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:04,591][model8_pretrain.py][INFO] Epoch:[0/2](622200/4588595) loss:2.747 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:04,591][model8_pretrain.py][INFO] Epoch:[0/2](622200/4588595) loss:2.971 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:04,591][model8_pretrain.py][INFO] Epoch:[0/2](622200/4588595) loss:2.840 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:04,591][model8_pretrain.py][INFO] Epoch:[0/2](622200/4588595) loss:2.550 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:04,591][model8_pretrain.py][INFO] Epoch:[0/2](622200/4588595) loss:3.367 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:04,592][model8_pretrain.py][INFO] Epoch:[0/2](622200/4588595) loss:2.522 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:04,592][model8_pretrain.py][INFO] Epoch:[0/2](622200/4588595) loss:3.213 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:41,558][model8_pretrain.py][INFO] Epoch:[0/2](622300/4588595) loss:2.571 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:41,558][model8_pretrain.py][INFO] Epoch:[0/2](622300/4588595) loss:2.524 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:41,558][model8_pretrain.py][INFO] Epoch:[0/2](622300/4588595) loss:3.039 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:41,558][model8_pretrain.py][INFO] Epoch:[0/2](622300/4588595) loss:2.447 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:41,558][model8_pretrain.py][INFO] Epoch:[0/2](622300/4588595) loss:3.009 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:41,558][model8_pretrain.py][INFO] Epoch:[0/2](622300/4588595) loss:3.122 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:41,559][model8_pretrain.py][INFO] Epoch:[0/2](622300/4588595) loss:2.226 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:34:41,559][model8_pretrain.py][INFO] Epoch:[0/2](622300/4588595) loss:3.346 lr:0.0000100 epoch_Time:25207.0min: [2024-01-05 11:35:18,512][model8_pretrain.py][INFO] Epoch:[0/2](622400/4588595) loss:2.790 lr:0.0000100 epoch_Time:25206.0min: [2024-01-05 11:35:18,512][model8_pretrain.py][INFO] Epoch:[0/2](622400/4588595) loss:2.712 lr:0.0000100 epoch_Time:25206.0min: [2024-01-05 11:35:18,512][model8_pretrain.py][INFO] Epoch:[0/2](622400/4588595) loss:2.662 lr:0.0000100 epoch_Time:25206.0min: [2024-01-05 11:35:18,512][model8_pretrain.py][INFO] Epoch:[0/2](622400/4588595) loss:2.984 lr:0.0000100 epoch_Time:25206.0min: [2024-01-05 11:35:18,512][model8_pretrain.py][INFO] Epoch:[0/2](622400/4588595) loss:2.615 lr:0.0000100 epoch_Time:25206.0min: [2024-01-05 11:35:18,512][model8_pretrain.py][INFO] Epoch:[0/2](622400/4588595) loss:2.572 lr:0.0000100 epoch_Time:25206.0min: [2024-01-05 11:35:18,512][model8_pretrain.py][INFO] Epoch:[0/2](622400/4588595) loss:2.695 lr:0.0000100 epoch_Time:25206.0min: [2024-01-05 11:35:18,512][model8_pretrain.py][INFO] Epoch:[0/2](622400/4588595) loss:3.199 lr:0.0000100 epoch_Time:25206.0min: [2024-01-05 11:35:55,450][model8_pretrain.py][INFO] Epoch:[0/2](622500/4588595) loss:2.432 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:35:55,450][model8_pretrain.py][INFO] Epoch:[0/2](622500/4588595) loss:2.945 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:35:55,450][model8_pretrain.py][INFO] Epoch:[0/2](622500/4588595) loss:2.666 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:35:55,450][model8_pretrain.py][INFO] Epoch:[0/2](622500/4588595) loss:2.730 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:35:55,450][model8_pretrain.py][INFO] Epoch:[0/2](622500/4588595) loss:2.863 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:35:55,450][model8_pretrain.py][INFO] Epoch:[0/2](622500/4588595) loss:2.285 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:35:55,450][model8_pretrain.py][INFO] Epoch:[0/2](622500/4588595) loss:3.388 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:35:55,450][model8_pretrain.py][INFO] Epoch:[0/2](622500/4588595) loss:3.000 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:36:32,388][model8_pretrain.py][INFO] Epoch:[0/2](622600/4588595) loss:2.823 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:36:32,388][model8_pretrain.py][INFO] Epoch:[0/2](622600/4588595) loss:2.841 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:36:32,388][model8_pretrain.py][INFO] Epoch:[0/2](622600/4588595) loss:2.315 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:36:32,388][model8_pretrain.py][INFO] Epoch:[0/2](622600/4588595) loss:2.193 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:36:32,388][model8_pretrain.py][INFO] Epoch:[0/2](622600/4588595) loss:3.727 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:36:32,388][model8_pretrain.py][INFO] Epoch:[0/2](622600/4588595) loss:2.882 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:36:32,388][model8_pretrain.py][INFO] Epoch:[0/2](622600/4588595) loss:3.185 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:36:32,388][model8_pretrain.py][INFO] Epoch:[0/2](622600/4588595) loss:2.556 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:37:21,255][model8_pretrain.py][INFO] Epoch:[0/2](622700/4588595) loss:3.153 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:37:21,255][model8_pretrain.py][INFO] Epoch:[0/2](622700/4588595) loss:2.863 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:37:21,255][model8_pretrain.py][INFO] Epoch:[0/2](622700/4588595) loss:2.845 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:37:21,255][model8_pretrain.py][INFO] Epoch:[0/2](622700/4588595) loss:2.713 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:37:21,255][model8_pretrain.py][INFO] Epoch:[0/2](622700/4588595) loss:3.027 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:37:21,255][model8_pretrain.py][INFO] Epoch:[0/2](622700/4588595) loss:2.156 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:37:21,255][model8_pretrain.py][INFO] Epoch:[0/2](622700/4588595) loss:2.584 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:37:21,255][model8_pretrain.py][INFO] Epoch:[0/2](622700/4588595) loss:2.815 lr:0.0000100 epoch_Time:25205.0min: [2024-01-05 11:37:58,179][model8_pretrain.py][INFO] Epoch:[0/2](622800/4588595) loss:2.715 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:37:58,180][model8_pretrain.py][INFO] Epoch:[0/2](622800/4588595) loss:2.602 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:37:58,180][model8_pretrain.py][INFO] Epoch:[0/2](622800/4588595) loss:2.985 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:37:58,180][model8_pretrain.py][INFO] Epoch:[0/2](622800/4588595) loss:2.753 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:37:58,180][model8_pretrain.py][INFO] Epoch:[0/2](622800/4588595) loss:3.375 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:37:58,180][model8_pretrain.py][INFO] Epoch:[0/2](622800/4588595) loss:2.676 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:37:58,180][model8_pretrain.py][INFO] Epoch:[0/2](622800/4588595) loss:3.011 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:37:58,180][model8_pretrain.py][INFO] Epoch:[0/2](622800/4588595) loss:2.850 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:38:35,124][model8_pretrain.py][INFO] Epoch:[0/2](622900/4588595) loss:3.257 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:38:35,124][model8_pretrain.py][INFO] Epoch:[0/2](622900/4588595) loss:3.024 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:38:35,124][model8_pretrain.py][INFO] Epoch:[0/2](622900/4588595) loss:2.515 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:38:35,124][model8_pretrain.py][INFO] Epoch:[0/2](622900/4588595) loss:3.150 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:38:35,124][model8_pretrain.py][INFO] Epoch:[0/2](622900/4588595) loss:2.451 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:38:35,124][model8_pretrain.py][INFO] Epoch:[0/2](622900/4588595) loss:2.859 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:38:35,124][model8_pretrain.py][INFO] Epoch:[0/2](622900/4588595) loss:2.907 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:38:35,124][model8_pretrain.py][INFO] Epoch:[0/2](622900/4588595) loss:2.956 lr:0.0000100 epoch_Time:25204.0min: [2024-01-05 11:39:12,065][model8_pretrain.py][INFO] Epoch:[0/2](623000/4588595) loss:2.703 lr:0.0000100 epoch_Time:25203.0min: [2024-01-05 11:39:12,065][model8_pretrain.py][INFO] Epoch:[0/2](623000/4588595) loss:2.296 lr:0.0000100 epoch_Time:25203.0min: [2024-01-05 11:39:12,065][model8_pretrain.py][INFO] Epoch:[0/2](623000/4588595) loss:2.668 lr:0.0000100 epoch_Time:25203.0min: [2024-01-05 11:39:12,065][model8_pretrain.py][INFO] Epoch:[0/2](623000/4588595) loss:2.944 lr:0.0000100 epoch_Time:25203.0min: [2024-01-05 11:39:12,065][model8_pretrain.py][INFO] Epoch:[0/2](623000/4588595) loss:2.866 lr:0.0000100 epoch_Time:25203.0min: [2024-01-05 11:39:12,065][model8_pretrain.py][INFO] Epoch:[0/2](623000/4588595) loss:2.900 lr:0.0000100 epoch_Time:25203.0min: [2024-01-05 11:39:12,065][model8_pretrain.py][INFO] Epoch:[0/2](623000/4588595) loss:2.769 lr:0.0000100 epoch_Time:25203.0min: [2024-01-05 11:39:12,065][model8_pretrain.py][INFO] Epoch:[0/2](623000/4588595) loss:3.028 lr:0.0000100 epoch_Time:25203.0min: [2024-01-05 11:39:49,001][model8_pretrain.py][INFO] Epoch:[0/2](623100/4588595) loss:3.187 lr:0.0000100 epoch_Time:25202.0min: [2024-01-05 11:39:49,001][model8_pretrain.py][INFO] Epoch:[0/2](623100/4588595) loss:3.226 lr:0.0000100 epoch_Time:25202.0min: [2024-01-05 11:39:49,001][model8_pretrain.py][INFO] Epoch:[0/2](623100/4588595) loss:3.187 lr:0.0000100 epoch_Time:25202.0min: [2024-01-05 11:39:49,001][model8_pretrain.py][INFO] Epoch:[0/2](623100/4588595) loss:3.181 lr:0.0000100 epoch_Time:25202.0min: [2024-01-05 11:39:49,001][model8_pretrain.py][INFO] Epoch:[0/2](623100/4588595) loss:3.084 lr:0.0000100 epoch_Time:25202.0min: [2024-01-05 11:39:49,001][model8_pretrain.py][INFO] Epoch:[0/2](623100/4588595) loss:3.305 lr:0.0000100 epoch_Time:25202.0min: [2024-01-05 11:39:49,001][model8_pretrain.py][INFO] Epoch:[0/2](623100/4588595) loss:2.889 lr:0.0000100 epoch_Time:25202.0min: [2024-01-05 11:39:49,001][model8_pretrain.py][INFO] Epoch:[0/2](623100/4588595) loss:2.325 lr:0.0000100 epoch_Time:25202.0min: [2024-01-05 11:40:25,938][model8_pretrain.py][INFO] Epoch:[0/2](623200/4588595) loss:3.284 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:40:25,938][model8_pretrain.py][INFO] Epoch:[0/2](623200/4588595) loss:2.757 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:40:25,938][model8_pretrain.py][INFO] Epoch:[0/2](623200/4588595) loss:2.517 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:40:25,938][model8_pretrain.py][INFO] Epoch:[0/2](623200/4588595) loss:3.044 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:40:25,938][model8_pretrain.py][INFO] Epoch:[0/2](623200/4588595) loss:2.784 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:40:25,938][model8_pretrain.py][INFO] Epoch:[0/2](623200/4588595) loss:3.501 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:40:25,938][model8_pretrain.py][INFO] Epoch:[0/2](623200/4588595) loss:3.113 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:40:25,939][model8_pretrain.py][INFO] Epoch:[0/2](623200/4588595) loss:2.608 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:41:02,900][model8_pretrain.py][INFO] Epoch:[0/2](623300/4588595) loss:3.149 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:02,900][model8_pretrain.py][INFO] Epoch:[0/2](623300/4588595) loss:2.829 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:02,900][model8_pretrain.py][INFO] Epoch:[0/2](623300/4588595) loss:2.489 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:02,900][model8_pretrain.py][INFO] Epoch:[0/2](623300/4588595) loss:2.473 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:02,900][model8_pretrain.py][INFO] Epoch:[0/2](623300/4588595) loss:2.585 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:02,900][model8_pretrain.py][INFO] Epoch:[0/2](623300/4588595) loss:2.839 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:02,900][model8_pretrain.py][INFO] Epoch:[0/2](623300/4588595) loss:3.019 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:02,900][model8_pretrain.py][INFO] Epoch:[0/2](623300/4588595) loss:2.608 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:39,866][model8_pretrain.py][INFO] Epoch:[0/2](623400/4588595) loss:2.896 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:39,866][model8_pretrain.py][INFO] Epoch:[0/2](623400/4588595) loss:2.800 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:39,866][model8_pretrain.py][INFO] Epoch:[0/2](623400/4588595) loss:2.992 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:39,866][model8_pretrain.py][INFO] Epoch:[0/2](623400/4588595) loss:2.927 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:39,866][model8_pretrain.py][INFO] Epoch:[0/2](623400/4588595) loss:3.368 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:39,866][model8_pretrain.py][INFO] Epoch:[0/2](623400/4588595) loss:2.532 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:39,866][model8_pretrain.py][INFO] Epoch:[0/2](623400/4588595) loss:2.672 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:41:39,866][model8_pretrain.py][INFO] Epoch:[0/2](623400/4588595) loss:2.858 lr:0.0000100 epoch_Time:25200.0min: [2024-01-05 11:42:28,779][model8_pretrain.py][INFO] Epoch:[0/2](623500/4588595) loss:2.282 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:42:28,779][model8_pretrain.py][INFO] Epoch:[0/2](623500/4588595) loss:2.431 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:42:28,779][model8_pretrain.py][INFO] Epoch:[0/2](623500/4588595) loss:2.356 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:42:28,779][model8_pretrain.py][INFO] Epoch:[0/2](623500/4588595) loss:3.218 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:42:28,779][model8_pretrain.py][INFO] Epoch:[0/2](623500/4588595) loss:2.585 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:42:28,779][model8_pretrain.py][INFO] Epoch:[0/2](623500/4588595) loss:2.849 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:42:28,779][model8_pretrain.py][INFO] Epoch:[0/2](623500/4588595) loss:3.103 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:42:28,779][model8_pretrain.py][INFO] Epoch:[0/2](623500/4588595) loss:3.150 lr:0.0000100 epoch_Time:25201.0min: [2024-01-05 11:43:05,694][model8_pretrain.py][INFO] Epoch:[0/2](623600/4588595) loss:3.215 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:05,694][model8_pretrain.py][INFO] Epoch:[0/2](623600/4588595) loss:2.941 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:05,694][model8_pretrain.py][INFO] Epoch:[0/2](623600/4588595) loss:3.099 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:05,694][model8_pretrain.py][INFO] Epoch:[0/2](623600/4588595) loss:2.768 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:05,694][model8_pretrain.py][INFO] Epoch:[0/2](623600/4588595) loss:3.075 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:05,694][model8_pretrain.py][INFO] Epoch:[0/2](623600/4588595) loss:2.463 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:05,694][model8_pretrain.py][INFO] Epoch:[0/2](623600/4588595) loss:3.144 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:05,694][model8_pretrain.py][INFO] Epoch:[0/2](623600/4588595) loss:2.892 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:42,672][model8_pretrain.py][INFO] Epoch:[0/2](623700/4588595) loss:2.649 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:42,672][model8_pretrain.py][INFO] Epoch:[0/2](623700/4588595) loss:3.123 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:42,672][model8_pretrain.py][INFO] Epoch:[0/2](623700/4588595) loss:2.666 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:42,672][model8_pretrain.py][INFO] Epoch:[0/2](623700/4588595) loss:2.631 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:42,672][model8_pretrain.py][INFO] Epoch:[0/2](623700/4588595) loss:2.503 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:42,672][model8_pretrain.py][INFO] Epoch:[0/2](623700/4588595) loss:2.589 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:42,672][model8_pretrain.py][INFO] Epoch:[0/2](623700/4588595) loss:2.260 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:43:42,672][model8_pretrain.py][INFO] Epoch:[0/2](623700/4588595) loss:2.939 lr:0.0000100 epoch_Time:25199.0min: [2024-01-05 11:44:19,621][model8_pretrain.py][INFO] Epoch:[0/2](623800/4588595) loss:3.039 lr:0.0000100 epoch_Time:25198.0min: [2024-01-05 11:44:19,621][model8_pretrain.py][INFO] Epoch:[0/2](623800/4588595) loss:2.780 lr:0.0000100 epoch_Time:25198.0min: [2024-01-05 11:44:19,621][model8_pretrain.py][INFO] Epoch:[0/2](623800/4588595) loss:2.821 lr:0.0000100 epoch_Time:25198.0min: [2024-01-05 11:44:19,621][model8_pretrain.py][INFO] Epoch:[0/2](623800/4588595) loss:2.794 lr:0.0000100 epoch_Time:25198.0min: [2024-01-05 11:44:19,621][model8_pretrain.py][INFO] Epoch:[0/2](623800/4588595) loss:1.856 lr:0.0000100 epoch_Time:25198.0min: [2024-01-05 11:44:19,621][model8_pretrain.py][INFO] Epoch:[0/2](623800/4588595) loss:2.493 lr:0.0000100 epoch_Time:25198.0min: [2024-01-05 11:44:19,621][model8_pretrain.py][INFO] Epoch:[0/2](623800/4588595) loss:3.440 lr:0.0000100 epoch_Time:25198.0min: [2024-01-05 11:44:19,622][model8_pretrain.py][INFO] Epoch:[0/2](623800/4588595) loss:2.905 lr:0.0000100 epoch_Time:25198.0min: [2024-01-05 11:44:56,553][model8_pretrain.py][INFO] Epoch:[0/2](623900/4588595) loss:2.312 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:44:56,553][model8_pretrain.py][INFO] Epoch:[0/2](623900/4588595) loss:2.661 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:44:56,553][model8_pretrain.py][INFO] Epoch:[0/2](623900/4588595) loss:3.133 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:44:56,553][model8_pretrain.py][INFO] Epoch:[0/2](623900/4588595) loss:3.052 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:44:56,553][model8_pretrain.py][INFO] Epoch:[0/2](623900/4588595) loss:3.136 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:44:56,553][model8_pretrain.py][INFO] Epoch:[0/2](623900/4588595) loss:2.872 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:44:56,553][model8_pretrain.py][INFO] Epoch:[0/2](623900/4588595) loss:2.675 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:44:56,553][model8_pretrain.py][INFO] Epoch:[0/2](623900/4588595) loss:2.867 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:45:33,495][model8_pretrain.py][INFO] Epoch:[0/2](624000/4588595) loss:2.375 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:45:33,495][model8_pretrain.py][INFO] Epoch:[0/2](624000/4588595) loss:3.177 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:45:33,495][model8_pretrain.py][INFO] Epoch:[0/2](624000/4588595) loss:2.880 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:45:33,495][model8_pretrain.py][INFO] Epoch:[0/2](624000/4588595) loss:2.182 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:45:33,495][model8_pretrain.py][INFO] Epoch:[0/2](624000/4588595) loss:2.721 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:45:33,495][model8_pretrain.py][INFO] Epoch:[0/2](624000/4588595) loss:3.026 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:45:33,495][model8_pretrain.py][INFO] Epoch:[0/2](624000/4588595) loss:2.936 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:45:33,495][model8_pretrain.py][INFO] Epoch:[0/2](624000/4588595) loss:3.015 lr:0.0000100 epoch_Time:25197.0min: [2024-01-05 11:46:10,455][model8_pretrain.py][INFO] Epoch:[0/2](624100/4588595) loss:3.246 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:46:10,455][model8_pretrain.py][INFO] Epoch:[0/2](624100/4588595) loss:2.967 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:46:10,455][model8_pretrain.py][INFO] Epoch:[0/2](624100/4588595) loss:2.701 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:46:10,455][model8_pretrain.py][INFO] Epoch:[0/2](624100/4588595) loss:2.912 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:46:10,455][model8_pretrain.py][INFO] Epoch:[0/2](624100/4588595) loss:3.281 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:46:10,455][model8_pretrain.py][INFO] Epoch:[0/2](624100/4588595) loss:2.835 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:46:10,455][model8_pretrain.py][INFO] Epoch:[0/2](624100/4588595) loss:3.036 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:46:10,455][model8_pretrain.py][INFO] Epoch:[0/2](624100/4588595) loss:2.932 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:46:47,388][model8_pretrain.py][INFO] Epoch:[0/2](624200/4588595) loss:2.863 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:46:47,388][model8_pretrain.py][INFO] Epoch:[0/2](624200/4588595) loss:3.336 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:46:47,388][model8_pretrain.py][INFO] Epoch:[0/2](624200/4588595) loss:3.160 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:46:47,388][model8_pretrain.py][INFO] Epoch:[0/2](624200/4588595) loss:2.655 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:46:47,388][model8_pretrain.py][INFO] Epoch:[0/2](624200/4588595) loss:3.102 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:46:47,388][model8_pretrain.py][INFO] Epoch:[0/2](624200/4588595) loss:1.826 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:46:47,388][model8_pretrain.py][INFO] Epoch:[0/2](624200/4588595) loss:3.018 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:46:47,388][model8_pretrain.py][INFO] Epoch:[0/2](624200/4588595) loss:3.416 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:47:36,032][model8_pretrain.py][INFO] Epoch:[0/2](624300/4588595) loss:3.218 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:47:36,032][model8_pretrain.py][INFO] Epoch:[0/2](624300/4588595) loss:2.649 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:47:36,032][model8_pretrain.py][INFO] Epoch:[0/2](624300/4588595) loss:2.888 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:47:36,032][model8_pretrain.py][INFO] Epoch:[0/2](624300/4588595) loss:2.965 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:47:36,032][model8_pretrain.py][INFO] Epoch:[0/2](624300/4588595) loss:2.541 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:47:36,032][model8_pretrain.py][INFO] Epoch:[0/2](624300/4588595) loss:3.158 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:47:36,032][model8_pretrain.py][INFO] Epoch:[0/2](624300/4588595) loss:2.036 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:47:36,033][model8_pretrain.py][INFO] Epoch:[0/2](624300/4588595) loss:3.058 lr:0.0000100 epoch_Time:25196.0min: [2024-01-05 11:48:12,963][model8_pretrain.py][INFO] Epoch:[0/2](624400/4588595) loss:2.636 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:48:12,963][model8_pretrain.py][INFO] Epoch:[0/2](624400/4588595) loss:2.954 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:48:12,963][model8_pretrain.py][INFO] Epoch:[0/2](624400/4588595) loss:2.758 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:48:12,963][model8_pretrain.py][INFO] Epoch:[0/2](624400/4588595) loss:3.040 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:48:12,963][model8_pretrain.py][INFO] Epoch:[0/2](624400/4588595) loss:3.101 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:48:12,963][model8_pretrain.py][INFO] Epoch:[0/2](624400/4588595) loss:3.350 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:48:12,963][model8_pretrain.py][INFO] Epoch:[0/2](624400/4588595) loss:2.739 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:48:12,964][model8_pretrain.py][INFO] Epoch:[0/2](624400/4588595) loss:2.881 lr:0.0000100 epoch_Time:25195.0min: [2024-01-05 11:48:49,908][model8_pretrain.py][INFO] Epoch:[0/2](624500/4588595) loss:2.710 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:48:49,908][model8_pretrain.py][INFO] Epoch:[0/2](624500/4588595) loss:3.319 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:48:49,908][model8_pretrain.py][INFO] Epoch:[0/2](624500/4588595) loss:2.849 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:48:49,908][model8_pretrain.py][INFO] Epoch:[0/2](624500/4588595) loss:3.032 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:48:49,908][model8_pretrain.py][INFO] Epoch:[0/2](624500/4588595) loss:2.862 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:48:49,908][model8_pretrain.py][INFO] Epoch:[0/2](624500/4588595) loss:3.075 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:48:49,908][model8_pretrain.py][INFO] Epoch:[0/2](624500/4588595) loss:2.461 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:48:49,909][model8_pretrain.py][INFO] Epoch:[0/2](624500/4588595) loss:2.395 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:49:26,848][model8_pretrain.py][INFO] Epoch:[0/2](624600/4588595) loss:2.899 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:49:26,848][model8_pretrain.py][INFO] Epoch:[0/2](624600/4588595) loss:3.290 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:49:26,848][model8_pretrain.py][INFO] Epoch:[0/2](624600/4588595) loss:2.579 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:49:26,848][model8_pretrain.py][INFO] Epoch:[0/2](624600/4588595) loss:2.348 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:49:26,848][model8_pretrain.py][INFO] Epoch:[0/2](624600/4588595) loss:2.440 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:49:26,848][model8_pretrain.py][INFO] Epoch:[0/2](624600/4588595) loss:3.271 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:49:26,849][model8_pretrain.py][INFO] Epoch:[0/2](624600/4588595) loss:2.544 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:49:26,849][model8_pretrain.py][INFO] Epoch:[0/2](624600/4588595) loss:3.353 lr:0.0000100 epoch_Time:25193.0min: [2024-01-05 11:50:03,796][model8_pretrain.py][INFO] Epoch:[0/2](624700/4588595) loss:3.160 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:03,796][model8_pretrain.py][INFO] Epoch:[0/2](624700/4588595) loss:2.790 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:03,796][model8_pretrain.py][INFO] Epoch:[0/2](624700/4588595) loss:2.440 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:03,796][model8_pretrain.py][INFO] Epoch:[0/2](624700/4588595) loss:2.076 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:03,796][model8_pretrain.py][INFO] Epoch:[0/2](624700/4588595) loss:2.890 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:03,796][model8_pretrain.py][INFO] Epoch:[0/2](624700/4588595) loss:2.576 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:03,796][model8_pretrain.py][INFO] Epoch:[0/2](624700/4588595) loss:3.034 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:03,796][model8_pretrain.py][INFO] Epoch:[0/2](624700/4588595) loss:3.326 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:40,759][model8_pretrain.py][INFO] Epoch:[0/2](624800/4588595) loss:3.140 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:40,759][model8_pretrain.py][INFO] Epoch:[0/2](624800/4588595) loss:2.951 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:40,759][model8_pretrain.py][INFO] Epoch:[0/2](624800/4588595) loss:3.170 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:40,759][model8_pretrain.py][INFO] Epoch:[0/2](624800/4588595) loss:3.440 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:40,759][model8_pretrain.py][INFO] Epoch:[0/2](624800/4588595) loss:2.890 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:40,759][model8_pretrain.py][INFO] Epoch:[0/2](624800/4588595) loss:2.928 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:40,759][model8_pretrain.py][INFO] Epoch:[0/2](624800/4588595) loss:1.995 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:50:40,759][model8_pretrain.py][INFO] Epoch:[0/2](624800/4588595) loss:2.742 lr:0.0000100 epoch_Time:25192.0min: [2024-01-05 11:51:17,713][model8_pretrain.py][INFO] Epoch:[0/2](624900/4588595) loss:2.647 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:51:17,713][model8_pretrain.py][INFO] Epoch:[0/2](624900/4588595) loss:3.169 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:51:17,713][model8_pretrain.py][INFO] Epoch:[0/2](624900/4588595) loss:3.353 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:51:17,713][model8_pretrain.py][INFO] Epoch:[0/2](624900/4588595) loss:2.519 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:51:17,713][model8_pretrain.py][INFO] Epoch:[0/2](624900/4588595) loss:2.657 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:51:17,713][model8_pretrain.py][INFO] Epoch:[0/2](624900/4588595) loss:3.122 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:51:17,713][model8_pretrain.py][INFO] Epoch:[0/2](624900/4588595) loss:2.621 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:51:17,714][model8_pretrain.py][INFO] Epoch:[0/2](624900/4588595) loss:3.810 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:51:54,655][model8_pretrain.py][INFO] Epoch:[0/2](625000/4588595) loss:3.239 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:51:54,655][model8_pretrain.py][INFO] Epoch:[0/2](625000/4588595) loss:2.655 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:51:54,655][model8_pretrain.py][INFO] Epoch:[0/2](625000/4588595) loss:3.087 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:51:54,655][model8_pretrain.py][INFO] Epoch:[0/2](625000/4588595) loss:2.917 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:51:54,655][model8_pretrain.py][INFO] Epoch:[0/2](625000/4588595) loss:3.225 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:51:54,655][model8_pretrain.py][INFO] Epoch:[0/2](625000/4588595) loss:2.569 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:51:54,655][model8_pretrain.py][INFO] Epoch:[0/2](625000/4588595) loss:3.038 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:51:54,655][model8_pretrain.py][INFO] Epoch:[0/2](625000/4588595) loss:2.781 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:52:43,484][model8_pretrain.py][INFO] Epoch:[0/2](625100/4588595) loss:2.963 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:52:43,484][model8_pretrain.py][INFO] Epoch:[0/2](625100/4588595) loss:3.295 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:52:43,484][model8_pretrain.py][INFO] Epoch:[0/2](625100/4588595) loss:2.982 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:52:43,484][model8_pretrain.py][INFO] Epoch:[0/2](625100/4588595) loss:2.680 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:52:43,484][model8_pretrain.py][INFO] Epoch:[0/2](625100/4588595) loss:3.134 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:52:43,484][model8_pretrain.py][INFO] Epoch:[0/2](625100/4588595) loss:3.172 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:52:43,484][model8_pretrain.py][INFO] Epoch:[0/2](625100/4588595) loss:3.108 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:52:43,485][model8_pretrain.py][INFO] Epoch:[0/2](625100/4588595) loss:2.887 lr:0.0000100 epoch_Time:25191.0min: [2024-01-05 11:53:20,420][model8_pretrain.py][INFO] Epoch:[0/2](625200/4588595) loss:2.551 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:53:20,420][model8_pretrain.py][INFO] Epoch:[0/2](625200/4588595) loss:2.432 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:53:20,420][model8_pretrain.py][INFO] Epoch:[0/2](625200/4588595) loss:2.624 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:53:20,420][model8_pretrain.py][INFO] Epoch:[0/2](625200/4588595) loss:3.239 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:53:20,420][model8_pretrain.py][INFO] Epoch:[0/2](625200/4588595) loss:3.302 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:53:20,420][model8_pretrain.py][INFO] Epoch:[0/2](625200/4588595) loss:2.815 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:53:20,420][model8_pretrain.py][INFO] Epoch:[0/2](625200/4588595) loss:3.301 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:53:20,420][model8_pretrain.py][INFO] Epoch:[0/2](625200/4588595) loss:3.167 lr:0.0000100 epoch_Time:25190.0min: [2024-01-05 11:53:57,365][model8_pretrain.py][INFO] Epoch:[0/2](625300/4588595) loss:2.900 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:53:57,365][model8_pretrain.py][INFO] Epoch:[0/2](625300/4588595) loss:2.480 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:53:57,365][model8_pretrain.py][INFO] Epoch:[0/2](625300/4588595) loss:2.795 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:53:57,365][model8_pretrain.py][INFO] Epoch:[0/2](625300/4588595) loss:3.126 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:53:57,365][model8_pretrain.py][INFO] Epoch:[0/2](625300/4588595) loss:3.195 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:53:57,365][model8_pretrain.py][INFO] Epoch:[0/2](625300/4588595) loss:2.166 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:53:57,365][model8_pretrain.py][INFO] Epoch:[0/2](625300/4588595) loss:3.160 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:53:57,365][model8_pretrain.py][INFO] Epoch:[0/2](625300/4588595) loss:2.692 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:54:34,319][model8_pretrain.py][INFO] Epoch:[0/2](625400/4588595) loss:2.697 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:54:34,319][model8_pretrain.py][INFO] Epoch:[0/2](625400/4588595) loss:2.747 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:54:34,319][model8_pretrain.py][INFO] Epoch:[0/2](625400/4588595) loss:3.136 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:54:34,319][model8_pretrain.py][INFO] Epoch:[0/2](625400/4588595) loss:3.143 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:54:34,319][model8_pretrain.py][INFO] Epoch:[0/2](625400/4588595) loss:2.344 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:54:34,319][model8_pretrain.py][INFO] Epoch:[0/2](625400/4588595) loss:3.362 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:54:34,319][model8_pretrain.py][INFO] Epoch:[0/2](625400/4588595) loss:2.897 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:54:34,319][model8_pretrain.py][INFO] Epoch:[0/2](625400/4588595) loss:2.829 lr:0.0000100 epoch_Time:25189.0min: [2024-01-05 11:55:11,268][model8_pretrain.py][INFO] Epoch:[0/2](625500/4588595) loss:2.881 lr:0.0000100 epoch_Time:25187.0min: [2024-01-05 11:55:11,268][model8_pretrain.py][INFO] Epoch:[0/2](625500/4588595) loss:3.080 lr:0.0000100 epoch_Time:25187.0min: [2024-01-05 11:55:11,268][model8_pretrain.py][INFO] Epoch:[0/2](625500/4588595) loss:2.844 lr:0.0000100 epoch_Time:25187.0min: [2024-01-05 11:55:11,268][model8_pretrain.py][INFO] Epoch:[0/2](625500/4588595) loss:2.636 lr:0.0000100 epoch_Time:25187.0min: [2024-01-05 11:55:11,268][model8_pretrain.py][INFO] Epoch:[0/2](625500/4588595) loss:2.927 lr:0.0000100 epoch_Time:25187.0min: [2024-01-05 11:55:11,268][model8_pretrain.py][INFO] Epoch:[0/2](625500/4588595) loss:2.933 lr:0.0000100 epoch_Time:25187.0min: [2024-01-05 11:55:11,268][model8_pretrain.py][INFO] Epoch:[0/2](625500/4588595) loss:3.176 lr:0.0000100 epoch_Time:25187.0min: [2024-01-05 11:55:11,268][model8_pretrain.py][INFO] Epoch:[0/2](625500/4588595) loss:2.435 lr:0.0000100 epoch_Time:25187.0min: [2024-01-05 11:55:48,235][model8_pretrain.py][INFO] Epoch:[0/2](625600/4588595) loss:2.918 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:55:48,235][model8_pretrain.py][INFO] Epoch:[0/2](625600/4588595) loss:2.837 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:55:48,235][model8_pretrain.py][INFO] Epoch:[0/2](625600/4588595) loss:3.021 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:55:48,235][model8_pretrain.py][INFO] Epoch:[0/2](625600/4588595) loss:3.405 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:55:48,235][model8_pretrain.py][INFO] Epoch:[0/2](625600/4588595) loss:2.588 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:55:48,235][model8_pretrain.py][INFO] Epoch:[0/2](625600/4588595) loss:3.185 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:55:48,235][model8_pretrain.py][INFO] Epoch:[0/2](625600/4588595) loss:2.885 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:55:48,235][model8_pretrain.py][INFO] Epoch:[0/2](625600/4588595) loss:2.880 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:56:25,199][model8_pretrain.py][INFO] Epoch:[0/2](625700/4588595) loss:2.986 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:56:25,199][model8_pretrain.py][INFO] Epoch:[0/2](625700/4588595) loss:3.034 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:56:25,199][model8_pretrain.py][INFO] Epoch:[0/2](625700/4588595) loss:2.741 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:56:25,199][model8_pretrain.py][INFO] Epoch:[0/2](625700/4588595) loss:2.934 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:56:25,199][model8_pretrain.py][INFO] Epoch:[0/2](625700/4588595) loss:2.639 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:56:25,199][model8_pretrain.py][INFO] Epoch:[0/2](625700/4588595) loss:2.922 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:56:25,199][model8_pretrain.py][INFO] Epoch:[0/2](625700/4588595) loss:3.087 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:56:25,199][model8_pretrain.py][INFO] Epoch:[0/2](625700/4588595) loss:2.306 lr:0.0000100 epoch_Time:25186.0min: [2024-01-05 11:57:02,151][model8_pretrain.py][INFO] Epoch:[0/2](625800/4588595) loss:2.765 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:02,151][model8_pretrain.py][INFO] Epoch:[0/2](625800/4588595) loss:2.943 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:02,151][model8_pretrain.py][INFO] Epoch:[0/2](625800/4588595) loss:2.760 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:02,151][model8_pretrain.py][INFO] Epoch:[0/2](625800/4588595) loss:3.052 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:02,151][model8_pretrain.py][INFO] Epoch:[0/2](625800/4588595) loss:2.890 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:02,151][model8_pretrain.py][INFO] Epoch:[0/2](625800/4588595) loss:2.924 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:02,151][model8_pretrain.py][INFO] Epoch:[0/2](625800/4588595) loss:3.114 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:02,151][model8_pretrain.py][INFO] Epoch:[0/2](625800/4588595) loss:3.002 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:51,160][model8_pretrain.py][INFO] Epoch:[0/2](625900/4588595) loss:2.707 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:51,160][model8_pretrain.py][INFO] Epoch:[0/2](625900/4588595) loss:3.071 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:51,160][model8_pretrain.py][INFO] Epoch:[0/2](625900/4588595) loss:3.109 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:51,160][model8_pretrain.py][INFO] Epoch:[0/2](625900/4588595) loss:3.111 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:51,160][model8_pretrain.py][INFO] Epoch:[0/2](625900/4588595) loss:3.301 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:51,160][model8_pretrain.py][INFO] Epoch:[0/2](625900/4588595) loss:2.843 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:51,160][model8_pretrain.py][INFO] Epoch:[0/2](625900/4588595) loss:2.651 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:57:51,160][model8_pretrain.py][INFO] Epoch:[0/2](625900/4588595) loss:2.408 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:58:28,095][model8_pretrain.py][INFO] Epoch:[0/2](626000/4588595) loss:3.125 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:58:28,095][model8_pretrain.py][INFO] Epoch:[0/2](626000/4588595) loss:3.075 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:58:28,096][model8_pretrain.py][INFO] Epoch:[0/2](626000/4588595) loss:2.343 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:58:28,096][model8_pretrain.py][INFO] Epoch:[0/2](626000/4588595) loss:2.494 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:58:28,096][model8_pretrain.py][INFO] Epoch:[0/2](626000/4588595) loss:3.084 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:58:28,097][model8_pretrain.py][INFO] Epoch:[0/2](626000/4588595) loss:3.297 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:58:28,097][model8_pretrain.py][INFO] Epoch:[0/2](626000/4588595) loss:3.154 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:58:28,097][model8_pretrain.py][INFO] Epoch:[0/2](626000/4588595) loss:2.685 lr:0.0000100 epoch_Time:25185.0min: [2024-01-05 11:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](626100/4588595) loss:2.814 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](626100/4588595) loss:2.698 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](626100/4588595) loss:3.118 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](626100/4588595) loss:2.604 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](626100/4588595) loss:2.676 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](626100/4588595) loss:2.795 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](626100/4588595) loss:2.911 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:05,035][model8_pretrain.py][INFO] Epoch:[0/2](626100/4588595) loss:3.198 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:41,965][model8_pretrain.py][INFO] Epoch:[0/2](626200/4588595) loss:2.836 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:41,965][model8_pretrain.py][INFO] Epoch:[0/2](626200/4588595) loss:3.005 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:41,966][model8_pretrain.py][INFO] Epoch:[0/2](626200/4588595) loss:3.161 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:41,966][model8_pretrain.py][INFO] Epoch:[0/2](626200/4588595) loss:2.954 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:41,966][model8_pretrain.py][INFO] Epoch:[0/2](626200/4588595) loss:2.874 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:41,966][model8_pretrain.py][INFO] Epoch:[0/2](626200/4588595) loss:3.312 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:41,966][model8_pretrain.py][INFO] Epoch:[0/2](626200/4588595) loss:2.889 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 11:59:41,966][model8_pretrain.py][INFO] Epoch:[0/2](626200/4588595) loss:2.957 lr:0.0000100 epoch_Time:25184.0min: [2024-01-05 12:00:18,912][model8_pretrain.py][INFO] Epoch:[0/2](626300/4588595) loss:2.633 lr:0.0000100 epoch_Time:25183.0min: [2024-01-05 12:00:18,912][model8_pretrain.py][INFO] Epoch:[0/2](626300/4588595) loss:3.103 lr:0.0000100 epoch_Time:25183.0min: [2024-01-05 12:00:18,912][model8_pretrain.py][INFO] Epoch:[0/2](626300/4588595) loss:2.044 lr:0.0000100 epoch_Time:25183.0min: [2024-01-05 12:00:18,912][model8_pretrain.py][INFO] Epoch:[0/2](626300/4588595) loss:2.201 lr:0.0000100 epoch_Time:25183.0min: [2024-01-05 12:00:18,912][model8_pretrain.py][INFO] Epoch:[0/2](626300/4588595) loss:2.763 lr:0.0000100 epoch_Time:25183.0min: [2024-01-05 12:00:18,912][model8_pretrain.py][INFO] Epoch:[0/2](626300/4588595) loss:2.218 lr:0.0000100 epoch_Time:25183.0min: [2024-01-05 12:00:18,912][model8_pretrain.py][INFO] Epoch:[0/2](626300/4588595) loss:2.747 lr:0.0000100 epoch_Time:25183.0min: [2024-01-05 12:00:18,912][model8_pretrain.py][INFO] Epoch:[0/2](626300/4588595) loss:2.819 lr:0.0000100 epoch_Time:25183.0min: [2024-01-05 12:00:55,877][model8_pretrain.py][INFO] Epoch:[0/2](626400/4588595) loss:3.146 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:00:55,877][model8_pretrain.py][INFO] Epoch:[0/2](626400/4588595) loss:2.843 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:00:55,877][model8_pretrain.py][INFO] Epoch:[0/2](626400/4588595) loss:3.081 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:00:55,877][model8_pretrain.py][INFO] Epoch:[0/2](626400/4588595) loss:2.935 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:00:55,877][model8_pretrain.py][INFO] Epoch:[0/2](626400/4588595) loss:3.514 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:00:55,877][model8_pretrain.py][INFO] Epoch:[0/2](626400/4588595) loss:2.529 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:00:55,877][model8_pretrain.py][INFO] Epoch:[0/2](626400/4588595) loss:2.548 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:00:55,877][model8_pretrain.py][INFO] Epoch:[0/2](626400/4588595) loss:2.845 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:01:32,832][model8_pretrain.py][INFO] Epoch:[0/2](626500/4588595) loss:2.889 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:01:32,832][model8_pretrain.py][INFO] Epoch:[0/2](626500/4588595) loss:2.712 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:01:32,832][model8_pretrain.py][INFO] Epoch:[0/2](626500/4588595) loss:2.715 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:01:32,832][model8_pretrain.py][INFO] Epoch:[0/2](626500/4588595) loss:2.786 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:01:32,832][model8_pretrain.py][INFO] Epoch:[0/2](626500/4588595) loss:2.958 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:01:32,832][model8_pretrain.py][INFO] Epoch:[0/2](626500/4588595) loss:2.353 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:01:32,832][model8_pretrain.py][INFO] Epoch:[0/2](626500/4588595) loss:2.389 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:01:32,833][model8_pretrain.py][INFO] Epoch:[0/2](626500/4588595) loss:2.861 lr:0.0000100 epoch_Time:25182.0min: [2024-01-05 12:02:09,772][model8_pretrain.py][INFO] Epoch:[0/2](626600/4588595) loss:2.832 lr:0.0000100 epoch_Time:25180.0min: [2024-01-05 12:02:09,772][model8_pretrain.py][INFO] Epoch:[0/2](626600/4588595) loss:2.941 lr:0.0000100 epoch_Time:25180.0min: [2024-01-05 12:02:09,772][model8_pretrain.py][INFO] Epoch:[0/2](626600/4588595) loss:2.832 lr:0.0000100 epoch_Time:25180.0min: [2024-01-05 12:02:09,772][model8_pretrain.py][INFO] Epoch:[0/2](626600/4588595) loss:3.023 lr:0.0000100 epoch_Time:25180.0min: [2024-01-05 12:02:09,772][model8_pretrain.py][INFO] Epoch:[0/2](626600/4588595) loss:2.456 lr:0.0000100 epoch_Time:25180.0min: [2024-01-05 12:02:09,772][model8_pretrain.py][INFO] Epoch:[0/2](626600/4588595) loss:2.414 lr:0.0000100 epoch_Time:25180.0min: [2024-01-05 12:02:09,772][model8_pretrain.py][INFO] Epoch:[0/2](626600/4588595) loss:2.681 lr:0.0000100 epoch_Time:25180.0min: [2024-01-05 12:02:09,772][model8_pretrain.py][INFO] Epoch:[0/2](626600/4588595) loss:2.961 lr:0.0000100 epoch_Time:25180.0min: [2024-01-05 12:02:58,657][model8_pretrain.py][INFO] Epoch:[0/2](626700/4588595) loss:3.298 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:02:58,657][model8_pretrain.py][INFO] Epoch:[0/2](626700/4588595) loss:2.384 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:02:58,657][model8_pretrain.py][INFO] Epoch:[0/2](626700/4588595) loss:3.103 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:02:58,657][model8_pretrain.py][INFO] Epoch:[0/2](626700/4588595) loss:2.565 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:02:58,657][model8_pretrain.py][INFO] Epoch:[0/2](626700/4588595) loss:2.907 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:02:58,657][model8_pretrain.py][INFO] Epoch:[0/2](626700/4588595) loss:2.633 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:02:58,657][model8_pretrain.py][INFO] Epoch:[0/2](626700/4588595) loss:3.493 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:02:58,657][model8_pretrain.py][INFO] Epoch:[0/2](626700/4588595) loss:3.051 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:03:35,578][model8_pretrain.py][INFO] Epoch:[0/2](626800/4588595) loss:2.676 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:03:35,578][model8_pretrain.py][INFO] Epoch:[0/2](626800/4588595) loss:3.151 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:03:35,578][model8_pretrain.py][INFO] Epoch:[0/2](626800/4588595) loss:2.623 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:03:35,578][model8_pretrain.py][INFO] Epoch:[0/2](626800/4588595) loss:3.003 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:03:35,578][model8_pretrain.py][INFO] Epoch:[0/2](626800/4588595) loss:2.283 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:03:35,579][model8_pretrain.py][INFO] Epoch:[0/2](626800/4588595) loss:3.269 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:03:35,579][model8_pretrain.py][INFO] Epoch:[0/2](626800/4588595) loss:2.976 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:03:35,579][model8_pretrain.py][INFO] Epoch:[0/2](626800/4588595) loss:3.015 lr:0.0000100 epoch_Time:25181.0min: [2024-01-05 12:04:12,515][model8_pretrain.py][INFO] Epoch:[0/2](626900/4588595) loss:3.257 lr:0.0000100 epoch_Time:25179.0min: [2024-01-05 12:04:12,515][model8_pretrain.py][INFO] Epoch:[0/2](626900/4588595) loss:2.961 lr:0.0000100 epoch_Time:25179.0min: [2024-01-05 12:04:12,515][model8_pretrain.py][INFO] Epoch:[0/2](626900/4588595) loss:2.601 lr:0.0000100 epoch_Time:25179.0min: [2024-01-05 12:04:12,515][model8_pretrain.py][INFO] Epoch:[0/2](626900/4588595) loss:2.839 lr:0.0000100 epoch_Time:25179.0min: [2024-01-05 12:04:12,515][model8_pretrain.py][INFO] Epoch:[0/2](626900/4588595) loss:3.375 lr:0.0000100 epoch_Time:25179.0min: [2024-01-05 12:04:12,515][model8_pretrain.py][INFO] Epoch:[0/2](626900/4588595) loss:3.195 lr:0.0000100 epoch_Time:25179.0min: [2024-01-05 12:04:12,515][model8_pretrain.py][INFO] Epoch:[0/2](626900/4588595) loss:2.598 lr:0.0000100 epoch_Time:25179.0min: [2024-01-05 12:04:12,515][model8_pretrain.py][INFO] Epoch:[0/2](626900/4588595) loss:3.384 lr:0.0000100 epoch_Time:25179.0min: [2024-01-05 12:04:49,462][model8_pretrain.py][INFO] Epoch:[0/2](627000/4588595) loss:2.967 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:04:49,462][model8_pretrain.py][INFO] Epoch:[0/2](627000/4588595) loss:2.738 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:04:49,462][model8_pretrain.py][INFO] Epoch:[0/2](627000/4588595) loss:2.979 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:04:49,462][model8_pretrain.py][INFO] Epoch:[0/2](627000/4588595) loss:3.474 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:04:49,462][model8_pretrain.py][INFO] Epoch:[0/2](627000/4588595) loss:2.842 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:04:49,462][model8_pretrain.py][INFO] Epoch:[0/2](627000/4588595) loss:3.259 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:04:49,463][model8_pretrain.py][INFO] Epoch:[0/2](627000/4588595) loss:3.244 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:04:49,463][model8_pretrain.py][INFO] Epoch:[0/2](627000/4588595) loss:3.215 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:05:26,399][model8_pretrain.py][INFO] Epoch:[0/2](627100/4588595) loss:2.641 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:05:26,399][model8_pretrain.py][INFO] Epoch:[0/2](627100/4588595) loss:2.650 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:05:26,399][model8_pretrain.py][INFO] Epoch:[0/2](627100/4588595) loss:3.051 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:05:26,399][model8_pretrain.py][INFO] Epoch:[0/2](627100/4588595) loss:2.956 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:05:26,399][model8_pretrain.py][INFO] Epoch:[0/2](627100/4588595) loss:2.797 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:05:26,399][model8_pretrain.py][INFO] Epoch:[0/2](627100/4588595) loss:2.659 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:05:26,400][model8_pretrain.py][INFO] Epoch:[0/2](627100/4588595) loss:3.198 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:05:26,400][model8_pretrain.py][INFO] Epoch:[0/2](627100/4588595) loss:3.090 lr:0.0000100 epoch_Time:25178.0min: [2024-01-05 12:06:03,361][model8_pretrain.py][INFO] Epoch:[0/2](627200/4588595) loss:2.952 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:03,361][model8_pretrain.py][INFO] Epoch:[0/2](627200/4588595) loss:2.695 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:03,361][model8_pretrain.py][INFO] Epoch:[0/2](627200/4588595) loss:2.797 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:03,361][model8_pretrain.py][INFO] Epoch:[0/2](627200/4588595) loss:2.982 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:03,361][model8_pretrain.py][INFO] Epoch:[0/2](627200/4588595) loss:2.655 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:03,361][model8_pretrain.py][INFO] Epoch:[0/2](627200/4588595) loss:3.230 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:03,361][model8_pretrain.py][INFO] Epoch:[0/2](627200/4588595) loss:2.975 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:03,362][model8_pretrain.py][INFO] Epoch:[0/2](627200/4588595) loss:2.837 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:40,308][model8_pretrain.py][INFO] Epoch:[0/2](627300/4588595) loss:3.037 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:40,308][model8_pretrain.py][INFO] Epoch:[0/2](627300/4588595) loss:2.833 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:40,308][model8_pretrain.py][INFO] Epoch:[0/2](627300/4588595) loss:3.079 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:40,308][model8_pretrain.py][INFO] Epoch:[0/2](627300/4588595) loss:2.513 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:40,308][model8_pretrain.py][INFO] Epoch:[0/2](627300/4588595) loss:2.818 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:40,308][model8_pretrain.py][INFO] Epoch:[0/2](627300/4588595) loss:2.796 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:40,308][model8_pretrain.py][INFO] Epoch:[0/2](627300/4588595) loss:2.728 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:06:40,309][model8_pretrain.py][INFO] Epoch:[0/2](627300/4588595) loss:2.682 lr:0.0000100 epoch_Time:25177.0min: [2024-01-05 12:07:17,239][model8_pretrain.py][INFO] Epoch:[0/2](627400/4588595) loss:3.138 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:07:17,240][model8_pretrain.py][INFO] Epoch:[0/2](627400/4588595) loss:3.182 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:07:17,240][model8_pretrain.py][INFO] Epoch:[0/2](627400/4588595) loss:2.518 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:07:17,240][model8_pretrain.py][INFO] Epoch:[0/2](627400/4588595) loss:2.257 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:07:17,240][model8_pretrain.py][INFO] Epoch:[0/2](627400/4588595) loss:3.033 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:07:17,239][model8_pretrain.py][INFO] Epoch:[0/2](627400/4588595) loss:2.769 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:07:17,240][model8_pretrain.py][INFO] Epoch:[0/2](627400/4588595) loss:3.106 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:07:17,240][model8_pretrain.py][INFO] Epoch:[0/2](627400/4588595) loss:2.593 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:06,176][model8_pretrain.py][INFO] Epoch:[0/2](627500/4588595) loss:2.982 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:06,176][model8_pretrain.py][INFO] Epoch:[0/2](627500/4588595) loss:2.748 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:06,176][model8_pretrain.py][INFO] Epoch:[0/2](627500/4588595) loss:2.540 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:06,176][model8_pretrain.py][INFO] Epoch:[0/2](627500/4588595) loss:2.814 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:06,176][model8_pretrain.py][INFO] Epoch:[0/2](627500/4588595) loss:3.249 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:06,176][model8_pretrain.py][INFO] Epoch:[0/2](627500/4588595) loss:2.616 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:06,176][model8_pretrain.py][INFO] Epoch:[0/2](627500/4588595) loss:3.161 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:06,177][model8_pretrain.py][INFO] Epoch:[0/2](627500/4588595) loss:3.133 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:43,080][model8_pretrain.py][INFO] Epoch:[0/2](627600/4588595) loss:2.824 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:43,080][model8_pretrain.py][INFO] Epoch:[0/2](627600/4588595) loss:2.180 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:43,080][model8_pretrain.py][INFO] Epoch:[0/2](627600/4588595) loss:2.649 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:43,080][model8_pretrain.py][INFO] Epoch:[0/2](627600/4588595) loss:3.124 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:43,080][model8_pretrain.py][INFO] Epoch:[0/2](627600/4588595) loss:2.506 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:43,080][model8_pretrain.py][INFO] Epoch:[0/2](627600/4588595) loss:1.790 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:43,080][model8_pretrain.py][INFO] Epoch:[0/2](627600/4588595) loss:2.647 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:08:43,081][model8_pretrain.py][INFO] Epoch:[0/2](627600/4588595) loss:2.774 lr:0.0000100 epoch_Time:25176.0min: [2024-01-05 12:09:20,027][model8_pretrain.py][INFO] Epoch:[0/2](627700/4588595) loss:2.364 lr:0.0000100 epoch_Time:25175.0min: [2024-01-05 12:09:20,027][model8_pretrain.py][INFO] Epoch:[0/2](627700/4588595) loss:3.433 lr:0.0000100 epoch_Time:25175.0min: [2024-01-05 12:09:20,027][model8_pretrain.py][INFO] Epoch:[0/2](627700/4588595) loss:3.267 lr:0.0000100 epoch_Time:25175.0min: [2024-01-05 12:09:20,027][model8_pretrain.py][INFO] Epoch:[0/2](627700/4588595) loss:2.803 lr:0.0000100 epoch_Time:25175.0min: [2024-01-05 12:09:20,027][model8_pretrain.py][INFO] Epoch:[0/2](627700/4588595) loss:2.749 lr:0.0000100 epoch_Time:25175.0min: [2024-01-05 12:09:20,028][model8_pretrain.py][INFO] Epoch:[0/2](627700/4588595) loss:3.006 lr:0.0000100 epoch_Time:25175.0min: [2024-01-05 12:09:20,027][model8_pretrain.py][INFO] Epoch:[0/2](627700/4588595) loss:3.163 lr:0.0000100 epoch_Time:25175.0min: [2024-01-05 12:09:20,028][model8_pretrain.py][INFO] Epoch:[0/2](627700/4588595) loss:2.789 lr:0.0000100 epoch_Time:25175.0min: [2024-01-05 12:09:56,967][model8_pretrain.py][INFO] Epoch:[0/2](627800/4588595) loss:2.875 lr:0.0000100 epoch_Time:25174.0min: [2024-01-05 12:09:56,967][model8_pretrain.py][INFO] Epoch:[0/2](627800/4588595) loss:2.985 lr:0.0000100 epoch_Time:25174.0min: [2024-01-05 12:09:56,967][model8_pretrain.py][INFO] Epoch:[0/2](627800/4588595) loss:2.277 lr:0.0000100 epoch_Time:25174.0min: [2024-01-05 12:09:56,967][model8_pretrain.py][INFO] Epoch:[0/2](627800/4588595) loss:3.213 lr:0.0000100 epoch_Time:25174.0min: [2024-01-05 12:09:56,967][model8_pretrain.py][INFO] Epoch:[0/2](627800/4588595) loss:2.670 lr:0.0000100 epoch_Time:25174.0min: [2024-01-05 12:09:56,967][model8_pretrain.py][INFO] Epoch:[0/2](627800/4588595) loss:2.577 lr:0.0000100 epoch_Time:25174.0min: [2024-01-05 12:09:56,967][model8_pretrain.py][INFO] Epoch:[0/2](627800/4588595) loss:2.648 lr:0.0000100 epoch_Time:25174.0min: [2024-01-05 12:09:56,967][model8_pretrain.py][INFO] Epoch:[0/2](627800/4588595) loss:3.140 lr:0.0000100 epoch_Time:25174.0min: [2024-01-05 12:10:33,925][model8_pretrain.py][INFO] Epoch:[0/2](627900/4588595) loss:3.110 lr:0.0000100 epoch_Time:25173.0min: [2024-01-05 12:10:33,925][model8_pretrain.py][INFO] Epoch:[0/2](627900/4588595) loss:2.800 lr:0.0000100 epoch_Time:25173.0min: [2024-01-05 12:10:33,925][model8_pretrain.py][INFO] Epoch:[0/2](627900/4588595) loss:2.833 lr:0.0000100 epoch_Time:25173.0min: [2024-01-05 12:10:33,925][model8_pretrain.py][INFO] Epoch:[0/2](627900/4588595) loss:2.621 lr:0.0000100 epoch_Time:25173.0min: [2024-01-05 12:10:33,925][model8_pretrain.py][INFO] Epoch:[0/2](627900/4588595) loss:2.411 lr:0.0000100 epoch_Time:25173.0min: [2024-01-05 12:10:33,925][model8_pretrain.py][INFO] Epoch:[0/2](627900/4588595) loss:3.241 lr:0.0000100 epoch_Time:25173.0min: [2024-01-05 12:10:33,925][model8_pretrain.py][INFO] Epoch:[0/2](627900/4588595) loss:3.401 lr:0.0000100 epoch_Time:25173.0min: [2024-01-05 12:10:33,925][model8_pretrain.py][INFO] Epoch:[0/2](627900/4588595) loss:2.732 lr:0.0000100 epoch_Time:25173.0min: [2024-01-05 12:11:10,923][model8_pretrain.py][INFO] Epoch:[0/2](628000/4588595) loss:3.005 lr:0.0000100 epoch_Time:25172.0min: [2024-01-05 12:11:10,923][model8_pretrain.py][INFO] Epoch:[0/2](628000/4588595) loss:3.117 lr:0.0000100 epoch_Time:25172.0min: [2024-01-05 12:11:10,923][model8_pretrain.py][INFO] Epoch:[0/2](628000/4588595) loss:3.045 lr:0.0000100 epoch_Time:25172.0min: [2024-01-05 12:11:10,923][model8_pretrain.py][INFO] Epoch:[0/2](628000/4588595) loss:2.873 lr:0.0000100 epoch_Time:25172.0min: [2024-01-05 12:11:10,923][model8_pretrain.py][INFO] Epoch:[0/2](628000/4588595) loss:2.870 lr:0.0000100 epoch_Time:25172.0min: [2024-01-05 12:11:10,923][model8_pretrain.py][INFO] Epoch:[0/2](628000/4588595) loss:3.440 lr:0.0000100 epoch_Time:25172.0min: [2024-01-05 12:11:10,923][model8_pretrain.py][INFO] Epoch:[0/2](628000/4588595) loss:3.065 lr:0.0000100 epoch_Time:25172.0min: [2024-01-05 12:11:10,923][model8_pretrain.py][INFO] Epoch:[0/2](628000/4588595) loss:3.107 lr:0.0000100 epoch_Time:25172.0min: [2024-01-05 12:11:47,863][model8_pretrain.py][INFO] Epoch:[0/2](628100/4588595) loss:3.189 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:11:47,863][model8_pretrain.py][INFO] Epoch:[0/2](628100/4588595) loss:2.786 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:11:47,863][model8_pretrain.py][INFO] Epoch:[0/2](628100/4588595) loss:2.531 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:11:47,863][model8_pretrain.py][INFO] Epoch:[0/2](628100/4588595) loss:2.761 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:11:47,863][model8_pretrain.py][INFO] Epoch:[0/2](628100/4588595) loss:2.953 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:11:47,863][model8_pretrain.py][INFO] Epoch:[0/2](628100/4588595) loss:3.085 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:11:47,863][model8_pretrain.py][INFO] Epoch:[0/2](628100/4588595) loss:2.945 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:11:47,863][model8_pretrain.py][INFO] Epoch:[0/2](628100/4588595) loss:3.491 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:12:24,809][model8_pretrain.py][INFO] Epoch:[0/2](628200/4588595) loss:3.479 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:12:24,809][model8_pretrain.py][INFO] Epoch:[0/2](628200/4588595) loss:3.044 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:12:24,809][model8_pretrain.py][INFO] Epoch:[0/2](628200/4588595) loss:2.761 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:12:24,809][model8_pretrain.py][INFO] Epoch:[0/2](628200/4588595) loss:2.776 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:12:24,809][model8_pretrain.py][INFO] Epoch:[0/2](628200/4588595) loss:3.039 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:12:24,809][model8_pretrain.py][INFO] Epoch:[0/2](628200/4588595) loss:2.476 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:12:24,809][model8_pretrain.py][INFO] Epoch:[0/2](628200/4588595) loss:2.209 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:12:24,809][model8_pretrain.py][INFO] Epoch:[0/2](628200/4588595) loss:3.205 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:13:13,884][model8_pretrain.py][INFO] Epoch:[0/2](628300/4588595) loss:2.606 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:13:13,884][model8_pretrain.py][INFO] Epoch:[0/2](628300/4588595) loss:2.459 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:13:13,884][model8_pretrain.py][INFO] Epoch:[0/2](628300/4588595) loss:2.600 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:13:13,884][model8_pretrain.py][INFO] Epoch:[0/2](628300/4588595) loss:2.543 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:13:13,884][model8_pretrain.py][INFO] Epoch:[0/2](628300/4588595) loss:2.920 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:13:13,884][model8_pretrain.py][INFO] Epoch:[0/2](628300/4588595) loss:2.461 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:13:13,884][model8_pretrain.py][INFO] Epoch:[0/2](628300/4588595) loss:2.743 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:13:13,884][model8_pretrain.py][INFO] Epoch:[0/2](628300/4588595) loss:3.342 lr:0.0000100 epoch_Time:25171.0min: [2024-01-05 12:13:50,810][model8_pretrain.py][INFO] Epoch:[0/2](628400/4588595) loss:2.791 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:13:50,810][model8_pretrain.py][INFO] Epoch:[0/2](628400/4588595) loss:2.323 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:13:50,810][model8_pretrain.py][INFO] Epoch:[0/2](628400/4588595) loss:2.988 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:13:50,811][model8_pretrain.py][INFO] Epoch:[0/2](628400/4588595) loss:3.136 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:13:50,811][model8_pretrain.py][INFO] Epoch:[0/2](628400/4588595) loss:2.876 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:13:50,811][model8_pretrain.py][INFO] Epoch:[0/2](628400/4588595) loss:2.636 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:13:50,811][model8_pretrain.py][INFO] Epoch:[0/2](628400/4588595) loss:3.030 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:13:50,811][model8_pretrain.py][INFO] Epoch:[0/2](628400/4588595) loss:2.742 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:14:27,754][model8_pretrain.py][INFO] Epoch:[0/2](628500/4588595) loss:2.495 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:14:27,754][model8_pretrain.py][INFO] Epoch:[0/2](628500/4588595) loss:2.365 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:14:27,754][model8_pretrain.py][INFO] Epoch:[0/2](628500/4588595) loss:2.975 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:14:27,754][model8_pretrain.py][INFO] Epoch:[0/2](628500/4588595) loss:3.027 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:14:27,754][model8_pretrain.py][INFO] Epoch:[0/2](628500/4588595) loss:2.770 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:14:27,754][model8_pretrain.py][INFO] Epoch:[0/2](628500/4588595) loss:3.135 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:14:27,754][model8_pretrain.py][INFO] Epoch:[0/2](628500/4588595) loss:3.037 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:14:27,754][model8_pretrain.py][INFO] Epoch:[0/2](628500/4588595) loss:2.350 lr:0.0000100 epoch_Time:25170.0min: [2024-01-05 12:15:04,700][model8_pretrain.py][INFO] Epoch:[0/2](628600/4588595) loss:2.709 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:04,700][model8_pretrain.py][INFO] Epoch:[0/2](628600/4588595) loss:2.901 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:04,700][model8_pretrain.py][INFO] Epoch:[0/2](628600/4588595) loss:3.020 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:04,700][model8_pretrain.py][INFO] Epoch:[0/2](628600/4588595) loss:1.795 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:04,700][model8_pretrain.py][INFO] Epoch:[0/2](628600/4588595) loss:3.148 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:04,700][model8_pretrain.py][INFO] Epoch:[0/2](628600/4588595) loss:2.771 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:04,700][model8_pretrain.py][INFO] Epoch:[0/2](628600/4588595) loss:2.769 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:04,700][model8_pretrain.py][INFO] Epoch:[0/2](628600/4588595) loss:2.740 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:41,638][model8_pretrain.py][INFO] Epoch:[0/2](628700/4588595) loss:2.705 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:41,638][model8_pretrain.py][INFO] Epoch:[0/2](628700/4588595) loss:2.933 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:41,638][model8_pretrain.py][INFO] Epoch:[0/2](628700/4588595) loss:3.211 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:41,638][model8_pretrain.py][INFO] Epoch:[0/2](628700/4588595) loss:2.540 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:41,638][model8_pretrain.py][INFO] Epoch:[0/2](628700/4588595) loss:2.080 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:41,638][model8_pretrain.py][INFO] Epoch:[0/2](628700/4588595) loss:2.647 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:41,638][model8_pretrain.py][INFO] Epoch:[0/2](628700/4588595) loss:2.383 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:15:41,638][model8_pretrain.py][INFO] Epoch:[0/2](628700/4588595) loss:2.695 lr:0.0000100 epoch_Time:25169.0min: [2024-01-05 12:16:18,599][model8_pretrain.py][INFO] Epoch:[0/2](628800/4588595) loss:2.895 lr:0.0000100 epoch_Time:25168.0min: [2024-01-05 12:16:18,599][model8_pretrain.py][INFO] Epoch:[0/2](628800/4588595) loss:3.244 lr:0.0000100 epoch_Time:25168.0min: [2024-01-05 12:16:18,599][model8_pretrain.py][INFO] Epoch:[0/2](628800/4588595) loss:3.357 lr:0.0000100 epoch_Time:25168.0min: [2024-01-05 12:16:18,599][model8_pretrain.py][INFO] Epoch:[0/2](628800/4588595) loss:2.872 lr:0.0000100 epoch_Time:25168.0min: [2024-01-05 12:16:18,599][model8_pretrain.py][INFO] Epoch:[0/2](628800/4588595) loss:2.313 lr:0.0000100 epoch_Time:25168.0min: [2024-01-05 12:16:18,599][model8_pretrain.py][INFO] Epoch:[0/2](628800/4588595) loss:3.007 lr:0.0000100 epoch_Time:25168.0min: [2024-01-05 12:16:18,599][model8_pretrain.py][INFO] Epoch:[0/2](628800/4588595) loss:2.185 lr:0.0000100 epoch_Time:25168.0min: [2024-01-05 12:16:18,599][model8_pretrain.py][INFO] Epoch:[0/2](628800/4588595) loss:2.841 lr:0.0000100 epoch_Time:25168.0min: [2024-01-05 12:16:55,545][model8_pretrain.py][INFO] Epoch:[0/2](628900/4588595) loss:2.744 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:16:55,545][model8_pretrain.py][INFO] Epoch:[0/2](628900/4588595) loss:2.050 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:16:55,545][model8_pretrain.py][INFO] Epoch:[0/2](628900/4588595) loss:2.853 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:16:55,545][model8_pretrain.py][INFO] Epoch:[0/2](628900/4588595) loss:2.616 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:16:55,545][model8_pretrain.py][INFO] Epoch:[0/2](628900/4588595) loss:3.168 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:16:55,545][model8_pretrain.py][INFO] Epoch:[0/2](628900/4588595) loss:2.418 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:16:55,545][model8_pretrain.py][INFO] Epoch:[0/2](628900/4588595) loss:2.666 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:16:55,545][model8_pretrain.py][INFO] Epoch:[0/2](628900/4588595) loss:2.967 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:17:32,494][model8_pretrain.py][INFO] Epoch:[0/2](629000/4588595) loss:2.988 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:17:32,494][model8_pretrain.py][INFO] Epoch:[0/2](629000/4588595) loss:2.835 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:17:32,494][model8_pretrain.py][INFO] Epoch:[0/2](629000/4588595) loss:3.345 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:17:32,494][model8_pretrain.py][INFO] Epoch:[0/2](629000/4588595) loss:3.031 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:17:32,494][model8_pretrain.py][INFO] Epoch:[0/2](629000/4588595) loss:2.553 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:17:32,495][model8_pretrain.py][INFO] Epoch:[0/2](629000/4588595) loss:3.400 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:17:32,495][model8_pretrain.py][INFO] Epoch:[0/2](629000/4588595) loss:2.139 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:17:32,495][model8_pretrain.py][INFO] Epoch:[0/2](629000/4588595) loss:2.802 lr:0.0000100 epoch_Time:25166.0min: [2024-01-05 12:18:21,279][model8_pretrain.py][INFO] Epoch:[0/2](629100/4588595) loss:3.233 lr:0.0000100 epoch_Time:25167.0min: [2024-01-05 12:18:21,279][model8_pretrain.py][INFO] Epoch:[0/2](629100/4588595) loss:2.998 lr:0.0000100 epoch_Time:25167.0min: [2024-01-05 12:18:21,279][model8_pretrain.py][INFO] Epoch:[0/2](629100/4588595) loss:2.632 lr:0.0000100 epoch_Time:25167.0min: [2024-01-05 12:18:21,279][model8_pretrain.py][INFO] Epoch:[0/2](629100/4588595) loss:3.121 lr:0.0000100 epoch_Time:25167.0min: [2024-01-05 12:18:21,279][model8_pretrain.py][INFO] Epoch:[0/2](629100/4588595) loss:3.025 lr:0.0000100 epoch_Time:25167.0min: [2024-01-05 12:18:21,279][model8_pretrain.py][INFO] Epoch:[0/2](629100/4588595) loss:2.975 lr:0.0000100 epoch_Time:25167.0min: [2024-01-05 12:18:21,279][model8_pretrain.py][INFO] Epoch:[0/2](629100/4588595) loss:2.664 lr:0.0000100 epoch_Time:25167.0min: [2024-01-05 12:18:21,279][model8_pretrain.py][INFO] Epoch:[0/2](629100/4588595) loss:2.948 lr:0.0000100 epoch_Time:25167.0min: [2024-01-05 12:18:58,206][model8_pretrain.py][INFO] Epoch:[0/2](629200/4588595) loss:3.033 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:18:58,206][model8_pretrain.py][INFO] Epoch:[0/2](629200/4588595) loss:2.994 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:18:58,206][model8_pretrain.py][INFO] Epoch:[0/2](629200/4588595) loss:3.216 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:18:58,206][model8_pretrain.py][INFO] Epoch:[0/2](629200/4588595) loss:2.871 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:18:58,206][model8_pretrain.py][INFO] Epoch:[0/2](629200/4588595) loss:2.578 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:18:58,207][model8_pretrain.py][INFO] Epoch:[0/2](629200/4588595) loss:2.499 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:18:58,206][model8_pretrain.py][INFO] Epoch:[0/2](629200/4588595) loss:2.426 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:18:58,207][model8_pretrain.py][INFO] Epoch:[0/2](629200/4588595) loss:2.599 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:19:35,145][model8_pretrain.py][INFO] Epoch:[0/2](629300/4588595) loss:2.934 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:19:35,145][model8_pretrain.py][INFO] Epoch:[0/2](629300/4588595) loss:3.213 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:19:35,145][model8_pretrain.py][INFO] Epoch:[0/2](629300/4588595) loss:2.659 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:19:35,145][model8_pretrain.py][INFO] Epoch:[0/2](629300/4588595) loss:3.023 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:19:35,145][model8_pretrain.py][INFO] Epoch:[0/2](629300/4588595) loss:3.006 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:19:35,145][model8_pretrain.py][INFO] Epoch:[0/2](629300/4588595) loss:2.883 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:19:35,145][model8_pretrain.py][INFO] Epoch:[0/2](629300/4588595) loss:2.234 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:19:35,146][model8_pretrain.py][INFO] Epoch:[0/2](629300/4588595) loss:2.419 lr:0.0000100 epoch_Time:25165.0min: [2024-01-05 12:20:12,096][model8_pretrain.py][INFO] Epoch:[0/2](629400/4588595) loss:2.908 lr:0.0000100 epoch_Time:25164.0min: [2024-01-05 12:20:12,096][model8_pretrain.py][INFO] Epoch:[0/2](629400/4588595) loss:2.913 lr:0.0000100 epoch_Time:25164.0min: [2024-01-05 12:20:12,096][model8_pretrain.py][INFO] Epoch:[0/2](629400/4588595) loss:2.829 lr:0.0000100 epoch_Time:25164.0min: [2024-01-05 12:20:12,096][model8_pretrain.py][INFO] Epoch:[0/2](629400/4588595) loss:2.892 lr:0.0000100 epoch_Time:25164.0min: [2024-01-05 12:20:12,096][model8_pretrain.py][INFO] Epoch:[0/2](629400/4588595) loss:3.134 lr:0.0000100 epoch_Time:25164.0min: [2024-01-05 12:20:12,096][model8_pretrain.py][INFO] Epoch:[0/2](629400/4588595) loss:3.131 lr:0.0000100 epoch_Time:25164.0min: [2024-01-05 12:20:12,096][model8_pretrain.py][INFO] Epoch:[0/2](629400/4588595) loss:2.794 lr:0.0000100 epoch_Time:25164.0min: [2024-01-05 12:20:12,097][model8_pretrain.py][INFO] Epoch:[0/2](629400/4588595) loss:3.129 lr:0.0000100 epoch_Time:25164.0min: [2024-01-05 12:20:49,032][model8_pretrain.py][INFO] Epoch:[0/2](629500/4588595) loss:2.904 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:20:49,032][model8_pretrain.py][INFO] Epoch:[0/2](629500/4588595) loss:2.473 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:20:49,032][model8_pretrain.py][INFO] Epoch:[0/2](629500/4588595) loss:2.629 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:20:49,032][model8_pretrain.py][INFO] Epoch:[0/2](629500/4588595) loss:2.726 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:20:49,032][model8_pretrain.py][INFO] Epoch:[0/2](629500/4588595) loss:2.550 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:20:49,032][model8_pretrain.py][INFO] Epoch:[0/2](629500/4588595) loss:2.324 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:20:49,032][model8_pretrain.py][INFO] Epoch:[0/2](629500/4588595) loss:3.152 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:20:49,032][model8_pretrain.py][INFO] Epoch:[0/2](629500/4588595) loss:2.826 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:21:25,971][model8_pretrain.py][INFO] Epoch:[0/2](629600/4588595) loss:2.296 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:21:25,971][model8_pretrain.py][INFO] Epoch:[0/2](629600/4588595) loss:3.033 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:21:25,971][model8_pretrain.py][INFO] Epoch:[0/2](629600/4588595) loss:3.181 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:21:25,971][model8_pretrain.py][INFO] Epoch:[0/2](629600/4588595) loss:3.036 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:21:25,971][model8_pretrain.py][INFO] Epoch:[0/2](629600/4588595) loss:2.305 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:21:25,971][model8_pretrain.py][INFO] Epoch:[0/2](629600/4588595) loss:2.902 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:21:25,971][model8_pretrain.py][INFO] Epoch:[0/2](629600/4588595) loss:2.866 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:21:25,971][model8_pretrain.py][INFO] Epoch:[0/2](629600/4588595) loss:3.125 lr:0.0000100 epoch_Time:25163.0min: [2024-01-05 12:22:02,917][model8_pretrain.py][INFO] Epoch:[0/2](629700/4588595) loss:2.697 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:02,917][model8_pretrain.py][INFO] Epoch:[0/2](629700/4588595) loss:2.736 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:02,917][model8_pretrain.py][INFO] Epoch:[0/2](629700/4588595) loss:3.215 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:02,917][model8_pretrain.py][INFO] Epoch:[0/2](629700/4588595) loss:2.795 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:02,917][model8_pretrain.py][INFO] Epoch:[0/2](629700/4588595) loss:3.222 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:02,917][model8_pretrain.py][INFO] Epoch:[0/2](629700/4588595) loss:3.207 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:02,917][model8_pretrain.py][INFO] Epoch:[0/2](629700/4588595) loss:3.106 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:02,917][model8_pretrain.py][INFO] Epoch:[0/2](629700/4588595) loss:2.843 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:39,861][model8_pretrain.py][INFO] Epoch:[0/2](629800/4588595) loss:2.814 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:39,861][model8_pretrain.py][INFO] Epoch:[0/2](629800/4588595) loss:2.719 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:39,861][model8_pretrain.py][INFO] Epoch:[0/2](629800/4588595) loss:2.789 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:39,861][model8_pretrain.py][INFO] Epoch:[0/2](629800/4588595) loss:2.440 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:39,861][model8_pretrain.py][INFO] Epoch:[0/2](629800/4588595) loss:3.432 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:39,861][model8_pretrain.py][INFO] Epoch:[0/2](629800/4588595) loss:2.485 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:39,861][model8_pretrain.py][INFO] Epoch:[0/2](629800/4588595) loss:2.719 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:22:39,861][model8_pretrain.py][INFO] Epoch:[0/2](629800/4588595) loss:2.431 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:23:28,483][model8_pretrain.py][INFO] Epoch:[0/2](629900/4588595) loss:2.501 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:23:28,483][model8_pretrain.py][INFO] Epoch:[0/2](629900/4588595) loss:2.808 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:23:28,483][model8_pretrain.py][INFO] Epoch:[0/2](629900/4588595) loss:2.580 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:23:28,484][model8_pretrain.py][INFO] Epoch:[0/2](629900/4588595) loss:2.995 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:23:28,483][model8_pretrain.py][INFO] Epoch:[0/2](629900/4588595) loss:2.812 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:23:28,484][model8_pretrain.py][INFO] Epoch:[0/2](629900/4588595) loss:2.959 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:23:28,484][model8_pretrain.py][INFO] Epoch:[0/2](629900/4588595) loss:2.537 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:23:28,484][model8_pretrain.py][INFO] Epoch:[0/2](629900/4588595) loss:3.036 lr:0.0000100 epoch_Time:25162.0min: [2024-01-05 12:24:05,406][model8_pretrain.py][INFO] Epoch:[0/2](630000/4588595) loss:2.367 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:05,406][model8_pretrain.py][INFO] Epoch:[0/2](630000/4588595) loss:2.850 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:05,406][model8_pretrain.py][INFO] Epoch:[0/2](630000/4588595) loss:2.569 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:05,406][model8_pretrain.py][INFO] Epoch:[0/2](630000/4588595) loss:2.601 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:05,406][model8_pretrain.py][INFO] Epoch:[0/2](630000/4588595) loss:2.945 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:05,406][model8_pretrain.py][INFO] Epoch:[0/2](630000/4588595) loss:2.891 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:05,406][model8_pretrain.py][INFO] Epoch:[0/2](630000/4588595) loss:3.031 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:05,406][model8_pretrain.py][INFO] Epoch:[0/2](630000/4588595) loss:2.753 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:42,356][model8_pretrain.py][INFO] Epoch:[0/2](630100/4588595) loss:2.565 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:42,356][model8_pretrain.py][INFO] Epoch:[0/2](630100/4588595) loss:3.435 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:42,356][model8_pretrain.py][INFO] Epoch:[0/2](630100/4588595) loss:2.744 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:42,356][model8_pretrain.py][INFO] Epoch:[0/2](630100/4588595) loss:2.873 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:42,356][model8_pretrain.py][INFO] Epoch:[0/2](630100/4588595) loss:3.005 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:42,356][model8_pretrain.py][INFO] Epoch:[0/2](630100/4588595) loss:2.553 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:42,357][model8_pretrain.py][INFO] Epoch:[0/2](630100/4588595) loss:3.360 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:24:42,357][model8_pretrain.py][INFO] Epoch:[0/2](630100/4588595) loss:3.059 lr:0.0000100 epoch_Time:25161.0min: [2024-01-05 12:25:19,308][model8_pretrain.py][INFO] Epoch:[0/2](630200/4588595) loss:2.687 lr:0.0000100 epoch_Time:25159.0min: [2024-01-05 12:25:19,308][model8_pretrain.py][INFO] Epoch:[0/2](630200/4588595) loss:2.986 lr:0.0000100 epoch_Time:25159.0min: [2024-01-05 12:25:19,308][model8_pretrain.py][INFO] Epoch:[0/2](630200/4588595) loss:2.983 lr:0.0000100 epoch_Time:25159.0min: [2024-01-05 12:25:19,308][model8_pretrain.py][INFO] Epoch:[0/2](630200/4588595) loss:3.038 lr:0.0000100 epoch_Time:25159.0min: [2024-01-05 12:25:19,308][model8_pretrain.py][INFO] Epoch:[0/2](630200/4588595) loss:3.273 lr:0.0000100 epoch_Time:25159.0min: [2024-01-05 12:25:19,308][model8_pretrain.py][INFO] Epoch:[0/2](630200/4588595) loss:2.453 lr:0.0000100 epoch_Time:25159.0min: [2024-01-05 12:25:19,308][model8_pretrain.py][INFO] Epoch:[0/2](630200/4588595) loss:2.691 lr:0.0000100 epoch_Time:25159.0min: [2024-01-05 12:25:19,308][model8_pretrain.py][INFO] Epoch:[0/2](630200/4588595) loss:2.603 lr:0.0000100 epoch_Time:25159.0min: [2024-01-05 12:25:56,252][model8_pretrain.py][INFO] Epoch:[0/2](630300/4588595) loss:2.629 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:25:56,252][model8_pretrain.py][INFO] Epoch:[0/2](630300/4588595) loss:2.403 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:25:56,252][model8_pretrain.py][INFO] Epoch:[0/2](630300/4588595) loss:2.544 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:25:56,252][model8_pretrain.py][INFO] Epoch:[0/2](630300/4588595) loss:2.659 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:25:56,252][model8_pretrain.py][INFO] Epoch:[0/2](630300/4588595) loss:3.145 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:25:56,253][model8_pretrain.py][INFO] Epoch:[0/2](630300/4588595) loss:3.042 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:25:56,253][model8_pretrain.py][INFO] Epoch:[0/2](630300/4588595) loss:3.263 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:25:56,253][model8_pretrain.py][INFO] Epoch:[0/2](630300/4588595) loss:2.706 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:26:33,214][model8_pretrain.py][INFO] Epoch:[0/2](630400/4588595) loss:3.059 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:26:33,214][model8_pretrain.py][INFO] Epoch:[0/2](630400/4588595) loss:2.525 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:26:33,214][model8_pretrain.py][INFO] Epoch:[0/2](630400/4588595) loss:2.767 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:26:33,214][model8_pretrain.py][INFO] Epoch:[0/2](630400/4588595) loss:2.534 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:26:33,214][model8_pretrain.py][INFO] Epoch:[0/2](630400/4588595) loss:3.027 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:26:33,214][model8_pretrain.py][INFO] Epoch:[0/2](630400/4588595) loss:2.203 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:26:33,214][model8_pretrain.py][INFO] Epoch:[0/2](630400/4588595) loss:2.303 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:26:33,214][model8_pretrain.py][INFO] Epoch:[0/2](630400/4588595) loss:2.904 lr:0.0000100 epoch_Time:25158.0min: [2024-01-05 12:27:10,163][model8_pretrain.py][INFO] Epoch:[0/2](630500/4588595) loss:2.710 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:10,163][model8_pretrain.py][INFO] Epoch:[0/2](630500/4588595) loss:2.833 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:10,163][model8_pretrain.py][INFO] Epoch:[0/2](630500/4588595) loss:2.823 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:10,163][model8_pretrain.py][INFO] Epoch:[0/2](630500/4588595) loss:2.757 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:10,163][model8_pretrain.py][INFO] Epoch:[0/2](630500/4588595) loss:2.829 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:10,163][model8_pretrain.py][INFO] Epoch:[0/2](630500/4588595) loss:2.582 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:10,163][model8_pretrain.py][INFO] Epoch:[0/2](630500/4588595) loss:3.298 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:10,164][model8_pretrain.py][INFO] Epoch:[0/2](630500/4588595) loss:3.228 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:47,100][model8_pretrain.py][INFO] Epoch:[0/2](630600/4588595) loss:2.770 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:47,100][model8_pretrain.py][INFO] Epoch:[0/2](630600/4588595) loss:2.743 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:47,100][model8_pretrain.py][INFO] Epoch:[0/2](630600/4588595) loss:2.483 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:47,100][model8_pretrain.py][INFO] Epoch:[0/2](630600/4588595) loss:3.086 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:47,100][model8_pretrain.py][INFO] Epoch:[0/2](630600/4588595) loss:3.233 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:47,101][model8_pretrain.py][INFO] Epoch:[0/2](630600/4588595) loss:2.873 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:47,101][model8_pretrain.py][INFO] Epoch:[0/2](630600/4588595) loss:2.699 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:27:47,101][model8_pretrain.py][INFO] Epoch:[0/2](630600/4588595) loss:2.711 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:28:34,308][model8_pretrain.py][INFO] Epoch:[0/2](630700/4588595) loss:2.878 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:28:34,308][model8_pretrain.py][INFO] Epoch:[0/2](630700/4588595) loss:3.488 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:28:34,308][model8_pretrain.py][INFO] Epoch:[0/2](630700/4588595) loss:2.515 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:28:34,308][model8_pretrain.py][INFO] Epoch:[0/2](630700/4588595) loss:2.470 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:28:34,308][model8_pretrain.py][INFO] Epoch:[0/2](630700/4588595) loss:2.929 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:28:34,308][model8_pretrain.py][INFO] Epoch:[0/2](630700/4588595) loss:2.549 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:28:34,308][model8_pretrain.py][INFO] Epoch:[0/2](630700/4588595) loss:2.329 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:28:34,308][model8_pretrain.py][INFO] Epoch:[0/2](630700/4588595) loss:2.934 lr:0.0000100 epoch_Time:25157.0min: [2024-01-05 12:29:12,962][model8_pretrain.py][INFO] Epoch:[0/2](630800/4588595) loss:2.779 lr:0.0000100 epoch_Time:25156.0min: [2024-01-05 12:29:12,962][model8_pretrain.py][INFO] Epoch:[0/2](630800/4588595) loss:2.581 lr:0.0000100 epoch_Time:25156.0min: [2024-01-05 12:29:12,962][model8_pretrain.py][INFO] Epoch:[0/2](630800/4588595) loss:2.995 lr:0.0000100 epoch_Time:25156.0min: [2024-01-05 12:29:12,962][model8_pretrain.py][INFO] Epoch:[0/2](630800/4588595) loss:3.097 lr:0.0000100 epoch_Time:25156.0min: [2024-01-05 12:29:12,962][model8_pretrain.py][INFO] Epoch:[0/2](630800/4588595) loss:3.314 lr:0.0000100 epoch_Time:25156.0min: [2024-01-05 12:29:12,962][model8_pretrain.py][INFO] Epoch:[0/2](630800/4588595) loss:2.859 lr:0.0000100 epoch_Time:25156.0min: [2024-01-05 12:29:12,962][model8_pretrain.py][INFO] Epoch:[0/2](630800/4588595) loss:3.037 lr:0.0000100 epoch_Time:25156.0min: [2024-01-05 12:29:12,962][model8_pretrain.py][INFO] Epoch:[0/2](630800/4588595) loss:2.550 lr:0.0000100 epoch_Time:25156.0min: [2024-01-05 12:29:49,905][model8_pretrain.py][INFO] Epoch:[0/2](630900/4588595) loss:2.953 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:29:49,905][model8_pretrain.py][INFO] Epoch:[0/2](630900/4588595) loss:2.993 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:29:49,905][model8_pretrain.py][INFO] Epoch:[0/2](630900/4588595) loss:3.227 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:29:49,905][model8_pretrain.py][INFO] Epoch:[0/2](630900/4588595) loss:3.360 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:29:49,905][model8_pretrain.py][INFO] Epoch:[0/2](630900/4588595) loss:2.507 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:29:49,905][model8_pretrain.py][INFO] Epoch:[0/2](630900/4588595) loss:2.991 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:29:49,905][model8_pretrain.py][INFO] Epoch:[0/2](630900/4588595) loss:2.742 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:29:49,905][model8_pretrain.py][INFO] Epoch:[0/2](630900/4588595) loss:2.994 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:30:26,859][model8_pretrain.py][INFO] Epoch:[0/2](631000/4588595) loss:3.011 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:30:26,859][model8_pretrain.py][INFO] Epoch:[0/2](631000/4588595) loss:2.740 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:30:26,859][model8_pretrain.py][INFO] Epoch:[0/2](631000/4588595) loss:3.270 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:30:26,859][model8_pretrain.py][INFO] Epoch:[0/2](631000/4588595) loss:2.958 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:30:26,859][model8_pretrain.py][INFO] Epoch:[0/2](631000/4588595) loss:2.764 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:30:26,859][model8_pretrain.py][INFO] Epoch:[0/2](631000/4588595) loss:2.390 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:30:26,859][model8_pretrain.py][INFO] Epoch:[0/2](631000/4588595) loss:3.095 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:30:26,859][model8_pretrain.py][INFO] Epoch:[0/2](631000/4588595) loss:2.880 lr:0.0000100 epoch_Time:25155.0min: [2024-01-05 12:31:03,814][model8_pretrain.py][INFO] Epoch:[0/2](631100/4588595) loss:2.677 lr:0.0000100 epoch_Time:25154.0min: [2024-01-05 12:31:03,814][model8_pretrain.py][INFO] Epoch:[0/2](631100/4588595) loss:2.455 lr:0.0000100 epoch_Time:25154.0min: [2024-01-05 12:31:03,814][model8_pretrain.py][INFO] Epoch:[0/2](631100/4588595) loss:2.389 lr:0.0000100 epoch_Time:25154.0min: [2024-01-05 12:31:03,814][model8_pretrain.py][INFO] Epoch:[0/2](631100/4588595) loss:3.015 lr:0.0000100 epoch_Time:25154.0min: [2024-01-05 12:31:03,814][model8_pretrain.py][INFO] Epoch:[0/2](631100/4588595) loss:3.087 lr:0.0000100 epoch_Time:25154.0min: [2024-01-05 12:31:03,815][model8_pretrain.py][INFO] Epoch:[0/2](631100/4588595) loss:2.415 lr:0.0000100 epoch_Time:25154.0min: [2024-01-05 12:31:03,815][model8_pretrain.py][INFO] Epoch:[0/2](631100/4588595) loss:3.162 lr:0.0000100 epoch_Time:25154.0min: [2024-01-05 12:31:03,815][model8_pretrain.py][INFO] Epoch:[0/2](631100/4588595) loss:2.565 lr:0.0000100 epoch_Time:25154.0min: [2024-01-05 12:31:40,755][model8_pretrain.py][INFO] Epoch:[0/2](631200/4588595) loss:2.848 lr:0.0000100 epoch_Time:25153.0min: [2024-01-05 12:31:40,755][model8_pretrain.py][INFO] Epoch:[0/2](631200/4588595) loss:2.733 lr:0.0000100 epoch_Time:25153.0min: [2024-01-05 12:31:40,755][model8_pretrain.py][INFO] Epoch:[0/2](631200/4588595) loss:2.380 lr:0.0000100 epoch_Time:25153.0min: [2024-01-05 12:31:40,755][model8_pretrain.py][INFO] Epoch:[0/2](631200/4588595) loss:2.908 lr:0.0000100 epoch_Time:25153.0min: [2024-01-05 12:31:40,755][model8_pretrain.py][INFO] Epoch:[0/2](631200/4588595) loss:2.766 lr:0.0000100 epoch_Time:25153.0min: [2024-01-05 12:31:40,755][model8_pretrain.py][INFO] Epoch:[0/2](631200/4588595) loss:2.615 lr:0.0000100 epoch_Time:25153.0min: [2024-01-05 12:31:40,756][model8_pretrain.py][INFO] Epoch:[0/2](631200/4588595) loss:3.201 lr:0.0000100 epoch_Time:25153.0min: [2024-01-05 12:31:40,756][model8_pretrain.py][INFO] Epoch:[0/2](631200/4588595) loss:2.463 lr:0.0000100 epoch_Time:25153.0min: [2024-01-05 12:32:17,719][model8_pretrain.py][INFO] Epoch:[0/2](631300/4588595) loss:3.028 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:32:17,719][model8_pretrain.py][INFO] Epoch:[0/2](631300/4588595) loss:3.005 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:32:17,719][model8_pretrain.py][INFO] Epoch:[0/2](631300/4588595) loss:2.753 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:32:17,719][model8_pretrain.py][INFO] Epoch:[0/2](631300/4588595) loss:2.429 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:32:17,719][model8_pretrain.py][INFO] Epoch:[0/2](631300/4588595) loss:2.187 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:32:17,719][model8_pretrain.py][INFO] Epoch:[0/2](631300/4588595) loss:3.144 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:32:17,719][model8_pretrain.py][INFO] Epoch:[0/2](631300/4588595) loss:3.037 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:32:17,719][model8_pretrain.py][INFO] Epoch:[0/2](631300/4588595) loss:3.244 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:32:54,675][model8_pretrain.py][INFO] Epoch:[0/2](631400/4588595) loss:3.050 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:32:54,675][model8_pretrain.py][INFO] Epoch:[0/2](631400/4588595) loss:2.658 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:32:54,675][model8_pretrain.py][INFO] Epoch:[0/2](631400/4588595) loss:3.021 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:32:54,676][model8_pretrain.py][INFO] Epoch:[0/2](631400/4588595) loss:2.553 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:32:54,676][model8_pretrain.py][INFO] Epoch:[0/2](631400/4588595) loss:2.967 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:32:54,675][model8_pretrain.py][INFO] Epoch:[0/2](631400/4588595) loss:2.942 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:32:54,676][model8_pretrain.py][INFO] Epoch:[0/2](631400/4588595) loss:2.402 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:32:54,676][model8_pretrain.py][INFO] Epoch:[0/2](631400/4588595) loss:2.796 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:33:41,882][model8_pretrain.py][INFO] Epoch:[0/2](631500/4588595) loss:3.195 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:33:41,882][model8_pretrain.py][INFO] Epoch:[0/2](631500/4588595) loss:3.008 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:33:41,882][model8_pretrain.py][INFO] Epoch:[0/2](631500/4588595) loss:2.564 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:33:41,882][model8_pretrain.py][INFO] Epoch:[0/2](631500/4588595) loss:2.630 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:33:41,882][model8_pretrain.py][INFO] Epoch:[0/2](631500/4588595) loss:3.197 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:33:41,882][model8_pretrain.py][INFO] Epoch:[0/2](631500/4588595) loss:2.579 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:33:41,882][model8_pretrain.py][INFO] Epoch:[0/2](631500/4588595) loss:3.046 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:33:41,882][model8_pretrain.py][INFO] Epoch:[0/2](631500/4588595) loss:2.895 lr:0.0000100 epoch_Time:25152.0min: [2024-01-05 12:34:20,455][model8_pretrain.py][INFO] Epoch:[0/2](631600/4588595) loss:2.770 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:34:20,455][model8_pretrain.py][INFO] Epoch:[0/2](631600/4588595) loss:2.699 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:34:20,455][model8_pretrain.py][INFO] Epoch:[0/2](631600/4588595) loss:2.572 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:34:20,455][model8_pretrain.py][INFO] Epoch:[0/2](631600/4588595) loss:2.594 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:34:20,455][model8_pretrain.py][INFO] Epoch:[0/2](631600/4588595) loss:2.511 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:34:20,455][model8_pretrain.py][INFO] Epoch:[0/2](631600/4588595) loss:2.680 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:34:20,456][model8_pretrain.py][INFO] Epoch:[0/2](631600/4588595) loss:2.315 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:34:20,456][model8_pretrain.py][INFO] Epoch:[0/2](631600/4588595) loss:2.902 lr:0.0000100 epoch_Time:25151.0min: [2024-01-05 12:34:57,401][model8_pretrain.py][INFO] Epoch:[0/2](631700/4588595) loss:2.916 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:34:57,401][model8_pretrain.py][INFO] Epoch:[0/2](631700/4588595) loss:3.268 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:34:57,401][model8_pretrain.py][INFO] Epoch:[0/2](631700/4588595) loss:3.118 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:34:57,401][model8_pretrain.py][INFO] Epoch:[0/2](631700/4588595) loss:2.263 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:34:57,401][model8_pretrain.py][INFO] Epoch:[0/2](631700/4588595) loss:3.092 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:34:57,401][model8_pretrain.py][INFO] Epoch:[0/2](631700/4588595) loss:2.827 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:34:57,401][model8_pretrain.py][INFO] Epoch:[0/2](631700/4588595) loss:2.453 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:34:57,401][model8_pretrain.py][INFO] Epoch:[0/2](631700/4588595) loss:2.808 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:35:34,350][model8_pretrain.py][INFO] Epoch:[0/2](631800/4588595) loss:3.212 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:35:34,350][model8_pretrain.py][INFO] Epoch:[0/2](631800/4588595) loss:2.573 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:35:34,350][model8_pretrain.py][INFO] Epoch:[0/2](631800/4588595) loss:2.571 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:35:34,350][model8_pretrain.py][INFO] Epoch:[0/2](631800/4588595) loss:2.876 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:35:34,350][model8_pretrain.py][INFO] Epoch:[0/2](631800/4588595) loss:2.052 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:35:34,350][model8_pretrain.py][INFO] Epoch:[0/2](631800/4588595) loss:2.879 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:35:34,350][model8_pretrain.py][INFO] Epoch:[0/2](631800/4588595) loss:3.108 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:35:34,350][model8_pretrain.py][INFO] Epoch:[0/2](631800/4588595) loss:3.177 lr:0.0000100 epoch_Time:25150.0min: [2024-01-05 12:36:11,301][model8_pretrain.py][INFO] Epoch:[0/2](631900/4588595) loss:2.862 lr:0.0000100 epoch_Time:25149.0min: [2024-01-05 12:36:11,301][model8_pretrain.py][INFO] Epoch:[0/2](631900/4588595) loss:2.548 lr:0.0000100 epoch_Time:25149.0min: [2024-01-05 12:36:11,301][model8_pretrain.py][INFO] Epoch:[0/2](631900/4588595) loss:3.187 lr:0.0000100 epoch_Time:25149.0min: [2024-01-05 12:36:11,301][model8_pretrain.py][INFO] Epoch:[0/2](631900/4588595) loss:2.841 lr:0.0000100 epoch_Time:25149.0min: [2024-01-05 12:36:11,301][model8_pretrain.py][INFO] Epoch:[0/2](631900/4588595) loss:2.676 lr:0.0000100 epoch_Time:25149.0min: [2024-01-05 12:36:11,301][model8_pretrain.py][INFO] Epoch:[0/2](631900/4588595) loss:3.025 lr:0.0000100 epoch_Time:25149.0min: [2024-01-05 12:36:11,301][model8_pretrain.py][INFO] Epoch:[0/2](631900/4588595) loss:2.993 lr:0.0000100 epoch_Time:25149.0min: [2024-01-05 12:36:11,301][model8_pretrain.py][INFO] Epoch:[0/2](631900/4588595) loss:3.279 lr:0.0000100 epoch_Time:25149.0min: [2024-01-05 12:36:48,255][model8_pretrain.py][INFO] Epoch:[0/2](632000/4588595) loss:2.397 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:36:48,255][model8_pretrain.py][INFO] Epoch:[0/2](632000/4588595) loss:3.225 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:36:48,255][model8_pretrain.py][INFO] Epoch:[0/2](632000/4588595) loss:1.927 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:36:48,255][model8_pretrain.py][INFO] Epoch:[0/2](632000/4588595) loss:3.110 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:36:48,255][model8_pretrain.py][INFO] Epoch:[0/2](632000/4588595) loss:2.716 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:36:48,255][model8_pretrain.py][INFO] Epoch:[0/2](632000/4588595) loss:2.757 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:36:48,255][model8_pretrain.py][INFO] Epoch:[0/2](632000/4588595) loss:2.645 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:36:48,255][model8_pretrain.py][INFO] Epoch:[0/2](632000/4588595) loss:2.752 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:37:25,199][model8_pretrain.py][INFO] Epoch:[0/2](632100/4588595) loss:2.989 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:37:25,199][model8_pretrain.py][INFO] Epoch:[0/2](632100/4588595) loss:3.392 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:37:25,199][model8_pretrain.py][INFO] Epoch:[0/2](632100/4588595) loss:2.881 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:37:25,199][model8_pretrain.py][INFO] Epoch:[0/2](632100/4588595) loss:2.804 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:37:25,199][model8_pretrain.py][INFO] Epoch:[0/2](632100/4588595) loss:2.939 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:37:25,199][model8_pretrain.py][INFO] Epoch:[0/2](632100/4588595) loss:2.916 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:37:25,199][model8_pretrain.py][INFO] Epoch:[0/2](632100/4588595) loss:2.407 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:37:25,199][model8_pretrain.py][INFO] Epoch:[0/2](632100/4588595) loss:2.253 lr:0.0000100 epoch_Time:25148.0min: [2024-01-05 12:38:02,136][model8_pretrain.py][INFO] Epoch:[0/2](632200/4588595) loss:2.386 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:02,136][model8_pretrain.py][INFO] Epoch:[0/2](632200/4588595) loss:2.790 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:02,136][model8_pretrain.py][INFO] Epoch:[0/2](632200/4588595) loss:3.250 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:02,136][model8_pretrain.py][INFO] Epoch:[0/2](632200/4588595) loss:3.177 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:02,136][model8_pretrain.py][INFO] Epoch:[0/2](632200/4588595) loss:2.535 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:02,136][model8_pretrain.py][INFO] Epoch:[0/2](632200/4588595) loss:2.758 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:02,136][model8_pretrain.py][INFO] Epoch:[0/2](632200/4588595) loss:2.873 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:02,136][model8_pretrain.py][INFO] Epoch:[0/2](632200/4588595) loss:3.003 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:47,739][model8_pretrain.py][INFO] Epoch:[0/2](632300/4588595) loss:2.567 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:47,739][model8_pretrain.py][INFO] Epoch:[0/2](632300/4588595) loss:2.457 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:47,739][model8_pretrain.py][INFO] Epoch:[0/2](632300/4588595) loss:3.178 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:47,739][model8_pretrain.py][INFO] Epoch:[0/2](632300/4588595) loss:2.838 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:47,739][model8_pretrain.py][INFO] Epoch:[0/2](632300/4588595) loss:2.991 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:47,739][model8_pretrain.py][INFO] Epoch:[0/2](632300/4588595) loss:2.932 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:47,739][model8_pretrain.py][INFO] Epoch:[0/2](632300/4588595) loss:2.633 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:38:47,739][model8_pretrain.py][INFO] Epoch:[0/2](632300/4588595) loss:2.679 lr:0.0000100 epoch_Time:25146.0min: [2024-01-05 12:39:28,137][model8_pretrain.py][INFO] Epoch:[0/2](632400/4588595) loss:2.242 lr:0.0000100 epoch_Time:25147.0min: [2024-01-05 12:39:28,137][model8_pretrain.py][INFO] Epoch:[0/2](632400/4588595) loss:2.902 lr:0.0000100 epoch_Time:25147.0min: [2024-01-05 12:39:28,137][model8_pretrain.py][INFO] Epoch:[0/2](632400/4588595) loss:3.151 lr:0.0000100 epoch_Time:25147.0min: [2024-01-05 12:39:28,137][model8_pretrain.py][INFO] Epoch:[0/2](632400/4588595) loss:3.115 lr:0.0000100 epoch_Time:25147.0min: [2024-01-05 12:39:28,137][model8_pretrain.py][INFO] Epoch:[0/2](632400/4588595) loss:3.415 lr:0.0000100 epoch_Time:25147.0min: [2024-01-05 12:39:28,137][model8_pretrain.py][INFO] Epoch:[0/2](632400/4588595) loss:3.025 lr:0.0000100 epoch_Time:25147.0min: [2024-01-05 12:39:28,137][model8_pretrain.py][INFO] Epoch:[0/2](632400/4588595) loss:2.642 lr:0.0000100 epoch_Time:25147.0min: [2024-01-05 12:39:28,138][model8_pretrain.py][INFO] Epoch:[0/2](632400/4588595) loss:3.038 lr:0.0000100 epoch_Time:25147.0min: [2024-01-05 12:40:05,087][model8_pretrain.py][INFO] Epoch:[0/2](632500/4588595) loss:2.809 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:05,087][model8_pretrain.py][INFO] Epoch:[0/2](632500/4588595) loss:2.629 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:05,087][model8_pretrain.py][INFO] Epoch:[0/2](632500/4588595) loss:2.508 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:05,087][model8_pretrain.py][INFO] Epoch:[0/2](632500/4588595) loss:3.039 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:05,087][model8_pretrain.py][INFO] Epoch:[0/2](632500/4588595) loss:3.038 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:05,087][model8_pretrain.py][INFO] Epoch:[0/2](632500/4588595) loss:2.482 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:05,087][model8_pretrain.py][INFO] Epoch:[0/2](632500/4588595) loss:3.289 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:05,087][model8_pretrain.py][INFO] Epoch:[0/2](632500/4588595) loss:2.697 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:42,049][model8_pretrain.py][INFO] Epoch:[0/2](632600/4588595) loss:2.724 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:42,049][model8_pretrain.py][INFO] Epoch:[0/2](632600/4588595) loss:3.280 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:42,049][model8_pretrain.py][INFO] Epoch:[0/2](632600/4588595) loss:2.976 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:42,049][model8_pretrain.py][INFO] Epoch:[0/2](632600/4588595) loss:2.618 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:42,049][model8_pretrain.py][INFO] Epoch:[0/2](632600/4588595) loss:3.105 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:42,049][model8_pretrain.py][INFO] Epoch:[0/2](632600/4588595) loss:2.866 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:42,049][model8_pretrain.py][INFO] Epoch:[0/2](632600/4588595) loss:2.847 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:40:42,050][model8_pretrain.py][INFO] Epoch:[0/2](632600/4588595) loss:2.766 lr:0.0000100 epoch_Time:25145.0min: [2024-01-05 12:41:19,017][model8_pretrain.py][INFO] Epoch:[0/2](632700/4588595) loss:2.140 lr:0.0000100 epoch_Time:25144.0min: [2024-01-05 12:41:19,017][model8_pretrain.py][INFO] Epoch:[0/2](632700/4588595) loss:1.953 lr:0.0000100 epoch_Time:25144.0min: [2024-01-05 12:41:19,017][model8_pretrain.py][INFO] Epoch:[0/2](632700/4588595) loss:3.214 lr:0.0000100 epoch_Time:25144.0min: [2024-01-05 12:41:19,017][model8_pretrain.py][INFO] Epoch:[0/2](632700/4588595) loss:3.169 lr:0.0000100 epoch_Time:25144.0min: [2024-01-05 12:41:19,017][model8_pretrain.py][INFO] Epoch:[0/2](632700/4588595) loss:2.468 lr:0.0000100 epoch_Time:25144.0min: [2024-01-05 12:41:19,017][model8_pretrain.py][INFO] Epoch:[0/2](632700/4588595) loss:2.755 lr:0.0000100 epoch_Time:25144.0min: [2024-01-05 12:41:19,017][model8_pretrain.py][INFO] Epoch:[0/2](632700/4588595) loss:2.783 lr:0.0000100 epoch_Time:25144.0min: [2024-01-05 12:41:19,017][model8_pretrain.py][INFO] Epoch:[0/2](632700/4588595) loss:3.099 lr:0.0000100 epoch_Time:25144.0min: [2024-01-05 12:41:55,973][model8_pretrain.py][INFO] Epoch:[0/2](632800/4588595) loss:2.858 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:41:55,973][model8_pretrain.py][INFO] Epoch:[0/2](632800/4588595) loss:2.777 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:41:55,973][model8_pretrain.py][INFO] Epoch:[0/2](632800/4588595) loss:3.177 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:41:55,973][model8_pretrain.py][INFO] Epoch:[0/2](632800/4588595) loss:2.508 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:41:55,973][model8_pretrain.py][INFO] Epoch:[0/2](632800/4588595) loss:2.506 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:41:55,973][model8_pretrain.py][INFO] Epoch:[0/2](632800/4588595) loss:2.711 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:41:55,973][model8_pretrain.py][INFO] Epoch:[0/2](632800/4588595) loss:2.752 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:41:55,973][model8_pretrain.py][INFO] Epoch:[0/2](632800/4588595) loss:3.249 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:42:32,938][model8_pretrain.py][INFO] Epoch:[0/2](632900/4588595) loss:2.831 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:42:32,938][model8_pretrain.py][INFO] Epoch:[0/2](632900/4588595) loss:2.268 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:42:32,938][model8_pretrain.py][INFO] Epoch:[0/2](632900/4588595) loss:2.535 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:42:32,938][model8_pretrain.py][INFO] Epoch:[0/2](632900/4588595) loss:2.695 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:42:32,938][model8_pretrain.py][INFO] Epoch:[0/2](632900/4588595) loss:2.838 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:42:32,938][model8_pretrain.py][INFO] Epoch:[0/2](632900/4588595) loss:2.690 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:42:32,938][model8_pretrain.py][INFO] Epoch:[0/2](632900/4588595) loss:2.719 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:42:32,939][model8_pretrain.py][INFO] Epoch:[0/2](632900/4588595) loss:2.649 lr:0.0000100 epoch_Time:25143.0min: [2024-01-05 12:43:09,893][model8_pretrain.py][INFO] Epoch:[0/2](633000/4588595) loss:2.823 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:09,893][model8_pretrain.py][INFO] Epoch:[0/2](633000/4588595) loss:2.692 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:09,893][model8_pretrain.py][INFO] Epoch:[0/2](633000/4588595) loss:3.121 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:09,893][model8_pretrain.py][INFO] Epoch:[0/2](633000/4588595) loss:2.599 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:09,893][model8_pretrain.py][INFO] Epoch:[0/2](633000/4588595) loss:2.745 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:09,893][model8_pretrain.py][INFO] Epoch:[0/2](633000/4588595) loss:2.775 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:09,893][model8_pretrain.py][INFO] Epoch:[0/2](633000/4588595) loss:2.660 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:09,893][model8_pretrain.py][INFO] Epoch:[0/2](633000/4588595) loss:3.005 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:55,523][model8_pretrain.py][INFO] Epoch:[0/2](633100/4588595) loss:2.925 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:55,523][model8_pretrain.py][INFO] Epoch:[0/2](633100/4588595) loss:3.200 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:55,523][model8_pretrain.py][INFO] Epoch:[0/2](633100/4588595) loss:2.641 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:55,523][model8_pretrain.py][INFO] Epoch:[0/2](633100/4588595) loss:2.999 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:55,523][model8_pretrain.py][INFO] Epoch:[0/2](633100/4588595) loss:2.754 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:55,523][model8_pretrain.py][INFO] Epoch:[0/2](633100/4588595) loss:2.117 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:55,523][model8_pretrain.py][INFO] Epoch:[0/2](633100/4588595) loss:2.829 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:43:55,523][model8_pretrain.py][INFO] Epoch:[0/2](633100/4588595) loss:2.575 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:44:35,948][model8_pretrain.py][INFO] Epoch:[0/2](633200/4588595) loss:3.038 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:44:35,948][model8_pretrain.py][INFO] Epoch:[0/2](633200/4588595) loss:1.856 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:44:35,948][model8_pretrain.py][INFO] Epoch:[0/2](633200/4588595) loss:2.882 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:44:35,948][model8_pretrain.py][INFO] Epoch:[0/2](633200/4588595) loss:3.094 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:44:35,948][model8_pretrain.py][INFO] Epoch:[0/2](633200/4588595) loss:3.130 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:44:35,949][model8_pretrain.py][INFO] Epoch:[0/2](633200/4588595) loss:3.056 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:44:35,949][model8_pretrain.py][INFO] Epoch:[0/2](633200/4588595) loss:2.563 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:44:35,949][model8_pretrain.py][INFO] Epoch:[0/2](633200/4588595) loss:3.243 lr:0.0000100 epoch_Time:25142.0min: [2024-01-05 12:45:12,908][model8_pretrain.py][INFO] Epoch:[0/2](633300/4588595) loss:3.174 lr:0.0000100 epoch_Time:25141.0min: [2024-01-05 12:45:12,908][model8_pretrain.py][INFO] Epoch:[0/2](633300/4588595) loss:2.393 lr:0.0000100 epoch_Time:25141.0min: [2024-01-05 12:45:12,908][model8_pretrain.py][INFO] Epoch:[0/2](633300/4588595) loss:2.698 lr:0.0000100 epoch_Time:25141.0min: [2024-01-05 12:45:12,908][model8_pretrain.py][INFO] Epoch:[0/2](633300/4588595) loss:3.193 lr:0.0000100 epoch_Time:25141.0min: [2024-01-05 12:45:12,908][model8_pretrain.py][INFO] Epoch:[0/2](633300/4588595) loss:2.808 lr:0.0000100 epoch_Time:25141.0min: [2024-01-05 12:45:12,908][model8_pretrain.py][INFO] Epoch:[0/2](633300/4588595) loss:2.896 lr:0.0000100 epoch_Time:25141.0min: [2024-01-05 12:45:12,908][model8_pretrain.py][INFO] Epoch:[0/2](633300/4588595) loss:3.255 lr:0.0000100 epoch_Time:25141.0min: [2024-01-05 12:45:12,908][model8_pretrain.py][INFO] Epoch:[0/2](633300/4588595) loss:3.079 lr:0.0000100 epoch_Time:25141.0min: [2024-01-05 12:45:49,865][model8_pretrain.py][INFO] Epoch:[0/2](633400/4588595) loss:2.870 lr:0.0000100 epoch_Time:25140.0min: [2024-01-05 12:45:49,865][model8_pretrain.py][INFO] Epoch:[0/2](633400/4588595) loss:2.910 lr:0.0000100 epoch_Time:25140.0min: [2024-01-05 12:45:49,865][model8_pretrain.py][INFO] Epoch:[0/2](633400/4588595) loss:3.001 lr:0.0000100 epoch_Time:25140.0min: [2024-01-05 12:45:49,865][model8_pretrain.py][INFO] Epoch:[0/2](633400/4588595) loss:3.010 lr:0.0000100 epoch_Time:25140.0min: [2024-01-05 12:45:49,865][model8_pretrain.py][INFO] Epoch:[0/2](633400/4588595) loss:2.968 lr:0.0000100 epoch_Time:25140.0min: [2024-01-05 12:45:49,865][model8_pretrain.py][INFO] Epoch:[0/2](633400/4588595) loss:2.978 lr:0.0000100 epoch_Time:25140.0min: [2024-01-05 12:45:49,865][model8_pretrain.py][INFO] Epoch:[0/2](633400/4588595) loss:2.990 lr:0.0000100 epoch_Time:25140.0min: [2024-01-05 12:45:49,865][model8_pretrain.py][INFO] Epoch:[0/2](633400/4588595) loss:2.525 lr:0.0000100 epoch_Time:25140.0min: [2024-01-05 12:46:26,827][model8_pretrain.py][INFO] Epoch:[0/2](633500/4588595) loss:3.327 lr:0.0000100 epoch_Time:25139.0min: [2024-01-05 12:46:26,827][model8_pretrain.py][INFO] Epoch:[0/2](633500/4588595) loss:2.624 lr:0.0000100 epoch_Time:25139.0min: [2024-01-05 12:46:26,827][model8_pretrain.py][INFO] Epoch:[0/2](633500/4588595) loss:2.992 lr:0.0000100 epoch_Time:25139.0min: [2024-01-05 12:46:26,827][model8_pretrain.py][INFO] Epoch:[0/2](633500/4588595) loss:2.725 lr:0.0000100 epoch_Time:25139.0min: [2024-01-05 12:46:26,827][model8_pretrain.py][INFO] Epoch:[0/2](633500/4588595) loss:2.858 lr:0.0000100 epoch_Time:25139.0min: [2024-01-05 12:46:26,827][model8_pretrain.py][INFO] Epoch:[0/2](633500/4588595) loss:2.615 lr:0.0000100 epoch_Time:25139.0min: [2024-01-05 12:46:26,827][model8_pretrain.py][INFO] Epoch:[0/2](633500/4588595) loss:2.422 lr:0.0000100 epoch_Time:25139.0min: [2024-01-05 12:46:26,827][model8_pretrain.py][INFO] Epoch:[0/2](633500/4588595) loss:3.194 lr:0.0000100 epoch_Time:25139.0min: [2024-01-05 12:47:03,788][model8_pretrain.py][INFO] Epoch:[0/2](633600/4588595) loss:2.939 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:03,788][model8_pretrain.py][INFO] Epoch:[0/2](633600/4588595) loss:2.377 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:03,788][model8_pretrain.py][INFO] Epoch:[0/2](633600/4588595) loss:2.916 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:03,788][model8_pretrain.py][INFO] Epoch:[0/2](633600/4588595) loss:2.970 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:03,788][model8_pretrain.py][INFO] Epoch:[0/2](633600/4588595) loss:2.820 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:03,789][model8_pretrain.py][INFO] Epoch:[0/2](633600/4588595) loss:2.845 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:03,789][model8_pretrain.py][INFO] Epoch:[0/2](633600/4588595) loss:2.865 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:03,789][model8_pretrain.py][INFO] Epoch:[0/2](633600/4588595) loss:2.814 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:40,766][model8_pretrain.py][INFO] Epoch:[0/2](633700/4588595) loss:2.831 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:40,766][model8_pretrain.py][INFO] Epoch:[0/2](633700/4588595) loss:2.885 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:40,766][model8_pretrain.py][INFO] Epoch:[0/2](633700/4588595) loss:2.733 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:40,766][model8_pretrain.py][INFO] Epoch:[0/2](633700/4588595) loss:2.794 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:40,766][model8_pretrain.py][INFO] Epoch:[0/2](633700/4588595) loss:2.974 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:40,766][model8_pretrain.py][INFO] Epoch:[0/2](633700/4588595) loss:2.864 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:40,766][model8_pretrain.py][INFO] Epoch:[0/2](633700/4588595) loss:3.046 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:47:40,766][model8_pretrain.py][INFO] Epoch:[0/2](633700/4588595) loss:2.876 lr:0.0000100 epoch_Time:25138.0min: [2024-01-05 12:48:17,699][model8_pretrain.py][INFO] Epoch:[0/2](633800/4588595) loss:2.650 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:48:17,700][model8_pretrain.py][INFO] Epoch:[0/2](633800/4588595) loss:2.184 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:48:17,699][model8_pretrain.py][INFO] Epoch:[0/2](633800/4588595) loss:2.680 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:48:17,700][model8_pretrain.py][INFO] Epoch:[0/2](633800/4588595) loss:2.531 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:48:17,700][model8_pretrain.py][INFO] Epoch:[0/2](633800/4588595) loss:3.460 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:48:17,700][model8_pretrain.py][INFO] Epoch:[0/2](633800/4588595) loss:3.443 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:48:17,700][model8_pretrain.py][INFO] Epoch:[0/2](633800/4588595) loss:2.897 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:48:17,700][model8_pretrain.py][INFO] Epoch:[0/2](633800/4588595) loss:2.489 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:01,530][model8_pretrain.py][INFO] Epoch:[0/2](633900/4588595) loss:2.809 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:01,530][model8_pretrain.py][INFO] Epoch:[0/2](633900/4588595) loss:2.962 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:01,530][model8_pretrain.py][INFO] Epoch:[0/2](633900/4588595) loss:3.514 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:01,535][model8_pretrain.py][INFO] Epoch:[0/2](633900/4588595) loss:2.419 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:01,535][model8_pretrain.py][INFO] Epoch:[0/2](633900/4588595) loss:2.662 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:01,535][model8_pretrain.py][INFO] Epoch:[0/2](633900/4588595) loss:2.031 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:01,535][model8_pretrain.py][INFO] Epoch:[0/2](633900/4588595) loss:3.222 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:01,536][model8_pretrain.py][INFO] Epoch:[0/2](633900/4588595) loss:3.022 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:43,717][model8_pretrain.py][INFO] Epoch:[0/2](634000/4588595) loss:2.902 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:43,717][model8_pretrain.py][INFO] Epoch:[0/2](634000/4588595) loss:2.698 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:43,717][model8_pretrain.py][INFO] Epoch:[0/2](634000/4588595) loss:2.826 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:43,717][model8_pretrain.py][INFO] Epoch:[0/2](634000/4588595) loss:2.927 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:43,717][model8_pretrain.py][INFO] Epoch:[0/2](634000/4588595) loss:2.691 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:43,717][model8_pretrain.py][INFO] Epoch:[0/2](634000/4588595) loss:2.908 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:43,717][model8_pretrain.py][INFO] Epoch:[0/2](634000/4588595) loss:2.987 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:49:43,718][model8_pretrain.py][INFO] Epoch:[0/2](634000/4588595) loss:2.844 lr:0.0000100 epoch_Time:25137.0min: [2024-01-05 12:50:20,661][model8_pretrain.py][INFO] Epoch:[0/2](634100/4588595) loss:2.597 lr:0.0000100 epoch_Time:25136.0min: [2024-01-05 12:50:20,661][model8_pretrain.py][INFO] Epoch:[0/2](634100/4588595) loss:3.218 lr:0.0000100 epoch_Time:25136.0min: [2024-01-05 12:50:20,661][model8_pretrain.py][INFO] Epoch:[0/2](634100/4588595) loss:2.890 lr:0.0000100 epoch_Time:25136.0min: [2024-01-05 12:50:20,661][model8_pretrain.py][INFO] Epoch:[0/2](634100/4588595) loss:2.940 lr:0.0000100 epoch_Time:25136.0min: [2024-01-05 12:50:20,661][model8_pretrain.py][INFO] Epoch:[0/2](634100/4588595) loss:2.588 lr:0.0000100 epoch_Time:25136.0min: [2024-01-05 12:50:20,661][model8_pretrain.py][INFO] Epoch:[0/2](634100/4588595) loss:2.636 lr:0.0000100 epoch_Time:25136.0min: [2024-01-05 12:50:20,661][model8_pretrain.py][INFO] Epoch:[0/2](634100/4588595) loss:3.098 lr:0.0000100 epoch_Time:25136.0min: [2024-01-05 12:50:20,661][model8_pretrain.py][INFO] Epoch:[0/2](634100/4588595) loss:2.018 lr:0.0000100 epoch_Time:25136.0min: [2024-01-05 12:50:57,604][model8_pretrain.py][INFO] Epoch:[0/2](634200/4588595) loss:2.681 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:50:57,604][model8_pretrain.py][INFO] Epoch:[0/2](634200/4588595) loss:3.107 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:50:57,604][model8_pretrain.py][INFO] Epoch:[0/2](634200/4588595) loss:2.884 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:50:57,604][model8_pretrain.py][INFO] Epoch:[0/2](634200/4588595) loss:3.086 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:50:57,604][model8_pretrain.py][INFO] Epoch:[0/2](634200/4588595) loss:2.093 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:50:57,604][model8_pretrain.py][INFO] Epoch:[0/2](634200/4588595) loss:2.688 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:50:57,604][model8_pretrain.py][INFO] Epoch:[0/2](634200/4588595) loss:3.349 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:50:57,604][model8_pretrain.py][INFO] Epoch:[0/2](634200/4588595) loss:2.547 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:51:34,553][model8_pretrain.py][INFO] Epoch:[0/2](634300/4588595) loss:2.749 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:51:34,553][model8_pretrain.py][INFO] Epoch:[0/2](634300/4588595) loss:3.089 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:51:34,553][model8_pretrain.py][INFO] Epoch:[0/2](634300/4588595) loss:2.723 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:51:34,553][model8_pretrain.py][INFO] Epoch:[0/2](634300/4588595) loss:2.700 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:51:34,553][model8_pretrain.py][INFO] Epoch:[0/2](634300/4588595) loss:2.564 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:51:34,553][model8_pretrain.py][INFO] Epoch:[0/2](634300/4588595) loss:2.665 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:51:34,553][model8_pretrain.py][INFO] Epoch:[0/2](634300/4588595) loss:2.772 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:51:34,553][model8_pretrain.py][INFO] Epoch:[0/2](634300/4588595) loss:2.818 lr:0.0000100 epoch_Time:25135.0min: [2024-01-05 12:52:11,497][model8_pretrain.py][INFO] Epoch:[0/2](634400/4588595) loss:3.096 lr:0.0000100 epoch_Time:25134.0min: [2024-01-05 12:52:11,497][model8_pretrain.py][INFO] Epoch:[0/2](634400/4588595) loss:2.627 lr:0.0000100 epoch_Time:25134.0min: [2024-01-05 12:52:11,497][model8_pretrain.py][INFO] Epoch:[0/2](634400/4588595) loss:2.162 lr:0.0000100 epoch_Time:25134.0min: [2024-01-05 12:52:11,497][model8_pretrain.py][INFO] Epoch:[0/2](634400/4588595) loss:2.406 lr:0.0000100 epoch_Time:25134.0min: [2024-01-05 12:52:11,497][model8_pretrain.py][INFO] Epoch:[0/2](634400/4588595) loss:2.872 lr:0.0000100 epoch_Time:25134.0min: [2024-01-05 12:52:11,497][model8_pretrain.py][INFO] Epoch:[0/2](634400/4588595) loss:2.913 lr:0.0000100 epoch_Time:25134.0min: [2024-01-05 12:52:11,497][model8_pretrain.py][INFO] Epoch:[0/2](634400/4588595) loss:2.870 lr:0.0000100 epoch_Time:25134.0min: [2024-01-05 12:52:11,497][model8_pretrain.py][INFO] Epoch:[0/2](634400/4588595) loss:2.866 lr:0.0000100 epoch_Time:25134.0min: [2024-01-05 12:52:48,438][model8_pretrain.py][INFO] Epoch:[0/2](634500/4588595) loss:2.632 lr:0.0000100 epoch_Time:25133.0min: [2024-01-05 12:52:48,438][model8_pretrain.py][INFO] Epoch:[0/2](634500/4588595) loss:3.074 lr:0.0000100 epoch_Time:25133.0min: [2024-01-05 12:52:48,438][model8_pretrain.py][INFO] Epoch:[0/2](634500/4588595) loss:2.513 lr:0.0000100 epoch_Time:25133.0min: [2024-01-05 12:52:48,438][model8_pretrain.py][INFO] Epoch:[0/2](634500/4588595) loss:2.817 lr:0.0000100 epoch_Time:25133.0min: [2024-01-05 12:52:48,438][model8_pretrain.py][INFO] Epoch:[0/2](634500/4588595) loss:2.767 lr:0.0000100 epoch_Time:25133.0min: [2024-01-05 12:52:48,438][model8_pretrain.py][INFO] Epoch:[0/2](634500/4588595) loss:3.336 lr:0.0000100 epoch_Time:25133.0min: [2024-01-05 12:52:48,438][model8_pretrain.py][INFO] Epoch:[0/2](634500/4588595) loss:3.189 lr:0.0000100 epoch_Time:25133.0min: [2024-01-05 12:52:48,438][model8_pretrain.py][INFO] Epoch:[0/2](634500/4588595) loss:2.587 lr:0.0000100 epoch_Time:25133.0min: [2024-01-05 12:53:25,375][model8_pretrain.py][INFO] Epoch:[0/2](634600/4588595) loss:2.716 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:53:25,375][model8_pretrain.py][INFO] Epoch:[0/2](634600/4588595) loss:2.890 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:53:25,375][model8_pretrain.py][INFO] Epoch:[0/2](634600/4588595) loss:2.812 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:53:25,375][model8_pretrain.py][INFO] Epoch:[0/2](634600/4588595) loss:2.890 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:53:25,375][model8_pretrain.py][INFO] Epoch:[0/2](634600/4588595) loss:2.821 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:53:25,375][model8_pretrain.py][INFO] Epoch:[0/2](634600/4588595) loss:2.368 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:53:25,375][model8_pretrain.py][INFO] Epoch:[0/2](634600/4588595) loss:3.078 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:53:25,375][model8_pretrain.py][INFO] Epoch:[0/2](634600/4588595) loss:2.968 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:05,778][model8_pretrain.py][INFO] Epoch:[0/2](634700/4588595) loss:3.021 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:05,778][model8_pretrain.py][INFO] Epoch:[0/2](634700/4588595) loss:3.002 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:05,778][model8_pretrain.py][INFO] Epoch:[0/2](634700/4588595) loss:2.847 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:05,778][model8_pretrain.py][INFO] Epoch:[0/2](634700/4588595) loss:2.369 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:05,778][model8_pretrain.py][INFO] Epoch:[0/2](634700/4588595) loss:2.901 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:05,778][model8_pretrain.py][INFO] Epoch:[0/2](634700/4588595) loss:2.875 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:05,778][model8_pretrain.py][INFO] Epoch:[0/2](634700/4588595) loss:2.947 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:05,778][model8_pretrain.py][INFO] Epoch:[0/2](634700/4588595) loss:3.358 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:51,143][model8_pretrain.py][INFO] Epoch:[0/2](634800/4588595) loss:3.277 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:51,143][model8_pretrain.py][INFO] Epoch:[0/2](634800/4588595) loss:3.033 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:51,144][model8_pretrain.py][INFO] Epoch:[0/2](634800/4588595) loss:2.866 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:51,144][model8_pretrain.py][INFO] Epoch:[0/2](634800/4588595) loss:3.113 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:51,144][model8_pretrain.py][INFO] Epoch:[0/2](634800/4588595) loss:2.845 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:51,144][model8_pretrain.py][INFO] Epoch:[0/2](634800/4588595) loss:2.533 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:51,144][model8_pretrain.py][INFO] Epoch:[0/2](634800/4588595) loss:3.030 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:54:51,144][model8_pretrain.py][INFO] Epoch:[0/2](634800/4588595) loss:2.809 lr:0.0000100 epoch_Time:25132.0min: [2024-01-05 12:55:28,084][model8_pretrain.py][INFO] Epoch:[0/2](634900/4588595) loss:2.987 lr:0.0000100 epoch_Time:25131.0min: [2024-01-05 12:55:28,084][model8_pretrain.py][INFO] Epoch:[0/2](634900/4588595) loss:2.649 lr:0.0000100 epoch_Time:25131.0min: [2024-01-05 12:55:28,084][model8_pretrain.py][INFO] Epoch:[0/2](634900/4588595) loss:2.783 lr:0.0000100 epoch_Time:25131.0min: [2024-01-05 12:55:28,084][model8_pretrain.py][INFO] Epoch:[0/2](634900/4588595) loss:2.641 lr:0.0000100 epoch_Time:25131.0min: [2024-01-05 12:55:28,084][model8_pretrain.py][INFO] Epoch:[0/2](634900/4588595) loss:2.893 lr:0.0000100 epoch_Time:25131.0min: [2024-01-05 12:55:28,085][model8_pretrain.py][INFO] Epoch:[0/2](634900/4588595) loss:3.208 lr:0.0000100 epoch_Time:25131.0min: [2024-01-05 12:55:28,085][model8_pretrain.py][INFO] Epoch:[0/2](634900/4588595) loss:3.140 lr:0.0000100 epoch_Time:25131.0min: [2024-01-05 12:55:28,085][model8_pretrain.py][INFO] Epoch:[0/2](634900/4588595) loss:3.121 lr:0.0000100 epoch_Time:25131.0min: [2024-01-05 12:56:05,040][model8_pretrain.py][INFO] Epoch:[0/2](635000/4588595) loss:2.552 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:05,040][model8_pretrain.py][INFO] Epoch:[0/2](635000/4588595) loss:2.685 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:05,040][model8_pretrain.py][INFO] Epoch:[0/2](635000/4588595) loss:2.886 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:05,040][model8_pretrain.py][INFO] Epoch:[0/2](635000/4588595) loss:3.028 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:05,040][model8_pretrain.py][INFO] Epoch:[0/2](635000/4588595) loss:2.675 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:05,040][model8_pretrain.py][INFO] Epoch:[0/2](635000/4588595) loss:2.742 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:05,040][model8_pretrain.py][INFO] Epoch:[0/2](635000/4588595) loss:2.924 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:05,040][model8_pretrain.py][INFO] Epoch:[0/2](635000/4588595) loss:2.654 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:41,983][model8_pretrain.py][INFO] Epoch:[0/2](635100/4588595) loss:2.682 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:41,983][model8_pretrain.py][INFO] Epoch:[0/2](635100/4588595) loss:2.046 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:41,983][model8_pretrain.py][INFO] Epoch:[0/2](635100/4588595) loss:2.754 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:41,983][model8_pretrain.py][INFO] Epoch:[0/2](635100/4588595) loss:2.745 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:41,983][model8_pretrain.py][INFO] Epoch:[0/2](635100/4588595) loss:2.918 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:41,983][model8_pretrain.py][INFO] Epoch:[0/2](635100/4588595) loss:2.921 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:41,983][model8_pretrain.py][INFO] Epoch:[0/2](635100/4588595) loss:2.851 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:56:41,983][model8_pretrain.py][INFO] Epoch:[0/2](635100/4588595) loss:3.595 lr:0.0000100 epoch_Time:25130.0min: [2024-01-05 12:57:18,922][model8_pretrain.py][INFO] Epoch:[0/2](635200/4588595) loss:2.905 lr:0.0000100 epoch_Time:25129.0min: [2024-01-05 12:57:18,922][model8_pretrain.py][INFO] Epoch:[0/2](635200/4588595) loss:2.782 lr:0.0000100 epoch_Time:25129.0min: [2024-01-05 12:57:18,922][model8_pretrain.py][INFO] Epoch:[0/2](635200/4588595) loss:2.726 lr:0.0000100 epoch_Time:25129.0min: [2024-01-05 12:57:18,922][model8_pretrain.py][INFO] Epoch:[0/2](635200/4588595) loss:2.207 lr:0.0000100 epoch_Time:25129.0min: [2024-01-05 12:57:18,922][model8_pretrain.py][INFO] Epoch:[0/2](635200/4588595) loss:2.892 lr:0.0000100 epoch_Time:25129.0min: [2024-01-05 12:57:18,923][model8_pretrain.py][INFO] Epoch:[0/2](635200/4588595) loss:2.927 lr:0.0000100 epoch_Time:25129.0min: [2024-01-05 12:57:18,922][model8_pretrain.py][INFO] Epoch:[0/2](635200/4588595) loss:2.239 lr:0.0000100 epoch_Time:25129.0min: [2024-01-05 12:57:18,923][model8_pretrain.py][INFO] Epoch:[0/2](635200/4588595) loss:3.587 lr:0.0000100 epoch_Time:25129.0min: [2024-01-05 12:57:55,853][model8_pretrain.py][INFO] Epoch:[0/2](635300/4588595) loss:3.068 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:57:55,853][model8_pretrain.py][INFO] Epoch:[0/2](635300/4588595) loss:2.097 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:57:55,853][model8_pretrain.py][INFO] Epoch:[0/2](635300/4588595) loss:2.891 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:57:55,853][model8_pretrain.py][INFO] Epoch:[0/2](635300/4588595) loss:2.725 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:57:55,853][model8_pretrain.py][INFO] Epoch:[0/2](635300/4588595) loss:2.430 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:57:55,853][model8_pretrain.py][INFO] Epoch:[0/2](635300/4588595) loss:2.941 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:57:55,854][model8_pretrain.py][INFO] Epoch:[0/2](635300/4588595) loss:2.390 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:57:55,854][model8_pretrain.py][INFO] Epoch:[0/2](635300/4588595) loss:2.640 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:58:32,788][model8_pretrain.py][INFO] Epoch:[0/2](635400/4588595) loss:2.934 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:58:32,788][model8_pretrain.py][INFO] Epoch:[0/2](635400/4588595) loss:2.544 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:58:32,788][model8_pretrain.py][INFO] Epoch:[0/2](635400/4588595) loss:2.957 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:58:32,788][model8_pretrain.py][INFO] Epoch:[0/2](635400/4588595) loss:2.858 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:58:32,788][model8_pretrain.py][INFO] Epoch:[0/2](635400/4588595) loss:2.470 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:58:32,788][model8_pretrain.py][INFO] Epoch:[0/2](635400/4588595) loss:2.720 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:58:32,788][model8_pretrain.py][INFO] Epoch:[0/2](635400/4588595) loss:2.710 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:58:32,788][model8_pretrain.py][INFO] Epoch:[0/2](635400/4588595) loss:2.667 lr:0.0000100 epoch_Time:25128.0min: [2024-01-05 12:59:13,175][model8_pretrain.py][INFO] Epoch:[0/2](635500/4588595) loss:2.820 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:13,175][model8_pretrain.py][INFO] Epoch:[0/2](635500/4588595) loss:2.870 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:13,175][model8_pretrain.py][INFO] Epoch:[0/2](635500/4588595) loss:3.027 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:13,175][model8_pretrain.py][INFO] Epoch:[0/2](635500/4588595) loss:2.478 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:13,175][model8_pretrain.py][INFO] Epoch:[0/2](635500/4588595) loss:2.746 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:13,175][model8_pretrain.py][INFO] Epoch:[0/2](635500/4588595) loss:3.443 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:13,176][model8_pretrain.py][INFO] Epoch:[0/2](635500/4588595) loss:2.473 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:13,176][model8_pretrain.py][INFO] Epoch:[0/2](635500/4588595) loss:3.068 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:58,593][model8_pretrain.py][INFO] Epoch:[0/2](635600/4588595) loss:2.831 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:58,593][model8_pretrain.py][INFO] Epoch:[0/2](635600/4588595) loss:2.841 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:58,593][model8_pretrain.py][INFO] Epoch:[0/2](635600/4588595) loss:2.788 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:58,593][model8_pretrain.py][INFO] Epoch:[0/2](635600/4588595) loss:2.740 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:58,593][model8_pretrain.py][INFO] Epoch:[0/2](635600/4588595) loss:2.534 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:58,593][model8_pretrain.py][INFO] Epoch:[0/2](635600/4588595) loss:2.320 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:58,593][model8_pretrain.py][INFO] Epoch:[0/2](635600/4588595) loss:3.188 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 12:59:58,593][model8_pretrain.py][INFO] Epoch:[0/2](635600/4588595) loss:2.520 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 13:00:35,537][model8_pretrain.py][INFO] Epoch:[0/2](635700/4588595) loss:2.880 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 13:00:35,537][model8_pretrain.py][INFO] Epoch:[0/2](635700/4588595) loss:2.802 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 13:00:35,537][model8_pretrain.py][INFO] Epoch:[0/2](635700/4588595) loss:2.380 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 13:00:35,538][model8_pretrain.py][INFO] Epoch:[0/2](635700/4588595) loss:2.640 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 13:00:35,537][model8_pretrain.py][INFO] Epoch:[0/2](635700/4588595) loss:3.093 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 13:00:35,537][model8_pretrain.py][INFO] Epoch:[0/2](635700/4588595) loss:2.667 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 13:00:35,538][model8_pretrain.py][INFO] Epoch:[0/2](635700/4588595) loss:2.957 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 13:00:35,538][model8_pretrain.py][INFO] Epoch:[0/2](635700/4588595) loss:2.733 lr:0.0000100 epoch_Time:25127.0min: [2024-01-05 13:01:12,491][model8_pretrain.py][INFO] Epoch:[0/2](635800/4588595) loss:1.704 lr:0.0000100 epoch_Time:25126.0min: [2024-01-05 13:01:12,491][model8_pretrain.py][INFO] Epoch:[0/2](635800/4588595) loss:2.562 lr:0.0000100 epoch_Time:25126.0min: [2024-01-05 13:01:12,491][model8_pretrain.py][INFO] Epoch:[0/2](635800/4588595) loss:3.047 lr:0.0000100 epoch_Time:25126.0min: [2024-01-05 13:01:12,491][model8_pretrain.py][INFO] Epoch:[0/2](635800/4588595) loss:2.428 lr:0.0000100 epoch_Time:25126.0min: [2024-01-05 13:01:12,491][model8_pretrain.py][INFO] Epoch:[0/2](635800/4588595) loss:2.644 lr:0.0000100 epoch_Time:25126.0min: [2024-01-05 13:01:12,491][model8_pretrain.py][INFO] Epoch:[0/2](635800/4588595) loss:3.194 lr:0.0000100 epoch_Time:25126.0min: [2024-01-05 13:01:12,491][model8_pretrain.py][INFO] Epoch:[0/2](635800/4588595) loss:2.943 lr:0.0000100 epoch_Time:25126.0min: [2024-01-05 13:01:12,491][model8_pretrain.py][INFO] Epoch:[0/2](635800/4588595) loss:2.455 lr:0.0000100 epoch_Time:25126.0min: [2024-01-05 13:01:49,410][model8_pretrain.py][INFO] Epoch:[0/2](635900/4588595) loss:3.072 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:01:49,410][model8_pretrain.py][INFO] Epoch:[0/2](635900/4588595) loss:2.280 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:01:49,410][model8_pretrain.py][INFO] Epoch:[0/2](635900/4588595) loss:2.299 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:01:49,410][model8_pretrain.py][INFO] Epoch:[0/2](635900/4588595) loss:2.703 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:01:49,410][model8_pretrain.py][INFO] Epoch:[0/2](635900/4588595) loss:2.528 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:01:49,410][model8_pretrain.py][INFO] Epoch:[0/2](635900/4588595) loss:2.619 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:01:49,410][model8_pretrain.py][INFO] Epoch:[0/2](635900/4588595) loss:3.189 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:01:49,410][model8_pretrain.py][INFO] Epoch:[0/2](635900/4588595) loss:2.871 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:02:26,353][model8_pretrain.py][INFO] Epoch:[0/2](636000/4588595) loss:3.046 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:02:26,353][model8_pretrain.py][INFO] Epoch:[0/2](636000/4588595) loss:2.845 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:02:26,353][model8_pretrain.py][INFO] Epoch:[0/2](636000/4588595) loss:2.132 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:02:26,353][model8_pretrain.py][INFO] Epoch:[0/2](636000/4588595) loss:2.778 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:02:26,353][model8_pretrain.py][INFO] Epoch:[0/2](636000/4588595) loss:2.434 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:02:26,353][model8_pretrain.py][INFO] Epoch:[0/2](636000/4588595) loss:2.642 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:02:26,353][model8_pretrain.py][INFO] Epoch:[0/2](636000/4588595) loss:2.916 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:02:26,353][model8_pretrain.py][INFO] Epoch:[0/2](636000/4588595) loss:2.281 lr:0.0000100 epoch_Time:25124.0min: [2024-01-05 13:03:03,301][model8_pretrain.py][INFO] Epoch:[0/2](636100/4588595) loss:2.155 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:03,301][model8_pretrain.py][INFO] Epoch:[0/2](636100/4588595) loss:2.717 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:03,301][model8_pretrain.py][INFO] Epoch:[0/2](636100/4588595) loss:2.675 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:03,301][model8_pretrain.py][INFO] Epoch:[0/2](636100/4588595) loss:2.272 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:03,301][model8_pretrain.py][INFO] Epoch:[0/2](636100/4588595) loss:2.823 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:03,301][model8_pretrain.py][INFO] Epoch:[0/2](636100/4588595) loss:2.933 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:03,302][model8_pretrain.py][INFO] Epoch:[0/2](636100/4588595) loss:3.092 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:03,302][model8_pretrain.py][INFO] Epoch:[0/2](636100/4588595) loss:3.010 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:40,240][model8_pretrain.py][INFO] Epoch:[0/2](636200/4588595) loss:2.861 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:40,240][model8_pretrain.py][INFO] Epoch:[0/2](636200/4588595) loss:2.632 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:40,240][model8_pretrain.py][INFO] Epoch:[0/2](636200/4588595) loss:2.961 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:40,240][model8_pretrain.py][INFO] Epoch:[0/2](636200/4588595) loss:3.283 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:40,240][model8_pretrain.py][INFO] Epoch:[0/2](636200/4588595) loss:2.861 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:40,240][model8_pretrain.py][INFO] Epoch:[0/2](636200/4588595) loss:2.441 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:40,240][model8_pretrain.py][INFO] Epoch:[0/2](636200/4588595) loss:3.113 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:03:40,241][model8_pretrain.py][INFO] Epoch:[0/2](636200/4588595) loss:3.324 lr:0.0000100 epoch_Time:25123.0min: [2024-01-05 13:04:20,686][model8_pretrain.py][INFO] Epoch:[0/2](636300/4588595) loss:2.938 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:04:20,686][model8_pretrain.py][INFO] Epoch:[0/2](636300/4588595) loss:2.816 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:04:20,686][model8_pretrain.py][INFO] Epoch:[0/2](636300/4588595) loss:2.977 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:04:20,686][model8_pretrain.py][INFO] Epoch:[0/2](636300/4588595) loss:2.830 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:04:20,686][model8_pretrain.py][INFO] Epoch:[0/2](636300/4588595) loss:2.867 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:04:20,686][model8_pretrain.py][INFO] Epoch:[0/2](636300/4588595) loss:2.860 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:04:20,686][model8_pretrain.py][INFO] Epoch:[0/2](636300/4588595) loss:2.491 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:04:20,686][model8_pretrain.py][INFO] Epoch:[0/2](636300/4588595) loss:2.787 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:06,797][model8_pretrain.py][INFO] Epoch:[0/2](636400/4588595) loss:3.245 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:06,797][model8_pretrain.py][INFO] Epoch:[0/2](636400/4588595) loss:3.057 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:06,797][model8_pretrain.py][INFO] Epoch:[0/2](636400/4588595) loss:2.733 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:06,797][model8_pretrain.py][INFO] Epoch:[0/2](636400/4588595) loss:2.792 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:06,797][model8_pretrain.py][INFO] Epoch:[0/2](636400/4588595) loss:3.010 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:06,797][model8_pretrain.py][INFO] Epoch:[0/2](636400/4588595) loss:2.275 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:06,797][model8_pretrain.py][INFO] Epoch:[0/2](636400/4588595) loss:2.931 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:06,797][model8_pretrain.py][INFO] Epoch:[0/2](636400/4588595) loss:3.038 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:43,722][model8_pretrain.py][INFO] Epoch:[0/2](636500/4588595) loss:3.046 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:43,722][model8_pretrain.py][INFO] Epoch:[0/2](636500/4588595) loss:2.959 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:43,722][model8_pretrain.py][INFO] Epoch:[0/2](636500/4588595) loss:2.085 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:43,722][model8_pretrain.py][INFO] Epoch:[0/2](636500/4588595) loss:3.185 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:43,722][model8_pretrain.py][INFO] Epoch:[0/2](636500/4588595) loss:2.720 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:43,722][model8_pretrain.py][INFO] Epoch:[0/2](636500/4588595) loss:2.929 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:43,722][model8_pretrain.py][INFO] Epoch:[0/2](636500/4588595) loss:2.600 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:05:43,722][model8_pretrain.py][INFO] Epoch:[0/2](636500/4588595) loss:3.017 lr:0.0000100 epoch_Time:25122.0min: [2024-01-05 13:06:20,678][model8_pretrain.py][INFO] Epoch:[0/2](636600/4588595) loss:2.478 lr:0.0000100 epoch_Time:25121.0min: [2024-01-05 13:06:20,678][model8_pretrain.py][INFO] Epoch:[0/2](636600/4588595) loss:2.734 lr:0.0000100 epoch_Time:25121.0min: [2024-01-05 13:06:20,678][model8_pretrain.py][INFO] Epoch:[0/2](636600/4588595) loss:3.049 lr:0.0000100 epoch_Time:25121.0min: [2024-01-05 13:06:20,678][model8_pretrain.py][INFO] Epoch:[0/2](636600/4588595) loss:2.248 lr:0.0000100 epoch_Time:25121.0min: [2024-01-05 13:06:20,678][model8_pretrain.py][INFO] Epoch:[0/2](636600/4588595) loss:2.792 lr:0.0000100 epoch_Time:25121.0min: [2024-01-05 13:06:20,678][model8_pretrain.py][INFO] Epoch:[0/2](636600/4588595) loss:3.189 lr:0.0000100 epoch_Time:25121.0min: [2024-01-05 13:06:20,678][model8_pretrain.py][INFO] Epoch:[0/2](636600/4588595) loss:2.302 lr:0.0000100 epoch_Time:25121.0min: [2024-01-05 13:06:20,678][model8_pretrain.py][INFO] Epoch:[0/2](636600/4588595) loss:3.299 lr:0.0000100 epoch_Time:25121.0min: [2024-01-05 13:06:57,625][model8_pretrain.py][INFO] Epoch:[0/2](636700/4588595) loss:3.113 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:06:57,625][model8_pretrain.py][INFO] Epoch:[0/2](636700/4588595) loss:2.894 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:06:57,625][model8_pretrain.py][INFO] Epoch:[0/2](636700/4588595) loss:2.588 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:06:57,625][model8_pretrain.py][INFO] Epoch:[0/2](636700/4588595) loss:3.022 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:06:57,625][model8_pretrain.py][INFO] Epoch:[0/2](636700/4588595) loss:3.468 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:06:57,625][model8_pretrain.py][INFO] Epoch:[0/2](636700/4588595) loss:3.238 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:06:57,625][model8_pretrain.py][INFO] Epoch:[0/2](636700/4588595) loss:3.000 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:06:57,625][model8_pretrain.py][INFO] Epoch:[0/2](636700/4588595) loss:2.932 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:07:34,569][model8_pretrain.py][INFO] Epoch:[0/2](636800/4588595) loss:2.463 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:07:34,569][model8_pretrain.py][INFO] Epoch:[0/2](636800/4588595) loss:3.112 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:07:34,569][model8_pretrain.py][INFO] Epoch:[0/2](636800/4588595) loss:2.907 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:07:34,569][model8_pretrain.py][INFO] Epoch:[0/2](636800/4588595) loss:1.720 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:07:34,569][model8_pretrain.py][INFO] Epoch:[0/2](636800/4588595) loss:3.095 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:07:34,569][model8_pretrain.py][INFO] Epoch:[0/2](636800/4588595) loss:2.924 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:07:34,569][model8_pretrain.py][INFO] Epoch:[0/2](636800/4588595) loss:2.545 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:07:34,569][model8_pretrain.py][INFO] Epoch:[0/2](636800/4588595) loss:2.642 lr:0.0000100 epoch_Time:25120.0min: [2024-01-05 13:08:11,517][model8_pretrain.py][INFO] Epoch:[0/2](636900/4588595) loss:3.034 lr:0.0000100 epoch_Time:25118.0min: [2024-01-05 13:08:11,517][model8_pretrain.py][INFO] Epoch:[0/2](636900/4588595) loss:2.970 lr:0.0000100 epoch_Time:25118.0min: [2024-01-05 13:08:11,517][model8_pretrain.py][INFO] Epoch:[0/2](636900/4588595) loss:3.123 lr:0.0000100 epoch_Time:25118.0min: [2024-01-05 13:08:11,517][model8_pretrain.py][INFO] Epoch:[0/2](636900/4588595) loss:3.387 lr:0.0000100 epoch_Time:25118.0min: [2024-01-05 13:08:11,517][model8_pretrain.py][INFO] Epoch:[0/2](636900/4588595) loss:2.896 lr:0.0000100 epoch_Time:25118.0min: [2024-01-05 13:08:11,517][model8_pretrain.py][INFO] Epoch:[0/2](636900/4588595) loss:2.153 lr:0.0000100 epoch_Time:25118.0min: [2024-01-05 13:08:11,517][model8_pretrain.py][INFO] Epoch:[0/2](636900/4588595) loss:3.136 lr:0.0000100 epoch_Time:25118.0min: [2024-01-05 13:08:11,517][model8_pretrain.py][INFO] Epoch:[0/2](636900/4588595) loss:2.736 lr:0.0000100 epoch_Time:25118.0min: [2024-01-05 13:08:48,461][model8_pretrain.py][INFO] Epoch:[0/2](637000/4588595) loss:2.877 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:08:48,461][model8_pretrain.py][INFO] Epoch:[0/2](637000/4588595) loss:2.795 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:08:48,461][model8_pretrain.py][INFO] Epoch:[0/2](637000/4588595) loss:2.890 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:08:48,461][model8_pretrain.py][INFO] Epoch:[0/2](637000/4588595) loss:2.877 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:08:48,461][model8_pretrain.py][INFO] Epoch:[0/2](637000/4588595) loss:2.948 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:08:48,461][model8_pretrain.py][INFO] Epoch:[0/2](637000/4588595) loss:3.333 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:08:48,461][model8_pretrain.py][INFO] Epoch:[0/2](637000/4588595) loss:3.215 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:08:48,462][model8_pretrain.py][INFO] Epoch:[0/2](637000/4588595) loss:2.100 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:09:27,138][model8_pretrain.py][INFO] Epoch:[0/2](637100/4588595) loss:2.967 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:09:27,138][model8_pretrain.py][INFO] Epoch:[0/2](637100/4588595) loss:2.898 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:09:27,138][model8_pretrain.py][INFO] Epoch:[0/2](637100/4588595) loss:2.984 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:09:27,138][model8_pretrain.py][INFO] Epoch:[0/2](637100/4588595) loss:2.963 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:09:27,138][model8_pretrain.py][INFO] Epoch:[0/2](637100/4588595) loss:2.973 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:09:27,138][model8_pretrain.py][INFO] Epoch:[0/2](637100/4588595) loss:3.103 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:09:27,138][model8_pretrain.py][INFO] Epoch:[0/2](637100/4588595) loss:2.474 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:09:27,138][model8_pretrain.py][INFO] Epoch:[0/2](637100/4588595) loss:2.129 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:10:14,332][model8_pretrain.py][INFO] Epoch:[0/2](637200/4588595) loss:2.889 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:10:14,332][model8_pretrain.py][INFO] Epoch:[0/2](637200/4588595) loss:3.035 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:10:14,332][model8_pretrain.py][INFO] Epoch:[0/2](637200/4588595) loss:2.753 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:10:14,332][model8_pretrain.py][INFO] Epoch:[0/2](637200/4588595) loss:3.313 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:10:14,332][model8_pretrain.py][INFO] Epoch:[0/2](637200/4588595) loss:3.089 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:10:14,332][model8_pretrain.py][INFO] Epoch:[0/2](637200/4588595) loss:2.105 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:10:14,332][model8_pretrain.py][INFO] Epoch:[0/2](637200/4588595) loss:2.949 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:10:14,332][model8_pretrain.py][INFO] Epoch:[0/2](637200/4588595) loss:2.837 lr:0.0000100 epoch_Time:25117.0min: [2024-01-05 13:10:51,269][model8_pretrain.py][INFO] Epoch:[0/2](637300/4588595) loss:3.065 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:10:51,269][model8_pretrain.py][INFO] Epoch:[0/2](637300/4588595) loss:2.845 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:10:51,269][model8_pretrain.py][INFO] Epoch:[0/2](637300/4588595) loss:2.922 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:10:51,269][model8_pretrain.py][INFO] Epoch:[0/2](637300/4588595) loss:3.219 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:10:51,269][model8_pretrain.py][INFO] Epoch:[0/2](637300/4588595) loss:3.236 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:10:51,269][model8_pretrain.py][INFO] Epoch:[0/2](637300/4588595) loss:2.634 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:10:51,269][model8_pretrain.py][INFO] Epoch:[0/2](637300/4588595) loss:2.640 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:10:51,269][model8_pretrain.py][INFO] Epoch:[0/2](637300/4588595) loss:2.673 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:11:28,205][model8_pretrain.py][INFO] Epoch:[0/2](637400/4588595) loss:2.954 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:11:28,205][model8_pretrain.py][INFO] Epoch:[0/2](637400/4588595) loss:2.905 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:11:28,205][model8_pretrain.py][INFO] Epoch:[0/2](637400/4588595) loss:2.572 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:11:28,205][model8_pretrain.py][INFO] Epoch:[0/2](637400/4588595) loss:2.854 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:11:28,205][model8_pretrain.py][INFO] Epoch:[0/2](637400/4588595) loss:2.568 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:11:28,205][model8_pretrain.py][INFO] Epoch:[0/2](637400/4588595) loss:2.473 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:11:28,205][model8_pretrain.py][INFO] Epoch:[0/2](637400/4588595) loss:2.385 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:11:28,205][model8_pretrain.py][INFO] Epoch:[0/2](637400/4588595) loss:2.983 lr:0.0000100 epoch_Time:25116.0min: [2024-01-05 13:12:05,146][model8_pretrain.py][INFO] Epoch:[0/2](637500/4588595) loss:2.698 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:05,146][model8_pretrain.py][INFO] Epoch:[0/2](637500/4588595) loss:2.772 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:05,146][model8_pretrain.py][INFO] Epoch:[0/2](637500/4588595) loss:2.551 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:05,146][model8_pretrain.py][INFO] Epoch:[0/2](637500/4588595) loss:2.650 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:05,146][model8_pretrain.py][INFO] Epoch:[0/2](637500/4588595) loss:3.102 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:05,146][model8_pretrain.py][INFO] Epoch:[0/2](637500/4588595) loss:3.071 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:05,146][model8_pretrain.py][INFO] Epoch:[0/2](637500/4588595) loss:3.002 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:05,146][model8_pretrain.py][INFO] Epoch:[0/2](637500/4588595) loss:2.925 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:42,084][model8_pretrain.py][INFO] Epoch:[0/2](637600/4588595) loss:2.910 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:42,084][model8_pretrain.py][INFO] Epoch:[0/2](637600/4588595) loss:2.932 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:42,084][model8_pretrain.py][INFO] Epoch:[0/2](637600/4588595) loss:2.895 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:42,084][model8_pretrain.py][INFO] Epoch:[0/2](637600/4588595) loss:2.897 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:42,084][model8_pretrain.py][INFO] Epoch:[0/2](637600/4588595) loss:2.828 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:42,084][model8_pretrain.py][INFO] Epoch:[0/2](637600/4588595) loss:2.959 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:42,084][model8_pretrain.py][INFO] Epoch:[0/2](637600/4588595) loss:2.727 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:12:42,084][model8_pretrain.py][INFO] Epoch:[0/2](637600/4588595) loss:2.916 lr:0.0000100 epoch_Time:25115.0min: [2024-01-05 13:13:19,022][model8_pretrain.py][INFO] Epoch:[0/2](637700/4588595) loss:3.041 lr:0.0000100 epoch_Time:25114.0min: [2024-01-05 13:13:19,022][model8_pretrain.py][INFO] Epoch:[0/2](637700/4588595) loss:2.498 lr:0.0000100 epoch_Time:25114.0min: [2024-01-05 13:13:19,022][model8_pretrain.py][INFO] Epoch:[0/2](637700/4588595) loss:2.700 lr:0.0000100 epoch_Time:25114.0min: [2024-01-05 13:13:19,022][model8_pretrain.py][INFO] Epoch:[0/2](637700/4588595) loss:2.569 lr:0.0000100 epoch_Time:25114.0min: [2024-01-05 13:13:19,022][model8_pretrain.py][INFO] Epoch:[0/2](637700/4588595) loss:3.082 lr:0.0000100 epoch_Time:25114.0min: [2024-01-05 13:13:19,022][model8_pretrain.py][INFO] Epoch:[0/2](637700/4588595) loss:2.788 lr:0.0000100 epoch_Time:25114.0min: [2024-01-05 13:13:19,022][model8_pretrain.py][INFO] Epoch:[0/2](637700/4588595) loss:2.675 lr:0.0000100 epoch_Time:25114.0min: [2024-01-05 13:13:19,022][model8_pretrain.py][INFO] Epoch:[0/2](637700/4588595) loss:2.728 lr:0.0000100 epoch_Time:25114.0min: [2024-01-05 13:13:55,950][model8_pretrain.py][INFO] Epoch:[0/2](637800/4588595) loss:2.542 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:13:55,950][model8_pretrain.py][INFO] Epoch:[0/2](637800/4588595) loss:2.924 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:13:55,950][model8_pretrain.py][INFO] Epoch:[0/2](637800/4588595) loss:2.851 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:13:55,950][model8_pretrain.py][INFO] Epoch:[0/2](637800/4588595) loss:2.515 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:13:55,950][model8_pretrain.py][INFO] Epoch:[0/2](637800/4588595) loss:2.877 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:13:55,950][model8_pretrain.py][INFO] Epoch:[0/2](637800/4588595) loss:2.492 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:13:55,950][model8_pretrain.py][INFO] Epoch:[0/2](637800/4588595) loss:3.094 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:13:55,950][model8_pretrain.py][INFO] Epoch:[0/2](637800/4588595) loss:2.729 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:14:34,597][model8_pretrain.py][INFO] Epoch:[0/2](637900/4588595) loss:2.581 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:14:34,597][model8_pretrain.py][INFO] Epoch:[0/2](637900/4588595) loss:2.240 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:14:34,597][model8_pretrain.py][INFO] Epoch:[0/2](637900/4588595) loss:2.789 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:14:34,597][model8_pretrain.py][INFO] Epoch:[0/2](637900/4588595) loss:2.319 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:14:34,598][model8_pretrain.py][INFO] Epoch:[0/2](637900/4588595) loss:2.500 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:14:34,597][model8_pretrain.py][INFO] Epoch:[0/2](637900/4588595) loss:2.921 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:14:34,598][model8_pretrain.py][INFO] Epoch:[0/2](637900/4588595) loss:2.943 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:14:34,598][model8_pretrain.py][INFO] Epoch:[0/2](637900/4588595) loss:2.788 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:15:22,004][model8_pretrain.py][INFO] Epoch:[0/2](638000/4588595) loss:2.996 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:15:22,004][model8_pretrain.py][INFO] Epoch:[0/2](638000/4588595) loss:2.653 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:15:22,004][model8_pretrain.py][INFO] Epoch:[0/2](638000/4588595) loss:2.590 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:15:22,004][model8_pretrain.py][INFO] Epoch:[0/2](638000/4588595) loss:2.848 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:15:22,004][model8_pretrain.py][INFO] Epoch:[0/2](638000/4588595) loss:2.341 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:15:22,004][model8_pretrain.py][INFO] Epoch:[0/2](638000/4588595) loss:2.872 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:15:22,004][model8_pretrain.py][INFO] Epoch:[0/2](638000/4588595) loss:3.289 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:15:22,004][model8_pretrain.py][INFO] Epoch:[0/2](638000/4588595) loss:1.836 lr:0.0000100 epoch_Time:25113.0min: [2024-01-05 13:15:58,941][model8_pretrain.py][INFO] Epoch:[0/2](638100/4588595) loss:3.159 lr:0.0000100 epoch_Time:25112.0min: [2024-01-05 13:15:58,941][model8_pretrain.py][INFO] Epoch:[0/2](638100/4588595) loss:2.982 lr:0.0000100 epoch_Time:25112.0min: [2024-01-05 13:15:58,941][model8_pretrain.py][INFO] Epoch:[0/2](638100/4588595) loss:3.439 lr:0.0000100 epoch_Time:25112.0min: [2024-01-05 13:15:58,941][model8_pretrain.py][INFO] Epoch:[0/2](638100/4588595) loss:2.666 lr:0.0000100 epoch_Time:25112.0min: [2024-01-05 13:15:58,941][model8_pretrain.py][INFO] Epoch:[0/2](638100/4588595) loss:3.340 lr:0.0000100 epoch_Time:25112.0min: [2024-01-05 13:15:58,941][model8_pretrain.py][INFO] Epoch:[0/2](638100/4588595) loss:2.118 lr:0.0000100 epoch_Time:25112.0min: [2024-01-05 13:15:58,941][model8_pretrain.py][INFO] Epoch:[0/2](638100/4588595) loss:2.899 lr:0.0000100 epoch_Time:25112.0min: [2024-01-05 13:15:58,941][model8_pretrain.py][INFO] Epoch:[0/2](638100/4588595) loss:3.000 lr:0.0000100 epoch_Time:25112.0min: [2024-01-05 13:16:35,884][model8_pretrain.py][INFO] Epoch:[0/2](638200/4588595) loss:2.574 lr:0.0000100 epoch_Time:25111.0min: [2024-01-05 13:16:35,884][model8_pretrain.py][INFO] Epoch:[0/2](638200/4588595) loss:2.952 lr:0.0000100 epoch_Time:25111.0min: [2024-01-05 13:16:35,884][model8_pretrain.py][INFO] Epoch:[0/2](638200/4588595) loss:2.887 lr:0.0000100 epoch_Time:25111.0min: [2024-01-05 13:16:35,884][model8_pretrain.py][INFO] Epoch:[0/2](638200/4588595) loss:3.031 lr:0.0000100 epoch_Time:25111.0min: [2024-01-05 13:16:35,884][model8_pretrain.py][INFO] Epoch:[0/2](638200/4588595) loss:3.096 lr:0.0000100 epoch_Time:25111.0min: [2024-01-05 13:16:35,884][model8_pretrain.py][INFO] Epoch:[0/2](638200/4588595) loss:2.977 lr:0.0000100 epoch_Time:25111.0min: [2024-01-05 13:16:35,884][model8_pretrain.py][INFO] Epoch:[0/2](638200/4588595) loss:3.376 lr:0.0000100 epoch_Time:25111.0min: [2024-01-05 13:16:35,884][model8_pretrain.py][INFO] Epoch:[0/2](638200/4588595) loss:1.794 lr:0.0000100 epoch_Time:25111.0min: [2024-01-05 13:17:12,839][model8_pretrain.py][INFO] Epoch:[0/2](638300/4588595) loss:2.870 lr:0.0000100 epoch_Time:25110.0min: [2024-01-05 13:17:12,839][model8_pretrain.py][INFO] Epoch:[0/2](638300/4588595) loss:2.845 lr:0.0000100 epoch_Time:25110.0min: [2024-01-05 13:17:12,839][model8_pretrain.py][INFO] Epoch:[0/2](638300/4588595) loss:2.428 lr:0.0000100 epoch_Time:25110.0min: [2024-01-05 13:17:12,839][model8_pretrain.py][INFO] Epoch:[0/2](638300/4588595) loss:2.989 lr:0.0000100 epoch_Time:25110.0min: [2024-01-05 13:17:12,839][model8_pretrain.py][INFO] Epoch:[0/2](638300/4588595) loss:2.885 lr:0.0000100 epoch_Time:25110.0min: [2024-01-05 13:17:12,839][model8_pretrain.py][INFO] Epoch:[0/2](638300/4588595) loss:3.458 lr:0.0000100 epoch_Time:25110.0min: [2024-01-05 13:17:12,839][model8_pretrain.py][INFO] Epoch:[0/2](638300/4588595) loss:2.914 lr:0.0000100 epoch_Time:25110.0min: [2024-01-05 13:17:12,839][model8_pretrain.py][INFO] Epoch:[0/2](638300/4588595) loss:3.073 lr:0.0000100 epoch_Time:25110.0min: [2024-01-05 13:17:49,784][model8_pretrain.py][INFO] Epoch:[0/2](638400/4588595) loss:3.180 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:17:49,784][model8_pretrain.py][INFO] Epoch:[0/2](638400/4588595) loss:2.601 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:17:49,784][model8_pretrain.py][INFO] Epoch:[0/2](638400/4588595) loss:3.010 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:17:49,784][model8_pretrain.py][INFO] Epoch:[0/2](638400/4588595) loss:2.838 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:17:49,784][model8_pretrain.py][INFO] Epoch:[0/2](638400/4588595) loss:3.321 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:17:49,784][model8_pretrain.py][INFO] Epoch:[0/2](638400/4588595) loss:2.993 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:17:49,784][model8_pretrain.py][INFO] Epoch:[0/2](638400/4588595) loss:2.578 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:17:49,784][model8_pretrain.py][INFO] Epoch:[0/2](638400/4588595) loss:2.502 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](638500/4588595) loss:2.224 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](638500/4588595) loss:2.657 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](638500/4588595) loss:2.924 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](638500/4588595) loss:2.493 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](638500/4588595) loss:3.122 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](638500/4588595) loss:3.085 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:18:26,729][model8_pretrain.py][INFO] Epoch:[0/2](638500/4588595) loss:2.954 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:18:26,729][model8_pretrain.py][INFO] Epoch:[0/2](638500/4588595) loss:2.793 lr:0.0000100 epoch_Time:25109.0min: [2024-01-05 13:19:03,678][model8_pretrain.py][INFO] Epoch:[0/2](638600/4588595) loss:2.432 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:03,678][model8_pretrain.py][INFO] Epoch:[0/2](638600/4588595) loss:3.044 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:03,678][model8_pretrain.py][INFO] Epoch:[0/2](638600/4588595) loss:2.224 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:03,678][model8_pretrain.py][INFO] Epoch:[0/2](638600/4588595) loss:2.908 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:03,679][model8_pretrain.py][INFO] Epoch:[0/2](638600/4588595) loss:3.012 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:03,679][model8_pretrain.py][INFO] Epoch:[0/2](638600/4588595) loss:2.894 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:03,679][model8_pretrain.py][INFO] Epoch:[0/2](638600/4588595) loss:2.941 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:03,679][model8_pretrain.py][INFO] Epoch:[0/2](638600/4588595) loss:3.479 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:40,625][model8_pretrain.py][INFO] Epoch:[0/2](638700/4588595) loss:3.288 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:40,625][model8_pretrain.py][INFO] Epoch:[0/2](638700/4588595) loss:2.444 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:40,625][model8_pretrain.py][INFO] Epoch:[0/2](638700/4588595) loss:2.987 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:40,625][model8_pretrain.py][INFO] Epoch:[0/2](638700/4588595) loss:2.693 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:40,625][model8_pretrain.py][INFO] Epoch:[0/2](638700/4588595) loss:2.852 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:40,626][model8_pretrain.py][INFO] Epoch:[0/2](638700/4588595) loss:3.047 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:40,626][model8_pretrain.py][INFO] Epoch:[0/2](638700/4588595) loss:2.770 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:19:42,327][model8_pretrain.py][INFO] Epoch:[0/2](638700/4588595) loss:3.101 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:20:29,813][model8_pretrain.py][INFO] Epoch:[0/2](638800/4588595) loss:2.236 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:20:29,813][model8_pretrain.py][INFO] Epoch:[0/2](638800/4588595) loss:2.984 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:20:29,813][model8_pretrain.py][INFO] Epoch:[0/2](638800/4588595) loss:2.998 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:20:29,813][model8_pretrain.py][INFO] Epoch:[0/2](638800/4588595) loss:2.600 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:20:29,813][model8_pretrain.py][INFO] Epoch:[0/2](638800/4588595) loss:2.667 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:20:29,813][model8_pretrain.py][INFO] Epoch:[0/2](638800/4588595) loss:2.756 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:20:29,813][model8_pretrain.py][INFO] Epoch:[0/2](638800/4588595) loss:3.121 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:20:29,813][model8_pretrain.py][INFO] Epoch:[0/2](638800/4588595) loss:2.764 lr:0.0000100 epoch_Time:25108.0min: [2024-01-05 13:21:06,722][model8_pretrain.py][INFO] Epoch:[0/2](638900/4588595) loss:2.753 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:06,722][model8_pretrain.py][INFO] Epoch:[0/2](638900/4588595) loss:2.833 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:06,722][model8_pretrain.py][INFO] Epoch:[0/2](638900/4588595) loss:2.567 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:06,722][model8_pretrain.py][INFO] Epoch:[0/2](638900/4588595) loss:3.070 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:06,722][model8_pretrain.py][INFO] Epoch:[0/2](638900/4588595) loss:2.435 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:06,722][model8_pretrain.py][INFO] Epoch:[0/2](638900/4588595) loss:2.207 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:06,722][model8_pretrain.py][INFO] Epoch:[0/2](638900/4588595) loss:3.042 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:06,722][model8_pretrain.py][INFO] Epoch:[0/2](638900/4588595) loss:3.234 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:43,675][model8_pretrain.py][INFO] Epoch:[0/2](639000/4588595) loss:3.134 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:43,675][model8_pretrain.py][INFO] Epoch:[0/2](639000/4588595) loss:3.268 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:43,675][model8_pretrain.py][INFO] Epoch:[0/2](639000/4588595) loss:2.887 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:43,675][model8_pretrain.py][INFO] Epoch:[0/2](639000/4588595) loss:3.020 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:43,675][model8_pretrain.py][INFO] Epoch:[0/2](639000/4588595) loss:2.829 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:43,675][model8_pretrain.py][INFO] Epoch:[0/2](639000/4588595) loss:2.866 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:43,675][model8_pretrain.py][INFO] Epoch:[0/2](639000/4588595) loss:2.982 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:21:43,675][model8_pretrain.py][INFO] Epoch:[0/2](639000/4588595) loss:3.364 lr:0.0000100 epoch_Time:25107.0min: [2024-01-05 13:22:20,616][model8_pretrain.py][INFO] Epoch:[0/2](639100/4588595) loss:2.979 lr:0.0000100 epoch_Time:25106.0min: [2024-01-05 13:22:20,616][model8_pretrain.py][INFO] Epoch:[0/2](639100/4588595) loss:2.935 lr:0.0000100 epoch_Time:25106.0min: [2024-01-05 13:22:20,616][model8_pretrain.py][INFO] Epoch:[0/2](639100/4588595) loss:2.880 lr:0.0000100 epoch_Time:25106.0min: [2024-01-05 13:22:20,616][model8_pretrain.py][INFO] Epoch:[0/2](639100/4588595) loss:2.922 lr:0.0000100 epoch_Time:25106.0min: [2024-01-05 13:22:20,616][model8_pretrain.py][INFO] Epoch:[0/2](639100/4588595) loss:2.536 lr:0.0000100 epoch_Time:25106.0min: [2024-01-05 13:22:20,616][model8_pretrain.py][INFO] Epoch:[0/2](639100/4588595) loss:1.968 lr:0.0000100 epoch_Time:25106.0min: [2024-01-05 13:22:20,616][model8_pretrain.py][INFO] Epoch:[0/2](639100/4588595) loss:2.591 lr:0.0000100 epoch_Time:25106.0min: [2024-01-05 13:22:20,616][model8_pretrain.py][INFO] Epoch:[0/2](639100/4588595) loss:2.107 lr:0.0000100 epoch_Time:25106.0min: [2024-01-05 13:22:57,551][model8_pretrain.py][INFO] Epoch:[0/2](639200/4588595) loss:2.463 lr:0.0000100 epoch_Time:25105.0min: [2024-01-05 13:22:57,551][model8_pretrain.py][INFO] Epoch:[0/2](639200/4588595) loss:3.366 lr:0.0000100 epoch_Time:25105.0min: [2024-01-05 13:22:57,551][model8_pretrain.py][INFO] Epoch:[0/2](639200/4588595) loss:2.559 lr:0.0000100 epoch_Time:25105.0min: [2024-01-05 13:22:57,551][model8_pretrain.py][INFO] Epoch:[0/2](639200/4588595) loss:2.213 lr:0.0000100 epoch_Time:25105.0min: [2024-01-05 13:22:57,551][model8_pretrain.py][INFO] Epoch:[0/2](639200/4588595) loss:2.569 lr:0.0000100 epoch_Time:25105.0min: [2024-01-05 13:22:57,551][model8_pretrain.py][INFO] Epoch:[0/2](639200/4588595) loss:3.224 lr:0.0000100 epoch_Time:25105.0min: [2024-01-05 13:22:57,551][model8_pretrain.py][INFO] Epoch:[0/2](639200/4588595) loss:3.064 lr:0.0000100 epoch_Time:25105.0min: [2024-01-05 13:22:57,551][model8_pretrain.py][INFO] Epoch:[0/2](639200/4588595) loss:2.721 lr:0.0000100 epoch_Time:25105.0min: [2024-01-05 13:23:34,528][model8_pretrain.py][INFO] Epoch:[0/2](639300/4588595) loss:2.651 lr:0.0000100 epoch_Time:25104.0min: [2024-01-05 13:23:34,528][model8_pretrain.py][INFO] Epoch:[0/2](639300/4588595) loss:2.184 lr:0.0000100 epoch_Time:25104.0min: [2024-01-05 13:23:34,528][model8_pretrain.py][INFO] Epoch:[0/2](639300/4588595) loss:3.034 lr:0.0000100 epoch_Time:25104.0min: [2024-01-05 13:23:34,529][model8_pretrain.py][INFO] Epoch:[0/2](639300/4588595) loss:2.908 lr:0.0000100 epoch_Time:25104.0min: [2024-01-05 13:23:34,529][model8_pretrain.py][INFO] Epoch:[0/2](639300/4588595) loss:3.288 lr:0.0000100 epoch_Time:25104.0min: [2024-01-05 13:23:34,529][model8_pretrain.py][INFO] Epoch:[0/2](639300/4588595) loss:2.381 lr:0.0000100 epoch_Time:25104.0min: [2024-01-05 13:23:34,529][model8_pretrain.py][INFO] Epoch:[0/2](639300/4588595) loss:2.471 lr:0.0000100 epoch_Time:25104.0min: [2024-01-05 13:23:34,529][model8_pretrain.py][INFO] Epoch:[0/2](639300/4588595) loss:3.407 lr:0.0000100 epoch_Time:25104.0min: [2024-01-05 13:24:11,473][model8_pretrain.py][INFO] Epoch:[0/2](639400/4588595) loss:3.126 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:24:11,473][model8_pretrain.py][INFO] Epoch:[0/2](639400/4588595) loss:2.977 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:24:11,473][model8_pretrain.py][INFO] Epoch:[0/2](639400/4588595) loss:2.765 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:24:11,473][model8_pretrain.py][INFO] Epoch:[0/2](639400/4588595) loss:3.028 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:24:11,473][model8_pretrain.py][INFO] Epoch:[0/2](639400/4588595) loss:2.831 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:24:11,473][model8_pretrain.py][INFO] Epoch:[0/2](639400/4588595) loss:2.075 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:24:11,473][model8_pretrain.py][INFO] Epoch:[0/2](639400/4588595) loss:2.878 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:24:11,473][model8_pretrain.py][INFO] Epoch:[0/2](639400/4588595) loss:2.687 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:24:48,414][model8_pretrain.py][INFO] Epoch:[0/2](639500/4588595) loss:2.686 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:24:48,414][model8_pretrain.py][INFO] Epoch:[0/2](639500/4588595) loss:2.636 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:24:48,414][model8_pretrain.py][INFO] Epoch:[0/2](639500/4588595) loss:2.828 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:24:48,414][model8_pretrain.py][INFO] Epoch:[0/2](639500/4588595) loss:2.947 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:24:48,414][model8_pretrain.py][INFO] Epoch:[0/2](639500/4588595) loss:2.880 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:24:48,414][model8_pretrain.py][INFO] Epoch:[0/2](639500/4588595) loss:3.138 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:24:48,415][model8_pretrain.py][INFO] Epoch:[0/2](639500/4588595) loss:2.110 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:24:48,415][model8_pretrain.py][INFO] Epoch:[0/2](639500/4588595) loss:3.211 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:25:37,352][model8_pretrain.py][INFO] Epoch:[0/2](639600/4588595) loss:2.467 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:25:37,352][model8_pretrain.py][INFO] Epoch:[0/2](639600/4588595) loss:2.326 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:25:37,352][model8_pretrain.py][INFO] Epoch:[0/2](639600/4588595) loss:3.302 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:25:37,352][model8_pretrain.py][INFO] Epoch:[0/2](639600/4588595) loss:2.826 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:25:37,352][model8_pretrain.py][INFO] Epoch:[0/2](639600/4588595) loss:2.746 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:25:37,352][model8_pretrain.py][INFO] Epoch:[0/2](639600/4588595) loss:2.679 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:25:37,352][model8_pretrain.py][INFO] Epoch:[0/2](639600/4588595) loss:3.295 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:25:37,352][model8_pretrain.py][INFO] Epoch:[0/2](639600/4588595) loss:2.517 lr:0.0000100 epoch_Time:25103.0min: [2024-01-05 13:26:14,284][model8_pretrain.py][INFO] Epoch:[0/2](639700/4588595) loss:2.799 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:26:14,284][model8_pretrain.py][INFO] Epoch:[0/2](639700/4588595) loss:2.873 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:26:14,284][model8_pretrain.py][INFO] Epoch:[0/2](639700/4588595) loss:2.120 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:26:14,284][model8_pretrain.py][INFO] Epoch:[0/2](639700/4588595) loss:2.658 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:26:14,284][model8_pretrain.py][INFO] Epoch:[0/2](639700/4588595) loss:3.127 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:26:14,284][model8_pretrain.py][INFO] Epoch:[0/2](639700/4588595) loss:2.975 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:26:14,284][model8_pretrain.py][INFO] Epoch:[0/2](639700/4588595) loss:2.618 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:26:14,285][model8_pretrain.py][INFO] Epoch:[0/2](639700/4588595) loss:2.957 lr:0.0000100 epoch_Time:25102.0min: [2024-01-05 13:26:51,230][model8_pretrain.py][INFO] Epoch:[0/2](639800/4588595) loss:2.588 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:26:51,230][model8_pretrain.py][INFO] Epoch:[0/2](639800/4588595) loss:2.542 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:26:51,230][model8_pretrain.py][INFO] Epoch:[0/2](639800/4588595) loss:2.753 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:26:51,230][model8_pretrain.py][INFO] Epoch:[0/2](639800/4588595) loss:2.736 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:26:51,230][model8_pretrain.py][INFO] Epoch:[0/2](639800/4588595) loss:2.435 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:26:51,230][model8_pretrain.py][INFO] Epoch:[0/2](639800/4588595) loss:2.927 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:26:51,230][model8_pretrain.py][INFO] Epoch:[0/2](639800/4588595) loss:2.717 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:26:51,230][model8_pretrain.py][INFO] Epoch:[0/2](639800/4588595) loss:2.905 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:27:28,173][model8_pretrain.py][INFO] Epoch:[0/2](639900/4588595) loss:3.206 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:27:28,173][model8_pretrain.py][INFO] Epoch:[0/2](639900/4588595) loss:3.047 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:27:28,173][model8_pretrain.py][INFO] Epoch:[0/2](639900/4588595) loss:2.633 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:27:28,173][model8_pretrain.py][INFO] Epoch:[0/2](639900/4588595) loss:2.712 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:27:28,173][model8_pretrain.py][INFO] Epoch:[0/2](639900/4588595) loss:2.694 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:27:28,173][model8_pretrain.py][INFO] Epoch:[0/2](639900/4588595) loss:3.090 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:27:28,173][model8_pretrain.py][INFO] Epoch:[0/2](639900/4588595) loss:3.303 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:27:28,173][model8_pretrain.py][INFO] Epoch:[0/2](639900/4588595) loss:2.207 lr:0.0000100 epoch_Time:25101.0min: [2024-01-05 13:28:05,119][model8_pretrain.py][INFO] Epoch:[0/2](640000/4588595) loss:2.973 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:05,119][model8_pretrain.py][INFO] Epoch:[0/2](640000/4588595) loss:2.719 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:05,119][model8_pretrain.py][INFO] Epoch:[0/2](640000/4588595) loss:2.377 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:05,119][model8_pretrain.py][INFO] Epoch:[0/2](640000/4588595) loss:2.669 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:05,119][model8_pretrain.py][INFO] Epoch:[0/2](640000/4588595) loss:2.863 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:05,119][model8_pretrain.py][INFO] Epoch:[0/2](640000/4588595) loss:2.614 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:05,119][model8_pretrain.py][INFO] Epoch:[0/2](640000/4588595) loss:3.217 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:05,119][model8_pretrain.py][INFO] Epoch:[0/2](640000/4588595) loss:3.164 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:42,048][model8_pretrain.py][INFO] Epoch:[0/2](640100/4588595) loss:3.046 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:42,048][model8_pretrain.py][INFO] Epoch:[0/2](640100/4588595) loss:2.862 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:42,048][model8_pretrain.py][INFO] Epoch:[0/2](640100/4588595) loss:2.833 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:42,048][model8_pretrain.py][INFO] Epoch:[0/2](640100/4588595) loss:3.149 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:42,049][model8_pretrain.py][INFO] Epoch:[0/2](640100/4588595) loss:2.747 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:42,048][model8_pretrain.py][INFO] Epoch:[0/2](640100/4588595) loss:2.974 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:42,048][model8_pretrain.py][INFO] Epoch:[0/2](640100/4588595) loss:2.450 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:28:42,049][model8_pretrain.py][INFO] Epoch:[0/2](640100/4588595) loss:2.915 lr:0.0000100 epoch_Time:25100.0min: [2024-01-05 13:29:18,989][model8_pretrain.py][INFO] Epoch:[0/2](640200/4588595) loss:3.119 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:29:18,989][model8_pretrain.py][INFO] Epoch:[0/2](640200/4588595) loss:2.530 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:29:18,989][model8_pretrain.py][INFO] Epoch:[0/2](640200/4588595) loss:2.890 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:29:18,989][model8_pretrain.py][INFO] Epoch:[0/2](640200/4588595) loss:2.429 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:29:18,989][model8_pretrain.py][INFO] Epoch:[0/2](640200/4588595) loss:2.813 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:29:18,989][model8_pretrain.py][INFO] Epoch:[0/2](640200/4588595) loss:2.652 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:29:18,989][model8_pretrain.py][INFO] Epoch:[0/2](640200/4588595) loss:2.539 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:29:18,989][model8_pretrain.py][INFO] Epoch:[0/2](640200/4588595) loss:3.029 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:29:55,930][model8_pretrain.py][INFO] Epoch:[0/2](640300/4588595) loss:3.130 lr:0.0000100 epoch_Time:25097.0min: [2024-01-05 13:29:55,930][model8_pretrain.py][INFO] Epoch:[0/2](640300/4588595) loss:3.265 lr:0.0000100 epoch_Time:25097.0min: [2024-01-05 13:29:55,930][model8_pretrain.py][INFO] Epoch:[0/2](640300/4588595) loss:3.163 lr:0.0000100 epoch_Time:25097.0min: [2024-01-05 13:29:55,930][model8_pretrain.py][INFO] Epoch:[0/2](640300/4588595) loss:2.876 lr:0.0000100 epoch_Time:25097.0min: [2024-01-05 13:29:55,930][model8_pretrain.py][INFO] Epoch:[0/2](640300/4588595) loss:3.087 lr:0.0000100 epoch_Time:25097.0min: [2024-01-05 13:29:55,930][model8_pretrain.py][INFO] Epoch:[0/2](640300/4588595) loss:2.843 lr:0.0000100 epoch_Time:25097.0min: [2024-01-05 13:29:55,930][model8_pretrain.py][INFO] Epoch:[0/2](640300/4588595) loss:2.273 lr:0.0000100 epoch_Time:25097.0min: [2024-01-05 13:29:55,930][model8_pretrain.py][INFO] Epoch:[0/2](640300/4588595) loss:2.598 lr:0.0000100 epoch_Time:25097.0min: [2024-01-05 13:30:44,732][model8_pretrain.py][INFO] Epoch:[0/2](640400/4588595) loss:2.965 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:30:44,732][model8_pretrain.py][INFO] Epoch:[0/2](640400/4588595) loss:2.921 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:30:44,732][model8_pretrain.py][INFO] Epoch:[0/2](640400/4588595) loss:2.705 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:30:44,732][model8_pretrain.py][INFO] Epoch:[0/2](640400/4588595) loss:2.523 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:30:44,732][model8_pretrain.py][INFO] Epoch:[0/2](640400/4588595) loss:2.882 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:30:44,732][model8_pretrain.py][INFO] Epoch:[0/2](640400/4588595) loss:3.071 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:30:44,732][model8_pretrain.py][INFO] Epoch:[0/2](640400/4588595) loss:2.618 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:30:44,732][model8_pretrain.py][INFO] Epoch:[0/2](640400/4588595) loss:2.171 lr:0.0000100 epoch_Time:25099.0min: [2024-01-05 13:31:21,664][model8_pretrain.py][INFO] Epoch:[0/2](640500/4588595) loss:3.050 lr:0.0000100 epoch_Time:25098.0min: [2024-01-05 13:31:21,664][model8_pretrain.py][INFO] Epoch:[0/2](640500/4588595) loss:2.901 lr:0.0000100 epoch_Time:25098.0min: [2024-01-05 13:31:21,664][model8_pretrain.py][INFO] Epoch:[0/2](640500/4588595) loss:2.616 lr:0.0000100 epoch_Time:25098.0min: [2024-01-05 13:31:21,664][model8_pretrain.py][INFO] Epoch:[0/2](640500/4588595) loss:2.940 lr:0.0000100 epoch_Time:25098.0min: [2024-01-05 13:31:21,664][model8_pretrain.py][INFO] Epoch:[0/2](640500/4588595) loss:3.293 lr:0.0000100 epoch_Time:25098.0min: [2024-01-05 13:31:21,664][model8_pretrain.py][INFO] Epoch:[0/2](640500/4588595) loss:2.847 lr:0.0000100 epoch_Time:25098.0min: [2024-01-05 13:31:21,664][model8_pretrain.py][INFO] Epoch:[0/2](640500/4588595) loss:2.944 lr:0.0000100 epoch_Time:25098.0min: [2024-01-05 13:31:21,665][model8_pretrain.py][INFO] Epoch:[0/2](640500/4588595) loss:2.698 lr:0.0000100 epoch_Time:25098.0min: [2024-01-05 13:31:58,609][model8_pretrain.py][INFO] Epoch:[0/2](640600/4588595) loss:2.966 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:31:58,609][model8_pretrain.py][INFO] Epoch:[0/2](640600/4588595) loss:2.251 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:31:58,609][model8_pretrain.py][INFO] Epoch:[0/2](640600/4588595) loss:3.259 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:31:58,609][model8_pretrain.py][INFO] Epoch:[0/2](640600/4588595) loss:3.260 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:31:58,609][model8_pretrain.py][INFO] Epoch:[0/2](640600/4588595) loss:3.248 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:31:58,609][model8_pretrain.py][INFO] Epoch:[0/2](640600/4588595) loss:2.512 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:31:58,609][model8_pretrain.py][INFO] Epoch:[0/2](640600/4588595) loss:2.583 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:31:58,609][model8_pretrain.py][INFO] Epoch:[0/2](640600/4588595) loss:3.224 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:32:35,553][model8_pretrain.py][INFO] Epoch:[0/2](640700/4588595) loss:2.105 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:32:35,553][model8_pretrain.py][INFO] Epoch:[0/2](640700/4588595) loss:2.075 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:32:35,553][model8_pretrain.py][INFO] Epoch:[0/2](640700/4588595) loss:2.979 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:32:35,553][model8_pretrain.py][INFO] Epoch:[0/2](640700/4588595) loss:2.340 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:32:35,553][model8_pretrain.py][INFO] Epoch:[0/2](640700/4588595) loss:3.131 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:32:35,553][model8_pretrain.py][INFO] Epoch:[0/2](640700/4588595) loss:3.265 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:32:35,553][model8_pretrain.py][INFO] Epoch:[0/2](640700/4588595) loss:2.975 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:32:35,553][model8_pretrain.py][INFO] Epoch:[0/2](640700/4588595) loss:2.365 lr:0.0000100 epoch_Time:25096.0min: [2024-01-05 13:33:12,492][model8_pretrain.py][INFO] Epoch:[0/2](640800/4588595) loss:2.829 lr:0.0000100 epoch_Time:25095.0min: [2024-01-05 13:33:12,492][model8_pretrain.py][INFO] Epoch:[0/2](640800/4588595) loss:2.696 lr:0.0000100 epoch_Time:25095.0min: [2024-01-05 13:33:12,492][model8_pretrain.py][INFO] Epoch:[0/2](640800/4588595) loss:3.284 lr:0.0000100 epoch_Time:25095.0min: [2024-01-05 13:33:12,492][model8_pretrain.py][INFO] Epoch:[0/2](640800/4588595) loss:2.862 lr:0.0000100 epoch_Time:25095.0min: [2024-01-05 13:33:12,492][model8_pretrain.py][INFO] Epoch:[0/2](640800/4588595) loss:2.382 lr:0.0000100 epoch_Time:25095.0min: [2024-01-05 13:33:12,492][model8_pretrain.py][INFO] Epoch:[0/2](640800/4588595) loss:3.144 lr:0.0000100 epoch_Time:25095.0min: [2024-01-05 13:33:12,492][model8_pretrain.py][INFO] Epoch:[0/2](640800/4588595) loss:2.932 lr:0.0000100 epoch_Time:25095.0min: [2024-01-05 13:33:12,492][model8_pretrain.py][INFO] Epoch:[0/2](640800/4588595) loss:3.225 lr:0.0000100 epoch_Time:25095.0min: [2024-01-05 13:33:49,430][model8_pretrain.py][INFO] Epoch:[0/2](640900/4588595) loss:3.006 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:33:49,430][model8_pretrain.py][INFO] Epoch:[0/2](640900/4588595) loss:2.544 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:33:49,431][model8_pretrain.py][INFO] Epoch:[0/2](640900/4588595) loss:3.193 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:33:49,431][model8_pretrain.py][INFO] Epoch:[0/2](640900/4588595) loss:2.174 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:33:49,431][model8_pretrain.py][INFO] Epoch:[0/2](640900/4588595) loss:2.206 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:33:49,431][model8_pretrain.py][INFO] Epoch:[0/2](640900/4588595) loss:2.668 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:33:49,431][model8_pretrain.py][INFO] Epoch:[0/2](640900/4588595) loss:3.188 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:33:49,431][model8_pretrain.py][INFO] Epoch:[0/2](640900/4588595) loss:2.171 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:34:26,372][model8_pretrain.py][INFO] Epoch:[0/2](641000/4588595) loss:1.902 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:34:26,372][model8_pretrain.py][INFO] Epoch:[0/2](641000/4588595) loss:3.106 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:34:26,372][model8_pretrain.py][INFO] Epoch:[0/2](641000/4588595) loss:3.277 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:34:26,372][model8_pretrain.py][INFO] Epoch:[0/2](641000/4588595) loss:2.749 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:34:26,372][model8_pretrain.py][INFO] Epoch:[0/2](641000/4588595) loss:2.385 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:34:26,372][model8_pretrain.py][INFO] Epoch:[0/2](641000/4588595) loss:3.102 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:34:26,372][model8_pretrain.py][INFO] Epoch:[0/2](641000/4588595) loss:2.182 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:34:26,372][model8_pretrain.py][INFO] Epoch:[0/2](641000/4588595) loss:3.321 lr:0.0000100 epoch_Time:25094.0min: [2024-01-05 13:35:03,314][model8_pretrain.py][INFO] Epoch:[0/2](641100/4588595) loss:3.088 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:03,314][model8_pretrain.py][INFO] Epoch:[0/2](641100/4588595) loss:2.587 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:03,314][model8_pretrain.py][INFO] Epoch:[0/2](641100/4588595) loss:2.030 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:03,314][model8_pretrain.py][INFO] Epoch:[0/2](641100/4588595) loss:2.672 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:03,314][model8_pretrain.py][INFO] Epoch:[0/2](641100/4588595) loss:3.272 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:03,314][model8_pretrain.py][INFO] Epoch:[0/2](641100/4588595) loss:3.392 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:03,314][model8_pretrain.py][INFO] Epoch:[0/2](641100/4588595) loss:2.365 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:03,315][model8_pretrain.py][INFO] Epoch:[0/2](641100/4588595) loss:2.638 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:52,030][model8_pretrain.py][INFO] Epoch:[0/2](641200/4588595) loss:2.749 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:52,030][model8_pretrain.py][INFO] Epoch:[0/2](641200/4588595) loss:2.715 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:52,030][model8_pretrain.py][INFO] Epoch:[0/2](641200/4588595) loss:3.255 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:52,030][model8_pretrain.py][INFO] Epoch:[0/2](641200/4588595) loss:2.333 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:52,030][model8_pretrain.py][INFO] Epoch:[0/2](641200/4588595) loss:3.142 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:52,030][model8_pretrain.py][INFO] Epoch:[0/2](641200/4588595) loss:3.049 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:52,030][model8_pretrain.py][INFO] Epoch:[0/2](641200/4588595) loss:2.383 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:35:52,031][model8_pretrain.py][INFO] Epoch:[0/2](641200/4588595) loss:2.475 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:36:28,969][model8_pretrain.py][INFO] Epoch:[0/2](641300/4588595) loss:2.918 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:36:28,970][model8_pretrain.py][INFO] Epoch:[0/2](641300/4588595) loss:2.235 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:36:28,970][model8_pretrain.py][INFO] Epoch:[0/2](641300/4588595) loss:3.109 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:36:28,970][model8_pretrain.py][INFO] Epoch:[0/2](641300/4588595) loss:2.722 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:36:28,970][model8_pretrain.py][INFO] Epoch:[0/2](641300/4588595) loss:2.884 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:36:28,970][model8_pretrain.py][INFO] Epoch:[0/2](641300/4588595) loss:2.333 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:36:28,970][model8_pretrain.py][INFO] Epoch:[0/2](641300/4588595) loss:1.990 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:36:28,970][model8_pretrain.py][INFO] Epoch:[0/2](641300/4588595) loss:2.979 lr:0.0000100 epoch_Time:25093.0min: [2024-01-05 13:37:05,919][model8_pretrain.py][INFO] Epoch:[0/2](641400/4588595) loss:3.380 lr:0.0000100 epoch_Time:25092.0min: [2024-01-05 13:37:05,919][model8_pretrain.py][INFO] Epoch:[0/2](641400/4588595) loss:3.210 lr:0.0000100 epoch_Time:25092.0min: [2024-01-05 13:37:05,919][model8_pretrain.py][INFO] Epoch:[0/2](641400/4588595) loss:2.968 lr:0.0000100 epoch_Time:25092.0min: [2024-01-05 13:37:05,919][model8_pretrain.py][INFO] Epoch:[0/2](641400/4588595) loss:2.964 lr:0.0000100 epoch_Time:25092.0min: [2024-01-05 13:37:05,919][model8_pretrain.py][INFO] Epoch:[0/2](641400/4588595) loss:2.442 lr:0.0000100 epoch_Time:25092.0min: [2024-01-05 13:37:05,919][model8_pretrain.py][INFO] Epoch:[0/2](641400/4588595) loss:2.891 lr:0.0000100 epoch_Time:25092.0min: [2024-01-05 13:37:05,919][model8_pretrain.py][INFO] Epoch:[0/2](641400/4588595) loss:2.319 lr:0.0000100 epoch_Time:25092.0min: [2024-01-05 13:37:05,919][model8_pretrain.py][INFO] Epoch:[0/2](641400/4588595) loss:2.985 lr:0.0000100 epoch_Time:25092.0min: [2024-01-05 13:37:42,836][model8_pretrain.py][INFO] Epoch:[0/2](641500/4588595) loss:2.131 lr:0.0000100 epoch_Time:25091.0min: [2024-01-05 13:37:42,836][model8_pretrain.py][INFO] Epoch:[0/2](641500/4588595) loss:2.863 lr:0.0000100 epoch_Time:25091.0min: [2024-01-05 13:37:42,836][model8_pretrain.py][INFO] Epoch:[0/2](641500/4588595) loss:3.146 lr:0.0000100 epoch_Time:25091.0min: [2024-01-05 13:37:42,836][model8_pretrain.py][INFO] Epoch:[0/2](641500/4588595) loss:3.778 lr:0.0000100 epoch_Time:25091.0min: [2024-01-05 13:37:42,836][model8_pretrain.py][INFO] Epoch:[0/2](641500/4588595) loss:2.925 lr:0.0000100 epoch_Time:25091.0min: [2024-01-05 13:37:42,836][model8_pretrain.py][INFO] Epoch:[0/2](641500/4588595) loss:2.666 lr:0.0000100 epoch_Time:25091.0min: [2024-01-05 13:37:42,836][model8_pretrain.py][INFO] Epoch:[0/2](641500/4588595) loss:3.232 lr:0.0000100 epoch_Time:25091.0min: [2024-01-05 13:37:42,836][model8_pretrain.py][INFO] Epoch:[0/2](641500/4588595) loss:3.096 lr:0.0000100 epoch_Time:25091.0min: [2024-01-05 13:38:19,772][model8_pretrain.py][INFO] Epoch:[0/2](641600/4588595) loss:2.668 lr:0.0000100 epoch_Time:25090.0min: [2024-01-05 13:38:19,772][model8_pretrain.py][INFO] Epoch:[0/2](641600/4588595) loss:2.868 lr:0.0000100 epoch_Time:25090.0min: [2024-01-05 13:38:19,772][model8_pretrain.py][INFO] Epoch:[0/2](641600/4588595) loss:2.243 lr:0.0000100 epoch_Time:25090.0min: [2024-01-05 13:38:19,772][model8_pretrain.py][INFO] Epoch:[0/2](641600/4588595) loss:3.164 lr:0.0000100 epoch_Time:25090.0min: [2024-01-05 13:38:19,772][model8_pretrain.py][INFO] Epoch:[0/2](641600/4588595) loss:3.143 lr:0.0000100 epoch_Time:25090.0min: [2024-01-05 13:38:19,772][model8_pretrain.py][INFO] Epoch:[0/2](641600/4588595) loss:2.437 lr:0.0000100 epoch_Time:25090.0min: [2024-01-05 13:38:19,772][model8_pretrain.py][INFO] Epoch:[0/2](641600/4588595) loss:2.475 lr:0.0000100 epoch_Time:25090.0min: [2024-01-05 13:38:19,772][model8_pretrain.py][INFO] Epoch:[0/2](641600/4588595) loss:2.287 lr:0.0000100 epoch_Time:25090.0min: [2024-01-05 13:38:56,698][model8_pretrain.py][INFO] Epoch:[0/2](641700/4588595) loss:3.102 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:38:56,698][model8_pretrain.py][INFO] Epoch:[0/2](641700/4588595) loss:2.668 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:38:56,698][model8_pretrain.py][INFO] Epoch:[0/2](641700/4588595) loss:2.642 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:38:56,698][model8_pretrain.py][INFO] Epoch:[0/2](641700/4588595) loss:2.735 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:38:56,698][model8_pretrain.py][INFO] Epoch:[0/2](641700/4588595) loss:2.820 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:38:56,698][model8_pretrain.py][INFO] Epoch:[0/2](641700/4588595) loss:2.345 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:38:56,698][model8_pretrain.py][INFO] Epoch:[0/2](641700/4588595) loss:3.163 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:38:56,698][model8_pretrain.py][INFO] Epoch:[0/2](641700/4588595) loss:2.623 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:39:33,632][model8_pretrain.py][INFO] Epoch:[0/2](641800/4588595) loss:3.317 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:39:33,632][model8_pretrain.py][INFO] Epoch:[0/2](641800/4588595) loss:2.897 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:39:33,632][model8_pretrain.py][INFO] Epoch:[0/2](641800/4588595) loss:2.909 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:39:33,632][model8_pretrain.py][INFO] Epoch:[0/2](641800/4588595) loss:2.109 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:39:33,632][model8_pretrain.py][INFO] Epoch:[0/2](641800/4588595) loss:2.709 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:39:33,632][model8_pretrain.py][INFO] Epoch:[0/2](641800/4588595) loss:2.961 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:39:33,632][model8_pretrain.py][INFO] Epoch:[0/2](641800/4588595) loss:2.987 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:39:33,632][model8_pretrain.py][INFO] Epoch:[0/2](641800/4588595) loss:2.977 lr:0.0000100 epoch_Time:25089.0min: [2024-01-05 13:40:10,568][model8_pretrain.py][INFO] Epoch:[0/2](641900/4588595) loss:3.204 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:10,568][model8_pretrain.py][INFO] Epoch:[0/2](641900/4588595) loss:2.610 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:10,568][model8_pretrain.py][INFO] Epoch:[0/2](641900/4588595) loss:3.187 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:10,568][model8_pretrain.py][INFO] Epoch:[0/2](641900/4588595) loss:2.975 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:10,568][model8_pretrain.py][INFO] Epoch:[0/2](641900/4588595) loss:2.750 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:10,568][model8_pretrain.py][INFO] Epoch:[0/2](641900/4588595) loss:2.839 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:10,568][model8_pretrain.py][INFO] Epoch:[0/2](641900/4588595) loss:3.250 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:10,569][model8_pretrain.py][INFO] Epoch:[0/2](641900/4588595) loss:3.226 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:59,217][model8_pretrain.py][INFO] Epoch:[0/2](642000/4588595) loss:2.371 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:59,217][model8_pretrain.py][INFO] Epoch:[0/2](642000/4588595) loss:2.827 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:59,217][model8_pretrain.py][INFO] Epoch:[0/2](642000/4588595) loss:3.106 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:59,217][model8_pretrain.py][INFO] Epoch:[0/2](642000/4588595) loss:3.176 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:59,217][model8_pretrain.py][INFO] Epoch:[0/2](642000/4588595) loss:3.161 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:59,217][model8_pretrain.py][INFO] Epoch:[0/2](642000/4588595) loss:2.834 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:59,217][model8_pretrain.py][INFO] Epoch:[0/2](642000/4588595) loss:3.100 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:40:59,217][model8_pretrain.py][INFO] Epoch:[0/2](642000/4588595) loss:3.212 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:41:36,152][model8_pretrain.py][INFO] Epoch:[0/2](642100/4588595) loss:2.698 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:41:36,152][model8_pretrain.py][INFO] Epoch:[0/2](642100/4588595) loss:2.968 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:41:36,152][model8_pretrain.py][INFO] Epoch:[0/2](642100/4588595) loss:3.357 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:41:36,152][model8_pretrain.py][INFO] Epoch:[0/2](642100/4588595) loss:2.966 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:41:36,152][model8_pretrain.py][INFO] Epoch:[0/2](642100/4588595) loss:2.628 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:41:36,152][model8_pretrain.py][INFO] Epoch:[0/2](642100/4588595) loss:2.767 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:41:36,152][model8_pretrain.py][INFO] Epoch:[0/2](642100/4588595) loss:3.340 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:41:36,152][model8_pretrain.py][INFO] Epoch:[0/2](642100/4588595) loss:3.183 lr:0.0000100 epoch_Time:25088.0min: [2024-01-05 13:42:13,096][model8_pretrain.py][INFO] Epoch:[0/2](642200/4588595) loss:2.776 lr:0.0000100 epoch_Time:25087.0min: [2024-01-05 13:42:13,096][model8_pretrain.py][INFO] Epoch:[0/2](642200/4588595) loss:3.032 lr:0.0000100 epoch_Time:25087.0min: [2024-01-05 13:42:13,096][model8_pretrain.py][INFO] Epoch:[0/2](642200/4588595) loss:2.924 lr:0.0000100 epoch_Time:25087.0min: [2024-01-05 13:42:13,096][model8_pretrain.py][INFO] Epoch:[0/2](642200/4588595) loss:3.029 lr:0.0000100 epoch_Time:25087.0min: [2024-01-05 13:42:13,096][model8_pretrain.py][INFO] Epoch:[0/2](642200/4588595) loss:3.103 lr:0.0000100 epoch_Time:25087.0min: [2024-01-05 13:42:13,096][model8_pretrain.py][INFO] Epoch:[0/2](642200/4588595) loss:3.313 lr:0.0000100 epoch_Time:25087.0min: [2024-01-05 13:42:13,096][model8_pretrain.py][INFO] Epoch:[0/2](642200/4588595) loss:3.292 lr:0.0000100 epoch_Time:25087.0min: [2024-01-05 13:42:13,096][model8_pretrain.py][INFO] Epoch:[0/2](642200/4588595) loss:2.582 lr:0.0000100 epoch_Time:25087.0min: [2024-01-05 13:42:50,038][model8_pretrain.py][INFO] Epoch:[0/2](642300/4588595) loss:2.744 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:42:50,038][model8_pretrain.py][INFO] Epoch:[0/2](642300/4588595) loss:2.810 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:42:50,038][model8_pretrain.py][INFO] Epoch:[0/2](642300/4588595) loss:2.946 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:42:50,038][model8_pretrain.py][INFO] Epoch:[0/2](642300/4588595) loss:3.122 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:42:50,038][model8_pretrain.py][INFO] Epoch:[0/2](642300/4588595) loss:2.897 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:42:50,038][model8_pretrain.py][INFO] Epoch:[0/2](642300/4588595) loss:3.396 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:42:50,038][model8_pretrain.py][INFO] Epoch:[0/2](642300/4588595) loss:2.836 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:42:50,038][model8_pretrain.py][INFO] Epoch:[0/2](642300/4588595) loss:3.027 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:43:26,972][model8_pretrain.py][INFO] Epoch:[0/2](642400/4588595) loss:2.474 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:43:26,972][model8_pretrain.py][INFO] Epoch:[0/2](642400/4588595) loss:3.007 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:43:26,972][model8_pretrain.py][INFO] Epoch:[0/2](642400/4588595) loss:2.993 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:43:26,972][model8_pretrain.py][INFO] Epoch:[0/2](642400/4588595) loss:2.680 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:43:26,972][model8_pretrain.py][INFO] Epoch:[0/2](642400/4588595) loss:3.059 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:43:26,972][model8_pretrain.py][INFO] Epoch:[0/2](642400/4588595) loss:2.536 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:43:26,972][model8_pretrain.py][INFO] Epoch:[0/2](642400/4588595) loss:3.374 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:43:26,972][model8_pretrain.py][INFO] Epoch:[0/2](642400/4588595) loss:2.471 lr:0.0000100 epoch_Time:25086.0min: [2024-01-05 13:44:03,902][model8_pretrain.py][INFO] Epoch:[0/2](642500/4588595) loss:2.564 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:03,902][model8_pretrain.py][INFO] Epoch:[0/2](642500/4588595) loss:2.780 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:03,902][model8_pretrain.py][INFO] Epoch:[0/2](642500/4588595) loss:2.894 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:03,902][model8_pretrain.py][INFO] Epoch:[0/2](642500/4588595) loss:2.631 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:03,902][model8_pretrain.py][INFO] Epoch:[0/2](642500/4588595) loss:2.643 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:03,902][model8_pretrain.py][INFO] Epoch:[0/2](642500/4588595) loss:2.942 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:03,902][model8_pretrain.py][INFO] Epoch:[0/2](642500/4588595) loss:2.653 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:03,902][model8_pretrain.py][INFO] Epoch:[0/2](642500/4588595) loss:2.893 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](642600/4588595) loss:2.622 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](642600/4588595) loss:2.850 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](642600/4588595) loss:3.019 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](642600/4588595) loss:2.889 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](642600/4588595) loss:2.792 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](642600/4588595) loss:3.064 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](642600/4588595) loss:2.985 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](642600/4588595) loss:3.126 lr:0.0000100 epoch_Time:25084.0min: [2024-01-05 13:45:17,762][model8_pretrain.py][INFO] Epoch:[0/2](642700/4588595) loss:2.487 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:45:17,762][model8_pretrain.py][INFO] Epoch:[0/2](642700/4588595) loss:3.220 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:45:17,763][model8_pretrain.py][INFO] Epoch:[0/2](642700/4588595) loss:2.917 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:45:17,763][model8_pretrain.py][INFO] Epoch:[0/2](642700/4588595) loss:2.724 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:45:17,763][model8_pretrain.py][INFO] Epoch:[0/2](642700/4588595) loss:2.709 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:45:17,763][model8_pretrain.py][INFO] Epoch:[0/2](642700/4588595) loss:3.158 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:45:17,763][model8_pretrain.py][INFO] Epoch:[0/2](642700/4588595) loss:2.792 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:45:17,763][model8_pretrain.py][INFO] Epoch:[0/2](642700/4588595) loss:2.408 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:06,362][model8_pretrain.py][INFO] Epoch:[0/2](642800/4588595) loss:2.663 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:06,362][model8_pretrain.py][INFO] Epoch:[0/2](642800/4588595) loss:2.633 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:06,362][model8_pretrain.py][INFO] Epoch:[0/2](642800/4588595) loss:2.682 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:06,362][model8_pretrain.py][INFO] Epoch:[0/2](642800/4588595) loss:3.155 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:06,362][model8_pretrain.py][INFO] Epoch:[0/2](642800/4588595) loss:2.242 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:06,362][model8_pretrain.py][INFO] Epoch:[0/2](642800/4588595) loss:2.650 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:06,362][model8_pretrain.py][INFO] Epoch:[0/2](642800/4588595) loss:2.971 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:06,362][model8_pretrain.py][INFO] Epoch:[0/2](642800/4588595) loss:3.062 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:43,302][model8_pretrain.py][INFO] Epoch:[0/2](642900/4588595) loss:2.648 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:43,302][model8_pretrain.py][INFO] Epoch:[0/2](642900/4588595) loss:3.207 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:43,302][model8_pretrain.py][INFO] Epoch:[0/2](642900/4588595) loss:2.852 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:43,302][model8_pretrain.py][INFO] Epoch:[0/2](642900/4588595) loss:2.894 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:43,302][model8_pretrain.py][INFO] Epoch:[0/2](642900/4588595) loss:2.901 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:43,302][model8_pretrain.py][INFO] Epoch:[0/2](642900/4588595) loss:3.352 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:43,302][model8_pretrain.py][INFO] Epoch:[0/2](642900/4588595) loss:3.072 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:46:43,302][model8_pretrain.py][INFO] Epoch:[0/2](642900/4588595) loss:2.426 lr:0.0000100 epoch_Time:25083.0min: [2024-01-05 13:47:20,243][model8_pretrain.py][INFO] Epoch:[0/2](643000/4588595) loss:2.993 lr:0.0000100 epoch_Time:25082.0min: [2024-01-05 13:47:20,243][model8_pretrain.py][INFO] Epoch:[0/2](643000/4588595) loss:2.975 lr:0.0000100 epoch_Time:25082.0min: [2024-01-05 13:47:20,243][model8_pretrain.py][INFO] Epoch:[0/2](643000/4588595) loss:2.839 lr:0.0000100 epoch_Time:25082.0min: [2024-01-05 13:47:20,243][model8_pretrain.py][INFO] Epoch:[0/2](643000/4588595) loss:2.962 lr:0.0000100 epoch_Time:25082.0min: [2024-01-05 13:47:20,243][model8_pretrain.py][INFO] Epoch:[0/2](643000/4588595) loss:3.218 lr:0.0000100 epoch_Time:25082.0min: [2024-01-05 13:47:20,243][model8_pretrain.py][INFO] Epoch:[0/2](643000/4588595) loss:2.874 lr:0.0000100 epoch_Time:25082.0min: [2024-01-05 13:47:20,243][model8_pretrain.py][INFO] Epoch:[0/2](643000/4588595) loss:2.317 lr:0.0000100 epoch_Time:25082.0min: [2024-01-05 13:47:20,244][model8_pretrain.py][INFO] Epoch:[0/2](643000/4588595) loss:2.576 lr:0.0000100 epoch_Time:25082.0min: [2024-01-05 13:47:57,176][model8_pretrain.py][INFO] Epoch:[0/2](643100/4588595) loss:2.860 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:47:57,176][model8_pretrain.py][INFO] Epoch:[0/2](643100/4588595) loss:2.551 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:47:57,176][model8_pretrain.py][INFO] Epoch:[0/2](643100/4588595) loss:3.401 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:47:57,176][model8_pretrain.py][INFO] Epoch:[0/2](643100/4588595) loss:2.495 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:47:57,176][model8_pretrain.py][INFO] Epoch:[0/2](643100/4588595) loss:2.537 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:47:57,176][model8_pretrain.py][INFO] Epoch:[0/2](643100/4588595) loss:2.401 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:47:57,176][model8_pretrain.py][INFO] Epoch:[0/2](643100/4588595) loss:3.100 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:47:57,176][model8_pretrain.py][INFO] Epoch:[0/2](643100/4588595) loss:2.551 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:48:34,111][model8_pretrain.py][INFO] Epoch:[0/2](643200/4588595) loss:2.019 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:48:34,111][model8_pretrain.py][INFO] Epoch:[0/2](643200/4588595) loss:3.077 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:48:34,111][model8_pretrain.py][INFO] Epoch:[0/2](643200/4588595) loss:2.878 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:48:34,111][model8_pretrain.py][INFO] Epoch:[0/2](643200/4588595) loss:2.871 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:48:34,111][model8_pretrain.py][INFO] Epoch:[0/2](643200/4588595) loss:2.828 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:48:34,111][model8_pretrain.py][INFO] Epoch:[0/2](643200/4588595) loss:3.179 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:48:34,111][model8_pretrain.py][INFO] Epoch:[0/2](643200/4588595) loss:2.693 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:48:34,112][model8_pretrain.py][INFO] Epoch:[0/2](643200/4588595) loss:2.693 lr:0.0000100 epoch_Time:25081.0min: [2024-01-05 13:49:11,077][model8_pretrain.py][INFO] Epoch:[0/2](643300/4588595) loss:2.702 lr:0.0000100 epoch_Time:25080.0min: [2024-01-05 13:49:11,077][model8_pretrain.py][INFO] Epoch:[0/2](643300/4588595) loss:2.742 lr:0.0000100 epoch_Time:25080.0min: [2024-01-05 13:49:11,077][model8_pretrain.py][INFO] Epoch:[0/2](643300/4588595) loss:3.031 lr:0.0000100 epoch_Time:25080.0min: [2024-01-05 13:49:11,077][model8_pretrain.py][INFO] Epoch:[0/2](643300/4588595) loss:2.619 lr:0.0000100 epoch_Time:25080.0min: [2024-01-05 13:49:11,077][model8_pretrain.py][INFO] Epoch:[0/2](643300/4588595) loss:2.611 lr:0.0000100 epoch_Time:25080.0min: [2024-01-05 13:49:11,077][model8_pretrain.py][INFO] Epoch:[0/2](643300/4588595) loss:2.933 lr:0.0000100 epoch_Time:25080.0min: [2024-01-05 13:49:11,078][model8_pretrain.py][INFO] Epoch:[0/2](643300/4588595) loss:2.498 lr:0.0000100 epoch_Time:25080.0min: [2024-01-05 13:49:11,078][model8_pretrain.py][INFO] Epoch:[0/2](643300/4588595) loss:2.769 lr:0.0000100 epoch_Time:25080.0min: [2024-01-05 13:49:48,017][model8_pretrain.py][INFO] Epoch:[0/2](643400/4588595) loss:3.309 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:49:48,017][model8_pretrain.py][INFO] Epoch:[0/2](643400/4588595) loss:1.719 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:49:48,017][model8_pretrain.py][INFO] Epoch:[0/2](643400/4588595) loss:2.714 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:49:48,017][model8_pretrain.py][INFO] Epoch:[0/2](643400/4588595) loss:2.553 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:49:48,017][model8_pretrain.py][INFO] Epoch:[0/2](643400/4588595) loss:2.288 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:49:48,017][model8_pretrain.py][INFO] Epoch:[0/2](643400/4588595) loss:2.645 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:49:48,017][model8_pretrain.py][INFO] Epoch:[0/2](643400/4588595) loss:2.964 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:49:48,018][model8_pretrain.py][INFO] Epoch:[0/2](643400/4588595) loss:3.179 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:50:24,950][model8_pretrain.py][INFO] Epoch:[0/2](643500/4588595) loss:2.385 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:50:24,950][model8_pretrain.py][INFO] Epoch:[0/2](643500/4588595) loss:2.580 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:50:24,950][model8_pretrain.py][INFO] Epoch:[0/2](643500/4588595) loss:2.783 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:50:24,950][model8_pretrain.py][INFO] Epoch:[0/2](643500/4588595) loss:3.257 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:50:24,950][model8_pretrain.py][INFO] Epoch:[0/2](643500/4588595) loss:2.892 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:50:24,950][model8_pretrain.py][INFO] Epoch:[0/2](643500/4588595) loss:2.677 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:50:24,951][model8_pretrain.py][INFO] Epoch:[0/2](643500/4588595) loss:2.662 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:50:24,951][model8_pretrain.py][INFO] Epoch:[0/2](643500/4588595) loss:3.109 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:51:13,995][model8_pretrain.py][INFO] Epoch:[0/2](643600/4588595) loss:3.027 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:51:13,995][model8_pretrain.py][INFO] Epoch:[0/2](643600/4588595) loss:2.646 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:51:13,995][model8_pretrain.py][INFO] Epoch:[0/2](643600/4588595) loss:2.612 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:51:13,995][model8_pretrain.py][INFO] Epoch:[0/2](643600/4588595) loss:2.881 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:51:13,995][model8_pretrain.py][INFO] Epoch:[0/2](643600/4588595) loss:2.916 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:51:13,995][model8_pretrain.py][INFO] Epoch:[0/2](643600/4588595) loss:3.005 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:51:13,995][model8_pretrain.py][INFO] Epoch:[0/2](643600/4588595) loss:2.960 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:51:13,995][model8_pretrain.py][INFO] Epoch:[0/2](643600/4588595) loss:3.231 lr:0.0000100 epoch_Time:25079.0min: [2024-01-05 13:51:50,929][model8_pretrain.py][INFO] Epoch:[0/2](643700/4588595) loss:1.963 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:51:50,929][model8_pretrain.py][INFO] Epoch:[0/2](643700/4588595) loss:1.461 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:51:50,929][model8_pretrain.py][INFO] Epoch:[0/2](643700/4588595) loss:3.088 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:51:50,929][model8_pretrain.py][INFO] Epoch:[0/2](643700/4588595) loss:3.235 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:51:50,929][model8_pretrain.py][INFO] Epoch:[0/2](643700/4588595) loss:2.944 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:51:50,929][model8_pretrain.py][INFO] Epoch:[0/2](643700/4588595) loss:3.141 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:51:50,929][model8_pretrain.py][INFO] Epoch:[0/2](643700/4588595) loss:3.355 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:51:50,929][model8_pretrain.py][INFO] Epoch:[0/2](643700/4588595) loss:2.538 lr:0.0000100 epoch_Time:25078.0min: [2024-01-05 13:52:27,868][model8_pretrain.py][INFO] Epoch:[0/2](643800/4588595) loss:2.540 lr:0.0000100 epoch_Time:25077.0min: [2024-01-05 13:52:27,868][model8_pretrain.py][INFO] Epoch:[0/2](643800/4588595) loss:3.244 lr:0.0000100 epoch_Time:25077.0min: [2024-01-05 13:52:27,868][model8_pretrain.py][INFO] Epoch:[0/2](643800/4588595) loss:2.635 lr:0.0000100 epoch_Time:25077.0min: [2024-01-05 13:52:27,868][model8_pretrain.py][INFO] Epoch:[0/2](643800/4588595) loss:3.136 lr:0.0000100 epoch_Time:25077.0min: [2024-01-05 13:52:27,868][model8_pretrain.py][INFO] Epoch:[0/2](643800/4588595) loss:2.020 lr:0.0000100 epoch_Time:25077.0min: [2024-01-05 13:52:27,868][model8_pretrain.py][INFO] Epoch:[0/2](643800/4588595) loss:3.342 lr:0.0000100 epoch_Time:25077.0min: [2024-01-05 13:52:27,868][model8_pretrain.py][INFO] Epoch:[0/2](643800/4588595) loss:2.868 lr:0.0000100 epoch_Time:25077.0min: [2024-01-05 13:52:27,869][model8_pretrain.py][INFO] Epoch:[0/2](643800/4588595) loss:2.467 lr:0.0000100 epoch_Time:25077.0min: [2024-01-05 13:53:04,799][model8_pretrain.py][INFO] Epoch:[0/2](643900/4588595) loss:2.770 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:04,799][model8_pretrain.py][INFO] Epoch:[0/2](643900/4588595) loss:2.838 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:04,799][model8_pretrain.py][INFO] Epoch:[0/2](643900/4588595) loss:2.701 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:04,799][model8_pretrain.py][INFO] Epoch:[0/2](643900/4588595) loss:3.081 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:04,800][model8_pretrain.py][INFO] Epoch:[0/2](643900/4588595) loss:3.201 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:04,799][model8_pretrain.py][INFO] Epoch:[0/2](643900/4588595) loss:3.167 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:04,800][model8_pretrain.py][INFO] Epoch:[0/2](643900/4588595) loss:2.855 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:04,800][model8_pretrain.py][INFO] Epoch:[0/2](643900/4588595) loss:2.888 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:41,739][model8_pretrain.py][INFO] Epoch:[0/2](644000/4588595) loss:2.620 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:41,739][model8_pretrain.py][INFO] Epoch:[0/2](644000/4588595) loss:3.041 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:41,739][model8_pretrain.py][INFO] Epoch:[0/2](644000/4588595) loss:2.655 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:41,739][model8_pretrain.py][INFO] Epoch:[0/2](644000/4588595) loss:3.099 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:41,739][model8_pretrain.py][INFO] Epoch:[0/2](644000/4588595) loss:3.123 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:41,739][model8_pretrain.py][INFO] Epoch:[0/2](644000/4588595) loss:3.287 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:41,739][model8_pretrain.py][INFO] Epoch:[0/2](644000/4588595) loss:2.917 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:53:41,739][model8_pretrain.py][INFO] Epoch:[0/2](644000/4588595) loss:3.021 lr:0.0000100 epoch_Time:25076.0min: [2024-01-05 13:54:18,693][model8_pretrain.py][INFO] Epoch:[0/2](644100/4588595) loss:3.369 lr:0.0000100 epoch_Time:25075.0min: [2024-01-05 13:54:18,694][model8_pretrain.py][INFO] Epoch:[0/2](644100/4588595) loss:2.456 lr:0.0000100 epoch_Time:25075.0min: [2024-01-05 13:54:18,694][model8_pretrain.py][INFO] Epoch:[0/2](644100/4588595) loss:2.525 lr:0.0000100 epoch_Time:25075.0min: [2024-01-05 13:54:18,694][model8_pretrain.py][INFO] Epoch:[0/2](644100/4588595) loss:2.792 lr:0.0000100 epoch_Time:25075.0min: [2024-01-05 13:54:18,694][model8_pretrain.py][INFO] Epoch:[0/2](644100/4588595) loss:3.019 lr:0.0000100 epoch_Time:25075.0min: [2024-01-05 13:54:18,694][model8_pretrain.py][INFO] Epoch:[0/2](644100/4588595) loss:2.728 lr:0.0000100 epoch_Time:25075.0min: [2024-01-05 13:54:18,694][model8_pretrain.py][INFO] Epoch:[0/2](644100/4588595) loss:2.263 lr:0.0000100 epoch_Time:25075.0min: [2024-01-05 13:54:18,694][model8_pretrain.py][INFO] Epoch:[0/2](644100/4588595) loss:2.942 lr:0.0000100 epoch_Time:25075.0min: [2024-01-05 13:54:55,625][model8_pretrain.py][INFO] Epoch:[0/2](644200/4588595) loss:2.713 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:54:55,625][model8_pretrain.py][INFO] Epoch:[0/2](644200/4588595) loss:2.328 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:54:55,625][model8_pretrain.py][INFO] Epoch:[0/2](644200/4588595) loss:2.549 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:54:55,625][model8_pretrain.py][INFO] Epoch:[0/2](644200/4588595) loss:2.649 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:54:55,625][model8_pretrain.py][INFO] Epoch:[0/2](644200/4588595) loss:2.167 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:54:55,625][model8_pretrain.py][INFO] Epoch:[0/2](644200/4588595) loss:2.466 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:54:55,625][model8_pretrain.py][INFO] Epoch:[0/2](644200/4588595) loss:2.830 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:54:55,625][model8_pretrain.py][INFO] Epoch:[0/2](644200/4588595) loss:3.081 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:55:32,554][model8_pretrain.py][INFO] Epoch:[0/2](644300/4588595) loss:2.787 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:55:32,554][model8_pretrain.py][INFO] Epoch:[0/2](644300/4588595) loss:2.530 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:55:32,554][model8_pretrain.py][INFO] Epoch:[0/2](644300/4588595) loss:2.804 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:55:32,554][model8_pretrain.py][INFO] Epoch:[0/2](644300/4588595) loss:2.532 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:55:32,554][model8_pretrain.py][INFO] Epoch:[0/2](644300/4588595) loss:2.698 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:55:32,554][model8_pretrain.py][INFO] Epoch:[0/2](644300/4588595) loss:2.879 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:55:32,554][model8_pretrain.py][INFO] Epoch:[0/2](644300/4588595) loss:2.090 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:55:32,554][model8_pretrain.py][INFO] Epoch:[0/2](644300/4588595) loss:2.564 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:56:21,448][model8_pretrain.py][INFO] Epoch:[0/2](644400/4588595) loss:3.152 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:56:21,448][model8_pretrain.py][INFO] Epoch:[0/2](644400/4588595) loss:3.218 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:56:21,448][model8_pretrain.py][INFO] Epoch:[0/2](644400/4588595) loss:2.651 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:56:21,448][model8_pretrain.py][INFO] Epoch:[0/2](644400/4588595) loss:2.521 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:56:21,448][model8_pretrain.py][INFO] Epoch:[0/2](644400/4588595) loss:3.154 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:56:21,448][model8_pretrain.py][INFO] Epoch:[0/2](644400/4588595) loss:2.724 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:56:21,448][model8_pretrain.py][INFO] Epoch:[0/2](644400/4588595) loss:2.890 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:56:21,448][model8_pretrain.py][INFO] Epoch:[0/2](644400/4588595) loss:3.152 lr:0.0000100 epoch_Time:25074.0min: [2024-01-05 13:56:58,384][model8_pretrain.py][INFO] Epoch:[0/2](644500/4588595) loss:3.199 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:56:58,384][model8_pretrain.py][INFO] Epoch:[0/2](644500/4588595) loss:2.430 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:56:58,384][model8_pretrain.py][INFO] Epoch:[0/2](644500/4588595) loss:3.205 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:56:58,384][model8_pretrain.py][INFO] Epoch:[0/2](644500/4588595) loss:3.146 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:56:58,384][model8_pretrain.py][INFO] Epoch:[0/2](644500/4588595) loss:2.775 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:56:58,384][model8_pretrain.py][INFO] Epoch:[0/2](644500/4588595) loss:3.393 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:56:58,384][model8_pretrain.py][INFO] Epoch:[0/2](644500/4588595) loss:2.317 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:56:58,384][model8_pretrain.py][INFO] Epoch:[0/2](644500/4588595) loss:2.833 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:57:35,296][model8_pretrain.py][INFO] Epoch:[0/2](644600/4588595) loss:2.929 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:57:35,296][model8_pretrain.py][INFO] Epoch:[0/2](644600/4588595) loss:2.455 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:57:35,296][model8_pretrain.py][INFO] Epoch:[0/2](644600/4588595) loss:2.619 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:57:35,296][model8_pretrain.py][INFO] Epoch:[0/2](644600/4588595) loss:2.668 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:57:35,296][model8_pretrain.py][INFO] Epoch:[0/2](644600/4588595) loss:3.275 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:57:35,296][model8_pretrain.py][INFO] Epoch:[0/2](644600/4588595) loss:3.084 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:57:35,296][model8_pretrain.py][INFO] Epoch:[0/2](644600/4588595) loss:3.184 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:57:35,297][model8_pretrain.py][INFO] Epoch:[0/2](644600/4588595) loss:3.218 lr:0.0000100 epoch_Time:25073.0min: [2024-01-05 13:58:12,234][model8_pretrain.py][INFO] Epoch:[0/2](644700/4588595) loss:3.034 lr:0.0000100 epoch_Time:25072.0min: [2024-01-05 13:58:12,235][model8_pretrain.py][INFO] Epoch:[0/2](644700/4588595) loss:3.254 lr:0.0000100 epoch_Time:25072.0min: [2024-01-05 13:58:12,234][model8_pretrain.py][INFO] Epoch:[0/2](644700/4588595) loss:2.392 lr:0.0000100 epoch_Time:25072.0min: [2024-01-05 13:58:12,235][model8_pretrain.py][INFO] Epoch:[0/2](644700/4588595) loss:2.549 lr:0.0000100 epoch_Time:25072.0min: [2024-01-05 13:58:12,235][model8_pretrain.py][INFO] Epoch:[0/2](644700/4588595) loss:2.547 lr:0.0000100 epoch_Time:25072.0min: [2024-01-05 13:58:12,235][model8_pretrain.py][INFO] Epoch:[0/2](644700/4588595) loss:2.660 lr:0.0000100 epoch_Time:25072.0min: [2024-01-05 13:58:12,235][model8_pretrain.py][INFO] Epoch:[0/2](644700/4588595) loss:3.524 lr:0.0000100 epoch_Time:25072.0min: [2024-01-05 13:58:12,235][model8_pretrain.py][INFO] Epoch:[0/2](644700/4588595) loss:2.772 lr:0.0000100 epoch_Time:25072.0min: [2024-01-05 13:58:49,169][model8_pretrain.py][INFO] Epoch:[0/2](644800/4588595) loss:2.925 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:58:49,169][model8_pretrain.py][INFO] Epoch:[0/2](644800/4588595) loss:2.910 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:58:49,169][model8_pretrain.py][INFO] Epoch:[0/2](644800/4588595) loss:2.796 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:58:49,169][model8_pretrain.py][INFO] Epoch:[0/2](644800/4588595) loss:2.332 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:58:49,169][model8_pretrain.py][INFO] Epoch:[0/2](644800/4588595) loss:2.908 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:58:49,169][model8_pretrain.py][INFO] Epoch:[0/2](644800/4588595) loss:2.416 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:58:49,169][model8_pretrain.py][INFO] Epoch:[0/2](644800/4588595) loss:2.970 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:58:49,169][model8_pretrain.py][INFO] Epoch:[0/2](644800/4588595) loss:3.309 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:59:26,109][model8_pretrain.py][INFO] Epoch:[0/2](644900/4588595) loss:3.064 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:59:26,109][model8_pretrain.py][INFO] Epoch:[0/2](644900/4588595) loss:3.148 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:59:26,109][model8_pretrain.py][INFO] Epoch:[0/2](644900/4588595) loss:2.521 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:59:26,109][model8_pretrain.py][INFO] Epoch:[0/2](644900/4588595) loss:2.547 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:59:26,109][model8_pretrain.py][INFO] Epoch:[0/2](644900/4588595) loss:3.167 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:59:26,109][model8_pretrain.py][INFO] Epoch:[0/2](644900/4588595) loss:2.856 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:59:26,109][model8_pretrain.py][INFO] Epoch:[0/2](644900/4588595) loss:3.079 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 13:59:26,109][model8_pretrain.py][INFO] Epoch:[0/2](644900/4588595) loss:2.506 lr:0.0000100 epoch_Time:25070.0min: [2024-01-05 14:00:03,043][model8_pretrain.py][INFO] Epoch:[0/2](645000/4588595) loss:3.347 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:03,043][model8_pretrain.py][INFO] Epoch:[0/2](645000/4588595) loss:2.636 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:03,043][model8_pretrain.py][INFO] Epoch:[0/2](645000/4588595) loss:3.017 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:03,043][model8_pretrain.py][INFO] Epoch:[0/2](645000/4588595) loss:2.780 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:03,043][model8_pretrain.py][INFO] Epoch:[0/2](645000/4588595) loss:2.772 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:03,043][model8_pretrain.py][INFO] Epoch:[0/2](645000/4588595) loss:2.795 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:03,044][model8_pretrain.py][INFO] Epoch:[0/2](645000/4588595) loss:3.060 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:03,044][model8_pretrain.py][INFO] Epoch:[0/2](645000/4588595) loss:2.773 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:39,980][model8_pretrain.py][INFO] Epoch:[0/2](645100/4588595) loss:2.883 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:39,980][model8_pretrain.py][INFO] Epoch:[0/2](645100/4588595) loss:3.130 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:39,980][model8_pretrain.py][INFO] Epoch:[0/2](645100/4588595) loss:3.127 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:39,980][model8_pretrain.py][INFO] Epoch:[0/2](645100/4588595) loss:2.510 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:39,980][model8_pretrain.py][INFO] Epoch:[0/2](645100/4588595) loss:2.438 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:39,980][model8_pretrain.py][INFO] Epoch:[0/2](645100/4588595) loss:2.401 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:39,980][model8_pretrain.py][INFO] Epoch:[0/2](645100/4588595) loss:3.235 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:00:39,980][model8_pretrain.py][INFO] Epoch:[0/2](645100/4588595) loss:3.371 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:01:28,896][model8_pretrain.py][INFO] Epoch:[0/2](645200/4588595) loss:2.282 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:01:28,896][model8_pretrain.py][INFO] Epoch:[0/2](645200/4588595) loss:2.851 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:01:28,896][model8_pretrain.py][INFO] Epoch:[0/2](645200/4588595) loss:3.151 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:01:28,896][model8_pretrain.py][INFO] Epoch:[0/2](645200/4588595) loss:3.043 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:01:28,896][model8_pretrain.py][INFO] Epoch:[0/2](645200/4588595) loss:3.180 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:01:28,896][model8_pretrain.py][INFO] Epoch:[0/2](645200/4588595) loss:3.490 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:01:28,896][model8_pretrain.py][INFO] Epoch:[0/2](645200/4588595) loss:2.300 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:01:28,896][model8_pretrain.py][INFO] Epoch:[0/2](645200/4588595) loss:3.032 lr:0.0000100 epoch_Time:25069.0min: [2024-01-05 14:02:05,830][model8_pretrain.py][INFO] Epoch:[0/2](645300/4588595) loss:2.930 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:05,830][model8_pretrain.py][INFO] Epoch:[0/2](645300/4588595) loss:2.550 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:05,830][model8_pretrain.py][INFO] Epoch:[0/2](645300/4588595) loss:3.343 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:05,830][model8_pretrain.py][INFO] Epoch:[0/2](645300/4588595) loss:2.766 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:05,830][model8_pretrain.py][INFO] Epoch:[0/2](645300/4588595) loss:3.006 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:05,831][model8_pretrain.py][INFO] Epoch:[0/2](645300/4588595) loss:2.971 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:05,831][model8_pretrain.py][INFO] Epoch:[0/2](645300/4588595) loss:3.050 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:05,831][model8_pretrain.py][INFO] Epoch:[0/2](645300/4588595) loss:3.385 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:42,780][model8_pretrain.py][INFO] Epoch:[0/2](645400/4588595) loss:3.341 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:42,780][model8_pretrain.py][INFO] Epoch:[0/2](645400/4588595) loss:2.672 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:42,780][model8_pretrain.py][INFO] Epoch:[0/2](645400/4588595) loss:3.040 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:42,780][model8_pretrain.py][INFO] Epoch:[0/2](645400/4588595) loss:2.888 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:42,780][model8_pretrain.py][INFO] Epoch:[0/2](645400/4588595) loss:2.557 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:42,780][model8_pretrain.py][INFO] Epoch:[0/2](645400/4588595) loss:2.823 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:42,780][model8_pretrain.py][INFO] Epoch:[0/2](645400/4588595) loss:2.691 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:02:42,780][model8_pretrain.py][INFO] Epoch:[0/2](645400/4588595) loss:2.819 lr:0.0000100 epoch_Time:25068.0min: [2024-01-05 14:03:19,722][model8_pretrain.py][INFO] Epoch:[0/2](645500/4588595) loss:2.661 lr:0.0000100 epoch_Time:25067.0min: [2024-01-05 14:03:19,722][model8_pretrain.py][INFO] Epoch:[0/2](645500/4588595) loss:2.900 lr:0.0000100 epoch_Time:25067.0min: [2024-01-05 14:03:19,722][model8_pretrain.py][INFO] Epoch:[0/2](645500/4588595) loss:2.899 lr:0.0000100 epoch_Time:25067.0min: [2024-01-05 14:03:19,722][model8_pretrain.py][INFO] Epoch:[0/2](645500/4588595) loss:2.834 lr:0.0000100 epoch_Time:25067.0min: [2024-01-05 14:03:19,722][model8_pretrain.py][INFO] Epoch:[0/2](645500/4588595) loss:2.993 lr:0.0000100 epoch_Time:25067.0min: [2024-01-05 14:03:19,722][model8_pretrain.py][INFO] Epoch:[0/2](645500/4588595) loss:2.379 lr:0.0000100 epoch_Time:25067.0min: [2024-01-05 14:03:19,722][model8_pretrain.py][INFO] Epoch:[0/2](645500/4588595) loss:2.863 lr:0.0000100 epoch_Time:25067.0min: [2024-01-05 14:03:19,723][model8_pretrain.py][INFO] Epoch:[0/2](645500/4588595) loss:2.559 lr:0.0000100 epoch_Time:25067.0min: [2024-01-05 14:03:56,662][model8_pretrain.py][INFO] Epoch:[0/2](645600/4588595) loss:2.745 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:03:56,662][model8_pretrain.py][INFO] Epoch:[0/2](645600/4588595) loss:2.757 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:03:56,662][model8_pretrain.py][INFO] Epoch:[0/2](645600/4588595) loss:2.589 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:03:56,662][model8_pretrain.py][INFO] Epoch:[0/2](645600/4588595) loss:2.322 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:03:56,662][model8_pretrain.py][INFO] Epoch:[0/2](645600/4588595) loss:3.041 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:03:56,662][model8_pretrain.py][INFO] Epoch:[0/2](645600/4588595) loss:2.471 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:03:56,662][model8_pretrain.py][INFO] Epoch:[0/2](645600/4588595) loss:3.112 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:03:56,662][model8_pretrain.py][INFO] Epoch:[0/2](645600/4588595) loss:2.618 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:04:33,618][model8_pretrain.py][INFO] Epoch:[0/2](645700/4588595) loss:2.889 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:04:33,618][model8_pretrain.py][INFO] Epoch:[0/2](645700/4588595) loss:3.391 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:04:33,618][model8_pretrain.py][INFO] Epoch:[0/2](645700/4588595) loss:3.205 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:04:33,618][model8_pretrain.py][INFO] Epoch:[0/2](645700/4588595) loss:2.991 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:04:33,618][model8_pretrain.py][INFO] Epoch:[0/2](645700/4588595) loss:2.971 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:04:33,618][model8_pretrain.py][INFO] Epoch:[0/2](645700/4588595) loss:2.760 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:04:33,618][model8_pretrain.py][INFO] Epoch:[0/2](645700/4588595) loss:2.488 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:04:33,618][model8_pretrain.py][INFO] Epoch:[0/2](645700/4588595) loss:2.234 lr:0.0000100 epoch_Time:25066.0min: [2024-01-05 14:05:10,571][model8_pretrain.py][INFO] Epoch:[0/2](645800/4588595) loss:2.796 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:10,571][model8_pretrain.py][INFO] Epoch:[0/2](645800/4588595) loss:2.896 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:10,571][model8_pretrain.py][INFO] Epoch:[0/2](645800/4588595) loss:2.065 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:10,571][model8_pretrain.py][INFO] Epoch:[0/2](645800/4588595) loss:2.326 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:10,571][model8_pretrain.py][INFO] Epoch:[0/2](645800/4588595) loss:3.218 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:10,571][model8_pretrain.py][INFO] Epoch:[0/2](645800/4588595) loss:3.305 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:10,571][model8_pretrain.py][INFO] Epoch:[0/2](645800/4588595) loss:2.568 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:10,572][model8_pretrain.py][INFO] Epoch:[0/2](645800/4588595) loss:2.627 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:47,505][model8_pretrain.py][INFO] Epoch:[0/2](645900/4588595) loss:2.663 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:47,505][model8_pretrain.py][INFO] Epoch:[0/2](645900/4588595) loss:2.159 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:47,505][model8_pretrain.py][INFO] Epoch:[0/2](645900/4588595) loss:3.179 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:47,505][model8_pretrain.py][INFO] Epoch:[0/2](645900/4588595) loss:2.803 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:47,505][model8_pretrain.py][INFO] Epoch:[0/2](645900/4588595) loss:2.516 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:47,505][model8_pretrain.py][INFO] Epoch:[0/2](645900/4588595) loss:2.703 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:47,505][model8_pretrain.py][INFO] Epoch:[0/2](645900/4588595) loss:2.710 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:05:47,505][model8_pretrain.py][INFO] Epoch:[0/2](645900/4588595) loss:3.489 lr:0.0000100 epoch_Time:25064.0min: [2024-01-05 14:06:36,412][model8_pretrain.py][INFO] Epoch:[0/2](646000/4588595) loss:2.849 lr:0.0000100 epoch_Time:25065.0min: [2024-01-05 14:06:36,412][model8_pretrain.py][INFO] Epoch:[0/2](646000/4588595) loss:2.437 lr:0.0000100 epoch_Time:25065.0min: [2024-01-05 14:06:36,412][model8_pretrain.py][INFO] Epoch:[0/2](646000/4588595) loss:2.906 lr:0.0000100 epoch_Time:25065.0min: [2024-01-05 14:06:36,412][model8_pretrain.py][INFO] Epoch:[0/2](646000/4588595) loss:3.271 lr:0.0000100 epoch_Time:25065.0min: [2024-01-05 14:06:36,412][model8_pretrain.py][INFO] Epoch:[0/2](646000/4588595) loss:2.544 lr:0.0000100 epoch_Time:25065.0min: [2024-01-05 14:06:36,412][model8_pretrain.py][INFO] Epoch:[0/2](646000/4588595) loss:2.311 lr:0.0000100 epoch_Time:25065.0min: [2024-01-05 14:06:36,412][model8_pretrain.py][INFO] Epoch:[0/2](646000/4588595) loss:2.756 lr:0.0000100 epoch_Time:25065.0min: [2024-01-05 14:06:36,412][model8_pretrain.py][INFO] Epoch:[0/2](646000/4588595) loss:2.991 lr:0.0000100 epoch_Time:25065.0min: [2024-01-05 14:07:13,342][model8_pretrain.py][INFO] Epoch:[0/2](646100/4588595) loss:2.629 lr:0.0000100 epoch_Time:25063.0min: [2024-01-05 14:07:13,342][model8_pretrain.py][INFO] Epoch:[0/2](646100/4588595) loss:2.814 lr:0.0000100 epoch_Time:25063.0min: [2024-01-05 14:07:13,342][model8_pretrain.py][INFO] Epoch:[0/2](646100/4588595) loss:2.739 lr:0.0000100 epoch_Time:25063.0min: [2024-01-05 14:07:13,342][model8_pretrain.py][INFO] Epoch:[0/2](646100/4588595) loss:2.904 lr:0.0000100 epoch_Time:25063.0min: [2024-01-05 14:07:13,342][model8_pretrain.py][INFO] Epoch:[0/2](646100/4588595) loss:2.565 lr:0.0000100 epoch_Time:25063.0min: [2024-01-05 14:07:13,342][model8_pretrain.py][INFO] Epoch:[0/2](646100/4588595) loss:2.912 lr:0.0000100 epoch_Time:25063.0min: [2024-01-05 14:07:13,342][model8_pretrain.py][INFO] Epoch:[0/2](646100/4588595) loss:3.033 lr:0.0000100 epoch_Time:25063.0min: [2024-01-05 14:07:13,342][model8_pretrain.py][INFO] Epoch:[0/2](646100/4588595) loss:2.909 lr:0.0000100 epoch_Time:25063.0min: [2024-01-05 14:07:50,277][model8_pretrain.py][INFO] Epoch:[0/2](646200/4588595) loss:2.686 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:07:50,277][model8_pretrain.py][INFO] Epoch:[0/2](646200/4588595) loss:2.101 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:07:50,277][model8_pretrain.py][INFO] Epoch:[0/2](646200/4588595) loss:3.057 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:07:50,277][model8_pretrain.py][INFO] Epoch:[0/2](646200/4588595) loss:3.204 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:07:50,277][model8_pretrain.py][INFO] Epoch:[0/2](646200/4588595) loss:3.007 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:07:50,277][model8_pretrain.py][INFO] Epoch:[0/2](646200/4588595) loss:3.181 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:07:50,277][model8_pretrain.py][INFO] Epoch:[0/2](646200/4588595) loss:2.432 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:07:50,277][model8_pretrain.py][INFO] Epoch:[0/2](646200/4588595) loss:2.782 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:08:27,213][model8_pretrain.py][INFO] Epoch:[0/2](646300/4588595) loss:2.266 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:08:27,213][model8_pretrain.py][INFO] Epoch:[0/2](646300/4588595) loss:2.829 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:08:27,213][model8_pretrain.py][INFO] Epoch:[0/2](646300/4588595) loss:2.568 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:08:27,213][model8_pretrain.py][INFO] Epoch:[0/2](646300/4588595) loss:2.986 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:08:27,213][model8_pretrain.py][INFO] Epoch:[0/2](646300/4588595) loss:2.730 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:08:27,213][model8_pretrain.py][INFO] Epoch:[0/2](646300/4588595) loss:2.397 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:08:27,213][model8_pretrain.py][INFO] Epoch:[0/2](646300/4588595) loss:3.173 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:08:27,214][model8_pretrain.py][INFO] Epoch:[0/2](646300/4588595) loss:2.839 lr:0.0000100 epoch_Time:25062.0min: [2024-01-05 14:09:04,146][model8_pretrain.py][INFO] Epoch:[0/2](646400/4588595) loss:2.556 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:04,146][model8_pretrain.py][INFO] Epoch:[0/2](646400/4588595) loss:2.981 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:04,146][model8_pretrain.py][INFO] Epoch:[0/2](646400/4588595) loss:3.143 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:04,146][model8_pretrain.py][INFO] Epoch:[0/2](646400/4588595) loss:3.411 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:04,146][model8_pretrain.py][INFO] Epoch:[0/2](646400/4588595) loss:2.454 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:04,146][model8_pretrain.py][INFO] Epoch:[0/2](646400/4588595) loss:2.696 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:04,146][model8_pretrain.py][INFO] Epoch:[0/2](646400/4588595) loss:2.625 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:04,146][model8_pretrain.py][INFO] Epoch:[0/2](646400/4588595) loss:2.933 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:41,080][model8_pretrain.py][INFO] Epoch:[0/2](646500/4588595) loss:3.211 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:41,080][model8_pretrain.py][INFO] Epoch:[0/2](646500/4588595) loss:3.190 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:41,080][model8_pretrain.py][INFO] Epoch:[0/2](646500/4588595) loss:1.951 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:41,080][model8_pretrain.py][INFO] Epoch:[0/2](646500/4588595) loss:3.043 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:41,081][model8_pretrain.py][INFO] Epoch:[0/2](646500/4588595) loss:3.260 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:41,081][model8_pretrain.py][INFO] Epoch:[0/2](646500/4588595) loss:2.542 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:41,081][model8_pretrain.py][INFO] Epoch:[0/2](646500/4588595) loss:3.218 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:09:41,081][model8_pretrain.py][INFO] Epoch:[0/2](646500/4588595) loss:2.645 lr:0.0000100 epoch_Time:25061.0min: [2024-01-05 14:10:18,015][model8_pretrain.py][INFO] Epoch:[0/2](646600/4588595) loss:2.976 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:10:18,015][model8_pretrain.py][INFO] Epoch:[0/2](646600/4588595) loss:3.192 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:10:18,015][model8_pretrain.py][INFO] Epoch:[0/2](646600/4588595) loss:2.857 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:10:18,015][model8_pretrain.py][INFO] Epoch:[0/2](646600/4588595) loss:2.993 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:10:18,015][model8_pretrain.py][INFO] Epoch:[0/2](646600/4588595) loss:2.813 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:10:18,015][model8_pretrain.py][INFO] Epoch:[0/2](646600/4588595) loss:3.197 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:10:18,015][model8_pretrain.py][INFO] Epoch:[0/2](646600/4588595) loss:3.203 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:10:18,015][model8_pretrain.py][INFO] Epoch:[0/2](646600/4588595) loss:2.418 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:10:54,956][model8_pretrain.py][INFO] Epoch:[0/2](646700/4588595) loss:2.979 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:10:54,956][model8_pretrain.py][INFO] Epoch:[0/2](646700/4588595) loss:2.952 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:10:54,956][model8_pretrain.py][INFO] Epoch:[0/2](646700/4588595) loss:3.089 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:10:54,956][model8_pretrain.py][INFO] Epoch:[0/2](646700/4588595) loss:3.078 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:10:54,956][model8_pretrain.py][INFO] Epoch:[0/2](646700/4588595) loss:3.355 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:10:54,956][model8_pretrain.py][INFO] Epoch:[0/2](646700/4588595) loss:3.001 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:10:54,956][model8_pretrain.py][INFO] Epoch:[0/2](646700/4588595) loss:2.501 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:10:54,956][model8_pretrain.py][INFO] Epoch:[0/2](646700/4588595) loss:2.709 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:11:43,788][model8_pretrain.py][INFO] Epoch:[0/2](646800/4588595) loss:3.048 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:11:43,788][model8_pretrain.py][INFO] Epoch:[0/2](646800/4588595) loss:3.113 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:11:43,788][model8_pretrain.py][INFO] Epoch:[0/2](646800/4588595) loss:3.453 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:11:43,788][model8_pretrain.py][INFO] Epoch:[0/2](646800/4588595) loss:2.852 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:11:43,788][model8_pretrain.py][INFO] Epoch:[0/2](646800/4588595) loss:2.913 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:11:43,788][model8_pretrain.py][INFO] Epoch:[0/2](646800/4588595) loss:3.409 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:11:43,788][model8_pretrain.py][INFO] Epoch:[0/2](646800/4588595) loss:2.700 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:11:43,789][model8_pretrain.py][INFO] Epoch:[0/2](646800/4588595) loss:2.826 lr:0.0000100 epoch_Time:25060.0min: [2024-01-05 14:12:20,724][model8_pretrain.py][INFO] Epoch:[0/2](646900/4588595) loss:2.772 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:12:20,724][model8_pretrain.py][INFO] Epoch:[0/2](646900/4588595) loss:2.634 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:12:20,724][model8_pretrain.py][INFO] Epoch:[0/2](646900/4588595) loss:2.785 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:12:20,724][model8_pretrain.py][INFO] Epoch:[0/2](646900/4588595) loss:3.316 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:12:20,724][model8_pretrain.py][INFO] Epoch:[0/2](646900/4588595) loss:3.129 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:12:20,724][model8_pretrain.py][INFO] Epoch:[0/2](646900/4588595) loss:2.686 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:12:20,724][model8_pretrain.py][INFO] Epoch:[0/2](646900/4588595) loss:3.181 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:12:20,726][model8_pretrain.py][INFO] Epoch:[0/2](646900/4588595) loss:2.796 lr:0.0000100 epoch_Time:25059.0min: [2024-01-05 14:12:57,673][model8_pretrain.py][INFO] Epoch:[0/2](647000/4588595) loss:2.801 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:12:57,673][model8_pretrain.py][INFO] Epoch:[0/2](647000/4588595) loss:2.126 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:12:57,673][model8_pretrain.py][INFO] Epoch:[0/2](647000/4588595) loss:3.375 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:12:57,673][model8_pretrain.py][INFO] Epoch:[0/2](647000/4588595) loss:3.071 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:12:57,673][model8_pretrain.py][INFO] Epoch:[0/2](647000/4588595) loss:2.738 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:12:57,673][model8_pretrain.py][INFO] Epoch:[0/2](647000/4588595) loss:2.894 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:12:57,673][model8_pretrain.py][INFO] Epoch:[0/2](647000/4588595) loss:3.004 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:12:57,673][model8_pretrain.py][INFO] Epoch:[0/2](647000/4588595) loss:3.187 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:13:34,607][model8_pretrain.py][INFO] Epoch:[0/2](647100/4588595) loss:3.194 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:13:34,607][model8_pretrain.py][INFO] Epoch:[0/2](647100/4588595) loss:2.639 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:13:34,608][model8_pretrain.py][INFO] Epoch:[0/2](647100/4588595) loss:3.149 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:13:34,608][model8_pretrain.py][INFO] Epoch:[0/2](647100/4588595) loss:2.840 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:13:34,608][model8_pretrain.py][INFO] Epoch:[0/2](647100/4588595) loss:3.571 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:13:34,608][model8_pretrain.py][INFO] Epoch:[0/2](647100/4588595) loss:2.640 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:13:34,608][model8_pretrain.py][INFO] Epoch:[0/2](647100/4588595) loss:2.889 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:13:34,608][model8_pretrain.py][INFO] Epoch:[0/2](647100/4588595) loss:2.493 lr:0.0000100 epoch_Time:25057.0min: [2024-01-05 14:14:11,550][model8_pretrain.py][INFO] Epoch:[0/2](647200/4588595) loss:2.860 lr:0.0000100 epoch_Time:25056.0min: [2024-01-05 14:14:11,550][model8_pretrain.py][INFO] Epoch:[0/2](647200/4588595) loss:2.991 lr:0.0000100 epoch_Time:25056.0min: [2024-01-05 14:14:11,550][model8_pretrain.py][INFO] Epoch:[0/2](647200/4588595) loss:2.907 lr:0.0000100 epoch_Time:25056.0min: [2024-01-05 14:14:11,551][model8_pretrain.py][INFO] Epoch:[0/2](647200/4588595) loss:2.762 lr:0.0000100 epoch_Time:25056.0min: [2024-01-05 14:14:11,551][model8_pretrain.py][INFO] Epoch:[0/2](647200/4588595) loss:2.756 lr:0.0000100 epoch_Time:25056.0min: [2024-01-05 14:14:11,551][model8_pretrain.py][INFO] Epoch:[0/2](647200/4588595) loss:3.182 lr:0.0000100 epoch_Time:25056.0min: [2024-01-05 14:14:11,551][model8_pretrain.py][INFO] Epoch:[0/2](647200/4588595) loss:3.392 lr:0.0000100 epoch_Time:25056.0min: [2024-01-05 14:14:11,551][model8_pretrain.py][INFO] Epoch:[0/2](647200/4588595) loss:2.733 lr:0.0000100 epoch_Time:25056.0min: [2024-01-05 14:14:48,482][model8_pretrain.py][INFO] Epoch:[0/2](647300/4588595) loss:2.262 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:14:48,482][model8_pretrain.py][INFO] Epoch:[0/2](647300/4588595) loss:3.035 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:14:48,482][model8_pretrain.py][INFO] Epoch:[0/2](647300/4588595) loss:2.907 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:14:48,482][model8_pretrain.py][INFO] Epoch:[0/2](647300/4588595) loss:2.446 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:14:48,482][model8_pretrain.py][INFO] Epoch:[0/2](647300/4588595) loss:2.708 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:14:48,482][model8_pretrain.py][INFO] Epoch:[0/2](647300/4588595) loss:2.769 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:14:48,482][model8_pretrain.py][INFO] Epoch:[0/2](647300/4588595) loss:2.780 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:14:48,482][model8_pretrain.py][INFO] Epoch:[0/2](647300/4588595) loss:2.879 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:15:25,415][model8_pretrain.py][INFO] Epoch:[0/2](647400/4588595) loss:2.780 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:15:25,415][model8_pretrain.py][INFO] Epoch:[0/2](647400/4588595) loss:3.041 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:15:25,415][model8_pretrain.py][INFO] Epoch:[0/2](647400/4588595) loss:2.395 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:15:25,415][model8_pretrain.py][INFO] Epoch:[0/2](647400/4588595) loss:2.526 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:15:25,415][model8_pretrain.py][INFO] Epoch:[0/2](647400/4588595) loss:2.773 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:15:25,415][model8_pretrain.py][INFO] Epoch:[0/2](647400/4588595) loss:2.462 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:15:25,416][model8_pretrain.py][INFO] Epoch:[0/2](647400/4588595) loss:2.947 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:15:25,416][model8_pretrain.py][INFO] Epoch:[0/2](647400/4588595) loss:2.922 lr:0.0000100 epoch_Time:25055.0min: [2024-01-05 14:16:02,346][model8_pretrain.py][INFO] Epoch:[0/2](647500/4588595) loss:2.677 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:02,346][model8_pretrain.py][INFO] Epoch:[0/2](647500/4588595) loss:2.286 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:02,346][model8_pretrain.py][INFO] Epoch:[0/2](647500/4588595) loss:2.929 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:02,346][model8_pretrain.py][INFO] Epoch:[0/2](647500/4588595) loss:2.670 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:02,346][model8_pretrain.py][INFO] Epoch:[0/2](647500/4588595) loss:2.815 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:02,346][model8_pretrain.py][INFO] Epoch:[0/2](647500/4588595) loss:3.018 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:02,346][model8_pretrain.py][INFO] Epoch:[0/2](647500/4588595) loss:2.920 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:02,346][model8_pretrain.py][INFO] Epoch:[0/2](647500/4588595) loss:3.002 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:51,007][model8_pretrain.py][INFO] Epoch:[0/2](647600/4588595) loss:2.864 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:51,007][model8_pretrain.py][INFO] Epoch:[0/2](647600/4588595) loss:2.846 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:51,007][model8_pretrain.py][INFO] Epoch:[0/2](647600/4588595) loss:3.342 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:51,007][model8_pretrain.py][INFO] Epoch:[0/2](647600/4588595) loss:2.848 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:51,007][model8_pretrain.py][INFO] Epoch:[0/2](647600/4588595) loss:2.690 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:51,007][model8_pretrain.py][INFO] Epoch:[0/2](647600/4588595) loss:2.324 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:51,007][model8_pretrain.py][INFO] Epoch:[0/2](647600/4588595) loss:2.959 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:16:51,007][model8_pretrain.py][INFO] Epoch:[0/2](647600/4588595) loss:2.830 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:17:27,930][model8_pretrain.py][INFO] Epoch:[0/2](647700/4588595) loss:2.824 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:17:27,930][model8_pretrain.py][INFO] Epoch:[0/2](647700/4588595) loss:2.776 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:17:27,930][model8_pretrain.py][INFO] Epoch:[0/2](647700/4588595) loss:3.185 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:17:27,930][model8_pretrain.py][INFO] Epoch:[0/2](647700/4588595) loss:2.982 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:17:27,930][model8_pretrain.py][INFO] Epoch:[0/2](647700/4588595) loss:2.215 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:17:27,930][model8_pretrain.py][INFO] Epoch:[0/2](647700/4588595) loss:2.741 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:17:27,930][model8_pretrain.py][INFO] Epoch:[0/2](647700/4588595) loss:2.854 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:17:27,930][model8_pretrain.py][INFO] Epoch:[0/2](647700/4588595) loss:3.086 lr:0.0000100 epoch_Time:25054.0min: [2024-01-05 14:18:04,878][model8_pretrain.py][INFO] Epoch:[0/2](647800/4588595) loss:2.579 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:04,878][model8_pretrain.py][INFO] Epoch:[0/2](647800/4588595) loss:3.007 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:04,878][model8_pretrain.py][INFO] Epoch:[0/2](647800/4588595) loss:2.544 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:04,878][model8_pretrain.py][INFO] Epoch:[0/2](647800/4588595) loss:3.189 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:04,878][model8_pretrain.py][INFO] Epoch:[0/2](647800/4588595) loss:3.142 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:04,878][model8_pretrain.py][INFO] Epoch:[0/2](647800/4588595) loss:2.832 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:04,878][model8_pretrain.py][INFO] Epoch:[0/2](647800/4588595) loss:2.354 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:04,879][model8_pretrain.py][INFO] Epoch:[0/2](647800/4588595) loss:2.745 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:41,827][model8_pretrain.py][INFO] Epoch:[0/2](647900/4588595) loss:2.510 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:41,827][model8_pretrain.py][INFO] Epoch:[0/2](647900/4588595) loss:2.822 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:41,827][model8_pretrain.py][INFO] Epoch:[0/2](647900/4588595) loss:2.893 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:41,827][model8_pretrain.py][INFO] Epoch:[0/2](647900/4588595) loss:2.603 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:41,827][model8_pretrain.py][INFO] Epoch:[0/2](647900/4588595) loss:2.951 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:41,827][model8_pretrain.py][INFO] Epoch:[0/2](647900/4588595) loss:2.556 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:41,827][model8_pretrain.py][INFO] Epoch:[0/2](647900/4588595) loss:2.923 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:18:41,827][model8_pretrain.py][INFO] Epoch:[0/2](647900/4588595) loss:3.049 lr:0.0000100 epoch_Time:25053.0min: [2024-01-05 14:19:18,772][model8_pretrain.py][INFO] Epoch:[0/2](648000/4588595) loss:2.798 lr:0.0000100 epoch_Time:25051.0min: [2024-01-05 14:19:18,772][model8_pretrain.py][INFO] Epoch:[0/2](648000/4588595) loss:2.745 lr:0.0000100 epoch_Time:25051.0min: [2024-01-05 14:19:18,772][model8_pretrain.py][INFO] Epoch:[0/2](648000/4588595) loss:2.983 lr:0.0000100 epoch_Time:25051.0min: [2024-01-05 14:19:18,772][model8_pretrain.py][INFO] Epoch:[0/2](648000/4588595) loss:2.826 lr:0.0000100 epoch_Time:25051.0min: [2024-01-05 14:19:18,772][model8_pretrain.py][INFO] Epoch:[0/2](648000/4588595) loss:2.337 lr:0.0000100 epoch_Time:25051.0min: [2024-01-05 14:19:18,773][model8_pretrain.py][INFO] Epoch:[0/2](648000/4588595) loss:3.297 lr:0.0000100 epoch_Time:25051.0min: [2024-01-05 14:19:18,773][model8_pretrain.py][INFO] Epoch:[0/2](648000/4588595) loss:2.813 lr:0.0000100 epoch_Time:25051.0min: [2024-01-05 14:19:18,773][model8_pretrain.py][INFO] Epoch:[0/2](648000/4588595) loss:2.596 lr:0.0000100 epoch_Time:25051.0min: [2024-01-05 14:19:55,700][model8_pretrain.py][INFO] Epoch:[0/2](648100/4588595) loss:2.651 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:19:55,700][model8_pretrain.py][INFO] Epoch:[0/2](648100/4588595) loss:2.801 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:19:55,700][model8_pretrain.py][INFO] Epoch:[0/2](648100/4588595) loss:3.406 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:19:55,700][model8_pretrain.py][INFO] Epoch:[0/2](648100/4588595) loss:3.138 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:19:55,700][model8_pretrain.py][INFO] Epoch:[0/2](648100/4588595) loss:2.624 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:19:55,700][model8_pretrain.py][INFO] Epoch:[0/2](648100/4588595) loss:2.536 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:19:55,700][model8_pretrain.py][INFO] Epoch:[0/2](648100/4588595) loss:2.806 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:19:55,700][model8_pretrain.py][INFO] Epoch:[0/2](648100/4588595) loss:3.175 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:20:32,633][model8_pretrain.py][INFO] Epoch:[0/2](648200/4588595) loss:2.705 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:20:32,633][model8_pretrain.py][INFO] Epoch:[0/2](648200/4588595) loss:2.907 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:20:32,633][model8_pretrain.py][INFO] Epoch:[0/2](648200/4588595) loss:2.938 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:20:32,633][model8_pretrain.py][INFO] Epoch:[0/2](648200/4588595) loss:3.094 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:20:32,633][model8_pretrain.py][INFO] Epoch:[0/2](648200/4588595) loss:3.191 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:20:32,633][model8_pretrain.py][INFO] Epoch:[0/2](648200/4588595) loss:2.846 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:20:32,633][model8_pretrain.py][INFO] Epoch:[0/2](648200/4588595) loss:3.008 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:20:32,633][model8_pretrain.py][INFO] Epoch:[0/2](648200/4588595) loss:2.346 lr:0.0000100 epoch_Time:25050.0min: [2024-01-05 14:21:09,562][model8_pretrain.py][INFO] Epoch:[0/2](648300/4588595) loss:2.578 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:09,562][model8_pretrain.py][INFO] Epoch:[0/2](648300/4588595) loss:2.963 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:09,562][model8_pretrain.py][INFO] Epoch:[0/2](648300/4588595) loss:2.577 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:09,562][model8_pretrain.py][INFO] Epoch:[0/2](648300/4588595) loss:2.743 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:09,562][model8_pretrain.py][INFO] Epoch:[0/2](648300/4588595) loss:2.911 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:09,562][model8_pretrain.py][INFO] Epoch:[0/2](648300/4588595) loss:3.220 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:09,562][model8_pretrain.py][INFO] Epoch:[0/2](648300/4588595) loss:2.605 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:09,563][model8_pretrain.py][INFO] Epoch:[0/2](648300/4588595) loss:2.669 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:58,433][model8_pretrain.py][INFO] Epoch:[0/2](648400/4588595) loss:2.911 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:58,433][model8_pretrain.py][INFO] Epoch:[0/2](648400/4588595) loss:2.995 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:58,433][model8_pretrain.py][INFO] Epoch:[0/2](648400/4588595) loss:2.854 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:58,433][model8_pretrain.py][INFO] Epoch:[0/2](648400/4588595) loss:3.096 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:58,433][model8_pretrain.py][INFO] Epoch:[0/2](648400/4588595) loss:2.883 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:58,433][model8_pretrain.py][INFO] Epoch:[0/2](648400/4588595) loss:3.380 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:58,433][model8_pretrain.py][INFO] Epoch:[0/2](648400/4588595) loss:2.526 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:21:58,433][model8_pretrain.py][INFO] Epoch:[0/2](648400/4588595) loss:2.541 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:22:35,370][model8_pretrain.py][INFO] Epoch:[0/2](648500/4588595) loss:2.959 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:22:35,370][model8_pretrain.py][INFO] Epoch:[0/2](648500/4588595) loss:2.806 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:22:35,370][model8_pretrain.py][INFO] Epoch:[0/2](648500/4588595) loss:3.094 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:22:35,370][model8_pretrain.py][INFO] Epoch:[0/2](648500/4588595) loss:2.456 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:22:35,370][model8_pretrain.py][INFO] Epoch:[0/2](648500/4588595) loss:2.675 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:22:35,371][model8_pretrain.py][INFO] Epoch:[0/2](648500/4588595) loss:3.109 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:22:35,371][model8_pretrain.py][INFO] Epoch:[0/2](648500/4588595) loss:2.662 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:22:35,371][model8_pretrain.py][INFO] Epoch:[0/2](648500/4588595) loss:3.220 lr:0.0000100 epoch_Time:25049.0min: [2024-01-05 14:23:12,319][model8_pretrain.py][INFO] Epoch:[0/2](648600/4588595) loss:2.835 lr:0.0000100 epoch_Time:25048.0min: [2024-01-05 14:23:12,319][model8_pretrain.py][INFO] Epoch:[0/2](648600/4588595) loss:3.073 lr:0.0000100 epoch_Time:25048.0min: [2024-01-05 14:23:12,319][model8_pretrain.py][INFO] Epoch:[0/2](648600/4588595) loss:2.838 lr:0.0000100 epoch_Time:25048.0min: [2024-01-05 14:23:12,319][model8_pretrain.py][INFO] Epoch:[0/2](648600/4588595) loss:3.072 lr:0.0000100 epoch_Time:25048.0min: [2024-01-05 14:23:12,319][model8_pretrain.py][INFO] Epoch:[0/2](648600/4588595) loss:2.605 lr:0.0000100 epoch_Time:25048.0min: [2024-01-05 14:23:12,319][model8_pretrain.py][INFO] Epoch:[0/2](648600/4588595) loss:3.105 lr:0.0000100 epoch_Time:25048.0min: [2024-01-05 14:23:12,319][model8_pretrain.py][INFO] Epoch:[0/2](648600/4588595) loss:2.544 lr:0.0000100 epoch_Time:25048.0min: [2024-01-05 14:23:12,320][model8_pretrain.py][INFO] Epoch:[0/2](648600/4588595) loss:3.531 lr:0.0000100 epoch_Time:25048.0min: [2024-01-05 14:23:49,270][model8_pretrain.py][INFO] Epoch:[0/2](648700/4588595) loss:2.807 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:23:49,270][model8_pretrain.py][INFO] Epoch:[0/2](648700/4588595) loss:2.899 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:23:49,270][model8_pretrain.py][INFO] Epoch:[0/2](648700/4588595) loss:3.152 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:23:49,270][model8_pretrain.py][INFO] Epoch:[0/2](648700/4588595) loss:2.481 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:23:49,270][model8_pretrain.py][INFO] Epoch:[0/2](648700/4588595) loss:2.821 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:23:49,270][model8_pretrain.py][INFO] Epoch:[0/2](648700/4588595) loss:2.789 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:23:49,270][model8_pretrain.py][INFO] Epoch:[0/2](648700/4588595) loss:2.749 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:23:49,270][model8_pretrain.py][INFO] Epoch:[0/2](648700/4588595) loss:2.293 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:24:26,228][model8_pretrain.py][INFO] Epoch:[0/2](648800/4588595) loss:2.803 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:24:26,228][model8_pretrain.py][INFO] Epoch:[0/2](648800/4588595) loss:2.699 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:24:26,228][model8_pretrain.py][INFO] Epoch:[0/2](648800/4588595) loss:3.261 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:24:26,228][model8_pretrain.py][INFO] Epoch:[0/2](648800/4588595) loss:2.662 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:24:26,228][model8_pretrain.py][INFO] Epoch:[0/2](648800/4588595) loss:2.538 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:24:26,228][model8_pretrain.py][INFO] Epoch:[0/2](648800/4588595) loss:2.832 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:24:26,228][model8_pretrain.py][INFO] Epoch:[0/2](648800/4588595) loss:3.247 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:24:26,228][model8_pretrain.py][INFO] Epoch:[0/2](648800/4588595) loss:2.917 lr:0.0000100 epoch_Time:25047.0min: [2024-01-05 14:25:03,220][model8_pretrain.py][INFO] Epoch:[0/2](648900/4588595) loss:3.009 lr:0.0000100 epoch_Time:25046.0min: [2024-01-05 14:25:03,220][model8_pretrain.py][INFO] Epoch:[0/2](648900/4588595) loss:2.778 lr:0.0000100 epoch_Time:25046.0min: [2024-01-05 14:25:03,220][model8_pretrain.py][INFO] Epoch:[0/2](648900/4588595) loss:2.664 lr:0.0000100 epoch_Time:25046.0min: [2024-01-05 14:25:03,220][model8_pretrain.py][INFO] Epoch:[0/2](648900/4588595) loss:3.174 lr:0.0000100 epoch_Time:25046.0min: [2024-01-05 14:25:03,220][model8_pretrain.py][INFO] Epoch:[0/2](648900/4588595) loss:3.290 lr:0.0000100 epoch_Time:25046.0min: [2024-01-05 14:25:03,221][model8_pretrain.py][INFO] Epoch:[0/2](648900/4588595) loss:3.282 lr:0.0000100 epoch_Time:25046.0min: [2024-01-05 14:25:03,221][model8_pretrain.py][INFO] Epoch:[0/2](648900/4588595) loss:3.095 lr:0.0000100 epoch_Time:25046.0min: [2024-01-05 14:25:03,221][model8_pretrain.py][INFO] Epoch:[0/2](648900/4588595) loss:2.643 lr:0.0000100 epoch_Time:25046.0min: [2024-01-05 14:25:40,145][model8_pretrain.py][INFO] Epoch:[0/2](649000/4588595) loss:2.987 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:25:40,145][model8_pretrain.py][INFO] Epoch:[0/2](649000/4588595) loss:2.530 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:25:40,145][model8_pretrain.py][INFO] Epoch:[0/2](649000/4588595) loss:2.347 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:25:40,145][model8_pretrain.py][INFO] Epoch:[0/2](649000/4588595) loss:2.777 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:25:40,145][model8_pretrain.py][INFO] Epoch:[0/2](649000/4588595) loss:2.339 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:25:40,145][model8_pretrain.py][INFO] Epoch:[0/2](649000/4588595) loss:2.929 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:25:40,145][model8_pretrain.py][INFO] Epoch:[0/2](649000/4588595) loss:3.109 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:25:40,146][model8_pretrain.py][INFO] Epoch:[0/2](649000/4588595) loss:2.957 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:26:17,088][model8_pretrain.py][INFO] Epoch:[0/2](649100/4588595) loss:2.979 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:26:17,088][model8_pretrain.py][INFO] Epoch:[0/2](649100/4588595) loss:2.820 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:26:17,088][model8_pretrain.py][INFO] Epoch:[0/2](649100/4588595) loss:3.075 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:26:17,088][model8_pretrain.py][INFO] Epoch:[0/2](649100/4588595) loss:2.705 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:26:17,088][model8_pretrain.py][INFO] Epoch:[0/2](649100/4588595) loss:2.686 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:26:17,088][model8_pretrain.py][INFO] Epoch:[0/2](649100/4588595) loss:2.620 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:26:17,088][model8_pretrain.py][INFO] Epoch:[0/2](649100/4588595) loss:2.664 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:26:17,088][model8_pretrain.py][INFO] Epoch:[0/2](649100/4588595) loss:3.134 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:27:05,957][model8_pretrain.py][INFO] Epoch:[0/2](649200/4588595) loss:2.925 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:27:05,957][model8_pretrain.py][INFO] Epoch:[0/2](649200/4588595) loss:2.689 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:27:05,957][model8_pretrain.py][INFO] Epoch:[0/2](649200/4588595) loss:2.742 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:27:05,957][model8_pretrain.py][INFO] Epoch:[0/2](649200/4588595) loss:1.923 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:27:05,957][model8_pretrain.py][INFO] Epoch:[0/2](649200/4588595) loss:3.188 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:27:05,957][model8_pretrain.py][INFO] Epoch:[0/2](649200/4588595) loss:3.132 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:27:05,957][model8_pretrain.py][INFO] Epoch:[0/2](649200/4588595) loss:2.762 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:27:05,957][model8_pretrain.py][INFO] Epoch:[0/2](649200/4588595) loss:2.830 lr:0.0000100 epoch_Time:25045.0min: [2024-01-05 14:27:42,888][model8_pretrain.py][INFO] Epoch:[0/2](649300/4588595) loss:2.953 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:27:42,888][model8_pretrain.py][INFO] Epoch:[0/2](649300/4588595) loss:2.310 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:27:42,888][model8_pretrain.py][INFO] Epoch:[0/2](649300/4588595) loss:2.940 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:27:42,888][model8_pretrain.py][INFO] Epoch:[0/2](649300/4588595) loss:2.463 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:27:42,888][model8_pretrain.py][INFO] Epoch:[0/2](649300/4588595) loss:2.875 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:27:42,888][model8_pretrain.py][INFO] Epoch:[0/2](649300/4588595) loss:2.753 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:27:42,888][model8_pretrain.py][INFO] Epoch:[0/2](649300/4588595) loss:2.955 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:27:42,888][model8_pretrain.py][INFO] Epoch:[0/2](649300/4588595) loss:3.097 lr:0.0000100 epoch_Time:25044.0min: [2024-01-05 14:28:19,834][model8_pretrain.py][INFO] Epoch:[0/2](649400/4588595) loss:3.041 lr:0.0000100 epoch_Time:25043.0min: [2024-01-05 14:28:19,834][model8_pretrain.py][INFO] Epoch:[0/2](649400/4588595) loss:3.112 lr:0.0000100 epoch_Time:25043.0min: [2024-01-05 14:28:19,834][model8_pretrain.py][INFO] Epoch:[0/2](649400/4588595) loss:2.961 lr:0.0000100 epoch_Time:25043.0min: [2024-01-05 14:28:19,834][model8_pretrain.py][INFO] Epoch:[0/2](649400/4588595) loss:2.549 lr:0.0000100 epoch_Time:25043.0min: [2024-01-05 14:28:19,834][model8_pretrain.py][INFO] Epoch:[0/2](649400/4588595) loss:2.053 lr:0.0000100 epoch_Time:25043.0min: [2024-01-05 14:28:19,834][model8_pretrain.py][INFO] Epoch:[0/2](649400/4588595) loss:2.313 lr:0.0000100 epoch_Time:25043.0min: [2024-01-05 14:28:19,834][model8_pretrain.py][INFO] Epoch:[0/2](649400/4588595) loss:3.089 lr:0.0000100 epoch_Time:25043.0min: [2024-01-05 14:28:19,834][model8_pretrain.py][INFO] Epoch:[0/2](649400/4588595) loss:2.816 lr:0.0000100 epoch_Time:25043.0min: [2024-01-05 14:28:56,768][model8_pretrain.py][INFO] Epoch:[0/2](649500/4588595) loss:2.994 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:28:56,768][model8_pretrain.py][INFO] Epoch:[0/2](649500/4588595) loss:2.878 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:28:56,768][model8_pretrain.py][INFO] Epoch:[0/2](649500/4588595) loss:2.345 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:28:56,768][model8_pretrain.py][INFO] Epoch:[0/2](649500/4588595) loss:2.489 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:28:56,768][model8_pretrain.py][INFO] Epoch:[0/2](649500/4588595) loss:2.439 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:28:56,768][model8_pretrain.py][INFO] Epoch:[0/2](649500/4588595) loss:3.009 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:28:56,768][model8_pretrain.py][INFO] Epoch:[0/2](649500/4588595) loss:2.616 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:28:56,768][model8_pretrain.py][INFO] Epoch:[0/2](649500/4588595) loss:2.161 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:29:33,726][model8_pretrain.py][INFO] Epoch:[0/2](649600/4588595) loss:2.868 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:29:33,727][model8_pretrain.py][INFO] Epoch:[0/2](649600/4588595) loss:2.301 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:29:33,727][model8_pretrain.py][INFO] Epoch:[0/2](649600/4588595) loss:2.415 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:29:33,727][model8_pretrain.py][INFO] Epoch:[0/2](649600/4588595) loss:3.012 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:29:33,727][model8_pretrain.py][INFO] Epoch:[0/2](649600/4588595) loss:3.121 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:29:33,727][model8_pretrain.py][INFO] Epoch:[0/2](649600/4588595) loss:2.829 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:29:33,727][model8_pretrain.py][INFO] Epoch:[0/2](649600/4588595) loss:2.924 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:29:33,727][model8_pretrain.py][INFO] Epoch:[0/2](649600/4588595) loss:2.771 lr:0.0000100 epoch_Time:25042.0min: [2024-01-05 14:30:10,670][model8_pretrain.py][INFO] Epoch:[0/2](649700/4588595) loss:2.286 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:10,670][model8_pretrain.py][INFO] Epoch:[0/2](649700/4588595) loss:2.991 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:10,670][model8_pretrain.py][INFO] Epoch:[0/2](649700/4588595) loss:2.639 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:10,670][model8_pretrain.py][INFO] Epoch:[0/2](649700/4588595) loss:2.688 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:10,670][model8_pretrain.py][INFO] Epoch:[0/2](649700/4588595) loss:2.983 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:10,670][model8_pretrain.py][INFO] Epoch:[0/2](649700/4588595) loss:3.064 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:10,670][model8_pretrain.py][INFO] Epoch:[0/2](649700/4588595) loss:2.323 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:10,670][model8_pretrain.py][INFO] Epoch:[0/2](649700/4588595) loss:3.203 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:47,597][model8_pretrain.py][INFO] Epoch:[0/2](649800/4588595) loss:2.844 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:47,597][model8_pretrain.py][INFO] Epoch:[0/2](649800/4588595) loss:2.769 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:30:47,597][model8_pretrain.py][INFO] Epoch:[0/2](649800/4588595) loss:2.653 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:47,597][model8_pretrain.py][INFO] Epoch:[0/2](649800/4588595) loss:3.103 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:47,597][model8_pretrain.py][INFO] Epoch:[0/2](649800/4588595) loss:2.507 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:47,597][model8_pretrain.py][INFO] Epoch:[0/2](649800/4588595) loss:3.043 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:47,597][model8_pretrain.py][INFO] Epoch:[0/2](649800/4588595) loss:2.783 lr:0.0000100 epoch_Time:25041.0min: [2024-01-05 14:30:47,597][model8_pretrain.py][INFO] Epoch:[0/2](649800/4588595) loss:2.693 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:31:24,543][model8_pretrain.py][INFO] Epoch:[0/2](649900/4588595) loss:2.903 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:31:24,543][model8_pretrain.py][INFO] Epoch:[0/2](649900/4588595) loss:3.662 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:31:24,544][model8_pretrain.py][INFO] Epoch:[0/2](649900/4588595) loss:2.789 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:31:24,544][model8_pretrain.py][INFO] Epoch:[0/2](649900/4588595) loss:2.559 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:31:24,544][model8_pretrain.py][INFO] Epoch:[0/2](649900/4588595) loss:2.741 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:31:24,544][model8_pretrain.py][INFO] Epoch:[0/2](649900/4588595) loss:3.017 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:31:24,544][model8_pretrain.py][INFO] Epoch:[0/2](649900/4588595) loss:3.058 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:31:24,544][model8_pretrain.py][INFO] Epoch:[0/2](649900/4588595) loss:2.516 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:32:13,510][model8_pretrain.py][INFO] Epoch:[0/2](650000/4588595) loss:2.894 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:32:13,510][model8_pretrain.py][INFO] Epoch:[0/2](650000/4588595) loss:2.865 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:32:13,510][model8_pretrain.py][INFO] Epoch:[0/2](650000/4588595) loss:2.501 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:32:13,510][model8_pretrain.py][INFO] Epoch:[0/2](650000/4588595) loss:3.389 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:32:13,510][model8_pretrain.py][INFO] Epoch:[0/2](650000/4588595) loss:3.123 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:32:13,510][model8_pretrain.py][INFO] Epoch:[0/2](650000/4588595) loss:2.271 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:32:13,510][model8_pretrain.py][INFO] Epoch:[0/2](650000/4588595) loss:1.848 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:32:13,510][model8_pretrain.py][INFO] Epoch:[0/2](650000/4588595) loss:3.090 lr:0.0000100 epoch_Time:25040.0min: [2024-01-05 14:32:50,441][model8_pretrain.py][INFO] Epoch:[0/2](650100/4588595) loss:2.740 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:32:50,441][model8_pretrain.py][INFO] Epoch:[0/2](650100/4588595) loss:2.801 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:32:50,441][model8_pretrain.py][INFO] Epoch:[0/2](650100/4588595) loss:2.940 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:32:50,441][model8_pretrain.py][INFO] Epoch:[0/2](650100/4588595) loss:2.617 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:32:50,441][model8_pretrain.py][INFO] Epoch:[0/2](650100/4588595) loss:2.602 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:32:50,441][model8_pretrain.py][INFO] Epoch:[0/2](650100/4588595) loss:2.233 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:32:50,441][model8_pretrain.py][INFO] Epoch:[0/2](650100/4588595) loss:2.634 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:32:50,441][model8_pretrain.py][INFO] Epoch:[0/2](650100/4588595) loss:2.596 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:33:27,396][model8_pretrain.py][INFO] Epoch:[0/2](650200/4588595) loss:2.464 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:33:27,396][model8_pretrain.py][INFO] Epoch:[0/2](650200/4588595) loss:2.719 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:33:27,396][model8_pretrain.py][INFO] Epoch:[0/2](650200/4588595) loss:2.217 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:33:27,396][model8_pretrain.py][INFO] Epoch:[0/2](650200/4588595) loss:2.788 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:33:27,396][model8_pretrain.py][INFO] Epoch:[0/2](650200/4588595) loss:2.817 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:33:27,396][model8_pretrain.py][INFO] Epoch:[0/2](650200/4588595) loss:3.045 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:33:27,396][model8_pretrain.py][INFO] Epoch:[0/2](650200/4588595) loss:2.802 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:33:27,396][model8_pretrain.py][INFO] Epoch:[0/2](650200/4588595) loss:3.009 lr:0.0000100 epoch_Time:25039.0min: [2024-01-05 14:34:04,361][model8_pretrain.py][INFO] Epoch:[0/2](650300/4588595) loss:2.978 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:04,361][model8_pretrain.py][INFO] Epoch:[0/2](650300/4588595) loss:2.749 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:04,361][model8_pretrain.py][INFO] Epoch:[0/2](650300/4588595) loss:2.752 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:04,361][model8_pretrain.py][INFO] Epoch:[0/2](650300/4588595) loss:3.064 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:04,361][model8_pretrain.py][INFO] Epoch:[0/2](650300/4588595) loss:2.430 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:04,361][model8_pretrain.py][INFO] Epoch:[0/2](650300/4588595) loss:3.004 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:04,361][model8_pretrain.py][INFO] Epoch:[0/2](650300/4588595) loss:3.031 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:04,361][model8_pretrain.py][INFO] Epoch:[0/2](650300/4588595) loss:2.954 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:41,315][model8_pretrain.py][INFO] Epoch:[0/2](650400/4588595) loss:3.353 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:41,315][model8_pretrain.py][INFO] Epoch:[0/2](650400/4588595) loss:2.806 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:41,315][model8_pretrain.py][INFO] Epoch:[0/2](650400/4588595) loss:3.057 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:41,315][model8_pretrain.py][INFO] Epoch:[0/2](650400/4588595) loss:3.015 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:41,315][model8_pretrain.py][INFO] Epoch:[0/2](650400/4588595) loss:2.689 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:41,315][model8_pretrain.py][INFO] Epoch:[0/2](650400/4588595) loss:2.403 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:41,315][model8_pretrain.py][INFO] Epoch:[0/2](650400/4588595) loss:2.660 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:34:41,315][model8_pretrain.py][INFO] Epoch:[0/2](650400/4588595) loss:3.417 lr:0.0000100 epoch_Time:25037.0min: [2024-01-05 14:35:18,263][model8_pretrain.py][INFO] Epoch:[0/2](650500/4588595) loss:3.154 lr:0.0000100 epoch_Time:25036.0min: [2024-01-05 14:35:18,263][model8_pretrain.py][INFO] Epoch:[0/2](650500/4588595) loss:2.607 lr:0.0000100 epoch_Time:25036.0min: [2024-01-05 14:35:18,263][model8_pretrain.py][INFO] Epoch:[0/2](650500/4588595) loss:2.835 lr:0.0000100 epoch_Time:25036.0min: [2024-01-05 14:35:18,263][model8_pretrain.py][INFO] Epoch:[0/2](650500/4588595) loss:2.434 lr:0.0000100 epoch_Time:25036.0min: [2024-01-05 14:35:18,263][model8_pretrain.py][INFO] Epoch:[0/2](650500/4588595) loss:2.870 lr:0.0000100 epoch_Time:25036.0min: [2024-01-05 14:35:18,263][model8_pretrain.py][INFO] Epoch:[0/2](650500/4588595) loss:2.556 lr:0.0000100 epoch_Time:25036.0min: [2024-01-05 14:35:18,263][model8_pretrain.py][INFO] Epoch:[0/2](650500/4588595) loss:2.885 lr:0.0000100 epoch_Time:25036.0min: [2024-01-05 14:35:18,263][model8_pretrain.py][INFO] Epoch:[0/2](650500/4588595) loss:2.973 lr:0.0000100 epoch_Time:25036.0min: [2024-01-05 14:35:55,208][model8_pretrain.py][INFO] Epoch:[0/2](650600/4588595) loss:2.880 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:35:55,208][model8_pretrain.py][INFO] Epoch:[0/2](650600/4588595) loss:2.600 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:35:55,208][model8_pretrain.py][INFO] Epoch:[0/2](650600/4588595) loss:2.747 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:35:55,208][model8_pretrain.py][INFO] Epoch:[0/2](650600/4588595) loss:2.889 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:35:55,208][model8_pretrain.py][INFO] Epoch:[0/2](650600/4588595) loss:2.903 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:35:55,208][model8_pretrain.py][INFO] Epoch:[0/2](650600/4588595) loss:2.665 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:35:55,208][model8_pretrain.py][INFO] Epoch:[0/2](650600/4588595) loss:2.657 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:35:55,209][model8_pretrain.py][INFO] Epoch:[0/2](650600/4588595) loss:2.893 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:36:32,180][model8_pretrain.py][INFO] Epoch:[0/2](650700/4588595) loss:3.229 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:36:32,180][model8_pretrain.py][INFO] Epoch:[0/2](650700/4588595) loss:2.911 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:36:32,180][model8_pretrain.py][INFO] Epoch:[0/2](650700/4588595) loss:3.279 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:36:32,180][model8_pretrain.py][INFO] Epoch:[0/2](650700/4588595) loss:2.408 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:36:32,180][model8_pretrain.py][INFO] Epoch:[0/2](650700/4588595) loss:2.695 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:36:32,180][model8_pretrain.py][INFO] Epoch:[0/2](650700/4588595) loss:2.432 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:36:32,180][model8_pretrain.py][INFO] Epoch:[0/2](650700/4588595) loss:2.327 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:36:32,180][model8_pretrain.py][INFO] Epoch:[0/2](650700/4588595) loss:2.887 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:37:19,407][model8_pretrain.py][INFO] Epoch:[0/2](650800/4588595) loss:2.814 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:37:19,407][model8_pretrain.py][INFO] Epoch:[0/2](650800/4588595) loss:3.003 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:37:19,408][model8_pretrain.py][INFO] Epoch:[0/2](650800/4588595) loss:2.903 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:37:19,408][model8_pretrain.py][INFO] Epoch:[0/2](650800/4588595) loss:2.872 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:37:19,408][model8_pretrain.py][INFO] Epoch:[0/2](650800/4588595) loss:2.974 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:37:19,408][model8_pretrain.py][INFO] Epoch:[0/2](650800/4588595) loss:2.824 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:37:19,408][model8_pretrain.py][INFO] Epoch:[0/2](650800/4588595) loss:2.892 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:37:19,408][model8_pretrain.py][INFO] Epoch:[0/2](650800/4588595) loss:2.698 lr:0.0000100 epoch_Time:25035.0min: [2024-01-05 14:37:58,022][model8_pretrain.py][INFO] Epoch:[0/2](650900/4588595) loss:2.362 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:37:58,022][model8_pretrain.py][INFO] Epoch:[0/2](650900/4588595) loss:2.964 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:37:58,022][model8_pretrain.py][INFO] Epoch:[0/2](650900/4588595) loss:2.967 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:37:58,022][model8_pretrain.py][INFO] Epoch:[0/2](650900/4588595) loss:2.647 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:37:58,022][model8_pretrain.py][INFO] Epoch:[0/2](650900/4588595) loss:2.751 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:37:58,022][model8_pretrain.py][INFO] Epoch:[0/2](650900/4588595) loss:3.155 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:37:58,022][model8_pretrain.py][INFO] Epoch:[0/2](650900/4588595) loss:2.560 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:37:58,022][model8_pretrain.py][INFO] Epoch:[0/2](650900/4588595) loss:3.218 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:38:34,969][model8_pretrain.py][INFO] Epoch:[0/2](651000/4588595) loss:2.921 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:38:34,969][model8_pretrain.py][INFO] Epoch:[0/2](651000/4588595) loss:2.698 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:38:34,969][model8_pretrain.py][INFO] Epoch:[0/2](651000/4588595) loss:2.796 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:38:34,969][model8_pretrain.py][INFO] Epoch:[0/2](651000/4588595) loss:2.288 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:38:34,969][model8_pretrain.py][INFO] Epoch:[0/2](651000/4588595) loss:2.652 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:38:34,969][model8_pretrain.py][INFO] Epoch:[0/2](651000/4588595) loss:2.926 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:38:34,969][model8_pretrain.py][INFO] Epoch:[0/2](651000/4588595) loss:2.769 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:38:34,969][model8_pretrain.py][INFO] Epoch:[0/2](651000/4588595) loss:2.844 lr:0.0000100 epoch_Time:25034.0min: [2024-01-05 14:39:11,907][model8_pretrain.py][INFO] Epoch:[0/2](651100/4588595) loss:2.680 lr:0.0000100 epoch_Time:25033.0min: [2024-01-05 14:39:11,907][model8_pretrain.py][INFO] Epoch:[0/2](651100/4588595) loss:3.163 lr:0.0000100 epoch_Time:25033.0min: [2024-01-05 14:39:11,907][model8_pretrain.py][INFO] Epoch:[0/2](651100/4588595) loss:3.122 lr:0.0000100 epoch_Time:25033.0min: [2024-01-05 14:39:11,907][model8_pretrain.py][INFO] Epoch:[0/2](651100/4588595) loss:3.131 lr:0.0000100 epoch_Time:25033.0min: [2024-01-05 14:39:11,907][model8_pretrain.py][INFO] Epoch:[0/2](651100/4588595) loss:2.763 lr:0.0000100 epoch_Time:25033.0min: [2024-01-05 14:39:11,907][model8_pretrain.py][INFO] Epoch:[0/2](651100/4588595) loss:2.537 lr:0.0000100 epoch_Time:25033.0min: [2024-01-05 14:39:11,907][model8_pretrain.py][INFO] Epoch:[0/2](651100/4588595) loss:3.462 lr:0.0000100 epoch_Time:25033.0min: [2024-01-05 14:39:11,907][model8_pretrain.py][INFO] Epoch:[0/2](651100/4588595) loss:2.908 lr:0.0000100 epoch_Time:25033.0min: [2024-01-05 14:39:48,850][model8_pretrain.py][INFO] Epoch:[0/2](651200/4588595) loss:2.354 lr:0.0000100 epoch_Time:25032.0min: [2024-01-05 14:39:48,850][model8_pretrain.py][INFO] Epoch:[0/2](651200/4588595) loss:3.158 lr:0.0000100 epoch_Time:25032.0min: [2024-01-05 14:39:48,850][model8_pretrain.py][INFO] Epoch:[0/2](651200/4588595) loss:2.756 lr:0.0000100 epoch_Time:25032.0min: [2024-01-05 14:39:48,850][model8_pretrain.py][INFO] Epoch:[0/2](651200/4588595) loss:3.525 lr:0.0000100 epoch_Time:25032.0min: [2024-01-05 14:39:48,850][model8_pretrain.py][INFO] Epoch:[0/2](651200/4588595) loss:2.372 lr:0.0000100 epoch_Time:25032.0min: [2024-01-05 14:39:48,850][model8_pretrain.py][INFO] Epoch:[0/2](651200/4588595) loss:3.155 lr:0.0000100 epoch_Time:25032.0min: [2024-01-05 14:39:48,850][model8_pretrain.py][INFO] Epoch:[0/2](651200/4588595) loss:3.412 lr:0.0000100 epoch_Time:25032.0min: [2024-01-05 14:39:48,850][model8_pretrain.py][INFO] Epoch:[0/2](651200/4588595) loss:2.876 lr:0.0000100 epoch_Time:25032.0min: [2024-01-05 14:40:25,778][model8_pretrain.py][INFO] Epoch:[0/2](651300/4588595) loss:3.194 lr:0.0000100 epoch_Time:25031.0min: [2024-01-05 14:40:25,778][model8_pretrain.py][INFO] Epoch:[0/2](651300/4588595) loss:1.909 lr:0.0000100 epoch_Time:25031.0min: [2024-01-05 14:40:25,778][model8_pretrain.py][INFO] Epoch:[0/2](651300/4588595) loss:3.183 lr:0.0000100 epoch_Time:25031.0min: [2024-01-05 14:40:25,778][model8_pretrain.py][INFO] Epoch:[0/2](651300/4588595) loss:2.772 lr:0.0000100 epoch_Time:25031.0min: [2024-01-05 14:40:25,778][model8_pretrain.py][INFO] Epoch:[0/2](651300/4588595) loss:2.290 lr:0.0000100 epoch_Time:25031.0min: [2024-01-05 14:40:25,778][model8_pretrain.py][INFO] Epoch:[0/2](651300/4588595) loss:2.822 lr:0.0000100 epoch_Time:25031.0min: [2024-01-05 14:40:25,778][model8_pretrain.py][INFO] Epoch:[0/2](651300/4588595) loss:2.592 lr:0.0000100 epoch_Time:25031.0min: [2024-01-05 14:40:25,778][model8_pretrain.py][INFO] Epoch:[0/2](651300/4588595) loss:3.179 lr:0.0000100 epoch_Time:25031.0min: [2024-01-05 14:41:02,723][model8_pretrain.py][INFO] Epoch:[0/2](651400/4588595) loss:3.190 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:02,723][model8_pretrain.py][INFO] Epoch:[0/2](651400/4588595) loss:2.912 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:02,723][model8_pretrain.py][INFO] Epoch:[0/2](651400/4588595) loss:2.358 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:02,723][model8_pretrain.py][INFO] Epoch:[0/2](651400/4588595) loss:2.946 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:02,723][model8_pretrain.py][INFO] Epoch:[0/2](651400/4588595) loss:3.019 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:02,723][model8_pretrain.py][INFO] Epoch:[0/2](651400/4588595) loss:2.907 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:02,723][model8_pretrain.py][INFO] Epoch:[0/2](651400/4588595) loss:2.837 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:02,723][model8_pretrain.py][INFO] Epoch:[0/2](651400/4588595) loss:2.419 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:39,655][model8_pretrain.py][INFO] Epoch:[0/2](651500/4588595) loss:2.876 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:39,655][model8_pretrain.py][INFO] Epoch:[0/2](651500/4588595) loss:2.839 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:39,655][model8_pretrain.py][INFO] Epoch:[0/2](651500/4588595) loss:2.534 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:39,655][model8_pretrain.py][INFO] Epoch:[0/2](651500/4588595) loss:2.875 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:39,655][model8_pretrain.py][INFO] Epoch:[0/2](651500/4588595) loss:3.122 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:39,655][model8_pretrain.py][INFO] Epoch:[0/2](651500/4588595) loss:2.713 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:39,655][model8_pretrain.py][INFO] Epoch:[0/2](651500/4588595) loss:2.892 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:41:39,656][model8_pretrain.py][INFO] Epoch:[0/2](651500/4588595) loss:2.527 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:42:27,001][model8_pretrain.py][INFO] Epoch:[0/2](651600/4588595) loss:3.374 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:42:27,001][model8_pretrain.py][INFO] Epoch:[0/2](651600/4588595) loss:2.485 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:42:27,001][model8_pretrain.py][INFO] Epoch:[0/2](651600/4588595) loss:3.352 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:42:27,001][model8_pretrain.py][INFO] Epoch:[0/2](651600/4588595) loss:2.851 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:42:27,001][model8_pretrain.py][INFO] Epoch:[0/2](651600/4588595) loss:2.692 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:42:27,001][model8_pretrain.py][INFO] Epoch:[0/2](651600/4588595) loss:3.101 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:42:27,001][model8_pretrain.py][INFO] Epoch:[0/2](651600/4588595) loss:2.733 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:42:27,001][model8_pretrain.py][INFO] Epoch:[0/2](651600/4588595) loss:2.540 lr:0.0000100 epoch_Time:25030.0min: [2024-01-05 14:43:05,600][model8_pretrain.py][INFO] Epoch:[0/2](651700/4588595) loss:2.879 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:05,600][model8_pretrain.py][INFO] Epoch:[0/2](651700/4588595) loss:2.447 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:05,600][model8_pretrain.py][INFO] Epoch:[0/2](651700/4588595) loss:2.879 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:05,600][model8_pretrain.py][INFO] Epoch:[0/2](651700/4588595) loss:2.550 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:05,600][model8_pretrain.py][INFO] Epoch:[0/2](651700/4588595) loss:3.245 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:05,600][model8_pretrain.py][INFO] Epoch:[0/2](651700/4588595) loss:2.669 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:05,600][model8_pretrain.py][INFO] Epoch:[0/2](651700/4588595) loss:2.821 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:05,600][model8_pretrain.py][INFO] Epoch:[0/2](651700/4588595) loss:3.131 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:42,542][model8_pretrain.py][INFO] Epoch:[0/2](651800/4588595) loss:2.901 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:42,542][model8_pretrain.py][INFO] Epoch:[0/2](651800/4588595) loss:2.487 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:42,542][model8_pretrain.py][INFO] Epoch:[0/2](651800/4588595) loss:3.564 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:42,542][model8_pretrain.py][INFO] Epoch:[0/2](651800/4588595) loss:2.790 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:42,542][model8_pretrain.py][INFO] Epoch:[0/2](651800/4588595) loss:2.245 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:42,542][model8_pretrain.py][INFO] Epoch:[0/2](651800/4588595) loss:2.686 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:42,542][model8_pretrain.py][INFO] Epoch:[0/2](651800/4588595) loss:3.514 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:43:42,542][model8_pretrain.py][INFO] Epoch:[0/2](651800/4588595) loss:2.765 lr:0.0000100 epoch_Time:25029.0min: [2024-01-05 14:44:19,483][model8_pretrain.py][INFO] Epoch:[0/2](651900/4588595) loss:2.638 lr:0.0000100 epoch_Time:25028.0min: [2024-01-05 14:44:19,483][model8_pretrain.py][INFO] Epoch:[0/2](651900/4588595) loss:2.781 lr:0.0000100 epoch_Time:25028.0min: [2024-01-05 14:44:19,483][model8_pretrain.py][INFO] Epoch:[0/2](651900/4588595) loss:3.095 lr:0.0000100 epoch_Time:25028.0min: [2024-01-05 14:44:19,483][model8_pretrain.py][INFO] Epoch:[0/2](651900/4588595) loss:2.693 lr:0.0000100 epoch_Time:25028.0min: [2024-01-05 14:44:19,483][model8_pretrain.py][INFO] Epoch:[0/2](651900/4588595) loss:2.776 lr:0.0000100 epoch_Time:25028.0min: [2024-01-05 14:44:19,483][model8_pretrain.py][INFO] Epoch:[0/2](651900/4588595) loss:3.023 lr:0.0000100 epoch_Time:25028.0min: [2024-01-05 14:44:19,483][model8_pretrain.py][INFO] Epoch:[0/2](651900/4588595) loss:2.532 lr:0.0000100 epoch_Time:25028.0min: [2024-01-05 14:44:19,484][model8_pretrain.py][INFO] Epoch:[0/2](651900/4588595) loss:3.667 lr:0.0000100 epoch_Time:25028.0min: [2024-01-05 14:44:56,415][model8_pretrain.py][INFO] Epoch:[0/2](652000/4588595) loss:2.540 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:44:56,416][model8_pretrain.py][INFO] Epoch:[0/2](652000/4588595) loss:3.266 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:44:56,416][model8_pretrain.py][INFO] Epoch:[0/2](652000/4588595) loss:2.798 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:44:56,416][model8_pretrain.py][INFO] Epoch:[0/2](652000/4588595) loss:2.398 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:44:56,416][model8_pretrain.py][INFO] Epoch:[0/2](652000/4588595) loss:2.783 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:44:56,416][model8_pretrain.py][INFO] Epoch:[0/2](652000/4588595) loss:2.950 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:44:56,416][model8_pretrain.py][INFO] Epoch:[0/2](652000/4588595) loss:2.963 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:44:56,416][model8_pretrain.py][INFO] Epoch:[0/2](652000/4588595) loss:3.252 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:45:33,370][model8_pretrain.py][INFO] Epoch:[0/2](652100/4588595) loss:2.876 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:45:33,370][model8_pretrain.py][INFO] Epoch:[0/2](652100/4588595) loss:2.528 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:45:33,370][model8_pretrain.py][INFO] Epoch:[0/2](652100/4588595) loss:3.309 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:45:33,370][model8_pretrain.py][INFO] Epoch:[0/2](652100/4588595) loss:2.797 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:45:33,370][model8_pretrain.py][INFO] Epoch:[0/2](652100/4588595) loss:2.661 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:45:33,370][model8_pretrain.py][INFO] Epoch:[0/2](652100/4588595) loss:3.035 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:45:33,370][model8_pretrain.py][INFO] Epoch:[0/2](652100/4588595) loss:2.919 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:45:33,370][model8_pretrain.py][INFO] Epoch:[0/2](652100/4588595) loss:2.684 lr:0.0000100 epoch_Time:25027.0min: [2024-01-05 14:46:10,318][model8_pretrain.py][INFO] Epoch:[0/2](652200/4588595) loss:2.686 lr:0.0000100 epoch_Time:25026.0min: [2024-01-05 14:46:10,318][model8_pretrain.py][INFO] Epoch:[0/2](652200/4588595) loss:2.625 lr:0.0000100 epoch_Time:25026.0min: [2024-01-05 14:46:10,318][model8_pretrain.py][INFO] Epoch:[0/2](652200/4588595) loss:3.007 lr:0.0000100 epoch_Time:25026.0min: [2024-01-05 14:46:10,318][model8_pretrain.py][INFO] Epoch:[0/2](652200/4588595) loss:2.870 lr:0.0000100 epoch_Time:25026.0min: [2024-01-05 14:46:10,318][model8_pretrain.py][INFO] Epoch:[0/2](652200/4588595) loss:2.955 lr:0.0000100 epoch_Time:25026.0min: [2024-01-05 14:46:10,318][model8_pretrain.py][INFO] Epoch:[0/2](652200/4588595) loss:2.911 lr:0.0000100 epoch_Time:25026.0min: [2024-01-05 14:46:10,318][model8_pretrain.py][INFO] Epoch:[0/2](652200/4588595) loss:2.821 lr:0.0000100 epoch_Time:25026.0min: [2024-01-05 14:46:10,318][model8_pretrain.py][INFO] Epoch:[0/2](652200/4588595) loss:3.127 lr:0.0000100 epoch_Time:25026.0min: [2024-01-05 14:46:47,258][model8_pretrain.py][INFO] Epoch:[0/2](652300/4588595) loss:3.285 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:46:47,258][model8_pretrain.py][INFO] Epoch:[0/2](652300/4588595) loss:3.376 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:46:47,258][model8_pretrain.py][INFO] Epoch:[0/2](652300/4588595) loss:2.981 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:46:47,258][model8_pretrain.py][INFO] Epoch:[0/2](652300/4588595) loss:2.680 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:46:47,258][model8_pretrain.py][INFO] Epoch:[0/2](652300/4588595) loss:2.921 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:46:47,258][model8_pretrain.py][INFO] Epoch:[0/2](652300/4588595) loss:2.442 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:46:47,258][model8_pretrain.py][INFO] Epoch:[0/2](652300/4588595) loss:3.096 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:46:47,259][model8_pretrain.py][INFO] Epoch:[0/2](652300/4588595) loss:1.867 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:47:32,857][model8_pretrain.py][INFO] Epoch:[0/2](652400/4588595) loss:2.471 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:47:32,857][model8_pretrain.py][INFO] Epoch:[0/2](652400/4588595) loss:2.730 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:47:32,857][model8_pretrain.py][INFO] Epoch:[0/2](652400/4588595) loss:3.361 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:47:32,857][model8_pretrain.py][INFO] Epoch:[0/2](652400/4588595) loss:2.237 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:47:32,857][model8_pretrain.py][INFO] Epoch:[0/2](652400/4588595) loss:2.927 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:47:32,857][model8_pretrain.py][INFO] Epoch:[0/2](652400/4588595) loss:2.595 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:47:32,857][model8_pretrain.py][INFO] Epoch:[0/2](652400/4588595) loss:3.372 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:47:32,857][model8_pretrain.py][INFO] Epoch:[0/2](652400/4588595) loss:2.831 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:48:13,242][model8_pretrain.py][INFO] Epoch:[0/2](652500/4588595) loss:3.120 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:48:13,242][model8_pretrain.py][INFO] Epoch:[0/2](652500/4588595) loss:2.680 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:48:13,242][model8_pretrain.py][INFO] Epoch:[0/2](652500/4588595) loss:2.368 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:48:13,242][model8_pretrain.py][INFO] Epoch:[0/2](652500/4588595) loss:3.156 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:48:13,242][model8_pretrain.py][INFO] Epoch:[0/2](652500/4588595) loss:2.967 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:48:13,243][model8_pretrain.py][INFO] Epoch:[0/2](652500/4588595) loss:2.902 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:48:13,243][model8_pretrain.py][INFO] Epoch:[0/2](652500/4588595) loss:2.598 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:48:13,243][model8_pretrain.py][INFO] Epoch:[0/2](652500/4588595) loss:2.638 lr:0.0000100 epoch_Time:25025.0min: [2024-01-05 14:48:50,170][model8_pretrain.py][INFO] Epoch:[0/2](652600/4588595) loss:2.633 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:48:50,170][model8_pretrain.py][INFO] Epoch:[0/2](652600/4588595) loss:2.770 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:48:50,170][model8_pretrain.py][INFO] Epoch:[0/2](652600/4588595) loss:2.541 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:48:50,170][model8_pretrain.py][INFO] Epoch:[0/2](652600/4588595) loss:2.763 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:48:50,170][model8_pretrain.py][INFO] Epoch:[0/2](652600/4588595) loss:2.921 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:48:50,171][model8_pretrain.py][INFO] Epoch:[0/2](652600/4588595) loss:2.977 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:48:50,171][model8_pretrain.py][INFO] Epoch:[0/2](652600/4588595) loss:2.797 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:48:50,172][model8_pretrain.py][INFO] Epoch:[0/2](652600/4588595) loss:2.883 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:49:27,103][model8_pretrain.py][INFO] Epoch:[0/2](652700/4588595) loss:2.835 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:49:27,103][model8_pretrain.py][INFO] Epoch:[0/2](652700/4588595) loss:2.580 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:49:27,103][model8_pretrain.py][INFO] Epoch:[0/2](652700/4588595) loss:2.142 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:49:27,103][model8_pretrain.py][INFO] Epoch:[0/2](652700/4588595) loss:3.021 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:49:27,103][model8_pretrain.py][INFO] Epoch:[0/2](652700/4588595) loss:2.818 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:49:27,103][model8_pretrain.py][INFO] Epoch:[0/2](652700/4588595) loss:3.072 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:49:27,103][model8_pretrain.py][INFO] Epoch:[0/2](652700/4588595) loss:2.932 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:49:27,103][model8_pretrain.py][INFO] Epoch:[0/2](652700/4588595) loss:2.734 lr:0.0000100 epoch_Time:25023.0min: [2024-01-05 14:50:04,032][model8_pretrain.py][INFO] Epoch:[0/2](652800/4588595) loss:2.825 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:04,032][model8_pretrain.py][INFO] Epoch:[0/2](652800/4588595) loss:2.278 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:04,032][model8_pretrain.py][INFO] Epoch:[0/2](652800/4588595) loss:2.604 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:04,032][model8_pretrain.py][INFO] Epoch:[0/2](652800/4588595) loss:3.061 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:04,032][model8_pretrain.py][INFO] Epoch:[0/2](652800/4588595) loss:2.566 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:04,032][model8_pretrain.py][INFO] Epoch:[0/2](652800/4588595) loss:3.004 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:04,033][model8_pretrain.py][INFO] Epoch:[0/2](652800/4588595) loss:3.336 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:04,033][model8_pretrain.py][INFO] Epoch:[0/2](652800/4588595) loss:2.425 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:40,974][model8_pretrain.py][INFO] Epoch:[0/2](652900/4588595) loss:2.099 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:40,974][model8_pretrain.py][INFO] Epoch:[0/2](652900/4588595) loss:3.128 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:40,974][model8_pretrain.py][INFO] Epoch:[0/2](652900/4588595) loss:2.870 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:40,974][model8_pretrain.py][INFO] Epoch:[0/2](652900/4588595) loss:3.004 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:40,974][model8_pretrain.py][INFO] Epoch:[0/2](652900/4588595) loss:2.251 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:40,974][model8_pretrain.py][INFO] Epoch:[0/2](652900/4588595) loss:2.583 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:40,974][model8_pretrain.py][INFO] Epoch:[0/2](652900/4588595) loss:2.968 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:50:40,974][model8_pretrain.py][INFO] Epoch:[0/2](652900/4588595) loss:2.383 lr:0.0000100 epoch_Time:25022.0min: [2024-01-05 14:51:17,901][model8_pretrain.py][INFO] Epoch:[0/2](653000/4588595) loss:2.660 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:51:17,901][model8_pretrain.py][INFO] Epoch:[0/2](653000/4588595) loss:3.103 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:51:17,901][model8_pretrain.py][INFO] Epoch:[0/2](653000/4588595) loss:2.032 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:51:17,901][model8_pretrain.py][INFO] Epoch:[0/2](653000/4588595) loss:2.518 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:51:17,901][model8_pretrain.py][INFO] Epoch:[0/2](653000/4588595) loss:2.710 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:51:17,901][model8_pretrain.py][INFO] Epoch:[0/2](653000/4588595) loss:2.909 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:51:17,902][model8_pretrain.py][INFO] Epoch:[0/2](653000/4588595) loss:2.789 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:51:17,901][model8_pretrain.py][INFO] Epoch:[0/2](653000/4588595) loss:2.454 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:51:54,841][model8_pretrain.py][INFO] Epoch:[0/2](653100/4588595) loss:2.767 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:51:54,841][model8_pretrain.py][INFO] Epoch:[0/2](653100/4588595) loss:2.718 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:51:54,841][model8_pretrain.py][INFO] Epoch:[0/2](653100/4588595) loss:2.812 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:51:54,841][model8_pretrain.py][INFO] Epoch:[0/2](653100/4588595) loss:2.520 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:51:54,841][model8_pretrain.py][INFO] Epoch:[0/2](653100/4588595) loss:3.054 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:51:54,841][model8_pretrain.py][INFO] Epoch:[0/2](653100/4588595) loss:3.323 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:51:54,841][model8_pretrain.py][INFO] Epoch:[0/2](653100/4588595) loss:3.287 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:51:54,841][model8_pretrain.py][INFO] Epoch:[0/2](653100/4588595) loss:2.612 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:52:40,427][model8_pretrain.py][INFO] Epoch:[0/2](653200/4588595) loss:3.228 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:52:40,427][model8_pretrain.py][INFO] Epoch:[0/2](653200/4588595) loss:2.924 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:52:40,427][model8_pretrain.py][INFO] Epoch:[0/2](653200/4588595) loss:2.735 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:52:40,427][model8_pretrain.py][INFO] Epoch:[0/2](653200/4588595) loss:2.779 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:52:40,431][model8_pretrain.py][INFO] Epoch:[0/2](653200/4588595) loss:2.802 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:52:40,432][model8_pretrain.py][INFO] Epoch:[0/2](653200/4588595) loss:2.883 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:52:40,432][model8_pretrain.py][INFO] Epoch:[0/2](653200/4588595) loss:3.020 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:52:40,432][model8_pretrain.py][INFO] Epoch:[0/2](653200/4588595) loss:2.646 lr:0.0000100 epoch_Time:25021.0min: [2024-01-05 14:53:20,841][model8_pretrain.py][INFO] Epoch:[0/2](653300/4588595) loss:2.645 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:53:20,841][model8_pretrain.py][INFO] Epoch:[0/2](653300/4588595) loss:2.712 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:53:20,841][model8_pretrain.py][INFO] Epoch:[0/2](653300/4588595) loss:2.918 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:53:20,841][model8_pretrain.py][INFO] Epoch:[0/2](653300/4588595) loss:2.913 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:53:20,841][model8_pretrain.py][INFO] Epoch:[0/2](653300/4588595) loss:2.967 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:53:20,841][model8_pretrain.py][INFO] Epoch:[0/2](653300/4588595) loss:2.819 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:53:20,841][model8_pretrain.py][INFO] Epoch:[0/2](653300/4588595) loss:2.593 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:53:20,842][model8_pretrain.py][INFO] Epoch:[0/2](653300/4588595) loss:2.456 lr:0.0000100 epoch_Time:25020.0min: [2024-01-05 14:53:57,779][model8_pretrain.py][INFO] Epoch:[0/2](653400/4588595) loss:2.625 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:53:57,779][model8_pretrain.py][INFO] Epoch:[0/2](653400/4588595) loss:3.194 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:53:57,779][model8_pretrain.py][INFO] Epoch:[0/2](653400/4588595) loss:2.528 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:53:57,779][model8_pretrain.py][INFO] Epoch:[0/2](653400/4588595) loss:3.290 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:53:57,779][model8_pretrain.py][INFO] Epoch:[0/2](653400/4588595) loss:2.699 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:53:57,779][model8_pretrain.py][INFO] Epoch:[0/2](653400/4588595) loss:3.178 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:53:57,780][model8_pretrain.py][INFO] Epoch:[0/2](653400/4588595) loss:2.702 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:53:57,780][model8_pretrain.py][INFO] Epoch:[0/2](653400/4588595) loss:3.001 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:54:34,742][model8_pretrain.py][INFO] Epoch:[0/2](653500/4588595) loss:2.794 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:54:34,742][model8_pretrain.py][INFO] Epoch:[0/2](653500/4588595) loss:2.616 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:54:34,742][model8_pretrain.py][INFO] Epoch:[0/2](653500/4588595) loss:3.354 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:54:34,742][model8_pretrain.py][INFO] Epoch:[0/2](653500/4588595) loss:2.987 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:54:34,742][model8_pretrain.py][INFO] Epoch:[0/2](653500/4588595) loss:2.397 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:54:34,742][model8_pretrain.py][INFO] Epoch:[0/2](653500/4588595) loss:2.940 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:54:34,742][model8_pretrain.py][INFO] Epoch:[0/2](653500/4588595) loss:2.660 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:54:34,742][model8_pretrain.py][INFO] Epoch:[0/2](653500/4588595) loss:2.269 lr:0.0000100 epoch_Time:25019.0min: [2024-01-05 14:55:11,705][model8_pretrain.py][INFO] Epoch:[0/2](653600/4588595) loss:3.033 lr:0.0000100 epoch_Time:25017.0min: [2024-01-05 14:55:11,705][model8_pretrain.py][INFO] Epoch:[0/2](653600/4588595) loss:3.187 lr:0.0000100 epoch_Time:25017.0min: [2024-01-05 14:55:11,705][model8_pretrain.py][INFO] Epoch:[0/2](653600/4588595) loss:3.076 lr:0.0000100 epoch_Time:25017.0min: [2024-01-05 14:55:11,705][model8_pretrain.py][INFO] Epoch:[0/2](653600/4588595) loss:1.917 lr:0.0000100 epoch_Time:25017.0min: [2024-01-05 14:55:11,705][model8_pretrain.py][INFO] Epoch:[0/2](653600/4588595) loss:2.932 lr:0.0000100 epoch_Time:25017.0min: [2024-01-05 14:55:11,705][model8_pretrain.py][INFO] Epoch:[0/2](653600/4588595) loss:2.100 lr:0.0000100 epoch_Time:25017.0min: [2024-01-05 14:55:11,705][model8_pretrain.py][INFO] Epoch:[0/2](653600/4588595) loss:3.242 lr:0.0000100 epoch_Time:25017.0min: [2024-01-05 14:55:11,705][model8_pretrain.py][INFO] Epoch:[0/2](653600/4588595) loss:3.163 lr:0.0000100 epoch_Time:25017.0min: [2024-01-05 14:55:48,642][model8_pretrain.py][INFO] Epoch:[0/2](653700/4588595) loss:2.793 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:55:48,642][model8_pretrain.py][INFO] Epoch:[0/2](653700/4588595) loss:2.831 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:55:48,642][model8_pretrain.py][INFO] Epoch:[0/2](653700/4588595) loss:2.727 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:55:48,642][model8_pretrain.py][INFO] Epoch:[0/2](653700/4588595) loss:2.825 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:55:48,642][model8_pretrain.py][INFO] Epoch:[0/2](653700/4588595) loss:2.975 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:55:48,642][model8_pretrain.py][INFO] Epoch:[0/2](653700/4588595) loss:2.563 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:55:48,642][model8_pretrain.py][INFO] Epoch:[0/2](653700/4588595) loss:2.427 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:55:48,642][model8_pretrain.py][INFO] Epoch:[0/2](653700/4588595) loss:2.391 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:56:25,582][model8_pretrain.py][INFO] Epoch:[0/2](653800/4588595) loss:3.393 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:56:25,582][model8_pretrain.py][INFO] Epoch:[0/2](653800/4588595) loss:2.929 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:56:25,582][model8_pretrain.py][INFO] Epoch:[0/2](653800/4588595) loss:2.942 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:56:25,582][model8_pretrain.py][INFO] Epoch:[0/2](653800/4588595) loss:3.129 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:56:25,582][model8_pretrain.py][INFO] Epoch:[0/2](653800/4588595) loss:2.341 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:56:25,582][model8_pretrain.py][INFO] Epoch:[0/2](653800/4588595) loss:2.973 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:56:25,582][model8_pretrain.py][INFO] Epoch:[0/2](653800/4588595) loss:2.657 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:56:25,582][model8_pretrain.py][INFO] Epoch:[0/2](653800/4588595) loss:2.530 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:57:02,493][model8_pretrain.py][INFO] Epoch:[0/2](653900/4588595) loss:2.874 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:57:02,493][model8_pretrain.py][INFO] Epoch:[0/2](653900/4588595) loss:2.648 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:57:02,493][model8_pretrain.py][INFO] Epoch:[0/2](653900/4588595) loss:2.964 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:57:02,493][model8_pretrain.py][INFO] Epoch:[0/2](653900/4588595) loss:3.065 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:57:02,493][model8_pretrain.py][INFO] Epoch:[0/2](653900/4588595) loss:2.909 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:57:02,493][model8_pretrain.py][INFO] Epoch:[0/2](653900/4588595) loss:3.052 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:57:02,494][model8_pretrain.py][INFO] Epoch:[0/2](653900/4588595) loss:2.913 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:57:02,494][model8_pretrain.py][INFO] Epoch:[0/2](653900/4588595) loss:3.016 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:57:46,168][model8_pretrain.py][INFO] Epoch:[0/2](654000/4588595) loss:2.396 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:57:46,168][model8_pretrain.py][INFO] Epoch:[0/2](654000/4588595) loss:2.345 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:57:46,168][model8_pretrain.py][INFO] Epoch:[0/2](654000/4588595) loss:3.408 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:57:46,172][model8_pretrain.py][INFO] Epoch:[0/2](654000/4588595) loss:2.810 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:57:46,173][model8_pretrain.py][INFO] Epoch:[0/2](654000/4588595) loss:2.611 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:57:46,173][model8_pretrain.py][INFO] Epoch:[0/2](654000/4588595) loss:3.100 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:57:46,173][model8_pretrain.py][INFO] Epoch:[0/2](654000/4588595) loss:2.725 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:57:46,173][model8_pretrain.py][INFO] Epoch:[0/2](654000/4588595) loss:3.123 lr:0.0000100 epoch_Time:25016.0min: [2024-01-05 14:58:28,341][model8_pretrain.py][INFO] Epoch:[0/2](654100/4588595) loss:3.162 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:58:28,342][model8_pretrain.py][INFO] Epoch:[0/2](654100/4588595) loss:2.922 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:58:28,342][model8_pretrain.py][INFO] Epoch:[0/2](654100/4588595) loss:2.584 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:58:28,342][model8_pretrain.py][INFO] Epoch:[0/2](654100/4588595) loss:3.157 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:58:28,342][model8_pretrain.py][INFO] Epoch:[0/2](654100/4588595) loss:2.725 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:58:28,342][model8_pretrain.py][INFO] Epoch:[0/2](654100/4588595) loss:2.608 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:58:28,342][model8_pretrain.py][INFO] Epoch:[0/2](654100/4588595) loss:2.808 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:58:28,342][model8_pretrain.py][INFO] Epoch:[0/2](654100/4588595) loss:2.948 lr:0.0000100 epoch_Time:25015.0min: [2024-01-05 14:59:05,282][model8_pretrain.py][INFO] Epoch:[0/2](654200/4588595) loss:2.585 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:05,282][model8_pretrain.py][INFO] Epoch:[0/2](654200/4588595) loss:2.387 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:05,283][model8_pretrain.py][INFO] Epoch:[0/2](654200/4588595) loss:2.873 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:05,283][model8_pretrain.py][INFO] Epoch:[0/2](654200/4588595) loss:2.890 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:05,283][model8_pretrain.py][INFO] Epoch:[0/2](654200/4588595) loss:2.851 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:05,283][model8_pretrain.py][INFO] Epoch:[0/2](654200/4588595) loss:2.112 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:05,283][model8_pretrain.py][INFO] Epoch:[0/2](654200/4588595) loss:3.344 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:05,283][model8_pretrain.py][INFO] Epoch:[0/2](654200/4588595) loss:2.610 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:42,314][model8_pretrain.py][INFO] Epoch:[0/2](654300/4588595) loss:2.684 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:42,314][model8_pretrain.py][INFO] Epoch:[0/2](654300/4588595) loss:2.518 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:42,314][model8_pretrain.py][INFO] Epoch:[0/2](654300/4588595) loss:2.765 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:42,314][model8_pretrain.py][INFO] Epoch:[0/2](654300/4588595) loss:3.155 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:42,314][model8_pretrain.py][INFO] Epoch:[0/2](654300/4588595) loss:2.835 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:42,314][model8_pretrain.py][INFO] Epoch:[0/2](654300/4588595) loss:2.653 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:42,314][model8_pretrain.py][INFO] Epoch:[0/2](654300/4588595) loss:2.617 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 14:59:42,314][model8_pretrain.py][INFO] Epoch:[0/2](654300/4588595) loss:2.810 lr:0.0000100 epoch_Time:25014.0min: [2024-01-05 15:00:19,278][model8_pretrain.py][INFO] Epoch:[0/2](654400/4588595) loss:2.824 lr:0.0000100 epoch_Time:25013.0min: [2024-01-05 15:00:19,278][model8_pretrain.py][INFO] Epoch:[0/2](654400/4588595) loss:2.929 lr:0.0000100 epoch_Time:25013.0min: [2024-01-05 15:00:19,278][model8_pretrain.py][INFO] Epoch:[0/2](654400/4588595) loss:2.658 lr:0.0000100 epoch_Time:25013.0min: [2024-01-05 15:00:19,278][model8_pretrain.py][INFO] Epoch:[0/2](654400/4588595) loss:2.702 lr:0.0000100 epoch_Time:25013.0min: [2024-01-05 15:00:19,278][model8_pretrain.py][INFO] Epoch:[0/2](654400/4588595) loss:2.947 lr:0.0000100 epoch_Time:25013.0min: [2024-01-05 15:00:19,278][model8_pretrain.py][INFO] Epoch:[0/2](654400/4588595) loss:2.830 lr:0.0000100 epoch_Time:25013.0min: [2024-01-05 15:00:19,278][model8_pretrain.py][INFO] Epoch:[0/2](654400/4588595) loss:2.884 lr:0.0000100 epoch_Time:25013.0min: [2024-01-05 15:00:19,278][model8_pretrain.py][INFO] Epoch:[0/2](654400/4588595) loss:2.932 lr:0.0000100 epoch_Time:25013.0min: [2024-01-05 15:00:56,325][model8_pretrain.py][INFO] Epoch:[0/2](654500/4588595) loss:2.664 lr:0.0000100 epoch_Time:25012.0min: [2024-01-05 15:00:56,325][model8_pretrain.py][INFO] Epoch:[0/2](654500/4588595) loss:3.184 lr:0.0000100 epoch_Time:25012.0min: [2024-01-05 15:00:56,325][model8_pretrain.py][INFO] Epoch:[0/2](654500/4588595) loss:2.906 lr:0.0000100 epoch_Time:25012.0min: [2024-01-05 15:00:56,325][model8_pretrain.py][INFO] Epoch:[0/2](654500/4588595) loss:3.154 lr:0.0000100 epoch_Time:25012.0min: [2024-01-05 15:00:56,325][model8_pretrain.py][INFO] Epoch:[0/2](654500/4588595) loss:2.886 lr:0.0000100 epoch_Time:25012.0min: [2024-01-05 15:00:56,325][model8_pretrain.py][INFO] Epoch:[0/2](654500/4588595) loss:2.965 lr:0.0000100 epoch_Time:25012.0min: [2024-01-05 15:00:56,325][model8_pretrain.py][INFO] Epoch:[0/2](654500/4588595) loss:2.818 lr:0.0000100 epoch_Time:25012.0min: [2024-01-05 15:00:56,325][model8_pretrain.py][INFO] Epoch:[0/2](654500/4588595) loss:2.828 lr:0.0000100 epoch_Time:25012.0min: [2024-01-05 15:01:33,290][model8_pretrain.py][INFO] Epoch:[0/2](654600/4588595) loss:2.535 lr:0.0000100 epoch_Time:25011.0min: [2024-01-05 15:01:33,290][model8_pretrain.py][INFO] Epoch:[0/2](654600/4588595) loss:2.857 lr:0.0000100 epoch_Time:25011.0min: [2024-01-05 15:01:33,290][model8_pretrain.py][INFO] Epoch:[0/2](654600/4588595) loss:2.534 lr:0.0000100 epoch_Time:25011.0min: [2024-01-05 15:01:33,290][model8_pretrain.py][INFO] Epoch:[0/2](654600/4588595) loss:2.702 lr:0.0000100 epoch_Time:25011.0min: [2024-01-05 15:01:33,290][model8_pretrain.py][INFO] Epoch:[0/2](654600/4588595) loss:2.732 lr:0.0000100 epoch_Time:25011.0min: [2024-01-05 15:01:33,290][model8_pretrain.py][INFO] Epoch:[0/2](654600/4588595) loss:2.830 lr:0.0000100 epoch_Time:25011.0min: [2024-01-05 15:01:33,291][model8_pretrain.py][INFO] Epoch:[0/2](654600/4588595) loss:2.768 lr:0.0000100 epoch_Time:25011.0min: [2024-01-05 15:01:33,291][model8_pretrain.py][INFO] Epoch:[0/2](654600/4588595) loss:2.894 lr:0.0000100 epoch_Time:25011.0min: [2024-01-05 15:02:10,267][model8_pretrain.py][INFO] Epoch:[0/2](654700/4588595) loss:2.682 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:10,267][model8_pretrain.py][INFO] Epoch:[0/2](654700/4588595) loss:2.978 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:10,267][model8_pretrain.py][INFO] Epoch:[0/2](654700/4588595) loss:2.869 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:10,267][model8_pretrain.py][INFO] Epoch:[0/2](654700/4588595) loss:2.806 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:10,267][model8_pretrain.py][INFO] Epoch:[0/2](654700/4588595) loss:2.569 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:10,267][model8_pretrain.py][INFO] Epoch:[0/2](654700/4588595) loss:2.977 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:10,267][model8_pretrain.py][INFO] Epoch:[0/2](654700/4588595) loss:2.889 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:10,267][model8_pretrain.py][INFO] Epoch:[0/2](654700/4588595) loss:2.335 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:50,646][model8_pretrain.py][INFO] Epoch:[0/2](654800/4588595) loss:2.922 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:50,646][model8_pretrain.py][INFO] Epoch:[0/2](654800/4588595) loss:3.330 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:50,646][model8_pretrain.py][INFO] Epoch:[0/2](654800/4588595) loss:2.999 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:50,646][model8_pretrain.py][INFO] Epoch:[0/2](654800/4588595) loss:2.620 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:50,646][model8_pretrain.py][INFO] Epoch:[0/2](654800/4588595) loss:3.111 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:50,646][model8_pretrain.py][INFO] Epoch:[0/2](654800/4588595) loss:3.291 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:50,646][model8_pretrain.py][INFO] Epoch:[0/2](654800/4588595) loss:2.936 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:02:50,646][model8_pretrain.py][INFO] Epoch:[0/2](654800/4588595) loss:3.234 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:03:36,458][model8_pretrain.py][INFO] Epoch:[0/2](654900/4588595) loss:2.943 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:03:36,458][model8_pretrain.py][INFO] Epoch:[0/2](654900/4588595) loss:2.628 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:03:36,458][model8_pretrain.py][INFO] Epoch:[0/2](654900/4588595) loss:3.204 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:03:36,458][model8_pretrain.py][INFO] Epoch:[0/2](654900/4588595) loss:3.489 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:03:36,458][model8_pretrain.py][INFO] Epoch:[0/2](654900/4588595) loss:2.966 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:03:36,458][model8_pretrain.py][INFO] Epoch:[0/2](654900/4588595) loss:2.761 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:03:36,458][model8_pretrain.py][INFO] Epoch:[0/2](654900/4588595) loss:2.608 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:03:36,458][model8_pretrain.py][INFO] Epoch:[0/2](654900/4588595) loss:2.807 lr:0.0000100 epoch_Time:25010.0min: [2024-01-05 15:04:13,393][model8_pretrain.py][INFO] Epoch:[0/2](655000/4588595) loss:2.660 lr:0.0000100 epoch_Time:25009.0min: [2024-01-05 15:04:13,393][model8_pretrain.py][INFO] Epoch:[0/2](655000/4588595) loss:2.438 lr:0.0000100 epoch_Time:25009.0min: [2024-01-05 15:04:13,393][model8_pretrain.py][INFO] Epoch:[0/2](655000/4588595) loss:2.198 lr:0.0000100 epoch_Time:25009.0min: [2024-01-05 15:04:13,393][model8_pretrain.py][INFO] Epoch:[0/2](655000/4588595) loss:3.168 lr:0.0000100 epoch_Time:25009.0min: [2024-01-05 15:04:13,393][model8_pretrain.py][INFO] Epoch:[0/2](655000/4588595) loss:2.817 lr:0.0000100 epoch_Time:25009.0min: [2024-01-05 15:04:13,393][model8_pretrain.py][INFO] Epoch:[0/2](655000/4588595) loss:2.733 lr:0.0000100 epoch_Time:25009.0min: [2024-01-05 15:04:13,393][model8_pretrain.py][INFO] Epoch:[0/2](655000/4588595) loss:2.875 lr:0.0000100 epoch_Time:25009.0min: [2024-01-05 15:04:13,393][model8_pretrain.py][INFO] Epoch:[0/2](655000/4588595) loss:2.612 lr:0.0000100 epoch_Time:25009.0min: [2024-01-05 15:04:50,340][model8_pretrain.py][INFO] Epoch:[0/2](655100/4588595) loss:2.864 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:04:50,340][model8_pretrain.py][INFO] Epoch:[0/2](655100/4588595) loss:2.643 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:04:50,340][model8_pretrain.py][INFO] Epoch:[0/2](655100/4588595) loss:3.070 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:04:50,340][model8_pretrain.py][INFO] Epoch:[0/2](655100/4588595) loss:2.388 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:04:50,340][model8_pretrain.py][INFO] Epoch:[0/2](655100/4588595) loss:2.676 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:04:50,340][model8_pretrain.py][INFO] Epoch:[0/2](655100/4588595) loss:2.528 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:04:50,340][model8_pretrain.py][INFO] Epoch:[0/2](655100/4588595) loss:3.034 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:04:50,341][model8_pretrain.py][INFO] Epoch:[0/2](655100/4588595) loss:2.682 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:05:27,275][model8_pretrain.py][INFO] Epoch:[0/2](655200/4588595) loss:2.257 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:05:27,275][model8_pretrain.py][INFO] Epoch:[0/2](655200/4588595) loss:2.999 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:05:27,275][model8_pretrain.py][INFO] Epoch:[0/2](655200/4588595) loss:2.973 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:05:27,275][model8_pretrain.py][INFO] Epoch:[0/2](655200/4588595) loss:3.321 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:05:27,275][model8_pretrain.py][INFO] Epoch:[0/2](655200/4588595) loss:2.420 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:05:27,275][model8_pretrain.py][INFO] Epoch:[0/2](655200/4588595) loss:2.577 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:05:27,275][model8_pretrain.py][INFO] Epoch:[0/2](655200/4588595) loss:3.173 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:05:27,276][model8_pretrain.py][INFO] Epoch:[0/2](655200/4588595) loss:3.045 lr:0.0000100 epoch_Time:25008.0min: [2024-01-05 15:06:04,201][model8_pretrain.py][INFO] Epoch:[0/2](655300/4588595) loss:2.746 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:04,201][model8_pretrain.py][INFO] Epoch:[0/2](655300/4588595) loss:2.949 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:04,201][model8_pretrain.py][INFO] Epoch:[0/2](655300/4588595) loss:2.651 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:04,201][model8_pretrain.py][INFO] Epoch:[0/2](655300/4588595) loss:2.998 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:04,201][model8_pretrain.py][INFO] Epoch:[0/2](655300/4588595) loss:2.830 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:04,201][model8_pretrain.py][INFO] Epoch:[0/2](655300/4588595) loss:2.917 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:04,201][model8_pretrain.py][INFO] Epoch:[0/2](655300/4588595) loss:3.093 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:04,202][model8_pretrain.py][INFO] Epoch:[0/2](655300/4588595) loss:3.029 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:41,158][model8_pretrain.py][INFO] Epoch:[0/2](655400/4588595) loss:3.178 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:41,158][model8_pretrain.py][INFO] Epoch:[0/2](655400/4588595) loss:2.752 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:41,158][model8_pretrain.py][INFO] Epoch:[0/2](655400/4588595) loss:2.704 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:41,158][model8_pretrain.py][INFO] Epoch:[0/2](655400/4588595) loss:2.540 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:41,158][model8_pretrain.py][INFO] Epoch:[0/2](655400/4588595) loss:2.217 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:41,158][model8_pretrain.py][INFO] Epoch:[0/2](655400/4588595) loss:2.375 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:41,159][model8_pretrain.py][INFO] Epoch:[0/2](655400/4588595) loss:2.467 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:06:41,159][model8_pretrain.py][INFO] Epoch:[0/2](655400/4588595) loss:2.724 lr:0.0000100 epoch_Time:25007.0min: [2024-01-05 15:07:18,090][model8_pretrain.py][INFO] Epoch:[0/2](655500/4588595) loss:2.626 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:07:18,090][model8_pretrain.py][INFO] Epoch:[0/2](655500/4588595) loss:3.011 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:07:18,091][model8_pretrain.py][INFO] Epoch:[0/2](655500/4588595) loss:3.045 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:07:18,090][model8_pretrain.py][INFO] Epoch:[0/2](655500/4588595) loss:2.137 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:07:18,090][model8_pretrain.py][INFO] Epoch:[0/2](655500/4588595) loss:3.003 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:07:18,090][model8_pretrain.py][INFO] Epoch:[0/2](655500/4588595) loss:3.378 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:07:18,090][model8_pretrain.py][INFO] Epoch:[0/2](655500/4588595) loss:2.485 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:07:18,090][model8_pretrain.py][INFO] Epoch:[0/2](655500/4588595) loss:2.903 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:07:58,513][model8_pretrain.py][INFO] Epoch:[0/2](655600/4588595) loss:3.026 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:07:58,513][model8_pretrain.py][INFO] Epoch:[0/2](655600/4588595) loss:2.969 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:07:58,513][model8_pretrain.py][INFO] Epoch:[0/2](655600/4588595) loss:1.794 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:07:58,513][model8_pretrain.py][INFO] Epoch:[0/2](655600/4588595) loss:2.969 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:07:58,513][model8_pretrain.py][INFO] Epoch:[0/2](655600/4588595) loss:2.725 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:07:58,513][model8_pretrain.py][INFO] Epoch:[0/2](655600/4588595) loss:3.273 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:07:58,513][model8_pretrain.py][INFO] Epoch:[0/2](655600/4588595) loss:3.022 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:07:58,513][model8_pretrain.py][INFO] Epoch:[0/2](655600/4588595) loss:2.562 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:08:43,865][model8_pretrain.py][INFO] Epoch:[0/2](655700/4588595) loss:2.272 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:08:43,865][model8_pretrain.py][INFO] Epoch:[0/2](655700/4588595) loss:2.026 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:08:43,865][model8_pretrain.py][INFO] Epoch:[0/2](655700/4588595) loss:2.562 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:08:43,865][model8_pretrain.py][INFO] Epoch:[0/2](655700/4588595) loss:2.459 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:08:43,865][model8_pretrain.py][INFO] Epoch:[0/2](655700/4588595) loss:3.083 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:08:43,865][model8_pretrain.py][INFO] Epoch:[0/2](655700/4588595) loss:2.399 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:08:43,865][model8_pretrain.py][INFO] Epoch:[0/2](655700/4588595) loss:3.223 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:08:43,865][model8_pretrain.py][INFO] Epoch:[0/2](655700/4588595) loss:3.105 lr:0.0000100 epoch_Time:25006.0min: [2024-01-05 15:09:20,806][model8_pretrain.py][INFO] Epoch:[0/2](655800/4588595) loss:3.190 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:09:20,806][model8_pretrain.py][INFO] Epoch:[0/2](655800/4588595) loss:2.292 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:09:20,806][model8_pretrain.py][INFO] Epoch:[0/2](655800/4588595) loss:2.742 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:09:20,806][model8_pretrain.py][INFO] Epoch:[0/2](655800/4588595) loss:2.849 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:09:20,806][model8_pretrain.py][INFO] Epoch:[0/2](655800/4588595) loss:2.710 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:09:20,806][model8_pretrain.py][INFO] Epoch:[0/2](655800/4588595) loss:3.235 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:09:20,806][model8_pretrain.py][INFO] Epoch:[0/2](655800/4588595) loss:2.982 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:09:20,806][model8_pretrain.py][INFO] Epoch:[0/2](655800/4588595) loss:2.546 lr:0.0000100 epoch_Time:25005.0min: [2024-01-05 15:09:57,748][model8_pretrain.py][INFO] Epoch:[0/2](655900/4588595) loss:3.233 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:09:57,748][model8_pretrain.py][INFO] Epoch:[0/2](655900/4588595) loss:2.529 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:09:57,748][model8_pretrain.py][INFO] Epoch:[0/2](655900/4588595) loss:3.160 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:09:57,748][model8_pretrain.py][INFO] Epoch:[0/2](655900/4588595) loss:2.946 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:09:57,748][model8_pretrain.py][INFO] Epoch:[0/2](655900/4588595) loss:2.790 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:09:57,748][model8_pretrain.py][INFO] Epoch:[0/2](655900/4588595) loss:2.602 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:09:57,748][model8_pretrain.py][INFO] Epoch:[0/2](655900/4588595) loss:2.832 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:09:57,749][model8_pretrain.py][INFO] Epoch:[0/2](655900/4588595) loss:2.717 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:10:34,698][model8_pretrain.py][INFO] Epoch:[0/2](656000/4588595) loss:2.355 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:10:34,698][model8_pretrain.py][INFO] Epoch:[0/2](656000/4588595) loss:2.906 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:10:34,698][model8_pretrain.py][INFO] Epoch:[0/2](656000/4588595) loss:3.167 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:10:34,698][model8_pretrain.py][INFO] Epoch:[0/2](656000/4588595) loss:3.198 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:10:34,698][model8_pretrain.py][INFO] Epoch:[0/2](656000/4588595) loss:3.033 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:10:34,698][model8_pretrain.py][INFO] Epoch:[0/2](656000/4588595) loss:2.159 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:10:34,698][model8_pretrain.py][INFO] Epoch:[0/2](656000/4588595) loss:2.569 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:10:34,699][model8_pretrain.py][INFO] Epoch:[0/2](656000/4588595) loss:2.919 lr:0.0000100 epoch_Time:25003.0min: [2024-01-05 15:11:11,633][model8_pretrain.py][INFO] Epoch:[0/2](656100/4588595) loss:2.581 lr:0.0000100 epoch_Time:25002.0min: [2024-01-05 15:11:11,633][model8_pretrain.py][INFO] Epoch:[0/2](656100/4588595) loss:2.515 lr:0.0000100 epoch_Time:25002.0min: [2024-01-05 15:11:11,633][model8_pretrain.py][INFO] Epoch:[0/2](656100/4588595) loss:2.226 lr:0.0000100 epoch_Time:25002.0min: [2024-01-05 15:11:11,633][model8_pretrain.py][INFO] Epoch:[0/2](656100/4588595) loss:2.635 lr:0.0000100 epoch_Time:25002.0min: [2024-01-05 15:11:11,633][model8_pretrain.py][INFO] Epoch:[0/2](656100/4588595) loss:2.804 lr:0.0000100 epoch_Time:25002.0min: [2024-01-05 15:11:11,633][model8_pretrain.py][INFO] Epoch:[0/2](656100/4588595) loss:2.779 lr:0.0000100 epoch_Time:25002.0min: [2024-01-05 15:11:11,634][model8_pretrain.py][INFO] Epoch:[0/2](656100/4588595) loss:2.629 lr:0.0000100 epoch_Time:25002.0min: [2024-01-05 15:11:11,634][model8_pretrain.py][INFO] Epoch:[0/2](656100/4588595) loss:3.037 lr:0.0000100 epoch_Time:25002.0min: [2024-01-05 15:11:48,584][model8_pretrain.py][INFO] Epoch:[0/2](656200/4588595) loss:3.106 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:11:48,584][model8_pretrain.py][INFO] Epoch:[0/2](656200/4588595) loss:3.214 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:11:48,584][model8_pretrain.py][INFO] Epoch:[0/2](656200/4588595) loss:3.440 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:11:48,584][model8_pretrain.py][INFO] Epoch:[0/2](656200/4588595) loss:2.601 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:11:48,584][model8_pretrain.py][INFO] Epoch:[0/2](656200/4588595) loss:2.674 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:11:48,584][model8_pretrain.py][INFO] Epoch:[0/2](656200/4588595) loss:3.325 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:11:48,585][model8_pretrain.py][INFO] Epoch:[0/2](656200/4588595) loss:2.644 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:11:48,585][model8_pretrain.py][INFO] Epoch:[0/2](656200/4588595) loss:2.587 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:12:25,521][model8_pretrain.py][INFO] Epoch:[0/2](656300/4588595) loss:2.874 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:12:25,521][model8_pretrain.py][INFO] Epoch:[0/2](656300/4588595) loss:2.854 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:12:25,521][model8_pretrain.py][INFO] Epoch:[0/2](656300/4588595) loss:2.811 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:12:25,521][model8_pretrain.py][INFO] Epoch:[0/2](656300/4588595) loss:2.910 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:12:25,521][model8_pretrain.py][INFO] Epoch:[0/2](656300/4588595) loss:2.826 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:12:25,521][model8_pretrain.py][INFO] Epoch:[0/2](656300/4588595) loss:1.964 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:12:25,521][model8_pretrain.py][INFO] Epoch:[0/2](656300/4588595) loss:2.843 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:12:25,522][model8_pretrain.py][INFO] Epoch:[0/2](656300/4588595) loss:2.558 lr:0.0000100 epoch_Time:25001.0min: [2024-01-05 15:13:05,979][model8_pretrain.py][INFO] Epoch:[0/2](656400/4588595) loss:2.826 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:05,979][model8_pretrain.py][INFO] Epoch:[0/2](656400/4588595) loss:3.051 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:05,979][model8_pretrain.py][INFO] Epoch:[0/2](656400/4588595) loss:2.709 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:05,979][model8_pretrain.py][INFO] Epoch:[0/2](656400/4588595) loss:2.576 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:05,979][model8_pretrain.py][INFO] Epoch:[0/2](656400/4588595) loss:2.100 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:05,979][model8_pretrain.py][INFO] Epoch:[0/2](656400/4588595) loss:2.812 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:05,979][model8_pretrain.py][INFO] Epoch:[0/2](656400/4588595) loss:2.590 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:05,979][model8_pretrain.py][INFO] Epoch:[0/2](656400/4588595) loss:2.856 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:51,344][model8_pretrain.py][INFO] Epoch:[0/2](656500/4588595) loss:2.467 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:51,344][model8_pretrain.py][INFO] Epoch:[0/2](656500/4588595) loss:3.208 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:51,344][model8_pretrain.py][INFO] Epoch:[0/2](656500/4588595) loss:2.639 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:51,344][model8_pretrain.py][INFO] Epoch:[0/2](656500/4588595) loss:2.533 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:51,344][model8_pretrain.py][INFO] Epoch:[0/2](656500/4588595) loss:3.305 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:51,344][model8_pretrain.py][INFO] Epoch:[0/2](656500/4588595) loss:2.759 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:51,344][model8_pretrain.py][INFO] Epoch:[0/2](656500/4588595) loss:2.494 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:13:51,344][model8_pretrain.py][INFO] Epoch:[0/2](656500/4588595) loss:2.760 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:14:28,253][model8_pretrain.py][INFO] Epoch:[0/2](656600/4588595) loss:3.004 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:14:28,253][model8_pretrain.py][INFO] Epoch:[0/2](656600/4588595) loss:2.278 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:14:28,253][model8_pretrain.py][INFO] Epoch:[0/2](656600/4588595) loss:3.290 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:14:28,253][model8_pretrain.py][INFO] Epoch:[0/2](656600/4588595) loss:2.668 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:14:28,253][model8_pretrain.py][INFO] Epoch:[0/2](656600/4588595) loss:2.286 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:14:28,254][model8_pretrain.py][INFO] Epoch:[0/2](656600/4588595) loss:2.243 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:14:28,254][model8_pretrain.py][INFO] Epoch:[0/2](656600/4588595) loss:2.709 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:14:28,254][model8_pretrain.py][INFO] Epoch:[0/2](656600/4588595) loss:2.946 lr:0.0000100 epoch_Time:25000.0min: [2024-01-05 15:15:05,193][model8_pretrain.py][INFO] Epoch:[0/2](656700/4588595) loss:2.952 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:05,193][model8_pretrain.py][INFO] Epoch:[0/2](656700/4588595) loss:2.779 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:05,193][model8_pretrain.py][INFO] Epoch:[0/2](656700/4588595) loss:2.894 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:05,193][model8_pretrain.py][INFO] Epoch:[0/2](656700/4588595) loss:2.655 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:05,193][model8_pretrain.py][INFO] Epoch:[0/2](656700/4588595) loss:3.278 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:05,193][model8_pretrain.py][INFO] Epoch:[0/2](656700/4588595) loss:2.328 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:05,193][model8_pretrain.py][INFO] Epoch:[0/2](656700/4588595) loss:2.783 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:05,193][model8_pretrain.py][INFO] Epoch:[0/2](656700/4588595) loss:2.595 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:42,140][model8_pretrain.py][INFO] Epoch:[0/2](656800/4588595) loss:3.044 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:42,140][model8_pretrain.py][INFO] Epoch:[0/2](656800/4588595) loss:2.606 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:42,140][model8_pretrain.py][INFO] Epoch:[0/2](656800/4588595) loss:2.729 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:42,140][model8_pretrain.py][INFO] Epoch:[0/2](656800/4588595) loss:2.725 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:42,141][model8_pretrain.py][INFO] Epoch:[0/2](656800/4588595) loss:2.672 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:42,140][model8_pretrain.py][INFO] Epoch:[0/2](656800/4588595) loss:3.167 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:42,141][model8_pretrain.py][INFO] Epoch:[0/2](656800/4588595) loss:3.079 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:15:42,141][model8_pretrain.py][INFO] Epoch:[0/2](656800/4588595) loss:2.920 lr:0.0000100 epoch_Time:24999.0min: [2024-01-05 15:16:19,094][model8_pretrain.py][INFO] Epoch:[0/2](656900/4588595) loss:2.915 lr:0.0000100 epoch_Time:24997.0min: [2024-01-05 15:16:19,095][model8_pretrain.py][INFO] Epoch:[0/2](656900/4588595) loss:3.134 lr:0.0000100 epoch_Time:24997.0min: [2024-01-05 15:16:19,095][model8_pretrain.py][INFO] Epoch:[0/2](656900/4588595) loss:3.140 lr:0.0000100 epoch_Time:24997.0min: [2024-01-05 15:16:19,095][model8_pretrain.py][INFO] Epoch:[0/2](656900/4588595) loss:3.084 lr:0.0000100 epoch_Time:24997.0min: [2024-01-05 15:16:19,095][model8_pretrain.py][INFO] Epoch:[0/2](656900/4588595) loss:3.048 lr:0.0000100 epoch_Time:24997.0min: [2024-01-05 15:16:19,095][model8_pretrain.py][INFO] Epoch:[0/2](656900/4588595) loss:3.000 lr:0.0000100 epoch_Time:24997.0min: [2024-01-05 15:16:19,095][model8_pretrain.py][INFO] Epoch:[0/2](656900/4588595) loss:2.940 lr:0.0000100 epoch_Time:24997.0min: [2024-01-05 15:16:19,095][model8_pretrain.py][INFO] Epoch:[0/2](656900/4588595) loss:2.844 lr:0.0000100 epoch_Time:24997.0min: [2024-01-05 15:16:56,040][model8_pretrain.py][INFO] Epoch:[0/2](657000/4588595) loss:2.853 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:16:56,040][model8_pretrain.py][INFO] Epoch:[0/2](657000/4588595) loss:3.187 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:16:56,041][model8_pretrain.py][INFO] Epoch:[0/2](657000/4588595) loss:2.508 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:16:56,041][model8_pretrain.py][INFO] Epoch:[0/2](657000/4588595) loss:2.770 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:16:56,041][model8_pretrain.py][INFO] Epoch:[0/2](657000/4588595) loss:2.892 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:16:56,041][model8_pretrain.py][INFO] Epoch:[0/2](657000/4588595) loss:3.291 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:16:56,041][model8_pretrain.py][INFO] Epoch:[0/2](657000/4588595) loss:2.787 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:16:56,041][model8_pretrain.py][INFO] Epoch:[0/2](657000/4588595) loss:2.982 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:17:33,002][model8_pretrain.py][INFO] Epoch:[0/2](657100/4588595) loss:2.262 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:17:33,002][model8_pretrain.py][INFO] Epoch:[0/2](657100/4588595) loss:2.283 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:17:33,002][model8_pretrain.py][INFO] Epoch:[0/2](657100/4588595) loss:2.791 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:17:33,002][model8_pretrain.py][INFO] Epoch:[0/2](657100/4588595) loss:2.618 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:17:33,002][model8_pretrain.py][INFO] Epoch:[0/2](657100/4588595) loss:2.910 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:17:33,002][model8_pretrain.py][INFO] Epoch:[0/2](657100/4588595) loss:2.755 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:17:33,002][model8_pretrain.py][INFO] Epoch:[0/2](657100/4588595) loss:2.902 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:17:33,002][model8_pretrain.py][INFO] Epoch:[0/2](657100/4588595) loss:2.678 lr:0.0000100 epoch_Time:24996.0min: [2024-01-05 15:18:11,716][model8_pretrain.py][INFO] Epoch:[0/2](657200/4588595) loss:2.648 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:11,716][model8_pretrain.py][INFO] Epoch:[0/2](657200/4588595) loss:2.574 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:11,716][model8_pretrain.py][INFO] Epoch:[0/2](657200/4588595) loss:2.855 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:11,716][model8_pretrain.py][INFO] Epoch:[0/2](657200/4588595) loss:3.363 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:11,716][model8_pretrain.py][INFO] Epoch:[0/2](657200/4588595) loss:3.113 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:11,716][model8_pretrain.py][INFO] Epoch:[0/2](657200/4588595) loss:2.639 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:11,716][model8_pretrain.py][INFO] Epoch:[0/2](657200/4588595) loss:2.763 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:11,716][model8_pretrain.py][INFO] Epoch:[0/2](657200/4588595) loss:3.013 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:58,842][model8_pretrain.py][INFO] Epoch:[0/2](657300/4588595) loss:2.860 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:58,842][model8_pretrain.py][INFO] Epoch:[0/2](657300/4588595) loss:2.942 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:58,842][model8_pretrain.py][INFO] Epoch:[0/2](657300/4588595) loss:2.929 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:58,842][model8_pretrain.py][INFO] Epoch:[0/2](657300/4588595) loss:2.400 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:58,842][model8_pretrain.py][INFO] Epoch:[0/2](657300/4588595) loss:2.974 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:58,842][model8_pretrain.py][INFO] Epoch:[0/2](657300/4588595) loss:2.858 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:58,842][model8_pretrain.py][INFO] Epoch:[0/2](657300/4588595) loss:3.586 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:18:58,842][model8_pretrain.py][INFO] Epoch:[0/2](657300/4588595) loss:2.897 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:19:35,788][model8_pretrain.py][INFO] Epoch:[0/2](657400/4588595) loss:3.188 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:19:35,788][model8_pretrain.py][INFO] Epoch:[0/2](657400/4588595) loss:2.863 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:19:35,788][model8_pretrain.py][INFO] Epoch:[0/2](657400/4588595) loss:3.535 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:19:35,788][model8_pretrain.py][INFO] Epoch:[0/2](657400/4588595) loss:3.409 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:19:35,788][model8_pretrain.py][INFO] Epoch:[0/2](657400/4588595) loss:3.184 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:19:35,788][model8_pretrain.py][INFO] Epoch:[0/2](657400/4588595) loss:2.707 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:19:35,788][model8_pretrain.py][INFO] Epoch:[0/2](657400/4588595) loss:2.561 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:19:35,788][model8_pretrain.py][INFO] Epoch:[0/2](657400/4588595) loss:3.098 lr:0.0000100 epoch_Time:24995.0min: [2024-01-05 15:20:12,740][model8_pretrain.py][INFO] Epoch:[0/2](657500/4588595) loss:2.318 lr:0.0000100 epoch_Time:24994.0min: [2024-01-05 15:20:12,740][model8_pretrain.py][INFO] Epoch:[0/2](657500/4588595) loss:3.092 lr:0.0000100 epoch_Time:24994.0min: [2024-01-05 15:20:12,740][model8_pretrain.py][INFO] Epoch:[0/2](657500/4588595) loss:2.929 lr:0.0000100 epoch_Time:24994.0min: [2024-01-05 15:20:12,740][model8_pretrain.py][INFO] Epoch:[0/2](657500/4588595) loss:3.183 lr:0.0000100 epoch_Time:24994.0min: [2024-01-05 15:20:12,740][model8_pretrain.py][INFO] Epoch:[0/2](657500/4588595) loss:3.215 lr:0.0000100 epoch_Time:24994.0min: [2024-01-05 15:20:12,740][model8_pretrain.py][INFO] Epoch:[0/2](657500/4588595) loss:2.708 lr:0.0000100 epoch_Time:24994.0min: [2024-01-05 15:20:12,740][model8_pretrain.py][INFO] Epoch:[0/2](657500/4588595) loss:2.841 lr:0.0000100 epoch_Time:24994.0min: [2024-01-05 15:20:12,741][model8_pretrain.py][INFO] Epoch:[0/2](657500/4588595) loss:3.011 lr:0.0000100 epoch_Time:24994.0min: [2024-01-05 15:20:49,685][model8_pretrain.py][INFO] Epoch:[0/2](657600/4588595) loss:2.709 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:20:49,685][model8_pretrain.py][INFO] Epoch:[0/2](657600/4588595) loss:2.950 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:20:49,685][model8_pretrain.py][INFO] Epoch:[0/2](657600/4588595) loss:2.307 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:20:49,685][model8_pretrain.py][INFO] Epoch:[0/2](657600/4588595) loss:3.140 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:20:49,685][model8_pretrain.py][INFO] Epoch:[0/2](657600/4588595) loss:2.617 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:20:49,685][model8_pretrain.py][INFO] Epoch:[0/2](657600/4588595) loss:2.874 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:20:49,685][model8_pretrain.py][INFO] Epoch:[0/2](657600/4588595) loss:2.904 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:20:49,686][model8_pretrain.py][INFO] Epoch:[0/2](657600/4588595) loss:3.174 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:21:26,621][model8_pretrain.py][INFO] Epoch:[0/2](657700/4588595) loss:2.613 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:21:26,621][model8_pretrain.py][INFO] Epoch:[0/2](657700/4588595) loss:2.540 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:21:26,621][model8_pretrain.py][INFO] Epoch:[0/2](657700/4588595) loss:3.258 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:21:26,621][model8_pretrain.py][INFO] Epoch:[0/2](657700/4588595) loss:3.207 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:21:26,621][model8_pretrain.py][INFO] Epoch:[0/2](657700/4588595) loss:2.996 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:21:26,621][model8_pretrain.py][INFO] Epoch:[0/2](657700/4588595) loss:3.322 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:21:26,621][model8_pretrain.py][INFO] Epoch:[0/2](657700/4588595) loss:3.110 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:21:26,621][model8_pretrain.py][INFO] Epoch:[0/2](657700/4588595) loss:3.294 lr:0.0000100 epoch_Time:24993.0min: [2024-01-05 15:22:03,564][model8_pretrain.py][INFO] Epoch:[0/2](657800/4588595) loss:2.820 lr:0.0000100 epoch_Time:24992.0min: [2024-01-05 15:22:03,564][model8_pretrain.py][INFO] Epoch:[0/2](657800/4588595) loss:2.357 lr:0.0000100 epoch_Time:24992.0min: [2024-01-05 15:22:03,564][model8_pretrain.py][INFO] Epoch:[0/2](657800/4588595) loss:2.259 lr:0.0000100 epoch_Time:24992.0min: [2024-01-05 15:22:03,564][model8_pretrain.py][INFO] Epoch:[0/2](657800/4588595) loss:2.636 lr:0.0000100 epoch_Time:24992.0min: [2024-01-05 15:22:03,564][model8_pretrain.py][INFO] Epoch:[0/2](657800/4588595) loss:2.925 lr:0.0000100 epoch_Time:24992.0min: [2024-01-05 15:22:03,564][model8_pretrain.py][INFO] Epoch:[0/2](657800/4588595) loss:2.331 lr:0.0000100 epoch_Time:24992.0min: [2024-01-05 15:22:03,564][model8_pretrain.py][INFO] Epoch:[0/2](657800/4588595) loss:2.504 lr:0.0000100 epoch_Time:24992.0min: [2024-01-05 15:22:03,564][model8_pretrain.py][INFO] Epoch:[0/2](657800/4588595) loss:3.094 lr:0.0000100 epoch_Time:24992.0min: [2024-01-05 15:22:40,499][model8_pretrain.py][INFO] Epoch:[0/2](657900/4588595) loss:3.041 lr:0.0000100 epoch_Time:24991.0min: [2024-01-05 15:22:40,499][model8_pretrain.py][INFO] Epoch:[0/2](657900/4588595) loss:2.772 lr:0.0000100 epoch_Time:24991.0min: [2024-01-05 15:22:40,499][model8_pretrain.py][INFO] Epoch:[0/2](657900/4588595) loss:3.043 lr:0.0000100 epoch_Time:24991.0min: [2024-01-05 15:22:40,499][model8_pretrain.py][INFO] Epoch:[0/2](657900/4588595) loss:2.968 lr:0.0000100 epoch_Time:24991.0min: [2024-01-05 15:22:40,499][model8_pretrain.py][INFO] Epoch:[0/2](657900/4588595) loss:2.683 lr:0.0000100 epoch_Time:24991.0min: [2024-01-05 15:22:40,499][model8_pretrain.py][INFO] Epoch:[0/2](657900/4588595) loss:2.945 lr:0.0000100 epoch_Time:24991.0min: [2024-01-05 15:22:40,499][model8_pretrain.py][INFO] Epoch:[0/2](657900/4588595) loss:2.640 lr:0.0000100 epoch_Time:24991.0min: [2024-01-05 15:22:40,499][model8_pretrain.py][INFO] Epoch:[0/2](657900/4588595) loss:2.919 lr:0.0000100 epoch_Time:24991.0min: [2024-01-05 15:23:19,147][model8_pretrain.py][INFO] Epoch:[0/2](658000/4588595) loss:3.146 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:23:19,147][model8_pretrain.py][INFO] Epoch:[0/2](658000/4588595) loss:3.193 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:23:19,147][model8_pretrain.py][INFO] Epoch:[0/2](658000/4588595) loss:2.854 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:23:19,147][model8_pretrain.py][INFO] Epoch:[0/2](658000/4588595) loss:3.097 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:23:19,147][model8_pretrain.py][INFO] Epoch:[0/2](658000/4588595) loss:3.256 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:23:19,147][model8_pretrain.py][INFO] Epoch:[0/2](658000/4588595) loss:2.816 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:23:19,148][model8_pretrain.py][INFO] Epoch:[0/2](658000/4588595) loss:3.040 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:23:19,147][model8_pretrain.py][INFO] Epoch:[0/2](658000/4588595) loss:3.231 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:06,040][model8_pretrain.py][INFO] Epoch:[0/2](658100/4588595) loss:2.502 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:06,040][model8_pretrain.py][INFO] Epoch:[0/2](658100/4588595) loss:2.889 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:06,040][model8_pretrain.py][INFO] Epoch:[0/2](658100/4588595) loss:3.021 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:06,040][model8_pretrain.py][INFO] Epoch:[0/2](658100/4588595) loss:3.041 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:06,040][model8_pretrain.py][INFO] Epoch:[0/2](658100/4588595) loss:2.889 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:06,040][model8_pretrain.py][INFO] Epoch:[0/2](658100/4588595) loss:2.943 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:06,040][model8_pretrain.py][INFO] Epoch:[0/2](658100/4588595) loss:2.795 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:06,040][model8_pretrain.py][INFO] Epoch:[0/2](658100/4588595) loss:2.410 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:42,977][model8_pretrain.py][INFO] Epoch:[0/2](658200/4588595) loss:2.946 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:42,977][model8_pretrain.py][INFO] Epoch:[0/2](658200/4588595) loss:3.028 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:42,977][model8_pretrain.py][INFO] Epoch:[0/2](658200/4588595) loss:3.236 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:42,977][model8_pretrain.py][INFO] Epoch:[0/2](658200/4588595) loss:3.260 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:42,977][model8_pretrain.py][INFO] Epoch:[0/2](658200/4588595) loss:2.703 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:42,977][model8_pretrain.py][INFO] Epoch:[0/2](658200/4588595) loss:2.966 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:42,977][model8_pretrain.py][INFO] Epoch:[0/2](658200/4588595) loss:2.347 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:24:42,978][model8_pretrain.py][INFO] Epoch:[0/2](658200/4588595) loss:2.706 lr:0.0000100 epoch_Time:24990.0min: [2024-01-05 15:25:19,910][model8_pretrain.py][INFO] Epoch:[0/2](658300/4588595) loss:1.959 lr:0.0000100 epoch_Time:24989.0min: [2024-01-05 15:25:19,910][model8_pretrain.py][INFO] Epoch:[0/2](658300/4588595) loss:1.978 lr:0.0000100 epoch_Time:24989.0min: [2024-01-05 15:25:19,910][model8_pretrain.py][INFO] Epoch:[0/2](658300/4588595) loss:2.825 lr:0.0000100 epoch_Time:24989.0min: [2024-01-05 15:25:19,910][model8_pretrain.py][INFO] Epoch:[0/2](658300/4588595) loss:2.725 lr:0.0000100 epoch_Time:24989.0min: [2024-01-05 15:25:19,910][model8_pretrain.py][INFO] Epoch:[0/2](658300/4588595) loss:2.873 lr:0.0000100 epoch_Time:24989.0min: [2024-01-05 15:25:19,910][model8_pretrain.py][INFO] Epoch:[0/2](658300/4588595) loss:3.248 lr:0.0000100 epoch_Time:24989.0min: [2024-01-05 15:25:19,911][model8_pretrain.py][INFO] Epoch:[0/2](658300/4588595) loss:3.056 lr:0.0000100 epoch_Time:24989.0min: [2024-01-05 15:25:19,911][model8_pretrain.py][INFO] Epoch:[0/2](658300/4588595) loss:2.285 lr:0.0000100 epoch_Time:24989.0min: [2024-01-05 15:25:56,850][model8_pretrain.py][INFO] Epoch:[0/2](658400/4588595) loss:2.754 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:25:56,850][model8_pretrain.py][INFO] Epoch:[0/2](658400/4588595) loss:3.012 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:25:56,850][model8_pretrain.py][INFO] Epoch:[0/2](658400/4588595) loss:2.899 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:25:56,850][model8_pretrain.py][INFO] Epoch:[0/2](658400/4588595) loss:2.055 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:25:56,850][model8_pretrain.py][INFO] Epoch:[0/2](658400/4588595) loss:2.521 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:25:56,850][model8_pretrain.py][INFO] Epoch:[0/2](658400/4588595) loss:2.898 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:25:56,850][model8_pretrain.py][INFO] Epoch:[0/2](658400/4588595) loss:2.489 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:25:56,850][model8_pretrain.py][INFO] Epoch:[0/2](658400/4588595) loss:3.102 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:26:33,781][model8_pretrain.py][INFO] Epoch:[0/2](658500/4588595) loss:3.143 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:26:33,781][model8_pretrain.py][INFO] Epoch:[0/2](658500/4588595) loss:2.224 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:26:33,781][model8_pretrain.py][INFO] Epoch:[0/2](658500/4588595) loss:2.843 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:26:33,781][model8_pretrain.py][INFO] Epoch:[0/2](658500/4588595) loss:2.739 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:26:33,781][model8_pretrain.py][INFO] Epoch:[0/2](658500/4588595) loss:2.640 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:26:33,781][model8_pretrain.py][INFO] Epoch:[0/2](658500/4588595) loss:2.388 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:26:33,781][model8_pretrain.py][INFO] Epoch:[0/2](658500/4588595) loss:3.343 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:26:33,781][model8_pretrain.py][INFO] Epoch:[0/2](658500/4588595) loss:3.122 lr:0.0000100 epoch_Time:24988.0min: [2024-01-05 15:27:10,719][model8_pretrain.py][INFO] Epoch:[0/2](658600/4588595) loss:2.805 lr:0.0000100 epoch_Time:24987.0min: [2024-01-05 15:27:10,719][model8_pretrain.py][INFO] Epoch:[0/2](658600/4588595) loss:3.061 lr:0.0000100 epoch_Time:24987.0min: [2024-01-05 15:27:10,719][model8_pretrain.py][INFO] Epoch:[0/2](658600/4588595) loss:2.899 lr:0.0000100 epoch_Time:24987.0min: [2024-01-05 15:27:10,719][model8_pretrain.py][INFO] Epoch:[0/2](658600/4588595) loss:2.445 lr:0.0000100 epoch_Time:24987.0min: [2024-01-05 15:27:10,719][model8_pretrain.py][INFO] Epoch:[0/2](658600/4588595) loss:3.119 lr:0.0000100 epoch_Time:24987.0min: [2024-01-05 15:27:10,719][model8_pretrain.py][INFO] Epoch:[0/2](658600/4588595) loss:2.680 lr:0.0000100 epoch_Time:24987.0min: [2024-01-05 15:27:10,719][model8_pretrain.py][INFO] Epoch:[0/2](658600/4588595) loss:2.957 lr:0.0000100 epoch_Time:24987.0min: [2024-01-05 15:27:10,719][model8_pretrain.py][INFO] Epoch:[0/2](658600/4588595) loss:2.895 lr:0.0000100 epoch_Time:24987.0min: [2024-01-05 15:27:47,640][model8_pretrain.py][INFO] Epoch:[0/2](658700/4588595) loss:2.574 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:27:47,640][model8_pretrain.py][INFO] Epoch:[0/2](658700/4588595) loss:3.059 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:27:47,640][model8_pretrain.py][INFO] Epoch:[0/2](658700/4588595) loss:3.098 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:27:47,640][model8_pretrain.py][INFO] Epoch:[0/2](658700/4588595) loss:3.062 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:27:47,640][model8_pretrain.py][INFO] Epoch:[0/2](658700/4588595) loss:2.668 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:27:47,640][model8_pretrain.py][INFO] Epoch:[0/2](658700/4588595) loss:2.523 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:27:47,640][model8_pretrain.py][INFO] Epoch:[0/2](658700/4588595) loss:2.232 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:27:47,640][model8_pretrain.py][INFO] Epoch:[0/2](658700/4588595) loss:3.249 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:28:24,577][model8_pretrain.py][INFO] Epoch:[0/2](658800/4588595) loss:3.196 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:28:24,577][model8_pretrain.py][INFO] Epoch:[0/2](658800/4588595) loss:3.100 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:28:24,577][model8_pretrain.py][INFO] Epoch:[0/2](658800/4588595) loss:2.669 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:28:24,577][model8_pretrain.py][INFO] Epoch:[0/2](658800/4588595) loss:2.799 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:28:24,577][model8_pretrain.py][INFO] Epoch:[0/2](658800/4588595) loss:3.045 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:28:24,577][model8_pretrain.py][INFO] Epoch:[0/2](658800/4588595) loss:2.451 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:28:24,577][model8_pretrain.py][INFO] Epoch:[0/2](658800/4588595) loss:2.728 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:28:26,280][model8_pretrain.py][INFO] Epoch:[0/2](658800/4588595) loss:2.878 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:29:13,808][model8_pretrain.py][INFO] Epoch:[0/2](658900/4588595) loss:2.329 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:29:13,808][model8_pretrain.py][INFO] Epoch:[0/2](658900/4588595) loss:2.575 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:29:13,808][model8_pretrain.py][INFO] Epoch:[0/2](658900/4588595) loss:3.519 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:29:13,808][model8_pretrain.py][INFO] Epoch:[0/2](658900/4588595) loss:2.929 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:29:13,808][model8_pretrain.py][INFO] Epoch:[0/2](658900/4588595) loss:3.284 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:29:13,808][model8_pretrain.py][INFO] Epoch:[0/2](658900/4588595) loss:2.846 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:29:13,808][model8_pretrain.py][INFO] Epoch:[0/2](658900/4588595) loss:3.053 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:29:13,808][model8_pretrain.py][INFO] Epoch:[0/2](658900/4588595) loss:2.434 lr:0.0000100 epoch_Time:24986.0min: [2024-01-05 15:29:50,727][model8_pretrain.py][INFO] Epoch:[0/2](659000/4588595) loss:3.008 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:29:50,727][model8_pretrain.py][INFO] Epoch:[0/2](659000/4588595) loss:2.449 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:29:50,727][model8_pretrain.py][INFO] Epoch:[0/2](659000/4588595) loss:2.738 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:29:50,727][model8_pretrain.py][INFO] Epoch:[0/2](659000/4588595) loss:3.004 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:29:50,727][model8_pretrain.py][INFO] Epoch:[0/2](659000/4588595) loss:2.871 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:29:50,727][model8_pretrain.py][INFO] Epoch:[0/2](659000/4588595) loss:2.429 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:29:50,727][model8_pretrain.py][INFO] Epoch:[0/2](659000/4588595) loss:2.084 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:29:50,727][model8_pretrain.py][INFO] Epoch:[0/2](659000/4588595) loss:3.048 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:30:27,670][model8_pretrain.py][INFO] Epoch:[0/2](659100/4588595) loss:2.783 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:30:27,670][model8_pretrain.py][INFO] Epoch:[0/2](659100/4588595) loss:3.088 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:30:27,670][model8_pretrain.py][INFO] Epoch:[0/2](659100/4588595) loss:3.126 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:30:27,670][model8_pretrain.py][INFO] Epoch:[0/2](659100/4588595) loss:2.851 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:30:27,670][model8_pretrain.py][INFO] Epoch:[0/2](659100/4588595) loss:2.618 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:30:27,670][model8_pretrain.py][INFO] Epoch:[0/2](659100/4588595) loss:2.954 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:30:27,670][model8_pretrain.py][INFO] Epoch:[0/2](659100/4588595) loss:3.014 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:30:27,670][model8_pretrain.py][INFO] Epoch:[0/2](659100/4588595) loss:2.786 lr:0.0000100 epoch_Time:24985.0min: [2024-01-05 15:31:04,614][model8_pretrain.py][INFO] Epoch:[0/2](659200/4588595) loss:2.411 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:04,614][model8_pretrain.py][INFO] Epoch:[0/2](659200/4588595) loss:2.832 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:04,614][model8_pretrain.py][INFO] Epoch:[0/2](659200/4588595) loss:3.025 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:04,614][model8_pretrain.py][INFO] Epoch:[0/2](659200/4588595) loss:3.075 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:04,614][model8_pretrain.py][INFO] Epoch:[0/2](659200/4588595) loss:3.483 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:04,614][model8_pretrain.py][INFO] Epoch:[0/2](659200/4588595) loss:3.118 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:04,614][model8_pretrain.py][INFO] Epoch:[0/2](659200/4588595) loss:2.370 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:04,614][model8_pretrain.py][INFO] Epoch:[0/2](659200/4588595) loss:3.320 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:41,555][model8_pretrain.py][INFO] Epoch:[0/2](659300/4588595) loss:2.935 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:41,555][model8_pretrain.py][INFO] Epoch:[0/2](659300/4588595) loss:2.999 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:41,555][model8_pretrain.py][INFO] Epoch:[0/2](659300/4588595) loss:3.035 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:41,555][model8_pretrain.py][INFO] Epoch:[0/2](659300/4588595) loss:2.716 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:41,555][model8_pretrain.py][INFO] Epoch:[0/2](659300/4588595) loss:2.047 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:41,555][model8_pretrain.py][INFO] Epoch:[0/2](659300/4588595) loss:3.163 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:41,555][model8_pretrain.py][INFO] Epoch:[0/2](659300/4588595) loss:2.690 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:31:41,555][model8_pretrain.py][INFO] Epoch:[0/2](659300/4588595) loss:2.820 lr:0.0000100 epoch_Time:24983.0min: [2024-01-05 15:32:18,501][model8_pretrain.py][INFO] Epoch:[0/2](659400/4588595) loss:2.966 lr:0.0000100 epoch_Time:24982.0min: [2024-01-05 15:32:18,501][model8_pretrain.py][INFO] Epoch:[0/2](659400/4588595) loss:3.466 lr:0.0000100 epoch_Time:24982.0min: [2024-01-05 15:32:18,501][model8_pretrain.py][INFO] Epoch:[0/2](659400/4588595) loss:2.323 lr:0.0000100 epoch_Time:24982.0min: [2024-01-05 15:32:18,501][model8_pretrain.py][INFO] Epoch:[0/2](659400/4588595) loss:2.703 lr:0.0000100 epoch_Time:24982.0min: [2024-01-05 15:32:18,501][model8_pretrain.py][INFO] Epoch:[0/2](659400/4588595) loss:3.126 lr:0.0000100 epoch_Time:24982.0min: [2024-01-05 15:32:18,501][model8_pretrain.py][INFO] Epoch:[0/2](659400/4588595) loss:2.927 lr:0.0000100 epoch_Time:24982.0min: [2024-01-05 15:32:18,501][model8_pretrain.py][INFO] Epoch:[0/2](659400/4588595) loss:3.429 lr:0.0000100 epoch_Time:24982.0min: [2024-01-05 15:32:18,501][model8_pretrain.py][INFO] Epoch:[0/2](659400/4588595) loss:3.211 lr:0.0000100 epoch_Time:24982.0min: [2024-01-05 15:32:55,438][model8_pretrain.py][INFO] Epoch:[0/2](659500/4588595) loss:2.537 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:32:55,438][model8_pretrain.py][INFO] Epoch:[0/2](659500/4588595) loss:2.851 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:32:55,438][model8_pretrain.py][INFO] Epoch:[0/2](659500/4588595) loss:2.989 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:32:55,438][model8_pretrain.py][INFO] Epoch:[0/2](659500/4588595) loss:2.747 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:32:55,438][model8_pretrain.py][INFO] Epoch:[0/2](659500/4588595) loss:2.665 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:32:55,438][model8_pretrain.py][INFO] Epoch:[0/2](659500/4588595) loss:2.753 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:32:55,438][model8_pretrain.py][INFO] Epoch:[0/2](659500/4588595) loss:2.926 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:32:55,438][model8_pretrain.py][INFO] Epoch:[0/2](659500/4588595) loss:2.493 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:33:32,380][model8_pretrain.py][INFO] Epoch:[0/2](659600/4588595) loss:2.813 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:33:32,380][model8_pretrain.py][INFO] Epoch:[0/2](659600/4588595) loss:2.541 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:33:32,380][model8_pretrain.py][INFO] Epoch:[0/2](659600/4588595) loss:2.693 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:33:32,380][model8_pretrain.py][INFO] Epoch:[0/2](659600/4588595) loss:3.259 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:33:32,381][model8_pretrain.py][INFO] Epoch:[0/2](659600/4588595) loss:2.876 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:33:32,381][model8_pretrain.py][INFO] Epoch:[0/2](659600/4588595) loss:2.561 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:33:32,381][model8_pretrain.py][INFO] Epoch:[0/2](659600/4588595) loss:3.033 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:33:32,381][model8_pretrain.py][INFO] Epoch:[0/2](659600/4588595) loss:3.135 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:34:21,024][model8_pretrain.py][INFO] Epoch:[0/2](659700/4588595) loss:3.363 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:34:21,024][model8_pretrain.py][INFO] Epoch:[0/2](659700/4588595) loss:2.735 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:34:21,024][model8_pretrain.py][INFO] Epoch:[0/2](659700/4588595) loss:2.718 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:34:21,024][model8_pretrain.py][INFO] Epoch:[0/2](659700/4588595) loss:2.779 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:34:21,024][model8_pretrain.py][INFO] Epoch:[0/2](659700/4588595) loss:3.140 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:34:21,024][model8_pretrain.py][INFO] Epoch:[0/2](659700/4588595) loss:2.958 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:34:21,025][model8_pretrain.py][INFO] Epoch:[0/2](659700/4588595) loss:2.453 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:34:21,025][model8_pretrain.py][INFO] Epoch:[0/2](659700/4588595) loss:2.879 lr:0.0000100 epoch_Time:24981.0min: [2024-01-05 15:34:57,972][model8_pretrain.py][INFO] Epoch:[0/2](659800/4588595) loss:2.683 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:34:57,972][model8_pretrain.py][INFO] Epoch:[0/2](659800/4588595) loss:2.884 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:34:57,972][model8_pretrain.py][INFO] Epoch:[0/2](659800/4588595) loss:2.957 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:34:57,973][model8_pretrain.py][INFO] Epoch:[0/2](659800/4588595) loss:2.889 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:34:57,972][model8_pretrain.py][INFO] Epoch:[0/2](659800/4588595) loss:3.168 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:34:57,973][model8_pretrain.py][INFO] Epoch:[0/2](659800/4588595) loss:2.897 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:34:57,973][model8_pretrain.py][INFO] Epoch:[0/2](659800/4588595) loss:3.255 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:34:57,973][model8_pretrain.py][INFO] Epoch:[0/2](659800/4588595) loss:2.384 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:35:34,928][model8_pretrain.py][INFO] Epoch:[0/2](659900/4588595) loss:2.175 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:35:34,928][model8_pretrain.py][INFO] Epoch:[0/2](659900/4588595) loss:2.835 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:35:34,928][model8_pretrain.py][INFO] Epoch:[0/2](659900/4588595) loss:2.797 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:35:34,928][model8_pretrain.py][INFO] Epoch:[0/2](659900/4588595) loss:2.647 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:35:34,928][model8_pretrain.py][INFO] Epoch:[0/2](659900/4588595) loss:2.790 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:35:34,928][model8_pretrain.py][INFO] Epoch:[0/2](659900/4588595) loss:2.916 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:35:34,928][model8_pretrain.py][INFO] Epoch:[0/2](659900/4588595) loss:2.869 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:35:34,928][model8_pretrain.py][INFO] Epoch:[0/2](659900/4588595) loss:2.909 lr:0.0000100 epoch_Time:24980.0min: [2024-01-05 15:36:11,888][model8_pretrain.py][INFO] Epoch:[0/2](660000/4588595) loss:3.247 lr:0.0000100 epoch_Time:24979.0min: [2024-01-05 15:36:11,888][model8_pretrain.py][INFO] Epoch:[0/2](660000/4588595) loss:2.732 lr:0.0000100 epoch_Time:24979.0min: [2024-01-05 15:36:11,888][model8_pretrain.py][INFO] Epoch:[0/2](660000/4588595) loss:3.143 lr:0.0000100 epoch_Time:24979.0min: [2024-01-05 15:36:11,888][model8_pretrain.py][INFO] Epoch:[0/2](660000/4588595) loss:2.765 lr:0.0000100 epoch_Time:24979.0min: [2024-01-05 15:36:11,888][model8_pretrain.py][INFO] Epoch:[0/2](660000/4588595) loss:2.870 lr:0.0000100 epoch_Time:24979.0min: [2024-01-05 15:36:11,888][model8_pretrain.py][INFO] Epoch:[0/2](660000/4588595) loss:3.290 lr:0.0000100 epoch_Time:24979.0min: [2024-01-05 15:36:11,889][model8_pretrain.py][INFO] Epoch:[0/2](660000/4588595) loss:3.172 lr:0.0000100 epoch_Time:24979.0min: [2024-01-05 15:36:11,889][model8_pretrain.py][INFO] Epoch:[0/2](660000/4588595) loss:3.436 lr:0.0000100 epoch_Time:24979.0min: [2024-01-05 15:36:48,836][model8_pretrain.py][INFO] Epoch:[0/2](660100/4588595) loss:2.379 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:36:48,836][model8_pretrain.py][INFO] Epoch:[0/2](660100/4588595) loss:3.079 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:36:48,836][model8_pretrain.py][INFO] Epoch:[0/2](660100/4588595) loss:2.720 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:36:48,836][model8_pretrain.py][INFO] Epoch:[0/2](660100/4588595) loss:3.117 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:36:48,836][model8_pretrain.py][INFO] Epoch:[0/2](660100/4588595) loss:3.185 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:36:48,836][model8_pretrain.py][INFO] Epoch:[0/2](660100/4588595) loss:3.160 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:36:48,836][model8_pretrain.py][INFO] Epoch:[0/2](660100/4588595) loss:2.657 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:36:48,836][model8_pretrain.py][INFO] Epoch:[0/2](660100/4588595) loss:3.263 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:37:25,778][model8_pretrain.py][INFO] Epoch:[0/2](660200/4588595) loss:2.957 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:37:25,778][model8_pretrain.py][INFO] Epoch:[0/2](660200/4588595) loss:2.981 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:37:25,778][model8_pretrain.py][INFO] Epoch:[0/2](660200/4588595) loss:2.697 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:37:25,778][model8_pretrain.py][INFO] Epoch:[0/2](660200/4588595) loss:2.919 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:37:25,778][model8_pretrain.py][INFO] Epoch:[0/2](660200/4588595) loss:2.911 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:37:25,778][model8_pretrain.py][INFO] Epoch:[0/2](660200/4588595) loss:2.909 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:37:25,778][model8_pretrain.py][INFO] Epoch:[0/2](660200/4588595) loss:2.788 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:37:25,778][model8_pretrain.py][INFO] Epoch:[0/2](660200/4588595) loss:2.622 lr:0.0000100 epoch_Time:24977.0min: [2024-01-05 15:38:02,725][model8_pretrain.py][INFO] Epoch:[0/2](660300/4588595) loss:2.737 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:02,725][model8_pretrain.py][INFO] Epoch:[0/2](660300/4588595) loss:2.804 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:02,725][model8_pretrain.py][INFO] Epoch:[0/2](660300/4588595) loss:2.510 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:02,725][model8_pretrain.py][INFO] Epoch:[0/2](660300/4588595) loss:2.546 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:02,725][model8_pretrain.py][INFO] Epoch:[0/2](660300/4588595) loss:3.462 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:02,725][model8_pretrain.py][INFO] Epoch:[0/2](660300/4588595) loss:2.712 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:02,725][model8_pretrain.py][INFO] Epoch:[0/2](660300/4588595) loss:3.175 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:02,725][model8_pretrain.py][INFO] Epoch:[0/2](660300/4588595) loss:3.122 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:39,677][model8_pretrain.py][INFO] Epoch:[0/2](660400/4588595) loss:2.933 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:39,677][model8_pretrain.py][INFO] Epoch:[0/2](660400/4588595) loss:2.510 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:39,677][model8_pretrain.py][INFO] Epoch:[0/2](660400/4588595) loss:2.647 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:39,677][model8_pretrain.py][INFO] Epoch:[0/2](660400/4588595) loss:2.833 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:39,677][model8_pretrain.py][INFO] Epoch:[0/2](660400/4588595) loss:2.875 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:39,677][model8_pretrain.py][INFO] Epoch:[0/2](660400/4588595) loss:3.263 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:39,677][model8_pretrain.py][INFO] Epoch:[0/2](660400/4588595) loss:3.124 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:38:39,677][model8_pretrain.py][INFO] Epoch:[0/2](660400/4588595) loss:2.574 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:39:28,474][model8_pretrain.py][INFO] Epoch:[0/2](660500/4588595) loss:3.050 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:39:28,474][model8_pretrain.py][INFO] Epoch:[0/2](660500/4588595) loss:2.787 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:39:28,474][model8_pretrain.py][INFO] Epoch:[0/2](660500/4588595) loss:3.078 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:39:28,474][model8_pretrain.py][INFO] Epoch:[0/2](660500/4588595) loss:2.523 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:39:28,474][model8_pretrain.py][INFO] Epoch:[0/2](660500/4588595) loss:3.249 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:39:28,474][model8_pretrain.py][INFO] Epoch:[0/2](660500/4588595) loss:2.814 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:39:28,474][model8_pretrain.py][INFO] Epoch:[0/2](660500/4588595) loss:2.808 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:39:28,474][model8_pretrain.py][INFO] Epoch:[0/2](660500/4588595) loss:2.760 lr:0.0000100 epoch_Time:24976.0min: [2024-01-05 15:40:05,431][model8_pretrain.py][INFO] Epoch:[0/2](660600/4588595) loss:2.572 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:05,431][model8_pretrain.py][INFO] Epoch:[0/2](660600/4588595) loss:2.433 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:05,431][model8_pretrain.py][INFO] Epoch:[0/2](660600/4588595) loss:2.318 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:05,431][model8_pretrain.py][INFO] Epoch:[0/2](660600/4588595) loss:2.902 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:05,431][model8_pretrain.py][INFO] Epoch:[0/2](660600/4588595) loss:3.205 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:05,431][model8_pretrain.py][INFO] Epoch:[0/2](660600/4588595) loss:2.472 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:05,431][model8_pretrain.py][INFO] Epoch:[0/2](660600/4588595) loss:2.815 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:05,431][model8_pretrain.py][INFO] Epoch:[0/2](660600/4588595) loss:3.324 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:42,388][model8_pretrain.py][INFO] Epoch:[0/2](660700/4588595) loss:2.070 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:42,388][model8_pretrain.py][INFO] Epoch:[0/2](660700/4588595) loss:2.750 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:42,388][model8_pretrain.py][INFO] Epoch:[0/2](660700/4588595) loss:3.233 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:42,388][model8_pretrain.py][INFO] Epoch:[0/2](660700/4588595) loss:3.138 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:42,388][model8_pretrain.py][INFO] Epoch:[0/2](660700/4588595) loss:2.892 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:42,388][model8_pretrain.py][INFO] Epoch:[0/2](660700/4588595) loss:3.151 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:42,388][model8_pretrain.py][INFO] Epoch:[0/2](660700/4588595) loss:2.516 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:40:42,388][model8_pretrain.py][INFO] Epoch:[0/2](660700/4588595) loss:2.923 lr:0.0000100 epoch_Time:24975.0min: [2024-01-05 15:41:19,346][model8_pretrain.py][INFO] Epoch:[0/2](660800/4588595) loss:3.187 lr:0.0000100 epoch_Time:24974.0min: [2024-01-05 15:41:19,346][model8_pretrain.py][INFO] Epoch:[0/2](660800/4588595) loss:2.507 lr:0.0000100 epoch_Time:24974.0min: [2024-01-05 15:41:19,346][model8_pretrain.py][INFO] Epoch:[0/2](660800/4588595) loss:3.059 lr:0.0000100 epoch_Time:24974.0min: [2024-01-05 15:41:19,346][model8_pretrain.py][INFO] Epoch:[0/2](660800/4588595) loss:2.143 lr:0.0000100 epoch_Time:24974.0min: [2024-01-05 15:41:19,346][model8_pretrain.py][INFO] Epoch:[0/2](660800/4588595) loss:2.882 lr:0.0000100 epoch_Time:24974.0min: [2024-01-05 15:41:19,346][model8_pretrain.py][INFO] Epoch:[0/2](660800/4588595) loss:3.112 lr:0.0000100 epoch_Time:24974.0min: [2024-01-05 15:41:19,346][model8_pretrain.py][INFO] Epoch:[0/2](660800/4588595) loss:3.066 lr:0.0000100 epoch_Time:24974.0min: [2024-01-05 15:41:19,346][model8_pretrain.py][INFO] Epoch:[0/2](660800/4588595) loss:3.332 lr:0.0000100 epoch_Time:24974.0min: [2024-01-05 15:41:56,294][model8_pretrain.py][INFO] Epoch:[0/2](660900/4588595) loss:2.756 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:41:56,294][model8_pretrain.py][INFO] Epoch:[0/2](660900/4588595) loss:2.723 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:41:56,294][model8_pretrain.py][INFO] Epoch:[0/2](660900/4588595) loss:2.999 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:41:56,294][model8_pretrain.py][INFO] Epoch:[0/2](660900/4588595) loss:2.762 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:41:56,294][model8_pretrain.py][INFO] Epoch:[0/2](660900/4588595) loss:2.387 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:41:56,294][model8_pretrain.py][INFO] Epoch:[0/2](660900/4588595) loss:2.716 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:41:56,294][model8_pretrain.py][INFO] Epoch:[0/2](660900/4588595) loss:2.313 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:41:56,294][model8_pretrain.py][INFO] Epoch:[0/2](660900/4588595) loss:2.305 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:42:33,253][model8_pretrain.py][INFO] Epoch:[0/2](661000/4588595) loss:2.547 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:42:33,254][model8_pretrain.py][INFO] Epoch:[0/2](661000/4588595) loss:3.055 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:42:33,254][model8_pretrain.py][INFO] Epoch:[0/2](661000/4588595) loss:2.481 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:42:33,254][model8_pretrain.py][INFO] Epoch:[0/2](661000/4588595) loss:3.309 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:42:33,254][model8_pretrain.py][INFO] Epoch:[0/2](661000/4588595) loss:2.643 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:42:33,254][model8_pretrain.py][INFO] Epoch:[0/2](661000/4588595) loss:3.275 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:42:33,254][model8_pretrain.py][INFO] Epoch:[0/2](661000/4588595) loss:3.157 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:42:33,254][model8_pretrain.py][INFO] Epoch:[0/2](661000/4588595) loss:2.741 lr:0.0000100 epoch_Time:24973.0min: [2024-01-05 15:43:10,202][model8_pretrain.py][INFO] Epoch:[0/2](661100/4588595) loss:3.091 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:10,202][model8_pretrain.py][INFO] Epoch:[0/2](661100/4588595) loss:2.657 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:10,202][model8_pretrain.py][INFO] Epoch:[0/2](661100/4588595) loss:2.706 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:10,202][model8_pretrain.py][INFO] Epoch:[0/2](661100/4588595) loss:2.716 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:10,202][model8_pretrain.py][INFO] Epoch:[0/2](661100/4588595) loss:3.004 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:10,202][model8_pretrain.py][INFO] Epoch:[0/2](661100/4588595) loss:2.546 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:10,202][model8_pretrain.py][INFO] Epoch:[0/2](661100/4588595) loss:2.592 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:10,203][model8_pretrain.py][INFO] Epoch:[0/2](661100/4588595) loss:2.393 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:47,151][model8_pretrain.py][INFO] Epoch:[0/2](661200/4588595) loss:2.636 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:47,151][model8_pretrain.py][INFO] Epoch:[0/2](661200/4588595) loss:3.051 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:47,151][model8_pretrain.py][INFO] Epoch:[0/2](661200/4588595) loss:2.116 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:47,151][model8_pretrain.py][INFO] Epoch:[0/2](661200/4588595) loss:2.750 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:47,151][model8_pretrain.py][INFO] Epoch:[0/2](661200/4588595) loss:2.646 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:47,151][model8_pretrain.py][INFO] Epoch:[0/2](661200/4588595) loss:2.442 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:47,151][model8_pretrain.py][INFO] Epoch:[0/2](661200/4588595) loss:2.608 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:43:47,151][model8_pretrain.py][INFO] Epoch:[0/2](661200/4588595) loss:2.322 lr:0.0000100 epoch_Time:24971.0min: [2024-01-05 15:44:36,331][model8_pretrain.py][INFO] Epoch:[0/2](661300/4588595) loss:2.718 lr:0.0000100 epoch_Time:24972.0min: [2024-01-05 15:44:36,331][model8_pretrain.py][INFO] Epoch:[0/2](661300/4588595) loss:2.264 lr:0.0000100 epoch_Time:24972.0min: [2024-01-05 15:44:36,331][model8_pretrain.py][INFO] Epoch:[0/2](661300/4588595) loss:2.331 lr:0.0000100 epoch_Time:24972.0min: [2024-01-05 15:44:36,331][model8_pretrain.py][INFO] Epoch:[0/2](661300/4588595) loss:2.839 lr:0.0000100 epoch_Time:24972.0min: [2024-01-05 15:44:36,331][model8_pretrain.py][INFO] Epoch:[0/2](661300/4588595) loss:3.311 lr:0.0000100 epoch_Time:24972.0min: [2024-01-05 15:44:36,331][model8_pretrain.py][INFO] Epoch:[0/2](661300/4588595) loss:2.521 lr:0.0000100 epoch_Time:24972.0min: [2024-01-05 15:44:36,331][model8_pretrain.py][INFO] Epoch:[0/2](661300/4588595) loss:2.977 lr:0.0000100 epoch_Time:24972.0min: [2024-01-05 15:44:36,331][model8_pretrain.py][INFO] Epoch:[0/2](661300/4588595) loss:2.827 lr:0.0000100 epoch_Time:24972.0min: [2024-01-05 15:45:13,248][model8_pretrain.py][INFO] Epoch:[0/2](661400/4588595) loss:2.717 lr:0.0000100 epoch_Time:24970.0min: [2024-01-05 15:45:13,248][model8_pretrain.py][INFO] Epoch:[0/2](661400/4588595) loss:3.031 lr:0.0000100 epoch_Time:24970.0min: [2024-01-05 15:45:13,249][model8_pretrain.py][INFO] Epoch:[0/2](661400/4588595) loss:2.780 lr:0.0000100 epoch_Time:24970.0min: [2024-01-05 15:45:13,248][model8_pretrain.py][INFO] Epoch:[0/2](661400/4588595) loss:3.065 lr:0.0000100 epoch_Time:24970.0min: [2024-01-05 15:45:13,249][model8_pretrain.py][INFO] Epoch:[0/2](661400/4588595) loss:3.072 lr:0.0000100 epoch_Time:24970.0min: [2024-01-05 15:45:13,249][model8_pretrain.py][INFO] Epoch:[0/2](661400/4588595) loss:2.601 lr:0.0000100 epoch_Time:24970.0min: [2024-01-05 15:45:13,249][model8_pretrain.py][INFO] Epoch:[0/2](661400/4588595) loss:3.319 lr:0.0000100 epoch_Time:24970.0min: [2024-01-05 15:45:13,249][model8_pretrain.py][INFO] Epoch:[0/2](661400/4588595) loss:2.819 lr:0.0000100 epoch_Time:24970.0min: [2024-01-05 15:45:50,190][model8_pretrain.py][INFO] Epoch:[0/2](661500/4588595) loss:2.895 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:45:50,190][model8_pretrain.py][INFO] Epoch:[0/2](661500/4588595) loss:2.603 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:45:50,190][model8_pretrain.py][INFO] Epoch:[0/2](661500/4588595) loss:2.935 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:45:50,190][model8_pretrain.py][INFO] Epoch:[0/2](661500/4588595) loss:3.181 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:45:50,190][model8_pretrain.py][INFO] Epoch:[0/2](661500/4588595) loss:2.655 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:45:50,191][model8_pretrain.py][INFO] Epoch:[0/2](661500/4588595) loss:2.538 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:45:50,191][model8_pretrain.py][INFO] Epoch:[0/2](661500/4588595) loss:2.958 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:45:50,191][model8_pretrain.py][INFO] Epoch:[0/2](661500/4588595) loss:2.726 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:46:27,121][model8_pretrain.py][INFO] Epoch:[0/2](661600/4588595) loss:2.245 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:46:27,121][model8_pretrain.py][INFO] Epoch:[0/2](661600/4588595) loss:2.972 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:46:27,121][model8_pretrain.py][INFO] Epoch:[0/2](661600/4588595) loss:2.737 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:46:27,121][model8_pretrain.py][INFO] Epoch:[0/2](661600/4588595) loss:3.226 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:46:27,121][model8_pretrain.py][INFO] Epoch:[0/2](661600/4588595) loss:2.791 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:46:27,121][model8_pretrain.py][INFO] Epoch:[0/2](661600/4588595) loss:2.818 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:46:27,121][model8_pretrain.py][INFO] Epoch:[0/2](661600/4588595) loss:3.342 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:46:27,121][model8_pretrain.py][INFO] Epoch:[0/2](661600/4588595) loss:3.108 lr:0.0000100 epoch_Time:24969.0min: [2024-01-05 15:47:04,061][model8_pretrain.py][INFO] Epoch:[0/2](661700/4588595) loss:2.540 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:04,061][model8_pretrain.py][INFO] Epoch:[0/2](661700/4588595) loss:2.632 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:04,061][model8_pretrain.py][INFO] Epoch:[0/2](661700/4588595) loss:2.484 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:04,061][model8_pretrain.py][INFO] Epoch:[0/2](661700/4588595) loss:2.616 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:04,061][model8_pretrain.py][INFO] Epoch:[0/2](661700/4588595) loss:3.105 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:04,061][model8_pretrain.py][INFO] Epoch:[0/2](661700/4588595) loss:2.350 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:04,061][model8_pretrain.py][INFO] Epoch:[0/2](661700/4588595) loss:2.771 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:04,061][model8_pretrain.py][INFO] Epoch:[0/2](661700/4588595) loss:3.112 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:40,985][model8_pretrain.py][INFO] Epoch:[0/2](661800/4588595) loss:2.890 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:40,985][model8_pretrain.py][INFO] Epoch:[0/2](661800/4588595) loss:2.298 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:40,985][model8_pretrain.py][INFO] Epoch:[0/2](661800/4588595) loss:2.316 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:40,985][model8_pretrain.py][INFO] Epoch:[0/2](661800/4588595) loss:3.051 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:40,985][model8_pretrain.py][INFO] Epoch:[0/2](661800/4588595) loss:2.828 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:40,985][model8_pretrain.py][INFO] Epoch:[0/2](661800/4588595) loss:2.361 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:40,985][model8_pretrain.py][INFO] Epoch:[0/2](661800/4588595) loss:2.570 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:47:40,985][model8_pretrain.py][INFO] Epoch:[0/2](661800/4588595) loss:2.895 lr:0.0000100 epoch_Time:24968.0min: [2024-01-05 15:48:17,909][model8_pretrain.py][INFO] Epoch:[0/2](661900/4588595) loss:2.939 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:48:17,909][model8_pretrain.py][INFO] Epoch:[0/2](661900/4588595) loss:2.890 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:48:17,909][model8_pretrain.py][INFO] Epoch:[0/2](661900/4588595) loss:3.243 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:48:17,909][model8_pretrain.py][INFO] Epoch:[0/2](661900/4588595) loss:2.791 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:48:17,909][model8_pretrain.py][INFO] Epoch:[0/2](661900/4588595) loss:2.855 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:48:17,909][model8_pretrain.py][INFO] Epoch:[0/2](661900/4588595) loss:2.880 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:48:17,909][model8_pretrain.py][INFO] Epoch:[0/2](661900/4588595) loss:2.863 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:48:17,909][model8_pretrain.py][INFO] Epoch:[0/2](661900/4588595) loss:3.262 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:48:54,837][model8_pretrain.py][INFO] Epoch:[0/2](662000/4588595) loss:2.921 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:48:54,837][model8_pretrain.py][INFO] Epoch:[0/2](662000/4588595) loss:2.843 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:48:54,838][model8_pretrain.py][INFO] Epoch:[0/2](662000/4588595) loss:2.590 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:48:54,838][model8_pretrain.py][INFO] Epoch:[0/2](662000/4588595) loss:3.068 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:48:54,838][model8_pretrain.py][INFO] Epoch:[0/2](662000/4588595) loss:2.358 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:48:54,838][model8_pretrain.py][INFO] Epoch:[0/2](662000/4588595) loss:2.614 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:48:54,838][model8_pretrain.py][INFO] Epoch:[0/2](662000/4588595) loss:2.986 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:48:54,838][model8_pretrain.py][INFO] Epoch:[0/2](662000/4588595) loss:2.878 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:49:43,953][model8_pretrain.py][INFO] Epoch:[0/2](662100/4588595) loss:2.634 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:49:43,953][model8_pretrain.py][INFO] Epoch:[0/2](662100/4588595) loss:3.163 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:49:43,953][model8_pretrain.py][INFO] Epoch:[0/2](662100/4588595) loss:2.498 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:49:43,953][model8_pretrain.py][INFO] Epoch:[0/2](662100/4588595) loss:3.218 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:49:43,953][model8_pretrain.py][INFO] Epoch:[0/2](662100/4588595) loss:2.499 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:49:43,953][model8_pretrain.py][INFO] Epoch:[0/2](662100/4588595) loss:2.307 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:49:43,953][model8_pretrain.py][INFO] Epoch:[0/2](662100/4588595) loss:3.192 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:49:43,953][model8_pretrain.py][INFO] Epoch:[0/2](662100/4588595) loss:2.760 lr:0.0000100 epoch_Time:24967.0min: [2024-01-05 15:50:20,890][model8_pretrain.py][INFO] Epoch:[0/2](662200/4588595) loss:2.854 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:50:20,890][model8_pretrain.py][INFO] Epoch:[0/2](662200/4588595) loss:2.561 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:50:20,890][model8_pretrain.py][INFO] Epoch:[0/2](662200/4588595) loss:2.816 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:50:20,890][model8_pretrain.py][INFO] Epoch:[0/2](662200/4588595) loss:3.020 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:50:20,890][model8_pretrain.py][INFO] Epoch:[0/2](662200/4588595) loss:2.991 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:50:20,890][model8_pretrain.py][INFO] Epoch:[0/2](662200/4588595) loss:2.664 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:50:20,890][model8_pretrain.py][INFO] Epoch:[0/2](662200/4588595) loss:3.045 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:50:20,890][model8_pretrain.py][INFO] Epoch:[0/2](662200/4588595) loss:2.796 lr:0.0000100 epoch_Time:24966.0min: [2024-01-05 15:50:57,830][model8_pretrain.py][INFO] Epoch:[0/2](662300/4588595) loss:2.468 lr:0.0000100 epoch_Time:24965.0min: [2024-01-05 15:50:57,830][model8_pretrain.py][INFO] Epoch:[0/2](662300/4588595) loss:3.320 lr:0.0000100 epoch_Time:24965.0min: [2024-01-05 15:50:57,830][model8_pretrain.py][INFO] Epoch:[0/2](662300/4588595) loss:2.861 lr:0.0000100 epoch_Time:24965.0min: [2024-01-05 15:50:57,830][model8_pretrain.py][INFO] Epoch:[0/2](662300/4588595) loss:2.973 lr:0.0000100 epoch_Time:24965.0min: [2024-01-05 15:50:57,830][model8_pretrain.py][INFO] Epoch:[0/2](662300/4588595) loss:2.761 lr:0.0000100 epoch_Time:24965.0min: [2024-01-05 15:50:57,830][model8_pretrain.py][INFO] Epoch:[0/2](662300/4588595) loss:2.738 lr:0.0000100 epoch_Time:24965.0min: [2024-01-05 15:50:57,830][model8_pretrain.py][INFO] Epoch:[0/2](662300/4588595) loss:2.811 lr:0.0000100 epoch_Time:24965.0min: [2024-01-05 15:50:57,830][model8_pretrain.py][INFO] Epoch:[0/2](662300/4588595) loss:3.157 lr:0.0000100 epoch_Time:24965.0min: [2024-01-05 15:51:34,767][model8_pretrain.py][INFO] Epoch:[0/2](662400/4588595) loss:2.341 lr:0.0000100 epoch_Time:24964.0min: [2024-01-05 15:51:34,767][model8_pretrain.py][INFO] Epoch:[0/2](662400/4588595) loss:2.450 lr:0.0000100 epoch_Time:24964.0min: [2024-01-05 15:51:34,767][model8_pretrain.py][INFO] Epoch:[0/2](662400/4588595) loss:2.630 lr:0.0000100 epoch_Time:24964.0min: [2024-01-05 15:51:34,767][model8_pretrain.py][INFO] Epoch:[0/2](662400/4588595) loss:2.963 lr:0.0000100 epoch_Time:24964.0min: [2024-01-05 15:51:34,768][model8_pretrain.py][INFO] Epoch:[0/2](662400/4588595) loss:2.869 lr:0.0000100 epoch_Time:24964.0min: [2024-01-05 15:51:34,768][model8_pretrain.py][INFO] Epoch:[0/2](662400/4588595) loss:3.187 lr:0.0000100 epoch_Time:24964.0min: [2024-01-05 15:51:34,768][model8_pretrain.py][INFO] Epoch:[0/2](662400/4588595) loss:2.800 lr:0.0000100 epoch_Time:24964.0min: [2024-01-05 15:51:34,768][model8_pretrain.py][INFO] Epoch:[0/2](662400/4588595) loss:2.875 lr:0.0000100 epoch_Time:24964.0min: [2024-01-05 15:52:11,707][model8_pretrain.py][INFO] Epoch:[0/2](662500/4588595) loss:2.862 lr:0.0000100 epoch_Time:24963.0min: [2024-01-05 15:52:11,707][model8_pretrain.py][INFO] Epoch:[0/2](662500/4588595) loss:2.262 lr:0.0000100 epoch_Time:24963.0min: [2024-01-05 15:52:11,707][model8_pretrain.py][INFO] Epoch:[0/2](662500/4588595) loss:2.683 lr:0.0000100 epoch_Time:24963.0min: [2024-01-05 15:52:11,707][model8_pretrain.py][INFO] Epoch:[0/2](662500/4588595) loss:2.707 lr:0.0000100 epoch_Time:24963.0min: [2024-01-05 15:52:11,707][model8_pretrain.py][INFO] Epoch:[0/2](662500/4588595) loss:2.685 lr:0.0000100 epoch_Time:24963.0min: [2024-01-05 15:52:11,707][model8_pretrain.py][INFO] Epoch:[0/2](662500/4588595) loss:2.610 lr:0.0000100 epoch_Time:24963.0min: [2024-01-05 15:52:11,707][model8_pretrain.py][INFO] Epoch:[0/2](662500/4588595) loss:2.855 lr:0.0000100 epoch_Time:24963.0min: [2024-01-05 15:52:11,707][model8_pretrain.py][INFO] Epoch:[0/2](662500/4588595) loss:2.972 lr:0.0000100 epoch_Time:24963.0min: [2024-01-05 15:52:48,634][model8_pretrain.py][INFO] Epoch:[0/2](662600/4588595) loss:2.608 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:52:48,634][model8_pretrain.py][INFO] Epoch:[0/2](662600/4588595) loss:3.043 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:52:48,634][model8_pretrain.py][INFO] Epoch:[0/2](662600/4588595) loss:2.956 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:52:48,634][model8_pretrain.py][INFO] Epoch:[0/2](662600/4588595) loss:2.850 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:52:48,634][model8_pretrain.py][INFO] Epoch:[0/2](662600/4588595) loss:3.254 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:52:48,634][model8_pretrain.py][INFO] Epoch:[0/2](662600/4588595) loss:2.665 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:52:48,634][model8_pretrain.py][INFO] Epoch:[0/2](662600/4588595) loss:2.943 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:52:48,634][model8_pretrain.py][INFO] Epoch:[0/2](662600/4588595) loss:2.552 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:53:25,586][model8_pretrain.py][INFO] Epoch:[0/2](662700/4588595) loss:2.881 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:53:25,586][model8_pretrain.py][INFO] Epoch:[0/2](662700/4588595) loss:2.743 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:53:25,586][model8_pretrain.py][INFO] Epoch:[0/2](662700/4588595) loss:2.579 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:53:25,586][model8_pretrain.py][INFO] Epoch:[0/2](662700/4588595) loss:3.264 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:53:25,586][model8_pretrain.py][INFO] Epoch:[0/2](662700/4588595) loss:3.247 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:53:25,586][model8_pretrain.py][INFO] Epoch:[0/2](662700/4588595) loss:3.152 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:53:25,586][model8_pretrain.py][INFO] Epoch:[0/2](662700/4588595) loss:2.725 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:53:25,587][model8_pretrain.py][INFO] Epoch:[0/2](662700/4588595) loss:3.270 lr:0.0000100 epoch_Time:24962.0min: [2024-01-05 15:54:02,519][model8_pretrain.py][INFO] Epoch:[0/2](662800/4588595) loss:2.695 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:02,519][model8_pretrain.py][INFO] Epoch:[0/2](662800/4588595) loss:2.116 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:02,519][model8_pretrain.py][INFO] Epoch:[0/2](662800/4588595) loss:3.298 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:02,519][model8_pretrain.py][INFO] Epoch:[0/2](662800/4588595) loss:2.726 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:02,519][model8_pretrain.py][INFO] Epoch:[0/2](662800/4588595) loss:2.312 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:02,519][model8_pretrain.py][INFO] Epoch:[0/2](662800/4588595) loss:2.946 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:02,519][model8_pretrain.py][INFO] Epoch:[0/2](662800/4588595) loss:2.647 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:02,519][model8_pretrain.py][INFO] Epoch:[0/2](662800/4588595) loss:2.887 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:51,704][model8_pretrain.py][INFO] Epoch:[0/2](662900/4588595) loss:2.864 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:51,704][model8_pretrain.py][INFO] Epoch:[0/2](662900/4588595) loss:2.668 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:51,704][model8_pretrain.py][INFO] Epoch:[0/2](662900/4588595) loss:2.827 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:51,704][model8_pretrain.py][INFO] Epoch:[0/2](662900/4588595) loss:2.670 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:51,704][model8_pretrain.py][INFO] Epoch:[0/2](662900/4588595) loss:2.805 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:51,704][model8_pretrain.py][INFO] Epoch:[0/2](662900/4588595) loss:1.853 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:51,704][model8_pretrain.py][INFO] Epoch:[0/2](662900/4588595) loss:2.128 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:54:51,704][model8_pretrain.py][INFO] Epoch:[0/2](662900/4588595) loss:2.772 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:55:28,651][model8_pretrain.py][INFO] Epoch:[0/2](663000/4588595) loss:2.616 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:55:28,651][model8_pretrain.py][INFO] Epoch:[0/2](663000/4588595) loss:2.494 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:55:28,651][model8_pretrain.py][INFO] Epoch:[0/2](663000/4588595) loss:2.816 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:55:28,651][model8_pretrain.py][INFO] Epoch:[0/2](663000/4588595) loss:3.235 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:55:28,651][model8_pretrain.py][INFO] Epoch:[0/2](663000/4588595) loss:2.607 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:55:28,651][model8_pretrain.py][INFO] Epoch:[0/2](663000/4588595) loss:2.327 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:55:28,651][model8_pretrain.py][INFO] Epoch:[0/2](663000/4588595) loss:2.632 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:55:28,651][model8_pretrain.py][INFO] Epoch:[0/2](663000/4588595) loss:2.915 lr:0.0000100 epoch_Time:24961.0min: [2024-01-05 15:56:05,596][model8_pretrain.py][INFO] Epoch:[0/2](663100/4588595) loss:3.092 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:05,596][model8_pretrain.py][INFO] Epoch:[0/2](663100/4588595) loss:2.591 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:05,596][model8_pretrain.py][INFO] Epoch:[0/2](663100/4588595) loss:3.004 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:05,596][model8_pretrain.py][INFO] Epoch:[0/2](663100/4588595) loss:3.010 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:05,596][model8_pretrain.py][INFO] Epoch:[0/2](663100/4588595) loss:3.161 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:05,596][model8_pretrain.py][INFO] Epoch:[0/2](663100/4588595) loss:2.811 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:05,596][model8_pretrain.py][INFO] Epoch:[0/2](663100/4588595) loss:3.048 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:05,597][model8_pretrain.py][INFO] Epoch:[0/2](663100/4588595) loss:2.781 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:42,541][model8_pretrain.py][INFO] Epoch:[0/2](663200/4588595) loss:3.130 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:42,541][model8_pretrain.py][INFO] Epoch:[0/2](663200/4588595) loss:2.858 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:42,541][model8_pretrain.py][INFO] Epoch:[0/2](663200/4588595) loss:2.735 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:42,541][model8_pretrain.py][INFO] Epoch:[0/2](663200/4588595) loss:2.704 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:42,541][model8_pretrain.py][INFO] Epoch:[0/2](663200/4588595) loss:3.340 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:42,541][model8_pretrain.py][INFO] Epoch:[0/2](663200/4588595) loss:2.731 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:42,541][model8_pretrain.py][INFO] Epoch:[0/2](663200/4588595) loss:2.603 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:56:42,541][model8_pretrain.py][INFO] Epoch:[0/2](663200/4588595) loss:2.502 lr:0.0000100 epoch_Time:24960.0min: [2024-01-05 15:57:19,490][model8_pretrain.py][INFO] Epoch:[0/2](663300/4588595) loss:3.420 lr:0.0000100 epoch_Time:24959.0min: [2024-01-05 15:57:19,490][model8_pretrain.py][INFO] Epoch:[0/2](663300/4588595) loss:2.213 lr:0.0000100 epoch_Time:24959.0min: [2024-01-05 15:57:19,490][model8_pretrain.py][INFO] Epoch:[0/2](663300/4588595) loss:2.948 lr:0.0000100 epoch_Time:24959.0min: [2024-01-05 15:57:19,490][model8_pretrain.py][INFO] Epoch:[0/2](663300/4588595) loss:2.604 lr:0.0000100 epoch_Time:24959.0min: [2024-01-05 15:57:19,490][model8_pretrain.py][INFO] Epoch:[0/2](663300/4588595) loss:3.302 lr:0.0000100 epoch_Time:24959.0min: [2024-01-05 15:57:19,490][model8_pretrain.py][INFO] Epoch:[0/2](663300/4588595) loss:2.869 lr:0.0000100 epoch_Time:24959.0min: [2024-01-05 15:57:19,490][model8_pretrain.py][INFO] Epoch:[0/2](663300/4588595) loss:2.525 lr:0.0000100 epoch_Time:24959.0min: [2024-01-05 15:57:19,490][model8_pretrain.py][INFO] Epoch:[0/2](663300/4588595) loss:1.990 lr:0.0000100 epoch_Time:24959.0min: [2024-01-05 15:57:56,433][model8_pretrain.py][INFO] Epoch:[0/2](663400/4588595) loss:2.792 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:57:56,433][model8_pretrain.py][INFO] Epoch:[0/2](663400/4588595) loss:2.747 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:57:56,433][model8_pretrain.py][INFO] Epoch:[0/2](663400/4588595) loss:2.997 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:57:56,433][model8_pretrain.py][INFO] Epoch:[0/2](663400/4588595) loss:3.071 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:57:56,433][model8_pretrain.py][INFO] Epoch:[0/2](663400/4588595) loss:2.458 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:57:56,433][model8_pretrain.py][INFO] Epoch:[0/2](663400/4588595) loss:3.011 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:57:56,433][model8_pretrain.py][INFO] Epoch:[0/2](663400/4588595) loss:2.671 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:57:56,433][model8_pretrain.py][INFO] Epoch:[0/2](663400/4588595) loss:2.586 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:58:33,374][model8_pretrain.py][INFO] Epoch:[0/2](663500/4588595) loss:2.827 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:58:33,374][model8_pretrain.py][INFO] Epoch:[0/2](663500/4588595) loss:2.484 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:58:33,374][model8_pretrain.py][INFO] Epoch:[0/2](663500/4588595) loss:2.676 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:58:33,374][model8_pretrain.py][INFO] Epoch:[0/2](663500/4588595) loss:3.366 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:58:33,374][model8_pretrain.py][INFO] Epoch:[0/2](663500/4588595) loss:2.703 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:58:33,374][model8_pretrain.py][INFO] Epoch:[0/2](663500/4588595) loss:2.339 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:58:33,374][model8_pretrain.py][INFO] Epoch:[0/2](663500/4588595) loss:2.764 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:58:33,374][model8_pretrain.py][INFO] Epoch:[0/2](663500/4588595) loss:2.743 lr:0.0000100 epoch_Time:24957.0min: [2024-01-05 15:59:10,323][model8_pretrain.py][INFO] Epoch:[0/2](663600/4588595) loss:2.969 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:10,323][model8_pretrain.py][INFO] Epoch:[0/2](663600/4588595) loss:2.768 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:10,323][model8_pretrain.py][INFO] Epoch:[0/2](663600/4588595) loss:2.847 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:10,323][model8_pretrain.py][INFO] Epoch:[0/2](663600/4588595) loss:2.084 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:10,323][model8_pretrain.py][INFO] Epoch:[0/2](663600/4588595) loss:2.749 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:10,323][model8_pretrain.py][INFO] Epoch:[0/2](663600/4588595) loss:2.755 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:10,323][model8_pretrain.py][INFO] Epoch:[0/2](663600/4588595) loss:2.755 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:10,323][model8_pretrain.py][INFO] Epoch:[0/2](663600/4588595) loss:2.572 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:59,218][model8_pretrain.py][INFO] Epoch:[0/2](663700/4588595) loss:3.559 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:59,218][model8_pretrain.py][INFO] Epoch:[0/2](663700/4588595) loss:2.574 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:59,218][model8_pretrain.py][INFO] Epoch:[0/2](663700/4588595) loss:2.594 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:59,218][model8_pretrain.py][INFO] Epoch:[0/2](663700/4588595) loss:2.945 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:59,218][model8_pretrain.py][INFO] Epoch:[0/2](663700/4588595) loss:3.035 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:59,218][model8_pretrain.py][INFO] Epoch:[0/2](663700/4588595) loss:2.917 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:59,218][model8_pretrain.py][INFO] Epoch:[0/2](663700/4588595) loss:2.858 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 15:59:59,219][model8_pretrain.py][INFO] Epoch:[0/2](663700/4588595) loss:3.366 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 16:00:36,154][model8_pretrain.py][INFO] Epoch:[0/2](663800/4588595) loss:3.066 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 16:00:36,154][model8_pretrain.py][INFO] Epoch:[0/2](663800/4588595) loss:3.318 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 16:00:36,154][model8_pretrain.py][INFO] Epoch:[0/2](663800/4588595) loss:3.007 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 16:00:36,154][model8_pretrain.py][INFO] Epoch:[0/2](663800/4588595) loss:2.406 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 16:00:36,154][model8_pretrain.py][INFO] Epoch:[0/2](663800/4588595) loss:2.464 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 16:00:36,154][model8_pretrain.py][INFO] Epoch:[0/2](663800/4588595) loss:2.332 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 16:00:36,154][model8_pretrain.py][INFO] Epoch:[0/2](663800/4588595) loss:2.239 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 16:00:36,154][model8_pretrain.py][INFO] Epoch:[0/2](663800/4588595) loss:2.009 lr:0.0000100 epoch_Time:24956.0min: [2024-01-05 16:01:13,096][model8_pretrain.py][INFO] Epoch:[0/2](663900/4588595) loss:2.865 lr:0.0000100 epoch_Time:24955.0min: [2024-01-05 16:01:13,096][model8_pretrain.py][INFO] Epoch:[0/2](663900/4588595) loss:2.809 lr:0.0000100 epoch_Time:24955.0min: [2024-01-05 16:01:13,096][model8_pretrain.py][INFO] Epoch:[0/2](663900/4588595) loss:2.672 lr:0.0000100 epoch_Time:24955.0min: [2024-01-05 16:01:13,096][model8_pretrain.py][INFO] Epoch:[0/2](663900/4588595) loss:2.676 lr:0.0000100 epoch_Time:24955.0min: [2024-01-05 16:01:13,096][model8_pretrain.py][INFO] Epoch:[0/2](663900/4588595) loss:3.184 lr:0.0000100 epoch_Time:24955.0min: [2024-01-05 16:01:13,096][model8_pretrain.py][INFO] Epoch:[0/2](663900/4588595) loss:2.648 lr:0.0000100 epoch_Time:24955.0min: [2024-01-05 16:01:13,096][model8_pretrain.py][INFO] Epoch:[0/2](663900/4588595) loss:2.485 lr:0.0000100 epoch_Time:24955.0min: [2024-01-05 16:01:13,096][model8_pretrain.py][INFO] Epoch:[0/2](663900/4588595) loss:2.866 lr:0.0000100 epoch_Time:24955.0min: [2024-01-05 16:01:50,015][model8_pretrain.py][INFO] Epoch:[0/2](664000/4588595) loss:2.872 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:01:50,015][model8_pretrain.py][INFO] Epoch:[0/2](664000/4588595) loss:2.706 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:01:50,015][model8_pretrain.py][INFO] Epoch:[0/2](664000/4588595) loss:2.664 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:01:50,015][model8_pretrain.py][INFO] Epoch:[0/2](664000/4588595) loss:3.144 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:01:50,015][model8_pretrain.py][INFO] Epoch:[0/2](664000/4588595) loss:3.170 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:01:50,015][model8_pretrain.py][INFO] Epoch:[0/2](664000/4588595) loss:2.847 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:01:50,015][model8_pretrain.py][INFO] Epoch:[0/2](664000/4588595) loss:3.057 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:01:50,015][model8_pretrain.py][INFO] Epoch:[0/2](664000/4588595) loss:3.109 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:02:26,956][model8_pretrain.py][INFO] Epoch:[0/2](664100/4588595) loss:2.435 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:02:26,956][model8_pretrain.py][INFO] Epoch:[0/2](664100/4588595) loss:2.645 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:02:26,956][model8_pretrain.py][INFO] Epoch:[0/2](664100/4588595) loss:2.926 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:02:26,956][model8_pretrain.py][INFO] Epoch:[0/2](664100/4588595) loss:2.979 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:02:26,956][model8_pretrain.py][INFO] Epoch:[0/2](664100/4588595) loss:2.916 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:02:26,957][model8_pretrain.py][INFO] Epoch:[0/2](664100/4588595) loss:2.637 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:02:26,957][model8_pretrain.py][INFO] Epoch:[0/2](664100/4588595) loss:2.589 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:02:26,957][model8_pretrain.py][INFO] Epoch:[0/2](664100/4588595) loss:3.525 lr:0.0000100 epoch_Time:24954.0min: [2024-01-05 16:03:03,891][model8_pretrain.py][INFO] Epoch:[0/2](664200/4588595) loss:3.122 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:03,891][model8_pretrain.py][INFO] Epoch:[0/2](664200/4588595) loss:3.076 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:03,891][model8_pretrain.py][INFO] Epoch:[0/2](664200/4588595) loss:3.004 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:03,891][model8_pretrain.py][INFO] Epoch:[0/2](664200/4588595) loss:2.992 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:03,891][model8_pretrain.py][INFO] Epoch:[0/2](664200/4588595) loss:2.679 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:03,891][model8_pretrain.py][INFO] Epoch:[0/2](664200/4588595) loss:3.204 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:03,891][model8_pretrain.py][INFO] Epoch:[0/2](664200/4588595) loss:3.264 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:03,891][model8_pretrain.py][INFO] Epoch:[0/2](664200/4588595) loss:3.015 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:40,822][model8_pretrain.py][INFO] Epoch:[0/2](664300/4588595) loss:2.217 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:40,822][model8_pretrain.py][INFO] Epoch:[0/2](664300/4588595) loss:1.471 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:40,822][model8_pretrain.py][INFO] Epoch:[0/2](664300/4588595) loss:3.010 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:40,822][model8_pretrain.py][INFO] Epoch:[0/2](664300/4588595) loss:2.755 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:40,822][model8_pretrain.py][INFO] Epoch:[0/2](664300/4588595) loss:2.455 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:40,822][model8_pretrain.py][INFO] Epoch:[0/2](664300/4588595) loss:3.129 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:40,822][model8_pretrain.py][INFO] Epoch:[0/2](664300/4588595) loss:2.609 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:03:40,822][model8_pretrain.py][INFO] Epoch:[0/2](664300/4588595) loss:2.774 lr:0.0000100 epoch_Time:24953.0min: [2024-01-05 16:04:17,753][model8_pretrain.py][INFO] Epoch:[0/2](664400/4588595) loss:2.444 lr:0.0000100 epoch_Time:24951.0min: [2024-01-05 16:04:17,753][model8_pretrain.py][INFO] Epoch:[0/2](664400/4588595) loss:3.370 lr:0.0000100 epoch_Time:24951.0min: [2024-01-05 16:04:17,753][model8_pretrain.py][INFO] Epoch:[0/2](664400/4588595) loss:3.056 lr:0.0000100 epoch_Time:24951.0min: [2024-01-05 16:04:17,753][model8_pretrain.py][INFO] Epoch:[0/2](664400/4588595) loss:2.790 lr:0.0000100 epoch_Time:24951.0min: [2024-01-05 16:04:17,753][model8_pretrain.py][INFO] Epoch:[0/2](664400/4588595) loss:3.246 lr:0.0000100 epoch_Time:24951.0min: [2024-01-05 16:04:17,753][model8_pretrain.py][INFO] Epoch:[0/2](664400/4588595) loss:2.464 lr:0.0000100 epoch_Time:24951.0min: [2024-01-05 16:04:17,753][model8_pretrain.py][INFO] Epoch:[0/2](664400/4588595) loss:3.247 lr:0.0000100 epoch_Time:24951.0min: [2024-01-05 16:04:17,753][model8_pretrain.py][INFO] Epoch:[0/2](664400/4588595) loss:2.597 lr:0.0000100 epoch_Time:24951.0min: [2024-01-05 16:05:06,551][model8_pretrain.py][INFO] Epoch:[0/2](664500/4588595) loss:2.603 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:06,552][model8_pretrain.py][INFO] Epoch:[0/2](664500/4588595) loss:2.867 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:06,551][model8_pretrain.py][INFO] Epoch:[0/2](664500/4588595) loss:2.544 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:06,552][model8_pretrain.py][INFO] Epoch:[0/2](664500/4588595) loss:2.620 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:06,552][model8_pretrain.py][INFO] Epoch:[0/2](664500/4588595) loss:2.972 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:06,551][model8_pretrain.py][INFO] Epoch:[0/2](664500/4588595) loss:2.774 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:06,552][model8_pretrain.py][INFO] Epoch:[0/2](664500/4588595) loss:2.637 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:06,552][model8_pretrain.py][INFO] Epoch:[0/2](664500/4588595) loss:2.004 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:43,472][model8_pretrain.py][INFO] Epoch:[0/2](664600/4588595) loss:2.441 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:43,472][model8_pretrain.py][INFO] Epoch:[0/2](664600/4588595) loss:3.455 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:43,472][model8_pretrain.py][INFO] Epoch:[0/2](664600/4588595) loss:2.793 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:43,472][model8_pretrain.py][INFO] Epoch:[0/2](664600/4588595) loss:2.817 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:43,472][model8_pretrain.py][INFO] Epoch:[0/2](664600/4588595) loss:2.970 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:43,472][model8_pretrain.py][INFO] Epoch:[0/2](664600/4588595) loss:2.771 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:43,472][model8_pretrain.py][INFO] Epoch:[0/2](664600/4588595) loss:2.904 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:05:43,473][model8_pretrain.py][INFO] Epoch:[0/2](664600/4588595) loss:2.455 lr:0.0000100 epoch_Time:24952.0min: [2024-01-05 16:06:20,412][model8_pretrain.py][INFO] Epoch:[0/2](664700/4588595) loss:2.801 lr:0.0000100 epoch_Time:24950.0min: [2024-01-05 16:06:20,412][model8_pretrain.py][INFO] Epoch:[0/2](664700/4588595) loss:2.303 lr:0.0000100 epoch_Time:24950.0min: [2024-01-05 16:06:20,413][model8_pretrain.py][INFO] Epoch:[0/2](664700/4588595) loss:2.580 lr:0.0000100 epoch_Time:24950.0min: [2024-01-05 16:06:20,413][model8_pretrain.py][INFO] Epoch:[0/2](664700/4588595) loss:2.851 lr:0.0000100 epoch_Time:24950.0min: [2024-01-05 16:06:20,413][model8_pretrain.py][INFO] Epoch:[0/2](664700/4588595) loss:3.115 lr:0.0000100 epoch_Time:24950.0min: [2024-01-05 16:06:20,413][model8_pretrain.py][INFO] Epoch:[0/2](664700/4588595) loss:2.225 lr:0.0000100 epoch_Time:24950.0min: [2024-01-05 16:06:20,413][model8_pretrain.py][INFO] Epoch:[0/2](664700/4588595) loss:3.211 lr:0.0000100 epoch_Time:24950.0min: [2024-01-05 16:06:20,413][model8_pretrain.py][INFO] Epoch:[0/2](664700/4588595) loss:3.322 lr:0.0000100 epoch_Time:24950.0min: [2024-01-05 16:06:57,362][model8_pretrain.py][INFO] Epoch:[0/2](664800/4588595) loss:3.265 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:06:57,362][model8_pretrain.py][INFO] Epoch:[0/2](664800/4588595) loss:2.798 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:06:57,362][model8_pretrain.py][INFO] Epoch:[0/2](664800/4588595) loss:2.755 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:06:57,362][model8_pretrain.py][INFO] Epoch:[0/2](664800/4588595) loss:2.442 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:06:57,362][model8_pretrain.py][INFO] Epoch:[0/2](664800/4588595) loss:3.059 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:06:57,362][model8_pretrain.py][INFO] Epoch:[0/2](664800/4588595) loss:3.021 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:06:57,362][model8_pretrain.py][INFO] Epoch:[0/2](664800/4588595) loss:2.989 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:06:57,363][model8_pretrain.py][INFO] Epoch:[0/2](664800/4588595) loss:2.994 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:07:34,300][model8_pretrain.py][INFO] Epoch:[0/2](664900/4588595) loss:2.610 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:07:34,300][model8_pretrain.py][INFO] Epoch:[0/2](664900/4588595) loss:2.429 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:07:34,300][model8_pretrain.py][INFO] Epoch:[0/2](664900/4588595) loss:2.988 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:07:34,300][model8_pretrain.py][INFO] Epoch:[0/2](664900/4588595) loss:3.097 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:07:34,300][model8_pretrain.py][INFO] Epoch:[0/2](664900/4588595) loss:3.280 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:07:34,300][model8_pretrain.py][INFO] Epoch:[0/2](664900/4588595) loss:2.517 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:07:34,300][model8_pretrain.py][INFO] Epoch:[0/2](664900/4588595) loss:3.350 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:07:34,300][model8_pretrain.py][INFO] Epoch:[0/2](664900/4588595) loss:3.279 lr:0.0000100 epoch_Time:24949.0min: [2024-01-05 16:08:11,235][model8_pretrain.py][INFO] Epoch:[0/2](665000/4588595) loss:3.293 lr:0.0000100 epoch_Time:24948.0min: [2024-01-05 16:08:11,235][model8_pretrain.py][INFO] Epoch:[0/2](665000/4588595) loss:2.658 lr:0.0000100 epoch_Time:24948.0min: [2024-01-05 16:08:11,235][model8_pretrain.py][INFO] Epoch:[0/2](665000/4588595) loss:3.017 lr:0.0000100 epoch_Time:24948.0min: [2024-01-05 16:08:11,235][model8_pretrain.py][INFO] Epoch:[0/2](665000/4588595) loss:3.047 lr:0.0000100 epoch_Time:24948.0min: [2024-01-05 16:08:11,235][model8_pretrain.py][INFO] Epoch:[0/2](665000/4588595) loss:3.111 lr:0.0000100 epoch_Time:24948.0min: [2024-01-05 16:08:11,235][model8_pretrain.py][INFO] Epoch:[0/2](665000/4588595) loss:2.857 lr:0.0000100 epoch_Time:24948.0min: [2024-01-05 16:08:11,235][model8_pretrain.py][INFO] Epoch:[0/2](665000/4588595) loss:2.261 lr:0.0000100 epoch_Time:24948.0min: [2024-01-05 16:08:11,235][model8_pretrain.py][INFO] Epoch:[0/2](665000/4588595) loss:2.956 lr:0.0000100 epoch_Time:24948.0min: [2024-01-05 16:08:48,177][model8_pretrain.py][INFO] Epoch:[0/2](665100/4588595) loss:2.864 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:08:48,177][model8_pretrain.py][INFO] Epoch:[0/2](665100/4588595) loss:3.213 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:08:48,177][model8_pretrain.py][INFO] Epoch:[0/2](665100/4588595) loss:2.201 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:08:48,177][model8_pretrain.py][INFO] Epoch:[0/2](665100/4588595) loss:2.848 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:08:48,177][model8_pretrain.py][INFO] Epoch:[0/2](665100/4588595) loss:2.836 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:08:48,177][model8_pretrain.py][INFO] Epoch:[0/2](665100/4588595) loss:2.408 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:08:48,178][model8_pretrain.py][INFO] Epoch:[0/2](665100/4588595) loss:2.846 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:08:48,178][model8_pretrain.py][INFO] Epoch:[0/2](665100/4588595) loss:3.156 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:09:25,113][model8_pretrain.py][INFO] Epoch:[0/2](665200/4588595) loss:2.926 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:09:25,113][model8_pretrain.py][INFO] Epoch:[0/2](665200/4588595) loss:3.147 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:09:25,113][model8_pretrain.py][INFO] Epoch:[0/2](665200/4588595) loss:2.677 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:09:25,113][model8_pretrain.py][INFO] Epoch:[0/2](665200/4588595) loss:3.159 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:09:25,113][model8_pretrain.py][INFO] Epoch:[0/2](665200/4588595) loss:2.868 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:09:25,113][model8_pretrain.py][INFO] Epoch:[0/2](665200/4588595) loss:3.130 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:09:25,113][model8_pretrain.py][INFO] Epoch:[0/2](665200/4588595) loss:3.138 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:09:25,113][model8_pretrain.py][INFO] Epoch:[0/2](665200/4588595) loss:2.726 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:10:14,083][model8_pretrain.py][INFO] Epoch:[0/2](665300/4588595) loss:2.964 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:10:14,083][model8_pretrain.py][INFO] Epoch:[0/2](665300/4588595) loss:2.977 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:10:14,083][model8_pretrain.py][INFO] Epoch:[0/2](665300/4588595) loss:2.601 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:10:14,083][model8_pretrain.py][INFO] Epoch:[0/2](665300/4588595) loss:2.491 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:10:14,083][model8_pretrain.py][INFO] Epoch:[0/2](665300/4588595) loss:3.364 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:10:14,083][model8_pretrain.py][INFO] Epoch:[0/2](665300/4588595) loss:3.390 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:10:14,083][model8_pretrain.py][INFO] Epoch:[0/2](665300/4588595) loss:2.242 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:10:14,083][model8_pretrain.py][INFO] Epoch:[0/2](665300/4588595) loss:2.720 lr:0.0000100 epoch_Time:24947.0min: [2024-01-05 16:10:51,009][model8_pretrain.py][INFO] Epoch:[0/2](665400/4588595) loss:2.935 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:10:51,009][model8_pretrain.py][INFO] Epoch:[0/2](665400/4588595) loss:3.257 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:10:51,009][model8_pretrain.py][INFO] Epoch:[0/2](665400/4588595) loss:2.448 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:10:51,009][model8_pretrain.py][INFO] Epoch:[0/2](665400/4588595) loss:2.960 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:10:51,009][model8_pretrain.py][INFO] Epoch:[0/2](665400/4588595) loss:2.666 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:10:51,009][model8_pretrain.py][INFO] Epoch:[0/2](665400/4588595) loss:2.623 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:10:51,009][model8_pretrain.py][INFO] Epoch:[0/2](665400/4588595) loss:2.774 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:10:51,010][model8_pretrain.py][INFO] Epoch:[0/2](665400/4588595) loss:3.145 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:11:27,958][model8_pretrain.py][INFO] Epoch:[0/2](665500/4588595) loss:2.899 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:11:27,958][model8_pretrain.py][INFO] Epoch:[0/2](665500/4588595) loss:2.966 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:11:27,958][model8_pretrain.py][INFO] Epoch:[0/2](665500/4588595) loss:3.148 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:11:27,958][model8_pretrain.py][INFO] Epoch:[0/2](665500/4588595) loss:2.801 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:11:27,958][model8_pretrain.py][INFO] Epoch:[0/2](665500/4588595) loss:3.138 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:11:27,958][model8_pretrain.py][INFO] Epoch:[0/2](665500/4588595) loss:2.545 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:11:27,958][model8_pretrain.py][INFO] Epoch:[0/2](665500/4588595) loss:2.889 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:11:27,958][model8_pretrain.py][INFO] Epoch:[0/2](665500/4588595) loss:2.572 lr:0.0000100 epoch_Time:24946.0min: [2024-01-05 16:12:04,904][model8_pretrain.py][INFO] Epoch:[0/2](665600/4588595) loss:3.055 lr:0.0000100 epoch_Time:24945.0min: [2024-01-05 16:12:04,904][model8_pretrain.py][INFO] Epoch:[0/2](665600/4588595) loss:3.010 lr:0.0000100 epoch_Time:24945.0min: [2024-01-05 16:12:04,904][model8_pretrain.py][INFO] Epoch:[0/2](665600/4588595) loss:3.026 lr:0.0000100 epoch_Time:24945.0min: [2024-01-05 16:12:04,904][model8_pretrain.py][INFO] Epoch:[0/2](665600/4588595) loss:2.527 lr:0.0000100 epoch_Time:24945.0min: [2024-01-05 16:12:04,904][model8_pretrain.py][INFO] Epoch:[0/2](665600/4588595) loss:3.091 lr:0.0000100 epoch_Time:24945.0min: [2024-01-05 16:12:04,904][model8_pretrain.py][INFO] Epoch:[0/2](665600/4588595) loss:2.660 lr:0.0000100 epoch_Time:24945.0min: [2024-01-05 16:12:04,904][model8_pretrain.py][INFO] Epoch:[0/2](665600/4588595) loss:2.902 lr:0.0000100 epoch_Time:24945.0min: [2024-01-05 16:12:04,904][model8_pretrain.py][INFO] Epoch:[0/2](665600/4588595) loss:2.281 lr:0.0000100 epoch_Time:24945.0min: [2024-01-05 16:12:41,839][model8_pretrain.py][INFO] Epoch:[0/2](665700/4588595) loss:2.710 lr:0.0000100 epoch_Time:24944.0min: [2024-01-05 16:12:41,839][model8_pretrain.py][INFO] Epoch:[0/2](665700/4588595) loss:2.790 lr:0.0000100 epoch_Time:24944.0min: [2024-01-05 16:12:41,839][model8_pretrain.py][INFO] Epoch:[0/2](665700/4588595) loss:3.136 lr:0.0000100 epoch_Time:24944.0min: [2024-01-05 16:12:41,839][model8_pretrain.py][INFO] Epoch:[0/2](665700/4588595) loss:2.963 lr:0.0000100 epoch_Time:24944.0min: [2024-01-05 16:12:41,839][model8_pretrain.py][INFO] Epoch:[0/2](665700/4588595) loss:2.897 lr:0.0000100 epoch_Time:24944.0min: [2024-01-05 16:12:41,839][model8_pretrain.py][INFO] Epoch:[0/2](665700/4588595) loss:2.877 lr:0.0000100 epoch_Time:24944.0min: [2024-01-05 16:12:41,839][model8_pretrain.py][INFO] Epoch:[0/2](665700/4588595) loss:2.584 lr:0.0000100 epoch_Time:24944.0min: [2024-01-05 16:12:41,840][model8_pretrain.py][INFO] Epoch:[0/2](665700/4588595) loss:2.669 lr:0.0000100 epoch_Time:24944.0min: [2024-01-05 16:13:18,780][model8_pretrain.py][INFO] Epoch:[0/2](665800/4588595) loss:2.798 lr:0.0000100 epoch_Time:24943.0min: [2024-01-05 16:13:18,780][model8_pretrain.py][INFO] Epoch:[0/2](665800/4588595) loss:2.337 lr:0.0000100 epoch_Time:24943.0min: [2024-01-05 16:13:18,780][model8_pretrain.py][INFO] Epoch:[0/2](665800/4588595) loss:2.592 lr:0.0000100 epoch_Time:24943.0min: [2024-01-05 16:13:18,780][model8_pretrain.py][INFO] Epoch:[0/2](665800/4588595) loss:2.489 lr:0.0000100 epoch_Time:24943.0min: [2024-01-05 16:13:18,780][model8_pretrain.py][INFO] Epoch:[0/2](665800/4588595) loss:3.024 lr:0.0000100 epoch_Time:24943.0min: [2024-01-05 16:13:18,781][model8_pretrain.py][INFO] Epoch:[0/2](665800/4588595) loss:3.144 lr:0.0000100 epoch_Time:24943.0min: [2024-01-05 16:13:18,781][model8_pretrain.py][INFO] Epoch:[0/2](665800/4588595) loss:2.527 lr:0.0000100 epoch_Time:24943.0min: [2024-01-05 16:13:18,781][model8_pretrain.py][INFO] Epoch:[0/2](665800/4588595) loss:2.987 lr:0.0000100 epoch_Time:24943.0min: [2024-01-05 16:13:55,714][model8_pretrain.py][INFO] Epoch:[0/2](665900/4588595) loss:2.000 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:13:55,714][model8_pretrain.py][INFO] Epoch:[0/2](665900/4588595) loss:2.624 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:13:55,714][model8_pretrain.py][INFO] Epoch:[0/2](665900/4588595) loss:3.369 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:13:55,714][model8_pretrain.py][INFO] Epoch:[0/2](665900/4588595) loss:2.511 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:13:55,714][model8_pretrain.py][INFO] Epoch:[0/2](665900/4588595) loss:2.879 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:13:55,714][model8_pretrain.py][INFO] Epoch:[0/2](665900/4588595) loss:2.844 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:13:55,715][model8_pretrain.py][INFO] Epoch:[0/2](665900/4588595) loss:2.309 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:13:55,716][model8_pretrain.py][INFO] Epoch:[0/2](665900/4588595) loss:3.111 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:14:32,652][model8_pretrain.py][INFO] Epoch:[0/2](666000/4588595) loss:2.629 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:14:32,652][model8_pretrain.py][INFO] Epoch:[0/2](666000/4588595) loss:2.842 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:14:32,652][model8_pretrain.py][INFO] Epoch:[0/2](666000/4588595) loss:2.432 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:14:32,652][model8_pretrain.py][INFO] Epoch:[0/2](666000/4588595) loss:2.871 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:14:32,652][model8_pretrain.py][INFO] Epoch:[0/2](666000/4588595) loss:2.867 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:14:32,652][model8_pretrain.py][INFO] Epoch:[0/2](666000/4588595) loss:2.706 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:14:32,652][model8_pretrain.py][INFO] Epoch:[0/2](666000/4588595) loss:3.079 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:14:32,652][model8_pretrain.py][INFO] Epoch:[0/2](666000/4588595) loss:2.931 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:15:21,711][model8_pretrain.py][INFO] Epoch:[0/2](666100/4588595) loss:3.252 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:15:21,711][model8_pretrain.py][INFO] Epoch:[0/2](666100/4588595) loss:2.832 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:15:21,711][model8_pretrain.py][INFO] Epoch:[0/2](666100/4588595) loss:2.424 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:15:21,711][model8_pretrain.py][INFO] Epoch:[0/2](666100/4588595) loss:2.682 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:15:21,711][model8_pretrain.py][INFO] Epoch:[0/2](666100/4588595) loss:2.827 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:15:21,711][model8_pretrain.py][INFO] Epoch:[0/2](666100/4588595) loss:2.931 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:15:21,711][model8_pretrain.py][INFO] Epoch:[0/2](666100/4588595) loss:2.611 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:15:21,711][model8_pretrain.py][INFO] Epoch:[0/2](666100/4588595) loss:2.934 lr:0.0000100 epoch_Time:24942.0min: [2024-01-05 16:15:58,642][model8_pretrain.py][INFO] Epoch:[0/2](666200/4588595) loss:3.144 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:15:58,642][model8_pretrain.py][INFO] Epoch:[0/2](666200/4588595) loss:3.105 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:15:58,642][model8_pretrain.py][INFO] Epoch:[0/2](666200/4588595) loss:2.906 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:15:58,642][model8_pretrain.py][INFO] Epoch:[0/2](666200/4588595) loss:3.109 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:15:58,642][model8_pretrain.py][INFO] Epoch:[0/2](666200/4588595) loss:3.011 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:15:58,642][model8_pretrain.py][INFO] Epoch:[0/2](666200/4588595) loss:3.112 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:15:58,642][model8_pretrain.py][INFO] Epoch:[0/2](666200/4588595) loss:3.014 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:15:58,642][model8_pretrain.py][INFO] Epoch:[0/2](666200/4588595) loss:2.928 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:16:35,588][model8_pretrain.py][INFO] Epoch:[0/2](666300/4588595) loss:2.665 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:16:35,588][model8_pretrain.py][INFO] Epoch:[0/2](666300/4588595) loss:2.652 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:16:35,588][model8_pretrain.py][INFO] Epoch:[0/2](666300/4588595) loss:2.688 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:16:35,588][model8_pretrain.py][INFO] Epoch:[0/2](666300/4588595) loss:3.148 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:16:35,588][model8_pretrain.py][INFO] Epoch:[0/2](666300/4588595) loss:2.401 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:16:35,588][model8_pretrain.py][INFO] Epoch:[0/2](666300/4588595) loss:3.099 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:16:35,588][model8_pretrain.py][INFO] Epoch:[0/2](666300/4588595) loss:2.521 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:16:35,588][model8_pretrain.py][INFO] Epoch:[0/2](666300/4588595) loss:2.860 lr:0.0000100 epoch_Time:24941.0min: [2024-01-05 16:17:12,538][model8_pretrain.py][INFO] Epoch:[0/2](666400/4588595) loss:2.679 lr:0.0000100 epoch_Time:24940.0min: [2024-01-05 16:17:12,538][model8_pretrain.py][INFO] Epoch:[0/2](666400/4588595) loss:2.843 lr:0.0000100 epoch_Time:24940.0min: [2024-01-05 16:17:12,538][model8_pretrain.py][INFO] Epoch:[0/2](666400/4588595) loss:3.054 lr:0.0000100 epoch_Time:24940.0min: [2024-01-05 16:17:12,538][model8_pretrain.py][INFO] Epoch:[0/2](666400/4588595) loss:3.292 lr:0.0000100 epoch_Time:24940.0min: [2024-01-05 16:17:12,538][model8_pretrain.py][INFO] Epoch:[0/2](666400/4588595) loss:2.763 lr:0.0000100 epoch_Time:24940.0min: [2024-01-05 16:17:12,538][model8_pretrain.py][INFO] Epoch:[0/2](666400/4588595) loss:3.127 lr:0.0000100 epoch_Time:24940.0min: [2024-01-05 16:17:12,538][model8_pretrain.py][INFO] Epoch:[0/2](666400/4588595) loss:2.788 lr:0.0000100 epoch_Time:24940.0min: [2024-01-05 16:17:12,538][model8_pretrain.py][INFO] Epoch:[0/2](666400/4588595) loss:3.380 lr:0.0000100 epoch_Time:24940.0min: [2024-01-05 16:17:49,471][model8_pretrain.py][INFO] Epoch:[0/2](666500/4588595) loss:2.860 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:17:49,471][model8_pretrain.py][INFO] Epoch:[0/2](666500/4588595) loss:2.510 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:17:49,471][model8_pretrain.py][INFO] Epoch:[0/2](666500/4588595) loss:3.042 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:17:49,471][model8_pretrain.py][INFO] Epoch:[0/2](666500/4588595) loss:3.060 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:17:49,471][model8_pretrain.py][INFO] Epoch:[0/2](666500/4588595) loss:3.193 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:17:49,471][model8_pretrain.py][INFO] Epoch:[0/2](666500/4588595) loss:2.560 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:17:49,471][model8_pretrain.py][INFO] Epoch:[0/2](666500/4588595) loss:2.693 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:17:49,472][model8_pretrain.py][INFO] Epoch:[0/2](666500/4588595) loss:2.587 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:18:26,405][model8_pretrain.py][INFO] Epoch:[0/2](666600/4588595) loss:2.893 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:18:26,405][model8_pretrain.py][INFO] Epoch:[0/2](666600/4588595) loss:3.029 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:18:26,406][model8_pretrain.py][INFO] Epoch:[0/2](666600/4588595) loss:2.556 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:18:26,406][model8_pretrain.py][INFO] Epoch:[0/2](666600/4588595) loss:2.401 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:18:26,406][model8_pretrain.py][INFO] Epoch:[0/2](666600/4588595) loss:2.655 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:18:26,406][model8_pretrain.py][INFO] Epoch:[0/2](666600/4588595) loss:3.066 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:18:26,406][model8_pretrain.py][INFO] Epoch:[0/2](666600/4588595) loss:3.024 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:18:26,406][model8_pretrain.py][INFO] Epoch:[0/2](666600/4588595) loss:2.360 lr:0.0000100 epoch_Time:24939.0min: [2024-01-05 16:19:03,354][model8_pretrain.py][INFO] Epoch:[0/2](666700/4588595) loss:2.963 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:03,354][model8_pretrain.py][INFO] Epoch:[0/2](666700/4588595) loss:3.221 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:03,354][model8_pretrain.py][INFO] Epoch:[0/2](666700/4588595) loss:3.258 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:03,354][model8_pretrain.py][INFO] Epoch:[0/2](666700/4588595) loss:3.055 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:03,354][model8_pretrain.py][INFO] Epoch:[0/2](666700/4588595) loss:2.571 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:03,354][model8_pretrain.py][INFO] Epoch:[0/2](666700/4588595) loss:3.564 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:03,354][model8_pretrain.py][INFO] Epoch:[0/2](666700/4588595) loss:2.930 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:03,354][model8_pretrain.py][INFO] Epoch:[0/2](666700/4588595) loss:2.704 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:40,290][model8_pretrain.py][INFO] Epoch:[0/2](666800/4588595) loss:2.562 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:40,290][model8_pretrain.py][INFO] Epoch:[0/2](666800/4588595) loss:3.156 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:40,290][model8_pretrain.py][INFO] Epoch:[0/2](666800/4588595) loss:3.083 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:40,290][model8_pretrain.py][INFO] Epoch:[0/2](666800/4588595) loss:3.209 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:40,290][model8_pretrain.py][INFO] Epoch:[0/2](666800/4588595) loss:3.595 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:40,290][model8_pretrain.py][INFO] Epoch:[0/2](666800/4588595) loss:2.463 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:40,290][model8_pretrain.py][INFO] Epoch:[0/2](666800/4588595) loss:3.187 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:19:40,290][model8_pretrain.py][INFO] Epoch:[0/2](666800/4588595) loss:2.699 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:20:29,135][model8_pretrain.py][INFO] Epoch:[0/2](666900/4588595) loss:2.173 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:20:29,135][model8_pretrain.py][INFO] Epoch:[0/2](666900/4588595) loss:3.204 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:20:29,135][model8_pretrain.py][INFO] Epoch:[0/2](666900/4588595) loss:2.806 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:20:29,135][model8_pretrain.py][INFO] Epoch:[0/2](666900/4588595) loss:2.800 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:20:29,135][model8_pretrain.py][INFO] Epoch:[0/2](666900/4588595) loss:3.007 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:20:29,135][model8_pretrain.py][INFO] Epoch:[0/2](666900/4588595) loss:2.334 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:20:29,135][model8_pretrain.py][INFO] Epoch:[0/2](666900/4588595) loss:2.892 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:20:29,135][model8_pretrain.py][INFO] Epoch:[0/2](666900/4588595) loss:3.023 lr:0.0000100 epoch_Time:24937.0min: [2024-01-05 16:21:06,068][model8_pretrain.py][INFO] Epoch:[0/2](667000/4588595) loss:2.780 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:06,068][model8_pretrain.py][INFO] Epoch:[0/2](667000/4588595) loss:2.661 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:06,068][model8_pretrain.py][INFO] Epoch:[0/2](667000/4588595) loss:3.191 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:06,068][model8_pretrain.py][INFO] Epoch:[0/2](667000/4588595) loss:2.621 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:06,068][model8_pretrain.py][INFO] Epoch:[0/2](667000/4588595) loss:2.386 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:06,068][model8_pretrain.py][INFO] Epoch:[0/2](667000/4588595) loss:2.670 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:06,068][model8_pretrain.py][INFO] Epoch:[0/2](667000/4588595) loss:2.387 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:06,068][model8_pretrain.py][INFO] Epoch:[0/2](667000/4588595) loss:2.442 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:43,018][model8_pretrain.py][INFO] Epoch:[0/2](667100/4588595) loss:2.914 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:43,018][model8_pretrain.py][INFO] Epoch:[0/2](667100/4588595) loss:2.612 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:43,018][model8_pretrain.py][INFO] Epoch:[0/2](667100/4588595) loss:2.597 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:43,018][model8_pretrain.py][INFO] Epoch:[0/2](667100/4588595) loss:2.836 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:43,019][model8_pretrain.py][INFO] Epoch:[0/2](667100/4588595) loss:3.155 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:43,019][model8_pretrain.py][INFO] Epoch:[0/2](667100/4588595) loss:2.675 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:43,019][model8_pretrain.py][INFO] Epoch:[0/2](667100/4588595) loss:2.202 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:21:43,019][model8_pretrain.py][INFO] Epoch:[0/2](667100/4588595) loss:2.714 lr:0.0000100 epoch_Time:24936.0min: [2024-01-05 16:22:19,984][model8_pretrain.py][INFO] Epoch:[0/2](667200/4588595) loss:3.157 lr:0.0000100 epoch_Time:24935.0min: [2024-01-05 16:22:19,984][model8_pretrain.py][INFO] Epoch:[0/2](667200/4588595) loss:2.477 lr:0.0000100 epoch_Time:24935.0min: [2024-01-05 16:22:19,984][model8_pretrain.py][INFO] Epoch:[0/2](667200/4588595) loss:2.366 lr:0.0000100 epoch_Time:24935.0min: [2024-01-05 16:22:19,984][model8_pretrain.py][INFO] Epoch:[0/2](667200/4588595) loss:3.532 lr:0.0000100 epoch_Time:24935.0min: [2024-01-05 16:22:19,984][model8_pretrain.py][INFO] Epoch:[0/2](667200/4588595) loss:3.162 lr:0.0000100 epoch_Time:24935.0min: [2024-01-05 16:22:19,984][model8_pretrain.py][INFO] Epoch:[0/2](667200/4588595) loss:2.773 lr:0.0000100 epoch_Time:24935.0min: [2024-01-05 16:22:19,984][model8_pretrain.py][INFO] Epoch:[0/2](667200/4588595) loss:2.659 lr:0.0000100 epoch_Time:24935.0min: [2024-01-05 16:22:19,984][model8_pretrain.py][INFO] Epoch:[0/2](667200/4588595) loss:3.085 lr:0.0000100 epoch_Time:24935.0min: [2024-01-05 16:22:56,934][model8_pretrain.py][INFO] Epoch:[0/2](667300/4588595) loss:2.617 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:22:56,934][model8_pretrain.py][INFO] Epoch:[0/2](667300/4588595) loss:3.273 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:22:56,934][model8_pretrain.py][INFO] Epoch:[0/2](667300/4588595) loss:3.073 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:22:56,934][model8_pretrain.py][INFO] Epoch:[0/2](667300/4588595) loss:2.757 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:22:56,934][model8_pretrain.py][INFO] Epoch:[0/2](667300/4588595) loss:2.678 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:22:56,934][model8_pretrain.py][INFO] Epoch:[0/2](667300/4588595) loss:2.814 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:22:56,934][model8_pretrain.py][INFO] Epoch:[0/2](667300/4588595) loss:2.872 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:22:56,935][model8_pretrain.py][INFO] Epoch:[0/2](667300/4588595) loss:2.684 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:23:33,905][model8_pretrain.py][INFO] Epoch:[0/2](667400/4588595) loss:3.190 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:23:33,905][model8_pretrain.py][INFO] Epoch:[0/2](667400/4588595) loss:2.738 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:23:33,905][model8_pretrain.py][INFO] Epoch:[0/2](667400/4588595) loss:1.953 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:23:33,905][model8_pretrain.py][INFO] Epoch:[0/2](667400/4588595) loss:2.865 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:23:33,905][model8_pretrain.py][INFO] Epoch:[0/2](667400/4588595) loss:2.798 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:23:33,905][model8_pretrain.py][INFO] Epoch:[0/2](667400/4588595) loss:2.484 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:23:33,905][model8_pretrain.py][INFO] Epoch:[0/2](667400/4588595) loss:3.288 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:23:33,905][model8_pretrain.py][INFO] Epoch:[0/2](667400/4588595) loss:2.691 lr:0.0000100 epoch_Time:24934.0min: [2024-01-05 16:24:10,850][model8_pretrain.py][INFO] Epoch:[0/2](667500/4588595) loss:2.382 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:24:10,850][model8_pretrain.py][INFO] Epoch:[0/2](667500/4588595) loss:2.564 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:24:10,850][model8_pretrain.py][INFO] Epoch:[0/2](667500/4588595) loss:3.081 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:24:10,850][model8_pretrain.py][INFO] Epoch:[0/2](667500/4588595) loss:3.066 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:24:10,850][model8_pretrain.py][INFO] Epoch:[0/2](667500/4588595) loss:2.683 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:24:10,850][model8_pretrain.py][INFO] Epoch:[0/2](667500/4588595) loss:3.366 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:24:10,850][model8_pretrain.py][INFO] Epoch:[0/2](667500/4588595) loss:2.393 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:24:10,850][model8_pretrain.py][INFO] Epoch:[0/2](667500/4588595) loss:2.451 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:24:47,786][model8_pretrain.py][INFO] Epoch:[0/2](667600/4588595) loss:2.627 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:24:47,786][model8_pretrain.py][INFO] Epoch:[0/2](667600/4588595) loss:3.149 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:24:47,786][model8_pretrain.py][INFO] Epoch:[0/2](667600/4588595) loss:2.685 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:24:47,786][model8_pretrain.py][INFO] Epoch:[0/2](667600/4588595) loss:3.156 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:24:47,786][model8_pretrain.py][INFO] Epoch:[0/2](667600/4588595) loss:2.188 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:24:47,786][model8_pretrain.py][INFO] Epoch:[0/2](667600/4588595) loss:3.198 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:24:47,786][model8_pretrain.py][INFO] Epoch:[0/2](667600/4588595) loss:2.015 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:24:47,787][model8_pretrain.py][INFO] Epoch:[0/2](667600/4588595) loss:2.348 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:25:37,010][model8_pretrain.py][INFO] Epoch:[0/2](667700/4588595) loss:3.031 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:25:37,010][model8_pretrain.py][INFO] Epoch:[0/2](667700/4588595) loss:2.978 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:25:37,010][model8_pretrain.py][INFO] Epoch:[0/2](667700/4588595) loss:3.065 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:25:37,010][model8_pretrain.py][INFO] Epoch:[0/2](667700/4588595) loss:2.956 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:25:37,010][model8_pretrain.py][INFO] Epoch:[0/2](667700/4588595) loss:3.131 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:25:37,010][model8_pretrain.py][INFO] Epoch:[0/2](667700/4588595) loss:2.614 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:25:37,011][model8_pretrain.py][INFO] Epoch:[0/2](667700/4588595) loss:3.007 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:25:37,011][model8_pretrain.py][INFO] Epoch:[0/2](667700/4588595) loss:2.545 lr:0.0000100 epoch_Time:24933.0min: [2024-01-05 16:26:13,973][model8_pretrain.py][INFO] Epoch:[0/2](667800/4588595) loss:2.868 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:26:13,973][model8_pretrain.py][INFO] Epoch:[0/2](667800/4588595) loss:3.199 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:26:13,973][model8_pretrain.py][INFO] Epoch:[0/2](667800/4588595) loss:3.131 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:26:13,973][model8_pretrain.py][INFO] Epoch:[0/2](667800/4588595) loss:3.239 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:26:13,973][model8_pretrain.py][INFO] Epoch:[0/2](667800/4588595) loss:3.542 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:26:13,973][model8_pretrain.py][INFO] Epoch:[0/2](667800/4588595) loss:2.804 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:26:13,973][model8_pretrain.py][INFO] Epoch:[0/2](667800/4588595) loss:3.051 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:26:13,974][model8_pretrain.py][INFO] Epoch:[0/2](667800/4588595) loss:2.717 lr:0.0000100 epoch_Time:24932.0min: [2024-01-05 16:26:50,922][model8_pretrain.py][INFO] Epoch:[0/2](667900/4588595) loss:2.249 lr:0.0000100 epoch_Time:24931.0min: [2024-01-05 16:26:50,922][model8_pretrain.py][INFO] Epoch:[0/2](667900/4588595) loss:3.208 lr:0.0000100 epoch_Time:24931.0min: [2024-01-05 16:26:50,922][model8_pretrain.py][INFO] Epoch:[0/2](667900/4588595) loss:2.699 lr:0.0000100 epoch_Time:24931.0min: [2024-01-05 16:26:50,922][model8_pretrain.py][INFO] Epoch:[0/2](667900/4588595) loss:2.587 lr:0.0000100 epoch_Time:24931.0min: [2024-01-05 16:26:50,922][model8_pretrain.py][INFO] Epoch:[0/2](667900/4588595) loss:2.459 lr:0.0000100 epoch_Time:24931.0min: [2024-01-05 16:26:50,922][model8_pretrain.py][INFO] Epoch:[0/2](667900/4588595) loss:2.675 lr:0.0000100 epoch_Time:24931.0min: [2024-01-05 16:26:50,922][model8_pretrain.py][INFO] Epoch:[0/2](667900/4588595) loss:2.508 lr:0.0000100 epoch_Time:24931.0min: [2024-01-05 16:26:50,922][model8_pretrain.py][INFO] Epoch:[0/2](667900/4588595) loss:2.488 lr:0.0000100 epoch_Time:24931.0min: [2024-01-05 16:27:27,881][model8_pretrain.py][INFO] Epoch:[0/2](668000/4588595) loss:2.685 lr:0.0000100 epoch_Time:24930.0min: [2024-01-05 16:27:27,881][model8_pretrain.py][INFO] Epoch:[0/2](668000/4588595) loss:2.695 lr:0.0000100 epoch_Time:24930.0min: [2024-01-05 16:27:27,881][model8_pretrain.py][INFO] Epoch:[0/2](668000/4588595) loss:2.925 lr:0.0000100 epoch_Time:24930.0min: [2024-01-05 16:27:27,881][model8_pretrain.py][INFO] Epoch:[0/2](668000/4588595) loss:2.989 lr:0.0000100 epoch_Time:24930.0min: [2024-01-05 16:27:27,881][model8_pretrain.py][INFO] Epoch:[0/2](668000/4588595) loss:3.116 lr:0.0000100 epoch_Time:24930.0min: [2024-01-05 16:27:27,882][model8_pretrain.py][INFO] Epoch:[0/2](668000/4588595) loss:3.605 lr:0.0000100 epoch_Time:24930.0min: [2024-01-05 16:27:27,882][model8_pretrain.py][INFO] Epoch:[0/2](668000/4588595) loss:2.918 lr:0.0000100 epoch_Time:24930.0min: [2024-01-05 16:27:27,882][model8_pretrain.py][INFO] Epoch:[0/2](668000/4588595) loss:3.019 lr:0.0000100 epoch_Time:24930.0min: [2024-01-05 16:28:04,837][model8_pretrain.py][INFO] Epoch:[0/2](668100/4588595) loss:2.532 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:04,837][model8_pretrain.py][INFO] Epoch:[0/2](668100/4588595) loss:2.692 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:04,837][model8_pretrain.py][INFO] Epoch:[0/2](668100/4588595) loss:2.888 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:04,837][model8_pretrain.py][INFO] Epoch:[0/2](668100/4588595) loss:1.656 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:04,837][model8_pretrain.py][INFO] Epoch:[0/2](668100/4588595) loss:2.945 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:04,837][model8_pretrain.py][INFO] Epoch:[0/2](668100/4588595) loss:2.953 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:04,837][model8_pretrain.py][INFO] Epoch:[0/2](668100/4588595) loss:2.903 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:04,837][model8_pretrain.py][INFO] Epoch:[0/2](668100/4588595) loss:3.058 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:41,799][model8_pretrain.py][INFO] Epoch:[0/2](668200/4588595) loss:2.777 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:41,799][model8_pretrain.py][INFO] Epoch:[0/2](668200/4588595) loss:2.674 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:41,799][model8_pretrain.py][INFO] Epoch:[0/2](668200/4588595) loss:2.252 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:41,799][model8_pretrain.py][INFO] Epoch:[0/2](668200/4588595) loss:2.649 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:41,799][model8_pretrain.py][INFO] Epoch:[0/2](668200/4588595) loss:3.091 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:41,799][model8_pretrain.py][INFO] Epoch:[0/2](668200/4588595) loss:2.054 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:41,799][model8_pretrain.py][INFO] Epoch:[0/2](668200/4588595) loss:2.924 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:28:41,799][model8_pretrain.py][INFO] Epoch:[0/2](668200/4588595) loss:3.170 lr:0.0000100 epoch_Time:24929.0min: [2024-01-05 16:29:18,782][model8_pretrain.py][INFO] Epoch:[0/2](668300/4588595) loss:2.894 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:29:18,782][model8_pretrain.py][INFO] Epoch:[0/2](668300/4588595) loss:3.014 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:29:18,782][model8_pretrain.py][INFO] Epoch:[0/2](668300/4588595) loss:2.608 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:29:18,782][model8_pretrain.py][INFO] Epoch:[0/2](668300/4588595) loss:2.561 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:29:18,782][model8_pretrain.py][INFO] Epoch:[0/2](668300/4588595) loss:3.283 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:29:18,783][model8_pretrain.py][INFO] Epoch:[0/2](668300/4588595) loss:2.875 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:29:18,783][model8_pretrain.py][INFO] Epoch:[0/2](668300/4588595) loss:3.131 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:29:18,783][model8_pretrain.py][INFO] Epoch:[0/2](668300/4588595) loss:2.980 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:29:55,753][model8_pretrain.py][INFO] Epoch:[0/2](668400/4588595) loss:3.019 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:29:55,753][model8_pretrain.py][INFO] Epoch:[0/2](668400/4588595) loss:2.795 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:29:55,753][model8_pretrain.py][INFO] Epoch:[0/2](668400/4588595) loss:2.779 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:29:55,753][model8_pretrain.py][INFO] Epoch:[0/2](668400/4588595) loss:2.908 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:29:55,753][model8_pretrain.py][INFO] Epoch:[0/2](668400/4588595) loss:3.189 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:29:55,753][model8_pretrain.py][INFO] Epoch:[0/2](668400/4588595) loss:2.570 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:29:55,753][model8_pretrain.py][INFO] Epoch:[0/2](668400/4588595) loss:3.151 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:29:55,753][model8_pretrain.py][INFO] Epoch:[0/2](668400/4588595) loss:2.650 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:30:44,633][model8_pretrain.py][INFO] Epoch:[0/2](668500/4588595) loss:3.246 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:30:44,633][model8_pretrain.py][INFO] Epoch:[0/2](668500/4588595) loss:2.913 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:30:44,633][model8_pretrain.py][INFO] Epoch:[0/2](668500/4588595) loss:2.994 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:30:44,633][model8_pretrain.py][INFO] Epoch:[0/2](668500/4588595) loss:2.467 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:30:44,633][model8_pretrain.py][INFO] Epoch:[0/2](668500/4588595) loss:2.897 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:30:44,633][model8_pretrain.py][INFO] Epoch:[0/2](668500/4588595) loss:1.970 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:30:44,633][model8_pretrain.py][INFO] Epoch:[0/2](668500/4588595) loss:2.693 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:30:44,633][model8_pretrain.py][INFO] Epoch:[0/2](668500/4588595) loss:3.202 lr:0.0000100 epoch_Time:24928.0min: [2024-01-05 16:31:21,570][model8_pretrain.py][INFO] Epoch:[0/2](668600/4588595) loss:2.743 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:31:21,570][model8_pretrain.py][INFO] Epoch:[0/2](668600/4588595) loss:2.694 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:31:21,570][model8_pretrain.py][INFO] Epoch:[0/2](668600/4588595) loss:2.365 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:31:21,570][model8_pretrain.py][INFO] Epoch:[0/2](668600/4588595) loss:2.703 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:31:21,570][model8_pretrain.py][INFO] Epoch:[0/2](668600/4588595) loss:3.161 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:31:21,570][model8_pretrain.py][INFO] Epoch:[0/2](668600/4588595) loss:2.714 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:31:21,570][model8_pretrain.py][INFO] Epoch:[0/2](668600/4588595) loss:2.963 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:31:21,570][model8_pretrain.py][INFO] Epoch:[0/2](668600/4588595) loss:2.752 lr:0.0000100 epoch_Time:24927.0min: [2024-01-05 16:31:58,536][model8_pretrain.py][INFO] Epoch:[0/2](668700/4588595) loss:2.861 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:31:58,536][model8_pretrain.py][INFO] Epoch:[0/2](668700/4588595) loss:3.061 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:31:58,536][model8_pretrain.py][INFO] Epoch:[0/2](668700/4588595) loss:2.915 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:31:58,536][model8_pretrain.py][INFO] Epoch:[0/2](668700/4588595) loss:2.766 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:31:58,536][model8_pretrain.py][INFO] Epoch:[0/2](668700/4588595) loss:2.147 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:31:58,536][model8_pretrain.py][INFO] Epoch:[0/2](668700/4588595) loss:2.426 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:31:58,536][model8_pretrain.py][INFO] Epoch:[0/2](668700/4588595) loss:2.739 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:31:58,536][model8_pretrain.py][INFO] Epoch:[0/2](668700/4588595) loss:2.719 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:32:35,485][model8_pretrain.py][INFO] Epoch:[0/2](668800/4588595) loss:2.800 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:32:35,485][model8_pretrain.py][INFO] Epoch:[0/2](668800/4588595) loss:2.821 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:32:35,486][model8_pretrain.py][INFO] Epoch:[0/2](668800/4588595) loss:2.652 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:32:35,486][model8_pretrain.py][INFO] Epoch:[0/2](668800/4588595) loss:2.869 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:32:35,486][model8_pretrain.py][INFO] Epoch:[0/2](668800/4588595) loss:2.848 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:32:35,486][model8_pretrain.py][INFO] Epoch:[0/2](668800/4588595) loss:3.038 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:32:35,486][model8_pretrain.py][INFO] Epoch:[0/2](668800/4588595) loss:2.994 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:32:35,486][model8_pretrain.py][INFO] Epoch:[0/2](668800/4588595) loss:2.409 lr:0.0000100 epoch_Time:24926.0min: [2024-01-05 16:33:12,435][model8_pretrain.py][INFO] Epoch:[0/2](668900/4588595) loss:3.139 lr:0.0000100 epoch_Time:24924.0min: [2024-01-05 16:33:12,435][model8_pretrain.py][INFO] Epoch:[0/2](668900/4588595) loss:2.972 lr:0.0000100 epoch_Time:24925.0min: [2024-01-05 16:33:12,435][model8_pretrain.py][INFO] Epoch:[0/2](668900/4588595) loss:3.061 lr:0.0000100 epoch_Time:24925.0min: [2024-01-05 16:33:12,435][model8_pretrain.py][INFO] Epoch:[0/2](668900/4588595) loss:2.512 lr:0.0000100 epoch_Time:24924.0min: [2024-01-05 16:33:12,435][model8_pretrain.py][INFO] Epoch:[0/2](668900/4588595) loss:2.354 lr:0.0000100 epoch_Time:24925.0min: [2024-01-05 16:33:12,435][model8_pretrain.py][INFO] Epoch:[0/2](668900/4588595) loss:2.488 lr:0.0000100 epoch_Time:24925.0min: [2024-01-05 16:33:12,435][model8_pretrain.py][INFO] Epoch:[0/2](668900/4588595) loss:2.941 lr:0.0000100 epoch_Time:24925.0min: [2024-01-05 16:33:12,435][model8_pretrain.py][INFO] Epoch:[0/2](668900/4588595) loss:2.549 lr:0.0000100 epoch_Time:24925.0min: [2024-01-05 16:33:49,395][model8_pretrain.py][INFO] Epoch:[0/2](669000/4588595) loss:2.514 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:33:49,395][model8_pretrain.py][INFO] Epoch:[0/2](669000/4588595) loss:2.886 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:33:49,395][model8_pretrain.py][INFO] Epoch:[0/2](669000/4588595) loss:2.827 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:33:49,395][model8_pretrain.py][INFO] Epoch:[0/2](669000/4588595) loss:2.477 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:33:49,395][model8_pretrain.py][INFO] Epoch:[0/2](669000/4588595) loss:2.967 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:33:49,395][model8_pretrain.py][INFO] Epoch:[0/2](669000/4588595) loss:2.425 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:33:49,395][model8_pretrain.py][INFO] Epoch:[0/2](669000/4588595) loss:2.932 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:33:49,396][model8_pretrain.py][INFO] Epoch:[0/2](669000/4588595) loss:2.921 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:34:26,351][model8_pretrain.py][INFO] Epoch:[0/2](669100/4588595) loss:2.719 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:34:26,351][model8_pretrain.py][INFO] Epoch:[0/2](669100/4588595) loss:3.104 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:34:26,351][model8_pretrain.py][INFO] Epoch:[0/2](669100/4588595) loss:2.827 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:34:26,351][model8_pretrain.py][INFO] Epoch:[0/2](669100/4588595) loss:3.116 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:34:26,351][model8_pretrain.py][INFO] Epoch:[0/2](669100/4588595) loss:2.888 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:34:26,351][model8_pretrain.py][INFO] Epoch:[0/2](669100/4588595) loss:3.081 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:34:26,352][model8_pretrain.py][INFO] Epoch:[0/2](669100/4588595) loss:2.553 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:34:26,352][model8_pretrain.py][INFO] Epoch:[0/2](669100/4588595) loss:3.194 lr:0.0000100 epoch_Time:24923.0min: [2024-01-05 16:35:03,305][model8_pretrain.py][INFO] Epoch:[0/2](669200/4588595) loss:2.659 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:03,305][model8_pretrain.py][INFO] Epoch:[0/2](669200/4588595) loss:2.926 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:03,305][model8_pretrain.py][INFO] Epoch:[0/2](669200/4588595) loss:2.804 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:03,305][model8_pretrain.py][INFO] Epoch:[0/2](669200/4588595) loss:2.770 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:03,305][model8_pretrain.py][INFO] Epoch:[0/2](669200/4588595) loss:2.539 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:03,305][model8_pretrain.py][INFO] Epoch:[0/2](669200/4588595) loss:2.880 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:03,305][model8_pretrain.py][INFO] Epoch:[0/2](669200/4588595) loss:2.797 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:03,306][model8_pretrain.py][INFO] Epoch:[0/2](669200/4588595) loss:2.797 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:52,193][model8_pretrain.py][INFO] Epoch:[0/2](669300/4588595) loss:2.544 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:52,193][model8_pretrain.py][INFO] Epoch:[0/2](669300/4588595) loss:2.659 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:52,193][model8_pretrain.py][INFO] Epoch:[0/2](669300/4588595) loss:2.054 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:52,193][model8_pretrain.py][INFO] Epoch:[0/2](669300/4588595) loss:2.823 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:52,193][model8_pretrain.py][INFO] Epoch:[0/2](669300/4588595) loss:3.002 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:52,193][model8_pretrain.py][INFO] Epoch:[0/2](669300/4588595) loss:3.489 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:52,193][model8_pretrain.py][INFO] Epoch:[0/2](669300/4588595) loss:3.225 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:35:52,193][model8_pretrain.py][INFO] Epoch:[0/2](669300/4588595) loss:2.855 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:36:29,132][model8_pretrain.py][INFO] Epoch:[0/2](669400/4588595) loss:2.467 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:36:29,133][model8_pretrain.py][INFO] Epoch:[0/2](669400/4588595) loss:2.965 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:36:29,133][model8_pretrain.py][INFO] Epoch:[0/2](669400/4588595) loss:2.500 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:36:29,133][model8_pretrain.py][INFO] Epoch:[0/2](669400/4588595) loss:2.832 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:36:29,133][model8_pretrain.py][INFO] Epoch:[0/2](669400/4588595) loss:2.973 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:36:29,133][model8_pretrain.py][INFO] Epoch:[0/2](669400/4588595) loss:2.872 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:36:29,133][model8_pretrain.py][INFO] Epoch:[0/2](669400/4588595) loss:2.970 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:36:29,133][model8_pretrain.py][INFO] Epoch:[0/2](669400/4588595) loss:2.507 lr:0.0000100 epoch_Time:24922.0min: [2024-01-05 16:37:06,075][model8_pretrain.py][INFO] Epoch:[0/2](669500/4588595) loss:3.007 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:06,075][model8_pretrain.py][INFO] Epoch:[0/2](669500/4588595) loss:2.560 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:06,075][model8_pretrain.py][INFO] Epoch:[0/2](669500/4588595) loss:3.060 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:06,075][model8_pretrain.py][INFO] Epoch:[0/2](669500/4588595) loss:2.864 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:06,075][model8_pretrain.py][INFO] Epoch:[0/2](669500/4588595) loss:3.051 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:06,075][model8_pretrain.py][INFO] Epoch:[0/2](669500/4588595) loss:2.892 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:06,075][model8_pretrain.py][INFO] Epoch:[0/2](669500/4588595) loss:2.679 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:06,075][model8_pretrain.py][INFO] Epoch:[0/2](669500/4588595) loss:2.886 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:43,018][model8_pretrain.py][INFO] Epoch:[0/2](669600/4588595) loss:2.030 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:43,018][model8_pretrain.py][INFO] Epoch:[0/2](669600/4588595) loss:3.024 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:43,018][model8_pretrain.py][INFO] Epoch:[0/2](669600/4588595) loss:3.377 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:43,018][model8_pretrain.py][INFO] Epoch:[0/2](669600/4588595) loss:3.073 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:43,018][model8_pretrain.py][INFO] Epoch:[0/2](669600/4588595) loss:2.656 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:43,018][model8_pretrain.py][INFO] Epoch:[0/2](669600/4588595) loss:2.491 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:43,018][model8_pretrain.py][INFO] Epoch:[0/2](669600/4588595) loss:2.850 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:37:43,018][model8_pretrain.py][INFO] Epoch:[0/2](669600/4588595) loss:2.872 lr:0.0000100 epoch_Time:24921.0min: [2024-01-05 16:38:19,968][model8_pretrain.py][INFO] Epoch:[0/2](669700/4588595) loss:3.519 lr:0.0000100 epoch_Time:24920.0min: [2024-01-05 16:38:19,968][model8_pretrain.py][INFO] Epoch:[0/2](669700/4588595) loss:2.711 lr:0.0000100 epoch_Time:24920.0min: [2024-01-05 16:38:19,968][model8_pretrain.py][INFO] Epoch:[0/2](669700/4588595) loss:3.364 lr:0.0000100 epoch_Time:24920.0min: [2024-01-05 16:38:19,968][model8_pretrain.py][INFO] Epoch:[0/2](669700/4588595) loss:2.481 lr:0.0000100 epoch_Time:24920.0min: [2024-01-05 16:38:19,968][model8_pretrain.py][INFO] Epoch:[0/2](669700/4588595) loss:2.755 lr:0.0000100 epoch_Time:24920.0min: [2024-01-05 16:38:19,968][model8_pretrain.py][INFO] Epoch:[0/2](669700/4588595) loss:3.294 lr:0.0000100 epoch_Time:24920.0min: [2024-01-05 16:38:19,968][model8_pretrain.py][INFO] Epoch:[0/2](669700/4588595) loss:3.017 lr:0.0000100 epoch_Time:24920.0min: [2024-01-05 16:38:19,968][model8_pretrain.py][INFO] Epoch:[0/2](669700/4588595) loss:2.744 lr:0.0000100 epoch_Time:24920.0min: [2024-01-05 16:38:56,849][model8_pretrain.py][INFO] Epoch:[0/2](669800/4588595) loss:2.712 lr:0.0000100 epoch_Time:24919.0min: [2024-01-05 16:38:56,849][model8_pretrain.py][INFO] Epoch:[0/2](669800/4588595) loss:2.544 lr:0.0000100 epoch_Time:24919.0min: [2024-01-05 16:38:56,849][model8_pretrain.py][INFO] Epoch:[0/2](669800/4588595) loss:2.699 lr:0.0000100 epoch_Time:24919.0min: [2024-01-05 16:38:56,849][model8_pretrain.py][INFO] Epoch:[0/2](669800/4588595) loss:2.406 lr:0.0000100 epoch_Time:24919.0min: [2024-01-05 16:38:56,849][model8_pretrain.py][INFO] Epoch:[0/2](669800/4588595) loss:2.822 lr:0.0000100 epoch_Time:24919.0min: [2024-01-05 16:38:56,849][model8_pretrain.py][INFO] Epoch:[0/2](669800/4588595) loss:2.495 lr:0.0000100 epoch_Time:24919.0min: [2024-01-05 16:38:56,849][model8_pretrain.py][INFO] Epoch:[0/2](669800/4588595) loss:2.565 lr:0.0000100 epoch_Time:24919.0min: [2024-01-05 16:38:56,849][model8_pretrain.py][INFO] Epoch:[0/2](669800/4588595) loss:2.783 lr:0.0000100 epoch_Time:24919.0min: [2024-01-05 16:39:33,785][model8_pretrain.py][INFO] Epoch:[0/2](669900/4588595) loss:2.875 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:39:33,785][model8_pretrain.py][INFO] Epoch:[0/2](669900/4588595) loss:2.685 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:39:33,785][model8_pretrain.py][INFO] Epoch:[0/2](669900/4588595) loss:2.906 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:39:33,785][model8_pretrain.py][INFO] Epoch:[0/2](669900/4588595) loss:2.965 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:39:33,785][model8_pretrain.py][INFO] Epoch:[0/2](669900/4588595) loss:2.591 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:39:33,785][model8_pretrain.py][INFO] Epoch:[0/2](669900/4588595) loss:2.790 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:39:33,785][model8_pretrain.py][INFO] Epoch:[0/2](669900/4588595) loss:2.329 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:39:33,785][model8_pretrain.py][INFO] Epoch:[0/2](669900/4588595) loss:2.786 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:40:10,714][model8_pretrain.py][INFO] Epoch:[0/2](670000/4588595) loss:2.500 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:40:10,714][model8_pretrain.py][INFO] Epoch:[0/2](670000/4588595) loss:2.887 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:40:10,714][model8_pretrain.py][INFO] Epoch:[0/2](670000/4588595) loss:2.593 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:40:10,714][model8_pretrain.py][INFO] Epoch:[0/2](670000/4588595) loss:3.252 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:40:10,714][model8_pretrain.py][INFO] Epoch:[0/2](670000/4588595) loss:2.597 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:40:10,714][model8_pretrain.py][INFO] Epoch:[0/2](670000/4588595) loss:2.914 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:40:10,714][model8_pretrain.py][INFO] Epoch:[0/2](670000/4588595) loss:2.342 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:40:10,714][model8_pretrain.py][INFO] Epoch:[0/2](670000/4588595) loss:2.486 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:40:59,764][model8_pretrain.py][INFO] Epoch:[0/2](670100/4588595) loss:2.638 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:40:59,764][model8_pretrain.py][INFO] Epoch:[0/2](670100/4588595) loss:2.971 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:40:59,764][model8_pretrain.py][INFO] Epoch:[0/2](670100/4588595) loss:2.649 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:40:59,764][model8_pretrain.py][INFO] Epoch:[0/2](670100/4588595) loss:2.877 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:40:59,764][model8_pretrain.py][INFO] Epoch:[0/2](670100/4588595) loss:3.409 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:40:59,764][model8_pretrain.py][INFO] Epoch:[0/2](670100/4588595) loss:2.819 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:40:59,764][model8_pretrain.py][INFO] Epoch:[0/2](670100/4588595) loss:3.005 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:40:59,764][model8_pretrain.py][INFO] Epoch:[0/2](670100/4588595) loss:2.939 lr:0.0000100 epoch_Time:24918.0min: [2024-01-05 16:41:36,675][model8_pretrain.py][INFO] Epoch:[0/2](670200/4588595) loss:2.919 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:41:36,675][model8_pretrain.py][INFO] Epoch:[0/2](670200/4588595) loss:3.263 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:41:36,675][model8_pretrain.py][INFO] Epoch:[0/2](670200/4588595) loss:2.694 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:41:36,675][model8_pretrain.py][INFO] Epoch:[0/2](670200/4588595) loss:3.061 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:41:36,675][model8_pretrain.py][INFO] Epoch:[0/2](670200/4588595) loss:2.392 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:41:36,675][model8_pretrain.py][INFO] Epoch:[0/2](670200/4588595) loss:2.441 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:41:36,675][model8_pretrain.py][INFO] Epoch:[0/2](670200/4588595) loss:2.952 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:41:36,675][model8_pretrain.py][INFO] Epoch:[0/2](670200/4588595) loss:3.251 lr:0.0000100 epoch_Time:24917.0min: [2024-01-05 16:42:13,579][model8_pretrain.py][INFO] Epoch:[0/2](670300/4588595) loss:2.874 lr:0.0000100 epoch_Time:24916.0min: [2024-01-05 16:42:13,579][model8_pretrain.py][INFO] Epoch:[0/2](670300/4588595) loss:2.894 lr:0.0000100 epoch_Time:24916.0min: [2024-01-05 16:42:13,579][model8_pretrain.py][INFO] Epoch:[0/2](670300/4588595) loss:2.502 lr:0.0000100 epoch_Time:24916.0min: [2024-01-05 16:42:13,579][model8_pretrain.py][INFO] Epoch:[0/2](670300/4588595) loss:3.008 lr:0.0000100 epoch_Time:24916.0min: [2024-01-05 16:42:13,579][model8_pretrain.py][INFO] Epoch:[0/2](670300/4588595) loss:3.180 lr:0.0000100 epoch_Time:24916.0min: [2024-01-05 16:42:13,579][model8_pretrain.py][INFO] Epoch:[0/2](670300/4588595) loss:2.677 lr:0.0000100 epoch_Time:24916.0min: [2024-01-05 16:42:13,580][model8_pretrain.py][INFO] Epoch:[0/2](670300/4588595) loss:2.721 lr:0.0000100 epoch_Time:24916.0min: [2024-01-05 16:42:13,580][model8_pretrain.py][INFO] Epoch:[0/2](670300/4588595) loss:2.834 lr:0.0000100 epoch_Time:24916.0min: [2024-01-05 16:42:50,504][model8_pretrain.py][INFO] Epoch:[0/2](670400/4588595) loss:3.034 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:42:50,504][model8_pretrain.py][INFO] Epoch:[0/2](670400/4588595) loss:2.552 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:42:50,504][model8_pretrain.py][INFO] Epoch:[0/2](670400/4588595) loss:2.549 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:42:50,504][model8_pretrain.py][INFO] Epoch:[0/2](670400/4588595) loss:2.323 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:42:50,504][model8_pretrain.py][INFO] Epoch:[0/2](670400/4588595) loss:3.200 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:42:50,504][model8_pretrain.py][INFO] Epoch:[0/2](670400/4588595) loss:3.114 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:42:50,504][model8_pretrain.py][INFO] Epoch:[0/2](670400/4588595) loss:2.134 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:42:50,504][model8_pretrain.py][INFO] Epoch:[0/2](670400/4588595) loss:3.147 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:43:27,432][model8_pretrain.py][INFO] Epoch:[0/2](670500/4588595) loss:3.021 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:43:27,432][model8_pretrain.py][INFO] Epoch:[0/2](670500/4588595) loss:1.910 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:43:27,432][model8_pretrain.py][INFO] Epoch:[0/2](670500/4588595) loss:2.990 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:43:27,432][model8_pretrain.py][INFO] Epoch:[0/2](670500/4588595) loss:3.004 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:43:27,432][model8_pretrain.py][INFO] Epoch:[0/2](670500/4588595) loss:3.006 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:43:27,432][model8_pretrain.py][INFO] Epoch:[0/2](670500/4588595) loss:2.626 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:43:27,432][model8_pretrain.py][INFO] Epoch:[0/2](670500/4588595) loss:2.627 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:43:27,432][model8_pretrain.py][INFO] Epoch:[0/2](670500/4588595) loss:3.089 lr:0.0000100 epoch_Time:24915.0min: [2024-01-05 16:44:04,360][model8_pretrain.py][INFO] Epoch:[0/2](670600/4588595) loss:2.940 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:04,360][model8_pretrain.py][INFO] Epoch:[0/2](670600/4588595) loss:2.746 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:04,360][model8_pretrain.py][INFO] Epoch:[0/2](670600/4588595) loss:2.355 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:04,360][model8_pretrain.py][INFO] Epoch:[0/2](670600/4588595) loss:2.723 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:04,360][model8_pretrain.py][INFO] Epoch:[0/2](670600/4588595) loss:2.874 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:04,360][model8_pretrain.py][INFO] Epoch:[0/2](670600/4588595) loss:2.777 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:04,360][model8_pretrain.py][INFO] Epoch:[0/2](670600/4588595) loss:3.040 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:04,360][model8_pretrain.py][INFO] Epoch:[0/2](670600/4588595) loss:2.954 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:41,290][model8_pretrain.py][INFO] Epoch:[0/2](670700/4588595) loss:2.567 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:41,290][model8_pretrain.py][INFO] Epoch:[0/2](670700/4588595) loss:2.683 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:41,290][model8_pretrain.py][INFO] Epoch:[0/2](670700/4588595) loss:3.228 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:41,290][model8_pretrain.py][INFO] Epoch:[0/2](670700/4588595) loss:3.033 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:41,290][model8_pretrain.py][INFO] Epoch:[0/2](670700/4588595) loss:1.845 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:41,290][model8_pretrain.py][INFO] Epoch:[0/2](670700/4588595) loss:2.953 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:41,290][model8_pretrain.py][INFO] Epoch:[0/2](670700/4588595) loss:3.334 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:44:41,290][model8_pretrain.py][INFO] Epoch:[0/2](670700/4588595) loss:3.340 lr:0.0000100 epoch_Time:24914.0min: [2024-01-05 16:45:18,215][model8_pretrain.py][INFO] Epoch:[0/2](670800/4588595) loss:3.367 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:45:18,215][model8_pretrain.py][INFO] Epoch:[0/2](670800/4588595) loss:2.791 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:45:18,215][model8_pretrain.py][INFO] Epoch:[0/2](670800/4588595) loss:3.051 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:45:18,215][model8_pretrain.py][INFO] Epoch:[0/2](670800/4588595) loss:3.000 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:45:18,215][model8_pretrain.py][INFO] Epoch:[0/2](670800/4588595) loss:2.984 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:45:18,215][model8_pretrain.py][INFO] Epoch:[0/2](670800/4588595) loss:2.371 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:45:18,215][model8_pretrain.py][INFO] Epoch:[0/2](670800/4588595) loss:2.335 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:45:18,215][model8_pretrain.py][INFO] Epoch:[0/2](670800/4588595) loss:2.910 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:05,653][model8_pretrain.py][INFO] Epoch:[0/2](670900/4588595) loss:2.927 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:05,653][model8_pretrain.py][INFO] Epoch:[0/2](670900/4588595) loss:3.165 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:05,653][model8_pretrain.py][INFO] Epoch:[0/2](670900/4588595) loss:2.907 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:05,653][model8_pretrain.py][INFO] Epoch:[0/2](670900/4588595) loss:2.848 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:05,653][model8_pretrain.py][INFO] Epoch:[0/2](670900/4588595) loss:2.756 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:05,653][model8_pretrain.py][INFO] Epoch:[0/2](670900/4588595) loss:2.373 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:05,653][model8_pretrain.py][INFO] Epoch:[0/2](670900/4588595) loss:2.836 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:05,653][model8_pretrain.py][INFO] Epoch:[0/2](670900/4588595) loss:2.710 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:44,282][model8_pretrain.py][INFO] Epoch:[0/2](671000/4588595) loss:2.188 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:44,283][model8_pretrain.py][INFO] Epoch:[0/2](671000/4588595) loss:2.783 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:44,283][model8_pretrain.py][INFO] Epoch:[0/2](671000/4588595) loss:3.096 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:44,282][model8_pretrain.py][INFO] Epoch:[0/2](671000/4588595) loss:3.271 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:44,282][model8_pretrain.py][INFO] Epoch:[0/2](671000/4588595) loss:3.347 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:44,283][model8_pretrain.py][INFO] Epoch:[0/2](671000/4588595) loss:3.676 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:44,283][model8_pretrain.py][INFO] Epoch:[0/2](671000/4588595) loss:2.557 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:46:44,283][model8_pretrain.py][INFO] Epoch:[0/2](671000/4588595) loss:3.055 lr:0.0000100 epoch_Time:24913.0min: [2024-01-05 16:47:21,211][model8_pretrain.py][INFO] Epoch:[0/2](671100/4588595) loss:2.988 lr:0.0000100 epoch_Time:24912.0min: [2024-01-05 16:47:21,211][model8_pretrain.py][INFO] Epoch:[0/2](671100/4588595) loss:2.661 lr:0.0000100 epoch_Time:24912.0min: [2024-01-05 16:47:21,211][model8_pretrain.py][INFO] Epoch:[0/2](671100/4588595) loss:2.947 lr:0.0000100 epoch_Time:24912.0min: [2024-01-05 16:47:21,211][model8_pretrain.py][INFO] Epoch:[0/2](671100/4588595) loss:2.625 lr:0.0000100 epoch_Time:24912.0min: [2024-01-05 16:47:21,211][model8_pretrain.py][INFO] Epoch:[0/2](671100/4588595) loss:3.073 lr:0.0000100 epoch_Time:24912.0min: [2024-01-05 16:47:21,211][model8_pretrain.py][INFO] Epoch:[0/2](671100/4588595) loss:2.100 lr:0.0000100 epoch_Time:24912.0min: [2024-01-05 16:47:21,211][model8_pretrain.py][INFO] Epoch:[0/2](671100/4588595) loss:3.067 lr:0.0000100 epoch_Time:24912.0min: [2024-01-05 16:47:21,211][model8_pretrain.py][INFO] Epoch:[0/2](671100/4588595) loss:2.127 lr:0.0000100 epoch_Time:24912.0min: [2024-01-05 16:47:58,152][model8_pretrain.py][INFO] Epoch:[0/2](671200/4588595) loss:2.628 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:47:58,152][model8_pretrain.py][INFO] Epoch:[0/2](671200/4588595) loss:3.548 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:47:58,152][model8_pretrain.py][INFO] Epoch:[0/2](671200/4588595) loss:2.854 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:47:58,152][model8_pretrain.py][INFO] Epoch:[0/2](671200/4588595) loss:3.043 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:47:58,152][model8_pretrain.py][INFO] Epoch:[0/2](671200/4588595) loss:3.054 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:47:58,152][model8_pretrain.py][INFO] Epoch:[0/2](671200/4588595) loss:3.131 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:47:58,153][model8_pretrain.py][INFO] Epoch:[0/2](671200/4588595) loss:2.958 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:47:58,153][model8_pretrain.py][INFO] Epoch:[0/2](671200/4588595) loss:2.890 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:48:35,087][model8_pretrain.py][INFO] Epoch:[0/2](671300/4588595) loss:2.330 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:48:35,087][model8_pretrain.py][INFO] Epoch:[0/2](671300/4588595) loss:2.405 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:48:35,087][model8_pretrain.py][INFO] Epoch:[0/2](671300/4588595) loss:2.906 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:48:35,087][model8_pretrain.py][INFO] Epoch:[0/2](671300/4588595) loss:2.709 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:48:35,087][model8_pretrain.py][INFO] Epoch:[0/2](671300/4588595) loss:3.251 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:48:35,088][model8_pretrain.py][INFO] Epoch:[0/2](671300/4588595) loss:1.995 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:48:35,087][model8_pretrain.py][INFO] Epoch:[0/2](671300/4588595) loss:3.418 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:48:35,088][model8_pretrain.py][INFO] Epoch:[0/2](671300/4588595) loss:3.145 lr:0.0000100 epoch_Time:24910.0min: [2024-01-05 16:49:12,020][model8_pretrain.py][INFO] Epoch:[0/2](671400/4588595) loss:3.004 lr:0.0000100 epoch_Time:24909.0min: [2024-01-05 16:49:12,020][model8_pretrain.py][INFO] Epoch:[0/2](671400/4588595) loss:2.744 lr:0.0000100 epoch_Time:24909.0min: [2024-01-05 16:49:12,020][model8_pretrain.py][INFO] Epoch:[0/2](671400/4588595) loss:2.359 lr:0.0000100 epoch_Time:24909.0min: [2024-01-05 16:49:12,020][model8_pretrain.py][INFO] Epoch:[0/2](671400/4588595) loss:2.652 lr:0.0000100 epoch_Time:24909.0min: [2024-01-05 16:49:12,020][model8_pretrain.py][INFO] Epoch:[0/2](671400/4588595) loss:3.009 lr:0.0000100 epoch_Time:24909.0min: [2024-01-05 16:49:12,020][model8_pretrain.py][INFO] Epoch:[0/2](671400/4588595) loss:3.232 lr:0.0000100 epoch_Time:24909.0min: [2024-01-05 16:49:12,020][model8_pretrain.py][INFO] Epoch:[0/2](671400/4588595) loss:2.978 lr:0.0000100 epoch_Time:24909.0min: [2024-01-05 16:49:12,020][model8_pretrain.py][INFO] Epoch:[0/2](671400/4588595) loss:2.759 lr:0.0000100 epoch_Time:24909.0min: [2024-01-05 16:49:48,947][model8_pretrain.py][INFO] Epoch:[0/2](671500/4588595) loss:2.762 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:49:48,947][model8_pretrain.py][INFO] Epoch:[0/2](671500/4588595) loss:2.940 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:49:48,947][model8_pretrain.py][INFO] Epoch:[0/2](671500/4588595) loss:2.897 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:49:48,947][model8_pretrain.py][INFO] Epoch:[0/2](671500/4588595) loss:2.984 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:49:48,947][model8_pretrain.py][INFO] Epoch:[0/2](671500/4588595) loss:3.091 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:49:48,947][model8_pretrain.py][INFO] Epoch:[0/2](671500/4588595) loss:3.158 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:49:48,947][model8_pretrain.py][INFO] Epoch:[0/2](671500/4588595) loss:2.481 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:49:48,948][model8_pretrain.py][INFO] Epoch:[0/2](671500/4588595) loss:3.325 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:50:25,878][model8_pretrain.py][INFO] Epoch:[0/2](671600/4588595) loss:3.314 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:50:25,878][model8_pretrain.py][INFO] Epoch:[0/2](671600/4588595) loss:3.179 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:50:25,878][model8_pretrain.py][INFO] Epoch:[0/2](671600/4588595) loss:2.537 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:50:25,878][model8_pretrain.py][INFO] Epoch:[0/2](671600/4588595) loss:2.502 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:50:25,878][model8_pretrain.py][INFO] Epoch:[0/2](671600/4588595) loss:2.778 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:50:25,878][model8_pretrain.py][INFO] Epoch:[0/2](671600/4588595) loss:2.720 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:50:25,879][model8_pretrain.py][INFO] Epoch:[0/2](671600/4588595) loss:2.392 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:50:25,879][model8_pretrain.py][INFO] Epoch:[0/2](671600/4588595) loss:3.035 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:51:13,271][model8_pretrain.py][INFO] Epoch:[0/2](671700/4588595) loss:2.648 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:51:13,271][model8_pretrain.py][INFO] Epoch:[0/2](671700/4588595) loss:2.550 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:51:13,271][model8_pretrain.py][INFO] Epoch:[0/2](671700/4588595) loss:3.408 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:51:13,271][model8_pretrain.py][INFO] Epoch:[0/2](671700/4588595) loss:2.329 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:51:13,271][model8_pretrain.py][INFO] Epoch:[0/2](671700/4588595) loss:2.987 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:51:13,271][model8_pretrain.py][INFO] Epoch:[0/2](671700/4588595) loss:2.836 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:51:13,271][model8_pretrain.py][INFO] Epoch:[0/2](671700/4588595) loss:3.278 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:51:13,272][model8_pretrain.py][INFO] Epoch:[0/2](671700/4588595) loss:3.314 lr:0.0000100 epoch_Time:24908.0min: [2024-01-05 16:51:51,879][model8_pretrain.py][INFO] Epoch:[0/2](671800/4588595) loss:2.252 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:51:51,879][model8_pretrain.py][INFO] Epoch:[0/2](671800/4588595) loss:2.458 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:51:51,879][model8_pretrain.py][INFO] Epoch:[0/2](671800/4588595) loss:2.637 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:51:51,879][model8_pretrain.py][INFO] Epoch:[0/2](671800/4588595) loss:3.041 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:51:51,879][model8_pretrain.py][INFO] Epoch:[0/2](671800/4588595) loss:2.824 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:51:51,879][model8_pretrain.py][INFO] Epoch:[0/2](671800/4588595) loss:2.948 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:51:51,879][model8_pretrain.py][INFO] Epoch:[0/2](671800/4588595) loss:2.855 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:51:51,880][model8_pretrain.py][INFO] Epoch:[0/2](671800/4588595) loss:3.056 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:52:28,830][model8_pretrain.py][INFO] Epoch:[0/2](671900/4588595) loss:2.195 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:52:28,830][model8_pretrain.py][INFO] Epoch:[0/2](671900/4588595) loss:2.502 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:52:28,830][model8_pretrain.py][INFO] Epoch:[0/2](671900/4588595) loss:3.142 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:52:28,830][model8_pretrain.py][INFO] Epoch:[0/2](671900/4588595) loss:3.697 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:52:28,830][model8_pretrain.py][INFO] Epoch:[0/2](671900/4588595) loss:2.713 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:52:28,830][model8_pretrain.py][INFO] Epoch:[0/2](671900/4588595) loss:2.558 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:52:28,830][model8_pretrain.py][INFO] Epoch:[0/2](671900/4588595) loss:3.289 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:52:28,830][model8_pretrain.py][INFO] Epoch:[0/2](671900/4588595) loss:2.575 lr:0.0000100 epoch_Time:24907.0min: [2024-01-05 16:53:05,768][model8_pretrain.py][INFO] Epoch:[0/2](672000/4588595) loss:2.678 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:05,768][model8_pretrain.py][INFO] Epoch:[0/2](672000/4588595) loss:2.402 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:05,768][model8_pretrain.py][INFO] Epoch:[0/2](672000/4588595) loss:2.665 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:05,768][model8_pretrain.py][INFO] Epoch:[0/2](672000/4588595) loss:2.835 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:05,768][model8_pretrain.py][INFO] Epoch:[0/2](672000/4588595) loss:3.082 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:05,768][model8_pretrain.py][INFO] Epoch:[0/2](672000/4588595) loss:3.348 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:05,768][model8_pretrain.py][INFO] Epoch:[0/2](672000/4588595) loss:3.041 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:05,768][model8_pretrain.py][INFO] Epoch:[0/2](672000/4588595) loss:3.180 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:42,707][model8_pretrain.py][INFO] Epoch:[0/2](672100/4588595) loss:2.960 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:42,707][model8_pretrain.py][INFO] Epoch:[0/2](672100/4588595) loss:2.697 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:42,707][model8_pretrain.py][INFO] Epoch:[0/2](672100/4588595) loss:2.742 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:42,707][model8_pretrain.py][INFO] Epoch:[0/2](672100/4588595) loss:2.974 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:42,707][model8_pretrain.py][INFO] Epoch:[0/2](672100/4588595) loss:2.447 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:42,707][model8_pretrain.py][INFO] Epoch:[0/2](672100/4588595) loss:2.986 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:42,707][model8_pretrain.py][INFO] Epoch:[0/2](672100/4588595) loss:2.931 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:53:42,707][model8_pretrain.py][INFO] Epoch:[0/2](672100/4588595) loss:2.833 lr:0.0000100 epoch_Time:24906.0min: [2024-01-05 16:54:19,647][model8_pretrain.py][INFO] Epoch:[0/2](672200/4588595) loss:2.775 lr:0.0000100 epoch_Time:24904.0min: [2024-01-05 16:54:19,647][model8_pretrain.py][INFO] Epoch:[0/2](672200/4588595) loss:2.993 lr:0.0000100 epoch_Time:24904.0min: [2024-01-05 16:54:19,647][model8_pretrain.py][INFO] Epoch:[0/2](672200/4588595) loss:3.137 lr:0.0000100 epoch_Time:24904.0min: [2024-01-05 16:54:19,647][model8_pretrain.py][INFO] Epoch:[0/2](672200/4588595) loss:2.609 lr:0.0000100 epoch_Time:24904.0min: [2024-01-05 16:54:19,647][model8_pretrain.py][INFO] Epoch:[0/2](672200/4588595) loss:3.053 lr:0.0000100 epoch_Time:24904.0min: [2024-01-05 16:54:19,647][model8_pretrain.py][INFO] Epoch:[0/2](672200/4588595) loss:2.611 lr:0.0000100 epoch_Time:24904.0min: [2024-01-05 16:54:19,647][model8_pretrain.py][INFO] Epoch:[0/2](672200/4588595) loss:2.742 lr:0.0000100 epoch_Time:24904.0min: [2024-01-05 16:54:19,647][model8_pretrain.py][INFO] Epoch:[0/2](672200/4588595) loss:2.245 lr:0.0000100 epoch_Time:24904.0min: [2024-01-05 16:54:56,606][model8_pretrain.py][INFO] Epoch:[0/2](672300/4588595) loss:2.729 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:54:56,606][model8_pretrain.py][INFO] Epoch:[0/2](672300/4588595) loss:2.572 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:54:56,606][model8_pretrain.py][INFO] Epoch:[0/2](672300/4588595) loss:3.379 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:54:56,606][model8_pretrain.py][INFO] Epoch:[0/2](672300/4588595) loss:3.180 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:54:56,606][model8_pretrain.py][INFO] Epoch:[0/2](672300/4588595) loss:2.955 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:54:56,606][model8_pretrain.py][INFO] Epoch:[0/2](672300/4588595) loss:3.094 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:54:56,606][model8_pretrain.py][INFO] Epoch:[0/2](672300/4588595) loss:2.771 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:54:56,606][model8_pretrain.py][INFO] Epoch:[0/2](672300/4588595) loss:2.591 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:55:33,552][model8_pretrain.py][INFO] Epoch:[0/2](672400/4588595) loss:2.678 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:55:33,552][model8_pretrain.py][INFO] Epoch:[0/2](672400/4588595) loss:3.238 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:55:33,552][model8_pretrain.py][INFO] Epoch:[0/2](672400/4588595) loss:2.830 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:55:33,552][model8_pretrain.py][INFO] Epoch:[0/2](672400/4588595) loss:2.908 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:55:33,552][model8_pretrain.py][INFO] Epoch:[0/2](672400/4588595) loss:2.770 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:55:33,552][model8_pretrain.py][INFO] Epoch:[0/2](672400/4588595) loss:3.029 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:55:33,552][model8_pretrain.py][INFO] Epoch:[0/2](672400/4588595) loss:2.934 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:55:33,552][model8_pretrain.py][INFO] Epoch:[0/2](672400/4588595) loss:2.673 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:56:19,120][model8_pretrain.py][INFO] Epoch:[0/2](672500/4588595) loss:2.947 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:56:19,120][model8_pretrain.py][INFO] Epoch:[0/2](672500/4588595) loss:2.943 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:56:19,120][model8_pretrain.py][INFO] Epoch:[0/2](672500/4588595) loss:2.878 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:56:19,120][model8_pretrain.py][INFO] Epoch:[0/2](672500/4588595) loss:2.836 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:56:19,120][model8_pretrain.py][INFO] Epoch:[0/2](672500/4588595) loss:2.589 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:56:19,120][model8_pretrain.py][INFO] Epoch:[0/2](672500/4588595) loss:2.657 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:56:19,120][model8_pretrain.py][INFO] Epoch:[0/2](672500/4588595) loss:2.763 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:56:19,120][model8_pretrain.py][INFO] Epoch:[0/2](672500/4588595) loss:2.844 lr:0.0000100 epoch_Time:24903.0min: [2024-01-05 16:56:59,486][model8_pretrain.py][INFO] Epoch:[0/2](672600/4588595) loss:3.253 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:56:59,487][model8_pretrain.py][INFO] Epoch:[0/2](672600/4588595) loss:2.733 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:56:59,487][model8_pretrain.py][INFO] Epoch:[0/2](672600/4588595) loss:2.388 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:56:59,487][model8_pretrain.py][INFO] Epoch:[0/2](672600/4588595) loss:2.931 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:56:59,487][model8_pretrain.py][INFO] Epoch:[0/2](672600/4588595) loss:2.874 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:56:59,487][model8_pretrain.py][INFO] Epoch:[0/2](672600/4588595) loss:3.000 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:56:59,487][model8_pretrain.py][INFO] Epoch:[0/2](672600/4588595) loss:2.710 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:56:59,487][model8_pretrain.py][INFO] Epoch:[0/2](672600/4588595) loss:2.840 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:57:36,431][model8_pretrain.py][INFO] Epoch:[0/2](672700/4588595) loss:2.742 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:57:36,431][model8_pretrain.py][INFO] Epoch:[0/2](672700/4588595) loss:2.833 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:57:36,431][model8_pretrain.py][INFO] Epoch:[0/2](672700/4588595) loss:2.713 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:57:36,431][model8_pretrain.py][INFO] Epoch:[0/2](672700/4588595) loss:2.656 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:57:36,431][model8_pretrain.py][INFO] Epoch:[0/2](672700/4588595) loss:2.613 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:57:36,431][model8_pretrain.py][INFO] Epoch:[0/2](672700/4588595) loss:2.639 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:57:36,431][model8_pretrain.py][INFO] Epoch:[0/2](672700/4588595) loss:2.661 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:57:36,431][model8_pretrain.py][INFO] Epoch:[0/2](672700/4588595) loss:2.673 lr:0.0000100 epoch_Time:24902.0min: [2024-01-05 16:58:13,388][model8_pretrain.py][INFO] Epoch:[0/2](672800/4588595) loss:2.621 lr:0.0000100 epoch_Time:24901.0min: [2024-01-05 16:58:13,388][model8_pretrain.py][INFO] Epoch:[0/2](672800/4588595) loss:3.243 lr:0.0000100 epoch_Time:24901.0min: [2024-01-05 16:58:13,388][model8_pretrain.py][INFO] Epoch:[0/2](672800/4588595) loss:3.092 lr:0.0000100 epoch_Time:24901.0min: [2024-01-05 16:58:13,388][model8_pretrain.py][INFO] Epoch:[0/2](672800/4588595) loss:3.029 lr:0.0000100 epoch_Time:24901.0min: [2024-01-05 16:58:13,388][model8_pretrain.py][INFO] Epoch:[0/2](672800/4588595) loss:3.321 lr:0.0000100 epoch_Time:24901.0min: [2024-01-05 16:58:13,388][model8_pretrain.py][INFO] Epoch:[0/2](672800/4588595) loss:2.794 lr:0.0000100 epoch_Time:24901.0min: [2024-01-05 16:58:13,388][model8_pretrain.py][INFO] Epoch:[0/2](672800/4588595) loss:2.847 lr:0.0000100 epoch_Time:24901.0min: [2024-01-05 16:58:13,388][model8_pretrain.py][INFO] Epoch:[0/2](672800/4588595) loss:2.888 lr:0.0000100 epoch_Time:24901.0min: [2024-01-05 16:58:50,339][model8_pretrain.py][INFO] Epoch:[0/2](672900/4588595) loss:2.332 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:58:50,339][model8_pretrain.py][INFO] Epoch:[0/2](672900/4588595) loss:2.348 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:58:50,339][model8_pretrain.py][INFO] Epoch:[0/2](672900/4588595) loss:2.161 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:58:50,339][model8_pretrain.py][INFO] Epoch:[0/2](672900/4588595) loss:2.446 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:58:50,339][model8_pretrain.py][INFO] Epoch:[0/2](672900/4588595) loss:3.126 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:58:50,339][model8_pretrain.py][INFO] Epoch:[0/2](672900/4588595) loss:2.824 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:58:50,339][model8_pretrain.py][INFO] Epoch:[0/2](672900/4588595) loss:3.167 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:58:50,340][model8_pretrain.py][INFO] Epoch:[0/2](672900/4588595) loss:2.094 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:59:27,315][model8_pretrain.py][INFO] Epoch:[0/2](673000/4588595) loss:2.724 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:59:27,315][model8_pretrain.py][INFO] Epoch:[0/2](673000/4588595) loss:2.441 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:59:27,315][model8_pretrain.py][INFO] Epoch:[0/2](673000/4588595) loss:2.157 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:59:27,315][model8_pretrain.py][INFO] Epoch:[0/2](673000/4588595) loss:2.924 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:59:27,315][model8_pretrain.py][INFO] Epoch:[0/2](673000/4588595) loss:3.059 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:59:27,315][model8_pretrain.py][INFO] Epoch:[0/2](673000/4588595) loss:3.190 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:59:27,315][model8_pretrain.py][INFO] Epoch:[0/2](673000/4588595) loss:2.827 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 16:59:27,315][model8_pretrain.py][INFO] Epoch:[0/2](673000/4588595) loss:2.202 lr:0.0000100 epoch_Time:24900.0min: [2024-01-05 17:00:04,257][model8_pretrain.py][INFO] Epoch:[0/2](673100/4588595) loss:2.450 lr:0.0000100 epoch_Time:24899.0min: [2024-01-05 17:00:04,257][model8_pretrain.py][INFO] Epoch:[0/2](673100/4588595) loss:3.092 lr:0.0000100 epoch_Time:24899.0min: [2024-01-05 17:00:04,257][model8_pretrain.py][INFO] Epoch:[0/2](673100/4588595) loss:2.974 lr:0.0000100 epoch_Time:24899.0min: [2024-01-05 17:00:04,257][model8_pretrain.py][INFO] Epoch:[0/2](673100/4588595) loss:3.254 lr:0.0000100 epoch_Time:24899.0min: [2024-01-05 17:00:04,257][model8_pretrain.py][INFO] Epoch:[0/2](673100/4588595) loss:2.956 lr:0.0000100 epoch_Time:24899.0min: [2024-01-05 17:00:04,257][model8_pretrain.py][INFO] Epoch:[0/2](673100/4588595) loss:2.066 lr:0.0000100 epoch_Time:24899.0min: [2024-01-05 17:00:04,257][model8_pretrain.py][INFO] Epoch:[0/2](673100/4588595) loss:3.088 lr:0.0000100 epoch_Time:24899.0min: [2024-01-05 17:00:04,257][model8_pretrain.py][INFO] Epoch:[0/2](673100/4588595) loss:3.099 lr:0.0000100 epoch_Time:24899.0min: [2024-01-05 17:00:41,202][model8_pretrain.py][INFO] Epoch:[0/2](673200/4588595) loss:2.976 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:00:41,202][model8_pretrain.py][INFO] Epoch:[0/2](673200/4588595) loss:2.930 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:00:41,202][model8_pretrain.py][INFO] Epoch:[0/2](673200/4588595) loss:2.395 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:00:41,202][model8_pretrain.py][INFO] Epoch:[0/2](673200/4588595) loss:2.341 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:00:41,202][model8_pretrain.py][INFO] Epoch:[0/2](673200/4588595) loss:2.922 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:00:41,202][model8_pretrain.py][INFO] Epoch:[0/2](673200/4588595) loss:2.790 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:00:41,202][model8_pretrain.py][INFO] Epoch:[0/2](673200/4588595) loss:2.490 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:00:41,202][model8_pretrain.py][INFO] Epoch:[0/2](673200/4588595) loss:2.530 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:01:24,890][model8_pretrain.py][INFO] Epoch:[0/2](673300/4588595) loss:2.598 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:01:24,890][model8_pretrain.py][INFO] Epoch:[0/2](673300/4588595) loss:3.078 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:01:24,890][model8_pretrain.py][INFO] Epoch:[0/2](673300/4588595) loss:2.956 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:01:24,890][model8_pretrain.py][INFO] Epoch:[0/2](673300/4588595) loss:2.583 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:01:24,890][model8_pretrain.py][INFO] Epoch:[0/2](673300/4588595) loss:3.020 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:01:24,890][model8_pretrain.py][INFO] Epoch:[0/2](673300/4588595) loss:2.988 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:01:24,890][model8_pretrain.py][INFO] Epoch:[0/2](673300/4588595) loss:3.256 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:01:26,592][model8_pretrain.py][INFO] Epoch:[0/2](673300/4588595) loss:2.766 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:02:07,036][model8_pretrain.py][INFO] Epoch:[0/2](673400/4588595) loss:2.882 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:02:07,036][model8_pretrain.py][INFO] Epoch:[0/2](673400/4588595) loss:2.847 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:02:07,036][model8_pretrain.py][INFO] Epoch:[0/2](673400/4588595) loss:3.113 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:02:07,036][model8_pretrain.py][INFO] Epoch:[0/2](673400/4588595) loss:2.784 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:02:07,036][model8_pretrain.py][INFO] Epoch:[0/2](673400/4588595) loss:3.027 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:02:07,036][model8_pretrain.py][INFO] Epoch:[0/2](673400/4588595) loss:3.019 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:02:07,036][model8_pretrain.py][INFO] Epoch:[0/2](673400/4588595) loss:1.766 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:02:07,036][model8_pretrain.py][INFO] Epoch:[0/2](673400/4588595) loss:2.616 lr:0.0000100 epoch_Time:24898.0min: [2024-01-05 17:02:43,971][model8_pretrain.py][INFO] Epoch:[0/2](673500/4588595) loss:2.714 lr:0.0000100 epoch_Time:24897.0min: [2024-01-05 17:02:43,971][model8_pretrain.py][INFO] Epoch:[0/2](673500/4588595) loss:2.616 lr:0.0000100 epoch_Time:24897.0min: [2024-01-05 17:02:43,971][model8_pretrain.py][INFO] Epoch:[0/2](673500/4588595) loss:2.655 lr:0.0000100 epoch_Time:24897.0min: [2024-01-05 17:02:43,971][model8_pretrain.py][INFO] Epoch:[0/2](673500/4588595) loss:2.689 lr:0.0000100 epoch_Time:24897.0min: [2024-01-05 17:02:43,971][model8_pretrain.py][INFO] Epoch:[0/2](673500/4588595) loss:2.557 lr:0.0000100 epoch_Time:24897.0min: [2024-01-05 17:02:43,971][model8_pretrain.py][INFO] Epoch:[0/2](673500/4588595) loss:2.166 lr:0.0000100 epoch_Time:24897.0min: [2024-01-05 17:02:43,971][model8_pretrain.py][INFO] Epoch:[0/2](673500/4588595) loss:2.566 lr:0.0000100 epoch_Time:24897.0min: [2024-01-05 17:02:43,971][model8_pretrain.py][INFO] Epoch:[0/2](673500/4588595) loss:2.619 lr:0.0000100 epoch_Time:24897.0min: [2024-01-05 17:03:20,915][model8_pretrain.py][INFO] Epoch:[0/2](673600/4588595) loss:2.552 lr:0.0000100 epoch_Time:24896.0min: [2024-01-05 17:03:20,915][model8_pretrain.py][INFO] Epoch:[0/2](673600/4588595) loss:2.920 lr:0.0000100 epoch_Time:24896.0min: [2024-01-05 17:03:20,915][model8_pretrain.py][INFO] Epoch:[0/2](673600/4588595) loss:2.847 lr:0.0000100 epoch_Time:24896.0min: [2024-01-05 17:03:20,915][model8_pretrain.py][INFO] Epoch:[0/2](673600/4588595) loss:2.584 lr:0.0000100 epoch_Time:24896.0min: [2024-01-05 17:03:20,915][model8_pretrain.py][INFO] Epoch:[0/2](673600/4588595) loss:2.850 lr:0.0000100 epoch_Time:24896.0min: [2024-01-05 17:03:20,915][model8_pretrain.py][INFO] Epoch:[0/2](673600/4588595) loss:2.760 lr:0.0000100 epoch_Time:24896.0min: [2024-01-05 17:03:20,915][model8_pretrain.py][INFO] Epoch:[0/2](673600/4588595) loss:2.736 lr:0.0000100 epoch_Time:24896.0min: [2024-01-05 17:03:20,915][model8_pretrain.py][INFO] Epoch:[0/2](673600/4588595) loss:2.543 lr:0.0000100 epoch_Time:24896.0min: [2024-01-05 17:03:57,845][model8_pretrain.py][INFO] Epoch:[0/2](673700/4588595) loss:2.477 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:03:57,845][model8_pretrain.py][INFO] Epoch:[0/2](673700/4588595) loss:2.927 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:03:57,845][model8_pretrain.py][INFO] Epoch:[0/2](673700/4588595) loss:2.876 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:03:57,845][model8_pretrain.py][INFO] Epoch:[0/2](673700/4588595) loss:2.903 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:03:57,846][model8_pretrain.py][INFO] Epoch:[0/2](673700/4588595) loss:2.350 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:03:57,846][model8_pretrain.py][INFO] Epoch:[0/2](673700/4588595) loss:2.931 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:03:57,846][model8_pretrain.py][INFO] Epoch:[0/2](673700/4588595) loss:2.774 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:03:57,846][model8_pretrain.py][INFO] Epoch:[0/2](673700/4588595) loss:2.069 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:04:34,775][model8_pretrain.py][INFO] Epoch:[0/2](673800/4588595) loss:2.656 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:04:34,775][model8_pretrain.py][INFO] Epoch:[0/2](673800/4588595) loss:3.271 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:04:34,775][model8_pretrain.py][INFO] Epoch:[0/2](673800/4588595) loss:2.535 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:04:34,775][model8_pretrain.py][INFO] Epoch:[0/2](673800/4588595) loss:2.323 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:04:34,775][model8_pretrain.py][INFO] Epoch:[0/2](673800/4588595) loss:2.646 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:04:34,775][model8_pretrain.py][INFO] Epoch:[0/2](673800/4588595) loss:2.760 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:04:34,776][model8_pretrain.py][INFO] Epoch:[0/2](673800/4588595) loss:3.162 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:04:34,776][model8_pretrain.py][INFO] Epoch:[0/2](673800/4588595) loss:3.470 lr:0.0000100 epoch_Time:24895.0min: [2024-01-05 17:05:11,709][model8_pretrain.py][INFO] Epoch:[0/2](673900/4588595) loss:1.753 lr:0.0000100 epoch_Time:24894.0min: [2024-01-05 17:05:11,709][model8_pretrain.py][INFO] Epoch:[0/2](673900/4588595) loss:2.590 lr:0.0000100 epoch_Time:24894.0min: [2024-01-05 17:05:11,709][model8_pretrain.py][INFO] Epoch:[0/2](673900/4588595) loss:2.602 lr:0.0000100 epoch_Time:24894.0min: [2024-01-05 17:05:11,709][model8_pretrain.py][INFO] Epoch:[0/2](673900/4588595) loss:2.514 lr:0.0000100 epoch_Time:24894.0min: [2024-01-05 17:05:11,709][model8_pretrain.py][INFO] Epoch:[0/2](673900/4588595) loss:2.792 lr:0.0000100 epoch_Time:24894.0min: [2024-01-05 17:05:11,709][model8_pretrain.py][INFO] Epoch:[0/2](673900/4588595) loss:2.503 lr:0.0000100 epoch_Time:24894.0min: [2024-01-05 17:05:11,709][model8_pretrain.py][INFO] Epoch:[0/2](673900/4588595) loss:2.760 lr:0.0000100 epoch_Time:24894.0min: [2024-01-05 17:05:11,709][model8_pretrain.py][INFO] Epoch:[0/2](673900/4588595) loss:3.082 lr:0.0000100 epoch_Time:24894.0min: [2024-01-05 17:05:48,640][model8_pretrain.py][INFO] Epoch:[0/2](674000/4588595) loss:2.814 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:05:48,640][model8_pretrain.py][INFO] Epoch:[0/2](674000/4588595) loss:3.092 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:05:48,640][model8_pretrain.py][INFO] Epoch:[0/2](674000/4588595) loss:2.500 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:05:48,640][model8_pretrain.py][INFO] Epoch:[0/2](674000/4588595) loss:2.704 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:05:48,640][model8_pretrain.py][INFO] Epoch:[0/2](674000/4588595) loss:3.012 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:05:48,640][model8_pretrain.py][INFO] Epoch:[0/2](674000/4588595) loss:2.387 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:05:48,640][model8_pretrain.py][INFO] Epoch:[0/2](674000/4588595) loss:3.059 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:05:48,640][model8_pretrain.py][INFO] Epoch:[0/2](674000/4588595) loss:2.948 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:06:32,346][model8_pretrain.py][INFO] Epoch:[0/2](674100/4588595) loss:2.855 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:06:32,346][model8_pretrain.py][INFO] Epoch:[0/2](674100/4588595) loss:2.554 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:06:32,346][model8_pretrain.py][INFO] Epoch:[0/2](674100/4588595) loss:2.850 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:06:32,350][model8_pretrain.py][INFO] Epoch:[0/2](674100/4588595) loss:3.134 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:06:32,350][model8_pretrain.py][INFO] Epoch:[0/2](674100/4588595) loss:3.047 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:06:32,350][model8_pretrain.py][INFO] Epoch:[0/2](674100/4588595) loss:2.223 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:06:32,351][model8_pretrain.py][INFO] Epoch:[0/2](674100/4588595) loss:2.454 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:06:32,351][model8_pretrain.py][INFO] Epoch:[0/2](674100/4588595) loss:3.208 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:07:14,247][model8_pretrain.py][INFO] Epoch:[0/2](674200/4588595) loss:2.494 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:07:14,247][model8_pretrain.py][INFO] Epoch:[0/2](674200/4588595) loss:2.943 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:07:14,247][model8_pretrain.py][INFO] Epoch:[0/2](674200/4588595) loss:2.355 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:07:14,247][model8_pretrain.py][INFO] Epoch:[0/2](674200/4588595) loss:3.080 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:07:14,247][model8_pretrain.py][INFO] Epoch:[0/2](674200/4588595) loss:2.907 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:07:14,247][model8_pretrain.py][INFO] Epoch:[0/2](674200/4588595) loss:3.014 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:07:14,247][model8_pretrain.py][INFO] Epoch:[0/2](674200/4588595) loss:2.807 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:07:14,248][model8_pretrain.py][INFO] Epoch:[0/2](674200/4588595) loss:3.310 lr:0.0000100 epoch_Time:24893.0min: [2024-01-05 17:07:51,179][model8_pretrain.py][INFO] Epoch:[0/2](674300/4588595) loss:2.915 lr:0.0000100 epoch_Time:24892.0min: [2024-01-05 17:07:51,179][model8_pretrain.py][INFO] Epoch:[0/2](674300/4588595) loss:3.126 lr:0.0000100 epoch_Time:24892.0min: [2024-01-05 17:07:51,179][model8_pretrain.py][INFO] Epoch:[0/2](674300/4588595) loss:2.779 lr:0.0000100 epoch_Time:24892.0min: [2024-01-05 17:07:51,179][model8_pretrain.py][INFO] Epoch:[0/2](674300/4588595) loss:3.384 lr:0.0000100 epoch_Time:24892.0min: [2024-01-05 17:07:51,179][model8_pretrain.py][INFO] Epoch:[0/2](674300/4588595) loss:3.421 lr:0.0000100 epoch_Time:24892.0min: [2024-01-05 17:07:51,179][model8_pretrain.py][INFO] Epoch:[0/2](674300/4588595) loss:2.700 lr:0.0000100 epoch_Time:24892.0min: [2024-01-05 17:07:51,179][model8_pretrain.py][INFO] Epoch:[0/2](674300/4588595) loss:2.937 lr:0.0000100 epoch_Time:24892.0min: [2024-01-05 17:07:51,179][model8_pretrain.py][INFO] Epoch:[0/2](674300/4588595) loss:2.990 lr:0.0000100 epoch_Time:24892.0min: [2024-01-05 17:08:28,117][model8_pretrain.py][INFO] Epoch:[0/2](674400/4588595) loss:2.575 lr:0.0000100 epoch_Time:24891.0min: [2024-01-05 17:08:28,118][model8_pretrain.py][INFO] Epoch:[0/2](674400/4588595) loss:2.814 lr:0.0000100 epoch_Time:24891.0min: [2024-01-05 17:08:28,118][model8_pretrain.py][INFO] Epoch:[0/2](674400/4588595) loss:2.673 lr:0.0000100 epoch_Time:24891.0min: [2024-01-05 17:08:28,118][model8_pretrain.py][INFO] Epoch:[0/2](674400/4588595) loss:2.177 lr:0.0000100 epoch_Time:24891.0min: [2024-01-05 17:08:28,118][model8_pretrain.py][INFO] Epoch:[0/2](674400/4588595) loss:3.244 lr:0.0000100 epoch_Time:24891.0min: [2024-01-05 17:08:28,118][model8_pretrain.py][INFO] Epoch:[0/2](674400/4588595) loss:3.094 lr:0.0000100 epoch_Time:24891.0min: [2024-01-05 17:08:28,118][model8_pretrain.py][INFO] Epoch:[0/2](674400/4588595) loss:2.482 lr:0.0000100 epoch_Time:24891.0min: [2024-01-05 17:08:28,118][model8_pretrain.py][INFO] Epoch:[0/2](674400/4588595) loss:2.953 lr:0.0000100 epoch_Time:24891.0min: [2024-01-05 17:09:05,049][model8_pretrain.py][INFO] Epoch:[0/2](674500/4588595) loss:3.032 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:05,049][model8_pretrain.py][INFO] Epoch:[0/2](674500/4588595) loss:2.778 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:05,049][model8_pretrain.py][INFO] Epoch:[0/2](674500/4588595) loss:2.233 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:05,049][model8_pretrain.py][INFO] Epoch:[0/2](674500/4588595) loss:3.084 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:05,049][model8_pretrain.py][INFO] Epoch:[0/2](674500/4588595) loss:2.996 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:05,049][model8_pretrain.py][INFO] Epoch:[0/2](674500/4588595) loss:2.966 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:05,049][model8_pretrain.py][INFO] Epoch:[0/2](674500/4588595) loss:3.149 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:05,049][model8_pretrain.py][INFO] Epoch:[0/2](674500/4588595) loss:2.546 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:41,976][model8_pretrain.py][INFO] Epoch:[0/2](674600/4588595) loss:2.817 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:41,976][model8_pretrain.py][INFO] Epoch:[0/2](674600/4588595) loss:2.935 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:41,977][model8_pretrain.py][INFO] Epoch:[0/2](674600/4588595) loss:2.962 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:41,976][model8_pretrain.py][INFO] Epoch:[0/2](674600/4588595) loss:1.722 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:41,977][model8_pretrain.py][INFO] Epoch:[0/2](674600/4588595) loss:2.863 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:41,977][model8_pretrain.py][INFO] Epoch:[0/2](674600/4588595) loss:2.361 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:41,977][model8_pretrain.py][INFO] Epoch:[0/2](674600/4588595) loss:2.713 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:09:41,977][model8_pretrain.py][INFO] Epoch:[0/2](674600/4588595) loss:2.237 lr:0.0000100 epoch_Time:24890.0min: [2024-01-05 17:10:18,911][model8_pretrain.py][INFO] Epoch:[0/2](674700/4588595) loss:3.072 lr:0.0000100 epoch_Time:24889.0min: [2024-01-05 17:10:18,911][model8_pretrain.py][INFO] Epoch:[0/2](674700/4588595) loss:2.482 lr:0.0000100 epoch_Time:24889.0min: [2024-01-05 17:10:18,911][model8_pretrain.py][INFO] Epoch:[0/2](674700/4588595) loss:3.131 lr:0.0000100 epoch_Time:24889.0min: [2024-01-05 17:10:18,911][model8_pretrain.py][INFO] Epoch:[0/2](674700/4588595) loss:2.834 lr:0.0000100 epoch_Time:24889.0min: [2024-01-05 17:10:18,911][model8_pretrain.py][INFO] Epoch:[0/2](674700/4588595) loss:3.262 lr:0.0000100 epoch_Time:24889.0min: [2024-01-05 17:10:18,911][model8_pretrain.py][INFO] Epoch:[0/2](674700/4588595) loss:2.706 lr:0.0000100 epoch_Time:24889.0min: [2024-01-05 17:10:18,911][model8_pretrain.py][INFO] Epoch:[0/2](674700/4588595) loss:2.451 lr:0.0000100 epoch_Time:24889.0min: [2024-01-05 17:10:18,912][model8_pretrain.py][INFO] Epoch:[0/2](674700/4588595) loss:2.786 lr:0.0000100 epoch_Time:24889.0min: [2024-01-05 17:10:55,843][model8_pretrain.py][INFO] Epoch:[0/2](674800/4588595) loss:2.440 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:10:55,843][model8_pretrain.py][INFO] Epoch:[0/2](674800/4588595) loss:3.203 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:10:55,843][model8_pretrain.py][INFO] Epoch:[0/2](674800/4588595) loss:2.908 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:10:55,843][model8_pretrain.py][INFO] Epoch:[0/2](674800/4588595) loss:3.000 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:10:55,843][model8_pretrain.py][INFO] Epoch:[0/2](674800/4588595) loss:2.977 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:10:55,843][model8_pretrain.py][INFO] Epoch:[0/2](674800/4588595) loss:3.346 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:10:55,843][model8_pretrain.py][INFO] Epoch:[0/2](674800/4588595) loss:3.019 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:10:55,844][model8_pretrain.py][INFO] Epoch:[0/2](674800/4588595) loss:3.014 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:11:36,233][model8_pretrain.py][INFO] Epoch:[0/2](674900/4588595) loss:2.718 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:11:36,233][model8_pretrain.py][INFO] Epoch:[0/2](674900/4588595) loss:3.076 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:11:36,233][model8_pretrain.py][INFO] Epoch:[0/2](674900/4588595) loss:2.914 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:11:36,233][model8_pretrain.py][INFO] Epoch:[0/2](674900/4588595) loss:3.148 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:11:36,233][model8_pretrain.py][INFO] Epoch:[0/2](674900/4588595) loss:2.635 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:11:36,233][model8_pretrain.py][INFO] Epoch:[0/2](674900/4588595) loss:2.694 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:11:36,233][model8_pretrain.py][INFO] Epoch:[0/2](674900/4588595) loss:2.895 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:11:36,234][model8_pretrain.py][INFO] Epoch:[0/2](674900/4588595) loss:2.954 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:12:21,371][model8_pretrain.py][INFO] Epoch:[0/2](675000/4588595) loss:2.165 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:12:21,371][model8_pretrain.py][INFO] Epoch:[0/2](675000/4588595) loss:2.671 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:12:21,371][model8_pretrain.py][INFO] Epoch:[0/2](675000/4588595) loss:2.909 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:12:21,371][model8_pretrain.py][INFO] Epoch:[0/2](675000/4588595) loss:3.513 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:12:21,371][model8_pretrain.py][INFO] Epoch:[0/2](675000/4588595) loss:2.699 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:12:21,371][model8_pretrain.py][INFO] Epoch:[0/2](675000/4588595) loss:2.540 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:12:21,371][model8_pretrain.py][INFO] Epoch:[0/2](675000/4588595) loss:2.753 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:12:21,372][model8_pretrain.py][INFO] Epoch:[0/2](675000/4588595) loss:2.561 lr:0.0000100 epoch_Time:24888.0min: [2024-01-05 17:12:58,318][model8_pretrain.py][INFO] Epoch:[0/2](675100/4588595) loss:2.833 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:12:58,318][model8_pretrain.py][INFO] Epoch:[0/2](675100/4588595) loss:2.355 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:12:58,318][model8_pretrain.py][INFO] Epoch:[0/2](675100/4588595) loss:3.277 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:12:58,318][model8_pretrain.py][INFO] Epoch:[0/2](675100/4588595) loss:2.427 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:12:58,318][model8_pretrain.py][INFO] Epoch:[0/2](675100/4588595) loss:2.636 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:12:58,318][model8_pretrain.py][INFO] Epoch:[0/2](675100/4588595) loss:2.822 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:12:58,318][model8_pretrain.py][INFO] Epoch:[0/2](675100/4588595) loss:2.430 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:12:58,318][model8_pretrain.py][INFO] Epoch:[0/2](675100/4588595) loss:2.466 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:13:35,258][model8_pretrain.py][INFO] Epoch:[0/2](675200/4588595) loss:2.922 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:13:35,258][model8_pretrain.py][INFO] Epoch:[0/2](675200/4588595) loss:2.629 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:13:35,258][model8_pretrain.py][INFO] Epoch:[0/2](675200/4588595) loss:2.321 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:13:35,258][model8_pretrain.py][INFO] Epoch:[0/2](675200/4588595) loss:2.831 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:13:35,258][model8_pretrain.py][INFO] Epoch:[0/2](675200/4588595) loss:2.533 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:13:35,258][model8_pretrain.py][INFO] Epoch:[0/2](675200/4588595) loss:3.133 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:13:35,258][model8_pretrain.py][INFO] Epoch:[0/2](675200/4588595) loss:2.969 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:13:35,258][model8_pretrain.py][INFO] Epoch:[0/2](675200/4588595) loss:2.847 lr:0.0000100 epoch_Time:24887.0min: [2024-01-05 17:14:12,202][model8_pretrain.py][INFO] Epoch:[0/2](675300/4588595) loss:3.185 lr:0.0000100 epoch_Time:24886.0min: [2024-01-05 17:14:12,202][model8_pretrain.py][INFO] Epoch:[0/2](675300/4588595) loss:2.674 lr:0.0000100 epoch_Time:24886.0min: [2024-01-05 17:14:12,202][model8_pretrain.py][INFO] Epoch:[0/2](675300/4588595) loss:2.509 lr:0.0000100 epoch_Time:24886.0min: [2024-01-05 17:14:12,202][model8_pretrain.py][INFO] Epoch:[0/2](675300/4588595) loss:2.914 lr:0.0000100 epoch_Time:24886.0min: [2024-01-05 17:14:12,202][model8_pretrain.py][INFO] Epoch:[0/2](675300/4588595) loss:2.830 lr:0.0000100 epoch_Time:24886.0min: [2024-01-05 17:14:12,202][model8_pretrain.py][INFO] Epoch:[0/2](675300/4588595) loss:3.054 lr:0.0000100 epoch_Time:24886.0min: [2024-01-05 17:14:12,202][model8_pretrain.py][INFO] Epoch:[0/2](675300/4588595) loss:3.285 lr:0.0000100 epoch_Time:24886.0min: [2024-01-05 17:14:12,202][model8_pretrain.py][INFO] Epoch:[0/2](675300/4588595) loss:2.856 lr:0.0000100 epoch_Time:24886.0min: [2024-01-05 17:14:49,150][model8_pretrain.py][INFO] Epoch:[0/2](675400/4588595) loss:3.148 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:14:49,150][model8_pretrain.py][INFO] Epoch:[0/2](675400/4588595) loss:3.031 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:14:49,150][model8_pretrain.py][INFO] Epoch:[0/2](675400/4588595) loss:3.063 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:14:49,150][model8_pretrain.py][INFO] Epoch:[0/2](675400/4588595) loss:3.009 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:14:49,150][model8_pretrain.py][INFO] Epoch:[0/2](675400/4588595) loss:2.768 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:14:49,150][model8_pretrain.py][INFO] Epoch:[0/2](675400/4588595) loss:3.113 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:14:49,150][model8_pretrain.py][INFO] Epoch:[0/2](675400/4588595) loss:3.363 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:14:49,150][model8_pretrain.py][INFO] Epoch:[0/2](675400/4588595) loss:2.815 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:15:26,060][model8_pretrain.py][INFO] Epoch:[0/2](675500/4588595) loss:2.588 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:15:26,061][model8_pretrain.py][INFO] Epoch:[0/2](675500/4588595) loss:2.493 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:15:26,061][model8_pretrain.py][INFO] Epoch:[0/2](675500/4588595) loss:2.812 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:15:26,061][model8_pretrain.py][INFO] Epoch:[0/2](675500/4588595) loss:2.981 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:15:26,061][model8_pretrain.py][INFO] Epoch:[0/2](675500/4588595) loss:2.893 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:15:26,061][model8_pretrain.py][INFO] Epoch:[0/2](675500/4588595) loss:2.650 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:15:26,061][model8_pretrain.py][INFO] Epoch:[0/2](675500/4588595) loss:2.542 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:15:26,061][model8_pretrain.py][INFO] Epoch:[0/2](675500/4588595) loss:2.617 lr:0.0000100 epoch_Time:24884.0min: [2024-01-05 17:16:02,992][model8_pretrain.py][INFO] Epoch:[0/2](675600/4588595) loss:2.857 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:02,992][model8_pretrain.py][INFO] Epoch:[0/2](675600/4588595) loss:2.506 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:02,992][model8_pretrain.py][INFO] Epoch:[0/2](675600/4588595) loss:2.733 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:02,992][model8_pretrain.py][INFO] Epoch:[0/2](675600/4588595) loss:2.696 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:02,992][model8_pretrain.py][INFO] Epoch:[0/2](675600/4588595) loss:3.006 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:02,992][model8_pretrain.py][INFO] Epoch:[0/2](675600/4588595) loss:2.732 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:02,992][model8_pretrain.py][INFO] Epoch:[0/2](675600/4588595) loss:2.905 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:02,992][model8_pretrain.py][INFO] Epoch:[0/2](675600/4588595) loss:2.574 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:43,374][model8_pretrain.py][INFO] Epoch:[0/2](675700/4588595) loss:3.044 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:43,374][model8_pretrain.py][INFO] Epoch:[0/2](675700/4588595) loss:3.212 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:43,374][model8_pretrain.py][INFO] Epoch:[0/2](675700/4588595) loss:2.791 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:43,374][model8_pretrain.py][INFO] Epoch:[0/2](675700/4588595) loss:2.803 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:43,374][model8_pretrain.py][INFO] Epoch:[0/2](675700/4588595) loss:2.884 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:43,374][model8_pretrain.py][INFO] Epoch:[0/2](675700/4588595) loss:2.767 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:43,374][model8_pretrain.py][INFO] Epoch:[0/2](675700/4588595) loss:3.282 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:16:43,374][model8_pretrain.py][INFO] Epoch:[0/2](675700/4588595) loss:3.281 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:17:28,421][model8_pretrain.py][INFO] Epoch:[0/2](675800/4588595) loss:3.270 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:17:28,421][model8_pretrain.py][INFO] Epoch:[0/2](675800/4588595) loss:2.988 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:17:28,422][model8_pretrain.py][INFO] Epoch:[0/2](675800/4588595) loss:3.051 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:17:28,422][model8_pretrain.py][INFO] Epoch:[0/2](675800/4588595) loss:2.896 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:17:28,422][model8_pretrain.py][INFO] Epoch:[0/2](675800/4588595) loss:3.038 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:17:28,422][model8_pretrain.py][INFO] Epoch:[0/2](675800/4588595) loss:2.416 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:17:28,422][model8_pretrain.py][INFO] Epoch:[0/2](675800/4588595) loss:2.881 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:17:28,422][model8_pretrain.py][INFO] Epoch:[0/2](675800/4588595) loss:3.031 lr:0.0000100 epoch_Time:24883.0min: [2024-01-05 17:18:05,346][model8_pretrain.py][INFO] Epoch:[0/2](675900/4588595) loss:2.939 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:05,346][model8_pretrain.py][INFO] Epoch:[0/2](675900/4588595) loss:1.969 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:05,346][model8_pretrain.py][INFO] Epoch:[0/2](675900/4588595) loss:3.374 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:05,346][model8_pretrain.py][INFO] Epoch:[0/2](675900/4588595) loss:2.655 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:05,346][model8_pretrain.py][INFO] Epoch:[0/2](675900/4588595) loss:2.275 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:05,346][model8_pretrain.py][INFO] Epoch:[0/2](675900/4588595) loss:2.701 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:05,347][model8_pretrain.py][INFO] Epoch:[0/2](675900/4588595) loss:2.697 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:05,347][model8_pretrain.py][INFO] Epoch:[0/2](675900/4588595) loss:2.717 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:42,284][model8_pretrain.py][INFO] Epoch:[0/2](676000/4588595) loss:2.675 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:42,284][model8_pretrain.py][INFO] Epoch:[0/2](676000/4588595) loss:2.864 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:42,284][model8_pretrain.py][INFO] Epoch:[0/2](676000/4588595) loss:3.377 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:42,284][model8_pretrain.py][INFO] Epoch:[0/2](676000/4588595) loss:2.687 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:42,284][model8_pretrain.py][INFO] Epoch:[0/2](676000/4588595) loss:2.895 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:42,284][model8_pretrain.py][INFO] Epoch:[0/2](676000/4588595) loss:2.062 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:42,284][model8_pretrain.py][INFO] Epoch:[0/2](676000/4588595) loss:2.694 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:18:42,284][model8_pretrain.py][INFO] Epoch:[0/2](676000/4588595) loss:2.651 lr:0.0000100 epoch_Time:24882.0min: [2024-01-05 17:19:19,228][model8_pretrain.py][INFO] Epoch:[0/2](676100/4588595) loss:3.282 lr:0.0000100 epoch_Time:24881.0min: [2024-01-05 17:19:19,228][model8_pretrain.py][INFO] Epoch:[0/2](676100/4588595) loss:2.607 lr:0.0000100 epoch_Time:24881.0min: [2024-01-05 17:19:19,228][model8_pretrain.py][INFO] Epoch:[0/2](676100/4588595) loss:2.715 lr:0.0000100 epoch_Time:24881.0min: [2024-01-05 17:19:19,228][model8_pretrain.py][INFO] Epoch:[0/2](676100/4588595) loss:2.766 lr:0.0000100 epoch_Time:24881.0min: [2024-01-05 17:19:19,228][model8_pretrain.py][INFO] Epoch:[0/2](676100/4588595) loss:2.889 lr:0.0000100 epoch_Time:24881.0min: [2024-01-05 17:19:19,228][model8_pretrain.py][INFO] Epoch:[0/2](676100/4588595) loss:2.914 lr:0.0000100 epoch_Time:24881.0min: [2024-01-05 17:19:19,228][model8_pretrain.py][INFO] Epoch:[0/2](676100/4588595) loss:3.108 lr:0.0000100 epoch_Time:24881.0min: [2024-01-05 17:19:19,229][model8_pretrain.py][INFO] Epoch:[0/2](676100/4588595) loss:2.987 lr:0.0000100 epoch_Time:24881.0min: [2024-01-05 17:19:56,169][model8_pretrain.py][INFO] Epoch:[0/2](676200/4588595) loss:3.026 lr:0.0000100 epoch_Time:24880.0min: [2024-01-05 17:19:56,169][model8_pretrain.py][INFO] Epoch:[0/2](676200/4588595) loss:3.005 lr:0.0000100 epoch_Time:24880.0min: [2024-01-05 17:19:56,169][model8_pretrain.py][INFO] Epoch:[0/2](676200/4588595) loss:2.808 lr:0.0000100 epoch_Time:24880.0min: [2024-01-05 17:19:56,169][model8_pretrain.py][INFO] Epoch:[0/2](676200/4588595) loss:2.679 lr:0.0000100 epoch_Time:24880.0min: [2024-01-05 17:19:56,169][model8_pretrain.py][INFO] Epoch:[0/2](676200/4588595) loss:2.774 lr:0.0000100 epoch_Time:24880.0min: [2024-01-05 17:19:56,169][model8_pretrain.py][INFO] Epoch:[0/2](676200/4588595) loss:2.899 lr:0.0000100 epoch_Time:24880.0min: [2024-01-05 17:19:56,169][model8_pretrain.py][INFO] Epoch:[0/2](676200/4588595) loss:2.108 lr:0.0000100 epoch_Time:24880.0min: [2024-01-05 17:19:56,169][model8_pretrain.py][INFO] Epoch:[0/2](676200/4588595) loss:2.875 lr:0.0000100 epoch_Time:24880.0min: [2024-01-05 17:20:33,111][model8_pretrain.py][INFO] Epoch:[0/2](676300/4588595) loss:3.012 lr:0.0000100 epoch_Time:24879.0min: [2024-01-05 17:20:33,111][model8_pretrain.py][INFO] Epoch:[0/2](676300/4588595) loss:3.142 lr:0.0000100 epoch_Time:24879.0min: [2024-01-05 17:20:33,111][model8_pretrain.py][INFO] Epoch:[0/2](676300/4588595) loss:2.825 lr:0.0000100 epoch_Time:24879.0min: [2024-01-05 17:20:33,111][model8_pretrain.py][INFO] Epoch:[0/2](676300/4588595) loss:2.961 lr:0.0000100 epoch_Time:24879.0min: [2024-01-05 17:20:33,111][model8_pretrain.py][INFO] Epoch:[0/2](676300/4588595) loss:2.754 lr:0.0000100 epoch_Time:24879.0min: [2024-01-05 17:20:33,111][model8_pretrain.py][INFO] Epoch:[0/2](676300/4588595) loss:2.671 lr:0.0000100 epoch_Time:24879.0min: [2024-01-05 17:20:33,111][model8_pretrain.py][INFO] Epoch:[0/2](676300/4588595) loss:2.912 lr:0.0000100 epoch_Time:24879.0min: [2024-01-05 17:20:33,111][model8_pretrain.py][INFO] Epoch:[0/2](676300/4588595) loss:2.885 lr:0.0000100 epoch_Time:24879.0min: [2024-01-05 17:21:10,044][model8_pretrain.py][INFO] Epoch:[0/2](676400/4588595) loss:3.359 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:10,044][model8_pretrain.py][INFO] Epoch:[0/2](676400/4588595) loss:2.322 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:10,044][model8_pretrain.py][INFO] Epoch:[0/2](676400/4588595) loss:2.803 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:10,044][model8_pretrain.py][INFO] Epoch:[0/2](676400/4588595) loss:2.732 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:10,044][model8_pretrain.py][INFO] Epoch:[0/2](676400/4588595) loss:2.602 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:10,044][model8_pretrain.py][INFO] Epoch:[0/2](676400/4588595) loss:2.619 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:10,044][model8_pretrain.py][INFO] Epoch:[0/2](676400/4588595) loss:2.649 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:10,044][model8_pretrain.py][INFO] Epoch:[0/2](676400/4588595) loss:2.698 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:50,394][model8_pretrain.py][INFO] Epoch:[0/2](676500/4588595) loss:2.598 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:50,394][model8_pretrain.py][INFO] Epoch:[0/2](676500/4588595) loss:2.018 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:50,394][model8_pretrain.py][INFO] Epoch:[0/2](676500/4588595) loss:2.168 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:50,394][model8_pretrain.py][INFO] Epoch:[0/2](676500/4588595) loss:2.474 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:50,394][model8_pretrain.py][INFO] Epoch:[0/2](676500/4588595) loss:3.176 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:50,398][model8_pretrain.py][INFO] Epoch:[0/2](676500/4588595) loss:2.578 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:50,398][model8_pretrain.py][INFO] Epoch:[0/2](676500/4588595) loss:2.678 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:21:50,398][model8_pretrain.py][INFO] Epoch:[0/2](676500/4588595) loss:2.832 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:22:36,367][model8_pretrain.py][INFO] Epoch:[0/2](676600/4588595) loss:2.452 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:22:36,367][model8_pretrain.py][INFO] Epoch:[0/2](676600/4588595) loss:3.198 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:22:36,367][model8_pretrain.py][INFO] Epoch:[0/2](676600/4588595) loss:2.675 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:22:36,367][model8_pretrain.py][INFO] Epoch:[0/2](676600/4588595) loss:2.260 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:22:36,367][model8_pretrain.py][INFO] Epoch:[0/2](676600/4588595) loss:2.013 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:22:36,367][model8_pretrain.py][INFO] Epoch:[0/2](676600/4588595) loss:2.818 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:22:36,367][model8_pretrain.py][INFO] Epoch:[0/2](676600/4588595) loss:3.015 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:22:36,367][model8_pretrain.py][INFO] Epoch:[0/2](676600/4588595) loss:2.638 lr:0.0000100 epoch_Time:24878.0min: [2024-01-05 17:23:13,314][model8_pretrain.py][INFO] Epoch:[0/2](676700/4588595) loss:3.062 lr:0.0000100 epoch_Time:24877.0min: [2024-01-05 17:23:13,314][model8_pretrain.py][INFO] Epoch:[0/2](676700/4588595) loss:2.725 lr:0.0000100 epoch_Time:24877.0min: [2024-01-05 17:23:13,314][model8_pretrain.py][INFO] Epoch:[0/2](676700/4588595) loss:2.686 lr:0.0000100 epoch_Time:24877.0min: [2024-01-05 17:23:13,314][model8_pretrain.py][INFO] Epoch:[0/2](676700/4588595) loss:2.973 lr:0.0000100 epoch_Time:24877.0min: [2024-01-05 17:23:13,314][model8_pretrain.py][INFO] Epoch:[0/2](676700/4588595) loss:2.628 lr:0.0000100 epoch_Time:24877.0min: [2024-01-05 17:23:13,314][model8_pretrain.py][INFO] Epoch:[0/2](676700/4588595) loss:2.661 lr:0.0000100 epoch_Time:24877.0min: [2024-01-05 17:23:13,314][model8_pretrain.py][INFO] Epoch:[0/2](676700/4588595) loss:2.808 lr:0.0000100 epoch_Time:24877.0min: [2024-01-05 17:23:13,315][model8_pretrain.py][INFO] Epoch:[0/2](676700/4588595) loss:3.113 lr:0.0000100 epoch_Time:24877.0min: [2024-01-05 17:23:50,266][model8_pretrain.py][INFO] Epoch:[0/2](676800/4588595) loss:3.104 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:23:50,267][model8_pretrain.py][INFO] Epoch:[0/2](676800/4588595) loss:3.348 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:23:50,267][model8_pretrain.py][INFO] Epoch:[0/2](676800/4588595) loss:2.361 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:23:50,267][model8_pretrain.py][INFO] Epoch:[0/2](676800/4588595) loss:3.424 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:23:50,267][model8_pretrain.py][INFO] Epoch:[0/2](676800/4588595) loss:2.486 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:23:50,267][model8_pretrain.py][INFO] Epoch:[0/2](676800/4588595) loss:2.632 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:23:50,267][model8_pretrain.py][INFO] Epoch:[0/2](676800/4588595) loss:2.665 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:23:50,267][model8_pretrain.py][INFO] Epoch:[0/2](676800/4588595) loss:2.783 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:24:27,208][model8_pretrain.py][INFO] Epoch:[0/2](676900/4588595) loss:3.079 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:24:27,208][model8_pretrain.py][INFO] Epoch:[0/2](676900/4588595) loss:2.719 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:24:27,208][model8_pretrain.py][INFO] Epoch:[0/2](676900/4588595) loss:3.580 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:24:27,208][model8_pretrain.py][INFO] Epoch:[0/2](676900/4588595) loss:3.035 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:24:27,208][model8_pretrain.py][INFO] Epoch:[0/2](676900/4588595) loss:3.221 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:24:27,208][model8_pretrain.py][INFO] Epoch:[0/2](676900/4588595) loss:3.281 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:24:27,208][model8_pretrain.py][INFO] Epoch:[0/2](676900/4588595) loss:2.572 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:24:27,209][model8_pretrain.py][INFO] Epoch:[0/2](676900/4588595) loss:3.083 lr:0.0000100 epoch_Time:24876.0min: [2024-01-05 17:25:04,139][model8_pretrain.py][INFO] Epoch:[0/2](677000/4588595) loss:2.389 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:04,139][model8_pretrain.py][INFO] Epoch:[0/2](677000/4588595) loss:2.665 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:04,139][model8_pretrain.py][INFO] Epoch:[0/2](677000/4588595) loss:2.249 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:04,139][model8_pretrain.py][INFO] Epoch:[0/2](677000/4588595) loss:2.169 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:04,139][model8_pretrain.py][INFO] Epoch:[0/2](677000/4588595) loss:2.933 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:04,139][model8_pretrain.py][INFO] Epoch:[0/2](677000/4588595) loss:3.097 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:04,139][model8_pretrain.py][INFO] Epoch:[0/2](677000/4588595) loss:3.347 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:04,139][model8_pretrain.py][INFO] Epoch:[0/2](677000/4588595) loss:2.819 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:41,081][model8_pretrain.py][INFO] Epoch:[0/2](677100/4588595) loss:2.301 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:41,081][model8_pretrain.py][INFO] Epoch:[0/2](677100/4588595) loss:2.759 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:41,081][model8_pretrain.py][INFO] Epoch:[0/2](677100/4588595) loss:3.272 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:41,081][model8_pretrain.py][INFO] Epoch:[0/2](677100/4588595) loss:2.517 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:41,081][model8_pretrain.py][INFO] Epoch:[0/2](677100/4588595) loss:2.770 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:41,081][model8_pretrain.py][INFO] Epoch:[0/2](677100/4588595) loss:3.351 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:41,081][model8_pretrain.py][INFO] Epoch:[0/2](677100/4588595) loss:3.400 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:25:41,081][model8_pretrain.py][INFO] Epoch:[0/2](677100/4588595) loss:2.633 lr:0.0000100 epoch_Time:24875.0min: [2024-01-05 17:26:17,989][model8_pretrain.py][INFO] Epoch:[0/2](677200/4588595) loss:2.952 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:26:17,989][model8_pretrain.py][INFO] Epoch:[0/2](677200/4588595) loss:3.146 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:26:17,989][model8_pretrain.py][INFO] Epoch:[0/2](677200/4588595) loss:2.547 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:26:17,989][model8_pretrain.py][INFO] Epoch:[0/2](677200/4588595) loss:2.599 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:26:17,989][model8_pretrain.py][INFO] Epoch:[0/2](677200/4588595) loss:3.505 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:26:17,989][model8_pretrain.py][INFO] Epoch:[0/2](677200/4588595) loss:3.160 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:26:17,989][model8_pretrain.py][INFO] Epoch:[0/2](677200/4588595) loss:2.926 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:26:17,989][model8_pretrain.py][INFO] Epoch:[0/2](677200/4588595) loss:3.059 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:26:56,617][model8_pretrain.py][INFO] Epoch:[0/2](677300/4588595) loss:1.967 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:26:56,617][model8_pretrain.py][INFO] Epoch:[0/2](677300/4588595) loss:2.564 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:26:56,617][model8_pretrain.py][INFO] Epoch:[0/2](677300/4588595) loss:3.287 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:26:56,617][model8_pretrain.py][INFO] Epoch:[0/2](677300/4588595) loss:3.316 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:26:56,617][model8_pretrain.py][INFO] Epoch:[0/2](677300/4588595) loss:2.965 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:26:56,617][model8_pretrain.py][INFO] Epoch:[0/2](677300/4588595) loss:3.255 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:26:56,617][model8_pretrain.py][INFO] Epoch:[0/2](677300/4588595) loss:3.071 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:26:56,617][model8_pretrain.py][INFO] Epoch:[0/2](677300/4588595) loss:2.902 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:27:43,863][model8_pretrain.py][INFO] Epoch:[0/2](677400/4588595) loss:3.028 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:27:43,863][model8_pretrain.py][INFO] Epoch:[0/2](677400/4588595) loss:2.981 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:27:43,863][model8_pretrain.py][INFO] Epoch:[0/2](677400/4588595) loss:3.105 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:27:43,863][model8_pretrain.py][INFO] Epoch:[0/2](677400/4588595) loss:3.070 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:27:43,863][model8_pretrain.py][INFO] Epoch:[0/2](677400/4588595) loss:3.087 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:27:43,863][model8_pretrain.py][INFO] Epoch:[0/2](677400/4588595) loss:2.400 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:27:43,863][model8_pretrain.py][INFO] Epoch:[0/2](677400/4588595) loss:2.803 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:27:43,863][model8_pretrain.py][INFO] Epoch:[0/2](677400/4588595) loss:2.706 lr:0.0000100 epoch_Time:24874.0min: [2024-01-05 17:28:20,806][model8_pretrain.py][INFO] Epoch:[0/2](677500/4588595) loss:2.961 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:28:20,806][model8_pretrain.py][INFO] Epoch:[0/2](677500/4588595) loss:3.131 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:28:20,806][model8_pretrain.py][INFO] Epoch:[0/2](677500/4588595) loss:3.013 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:28:20,806][model8_pretrain.py][INFO] Epoch:[0/2](677500/4588595) loss:3.167 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:28:20,806][model8_pretrain.py][INFO] Epoch:[0/2](677500/4588595) loss:3.034 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:28:20,806][model8_pretrain.py][INFO] Epoch:[0/2](677500/4588595) loss:2.724 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:28:20,806][model8_pretrain.py][INFO] Epoch:[0/2](677500/4588595) loss:2.888 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:28:20,806][model8_pretrain.py][INFO] Epoch:[0/2](677500/4588595) loss:2.984 lr:0.0000100 epoch_Time:24873.0min: [2024-01-05 17:28:57,747][model8_pretrain.py][INFO] Epoch:[0/2](677600/4588595) loss:2.511 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:28:57,747][model8_pretrain.py][INFO] Epoch:[0/2](677600/4588595) loss:2.966 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:28:57,747][model8_pretrain.py][INFO] Epoch:[0/2](677600/4588595) loss:2.694 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:28:57,747][model8_pretrain.py][INFO] Epoch:[0/2](677600/4588595) loss:3.071 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:28:57,747][model8_pretrain.py][INFO] Epoch:[0/2](677600/4588595) loss:2.608 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:28:57,747][model8_pretrain.py][INFO] Epoch:[0/2](677600/4588595) loss:2.898 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:28:57,747][model8_pretrain.py][INFO] Epoch:[0/2](677600/4588595) loss:2.921 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:28:57,748][model8_pretrain.py][INFO] Epoch:[0/2](677600/4588595) loss:3.007 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:29:34,693][model8_pretrain.py][INFO] Epoch:[0/2](677700/4588595) loss:2.191 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:29:34,693][model8_pretrain.py][INFO] Epoch:[0/2](677700/4588595) loss:2.924 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:29:34,693][model8_pretrain.py][INFO] Epoch:[0/2](677700/4588595) loss:2.366 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:29:34,693][model8_pretrain.py][INFO] Epoch:[0/2](677700/4588595) loss:2.734 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:29:34,693][model8_pretrain.py][INFO] Epoch:[0/2](677700/4588595) loss:2.875 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:29:34,693][model8_pretrain.py][INFO] Epoch:[0/2](677700/4588595) loss:2.261 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:29:34,694][model8_pretrain.py][INFO] Epoch:[0/2](677700/4588595) loss:2.365 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:29:34,694][model8_pretrain.py][INFO] Epoch:[0/2](677700/4588595) loss:2.558 lr:0.0000100 epoch_Time:24871.0min: [2024-01-05 17:30:11,648][model8_pretrain.py][INFO] Epoch:[0/2](677800/4588595) loss:3.058 lr:0.0000100 epoch_Time:24870.0min: [2024-01-05 17:30:11,648][model8_pretrain.py][INFO] Epoch:[0/2](677800/4588595) loss:2.815 lr:0.0000100 epoch_Time:24870.0min: [2024-01-05 17:30:11,648][model8_pretrain.py][INFO] Epoch:[0/2](677800/4588595) loss:2.487 lr:0.0000100 epoch_Time:24870.0min: [2024-01-05 17:30:11,648][model8_pretrain.py][INFO] Epoch:[0/2](677800/4588595) loss:2.633 lr:0.0000100 epoch_Time:24870.0min: [2024-01-05 17:30:11,648][model8_pretrain.py][INFO] Epoch:[0/2](677800/4588595) loss:2.747 lr:0.0000100 epoch_Time:24870.0min: [2024-01-05 17:30:11,648][model8_pretrain.py][INFO] Epoch:[0/2](677800/4588595) loss:2.881 lr:0.0000100 epoch_Time:24870.0min: [2024-01-05 17:30:11,648][model8_pretrain.py][INFO] Epoch:[0/2](677800/4588595) loss:1.971 lr:0.0000100 epoch_Time:24870.0min: [2024-01-05 17:30:11,648][model8_pretrain.py][INFO] Epoch:[0/2](677800/4588595) loss:3.196 lr:0.0000100 epoch_Time:24870.0min: [2024-01-05 17:30:48,593][model8_pretrain.py][INFO] Epoch:[0/2](677900/4588595) loss:2.534 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:30:48,593][model8_pretrain.py][INFO] Epoch:[0/2](677900/4588595) loss:2.523 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:30:48,593][model8_pretrain.py][INFO] Epoch:[0/2](677900/4588595) loss:2.970 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:30:48,593][model8_pretrain.py][INFO] Epoch:[0/2](677900/4588595) loss:3.129 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:30:48,593][model8_pretrain.py][INFO] Epoch:[0/2](677900/4588595) loss:2.763 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:30:48,593][model8_pretrain.py][INFO] Epoch:[0/2](677900/4588595) loss:2.923 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:30:48,594][model8_pretrain.py][INFO] Epoch:[0/2](677900/4588595) loss:3.221 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:30:48,594][model8_pretrain.py][INFO] Epoch:[0/2](677900/4588595) loss:2.869 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:31:25,531][model8_pretrain.py][INFO] Epoch:[0/2](678000/4588595) loss:2.764 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:31:25,531][model8_pretrain.py][INFO] Epoch:[0/2](678000/4588595) loss:3.054 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:31:25,531][model8_pretrain.py][INFO] Epoch:[0/2](678000/4588595) loss:3.465 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:31:25,531][model8_pretrain.py][INFO] Epoch:[0/2](678000/4588595) loss:2.461 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:31:25,531][model8_pretrain.py][INFO] Epoch:[0/2](678000/4588595) loss:2.859 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:31:25,531][model8_pretrain.py][INFO] Epoch:[0/2](678000/4588595) loss:2.719 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:31:25,531][model8_pretrain.py][INFO] Epoch:[0/2](678000/4588595) loss:2.531 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:31:25,531][model8_pretrain.py][INFO] Epoch:[0/2](678000/4588595) loss:3.274 lr:0.0000100 epoch_Time:24869.0min: [2024-01-05 17:32:04,181][model8_pretrain.py][INFO] Epoch:[0/2](678100/4588595) loss:2.010 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:04,181][model8_pretrain.py][INFO] Epoch:[0/2](678100/4588595) loss:3.153 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:04,181][model8_pretrain.py][INFO] Epoch:[0/2](678100/4588595) loss:2.429 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:04,181][model8_pretrain.py][INFO] Epoch:[0/2](678100/4588595) loss:2.603 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:04,181][model8_pretrain.py][INFO] Epoch:[0/2](678100/4588595) loss:3.168 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:04,181][model8_pretrain.py][INFO] Epoch:[0/2](678100/4588595) loss:2.970 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:04,181][model8_pretrain.py][INFO] Epoch:[0/2](678100/4588595) loss:2.472 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:04,181][model8_pretrain.py][INFO] Epoch:[0/2](678100/4588595) loss:2.307 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:51,400][model8_pretrain.py][INFO] Epoch:[0/2](678200/4588595) loss:3.037 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:51,400][model8_pretrain.py][INFO] Epoch:[0/2](678200/4588595) loss:3.186 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:51,400][model8_pretrain.py][INFO] Epoch:[0/2](678200/4588595) loss:2.537 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:51,400][model8_pretrain.py][INFO] Epoch:[0/2](678200/4588595) loss:2.848 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:51,400][model8_pretrain.py][INFO] Epoch:[0/2](678200/4588595) loss:3.027 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:51,400][model8_pretrain.py][INFO] Epoch:[0/2](678200/4588595) loss:2.408 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:51,400][model8_pretrain.py][INFO] Epoch:[0/2](678200/4588595) loss:2.139 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:32:51,400][model8_pretrain.py][INFO] Epoch:[0/2](678200/4588595) loss:3.296 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:33:28,328][model8_pretrain.py][INFO] Epoch:[0/2](678300/4588595) loss:3.017 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:33:28,328][model8_pretrain.py][INFO] Epoch:[0/2](678300/4588595) loss:2.362 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:33:28,328][model8_pretrain.py][INFO] Epoch:[0/2](678300/4588595) loss:2.900 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:33:28,328][model8_pretrain.py][INFO] Epoch:[0/2](678300/4588595) loss:2.993 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:33:28,328][model8_pretrain.py][INFO] Epoch:[0/2](678300/4588595) loss:3.242 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:33:28,328][model8_pretrain.py][INFO] Epoch:[0/2](678300/4588595) loss:3.052 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:33:28,328][model8_pretrain.py][INFO] Epoch:[0/2](678300/4588595) loss:3.053 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:33:28,328][model8_pretrain.py][INFO] Epoch:[0/2](678300/4588595) loss:3.256 lr:0.0000100 epoch_Time:24868.0min: [2024-01-05 17:34:05,271][model8_pretrain.py][INFO] Epoch:[0/2](678400/4588595) loss:2.902 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:05,271][model8_pretrain.py][INFO] Epoch:[0/2](678400/4588595) loss:2.845 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:05,271][model8_pretrain.py][INFO] Epoch:[0/2](678400/4588595) loss:2.826 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:05,271][model8_pretrain.py][INFO] Epoch:[0/2](678400/4588595) loss:2.576 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:05,271][model8_pretrain.py][INFO] Epoch:[0/2](678400/4588595) loss:2.398 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:05,271][model8_pretrain.py][INFO] Epoch:[0/2](678400/4588595) loss:2.692 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:05,271][model8_pretrain.py][INFO] Epoch:[0/2](678400/4588595) loss:2.797 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:05,271][model8_pretrain.py][INFO] Epoch:[0/2](678400/4588595) loss:2.433 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:42,210][model8_pretrain.py][INFO] Epoch:[0/2](678500/4588595) loss:2.056 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:42,210][model8_pretrain.py][INFO] Epoch:[0/2](678500/4588595) loss:2.879 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:42,210][model8_pretrain.py][INFO] Epoch:[0/2](678500/4588595) loss:2.773 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:42,210][model8_pretrain.py][INFO] Epoch:[0/2](678500/4588595) loss:2.750 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:42,210][model8_pretrain.py][INFO] Epoch:[0/2](678500/4588595) loss:2.947 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:42,210][model8_pretrain.py][INFO] Epoch:[0/2](678500/4588595) loss:2.796 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:42,210][model8_pretrain.py][INFO] Epoch:[0/2](678500/4588595) loss:2.606 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:34:42,211][model8_pretrain.py][INFO] Epoch:[0/2](678500/4588595) loss:2.861 lr:0.0000100 epoch_Time:24867.0min: [2024-01-05 17:35:19,158][model8_pretrain.py][INFO] Epoch:[0/2](678600/4588595) loss:3.545 lr:0.0000100 epoch_Time:24865.0min: [2024-01-05 17:35:19,158][model8_pretrain.py][INFO] Epoch:[0/2](678600/4588595) loss:2.663 lr:0.0000100 epoch_Time:24865.0min: [2024-01-05 17:35:19,159][model8_pretrain.py][INFO] Epoch:[0/2](678600/4588595) loss:2.929 lr:0.0000100 epoch_Time:24865.0min: [2024-01-05 17:35:19,159][model8_pretrain.py][INFO] Epoch:[0/2](678600/4588595) loss:2.772 lr:0.0000100 epoch_Time:24865.0min: [2024-01-05 17:35:19,159][model8_pretrain.py][INFO] Epoch:[0/2](678600/4588595) loss:2.695 lr:0.0000100 epoch_Time:24865.0min: [2024-01-05 17:35:19,159][model8_pretrain.py][INFO] Epoch:[0/2](678600/4588595) loss:2.643 lr:0.0000100 epoch_Time:24865.0min: [2024-01-05 17:35:19,159][model8_pretrain.py][INFO] Epoch:[0/2](678600/4588595) loss:3.118 lr:0.0000100 epoch_Time:24865.0min: [2024-01-05 17:35:19,159][model8_pretrain.py][INFO] Epoch:[0/2](678600/4588595) loss:3.139 lr:0.0000100 epoch_Time:24865.0min: [2024-01-05 17:35:56,099][model8_pretrain.py][INFO] Epoch:[0/2](678700/4588595) loss:3.174 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:35:56,099][model8_pretrain.py][INFO] Epoch:[0/2](678700/4588595) loss:2.797 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:35:56,099][model8_pretrain.py][INFO] Epoch:[0/2](678700/4588595) loss:2.895 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:35:56,099][model8_pretrain.py][INFO] Epoch:[0/2](678700/4588595) loss:2.992 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:35:56,099][model8_pretrain.py][INFO] Epoch:[0/2](678700/4588595) loss:2.656 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:35:56,099][model8_pretrain.py][INFO] Epoch:[0/2](678700/4588595) loss:3.005 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:35:56,099][model8_pretrain.py][INFO] Epoch:[0/2](678700/4588595) loss:2.497 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:35:56,099][model8_pretrain.py][INFO] Epoch:[0/2](678700/4588595) loss:2.986 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:36:33,039][model8_pretrain.py][INFO] Epoch:[0/2](678800/4588595) loss:3.350 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:36:33,039][model8_pretrain.py][INFO] Epoch:[0/2](678800/4588595) loss:3.121 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:36:33,039][model8_pretrain.py][INFO] Epoch:[0/2](678800/4588595) loss:3.206 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:36:33,039][model8_pretrain.py][INFO] Epoch:[0/2](678800/4588595) loss:3.360 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:36:33,039][model8_pretrain.py][INFO] Epoch:[0/2](678800/4588595) loss:3.298 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:36:33,040][model8_pretrain.py][INFO] Epoch:[0/2](678800/4588595) loss:3.188 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:36:33,040][model8_pretrain.py][INFO] Epoch:[0/2](678800/4588595) loss:3.178 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:36:33,040][model8_pretrain.py][INFO] Epoch:[0/2](678800/4588595) loss:2.726 lr:0.0000100 epoch_Time:24864.0min: [2024-01-05 17:37:09,986][model8_pretrain.py][INFO] Epoch:[0/2](678900/4588595) loss:3.106 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:09,987][model8_pretrain.py][INFO] Epoch:[0/2](678900/4588595) loss:2.777 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:09,986][model8_pretrain.py][INFO] Epoch:[0/2](678900/4588595) loss:2.999 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:09,987][model8_pretrain.py][INFO] Epoch:[0/2](678900/4588595) loss:2.729 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:09,987][model8_pretrain.py][INFO] Epoch:[0/2](678900/4588595) loss:2.881 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:09,987][model8_pretrain.py][INFO] Epoch:[0/2](678900/4588595) loss:2.531 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:09,987][model8_pretrain.py][INFO] Epoch:[0/2](678900/4588595) loss:2.971 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:11,688][model8_pretrain.py][INFO] Epoch:[0/2](678900/4588595) loss:2.310 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:58,663][model8_pretrain.py][INFO] Epoch:[0/2](679000/4588595) loss:2.587 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:58,663][model8_pretrain.py][INFO] Epoch:[0/2](679000/4588595) loss:3.050 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:58,663][model8_pretrain.py][INFO] Epoch:[0/2](679000/4588595) loss:3.244 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:58,663][model8_pretrain.py][INFO] Epoch:[0/2](679000/4588595) loss:2.952 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:58,663][model8_pretrain.py][INFO] Epoch:[0/2](679000/4588595) loss:2.044 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:58,663][model8_pretrain.py][INFO] Epoch:[0/2](679000/4588595) loss:3.007 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:58,663][model8_pretrain.py][INFO] Epoch:[0/2](679000/4588595) loss:2.837 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:37:58,663][model8_pretrain.py][INFO] Epoch:[0/2](679000/4588595) loss:2.448 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:38:35,597][model8_pretrain.py][INFO] Epoch:[0/2](679100/4588595) loss:3.311 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:38:35,597][model8_pretrain.py][INFO] Epoch:[0/2](679100/4588595) loss:2.463 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:38:35,597][model8_pretrain.py][INFO] Epoch:[0/2](679100/4588595) loss:3.123 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:38:35,597][model8_pretrain.py][INFO] Epoch:[0/2](679100/4588595) loss:2.692 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:38:35,597][model8_pretrain.py][INFO] Epoch:[0/2](679100/4588595) loss:2.753 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:38:35,597][model8_pretrain.py][INFO] Epoch:[0/2](679100/4588595) loss:2.193 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:38:35,597][model8_pretrain.py][INFO] Epoch:[0/2](679100/4588595) loss:2.917 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:38:35,597][model8_pretrain.py][INFO] Epoch:[0/2](679100/4588595) loss:3.229 lr:0.0000100 epoch_Time:24863.0min: [2024-01-05 17:39:12,533][model8_pretrain.py][INFO] Epoch:[0/2](679200/4588595) loss:3.556 lr:0.0000100 epoch_Time:24862.0min: [2024-01-05 17:39:12,533][model8_pretrain.py][INFO] Epoch:[0/2](679200/4588595) loss:2.736 lr:0.0000100 epoch_Time:24862.0min: [2024-01-05 17:39:12,533][model8_pretrain.py][INFO] Epoch:[0/2](679200/4588595) loss:2.271 lr:0.0000100 epoch_Time:24862.0min: [2024-01-05 17:39:12,533][model8_pretrain.py][INFO] Epoch:[0/2](679200/4588595) loss:2.909 lr:0.0000100 epoch_Time:24862.0min: [2024-01-05 17:39:12,533][model8_pretrain.py][INFO] Epoch:[0/2](679200/4588595) loss:3.314 lr:0.0000100 epoch_Time:24862.0min: [2024-01-05 17:39:12,533][model8_pretrain.py][INFO] Epoch:[0/2](679200/4588595) loss:2.780 lr:0.0000100 epoch_Time:24862.0min: [2024-01-05 17:39:12,533][model8_pretrain.py][INFO] Epoch:[0/2](679200/4588595) loss:2.784 lr:0.0000100 epoch_Time:24862.0min: [2024-01-05 17:39:12,533][model8_pretrain.py][INFO] Epoch:[0/2](679200/4588595) loss:2.925 lr:0.0000100 epoch_Time:24862.0min: [2024-01-05 17:39:49,473][model8_pretrain.py][INFO] Epoch:[0/2](679300/4588595) loss:2.890 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:39:49,473][model8_pretrain.py][INFO] Epoch:[0/2](679300/4588595) loss:2.795 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:39:49,473][model8_pretrain.py][INFO] Epoch:[0/2](679300/4588595) loss:2.874 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:39:49,473][model8_pretrain.py][INFO] Epoch:[0/2](679300/4588595) loss:3.066 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:39:49,474][model8_pretrain.py][INFO] Epoch:[0/2](679300/4588595) loss:3.070 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:39:49,474][model8_pretrain.py][INFO] Epoch:[0/2](679300/4588595) loss:3.049 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:39:49,474][model8_pretrain.py][INFO] Epoch:[0/2](679300/4588595) loss:3.123 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:39:49,474][model8_pretrain.py][INFO] Epoch:[0/2](679300/4588595) loss:2.740 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:40:26,417][model8_pretrain.py][INFO] Epoch:[0/2](679400/4588595) loss:2.469 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:40:26,417][model8_pretrain.py][INFO] Epoch:[0/2](679400/4588595) loss:2.607 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:40:26,417][model8_pretrain.py][INFO] Epoch:[0/2](679400/4588595) loss:2.584 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:40:26,417][model8_pretrain.py][INFO] Epoch:[0/2](679400/4588595) loss:3.080 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:40:26,418][model8_pretrain.py][INFO] Epoch:[0/2](679400/4588595) loss:2.550 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:40:26,418][model8_pretrain.py][INFO] Epoch:[0/2](679400/4588595) loss:2.911 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:40:26,418][model8_pretrain.py][INFO] Epoch:[0/2](679400/4588595) loss:3.212 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:40:26,418][model8_pretrain.py][INFO] Epoch:[0/2](679400/4588595) loss:3.060 lr:0.0000100 epoch_Time:24861.0min: [2024-01-05 17:41:03,370][model8_pretrain.py][INFO] Epoch:[0/2](679500/4588595) loss:2.539 lr:0.0000100 epoch_Time:24860.0min: [2024-01-05 17:41:03,370][model8_pretrain.py][INFO] Epoch:[0/2](679500/4588595) loss:2.889 lr:0.0000100 epoch_Time:24860.0min: [2024-01-05 17:41:03,370][model8_pretrain.py][INFO] Epoch:[0/2](679500/4588595) loss:2.966 lr:0.0000100 epoch_Time:24860.0min: [2024-01-05 17:41:03,370][model8_pretrain.py][INFO] Epoch:[0/2](679500/4588595) loss:2.916 lr:0.0000100 epoch_Time:24860.0min: [2024-01-05 17:41:03,370][model8_pretrain.py][INFO] Epoch:[0/2](679500/4588595) loss:2.742 lr:0.0000100 epoch_Time:24860.0min: [2024-01-05 17:41:03,370][model8_pretrain.py][INFO] Epoch:[0/2](679500/4588595) loss:2.986 lr:0.0000100 epoch_Time:24860.0min: [2024-01-05 17:41:03,370][model8_pretrain.py][INFO] Epoch:[0/2](679500/4588595) loss:2.622 lr:0.0000100 epoch_Time:24860.0min: [2024-01-05 17:41:03,370][model8_pretrain.py][INFO] Epoch:[0/2](679500/4588595) loss:2.600 lr:0.0000100 epoch_Time:24860.0min: [2024-01-05 17:41:40,342][model8_pretrain.py][INFO] Epoch:[0/2](679600/4588595) loss:2.763 lr:0.0000100 epoch_Time:24859.0min: [2024-01-05 17:41:40,342][model8_pretrain.py][INFO] Epoch:[0/2](679600/4588595) loss:2.797 lr:0.0000100 epoch_Time:24859.0min: [2024-01-05 17:41:40,342][model8_pretrain.py][INFO] Epoch:[0/2](679600/4588595) loss:2.727 lr:0.0000100 epoch_Time:24859.0min: [2024-01-05 17:41:40,342][model8_pretrain.py][INFO] Epoch:[0/2](679600/4588595) loss:3.117 lr:0.0000100 epoch_Time:24859.0min: [2024-01-05 17:41:40,342][model8_pretrain.py][INFO] Epoch:[0/2](679600/4588595) loss:2.905 lr:0.0000100 epoch_Time:24859.0min: [2024-01-05 17:41:40,342][model8_pretrain.py][INFO] Epoch:[0/2](679600/4588595) loss:3.156 lr:0.0000100 epoch_Time:24859.0min: [2024-01-05 17:41:40,342][model8_pretrain.py][INFO] Epoch:[0/2](679600/4588595) loss:3.208 lr:0.0000100 epoch_Time:24859.0min: [2024-01-05 17:41:40,342][model8_pretrain.py][INFO] Epoch:[0/2](679600/4588595) loss:2.957 lr:0.0000100 epoch_Time:24859.0min: [2024-01-05 17:42:17,278][model8_pretrain.py][INFO] Epoch:[0/2](679700/4588595) loss:2.882 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:42:17,278][model8_pretrain.py][INFO] Epoch:[0/2](679700/4588595) loss:2.691 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:42:17,278][model8_pretrain.py][INFO] Epoch:[0/2](679700/4588595) loss:2.642 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:42:17,278][model8_pretrain.py][INFO] Epoch:[0/2](679700/4588595) loss:3.336 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:42:17,278][model8_pretrain.py][INFO] Epoch:[0/2](679700/4588595) loss:2.812 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:42:17,278][model8_pretrain.py][INFO] Epoch:[0/2](679700/4588595) loss:2.539 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:42:17,278][model8_pretrain.py][INFO] Epoch:[0/2](679700/4588595) loss:2.832 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:42:17,278][model8_pretrain.py][INFO] Epoch:[0/2](679700/4588595) loss:2.978 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:06,027][model8_pretrain.py][INFO] Epoch:[0/2](679800/4588595) loss:2.433 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:06,027][model8_pretrain.py][INFO] Epoch:[0/2](679800/4588595) loss:3.177 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:06,027][model8_pretrain.py][INFO] Epoch:[0/2](679800/4588595) loss:2.919 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:06,027][model8_pretrain.py][INFO] Epoch:[0/2](679800/4588595) loss:3.253 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:06,027][model8_pretrain.py][INFO] Epoch:[0/2](679800/4588595) loss:3.185 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:06,027][model8_pretrain.py][INFO] Epoch:[0/2](679800/4588595) loss:2.811 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:06,027][model8_pretrain.py][INFO] Epoch:[0/2](679800/4588595) loss:2.675 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:06,027][model8_pretrain.py][INFO] Epoch:[0/2](679800/4588595) loss:2.632 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:42,958][model8_pretrain.py][INFO] Epoch:[0/2](679900/4588595) loss:1.915 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:42,958][model8_pretrain.py][INFO] Epoch:[0/2](679900/4588595) loss:2.340 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:42,958][model8_pretrain.py][INFO] Epoch:[0/2](679900/4588595) loss:2.690 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:42,958][model8_pretrain.py][INFO] Epoch:[0/2](679900/4588595) loss:3.192 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:42,958][model8_pretrain.py][INFO] Epoch:[0/2](679900/4588595) loss:3.020 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:42,958][model8_pretrain.py][INFO] Epoch:[0/2](679900/4588595) loss:2.455 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:42,958][model8_pretrain.py][INFO] Epoch:[0/2](679900/4588595) loss:3.080 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:43:42,959][model8_pretrain.py][INFO] Epoch:[0/2](679900/4588595) loss:3.129 lr:0.0000100 epoch_Time:24858.0min: [2024-01-05 17:44:19,862][model8_pretrain.py][INFO] Epoch:[0/2](680000/4588595) loss:3.374 lr:0.0000100 epoch_Time:24857.0min: [2024-01-05 17:44:19,862][model8_pretrain.py][INFO] Epoch:[0/2](680000/4588595) loss:2.856 lr:0.0000100 epoch_Time:24857.0min: [2024-01-05 17:44:19,862][model8_pretrain.py][INFO] Epoch:[0/2](680000/4588595) loss:3.353 lr:0.0000100 epoch_Time:24857.0min: [2024-01-05 17:44:19,862][model8_pretrain.py][INFO] Epoch:[0/2](680000/4588595) loss:2.719 lr:0.0000100 epoch_Time:24857.0min: [2024-01-05 17:44:19,862][model8_pretrain.py][INFO] Epoch:[0/2](680000/4588595) loss:2.858 lr:0.0000100 epoch_Time:24857.0min: [2024-01-05 17:44:19,862][model8_pretrain.py][INFO] Epoch:[0/2](680000/4588595) loss:3.014 lr:0.0000100 epoch_Time:24857.0min: [2024-01-05 17:44:19,862][model8_pretrain.py][INFO] Epoch:[0/2](680000/4588595) loss:2.688 lr:0.0000100 epoch_Time:24857.0min: [2024-01-05 17:44:19,862][model8_pretrain.py][INFO] Epoch:[0/2](680000/4588595) loss:2.760 lr:0.0000100 epoch_Time:24857.0min: [2024-01-05 17:44:56,786][model8_pretrain.py][INFO] Epoch:[0/2](680100/4588595) loss:3.287 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:44:56,786][model8_pretrain.py][INFO] Epoch:[0/2](680100/4588595) loss:2.289 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:44:56,786][model8_pretrain.py][INFO] Epoch:[0/2](680100/4588595) loss:2.900 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:44:56,786][model8_pretrain.py][INFO] Epoch:[0/2](680100/4588595) loss:3.055 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:44:56,786][model8_pretrain.py][INFO] Epoch:[0/2](680100/4588595) loss:3.366 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:44:56,786][model8_pretrain.py][INFO] Epoch:[0/2](680100/4588595) loss:2.590 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:44:56,786][model8_pretrain.py][INFO] Epoch:[0/2](680100/4588595) loss:3.184 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:44:56,786][model8_pretrain.py][INFO] Epoch:[0/2](680100/4588595) loss:2.284 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:45:33,727][model8_pretrain.py][INFO] Epoch:[0/2](680200/4588595) loss:2.930 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:45:33,727][model8_pretrain.py][INFO] Epoch:[0/2](680200/4588595) loss:2.688 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:45:33,727][model8_pretrain.py][INFO] Epoch:[0/2](680200/4588595) loss:3.096 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:45:33,727][model8_pretrain.py][INFO] Epoch:[0/2](680200/4588595) loss:3.051 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:45:33,727][model8_pretrain.py][INFO] Epoch:[0/2](680200/4588595) loss:3.173 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:45:33,727][model8_pretrain.py][INFO] Epoch:[0/2](680200/4588595) loss:2.316 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:45:33,727][model8_pretrain.py][INFO] Epoch:[0/2](680200/4588595) loss:2.783 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:45:33,728][model8_pretrain.py][INFO] Epoch:[0/2](680200/4588595) loss:2.468 lr:0.0000100 epoch_Time:24856.0min: [2024-01-05 17:46:10,674][model8_pretrain.py][INFO] Epoch:[0/2](680300/4588595) loss:3.036 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:10,674][model8_pretrain.py][INFO] Epoch:[0/2](680300/4588595) loss:2.635 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:10,674][model8_pretrain.py][INFO] Epoch:[0/2](680300/4588595) loss:2.831 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:10,674][model8_pretrain.py][INFO] Epoch:[0/2](680300/4588595) loss:2.786 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:10,674][model8_pretrain.py][INFO] Epoch:[0/2](680300/4588595) loss:2.667 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:10,674][model8_pretrain.py][INFO] Epoch:[0/2](680300/4588595) loss:2.967 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:10,675][model8_pretrain.py][INFO] Epoch:[0/2](680300/4588595) loss:2.791 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:10,675][model8_pretrain.py][INFO] Epoch:[0/2](680300/4588595) loss:3.172 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:47,605][model8_pretrain.py][INFO] Epoch:[0/2](680400/4588595) loss:2.828 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:47,605][model8_pretrain.py][INFO] Epoch:[0/2](680400/4588595) loss:3.210 lr:0.0000100 epoch_Time:24855.0min: [2024-01-05 17:46:47,605][model8_pretrain.py][INFO] Epoch:[0/2](680400/4588595) loss:2.335 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:46:47,605][model8_pretrain.py][INFO] Epoch:[0/2](680400/4588595) loss:2.683 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:46:47,605][model8_pretrain.py][INFO] Epoch:[0/2](680400/4588595) loss:2.693 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:46:47,605][model8_pretrain.py][INFO] Epoch:[0/2](680400/4588595) loss:2.925 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:46:47,606][model8_pretrain.py][INFO] Epoch:[0/2](680400/4588595) loss:2.911 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:46:47,606][model8_pretrain.py][INFO] Epoch:[0/2](680400/4588595) loss:3.165 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:47:24,534][model8_pretrain.py][INFO] Epoch:[0/2](680500/4588595) loss:2.984 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:47:24,534][model8_pretrain.py][INFO] Epoch:[0/2](680500/4588595) loss:3.067 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:47:24,534][model8_pretrain.py][INFO] Epoch:[0/2](680500/4588595) loss:2.952 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:47:24,534][model8_pretrain.py][INFO] Epoch:[0/2](680500/4588595) loss:2.961 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:47:24,534][model8_pretrain.py][INFO] Epoch:[0/2](680500/4588595) loss:2.773 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:47:24,534][model8_pretrain.py][INFO] Epoch:[0/2](680500/4588595) loss:2.868 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:47:24,534][model8_pretrain.py][INFO] Epoch:[0/2](680500/4588595) loss:2.766 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:47:24,534][model8_pretrain.py][INFO] Epoch:[0/2](680500/4588595) loss:3.200 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:48:13,426][model8_pretrain.py][INFO] Epoch:[0/2](680600/4588595) loss:2.504 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:48:13,426][model8_pretrain.py][INFO] Epoch:[0/2](680600/4588595) loss:2.529 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:48:13,426][model8_pretrain.py][INFO] Epoch:[0/2](680600/4588595) loss:2.674 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:48:13,426][model8_pretrain.py][INFO] Epoch:[0/2](680600/4588595) loss:3.095 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:48:13,426][model8_pretrain.py][INFO] Epoch:[0/2](680600/4588595) loss:3.065 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:48:13,427][model8_pretrain.py][INFO] Epoch:[0/2](680600/4588595) loss:2.506 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:48:13,427][model8_pretrain.py][INFO] Epoch:[0/2](680600/4588595) loss:2.801 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:48:13,427][model8_pretrain.py][INFO] Epoch:[0/2](680600/4588595) loss:3.086 lr:0.0000100 epoch_Time:24854.0min: [2024-01-05 17:48:50,355][model8_pretrain.py][INFO] Epoch:[0/2](680700/4588595) loss:2.777 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:48:50,355][model8_pretrain.py][INFO] Epoch:[0/2](680700/4588595) loss:3.040 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:48:50,355][model8_pretrain.py][INFO] Epoch:[0/2](680700/4588595) loss:2.964 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:48:50,355][model8_pretrain.py][INFO] Epoch:[0/2](680700/4588595) loss:2.522 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:48:50,355][model8_pretrain.py][INFO] Epoch:[0/2](680700/4588595) loss:2.828 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:48:50,355][model8_pretrain.py][INFO] Epoch:[0/2](680700/4588595) loss:2.834 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:48:50,356][model8_pretrain.py][INFO] Epoch:[0/2](680700/4588595) loss:2.460 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:48:50,356][model8_pretrain.py][INFO] Epoch:[0/2](680700/4588595) loss:3.107 lr:0.0000100 epoch_Time:24853.0min: [2024-01-05 17:49:27,284][model8_pretrain.py][INFO] Epoch:[0/2](680800/4588595) loss:3.208 lr:0.0000100 epoch_Time:24852.0min: [2024-01-05 17:49:27,284][model8_pretrain.py][INFO] Epoch:[0/2](680800/4588595) loss:2.950 lr:0.0000100 epoch_Time:24852.0min: [2024-01-05 17:49:27,284][model8_pretrain.py][INFO] Epoch:[0/2](680800/4588595) loss:2.650 lr:0.0000100 epoch_Time:24852.0min: [2024-01-05 17:49:27,284][model8_pretrain.py][INFO] Epoch:[0/2](680800/4588595) loss:2.511 lr:0.0000100 epoch_Time:24852.0min: [2024-01-05 17:49:27,284][model8_pretrain.py][INFO] Epoch:[0/2](680800/4588595) loss:2.854 lr:0.0000100 epoch_Time:24852.0min: [2024-01-05 17:49:27,284][model8_pretrain.py][INFO] Epoch:[0/2](680800/4588595) loss:2.657 lr:0.0000100 epoch_Time:24852.0min: [2024-01-05 17:49:27,284][model8_pretrain.py][INFO] Epoch:[0/2](680800/4588595) loss:2.923 lr:0.0000100 epoch_Time:24852.0min: [2024-01-05 17:49:27,284][model8_pretrain.py][INFO] Epoch:[0/2](680800/4588595) loss:2.731 lr:0.0000100 epoch_Time:24852.0min: [2024-01-05 17:50:04,213][model8_pretrain.py][INFO] Epoch:[0/2](680900/4588595) loss:2.934 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:04,213][model8_pretrain.py][INFO] Epoch:[0/2](680900/4588595) loss:2.976 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:04,213][model8_pretrain.py][INFO] Epoch:[0/2](680900/4588595) loss:2.712 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:04,213][model8_pretrain.py][INFO] Epoch:[0/2](680900/4588595) loss:3.179 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:04,213][model8_pretrain.py][INFO] Epoch:[0/2](680900/4588595) loss:2.011 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:04,213][model8_pretrain.py][INFO] Epoch:[0/2](680900/4588595) loss:3.073 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:04,213][model8_pretrain.py][INFO] Epoch:[0/2](680900/4588595) loss:3.414 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:04,213][model8_pretrain.py][INFO] Epoch:[0/2](680900/4588595) loss:2.804 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:41,148][model8_pretrain.py][INFO] Epoch:[0/2](681000/4588595) loss:2.572 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:41,148][model8_pretrain.py][INFO] Epoch:[0/2](681000/4588595) loss:2.279 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:41,148][model8_pretrain.py][INFO] Epoch:[0/2](681000/4588595) loss:2.811 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:41,148][model8_pretrain.py][INFO] Epoch:[0/2](681000/4588595) loss:3.043 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:41,148][model8_pretrain.py][INFO] Epoch:[0/2](681000/4588595) loss:2.201 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:41,148][model8_pretrain.py][INFO] Epoch:[0/2](681000/4588595) loss:3.029 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:41,148][model8_pretrain.py][INFO] Epoch:[0/2](681000/4588595) loss:3.037 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:50:41,148][model8_pretrain.py][INFO] Epoch:[0/2](681000/4588595) loss:3.019 lr:0.0000100 epoch_Time:24851.0min: [2024-01-05 17:51:18,086][model8_pretrain.py][INFO] Epoch:[0/2](681100/4588595) loss:2.481 lr:0.0000100 epoch_Time:24850.0min: [2024-01-05 17:51:18,086][model8_pretrain.py][INFO] Epoch:[0/2](681100/4588595) loss:2.411 lr:0.0000100 epoch_Time:24850.0min: [2024-01-05 17:51:18,086][model8_pretrain.py][INFO] Epoch:[0/2](681100/4588595) loss:2.697 lr:0.0000100 epoch_Time:24850.0min: [2024-01-05 17:51:18,086][model8_pretrain.py][INFO] Epoch:[0/2](681100/4588595) loss:2.412 lr:0.0000100 epoch_Time:24850.0min: [2024-01-05 17:51:18,086][model8_pretrain.py][INFO] Epoch:[0/2](681100/4588595) loss:3.056 lr:0.0000100 epoch_Time:24850.0min: [2024-01-05 17:51:18,087][model8_pretrain.py][INFO] Epoch:[0/2](681100/4588595) loss:2.664 lr:0.0000100 epoch_Time:24850.0min: [2024-01-05 17:51:18,087][model8_pretrain.py][INFO] Epoch:[0/2](681100/4588595) loss:2.348 lr:0.0000100 epoch_Time:24850.0min: [2024-01-05 17:51:18,087][model8_pretrain.py][INFO] Epoch:[0/2](681100/4588595) loss:2.659 lr:0.0000100 epoch_Time:24850.0min: [2024-01-05 17:51:55,023][model8_pretrain.py][INFO] Epoch:[0/2](681200/4588595) loss:2.870 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:51:55,023][model8_pretrain.py][INFO] Epoch:[0/2](681200/4588595) loss:2.386 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:51:55,023][model8_pretrain.py][INFO] Epoch:[0/2](681200/4588595) loss:2.039 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:51:55,023][model8_pretrain.py][INFO] Epoch:[0/2](681200/4588595) loss:2.461 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:51:55,023][model8_pretrain.py][INFO] Epoch:[0/2](681200/4588595) loss:2.730 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:51:55,023][model8_pretrain.py][INFO] Epoch:[0/2](681200/4588595) loss:2.457 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:51:55,023][model8_pretrain.py][INFO] Epoch:[0/2](681200/4588595) loss:2.711 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:51:55,023][model8_pretrain.py][INFO] Epoch:[0/2](681200/4588595) loss:2.656 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:52:31,947][model8_pretrain.py][INFO] Epoch:[0/2](681300/4588595) loss:2.652 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:52:31,947][model8_pretrain.py][INFO] Epoch:[0/2](681300/4588595) loss:2.779 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:52:31,947][model8_pretrain.py][INFO] Epoch:[0/2](681300/4588595) loss:2.762 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:52:31,947][model8_pretrain.py][INFO] Epoch:[0/2](681300/4588595) loss:2.448 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:52:31,947][model8_pretrain.py][INFO] Epoch:[0/2](681300/4588595) loss:2.870 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:52:31,947][model8_pretrain.py][INFO] Epoch:[0/2](681300/4588595) loss:3.192 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:52:31,947][model8_pretrain.py][INFO] Epoch:[0/2](681300/4588595) loss:2.664 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:52:31,947][model8_pretrain.py][INFO] Epoch:[0/2](681300/4588595) loss:2.856 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:53:20,947][model8_pretrain.py][INFO] Epoch:[0/2](681400/4588595) loss:3.016 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:53:20,947][model8_pretrain.py][INFO] Epoch:[0/2](681400/4588595) loss:2.709 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:53:20,947][model8_pretrain.py][INFO] Epoch:[0/2](681400/4588595) loss:2.248 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:53:20,947][model8_pretrain.py][INFO] Epoch:[0/2](681400/4588595) loss:2.347 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:53:20,947][model8_pretrain.py][INFO] Epoch:[0/2](681400/4588595) loss:2.669 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:53:20,947][model8_pretrain.py][INFO] Epoch:[0/2](681400/4588595) loss:3.453 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:53:20,947][model8_pretrain.py][INFO] Epoch:[0/2](681400/4588595) loss:3.214 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:53:20,947][model8_pretrain.py][INFO] Epoch:[0/2](681400/4588595) loss:2.405 lr:0.0000100 epoch_Time:24849.0min: [2024-01-05 17:53:57,881][model8_pretrain.py][INFO] Epoch:[0/2](681500/4588595) loss:2.689 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:53:57,881][model8_pretrain.py][INFO] Epoch:[0/2](681500/4588595) loss:2.954 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:53:57,881][model8_pretrain.py][INFO] Epoch:[0/2](681500/4588595) loss:2.517 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:53:57,881][model8_pretrain.py][INFO] Epoch:[0/2](681500/4588595) loss:2.699 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:53:57,881][model8_pretrain.py][INFO] Epoch:[0/2](681500/4588595) loss:3.092 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:53:57,881][model8_pretrain.py][INFO] Epoch:[0/2](681500/4588595) loss:3.035 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:53:57,881][model8_pretrain.py][INFO] Epoch:[0/2](681500/4588595) loss:2.737 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:53:57,881][model8_pretrain.py][INFO] Epoch:[0/2](681500/4588595) loss:2.662 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:54:34,837][model8_pretrain.py][INFO] Epoch:[0/2](681600/4588595) loss:2.782 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:54:34,837][model8_pretrain.py][INFO] Epoch:[0/2](681600/4588595) loss:3.345 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:54:34,837][model8_pretrain.py][INFO] Epoch:[0/2](681600/4588595) loss:3.066 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:54:34,837][model8_pretrain.py][INFO] Epoch:[0/2](681600/4588595) loss:2.831 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:54:34,837][model8_pretrain.py][INFO] Epoch:[0/2](681600/4588595) loss:2.673 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:54:34,837][model8_pretrain.py][INFO] Epoch:[0/2](681600/4588595) loss:2.315 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:54:34,837][model8_pretrain.py][INFO] Epoch:[0/2](681600/4588595) loss:1.894 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:54:34,837][model8_pretrain.py][INFO] Epoch:[0/2](681600/4588595) loss:2.220 lr:0.0000100 epoch_Time:24848.0min: [2024-01-05 17:55:11,793][model8_pretrain.py][INFO] Epoch:[0/2](681700/4588595) loss:2.397 lr:0.0000100 epoch_Time:24847.0min: [2024-01-05 17:55:11,793][model8_pretrain.py][INFO] Epoch:[0/2](681700/4588595) loss:2.191 lr:0.0000100 epoch_Time:24847.0min: [2024-01-05 17:55:11,793][model8_pretrain.py][INFO] Epoch:[0/2](681700/4588595) loss:2.675 lr:0.0000100 epoch_Time:24847.0min: [2024-01-05 17:55:11,793][model8_pretrain.py][INFO] Epoch:[0/2](681700/4588595) loss:2.866 lr:0.0000100 epoch_Time:24847.0min: [2024-01-05 17:55:11,793][model8_pretrain.py][INFO] Epoch:[0/2](681700/4588595) loss:2.839 lr:0.0000100 epoch_Time:24847.0min: [2024-01-05 17:55:11,793][model8_pretrain.py][INFO] Epoch:[0/2](681700/4588595) loss:2.854 lr:0.0000100 epoch_Time:24847.0min: [2024-01-05 17:55:11,793][model8_pretrain.py][INFO] Epoch:[0/2](681700/4588595) loss:3.016 lr:0.0000100 epoch_Time:24847.0min: [2024-01-05 17:55:11,793][model8_pretrain.py][INFO] Epoch:[0/2](681700/4588595) loss:2.934 lr:0.0000100 epoch_Time:24847.0min: [2024-01-05 17:55:48,733][model8_pretrain.py][INFO] Epoch:[0/2](681800/4588595) loss:2.905 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:55:48,733][model8_pretrain.py][INFO] Epoch:[0/2](681800/4588595) loss:2.785 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:55:48,733][model8_pretrain.py][INFO] Epoch:[0/2](681800/4588595) loss:2.495 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:55:48,733][model8_pretrain.py][INFO] Epoch:[0/2](681800/4588595) loss:2.848 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:55:48,733][model8_pretrain.py][INFO] Epoch:[0/2](681800/4588595) loss:3.061 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:55:48,733][model8_pretrain.py][INFO] Epoch:[0/2](681800/4588595) loss:3.045 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:55:48,733][model8_pretrain.py][INFO] Epoch:[0/2](681800/4588595) loss:3.236 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:55:48,733][model8_pretrain.py][INFO] Epoch:[0/2](681800/4588595) loss:2.940 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:56:25,676][model8_pretrain.py][INFO] Epoch:[0/2](681900/4588595) loss:2.974 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:56:25,676][model8_pretrain.py][INFO] Epoch:[0/2](681900/4588595) loss:2.726 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:56:25,676][model8_pretrain.py][INFO] Epoch:[0/2](681900/4588595) loss:3.128 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:56:25,676][model8_pretrain.py][INFO] Epoch:[0/2](681900/4588595) loss:2.762 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:56:25,676][model8_pretrain.py][INFO] Epoch:[0/2](681900/4588595) loss:2.767 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:56:25,676][model8_pretrain.py][INFO] Epoch:[0/2](681900/4588595) loss:1.942 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:56:25,676][model8_pretrain.py][INFO] Epoch:[0/2](681900/4588595) loss:2.714 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:56:25,676][model8_pretrain.py][INFO] Epoch:[0/2](681900/4588595) loss:2.090 lr:0.0000100 epoch_Time:24845.0min: [2024-01-05 17:57:02,628][model8_pretrain.py][INFO] Epoch:[0/2](682000/4588595) loss:2.445 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:02,628][model8_pretrain.py][INFO] Epoch:[0/2](682000/4588595) loss:3.170 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:02,628][model8_pretrain.py][INFO] Epoch:[0/2](682000/4588595) loss:2.514 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:02,628][model8_pretrain.py][INFO] Epoch:[0/2](682000/4588595) loss:2.396 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:02,628][model8_pretrain.py][INFO] Epoch:[0/2](682000/4588595) loss:2.749 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:02,628][model8_pretrain.py][INFO] Epoch:[0/2](682000/4588595) loss:2.983 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:02,628][model8_pretrain.py][INFO] Epoch:[0/2](682000/4588595) loss:3.120 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:02,628][model8_pretrain.py][INFO] Epoch:[0/2](682000/4588595) loss:3.191 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:39,572][model8_pretrain.py][INFO] Epoch:[0/2](682100/4588595) loss:2.066 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:39,572][model8_pretrain.py][INFO] Epoch:[0/2](682100/4588595) loss:2.972 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:39,572][model8_pretrain.py][INFO] Epoch:[0/2](682100/4588595) loss:2.626 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:39,572][model8_pretrain.py][INFO] Epoch:[0/2](682100/4588595) loss:2.737 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:39,572][model8_pretrain.py][INFO] Epoch:[0/2](682100/4588595) loss:2.790 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:39,572][model8_pretrain.py][INFO] Epoch:[0/2](682100/4588595) loss:2.119 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:39,572][model8_pretrain.py][INFO] Epoch:[0/2](682100/4588595) loss:2.910 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:57:39,572][model8_pretrain.py][INFO] Epoch:[0/2](682100/4588595) loss:2.535 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:58:28,511][model8_pretrain.py][INFO] Epoch:[0/2](682200/4588595) loss:2.774 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:58:28,511][model8_pretrain.py][INFO] Epoch:[0/2](682200/4588595) loss:2.892 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:58:28,511][model8_pretrain.py][INFO] Epoch:[0/2](682200/4588595) loss:2.462 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:58:28,511][model8_pretrain.py][INFO] Epoch:[0/2](682200/4588595) loss:2.833 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:58:28,511][model8_pretrain.py][INFO] Epoch:[0/2](682200/4588595) loss:2.646 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:58:28,511][model8_pretrain.py][INFO] Epoch:[0/2](682200/4588595) loss:3.297 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:58:28,511][model8_pretrain.py][INFO] Epoch:[0/2](682200/4588595) loss:2.852 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:58:28,512][model8_pretrain.py][INFO] Epoch:[0/2](682200/4588595) loss:2.825 lr:0.0000100 epoch_Time:24844.0min: [2024-01-05 17:59:05,440][model8_pretrain.py][INFO] Epoch:[0/2](682300/4588595) loss:2.412 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:05,440][model8_pretrain.py][INFO] Epoch:[0/2](682300/4588595) loss:2.727 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:05,440][model8_pretrain.py][INFO] Epoch:[0/2](682300/4588595) loss:2.741 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:05,440][model8_pretrain.py][INFO] Epoch:[0/2](682300/4588595) loss:2.826 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:05,440][model8_pretrain.py][INFO] Epoch:[0/2](682300/4588595) loss:3.045 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:05,440][model8_pretrain.py][INFO] Epoch:[0/2](682300/4588595) loss:2.920 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:05,440][model8_pretrain.py][INFO] Epoch:[0/2](682300/4588595) loss:3.090 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:05,441][model8_pretrain.py][INFO] Epoch:[0/2](682300/4588595) loss:2.761 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:42,381][model8_pretrain.py][INFO] Epoch:[0/2](682400/4588595) loss:3.004 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:42,381][model8_pretrain.py][INFO] Epoch:[0/2](682400/4588595) loss:2.674 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:42,381][model8_pretrain.py][INFO] Epoch:[0/2](682400/4588595) loss:2.365 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:42,382][model8_pretrain.py][INFO] Epoch:[0/2](682400/4588595) loss:3.243 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:42,382][model8_pretrain.py][INFO] Epoch:[0/2](682400/4588595) loss:3.058 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:42,382][model8_pretrain.py][INFO] Epoch:[0/2](682400/4588595) loss:2.812 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:42,382][model8_pretrain.py][INFO] Epoch:[0/2](682400/4588595) loss:3.337 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 17:59:42,382][model8_pretrain.py][INFO] Epoch:[0/2](682400/4588595) loss:2.839 lr:0.0000100 epoch_Time:24843.0min: [2024-01-05 18:00:19,325][model8_pretrain.py][INFO] Epoch:[0/2](682500/4588595) loss:2.593 lr:0.0000100 epoch_Time:24842.0min: [2024-01-05 18:00:19,325][model8_pretrain.py][INFO] Epoch:[0/2](682500/4588595) loss:2.387 lr:0.0000100 epoch_Time:24842.0min: [2024-01-05 18:00:19,325][model8_pretrain.py][INFO] Epoch:[0/2](682500/4588595) loss:2.564 lr:0.0000100 epoch_Time:24842.0min: [2024-01-05 18:00:19,325][model8_pretrain.py][INFO] Epoch:[0/2](682500/4588595) loss:3.516 lr:0.0000100 epoch_Time:24842.0min: [2024-01-05 18:00:19,325][model8_pretrain.py][INFO] Epoch:[0/2](682500/4588595) loss:2.993 lr:0.0000100 epoch_Time:24842.0min: [2024-01-05 18:00:19,325][model8_pretrain.py][INFO] Epoch:[0/2](682500/4588595) loss:2.756 lr:0.0000100 epoch_Time:24842.0min: [2024-01-05 18:00:19,325][model8_pretrain.py][INFO] Epoch:[0/2](682500/4588595) loss:3.116 lr:0.0000100 epoch_Time:24842.0min: [2024-01-05 18:00:19,325][model8_pretrain.py][INFO] Epoch:[0/2](682500/4588595) loss:2.799 lr:0.0000100 epoch_Time:24842.0min: [2024-01-05 18:00:56,264][model8_pretrain.py][INFO] Epoch:[0/2](682600/4588595) loss:2.670 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:00:56,265][model8_pretrain.py][INFO] Epoch:[0/2](682600/4588595) loss:2.847 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:00:56,265][model8_pretrain.py][INFO] Epoch:[0/2](682600/4588595) loss:3.445 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:00:56,265][model8_pretrain.py][INFO] Epoch:[0/2](682600/4588595) loss:2.341 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:00:56,265][model8_pretrain.py][INFO] Epoch:[0/2](682600/4588595) loss:3.018 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:00:56,265][model8_pretrain.py][INFO] Epoch:[0/2](682600/4588595) loss:2.489 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:00:56,265][model8_pretrain.py][INFO] Epoch:[0/2](682600/4588595) loss:3.162 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:00:56,265][model8_pretrain.py][INFO] Epoch:[0/2](682600/4588595) loss:2.814 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:01:33,218][model8_pretrain.py][INFO] Epoch:[0/2](682700/4588595) loss:3.190 lr:0.0000100 epoch_Time:24840.0min: [2024-01-05 18:01:33,218][model8_pretrain.py][INFO] Epoch:[0/2](682700/4588595) loss:2.775 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:01:33,218][model8_pretrain.py][INFO] Epoch:[0/2](682700/4588595) loss:2.513 lr:0.0000100 epoch_Time:24840.0min: [2024-01-05 18:01:33,218][model8_pretrain.py][INFO] Epoch:[0/2](682700/4588595) loss:2.738 lr:0.0000100 epoch_Time:24840.0min: [2024-01-05 18:01:33,218][model8_pretrain.py][INFO] Epoch:[0/2](682700/4588595) loss:3.075 lr:0.0000100 epoch_Time:24840.0min: [2024-01-05 18:01:33,218][model8_pretrain.py][INFO] Epoch:[0/2](682700/4588595) loss:2.619 lr:0.0000100 epoch_Time:24840.0min: [2024-01-05 18:01:33,218][model8_pretrain.py][INFO] Epoch:[0/2](682700/4588595) loss:2.626 lr:0.0000100 epoch_Time:24841.0min: [2024-01-05 18:01:33,219][model8_pretrain.py][INFO] Epoch:[0/2](682700/4588595) loss:2.963 lr:0.0000100 epoch_Time:24840.0min: [2024-01-05 18:02:10,166][model8_pretrain.py][INFO] Epoch:[0/2](682800/4588595) loss:3.003 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:10,166][model8_pretrain.py][INFO] Epoch:[0/2](682800/4588595) loss:2.809 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:10,166][model8_pretrain.py][INFO] Epoch:[0/2](682800/4588595) loss:2.872 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:10,166][model8_pretrain.py][INFO] Epoch:[0/2](682800/4588595) loss:2.676 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:10,166][model8_pretrain.py][INFO] Epoch:[0/2](682800/4588595) loss:3.344 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:10,166][model8_pretrain.py][INFO] Epoch:[0/2](682800/4588595) loss:2.613 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:10,166][model8_pretrain.py][INFO] Epoch:[0/2](682800/4588595) loss:2.362 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:10,166][model8_pretrain.py][INFO] Epoch:[0/2](682800/4588595) loss:2.348 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:47,115][model8_pretrain.py][INFO] Epoch:[0/2](682900/4588595) loss:3.169 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:47,115][model8_pretrain.py][INFO] Epoch:[0/2](682900/4588595) loss:2.853 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:47,115][model8_pretrain.py][INFO] Epoch:[0/2](682900/4588595) loss:3.158 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:47,115][model8_pretrain.py][INFO] Epoch:[0/2](682900/4588595) loss:3.167 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:47,115][model8_pretrain.py][INFO] Epoch:[0/2](682900/4588595) loss:3.062 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:47,115][model8_pretrain.py][INFO] Epoch:[0/2](682900/4588595) loss:2.785 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:47,115][model8_pretrain.py][INFO] Epoch:[0/2](682900/4588595) loss:2.898 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:02:47,115][model8_pretrain.py][INFO] Epoch:[0/2](682900/4588595) loss:2.981 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:03:36,053][model8_pretrain.py][INFO] Epoch:[0/2](683000/4588595) loss:3.131 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:03:36,053][model8_pretrain.py][INFO] Epoch:[0/2](683000/4588595) loss:2.872 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:03:36,053][model8_pretrain.py][INFO] Epoch:[0/2](683000/4588595) loss:2.928 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:03:36,053][model8_pretrain.py][INFO] Epoch:[0/2](683000/4588595) loss:2.698 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:03:36,053][model8_pretrain.py][INFO] Epoch:[0/2](683000/4588595) loss:2.311 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:03:36,053][model8_pretrain.py][INFO] Epoch:[0/2](683000/4588595) loss:2.354 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:03:36,053][model8_pretrain.py][INFO] Epoch:[0/2](683000/4588595) loss:2.766 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:03:36,053][model8_pretrain.py][INFO] Epoch:[0/2](683000/4588595) loss:2.801 lr:0.0000100 epoch_Time:24839.0min: [2024-01-05 18:04:12,981][model8_pretrain.py][INFO] Epoch:[0/2](683100/4588595) loss:2.254 lr:0.0000100 epoch_Time:24838.0min: [2024-01-05 18:04:12,981][model8_pretrain.py][INFO] Epoch:[0/2](683100/4588595) loss:3.276 lr:0.0000100 epoch_Time:24838.0min: [2024-01-05 18:04:12,982][model8_pretrain.py][INFO] Epoch:[0/2](683100/4588595) loss:2.554 lr:0.0000100 epoch_Time:24838.0min: [2024-01-05 18:04:12,982][model8_pretrain.py][INFO] Epoch:[0/2](683100/4588595) loss:1.944 lr:0.0000100 epoch_Time:24838.0min: [2024-01-05 18:04:12,982][model8_pretrain.py][INFO] Epoch:[0/2](683100/4588595) loss:3.500 lr:0.0000100 epoch_Time:24838.0min: [2024-01-05 18:04:12,982][model8_pretrain.py][INFO] Epoch:[0/2](683100/4588595) loss:2.589 lr:0.0000100 epoch_Time:24838.0min: [2024-01-05 18:04:12,982][model8_pretrain.py][INFO] Epoch:[0/2](683100/4588595) loss:2.617 lr:0.0000100 epoch_Time:24838.0min: [2024-01-05 18:04:12,982][model8_pretrain.py][INFO] Epoch:[0/2](683100/4588595) loss:2.854 lr:0.0000100 epoch_Time:24838.0min: [2024-01-05 18:04:49,927][model8_pretrain.py][INFO] Epoch:[0/2](683200/4588595) loss:2.658 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:04:49,927][model8_pretrain.py][INFO] Epoch:[0/2](683200/4588595) loss:2.921 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:04:49,927][model8_pretrain.py][INFO] Epoch:[0/2](683200/4588595) loss:2.848 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:04:49,927][model8_pretrain.py][INFO] Epoch:[0/2](683200/4588595) loss:3.000 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:04:49,927][model8_pretrain.py][INFO] Epoch:[0/2](683200/4588595) loss:2.577 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:04:49,927][model8_pretrain.py][INFO] Epoch:[0/2](683200/4588595) loss:2.426 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:04:49,927][model8_pretrain.py][INFO] Epoch:[0/2](683200/4588595) loss:2.541 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:04:49,927][model8_pretrain.py][INFO] Epoch:[0/2](683200/4588595) loss:2.778 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:05:26,868][model8_pretrain.py][INFO] Epoch:[0/2](683300/4588595) loss:2.462 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:05:26,868][model8_pretrain.py][INFO] Epoch:[0/2](683300/4588595) loss:2.520 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:05:26,868][model8_pretrain.py][INFO] Epoch:[0/2](683300/4588595) loss:2.344 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:05:26,868][model8_pretrain.py][INFO] Epoch:[0/2](683300/4588595) loss:3.034 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:05:26,868][model8_pretrain.py][INFO] Epoch:[0/2](683300/4588595) loss:3.230 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:05:26,868][model8_pretrain.py][INFO] Epoch:[0/2](683300/4588595) loss:2.912 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:05:26,868][model8_pretrain.py][INFO] Epoch:[0/2](683300/4588595) loss:2.819 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:05:26,868][model8_pretrain.py][INFO] Epoch:[0/2](683300/4588595) loss:2.783 lr:0.0000100 epoch_Time:24837.0min: [2024-01-05 18:06:03,813][model8_pretrain.py][INFO] Epoch:[0/2](683400/4588595) loss:2.867 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:03,813][model8_pretrain.py][INFO] Epoch:[0/2](683400/4588595) loss:3.287 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:03,813][model8_pretrain.py][INFO] Epoch:[0/2](683400/4588595) loss:2.905 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:03,813][model8_pretrain.py][INFO] Epoch:[0/2](683400/4588595) loss:3.215 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:03,813][model8_pretrain.py][INFO] Epoch:[0/2](683400/4588595) loss:3.386 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:03,813][model8_pretrain.py][INFO] Epoch:[0/2](683400/4588595) loss:2.634 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:03,813][model8_pretrain.py][INFO] Epoch:[0/2](683400/4588595) loss:3.284 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:03,814][model8_pretrain.py][INFO] Epoch:[0/2](683400/4588595) loss:2.147 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:40,756][model8_pretrain.py][INFO] Epoch:[0/2](683500/4588595) loss:3.110 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:40,756][model8_pretrain.py][INFO] Epoch:[0/2](683500/4588595) loss:3.018 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:40,756][model8_pretrain.py][INFO] Epoch:[0/2](683500/4588595) loss:2.920 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:40,757][model8_pretrain.py][INFO] Epoch:[0/2](683500/4588595) loss:2.717 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:40,757][model8_pretrain.py][INFO] Epoch:[0/2](683500/4588595) loss:2.685 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:40,757][model8_pretrain.py][INFO] Epoch:[0/2](683500/4588595) loss:2.439 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:40,757][model8_pretrain.py][INFO] Epoch:[0/2](683500/4588595) loss:2.487 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:06:40,757][model8_pretrain.py][INFO] Epoch:[0/2](683500/4588595) loss:2.960 lr:0.0000100 epoch_Time:24836.0min: [2024-01-05 18:07:17,694][model8_pretrain.py][INFO] Epoch:[0/2](683600/4588595) loss:3.183 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:07:17,694][model8_pretrain.py][INFO] Epoch:[0/2](683600/4588595) loss:2.927 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:07:17,694][model8_pretrain.py][INFO] Epoch:[0/2](683600/4588595) loss:3.096 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:07:17,694][model8_pretrain.py][INFO] Epoch:[0/2](683600/4588595) loss:2.636 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:07:17,694][model8_pretrain.py][INFO] Epoch:[0/2](683600/4588595) loss:2.991 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:07:17,694][model8_pretrain.py][INFO] Epoch:[0/2](683600/4588595) loss:3.097 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:07:17,694][model8_pretrain.py][INFO] Epoch:[0/2](683600/4588595) loss:2.792 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:07:17,694][model8_pretrain.py][INFO] Epoch:[0/2](683600/4588595) loss:3.012 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:07:54,627][model8_pretrain.py][INFO] Epoch:[0/2](683700/4588595) loss:2.710 lr:0.0000100 epoch_Time:24833.0min: [2024-01-05 18:07:54,627][model8_pretrain.py][INFO] Epoch:[0/2](683700/4588595) loss:2.713 lr:0.0000100 epoch_Time:24833.0min: [2024-01-05 18:07:54,627][model8_pretrain.py][INFO] Epoch:[0/2](683700/4588595) loss:2.634 lr:0.0000100 epoch_Time:24833.0min: [2024-01-05 18:07:54,627][model8_pretrain.py][INFO] Epoch:[0/2](683700/4588595) loss:2.888 lr:0.0000100 epoch_Time:24833.0min: [2024-01-05 18:07:54,627][model8_pretrain.py][INFO] Epoch:[0/2](683700/4588595) loss:2.826 lr:0.0000100 epoch_Time:24833.0min: [2024-01-05 18:07:54,627][model8_pretrain.py][INFO] Epoch:[0/2](683700/4588595) loss:2.831 lr:0.0000100 epoch_Time:24833.0min: [2024-01-05 18:07:54,627][model8_pretrain.py][INFO] Epoch:[0/2](683700/4588595) loss:3.028 lr:0.0000100 epoch_Time:24833.0min: [2024-01-05 18:07:54,627][model8_pretrain.py][INFO] Epoch:[0/2](683700/4588595) loss:2.825 lr:0.0000100 epoch_Time:24833.0min: [2024-01-05 18:08:43,691][model8_pretrain.py][INFO] Epoch:[0/2](683800/4588595) loss:2.913 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:08:43,691][model8_pretrain.py][INFO] Epoch:[0/2](683800/4588595) loss:3.041 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:08:43,691][model8_pretrain.py][INFO] Epoch:[0/2](683800/4588595) loss:2.865 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:08:43,691][model8_pretrain.py][INFO] Epoch:[0/2](683800/4588595) loss:2.967 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:08:43,691][model8_pretrain.py][INFO] Epoch:[0/2](683800/4588595) loss:3.103 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:08:43,692][model8_pretrain.py][INFO] Epoch:[0/2](683800/4588595) loss:2.569 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:08:43,692][model8_pretrain.py][INFO] Epoch:[0/2](683800/4588595) loss:2.662 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:08:43,692][model8_pretrain.py][INFO] Epoch:[0/2](683800/4588595) loss:2.632 lr:0.0000100 epoch_Time:24835.0min: [2024-01-05 18:09:20,628][model8_pretrain.py][INFO] Epoch:[0/2](683900/4588595) loss:3.005 lr:0.0000100 epoch_Time:24834.0min: [2024-01-05 18:09:20,628][model8_pretrain.py][INFO] Epoch:[0/2](683900/4588595) loss:2.197 lr:0.0000100 epoch_Time:24834.0min: [2024-01-05 18:09:20,628][model8_pretrain.py][INFO] Epoch:[0/2](683900/4588595) loss:2.555 lr:0.0000100 epoch_Time:24834.0min: [2024-01-05 18:09:20,628][model8_pretrain.py][INFO] Epoch:[0/2](683900/4588595) loss:3.204 lr:0.0000100 epoch_Time:24834.0min: [2024-01-05 18:09:20,628][model8_pretrain.py][INFO] Epoch:[0/2](683900/4588595) loss:3.092 lr:0.0000100 epoch_Time:24834.0min: [2024-01-05 18:09:20,628][model8_pretrain.py][INFO] Epoch:[0/2](683900/4588595) loss:2.319 lr:0.0000100 epoch_Time:24834.0min: [2024-01-05 18:09:20,628][model8_pretrain.py][INFO] Epoch:[0/2](683900/4588595) loss:2.900 lr:0.0000100 epoch_Time:24834.0min: [2024-01-05 18:09:20,628][model8_pretrain.py][INFO] Epoch:[0/2](683900/4588595) loss:3.084 lr:0.0000100 epoch_Time:24834.0min: [2024-01-05 18:09:57,579][model8_pretrain.py][INFO] Epoch:[0/2](684000/4588595) loss:2.761 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:09:57,579][model8_pretrain.py][INFO] Epoch:[0/2](684000/4588595) loss:3.010 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:09:57,579][model8_pretrain.py][INFO] Epoch:[0/2](684000/4588595) loss:2.783 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:09:57,579][model8_pretrain.py][INFO] Epoch:[0/2](684000/4588595) loss:3.287 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:09:57,579][model8_pretrain.py][INFO] Epoch:[0/2](684000/4588595) loss:3.145 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:09:57,579][model8_pretrain.py][INFO] Epoch:[0/2](684000/4588595) loss:2.346 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:09:57,579][model8_pretrain.py][INFO] Epoch:[0/2](684000/4588595) loss:2.787 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:09:57,579][model8_pretrain.py][INFO] Epoch:[0/2](684000/4588595) loss:2.732 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:10:34,533][model8_pretrain.py][INFO] Epoch:[0/2](684100/4588595) loss:2.935 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:10:34,533][model8_pretrain.py][INFO] Epoch:[0/2](684100/4588595) loss:2.558 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:10:34,533][model8_pretrain.py][INFO] Epoch:[0/2](684100/4588595) loss:2.772 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:10:34,533][model8_pretrain.py][INFO] Epoch:[0/2](684100/4588595) loss:2.947 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:10:34,533][model8_pretrain.py][INFO] Epoch:[0/2](684100/4588595) loss:2.840 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:10:34,533][model8_pretrain.py][INFO] Epoch:[0/2](684100/4588595) loss:2.750 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:10:34,533][model8_pretrain.py][INFO] Epoch:[0/2](684100/4588595) loss:2.640 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:10:34,533][model8_pretrain.py][INFO] Epoch:[0/2](684100/4588595) loss:2.816 lr:0.0000100 epoch_Time:24832.0min: [2024-01-05 18:11:11,478][model8_pretrain.py][INFO] Epoch:[0/2](684200/4588595) loss:2.539 lr:0.0000100 epoch_Time:24831.0min: [2024-01-05 18:11:11,478][model8_pretrain.py][INFO] Epoch:[0/2](684200/4588595) loss:2.656 lr:0.0000100 epoch_Time:24831.0min: [2024-01-05 18:11:11,478][model8_pretrain.py][INFO] Epoch:[0/2](684200/4588595) loss:2.636 lr:0.0000100 epoch_Time:24831.0min: [2024-01-05 18:11:11,478][model8_pretrain.py][INFO] Epoch:[0/2](684200/4588595) loss:2.611 lr:0.0000100 epoch_Time:24831.0min: [2024-01-05 18:11:11,479][model8_pretrain.py][INFO] Epoch:[0/2](684200/4588595) loss:3.344 lr:0.0000100 epoch_Time:24831.0min: [2024-01-05 18:11:11,478][model8_pretrain.py][INFO] Epoch:[0/2](684200/4588595) loss:2.126 lr:0.0000100 epoch_Time:24831.0min: [2024-01-05 18:11:11,478][model8_pretrain.py][INFO] Epoch:[0/2](684200/4588595) loss:2.891 lr:0.0000100 epoch_Time:24831.0min: [2024-01-05 18:11:11,479][model8_pretrain.py][INFO] Epoch:[0/2](684200/4588595) loss:2.090 lr:0.0000100 epoch_Time:24831.0min: [2024-01-05 18:11:48,437][model8_pretrain.py][INFO] Epoch:[0/2](684300/4588595) loss:2.566 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:11:48,437][model8_pretrain.py][INFO] Epoch:[0/2](684300/4588595) loss:3.224 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:11:48,437][model8_pretrain.py][INFO] Epoch:[0/2](684300/4588595) loss:2.675 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:11:48,437][model8_pretrain.py][INFO] Epoch:[0/2](684300/4588595) loss:3.037 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:11:48,437][model8_pretrain.py][INFO] Epoch:[0/2](684300/4588595) loss:2.984 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:11:48,437][model8_pretrain.py][INFO] Epoch:[0/2](684300/4588595) loss:2.942 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:11:48,437][model8_pretrain.py][INFO] Epoch:[0/2](684300/4588595) loss:2.846 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:11:48,437][model8_pretrain.py][INFO] Epoch:[0/2](684300/4588595) loss:3.035 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:12:25,386][model8_pretrain.py][INFO] Epoch:[0/2](684400/4588595) loss:3.411 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:12:25,386][model8_pretrain.py][INFO] Epoch:[0/2](684400/4588595) loss:2.954 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:12:25,386][model8_pretrain.py][INFO] Epoch:[0/2](684400/4588595) loss:2.243 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:12:25,386][model8_pretrain.py][INFO] Epoch:[0/2](684400/4588595) loss:3.454 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:12:25,386][model8_pretrain.py][INFO] Epoch:[0/2](684400/4588595) loss:3.199 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:12:25,386][model8_pretrain.py][INFO] Epoch:[0/2](684400/4588595) loss:2.842 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:12:25,386][model8_pretrain.py][INFO] Epoch:[0/2](684400/4588595) loss:2.470 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:12:25,386][model8_pretrain.py][INFO] Epoch:[0/2](684400/4588595) loss:2.222 lr:0.0000100 epoch_Time:24830.0min: [2024-01-05 18:13:02,344][model8_pretrain.py][INFO] Epoch:[0/2](684500/4588595) loss:2.439 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:02,344][model8_pretrain.py][INFO] Epoch:[0/2](684500/4588595) loss:2.879 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:02,344][model8_pretrain.py][INFO] Epoch:[0/2](684500/4588595) loss:2.543 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:02,344][model8_pretrain.py][INFO] Epoch:[0/2](684500/4588595) loss:2.125 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:02,344][model8_pretrain.py][INFO] Epoch:[0/2](684500/4588595) loss:2.419 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:02,344][model8_pretrain.py][INFO] Epoch:[0/2](684500/4588595) loss:2.798 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:02,344][model8_pretrain.py][INFO] Epoch:[0/2](684500/4588595) loss:2.746 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:02,344][model8_pretrain.py][INFO] Epoch:[0/2](684500/4588595) loss:3.241 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:51,443][model8_pretrain.py][INFO] Epoch:[0/2](684600/4588595) loss:3.124 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:51,443][model8_pretrain.py][INFO] Epoch:[0/2](684600/4588595) loss:3.096 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:51,444][model8_pretrain.py][INFO] Epoch:[0/2](684600/4588595) loss:1.982 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:51,444][model8_pretrain.py][INFO] Epoch:[0/2](684600/4588595) loss:2.882 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:51,444][model8_pretrain.py][INFO] Epoch:[0/2](684600/4588595) loss:2.589 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:51,444][model8_pretrain.py][INFO] Epoch:[0/2](684600/4588595) loss:2.603 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:51,444][model8_pretrain.py][INFO] Epoch:[0/2](684600/4588595) loss:2.570 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:13:51,444][model8_pretrain.py][INFO] Epoch:[0/2](684600/4588595) loss:2.988 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:14:28,336][model8_pretrain.py][INFO] Epoch:[0/2](684700/4588595) loss:2.759 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:14:28,336][model8_pretrain.py][INFO] Epoch:[0/2](684700/4588595) loss:3.086 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:14:28,336][model8_pretrain.py][INFO] Epoch:[0/2](684700/4588595) loss:2.959 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:14:28,336][model8_pretrain.py][INFO] Epoch:[0/2](684700/4588595) loss:2.871 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:14:28,336][model8_pretrain.py][INFO] Epoch:[0/2](684700/4588595) loss:2.594 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:14:28,336][model8_pretrain.py][INFO] Epoch:[0/2](684700/4588595) loss:2.745 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:14:28,336][model8_pretrain.py][INFO] Epoch:[0/2](684700/4588595) loss:2.903 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:14:28,336][model8_pretrain.py][INFO] Epoch:[0/2](684700/4588595) loss:2.676 lr:0.0000100 epoch_Time:24829.0min: [2024-01-05 18:15:05,257][model8_pretrain.py][INFO] Epoch:[0/2](684800/4588595) loss:3.218 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:05,257][model8_pretrain.py][INFO] Epoch:[0/2](684800/4588595) loss:2.988 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:05,257][model8_pretrain.py][INFO] Epoch:[0/2](684800/4588595) loss:2.585 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:05,257][model8_pretrain.py][INFO] Epoch:[0/2](684800/4588595) loss:2.896 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:05,257][model8_pretrain.py][INFO] Epoch:[0/2](684800/4588595) loss:2.314 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:05,257][model8_pretrain.py][INFO] Epoch:[0/2](684800/4588595) loss:2.858 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:05,257][model8_pretrain.py][INFO] Epoch:[0/2](684800/4588595) loss:2.862 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:05,257][model8_pretrain.py][INFO] Epoch:[0/2](684800/4588595) loss:2.695 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:42,198][model8_pretrain.py][INFO] Epoch:[0/2](684900/4588595) loss:2.769 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:42,198][model8_pretrain.py][INFO] Epoch:[0/2](684900/4588595) loss:2.403 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:42,198][model8_pretrain.py][INFO] Epoch:[0/2](684900/4588595) loss:3.202 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:42,198][model8_pretrain.py][INFO] Epoch:[0/2](684900/4588595) loss:2.841 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:42,198][model8_pretrain.py][INFO] Epoch:[0/2](684900/4588595) loss:2.694 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:42,198][model8_pretrain.py][INFO] Epoch:[0/2](684900/4588595) loss:3.096 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:42,198][model8_pretrain.py][INFO] Epoch:[0/2](684900/4588595) loss:2.784 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:15:42,198][model8_pretrain.py][INFO] Epoch:[0/2](684900/4588595) loss:3.235 lr:0.0000100 epoch_Time:24828.0min: [2024-01-05 18:16:19,139][model8_pretrain.py][INFO] Epoch:[0/2](685000/4588595) loss:2.459 lr:0.0000100 epoch_Time:24826.0min: [2024-01-05 18:16:19,139][model8_pretrain.py][INFO] Epoch:[0/2](685000/4588595) loss:2.488 lr:0.0000100 epoch_Time:24826.0min: [2024-01-05 18:16:19,139][model8_pretrain.py][INFO] Epoch:[0/2](685000/4588595) loss:2.885 lr:0.0000100 epoch_Time:24826.0min: [2024-01-05 18:16:19,139][model8_pretrain.py][INFO] Epoch:[0/2](685000/4588595) loss:3.144 lr:0.0000100 epoch_Time:24826.0min: [2024-01-05 18:16:19,139][model8_pretrain.py][INFO] Epoch:[0/2](685000/4588595) loss:2.847 lr:0.0000100 epoch_Time:24826.0min: [2024-01-05 18:16:19,139][model8_pretrain.py][INFO] Epoch:[0/2](685000/4588595) loss:2.771 lr:0.0000100 epoch_Time:24826.0min: [2024-01-05 18:16:19,140][model8_pretrain.py][INFO] Epoch:[0/2](685000/4588595) loss:3.115 lr:0.0000100 epoch_Time:24826.0min: [2024-01-05 18:16:19,140][model8_pretrain.py][INFO] Epoch:[0/2](685000/4588595) loss:2.851 lr:0.0000100 epoch_Time:24826.0min: [2024-01-05 18:16:56,069][model8_pretrain.py][INFO] Epoch:[0/2](685100/4588595) loss:2.722 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:16:56,069][model8_pretrain.py][INFO] Epoch:[0/2](685100/4588595) loss:2.670 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:16:56,069][model8_pretrain.py][INFO] Epoch:[0/2](685100/4588595) loss:2.923 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:16:56,069][model8_pretrain.py][INFO] Epoch:[0/2](685100/4588595) loss:2.996 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:16:56,069][model8_pretrain.py][INFO] Epoch:[0/2](685100/4588595) loss:3.039 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:16:56,069][model8_pretrain.py][INFO] Epoch:[0/2](685100/4588595) loss:3.067 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:16:56,069][model8_pretrain.py][INFO] Epoch:[0/2](685100/4588595) loss:3.422 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:16:56,069][model8_pretrain.py][INFO] Epoch:[0/2](685100/4588595) loss:2.831 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:17:33,009][model8_pretrain.py][INFO] Epoch:[0/2](685200/4588595) loss:2.409 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:17:33,009][model8_pretrain.py][INFO] Epoch:[0/2](685200/4588595) loss:3.047 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:17:33,009][model8_pretrain.py][INFO] Epoch:[0/2](685200/4588595) loss:3.237 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:17:33,009][model8_pretrain.py][INFO] Epoch:[0/2](685200/4588595) loss:2.831 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:17:33,009][model8_pretrain.py][INFO] Epoch:[0/2](685200/4588595) loss:2.520 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:17:33,009][model8_pretrain.py][INFO] Epoch:[0/2](685200/4588595) loss:3.193 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:17:33,009][model8_pretrain.py][INFO] Epoch:[0/2](685200/4588595) loss:3.005 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:17:33,010][model8_pretrain.py][INFO] Epoch:[0/2](685200/4588595) loss:2.952 lr:0.0000100 epoch_Time:24825.0min: [2024-01-05 18:18:09,945][model8_pretrain.py][INFO] Epoch:[0/2](685300/4588595) loss:2.888 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:09,945][model8_pretrain.py][INFO] Epoch:[0/2](685300/4588595) loss:2.625 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:09,945][model8_pretrain.py][INFO] Epoch:[0/2](685300/4588595) loss:2.246 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:09,945][model8_pretrain.py][INFO] Epoch:[0/2](685300/4588595) loss:3.082 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:09,945][model8_pretrain.py][INFO] Epoch:[0/2](685300/4588595) loss:2.893 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:09,945][model8_pretrain.py][INFO] Epoch:[0/2](685300/4588595) loss:2.977 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:09,945][model8_pretrain.py][INFO] Epoch:[0/2](685300/4588595) loss:2.197 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:09,945][model8_pretrain.py][INFO] Epoch:[0/2](685300/4588595) loss:2.901 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:58,931][model8_pretrain.py][INFO] Epoch:[0/2](685400/4588595) loss:1.964 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:58,931][model8_pretrain.py][INFO] Epoch:[0/2](685400/4588595) loss:2.579 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:58,931][model8_pretrain.py][INFO] Epoch:[0/2](685400/4588595) loss:2.597 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:58,931][model8_pretrain.py][INFO] Epoch:[0/2](685400/4588595) loss:2.995 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:58,931][model8_pretrain.py][INFO] Epoch:[0/2](685400/4588595) loss:2.805 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:58,931][model8_pretrain.py][INFO] Epoch:[0/2](685400/4588595) loss:2.703 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:58,931][model8_pretrain.py][INFO] Epoch:[0/2](685400/4588595) loss:2.579 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:18:58,931][model8_pretrain.py][INFO] Epoch:[0/2](685400/4588595) loss:3.226 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:19:35,847][model8_pretrain.py][INFO] Epoch:[0/2](685500/4588595) loss:2.945 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:19:35,847][model8_pretrain.py][INFO] Epoch:[0/2](685500/4588595) loss:2.945 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:19:35,847][model8_pretrain.py][INFO] Epoch:[0/2](685500/4588595) loss:2.838 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:19:35,847][model8_pretrain.py][INFO] Epoch:[0/2](685500/4588595) loss:2.662 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:19:35,847][model8_pretrain.py][INFO] Epoch:[0/2](685500/4588595) loss:2.747 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:19:35,847][model8_pretrain.py][INFO] Epoch:[0/2](685500/4588595) loss:3.165 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:19:35,847][model8_pretrain.py][INFO] Epoch:[0/2](685500/4588595) loss:2.511 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:19:35,847][model8_pretrain.py][INFO] Epoch:[0/2](685500/4588595) loss:2.358 lr:0.0000100 epoch_Time:24824.0min: [2024-01-05 18:20:12,789][model8_pretrain.py][INFO] Epoch:[0/2](685600/4588595) loss:2.898 lr:0.0000100 epoch_Time:24823.0min: [2024-01-05 18:20:12,789][model8_pretrain.py][INFO] Epoch:[0/2](685600/4588595) loss:2.580 lr:0.0000100 epoch_Time:24823.0min: [2024-01-05 18:20:12,789][model8_pretrain.py][INFO] Epoch:[0/2](685600/4588595) loss:2.652 lr:0.0000100 epoch_Time:24823.0min: [2024-01-05 18:20:12,789][model8_pretrain.py][INFO] Epoch:[0/2](685600/4588595) loss:2.210 lr:0.0000100 epoch_Time:24823.0min: [2024-01-05 18:20:12,789][model8_pretrain.py][INFO] Epoch:[0/2](685600/4588595) loss:3.154 lr:0.0000100 epoch_Time:24823.0min: [2024-01-05 18:20:12,789][model8_pretrain.py][INFO] Epoch:[0/2](685600/4588595) loss:2.917 lr:0.0000100 epoch_Time:24823.0min: [2024-01-05 18:20:12,789][model8_pretrain.py][INFO] Epoch:[0/2](685600/4588595) loss:3.014 lr:0.0000100 epoch_Time:24823.0min: [2024-01-05 18:20:12,789][model8_pretrain.py][INFO] Epoch:[0/2](685600/4588595) loss:2.814 lr:0.0000100 epoch_Time:24823.0min: [2024-01-05 18:20:49,725][model8_pretrain.py][INFO] Epoch:[0/2](685700/4588595) loss:2.706 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:20:49,725][model8_pretrain.py][INFO] Epoch:[0/2](685700/4588595) loss:2.654 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:20:49,725][model8_pretrain.py][INFO] Epoch:[0/2](685700/4588595) loss:3.113 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:20:49,725][model8_pretrain.py][INFO] Epoch:[0/2](685700/4588595) loss:2.373 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:20:49,725][model8_pretrain.py][INFO] Epoch:[0/2](685700/4588595) loss:2.636 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:20:49,725][model8_pretrain.py][INFO] Epoch:[0/2](685700/4588595) loss:3.243 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:20:49,725][model8_pretrain.py][INFO] Epoch:[0/2](685700/4588595) loss:2.611 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:20:49,725][model8_pretrain.py][INFO] Epoch:[0/2](685700/4588595) loss:2.871 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:21:26,654][model8_pretrain.py][INFO] Epoch:[0/2](685800/4588595) loss:2.707 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:21:26,654][model8_pretrain.py][INFO] Epoch:[0/2](685800/4588595) loss:2.860 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:21:26,654][model8_pretrain.py][INFO] Epoch:[0/2](685800/4588595) loss:3.207 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:21:26,654][model8_pretrain.py][INFO] Epoch:[0/2](685800/4588595) loss:2.537 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:21:26,654][model8_pretrain.py][INFO] Epoch:[0/2](685800/4588595) loss:2.990 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:21:26,654][model8_pretrain.py][INFO] Epoch:[0/2](685800/4588595) loss:2.391 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:21:26,654][model8_pretrain.py][INFO] Epoch:[0/2](685800/4588595) loss:2.868 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:21:26,654][model8_pretrain.py][INFO] Epoch:[0/2](685800/4588595) loss:2.554 lr:0.0000100 epoch_Time:24822.0min: [2024-01-05 18:22:03,593][model8_pretrain.py][INFO] Epoch:[0/2](685900/4588595) loss:3.129 lr:0.0000100 epoch_Time:24821.0min: [2024-01-05 18:22:03,593][model8_pretrain.py][INFO] Epoch:[0/2](685900/4588595) loss:3.048 lr:0.0000100 epoch_Time:24821.0min: [2024-01-05 18:22:03,593][model8_pretrain.py][INFO] Epoch:[0/2](685900/4588595) loss:3.032 lr:0.0000100 epoch_Time:24821.0min: [2024-01-05 18:22:03,593][model8_pretrain.py][INFO] Epoch:[0/2](685900/4588595) loss:2.917 lr:0.0000100 epoch_Time:24821.0min: [2024-01-05 18:22:03,593][model8_pretrain.py][INFO] Epoch:[0/2](685900/4588595) loss:2.780 lr:0.0000100 epoch_Time:24821.0min: [2024-01-05 18:22:03,593][model8_pretrain.py][INFO] Epoch:[0/2](685900/4588595) loss:2.345 lr:0.0000100 epoch_Time:24821.0min: [2024-01-05 18:22:03,593][model8_pretrain.py][INFO] Epoch:[0/2](685900/4588595) loss:3.173 lr:0.0000100 epoch_Time:24821.0min: [2024-01-05 18:22:03,593][model8_pretrain.py][INFO] Epoch:[0/2](685900/4588595) loss:2.805 lr:0.0000100 epoch_Time:24821.0min: [2024-01-05 18:22:40,541][model8_pretrain.py][INFO] Epoch:[0/2](686000/4588595) loss:3.101 lr:0.0000100 epoch_Time:24820.0min: [2024-01-05 18:22:40,541][model8_pretrain.py][INFO] Epoch:[0/2](686000/4588595) loss:2.762 lr:0.0000100 epoch_Time:24820.0min: [2024-01-05 18:22:40,541][model8_pretrain.py][INFO] Epoch:[0/2](686000/4588595) loss:3.247 lr:0.0000100 epoch_Time:24820.0min: [2024-01-05 18:22:40,541][model8_pretrain.py][INFO] Epoch:[0/2](686000/4588595) loss:2.508 lr:0.0000100 epoch_Time:24820.0min: [2024-01-05 18:22:40,541][model8_pretrain.py][INFO] Epoch:[0/2](686000/4588595) loss:2.719 lr:0.0000100 epoch_Time:24820.0min: [2024-01-05 18:22:40,541][model8_pretrain.py][INFO] Epoch:[0/2](686000/4588595) loss:2.532 lr:0.0000100 epoch_Time:24820.0min: [2024-01-05 18:22:40,541][model8_pretrain.py][INFO] Epoch:[0/2](686000/4588595) loss:3.232 lr:0.0000100 epoch_Time:24820.0min: [2024-01-05 18:22:40,541][model8_pretrain.py][INFO] Epoch:[0/2](686000/4588595) loss:3.121 lr:0.0000100 epoch_Time:24820.0min: [2024-01-05 18:23:17,495][model8_pretrain.py][INFO] Epoch:[0/2](686100/4588595) loss:2.941 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:23:17,495][model8_pretrain.py][INFO] Epoch:[0/2](686100/4588595) loss:3.004 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:23:17,495][model8_pretrain.py][INFO] Epoch:[0/2](686100/4588595) loss:2.866 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:23:17,495][model8_pretrain.py][INFO] Epoch:[0/2](686100/4588595) loss:3.122 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:23:17,495][model8_pretrain.py][INFO] Epoch:[0/2](686100/4588595) loss:2.921 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:23:17,495][model8_pretrain.py][INFO] Epoch:[0/2](686100/4588595) loss:2.891 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:23:17,495][model8_pretrain.py][INFO] Epoch:[0/2](686100/4588595) loss:2.745 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:23:17,495][model8_pretrain.py][INFO] Epoch:[0/2](686100/4588595) loss:2.786 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:06,514][model8_pretrain.py][INFO] Epoch:[0/2](686200/4588595) loss:2.940 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:06,514][model8_pretrain.py][INFO] Epoch:[0/2](686200/4588595) loss:2.442 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:06,514][model8_pretrain.py][INFO] Epoch:[0/2](686200/4588595) loss:2.668 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:06,514][model8_pretrain.py][INFO] Epoch:[0/2](686200/4588595) loss:2.977 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:06,514][model8_pretrain.py][INFO] Epoch:[0/2](686200/4588595) loss:2.880 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:06,514][model8_pretrain.py][INFO] Epoch:[0/2](686200/4588595) loss:3.058 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:06,515][model8_pretrain.py][INFO] Epoch:[0/2](686200/4588595) loss:2.917 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:06,514][model8_pretrain.py][INFO] Epoch:[0/2](686200/4588595) loss:3.439 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:43,438][model8_pretrain.py][INFO] Epoch:[0/2](686300/4588595) loss:3.272 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:43,438][model8_pretrain.py][INFO] Epoch:[0/2](686300/4588595) loss:2.842 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:43,438][model8_pretrain.py][INFO] Epoch:[0/2](686300/4588595) loss:3.297 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:43,438][model8_pretrain.py][INFO] Epoch:[0/2](686300/4588595) loss:2.915 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:43,438][model8_pretrain.py][INFO] Epoch:[0/2](686300/4588595) loss:3.057 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:43,438][model8_pretrain.py][INFO] Epoch:[0/2](686300/4588595) loss:2.442 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:43,438][model8_pretrain.py][INFO] Epoch:[0/2](686300/4588595) loss:3.147 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:24:43,439][model8_pretrain.py][INFO] Epoch:[0/2](686300/4588595) loss:2.488 lr:0.0000100 epoch_Time:24819.0min: [2024-01-05 18:25:20,383][model8_pretrain.py][INFO] Epoch:[0/2](686400/4588595) loss:2.439 lr:0.0000100 epoch_Time:24818.0min: [2024-01-05 18:25:20,383][model8_pretrain.py][INFO] Epoch:[0/2](686400/4588595) loss:3.299 lr:0.0000100 epoch_Time:24818.0min: [2024-01-05 18:25:20,383][model8_pretrain.py][INFO] Epoch:[0/2](686400/4588595) loss:3.281 lr:0.0000100 epoch_Time:24818.0min: [2024-01-05 18:25:20,383][model8_pretrain.py][INFO] Epoch:[0/2](686400/4588595) loss:3.052 lr:0.0000100 epoch_Time:24818.0min: [2024-01-05 18:25:20,383][model8_pretrain.py][INFO] Epoch:[0/2](686400/4588595) loss:3.152 lr:0.0000100 epoch_Time:24818.0min: [2024-01-05 18:25:20,383][model8_pretrain.py][INFO] Epoch:[0/2](686400/4588595) loss:3.375 lr:0.0000100 epoch_Time:24818.0min: [2024-01-05 18:25:20,383][model8_pretrain.py][INFO] Epoch:[0/2](686400/4588595) loss:2.992 lr:0.0000100 epoch_Time:24818.0min: [2024-01-05 18:25:20,383][model8_pretrain.py][INFO] Epoch:[0/2](686400/4588595) loss:2.468 lr:0.0000100 epoch_Time:24818.0min: [2024-01-05 18:25:57,314][model8_pretrain.py][INFO] Epoch:[0/2](686500/4588595) loss:2.691 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:25:57,314][model8_pretrain.py][INFO] Epoch:[0/2](686500/4588595) loss:3.032 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:25:57,314][model8_pretrain.py][INFO] Epoch:[0/2](686500/4588595) loss:2.260 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:25:57,314][model8_pretrain.py][INFO] Epoch:[0/2](686500/4588595) loss:3.130 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:25:57,314][model8_pretrain.py][INFO] Epoch:[0/2](686500/4588595) loss:3.050 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:25:57,314][model8_pretrain.py][INFO] Epoch:[0/2](686500/4588595) loss:2.768 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:25:57,314][model8_pretrain.py][INFO] Epoch:[0/2](686500/4588595) loss:2.654 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:25:57,314][model8_pretrain.py][INFO] Epoch:[0/2](686500/4588595) loss:2.692 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:26:34,268][model8_pretrain.py][INFO] Epoch:[0/2](686600/4588595) loss:2.670 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:26:34,268][model8_pretrain.py][INFO] Epoch:[0/2](686600/4588595) loss:2.756 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:26:34,269][model8_pretrain.py][INFO] Epoch:[0/2](686600/4588595) loss:2.824 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:26:34,269][model8_pretrain.py][INFO] Epoch:[0/2](686600/4588595) loss:3.092 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:26:34,269][model8_pretrain.py][INFO] Epoch:[0/2](686600/4588595) loss:2.592 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:26:34,269][model8_pretrain.py][INFO] Epoch:[0/2](686600/4588595) loss:2.887 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:26:34,269][model8_pretrain.py][INFO] Epoch:[0/2](686600/4588595) loss:2.952 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:26:34,269][model8_pretrain.py][INFO] Epoch:[0/2](686600/4588595) loss:3.314 lr:0.0000100 epoch_Time:24817.0min: [2024-01-05 18:27:11,220][model8_pretrain.py][INFO] Epoch:[0/2](686700/4588595) loss:2.431 lr:0.0000100 epoch_Time:24816.0min: [2024-01-05 18:27:11,220][model8_pretrain.py][INFO] Epoch:[0/2](686700/4588595) loss:3.200 lr:0.0000100 epoch_Time:24816.0min: [2024-01-05 18:27:11,220][model8_pretrain.py][INFO] Epoch:[0/2](686700/4588595) loss:2.422 lr:0.0000100 epoch_Time:24816.0min: [2024-01-05 18:27:11,220][model8_pretrain.py][INFO] Epoch:[0/2](686700/4588595) loss:2.809 lr:0.0000100 epoch_Time:24816.0min: [2024-01-05 18:27:11,220][model8_pretrain.py][INFO] Epoch:[0/2](686700/4588595) loss:1.731 lr:0.0000100 epoch_Time:24816.0min: [2024-01-05 18:27:11,220][model8_pretrain.py][INFO] Epoch:[0/2](686700/4588595) loss:2.450 lr:0.0000100 epoch_Time:24816.0min: [2024-01-05 18:27:11,220][model8_pretrain.py][INFO] Epoch:[0/2](686700/4588595) loss:2.946 lr:0.0000100 epoch_Time:24816.0min: [2024-01-05 18:27:11,221][model8_pretrain.py][INFO] Epoch:[0/2](686700/4588595) loss:3.004 lr:0.0000100 epoch_Time:24816.0min: [2024-01-05 18:27:48,173][model8_pretrain.py][INFO] Epoch:[0/2](686800/4588595) loss:2.671 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:27:48,173][model8_pretrain.py][INFO] Epoch:[0/2](686800/4588595) loss:2.870 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:27:48,173][model8_pretrain.py][INFO] Epoch:[0/2](686800/4588595) loss:2.472 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:27:48,173][model8_pretrain.py][INFO] Epoch:[0/2](686800/4588595) loss:2.714 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:27:48,173][model8_pretrain.py][INFO] Epoch:[0/2](686800/4588595) loss:2.423 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:27:48,173][model8_pretrain.py][INFO] Epoch:[0/2](686800/4588595) loss:2.859 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:27:48,173][model8_pretrain.py][INFO] Epoch:[0/2](686800/4588595) loss:2.843 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:27:48,173][model8_pretrain.py][INFO] Epoch:[0/2](686800/4588595) loss:2.310 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:28:25,117][model8_pretrain.py][INFO] Epoch:[0/2](686900/4588595) loss:3.024 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:28:25,117][model8_pretrain.py][INFO] Epoch:[0/2](686900/4588595) loss:3.099 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:28:25,117][model8_pretrain.py][INFO] Epoch:[0/2](686900/4588595) loss:3.338 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:28:25,117][model8_pretrain.py][INFO] Epoch:[0/2](686900/4588595) loss:2.798 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:28:25,117][model8_pretrain.py][INFO] Epoch:[0/2](686900/4588595) loss:2.988 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:28:25,117][model8_pretrain.py][INFO] Epoch:[0/2](686900/4588595) loss:1.978 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:28:25,117][model8_pretrain.py][INFO] Epoch:[0/2](686900/4588595) loss:2.272 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:28:25,117][model8_pretrain.py][INFO] Epoch:[0/2](686900/4588595) loss:2.912 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:29:14,055][model8_pretrain.py][INFO] Epoch:[0/2](687000/4588595) loss:2.789 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:29:14,055][model8_pretrain.py][INFO] Epoch:[0/2](687000/4588595) loss:2.934 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:29:14,055][model8_pretrain.py][INFO] Epoch:[0/2](687000/4588595) loss:2.183 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:29:14,055][model8_pretrain.py][INFO] Epoch:[0/2](687000/4588595) loss:2.880 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:29:14,055][model8_pretrain.py][INFO] Epoch:[0/2](687000/4588595) loss:2.509 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:29:14,055][model8_pretrain.py][INFO] Epoch:[0/2](687000/4588595) loss:2.230 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:29:14,055][model8_pretrain.py][INFO] Epoch:[0/2](687000/4588595) loss:3.005 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:29:14,055][model8_pretrain.py][INFO] Epoch:[0/2](687000/4588595) loss:3.055 lr:0.0000100 epoch_Time:24815.0min: [2024-01-05 18:29:50,991][model8_pretrain.py][INFO] Epoch:[0/2](687100/4588595) loss:2.812 lr:0.0000100 epoch_Time:24814.0min: [2024-01-05 18:29:50,991][model8_pretrain.py][INFO] Epoch:[0/2](687100/4588595) loss:2.406 lr:0.0000100 epoch_Time:24814.0min: [2024-01-05 18:29:50,991][model8_pretrain.py][INFO] Epoch:[0/2](687100/4588595) loss:2.274 lr:0.0000100 epoch_Time:24814.0min: [2024-01-05 18:29:50,991][model8_pretrain.py][INFO] Epoch:[0/2](687100/4588595) loss:2.569 lr:0.0000100 epoch_Time:24814.0min: [2024-01-05 18:29:50,991][model8_pretrain.py][INFO] Epoch:[0/2](687100/4588595) loss:2.768 lr:0.0000100 epoch_Time:24814.0min: [2024-01-05 18:29:50,991][model8_pretrain.py][INFO] Epoch:[0/2](687100/4588595) loss:3.102 lr:0.0000100 epoch_Time:24814.0min: [2024-01-05 18:29:50,991][model8_pretrain.py][INFO] Epoch:[0/2](687100/4588595) loss:2.726 lr:0.0000100 epoch_Time:24814.0min: [2024-01-05 18:29:50,991][model8_pretrain.py][INFO] Epoch:[0/2](687100/4588595) loss:2.400 lr:0.0000100 epoch_Time:24814.0min: [2024-01-05 18:30:27,931][model8_pretrain.py][INFO] Epoch:[0/2](687200/4588595) loss:2.166 lr:0.0000100 epoch_Time:24813.0min: [2024-01-05 18:30:27,931][model8_pretrain.py][INFO] Epoch:[0/2](687200/4588595) loss:3.166 lr:0.0000100 epoch_Time:24813.0min: [2024-01-05 18:30:27,931][model8_pretrain.py][INFO] Epoch:[0/2](687200/4588595) loss:3.323 lr:0.0000100 epoch_Time:24813.0min: [2024-01-05 18:30:27,931][model8_pretrain.py][INFO] Epoch:[0/2](687200/4588595) loss:2.668 lr:0.0000100 epoch_Time:24813.0min: [2024-01-05 18:30:27,931][model8_pretrain.py][INFO] Epoch:[0/2](687200/4588595) loss:3.195 lr:0.0000100 epoch_Time:24813.0min: [2024-01-05 18:30:27,931][model8_pretrain.py][INFO] Epoch:[0/2](687200/4588595) loss:3.086 lr:0.0000100 epoch_Time:24813.0min: [2024-01-05 18:30:27,931][model8_pretrain.py][INFO] Epoch:[0/2](687200/4588595) loss:2.711 lr:0.0000100 epoch_Time:24813.0min: [2024-01-05 18:30:27,932][model8_pretrain.py][INFO] Epoch:[0/2](687200/4588595) loss:2.618 lr:0.0000100 epoch_Time:24813.0min: [2024-01-05 18:31:04,878][model8_pretrain.py][INFO] Epoch:[0/2](687300/4588595) loss:2.314 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:04,878][model8_pretrain.py][INFO] Epoch:[0/2](687300/4588595) loss:2.505 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:04,878][model8_pretrain.py][INFO] Epoch:[0/2](687300/4588595) loss:3.211 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:04,878][model8_pretrain.py][INFO] Epoch:[0/2](687300/4588595) loss:3.045 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:04,878][model8_pretrain.py][INFO] Epoch:[0/2](687300/4588595) loss:2.697 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:04,878][model8_pretrain.py][INFO] Epoch:[0/2](687300/4588595) loss:2.781 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:04,878][model8_pretrain.py][INFO] Epoch:[0/2](687300/4588595) loss:3.289 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:04,878][model8_pretrain.py][INFO] Epoch:[0/2](687300/4588595) loss:2.885 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:41,819][model8_pretrain.py][INFO] Epoch:[0/2](687400/4588595) loss:2.700 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:41,819][model8_pretrain.py][INFO] Epoch:[0/2](687400/4588595) loss:3.050 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:41,819][model8_pretrain.py][INFO] Epoch:[0/2](687400/4588595) loss:3.074 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:41,819][model8_pretrain.py][INFO] Epoch:[0/2](687400/4588595) loss:2.563 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:41,819][model8_pretrain.py][INFO] Epoch:[0/2](687400/4588595) loss:3.007 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:41,819][model8_pretrain.py][INFO] Epoch:[0/2](687400/4588595) loss:2.742 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:41,819][model8_pretrain.py][INFO] Epoch:[0/2](687400/4588595) loss:3.141 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:31:41,819][model8_pretrain.py][INFO] Epoch:[0/2](687400/4588595) loss:3.120 lr:0.0000100 epoch_Time:24812.0min: [2024-01-05 18:32:18,771][model8_pretrain.py][INFO] Epoch:[0/2](687500/4588595) loss:2.790 lr:0.0000100 epoch_Time:24811.0min: [2024-01-05 18:32:18,771][model8_pretrain.py][INFO] Epoch:[0/2](687500/4588595) loss:3.360 lr:0.0000100 epoch_Time:24811.0min: [2024-01-05 18:32:18,771][model8_pretrain.py][INFO] Epoch:[0/2](687500/4588595) loss:2.509 lr:0.0000100 epoch_Time:24811.0min: [2024-01-05 18:32:18,771][model8_pretrain.py][INFO] Epoch:[0/2](687500/4588595) loss:3.141 lr:0.0000100 epoch_Time:24811.0min: [2024-01-05 18:32:18,771][model8_pretrain.py][INFO] Epoch:[0/2](687500/4588595) loss:2.525 lr:0.0000100 epoch_Time:24811.0min: [2024-01-05 18:32:18,771][model8_pretrain.py][INFO] Epoch:[0/2](687500/4588595) loss:3.222 lr:0.0000100 epoch_Time:24811.0min: [2024-01-05 18:32:18,771][model8_pretrain.py][INFO] Epoch:[0/2](687500/4588595) loss:3.245 lr:0.0000100 epoch_Time:24811.0min: [2024-01-05 18:32:18,771][model8_pretrain.py][INFO] Epoch:[0/2](687500/4588595) loss:2.887 lr:0.0000100 epoch_Time:24811.0min: [2024-01-05 18:32:55,718][model8_pretrain.py][INFO] Epoch:[0/2](687600/4588595) loss:2.721 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:32:55,718][model8_pretrain.py][INFO] Epoch:[0/2](687600/4588595) loss:2.459 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:32:55,718][model8_pretrain.py][INFO] Epoch:[0/2](687600/4588595) loss:2.767 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:32:55,718][model8_pretrain.py][INFO] Epoch:[0/2](687600/4588595) loss:2.719 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:32:55,718][model8_pretrain.py][INFO] Epoch:[0/2](687600/4588595) loss:2.518 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:32:55,718][model8_pretrain.py][INFO] Epoch:[0/2](687600/4588595) loss:2.691 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:32:55,718][model8_pretrain.py][INFO] Epoch:[0/2](687600/4588595) loss:3.177 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:32:55,718][model8_pretrain.py][INFO] Epoch:[0/2](687600/4588595) loss:2.832 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:33:32,653][model8_pretrain.py][INFO] Epoch:[0/2](687700/4588595) loss:2.347 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:33:32,653][model8_pretrain.py][INFO] Epoch:[0/2](687700/4588595) loss:3.123 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:33:32,653][model8_pretrain.py][INFO] Epoch:[0/2](687700/4588595) loss:3.030 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:33:32,653][model8_pretrain.py][INFO] Epoch:[0/2](687700/4588595) loss:2.163 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:33:32,653][model8_pretrain.py][INFO] Epoch:[0/2](687700/4588595) loss:2.819 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:33:32,653][model8_pretrain.py][INFO] Epoch:[0/2](687700/4588595) loss:3.417 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:33:32,653][model8_pretrain.py][INFO] Epoch:[0/2](687700/4588595) loss:2.735 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:33:32,654][model8_pretrain.py][INFO] Epoch:[0/2](687700/4588595) loss:2.599 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:34:21,470][model8_pretrain.py][INFO] Epoch:[0/2](687800/4588595) loss:2.934 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:34:21,470][model8_pretrain.py][INFO] Epoch:[0/2](687800/4588595) loss:2.674 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:34:21,470][model8_pretrain.py][INFO] Epoch:[0/2](687800/4588595) loss:2.864 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:34:21,470][model8_pretrain.py][INFO] Epoch:[0/2](687800/4588595) loss:2.706 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:34:21,470][model8_pretrain.py][INFO] Epoch:[0/2](687800/4588595) loss:3.096 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:34:21,470][model8_pretrain.py][INFO] Epoch:[0/2](687800/4588595) loss:2.785 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:34:21,470][model8_pretrain.py][INFO] Epoch:[0/2](687800/4588595) loss:2.378 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:34:21,470][model8_pretrain.py][INFO] Epoch:[0/2](687800/4588595) loss:2.231 lr:0.0000100 epoch_Time:24810.0min: [2024-01-05 18:34:58,396][model8_pretrain.py][INFO] Epoch:[0/2](687900/4588595) loss:2.367 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:34:58,396][model8_pretrain.py][INFO] Epoch:[0/2](687900/4588595) loss:2.524 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:34:58,396][model8_pretrain.py][INFO] Epoch:[0/2](687900/4588595) loss:2.709 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:34:58,396][model8_pretrain.py][INFO] Epoch:[0/2](687900/4588595) loss:3.085 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:34:58,396][model8_pretrain.py][INFO] Epoch:[0/2](687900/4588595) loss:3.273 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:34:58,396][model8_pretrain.py][INFO] Epoch:[0/2](687900/4588595) loss:2.836 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:34:58,396][model8_pretrain.py][INFO] Epoch:[0/2](687900/4588595) loss:3.074 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:34:58,396][model8_pretrain.py][INFO] Epoch:[0/2](687900/4588595) loss:2.285 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:35:35,326][model8_pretrain.py][INFO] Epoch:[0/2](688000/4588595) loss:3.127 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:35:35,326][model8_pretrain.py][INFO] Epoch:[0/2](688000/4588595) loss:2.771 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:35:35,326][model8_pretrain.py][INFO] Epoch:[0/2](688000/4588595) loss:2.986 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:35:35,326][model8_pretrain.py][INFO] Epoch:[0/2](688000/4588595) loss:2.660 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:35:35,327][model8_pretrain.py][INFO] Epoch:[0/2](688000/4588595) loss:2.656 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:35:35,327][model8_pretrain.py][INFO] Epoch:[0/2](688000/4588595) loss:2.961 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:35:35,327][model8_pretrain.py][INFO] Epoch:[0/2](688000/4588595) loss:2.446 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:35:35,327][model8_pretrain.py][INFO] Epoch:[0/2](688000/4588595) loss:3.097 lr:0.0000100 epoch_Time:24809.0min: [2024-01-05 18:36:12,270][model8_pretrain.py][INFO] Epoch:[0/2](688100/4588595) loss:3.205 lr:0.0000100 epoch_Time:24808.0min: [2024-01-05 18:36:12,270][model8_pretrain.py][INFO] Epoch:[0/2](688100/4588595) loss:2.901 lr:0.0000100 epoch_Time:24808.0min: [2024-01-05 18:36:12,270][model8_pretrain.py][INFO] Epoch:[0/2](688100/4588595) loss:2.464 lr:0.0000100 epoch_Time:24808.0min: [2024-01-05 18:36:12,270][model8_pretrain.py][INFO] Epoch:[0/2](688100/4588595) loss:2.368 lr:0.0000100 epoch_Time:24808.0min: [2024-01-05 18:36:12,270][model8_pretrain.py][INFO] Epoch:[0/2](688100/4588595) loss:2.548 lr:0.0000100 epoch_Time:24808.0min: [2024-01-05 18:36:12,270][model8_pretrain.py][INFO] Epoch:[0/2](688100/4588595) loss:2.151 lr:0.0000100 epoch_Time:24808.0min: [2024-01-05 18:36:12,270][model8_pretrain.py][INFO] Epoch:[0/2](688100/4588595) loss:2.635 lr:0.0000100 epoch_Time:24808.0min: [2024-01-05 18:36:12,270][model8_pretrain.py][INFO] Epoch:[0/2](688100/4588595) loss:2.902 lr:0.0000100 epoch_Time:24808.0min: [2024-01-05 18:36:49,234][model8_pretrain.py][INFO] Epoch:[0/2](688200/4588595) loss:2.724 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:36:49,234][model8_pretrain.py][INFO] Epoch:[0/2](688200/4588595) loss:2.936 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:36:49,234][model8_pretrain.py][INFO] Epoch:[0/2](688200/4588595) loss:3.104 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:36:49,234][model8_pretrain.py][INFO] Epoch:[0/2](688200/4588595) loss:2.933 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:36:49,234][model8_pretrain.py][INFO] Epoch:[0/2](688200/4588595) loss:3.202 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:36:49,234][model8_pretrain.py][INFO] Epoch:[0/2](688200/4588595) loss:2.362 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:36:49,234][model8_pretrain.py][INFO] Epoch:[0/2](688200/4588595) loss:2.803 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:36:49,234][model8_pretrain.py][INFO] Epoch:[0/2](688200/4588595) loss:2.356 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:37:26,237][model8_pretrain.py][INFO] Epoch:[0/2](688300/4588595) loss:2.774 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:37:26,237][model8_pretrain.py][INFO] Epoch:[0/2](688300/4588595) loss:2.994 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:37:26,237][model8_pretrain.py][INFO] Epoch:[0/2](688300/4588595) loss:2.613 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:37:26,237][model8_pretrain.py][INFO] Epoch:[0/2](688300/4588595) loss:3.246 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:37:26,237][model8_pretrain.py][INFO] Epoch:[0/2](688300/4588595) loss:2.889 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:37:26,237][model8_pretrain.py][INFO] Epoch:[0/2](688300/4588595) loss:2.937 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:37:26,237][model8_pretrain.py][INFO] Epoch:[0/2](688300/4588595) loss:2.561 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:37:26,237][model8_pretrain.py][INFO] Epoch:[0/2](688300/4588595) loss:2.695 lr:0.0000100 epoch_Time:24806.0min: [2024-01-05 18:38:03,253][model8_pretrain.py][INFO] Epoch:[0/2](688400/4588595) loss:2.955 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:03,253][model8_pretrain.py][INFO] Epoch:[0/2](688400/4588595) loss:1.640 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:03,253][model8_pretrain.py][INFO] Epoch:[0/2](688400/4588595) loss:3.148 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:03,253][model8_pretrain.py][INFO] Epoch:[0/2](688400/4588595) loss:3.348 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:03,253][model8_pretrain.py][INFO] Epoch:[0/2](688400/4588595) loss:2.891 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:03,253][model8_pretrain.py][INFO] Epoch:[0/2](688400/4588595) loss:2.830 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:03,253][model8_pretrain.py][INFO] Epoch:[0/2](688400/4588595) loss:3.043 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:03,254][model8_pretrain.py][INFO] Epoch:[0/2](688400/4588595) loss:2.634 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:40,187][model8_pretrain.py][INFO] Epoch:[0/2](688500/4588595) loss:3.254 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:40,187][model8_pretrain.py][INFO] Epoch:[0/2](688500/4588595) loss:2.339 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:40,187][model8_pretrain.py][INFO] Epoch:[0/2](688500/4588595) loss:2.446 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:40,187][model8_pretrain.py][INFO] Epoch:[0/2](688500/4588595) loss:2.537 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:40,187][model8_pretrain.py][INFO] Epoch:[0/2](688500/4588595) loss:3.330 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:40,187][model8_pretrain.py][INFO] Epoch:[0/2](688500/4588595) loss:3.205 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:40,187][model8_pretrain.py][INFO] Epoch:[0/2](688500/4588595) loss:2.825 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:38:40,187][model8_pretrain.py][INFO] Epoch:[0/2](688500/4588595) loss:3.349 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:39:29,164][model8_pretrain.py][INFO] Epoch:[0/2](688600/4588595) loss:2.931 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:39:29,164][model8_pretrain.py][INFO] Epoch:[0/2](688600/4588595) loss:2.596 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:39:29,164][model8_pretrain.py][INFO] Epoch:[0/2](688600/4588595) loss:2.990 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:39:29,164][model8_pretrain.py][INFO] Epoch:[0/2](688600/4588595) loss:2.687 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:39:29,164][model8_pretrain.py][INFO] Epoch:[0/2](688600/4588595) loss:2.931 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:39:29,164][model8_pretrain.py][INFO] Epoch:[0/2](688600/4588595) loss:2.422 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:39:29,164][model8_pretrain.py][INFO] Epoch:[0/2](688600/4588595) loss:2.668 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:39:29,165][model8_pretrain.py][INFO] Epoch:[0/2](688600/4588595) loss:1.948 lr:0.0000100 epoch_Time:24805.0min: [2024-01-05 18:40:06,091][model8_pretrain.py][INFO] Epoch:[0/2](688700/4588595) loss:2.292 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:06,091][model8_pretrain.py][INFO] Epoch:[0/2](688700/4588595) loss:2.817 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:06,091][model8_pretrain.py][INFO] Epoch:[0/2](688700/4588595) loss:2.707 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:06,091][model8_pretrain.py][INFO] Epoch:[0/2](688700/4588595) loss:3.118 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:06,091][model8_pretrain.py][INFO] Epoch:[0/2](688700/4588595) loss:3.144 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:06,091][model8_pretrain.py][INFO] Epoch:[0/2](688700/4588595) loss:3.123 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:06,091][model8_pretrain.py][INFO] Epoch:[0/2](688700/4588595) loss:2.746 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:06,091][model8_pretrain.py][INFO] Epoch:[0/2](688700/4588595) loss:2.918 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:43,035][model8_pretrain.py][INFO] Epoch:[0/2](688800/4588595) loss:3.188 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:43,035][model8_pretrain.py][INFO] Epoch:[0/2](688800/4588595) loss:2.964 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:43,035][model8_pretrain.py][INFO] Epoch:[0/2](688800/4588595) loss:2.611 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:43,035][model8_pretrain.py][INFO] Epoch:[0/2](688800/4588595) loss:2.857 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:43,035][model8_pretrain.py][INFO] Epoch:[0/2](688800/4588595) loss:2.727 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:43,035][model8_pretrain.py][INFO] Epoch:[0/2](688800/4588595) loss:2.901 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:43,035][model8_pretrain.py][INFO] Epoch:[0/2](688800/4588595) loss:1.905 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:40:43,035][model8_pretrain.py][INFO] Epoch:[0/2](688800/4588595) loss:2.806 lr:0.0000100 epoch_Time:24804.0min: [2024-01-05 18:41:19,983][model8_pretrain.py][INFO] Epoch:[0/2](688900/4588595) loss:2.610 lr:0.0000100 epoch_Time:24803.0min: [2024-01-05 18:41:19,983][model8_pretrain.py][INFO] Epoch:[0/2](688900/4588595) loss:2.847 lr:0.0000100 epoch_Time:24803.0min: [2024-01-05 18:41:19,983][model8_pretrain.py][INFO] Epoch:[0/2](688900/4588595) loss:3.142 lr:0.0000100 epoch_Time:24803.0min: [2024-01-05 18:41:19,983][model8_pretrain.py][INFO] Epoch:[0/2](688900/4588595) loss:2.365 lr:0.0000100 epoch_Time:24803.0min: [2024-01-05 18:41:19,983][model8_pretrain.py][INFO] Epoch:[0/2](688900/4588595) loss:3.245 lr:0.0000100 epoch_Time:24803.0min: [2024-01-05 18:41:19,983][model8_pretrain.py][INFO] Epoch:[0/2](688900/4588595) loss:2.664 lr:0.0000100 epoch_Time:24803.0min: [2024-01-05 18:41:19,983][model8_pretrain.py][INFO] Epoch:[0/2](688900/4588595) loss:2.888 lr:0.0000100 epoch_Time:24803.0min: [2024-01-05 18:41:19,983][model8_pretrain.py][INFO] Epoch:[0/2](688900/4588595) loss:2.632 lr:0.0000100 epoch_Time:24803.0min: [2024-01-05 18:41:56,930][model8_pretrain.py][INFO] Epoch:[0/2](689000/4588595) loss:2.943 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:41:56,930][model8_pretrain.py][INFO] Epoch:[0/2](689000/4588595) loss:2.426 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:41:56,930][model8_pretrain.py][INFO] Epoch:[0/2](689000/4588595) loss:2.697 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:41:56,930][model8_pretrain.py][INFO] Epoch:[0/2](689000/4588595) loss:3.089 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:41:56,930][model8_pretrain.py][INFO] Epoch:[0/2](689000/4588595) loss:2.232 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:41:56,930][model8_pretrain.py][INFO] Epoch:[0/2](689000/4588595) loss:2.663 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:41:56,931][model8_pretrain.py][INFO] Epoch:[0/2](689000/4588595) loss:2.991 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:41:56,931][model8_pretrain.py][INFO] Epoch:[0/2](689000/4588595) loss:3.135 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:42:33,883][model8_pretrain.py][INFO] Epoch:[0/2](689100/4588595) loss:2.961 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:42:33,883][model8_pretrain.py][INFO] Epoch:[0/2](689100/4588595) loss:2.758 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:42:33,883][model8_pretrain.py][INFO] Epoch:[0/2](689100/4588595) loss:3.376 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:42:33,883][model8_pretrain.py][INFO] Epoch:[0/2](689100/4588595) loss:3.006 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:42:33,883][model8_pretrain.py][INFO] Epoch:[0/2](689100/4588595) loss:2.465 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:42:33,883][model8_pretrain.py][INFO] Epoch:[0/2](689100/4588595) loss:2.957 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:42:33,883][model8_pretrain.py][INFO] Epoch:[0/2](689100/4588595) loss:2.887 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:42:33,884][model8_pretrain.py][INFO] Epoch:[0/2](689100/4588595) loss:2.987 lr:0.0000100 epoch_Time:24802.0min: [2024-01-05 18:43:10,831][model8_pretrain.py][INFO] Epoch:[0/2](689200/4588595) loss:2.940 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:43:10,831][model8_pretrain.py][INFO] Epoch:[0/2](689200/4588595) loss:2.586 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:43:10,831][model8_pretrain.py][INFO] Epoch:[0/2](689200/4588595) loss:2.840 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:43:10,831][model8_pretrain.py][INFO] Epoch:[0/2](689200/4588595) loss:2.758 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:43:10,831][model8_pretrain.py][INFO] Epoch:[0/2](689200/4588595) loss:2.745 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:43:10,831][model8_pretrain.py][INFO] Epoch:[0/2](689200/4588595) loss:3.000 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:43:10,831][model8_pretrain.py][INFO] Epoch:[0/2](689200/4588595) loss:2.843 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:43:10,832][model8_pretrain.py][INFO] Epoch:[0/2](689200/4588595) loss:3.203 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:43:47,781][model8_pretrain.py][INFO] Epoch:[0/2](689300/4588595) loss:2.262 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:43:47,781][model8_pretrain.py][INFO] Epoch:[0/2](689300/4588595) loss:3.055 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:43:47,781][model8_pretrain.py][INFO] Epoch:[0/2](689300/4588595) loss:2.322 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:43:47,781][model8_pretrain.py][INFO] Epoch:[0/2](689300/4588595) loss:2.776 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:43:47,781][model8_pretrain.py][INFO] Epoch:[0/2](689300/4588595) loss:3.113 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:43:47,781][model8_pretrain.py][INFO] Epoch:[0/2](689300/4588595) loss:2.204 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:43:47,781][model8_pretrain.py][INFO] Epoch:[0/2](689300/4588595) loss:2.564 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:43:47,781][model8_pretrain.py][INFO] Epoch:[0/2](689300/4588595) loss:2.809 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:44:36,686][model8_pretrain.py][INFO] Epoch:[0/2](689400/4588595) loss:2.543 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:44:36,686][model8_pretrain.py][INFO] Epoch:[0/2](689400/4588595) loss:2.863 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:44:36,686][model8_pretrain.py][INFO] Epoch:[0/2](689400/4588595) loss:2.496 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:44:36,686][model8_pretrain.py][INFO] Epoch:[0/2](689400/4588595) loss:3.101 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:44:36,686][model8_pretrain.py][INFO] Epoch:[0/2](689400/4588595) loss:2.865 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:44:36,686][model8_pretrain.py][INFO] Epoch:[0/2](689400/4588595) loss:3.004 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:44:36,686][model8_pretrain.py][INFO] Epoch:[0/2](689400/4588595) loss:2.757 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:44:36,686][model8_pretrain.py][INFO] Epoch:[0/2](689400/4588595) loss:2.441 lr:0.0000100 epoch_Time:24800.0min: [2024-01-05 18:45:13,580][model8_pretrain.py][INFO] Epoch:[0/2](689500/4588595) loss:2.931 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:45:13,580][model8_pretrain.py][INFO] Epoch:[0/2](689500/4588595) loss:2.355 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:45:13,580][model8_pretrain.py][INFO] Epoch:[0/2](689500/4588595) loss:2.974 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:45:13,580][model8_pretrain.py][INFO] Epoch:[0/2](689500/4588595) loss:2.116 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:45:13,580][model8_pretrain.py][INFO] Epoch:[0/2](689500/4588595) loss:2.653 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:45:13,580][model8_pretrain.py][INFO] Epoch:[0/2](689500/4588595) loss:2.869 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:45:13,580][model8_pretrain.py][INFO] Epoch:[0/2](689500/4588595) loss:3.064 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:45:13,580][model8_pretrain.py][INFO] Epoch:[0/2](689500/4588595) loss:2.607 lr:0.0000100 epoch_Time:24799.0min: [2024-01-05 18:45:50,510][model8_pretrain.py][INFO] Epoch:[0/2](689600/4588595) loss:2.865 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:45:50,510][model8_pretrain.py][INFO] Epoch:[0/2](689600/4588595) loss:2.361 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:45:50,510][model8_pretrain.py][INFO] Epoch:[0/2](689600/4588595) loss:2.700 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:45:50,510][model8_pretrain.py][INFO] Epoch:[0/2](689600/4588595) loss:2.846 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:45:50,510][model8_pretrain.py][INFO] Epoch:[0/2](689600/4588595) loss:3.020 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:45:50,510][model8_pretrain.py][INFO] Epoch:[0/2](689600/4588595) loss:3.002 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:45:50,510][model8_pretrain.py][INFO] Epoch:[0/2](689600/4588595) loss:3.095 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:45:50,511][model8_pretrain.py][INFO] Epoch:[0/2](689600/4588595) loss:2.858 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:46:27,444][model8_pretrain.py][INFO] Epoch:[0/2](689700/4588595) loss:2.380 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:46:27,444][model8_pretrain.py][INFO] Epoch:[0/2](689700/4588595) loss:2.349 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:46:27,444][model8_pretrain.py][INFO] Epoch:[0/2](689700/4588595) loss:3.137 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:46:27,445][model8_pretrain.py][INFO] Epoch:[0/2](689700/4588595) loss:3.019 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:46:27,445][model8_pretrain.py][INFO] Epoch:[0/2](689700/4588595) loss:2.845 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:46:27,445][model8_pretrain.py][INFO] Epoch:[0/2](689700/4588595) loss:3.123 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:46:27,445][model8_pretrain.py][INFO] Epoch:[0/2](689700/4588595) loss:2.789 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:46:27,445][model8_pretrain.py][INFO] Epoch:[0/2](689700/4588595) loss:3.242 lr:0.0000100 epoch_Time:24798.0min: [2024-01-05 18:47:04,424][model8_pretrain.py][INFO] Epoch:[0/2](689800/4588595) loss:2.798 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:04,424][model8_pretrain.py][INFO] Epoch:[0/2](689800/4588595) loss:2.995 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:04,424][model8_pretrain.py][INFO] Epoch:[0/2](689800/4588595) loss:2.920 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:04,424][model8_pretrain.py][INFO] Epoch:[0/2](689800/4588595) loss:2.893 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:04,424][model8_pretrain.py][INFO] Epoch:[0/2](689800/4588595) loss:2.796 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:04,424][model8_pretrain.py][INFO] Epoch:[0/2](689800/4588595) loss:3.173 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:04,424][model8_pretrain.py][INFO] Epoch:[0/2](689800/4588595) loss:3.331 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:04,424][model8_pretrain.py][INFO] Epoch:[0/2](689800/4588595) loss:3.187 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:41,377][model8_pretrain.py][INFO] Epoch:[0/2](689900/4588595) loss:2.595 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:41,377][model8_pretrain.py][INFO] Epoch:[0/2](689900/4588595) loss:2.636 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:41,377][model8_pretrain.py][INFO] Epoch:[0/2](689900/4588595) loss:2.534 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:41,377][model8_pretrain.py][INFO] Epoch:[0/2](689900/4588595) loss:2.847 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:41,377][model8_pretrain.py][INFO] Epoch:[0/2](689900/4588595) loss:3.038 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:41,377][model8_pretrain.py][INFO] Epoch:[0/2](689900/4588595) loss:3.107 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:41,377][model8_pretrain.py][INFO] Epoch:[0/2](689900/4588595) loss:3.605 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:47:41,377][model8_pretrain.py][INFO] Epoch:[0/2](689900/4588595) loss:3.147 lr:0.0000100 epoch_Time:24797.0min: [2024-01-05 18:48:18,318][model8_pretrain.py][INFO] Epoch:[0/2](690000/4588595) loss:3.069 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:48:18,318][model8_pretrain.py][INFO] Epoch:[0/2](690000/4588595) loss:2.673 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:48:18,318][model8_pretrain.py][INFO] Epoch:[0/2](690000/4588595) loss:2.950 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:48:18,318][model8_pretrain.py][INFO] Epoch:[0/2](690000/4588595) loss:2.698 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:48:18,318][model8_pretrain.py][INFO] Epoch:[0/2](690000/4588595) loss:2.154 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:48:18,318][model8_pretrain.py][INFO] Epoch:[0/2](690000/4588595) loss:3.014 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:48:18,318][model8_pretrain.py][INFO] Epoch:[0/2](690000/4588595) loss:3.442 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:48:18,318][model8_pretrain.py][INFO] Epoch:[0/2](690000/4588595) loss:2.090 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:48:55,259][model8_pretrain.py][INFO] Epoch:[0/2](690100/4588595) loss:2.460 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:48:55,259][model8_pretrain.py][INFO] Epoch:[0/2](690100/4588595) loss:2.793 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:48:55,259][model8_pretrain.py][INFO] Epoch:[0/2](690100/4588595) loss:2.737 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:48:55,259][model8_pretrain.py][INFO] Epoch:[0/2](690100/4588595) loss:2.891 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:48:55,259][model8_pretrain.py][INFO] Epoch:[0/2](690100/4588595) loss:2.628 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:48:55,259][model8_pretrain.py][INFO] Epoch:[0/2](690100/4588595) loss:3.177 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:48:55,259][model8_pretrain.py][INFO] Epoch:[0/2](690100/4588595) loss:2.881 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:48:55,259][model8_pretrain.py][INFO] Epoch:[0/2](690100/4588595) loss:2.797 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:49:44,179][model8_pretrain.py][INFO] Epoch:[0/2](690200/4588595) loss:3.156 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:49:44,184][model8_pretrain.py][INFO] Epoch:[0/2](690200/4588595) loss:2.686 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:49:44,184][model8_pretrain.py][INFO] Epoch:[0/2](690200/4588595) loss:3.038 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:49:44,184][model8_pretrain.py][INFO] Epoch:[0/2](690200/4588595) loss:2.823 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:49:44,184][model8_pretrain.py][INFO] Epoch:[0/2](690200/4588595) loss:3.000 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:49:44,184][model8_pretrain.py][INFO] Epoch:[0/2](690200/4588595) loss:2.175 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:49:44,184][model8_pretrain.py][INFO] Epoch:[0/2](690200/4588595) loss:2.333 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:49:44,184][model8_pretrain.py][INFO] Epoch:[0/2](690200/4588595) loss:3.251 lr:0.0000100 epoch_Time:24796.0min: [2024-01-05 18:50:21,110][model8_pretrain.py][INFO] Epoch:[0/2](690300/4588595) loss:2.896 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:50:21,110][model8_pretrain.py][INFO] Epoch:[0/2](690300/4588595) loss:2.963 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:50:21,110][model8_pretrain.py][INFO] Epoch:[0/2](690300/4588595) loss:3.380 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:50:21,110][model8_pretrain.py][INFO] Epoch:[0/2](690300/4588595) loss:2.975 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:50:21,110][model8_pretrain.py][INFO] Epoch:[0/2](690300/4588595) loss:2.328 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:50:21,110][model8_pretrain.py][INFO] Epoch:[0/2](690300/4588595) loss:2.708 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:50:21,110][model8_pretrain.py][INFO] Epoch:[0/2](690300/4588595) loss:2.948 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:50:21,110][model8_pretrain.py][INFO] Epoch:[0/2](690300/4588595) loss:3.040 lr:0.0000100 epoch_Time:24795.0min: [2024-01-05 18:50:58,039][model8_pretrain.py][INFO] Epoch:[0/2](690400/4588595) loss:3.097 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:50:58,039][model8_pretrain.py][INFO] Epoch:[0/2](690400/4588595) loss:2.636 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:50:58,039][model8_pretrain.py][INFO] Epoch:[0/2](690400/4588595) loss:2.999 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:50:58,039][model8_pretrain.py][INFO] Epoch:[0/2](690400/4588595) loss:2.512 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:50:58,039][model8_pretrain.py][INFO] Epoch:[0/2](690400/4588595) loss:2.301 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:50:58,039][model8_pretrain.py][INFO] Epoch:[0/2](690400/4588595) loss:2.861 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:50:58,039][model8_pretrain.py][INFO] Epoch:[0/2](690400/4588595) loss:2.894 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:50:58,039][model8_pretrain.py][INFO] Epoch:[0/2](690400/4588595) loss:2.976 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:51:34,961][model8_pretrain.py][INFO] Epoch:[0/2](690500/4588595) loss:2.943 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:51:34,961][model8_pretrain.py][INFO] Epoch:[0/2](690500/4588595) loss:2.834 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:51:34,961][model8_pretrain.py][INFO] Epoch:[0/2](690500/4588595) loss:3.116 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:51:34,961][model8_pretrain.py][INFO] Epoch:[0/2](690500/4588595) loss:3.107 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:51:34,961][model8_pretrain.py][INFO] Epoch:[0/2](690500/4588595) loss:3.086 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:51:34,962][model8_pretrain.py][INFO] Epoch:[0/2](690500/4588595) loss:2.645 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:51:34,962][model8_pretrain.py][INFO] Epoch:[0/2](690500/4588595) loss:2.544 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:51:34,962][model8_pretrain.py][INFO] Epoch:[0/2](690500/4588595) loss:3.134 lr:0.0000100 epoch_Time:24793.0min: [2024-01-05 18:52:11,892][model8_pretrain.py][INFO] Epoch:[0/2](690600/4588595) loss:2.791 lr:0.0000100 epoch_Time:24792.0min: [2024-01-05 18:52:11,892][model8_pretrain.py][INFO] Epoch:[0/2](690600/4588595) loss:3.119 lr:0.0000100 epoch_Time:24792.0min: [2024-01-05 18:52:11,892][model8_pretrain.py][INFO] Epoch:[0/2](690600/4588595) loss:2.681 lr:0.0000100 epoch_Time:24792.0min: [2024-01-05 18:52:11,892][model8_pretrain.py][INFO] Epoch:[0/2](690600/4588595) loss:2.762 lr:0.0000100 epoch_Time:24792.0min: [2024-01-05 18:52:11,892][model8_pretrain.py][INFO] Epoch:[0/2](690600/4588595) loss:2.684 lr:0.0000100 epoch_Time:24792.0min: [2024-01-05 18:52:11,892][model8_pretrain.py][INFO] Epoch:[0/2](690600/4588595) loss:2.551 lr:0.0000100 epoch_Time:24792.0min: [2024-01-05 18:52:11,892][model8_pretrain.py][INFO] Epoch:[0/2](690600/4588595) loss:3.170 lr:0.0000100 epoch_Time:24792.0min: [2024-01-05 18:52:11,892][model8_pretrain.py][INFO] Epoch:[0/2](690600/4588595) loss:2.705 lr:0.0000100 epoch_Time:24792.0min: [2024-01-05 18:52:48,823][model8_pretrain.py][INFO] Epoch:[0/2](690700/4588595) loss:2.667 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:52:48,823][model8_pretrain.py][INFO] Epoch:[0/2](690700/4588595) loss:2.357 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:52:48,823][model8_pretrain.py][INFO] Epoch:[0/2](690700/4588595) loss:2.807 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:52:48,823][model8_pretrain.py][INFO] Epoch:[0/2](690700/4588595) loss:2.952 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:52:48,823][model8_pretrain.py][INFO] Epoch:[0/2](690700/4588595) loss:3.179 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:52:48,823][model8_pretrain.py][INFO] Epoch:[0/2](690700/4588595) loss:2.196 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:52:48,823][model8_pretrain.py][INFO] Epoch:[0/2](690700/4588595) loss:2.526 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:52:48,823][model8_pretrain.py][INFO] Epoch:[0/2](690700/4588595) loss:2.914 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:53:25,754][model8_pretrain.py][INFO] Epoch:[0/2](690800/4588595) loss:3.090 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:53:25,754][model8_pretrain.py][INFO] Epoch:[0/2](690800/4588595) loss:2.670 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:53:25,755][model8_pretrain.py][INFO] Epoch:[0/2](690800/4588595) loss:3.028 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:53:25,755][model8_pretrain.py][INFO] Epoch:[0/2](690800/4588595) loss:2.409 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:53:25,754][model8_pretrain.py][INFO] Epoch:[0/2](690800/4588595) loss:3.100 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:53:25,755][model8_pretrain.py][INFO] Epoch:[0/2](690800/4588595) loss:2.933 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:53:25,755][model8_pretrain.py][INFO] Epoch:[0/2](690800/4588595) loss:2.697 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:53:25,755][model8_pretrain.py][INFO] Epoch:[0/2](690800/4588595) loss:2.901 lr:0.0000100 epoch_Time:24791.0min: [2024-01-05 18:54:02,698][model8_pretrain.py][INFO] Epoch:[0/2](690900/4588595) loss:2.527 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:02,698][model8_pretrain.py][INFO] Epoch:[0/2](690900/4588595) loss:2.614 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:02,698][model8_pretrain.py][INFO] Epoch:[0/2](690900/4588595) loss:2.723 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:02,698][model8_pretrain.py][INFO] Epoch:[0/2](690900/4588595) loss:2.666 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:02,698][model8_pretrain.py][INFO] Epoch:[0/2](690900/4588595) loss:2.626 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:02,698][model8_pretrain.py][INFO] Epoch:[0/2](690900/4588595) loss:3.325 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:02,698][model8_pretrain.py][INFO] Epoch:[0/2](690900/4588595) loss:2.498 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:02,698][model8_pretrain.py][INFO] Epoch:[0/2](690900/4588595) loss:2.635 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:49,862][model8_pretrain.py][INFO] Epoch:[0/2](691000/4588595) loss:3.253 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:49,862][model8_pretrain.py][INFO] Epoch:[0/2](691000/4588595) loss:2.155 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:49,862][model8_pretrain.py][INFO] Epoch:[0/2](691000/4588595) loss:2.602 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:49,862][model8_pretrain.py][INFO] Epoch:[0/2](691000/4588595) loss:2.946 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:49,862][model8_pretrain.py][INFO] Epoch:[0/2](691000/4588595) loss:2.499 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:49,862][model8_pretrain.py][INFO] Epoch:[0/2](691000/4588595) loss:2.540 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:49,862][model8_pretrain.py][INFO] Epoch:[0/2](691000/4588595) loss:2.499 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:54:49,862][model8_pretrain.py][INFO] Epoch:[0/2](691000/4588595) loss:3.100 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:55:28,477][model8_pretrain.py][INFO] Epoch:[0/2](691100/4588595) loss:2.843 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:55:28,477][model8_pretrain.py][INFO] Epoch:[0/2](691100/4588595) loss:2.691 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:55:28,477][model8_pretrain.py][INFO] Epoch:[0/2](691100/4588595) loss:2.577 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:55:28,477][model8_pretrain.py][INFO] Epoch:[0/2](691100/4588595) loss:2.969 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:55:28,477][model8_pretrain.py][INFO] Epoch:[0/2](691100/4588595) loss:3.210 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:55:28,477][model8_pretrain.py][INFO] Epoch:[0/2](691100/4588595) loss:2.796 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:55:28,477][model8_pretrain.py][INFO] Epoch:[0/2](691100/4588595) loss:2.349 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:55:28,477][model8_pretrain.py][INFO] Epoch:[0/2](691100/4588595) loss:3.198 lr:0.0000100 epoch_Time:24790.0min: [2024-01-05 18:56:05,414][model8_pretrain.py][INFO] Epoch:[0/2](691200/4588595) loss:3.055 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:05,414][model8_pretrain.py][INFO] Epoch:[0/2](691200/4588595) loss:2.885 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:05,414][model8_pretrain.py][INFO] Epoch:[0/2](691200/4588595) loss:3.064 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:05,414][model8_pretrain.py][INFO] Epoch:[0/2](691200/4588595) loss:2.663 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:05,414][model8_pretrain.py][INFO] Epoch:[0/2](691200/4588595) loss:3.137 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:05,414][model8_pretrain.py][INFO] Epoch:[0/2](691200/4588595) loss:2.778 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:05,414][model8_pretrain.py][INFO] Epoch:[0/2](691200/4588595) loss:3.265 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:05,414][model8_pretrain.py][INFO] Epoch:[0/2](691200/4588595) loss:2.951 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:42,344][model8_pretrain.py][INFO] Epoch:[0/2](691300/4588595) loss:2.774 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:42,344][model8_pretrain.py][INFO] Epoch:[0/2](691300/4588595) loss:2.830 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:42,344][model8_pretrain.py][INFO] Epoch:[0/2](691300/4588595) loss:2.965 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:42,344][model8_pretrain.py][INFO] Epoch:[0/2](691300/4588595) loss:2.796 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:42,344][model8_pretrain.py][INFO] Epoch:[0/2](691300/4588595) loss:2.932 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:42,344][model8_pretrain.py][INFO] Epoch:[0/2](691300/4588595) loss:2.956 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:42,344][model8_pretrain.py][INFO] Epoch:[0/2](691300/4588595) loss:2.539 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:56:42,344][model8_pretrain.py][INFO] Epoch:[0/2](691300/4588595) loss:2.706 lr:0.0000100 epoch_Time:24789.0min: [2024-01-05 18:57:19,289][model8_pretrain.py][INFO] Epoch:[0/2](691400/4588595) loss:2.641 lr:0.0000100 epoch_Time:24787.0min: [2024-01-05 18:57:19,289][model8_pretrain.py][INFO] Epoch:[0/2](691400/4588595) loss:2.927 lr:0.0000100 epoch_Time:24787.0min: [2024-01-05 18:57:19,289][model8_pretrain.py][INFO] Epoch:[0/2](691400/4588595) loss:2.681 lr:0.0000100 epoch_Time:24787.0min: [2024-01-05 18:57:19,289][model8_pretrain.py][INFO] Epoch:[0/2](691400/4588595) loss:3.111 lr:0.0000100 epoch_Time:24787.0min: [2024-01-05 18:57:19,289][model8_pretrain.py][INFO] Epoch:[0/2](691400/4588595) loss:3.165 lr:0.0000100 epoch_Time:24787.0min: [2024-01-05 18:57:19,289][model8_pretrain.py][INFO] Epoch:[0/2](691400/4588595) loss:3.021 lr:0.0000100 epoch_Time:24787.0min: [2024-01-05 18:57:19,289][model8_pretrain.py][INFO] Epoch:[0/2](691400/4588595) loss:2.346 lr:0.0000100 epoch_Time:24787.0min: [2024-01-05 18:57:19,289][model8_pretrain.py][INFO] Epoch:[0/2](691400/4588595) loss:2.965 lr:0.0000100 epoch_Time:24787.0min: [2024-01-05 18:57:56,230][model8_pretrain.py][INFO] Epoch:[0/2](691500/4588595) loss:2.783 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:57:56,230][model8_pretrain.py][INFO] Epoch:[0/2](691500/4588595) loss:3.290 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:57:56,230][model8_pretrain.py][INFO] Epoch:[0/2](691500/4588595) loss:2.839 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:57:56,230][model8_pretrain.py][INFO] Epoch:[0/2](691500/4588595) loss:2.557 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:57:56,230][model8_pretrain.py][INFO] Epoch:[0/2](691500/4588595) loss:2.692 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:57:56,230][model8_pretrain.py][INFO] Epoch:[0/2](691500/4588595) loss:2.610 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:57:56,230][model8_pretrain.py][INFO] Epoch:[0/2](691500/4588595) loss:1.976 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:57:56,230][model8_pretrain.py][INFO] Epoch:[0/2](691500/4588595) loss:2.528 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:58:33,182][model8_pretrain.py][INFO] Epoch:[0/2](691600/4588595) loss:3.246 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:58:33,183][model8_pretrain.py][INFO] Epoch:[0/2](691600/4588595) loss:2.619 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:58:33,183][model8_pretrain.py][INFO] Epoch:[0/2](691600/4588595) loss:3.265 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:58:33,183][model8_pretrain.py][INFO] Epoch:[0/2](691600/4588595) loss:2.798 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:58:33,183][model8_pretrain.py][INFO] Epoch:[0/2](691600/4588595) loss:2.992 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:58:33,183][model8_pretrain.py][INFO] Epoch:[0/2](691600/4588595) loss:2.515 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:58:33,183][model8_pretrain.py][INFO] Epoch:[0/2](691600/4588595) loss:3.030 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:58:33,183][model8_pretrain.py][INFO] Epoch:[0/2](691600/4588595) loss:2.884 lr:0.0000100 epoch_Time:24786.0min: [2024-01-05 18:59:10,118][model8_pretrain.py][INFO] Epoch:[0/2](691700/4588595) loss:2.507 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:10,118][model8_pretrain.py][INFO] Epoch:[0/2](691700/4588595) loss:2.382 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:10,118][model8_pretrain.py][INFO] Epoch:[0/2](691700/4588595) loss:2.734 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:10,118][model8_pretrain.py][INFO] Epoch:[0/2](691700/4588595) loss:2.893 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:10,118][model8_pretrain.py][INFO] Epoch:[0/2](691700/4588595) loss:3.070 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:10,118][model8_pretrain.py][INFO] Epoch:[0/2](691700/4588595) loss:2.794 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:10,118][model8_pretrain.py][INFO] Epoch:[0/2](691700/4588595) loss:3.162 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:10,118][model8_pretrain.py][INFO] Epoch:[0/2](691700/4588595) loss:3.007 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:57,072][model8_pretrain.py][INFO] Epoch:[0/2](691800/4588595) loss:3.236 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:57,072][model8_pretrain.py][INFO] Epoch:[0/2](691800/4588595) loss:2.724 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:57,072][model8_pretrain.py][INFO] Epoch:[0/2](691800/4588595) loss:2.849 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:57,072][model8_pretrain.py][INFO] Epoch:[0/2](691800/4588595) loss:2.445 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:57,072][model8_pretrain.py][INFO] Epoch:[0/2](691800/4588595) loss:2.703 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:57,073][model8_pretrain.py][INFO] Epoch:[0/2](691800/4588595) loss:3.395 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:57,073][model8_pretrain.py][INFO] Epoch:[0/2](691800/4588595) loss:3.549 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 18:59:57,074][model8_pretrain.py][INFO] Epoch:[0/2](691800/4588595) loss:2.680 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 19:00:35,693][model8_pretrain.py][INFO] Epoch:[0/2](691900/4588595) loss:3.086 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 19:00:35,693][model8_pretrain.py][INFO] Epoch:[0/2](691900/4588595) loss:2.734 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 19:00:35,693][model8_pretrain.py][INFO] Epoch:[0/2](691900/4588595) loss:2.628 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 19:00:35,693][model8_pretrain.py][INFO] Epoch:[0/2](691900/4588595) loss:3.326 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 19:00:35,693][model8_pretrain.py][INFO] Epoch:[0/2](691900/4588595) loss:2.349 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 19:00:35,694][model8_pretrain.py][INFO] Epoch:[0/2](691900/4588595) loss:3.045 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 19:00:35,694][model8_pretrain.py][INFO] Epoch:[0/2](691900/4588595) loss:2.852 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 19:00:35,694][model8_pretrain.py][INFO] Epoch:[0/2](691900/4588595) loss:3.295 lr:0.0000100 epoch_Time:24785.0min: [2024-01-05 19:01:12,634][model8_pretrain.py][INFO] Epoch:[0/2](692000/4588595) loss:2.655 lr:0.0000100 epoch_Time:24784.0min: [2024-01-05 19:01:12,634][model8_pretrain.py][INFO] Epoch:[0/2](692000/4588595) loss:2.586 lr:0.0000100 epoch_Time:24784.0min: [2024-01-05 19:01:12,634][model8_pretrain.py][INFO] Epoch:[0/2](692000/4588595) loss:3.194 lr:0.0000100 epoch_Time:24784.0min: [2024-01-05 19:01:12,634][model8_pretrain.py][INFO] Epoch:[0/2](692000/4588595) loss:3.004 lr:0.0000100 epoch_Time:24784.0min: [2024-01-05 19:01:12,634][model8_pretrain.py][INFO] Epoch:[0/2](692000/4588595) loss:2.969 lr:0.0000100 epoch_Time:24784.0min: [2024-01-05 19:01:12,634][model8_pretrain.py][INFO] Epoch:[0/2](692000/4588595) loss:3.080 lr:0.0000100 epoch_Time:24784.0min: [2024-01-05 19:01:12,634][model8_pretrain.py][INFO] Epoch:[0/2](692000/4588595) loss:2.641 lr:0.0000100 epoch_Time:24784.0min: [2024-01-05 19:01:12,634][model8_pretrain.py][INFO] Epoch:[0/2](692000/4588595) loss:2.830 lr:0.0000100 epoch_Time:24784.0min: [2024-01-05 19:01:49,566][model8_pretrain.py][INFO] Epoch:[0/2](692100/4588595) loss:2.908 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:01:49,566][model8_pretrain.py][INFO] Epoch:[0/2](692100/4588595) loss:3.219 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:01:49,566][model8_pretrain.py][INFO] Epoch:[0/2](692100/4588595) loss:3.072 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:01:49,566][model8_pretrain.py][INFO] Epoch:[0/2](692100/4588595) loss:2.682 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:01:49,566][model8_pretrain.py][INFO] Epoch:[0/2](692100/4588595) loss:3.187 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:01:49,566][model8_pretrain.py][INFO] Epoch:[0/2](692100/4588595) loss:2.448 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:01:49,566][model8_pretrain.py][INFO] Epoch:[0/2](692100/4588595) loss:3.046 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:01:49,566][model8_pretrain.py][INFO] Epoch:[0/2](692100/4588595) loss:3.084 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:02:26,512][model8_pretrain.py][INFO] Epoch:[0/2](692200/4588595) loss:3.146 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:02:26,512][model8_pretrain.py][INFO] Epoch:[0/2](692200/4588595) loss:3.184 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:02:26,512][model8_pretrain.py][INFO] Epoch:[0/2](692200/4588595) loss:3.179 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:02:26,512][model8_pretrain.py][INFO] Epoch:[0/2](692200/4588595) loss:2.789 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:02:26,512][model8_pretrain.py][INFO] Epoch:[0/2](692200/4588595) loss:2.625 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:02:26,512][model8_pretrain.py][INFO] Epoch:[0/2](692200/4588595) loss:2.939 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:02:26,513][model8_pretrain.py][INFO] Epoch:[0/2](692200/4588595) loss:3.032 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:02:26,513][model8_pretrain.py][INFO] Epoch:[0/2](692200/4588595) loss:2.667 lr:0.0000100 epoch_Time:24783.0min: [2024-01-05 19:03:03,491][model8_pretrain.py][INFO] Epoch:[0/2](692300/4588595) loss:2.458 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:03,491][model8_pretrain.py][INFO] Epoch:[0/2](692300/4588595) loss:2.411 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:03,491][model8_pretrain.py][INFO] Epoch:[0/2](692300/4588595) loss:2.667 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:03,491][model8_pretrain.py][INFO] Epoch:[0/2](692300/4588595) loss:2.998 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:03,491][model8_pretrain.py][INFO] Epoch:[0/2](692300/4588595) loss:2.476 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:03,491][model8_pretrain.py][INFO] Epoch:[0/2](692300/4588595) loss:2.848 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:03,491][model8_pretrain.py][INFO] Epoch:[0/2](692300/4588595) loss:2.794 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:03,491][model8_pretrain.py][INFO] Epoch:[0/2](692300/4588595) loss:2.413 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:40,429][model8_pretrain.py][INFO] Epoch:[0/2](692400/4588595) loss:3.039 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:40,429][model8_pretrain.py][INFO] Epoch:[0/2](692400/4588595) loss:2.793 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:40,429][model8_pretrain.py][INFO] Epoch:[0/2](692400/4588595) loss:3.189 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:40,429][model8_pretrain.py][INFO] Epoch:[0/2](692400/4588595) loss:2.699 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:40,429][model8_pretrain.py][INFO] Epoch:[0/2](692400/4588595) loss:3.009 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:40,429][model8_pretrain.py][INFO] Epoch:[0/2](692400/4588595) loss:2.732 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:40,429][model8_pretrain.py][INFO] Epoch:[0/2](692400/4588595) loss:2.182 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:03:40,429][model8_pretrain.py][INFO] Epoch:[0/2](692400/4588595) loss:2.689 lr:0.0000100 epoch_Time:24781.0min: [2024-01-05 19:04:17,350][model8_pretrain.py][INFO] Epoch:[0/2](692500/4588595) loss:2.409 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:04:17,350][model8_pretrain.py][INFO] Epoch:[0/2](692500/4588595) loss:2.959 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:04:17,350][model8_pretrain.py][INFO] Epoch:[0/2](692500/4588595) loss:2.872 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:04:17,350][model8_pretrain.py][INFO] Epoch:[0/2](692500/4588595) loss:3.160 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:04:17,350][model8_pretrain.py][INFO] Epoch:[0/2](692500/4588595) loss:3.427 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:04:17,350][model8_pretrain.py][INFO] Epoch:[0/2](692500/4588595) loss:2.546 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:04:17,350][model8_pretrain.py][INFO] Epoch:[0/2](692500/4588595) loss:2.961 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:04:17,350][model8_pretrain.py][INFO] Epoch:[0/2](692500/4588595) loss:3.051 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:02,668][model8_pretrain.py][INFO] Epoch:[0/2](692600/4588595) loss:2.664 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:02,668][model8_pretrain.py][INFO] Epoch:[0/2](692600/4588595) loss:3.015 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:02,668][model8_pretrain.py][INFO] Epoch:[0/2](692600/4588595) loss:2.909 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:02,668][model8_pretrain.py][INFO] Epoch:[0/2](692600/4588595) loss:2.908 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:02,668][model8_pretrain.py][INFO] Epoch:[0/2](692600/4588595) loss:2.353 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:02,668][model8_pretrain.py][INFO] Epoch:[0/2](692600/4588595) loss:3.225 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:02,668][model8_pretrain.py][INFO] Epoch:[0/2](692600/4588595) loss:2.750 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:02,668][model8_pretrain.py][INFO] Epoch:[0/2](692600/4588595) loss:2.530 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:43,072][model8_pretrain.py][INFO] Epoch:[0/2](692700/4588595) loss:2.582 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:43,072][model8_pretrain.py][INFO] Epoch:[0/2](692700/4588595) loss:2.672 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:43,072][model8_pretrain.py][INFO] Epoch:[0/2](692700/4588595) loss:2.909 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:43,072][model8_pretrain.py][INFO] Epoch:[0/2](692700/4588595) loss:3.012 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:43,072][model8_pretrain.py][INFO] Epoch:[0/2](692700/4588595) loss:2.349 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:43,072][model8_pretrain.py][INFO] Epoch:[0/2](692700/4588595) loss:2.453 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:43,072][model8_pretrain.py][INFO] Epoch:[0/2](692700/4588595) loss:2.429 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:05:43,072][model8_pretrain.py][INFO] Epoch:[0/2](692700/4588595) loss:2.980 lr:0.0000100 epoch_Time:24780.0min: [2024-01-05 19:06:20,000][model8_pretrain.py][INFO] Epoch:[0/2](692800/4588595) loss:3.081 lr:0.0000100 epoch_Time:24779.0min: [2024-01-05 19:06:20,000][model8_pretrain.py][INFO] Epoch:[0/2](692800/4588595) loss:2.787 lr:0.0000100 epoch_Time:24779.0min: [2024-01-05 19:06:20,000][model8_pretrain.py][INFO] Epoch:[0/2](692800/4588595) loss:3.102 lr:0.0000100 epoch_Time:24779.0min: [2024-01-05 19:06:20,000][model8_pretrain.py][INFO] Epoch:[0/2](692800/4588595) loss:3.159 lr:0.0000100 epoch_Time:24779.0min: [2024-01-05 19:06:20,000][model8_pretrain.py][INFO] Epoch:[0/2](692800/4588595) loss:1.723 lr:0.0000100 epoch_Time:24779.0min: [2024-01-05 19:06:20,001][model8_pretrain.py][INFO] Epoch:[0/2](692800/4588595) loss:3.003 lr:0.0000100 epoch_Time:24779.0min: [2024-01-05 19:06:20,001][model8_pretrain.py][INFO] Epoch:[0/2](692800/4588595) loss:3.047 lr:0.0000100 epoch_Time:24779.0min: [2024-01-05 19:06:20,001][model8_pretrain.py][INFO] Epoch:[0/2](692800/4588595) loss:2.780 lr:0.0000100 epoch_Time:24779.0min: [2024-01-05 19:06:56,958][model8_pretrain.py][INFO] Epoch:[0/2](692900/4588595) loss:2.079 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:06:56,958][model8_pretrain.py][INFO] Epoch:[0/2](692900/4588595) loss:2.786 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:06:56,958][model8_pretrain.py][INFO] Epoch:[0/2](692900/4588595) loss:2.697 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:06:56,958][model8_pretrain.py][INFO] Epoch:[0/2](692900/4588595) loss:2.652 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:06:56,958][model8_pretrain.py][INFO] Epoch:[0/2](692900/4588595) loss:2.522 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:06:56,958][model8_pretrain.py][INFO] Epoch:[0/2](692900/4588595) loss:2.719 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:06:56,958][model8_pretrain.py][INFO] Epoch:[0/2](692900/4588595) loss:2.780 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:06:56,959][model8_pretrain.py][INFO] Epoch:[0/2](692900/4588595) loss:2.724 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:07:33,913][model8_pretrain.py][INFO] Epoch:[0/2](693000/4588595) loss:2.434 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:07:33,913][model8_pretrain.py][INFO] Epoch:[0/2](693000/4588595) loss:3.038 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:07:33,913][model8_pretrain.py][INFO] Epoch:[0/2](693000/4588595) loss:2.722 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:07:33,913][model8_pretrain.py][INFO] Epoch:[0/2](693000/4588595) loss:3.023 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:07:33,913][model8_pretrain.py][INFO] Epoch:[0/2](693000/4588595) loss:3.018 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:07:33,913][model8_pretrain.py][INFO] Epoch:[0/2](693000/4588595) loss:2.061 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:07:33,913][model8_pretrain.py][INFO] Epoch:[0/2](693000/4588595) loss:3.289 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:07:33,913][model8_pretrain.py][INFO] Epoch:[0/2](693000/4588595) loss:2.987 lr:0.0000100 epoch_Time:24778.0min: [2024-01-05 19:08:10,860][model8_pretrain.py][INFO] Epoch:[0/2](693100/4588595) loss:3.014 lr:0.0000100 epoch_Time:24777.0min: [2024-01-05 19:08:10,860][model8_pretrain.py][INFO] Epoch:[0/2](693100/4588595) loss:2.895 lr:0.0000100 epoch_Time:24777.0min: [2024-01-05 19:08:10,860][model8_pretrain.py][INFO] Epoch:[0/2](693100/4588595) loss:2.791 lr:0.0000100 epoch_Time:24777.0min: [2024-01-05 19:08:10,860][model8_pretrain.py][INFO] Epoch:[0/2](693100/4588595) loss:2.941 lr:0.0000100 epoch_Time:24777.0min: [2024-01-05 19:08:10,860][model8_pretrain.py][INFO] Epoch:[0/2](693100/4588595) loss:3.157 lr:0.0000100 epoch_Time:24777.0min: [2024-01-05 19:08:10,860][model8_pretrain.py][INFO] Epoch:[0/2](693100/4588595) loss:3.077 lr:0.0000100 epoch_Time:24777.0min: [2024-01-05 19:08:10,860][model8_pretrain.py][INFO] Epoch:[0/2](693100/4588595) loss:3.143 lr:0.0000100 epoch_Time:24777.0min: [2024-01-05 19:08:10,860][model8_pretrain.py][INFO] Epoch:[0/2](693100/4588595) loss:2.757 lr:0.0000100 epoch_Time:24777.0min: [2024-01-05 19:08:47,815][model8_pretrain.py][INFO] Epoch:[0/2](693200/4588595) loss:3.031 lr:0.0000100 epoch_Time:24776.0min: [2024-01-05 19:08:47,815][model8_pretrain.py][INFO] Epoch:[0/2](693200/4588595) loss:2.466 lr:0.0000100 epoch_Time:24776.0min: [2024-01-05 19:08:47,815][model8_pretrain.py][INFO] Epoch:[0/2](693200/4588595) loss:2.673 lr:0.0000100 epoch_Time:24776.0min: [2024-01-05 19:08:47,815][model8_pretrain.py][INFO] Epoch:[0/2](693200/4588595) loss:2.690 lr:0.0000100 epoch_Time:24776.0min: [2024-01-05 19:08:47,815][model8_pretrain.py][INFO] Epoch:[0/2](693200/4588595) loss:2.874 lr:0.0000100 epoch_Time:24776.0min: [2024-01-05 19:08:47,815][model8_pretrain.py][INFO] Epoch:[0/2](693200/4588595) loss:2.390 lr:0.0000100 epoch_Time:24776.0min: [2024-01-05 19:08:47,815][model8_pretrain.py][INFO] Epoch:[0/2](693200/4588595) loss:3.162 lr:0.0000100 epoch_Time:24776.0min: [2024-01-05 19:08:47,815][model8_pretrain.py][INFO] Epoch:[0/2](693200/4588595) loss:3.207 lr:0.0000100 epoch_Time:24776.0min: [2024-01-05 19:09:24,764][model8_pretrain.py][INFO] Epoch:[0/2](693300/4588595) loss:2.844 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:09:24,764][model8_pretrain.py][INFO] Epoch:[0/2](693300/4588595) loss:3.280 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:09:24,764][model8_pretrain.py][INFO] Epoch:[0/2](693300/4588595) loss:2.438 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:09:24,764][model8_pretrain.py][INFO] Epoch:[0/2](693300/4588595) loss:3.446 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:09:24,764][model8_pretrain.py][INFO] Epoch:[0/2](693300/4588595) loss:3.151 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:09:24,764][model8_pretrain.py][INFO] Epoch:[0/2](693300/4588595) loss:2.914 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:09:24,765][model8_pretrain.py][INFO] Epoch:[0/2](693300/4588595) loss:2.958 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:09:24,765][model8_pretrain.py][INFO] Epoch:[0/2](693300/4588595) loss:2.966 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:08,526][model8_pretrain.py][INFO] Epoch:[0/2](693400/4588595) loss:3.153 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:08,526][model8_pretrain.py][INFO] Epoch:[0/2](693400/4588595) loss:2.555 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:08,526][model8_pretrain.py][INFO] Epoch:[0/2](693400/4588595) loss:2.924 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:08,526][model8_pretrain.py][INFO] Epoch:[0/2](693400/4588595) loss:2.772 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:08,526][model8_pretrain.py][INFO] Epoch:[0/2](693400/4588595) loss:3.343 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:08,526][model8_pretrain.py][INFO] Epoch:[0/2](693400/4588595) loss:2.781 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:08,526][model8_pretrain.py][INFO] Epoch:[0/2](693400/4588595) loss:2.574 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:10,237][model8_pretrain.py][INFO] Epoch:[0/2](693400/4588595) loss:3.339 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:50,634][model8_pretrain.py][INFO] Epoch:[0/2](693500/4588595) loss:2.689 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:50,634][model8_pretrain.py][INFO] Epoch:[0/2](693500/4588595) loss:2.750 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:50,634][model8_pretrain.py][INFO] Epoch:[0/2](693500/4588595) loss:2.884 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:50,634][model8_pretrain.py][INFO] Epoch:[0/2](693500/4588595) loss:2.617 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:50,634][model8_pretrain.py][INFO] Epoch:[0/2](693500/4588595) loss:3.312 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:50,634][model8_pretrain.py][INFO] Epoch:[0/2](693500/4588595) loss:3.112 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:50,634][model8_pretrain.py][INFO] Epoch:[0/2](693500/4588595) loss:3.309 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:10:50,634][model8_pretrain.py][INFO] Epoch:[0/2](693500/4588595) loss:2.538 lr:0.0000100 epoch_Time:24775.0min: [2024-01-05 19:11:27,557][model8_pretrain.py][INFO] Epoch:[0/2](693600/4588595) loss:2.346 lr:0.0000100 epoch_Time:24774.0min: [2024-01-05 19:11:27,557][model8_pretrain.py][INFO] Epoch:[0/2](693600/4588595) loss:2.393 lr:0.0000100 epoch_Time:24774.0min: [2024-01-05 19:11:27,557][model8_pretrain.py][INFO] Epoch:[0/2](693600/4588595) loss:2.807 lr:0.0000100 epoch_Time:24774.0min: [2024-01-05 19:11:27,557][model8_pretrain.py][INFO] Epoch:[0/2](693600/4588595) loss:2.997 lr:0.0000100 epoch_Time:24774.0min: [2024-01-05 19:11:27,557][model8_pretrain.py][INFO] Epoch:[0/2](693600/4588595) loss:3.107 lr:0.0000100 epoch_Time:24774.0min: [2024-01-05 19:11:27,557][model8_pretrain.py][INFO] Epoch:[0/2](693600/4588595) loss:2.337 lr:0.0000100 epoch_Time:24774.0min: [2024-01-05 19:11:27,557][model8_pretrain.py][INFO] Epoch:[0/2](693600/4588595) loss:2.339 lr:0.0000100 epoch_Time:24774.0min: [2024-01-05 19:11:27,557][model8_pretrain.py][INFO] Epoch:[0/2](693600/4588595) loss:3.001 lr:0.0000100 epoch_Time:24774.0min: [2024-01-05 19:12:04,484][model8_pretrain.py][INFO] Epoch:[0/2](693700/4588595) loss:3.173 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:04,484][model8_pretrain.py][INFO] Epoch:[0/2](693700/4588595) loss:2.740 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:04,484][model8_pretrain.py][INFO] Epoch:[0/2](693700/4588595) loss:2.617 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:04,484][model8_pretrain.py][INFO] Epoch:[0/2](693700/4588595) loss:3.192 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:04,484][model8_pretrain.py][INFO] Epoch:[0/2](693700/4588595) loss:2.969 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:04,484][model8_pretrain.py][INFO] Epoch:[0/2](693700/4588595) loss:2.960 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:04,484][model8_pretrain.py][INFO] Epoch:[0/2](693700/4588595) loss:2.786 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:04,484][model8_pretrain.py][INFO] Epoch:[0/2](693700/4588595) loss:2.757 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:41,409][model8_pretrain.py][INFO] Epoch:[0/2](693800/4588595) loss:2.788 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:41,409][model8_pretrain.py][INFO] Epoch:[0/2](693800/4588595) loss:2.625 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:41,409][model8_pretrain.py][INFO] Epoch:[0/2](693800/4588595) loss:3.193 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:41,409][model8_pretrain.py][INFO] Epoch:[0/2](693800/4588595) loss:2.785 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:41,409][model8_pretrain.py][INFO] Epoch:[0/2](693800/4588595) loss:2.774 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:41,409][model8_pretrain.py][INFO] Epoch:[0/2](693800/4588595) loss:3.123 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:41,409][model8_pretrain.py][INFO] Epoch:[0/2](693800/4588595) loss:2.818 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:12:41,409][model8_pretrain.py][INFO] Epoch:[0/2](693800/4588595) loss:2.132 lr:0.0000100 epoch_Time:24773.0min: [2024-01-05 19:13:18,333][model8_pretrain.py][INFO] Epoch:[0/2](693900/4588595) loss:2.719 lr:0.0000100 epoch_Time:24772.0min: [2024-01-05 19:13:18,333][model8_pretrain.py][INFO] Epoch:[0/2](693900/4588595) loss:2.293 lr:0.0000100 epoch_Time:24772.0min: [2024-01-05 19:13:18,333][model8_pretrain.py][INFO] Epoch:[0/2](693900/4588595) loss:3.041 lr:0.0000100 epoch_Time:24772.0min: [2024-01-05 19:13:18,333][model8_pretrain.py][INFO] Epoch:[0/2](693900/4588595) loss:2.525 lr:0.0000100 epoch_Time:24772.0min: [2024-01-05 19:13:18,333][model8_pretrain.py][INFO] Epoch:[0/2](693900/4588595) loss:2.880 lr:0.0000100 epoch_Time:24772.0min: [2024-01-05 19:13:18,333][model8_pretrain.py][INFO] Epoch:[0/2](693900/4588595) loss:2.855 lr:0.0000100 epoch_Time:24772.0min: [2024-01-05 19:13:18,333][model8_pretrain.py][INFO] Epoch:[0/2](693900/4588595) loss:3.281 lr:0.0000100 epoch_Time:24772.0min: [2024-01-05 19:13:18,333][model8_pretrain.py][INFO] Epoch:[0/2](693900/4588595) loss:2.894 lr:0.0000100 epoch_Time:24772.0min: [2024-01-05 19:13:55,266][model8_pretrain.py][INFO] Epoch:[0/2](694000/4588595) loss:3.205 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:13:55,266][model8_pretrain.py][INFO] Epoch:[0/2](694000/4588595) loss:2.749 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:13:55,266][model8_pretrain.py][INFO] Epoch:[0/2](694000/4588595) loss:2.933 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:13:55,266][model8_pretrain.py][INFO] Epoch:[0/2](694000/4588595) loss:3.070 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:13:55,266][model8_pretrain.py][INFO] Epoch:[0/2](694000/4588595) loss:2.755 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:13:55,266][model8_pretrain.py][INFO] Epoch:[0/2](694000/4588595) loss:2.685 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:13:55,266][model8_pretrain.py][INFO] Epoch:[0/2](694000/4588595) loss:2.548 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:13:55,266][model8_pretrain.py][INFO] Epoch:[0/2](694000/4588595) loss:2.883 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:14:32,185][model8_pretrain.py][INFO] Epoch:[0/2](694100/4588595) loss:2.649 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:14:32,185][model8_pretrain.py][INFO] Epoch:[0/2](694100/4588595) loss:2.862 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:14:32,186][model8_pretrain.py][INFO] Epoch:[0/2](694100/4588595) loss:3.149 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:14:32,186][model8_pretrain.py][INFO] Epoch:[0/2](694100/4588595) loss:3.017 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:14:32,186][model8_pretrain.py][INFO] Epoch:[0/2](694100/4588595) loss:2.457 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:14:32,186][model8_pretrain.py][INFO] Epoch:[0/2](694100/4588595) loss:2.722 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:14:32,186][model8_pretrain.py][INFO] Epoch:[0/2](694100/4588595) loss:2.741 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:14:32,186][model8_pretrain.py][INFO] Epoch:[0/2](694100/4588595) loss:2.699 lr:0.0000100 epoch_Time:24771.0min: [2024-01-05 19:15:15,791][model8_pretrain.py][INFO] Epoch:[0/2](694200/4588595) loss:2.922 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:15,791][model8_pretrain.py][INFO] Epoch:[0/2](694200/4588595) loss:2.282 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:15,791][model8_pretrain.py][INFO] Epoch:[0/2](694200/4588595) loss:2.448 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:15,795][model8_pretrain.py][INFO] Epoch:[0/2](694200/4588595) loss:2.514 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:15,795][model8_pretrain.py][INFO] Epoch:[0/2](694200/4588595) loss:2.789 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:15,795][model8_pretrain.py][INFO] Epoch:[0/2](694200/4588595) loss:3.214 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:15,796][model8_pretrain.py][INFO] Epoch:[0/2](694200/4588595) loss:2.540 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:15,796][model8_pretrain.py][INFO] Epoch:[0/2](694200/4588595) loss:3.023 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:58,013][model8_pretrain.py][INFO] Epoch:[0/2](694300/4588595) loss:3.132 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:58,013][model8_pretrain.py][INFO] Epoch:[0/2](694300/4588595) loss:3.119 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:58,013][model8_pretrain.py][INFO] Epoch:[0/2](694300/4588595) loss:2.769 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:58,013][model8_pretrain.py][INFO] Epoch:[0/2](694300/4588595) loss:3.008 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:58,013][model8_pretrain.py][INFO] Epoch:[0/2](694300/4588595) loss:2.485 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:58,013][model8_pretrain.py][INFO] Epoch:[0/2](694300/4588595) loss:2.658 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:58,014][model8_pretrain.py][INFO] Epoch:[0/2](694300/4588595) loss:2.241 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:15:58,014][model8_pretrain.py][INFO] Epoch:[0/2](694300/4588595) loss:2.001 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:16:34,929][model8_pretrain.py][INFO] Epoch:[0/2](694400/4588595) loss:3.121 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:16:34,929][model8_pretrain.py][INFO] Epoch:[0/2](694400/4588595) loss:3.133 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:16:34,929][model8_pretrain.py][INFO] Epoch:[0/2](694400/4588595) loss:2.940 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:16:34,929][model8_pretrain.py][INFO] Epoch:[0/2](694400/4588595) loss:2.973 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:16:34,929][model8_pretrain.py][INFO] Epoch:[0/2](694400/4588595) loss:3.144 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:16:34,929][model8_pretrain.py][INFO] Epoch:[0/2](694400/4588595) loss:2.772 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:16:34,929][model8_pretrain.py][INFO] Epoch:[0/2](694400/4588595) loss:2.964 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:16:34,929][model8_pretrain.py][INFO] Epoch:[0/2](694400/4588595) loss:2.975 lr:0.0000100 epoch_Time:24770.0min: [2024-01-05 19:17:11,848][model8_pretrain.py][INFO] Epoch:[0/2](694500/4588595) loss:2.495 lr:0.0000100 epoch_Time:24768.0min: [2024-01-05 19:17:11,848][model8_pretrain.py][INFO] Epoch:[0/2](694500/4588595) loss:2.510 lr:0.0000100 epoch_Time:24768.0min: [2024-01-05 19:17:11,848][model8_pretrain.py][INFO] Epoch:[0/2](694500/4588595) loss:3.183 lr:0.0000100 epoch_Time:24768.0min: [2024-01-05 19:17:11,848][model8_pretrain.py][INFO] Epoch:[0/2](694500/4588595) loss:2.664 lr:0.0000100 epoch_Time:24768.0min: [2024-01-05 19:17:11,848][model8_pretrain.py][INFO] Epoch:[0/2](694500/4588595) loss:2.807 lr:0.0000100 epoch_Time:24768.0min: [2024-01-05 19:17:11,848][model8_pretrain.py][INFO] Epoch:[0/2](694500/4588595) loss:2.784 lr:0.0000100 epoch_Time:24768.0min: [2024-01-05 19:17:11,848][model8_pretrain.py][INFO] Epoch:[0/2](694500/4588595) loss:3.134 lr:0.0000100 epoch_Time:24768.0min: [2024-01-05 19:17:11,848][model8_pretrain.py][INFO] Epoch:[0/2](694500/4588595) loss:3.085 lr:0.0000100 epoch_Time:24768.0min: [2024-01-05 19:17:48,776][model8_pretrain.py][INFO] Epoch:[0/2](694600/4588595) loss:2.953 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:17:48,776][model8_pretrain.py][INFO] Epoch:[0/2](694600/4588595) loss:2.947 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:17:48,776][model8_pretrain.py][INFO] Epoch:[0/2](694600/4588595) loss:2.792 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:17:48,776][model8_pretrain.py][INFO] Epoch:[0/2](694600/4588595) loss:2.690 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:17:48,776][model8_pretrain.py][INFO] Epoch:[0/2](694600/4588595) loss:2.869 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:17:48,776][model8_pretrain.py][INFO] Epoch:[0/2](694600/4588595) loss:3.203 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:17:48,776][model8_pretrain.py][INFO] Epoch:[0/2](694600/4588595) loss:2.520 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:17:48,776][model8_pretrain.py][INFO] Epoch:[0/2](694600/4588595) loss:2.362 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:18:25,709][model8_pretrain.py][INFO] Epoch:[0/2](694700/4588595) loss:2.695 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:18:25,709][model8_pretrain.py][INFO] Epoch:[0/2](694700/4588595) loss:2.402 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:18:25,709][model8_pretrain.py][INFO] Epoch:[0/2](694700/4588595) loss:2.818 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:18:25,709][model8_pretrain.py][INFO] Epoch:[0/2](694700/4588595) loss:2.849 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:18:25,709][model8_pretrain.py][INFO] Epoch:[0/2](694700/4588595) loss:2.972 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:18:25,709][model8_pretrain.py][INFO] Epoch:[0/2](694700/4588595) loss:2.846 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:18:25,709][model8_pretrain.py][INFO] Epoch:[0/2](694700/4588595) loss:2.767 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:18:25,709][model8_pretrain.py][INFO] Epoch:[0/2](694700/4588595) loss:2.732 lr:0.0000100 epoch_Time:24767.0min: [2024-01-05 19:19:02,638][model8_pretrain.py][INFO] Epoch:[0/2](694800/4588595) loss:2.750 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:02,638][model8_pretrain.py][INFO] Epoch:[0/2](694800/4588595) loss:3.312 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:02,638][model8_pretrain.py][INFO] Epoch:[0/2](694800/4588595) loss:2.941 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:02,638][model8_pretrain.py][INFO] Epoch:[0/2](694800/4588595) loss:2.360 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:02,638][model8_pretrain.py][INFO] Epoch:[0/2](694800/4588595) loss:3.257 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:02,638][model8_pretrain.py][INFO] Epoch:[0/2](694800/4588595) loss:2.561 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:02,638][model8_pretrain.py][INFO] Epoch:[0/2](694800/4588595) loss:2.538 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:02,638][model8_pretrain.py][INFO] Epoch:[0/2](694800/4588595) loss:3.084 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:39,587][model8_pretrain.py][INFO] Epoch:[0/2](694900/4588595) loss:3.016 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:39,587][model8_pretrain.py][INFO] Epoch:[0/2](694900/4588595) loss:2.903 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:39,587][model8_pretrain.py][INFO] Epoch:[0/2](694900/4588595) loss:3.056 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:39,587][model8_pretrain.py][INFO] Epoch:[0/2](694900/4588595) loss:2.623 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:39,587][model8_pretrain.py][INFO] Epoch:[0/2](694900/4588595) loss:2.694 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:39,587][model8_pretrain.py][INFO] Epoch:[0/2](694900/4588595) loss:2.677 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:39,587][model8_pretrain.py][INFO] Epoch:[0/2](694900/4588595) loss:2.980 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:19:39,587][model8_pretrain.py][INFO] Epoch:[0/2](694900/4588595) loss:3.467 lr:0.0000100 epoch_Time:24766.0min: [2024-01-05 19:20:20,001][model8_pretrain.py][INFO] Epoch:[0/2](695000/4588595) loss:2.570 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:20:20,001][model8_pretrain.py][INFO] Epoch:[0/2](695000/4588595) loss:2.038 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:20:20,001][model8_pretrain.py][INFO] Epoch:[0/2](695000/4588595) loss:2.349 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:20:20,001][model8_pretrain.py][INFO] Epoch:[0/2](695000/4588595) loss:2.490 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:20:20,001][model8_pretrain.py][INFO] Epoch:[0/2](695000/4588595) loss:2.370 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:20:20,001][model8_pretrain.py][INFO] Epoch:[0/2](695000/4588595) loss:3.045 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:20:20,001][model8_pretrain.py][INFO] Epoch:[0/2](695000/4588595) loss:2.661 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:20:20,001][model8_pretrain.py][INFO] Epoch:[0/2](695000/4588595) loss:2.650 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:05,391][model8_pretrain.py][INFO] Epoch:[0/2](695100/4588595) loss:2.320 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:05,391][model8_pretrain.py][INFO] Epoch:[0/2](695100/4588595) loss:2.726 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:05,391][model8_pretrain.py][INFO] Epoch:[0/2](695100/4588595) loss:2.628 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:05,391][model8_pretrain.py][INFO] Epoch:[0/2](695100/4588595) loss:2.876 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:05,391][model8_pretrain.py][INFO] Epoch:[0/2](695100/4588595) loss:2.860 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:05,391][model8_pretrain.py][INFO] Epoch:[0/2](695100/4588595) loss:2.983 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:05,391][model8_pretrain.py][INFO] Epoch:[0/2](695100/4588595) loss:2.712 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:05,391][model8_pretrain.py][INFO] Epoch:[0/2](695100/4588595) loss:2.879 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:42,319][model8_pretrain.py][INFO] Epoch:[0/2](695200/4588595) loss:2.513 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:42,319][model8_pretrain.py][INFO] Epoch:[0/2](695200/4588595) loss:3.265 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:42,319][model8_pretrain.py][INFO] Epoch:[0/2](695200/4588595) loss:2.316 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:42,319][model8_pretrain.py][INFO] Epoch:[0/2](695200/4588595) loss:3.146 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:42,319][model8_pretrain.py][INFO] Epoch:[0/2](695200/4588595) loss:2.580 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:42,319][model8_pretrain.py][INFO] Epoch:[0/2](695200/4588595) loss:2.683 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:42,319][model8_pretrain.py][INFO] Epoch:[0/2](695200/4588595) loss:2.777 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:21:42,319][model8_pretrain.py][INFO] Epoch:[0/2](695200/4588595) loss:3.139 lr:0.0000100 epoch_Time:24765.0min: [2024-01-05 19:22:19,247][model8_pretrain.py][INFO] Epoch:[0/2](695300/4588595) loss:2.284 lr:0.0000100 epoch_Time:24764.0min: [2024-01-05 19:22:19,247][model8_pretrain.py][INFO] Epoch:[0/2](695300/4588595) loss:3.248 lr:0.0000100 epoch_Time:24764.0min: [2024-01-05 19:22:19,247][model8_pretrain.py][INFO] Epoch:[0/2](695300/4588595) loss:2.808 lr:0.0000100 epoch_Time:24764.0min: [2024-01-05 19:22:19,247][model8_pretrain.py][INFO] Epoch:[0/2](695300/4588595) loss:2.867 lr:0.0000100 epoch_Time:24764.0min: [2024-01-05 19:22:19,247][model8_pretrain.py][INFO] Epoch:[0/2](695300/4588595) loss:2.660 lr:0.0000100 epoch_Time:24764.0min: [2024-01-05 19:22:19,247][model8_pretrain.py][INFO] Epoch:[0/2](695300/4588595) loss:2.850 lr:0.0000100 epoch_Time:24764.0min: [2024-01-05 19:22:19,248][model8_pretrain.py][INFO] Epoch:[0/2](695300/4588595) loss:3.031 lr:0.0000100 epoch_Time:24764.0min: [2024-01-05 19:22:19,248][model8_pretrain.py][INFO] Epoch:[0/2](695300/4588595) loss:3.204 lr:0.0000100 epoch_Time:24764.0min: [2024-01-05 19:22:56,183][model8_pretrain.py][INFO] Epoch:[0/2](695400/4588595) loss:3.252 lr:0.0000100 epoch_Time:24763.0min: [2024-01-05 19:22:56,183][model8_pretrain.py][INFO] Epoch:[0/2](695400/4588595) loss:2.809 lr:0.0000100 epoch_Time:24763.0min: [2024-01-05 19:22:56,183][model8_pretrain.py][INFO] Epoch:[0/2](695400/4588595) loss:2.097 lr:0.0000100 epoch_Time:24763.0min: [2024-01-05 19:22:56,183][model8_pretrain.py][INFO] Epoch:[0/2](695400/4588595) loss:3.071 lr:0.0000100 epoch_Time:24763.0min: [2024-01-05 19:22:56,183][model8_pretrain.py][INFO] Epoch:[0/2](695400/4588595) loss:2.846 lr:0.0000100 epoch_Time:24763.0min: [2024-01-05 19:22:56,183][model8_pretrain.py][INFO] Epoch:[0/2](695400/4588595) loss:2.871 lr:0.0000100 epoch_Time:24763.0min: [2024-01-05 19:22:56,183][model8_pretrain.py][INFO] Epoch:[0/2](695400/4588595) loss:2.484 lr:0.0000100 epoch_Time:24763.0min: [2024-01-05 19:22:56,184][model8_pretrain.py][INFO] Epoch:[0/2](695400/4588595) loss:2.203 lr:0.0000100 epoch_Time:24763.0min: [2024-01-05 19:23:33,110][model8_pretrain.py][INFO] Epoch:[0/2](695500/4588595) loss:2.642 lr:0.0000100 epoch_Time:24762.0min: [2024-01-05 19:23:33,110][model8_pretrain.py][INFO] Epoch:[0/2](695500/4588595) loss:2.711 lr:0.0000100 epoch_Time:24762.0min: [2024-01-05 19:23:33,110][model8_pretrain.py][INFO] Epoch:[0/2](695500/4588595) loss:2.627 lr:0.0000100 epoch_Time:24762.0min: [2024-01-05 19:23:33,110][model8_pretrain.py][INFO] Epoch:[0/2](695500/4588595) loss:3.211 lr:0.0000100 epoch_Time:24762.0min: [2024-01-05 19:23:33,110][model8_pretrain.py][INFO] Epoch:[0/2](695500/4588595) loss:2.473 lr:0.0000100 epoch_Time:24762.0min: [2024-01-05 19:23:33,110][model8_pretrain.py][INFO] Epoch:[0/2](695500/4588595) loss:2.893 lr:0.0000100 epoch_Time:24762.0min: [2024-01-05 19:23:33,110][model8_pretrain.py][INFO] Epoch:[0/2](695500/4588595) loss:3.101 lr:0.0000100 epoch_Time:24762.0min: [2024-01-05 19:23:33,110][model8_pretrain.py][INFO] Epoch:[0/2](695500/4588595) loss:2.356 lr:0.0000100 epoch_Time:24762.0min: [2024-01-05 19:24:10,041][model8_pretrain.py][INFO] Epoch:[0/2](695600/4588595) loss:3.070 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:10,041][model8_pretrain.py][INFO] Epoch:[0/2](695600/4588595) loss:3.079 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:10,041][model8_pretrain.py][INFO] Epoch:[0/2](695600/4588595) loss:2.885 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:10,041][model8_pretrain.py][INFO] Epoch:[0/2](695600/4588595) loss:2.821 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:10,041][model8_pretrain.py][INFO] Epoch:[0/2](695600/4588595) loss:2.630 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:10,041][model8_pretrain.py][INFO] Epoch:[0/2](695600/4588595) loss:2.249 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:10,041][model8_pretrain.py][INFO] Epoch:[0/2](695600/4588595) loss:2.898 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:10,041][model8_pretrain.py][INFO] Epoch:[0/2](695600/4588595) loss:3.433 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:46,984][model8_pretrain.py][INFO] Epoch:[0/2](695700/4588595) loss:3.554 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:46,984][model8_pretrain.py][INFO] Epoch:[0/2](695700/4588595) loss:3.416 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:46,984][model8_pretrain.py][INFO] Epoch:[0/2](695700/4588595) loss:2.995 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:46,984][model8_pretrain.py][INFO] Epoch:[0/2](695700/4588595) loss:2.876 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:46,984][model8_pretrain.py][INFO] Epoch:[0/2](695700/4588595) loss:3.273 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:46,984][model8_pretrain.py][INFO] Epoch:[0/2](695700/4588595) loss:2.568 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:46,984][model8_pretrain.py][INFO] Epoch:[0/2](695700/4588595) loss:2.743 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:24:46,984][model8_pretrain.py][INFO] Epoch:[0/2](695700/4588595) loss:2.796 lr:0.0000100 epoch_Time:24761.0min: [2024-01-05 19:25:27,395][model8_pretrain.py][INFO] Epoch:[0/2](695800/4588595) loss:2.884 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:25:27,395][model8_pretrain.py][INFO] Epoch:[0/2](695800/4588595) loss:2.789 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:25:27,395][model8_pretrain.py][INFO] Epoch:[0/2](695800/4588595) loss:2.774 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:25:27,395][model8_pretrain.py][INFO] Epoch:[0/2](695800/4588595) loss:3.249 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:25:27,395][model8_pretrain.py][INFO] Epoch:[0/2](695800/4588595) loss:2.468 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:25:27,395][model8_pretrain.py][INFO] Epoch:[0/2](695800/4588595) loss:2.967 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:25:27,395][model8_pretrain.py][INFO] Epoch:[0/2](695800/4588595) loss:3.180 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:25:27,396][model8_pretrain.py][INFO] Epoch:[0/2](695800/4588595) loss:2.600 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:26:12,869][model8_pretrain.py][INFO] Epoch:[0/2](695900/4588595) loss:3.109 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:26:12,869][model8_pretrain.py][INFO] Epoch:[0/2](695900/4588595) loss:2.917 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:26:12,869][model8_pretrain.py][INFO] Epoch:[0/2](695900/4588595) loss:2.890 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:26:12,869][model8_pretrain.py][INFO] Epoch:[0/2](695900/4588595) loss:3.063 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:26:12,869][model8_pretrain.py][INFO] Epoch:[0/2](695900/4588595) loss:3.165 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:26:12,869][model8_pretrain.py][INFO] Epoch:[0/2](695900/4588595) loss:2.620 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:26:12,869][model8_pretrain.py][INFO] Epoch:[0/2](695900/4588595) loss:3.086 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:26:12,869][model8_pretrain.py][INFO] Epoch:[0/2](695900/4588595) loss:2.853 lr:0.0000100 epoch_Time:24760.0min: [2024-01-05 19:26:49,807][model8_pretrain.py][INFO] Epoch:[0/2](696000/4588595) loss:2.873 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:26:49,807][model8_pretrain.py][INFO] Epoch:[0/2](696000/4588595) loss:2.867 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:26:49,807][model8_pretrain.py][INFO] Epoch:[0/2](696000/4588595) loss:2.826 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:26:49,807][model8_pretrain.py][INFO] Epoch:[0/2](696000/4588595) loss:2.898 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:26:49,807][model8_pretrain.py][INFO] Epoch:[0/2](696000/4588595) loss:2.683 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:26:49,807][model8_pretrain.py][INFO] Epoch:[0/2](696000/4588595) loss:2.808 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:26:49,807][model8_pretrain.py][INFO] Epoch:[0/2](696000/4588595) loss:1.988 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:26:49,807][model8_pretrain.py][INFO] Epoch:[0/2](696000/4588595) loss:2.446 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:27:26,739][model8_pretrain.py][INFO] Epoch:[0/2](696100/4588595) loss:3.313 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:27:26,739][model8_pretrain.py][INFO] Epoch:[0/2](696100/4588595) loss:3.331 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:27:26,739][model8_pretrain.py][INFO] Epoch:[0/2](696100/4588595) loss:3.336 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:27:26,739][model8_pretrain.py][INFO] Epoch:[0/2](696100/4588595) loss:2.828 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:27:26,739][model8_pretrain.py][INFO] Epoch:[0/2](696100/4588595) loss:3.281 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:27:26,739][model8_pretrain.py][INFO] Epoch:[0/2](696100/4588595) loss:2.522 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:27:26,739][model8_pretrain.py][INFO] Epoch:[0/2](696100/4588595) loss:2.968 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:27:26,739][model8_pretrain.py][INFO] Epoch:[0/2](696100/4588595) loss:3.040 lr:0.0000100 epoch_Time:24759.0min: [2024-01-05 19:28:03,678][model8_pretrain.py][INFO] Epoch:[0/2](696200/4588595) loss:2.529 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:03,678][model8_pretrain.py][INFO] Epoch:[0/2](696200/4588595) loss:2.689 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:03,678][model8_pretrain.py][INFO] Epoch:[0/2](696200/4588595) loss:2.381 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:03,678][model8_pretrain.py][INFO] Epoch:[0/2](696200/4588595) loss:2.707 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:03,678][model8_pretrain.py][INFO] Epoch:[0/2](696200/4588595) loss:2.969 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:03,678][model8_pretrain.py][INFO] Epoch:[0/2](696200/4588595) loss:2.452 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:03,678][model8_pretrain.py][INFO] Epoch:[0/2](696200/4588595) loss:2.673 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:03,678][model8_pretrain.py][INFO] Epoch:[0/2](696200/4588595) loss:2.208 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:40,610][model8_pretrain.py][INFO] Epoch:[0/2](696300/4588595) loss:3.069 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:40,610][model8_pretrain.py][INFO] Epoch:[0/2](696300/4588595) loss:3.115 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:40,610][model8_pretrain.py][INFO] Epoch:[0/2](696300/4588595) loss:2.756 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:40,610][model8_pretrain.py][INFO] Epoch:[0/2](696300/4588595) loss:2.810 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:40,610][model8_pretrain.py][INFO] Epoch:[0/2](696300/4588595) loss:2.244 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:40,610][model8_pretrain.py][INFO] Epoch:[0/2](696300/4588595) loss:2.330 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:40,610][model8_pretrain.py][INFO] Epoch:[0/2](696300/4588595) loss:2.952 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:28:40,610][model8_pretrain.py][INFO] Epoch:[0/2](696300/4588595) loss:2.922 lr:0.0000100 epoch_Time:24758.0min: [2024-01-05 19:29:17,545][model8_pretrain.py][INFO] Epoch:[0/2](696400/4588595) loss:2.648 lr:0.0000100 epoch_Time:24757.0min: [2024-01-05 19:29:17,545][model8_pretrain.py][INFO] Epoch:[0/2](696400/4588595) loss:2.896 lr:0.0000100 epoch_Time:24757.0min: [2024-01-05 19:29:17,545][model8_pretrain.py][INFO] Epoch:[0/2](696400/4588595) loss:1.970 lr:0.0000100 epoch_Time:24757.0min: [2024-01-05 19:29:17,545][model8_pretrain.py][INFO] Epoch:[0/2](696400/4588595) loss:2.619 lr:0.0000100 epoch_Time:24757.0min: [2024-01-05 19:29:17,545][model8_pretrain.py][INFO] Epoch:[0/2](696400/4588595) loss:2.315 lr:0.0000100 epoch_Time:24757.0min: [2024-01-05 19:29:17,545][model8_pretrain.py][INFO] Epoch:[0/2](696400/4588595) loss:3.187 lr:0.0000100 epoch_Time:24757.0min: [2024-01-05 19:29:17,545][model8_pretrain.py][INFO] Epoch:[0/2](696400/4588595) loss:2.868 lr:0.0000100 epoch_Time:24757.0min: [2024-01-05 19:29:17,546][model8_pretrain.py][INFO] Epoch:[0/2](696400/4588595) loss:2.726 lr:0.0000100 epoch_Time:24757.0min: [2024-01-05 19:29:54,490][model8_pretrain.py][INFO] Epoch:[0/2](696500/4588595) loss:3.010 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:29:54,490][model8_pretrain.py][INFO] Epoch:[0/2](696500/4588595) loss:2.434 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:29:54,490][model8_pretrain.py][INFO] Epoch:[0/2](696500/4588595) loss:3.067 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:29:54,490][model8_pretrain.py][INFO] Epoch:[0/2](696500/4588595) loss:3.015 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:29:54,490][model8_pretrain.py][INFO] Epoch:[0/2](696500/4588595) loss:2.998 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:29:54,490][model8_pretrain.py][INFO] Epoch:[0/2](696500/4588595) loss:3.014 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:29:54,490][model8_pretrain.py][INFO] Epoch:[0/2](696500/4588595) loss:2.571 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:29:54,490][model8_pretrain.py][INFO] Epoch:[0/2](696500/4588595) loss:2.841 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:30:34,939][model8_pretrain.py][INFO] Epoch:[0/2](696600/4588595) loss:2.745 lr:0.0000100 epoch_Time:24756.0min: [2024-01-05 19:30:34,939][model8_pretrain.py][INFO] Epoch:[0/2](696600/4588595) loss:2.767 lr:0.0000100 epoch_Time:24756.0min: [2024-01-05 19:30:34,939][model8_pretrain.py][INFO] Epoch:[0/2](696600/4588595) loss:3.102 lr:0.0000100 epoch_Time:24756.0min: [2024-01-05 19:30:34,939][model8_pretrain.py][INFO] Epoch:[0/2](696600/4588595) loss:2.747 lr:0.0000100 epoch_Time:24756.0min: [2024-01-05 19:30:34,939][model8_pretrain.py][INFO] Epoch:[0/2](696600/4588595) loss:2.602 lr:0.0000100 epoch_Time:24756.0min: [2024-01-05 19:30:34,943][model8_pretrain.py][INFO] Epoch:[0/2](696600/4588595) loss:3.517 lr:0.0000100 epoch_Time:24756.0min: [2024-01-05 19:30:34,944][model8_pretrain.py][INFO] Epoch:[0/2](696600/4588595) loss:2.726 lr:0.0000100 epoch_Time:24756.0min: [2024-01-05 19:30:34,944][model8_pretrain.py][INFO] Epoch:[0/2](696600/4588595) loss:2.122 lr:0.0000100 epoch_Time:24756.0min: [2024-01-05 19:31:20,432][model8_pretrain.py][INFO] Epoch:[0/2](696700/4588595) loss:2.716 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:31:20,432][model8_pretrain.py][INFO] Epoch:[0/2](696700/4588595) loss:2.862 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:31:20,432][model8_pretrain.py][INFO] Epoch:[0/2](696700/4588595) loss:2.652 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:31:20,432][model8_pretrain.py][INFO] Epoch:[0/2](696700/4588595) loss:2.831 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:31:20,432][model8_pretrain.py][INFO] Epoch:[0/2](696700/4588595) loss:2.476 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:31:20,432][model8_pretrain.py][INFO] Epoch:[0/2](696700/4588595) loss:2.805 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:31:20,432][model8_pretrain.py][INFO] Epoch:[0/2](696700/4588595) loss:2.257 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:31:20,432][model8_pretrain.py][INFO] Epoch:[0/2](696700/4588595) loss:2.748 lr:0.0000100 epoch_Time:24755.0min: [2024-01-05 19:31:57,359][model8_pretrain.py][INFO] Epoch:[0/2](696800/4588595) loss:2.582 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:31:57,360][model8_pretrain.py][INFO] Epoch:[0/2](696800/4588595) loss:2.221 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:31:57,360][model8_pretrain.py][INFO] Epoch:[0/2](696800/4588595) loss:2.837 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:31:57,359][model8_pretrain.py][INFO] Epoch:[0/2](696800/4588595) loss:2.667 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:31:57,360][model8_pretrain.py][INFO] Epoch:[0/2](696800/4588595) loss:3.274 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:31:57,360][model8_pretrain.py][INFO] Epoch:[0/2](696800/4588595) loss:2.888 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:31:57,360][model8_pretrain.py][INFO] Epoch:[0/2](696800/4588595) loss:2.661 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:31:57,360][model8_pretrain.py][INFO] Epoch:[0/2](696800/4588595) loss:2.907 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:32:34,300][model8_pretrain.py][INFO] Epoch:[0/2](696900/4588595) loss:3.262 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:32:34,300][model8_pretrain.py][INFO] Epoch:[0/2](696900/4588595) loss:2.666 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:32:34,300][model8_pretrain.py][INFO] Epoch:[0/2](696900/4588595) loss:2.738 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:32:34,300][model8_pretrain.py][INFO] Epoch:[0/2](696900/4588595) loss:2.876 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:32:34,300][model8_pretrain.py][INFO] Epoch:[0/2](696900/4588595) loss:3.033 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:32:34,300][model8_pretrain.py][INFO] Epoch:[0/2](696900/4588595) loss:3.030 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:32:34,300][model8_pretrain.py][INFO] Epoch:[0/2](696900/4588595) loss:2.787 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:32:34,300][model8_pretrain.py][INFO] Epoch:[0/2](696900/4588595) loss:3.180 lr:0.0000100 epoch_Time:24754.0min: [2024-01-05 19:33:11,233][model8_pretrain.py][INFO] Epoch:[0/2](697000/4588595) loss:3.350 lr:0.0000100 epoch_Time:24753.0min: [2024-01-05 19:33:11,233][model8_pretrain.py][INFO] Epoch:[0/2](697000/4588595) loss:1.403 lr:0.0000100 epoch_Time:24753.0min: [2024-01-05 19:33:11,233][model8_pretrain.py][INFO] Epoch:[0/2](697000/4588595) loss:3.081 lr:0.0000100 epoch_Time:24753.0min: [2024-01-05 19:33:11,233][model8_pretrain.py][INFO] Epoch:[0/2](697000/4588595) loss:3.119 lr:0.0000100 epoch_Time:24753.0min: [2024-01-05 19:33:11,233][model8_pretrain.py][INFO] Epoch:[0/2](697000/4588595) loss:2.957 lr:0.0000100 epoch_Time:24753.0min: [2024-01-05 19:33:11,234][model8_pretrain.py][INFO] Epoch:[0/2](697000/4588595) loss:2.908 lr:0.0000100 epoch_Time:24753.0min: [2024-01-05 19:33:11,234][model8_pretrain.py][INFO] Epoch:[0/2](697000/4588595) loss:2.765 lr:0.0000100 epoch_Time:24753.0min: [2024-01-05 19:33:11,234][model8_pretrain.py][INFO] Epoch:[0/2](697000/4588595) loss:2.242 lr:0.0000100 epoch_Time:24753.0min: [2024-01-05 19:33:48,163][model8_pretrain.py][INFO] Epoch:[0/2](697100/4588595) loss:3.158 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:33:48,163][model8_pretrain.py][INFO] Epoch:[0/2](697100/4588595) loss:3.237 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:33:48,163][model8_pretrain.py][INFO] Epoch:[0/2](697100/4588595) loss:3.351 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:33:48,163][model8_pretrain.py][INFO] Epoch:[0/2](697100/4588595) loss:2.910 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:33:48,163][model8_pretrain.py][INFO] Epoch:[0/2](697100/4588595) loss:2.503 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:33:48,163][model8_pretrain.py][INFO] Epoch:[0/2](697100/4588595) loss:2.842 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:33:48,164][model8_pretrain.py][INFO] Epoch:[0/2](697100/4588595) loss:3.088 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:33:48,164][model8_pretrain.py][INFO] Epoch:[0/2](697100/4588595) loss:3.256 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:34:25,103][model8_pretrain.py][INFO] Epoch:[0/2](697200/4588595) loss:2.576 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:34:25,103][model8_pretrain.py][INFO] Epoch:[0/2](697200/4588595) loss:2.486 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:34:25,103][model8_pretrain.py][INFO] Epoch:[0/2](697200/4588595) loss:3.019 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:34:25,103][model8_pretrain.py][INFO] Epoch:[0/2](697200/4588595) loss:2.708 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:34:25,103][model8_pretrain.py][INFO] Epoch:[0/2](697200/4588595) loss:2.938 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:34:25,103][model8_pretrain.py][INFO] Epoch:[0/2](697200/4588595) loss:3.263 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:34:25,103][model8_pretrain.py][INFO] Epoch:[0/2](697200/4588595) loss:2.605 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:34:25,103][model8_pretrain.py][INFO] Epoch:[0/2](697200/4588595) loss:2.759 lr:0.0000100 epoch_Time:24752.0min: [2024-01-05 19:35:02,038][model8_pretrain.py][INFO] Epoch:[0/2](697300/4588595) loss:1.899 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:02,038][model8_pretrain.py][INFO] Epoch:[0/2](697300/4588595) loss:2.193 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:02,038][model8_pretrain.py][INFO] Epoch:[0/2](697300/4588595) loss:2.309 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:02,038][model8_pretrain.py][INFO] Epoch:[0/2](697300/4588595) loss:3.139 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:02,038][model8_pretrain.py][INFO] Epoch:[0/2](697300/4588595) loss:2.396 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:02,038][model8_pretrain.py][INFO] Epoch:[0/2](697300/4588595) loss:2.789 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:02,038][model8_pretrain.py][INFO] Epoch:[0/2](697300/4588595) loss:2.847 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:02,038][model8_pretrain.py][INFO] Epoch:[0/2](697300/4588595) loss:2.668 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:40,703][model8_pretrain.py][INFO] Epoch:[0/2](697400/4588595) loss:3.379 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:40,703][model8_pretrain.py][INFO] Epoch:[0/2](697400/4588595) loss:3.060 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:40,703][model8_pretrain.py][INFO] Epoch:[0/2](697400/4588595) loss:2.538 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:40,703][model8_pretrain.py][INFO] Epoch:[0/2](697400/4588595) loss:3.018 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:40,703][model8_pretrain.py][INFO] Epoch:[0/2](697400/4588595) loss:2.770 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:40,703][model8_pretrain.py][INFO] Epoch:[0/2](697400/4588595) loss:2.439 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:40,703][model8_pretrain.py][INFO] Epoch:[0/2](697400/4588595) loss:3.581 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:35:40,704][model8_pretrain.py][INFO] Epoch:[0/2](697400/4588595) loss:2.729 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:36:27,945][model8_pretrain.py][INFO] Epoch:[0/2](697500/4588595) loss:2.818 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:36:27,945][model8_pretrain.py][INFO] Epoch:[0/2](697500/4588595) loss:2.729 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:36:27,945][model8_pretrain.py][INFO] Epoch:[0/2](697500/4588595) loss:2.244 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:36:27,945][model8_pretrain.py][INFO] Epoch:[0/2](697500/4588595) loss:2.845 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:36:27,945][model8_pretrain.py][INFO] Epoch:[0/2](697500/4588595) loss:2.521 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:36:27,945][model8_pretrain.py][INFO] Epoch:[0/2](697500/4588595) loss:2.348 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:36:27,945][model8_pretrain.py][INFO] Epoch:[0/2](697500/4588595) loss:2.804 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:36:27,945][model8_pretrain.py][INFO] Epoch:[0/2](697500/4588595) loss:2.526 lr:0.0000100 epoch_Time:24751.0min: [2024-01-05 19:37:04,875][model8_pretrain.py][INFO] Epoch:[0/2](697600/4588595) loss:2.798 lr:0.0000100 epoch_Time:24750.0min: [2024-01-05 19:37:04,875][model8_pretrain.py][INFO] Epoch:[0/2](697600/4588595) loss:2.628 lr:0.0000100 epoch_Time:24750.0min: [2024-01-05 19:37:04,875][model8_pretrain.py][INFO] Epoch:[0/2](697600/4588595) loss:2.373 lr:0.0000100 epoch_Time:24750.0min: [2024-01-05 19:37:04,875][model8_pretrain.py][INFO] Epoch:[0/2](697600/4588595) loss:2.483 lr:0.0000100 epoch_Time:24750.0min: [2024-01-05 19:37:04,875][model8_pretrain.py][INFO] Epoch:[0/2](697600/4588595) loss:3.101 lr:0.0000100 epoch_Time:24750.0min: [2024-01-05 19:37:04,875][model8_pretrain.py][INFO] Epoch:[0/2](697600/4588595) loss:2.519 lr:0.0000100 epoch_Time:24750.0min: [2024-01-05 19:37:04,875][model8_pretrain.py][INFO] Epoch:[0/2](697600/4588595) loss:2.966 lr:0.0000100 epoch_Time:24750.0min: [2024-01-05 19:37:04,875][model8_pretrain.py][INFO] Epoch:[0/2](697600/4588595) loss:2.766 lr:0.0000100 epoch_Time:24750.0min: [2024-01-05 19:37:41,807][model8_pretrain.py][INFO] Epoch:[0/2](697700/4588595) loss:3.332 lr:0.0000100 epoch_Time:24749.0min: [2024-01-05 19:37:41,807][model8_pretrain.py][INFO] Epoch:[0/2](697700/4588595) loss:3.277 lr:0.0000100 epoch_Time:24749.0min: [2024-01-05 19:37:41,807][model8_pretrain.py][INFO] Epoch:[0/2](697700/4588595) loss:2.620 lr:0.0000100 epoch_Time:24749.0min: [2024-01-05 19:37:41,808][model8_pretrain.py][INFO] Epoch:[0/2](697700/4588595) loss:2.946 lr:0.0000100 epoch_Time:24749.0min: [2024-01-05 19:37:41,808][model8_pretrain.py][INFO] Epoch:[0/2](697700/4588595) loss:2.895 lr:0.0000100 epoch_Time:24749.0min: [2024-01-05 19:37:41,808][model8_pretrain.py][INFO] Epoch:[0/2](697700/4588595) loss:2.678 lr:0.0000100 epoch_Time:24749.0min: [2024-01-05 19:37:41,808][model8_pretrain.py][INFO] Epoch:[0/2](697700/4588595) loss:2.722 lr:0.0000100 epoch_Time:24749.0min: [2024-01-05 19:37:41,808][model8_pretrain.py][INFO] Epoch:[0/2](697700/4588595) loss:2.355 lr:0.0000100 epoch_Time:24749.0min: [2024-01-05 19:38:18,737][model8_pretrain.py][INFO] Epoch:[0/2](697800/4588595) loss:3.079 lr:0.0000100 epoch_Time:24748.0min: [2024-01-05 19:38:18,737][model8_pretrain.py][INFO] Epoch:[0/2](697800/4588595) loss:2.808 lr:0.0000100 epoch_Time:24748.0min: [2024-01-05 19:38:18,737][model8_pretrain.py][INFO] Epoch:[0/2](697800/4588595) loss:2.580 lr:0.0000100 epoch_Time:24748.0min: [2024-01-05 19:38:18,737][model8_pretrain.py][INFO] Epoch:[0/2](697800/4588595) loss:2.876 lr:0.0000100 epoch_Time:24748.0min: [2024-01-05 19:38:18,737][model8_pretrain.py][INFO] Epoch:[0/2](697800/4588595) loss:2.616 lr:0.0000100 epoch_Time:24748.0min: [2024-01-05 19:38:18,737][model8_pretrain.py][INFO] Epoch:[0/2](697800/4588595) loss:2.820 lr:0.0000100 epoch_Time:24748.0min: [2024-01-05 19:38:18,737][model8_pretrain.py][INFO] Epoch:[0/2](697800/4588595) loss:2.737 lr:0.0000100 epoch_Time:24748.0min: [2024-01-05 19:38:18,737][model8_pretrain.py][INFO] Epoch:[0/2](697800/4588595) loss:3.048 lr:0.0000100 epoch_Time:24748.0min: [2024-01-05 19:38:55,670][model8_pretrain.py][INFO] Epoch:[0/2](697900/4588595) loss:2.524 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:38:55,670][model8_pretrain.py][INFO] Epoch:[0/2](697900/4588595) loss:2.586 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:38:55,670][model8_pretrain.py][INFO] Epoch:[0/2](697900/4588595) loss:3.435 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:38:55,670][model8_pretrain.py][INFO] Epoch:[0/2](697900/4588595) loss:2.662 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:38:55,670][model8_pretrain.py][INFO] Epoch:[0/2](697900/4588595) loss:2.836 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:38:55,670][model8_pretrain.py][INFO] Epoch:[0/2](697900/4588595) loss:2.736 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:38:55,670][model8_pretrain.py][INFO] Epoch:[0/2](697900/4588595) loss:3.022 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:38:55,670][model8_pretrain.py][INFO] Epoch:[0/2](697900/4588595) loss:3.218 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:39:32,598][model8_pretrain.py][INFO] Epoch:[0/2](698000/4588595) loss:3.141 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:39:32,598][model8_pretrain.py][INFO] Epoch:[0/2](698000/4588595) loss:2.079 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:39:32,598][model8_pretrain.py][INFO] Epoch:[0/2](698000/4588595) loss:3.096 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:39:32,598][model8_pretrain.py][INFO] Epoch:[0/2](698000/4588595) loss:3.507 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:39:32,598][model8_pretrain.py][INFO] Epoch:[0/2](698000/4588595) loss:2.790 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:39:32,598][model8_pretrain.py][INFO] Epoch:[0/2](698000/4588595) loss:2.753 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:39:32,599][model8_pretrain.py][INFO] Epoch:[0/2](698000/4588595) loss:3.159 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:39:32,599][model8_pretrain.py][INFO] Epoch:[0/2](698000/4588595) loss:2.923 lr:0.0000100 epoch_Time:24747.0min: [2024-01-05 19:40:09,541][model8_pretrain.py][INFO] Epoch:[0/2](698100/4588595) loss:2.779 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:40:09,541][model8_pretrain.py][INFO] Epoch:[0/2](698100/4588595) loss:2.556 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:40:09,541][model8_pretrain.py][INFO] Epoch:[0/2](698100/4588595) loss:2.678 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:40:09,541][model8_pretrain.py][INFO] Epoch:[0/2](698100/4588595) loss:2.277 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:40:09,541][model8_pretrain.py][INFO] Epoch:[0/2](698100/4588595) loss:2.636 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:40:09,541][model8_pretrain.py][INFO] Epoch:[0/2](698100/4588595) loss:2.796 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:40:09,541][model8_pretrain.py][INFO] Epoch:[0/2](698100/4588595) loss:2.946 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:40:09,541][model8_pretrain.py][INFO] Epoch:[0/2](698100/4588595) loss:2.602 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:40:48,211][model8_pretrain.py][INFO] Epoch:[0/2](698200/4588595) loss:2.326 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:40:48,211][model8_pretrain.py][INFO] Epoch:[0/2](698200/4588595) loss:3.082 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:40:48,211][model8_pretrain.py][INFO] Epoch:[0/2](698200/4588595) loss:2.288 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:40:48,211][model8_pretrain.py][INFO] Epoch:[0/2](698200/4588595) loss:3.016 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:40:48,211][model8_pretrain.py][INFO] Epoch:[0/2](698200/4588595) loss:2.926 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:40:48,211][model8_pretrain.py][INFO] Epoch:[0/2](698200/4588595) loss:3.180 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:40:48,212][model8_pretrain.py][INFO] Epoch:[0/2](698200/4588595) loss:2.870 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:40:48,212][model8_pretrain.py][INFO] Epoch:[0/2](698200/4588595) loss:2.893 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:41:35,268][model8_pretrain.py][INFO] Epoch:[0/2](698300/4588595) loss:2.524 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:41:35,269][model8_pretrain.py][INFO] Epoch:[0/2](698300/4588595) loss:3.155 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:41:35,269][model8_pretrain.py][INFO] Epoch:[0/2](698300/4588595) loss:2.568 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:41:35,269][model8_pretrain.py][INFO] Epoch:[0/2](698300/4588595) loss:2.615 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:41:35,269][model8_pretrain.py][INFO] Epoch:[0/2](698300/4588595) loss:3.357 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:41:35,269][model8_pretrain.py][INFO] Epoch:[0/2](698300/4588595) loss:2.839 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:41:35,269][model8_pretrain.py][INFO] Epoch:[0/2](698300/4588595) loss:2.982 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:41:35,269][model8_pretrain.py][INFO] Epoch:[0/2](698300/4588595) loss:2.963 lr:0.0000100 epoch_Time:24746.0min: [2024-01-05 19:42:12,198][model8_pretrain.py][INFO] Epoch:[0/2](698400/4588595) loss:3.120 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:42:12,198][model8_pretrain.py][INFO] Epoch:[0/2](698400/4588595) loss:2.416 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:42:12,198][model8_pretrain.py][INFO] Epoch:[0/2](698400/4588595) loss:2.814 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:42:12,198][model8_pretrain.py][INFO] Epoch:[0/2](698400/4588595) loss:2.857 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:42:12,198][model8_pretrain.py][INFO] Epoch:[0/2](698400/4588595) loss:3.065 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:42:12,198][model8_pretrain.py][INFO] Epoch:[0/2](698400/4588595) loss:2.720 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:42:12,198][model8_pretrain.py][INFO] Epoch:[0/2](698400/4588595) loss:2.983 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:42:12,198][model8_pretrain.py][INFO] Epoch:[0/2](698400/4588595) loss:2.645 lr:0.0000100 epoch_Time:24745.0min: [2024-01-05 19:42:49,124][model8_pretrain.py][INFO] Epoch:[0/2](698500/4588595) loss:2.841 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:42:49,124][model8_pretrain.py][INFO] Epoch:[0/2](698500/4588595) loss:2.713 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:42:49,124][model8_pretrain.py][INFO] Epoch:[0/2](698500/4588595) loss:3.132 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:42:49,124][model8_pretrain.py][INFO] Epoch:[0/2](698500/4588595) loss:2.945 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:42:49,124][model8_pretrain.py][INFO] Epoch:[0/2](698500/4588595) loss:2.253 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:42:49,124][model8_pretrain.py][INFO] Epoch:[0/2](698500/4588595) loss:3.201 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:42:49,124][model8_pretrain.py][INFO] Epoch:[0/2](698500/4588595) loss:2.925 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:42:49,124][model8_pretrain.py][INFO] Epoch:[0/2](698500/4588595) loss:2.263 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:43:26,055][model8_pretrain.py][INFO] Epoch:[0/2](698600/4588595) loss:2.695 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:43:26,055][model8_pretrain.py][INFO] Epoch:[0/2](698600/4588595) loss:2.553 lr:0.0000100 epoch_Time:24743.0min: [2024-01-05 19:43:26,055][model8_pretrain.py][INFO] Epoch:[0/2](698600/4588595) loss:2.757 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:43:26,055][model8_pretrain.py][INFO] Epoch:[0/2](698600/4588595) loss:3.344 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:43:26,055][model8_pretrain.py][INFO] Epoch:[0/2](698600/4588595) loss:2.963 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:43:26,055][model8_pretrain.py][INFO] Epoch:[0/2](698600/4588595) loss:2.815 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:43:26,055][model8_pretrain.py][INFO] Epoch:[0/2](698600/4588595) loss:2.297 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:43:26,055][model8_pretrain.py][INFO] Epoch:[0/2](698600/4588595) loss:3.159 lr:0.0000100 epoch_Time:24744.0min: [2024-01-05 19:44:02,993][model8_pretrain.py][INFO] Epoch:[0/2](698700/4588595) loss:3.069 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:02,993][model8_pretrain.py][INFO] Epoch:[0/2](698700/4588595) loss:3.072 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:02,993][model8_pretrain.py][INFO] Epoch:[0/2](698700/4588595) loss:2.848 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:02,993][model8_pretrain.py][INFO] Epoch:[0/2](698700/4588595) loss:2.999 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:02,993][model8_pretrain.py][INFO] Epoch:[0/2](698700/4588595) loss:3.021 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:02,993][model8_pretrain.py][INFO] Epoch:[0/2](698700/4588595) loss:3.465 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:02,993][model8_pretrain.py][INFO] Epoch:[0/2](698700/4588595) loss:2.862 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:02,993][model8_pretrain.py][INFO] Epoch:[0/2](698700/4588595) loss:2.513 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:39,932][model8_pretrain.py][INFO] Epoch:[0/2](698800/4588595) loss:2.999 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:39,932][model8_pretrain.py][INFO] Epoch:[0/2](698800/4588595) loss:2.941 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:39,932][model8_pretrain.py][INFO] Epoch:[0/2](698800/4588595) loss:2.864 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:39,932][model8_pretrain.py][INFO] Epoch:[0/2](698800/4588595) loss:2.594 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:39,932][model8_pretrain.py][INFO] Epoch:[0/2](698800/4588595) loss:2.924 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:39,932][model8_pretrain.py][INFO] Epoch:[0/2](698800/4588595) loss:3.115 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:39,932][model8_pretrain.py][INFO] Epoch:[0/2](698800/4588595) loss:3.222 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:44:39,932][model8_pretrain.py][INFO] Epoch:[0/2](698800/4588595) loss:3.141 lr:0.0000100 epoch_Time:24742.0min: [2024-01-05 19:45:16,865][model8_pretrain.py][INFO] Epoch:[0/2](698900/4588595) loss:2.766 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:45:16,865][model8_pretrain.py][INFO] Epoch:[0/2](698900/4588595) loss:2.777 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:45:16,865][model8_pretrain.py][INFO] Epoch:[0/2](698900/4588595) loss:2.888 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:45:16,865][model8_pretrain.py][INFO] Epoch:[0/2](698900/4588595) loss:2.846 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:45:16,866][model8_pretrain.py][INFO] Epoch:[0/2](698900/4588595) loss:2.931 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:45:16,866][model8_pretrain.py][INFO] Epoch:[0/2](698900/4588595) loss:2.832 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:45:16,866][model8_pretrain.py][INFO] Epoch:[0/2](698900/4588595) loss:2.121 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:45:16,866][model8_pretrain.py][INFO] Epoch:[0/2](698900/4588595) loss:2.568 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:45:53,794][model8_pretrain.py][INFO] Epoch:[0/2](699000/4588595) loss:2.216 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:45:53,794][model8_pretrain.py][INFO] Epoch:[0/2](699000/4588595) loss:2.497 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:45:53,795][model8_pretrain.py][INFO] Epoch:[0/2](699000/4588595) loss:3.336 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:45:53,795][model8_pretrain.py][INFO] Epoch:[0/2](699000/4588595) loss:2.199 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:45:53,795][model8_pretrain.py][INFO] Epoch:[0/2](699000/4588595) loss:2.826 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:45:53,795][model8_pretrain.py][INFO] Epoch:[0/2](699000/4588595) loss:2.648 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:45:53,796][model8_pretrain.py][INFO] Epoch:[0/2](699000/4588595) loss:2.591 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:45:55,504][model8_pretrain.py][INFO] Epoch:[0/2](699000/4588595) loss:2.957 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:46:42,666][model8_pretrain.py][INFO] Epoch:[0/2](699100/4588595) loss:3.007 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:46:42,666][model8_pretrain.py][INFO] Epoch:[0/2](699100/4588595) loss:2.998 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:46:42,666][model8_pretrain.py][INFO] Epoch:[0/2](699100/4588595) loss:2.066 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:46:42,666][model8_pretrain.py][INFO] Epoch:[0/2](699100/4588595) loss:2.819 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:46:42,666][model8_pretrain.py][INFO] Epoch:[0/2](699100/4588595) loss:2.948 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:46:42,666][model8_pretrain.py][INFO] Epoch:[0/2](699100/4588595) loss:3.185 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:46:42,666][model8_pretrain.py][INFO] Epoch:[0/2](699100/4588595) loss:3.093 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:46:42,666][model8_pretrain.py][INFO] Epoch:[0/2](699100/4588595) loss:2.520 lr:0.0000100 epoch_Time:24741.0min: [2024-01-05 19:47:19,605][model8_pretrain.py][INFO] Epoch:[0/2](699200/4588595) loss:2.909 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:47:19,605][model8_pretrain.py][INFO] Epoch:[0/2](699200/4588595) loss:2.561 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:47:19,605][model8_pretrain.py][INFO] Epoch:[0/2](699200/4588595) loss:2.584 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:47:19,605][model8_pretrain.py][INFO] Epoch:[0/2](699200/4588595) loss:2.946 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:47:19,605][model8_pretrain.py][INFO] Epoch:[0/2](699200/4588595) loss:3.302 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:47:19,605][model8_pretrain.py][INFO] Epoch:[0/2](699200/4588595) loss:3.066 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:47:19,605][model8_pretrain.py][INFO] Epoch:[0/2](699200/4588595) loss:2.554 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:47:19,605][model8_pretrain.py][INFO] Epoch:[0/2](699200/4588595) loss:2.787 lr:0.0000100 epoch_Time:24740.0min: [2024-01-05 19:47:56,542][model8_pretrain.py][INFO] Epoch:[0/2](699300/4588595) loss:2.727 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:47:56,542][model8_pretrain.py][INFO] Epoch:[0/2](699300/4588595) loss:2.609 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:47:56,542][model8_pretrain.py][INFO] Epoch:[0/2](699300/4588595) loss:2.643 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:47:56,542][model8_pretrain.py][INFO] Epoch:[0/2](699300/4588595) loss:2.989 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:47:56,542][model8_pretrain.py][INFO] Epoch:[0/2](699300/4588595) loss:2.750 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:47:56,542][model8_pretrain.py][INFO] Epoch:[0/2](699300/4588595) loss:3.014 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:47:56,542][model8_pretrain.py][INFO] Epoch:[0/2](699300/4588595) loss:3.034 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:47:56,542][model8_pretrain.py][INFO] Epoch:[0/2](699300/4588595) loss:3.175 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:48:33,490][model8_pretrain.py][INFO] Epoch:[0/2](699400/4588595) loss:3.515 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:48:33,490][model8_pretrain.py][INFO] Epoch:[0/2](699400/4588595) loss:2.970 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:48:33,490][model8_pretrain.py][INFO] Epoch:[0/2](699400/4588595) loss:3.115 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:48:33,490][model8_pretrain.py][INFO] Epoch:[0/2](699400/4588595) loss:3.224 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:48:33,490][model8_pretrain.py][INFO] Epoch:[0/2](699400/4588595) loss:3.301 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:48:33,490][model8_pretrain.py][INFO] Epoch:[0/2](699400/4588595) loss:2.778 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:48:33,490][model8_pretrain.py][INFO] Epoch:[0/2](699400/4588595) loss:2.388 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:48:33,490][model8_pretrain.py][INFO] Epoch:[0/2](699400/4588595) loss:3.243 lr:0.0000100 epoch_Time:24739.0min: [2024-01-05 19:49:10,449][model8_pretrain.py][INFO] Epoch:[0/2](699500/4588595) loss:2.748 lr:0.0000100 epoch_Time:24738.0min: [2024-01-05 19:49:10,449][model8_pretrain.py][INFO] Epoch:[0/2](699500/4588595) loss:3.228 lr:0.0000100 epoch_Time:24738.0min: [2024-01-05 19:49:10,449][model8_pretrain.py][INFO] Epoch:[0/2](699500/4588595) loss:3.038 lr:0.0000100 epoch_Time:24738.0min: [2024-01-05 19:49:10,449][model8_pretrain.py][INFO] Epoch:[0/2](699500/4588595) loss:2.681 lr:0.0000100 epoch_Time:24738.0min: [2024-01-05 19:49:10,449][model8_pretrain.py][INFO] Epoch:[0/2](699500/4588595) loss:2.986 lr:0.0000100 epoch_Time:24738.0min: [2024-01-05 19:49:10,449][model8_pretrain.py][INFO] Epoch:[0/2](699500/4588595) loss:3.013 lr:0.0000100 epoch_Time:24738.0min: [2024-01-05 19:49:10,449][model8_pretrain.py][INFO] Epoch:[0/2](699500/4588595) loss:3.285 lr:0.0000100 epoch_Time:24738.0min: [2024-01-05 19:49:10,449][model8_pretrain.py][INFO] Epoch:[0/2](699500/4588595) loss:2.738 lr:0.0000100 epoch_Time:24738.0min: [2024-01-05 19:49:47,453][model8_pretrain.py][INFO] Epoch:[0/2](699600/4588595) loss:3.036 lr:0.0000100 epoch_Time:24737.0min: [2024-01-05 19:49:47,453][model8_pretrain.py][INFO] Epoch:[0/2](699600/4588595) loss:2.404 lr:0.0000100 epoch_Time:24737.0min: [2024-01-05 19:49:47,453][model8_pretrain.py][INFO] Epoch:[0/2](699600/4588595) loss:2.625 lr:0.0000100 epoch_Time:24737.0min: [2024-01-05 19:49:47,453][model8_pretrain.py][INFO] Epoch:[0/2](699600/4588595) loss:3.018 lr:0.0000100 epoch_Time:24737.0min: [2024-01-05 19:49:47,453][model8_pretrain.py][INFO] Epoch:[0/2](699600/4588595) loss:2.769 lr:0.0000100 epoch_Time:24737.0min: [2024-01-05 19:49:47,453][model8_pretrain.py][INFO] Epoch:[0/2](699600/4588595) loss:3.102 lr:0.0000100 epoch_Time:24737.0min: [2024-01-05 19:49:47,453][model8_pretrain.py][INFO] Epoch:[0/2](699600/4588595) loss:2.405 lr:0.0000100 epoch_Time:24737.0min: [2024-01-05 19:49:47,453][model8_pretrain.py][INFO] Epoch:[0/2](699600/4588595) loss:2.223 lr:0.0000100 epoch_Time:24737.0min: [2024-01-05 19:50:24,431][model8_pretrain.py][INFO] Epoch:[0/2](699700/4588595) loss:2.601 lr:0.0000100 epoch_Time:24736.0min: [2024-01-05 19:50:24,431][model8_pretrain.py][INFO] Epoch:[0/2](699700/4588595) loss:2.708 lr:0.0000100 epoch_Time:24736.0min: [2024-01-05 19:50:24,431][model8_pretrain.py][INFO] Epoch:[0/2](699700/4588595) loss:2.880 lr:0.0000100 epoch_Time:24736.0min: [2024-01-05 19:50:24,431][model8_pretrain.py][INFO] Epoch:[0/2](699700/4588595) loss:3.145 lr:0.0000100 epoch_Time:24736.0min: [2024-01-05 19:50:24,431][model8_pretrain.py][INFO] Epoch:[0/2](699700/4588595) loss:1.995 lr:0.0000100 epoch_Time:24736.0min: [2024-01-05 19:50:24,431][model8_pretrain.py][INFO] Epoch:[0/2](699700/4588595) loss:2.466 lr:0.0000100 epoch_Time:24736.0min: [2024-01-05 19:50:24,431][model8_pretrain.py][INFO] Epoch:[0/2](699700/4588595) loss:2.862 lr:0.0000100 epoch_Time:24736.0min: [2024-01-05 19:50:24,431][model8_pretrain.py][INFO] Epoch:[0/2](699700/4588595) loss:2.492 lr:0.0000100 epoch_Time:24736.0min: [2024-01-05 19:51:01,429][model8_pretrain.py][INFO] Epoch:[0/2](699800/4588595) loss:3.432 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:01,429][model8_pretrain.py][INFO] Epoch:[0/2](699800/4588595) loss:2.702 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:01,429][model8_pretrain.py][INFO] Epoch:[0/2](699800/4588595) loss:2.996 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:01,429][model8_pretrain.py][INFO] Epoch:[0/2](699800/4588595) loss:3.439 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:01,429][model8_pretrain.py][INFO] Epoch:[0/2](699800/4588595) loss:3.278 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:01,429][model8_pretrain.py][INFO] Epoch:[0/2](699800/4588595) loss:2.854 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:01,429][model8_pretrain.py][INFO] Epoch:[0/2](699800/4588595) loss:3.253 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:01,429][model8_pretrain.py][INFO] Epoch:[0/2](699800/4588595) loss:2.645 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:50,455][model8_pretrain.py][INFO] Epoch:[0/2](699900/4588595) loss:2.977 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:50,455][model8_pretrain.py][INFO] Epoch:[0/2](699900/4588595) loss:2.759 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:50,455][model8_pretrain.py][INFO] Epoch:[0/2](699900/4588595) loss:3.546 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:50,455][model8_pretrain.py][INFO] Epoch:[0/2](699900/4588595) loss:3.215 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:50,455][model8_pretrain.py][INFO] Epoch:[0/2](699900/4588595) loss:2.629 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:50,455][model8_pretrain.py][INFO] Epoch:[0/2](699900/4588595) loss:2.473 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:50,455][model8_pretrain.py][INFO] Epoch:[0/2](699900/4588595) loss:2.654 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:51:50,455][model8_pretrain.py][INFO] Epoch:[0/2](699900/4588595) loss:3.224 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:52:27,376][model8_pretrain.py][INFO] Epoch:[0/2](700000/4588595) loss:2.719 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:52:27,376][model8_pretrain.py][INFO] Epoch:[0/2](700000/4588595) loss:3.266 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:52:27,376][model8_pretrain.py][INFO] Epoch:[0/2](700000/4588595) loss:2.979 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:52:27,376][model8_pretrain.py][INFO] Epoch:[0/2](700000/4588595) loss:2.670 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:52:27,376][model8_pretrain.py][INFO] Epoch:[0/2](700000/4588595) loss:3.027 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:52:27,376][model8_pretrain.py][INFO] Epoch:[0/2](700000/4588595) loss:2.770 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:52:27,376][model8_pretrain.py][INFO] Epoch:[0/2](700000/4588595) loss:2.869 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:52:27,376][model8_pretrain.py][INFO] Epoch:[0/2](700000/4588595) loss:3.124 lr:0.0000100 epoch_Time:24735.0min: [2024-01-05 19:53:04,318][model8_pretrain.py][INFO] Epoch:[0/2](700100/4588595) loss:2.555 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:04,318][model8_pretrain.py][INFO] Epoch:[0/2](700100/4588595) loss:3.387 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:04,318][model8_pretrain.py][INFO] Epoch:[0/2](700100/4588595) loss:3.116 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:04,318][model8_pretrain.py][INFO] Epoch:[0/2](700100/4588595) loss:2.642 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:04,318][model8_pretrain.py][INFO] Epoch:[0/2](700100/4588595) loss:3.287 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:04,318][model8_pretrain.py][INFO] Epoch:[0/2](700100/4588595) loss:2.789 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:04,319][model8_pretrain.py][INFO] Epoch:[0/2](700100/4588595) loss:2.818 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:04,319][model8_pretrain.py][INFO] Epoch:[0/2](700100/4588595) loss:2.174 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:41,260][model8_pretrain.py][INFO] Epoch:[0/2](700200/4588595) loss:3.215 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:41,260][model8_pretrain.py][INFO] Epoch:[0/2](700200/4588595) loss:3.229 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:41,260][model8_pretrain.py][INFO] Epoch:[0/2](700200/4588595) loss:3.334 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:41,260][model8_pretrain.py][INFO] Epoch:[0/2](700200/4588595) loss:2.368 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:41,260][model8_pretrain.py][INFO] Epoch:[0/2](700200/4588595) loss:2.823 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:41,260][model8_pretrain.py][INFO] Epoch:[0/2](700200/4588595) loss:2.995 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:41,260][model8_pretrain.py][INFO] Epoch:[0/2](700200/4588595) loss:2.808 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:53:41,260][model8_pretrain.py][INFO] Epoch:[0/2](700200/4588595) loss:2.846 lr:0.0000100 epoch_Time:24734.0min: [2024-01-05 19:54:18,198][model8_pretrain.py][INFO] Epoch:[0/2](700300/4588595) loss:2.797 lr:0.0000100 epoch_Time:24733.0min: [2024-01-05 19:54:18,198][model8_pretrain.py][INFO] Epoch:[0/2](700300/4588595) loss:3.291 lr:0.0000100 epoch_Time:24733.0min: [2024-01-05 19:54:18,198][model8_pretrain.py][INFO] Epoch:[0/2](700300/4588595) loss:2.744 lr:0.0000100 epoch_Time:24733.0min: [2024-01-05 19:54:18,199][model8_pretrain.py][INFO] Epoch:[0/2](700300/4588595) loss:2.961 lr:0.0000100 epoch_Time:24733.0min: [2024-01-05 19:54:18,198][model8_pretrain.py][INFO] Epoch:[0/2](700300/4588595) loss:2.519 lr:0.0000100 epoch_Time:24733.0min: [2024-01-05 19:54:18,199][model8_pretrain.py][INFO] Epoch:[0/2](700300/4588595) loss:2.876 lr:0.0000100 epoch_Time:24733.0min: [2024-01-05 19:54:18,198][model8_pretrain.py][INFO] Epoch:[0/2](700300/4588595) loss:2.977 lr:0.0000100 epoch_Time:24733.0min: [2024-01-05 19:54:18,198][model8_pretrain.py][INFO] Epoch:[0/2](700300/4588595) loss:2.159 lr:0.0000100 epoch_Time:24733.0min: [2024-01-05 19:54:55,132][model8_pretrain.py][INFO] Epoch:[0/2](700400/4588595) loss:3.014 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:54:55,132][model8_pretrain.py][INFO] Epoch:[0/2](700400/4588595) loss:3.192 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:54:55,132][model8_pretrain.py][INFO] Epoch:[0/2](700400/4588595) loss:2.900 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:54:55,133][model8_pretrain.py][INFO] Epoch:[0/2](700400/4588595) loss:2.618 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:54:55,133][model8_pretrain.py][INFO] Epoch:[0/2](700400/4588595) loss:3.284 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:54:55,133][model8_pretrain.py][INFO] Epoch:[0/2](700400/4588595) loss:3.202 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:54:55,133][model8_pretrain.py][INFO] Epoch:[0/2](700400/4588595) loss:2.677 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:54:55,133][model8_pretrain.py][INFO] Epoch:[0/2](700400/4588595) loss:2.130 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:55:32,069][model8_pretrain.py][INFO] Epoch:[0/2](700500/4588595) loss:2.329 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:55:32,069][model8_pretrain.py][INFO] Epoch:[0/2](700500/4588595) loss:3.122 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:55:32,069][model8_pretrain.py][INFO] Epoch:[0/2](700500/4588595) loss:2.365 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:55:32,069][model8_pretrain.py][INFO] Epoch:[0/2](700500/4588595) loss:3.199 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:55:32,069][model8_pretrain.py][INFO] Epoch:[0/2](700500/4588595) loss:2.862 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:55:32,069][model8_pretrain.py][INFO] Epoch:[0/2](700500/4588595) loss:2.338 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:55:32,069][model8_pretrain.py][INFO] Epoch:[0/2](700500/4588595) loss:2.665 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:55:32,069][model8_pretrain.py][INFO] Epoch:[0/2](700500/4588595) loss:2.695 lr:0.0000100 epoch_Time:24732.0min: [2024-01-05 19:56:09,007][model8_pretrain.py][INFO] Epoch:[0/2](700600/4588595) loss:2.327 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:56:09,007][model8_pretrain.py][INFO] Epoch:[0/2](700600/4588595) loss:3.084 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:56:09,007][model8_pretrain.py][INFO] Epoch:[0/2](700600/4588595) loss:2.764 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:56:09,007][model8_pretrain.py][INFO] Epoch:[0/2](700600/4588595) loss:2.726 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:56:09,007][model8_pretrain.py][INFO] Epoch:[0/2](700600/4588595) loss:2.267 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:56:09,007][model8_pretrain.py][INFO] Epoch:[0/2](700600/4588595) loss:2.724 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:56:09,007][model8_pretrain.py][INFO] Epoch:[0/2](700600/4588595) loss:2.907 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:56:09,007][model8_pretrain.py][INFO] Epoch:[0/2](700600/4588595) loss:2.551 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:56:57,870][model8_pretrain.py][INFO] Epoch:[0/2](700700/4588595) loss:2.596 lr:0.0000100 epoch_Time:24731.0min: [2024-01-05 19:56:57,870][model8_pretrain.py][INFO] Epoch:[0/2](700700/4588595) loss:2.766 lr:0.0000100 epoch_Time:24731.0min: [2024-01-05 19:56:57,871][model8_pretrain.py][INFO] Epoch:[0/2](700700/4588595) loss:2.670 lr:0.0000100 epoch_Time:24731.0min: [2024-01-05 19:56:57,871][model8_pretrain.py][INFO] Epoch:[0/2](700700/4588595) loss:2.823 lr:0.0000100 epoch_Time:24731.0min: [2024-01-05 19:56:57,871][model8_pretrain.py][INFO] Epoch:[0/2](700700/4588595) loss:2.544 lr:0.0000100 epoch_Time:24731.0min: [2024-01-05 19:56:57,871][model8_pretrain.py][INFO] Epoch:[0/2](700700/4588595) loss:2.662 lr:0.0000100 epoch_Time:24731.0min: [2024-01-05 19:56:57,871][model8_pretrain.py][INFO] Epoch:[0/2](700700/4588595) loss:3.020 lr:0.0000100 epoch_Time:24731.0min: [2024-01-05 19:56:57,871][model8_pretrain.py][INFO] Epoch:[0/2](700700/4588595) loss:2.544 lr:0.0000100 epoch_Time:24731.0min: [2024-01-05 19:57:34,804][model8_pretrain.py][INFO] Epoch:[0/2](700800/4588595) loss:3.123 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:57:34,804][model8_pretrain.py][INFO] Epoch:[0/2](700800/4588595) loss:3.090 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:57:34,804][model8_pretrain.py][INFO] Epoch:[0/2](700800/4588595) loss:2.626 lr:0.0000100 epoch_Time:24731.0min: [2024-01-05 19:57:34,804][model8_pretrain.py][INFO] Epoch:[0/2](700800/4588595) loss:3.103 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:57:34,804][model8_pretrain.py][INFO] Epoch:[0/2](700800/4588595) loss:2.755 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:57:34,804][model8_pretrain.py][INFO] Epoch:[0/2](700800/4588595) loss:3.162 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:57:34,804][model8_pretrain.py][INFO] Epoch:[0/2](700800/4588595) loss:2.964 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:57:34,804][model8_pretrain.py][INFO] Epoch:[0/2](700800/4588595) loss:3.162 lr:0.0000100 epoch_Time:24730.0min: [2024-01-05 19:58:11,747][model8_pretrain.py][INFO] Epoch:[0/2](700900/4588595) loss:2.212 lr:0.0000100 epoch_Time:24729.0min: [2024-01-05 19:58:11,747][model8_pretrain.py][INFO] Epoch:[0/2](700900/4588595) loss:2.503 lr:0.0000100 epoch_Time:24729.0min: [2024-01-05 19:58:11,747][model8_pretrain.py][INFO] Epoch:[0/2](700900/4588595) loss:2.985 lr:0.0000100 epoch_Time:24729.0min: [2024-01-05 19:58:11,747][model8_pretrain.py][INFO] Epoch:[0/2](700900/4588595) loss:2.742 lr:0.0000100 epoch_Time:24729.0min: [2024-01-05 19:58:11,747][model8_pretrain.py][INFO] Epoch:[0/2](700900/4588595) loss:2.893 lr:0.0000100 epoch_Time:24729.0min: [2024-01-05 19:58:11,747][model8_pretrain.py][INFO] Epoch:[0/2](700900/4588595) loss:2.763 lr:0.0000100 epoch_Time:24729.0min: [2024-01-05 19:58:11,747][model8_pretrain.py][INFO] Epoch:[0/2](700900/4588595) loss:2.984 lr:0.0000100 epoch_Time:24729.0min: [2024-01-05 19:58:11,747][model8_pretrain.py][INFO] Epoch:[0/2](700900/4588595) loss:2.460 lr:0.0000100 epoch_Time:24729.0min: [2024-01-05 19:58:48,691][model8_pretrain.py][INFO] Epoch:[0/2](701000/4588595) loss:2.572 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:58:48,691][model8_pretrain.py][INFO] Epoch:[0/2](701000/4588595) loss:2.412 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:58:48,691][model8_pretrain.py][INFO] Epoch:[0/2](701000/4588595) loss:3.281 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:58:48,691][model8_pretrain.py][INFO] Epoch:[0/2](701000/4588595) loss:3.379 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:58:48,691][model8_pretrain.py][INFO] Epoch:[0/2](701000/4588595) loss:2.825 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:58:48,691][model8_pretrain.py][INFO] Epoch:[0/2](701000/4588595) loss:2.985 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:58:48,691][model8_pretrain.py][INFO] Epoch:[0/2](701000/4588595) loss:2.649 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:58:48,691][model8_pretrain.py][INFO] Epoch:[0/2](701000/4588595) loss:2.352 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:59:25,631][model8_pretrain.py][INFO] Epoch:[0/2](701100/4588595) loss:2.484 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:59:25,631][model8_pretrain.py][INFO] Epoch:[0/2](701100/4588595) loss:3.163 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:59:25,631][model8_pretrain.py][INFO] Epoch:[0/2](701100/4588595) loss:3.461 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:59:25,631][model8_pretrain.py][INFO] Epoch:[0/2](701100/4588595) loss:2.966 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:59:25,631][model8_pretrain.py][INFO] Epoch:[0/2](701100/4588595) loss:3.083 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:59:25,631][model8_pretrain.py][INFO] Epoch:[0/2](701100/4588595) loss:2.612 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:59:25,631][model8_pretrain.py][INFO] Epoch:[0/2](701100/4588595) loss:2.612 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 19:59:25,631][model8_pretrain.py][INFO] Epoch:[0/2](701100/4588595) loss:2.269 lr:0.0000100 epoch_Time:24728.0min: [2024-01-05 20:00:02,600][model8_pretrain.py][INFO] Epoch:[0/2](701200/4588595) loss:3.148 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:02,600][model8_pretrain.py][INFO] Epoch:[0/2](701200/4588595) loss:2.983 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:02,600][model8_pretrain.py][INFO] Epoch:[0/2](701200/4588595) loss:2.441 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:02,600][model8_pretrain.py][INFO] Epoch:[0/2](701200/4588595) loss:2.690 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:02,600][model8_pretrain.py][INFO] Epoch:[0/2](701200/4588595) loss:3.088 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:02,600][model8_pretrain.py][INFO] Epoch:[0/2](701200/4588595) loss:2.876 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:02,600][model8_pretrain.py][INFO] Epoch:[0/2](701200/4588595) loss:3.175 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:02,600][model8_pretrain.py][INFO] Epoch:[0/2](701200/4588595) loss:2.826 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:39,546][model8_pretrain.py][INFO] Epoch:[0/2](701300/4588595) loss:2.785 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:39,546][model8_pretrain.py][INFO] Epoch:[0/2](701300/4588595) loss:3.318 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:39,546][model8_pretrain.py][INFO] Epoch:[0/2](701300/4588595) loss:3.037 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:39,546][model8_pretrain.py][INFO] Epoch:[0/2](701300/4588595) loss:3.045 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:39,546][model8_pretrain.py][INFO] Epoch:[0/2](701300/4588595) loss:2.781 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:39,546][model8_pretrain.py][INFO] Epoch:[0/2](701300/4588595) loss:2.945 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:39,546][model8_pretrain.py][INFO] Epoch:[0/2](701300/4588595) loss:3.025 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:00:39,546][model8_pretrain.py][INFO] Epoch:[0/2](701300/4588595) loss:2.345 lr:0.0000100 epoch_Time:24727.0min: [2024-01-05 20:01:16,488][model8_pretrain.py][INFO] Epoch:[0/2](701400/4588595) loss:3.111 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:01:16,488][model8_pretrain.py][INFO] Epoch:[0/2](701400/4588595) loss:2.695 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:01:16,488][model8_pretrain.py][INFO] Epoch:[0/2](701400/4588595) loss:3.205 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:01:16,488][model8_pretrain.py][INFO] Epoch:[0/2](701400/4588595) loss:2.971 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:01:16,488][model8_pretrain.py][INFO] Epoch:[0/2](701400/4588595) loss:2.883 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:01:16,488][model8_pretrain.py][INFO] Epoch:[0/2](701400/4588595) loss:2.658 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:01:16,489][model8_pretrain.py][INFO] Epoch:[0/2](701400/4588595) loss:3.064 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:01:16,490][model8_pretrain.py][INFO] Epoch:[0/2](701400/4588595) loss:3.151 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:05,541][model8_pretrain.py][INFO] Epoch:[0/2](701500/4588595) loss:2.609 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:05,541][model8_pretrain.py][INFO] Epoch:[0/2](701500/4588595) loss:2.932 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:05,541][model8_pretrain.py][INFO] Epoch:[0/2](701500/4588595) loss:3.102 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:05,541][model8_pretrain.py][INFO] Epoch:[0/2](701500/4588595) loss:2.654 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:05,541][model8_pretrain.py][INFO] Epoch:[0/2](701500/4588595) loss:3.034 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:05,541][model8_pretrain.py][INFO] Epoch:[0/2](701500/4588595) loss:2.979 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:05,541][model8_pretrain.py][INFO] Epoch:[0/2](701500/4588595) loss:2.117 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:05,541][model8_pretrain.py][INFO] Epoch:[0/2](701500/4588595) loss:3.210 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:42,469][model8_pretrain.py][INFO] Epoch:[0/2](701600/4588595) loss:2.435 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:42,469][model8_pretrain.py][INFO] Epoch:[0/2](701600/4588595) loss:3.191 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:42,469][model8_pretrain.py][INFO] Epoch:[0/2](701600/4588595) loss:2.877 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:42,469][model8_pretrain.py][INFO] Epoch:[0/2](701600/4588595) loss:2.984 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:42,469][model8_pretrain.py][INFO] Epoch:[0/2](701600/4588595) loss:2.966 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:42,469][model8_pretrain.py][INFO] Epoch:[0/2](701600/4588595) loss:2.416 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:42,469][model8_pretrain.py][INFO] Epoch:[0/2](701600/4588595) loss:3.054 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:02:42,470][model8_pretrain.py][INFO] Epoch:[0/2](701600/4588595) loss:3.238 lr:0.0000100 epoch_Time:24726.0min: [2024-01-05 20:03:19,396][model8_pretrain.py][INFO] Epoch:[0/2](701700/4588595) loss:2.857 lr:0.0000100 epoch_Time:24725.0min: [2024-01-05 20:03:19,396][model8_pretrain.py][INFO] Epoch:[0/2](701700/4588595) loss:3.182 lr:0.0000100 epoch_Time:24725.0min: [2024-01-05 20:03:19,396][model8_pretrain.py][INFO] Epoch:[0/2](701700/4588595) loss:3.405 lr:0.0000100 epoch_Time:24725.0min: [2024-01-05 20:03:19,396][model8_pretrain.py][INFO] Epoch:[0/2](701700/4588595) loss:2.896 lr:0.0000100 epoch_Time:24725.0min: [2024-01-05 20:03:19,396][model8_pretrain.py][INFO] Epoch:[0/2](701700/4588595) loss:3.495 lr:0.0000100 epoch_Time:24725.0min: [2024-01-05 20:03:19,396][model8_pretrain.py][INFO] Epoch:[0/2](701700/4588595) loss:2.571 lr:0.0000100 epoch_Time:24725.0min: [2024-01-05 20:03:19,396][model8_pretrain.py][INFO] Epoch:[0/2](701700/4588595) loss:3.087 lr:0.0000100 epoch_Time:24725.0min: [2024-01-05 20:03:19,397][model8_pretrain.py][INFO] Epoch:[0/2](701700/4588595) loss:2.568 lr:0.0000100 epoch_Time:24725.0min: [2024-01-05 20:03:56,319][model8_pretrain.py][INFO] Epoch:[0/2](701800/4588595) loss:1.909 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:03:56,319][model8_pretrain.py][INFO] Epoch:[0/2](701800/4588595) loss:2.024 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:03:56,319][model8_pretrain.py][INFO] Epoch:[0/2](701800/4588595) loss:2.814 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:03:56,319][model8_pretrain.py][INFO] Epoch:[0/2](701800/4588595) loss:2.883 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:03:56,319][model8_pretrain.py][INFO] Epoch:[0/2](701800/4588595) loss:3.074 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:03:56,319][model8_pretrain.py][INFO] Epoch:[0/2](701800/4588595) loss:2.610 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:03:56,319][model8_pretrain.py][INFO] Epoch:[0/2](701800/4588595) loss:2.939 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:03:56,320][model8_pretrain.py][INFO] Epoch:[0/2](701800/4588595) loss:3.141 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:04:33,264][model8_pretrain.py][INFO] Epoch:[0/2](701900/4588595) loss:2.829 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:04:33,264][model8_pretrain.py][INFO] Epoch:[0/2](701900/4588595) loss:3.065 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:04:33,264][model8_pretrain.py][INFO] Epoch:[0/2](701900/4588595) loss:3.329 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:04:33,264][model8_pretrain.py][INFO] Epoch:[0/2](701900/4588595) loss:2.727 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:04:33,264][model8_pretrain.py][INFO] Epoch:[0/2](701900/4588595) loss:3.142 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:04:33,264][model8_pretrain.py][INFO] Epoch:[0/2](701900/4588595) loss:2.833 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:04:33,264][model8_pretrain.py][INFO] Epoch:[0/2](701900/4588595) loss:3.200 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:04:33,264][model8_pretrain.py][INFO] Epoch:[0/2](701900/4588595) loss:2.557 lr:0.0000100 epoch_Time:24723.0min: [2024-01-05 20:05:10,201][model8_pretrain.py][INFO] Epoch:[0/2](702000/4588595) loss:3.045 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:10,201][model8_pretrain.py][INFO] Epoch:[0/2](702000/4588595) loss:2.955 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:10,201][model8_pretrain.py][INFO] Epoch:[0/2](702000/4588595) loss:2.403 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:10,201][model8_pretrain.py][INFO] Epoch:[0/2](702000/4588595) loss:2.987 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:10,201][model8_pretrain.py][INFO] Epoch:[0/2](702000/4588595) loss:2.543 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:10,201][model8_pretrain.py][INFO] Epoch:[0/2](702000/4588595) loss:2.392 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:10,201][model8_pretrain.py][INFO] Epoch:[0/2](702000/4588595) loss:2.624 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:10,201][model8_pretrain.py][INFO] Epoch:[0/2](702000/4588595) loss:2.468 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:47,135][model8_pretrain.py][INFO] Epoch:[0/2](702100/4588595) loss:2.864 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:47,135][model8_pretrain.py][INFO] Epoch:[0/2](702100/4588595) loss:2.444 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:47,135][model8_pretrain.py][INFO] Epoch:[0/2](702100/4588595) loss:3.151 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:47,135][model8_pretrain.py][INFO] Epoch:[0/2](702100/4588595) loss:2.803 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:47,135][model8_pretrain.py][INFO] Epoch:[0/2](702100/4588595) loss:2.949 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:47,135][model8_pretrain.py][INFO] Epoch:[0/2](702100/4588595) loss:3.014 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:47,135][model8_pretrain.py][INFO] Epoch:[0/2](702100/4588595) loss:2.598 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:05:47,135][model8_pretrain.py][INFO] Epoch:[0/2](702100/4588595) loss:2.817 lr:0.0000100 epoch_Time:24722.0min: [2024-01-05 20:06:24,079][model8_pretrain.py][INFO] Epoch:[0/2](702200/4588595) loss:2.705 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:06:24,079][model8_pretrain.py][INFO] Epoch:[0/2](702200/4588595) loss:2.882 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:06:24,079][model8_pretrain.py][INFO] Epoch:[0/2](702200/4588595) loss:3.217 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:06:24,079][model8_pretrain.py][INFO] Epoch:[0/2](702200/4588595) loss:2.433 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:06:24,079][model8_pretrain.py][INFO] Epoch:[0/2](702200/4588595) loss:2.561 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:06:24,079][model8_pretrain.py][INFO] Epoch:[0/2](702200/4588595) loss:2.593 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:06:24,079][model8_pretrain.py][INFO] Epoch:[0/2](702200/4588595) loss:2.350 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:06:24,079][model8_pretrain.py][INFO] Epoch:[0/2](702200/4588595) loss:3.238 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:07:13,160][model8_pretrain.py][INFO] Epoch:[0/2](702300/4588595) loss:2.290 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:07:13,160][model8_pretrain.py][INFO] Epoch:[0/2](702300/4588595) loss:2.948 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:07:13,160][model8_pretrain.py][INFO] Epoch:[0/2](702300/4588595) loss:1.936 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:07:13,160][model8_pretrain.py][INFO] Epoch:[0/2](702300/4588595) loss:2.563 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:07:13,160][model8_pretrain.py][INFO] Epoch:[0/2](702300/4588595) loss:2.786 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:07:13,160][model8_pretrain.py][INFO] Epoch:[0/2](702300/4588595) loss:2.218 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:07:13,160][model8_pretrain.py][INFO] Epoch:[0/2](702300/4588595) loss:2.693 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:07:13,160][model8_pretrain.py][INFO] Epoch:[0/2](702300/4588595) loss:2.678 lr:0.0000100 epoch_Time:24721.0min: [2024-01-05 20:07:50,090][model8_pretrain.py][INFO] Epoch:[0/2](702400/4588595) loss:2.743 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:07:50,090][model8_pretrain.py][INFO] Epoch:[0/2](702400/4588595) loss:2.986 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:07:50,090][model8_pretrain.py][INFO] Epoch:[0/2](702400/4588595) loss:2.706 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:07:50,090][model8_pretrain.py][INFO] Epoch:[0/2](702400/4588595) loss:2.929 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:07:50,090][model8_pretrain.py][INFO] Epoch:[0/2](702400/4588595) loss:2.713 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:07:50,090][model8_pretrain.py][INFO] Epoch:[0/2](702400/4588595) loss:3.048 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:07:50,090][model8_pretrain.py][INFO] Epoch:[0/2](702400/4588595) loss:2.929 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:07:50,090][model8_pretrain.py][INFO] Epoch:[0/2](702400/4588595) loss:2.590 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:08:27,033][model8_pretrain.py][INFO] Epoch:[0/2](702500/4588595) loss:3.208 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:08:27,033][model8_pretrain.py][INFO] Epoch:[0/2](702500/4588595) loss:2.968 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:08:27,033][model8_pretrain.py][INFO] Epoch:[0/2](702500/4588595) loss:3.132 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:08:27,033][model8_pretrain.py][INFO] Epoch:[0/2](702500/4588595) loss:2.564 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:08:27,033][model8_pretrain.py][INFO] Epoch:[0/2](702500/4588595) loss:2.841 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:08:27,033][model8_pretrain.py][INFO] Epoch:[0/2](702500/4588595) loss:2.820 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:08:27,033][model8_pretrain.py][INFO] Epoch:[0/2](702500/4588595) loss:3.346 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:08:27,033][model8_pretrain.py][INFO] Epoch:[0/2](702500/4588595) loss:2.445 lr:0.0000100 epoch_Time:24720.0min: [2024-01-05 20:09:03,982][model8_pretrain.py][INFO] Epoch:[0/2](702600/4588595) loss:2.783 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:03,982][model8_pretrain.py][INFO] Epoch:[0/2](702600/4588595) loss:3.120 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:03,982][model8_pretrain.py][INFO] Epoch:[0/2](702600/4588595) loss:3.607 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:03,982][model8_pretrain.py][INFO] Epoch:[0/2](702600/4588595) loss:2.531 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:03,982][model8_pretrain.py][INFO] Epoch:[0/2](702600/4588595) loss:2.810 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:03,982][model8_pretrain.py][INFO] Epoch:[0/2](702600/4588595) loss:2.855 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:03,982][model8_pretrain.py][INFO] Epoch:[0/2](702600/4588595) loss:3.079 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:03,982][model8_pretrain.py][INFO] Epoch:[0/2](702600/4588595) loss:2.920 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:40,922][model8_pretrain.py][INFO] Epoch:[0/2](702700/4588595) loss:3.053 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:40,922][model8_pretrain.py][INFO] Epoch:[0/2](702700/4588595) loss:2.727 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:40,922][model8_pretrain.py][INFO] Epoch:[0/2](702700/4588595) loss:2.635 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:40,922][model8_pretrain.py][INFO] Epoch:[0/2](702700/4588595) loss:2.864 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:40,922][model8_pretrain.py][INFO] Epoch:[0/2](702700/4588595) loss:2.880 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:40,922][model8_pretrain.py][INFO] Epoch:[0/2](702700/4588595) loss:2.946 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:40,922][model8_pretrain.py][INFO] Epoch:[0/2](702700/4588595) loss:3.511 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:09:40,922][model8_pretrain.py][INFO] Epoch:[0/2](702700/4588595) loss:2.442 lr:0.0000100 epoch_Time:24719.0min: [2024-01-05 20:10:17,865][model8_pretrain.py][INFO] Epoch:[0/2](702800/4588595) loss:3.358 lr:0.0000100 epoch_Time:24717.0min: [2024-01-05 20:10:17,865][model8_pretrain.py][INFO] Epoch:[0/2](702800/4588595) loss:2.721 lr:0.0000100 epoch_Time:24717.0min: [2024-01-05 20:10:17,865][model8_pretrain.py][INFO] Epoch:[0/2](702800/4588595) loss:2.794 lr:0.0000100 epoch_Time:24717.0min: [2024-01-05 20:10:17,865][model8_pretrain.py][INFO] Epoch:[0/2](702800/4588595) loss:2.787 lr:0.0000100 epoch_Time:24717.0min: [2024-01-05 20:10:17,865][model8_pretrain.py][INFO] Epoch:[0/2](702800/4588595) loss:2.882 lr:0.0000100 epoch_Time:24717.0min: [2024-01-05 20:10:17,865][model8_pretrain.py][INFO] Epoch:[0/2](702800/4588595) loss:2.603 lr:0.0000100 epoch_Time:24717.0min: [2024-01-05 20:10:17,865][model8_pretrain.py][INFO] Epoch:[0/2](702800/4588595) loss:2.839 lr:0.0000100 epoch_Time:24717.0min: [2024-01-05 20:10:17,865][model8_pretrain.py][INFO] Epoch:[0/2](702800/4588595) loss:3.090 lr:0.0000100 epoch_Time:24717.0min: [2024-01-05 20:10:54,816][model8_pretrain.py][INFO] Epoch:[0/2](702900/4588595) loss:2.560 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:10:54,816][model8_pretrain.py][INFO] Epoch:[0/2](702900/4588595) loss:2.554 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:10:54,816][model8_pretrain.py][INFO] Epoch:[0/2](702900/4588595) loss:3.106 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:10:54,816][model8_pretrain.py][INFO] Epoch:[0/2](702900/4588595) loss:2.214 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:10:54,816][model8_pretrain.py][INFO] Epoch:[0/2](702900/4588595) loss:2.619 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:10:54,816][model8_pretrain.py][INFO] Epoch:[0/2](702900/4588595) loss:3.153 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:10:54,816][model8_pretrain.py][INFO] Epoch:[0/2](702900/4588595) loss:2.539 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:10:54,816][model8_pretrain.py][INFO] Epoch:[0/2](702900/4588595) loss:3.135 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:11:31,765][model8_pretrain.py][INFO] Epoch:[0/2](703000/4588595) loss:2.724 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:11:31,765][model8_pretrain.py][INFO] Epoch:[0/2](703000/4588595) loss:2.582 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:11:31,765][model8_pretrain.py][INFO] Epoch:[0/2](703000/4588595) loss:2.843 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:11:31,765][model8_pretrain.py][INFO] Epoch:[0/2](703000/4588595) loss:2.066 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:11:31,765][model8_pretrain.py][INFO] Epoch:[0/2](703000/4588595) loss:2.900 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:11:31,765][model8_pretrain.py][INFO] Epoch:[0/2](703000/4588595) loss:3.120 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:11:31,765][model8_pretrain.py][INFO] Epoch:[0/2](703000/4588595) loss:2.878 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:11:31,765][model8_pretrain.py][INFO] Epoch:[0/2](703000/4588595) loss:2.527 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:12:20,890][model8_pretrain.py][INFO] Epoch:[0/2](703100/4588595) loss:2.949 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:12:20,890][model8_pretrain.py][INFO] Epoch:[0/2](703100/4588595) loss:2.280 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:12:20,890][model8_pretrain.py][INFO] Epoch:[0/2](703100/4588595) loss:3.477 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:12:20,890][model8_pretrain.py][INFO] Epoch:[0/2](703100/4588595) loss:3.116 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:12:20,890][model8_pretrain.py][INFO] Epoch:[0/2](703100/4588595) loss:2.706 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:12:20,890][model8_pretrain.py][INFO] Epoch:[0/2](703100/4588595) loss:2.824 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:12:20,890][model8_pretrain.py][INFO] Epoch:[0/2](703100/4588595) loss:3.125 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:12:20,890][model8_pretrain.py][INFO] Epoch:[0/2](703100/4588595) loss:2.434 lr:0.0000100 epoch_Time:24716.0min: [2024-01-05 20:12:57,829][model8_pretrain.py][INFO] Epoch:[0/2](703200/4588595) loss:2.800 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:12:57,829][model8_pretrain.py][INFO] Epoch:[0/2](703200/4588595) loss:2.636 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:12:57,829][model8_pretrain.py][INFO] Epoch:[0/2](703200/4588595) loss:2.686 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:12:57,829][model8_pretrain.py][INFO] Epoch:[0/2](703200/4588595) loss:2.651 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:12:57,829][model8_pretrain.py][INFO] Epoch:[0/2](703200/4588595) loss:2.911 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:12:57,829][model8_pretrain.py][INFO] Epoch:[0/2](703200/4588595) loss:2.328 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:12:57,829][model8_pretrain.py][INFO] Epoch:[0/2](703200/4588595) loss:2.645 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:12:57,830][model8_pretrain.py][INFO] Epoch:[0/2](703200/4588595) loss:2.476 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:13:34,747][model8_pretrain.py][INFO] Epoch:[0/2](703300/4588595) loss:2.863 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:13:34,747][model8_pretrain.py][INFO] Epoch:[0/2](703300/4588595) loss:3.356 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:13:34,748][model8_pretrain.py][INFO] Epoch:[0/2](703300/4588595) loss:2.004 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:13:34,748][model8_pretrain.py][INFO] Epoch:[0/2](703300/4588595) loss:2.402 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:13:34,748][model8_pretrain.py][INFO] Epoch:[0/2](703300/4588595) loss:2.857 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:13:34,748][model8_pretrain.py][INFO] Epoch:[0/2](703300/4588595) loss:2.630 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:13:34,748][model8_pretrain.py][INFO] Epoch:[0/2](703300/4588595) loss:2.954 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:13:34,748][model8_pretrain.py][INFO] Epoch:[0/2](703300/4588595) loss:2.971 lr:0.0000100 epoch_Time:24715.0min: [2024-01-05 20:14:11,702][model8_pretrain.py][INFO] Epoch:[0/2](703400/4588595) loss:3.384 lr:0.0000100 epoch_Time:24714.0min: [2024-01-05 20:14:11,702][model8_pretrain.py][INFO] Epoch:[0/2](703400/4588595) loss:2.697 lr:0.0000100 epoch_Time:24714.0min: [2024-01-05 20:14:11,702][model8_pretrain.py][INFO] Epoch:[0/2](703400/4588595) loss:2.551 lr:0.0000100 epoch_Time:24714.0min: [2024-01-05 20:14:11,702][model8_pretrain.py][INFO] Epoch:[0/2](703400/4588595) loss:2.943 lr:0.0000100 epoch_Time:24714.0min: [2024-01-05 20:14:11,702][model8_pretrain.py][INFO] Epoch:[0/2](703400/4588595) loss:2.699 lr:0.0000100 epoch_Time:24714.0min: [2024-01-05 20:14:11,702][model8_pretrain.py][INFO] Epoch:[0/2](703400/4588595) loss:2.775 lr:0.0000100 epoch_Time:24714.0min: [2024-01-05 20:14:11,702][model8_pretrain.py][INFO] Epoch:[0/2](703400/4588595) loss:2.772 lr:0.0000100 epoch_Time:24714.0min: [2024-01-05 20:14:11,702][model8_pretrain.py][INFO] Epoch:[0/2](703400/4588595) loss:3.117 lr:0.0000100 epoch_Time:24714.0min: [2024-01-05 20:14:48,645][model8_pretrain.py][INFO] Epoch:[0/2](703500/4588595) loss:2.711 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:14:48,645][model8_pretrain.py][INFO] Epoch:[0/2](703500/4588595) loss:3.017 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:14:48,645][model8_pretrain.py][INFO] Epoch:[0/2](703500/4588595) loss:2.575 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:14:48,645][model8_pretrain.py][INFO] Epoch:[0/2](703500/4588595) loss:2.689 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:14:48,645][model8_pretrain.py][INFO] Epoch:[0/2](703500/4588595) loss:3.168 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:14:48,645][model8_pretrain.py][INFO] Epoch:[0/2](703500/4588595) loss:2.812 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:14:48,645][model8_pretrain.py][INFO] Epoch:[0/2](703500/4588595) loss:3.215 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:14:48,645][model8_pretrain.py][INFO] Epoch:[0/2](703500/4588595) loss:2.686 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:15:25,581][model8_pretrain.py][INFO] Epoch:[0/2](703600/4588595) loss:2.896 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:15:25,581][model8_pretrain.py][INFO] Epoch:[0/2](703600/4588595) loss:2.741 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:15:25,581][model8_pretrain.py][INFO] Epoch:[0/2](703600/4588595) loss:2.014 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:15:25,581][model8_pretrain.py][INFO] Epoch:[0/2](703600/4588595) loss:2.459 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:15:25,581][model8_pretrain.py][INFO] Epoch:[0/2](703600/4588595) loss:2.339 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:15:25,581][model8_pretrain.py][INFO] Epoch:[0/2](703600/4588595) loss:2.840 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:15:25,581][model8_pretrain.py][INFO] Epoch:[0/2](703600/4588595) loss:2.478 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:15:25,581][model8_pretrain.py][INFO] Epoch:[0/2](703600/4588595) loss:3.123 lr:0.0000100 epoch_Time:24713.0min: [2024-01-05 20:16:02,532][model8_pretrain.py][INFO] Epoch:[0/2](703700/4588595) loss:2.783 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:16:02,532][model8_pretrain.py][INFO] Epoch:[0/2](703700/4588595) loss:3.239 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:16:02,532][model8_pretrain.py][INFO] Epoch:[0/2](703700/4588595) loss:2.975 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:16:02,532][model8_pretrain.py][INFO] Epoch:[0/2](703700/4588595) loss:3.290 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:16:02,532][model8_pretrain.py][INFO] Epoch:[0/2](703700/4588595) loss:2.474 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:16:02,532][model8_pretrain.py][INFO] Epoch:[0/2](703700/4588595) loss:2.574 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:16:02,532][model8_pretrain.py][INFO] Epoch:[0/2](703700/4588595) loss:2.439 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:16:02,532][model8_pretrain.py][INFO] Epoch:[0/2](703700/4588595) loss:2.782 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:16:39,477][model8_pretrain.py][INFO] Epoch:[0/2](703800/4588595) loss:3.354 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:16:39,477][model8_pretrain.py][INFO] Epoch:[0/2](703800/4588595) loss:3.150 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:16:39,477][model8_pretrain.py][INFO] Epoch:[0/2](703800/4588595) loss:2.837 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:16:39,477][model8_pretrain.py][INFO] Epoch:[0/2](703800/4588595) loss:3.244 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:16:39,477][model8_pretrain.py][INFO] Epoch:[0/2](703800/4588595) loss:2.742 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:16:39,477][model8_pretrain.py][INFO] Epoch:[0/2](703800/4588595) loss:2.774 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:16:39,477][model8_pretrain.py][INFO] Epoch:[0/2](703800/4588595) loss:3.041 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:16:39,477][model8_pretrain.py][INFO] Epoch:[0/2](703800/4588595) loss:2.580 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:17:28,394][model8_pretrain.py][INFO] Epoch:[0/2](703900/4588595) loss:2.573 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:17:28,394][model8_pretrain.py][INFO] Epoch:[0/2](703900/4588595) loss:3.023 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:17:28,394][model8_pretrain.py][INFO] Epoch:[0/2](703900/4588595) loss:3.141 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:17:28,394][model8_pretrain.py][INFO] Epoch:[0/2](703900/4588595) loss:2.765 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:17:28,394][model8_pretrain.py][INFO] Epoch:[0/2](703900/4588595) loss:2.655 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:17:28,394][model8_pretrain.py][INFO] Epoch:[0/2](703900/4588595) loss:3.357 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:17:28,394][model8_pretrain.py][INFO] Epoch:[0/2](703900/4588595) loss:2.913 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:17:28,394][model8_pretrain.py][INFO] Epoch:[0/2](703900/4588595) loss:2.537 lr:0.0000100 epoch_Time:24712.0min: [2024-01-05 20:18:05,328][model8_pretrain.py][INFO] Epoch:[0/2](704000/4588595) loss:2.757 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:05,328][model8_pretrain.py][INFO] Epoch:[0/2](704000/4588595) loss:3.200 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:18:05,328][model8_pretrain.py][INFO] Epoch:[0/2](704000/4588595) loss:3.016 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:05,328][model8_pretrain.py][INFO] Epoch:[0/2](704000/4588595) loss:2.897 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:05,328][model8_pretrain.py][INFO] Epoch:[0/2](704000/4588595) loss:2.693 lr:0.0000100 epoch_Time:24711.0min: [2024-01-05 20:18:05,328][model8_pretrain.py][INFO] Epoch:[0/2](704000/4588595) loss:2.593 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:05,328][model8_pretrain.py][INFO] Epoch:[0/2](704000/4588595) loss:3.022 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:05,328][model8_pretrain.py][INFO] Epoch:[0/2](704000/4588595) loss:3.030 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:42,268][model8_pretrain.py][INFO] Epoch:[0/2](704100/4588595) loss:3.379 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:42,268][model8_pretrain.py][INFO] Epoch:[0/2](704100/4588595) loss:2.990 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:42,268][model8_pretrain.py][INFO] Epoch:[0/2](704100/4588595) loss:2.672 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:42,269][model8_pretrain.py][INFO] Epoch:[0/2](704100/4588595) loss:2.973 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:42,269][model8_pretrain.py][INFO] Epoch:[0/2](704100/4588595) loss:2.059 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:42,269][model8_pretrain.py][INFO] Epoch:[0/2](704100/4588595) loss:2.991 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:42,269][model8_pretrain.py][INFO] Epoch:[0/2](704100/4588595) loss:3.145 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:18:42,269][model8_pretrain.py][INFO] Epoch:[0/2](704100/4588595) loss:2.243 lr:0.0000100 epoch_Time:24710.0min: [2024-01-05 20:19:19,217][model8_pretrain.py][INFO] Epoch:[0/2](704200/4588595) loss:2.764 lr:0.0000100 epoch_Time:24709.0min: [2024-01-05 20:19:19,217][model8_pretrain.py][INFO] Epoch:[0/2](704200/4588595) loss:3.072 lr:0.0000100 epoch_Time:24709.0min: [2024-01-05 20:19:19,217][model8_pretrain.py][INFO] Epoch:[0/2](704200/4588595) loss:2.884 lr:0.0000100 epoch_Time:24709.0min: [2024-01-05 20:19:19,217][model8_pretrain.py][INFO] Epoch:[0/2](704200/4588595) loss:3.258 lr:0.0000100 epoch_Time:24709.0min: [2024-01-05 20:19:19,217][model8_pretrain.py][INFO] Epoch:[0/2](704200/4588595) loss:2.575 lr:0.0000100 epoch_Time:24709.0min: [2024-01-05 20:19:19,217][model8_pretrain.py][INFO] Epoch:[0/2](704200/4588595) loss:2.716 lr:0.0000100 epoch_Time:24709.0min: [2024-01-05 20:19:19,217][model8_pretrain.py][INFO] Epoch:[0/2](704200/4588595) loss:3.073 lr:0.0000100 epoch_Time:24709.0min: [2024-01-05 20:19:19,217][model8_pretrain.py][INFO] Epoch:[0/2](704200/4588595) loss:2.682 lr:0.0000100 epoch_Time:24709.0min: [2024-01-05 20:19:56,147][model8_pretrain.py][INFO] Epoch:[0/2](704300/4588595) loss:2.724 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:19:56,147][model8_pretrain.py][INFO] Epoch:[0/2](704300/4588595) loss:2.951 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:19:56,147][model8_pretrain.py][INFO] Epoch:[0/2](704300/4588595) loss:3.324 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:19:56,147][model8_pretrain.py][INFO] Epoch:[0/2](704300/4588595) loss:2.502 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:19:56,147][model8_pretrain.py][INFO] Epoch:[0/2](704300/4588595) loss:3.201 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:19:56,147][model8_pretrain.py][INFO] Epoch:[0/2](704300/4588595) loss:2.690 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:19:56,147][model8_pretrain.py][INFO] Epoch:[0/2](704300/4588595) loss:2.558 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:19:56,147][model8_pretrain.py][INFO] Epoch:[0/2](704300/4588595) loss:3.045 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:20:33,089][model8_pretrain.py][INFO] Epoch:[0/2](704400/4588595) loss:2.793 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:20:33,089][model8_pretrain.py][INFO] Epoch:[0/2](704400/4588595) loss:2.818 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:20:33,089][model8_pretrain.py][INFO] Epoch:[0/2](704400/4588595) loss:2.885 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:20:33,089][model8_pretrain.py][INFO] Epoch:[0/2](704400/4588595) loss:2.503 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:20:33,089][model8_pretrain.py][INFO] Epoch:[0/2](704400/4588595) loss:2.464 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:20:33,089][model8_pretrain.py][INFO] Epoch:[0/2](704400/4588595) loss:2.243 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:20:33,090][model8_pretrain.py][INFO] Epoch:[0/2](704400/4588595) loss:2.560 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:20:33,090][model8_pretrain.py][INFO] Epoch:[0/2](704400/4588595) loss:2.417 lr:0.0000100 epoch_Time:24708.0min: [2024-01-05 20:21:10,031][model8_pretrain.py][INFO] Epoch:[0/2](704500/4588595) loss:2.864 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:10,031][model8_pretrain.py][INFO] Epoch:[0/2](704500/4588595) loss:2.717 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:10,031][model8_pretrain.py][INFO] Epoch:[0/2](704500/4588595) loss:3.019 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:10,031][model8_pretrain.py][INFO] Epoch:[0/2](704500/4588595) loss:2.929 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:10,031][model8_pretrain.py][INFO] Epoch:[0/2](704500/4588595) loss:2.988 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:10,031][model8_pretrain.py][INFO] Epoch:[0/2](704500/4588595) loss:2.948 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:10,031][model8_pretrain.py][INFO] Epoch:[0/2](704500/4588595) loss:2.981 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:10,031][model8_pretrain.py][INFO] Epoch:[0/2](704500/4588595) loss:2.588 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:46,970][model8_pretrain.py][INFO] Epoch:[0/2](704600/4588595) loss:2.762 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:46,970][model8_pretrain.py][INFO] Epoch:[0/2](704600/4588595) loss:3.017 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:46,970][model8_pretrain.py][INFO] Epoch:[0/2](704600/4588595) loss:2.785 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:46,970][model8_pretrain.py][INFO] Epoch:[0/2](704600/4588595) loss:2.813 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:46,970][model8_pretrain.py][INFO] Epoch:[0/2](704600/4588595) loss:2.489 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:46,970][model8_pretrain.py][INFO] Epoch:[0/2](704600/4588595) loss:2.894 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:46,970][model8_pretrain.py][INFO] Epoch:[0/2](704600/4588595) loss:3.126 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:21:46,970][model8_pretrain.py][INFO] Epoch:[0/2](704600/4588595) loss:3.090 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:22:35,831][model8_pretrain.py][INFO] Epoch:[0/2](704700/4588595) loss:2.762 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:22:35,831][model8_pretrain.py][INFO] Epoch:[0/2](704700/4588595) loss:2.667 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:22:35,831][model8_pretrain.py][INFO] Epoch:[0/2](704700/4588595) loss:2.722 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:22:35,831][model8_pretrain.py][INFO] Epoch:[0/2](704700/4588595) loss:2.776 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:22:35,831][model8_pretrain.py][INFO] Epoch:[0/2](704700/4588595) loss:2.357 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:22:35,831][model8_pretrain.py][INFO] Epoch:[0/2](704700/4588595) loss:3.343 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:22:35,831][model8_pretrain.py][INFO] Epoch:[0/2](704700/4588595) loss:2.945 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:22:35,832][model8_pretrain.py][INFO] Epoch:[0/2](704700/4588595) loss:3.297 lr:0.0000100 epoch_Time:24707.0min: [2024-01-05 20:23:12,751][model8_pretrain.py][INFO] Epoch:[0/2](704800/4588595) loss:2.622 lr:0.0000100 epoch_Time:24706.0min: [2024-01-05 20:23:12,751][model8_pretrain.py][INFO] Epoch:[0/2](704800/4588595) loss:3.010 lr:0.0000100 epoch_Time:24706.0min: [2024-01-05 20:23:12,751][model8_pretrain.py][INFO] Epoch:[0/2](704800/4588595) loss:3.123 lr:0.0000100 epoch_Time:24706.0min: [2024-01-05 20:23:12,751][model8_pretrain.py][INFO] Epoch:[0/2](704800/4588595) loss:3.046 lr:0.0000100 epoch_Time:24706.0min: [2024-01-05 20:23:12,751][model8_pretrain.py][INFO] Epoch:[0/2](704800/4588595) loss:2.812 lr:0.0000100 epoch_Time:24706.0min: [2024-01-05 20:23:12,751][model8_pretrain.py][INFO] Epoch:[0/2](704800/4588595) loss:2.308 lr:0.0000100 epoch_Time:24706.0min: [2024-01-05 20:23:12,751][model8_pretrain.py][INFO] Epoch:[0/2](704800/4588595) loss:2.661 lr:0.0000100 epoch_Time:24706.0min: [2024-01-05 20:23:12,752][model8_pretrain.py][INFO] Epoch:[0/2](704800/4588595) loss:3.053 lr:0.0000100 epoch_Time:24706.0min: [2024-01-05 20:23:49,685][model8_pretrain.py][INFO] Epoch:[0/2](704900/4588595) loss:2.340 lr:0.0000100 epoch_Time:24705.0min: [2024-01-05 20:23:49,685][model8_pretrain.py][INFO] Epoch:[0/2](704900/4588595) loss:2.368 lr:0.0000100 epoch_Time:24705.0min: [2024-01-05 20:23:49,685][model8_pretrain.py][INFO] Epoch:[0/2](704900/4588595) loss:2.786 lr:0.0000100 epoch_Time:24705.0min: [2024-01-05 20:23:49,685][model8_pretrain.py][INFO] Epoch:[0/2](704900/4588595) loss:2.831 lr:0.0000100 epoch_Time:24705.0min: [2024-01-05 20:23:49,685][model8_pretrain.py][INFO] Epoch:[0/2](704900/4588595) loss:3.437 lr:0.0000100 epoch_Time:24705.0min: [2024-01-05 20:23:49,685][model8_pretrain.py][INFO] Epoch:[0/2](704900/4588595) loss:2.689 lr:0.0000100 epoch_Time:24705.0min: [2024-01-05 20:23:49,686][model8_pretrain.py][INFO] Epoch:[0/2](704900/4588595) loss:2.513 lr:0.0000100 epoch_Time:24705.0min: [2024-01-05 20:23:49,686][model8_pretrain.py][INFO] Epoch:[0/2](704900/4588595) loss:3.171 lr:0.0000100 epoch_Time:24705.0min: [2024-01-05 20:24:26,627][model8_pretrain.py][INFO] Epoch:[0/2](705000/4588595) loss:2.361 lr:0.0000100 epoch_Time:24704.0min: [2024-01-05 20:24:26,627][model8_pretrain.py][INFO] Epoch:[0/2](705000/4588595) loss:3.381 lr:0.0000100 epoch_Time:24704.0min: [2024-01-05 20:24:26,627][model8_pretrain.py][INFO] Epoch:[0/2](705000/4588595) loss:2.475 lr:0.0000100 epoch_Time:24704.0min: [2024-01-05 20:24:26,627][model8_pretrain.py][INFO] Epoch:[0/2](705000/4588595) loss:2.040 lr:0.0000100 epoch_Time:24704.0min: [2024-01-05 20:24:26,627][model8_pretrain.py][INFO] Epoch:[0/2](705000/4588595) loss:3.175 lr:0.0000100 epoch_Time:24704.0min: [2024-01-05 20:24:26,627][model8_pretrain.py][INFO] Epoch:[0/2](705000/4588595) loss:2.658 lr:0.0000100 epoch_Time:24704.0min: [2024-01-05 20:24:26,627][model8_pretrain.py][INFO] Epoch:[0/2](705000/4588595) loss:2.670 lr:0.0000100 epoch_Time:24704.0min: [2024-01-05 20:24:26,627][model8_pretrain.py][INFO] Epoch:[0/2](705000/4588595) loss:3.052 lr:0.0000100 epoch_Time:24704.0min: [2024-01-05 20:25:03,570][model8_pretrain.py][INFO] Epoch:[0/2](705100/4588595) loss:2.686 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:03,570][model8_pretrain.py][INFO] Epoch:[0/2](705100/4588595) loss:2.765 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:03,570][model8_pretrain.py][INFO] Epoch:[0/2](705100/4588595) loss:2.493 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:03,570][model8_pretrain.py][INFO] Epoch:[0/2](705100/4588595) loss:2.670 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:03,570][model8_pretrain.py][INFO] Epoch:[0/2](705100/4588595) loss:2.563 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:03,570][model8_pretrain.py][INFO] Epoch:[0/2](705100/4588595) loss:2.429 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:03,570][model8_pretrain.py][INFO] Epoch:[0/2](705100/4588595) loss:2.735 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:03,570][model8_pretrain.py][INFO] Epoch:[0/2](705100/4588595) loss:3.035 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:40,507][model8_pretrain.py][INFO] Epoch:[0/2](705200/4588595) loss:2.404 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:40,507][model8_pretrain.py][INFO] Epoch:[0/2](705200/4588595) loss:2.943 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:40,507][model8_pretrain.py][INFO] Epoch:[0/2](705200/4588595) loss:2.749 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:40,507][model8_pretrain.py][INFO] Epoch:[0/2](705200/4588595) loss:3.173 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:40,507][model8_pretrain.py][INFO] Epoch:[0/2](705200/4588595) loss:3.381 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:40,507][model8_pretrain.py][INFO] Epoch:[0/2](705200/4588595) loss:2.717 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:40,507][model8_pretrain.py][INFO] Epoch:[0/2](705200/4588595) loss:3.095 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:25:40,507][model8_pretrain.py][INFO] Epoch:[0/2](705200/4588595) loss:2.565 lr:0.0000100 epoch_Time:24703.0min: [2024-01-05 20:26:17,449][model8_pretrain.py][INFO] Epoch:[0/2](705300/4588595) loss:3.103 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:26:17,449][model8_pretrain.py][INFO] Epoch:[0/2](705300/4588595) loss:2.607 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:26:17,449][model8_pretrain.py][INFO] Epoch:[0/2](705300/4588595) loss:2.871 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:26:17,449][model8_pretrain.py][INFO] Epoch:[0/2](705300/4588595) loss:2.490 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:26:17,449][model8_pretrain.py][INFO] Epoch:[0/2](705300/4588595) loss:3.062 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:26:17,449][model8_pretrain.py][INFO] Epoch:[0/2](705300/4588595) loss:2.676 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:26:17,449][model8_pretrain.py][INFO] Epoch:[0/2](705300/4588595) loss:2.848 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:26:17,449][model8_pretrain.py][INFO] Epoch:[0/2](705300/4588595) loss:2.706 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:26:54,399][model8_pretrain.py][INFO] Epoch:[0/2](705400/4588595) loss:2.693 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:26:54,399][model8_pretrain.py][INFO] Epoch:[0/2](705400/4588595) loss:2.283 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:26:54,399][model8_pretrain.py][INFO] Epoch:[0/2](705400/4588595) loss:3.154 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:26:54,399][model8_pretrain.py][INFO] Epoch:[0/2](705400/4588595) loss:2.886 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:26:54,399][model8_pretrain.py][INFO] Epoch:[0/2](705400/4588595) loss:2.268 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:26:54,399][model8_pretrain.py][INFO] Epoch:[0/2](705400/4588595) loss:2.664 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:26:54,399][model8_pretrain.py][INFO] Epoch:[0/2](705400/4588595) loss:2.774 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:26:54,399][model8_pretrain.py][INFO] Epoch:[0/2](705400/4588595) loss:2.907 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:27:43,132][model8_pretrain.py][INFO] Epoch:[0/2](705500/4588595) loss:3.032 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:27:43,132][model8_pretrain.py][INFO] Epoch:[0/2](705500/4588595) loss:2.310 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:27:43,132][model8_pretrain.py][INFO] Epoch:[0/2](705500/4588595) loss:2.725 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:27:43,132][model8_pretrain.py][INFO] Epoch:[0/2](705500/4588595) loss:3.147 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:27:43,132][model8_pretrain.py][INFO] Epoch:[0/2](705500/4588595) loss:2.597 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:27:43,132][model8_pretrain.py][INFO] Epoch:[0/2](705500/4588595) loss:2.952 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:27:43,132][model8_pretrain.py][INFO] Epoch:[0/2](705500/4588595) loss:2.161 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:27:43,132][model8_pretrain.py][INFO] Epoch:[0/2](705500/4588595) loss:3.296 lr:0.0000100 epoch_Time:24702.0min: [2024-01-05 20:28:20,063][model8_pretrain.py][INFO] Epoch:[0/2](705600/4588595) loss:2.805 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:28:20,064][model8_pretrain.py][INFO] Epoch:[0/2](705600/4588595) loss:3.073 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:28:20,064][model8_pretrain.py][INFO] Epoch:[0/2](705600/4588595) loss:3.251 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:28:20,064][model8_pretrain.py][INFO] Epoch:[0/2](705600/4588595) loss:3.016 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:28:20,064][model8_pretrain.py][INFO] Epoch:[0/2](705600/4588595) loss:3.193 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:28:20,064][model8_pretrain.py][INFO] Epoch:[0/2](705600/4588595) loss:3.052 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:28:20,064][model8_pretrain.py][INFO] Epoch:[0/2](705600/4588595) loss:2.793 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:28:20,064][model8_pretrain.py][INFO] Epoch:[0/2](705600/4588595) loss:3.343 lr:0.0000100 epoch_Time:24701.0min: [2024-01-05 20:28:57,004][model8_pretrain.py][INFO] Epoch:[0/2](705700/4588595) loss:2.753 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:28:57,004][model8_pretrain.py][INFO] Epoch:[0/2](705700/4588595) loss:2.665 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:28:57,004][model8_pretrain.py][INFO] Epoch:[0/2](705700/4588595) loss:3.142 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:28:57,004][model8_pretrain.py][INFO] Epoch:[0/2](705700/4588595) loss:2.778 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:28:57,004][model8_pretrain.py][INFO] Epoch:[0/2](705700/4588595) loss:2.989 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:28:57,004][model8_pretrain.py][INFO] Epoch:[0/2](705700/4588595) loss:2.356 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:28:57,004][model8_pretrain.py][INFO] Epoch:[0/2](705700/4588595) loss:2.377 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:28:57,004][model8_pretrain.py][INFO] Epoch:[0/2](705700/4588595) loss:2.537 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:29:33,953][model8_pretrain.py][INFO] Epoch:[0/2](705800/4588595) loss:2.296 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:29:33,953][model8_pretrain.py][INFO] Epoch:[0/2](705800/4588595) loss:3.011 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:29:33,953][model8_pretrain.py][INFO] Epoch:[0/2](705800/4588595) loss:2.943 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:29:33,953][model8_pretrain.py][INFO] Epoch:[0/2](705800/4588595) loss:1.598 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:29:33,953][model8_pretrain.py][INFO] Epoch:[0/2](705800/4588595) loss:3.090 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:29:33,953][model8_pretrain.py][INFO] Epoch:[0/2](705800/4588595) loss:3.045 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:29:33,953][model8_pretrain.py][INFO] Epoch:[0/2](705800/4588595) loss:2.869 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:29:33,953][model8_pretrain.py][INFO] Epoch:[0/2](705800/4588595) loss:2.915 lr:0.0000100 epoch_Time:24700.0min: [2024-01-05 20:30:10,899][model8_pretrain.py][INFO] Epoch:[0/2](705900/4588595) loss:2.759 lr:0.0000100 epoch_Time:24699.0min: [2024-01-05 20:30:10,899][model8_pretrain.py][INFO] Epoch:[0/2](705900/4588595) loss:3.161 lr:0.0000100 epoch_Time:24699.0min: [2024-01-05 20:30:10,899][model8_pretrain.py][INFO] Epoch:[0/2](705900/4588595) loss:2.843 lr:0.0000100 epoch_Time:24699.0min: [2024-01-05 20:30:10,899][model8_pretrain.py][INFO] Epoch:[0/2](705900/4588595) loss:2.614 lr:0.0000100 epoch_Time:24699.0min: [2024-01-05 20:30:10,899][model8_pretrain.py][INFO] Epoch:[0/2](705900/4588595) loss:2.567 lr:0.0000100 epoch_Time:24699.0min: [2024-01-05 20:30:10,899][model8_pretrain.py][INFO] Epoch:[0/2](705900/4588595) loss:2.528 lr:0.0000100 epoch_Time:24699.0min: [2024-01-05 20:30:10,899][model8_pretrain.py][INFO] Epoch:[0/2](705900/4588595) loss:2.853 lr:0.0000100 epoch_Time:24699.0min: [2024-01-05 20:30:10,900][model8_pretrain.py][INFO] Epoch:[0/2](705900/4588595) loss:2.777 lr:0.0000100 epoch_Time:24699.0min: [2024-01-05 20:30:47,834][model8_pretrain.py][INFO] Epoch:[0/2](706000/4588595) loss:2.885 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:30:47,834][model8_pretrain.py][INFO] Epoch:[0/2](706000/4588595) loss:2.859 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:30:47,834][model8_pretrain.py][INFO] Epoch:[0/2](706000/4588595) loss:2.994 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:30:47,834][model8_pretrain.py][INFO] Epoch:[0/2](706000/4588595) loss:2.857 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:30:47,834][model8_pretrain.py][INFO] Epoch:[0/2](706000/4588595) loss:2.704 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:30:47,834][model8_pretrain.py][INFO] Epoch:[0/2](706000/4588595) loss:3.165 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:30:47,834][model8_pretrain.py][INFO] Epoch:[0/2](706000/4588595) loss:2.782 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:30:47,834][model8_pretrain.py][INFO] Epoch:[0/2](706000/4588595) loss:2.105 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:31:24,770][model8_pretrain.py][INFO] Epoch:[0/2](706100/4588595) loss:3.070 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:31:24,770][model8_pretrain.py][INFO] Epoch:[0/2](706100/4588595) loss:2.844 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:31:24,770][model8_pretrain.py][INFO] Epoch:[0/2](706100/4588595) loss:2.255 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:31:24,770][model8_pretrain.py][INFO] Epoch:[0/2](706100/4588595) loss:2.635 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:31:24,770][model8_pretrain.py][INFO] Epoch:[0/2](706100/4588595) loss:2.328 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:31:24,770][model8_pretrain.py][INFO] Epoch:[0/2](706100/4588595) loss:3.039 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:31:24,770][model8_pretrain.py][INFO] Epoch:[0/2](706100/4588595) loss:3.261 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:31:24,771][model8_pretrain.py][INFO] Epoch:[0/2](706100/4588595) loss:3.100 lr:0.0000100 epoch_Time:24697.0min: [2024-01-05 20:32:01,713][model8_pretrain.py][INFO] Epoch:[0/2](706200/4588595) loss:3.161 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:01,713][model8_pretrain.py][INFO] Epoch:[0/2](706200/4588595) loss:2.547 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:01,713][model8_pretrain.py][INFO] Epoch:[0/2](706200/4588595) loss:3.354 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:01,713][model8_pretrain.py][INFO] Epoch:[0/2](706200/4588595) loss:2.647 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:01,713][model8_pretrain.py][INFO] Epoch:[0/2](706200/4588595) loss:2.642 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:01,714][model8_pretrain.py][INFO] Epoch:[0/2](706200/4588595) loss:2.954 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:01,714][model8_pretrain.py][INFO] Epoch:[0/2](706200/4588595) loss:3.002 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:01,714][model8_pretrain.py][INFO] Epoch:[0/2](706200/4588595) loss:2.100 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:50,688][model8_pretrain.py][INFO] Epoch:[0/2](706300/4588595) loss:3.676 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:50,688][model8_pretrain.py][INFO] Epoch:[0/2](706300/4588595) loss:2.995 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:50,688][model8_pretrain.py][INFO] Epoch:[0/2](706300/4588595) loss:3.086 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:50,688][model8_pretrain.py][INFO] Epoch:[0/2](706300/4588595) loss:3.075 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:50,688][model8_pretrain.py][INFO] Epoch:[0/2](706300/4588595) loss:2.686 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:50,688][model8_pretrain.py][INFO] Epoch:[0/2](706300/4588595) loss:2.383 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:50,688][model8_pretrain.py][INFO] Epoch:[0/2](706300/4588595) loss:2.732 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:32:50,688][model8_pretrain.py][INFO] Epoch:[0/2](706300/4588595) loss:3.091 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:33:27,615][model8_pretrain.py][INFO] Epoch:[0/2](706400/4588595) loss:3.132 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:33:27,615][model8_pretrain.py][INFO] Epoch:[0/2](706400/4588595) loss:2.847 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:33:27,615][model8_pretrain.py][INFO] Epoch:[0/2](706400/4588595) loss:2.653 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:33:27,615][model8_pretrain.py][INFO] Epoch:[0/2](706400/4588595) loss:2.471 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:33:27,615][model8_pretrain.py][INFO] Epoch:[0/2](706400/4588595) loss:2.496 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:33:27,615][model8_pretrain.py][INFO] Epoch:[0/2](706400/4588595) loss:3.456 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:33:27,615][model8_pretrain.py][INFO] Epoch:[0/2](706400/4588595) loss:2.712 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:33:27,616][model8_pretrain.py][INFO] Epoch:[0/2](706400/4588595) loss:2.069 lr:0.0000100 epoch_Time:24696.0min: [2024-01-05 20:34:04,550][model8_pretrain.py][INFO] Epoch:[0/2](706500/4588595) loss:2.850 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:04,550][model8_pretrain.py][INFO] Epoch:[0/2](706500/4588595) loss:2.796 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:04,550][model8_pretrain.py][INFO] Epoch:[0/2](706500/4588595) loss:2.987 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:04,550][model8_pretrain.py][INFO] Epoch:[0/2](706500/4588595) loss:2.670 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:04,550][model8_pretrain.py][INFO] Epoch:[0/2](706500/4588595) loss:2.682 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:04,550][model8_pretrain.py][INFO] Epoch:[0/2](706500/4588595) loss:2.193 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:04,550][model8_pretrain.py][INFO] Epoch:[0/2](706500/4588595) loss:2.772 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:04,550][model8_pretrain.py][INFO] Epoch:[0/2](706500/4588595) loss:2.457 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:41,491][model8_pretrain.py][INFO] Epoch:[0/2](706600/4588595) loss:3.209 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:41,491][model8_pretrain.py][INFO] Epoch:[0/2](706600/4588595) loss:2.701 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:41,491][model8_pretrain.py][INFO] Epoch:[0/2](706600/4588595) loss:2.895 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:41,491][model8_pretrain.py][INFO] Epoch:[0/2](706600/4588595) loss:2.961 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:41,491][model8_pretrain.py][INFO] Epoch:[0/2](706600/4588595) loss:3.263 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:41,491][model8_pretrain.py][INFO] Epoch:[0/2](706600/4588595) loss:2.700 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:41,491][model8_pretrain.py][INFO] Epoch:[0/2](706600/4588595) loss:2.895 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:34:41,491][model8_pretrain.py][INFO] Epoch:[0/2](706600/4588595) loss:3.117 lr:0.0000100 epoch_Time:24695.0min: [2024-01-05 20:35:18,438][model8_pretrain.py][INFO] Epoch:[0/2](706700/4588595) loss:3.390 lr:0.0000100 epoch_Time:24694.0min: [2024-01-05 20:35:18,438][model8_pretrain.py][INFO] Epoch:[0/2](706700/4588595) loss:2.709 lr:0.0000100 epoch_Time:24694.0min: [2024-01-05 20:35:18,438][model8_pretrain.py][INFO] Epoch:[0/2](706700/4588595) loss:2.762 lr:0.0000100 epoch_Time:24694.0min: [2024-01-05 20:35:18,438][model8_pretrain.py][INFO] Epoch:[0/2](706700/4588595) loss:2.739 lr:0.0000100 epoch_Time:24694.0min: [2024-01-05 20:35:18,438][model8_pretrain.py][INFO] Epoch:[0/2](706700/4588595) loss:2.630 lr:0.0000100 epoch_Time:24694.0min: [2024-01-05 20:35:18,438][model8_pretrain.py][INFO] Epoch:[0/2](706700/4588595) loss:2.494 lr:0.0000100 epoch_Time:24694.0min: [2024-01-05 20:35:18,438][model8_pretrain.py][INFO] Epoch:[0/2](706700/4588595) loss:3.107 lr:0.0000100 epoch_Time:24694.0min: [2024-01-05 20:35:18,438][model8_pretrain.py][INFO] Epoch:[0/2](706700/4588595) loss:2.697 lr:0.0000100 epoch_Time:24694.0min: [2024-01-05 20:35:55,396][model8_pretrain.py][INFO] Epoch:[0/2](706800/4588595) loss:3.029 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:35:55,396][model8_pretrain.py][INFO] Epoch:[0/2](706800/4588595) loss:2.961 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:35:55,396][model8_pretrain.py][INFO] Epoch:[0/2](706800/4588595) loss:2.800 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:35:55,396][model8_pretrain.py][INFO] Epoch:[0/2](706800/4588595) loss:2.862 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:35:55,396][model8_pretrain.py][INFO] Epoch:[0/2](706800/4588595) loss:2.823 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:35:55,396][model8_pretrain.py][INFO] Epoch:[0/2](706800/4588595) loss:3.211 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:35:55,397][model8_pretrain.py][INFO] Epoch:[0/2](706800/4588595) loss:3.390 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:35:55,397][model8_pretrain.py][INFO] Epoch:[0/2](706800/4588595) loss:2.852 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:36:32,340][model8_pretrain.py][INFO] Epoch:[0/2](706900/4588595) loss:2.799 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:36:32,340][model8_pretrain.py][INFO] Epoch:[0/2](706900/4588595) loss:3.131 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:36:32,340][model8_pretrain.py][INFO] Epoch:[0/2](706900/4588595) loss:2.937 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:36:32,340][model8_pretrain.py][INFO] Epoch:[0/2](706900/4588595) loss:2.544 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:36:32,340][model8_pretrain.py][INFO] Epoch:[0/2](706900/4588595) loss:3.046 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:36:32,340][model8_pretrain.py][INFO] Epoch:[0/2](706900/4588595) loss:2.757 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:36:32,340][model8_pretrain.py][INFO] Epoch:[0/2](706900/4588595) loss:2.347 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:36:32,340][model8_pretrain.py][INFO] Epoch:[0/2](706900/4588595) loss:2.559 lr:0.0000100 epoch_Time:24693.0min: [2024-01-05 20:37:09,286][model8_pretrain.py][INFO] Epoch:[0/2](707000/4588595) loss:2.154 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:37:09,286][model8_pretrain.py][INFO] Epoch:[0/2](707000/4588595) loss:2.679 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:37:09,286][model8_pretrain.py][INFO] Epoch:[0/2](707000/4588595) loss:2.972 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:37:09,286][model8_pretrain.py][INFO] Epoch:[0/2](707000/4588595) loss:2.682 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:37:09,286][model8_pretrain.py][INFO] Epoch:[0/2](707000/4588595) loss:2.572 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:37:09,286][model8_pretrain.py][INFO] Epoch:[0/2](707000/4588595) loss:2.448 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:37:09,286][model8_pretrain.py][INFO] Epoch:[0/2](707000/4588595) loss:2.875 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:37:09,286][model8_pretrain.py][INFO] Epoch:[0/2](707000/4588595) loss:3.222 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:37:58,117][model8_pretrain.py][INFO] Epoch:[0/2](707100/4588595) loss:3.367 lr:0.0000100 epoch_Time:24692.0min: [2024-01-05 20:37:58,117][model8_pretrain.py][INFO] Epoch:[0/2](707100/4588595) loss:3.090 lr:0.0000100 epoch_Time:24692.0min: [2024-01-05 20:37:58,117][model8_pretrain.py][INFO] Epoch:[0/2](707100/4588595) loss:2.396 lr:0.0000100 epoch_Time:24692.0min: [2024-01-05 20:37:58,117][model8_pretrain.py][INFO] Epoch:[0/2](707100/4588595) loss:2.665 lr:0.0000100 epoch_Time:24692.0min: [2024-01-05 20:37:58,117][model8_pretrain.py][INFO] Epoch:[0/2](707100/4588595) loss:3.244 lr:0.0000100 epoch_Time:24692.0min: [2024-01-05 20:37:58,117][model8_pretrain.py][INFO] Epoch:[0/2](707100/4588595) loss:2.913 lr:0.0000100 epoch_Time:24692.0min: [2024-01-05 20:37:58,117][model8_pretrain.py][INFO] Epoch:[0/2](707100/4588595) loss:2.848 lr:0.0000100 epoch_Time:24692.0min: [2024-01-05 20:37:58,118][model8_pretrain.py][INFO] Epoch:[0/2](707100/4588595) loss:3.068 lr:0.0000100 epoch_Time:24692.0min: [2024-01-05 20:38:35,012][model8_pretrain.py][INFO] Epoch:[0/2](707200/4588595) loss:2.764 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:38:35,012][model8_pretrain.py][INFO] Epoch:[0/2](707200/4588595) loss:2.798 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:38:35,012][model8_pretrain.py][INFO] Epoch:[0/2](707200/4588595) loss:2.855 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:38:35,012][model8_pretrain.py][INFO] Epoch:[0/2](707200/4588595) loss:2.851 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:38:35,012][model8_pretrain.py][INFO] Epoch:[0/2](707200/4588595) loss:2.663 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:38:35,012][model8_pretrain.py][INFO] Epoch:[0/2](707200/4588595) loss:2.653 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:38:35,012][model8_pretrain.py][INFO] Epoch:[0/2](707200/4588595) loss:2.384 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:38:35,012][model8_pretrain.py][INFO] Epoch:[0/2](707200/4588595) loss:2.269 lr:0.0000100 epoch_Time:24691.0min: [2024-01-05 20:39:11,945][model8_pretrain.py][INFO] Epoch:[0/2](707300/4588595) loss:2.367 lr:0.0000100 epoch_Time:24690.0min: [2024-01-05 20:39:11,945][model8_pretrain.py][INFO] Epoch:[0/2](707300/4588595) loss:3.077 lr:0.0000100 epoch_Time:24690.0min: [2024-01-05 20:39:11,945][model8_pretrain.py][INFO] Epoch:[0/2](707300/4588595) loss:3.014 lr:0.0000100 epoch_Time:24690.0min: [2024-01-05 20:39:11,945][model8_pretrain.py][INFO] Epoch:[0/2](707300/4588595) loss:2.690 lr:0.0000100 epoch_Time:24690.0min: [2024-01-05 20:39:11,945][model8_pretrain.py][INFO] Epoch:[0/2](707300/4588595) loss:2.637 lr:0.0000100 epoch_Time:24690.0min: [2024-01-05 20:39:11,945][model8_pretrain.py][INFO] Epoch:[0/2](707300/4588595) loss:2.770 lr:0.0000100 epoch_Time:24690.0min: [2024-01-05 20:39:11,945][model8_pretrain.py][INFO] Epoch:[0/2](707300/4588595) loss:3.168 lr:0.0000100 epoch_Time:24690.0min: [2024-01-05 20:39:11,945][model8_pretrain.py][INFO] Epoch:[0/2](707300/4588595) loss:3.047 lr:0.0000100 epoch_Time:24690.0min: [2024-01-05 20:39:48,879][model8_pretrain.py][INFO] Epoch:[0/2](707400/4588595) loss:2.376 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:39:48,879][model8_pretrain.py][INFO] Epoch:[0/2](707400/4588595) loss:2.898 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:39:48,879][model8_pretrain.py][INFO] Epoch:[0/2](707400/4588595) loss:2.584 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:39:48,879][model8_pretrain.py][INFO] Epoch:[0/2](707400/4588595) loss:2.508 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:39:48,879][model8_pretrain.py][INFO] Epoch:[0/2](707400/4588595) loss:3.692 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:39:48,879][model8_pretrain.py][INFO] Epoch:[0/2](707400/4588595) loss:2.293 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:39:48,879][model8_pretrain.py][INFO] Epoch:[0/2](707400/4588595) loss:2.251 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:39:48,879][model8_pretrain.py][INFO] Epoch:[0/2](707400/4588595) loss:1.743 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:40:25,813][model8_pretrain.py][INFO] Epoch:[0/2](707500/4588595) loss:2.648 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:40:25,813][model8_pretrain.py][INFO] Epoch:[0/2](707500/4588595) loss:2.609 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:40:25,813][model8_pretrain.py][INFO] Epoch:[0/2](707500/4588595) loss:2.654 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:40:25,813][model8_pretrain.py][INFO] Epoch:[0/2](707500/4588595) loss:3.219 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:40:25,813][model8_pretrain.py][INFO] Epoch:[0/2](707500/4588595) loss:2.507 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:40:25,813][model8_pretrain.py][INFO] Epoch:[0/2](707500/4588595) loss:3.086 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:40:25,813][model8_pretrain.py][INFO] Epoch:[0/2](707500/4588595) loss:2.982 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:40:25,813][model8_pretrain.py][INFO] Epoch:[0/2](707500/4588595) loss:2.570 lr:0.0000100 epoch_Time:24689.0min: [2024-01-05 20:41:02,735][model8_pretrain.py][INFO] Epoch:[0/2](707600/4588595) loss:3.464 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:02,736][model8_pretrain.py][INFO] Epoch:[0/2](707600/4588595) loss:2.961 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:02,736][model8_pretrain.py][INFO] Epoch:[0/2](707600/4588595) loss:2.486 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:02,736][model8_pretrain.py][INFO] Epoch:[0/2](707600/4588595) loss:3.071 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:02,736][model8_pretrain.py][INFO] Epoch:[0/2](707600/4588595) loss:2.444 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:02,736][model8_pretrain.py][INFO] Epoch:[0/2](707600/4588595) loss:2.619 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:02,736][model8_pretrain.py][INFO] Epoch:[0/2](707600/4588595) loss:2.999 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:02,736][model8_pretrain.py][INFO] Epoch:[0/2](707600/4588595) loss:2.949 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:39,667][model8_pretrain.py][INFO] Epoch:[0/2](707700/4588595) loss:2.493 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:39,667][model8_pretrain.py][INFO] Epoch:[0/2](707700/4588595) loss:2.073 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:39,667][model8_pretrain.py][INFO] Epoch:[0/2](707700/4588595) loss:2.561 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:39,667][model8_pretrain.py][INFO] Epoch:[0/2](707700/4588595) loss:2.445 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:39,667][model8_pretrain.py][INFO] Epoch:[0/2](707700/4588595) loss:2.478 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:39,667][model8_pretrain.py][INFO] Epoch:[0/2](707700/4588595) loss:2.485 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:39,667][model8_pretrain.py][INFO] Epoch:[0/2](707700/4588595) loss:2.574 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:41:39,667][model8_pretrain.py][INFO] Epoch:[0/2](707700/4588595) loss:2.971 lr:0.0000100 epoch_Time:24688.0min: [2024-01-05 20:42:16,590][model8_pretrain.py][INFO] Epoch:[0/2](707800/4588595) loss:2.412 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:42:16,590][model8_pretrain.py][INFO] Epoch:[0/2](707800/4588595) loss:2.985 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:42:16,590][model8_pretrain.py][INFO] Epoch:[0/2](707800/4588595) loss:3.082 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:42:16,590][model8_pretrain.py][INFO] Epoch:[0/2](707800/4588595) loss:3.410 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:42:16,590][model8_pretrain.py][INFO] Epoch:[0/2](707800/4588595) loss:2.649 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:42:16,590][model8_pretrain.py][INFO] Epoch:[0/2](707800/4588595) loss:2.937 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:42:16,590][model8_pretrain.py][INFO] Epoch:[0/2](707800/4588595) loss:2.782 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:42:16,590][model8_pretrain.py][INFO] Epoch:[0/2](707800/4588595) loss:2.990 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:05,361][model8_pretrain.py][INFO] Epoch:[0/2](707900/4588595) loss:2.650 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:05,361][model8_pretrain.py][INFO] Epoch:[0/2](707900/4588595) loss:2.988 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:05,361][model8_pretrain.py][INFO] Epoch:[0/2](707900/4588595) loss:2.960 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:05,361][model8_pretrain.py][INFO] Epoch:[0/2](707900/4588595) loss:2.898 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:05,361][model8_pretrain.py][INFO] Epoch:[0/2](707900/4588595) loss:2.925 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:05,361][model8_pretrain.py][INFO] Epoch:[0/2](707900/4588595) loss:2.786 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:05,361][model8_pretrain.py][INFO] Epoch:[0/2](707900/4588595) loss:2.840 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:05,361][model8_pretrain.py][INFO] Epoch:[0/2](707900/4588595) loss:2.736 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:42,282][model8_pretrain.py][INFO] Epoch:[0/2](708000/4588595) loss:2.600 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:42,282][model8_pretrain.py][INFO] Epoch:[0/2](708000/4588595) loss:2.585 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:42,282][model8_pretrain.py][INFO] Epoch:[0/2](708000/4588595) loss:3.437 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:42,282][model8_pretrain.py][INFO] Epoch:[0/2](708000/4588595) loss:2.909 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:42,282][model8_pretrain.py][INFO] Epoch:[0/2](708000/4588595) loss:2.852 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:42,282][model8_pretrain.py][INFO] Epoch:[0/2](708000/4588595) loss:2.940 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:42,282][model8_pretrain.py][INFO] Epoch:[0/2](708000/4588595) loss:2.685 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:43:42,283][model8_pretrain.py][INFO] Epoch:[0/2](708000/4588595) loss:2.908 lr:0.0000100 epoch_Time:24687.0min: [2024-01-05 20:44:19,227][model8_pretrain.py][INFO] Epoch:[0/2](708100/4588595) loss:3.003 lr:0.0000100 epoch_Time:24685.0min: [2024-01-05 20:44:19,227][model8_pretrain.py][INFO] Epoch:[0/2](708100/4588595) loss:3.384 lr:0.0000100 epoch_Time:24685.0min: [2024-01-05 20:44:19,227][model8_pretrain.py][INFO] Epoch:[0/2](708100/4588595) loss:2.456 lr:0.0000100 epoch_Time:24685.0min: [2024-01-05 20:44:19,227][model8_pretrain.py][INFO] Epoch:[0/2](708100/4588595) loss:3.175 lr:0.0000100 epoch_Time:24685.0min: [2024-01-05 20:44:19,227][model8_pretrain.py][INFO] Epoch:[0/2](708100/4588595) loss:2.678 lr:0.0000100 epoch_Time:24685.0min: [2024-01-05 20:44:19,227][model8_pretrain.py][INFO] Epoch:[0/2](708100/4588595) loss:2.795 lr:0.0000100 epoch_Time:24685.0min: [2024-01-05 20:44:19,227][model8_pretrain.py][INFO] Epoch:[0/2](708100/4588595) loss:2.377 lr:0.0000100 epoch_Time:24685.0min: [2024-01-05 20:44:19,227][model8_pretrain.py][INFO] Epoch:[0/2](708100/4588595) loss:2.437 lr:0.0000100 epoch_Time:24685.0min: [2024-01-05 20:44:56,168][model8_pretrain.py][INFO] Epoch:[0/2](708200/4588595) loss:3.002 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:44:56,168][model8_pretrain.py][INFO] Epoch:[0/2](708200/4588595) loss:2.535 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:44:56,168][model8_pretrain.py][INFO] Epoch:[0/2](708200/4588595) loss:2.787 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:44:56,168][model8_pretrain.py][INFO] Epoch:[0/2](708200/4588595) loss:2.764 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:44:56,169][model8_pretrain.py][INFO] Epoch:[0/2](708200/4588595) loss:2.938 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:44:56,168][model8_pretrain.py][INFO] Epoch:[0/2](708200/4588595) loss:2.550 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:44:56,169][model8_pretrain.py][INFO] Epoch:[0/2](708200/4588595) loss:2.389 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:44:56,169][model8_pretrain.py][INFO] Epoch:[0/2](708200/4588595) loss:3.104 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:45:33,106][model8_pretrain.py][INFO] Epoch:[0/2](708300/4588595) loss:2.954 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:45:33,106][model8_pretrain.py][INFO] Epoch:[0/2](708300/4588595) loss:3.101 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:45:33,106][model8_pretrain.py][INFO] Epoch:[0/2](708300/4588595) loss:2.844 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:45:33,106][model8_pretrain.py][INFO] Epoch:[0/2](708300/4588595) loss:2.623 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:45:33,106][model8_pretrain.py][INFO] Epoch:[0/2](708300/4588595) loss:2.195 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:45:33,106][model8_pretrain.py][INFO] Epoch:[0/2](708300/4588595) loss:3.445 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:45:33,106][model8_pretrain.py][INFO] Epoch:[0/2](708300/4588595) loss:2.690 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:45:33,106][model8_pretrain.py][INFO] Epoch:[0/2](708300/4588595) loss:2.802 lr:0.0000100 epoch_Time:24684.0min: [2024-01-05 20:46:10,049][model8_pretrain.py][INFO] Epoch:[0/2](708400/4588595) loss:2.747 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:10,049][model8_pretrain.py][INFO] Epoch:[0/2](708400/4588595) loss:3.071 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:10,049][model8_pretrain.py][INFO] Epoch:[0/2](708400/4588595) loss:2.734 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:10,049][model8_pretrain.py][INFO] Epoch:[0/2](708400/4588595) loss:2.565 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:10,049][model8_pretrain.py][INFO] Epoch:[0/2](708400/4588595) loss:2.503 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:10,049][model8_pretrain.py][INFO] Epoch:[0/2](708400/4588595) loss:2.577 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:10,049][model8_pretrain.py][INFO] Epoch:[0/2](708400/4588595) loss:2.733 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:10,049][model8_pretrain.py][INFO] Epoch:[0/2](708400/4588595) loss:2.924 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:46,992][model8_pretrain.py][INFO] Epoch:[0/2](708500/4588595) loss:3.039 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:46,992][model8_pretrain.py][INFO] Epoch:[0/2](708500/4588595) loss:1.946 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:46,992][model8_pretrain.py][INFO] Epoch:[0/2](708500/4588595) loss:2.677 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:46,992][model8_pretrain.py][INFO] Epoch:[0/2](708500/4588595) loss:2.861 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:46,992][model8_pretrain.py][INFO] Epoch:[0/2](708500/4588595) loss:2.520 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:46,992][model8_pretrain.py][INFO] Epoch:[0/2](708500/4588595) loss:2.441 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:46,992][model8_pretrain.py][INFO] Epoch:[0/2](708500/4588595) loss:2.822 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:46:46,992][model8_pretrain.py][INFO] Epoch:[0/2](708500/4588595) loss:2.987 lr:0.0000100 epoch_Time:24683.0min: [2024-01-05 20:47:23,928][model8_pretrain.py][INFO] Epoch:[0/2](708600/4588595) loss:2.698 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:47:23,928][model8_pretrain.py][INFO] Epoch:[0/2](708600/4588595) loss:2.197 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:47:23,928][model8_pretrain.py][INFO] Epoch:[0/2](708600/4588595) loss:2.918 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:47:23,928][model8_pretrain.py][INFO] Epoch:[0/2](708600/4588595) loss:2.610 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:47:23,928][model8_pretrain.py][INFO] Epoch:[0/2](708600/4588595) loss:3.118 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:47:23,928][model8_pretrain.py][INFO] Epoch:[0/2](708600/4588595) loss:3.030 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:47:23,928][model8_pretrain.py][INFO] Epoch:[0/2](708600/4588595) loss:2.876 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:47:23,928][model8_pretrain.py][INFO] Epoch:[0/2](708600/4588595) loss:2.920 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:48:12,892][model8_pretrain.py][INFO] Epoch:[0/2](708700/4588595) loss:2.584 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:48:12,892][model8_pretrain.py][INFO] Epoch:[0/2](708700/4588595) loss:2.889 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:48:12,892][model8_pretrain.py][INFO] Epoch:[0/2](708700/4588595) loss:2.891 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:48:12,892][model8_pretrain.py][INFO] Epoch:[0/2](708700/4588595) loss:2.868 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:48:12,892][model8_pretrain.py][INFO] Epoch:[0/2](708700/4588595) loss:2.938 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:48:12,892][model8_pretrain.py][INFO] Epoch:[0/2](708700/4588595) loss:3.186 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:48:12,892][model8_pretrain.py][INFO] Epoch:[0/2](708700/4588595) loss:3.113 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:48:12,892][model8_pretrain.py][INFO] Epoch:[0/2](708700/4588595) loss:2.444 lr:0.0000100 epoch_Time:24682.0min: [2024-01-05 20:48:49,824][model8_pretrain.py][INFO] Epoch:[0/2](708800/4588595) loss:3.032 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:48:49,824][model8_pretrain.py][INFO] Epoch:[0/2](708800/4588595) loss:2.639 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:48:49,824][model8_pretrain.py][INFO] Epoch:[0/2](708800/4588595) loss:2.613 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:48:49,824][model8_pretrain.py][INFO] Epoch:[0/2](708800/4588595) loss:3.160 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:48:49,824][model8_pretrain.py][INFO] Epoch:[0/2](708800/4588595) loss:2.417 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:48:49,824][model8_pretrain.py][INFO] Epoch:[0/2](708800/4588595) loss:3.092 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:48:49,824][model8_pretrain.py][INFO] Epoch:[0/2](708800/4588595) loss:3.118 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:48:49,824][model8_pretrain.py][INFO] Epoch:[0/2](708800/4588595) loss:2.909 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:49:26,771][model8_pretrain.py][INFO] Epoch:[0/2](708900/4588595) loss:2.815 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:49:26,771][model8_pretrain.py][INFO] Epoch:[0/2](708900/4588595) loss:3.111 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:49:26,771][model8_pretrain.py][INFO] Epoch:[0/2](708900/4588595) loss:3.312 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:49:26,771][model8_pretrain.py][INFO] Epoch:[0/2](708900/4588595) loss:2.733 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:49:26,771][model8_pretrain.py][INFO] Epoch:[0/2](708900/4588595) loss:2.541 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:49:26,771][model8_pretrain.py][INFO] Epoch:[0/2](708900/4588595) loss:2.585 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:49:26,771][model8_pretrain.py][INFO] Epoch:[0/2](708900/4588595) loss:3.171 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:49:26,772][model8_pretrain.py][INFO] Epoch:[0/2](708900/4588595) loss:2.758 lr:0.0000100 epoch_Time:24681.0min: [2024-01-05 20:50:03,722][model8_pretrain.py][INFO] Epoch:[0/2](709000/4588595) loss:2.154 lr:0.0000100 epoch_Time:24680.0min: [2024-01-05 20:50:03,722][model8_pretrain.py][INFO] Epoch:[0/2](709000/4588595) loss:3.017 lr:0.0000100 epoch_Time:24680.0min: [2024-01-05 20:50:03,722][model8_pretrain.py][INFO] Epoch:[0/2](709000/4588595) loss:2.875 lr:0.0000100 epoch_Time:24680.0min: [2024-01-05 20:50:03,722][model8_pretrain.py][INFO] Epoch:[0/2](709000/4588595) loss:2.849 lr:0.0000100 epoch_Time:24680.0min: [2024-01-05 20:50:03,722][model8_pretrain.py][INFO] Epoch:[0/2](709000/4588595) loss:2.910 lr:0.0000100 epoch_Time:24680.0min: [2024-01-05 20:50:03,722][model8_pretrain.py][INFO] Epoch:[0/2](709000/4588595) loss:2.593 lr:0.0000100 epoch_Time:24680.0min: [2024-01-05 20:50:03,722][model8_pretrain.py][INFO] Epoch:[0/2](709000/4588595) loss:2.972 lr:0.0000100 epoch_Time:24680.0min: [2024-01-05 20:50:03,722][model8_pretrain.py][INFO] Epoch:[0/2](709000/4588595) loss:3.193 lr:0.0000100 epoch_Time:24680.0min: [2024-01-05 20:50:40,645][model8_pretrain.py][INFO] Epoch:[0/2](709100/4588595) loss:2.912 lr:0.0000100 epoch_Time:24679.0min: [2024-01-05 20:50:40,645][model8_pretrain.py][INFO] Epoch:[0/2](709100/4588595) loss:2.784 lr:0.0000100 epoch_Time:24679.0min: [2024-01-05 20:50:40,645][model8_pretrain.py][INFO] Epoch:[0/2](709100/4588595) loss:3.431 lr:0.0000100 epoch_Time:24679.0min: [2024-01-05 20:50:40,645][model8_pretrain.py][INFO] Epoch:[0/2](709100/4588595) loss:2.793 lr:0.0000100 epoch_Time:24679.0min: [2024-01-05 20:50:40,645][model8_pretrain.py][INFO] Epoch:[0/2](709100/4588595) loss:2.905 lr:0.0000100 epoch_Time:24679.0min: [2024-01-05 20:50:40,645][model8_pretrain.py][INFO] Epoch:[0/2](709100/4588595) loss:2.827 lr:0.0000100 epoch_Time:24679.0min: [2024-01-05 20:50:40,646][model8_pretrain.py][INFO] Epoch:[0/2](709100/4588595) loss:2.861 lr:0.0000100 epoch_Time:24679.0min: [2024-01-05 20:50:40,646][model8_pretrain.py][INFO] Epoch:[0/2](709100/4588595) loss:2.576 lr:0.0000100 epoch_Time:24679.0min: [2024-01-05 20:51:17,590][model8_pretrain.py][INFO] Epoch:[0/2](709200/4588595) loss:2.615 lr:0.0000100 epoch_Time:24678.0min: [2024-01-05 20:51:17,590][model8_pretrain.py][INFO] Epoch:[0/2](709200/4588595) loss:2.349 lr:0.0000100 epoch_Time:24678.0min: [2024-01-05 20:51:17,590][model8_pretrain.py][INFO] Epoch:[0/2](709200/4588595) loss:2.769 lr:0.0000100 epoch_Time:24678.0min: [2024-01-05 20:51:17,590][model8_pretrain.py][INFO] Epoch:[0/2](709200/4588595) loss:2.966 lr:0.0000100 epoch_Time:24678.0min: [2024-01-05 20:51:17,590][model8_pretrain.py][INFO] Epoch:[0/2](709200/4588595) loss:2.552 lr:0.0000100 epoch_Time:24678.0min: [2024-01-05 20:51:17,590][model8_pretrain.py][INFO] Epoch:[0/2](709200/4588595) loss:2.590 lr:0.0000100 epoch_Time:24678.0min: [2024-01-05 20:51:17,590][model8_pretrain.py][INFO] Epoch:[0/2](709200/4588595) loss:2.744 lr:0.0000100 epoch_Time:24678.0min: [2024-01-05 20:51:17,590][model8_pretrain.py][INFO] Epoch:[0/2](709200/4588595) loss:2.615 lr:0.0000100 epoch_Time:24678.0min: [2024-01-05 20:51:54,530][model8_pretrain.py][INFO] Epoch:[0/2](709300/4588595) loss:3.080 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:51:54,530][model8_pretrain.py][INFO] Epoch:[0/2](709300/4588595) loss:2.811 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:51:54,530][model8_pretrain.py][INFO] Epoch:[0/2](709300/4588595) loss:2.649 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:51:54,530][model8_pretrain.py][INFO] Epoch:[0/2](709300/4588595) loss:2.895 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:51:54,530][model8_pretrain.py][INFO] Epoch:[0/2](709300/4588595) loss:3.147 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:51:54,530][model8_pretrain.py][INFO] Epoch:[0/2](709300/4588595) loss:2.496 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:51:54,530][model8_pretrain.py][INFO] Epoch:[0/2](709300/4588595) loss:2.840 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:51:54,530][model8_pretrain.py][INFO] Epoch:[0/2](709300/4588595) loss:2.381 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:52:31,471][model8_pretrain.py][INFO] Epoch:[0/2](709400/4588595) loss:2.428 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:52:31,471][model8_pretrain.py][INFO] Epoch:[0/2](709400/4588595) loss:3.035 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:52:31,471][model8_pretrain.py][INFO] Epoch:[0/2](709400/4588595) loss:2.956 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:52:31,471][model8_pretrain.py][INFO] Epoch:[0/2](709400/4588595) loss:3.115 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:52:31,471][model8_pretrain.py][INFO] Epoch:[0/2](709400/4588595) loss:2.777 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:52:31,471][model8_pretrain.py][INFO] Epoch:[0/2](709400/4588595) loss:2.062 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:52:31,471][model8_pretrain.py][INFO] Epoch:[0/2](709400/4588595) loss:2.734 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:52:31,471][model8_pretrain.py][INFO] Epoch:[0/2](709400/4588595) loss:3.255 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:53:20,536][model8_pretrain.py][INFO] Epoch:[0/2](709500/4588595) loss:3.161 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:53:20,536][model8_pretrain.py][INFO] Epoch:[0/2](709500/4588595) loss:2.674 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:53:20,536][model8_pretrain.py][INFO] Epoch:[0/2](709500/4588595) loss:2.764 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:53:20,536][model8_pretrain.py][INFO] Epoch:[0/2](709500/4588595) loss:3.046 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:53:20,536][model8_pretrain.py][INFO] Epoch:[0/2](709500/4588595) loss:3.337 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:53:20,536][model8_pretrain.py][INFO] Epoch:[0/2](709500/4588595) loss:3.067 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:53:20,537][model8_pretrain.py][INFO] Epoch:[0/2](709500/4588595) loss:3.140 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:53:20,537][model8_pretrain.py][INFO] Epoch:[0/2](709500/4588595) loss:3.140 lr:0.0000100 epoch_Time:24677.0min: [2024-01-05 20:53:57,447][model8_pretrain.py][INFO] Epoch:[0/2](709600/4588595) loss:2.500 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:53:57,447][model8_pretrain.py][INFO] Epoch:[0/2](709600/4588595) loss:3.357 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:53:57,447][model8_pretrain.py][INFO] Epoch:[0/2](709600/4588595) loss:2.505 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:53:57,447][model8_pretrain.py][INFO] Epoch:[0/2](709600/4588595) loss:3.103 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:53:57,447][model8_pretrain.py][INFO] Epoch:[0/2](709600/4588595) loss:3.188 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:53:57,447][model8_pretrain.py][INFO] Epoch:[0/2](709600/4588595) loss:2.938 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:53:57,447][model8_pretrain.py][INFO] Epoch:[0/2](709600/4588595) loss:2.765 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:53:57,447][model8_pretrain.py][INFO] Epoch:[0/2](709600/4588595) loss:3.003 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:54:34,377][model8_pretrain.py][INFO] Epoch:[0/2](709700/4588595) loss:2.451 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:54:34,377][model8_pretrain.py][INFO] Epoch:[0/2](709700/4588595) loss:2.945 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:54:34,377][model8_pretrain.py][INFO] Epoch:[0/2](709700/4588595) loss:2.401 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:54:34,377][model8_pretrain.py][INFO] Epoch:[0/2](709700/4588595) loss:2.339 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:54:34,377][model8_pretrain.py][INFO] Epoch:[0/2](709700/4588595) loss:2.699 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:54:34,377][model8_pretrain.py][INFO] Epoch:[0/2](709700/4588595) loss:3.104 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:54:34,377][model8_pretrain.py][INFO] Epoch:[0/2](709700/4588595) loss:3.316 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:54:34,377][model8_pretrain.py][INFO] Epoch:[0/2](709700/4588595) loss:3.075 lr:0.0000100 epoch_Time:24676.0min: [2024-01-05 20:55:11,299][model8_pretrain.py][INFO] Epoch:[0/2](709800/4588595) loss:2.738 lr:0.0000100 epoch_Time:24675.0min: [2024-01-05 20:55:11,300][model8_pretrain.py][INFO] Epoch:[0/2](709800/4588595) loss:2.902 lr:0.0000100 epoch_Time:24675.0min: [2024-01-05 20:55:11,300][model8_pretrain.py][INFO] Epoch:[0/2](709800/4588595) loss:2.860 lr:0.0000100 epoch_Time:24675.0min: [2024-01-05 20:55:11,300][model8_pretrain.py][INFO] Epoch:[0/2](709800/4588595) loss:3.115 lr:0.0000100 epoch_Time:24675.0min: [2024-01-05 20:55:11,300][model8_pretrain.py][INFO] Epoch:[0/2](709800/4588595) loss:3.039 lr:0.0000100 epoch_Time:24675.0min: [2024-01-05 20:55:11,300][model8_pretrain.py][INFO] Epoch:[0/2](709800/4588595) loss:2.491 lr:0.0000100 epoch_Time:24675.0min: [2024-01-05 20:55:11,300][model8_pretrain.py][INFO] Epoch:[0/2](709800/4588595) loss:2.624 lr:0.0000100 epoch_Time:24675.0min: [2024-01-05 20:55:11,300][model8_pretrain.py][INFO] Epoch:[0/2](709800/4588595) loss:3.109 lr:0.0000100 epoch_Time:24675.0min: [2024-01-05 20:55:48,223][model8_pretrain.py][INFO] Epoch:[0/2](709900/4588595) loss:2.428 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:55:48,223][model8_pretrain.py][INFO] Epoch:[0/2](709900/4588595) loss:2.527 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:55:48,223][model8_pretrain.py][INFO] Epoch:[0/2](709900/4588595) loss:3.014 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:55:48,223][model8_pretrain.py][INFO] Epoch:[0/2](709900/4588595) loss:2.856 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:55:48,223][model8_pretrain.py][INFO] Epoch:[0/2](709900/4588595) loss:2.965 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:55:48,223][model8_pretrain.py][INFO] Epoch:[0/2](709900/4588595) loss:2.677 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:55:48,223][model8_pretrain.py][INFO] Epoch:[0/2](709900/4588595) loss:3.000 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:55:48,223][model8_pretrain.py][INFO] Epoch:[0/2](709900/4588595) loss:2.574 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:56:25,152][model8_pretrain.py][INFO] Epoch:[0/2](710000/4588595) loss:3.157 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:56:25,152][model8_pretrain.py][INFO] Epoch:[0/2](710000/4588595) loss:1.977 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:56:25,152][model8_pretrain.py][INFO] Epoch:[0/2](710000/4588595) loss:2.635 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:56:25,152][model8_pretrain.py][INFO] Epoch:[0/2](710000/4588595) loss:3.092 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:56:25,152][model8_pretrain.py][INFO] Epoch:[0/2](710000/4588595) loss:3.072 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:56:25,152][model8_pretrain.py][INFO] Epoch:[0/2](710000/4588595) loss:2.632 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:56:25,152][model8_pretrain.py][INFO] Epoch:[0/2](710000/4588595) loss:2.749 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:56:25,152][model8_pretrain.py][INFO] Epoch:[0/2](710000/4588595) loss:2.856 lr:0.0000100 epoch_Time:24674.0min: [2024-01-05 20:57:02,072][model8_pretrain.py][INFO] Epoch:[0/2](710100/4588595) loss:2.741 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:02,072][model8_pretrain.py][INFO] Epoch:[0/2](710100/4588595) loss:3.080 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:02,072][model8_pretrain.py][INFO] Epoch:[0/2](710100/4588595) loss:3.059 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:02,072][model8_pretrain.py][INFO] Epoch:[0/2](710100/4588595) loss:3.014 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:02,072][model8_pretrain.py][INFO] Epoch:[0/2](710100/4588595) loss:2.838 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:02,072][model8_pretrain.py][INFO] Epoch:[0/2](710100/4588595) loss:2.362 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:02,072][model8_pretrain.py][INFO] Epoch:[0/2](710100/4588595) loss:2.845 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:02,072][model8_pretrain.py][INFO] Epoch:[0/2](710100/4588595) loss:3.463 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:38,988][model8_pretrain.py][INFO] Epoch:[0/2](710200/4588595) loss:2.844 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:38,988][model8_pretrain.py][INFO] Epoch:[0/2](710200/4588595) loss:2.639 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:38,988][model8_pretrain.py][INFO] Epoch:[0/2](710200/4588595) loss:2.870 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:38,988][model8_pretrain.py][INFO] Epoch:[0/2](710200/4588595) loss:1.558 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:38,988][model8_pretrain.py][INFO] Epoch:[0/2](710200/4588595) loss:3.163 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:38,988][model8_pretrain.py][INFO] Epoch:[0/2](710200/4588595) loss:2.949 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:38,988][model8_pretrain.py][INFO] Epoch:[0/2](710200/4588595) loss:3.221 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:57:38,988][model8_pretrain.py][INFO] Epoch:[0/2](710200/4588595) loss:3.110 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:58:27,914][model8_pretrain.py][INFO] Epoch:[0/2](710300/4588595) loss:2.967 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:58:27,918][model8_pretrain.py][INFO] Epoch:[0/2](710300/4588595) loss:2.650 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:58:27,918][model8_pretrain.py][INFO] Epoch:[0/2](710300/4588595) loss:2.840 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:58:27,918][model8_pretrain.py][INFO] Epoch:[0/2](710300/4588595) loss:3.041 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:58:27,918][model8_pretrain.py][INFO] Epoch:[0/2](710300/4588595) loss:3.181 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:58:27,918][model8_pretrain.py][INFO] Epoch:[0/2](710300/4588595) loss:3.107 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:58:27,918][model8_pretrain.py][INFO] Epoch:[0/2](710300/4588595) loss:2.986 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:58:27,919][model8_pretrain.py][INFO] Epoch:[0/2](710300/4588595) loss:2.866 lr:0.0000100 epoch_Time:24672.0min: [2024-01-05 20:59:04,838][model8_pretrain.py][INFO] Epoch:[0/2](710400/4588595) loss:3.041 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:04,838][model8_pretrain.py][INFO] Epoch:[0/2](710400/4588595) loss:2.682 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:04,838][model8_pretrain.py][INFO] Epoch:[0/2](710400/4588595) loss:2.774 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:04,838][model8_pretrain.py][INFO] Epoch:[0/2](710400/4588595) loss:2.938 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:04,839][model8_pretrain.py][INFO] Epoch:[0/2](710400/4588595) loss:2.561 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:04,839][model8_pretrain.py][INFO] Epoch:[0/2](710400/4588595) loss:3.203 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:04,840][model8_pretrain.py][INFO] Epoch:[0/2](710400/4588595) loss:2.817 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:04,840][model8_pretrain.py][INFO] Epoch:[0/2](710400/4588595) loss:2.816 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:41,771][model8_pretrain.py][INFO] Epoch:[0/2](710500/4588595) loss:2.444 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:41,771][model8_pretrain.py][INFO] Epoch:[0/2](710500/4588595) loss:2.927 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:41,771][model8_pretrain.py][INFO] Epoch:[0/2](710500/4588595) loss:2.232 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:41,771][model8_pretrain.py][INFO] Epoch:[0/2](710500/4588595) loss:2.475 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:41,771][model8_pretrain.py][INFO] Epoch:[0/2](710500/4588595) loss:2.529 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:41,771][model8_pretrain.py][INFO] Epoch:[0/2](710500/4588595) loss:2.630 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:41,771][model8_pretrain.py][INFO] Epoch:[0/2](710500/4588595) loss:3.041 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 20:59:41,771][model8_pretrain.py][INFO] Epoch:[0/2](710500/4588595) loss:2.807 lr:0.0000100 epoch_Time:24671.0min: [2024-01-05 21:00:18,698][model8_pretrain.py][INFO] Epoch:[0/2](710600/4588595) loss:2.980 lr:0.0000100 epoch_Time:24670.0min: [2024-01-05 21:00:18,698][model8_pretrain.py][INFO] Epoch:[0/2](710600/4588595) loss:2.300 lr:0.0000100 epoch_Time:24670.0min: [2024-01-05 21:00:18,698][model8_pretrain.py][INFO] Epoch:[0/2](710600/4588595) loss:2.742 lr:0.0000100 epoch_Time:24670.0min: [2024-01-05 21:00:18,698][model8_pretrain.py][INFO] Epoch:[0/2](710600/4588595) loss:2.482 lr:0.0000100 epoch_Time:24670.0min: [2024-01-05 21:00:18,698][model8_pretrain.py][INFO] Epoch:[0/2](710600/4588595) loss:3.583 lr:0.0000100 epoch_Time:24670.0min: [2024-01-05 21:00:18,698][model8_pretrain.py][INFO] Epoch:[0/2](710600/4588595) loss:2.228 lr:0.0000100 epoch_Time:24670.0min: [2024-01-05 21:00:18,699][model8_pretrain.py][INFO] Epoch:[0/2](710600/4588595) loss:2.820 lr:0.0000100 epoch_Time:24670.0min: [2024-01-05 21:00:18,699][model8_pretrain.py][INFO] Epoch:[0/2](710600/4588595) loss:3.006 lr:0.0000100 epoch_Time:24670.0min: [2024-01-05 21:00:55,625][model8_pretrain.py][INFO] Epoch:[0/2](710700/4588595) loss:2.219 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:00:55,626][model8_pretrain.py][INFO] Epoch:[0/2](710700/4588595) loss:3.014 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:00:55,626][model8_pretrain.py][INFO] Epoch:[0/2](710700/4588595) loss:3.116 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:00:55,626][model8_pretrain.py][INFO] Epoch:[0/2](710700/4588595) loss:3.034 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:00:55,626][model8_pretrain.py][INFO] Epoch:[0/2](710700/4588595) loss:2.627 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:00:55,626][model8_pretrain.py][INFO] Epoch:[0/2](710700/4588595) loss:3.152 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:00:55,626][model8_pretrain.py][INFO] Epoch:[0/2](710700/4588595) loss:2.815 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:00:55,626][model8_pretrain.py][INFO] Epoch:[0/2](710700/4588595) loss:2.696 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:01:32,548][model8_pretrain.py][INFO] Epoch:[0/2](710800/4588595) loss:3.368 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:01:32,548][model8_pretrain.py][INFO] Epoch:[0/2](710800/4588595) loss:3.082 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:01:32,548][model8_pretrain.py][INFO] Epoch:[0/2](710800/4588595) loss:2.766 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:01:32,548][model8_pretrain.py][INFO] Epoch:[0/2](710800/4588595) loss:2.585 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:01:32,548][model8_pretrain.py][INFO] Epoch:[0/2](710800/4588595) loss:3.099 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:01:32,548][model8_pretrain.py][INFO] Epoch:[0/2](710800/4588595) loss:2.838 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:01:32,548][model8_pretrain.py][INFO] Epoch:[0/2](710800/4588595) loss:2.033 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:01:32,548][model8_pretrain.py][INFO] Epoch:[0/2](710800/4588595) loss:2.848 lr:0.0000100 epoch_Time:24669.0min: [2024-01-05 21:02:09,493][model8_pretrain.py][INFO] Epoch:[0/2](710900/4588595) loss:3.422 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:09,493][model8_pretrain.py][INFO] Epoch:[0/2](710900/4588595) loss:2.440 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:09,493][model8_pretrain.py][INFO] Epoch:[0/2](710900/4588595) loss:2.296 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:09,493][model8_pretrain.py][INFO] Epoch:[0/2](710900/4588595) loss:2.968 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:09,493][model8_pretrain.py][INFO] Epoch:[0/2](710900/4588595) loss:2.797 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:09,493][model8_pretrain.py][INFO] Epoch:[0/2](710900/4588595) loss:2.733 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:09,493][model8_pretrain.py][INFO] Epoch:[0/2](710900/4588595) loss:2.894 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:09,493][model8_pretrain.py][INFO] Epoch:[0/2](710900/4588595) loss:3.220 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:46,416][model8_pretrain.py][INFO] Epoch:[0/2](711000/4588595) loss:3.188 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:46,416][model8_pretrain.py][INFO] Epoch:[0/2](711000/4588595) loss:2.852 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:46,416][model8_pretrain.py][INFO] Epoch:[0/2](711000/4588595) loss:2.423 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:46,416][model8_pretrain.py][INFO] Epoch:[0/2](711000/4588595) loss:2.775 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:46,416][model8_pretrain.py][INFO] Epoch:[0/2](711000/4588595) loss:3.383 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:46,416][model8_pretrain.py][INFO] Epoch:[0/2](711000/4588595) loss:2.516 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:46,416][model8_pretrain.py][INFO] Epoch:[0/2](711000/4588595) loss:2.597 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:02:46,416][model8_pretrain.py][INFO] Epoch:[0/2](711000/4588595) loss:3.111 lr:0.0000100 epoch_Time:24668.0min: [2024-01-05 21:03:33,545][model8_pretrain.py][INFO] Epoch:[0/2](711100/4588595) loss:2.949 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:03:33,545][model8_pretrain.py][INFO] Epoch:[0/2](711100/4588595) loss:2.231 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:03:33,545][model8_pretrain.py][INFO] Epoch:[0/2](711100/4588595) loss:2.753 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:03:33,545][model8_pretrain.py][INFO] Epoch:[0/2](711100/4588595) loss:2.215 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:03:33,545][model8_pretrain.py][INFO] Epoch:[0/2](711100/4588595) loss:2.643 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:03:33,545][model8_pretrain.py][INFO] Epoch:[0/2](711100/4588595) loss:2.368 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:03:33,545][model8_pretrain.py][INFO] Epoch:[0/2](711100/4588595) loss:2.905 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:03:33,545][model8_pretrain.py][INFO] Epoch:[0/2](711100/4588595) loss:2.581 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:04:12,185][model8_pretrain.py][INFO] Epoch:[0/2](711200/4588595) loss:2.933 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:04:12,185][model8_pretrain.py][INFO] Epoch:[0/2](711200/4588595) loss:2.842 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:04:12,185][model8_pretrain.py][INFO] Epoch:[0/2](711200/4588595) loss:1.880 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:04:12,185][model8_pretrain.py][INFO] Epoch:[0/2](711200/4588595) loss:2.866 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:04:12,185][model8_pretrain.py][INFO] Epoch:[0/2](711200/4588595) loss:2.937 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:04:12,185][model8_pretrain.py][INFO] Epoch:[0/2](711200/4588595) loss:3.056 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:04:12,185][model8_pretrain.py][INFO] Epoch:[0/2](711200/4588595) loss:3.278 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:04:12,185][model8_pretrain.py][INFO] Epoch:[0/2](711200/4588595) loss:2.999 lr:0.0000100 epoch_Time:24667.0min: [2024-01-05 21:04:49,110][model8_pretrain.py][INFO] Epoch:[0/2](711300/4588595) loss:1.998 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:04:49,110][model8_pretrain.py][INFO] Epoch:[0/2](711300/4588595) loss:2.855 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:04:49,110][model8_pretrain.py][INFO] Epoch:[0/2](711300/4588595) loss:3.278 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:04:49,110][model8_pretrain.py][INFO] Epoch:[0/2](711300/4588595) loss:3.433 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:04:49,110][model8_pretrain.py][INFO] Epoch:[0/2](711300/4588595) loss:2.262 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:04:49,110][model8_pretrain.py][INFO] Epoch:[0/2](711300/4588595) loss:2.890 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:04:49,110][model8_pretrain.py][INFO] Epoch:[0/2](711300/4588595) loss:2.542 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:04:49,110][model8_pretrain.py][INFO] Epoch:[0/2](711300/4588595) loss:2.641 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:05:26,035][model8_pretrain.py][INFO] Epoch:[0/2](711400/4588595) loss:2.411 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:05:26,035][model8_pretrain.py][INFO] Epoch:[0/2](711400/4588595) loss:3.407 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:05:26,035][model8_pretrain.py][INFO] Epoch:[0/2](711400/4588595) loss:2.678 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:05:26,035][model8_pretrain.py][INFO] Epoch:[0/2](711400/4588595) loss:2.442 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:05:26,035][model8_pretrain.py][INFO] Epoch:[0/2](711400/4588595) loss:2.796 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:05:26,035][model8_pretrain.py][INFO] Epoch:[0/2](711400/4588595) loss:3.016 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:05:26,035][model8_pretrain.py][INFO] Epoch:[0/2](711400/4588595) loss:2.509 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:05:26,035][model8_pretrain.py][INFO] Epoch:[0/2](711400/4588595) loss:2.884 lr:0.0000100 epoch_Time:24665.0min: [2024-01-05 21:06:02,968][model8_pretrain.py][INFO] Epoch:[0/2](711500/4588595) loss:2.713 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:02,968][model8_pretrain.py][INFO] Epoch:[0/2](711500/4588595) loss:3.017 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:02,968][model8_pretrain.py][INFO] Epoch:[0/2](711500/4588595) loss:2.405 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:02,968][model8_pretrain.py][INFO] Epoch:[0/2](711500/4588595) loss:2.928 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:02,968][model8_pretrain.py][INFO] Epoch:[0/2](711500/4588595) loss:3.010 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:02,968][model8_pretrain.py][INFO] Epoch:[0/2](711500/4588595) loss:3.007 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:02,968][model8_pretrain.py][INFO] Epoch:[0/2](711500/4588595) loss:3.175 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:02,968][model8_pretrain.py][INFO] Epoch:[0/2](711500/4588595) loss:2.314 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:39,902][model8_pretrain.py][INFO] Epoch:[0/2](711600/4588595) loss:2.801 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:39,903][model8_pretrain.py][INFO] Epoch:[0/2](711600/4588595) loss:2.886 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:39,903][model8_pretrain.py][INFO] Epoch:[0/2](711600/4588595) loss:2.930 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:39,903][model8_pretrain.py][INFO] Epoch:[0/2](711600/4588595) loss:2.735 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:39,903][model8_pretrain.py][INFO] Epoch:[0/2](711600/4588595) loss:3.137 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:39,903][model8_pretrain.py][INFO] Epoch:[0/2](711600/4588595) loss:2.950 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:39,903][model8_pretrain.py][INFO] Epoch:[0/2](711600/4588595) loss:2.801 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:06:39,903][model8_pretrain.py][INFO] Epoch:[0/2](711600/4588595) loss:2.926 lr:0.0000100 epoch_Time:24664.0min: [2024-01-05 21:07:16,841][model8_pretrain.py][INFO] Epoch:[0/2](711700/4588595) loss:3.007 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:07:16,841][model8_pretrain.py][INFO] Epoch:[0/2](711700/4588595) loss:3.045 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:07:16,841][model8_pretrain.py][INFO] Epoch:[0/2](711700/4588595) loss:3.105 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:07:16,841][model8_pretrain.py][INFO] Epoch:[0/2](711700/4588595) loss:2.609 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:07:16,841][model8_pretrain.py][INFO] Epoch:[0/2](711700/4588595) loss:2.952 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:07:16,841][model8_pretrain.py][INFO] Epoch:[0/2](711700/4588595) loss:3.422 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:07:16,841][model8_pretrain.py][INFO] Epoch:[0/2](711700/4588595) loss:2.669 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:07:16,842][model8_pretrain.py][INFO] Epoch:[0/2](711700/4588595) loss:3.281 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:07:53,794][model8_pretrain.py][INFO] Epoch:[0/2](711800/4588595) loss:3.036 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:07:53,794][model8_pretrain.py][INFO] Epoch:[0/2](711800/4588595) loss:2.665 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:07:53,794][model8_pretrain.py][INFO] Epoch:[0/2](711800/4588595) loss:3.126 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:07:53,794][model8_pretrain.py][INFO] Epoch:[0/2](711800/4588595) loss:3.006 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:07:53,794][model8_pretrain.py][INFO] Epoch:[0/2](711800/4588595) loss:2.967 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:07:53,794][model8_pretrain.py][INFO] Epoch:[0/2](711800/4588595) loss:3.123 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:07:53,794][model8_pretrain.py][INFO] Epoch:[0/2](711800/4588595) loss:2.030 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:07:53,795][model8_pretrain.py][INFO] Epoch:[0/2](711800/4588595) loss:3.074 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:08:40,821][model8_pretrain.py][INFO] Epoch:[0/2](711900/4588595) loss:2.626 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:08:40,821][model8_pretrain.py][INFO] Epoch:[0/2](711900/4588595) loss:3.196 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:08:40,821][model8_pretrain.py][INFO] Epoch:[0/2](711900/4588595) loss:3.247 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:08:40,821][model8_pretrain.py][INFO] Epoch:[0/2](711900/4588595) loss:2.800 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:08:40,821][model8_pretrain.py][INFO] Epoch:[0/2](711900/4588595) loss:3.331 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:08:40,821][model8_pretrain.py][INFO] Epoch:[0/2](711900/4588595) loss:3.015 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:08:40,821][model8_pretrain.py][INFO] Epoch:[0/2](711900/4588595) loss:3.043 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:08:40,821][model8_pretrain.py][INFO] Epoch:[0/2](711900/4588595) loss:3.165 lr:0.0000100 epoch_Time:24663.0min: [2024-01-05 21:09:19,423][model8_pretrain.py][INFO] Epoch:[0/2](712000/4588595) loss:2.326 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:09:19,423][model8_pretrain.py][INFO] Epoch:[0/2](712000/4588595) loss:3.229 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:09:19,423][model8_pretrain.py][INFO] Epoch:[0/2](712000/4588595) loss:3.295 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:09:19,423][model8_pretrain.py][INFO] Epoch:[0/2](712000/4588595) loss:2.440 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:09:19,423][model8_pretrain.py][INFO] Epoch:[0/2](712000/4588595) loss:3.351 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:09:19,424][model8_pretrain.py][INFO] Epoch:[0/2](712000/4588595) loss:3.158 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:09:19,424][model8_pretrain.py][INFO] Epoch:[0/2](712000/4588595) loss:2.387 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:09:19,424][model8_pretrain.py][INFO] Epoch:[0/2](712000/4588595) loss:2.758 lr:0.0000100 epoch_Time:24662.0min: [2024-01-05 21:09:56,350][model8_pretrain.py][INFO] Epoch:[0/2](712100/4588595) loss:3.074 lr:0.0000100 epoch_Time:24661.0min: [2024-01-05 21:09:56,350][model8_pretrain.py][INFO] Epoch:[0/2](712100/4588595) loss:3.093 lr:0.0000100 epoch_Time:24661.0min: [2024-01-05 21:09:56,350][model8_pretrain.py][INFO] Epoch:[0/2](712100/4588595) loss:2.505 lr:0.0000100 epoch_Time:24661.0min: [2024-01-05 21:09:56,350][model8_pretrain.py][INFO] Epoch:[0/2](712100/4588595) loss:3.013 lr:0.0000100 epoch_Time:24661.0min: [2024-01-05 21:09:56,350][model8_pretrain.py][INFO] Epoch:[0/2](712100/4588595) loss:2.781 lr:0.0000100 epoch_Time:24661.0min: [2024-01-05 21:09:56,350][model8_pretrain.py][INFO] Epoch:[0/2](712100/4588595) loss:3.071 lr:0.0000100 epoch_Time:24661.0min: [2024-01-05 21:09:56,350][model8_pretrain.py][INFO] Epoch:[0/2](712100/4588595) loss:3.053 lr:0.0000100 epoch_Time:24661.0min: [2024-01-05 21:09:56,350][model8_pretrain.py][INFO] Epoch:[0/2](712100/4588595) loss:2.857 lr:0.0000100 epoch_Time:24661.0min: [2024-01-05 21:10:33,291][model8_pretrain.py][INFO] Epoch:[0/2](712200/4588595) loss:2.961 lr:0.0000100 epoch_Time:24660.0min: [2024-01-05 21:10:33,291][model8_pretrain.py][INFO] Epoch:[0/2](712200/4588595) loss:2.157 lr:0.0000100 epoch_Time:24660.0min: [2024-01-05 21:10:33,291][model8_pretrain.py][INFO] Epoch:[0/2](712200/4588595) loss:2.938 lr:0.0000100 epoch_Time:24660.0min: [2024-01-05 21:10:33,291][model8_pretrain.py][INFO] Epoch:[0/2](712200/4588595) loss:3.120 lr:0.0000100 epoch_Time:24660.0min: [2024-01-05 21:10:33,291][model8_pretrain.py][INFO] Epoch:[0/2](712200/4588595) loss:2.837 lr:0.0000100 epoch_Time:24660.0min: [2024-01-05 21:10:33,291][model8_pretrain.py][INFO] Epoch:[0/2](712200/4588595) loss:2.570 lr:0.0000100 epoch_Time:24660.0min: [2024-01-05 21:10:33,291][model8_pretrain.py][INFO] Epoch:[0/2](712200/4588595) loss:2.725 lr:0.0000100 epoch_Time:24660.0min: [2024-01-05 21:10:33,292][model8_pretrain.py][INFO] Epoch:[0/2](712200/4588595) loss:2.968 lr:0.0000100 epoch_Time:24660.0min: [2024-01-05 21:11:10,226][model8_pretrain.py][INFO] Epoch:[0/2](712300/4588595) loss:2.783 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:10,226][model8_pretrain.py][INFO] Epoch:[0/2](712300/4588595) loss:2.926 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:10,226][model8_pretrain.py][INFO] Epoch:[0/2](712300/4588595) loss:2.814 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:10,226][model8_pretrain.py][INFO] Epoch:[0/2](712300/4588595) loss:2.447 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:10,226][model8_pretrain.py][INFO] Epoch:[0/2](712300/4588595) loss:3.201 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:10,226][model8_pretrain.py][INFO] Epoch:[0/2](712300/4588595) loss:2.707 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:10,226][model8_pretrain.py][INFO] Epoch:[0/2](712300/4588595) loss:2.205 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:10,227][model8_pretrain.py][INFO] Epoch:[0/2](712300/4588595) loss:2.481 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:47,158][model8_pretrain.py][INFO] Epoch:[0/2](712400/4588595) loss:2.606 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:47,158][model8_pretrain.py][INFO] Epoch:[0/2](712400/4588595) loss:2.883 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:47,158][model8_pretrain.py][INFO] Epoch:[0/2](712400/4588595) loss:2.730 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:47,158][model8_pretrain.py][INFO] Epoch:[0/2](712400/4588595) loss:2.678 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:47,158][model8_pretrain.py][INFO] Epoch:[0/2](712400/4588595) loss:2.319 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:47,158][model8_pretrain.py][INFO] Epoch:[0/2](712400/4588595) loss:2.244 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:47,158][model8_pretrain.py][INFO] Epoch:[0/2](712400/4588595) loss:3.128 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:11:47,158][model8_pretrain.py][INFO] Epoch:[0/2](712400/4588595) loss:2.408 lr:0.0000100 epoch_Time:24659.0min: [2024-01-05 21:12:24,096][model8_pretrain.py][INFO] Epoch:[0/2](712500/4588595) loss:2.261 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:12:24,096][model8_pretrain.py][INFO] Epoch:[0/2](712500/4588595) loss:3.185 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:12:24,096][model8_pretrain.py][INFO] Epoch:[0/2](712500/4588595) loss:3.120 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:12:24,096][model8_pretrain.py][INFO] Epoch:[0/2](712500/4588595) loss:2.784 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:12:24,096][model8_pretrain.py][INFO] Epoch:[0/2](712500/4588595) loss:2.707 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:12:24,096][model8_pretrain.py][INFO] Epoch:[0/2](712500/4588595) loss:2.926 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:12:24,097][model8_pretrain.py][INFO] Epoch:[0/2](712500/4588595) loss:2.822 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:12:24,097][model8_pretrain.py][INFO] Epoch:[0/2](712500/4588595) loss:2.648 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:13:01,038][model8_pretrain.py][INFO] Epoch:[0/2](712600/4588595) loss:2.639 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:13:01,038][model8_pretrain.py][INFO] Epoch:[0/2](712600/4588595) loss:2.310 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:13:01,038][model8_pretrain.py][INFO] Epoch:[0/2](712600/4588595) loss:2.347 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:13:01,038][model8_pretrain.py][INFO] Epoch:[0/2](712600/4588595) loss:3.129 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:13:01,038][model8_pretrain.py][INFO] Epoch:[0/2](712600/4588595) loss:2.884 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:13:01,038][model8_pretrain.py][INFO] Epoch:[0/2](712600/4588595) loss:3.098 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:13:01,038][model8_pretrain.py][INFO] Epoch:[0/2](712600/4588595) loss:2.815 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:13:01,038][model8_pretrain.py][INFO] Epoch:[0/2](712600/4588595) loss:2.811 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:13:46,212][model8_pretrain.py][INFO] Epoch:[0/2](712700/4588595) loss:2.791 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:13:46,212][model8_pretrain.py][INFO] Epoch:[0/2](712700/4588595) loss:2.868 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:13:46,212][model8_pretrain.py][INFO] Epoch:[0/2](712700/4588595) loss:2.370 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:13:46,212][model8_pretrain.py][INFO] Epoch:[0/2](712700/4588595) loss:2.321 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:13:46,212][model8_pretrain.py][INFO] Epoch:[0/2](712700/4588595) loss:2.913 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:13:46,212][model8_pretrain.py][INFO] Epoch:[0/2](712700/4588595) loss:3.226 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:13:46,212][model8_pretrain.py][INFO] Epoch:[0/2](712700/4588595) loss:2.627 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:13:46,212][model8_pretrain.py][INFO] Epoch:[0/2](712700/4588595) loss:3.073 lr:0.0000100 epoch_Time:24658.0min: [2024-01-05 21:14:26,646][model8_pretrain.py][INFO] Epoch:[0/2](712800/4588595) loss:3.020 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:14:26,646][model8_pretrain.py][INFO] Epoch:[0/2](712800/4588595) loss:2.351 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:14:26,646][model8_pretrain.py][INFO] Epoch:[0/2](712800/4588595) loss:2.946 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:14:26,646][model8_pretrain.py][INFO] Epoch:[0/2](712800/4588595) loss:2.782 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:14:26,646][model8_pretrain.py][INFO] Epoch:[0/2](712800/4588595) loss:3.104 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:14:26,646][model8_pretrain.py][INFO] Epoch:[0/2](712800/4588595) loss:3.243 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:14:26,646][model8_pretrain.py][INFO] Epoch:[0/2](712800/4588595) loss:2.923 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:14:26,646][model8_pretrain.py][INFO] Epoch:[0/2](712800/4588595) loss:2.775 lr:0.0000100 epoch_Time:24657.0min: [2024-01-05 21:15:03,593][model8_pretrain.py][INFO] Epoch:[0/2](712900/4588595) loss:2.889 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:03,593][model8_pretrain.py][INFO] Epoch:[0/2](712900/4588595) loss:2.854 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:03,593][model8_pretrain.py][INFO] Epoch:[0/2](712900/4588595) loss:2.473 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:03,593][model8_pretrain.py][INFO] Epoch:[0/2](712900/4588595) loss:2.916 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:03,593][model8_pretrain.py][INFO] Epoch:[0/2](712900/4588595) loss:2.939 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:03,593][model8_pretrain.py][INFO] Epoch:[0/2](712900/4588595) loss:2.625 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:03,593][model8_pretrain.py][INFO] Epoch:[0/2](712900/4588595) loss:2.705 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:03,593][model8_pretrain.py][INFO] Epoch:[0/2](712900/4588595) loss:3.069 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:40,528][model8_pretrain.py][INFO] Epoch:[0/2](713000/4588595) loss:3.076 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:40,528][model8_pretrain.py][INFO] Epoch:[0/2](713000/4588595) loss:2.832 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:40,528][model8_pretrain.py][INFO] Epoch:[0/2](713000/4588595) loss:2.725 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:40,528][model8_pretrain.py][INFO] Epoch:[0/2](713000/4588595) loss:3.082 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:40,528][model8_pretrain.py][INFO] Epoch:[0/2](713000/4588595) loss:2.804 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:40,528][model8_pretrain.py][INFO] Epoch:[0/2](713000/4588595) loss:2.967 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:40,528][model8_pretrain.py][INFO] Epoch:[0/2](713000/4588595) loss:3.175 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:15:40,528][model8_pretrain.py][INFO] Epoch:[0/2](713000/4588595) loss:3.171 lr:0.0000100 epoch_Time:24656.0min: [2024-01-05 21:16:17,476][model8_pretrain.py][INFO] Epoch:[0/2](713100/4588595) loss:2.746 lr:0.0000100 epoch_Time:24655.0min: [2024-01-05 21:16:17,476][model8_pretrain.py][INFO] Epoch:[0/2](713100/4588595) loss:3.277 lr:0.0000100 epoch_Time:24655.0min: [2024-01-05 21:16:17,476][model8_pretrain.py][INFO] Epoch:[0/2](713100/4588595) loss:3.287 lr:0.0000100 epoch_Time:24655.0min: [2024-01-05 21:16:17,476][model8_pretrain.py][INFO] Epoch:[0/2](713100/4588595) loss:2.424 lr:0.0000100 epoch_Time:24655.0min: [2024-01-05 21:16:17,476][model8_pretrain.py][INFO] Epoch:[0/2](713100/4588595) loss:2.989 lr:0.0000100 epoch_Time:24655.0min: [2024-01-05 21:16:17,477][model8_pretrain.py][INFO] Epoch:[0/2](713100/4588595) loss:2.887 lr:0.0000100 epoch_Time:24655.0min: [2024-01-05 21:16:17,476][model8_pretrain.py][INFO] Epoch:[0/2](713100/4588595) loss:2.956 lr:0.0000100 epoch_Time:24655.0min: [2024-01-05 21:16:17,477][model8_pretrain.py][INFO] Epoch:[0/2](713100/4588595) loss:2.665 lr:0.0000100 epoch_Time:24655.0min: [2024-01-05 21:16:54,428][model8_pretrain.py][INFO] Epoch:[0/2](713200/4588595) loss:2.853 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:16:54,428][model8_pretrain.py][INFO] Epoch:[0/2](713200/4588595) loss:3.111 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:16:54,428][model8_pretrain.py][INFO] Epoch:[0/2](713200/4588595) loss:2.755 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:16:54,428][model8_pretrain.py][INFO] Epoch:[0/2](713200/4588595) loss:2.571 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:16:54,428][model8_pretrain.py][INFO] Epoch:[0/2](713200/4588595) loss:2.410 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:16:54,428][model8_pretrain.py][INFO] Epoch:[0/2](713200/4588595) loss:2.287 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:16:54,428][model8_pretrain.py][INFO] Epoch:[0/2](713200/4588595) loss:2.946 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:16:54,428][model8_pretrain.py][INFO] Epoch:[0/2](713200/4588595) loss:2.957 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:17:31,362][model8_pretrain.py][INFO] Epoch:[0/2](713300/4588595) loss:2.806 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:17:31,362][model8_pretrain.py][INFO] Epoch:[0/2](713300/4588595) loss:2.699 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:17:31,362][model8_pretrain.py][INFO] Epoch:[0/2](713300/4588595) loss:2.547 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:17:31,362][model8_pretrain.py][INFO] Epoch:[0/2](713300/4588595) loss:3.015 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:17:31,362][model8_pretrain.py][INFO] Epoch:[0/2](713300/4588595) loss:3.032 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:17:31,362][model8_pretrain.py][INFO] Epoch:[0/2](713300/4588595) loss:2.771 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:17:31,362][model8_pretrain.py][INFO] Epoch:[0/2](713300/4588595) loss:2.359 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:17:31,362][model8_pretrain.py][INFO] Epoch:[0/2](713300/4588595) loss:3.316 lr:0.0000100 epoch_Time:24653.0min: [2024-01-05 21:18:08,308][model8_pretrain.py][INFO] Epoch:[0/2](713400/4588595) loss:2.324 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:08,308][model8_pretrain.py][INFO] Epoch:[0/2](713400/4588595) loss:2.829 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:08,308][model8_pretrain.py][INFO] Epoch:[0/2](713400/4588595) loss:3.045 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:08,308][model8_pretrain.py][INFO] Epoch:[0/2](713400/4588595) loss:2.932 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:08,308][model8_pretrain.py][INFO] Epoch:[0/2](713400/4588595) loss:2.996 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:08,308][model8_pretrain.py][INFO] Epoch:[0/2](713400/4588595) loss:3.138 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:08,308][model8_pretrain.py][INFO] Epoch:[0/2](713400/4588595) loss:2.618 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:08,308][model8_pretrain.py][INFO] Epoch:[0/2](713400/4588595) loss:2.559 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:51,669][model8_pretrain.py][INFO] Epoch:[0/2](713500/4588595) loss:2.964 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:51,669][model8_pretrain.py][INFO] Epoch:[0/2](713500/4588595) loss:3.072 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:51,669][model8_pretrain.py][INFO] Epoch:[0/2](713500/4588595) loss:2.528 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:51,669][model8_pretrain.py][INFO] Epoch:[0/2](713500/4588595) loss:2.788 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:51,669][model8_pretrain.py][INFO] Epoch:[0/2](713500/4588595) loss:3.041 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:51,669][model8_pretrain.py][INFO] Epoch:[0/2](713500/4588595) loss:3.124 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:51,670][model8_pretrain.py][INFO] Epoch:[0/2](713500/4588595) loss:3.120 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:18:51,670][model8_pretrain.py][INFO] Epoch:[0/2](713500/4588595) loss:2.608 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:19:33,763][model8_pretrain.py][INFO] Epoch:[0/2](713600/4588595) loss:2.873 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:19:33,763][model8_pretrain.py][INFO] Epoch:[0/2](713600/4588595) loss:3.059 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:19:33,763][model8_pretrain.py][INFO] Epoch:[0/2](713600/4588595) loss:2.808 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:19:33,763][model8_pretrain.py][INFO] Epoch:[0/2](713600/4588595) loss:2.544 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:19:33,763][model8_pretrain.py][INFO] Epoch:[0/2](713600/4588595) loss:2.920 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:19:33,763][model8_pretrain.py][INFO] Epoch:[0/2](713600/4588595) loss:2.424 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:19:33,763][model8_pretrain.py][INFO] Epoch:[0/2](713600/4588595) loss:2.950 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:19:33,763][model8_pretrain.py][INFO] Epoch:[0/2](713600/4588595) loss:2.701 lr:0.0000100 epoch_Time:24652.0min: [2024-01-05 21:20:10,699][model8_pretrain.py][INFO] Epoch:[0/2](713700/4588595) loss:2.982 lr:0.0000100 epoch_Time:24651.0min: [2024-01-05 21:20:10,699][model8_pretrain.py][INFO] Epoch:[0/2](713700/4588595) loss:3.083 lr:0.0000100 epoch_Time:24651.0min: [2024-01-05 21:20:10,699][model8_pretrain.py][INFO] Epoch:[0/2](713700/4588595) loss:2.671 lr:0.0000100 epoch_Time:24651.0min: [2024-01-05 21:20:10,699][model8_pretrain.py][INFO] Epoch:[0/2](713700/4588595) loss:2.678 lr:0.0000100 epoch_Time:24651.0min: [2024-01-05 21:20:10,699][model8_pretrain.py][INFO] Epoch:[0/2](713700/4588595) loss:2.948 lr:0.0000100 epoch_Time:24651.0min: [2024-01-05 21:20:10,699][model8_pretrain.py][INFO] Epoch:[0/2](713700/4588595) loss:3.274 lr:0.0000100 epoch_Time:24651.0min: [2024-01-05 21:20:10,699][model8_pretrain.py][INFO] Epoch:[0/2](713700/4588595) loss:2.895 lr:0.0000100 epoch_Time:24651.0min: [2024-01-05 21:20:10,699][model8_pretrain.py][INFO] Epoch:[0/2](713700/4588595) loss:2.504 lr:0.0000100 epoch_Time:24651.0min: [2024-01-05 21:20:47,632][model8_pretrain.py][INFO] Epoch:[0/2](713800/4588595) loss:2.413 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:20:47,632][model8_pretrain.py][INFO] Epoch:[0/2](713800/4588595) loss:2.690 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:20:47,632][model8_pretrain.py][INFO] Epoch:[0/2](713800/4588595) loss:3.123 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:20:47,632][model8_pretrain.py][INFO] Epoch:[0/2](713800/4588595) loss:3.050 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:20:47,632][model8_pretrain.py][INFO] Epoch:[0/2](713800/4588595) loss:2.466 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:20:47,632][model8_pretrain.py][INFO] Epoch:[0/2](713800/4588595) loss:2.772 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:20:47,632][model8_pretrain.py][INFO] Epoch:[0/2](713800/4588595) loss:2.798 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:20:47,632][model8_pretrain.py][INFO] Epoch:[0/2](713800/4588595) loss:2.648 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:21:24,564][model8_pretrain.py][INFO] Epoch:[0/2](713900/4588595) loss:2.983 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:21:24,564][model8_pretrain.py][INFO] Epoch:[0/2](713900/4588595) loss:3.117 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:21:24,564][model8_pretrain.py][INFO] Epoch:[0/2](713900/4588595) loss:2.411 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:21:24,564][model8_pretrain.py][INFO] Epoch:[0/2](713900/4588595) loss:2.463 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:21:24,565][model8_pretrain.py][INFO] Epoch:[0/2](713900/4588595) loss:3.246 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:21:24,565][model8_pretrain.py][INFO] Epoch:[0/2](713900/4588595) loss:2.731 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:21:24,565][model8_pretrain.py][INFO] Epoch:[0/2](713900/4588595) loss:2.825 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:21:24,565][model8_pretrain.py][INFO] Epoch:[0/2](713900/4588595) loss:3.051 lr:0.0000100 epoch_Time:24650.0min: [2024-01-05 21:22:01,509][model8_pretrain.py][INFO] Epoch:[0/2](714000/4588595) loss:2.689 lr:0.0000100 epoch_Time:24649.0min: [2024-01-05 21:22:01,509][model8_pretrain.py][INFO] Epoch:[0/2](714000/4588595) loss:2.666 lr:0.0000100 epoch_Time:24649.0min: [2024-01-05 21:22:01,509][model8_pretrain.py][INFO] Epoch:[0/2](714000/4588595) loss:3.236 lr:0.0000100 epoch_Time:24649.0min: [2024-01-05 21:22:01,509][model8_pretrain.py][INFO] Epoch:[0/2](714000/4588595) loss:2.669 lr:0.0000100 epoch_Time:24649.0min: [2024-01-05 21:22:01,509][model8_pretrain.py][INFO] Epoch:[0/2](714000/4588595) loss:3.019 lr:0.0000100 epoch_Time:24649.0min: [2024-01-05 21:22:01,509][model8_pretrain.py][INFO] Epoch:[0/2](714000/4588595) loss:3.012 lr:0.0000100 epoch_Time:24649.0min: [2024-01-05 21:22:01,509][model8_pretrain.py][INFO] Epoch:[0/2](714000/4588595) loss:2.997 lr:0.0000100 epoch_Time:24649.0min: [2024-01-05 21:22:01,509][model8_pretrain.py][INFO] Epoch:[0/2](714000/4588595) loss:2.847 lr:0.0000100 epoch_Time:24649.0min: [2024-01-05 21:22:38,447][model8_pretrain.py][INFO] Epoch:[0/2](714100/4588595) loss:3.021 lr:0.0000100 epoch_Time:24648.0min: [2024-01-05 21:22:38,447][model8_pretrain.py][INFO] Epoch:[0/2](714100/4588595) loss:3.027 lr:0.0000100 epoch_Time:24648.0min: [2024-01-05 21:22:38,447][model8_pretrain.py][INFO] Epoch:[0/2](714100/4588595) loss:2.963 lr:0.0000100 epoch_Time:24648.0min: [2024-01-05 21:22:38,447][model8_pretrain.py][INFO] Epoch:[0/2](714100/4588595) loss:2.960 lr:0.0000100 epoch_Time:24648.0min: [2024-01-05 21:22:38,447][model8_pretrain.py][INFO] Epoch:[0/2](714100/4588595) loss:3.044 lr:0.0000100 epoch_Time:24648.0min: [2024-01-05 21:22:38,447][model8_pretrain.py][INFO] Epoch:[0/2](714100/4588595) loss:3.163 lr:0.0000100 epoch_Time:24648.0min: [2024-01-05 21:22:38,447][model8_pretrain.py][INFO] Epoch:[0/2](714100/4588595) loss:3.075 lr:0.0000100 epoch_Time:24648.0min: [2024-01-05 21:22:38,447][model8_pretrain.py][INFO] Epoch:[0/2](714100/4588595) loss:2.366 lr:0.0000100 epoch_Time:24648.0min: [2024-01-05 21:23:15,387][model8_pretrain.py][INFO] Epoch:[0/2](714200/4588595) loss:3.038 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:15,387][model8_pretrain.py][INFO] Epoch:[0/2](714200/4588595) loss:2.539 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:15,387][model8_pretrain.py][INFO] Epoch:[0/2](714200/4588595) loss:2.713 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:15,387][model8_pretrain.py][INFO] Epoch:[0/2](714200/4588595) loss:2.903 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:15,387][model8_pretrain.py][INFO] Epoch:[0/2](714200/4588595) loss:2.720 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:15,387][model8_pretrain.py][INFO] Epoch:[0/2](714200/4588595) loss:2.445 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:15,387][model8_pretrain.py][INFO] Epoch:[0/2](714200/4588595) loss:2.862 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:15,387][model8_pretrain.py][INFO] Epoch:[0/2](714200/4588595) loss:3.340 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:58,975][model8_pretrain.py][INFO] Epoch:[0/2](714300/4588595) loss:2.767 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:58,975][model8_pretrain.py][INFO] Epoch:[0/2](714300/4588595) loss:2.941 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:58,975][model8_pretrain.py][INFO] Epoch:[0/2](714300/4588595) loss:2.981 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:58,975][model8_pretrain.py][INFO] Epoch:[0/2](714300/4588595) loss:3.062 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:58,975][model8_pretrain.py][INFO] Epoch:[0/2](714300/4588595) loss:2.494 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:58,976][model8_pretrain.py][INFO] Epoch:[0/2](714300/4588595) loss:2.989 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:58,976][model8_pretrain.py][INFO] Epoch:[0/2](714300/4588595) loss:2.595 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:23:58,980][model8_pretrain.py][INFO] Epoch:[0/2](714300/4588595) loss:2.987 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:24:41,089][model8_pretrain.py][INFO] Epoch:[0/2](714400/4588595) loss:2.432 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:24:41,089][model8_pretrain.py][INFO] Epoch:[0/2](714400/4588595) loss:2.664 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:24:41,089][model8_pretrain.py][INFO] Epoch:[0/2](714400/4588595) loss:2.970 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:24:41,089][model8_pretrain.py][INFO] Epoch:[0/2](714400/4588595) loss:3.278 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:24:41,089][model8_pretrain.py][INFO] Epoch:[0/2](714400/4588595) loss:3.077 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:24:41,089][model8_pretrain.py][INFO] Epoch:[0/2](714400/4588595) loss:2.972 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:24:41,089][model8_pretrain.py][INFO] Epoch:[0/2](714400/4588595) loss:2.389 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:24:41,089][model8_pretrain.py][INFO] Epoch:[0/2](714400/4588595) loss:2.824 lr:0.0000100 epoch_Time:24647.0min: [2024-01-05 21:25:18,041][model8_pretrain.py][INFO] Epoch:[0/2](714500/4588595) loss:2.186 lr:0.0000100 epoch_Time:24646.0min: [2024-01-05 21:25:18,041][model8_pretrain.py][INFO] Epoch:[0/2](714500/4588595) loss:3.376 lr:0.0000100 epoch_Time:24646.0min: [2024-01-05 21:25:18,041][model8_pretrain.py][INFO] Epoch:[0/2](714500/4588595) loss:3.262 lr:0.0000100 epoch_Time:24646.0min: [2024-01-05 21:25:18,041][model8_pretrain.py][INFO] Epoch:[0/2](714500/4588595) loss:2.926 lr:0.0000100 epoch_Time:24646.0min: [2024-01-05 21:25:18,041][model8_pretrain.py][INFO] Epoch:[0/2](714500/4588595) loss:2.825 lr:0.0000100 epoch_Time:24646.0min: [2024-01-05 21:25:18,041][model8_pretrain.py][INFO] Epoch:[0/2](714500/4588595) loss:2.573 lr:0.0000100 epoch_Time:24646.0min: [2024-01-05 21:25:18,041][model8_pretrain.py][INFO] Epoch:[0/2](714500/4588595) loss:2.908 lr:0.0000100 epoch_Time:24646.0min: [2024-01-05 21:25:18,041][model8_pretrain.py][INFO] Epoch:[0/2](714500/4588595) loss:2.919 lr:0.0000100 epoch_Time:24646.0min: [2024-01-05 21:25:54,997][model8_pretrain.py][INFO] Epoch:[0/2](714600/4588595) loss:2.617 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:25:54,997][model8_pretrain.py][INFO] Epoch:[0/2](714600/4588595) loss:2.987 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:25:54,997][model8_pretrain.py][INFO] Epoch:[0/2](714600/4588595) loss:2.844 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:25:54,997][model8_pretrain.py][INFO] Epoch:[0/2](714600/4588595) loss:2.361 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:25:54,997][model8_pretrain.py][INFO] Epoch:[0/2](714600/4588595) loss:3.326 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:25:54,997][model8_pretrain.py][INFO] Epoch:[0/2](714600/4588595) loss:2.755 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:25:54,997][model8_pretrain.py][INFO] Epoch:[0/2](714600/4588595) loss:2.859 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:25:54,997][model8_pretrain.py][INFO] Epoch:[0/2](714600/4588595) loss:2.941 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:26:31,963][model8_pretrain.py][INFO] Epoch:[0/2](714700/4588595) loss:2.916 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:26:31,963][model8_pretrain.py][INFO] Epoch:[0/2](714700/4588595) loss:2.849 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:26:31,963][model8_pretrain.py][INFO] Epoch:[0/2](714700/4588595) loss:2.736 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:26:31,963][model8_pretrain.py][INFO] Epoch:[0/2](714700/4588595) loss:3.213 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:26:31,964][model8_pretrain.py][INFO] Epoch:[0/2](714700/4588595) loss:2.220 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:26:31,964][model8_pretrain.py][INFO] Epoch:[0/2](714700/4588595) loss:3.615 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:26:31,964][model8_pretrain.py][INFO] Epoch:[0/2](714700/4588595) loss:3.180 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:26:31,964][model8_pretrain.py][INFO] Epoch:[0/2](714700/4588595) loss:2.946 lr:0.0000100 epoch_Time:24645.0min: [2024-01-05 21:27:08,920][model8_pretrain.py][INFO] Epoch:[0/2](714800/4588595) loss:2.435 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:08,920][model8_pretrain.py][INFO] Epoch:[0/2](714800/4588595) loss:2.154 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:08,920][model8_pretrain.py][INFO] Epoch:[0/2](714800/4588595) loss:2.630 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:08,920][model8_pretrain.py][INFO] Epoch:[0/2](714800/4588595) loss:3.237 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:08,920][model8_pretrain.py][INFO] Epoch:[0/2](714800/4588595) loss:3.039 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:08,921][model8_pretrain.py][INFO] Epoch:[0/2](714800/4588595) loss:3.157 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:08,921][model8_pretrain.py][INFO] Epoch:[0/2](714800/4588595) loss:3.490 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:08,921][model8_pretrain.py][INFO] Epoch:[0/2](714800/4588595) loss:3.026 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:45,872][model8_pretrain.py][INFO] Epoch:[0/2](714900/4588595) loss:2.209 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:45,872][model8_pretrain.py][INFO] Epoch:[0/2](714900/4588595) loss:2.770 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:45,872][model8_pretrain.py][INFO] Epoch:[0/2](714900/4588595) loss:2.762 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:45,872][model8_pretrain.py][INFO] Epoch:[0/2](714900/4588595) loss:2.854 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:45,872][model8_pretrain.py][INFO] Epoch:[0/2](714900/4588595) loss:2.949 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:45,872][model8_pretrain.py][INFO] Epoch:[0/2](714900/4588595) loss:3.304 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:45,872][model8_pretrain.py][INFO] Epoch:[0/2](714900/4588595) loss:3.144 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:27:45,872][model8_pretrain.py][INFO] Epoch:[0/2](714900/4588595) loss:2.825 lr:0.0000100 epoch_Time:24644.0min: [2024-01-05 21:28:22,818][model8_pretrain.py][INFO] Epoch:[0/2](715000/4588595) loss:1.900 lr:0.0000100 epoch_Time:24643.0min: [2024-01-05 21:28:22,818][model8_pretrain.py][INFO] Epoch:[0/2](715000/4588595) loss:2.420 lr:0.0000100 epoch_Time:24643.0min: [2024-01-05 21:28:22,818][model8_pretrain.py][INFO] Epoch:[0/2](715000/4588595) loss:2.549 lr:0.0000100 epoch_Time:24643.0min: [2024-01-05 21:28:22,818][model8_pretrain.py][INFO] Epoch:[0/2](715000/4588595) loss:2.354 lr:0.0000100 epoch_Time:24643.0min: [2024-01-05 21:28:22,818][model8_pretrain.py][INFO] Epoch:[0/2](715000/4588595) loss:2.632 lr:0.0000100 epoch_Time:24643.0min: [2024-01-05 21:28:22,818][model8_pretrain.py][INFO] Epoch:[0/2](715000/4588595) loss:2.763 lr:0.0000100 epoch_Time:24643.0min: [2024-01-05 21:28:22,818][model8_pretrain.py][INFO] Epoch:[0/2](715000/4588595) loss:3.141 lr:0.0000100 epoch_Time:24643.0min: [2024-01-05 21:28:22,818][model8_pretrain.py][INFO] Epoch:[0/2](715000/4588595) loss:2.839 lr:0.0000100 epoch_Time:24643.0min: [2024-01-05 21:29:03,275][model8_pretrain.py][INFO] Epoch:[0/2](715100/4588595) loss:3.348 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:03,275][model8_pretrain.py][INFO] Epoch:[0/2](715100/4588595) loss:2.674 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:03,275][model8_pretrain.py][INFO] Epoch:[0/2](715100/4588595) loss:3.147 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:03,275][model8_pretrain.py][INFO] Epoch:[0/2](715100/4588595) loss:3.082 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:03,275][model8_pretrain.py][INFO] Epoch:[0/2](715100/4588595) loss:2.581 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:03,275][model8_pretrain.py][INFO] Epoch:[0/2](715100/4588595) loss:3.212 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:03,275][model8_pretrain.py][INFO] Epoch:[0/2](715100/4588595) loss:2.605 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:03,275][model8_pretrain.py][INFO] Epoch:[0/2](715100/4588595) loss:2.995 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:48,563][model8_pretrain.py][INFO] Epoch:[0/2](715200/4588595) loss:3.008 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:48,563][model8_pretrain.py][INFO] Epoch:[0/2](715200/4588595) loss:3.273 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:48,563][model8_pretrain.py][INFO] Epoch:[0/2](715200/4588595) loss:2.300 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:48,563][model8_pretrain.py][INFO] Epoch:[0/2](715200/4588595) loss:2.851 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:48,563][model8_pretrain.py][INFO] Epoch:[0/2](715200/4588595) loss:2.494 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:48,563][model8_pretrain.py][INFO] Epoch:[0/2](715200/4588595) loss:2.553 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:48,563][model8_pretrain.py][INFO] Epoch:[0/2](715200/4588595) loss:2.623 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:29:48,563][model8_pretrain.py][INFO] Epoch:[0/2](715200/4588595) loss:2.179 lr:0.0000100 epoch_Time:24642.0min: [2024-01-05 21:30:25,509][model8_pretrain.py][INFO] Epoch:[0/2](715300/4588595) loss:2.682 lr:0.0000100 epoch_Time:24641.0min: [2024-01-05 21:30:25,509][model8_pretrain.py][INFO] Epoch:[0/2](715300/4588595) loss:3.033 lr:0.0000100 epoch_Time:24641.0min: [2024-01-05 21:30:25,509][model8_pretrain.py][INFO] Epoch:[0/2](715300/4588595) loss:2.997 lr:0.0000100 epoch_Time:24641.0min: [2024-01-05 21:30:25,509][model8_pretrain.py][INFO] Epoch:[0/2](715300/4588595) loss:2.672 lr:0.0000100 epoch_Time:24641.0min: [2024-01-05 21:30:25,509][model8_pretrain.py][INFO] Epoch:[0/2](715300/4588595) loss:3.236 lr:0.0000100 epoch_Time:24641.0min: [2024-01-05 21:30:25,509][model8_pretrain.py][INFO] Epoch:[0/2](715300/4588595) loss:2.888 lr:0.0000100 epoch_Time:24641.0min: [2024-01-05 21:30:25,509][model8_pretrain.py][INFO] Epoch:[0/2](715300/4588595) loss:2.700 lr:0.0000100 epoch_Time:24641.0min: [2024-01-05 21:30:25,509][model8_pretrain.py][INFO] Epoch:[0/2](715300/4588595) loss:3.312 lr:0.0000100 epoch_Time:24641.0min: [2024-01-05 21:31:02,468][model8_pretrain.py][INFO] Epoch:[0/2](715400/4588595) loss:3.030 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:02,468][model8_pretrain.py][INFO] Epoch:[0/2](715400/4588595) loss:1.877 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:02,468][model8_pretrain.py][INFO] Epoch:[0/2](715400/4588595) loss:3.102 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:02,468][model8_pretrain.py][INFO] Epoch:[0/2](715400/4588595) loss:2.986 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:02,468][model8_pretrain.py][INFO] Epoch:[0/2](715400/4588595) loss:2.457 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:02,468][model8_pretrain.py][INFO] Epoch:[0/2](715400/4588595) loss:2.751 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:02,468][model8_pretrain.py][INFO] Epoch:[0/2](715400/4588595) loss:2.756 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:02,468][model8_pretrain.py][INFO] Epoch:[0/2](715400/4588595) loss:3.127 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:39,398][model8_pretrain.py][INFO] Epoch:[0/2](715500/4588595) loss:2.960 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:39,398][model8_pretrain.py][INFO] Epoch:[0/2](715500/4588595) loss:2.542 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:39,398][model8_pretrain.py][INFO] Epoch:[0/2](715500/4588595) loss:2.978 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:39,398][model8_pretrain.py][INFO] Epoch:[0/2](715500/4588595) loss:2.938 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:39,398][model8_pretrain.py][INFO] Epoch:[0/2](715500/4588595) loss:2.727 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:39,398][model8_pretrain.py][INFO] Epoch:[0/2](715500/4588595) loss:2.826 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:39,398][model8_pretrain.py][INFO] Epoch:[0/2](715500/4588595) loss:2.744 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:31:39,398][model8_pretrain.py][INFO] Epoch:[0/2](715500/4588595) loss:2.991 lr:0.0000100 epoch_Time:24640.0min: [2024-01-05 21:32:16,350][model8_pretrain.py][INFO] Epoch:[0/2](715600/4588595) loss:2.401 lr:0.0000100 epoch_Time:24639.0min: [2024-01-05 21:32:16,350][model8_pretrain.py][INFO] Epoch:[0/2](715600/4588595) loss:2.842 lr:0.0000100 epoch_Time:24639.0min: [2024-01-05 21:32:16,350][model8_pretrain.py][INFO] Epoch:[0/2](715600/4588595) loss:2.838 lr:0.0000100 epoch_Time:24639.0min: [2024-01-05 21:32:16,350][model8_pretrain.py][INFO] Epoch:[0/2](715600/4588595) loss:3.058 lr:0.0000100 epoch_Time:24639.0min: [2024-01-05 21:32:16,350][model8_pretrain.py][INFO] Epoch:[0/2](715600/4588595) loss:2.409 lr:0.0000100 epoch_Time:24639.0min: [2024-01-05 21:32:16,350][model8_pretrain.py][INFO] Epoch:[0/2](715600/4588595) loss:2.659 lr:0.0000100 epoch_Time:24639.0min: [2024-01-05 21:32:16,350][model8_pretrain.py][INFO] Epoch:[0/2](715600/4588595) loss:2.202 lr:0.0000100 epoch_Time:24639.0min: [2024-01-05 21:32:16,350][model8_pretrain.py][INFO] Epoch:[0/2](715600/4588595) loss:3.099 lr:0.0000100 epoch_Time:24639.0min: [2024-01-05 21:32:53,312][model8_pretrain.py][INFO] Epoch:[0/2](715700/4588595) loss:3.079 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:32:53,312][model8_pretrain.py][INFO] Epoch:[0/2](715700/4588595) loss:2.418 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:32:53,312][model8_pretrain.py][INFO] Epoch:[0/2](715700/4588595) loss:2.849 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:32:53,312][model8_pretrain.py][INFO] Epoch:[0/2](715700/4588595) loss:2.820 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:32:53,313][model8_pretrain.py][INFO] Epoch:[0/2](715700/4588595) loss:2.399 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:32:53,313][model8_pretrain.py][INFO] Epoch:[0/2](715700/4588595) loss:2.945 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:32:53,313][model8_pretrain.py][INFO] Epoch:[0/2](715700/4588595) loss:2.590 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:32:53,313][model8_pretrain.py][INFO] Epoch:[0/2](715700/4588595) loss:2.527 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:33:30,315][model8_pretrain.py][INFO] Epoch:[0/2](715800/4588595) loss:2.581 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:33:30,315][model8_pretrain.py][INFO] Epoch:[0/2](715800/4588595) loss:2.727 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:33:30,315][model8_pretrain.py][INFO] Epoch:[0/2](715800/4588595) loss:3.161 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:33:30,315][model8_pretrain.py][INFO] Epoch:[0/2](715800/4588595) loss:2.887 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:33:30,315][model8_pretrain.py][INFO] Epoch:[0/2](715800/4588595) loss:2.811 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:33:30,315][model8_pretrain.py][INFO] Epoch:[0/2](715800/4588595) loss:2.636 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:33:30,315][model8_pretrain.py][INFO] Epoch:[0/2](715800/4588595) loss:2.842 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:33:30,315][model8_pretrain.py][INFO] Epoch:[0/2](715800/4588595) loss:2.489 lr:0.0000100 epoch_Time:24638.0min: [2024-01-05 21:34:10,825][model8_pretrain.py][INFO] Epoch:[0/2](715900/4588595) loss:2.771 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:10,825][model8_pretrain.py][INFO] Epoch:[0/2](715900/4588595) loss:3.079 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:10,825][model8_pretrain.py][INFO] Epoch:[0/2](715900/4588595) loss:2.786 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:10,825][model8_pretrain.py][INFO] Epoch:[0/2](715900/4588595) loss:3.330 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:10,825][model8_pretrain.py][INFO] Epoch:[0/2](715900/4588595) loss:3.435 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:10,825][model8_pretrain.py][INFO] Epoch:[0/2](715900/4588595) loss:2.275 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:10,825][model8_pretrain.py][INFO] Epoch:[0/2](715900/4588595) loss:3.077 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:10,825][model8_pretrain.py][INFO] Epoch:[0/2](715900/4588595) loss:3.227 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:56,286][model8_pretrain.py][INFO] Epoch:[0/2](716000/4588595) loss:2.822 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:56,286][model8_pretrain.py][INFO] Epoch:[0/2](716000/4588595) loss:2.876 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:56,286][model8_pretrain.py][INFO] Epoch:[0/2](716000/4588595) loss:2.988 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:56,286][model8_pretrain.py][INFO] Epoch:[0/2](716000/4588595) loss:2.778 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:56,286][model8_pretrain.py][INFO] Epoch:[0/2](716000/4588595) loss:2.781 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:56,286][model8_pretrain.py][INFO] Epoch:[0/2](716000/4588595) loss:3.394 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:56,286][model8_pretrain.py][INFO] Epoch:[0/2](716000/4588595) loss:3.564 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:34:56,286][model8_pretrain.py][INFO] Epoch:[0/2](716000/4588595) loss:2.821 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:35:33,248][model8_pretrain.py][INFO] Epoch:[0/2](716100/4588595) loss:3.088 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:35:33,248][model8_pretrain.py][INFO] Epoch:[0/2](716100/4588595) loss:2.888 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:35:33,249][model8_pretrain.py][INFO] Epoch:[0/2](716100/4588595) loss:2.593 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:35:33,249][model8_pretrain.py][INFO] Epoch:[0/2](716100/4588595) loss:3.255 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:35:33,249][model8_pretrain.py][INFO] Epoch:[0/2](716100/4588595) loss:2.816 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:35:33,249][model8_pretrain.py][INFO] Epoch:[0/2](716100/4588595) loss:2.840 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:35:33,249][model8_pretrain.py][INFO] Epoch:[0/2](716100/4588595) loss:3.288 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:35:33,249][model8_pretrain.py][INFO] Epoch:[0/2](716100/4588595) loss:3.089 lr:0.0000100 epoch_Time:24637.0min: [2024-01-05 21:36:10,199][model8_pretrain.py][INFO] Epoch:[0/2](716200/4588595) loss:3.276 lr:0.0000100 epoch_Time:24636.0min: [2024-01-05 21:36:10,199][model8_pretrain.py][INFO] Epoch:[0/2](716200/4588595) loss:3.238 lr:0.0000100 epoch_Time:24636.0min: [2024-01-05 21:36:10,199][model8_pretrain.py][INFO] Epoch:[0/2](716200/4588595) loss:2.943 lr:0.0000100 epoch_Time:24636.0min: [2024-01-05 21:36:10,199][model8_pretrain.py][INFO] Epoch:[0/2](716200/4588595) loss:2.488 lr:0.0000100 epoch_Time:24636.0min: [2024-01-05 21:36:10,199][model8_pretrain.py][INFO] Epoch:[0/2](716200/4588595) loss:3.112 lr:0.0000100 epoch_Time:24636.0min: [2024-01-05 21:36:10,199][model8_pretrain.py][INFO] Epoch:[0/2](716200/4588595) loss:3.019 lr:0.0000100 epoch_Time:24636.0min: [2024-01-05 21:36:10,199][model8_pretrain.py][INFO] Epoch:[0/2](716200/4588595) loss:2.734 lr:0.0000100 epoch_Time:24636.0min: [2024-01-05 21:36:10,199][model8_pretrain.py][INFO] Epoch:[0/2](716200/4588595) loss:3.087 lr:0.0000100 epoch_Time:24636.0min: [2024-01-05 21:36:47,144][model8_pretrain.py][INFO] Epoch:[0/2](716300/4588595) loss:2.644 lr:0.0000100 epoch_Time:24635.0min: [2024-01-05 21:36:47,144][model8_pretrain.py][INFO] Epoch:[0/2](716300/4588595) loss:2.798 lr:0.0000100 epoch_Time:24635.0min: [2024-01-05 21:36:47,144][model8_pretrain.py][INFO] Epoch:[0/2](716300/4588595) loss:2.862 lr:0.0000100 epoch_Time:24635.0min: [2024-01-05 21:36:47,144][model8_pretrain.py][INFO] Epoch:[0/2](716300/4588595) loss:2.469 lr:0.0000100 epoch_Time:24635.0min: [2024-01-05 21:36:47,144][model8_pretrain.py][INFO] Epoch:[0/2](716300/4588595) loss:3.254 lr:0.0000100 epoch_Time:24635.0min: [2024-01-05 21:36:47,144][model8_pretrain.py][INFO] Epoch:[0/2](716300/4588595) loss:2.959 lr:0.0000100 epoch_Time:24635.0min: [2024-01-05 21:36:47,144][model8_pretrain.py][INFO] Epoch:[0/2](716300/4588595) loss:2.830 lr:0.0000100 epoch_Time:24635.0min: [2024-01-05 21:36:47,144][model8_pretrain.py][INFO] Epoch:[0/2](716300/4588595) loss:2.898 lr:0.0000100 epoch_Time:24635.0min: [2024-01-05 21:37:24,104][model8_pretrain.py][INFO] Epoch:[0/2](716400/4588595) loss:2.374 lr:0.0000100 epoch_Time:24634.0min: [2024-01-05 21:37:24,104][model8_pretrain.py][INFO] Epoch:[0/2](716400/4588595) loss:3.157 lr:0.0000100 epoch_Time:24634.0min: [2024-01-05 21:37:24,104][model8_pretrain.py][INFO] Epoch:[0/2](716400/4588595) loss:3.102 lr:0.0000100 epoch_Time:24634.0min: [2024-01-05 21:37:24,104][model8_pretrain.py][INFO] Epoch:[0/2](716400/4588595) loss:3.096 lr:0.0000100 epoch_Time:24634.0min: [2024-01-05 21:37:24,105][model8_pretrain.py][INFO] Epoch:[0/2](716400/4588595) loss:2.767 lr:0.0000100 epoch_Time:24634.0min: [2024-01-05 21:37:24,105][model8_pretrain.py][INFO] Epoch:[0/2](716400/4588595) loss:3.087 lr:0.0000100 epoch_Time:24634.0min: [2024-01-05 21:37:24,105][model8_pretrain.py][INFO] Epoch:[0/2](716400/4588595) loss:2.621 lr:0.0000100 epoch_Time:24634.0min: [2024-01-05 21:37:24,105][model8_pretrain.py][INFO] Epoch:[0/2](716400/4588595) loss:2.662 lr:0.0000100 epoch_Time:24634.0min: [2024-01-05 21:38:01,063][model8_pretrain.py][INFO] Epoch:[0/2](716500/4588595) loss:2.970 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:01,063][model8_pretrain.py][INFO] Epoch:[0/2](716500/4588595) loss:2.270 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:01,063][model8_pretrain.py][INFO] Epoch:[0/2](716500/4588595) loss:3.050 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:01,063][model8_pretrain.py][INFO] Epoch:[0/2](716500/4588595) loss:2.317 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:01,063][model8_pretrain.py][INFO] Epoch:[0/2](716500/4588595) loss:2.596 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:01,063][model8_pretrain.py][INFO] Epoch:[0/2](716500/4588595) loss:2.742 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:01,063][model8_pretrain.py][INFO] Epoch:[0/2](716500/4588595) loss:2.710 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:01,063][model8_pretrain.py][INFO] Epoch:[0/2](716500/4588595) loss:2.654 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:38,008][model8_pretrain.py][INFO] Epoch:[0/2](716600/4588595) loss:3.123 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:38,008][model8_pretrain.py][INFO] Epoch:[0/2](716600/4588595) loss:3.029 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:38,008][model8_pretrain.py][INFO] Epoch:[0/2](716600/4588595) loss:2.451 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:38,008][model8_pretrain.py][INFO] Epoch:[0/2](716600/4588595) loss:2.669 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:38,008][model8_pretrain.py][INFO] Epoch:[0/2](716600/4588595) loss:2.955 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:38,008][model8_pretrain.py][INFO] Epoch:[0/2](716600/4588595) loss:2.155 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:38,008][model8_pretrain.py][INFO] Epoch:[0/2](716600/4588595) loss:2.144 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:38:38,008][model8_pretrain.py][INFO] Epoch:[0/2](716600/4588595) loss:2.757 lr:0.0000100 epoch_Time:24633.0min: [2024-01-05 21:39:18,470][model8_pretrain.py][INFO] Epoch:[0/2](716700/4588595) loss:2.961 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:39:18,470][model8_pretrain.py][INFO] Epoch:[0/2](716700/4588595) loss:2.584 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:39:18,470][model8_pretrain.py][INFO] Epoch:[0/2](716700/4588595) loss:2.907 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:39:18,474][model8_pretrain.py][INFO] Epoch:[0/2](716700/4588595) loss:2.753 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:39:18,474][model8_pretrain.py][INFO] Epoch:[0/2](716700/4588595) loss:2.930 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:39:18,475][model8_pretrain.py][INFO] Epoch:[0/2](716700/4588595) loss:2.660 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:39:18,475][model8_pretrain.py][INFO] Epoch:[0/2](716700/4588595) loss:2.551 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:39:18,475][model8_pretrain.py][INFO] Epoch:[0/2](716700/4588595) loss:2.996 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:03,992][model8_pretrain.py][INFO] Epoch:[0/2](716800/4588595) loss:3.157 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:03,992][model8_pretrain.py][INFO] Epoch:[0/2](716800/4588595) loss:2.896 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:03,992][model8_pretrain.py][INFO] Epoch:[0/2](716800/4588595) loss:2.783 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:03,992][model8_pretrain.py][INFO] Epoch:[0/2](716800/4588595) loss:2.958 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:03,992][model8_pretrain.py][INFO] Epoch:[0/2](716800/4588595) loss:3.173 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:03,992][model8_pretrain.py][INFO] Epoch:[0/2](716800/4588595) loss:2.937 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:03,992][model8_pretrain.py][INFO] Epoch:[0/2](716800/4588595) loss:2.435 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:03,992][model8_pretrain.py][INFO] Epoch:[0/2](716800/4588595) loss:2.729 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:40,928][model8_pretrain.py][INFO] Epoch:[0/2](716900/4588595) loss:3.213 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:40,928][model8_pretrain.py][INFO] Epoch:[0/2](716900/4588595) loss:2.717 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:40,928][model8_pretrain.py][INFO] Epoch:[0/2](716900/4588595) loss:2.720 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:40,928][model8_pretrain.py][INFO] Epoch:[0/2](716900/4588595) loss:2.439 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:40,928][model8_pretrain.py][INFO] Epoch:[0/2](716900/4588595) loss:3.134 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:40,928][model8_pretrain.py][INFO] Epoch:[0/2](716900/4588595) loss:2.457 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:40,928][model8_pretrain.py][INFO] Epoch:[0/2](716900/4588595) loss:2.707 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:40:40,929][model8_pretrain.py][INFO] Epoch:[0/2](716900/4588595) loss:3.358 lr:0.0000100 epoch_Time:24632.0min: [2024-01-05 21:41:17,863][model8_pretrain.py][INFO] Epoch:[0/2](717000/4588595) loss:2.626 lr:0.0000100 epoch_Time:24631.0min: [2024-01-05 21:41:17,863][model8_pretrain.py][INFO] Epoch:[0/2](717000/4588595) loss:2.935 lr:0.0000100 epoch_Time:24631.0min: [2024-01-05 21:41:17,863][model8_pretrain.py][INFO] Epoch:[0/2](717000/4588595) loss:3.197 lr:0.0000100 epoch_Time:24631.0min: [2024-01-05 21:41:17,863][model8_pretrain.py][INFO] Epoch:[0/2](717000/4588595) loss:2.947 lr:0.0000100 epoch_Time:24631.0min: [2024-01-05 21:41:17,863][model8_pretrain.py][INFO] Epoch:[0/2](717000/4588595) loss:2.106 lr:0.0000100 epoch_Time:24631.0min: [2024-01-05 21:41:17,863][model8_pretrain.py][INFO] Epoch:[0/2](717000/4588595) loss:2.620 lr:0.0000100 epoch_Time:24631.0min: [2024-01-05 21:41:17,863][model8_pretrain.py][INFO] Epoch:[0/2](717000/4588595) loss:3.644 lr:0.0000100 epoch_Time:24631.0min: [2024-01-05 21:41:17,863][model8_pretrain.py][INFO] Epoch:[0/2](717000/4588595) loss:2.777 lr:0.0000100 epoch_Time:24631.0min: [2024-01-05 21:41:54,808][model8_pretrain.py][INFO] Epoch:[0/2](717100/4588595) loss:2.890 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:41:54,808][model8_pretrain.py][INFO] Epoch:[0/2](717100/4588595) loss:2.694 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:41:54,808][model8_pretrain.py][INFO] Epoch:[0/2](717100/4588595) loss:2.853 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:41:54,808][model8_pretrain.py][INFO] Epoch:[0/2](717100/4588595) loss:2.795 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:41:54,808][model8_pretrain.py][INFO] Epoch:[0/2](717100/4588595) loss:2.558 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:41:54,808][model8_pretrain.py][INFO] Epoch:[0/2](717100/4588595) loss:2.824 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:41:54,808][model8_pretrain.py][INFO] Epoch:[0/2](717100/4588595) loss:3.016 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:41:54,808][model8_pretrain.py][INFO] Epoch:[0/2](717100/4588595) loss:3.450 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:42:31,763][model8_pretrain.py][INFO] Epoch:[0/2](717200/4588595) loss:2.730 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:42:31,763][model8_pretrain.py][INFO] Epoch:[0/2](717200/4588595) loss:2.453 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:42:31,763][model8_pretrain.py][INFO] Epoch:[0/2](717200/4588595) loss:3.171 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:42:31,763][model8_pretrain.py][INFO] Epoch:[0/2](717200/4588595) loss:2.759 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:42:31,763][model8_pretrain.py][INFO] Epoch:[0/2](717200/4588595) loss:2.545 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:42:31,763][model8_pretrain.py][INFO] Epoch:[0/2](717200/4588595) loss:2.718 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:42:31,763][model8_pretrain.py][INFO] Epoch:[0/2](717200/4588595) loss:2.194 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:42:31,764][model8_pretrain.py][INFO] Epoch:[0/2](717200/4588595) loss:2.776 lr:0.0000100 epoch_Time:24630.0min: [2024-01-05 21:43:08,709][model8_pretrain.py][INFO] Epoch:[0/2](717300/4588595) loss:3.048 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:08,709][model8_pretrain.py][INFO] Epoch:[0/2](717300/4588595) loss:2.599 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:08,710][model8_pretrain.py][INFO] Epoch:[0/2](717300/4588595) loss:2.355 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:08,710][model8_pretrain.py][INFO] Epoch:[0/2](717300/4588595) loss:2.894 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:08,710][model8_pretrain.py][INFO] Epoch:[0/2](717300/4588595) loss:2.577 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:08,710][model8_pretrain.py][INFO] Epoch:[0/2](717300/4588595) loss:2.643 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:08,710][model8_pretrain.py][INFO] Epoch:[0/2](717300/4588595) loss:2.711 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:08,710][model8_pretrain.py][INFO] Epoch:[0/2](717300/4588595) loss:2.539 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:45,647][model8_pretrain.py][INFO] Epoch:[0/2](717400/4588595) loss:2.477 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:45,647][model8_pretrain.py][INFO] Epoch:[0/2](717400/4588595) loss:2.790 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:45,647][model8_pretrain.py][INFO] Epoch:[0/2](717400/4588595) loss:2.869 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:45,647][model8_pretrain.py][INFO] Epoch:[0/2](717400/4588595) loss:2.836 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:45,647][model8_pretrain.py][INFO] Epoch:[0/2](717400/4588595) loss:3.459 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:45,647][model8_pretrain.py][INFO] Epoch:[0/2](717400/4588595) loss:2.488 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:45,647][model8_pretrain.py][INFO] Epoch:[0/2](717400/4588595) loss:3.188 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:43:45,647][model8_pretrain.py][INFO] Epoch:[0/2](717400/4588595) loss:2.405 lr:0.0000100 epoch_Time:24628.0min: [2024-01-05 21:44:24,291][model8_pretrain.py][INFO] Epoch:[0/2](717500/4588595) loss:2.843 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:44:24,291][model8_pretrain.py][INFO] Epoch:[0/2](717500/4588595) loss:2.416 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:44:24,291][model8_pretrain.py][INFO] Epoch:[0/2](717500/4588595) loss:2.896 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:44:24,291][model8_pretrain.py][INFO] Epoch:[0/2](717500/4588595) loss:2.707 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:44:24,291][model8_pretrain.py][INFO] Epoch:[0/2](717500/4588595) loss:2.875 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:44:24,291][model8_pretrain.py][INFO] Epoch:[0/2](717500/4588595) loss:2.732 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:44:24,291][model8_pretrain.py][INFO] Epoch:[0/2](717500/4588595) loss:2.800 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:44:24,292][model8_pretrain.py][INFO] Epoch:[0/2](717500/4588595) loss:2.847 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:45:11,396][model8_pretrain.py][INFO] Epoch:[0/2](717600/4588595) loss:3.339 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:45:11,396][model8_pretrain.py][INFO] Epoch:[0/2](717600/4588595) loss:2.606 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:45:11,396][model8_pretrain.py][INFO] Epoch:[0/2](717600/4588595) loss:3.043 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:45:11,396][model8_pretrain.py][INFO] Epoch:[0/2](717600/4588595) loss:2.550 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:45:11,396][model8_pretrain.py][INFO] Epoch:[0/2](717600/4588595) loss:3.050 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:45:11,396][model8_pretrain.py][INFO] Epoch:[0/2](717600/4588595) loss:3.183 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:45:11,396][model8_pretrain.py][INFO] Epoch:[0/2](717600/4588595) loss:2.760 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:45:11,396][model8_pretrain.py][INFO] Epoch:[0/2](717600/4588595) loss:2.452 lr:0.0000100 epoch_Time:24627.0min: [2024-01-05 21:45:48,325][model8_pretrain.py][INFO] Epoch:[0/2](717700/4588595) loss:2.589 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:45:48,325][model8_pretrain.py][INFO] Epoch:[0/2](717700/4588595) loss:2.422 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:45:48,325][model8_pretrain.py][INFO] Epoch:[0/2](717700/4588595) loss:2.508 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:45:48,325][model8_pretrain.py][INFO] Epoch:[0/2](717700/4588595) loss:2.933 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:45:48,325][model8_pretrain.py][INFO] Epoch:[0/2](717700/4588595) loss:3.003 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:45:48,325][model8_pretrain.py][INFO] Epoch:[0/2](717700/4588595) loss:2.209 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:45:48,325][model8_pretrain.py][INFO] Epoch:[0/2](717700/4588595) loss:2.622 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:45:48,325][model8_pretrain.py][INFO] Epoch:[0/2](717700/4588595) loss:2.626 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:46:25,256][model8_pretrain.py][INFO] Epoch:[0/2](717800/4588595) loss:2.520 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:46:25,256][model8_pretrain.py][INFO] Epoch:[0/2](717800/4588595) loss:2.128 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:46:25,256][model8_pretrain.py][INFO] Epoch:[0/2](717800/4588595) loss:2.805 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:46:25,256][model8_pretrain.py][INFO] Epoch:[0/2](717800/4588595) loss:3.280 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:46:25,256][model8_pretrain.py][INFO] Epoch:[0/2](717800/4588595) loss:2.959 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:46:25,256][model8_pretrain.py][INFO] Epoch:[0/2](717800/4588595) loss:2.695 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:46:25,256][model8_pretrain.py][INFO] Epoch:[0/2](717800/4588595) loss:2.874 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:46:25,256][model8_pretrain.py][INFO] Epoch:[0/2](717800/4588595) loss:2.486 lr:0.0000100 epoch_Time:24626.0min: [2024-01-05 21:47:02,197][model8_pretrain.py][INFO] Epoch:[0/2](717900/4588595) loss:3.316 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:02,197][model8_pretrain.py][INFO] Epoch:[0/2](717900/4588595) loss:2.673 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:02,197][model8_pretrain.py][INFO] Epoch:[0/2](717900/4588595) loss:3.120 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:02,197][model8_pretrain.py][INFO] Epoch:[0/2](717900/4588595) loss:2.984 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:02,198][model8_pretrain.py][INFO] Epoch:[0/2](717900/4588595) loss:2.592 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:02,198][model8_pretrain.py][INFO] Epoch:[0/2](717900/4588595) loss:2.759 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:02,198][model8_pretrain.py][INFO] Epoch:[0/2](717900/4588595) loss:3.040 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:02,199][model8_pretrain.py][INFO] Epoch:[0/2](717900/4588595) loss:2.129 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:39,128][model8_pretrain.py][INFO] Epoch:[0/2](718000/4588595) loss:2.483 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:39,128][model8_pretrain.py][INFO] Epoch:[0/2](718000/4588595) loss:2.620 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:39,128][model8_pretrain.py][INFO] Epoch:[0/2](718000/4588595) loss:2.628 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:39,128][model8_pretrain.py][INFO] Epoch:[0/2](718000/4588595) loss:3.107 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:39,128][model8_pretrain.py][INFO] Epoch:[0/2](718000/4588595) loss:3.279 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:39,128][model8_pretrain.py][INFO] Epoch:[0/2](718000/4588595) loss:2.941 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:39,128][model8_pretrain.py][INFO] Epoch:[0/2](718000/4588595) loss:2.517 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:47:39,128][model8_pretrain.py][INFO] Epoch:[0/2](718000/4588595) loss:2.638 lr:0.0000100 epoch_Time:24625.0min: [2024-01-05 21:48:16,061][model8_pretrain.py][INFO] Epoch:[0/2](718100/4588595) loss:2.884 lr:0.0000100 epoch_Time:24624.0min: [2024-01-05 21:48:16,061][model8_pretrain.py][INFO] Epoch:[0/2](718100/4588595) loss:3.036 lr:0.0000100 epoch_Time:24624.0min: [2024-01-05 21:48:16,061][model8_pretrain.py][INFO] Epoch:[0/2](718100/4588595) loss:3.216 lr:0.0000100 epoch_Time:24624.0min: [2024-01-05 21:48:16,061][model8_pretrain.py][INFO] Epoch:[0/2](718100/4588595) loss:2.409 lr:0.0000100 epoch_Time:24624.0min: [2024-01-05 21:48:16,061][model8_pretrain.py][INFO] Epoch:[0/2](718100/4588595) loss:2.989 lr:0.0000100 epoch_Time:24624.0min: [2024-01-05 21:48:16,061][model8_pretrain.py][INFO] Epoch:[0/2](718100/4588595) loss:2.649 lr:0.0000100 epoch_Time:24624.0min: [2024-01-05 21:48:16,061][model8_pretrain.py][INFO] Epoch:[0/2](718100/4588595) loss:3.324 lr:0.0000100 epoch_Time:24624.0min: [2024-01-05 21:48:16,061][model8_pretrain.py][INFO] Epoch:[0/2](718100/4588595) loss:3.210 lr:0.0000100 epoch_Time:24624.0min: [2024-01-05 21:48:52,992][model8_pretrain.py][INFO] Epoch:[0/2](718200/4588595) loss:2.309 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:48:52,992][model8_pretrain.py][INFO] Epoch:[0/2](718200/4588595) loss:2.787 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:48:52,992][model8_pretrain.py][INFO] Epoch:[0/2](718200/4588595) loss:2.799 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:48:52,992][model8_pretrain.py][INFO] Epoch:[0/2](718200/4588595) loss:3.032 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:48:52,992][model8_pretrain.py][INFO] Epoch:[0/2](718200/4588595) loss:2.953 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:48:52,992][model8_pretrain.py][INFO] Epoch:[0/2](718200/4588595) loss:3.188 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:48:52,992][model8_pretrain.py][INFO] Epoch:[0/2](718200/4588595) loss:3.189 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:48:52,992][model8_pretrain.py][INFO] Epoch:[0/2](718200/4588595) loss:2.761 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:49:31,641][model8_pretrain.py][INFO] Epoch:[0/2](718300/4588595) loss:2.782 lr:0.0000100 epoch_Time:24623.0min: [2024-01-05 21:49:31,641][model8_pretrain.py][INFO] Epoch:[0/2](718300/4588595) loss:2.136 lr:0.0000100 epoch_Time:24623.0min: [2024-01-05 21:49:31,641][model8_pretrain.py][INFO] Epoch:[0/2](718300/4588595) loss:2.838 lr:0.0000100 epoch_Time:24623.0min: [2024-01-05 21:49:31,641][model8_pretrain.py][INFO] Epoch:[0/2](718300/4588595) loss:3.189 lr:0.0000100 epoch_Time:24623.0min: [2024-01-05 21:49:31,641][model8_pretrain.py][INFO] Epoch:[0/2](718300/4588595) loss:2.908 lr:0.0000100 epoch_Time:24623.0min: [2024-01-05 21:49:31,641][model8_pretrain.py][INFO] Epoch:[0/2](718300/4588595) loss:2.611 lr:0.0000100 epoch_Time:24623.0min: [2024-01-05 21:49:31,641][model8_pretrain.py][INFO] Epoch:[0/2](718300/4588595) loss:2.816 lr:0.0000100 epoch_Time:24623.0min: [2024-01-05 21:49:31,641][model8_pretrain.py][INFO] Epoch:[0/2](718300/4588595) loss:2.078 lr:0.0000100 epoch_Time:24623.0min: [2024-01-05 21:50:18,724][model8_pretrain.py][INFO] Epoch:[0/2](718400/4588595) loss:2.618 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:50:18,724][model8_pretrain.py][INFO] Epoch:[0/2](718400/4588595) loss:2.621 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:50:18,724][model8_pretrain.py][INFO] Epoch:[0/2](718400/4588595) loss:2.778 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:50:18,724][model8_pretrain.py][INFO] Epoch:[0/2](718400/4588595) loss:3.046 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:50:18,724][model8_pretrain.py][INFO] Epoch:[0/2](718400/4588595) loss:3.058 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:50:18,724][model8_pretrain.py][INFO] Epoch:[0/2](718400/4588595) loss:3.009 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:50:18,724][model8_pretrain.py][INFO] Epoch:[0/2](718400/4588595) loss:2.831 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:50:18,724][model8_pretrain.py][INFO] Epoch:[0/2](718400/4588595) loss:2.877 lr:0.0000100 epoch_Time:24622.0min: [2024-01-05 21:50:55,649][model8_pretrain.py][INFO] Epoch:[0/2](718500/4588595) loss:3.008 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:50:55,649][model8_pretrain.py][INFO] Epoch:[0/2](718500/4588595) loss:3.020 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:50:55,649][model8_pretrain.py][INFO] Epoch:[0/2](718500/4588595) loss:3.328 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:50:55,649][model8_pretrain.py][INFO] Epoch:[0/2](718500/4588595) loss:3.041 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:50:55,649][model8_pretrain.py][INFO] Epoch:[0/2](718500/4588595) loss:3.141 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:50:55,649][model8_pretrain.py][INFO] Epoch:[0/2](718500/4588595) loss:2.773 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:50:55,649][model8_pretrain.py][INFO] Epoch:[0/2](718500/4588595) loss:3.270 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:50:55,650][model8_pretrain.py][INFO] Epoch:[0/2](718500/4588595) loss:2.739 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:51:32,587][model8_pretrain.py][INFO] Epoch:[0/2](718600/4588595) loss:2.843 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:51:32,587][model8_pretrain.py][INFO] Epoch:[0/2](718600/4588595) loss:2.914 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:51:32,587][model8_pretrain.py][INFO] Epoch:[0/2](718600/4588595) loss:2.556 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:51:32,587][model8_pretrain.py][INFO] Epoch:[0/2](718600/4588595) loss:1.938 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:51:32,587][model8_pretrain.py][INFO] Epoch:[0/2](718600/4588595) loss:3.300 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:51:32,587][model8_pretrain.py][INFO] Epoch:[0/2](718600/4588595) loss:2.841 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:51:32,587][model8_pretrain.py][INFO] Epoch:[0/2](718600/4588595) loss:2.648 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:51:32,587][model8_pretrain.py][INFO] Epoch:[0/2](718600/4588595) loss:2.597 lr:0.0000100 epoch_Time:24621.0min: [2024-01-05 21:52:09,530][model8_pretrain.py][INFO] Epoch:[0/2](718700/4588595) loss:2.452 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:09,530][model8_pretrain.py][INFO] Epoch:[0/2](718700/4588595) loss:3.134 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:09,530][model8_pretrain.py][INFO] Epoch:[0/2](718700/4588595) loss:2.897 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:09,530][model8_pretrain.py][INFO] Epoch:[0/2](718700/4588595) loss:2.626 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:09,530][model8_pretrain.py][INFO] Epoch:[0/2](718700/4588595) loss:2.793 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:09,530][model8_pretrain.py][INFO] Epoch:[0/2](718700/4588595) loss:2.918 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:09,530][model8_pretrain.py][INFO] Epoch:[0/2](718700/4588595) loss:2.368 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:09,530][model8_pretrain.py][INFO] Epoch:[0/2](718700/4588595) loss:3.210 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:46,465][model8_pretrain.py][INFO] Epoch:[0/2](718800/4588595) loss:2.942 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:46,465][model8_pretrain.py][INFO] Epoch:[0/2](718800/4588595) loss:2.876 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:46,465][model8_pretrain.py][INFO] Epoch:[0/2](718800/4588595) loss:3.063 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:46,465][model8_pretrain.py][INFO] Epoch:[0/2](718800/4588595) loss:2.830 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:46,465][model8_pretrain.py][INFO] Epoch:[0/2](718800/4588595) loss:3.054 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:46,465][model8_pretrain.py][INFO] Epoch:[0/2](718800/4588595) loss:3.293 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:46,465][model8_pretrain.py][INFO] Epoch:[0/2](718800/4588595) loss:3.045 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:52:46,465][model8_pretrain.py][INFO] Epoch:[0/2](718800/4588595) loss:2.907 lr:0.0000100 epoch_Time:24620.0min: [2024-01-05 21:53:23,410][model8_pretrain.py][INFO] Epoch:[0/2](718900/4588595) loss:2.773 lr:0.0000100 epoch_Time:24619.0min: [2024-01-05 21:53:23,410][model8_pretrain.py][INFO] Epoch:[0/2](718900/4588595) loss:2.458 lr:0.0000100 epoch_Time:24619.0min: [2024-01-05 21:53:23,410][model8_pretrain.py][INFO] Epoch:[0/2](718900/4588595) loss:3.200 lr:0.0000100 epoch_Time:24619.0min: [2024-01-05 21:53:23,410][model8_pretrain.py][INFO] Epoch:[0/2](718900/4588595) loss:2.051 lr:0.0000100 epoch_Time:24619.0min: [2024-01-05 21:53:23,411][model8_pretrain.py][INFO] Epoch:[0/2](718900/4588595) loss:2.833 lr:0.0000100 epoch_Time:24619.0min: [2024-01-05 21:53:23,411][model8_pretrain.py][INFO] Epoch:[0/2](718900/4588595) loss:3.047 lr:0.0000100 epoch_Time:24619.0min: [2024-01-05 21:53:23,411][model8_pretrain.py][INFO] Epoch:[0/2](718900/4588595) loss:2.686 lr:0.0000100 epoch_Time:24619.0min: [2024-01-05 21:53:23,411][model8_pretrain.py][INFO] Epoch:[0/2](718900/4588595) loss:2.383 lr:0.0000100 epoch_Time:24619.0min: [2024-01-05 21:54:00,366][model8_pretrain.py][INFO] Epoch:[0/2](719000/4588595) loss:2.677 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:00,366][model8_pretrain.py][INFO] Epoch:[0/2](719000/4588595) loss:2.491 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:00,366][model8_pretrain.py][INFO] Epoch:[0/2](719000/4588595) loss:2.847 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:00,366][model8_pretrain.py][INFO] Epoch:[0/2](719000/4588595) loss:2.849 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:00,366][model8_pretrain.py][INFO] Epoch:[0/2](719000/4588595) loss:3.389 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:00,366][model8_pretrain.py][INFO] Epoch:[0/2](719000/4588595) loss:2.771 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:00,366][model8_pretrain.py][INFO] Epoch:[0/2](719000/4588595) loss:3.007 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:00,366][model8_pretrain.py][INFO] Epoch:[0/2](719000/4588595) loss:2.768 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:37,312][model8_pretrain.py][INFO] Epoch:[0/2](719100/4588595) loss:2.854 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:37,312][model8_pretrain.py][INFO] Epoch:[0/2](719100/4588595) loss:3.096 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:37,312][model8_pretrain.py][INFO] Epoch:[0/2](719100/4588595) loss:3.126 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:37,312][model8_pretrain.py][INFO] Epoch:[0/2](719100/4588595) loss:2.479 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:37,312][model8_pretrain.py][INFO] Epoch:[0/2](719100/4588595) loss:2.679 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:37,312][model8_pretrain.py][INFO] Epoch:[0/2](719100/4588595) loss:2.439 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:37,312][model8_pretrain.py][INFO] Epoch:[0/2](719100/4588595) loss:2.779 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:54:37,312][model8_pretrain.py][INFO] Epoch:[0/2](719100/4588595) loss:2.782 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:55:26,155][model8_pretrain.py][INFO] Epoch:[0/2](719200/4588595) loss:2.779 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:55:26,155][model8_pretrain.py][INFO] Epoch:[0/2](719200/4588595) loss:3.004 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:55:26,155][model8_pretrain.py][INFO] Epoch:[0/2](719200/4588595) loss:3.007 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:55:26,155][model8_pretrain.py][INFO] Epoch:[0/2](719200/4588595) loss:2.800 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:55:26,155][model8_pretrain.py][INFO] Epoch:[0/2](719200/4588595) loss:3.142 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:55:26,155][model8_pretrain.py][INFO] Epoch:[0/2](719200/4588595) loss:2.333 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:55:26,155][model8_pretrain.py][INFO] Epoch:[0/2](719200/4588595) loss:3.099 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:55:26,155][model8_pretrain.py][INFO] Epoch:[0/2](719200/4588595) loss:2.704 lr:0.0000100 epoch_Time:24618.0min: [2024-01-05 21:56:03,089][model8_pretrain.py][INFO] Epoch:[0/2](719300/4588595) loss:2.853 lr:0.0000100 epoch_Time:24617.0min: [2024-01-05 21:56:03,089][model8_pretrain.py][INFO] Epoch:[0/2](719300/4588595) loss:2.758 lr:0.0000100 epoch_Time:24617.0min: [2024-01-05 21:56:03,089][model8_pretrain.py][INFO] Epoch:[0/2](719300/4588595) loss:2.907 lr:0.0000100 epoch_Time:24617.0min: [2024-01-05 21:56:03,089][model8_pretrain.py][INFO] Epoch:[0/2](719300/4588595) loss:3.040 lr:0.0000100 epoch_Time:24617.0min: [2024-01-05 21:56:03,089][model8_pretrain.py][INFO] Epoch:[0/2](719300/4588595) loss:2.527 lr:0.0000100 epoch_Time:24617.0min: [2024-01-05 21:56:03,089][model8_pretrain.py][INFO] Epoch:[0/2](719300/4588595) loss:2.984 lr:0.0000100 epoch_Time:24617.0min: [2024-01-05 21:56:03,089][model8_pretrain.py][INFO] Epoch:[0/2](719300/4588595) loss:2.626 lr:0.0000100 epoch_Time:24617.0min: [2024-01-05 21:56:03,089][model8_pretrain.py][INFO] Epoch:[0/2](719300/4588595) loss:2.996 lr:0.0000100 epoch_Time:24617.0min: [2024-01-05 21:56:40,028][model8_pretrain.py][INFO] Epoch:[0/2](719400/4588595) loss:2.856 lr:0.0000100 epoch_Time:24616.0min: [2024-01-05 21:56:40,028][model8_pretrain.py][INFO] Epoch:[0/2](719400/4588595) loss:2.943 lr:0.0000100 epoch_Time:24616.0min: [2024-01-05 21:56:40,028][model8_pretrain.py][INFO] Epoch:[0/2](719400/4588595) loss:3.086 lr:0.0000100 epoch_Time:24616.0min: [2024-01-05 21:56:40,028][model8_pretrain.py][INFO] Epoch:[0/2](719400/4588595) loss:2.996 lr:0.0000100 epoch_Time:24616.0min: [2024-01-05 21:56:40,028][model8_pretrain.py][INFO] Epoch:[0/2](719400/4588595) loss:3.132 lr:0.0000100 epoch_Time:24616.0min: [2024-01-05 21:56:40,028][model8_pretrain.py][INFO] Epoch:[0/2](719400/4588595) loss:3.014 lr:0.0000100 epoch_Time:24616.0min: [2024-01-05 21:56:40,028][model8_pretrain.py][INFO] Epoch:[0/2](719400/4588595) loss:2.285 lr:0.0000100 epoch_Time:24616.0min: [2024-01-05 21:56:40,028][model8_pretrain.py][INFO] Epoch:[0/2](719400/4588595) loss:3.186 lr:0.0000100 epoch_Time:24616.0min: [2024-01-05 21:57:16,975][model8_pretrain.py][INFO] Epoch:[0/2](719500/4588595) loss:3.047 lr:0.0000100 epoch_Time:24615.0min: [2024-01-05 21:57:16,975][model8_pretrain.py][INFO] Epoch:[0/2](719500/4588595) loss:2.548 lr:0.0000100 epoch_Time:24615.0min: [2024-01-05 21:57:16,975][model8_pretrain.py][INFO] Epoch:[0/2](719500/4588595) loss:3.127 lr:0.0000100 epoch_Time:24615.0min: [2024-01-05 21:57:16,975][model8_pretrain.py][INFO] Epoch:[0/2](719500/4588595) loss:3.130 lr:0.0000100 epoch_Time:24615.0min: [2024-01-05 21:57:16,975][model8_pretrain.py][INFO] Epoch:[0/2](719500/4588595) loss:2.956 lr:0.0000100 epoch_Time:24615.0min: [2024-01-05 21:57:16,975][model8_pretrain.py][INFO] Epoch:[0/2](719500/4588595) loss:2.681 lr:0.0000100 epoch_Time:24615.0min: [2024-01-05 21:57:16,975][model8_pretrain.py][INFO] Epoch:[0/2](719500/4588595) loss:2.778 lr:0.0000100 epoch_Time:24615.0min: [2024-01-05 21:57:16,975][model8_pretrain.py][INFO] Epoch:[0/2](719500/4588595) loss:2.705 lr:0.0000100 epoch_Time:24615.0min: [2024-01-05 21:57:53,908][model8_pretrain.py][INFO] Epoch:[0/2](719600/4588595) loss:2.713 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:57:53,908][model8_pretrain.py][INFO] Epoch:[0/2](719600/4588595) loss:2.770 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:57:53,908][model8_pretrain.py][INFO] Epoch:[0/2](719600/4588595) loss:2.950 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:57:53,909][model8_pretrain.py][INFO] Epoch:[0/2](719600/4588595) loss:2.576 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:57:53,909][model8_pretrain.py][INFO] Epoch:[0/2](719600/4588595) loss:2.943 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:57:53,908][model8_pretrain.py][INFO] Epoch:[0/2](719600/4588595) loss:3.026 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:57:53,909][model8_pretrain.py][INFO] Epoch:[0/2](719600/4588595) loss:2.878 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:57:53,909][model8_pretrain.py][INFO] Epoch:[0/2](719600/4588595) loss:2.582 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:58:30,817][model8_pretrain.py][INFO] Epoch:[0/2](719700/4588595) loss:2.680 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:58:30,817][model8_pretrain.py][INFO] Epoch:[0/2](719700/4588595) loss:2.675 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:58:30,817][model8_pretrain.py][INFO] Epoch:[0/2](719700/4588595) loss:2.816 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:58:30,817][model8_pretrain.py][INFO] Epoch:[0/2](719700/4588595) loss:2.968 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:58:30,817][model8_pretrain.py][INFO] Epoch:[0/2](719700/4588595) loss:2.861 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:58:30,817][model8_pretrain.py][INFO] Epoch:[0/2](719700/4588595) loss:3.152 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:58:30,817][model8_pretrain.py][INFO] Epoch:[0/2](719700/4588595) loss:2.407 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:58:30,817][model8_pretrain.py][INFO] Epoch:[0/2](719700/4588595) loss:1.649 lr:0.0000100 epoch_Time:24614.0min: [2024-01-05 21:59:07,746][model8_pretrain.py][INFO] Epoch:[0/2](719800/4588595) loss:2.579 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:07,746][model8_pretrain.py][INFO] Epoch:[0/2](719800/4588595) loss:3.117 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:07,746][model8_pretrain.py][INFO] Epoch:[0/2](719800/4588595) loss:2.477 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:07,746][model8_pretrain.py][INFO] Epoch:[0/2](719800/4588595) loss:2.844 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:07,746][model8_pretrain.py][INFO] Epoch:[0/2](719800/4588595) loss:3.323 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:07,746][model8_pretrain.py][INFO] Epoch:[0/2](719800/4588595) loss:2.906 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:07,746][model8_pretrain.py][INFO] Epoch:[0/2](719800/4588595) loss:2.551 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:07,746][model8_pretrain.py][INFO] Epoch:[0/2](719800/4588595) loss:2.967 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:44,676][model8_pretrain.py][INFO] Epoch:[0/2](719900/4588595) loss:3.267 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:44,676][model8_pretrain.py][INFO] Epoch:[0/2](719900/4588595) loss:2.624 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:44,676][model8_pretrain.py][INFO] Epoch:[0/2](719900/4588595) loss:2.527 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:44,676][model8_pretrain.py][INFO] Epoch:[0/2](719900/4588595) loss:2.716 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:44,676][model8_pretrain.py][INFO] Epoch:[0/2](719900/4588595) loss:2.948 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:44,676][model8_pretrain.py][INFO] Epoch:[0/2](719900/4588595) loss:3.277 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:44,676][model8_pretrain.py][INFO] Epoch:[0/2](719900/4588595) loss:2.908 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 21:59:44,676][model8_pretrain.py][INFO] Epoch:[0/2](719900/4588595) loss:2.652 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 22:00:33,551][model8_pretrain.py][INFO] Epoch:[0/2](720000/4588595) loss:2.662 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 22:00:33,551][model8_pretrain.py][INFO] Epoch:[0/2](720000/4588595) loss:2.428 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 22:00:33,551][model8_pretrain.py][INFO] Epoch:[0/2](720000/4588595) loss:3.038 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 22:00:33,551][model8_pretrain.py][INFO] Epoch:[0/2](720000/4588595) loss:2.878 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 22:00:33,551][model8_pretrain.py][INFO] Epoch:[0/2](720000/4588595) loss:3.139 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 22:00:33,551][model8_pretrain.py][INFO] Epoch:[0/2](720000/4588595) loss:2.252 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 22:00:33,551][model8_pretrain.py][INFO] Epoch:[0/2](720000/4588595) loss:3.174 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 22:00:33,551][model8_pretrain.py][INFO] Epoch:[0/2](720000/4588595) loss:2.757 lr:0.0000100 epoch_Time:24613.0min: [2024-01-05 22:01:10,479][model8_pretrain.py][INFO] Epoch:[0/2](720100/4588595) loss:2.767 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:10,479][model8_pretrain.py][INFO] Epoch:[0/2](720100/4588595) loss:3.055 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:10,479][model8_pretrain.py][INFO] Epoch:[0/2](720100/4588595) loss:2.646 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:10,479][model8_pretrain.py][INFO] Epoch:[0/2](720100/4588595) loss:2.703 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:10,479][model8_pretrain.py][INFO] Epoch:[0/2](720100/4588595) loss:2.996 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:10,479][model8_pretrain.py][INFO] Epoch:[0/2](720100/4588595) loss:2.792 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:10,479][model8_pretrain.py][INFO] Epoch:[0/2](720100/4588595) loss:2.670 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:10,479][model8_pretrain.py][INFO] Epoch:[0/2](720100/4588595) loss:2.874 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:47,413][model8_pretrain.py][INFO] Epoch:[0/2](720200/4588595) loss:3.166 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:47,413][model8_pretrain.py][INFO] Epoch:[0/2](720200/4588595) loss:2.596 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:47,413][model8_pretrain.py][INFO] Epoch:[0/2](720200/4588595) loss:3.181 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:47,413][model8_pretrain.py][INFO] Epoch:[0/2](720200/4588595) loss:2.939 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:47,413][model8_pretrain.py][INFO] Epoch:[0/2](720200/4588595) loss:2.540 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:47,413][model8_pretrain.py][INFO] Epoch:[0/2](720200/4588595) loss:2.443 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:47,413][model8_pretrain.py][INFO] Epoch:[0/2](720200/4588595) loss:3.429 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:01:47,413][model8_pretrain.py][INFO] Epoch:[0/2](720200/4588595) loss:2.966 lr:0.0000100 epoch_Time:24612.0min: [2024-01-05 22:02:24,360][model8_pretrain.py][INFO] Epoch:[0/2](720300/4588595) loss:2.535 lr:0.0000100 epoch_Time:24611.0min: [2024-01-05 22:02:24,360][model8_pretrain.py][INFO] Epoch:[0/2](720300/4588595) loss:3.150 lr:0.0000100 epoch_Time:24611.0min: [2024-01-05 22:02:24,360][model8_pretrain.py][INFO] Epoch:[0/2](720300/4588595) loss:2.632 lr:0.0000100 epoch_Time:24611.0min: [2024-01-05 22:02:24,360][model8_pretrain.py][INFO] Epoch:[0/2](720300/4588595) loss:3.340 lr:0.0000100 epoch_Time:24611.0min: [2024-01-05 22:02:24,360][model8_pretrain.py][INFO] Epoch:[0/2](720300/4588595) loss:3.291 lr:0.0000100 epoch_Time:24611.0min: [2024-01-05 22:02:24,360][model8_pretrain.py][INFO] Epoch:[0/2](720300/4588595) loss:2.993 lr:0.0000100 epoch_Time:24611.0min: [2024-01-05 22:02:24,360][model8_pretrain.py][INFO] Epoch:[0/2](720300/4588595) loss:1.987 lr:0.0000100 epoch_Time:24611.0min: [2024-01-05 22:02:24,360][model8_pretrain.py][INFO] Epoch:[0/2](720300/4588595) loss:2.298 lr:0.0000100 epoch_Time:24611.0min: [2024-01-05 22:03:01,308][model8_pretrain.py][INFO] Epoch:[0/2](720400/4588595) loss:3.083 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:01,308][model8_pretrain.py][INFO] Epoch:[0/2](720400/4588595) loss:3.145 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:01,308][model8_pretrain.py][INFO] Epoch:[0/2](720400/4588595) loss:2.935 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:01,308][model8_pretrain.py][INFO] Epoch:[0/2](720400/4588595) loss:2.986 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:01,308][model8_pretrain.py][INFO] Epoch:[0/2](720400/4588595) loss:2.406 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:01,308][model8_pretrain.py][INFO] Epoch:[0/2](720400/4588595) loss:3.164 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:01,308][model8_pretrain.py][INFO] Epoch:[0/2](720400/4588595) loss:2.768 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:01,308][model8_pretrain.py][INFO] Epoch:[0/2](720400/4588595) loss:2.528 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:38,233][model8_pretrain.py][INFO] Epoch:[0/2](720500/4588595) loss:2.960 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:38,233][model8_pretrain.py][INFO] Epoch:[0/2](720500/4588595) loss:2.353 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:38,233][model8_pretrain.py][INFO] Epoch:[0/2](720500/4588595) loss:2.671 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:38,233][model8_pretrain.py][INFO] Epoch:[0/2](720500/4588595) loss:3.394 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:38,233][model8_pretrain.py][INFO] Epoch:[0/2](720500/4588595) loss:2.656 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:38,233][model8_pretrain.py][INFO] Epoch:[0/2](720500/4588595) loss:2.450 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:38,233][model8_pretrain.py][INFO] Epoch:[0/2](720500/4588595) loss:3.033 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:03:38,233][model8_pretrain.py][INFO] Epoch:[0/2](720500/4588595) loss:2.948 lr:0.0000100 epoch_Time:24609.0min: [2024-01-05 22:04:15,167][model8_pretrain.py][INFO] Epoch:[0/2](720600/4588595) loss:2.973 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:04:15,167][model8_pretrain.py][INFO] Epoch:[0/2](720600/4588595) loss:2.726 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:04:15,167][model8_pretrain.py][INFO] Epoch:[0/2](720600/4588595) loss:2.949 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:04:15,167][model8_pretrain.py][INFO] Epoch:[0/2](720600/4588595) loss:2.340 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:04:15,167][model8_pretrain.py][INFO] Epoch:[0/2](720600/4588595) loss:3.329 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:04:15,167][model8_pretrain.py][INFO] Epoch:[0/2](720600/4588595) loss:2.975 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:04:15,167][model8_pretrain.py][INFO] Epoch:[0/2](720600/4588595) loss:2.989 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:04:15,167][model8_pretrain.py][INFO] Epoch:[0/2](720600/4588595) loss:3.235 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:04:52,074][model8_pretrain.py][INFO] Epoch:[0/2](720700/4588595) loss:3.136 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:04:52,074][model8_pretrain.py][INFO] Epoch:[0/2](720700/4588595) loss:2.710 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:04:52,074][model8_pretrain.py][INFO] Epoch:[0/2](720700/4588595) loss:3.314 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:04:52,074][model8_pretrain.py][INFO] Epoch:[0/2](720700/4588595) loss:2.605 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:04:52,074][model8_pretrain.py][INFO] Epoch:[0/2](720700/4588595) loss:2.936 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:04:52,074][model8_pretrain.py][INFO] Epoch:[0/2](720700/4588595) loss:2.389 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:04:52,074][model8_pretrain.py][INFO] Epoch:[0/2](720700/4588595) loss:2.884 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:04:52,074][model8_pretrain.py][INFO] Epoch:[0/2](720700/4588595) loss:3.084 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:05:41,272][model8_pretrain.py][INFO] Epoch:[0/2](720800/4588595) loss:3.048 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:05:41,272][model8_pretrain.py][INFO] Epoch:[0/2](720800/4588595) loss:3.353 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:05:41,272][model8_pretrain.py][INFO] Epoch:[0/2](720800/4588595) loss:2.445 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:05:41,272][model8_pretrain.py][INFO] Epoch:[0/2](720800/4588595) loss:2.721 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:05:41,272][model8_pretrain.py][INFO] Epoch:[0/2](720800/4588595) loss:2.979 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:05:41,272][model8_pretrain.py][INFO] Epoch:[0/2](720800/4588595) loss:3.085 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:05:41,272][model8_pretrain.py][INFO] Epoch:[0/2](720800/4588595) loss:2.799 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:05:41,273][model8_pretrain.py][INFO] Epoch:[0/2](720800/4588595) loss:2.899 lr:0.0000100 epoch_Time:24608.0min: [2024-01-05 22:06:18,198][model8_pretrain.py][INFO] Epoch:[0/2](720900/4588595) loss:2.753 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:06:18,198][model8_pretrain.py][INFO] Epoch:[0/2](720900/4588595) loss:2.728 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:06:18,198][model8_pretrain.py][INFO] Epoch:[0/2](720900/4588595) loss:3.280 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:06:18,198][model8_pretrain.py][INFO] Epoch:[0/2](720900/4588595) loss:2.587 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:06:18,198][model8_pretrain.py][INFO] Epoch:[0/2](720900/4588595) loss:2.867 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:06:18,198][model8_pretrain.py][INFO] Epoch:[0/2](720900/4588595) loss:2.729 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:06:18,199][model8_pretrain.py][INFO] Epoch:[0/2](720900/4588595) loss:2.430 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:06:18,199][model8_pretrain.py][INFO] Epoch:[0/2](720900/4588595) loss:2.903 lr:0.0000100 epoch_Time:24607.0min: [2024-01-05 22:06:55,125][model8_pretrain.py][INFO] Epoch:[0/2](721000/4588595) loss:2.720 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:06:55,125][model8_pretrain.py][INFO] Epoch:[0/2](721000/4588595) loss:2.705 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:06:55,125][model8_pretrain.py][INFO] Epoch:[0/2](721000/4588595) loss:3.082 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:06:55,125][model8_pretrain.py][INFO] Epoch:[0/2](721000/4588595) loss:2.466 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:06:55,125][model8_pretrain.py][INFO] Epoch:[0/2](721000/4588595) loss:2.522 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:06:55,125][model8_pretrain.py][INFO] Epoch:[0/2](721000/4588595) loss:3.175 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:06:55,125][model8_pretrain.py][INFO] Epoch:[0/2](721000/4588595) loss:2.713 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:06:55,126][model8_pretrain.py][INFO] Epoch:[0/2](721000/4588595) loss:3.674 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:07:32,063][model8_pretrain.py][INFO] Epoch:[0/2](721100/4588595) loss:2.723 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:07:32,063][model8_pretrain.py][INFO] Epoch:[0/2](721100/4588595) loss:2.971 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:07:32,063][model8_pretrain.py][INFO] Epoch:[0/2](721100/4588595) loss:2.999 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:07:32,063][model8_pretrain.py][INFO] Epoch:[0/2](721100/4588595) loss:2.547 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:07:32,063][model8_pretrain.py][INFO] Epoch:[0/2](721100/4588595) loss:2.928 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:07:32,063][model8_pretrain.py][INFO] Epoch:[0/2](721100/4588595) loss:2.885 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:07:32,063][model8_pretrain.py][INFO] Epoch:[0/2](721100/4588595) loss:3.093 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:07:32,063][model8_pretrain.py][INFO] Epoch:[0/2](721100/4588595) loss:2.999 lr:0.0000100 epoch_Time:24606.0min: [2024-01-05 22:08:09,006][model8_pretrain.py][INFO] Epoch:[0/2](721200/4588595) loss:2.953 lr:0.0000100 epoch_Time:24605.0min: [2024-01-05 22:08:09,006][model8_pretrain.py][INFO] Epoch:[0/2](721200/4588595) loss:3.340 lr:0.0000100 epoch_Time:24605.0min: [2024-01-05 22:08:09,006][model8_pretrain.py][INFO] Epoch:[0/2](721200/4588595) loss:2.986 lr:0.0000100 epoch_Time:24605.0min: [2024-01-05 22:08:09,006][model8_pretrain.py][INFO] Epoch:[0/2](721200/4588595) loss:3.072 lr:0.0000100 epoch_Time:24605.0min: [2024-01-05 22:08:09,006][model8_pretrain.py][INFO] Epoch:[0/2](721200/4588595) loss:3.016 lr:0.0000100 epoch_Time:24605.0min: [2024-01-05 22:08:09,006][model8_pretrain.py][INFO] Epoch:[0/2](721200/4588595) loss:3.293 lr:0.0000100 epoch_Time:24605.0min: [2024-01-05 22:08:09,006][model8_pretrain.py][INFO] Epoch:[0/2](721200/4588595) loss:2.437 lr:0.0000100 epoch_Time:24605.0min: [2024-01-05 22:08:09,006][model8_pretrain.py][INFO] Epoch:[0/2](721200/4588595) loss:2.819 lr:0.0000100 epoch_Time:24605.0min: [2024-01-05 22:08:45,941][model8_pretrain.py][INFO] Epoch:[0/2](721300/4588595) loss:2.973 lr:0.0000100 epoch_Time:24604.0min: [2024-01-05 22:08:45,941][model8_pretrain.py][INFO] Epoch:[0/2](721300/4588595) loss:3.137 lr:0.0000100 epoch_Time:24604.0min: [2024-01-05 22:08:45,941][model8_pretrain.py][INFO] Epoch:[0/2](721300/4588595) loss:2.636 lr:0.0000100 epoch_Time:24604.0min: [2024-01-05 22:08:45,941][model8_pretrain.py][INFO] Epoch:[0/2](721300/4588595) loss:2.846 lr:0.0000100 epoch_Time:24604.0min: [2024-01-05 22:08:45,941][model8_pretrain.py][INFO] Epoch:[0/2](721300/4588595) loss:3.279 lr:0.0000100 epoch_Time:24604.0min: [2024-01-05 22:08:45,941][model8_pretrain.py][INFO] Epoch:[0/2](721300/4588595) loss:3.000 lr:0.0000100 epoch_Time:24604.0min: [2024-01-05 22:08:45,941][model8_pretrain.py][INFO] Epoch:[0/2](721300/4588595) loss:2.374 lr:0.0000100 epoch_Time:24604.0min: [2024-01-05 22:08:45,941][model8_pretrain.py][INFO] Epoch:[0/2](721300/4588595) loss:3.197 lr:0.0000100 epoch_Time:24604.0min: [2024-01-05 22:09:22,870][model8_pretrain.py][INFO] Epoch:[0/2](721400/4588595) loss:2.651 lr:0.0000100 epoch_Time:24603.0min: [2024-01-05 22:09:22,870][model8_pretrain.py][INFO] Epoch:[0/2](721400/4588595) loss:2.935 lr:0.0000100 epoch_Time:24603.0min: [2024-01-05 22:09:22,870][model8_pretrain.py][INFO] Epoch:[0/2](721400/4588595) loss:3.121 lr:0.0000100 epoch_Time:24603.0min: [2024-01-05 22:09:22,870][model8_pretrain.py][INFO] Epoch:[0/2](721400/4588595) loss:2.876 lr:0.0000100 epoch_Time:24603.0min: [2024-01-05 22:09:22,870][model8_pretrain.py][INFO] Epoch:[0/2](721400/4588595) loss:2.813 lr:0.0000100 epoch_Time:24603.0min: [2024-01-05 22:09:22,870][model8_pretrain.py][INFO] Epoch:[0/2](721400/4588595) loss:2.873 lr:0.0000100 epoch_Time:24603.0min: [2024-01-05 22:09:22,871][model8_pretrain.py][INFO] Epoch:[0/2](721400/4588595) loss:2.192 lr:0.0000100 epoch_Time:24603.0min: [2024-01-05 22:09:22,871][model8_pretrain.py][INFO] Epoch:[0/2](721400/4588595) loss:2.743 lr:0.0000100 epoch_Time:24603.0min: [2024-01-05 22:09:59,798][model8_pretrain.py][INFO] Epoch:[0/2](721500/4588595) loss:2.935 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:09:59,798][model8_pretrain.py][INFO] Epoch:[0/2](721500/4588595) loss:2.265 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:09:59,798][model8_pretrain.py][INFO] Epoch:[0/2](721500/4588595) loss:2.634 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:09:59,798][model8_pretrain.py][INFO] Epoch:[0/2](721500/4588595) loss:2.514 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:09:59,798][model8_pretrain.py][INFO] Epoch:[0/2](721500/4588595) loss:2.477 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:09:59,798][model8_pretrain.py][INFO] Epoch:[0/2](721500/4588595) loss:1.758 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:09:59,798][model8_pretrain.py][INFO] Epoch:[0/2](721500/4588595) loss:2.830 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:09:59,798][model8_pretrain.py][INFO] Epoch:[0/2](721500/4588595) loss:2.747 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:10:48,499][model8_pretrain.py][INFO] Epoch:[0/2](721600/4588595) loss:2.311 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:10:48,499][model8_pretrain.py][INFO] Epoch:[0/2](721600/4588595) loss:2.648 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:10:48,499][model8_pretrain.py][INFO] Epoch:[0/2](721600/4588595) loss:3.270 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:10:48,499][model8_pretrain.py][INFO] Epoch:[0/2](721600/4588595) loss:3.131 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:10:48,499][model8_pretrain.py][INFO] Epoch:[0/2](721600/4588595) loss:3.075 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:10:48,499][model8_pretrain.py][INFO] Epoch:[0/2](721600/4588595) loss:2.932 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:10:48,499][model8_pretrain.py][INFO] Epoch:[0/2](721600/4588595) loss:2.576 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:10:48,499][model8_pretrain.py][INFO] Epoch:[0/2](721600/4588595) loss:3.202 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:11:25,438][model8_pretrain.py][INFO] Epoch:[0/2](721700/4588595) loss:2.396 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:11:25,438][model8_pretrain.py][INFO] Epoch:[0/2](721700/4588595) loss:2.998 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:11:25,438][model8_pretrain.py][INFO] Epoch:[0/2](721700/4588595) loss:2.970 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:11:25,438][model8_pretrain.py][INFO] Epoch:[0/2](721700/4588595) loss:2.358 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:11:25,438][model8_pretrain.py][INFO] Epoch:[0/2](721700/4588595) loss:2.832 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:11:25,438][model8_pretrain.py][INFO] Epoch:[0/2](721700/4588595) loss:3.364 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:11:25,438][model8_pretrain.py][INFO] Epoch:[0/2](721700/4588595) loss:2.969 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:11:25,438][model8_pretrain.py][INFO] Epoch:[0/2](721700/4588595) loss:3.167 lr:0.0000100 epoch_Time:24602.0min: [2024-01-05 22:12:02,383][model8_pretrain.py][INFO] Epoch:[0/2](721800/4588595) loss:3.010 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:02,384][model8_pretrain.py][INFO] Epoch:[0/2](721800/4588595) loss:2.870 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:02,384][model8_pretrain.py][INFO] Epoch:[0/2](721800/4588595) loss:2.983 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:02,384][model8_pretrain.py][INFO] Epoch:[0/2](721800/4588595) loss:2.762 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:02,384][model8_pretrain.py][INFO] Epoch:[0/2](721800/4588595) loss:2.994 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:02,384][model8_pretrain.py][INFO] Epoch:[0/2](721800/4588595) loss:3.048 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:02,384][model8_pretrain.py][INFO] Epoch:[0/2](721800/4588595) loss:3.043 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:02,384][model8_pretrain.py][INFO] Epoch:[0/2](721800/4588595) loss:2.728 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:39,342][model8_pretrain.py][INFO] Epoch:[0/2](721900/4588595) loss:2.299 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:39,342][model8_pretrain.py][INFO] Epoch:[0/2](721900/4588595) loss:2.359 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:39,342][model8_pretrain.py][INFO] Epoch:[0/2](721900/4588595) loss:2.979 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:39,342][model8_pretrain.py][INFO] Epoch:[0/2](721900/4588595) loss:2.358 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:39,342][model8_pretrain.py][INFO] Epoch:[0/2](721900/4588595) loss:2.655 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:39,342][model8_pretrain.py][INFO] Epoch:[0/2](721900/4588595) loss:2.652 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:39,342][model8_pretrain.py][INFO] Epoch:[0/2](721900/4588595) loss:2.817 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:12:39,342][model8_pretrain.py][INFO] Epoch:[0/2](721900/4588595) loss:3.433 lr:0.0000100 epoch_Time:24601.0min: [2024-01-05 22:13:16,286][model8_pretrain.py][INFO] Epoch:[0/2](722000/4588595) loss:2.502 lr:0.0000100 epoch_Time:24600.0min: [2024-01-05 22:13:16,286][model8_pretrain.py][INFO] Epoch:[0/2](722000/4588595) loss:2.251 lr:0.0000100 epoch_Time:24600.0min: [2024-01-05 22:13:16,286][model8_pretrain.py][INFO] Epoch:[0/2](722000/4588595) loss:2.927 lr:0.0000100 epoch_Time:24600.0min: [2024-01-05 22:13:16,286][model8_pretrain.py][INFO] Epoch:[0/2](722000/4588595) loss:3.139 lr:0.0000100 epoch_Time:24600.0min: [2024-01-05 22:13:16,286][model8_pretrain.py][INFO] Epoch:[0/2](722000/4588595) loss:2.748 lr:0.0000100 epoch_Time:24600.0min: [2024-01-05 22:13:16,286][model8_pretrain.py][INFO] Epoch:[0/2](722000/4588595) loss:2.961 lr:0.0000100 epoch_Time:24600.0min: [2024-01-05 22:13:16,286][model8_pretrain.py][INFO] Epoch:[0/2](722000/4588595) loss:2.860 lr:0.0000100 epoch_Time:24600.0min: [2024-01-05 22:13:16,286][model8_pretrain.py][INFO] Epoch:[0/2](722000/4588595) loss:2.499 lr:0.0000100 epoch_Time:24600.0min: [2024-01-05 22:13:53,235][model8_pretrain.py][INFO] Epoch:[0/2](722100/4588595) loss:3.022 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:13:53,235][model8_pretrain.py][INFO] Epoch:[0/2](722100/4588595) loss:3.214 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:13:53,235][model8_pretrain.py][INFO] Epoch:[0/2](722100/4588595) loss:3.125 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:13:53,235][model8_pretrain.py][INFO] Epoch:[0/2](722100/4588595) loss:2.559 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:13:53,235][model8_pretrain.py][INFO] Epoch:[0/2](722100/4588595) loss:3.044 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:13:53,235][model8_pretrain.py][INFO] Epoch:[0/2](722100/4588595) loss:2.224 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:13:53,235][model8_pretrain.py][INFO] Epoch:[0/2](722100/4588595) loss:2.124 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:13:53,235][model8_pretrain.py][INFO] Epoch:[0/2](722100/4588595) loss:2.768 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:14:30,180][model8_pretrain.py][INFO] Epoch:[0/2](722200/4588595) loss:2.554 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:14:30,180][model8_pretrain.py][INFO] Epoch:[0/2](722200/4588595) loss:2.567 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:14:30,180][model8_pretrain.py][INFO] Epoch:[0/2](722200/4588595) loss:3.080 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:14:30,180][model8_pretrain.py][INFO] Epoch:[0/2](722200/4588595) loss:3.019 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:14:30,180][model8_pretrain.py][INFO] Epoch:[0/2](722200/4588595) loss:2.800 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:14:30,180][model8_pretrain.py][INFO] Epoch:[0/2](722200/4588595) loss:2.837 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:14:30,180][model8_pretrain.py][INFO] Epoch:[0/2](722200/4588595) loss:2.577 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:14:30,181][model8_pretrain.py][INFO] Epoch:[0/2](722200/4588595) loss:3.641 lr:0.0000100 epoch_Time:24599.0min: [2024-01-05 22:15:07,123][model8_pretrain.py][INFO] Epoch:[0/2](722300/4588595) loss:2.825 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:15:07,123][model8_pretrain.py][INFO] Epoch:[0/2](722300/4588595) loss:2.866 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:15:07,123][model8_pretrain.py][INFO] Epoch:[0/2](722300/4588595) loss:3.103 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:15:07,123][model8_pretrain.py][INFO] Epoch:[0/2](722300/4588595) loss:3.303 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:15:07,123][model8_pretrain.py][INFO] Epoch:[0/2](722300/4588595) loss:2.782 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:15:07,123][model8_pretrain.py][INFO] Epoch:[0/2](722300/4588595) loss:2.157 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:15:07,123][model8_pretrain.py][INFO] Epoch:[0/2](722300/4588595) loss:2.998 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:15:07,123][model8_pretrain.py][INFO] Epoch:[0/2](722300/4588595) loss:3.002 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:15:55,838][model8_pretrain.py][INFO] Epoch:[0/2](722400/4588595) loss:2.809 lr:0.0000100 epoch_Time:24598.0min: [2024-01-05 22:15:55,838][model8_pretrain.py][INFO] Epoch:[0/2](722400/4588595) loss:2.912 lr:0.0000100 epoch_Time:24598.0min: [2024-01-05 22:15:55,838][model8_pretrain.py][INFO] Epoch:[0/2](722400/4588595) loss:2.926 lr:0.0000100 epoch_Time:24598.0min: [2024-01-05 22:15:55,838][model8_pretrain.py][INFO] Epoch:[0/2](722400/4588595) loss:3.083 lr:0.0000100 epoch_Time:24598.0min: [2024-01-05 22:15:55,838][model8_pretrain.py][INFO] Epoch:[0/2](722400/4588595) loss:3.039 lr:0.0000100 epoch_Time:24598.0min: [2024-01-05 22:15:55,838][model8_pretrain.py][INFO] Epoch:[0/2](722400/4588595) loss:2.611 lr:0.0000100 epoch_Time:24598.0min: [2024-01-05 22:15:55,838][model8_pretrain.py][INFO] Epoch:[0/2](722400/4588595) loss:2.892 lr:0.0000100 epoch_Time:24598.0min: [2024-01-05 22:15:55,838][model8_pretrain.py][INFO] Epoch:[0/2](722400/4588595) loss:2.672 lr:0.0000100 epoch_Time:24598.0min: [2024-01-05 22:16:32,771][model8_pretrain.py][INFO] Epoch:[0/2](722500/4588595) loss:3.038 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:16:32,771][model8_pretrain.py][INFO] Epoch:[0/2](722500/4588595) loss:2.929 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:16:32,771][model8_pretrain.py][INFO] Epoch:[0/2](722500/4588595) loss:3.278 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:16:32,771][model8_pretrain.py][INFO] Epoch:[0/2](722500/4588595) loss:2.781 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:16:32,771][model8_pretrain.py][INFO] Epoch:[0/2](722500/4588595) loss:3.009 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:16:32,771][model8_pretrain.py][INFO] Epoch:[0/2](722500/4588595) loss:2.695 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:16:32,771][model8_pretrain.py][INFO] Epoch:[0/2](722500/4588595) loss:2.875 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:16:32,772][model8_pretrain.py][INFO] Epoch:[0/2](722500/4588595) loss:2.525 lr:0.0000100 epoch_Time:24597.0min: [2024-01-05 22:17:09,709][model8_pretrain.py][INFO] Epoch:[0/2](722600/4588595) loss:2.377 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:09,709][model8_pretrain.py][INFO] Epoch:[0/2](722600/4588595) loss:2.161 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:09,709][model8_pretrain.py][INFO] Epoch:[0/2](722600/4588595) loss:2.860 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:09,709][model8_pretrain.py][INFO] Epoch:[0/2](722600/4588595) loss:2.760 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:09,709][model8_pretrain.py][INFO] Epoch:[0/2](722600/4588595) loss:2.801 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:09,709][model8_pretrain.py][INFO] Epoch:[0/2](722600/4588595) loss:2.514 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:09,709][model8_pretrain.py][INFO] Epoch:[0/2](722600/4588595) loss:2.569 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:09,709][model8_pretrain.py][INFO] Epoch:[0/2](722600/4588595) loss:2.734 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:46,653][model8_pretrain.py][INFO] Epoch:[0/2](722700/4588595) loss:2.622 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:46,653][model8_pretrain.py][INFO] Epoch:[0/2](722700/4588595) loss:3.014 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:46,653][model8_pretrain.py][INFO] Epoch:[0/2](722700/4588595) loss:1.775 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:46,653][model8_pretrain.py][INFO] Epoch:[0/2](722700/4588595) loss:2.750 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:46,653][model8_pretrain.py][INFO] Epoch:[0/2](722700/4588595) loss:2.993 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:46,654][model8_pretrain.py][INFO] Epoch:[0/2](722700/4588595) loss:2.649 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:46,654][model8_pretrain.py][INFO] Epoch:[0/2](722700/4588595) loss:2.886 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:17:46,654][model8_pretrain.py][INFO] Epoch:[0/2](722700/4588595) loss:2.957 lr:0.0000100 epoch_Time:24596.0min: [2024-01-05 22:18:23,614][model8_pretrain.py][INFO] Epoch:[0/2](722800/4588595) loss:2.657 lr:0.0000100 epoch_Time:24595.0min: [2024-01-05 22:18:23,614][model8_pretrain.py][INFO] Epoch:[0/2](722800/4588595) loss:3.120 lr:0.0000100 epoch_Time:24595.0min: [2024-01-05 22:18:23,614][model8_pretrain.py][INFO] Epoch:[0/2](722800/4588595) loss:2.452 lr:0.0000100 epoch_Time:24595.0min: [2024-01-05 22:18:23,614][model8_pretrain.py][INFO] Epoch:[0/2](722800/4588595) loss:2.276 lr:0.0000100 epoch_Time:24595.0min: [2024-01-05 22:18:23,614][model8_pretrain.py][INFO] Epoch:[0/2](722800/4588595) loss:2.890 lr:0.0000100 epoch_Time:24595.0min: [2024-01-05 22:18:23,614][model8_pretrain.py][INFO] Epoch:[0/2](722800/4588595) loss:3.203 lr:0.0000100 epoch_Time:24595.0min: [2024-01-05 22:18:23,614][model8_pretrain.py][INFO] Epoch:[0/2](722800/4588595) loss:3.432 lr:0.0000100 epoch_Time:24595.0min: [2024-01-05 22:18:23,614][model8_pretrain.py][INFO] Epoch:[0/2](722800/4588595) loss:3.109 lr:0.0000100 epoch_Time:24595.0min: [2024-01-05 22:19:00,545][model8_pretrain.py][INFO] Epoch:[0/2](722900/4588595) loss:2.867 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:00,545][model8_pretrain.py][INFO] Epoch:[0/2](722900/4588595) loss:3.268 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:00,545][model8_pretrain.py][INFO] Epoch:[0/2](722900/4588595) loss:2.993 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:00,545][model8_pretrain.py][INFO] Epoch:[0/2](722900/4588595) loss:2.468 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:00,545][model8_pretrain.py][INFO] Epoch:[0/2](722900/4588595) loss:2.423 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:00,545][model8_pretrain.py][INFO] Epoch:[0/2](722900/4588595) loss:2.793 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:00,545][model8_pretrain.py][INFO] Epoch:[0/2](722900/4588595) loss:2.783 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:00,545][model8_pretrain.py][INFO] Epoch:[0/2](722900/4588595) loss:3.093 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:37,471][model8_pretrain.py][INFO] Epoch:[0/2](723000/4588595) loss:2.563 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:37,471][model8_pretrain.py][INFO] Epoch:[0/2](723000/4588595) loss:2.092 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:37,471][model8_pretrain.py][INFO] Epoch:[0/2](723000/4588595) loss:3.016 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:37,471][model8_pretrain.py][INFO] Epoch:[0/2](723000/4588595) loss:2.301 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:37,471][model8_pretrain.py][INFO] Epoch:[0/2](723000/4588595) loss:2.106 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:37,471][model8_pretrain.py][INFO] Epoch:[0/2](723000/4588595) loss:3.240 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:37,472][model8_pretrain.py][INFO] Epoch:[0/2](723000/4588595) loss:2.015 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:19:37,472][model8_pretrain.py][INFO] Epoch:[0/2](723000/4588595) loss:3.110 lr:0.0000100 epoch_Time:24594.0min: [2024-01-05 22:20:14,411][model8_pretrain.py][INFO] Epoch:[0/2](723100/4588595) loss:3.202 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:20:14,411][model8_pretrain.py][INFO] Epoch:[0/2](723100/4588595) loss:3.013 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:20:14,411][model8_pretrain.py][INFO] Epoch:[0/2](723100/4588595) loss:2.517 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:20:14,411][model8_pretrain.py][INFO] Epoch:[0/2](723100/4588595) loss:2.842 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:20:14,411][model8_pretrain.py][INFO] Epoch:[0/2](723100/4588595) loss:2.553 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:20:14,411][model8_pretrain.py][INFO] Epoch:[0/2](723100/4588595) loss:3.151 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:20:14,411][model8_pretrain.py][INFO] Epoch:[0/2](723100/4588595) loss:2.043 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:20:14,412][model8_pretrain.py][INFO] Epoch:[0/2](723100/4588595) loss:3.216 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:03,282][model8_pretrain.py][INFO] Epoch:[0/2](723200/4588595) loss:2.756 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:03,282][model8_pretrain.py][INFO] Epoch:[0/2](723200/4588595) loss:2.450 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:03,282][model8_pretrain.py][INFO] Epoch:[0/2](723200/4588595) loss:2.071 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:03,282][model8_pretrain.py][INFO] Epoch:[0/2](723200/4588595) loss:3.353 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:03,282][model8_pretrain.py][INFO] Epoch:[0/2](723200/4588595) loss:2.871 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:03,282][model8_pretrain.py][INFO] Epoch:[0/2](723200/4588595) loss:3.080 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:03,282][model8_pretrain.py][INFO] Epoch:[0/2](723200/4588595) loss:3.261 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:03,282][model8_pretrain.py][INFO] Epoch:[0/2](723200/4588595) loss:3.181 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:40,211][model8_pretrain.py][INFO] Epoch:[0/2](723300/4588595) loss:3.227 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:40,211][model8_pretrain.py][INFO] Epoch:[0/2](723300/4588595) loss:2.723 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:40,211][model8_pretrain.py][INFO] Epoch:[0/2](723300/4588595) loss:2.818 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:40,211][model8_pretrain.py][INFO] Epoch:[0/2](723300/4588595) loss:2.766 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:40,211][model8_pretrain.py][INFO] Epoch:[0/2](723300/4588595) loss:2.355 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:40,211][model8_pretrain.py][INFO] Epoch:[0/2](723300/4588595) loss:2.912 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:40,211][model8_pretrain.py][INFO] Epoch:[0/2](723300/4588595) loss:2.055 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:21:40,211][model8_pretrain.py][INFO] Epoch:[0/2](723300/4588595) loss:2.813 lr:0.0000100 epoch_Time:24593.0min: [2024-01-05 22:22:17,143][model8_pretrain.py][INFO] Epoch:[0/2](723400/4588595) loss:2.714 lr:0.0000100 epoch_Time:24592.0min: [2024-01-05 22:22:17,143][model8_pretrain.py][INFO] Epoch:[0/2](723400/4588595) loss:2.417 lr:0.0000100 epoch_Time:24592.0min: [2024-01-05 22:22:17,143][model8_pretrain.py][INFO] Epoch:[0/2](723400/4588595) loss:2.911 lr:0.0000100 epoch_Time:24592.0min: [2024-01-05 22:22:17,143][model8_pretrain.py][INFO] Epoch:[0/2](723400/4588595) loss:3.163 lr:0.0000100 epoch_Time:24592.0min: [2024-01-05 22:22:17,143][model8_pretrain.py][INFO] Epoch:[0/2](723400/4588595) loss:3.119 lr:0.0000100 epoch_Time:24592.0min: [2024-01-05 22:22:17,143][model8_pretrain.py][INFO] Epoch:[0/2](723400/4588595) loss:2.477 lr:0.0000100 epoch_Time:24592.0min: [2024-01-05 22:22:17,144][model8_pretrain.py][INFO] Epoch:[0/2](723400/4588595) loss:3.162 lr:0.0000100 epoch_Time:24592.0min: [2024-01-05 22:22:17,144][model8_pretrain.py][INFO] Epoch:[0/2](723400/4588595) loss:2.850 lr:0.0000100 epoch_Time:24592.0min: [2024-01-05 22:22:54,077][model8_pretrain.py][INFO] Epoch:[0/2](723500/4588595) loss:3.024 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:22:54,077][model8_pretrain.py][INFO] Epoch:[0/2](723500/4588595) loss:3.103 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:22:54,077][model8_pretrain.py][INFO] Epoch:[0/2](723500/4588595) loss:2.636 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:22:54,077][model8_pretrain.py][INFO] Epoch:[0/2](723500/4588595) loss:2.648 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:22:54,077][model8_pretrain.py][INFO] Epoch:[0/2](723500/4588595) loss:3.262 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:22:54,077][model8_pretrain.py][INFO] Epoch:[0/2](723500/4588595) loss:2.932 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:22:54,078][model8_pretrain.py][INFO] Epoch:[0/2](723500/4588595) loss:2.844 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:22:54,078][model8_pretrain.py][INFO] Epoch:[0/2](723500/4588595) loss:2.567 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:23:31,007][model8_pretrain.py][INFO] Epoch:[0/2](723600/4588595) loss:3.441 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:23:31,007][model8_pretrain.py][INFO] Epoch:[0/2](723600/4588595) loss:3.289 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:23:31,007][model8_pretrain.py][INFO] Epoch:[0/2](723600/4588595) loss:2.965 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:23:31,007][model8_pretrain.py][INFO] Epoch:[0/2](723600/4588595) loss:2.934 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:23:31,007][model8_pretrain.py][INFO] Epoch:[0/2](723600/4588595) loss:3.043 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:23:31,007][model8_pretrain.py][INFO] Epoch:[0/2](723600/4588595) loss:3.332 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:23:31,007][model8_pretrain.py][INFO] Epoch:[0/2](723600/4588595) loss:3.158 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:23:31,007][model8_pretrain.py][INFO] Epoch:[0/2](723600/4588595) loss:3.228 lr:0.0000100 epoch_Time:24590.0min: [2024-01-05 22:24:07,932][model8_pretrain.py][INFO] Epoch:[0/2](723700/4588595) loss:3.025 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:07,932][model8_pretrain.py][INFO] Epoch:[0/2](723700/4588595) loss:3.301 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:07,932][model8_pretrain.py][INFO] Epoch:[0/2](723700/4588595) loss:2.472 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:07,932][model8_pretrain.py][INFO] Epoch:[0/2](723700/4588595) loss:2.987 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:07,932][model8_pretrain.py][INFO] Epoch:[0/2](723700/4588595) loss:2.999 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:07,932][model8_pretrain.py][INFO] Epoch:[0/2](723700/4588595) loss:2.765 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:07,932][model8_pretrain.py][INFO] Epoch:[0/2](723700/4588595) loss:2.638 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:07,932][model8_pretrain.py][INFO] Epoch:[0/2](723700/4588595) loss:2.704 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:44,858][model8_pretrain.py][INFO] Epoch:[0/2](723800/4588595) loss:2.633 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:44,858][model8_pretrain.py][INFO] Epoch:[0/2](723800/4588595) loss:3.447 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:44,858][model8_pretrain.py][INFO] Epoch:[0/2](723800/4588595) loss:3.313 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:44,858][model8_pretrain.py][INFO] Epoch:[0/2](723800/4588595) loss:2.805 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:44,858][model8_pretrain.py][INFO] Epoch:[0/2](723800/4588595) loss:2.729 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:44,858][model8_pretrain.py][INFO] Epoch:[0/2](723800/4588595) loss:2.888 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:44,858][model8_pretrain.py][INFO] Epoch:[0/2](723800/4588595) loss:2.617 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:24:44,858][model8_pretrain.py][INFO] Epoch:[0/2](723800/4588595) loss:2.858 lr:0.0000100 epoch_Time:24589.0min: [2024-01-05 22:25:21,791][model8_pretrain.py][INFO] Epoch:[0/2](723900/4588595) loss:3.159 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:25:21,791][model8_pretrain.py][INFO] Epoch:[0/2](723900/4588595) loss:3.155 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:25:21,791][model8_pretrain.py][INFO] Epoch:[0/2](723900/4588595) loss:2.730 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:25:21,791][model8_pretrain.py][INFO] Epoch:[0/2](723900/4588595) loss:2.921 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:25:21,791][model8_pretrain.py][INFO] Epoch:[0/2](723900/4588595) loss:2.406 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:25:21,791][model8_pretrain.py][INFO] Epoch:[0/2](723900/4588595) loss:2.637 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:25:21,791][model8_pretrain.py][INFO] Epoch:[0/2](723900/4588595) loss:2.768 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:25:21,791][model8_pretrain.py][INFO] Epoch:[0/2](723900/4588595) loss:2.553 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:10,448][model8_pretrain.py][INFO] Epoch:[0/2](724000/4588595) loss:3.193 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](724000/4588595) loss:2.853 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:10,448][model8_pretrain.py][INFO] Epoch:[0/2](724000/4588595) loss:2.228 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](724000/4588595) loss:2.555 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](724000/4588595) loss:3.072 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](724000/4588595) loss:3.368 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](724000/4588595) loss:3.192 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:10,449][model8_pretrain.py][INFO] Epoch:[0/2](724000/4588595) loss:2.653 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:47,379][model8_pretrain.py][INFO] Epoch:[0/2](724100/4588595) loss:2.396 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:47,379][model8_pretrain.py][INFO] Epoch:[0/2](724100/4588595) loss:2.472 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:47,379][model8_pretrain.py][INFO] Epoch:[0/2](724100/4588595) loss:2.641 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:47,379][model8_pretrain.py][INFO] Epoch:[0/2](724100/4588595) loss:2.530 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:47,379][model8_pretrain.py][INFO] Epoch:[0/2](724100/4588595) loss:2.325 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:47,379][model8_pretrain.py][INFO] Epoch:[0/2](724100/4588595) loss:2.709 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:47,379][model8_pretrain.py][INFO] Epoch:[0/2](724100/4588595) loss:2.573 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:26:47,379][model8_pretrain.py][INFO] Epoch:[0/2](724100/4588595) loss:2.983 lr:0.0000100 epoch_Time:24588.0min: [2024-01-05 22:27:24,327][model8_pretrain.py][INFO] Epoch:[0/2](724200/4588595) loss:2.771 lr:0.0000100 epoch_Time:24587.0min: [2024-01-05 22:27:24,327][model8_pretrain.py][INFO] Epoch:[0/2](724200/4588595) loss:3.426 lr:0.0000100 epoch_Time:24587.0min: [2024-01-05 22:27:24,327][model8_pretrain.py][INFO] Epoch:[0/2](724200/4588595) loss:3.163 lr:0.0000100 epoch_Time:24587.0min: [2024-01-05 22:27:24,327][model8_pretrain.py][INFO] Epoch:[0/2](724200/4588595) loss:3.119 lr:0.0000100 epoch_Time:24587.0min: [2024-01-05 22:27:24,327][model8_pretrain.py][INFO] Epoch:[0/2](724200/4588595) loss:2.596 lr:0.0000100 epoch_Time:24587.0min: [2024-01-05 22:27:24,327][model8_pretrain.py][INFO] Epoch:[0/2](724200/4588595) loss:2.724 lr:0.0000100 epoch_Time:24587.0min: [2024-01-05 22:27:24,327][model8_pretrain.py][INFO] Epoch:[0/2](724200/4588595) loss:2.294 lr:0.0000100 epoch_Time:24587.0min: [2024-01-05 22:27:24,327][model8_pretrain.py][INFO] Epoch:[0/2](724200/4588595) loss:2.595 lr:0.0000100 epoch_Time:24587.0min: [2024-01-05 22:28:01,267][model8_pretrain.py][INFO] Epoch:[0/2](724300/4588595) loss:3.191 lr:0.0000100 epoch_Time:24586.0min: [2024-01-05 22:28:01,267][model8_pretrain.py][INFO] Epoch:[0/2](724300/4588595) loss:2.803 lr:0.0000100 epoch_Time:24586.0min: [2024-01-05 22:28:01,267][model8_pretrain.py][INFO] Epoch:[0/2](724300/4588595) loss:2.790 lr:0.0000100 epoch_Time:24586.0min: [2024-01-05 22:28:01,267][model8_pretrain.py][INFO] Epoch:[0/2](724300/4588595) loss:2.800 lr:0.0000100 epoch_Time:24586.0min: [2024-01-05 22:28:01,267][model8_pretrain.py][INFO] Epoch:[0/2](724300/4588595) loss:2.607 lr:0.0000100 epoch_Time:24586.0min: [2024-01-05 22:28:01,267][model8_pretrain.py][INFO] Epoch:[0/2](724300/4588595) loss:2.549 lr:0.0000100 epoch_Time:24586.0min: [2024-01-05 22:28:01,267][model8_pretrain.py][INFO] Epoch:[0/2](724300/4588595) loss:3.142 lr:0.0000100 epoch_Time:24586.0min: [2024-01-05 22:28:01,267][model8_pretrain.py][INFO] Epoch:[0/2](724300/4588595) loss:2.335 lr:0.0000100 epoch_Time:24586.0min: [2024-01-05 22:28:38,210][model8_pretrain.py][INFO] Epoch:[0/2](724400/4588595) loss:3.055 lr:0.0000100 epoch_Time:24585.0min: [2024-01-05 22:28:38,210][model8_pretrain.py][INFO] Epoch:[0/2](724400/4588595) loss:2.739 lr:0.0000100 epoch_Time:24585.0min: [2024-01-05 22:28:38,210][model8_pretrain.py][INFO] Epoch:[0/2](724400/4588595) loss:2.577 lr:0.0000100 epoch_Time:24585.0min: [2024-01-05 22:28:38,210][model8_pretrain.py][INFO] Epoch:[0/2](724400/4588595) loss:2.616 lr:0.0000100 epoch_Time:24585.0min: [2024-01-05 22:28:38,210][model8_pretrain.py][INFO] Epoch:[0/2](724400/4588595) loss:2.523 lr:0.0000100 epoch_Time:24585.0min: [2024-01-05 22:28:38,210][model8_pretrain.py][INFO] Epoch:[0/2](724400/4588595) loss:1.915 lr:0.0000100 epoch_Time:24585.0min: [2024-01-05 22:28:38,210][model8_pretrain.py][INFO] Epoch:[0/2](724400/4588595) loss:2.829 lr:0.0000100 epoch_Time:24585.0min: [2024-01-05 22:28:38,211][model8_pretrain.py][INFO] Epoch:[0/2](724400/4588595) loss:3.195 lr:0.0000100 epoch_Time:24585.0min: [2024-01-05 22:29:15,155][model8_pretrain.py][INFO] Epoch:[0/2](724500/4588595) loss:3.012 lr:0.0000100 epoch_Time:24584.0min: [2024-01-05 22:29:15,155][model8_pretrain.py][INFO] Epoch:[0/2](724500/4588595) loss:3.384 lr:0.0000100 epoch_Time:24584.0min: [2024-01-05 22:29:15,155][model8_pretrain.py][INFO] Epoch:[0/2](724500/4588595) loss:2.775 lr:0.0000100 epoch_Time:24584.0min: [2024-01-05 22:29:15,155][model8_pretrain.py][INFO] Epoch:[0/2](724500/4588595) loss:2.806 lr:0.0000100 epoch_Time:24584.0min: [2024-01-05 22:29:15,155][model8_pretrain.py][INFO] Epoch:[0/2](724500/4588595) loss:2.776 lr:0.0000100 epoch_Time:24584.0min: [2024-01-05 22:29:15,155][model8_pretrain.py][INFO] Epoch:[0/2](724500/4588595) loss:2.662 lr:0.0000100 epoch_Time:24584.0min: [2024-01-05 22:29:15,156][model8_pretrain.py][INFO] Epoch:[0/2](724500/4588595) loss:2.988 lr:0.0000100 epoch_Time:24584.0min: [2024-01-05 22:29:15,156][model8_pretrain.py][INFO] Epoch:[0/2](724500/4588595) loss:3.083 lr:0.0000100 epoch_Time:24584.0min: [2024-01-05 22:29:52,108][model8_pretrain.py][INFO] Epoch:[0/2](724600/4588595) loss:3.101 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:29:52,108][model8_pretrain.py][INFO] Epoch:[0/2](724600/4588595) loss:3.037 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:29:52,108][model8_pretrain.py][INFO] Epoch:[0/2](724600/4588595) loss:2.540 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:29:52,108][model8_pretrain.py][INFO] Epoch:[0/2](724600/4588595) loss:3.157 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:29:52,108][model8_pretrain.py][INFO] Epoch:[0/2](724600/4588595) loss:3.113 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:29:52,108][model8_pretrain.py][INFO] Epoch:[0/2](724600/4588595) loss:2.749 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:29:52,108][model8_pretrain.py][INFO] Epoch:[0/2](724600/4588595) loss:2.877 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:29:52,108][model8_pretrain.py][INFO] Epoch:[0/2](724600/4588595) loss:2.271 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:30:29,032][model8_pretrain.py][INFO] Epoch:[0/2](724700/4588595) loss:2.858 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:30:29,032][model8_pretrain.py][INFO] Epoch:[0/2](724700/4588595) loss:2.934 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:30:29,032][model8_pretrain.py][INFO] Epoch:[0/2](724700/4588595) loss:2.927 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:30:29,032][model8_pretrain.py][INFO] Epoch:[0/2](724700/4588595) loss:2.576 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:30:29,032][model8_pretrain.py][INFO] Epoch:[0/2](724700/4588595) loss:2.759 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:30:29,033][model8_pretrain.py][INFO] Epoch:[0/2](724700/4588595) loss:3.024 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:30:29,032][model8_pretrain.py][INFO] Epoch:[0/2](724700/4588595) loss:2.803 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:30:29,032][model8_pretrain.py][INFO] Epoch:[0/2](724700/4588595) loss:3.460 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:31:17,826][model8_pretrain.py][INFO] Epoch:[0/2](724800/4588595) loss:2.956 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:31:17,826][model8_pretrain.py][INFO] Epoch:[0/2](724800/4588595) loss:2.523 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:31:17,826][model8_pretrain.py][INFO] Epoch:[0/2](724800/4588595) loss:2.269 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:31:17,826][model8_pretrain.py][INFO] Epoch:[0/2](724800/4588595) loss:2.398 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:31:17,826][model8_pretrain.py][INFO] Epoch:[0/2](724800/4588595) loss:2.422 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:31:17,826][model8_pretrain.py][INFO] Epoch:[0/2](724800/4588595) loss:2.846 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:31:17,826][model8_pretrain.py][INFO] Epoch:[0/2](724800/4588595) loss:3.308 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:31:17,826][model8_pretrain.py][INFO] Epoch:[0/2](724800/4588595) loss:2.837 lr:0.0000100 epoch_Time:24583.0min: [2024-01-05 22:31:54,773][model8_pretrain.py][INFO] Epoch:[0/2](724900/4588595) loss:2.672 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:31:54,773][model8_pretrain.py][INFO] Epoch:[0/2](724900/4588595) loss:3.026 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:31:54,774][model8_pretrain.py][INFO] Epoch:[0/2](724900/4588595) loss:3.040 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:31:54,774][model8_pretrain.py][INFO] Epoch:[0/2](724900/4588595) loss:2.936 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:31:54,774][model8_pretrain.py][INFO] Epoch:[0/2](724900/4588595) loss:2.858 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:31:54,774][model8_pretrain.py][INFO] Epoch:[0/2](724900/4588595) loss:2.672 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:31:54,774][model8_pretrain.py][INFO] Epoch:[0/2](724900/4588595) loss:2.705 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:31:54,774][model8_pretrain.py][INFO] Epoch:[0/2](724900/4588595) loss:2.727 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:32:31,713][model8_pretrain.py][INFO] Epoch:[0/2](725000/4588595) loss:2.380 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:32:31,713][model8_pretrain.py][INFO] Epoch:[0/2](725000/4588595) loss:2.361 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:32:31,713][model8_pretrain.py][INFO] Epoch:[0/2](725000/4588595) loss:2.887 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:32:31,713][model8_pretrain.py][INFO] Epoch:[0/2](725000/4588595) loss:2.708 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:32:31,713][model8_pretrain.py][INFO] Epoch:[0/2](725000/4588595) loss:2.905 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:32:31,713][model8_pretrain.py][INFO] Epoch:[0/2](725000/4588595) loss:2.561 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:32:31,713][model8_pretrain.py][INFO] Epoch:[0/2](725000/4588595) loss:2.156 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:32:31,713][model8_pretrain.py][INFO] Epoch:[0/2](725000/4588595) loss:3.100 lr:0.0000100 epoch_Time:24582.0min: [2024-01-05 22:33:08,658][model8_pretrain.py][INFO] Epoch:[0/2](725100/4588595) loss:2.414 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:08,658][model8_pretrain.py][INFO] Epoch:[0/2](725100/4588595) loss:2.626 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:08,658][model8_pretrain.py][INFO] Epoch:[0/2](725100/4588595) loss:2.823 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:08,658][model8_pretrain.py][INFO] Epoch:[0/2](725100/4588595) loss:3.081 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:08,658][model8_pretrain.py][INFO] Epoch:[0/2](725100/4588595) loss:2.621 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:08,658][model8_pretrain.py][INFO] Epoch:[0/2](725100/4588595) loss:2.570 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:08,658][model8_pretrain.py][INFO] Epoch:[0/2](725100/4588595) loss:2.928 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:08,658][model8_pretrain.py][INFO] Epoch:[0/2](725100/4588595) loss:3.214 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:45,600][model8_pretrain.py][INFO] Epoch:[0/2](725200/4588595) loss:2.892 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:45,600][model8_pretrain.py][INFO] Epoch:[0/2](725200/4588595) loss:2.888 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:45,600][model8_pretrain.py][INFO] Epoch:[0/2](725200/4588595) loss:3.188 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:45,600][model8_pretrain.py][INFO] Epoch:[0/2](725200/4588595) loss:3.116 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:45,601][model8_pretrain.py][INFO] Epoch:[0/2](725200/4588595) loss:2.985 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:45,601][model8_pretrain.py][INFO] Epoch:[0/2](725200/4588595) loss:2.644 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:45,601][model8_pretrain.py][INFO] Epoch:[0/2](725200/4588595) loss:2.366 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:33:45,601][model8_pretrain.py][INFO] Epoch:[0/2](725200/4588595) loss:3.049 lr:0.0000100 epoch_Time:24581.0min: [2024-01-05 22:34:22,560][model8_pretrain.py][INFO] Epoch:[0/2](725300/4588595) loss:3.447 lr:0.0000100 epoch_Time:24580.0min: [2024-01-05 22:34:22,560][model8_pretrain.py][INFO] Epoch:[0/2](725300/4588595) loss:2.296 lr:0.0000100 epoch_Time:24580.0min: [2024-01-05 22:34:22,560][model8_pretrain.py][INFO] Epoch:[0/2](725300/4588595) loss:2.904 lr:0.0000100 epoch_Time:24580.0min: [2024-01-05 22:34:22,560][model8_pretrain.py][INFO] Epoch:[0/2](725300/4588595) loss:3.014 lr:0.0000100 epoch_Time:24580.0min: [2024-01-05 22:34:22,560][model8_pretrain.py][INFO] Epoch:[0/2](725300/4588595) loss:3.155 lr:0.0000100 epoch_Time:24580.0min: [2024-01-05 22:34:22,560][model8_pretrain.py][INFO] Epoch:[0/2](725300/4588595) loss:3.022 lr:0.0000100 epoch_Time:24580.0min: [2024-01-05 22:34:22,560][model8_pretrain.py][INFO] Epoch:[0/2](725300/4588595) loss:2.693 lr:0.0000100 epoch_Time:24580.0min: [2024-01-05 22:34:22,561][model8_pretrain.py][INFO] Epoch:[0/2](725300/4588595) loss:2.583 lr:0.0000100 epoch_Time:24580.0min: [2024-01-05 22:34:59,496][model8_pretrain.py][INFO] Epoch:[0/2](725400/4588595) loss:2.973 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:34:59,496][model8_pretrain.py][INFO] Epoch:[0/2](725400/4588595) loss:2.929 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:34:59,496][model8_pretrain.py][INFO] Epoch:[0/2](725400/4588595) loss:2.836 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:34:59,496][model8_pretrain.py][INFO] Epoch:[0/2](725400/4588595) loss:2.724 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:34:59,496][model8_pretrain.py][INFO] Epoch:[0/2](725400/4588595) loss:2.753 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:34:59,496][model8_pretrain.py][INFO] Epoch:[0/2](725400/4588595) loss:2.796 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:34:59,497][model8_pretrain.py][INFO] Epoch:[0/2](725400/4588595) loss:3.578 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:34:59,497][model8_pretrain.py][INFO] Epoch:[0/2](725400/4588595) loss:2.514 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:35:36,428][model8_pretrain.py][INFO] Epoch:[0/2](725500/4588595) loss:2.713 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:35:36,428][model8_pretrain.py][INFO] Epoch:[0/2](725500/4588595) loss:2.482 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:35:36,428][model8_pretrain.py][INFO] Epoch:[0/2](725500/4588595) loss:2.243 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:35:36,428][model8_pretrain.py][INFO] Epoch:[0/2](725500/4588595) loss:2.776 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:35:36,428][model8_pretrain.py][INFO] Epoch:[0/2](725500/4588595) loss:2.896 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:35:36,428][model8_pretrain.py][INFO] Epoch:[0/2](725500/4588595) loss:2.370 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:35:36,428][model8_pretrain.py][INFO] Epoch:[0/2](725500/4588595) loss:2.813 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:35:36,428][model8_pretrain.py][INFO] Epoch:[0/2](725500/4588595) loss:2.527 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:36:25,150][model8_pretrain.py][INFO] Epoch:[0/2](725600/4588595) loss:2.609 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:36:25,150][model8_pretrain.py][INFO] Epoch:[0/2](725600/4588595) loss:2.361 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:36:25,150][model8_pretrain.py][INFO] Epoch:[0/2](725600/4588595) loss:2.420 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:36:25,150][model8_pretrain.py][INFO] Epoch:[0/2](725600/4588595) loss:2.600 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:36:25,150][model8_pretrain.py][INFO] Epoch:[0/2](725600/4588595) loss:3.213 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:36:25,150][model8_pretrain.py][INFO] Epoch:[0/2](725600/4588595) loss:2.847 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:36:25,150][model8_pretrain.py][INFO] Epoch:[0/2](725600/4588595) loss:2.747 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:36:25,150][model8_pretrain.py][INFO] Epoch:[0/2](725600/4588595) loss:2.982 lr:0.0000100 epoch_Time:24578.0min: [2024-01-05 22:37:02,080][model8_pretrain.py][INFO] Epoch:[0/2](725700/4588595) loss:2.630 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:02,080][model8_pretrain.py][INFO] Epoch:[0/2](725700/4588595) loss:2.450 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:02,080][model8_pretrain.py][INFO] Epoch:[0/2](725700/4588595) loss:2.701 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:02,080][model8_pretrain.py][INFO] Epoch:[0/2](725700/4588595) loss:3.587 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:02,080][model8_pretrain.py][INFO] Epoch:[0/2](725700/4588595) loss:2.717 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:02,080][model8_pretrain.py][INFO] Epoch:[0/2](725700/4588595) loss:2.476 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:02,080][model8_pretrain.py][INFO] Epoch:[0/2](725700/4588595) loss:2.927 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:02,080][model8_pretrain.py][INFO] Epoch:[0/2](725700/4588595) loss:2.933 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:39,015][model8_pretrain.py][INFO] Epoch:[0/2](725800/4588595) loss:3.133 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:39,015][model8_pretrain.py][INFO] Epoch:[0/2](725800/4588595) loss:2.866 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:39,015][model8_pretrain.py][INFO] Epoch:[0/2](725800/4588595) loss:2.725 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:39,015][model8_pretrain.py][INFO] Epoch:[0/2](725800/4588595) loss:3.063 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:39,015][model8_pretrain.py][INFO] Epoch:[0/2](725800/4588595) loss:2.668 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:39,015][model8_pretrain.py][INFO] Epoch:[0/2](725800/4588595) loss:2.879 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:39,015][model8_pretrain.py][INFO] Epoch:[0/2](725800/4588595) loss:2.786 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:37:39,016][model8_pretrain.py][INFO] Epoch:[0/2](725800/4588595) loss:2.755 lr:0.0000100 epoch_Time:24577.0min: [2024-01-05 22:38:15,964][model8_pretrain.py][INFO] Epoch:[0/2](725900/4588595) loss:2.773 lr:0.0000100 epoch_Time:24576.0min: [2024-01-05 22:38:15,964][model8_pretrain.py][INFO] Epoch:[0/2](725900/4588595) loss:3.129 lr:0.0000100 epoch_Time:24576.0min: [2024-01-05 22:38:15,964][model8_pretrain.py][INFO] Epoch:[0/2](725900/4588595) loss:2.889 lr:0.0000100 epoch_Time:24576.0min: [2024-01-05 22:38:15,964][model8_pretrain.py][INFO] Epoch:[0/2](725900/4588595) loss:2.967 lr:0.0000100 epoch_Time:24576.0min: [2024-01-05 22:38:15,964][model8_pretrain.py][INFO] Epoch:[0/2](725900/4588595) loss:2.695 lr:0.0000100 epoch_Time:24576.0min: [2024-01-05 22:38:15,964][model8_pretrain.py][INFO] Epoch:[0/2](725900/4588595) loss:2.759 lr:0.0000100 epoch_Time:24576.0min: [2024-01-05 22:38:15,964][model8_pretrain.py][INFO] Epoch:[0/2](725900/4588595) loss:2.868 lr:0.0000100 epoch_Time:24576.0min: [2024-01-05 22:38:15,964][model8_pretrain.py][INFO] Epoch:[0/2](725900/4588595) loss:2.648 lr:0.0000100 epoch_Time:24576.0min: [2024-01-05 22:38:52,904][model8_pretrain.py][INFO] Epoch:[0/2](726000/4588595) loss:3.161 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:38:52,904][model8_pretrain.py][INFO] Epoch:[0/2](726000/4588595) loss:2.918 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:38:52,904][model8_pretrain.py][INFO] Epoch:[0/2](726000/4588595) loss:2.554 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:38:52,904][model8_pretrain.py][INFO] Epoch:[0/2](726000/4588595) loss:3.189 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:38:52,904][model8_pretrain.py][INFO] Epoch:[0/2](726000/4588595) loss:2.220 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:38:52,904][model8_pretrain.py][INFO] Epoch:[0/2](726000/4588595) loss:2.835 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:38:52,904][model8_pretrain.py][INFO] Epoch:[0/2](726000/4588595) loss:2.991 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:38:52,904][model8_pretrain.py][INFO] Epoch:[0/2](726000/4588595) loss:2.359 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:39:29,843][model8_pretrain.py][INFO] Epoch:[0/2](726100/4588595) loss:3.189 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:39:29,843][model8_pretrain.py][INFO] Epoch:[0/2](726100/4588595) loss:2.435 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:39:29,843][model8_pretrain.py][INFO] Epoch:[0/2](726100/4588595) loss:2.971 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:39:29,843][model8_pretrain.py][INFO] Epoch:[0/2](726100/4588595) loss:3.299 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:39:29,843][model8_pretrain.py][INFO] Epoch:[0/2](726100/4588595) loss:2.853 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:39:29,843][model8_pretrain.py][INFO] Epoch:[0/2](726100/4588595) loss:3.092 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:39:29,843][model8_pretrain.py][INFO] Epoch:[0/2](726100/4588595) loss:2.729 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:39:29,844][model8_pretrain.py][INFO] Epoch:[0/2](726100/4588595) loss:2.870 lr:0.0000100 epoch_Time:24575.0min: [2024-01-05 22:40:06,770][model8_pretrain.py][INFO] Epoch:[0/2](726200/4588595) loss:2.410 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:40:06,770][model8_pretrain.py][INFO] Epoch:[0/2](726200/4588595) loss:2.499 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:40:06,770][model8_pretrain.py][INFO] Epoch:[0/2](726200/4588595) loss:2.669 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:40:06,770][model8_pretrain.py][INFO] Epoch:[0/2](726200/4588595) loss:3.021 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:40:06,770][model8_pretrain.py][INFO] Epoch:[0/2](726200/4588595) loss:3.001 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:40:06,770][model8_pretrain.py][INFO] Epoch:[0/2](726200/4588595) loss:2.604 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:40:06,770][model8_pretrain.py][INFO] Epoch:[0/2](726200/4588595) loss:2.297 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:40:06,770][model8_pretrain.py][INFO] Epoch:[0/2](726200/4588595) loss:2.741 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:40:43,714][model8_pretrain.py][INFO] Epoch:[0/2](726300/4588595) loss:2.894 lr:0.0000100 epoch_Time:24573.0min: [2024-01-05 22:40:43,714][model8_pretrain.py][INFO] Epoch:[0/2](726300/4588595) loss:2.819 lr:0.0000100 epoch_Time:24573.0min: [2024-01-05 22:40:43,714][model8_pretrain.py][INFO] Epoch:[0/2](726300/4588595) loss:2.433 lr:0.0000100 epoch_Time:24573.0min: [2024-01-05 22:40:43,714][model8_pretrain.py][INFO] Epoch:[0/2](726300/4588595) loss:3.140 lr:0.0000100 epoch_Time:24573.0min: [2024-01-05 22:40:43,714][model8_pretrain.py][INFO] Epoch:[0/2](726300/4588595) loss:2.236 lr:0.0000100 epoch_Time:24573.0min: [2024-01-05 22:40:43,714][model8_pretrain.py][INFO] Epoch:[0/2](726300/4588595) loss:3.069 lr:0.0000100 epoch_Time:24573.0min: [2024-01-05 22:40:43,714][model8_pretrain.py][INFO] Epoch:[0/2](726300/4588595) loss:3.317 lr:0.0000100 epoch_Time:24573.0min: [2024-01-05 22:40:43,714][model8_pretrain.py][INFO] Epoch:[0/2](726300/4588595) loss:2.223 lr:0.0000100 epoch_Time:24573.0min: [2024-01-05 22:41:32,575][model8_pretrain.py][INFO] Epoch:[0/2](726400/4588595) loss:2.849 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:41:32,575][model8_pretrain.py][INFO] Epoch:[0/2](726400/4588595) loss:2.896 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:41:32,575][model8_pretrain.py][INFO] Epoch:[0/2](726400/4588595) loss:2.750 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:41:32,575][model8_pretrain.py][INFO] Epoch:[0/2](726400/4588595) loss:2.797 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:41:32,575][model8_pretrain.py][INFO] Epoch:[0/2](726400/4588595) loss:2.891 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:41:32,575][model8_pretrain.py][INFO] Epoch:[0/2](726400/4588595) loss:2.653 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:41:32,575][model8_pretrain.py][INFO] Epoch:[0/2](726400/4588595) loss:3.095 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:41:32,575][model8_pretrain.py][INFO] Epoch:[0/2](726400/4588595) loss:3.351 lr:0.0000100 epoch_Time:24574.0min: [2024-01-05 22:42:09,515][model8_pretrain.py][INFO] Epoch:[0/2](726500/4588595) loss:3.254 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:09,515][model8_pretrain.py][INFO] Epoch:[0/2](726500/4588595) loss:3.416 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:09,515][model8_pretrain.py][INFO] Epoch:[0/2](726500/4588595) loss:3.074 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:09,515][model8_pretrain.py][INFO] Epoch:[0/2](726500/4588595) loss:2.759 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:09,515][model8_pretrain.py][INFO] Epoch:[0/2](726500/4588595) loss:2.484 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:09,515][model8_pretrain.py][INFO] Epoch:[0/2](726500/4588595) loss:3.076 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:09,515][model8_pretrain.py][INFO] Epoch:[0/2](726500/4588595) loss:3.123 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:09,515][model8_pretrain.py][INFO] Epoch:[0/2](726500/4588595) loss:2.905 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:46,443][model8_pretrain.py][INFO] Epoch:[0/2](726600/4588595) loss:2.932 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:46,443][model8_pretrain.py][INFO] Epoch:[0/2](726600/4588595) loss:2.789 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:46,443][model8_pretrain.py][INFO] Epoch:[0/2](726600/4588595) loss:3.546 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:46,443][model8_pretrain.py][INFO] Epoch:[0/2](726600/4588595) loss:2.311 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:46,443][model8_pretrain.py][INFO] Epoch:[0/2](726600/4588595) loss:2.472 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:46,443][model8_pretrain.py][INFO] Epoch:[0/2](726600/4588595) loss:2.588 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:46,443][model8_pretrain.py][INFO] Epoch:[0/2](726600/4588595) loss:2.680 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:42:46,443][model8_pretrain.py][INFO] Epoch:[0/2](726600/4588595) loss:2.493 lr:0.0000100 epoch_Time:24572.0min: [2024-01-05 22:43:23,391][model8_pretrain.py][INFO] Epoch:[0/2](726700/4588595) loss:2.942 lr:0.0000100 epoch_Time:24571.0min: [2024-01-05 22:43:23,391][model8_pretrain.py][INFO] Epoch:[0/2](726700/4588595) loss:2.795 lr:0.0000100 epoch_Time:24571.0min: [2024-01-05 22:43:23,391][model8_pretrain.py][INFO] Epoch:[0/2](726700/4588595) loss:3.006 lr:0.0000100 epoch_Time:24571.0min: [2024-01-05 22:43:23,391][model8_pretrain.py][INFO] Epoch:[0/2](726700/4588595) loss:2.710 lr:0.0000100 epoch_Time:24571.0min: [2024-01-05 22:43:23,391][model8_pretrain.py][INFO] Epoch:[0/2](726700/4588595) loss:3.181 lr:0.0000100 epoch_Time:24571.0min: [2024-01-05 22:43:23,392][model8_pretrain.py][INFO] Epoch:[0/2](726700/4588595) loss:2.115 lr:0.0000100 epoch_Time:24571.0min: [2024-01-05 22:43:23,392][model8_pretrain.py][INFO] Epoch:[0/2](726700/4588595) loss:2.848 lr:0.0000100 epoch_Time:24571.0min: [2024-01-05 22:43:23,393][model8_pretrain.py][INFO] Epoch:[0/2](726700/4588595) loss:3.335 lr:0.0000100 epoch_Time:24571.0min: [2024-01-05 22:44:00,330][model8_pretrain.py][INFO] Epoch:[0/2](726800/4588595) loss:2.307 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:00,330][model8_pretrain.py][INFO] Epoch:[0/2](726800/4588595) loss:2.858 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:00,330][model8_pretrain.py][INFO] Epoch:[0/2](726800/4588595) loss:2.935 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:00,330][model8_pretrain.py][INFO] Epoch:[0/2](726800/4588595) loss:2.876 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:00,330][model8_pretrain.py][INFO] Epoch:[0/2](726800/4588595) loss:2.530 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:00,330][model8_pretrain.py][INFO] Epoch:[0/2](726800/4588595) loss:2.250 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:00,330][model8_pretrain.py][INFO] Epoch:[0/2](726800/4588595) loss:2.341 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:00,330][model8_pretrain.py][INFO] Epoch:[0/2](726800/4588595) loss:3.076 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:37,270][model8_pretrain.py][INFO] Epoch:[0/2](726900/4588595) loss:2.902 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:37,270][model8_pretrain.py][INFO] Epoch:[0/2](726900/4588595) loss:2.975 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:37,270][model8_pretrain.py][INFO] Epoch:[0/2](726900/4588595) loss:2.965 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:37,270][model8_pretrain.py][INFO] Epoch:[0/2](726900/4588595) loss:2.626 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:37,270][model8_pretrain.py][INFO] Epoch:[0/2](726900/4588595) loss:2.704 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:37,270][model8_pretrain.py][INFO] Epoch:[0/2](726900/4588595) loss:2.696 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:37,270][model8_pretrain.py][INFO] Epoch:[0/2](726900/4588595) loss:2.916 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:44:37,270][model8_pretrain.py][INFO] Epoch:[0/2](726900/4588595) loss:2.711 lr:0.0000100 epoch_Time:24570.0min: [2024-01-05 22:45:14,199][model8_pretrain.py][INFO] Epoch:[0/2](727000/4588595) loss:2.962 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:45:14,199][model8_pretrain.py][INFO] Epoch:[0/2](727000/4588595) loss:3.009 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:45:14,199][model8_pretrain.py][INFO] Epoch:[0/2](727000/4588595) loss:2.892 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:45:14,199][model8_pretrain.py][INFO] Epoch:[0/2](727000/4588595) loss:2.701 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:45:14,199][model8_pretrain.py][INFO] Epoch:[0/2](727000/4588595) loss:2.096 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:45:14,199][model8_pretrain.py][INFO] Epoch:[0/2](727000/4588595) loss:2.706 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:45:14,199][model8_pretrain.py][INFO] Epoch:[0/2](727000/4588595) loss:2.222 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:45:14,199][model8_pretrain.py][INFO] Epoch:[0/2](727000/4588595) loss:2.787 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:45:51,133][model8_pretrain.py][INFO] Epoch:[0/2](727100/4588595) loss:2.805 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:45:51,133][model8_pretrain.py][INFO] Epoch:[0/2](727100/4588595) loss:3.271 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:45:51,133][model8_pretrain.py][INFO] Epoch:[0/2](727100/4588595) loss:2.868 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:45:51,133][model8_pretrain.py][INFO] Epoch:[0/2](727100/4588595) loss:2.848 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:45:51,133][model8_pretrain.py][INFO] Epoch:[0/2](727100/4588595) loss:2.146 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:45:51,133][model8_pretrain.py][INFO] Epoch:[0/2](727100/4588595) loss:3.182 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:45:51,133][model8_pretrain.py][INFO] Epoch:[0/2](727100/4588595) loss:2.667 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:45:51,133][model8_pretrain.py][INFO] Epoch:[0/2](727100/4588595) loss:2.980 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:46:40,225][model8_pretrain.py][INFO] Epoch:[0/2](727200/4588595) loss:2.861 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:46:40,225][model8_pretrain.py][INFO] Epoch:[0/2](727200/4588595) loss:2.874 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:46:40,225][model8_pretrain.py][INFO] Epoch:[0/2](727200/4588595) loss:3.300 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:46:40,225][model8_pretrain.py][INFO] Epoch:[0/2](727200/4588595) loss:2.619 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:46:40,225][model8_pretrain.py][INFO] Epoch:[0/2](727200/4588595) loss:2.749 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:46:40,225][model8_pretrain.py][INFO] Epoch:[0/2](727200/4588595) loss:2.751 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:46:40,225][model8_pretrain.py][INFO] Epoch:[0/2](727200/4588595) loss:3.305 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:46:40,225][model8_pretrain.py][INFO] Epoch:[0/2](727200/4588595) loss:2.657 lr:0.0000100 epoch_Time:24569.0min: [2024-01-05 22:47:17,150][model8_pretrain.py][INFO] Epoch:[0/2](727300/4588595) loss:2.606 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:47:17,150][model8_pretrain.py][INFO] Epoch:[0/2](727300/4588595) loss:2.999 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:47:17,150][model8_pretrain.py][INFO] Epoch:[0/2](727300/4588595) loss:2.964 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:47:17,150][model8_pretrain.py][INFO] Epoch:[0/2](727300/4588595) loss:2.901 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:47:17,151][model8_pretrain.py][INFO] Epoch:[0/2](727300/4588595) loss:2.682 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:47:17,151][model8_pretrain.py][INFO] Epoch:[0/2](727300/4588595) loss:3.177 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:47:17,151][model8_pretrain.py][INFO] Epoch:[0/2](727300/4588595) loss:2.865 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:47:17,151][model8_pretrain.py][INFO] Epoch:[0/2](727300/4588595) loss:2.791 lr:0.0000100 epoch_Time:24568.0min: [2024-01-05 22:47:54,092][model8_pretrain.py][INFO] Epoch:[0/2](727400/4588595) loss:2.721 lr:0.0000100 epoch_Time:24567.0min: [2024-01-05 22:47:54,092][model8_pretrain.py][INFO] Epoch:[0/2](727400/4588595) loss:2.598 lr:0.0000100 epoch_Time:24567.0min: [2024-01-05 22:47:54,092][model8_pretrain.py][INFO] Epoch:[0/2](727400/4588595) loss:3.106 lr:0.0000100 epoch_Time:24567.0min: [2024-01-05 22:47:54,092][model8_pretrain.py][INFO] Epoch:[0/2](727400/4588595) loss:2.533 lr:0.0000100 epoch_Time:24567.0min: [2024-01-05 22:47:54,092][model8_pretrain.py][INFO] Epoch:[0/2](727400/4588595) loss:2.189 lr:0.0000100 epoch_Time:24567.0min: [2024-01-05 22:47:54,092][model8_pretrain.py][INFO] Epoch:[0/2](727400/4588595) loss:3.005 lr:0.0000100 epoch_Time:24567.0min: [2024-01-05 22:47:54,092][model8_pretrain.py][INFO] Epoch:[0/2](727400/4588595) loss:2.936 lr:0.0000100 epoch_Time:24567.0min: [2024-01-05 22:47:54,092][model8_pretrain.py][INFO] Epoch:[0/2](727400/4588595) loss:2.644 lr:0.0000100 epoch_Time:24567.0min: [2024-01-05 22:48:31,033][model8_pretrain.py][INFO] Epoch:[0/2](727500/4588595) loss:1.945 lr:0.0000100 epoch_Time:24566.0min: [2024-01-05 22:48:31,033][model8_pretrain.py][INFO] Epoch:[0/2](727500/4588595) loss:3.028 lr:0.0000100 epoch_Time:24566.0min: [2024-01-05 22:48:31,033][model8_pretrain.py][INFO] Epoch:[0/2](727500/4588595) loss:2.705 lr:0.0000100 epoch_Time:24566.0min: [2024-01-05 22:48:31,033][model8_pretrain.py][INFO] Epoch:[0/2](727500/4588595) loss:2.830 lr:0.0000100 epoch_Time:24566.0min: [2024-01-05 22:48:31,033][model8_pretrain.py][INFO] Epoch:[0/2](727500/4588595) loss:2.879 lr:0.0000100 epoch_Time:24566.0min: [2024-01-05 22:48:31,033][model8_pretrain.py][INFO] Epoch:[0/2](727500/4588595) loss:2.758 lr:0.0000100 epoch_Time:24566.0min: [2024-01-05 22:48:31,033][model8_pretrain.py][INFO] Epoch:[0/2](727500/4588595) loss:2.719 lr:0.0000100 epoch_Time:24566.0min: [2024-01-05 22:48:31,033][model8_pretrain.py][INFO] Epoch:[0/2](727500/4588595) loss:3.035 lr:0.0000100 epoch_Time:24566.0min: [2024-01-05 22:49:07,963][model8_pretrain.py][INFO] Epoch:[0/2](727600/4588595) loss:3.023 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:07,963][model8_pretrain.py][INFO] Epoch:[0/2](727600/4588595) loss:3.095 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:07,963][model8_pretrain.py][INFO] Epoch:[0/2](727600/4588595) loss:2.570 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:07,963][model8_pretrain.py][INFO] Epoch:[0/2](727600/4588595) loss:3.234 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:07,963][model8_pretrain.py][INFO] Epoch:[0/2](727600/4588595) loss:2.370 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:07,963][model8_pretrain.py][INFO] Epoch:[0/2](727600/4588595) loss:2.870 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:07,963][model8_pretrain.py][INFO] Epoch:[0/2](727600/4588595) loss:3.013 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:07,963][model8_pretrain.py][INFO] Epoch:[0/2](727600/4588595) loss:3.144 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:44,905][model8_pretrain.py][INFO] Epoch:[0/2](727700/4588595) loss:3.252 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:44,905][model8_pretrain.py][INFO] Epoch:[0/2](727700/4588595) loss:2.716 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:44,905][model8_pretrain.py][INFO] Epoch:[0/2](727700/4588595) loss:2.872 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:44,905][model8_pretrain.py][INFO] Epoch:[0/2](727700/4588595) loss:3.166 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:44,905][model8_pretrain.py][INFO] Epoch:[0/2](727700/4588595) loss:3.214 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:44,905][model8_pretrain.py][INFO] Epoch:[0/2](727700/4588595) loss:3.030 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:44,905][model8_pretrain.py][INFO] Epoch:[0/2](727700/4588595) loss:2.934 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:49:44,905][model8_pretrain.py][INFO] Epoch:[0/2](727700/4588595) loss:2.924 lr:0.0000100 epoch_Time:24565.0min: [2024-01-05 22:50:21,839][model8_pretrain.py][INFO] Epoch:[0/2](727800/4588595) loss:2.951 lr:0.0000100 epoch_Time:24564.0min: [2024-01-05 22:50:21,839][model8_pretrain.py][INFO] Epoch:[0/2](727800/4588595) loss:3.333 lr:0.0000100 epoch_Time:24564.0min: [2024-01-05 22:50:21,839][model8_pretrain.py][INFO] Epoch:[0/2](727800/4588595) loss:2.853 lr:0.0000100 epoch_Time:24564.0min: [2024-01-05 22:50:21,839][model8_pretrain.py][INFO] Epoch:[0/2](727800/4588595) loss:2.538 lr:0.0000100 epoch_Time:24564.0min: [2024-01-05 22:50:21,839][model8_pretrain.py][INFO] Epoch:[0/2](727800/4588595) loss:2.913 lr:0.0000100 epoch_Time:24564.0min: [2024-01-05 22:50:21,839][model8_pretrain.py][INFO] Epoch:[0/2](727800/4588595) loss:2.438 lr:0.0000100 epoch_Time:24564.0min: [2024-01-05 22:50:21,839][model8_pretrain.py][INFO] Epoch:[0/2](727800/4588595) loss:2.918 lr:0.0000100 epoch_Time:24564.0min: [2024-01-05 22:50:21,839][model8_pretrain.py][INFO] Epoch:[0/2](727800/4588595) loss:2.743 lr:0.0000100 epoch_Time:24564.0min: [2024-01-05 22:50:58,775][model8_pretrain.py][INFO] Epoch:[0/2](727900/4588595) loss:2.418 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:50:58,775][model8_pretrain.py][INFO] Epoch:[0/2](727900/4588595) loss:2.778 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:50:58,775][model8_pretrain.py][INFO] Epoch:[0/2](727900/4588595) loss:3.054 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:50:58,775][model8_pretrain.py][INFO] Epoch:[0/2](727900/4588595) loss:2.795 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:50:58,775][model8_pretrain.py][INFO] Epoch:[0/2](727900/4588595) loss:3.036 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:50:58,775][model8_pretrain.py][INFO] Epoch:[0/2](727900/4588595) loss:2.031 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:50:58,775][model8_pretrain.py][INFO] Epoch:[0/2](727900/4588595) loss:2.654 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:50:58,775][model8_pretrain.py][INFO] Epoch:[0/2](727900/4588595) loss:2.954 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:51:47,762][model8_pretrain.py][INFO] Epoch:[0/2](728000/4588595) loss:2.751 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:51:47,762][model8_pretrain.py][INFO] Epoch:[0/2](728000/4588595) loss:3.268 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:51:47,762][model8_pretrain.py][INFO] Epoch:[0/2](728000/4588595) loss:2.788 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:51:47,762][model8_pretrain.py][INFO] Epoch:[0/2](728000/4588595) loss:2.316 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:51:47,762][model8_pretrain.py][INFO] Epoch:[0/2](728000/4588595) loss:2.602 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:51:47,762][model8_pretrain.py][INFO] Epoch:[0/2](728000/4588595) loss:2.157 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:51:47,762][model8_pretrain.py][INFO] Epoch:[0/2](728000/4588595) loss:2.324 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:51:47,762][model8_pretrain.py][INFO] Epoch:[0/2](728000/4588595) loss:2.692 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:52:24,699][model8_pretrain.py][INFO] Epoch:[0/2](728100/4588595) loss:2.828 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:52:24,699][model8_pretrain.py][INFO] Epoch:[0/2](728100/4588595) loss:2.612 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:52:24,699][model8_pretrain.py][INFO] Epoch:[0/2](728100/4588595) loss:2.641 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:52:24,699][model8_pretrain.py][INFO] Epoch:[0/2](728100/4588595) loss:2.094 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:52:24,699][model8_pretrain.py][INFO] Epoch:[0/2](728100/4588595) loss:2.841 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:52:24,699][model8_pretrain.py][INFO] Epoch:[0/2](728100/4588595) loss:2.965 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:52:24,699][model8_pretrain.py][INFO] Epoch:[0/2](728100/4588595) loss:3.229 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:52:24,699][model8_pretrain.py][INFO] Epoch:[0/2](728100/4588595) loss:2.414 lr:0.0000100 epoch_Time:24563.0min: [2024-01-05 22:53:01,652][model8_pretrain.py][INFO] Epoch:[0/2](728200/4588595) loss:3.265 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:01,652][model8_pretrain.py][INFO] Epoch:[0/2](728200/4588595) loss:3.260 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:01,652][model8_pretrain.py][INFO] Epoch:[0/2](728200/4588595) loss:3.149 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:01,652][model8_pretrain.py][INFO] Epoch:[0/2](728200/4588595) loss:2.393 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:01,652][model8_pretrain.py][INFO] Epoch:[0/2](728200/4588595) loss:3.106 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:01,652][model8_pretrain.py][INFO] Epoch:[0/2](728200/4588595) loss:2.745 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:01,652][model8_pretrain.py][INFO] Epoch:[0/2](728200/4588595) loss:3.339 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:01,652][model8_pretrain.py][INFO] Epoch:[0/2](728200/4588595) loss:2.345 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:38,594][model8_pretrain.py][INFO] Epoch:[0/2](728300/4588595) loss:2.839 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:38,594][model8_pretrain.py][INFO] Epoch:[0/2](728300/4588595) loss:3.160 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:38,594][model8_pretrain.py][INFO] Epoch:[0/2](728300/4588595) loss:2.612 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:38,594][model8_pretrain.py][INFO] Epoch:[0/2](728300/4588595) loss:3.136 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:38,594][model8_pretrain.py][INFO] Epoch:[0/2](728300/4588595) loss:2.945 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:38,594][model8_pretrain.py][INFO] Epoch:[0/2](728300/4588595) loss:2.458 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:38,594][model8_pretrain.py][INFO] Epoch:[0/2](728300/4588595) loss:2.711 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:53:38,594][model8_pretrain.py][INFO] Epoch:[0/2](728300/4588595) loss:3.068 lr:0.0000100 epoch_Time:24562.0min: [2024-01-05 22:54:15,548][model8_pretrain.py][INFO] Epoch:[0/2](728400/4588595) loss:2.976 lr:0.0000100 epoch_Time:24561.0min: [2024-01-05 22:54:15,548][model8_pretrain.py][INFO] Epoch:[0/2](728400/4588595) loss:3.226 lr:0.0000100 epoch_Time:24561.0min: [2024-01-05 22:54:15,548][model8_pretrain.py][INFO] Epoch:[0/2](728400/4588595) loss:2.585 lr:0.0000100 epoch_Time:24561.0min: [2024-01-05 22:54:15,548][model8_pretrain.py][INFO] Epoch:[0/2](728400/4588595) loss:2.886 lr:0.0000100 epoch_Time:24561.0min: [2024-01-05 22:54:15,548][model8_pretrain.py][INFO] Epoch:[0/2](728400/4588595) loss:2.520 lr:0.0000100 epoch_Time:24561.0min: [2024-01-05 22:54:15,548][model8_pretrain.py][INFO] Epoch:[0/2](728400/4588595) loss:3.097 lr:0.0000100 epoch_Time:24561.0min: [2024-01-05 22:54:15,549][model8_pretrain.py][INFO] Epoch:[0/2](728400/4588595) loss:2.483 lr:0.0000100 epoch_Time:24561.0min: [2024-01-05 22:54:15,549][model8_pretrain.py][INFO] Epoch:[0/2](728400/4588595) loss:2.717 lr:0.0000100 epoch_Time:24561.0min: [2024-01-05 22:54:52,508][model8_pretrain.py][INFO] Epoch:[0/2](728500/4588595) loss:3.181 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:54:52,508][model8_pretrain.py][INFO] Epoch:[0/2](728500/4588595) loss:2.566 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:54:52,508][model8_pretrain.py][INFO] Epoch:[0/2](728500/4588595) loss:2.767 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:54:52,508][model8_pretrain.py][INFO] Epoch:[0/2](728500/4588595) loss:2.943 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:54:52,508][model8_pretrain.py][INFO] Epoch:[0/2](728500/4588595) loss:2.989 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:54:52,509][model8_pretrain.py][INFO] Epoch:[0/2](728500/4588595) loss:2.572 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:54:52,509][model8_pretrain.py][INFO] Epoch:[0/2](728500/4588595) loss:2.696 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:54:52,509][model8_pretrain.py][INFO] Epoch:[0/2](728500/4588595) loss:2.347 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:55:29,445][model8_pretrain.py][INFO] Epoch:[0/2](728600/4588595) loss:2.710 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:55:29,445][model8_pretrain.py][INFO] Epoch:[0/2](728600/4588595) loss:2.870 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:55:29,445][model8_pretrain.py][INFO] Epoch:[0/2](728600/4588595) loss:2.543 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:55:29,445][model8_pretrain.py][INFO] Epoch:[0/2](728600/4588595) loss:2.872 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:55:29,445][model8_pretrain.py][INFO] Epoch:[0/2](728600/4588595) loss:2.915 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:55:29,445][model8_pretrain.py][INFO] Epoch:[0/2](728600/4588595) loss:2.715 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:55:29,445][model8_pretrain.py][INFO] Epoch:[0/2](728600/4588595) loss:2.530 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:55:29,446][model8_pretrain.py][INFO] Epoch:[0/2](728600/4588595) loss:2.796 lr:0.0000100 epoch_Time:24559.0min: [2024-01-05 22:56:06,387][model8_pretrain.py][INFO] Epoch:[0/2](728700/4588595) loss:2.757 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:06,387][model8_pretrain.py][INFO] Epoch:[0/2](728700/4588595) loss:2.793 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:06,387][model8_pretrain.py][INFO] Epoch:[0/2](728700/4588595) loss:2.942 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:06,387][model8_pretrain.py][INFO] Epoch:[0/2](728700/4588595) loss:3.152 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:06,387][model8_pretrain.py][INFO] Epoch:[0/2](728700/4588595) loss:3.024 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:06,387][model8_pretrain.py][INFO] Epoch:[0/2](728700/4588595) loss:2.468 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:06,387][model8_pretrain.py][INFO] Epoch:[0/2](728700/4588595) loss:3.140 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:06,387][model8_pretrain.py][INFO] Epoch:[0/2](728700/4588595) loss:2.652 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:55,385][model8_pretrain.py][INFO] Epoch:[0/2](728800/4588595) loss:2.429 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:55,385][model8_pretrain.py][INFO] Epoch:[0/2](728800/4588595) loss:2.673 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:55,385][model8_pretrain.py][INFO] Epoch:[0/2](728800/4588595) loss:2.938 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:55,386][model8_pretrain.py][INFO] Epoch:[0/2](728800/4588595) loss:2.996 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:55,386][model8_pretrain.py][INFO] Epoch:[0/2](728800/4588595) loss:3.099 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:55,386][model8_pretrain.py][INFO] Epoch:[0/2](728800/4588595) loss:3.547 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:55,386][model8_pretrain.py][INFO] Epoch:[0/2](728800/4588595) loss:2.714 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:56:55,386][model8_pretrain.py][INFO] Epoch:[0/2](728800/4588595) loss:3.095 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:57:32,305][model8_pretrain.py][INFO] Epoch:[0/2](728900/4588595) loss:2.548 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:57:32,305][model8_pretrain.py][INFO] Epoch:[0/2](728900/4588595) loss:3.158 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:57:32,305][model8_pretrain.py][INFO] Epoch:[0/2](728900/4588595) loss:3.602 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:57:32,305][model8_pretrain.py][INFO] Epoch:[0/2](728900/4588595) loss:2.871 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:57:32,305][model8_pretrain.py][INFO] Epoch:[0/2](728900/4588595) loss:2.562 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:57:32,305][model8_pretrain.py][INFO] Epoch:[0/2](728900/4588595) loss:3.304 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:57:32,305][model8_pretrain.py][INFO] Epoch:[0/2](728900/4588595) loss:2.307 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:57:32,305][model8_pretrain.py][INFO] Epoch:[0/2](728900/4588595) loss:2.963 lr:0.0000100 epoch_Time:24558.0min: [2024-01-05 22:58:09,248][model8_pretrain.py][INFO] Epoch:[0/2](729000/4588595) loss:2.955 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:09,248][model8_pretrain.py][INFO] Epoch:[0/2](729000/4588595) loss:2.229 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:09,248][model8_pretrain.py][INFO] Epoch:[0/2](729000/4588595) loss:3.187 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:09,248][model8_pretrain.py][INFO] Epoch:[0/2](729000/4588595) loss:3.278 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:09,248][model8_pretrain.py][INFO] Epoch:[0/2](729000/4588595) loss:3.300 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:09,248][model8_pretrain.py][INFO] Epoch:[0/2](729000/4588595) loss:2.858 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:09,248][model8_pretrain.py][INFO] Epoch:[0/2](729000/4588595) loss:2.462 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:09,249][model8_pretrain.py][INFO] Epoch:[0/2](729000/4588595) loss:3.063 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:46,184][model8_pretrain.py][INFO] Epoch:[0/2](729100/4588595) loss:3.278 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:46,184][model8_pretrain.py][INFO] Epoch:[0/2](729100/4588595) loss:2.789 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:46,184][model8_pretrain.py][INFO] Epoch:[0/2](729100/4588595) loss:3.151 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:46,184][model8_pretrain.py][INFO] Epoch:[0/2](729100/4588595) loss:2.840 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:46,184][model8_pretrain.py][INFO] Epoch:[0/2](729100/4588595) loss:2.506 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:46,184][model8_pretrain.py][INFO] Epoch:[0/2](729100/4588595) loss:2.253 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:46,184][model8_pretrain.py][INFO] Epoch:[0/2](729100/4588595) loss:3.209 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:58:46,185][model8_pretrain.py][INFO] Epoch:[0/2](729100/4588595) loss:2.682 lr:0.0000100 epoch_Time:24557.0min: [2024-01-05 22:59:23,114][model8_pretrain.py][INFO] Epoch:[0/2](729200/4588595) loss:2.662 lr:0.0000100 epoch_Time:24556.0min: [2024-01-05 22:59:23,114][model8_pretrain.py][INFO] Epoch:[0/2](729200/4588595) loss:2.755 lr:0.0000100 epoch_Time:24556.0min: [2024-01-05 22:59:23,114][model8_pretrain.py][INFO] Epoch:[0/2](729200/4588595) loss:2.736 lr:0.0000100 epoch_Time:24556.0min: [2024-01-05 22:59:23,114][model8_pretrain.py][INFO] Epoch:[0/2](729200/4588595) loss:2.749 lr:0.0000100 epoch_Time:24556.0min: [2024-01-05 22:59:23,114][model8_pretrain.py][INFO] Epoch:[0/2](729200/4588595) loss:2.840 lr:0.0000100 epoch_Time:24556.0min: [2024-01-05 22:59:23,114][model8_pretrain.py][INFO] Epoch:[0/2](729200/4588595) loss:2.626 lr:0.0000100 epoch_Time:24556.0min: [2024-01-05 22:59:23,114][model8_pretrain.py][INFO] Epoch:[0/2](729200/4588595) loss:3.105 lr:0.0000100 epoch_Time:24556.0min: [2024-01-05 22:59:23,115][model8_pretrain.py][INFO] Epoch:[0/2](729200/4588595) loss:2.553 lr:0.0000100 epoch_Time:24556.0min: [2024-01-05 23:00:00,052][model8_pretrain.py][INFO] Epoch:[0/2](729300/4588595) loss:2.879 lr:0.0000100 epoch_Time:24555.0min: [2024-01-05 23:00:00,052][model8_pretrain.py][INFO] Epoch:[0/2](729300/4588595) loss:2.551 lr:0.0000100 epoch_Time:24555.0min: [2024-01-05 23:00:00,052][model8_pretrain.py][INFO] Epoch:[0/2](729300/4588595) loss:2.902 lr:0.0000100 epoch_Time:24555.0min: [2024-01-05 23:00:00,052][model8_pretrain.py][INFO] Epoch:[0/2](729300/4588595) loss:3.030 lr:0.0000100 epoch_Time:24555.0min: [2024-01-05 23:00:00,052][model8_pretrain.py][INFO] Epoch:[0/2](729300/4588595) loss:2.056 lr:0.0000100 epoch_Time:24555.0min: [2024-01-05 23:00:00,052][model8_pretrain.py][INFO] Epoch:[0/2](729300/4588595) loss:2.590 lr:0.0000100 epoch_Time:24555.0min: [2024-01-05 23:00:00,052][model8_pretrain.py][INFO] Epoch:[0/2](729300/4588595) loss:3.131 lr:0.0000100 epoch_Time:24555.0min: [2024-01-05 23:00:00,052][model8_pretrain.py][INFO] Epoch:[0/2](729300/4588595) loss:2.927 lr:0.0000100 epoch_Time:24555.0min: [2024-01-05 23:00:36,994][model8_pretrain.py][INFO] Epoch:[0/2](729400/4588595) loss:2.983 lr:0.0000100 epoch_Time:24554.0min: [2024-01-05 23:00:36,994][model8_pretrain.py][INFO] Epoch:[0/2](729400/4588595) loss:2.834 lr:0.0000100 epoch_Time:24554.0min: [2024-01-05 23:00:36,994][model8_pretrain.py][INFO] Epoch:[0/2](729400/4588595) loss:2.986 lr:0.0000100 epoch_Time:24554.0min: [2024-01-05 23:00:36,994][model8_pretrain.py][INFO] Epoch:[0/2](729400/4588595) loss:2.550 lr:0.0000100 epoch_Time:24554.0min: [2024-01-05 23:00:36,994][model8_pretrain.py][INFO] Epoch:[0/2](729400/4588595) loss:2.971 lr:0.0000100 epoch_Time:24554.0min: [2024-01-05 23:00:36,994][model8_pretrain.py][INFO] Epoch:[0/2](729400/4588595) loss:2.082 lr:0.0000100 epoch_Time:24554.0min: [2024-01-05 23:00:36,994][model8_pretrain.py][INFO] Epoch:[0/2](729400/4588595) loss:2.942 lr:0.0000100 epoch_Time:24554.0min: [2024-01-05 23:00:36,995][model8_pretrain.py][INFO] Epoch:[0/2](729400/4588595) loss:3.039 lr:0.0000100 epoch_Time:24554.0min: [2024-01-05 23:01:13,939][model8_pretrain.py][INFO] Epoch:[0/2](729500/4588595) loss:2.593 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:01:13,939][model8_pretrain.py][INFO] Epoch:[0/2](729500/4588595) loss:2.660 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:01:13,939][model8_pretrain.py][INFO] Epoch:[0/2](729500/4588595) loss:3.216 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:01:13,939][model8_pretrain.py][INFO] Epoch:[0/2](729500/4588595) loss:2.899 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:01:13,939][model8_pretrain.py][INFO] Epoch:[0/2](729500/4588595) loss:2.188 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:01:13,939][model8_pretrain.py][INFO] Epoch:[0/2](729500/4588595) loss:2.846 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:01:13,939][model8_pretrain.py][INFO] Epoch:[0/2](729500/4588595) loss:2.962 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:01:13,940][model8_pretrain.py][INFO] Epoch:[0/2](729500/4588595) loss:2.765 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:02,696][model8_pretrain.py][INFO] Epoch:[0/2](729600/4588595) loss:2.494 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:02,696][model8_pretrain.py][INFO] Epoch:[0/2](729600/4588595) loss:2.407 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:02,696][model8_pretrain.py][INFO] Epoch:[0/2](729600/4588595) loss:3.333 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:02,696][model8_pretrain.py][INFO] Epoch:[0/2](729600/4588595) loss:2.838 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:02,696][model8_pretrain.py][INFO] Epoch:[0/2](729600/4588595) loss:3.083 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:02,696][model8_pretrain.py][INFO] Epoch:[0/2](729600/4588595) loss:3.062 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:02,696][model8_pretrain.py][INFO] Epoch:[0/2](729600/4588595) loss:3.078 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:02,697][model8_pretrain.py][INFO] Epoch:[0/2](729600/4588595) loss:2.754 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:39,629][model8_pretrain.py][INFO] Epoch:[0/2](729700/4588595) loss:3.142 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:39,629][model8_pretrain.py][INFO] Epoch:[0/2](729700/4588595) loss:3.115 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:39,629][model8_pretrain.py][INFO] Epoch:[0/2](729700/4588595) loss:2.907 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:39,629][model8_pretrain.py][INFO] Epoch:[0/2](729700/4588595) loss:3.031 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:39,629][model8_pretrain.py][INFO] Epoch:[0/2](729700/4588595) loss:2.857 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:39,629][model8_pretrain.py][INFO] Epoch:[0/2](729700/4588595) loss:2.392 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:39,629][model8_pretrain.py][INFO] Epoch:[0/2](729700/4588595) loss:2.607 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:02:39,629][model8_pretrain.py][INFO] Epoch:[0/2](729700/4588595) loss:3.159 lr:0.0000100 epoch_Time:24553.0min: [2024-01-05 23:03:16,572][model8_pretrain.py][INFO] Epoch:[0/2](729800/4588595) loss:2.786 lr:0.0000100 epoch_Time:24552.0min: [2024-01-05 23:03:16,572][model8_pretrain.py][INFO] Epoch:[0/2](729800/4588595) loss:1.949 lr:0.0000100 epoch_Time:24552.0min: [2024-01-05 23:03:16,572][model8_pretrain.py][INFO] Epoch:[0/2](729800/4588595) loss:2.504 lr:0.0000100 epoch_Time:24552.0min: [2024-01-05 23:03:16,572][model8_pretrain.py][INFO] Epoch:[0/2](729800/4588595) loss:2.836 lr:0.0000100 epoch_Time:24552.0min: [2024-01-05 23:03:16,572][model8_pretrain.py][INFO] Epoch:[0/2](729800/4588595) loss:2.758 lr:0.0000100 epoch_Time:24552.0min: [2024-01-05 23:03:16,572][model8_pretrain.py][INFO] Epoch:[0/2](729800/4588595) loss:3.115 lr:0.0000100 epoch_Time:24552.0min: [2024-01-05 23:03:16,572][model8_pretrain.py][INFO] Epoch:[0/2](729800/4588595) loss:2.554 lr:0.0000100 epoch_Time:24552.0min: [2024-01-05 23:03:16,572][model8_pretrain.py][INFO] Epoch:[0/2](729800/4588595) loss:3.013 lr:0.0000100 epoch_Time:24552.0min: [2024-01-05 23:03:53,518][model8_pretrain.py][INFO] Epoch:[0/2](729900/4588595) loss:2.869 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:03:53,518][model8_pretrain.py][INFO] Epoch:[0/2](729900/4588595) loss:2.680 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:03:53,518][model8_pretrain.py][INFO] Epoch:[0/2](729900/4588595) loss:3.571 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:03:53,518][model8_pretrain.py][INFO] Epoch:[0/2](729900/4588595) loss:2.462 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:03:53,518][model8_pretrain.py][INFO] Epoch:[0/2](729900/4588595) loss:2.479 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:03:53,518][model8_pretrain.py][INFO] Epoch:[0/2](729900/4588595) loss:2.464 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:03:53,518][model8_pretrain.py][INFO] Epoch:[0/2](729900/4588595) loss:2.675 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:03:53,518][model8_pretrain.py][INFO] Epoch:[0/2](729900/4588595) loss:3.064 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:04:30,457][model8_pretrain.py][INFO] Epoch:[0/2](730000/4588595) loss:3.161 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:04:30,457][model8_pretrain.py][INFO] Epoch:[0/2](730000/4588595) loss:2.540 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:04:30,457][model8_pretrain.py][INFO] Epoch:[0/2](730000/4588595) loss:2.481 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:04:30,457][model8_pretrain.py][INFO] Epoch:[0/2](730000/4588595) loss:2.687 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:04:30,457][model8_pretrain.py][INFO] Epoch:[0/2](730000/4588595) loss:2.613 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:04:30,457][model8_pretrain.py][INFO] Epoch:[0/2](730000/4588595) loss:2.795 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:04:30,457][model8_pretrain.py][INFO] Epoch:[0/2](730000/4588595) loss:2.392 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:04:30,458][model8_pretrain.py][INFO] Epoch:[0/2](730000/4588595) loss:2.769 lr:0.0000100 epoch_Time:24551.0min: [2024-01-05 23:05:07,396][model8_pretrain.py][INFO] Epoch:[0/2](730100/4588595) loss:3.085 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:07,396][model8_pretrain.py][INFO] Epoch:[0/2](730100/4588595) loss:2.703 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:07,396][model8_pretrain.py][INFO] Epoch:[0/2](730100/4588595) loss:2.928 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:07,396][model8_pretrain.py][INFO] Epoch:[0/2](730100/4588595) loss:2.712 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:07,396][model8_pretrain.py][INFO] Epoch:[0/2](730100/4588595) loss:2.270 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:07,396][model8_pretrain.py][INFO] Epoch:[0/2](730100/4588595) loss:3.142 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:07,396][model8_pretrain.py][INFO] Epoch:[0/2](730100/4588595) loss:3.127 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:07,396][model8_pretrain.py][INFO] Epoch:[0/2](730100/4588595) loss:3.231 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:44,333][model8_pretrain.py][INFO] Epoch:[0/2](730200/4588595) loss:2.823 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:44,333][model8_pretrain.py][INFO] Epoch:[0/2](730200/4588595) loss:2.230 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:44,333][model8_pretrain.py][INFO] Epoch:[0/2](730200/4588595) loss:3.315 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:44,333][model8_pretrain.py][INFO] Epoch:[0/2](730200/4588595) loss:2.826 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:44,333][model8_pretrain.py][INFO] Epoch:[0/2](730200/4588595) loss:3.429 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:44,333][model8_pretrain.py][INFO] Epoch:[0/2](730200/4588595) loss:3.232 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:44,333][model8_pretrain.py][INFO] Epoch:[0/2](730200/4588595) loss:2.905 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:05:44,333][model8_pretrain.py][INFO] Epoch:[0/2](730200/4588595) loss:2.615 lr:0.0000100 epoch_Time:24550.0min: [2024-01-05 23:06:21,290][model8_pretrain.py][INFO] Epoch:[0/2](730300/4588595) loss:2.882 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:06:21,290][model8_pretrain.py][INFO] Epoch:[0/2](730300/4588595) loss:2.938 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:06:21,290][model8_pretrain.py][INFO] Epoch:[0/2](730300/4588595) loss:2.940 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:06:21,290][model8_pretrain.py][INFO] Epoch:[0/2](730300/4588595) loss:3.282 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:06:21,290][model8_pretrain.py][INFO] Epoch:[0/2](730300/4588595) loss:2.840 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:06:21,290][model8_pretrain.py][INFO] Epoch:[0/2](730300/4588595) loss:2.663 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:06:21,290][model8_pretrain.py][INFO] Epoch:[0/2](730300/4588595) loss:3.236 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:06:21,290][model8_pretrain.py][INFO] Epoch:[0/2](730300/4588595) loss:2.958 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:08,439][model8_pretrain.py][INFO] Epoch:[0/2](730400/4588595) loss:2.702 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:08,439][model8_pretrain.py][INFO] Epoch:[0/2](730400/4588595) loss:2.566 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:08,439][model8_pretrain.py][INFO] Epoch:[0/2](730400/4588595) loss:2.656 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:08,440][model8_pretrain.py][INFO] Epoch:[0/2](730400/4588595) loss:2.975 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:08,440][model8_pretrain.py][INFO] Epoch:[0/2](730400/4588595) loss:2.895 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:08,440][model8_pretrain.py][INFO] Epoch:[0/2](730400/4588595) loss:2.695 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:08,440][model8_pretrain.py][INFO] Epoch:[0/2](730400/4588595) loss:2.862 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:10,130][model8_pretrain.py][INFO] Epoch:[0/2](730400/4588595) loss:3.155 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:47,059][model8_pretrain.py][INFO] Epoch:[0/2](730500/4588595) loss:2.424 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:47,059][model8_pretrain.py][INFO] Epoch:[0/2](730500/4588595) loss:3.298 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:47,059][model8_pretrain.py][INFO] Epoch:[0/2](730500/4588595) loss:3.175 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:47,059][model8_pretrain.py][INFO] Epoch:[0/2](730500/4588595) loss:3.004 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:47,059][model8_pretrain.py][INFO] Epoch:[0/2](730500/4588595) loss:3.254 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:47,059][model8_pretrain.py][INFO] Epoch:[0/2](730500/4588595) loss:3.168 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:47,059][model8_pretrain.py][INFO] Epoch:[0/2](730500/4588595) loss:2.909 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:07:47,059][model8_pretrain.py][INFO] Epoch:[0/2](730500/4588595) loss:2.584 lr:0.0000100 epoch_Time:24549.0min: [2024-01-05 23:08:24,024][model8_pretrain.py][INFO] Epoch:[0/2](730600/4588595) loss:2.850 lr:0.0000100 epoch_Time:24547.0min: [2024-01-05 23:08:24,024][model8_pretrain.py][INFO] Epoch:[0/2](730600/4588595) loss:2.879 lr:0.0000100 epoch_Time:24547.0min: [2024-01-05 23:08:24,024][model8_pretrain.py][INFO] Epoch:[0/2](730600/4588595) loss:2.616 lr:0.0000100 epoch_Time:24547.0min: [2024-01-05 23:08:24,024][model8_pretrain.py][INFO] Epoch:[0/2](730600/4588595) loss:2.411 lr:0.0000100 epoch_Time:24547.0min: [2024-01-05 23:08:24,024][model8_pretrain.py][INFO] Epoch:[0/2](730600/4588595) loss:2.928 lr:0.0000100 epoch_Time:24547.0min: [2024-01-05 23:08:24,024][model8_pretrain.py][INFO] Epoch:[0/2](730600/4588595) loss:2.883 lr:0.0000100 epoch_Time:24547.0min: [2024-01-05 23:08:24,024][model8_pretrain.py][INFO] Epoch:[0/2](730600/4588595) loss:2.493 lr:0.0000100 epoch_Time:24547.0min: [2024-01-05 23:08:24,024][model8_pretrain.py][INFO] Epoch:[0/2](730600/4588595) loss:2.693 lr:0.0000100 epoch_Time:24547.0min: [2024-01-05 23:09:01,033][model8_pretrain.py][INFO] Epoch:[0/2](730700/4588595) loss:3.341 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:01,033][model8_pretrain.py][INFO] Epoch:[0/2](730700/4588595) loss:3.215 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:01,033][model8_pretrain.py][INFO] Epoch:[0/2](730700/4588595) loss:2.842 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:01,033][model8_pretrain.py][INFO] Epoch:[0/2](730700/4588595) loss:2.652 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:01,033][model8_pretrain.py][INFO] Epoch:[0/2](730700/4588595) loss:3.046 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:01,033][model8_pretrain.py][INFO] Epoch:[0/2](730700/4588595) loss:2.839 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:01,034][model8_pretrain.py][INFO] Epoch:[0/2](730700/4588595) loss:2.627 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:01,034][model8_pretrain.py][INFO] Epoch:[0/2](730700/4588595) loss:2.237 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:37,987][model8_pretrain.py][INFO] Epoch:[0/2](730800/4588595) loss:2.809 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:37,987][model8_pretrain.py][INFO] Epoch:[0/2](730800/4588595) loss:2.780 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:37,987][model8_pretrain.py][INFO] Epoch:[0/2](730800/4588595) loss:3.050 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:37,987][model8_pretrain.py][INFO] Epoch:[0/2](730800/4588595) loss:2.537 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:37,987][model8_pretrain.py][INFO] Epoch:[0/2](730800/4588595) loss:3.201 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:37,987][model8_pretrain.py][INFO] Epoch:[0/2](730800/4588595) loss:2.713 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:37,987][model8_pretrain.py][INFO] Epoch:[0/2](730800/4588595) loss:2.933 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:09:37,987][model8_pretrain.py][INFO] Epoch:[0/2](730800/4588595) loss:2.333 lr:0.0000100 epoch_Time:24546.0min: [2024-01-05 23:10:14,933][model8_pretrain.py][INFO] Epoch:[0/2](730900/4588595) loss:2.644 lr:0.0000100 epoch_Time:24545.0min: [2024-01-05 23:10:14,933][model8_pretrain.py][INFO] Epoch:[0/2](730900/4588595) loss:3.036 lr:0.0000100 epoch_Time:24545.0min: [2024-01-05 23:10:14,933][model8_pretrain.py][INFO] Epoch:[0/2](730900/4588595) loss:2.744 lr:0.0000100 epoch_Time:24545.0min: [2024-01-05 23:10:14,933][model8_pretrain.py][INFO] Epoch:[0/2](730900/4588595) loss:2.910 lr:0.0000100 epoch_Time:24545.0min: [2024-01-05 23:10:14,933][model8_pretrain.py][INFO] Epoch:[0/2](730900/4588595) loss:2.584 lr:0.0000100 epoch_Time:24545.0min: [2024-01-05 23:10:14,933][model8_pretrain.py][INFO] Epoch:[0/2](730900/4588595) loss:3.065 lr:0.0000100 epoch_Time:24545.0min: [2024-01-05 23:10:14,933][model8_pretrain.py][INFO] Epoch:[0/2](730900/4588595) loss:2.670 lr:0.0000100 epoch_Time:24545.0min: [2024-01-05 23:10:14,933][model8_pretrain.py][INFO] Epoch:[0/2](730900/4588595) loss:2.723 lr:0.0000100 epoch_Time:24545.0min: [2024-01-05 23:10:51,878][model8_pretrain.py][INFO] Epoch:[0/2](731000/4588595) loss:3.240 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:10:51,878][model8_pretrain.py][INFO] Epoch:[0/2](731000/4588595) loss:3.301 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:10:51,878][model8_pretrain.py][INFO] Epoch:[0/2](731000/4588595) loss:2.483 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:10:51,878][model8_pretrain.py][INFO] Epoch:[0/2](731000/4588595) loss:3.299 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:10:51,878][model8_pretrain.py][INFO] Epoch:[0/2](731000/4588595) loss:2.259 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:10:51,878][model8_pretrain.py][INFO] Epoch:[0/2](731000/4588595) loss:2.599 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:10:51,878][model8_pretrain.py][INFO] Epoch:[0/2](731000/4588595) loss:3.250 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:10:51,878][model8_pretrain.py][INFO] Epoch:[0/2](731000/4588595) loss:2.925 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:11:28,823][model8_pretrain.py][INFO] Epoch:[0/2](731100/4588595) loss:2.178 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:11:28,823][model8_pretrain.py][INFO] Epoch:[0/2](731100/4588595) loss:2.511 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:11:28,823][model8_pretrain.py][INFO] Epoch:[0/2](731100/4588595) loss:2.861 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:11:28,823][model8_pretrain.py][INFO] Epoch:[0/2](731100/4588595) loss:2.412 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:11:28,823][model8_pretrain.py][INFO] Epoch:[0/2](731100/4588595) loss:2.440 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:11:28,823][model8_pretrain.py][INFO] Epoch:[0/2](731100/4588595) loss:2.863 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:11:28,823][model8_pretrain.py][INFO] Epoch:[0/2](731100/4588595) loss:2.404 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:11:28,823][model8_pretrain.py][INFO] Epoch:[0/2](731100/4588595) loss:2.834 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:12:15,924][model8_pretrain.py][INFO] Epoch:[0/2](731200/4588595) loss:2.769 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:12:15,924][model8_pretrain.py][INFO] Epoch:[0/2](731200/4588595) loss:2.724 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:12:15,924][model8_pretrain.py][INFO] Epoch:[0/2](731200/4588595) loss:2.976 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:12:15,924][model8_pretrain.py][INFO] Epoch:[0/2](731200/4588595) loss:2.269 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:12:15,924][model8_pretrain.py][INFO] Epoch:[0/2](731200/4588595) loss:2.952 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:12:15,924][model8_pretrain.py][INFO] Epoch:[0/2](731200/4588595) loss:2.859 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:12:15,924][model8_pretrain.py][INFO] Epoch:[0/2](731200/4588595) loss:3.202 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:12:15,924][model8_pretrain.py][INFO] Epoch:[0/2](731200/4588595) loss:2.730 lr:0.0000100 epoch_Time:24544.0min: [2024-01-05 23:12:54,601][model8_pretrain.py][INFO] Epoch:[0/2](731300/4588595) loss:2.842 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:12:54,601][model8_pretrain.py][INFO] Epoch:[0/2](731300/4588595) loss:2.503 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:12:54,601][model8_pretrain.py][INFO] Epoch:[0/2](731300/4588595) loss:2.862 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:12:54,601][model8_pretrain.py][INFO] Epoch:[0/2](731300/4588595) loss:2.586 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:12:54,601][model8_pretrain.py][INFO] Epoch:[0/2](731300/4588595) loss:3.001 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:12:54,601][model8_pretrain.py][INFO] Epoch:[0/2](731300/4588595) loss:2.750 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:12:54,601][model8_pretrain.py][INFO] Epoch:[0/2](731300/4588595) loss:3.091 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:12:54,601][model8_pretrain.py][INFO] Epoch:[0/2](731300/4588595) loss:2.991 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:13:31,543][model8_pretrain.py][INFO] Epoch:[0/2](731400/4588595) loss:2.834 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:13:31,543][model8_pretrain.py][INFO] Epoch:[0/2](731400/4588595) loss:2.608 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:13:31,543][model8_pretrain.py][INFO] Epoch:[0/2](731400/4588595) loss:2.579 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:13:31,543][model8_pretrain.py][INFO] Epoch:[0/2](731400/4588595) loss:2.960 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:13:31,543][model8_pretrain.py][INFO] Epoch:[0/2](731400/4588595) loss:3.255 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:13:31,543][model8_pretrain.py][INFO] Epoch:[0/2](731400/4588595) loss:2.120 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:13:31,543][model8_pretrain.py][INFO] Epoch:[0/2](731400/4588595) loss:2.496 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:13:31,543][model8_pretrain.py][INFO] Epoch:[0/2](731400/4588595) loss:2.549 lr:0.0000100 epoch_Time:24543.0min: [2024-01-05 23:14:08,491][model8_pretrain.py][INFO] Epoch:[0/2](731500/4588595) loss:3.002 lr:0.0000100 epoch_Time:24542.0min: [2024-01-05 23:14:08,491][model8_pretrain.py][INFO] Epoch:[0/2](731500/4588595) loss:3.445 lr:0.0000100 epoch_Time:24542.0min: [2024-01-05 23:14:08,491][model8_pretrain.py][INFO] Epoch:[0/2](731500/4588595) loss:2.991 lr:0.0000100 epoch_Time:24542.0min: [2024-01-05 23:14:08,491][model8_pretrain.py][INFO] Epoch:[0/2](731500/4588595) loss:3.247 lr:0.0000100 epoch_Time:24542.0min: [2024-01-05 23:14:08,491][model8_pretrain.py][INFO] Epoch:[0/2](731500/4588595) loss:3.603 lr:0.0000100 epoch_Time:24542.0min: [2024-01-05 23:14:08,491][model8_pretrain.py][INFO] Epoch:[0/2](731500/4588595) loss:2.849 lr:0.0000100 epoch_Time:24542.0min: [2024-01-05 23:14:08,491][model8_pretrain.py][INFO] Epoch:[0/2](731500/4588595) loss:3.073 lr:0.0000100 epoch_Time:24542.0min: [2024-01-05 23:14:08,491][model8_pretrain.py][INFO] Epoch:[0/2](731500/4588595) loss:2.621 lr:0.0000100 epoch_Time:24542.0min: [2024-01-05 23:14:45,422][model8_pretrain.py][INFO] Epoch:[0/2](731600/4588595) loss:3.120 lr:0.0000100 epoch_Time:24541.0min: [2024-01-05 23:14:45,422][model8_pretrain.py][INFO] Epoch:[0/2](731600/4588595) loss:2.531 lr:0.0000100 epoch_Time:24541.0min: [2024-01-05 23:14:45,422][model8_pretrain.py][INFO] Epoch:[0/2](731600/4588595) loss:2.625 lr:0.0000100 epoch_Time:24541.0min: [2024-01-05 23:14:45,422][model8_pretrain.py][INFO] Epoch:[0/2](731600/4588595) loss:2.695 lr:0.0000100 epoch_Time:24541.0min: [2024-01-05 23:14:45,422][model8_pretrain.py][INFO] Epoch:[0/2](731600/4588595) loss:2.950 lr:0.0000100 epoch_Time:24541.0min: [2024-01-05 23:14:45,422][model8_pretrain.py][INFO] Epoch:[0/2](731600/4588595) loss:3.128 lr:0.0000100 epoch_Time:24541.0min: [2024-01-05 23:14:45,422][model8_pretrain.py][INFO] Epoch:[0/2](731600/4588595) loss:2.598 lr:0.0000100 epoch_Time:24541.0min: [2024-01-05 23:14:45,422][model8_pretrain.py][INFO] Epoch:[0/2](731600/4588595) loss:2.927 lr:0.0000100 epoch_Time:24541.0min: [2024-01-05 23:15:22,348][model8_pretrain.py][INFO] Epoch:[0/2](731700/4588595) loss:2.668 lr:0.0000100 epoch_Time:24540.0min: [2024-01-05 23:15:22,348][model8_pretrain.py][INFO] Epoch:[0/2](731700/4588595) loss:3.010 lr:0.0000100 epoch_Time:24540.0min: [2024-01-05 23:15:22,348][model8_pretrain.py][INFO] Epoch:[0/2](731700/4588595) loss:3.052 lr:0.0000100 epoch_Time:24540.0min: [2024-01-05 23:15:22,348][model8_pretrain.py][INFO] Epoch:[0/2](731700/4588595) loss:2.668 lr:0.0000100 epoch_Time:24540.0min: [2024-01-05 23:15:22,348][model8_pretrain.py][INFO] Epoch:[0/2](731700/4588595) loss:3.010 lr:0.0000100 epoch_Time:24540.0min: [2024-01-05 23:15:22,348][model8_pretrain.py][INFO] Epoch:[0/2](731700/4588595) loss:2.834 lr:0.0000100 epoch_Time:24540.0min: [2024-01-05 23:15:22,348][model8_pretrain.py][INFO] Epoch:[0/2](731700/4588595) loss:3.059 lr:0.0000100 epoch_Time:24540.0min: [2024-01-05 23:15:22,349][model8_pretrain.py][INFO] Epoch:[0/2](731700/4588595) loss:2.471 lr:0.0000100 epoch_Time:24540.0min: [2024-01-05 23:15:59,291][model8_pretrain.py][INFO] Epoch:[0/2](731800/4588595) loss:3.116 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:15:59,291][model8_pretrain.py][INFO] Epoch:[0/2](731800/4588595) loss:2.863 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:15:59,291][model8_pretrain.py][INFO] Epoch:[0/2](731800/4588595) loss:2.342 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:15:59,291][model8_pretrain.py][INFO] Epoch:[0/2](731800/4588595) loss:2.755 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:15:59,291][model8_pretrain.py][INFO] Epoch:[0/2](731800/4588595) loss:2.656 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:15:59,291][model8_pretrain.py][INFO] Epoch:[0/2](731800/4588595) loss:3.324 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:15:59,291][model8_pretrain.py][INFO] Epoch:[0/2](731800/4588595) loss:2.499 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:15:59,291][model8_pretrain.py][INFO] Epoch:[0/2](731800/4588595) loss:2.861 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:16:36,227][model8_pretrain.py][INFO] Epoch:[0/2](731900/4588595) loss:2.575 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:16:36,227][model8_pretrain.py][INFO] Epoch:[0/2](731900/4588595) loss:2.631 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:16:36,227][model8_pretrain.py][INFO] Epoch:[0/2](731900/4588595) loss:2.817 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:16:36,227][model8_pretrain.py][INFO] Epoch:[0/2](731900/4588595) loss:3.070 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:16:36,227][model8_pretrain.py][INFO] Epoch:[0/2](731900/4588595) loss:3.087 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:16:36,227][model8_pretrain.py][INFO] Epoch:[0/2](731900/4588595) loss:3.323 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:16:36,227][model8_pretrain.py][INFO] Epoch:[0/2](731900/4588595) loss:2.762 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:16:36,227][model8_pretrain.py][INFO] Epoch:[0/2](731900/4588595) loss:2.569 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:17:23,386][model8_pretrain.py][INFO] Epoch:[0/2](732000/4588595) loss:3.158 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:17:23,386][model8_pretrain.py][INFO] Epoch:[0/2](732000/4588595) loss:3.144 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:17:23,386][model8_pretrain.py][INFO] Epoch:[0/2](732000/4588595) loss:3.126 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:17:23,386][model8_pretrain.py][INFO] Epoch:[0/2](732000/4588595) loss:3.002 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:17:23,386][model8_pretrain.py][INFO] Epoch:[0/2](732000/4588595) loss:3.372 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:17:23,386][model8_pretrain.py][INFO] Epoch:[0/2](732000/4588595) loss:2.241 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:17:23,386][model8_pretrain.py][INFO] Epoch:[0/2](732000/4588595) loss:2.739 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:17:23,386][model8_pretrain.py][INFO] Epoch:[0/2](732000/4588595) loss:3.299 lr:0.0000100 epoch_Time:24539.0min: [2024-01-05 23:18:01,985][model8_pretrain.py][INFO] Epoch:[0/2](732100/4588595) loss:2.816 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:01,985][model8_pretrain.py][INFO] Epoch:[0/2](732100/4588595) loss:2.518 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:01,985][model8_pretrain.py][INFO] Epoch:[0/2](732100/4588595) loss:2.991 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:01,985][model8_pretrain.py][INFO] Epoch:[0/2](732100/4588595) loss:3.133 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:01,985][model8_pretrain.py][INFO] Epoch:[0/2](732100/4588595) loss:3.429 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:01,985][model8_pretrain.py][INFO] Epoch:[0/2](732100/4588595) loss:2.802 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:01,985][model8_pretrain.py][INFO] Epoch:[0/2](732100/4588595) loss:2.677 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:01,985][model8_pretrain.py][INFO] Epoch:[0/2](732100/4588595) loss:2.802 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:38,931][model8_pretrain.py][INFO] Epoch:[0/2](732200/4588595) loss:2.402 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:38,931][model8_pretrain.py][INFO] Epoch:[0/2](732200/4588595) loss:2.101 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:38,931][model8_pretrain.py][INFO] Epoch:[0/2](732200/4588595) loss:2.253 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:38,931][model8_pretrain.py][INFO] Epoch:[0/2](732200/4588595) loss:3.074 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:38,931][model8_pretrain.py][INFO] Epoch:[0/2](732200/4588595) loss:2.879 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:38,931][model8_pretrain.py][INFO] Epoch:[0/2](732200/4588595) loss:3.152 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:38,931][model8_pretrain.py][INFO] Epoch:[0/2](732200/4588595) loss:3.143 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:18:38,932][model8_pretrain.py][INFO] Epoch:[0/2](732200/4588595) loss:2.838 lr:0.0000100 epoch_Time:24538.0min: [2024-01-05 23:19:15,877][model8_pretrain.py][INFO] Epoch:[0/2](732300/4588595) loss:2.653 lr:0.0000100 epoch_Time:24537.0min: [2024-01-05 23:19:15,877][model8_pretrain.py][INFO] Epoch:[0/2](732300/4588595) loss:2.794 lr:0.0000100 epoch_Time:24537.0min: [2024-01-05 23:19:15,877][model8_pretrain.py][INFO] Epoch:[0/2](732300/4588595) loss:2.280 lr:0.0000100 epoch_Time:24537.0min: [2024-01-05 23:19:15,877][model8_pretrain.py][INFO] Epoch:[0/2](732300/4588595) loss:3.063 lr:0.0000100 epoch_Time:24537.0min: [2024-01-05 23:19:15,877][model8_pretrain.py][INFO] Epoch:[0/2](732300/4588595) loss:2.974 lr:0.0000100 epoch_Time:24537.0min: [2024-01-05 23:19:15,877][model8_pretrain.py][INFO] Epoch:[0/2](732300/4588595) loss:3.225 lr:0.0000100 epoch_Time:24537.0min: [2024-01-05 23:19:15,877][model8_pretrain.py][INFO] Epoch:[0/2](732300/4588595) loss:2.506 lr:0.0000100 epoch_Time:24537.0min: [2024-01-05 23:19:15,877][model8_pretrain.py][INFO] Epoch:[0/2](732300/4588595) loss:3.478 lr:0.0000100 epoch_Time:24537.0min: [2024-01-05 23:19:52,818][model8_pretrain.py][INFO] Epoch:[0/2](732400/4588595) loss:2.828 lr:0.0000100 epoch_Time:24536.0min: [2024-01-05 23:19:52,818][model8_pretrain.py][INFO] Epoch:[0/2](732400/4588595) loss:2.900 lr:0.0000100 epoch_Time:24536.0min: [2024-01-05 23:19:52,818][model8_pretrain.py][INFO] Epoch:[0/2](732400/4588595) loss:3.146 lr:0.0000100 epoch_Time:24536.0min: [2024-01-05 23:19:52,818][model8_pretrain.py][INFO] Epoch:[0/2](732400/4588595) loss:2.716 lr:0.0000100 epoch_Time:24536.0min: [2024-01-05 23:19:52,818][model8_pretrain.py][INFO] Epoch:[0/2](732400/4588595) loss:2.664 lr:0.0000100 epoch_Time:24536.0min: [2024-01-05 23:19:52,818][model8_pretrain.py][INFO] Epoch:[0/2](732400/4588595) loss:2.965 lr:0.0000100 epoch_Time:24536.0min: [2024-01-05 23:19:52,818][model8_pretrain.py][INFO] Epoch:[0/2](732400/4588595) loss:2.938 lr:0.0000100 epoch_Time:24536.0min: [2024-01-05 23:19:52,818][model8_pretrain.py][INFO] Epoch:[0/2](732400/4588595) loss:2.785 lr:0.0000100 epoch_Time:24536.0min: [2024-01-05 23:20:29,757][model8_pretrain.py][INFO] Epoch:[0/2](732500/4588595) loss:2.969 lr:0.0000100 epoch_Time:24535.0min: [2024-01-05 23:20:29,757][model8_pretrain.py][INFO] Epoch:[0/2](732500/4588595) loss:3.281 lr:0.0000100 epoch_Time:24535.0min: [2024-01-05 23:20:29,757][model8_pretrain.py][INFO] Epoch:[0/2](732500/4588595) loss:2.601 lr:0.0000100 epoch_Time:24535.0min: [2024-01-05 23:20:29,757][model8_pretrain.py][INFO] Epoch:[0/2](732500/4588595) loss:2.577 lr:0.0000100 epoch_Time:24535.0min: [2024-01-05 23:20:29,757][model8_pretrain.py][INFO] Epoch:[0/2](732500/4588595) loss:2.582 lr:0.0000100 epoch_Time:24535.0min: [2024-01-05 23:20:29,757][model8_pretrain.py][INFO] Epoch:[0/2](732500/4588595) loss:1.936 lr:0.0000100 epoch_Time:24535.0min: [2024-01-05 23:20:29,757][model8_pretrain.py][INFO] Epoch:[0/2](732500/4588595) loss:2.493 lr:0.0000100 epoch_Time:24535.0min: [2024-01-05 23:20:29,757][model8_pretrain.py][INFO] Epoch:[0/2](732500/4588595) loss:2.723 lr:0.0000100 epoch_Time:24535.0min: [2024-01-05 23:21:06,703][model8_pretrain.py][INFO] Epoch:[0/2](732600/4588595) loss:2.192 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:06,703][model8_pretrain.py][INFO] Epoch:[0/2](732600/4588595) loss:2.997 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:06,703][model8_pretrain.py][INFO] Epoch:[0/2](732600/4588595) loss:2.161 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:06,703][model8_pretrain.py][INFO] Epoch:[0/2](732600/4588595) loss:2.446 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:06,703][model8_pretrain.py][INFO] Epoch:[0/2](732600/4588595) loss:3.118 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:06,703][model8_pretrain.py][INFO] Epoch:[0/2](732600/4588595) loss:3.223 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:06,703][model8_pretrain.py][INFO] Epoch:[0/2](732600/4588595) loss:2.282 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:06,703][model8_pretrain.py][INFO] Epoch:[0/2](732600/4588595) loss:2.938 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:43,638][model8_pretrain.py][INFO] Epoch:[0/2](732700/4588595) loss:2.255 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:43,638][model8_pretrain.py][INFO] Epoch:[0/2](732700/4588595) loss:2.834 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:43,638][model8_pretrain.py][INFO] Epoch:[0/2](732700/4588595) loss:2.447 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:43,638][model8_pretrain.py][INFO] Epoch:[0/2](732700/4588595) loss:2.774 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:43,638][model8_pretrain.py][INFO] Epoch:[0/2](732700/4588595) loss:2.769 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:43,638][model8_pretrain.py][INFO] Epoch:[0/2](732700/4588595) loss:2.319 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:43,638][model8_pretrain.py][INFO] Epoch:[0/2](732700/4588595) loss:2.782 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:21:43,638][model8_pretrain.py][INFO] Epoch:[0/2](732700/4588595) loss:2.757 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:22:29,010][model8_pretrain.py][INFO] Epoch:[0/2](732800/4588595) loss:2.934 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:22:29,011][model8_pretrain.py][INFO] Epoch:[0/2](732800/4588595) loss:2.438 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:22:29,011][model8_pretrain.py][INFO] Epoch:[0/2](732800/4588595) loss:2.348 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:22:29,011][model8_pretrain.py][INFO] Epoch:[0/2](732800/4588595) loss:2.677 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:22:29,011][model8_pretrain.py][INFO] Epoch:[0/2](732800/4588595) loss:2.584 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:22:29,011][model8_pretrain.py][INFO] Epoch:[0/2](732800/4588595) loss:3.010 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:22:29,011][model8_pretrain.py][INFO] Epoch:[0/2](732800/4588595) loss:3.064 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:22:29,011][model8_pretrain.py][INFO] Epoch:[0/2](732800/4588595) loss:3.066 lr:0.0000100 epoch_Time:24534.0min: [2024-01-05 23:23:09,383][model8_pretrain.py][INFO] Epoch:[0/2](732900/4588595) loss:3.116 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:09,383][model8_pretrain.py][INFO] Epoch:[0/2](732900/4588595) loss:2.991 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:09,383][model8_pretrain.py][INFO] Epoch:[0/2](732900/4588595) loss:2.839 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:09,383][model8_pretrain.py][INFO] Epoch:[0/2](732900/4588595) loss:2.800 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:09,383][model8_pretrain.py][INFO] Epoch:[0/2](732900/4588595) loss:3.176 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:09,383][model8_pretrain.py][INFO] Epoch:[0/2](732900/4588595) loss:3.046 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:09,384][model8_pretrain.py][INFO] Epoch:[0/2](732900/4588595) loss:2.368 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:09,384][model8_pretrain.py][INFO] Epoch:[0/2](732900/4588595) loss:2.216 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:46,319][model8_pretrain.py][INFO] Epoch:[0/2](733000/4588595) loss:2.716 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:46,319][model8_pretrain.py][INFO] Epoch:[0/2](733000/4588595) loss:3.049 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:46,319][model8_pretrain.py][INFO] Epoch:[0/2](733000/4588595) loss:2.987 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:46,319][model8_pretrain.py][INFO] Epoch:[0/2](733000/4588595) loss:2.449 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:46,319][model8_pretrain.py][INFO] Epoch:[0/2](733000/4588595) loss:3.096 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:46,319][model8_pretrain.py][INFO] Epoch:[0/2](733000/4588595) loss:2.831 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:46,319][model8_pretrain.py][INFO] Epoch:[0/2](733000/4588595) loss:3.121 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:23:46,319][model8_pretrain.py][INFO] Epoch:[0/2](733000/4588595) loss:2.559 lr:0.0000100 epoch_Time:24533.0min: [2024-01-05 23:24:23,236][model8_pretrain.py][INFO] Epoch:[0/2](733100/4588595) loss:3.020 lr:0.0000100 epoch_Time:24532.0min: [2024-01-05 23:24:23,236][model8_pretrain.py][INFO] Epoch:[0/2](733100/4588595) loss:3.101 lr:0.0000100 epoch_Time:24532.0min: [2024-01-05 23:24:23,236][model8_pretrain.py][INFO] Epoch:[0/2](733100/4588595) loss:2.482 lr:0.0000100 epoch_Time:24532.0min: [2024-01-05 23:24:23,236][model8_pretrain.py][INFO] Epoch:[0/2](733100/4588595) loss:2.937 lr:0.0000100 epoch_Time:24532.0min: [2024-01-05 23:24:23,236][model8_pretrain.py][INFO] Epoch:[0/2](733100/4588595) loss:3.015 lr:0.0000100 epoch_Time:24532.0min: [2024-01-05 23:24:23,236][model8_pretrain.py][INFO] Epoch:[0/2](733100/4588595) loss:2.713 lr:0.0000100 epoch_Time:24532.0min: [2024-01-05 23:24:23,236][model8_pretrain.py][INFO] Epoch:[0/2](733100/4588595) loss:3.225 lr:0.0000100 epoch_Time:24532.0min: [2024-01-05 23:24:23,237][model8_pretrain.py][INFO] Epoch:[0/2](733100/4588595) loss:2.782 lr:0.0000100 epoch_Time:24532.0min: [2024-01-05 23:25:00,180][model8_pretrain.py][INFO] Epoch:[0/2](733200/4588595) loss:3.216 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:00,180][model8_pretrain.py][INFO] Epoch:[0/2](733200/4588595) loss:2.419 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:00,180][model8_pretrain.py][INFO] Epoch:[0/2](733200/4588595) loss:3.320 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:00,180][model8_pretrain.py][INFO] Epoch:[0/2](733200/4588595) loss:2.910 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:00,180][model8_pretrain.py][INFO] Epoch:[0/2](733200/4588595) loss:2.970 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:00,180][model8_pretrain.py][INFO] Epoch:[0/2](733200/4588595) loss:3.096 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:00,181][model8_pretrain.py][INFO] Epoch:[0/2](733200/4588595) loss:2.400 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:00,181][model8_pretrain.py][INFO] Epoch:[0/2](733200/4588595) loss:3.179 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:37,108][model8_pretrain.py][INFO] Epoch:[0/2](733300/4588595) loss:3.152 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:37,108][model8_pretrain.py][INFO] Epoch:[0/2](733300/4588595) loss:2.788 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:37,108][model8_pretrain.py][INFO] Epoch:[0/2](733300/4588595) loss:3.261 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:37,108][model8_pretrain.py][INFO] Epoch:[0/2](733300/4588595) loss:2.772 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:37,108][model8_pretrain.py][INFO] Epoch:[0/2](733300/4588595) loss:2.985 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:37,108][model8_pretrain.py][INFO] Epoch:[0/2](733300/4588595) loss:3.024 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:37,108][model8_pretrain.py][INFO] Epoch:[0/2](733300/4588595) loss:2.559 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:25:37,108][model8_pretrain.py][INFO] Epoch:[0/2](733300/4588595) loss:2.791 lr:0.0000100 epoch_Time:24531.0min: [2024-01-05 23:26:14,049][model8_pretrain.py][INFO] Epoch:[0/2](733400/4588595) loss:2.767 lr:0.0000100 epoch_Time:24530.0min: [2024-01-05 23:26:14,049][model8_pretrain.py][INFO] Epoch:[0/2](733400/4588595) loss:2.677 lr:0.0000100 epoch_Time:24530.0min: [2024-01-05 23:26:14,049][model8_pretrain.py][INFO] Epoch:[0/2](733400/4588595) loss:3.167 lr:0.0000100 epoch_Time:24530.0min: [2024-01-05 23:26:14,049][model8_pretrain.py][INFO] Epoch:[0/2](733400/4588595) loss:2.898 lr:0.0000100 epoch_Time:24530.0min: [2024-01-05 23:26:14,049][model8_pretrain.py][INFO] Epoch:[0/2](733400/4588595) loss:3.523 lr:0.0000100 epoch_Time:24530.0min: [2024-01-05 23:26:14,049][model8_pretrain.py][INFO] Epoch:[0/2](733400/4588595) loss:1.920 lr:0.0000100 epoch_Time:24530.0min: [2024-01-05 23:26:14,050][model8_pretrain.py][INFO] Epoch:[0/2](733400/4588595) loss:2.580 lr:0.0000100 epoch_Time:24530.0min: [2024-01-05 23:26:14,050][model8_pretrain.py][INFO] Epoch:[0/2](733400/4588595) loss:2.744 lr:0.0000100 epoch_Time:24530.0min: [2024-01-05 23:26:50,991][model8_pretrain.py][INFO] Epoch:[0/2](733500/4588595) loss:2.878 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:26:50,991][model8_pretrain.py][INFO] Epoch:[0/2](733500/4588595) loss:2.861 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:26:50,991][model8_pretrain.py][INFO] Epoch:[0/2](733500/4588595) loss:2.518 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:26:50,991][model8_pretrain.py][INFO] Epoch:[0/2](733500/4588595) loss:2.077 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:26:50,991][model8_pretrain.py][INFO] Epoch:[0/2](733500/4588595) loss:3.380 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:26:50,991][model8_pretrain.py][INFO] Epoch:[0/2](733500/4588595) loss:3.148 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:26:50,991][model8_pretrain.py][INFO] Epoch:[0/2](733500/4588595) loss:3.078 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:26:50,991][model8_pretrain.py][INFO] Epoch:[0/2](733500/4588595) loss:2.927 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:27:36,583][model8_pretrain.py][INFO] Epoch:[0/2](733600/4588595) loss:2.773 lr:0.0000100 epoch_Time:24529.0min: [2024-01-05 23:27:36,583][model8_pretrain.py][INFO] Epoch:[0/2](733600/4588595) loss:3.266 lr:0.0000100 epoch_Time:24529.0min: [2024-01-05 23:27:36,583][model8_pretrain.py][INFO] Epoch:[0/2](733600/4588595) loss:2.756 lr:0.0000100 epoch_Time:24529.0min: [2024-01-05 23:27:36,583][model8_pretrain.py][INFO] Epoch:[0/2](733600/4588595) loss:2.881 lr:0.0000100 epoch_Time:24529.0min: [2024-01-05 23:27:36,583][model8_pretrain.py][INFO] Epoch:[0/2](733600/4588595) loss:2.840 lr:0.0000100 epoch_Time:24529.0min: [2024-01-05 23:27:36,583][model8_pretrain.py][INFO] Epoch:[0/2](733600/4588595) loss:2.887 lr:0.0000100 epoch_Time:24529.0min: [2024-01-05 23:27:36,583][model8_pretrain.py][INFO] Epoch:[0/2](733600/4588595) loss:2.858 lr:0.0000100 epoch_Time:24529.0min: [2024-01-05 23:27:36,583][model8_pretrain.py][INFO] Epoch:[0/2](733600/4588595) loss:2.424 lr:0.0000100 epoch_Time:24529.0min: [2024-01-05 23:28:16,993][model8_pretrain.py][INFO] Epoch:[0/2](733700/4588595) loss:2.766 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:28:16,993][model8_pretrain.py][INFO] Epoch:[0/2](733700/4588595) loss:2.803 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:28:16,993][model8_pretrain.py][INFO] Epoch:[0/2](733700/4588595) loss:2.750 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:28:16,993][model8_pretrain.py][INFO] Epoch:[0/2](733700/4588595) loss:2.808 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:28:16,993][model8_pretrain.py][INFO] Epoch:[0/2](733700/4588595) loss:2.537 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:28:16,993][model8_pretrain.py][INFO] Epoch:[0/2](733700/4588595) loss:3.052 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:28:16,994][model8_pretrain.py][INFO] Epoch:[0/2](733700/4588595) loss:2.533 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:28:16,994][model8_pretrain.py][INFO] Epoch:[0/2](733700/4588595) loss:3.314 lr:0.0000100 epoch_Time:24528.0min: [2024-01-05 23:28:53,940][model8_pretrain.py][INFO] Epoch:[0/2](733800/4588595) loss:2.921 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:28:53,940][model8_pretrain.py][INFO] Epoch:[0/2](733800/4588595) loss:2.983 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:28:53,940][model8_pretrain.py][INFO] Epoch:[0/2](733800/4588595) loss:2.810 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:28:53,940][model8_pretrain.py][INFO] Epoch:[0/2](733800/4588595) loss:2.716 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:28:53,940][model8_pretrain.py][INFO] Epoch:[0/2](733800/4588595) loss:2.560 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:28:53,941][model8_pretrain.py][INFO] Epoch:[0/2](733800/4588595) loss:3.029 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:28:53,940][model8_pretrain.py][INFO] Epoch:[0/2](733800/4588595) loss:2.854 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:28:53,941][model8_pretrain.py][INFO] Epoch:[0/2](733800/4588595) loss:2.723 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:29:30,872][model8_pretrain.py][INFO] Epoch:[0/2](733900/4588595) loss:2.975 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:29:30,872][model8_pretrain.py][INFO] Epoch:[0/2](733900/4588595) loss:2.804 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:29:30,872][model8_pretrain.py][INFO] Epoch:[0/2](733900/4588595) loss:2.947 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:29:30,872][model8_pretrain.py][INFO] Epoch:[0/2](733900/4588595) loss:3.087 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:29:30,872][model8_pretrain.py][INFO] Epoch:[0/2](733900/4588595) loss:2.508 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:29:30,872][model8_pretrain.py][INFO] Epoch:[0/2](733900/4588595) loss:3.268 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:29:30,872][model8_pretrain.py][INFO] Epoch:[0/2](733900/4588595) loss:2.858 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:29:30,872][model8_pretrain.py][INFO] Epoch:[0/2](733900/4588595) loss:2.810 lr:0.0000100 epoch_Time:24527.0min: [2024-01-05 23:30:07,805][model8_pretrain.py][INFO] Epoch:[0/2](734000/4588595) loss:2.563 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:07,805][model8_pretrain.py][INFO] Epoch:[0/2](734000/4588595) loss:2.937 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:07,805][model8_pretrain.py][INFO] Epoch:[0/2](734000/4588595) loss:2.045 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:07,805][model8_pretrain.py][INFO] Epoch:[0/2](734000/4588595) loss:2.892 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:07,805][model8_pretrain.py][INFO] Epoch:[0/2](734000/4588595) loss:2.796 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:07,805][model8_pretrain.py][INFO] Epoch:[0/2](734000/4588595) loss:2.465 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:07,805][model8_pretrain.py][INFO] Epoch:[0/2](734000/4588595) loss:3.092 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:07,805][model8_pretrain.py][INFO] Epoch:[0/2](734000/4588595) loss:3.558 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:44,741][model8_pretrain.py][INFO] Epoch:[0/2](734100/4588595) loss:2.663 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:44,741][model8_pretrain.py][INFO] Epoch:[0/2](734100/4588595) loss:2.166 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:44,741][model8_pretrain.py][INFO] Epoch:[0/2](734100/4588595) loss:3.076 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:44,741][model8_pretrain.py][INFO] Epoch:[0/2](734100/4588595) loss:3.071 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:44,741][model8_pretrain.py][INFO] Epoch:[0/2](734100/4588595) loss:2.835 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:44,742][model8_pretrain.py][INFO] Epoch:[0/2](734100/4588595) loss:2.735 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:44,742][model8_pretrain.py][INFO] Epoch:[0/2](734100/4588595) loss:2.342 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:30:44,742][model8_pretrain.py][INFO] Epoch:[0/2](734100/4588595) loss:2.691 lr:0.0000100 epoch_Time:24526.0min: [2024-01-05 23:31:21,665][model8_pretrain.py][INFO] Epoch:[0/2](734200/4588595) loss:2.857 lr:0.0000100 epoch_Time:24525.0min: [2024-01-05 23:31:21,665][model8_pretrain.py][INFO] Epoch:[0/2](734200/4588595) loss:2.445 lr:0.0000100 epoch_Time:24525.0min: [2024-01-05 23:31:21,665][model8_pretrain.py][INFO] Epoch:[0/2](734200/4588595) loss:3.013 lr:0.0000100 epoch_Time:24525.0min: [2024-01-05 23:31:21,665][model8_pretrain.py][INFO] Epoch:[0/2](734200/4588595) loss:2.653 lr:0.0000100 epoch_Time:24525.0min: [2024-01-05 23:31:21,665][model8_pretrain.py][INFO] Epoch:[0/2](734200/4588595) loss:2.797 lr:0.0000100 epoch_Time:24525.0min: [2024-01-05 23:31:21,665][model8_pretrain.py][INFO] Epoch:[0/2](734200/4588595) loss:3.584 lr:0.0000100 epoch_Time:24525.0min: [2024-01-05 23:31:21,665][model8_pretrain.py][INFO] Epoch:[0/2](734200/4588595) loss:2.685 lr:0.0000100 epoch_Time:24525.0min: [2024-01-05 23:31:21,665][model8_pretrain.py][INFO] Epoch:[0/2](734200/4588595) loss:2.845 lr:0.0000100 epoch_Time:24525.0min: [2024-01-05 23:31:58,595][model8_pretrain.py][INFO] Epoch:[0/2](734300/4588595) loss:2.785 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:31:58,595][model8_pretrain.py][INFO] Epoch:[0/2](734300/4588595) loss:2.740 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:31:58,595][model8_pretrain.py][INFO] Epoch:[0/2](734300/4588595) loss:2.859 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:31:58,596][model8_pretrain.py][INFO] Epoch:[0/2](734300/4588595) loss:2.838 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:31:58,596][model8_pretrain.py][INFO] Epoch:[0/2](734300/4588595) loss:3.109 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:31:58,596][model8_pretrain.py][INFO] Epoch:[0/2](734300/4588595) loss:2.767 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:31:58,596][model8_pretrain.py][INFO] Epoch:[0/2](734300/4588595) loss:2.665 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:31:58,596][model8_pretrain.py][INFO] Epoch:[0/2](734300/4588595) loss:2.592 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:32:44,230][model8_pretrain.py][INFO] Epoch:[0/2](734400/4588595) loss:2.837 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:32:44,230][model8_pretrain.py][INFO] Epoch:[0/2](734400/4588595) loss:2.632 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:32:44,230][model8_pretrain.py][INFO] Epoch:[0/2](734400/4588595) loss:3.024 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:32:44,230][model8_pretrain.py][INFO] Epoch:[0/2](734400/4588595) loss:2.805 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:32:44,230][model8_pretrain.py][INFO] Epoch:[0/2](734400/4588595) loss:2.791 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:32:44,230][model8_pretrain.py][INFO] Epoch:[0/2](734400/4588595) loss:2.788 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:32:44,230][model8_pretrain.py][INFO] Epoch:[0/2](734400/4588595) loss:3.069 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:32:44,235][model8_pretrain.py][INFO] Epoch:[0/2](734400/4588595) loss:2.613 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:33:24,655][model8_pretrain.py][INFO] Epoch:[0/2](734500/4588595) loss:2.731 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:33:24,655][model8_pretrain.py][INFO] Epoch:[0/2](734500/4588595) loss:2.938 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:33:24,655][model8_pretrain.py][INFO] Epoch:[0/2](734500/4588595) loss:2.809 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:33:24,655][model8_pretrain.py][INFO] Epoch:[0/2](734500/4588595) loss:2.561 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:33:24,655][model8_pretrain.py][INFO] Epoch:[0/2](734500/4588595) loss:2.589 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:33:24,655][model8_pretrain.py][INFO] Epoch:[0/2](734500/4588595) loss:3.320 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:33:24,655][model8_pretrain.py][INFO] Epoch:[0/2](734500/4588595) loss:2.159 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:33:24,655][model8_pretrain.py][INFO] Epoch:[0/2](734500/4588595) loss:2.599 lr:0.0000100 epoch_Time:24524.0min: [2024-01-05 23:34:01,586][model8_pretrain.py][INFO] Epoch:[0/2](734600/4588595) loss:2.590 lr:0.0000100 epoch_Time:24523.0min: [2024-01-05 23:34:01,586][model8_pretrain.py][INFO] Epoch:[0/2](734600/4588595) loss:3.053 lr:0.0000100 epoch_Time:24523.0min: [2024-01-05 23:34:01,586][model8_pretrain.py][INFO] Epoch:[0/2](734600/4588595) loss:2.783 lr:0.0000100 epoch_Time:24523.0min: [2024-01-05 23:34:01,586][model8_pretrain.py][INFO] Epoch:[0/2](734600/4588595) loss:3.029 lr:0.0000100 epoch_Time:24523.0min: [2024-01-05 23:34:01,586][model8_pretrain.py][INFO] Epoch:[0/2](734600/4588595) loss:2.739 lr:0.0000100 epoch_Time:24523.0min: [2024-01-05 23:34:01,586][model8_pretrain.py][INFO] Epoch:[0/2](734600/4588595) loss:3.084 lr:0.0000100 epoch_Time:24523.0min: [2024-01-05 23:34:01,586][model8_pretrain.py][INFO] Epoch:[0/2](734600/4588595) loss:2.868 lr:0.0000100 epoch_Time:24523.0min: [2024-01-05 23:34:01,586][model8_pretrain.py][INFO] Epoch:[0/2](734600/4588595) loss:2.957 lr:0.0000100 epoch_Time:24523.0min: [2024-01-05 23:34:38,532][model8_pretrain.py][INFO] Epoch:[0/2](734700/4588595) loss:3.001 lr:0.0000100 epoch_Time:24522.0min: [2024-01-05 23:34:38,532][model8_pretrain.py][INFO] Epoch:[0/2](734700/4588595) loss:2.760 lr:0.0000100 epoch_Time:24522.0min: [2024-01-05 23:34:38,532][model8_pretrain.py][INFO] Epoch:[0/2](734700/4588595) loss:3.231 lr:0.0000100 epoch_Time:24522.0min: [2024-01-05 23:34:38,532][model8_pretrain.py][INFO] Epoch:[0/2](734700/4588595) loss:3.075 lr:0.0000100 epoch_Time:24522.0min: [2024-01-05 23:34:38,533][model8_pretrain.py][INFO] Epoch:[0/2](734700/4588595) loss:1.884 lr:0.0000100 epoch_Time:24522.0min: [2024-01-05 23:34:38,532][model8_pretrain.py][INFO] Epoch:[0/2](734700/4588595) loss:2.973 lr:0.0000100 epoch_Time:24522.0min: [2024-01-05 23:34:38,532][model8_pretrain.py][INFO] Epoch:[0/2](734700/4588595) loss:3.134 lr:0.0000100 epoch_Time:24522.0min: [2024-01-05 23:34:38,533][model8_pretrain.py][INFO] Epoch:[0/2](734700/4588595) loss:3.040 lr:0.0000100 epoch_Time:24522.0min: [2024-01-05 23:35:15,499][model8_pretrain.py][INFO] Epoch:[0/2](734800/4588595) loss:2.943 lr:0.0000100 epoch_Time:24521.0min: [2024-01-05 23:35:15,499][model8_pretrain.py][INFO] Epoch:[0/2](734800/4588595) loss:2.646 lr:0.0000100 epoch_Time:24521.0min: [2024-01-05 23:35:15,499][model8_pretrain.py][INFO] Epoch:[0/2](734800/4588595) loss:3.156 lr:0.0000100 epoch_Time:24521.0min: [2024-01-05 23:35:15,499][model8_pretrain.py][INFO] Epoch:[0/2](734800/4588595) loss:3.304 lr:0.0000100 epoch_Time:24521.0min: [2024-01-05 23:35:15,499][model8_pretrain.py][INFO] Epoch:[0/2](734800/4588595) loss:2.602 lr:0.0000100 epoch_Time:24521.0min: [2024-01-05 23:35:15,499][model8_pretrain.py][INFO] Epoch:[0/2](734800/4588595) loss:2.776 lr:0.0000100 epoch_Time:24521.0min: [2024-01-05 23:35:15,499][model8_pretrain.py][INFO] Epoch:[0/2](734800/4588595) loss:3.141 lr:0.0000100 epoch_Time:24521.0min: [2024-01-05 23:35:15,499][model8_pretrain.py][INFO] Epoch:[0/2](734800/4588595) loss:2.728 lr:0.0000100 epoch_Time:24521.0min: [2024-01-05 23:35:52,449][model8_pretrain.py][INFO] Epoch:[0/2](734900/4588595) loss:2.971 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:35:52,449][model8_pretrain.py][INFO] Epoch:[0/2](734900/4588595) loss:2.279 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:35:52,449][model8_pretrain.py][INFO] Epoch:[0/2](734900/4588595) loss:2.864 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:35:52,449][model8_pretrain.py][INFO] Epoch:[0/2](734900/4588595) loss:3.000 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:35:52,449][model8_pretrain.py][INFO] Epoch:[0/2](734900/4588595) loss:2.830 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:35:52,449][model8_pretrain.py][INFO] Epoch:[0/2](734900/4588595) loss:3.217 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:35:52,449][model8_pretrain.py][INFO] Epoch:[0/2](734900/4588595) loss:2.467 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:35:52,449][model8_pretrain.py][INFO] Epoch:[0/2](734900/4588595) loss:3.083 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:36:29,389][model8_pretrain.py][INFO] Epoch:[0/2](735000/4588595) loss:2.672 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:36:29,389][model8_pretrain.py][INFO] Epoch:[0/2](735000/4588595) loss:3.479 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:36:29,389][model8_pretrain.py][INFO] Epoch:[0/2](735000/4588595) loss:2.835 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:36:29,389][model8_pretrain.py][INFO] Epoch:[0/2](735000/4588595) loss:2.289 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:36:29,389][model8_pretrain.py][INFO] Epoch:[0/2](735000/4588595) loss:2.196 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:36:29,389][model8_pretrain.py][INFO] Epoch:[0/2](735000/4588595) loss:2.596 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:36:29,389][model8_pretrain.py][INFO] Epoch:[0/2](735000/4588595) loss:2.842 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:36:29,389][model8_pretrain.py][INFO] Epoch:[0/2](735000/4588595) loss:3.211 lr:0.0000100 epoch_Time:24520.0min: [2024-01-05 23:37:06,329][model8_pretrain.py][INFO] Epoch:[0/2](735100/4588595) loss:3.521 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:37:06,329][model8_pretrain.py][INFO] Epoch:[0/2](735100/4588595) loss:3.324 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:37:06,329][model8_pretrain.py][INFO] Epoch:[0/2](735100/4588595) loss:2.907 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:37:06,329][model8_pretrain.py][INFO] Epoch:[0/2](735100/4588595) loss:3.153 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:37:06,329][model8_pretrain.py][INFO] Epoch:[0/2](735100/4588595) loss:2.637 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:37:06,329][model8_pretrain.py][INFO] Epoch:[0/2](735100/4588595) loss:3.094 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:37:06,329][model8_pretrain.py][INFO] Epoch:[0/2](735100/4588595) loss:2.866 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:37:06,330][model8_pretrain.py][INFO] Epoch:[0/2](735100/4588595) loss:2.436 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:37:48,469][model8_pretrain.py][INFO] Epoch:[0/2](735200/4588595) loss:2.645 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:37:48,469][model8_pretrain.py][INFO] Epoch:[0/2](735200/4588595) loss:2.876 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:37:48,469][model8_pretrain.py][INFO] Epoch:[0/2](735200/4588595) loss:3.194 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:37:48,469][model8_pretrain.py][INFO] Epoch:[0/2](735200/4588595) loss:3.197 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:37:48,469][model8_pretrain.py][INFO] Epoch:[0/2](735200/4588595) loss:2.829 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:37:48,469][model8_pretrain.py][INFO] Epoch:[0/2](735200/4588595) loss:3.167 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:37:48,469][model8_pretrain.py][INFO] Epoch:[0/2](735200/4588595) loss:3.396 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:37:48,469][model8_pretrain.py][INFO] Epoch:[0/2](735200/4588595) loss:2.853 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:38:32,356][model8_pretrain.py][INFO] Epoch:[0/2](735300/4588595) loss:2.653 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:38:32,356][model8_pretrain.py][INFO] Epoch:[0/2](735300/4588595) loss:2.595 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:38:32,356][model8_pretrain.py][INFO] Epoch:[0/2](735300/4588595) loss:2.870 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:38:32,356][model8_pretrain.py][INFO] Epoch:[0/2](735300/4588595) loss:3.342 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:38:32,356][model8_pretrain.py][INFO] Epoch:[0/2](735300/4588595) loss:2.561 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:38:32,356][model8_pretrain.py][INFO] Epoch:[0/2](735300/4588595) loss:2.503 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:38:32,356][model8_pretrain.py][INFO] Epoch:[0/2](735300/4588595) loss:2.734 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:38:32,357][model8_pretrain.py][INFO] Epoch:[0/2](735300/4588595) loss:2.555 lr:0.0000100 epoch_Time:24519.0min: [2024-01-05 23:39:09,296][model8_pretrain.py][INFO] Epoch:[0/2](735400/4588595) loss:2.671 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:09,296][model8_pretrain.py][INFO] Epoch:[0/2](735400/4588595) loss:2.755 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:09,296][model8_pretrain.py][INFO] Epoch:[0/2](735400/4588595) loss:2.965 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:09,296][model8_pretrain.py][INFO] Epoch:[0/2](735400/4588595) loss:3.255 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:09,296][model8_pretrain.py][INFO] Epoch:[0/2](735400/4588595) loss:3.200 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:09,296][model8_pretrain.py][INFO] Epoch:[0/2](735400/4588595) loss:2.575 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:09,296][model8_pretrain.py][INFO] Epoch:[0/2](735400/4588595) loss:3.276 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:09,297][model8_pretrain.py][INFO] Epoch:[0/2](735400/4588595) loss:2.547 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:46,205][model8_pretrain.py][INFO] Epoch:[0/2](735500/4588595) loss:3.023 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:46,205][model8_pretrain.py][INFO] Epoch:[0/2](735500/4588595) loss:3.031 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:46,205][model8_pretrain.py][INFO] Epoch:[0/2](735500/4588595) loss:3.055 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:46,205][model8_pretrain.py][INFO] Epoch:[0/2](735500/4588595) loss:3.295 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:46,205][model8_pretrain.py][INFO] Epoch:[0/2](735500/4588595) loss:2.617 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:46,205][model8_pretrain.py][INFO] Epoch:[0/2](735500/4588595) loss:2.637 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:46,205][model8_pretrain.py][INFO] Epoch:[0/2](735500/4588595) loss:2.963 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:39:46,205][model8_pretrain.py][INFO] Epoch:[0/2](735500/4588595) loss:2.833 lr:0.0000100 epoch_Time:24518.0min: [2024-01-05 23:40:23,145][model8_pretrain.py][INFO] Epoch:[0/2](735600/4588595) loss:3.218 lr:0.0000100 epoch_Time:24516.0min: [2024-01-05 23:40:23,145][model8_pretrain.py][INFO] Epoch:[0/2](735600/4588595) loss:2.489 lr:0.0000100 epoch_Time:24517.0min: [2024-01-05 23:40:23,145][model8_pretrain.py][INFO] Epoch:[0/2](735600/4588595) loss:2.687 lr:0.0000100 epoch_Time:24517.0min: [2024-01-05 23:40:23,145][model8_pretrain.py][INFO] Epoch:[0/2](735600/4588595) loss:2.739 lr:0.0000100 epoch_Time:24517.0min: [2024-01-05 23:40:23,145][model8_pretrain.py][INFO] Epoch:[0/2](735600/4588595) loss:2.712 lr:0.0000100 epoch_Time:24517.0min: [2024-01-05 23:40:23,145][model8_pretrain.py][INFO] Epoch:[0/2](735600/4588595) loss:2.370 lr:0.0000100 epoch_Time:24517.0min: [2024-01-05 23:40:23,145][model8_pretrain.py][INFO] Epoch:[0/2](735600/4588595) loss:2.828 lr:0.0000100 epoch_Time:24517.0min: [2024-01-05 23:40:23,145][model8_pretrain.py][INFO] Epoch:[0/2](735600/4588595) loss:2.691 lr:0.0000100 epoch_Time:24516.0min: [2024-01-05 23:41:00,080][model8_pretrain.py][INFO] Epoch:[0/2](735700/4588595) loss:2.878 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:00,080][model8_pretrain.py][INFO] Epoch:[0/2](735700/4588595) loss:3.312 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:00,080][model8_pretrain.py][INFO] Epoch:[0/2](735700/4588595) loss:2.956 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:00,080][model8_pretrain.py][INFO] Epoch:[0/2](735700/4588595) loss:2.658 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:00,080][model8_pretrain.py][INFO] Epoch:[0/2](735700/4588595) loss:2.853 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:00,080][model8_pretrain.py][INFO] Epoch:[0/2](735700/4588595) loss:2.616 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:00,080][model8_pretrain.py][INFO] Epoch:[0/2](735700/4588595) loss:2.691 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:00,080][model8_pretrain.py][INFO] Epoch:[0/2](735700/4588595) loss:3.014 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:37,006][model8_pretrain.py][INFO] Epoch:[0/2](735800/4588595) loss:3.011 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:37,006][model8_pretrain.py][INFO] Epoch:[0/2](735800/4588595) loss:2.469 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:37,006][model8_pretrain.py][INFO] Epoch:[0/2](735800/4588595) loss:2.896 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:37,006][model8_pretrain.py][INFO] Epoch:[0/2](735800/4588595) loss:3.022 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:37,006][model8_pretrain.py][INFO] Epoch:[0/2](735800/4588595) loss:2.841 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:37,006][model8_pretrain.py][INFO] Epoch:[0/2](735800/4588595) loss:3.569 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:37,006][model8_pretrain.py][INFO] Epoch:[0/2](735800/4588595) loss:2.962 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:41:37,006][model8_pretrain.py][INFO] Epoch:[0/2](735800/4588595) loss:2.744 lr:0.0000100 epoch_Time:24515.0min: [2024-01-05 23:42:13,935][model8_pretrain.py][INFO] Epoch:[0/2](735900/4588595) loss:2.350 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:42:13,935][model8_pretrain.py][INFO] Epoch:[0/2](735900/4588595) loss:3.119 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:42:13,935][model8_pretrain.py][INFO] Epoch:[0/2](735900/4588595) loss:2.960 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:42:13,935][model8_pretrain.py][INFO] Epoch:[0/2](735900/4588595) loss:3.065 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:42:13,935][model8_pretrain.py][INFO] Epoch:[0/2](735900/4588595) loss:2.456 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:42:13,935][model8_pretrain.py][INFO] Epoch:[0/2](735900/4588595) loss:2.743 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:42:13,935][model8_pretrain.py][INFO] Epoch:[0/2](735900/4588595) loss:2.085 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:42:13,935][model8_pretrain.py][INFO] Epoch:[0/2](735900/4588595) loss:3.268 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:42:54,359][model8_pretrain.py][INFO] Epoch:[0/2](736000/4588595) loss:2.757 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:42:54,359][model8_pretrain.py][INFO] Epoch:[0/2](736000/4588595) loss:2.767 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:42:54,359][model8_pretrain.py][INFO] Epoch:[0/2](736000/4588595) loss:2.631 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:42:54,359][model8_pretrain.py][INFO] Epoch:[0/2](736000/4588595) loss:3.104 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:42:54,359][model8_pretrain.py][INFO] Epoch:[0/2](736000/4588595) loss:3.095 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:42:54,359][model8_pretrain.py][INFO] Epoch:[0/2](736000/4588595) loss:2.757 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:42:54,359][model8_pretrain.py][INFO] Epoch:[0/2](736000/4588595) loss:2.963 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:42:54,359][model8_pretrain.py][INFO] Epoch:[0/2](736000/4588595) loss:2.987 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:43:39,795][model8_pretrain.py][INFO] Epoch:[0/2](736100/4588595) loss:2.475 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:43:39,795][model8_pretrain.py][INFO] Epoch:[0/2](736100/4588595) loss:2.461 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:43:39,795][model8_pretrain.py][INFO] Epoch:[0/2](736100/4588595) loss:2.878 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:43:39,795][model8_pretrain.py][INFO] Epoch:[0/2](736100/4588595) loss:3.017 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:43:39,795][model8_pretrain.py][INFO] Epoch:[0/2](736100/4588595) loss:2.605 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:43:39,795][model8_pretrain.py][INFO] Epoch:[0/2](736100/4588595) loss:3.232 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:43:39,795][model8_pretrain.py][INFO] Epoch:[0/2](736100/4588595) loss:3.417 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:43:39,796][model8_pretrain.py][INFO] Epoch:[0/2](736100/4588595) loss:2.827 lr:0.0000100 epoch_Time:24514.0min: [2024-01-05 23:44:16,724][model8_pretrain.py][INFO] Epoch:[0/2](736200/4588595) loss:2.816 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:44:16,724][model8_pretrain.py][INFO] Epoch:[0/2](736200/4588595) loss:3.098 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:44:16,724][model8_pretrain.py][INFO] Epoch:[0/2](736200/4588595) loss:3.440 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:44:16,724][model8_pretrain.py][INFO] Epoch:[0/2](736200/4588595) loss:2.693 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:44:16,724][model8_pretrain.py][INFO] Epoch:[0/2](736200/4588595) loss:2.698 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:44:16,724][model8_pretrain.py][INFO] Epoch:[0/2](736200/4588595) loss:3.017 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:44:16,724][model8_pretrain.py][INFO] Epoch:[0/2](736200/4588595) loss:3.084 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:44:16,724][model8_pretrain.py][INFO] Epoch:[0/2](736200/4588595) loss:2.572 lr:0.0000100 epoch_Time:24513.0min: [2024-01-05 23:44:53,663][model8_pretrain.py][INFO] Epoch:[0/2](736300/4588595) loss:2.748 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:44:53,663][model8_pretrain.py][INFO] Epoch:[0/2](736300/4588595) loss:2.378 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:44:53,663][model8_pretrain.py][INFO] Epoch:[0/2](736300/4588595) loss:3.213 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:44:53,663][model8_pretrain.py][INFO] Epoch:[0/2](736300/4588595) loss:2.638 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:44:53,663][model8_pretrain.py][INFO] Epoch:[0/2](736300/4588595) loss:2.992 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:44:53,663][model8_pretrain.py][INFO] Epoch:[0/2](736300/4588595) loss:2.600 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:44:53,663][model8_pretrain.py][INFO] Epoch:[0/2](736300/4588595) loss:2.545 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:44:53,663][model8_pretrain.py][INFO] Epoch:[0/2](736300/4588595) loss:2.682 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:45:30,607][model8_pretrain.py][INFO] Epoch:[0/2](736400/4588595) loss:2.556 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:45:30,607][model8_pretrain.py][INFO] Epoch:[0/2](736400/4588595) loss:2.398 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:45:30,607][model8_pretrain.py][INFO] Epoch:[0/2](736400/4588595) loss:2.942 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:45:30,607][model8_pretrain.py][INFO] Epoch:[0/2](736400/4588595) loss:3.038 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:45:30,607][model8_pretrain.py][INFO] Epoch:[0/2](736400/4588595) loss:2.961 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:45:30,607][model8_pretrain.py][INFO] Epoch:[0/2](736400/4588595) loss:2.581 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:45:30,607][model8_pretrain.py][INFO] Epoch:[0/2](736400/4588595) loss:2.909 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:45:30,607][model8_pretrain.py][INFO] Epoch:[0/2](736400/4588595) loss:2.917 lr:0.0000100 epoch_Time:24512.0min: [2024-01-05 23:46:07,538][model8_pretrain.py][INFO] Epoch:[0/2](736500/4588595) loss:3.349 lr:0.0000100 epoch_Time:24511.0min: [2024-01-05 23:46:07,538][model8_pretrain.py][INFO] Epoch:[0/2](736500/4588595) loss:2.888 lr:0.0000100 epoch_Time:24511.0min: [2024-01-05 23:46:07,538][model8_pretrain.py][INFO] Epoch:[0/2](736500/4588595) loss:2.935 lr:0.0000100 epoch_Time:24511.0min: [2024-01-05 23:46:07,538][model8_pretrain.py][INFO] Epoch:[0/2](736500/4588595) loss:2.460 lr:0.0000100 epoch_Time:24511.0min: [2024-01-05 23:46:07,538][model8_pretrain.py][INFO] Epoch:[0/2](736500/4588595) loss:2.501 lr:0.0000100 epoch_Time:24511.0min: [2024-01-05 23:46:07,538][model8_pretrain.py][INFO] Epoch:[0/2](736500/4588595) loss:3.117 lr:0.0000100 epoch_Time:24511.0min: [2024-01-05 23:46:07,538][model8_pretrain.py][INFO] Epoch:[0/2](736500/4588595) loss:2.877 lr:0.0000100 epoch_Time:24511.0min: [2024-01-05 23:46:07,538][model8_pretrain.py][INFO] Epoch:[0/2](736500/4588595) loss:2.721 lr:0.0000100 epoch_Time:24511.0min: [2024-01-05 23:46:44,478][model8_pretrain.py][INFO] Epoch:[0/2](736600/4588595) loss:2.383 lr:0.0000100 epoch_Time:24510.0min: [2024-01-05 23:46:44,479][model8_pretrain.py][INFO] Epoch:[0/2](736600/4588595) loss:2.436 lr:0.0000100 epoch_Time:24510.0min: [2024-01-05 23:46:44,479][model8_pretrain.py][INFO] Epoch:[0/2](736600/4588595) loss:3.266 lr:0.0000100 epoch_Time:24510.0min: [2024-01-05 23:46:44,479][model8_pretrain.py][INFO] Epoch:[0/2](736600/4588595) loss:3.093 lr:0.0000100 epoch_Time:24510.0min: [2024-01-05 23:46:44,479][model8_pretrain.py][INFO] Epoch:[0/2](736600/4588595) loss:2.900 lr:0.0000100 epoch_Time:24510.0min: [2024-01-05 23:46:44,479][model8_pretrain.py][INFO] Epoch:[0/2](736600/4588595) loss:3.187 lr:0.0000100 epoch_Time:24510.0min: [2024-01-05 23:46:44,479][model8_pretrain.py][INFO] Epoch:[0/2](736600/4588595) loss:2.948 lr:0.0000100 epoch_Time:24510.0min: [2024-01-05 23:46:44,479][model8_pretrain.py][INFO] Epoch:[0/2](736600/4588595) loss:2.858 lr:0.0000100 epoch_Time:24510.0min: [2024-01-05 23:47:21,429][model8_pretrain.py][INFO] Epoch:[0/2](736700/4588595) loss:2.905 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:47:21,429][model8_pretrain.py][INFO] Epoch:[0/2](736700/4588595) loss:2.796 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:47:21,429][model8_pretrain.py][INFO] Epoch:[0/2](736700/4588595) loss:3.056 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:47:21,429][model8_pretrain.py][INFO] Epoch:[0/2](736700/4588595) loss:3.200 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:47:21,429][model8_pretrain.py][INFO] Epoch:[0/2](736700/4588595) loss:3.384 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:47:21,429][model8_pretrain.py][INFO] Epoch:[0/2](736700/4588595) loss:2.392 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:47:21,429][model8_pretrain.py][INFO] Epoch:[0/2](736700/4588595) loss:2.564 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:47:21,430][model8_pretrain.py][INFO] Epoch:[0/2](736700/4588595) loss:2.898 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:01,918][model8_pretrain.py][INFO] Epoch:[0/2](736800/4588595) loss:3.470 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:01,918][model8_pretrain.py][INFO] Epoch:[0/2](736800/4588595) loss:2.881 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:01,918][model8_pretrain.py][INFO] Epoch:[0/2](736800/4588595) loss:3.142 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:01,918][model8_pretrain.py][INFO] Epoch:[0/2](736800/4588595) loss:2.045 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:01,919][model8_pretrain.py][INFO] Epoch:[0/2](736800/4588595) loss:2.496 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:01,919][model8_pretrain.py][INFO] Epoch:[0/2](736800/4588595) loss:3.215 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:01,922][model8_pretrain.py][INFO] Epoch:[0/2](736800/4588595) loss:2.647 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:01,922][model8_pretrain.py][INFO] Epoch:[0/2](736800/4588595) loss:3.051 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:47,372][model8_pretrain.py][INFO] Epoch:[0/2](736900/4588595) loss:2.854 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:47,372][model8_pretrain.py][INFO] Epoch:[0/2](736900/4588595) loss:3.084 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:47,372][model8_pretrain.py][INFO] Epoch:[0/2](736900/4588595) loss:2.866 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:47,372][model8_pretrain.py][INFO] Epoch:[0/2](736900/4588595) loss:3.569 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:47,372][model8_pretrain.py][INFO] Epoch:[0/2](736900/4588595) loss:2.677 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:47,372][model8_pretrain.py][INFO] Epoch:[0/2](736900/4588595) loss:2.590 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:47,372][model8_pretrain.py][INFO] Epoch:[0/2](736900/4588595) loss:2.418 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:48:47,372][model8_pretrain.py][INFO] Epoch:[0/2](736900/4588595) loss:2.410 lr:0.0000100 epoch_Time:24509.0min: [2024-01-05 23:49:24,303][model8_pretrain.py][INFO] Epoch:[0/2](737000/4588595) loss:2.989 lr:0.0000100 epoch_Time:24508.0min: [2024-01-05 23:49:24,303][model8_pretrain.py][INFO] Epoch:[0/2](737000/4588595) loss:2.133 lr:0.0000100 epoch_Time:24508.0min: [2024-01-05 23:49:24,303][model8_pretrain.py][INFO] Epoch:[0/2](737000/4588595) loss:2.273 lr:0.0000100 epoch_Time:24508.0min: [2024-01-05 23:49:24,303][model8_pretrain.py][INFO] Epoch:[0/2](737000/4588595) loss:2.544 lr:0.0000100 epoch_Time:24508.0min: [2024-01-05 23:49:24,303][model8_pretrain.py][INFO] Epoch:[0/2](737000/4588595) loss:2.611 lr:0.0000100 epoch_Time:24508.0min: [2024-01-05 23:49:24,303][model8_pretrain.py][INFO] Epoch:[0/2](737000/4588595) loss:2.950 lr:0.0000100 epoch_Time:24508.0min: [2024-01-05 23:49:24,303][model8_pretrain.py][INFO] Epoch:[0/2](737000/4588595) loss:1.818 lr:0.0000100 epoch_Time:24508.0min: [2024-01-05 23:49:24,303][model8_pretrain.py][INFO] Epoch:[0/2](737000/4588595) loss:2.346 lr:0.0000100 epoch_Time:24508.0min: [2024-01-05 23:50:01,241][model8_pretrain.py][INFO] Epoch:[0/2](737100/4588595) loss:2.856 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:01,241][model8_pretrain.py][INFO] Epoch:[0/2](737100/4588595) loss:2.693 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:01,241][model8_pretrain.py][INFO] Epoch:[0/2](737100/4588595) loss:2.885 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:01,241][model8_pretrain.py][INFO] Epoch:[0/2](737100/4588595) loss:2.515 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:01,241][model8_pretrain.py][INFO] Epoch:[0/2](737100/4588595) loss:3.189 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:01,241][model8_pretrain.py][INFO] Epoch:[0/2](737100/4588595) loss:1.775 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:01,241][model8_pretrain.py][INFO] Epoch:[0/2](737100/4588595) loss:2.918 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:01,242][model8_pretrain.py][INFO] Epoch:[0/2](737100/4588595) loss:2.740 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:38,178][model8_pretrain.py][INFO] Epoch:[0/2](737200/4588595) loss:2.843 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:38,179][model8_pretrain.py][INFO] Epoch:[0/2](737200/4588595) loss:2.726 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:38,179][model8_pretrain.py][INFO] Epoch:[0/2](737200/4588595) loss:2.588 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:38,179][model8_pretrain.py][INFO] Epoch:[0/2](737200/4588595) loss:2.957 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:38,179][model8_pretrain.py][INFO] Epoch:[0/2](737200/4588595) loss:2.827 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:38,179][model8_pretrain.py][INFO] Epoch:[0/2](737200/4588595) loss:2.806 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:38,179][model8_pretrain.py][INFO] Epoch:[0/2](737200/4588595) loss:3.065 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:50:38,179][model8_pretrain.py][INFO] Epoch:[0/2](737200/4588595) loss:2.491 lr:0.0000100 epoch_Time:24507.0min: [2024-01-05 23:51:15,122][model8_pretrain.py][INFO] Epoch:[0/2](737300/4588595) loss:3.150 lr:0.0000100 epoch_Time:24506.0min: [2024-01-05 23:51:15,122][model8_pretrain.py][INFO] Epoch:[0/2](737300/4588595) loss:2.847 lr:0.0000100 epoch_Time:24506.0min: [2024-01-05 23:51:15,122][model8_pretrain.py][INFO] Epoch:[0/2](737300/4588595) loss:2.905 lr:0.0000100 epoch_Time:24506.0min: [2024-01-05 23:51:15,122][model8_pretrain.py][INFO] Epoch:[0/2](737300/4588595) loss:2.846 lr:0.0000100 epoch_Time:24506.0min: [2024-01-05 23:51:15,122][model8_pretrain.py][INFO] Epoch:[0/2](737300/4588595) loss:3.052 lr:0.0000100 epoch_Time:24506.0min: [2024-01-05 23:51:15,122][model8_pretrain.py][INFO] Epoch:[0/2](737300/4588595) loss:3.006 lr:0.0000100 epoch_Time:24506.0min: [2024-01-05 23:51:15,122][model8_pretrain.py][INFO] Epoch:[0/2](737300/4588595) loss:3.183 lr:0.0000100 epoch_Time:24506.0min: [2024-01-05 23:51:15,122][model8_pretrain.py][INFO] Epoch:[0/2](737300/4588595) loss:3.000 lr:0.0000100 epoch_Time:24506.0min: [2024-01-05 23:51:52,077][model8_pretrain.py][INFO] Epoch:[0/2](737400/4588595) loss:2.257 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:51:52,077][model8_pretrain.py][INFO] Epoch:[0/2](737400/4588595) loss:3.204 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:51:52,077][model8_pretrain.py][INFO] Epoch:[0/2](737400/4588595) loss:3.012 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:51:52,077][model8_pretrain.py][INFO] Epoch:[0/2](737400/4588595) loss:2.397 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:51:52,077][model8_pretrain.py][INFO] Epoch:[0/2](737400/4588595) loss:2.723 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:51:52,077][model8_pretrain.py][INFO] Epoch:[0/2](737400/4588595) loss:2.748 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:51:52,077][model8_pretrain.py][INFO] Epoch:[0/2](737400/4588595) loss:2.871 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:51:52,077][model8_pretrain.py][INFO] Epoch:[0/2](737400/4588595) loss:2.206 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:52:29,013][model8_pretrain.py][INFO] Epoch:[0/2](737500/4588595) loss:3.135 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:52:29,013][model8_pretrain.py][INFO] Epoch:[0/2](737500/4588595) loss:3.109 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:52:29,013][model8_pretrain.py][INFO] Epoch:[0/2](737500/4588595) loss:2.532 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:52:29,013][model8_pretrain.py][INFO] Epoch:[0/2](737500/4588595) loss:2.719 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:52:29,013][model8_pretrain.py][INFO] Epoch:[0/2](737500/4588595) loss:3.394 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:52:29,013][model8_pretrain.py][INFO] Epoch:[0/2](737500/4588595) loss:2.207 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:52:29,013][model8_pretrain.py][INFO] Epoch:[0/2](737500/4588595) loss:3.154 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:52:29,013][model8_pretrain.py][INFO] Epoch:[0/2](737500/4588595) loss:3.001 lr:0.0000100 epoch_Time:24505.0min: [2024-01-05 23:53:07,651][model8_pretrain.py][INFO] Epoch:[0/2](737600/4588595) loss:3.261 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:07,652][model8_pretrain.py][INFO] Epoch:[0/2](737600/4588595) loss:2.935 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:07,652][model8_pretrain.py][INFO] Epoch:[0/2](737600/4588595) loss:2.730 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:07,652][model8_pretrain.py][INFO] Epoch:[0/2](737600/4588595) loss:2.634 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:07,652][model8_pretrain.py][INFO] Epoch:[0/2](737600/4588595) loss:2.941 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:07,652][model8_pretrain.py][INFO] Epoch:[0/2](737600/4588595) loss:2.958 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:07,652][model8_pretrain.py][INFO] Epoch:[0/2](737600/4588595) loss:2.960 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:07,652][model8_pretrain.py][INFO] Epoch:[0/2](737600/4588595) loss:3.044 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:54,683][model8_pretrain.py][INFO] Epoch:[0/2](737700/4588595) loss:2.215 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:54,683][model8_pretrain.py][INFO] Epoch:[0/2](737700/4588595) loss:3.022 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:54,683][model8_pretrain.py][INFO] Epoch:[0/2](737700/4588595) loss:2.594 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:54,683][model8_pretrain.py][INFO] Epoch:[0/2](737700/4588595) loss:2.477 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:54,683][model8_pretrain.py][INFO] Epoch:[0/2](737700/4588595) loss:2.199 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:54,683][model8_pretrain.py][INFO] Epoch:[0/2](737700/4588595) loss:2.817 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:54,683][model8_pretrain.py][INFO] Epoch:[0/2](737700/4588595) loss:2.918 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:53:54,683][model8_pretrain.py][INFO] Epoch:[0/2](737700/4588595) loss:2.983 lr:0.0000100 epoch_Time:24504.0min: [2024-01-05 23:54:31,610][model8_pretrain.py][INFO] Epoch:[0/2](737800/4588595) loss:2.614 lr:0.0000100 epoch_Time:24503.0min: [2024-01-05 23:54:31,610][model8_pretrain.py][INFO] Epoch:[0/2](737800/4588595) loss:3.053 lr:0.0000100 epoch_Time:24503.0min: [2024-01-05 23:54:31,610][model8_pretrain.py][INFO] Epoch:[0/2](737800/4588595) loss:2.500 lr:0.0000100 epoch_Time:24503.0min: [2024-01-05 23:54:31,610][model8_pretrain.py][INFO] Epoch:[0/2](737800/4588595) loss:3.070 lr:0.0000100 epoch_Time:24503.0min: [2024-01-05 23:54:31,610][model8_pretrain.py][INFO] Epoch:[0/2](737800/4588595) loss:2.924 lr:0.0000100 epoch_Time:24503.0min: [2024-01-05 23:54:31,610][model8_pretrain.py][INFO] Epoch:[0/2](737800/4588595) loss:3.137 lr:0.0000100 epoch_Time:24503.0min: [2024-01-05 23:54:31,610][model8_pretrain.py][INFO] Epoch:[0/2](737800/4588595) loss:3.014 lr:0.0000100 epoch_Time:24503.0min: [2024-01-05 23:54:31,610][model8_pretrain.py][INFO] Epoch:[0/2](737800/4588595) loss:2.820 lr:0.0000100 epoch_Time:24503.0min: [2024-01-05 23:55:08,556][model8_pretrain.py][INFO] Epoch:[0/2](737900/4588595) loss:3.222 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:08,556][model8_pretrain.py][INFO] Epoch:[0/2](737900/4588595) loss:2.630 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:08,556][model8_pretrain.py][INFO] Epoch:[0/2](737900/4588595) loss:3.103 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:08,556][model8_pretrain.py][INFO] Epoch:[0/2](737900/4588595) loss:3.007 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:08,556][model8_pretrain.py][INFO] Epoch:[0/2](737900/4588595) loss:2.800 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:08,556][model8_pretrain.py][INFO] Epoch:[0/2](737900/4588595) loss:2.434 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:08,556][model8_pretrain.py][INFO] Epoch:[0/2](737900/4588595) loss:2.646 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:08,557][model8_pretrain.py][INFO] Epoch:[0/2](737900/4588595) loss:3.075 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:45,493][model8_pretrain.py][INFO] Epoch:[0/2](738000/4588595) loss:2.846 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:45,493][model8_pretrain.py][INFO] Epoch:[0/2](738000/4588595) loss:2.842 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:45,493][model8_pretrain.py][INFO] Epoch:[0/2](738000/4588595) loss:2.360 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:45,493][model8_pretrain.py][INFO] Epoch:[0/2](738000/4588595) loss:2.814 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:45,493][model8_pretrain.py][INFO] Epoch:[0/2](738000/4588595) loss:3.189 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:45,493][model8_pretrain.py][INFO] Epoch:[0/2](738000/4588595) loss:3.036 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:45,493][model8_pretrain.py][INFO] Epoch:[0/2](738000/4588595) loss:2.582 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:55:45,494][model8_pretrain.py][INFO] Epoch:[0/2](738000/4588595) loss:2.928 lr:0.0000100 epoch_Time:24502.0min: [2024-01-05 23:56:22,415][model8_pretrain.py][INFO] Epoch:[0/2](738100/4588595) loss:2.832 lr:0.0000100 epoch_Time:24501.0min: [2024-01-05 23:56:22,415][model8_pretrain.py][INFO] Epoch:[0/2](738100/4588595) loss:2.885 lr:0.0000100 epoch_Time:24501.0min: [2024-01-05 23:56:22,415][model8_pretrain.py][INFO] Epoch:[0/2](738100/4588595) loss:3.090 lr:0.0000100 epoch_Time:24501.0min: [2024-01-05 23:56:22,415][model8_pretrain.py][INFO] Epoch:[0/2](738100/4588595) loss:2.366 lr:0.0000100 epoch_Time:24501.0min: [2024-01-05 23:56:22,415][model8_pretrain.py][INFO] Epoch:[0/2](738100/4588595) loss:3.157 lr:0.0000100 epoch_Time:24501.0min: [2024-01-05 23:56:22,415][model8_pretrain.py][INFO] Epoch:[0/2](738100/4588595) loss:2.853 lr:0.0000100 epoch_Time:24501.0min: [2024-01-05 23:56:22,415][model8_pretrain.py][INFO] Epoch:[0/2](738100/4588595) loss:3.004 lr:0.0000100 epoch_Time:24501.0min: [2024-01-05 23:56:22,415][model8_pretrain.py][INFO] Epoch:[0/2](738100/4588595) loss:3.116 lr:0.0000100 epoch_Time:24501.0min: [2024-01-05 23:56:59,336][model8_pretrain.py][INFO] Epoch:[0/2](738200/4588595) loss:2.895 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:56:59,336][model8_pretrain.py][INFO] Epoch:[0/2](738200/4588595) loss:2.604 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:56:59,336][model8_pretrain.py][INFO] Epoch:[0/2](738200/4588595) loss:2.727 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:56:59,336][model8_pretrain.py][INFO] Epoch:[0/2](738200/4588595) loss:2.832 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:56:59,336][model8_pretrain.py][INFO] Epoch:[0/2](738200/4588595) loss:2.589 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:56:59,336][model8_pretrain.py][INFO] Epoch:[0/2](738200/4588595) loss:2.494 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:56:59,336][model8_pretrain.py][INFO] Epoch:[0/2](738200/4588595) loss:2.590 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:56:59,336][model8_pretrain.py][INFO] Epoch:[0/2](738200/4588595) loss:2.424 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:57:36,265][model8_pretrain.py][INFO] Epoch:[0/2](738300/4588595) loss:2.279 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:57:36,265][model8_pretrain.py][INFO] Epoch:[0/2](738300/4588595) loss:2.989 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:57:36,265][model8_pretrain.py][INFO] Epoch:[0/2](738300/4588595) loss:2.141 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:57:36,265][model8_pretrain.py][INFO] Epoch:[0/2](738300/4588595) loss:2.629 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:57:36,265][model8_pretrain.py][INFO] Epoch:[0/2](738300/4588595) loss:2.858 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:57:36,265][model8_pretrain.py][INFO] Epoch:[0/2](738300/4588595) loss:2.351 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:57:36,265][model8_pretrain.py][INFO] Epoch:[0/2](738300/4588595) loss:3.125 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:57:36,265][model8_pretrain.py][INFO] Epoch:[0/2](738300/4588595) loss:1.877 lr:0.0000100 epoch_Time:24500.0min: [2024-01-05 23:58:14,918][model8_pretrain.py][INFO] Epoch:[0/2](738400/4588595) loss:3.054 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:58:14,918][model8_pretrain.py][INFO] Epoch:[0/2](738400/4588595) loss:2.506 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:58:14,918][model8_pretrain.py][INFO] Epoch:[0/2](738400/4588595) loss:2.762 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:58:14,918][model8_pretrain.py][INFO] Epoch:[0/2](738400/4588595) loss:2.437 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:58:14,918][model8_pretrain.py][INFO] Epoch:[0/2](738400/4588595) loss:2.917 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:58:14,918][model8_pretrain.py][INFO] Epoch:[0/2](738400/4588595) loss:2.769 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:58:14,918][model8_pretrain.py][INFO] Epoch:[0/2](738400/4588595) loss:2.767 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:58:14,918][model8_pretrain.py][INFO] Epoch:[0/2](738400/4588595) loss:2.677 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:02,646][model8_pretrain.py][INFO] Epoch:[0/2](738500/4588595) loss:2.519 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:02,646][model8_pretrain.py][INFO] Epoch:[0/2](738500/4588595) loss:2.708 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:02,646][model8_pretrain.py][INFO] Epoch:[0/2](738500/4588595) loss:3.053 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:02,646][model8_pretrain.py][INFO] Epoch:[0/2](738500/4588595) loss:3.075 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:02,646][model8_pretrain.py][INFO] Epoch:[0/2](738500/4588595) loss:3.223 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:02,646][model8_pretrain.py][INFO] Epoch:[0/2](738500/4588595) loss:3.468 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:02,647][model8_pretrain.py][INFO] Epoch:[0/2](738500/4588595) loss:3.177 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:02,647][model8_pretrain.py][INFO] Epoch:[0/2](738500/4588595) loss:3.320 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:39,581][model8_pretrain.py][INFO] Epoch:[0/2](738600/4588595) loss:2.937 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:39,581][model8_pretrain.py][INFO] Epoch:[0/2](738600/4588595) loss:2.402 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:39,581][model8_pretrain.py][INFO] Epoch:[0/2](738600/4588595) loss:2.606 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:39,581][model8_pretrain.py][INFO] Epoch:[0/2](738600/4588595) loss:2.751 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:39,581][model8_pretrain.py][INFO] Epoch:[0/2](738600/4588595) loss:2.768 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:39,581][model8_pretrain.py][INFO] Epoch:[0/2](738600/4588595) loss:2.400 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:39,581][model8_pretrain.py][INFO] Epoch:[0/2](738600/4588595) loss:2.574 lr:0.0000100 epoch_Time:24499.0min: [2024-01-05 23:59:39,581][model8_pretrain.py][INFO] Epoch:[0/2](738600/4588595) loss:2.553 lr:0.0000100 epoch_Time:24499.0min: [2024-01-06 00:00:16,523][model8_pretrain.py][INFO] Epoch:[0/2](738700/4588595) loss:2.857 lr:0.0000100 epoch_Time:24498.0min: [2024-01-06 00:00:16,523][model8_pretrain.py][INFO] Epoch:[0/2](738700/4588595) loss:2.032 lr:0.0000100 epoch_Time:24498.0min: [2024-01-06 00:00:16,523][model8_pretrain.py][INFO] Epoch:[0/2](738700/4588595) loss:2.498 lr:0.0000100 epoch_Time:24498.0min: [2024-01-06 00:00:16,523][model8_pretrain.py][INFO] Epoch:[0/2](738700/4588595) loss:2.629 lr:0.0000100 epoch_Time:24498.0min: [2024-01-06 00:00:16,524][model8_pretrain.py][INFO] Epoch:[0/2](738700/4588595) loss:3.042 lr:0.0000100 epoch_Time:24498.0min: [2024-01-06 00:00:16,524][model8_pretrain.py][INFO] Epoch:[0/2](738700/4588595) loss:2.472 lr:0.0000100 epoch_Time:24498.0min: [2024-01-06 00:00:16,524][model8_pretrain.py][INFO] Epoch:[0/2](738700/4588595) loss:2.446 lr:0.0000100 epoch_Time:24498.0min: [2024-01-06 00:00:16,524][model8_pretrain.py][INFO] Epoch:[0/2](738700/4588595) loss:2.767 lr:0.0000100 epoch_Time:24498.0min: [2024-01-06 00:00:53,452][model8_pretrain.py][INFO] Epoch:[0/2](738800/4588595) loss:2.906 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:00:53,452][model8_pretrain.py][INFO] Epoch:[0/2](738800/4588595) loss:2.676 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:00:53,452][model8_pretrain.py][INFO] Epoch:[0/2](738800/4588595) loss:2.722 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:00:53,452][model8_pretrain.py][INFO] Epoch:[0/2](738800/4588595) loss:2.097 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:00:53,452][model8_pretrain.py][INFO] Epoch:[0/2](738800/4588595) loss:2.805 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:00:53,452][model8_pretrain.py][INFO] Epoch:[0/2](738800/4588595) loss:2.328 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:00:53,452][model8_pretrain.py][INFO] Epoch:[0/2](738800/4588595) loss:2.153 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:00:53,452][model8_pretrain.py][INFO] Epoch:[0/2](738800/4588595) loss:3.423 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:01:30,386][model8_pretrain.py][INFO] Epoch:[0/2](738900/4588595) loss:2.962 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:01:30,386][model8_pretrain.py][INFO] Epoch:[0/2](738900/4588595) loss:2.581 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:01:30,386][model8_pretrain.py][INFO] Epoch:[0/2](738900/4588595) loss:2.525 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:01:30,386][model8_pretrain.py][INFO] Epoch:[0/2](738900/4588595) loss:2.855 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:01:30,386][model8_pretrain.py][INFO] Epoch:[0/2](738900/4588595) loss:2.366 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:01:30,386][model8_pretrain.py][INFO] Epoch:[0/2](738900/4588595) loss:2.625 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:01:30,387][model8_pretrain.py][INFO] Epoch:[0/2](738900/4588595) loss:2.477 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:01:30,387][model8_pretrain.py][INFO] Epoch:[0/2](738900/4588595) loss:2.791 lr:0.0000100 epoch_Time:24496.0min: [2024-01-06 00:02:07,316][model8_pretrain.py][INFO] Epoch:[0/2](739000/4588595) loss:2.744 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:07,316][model8_pretrain.py][INFO] Epoch:[0/2](739000/4588595) loss:2.716 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:07,316][model8_pretrain.py][INFO] Epoch:[0/2](739000/4588595) loss:2.270 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:07,317][model8_pretrain.py][INFO] Epoch:[0/2](739000/4588595) loss:2.984 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:07,317][model8_pretrain.py][INFO] Epoch:[0/2](739000/4588595) loss:3.070 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:07,317][model8_pretrain.py][INFO] Epoch:[0/2](739000/4588595) loss:2.710 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:07,317][model8_pretrain.py][INFO] Epoch:[0/2](739000/4588595) loss:2.446 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:07,317][model8_pretrain.py][INFO] Epoch:[0/2](739000/4588595) loss:2.610 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:44,248][model8_pretrain.py][INFO] Epoch:[0/2](739100/4588595) loss:2.300 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:44,248][model8_pretrain.py][INFO] Epoch:[0/2](739100/4588595) loss:2.865 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:44,248][model8_pretrain.py][INFO] Epoch:[0/2](739100/4588595) loss:3.172 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:44,248][model8_pretrain.py][INFO] Epoch:[0/2](739100/4588595) loss:2.892 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:44,248][model8_pretrain.py][INFO] Epoch:[0/2](739100/4588595) loss:2.685 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:44,248][model8_pretrain.py][INFO] Epoch:[0/2](739100/4588595) loss:2.154 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:44,248][model8_pretrain.py][INFO] Epoch:[0/2](739100/4588595) loss:2.573 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:02:44,249][model8_pretrain.py][INFO] Epoch:[0/2](739100/4588595) loss:2.923 lr:0.0000100 epoch_Time:24495.0min: [2024-01-06 00:03:21,174][model8_pretrain.py][INFO] Epoch:[0/2](739200/4588595) loss:2.319 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:03:21,174][model8_pretrain.py][INFO] Epoch:[0/2](739200/4588595) loss:3.160 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:03:21,174][model8_pretrain.py][INFO] Epoch:[0/2](739200/4588595) loss:2.723 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:03:21,174][model8_pretrain.py][INFO] Epoch:[0/2](739200/4588595) loss:2.547 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:03:21,174][model8_pretrain.py][INFO] Epoch:[0/2](739200/4588595) loss:2.541 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:03:21,174][model8_pretrain.py][INFO] Epoch:[0/2](739200/4588595) loss:3.519 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:03:21,174][model8_pretrain.py][INFO] Epoch:[0/2](739200/4588595) loss:2.898 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:03:21,174][model8_pretrain.py][INFO] Epoch:[0/2](739200/4588595) loss:2.489 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:10,020][model8_pretrain.py][INFO] Epoch:[0/2](739300/4588595) loss:2.441 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:10,020][model8_pretrain.py][INFO] Epoch:[0/2](739300/4588595) loss:2.280 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:10,020][model8_pretrain.py][INFO] Epoch:[0/2](739300/4588595) loss:3.073 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:10,020][model8_pretrain.py][INFO] Epoch:[0/2](739300/4588595) loss:3.293 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:10,020][model8_pretrain.py][INFO] Epoch:[0/2](739300/4588595) loss:2.627 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:10,020][model8_pretrain.py][INFO] Epoch:[0/2](739300/4588595) loss:2.881 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:10,020][model8_pretrain.py][INFO] Epoch:[0/2](739300/4588595) loss:3.111 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:10,021][model8_pretrain.py][INFO] Epoch:[0/2](739300/4588595) loss:2.442 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:46,955][model8_pretrain.py][INFO] Epoch:[0/2](739400/4588595) loss:2.641 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:46,955][model8_pretrain.py][INFO] Epoch:[0/2](739400/4588595) loss:2.558 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:46,955][model8_pretrain.py][INFO] Epoch:[0/2](739400/4588595) loss:2.462 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:46,955][model8_pretrain.py][INFO] Epoch:[0/2](739400/4588595) loss:2.623 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:46,955][model8_pretrain.py][INFO] Epoch:[0/2](739400/4588595) loss:2.757 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:46,955][model8_pretrain.py][INFO] Epoch:[0/2](739400/4588595) loss:2.783 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:46,955][model8_pretrain.py][INFO] Epoch:[0/2](739400/4588595) loss:3.026 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:04:46,955][model8_pretrain.py][INFO] Epoch:[0/2](739400/4588595) loss:2.999 lr:0.0000100 epoch_Time:24494.0min: [2024-01-06 00:05:23,893][model8_pretrain.py][INFO] Epoch:[0/2](739500/4588595) loss:2.711 lr:0.0000100 epoch_Time:24493.0min: [2024-01-06 00:05:23,893][model8_pretrain.py][INFO] Epoch:[0/2](739500/4588595) loss:2.558 lr:0.0000100 epoch_Time:24493.0min: [2024-01-06 00:05:23,893][model8_pretrain.py][INFO] Epoch:[0/2](739500/4588595) loss:2.867 lr:0.0000100 epoch_Time:24493.0min: [2024-01-06 00:05:23,893][model8_pretrain.py][INFO] Epoch:[0/2](739500/4588595) loss:2.963 lr:0.0000100 epoch_Time:24493.0min: [2024-01-06 00:05:23,894][model8_pretrain.py][INFO] Epoch:[0/2](739500/4588595) loss:2.732 lr:0.0000100 epoch_Time:24493.0min: [2024-01-06 00:05:23,894][model8_pretrain.py][INFO] Epoch:[0/2](739500/4588595) loss:2.465 lr:0.0000100 epoch_Time:24493.0min: [2024-01-06 00:05:23,894][model8_pretrain.py][INFO] Epoch:[0/2](739500/4588595) loss:3.022 lr:0.0000100 epoch_Time:24493.0min: [2024-01-06 00:05:23,894][model8_pretrain.py][INFO] Epoch:[0/2](739500/4588595) loss:3.230 lr:0.0000100 epoch_Time:24493.0min: [2024-01-06 00:06:00,829][model8_pretrain.py][INFO] Epoch:[0/2](739600/4588595) loss:3.000 lr:0.0000100 epoch_Time:24492.0min: [2024-01-06 00:06:00,829][model8_pretrain.py][INFO] Epoch:[0/2](739600/4588595) loss:3.018 lr:0.0000100 epoch_Time:24492.0min: [2024-01-06 00:06:00,829][model8_pretrain.py][INFO] Epoch:[0/2](739600/4588595) loss:2.896 lr:0.0000100 epoch_Time:24492.0min: [2024-01-06 00:06:00,829][model8_pretrain.py][INFO] Epoch:[0/2](739600/4588595) loss:2.901 lr:0.0000100 epoch_Time:24492.0min: [2024-01-06 00:06:00,829][model8_pretrain.py][INFO] Epoch:[0/2](739600/4588595) loss:2.364 lr:0.0000100 epoch_Time:24492.0min: [2024-01-06 00:06:00,829][model8_pretrain.py][INFO] Epoch:[0/2](739600/4588595) loss:2.943 lr:0.0000100 epoch_Time:24492.0min: [2024-01-06 00:06:00,829][model8_pretrain.py][INFO] Epoch:[0/2](739600/4588595) loss:3.118 lr:0.0000100 epoch_Time:24492.0min: [2024-01-06 00:06:00,829][model8_pretrain.py][INFO] Epoch:[0/2](739600/4588595) loss:2.604 lr:0.0000100 epoch_Time:24492.0min: [2024-01-06 00:06:37,772][model8_pretrain.py][INFO] Epoch:[0/2](739700/4588595) loss:2.788 lr:0.0000100 epoch_Time:24491.0min: [2024-01-06 00:06:37,772][model8_pretrain.py][INFO] Epoch:[0/2](739700/4588595) loss:2.752 lr:0.0000100 epoch_Time:24491.0min: [2024-01-06 00:06:37,772][model8_pretrain.py][INFO] Epoch:[0/2](739700/4588595) loss:3.052 lr:0.0000100 epoch_Time:24491.0min: [2024-01-06 00:06:37,772][model8_pretrain.py][INFO] Epoch:[0/2](739700/4588595) loss:2.947 lr:0.0000100 epoch_Time:24491.0min: [2024-01-06 00:06:37,772][model8_pretrain.py][INFO] Epoch:[0/2](739700/4588595) loss:2.880 lr:0.0000100 epoch_Time:24491.0min: [2024-01-06 00:06:37,772][model8_pretrain.py][INFO] Epoch:[0/2](739700/4588595) loss:2.845 lr:0.0000100 epoch_Time:24491.0min: [2024-01-06 00:06:37,772][model8_pretrain.py][INFO] Epoch:[0/2](739700/4588595) loss:2.710 lr:0.0000100 epoch_Time:24491.0min: [2024-01-06 00:06:37,772][model8_pretrain.py][INFO] Epoch:[0/2](739700/4588595) loss:2.813 lr:0.0000100 epoch_Time:24491.0min: [2024-01-06 00:07:14,708][model8_pretrain.py][INFO] Epoch:[0/2](739800/4588595) loss:2.760 lr:0.0000100 epoch_Time:24490.0min: [2024-01-06 00:07:14,708][model8_pretrain.py][INFO] Epoch:[0/2](739800/4588595) loss:2.688 lr:0.0000100 epoch_Time:24490.0min: [2024-01-06 00:07:14,708][model8_pretrain.py][INFO] Epoch:[0/2](739800/4588595) loss:3.200 lr:0.0000100 epoch_Time:24490.0min: [2024-01-06 00:07:14,708][model8_pretrain.py][INFO] Epoch:[0/2](739800/4588595) loss:3.328 lr:0.0000100 epoch_Time:24490.0min: [2024-01-06 00:07:14,708][model8_pretrain.py][INFO] Epoch:[0/2](739800/4588595) loss:2.858 lr:0.0000100 epoch_Time:24490.0min: [2024-01-06 00:07:14,708][model8_pretrain.py][INFO] Epoch:[0/2](739800/4588595) loss:2.872 lr:0.0000100 epoch_Time:24490.0min: [2024-01-06 00:07:14,708][model8_pretrain.py][INFO] Epoch:[0/2](739800/4588595) loss:2.954 lr:0.0000100 epoch_Time:24490.0min: [2024-01-06 00:07:14,708][model8_pretrain.py][INFO] Epoch:[0/2](739800/4588595) loss:3.248 lr:0.0000100 epoch_Time:24490.0min: [2024-01-06 00:07:51,641][model8_pretrain.py][INFO] Epoch:[0/2](739900/4588595) loss:3.131 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:07:51,641][model8_pretrain.py][INFO] Epoch:[0/2](739900/4588595) loss:3.048 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:07:51,641][model8_pretrain.py][INFO] Epoch:[0/2](739900/4588595) loss:2.710 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:07:51,641][model8_pretrain.py][INFO] Epoch:[0/2](739900/4588595) loss:3.251 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:07:51,641][model8_pretrain.py][INFO] Epoch:[0/2](739900/4588595) loss:2.805 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:07:51,641][model8_pretrain.py][INFO] Epoch:[0/2](739900/4588595) loss:2.654 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:07:51,641][model8_pretrain.py][INFO] Epoch:[0/2](739900/4588595) loss:2.833 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:07:51,642][model8_pretrain.py][INFO] Epoch:[0/2](739900/4588595) loss:2.853 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:08:28,575][model8_pretrain.py][INFO] Epoch:[0/2](740000/4588595) loss:3.119 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:08:28,575][model8_pretrain.py][INFO] Epoch:[0/2](740000/4588595) loss:2.606 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:08:28,575][model8_pretrain.py][INFO] Epoch:[0/2](740000/4588595) loss:3.287 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:08:28,575][model8_pretrain.py][INFO] Epoch:[0/2](740000/4588595) loss:3.584 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:08:28,575][model8_pretrain.py][INFO] Epoch:[0/2](740000/4588595) loss:2.832 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:08:28,575][model8_pretrain.py][INFO] Epoch:[0/2](740000/4588595) loss:2.272 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:08:28,575][model8_pretrain.py][INFO] Epoch:[0/2](740000/4588595) loss:3.041 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:08:28,576][model8_pretrain.py][INFO] Epoch:[0/2](740000/4588595) loss:2.827 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:09:17,376][model8_pretrain.py][INFO] Epoch:[0/2](740100/4588595) loss:2.080 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:09:17,376][model8_pretrain.py][INFO] Epoch:[0/2](740100/4588595) loss:3.176 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:09:17,376][model8_pretrain.py][INFO] Epoch:[0/2](740100/4588595) loss:2.945 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:09:17,376][model8_pretrain.py][INFO] Epoch:[0/2](740100/4588595) loss:2.634 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:09:17,376][model8_pretrain.py][INFO] Epoch:[0/2](740100/4588595) loss:2.657 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:09:17,376][model8_pretrain.py][INFO] Epoch:[0/2](740100/4588595) loss:2.893 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:09:17,376][model8_pretrain.py][INFO] Epoch:[0/2](740100/4588595) loss:3.044 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:09:17,376][model8_pretrain.py][INFO] Epoch:[0/2](740100/4588595) loss:3.058 lr:0.0000100 epoch_Time:24489.0min: [2024-01-06 00:09:54,318][model8_pretrain.py][INFO] Epoch:[0/2](740200/4588595) loss:2.286 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:09:54,318][model8_pretrain.py][INFO] Epoch:[0/2](740200/4588595) loss:2.862 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:09:54,318][model8_pretrain.py][INFO] Epoch:[0/2](740200/4588595) loss:3.103 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:09:54,318][model8_pretrain.py][INFO] Epoch:[0/2](740200/4588595) loss:2.830 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:09:54,318][model8_pretrain.py][INFO] Epoch:[0/2](740200/4588595) loss:3.360 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:09:54,318][model8_pretrain.py][INFO] Epoch:[0/2](740200/4588595) loss:2.439 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:09:54,318][model8_pretrain.py][INFO] Epoch:[0/2](740200/4588595) loss:2.662 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:09:54,319][model8_pretrain.py][INFO] Epoch:[0/2](740200/4588595) loss:3.217 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:10:31,262][model8_pretrain.py][INFO] Epoch:[0/2](740300/4588595) loss:2.899 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:10:31,262][model8_pretrain.py][INFO] Epoch:[0/2](740300/4588595) loss:2.923 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:10:31,262][model8_pretrain.py][INFO] Epoch:[0/2](740300/4588595) loss:1.922 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:10:31,262][model8_pretrain.py][INFO] Epoch:[0/2](740300/4588595) loss:3.075 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:10:31,262][model8_pretrain.py][INFO] Epoch:[0/2](740300/4588595) loss:2.279 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:10:31,262][model8_pretrain.py][INFO] Epoch:[0/2](740300/4588595) loss:2.787 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:10:31,262][model8_pretrain.py][INFO] Epoch:[0/2](740300/4588595) loss:2.595 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:10:31,262][model8_pretrain.py][INFO] Epoch:[0/2](740300/4588595) loss:2.849 lr:0.0000100 epoch_Time:24488.0min: [2024-01-06 00:11:08,211][model8_pretrain.py][INFO] Epoch:[0/2](740400/4588595) loss:3.137 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:08,211][model8_pretrain.py][INFO] Epoch:[0/2](740400/4588595) loss:3.129 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:08,211][model8_pretrain.py][INFO] Epoch:[0/2](740400/4588595) loss:2.798 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:08,211][model8_pretrain.py][INFO] Epoch:[0/2](740400/4588595) loss:2.492 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:08,211][model8_pretrain.py][INFO] Epoch:[0/2](740400/4588595) loss:3.250 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:08,211][model8_pretrain.py][INFO] Epoch:[0/2](740400/4588595) loss:2.922 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:08,211][model8_pretrain.py][INFO] Epoch:[0/2](740400/4588595) loss:2.928 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:08,211][model8_pretrain.py][INFO] Epoch:[0/2](740400/4588595) loss:2.707 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:45,144][model8_pretrain.py][INFO] Epoch:[0/2](740500/4588595) loss:3.211 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:45,144][model8_pretrain.py][INFO] Epoch:[0/2](740500/4588595) loss:2.958 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:45,144][model8_pretrain.py][INFO] Epoch:[0/2](740500/4588595) loss:3.155 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:45,144][model8_pretrain.py][INFO] Epoch:[0/2](740500/4588595) loss:1.920 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:45,144][model8_pretrain.py][INFO] Epoch:[0/2](740500/4588595) loss:2.530 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:45,144][model8_pretrain.py][INFO] Epoch:[0/2](740500/4588595) loss:2.991 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:45,144][model8_pretrain.py][INFO] Epoch:[0/2](740500/4588595) loss:2.458 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:11:45,144][model8_pretrain.py][INFO] Epoch:[0/2](740500/4588595) loss:2.357 lr:0.0000100 epoch_Time:24487.0min: [2024-01-06 00:12:22,078][model8_pretrain.py][INFO] Epoch:[0/2](740600/4588595) loss:2.401 lr:0.0000100 epoch_Time:24486.0min: [2024-01-06 00:12:22,078][model8_pretrain.py][INFO] Epoch:[0/2](740600/4588595) loss:3.355 lr:0.0000100 epoch_Time:24486.0min: [2024-01-06 00:12:22,078][model8_pretrain.py][INFO] Epoch:[0/2](740600/4588595) loss:2.991 lr:0.0000100 epoch_Time:24486.0min: [2024-01-06 00:12:22,078][model8_pretrain.py][INFO] Epoch:[0/2](740600/4588595) loss:2.291 lr:0.0000100 epoch_Time:24486.0min: [2024-01-06 00:12:22,078][model8_pretrain.py][INFO] Epoch:[0/2](740600/4588595) loss:2.979 lr:0.0000100 epoch_Time:24486.0min: [2024-01-06 00:12:22,078][model8_pretrain.py][INFO] Epoch:[0/2](740600/4588595) loss:2.505 lr:0.0000100 epoch_Time:24486.0min: [2024-01-06 00:12:22,078][model8_pretrain.py][INFO] Epoch:[0/2](740600/4588595) loss:2.964 lr:0.0000100 epoch_Time:24486.0min: [2024-01-06 00:12:22,078][model8_pretrain.py][INFO] Epoch:[0/2](740600/4588595) loss:3.080 lr:0.0000100 epoch_Time:24486.0min: [2024-01-06 00:12:59,012][model8_pretrain.py][INFO] Epoch:[0/2](740700/4588595) loss:2.747 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:12:59,012][model8_pretrain.py][INFO] Epoch:[0/2](740700/4588595) loss:2.288 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:12:59,012][model8_pretrain.py][INFO] Epoch:[0/2](740700/4588595) loss:3.062 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:12:59,013][model8_pretrain.py][INFO] Epoch:[0/2](740700/4588595) loss:2.864 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:12:59,013][model8_pretrain.py][INFO] Epoch:[0/2](740700/4588595) loss:2.548 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:12:59,013][model8_pretrain.py][INFO] Epoch:[0/2](740700/4588595) loss:2.655 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:12:59,012][model8_pretrain.py][INFO] Epoch:[0/2](740700/4588595) loss:2.808 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:12:59,012][model8_pretrain.py][INFO] Epoch:[0/2](740700/4588595) loss:2.833 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:13:35,958][model8_pretrain.py][INFO] Epoch:[0/2](740800/4588595) loss:2.556 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:13:35,958][model8_pretrain.py][INFO] Epoch:[0/2](740800/4588595) loss:2.580 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:13:35,958][model8_pretrain.py][INFO] Epoch:[0/2](740800/4588595) loss:3.208 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:13:35,958][model8_pretrain.py][INFO] Epoch:[0/2](740800/4588595) loss:3.378 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:13:35,958][model8_pretrain.py][INFO] Epoch:[0/2](740800/4588595) loss:2.834 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:13:35,958][model8_pretrain.py][INFO] Epoch:[0/2](740800/4588595) loss:2.965 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:13:35,958][model8_pretrain.py][INFO] Epoch:[0/2](740800/4588595) loss:2.899 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:13:35,958][model8_pretrain.py][INFO] Epoch:[0/2](740800/4588595) loss:2.779 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:14:24,937][model8_pretrain.py][INFO] Epoch:[0/2](740900/4588595) loss:3.303 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:14:24,937][model8_pretrain.py][INFO] Epoch:[0/2](740900/4588595) loss:3.225 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:14:24,937][model8_pretrain.py][INFO] Epoch:[0/2](740900/4588595) loss:3.320 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:14:24,937][model8_pretrain.py][INFO] Epoch:[0/2](740900/4588595) loss:2.468 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:14:24,937][model8_pretrain.py][INFO] Epoch:[0/2](740900/4588595) loss:2.504 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:14:24,937][model8_pretrain.py][INFO] Epoch:[0/2](740900/4588595) loss:3.074 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:14:24,937][model8_pretrain.py][INFO] Epoch:[0/2](740900/4588595) loss:2.660 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:14:24,937][model8_pretrain.py][INFO] Epoch:[0/2](740900/4588595) loss:3.096 lr:0.0000100 epoch_Time:24484.0min: [2024-01-06 00:15:01,875][model8_pretrain.py][INFO] Epoch:[0/2](741000/4588595) loss:3.167 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:01,875][model8_pretrain.py][INFO] Epoch:[0/2](741000/4588595) loss:3.125 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:01,875][model8_pretrain.py][INFO] Epoch:[0/2](741000/4588595) loss:3.196 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:01,875][model8_pretrain.py][INFO] Epoch:[0/2](741000/4588595) loss:2.528 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:01,875][model8_pretrain.py][INFO] Epoch:[0/2](741000/4588595) loss:2.664 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:01,875][model8_pretrain.py][INFO] Epoch:[0/2](741000/4588595) loss:2.761 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:01,875][model8_pretrain.py][INFO] Epoch:[0/2](741000/4588595) loss:2.743 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:01,875][model8_pretrain.py][INFO] Epoch:[0/2](741000/4588595) loss:2.547 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:38,821][model8_pretrain.py][INFO] Epoch:[0/2](741100/4588595) loss:3.212 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:38,822][model8_pretrain.py][INFO] Epoch:[0/2](741100/4588595) loss:3.121 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:38,822][model8_pretrain.py][INFO] Epoch:[0/2](741100/4588595) loss:2.961 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:38,822][model8_pretrain.py][INFO] Epoch:[0/2](741100/4588595) loss:2.764 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:38,822][model8_pretrain.py][INFO] Epoch:[0/2](741100/4588595) loss:3.046 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:38,822][model8_pretrain.py][INFO] Epoch:[0/2](741100/4588595) loss:2.940 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:38,822][model8_pretrain.py][INFO] Epoch:[0/2](741100/4588595) loss:3.213 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:15:38,822][model8_pretrain.py][INFO] Epoch:[0/2](741100/4588595) loss:2.942 lr:0.0000100 epoch_Time:24483.0min: [2024-01-06 00:16:15,763][model8_pretrain.py][INFO] Epoch:[0/2](741200/4588595) loss:2.823 lr:0.0000100 epoch_Time:24482.0min: [2024-01-06 00:16:15,764][model8_pretrain.py][INFO] Epoch:[0/2](741200/4588595) loss:2.931 lr:0.0000100 epoch_Time:24482.0min: [2024-01-06 00:16:15,763][model8_pretrain.py][INFO] Epoch:[0/2](741200/4588595) loss:3.287 lr:0.0000100 epoch_Time:24482.0min: [2024-01-06 00:16:15,763][model8_pretrain.py][INFO] Epoch:[0/2](741200/4588595) loss:2.943 lr:0.0000100 epoch_Time:24482.0min: [2024-01-06 00:16:15,764][model8_pretrain.py][INFO] Epoch:[0/2](741200/4588595) loss:2.840 lr:0.0000100 epoch_Time:24482.0min: [2024-01-06 00:16:15,764][model8_pretrain.py][INFO] Epoch:[0/2](741200/4588595) loss:2.013 lr:0.0000100 epoch_Time:24482.0min: [2024-01-06 00:16:15,764][model8_pretrain.py][INFO] Epoch:[0/2](741200/4588595) loss:2.547 lr:0.0000100 epoch_Time:24482.0min: [2024-01-06 00:16:15,764][model8_pretrain.py][INFO] Epoch:[0/2](741200/4588595) loss:2.998 lr:0.0000100 epoch_Time:24482.0min: [2024-01-06 00:16:52,700][model8_pretrain.py][INFO] Epoch:[0/2](741300/4588595) loss:2.290 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:16:52,700][model8_pretrain.py][INFO] Epoch:[0/2](741300/4588595) loss:2.826 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:16:52,700][model8_pretrain.py][INFO] Epoch:[0/2](741300/4588595) loss:2.852 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:16:52,700][model8_pretrain.py][INFO] Epoch:[0/2](741300/4588595) loss:2.896 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:16:52,700][model8_pretrain.py][INFO] Epoch:[0/2](741300/4588595) loss:2.750 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:16:52,700][model8_pretrain.py][INFO] Epoch:[0/2](741300/4588595) loss:2.330 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:16:52,700][model8_pretrain.py][INFO] Epoch:[0/2](741300/4588595) loss:3.151 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:16:52,700][model8_pretrain.py][INFO] Epoch:[0/2](741300/4588595) loss:3.129 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:17:29,611][model8_pretrain.py][INFO] Epoch:[0/2](741400/4588595) loss:2.800 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:17:29,611][model8_pretrain.py][INFO] Epoch:[0/2](741400/4588595) loss:3.180 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:17:29,611][model8_pretrain.py][INFO] Epoch:[0/2](741400/4588595) loss:2.292 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:17:29,611][model8_pretrain.py][INFO] Epoch:[0/2](741400/4588595) loss:2.751 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:17:29,611][model8_pretrain.py][INFO] Epoch:[0/2](741400/4588595) loss:2.930 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:17:29,611][model8_pretrain.py][INFO] Epoch:[0/2](741400/4588595) loss:2.755 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:17:29,611][model8_pretrain.py][INFO] Epoch:[0/2](741400/4588595) loss:2.743 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:17:29,611][model8_pretrain.py][INFO] Epoch:[0/2](741400/4588595) loss:2.444 lr:0.0000100 epoch_Time:24481.0min: [2024-01-06 00:18:06,544][model8_pretrain.py][INFO] Epoch:[0/2](741500/4588595) loss:2.762 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:18:06,544][model8_pretrain.py][INFO] Epoch:[0/2](741500/4588595) loss:2.660 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:18:06,545][model8_pretrain.py][INFO] Epoch:[0/2](741500/4588595) loss:3.177 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:18:06,545][model8_pretrain.py][INFO] Epoch:[0/2](741500/4588595) loss:3.489 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:18:06,544][model8_pretrain.py][INFO] Epoch:[0/2](741500/4588595) loss:2.748 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:18:06,545][model8_pretrain.py][INFO] Epoch:[0/2](741500/4588595) loss:3.129 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:18:06,545][model8_pretrain.py][INFO] Epoch:[0/2](741500/4588595) loss:3.213 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:18:06,545][model8_pretrain.py][INFO] Epoch:[0/2](741500/4588595) loss:3.157 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:18:43,475][model8_pretrain.py][INFO] Epoch:[0/2](741600/4588595) loss:2.811 lr:0.0000100 epoch_Time:24479.0min: [2024-01-06 00:18:43,475][model8_pretrain.py][INFO] Epoch:[0/2](741600/4588595) loss:2.787 lr:0.0000100 epoch_Time:24479.0min: [2024-01-06 00:18:43,475][model8_pretrain.py][INFO] Epoch:[0/2](741600/4588595) loss:2.443 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:18:43,475][model8_pretrain.py][INFO] Epoch:[0/2](741600/4588595) loss:2.567 lr:0.0000100 epoch_Time:24479.0min: [2024-01-06 00:18:43,475][model8_pretrain.py][INFO] Epoch:[0/2](741600/4588595) loss:2.942 lr:0.0000100 epoch_Time:24479.0min: [2024-01-06 00:18:43,475][model8_pretrain.py][INFO] Epoch:[0/2](741600/4588595) loss:2.880 lr:0.0000100 epoch_Time:24479.0min: [2024-01-06 00:18:43,475][model8_pretrain.py][INFO] Epoch:[0/2](741600/4588595) loss:2.492 lr:0.0000100 epoch_Time:24479.0min: [2024-01-06 00:18:43,476][model8_pretrain.py][INFO] Epoch:[0/2](741600/4588595) loss:2.978 lr:0.0000100 epoch_Time:24479.0min: [2024-01-06 00:19:32,576][model8_pretrain.py][INFO] Epoch:[0/2](741700/4588595) loss:3.082 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:19:32,576][model8_pretrain.py][INFO] Epoch:[0/2](741700/4588595) loss:2.693 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:19:32,576][model8_pretrain.py][INFO] Epoch:[0/2](741700/4588595) loss:3.316 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:19:32,576][model8_pretrain.py][INFO] Epoch:[0/2](741700/4588595) loss:2.841 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:19:32,576][model8_pretrain.py][INFO] Epoch:[0/2](741700/4588595) loss:2.923 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:19:32,576][model8_pretrain.py][INFO] Epoch:[0/2](741700/4588595) loss:2.264 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:19:32,576][model8_pretrain.py][INFO] Epoch:[0/2](741700/4588595) loss:3.291 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:19:32,576][model8_pretrain.py][INFO] Epoch:[0/2](741700/4588595) loss:2.400 lr:0.0000100 epoch_Time:24480.0min: [2024-01-06 00:20:09,507][model8_pretrain.py][INFO] Epoch:[0/2](741800/4588595) loss:2.786 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:09,507][model8_pretrain.py][INFO] Epoch:[0/2](741800/4588595) loss:3.188 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:09,507][model8_pretrain.py][INFO] Epoch:[0/2](741800/4588595) loss:3.025 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:09,507][model8_pretrain.py][INFO] Epoch:[0/2](741800/4588595) loss:2.977 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:09,507][model8_pretrain.py][INFO] Epoch:[0/2](741800/4588595) loss:2.921 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:09,507][model8_pretrain.py][INFO] Epoch:[0/2](741800/4588595) loss:3.251 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:09,507][model8_pretrain.py][INFO] Epoch:[0/2](741800/4588595) loss:3.119 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:09,507][model8_pretrain.py][INFO] Epoch:[0/2](741800/4588595) loss:2.645 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:46,438][model8_pretrain.py][INFO] Epoch:[0/2](741900/4588595) loss:2.895 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:46,438][model8_pretrain.py][INFO] Epoch:[0/2](741900/4588595) loss:2.824 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:46,438][model8_pretrain.py][INFO] Epoch:[0/2](741900/4588595) loss:2.696 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:46,438][model8_pretrain.py][INFO] Epoch:[0/2](741900/4588595) loss:2.453 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:46,438][model8_pretrain.py][INFO] Epoch:[0/2](741900/4588595) loss:2.992 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:46,439][model8_pretrain.py][INFO] Epoch:[0/2](741900/4588595) loss:2.580 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:46,439][model8_pretrain.py][INFO] Epoch:[0/2](741900/4588595) loss:2.891 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:20:46,439][model8_pretrain.py][INFO] Epoch:[0/2](741900/4588595) loss:2.848 lr:0.0000100 epoch_Time:24478.0min: [2024-01-06 00:21:23,397][model8_pretrain.py][INFO] Epoch:[0/2](742000/4588595) loss:2.568 lr:0.0000100 epoch_Time:24477.0min: [2024-01-06 00:21:23,397][model8_pretrain.py][INFO] Epoch:[0/2](742000/4588595) loss:2.684 lr:0.0000100 epoch_Time:24477.0min: [2024-01-06 00:21:23,397][model8_pretrain.py][INFO] Epoch:[0/2](742000/4588595) loss:2.466 lr:0.0000100 epoch_Time:24477.0min: [2024-01-06 00:21:23,397][model8_pretrain.py][INFO] Epoch:[0/2](742000/4588595) loss:3.028 lr:0.0000100 epoch_Time:24477.0min: [2024-01-06 00:21:23,397][model8_pretrain.py][INFO] Epoch:[0/2](742000/4588595) loss:2.975 lr:0.0000100 epoch_Time:24477.0min: [2024-01-06 00:21:23,397][model8_pretrain.py][INFO] Epoch:[0/2](742000/4588595) loss:3.118 lr:0.0000100 epoch_Time:24477.0min: [2024-01-06 00:21:23,398][model8_pretrain.py][INFO] Epoch:[0/2](742000/4588595) loss:2.791 lr:0.0000100 epoch_Time:24477.0min: [2024-01-06 00:21:23,398][model8_pretrain.py][INFO] Epoch:[0/2](742000/4588595) loss:2.977 lr:0.0000100 epoch_Time:24477.0min: [2024-01-06 00:22:00,325][model8_pretrain.py][INFO] Epoch:[0/2](742100/4588595) loss:2.322 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:00,325][model8_pretrain.py][INFO] Epoch:[0/2](742100/4588595) loss:2.766 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:00,325][model8_pretrain.py][INFO] Epoch:[0/2](742100/4588595) loss:2.434 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:00,325][model8_pretrain.py][INFO] Epoch:[0/2](742100/4588595) loss:2.686 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:00,325][model8_pretrain.py][INFO] Epoch:[0/2](742100/4588595) loss:3.096 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:00,326][model8_pretrain.py][INFO] Epoch:[0/2](742100/4588595) loss:3.109 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:00,326][model8_pretrain.py][INFO] Epoch:[0/2](742100/4588595) loss:2.321 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:00,326][model8_pretrain.py][INFO] Epoch:[0/2](742100/4588595) loss:2.577 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:37,248][model8_pretrain.py][INFO] Epoch:[0/2](742200/4588595) loss:2.667 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:37,248][model8_pretrain.py][INFO] Epoch:[0/2](742200/4588595) loss:2.976 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:37,248][model8_pretrain.py][INFO] Epoch:[0/2](742200/4588595) loss:3.069 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:37,248][model8_pretrain.py][INFO] Epoch:[0/2](742200/4588595) loss:2.863 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:37,248][model8_pretrain.py][INFO] Epoch:[0/2](742200/4588595) loss:2.718 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:37,248][model8_pretrain.py][INFO] Epoch:[0/2](742200/4588595) loss:2.814 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:37,248][model8_pretrain.py][INFO] Epoch:[0/2](742200/4588595) loss:2.426 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:22:37,248][model8_pretrain.py][INFO] Epoch:[0/2](742200/4588595) loss:3.296 lr:0.0000100 epoch_Time:24476.0min: [2024-01-06 00:23:14,177][model8_pretrain.py][INFO] Epoch:[0/2](742300/4588595) loss:2.928 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:23:14,177][model8_pretrain.py][INFO] Epoch:[0/2](742300/4588595) loss:2.562 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:23:14,177][model8_pretrain.py][INFO] Epoch:[0/2](742300/4588595) loss:3.003 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:23:14,177][model8_pretrain.py][INFO] Epoch:[0/2](742300/4588595) loss:2.768 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:23:14,177][model8_pretrain.py][INFO] Epoch:[0/2](742300/4588595) loss:3.017 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:23:14,177][model8_pretrain.py][INFO] Epoch:[0/2](742300/4588595) loss:2.674 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:23:14,177][model8_pretrain.py][INFO] Epoch:[0/2](742300/4588595) loss:3.361 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:23:14,177][model8_pretrain.py][INFO] Epoch:[0/2](742300/4588595) loss:2.558 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:23:51,106][model8_pretrain.py][INFO] Epoch:[0/2](742400/4588595) loss:3.229 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:23:51,106][model8_pretrain.py][INFO] Epoch:[0/2](742400/4588595) loss:2.578 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:23:51,106][model8_pretrain.py][INFO] Epoch:[0/2](742400/4588595) loss:2.358 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:23:51,106][model8_pretrain.py][INFO] Epoch:[0/2](742400/4588595) loss:2.820 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:23:51,106][model8_pretrain.py][INFO] Epoch:[0/2](742400/4588595) loss:2.142 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:23:51,106][model8_pretrain.py][INFO] Epoch:[0/2](742400/4588595) loss:3.011 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:23:51,106][model8_pretrain.py][INFO] Epoch:[0/2](742400/4588595) loss:3.051 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:23:51,106][model8_pretrain.py][INFO] Epoch:[0/2](742400/4588595) loss:2.910 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:24:40,211][model8_pretrain.py][INFO] Epoch:[0/2](742500/4588595) loss:2.850 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:24:40,211][model8_pretrain.py][INFO] Epoch:[0/2](742500/4588595) loss:2.738 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:24:40,212][model8_pretrain.py][INFO] Epoch:[0/2](742500/4588595) loss:3.070 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:24:40,212][model8_pretrain.py][INFO] Epoch:[0/2](742500/4588595) loss:3.162 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:24:40,212][model8_pretrain.py][INFO] Epoch:[0/2](742500/4588595) loss:2.687 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:24:40,212][model8_pretrain.py][INFO] Epoch:[0/2](742500/4588595) loss:3.069 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:24:40,212][model8_pretrain.py][INFO] Epoch:[0/2](742500/4588595) loss:2.705 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:24:40,212][model8_pretrain.py][INFO] Epoch:[0/2](742500/4588595) loss:2.905 lr:0.0000100 epoch_Time:24475.0min: [2024-01-06 00:25:17,138][model8_pretrain.py][INFO] Epoch:[0/2](742600/4588595) loss:2.895 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:25:17,138][model8_pretrain.py][INFO] Epoch:[0/2](742600/4588595) loss:3.288 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:25:17,138][model8_pretrain.py][INFO] Epoch:[0/2](742600/4588595) loss:2.885 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:25:17,138][model8_pretrain.py][INFO] Epoch:[0/2](742600/4588595) loss:3.339 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:25:17,138][model8_pretrain.py][INFO] Epoch:[0/2](742600/4588595) loss:3.062 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:25:17,138][model8_pretrain.py][INFO] Epoch:[0/2](742600/4588595) loss:2.475 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:25:17,138][model8_pretrain.py][INFO] Epoch:[0/2](742600/4588595) loss:2.770 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:25:17,138][model8_pretrain.py][INFO] Epoch:[0/2](742600/4588595) loss:2.865 lr:0.0000100 epoch_Time:24474.0min: [2024-01-06 00:25:54,070][model8_pretrain.py][INFO] Epoch:[0/2](742700/4588595) loss:2.526 lr:0.0000100 epoch_Time:24473.0min: [2024-01-06 00:25:54,070][model8_pretrain.py][INFO] Epoch:[0/2](742700/4588595) loss:3.088 lr:0.0000100 epoch_Time:24473.0min: [2024-01-06 00:25:54,070][model8_pretrain.py][INFO] Epoch:[0/2](742700/4588595) loss:2.395 lr:0.0000100 epoch_Time:24473.0min: [2024-01-06 00:25:54,070][model8_pretrain.py][INFO] Epoch:[0/2](742700/4588595) loss:3.135 lr:0.0000100 epoch_Time:24473.0min: [2024-01-06 00:25:54,070][model8_pretrain.py][INFO] Epoch:[0/2](742700/4588595) loss:3.061 lr:0.0000100 epoch_Time:24473.0min: [2024-01-06 00:25:54,070][model8_pretrain.py][INFO] Epoch:[0/2](742700/4588595) loss:2.849 lr:0.0000100 epoch_Time:24473.0min: [2024-01-06 00:25:54,070][model8_pretrain.py][INFO] Epoch:[0/2](742700/4588595) loss:2.633 lr:0.0000100 epoch_Time:24473.0min: [2024-01-06 00:25:54,070][model8_pretrain.py][INFO] Epoch:[0/2](742700/4588595) loss:3.127 lr:0.0000100 epoch_Time:24473.0min: [2024-01-06 00:26:31,005][model8_pretrain.py][INFO] Epoch:[0/2](742800/4588595) loss:2.831 lr:0.0000100 epoch_Time:24472.0min: [2024-01-06 00:26:31,005][model8_pretrain.py][INFO] Epoch:[0/2](742800/4588595) loss:2.124 lr:0.0000100 epoch_Time:24472.0min: [2024-01-06 00:26:31,005][model8_pretrain.py][INFO] Epoch:[0/2](742800/4588595) loss:2.897 lr:0.0000100 epoch_Time:24472.0min: [2024-01-06 00:26:31,005][model8_pretrain.py][INFO] Epoch:[0/2](742800/4588595) loss:2.756 lr:0.0000100 epoch_Time:24472.0min: [2024-01-06 00:26:31,005][model8_pretrain.py][INFO] Epoch:[0/2](742800/4588595) loss:3.057 lr:0.0000100 epoch_Time:24472.0min: [2024-01-06 00:26:31,005][model8_pretrain.py][INFO] Epoch:[0/2](742800/4588595) loss:3.384 lr:0.0000100 epoch_Time:24472.0min: [2024-01-06 00:26:31,005][model8_pretrain.py][INFO] Epoch:[0/2](742800/4588595) loss:2.694 lr:0.0000100 epoch_Time:24472.0min: [2024-01-06 00:26:31,006][model8_pretrain.py][INFO] Epoch:[0/2](742800/4588595) loss:2.847 lr:0.0000100 epoch_Time:24472.0min: [2024-01-06 00:27:07,948][model8_pretrain.py][INFO] Epoch:[0/2](742900/4588595) loss:2.405 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:07,948][model8_pretrain.py][INFO] Epoch:[0/2](742900/4588595) loss:2.743 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:07,948][model8_pretrain.py][INFO] Epoch:[0/2](742900/4588595) loss:2.853 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:07,948][model8_pretrain.py][INFO] Epoch:[0/2](742900/4588595) loss:2.933 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:07,948][model8_pretrain.py][INFO] Epoch:[0/2](742900/4588595) loss:2.717 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:07,948][model8_pretrain.py][INFO] Epoch:[0/2](742900/4588595) loss:2.059 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:07,948][model8_pretrain.py][INFO] Epoch:[0/2](742900/4588595) loss:2.682 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:07,948][model8_pretrain.py][INFO] Epoch:[0/2](742900/4588595) loss:2.904 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:44,879][model8_pretrain.py][INFO] Epoch:[0/2](743000/4588595) loss:3.143 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:44,879][model8_pretrain.py][INFO] Epoch:[0/2](743000/4588595) loss:2.801 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:44,879][model8_pretrain.py][INFO] Epoch:[0/2](743000/4588595) loss:2.905 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:44,879][model8_pretrain.py][INFO] Epoch:[0/2](743000/4588595) loss:2.597 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:44,879][model8_pretrain.py][INFO] Epoch:[0/2](743000/4588595) loss:2.824 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:44,879][model8_pretrain.py][INFO] Epoch:[0/2](743000/4588595) loss:3.099 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:44,880][model8_pretrain.py][INFO] Epoch:[0/2](743000/4588595) loss:3.021 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:27:44,879][model8_pretrain.py][INFO] Epoch:[0/2](743000/4588595) loss:2.637 lr:0.0000100 epoch_Time:24471.0min: [2024-01-06 00:28:21,813][model8_pretrain.py][INFO] Epoch:[0/2](743100/4588595) loss:2.878 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:28:21,813][model8_pretrain.py][INFO] Epoch:[0/2](743100/4588595) loss:3.073 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:28:21,813][model8_pretrain.py][INFO] Epoch:[0/2](743100/4588595) loss:2.353 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:28:21,813][model8_pretrain.py][INFO] Epoch:[0/2](743100/4588595) loss:3.169 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:28:21,813][model8_pretrain.py][INFO] Epoch:[0/2](743100/4588595) loss:2.734 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:28:21,813][model8_pretrain.py][INFO] Epoch:[0/2](743100/4588595) loss:3.165 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:28:21,813][model8_pretrain.py][INFO] Epoch:[0/2](743100/4588595) loss:3.158 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:28:21,814][model8_pretrain.py][INFO] Epoch:[0/2](743100/4588595) loss:2.885 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:28:58,744][model8_pretrain.py][INFO] Epoch:[0/2](743200/4588595) loss:2.443 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:28:58,744][model8_pretrain.py][INFO] Epoch:[0/2](743200/4588595) loss:3.041 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:28:58,744][model8_pretrain.py][INFO] Epoch:[0/2](743200/4588595) loss:2.698 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:28:58,744][model8_pretrain.py][INFO] Epoch:[0/2](743200/4588595) loss:3.088 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:28:58,744][model8_pretrain.py][INFO] Epoch:[0/2](743200/4588595) loss:2.492 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:28:58,744][model8_pretrain.py][INFO] Epoch:[0/2](743200/4588595) loss:2.413 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:28:58,744][model8_pretrain.py][INFO] Epoch:[0/2](743200/4588595) loss:2.905 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:28:58,744][model8_pretrain.py][INFO] Epoch:[0/2](743200/4588595) loss:3.036 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:29:47,504][model8_pretrain.py][INFO] Epoch:[0/2](743300/4588595) loss:2.440 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:29:47,504][model8_pretrain.py][INFO] Epoch:[0/2](743300/4588595) loss:2.944 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:29:47,504][model8_pretrain.py][INFO] Epoch:[0/2](743300/4588595) loss:3.263 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:29:47,504][model8_pretrain.py][INFO] Epoch:[0/2](743300/4588595) loss:2.932 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:29:47,504][model8_pretrain.py][INFO] Epoch:[0/2](743300/4588595) loss:2.964 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:29:47,504][model8_pretrain.py][INFO] Epoch:[0/2](743300/4588595) loss:2.849 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:29:47,504][model8_pretrain.py][INFO] Epoch:[0/2](743300/4588595) loss:2.886 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:29:47,504][model8_pretrain.py][INFO] Epoch:[0/2](743300/4588595) loss:3.248 lr:0.0000100 epoch_Time:24470.0min: [2024-01-06 00:30:24,430][model8_pretrain.py][INFO] Epoch:[0/2](743400/4588595) loss:2.816 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:30:24,430][model8_pretrain.py][INFO] Epoch:[0/2](743400/4588595) loss:2.357 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:30:24,430][model8_pretrain.py][INFO] Epoch:[0/2](743400/4588595) loss:2.434 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:30:24,430][model8_pretrain.py][INFO] Epoch:[0/2](743400/4588595) loss:3.292 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:30:24,430][model8_pretrain.py][INFO] Epoch:[0/2](743400/4588595) loss:3.004 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:30:24,430][model8_pretrain.py][INFO] Epoch:[0/2](743400/4588595) loss:2.913 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:30:24,430][model8_pretrain.py][INFO] Epoch:[0/2](743400/4588595) loss:2.459 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:30:24,430][model8_pretrain.py][INFO] Epoch:[0/2](743400/4588595) loss:2.681 lr:0.0000100 epoch_Time:24469.0min: [2024-01-06 00:31:01,367][model8_pretrain.py][INFO] Epoch:[0/2](743500/4588595) loss:2.390 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:01,367][model8_pretrain.py][INFO] Epoch:[0/2](743500/4588595) loss:3.355 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:01,367][model8_pretrain.py][INFO] Epoch:[0/2](743500/4588595) loss:3.183 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:01,367][model8_pretrain.py][INFO] Epoch:[0/2](743500/4588595) loss:3.200 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:01,367][model8_pretrain.py][INFO] Epoch:[0/2](743500/4588595) loss:2.621 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:01,367][model8_pretrain.py][INFO] Epoch:[0/2](743500/4588595) loss:2.999 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:01,367][model8_pretrain.py][INFO] Epoch:[0/2](743500/4588595) loss:3.054 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:01,367][model8_pretrain.py][INFO] Epoch:[0/2](743500/4588595) loss:2.329 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:38,309][model8_pretrain.py][INFO] Epoch:[0/2](743600/4588595) loss:2.792 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:38,309][model8_pretrain.py][INFO] Epoch:[0/2](743600/4588595) loss:2.878 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:38,310][model8_pretrain.py][INFO] Epoch:[0/2](743600/4588595) loss:2.939 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:38,310][model8_pretrain.py][INFO] Epoch:[0/2](743600/4588595) loss:2.739 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:38,310][model8_pretrain.py][INFO] Epoch:[0/2](743600/4588595) loss:3.256 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:38,309][model8_pretrain.py][INFO] Epoch:[0/2](743600/4588595) loss:3.325 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:38,310][model8_pretrain.py][INFO] Epoch:[0/2](743600/4588595) loss:2.581 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:31:38,310][model8_pretrain.py][INFO] Epoch:[0/2](743600/4588595) loss:2.821 lr:0.0000100 epoch_Time:24468.0min: [2024-01-06 00:32:15,248][model8_pretrain.py][INFO] Epoch:[0/2](743700/4588595) loss:3.064 lr:0.0000100 epoch_Time:24467.0min: [2024-01-06 00:32:15,248][model8_pretrain.py][INFO] Epoch:[0/2](743700/4588595) loss:2.568 lr:0.0000100 epoch_Time:24467.0min: [2024-01-06 00:32:15,248][model8_pretrain.py][INFO] Epoch:[0/2](743700/4588595) loss:2.683 lr:0.0000100 epoch_Time:24467.0min: [2024-01-06 00:32:15,248][model8_pretrain.py][INFO] Epoch:[0/2](743700/4588595) loss:2.684 lr:0.0000100 epoch_Time:24467.0min: [2024-01-06 00:32:15,248][model8_pretrain.py][INFO] Epoch:[0/2](743700/4588595) loss:2.482 lr:0.0000100 epoch_Time:24467.0min: [2024-01-06 00:32:15,248][model8_pretrain.py][INFO] Epoch:[0/2](743700/4588595) loss:3.096 lr:0.0000100 epoch_Time:24467.0min: [2024-01-06 00:32:15,248][model8_pretrain.py][INFO] Epoch:[0/2](743700/4588595) loss:2.636 lr:0.0000100 epoch_Time:24467.0min: [2024-01-06 00:32:15,248][model8_pretrain.py][INFO] Epoch:[0/2](743700/4588595) loss:3.012 lr:0.0000100 epoch_Time:24467.0min: [2024-01-06 00:32:52,191][model8_pretrain.py][INFO] Epoch:[0/2](743800/4588595) loss:2.880 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:32:52,191][model8_pretrain.py][INFO] Epoch:[0/2](743800/4588595) loss:2.892 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:32:52,191][model8_pretrain.py][INFO] Epoch:[0/2](743800/4588595) loss:3.365 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:32:52,191][model8_pretrain.py][INFO] Epoch:[0/2](743800/4588595) loss:3.088 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:32:52,191][model8_pretrain.py][INFO] Epoch:[0/2](743800/4588595) loss:3.064 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:32:52,191][model8_pretrain.py][INFO] Epoch:[0/2](743800/4588595) loss:3.172 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:32:52,191][model8_pretrain.py][INFO] Epoch:[0/2](743800/4588595) loss:3.087 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:32:52,191][model8_pretrain.py][INFO] Epoch:[0/2](743800/4588595) loss:2.966 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:33:29,130][model8_pretrain.py][INFO] Epoch:[0/2](743900/4588595) loss:2.755 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:33:29,130][model8_pretrain.py][INFO] Epoch:[0/2](743900/4588595) loss:2.435 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:33:29,130][model8_pretrain.py][INFO] Epoch:[0/2](743900/4588595) loss:2.777 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:33:29,130][model8_pretrain.py][INFO] Epoch:[0/2](743900/4588595) loss:2.791 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:33:29,130][model8_pretrain.py][INFO] Epoch:[0/2](743900/4588595) loss:2.371 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:33:29,130][model8_pretrain.py][INFO] Epoch:[0/2](743900/4588595) loss:2.731 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:33:29,130][model8_pretrain.py][INFO] Epoch:[0/2](743900/4588595) loss:2.238 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:33:29,130][model8_pretrain.py][INFO] Epoch:[0/2](743900/4588595) loss:2.960 lr:0.0000100 epoch_Time:24465.0min: [2024-01-06 00:34:06,062][model8_pretrain.py][INFO] Epoch:[0/2](744000/4588595) loss:2.552 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:06,062][model8_pretrain.py][INFO] Epoch:[0/2](744000/4588595) loss:2.965 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:06,062][model8_pretrain.py][INFO] Epoch:[0/2](744000/4588595) loss:2.964 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:06,062][model8_pretrain.py][INFO] Epoch:[0/2](744000/4588595) loss:3.444 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:06,062][model8_pretrain.py][INFO] Epoch:[0/2](744000/4588595) loss:2.980 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:06,062][model8_pretrain.py][INFO] Epoch:[0/2](744000/4588595) loss:3.211 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:06,062][model8_pretrain.py][INFO] Epoch:[0/2](744000/4588595) loss:3.230 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:06,062][model8_pretrain.py][INFO] Epoch:[0/2](744000/4588595) loss:2.909 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:54,741][model8_pretrain.py][INFO] Epoch:[0/2](744100/4588595) loss:2.722 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:54,741][model8_pretrain.py][INFO] Epoch:[0/2](744100/4588595) loss:2.220 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:54,742][model8_pretrain.py][INFO] Epoch:[0/2](744100/4588595) loss:2.401 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:54,742][model8_pretrain.py][INFO] Epoch:[0/2](744100/4588595) loss:2.990 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:54,742][model8_pretrain.py][INFO] Epoch:[0/2](744100/4588595) loss:2.207 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:54,742][model8_pretrain.py][INFO] Epoch:[0/2](744100/4588595) loss:2.443 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:54,742][model8_pretrain.py][INFO] Epoch:[0/2](744100/4588595) loss:2.464 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:34:54,742][model8_pretrain.py][INFO] Epoch:[0/2](744100/4588595) loss:2.574 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:35:31,713][model8_pretrain.py][INFO] Epoch:[0/2](744200/4588595) loss:2.855 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:35:31,713][model8_pretrain.py][INFO] Epoch:[0/2](744200/4588595) loss:2.736 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:35:31,713][model8_pretrain.py][INFO] Epoch:[0/2](744200/4588595) loss:2.408 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:35:31,713][model8_pretrain.py][INFO] Epoch:[0/2](744200/4588595) loss:2.193 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:35:31,713][model8_pretrain.py][INFO] Epoch:[0/2](744200/4588595) loss:3.039 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:35:31,713][model8_pretrain.py][INFO] Epoch:[0/2](744200/4588595) loss:2.624 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:35:31,713][model8_pretrain.py][INFO] Epoch:[0/2](744200/4588595) loss:2.645 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:35:31,713][model8_pretrain.py][INFO] Epoch:[0/2](744200/4588595) loss:2.523 lr:0.0000100 epoch_Time:24464.0min: [2024-01-06 00:36:08,656][model8_pretrain.py][INFO] Epoch:[0/2](744300/4588595) loss:2.601 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:08,656][model8_pretrain.py][INFO] Epoch:[0/2](744300/4588595) loss:2.609 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:08,656][model8_pretrain.py][INFO] Epoch:[0/2](744300/4588595) loss:2.993 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:08,656][model8_pretrain.py][INFO] Epoch:[0/2](744300/4588595) loss:2.718 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:08,656][model8_pretrain.py][INFO] Epoch:[0/2](744300/4588595) loss:2.076 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:08,656][model8_pretrain.py][INFO] Epoch:[0/2](744300/4588595) loss:2.400 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:08,657][model8_pretrain.py][INFO] Epoch:[0/2](744300/4588595) loss:2.816 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:08,657][model8_pretrain.py][INFO] Epoch:[0/2](744300/4588595) loss:3.041 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:45,598][model8_pretrain.py][INFO] Epoch:[0/2](744400/4588595) loss:2.496 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:45,598][model8_pretrain.py][INFO] Epoch:[0/2](744400/4588595) loss:3.036 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:45,598][model8_pretrain.py][INFO] Epoch:[0/2](744400/4588595) loss:3.020 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:45,598][model8_pretrain.py][INFO] Epoch:[0/2](744400/4588595) loss:3.083 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:45,598][model8_pretrain.py][INFO] Epoch:[0/2](744400/4588595) loss:2.520 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:45,598][model8_pretrain.py][INFO] Epoch:[0/2](744400/4588595) loss:3.404 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:45,598][model8_pretrain.py][INFO] Epoch:[0/2](744400/4588595) loss:2.975 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:36:45,598][model8_pretrain.py][INFO] Epoch:[0/2](744400/4588595) loss:2.149 lr:0.0000100 epoch_Time:24463.0min: [2024-01-06 00:37:22,542][model8_pretrain.py][INFO] Epoch:[0/2](744500/4588595) loss:2.692 lr:0.0000100 epoch_Time:24462.0min: [2024-01-06 00:37:22,542][model8_pretrain.py][INFO] Epoch:[0/2](744500/4588595) loss:2.922 lr:0.0000100 epoch_Time:24462.0min: [2024-01-06 00:37:22,542][model8_pretrain.py][INFO] Epoch:[0/2](744500/4588595) loss:2.667 lr:0.0000100 epoch_Time:24462.0min: [2024-01-06 00:37:22,542][model8_pretrain.py][INFO] Epoch:[0/2](744500/4588595) loss:3.408 lr:0.0000100 epoch_Time:24462.0min: [2024-01-06 00:37:22,542][model8_pretrain.py][INFO] Epoch:[0/2](744500/4588595) loss:3.093 lr:0.0000100 epoch_Time:24462.0min: [2024-01-06 00:37:22,542][model8_pretrain.py][INFO] Epoch:[0/2](744500/4588595) loss:2.972 lr:0.0000100 epoch_Time:24462.0min: [2024-01-06 00:37:22,542][model8_pretrain.py][INFO] Epoch:[0/2](744500/4588595) loss:2.487 lr:0.0000100 epoch_Time:24462.0min: [2024-01-06 00:37:22,542][model8_pretrain.py][INFO] Epoch:[0/2](744500/4588595) loss:2.900 lr:0.0000100 epoch_Time:24462.0min: [2024-01-06 00:37:59,482][model8_pretrain.py][INFO] Epoch:[0/2](744600/4588595) loss:2.813 lr:0.0000100 epoch_Time:24461.0min: [2024-01-06 00:37:59,482][model8_pretrain.py][INFO] Epoch:[0/2](744600/4588595) loss:2.874 lr:0.0000100 epoch_Time:24461.0min: [2024-01-06 00:37:59,482][model8_pretrain.py][INFO] Epoch:[0/2](744600/4588595) loss:3.137 lr:0.0000100 epoch_Time:24461.0min: [2024-01-06 00:37:59,482][model8_pretrain.py][INFO] Epoch:[0/2](744600/4588595) loss:2.124 lr:0.0000100 epoch_Time:24461.0min: [2024-01-06 00:37:59,482][model8_pretrain.py][INFO] Epoch:[0/2](744600/4588595) loss:2.776 lr:0.0000100 epoch_Time:24461.0min: [2024-01-06 00:37:59,482][model8_pretrain.py][INFO] Epoch:[0/2](744600/4588595) loss:2.752 lr:0.0000100 epoch_Time:24461.0min: [2024-01-06 00:37:59,482][model8_pretrain.py][INFO] Epoch:[0/2](744600/4588595) loss:3.553 lr:0.0000100 epoch_Time:24461.0min: [2024-01-06 00:37:59,483][model8_pretrain.py][INFO] Epoch:[0/2](744600/4588595) loss:2.651 lr:0.0000100 epoch_Time:24461.0min: [2024-01-06 00:38:36,423][model8_pretrain.py][INFO] Epoch:[0/2](744700/4588595) loss:2.354 lr:0.0000100 epoch_Time:24460.0min: [2024-01-06 00:38:36,423][model8_pretrain.py][INFO] Epoch:[0/2](744700/4588595) loss:2.452 lr:0.0000100 epoch_Time:24460.0min: [2024-01-06 00:38:36,423][model8_pretrain.py][INFO] Epoch:[0/2](744700/4588595) loss:3.077 lr:0.0000100 epoch_Time:24460.0min: [2024-01-06 00:38:36,423][model8_pretrain.py][INFO] Epoch:[0/2](744700/4588595) loss:2.910 lr:0.0000100 epoch_Time:24460.0min: [2024-01-06 00:38:36,424][model8_pretrain.py][INFO] Epoch:[0/2](744700/4588595) loss:3.056 lr:0.0000100 epoch_Time:24460.0min: [2024-01-06 00:38:36,423][model8_pretrain.py][INFO] Epoch:[0/2](744700/4588595) loss:2.664 lr:0.0000100 epoch_Time:24460.0min: [2024-01-06 00:38:36,424][model8_pretrain.py][INFO] Epoch:[0/2](744700/4588595) loss:3.118 lr:0.0000100 epoch_Time:24460.0min: [2024-01-06 00:38:36,424][model8_pretrain.py][INFO] Epoch:[0/2](744700/4588595) loss:3.156 lr:0.0000100 epoch_Time:24460.0min: [2024-01-06 00:39:13,371][model8_pretrain.py][INFO] Epoch:[0/2](744800/4588595) loss:3.032 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:39:13,371][model8_pretrain.py][INFO] Epoch:[0/2](744800/4588595) loss:3.276 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:39:13,371][model8_pretrain.py][INFO] Epoch:[0/2](744800/4588595) loss:3.226 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:39:13,371][model8_pretrain.py][INFO] Epoch:[0/2](744800/4588595) loss:2.738 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:39:13,371][model8_pretrain.py][INFO] Epoch:[0/2](744800/4588595) loss:2.596 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:39:13,371][model8_pretrain.py][INFO] Epoch:[0/2](744800/4588595) loss:3.403 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:39:13,371][model8_pretrain.py][INFO] Epoch:[0/2](744800/4588595) loss:2.082 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:39:13,371][model8_pretrain.py][INFO] Epoch:[0/2](744800/4588595) loss:2.502 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:02,035][model8_pretrain.py][INFO] Epoch:[0/2](744900/4588595) loss:2.748 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:02,035][model8_pretrain.py][INFO] Epoch:[0/2](744900/4588595) loss:3.124 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:02,035][model8_pretrain.py][INFO] Epoch:[0/2](744900/4588595) loss:2.324 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:02,035][model8_pretrain.py][INFO] Epoch:[0/2](744900/4588595) loss:2.650 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:02,035][model8_pretrain.py][INFO] Epoch:[0/2](744900/4588595) loss:2.814 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:02,036][model8_pretrain.py][INFO] Epoch:[0/2](744900/4588595) loss:3.381 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:02,036][model8_pretrain.py][INFO] Epoch:[0/2](744900/4588595) loss:3.040 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:02,036][model8_pretrain.py][INFO] Epoch:[0/2](744900/4588595) loss:2.677 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:38,965][model8_pretrain.py][INFO] Epoch:[0/2](745000/4588595) loss:3.120 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:38,965][model8_pretrain.py][INFO] Epoch:[0/2](745000/4588595) loss:2.441 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:38,965][model8_pretrain.py][INFO] Epoch:[0/2](745000/4588595) loss:1.990 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:38,965][model8_pretrain.py][INFO] Epoch:[0/2](745000/4588595) loss:2.457 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:38,965][model8_pretrain.py][INFO] Epoch:[0/2](745000/4588595) loss:2.806 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:38,965][model8_pretrain.py][INFO] Epoch:[0/2](745000/4588595) loss:3.028 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:38,965][model8_pretrain.py][INFO] Epoch:[0/2](745000/4588595) loss:3.230 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:40:38,965][model8_pretrain.py][INFO] Epoch:[0/2](745000/4588595) loss:2.760 lr:0.0000100 epoch_Time:24459.0min: [2024-01-06 00:41:15,903][model8_pretrain.py][INFO] Epoch:[0/2](745100/4588595) loss:2.647 lr:0.0000100 epoch_Time:24458.0min: [2024-01-06 00:41:15,903][model8_pretrain.py][INFO] Epoch:[0/2](745100/4588595) loss:2.720 lr:0.0000100 epoch_Time:24458.0min: [2024-01-06 00:41:15,903][model8_pretrain.py][INFO] Epoch:[0/2](745100/4588595) loss:2.800 lr:0.0000100 epoch_Time:24458.0min: [2024-01-06 00:41:15,903][model8_pretrain.py][INFO] Epoch:[0/2](745100/4588595) loss:3.045 lr:0.0000100 epoch_Time:24458.0min: [2024-01-06 00:41:15,903][model8_pretrain.py][INFO] Epoch:[0/2](745100/4588595) loss:2.809 lr:0.0000100 epoch_Time:24458.0min: [2024-01-06 00:41:15,903][model8_pretrain.py][INFO] Epoch:[0/2](745100/4588595) loss:2.592 lr:0.0000100 epoch_Time:24458.0min: [2024-01-06 00:41:15,903][model8_pretrain.py][INFO] Epoch:[0/2](745100/4588595) loss:2.821 lr:0.0000100 epoch_Time:24458.0min: [2024-01-06 00:41:15,903][model8_pretrain.py][INFO] Epoch:[0/2](745100/4588595) loss:2.927 lr:0.0000100 epoch_Time:24458.0min: [2024-01-06 00:41:52,856][model8_pretrain.py][INFO] Epoch:[0/2](745200/4588595) loss:2.231 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:41:52,856][model8_pretrain.py][INFO] Epoch:[0/2](745200/4588595) loss:2.926 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:41:52,856][model8_pretrain.py][INFO] Epoch:[0/2](745200/4588595) loss:3.090 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:41:52,856][model8_pretrain.py][INFO] Epoch:[0/2](745200/4588595) loss:3.237 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:41:52,856][model8_pretrain.py][INFO] Epoch:[0/2](745200/4588595) loss:2.782 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:41:52,856][model8_pretrain.py][INFO] Epoch:[0/2](745200/4588595) loss:2.453 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:41:52,856][model8_pretrain.py][INFO] Epoch:[0/2](745200/4588595) loss:1.902 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:41:52,857][model8_pretrain.py][INFO] Epoch:[0/2](745200/4588595) loss:2.682 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:42:29,805][model8_pretrain.py][INFO] Epoch:[0/2](745300/4588595) loss:2.667 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:42:29,805][model8_pretrain.py][INFO] Epoch:[0/2](745300/4588595) loss:2.477 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:42:29,805][model8_pretrain.py][INFO] Epoch:[0/2](745300/4588595) loss:2.993 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:42:29,805][model8_pretrain.py][INFO] Epoch:[0/2](745300/4588595) loss:3.198 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:42:29,805][model8_pretrain.py][INFO] Epoch:[0/2](745300/4588595) loss:3.571 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:42:29,805][model8_pretrain.py][INFO] Epoch:[0/2](745300/4588595) loss:2.237 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:42:29,805][model8_pretrain.py][INFO] Epoch:[0/2](745300/4588595) loss:2.854 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:42:29,805][model8_pretrain.py][INFO] Epoch:[0/2](745300/4588595) loss:2.469 lr:0.0000100 epoch_Time:24457.0min: [2024-01-06 00:43:06,751][model8_pretrain.py][INFO] Epoch:[0/2](745400/4588595) loss:2.877 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:06,751][model8_pretrain.py][INFO] Epoch:[0/2](745400/4588595) loss:3.210 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:06,751][model8_pretrain.py][INFO] Epoch:[0/2](745400/4588595) loss:3.018 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:06,751][model8_pretrain.py][INFO] Epoch:[0/2](745400/4588595) loss:3.278 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:06,752][model8_pretrain.py][INFO] Epoch:[0/2](745400/4588595) loss:2.899 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:06,752][model8_pretrain.py][INFO] Epoch:[0/2](745400/4588595) loss:2.660 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:06,752][model8_pretrain.py][INFO] Epoch:[0/2](745400/4588595) loss:2.793 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:06,752][model8_pretrain.py][INFO] Epoch:[0/2](745400/4588595) loss:2.777 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:43,712][model8_pretrain.py][INFO] Epoch:[0/2](745500/4588595) loss:2.997 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:43,712][model8_pretrain.py][INFO] Epoch:[0/2](745500/4588595) loss:2.936 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:43,712][model8_pretrain.py][INFO] Epoch:[0/2](745500/4588595) loss:2.946 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:43,712][model8_pretrain.py][INFO] Epoch:[0/2](745500/4588595) loss:2.599 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:43,712][model8_pretrain.py][INFO] Epoch:[0/2](745500/4588595) loss:3.229 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:43,712][model8_pretrain.py][INFO] Epoch:[0/2](745500/4588595) loss:3.301 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:43,712][model8_pretrain.py][INFO] Epoch:[0/2](745500/4588595) loss:2.090 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:43:43,712][model8_pretrain.py][INFO] Epoch:[0/2](745500/4588595) loss:2.485 lr:0.0000100 epoch_Time:24456.0min: [2024-01-06 00:44:20,662][model8_pretrain.py][INFO] Epoch:[0/2](745600/4588595) loss:2.305 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:44:20,662][model8_pretrain.py][INFO] Epoch:[0/2](745600/4588595) loss:3.055 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:44:20,662][model8_pretrain.py][INFO] Epoch:[0/2](745600/4588595) loss:2.681 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:44:20,662][model8_pretrain.py][INFO] Epoch:[0/2](745600/4588595) loss:3.008 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:44:20,662][model8_pretrain.py][INFO] Epoch:[0/2](745600/4588595) loss:2.956 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:44:20,662][model8_pretrain.py][INFO] Epoch:[0/2](745600/4588595) loss:2.690 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:44:20,662][model8_pretrain.py][INFO] Epoch:[0/2](745600/4588595) loss:2.884 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:44:20,662][model8_pretrain.py][INFO] Epoch:[0/2](745600/4588595) loss:2.250 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:45:09,454][model8_pretrain.py][INFO] Epoch:[0/2](745700/4588595) loss:2.558 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:45:09,454][model8_pretrain.py][INFO] Epoch:[0/2](745700/4588595) loss:3.123 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:45:09,454][model8_pretrain.py][INFO] Epoch:[0/2](745700/4588595) loss:3.113 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:45:09,454][model8_pretrain.py][INFO] Epoch:[0/2](745700/4588595) loss:2.668 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:45:09,454][model8_pretrain.py][INFO] Epoch:[0/2](745700/4588595) loss:3.208 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:45:09,454][model8_pretrain.py][INFO] Epoch:[0/2](745700/4588595) loss:3.195 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:45:09,454][model8_pretrain.py][INFO] Epoch:[0/2](745700/4588595) loss:3.006 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:45:09,455][model8_pretrain.py][INFO] Epoch:[0/2](745700/4588595) loss:2.583 lr:0.0000100 epoch_Time:24455.0min: [2024-01-06 00:45:46,386][model8_pretrain.py][INFO] Epoch:[0/2](745800/4588595) loss:3.329 lr:0.0000100 epoch_Time:24454.0min: [2024-01-06 00:45:46,386][model8_pretrain.py][INFO] Epoch:[0/2](745800/4588595) loss:3.110 lr:0.0000100 epoch_Time:24454.0min: [2024-01-06 00:45:46,386][model8_pretrain.py][INFO] Epoch:[0/2](745800/4588595) loss:2.941 lr:0.0000100 epoch_Time:24454.0min: [2024-01-06 00:45:46,386][model8_pretrain.py][INFO] Epoch:[0/2](745800/4588595) loss:2.838 lr:0.0000100 epoch_Time:24454.0min: [2024-01-06 00:45:46,386][model8_pretrain.py][INFO] Epoch:[0/2](745800/4588595) loss:3.010 lr:0.0000100 epoch_Time:24454.0min: [2024-01-06 00:45:46,386][model8_pretrain.py][INFO] Epoch:[0/2](745800/4588595) loss:2.939 lr:0.0000100 epoch_Time:24454.0min: [2024-01-06 00:45:46,386][model8_pretrain.py][INFO] Epoch:[0/2](745800/4588595) loss:3.334 lr:0.0000100 epoch_Time:24454.0min: [2024-01-06 00:45:46,387][model8_pretrain.py][INFO] Epoch:[0/2](745800/4588595) loss:2.891 lr:0.0000100 epoch_Time:24454.0min: [2024-01-06 00:46:23,344][model8_pretrain.py][INFO] Epoch:[0/2](745900/4588595) loss:3.230 lr:0.0000100 epoch_Time:24453.0min: [2024-01-06 00:46:23,344][model8_pretrain.py][INFO] Epoch:[0/2](745900/4588595) loss:2.959 lr:0.0000100 epoch_Time:24453.0min: [2024-01-06 00:46:23,344][model8_pretrain.py][INFO] Epoch:[0/2](745900/4588595) loss:2.903 lr:0.0000100 epoch_Time:24453.0min: [2024-01-06 00:46:23,344][model8_pretrain.py][INFO] Epoch:[0/2](745900/4588595) loss:2.945 lr:0.0000100 epoch_Time:24453.0min: [2024-01-06 00:46:23,344][model8_pretrain.py][INFO] Epoch:[0/2](745900/4588595) loss:2.657 lr:0.0000100 epoch_Time:24453.0min: [2024-01-06 00:46:23,344][model8_pretrain.py][INFO] Epoch:[0/2](745900/4588595) loss:2.332 lr:0.0000100 epoch_Time:24453.0min: [2024-01-06 00:46:23,344][model8_pretrain.py][INFO] Epoch:[0/2](745900/4588595) loss:2.609 lr:0.0000100 epoch_Time:24453.0min: [2024-01-06 00:46:23,344][model8_pretrain.py][INFO] Epoch:[0/2](745900/4588595) loss:2.176 lr:0.0000100 epoch_Time:24453.0min: [2024-01-06 00:47:00,294][model8_pretrain.py][INFO] Epoch:[0/2](746000/4588595) loss:2.885 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:00,294][model8_pretrain.py][INFO] Epoch:[0/2](746000/4588595) loss:3.102 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:00,294][model8_pretrain.py][INFO] Epoch:[0/2](746000/4588595) loss:2.549 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:00,294][model8_pretrain.py][INFO] Epoch:[0/2](746000/4588595) loss:2.746 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:00,294][model8_pretrain.py][INFO] Epoch:[0/2](746000/4588595) loss:2.455 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:00,294][model8_pretrain.py][INFO] Epoch:[0/2](746000/4588595) loss:3.013 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:00,295][model8_pretrain.py][INFO] Epoch:[0/2](746000/4588595) loss:2.471 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:00,295][model8_pretrain.py][INFO] Epoch:[0/2](746000/4588595) loss:2.632 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:37,238][model8_pretrain.py][INFO] Epoch:[0/2](746100/4588595) loss:2.789 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:37,238][model8_pretrain.py][INFO] Epoch:[0/2](746100/4588595) loss:2.863 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:37,238][model8_pretrain.py][INFO] Epoch:[0/2](746100/4588595) loss:1.990 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:37,239][model8_pretrain.py][INFO] Epoch:[0/2](746100/4588595) loss:2.966 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:37,239][model8_pretrain.py][INFO] Epoch:[0/2](746100/4588595) loss:2.396 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:37,239][model8_pretrain.py][INFO] Epoch:[0/2](746100/4588595) loss:2.849 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:37,239][model8_pretrain.py][INFO] Epoch:[0/2](746100/4588595) loss:2.717 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:47:37,239][model8_pretrain.py][INFO] Epoch:[0/2](746100/4588595) loss:2.372 lr:0.0000100 epoch_Time:24452.0min: [2024-01-06 00:48:14,179][model8_pretrain.py][INFO] Epoch:[0/2](746200/4588595) loss:2.805 lr:0.0000100 epoch_Time:24451.0min: [2024-01-06 00:48:14,179][model8_pretrain.py][INFO] Epoch:[0/2](746200/4588595) loss:2.747 lr:0.0000100 epoch_Time:24451.0min: [2024-01-06 00:48:14,179][model8_pretrain.py][INFO] Epoch:[0/2](746200/4588595) loss:2.596 lr:0.0000100 epoch_Time:24451.0min: [2024-01-06 00:48:14,179][model8_pretrain.py][INFO] Epoch:[0/2](746200/4588595) loss:2.400 lr:0.0000100 epoch_Time:24451.0min: [2024-01-06 00:48:14,179][model8_pretrain.py][INFO] Epoch:[0/2](746200/4588595) loss:2.998 lr:0.0000100 epoch_Time:24451.0min: [2024-01-06 00:48:14,179][model8_pretrain.py][INFO] Epoch:[0/2](746200/4588595) loss:2.960 lr:0.0000100 epoch_Time:24451.0min: [2024-01-06 00:48:14,179][model8_pretrain.py][INFO] Epoch:[0/2](746200/4588595) loss:2.003 lr:0.0000100 epoch_Time:24451.0min: [2024-01-06 00:48:14,179][model8_pretrain.py][INFO] Epoch:[0/2](746200/4588595) loss:3.022 lr:0.0000100 epoch_Time:24451.0min: [2024-01-06 00:48:51,118][model8_pretrain.py][INFO] Epoch:[0/2](746300/4588595) loss:3.026 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:48:51,118][model8_pretrain.py][INFO] Epoch:[0/2](746300/4588595) loss:2.633 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:48:51,118][model8_pretrain.py][INFO] Epoch:[0/2](746300/4588595) loss:2.546 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:48:51,118][model8_pretrain.py][INFO] Epoch:[0/2](746300/4588595) loss:2.684 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:48:51,118][model8_pretrain.py][INFO] Epoch:[0/2](746300/4588595) loss:2.217 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:48:51,118][model8_pretrain.py][INFO] Epoch:[0/2](746300/4588595) loss:2.788 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:48:51,118][model8_pretrain.py][INFO] Epoch:[0/2](746300/4588595) loss:2.581 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:48:51,118][model8_pretrain.py][INFO] Epoch:[0/2](746300/4588595) loss:2.805 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:49:28,049][model8_pretrain.py][INFO] Epoch:[0/2](746400/4588595) loss:2.896 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:49:28,049][model8_pretrain.py][INFO] Epoch:[0/2](746400/4588595) loss:2.640 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:49:28,049][model8_pretrain.py][INFO] Epoch:[0/2](746400/4588595) loss:2.129 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:49:28,049][model8_pretrain.py][INFO] Epoch:[0/2](746400/4588595) loss:2.345 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:49:28,049][model8_pretrain.py][INFO] Epoch:[0/2](746400/4588595) loss:2.732 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:49:28,049][model8_pretrain.py][INFO] Epoch:[0/2](746400/4588595) loss:2.996 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:49:28,049][model8_pretrain.py][INFO] Epoch:[0/2](746400/4588595) loss:2.716 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:49:28,049][model8_pretrain.py][INFO] Epoch:[0/2](746400/4588595) loss:2.928 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:50:17,247][model8_pretrain.py][INFO] Epoch:[0/2](746500/4588595) loss:2.914 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:50:17,248][model8_pretrain.py][INFO] Epoch:[0/2](746500/4588595) loss:2.768 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:50:17,247][model8_pretrain.py][INFO] Epoch:[0/2](746500/4588595) loss:2.345 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:50:17,247][model8_pretrain.py][INFO] Epoch:[0/2](746500/4588595) loss:3.027 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:50:17,247][model8_pretrain.py][INFO] Epoch:[0/2](746500/4588595) loss:3.073 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:50:17,248][model8_pretrain.py][INFO] Epoch:[0/2](746500/4588595) loss:2.305 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:50:17,248][model8_pretrain.py][INFO] Epoch:[0/2](746500/4588595) loss:2.808 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:50:17,248][model8_pretrain.py][INFO] Epoch:[0/2](746500/4588595) loss:2.842 lr:0.0000100 epoch_Time:24450.0min: [2024-01-06 00:50:54,182][model8_pretrain.py][INFO] Epoch:[0/2](746600/4588595) loss:2.546 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:50:54,182][model8_pretrain.py][INFO] Epoch:[0/2](746600/4588595) loss:2.344 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:50:54,182][model8_pretrain.py][INFO] Epoch:[0/2](746600/4588595) loss:3.005 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:50:54,182][model8_pretrain.py][INFO] Epoch:[0/2](746600/4588595) loss:2.266 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:50:54,182][model8_pretrain.py][INFO] Epoch:[0/2](746600/4588595) loss:2.981 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:50:54,182][model8_pretrain.py][INFO] Epoch:[0/2](746600/4588595) loss:2.708 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:50:54,182][model8_pretrain.py][INFO] Epoch:[0/2](746600/4588595) loss:2.756 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:50:54,182][model8_pretrain.py][INFO] Epoch:[0/2](746600/4588595) loss:2.887 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:51:31,098][model8_pretrain.py][INFO] Epoch:[0/2](746700/4588595) loss:3.060 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:51:31,098][model8_pretrain.py][INFO] Epoch:[0/2](746700/4588595) loss:2.540 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:51:31,098][model8_pretrain.py][INFO] Epoch:[0/2](746700/4588595) loss:3.052 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:51:31,098][model8_pretrain.py][INFO] Epoch:[0/2](746700/4588595) loss:2.695 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:51:31,098][model8_pretrain.py][INFO] Epoch:[0/2](746700/4588595) loss:2.858 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:51:31,098][model8_pretrain.py][INFO] Epoch:[0/2](746700/4588595) loss:2.583 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:51:31,098][model8_pretrain.py][INFO] Epoch:[0/2](746700/4588595) loss:2.932 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:51:31,098][model8_pretrain.py][INFO] Epoch:[0/2](746700/4588595) loss:2.451 lr:0.0000100 epoch_Time:24449.0min: [2024-01-06 00:52:08,035][model8_pretrain.py][INFO] Epoch:[0/2](746800/4588595) loss:3.048 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:08,035][model8_pretrain.py][INFO] Epoch:[0/2](746800/4588595) loss:2.215 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:08,035][model8_pretrain.py][INFO] Epoch:[0/2](746800/4588595) loss:3.129 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:08,035][model8_pretrain.py][INFO] Epoch:[0/2](746800/4588595) loss:3.043 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:08,035][model8_pretrain.py][INFO] Epoch:[0/2](746800/4588595) loss:2.925 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:08,035][model8_pretrain.py][INFO] Epoch:[0/2](746800/4588595) loss:2.829 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:08,036][model8_pretrain.py][INFO] Epoch:[0/2](746800/4588595) loss:3.070 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:08,036][model8_pretrain.py][INFO] Epoch:[0/2](746800/4588595) loss:2.918 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:44,990][model8_pretrain.py][INFO] Epoch:[0/2](746900/4588595) loss:2.772 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:44,990][model8_pretrain.py][INFO] Epoch:[0/2](746900/4588595) loss:2.908 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:44,990][model8_pretrain.py][INFO] Epoch:[0/2](746900/4588595) loss:3.586 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:44,990][model8_pretrain.py][INFO] Epoch:[0/2](746900/4588595) loss:2.627 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:44,990][model8_pretrain.py][INFO] Epoch:[0/2](746900/4588595) loss:2.509 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:44,990][model8_pretrain.py][INFO] Epoch:[0/2](746900/4588595) loss:2.996 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:44,990][model8_pretrain.py][INFO] Epoch:[0/2](746900/4588595) loss:3.005 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:52:44,990][model8_pretrain.py][INFO] Epoch:[0/2](746900/4588595) loss:2.883 lr:0.0000100 epoch_Time:24447.0min: [2024-01-06 00:53:21,939][model8_pretrain.py][INFO] Epoch:[0/2](747000/4588595) loss:2.729 lr:0.0000100 epoch_Time:24446.0min: [2024-01-06 00:53:21,939][model8_pretrain.py][INFO] Epoch:[0/2](747000/4588595) loss:2.417 lr:0.0000100 epoch_Time:24446.0min: [2024-01-06 00:53:21,939][model8_pretrain.py][INFO] Epoch:[0/2](747000/4588595) loss:3.131 lr:0.0000100 epoch_Time:24446.0min: [2024-01-06 00:53:21,939][model8_pretrain.py][INFO] Epoch:[0/2](747000/4588595) loss:2.963 lr:0.0000100 epoch_Time:24446.0min: [2024-01-06 00:53:21,939][model8_pretrain.py][INFO] Epoch:[0/2](747000/4588595) loss:2.446 lr:0.0000100 epoch_Time:24446.0min: [2024-01-06 00:53:21,939][model8_pretrain.py][INFO] Epoch:[0/2](747000/4588595) loss:2.332 lr:0.0000100 epoch_Time:24446.0min: [2024-01-06 00:53:21,939][model8_pretrain.py][INFO] Epoch:[0/2](747000/4588595) loss:3.077 lr:0.0000100 epoch_Time:24446.0min: [2024-01-06 00:53:21,939][model8_pretrain.py][INFO] Epoch:[0/2](747000/4588595) loss:3.523 lr:0.0000100 epoch_Time:24446.0min: [2024-01-06 00:53:58,852][model8_pretrain.py][INFO] Epoch:[0/2](747100/4588595) loss:2.954 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:53:58,852][model8_pretrain.py][INFO] Epoch:[0/2](747100/4588595) loss:2.827 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:53:58,852][model8_pretrain.py][INFO] Epoch:[0/2](747100/4588595) loss:3.202 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:53:58,852][model8_pretrain.py][INFO] Epoch:[0/2](747100/4588595) loss:2.662 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:53:58,852][model8_pretrain.py][INFO] Epoch:[0/2](747100/4588595) loss:3.073 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:53:58,852][model8_pretrain.py][INFO] Epoch:[0/2](747100/4588595) loss:3.201 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:53:58,852][model8_pretrain.py][INFO] Epoch:[0/2](747100/4588595) loss:2.515 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:53:58,852][model8_pretrain.py][INFO] Epoch:[0/2](747100/4588595) loss:2.379 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:54:35,815][model8_pretrain.py][INFO] Epoch:[0/2](747200/4588595) loss:3.172 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:54:35,815][model8_pretrain.py][INFO] Epoch:[0/2](747200/4588595) loss:2.806 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:54:35,815][model8_pretrain.py][INFO] Epoch:[0/2](747200/4588595) loss:2.868 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:54:35,815][model8_pretrain.py][INFO] Epoch:[0/2](747200/4588595) loss:2.979 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:54:35,815][model8_pretrain.py][INFO] Epoch:[0/2](747200/4588595) loss:2.372 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:54:35,815][model8_pretrain.py][INFO] Epoch:[0/2](747200/4588595) loss:2.390 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:54:35,815][model8_pretrain.py][INFO] Epoch:[0/2](747200/4588595) loss:3.051 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:54:35,815][model8_pretrain.py][INFO] Epoch:[0/2](747200/4588595) loss:2.996 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:55:23,196][model8_pretrain.py][INFO] Epoch:[0/2](747300/4588595) loss:2.483 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:55:23,196][model8_pretrain.py][INFO] Epoch:[0/2](747300/4588595) loss:2.908 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:55:23,196][model8_pretrain.py][INFO] Epoch:[0/2](747300/4588595) loss:3.152 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:55:23,196][model8_pretrain.py][INFO] Epoch:[0/2](747300/4588595) loss:2.894 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:55:23,196][model8_pretrain.py][INFO] Epoch:[0/2](747300/4588595) loss:2.561 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:55:23,196][model8_pretrain.py][INFO] Epoch:[0/2](747300/4588595) loss:2.879 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:55:23,196][model8_pretrain.py][INFO] Epoch:[0/2](747300/4588595) loss:2.857 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:55:23,196][model8_pretrain.py][INFO] Epoch:[0/2](747300/4588595) loss:2.898 lr:0.0000100 epoch_Time:24445.0min: [2024-01-06 00:56:00,127][model8_pretrain.py][INFO] Epoch:[0/2](747400/4588595) loss:2.844 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:00,127][model8_pretrain.py][INFO] Epoch:[0/2](747400/4588595) loss:2.625 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:00,127][model8_pretrain.py][INFO] Epoch:[0/2](747400/4588595) loss:2.750 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:00,127][model8_pretrain.py][INFO] Epoch:[0/2](747400/4588595) loss:2.709 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:00,127][model8_pretrain.py][INFO] Epoch:[0/2](747400/4588595) loss:2.854 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:00,127][model8_pretrain.py][INFO] Epoch:[0/2](747400/4588595) loss:3.038 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:00,127][model8_pretrain.py][INFO] Epoch:[0/2](747400/4588595) loss:3.051 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:00,127][model8_pretrain.py][INFO] Epoch:[0/2](747400/4588595) loss:2.555 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:37,069][model8_pretrain.py][INFO] Epoch:[0/2](747500/4588595) loss:2.844 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:37,069][model8_pretrain.py][INFO] Epoch:[0/2](747500/4588595) loss:2.962 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:37,069][model8_pretrain.py][INFO] Epoch:[0/2](747500/4588595) loss:3.239 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:37,069][model8_pretrain.py][INFO] Epoch:[0/2](747500/4588595) loss:2.964 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:37,069][model8_pretrain.py][INFO] Epoch:[0/2](747500/4588595) loss:3.217 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:37,069][model8_pretrain.py][INFO] Epoch:[0/2](747500/4588595) loss:2.868 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:37,069][model8_pretrain.py][INFO] Epoch:[0/2](747500/4588595) loss:2.846 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:56:37,069][model8_pretrain.py][INFO] Epoch:[0/2](747500/4588595) loss:2.566 lr:0.0000100 epoch_Time:24444.0min: [2024-01-06 00:57:14,010][model8_pretrain.py][INFO] Epoch:[0/2](747600/4588595) loss:2.814 lr:0.0000100 epoch_Time:24443.0min: [2024-01-06 00:57:14,010][model8_pretrain.py][INFO] Epoch:[0/2](747600/4588595) loss:2.505 lr:0.0000100 epoch_Time:24443.0min: [2024-01-06 00:57:14,010][model8_pretrain.py][INFO] Epoch:[0/2](747600/4588595) loss:2.952 lr:0.0000100 epoch_Time:24443.0min: [2024-01-06 00:57:14,010][model8_pretrain.py][INFO] Epoch:[0/2](747600/4588595) loss:2.753 lr:0.0000100 epoch_Time:24443.0min: [2024-01-06 00:57:14,010][model8_pretrain.py][INFO] Epoch:[0/2](747600/4588595) loss:2.544 lr:0.0000100 epoch_Time:24443.0min: [2024-01-06 00:57:14,010][model8_pretrain.py][INFO] Epoch:[0/2](747600/4588595) loss:3.024 lr:0.0000100 epoch_Time:24443.0min: [2024-01-06 00:57:14,010][model8_pretrain.py][INFO] Epoch:[0/2](747600/4588595) loss:2.744 lr:0.0000100 epoch_Time:24443.0min: [2024-01-06 00:57:14,011][model8_pretrain.py][INFO] Epoch:[0/2](747600/4588595) loss:2.826 lr:0.0000100 epoch_Time:24443.0min: [2024-01-06 00:57:50,955][model8_pretrain.py][INFO] Epoch:[0/2](747700/4588595) loss:3.090 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:57:50,955][model8_pretrain.py][INFO] Epoch:[0/2](747700/4588595) loss:2.772 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:57:50,955][model8_pretrain.py][INFO] Epoch:[0/2](747700/4588595) loss:3.040 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:57:50,955][model8_pretrain.py][INFO] Epoch:[0/2](747700/4588595) loss:2.820 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:57:50,955][model8_pretrain.py][INFO] Epoch:[0/2](747700/4588595) loss:2.594 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:57:50,955][model8_pretrain.py][INFO] Epoch:[0/2](747700/4588595) loss:2.499 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:57:50,955][model8_pretrain.py][INFO] Epoch:[0/2](747700/4588595) loss:2.354 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:57:50,955][model8_pretrain.py][INFO] Epoch:[0/2](747700/4588595) loss:2.737 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:58:27,905][model8_pretrain.py][INFO] Epoch:[0/2](747800/4588595) loss:2.523 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:58:27,905][model8_pretrain.py][INFO] Epoch:[0/2](747800/4588595) loss:2.789 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:58:27,905][model8_pretrain.py][INFO] Epoch:[0/2](747800/4588595) loss:3.143 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:58:27,905][model8_pretrain.py][INFO] Epoch:[0/2](747800/4588595) loss:3.044 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:58:27,905][model8_pretrain.py][INFO] Epoch:[0/2](747800/4588595) loss:1.970 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:58:27,906][model8_pretrain.py][INFO] Epoch:[0/2](747800/4588595) loss:3.538 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:58:27,906][model8_pretrain.py][INFO] Epoch:[0/2](747800/4588595) loss:2.496 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:58:27,906][model8_pretrain.py][INFO] Epoch:[0/2](747800/4588595) loss:2.702 lr:0.0000100 epoch_Time:24441.0min: [2024-01-06 00:59:04,841][model8_pretrain.py][INFO] Epoch:[0/2](747900/4588595) loss:3.021 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:04,842][model8_pretrain.py][INFO] Epoch:[0/2](747900/4588595) loss:2.645 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:04,842][model8_pretrain.py][INFO] Epoch:[0/2](747900/4588595) loss:2.729 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:04,842][model8_pretrain.py][INFO] Epoch:[0/2](747900/4588595) loss:2.894 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:04,842][model8_pretrain.py][INFO] Epoch:[0/2](747900/4588595) loss:3.115 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:04,842][model8_pretrain.py][INFO] Epoch:[0/2](747900/4588595) loss:2.908 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:04,842][model8_pretrain.py][INFO] Epoch:[0/2](747900/4588595) loss:2.961 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:04,842][model8_pretrain.py][INFO] Epoch:[0/2](747900/4588595) loss:3.177 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:41,784][model8_pretrain.py][INFO] Epoch:[0/2](748000/4588595) loss:2.549 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:41,784][model8_pretrain.py][INFO] Epoch:[0/2](748000/4588595) loss:3.096 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:41,784][model8_pretrain.py][INFO] Epoch:[0/2](748000/4588595) loss:2.837 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:41,784][model8_pretrain.py][INFO] Epoch:[0/2](748000/4588595) loss:2.513 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:41,784][model8_pretrain.py][INFO] Epoch:[0/2](748000/4588595) loss:2.653 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:41,784][model8_pretrain.py][INFO] Epoch:[0/2](748000/4588595) loss:2.684 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:41,784][model8_pretrain.py][INFO] Epoch:[0/2](748000/4588595) loss:3.096 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 00:59:41,784][model8_pretrain.py][INFO] Epoch:[0/2](748000/4588595) loss:2.866 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 01:00:29,192][model8_pretrain.py][INFO] Epoch:[0/2](748100/4588595) loss:2.581 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 01:00:29,192][model8_pretrain.py][INFO] Epoch:[0/2](748100/4588595) loss:3.062 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 01:00:29,192][model8_pretrain.py][INFO] Epoch:[0/2](748100/4588595) loss:2.723 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 01:00:29,192][model8_pretrain.py][INFO] Epoch:[0/2](748100/4588595) loss:3.312 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 01:00:29,193][model8_pretrain.py][INFO] Epoch:[0/2](748100/4588595) loss:2.858 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 01:00:29,193][model8_pretrain.py][INFO] Epoch:[0/2](748100/4588595) loss:2.665 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 01:00:29,193][model8_pretrain.py][INFO] Epoch:[0/2](748100/4588595) loss:3.195 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 01:00:29,193][model8_pretrain.py][INFO] Epoch:[0/2](748100/4588595) loss:3.010 lr:0.0000100 epoch_Time:24440.0min: [2024-01-06 01:01:06,126][model8_pretrain.py][INFO] Epoch:[0/2](748200/4588595) loss:2.951 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:06,126][model8_pretrain.py][INFO] Epoch:[0/2](748200/4588595) loss:2.615 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:06,126][model8_pretrain.py][INFO] Epoch:[0/2](748200/4588595) loss:2.567 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:06,126][model8_pretrain.py][INFO] Epoch:[0/2](748200/4588595) loss:2.536 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:06,126][model8_pretrain.py][INFO] Epoch:[0/2](748200/4588595) loss:2.517 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:06,126][model8_pretrain.py][INFO] Epoch:[0/2](748200/4588595) loss:2.728 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:06,127][model8_pretrain.py][INFO] Epoch:[0/2](748200/4588595) loss:3.190 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:06,127][model8_pretrain.py][INFO] Epoch:[0/2](748200/4588595) loss:3.285 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:43,068][model8_pretrain.py][INFO] Epoch:[0/2](748300/4588595) loss:2.466 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:43,068][model8_pretrain.py][INFO] Epoch:[0/2](748300/4588595) loss:2.622 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:43,068][model8_pretrain.py][INFO] Epoch:[0/2](748300/4588595) loss:2.825 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:43,068][model8_pretrain.py][INFO] Epoch:[0/2](748300/4588595) loss:2.540 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:43,068][model8_pretrain.py][INFO] Epoch:[0/2](748300/4588595) loss:2.807 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:43,068][model8_pretrain.py][INFO] Epoch:[0/2](748300/4588595) loss:3.300 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:43,068][model8_pretrain.py][INFO] Epoch:[0/2](748300/4588595) loss:3.194 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:01:43,069][model8_pretrain.py][INFO] Epoch:[0/2](748300/4588595) loss:2.598 lr:0.0000100 epoch_Time:24439.0min: [2024-01-06 01:02:20,002][model8_pretrain.py][INFO] Epoch:[0/2](748400/4588595) loss:2.333 lr:0.0000100 epoch_Time:24438.0min: [2024-01-06 01:02:20,002][model8_pretrain.py][INFO] Epoch:[0/2](748400/4588595) loss:3.067 lr:0.0000100 epoch_Time:24438.0min: [2024-01-06 01:02:20,002][model8_pretrain.py][INFO] Epoch:[0/2](748400/4588595) loss:3.195 lr:0.0000100 epoch_Time:24438.0min: [2024-01-06 01:02:20,002][model8_pretrain.py][INFO] Epoch:[0/2](748400/4588595) loss:2.741 lr:0.0000100 epoch_Time:24438.0min: [2024-01-06 01:02:20,002][model8_pretrain.py][INFO] Epoch:[0/2](748400/4588595) loss:3.124 lr:0.0000100 epoch_Time:24438.0min: [2024-01-06 01:02:20,002][model8_pretrain.py][INFO] Epoch:[0/2](748400/4588595) loss:3.225 lr:0.0000100 epoch_Time:24438.0min: [2024-01-06 01:02:20,002][model8_pretrain.py][INFO] Epoch:[0/2](748400/4588595) loss:2.828 lr:0.0000100 epoch_Time:24438.0min: [2024-01-06 01:02:20,002][model8_pretrain.py][INFO] Epoch:[0/2](748400/4588595) loss:2.638 lr:0.0000100 epoch_Time:24438.0min: [2024-01-06 01:02:56,940][model8_pretrain.py][INFO] Epoch:[0/2](748500/4588595) loss:3.039 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:02:56,940][model8_pretrain.py][INFO] Epoch:[0/2](748500/4588595) loss:2.588 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:02:56,940][model8_pretrain.py][INFO] Epoch:[0/2](748500/4588595) loss:2.350 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:02:56,940][model8_pretrain.py][INFO] Epoch:[0/2](748500/4588595) loss:2.697 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:02:56,940][model8_pretrain.py][INFO] Epoch:[0/2](748500/4588595) loss:2.906 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:02:56,940][model8_pretrain.py][INFO] Epoch:[0/2](748500/4588595) loss:2.426 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:02:56,940][model8_pretrain.py][INFO] Epoch:[0/2](748500/4588595) loss:2.438 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:02:56,940][model8_pretrain.py][INFO] Epoch:[0/2](748500/4588595) loss:2.563 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:03:33,880][model8_pretrain.py][INFO] Epoch:[0/2](748600/4588595) loss:2.608 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:03:33,880][model8_pretrain.py][INFO] Epoch:[0/2](748600/4588595) loss:3.299 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:03:33,880][model8_pretrain.py][INFO] Epoch:[0/2](748600/4588595) loss:3.273 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:03:33,880][model8_pretrain.py][INFO] Epoch:[0/2](748600/4588595) loss:2.918 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:03:33,880][model8_pretrain.py][INFO] Epoch:[0/2](748600/4588595) loss:3.304 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:03:33,880][model8_pretrain.py][INFO] Epoch:[0/2](748600/4588595) loss:2.677 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:03:33,880][model8_pretrain.py][INFO] Epoch:[0/2](748600/4588595) loss:3.117 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:03:33,880][model8_pretrain.py][INFO] Epoch:[0/2](748600/4588595) loss:3.031 lr:0.0000100 epoch_Time:24436.0min: [2024-01-06 01:04:10,818][model8_pretrain.py][INFO] Epoch:[0/2](748700/4588595) loss:3.083 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:04:10,818][model8_pretrain.py][INFO] Epoch:[0/2](748700/4588595) loss:3.054 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:04:10,819][model8_pretrain.py][INFO] Epoch:[0/2](748700/4588595) loss:3.011 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:04:10,819][model8_pretrain.py][INFO] Epoch:[0/2](748700/4588595) loss:2.949 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:04:10,819][model8_pretrain.py][INFO] Epoch:[0/2](748700/4588595) loss:2.500 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:04:10,819][model8_pretrain.py][INFO] Epoch:[0/2](748700/4588595) loss:2.619 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:04:10,819][model8_pretrain.py][INFO] Epoch:[0/2](748700/4588595) loss:2.542 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:04:10,819][model8_pretrain.py][INFO] Epoch:[0/2](748700/4588595) loss:3.198 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:04:47,763][model8_pretrain.py][INFO] Epoch:[0/2](748800/4588595) loss:2.957 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:04:47,763][model8_pretrain.py][INFO] Epoch:[0/2](748800/4588595) loss:2.973 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:04:47,763][model8_pretrain.py][INFO] Epoch:[0/2](748800/4588595) loss:2.372 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:04:47,763][model8_pretrain.py][INFO] Epoch:[0/2](748800/4588595) loss:2.414 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:04:47,763][model8_pretrain.py][INFO] Epoch:[0/2](748800/4588595) loss:2.903 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:04:47,763][model8_pretrain.py][INFO] Epoch:[0/2](748800/4588595) loss:2.507 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:04:47,763][model8_pretrain.py][INFO] Epoch:[0/2](748800/4588595) loss:3.040 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:04:47,763][model8_pretrain.py][INFO] Epoch:[0/2](748800/4588595) loss:2.183 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:05:34,907][model8_pretrain.py][INFO] Epoch:[0/2](748900/4588595) loss:3.579 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:05:34,907][model8_pretrain.py][INFO] Epoch:[0/2](748900/4588595) loss:3.376 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:05:34,907][model8_pretrain.py][INFO] Epoch:[0/2](748900/4588595) loss:2.943 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:05:34,907][model8_pretrain.py][INFO] Epoch:[0/2](748900/4588595) loss:2.545 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:05:34,907][model8_pretrain.py][INFO] Epoch:[0/2](748900/4588595) loss:2.751 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:05:34,907][model8_pretrain.py][INFO] Epoch:[0/2](748900/4588595) loss:2.932 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:05:34,907][model8_pretrain.py][INFO] Epoch:[0/2](748900/4588595) loss:2.810 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:05:34,907][model8_pretrain.py][INFO] Epoch:[0/2](748900/4588595) loss:2.986 lr:0.0000100 epoch_Time:24435.0min: [2024-01-06 01:06:11,845][model8_pretrain.py][INFO] Epoch:[0/2](749000/4588595) loss:2.929 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:06:11,846][model8_pretrain.py][INFO] Epoch:[0/2](749000/4588595) loss:2.468 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:06:11,846][model8_pretrain.py][INFO] Epoch:[0/2](749000/4588595) loss:2.751 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:06:11,846][model8_pretrain.py][INFO] Epoch:[0/2](749000/4588595) loss:3.016 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:06:11,846][model8_pretrain.py][INFO] Epoch:[0/2](749000/4588595) loss:2.728 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:06:11,846][model8_pretrain.py][INFO] Epoch:[0/2](749000/4588595) loss:2.619 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:06:11,846][model8_pretrain.py][INFO] Epoch:[0/2](749000/4588595) loss:3.251 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:06:11,846][model8_pretrain.py][INFO] Epoch:[0/2](749000/4588595) loss:3.159 lr:0.0000100 epoch_Time:24434.0min: [2024-01-06 01:06:48,794][model8_pretrain.py][INFO] Epoch:[0/2](749100/4588595) loss:3.248 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:06:48,794][model8_pretrain.py][INFO] Epoch:[0/2](749100/4588595) loss:2.314 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:06:48,794][model8_pretrain.py][INFO] Epoch:[0/2](749100/4588595) loss:2.672 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:06:48,794][model8_pretrain.py][INFO] Epoch:[0/2](749100/4588595) loss:2.648 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:06:48,794][model8_pretrain.py][INFO] Epoch:[0/2](749100/4588595) loss:3.077 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:06:48,794][model8_pretrain.py][INFO] Epoch:[0/2](749100/4588595) loss:2.877 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:06:48,794][model8_pretrain.py][INFO] Epoch:[0/2](749100/4588595) loss:3.183 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:06:48,795][model8_pretrain.py][INFO] Epoch:[0/2](749100/4588595) loss:2.838 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:07:25,736][model8_pretrain.py][INFO] Epoch:[0/2](749200/4588595) loss:2.309 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:07:25,736][model8_pretrain.py][INFO] Epoch:[0/2](749200/4588595) loss:2.681 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:07:25,736][model8_pretrain.py][INFO] Epoch:[0/2](749200/4588595) loss:3.159 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:07:25,736][model8_pretrain.py][INFO] Epoch:[0/2](749200/4588595) loss:1.987 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:07:25,736][model8_pretrain.py][INFO] Epoch:[0/2](749200/4588595) loss:2.663 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:07:25,736][model8_pretrain.py][INFO] Epoch:[0/2](749200/4588595) loss:2.978 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:07:25,736][model8_pretrain.py][INFO] Epoch:[0/2](749200/4588595) loss:2.734 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:07:25,736][model8_pretrain.py][INFO] Epoch:[0/2](749200/4588595) loss:2.406 lr:0.0000100 epoch_Time:24433.0min: [2024-01-06 01:08:02,682][model8_pretrain.py][INFO] Epoch:[0/2](749300/4588595) loss:3.292 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:02,682][model8_pretrain.py][INFO] Epoch:[0/2](749300/4588595) loss:2.699 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:02,682][model8_pretrain.py][INFO] Epoch:[0/2](749300/4588595) loss:3.484 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:02,682][model8_pretrain.py][INFO] Epoch:[0/2](749300/4588595) loss:2.636 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:02,683][model8_pretrain.py][INFO] Epoch:[0/2](749300/4588595) loss:2.440 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:02,683][model8_pretrain.py][INFO] Epoch:[0/2](749300/4588595) loss:2.109 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:02,683][model8_pretrain.py][INFO] Epoch:[0/2](749300/4588595) loss:2.837 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:02,683][model8_pretrain.py][INFO] Epoch:[0/2](749300/4588595) loss:2.015 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:39,630][model8_pretrain.py][INFO] Epoch:[0/2](749400/4588595) loss:3.115 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:39,630][model8_pretrain.py][INFO] Epoch:[0/2](749400/4588595) loss:2.973 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:39,630][model8_pretrain.py][INFO] Epoch:[0/2](749400/4588595) loss:2.771 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:39,630][model8_pretrain.py][INFO] Epoch:[0/2](749400/4588595) loss:2.171 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:39,630][model8_pretrain.py][INFO] Epoch:[0/2](749400/4588595) loss:2.456 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:39,630][model8_pretrain.py][INFO] Epoch:[0/2](749400/4588595) loss:2.635 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:39,630][model8_pretrain.py][INFO] Epoch:[0/2](749400/4588595) loss:3.451 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:08:39,630][model8_pretrain.py][INFO] Epoch:[0/2](749400/4588595) loss:2.915 lr:0.0000100 epoch_Time:24431.0min: [2024-01-06 01:09:16,578][model8_pretrain.py][INFO] Epoch:[0/2](749500/4588595) loss:2.810 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:09:16,578][model8_pretrain.py][INFO] Epoch:[0/2](749500/4588595) loss:2.283 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:09:16,578][model8_pretrain.py][INFO] Epoch:[0/2](749500/4588595) loss:2.488 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:09:16,578][model8_pretrain.py][INFO] Epoch:[0/2](749500/4588595) loss:2.870 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:09:16,578][model8_pretrain.py][INFO] Epoch:[0/2](749500/4588595) loss:2.847 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:09:16,578][model8_pretrain.py][INFO] Epoch:[0/2](749500/4588595) loss:2.462 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:09:16,579][model8_pretrain.py][INFO] Epoch:[0/2](749500/4588595) loss:2.997 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:09:16,579][model8_pretrain.py][INFO] Epoch:[0/2](749500/4588595) loss:2.688 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:09:53,534][model8_pretrain.py][INFO] Epoch:[0/2](749600/4588595) loss:2.427 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:09:53,534][model8_pretrain.py][INFO] Epoch:[0/2](749600/4588595) loss:2.241 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:09:53,534][model8_pretrain.py][INFO] Epoch:[0/2](749600/4588595) loss:2.216 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:09:53,534][model8_pretrain.py][INFO] Epoch:[0/2](749600/4588595) loss:2.735 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:09:53,534][model8_pretrain.py][INFO] Epoch:[0/2](749600/4588595) loss:2.668 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:09:53,534][model8_pretrain.py][INFO] Epoch:[0/2](749600/4588595) loss:2.502 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:09:53,534][model8_pretrain.py][INFO] Epoch:[0/2](749600/4588595) loss:2.503 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:09:53,535][model8_pretrain.py][INFO] Epoch:[0/2](749600/4588595) loss:3.030 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:10:40,568][model8_pretrain.py][INFO] Epoch:[0/2](749700/4588595) loss:3.351 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:10:40,568][model8_pretrain.py][INFO] Epoch:[0/2](749700/4588595) loss:2.929 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:10:40,568][model8_pretrain.py][INFO] Epoch:[0/2](749700/4588595) loss:2.712 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:10:40,568][model8_pretrain.py][INFO] Epoch:[0/2](749700/4588595) loss:2.907 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:10:40,568][model8_pretrain.py][INFO] Epoch:[0/2](749700/4588595) loss:2.848 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:10:40,568][model8_pretrain.py][INFO] Epoch:[0/2](749700/4588595) loss:2.968 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:10:40,568][model8_pretrain.py][INFO] Epoch:[0/2](749700/4588595) loss:2.533 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:10:40,569][model8_pretrain.py][INFO] Epoch:[0/2](749700/4588595) loss:2.066 lr:0.0000100 epoch_Time:24430.0min: [2024-01-06 01:11:17,486][model8_pretrain.py][INFO] Epoch:[0/2](749800/4588595) loss:2.941 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:11:17,486][model8_pretrain.py][INFO] Epoch:[0/2](749800/4588595) loss:2.827 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:11:17,486][model8_pretrain.py][INFO] Epoch:[0/2](749800/4588595) loss:2.912 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:11:17,486][model8_pretrain.py][INFO] Epoch:[0/2](749800/4588595) loss:2.999 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:11:17,486][model8_pretrain.py][INFO] Epoch:[0/2](749800/4588595) loss:2.072 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:11:17,486][model8_pretrain.py][INFO] Epoch:[0/2](749800/4588595) loss:2.815 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:11:17,486][model8_pretrain.py][INFO] Epoch:[0/2](749800/4588595) loss:1.910 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:11:17,486][model8_pretrain.py][INFO] Epoch:[0/2](749800/4588595) loss:3.064 lr:0.0000100 epoch_Time:24429.0min: [2024-01-06 01:11:54,413][model8_pretrain.py][INFO] Epoch:[0/2](749900/4588595) loss:2.956 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:11:54,413][model8_pretrain.py][INFO] Epoch:[0/2](749900/4588595) loss:2.839 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:11:54,413][model8_pretrain.py][INFO] Epoch:[0/2](749900/4588595) loss:2.919 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:11:54,413][model8_pretrain.py][INFO] Epoch:[0/2](749900/4588595) loss:2.452 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:11:54,414][model8_pretrain.py][INFO] Epoch:[0/2](749900/4588595) loss:2.577 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:11:54,414][model8_pretrain.py][INFO] Epoch:[0/2](749900/4588595) loss:2.761 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:11:54,414][model8_pretrain.py][INFO] Epoch:[0/2](749900/4588595) loss:2.695 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:11:54,414][model8_pretrain.py][INFO] Epoch:[0/2](749900/4588595) loss:3.094 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:12:31,356][model8_pretrain.py][INFO] Epoch:[0/2](750000/4588595) loss:2.677 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:12:31,356][model8_pretrain.py][INFO] Epoch:[0/2](750000/4588595) loss:2.935 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:12:31,356][model8_pretrain.py][INFO] Epoch:[0/2](750000/4588595) loss:2.366 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:12:31,356][model8_pretrain.py][INFO] Epoch:[0/2](750000/4588595) loss:2.893 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:12:31,356][model8_pretrain.py][INFO] Epoch:[0/2](750000/4588595) loss:2.964 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:12:31,356][model8_pretrain.py][INFO] Epoch:[0/2](750000/4588595) loss:3.311 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:12:31,356][model8_pretrain.py][INFO] Epoch:[0/2](750000/4588595) loss:2.708 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:12:31,356][model8_pretrain.py][INFO] Epoch:[0/2](750000/4588595) loss:3.123 lr:0.0000100 epoch_Time:24428.0min: [2024-01-06 01:13:08,297][model8_pretrain.py][INFO] Epoch:[0/2](750100/4588595) loss:3.090 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:08,297][model8_pretrain.py][INFO] Epoch:[0/2](750100/4588595) loss:2.623 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:08,297][model8_pretrain.py][INFO] Epoch:[0/2](750100/4588595) loss:3.133 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:08,297][model8_pretrain.py][INFO] Epoch:[0/2](750100/4588595) loss:2.767 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:08,297][model8_pretrain.py][INFO] Epoch:[0/2](750100/4588595) loss:2.868 lr:0.0000100 epoch_Time:24427.0min: [2024-01-06 01:13:08,297][model8_pretrain.py][INFO] Epoch:[0/2](750100/4588595) loss:2.349 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:08,297][model8_pretrain.py][INFO] Epoch:[0/2](750100/4588595) loss:3.239 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:08,298][model8_pretrain.py][INFO] Epoch:[0/2](750100/4588595) loss:2.503 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:45,243][model8_pretrain.py][INFO] Epoch:[0/2](750200/4588595) loss:2.953 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:45,243][model8_pretrain.py][INFO] Epoch:[0/2](750200/4588595) loss:2.433 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:45,243][model8_pretrain.py][INFO] Epoch:[0/2](750200/4588595) loss:2.884 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:45,243][model8_pretrain.py][INFO] Epoch:[0/2](750200/4588595) loss:2.484 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:45,243][model8_pretrain.py][INFO] Epoch:[0/2](750200/4588595) loss:2.813 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:45,243][model8_pretrain.py][INFO] Epoch:[0/2](750200/4588595) loss:2.954 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:45,244][model8_pretrain.py][INFO] Epoch:[0/2](750200/4588595) loss:2.946 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:13:45,244][model8_pretrain.py][INFO] Epoch:[0/2](750200/4588595) loss:3.306 lr:0.0000100 epoch_Time:24426.0min: [2024-01-06 01:14:22,240][model8_pretrain.py][INFO] Epoch:[0/2](750300/4588595) loss:2.761 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:14:22,241][model8_pretrain.py][INFO] Epoch:[0/2](750300/4588595) loss:2.839 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:14:22,241][model8_pretrain.py][INFO] Epoch:[0/2](750300/4588595) loss:2.883 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:14:22,241][model8_pretrain.py][INFO] Epoch:[0/2](750300/4588595) loss:2.592 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:14:22,241][model8_pretrain.py][INFO] Epoch:[0/2](750300/4588595) loss:3.105 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:14:22,241][model8_pretrain.py][INFO] Epoch:[0/2](750300/4588595) loss:3.076 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:14:22,241][model8_pretrain.py][INFO] Epoch:[0/2](750300/4588595) loss:2.395 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:14:22,241][model8_pretrain.py][INFO] Epoch:[0/2](750300/4588595) loss:3.146 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:14:59,189][model8_pretrain.py][INFO] Epoch:[0/2](750400/4588595) loss:2.472 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:14:59,189][model8_pretrain.py][INFO] Epoch:[0/2](750400/4588595) loss:2.184 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:14:59,189][model8_pretrain.py][INFO] Epoch:[0/2](750400/4588595) loss:2.784 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:14:59,189][model8_pretrain.py][INFO] Epoch:[0/2](750400/4588595) loss:2.846 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:14:59,189][model8_pretrain.py][INFO] Epoch:[0/2](750400/4588595) loss:2.868 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:14:59,189][model8_pretrain.py][INFO] Epoch:[0/2](750400/4588595) loss:2.259 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:14:59,190][model8_pretrain.py][INFO] Epoch:[0/2](750400/4588595) loss:2.084 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:14:59,190][model8_pretrain.py][INFO] Epoch:[0/2](750400/4588595) loss:2.462 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:15:44,803][model8_pretrain.py][INFO] Epoch:[0/2](750500/4588595) loss:3.051 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:15:44,803][model8_pretrain.py][INFO] Epoch:[0/2](750500/4588595) loss:2.708 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:15:44,803][model8_pretrain.py][INFO] Epoch:[0/2](750500/4588595) loss:3.108 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:15:44,803][model8_pretrain.py][INFO] Epoch:[0/2](750500/4588595) loss:3.218 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:15:44,803][model8_pretrain.py][INFO] Epoch:[0/2](750500/4588595) loss:2.146 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:15:44,803][model8_pretrain.py][INFO] Epoch:[0/2](750500/4588595) loss:2.601 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:15:44,803][model8_pretrain.py][INFO] Epoch:[0/2](750500/4588595) loss:3.154 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:15:46,188][model8_pretrain.py][INFO] Epoch:[0/2](750500/4588595) loss:2.784 lr:0.0000100 epoch_Time:24425.0min: [2024-01-06 01:16:23,123][model8_pretrain.py][INFO] Epoch:[0/2](750600/4588595) loss:2.266 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:16:23,123][model8_pretrain.py][INFO] Epoch:[0/2](750600/4588595) loss:1.698 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:16:23,123][model8_pretrain.py][INFO] Epoch:[0/2](750600/4588595) loss:2.882 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:16:23,123][model8_pretrain.py][INFO] Epoch:[0/2](750600/4588595) loss:3.335 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:16:23,123][model8_pretrain.py][INFO] Epoch:[0/2](750600/4588595) loss:2.598 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:16:23,123][model8_pretrain.py][INFO] Epoch:[0/2](750600/4588595) loss:2.845 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:16:23,123][model8_pretrain.py][INFO] Epoch:[0/2](750600/4588595) loss:2.469 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:16:23,123][model8_pretrain.py][INFO] Epoch:[0/2](750600/4588595) loss:2.795 lr:0.0000100 epoch_Time:24424.0min: [2024-01-06 01:17:00,066][model8_pretrain.py][INFO] Epoch:[0/2](750700/4588595) loss:2.236 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:00,066][model8_pretrain.py][INFO] Epoch:[0/2](750700/4588595) loss:2.479 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:00,066][model8_pretrain.py][INFO] Epoch:[0/2](750700/4588595) loss:3.007 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:00,066][model8_pretrain.py][INFO] Epoch:[0/2](750700/4588595) loss:2.823 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:00,066][model8_pretrain.py][INFO] Epoch:[0/2](750700/4588595) loss:2.617 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:00,066][model8_pretrain.py][INFO] Epoch:[0/2](750700/4588595) loss:3.233 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:00,066][model8_pretrain.py][INFO] Epoch:[0/2](750700/4588595) loss:3.119 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:00,066][model8_pretrain.py][INFO] Epoch:[0/2](750700/4588595) loss:3.183 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:37,001][model8_pretrain.py][INFO] Epoch:[0/2](750800/4588595) loss:2.793 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:37,001][model8_pretrain.py][INFO] Epoch:[0/2](750800/4588595) loss:2.837 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:37,001][model8_pretrain.py][INFO] Epoch:[0/2](750800/4588595) loss:2.740 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:37,001][model8_pretrain.py][INFO] Epoch:[0/2](750800/4588595) loss:2.737 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:37,001][model8_pretrain.py][INFO] Epoch:[0/2](750800/4588595) loss:2.992 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:37,001][model8_pretrain.py][INFO] Epoch:[0/2](750800/4588595) loss:2.842 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:37,001][model8_pretrain.py][INFO] Epoch:[0/2](750800/4588595) loss:2.567 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:17:37,001][model8_pretrain.py][INFO] Epoch:[0/2](750800/4588595) loss:2.536 lr:0.0000100 epoch_Time:24423.0min: [2024-01-06 01:18:13,948][model8_pretrain.py][INFO] Epoch:[0/2](750900/4588595) loss:2.306 lr:0.0000100 epoch_Time:24422.0min: [2024-01-06 01:18:13,948][model8_pretrain.py][INFO] Epoch:[0/2](750900/4588595) loss:2.663 lr:0.0000100 epoch_Time:24422.0min: [2024-01-06 01:18:13,948][model8_pretrain.py][INFO] Epoch:[0/2](750900/4588595) loss:2.884 lr:0.0000100 epoch_Time:24422.0min: [2024-01-06 01:18:13,948][model8_pretrain.py][INFO] Epoch:[0/2](750900/4588595) loss:2.853 lr:0.0000100 epoch_Time:24422.0min: [2024-01-06 01:18:13,948][model8_pretrain.py][INFO] Epoch:[0/2](750900/4588595) loss:2.784 lr:0.0000100 epoch_Time:24422.0min: [2024-01-06 01:18:13,948][model8_pretrain.py][INFO] Epoch:[0/2](750900/4588595) loss:2.906 lr:0.0000100 epoch_Time:24422.0min: [2024-01-06 01:18:13,948][model8_pretrain.py][INFO] Epoch:[0/2](750900/4588595) loss:2.779 lr:0.0000100 epoch_Time:24422.0min: [2024-01-06 01:18:13,948][model8_pretrain.py][INFO] Epoch:[0/2](750900/4588595) loss:2.463 lr:0.0000100 epoch_Time:24422.0min: [2024-01-06 01:18:50,901][model8_pretrain.py][INFO] Epoch:[0/2](751000/4588595) loss:2.738 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:18:50,901][model8_pretrain.py][INFO] Epoch:[0/2](751000/4588595) loss:3.063 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:18:50,901][model8_pretrain.py][INFO] Epoch:[0/2](751000/4588595) loss:2.852 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:18:50,901][model8_pretrain.py][INFO] Epoch:[0/2](751000/4588595) loss:3.272 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:18:50,901][model8_pretrain.py][INFO] Epoch:[0/2](751000/4588595) loss:2.909 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:18:50,901][model8_pretrain.py][INFO] Epoch:[0/2](751000/4588595) loss:2.979 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:18:50,901][model8_pretrain.py][INFO] Epoch:[0/2](751000/4588595) loss:2.198 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:18:50,901][model8_pretrain.py][INFO] Epoch:[0/2](751000/4588595) loss:2.836 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:19:27,841][model8_pretrain.py][INFO] Epoch:[0/2](751100/4588595) loss:3.197 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:19:27,841][model8_pretrain.py][INFO] Epoch:[0/2](751100/4588595) loss:2.510 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:19:27,841][model8_pretrain.py][INFO] Epoch:[0/2](751100/4588595) loss:2.759 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:19:27,841][model8_pretrain.py][INFO] Epoch:[0/2](751100/4588595) loss:2.402 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:19:27,841][model8_pretrain.py][INFO] Epoch:[0/2](751100/4588595) loss:2.421 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:19:27,841][model8_pretrain.py][INFO] Epoch:[0/2](751100/4588595) loss:3.016 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:19:27,841][model8_pretrain.py][INFO] Epoch:[0/2](751100/4588595) loss:3.103 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:19:27,841][model8_pretrain.py][INFO] Epoch:[0/2](751100/4588595) loss:2.601 lr:0.0000100 epoch_Time:24420.0min: [2024-01-06 01:20:04,788][model8_pretrain.py][INFO] Epoch:[0/2](751200/4588595) loss:3.467 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:04,788][model8_pretrain.py][INFO] Epoch:[0/2](751200/4588595) loss:2.768 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:04,788][model8_pretrain.py][INFO] Epoch:[0/2](751200/4588595) loss:2.819 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:04,788][model8_pretrain.py][INFO] Epoch:[0/2](751200/4588595) loss:2.914 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:04,788][model8_pretrain.py][INFO] Epoch:[0/2](751200/4588595) loss:2.980 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:04,788][model8_pretrain.py][INFO] Epoch:[0/2](751200/4588595) loss:2.830 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:04,788][model8_pretrain.py][INFO] Epoch:[0/2](751200/4588595) loss:3.250 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:04,788][model8_pretrain.py][INFO] Epoch:[0/2](751200/4588595) loss:2.754 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:50,548][model8_pretrain.py][INFO] Epoch:[0/2](751300/4588595) loss:2.773 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:50,548][model8_pretrain.py][INFO] Epoch:[0/2](751300/4588595) loss:2.701 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:50,548][model8_pretrain.py][INFO] Epoch:[0/2](751300/4588595) loss:2.879 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:50,548][model8_pretrain.py][INFO] Epoch:[0/2](751300/4588595) loss:3.008 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:50,548][model8_pretrain.py][INFO] Epoch:[0/2](751300/4588595) loss:2.616 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:50,548][model8_pretrain.py][INFO] Epoch:[0/2](751300/4588595) loss:2.599 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:50,548][model8_pretrain.py][INFO] Epoch:[0/2](751300/4588595) loss:2.675 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:20:50,548][model8_pretrain.py][INFO] Epoch:[0/2](751300/4588595) loss:2.886 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:21:29,034][model8_pretrain.py][INFO] Epoch:[0/2](751400/4588595) loss:2.902 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:21:29,034][model8_pretrain.py][INFO] Epoch:[0/2](751400/4588595) loss:2.914 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:21:29,034][model8_pretrain.py][INFO] Epoch:[0/2](751400/4588595) loss:2.318 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:21:29,034][model8_pretrain.py][INFO] Epoch:[0/2](751400/4588595) loss:2.637 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:21:29,034][model8_pretrain.py][INFO] Epoch:[0/2](751400/4588595) loss:2.901 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:21:29,034][model8_pretrain.py][INFO] Epoch:[0/2](751400/4588595) loss:2.914 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:21:29,034][model8_pretrain.py][INFO] Epoch:[0/2](751400/4588595) loss:2.937 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:21:29,034][model8_pretrain.py][INFO] Epoch:[0/2](751400/4588595) loss:2.412 lr:0.0000100 epoch_Time:24419.0min: [2024-01-06 01:22:05,970][model8_pretrain.py][INFO] Epoch:[0/2](751500/4588595) loss:2.284 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:05,970][model8_pretrain.py][INFO] Epoch:[0/2](751500/4588595) loss:3.041 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:05,970][model8_pretrain.py][INFO] Epoch:[0/2](751500/4588595) loss:3.007 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:05,970][model8_pretrain.py][INFO] Epoch:[0/2](751500/4588595) loss:2.804 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:05,970][model8_pretrain.py][INFO] Epoch:[0/2](751500/4588595) loss:2.558 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:05,970][model8_pretrain.py][INFO] Epoch:[0/2](751500/4588595) loss:3.151 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:05,970][model8_pretrain.py][INFO] Epoch:[0/2](751500/4588595) loss:2.759 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:05,970][model8_pretrain.py][INFO] Epoch:[0/2](751500/4588595) loss:2.871 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:42,909][model8_pretrain.py][INFO] Epoch:[0/2](751600/4588595) loss:2.687 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:42,909][model8_pretrain.py][INFO] Epoch:[0/2](751600/4588595) loss:2.783 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:42,909][model8_pretrain.py][INFO] Epoch:[0/2](751600/4588595) loss:2.869 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:42,909][model8_pretrain.py][INFO] Epoch:[0/2](751600/4588595) loss:2.713 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:42,909][model8_pretrain.py][INFO] Epoch:[0/2](751600/4588595) loss:2.802 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:42,909][model8_pretrain.py][INFO] Epoch:[0/2](751600/4588595) loss:3.162 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:42,909][model8_pretrain.py][INFO] Epoch:[0/2](751600/4588595) loss:2.610 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:22:42,909][model8_pretrain.py][INFO] Epoch:[0/2](751600/4588595) loss:2.805 lr:0.0000100 epoch_Time:24418.0min: [2024-01-06 01:23:19,852][model8_pretrain.py][INFO] Epoch:[0/2](751700/4588595) loss:3.211 lr:0.0000100 epoch_Time:24417.0min: [2024-01-06 01:23:19,852][model8_pretrain.py][INFO] Epoch:[0/2](751700/4588595) loss:2.790 lr:0.0000100 epoch_Time:24417.0min: [2024-01-06 01:23:19,852][model8_pretrain.py][INFO] Epoch:[0/2](751700/4588595) loss:2.769 lr:0.0000100 epoch_Time:24417.0min: [2024-01-06 01:23:19,852][model8_pretrain.py][INFO] Epoch:[0/2](751700/4588595) loss:2.979 lr:0.0000100 epoch_Time:24417.0min: [2024-01-06 01:23:19,852][model8_pretrain.py][INFO] Epoch:[0/2](751700/4588595) loss:2.091 lr:0.0000100 epoch_Time:24417.0min: [2024-01-06 01:23:19,852][model8_pretrain.py][INFO] Epoch:[0/2](751700/4588595) loss:2.562 lr:0.0000100 epoch_Time:24417.0min: [2024-01-06 01:23:19,852][model8_pretrain.py][INFO] Epoch:[0/2](751700/4588595) loss:2.697 lr:0.0000100 epoch_Time:24417.0min: [2024-01-06 01:23:19,852][model8_pretrain.py][INFO] Epoch:[0/2](751700/4588595) loss:2.954 lr:0.0000100 epoch_Time:24417.0min: [2024-01-06 01:23:56,797][model8_pretrain.py][INFO] Epoch:[0/2](751800/4588595) loss:2.576 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:23:56,797][model8_pretrain.py][INFO] Epoch:[0/2](751800/4588595) loss:3.751 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:23:56,797][model8_pretrain.py][INFO] Epoch:[0/2](751800/4588595) loss:2.743 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:23:56,797][model8_pretrain.py][INFO] Epoch:[0/2](751800/4588595) loss:2.180 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:23:56,797][model8_pretrain.py][INFO] Epoch:[0/2](751800/4588595) loss:2.434 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:23:56,797][model8_pretrain.py][INFO] Epoch:[0/2](751800/4588595) loss:2.658 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:23:56,798][model8_pretrain.py][INFO] Epoch:[0/2](751800/4588595) loss:3.021 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:23:56,798][model8_pretrain.py][INFO] Epoch:[0/2](751800/4588595) loss:3.154 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:24:33,736][model8_pretrain.py][INFO] Epoch:[0/2](751900/4588595) loss:3.102 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:24:33,736][model8_pretrain.py][INFO] Epoch:[0/2](751900/4588595) loss:2.578 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:24:33,736][model8_pretrain.py][INFO] Epoch:[0/2](751900/4588595) loss:2.516 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:24:33,736][model8_pretrain.py][INFO] Epoch:[0/2](751900/4588595) loss:2.700 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:24:33,736][model8_pretrain.py][INFO] Epoch:[0/2](751900/4588595) loss:3.058 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:24:33,737][model8_pretrain.py][INFO] Epoch:[0/2](751900/4588595) loss:2.369 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:24:33,737][model8_pretrain.py][INFO] Epoch:[0/2](751900/4588595) loss:2.867 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:24:33,737][model8_pretrain.py][INFO] Epoch:[0/2](751900/4588595) loss:2.732 lr:0.0000100 epoch_Time:24415.0min: [2024-01-06 01:25:10,685][model8_pretrain.py][INFO] Epoch:[0/2](752000/4588595) loss:2.807 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:10,685][model8_pretrain.py][INFO] Epoch:[0/2](752000/4588595) loss:2.556 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:10,685][model8_pretrain.py][INFO] Epoch:[0/2](752000/4588595) loss:2.610 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:10,685][model8_pretrain.py][INFO] Epoch:[0/2](752000/4588595) loss:2.899 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:10,685][model8_pretrain.py][INFO] Epoch:[0/2](752000/4588595) loss:3.218 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:10,685][model8_pretrain.py][INFO] Epoch:[0/2](752000/4588595) loss:2.601 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:10,685][model8_pretrain.py][INFO] Epoch:[0/2](752000/4588595) loss:2.053 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:10,685][model8_pretrain.py][INFO] Epoch:[0/2](752000/4588595) loss:2.688 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:56,376][model8_pretrain.py][INFO] Epoch:[0/2](752100/4588595) loss:2.761 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:56,376][model8_pretrain.py][INFO] Epoch:[0/2](752100/4588595) loss:2.886 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:56,376][model8_pretrain.py][INFO] Epoch:[0/2](752100/4588595) loss:2.458 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:56,376][model8_pretrain.py][INFO] Epoch:[0/2](752100/4588595) loss:2.257 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:56,376][model8_pretrain.py][INFO] Epoch:[0/2](752100/4588595) loss:2.768 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:56,376][model8_pretrain.py][INFO] Epoch:[0/2](752100/4588595) loss:3.110 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:56,376][model8_pretrain.py][INFO] Epoch:[0/2](752100/4588595) loss:2.813 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:25:56,376][model8_pretrain.py][INFO] Epoch:[0/2](752100/4588595) loss:2.372 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:26:34,983][model8_pretrain.py][INFO] Epoch:[0/2](752200/4588595) loss:3.047 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:26:34,983][model8_pretrain.py][INFO] Epoch:[0/2](752200/4588595) loss:2.289 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:26:34,983][model8_pretrain.py][INFO] Epoch:[0/2](752200/4588595) loss:2.622 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:26:34,983][model8_pretrain.py][INFO] Epoch:[0/2](752200/4588595) loss:2.977 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:26:34,983][model8_pretrain.py][INFO] Epoch:[0/2](752200/4588595) loss:2.671 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:26:34,983][model8_pretrain.py][INFO] Epoch:[0/2](752200/4588595) loss:3.607 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:26:34,984][model8_pretrain.py][INFO] Epoch:[0/2](752200/4588595) loss:2.503 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:26:34,984][model8_pretrain.py][INFO] Epoch:[0/2](752200/4588595) loss:3.274 lr:0.0000100 epoch_Time:24414.0min: [2024-01-06 01:27:11,935][model8_pretrain.py][INFO] Epoch:[0/2](752300/4588595) loss:2.283 lr:0.0000100 epoch_Time:24413.0min: [2024-01-06 01:27:11,935][model8_pretrain.py][INFO] Epoch:[0/2](752300/4588595) loss:2.651 lr:0.0000100 epoch_Time:24413.0min: [2024-01-06 01:27:11,935][model8_pretrain.py][INFO] Epoch:[0/2](752300/4588595) loss:2.867 lr:0.0000100 epoch_Time:24413.0min: [2024-01-06 01:27:11,935][model8_pretrain.py][INFO] Epoch:[0/2](752300/4588595) loss:2.746 lr:0.0000100 epoch_Time:24413.0min: [2024-01-06 01:27:11,935][model8_pretrain.py][INFO] Epoch:[0/2](752300/4588595) loss:3.026 lr:0.0000100 epoch_Time:24413.0min: [2024-01-06 01:27:11,935][model8_pretrain.py][INFO] Epoch:[0/2](752300/4588595) loss:2.697 lr:0.0000100 epoch_Time:24413.0min: [2024-01-06 01:27:11,935][model8_pretrain.py][INFO] Epoch:[0/2](752300/4588595) loss:2.501 lr:0.0000100 epoch_Time:24413.0min: [2024-01-06 01:27:11,935][model8_pretrain.py][INFO] Epoch:[0/2](752300/4588595) loss:3.174 lr:0.0000100 epoch_Time:24413.0min: [2024-01-06 01:27:48,890][model8_pretrain.py][INFO] Epoch:[0/2](752400/4588595) loss:3.171 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:27:48,890][model8_pretrain.py][INFO] Epoch:[0/2](752400/4588595) loss:3.068 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:27:48,890][model8_pretrain.py][INFO] Epoch:[0/2](752400/4588595) loss:2.239 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:27:48,890][model8_pretrain.py][INFO] Epoch:[0/2](752400/4588595) loss:2.478 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:27:48,890][model8_pretrain.py][INFO] Epoch:[0/2](752400/4588595) loss:2.590 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:27:48,890][model8_pretrain.py][INFO] Epoch:[0/2](752400/4588595) loss:2.714 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:27:48,891][model8_pretrain.py][INFO] Epoch:[0/2](752400/4588595) loss:2.779 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:27:48,891][model8_pretrain.py][INFO] Epoch:[0/2](752400/4588595) loss:2.866 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:28:25,847][model8_pretrain.py][INFO] Epoch:[0/2](752500/4588595) loss:2.721 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:28:25,847][model8_pretrain.py][INFO] Epoch:[0/2](752500/4588595) loss:2.023 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:28:25,847][model8_pretrain.py][INFO] Epoch:[0/2](752500/4588595) loss:3.285 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:28:25,847][model8_pretrain.py][INFO] Epoch:[0/2](752500/4588595) loss:2.815 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:28:25,847][model8_pretrain.py][INFO] Epoch:[0/2](752500/4588595) loss:2.963 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:28:25,847][model8_pretrain.py][INFO] Epoch:[0/2](752500/4588595) loss:2.842 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:28:25,847][model8_pretrain.py][INFO] Epoch:[0/2](752500/4588595) loss:2.867 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:28:25,847][model8_pretrain.py][INFO] Epoch:[0/2](752500/4588595) loss:2.763 lr:0.0000100 epoch_Time:24412.0min: [2024-01-06 01:29:02,805][model8_pretrain.py][INFO] Epoch:[0/2](752600/4588595) loss:2.334 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:02,805][model8_pretrain.py][INFO] Epoch:[0/2](752600/4588595) loss:3.067 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:02,805][model8_pretrain.py][INFO] Epoch:[0/2](752600/4588595) loss:2.804 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:02,805][model8_pretrain.py][INFO] Epoch:[0/2](752600/4588595) loss:2.925 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:02,805][model8_pretrain.py][INFO] Epoch:[0/2](752600/4588595) loss:2.983 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:02,805][model8_pretrain.py][INFO] Epoch:[0/2](752600/4588595) loss:2.893 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:02,805][model8_pretrain.py][INFO] Epoch:[0/2](752600/4588595) loss:3.318 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:02,805][model8_pretrain.py][INFO] Epoch:[0/2](752600/4588595) loss:2.854 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:39,759][model8_pretrain.py][INFO] Epoch:[0/2](752700/4588595) loss:2.983 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:39,759][model8_pretrain.py][INFO] Epoch:[0/2](752700/4588595) loss:3.002 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:39,759][model8_pretrain.py][INFO] Epoch:[0/2](752700/4588595) loss:2.946 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:39,759][model8_pretrain.py][INFO] Epoch:[0/2](752700/4588595) loss:3.363 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:39,759][model8_pretrain.py][INFO] Epoch:[0/2](752700/4588595) loss:2.686 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:39,759][model8_pretrain.py][INFO] Epoch:[0/2](752700/4588595) loss:1.824 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:39,760][model8_pretrain.py][INFO] Epoch:[0/2](752700/4588595) loss:2.913 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:29:39,760][model8_pretrain.py][INFO] Epoch:[0/2](752700/4588595) loss:1.883 lr:0.0000100 epoch_Time:24410.0min: [2024-01-06 01:30:16,717][model8_pretrain.py][INFO] Epoch:[0/2](752800/4588595) loss:2.795 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:30:16,717][model8_pretrain.py][INFO] Epoch:[0/2](752800/4588595) loss:2.872 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:30:16,717][model8_pretrain.py][INFO] Epoch:[0/2](752800/4588595) loss:2.351 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:30:16,717][model8_pretrain.py][INFO] Epoch:[0/2](752800/4588595) loss:3.213 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:30:16,717][model8_pretrain.py][INFO] Epoch:[0/2](752800/4588595) loss:2.857 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:30:16,717][model8_pretrain.py][INFO] Epoch:[0/2](752800/4588595) loss:3.098 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:30:16,717][model8_pretrain.py][INFO] Epoch:[0/2](752800/4588595) loss:2.843 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:30:16,717][model8_pretrain.py][INFO] Epoch:[0/2](752800/4588595) loss:2.662 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:00,378][model8_pretrain.py][INFO] Epoch:[0/2](752900/4588595) loss:2.737 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:00,378][model8_pretrain.py][INFO] Epoch:[0/2](752900/4588595) loss:2.710 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:00,378][model8_pretrain.py][INFO] Epoch:[0/2](752900/4588595) loss:2.973 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:00,378][model8_pretrain.py][INFO] Epoch:[0/2](752900/4588595) loss:3.051 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:00,378][model8_pretrain.py][INFO] Epoch:[0/2](752900/4588595) loss:2.205 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:00,378][model8_pretrain.py][INFO] Epoch:[0/2](752900/4588595) loss:3.005 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:00,378][model8_pretrain.py][INFO] Epoch:[0/2](752900/4588595) loss:2.341 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:00,378][model8_pretrain.py][INFO] Epoch:[0/2](752900/4588595) loss:3.076 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:40,813][model8_pretrain.py][INFO] Epoch:[0/2](753000/4588595) loss:2.516 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:40,813][model8_pretrain.py][INFO] Epoch:[0/2](753000/4588595) loss:2.734 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:40,813][model8_pretrain.py][INFO] Epoch:[0/2](753000/4588595) loss:2.948 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:40,813][model8_pretrain.py][INFO] Epoch:[0/2](753000/4588595) loss:2.897 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:40,813][model8_pretrain.py][INFO] Epoch:[0/2](753000/4588595) loss:2.927 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:40,813][model8_pretrain.py][INFO] Epoch:[0/2](753000/4588595) loss:2.962 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:40,813][model8_pretrain.py][INFO] Epoch:[0/2](753000/4588595) loss:2.978 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:31:40,813][model8_pretrain.py][INFO] Epoch:[0/2](753000/4588595) loss:2.912 lr:0.0000100 epoch_Time:24409.0min: [2024-01-06 01:32:17,731][model8_pretrain.py][INFO] Epoch:[0/2](753100/4588595) loss:2.918 lr:0.0000100 epoch_Time:24408.0min: [2024-01-06 01:32:17,731][model8_pretrain.py][INFO] Epoch:[0/2](753100/4588595) loss:2.620 lr:0.0000100 epoch_Time:24408.0min: [2024-01-06 01:32:17,731][model8_pretrain.py][INFO] Epoch:[0/2](753100/4588595) loss:2.611 lr:0.0000100 epoch_Time:24408.0min: [2024-01-06 01:32:17,731][model8_pretrain.py][INFO] Epoch:[0/2](753100/4588595) loss:2.757 lr:0.0000100 epoch_Time:24408.0min: [2024-01-06 01:32:17,731][model8_pretrain.py][INFO] Epoch:[0/2](753100/4588595) loss:2.901 lr:0.0000100 epoch_Time:24408.0min: [2024-01-06 01:32:17,732][model8_pretrain.py][INFO] Epoch:[0/2](753100/4588595) loss:1.832 lr:0.0000100 epoch_Time:24408.0min: [2024-01-06 01:32:17,732][model8_pretrain.py][INFO] Epoch:[0/2](753100/4588595) loss:2.665 lr:0.0000100 epoch_Time:24408.0min: [2024-01-06 01:32:17,732][model8_pretrain.py][INFO] Epoch:[0/2](753100/4588595) loss:2.393 lr:0.0000100 epoch_Time:24408.0min: [2024-01-06 01:32:54,674][model8_pretrain.py][INFO] Epoch:[0/2](753200/4588595) loss:3.040 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:32:54,674][model8_pretrain.py][INFO] Epoch:[0/2](753200/4588595) loss:3.084 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:32:54,674][model8_pretrain.py][INFO] Epoch:[0/2](753200/4588595) loss:3.282 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:32:54,674][model8_pretrain.py][INFO] Epoch:[0/2](753200/4588595) loss:2.804 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:32:54,674][model8_pretrain.py][INFO] Epoch:[0/2](753200/4588595) loss:2.712 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:32:54,674][model8_pretrain.py][INFO] Epoch:[0/2](753200/4588595) loss:2.806 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:32:54,674][model8_pretrain.py][INFO] Epoch:[0/2](753200/4588595) loss:3.258 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:32:54,675][model8_pretrain.py][INFO] Epoch:[0/2](753200/4588595) loss:2.503 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:33:31,613][model8_pretrain.py][INFO] Epoch:[0/2](753300/4588595) loss:2.500 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:33:31,613][model8_pretrain.py][INFO] Epoch:[0/2](753300/4588595) loss:2.995 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:33:31,613][model8_pretrain.py][INFO] Epoch:[0/2](753300/4588595) loss:2.766 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:33:31,613][model8_pretrain.py][INFO] Epoch:[0/2](753300/4588595) loss:2.828 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:33:31,613][model8_pretrain.py][INFO] Epoch:[0/2](753300/4588595) loss:2.484 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:33:31,613][model8_pretrain.py][INFO] Epoch:[0/2](753300/4588595) loss:2.852 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:33:31,613][model8_pretrain.py][INFO] Epoch:[0/2](753300/4588595) loss:2.986 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:33:31,614][model8_pretrain.py][INFO] Epoch:[0/2](753300/4588595) loss:2.917 lr:0.0000100 epoch_Time:24407.0min: [2024-01-06 01:34:08,562][model8_pretrain.py][INFO] Epoch:[0/2](753400/4588595) loss:2.741 lr:0.0000100 epoch_Time:24406.0min: [2024-01-06 01:34:08,562][model8_pretrain.py][INFO] Epoch:[0/2](753400/4588595) loss:2.894 lr:0.0000100 epoch_Time:24406.0min: [2024-01-06 01:34:08,562][model8_pretrain.py][INFO] Epoch:[0/2](753400/4588595) loss:2.518 lr:0.0000100 epoch_Time:24406.0min: [2024-01-06 01:34:08,562][model8_pretrain.py][INFO] Epoch:[0/2](753400/4588595) loss:3.304 lr:0.0000100 epoch_Time:24406.0min: [2024-01-06 01:34:08,562][model8_pretrain.py][INFO] Epoch:[0/2](753400/4588595) loss:2.551 lr:0.0000100 epoch_Time:24406.0min: [2024-01-06 01:34:08,562][model8_pretrain.py][INFO] Epoch:[0/2](753400/4588595) loss:2.919 lr:0.0000100 epoch_Time:24406.0min: [2024-01-06 01:34:08,562][model8_pretrain.py][INFO] Epoch:[0/2](753400/4588595) loss:2.770 lr:0.0000100 epoch_Time:24406.0min: [2024-01-06 01:34:08,563][model8_pretrain.py][INFO] Epoch:[0/2](753400/4588595) loss:2.995 lr:0.0000100 epoch_Time:24406.0min: [2024-01-06 01:34:45,513][model8_pretrain.py][INFO] Epoch:[0/2](753500/4588595) loss:2.938 lr:0.0000100 epoch_Time:24405.0min: [2024-01-06 01:34:45,513][model8_pretrain.py][INFO] Epoch:[0/2](753500/4588595) loss:3.082 lr:0.0000100 epoch_Time:24405.0min: [2024-01-06 01:34:45,513][model8_pretrain.py][INFO] Epoch:[0/2](753500/4588595) loss:2.669 lr:0.0000100 epoch_Time:24405.0min: [2024-01-06 01:34:45,514][model8_pretrain.py][INFO] Epoch:[0/2](753500/4588595) loss:2.859 lr:0.0000100 epoch_Time:24405.0min: [2024-01-06 01:34:45,514][model8_pretrain.py][INFO] Epoch:[0/2](753500/4588595) loss:2.757 lr:0.0000100 epoch_Time:24405.0min: [2024-01-06 01:34:45,514][model8_pretrain.py][INFO] Epoch:[0/2](753500/4588595) loss:2.762 lr:0.0000100 epoch_Time:24405.0min: [2024-01-06 01:34:45,514][model8_pretrain.py][INFO] Epoch:[0/2](753500/4588595) loss:2.731 lr:0.0000100 epoch_Time:24405.0min: [2024-01-06 01:34:45,514][model8_pretrain.py][INFO] Epoch:[0/2](753500/4588595) loss:2.220 lr:0.0000100 epoch_Time:24405.0min: [2024-01-06 01:35:22,435][model8_pretrain.py][INFO] Epoch:[0/2](753600/4588595) loss:2.933 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:35:22,435][model8_pretrain.py][INFO] Epoch:[0/2](753600/4588595) loss:3.261 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:35:22,435][model8_pretrain.py][INFO] Epoch:[0/2](753600/4588595) loss:3.290 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:35:22,435][model8_pretrain.py][INFO] Epoch:[0/2](753600/4588595) loss:2.714 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:35:22,435][model8_pretrain.py][INFO] Epoch:[0/2](753600/4588595) loss:2.560 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:35:22,435][model8_pretrain.py][INFO] Epoch:[0/2](753600/4588595) loss:2.672 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:35:22,435][model8_pretrain.py][INFO] Epoch:[0/2](753600/4588595) loss:3.131 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:35:22,436][model8_pretrain.py][INFO] Epoch:[0/2](753600/4588595) loss:2.868 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:06,028][model8_pretrain.py][INFO] Epoch:[0/2](753700/4588595) loss:2.804 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:06,028][model8_pretrain.py][INFO] Epoch:[0/2](753700/4588595) loss:3.003 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:06,028][model8_pretrain.py][INFO] Epoch:[0/2](753700/4588595) loss:2.656 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:06,028][model8_pretrain.py][INFO] Epoch:[0/2](753700/4588595) loss:3.063 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:06,028][model8_pretrain.py][INFO] Epoch:[0/2](753700/4588595) loss:2.613 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:06,028][model8_pretrain.py][INFO] Epoch:[0/2](753700/4588595) loss:2.984 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:06,028][model8_pretrain.py][INFO] Epoch:[0/2](753700/4588595) loss:2.926 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:06,028][model8_pretrain.py][INFO] Epoch:[0/2](753700/4588595) loss:3.064 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:46,576][model8_pretrain.py][INFO] Epoch:[0/2](753800/4588595) loss:2.796 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:46,576][model8_pretrain.py][INFO] Epoch:[0/2](753800/4588595) loss:2.717 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:46,576][model8_pretrain.py][INFO] Epoch:[0/2](753800/4588595) loss:2.957 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:46,576][model8_pretrain.py][INFO] Epoch:[0/2](753800/4588595) loss:2.635 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:46,576][model8_pretrain.py][INFO] Epoch:[0/2](753800/4588595) loss:2.548 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:46,576][model8_pretrain.py][INFO] Epoch:[0/2](753800/4588595) loss:3.403 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:46,576][model8_pretrain.py][INFO] Epoch:[0/2](753800/4588595) loss:2.357 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:36:46,576][model8_pretrain.py][INFO] Epoch:[0/2](753800/4588595) loss:3.124 lr:0.0000100 epoch_Time:24404.0min: [2024-01-06 01:37:23,512][model8_pretrain.py][INFO] Epoch:[0/2](753900/4588595) loss:2.741 lr:0.0000100 epoch_Time:24403.0min: [2024-01-06 01:37:23,512][model8_pretrain.py][INFO] Epoch:[0/2](753900/4588595) loss:2.898 lr:0.0000100 epoch_Time:24403.0min: [2024-01-06 01:37:23,512][model8_pretrain.py][INFO] Epoch:[0/2](753900/4588595) loss:2.963 lr:0.0000100 epoch_Time:24403.0min: [2024-01-06 01:37:23,512][model8_pretrain.py][INFO] Epoch:[0/2](753900/4588595) loss:3.475 lr:0.0000100 epoch_Time:24403.0min: [2024-01-06 01:37:23,512][model8_pretrain.py][INFO] Epoch:[0/2](753900/4588595) loss:2.377 lr:0.0000100 epoch_Time:24403.0min: [2024-01-06 01:37:23,512][model8_pretrain.py][INFO] Epoch:[0/2](753900/4588595) loss:2.495 lr:0.0000100 epoch_Time:24403.0min: [2024-01-06 01:37:23,512][model8_pretrain.py][INFO] Epoch:[0/2](753900/4588595) loss:3.098 lr:0.0000100 epoch_Time:24403.0min: [2024-01-06 01:37:23,512][model8_pretrain.py][INFO] Epoch:[0/2](753900/4588595) loss:3.509 lr:0.0000100 epoch_Time:24403.0min: [2024-01-06 01:38:00,452][model8_pretrain.py][INFO] Epoch:[0/2](754000/4588595) loss:2.749 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:00,452][model8_pretrain.py][INFO] Epoch:[0/2](754000/4588595) loss:3.341 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:00,452][model8_pretrain.py][INFO] Epoch:[0/2](754000/4588595) loss:2.706 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:00,452][model8_pretrain.py][INFO] Epoch:[0/2](754000/4588595) loss:3.191 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:00,452][model8_pretrain.py][INFO] Epoch:[0/2](754000/4588595) loss:2.340 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:00,452][model8_pretrain.py][INFO] Epoch:[0/2](754000/4588595) loss:2.993 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:00,452][model8_pretrain.py][INFO] Epoch:[0/2](754000/4588595) loss:2.392 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:00,452][model8_pretrain.py][INFO] Epoch:[0/2](754000/4588595) loss:2.987 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:37,391][model8_pretrain.py][INFO] Epoch:[0/2](754100/4588595) loss:2.684 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:37,391][model8_pretrain.py][INFO] Epoch:[0/2](754100/4588595) loss:3.179 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:37,391][model8_pretrain.py][INFO] Epoch:[0/2](754100/4588595) loss:2.859 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:37,391][model8_pretrain.py][INFO] Epoch:[0/2](754100/4588595) loss:3.190 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:37,391][model8_pretrain.py][INFO] Epoch:[0/2](754100/4588595) loss:2.325 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:37,391][model8_pretrain.py][INFO] Epoch:[0/2](754100/4588595) loss:2.691 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:37,391][model8_pretrain.py][INFO] Epoch:[0/2](754100/4588595) loss:2.752 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:38:37,391][model8_pretrain.py][INFO] Epoch:[0/2](754100/4588595) loss:3.195 lr:0.0000100 epoch_Time:24402.0min: [2024-01-06 01:39:14,336][model8_pretrain.py][INFO] Epoch:[0/2](754200/4588595) loss:2.872 lr:0.0000100 epoch_Time:24401.0min: [2024-01-06 01:39:14,336][model8_pretrain.py][INFO] Epoch:[0/2](754200/4588595) loss:2.685 lr:0.0000100 epoch_Time:24401.0min: [2024-01-06 01:39:14,336][model8_pretrain.py][INFO] Epoch:[0/2](754200/4588595) loss:3.162 lr:0.0000100 epoch_Time:24401.0min: [2024-01-06 01:39:14,336][model8_pretrain.py][INFO] Epoch:[0/2](754200/4588595) loss:3.120 lr:0.0000100 epoch_Time:24401.0min: [2024-01-06 01:39:14,337][model8_pretrain.py][INFO] Epoch:[0/2](754200/4588595) loss:2.674 lr:0.0000100 epoch_Time:24401.0min: [2024-01-06 01:39:14,337][model8_pretrain.py][INFO] Epoch:[0/2](754200/4588595) loss:2.789 lr:0.0000100 epoch_Time:24401.0min: [2024-01-06 01:39:14,337][model8_pretrain.py][INFO] Epoch:[0/2](754200/4588595) loss:3.083 lr:0.0000100 epoch_Time:24401.0min: [2024-01-06 01:39:14,337][model8_pretrain.py][INFO] Epoch:[0/2](754200/4588595) loss:2.345 lr:0.0000100 epoch_Time:24401.0min: [2024-01-06 01:39:51,284][model8_pretrain.py][INFO] Epoch:[0/2](754300/4588595) loss:3.275 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:39:51,284][model8_pretrain.py][INFO] Epoch:[0/2](754300/4588595) loss:2.777 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:39:51,284][model8_pretrain.py][INFO] Epoch:[0/2](754300/4588595) loss:2.213 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:39:51,284][model8_pretrain.py][INFO] Epoch:[0/2](754300/4588595) loss:3.515 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:39:51,284][model8_pretrain.py][INFO] Epoch:[0/2](754300/4588595) loss:3.168 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:39:51,284][model8_pretrain.py][INFO] Epoch:[0/2](754300/4588595) loss:2.355 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:39:51,284][model8_pretrain.py][INFO] Epoch:[0/2](754300/4588595) loss:2.537 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:39:51,284][model8_pretrain.py][INFO] Epoch:[0/2](754300/4588595) loss:2.297 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:40:28,228][model8_pretrain.py][INFO] Epoch:[0/2](754400/4588595) loss:2.837 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:40:28,228][model8_pretrain.py][INFO] Epoch:[0/2](754400/4588595) loss:3.114 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:40:28,228][model8_pretrain.py][INFO] Epoch:[0/2](754400/4588595) loss:3.180 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:40:28,228][model8_pretrain.py][INFO] Epoch:[0/2](754400/4588595) loss:2.834 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:40:28,228][model8_pretrain.py][INFO] Epoch:[0/2](754400/4588595) loss:2.896 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:40:28,228][model8_pretrain.py][INFO] Epoch:[0/2](754400/4588595) loss:3.053 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:40:28,228][model8_pretrain.py][INFO] Epoch:[0/2](754400/4588595) loss:3.264 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:40:28,228][model8_pretrain.py][INFO] Epoch:[0/2](754400/4588595) loss:2.907 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:41:11,925][model8_pretrain.py][INFO] Epoch:[0/2](754500/4588595) loss:3.123 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:41:11,925][model8_pretrain.py][INFO] Epoch:[0/2](754500/4588595) loss:2.654 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:41:11,925][model8_pretrain.py][INFO] Epoch:[0/2](754500/4588595) loss:3.102 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:41:11,925][model8_pretrain.py][INFO] Epoch:[0/2](754500/4588595) loss:2.369 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:41:11,925][model8_pretrain.py][INFO] Epoch:[0/2](754500/4588595) loss:2.817 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:41:11,925][model8_pretrain.py][INFO] Epoch:[0/2](754500/4588595) loss:3.513 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:41:11,929][model8_pretrain.py][INFO] Epoch:[0/2](754500/4588595) loss:2.289 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:41:11,930][model8_pretrain.py][INFO] Epoch:[0/2](754500/4588595) loss:2.435 lr:0.0000100 epoch_Time:24399.0min: [2024-01-06 01:41:52,381][model8_pretrain.py][INFO] Epoch:[0/2](754600/4588595) loss:2.741 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:41:52,381][model8_pretrain.py][INFO] Epoch:[0/2](754600/4588595) loss:3.352 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:41:52,381][model8_pretrain.py][INFO] Epoch:[0/2](754600/4588595) loss:3.131 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:41:52,381][model8_pretrain.py][INFO] Epoch:[0/2](754600/4588595) loss:3.229 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:41:52,381][model8_pretrain.py][INFO] Epoch:[0/2](754600/4588595) loss:2.774 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:41:52,381][model8_pretrain.py][INFO] Epoch:[0/2](754600/4588595) loss:2.632 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:41:52,381][model8_pretrain.py][INFO] Epoch:[0/2](754600/4588595) loss:2.517 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:41:52,381][model8_pretrain.py][INFO] Epoch:[0/2](754600/4588595) loss:2.634 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:42:29,319][model8_pretrain.py][INFO] Epoch:[0/2](754700/4588595) loss:3.039 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:42:29,319][model8_pretrain.py][INFO] Epoch:[0/2](754700/4588595) loss:2.673 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:42:29,319][model8_pretrain.py][INFO] Epoch:[0/2](754700/4588595) loss:2.567 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:42:29,319][model8_pretrain.py][INFO] Epoch:[0/2](754700/4588595) loss:2.757 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:42:29,319][model8_pretrain.py][INFO] Epoch:[0/2](754700/4588595) loss:3.110 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:42:29,319][model8_pretrain.py][INFO] Epoch:[0/2](754700/4588595) loss:2.438 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:42:29,319][model8_pretrain.py][INFO] Epoch:[0/2](754700/4588595) loss:2.934 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:42:29,319][model8_pretrain.py][INFO] Epoch:[0/2](754700/4588595) loss:2.547 lr:0.0000100 epoch_Time:24398.0min: [2024-01-06 01:43:06,252][model8_pretrain.py][INFO] Epoch:[0/2](754800/4588595) loss:2.733 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:06,252][model8_pretrain.py][INFO] Epoch:[0/2](754800/4588595) loss:2.799 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:06,252][model8_pretrain.py][INFO] Epoch:[0/2](754800/4588595) loss:2.400 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:06,252][model8_pretrain.py][INFO] Epoch:[0/2](754800/4588595) loss:2.385 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:06,252][model8_pretrain.py][INFO] Epoch:[0/2](754800/4588595) loss:2.696 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:06,252][model8_pretrain.py][INFO] Epoch:[0/2](754800/4588595) loss:2.865 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:06,252][model8_pretrain.py][INFO] Epoch:[0/2](754800/4588595) loss:3.000 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:06,252][model8_pretrain.py][INFO] Epoch:[0/2](754800/4588595) loss:2.128 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:43,185][model8_pretrain.py][INFO] Epoch:[0/2](754900/4588595) loss:2.997 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:43,185][model8_pretrain.py][INFO] Epoch:[0/2](754900/4588595) loss:3.350 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:43,185][model8_pretrain.py][INFO] Epoch:[0/2](754900/4588595) loss:2.332 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:43,185][model8_pretrain.py][INFO] Epoch:[0/2](754900/4588595) loss:3.396 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:43,185][model8_pretrain.py][INFO] Epoch:[0/2](754900/4588595) loss:3.160 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:43,185][model8_pretrain.py][INFO] Epoch:[0/2](754900/4588595) loss:2.995 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:43,185][model8_pretrain.py][INFO] Epoch:[0/2](754900/4588595) loss:2.341 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:43:43,185][model8_pretrain.py][INFO] Epoch:[0/2](754900/4588595) loss:2.263 lr:0.0000100 epoch_Time:24397.0min: [2024-01-06 01:44:20,124][model8_pretrain.py][INFO] Epoch:[0/2](755000/4588595) loss:2.519 lr:0.0000100 epoch_Time:24396.0min: [2024-01-06 01:44:20,124][model8_pretrain.py][INFO] Epoch:[0/2](755000/4588595) loss:2.934 lr:0.0000100 epoch_Time:24396.0min: [2024-01-06 01:44:20,124][model8_pretrain.py][INFO] Epoch:[0/2](755000/4588595) loss:3.108 lr:0.0000100 epoch_Time:24396.0min: [2024-01-06 01:44:20,124][model8_pretrain.py][INFO] Epoch:[0/2](755000/4588595) loss:3.210 lr:0.0000100 epoch_Time:24396.0min: [2024-01-06 01:44:20,124][model8_pretrain.py][INFO] Epoch:[0/2](755000/4588595) loss:2.233 lr:0.0000100 epoch_Time:24396.0min: [2024-01-06 01:44:20,124][model8_pretrain.py][INFO] Epoch:[0/2](755000/4588595) loss:3.123 lr:0.0000100 epoch_Time:24396.0min: [2024-01-06 01:44:20,124][model8_pretrain.py][INFO] Epoch:[0/2](755000/4588595) loss:2.817 lr:0.0000100 epoch_Time:24396.0min: [2024-01-06 01:44:20,124][model8_pretrain.py][INFO] Epoch:[0/2](755000/4588595) loss:2.644 lr:0.0000100 epoch_Time:24396.0min: [2024-01-06 01:44:57,064][model8_pretrain.py][INFO] Epoch:[0/2](755100/4588595) loss:3.042 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:44:57,064][model8_pretrain.py][INFO] Epoch:[0/2](755100/4588595) loss:2.935 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:44:57,064][model8_pretrain.py][INFO] Epoch:[0/2](755100/4588595) loss:2.919 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:44:57,064][model8_pretrain.py][INFO] Epoch:[0/2](755100/4588595) loss:2.375 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:44:57,064][model8_pretrain.py][INFO] Epoch:[0/2](755100/4588595) loss:2.249 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:44:57,064][model8_pretrain.py][INFO] Epoch:[0/2](755100/4588595) loss:3.270 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:44:57,064][model8_pretrain.py][INFO] Epoch:[0/2](755100/4588595) loss:2.466 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:44:57,064][model8_pretrain.py][INFO] Epoch:[0/2](755100/4588595) loss:2.676 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:45:34,008][model8_pretrain.py][INFO] Epoch:[0/2](755200/4588595) loss:2.646 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:45:34,008][model8_pretrain.py][INFO] Epoch:[0/2](755200/4588595) loss:2.402 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:45:34,008][model8_pretrain.py][INFO] Epoch:[0/2](755200/4588595) loss:3.404 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:45:34,008][model8_pretrain.py][INFO] Epoch:[0/2](755200/4588595) loss:2.981 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:45:34,008][model8_pretrain.py][INFO] Epoch:[0/2](755200/4588595) loss:2.847 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:45:34,008][model8_pretrain.py][INFO] Epoch:[0/2](755200/4588595) loss:3.255 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:45:34,008][model8_pretrain.py][INFO] Epoch:[0/2](755200/4588595) loss:2.408 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:45:34,008][model8_pretrain.py][INFO] Epoch:[0/2](755200/4588595) loss:2.641 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:46:16,055][model8_pretrain.py][INFO] Epoch:[0/2](755300/4588595) loss:3.330 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:46:16,055][model8_pretrain.py][INFO] Epoch:[0/2](755300/4588595) loss:3.135 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:46:16,055][model8_pretrain.py][INFO] Epoch:[0/2](755300/4588595) loss:2.526 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:46:16,055][model8_pretrain.py][INFO] Epoch:[0/2](755300/4588595) loss:2.873 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:46:16,055][model8_pretrain.py][INFO] Epoch:[0/2](755300/4588595) loss:2.598 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:46:16,055][model8_pretrain.py][INFO] Epoch:[0/2](755300/4588595) loss:3.143 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:46:16,055][model8_pretrain.py][INFO] Epoch:[0/2](755300/4588595) loss:3.291 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:46:16,055][model8_pretrain.py][INFO] Epoch:[0/2](755300/4588595) loss:2.392 lr:0.0000100 epoch_Time:24394.0min: [2024-01-06 01:46:58,256][model8_pretrain.py][INFO] Epoch:[0/2](755400/4588595) loss:3.002 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:46:58,256][model8_pretrain.py][INFO] Epoch:[0/2](755400/4588595) loss:3.117 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:46:58,256][model8_pretrain.py][INFO] Epoch:[0/2](755400/4588595) loss:2.710 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:46:58,256][model8_pretrain.py][INFO] Epoch:[0/2](755400/4588595) loss:2.678 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:46:58,256][model8_pretrain.py][INFO] Epoch:[0/2](755400/4588595) loss:2.677 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:46:58,256][model8_pretrain.py][INFO] Epoch:[0/2](755400/4588595) loss:3.019 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:46:58,256][model8_pretrain.py][INFO] Epoch:[0/2](755400/4588595) loss:2.861 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:46:58,256][model8_pretrain.py][INFO] Epoch:[0/2](755400/4588595) loss:3.274 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:47:35,206][model8_pretrain.py][INFO] Epoch:[0/2](755500/4588595) loss:2.467 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:47:35,206][model8_pretrain.py][INFO] Epoch:[0/2](755500/4588595) loss:2.754 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:47:35,206][model8_pretrain.py][INFO] Epoch:[0/2](755500/4588595) loss:3.127 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:47:35,206][model8_pretrain.py][INFO] Epoch:[0/2](755500/4588595) loss:2.872 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:47:35,206][model8_pretrain.py][INFO] Epoch:[0/2](755500/4588595) loss:3.085 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:47:35,206][model8_pretrain.py][INFO] Epoch:[0/2](755500/4588595) loss:2.885 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:47:35,206][model8_pretrain.py][INFO] Epoch:[0/2](755500/4588595) loss:2.803 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:47:35,206][model8_pretrain.py][INFO] Epoch:[0/2](755500/4588595) loss:2.938 lr:0.0000100 epoch_Time:24393.0min: [2024-01-06 01:48:12,160][model8_pretrain.py][INFO] Epoch:[0/2](755600/4588595) loss:2.462 lr:0.0000100 epoch_Time:24392.0min: [2024-01-06 01:48:12,160][model8_pretrain.py][INFO] Epoch:[0/2](755600/4588595) loss:2.518 lr:0.0000100 epoch_Time:24392.0min: [2024-01-06 01:48:12,160][model8_pretrain.py][INFO] Epoch:[0/2](755600/4588595) loss:3.031 lr:0.0000100 epoch_Time:24392.0min: [2024-01-06 01:48:12,160][model8_pretrain.py][INFO] Epoch:[0/2](755600/4588595) loss:2.921 lr:0.0000100 epoch_Time:24392.0min: [2024-01-06 01:48:12,160][model8_pretrain.py][INFO] Epoch:[0/2](755600/4588595) loss:2.867 lr:0.0000100 epoch_Time:24392.0min: [2024-01-06 01:48:12,160][model8_pretrain.py][INFO] Epoch:[0/2](755600/4588595) loss:2.965 lr:0.0000100 epoch_Time:24392.0min: [2024-01-06 01:48:12,160][model8_pretrain.py][INFO] Epoch:[0/2](755600/4588595) loss:2.570 lr:0.0000100 epoch_Time:24392.0min: [2024-01-06 01:48:12,160][model8_pretrain.py][INFO] Epoch:[0/2](755600/4588595) loss:2.790 lr:0.0000100 epoch_Time:24392.0min: [2024-01-06 01:48:49,114][model8_pretrain.py][INFO] Epoch:[0/2](755700/4588595) loss:2.821 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:48:49,115][model8_pretrain.py][INFO] Epoch:[0/2](755700/4588595) loss:2.681 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:48:49,115][model8_pretrain.py][INFO] Epoch:[0/2](755700/4588595) loss:3.338 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:48:49,115][model8_pretrain.py][INFO] Epoch:[0/2](755700/4588595) loss:3.071 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:48:49,115][model8_pretrain.py][INFO] Epoch:[0/2](755700/4588595) loss:2.651 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:48:49,115][model8_pretrain.py][INFO] Epoch:[0/2](755700/4588595) loss:2.529 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:48:49,115][model8_pretrain.py][INFO] Epoch:[0/2](755700/4588595) loss:2.860 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:48:49,115][model8_pretrain.py][INFO] Epoch:[0/2](755700/4588595) loss:2.918 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:49:26,081][model8_pretrain.py][INFO] Epoch:[0/2](755800/4588595) loss:2.619 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:49:26,081][model8_pretrain.py][INFO] Epoch:[0/2](755800/4588595) loss:2.683 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:49:26,081][model8_pretrain.py][INFO] Epoch:[0/2](755800/4588595) loss:3.247 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:49:26,081][model8_pretrain.py][INFO] Epoch:[0/2](755800/4588595) loss:2.493 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:49:26,081][model8_pretrain.py][INFO] Epoch:[0/2](755800/4588595) loss:2.129 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:49:26,081][model8_pretrain.py][INFO] Epoch:[0/2](755800/4588595) loss:2.905 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:49:26,081][model8_pretrain.py][INFO] Epoch:[0/2](755800/4588595) loss:2.608 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:49:26,083][model8_pretrain.py][INFO] Epoch:[0/2](755800/4588595) loss:2.614 lr:0.0000100 epoch_Time:24391.0min: [2024-01-06 01:50:03,006][model8_pretrain.py][INFO] Epoch:[0/2](755900/4588595) loss:2.529 lr:0.0000100 epoch_Time:24390.0min: [2024-01-06 01:50:03,006][model8_pretrain.py][INFO] Epoch:[0/2](755900/4588595) loss:2.578 lr:0.0000100 epoch_Time:24390.0min: [2024-01-06 01:50:03,007][model8_pretrain.py][INFO] Epoch:[0/2](755900/4588595) loss:3.071 lr:0.0000100 epoch_Time:24390.0min: [2024-01-06 01:50:03,007][model8_pretrain.py][INFO] Epoch:[0/2](755900/4588595) loss:3.128 lr:0.0000100 epoch_Time:24390.0min: [2024-01-06 01:50:03,007][model8_pretrain.py][INFO] Epoch:[0/2](755900/4588595) loss:2.410 lr:0.0000100 epoch_Time:24390.0min: [2024-01-06 01:50:03,007][model8_pretrain.py][INFO] Epoch:[0/2](755900/4588595) loss:2.850 lr:0.0000100 epoch_Time:24390.0min: [2024-01-06 01:50:03,007][model8_pretrain.py][INFO] Epoch:[0/2](755900/4588595) loss:2.223 lr:0.0000100 epoch_Time:24390.0min: [2024-01-06 01:50:03,007][model8_pretrain.py][INFO] Epoch:[0/2](755900/4588595) loss:3.192 lr:0.0000100 epoch_Time:24390.0min: [2024-01-06 01:50:39,966][model8_pretrain.py][INFO] Epoch:[0/2](756000/4588595) loss:3.126 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:50:39,966][model8_pretrain.py][INFO] Epoch:[0/2](756000/4588595) loss:3.040 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:50:39,966][model8_pretrain.py][INFO] Epoch:[0/2](756000/4588595) loss:3.134 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:50:39,966][model8_pretrain.py][INFO] Epoch:[0/2](756000/4588595) loss:2.804 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:50:39,966][model8_pretrain.py][INFO] Epoch:[0/2](756000/4588595) loss:2.907 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:50:39,966][model8_pretrain.py][INFO] Epoch:[0/2](756000/4588595) loss:1.934 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:50:39,966][model8_pretrain.py][INFO] Epoch:[0/2](756000/4588595) loss:2.351 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:50:39,967][model8_pretrain.py][INFO] Epoch:[0/2](756000/4588595) loss:3.298 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:51:20,464][model8_pretrain.py][INFO] Epoch:[0/2](756100/4588595) loss:2.870 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:51:20,464][model8_pretrain.py][INFO] Epoch:[0/2](756100/4588595) loss:2.658 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:51:20,464][model8_pretrain.py][INFO] Epoch:[0/2](756100/4588595) loss:3.115 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:51:20,464][model8_pretrain.py][INFO] Epoch:[0/2](756100/4588595) loss:2.583 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:51:20,464][model8_pretrain.py][INFO] Epoch:[0/2](756100/4588595) loss:2.983 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:51:20,464][model8_pretrain.py][INFO] Epoch:[0/2](756100/4588595) loss:2.936 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:51:20,464][model8_pretrain.py][INFO] Epoch:[0/2](756100/4588595) loss:3.045 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:51:20,464][model8_pretrain.py][INFO] Epoch:[0/2](756100/4588595) loss:2.362 lr:0.0000100 epoch_Time:24389.0min: [2024-01-06 01:52:04,314][model8_pretrain.py][INFO] Epoch:[0/2](756200/4588595) loss:3.326 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:04,314][model8_pretrain.py][INFO] Epoch:[0/2](756200/4588595) loss:2.640 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:04,314][model8_pretrain.py][INFO] Epoch:[0/2](756200/4588595) loss:2.299 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:04,314][model8_pretrain.py][INFO] Epoch:[0/2](756200/4588595) loss:3.032 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:04,314][model8_pretrain.py][INFO] Epoch:[0/2](756200/4588595) loss:2.578 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:04,314][model8_pretrain.py][INFO] Epoch:[0/2](756200/4588595) loss:2.930 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:04,314][model8_pretrain.py][INFO] Epoch:[0/2](756200/4588595) loss:3.115 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:04,314][model8_pretrain.py][INFO] Epoch:[0/2](756200/4588595) loss:2.181 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:41,267][model8_pretrain.py][INFO] Epoch:[0/2](756300/4588595) loss:2.094 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:41,267][model8_pretrain.py][INFO] Epoch:[0/2](756300/4588595) loss:2.214 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:41,267][model8_pretrain.py][INFO] Epoch:[0/2](756300/4588595) loss:2.839 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:41,267][model8_pretrain.py][INFO] Epoch:[0/2](756300/4588595) loss:2.737 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:41,268][model8_pretrain.py][INFO] Epoch:[0/2](756300/4588595) loss:2.405 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:41,267][model8_pretrain.py][INFO] Epoch:[0/2](756300/4588595) loss:2.742 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:41,267][model8_pretrain.py][INFO] Epoch:[0/2](756300/4588595) loss:2.903 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:52:41,267][model8_pretrain.py][INFO] Epoch:[0/2](756300/4588595) loss:2.730 lr:0.0000100 epoch_Time:24388.0min: [2024-01-06 01:53:18,224][model8_pretrain.py][INFO] Epoch:[0/2](756400/4588595) loss:2.101 lr:0.0000100 epoch_Time:24387.0min: [2024-01-06 01:53:18,224][model8_pretrain.py][INFO] Epoch:[0/2](756400/4588595) loss:2.783 lr:0.0000100 epoch_Time:24387.0min: [2024-01-06 01:53:18,224][model8_pretrain.py][INFO] Epoch:[0/2](756400/4588595) loss:2.691 lr:0.0000100 epoch_Time:24387.0min: [2024-01-06 01:53:18,224][model8_pretrain.py][INFO] Epoch:[0/2](756400/4588595) loss:2.860 lr:0.0000100 epoch_Time:24387.0min: [2024-01-06 01:53:18,224][model8_pretrain.py][INFO] Epoch:[0/2](756400/4588595) loss:2.511 lr:0.0000100 epoch_Time:24387.0min: [2024-01-06 01:53:18,224][model8_pretrain.py][INFO] Epoch:[0/2](756400/4588595) loss:2.344 lr:0.0000100 epoch_Time:24387.0min: [2024-01-06 01:53:18,224][model8_pretrain.py][INFO] Epoch:[0/2](756400/4588595) loss:2.476 lr:0.0000100 epoch_Time:24387.0min: [2024-01-06 01:53:18,224][model8_pretrain.py][INFO] Epoch:[0/2](756400/4588595) loss:2.927 lr:0.0000100 epoch_Time:24387.0min: [2024-01-06 01:53:55,170][model8_pretrain.py][INFO] Epoch:[0/2](756500/4588595) loss:2.741 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:53:55,170][model8_pretrain.py][INFO] Epoch:[0/2](756500/4588595) loss:2.938 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:53:55,170][model8_pretrain.py][INFO] Epoch:[0/2](756500/4588595) loss:2.245 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:53:55,170][model8_pretrain.py][INFO] Epoch:[0/2](756500/4588595) loss:2.653 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:53:55,170][model8_pretrain.py][INFO] Epoch:[0/2](756500/4588595) loss:3.118 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:53:55,170][model8_pretrain.py][INFO] Epoch:[0/2](756500/4588595) loss:2.753 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:53:55,170][model8_pretrain.py][INFO] Epoch:[0/2](756500/4588595) loss:2.429 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:53:55,170][model8_pretrain.py][INFO] Epoch:[0/2](756500/4588595) loss:3.269 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:54:32,117][model8_pretrain.py][INFO] Epoch:[0/2](756600/4588595) loss:2.545 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:54:32,117][model8_pretrain.py][INFO] Epoch:[0/2](756600/4588595) loss:2.687 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:54:32,117][model8_pretrain.py][INFO] Epoch:[0/2](756600/4588595) loss:2.879 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:54:32,117][model8_pretrain.py][INFO] Epoch:[0/2](756600/4588595) loss:2.349 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:54:32,117][model8_pretrain.py][INFO] Epoch:[0/2](756600/4588595) loss:2.570 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:54:32,117][model8_pretrain.py][INFO] Epoch:[0/2](756600/4588595) loss:2.528 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:54:32,117][model8_pretrain.py][INFO] Epoch:[0/2](756600/4588595) loss:3.120 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:54:32,118][model8_pretrain.py][INFO] Epoch:[0/2](756600/4588595) loss:3.307 lr:0.0000100 epoch_Time:24386.0min: [2024-01-06 01:55:09,085][model8_pretrain.py][INFO] Epoch:[0/2](756700/4588595) loss:3.073 lr:0.0000100 epoch_Time:24385.0min: [2024-01-06 01:55:09,085][model8_pretrain.py][INFO] Epoch:[0/2](756700/4588595) loss:2.799 lr:0.0000100 epoch_Time:24385.0min: [2024-01-06 01:55:09,085][model8_pretrain.py][INFO] Epoch:[0/2](756700/4588595) loss:3.048 lr:0.0000100 epoch_Time:24385.0min: [2024-01-06 01:55:09,085][model8_pretrain.py][INFO] Epoch:[0/2](756700/4588595) loss:2.936 lr:0.0000100 epoch_Time:24385.0min: [2024-01-06 01:55:09,085][model8_pretrain.py][INFO] Epoch:[0/2](756700/4588595) loss:2.754 lr:0.0000100 epoch_Time:24385.0min: [2024-01-06 01:55:09,085][model8_pretrain.py][INFO] Epoch:[0/2](756700/4588595) loss:2.272 lr:0.0000100 epoch_Time:24385.0min: [2024-01-06 01:55:09,085][model8_pretrain.py][INFO] Epoch:[0/2](756700/4588595) loss:2.668 lr:0.0000100 epoch_Time:24385.0min: [2024-01-06 01:55:09,086][model8_pretrain.py][INFO] Epoch:[0/2](756700/4588595) loss:2.810 lr:0.0000100 epoch_Time:24385.0min: [2024-01-06 01:55:46,037][model8_pretrain.py][INFO] Epoch:[0/2](756800/4588595) loss:2.486 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:55:46,037][model8_pretrain.py][INFO] Epoch:[0/2](756800/4588595) loss:2.123 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:55:46,037][model8_pretrain.py][INFO] Epoch:[0/2](756800/4588595) loss:2.593 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:55:46,037][model8_pretrain.py][INFO] Epoch:[0/2](756800/4588595) loss:2.987 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:55:46,037][model8_pretrain.py][INFO] Epoch:[0/2](756800/4588595) loss:3.012 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:55:46,037][model8_pretrain.py][INFO] Epoch:[0/2](756800/4588595) loss:2.780 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:55:46,037][model8_pretrain.py][INFO] Epoch:[0/2](756800/4588595) loss:3.099 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:55:46,037][model8_pretrain.py][INFO] Epoch:[0/2](756800/4588595) loss:3.269 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:56:26,466][model8_pretrain.py][INFO] Epoch:[0/2](756900/4588595) loss:3.301 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:56:26,466][model8_pretrain.py][INFO] Epoch:[0/2](756900/4588595) loss:2.633 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:56:26,466][model8_pretrain.py][INFO] Epoch:[0/2](756900/4588595) loss:2.631 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:56:26,470][model8_pretrain.py][INFO] Epoch:[0/2](756900/4588595) loss:3.165 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:56:26,471][model8_pretrain.py][INFO] Epoch:[0/2](756900/4588595) loss:2.486 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:56:26,471][model8_pretrain.py][INFO] Epoch:[0/2](756900/4588595) loss:2.312 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:56:26,471][model8_pretrain.py][INFO] Epoch:[0/2](756900/4588595) loss:2.878 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:56:26,471][model8_pretrain.py][INFO] Epoch:[0/2](756900/4588595) loss:2.871 lr:0.0000100 epoch_Time:24384.0min: [2024-01-06 01:57:10,325][model8_pretrain.py][INFO] Epoch:[0/2](757000/4588595) loss:2.722 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:10,325][model8_pretrain.py][INFO] Epoch:[0/2](757000/4588595) loss:1.828 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:10,325][model8_pretrain.py][INFO] Epoch:[0/2](757000/4588595) loss:3.332 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:10,325][model8_pretrain.py][INFO] Epoch:[0/2](757000/4588595) loss:2.849 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:10,325][model8_pretrain.py][INFO] Epoch:[0/2](757000/4588595) loss:2.869 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:10,325][model8_pretrain.py][INFO] Epoch:[0/2](757000/4588595) loss:2.771 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:10,325][model8_pretrain.py][INFO] Epoch:[0/2](757000/4588595) loss:2.997 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:10,325][model8_pretrain.py][INFO] Epoch:[0/2](757000/4588595) loss:2.620 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:47,256][model8_pretrain.py][INFO] Epoch:[0/2](757100/4588595) loss:3.281 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:47,256][model8_pretrain.py][INFO] Epoch:[0/2](757100/4588595) loss:2.812 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:47,256][model8_pretrain.py][INFO] Epoch:[0/2](757100/4588595) loss:3.011 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:47,257][model8_pretrain.py][INFO] Epoch:[0/2](757100/4588595) loss:3.483 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:47,256][model8_pretrain.py][INFO] Epoch:[0/2](757100/4588595) loss:2.695 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:47,256][model8_pretrain.py][INFO] Epoch:[0/2](757100/4588595) loss:3.292 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:47,257][model8_pretrain.py][INFO] Epoch:[0/2](757100/4588595) loss:2.980 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:57:47,257][model8_pretrain.py][INFO] Epoch:[0/2](757100/4588595) loss:2.836 lr:0.0000100 epoch_Time:24383.0min: [2024-01-06 01:58:24,194][model8_pretrain.py][INFO] Epoch:[0/2](757200/4588595) loss:3.014 lr:0.0000100 epoch_Time:24382.0min: [2024-01-06 01:58:24,194][model8_pretrain.py][INFO] Epoch:[0/2](757200/4588595) loss:2.858 lr:0.0000100 epoch_Time:24382.0min: [2024-01-06 01:58:24,194][model8_pretrain.py][INFO] Epoch:[0/2](757200/4588595) loss:2.722 lr:0.0000100 epoch_Time:24382.0min: [2024-01-06 01:58:24,194][model8_pretrain.py][INFO] Epoch:[0/2](757200/4588595) loss:3.439 lr:0.0000100 epoch_Time:24382.0min: [2024-01-06 01:58:24,194][model8_pretrain.py][INFO] Epoch:[0/2](757200/4588595) loss:3.213 lr:0.0000100 epoch_Time:24382.0min: [2024-01-06 01:58:24,194][model8_pretrain.py][INFO] Epoch:[0/2](757200/4588595) loss:2.477 lr:0.0000100 epoch_Time:24382.0min: [2024-01-06 01:58:24,194][model8_pretrain.py][INFO] Epoch:[0/2](757200/4588595) loss:3.069 lr:0.0000100 epoch_Time:24382.0min: [2024-01-06 01:58:24,194][model8_pretrain.py][INFO] Epoch:[0/2](757200/4588595) loss:2.459 lr:0.0000100 epoch_Time:24382.0min: [2024-01-06 01:59:01,139][model8_pretrain.py][INFO] Epoch:[0/2](757300/4588595) loss:2.304 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:01,139][model8_pretrain.py][INFO] Epoch:[0/2](757300/4588595) loss:3.061 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:01,139][model8_pretrain.py][INFO] Epoch:[0/2](757300/4588595) loss:3.318 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:01,139][model8_pretrain.py][INFO] Epoch:[0/2](757300/4588595) loss:3.063 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:01,139][model8_pretrain.py][INFO] Epoch:[0/2](757300/4588595) loss:2.780 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:01,139][model8_pretrain.py][INFO] Epoch:[0/2](757300/4588595) loss:2.434 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:01,139][model8_pretrain.py][INFO] Epoch:[0/2](757300/4588595) loss:3.112 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:01,139][model8_pretrain.py][INFO] Epoch:[0/2](757300/4588595) loss:2.376 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:38,076][model8_pretrain.py][INFO] Epoch:[0/2](757400/4588595) loss:2.824 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:38,076][model8_pretrain.py][INFO] Epoch:[0/2](757400/4588595) loss:2.402 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:38,076][model8_pretrain.py][INFO] Epoch:[0/2](757400/4588595) loss:2.064 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:38,076][model8_pretrain.py][INFO] Epoch:[0/2](757400/4588595) loss:2.406 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:38,076][model8_pretrain.py][INFO] Epoch:[0/2](757400/4588595) loss:2.879 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:38,076][model8_pretrain.py][INFO] Epoch:[0/2](757400/4588595) loss:2.941 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:38,077][model8_pretrain.py][INFO] Epoch:[0/2](757400/4588595) loss:2.772 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 01:59:38,077][model8_pretrain.py][INFO] Epoch:[0/2](757400/4588595) loss:2.928 lr:0.0000100 epoch_Time:24381.0min: [2024-01-06 02:00:15,016][model8_pretrain.py][INFO] Epoch:[0/2](757500/4588595) loss:2.184 lr:0.0000100 epoch_Time:24380.0min: [2024-01-06 02:00:15,016][model8_pretrain.py][INFO] Epoch:[0/2](757500/4588595) loss:2.687 lr:0.0000100 epoch_Time:24380.0min: [2024-01-06 02:00:15,016][model8_pretrain.py][INFO] Epoch:[0/2](757500/4588595) loss:2.856 lr:0.0000100 epoch_Time:24380.0min: [2024-01-06 02:00:15,016][model8_pretrain.py][INFO] Epoch:[0/2](757500/4588595) loss:3.162 lr:0.0000100 epoch_Time:24380.0min: [2024-01-06 02:00:15,016][model8_pretrain.py][INFO] Epoch:[0/2](757500/4588595) loss:2.907 lr:0.0000100 epoch_Time:24380.0min: [2024-01-06 02:00:15,016][model8_pretrain.py][INFO] Epoch:[0/2](757500/4588595) loss:2.710 lr:0.0000100 epoch_Time:24380.0min: [2024-01-06 02:00:15,016][model8_pretrain.py][INFO] Epoch:[0/2](757500/4588595) loss:3.147 lr:0.0000100 epoch_Time:24380.0min: [2024-01-06 02:00:15,016][model8_pretrain.py][INFO] Epoch:[0/2](757500/4588595) loss:2.596 lr:0.0000100 epoch_Time:24380.0min: [2024-01-06 02:00:51,955][model8_pretrain.py][INFO] Epoch:[0/2](757600/4588595) loss:2.708 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:00:51,955][model8_pretrain.py][INFO] Epoch:[0/2](757600/4588595) loss:2.989 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:00:51,955][model8_pretrain.py][INFO] Epoch:[0/2](757600/4588595) loss:2.681 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:00:51,955][model8_pretrain.py][INFO] Epoch:[0/2](757600/4588595) loss:2.558 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:00:51,955][model8_pretrain.py][INFO] Epoch:[0/2](757600/4588595) loss:2.800 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:00:51,955][model8_pretrain.py][INFO] Epoch:[0/2](757600/4588595) loss:2.659 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:00:51,955][model8_pretrain.py][INFO] Epoch:[0/2](757600/4588595) loss:3.023 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:00:51,956][model8_pretrain.py][INFO] Epoch:[0/2](757600/4588595) loss:2.805 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:01:30,639][model8_pretrain.py][INFO] Epoch:[0/2](757700/4588595) loss:2.498 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:01:30,639][model8_pretrain.py][INFO] Epoch:[0/2](757700/4588595) loss:2.478 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:01:30,639][model8_pretrain.py][INFO] Epoch:[0/2](757700/4588595) loss:3.188 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:01:30,639][model8_pretrain.py][INFO] Epoch:[0/2](757700/4588595) loss:2.748 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:01:30,639][model8_pretrain.py][INFO] Epoch:[0/2](757700/4588595) loss:2.043 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:01:30,639][model8_pretrain.py][INFO] Epoch:[0/2](757700/4588595) loss:2.743 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:01:30,639][model8_pretrain.py][INFO] Epoch:[0/2](757700/4588595) loss:3.052 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:01:30,640][model8_pretrain.py][INFO] Epoch:[0/2](757700/4588595) loss:2.975 lr:0.0000100 epoch_Time:24379.0min: [2024-01-06 02:02:16,278][model8_pretrain.py][INFO] Epoch:[0/2](757800/4588595) loss:2.641 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:02:16,278][model8_pretrain.py][INFO] Epoch:[0/2](757800/4588595) loss:3.697 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:02:16,278][model8_pretrain.py][INFO] Epoch:[0/2](757800/4588595) loss:3.297 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:02:16,278][model8_pretrain.py][INFO] Epoch:[0/2](757800/4588595) loss:2.838 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:02:16,278][model8_pretrain.py][INFO] Epoch:[0/2](757800/4588595) loss:2.657 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:02:16,278][model8_pretrain.py][INFO] Epoch:[0/2](757800/4588595) loss:2.796 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:02:16,278][model8_pretrain.py][INFO] Epoch:[0/2](757800/4588595) loss:2.574 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:02:16,278][model8_pretrain.py][INFO] Epoch:[0/2](757800/4588595) loss:2.775 lr:0.0000100 epoch_Time:24378.0min: [2024-01-06 02:02:53,216][model8_pretrain.py][INFO] Epoch:[0/2](757900/4588595) loss:2.214 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:02:53,216][model8_pretrain.py][INFO] Epoch:[0/2](757900/4588595) loss:3.027 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:02:53,216][model8_pretrain.py][INFO] Epoch:[0/2](757900/4588595) loss:2.653 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:02:53,216][model8_pretrain.py][INFO] Epoch:[0/2](757900/4588595) loss:2.948 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:02:53,216][model8_pretrain.py][INFO] Epoch:[0/2](757900/4588595) loss:2.944 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:02:53,216][model8_pretrain.py][INFO] Epoch:[0/2](757900/4588595) loss:2.674 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:02:53,216][model8_pretrain.py][INFO] Epoch:[0/2](757900/4588595) loss:2.712 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:02:53,216][model8_pretrain.py][INFO] Epoch:[0/2](757900/4588595) loss:3.095 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:03:30,164][model8_pretrain.py][INFO] Epoch:[0/2](758000/4588595) loss:2.438 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:03:30,164][model8_pretrain.py][INFO] Epoch:[0/2](758000/4588595) loss:3.305 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:03:30,164][model8_pretrain.py][INFO] Epoch:[0/2](758000/4588595) loss:2.431 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:03:30,164][model8_pretrain.py][INFO] Epoch:[0/2](758000/4588595) loss:3.042 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:03:30,165][model8_pretrain.py][INFO] Epoch:[0/2](758000/4588595) loss:2.817 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:03:30,165][model8_pretrain.py][INFO] Epoch:[0/2](758000/4588595) loss:3.289 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:03:30,165][model8_pretrain.py][INFO] Epoch:[0/2](758000/4588595) loss:3.134 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:03:30,165][model8_pretrain.py][INFO] Epoch:[0/2](758000/4588595) loss:2.311 lr:0.0000100 epoch_Time:24377.0min: [2024-01-06 02:04:07,117][model8_pretrain.py][INFO] Epoch:[0/2](758100/4588595) loss:2.868 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:07,117][model8_pretrain.py][INFO] Epoch:[0/2](758100/4588595) loss:2.660 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:07,117][model8_pretrain.py][INFO] Epoch:[0/2](758100/4588595) loss:2.834 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:07,117][model8_pretrain.py][INFO] Epoch:[0/2](758100/4588595) loss:2.955 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:07,117][model8_pretrain.py][INFO] Epoch:[0/2](758100/4588595) loss:2.812 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:07,117][model8_pretrain.py][INFO] Epoch:[0/2](758100/4588595) loss:2.856 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:07,117][model8_pretrain.py][INFO] Epoch:[0/2](758100/4588595) loss:2.378 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:07,117][model8_pretrain.py][INFO] Epoch:[0/2](758100/4588595) loss:2.689 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:44,075][model8_pretrain.py][INFO] Epoch:[0/2](758200/4588595) loss:2.453 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:44,075][model8_pretrain.py][INFO] Epoch:[0/2](758200/4588595) loss:2.922 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:44,075][model8_pretrain.py][INFO] Epoch:[0/2](758200/4588595) loss:3.207 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:44,075][model8_pretrain.py][INFO] Epoch:[0/2](758200/4588595) loss:2.908 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:44,075][model8_pretrain.py][INFO] Epoch:[0/2](758200/4588595) loss:3.152 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:44,075][model8_pretrain.py][INFO] Epoch:[0/2](758200/4588595) loss:2.724 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:44,075][model8_pretrain.py][INFO] Epoch:[0/2](758200/4588595) loss:2.808 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:04:44,075][model8_pretrain.py][INFO] Epoch:[0/2](758200/4588595) loss:3.214 lr:0.0000100 epoch_Time:24376.0min: [2024-01-06 02:05:21,028][model8_pretrain.py][INFO] Epoch:[0/2](758300/4588595) loss:2.617 lr:0.0000100 epoch_Time:24375.0min: [2024-01-06 02:05:21,028][model8_pretrain.py][INFO] Epoch:[0/2](758300/4588595) loss:2.374 lr:0.0000100 epoch_Time:24375.0min: [2024-01-06 02:05:21,028][model8_pretrain.py][INFO] Epoch:[0/2](758300/4588595) loss:1.892 lr:0.0000100 epoch_Time:24375.0min: [2024-01-06 02:05:21,028][model8_pretrain.py][INFO] Epoch:[0/2](758300/4588595) loss:2.973 lr:0.0000100 epoch_Time:24375.0min: [2024-01-06 02:05:21,028][model8_pretrain.py][INFO] Epoch:[0/2](758300/4588595) loss:3.420 lr:0.0000100 epoch_Time:24375.0min: [2024-01-06 02:05:21,028][model8_pretrain.py][INFO] Epoch:[0/2](758300/4588595) loss:2.468 lr:0.0000100 epoch_Time:24375.0min: [2024-01-06 02:05:21,028][model8_pretrain.py][INFO] Epoch:[0/2](758300/4588595) loss:2.668 lr:0.0000100 epoch_Time:24375.0min: [2024-01-06 02:05:21,028][model8_pretrain.py][INFO] Epoch:[0/2](758300/4588595) loss:3.091 lr:0.0000100 epoch_Time:24375.0min: [2024-01-06 02:05:57,945][model8_pretrain.py][INFO] Epoch:[0/2](758400/4588595) loss:2.629 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:05:57,946][model8_pretrain.py][INFO] Epoch:[0/2](758400/4588595) loss:3.116 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:05:57,946][model8_pretrain.py][INFO] Epoch:[0/2](758400/4588595) loss:2.886 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:05:57,946][model8_pretrain.py][INFO] Epoch:[0/2](758400/4588595) loss:2.939 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:05:57,946][model8_pretrain.py][INFO] Epoch:[0/2](758400/4588595) loss:2.800 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:05:57,946][model8_pretrain.py][INFO] Epoch:[0/2](758400/4588595) loss:2.942 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:05:57,946][model8_pretrain.py][INFO] Epoch:[0/2](758400/4588595) loss:2.365 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:05:57,946][model8_pretrain.py][INFO] Epoch:[0/2](758400/4588595) loss:3.244 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:06:36,527][model8_pretrain.py][INFO] Epoch:[0/2](758500/4588595) loss:3.160 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:06:36,527][model8_pretrain.py][INFO] Epoch:[0/2](758500/4588595) loss:3.034 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:06:36,527][model8_pretrain.py][INFO] Epoch:[0/2](758500/4588595) loss:2.841 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:06:36,527][model8_pretrain.py][INFO] Epoch:[0/2](758500/4588595) loss:2.896 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:06:36,527][model8_pretrain.py][INFO] Epoch:[0/2](758500/4588595) loss:2.437 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:06:36,527][model8_pretrain.py][INFO] Epoch:[0/2](758500/4588595) loss:2.950 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:06:36,528][model8_pretrain.py][INFO] Epoch:[0/2](758500/4588595) loss:1.764 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:06:36,528][model8_pretrain.py][INFO] Epoch:[0/2](758500/4588595) loss:3.212 lr:0.0000100 epoch_Time:24374.0min: [2024-01-06 02:07:22,293][model8_pretrain.py][INFO] Epoch:[0/2](758600/4588595) loss:2.730 lr:0.0000100 epoch_Time:24373.0min: [2024-01-06 02:07:22,293][model8_pretrain.py][INFO] Epoch:[0/2](758600/4588595) loss:3.276 lr:0.0000100 epoch_Time:24373.0min: [2024-01-06 02:07:22,293][model8_pretrain.py][INFO] Epoch:[0/2](758600/4588595) loss:2.495 lr:0.0000100 epoch_Time:24373.0min: [2024-01-06 02:07:22,293][model8_pretrain.py][INFO] Epoch:[0/2](758600/4588595) loss:3.131 lr:0.0000100 epoch_Time:24373.0min: [2024-01-06 02:07:22,293][model8_pretrain.py][INFO] Epoch:[0/2](758600/4588595) loss:3.217 lr:0.0000100 epoch_Time:24373.0min: [2024-01-06 02:07:22,293][model8_pretrain.py][INFO] Epoch:[0/2](758600/4588595) loss:3.061 lr:0.0000100 epoch_Time:24373.0min: [2024-01-06 02:07:22,293][model8_pretrain.py][INFO] Epoch:[0/2](758600/4588595) loss:3.610 lr:0.0000100 epoch_Time:24373.0min: [2024-01-06 02:07:22,293][model8_pretrain.py][INFO] Epoch:[0/2](758600/4588595) loss:3.315 lr:0.0000100 epoch_Time:24373.0min: [2024-01-06 02:07:59,225][model8_pretrain.py][INFO] Epoch:[0/2](758700/4588595) loss:2.738 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:07:59,225][model8_pretrain.py][INFO] Epoch:[0/2](758700/4588595) loss:2.893 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:07:59,225][model8_pretrain.py][INFO] Epoch:[0/2](758700/4588595) loss:2.911 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:07:59,225][model8_pretrain.py][INFO] Epoch:[0/2](758700/4588595) loss:2.462 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:07:59,225][model8_pretrain.py][INFO] Epoch:[0/2](758700/4588595) loss:2.713 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:07:59,225][model8_pretrain.py][INFO] Epoch:[0/2](758700/4588595) loss:2.680 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:07:59,225][model8_pretrain.py][INFO] Epoch:[0/2](758700/4588595) loss:2.996 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:07:59,226][model8_pretrain.py][INFO] Epoch:[0/2](758700/4588595) loss:3.062 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:08:36,162][model8_pretrain.py][INFO] Epoch:[0/2](758800/4588595) loss:2.445 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:08:36,162][model8_pretrain.py][INFO] Epoch:[0/2](758800/4588595) loss:2.888 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:08:36,162][model8_pretrain.py][INFO] Epoch:[0/2](758800/4588595) loss:2.823 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:08:36,162][model8_pretrain.py][INFO] Epoch:[0/2](758800/4588595) loss:2.960 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:08:36,162][model8_pretrain.py][INFO] Epoch:[0/2](758800/4588595) loss:2.848 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:08:36,162][model8_pretrain.py][INFO] Epoch:[0/2](758800/4588595) loss:2.369 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:08:36,162][model8_pretrain.py][INFO] Epoch:[0/2](758800/4588595) loss:2.567 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:08:36,162][model8_pretrain.py][INFO] Epoch:[0/2](758800/4588595) loss:3.069 lr:0.0000100 epoch_Time:24372.0min: [2024-01-06 02:09:13,098][model8_pretrain.py][INFO] Epoch:[0/2](758900/4588595) loss:2.691 lr:0.0000100 epoch_Time:24371.0min: [2024-01-06 02:09:13,098][model8_pretrain.py][INFO] Epoch:[0/2](758900/4588595) loss:3.134 lr:0.0000100 epoch_Time:24371.0min: [2024-01-06 02:09:13,098][model8_pretrain.py][INFO] Epoch:[0/2](758900/4588595) loss:2.493 lr:0.0000100 epoch_Time:24371.0min: [2024-01-06 02:09:13,098][model8_pretrain.py][INFO] Epoch:[0/2](758900/4588595) loss:2.683 lr:0.0000100 epoch_Time:24371.0min: [2024-01-06 02:09:13,098][model8_pretrain.py][INFO] Epoch:[0/2](758900/4588595) loss:2.784 lr:0.0000100 epoch_Time:24371.0min: [2024-01-06 02:09:13,098][model8_pretrain.py][INFO] Epoch:[0/2](758900/4588595) loss:2.944 lr:0.0000100 epoch_Time:24371.0min: [2024-01-06 02:09:13,098][model8_pretrain.py][INFO] Epoch:[0/2](758900/4588595) loss:2.929 lr:0.0000100 epoch_Time:24371.0min: [2024-01-06 02:09:13,098][model8_pretrain.py][INFO] Epoch:[0/2](758900/4588595) loss:2.605 lr:0.0000100 epoch_Time:24371.0min: [2024-01-06 02:09:50,031][model8_pretrain.py][INFO] Epoch:[0/2](759000/4588595) loss:2.475 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:09:50,031][model8_pretrain.py][INFO] Epoch:[0/2](759000/4588595) loss:2.782 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:09:50,031][model8_pretrain.py][INFO] Epoch:[0/2](759000/4588595) loss:2.705 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:09:50,031][model8_pretrain.py][INFO] Epoch:[0/2](759000/4588595) loss:2.970 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:09:50,031][model8_pretrain.py][INFO] Epoch:[0/2](759000/4588595) loss:2.716 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:09:50,031][model8_pretrain.py][INFO] Epoch:[0/2](759000/4588595) loss:3.125 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:09:50,031][model8_pretrain.py][INFO] Epoch:[0/2](759000/4588595) loss:2.397 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:09:50,032][model8_pretrain.py][INFO] Epoch:[0/2](759000/4588595) loss:2.850 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:10:26,970][model8_pretrain.py][INFO] Epoch:[0/2](759100/4588595) loss:2.560 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:10:26,970][model8_pretrain.py][INFO] Epoch:[0/2](759100/4588595) loss:2.738 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:10:26,970][model8_pretrain.py][INFO] Epoch:[0/2](759100/4588595) loss:2.433 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:10:26,970][model8_pretrain.py][INFO] Epoch:[0/2](759100/4588595) loss:2.944 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:10:26,970][model8_pretrain.py][INFO] Epoch:[0/2](759100/4588595) loss:2.659 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:10:26,970][model8_pretrain.py][INFO] Epoch:[0/2](759100/4588595) loss:3.148 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:10:26,970][model8_pretrain.py][INFO] Epoch:[0/2](759100/4588595) loss:2.650 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:10:26,970][model8_pretrain.py][INFO] Epoch:[0/2](759100/4588595) loss:2.743 lr:0.0000100 epoch_Time:24370.0min: [2024-01-06 02:11:03,911][model8_pretrain.py][INFO] Epoch:[0/2](759200/4588595) loss:2.754 lr:0.0000100 epoch_Time:24369.0min: [2024-01-06 02:11:03,911][model8_pretrain.py][INFO] Epoch:[0/2](759200/4588595) loss:2.353 lr:0.0000100 epoch_Time:24369.0min: [2024-01-06 02:11:03,911][model8_pretrain.py][INFO] Epoch:[0/2](759200/4588595) loss:1.669 lr:0.0000100 epoch_Time:24369.0min: [2024-01-06 02:11:03,911][model8_pretrain.py][INFO] Epoch:[0/2](759200/4588595) loss:2.568 lr:0.0000100 epoch_Time:24369.0min: [2024-01-06 02:11:03,911][model8_pretrain.py][INFO] Epoch:[0/2](759200/4588595) loss:2.837 lr:0.0000100 epoch_Time:24369.0min: [2024-01-06 02:11:03,911][model8_pretrain.py][INFO] Epoch:[0/2](759200/4588595) loss:3.006 lr:0.0000100 epoch_Time:24369.0min: [2024-01-06 02:11:03,911][model8_pretrain.py][INFO] Epoch:[0/2](759200/4588595) loss:2.864 lr:0.0000100 epoch_Time:24369.0min: [2024-01-06 02:11:03,911][model8_pretrain.py][INFO] Epoch:[0/2](759200/4588595) loss:2.072 lr:0.0000100 epoch_Time:24369.0min: [2024-01-06 02:11:40,843][model8_pretrain.py][INFO] Epoch:[0/2](759300/4588595) loss:2.981 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:11:40,843][model8_pretrain.py][INFO] Epoch:[0/2](759300/4588595) loss:2.806 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:11:40,843][model8_pretrain.py][INFO] Epoch:[0/2](759300/4588595) loss:3.101 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:11:40,843][model8_pretrain.py][INFO] Epoch:[0/2](759300/4588595) loss:2.119 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:11:40,843][model8_pretrain.py][INFO] Epoch:[0/2](759300/4588595) loss:3.162 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:11:40,843][model8_pretrain.py][INFO] Epoch:[0/2](759300/4588595) loss:2.060 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:11:40,843][model8_pretrain.py][INFO] Epoch:[0/2](759300/4588595) loss:3.306 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:11:40,843][model8_pretrain.py][INFO] Epoch:[0/2](759300/4588595) loss:2.885 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:12:27,877][model8_pretrain.py][INFO] Epoch:[0/2](759400/4588595) loss:2.936 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:12:27,877][model8_pretrain.py][INFO] Epoch:[0/2](759400/4588595) loss:2.296 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:12:27,877][model8_pretrain.py][INFO] Epoch:[0/2](759400/4588595) loss:2.418 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:12:27,878][model8_pretrain.py][INFO] Epoch:[0/2](759400/4588595) loss:3.264 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:12:27,878][model8_pretrain.py][INFO] Epoch:[0/2](759400/4588595) loss:2.454 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:12:27,878][model8_pretrain.py][INFO] Epoch:[0/2](759400/4588595) loss:3.217 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:12:27,878][model8_pretrain.py][INFO] Epoch:[0/2](759400/4588595) loss:2.445 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:12:27,878][model8_pretrain.py][INFO] Epoch:[0/2](759400/4588595) loss:2.918 lr:0.0000100 epoch_Time:24368.0min: [2024-01-06 02:13:04,808][model8_pretrain.py][INFO] Epoch:[0/2](759500/4588595) loss:3.115 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:04,808][model8_pretrain.py][INFO] Epoch:[0/2](759500/4588595) loss:3.303 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:04,808][model8_pretrain.py][INFO] Epoch:[0/2](759500/4588595) loss:2.905 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:04,808][model8_pretrain.py][INFO] Epoch:[0/2](759500/4588595) loss:3.099 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:04,808][model8_pretrain.py][INFO] Epoch:[0/2](759500/4588595) loss:2.465 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:04,808][model8_pretrain.py][INFO] Epoch:[0/2](759500/4588595) loss:1.930 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:04,808][model8_pretrain.py][INFO] Epoch:[0/2](759500/4588595) loss:2.807 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:04,808][model8_pretrain.py][INFO] Epoch:[0/2](759500/4588595) loss:3.048 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:41,737][model8_pretrain.py][INFO] Epoch:[0/2](759600/4588595) loss:2.900 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:41,737][model8_pretrain.py][INFO] Epoch:[0/2](759600/4588595) loss:2.920 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:41,737][model8_pretrain.py][INFO] Epoch:[0/2](759600/4588595) loss:2.753 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:41,737][model8_pretrain.py][INFO] Epoch:[0/2](759600/4588595) loss:2.832 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:41,737][model8_pretrain.py][INFO] Epoch:[0/2](759600/4588595) loss:2.763 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:41,737][model8_pretrain.py][INFO] Epoch:[0/2](759600/4588595) loss:2.792 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:41,737][model8_pretrain.py][INFO] Epoch:[0/2](759600/4588595) loss:2.781 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:13:41,737][model8_pretrain.py][INFO] Epoch:[0/2](759600/4588595) loss:2.653 lr:0.0000100 epoch_Time:24367.0min: [2024-01-06 02:14:18,677][model8_pretrain.py][INFO] Epoch:[0/2](759700/4588595) loss:2.794 lr:0.0000100 epoch_Time:24366.0min: [2024-01-06 02:14:18,677][model8_pretrain.py][INFO] Epoch:[0/2](759700/4588595) loss:2.827 lr:0.0000100 epoch_Time:24366.0min: [2024-01-06 02:14:18,677][model8_pretrain.py][INFO] Epoch:[0/2](759700/4588595) loss:3.365 lr:0.0000100 epoch_Time:24366.0min: [2024-01-06 02:14:18,677][model8_pretrain.py][INFO] Epoch:[0/2](759700/4588595) loss:2.876 lr:0.0000100 epoch_Time:24366.0min: [2024-01-06 02:14:18,677][model8_pretrain.py][INFO] Epoch:[0/2](759700/4588595) loss:2.564 lr:0.0000100 epoch_Time:24366.0min: [2024-01-06 02:14:18,677][model8_pretrain.py][INFO] Epoch:[0/2](759700/4588595) loss:2.812 lr:0.0000100 epoch_Time:24366.0min: [2024-01-06 02:14:18,677][model8_pretrain.py][INFO] Epoch:[0/2](759700/4588595) loss:2.791 lr:0.0000100 epoch_Time:24366.0min: [2024-01-06 02:14:18,677][model8_pretrain.py][INFO] Epoch:[0/2](759700/4588595) loss:2.867 lr:0.0000100 epoch_Time:24366.0min: [2024-01-06 02:14:55,617][model8_pretrain.py][INFO] Epoch:[0/2](759800/4588595) loss:3.039 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:14:55,617][model8_pretrain.py][INFO] Epoch:[0/2](759800/4588595) loss:2.821 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:14:55,617][model8_pretrain.py][INFO] Epoch:[0/2](759800/4588595) loss:2.833 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:14:55,617][model8_pretrain.py][INFO] Epoch:[0/2](759800/4588595) loss:3.055 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:14:55,617][model8_pretrain.py][INFO] Epoch:[0/2](759800/4588595) loss:2.292 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:14:55,617][model8_pretrain.py][INFO] Epoch:[0/2](759800/4588595) loss:2.914 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:14:55,617][model8_pretrain.py][INFO] Epoch:[0/2](759800/4588595) loss:2.705 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:14:55,617][model8_pretrain.py][INFO] Epoch:[0/2](759800/4588595) loss:2.898 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:15:32,561][model8_pretrain.py][INFO] Epoch:[0/2](759900/4588595) loss:2.644 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:15:32,561][model8_pretrain.py][INFO] Epoch:[0/2](759900/4588595) loss:2.901 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:15:32,561][model8_pretrain.py][INFO] Epoch:[0/2](759900/4588595) loss:3.272 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:15:32,561][model8_pretrain.py][INFO] Epoch:[0/2](759900/4588595) loss:2.993 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:15:32,561][model8_pretrain.py][INFO] Epoch:[0/2](759900/4588595) loss:2.708 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:15:32,561][model8_pretrain.py][INFO] Epoch:[0/2](759900/4588595) loss:2.564 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:15:32,561][model8_pretrain.py][INFO] Epoch:[0/2](759900/4588595) loss:3.132 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:15:32,561][model8_pretrain.py][INFO] Epoch:[0/2](759900/4588595) loss:2.588 lr:0.0000100 epoch_Time:24365.0min: [2024-01-06 02:16:09,507][model8_pretrain.py][INFO] Epoch:[0/2](760000/4588595) loss:2.771 lr:0.0000100 epoch_Time:24364.0min: [2024-01-06 02:16:09,507][model8_pretrain.py][INFO] Epoch:[0/2](760000/4588595) loss:2.811 lr:0.0000100 epoch_Time:24364.0min: [2024-01-06 02:16:09,507][model8_pretrain.py][INFO] Epoch:[0/2](760000/4588595) loss:3.204 lr:0.0000100 epoch_Time:24364.0min: [2024-01-06 02:16:09,507][model8_pretrain.py][INFO] Epoch:[0/2](760000/4588595) loss:2.478 lr:0.0000100 epoch_Time:24364.0min: [2024-01-06 02:16:09,507][model8_pretrain.py][INFO] Epoch:[0/2](760000/4588595) loss:2.337 lr:0.0000100 epoch_Time:24364.0min: [2024-01-06 02:16:09,507][model8_pretrain.py][INFO] Epoch:[0/2](760000/4588595) loss:2.860 lr:0.0000100 epoch_Time:24364.0min: [2024-01-06 02:16:09,507][model8_pretrain.py][INFO] Epoch:[0/2](760000/4588595) loss:2.687 lr:0.0000100 epoch_Time:24364.0min: [2024-01-06 02:16:09,507][model8_pretrain.py][INFO] Epoch:[0/2](760000/4588595) loss:2.769 lr:0.0000100 epoch_Time:24364.0min: [2024-01-06 02:16:46,437][model8_pretrain.py][INFO] Epoch:[0/2](760100/4588595) loss:2.849 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:16:46,437][model8_pretrain.py][INFO] Epoch:[0/2](760100/4588595) loss:2.938 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:16:46,437][model8_pretrain.py][INFO] Epoch:[0/2](760100/4588595) loss:3.226 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:16:46,437][model8_pretrain.py][INFO] Epoch:[0/2](760100/4588595) loss:3.267 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:16:46,437][model8_pretrain.py][INFO] Epoch:[0/2](760100/4588595) loss:2.859 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:16:46,437][model8_pretrain.py][INFO] Epoch:[0/2](760100/4588595) loss:3.056 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:16:46,437][model8_pretrain.py][INFO] Epoch:[0/2](760100/4588595) loss:2.675 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:16:46,438][model8_pretrain.py][INFO] Epoch:[0/2](760100/4588595) loss:2.716 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:17:33,632][model8_pretrain.py][INFO] Epoch:[0/2](760200/4588595) loss:2.654 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:17:33,633][model8_pretrain.py][INFO] Epoch:[0/2](760200/4588595) loss:2.539 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:17:33,633][model8_pretrain.py][INFO] Epoch:[0/2](760200/4588595) loss:2.625 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:17:33,633][model8_pretrain.py][INFO] Epoch:[0/2](760200/4588595) loss:2.575 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:17:33,633][model8_pretrain.py][INFO] Epoch:[0/2](760200/4588595) loss:3.388 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:17:33,633][model8_pretrain.py][INFO] Epoch:[0/2](760200/4588595) loss:2.729 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:17:33,633][model8_pretrain.py][INFO] Epoch:[0/2](760200/4588595) loss:2.842 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:17:33,634][model8_pretrain.py][INFO] Epoch:[0/2](760200/4588595) loss:2.858 lr:0.0000100 epoch_Time:24363.0min: [2024-01-06 02:18:10,562][model8_pretrain.py][INFO] Epoch:[0/2](760300/4588595) loss:2.990 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:10,562][model8_pretrain.py][INFO] Epoch:[0/2](760300/4588595) loss:2.704 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:10,562][model8_pretrain.py][INFO] Epoch:[0/2](760300/4588595) loss:2.527 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:10,562][model8_pretrain.py][INFO] Epoch:[0/2](760300/4588595) loss:2.691 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:10,562][model8_pretrain.py][INFO] Epoch:[0/2](760300/4588595) loss:2.665 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:10,562][model8_pretrain.py][INFO] Epoch:[0/2](760300/4588595) loss:2.754 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:10,562][model8_pretrain.py][INFO] Epoch:[0/2](760300/4588595) loss:2.482 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:10,563][model8_pretrain.py][INFO] Epoch:[0/2](760300/4588595) loss:2.860 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:47,495][model8_pretrain.py][INFO] Epoch:[0/2](760400/4588595) loss:3.241 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:47,495][model8_pretrain.py][INFO] Epoch:[0/2](760400/4588595) loss:3.218 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:47,495][model8_pretrain.py][INFO] Epoch:[0/2](760400/4588595) loss:2.968 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:47,495][model8_pretrain.py][INFO] Epoch:[0/2](760400/4588595) loss:2.668 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:47,495][model8_pretrain.py][INFO] Epoch:[0/2](760400/4588595) loss:2.397 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:47,495][model8_pretrain.py][INFO] Epoch:[0/2](760400/4588595) loss:3.629 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:47,495][model8_pretrain.py][INFO] Epoch:[0/2](760400/4588595) loss:2.201 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:18:47,496][model8_pretrain.py][INFO] Epoch:[0/2](760400/4588595) loss:2.954 lr:0.0000100 epoch_Time:24362.0min: [2024-01-06 02:19:24,437][model8_pretrain.py][INFO] Epoch:[0/2](760500/4588595) loss:2.887 lr:0.0000100 epoch_Time:24361.0min: [2024-01-06 02:19:24,437][model8_pretrain.py][INFO] Epoch:[0/2](760500/4588595) loss:2.504 lr:0.0000100 epoch_Time:24361.0min: [2024-01-06 02:19:24,437][model8_pretrain.py][INFO] Epoch:[0/2](760500/4588595) loss:3.235 lr:0.0000100 epoch_Time:24361.0min: [2024-01-06 02:19:24,437][model8_pretrain.py][INFO] Epoch:[0/2](760500/4588595) loss:2.852 lr:0.0000100 epoch_Time:24361.0min: [2024-01-06 02:19:24,437][model8_pretrain.py][INFO] Epoch:[0/2](760500/4588595) loss:3.165 lr:0.0000100 epoch_Time:24361.0min: [2024-01-06 02:19:24,437][model8_pretrain.py][INFO] Epoch:[0/2](760500/4588595) loss:2.858 lr:0.0000100 epoch_Time:24361.0min: [2024-01-06 02:19:24,437][model8_pretrain.py][INFO] Epoch:[0/2](760500/4588595) loss:2.657 lr:0.0000100 epoch_Time:24361.0min: [2024-01-06 02:19:24,437][model8_pretrain.py][INFO] Epoch:[0/2](760500/4588595) loss:3.166 lr:0.0000100 epoch_Time:24361.0min: [2024-01-06 02:20:01,379][model8_pretrain.py][INFO] Epoch:[0/2](760600/4588595) loss:2.953 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:01,379][model8_pretrain.py][INFO] Epoch:[0/2](760600/4588595) loss:2.670 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:01,379][model8_pretrain.py][INFO] Epoch:[0/2](760600/4588595) loss:2.701 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:01,379][model8_pretrain.py][INFO] Epoch:[0/2](760600/4588595) loss:3.022 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:01,379][model8_pretrain.py][INFO] Epoch:[0/2](760600/4588595) loss:1.921 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:01,379][model8_pretrain.py][INFO] Epoch:[0/2](760600/4588595) loss:3.142 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:01,379][model8_pretrain.py][INFO] Epoch:[0/2](760600/4588595) loss:2.788 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:01,379][model8_pretrain.py][INFO] Epoch:[0/2](760600/4588595) loss:2.495 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:38,323][model8_pretrain.py][INFO] Epoch:[0/2](760700/4588595) loss:2.964 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:38,324][model8_pretrain.py][INFO] Epoch:[0/2](760700/4588595) loss:2.444 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:38,324][model8_pretrain.py][INFO] Epoch:[0/2](760700/4588595) loss:3.378 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:38,324][model8_pretrain.py][INFO] Epoch:[0/2](760700/4588595) loss:3.107 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:38,324][model8_pretrain.py][INFO] Epoch:[0/2](760700/4588595) loss:2.520 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:38,323][model8_pretrain.py][INFO] Epoch:[0/2](760700/4588595) loss:2.986 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:38,323][model8_pretrain.py][INFO] Epoch:[0/2](760700/4588595) loss:2.749 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:20:38,324][model8_pretrain.py][INFO] Epoch:[0/2](760700/4588595) loss:2.772 lr:0.0000100 epoch_Time:24360.0min: [2024-01-06 02:21:15,259][model8_pretrain.py][INFO] Epoch:[0/2](760800/4588595) loss:2.540 lr:0.0000100 epoch_Time:24359.0min: [2024-01-06 02:21:15,259][model8_pretrain.py][INFO] Epoch:[0/2](760800/4588595) loss:3.002 lr:0.0000100 epoch_Time:24359.0min: [2024-01-06 02:21:15,259][model8_pretrain.py][INFO] Epoch:[0/2](760800/4588595) loss:2.603 lr:0.0000100 epoch_Time:24359.0min: [2024-01-06 02:21:15,259][model8_pretrain.py][INFO] Epoch:[0/2](760800/4588595) loss:2.750 lr:0.0000100 epoch_Time:24359.0min: [2024-01-06 02:21:15,259][model8_pretrain.py][INFO] Epoch:[0/2](760800/4588595) loss:3.064 lr:0.0000100 epoch_Time:24359.0min: [2024-01-06 02:21:15,259][model8_pretrain.py][INFO] Epoch:[0/2](760800/4588595) loss:2.492 lr:0.0000100 epoch_Time:24359.0min: [2024-01-06 02:21:15,259][model8_pretrain.py][INFO] Epoch:[0/2](760800/4588595) loss:2.706 lr:0.0000100 epoch_Time:24359.0min: [2024-01-06 02:21:15,259][model8_pretrain.py][INFO] Epoch:[0/2](760800/4588595) loss:3.031 lr:0.0000100 epoch_Time:24359.0min: [2024-01-06 02:21:52,208][model8_pretrain.py][INFO] Epoch:[0/2](760900/4588595) loss:2.984 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:21:52,208][model8_pretrain.py][INFO] Epoch:[0/2](760900/4588595) loss:2.743 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:21:52,208][model8_pretrain.py][INFO] Epoch:[0/2](760900/4588595) loss:3.091 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:21:52,208][model8_pretrain.py][INFO] Epoch:[0/2](760900/4588595) loss:2.464 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:21:52,208][model8_pretrain.py][INFO] Epoch:[0/2](760900/4588595) loss:2.928 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:21:52,208][model8_pretrain.py][INFO] Epoch:[0/2](760900/4588595) loss:1.910 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:21:52,208][model8_pretrain.py][INFO] Epoch:[0/2](760900/4588595) loss:2.751 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:21:52,209][model8_pretrain.py][INFO] Epoch:[0/2](760900/4588595) loss:3.128 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:22:39,342][model8_pretrain.py][INFO] Epoch:[0/2](761000/4588595) loss:2.911 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:22:39,342][model8_pretrain.py][INFO] Epoch:[0/2](761000/4588595) loss:3.019 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:22:39,342][model8_pretrain.py][INFO] Epoch:[0/2](761000/4588595) loss:2.614 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:22:39,342][model8_pretrain.py][INFO] Epoch:[0/2](761000/4588595) loss:3.035 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:22:39,342][model8_pretrain.py][INFO] Epoch:[0/2](761000/4588595) loss:2.880 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:22:39,342][model8_pretrain.py][INFO] Epoch:[0/2](761000/4588595) loss:2.844 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:22:39,342][model8_pretrain.py][INFO] Epoch:[0/2](761000/4588595) loss:2.225 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:22:39,342][model8_pretrain.py][INFO] Epoch:[0/2](761000/4588595) loss:2.821 lr:0.0000100 epoch_Time:24358.0min: [2024-01-06 02:23:16,289][model8_pretrain.py][INFO] Epoch:[0/2](761100/4588595) loss:3.002 lr:0.0000100 epoch_Time:24357.0min: [2024-01-06 02:23:16,289][model8_pretrain.py][INFO] Epoch:[0/2](761100/4588595) loss:2.568 lr:0.0000100 epoch_Time:24357.0min: [2024-01-06 02:23:16,289][model8_pretrain.py][INFO] Epoch:[0/2](761100/4588595) loss:3.613 lr:0.0000100 epoch_Time:24357.0min: [2024-01-06 02:23:16,289][model8_pretrain.py][INFO] Epoch:[0/2](761100/4588595) loss:2.795 lr:0.0000100 epoch_Time:24357.0min: [2024-01-06 02:23:16,289][model8_pretrain.py][INFO] Epoch:[0/2](761100/4588595) loss:2.855 lr:0.0000100 epoch_Time:24357.0min: [2024-01-06 02:23:16,289][model8_pretrain.py][INFO] Epoch:[0/2](761100/4588595) loss:2.086 lr:0.0000100 epoch_Time:24357.0min: [2024-01-06 02:23:16,289][model8_pretrain.py][INFO] Epoch:[0/2](761100/4588595) loss:2.897 lr:0.0000100 epoch_Time:24357.0min: [2024-01-06 02:23:16,289][model8_pretrain.py][INFO] Epoch:[0/2](761100/4588595) loss:2.957 lr:0.0000100 epoch_Time:24357.0min: [2024-01-06 02:23:53,223][model8_pretrain.py][INFO] Epoch:[0/2](761200/4588595) loss:3.196 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:23:53,223][model8_pretrain.py][INFO] Epoch:[0/2](761200/4588595) loss:2.439 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:23:53,223][model8_pretrain.py][INFO] Epoch:[0/2](761200/4588595) loss:2.732 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:23:53,223][model8_pretrain.py][INFO] Epoch:[0/2](761200/4588595) loss:2.883 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:23:53,223][model8_pretrain.py][INFO] Epoch:[0/2](761200/4588595) loss:3.065 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:23:53,223][model8_pretrain.py][INFO] Epoch:[0/2](761200/4588595) loss:2.728 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:23:53,223][model8_pretrain.py][INFO] Epoch:[0/2](761200/4588595) loss:2.769 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:23:53,223][model8_pretrain.py][INFO] Epoch:[0/2](761200/4588595) loss:2.727 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:24:30,159][model8_pretrain.py][INFO] Epoch:[0/2](761300/4588595) loss:3.086 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:24:30,159][model8_pretrain.py][INFO] Epoch:[0/2](761300/4588595) loss:2.699 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:24:30,159][model8_pretrain.py][INFO] Epoch:[0/2](761300/4588595) loss:2.941 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:24:30,159][model8_pretrain.py][INFO] Epoch:[0/2](761300/4588595) loss:2.592 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:24:30,159][model8_pretrain.py][INFO] Epoch:[0/2](761300/4588595) loss:2.085 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:24:30,160][model8_pretrain.py][INFO] Epoch:[0/2](761300/4588595) loss:2.043 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:24:30,160][model8_pretrain.py][INFO] Epoch:[0/2](761300/4588595) loss:2.734 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:24:30,160][model8_pretrain.py][INFO] Epoch:[0/2](761300/4588595) loss:2.641 lr:0.0000100 epoch_Time:24356.0min: [2024-01-06 02:25:07,103][model8_pretrain.py][INFO] Epoch:[0/2](761400/4588595) loss:3.038 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:07,103][model8_pretrain.py][INFO] Epoch:[0/2](761400/4588595) loss:3.173 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:07,103][model8_pretrain.py][INFO] Epoch:[0/2](761400/4588595) loss:3.330 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:07,103][model8_pretrain.py][INFO] Epoch:[0/2](761400/4588595) loss:2.772 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:07,103][model8_pretrain.py][INFO] Epoch:[0/2](761400/4588595) loss:3.269 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:07,103][model8_pretrain.py][INFO] Epoch:[0/2](761400/4588595) loss:2.870 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:07,103][model8_pretrain.py][INFO] Epoch:[0/2](761400/4588595) loss:2.561 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:07,103][model8_pretrain.py][INFO] Epoch:[0/2](761400/4588595) loss:3.042 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:44,045][model8_pretrain.py][INFO] Epoch:[0/2](761500/4588595) loss:2.886 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:44,045][model8_pretrain.py][INFO] Epoch:[0/2](761500/4588595) loss:2.619 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:44,045][model8_pretrain.py][INFO] Epoch:[0/2](761500/4588595) loss:2.663 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:44,045][model8_pretrain.py][INFO] Epoch:[0/2](761500/4588595) loss:3.405 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:44,045][model8_pretrain.py][INFO] Epoch:[0/2](761500/4588595) loss:3.254 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:44,045][model8_pretrain.py][INFO] Epoch:[0/2](761500/4588595) loss:3.223 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:44,045][model8_pretrain.py][INFO] Epoch:[0/2](761500/4588595) loss:3.295 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:25:44,045][model8_pretrain.py][INFO] Epoch:[0/2](761500/4588595) loss:3.232 lr:0.0000100 epoch_Time:24355.0min: [2024-01-06 02:26:20,988][model8_pretrain.py][INFO] Epoch:[0/2](761600/4588595) loss:2.618 lr:0.0000100 epoch_Time:24354.0min: [2024-01-06 02:26:20,988][model8_pretrain.py][INFO] Epoch:[0/2](761600/4588595) loss:2.725 lr:0.0000100 epoch_Time:24354.0min: [2024-01-06 02:26:20,988][model8_pretrain.py][INFO] Epoch:[0/2](761600/4588595) loss:2.773 lr:0.0000100 epoch_Time:24354.0min: [2024-01-06 02:26:20,988][model8_pretrain.py][INFO] Epoch:[0/2](761600/4588595) loss:2.559 lr:0.0000100 epoch_Time:24354.0min: [2024-01-06 02:26:20,988][model8_pretrain.py][INFO] Epoch:[0/2](761600/4588595) loss:2.656 lr:0.0000100 epoch_Time:24354.0min: [2024-01-06 02:26:20,988][model8_pretrain.py][INFO] Epoch:[0/2](761600/4588595) loss:2.836 lr:0.0000100 epoch_Time:24354.0min: [2024-01-06 02:26:20,988][model8_pretrain.py][INFO] Epoch:[0/2](761600/4588595) loss:2.280 lr:0.0000100 epoch_Time:24354.0min: [2024-01-06 02:26:20,988][model8_pretrain.py][INFO] Epoch:[0/2](761600/4588595) loss:3.360 lr:0.0000100 epoch_Time:24354.0min: [2024-01-06 02:26:57,927][model8_pretrain.py][INFO] Epoch:[0/2](761700/4588595) loss:3.380 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:26:57,927][model8_pretrain.py][INFO] Epoch:[0/2](761700/4588595) loss:2.859 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:26:57,927][model8_pretrain.py][INFO] Epoch:[0/2](761700/4588595) loss:3.476 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:26:57,927][model8_pretrain.py][INFO] Epoch:[0/2](761700/4588595) loss:2.711 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:26:57,927][model8_pretrain.py][INFO] Epoch:[0/2](761700/4588595) loss:2.608 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:26:57,927][model8_pretrain.py][INFO] Epoch:[0/2](761700/4588595) loss:2.679 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:26:57,928][model8_pretrain.py][INFO] Epoch:[0/2](761700/4588595) loss:3.243 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:26:57,928][model8_pretrain.py][INFO] Epoch:[0/2](761700/4588595) loss:2.594 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:27:45,177][model8_pretrain.py][INFO] Epoch:[0/2](761800/4588595) loss:2.936 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:27:45,177][model8_pretrain.py][INFO] Epoch:[0/2](761800/4588595) loss:2.752 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:27:45,177][model8_pretrain.py][INFO] Epoch:[0/2](761800/4588595) loss:3.328 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:27:45,177][model8_pretrain.py][INFO] Epoch:[0/2](761800/4588595) loss:2.952 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:27:45,177][model8_pretrain.py][INFO] Epoch:[0/2](761800/4588595) loss:3.025 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:27:45,177][model8_pretrain.py][INFO] Epoch:[0/2](761800/4588595) loss:2.959 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:27:45,177][model8_pretrain.py][INFO] Epoch:[0/2](761800/4588595) loss:3.348 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:27:45,177][model8_pretrain.py][INFO] Epoch:[0/2](761800/4588595) loss:1.964 lr:0.0000100 epoch_Time:24353.0min: [2024-01-06 02:28:22,112][model8_pretrain.py][INFO] Epoch:[0/2](761900/4588595) loss:2.398 lr:0.0000100 epoch_Time:24352.0min: [2024-01-06 02:28:22,112][model8_pretrain.py][INFO] Epoch:[0/2](761900/4588595) loss:2.949 lr:0.0000100 epoch_Time:24352.0min: [2024-01-06 02:28:22,112][model8_pretrain.py][INFO] Epoch:[0/2](761900/4588595) loss:2.660 lr:0.0000100 epoch_Time:24352.0min: [2024-01-06 02:28:22,112][model8_pretrain.py][INFO] Epoch:[0/2](761900/4588595) loss:2.835 lr:0.0000100 epoch_Time:24352.0min: [2024-01-06 02:28:22,112][model8_pretrain.py][INFO] Epoch:[0/2](761900/4588595) loss:3.036 lr:0.0000100 epoch_Time:24352.0min: [2024-01-06 02:28:22,112][model8_pretrain.py][INFO] Epoch:[0/2](761900/4588595) loss:2.638 lr:0.0000100 epoch_Time:24352.0min: [2024-01-06 02:28:22,112][model8_pretrain.py][INFO] Epoch:[0/2](761900/4588595) loss:3.186 lr:0.0000100 epoch_Time:24352.0min: [2024-01-06 02:28:22,112][model8_pretrain.py][INFO] Epoch:[0/2](761900/4588595) loss:2.967 lr:0.0000100 epoch_Time:24352.0min: [2024-01-06 02:28:59,059][model8_pretrain.py][INFO] Epoch:[0/2](762000/4588595) loss:2.954 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:28:59,059][model8_pretrain.py][INFO] Epoch:[0/2](762000/4588595) loss:2.556 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:28:59,059][model8_pretrain.py][INFO] Epoch:[0/2](762000/4588595) loss:2.623 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:28:59,059][model8_pretrain.py][INFO] Epoch:[0/2](762000/4588595) loss:2.942 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:28:59,059][model8_pretrain.py][INFO] Epoch:[0/2](762000/4588595) loss:2.931 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:28:59,059][model8_pretrain.py][INFO] Epoch:[0/2](762000/4588595) loss:2.619 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:28:59,059][model8_pretrain.py][INFO] Epoch:[0/2](762000/4588595) loss:2.478 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:28:59,059][model8_pretrain.py][INFO] Epoch:[0/2](762000/4588595) loss:2.569 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:29:35,998][model8_pretrain.py][INFO] Epoch:[0/2](762100/4588595) loss:2.992 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:29:35,998][model8_pretrain.py][INFO] Epoch:[0/2](762100/4588595) loss:2.378 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:29:35,998][model8_pretrain.py][INFO] Epoch:[0/2](762100/4588595) loss:2.939 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:29:35,998][model8_pretrain.py][INFO] Epoch:[0/2](762100/4588595) loss:2.388 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:29:35,998][model8_pretrain.py][INFO] Epoch:[0/2](762100/4588595) loss:2.960 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:29:35,998][model8_pretrain.py][INFO] Epoch:[0/2](762100/4588595) loss:2.527 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:29:35,998][model8_pretrain.py][INFO] Epoch:[0/2](762100/4588595) loss:2.655 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:29:35,998][model8_pretrain.py][INFO] Epoch:[0/2](762100/4588595) loss:2.993 lr:0.0000100 epoch_Time:24351.0min: [2024-01-06 02:30:12,917][model8_pretrain.py][INFO] Epoch:[0/2](762200/4588595) loss:3.023 lr:0.0000100 epoch_Time:24350.0min: [2024-01-06 02:30:12,917][model8_pretrain.py][INFO] Epoch:[0/2](762200/4588595) loss:2.788 lr:0.0000100 epoch_Time:24350.0min: [2024-01-06 02:30:12,917][model8_pretrain.py][INFO] Epoch:[0/2](762200/4588595) loss:3.181 lr:0.0000100 epoch_Time:24350.0min: [2024-01-06 02:30:12,917][model8_pretrain.py][INFO] Epoch:[0/2](762200/4588595) loss:2.738 lr:0.0000100 epoch_Time:24350.0min: [2024-01-06 02:30:12,917][model8_pretrain.py][INFO] Epoch:[0/2](762200/4588595) loss:3.609 lr:0.0000100 epoch_Time:24350.0min: [2024-01-06 02:30:12,917][model8_pretrain.py][INFO] Epoch:[0/2](762200/4588595) loss:3.022 lr:0.0000100 epoch_Time:24350.0min: [2024-01-06 02:30:12,917][model8_pretrain.py][INFO] Epoch:[0/2](762200/4588595) loss:2.494 lr:0.0000100 epoch_Time:24350.0min: [2024-01-06 02:30:12,917][model8_pretrain.py][INFO] Epoch:[0/2](762200/4588595) loss:2.850 lr:0.0000100 epoch_Time:24350.0min: [2024-01-06 02:30:49,856][model8_pretrain.py][INFO] Epoch:[0/2](762300/4588595) loss:2.745 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:30:49,856][model8_pretrain.py][INFO] Epoch:[0/2](762300/4588595) loss:3.173 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:30:49,856][model8_pretrain.py][INFO] Epoch:[0/2](762300/4588595) loss:2.001 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:30:49,856][model8_pretrain.py][INFO] Epoch:[0/2](762300/4588595) loss:2.841 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:30:49,856][model8_pretrain.py][INFO] Epoch:[0/2](762300/4588595) loss:3.125 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:30:49,856][model8_pretrain.py][INFO] Epoch:[0/2](762300/4588595) loss:2.826 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:30:49,856][model8_pretrain.py][INFO] Epoch:[0/2](762300/4588595) loss:3.472 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:30:49,857][model8_pretrain.py][INFO] Epoch:[0/2](762300/4588595) loss:3.413 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:31:26,794][model8_pretrain.py][INFO] Epoch:[0/2](762400/4588595) loss:3.227 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:31:26,794][model8_pretrain.py][INFO] Epoch:[0/2](762400/4588595) loss:1.985 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:31:26,794][model8_pretrain.py][INFO] Epoch:[0/2](762400/4588595) loss:2.819 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:31:26,794][model8_pretrain.py][INFO] Epoch:[0/2](762400/4588595) loss:2.657 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:31:26,794][model8_pretrain.py][INFO] Epoch:[0/2](762400/4588595) loss:2.769 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:31:26,794][model8_pretrain.py][INFO] Epoch:[0/2](762400/4588595) loss:2.960 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:31:26,795][model8_pretrain.py][INFO] Epoch:[0/2](762400/4588595) loss:2.473 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:31:26,795][model8_pretrain.py][INFO] Epoch:[0/2](762400/4588595) loss:3.385 lr:0.0000100 epoch_Time:24349.0min: [2024-01-06 02:32:03,741][model8_pretrain.py][INFO] Epoch:[0/2](762500/4588595) loss:2.362 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:03,741][model8_pretrain.py][INFO] Epoch:[0/2](762500/4588595) loss:3.469 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:03,741][model8_pretrain.py][INFO] Epoch:[0/2](762500/4588595) loss:2.729 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:03,741][model8_pretrain.py][INFO] Epoch:[0/2](762500/4588595) loss:2.967 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:03,741][model8_pretrain.py][INFO] Epoch:[0/2](762500/4588595) loss:3.092 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:03,741][model8_pretrain.py][INFO] Epoch:[0/2](762500/4588595) loss:2.687 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:03,741][model8_pretrain.py][INFO] Epoch:[0/2](762500/4588595) loss:2.921 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:03,741][model8_pretrain.py][INFO] Epoch:[0/2](762500/4588595) loss:2.583 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:51,129][model8_pretrain.py][INFO] Epoch:[0/2](762600/4588595) loss:3.000 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:51,129][model8_pretrain.py][INFO] Epoch:[0/2](762600/4588595) loss:2.682 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:51,129][model8_pretrain.py][INFO] Epoch:[0/2](762600/4588595) loss:2.603 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:51,129][model8_pretrain.py][INFO] Epoch:[0/2](762600/4588595) loss:3.118 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:51,129][model8_pretrain.py][INFO] Epoch:[0/2](762600/4588595) loss:2.891 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:51,129][model8_pretrain.py][INFO] Epoch:[0/2](762600/4588595) loss:2.680 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:51,130][model8_pretrain.py][INFO] Epoch:[0/2](762600/4588595) loss:2.817 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:32:51,131][model8_pretrain.py][INFO] Epoch:[0/2](762600/4588595) loss:2.982 lr:0.0000100 epoch_Time:24348.0min: [2024-01-06 02:33:28,076][model8_pretrain.py][INFO] Epoch:[0/2](762700/4588595) loss:2.715 lr:0.0000100 epoch_Time:24347.0min: [2024-01-06 02:33:28,076][model8_pretrain.py][INFO] Epoch:[0/2](762700/4588595) loss:2.714 lr:0.0000100 epoch_Time:24347.0min: [2024-01-06 02:33:28,076][model8_pretrain.py][INFO] Epoch:[0/2](762700/4588595) loss:2.734 lr:0.0000100 epoch_Time:24347.0min: [2024-01-06 02:33:28,076][model8_pretrain.py][INFO] Epoch:[0/2](762700/4588595) loss:2.644 lr:0.0000100 epoch_Time:24347.0min: [2024-01-06 02:33:28,076][model8_pretrain.py][INFO] Epoch:[0/2](762700/4588595) loss:2.720 lr:0.0000100 epoch_Time:24347.0min: [2024-01-06 02:33:28,076][model8_pretrain.py][INFO] Epoch:[0/2](762700/4588595) loss:2.884 lr:0.0000100 epoch_Time:24347.0min: [2024-01-06 02:33:28,077][model8_pretrain.py][INFO] Epoch:[0/2](762700/4588595) loss:2.355 lr:0.0000100 epoch_Time:24347.0min: [2024-01-06 02:33:28,077][model8_pretrain.py][INFO] Epoch:[0/2](762700/4588595) loss:2.271 lr:0.0000100 epoch_Time:24347.0min: [2024-01-06 02:34:05,021][model8_pretrain.py][INFO] Epoch:[0/2](762800/4588595) loss:3.167 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:05,021][model8_pretrain.py][INFO] Epoch:[0/2](762800/4588595) loss:3.143 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:05,021][model8_pretrain.py][INFO] Epoch:[0/2](762800/4588595) loss:3.067 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:05,021][model8_pretrain.py][INFO] Epoch:[0/2](762800/4588595) loss:2.548 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:05,021][model8_pretrain.py][INFO] Epoch:[0/2](762800/4588595) loss:2.201 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:05,021][model8_pretrain.py][INFO] Epoch:[0/2](762800/4588595) loss:3.006 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:05,021][model8_pretrain.py][INFO] Epoch:[0/2](762800/4588595) loss:2.830 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:05,021][model8_pretrain.py][INFO] Epoch:[0/2](762800/4588595) loss:2.891 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:41,968][model8_pretrain.py][INFO] Epoch:[0/2](762900/4588595) loss:3.096 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:41,968][model8_pretrain.py][INFO] Epoch:[0/2](762900/4588595) loss:2.589 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:41,968][model8_pretrain.py][INFO] Epoch:[0/2](762900/4588595) loss:2.727 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:41,968][model8_pretrain.py][INFO] Epoch:[0/2](762900/4588595) loss:2.744 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:41,969][model8_pretrain.py][INFO] Epoch:[0/2](762900/4588595) loss:2.721 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:41,969][model8_pretrain.py][INFO] Epoch:[0/2](762900/4588595) loss:2.777 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:41,969][model8_pretrain.py][INFO] Epoch:[0/2](762900/4588595) loss:3.066 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:34:41,969][model8_pretrain.py][INFO] Epoch:[0/2](762900/4588595) loss:2.704 lr:0.0000100 epoch_Time:24346.0min: [2024-01-06 02:35:18,924][model8_pretrain.py][INFO] Epoch:[0/2](763000/4588595) loss:3.170 lr:0.0000100 epoch_Time:24345.0min: [2024-01-06 02:35:18,924][model8_pretrain.py][INFO] Epoch:[0/2](763000/4588595) loss:3.067 lr:0.0000100 epoch_Time:24345.0min: [2024-01-06 02:35:18,924][model8_pretrain.py][INFO] Epoch:[0/2](763000/4588595) loss:3.353 lr:0.0000100 epoch_Time:24345.0min: [2024-01-06 02:35:18,924][model8_pretrain.py][INFO] Epoch:[0/2](763000/4588595) loss:2.574 lr:0.0000100 epoch_Time:24345.0min: [2024-01-06 02:35:18,924][model8_pretrain.py][INFO] Epoch:[0/2](763000/4588595) loss:3.189 lr:0.0000100 epoch_Time:24345.0min: [2024-01-06 02:35:18,924][model8_pretrain.py][INFO] Epoch:[0/2](763000/4588595) loss:2.412 lr:0.0000100 epoch_Time:24345.0min: [2024-01-06 02:35:18,924][model8_pretrain.py][INFO] Epoch:[0/2](763000/4588595) loss:3.319 lr:0.0000100 epoch_Time:24345.0min: [2024-01-06 02:35:18,924][model8_pretrain.py][INFO] Epoch:[0/2](763000/4588595) loss:2.523 lr:0.0000100 epoch_Time:24345.0min: [2024-01-06 02:35:55,875][model8_pretrain.py][INFO] Epoch:[0/2](763100/4588595) loss:2.653 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:35:55,875][model8_pretrain.py][INFO] Epoch:[0/2](763100/4588595) loss:2.549 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:35:55,875][model8_pretrain.py][INFO] Epoch:[0/2](763100/4588595) loss:3.378 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:35:55,875][model8_pretrain.py][INFO] Epoch:[0/2](763100/4588595) loss:2.742 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:35:55,875][model8_pretrain.py][INFO] Epoch:[0/2](763100/4588595) loss:2.782 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:35:55,876][model8_pretrain.py][INFO] Epoch:[0/2](763100/4588595) loss:3.315 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:35:55,876][model8_pretrain.py][INFO] Epoch:[0/2](763100/4588595) loss:2.961 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:35:55,876][model8_pretrain.py][INFO] Epoch:[0/2](763100/4588595) loss:3.267 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:36:32,828][model8_pretrain.py][INFO] Epoch:[0/2](763200/4588595) loss:3.021 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:36:32,828][model8_pretrain.py][INFO] Epoch:[0/2](763200/4588595) loss:2.761 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:36:32,828][model8_pretrain.py][INFO] Epoch:[0/2](763200/4588595) loss:2.259 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:36:32,828][model8_pretrain.py][INFO] Epoch:[0/2](763200/4588595) loss:2.908 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:36:32,828][model8_pretrain.py][INFO] Epoch:[0/2](763200/4588595) loss:2.734 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:36:32,828][model8_pretrain.py][INFO] Epoch:[0/2](763200/4588595) loss:3.017 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:36:32,828][model8_pretrain.py][INFO] Epoch:[0/2](763200/4588595) loss:3.113 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:36:32,828][model8_pretrain.py][INFO] Epoch:[0/2](763200/4588595) loss:2.837 lr:0.0000100 epoch_Time:24344.0min: [2024-01-06 02:37:09,803][model8_pretrain.py][INFO] Epoch:[0/2](763300/4588595) loss:3.248 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:09,804][model8_pretrain.py][INFO] Epoch:[0/2](763300/4588595) loss:2.860 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:09,804][model8_pretrain.py][INFO] Epoch:[0/2](763300/4588595) loss:3.112 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:09,804][model8_pretrain.py][INFO] Epoch:[0/2](763300/4588595) loss:3.033 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:09,804][model8_pretrain.py][INFO] Epoch:[0/2](763300/4588595) loss:2.326 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:09,804][model8_pretrain.py][INFO] Epoch:[0/2](763300/4588595) loss:3.220 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:09,804][model8_pretrain.py][INFO] Epoch:[0/2](763300/4588595) loss:2.869 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:09,804][model8_pretrain.py][INFO] Epoch:[0/2](763300/4588595) loss:2.834 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:57,309][model8_pretrain.py][INFO] Epoch:[0/2](763400/4588595) loss:2.655 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:57,309][model8_pretrain.py][INFO] Epoch:[0/2](763400/4588595) loss:2.162 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:57,309][model8_pretrain.py][INFO] Epoch:[0/2](763400/4588595) loss:2.942 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:57,309][model8_pretrain.py][INFO] Epoch:[0/2](763400/4588595) loss:2.635 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:57,309][model8_pretrain.py][INFO] Epoch:[0/2](763400/4588595) loss:2.707 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:57,309][model8_pretrain.py][INFO] Epoch:[0/2](763400/4588595) loss:2.267 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:57,309][model8_pretrain.py][INFO] Epoch:[0/2](763400/4588595) loss:3.366 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:37:57,309][model8_pretrain.py][INFO] Epoch:[0/2](763400/4588595) loss:3.033 lr:0.0000100 epoch_Time:24343.0min: [2024-01-06 02:38:34,247][model8_pretrain.py][INFO] Epoch:[0/2](763500/4588595) loss:2.848 lr:0.0000100 epoch_Time:24342.0min: [2024-01-06 02:38:34,248][model8_pretrain.py][INFO] Epoch:[0/2](763500/4588595) loss:2.633 lr:0.0000100 epoch_Time:24342.0min: [2024-01-06 02:38:34,248][model8_pretrain.py][INFO] Epoch:[0/2](763500/4588595) loss:3.156 lr:0.0000100 epoch_Time:24342.0min: [2024-01-06 02:38:34,248][model8_pretrain.py][INFO] Epoch:[0/2](763500/4588595) loss:3.121 lr:0.0000100 epoch_Time:24342.0min: [2024-01-06 02:38:34,248][model8_pretrain.py][INFO] Epoch:[0/2](763500/4588595) loss:2.593 lr:0.0000100 epoch_Time:24342.0min: [2024-01-06 02:38:34,248][model8_pretrain.py][INFO] Epoch:[0/2](763500/4588595) loss:3.238 lr:0.0000100 epoch_Time:24342.0min: [2024-01-06 02:38:34,248][model8_pretrain.py][INFO] Epoch:[0/2](763500/4588595) loss:2.677 lr:0.0000100 epoch_Time:24342.0min: [2024-01-06 02:38:34,248][model8_pretrain.py][INFO] Epoch:[0/2](763500/4588595) loss:2.771 lr:0.0000100 epoch_Time:24342.0min: [2024-01-06 02:39:11,194][model8_pretrain.py][INFO] Epoch:[0/2](763600/4588595) loss:3.283 lr:0.0000100 epoch_Time:24341.0min: [2024-01-06 02:39:11,194][model8_pretrain.py][INFO] Epoch:[0/2](763600/4588595) loss:2.964 lr:0.0000100 epoch_Time:24341.0min: [2024-01-06 02:39:11,194][model8_pretrain.py][INFO] Epoch:[0/2](763600/4588595) loss:3.185 lr:0.0000100 epoch_Time:24341.0min: [2024-01-06 02:39:11,194][model8_pretrain.py][INFO] Epoch:[0/2](763600/4588595) loss:2.507 lr:0.0000100 epoch_Time:24341.0min: [2024-01-06 02:39:11,194][model8_pretrain.py][INFO] Epoch:[0/2](763600/4588595) loss:2.466 lr:0.0000100 epoch_Time:24341.0min: [2024-01-06 02:39:11,194][model8_pretrain.py][INFO] Epoch:[0/2](763600/4588595) loss:3.060 lr:0.0000100 epoch_Time:24341.0min: [2024-01-06 02:39:11,194][model8_pretrain.py][INFO] Epoch:[0/2](763600/4588595) loss:3.187 lr:0.0000100 epoch_Time:24341.0min: [2024-01-06 02:39:11,195][model8_pretrain.py][INFO] Epoch:[0/2](763600/4588595) loss:2.729 lr:0.0000100 epoch_Time:24341.0min: [2024-01-06 02:39:48,141][model8_pretrain.py][INFO] Epoch:[0/2](763700/4588595) loss:3.081 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:39:48,142][model8_pretrain.py][INFO] Epoch:[0/2](763700/4588595) loss:2.433 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:39:48,142][model8_pretrain.py][INFO] Epoch:[0/2](763700/4588595) loss:2.488 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:39:48,142][model8_pretrain.py][INFO] Epoch:[0/2](763700/4588595) loss:2.735 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:39:48,142][model8_pretrain.py][INFO] Epoch:[0/2](763700/4588595) loss:3.243 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:39:48,142][model8_pretrain.py][INFO] Epoch:[0/2](763700/4588595) loss:3.101 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:39:48,142][model8_pretrain.py][INFO] Epoch:[0/2](763700/4588595) loss:2.971 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:39:48,143][model8_pretrain.py][INFO] Epoch:[0/2](763700/4588595) loss:3.209 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:40:25,089][model8_pretrain.py][INFO] Epoch:[0/2](763800/4588595) loss:2.890 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:40:25,089][model8_pretrain.py][INFO] Epoch:[0/2](763800/4588595) loss:2.551 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:40:25,090][model8_pretrain.py][INFO] Epoch:[0/2](763800/4588595) loss:2.997 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:40:25,090][model8_pretrain.py][INFO] Epoch:[0/2](763800/4588595) loss:2.581 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:40:25,090][model8_pretrain.py][INFO] Epoch:[0/2](763800/4588595) loss:2.850 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:40:25,090][model8_pretrain.py][INFO] Epoch:[0/2](763800/4588595) loss:3.036 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:40:25,090][model8_pretrain.py][INFO] Epoch:[0/2](763800/4588595) loss:3.019 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:40:25,090][model8_pretrain.py][INFO] Epoch:[0/2](763800/4588595) loss:2.489 lr:0.0000100 epoch_Time:24340.0min: [2024-01-06 02:41:02,041][model8_pretrain.py][INFO] Epoch:[0/2](763900/4588595) loss:2.629 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:02,041][model8_pretrain.py][INFO] Epoch:[0/2](763900/4588595) loss:2.768 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:02,041][model8_pretrain.py][INFO] Epoch:[0/2](763900/4588595) loss:2.862 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:02,041][model8_pretrain.py][INFO] Epoch:[0/2](763900/4588595) loss:2.261 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:02,041][model8_pretrain.py][INFO] Epoch:[0/2](763900/4588595) loss:2.712 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:02,041][model8_pretrain.py][INFO] Epoch:[0/2](763900/4588595) loss:2.765 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:02,042][model8_pretrain.py][INFO] Epoch:[0/2](763900/4588595) loss:2.521 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:02,042][model8_pretrain.py][INFO] Epoch:[0/2](763900/4588595) loss:2.662 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:38,994][model8_pretrain.py][INFO] Epoch:[0/2](764000/4588595) loss:2.931 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:38,994][model8_pretrain.py][INFO] Epoch:[0/2](764000/4588595) loss:3.175 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:38,994][model8_pretrain.py][INFO] Epoch:[0/2](764000/4588595) loss:2.989 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:38,994][model8_pretrain.py][INFO] Epoch:[0/2](764000/4588595) loss:3.004 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:38,994][model8_pretrain.py][INFO] Epoch:[0/2](764000/4588595) loss:2.777 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:38,994][model8_pretrain.py][INFO] Epoch:[0/2](764000/4588595) loss:2.668 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:38,994][model8_pretrain.py][INFO] Epoch:[0/2](764000/4588595) loss:2.878 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:41:38,994][model8_pretrain.py][INFO] Epoch:[0/2](764000/4588595) loss:2.759 lr:0.0000100 epoch_Time:24339.0min: [2024-01-06 02:42:15,934][model8_pretrain.py][INFO] Epoch:[0/2](764100/4588595) loss:2.898 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:42:15,934][model8_pretrain.py][INFO] Epoch:[0/2](764100/4588595) loss:2.736 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:42:15,934][model8_pretrain.py][INFO] Epoch:[0/2](764100/4588595) loss:2.667 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:42:15,934][model8_pretrain.py][INFO] Epoch:[0/2](764100/4588595) loss:2.898 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:42:15,934][model8_pretrain.py][INFO] Epoch:[0/2](764100/4588595) loss:3.280 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:42:15,934][model8_pretrain.py][INFO] Epoch:[0/2](764100/4588595) loss:3.101 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:42:15,934][model8_pretrain.py][INFO] Epoch:[0/2](764100/4588595) loss:3.456 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:42:15,934][model8_pretrain.py][INFO] Epoch:[0/2](764100/4588595) loss:3.152 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:03,272][model8_pretrain.py][INFO] Epoch:[0/2](764200/4588595) loss:2.887 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:03,272][model8_pretrain.py][INFO] Epoch:[0/2](764200/4588595) loss:2.352 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:03,272][model8_pretrain.py][INFO] Epoch:[0/2](764200/4588595) loss:2.407 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:03,272][model8_pretrain.py][INFO] Epoch:[0/2](764200/4588595) loss:2.694 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:03,272][model8_pretrain.py][INFO] Epoch:[0/2](764200/4588595) loss:2.793 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:03,272][model8_pretrain.py][INFO] Epoch:[0/2](764200/4588595) loss:3.034 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:03,272][model8_pretrain.py][INFO] Epoch:[0/2](764200/4588595) loss:2.943 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:03,272][model8_pretrain.py][INFO] Epoch:[0/2](764200/4588595) loss:2.733 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:40,175][model8_pretrain.py][INFO] Epoch:[0/2](764300/4588595) loss:2.379 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:40,175][model8_pretrain.py][INFO] Epoch:[0/2](764300/4588595) loss:2.711 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:40,175][model8_pretrain.py][INFO] Epoch:[0/2](764300/4588595) loss:2.357 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:40,175][model8_pretrain.py][INFO] Epoch:[0/2](764300/4588595) loss:2.533 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:40,175][model8_pretrain.py][INFO] Epoch:[0/2](764300/4588595) loss:3.056 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:40,175][model8_pretrain.py][INFO] Epoch:[0/2](764300/4588595) loss:2.854 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:40,175][model8_pretrain.py][INFO] Epoch:[0/2](764300/4588595) loss:2.904 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:43:40,175][model8_pretrain.py][INFO] Epoch:[0/2](764300/4588595) loss:3.017 lr:0.0000100 epoch_Time:24338.0min: [2024-01-06 02:44:17,109][model8_pretrain.py][INFO] Epoch:[0/2](764400/4588595) loss:2.083 lr:0.0000100 epoch_Time:24336.0min: [2024-01-06 02:44:17,109][model8_pretrain.py][INFO] Epoch:[0/2](764400/4588595) loss:2.679 lr:0.0000100 epoch_Time:24336.0min: [2024-01-06 02:44:17,109][model8_pretrain.py][INFO] Epoch:[0/2](764400/4588595) loss:2.739 lr:0.0000100 epoch_Time:24336.0min: [2024-01-06 02:44:17,109][model8_pretrain.py][INFO] Epoch:[0/2](764400/4588595) loss:2.775 lr:0.0000100 epoch_Time:24336.0min: [2024-01-06 02:44:17,109][model8_pretrain.py][INFO] Epoch:[0/2](764400/4588595) loss:3.181 lr:0.0000100 epoch_Time:24336.0min: [2024-01-06 02:44:17,109][model8_pretrain.py][INFO] Epoch:[0/2](764400/4588595) loss:2.734 lr:0.0000100 epoch_Time:24336.0min: [2024-01-06 02:44:17,109][model8_pretrain.py][INFO] Epoch:[0/2](764400/4588595) loss:2.723 lr:0.0000100 epoch_Time:24336.0min: [2024-01-06 02:44:17,109][model8_pretrain.py][INFO] Epoch:[0/2](764400/4588595) loss:2.631 lr:0.0000100 epoch_Time:24336.0min: [2024-01-06 02:44:54,047][model8_pretrain.py][INFO] Epoch:[0/2](764500/4588595) loss:2.778 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:44:54,047][model8_pretrain.py][INFO] Epoch:[0/2](764500/4588595) loss:2.311 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:44:54,047][model8_pretrain.py][INFO] Epoch:[0/2](764500/4588595) loss:2.935 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:44:54,047][model8_pretrain.py][INFO] Epoch:[0/2](764500/4588595) loss:2.578 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:44:54,047][model8_pretrain.py][INFO] Epoch:[0/2](764500/4588595) loss:3.041 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:44:54,047][model8_pretrain.py][INFO] Epoch:[0/2](764500/4588595) loss:2.803 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:44:54,047][model8_pretrain.py][INFO] Epoch:[0/2](764500/4588595) loss:2.351 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:44:54,047][model8_pretrain.py][INFO] Epoch:[0/2](764500/4588595) loss:2.969 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:45:30,986][model8_pretrain.py][INFO] Epoch:[0/2](764600/4588595) loss:2.709 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:45:30,986][model8_pretrain.py][INFO] Epoch:[0/2](764600/4588595) loss:2.607 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:45:30,986][model8_pretrain.py][INFO] Epoch:[0/2](764600/4588595) loss:2.616 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:45:30,986][model8_pretrain.py][INFO] Epoch:[0/2](764600/4588595) loss:3.118 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:45:30,986][model8_pretrain.py][INFO] Epoch:[0/2](764600/4588595) loss:2.645 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:45:30,986][model8_pretrain.py][INFO] Epoch:[0/2](764600/4588595) loss:2.600 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:45:30,986][model8_pretrain.py][INFO] Epoch:[0/2](764600/4588595) loss:2.733 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:45:30,986][model8_pretrain.py][INFO] Epoch:[0/2](764600/4588595) loss:2.927 lr:0.0000100 epoch_Time:24335.0min: [2024-01-06 02:46:07,921][model8_pretrain.py][INFO] Epoch:[0/2](764700/4588595) loss:2.655 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:07,921][model8_pretrain.py][INFO] Epoch:[0/2](764700/4588595) loss:2.959 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:07,921][model8_pretrain.py][INFO] Epoch:[0/2](764700/4588595) loss:3.331 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:07,921][model8_pretrain.py][INFO] Epoch:[0/2](764700/4588595) loss:3.085 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:07,921][model8_pretrain.py][INFO] Epoch:[0/2](764700/4588595) loss:2.844 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:07,921][model8_pretrain.py][INFO] Epoch:[0/2](764700/4588595) loss:3.118 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:07,921][model8_pretrain.py][INFO] Epoch:[0/2](764700/4588595) loss:2.765 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:07,921][model8_pretrain.py][INFO] Epoch:[0/2](764700/4588595) loss:2.827 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:44,864][model8_pretrain.py][INFO] Epoch:[0/2](764800/4588595) loss:2.816 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:44,864][model8_pretrain.py][INFO] Epoch:[0/2](764800/4588595) loss:2.098 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:44,865][model8_pretrain.py][INFO] Epoch:[0/2](764800/4588595) loss:2.921 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:44,865][model8_pretrain.py][INFO] Epoch:[0/2](764800/4588595) loss:2.421 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:44,865][model8_pretrain.py][INFO] Epoch:[0/2](764800/4588595) loss:3.210 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:44,865][model8_pretrain.py][INFO] Epoch:[0/2](764800/4588595) loss:3.110 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:44,865][model8_pretrain.py][INFO] Epoch:[0/2](764800/4588595) loss:2.907 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:46:44,865][model8_pretrain.py][INFO] Epoch:[0/2](764800/4588595) loss:2.655 lr:0.0000100 epoch_Time:24334.0min: [2024-01-06 02:47:21,815][model8_pretrain.py][INFO] Epoch:[0/2](764900/4588595) loss:2.784 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:47:21,815][model8_pretrain.py][INFO] Epoch:[0/2](764900/4588595) loss:2.414 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:47:21,815][model8_pretrain.py][INFO] Epoch:[0/2](764900/4588595) loss:2.263 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:47:21,815][model8_pretrain.py][INFO] Epoch:[0/2](764900/4588595) loss:3.149 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:47:21,815][model8_pretrain.py][INFO] Epoch:[0/2](764900/4588595) loss:2.788 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:47:21,815][model8_pretrain.py][INFO] Epoch:[0/2](764900/4588595) loss:2.974 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:47:21,815][model8_pretrain.py][INFO] Epoch:[0/2](764900/4588595) loss:2.538 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:47:21,815][model8_pretrain.py][INFO] Epoch:[0/2](764900/4588595) loss:3.017 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:08,962][model8_pretrain.py][INFO] Epoch:[0/2](765000/4588595) loss:2.870 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:08,962][model8_pretrain.py][INFO] Epoch:[0/2](765000/4588595) loss:2.349 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:08,962][model8_pretrain.py][INFO] Epoch:[0/2](765000/4588595) loss:2.080 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:08,962][model8_pretrain.py][INFO] Epoch:[0/2](765000/4588595) loss:2.948 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:08,962][model8_pretrain.py][INFO] Epoch:[0/2](765000/4588595) loss:2.288 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:08,962][model8_pretrain.py][INFO] Epoch:[0/2](765000/4588595) loss:2.373 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:08,962][model8_pretrain.py][INFO] Epoch:[0/2](765000/4588595) loss:2.736 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:08,962][model8_pretrain.py][INFO] Epoch:[0/2](765000/4588595) loss:3.079 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:45,898][model8_pretrain.py][INFO] Epoch:[0/2](765100/4588595) loss:3.033 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:45,898][model8_pretrain.py][INFO] Epoch:[0/2](765100/4588595) loss:2.932 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:45,898][model8_pretrain.py][INFO] Epoch:[0/2](765100/4588595) loss:2.975 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:45,898][model8_pretrain.py][INFO] Epoch:[0/2](765100/4588595) loss:2.398 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:45,898][model8_pretrain.py][INFO] Epoch:[0/2](765100/4588595) loss:3.085 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:45,898][model8_pretrain.py][INFO] Epoch:[0/2](765100/4588595) loss:3.234 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:45,898][model8_pretrain.py][INFO] Epoch:[0/2](765100/4588595) loss:2.618 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:48:45,898][model8_pretrain.py][INFO] Epoch:[0/2](765100/4588595) loss:3.378 lr:0.0000100 epoch_Time:24333.0min: [2024-01-06 02:49:22,844][model8_pretrain.py][INFO] Epoch:[0/2](765200/4588595) loss:2.698 lr:0.0000100 epoch_Time:24331.0min: [2024-01-06 02:49:22,844][model8_pretrain.py][INFO] Epoch:[0/2](765200/4588595) loss:2.703 lr:0.0000100 epoch_Time:24331.0min: [2024-01-06 02:49:22,844][model8_pretrain.py][INFO] Epoch:[0/2](765200/4588595) loss:2.580 lr:0.0000100 epoch_Time:24331.0min: [2024-01-06 02:49:22,844][model8_pretrain.py][INFO] Epoch:[0/2](765200/4588595) loss:2.945 lr:0.0000100 epoch_Time:24331.0min: [2024-01-06 02:49:22,844][model8_pretrain.py][INFO] Epoch:[0/2](765200/4588595) loss:2.272 lr:0.0000100 epoch_Time:24331.0min: [2024-01-06 02:49:22,844][model8_pretrain.py][INFO] Epoch:[0/2](765200/4588595) loss:2.365 lr:0.0000100 epoch_Time:24331.0min: [2024-01-06 02:49:22,845][model8_pretrain.py][INFO] Epoch:[0/2](765200/4588595) loss:2.755 lr:0.0000100 epoch_Time:24331.0min: [2024-01-06 02:49:22,845][model8_pretrain.py][INFO] Epoch:[0/2](765200/4588595) loss:3.161 lr:0.0000100 epoch_Time:24331.0min: [2024-01-06 02:49:59,791][model8_pretrain.py][INFO] Epoch:[0/2](765300/4588595) loss:2.655 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:49:59,791][model8_pretrain.py][INFO] Epoch:[0/2](765300/4588595) loss:2.521 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:49:59,791][model8_pretrain.py][INFO] Epoch:[0/2](765300/4588595) loss:2.590 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:49:59,791][model8_pretrain.py][INFO] Epoch:[0/2](765300/4588595) loss:2.588 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:49:59,791][model8_pretrain.py][INFO] Epoch:[0/2](765300/4588595) loss:2.764 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:49:59,791][model8_pretrain.py][INFO] Epoch:[0/2](765300/4588595) loss:2.825 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:49:59,791][model8_pretrain.py][INFO] Epoch:[0/2](765300/4588595) loss:2.665 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:49:59,791][model8_pretrain.py][INFO] Epoch:[0/2](765300/4588595) loss:2.831 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:50:36,735][model8_pretrain.py][INFO] Epoch:[0/2](765400/4588595) loss:2.852 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:50:36,735][model8_pretrain.py][INFO] Epoch:[0/2](765400/4588595) loss:3.255 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:50:36,735][model8_pretrain.py][INFO] Epoch:[0/2](765400/4588595) loss:2.823 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:50:36,735][model8_pretrain.py][INFO] Epoch:[0/2](765400/4588595) loss:2.504 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:50:36,735][model8_pretrain.py][INFO] Epoch:[0/2](765400/4588595) loss:2.835 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:50:36,735][model8_pretrain.py][INFO] Epoch:[0/2](765400/4588595) loss:2.125 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:50:36,735][model8_pretrain.py][INFO] Epoch:[0/2](765400/4588595) loss:3.025 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:50:36,735][model8_pretrain.py][INFO] Epoch:[0/2](765400/4588595) loss:2.558 lr:0.0000100 epoch_Time:24330.0min: [2024-01-06 02:51:13,675][model8_pretrain.py][INFO] Epoch:[0/2](765500/4588595) loss:2.673 lr:0.0000100 epoch_Time:24329.0min: [2024-01-06 02:51:13,675][model8_pretrain.py][INFO] Epoch:[0/2](765500/4588595) loss:2.666 lr:0.0000100 epoch_Time:24329.0min: [2024-01-06 02:51:13,675][model8_pretrain.py][INFO] Epoch:[0/2](765500/4588595) loss:2.797 lr:0.0000100 epoch_Time:24329.0min: [2024-01-06 02:51:13,675][model8_pretrain.py][INFO] Epoch:[0/2](765500/4588595) loss:2.938 lr:0.0000100 epoch_Time:24329.0min: [2024-01-06 02:51:13,675][model8_pretrain.py][INFO] Epoch:[0/2](765500/4588595) loss:2.602 lr:0.0000100 epoch_Time:24329.0min: [2024-01-06 02:51:13,675][model8_pretrain.py][INFO] Epoch:[0/2](765500/4588595) loss:2.911 lr:0.0000100 epoch_Time:24329.0min: [2024-01-06 02:51:13,675][model8_pretrain.py][INFO] Epoch:[0/2](765500/4588595) loss:2.804 lr:0.0000100 epoch_Time:24329.0min: [2024-01-06 02:51:13,675][model8_pretrain.py][INFO] Epoch:[0/2](765500/4588595) loss:2.942 lr:0.0000100 epoch_Time:24329.0min: [2024-01-06 02:51:50,632][model8_pretrain.py][INFO] Epoch:[0/2](765600/4588595) loss:3.179 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:51:50,632][model8_pretrain.py][INFO] Epoch:[0/2](765600/4588595) loss:2.389 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:51:50,632][model8_pretrain.py][INFO] Epoch:[0/2](765600/4588595) loss:2.091 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:51:50,632][model8_pretrain.py][INFO] Epoch:[0/2](765600/4588595) loss:2.448 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:51:50,632][model8_pretrain.py][INFO] Epoch:[0/2](765600/4588595) loss:2.457 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:51:50,632][model8_pretrain.py][INFO] Epoch:[0/2](765600/4588595) loss:2.634 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:51:50,632][model8_pretrain.py][INFO] Epoch:[0/2](765600/4588595) loss:2.791 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:51:50,632][model8_pretrain.py][INFO] Epoch:[0/2](765600/4588595) loss:2.847 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:52:27,582][model8_pretrain.py][INFO] Epoch:[0/2](765700/4588595) loss:3.169 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:52:27,582][model8_pretrain.py][INFO] Epoch:[0/2](765700/4588595) loss:2.524 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:52:27,582][model8_pretrain.py][INFO] Epoch:[0/2](765700/4588595) loss:3.346 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:52:27,582][model8_pretrain.py][INFO] Epoch:[0/2](765700/4588595) loss:2.869 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:52:27,582][model8_pretrain.py][INFO] Epoch:[0/2](765700/4588595) loss:2.934 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:52:27,582][model8_pretrain.py][INFO] Epoch:[0/2](765700/4588595) loss:2.717 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:52:27,582][model8_pretrain.py][INFO] Epoch:[0/2](765700/4588595) loss:2.567 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:52:27,583][model8_pretrain.py][INFO] Epoch:[0/2](765700/4588595) loss:2.998 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:53:14,712][model8_pretrain.py][INFO] Epoch:[0/2](765800/4588595) loss:2.911 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:53:14,712][model8_pretrain.py][INFO] Epoch:[0/2](765800/4588595) loss:3.203 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:53:14,712][model8_pretrain.py][INFO] Epoch:[0/2](765800/4588595) loss:2.612 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:53:14,712][model8_pretrain.py][INFO] Epoch:[0/2](765800/4588595) loss:1.999 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:53:14,712][model8_pretrain.py][INFO] Epoch:[0/2](765800/4588595) loss:2.834 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:53:14,712][model8_pretrain.py][INFO] Epoch:[0/2](765800/4588595) loss:2.826 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:53:14,712][model8_pretrain.py][INFO] Epoch:[0/2](765800/4588595) loss:3.103 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:53:14,712][model8_pretrain.py][INFO] Epoch:[0/2](765800/4588595) loss:2.436 lr:0.0000100 epoch_Time:24328.0min: [2024-01-06 02:53:51,643][model8_pretrain.py][INFO] Epoch:[0/2](765900/4588595) loss:2.916 lr:0.0000100 epoch_Time:24327.0min: [2024-01-06 02:53:51,643][model8_pretrain.py][INFO] Epoch:[0/2](765900/4588595) loss:2.931 lr:0.0000100 epoch_Time:24327.0min: [2024-01-06 02:53:51,643][model8_pretrain.py][INFO] Epoch:[0/2](765900/4588595) loss:2.580 lr:0.0000100 epoch_Time:24327.0min: [2024-01-06 02:53:51,643][model8_pretrain.py][INFO] Epoch:[0/2](765900/4588595) loss:2.503 lr:0.0000100 epoch_Time:24327.0min: [2024-01-06 02:53:51,643][model8_pretrain.py][INFO] Epoch:[0/2](765900/4588595) loss:2.862 lr:0.0000100 epoch_Time:24327.0min: [2024-01-06 02:53:51,643][model8_pretrain.py][INFO] Epoch:[0/2](765900/4588595) loss:2.105 lr:0.0000100 epoch_Time:24327.0min: [2024-01-06 02:53:51,643][model8_pretrain.py][INFO] Epoch:[0/2](765900/4588595) loss:3.578 lr:0.0000100 epoch_Time:24327.0min: [2024-01-06 02:53:51,643][model8_pretrain.py][INFO] Epoch:[0/2](765900/4588595) loss:2.663 lr:0.0000100 epoch_Time:24327.0min: [2024-01-06 02:54:28,582][model8_pretrain.py][INFO] Epoch:[0/2](766000/4588595) loss:2.869 lr:0.0000100 epoch_Time:24326.0min: [2024-01-06 02:54:28,582][model8_pretrain.py][INFO] Epoch:[0/2](766000/4588595) loss:2.680 lr:0.0000100 epoch_Time:24326.0min: [2024-01-06 02:54:28,582][model8_pretrain.py][INFO] Epoch:[0/2](766000/4588595) loss:2.840 lr:0.0000100 epoch_Time:24326.0min: [2024-01-06 02:54:28,582][model8_pretrain.py][INFO] Epoch:[0/2](766000/4588595) loss:2.331 lr:0.0000100 epoch_Time:24326.0min: [2024-01-06 02:54:28,582][model8_pretrain.py][INFO] Epoch:[0/2](766000/4588595) loss:2.576 lr:0.0000100 epoch_Time:24326.0min: [2024-01-06 02:54:28,582][model8_pretrain.py][INFO] Epoch:[0/2](766000/4588595) loss:2.067 lr:0.0000100 epoch_Time:24326.0min: [2024-01-06 02:54:28,582][model8_pretrain.py][INFO] Epoch:[0/2](766000/4588595) loss:2.683 lr:0.0000100 epoch_Time:24326.0min: [2024-01-06 02:54:28,582][model8_pretrain.py][INFO] Epoch:[0/2](766000/4588595) loss:3.028 lr:0.0000100 epoch_Time:24326.0min: [2024-01-06 02:55:05,516][model8_pretrain.py][INFO] Epoch:[0/2](766100/4588595) loss:2.745 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:05,516][model8_pretrain.py][INFO] Epoch:[0/2](766100/4588595) loss:2.187 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:05,516][model8_pretrain.py][INFO] Epoch:[0/2](766100/4588595) loss:2.289 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:05,516][model8_pretrain.py][INFO] Epoch:[0/2](766100/4588595) loss:2.139 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:05,516][model8_pretrain.py][INFO] Epoch:[0/2](766100/4588595) loss:3.189 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:05,516][model8_pretrain.py][INFO] Epoch:[0/2](766100/4588595) loss:2.866 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:05,516][model8_pretrain.py][INFO] Epoch:[0/2](766100/4588595) loss:3.289 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:05,517][model8_pretrain.py][INFO] Epoch:[0/2](766100/4588595) loss:2.528 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:42,458][model8_pretrain.py][INFO] Epoch:[0/2](766200/4588595) loss:2.937 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:42,458][model8_pretrain.py][INFO] Epoch:[0/2](766200/4588595) loss:2.851 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:42,458][model8_pretrain.py][INFO] Epoch:[0/2](766200/4588595) loss:3.182 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:42,458][model8_pretrain.py][INFO] Epoch:[0/2](766200/4588595) loss:2.981 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:42,458][model8_pretrain.py][INFO] Epoch:[0/2](766200/4588595) loss:2.430 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:42,458][model8_pretrain.py][INFO] Epoch:[0/2](766200/4588595) loss:2.816 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:42,458][model8_pretrain.py][INFO] Epoch:[0/2](766200/4588595) loss:3.110 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:55:42,458][model8_pretrain.py][INFO] Epoch:[0/2](766200/4588595) loss:3.221 lr:0.0000100 epoch_Time:24325.0min: [2024-01-06 02:56:19,396][model8_pretrain.py][INFO] Epoch:[0/2](766300/4588595) loss:2.613 lr:0.0000100 epoch_Time:24324.0min: [2024-01-06 02:56:19,396][model8_pretrain.py][INFO] Epoch:[0/2](766300/4588595) loss:2.813 lr:0.0000100 epoch_Time:24324.0min: [2024-01-06 02:56:19,396][model8_pretrain.py][INFO] Epoch:[0/2](766300/4588595) loss:2.486 lr:0.0000100 epoch_Time:24324.0min: [2024-01-06 02:56:19,396][model8_pretrain.py][INFO] Epoch:[0/2](766300/4588595) loss:2.855 lr:0.0000100 epoch_Time:24324.0min: [2024-01-06 02:56:19,396][model8_pretrain.py][INFO] Epoch:[0/2](766300/4588595) loss:2.690 lr:0.0000100 epoch_Time:24324.0min: [2024-01-06 02:56:19,396][model8_pretrain.py][INFO] Epoch:[0/2](766300/4588595) loss:2.780 lr:0.0000100 epoch_Time:24324.0min: [2024-01-06 02:56:19,396][model8_pretrain.py][INFO] Epoch:[0/2](766300/4588595) loss:2.811 lr:0.0000100 epoch_Time:24324.0min: [2024-01-06 02:56:19,396][model8_pretrain.py][INFO] Epoch:[0/2](766300/4588595) loss:2.633 lr:0.0000100 epoch_Time:24324.0min: [2024-01-06 02:56:56,361][model8_pretrain.py][INFO] Epoch:[0/2](766400/4588595) loss:2.695 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:56:56,361][model8_pretrain.py][INFO] Epoch:[0/2](766400/4588595) loss:2.489 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:56:56,361][model8_pretrain.py][INFO] Epoch:[0/2](766400/4588595) loss:2.542 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:56:56,361][model8_pretrain.py][INFO] Epoch:[0/2](766400/4588595) loss:2.578 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:56:56,361][model8_pretrain.py][INFO] Epoch:[0/2](766400/4588595) loss:2.846 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:56:56,361][model8_pretrain.py][INFO] Epoch:[0/2](766400/4588595) loss:2.841 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:56:56,361][model8_pretrain.py][INFO] Epoch:[0/2](766400/4588595) loss:2.848 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:56:56,362][model8_pretrain.py][INFO] Epoch:[0/2](766400/4588595) loss:3.259 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:57:33,312][model8_pretrain.py][INFO] Epoch:[0/2](766500/4588595) loss:2.624 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:57:33,312][model8_pretrain.py][INFO] Epoch:[0/2](766500/4588595) loss:2.311 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:57:33,312][model8_pretrain.py][INFO] Epoch:[0/2](766500/4588595) loss:2.809 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:57:33,312][model8_pretrain.py][INFO] Epoch:[0/2](766500/4588595) loss:2.642 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:57:33,312][model8_pretrain.py][INFO] Epoch:[0/2](766500/4588595) loss:2.976 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:57:33,312][model8_pretrain.py][INFO] Epoch:[0/2](766500/4588595) loss:1.627 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:57:33,313][model8_pretrain.py][INFO] Epoch:[0/2](766500/4588595) loss:2.925 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:57:33,313][model8_pretrain.py][INFO] Epoch:[0/2](766500/4588595) loss:2.556 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:58:20,221][model8_pretrain.py][INFO] Epoch:[0/2](766600/4588595) loss:2.493 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:58:20,222][model8_pretrain.py][INFO] Epoch:[0/2](766600/4588595) loss:3.043 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:58:20,222][model8_pretrain.py][INFO] Epoch:[0/2](766600/4588595) loss:2.548 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:58:20,222][model8_pretrain.py][INFO] Epoch:[0/2](766600/4588595) loss:3.175 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:58:20,222][model8_pretrain.py][INFO] Epoch:[0/2](766600/4588595) loss:2.591 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:58:20,222][model8_pretrain.py][INFO] Epoch:[0/2](766600/4588595) loss:2.514 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:58:20,222][model8_pretrain.py][INFO] Epoch:[0/2](766600/4588595) loss:2.756 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:58:20,222][model8_pretrain.py][INFO] Epoch:[0/2](766600/4588595) loss:2.439 lr:0.0000100 epoch_Time:24323.0min: [2024-01-06 02:58:57,150][model8_pretrain.py][INFO] Epoch:[0/2](766700/4588595) loss:3.177 lr:0.0000100 epoch_Time:24322.0min: [2024-01-06 02:58:57,150][model8_pretrain.py][INFO] Epoch:[0/2](766700/4588595) loss:3.156 lr:0.0000100 epoch_Time:24322.0min: [2024-01-06 02:58:57,150][model8_pretrain.py][INFO] Epoch:[0/2](766700/4588595) loss:3.046 lr:0.0000100 epoch_Time:24322.0min: [2024-01-06 02:58:57,150][model8_pretrain.py][INFO] Epoch:[0/2](766700/4588595) loss:2.655 lr:0.0000100 epoch_Time:24322.0min: [2024-01-06 02:58:57,150][model8_pretrain.py][INFO] Epoch:[0/2](766700/4588595) loss:2.607 lr:0.0000100 epoch_Time:24322.0min: [2024-01-06 02:58:57,150][model8_pretrain.py][INFO] Epoch:[0/2](766700/4588595) loss:2.873 lr:0.0000100 epoch_Time:24322.0min: [2024-01-06 02:58:57,150][model8_pretrain.py][INFO] Epoch:[0/2](766700/4588595) loss:3.245 lr:0.0000100 epoch_Time:24322.0min: [2024-01-06 02:58:57,150][model8_pretrain.py][INFO] Epoch:[0/2](766700/4588595) loss:2.916 lr:0.0000100 epoch_Time:24322.0min: [2024-01-06 02:59:34,089][model8_pretrain.py][INFO] Epoch:[0/2](766800/4588595) loss:3.071 lr:0.0000100 epoch_Time:24321.0min: [2024-01-06 02:59:34,089][model8_pretrain.py][INFO] Epoch:[0/2](766800/4588595) loss:2.542 lr:0.0000100 epoch_Time:24321.0min: [2024-01-06 02:59:34,089][model8_pretrain.py][INFO] Epoch:[0/2](766800/4588595) loss:3.210 lr:0.0000100 epoch_Time:24321.0min: [2024-01-06 02:59:34,089][model8_pretrain.py][INFO] Epoch:[0/2](766800/4588595) loss:2.981 lr:0.0000100 epoch_Time:24321.0min: [2024-01-06 02:59:34,089][model8_pretrain.py][INFO] Epoch:[0/2](766800/4588595) loss:2.773 lr:0.0000100 epoch_Time:24321.0min: [2024-01-06 02:59:34,089][model8_pretrain.py][INFO] Epoch:[0/2](766800/4588595) loss:2.562 lr:0.0000100 epoch_Time:24321.0min: [2024-01-06 02:59:34,089][model8_pretrain.py][INFO] Epoch:[0/2](766800/4588595) loss:2.975 lr:0.0000100 epoch_Time:24321.0min: [2024-01-06 02:59:34,090][model8_pretrain.py][INFO] Epoch:[0/2](766800/4588595) loss:2.225 lr:0.0000100 epoch_Time:24321.0min: [2024-01-06 03:00:11,028][model8_pretrain.py][INFO] Epoch:[0/2](766900/4588595) loss:2.460 lr:0.0000100 epoch_Time:24320.0min: [2024-01-06 03:00:11,029][model8_pretrain.py][INFO] Epoch:[0/2](766900/4588595) loss:2.902 lr:0.0000100 epoch_Time:24320.0min: [2024-01-06 03:00:11,029][model8_pretrain.py][INFO] Epoch:[0/2](766900/4588595) loss:2.678 lr:0.0000100 epoch_Time:24320.0min: [2024-01-06 03:00:11,029][model8_pretrain.py][INFO] Epoch:[0/2](766900/4588595) loss:2.735 lr:0.0000100 epoch_Time:24320.0min: [2024-01-06 03:00:11,029][model8_pretrain.py][INFO] Epoch:[0/2](766900/4588595) loss:2.814 lr:0.0000100 epoch_Time:24320.0min: [2024-01-06 03:00:11,029][model8_pretrain.py][INFO] Epoch:[0/2](766900/4588595) loss:2.911 lr:0.0000100 epoch_Time:24320.0min: [2024-01-06 03:00:11,029][model8_pretrain.py][INFO] Epoch:[0/2](766900/4588595) loss:2.919 lr:0.0000100 epoch_Time:24320.0min: [2024-01-06 03:00:11,029][model8_pretrain.py][INFO] Epoch:[0/2](766900/4588595) loss:2.864 lr:0.0000100 epoch_Time:24320.0min: [2024-01-06 03:00:47,962][model8_pretrain.py][INFO] Epoch:[0/2](767000/4588595) loss:2.901 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:00:47,962][model8_pretrain.py][INFO] Epoch:[0/2](767000/4588595) loss:2.993 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:00:47,962][model8_pretrain.py][INFO] Epoch:[0/2](767000/4588595) loss:2.593 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:00:47,962][model8_pretrain.py][INFO] Epoch:[0/2](767000/4588595) loss:2.790 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:00:47,962][model8_pretrain.py][INFO] Epoch:[0/2](767000/4588595) loss:2.549 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:00:47,962][model8_pretrain.py][INFO] Epoch:[0/2](767000/4588595) loss:3.050 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:00:47,962][model8_pretrain.py][INFO] Epoch:[0/2](767000/4588595) loss:2.528 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:00:47,962][model8_pretrain.py][INFO] Epoch:[0/2](767000/4588595) loss:3.602 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:01:24,891][model8_pretrain.py][INFO] Epoch:[0/2](767100/4588595) loss:2.757 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:01:24,891][model8_pretrain.py][INFO] Epoch:[0/2](767100/4588595) loss:2.611 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:01:24,891][model8_pretrain.py][INFO] Epoch:[0/2](767100/4588595) loss:2.827 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:01:24,891][model8_pretrain.py][INFO] Epoch:[0/2](767100/4588595) loss:2.839 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:01:24,891][model8_pretrain.py][INFO] Epoch:[0/2](767100/4588595) loss:2.802 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:01:24,891][model8_pretrain.py][INFO] Epoch:[0/2](767100/4588595) loss:2.613 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:01:24,891][model8_pretrain.py][INFO] Epoch:[0/2](767100/4588595) loss:2.394 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:01:24,891][model8_pretrain.py][INFO] Epoch:[0/2](767100/4588595) loss:2.487 lr:0.0000100 epoch_Time:24319.0min: [2024-01-06 03:02:01,823][model8_pretrain.py][INFO] Epoch:[0/2](767200/4588595) loss:2.854 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:01,823][model8_pretrain.py][INFO] Epoch:[0/2](767200/4588595) loss:3.147 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:01,823][model8_pretrain.py][INFO] Epoch:[0/2](767200/4588595) loss:2.739 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:01,823][model8_pretrain.py][INFO] Epoch:[0/2](767200/4588595) loss:3.006 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:01,823][model8_pretrain.py][INFO] Epoch:[0/2](767200/4588595) loss:2.758 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:01,823][model8_pretrain.py][INFO] Epoch:[0/2](767200/4588595) loss:2.494 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:01,823][model8_pretrain.py][INFO] Epoch:[0/2](767200/4588595) loss:2.854 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:01,823][model8_pretrain.py][INFO] Epoch:[0/2](767200/4588595) loss:2.593 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:38,771][model8_pretrain.py][INFO] Epoch:[0/2](767300/4588595) loss:2.789 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:38,771][model8_pretrain.py][INFO] Epoch:[0/2](767300/4588595) loss:3.242 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:38,771][model8_pretrain.py][INFO] Epoch:[0/2](767300/4588595) loss:3.279 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:38,771][model8_pretrain.py][INFO] Epoch:[0/2](767300/4588595) loss:2.590 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:38,771][model8_pretrain.py][INFO] Epoch:[0/2](767300/4588595) loss:2.489 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:38,771][model8_pretrain.py][INFO] Epoch:[0/2](767300/4588595) loss:2.860 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:38,771][model8_pretrain.py][INFO] Epoch:[0/2](767300/4588595) loss:2.975 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:02:38,771][model8_pretrain.py][INFO] Epoch:[0/2](767300/4588595) loss:2.503 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:03:25,826][model8_pretrain.py][INFO] Epoch:[0/2](767400/4588595) loss:2.871 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:03:25,826][model8_pretrain.py][INFO] Epoch:[0/2](767400/4588595) loss:2.712 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:03:25,826][model8_pretrain.py][INFO] Epoch:[0/2](767400/4588595) loss:2.838 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:03:25,826][model8_pretrain.py][INFO] Epoch:[0/2](767400/4588595) loss:2.314 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:03:25,826][model8_pretrain.py][INFO] Epoch:[0/2](767400/4588595) loss:2.928 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:03:25,826][model8_pretrain.py][INFO] Epoch:[0/2](767400/4588595) loss:3.066 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:03:25,827][model8_pretrain.py][INFO] Epoch:[0/2](767400/4588595) loss:3.321 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:03:25,827][model8_pretrain.py][INFO] Epoch:[0/2](767400/4588595) loss:3.125 lr:0.0000100 epoch_Time:24318.0min: [2024-01-06 03:04:02,757][model8_pretrain.py][INFO] Epoch:[0/2](767500/4588595) loss:2.682 lr:0.0000100 epoch_Time:24317.0min: [2024-01-06 03:04:02,758][model8_pretrain.py][INFO] Epoch:[0/2](767500/4588595) loss:2.056 lr:0.0000100 epoch_Time:24317.0min: [2024-01-06 03:04:02,758][model8_pretrain.py][INFO] Epoch:[0/2](767500/4588595) loss:3.448 lr:0.0000100 epoch_Time:24317.0min: [2024-01-06 03:04:02,758][model8_pretrain.py][INFO] Epoch:[0/2](767500/4588595) loss:1.794 lr:0.0000100 epoch_Time:24317.0min: [2024-01-06 03:04:02,758][model8_pretrain.py][INFO] Epoch:[0/2](767500/4588595) loss:2.819 lr:0.0000100 epoch_Time:24317.0min: [2024-01-06 03:04:02,758][model8_pretrain.py][INFO] Epoch:[0/2](767500/4588595) loss:2.905 lr:0.0000100 epoch_Time:24317.0min: [2024-01-06 03:04:02,758][model8_pretrain.py][INFO] Epoch:[0/2](767500/4588595) loss:2.945 lr:0.0000100 epoch_Time:24317.0min: [2024-01-06 03:04:02,758][model8_pretrain.py][INFO] Epoch:[0/2](767500/4588595) loss:2.680 lr:0.0000100 epoch_Time:24317.0min: [2024-01-06 03:04:39,703][model8_pretrain.py][INFO] Epoch:[0/2](767600/4588595) loss:2.886 lr:0.0000100 epoch_Time:24316.0min: [2024-01-06 03:04:39,703][model8_pretrain.py][INFO] Epoch:[0/2](767600/4588595) loss:3.288 lr:0.0000100 epoch_Time:24316.0min: [2024-01-06 03:04:39,703][model8_pretrain.py][INFO] Epoch:[0/2](767600/4588595) loss:2.708 lr:0.0000100 epoch_Time:24316.0min: [2024-01-06 03:04:39,703][model8_pretrain.py][INFO] Epoch:[0/2](767600/4588595) loss:3.123 lr:0.0000100 epoch_Time:24316.0min: [2024-01-06 03:04:39,703][model8_pretrain.py][INFO] Epoch:[0/2](767600/4588595) loss:2.851 lr:0.0000100 epoch_Time:24316.0min: [2024-01-06 03:04:39,703][model8_pretrain.py][INFO] Epoch:[0/2](767600/4588595) loss:2.625 lr:0.0000100 epoch_Time:24316.0min: [2024-01-06 03:04:39,703][model8_pretrain.py][INFO] Epoch:[0/2](767600/4588595) loss:1.984 lr:0.0000100 epoch_Time:24316.0min: [2024-01-06 03:04:39,703][model8_pretrain.py][INFO] Epoch:[0/2](767600/4588595) loss:2.584 lr:0.0000100 epoch_Time:24316.0min: [2024-01-06 03:05:16,648][model8_pretrain.py][INFO] Epoch:[0/2](767700/4588595) loss:2.890 lr:0.0000100 epoch_Time:24315.0min: [2024-01-06 03:05:16,648][model8_pretrain.py][INFO] Epoch:[0/2](767700/4588595) loss:2.528 lr:0.0000100 epoch_Time:24315.0min: [2024-01-06 03:05:16,648][model8_pretrain.py][INFO] Epoch:[0/2](767700/4588595) loss:1.921 lr:0.0000100 epoch_Time:24315.0min: [2024-01-06 03:05:16,648][model8_pretrain.py][INFO] Epoch:[0/2](767700/4588595) loss:2.589 lr:0.0000100 epoch_Time:24315.0min: [2024-01-06 03:05:16,648][model8_pretrain.py][INFO] Epoch:[0/2](767700/4588595) loss:2.729 lr:0.0000100 epoch_Time:24315.0min: [2024-01-06 03:05:16,648][model8_pretrain.py][INFO] Epoch:[0/2](767700/4588595) loss:2.988 lr:0.0000100 epoch_Time:24315.0min: [2024-01-06 03:05:16,648][model8_pretrain.py][INFO] Epoch:[0/2](767700/4588595) loss:2.626 lr:0.0000100 epoch_Time:24315.0min: [2024-01-06 03:05:16,648][model8_pretrain.py][INFO] Epoch:[0/2](767700/4588595) loss:2.499 lr:0.0000100 epoch_Time:24315.0min: [2024-01-06 03:05:53,595][model8_pretrain.py][INFO] Epoch:[0/2](767800/4588595) loss:3.096 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:05:53,595][model8_pretrain.py][INFO] Epoch:[0/2](767800/4588595) loss:2.338 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:05:53,595][model8_pretrain.py][INFO] Epoch:[0/2](767800/4588595) loss:2.935 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:05:53,595][model8_pretrain.py][INFO] Epoch:[0/2](767800/4588595) loss:3.127 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:05:53,595][model8_pretrain.py][INFO] Epoch:[0/2](767800/4588595) loss:2.682 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:05:53,595][model8_pretrain.py][INFO] Epoch:[0/2](767800/4588595) loss:2.444 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:05:53,596][model8_pretrain.py][INFO] Epoch:[0/2](767800/4588595) loss:2.894 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:05:53,597][model8_pretrain.py][INFO] Epoch:[0/2](767800/4588595) loss:2.974 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:06:30,553][model8_pretrain.py][INFO] Epoch:[0/2](767900/4588595) loss:2.347 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:06:30,553][model8_pretrain.py][INFO] Epoch:[0/2](767900/4588595) loss:2.598 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:06:30,553][model8_pretrain.py][INFO] Epoch:[0/2](767900/4588595) loss:2.551 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:06:30,553][model8_pretrain.py][INFO] Epoch:[0/2](767900/4588595) loss:2.520 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:06:30,553][model8_pretrain.py][INFO] Epoch:[0/2](767900/4588595) loss:2.865 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:06:30,553][model8_pretrain.py][INFO] Epoch:[0/2](767900/4588595) loss:3.090 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:06:30,553][model8_pretrain.py][INFO] Epoch:[0/2](767900/4588595) loss:2.891 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:06:30,553][model8_pretrain.py][INFO] Epoch:[0/2](767900/4588595) loss:2.492 lr:0.0000100 epoch_Time:24314.0min: [2024-01-06 03:07:07,493][model8_pretrain.py][INFO] Epoch:[0/2](768000/4588595) loss:2.169 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:07,493][model8_pretrain.py][INFO] Epoch:[0/2](768000/4588595) loss:3.233 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:07,493][model8_pretrain.py][INFO] Epoch:[0/2](768000/4588595) loss:2.810 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:07,493][model8_pretrain.py][INFO] Epoch:[0/2](768000/4588595) loss:2.616 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:07,493][model8_pretrain.py][INFO] Epoch:[0/2](768000/4588595) loss:2.743 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:07,493][model8_pretrain.py][INFO] Epoch:[0/2](768000/4588595) loss:1.911 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:07,493][model8_pretrain.py][INFO] Epoch:[0/2](768000/4588595) loss:3.088 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:07,493][model8_pretrain.py][INFO] Epoch:[0/2](768000/4588595) loss:3.117 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:44,437][model8_pretrain.py][INFO] Epoch:[0/2](768100/4588595) loss:2.913 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:44,437][model8_pretrain.py][INFO] Epoch:[0/2](768100/4588595) loss:2.810 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:44,437][model8_pretrain.py][INFO] Epoch:[0/2](768100/4588595) loss:2.946 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:44,437][model8_pretrain.py][INFO] Epoch:[0/2](768100/4588595) loss:2.846 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:44,437][model8_pretrain.py][INFO] Epoch:[0/2](768100/4588595) loss:2.412 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:44,437][model8_pretrain.py][INFO] Epoch:[0/2](768100/4588595) loss:2.980 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:44,437][model8_pretrain.py][INFO] Epoch:[0/2](768100/4588595) loss:3.206 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:07:44,438][model8_pretrain.py][INFO] Epoch:[0/2](768100/4588595) loss:2.553 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:08:31,434][model8_pretrain.py][INFO] Epoch:[0/2](768200/4588595) loss:2.772 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:08:31,434][model8_pretrain.py][INFO] Epoch:[0/2](768200/4588595) loss:2.557 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:08:31,434][model8_pretrain.py][INFO] Epoch:[0/2](768200/4588595) loss:2.544 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:08:31,434][model8_pretrain.py][INFO] Epoch:[0/2](768200/4588595) loss:3.127 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:08:31,434][model8_pretrain.py][INFO] Epoch:[0/2](768200/4588595) loss:2.261 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:08:31,434][model8_pretrain.py][INFO] Epoch:[0/2](768200/4588595) loss:2.224 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:08:31,434][model8_pretrain.py][INFO] Epoch:[0/2](768200/4588595) loss:3.268 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:08:31,434][model8_pretrain.py][INFO] Epoch:[0/2](768200/4588595) loss:2.708 lr:0.0000100 epoch_Time:24313.0min: [2024-01-06 03:09:08,362][model8_pretrain.py][INFO] Epoch:[0/2](768300/4588595) loss:2.852 lr:0.0000100 epoch_Time:24312.0min: [2024-01-06 03:09:08,362][model8_pretrain.py][INFO] Epoch:[0/2](768300/4588595) loss:2.168 lr:0.0000100 epoch_Time:24312.0min: [2024-01-06 03:09:08,362][model8_pretrain.py][INFO] Epoch:[0/2](768300/4588595) loss:2.049 lr:0.0000100 epoch_Time:24312.0min: [2024-01-06 03:09:08,362][model8_pretrain.py][INFO] Epoch:[0/2](768300/4588595) loss:2.945 lr:0.0000100 epoch_Time:24312.0min: [2024-01-06 03:09:08,362][model8_pretrain.py][INFO] Epoch:[0/2](768300/4588595) loss:2.437 lr:0.0000100 epoch_Time:24312.0min: [2024-01-06 03:09:08,362][model8_pretrain.py][INFO] Epoch:[0/2](768300/4588595) loss:2.235 lr:0.0000100 epoch_Time:24312.0min: [2024-01-06 03:09:08,362][model8_pretrain.py][INFO] Epoch:[0/2](768300/4588595) loss:2.986 lr:0.0000100 epoch_Time:24312.0min: [2024-01-06 03:09:08,362][model8_pretrain.py][INFO] Epoch:[0/2](768300/4588595) loss:2.872 lr:0.0000100 epoch_Time:24312.0min: [2024-01-06 03:09:45,306][model8_pretrain.py][INFO] Epoch:[0/2](768400/4588595) loss:2.717 lr:0.0000100 epoch_Time:24311.0min: [2024-01-06 03:09:45,306][model8_pretrain.py][INFO] Epoch:[0/2](768400/4588595) loss:2.660 lr:0.0000100 epoch_Time:24311.0min: [2024-01-06 03:09:45,306][model8_pretrain.py][INFO] Epoch:[0/2](768400/4588595) loss:2.593 lr:0.0000100 epoch_Time:24311.0min: [2024-01-06 03:09:45,306][model8_pretrain.py][INFO] Epoch:[0/2](768400/4588595) loss:3.314 lr:0.0000100 epoch_Time:24311.0min: [2024-01-06 03:09:45,306][model8_pretrain.py][INFO] Epoch:[0/2](768400/4588595) loss:2.498 lr:0.0000100 epoch_Time:24311.0min: [2024-01-06 03:09:45,306][model8_pretrain.py][INFO] Epoch:[0/2](768400/4588595) loss:2.192 lr:0.0000100 epoch_Time:24311.0min: [2024-01-06 03:09:45,306][model8_pretrain.py][INFO] Epoch:[0/2](768400/4588595) loss:2.843 lr:0.0000100 epoch_Time:24311.0min: [2024-01-06 03:09:45,306][model8_pretrain.py][INFO] Epoch:[0/2](768400/4588595) loss:2.500 lr:0.0000100 epoch_Time:24311.0min: [2024-01-06 03:10:22,269][model8_pretrain.py][INFO] Epoch:[0/2](768500/4588595) loss:2.684 lr:0.0000100 epoch_Time:24310.0min: [2024-01-06 03:10:22,269][model8_pretrain.py][INFO] Epoch:[0/2](768500/4588595) loss:2.731 lr:0.0000100 epoch_Time:24310.0min: [2024-01-06 03:10:22,269][model8_pretrain.py][INFO] Epoch:[0/2](768500/4588595) loss:2.691 lr:0.0000100 epoch_Time:24310.0min: [2024-01-06 03:10:22,269][model8_pretrain.py][INFO] Epoch:[0/2](768500/4588595) loss:2.713 lr:0.0000100 epoch_Time:24310.0min: [2024-01-06 03:10:22,269][model8_pretrain.py][INFO] Epoch:[0/2](768500/4588595) loss:2.964 lr:0.0000100 epoch_Time:24310.0min: [2024-01-06 03:10:22,269][model8_pretrain.py][INFO] Epoch:[0/2](768500/4588595) loss:2.979 lr:0.0000100 epoch_Time:24310.0min: [2024-01-06 03:10:22,269][model8_pretrain.py][INFO] Epoch:[0/2](768500/4588595) loss:2.889 lr:0.0000100 epoch_Time:24310.0min: [2024-01-06 03:10:22,269][model8_pretrain.py][INFO] Epoch:[0/2](768500/4588595) loss:3.001 lr:0.0000100 epoch_Time:24310.0min: [2024-01-06 03:10:59,214][model8_pretrain.py][INFO] Epoch:[0/2](768600/4588595) loss:2.703 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:10:59,214][model8_pretrain.py][INFO] Epoch:[0/2](768600/4588595) loss:2.869 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:10:59,214][model8_pretrain.py][INFO] Epoch:[0/2](768600/4588595) loss:3.243 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:10:59,214][model8_pretrain.py][INFO] Epoch:[0/2](768600/4588595) loss:2.932 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:10:59,214][model8_pretrain.py][INFO] Epoch:[0/2](768600/4588595) loss:2.549 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:10:59,214][model8_pretrain.py][INFO] Epoch:[0/2](768600/4588595) loss:2.747 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:10:59,214][model8_pretrain.py][INFO] Epoch:[0/2](768600/4588595) loss:3.292 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:10:59,214][model8_pretrain.py][INFO] Epoch:[0/2](768600/4588595) loss:3.108 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:11:36,163][model8_pretrain.py][INFO] Epoch:[0/2](768700/4588595) loss:2.306 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:11:36,163][model8_pretrain.py][INFO] Epoch:[0/2](768700/4588595) loss:3.147 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:11:36,163][model8_pretrain.py][INFO] Epoch:[0/2](768700/4588595) loss:2.867 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:11:36,163][model8_pretrain.py][INFO] Epoch:[0/2](768700/4588595) loss:2.813 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:11:36,163][model8_pretrain.py][INFO] Epoch:[0/2](768700/4588595) loss:3.010 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:11:36,163][model8_pretrain.py][INFO] Epoch:[0/2](768700/4588595) loss:2.832 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:11:36,163][model8_pretrain.py][INFO] Epoch:[0/2](768700/4588595) loss:3.268 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:11:36,163][model8_pretrain.py][INFO] Epoch:[0/2](768700/4588595) loss:3.201 lr:0.0000100 epoch_Time:24309.0min: [2024-01-06 03:12:13,107][model8_pretrain.py][INFO] Epoch:[0/2](768800/4588595) loss:2.806 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:12:13,107][model8_pretrain.py][INFO] Epoch:[0/2](768800/4588595) loss:2.564 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:12:13,107][model8_pretrain.py][INFO] Epoch:[0/2](768800/4588595) loss:2.393 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:12:13,107][model8_pretrain.py][INFO] Epoch:[0/2](768800/4588595) loss:3.057 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:12:13,107][model8_pretrain.py][INFO] Epoch:[0/2](768800/4588595) loss:2.989 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:12:13,107][model8_pretrain.py][INFO] Epoch:[0/2](768800/4588595) loss:2.966 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:12:13,107][model8_pretrain.py][INFO] Epoch:[0/2](768800/4588595) loss:2.888 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:12:13,107][model8_pretrain.py][INFO] Epoch:[0/2](768800/4588595) loss:2.563 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:12:50,012][model8_pretrain.py][INFO] Epoch:[0/2](768900/4588595) loss:2.937 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:12:50,012][model8_pretrain.py][INFO] Epoch:[0/2](768900/4588595) loss:2.572 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:12:50,013][model8_pretrain.py][INFO] Epoch:[0/2](768900/4588595) loss:3.255 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:12:50,013][model8_pretrain.py][INFO] Epoch:[0/2](768900/4588595) loss:2.763 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:12:50,013][model8_pretrain.py][INFO] Epoch:[0/2](768900/4588595) loss:2.636 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:12:50,013][model8_pretrain.py][INFO] Epoch:[0/2](768900/4588595) loss:2.766 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:12:50,013][model8_pretrain.py][INFO] Epoch:[0/2](768900/4588595) loss:2.750 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:12:50,013][model8_pretrain.py][INFO] Epoch:[0/2](768900/4588595) loss:2.693 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:13:37,253][model8_pretrain.py][INFO] Epoch:[0/2](769000/4588595) loss:2.562 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:13:37,253][model8_pretrain.py][INFO] Epoch:[0/2](769000/4588595) loss:2.722 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:13:37,253][model8_pretrain.py][INFO] Epoch:[0/2](769000/4588595) loss:2.522 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:13:37,253][model8_pretrain.py][INFO] Epoch:[0/2](769000/4588595) loss:2.536 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:13:37,253][model8_pretrain.py][INFO] Epoch:[0/2](769000/4588595) loss:2.513 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:13:37,253][model8_pretrain.py][INFO] Epoch:[0/2](769000/4588595) loss:2.618 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:13:37,253][model8_pretrain.py][INFO] Epoch:[0/2](769000/4588595) loss:2.788 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:13:37,254][model8_pretrain.py][INFO] Epoch:[0/2](769000/4588595) loss:2.583 lr:0.0000100 epoch_Time:24308.0min: [2024-01-06 03:14:14,150][model8_pretrain.py][INFO] Epoch:[0/2](769100/4588595) loss:3.345 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:14:14,150][model8_pretrain.py][INFO] Epoch:[0/2](769100/4588595) loss:2.915 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:14:14,150][model8_pretrain.py][INFO] Epoch:[0/2](769100/4588595) loss:3.141 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:14:14,150][model8_pretrain.py][INFO] Epoch:[0/2](769100/4588595) loss:2.925 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:14:14,150][model8_pretrain.py][INFO] Epoch:[0/2](769100/4588595) loss:2.938 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:14:14,150][model8_pretrain.py][INFO] Epoch:[0/2](769100/4588595) loss:3.113 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:14:14,150][model8_pretrain.py][INFO] Epoch:[0/2](769100/4588595) loss:2.631 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:14:14,150][model8_pretrain.py][INFO] Epoch:[0/2](769100/4588595) loss:2.940 lr:0.0000100 epoch_Time:24307.0min: [2024-01-06 03:14:51,083][model8_pretrain.py][INFO] Epoch:[0/2](769200/4588595) loss:2.833 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:14:51,083][model8_pretrain.py][INFO] Epoch:[0/2](769200/4588595) loss:2.506 lr:0.0000100 epoch_Time:24306.0min: [2024-01-06 03:14:51,083][model8_pretrain.py][INFO] Epoch:[0/2](769200/4588595) loss:3.313 lr:0.0000100 epoch_Time:24306.0min: [2024-01-06 03:14:51,083][model8_pretrain.py][INFO] Epoch:[0/2](769200/4588595) loss:2.748 lr:0.0000100 epoch_Time:24306.0min: [2024-01-06 03:14:51,083][model8_pretrain.py][INFO] Epoch:[0/2](769200/4588595) loss:3.312 lr:0.0000100 epoch_Time:24306.0min: [2024-01-06 03:14:51,083][model8_pretrain.py][INFO] Epoch:[0/2](769200/4588595) loss:2.982 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:14:51,083][model8_pretrain.py][INFO] Epoch:[0/2](769200/4588595) loss:3.378 lr:0.0000100 epoch_Time:24306.0min: [2024-01-06 03:14:51,083][model8_pretrain.py][INFO] Epoch:[0/2](769200/4588595) loss:3.096 lr:0.0000100 epoch_Time:24306.0min: [2024-01-06 03:15:28,020][model8_pretrain.py][INFO] Epoch:[0/2](769300/4588595) loss:3.153 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:15:28,021][model8_pretrain.py][INFO] Epoch:[0/2](769300/4588595) loss:2.822 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:15:28,020][model8_pretrain.py][INFO] Epoch:[0/2](769300/4588595) loss:2.880 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:15:28,021][model8_pretrain.py][INFO] Epoch:[0/2](769300/4588595) loss:2.691 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:15:28,021][model8_pretrain.py][INFO] Epoch:[0/2](769300/4588595) loss:2.798 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:15:28,021][model8_pretrain.py][INFO] Epoch:[0/2](769300/4588595) loss:2.972 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:15:28,021][model8_pretrain.py][INFO] Epoch:[0/2](769300/4588595) loss:3.256 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:15:28,021][model8_pretrain.py][INFO] Epoch:[0/2](769300/4588595) loss:2.568 lr:0.0000100 epoch_Time:24305.0min: [2024-01-06 03:16:04,959][model8_pretrain.py][INFO] Epoch:[0/2](769400/4588595) loss:2.941 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:04,959][model8_pretrain.py][INFO] Epoch:[0/2](769400/4588595) loss:2.971 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:04,959][model8_pretrain.py][INFO] Epoch:[0/2](769400/4588595) loss:3.075 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:04,959][model8_pretrain.py][INFO] Epoch:[0/2](769400/4588595) loss:2.148 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:04,959][model8_pretrain.py][INFO] Epoch:[0/2](769400/4588595) loss:3.017 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:04,959][model8_pretrain.py][INFO] Epoch:[0/2](769400/4588595) loss:2.422 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:04,959][model8_pretrain.py][INFO] Epoch:[0/2](769400/4588595) loss:2.707 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:04,959][model8_pretrain.py][INFO] Epoch:[0/2](769400/4588595) loss:2.897 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:41,894][model8_pretrain.py][INFO] Epoch:[0/2](769500/4588595) loss:2.872 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:41,894][model8_pretrain.py][INFO] Epoch:[0/2](769500/4588595) loss:2.933 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:41,894][model8_pretrain.py][INFO] Epoch:[0/2](769500/4588595) loss:2.916 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:41,894][model8_pretrain.py][INFO] Epoch:[0/2](769500/4588595) loss:3.151 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:41,894][model8_pretrain.py][INFO] Epoch:[0/2](769500/4588595) loss:3.400 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:41,894][model8_pretrain.py][INFO] Epoch:[0/2](769500/4588595) loss:2.874 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:41,894][model8_pretrain.py][INFO] Epoch:[0/2](769500/4588595) loss:3.187 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:16:41,894][model8_pretrain.py][INFO] Epoch:[0/2](769500/4588595) loss:3.068 lr:0.0000100 epoch_Time:24304.0min: [2024-01-06 03:17:18,840][model8_pretrain.py][INFO] Epoch:[0/2](769600/4588595) loss:2.590 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:17:18,840][model8_pretrain.py][INFO] Epoch:[0/2](769600/4588595) loss:2.509 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:17:18,840][model8_pretrain.py][INFO] Epoch:[0/2](769600/4588595) loss:3.392 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:17:18,840][model8_pretrain.py][INFO] Epoch:[0/2](769600/4588595) loss:2.949 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:17:18,840][model8_pretrain.py][INFO] Epoch:[0/2](769600/4588595) loss:3.120 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:17:18,840][model8_pretrain.py][INFO] Epoch:[0/2](769600/4588595) loss:2.974 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:17:18,840][model8_pretrain.py][INFO] Epoch:[0/2](769600/4588595) loss:2.590 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:17:18,840][model8_pretrain.py][INFO] Epoch:[0/2](769600/4588595) loss:2.534 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:17:55,778][model8_pretrain.py][INFO] Epoch:[0/2](769700/4588595) loss:2.585 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:17:55,778][model8_pretrain.py][INFO] Epoch:[0/2](769700/4588595) loss:2.566 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:17:55,778][model8_pretrain.py][INFO] Epoch:[0/2](769700/4588595) loss:2.753 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:17:55,778][model8_pretrain.py][INFO] Epoch:[0/2](769700/4588595) loss:3.088 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:17:55,778][model8_pretrain.py][INFO] Epoch:[0/2](769700/4588595) loss:3.294 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:17:55,779][model8_pretrain.py][INFO] Epoch:[0/2](769700/4588595) loss:2.367 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:17:55,779][model8_pretrain.py][INFO] Epoch:[0/2](769700/4588595) loss:2.707 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:17:55,779][model8_pretrain.py][INFO] Epoch:[0/2](769700/4588595) loss:2.596 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:18:43,287][model8_pretrain.py][INFO] Epoch:[0/2](769800/4588595) loss:3.017 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:18:43,287][model8_pretrain.py][INFO] Epoch:[0/2](769800/4588595) loss:2.673 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:18:43,287][model8_pretrain.py][INFO] Epoch:[0/2](769800/4588595) loss:2.449 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:18:43,287][model8_pretrain.py][INFO] Epoch:[0/2](769800/4588595) loss:2.177 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:18:43,287][model8_pretrain.py][INFO] Epoch:[0/2](769800/4588595) loss:2.563 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:18:43,287][model8_pretrain.py][INFO] Epoch:[0/2](769800/4588595) loss:2.889 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:18:43,287][model8_pretrain.py][INFO] Epoch:[0/2](769800/4588595) loss:3.262 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:18:43,287][model8_pretrain.py][INFO] Epoch:[0/2](769800/4588595) loss:2.906 lr:0.0000100 epoch_Time:24303.0min: [2024-01-06 03:19:20,205][model8_pretrain.py][INFO] Epoch:[0/2](769900/4588595) loss:2.944 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:19:20,205][model8_pretrain.py][INFO] Epoch:[0/2](769900/4588595) loss:2.891 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:19:20,205][model8_pretrain.py][INFO] Epoch:[0/2](769900/4588595) loss:2.661 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:19:20,205][model8_pretrain.py][INFO] Epoch:[0/2](769900/4588595) loss:2.407 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:19:20,205][model8_pretrain.py][INFO] Epoch:[0/2](769900/4588595) loss:2.800 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:19:20,205][model8_pretrain.py][INFO] Epoch:[0/2](769900/4588595) loss:2.311 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:19:20,205][model8_pretrain.py][INFO] Epoch:[0/2](769900/4588595) loss:3.183 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:19:20,205][model8_pretrain.py][INFO] Epoch:[0/2](769900/4588595) loss:2.016 lr:0.0000100 epoch_Time:24302.0min: [2024-01-06 03:19:57,179][model8_pretrain.py][INFO] Epoch:[0/2](770000/4588595) loss:2.532 lr:0.0000100 epoch_Time:24301.0min: [2024-01-06 03:19:57,179][model8_pretrain.py][INFO] Epoch:[0/2](770000/4588595) loss:3.053 lr:0.0000100 epoch_Time:24301.0min: [2024-01-06 03:19:57,179][model8_pretrain.py][INFO] Epoch:[0/2](770000/4588595) loss:3.239 lr:0.0000100 epoch_Time:24301.0min: [2024-01-06 03:19:57,179][model8_pretrain.py][INFO] Epoch:[0/2](770000/4588595) loss:3.438 lr:0.0000100 epoch_Time:24301.0min: [2024-01-06 03:19:57,180][model8_pretrain.py][INFO] Epoch:[0/2](770000/4588595) loss:3.024 lr:0.0000100 epoch_Time:24301.0min: [2024-01-06 03:19:57,180][model8_pretrain.py][INFO] Epoch:[0/2](770000/4588595) loss:2.355 lr:0.0000100 epoch_Time:24301.0min: [2024-01-06 03:19:57,180][model8_pretrain.py][INFO] Epoch:[0/2](770000/4588595) loss:2.808 lr:0.0000100 epoch_Time:24301.0min: [2024-01-06 03:19:57,180][model8_pretrain.py][INFO] Epoch:[0/2](770000/4588595) loss:2.818 lr:0.0000100 epoch_Time:24301.0min: [2024-01-06 03:20:34,150][model8_pretrain.py][INFO] Epoch:[0/2](770100/4588595) loss:3.085 lr:0.0000100 epoch_Time:24300.0min: [2024-01-06 03:20:34,150][model8_pretrain.py][INFO] Epoch:[0/2](770100/4588595) loss:2.699 lr:0.0000100 epoch_Time:24300.0min: [2024-01-06 03:20:34,150][model8_pretrain.py][INFO] Epoch:[0/2](770100/4588595) loss:2.841 lr:0.0000100 epoch_Time:24300.0min: [2024-01-06 03:20:34,150][model8_pretrain.py][INFO] Epoch:[0/2](770100/4588595) loss:2.975 lr:0.0000100 epoch_Time:24300.0min: [2024-01-06 03:20:34,150][model8_pretrain.py][INFO] Epoch:[0/2](770100/4588595) loss:2.526 lr:0.0000100 epoch_Time:24300.0min: [2024-01-06 03:20:34,150][model8_pretrain.py][INFO] Epoch:[0/2](770100/4588595) loss:2.731 lr:0.0000100 epoch_Time:24300.0min: [2024-01-06 03:20:34,150][model8_pretrain.py][INFO] Epoch:[0/2](770100/4588595) loss:2.262 lr:0.0000100 epoch_Time:24300.0min: [2024-01-06 03:20:34,150][model8_pretrain.py][INFO] Epoch:[0/2](770100/4588595) loss:2.583 lr:0.0000100 epoch_Time:24300.0min: [2024-01-06 03:21:11,100][model8_pretrain.py][INFO] Epoch:[0/2](770200/4588595) loss:2.754 lr:0.0000100 epoch_Time:24299.0min: [2024-01-06 03:21:11,100][model8_pretrain.py][INFO] Epoch:[0/2](770200/4588595) loss:2.386 lr:0.0000100 epoch_Time:24299.0min: [2024-01-06 03:21:11,100][model8_pretrain.py][INFO] Epoch:[0/2](770200/4588595) loss:3.107 lr:0.0000100 epoch_Time:24299.0min: [2024-01-06 03:21:11,100][model8_pretrain.py][INFO] Epoch:[0/2](770200/4588595) loss:2.853 lr:0.0000100 epoch_Time:24299.0min: [2024-01-06 03:21:11,100][model8_pretrain.py][INFO] Epoch:[0/2](770200/4588595) loss:3.307 lr:0.0000100 epoch_Time:24299.0min: [2024-01-06 03:21:11,100][model8_pretrain.py][INFO] Epoch:[0/2](770200/4588595) loss:3.107 lr:0.0000100 epoch_Time:24299.0min: [2024-01-06 03:21:11,101][model8_pretrain.py][INFO] Epoch:[0/2](770200/4588595) loss:2.666 lr:0.0000100 epoch_Time:24299.0min: [2024-01-06 03:21:11,100][model8_pretrain.py][INFO] Epoch:[0/2](770200/4588595) loss:3.043 lr:0.0000100 epoch_Time:24299.0min: [2024-01-06 03:21:48,043][model8_pretrain.py][INFO] Epoch:[0/2](770300/4588595) loss:3.039 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:21:48,044][model8_pretrain.py][INFO] Epoch:[0/2](770300/4588595) loss:3.016 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:21:48,044][model8_pretrain.py][INFO] Epoch:[0/2](770300/4588595) loss:3.218 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:21:48,044][model8_pretrain.py][INFO] Epoch:[0/2](770300/4588595) loss:2.953 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:21:48,044][model8_pretrain.py][INFO] Epoch:[0/2](770300/4588595) loss:2.907 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:21:48,044][model8_pretrain.py][INFO] Epoch:[0/2](770300/4588595) loss:3.321 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:21:48,044][model8_pretrain.py][INFO] Epoch:[0/2](770300/4588595) loss:2.532 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:21:48,044][model8_pretrain.py][INFO] Epoch:[0/2](770300/4588595) loss:3.071 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:22:24,989][model8_pretrain.py][INFO] Epoch:[0/2](770400/4588595) loss:3.339 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:22:24,989][model8_pretrain.py][INFO] Epoch:[0/2](770400/4588595) loss:2.564 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:22:24,989][model8_pretrain.py][INFO] Epoch:[0/2](770400/4588595) loss:3.114 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:22:24,990][model8_pretrain.py][INFO] Epoch:[0/2](770400/4588595) loss:2.686 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:22:24,990][model8_pretrain.py][INFO] Epoch:[0/2](770400/4588595) loss:2.827 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:22:24,990][model8_pretrain.py][INFO] Epoch:[0/2](770400/4588595) loss:2.978 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:22:24,990][model8_pretrain.py][INFO] Epoch:[0/2](770400/4588595) loss:2.844 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:22:24,990][model8_pretrain.py][INFO] Epoch:[0/2](770400/4588595) loss:3.107 lr:0.0000100 epoch_Time:24298.0min: [2024-01-06 03:23:01,920][model8_pretrain.py][INFO] Epoch:[0/2](770500/4588595) loss:2.415 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:01,920][model8_pretrain.py][INFO] Epoch:[0/2](770500/4588595) loss:3.030 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:01,920][model8_pretrain.py][INFO] Epoch:[0/2](770500/4588595) loss:2.832 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:01,920][model8_pretrain.py][INFO] Epoch:[0/2](770500/4588595) loss:3.058 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:01,920][model8_pretrain.py][INFO] Epoch:[0/2](770500/4588595) loss:2.434 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:01,920][model8_pretrain.py][INFO] Epoch:[0/2](770500/4588595) loss:2.834 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:01,920][model8_pretrain.py][INFO] Epoch:[0/2](770500/4588595) loss:2.669 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:01,920][model8_pretrain.py][INFO] Epoch:[0/2](770500/4588595) loss:2.929 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:47,714][model8_pretrain.py][INFO] Epoch:[0/2](770600/4588595) loss:2.745 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:47,714][model8_pretrain.py][INFO] Epoch:[0/2](770600/4588595) loss:2.759 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:47,714][model8_pretrain.py][INFO] Epoch:[0/2](770600/4588595) loss:2.496 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:47,714][model8_pretrain.py][INFO] Epoch:[0/2](770600/4588595) loss:2.623 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:47,714][model8_pretrain.py][INFO] Epoch:[0/2](770600/4588595) loss:3.032 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:47,714][model8_pretrain.py][INFO] Epoch:[0/2](770600/4588595) loss:3.256 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:47,714][model8_pretrain.py][INFO] Epoch:[0/2](770600/4588595) loss:3.046 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:23:49,453][model8_pretrain.py][INFO] Epoch:[0/2](770600/4588595) loss:2.942 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:24:26,396][model8_pretrain.py][INFO] Epoch:[0/2](770700/4588595) loss:2.844 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:24:26,396][model8_pretrain.py][INFO] Epoch:[0/2](770700/4588595) loss:2.479 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:24:26,396][model8_pretrain.py][INFO] Epoch:[0/2](770700/4588595) loss:3.179 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:24:26,397][model8_pretrain.py][INFO] Epoch:[0/2](770700/4588595) loss:2.725 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:24:26,397][model8_pretrain.py][INFO] Epoch:[0/2](770700/4588595) loss:2.703 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:24:26,397][model8_pretrain.py][INFO] Epoch:[0/2](770700/4588595) loss:2.420 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:24:26,397][model8_pretrain.py][INFO] Epoch:[0/2](770700/4588595) loss:3.253 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:24:26,397][model8_pretrain.py][INFO] Epoch:[0/2](770700/4588595) loss:2.238 lr:0.0000100 epoch_Time:24297.0min: [2024-01-06 03:25:03,334][model8_pretrain.py][INFO] Epoch:[0/2](770800/4588595) loss:2.960 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:03,334][model8_pretrain.py][INFO] Epoch:[0/2](770800/4588595) loss:2.599 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:03,334][model8_pretrain.py][INFO] Epoch:[0/2](770800/4588595) loss:3.008 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:03,334][model8_pretrain.py][INFO] Epoch:[0/2](770800/4588595) loss:2.706 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:03,334][model8_pretrain.py][INFO] Epoch:[0/2](770800/4588595) loss:2.675 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:03,334][model8_pretrain.py][INFO] Epoch:[0/2](770800/4588595) loss:2.175 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:03,334][model8_pretrain.py][INFO] Epoch:[0/2](770800/4588595) loss:2.949 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:03,334][model8_pretrain.py][INFO] Epoch:[0/2](770800/4588595) loss:2.760 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:40,282][model8_pretrain.py][INFO] Epoch:[0/2](770900/4588595) loss:2.539 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:40,282][model8_pretrain.py][INFO] Epoch:[0/2](770900/4588595) loss:3.088 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:40,282][model8_pretrain.py][INFO] Epoch:[0/2](770900/4588595) loss:3.079 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:40,282][model8_pretrain.py][INFO] Epoch:[0/2](770900/4588595) loss:3.026 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:40,282][model8_pretrain.py][INFO] Epoch:[0/2](770900/4588595) loss:2.581 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:40,282][model8_pretrain.py][INFO] Epoch:[0/2](770900/4588595) loss:3.281 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:40,282][model8_pretrain.py][INFO] Epoch:[0/2](770900/4588595) loss:2.536 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:25:40,282][model8_pretrain.py][INFO] Epoch:[0/2](770900/4588595) loss:2.887 lr:0.0000100 epoch_Time:24296.0min: [2024-01-06 03:26:17,226][model8_pretrain.py][INFO] Epoch:[0/2](771000/4588595) loss:2.532 lr:0.0000100 epoch_Time:24294.0min: [2024-01-06 03:26:17,226][model8_pretrain.py][INFO] Epoch:[0/2](771000/4588595) loss:3.184 lr:0.0000100 epoch_Time:24294.0min: [2024-01-06 03:26:17,226][model8_pretrain.py][INFO] Epoch:[0/2](771000/4588595) loss:2.511 lr:0.0000100 epoch_Time:24294.0min: [2024-01-06 03:26:17,226][model8_pretrain.py][INFO] Epoch:[0/2](771000/4588595) loss:2.519 lr:0.0000100 epoch_Time:24294.0min: [2024-01-06 03:26:17,226][model8_pretrain.py][INFO] Epoch:[0/2](771000/4588595) loss:2.135 lr:0.0000100 epoch_Time:24294.0min: [2024-01-06 03:26:17,226][model8_pretrain.py][INFO] Epoch:[0/2](771000/4588595) loss:2.498 lr:0.0000100 epoch_Time:24294.0min: [2024-01-06 03:26:17,226][model8_pretrain.py][INFO] Epoch:[0/2](771000/4588595) loss:2.453 lr:0.0000100 epoch_Time:24294.0min: [2024-01-06 03:26:17,226][model8_pretrain.py][INFO] Epoch:[0/2](771000/4588595) loss:2.670 lr:0.0000100 epoch_Time:24294.0min: [2024-01-06 03:26:54,162][model8_pretrain.py][INFO] Epoch:[0/2](771100/4588595) loss:2.872 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:26:54,162][model8_pretrain.py][INFO] Epoch:[0/2](771100/4588595) loss:2.712 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:26:54,162][model8_pretrain.py][INFO] Epoch:[0/2](771100/4588595) loss:2.635 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:26:54,162][model8_pretrain.py][INFO] Epoch:[0/2](771100/4588595) loss:2.790 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:26:54,162][model8_pretrain.py][INFO] Epoch:[0/2](771100/4588595) loss:2.693 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:26:54,162][model8_pretrain.py][INFO] Epoch:[0/2](771100/4588595) loss:3.036 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:26:54,162][model8_pretrain.py][INFO] Epoch:[0/2](771100/4588595) loss:2.831 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:26:54,162][model8_pretrain.py][INFO] Epoch:[0/2](771100/4588595) loss:2.532 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:27:31,114][model8_pretrain.py][INFO] Epoch:[0/2](771200/4588595) loss:3.051 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:27:31,114][model8_pretrain.py][INFO] Epoch:[0/2](771200/4588595) loss:3.503 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:27:31,114][model8_pretrain.py][INFO] Epoch:[0/2](771200/4588595) loss:2.967 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:27:31,114][model8_pretrain.py][INFO] Epoch:[0/2](771200/4588595) loss:2.743 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:27:31,114][model8_pretrain.py][INFO] Epoch:[0/2](771200/4588595) loss:3.111 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:27:31,114][model8_pretrain.py][INFO] Epoch:[0/2](771200/4588595) loss:2.904 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:27:31,115][model8_pretrain.py][INFO] Epoch:[0/2](771200/4588595) loss:2.426 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:27:31,115][model8_pretrain.py][INFO] Epoch:[0/2](771200/4588595) loss:2.961 lr:0.0000100 epoch_Time:24293.0min: [2024-01-06 03:28:08,062][model8_pretrain.py][INFO] Epoch:[0/2](771300/4588595) loss:2.639 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:08,062][model8_pretrain.py][INFO] Epoch:[0/2](771300/4588595) loss:2.705 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:08,062][model8_pretrain.py][INFO] Epoch:[0/2](771300/4588595) loss:2.325 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:08,062][model8_pretrain.py][INFO] Epoch:[0/2](771300/4588595) loss:2.730 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:08,062][model8_pretrain.py][INFO] Epoch:[0/2](771300/4588595) loss:2.925 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:08,062][model8_pretrain.py][INFO] Epoch:[0/2](771300/4588595) loss:2.521 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:08,062][model8_pretrain.py][INFO] Epoch:[0/2](771300/4588595) loss:2.375 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:08,064][model8_pretrain.py][INFO] Epoch:[0/2](771300/4588595) loss:2.955 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:53,805][model8_pretrain.py][INFO] Epoch:[0/2](771400/4588595) loss:2.401 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:53,805][model8_pretrain.py][INFO] Epoch:[0/2](771400/4588595) loss:2.737 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:53,805][model8_pretrain.py][INFO] Epoch:[0/2](771400/4588595) loss:2.685 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:53,805][model8_pretrain.py][INFO] Epoch:[0/2](771400/4588595) loss:3.078 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:53,805][model8_pretrain.py][INFO] Epoch:[0/2](771400/4588595) loss:2.777 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:53,805][model8_pretrain.py][INFO] Epoch:[0/2](771400/4588595) loss:3.128 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:53,805][model8_pretrain.py][INFO] Epoch:[0/2](771400/4588595) loss:2.156 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:28:53,805][model8_pretrain.py][INFO] Epoch:[0/2](771400/4588595) loss:2.852 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:29:32,411][model8_pretrain.py][INFO] Epoch:[0/2](771500/4588595) loss:3.270 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:29:32,411][model8_pretrain.py][INFO] Epoch:[0/2](771500/4588595) loss:2.740 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:29:32,411][model8_pretrain.py][INFO] Epoch:[0/2](771500/4588595) loss:2.960 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:29:32,411][model8_pretrain.py][INFO] Epoch:[0/2](771500/4588595) loss:3.387 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:29:32,411][model8_pretrain.py][INFO] Epoch:[0/2](771500/4588595) loss:3.160 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:29:32,411][model8_pretrain.py][INFO] Epoch:[0/2](771500/4588595) loss:2.836 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:29:32,411][model8_pretrain.py][INFO] Epoch:[0/2](771500/4588595) loss:2.705 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:29:32,411][model8_pretrain.py][INFO] Epoch:[0/2](771500/4588595) loss:2.700 lr:0.0000100 epoch_Time:24292.0min: [2024-01-06 03:30:09,353][model8_pretrain.py][INFO] Epoch:[0/2](771600/4588595) loss:2.855 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:09,353][model8_pretrain.py][INFO] Epoch:[0/2](771600/4588595) loss:2.538 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:09,353][model8_pretrain.py][INFO] Epoch:[0/2](771600/4588595) loss:2.811 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:09,353][model8_pretrain.py][INFO] Epoch:[0/2](771600/4588595) loss:2.905 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:09,353][model8_pretrain.py][INFO] Epoch:[0/2](771600/4588595) loss:2.837 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:09,353][model8_pretrain.py][INFO] Epoch:[0/2](771600/4588595) loss:3.374 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:09,354][model8_pretrain.py][INFO] Epoch:[0/2](771600/4588595) loss:2.690 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:09,354][model8_pretrain.py][INFO] Epoch:[0/2](771600/4588595) loss:2.789 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:46,301][model8_pretrain.py][INFO] Epoch:[0/2](771700/4588595) loss:3.078 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:46,301][model8_pretrain.py][INFO] Epoch:[0/2](771700/4588595) loss:3.278 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:46,301][model8_pretrain.py][INFO] Epoch:[0/2](771700/4588595) loss:2.810 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:46,301][model8_pretrain.py][INFO] Epoch:[0/2](771700/4588595) loss:2.887 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:46,301][model8_pretrain.py][INFO] Epoch:[0/2](771700/4588595) loss:2.548 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:46,301][model8_pretrain.py][INFO] Epoch:[0/2](771700/4588595) loss:2.186 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:46,301][model8_pretrain.py][INFO] Epoch:[0/2](771700/4588595) loss:3.027 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:30:46,301][model8_pretrain.py][INFO] Epoch:[0/2](771700/4588595) loss:3.132 lr:0.0000100 epoch_Time:24291.0min: [2024-01-06 03:31:23,250][model8_pretrain.py][INFO] Epoch:[0/2](771800/4588595) loss:2.623 lr:0.0000100 epoch_Time:24289.0min: [2024-01-06 03:31:23,250][model8_pretrain.py][INFO] Epoch:[0/2](771800/4588595) loss:2.295 lr:0.0000100 epoch_Time:24289.0min: [2024-01-06 03:31:23,250][model8_pretrain.py][INFO] Epoch:[0/2](771800/4588595) loss:2.760 lr:0.0000100 epoch_Time:24289.0min: [2024-01-06 03:31:23,250][model8_pretrain.py][INFO] Epoch:[0/2](771800/4588595) loss:2.940 lr:0.0000100 epoch_Time:24289.0min: [2024-01-06 03:31:23,250][model8_pretrain.py][INFO] Epoch:[0/2](771800/4588595) loss:2.799 lr:0.0000100 epoch_Time:24289.0min: [2024-01-06 03:31:23,250][model8_pretrain.py][INFO] Epoch:[0/2](771800/4588595) loss:3.071 lr:0.0000100 epoch_Time:24289.0min: [2024-01-06 03:31:23,250][model8_pretrain.py][INFO] Epoch:[0/2](771800/4588595) loss:2.708 lr:0.0000100 epoch_Time:24289.0min: [2024-01-06 03:31:23,250][model8_pretrain.py][INFO] Epoch:[0/2](771800/4588595) loss:2.567 lr:0.0000100 epoch_Time:24289.0min: [2024-01-06 03:32:00,193][model8_pretrain.py][INFO] Epoch:[0/2](771900/4588595) loss:2.692 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:00,193][model8_pretrain.py][INFO] Epoch:[0/2](771900/4588595) loss:2.916 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:00,193][model8_pretrain.py][INFO] Epoch:[0/2](771900/4588595) loss:2.715 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:00,193][model8_pretrain.py][INFO] Epoch:[0/2](771900/4588595) loss:2.754 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:00,193][model8_pretrain.py][INFO] Epoch:[0/2](771900/4588595) loss:1.948 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:00,193][model8_pretrain.py][INFO] Epoch:[0/2](771900/4588595) loss:2.957 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:00,193][model8_pretrain.py][INFO] Epoch:[0/2](771900/4588595) loss:2.889 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:00,194][model8_pretrain.py][INFO] Epoch:[0/2](771900/4588595) loss:3.074 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:37,137][model8_pretrain.py][INFO] Epoch:[0/2](772000/4588595) loss:2.654 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:37,137][model8_pretrain.py][INFO] Epoch:[0/2](772000/4588595) loss:2.699 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:37,137][model8_pretrain.py][INFO] Epoch:[0/2](772000/4588595) loss:2.584 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:37,137][model8_pretrain.py][INFO] Epoch:[0/2](772000/4588595) loss:2.531 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:37,137][model8_pretrain.py][INFO] Epoch:[0/2](772000/4588595) loss:2.864 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:37,137][model8_pretrain.py][INFO] Epoch:[0/2](772000/4588595) loss:2.838 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:37,138][model8_pretrain.py][INFO] Epoch:[0/2](772000/4588595) loss:3.262 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:32:37,138][model8_pretrain.py][INFO] Epoch:[0/2](772000/4588595) loss:3.055 lr:0.0000100 epoch_Time:24288.0min: [2024-01-06 03:33:14,084][model8_pretrain.py][INFO] Epoch:[0/2](772100/4588595) loss:2.592 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:14,084][model8_pretrain.py][INFO] Epoch:[0/2](772100/4588595) loss:3.533 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:14,084][model8_pretrain.py][INFO] Epoch:[0/2](772100/4588595) loss:3.210 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:14,084][model8_pretrain.py][INFO] Epoch:[0/2](772100/4588595) loss:2.621 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:14,084][model8_pretrain.py][INFO] Epoch:[0/2](772100/4588595) loss:3.095 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:14,084][model8_pretrain.py][INFO] Epoch:[0/2](772100/4588595) loss:2.317 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:14,084][model8_pretrain.py][INFO] Epoch:[0/2](772100/4588595) loss:2.833 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:14,085][model8_pretrain.py][INFO] Epoch:[0/2](772100/4588595) loss:2.443 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:59,822][model8_pretrain.py][INFO] Epoch:[0/2](772200/4588595) loss:1.712 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:59,822][model8_pretrain.py][INFO] Epoch:[0/2](772200/4588595) loss:2.901 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:59,822][model8_pretrain.py][INFO] Epoch:[0/2](772200/4588595) loss:2.392 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:59,822][model8_pretrain.py][INFO] Epoch:[0/2](772200/4588595) loss:2.731 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:59,822][model8_pretrain.py][INFO] Epoch:[0/2](772200/4588595) loss:2.867 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:59,822][model8_pretrain.py][INFO] Epoch:[0/2](772200/4588595) loss:2.652 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:59,822][model8_pretrain.py][INFO] Epoch:[0/2](772200/4588595) loss:3.090 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:33:59,822][model8_pretrain.py][INFO] Epoch:[0/2](772200/4588595) loss:2.963 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:34:38,444][model8_pretrain.py][INFO] Epoch:[0/2](772300/4588595) loss:2.336 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:34:38,444][model8_pretrain.py][INFO] Epoch:[0/2](772300/4588595) loss:2.480 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:34:38,444][model8_pretrain.py][INFO] Epoch:[0/2](772300/4588595) loss:3.070 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:34:38,444][model8_pretrain.py][INFO] Epoch:[0/2](772300/4588595) loss:3.211 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:34:38,444][model8_pretrain.py][INFO] Epoch:[0/2](772300/4588595) loss:2.407 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:34:38,444][model8_pretrain.py][INFO] Epoch:[0/2](772300/4588595) loss:2.695 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:34:38,444][model8_pretrain.py][INFO] Epoch:[0/2](772300/4588595) loss:3.063 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:34:38,444][model8_pretrain.py][INFO] Epoch:[0/2](772300/4588595) loss:2.753 lr:0.0000100 epoch_Time:24287.0min: [2024-01-06 03:35:15,389][model8_pretrain.py][INFO] Epoch:[0/2](772400/4588595) loss:2.843 lr:0.0000100 epoch_Time:24286.0min: [2024-01-06 03:35:15,389][model8_pretrain.py][INFO] Epoch:[0/2](772400/4588595) loss:2.904 lr:0.0000100 epoch_Time:24286.0min: [2024-01-06 03:35:15,389][model8_pretrain.py][INFO] Epoch:[0/2](772400/4588595) loss:3.001 lr:0.0000100 epoch_Time:24286.0min: [2024-01-06 03:35:15,389][model8_pretrain.py][INFO] Epoch:[0/2](772400/4588595) loss:2.369 lr:0.0000100 epoch_Time:24286.0min: [2024-01-06 03:35:15,389][model8_pretrain.py][INFO] Epoch:[0/2](772400/4588595) loss:3.210 lr:0.0000100 epoch_Time:24286.0min: [2024-01-06 03:35:15,389][model8_pretrain.py][INFO] Epoch:[0/2](772400/4588595) loss:2.610 lr:0.0000100 epoch_Time:24286.0min: [2024-01-06 03:35:15,389][model8_pretrain.py][INFO] Epoch:[0/2](772400/4588595) loss:2.592 lr:0.0000100 epoch_Time:24286.0min: [2024-01-06 03:35:15,390][model8_pretrain.py][INFO] Epoch:[0/2](772400/4588595) loss:3.063 lr:0.0000100 epoch_Time:24286.0min: [2024-01-06 03:35:52,343][model8_pretrain.py][INFO] Epoch:[0/2](772500/4588595) loss:2.819 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:35:52,343][model8_pretrain.py][INFO] Epoch:[0/2](772500/4588595) loss:2.833 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:35:52,343][model8_pretrain.py][INFO] Epoch:[0/2](772500/4588595) loss:2.931 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:35:52,343][model8_pretrain.py][INFO] Epoch:[0/2](772500/4588595) loss:2.742 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:35:52,343][model8_pretrain.py][INFO] Epoch:[0/2](772500/4588595) loss:2.931 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:35:52,343][model8_pretrain.py][INFO] Epoch:[0/2](772500/4588595) loss:2.400 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:35:52,343][model8_pretrain.py][INFO] Epoch:[0/2](772500/4588595) loss:2.986 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:35:52,343][model8_pretrain.py][INFO] Epoch:[0/2](772500/4588595) loss:2.632 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:36:29,295][model8_pretrain.py][INFO] Epoch:[0/2](772600/4588595) loss:2.454 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:36:29,295][model8_pretrain.py][INFO] Epoch:[0/2](772600/4588595) loss:2.817 lr:0.0000100 epoch_Time:24284.0min: [2024-01-06 03:36:29,295][model8_pretrain.py][INFO] Epoch:[0/2](772600/4588595) loss:3.346 lr:0.0000100 epoch_Time:24284.0min: [2024-01-06 03:36:29,295][model8_pretrain.py][INFO] Epoch:[0/2](772600/4588595) loss:3.285 lr:0.0000100 epoch_Time:24284.0min: [2024-01-06 03:36:29,295][model8_pretrain.py][INFO] Epoch:[0/2](772600/4588595) loss:3.066 lr:0.0000100 epoch_Time:24284.0min: [2024-01-06 03:36:29,295][model8_pretrain.py][INFO] Epoch:[0/2](772600/4588595) loss:2.773 lr:0.0000100 epoch_Time:24285.0min: [2024-01-06 03:36:29,295][model8_pretrain.py][INFO] Epoch:[0/2](772600/4588595) loss:3.001 lr:0.0000100 epoch_Time:24284.0min: [2024-01-06 03:36:29,295][model8_pretrain.py][INFO] Epoch:[0/2](772600/4588595) loss:2.949 lr:0.0000100 epoch_Time:24284.0min: [2024-01-06 03:37:06,241][model8_pretrain.py][INFO] Epoch:[0/2](772700/4588595) loss:2.809 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:06,241][model8_pretrain.py][INFO] Epoch:[0/2](772700/4588595) loss:2.419 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:06,241][model8_pretrain.py][INFO] Epoch:[0/2](772700/4588595) loss:2.830 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:06,241][model8_pretrain.py][INFO] Epoch:[0/2](772700/4588595) loss:2.728 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:06,241][model8_pretrain.py][INFO] Epoch:[0/2](772700/4588595) loss:2.639 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:06,241][model8_pretrain.py][INFO] Epoch:[0/2](772700/4588595) loss:3.186 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:06,241][model8_pretrain.py][INFO] Epoch:[0/2](772700/4588595) loss:2.187 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:06,241][model8_pretrain.py][INFO] Epoch:[0/2](772700/4588595) loss:3.080 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:43,197][model8_pretrain.py][INFO] Epoch:[0/2](772800/4588595) loss:2.348 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:43,197][model8_pretrain.py][INFO] Epoch:[0/2](772800/4588595) loss:3.156 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:43,197][model8_pretrain.py][INFO] Epoch:[0/2](772800/4588595) loss:2.231 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:43,197][model8_pretrain.py][INFO] Epoch:[0/2](772800/4588595) loss:2.451 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:43,197][model8_pretrain.py][INFO] Epoch:[0/2](772800/4588595) loss:2.462 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:43,197][model8_pretrain.py][INFO] Epoch:[0/2](772800/4588595) loss:2.591 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:43,197][model8_pretrain.py][INFO] Epoch:[0/2](772800/4588595) loss:2.808 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:37:43,197][model8_pretrain.py][INFO] Epoch:[0/2](772800/4588595) loss:2.778 lr:0.0000100 epoch_Time:24283.0min: [2024-01-06 03:38:20,148][model8_pretrain.py][INFO] Epoch:[0/2](772900/4588595) loss:2.706 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:38:20,149][model8_pretrain.py][INFO] Epoch:[0/2](772900/4588595) loss:2.558 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:38:20,149][model8_pretrain.py][INFO] Epoch:[0/2](772900/4588595) loss:2.504 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:38:20,149][model8_pretrain.py][INFO] Epoch:[0/2](772900/4588595) loss:2.967 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:38:20,149][model8_pretrain.py][INFO] Epoch:[0/2](772900/4588595) loss:2.826 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:38:20,149][model8_pretrain.py][INFO] Epoch:[0/2](772900/4588595) loss:2.290 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:38:20,149][model8_pretrain.py][INFO] Epoch:[0/2](772900/4588595) loss:2.519 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:38:20,150][model8_pretrain.py][INFO] Epoch:[0/2](772900/4588595) loss:2.588 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:04,020][model8_pretrain.py][INFO] Epoch:[0/2](773000/4588595) loss:2.815 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:04,020][model8_pretrain.py][INFO] Epoch:[0/2](773000/4588595) loss:2.454 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:04,020][model8_pretrain.py][INFO] Epoch:[0/2](773000/4588595) loss:2.897 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:04,020][model8_pretrain.py][INFO] Epoch:[0/2](773000/4588595) loss:2.072 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:04,020][model8_pretrain.py][INFO] Epoch:[0/2](773000/4588595) loss:2.926 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:04,020][model8_pretrain.py][INFO] Epoch:[0/2](773000/4588595) loss:1.837 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:04,020][model8_pretrain.py][INFO] Epoch:[0/2](773000/4588595) loss:2.919 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:04,020][model8_pretrain.py][INFO] Epoch:[0/2](773000/4588595) loss:2.765 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:44,435][model8_pretrain.py][INFO] Epoch:[0/2](773100/4588595) loss:2.748 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:44,435][model8_pretrain.py][INFO] Epoch:[0/2](773100/4588595) loss:2.873 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:44,435][model8_pretrain.py][INFO] Epoch:[0/2](773100/4588595) loss:2.720 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:44,435][model8_pretrain.py][INFO] Epoch:[0/2](773100/4588595) loss:2.709 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:44,435][model8_pretrain.py][INFO] Epoch:[0/2](773100/4588595) loss:2.959 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:44,435][model8_pretrain.py][INFO] Epoch:[0/2](773100/4588595) loss:3.152 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:44,435][model8_pretrain.py][INFO] Epoch:[0/2](773100/4588595) loss:2.831 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:39:44,435][model8_pretrain.py][INFO] Epoch:[0/2](773100/4588595) loss:2.861 lr:0.0000100 epoch_Time:24282.0min: [2024-01-06 03:40:21,320][model8_pretrain.py][INFO] Epoch:[0/2](773200/4588595) loss:2.790 lr:0.0000100 epoch_Time:24281.0min: [2024-01-06 03:40:21,320][model8_pretrain.py][INFO] Epoch:[0/2](773200/4588595) loss:2.324 lr:0.0000100 epoch_Time:24281.0min: [2024-01-06 03:40:21,320][model8_pretrain.py][INFO] Epoch:[0/2](773200/4588595) loss:3.369 lr:0.0000100 epoch_Time:24281.0min: [2024-01-06 03:40:21,320][model8_pretrain.py][INFO] Epoch:[0/2](773200/4588595) loss:2.672 lr:0.0000100 epoch_Time:24281.0min: [2024-01-06 03:40:21,320][model8_pretrain.py][INFO] Epoch:[0/2](773200/4588595) loss:2.953 lr:0.0000100 epoch_Time:24281.0min: [2024-01-06 03:40:21,321][model8_pretrain.py][INFO] Epoch:[0/2](773200/4588595) loss:3.058 lr:0.0000100 epoch_Time:24281.0min: [2024-01-06 03:40:21,321][model8_pretrain.py][INFO] Epoch:[0/2](773200/4588595) loss:2.637 lr:0.0000100 epoch_Time:24281.0min: [2024-01-06 03:40:21,321][model8_pretrain.py][INFO] Epoch:[0/2](773200/4588595) loss:2.977 lr:0.0000100 epoch_Time:24281.0min: [2024-01-06 03:40:58,259][model8_pretrain.py][INFO] Epoch:[0/2](773300/4588595) loss:3.164 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:40:58,259][model8_pretrain.py][INFO] Epoch:[0/2](773300/4588595) loss:2.919 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:40:58,259][model8_pretrain.py][INFO] Epoch:[0/2](773300/4588595) loss:2.994 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:40:58,259][model8_pretrain.py][INFO] Epoch:[0/2](773300/4588595) loss:2.925 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:40:58,259][model8_pretrain.py][INFO] Epoch:[0/2](773300/4588595) loss:2.351 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:40:58,259][model8_pretrain.py][INFO] Epoch:[0/2](773300/4588595) loss:2.484 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:40:58,259][model8_pretrain.py][INFO] Epoch:[0/2](773300/4588595) loss:3.462 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:40:58,260][model8_pretrain.py][INFO] Epoch:[0/2](773300/4588595) loss:2.514 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:41:35,195][model8_pretrain.py][INFO] Epoch:[0/2](773400/4588595) loss:2.754 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:41:35,195][model8_pretrain.py][INFO] Epoch:[0/2](773400/4588595) loss:3.117 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:41:35,195][model8_pretrain.py][INFO] Epoch:[0/2](773400/4588595) loss:2.926 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:41:35,195][model8_pretrain.py][INFO] Epoch:[0/2](773400/4588595) loss:2.858 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:41:35,195][model8_pretrain.py][INFO] Epoch:[0/2](773400/4588595) loss:3.111 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:41:35,195][model8_pretrain.py][INFO] Epoch:[0/2](773400/4588595) loss:2.729 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:41:35,195][model8_pretrain.py][INFO] Epoch:[0/2](773400/4588595) loss:2.739 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:41:35,195][model8_pretrain.py][INFO] Epoch:[0/2](773400/4588595) loss:3.478 lr:0.0000100 epoch_Time:24280.0min: [2024-01-06 03:42:12,143][model8_pretrain.py][INFO] Epoch:[0/2](773500/4588595) loss:2.919 lr:0.0000100 epoch_Time:24278.0min: [2024-01-06 03:42:12,143][model8_pretrain.py][INFO] Epoch:[0/2](773500/4588595) loss:3.043 lr:0.0000100 epoch_Time:24278.0min: [2024-01-06 03:42:12,143][model8_pretrain.py][INFO] Epoch:[0/2](773500/4588595) loss:2.968 lr:0.0000100 epoch_Time:24278.0min: [2024-01-06 03:42:12,143][model8_pretrain.py][INFO] Epoch:[0/2](773500/4588595) loss:2.697 lr:0.0000100 epoch_Time:24278.0min: [2024-01-06 03:42:12,143][model8_pretrain.py][INFO] Epoch:[0/2](773500/4588595) loss:2.775 lr:0.0000100 epoch_Time:24278.0min: [2024-01-06 03:42:12,143][model8_pretrain.py][INFO] Epoch:[0/2](773500/4588595) loss:2.696 lr:0.0000100 epoch_Time:24278.0min: [2024-01-06 03:42:12,143][model8_pretrain.py][INFO] Epoch:[0/2](773500/4588595) loss:3.024 lr:0.0000100 epoch_Time:24278.0min: [2024-01-06 03:42:12,143][model8_pretrain.py][INFO] Epoch:[0/2](773500/4588595) loss:3.174 lr:0.0000100 epoch_Time:24278.0min: [2024-01-06 03:42:49,092][model8_pretrain.py][INFO] Epoch:[0/2](773600/4588595) loss:2.251 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:42:49,092][model8_pretrain.py][INFO] Epoch:[0/2](773600/4588595) loss:2.985 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:42:49,092][model8_pretrain.py][INFO] Epoch:[0/2](773600/4588595) loss:3.139 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:42:49,092][model8_pretrain.py][INFO] Epoch:[0/2](773600/4588595) loss:2.593 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:42:49,092][model8_pretrain.py][INFO] Epoch:[0/2](773600/4588595) loss:2.604 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:42:49,092][model8_pretrain.py][INFO] Epoch:[0/2](773600/4588595) loss:2.622 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:42:49,092][model8_pretrain.py][INFO] Epoch:[0/2](773600/4588595) loss:2.996 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:42:49,092][model8_pretrain.py][INFO] Epoch:[0/2](773600/4588595) loss:3.198 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:43:26,035][model8_pretrain.py][INFO] Epoch:[0/2](773700/4588595) loss:3.108 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:43:26,035][model8_pretrain.py][INFO] Epoch:[0/2](773700/4588595) loss:2.689 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:43:26,035][model8_pretrain.py][INFO] Epoch:[0/2](773700/4588595) loss:2.307 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:43:26,035][model8_pretrain.py][INFO] Epoch:[0/2](773700/4588595) loss:2.497 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:43:26,035][model8_pretrain.py][INFO] Epoch:[0/2](773700/4588595) loss:2.630 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:43:26,035][model8_pretrain.py][INFO] Epoch:[0/2](773700/4588595) loss:2.245 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:43:26,035][model8_pretrain.py][INFO] Epoch:[0/2](773700/4588595) loss:3.276 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:43:26,035][model8_pretrain.py][INFO] Epoch:[0/2](773700/4588595) loss:2.589 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:44:10,078][model8_pretrain.py][INFO] Epoch:[0/2](773800/4588595) loss:2.674 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:44:10,078][model8_pretrain.py][INFO] Epoch:[0/2](773800/4588595) loss:2.602 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:44:10,078][model8_pretrain.py][INFO] Epoch:[0/2](773800/4588595) loss:2.893 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:44:10,078][model8_pretrain.py][INFO] Epoch:[0/2](773800/4588595) loss:3.441 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:44:10,078][model8_pretrain.py][INFO] Epoch:[0/2](773800/4588595) loss:3.026 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:44:10,078][model8_pretrain.py][INFO] Epoch:[0/2](773800/4588595) loss:2.483 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:44:10,078][model8_pretrain.py][INFO] Epoch:[0/2](773800/4588595) loss:3.068 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:44:10,078][model8_pretrain.py][INFO] Epoch:[0/2](773800/4588595) loss:3.010 lr:0.0000100 epoch_Time:24277.0min: [2024-01-06 03:44:50,518][model8_pretrain.py][INFO] Epoch:[0/2](773900/4588595) loss:3.355 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:44:50,518][model8_pretrain.py][INFO] Epoch:[0/2](773900/4588595) loss:2.733 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:44:50,518][model8_pretrain.py][INFO] Epoch:[0/2](773900/4588595) loss:1.932 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:44:50,518][model8_pretrain.py][INFO] Epoch:[0/2](773900/4588595) loss:2.981 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:44:50,518][model8_pretrain.py][INFO] Epoch:[0/2](773900/4588595) loss:2.918 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:44:50,518][model8_pretrain.py][INFO] Epoch:[0/2](773900/4588595) loss:2.769 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:44:50,518][model8_pretrain.py][INFO] Epoch:[0/2](773900/4588595) loss:2.799 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:44:50,518][model8_pretrain.py][INFO] Epoch:[0/2](773900/4588595) loss:2.823 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:45:27,447][model8_pretrain.py][INFO] Epoch:[0/2](774000/4588595) loss:2.916 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:45:27,447][model8_pretrain.py][INFO] Epoch:[0/2](774000/4588595) loss:2.612 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:45:27,447][model8_pretrain.py][INFO] Epoch:[0/2](774000/4588595) loss:2.469 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:45:27,447][model8_pretrain.py][INFO] Epoch:[0/2](774000/4588595) loss:2.906 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:45:27,447][model8_pretrain.py][INFO] Epoch:[0/2](774000/4588595) loss:2.533 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:45:27,447][model8_pretrain.py][INFO] Epoch:[0/2](774000/4588595) loss:2.844 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:45:27,447][model8_pretrain.py][INFO] Epoch:[0/2](774000/4588595) loss:2.596 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:45:27,448][model8_pretrain.py][INFO] Epoch:[0/2](774000/4588595) loss:2.845 lr:0.0000100 epoch_Time:24276.0min: [2024-01-06 03:46:04,393][model8_pretrain.py][INFO] Epoch:[0/2](774100/4588595) loss:2.976 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:04,393][model8_pretrain.py][INFO] Epoch:[0/2](774100/4588595) loss:2.940 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:04,393][model8_pretrain.py][INFO] Epoch:[0/2](774100/4588595) loss:2.939 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:04,393][model8_pretrain.py][INFO] Epoch:[0/2](774100/4588595) loss:2.813 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:04,393][model8_pretrain.py][INFO] Epoch:[0/2](774100/4588595) loss:2.718 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:04,393][model8_pretrain.py][INFO] Epoch:[0/2](774100/4588595) loss:3.177 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:04,393][model8_pretrain.py][INFO] Epoch:[0/2](774100/4588595) loss:3.168 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:04,393][model8_pretrain.py][INFO] Epoch:[0/2](774100/4588595) loss:2.570 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:41,341][model8_pretrain.py][INFO] Epoch:[0/2](774200/4588595) loss:3.062 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:41,341][model8_pretrain.py][INFO] Epoch:[0/2](774200/4588595) loss:2.225 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:41,341][model8_pretrain.py][INFO] Epoch:[0/2](774200/4588595) loss:2.752 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:41,341][model8_pretrain.py][INFO] Epoch:[0/2](774200/4588595) loss:2.457 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:41,341][model8_pretrain.py][INFO] Epoch:[0/2](774200/4588595) loss:2.672 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:41,341][model8_pretrain.py][INFO] Epoch:[0/2](774200/4588595) loss:3.267 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:41,341][model8_pretrain.py][INFO] Epoch:[0/2](774200/4588595) loss:2.260 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:46:41,341][model8_pretrain.py][INFO] Epoch:[0/2](774200/4588595) loss:2.695 lr:0.0000100 epoch_Time:24275.0min: [2024-01-06 03:47:18,288][model8_pretrain.py][INFO] Epoch:[0/2](774300/4588595) loss:3.020 lr:0.0000100 epoch_Time:24273.0min: [2024-01-06 03:47:18,288][model8_pretrain.py][INFO] Epoch:[0/2](774300/4588595) loss:3.138 lr:0.0000100 epoch_Time:24273.0min: [2024-01-06 03:47:18,288][model8_pretrain.py][INFO] Epoch:[0/2](774300/4588595) loss:3.008 lr:0.0000100 epoch_Time:24273.0min: [2024-01-06 03:47:18,288][model8_pretrain.py][INFO] Epoch:[0/2](774300/4588595) loss:3.201 lr:0.0000100 epoch_Time:24273.0min: [2024-01-06 03:47:18,288][model8_pretrain.py][INFO] Epoch:[0/2](774300/4588595) loss:3.102 lr:0.0000100 epoch_Time:24273.0min: [2024-01-06 03:47:18,288][model8_pretrain.py][INFO] Epoch:[0/2](774300/4588595) loss:1.941 lr:0.0000100 epoch_Time:24273.0min: [2024-01-06 03:47:18,288][model8_pretrain.py][INFO] Epoch:[0/2](774300/4588595) loss:3.374 lr:0.0000100 epoch_Time:24273.0min: [2024-01-06 03:47:18,289][model8_pretrain.py][INFO] Epoch:[0/2](774300/4588595) loss:2.823 lr:0.0000100 epoch_Time:24273.0min: [2024-01-06 03:47:55,225][model8_pretrain.py][INFO] Epoch:[0/2](774400/4588595) loss:2.754 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:47:55,225][model8_pretrain.py][INFO] Epoch:[0/2](774400/4588595) loss:3.093 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:47:55,225][model8_pretrain.py][INFO] Epoch:[0/2](774400/4588595) loss:3.216 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:47:55,225][model8_pretrain.py][INFO] Epoch:[0/2](774400/4588595) loss:2.490 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:47:55,225][model8_pretrain.py][INFO] Epoch:[0/2](774400/4588595) loss:2.540 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:47:55,225][model8_pretrain.py][INFO] Epoch:[0/2](774400/4588595) loss:2.798 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:47:55,225][model8_pretrain.py][INFO] Epoch:[0/2](774400/4588595) loss:2.681 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:47:55,225][model8_pretrain.py][INFO] Epoch:[0/2](774400/4588595) loss:3.199 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:48:32,167][model8_pretrain.py][INFO] Epoch:[0/2](774500/4588595) loss:2.220 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:48:32,167][model8_pretrain.py][INFO] Epoch:[0/2](774500/4588595) loss:2.896 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:48:32,167][model8_pretrain.py][INFO] Epoch:[0/2](774500/4588595) loss:3.085 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:48:32,167][model8_pretrain.py][INFO] Epoch:[0/2](774500/4588595) loss:2.942 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:48:32,167][model8_pretrain.py][INFO] Epoch:[0/2](774500/4588595) loss:2.800 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:48:32,167][model8_pretrain.py][INFO] Epoch:[0/2](774500/4588595) loss:2.923 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:48:32,167][model8_pretrain.py][INFO] Epoch:[0/2](774500/4588595) loss:3.020 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:48:32,167][model8_pretrain.py][INFO] Epoch:[0/2](774500/4588595) loss:2.525 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:49:16,059][model8_pretrain.py][INFO] Epoch:[0/2](774600/4588595) loss:2.743 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:49:16,059][model8_pretrain.py][INFO] Epoch:[0/2](774600/4588595) loss:2.534 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:49:16,060][model8_pretrain.py][INFO] Epoch:[0/2](774600/4588595) loss:2.638 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:49:16,062][model8_pretrain.py][INFO] Epoch:[0/2](774600/4588595) loss:2.155 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:49:16,062][model8_pretrain.py][INFO] Epoch:[0/2](774600/4588595) loss:3.012 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:49:16,062][model8_pretrain.py][INFO] Epoch:[0/2](774600/4588595) loss:2.955 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:49:16,063][model8_pretrain.py][INFO] Epoch:[0/2](774600/4588595) loss:2.860 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:49:16,064][model8_pretrain.py][INFO] Epoch:[0/2](774600/4588595) loss:3.048 lr:0.0000100 epoch_Time:24272.0min: [2024-01-06 03:49:56,452][model8_pretrain.py][INFO] Epoch:[0/2](774700/4588595) loss:3.041 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:49:56,452][model8_pretrain.py][INFO] Epoch:[0/2](774700/4588595) loss:3.161 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:49:56,452][model8_pretrain.py][INFO] Epoch:[0/2](774700/4588595) loss:2.883 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:49:56,452][model8_pretrain.py][INFO] Epoch:[0/2](774700/4588595) loss:2.634 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:49:56,452][model8_pretrain.py][INFO] Epoch:[0/2](774700/4588595) loss:2.658 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:49:56,452][model8_pretrain.py][INFO] Epoch:[0/2](774700/4588595) loss:3.261 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:49:56,452][model8_pretrain.py][INFO] Epoch:[0/2](774700/4588595) loss:2.444 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:49:56,452][model8_pretrain.py][INFO] Epoch:[0/2](774700/4588595) loss:2.648 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:50:33,387][model8_pretrain.py][INFO] Epoch:[0/2](774800/4588595) loss:2.833 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:50:33,387][model8_pretrain.py][INFO] Epoch:[0/2](774800/4588595) loss:2.537 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:50:33,387][model8_pretrain.py][INFO] Epoch:[0/2](774800/4588595) loss:2.929 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:50:33,387][model8_pretrain.py][INFO] Epoch:[0/2](774800/4588595) loss:2.601 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:50:33,387][model8_pretrain.py][INFO] Epoch:[0/2](774800/4588595) loss:2.418 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:50:33,387][model8_pretrain.py][INFO] Epoch:[0/2](774800/4588595) loss:3.168 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:50:33,388][model8_pretrain.py][INFO] Epoch:[0/2](774800/4588595) loss:2.695 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:50:33,388][model8_pretrain.py][INFO] Epoch:[0/2](774800/4588595) loss:3.162 lr:0.0000100 epoch_Time:24271.0min: [2024-01-06 03:51:10,337][model8_pretrain.py][INFO] Epoch:[0/2](774900/4588595) loss:3.269 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:10,337][model8_pretrain.py][INFO] Epoch:[0/2](774900/4588595) loss:3.248 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:10,337][model8_pretrain.py][INFO] Epoch:[0/2](774900/4588595) loss:2.700 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:10,337][model8_pretrain.py][INFO] Epoch:[0/2](774900/4588595) loss:3.023 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:10,337][model8_pretrain.py][INFO] Epoch:[0/2](774900/4588595) loss:2.785 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:10,337][model8_pretrain.py][INFO] Epoch:[0/2](774900/4588595) loss:3.533 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:10,338][model8_pretrain.py][INFO] Epoch:[0/2](774900/4588595) loss:2.945 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:10,339][model8_pretrain.py][INFO] Epoch:[0/2](774900/4588595) loss:3.056 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:47,280][model8_pretrain.py][INFO] Epoch:[0/2](775000/4588595) loss:3.164 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:47,280][model8_pretrain.py][INFO] Epoch:[0/2](775000/4588595) loss:2.897 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:47,280][model8_pretrain.py][INFO] Epoch:[0/2](775000/4588595) loss:2.445 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:47,280][model8_pretrain.py][INFO] Epoch:[0/2](775000/4588595) loss:2.722 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:47,280][model8_pretrain.py][INFO] Epoch:[0/2](775000/4588595) loss:3.153 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:47,280][model8_pretrain.py][INFO] Epoch:[0/2](775000/4588595) loss:2.501 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:47,280][model8_pretrain.py][INFO] Epoch:[0/2](775000/4588595) loss:2.946 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:51:47,280][model8_pretrain.py][INFO] Epoch:[0/2](775000/4588595) loss:3.576 lr:0.0000100 epoch_Time:24270.0min: [2024-01-06 03:52:24,224][model8_pretrain.py][INFO] Epoch:[0/2](775100/4588595) loss:2.228 lr:0.0000100 epoch_Time:24269.0min: [2024-01-06 03:52:24,224][model8_pretrain.py][INFO] Epoch:[0/2](775100/4588595) loss:2.444 lr:0.0000100 epoch_Time:24269.0min: [2024-01-06 03:52:24,224][model8_pretrain.py][INFO] Epoch:[0/2](775100/4588595) loss:2.753 lr:0.0000100 epoch_Time:24269.0min: [2024-01-06 03:52:24,224][model8_pretrain.py][INFO] Epoch:[0/2](775100/4588595) loss:2.320 lr:0.0000100 epoch_Time:24269.0min: [2024-01-06 03:52:24,224][model8_pretrain.py][INFO] Epoch:[0/2](775100/4588595) loss:3.152 lr:0.0000100 epoch_Time:24269.0min: [2024-01-06 03:52:24,224][model8_pretrain.py][INFO] Epoch:[0/2](775100/4588595) loss:2.472 lr:0.0000100 epoch_Time:24269.0min: [2024-01-06 03:52:24,224][model8_pretrain.py][INFO] Epoch:[0/2](775100/4588595) loss:2.640 lr:0.0000100 epoch_Time:24269.0min: [2024-01-06 03:52:24,224][model8_pretrain.py][INFO] Epoch:[0/2](775100/4588595) loss:2.890 lr:0.0000100 epoch_Time:24269.0min: [2024-01-06 03:53:01,173][model8_pretrain.py][INFO] Epoch:[0/2](775200/4588595) loss:2.784 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:01,173][model8_pretrain.py][INFO] Epoch:[0/2](775200/4588595) loss:2.464 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:01,173][model8_pretrain.py][INFO] Epoch:[0/2](775200/4588595) loss:2.884 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:01,173][model8_pretrain.py][INFO] Epoch:[0/2](775200/4588595) loss:2.494 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:01,173][model8_pretrain.py][INFO] Epoch:[0/2](775200/4588595) loss:3.277 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:01,173][model8_pretrain.py][INFO] Epoch:[0/2](775200/4588595) loss:3.042 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:01,173][model8_pretrain.py][INFO] Epoch:[0/2](775200/4588595) loss:2.255 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:01,173][model8_pretrain.py][INFO] Epoch:[0/2](775200/4588595) loss:2.552 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:38,126][model8_pretrain.py][INFO] Epoch:[0/2](775300/4588595) loss:2.983 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:38,126][model8_pretrain.py][INFO] Epoch:[0/2](775300/4588595) loss:2.798 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:38,126][model8_pretrain.py][INFO] Epoch:[0/2](775300/4588595) loss:2.620 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:38,127][model8_pretrain.py][INFO] Epoch:[0/2](775300/4588595) loss:2.603 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:38,127][model8_pretrain.py][INFO] Epoch:[0/2](775300/4588595) loss:2.767 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:38,127][model8_pretrain.py][INFO] Epoch:[0/2](775300/4588595) loss:2.747 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:38,127][model8_pretrain.py][INFO] Epoch:[0/2](775300/4588595) loss:2.590 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:53:38,127][model8_pretrain.py][INFO] Epoch:[0/2](775300/4588595) loss:2.605 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:54:20,129][model8_pretrain.py][INFO] Epoch:[0/2](775400/4588595) loss:2.498 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:54:20,129][model8_pretrain.py][INFO] Epoch:[0/2](775400/4588595) loss:2.276 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:54:20,129][model8_pretrain.py][INFO] Epoch:[0/2](775400/4588595) loss:2.357 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:54:20,129][model8_pretrain.py][INFO] Epoch:[0/2](775400/4588595) loss:2.450 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:54:20,129][model8_pretrain.py][INFO] Epoch:[0/2](775400/4588595) loss:2.649 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:54:20,129][model8_pretrain.py][INFO] Epoch:[0/2](775400/4588595) loss:2.734 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:54:20,129][model8_pretrain.py][INFO] Epoch:[0/2](775400/4588595) loss:3.413 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:54:20,129][model8_pretrain.py][INFO] Epoch:[0/2](775400/4588595) loss:3.090 lr:0.0000100 epoch_Time:24267.0min: [2024-01-06 03:55:02,239][model8_pretrain.py][INFO] Epoch:[0/2](775500/4588595) loss:2.384 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:02,239][model8_pretrain.py][INFO] Epoch:[0/2](775500/4588595) loss:2.747 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:02,239][model8_pretrain.py][INFO] Epoch:[0/2](775500/4588595) loss:2.944 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:02,239][model8_pretrain.py][INFO] Epoch:[0/2](775500/4588595) loss:3.086 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:02,239][model8_pretrain.py][INFO] Epoch:[0/2](775500/4588595) loss:2.690 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:02,239][model8_pretrain.py][INFO] Epoch:[0/2](775500/4588595) loss:3.011 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:02,239][model8_pretrain.py][INFO] Epoch:[0/2](775500/4588595) loss:3.392 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:02,240][model8_pretrain.py][INFO] Epoch:[0/2](775500/4588595) loss:2.885 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:39,182][model8_pretrain.py][INFO] Epoch:[0/2](775600/4588595) loss:2.219 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:39,182][model8_pretrain.py][INFO] Epoch:[0/2](775600/4588595) loss:2.533 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:39,182][model8_pretrain.py][INFO] Epoch:[0/2](775600/4588595) loss:2.955 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:39,182][model8_pretrain.py][INFO] Epoch:[0/2](775600/4588595) loss:2.831 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:39,182][model8_pretrain.py][INFO] Epoch:[0/2](775600/4588595) loss:3.046 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:39,182][model8_pretrain.py][INFO] Epoch:[0/2](775600/4588595) loss:2.743 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:39,182][model8_pretrain.py][INFO] Epoch:[0/2](775600/4588595) loss:2.483 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:55:39,182][model8_pretrain.py][INFO] Epoch:[0/2](775600/4588595) loss:2.577 lr:0.0000100 epoch_Time:24266.0min: [2024-01-06 03:56:16,120][model8_pretrain.py][INFO] Epoch:[0/2](775700/4588595) loss:2.955 lr:0.0000100 epoch_Time:24265.0min: [2024-01-06 03:56:16,120][model8_pretrain.py][INFO] Epoch:[0/2](775700/4588595) loss:2.806 lr:0.0000100 epoch_Time:24265.0min: [2024-01-06 03:56:16,120][model8_pretrain.py][INFO] Epoch:[0/2](775700/4588595) loss:3.030 lr:0.0000100 epoch_Time:24265.0min: [2024-01-06 03:56:16,120][model8_pretrain.py][INFO] Epoch:[0/2](775700/4588595) loss:2.537 lr:0.0000100 epoch_Time:24265.0min: [2024-01-06 03:56:16,120][model8_pretrain.py][INFO] Epoch:[0/2](775700/4588595) loss:2.735 lr:0.0000100 epoch_Time:24265.0min: [2024-01-06 03:56:16,120][model8_pretrain.py][INFO] Epoch:[0/2](775700/4588595) loss:2.855 lr:0.0000100 epoch_Time:24265.0min: [2024-01-06 03:56:16,120][model8_pretrain.py][INFO] Epoch:[0/2](775700/4588595) loss:2.688 lr:0.0000100 epoch_Time:24265.0min: [2024-01-06 03:56:16,120][model8_pretrain.py][INFO] Epoch:[0/2](775700/4588595) loss:3.096 lr:0.0000100 epoch_Time:24265.0min: [2024-01-06 03:56:53,061][model8_pretrain.py][INFO] Epoch:[0/2](775800/4588595) loss:3.152 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:56:53,061][model8_pretrain.py][INFO] Epoch:[0/2](775800/4588595) loss:2.233 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:56:53,061][model8_pretrain.py][INFO] Epoch:[0/2](775800/4588595) loss:2.333 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:56:53,061][model8_pretrain.py][INFO] Epoch:[0/2](775800/4588595) loss:2.856 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:56:53,062][model8_pretrain.py][INFO] Epoch:[0/2](775800/4588595) loss:2.439 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:56:53,061][model8_pretrain.py][INFO] Epoch:[0/2](775800/4588595) loss:2.746 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:56:53,062][model8_pretrain.py][INFO] Epoch:[0/2](775800/4588595) loss:2.764 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:56:53,062][model8_pretrain.py][INFO] Epoch:[0/2](775800/4588595) loss:3.251 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:57:30,015][model8_pretrain.py][INFO] Epoch:[0/2](775900/4588595) loss:2.642 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:57:30,015][model8_pretrain.py][INFO] Epoch:[0/2](775900/4588595) loss:3.019 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:57:30,015][model8_pretrain.py][INFO] Epoch:[0/2](775900/4588595) loss:2.485 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:57:30,015][model8_pretrain.py][INFO] Epoch:[0/2](775900/4588595) loss:2.603 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:57:30,015][model8_pretrain.py][INFO] Epoch:[0/2](775900/4588595) loss:3.046 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:57:30,015][model8_pretrain.py][INFO] Epoch:[0/2](775900/4588595) loss:2.572 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:57:30,015][model8_pretrain.py][INFO] Epoch:[0/2](775900/4588595) loss:2.595 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:57:30,015][model8_pretrain.py][INFO] Epoch:[0/2](775900/4588595) loss:2.676 lr:0.0000100 epoch_Time:24264.0min: [2024-01-06 03:58:06,960][model8_pretrain.py][INFO] Epoch:[0/2](776000/4588595) loss:2.900 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:06,960][model8_pretrain.py][INFO] Epoch:[0/2](776000/4588595) loss:3.024 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:06,960][model8_pretrain.py][INFO] Epoch:[0/2](776000/4588595) loss:2.373 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:06,960][model8_pretrain.py][INFO] Epoch:[0/2](776000/4588595) loss:3.007 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:06,960][model8_pretrain.py][INFO] Epoch:[0/2](776000/4588595) loss:2.660 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:06,961][model8_pretrain.py][INFO] Epoch:[0/2](776000/4588595) loss:2.621 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:06,961][model8_pretrain.py][INFO] Epoch:[0/2](776000/4588595) loss:2.506 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:06,962][model8_pretrain.py][INFO] Epoch:[0/2](776000/4588595) loss:2.299 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:43,906][model8_pretrain.py][INFO] Epoch:[0/2](776100/4588595) loss:3.100 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:43,906][model8_pretrain.py][INFO] Epoch:[0/2](776100/4588595) loss:2.877 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:43,906][model8_pretrain.py][INFO] Epoch:[0/2](776100/4588595) loss:2.722 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:43,906][model8_pretrain.py][INFO] Epoch:[0/2](776100/4588595) loss:2.716 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:43,906][model8_pretrain.py][INFO] Epoch:[0/2](776100/4588595) loss:2.209 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:43,906][model8_pretrain.py][INFO] Epoch:[0/2](776100/4588595) loss:3.001 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:43,906][model8_pretrain.py][INFO] Epoch:[0/2](776100/4588595) loss:3.416 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:58:43,906][model8_pretrain.py][INFO] Epoch:[0/2](776100/4588595) loss:3.002 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:59:25,586][model8_pretrain.py][INFO] Epoch:[0/2](776200/4588595) loss:2.364 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:59:25,586][model8_pretrain.py][INFO] Epoch:[0/2](776200/4588595) loss:2.858 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:59:25,586][model8_pretrain.py][INFO] Epoch:[0/2](776200/4588595) loss:3.217 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:59:25,586][model8_pretrain.py][INFO] Epoch:[0/2](776200/4588595) loss:3.072 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:59:25,586][model8_pretrain.py][INFO] Epoch:[0/2](776200/4588595) loss:2.453 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:59:25,586][model8_pretrain.py][INFO] Epoch:[0/2](776200/4588595) loss:3.151 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:59:25,586][model8_pretrain.py][INFO] Epoch:[0/2](776200/4588595) loss:2.905 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 03:59:25,586][model8_pretrain.py][INFO] Epoch:[0/2](776200/4588595) loss:3.116 lr:0.0000100 epoch_Time:24262.0min: [2024-01-06 04:00:07,708][model8_pretrain.py][INFO] Epoch:[0/2](776300/4588595) loss:2.460 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:07,708][model8_pretrain.py][INFO] Epoch:[0/2](776300/4588595) loss:2.627 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:07,708][model8_pretrain.py][INFO] Epoch:[0/2](776300/4588595) loss:2.843 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:07,708][model8_pretrain.py][INFO] Epoch:[0/2](776300/4588595) loss:2.742 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:07,708][model8_pretrain.py][INFO] Epoch:[0/2](776300/4588595) loss:2.934 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:07,708][model8_pretrain.py][INFO] Epoch:[0/2](776300/4588595) loss:2.421 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:07,708][model8_pretrain.py][INFO] Epoch:[0/2](776300/4588595) loss:2.974 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:07,708][model8_pretrain.py][INFO] Epoch:[0/2](776300/4588595) loss:2.891 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:44,655][model8_pretrain.py][INFO] Epoch:[0/2](776400/4588595) loss:2.678 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:44,655][model8_pretrain.py][INFO] Epoch:[0/2](776400/4588595) loss:3.520 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:44,655][model8_pretrain.py][INFO] Epoch:[0/2](776400/4588595) loss:3.102 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:44,655][model8_pretrain.py][INFO] Epoch:[0/2](776400/4588595) loss:2.254 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:44,655][model8_pretrain.py][INFO] Epoch:[0/2](776400/4588595) loss:2.930 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:44,655][model8_pretrain.py][INFO] Epoch:[0/2](776400/4588595) loss:2.737 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:44,655][model8_pretrain.py][INFO] Epoch:[0/2](776400/4588595) loss:3.006 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:00:44,655][model8_pretrain.py][INFO] Epoch:[0/2](776400/4588595) loss:2.748 lr:0.0000100 epoch_Time:24261.0min: [2024-01-06 04:01:21,603][model8_pretrain.py][INFO] Epoch:[0/2](776500/4588595) loss:2.630 lr:0.0000100 epoch_Time:24260.0min: [2024-01-06 04:01:21,603][model8_pretrain.py][INFO] Epoch:[0/2](776500/4588595) loss:2.489 lr:0.0000100 epoch_Time:24260.0min: [2024-01-06 04:01:21,603][model8_pretrain.py][INFO] Epoch:[0/2](776500/4588595) loss:2.868 lr:0.0000100 epoch_Time:24260.0min: [2024-01-06 04:01:21,603][model8_pretrain.py][INFO] Epoch:[0/2](776500/4588595) loss:3.051 lr:0.0000100 epoch_Time:24260.0min: [2024-01-06 04:01:21,603][model8_pretrain.py][INFO] Epoch:[0/2](776500/4588595) loss:2.931 lr:0.0000100 epoch_Time:24260.0min: [2024-01-06 04:01:21,603][model8_pretrain.py][INFO] Epoch:[0/2](776500/4588595) loss:3.062 lr:0.0000100 epoch_Time:24260.0min: [2024-01-06 04:01:21,604][model8_pretrain.py][INFO] Epoch:[0/2](776500/4588595) loss:2.967 lr:0.0000100 epoch_Time:24260.0min: [2024-01-06 04:01:21,604][model8_pretrain.py][INFO] Epoch:[0/2](776500/4588595) loss:2.608 lr:0.0000100 epoch_Time:24260.0min: [2024-01-06 04:01:58,552][model8_pretrain.py][INFO] Epoch:[0/2](776600/4588595) loss:2.480 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:01:58,552][model8_pretrain.py][INFO] Epoch:[0/2](776600/4588595) loss:2.788 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:01:58,552][model8_pretrain.py][INFO] Epoch:[0/2](776600/4588595) loss:2.460 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:01:58,552][model8_pretrain.py][INFO] Epoch:[0/2](776600/4588595) loss:3.519 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:01:58,552][model8_pretrain.py][INFO] Epoch:[0/2](776600/4588595) loss:2.111 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:01:58,552][model8_pretrain.py][INFO] Epoch:[0/2](776600/4588595) loss:2.325 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:01:58,552][model8_pretrain.py][INFO] Epoch:[0/2](776600/4588595) loss:3.108 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:01:58,552][model8_pretrain.py][INFO] Epoch:[0/2](776600/4588595) loss:2.811 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:02:35,507][model8_pretrain.py][INFO] Epoch:[0/2](776700/4588595) loss:2.762 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:02:35,507][model8_pretrain.py][INFO] Epoch:[0/2](776700/4588595) loss:3.012 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:02:35,507][model8_pretrain.py][INFO] Epoch:[0/2](776700/4588595) loss:3.307 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:02:35,507][model8_pretrain.py][INFO] Epoch:[0/2](776700/4588595) loss:2.815 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:02:35,507][model8_pretrain.py][INFO] Epoch:[0/2](776700/4588595) loss:2.973 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:02:35,507][model8_pretrain.py][INFO] Epoch:[0/2](776700/4588595) loss:2.739 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:02:35,507][model8_pretrain.py][INFO] Epoch:[0/2](776700/4588595) loss:2.648 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:02:35,507][model8_pretrain.py][INFO] Epoch:[0/2](776700/4588595) loss:3.039 lr:0.0000100 epoch_Time:24259.0min: [2024-01-06 04:03:12,458][model8_pretrain.py][INFO] Epoch:[0/2](776800/4588595) loss:3.004 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:03:12,458][model8_pretrain.py][INFO] Epoch:[0/2](776800/4588595) loss:2.976 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:03:12,458][model8_pretrain.py][INFO] Epoch:[0/2](776800/4588595) loss:2.253 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:03:12,458][model8_pretrain.py][INFO] Epoch:[0/2](776800/4588595) loss:3.099 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:03:12,458][model8_pretrain.py][INFO] Epoch:[0/2](776800/4588595) loss:3.066 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:03:12,459][model8_pretrain.py][INFO] Epoch:[0/2](776800/4588595) loss:2.628 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:03:12,459][model8_pretrain.py][INFO] Epoch:[0/2](776800/4588595) loss:2.705 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:03:12,459][model8_pretrain.py][INFO] Epoch:[0/2](776800/4588595) loss:2.633 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:03:49,398][model8_pretrain.py][INFO] Epoch:[0/2](776900/4588595) loss:2.870 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:03:49,398][model8_pretrain.py][INFO] Epoch:[0/2](776900/4588595) loss:3.227 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:03:49,398][model8_pretrain.py][INFO] Epoch:[0/2](776900/4588595) loss:2.352 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:03:49,398][model8_pretrain.py][INFO] Epoch:[0/2](776900/4588595) loss:2.830 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:03:49,398][model8_pretrain.py][INFO] Epoch:[0/2](776900/4588595) loss:2.735 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:03:49,398][model8_pretrain.py][INFO] Epoch:[0/2](776900/4588595) loss:2.660 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:03:49,399][model8_pretrain.py][INFO] Epoch:[0/2](776900/4588595) loss:2.823 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:03:49,399][model8_pretrain.py][INFO] Epoch:[0/2](776900/4588595) loss:2.305 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:04:30,936][model8_pretrain.py][INFO] Epoch:[0/2](777000/4588595) loss:2.149 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:04:30,936][model8_pretrain.py][INFO] Epoch:[0/2](777000/4588595) loss:2.919 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:04:30,936][model8_pretrain.py][INFO] Epoch:[0/2](777000/4588595) loss:3.133 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:04:30,936][model8_pretrain.py][INFO] Epoch:[0/2](777000/4588595) loss:3.127 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:04:30,936][model8_pretrain.py][INFO] Epoch:[0/2](777000/4588595) loss:2.904 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:04:30,936][model8_pretrain.py][INFO] Epoch:[0/2](777000/4588595) loss:2.816 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:04:30,940][model8_pretrain.py][INFO] Epoch:[0/2](777000/4588595) loss:2.907 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:04:30,941][model8_pretrain.py][INFO] Epoch:[0/2](777000/4588595) loss:2.466 lr:0.0000100 epoch_Time:24257.0min: [2024-01-06 04:05:13,012][model8_pretrain.py][INFO] Epoch:[0/2](777100/4588595) loss:3.176 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:05:13,012][model8_pretrain.py][INFO] Epoch:[0/2](777100/4588595) loss:2.601 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:05:13,012][model8_pretrain.py][INFO] Epoch:[0/2](777100/4588595) loss:2.804 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:05:13,012][model8_pretrain.py][INFO] Epoch:[0/2](777100/4588595) loss:2.434 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:05:13,012][model8_pretrain.py][INFO] Epoch:[0/2](777100/4588595) loss:2.860 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:05:13,012][model8_pretrain.py][INFO] Epoch:[0/2](777100/4588595) loss:2.899 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:05:13,012][model8_pretrain.py][INFO] Epoch:[0/2](777100/4588595) loss:3.361 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:05:13,012][model8_pretrain.py][INFO] Epoch:[0/2](777100/4588595) loss:2.832 lr:0.0000100 epoch_Time:24256.0min: [2024-01-06 04:05:49,964][model8_pretrain.py][INFO] Epoch:[0/2](777200/4588595) loss:2.834 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:05:49,964][model8_pretrain.py][INFO] Epoch:[0/2](777200/4588595) loss:2.941 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:05:49,964][model8_pretrain.py][INFO] Epoch:[0/2](777200/4588595) loss:2.406 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:05:49,964][model8_pretrain.py][INFO] Epoch:[0/2](777200/4588595) loss:2.390 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:05:49,964][model8_pretrain.py][INFO] Epoch:[0/2](777200/4588595) loss:2.809 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:05:49,964][model8_pretrain.py][INFO] Epoch:[0/2](777200/4588595) loss:2.465 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:05:49,964][model8_pretrain.py][INFO] Epoch:[0/2](777200/4588595) loss:3.097 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:05:49,964][model8_pretrain.py][INFO] Epoch:[0/2](777200/4588595) loss:2.904 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:06:26,919][model8_pretrain.py][INFO] Epoch:[0/2](777300/4588595) loss:3.385 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:06:26,919][model8_pretrain.py][INFO] Epoch:[0/2](777300/4588595) loss:2.782 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:06:26,919][model8_pretrain.py][INFO] Epoch:[0/2](777300/4588595) loss:2.434 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:06:26,919][model8_pretrain.py][INFO] Epoch:[0/2](777300/4588595) loss:2.863 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:06:26,919][model8_pretrain.py][INFO] Epoch:[0/2](777300/4588595) loss:2.736 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:06:26,919][model8_pretrain.py][INFO] Epoch:[0/2](777300/4588595) loss:3.236 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:06:26,919][model8_pretrain.py][INFO] Epoch:[0/2](777300/4588595) loss:2.796 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:06:26,919][model8_pretrain.py][INFO] Epoch:[0/2](777300/4588595) loss:2.506 lr:0.0000100 epoch_Time:24255.0min: [2024-01-06 04:07:03,897][model8_pretrain.py][INFO] Epoch:[0/2](777400/4588595) loss:2.367 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:03,897][model8_pretrain.py][INFO] Epoch:[0/2](777400/4588595) loss:2.511 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:03,897][model8_pretrain.py][INFO] Epoch:[0/2](777400/4588595) loss:3.071 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:03,897][model8_pretrain.py][INFO] Epoch:[0/2](777400/4588595) loss:2.839 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:03,897][model8_pretrain.py][INFO] Epoch:[0/2](777400/4588595) loss:2.969 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:03,897][model8_pretrain.py][INFO] Epoch:[0/2](777400/4588595) loss:2.158 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:03,897][model8_pretrain.py][INFO] Epoch:[0/2](777400/4588595) loss:2.755 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:03,897][model8_pretrain.py][INFO] Epoch:[0/2](777400/4588595) loss:2.711 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:40,877][model8_pretrain.py][INFO] Epoch:[0/2](777500/4588595) loss:2.859 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:40,877][model8_pretrain.py][INFO] Epoch:[0/2](777500/4588595) loss:2.983 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:40,877][model8_pretrain.py][INFO] Epoch:[0/2](777500/4588595) loss:2.263 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:40,878][model8_pretrain.py][INFO] Epoch:[0/2](777500/4588595) loss:2.630 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:40,877][model8_pretrain.py][INFO] Epoch:[0/2](777500/4588595) loss:3.209 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:40,877][model8_pretrain.py][INFO] Epoch:[0/2](777500/4588595) loss:2.769 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:40,877][model8_pretrain.py][INFO] Epoch:[0/2](777500/4588595) loss:2.507 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:07:40,877][model8_pretrain.py][INFO] Epoch:[0/2](777500/4588595) loss:2.390 lr:0.0000100 epoch_Time:24254.0min: [2024-01-06 04:08:17,841][model8_pretrain.py][INFO] Epoch:[0/2](777600/4588595) loss:2.505 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:08:17,841][model8_pretrain.py][INFO] Epoch:[0/2](777600/4588595) loss:2.315 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:08:17,841][model8_pretrain.py][INFO] Epoch:[0/2](777600/4588595) loss:2.811 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:08:17,841][model8_pretrain.py][INFO] Epoch:[0/2](777600/4588595) loss:3.290 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:08:17,841][model8_pretrain.py][INFO] Epoch:[0/2](777600/4588595) loss:3.121 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:08:17,841][model8_pretrain.py][INFO] Epoch:[0/2](777600/4588595) loss:2.717 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:08:17,841][model8_pretrain.py][INFO] Epoch:[0/2](777600/4588595) loss:2.582 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:08:17,841][model8_pretrain.py][INFO] Epoch:[0/2](777600/4588595) loss:2.338 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:08:54,790][model8_pretrain.py][INFO] Epoch:[0/2](777700/4588595) loss:2.831 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:08:54,790][model8_pretrain.py][INFO] Epoch:[0/2](777700/4588595) loss:2.575 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:08:54,790][model8_pretrain.py][INFO] Epoch:[0/2](777700/4588595) loss:2.426 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:08:54,790][model8_pretrain.py][INFO] Epoch:[0/2](777700/4588595) loss:2.729 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:08:54,790][model8_pretrain.py][INFO] Epoch:[0/2](777700/4588595) loss:3.036 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:08:54,790][model8_pretrain.py][INFO] Epoch:[0/2](777700/4588595) loss:2.784 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:08:54,790][model8_pretrain.py][INFO] Epoch:[0/2](777700/4588595) loss:2.903 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:08:54,790][model8_pretrain.py][INFO] Epoch:[0/2](777700/4588595) loss:2.529 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:09:34,799][model8_pretrain.py][INFO] Epoch:[0/2](777800/4588595) loss:2.730 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:09:34,799][model8_pretrain.py][INFO] Epoch:[0/2](777800/4588595) loss:2.871 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:09:34,799][model8_pretrain.py][INFO] Epoch:[0/2](777800/4588595) loss:3.244 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:09:34,799][model8_pretrain.py][INFO] Epoch:[0/2](777800/4588595) loss:2.963 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:09:34,799][model8_pretrain.py][INFO] Epoch:[0/2](777800/4588595) loss:3.316 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:09:34,799][model8_pretrain.py][INFO] Epoch:[0/2](777800/4588595) loss:2.468 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:09:34,799][model8_pretrain.py][INFO] Epoch:[0/2](777800/4588595) loss:2.978 lr:0.0000100 epoch_Time:24252.0min: [2024-01-06 04:09:34,799][model8_pretrain.py][INFO] Epoch:[0/2](777800/4588595) loss:2.818 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:10:18,781][model8_pretrain.py][INFO] Epoch:[0/2](777900/4588595) loss:3.164 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:10:18,781][model8_pretrain.py][INFO] Epoch:[0/2](777900/4588595) loss:2.725 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:10:18,782][model8_pretrain.py][INFO] Epoch:[0/2](777900/4588595) loss:2.543 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:10:18,782][model8_pretrain.py][INFO] Epoch:[0/2](777900/4588595) loss:2.910 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:10:18,782][model8_pretrain.py][INFO] Epoch:[0/2](777900/4588595) loss:2.637 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:10:18,782][model8_pretrain.py][INFO] Epoch:[0/2](777900/4588595) loss:2.921 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:10:18,782][model8_pretrain.py][INFO] Epoch:[0/2](777900/4588595) loss:2.789 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:10:18,782][model8_pretrain.py][INFO] Epoch:[0/2](777900/4588595) loss:2.365 lr:0.0000100 epoch_Time:24251.0min: [2024-01-06 04:10:55,721][model8_pretrain.py][INFO] Epoch:[0/2](778000/4588595) loss:3.165 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:10:55,722][model8_pretrain.py][INFO] Epoch:[0/2](778000/4588595) loss:2.632 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:10:55,722][model8_pretrain.py][INFO] Epoch:[0/2](778000/4588595) loss:2.988 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:10:55,722][model8_pretrain.py][INFO] Epoch:[0/2](778000/4588595) loss:3.286 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:10:55,722][model8_pretrain.py][INFO] Epoch:[0/2](778000/4588595) loss:2.476 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:10:55,722][model8_pretrain.py][INFO] Epoch:[0/2](778000/4588595) loss:2.753 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:10:55,722][model8_pretrain.py][INFO] Epoch:[0/2](778000/4588595) loss:2.446 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:10:55,722][model8_pretrain.py][INFO] Epoch:[0/2](778000/4588595) loss:2.741 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:11:32,674][model8_pretrain.py][INFO] Epoch:[0/2](778100/4588595) loss:2.534 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:11:32,674][model8_pretrain.py][INFO] Epoch:[0/2](778100/4588595) loss:2.897 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:11:32,674][model8_pretrain.py][INFO] Epoch:[0/2](778100/4588595) loss:3.337 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:11:32,674][model8_pretrain.py][INFO] Epoch:[0/2](778100/4588595) loss:2.864 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:11:32,674][model8_pretrain.py][INFO] Epoch:[0/2](778100/4588595) loss:3.196 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:11:32,674][model8_pretrain.py][INFO] Epoch:[0/2](778100/4588595) loss:2.645 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:11:32,674][model8_pretrain.py][INFO] Epoch:[0/2](778100/4588595) loss:2.648 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:11:32,674][model8_pretrain.py][INFO] Epoch:[0/2](778100/4588595) loss:2.877 lr:0.0000100 epoch_Time:24250.0min: [2024-01-06 04:12:09,620][model8_pretrain.py][INFO] Epoch:[0/2](778200/4588595) loss:2.970 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:09,620][model8_pretrain.py][INFO] Epoch:[0/2](778200/4588595) loss:3.190 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:09,620][model8_pretrain.py][INFO] Epoch:[0/2](778200/4588595) loss:2.641 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:09,620][model8_pretrain.py][INFO] Epoch:[0/2](778200/4588595) loss:2.713 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:09,620][model8_pretrain.py][INFO] Epoch:[0/2](778200/4588595) loss:3.183 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:09,621][model8_pretrain.py][INFO] Epoch:[0/2](778200/4588595) loss:3.474 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:09,621][model8_pretrain.py][INFO] Epoch:[0/2](778200/4588595) loss:2.881 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:09,621][model8_pretrain.py][INFO] Epoch:[0/2](778200/4588595) loss:2.942 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:46,577][model8_pretrain.py][INFO] Epoch:[0/2](778300/4588595) loss:2.595 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:46,577][model8_pretrain.py][INFO] Epoch:[0/2](778300/4588595) loss:2.868 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:46,577][model8_pretrain.py][INFO] Epoch:[0/2](778300/4588595) loss:2.865 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:46,577][model8_pretrain.py][INFO] Epoch:[0/2](778300/4588595) loss:3.046 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:46,577][model8_pretrain.py][INFO] Epoch:[0/2](778300/4588595) loss:2.474 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:46,577][model8_pretrain.py][INFO] Epoch:[0/2](778300/4588595) loss:2.135 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:46,577][model8_pretrain.py][INFO] Epoch:[0/2](778300/4588595) loss:2.712 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:12:46,577][model8_pretrain.py][INFO] Epoch:[0/2](778300/4588595) loss:3.473 lr:0.0000100 epoch_Time:24249.0min: [2024-01-06 04:13:23,533][model8_pretrain.py][INFO] Epoch:[0/2](778400/4588595) loss:2.742 lr:0.0000100 epoch_Time:24247.0min: [2024-01-06 04:13:23,533][model8_pretrain.py][INFO] Epoch:[0/2](778400/4588595) loss:2.767 lr:0.0000100 epoch_Time:24247.0min: [2024-01-06 04:13:23,533][model8_pretrain.py][INFO] Epoch:[0/2](778400/4588595) loss:2.529 lr:0.0000100 epoch_Time:24247.0min: [2024-01-06 04:13:23,533][model8_pretrain.py][INFO] Epoch:[0/2](778400/4588595) loss:2.708 lr:0.0000100 epoch_Time:24247.0min: [2024-01-06 04:13:23,533][model8_pretrain.py][INFO] Epoch:[0/2](778400/4588595) loss:2.851 lr:0.0000100 epoch_Time:24247.0min: [2024-01-06 04:13:23,533][model8_pretrain.py][INFO] Epoch:[0/2](778400/4588595) loss:2.653 lr:0.0000100 epoch_Time:24247.0min: [2024-01-06 04:13:23,533][model8_pretrain.py][INFO] Epoch:[0/2](778400/4588595) loss:2.507 lr:0.0000100 epoch_Time:24247.0min: [2024-01-06 04:13:23,533][model8_pretrain.py][INFO] Epoch:[0/2](778400/4588595) loss:2.713 lr:0.0000100 epoch_Time:24247.0min: [2024-01-06 04:14:00,480][model8_pretrain.py][INFO] Epoch:[0/2](778500/4588595) loss:2.846 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:00,480][model8_pretrain.py][INFO] Epoch:[0/2](778500/4588595) loss:3.222 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:00,480][model8_pretrain.py][INFO] Epoch:[0/2](778500/4588595) loss:2.729 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:00,480][model8_pretrain.py][INFO] Epoch:[0/2](778500/4588595) loss:2.927 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:00,480][model8_pretrain.py][INFO] Epoch:[0/2](778500/4588595) loss:2.785 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:00,480][model8_pretrain.py][INFO] Epoch:[0/2](778500/4588595) loss:2.583 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:00,480][model8_pretrain.py][INFO] Epoch:[0/2](778500/4588595) loss:3.055 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:00,480][model8_pretrain.py][INFO] Epoch:[0/2](778500/4588595) loss:3.078 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:39,121][model8_pretrain.py][INFO] Epoch:[0/2](778600/4588595) loss:3.287 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:39,121][model8_pretrain.py][INFO] Epoch:[0/2](778600/4588595) loss:3.349 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:39,121][model8_pretrain.py][INFO] Epoch:[0/2](778600/4588595) loss:2.839 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:39,121][model8_pretrain.py][INFO] Epoch:[0/2](778600/4588595) loss:2.867 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:39,121][model8_pretrain.py][INFO] Epoch:[0/2](778600/4588595) loss:2.764 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:39,121][model8_pretrain.py][INFO] Epoch:[0/2](778600/4588595) loss:2.875 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:39,121][model8_pretrain.py][INFO] Epoch:[0/2](778600/4588595) loss:2.603 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:14:39,121][model8_pretrain.py][INFO] Epoch:[0/2](778600/4588595) loss:2.247 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:15:24,689][model8_pretrain.py][INFO] Epoch:[0/2](778700/4588595) loss:2.824 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:15:24,690][model8_pretrain.py][INFO] Epoch:[0/2](778700/4588595) loss:2.703 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:15:24,690][model8_pretrain.py][INFO] Epoch:[0/2](778700/4588595) loss:3.027 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:15:24,690][model8_pretrain.py][INFO] Epoch:[0/2](778700/4588595) loss:3.097 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:15:24,690][model8_pretrain.py][INFO] Epoch:[0/2](778700/4588595) loss:2.840 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:15:24,690][model8_pretrain.py][INFO] Epoch:[0/2](778700/4588595) loss:1.983 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:15:24,690][model8_pretrain.py][INFO] Epoch:[0/2](778700/4588595) loss:3.532 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:15:24,690][model8_pretrain.py][INFO] Epoch:[0/2](778700/4588595) loss:2.075 lr:0.0000100 epoch_Time:24246.0min: [2024-01-06 04:16:01,616][model8_pretrain.py][INFO] Epoch:[0/2](778800/4588595) loss:2.624 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:01,616][model8_pretrain.py][INFO] Epoch:[0/2](778800/4588595) loss:3.091 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:01,616][model8_pretrain.py][INFO] Epoch:[0/2](778800/4588595) loss:3.258 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:01,616][model8_pretrain.py][INFO] Epoch:[0/2](778800/4588595) loss:2.531 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:01,616][model8_pretrain.py][INFO] Epoch:[0/2](778800/4588595) loss:2.840 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:01,616][model8_pretrain.py][INFO] Epoch:[0/2](778800/4588595) loss:2.853 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:01,616][model8_pretrain.py][INFO] Epoch:[0/2](778800/4588595) loss:2.560 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:01,616][model8_pretrain.py][INFO] Epoch:[0/2](778800/4588595) loss:2.768 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:38,561][model8_pretrain.py][INFO] Epoch:[0/2](778900/4588595) loss:2.996 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:38,561][model8_pretrain.py][INFO] Epoch:[0/2](778900/4588595) loss:2.998 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:38,561][model8_pretrain.py][INFO] Epoch:[0/2](778900/4588595) loss:2.620 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:38,561][model8_pretrain.py][INFO] Epoch:[0/2](778900/4588595) loss:2.145 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:38,561][model8_pretrain.py][INFO] Epoch:[0/2](778900/4588595) loss:2.939 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:38,561][model8_pretrain.py][INFO] Epoch:[0/2](778900/4588595) loss:2.666 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:38,561][model8_pretrain.py][INFO] Epoch:[0/2](778900/4588595) loss:2.797 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:16:38,561][model8_pretrain.py][INFO] Epoch:[0/2](778900/4588595) loss:2.995 lr:0.0000100 epoch_Time:24245.0min: [2024-01-06 04:17:15,507][model8_pretrain.py][INFO] Epoch:[0/2](779000/4588595) loss:3.078 lr:0.0000100 epoch_Time:24244.0min: [2024-01-06 04:17:15,507][model8_pretrain.py][INFO] Epoch:[0/2](779000/4588595) loss:2.054 lr:0.0000100 epoch_Time:24244.0min: [2024-01-06 04:17:15,507][model8_pretrain.py][INFO] Epoch:[0/2](779000/4588595) loss:2.407 lr:0.0000100 epoch_Time:24244.0min: [2024-01-06 04:17:15,507][model8_pretrain.py][INFO] Epoch:[0/2](779000/4588595) loss:2.868 lr:0.0000100 epoch_Time:24244.0min: [2024-01-06 04:17:15,507][model8_pretrain.py][INFO] Epoch:[0/2](779000/4588595) loss:2.993 lr:0.0000100 epoch_Time:24244.0min: [2024-01-06 04:17:15,507][model8_pretrain.py][INFO] Epoch:[0/2](779000/4588595) loss:2.393 lr:0.0000100 epoch_Time:24244.0min: [2024-01-06 04:17:15,507][model8_pretrain.py][INFO] Epoch:[0/2](779000/4588595) loss:3.408 lr:0.0000100 epoch_Time:24244.0min: [2024-01-06 04:17:15,508][model8_pretrain.py][INFO] Epoch:[0/2](779000/4588595) loss:3.132 lr:0.0000100 epoch_Time:24244.0min: [2024-01-06 04:17:52,446][model8_pretrain.py][INFO] Epoch:[0/2](779100/4588595) loss:2.343 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:17:52,446][model8_pretrain.py][INFO] Epoch:[0/2](779100/4588595) loss:3.014 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:17:52,446][model8_pretrain.py][INFO] Epoch:[0/2](779100/4588595) loss:3.017 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:17:52,446][model8_pretrain.py][INFO] Epoch:[0/2](779100/4588595) loss:2.657 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:17:52,446][model8_pretrain.py][INFO] Epoch:[0/2](779100/4588595) loss:2.346 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:17:52,446][model8_pretrain.py][INFO] Epoch:[0/2](779100/4588595) loss:2.951 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:17:52,446][model8_pretrain.py][INFO] Epoch:[0/2](779100/4588595) loss:2.576 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:17:52,446][model8_pretrain.py][INFO] Epoch:[0/2](779100/4588595) loss:2.431 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:18:29,395][model8_pretrain.py][INFO] Epoch:[0/2](779200/4588595) loss:2.884 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:18:29,395][model8_pretrain.py][INFO] Epoch:[0/2](779200/4588595) loss:2.927 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:18:29,395][model8_pretrain.py][INFO] Epoch:[0/2](779200/4588595) loss:3.034 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:18:29,395][model8_pretrain.py][INFO] Epoch:[0/2](779200/4588595) loss:2.941 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:18:29,395][model8_pretrain.py][INFO] Epoch:[0/2](779200/4588595) loss:3.126 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:18:29,395][model8_pretrain.py][INFO] Epoch:[0/2](779200/4588595) loss:2.721 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:18:29,395][model8_pretrain.py][INFO] Epoch:[0/2](779200/4588595) loss:3.048 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:18:29,395][model8_pretrain.py][INFO] Epoch:[0/2](779200/4588595) loss:2.753 lr:0.0000100 epoch_Time:24243.0min: [2024-01-06 04:19:06,334][model8_pretrain.py][INFO] Epoch:[0/2](779300/4588595) loss:1.809 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:06,334][model8_pretrain.py][INFO] Epoch:[0/2](779300/4588595) loss:2.887 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:06,334][model8_pretrain.py][INFO] Epoch:[0/2](779300/4588595) loss:2.139 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:06,334][model8_pretrain.py][INFO] Epoch:[0/2](779300/4588595) loss:2.926 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:06,334][model8_pretrain.py][INFO] Epoch:[0/2](779300/4588595) loss:2.693 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:06,334][model8_pretrain.py][INFO] Epoch:[0/2](779300/4588595) loss:3.325 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:06,334][model8_pretrain.py][INFO] Epoch:[0/2](779300/4588595) loss:2.498 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:06,335][model8_pretrain.py][INFO] Epoch:[0/2](779300/4588595) loss:3.117 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:43,275][model8_pretrain.py][INFO] Epoch:[0/2](779400/4588595) loss:3.077 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:43,275][model8_pretrain.py][INFO] Epoch:[0/2](779400/4588595) loss:2.329 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:43,275][model8_pretrain.py][INFO] Epoch:[0/2](779400/4588595) loss:2.589 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:43,275][model8_pretrain.py][INFO] Epoch:[0/2](779400/4588595) loss:2.495 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:43,275][model8_pretrain.py][INFO] Epoch:[0/2](779400/4588595) loss:2.549 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:43,275][model8_pretrain.py][INFO] Epoch:[0/2](779400/4588595) loss:3.371 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:43,275][model8_pretrain.py][INFO] Epoch:[0/2](779400/4588595) loss:2.675 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:19:43,275][model8_pretrain.py][INFO] Epoch:[0/2](779400/4588595) loss:2.925 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:20:30,782][model8_pretrain.py][INFO] Epoch:[0/2](779500/4588595) loss:2.982 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:20:30,782][model8_pretrain.py][INFO] Epoch:[0/2](779500/4588595) loss:2.131 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:20:30,782][model8_pretrain.py][INFO] Epoch:[0/2](779500/4588595) loss:2.536 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:20:30,782][model8_pretrain.py][INFO] Epoch:[0/2](779500/4588595) loss:2.827 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:20:30,782][model8_pretrain.py][INFO] Epoch:[0/2](779500/4588595) loss:3.045 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:20:30,782][model8_pretrain.py][INFO] Epoch:[0/2](779500/4588595) loss:2.613 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:20:30,782][model8_pretrain.py][INFO] Epoch:[0/2](779500/4588595) loss:1.969 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:20:30,782][model8_pretrain.py][INFO] Epoch:[0/2](779500/4588595) loss:2.514 lr:0.0000100 epoch_Time:24241.0min: [2024-01-06 04:21:07,718][model8_pretrain.py][INFO] Epoch:[0/2](779600/4588595) loss:2.508 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:07,718][model8_pretrain.py][INFO] Epoch:[0/2](779600/4588595) loss:2.499 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:07,718][model8_pretrain.py][INFO] Epoch:[0/2](779600/4588595) loss:3.197 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:07,718][model8_pretrain.py][INFO] Epoch:[0/2](779600/4588595) loss:3.200 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:07,718][model8_pretrain.py][INFO] Epoch:[0/2](779600/4588595) loss:2.679 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:07,718][model8_pretrain.py][INFO] Epoch:[0/2](779600/4588595) loss:2.991 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:07,718][model8_pretrain.py][INFO] Epoch:[0/2](779600/4588595) loss:2.896 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:07,719][model8_pretrain.py][INFO] Epoch:[0/2](779600/4588595) loss:2.861 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:44,665][model8_pretrain.py][INFO] Epoch:[0/2](779700/4588595) loss:2.507 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:44,665][model8_pretrain.py][INFO] Epoch:[0/2](779700/4588595) loss:3.027 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:44,665][model8_pretrain.py][INFO] Epoch:[0/2](779700/4588595) loss:1.950 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:44,665][model8_pretrain.py][INFO] Epoch:[0/2](779700/4588595) loss:3.060 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:44,665][model8_pretrain.py][INFO] Epoch:[0/2](779700/4588595) loss:3.118 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:44,665][model8_pretrain.py][INFO] Epoch:[0/2](779700/4588595) loss:2.717 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:44,665][model8_pretrain.py][INFO] Epoch:[0/2](779700/4588595) loss:3.069 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:21:44,665][model8_pretrain.py][INFO] Epoch:[0/2](779700/4588595) loss:3.101 lr:0.0000100 epoch_Time:24240.0min: [2024-01-06 04:22:21,606][model8_pretrain.py][INFO] Epoch:[0/2](779800/4588595) loss:2.880 lr:0.0000100 epoch_Time:24239.0min: [2024-01-06 04:22:21,606][model8_pretrain.py][INFO] Epoch:[0/2](779800/4588595) loss:2.985 lr:0.0000100 epoch_Time:24239.0min: [2024-01-06 04:22:21,606][model8_pretrain.py][INFO] Epoch:[0/2](779800/4588595) loss:3.007 lr:0.0000100 epoch_Time:24239.0min: [2024-01-06 04:22:21,606][model8_pretrain.py][INFO] Epoch:[0/2](779800/4588595) loss:3.469 lr:0.0000100 epoch_Time:24239.0min: [2024-01-06 04:22:21,606][model8_pretrain.py][INFO] Epoch:[0/2](779800/4588595) loss:3.520 lr:0.0000100 epoch_Time:24239.0min: [2024-01-06 04:22:21,606][model8_pretrain.py][INFO] Epoch:[0/2](779800/4588595) loss:3.094 lr:0.0000100 epoch_Time:24239.0min: [2024-01-06 04:22:21,606][model8_pretrain.py][INFO] Epoch:[0/2](779800/4588595) loss:2.617 lr:0.0000100 epoch_Time:24239.0min: [2024-01-06 04:22:21,606][model8_pretrain.py][INFO] Epoch:[0/2](779800/4588595) loss:2.830 lr:0.0000100 epoch_Time:24239.0min: [2024-01-06 04:22:58,557][model8_pretrain.py][INFO] Epoch:[0/2](779900/4588595) loss:2.810 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:22:58,557][model8_pretrain.py][INFO] Epoch:[0/2](779900/4588595) loss:2.948 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:22:58,557][model8_pretrain.py][INFO] Epoch:[0/2](779900/4588595) loss:3.019 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:22:58,557][model8_pretrain.py][INFO] Epoch:[0/2](779900/4588595) loss:2.690 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:22:58,557][model8_pretrain.py][INFO] Epoch:[0/2](779900/4588595) loss:2.620 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:22:58,557][model8_pretrain.py][INFO] Epoch:[0/2](779900/4588595) loss:3.213 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:22:58,557][model8_pretrain.py][INFO] Epoch:[0/2](779900/4588595) loss:2.522 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:22:58,557][model8_pretrain.py][INFO] Epoch:[0/2](779900/4588595) loss:2.523 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:23:35,505][model8_pretrain.py][INFO] Epoch:[0/2](780000/4588595) loss:2.822 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:23:35,505][model8_pretrain.py][INFO] Epoch:[0/2](780000/4588595) loss:2.784 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:23:35,505][model8_pretrain.py][INFO] Epoch:[0/2](780000/4588595) loss:3.276 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:23:35,505][model8_pretrain.py][INFO] Epoch:[0/2](780000/4588595) loss:2.725 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:23:35,505][model8_pretrain.py][INFO] Epoch:[0/2](780000/4588595) loss:2.194 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:23:35,505][model8_pretrain.py][INFO] Epoch:[0/2](780000/4588595) loss:2.939 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:23:35,505][model8_pretrain.py][INFO] Epoch:[0/2](780000/4588595) loss:3.126 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:23:35,505][model8_pretrain.py][INFO] Epoch:[0/2](780000/4588595) loss:2.879 lr:0.0000100 epoch_Time:24238.0min: [2024-01-06 04:24:12,447][model8_pretrain.py][INFO] Epoch:[0/2](780100/4588595) loss:2.193 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:24:12,447][model8_pretrain.py][INFO] Epoch:[0/2](780100/4588595) loss:3.035 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:24:12,447][model8_pretrain.py][INFO] Epoch:[0/2](780100/4588595) loss:2.947 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:24:12,447][model8_pretrain.py][INFO] Epoch:[0/2](780100/4588595) loss:2.633 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:24:12,447][model8_pretrain.py][INFO] Epoch:[0/2](780100/4588595) loss:2.549 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:24:12,447][model8_pretrain.py][INFO] Epoch:[0/2](780100/4588595) loss:2.894 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:24:12,447][model8_pretrain.py][INFO] Epoch:[0/2](780100/4588595) loss:2.965 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:24:12,447][model8_pretrain.py][INFO] Epoch:[0/2](780100/4588595) loss:3.046 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:24:49,385][model8_pretrain.py][INFO] Epoch:[0/2](780200/4588595) loss:2.796 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:24:49,385][model8_pretrain.py][INFO] Epoch:[0/2](780200/4588595) loss:2.914 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:24:49,385][model8_pretrain.py][INFO] Epoch:[0/2](780200/4588595) loss:3.008 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:24:49,385][model8_pretrain.py][INFO] Epoch:[0/2](780200/4588595) loss:2.868 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:24:49,385][model8_pretrain.py][INFO] Epoch:[0/2](780200/4588595) loss:3.096 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:24:49,385][model8_pretrain.py][INFO] Epoch:[0/2](780200/4588595) loss:3.197 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:24:49,385][model8_pretrain.py][INFO] Epoch:[0/2](780200/4588595) loss:2.994 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:24:49,385][model8_pretrain.py][INFO] Epoch:[0/2](780200/4588595) loss:3.203 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:25:36,769][model8_pretrain.py][INFO] Epoch:[0/2](780300/4588595) loss:2.252 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:25:36,769][model8_pretrain.py][INFO] Epoch:[0/2](780300/4588595) loss:2.526 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:25:36,769][model8_pretrain.py][INFO] Epoch:[0/2](780300/4588595) loss:2.225 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:25:36,769][model8_pretrain.py][INFO] Epoch:[0/2](780300/4588595) loss:3.076 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:25:36,769][model8_pretrain.py][INFO] Epoch:[0/2](780300/4588595) loss:2.903 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:25:36,769][model8_pretrain.py][INFO] Epoch:[0/2](780300/4588595) loss:2.829 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:25:36,769][model8_pretrain.py][INFO] Epoch:[0/2](780300/4588595) loss:3.045 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:25:36,769][model8_pretrain.py][INFO] Epoch:[0/2](780300/4588595) loss:2.582 lr:0.0000100 epoch_Time:24236.0min: [2024-01-06 04:26:13,716][model8_pretrain.py][INFO] Epoch:[0/2](780400/4588595) loss:2.843 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:26:13,716][model8_pretrain.py][INFO] Epoch:[0/2](780400/4588595) loss:2.925 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:26:13,716][model8_pretrain.py][INFO] Epoch:[0/2](780400/4588595) loss:2.875 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:26:13,716][model8_pretrain.py][INFO] Epoch:[0/2](780400/4588595) loss:2.660 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:26:13,716][model8_pretrain.py][INFO] Epoch:[0/2](780400/4588595) loss:3.138 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:26:13,716][model8_pretrain.py][INFO] Epoch:[0/2](780400/4588595) loss:2.720 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:26:13,716][model8_pretrain.py][INFO] Epoch:[0/2](780400/4588595) loss:3.164 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:26:13,716][model8_pretrain.py][INFO] Epoch:[0/2](780400/4588595) loss:3.016 lr:0.0000100 epoch_Time:24235.0min: [2024-01-06 04:26:50,667][model8_pretrain.py][INFO] Epoch:[0/2](780500/4588595) loss:2.351 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:26:50,667][model8_pretrain.py][INFO] Epoch:[0/2](780500/4588595) loss:2.320 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:26:50,667][model8_pretrain.py][INFO] Epoch:[0/2](780500/4588595) loss:3.029 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:26:50,667][model8_pretrain.py][INFO] Epoch:[0/2](780500/4588595) loss:2.885 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:26:50,667][model8_pretrain.py][INFO] Epoch:[0/2](780500/4588595) loss:3.164 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:26:50,667][model8_pretrain.py][INFO] Epoch:[0/2](780500/4588595) loss:2.674 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:26:50,667][model8_pretrain.py][INFO] Epoch:[0/2](780500/4588595) loss:2.674 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:26:50,667][model8_pretrain.py][INFO] Epoch:[0/2](780500/4588595) loss:3.012 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:27:27,604][model8_pretrain.py][INFO] Epoch:[0/2](780600/4588595) loss:2.474 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:27:27,604][model8_pretrain.py][INFO] Epoch:[0/2](780600/4588595) loss:2.583 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:27:27,604][model8_pretrain.py][INFO] Epoch:[0/2](780600/4588595) loss:2.429 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:27:27,604][model8_pretrain.py][INFO] Epoch:[0/2](780600/4588595) loss:2.569 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:27:27,604][model8_pretrain.py][INFO] Epoch:[0/2](780600/4588595) loss:2.372 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:27:27,604][model8_pretrain.py][INFO] Epoch:[0/2](780600/4588595) loss:2.493 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:27:27,604][model8_pretrain.py][INFO] Epoch:[0/2](780600/4588595) loss:3.164 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:27:27,604][model8_pretrain.py][INFO] Epoch:[0/2](780600/4588595) loss:3.266 lr:0.0000100 epoch_Time:24234.0min: [2024-01-06 04:28:04,554][model8_pretrain.py][INFO] Epoch:[0/2](780700/4588595) loss:2.905 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:04,554][model8_pretrain.py][INFO] Epoch:[0/2](780700/4588595) loss:3.123 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:04,554][model8_pretrain.py][INFO] Epoch:[0/2](780700/4588595) loss:3.194 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:04,554][model8_pretrain.py][INFO] Epoch:[0/2](780700/4588595) loss:2.886 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:04,554][model8_pretrain.py][INFO] Epoch:[0/2](780700/4588595) loss:2.990 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:04,554][model8_pretrain.py][INFO] Epoch:[0/2](780700/4588595) loss:2.332 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:04,554][model8_pretrain.py][INFO] Epoch:[0/2](780700/4588595) loss:2.042 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:04,554][model8_pretrain.py][INFO] Epoch:[0/2](780700/4588595) loss:2.245 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:41,500][model8_pretrain.py][INFO] Epoch:[0/2](780800/4588595) loss:2.596 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:41,500][model8_pretrain.py][INFO] Epoch:[0/2](780800/4588595) loss:3.179 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:41,500][model8_pretrain.py][INFO] Epoch:[0/2](780800/4588595) loss:2.391 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:41,500][model8_pretrain.py][INFO] Epoch:[0/2](780800/4588595) loss:2.708 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:41,500][model8_pretrain.py][INFO] Epoch:[0/2](780800/4588595) loss:2.581 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:41,500][model8_pretrain.py][INFO] Epoch:[0/2](780800/4588595) loss:2.998 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:41,500][model8_pretrain.py][INFO] Epoch:[0/2](780800/4588595) loss:2.859 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:28:41,500][model8_pretrain.py][INFO] Epoch:[0/2](780800/4588595) loss:3.176 lr:0.0000100 epoch_Time:24233.0min: [2024-01-06 04:29:18,432][model8_pretrain.py][INFO] Epoch:[0/2](780900/4588595) loss:2.599 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:29:18,432][model8_pretrain.py][INFO] Epoch:[0/2](780900/4588595) loss:2.829 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:29:18,432][model8_pretrain.py][INFO] Epoch:[0/2](780900/4588595) loss:2.757 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:29:18,432][model8_pretrain.py][INFO] Epoch:[0/2](780900/4588595) loss:3.314 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:29:18,432][model8_pretrain.py][INFO] Epoch:[0/2](780900/4588595) loss:2.807 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:29:18,432][model8_pretrain.py][INFO] Epoch:[0/2](780900/4588595) loss:3.289 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:29:18,432][model8_pretrain.py][INFO] Epoch:[0/2](780900/4588595) loss:2.695 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:29:18,432][model8_pretrain.py][INFO] Epoch:[0/2](780900/4588595) loss:3.373 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:29:55,378][model8_pretrain.py][INFO] Epoch:[0/2](781000/4588595) loss:2.478 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:29:55,378][model8_pretrain.py][INFO] Epoch:[0/2](781000/4588595) loss:2.953 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:29:55,378][model8_pretrain.py][INFO] Epoch:[0/2](781000/4588595) loss:3.031 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:29:55,378][model8_pretrain.py][INFO] Epoch:[0/2](781000/4588595) loss:2.632 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:29:55,378][model8_pretrain.py][INFO] Epoch:[0/2](781000/4588595) loss:2.853 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:29:55,378][model8_pretrain.py][INFO] Epoch:[0/2](781000/4588595) loss:3.151 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:29:55,379][model8_pretrain.py][INFO] Epoch:[0/2](781000/4588595) loss:2.378 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:29:55,379][model8_pretrain.py][INFO] Epoch:[0/2](781000/4588595) loss:2.779 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:30:42,787][model8_pretrain.py][INFO] Epoch:[0/2](781100/4588595) loss:2.953 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:30:42,787][model8_pretrain.py][INFO] Epoch:[0/2](781100/4588595) loss:3.076 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:30:42,787][model8_pretrain.py][INFO] Epoch:[0/2](781100/4588595) loss:3.515 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:30:42,787][model8_pretrain.py][INFO] Epoch:[0/2](781100/4588595) loss:3.321 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:30:42,787][model8_pretrain.py][INFO] Epoch:[0/2](781100/4588595) loss:2.646 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:30:42,787][model8_pretrain.py][INFO] Epoch:[0/2](781100/4588595) loss:3.030 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:30:42,787][model8_pretrain.py][INFO] Epoch:[0/2](781100/4588595) loss:2.718 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:30:42,787][model8_pretrain.py][INFO] Epoch:[0/2](781100/4588595) loss:2.503 lr:0.0000100 epoch_Time:24231.0min: [2024-01-06 04:31:19,726][model8_pretrain.py][INFO] Epoch:[0/2](781200/4588595) loss:2.584 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:31:19,726][model8_pretrain.py][INFO] Epoch:[0/2](781200/4588595) loss:3.011 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:31:19,726][model8_pretrain.py][INFO] Epoch:[0/2](781200/4588595) loss:3.125 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:31:19,726][model8_pretrain.py][INFO] Epoch:[0/2](781200/4588595) loss:3.198 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:31:19,726][model8_pretrain.py][INFO] Epoch:[0/2](781200/4588595) loss:2.742 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:31:19,726][model8_pretrain.py][INFO] Epoch:[0/2](781200/4588595) loss:3.139 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:31:19,726][model8_pretrain.py][INFO] Epoch:[0/2](781200/4588595) loss:3.309 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:31:19,726][model8_pretrain.py][INFO] Epoch:[0/2](781200/4588595) loss:2.538 lr:0.0000100 epoch_Time:24230.0min: [2024-01-06 04:31:56,684][model8_pretrain.py][INFO] Epoch:[0/2](781300/4588595) loss:3.280 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:31:56,684][model8_pretrain.py][INFO] Epoch:[0/2](781300/4588595) loss:2.639 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:31:56,684][model8_pretrain.py][INFO] Epoch:[0/2](781300/4588595) loss:2.974 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:31:56,684][model8_pretrain.py][INFO] Epoch:[0/2](781300/4588595) loss:2.624 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:31:56,684][model8_pretrain.py][INFO] Epoch:[0/2](781300/4588595) loss:3.000 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:31:56,684][model8_pretrain.py][INFO] Epoch:[0/2](781300/4588595) loss:2.783 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:31:56,685][model8_pretrain.py][INFO] Epoch:[0/2](781300/4588595) loss:2.919 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:31:56,685][model8_pretrain.py][INFO] Epoch:[0/2](781300/4588595) loss:2.410 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:32:33,645][model8_pretrain.py][INFO] Epoch:[0/2](781400/4588595) loss:2.978 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:32:33,645][model8_pretrain.py][INFO] Epoch:[0/2](781400/4588595) loss:2.928 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:32:33,645][model8_pretrain.py][INFO] Epoch:[0/2](781400/4588595) loss:2.751 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:32:33,645][model8_pretrain.py][INFO] Epoch:[0/2](781400/4588595) loss:2.706 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:32:33,645][model8_pretrain.py][INFO] Epoch:[0/2](781400/4588595) loss:2.567 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:32:33,645][model8_pretrain.py][INFO] Epoch:[0/2](781400/4588595) loss:2.847 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:32:33,645][model8_pretrain.py][INFO] Epoch:[0/2](781400/4588595) loss:3.277 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:32:33,645][model8_pretrain.py][INFO] Epoch:[0/2](781400/4588595) loss:2.821 lr:0.0000100 epoch_Time:24229.0min: [2024-01-06 04:33:10,603][model8_pretrain.py][INFO] Epoch:[0/2](781500/4588595) loss:3.329 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:10,603][model8_pretrain.py][INFO] Epoch:[0/2](781500/4588595) loss:2.983 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:10,603][model8_pretrain.py][INFO] Epoch:[0/2](781500/4588595) loss:2.998 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:10,603][model8_pretrain.py][INFO] Epoch:[0/2](781500/4588595) loss:3.016 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:10,603][model8_pretrain.py][INFO] Epoch:[0/2](781500/4588595) loss:2.679 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:10,603][model8_pretrain.py][INFO] Epoch:[0/2](781500/4588595) loss:2.697 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:10,603][model8_pretrain.py][INFO] Epoch:[0/2](781500/4588595) loss:2.486 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:10,603][model8_pretrain.py][INFO] Epoch:[0/2](781500/4588595) loss:2.572 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:47,548][model8_pretrain.py][INFO] Epoch:[0/2](781600/4588595) loss:2.339 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:47,548][model8_pretrain.py][INFO] Epoch:[0/2](781600/4588595) loss:2.601 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:47,549][model8_pretrain.py][INFO] Epoch:[0/2](781600/4588595) loss:2.928 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:47,548][model8_pretrain.py][INFO] Epoch:[0/2](781600/4588595) loss:3.042 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:47,549][model8_pretrain.py][INFO] Epoch:[0/2](781600/4588595) loss:2.745 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:47,549][model8_pretrain.py][INFO] Epoch:[0/2](781600/4588595) loss:2.615 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:47,549][model8_pretrain.py][INFO] Epoch:[0/2](781600/4588595) loss:2.489 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:33:47,549][model8_pretrain.py][INFO] Epoch:[0/2](781600/4588595) loss:3.225 lr:0.0000100 epoch_Time:24228.0min: [2024-01-06 04:34:24,500][model8_pretrain.py][INFO] Epoch:[0/2](781700/4588595) loss:2.682 lr:0.0000100 epoch_Time:24227.0min: [2024-01-06 04:34:24,500][model8_pretrain.py][INFO] Epoch:[0/2](781700/4588595) loss:3.288 lr:0.0000100 epoch_Time:24227.0min: [2024-01-06 04:34:24,500][model8_pretrain.py][INFO] Epoch:[0/2](781700/4588595) loss:3.216 lr:0.0000100 epoch_Time:24227.0min: [2024-01-06 04:34:24,500][model8_pretrain.py][INFO] Epoch:[0/2](781700/4588595) loss:2.618 lr:0.0000100 epoch_Time:24227.0min: [2024-01-06 04:34:24,500][model8_pretrain.py][INFO] Epoch:[0/2](781700/4588595) loss:2.962 lr:0.0000100 epoch_Time:24227.0min: [2024-01-06 04:34:24,500][model8_pretrain.py][INFO] Epoch:[0/2](781700/4588595) loss:2.721 lr:0.0000100 epoch_Time:24227.0min: [2024-01-06 04:34:24,501][model8_pretrain.py][INFO] Epoch:[0/2](781700/4588595) loss:2.922 lr:0.0000100 epoch_Time:24227.0min: [2024-01-06 04:34:24,501][model8_pretrain.py][INFO] Epoch:[0/2](781700/4588595) loss:3.166 lr:0.0000100 epoch_Time:24227.0min: [2024-01-06 04:35:01,455][model8_pretrain.py][INFO] Epoch:[0/2](781800/4588595) loss:3.301 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:01,455][model8_pretrain.py][INFO] Epoch:[0/2](781800/4588595) loss:2.852 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:01,455][model8_pretrain.py][INFO] Epoch:[0/2](781800/4588595) loss:2.615 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:01,455][model8_pretrain.py][INFO] Epoch:[0/2](781800/4588595) loss:3.051 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:01,455][model8_pretrain.py][INFO] Epoch:[0/2](781800/4588595) loss:3.070 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:01,455][model8_pretrain.py][INFO] Epoch:[0/2](781800/4588595) loss:2.705 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:01,455][model8_pretrain.py][INFO] Epoch:[0/2](781800/4588595) loss:2.858 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:01,455][model8_pretrain.py][INFO] Epoch:[0/2](781800/4588595) loss:2.949 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:48,644][model8_pretrain.py][INFO] Epoch:[0/2](781900/4588595) loss:2.883 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:48,644][model8_pretrain.py][INFO] Epoch:[0/2](781900/4588595) loss:2.441 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:48,644][model8_pretrain.py][INFO] Epoch:[0/2](781900/4588595) loss:2.581 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:48,644][model8_pretrain.py][INFO] Epoch:[0/2](781900/4588595) loss:2.586 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:48,644][model8_pretrain.py][INFO] Epoch:[0/2](781900/4588595) loss:3.419 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:48,644][model8_pretrain.py][INFO] Epoch:[0/2](781900/4588595) loss:2.111 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:48,644][model8_pretrain.py][INFO] Epoch:[0/2](781900/4588595) loss:3.014 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:35:48,644][model8_pretrain.py][INFO] Epoch:[0/2](781900/4588595) loss:2.650 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:36:25,612][model8_pretrain.py][INFO] Epoch:[0/2](782000/4588595) loss:3.103 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:36:25,612][model8_pretrain.py][INFO] Epoch:[0/2](782000/4588595) loss:2.151 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:36:25,612][model8_pretrain.py][INFO] Epoch:[0/2](782000/4588595) loss:2.564 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:36:25,612][model8_pretrain.py][INFO] Epoch:[0/2](782000/4588595) loss:1.536 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:36:25,612][model8_pretrain.py][INFO] Epoch:[0/2](782000/4588595) loss:2.768 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:36:25,613][model8_pretrain.py][INFO] Epoch:[0/2](782000/4588595) loss:2.635 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:36:25,613][model8_pretrain.py][INFO] Epoch:[0/2](782000/4588595) loss:2.463 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:36:25,613][model8_pretrain.py][INFO] Epoch:[0/2](782000/4588595) loss:2.856 lr:0.0000100 epoch_Time:24225.0min: [2024-01-06 04:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](782100/4588595) loss:2.796 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](782100/4588595) loss:2.752 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](782100/4588595) loss:2.478 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](782100/4588595) loss:2.969 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](782100/4588595) loss:3.175 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](782100/4588595) loss:2.240 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](782100/4588595) loss:3.080 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:02,567][model8_pretrain.py][INFO] Epoch:[0/2](782100/4588595) loss:2.799 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:39,509][model8_pretrain.py][INFO] Epoch:[0/2](782200/4588595) loss:3.198 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:39,509][model8_pretrain.py][INFO] Epoch:[0/2](782200/4588595) loss:2.762 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:39,510][model8_pretrain.py][INFO] Epoch:[0/2](782200/4588595) loss:2.833 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:39,510][model8_pretrain.py][INFO] Epoch:[0/2](782200/4588595) loss:2.729 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:39,510][model8_pretrain.py][INFO] Epoch:[0/2](782200/4588595) loss:2.969 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:39,510][model8_pretrain.py][INFO] Epoch:[0/2](782200/4588595) loss:2.949 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:39,510][model8_pretrain.py][INFO] Epoch:[0/2](782200/4588595) loss:3.384 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:37:39,510][model8_pretrain.py][INFO] Epoch:[0/2](782200/4588595) loss:2.673 lr:0.0000100 epoch_Time:24224.0min: [2024-01-06 04:38:16,463][model8_pretrain.py][INFO] Epoch:[0/2](782300/4588595) loss:2.909 lr:0.0000100 epoch_Time:24223.0min: [2024-01-06 04:38:16,463][model8_pretrain.py][INFO] Epoch:[0/2](782300/4588595) loss:2.688 lr:0.0000100 epoch_Time:24223.0min: [2024-01-06 04:38:16,463][model8_pretrain.py][INFO] Epoch:[0/2](782300/4588595) loss:2.430 lr:0.0000100 epoch_Time:24223.0min: [2024-01-06 04:38:16,463][model8_pretrain.py][INFO] Epoch:[0/2](782300/4588595) loss:2.577 lr:0.0000100 epoch_Time:24223.0min: [2024-01-06 04:38:16,463][model8_pretrain.py][INFO] Epoch:[0/2](782300/4588595) loss:2.476 lr:0.0000100 epoch_Time:24223.0min: [2024-01-06 04:38:16,463][model8_pretrain.py][INFO] Epoch:[0/2](782300/4588595) loss:3.111 lr:0.0000100 epoch_Time:24223.0min: [2024-01-06 04:38:16,464][model8_pretrain.py][INFO] Epoch:[0/2](782300/4588595) loss:3.344 lr:0.0000100 epoch_Time:24223.0min: [2024-01-06 04:38:16,464][model8_pretrain.py][INFO] Epoch:[0/2](782300/4588595) loss:3.231 lr:0.0000100 epoch_Time:24223.0min: [2024-01-06 04:38:53,445][model8_pretrain.py][INFO] Epoch:[0/2](782400/4588595) loss:2.058 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:38:53,445][model8_pretrain.py][INFO] Epoch:[0/2](782400/4588595) loss:2.841 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:38:53,445][model8_pretrain.py][INFO] Epoch:[0/2](782400/4588595) loss:2.827 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:38:53,445][model8_pretrain.py][INFO] Epoch:[0/2](782400/4588595) loss:3.197 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:38:53,445][model8_pretrain.py][INFO] Epoch:[0/2](782400/4588595) loss:2.777 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:38:53,445][model8_pretrain.py][INFO] Epoch:[0/2](782400/4588595) loss:2.412 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:38:53,445][model8_pretrain.py][INFO] Epoch:[0/2](782400/4588595) loss:2.869 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:38:53,445][model8_pretrain.py][INFO] Epoch:[0/2](782400/4588595) loss:2.922 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:39:30,433][model8_pretrain.py][INFO] Epoch:[0/2](782500/4588595) loss:3.022 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:39:30,433][model8_pretrain.py][INFO] Epoch:[0/2](782500/4588595) loss:2.824 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:39:30,433][model8_pretrain.py][INFO] Epoch:[0/2](782500/4588595) loss:3.419 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:39:30,433][model8_pretrain.py][INFO] Epoch:[0/2](782500/4588595) loss:2.619 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:39:30,433][model8_pretrain.py][INFO] Epoch:[0/2](782500/4588595) loss:2.746 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:39:30,433][model8_pretrain.py][INFO] Epoch:[0/2](782500/4588595) loss:2.704 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:39:30,434][model8_pretrain.py][INFO] Epoch:[0/2](782500/4588595) loss:3.132 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:39:30,434][model8_pretrain.py][INFO] Epoch:[0/2](782500/4588595) loss:3.231 lr:0.0000100 epoch_Time:24222.0min: [2024-01-06 04:40:07,430][model8_pretrain.py][INFO] Epoch:[0/2](782600/4588595) loss:3.116 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:07,430][model8_pretrain.py][INFO] Epoch:[0/2](782600/4588595) loss:3.242 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:07,430][model8_pretrain.py][INFO] Epoch:[0/2](782600/4588595) loss:2.890 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:07,430][model8_pretrain.py][INFO] Epoch:[0/2](782600/4588595) loss:2.757 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:07,430][model8_pretrain.py][INFO] Epoch:[0/2](782600/4588595) loss:2.912 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:07,431][model8_pretrain.py][INFO] Epoch:[0/2](782600/4588595) loss:2.618 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:07,431][model8_pretrain.py][INFO] Epoch:[0/2](782600/4588595) loss:2.530 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:07,431][model8_pretrain.py][INFO] Epoch:[0/2](782600/4588595) loss:3.078 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:54,621][model8_pretrain.py][INFO] Epoch:[0/2](782700/4588595) loss:2.901 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:54,621][model8_pretrain.py][INFO] Epoch:[0/2](782700/4588595) loss:2.757 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:54,621][model8_pretrain.py][INFO] Epoch:[0/2](782700/4588595) loss:2.985 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:54,621][model8_pretrain.py][INFO] Epoch:[0/2](782700/4588595) loss:2.708 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:54,621][model8_pretrain.py][INFO] Epoch:[0/2](782700/4588595) loss:2.847 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:54,621][model8_pretrain.py][INFO] Epoch:[0/2](782700/4588595) loss:2.994 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:54,621][model8_pretrain.py][INFO] Epoch:[0/2](782700/4588595) loss:2.395 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:40:54,621][model8_pretrain.py][INFO] Epoch:[0/2](782700/4588595) loss:3.489 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:41:31,561][model8_pretrain.py][INFO] Epoch:[0/2](782800/4588595) loss:2.954 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:41:31,561][model8_pretrain.py][INFO] Epoch:[0/2](782800/4588595) loss:2.507 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:41:31,561][model8_pretrain.py][INFO] Epoch:[0/2](782800/4588595) loss:2.253 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:41:31,561][model8_pretrain.py][INFO] Epoch:[0/2](782800/4588595) loss:2.442 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:41:31,561][model8_pretrain.py][INFO] Epoch:[0/2](782800/4588595) loss:2.490 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:41:31,561][model8_pretrain.py][INFO] Epoch:[0/2](782800/4588595) loss:2.602 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:41:31,561][model8_pretrain.py][INFO] Epoch:[0/2](782800/4588595) loss:2.886 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:41:31,561][model8_pretrain.py][INFO] Epoch:[0/2](782800/4588595) loss:2.935 lr:0.0000100 epoch_Time:24220.0min: [2024-01-06 04:42:08,511][model8_pretrain.py][INFO] Epoch:[0/2](782900/4588595) loss:3.327 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:08,511][model8_pretrain.py][INFO] Epoch:[0/2](782900/4588595) loss:3.041 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:08,511][model8_pretrain.py][INFO] Epoch:[0/2](782900/4588595) loss:2.054 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:08,511][model8_pretrain.py][INFO] Epoch:[0/2](782900/4588595) loss:2.937 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:08,511][model8_pretrain.py][INFO] Epoch:[0/2](782900/4588595) loss:3.298 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:08,511][model8_pretrain.py][INFO] Epoch:[0/2](782900/4588595) loss:3.369 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:08,511][model8_pretrain.py][INFO] Epoch:[0/2](782900/4588595) loss:2.814 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:08,511][model8_pretrain.py][INFO] Epoch:[0/2](782900/4588595) loss:3.228 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:45,416][model8_pretrain.py][INFO] Epoch:[0/2](783000/4588595) loss:2.358 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:45,416][model8_pretrain.py][INFO] Epoch:[0/2](783000/4588595) loss:3.103 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:45,416][model8_pretrain.py][INFO] Epoch:[0/2](783000/4588595) loss:2.427 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:45,416][model8_pretrain.py][INFO] Epoch:[0/2](783000/4588595) loss:3.005 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:45,416][model8_pretrain.py][INFO] Epoch:[0/2](783000/4588595) loss:2.736 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:45,416][model8_pretrain.py][INFO] Epoch:[0/2](783000/4588595) loss:2.881 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:45,416][model8_pretrain.py][INFO] Epoch:[0/2](783000/4588595) loss:2.548 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:42:45,416][model8_pretrain.py][INFO] Epoch:[0/2](783000/4588595) loss:2.354 lr:0.0000100 epoch_Time:24219.0min: [2024-01-06 04:43:22,354][model8_pretrain.py][INFO] Epoch:[0/2](783100/4588595) loss:2.686 lr:0.0000100 epoch_Time:24218.0min: [2024-01-06 04:43:22,354][model8_pretrain.py][INFO] Epoch:[0/2](783100/4588595) loss:2.513 lr:0.0000100 epoch_Time:24218.0min: [2024-01-06 04:43:22,354][model8_pretrain.py][INFO] Epoch:[0/2](783100/4588595) loss:2.602 lr:0.0000100 epoch_Time:24218.0min: [2024-01-06 04:43:22,354][model8_pretrain.py][INFO] Epoch:[0/2](783100/4588595) loss:2.701 lr:0.0000100 epoch_Time:24218.0min: [2024-01-06 04:43:22,354][model8_pretrain.py][INFO] Epoch:[0/2](783100/4588595) loss:2.234 lr:0.0000100 epoch_Time:24218.0min: [2024-01-06 04:43:22,354][model8_pretrain.py][INFO] Epoch:[0/2](783100/4588595) loss:3.287 lr:0.0000100 epoch_Time:24218.0min: [2024-01-06 04:43:22,354][model8_pretrain.py][INFO] Epoch:[0/2](783100/4588595) loss:2.804 lr:0.0000100 epoch_Time:24218.0min: [2024-01-06 04:43:22,354][model8_pretrain.py][INFO] Epoch:[0/2](783100/4588595) loss:3.239 lr:0.0000100 epoch_Time:24218.0min: [2024-01-06 04:43:59,293][model8_pretrain.py][INFO] Epoch:[0/2](783200/4588595) loss:2.986 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:43:59,293][model8_pretrain.py][INFO] Epoch:[0/2](783200/4588595) loss:2.708 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:43:59,293][model8_pretrain.py][INFO] Epoch:[0/2](783200/4588595) loss:2.850 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:43:59,293][model8_pretrain.py][INFO] Epoch:[0/2](783200/4588595) loss:2.501 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:43:59,293][model8_pretrain.py][INFO] Epoch:[0/2](783200/4588595) loss:2.611 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:43:59,294][model8_pretrain.py][INFO] Epoch:[0/2](783200/4588595) loss:3.127 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:43:59,294][model8_pretrain.py][INFO] Epoch:[0/2](783200/4588595) loss:3.266 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:43:59,294][model8_pretrain.py][INFO] Epoch:[0/2](783200/4588595) loss:2.659 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:44:36,236][model8_pretrain.py][INFO] Epoch:[0/2](783300/4588595) loss:2.880 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:44:36,236][model8_pretrain.py][INFO] Epoch:[0/2](783300/4588595) loss:3.230 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:44:36,236][model8_pretrain.py][INFO] Epoch:[0/2](783300/4588595) loss:2.703 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:44:36,236][model8_pretrain.py][INFO] Epoch:[0/2](783300/4588595) loss:2.537 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:44:36,236][model8_pretrain.py][INFO] Epoch:[0/2](783300/4588595) loss:2.519 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:44:36,236][model8_pretrain.py][INFO] Epoch:[0/2](783300/4588595) loss:3.168 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:44:36,236][model8_pretrain.py][INFO] Epoch:[0/2](783300/4588595) loss:2.932 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:44:36,236][model8_pretrain.py][INFO] Epoch:[0/2](783300/4588595) loss:2.713 lr:0.0000100 epoch_Time:24217.0min: [2024-01-06 04:45:13,209][model8_pretrain.py][INFO] Epoch:[0/2](783400/4588595) loss:3.267 lr:0.0000100 epoch_Time:24216.0min: [2024-01-06 04:45:13,209][model8_pretrain.py][INFO] Epoch:[0/2](783400/4588595) loss:2.858 lr:0.0000100 epoch_Time:24216.0min: [2024-01-06 04:45:13,209][model8_pretrain.py][INFO] Epoch:[0/2](783400/4588595) loss:3.030 lr:0.0000100 epoch_Time:24216.0min: [2024-01-06 04:45:13,209][model8_pretrain.py][INFO] Epoch:[0/2](783400/4588595) loss:2.811 lr:0.0000100 epoch_Time:24216.0min: [2024-01-06 04:45:13,209][model8_pretrain.py][INFO] Epoch:[0/2](783400/4588595) loss:2.936 lr:0.0000100 epoch_Time:24216.0min: [2024-01-06 04:45:13,209][model8_pretrain.py][INFO] Epoch:[0/2](783400/4588595) loss:2.729 lr:0.0000100 epoch_Time:24216.0min: [2024-01-06 04:45:13,209][model8_pretrain.py][INFO] Epoch:[0/2](783400/4588595) loss:2.510 lr:0.0000100 epoch_Time:24216.0min: [2024-01-06 04:45:13,209][model8_pretrain.py][INFO] Epoch:[0/2](783400/4588595) loss:3.193 lr:0.0000100 epoch_Time:24216.0min: [2024-01-06 04:46:00,307][model8_pretrain.py][INFO] Epoch:[0/2](783500/4588595) loss:2.993 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:00,307][model8_pretrain.py][INFO] Epoch:[0/2](783500/4588595) loss:2.986 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:00,307][model8_pretrain.py][INFO] Epoch:[0/2](783500/4588595) loss:2.485 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:00,307][model8_pretrain.py][INFO] Epoch:[0/2](783500/4588595) loss:3.179 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:00,307][model8_pretrain.py][INFO] Epoch:[0/2](783500/4588595) loss:2.685 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:00,307][model8_pretrain.py][INFO] Epoch:[0/2](783500/4588595) loss:2.610 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:00,307][model8_pretrain.py][INFO] Epoch:[0/2](783500/4588595) loss:2.933 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:00,307][model8_pretrain.py][INFO] Epoch:[0/2](783500/4588595) loss:2.574 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:37,243][model8_pretrain.py][INFO] Epoch:[0/2](783600/4588595) loss:2.657 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:37,243][model8_pretrain.py][INFO] Epoch:[0/2](783600/4588595) loss:2.896 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:37,243][model8_pretrain.py][INFO] Epoch:[0/2](783600/4588595) loss:3.005 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:37,243][model8_pretrain.py][INFO] Epoch:[0/2](783600/4588595) loss:3.000 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:37,243][model8_pretrain.py][INFO] Epoch:[0/2](783600/4588595) loss:2.561 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:37,243][model8_pretrain.py][INFO] Epoch:[0/2](783600/4588595) loss:3.093 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:37,243][model8_pretrain.py][INFO] Epoch:[0/2](783600/4588595) loss:2.528 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:46:37,243][model8_pretrain.py][INFO] Epoch:[0/2](783600/4588595) loss:2.717 lr:0.0000100 epoch_Time:24215.0min: [2024-01-06 04:47:14,181][model8_pretrain.py][INFO] Epoch:[0/2](783700/4588595) loss:2.218 lr:0.0000100 epoch_Time:24214.0min: [2024-01-06 04:47:14,181][model8_pretrain.py][INFO] Epoch:[0/2](783700/4588595) loss:2.836 lr:0.0000100 epoch_Time:24214.0min: [2024-01-06 04:47:14,181][model8_pretrain.py][INFO] Epoch:[0/2](783700/4588595) loss:2.857 lr:0.0000100 epoch_Time:24214.0min: [2024-01-06 04:47:14,181][model8_pretrain.py][INFO] Epoch:[0/2](783700/4588595) loss:3.025 lr:0.0000100 epoch_Time:24214.0min: [2024-01-06 04:47:14,182][model8_pretrain.py][INFO] Epoch:[0/2](783700/4588595) loss:2.422 lr:0.0000100 epoch_Time:24214.0min: [2024-01-06 04:47:14,182][model8_pretrain.py][INFO] Epoch:[0/2](783700/4588595) loss:2.844 lr:0.0000100 epoch_Time:24214.0min: [2024-01-06 04:47:14,182][model8_pretrain.py][INFO] Epoch:[0/2](783700/4588595) loss:3.398 lr:0.0000100 epoch_Time:24214.0min: [2024-01-06 04:47:14,182][model8_pretrain.py][INFO] Epoch:[0/2](783700/4588595) loss:2.676 lr:0.0000100 epoch_Time:24214.0min: [2024-01-06 04:47:51,119][model8_pretrain.py][INFO] Epoch:[0/2](783800/4588595) loss:2.633 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:47:51,119][model8_pretrain.py][INFO] Epoch:[0/2](783800/4588595) loss:2.934 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:47:51,119][model8_pretrain.py][INFO] Epoch:[0/2](783800/4588595) loss:2.597 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:47:51,119][model8_pretrain.py][INFO] Epoch:[0/2](783800/4588595) loss:3.057 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:47:51,119][model8_pretrain.py][INFO] Epoch:[0/2](783800/4588595) loss:2.449 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:47:51,119][model8_pretrain.py][INFO] Epoch:[0/2](783800/4588595) loss:2.931 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:47:51,119][model8_pretrain.py][INFO] Epoch:[0/2](783800/4588595) loss:2.632 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:47:51,119][model8_pretrain.py][INFO] Epoch:[0/2](783800/4588595) loss:2.941 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:48:28,056][model8_pretrain.py][INFO] Epoch:[0/2](783900/4588595) loss:2.349 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:48:28,056][model8_pretrain.py][INFO] Epoch:[0/2](783900/4588595) loss:2.467 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:48:28,056][model8_pretrain.py][INFO] Epoch:[0/2](783900/4588595) loss:3.026 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:48:28,056][model8_pretrain.py][INFO] Epoch:[0/2](783900/4588595) loss:2.812 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:48:28,056][model8_pretrain.py][INFO] Epoch:[0/2](783900/4588595) loss:2.903 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:48:28,056][model8_pretrain.py][INFO] Epoch:[0/2](783900/4588595) loss:2.283 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:48:28,056][model8_pretrain.py][INFO] Epoch:[0/2](783900/4588595) loss:2.583 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:48:28,056][model8_pretrain.py][INFO] Epoch:[0/2](783900/4588595) loss:2.600 lr:0.0000100 epoch_Time:24213.0min: [2024-01-06 04:49:04,996][model8_pretrain.py][INFO] Epoch:[0/2](784000/4588595) loss:3.442 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:04,996][model8_pretrain.py][INFO] Epoch:[0/2](784000/4588595) loss:2.842 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:04,996][model8_pretrain.py][INFO] Epoch:[0/2](784000/4588595) loss:3.078 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:04,996][model8_pretrain.py][INFO] Epoch:[0/2](784000/4588595) loss:3.500 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:04,996][model8_pretrain.py][INFO] Epoch:[0/2](784000/4588595) loss:2.550 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:04,996][model8_pretrain.py][INFO] Epoch:[0/2](784000/4588595) loss:2.959 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:04,996][model8_pretrain.py][INFO] Epoch:[0/2](784000/4588595) loss:2.192 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:04,996][model8_pretrain.py][INFO] Epoch:[0/2](784000/4588595) loss:2.954 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:41,936][model8_pretrain.py][INFO] Epoch:[0/2](784100/4588595) loss:2.439 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:41,936][model8_pretrain.py][INFO] Epoch:[0/2](784100/4588595) loss:3.061 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:41,936][model8_pretrain.py][INFO] Epoch:[0/2](784100/4588595) loss:3.006 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:41,936][model8_pretrain.py][INFO] Epoch:[0/2](784100/4588595) loss:3.220 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:41,937][model8_pretrain.py][INFO] Epoch:[0/2](784100/4588595) loss:2.949 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:41,937][model8_pretrain.py][INFO] Epoch:[0/2](784100/4588595) loss:2.616 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:41,937][model8_pretrain.py][INFO] Epoch:[0/2](784100/4588595) loss:3.242 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:49:41,937][model8_pretrain.py][INFO] Epoch:[0/2](784100/4588595) loss:2.957 lr:0.0000100 epoch_Time:24212.0min: [2024-01-06 04:50:18,901][model8_pretrain.py][INFO] Epoch:[0/2](784200/4588595) loss:2.703 lr:0.0000100 epoch_Time:24211.0min: [2024-01-06 04:50:18,901][model8_pretrain.py][INFO] Epoch:[0/2](784200/4588595) loss:2.379 lr:0.0000100 epoch_Time:24211.0min: [2024-01-06 04:50:18,901][model8_pretrain.py][INFO] Epoch:[0/2](784200/4588595) loss:2.405 lr:0.0000100 epoch_Time:24211.0min: [2024-01-06 04:50:18,901][model8_pretrain.py][INFO] Epoch:[0/2](784200/4588595) loss:3.252 lr:0.0000100 epoch_Time:24211.0min: [2024-01-06 04:50:18,901][model8_pretrain.py][INFO] Epoch:[0/2](784200/4588595) loss:3.105 lr:0.0000100 epoch_Time:24211.0min: [2024-01-06 04:50:18,901][model8_pretrain.py][INFO] Epoch:[0/2](784200/4588595) loss:2.221 lr:0.0000100 epoch_Time:24211.0min: [2024-01-06 04:50:18,901][model8_pretrain.py][INFO] Epoch:[0/2](784200/4588595) loss:3.153 lr:0.0000100 epoch_Time:24211.0min: [2024-01-06 04:50:18,901][model8_pretrain.py][INFO] Epoch:[0/2](784200/4588595) loss:3.027 lr:0.0000100 epoch_Time:24211.0min: [2024-01-06 04:51:06,074][model8_pretrain.py][INFO] Epoch:[0/2](784300/4588595) loss:2.869 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:06,074][model8_pretrain.py][INFO] Epoch:[0/2](784300/4588595) loss:2.645 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:06,074][model8_pretrain.py][INFO] Epoch:[0/2](784300/4588595) loss:3.195 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:06,074][model8_pretrain.py][INFO] Epoch:[0/2](784300/4588595) loss:2.664 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:06,074][model8_pretrain.py][INFO] Epoch:[0/2](784300/4588595) loss:2.437 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:06,074][model8_pretrain.py][INFO] Epoch:[0/2](784300/4588595) loss:3.206 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:06,074][model8_pretrain.py][INFO] Epoch:[0/2](784300/4588595) loss:3.109 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:06,075][model8_pretrain.py][INFO] Epoch:[0/2](784300/4588595) loss:3.083 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:43,004][model8_pretrain.py][INFO] Epoch:[0/2](784400/4588595) loss:2.750 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:43,004][model8_pretrain.py][INFO] Epoch:[0/2](784400/4588595) loss:3.317 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:43,004][model8_pretrain.py][INFO] Epoch:[0/2](784400/4588595) loss:2.389 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:43,004][model8_pretrain.py][INFO] Epoch:[0/2](784400/4588595) loss:2.788 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:43,004][model8_pretrain.py][INFO] Epoch:[0/2](784400/4588595) loss:2.795 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:43,004][model8_pretrain.py][INFO] Epoch:[0/2](784400/4588595) loss:2.884 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:43,004][model8_pretrain.py][INFO] Epoch:[0/2](784400/4588595) loss:3.128 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:51:43,004][model8_pretrain.py][INFO] Epoch:[0/2](784400/4588595) loss:2.255 lr:0.0000100 epoch_Time:24210.0min: [2024-01-06 04:52:19,953][model8_pretrain.py][INFO] Epoch:[0/2](784500/4588595) loss:3.020 lr:0.0000100 epoch_Time:24209.0min: [2024-01-06 04:52:19,953][model8_pretrain.py][INFO] Epoch:[0/2](784500/4588595) loss:2.760 lr:0.0000100 epoch_Time:24209.0min: [2024-01-06 04:52:19,953][model8_pretrain.py][INFO] Epoch:[0/2](784500/4588595) loss:2.812 lr:0.0000100 epoch_Time:24209.0min: [2024-01-06 04:52:19,953][model8_pretrain.py][INFO] Epoch:[0/2](784500/4588595) loss:3.000 lr:0.0000100 epoch_Time:24209.0min: [2024-01-06 04:52:19,953][model8_pretrain.py][INFO] Epoch:[0/2](784500/4588595) loss:3.169 lr:0.0000100 epoch_Time:24209.0min: [2024-01-06 04:52:19,953][model8_pretrain.py][INFO] Epoch:[0/2](784500/4588595) loss:2.540 lr:0.0000100 epoch_Time:24209.0min: [2024-01-06 04:52:19,953][model8_pretrain.py][INFO] Epoch:[0/2](784500/4588595) loss:2.803 lr:0.0000100 epoch_Time:24209.0min: [2024-01-06 04:52:19,953][model8_pretrain.py][INFO] Epoch:[0/2](784500/4588595) loss:2.740 lr:0.0000100 epoch_Time:24209.0min: [2024-01-06 04:52:56,890][model8_pretrain.py][INFO] Epoch:[0/2](784600/4588595) loss:2.613 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:52:56,890][model8_pretrain.py][INFO] Epoch:[0/2](784600/4588595) loss:2.319 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:52:56,890][model8_pretrain.py][INFO] Epoch:[0/2](784600/4588595) loss:2.602 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:52:56,890][model8_pretrain.py][INFO] Epoch:[0/2](784600/4588595) loss:2.763 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:52:56,890][model8_pretrain.py][INFO] Epoch:[0/2](784600/4588595) loss:2.420 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:52:56,890][model8_pretrain.py][INFO] Epoch:[0/2](784600/4588595) loss:2.431 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:52:56,890][model8_pretrain.py][INFO] Epoch:[0/2](784600/4588595) loss:3.076 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:52:56,890][model8_pretrain.py][INFO] Epoch:[0/2](784600/4588595) loss:2.958 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:53:33,824][model8_pretrain.py][INFO] Epoch:[0/2](784700/4588595) loss:3.014 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:53:33,824][model8_pretrain.py][INFO] Epoch:[0/2](784700/4588595) loss:2.829 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:53:33,824][model8_pretrain.py][INFO] Epoch:[0/2](784700/4588595) loss:3.008 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:53:33,824][model8_pretrain.py][INFO] Epoch:[0/2](784700/4588595) loss:2.548 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:53:33,824][model8_pretrain.py][INFO] Epoch:[0/2](784700/4588595) loss:2.404 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:53:33,824][model8_pretrain.py][INFO] Epoch:[0/2](784700/4588595) loss:2.776 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:53:33,824][model8_pretrain.py][INFO] Epoch:[0/2](784700/4588595) loss:2.612 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:53:33,824][model8_pretrain.py][INFO] Epoch:[0/2](784700/4588595) loss:2.364 lr:0.0000100 epoch_Time:24208.0min: [2024-01-06 04:54:10,769][model8_pretrain.py][INFO] Epoch:[0/2](784800/4588595) loss:2.230 lr:0.0000100 epoch_Time:24207.0min: [2024-01-06 04:54:10,769][model8_pretrain.py][INFO] Epoch:[0/2](784800/4588595) loss:2.842 lr:0.0000100 epoch_Time:24207.0min: [2024-01-06 04:54:10,770][model8_pretrain.py][INFO] Epoch:[0/2](784800/4588595) loss:2.739 lr:0.0000100 epoch_Time:24207.0min: [2024-01-06 04:54:10,770][model8_pretrain.py][INFO] Epoch:[0/2](784800/4588595) loss:2.817 lr:0.0000100 epoch_Time:24207.0min: [2024-01-06 04:54:10,770][model8_pretrain.py][INFO] Epoch:[0/2](784800/4588595) loss:2.644 lr:0.0000100 epoch_Time:24207.0min: [2024-01-06 04:54:10,770][model8_pretrain.py][INFO] Epoch:[0/2](784800/4588595) loss:3.172 lr:0.0000100 epoch_Time:24207.0min: [2024-01-06 04:54:10,770][model8_pretrain.py][INFO] Epoch:[0/2](784800/4588595) loss:2.441 lr:0.0000100 epoch_Time:24207.0min: [2024-01-06 04:54:10,770][model8_pretrain.py][INFO] Epoch:[0/2](784800/4588595) loss:2.360 lr:0.0000100 epoch_Time:24207.0min: [2024-01-06 04:54:47,729][model8_pretrain.py][INFO] Epoch:[0/2](784900/4588595) loss:2.983 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:54:47,729][model8_pretrain.py][INFO] Epoch:[0/2](784900/4588595) loss:2.633 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:54:47,729][model8_pretrain.py][INFO] Epoch:[0/2](784900/4588595) loss:2.699 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:54:47,729][model8_pretrain.py][INFO] Epoch:[0/2](784900/4588595) loss:3.035 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:54:47,730][model8_pretrain.py][INFO] Epoch:[0/2](784900/4588595) loss:2.219 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:54:47,730][model8_pretrain.py][INFO] Epoch:[0/2](784900/4588595) loss:3.586 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:54:47,730][model8_pretrain.py][INFO] Epoch:[0/2](784900/4588595) loss:2.998 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:54:47,730][model8_pretrain.py][INFO] Epoch:[0/2](784900/4588595) loss:2.569 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:55:24,672][model8_pretrain.py][INFO] Epoch:[0/2](785000/4588595) loss:3.052 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:55:24,672][model8_pretrain.py][INFO] Epoch:[0/2](785000/4588595) loss:3.153 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:55:24,672][model8_pretrain.py][INFO] Epoch:[0/2](785000/4588595) loss:2.757 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:55:24,672][model8_pretrain.py][INFO] Epoch:[0/2](785000/4588595) loss:2.814 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:55:24,672][model8_pretrain.py][INFO] Epoch:[0/2](785000/4588595) loss:3.003 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:55:24,672][model8_pretrain.py][INFO] Epoch:[0/2](785000/4588595) loss:2.924 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:55:24,672][model8_pretrain.py][INFO] Epoch:[0/2](785000/4588595) loss:2.800 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:55:24,672][model8_pretrain.py][INFO] Epoch:[0/2](785000/4588595) loss:2.203 lr:0.0000100 epoch_Time:24206.0min: [2024-01-06 04:56:11,857][model8_pretrain.py][INFO] Epoch:[0/2](785100/4588595) loss:2.830 lr:0.0000100 epoch_Time:24205.0min: [2024-01-06 04:56:11,857][model8_pretrain.py][INFO] Epoch:[0/2](785100/4588595) loss:2.454 lr:0.0000100 epoch_Time:24205.0min: [2024-01-06 04:56:11,857][model8_pretrain.py][INFO] Epoch:[0/2](785100/4588595) loss:2.680 lr:0.0000100 epoch_Time:24205.0min: [2024-01-06 04:56:11,857][model8_pretrain.py][INFO] Epoch:[0/2](785100/4588595) loss:2.738 lr:0.0000100 epoch_Time:24205.0min: [2024-01-06 04:56:11,857][model8_pretrain.py][INFO] Epoch:[0/2](785100/4588595) loss:2.628 lr:0.0000100 epoch_Time:24205.0min: [2024-01-06 04:56:11,857][model8_pretrain.py][INFO] Epoch:[0/2](785100/4588595) loss:3.054 lr:0.0000100 epoch_Time:24205.0min: [2024-01-06 04:56:11,857][model8_pretrain.py][INFO] Epoch:[0/2](785100/4588595) loss:2.523 lr:0.0000100 epoch_Time:24205.0min: [2024-01-06 04:56:11,857][model8_pretrain.py][INFO] Epoch:[0/2](785100/4588595) loss:2.486 lr:0.0000100 epoch_Time:24205.0min: [2024-01-06 04:56:48,797][model8_pretrain.py][INFO] Epoch:[0/2](785200/4588595) loss:2.704 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:56:48,797][model8_pretrain.py][INFO] Epoch:[0/2](785200/4588595) loss:2.935 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:56:48,797][model8_pretrain.py][INFO] Epoch:[0/2](785200/4588595) loss:2.935 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:56:48,797][model8_pretrain.py][INFO] Epoch:[0/2](785200/4588595) loss:2.368 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:56:48,797][model8_pretrain.py][INFO] Epoch:[0/2](785200/4588595) loss:2.206 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:56:48,797][model8_pretrain.py][INFO] Epoch:[0/2](785200/4588595) loss:2.601 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:56:48,797][model8_pretrain.py][INFO] Epoch:[0/2](785200/4588595) loss:3.262 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:56:48,797][model8_pretrain.py][INFO] Epoch:[0/2](785200/4588595) loss:2.911 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:57:25,726][model8_pretrain.py][INFO] Epoch:[0/2](785300/4588595) loss:3.202 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:57:25,726][model8_pretrain.py][INFO] Epoch:[0/2](785300/4588595) loss:1.952 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:57:25,726][model8_pretrain.py][INFO] Epoch:[0/2](785300/4588595) loss:3.350 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:57:25,726][model8_pretrain.py][INFO] Epoch:[0/2](785300/4588595) loss:2.998 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:57:25,726][model8_pretrain.py][INFO] Epoch:[0/2](785300/4588595) loss:2.682 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:57:25,726][model8_pretrain.py][INFO] Epoch:[0/2](785300/4588595) loss:2.493 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:57:25,727][model8_pretrain.py][INFO] Epoch:[0/2](785300/4588595) loss:2.792 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:57:25,727][model8_pretrain.py][INFO] Epoch:[0/2](785300/4588595) loss:2.867 lr:0.0000100 epoch_Time:24204.0min: [2024-01-06 04:58:02,672][model8_pretrain.py][INFO] Epoch:[0/2](785400/4588595) loss:2.941 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:02,672][model8_pretrain.py][INFO] Epoch:[0/2](785400/4588595) loss:2.483 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:02,672][model8_pretrain.py][INFO] Epoch:[0/2](785400/4588595) loss:2.582 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:02,672][model8_pretrain.py][INFO] Epoch:[0/2](785400/4588595) loss:2.604 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:02,672][model8_pretrain.py][INFO] Epoch:[0/2](785400/4588595) loss:2.638 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:02,672][model8_pretrain.py][INFO] Epoch:[0/2](785400/4588595) loss:2.638 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:02,672][model8_pretrain.py][INFO] Epoch:[0/2](785400/4588595) loss:2.713 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:02,672][model8_pretrain.py][INFO] Epoch:[0/2](785400/4588595) loss:3.028 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:39,616][model8_pretrain.py][INFO] Epoch:[0/2](785500/4588595) loss:3.164 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:39,616][model8_pretrain.py][INFO] Epoch:[0/2](785500/4588595) loss:2.921 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:39,616][model8_pretrain.py][INFO] Epoch:[0/2](785500/4588595) loss:2.961 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:39,616][model8_pretrain.py][INFO] Epoch:[0/2](785500/4588595) loss:3.095 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:39,616][model8_pretrain.py][INFO] Epoch:[0/2](785500/4588595) loss:2.772 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:39,616][model8_pretrain.py][INFO] Epoch:[0/2](785500/4588595) loss:3.121 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:39,617][model8_pretrain.py][INFO] Epoch:[0/2](785500/4588595) loss:1.982 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:58:39,617][model8_pretrain.py][INFO] Epoch:[0/2](785500/4588595) loss:2.739 lr:0.0000100 epoch_Time:24203.0min: [2024-01-06 04:59:16,580][model8_pretrain.py][INFO] Epoch:[0/2](785600/4588595) loss:2.972 lr:0.0000100 epoch_Time:24202.0min: [2024-01-06 04:59:16,580][model8_pretrain.py][INFO] Epoch:[0/2](785600/4588595) loss:2.432 lr:0.0000100 epoch_Time:24202.0min: [2024-01-06 04:59:16,580][model8_pretrain.py][INFO] Epoch:[0/2](785600/4588595) loss:2.600 lr:0.0000100 epoch_Time:24202.0min: [2024-01-06 04:59:16,580][model8_pretrain.py][INFO] Epoch:[0/2](785600/4588595) loss:2.682 lr:0.0000100 epoch_Time:24202.0min: [2024-01-06 04:59:16,580][model8_pretrain.py][INFO] Epoch:[0/2](785600/4588595) loss:3.052 lr:0.0000100 epoch_Time:24202.0min: [2024-01-06 04:59:16,580][model8_pretrain.py][INFO] Epoch:[0/2](785600/4588595) loss:3.071 lr:0.0000100 epoch_Time:24202.0min: [2024-01-06 04:59:16,580][model8_pretrain.py][INFO] Epoch:[0/2](785600/4588595) loss:2.907 lr:0.0000100 epoch_Time:24202.0min: [2024-01-06 04:59:16,580][model8_pretrain.py][INFO] Epoch:[0/2](785600/4588595) loss:2.116 lr:0.0000100 epoch_Time:24202.0min: [2024-01-06 04:59:53,516][model8_pretrain.py][INFO] Epoch:[0/2](785700/4588595) loss:2.748 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 04:59:53,516][model8_pretrain.py][INFO] Epoch:[0/2](785700/4588595) loss:2.851 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 04:59:53,516][model8_pretrain.py][INFO] Epoch:[0/2](785700/4588595) loss:3.288 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 04:59:53,516][model8_pretrain.py][INFO] Epoch:[0/2](785700/4588595) loss:2.913 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 04:59:53,516][model8_pretrain.py][INFO] Epoch:[0/2](785700/4588595) loss:2.824 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 04:59:53,516][model8_pretrain.py][INFO] Epoch:[0/2](785700/4588595) loss:2.374 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 04:59:53,516][model8_pretrain.py][INFO] Epoch:[0/2](785700/4588595) loss:2.442 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 04:59:53,516][model8_pretrain.py][INFO] Epoch:[0/2](785700/4588595) loss:1.814 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 05:00:30,433][model8_pretrain.py][INFO] Epoch:[0/2](785800/4588595) loss:2.812 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 05:00:30,433][model8_pretrain.py][INFO] Epoch:[0/2](785800/4588595) loss:2.872 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 05:00:30,433][model8_pretrain.py][INFO] Epoch:[0/2](785800/4588595) loss:2.273 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 05:00:30,433][model8_pretrain.py][INFO] Epoch:[0/2](785800/4588595) loss:2.453 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 05:00:30,433][model8_pretrain.py][INFO] Epoch:[0/2](785800/4588595) loss:2.169 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 05:00:30,433][model8_pretrain.py][INFO] Epoch:[0/2](785800/4588595) loss:2.877 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 05:00:30,433][model8_pretrain.py][INFO] Epoch:[0/2](785800/4588595) loss:2.761 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 05:00:30,433][model8_pretrain.py][INFO] Epoch:[0/2](785800/4588595) loss:2.946 lr:0.0000100 epoch_Time:24201.0min: [2024-01-06 05:01:17,800][model8_pretrain.py][INFO] Epoch:[0/2](785900/4588595) loss:2.311 lr:0.0000100 epoch_Time:24200.0min: [2024-01-06 05:01:17,800][model8_pretrain.py][INFO] Epoch:[0/2](785900/4588595) loss:3.092 lr:0.0000100 epoch_Time:24200.0min: [2024-01-06 05:01:17,800][model8_pretrain.py][INFO] Epoch:[0/2](785900/4588595) loss:2.611 lr:0.0000100 epoch_Time:24200.0min: [2024-01-06 05:01:17,800][model8_pretrain.py][INFO] Epoch:[0/2](785900/4588595) loss:2.845 lr:0.0000100 epoch_Time:24200.0min: [2024-01-06 05:01:17,800][model8_pretrain.py][INFO] Epoch:[0/2](785900/4588595) loss:2.494 lr:0.0000100 epoch_Time:24200.0min: [2024-01-06 05:01:17,800][model8_pretrain.py][INFO] Epoch:[0/2](785900/4588595) loss:2.249 lr:0.0000100 epoch_Time:24200.0min: [2024-01-06 05:01:17,800][model8_pretrain.py][INFO] Epoch:[0/2](785900/4588595) loss:3.344 lr:0.0000100 epoch_Time:24200.0min: [2024-01-06 05:01:17,800][model8_pretrain.py][INFO] Epoch:[0/2](785900/4588595) loss:3.230 lr:0.0000100 epoch_Time:24200.0min: [2024-01-06 05:01:54,732][model8_pretrain.py][INFO] Epoch:[0/2](786000/4588595) loss:2.554 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:01:54,732][model8_pretrain.py][INFO] Epoch:[0/2](786000/4588595) loss:2.706 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:01:54,733][model8_pretrain.py][INFO] Epoch:[0/2](786000/4588595) loss:2.251 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:01:54,733][model8_pretrain.py][INFO] Epoch:[0/2](786000/4588595) loss:3.163 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:01:54,733][model8_pretrain.py][INFO] Epoch:[0/2](786000/4588595) loss:2.881 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:01:54,733][model8_pretrain.py][INFO] Epoch:[0/2](786000/4588595) loss:2.599 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:01:54,733][model8_pretrain.py][INFO] Epoch:[0/2](786000/4588595) loss:2.985 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:01:54,733][model8_pretrain.py][INFO] Epoch:[0/2](786000/4588595) loss:3.063 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:02:31,673][model8_pretrain.py][INFO] Epoch:[0/2](786100/4588595) loss:2.583 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:02:31,673][model8_pretrain.py][INFO] Epoch:[0/2](786100/4588595) loss:2.589 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:02:31,673][model8_pretrain.py][INFO] Epoch:[0/2](786100/4588595) loss:3.098 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:02:31,673][model8_pretrain.py][INFO] Epoch:[0/2](786100/4588595) loss:2.704 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:02:31,673][model8_pretrain.py][INFO] Epoch:[0/2](786100/4588595) loss:2.844 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:02:31,673][model8_pretrain.py][INFO] Epoch:[0/2](786100/4588595) loss:3.005 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:02:31,673][model8_pretrain.py][INFO] Epoch:[0/2](786100/4588595) loss:2.620 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:02:31,674][model8_pretrain.py][INFO] Epoch:[0/2](786100/4588595) loss:2.497 lr:0.0000100 epoch_Time:24199.0min: [2024-01-06 05:03:08,622][model8_pretrain.py][INFO] Epoch:[0/2](786200/4588595) loss:3.077 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:08,622][model8_pretrain.py][INFO] Epoch:[0/2](786200/4588595) loss:2.794 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:08,622][model8_pretrain.py][INFO] Epoch:[0/2](786200/4588595) loss:2.776 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:08,622][model8_pretrain.py][INFO] Epoch:[0/2](786200/4588595) loss:2.918 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:08,622][model8_pretrain.py][INFO] Epoch:[0/2](786200/4588595) loss:2.566 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:08,622][model8_pretrain.py][INFO] Epoch:[0/2](786200/4588595) loss:2.639 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:08,622][model8_pretrain.py][INFO] Epoch:[0/2](786200/4588595) loss:2.905 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:08,622][model8_pretrain.py][INFO] Epoch:[0/2](786200/4588595) loss:2.565 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:45,564][model8_pretrain.py][INFO] Epoch:[0/2](786300/4588595) loss:2.888 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:45,564][model8_pretrain.py][INFO] Epoch:[0/2](786300/4588595) loss:2.562 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:45,564][model8_pretrain.py][INFO] Epoch:[0/2](786300/4588595) loss:2.840 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:45,564][model8_pretrain.py][INFO] Epoch:[0/2](786300/4588595) loss:2.346 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:45,564][model8_pretrain.py][INFO] Epoch:[0/2](786300/4588595) loss:3.165 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:45,564][model8_pretrain.py][INFO] Epoch:[0/2](786300/4588595) loss:2.963 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:45,564][model8_pretrain.py][INFO] Epoch:[0/2](786300/4588595) loss:2.889 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:03:45,564][model8_pretrain.py][INFO] Epoch:[0/2](786300/4588595) loss:2.894 lr:0.0000100 epoch_Time:24198.0min: [2024-01-06 05:04:22,508][model8_pretrain.py][INFO] Epoch:[0/2](786400/4588595) loss:3.111 lr:0.0000100 epoch_Time:24197.0min: [2024-01-06 05:04:22,508][model8_pretrain.py][INFO] Epoch:[0/2](786400/4588595) loss:2.299 lr:0.0000100 epoch_Time:24197.0min: [2024-01-06 05:04:22,508][model8_pretrain.py][INFO] Epoch:[0/2](786400/4588595) loss:2.919 lr:0.0000100 epoch_Time:24197.0min: [2024-01-06 05:04:22,508][model8_pretrain.py][INFO] Epoch:[0/2](786400/4588595) loss:3.115 lr:0.0000100 epoch_Time:24197.0min: [2024-01-06 05:04:22,508][model8_pretrain.py][INFO] Epoch:[0/2](786400/4588595) loss:2.846 lr:0.0000100 epoch_Time:24197.0min: [2024-01-06 05:04:22,508][model8_pretrain.py][INFO] Epoch:[0/2](786400/4588595) loss:2.430 lr:0.0000100 epoch_Time:24197.0min: [2024-01-06 05:04:22,509][model8_pretrain.py][INFO] Epoch:[0/2](786400/4588595) loss:3.101 lr:0.0000100 epoch_Time:24197.0min: [2024-01-06 05:04:22,509][model8_pretrain.py][INFO] Epoch:[0/2](786400/4588595) loss:3.033 lr:0.0000100 epoch_Time:24197.0min: [2024-01-06 05:04:59,456][model8_pretrain.py][INFO] Epoch:[0/2](786500/4588595) loss:2.652 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:04:59,456][model8_pretrain.py][INFO] Epoch:[0/2](786500/4588595) loss:2.818 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:04:59,456][model8_pretrain.py][INFO] Epoch:[0/2](786500/4588595) loss:2.403 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:04:59,457][model8_pretrain.py][INFO] Epoch:[0/2](786500/4588595) loss:2.940 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:04:59,457][model8_pretrain.py][INFO] Epoch:[0/2](786500/4588595) loss:3.158 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:04:59,457][model8_pretrain.py][INFO] Epoch:[0/2](786500/4588595) loss:2.996 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:04:59,457][model8_pretrain.py][INFO] Epoch:[0/2](786500/4588595) loss:3.070 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:04:59,457][model8_pretrain.py][INFO] Epoch:[0/2](786500/4588595) loss:2.495 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:05:36,405][model8_pretrain.py][INFO] Epoch:[0/2](786600/4588595) loss:3.208 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:05:36,405][model8_pretrain.py][INFO] Epoch:[0/2](786600/4588595) loss:3.021 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:05:36,405][model8_pretrain.py][INFO] Epoch:[0/2](786600/4588595) loss:2.994 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:05:36,405][model8_pretrain.py][INFO] Epoch:[0/2](786600/4588595) loss:3.271 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:05:36,405][model8_pretrain.py][INFO] Epoch:[0/2](786600/4588595) loss:2.818 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:05:36,405][model8_pretrain.py][INFO] Epoch:[0/2](786600/4588595) loss:2.773 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:05:36,405][model8_pretrain.py][INFO] Epoch:[0/2](786600/4588595) loss:2.893 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:05:36,405][model8_pretrain.py][INFO] Epoch:[0/2](786600/4588595) loss:2.901 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:06:23,865][model8_pretrain.py][INFO] Epoch:[0/2](786700/4588595) loss:2.661 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:06:23,865][model8_pretrain.py][INFO] Epoch:[0/2](786700/4588595) loss:2.759 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:06:23,865][model8_pretrain.py][INFO] Epoch:[0/2](786700/4588595) loss:2.949 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:06:23,865][model8_pretrain.py][INFO] Epoch:[0/2](786700/4588595) loss:2.239 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:06:23,865][model8_pretrain.py][INFO] Epoch:[0/2](786700/4588595) loss:2.714 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:06:23,865][model8_pretrain.py][INFO] Epoch:[0/2](786700/4588595) loss:3.292 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:06:23,866][model8_pretrain.py][INFO] Epoch:[0/2](786700/4588595) loss:2.757 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:06:23,866][model8_pretrain.py][INFO] Epoch:[0/2](786700/4588595) loss:3.212 lr:0.0000100 epoch_Time:24196.0min: [2024-01-06 05:07:00,794][model8_pretrain.py][INFO] Epoch:[0/2](786800/4588595) loss:2.927 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:00,794][model8_pretrain.py][INFO] Epoch:[0/2](786800/4588595) loss:2.577 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:00,794][model8_pretrain.py][INFO] Epoch:[0/2](786800/4588595) loss:2.552 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:00,794][model8_pretrain.py][INFO] Epoch:[0/2](786800/4588595) loss:1.840 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:00,794][model8_pretrain.py][INFO] Epoch:[0/2](786800/4588595) loss:2.717 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:00,794][model8_pretrain.py][INFO] Epoch:[0/2](786800/4588595) loss:2.512 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:00,794][model8_pretrain.py][INFO] Epoch:[0/2](786800/4588595) loss:2.812 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:00,794][model8_pretrain.py][INFO] Epoch:[0/2](786800/4588595) loss:2.989 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:37,736][model8_pretrain.py][INFO] Epoch:[0/2](786900/4588595) loss:3.130 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:37,736][model8_pretrain.py][INFO] Epoch:[0/2](786900/4588595) loss:3.081 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:37,736][model8_pretrain.py][INFO] Epoch:[0/2](786900/4588595) loss:2.927 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:37,736][model8_pretrain.py][INFO] Epoch:[0/2](786900/4588595) loss:2.434 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:37,736][model8_pretrain.py][INFO] Epoch:[0/2](786900/4588595) loss:3.415 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:37,736][model8_pretrain.py][INFO] Epoch:[0/2](786900/4588595) loss:2.806 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:37,736][model8_pretrain.py][INFO] Epoch:[0/2](786900/4588595) loss:2.856 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:07:37,736][model8_pretrain.py][INFO] Epoch:[0/2](786900/4588595) loss:2.674 lr:0.0000100 epoch_Time:24194.0min: [2024-01-06 05:08:14,689][model8_pretrain.py][INFO] Epoch:[0/2](787000/4588595) loss:2.886 lr:0.0000100 epoch_Time:24193.0min: [2024-01-06 05:08:14,689][model8_pretrain.py][INFO] Epoch:[0/2](787000/4588595) loss:3.313 lr:0.0000100 epoch_Time:24193.0min: [2024-01-06 05:08:14,689][model8_pretrain.py][INFO] Epoch:[0/2](787000/4588595) loss:2.542 lr:0.0000100 epoch_Time:24193.0min: [2024-01-06 05:08:14,689][model8_pretrain.py][INFO] Epoch:[0/2](787000/4588595) loss:2.999 lr:0.0000100 epoch_Time:24193.0min: [2024-01-06 05:08:14,689][model8_pretrain.py][INFO] Epoch:[0/2](787000/4588595) loss:3.414 lr:0.0000100 epoch_Time:24193.0min: [2024-01-06 05:08:14,689][model8_pretrain.py][INFO] Epoch:[0/2](787000/4588595) loss:2.091 lr:0.0000100 epoch_Time:24193.0min: [2024-01-06 05:08:14,689][model8_pretrain.py][INFO] Epoch:[0/2](787000/4588595) loss:3.328 lr:0.0000100 epoch_Time:24193.0min: [2024-01-06 05:08:14,689][model8_pretrain.py][INFO] Epoch:[0/2](787000/4588595) loss:3.161 lr:0.0000100 epoch_Time:24193.0min: [2024-01-06 05:08:51,642][model8_pretrain.py][INFO] Epoch:[0/2](787100/4588595) loss:2.835 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:08:51,642][model8_pretrain.py][INFO] Epoch:[0/2](787100/4588595) loss:2.864 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:08:51,642][model8_pretrain.py][INFO] Epoch:[0/2](787100/4588595) loss:2.622 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:08:51,642][model8_pretrain.py][INFO] Epoch:[0/2](787100/4588595) loss:3.007 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:08:51,642][model8_pretrain.py][INFO] Epoch:[0/2](787100/4588595) loss:2.775 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:08:51,642][model8_pretrain.py][INFO] Epoch:[0/2](787100/4588595) loss:3.186 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:08:51,642][model8_pretrain.py][INFO] Epoch:[0/2](787100/4588595) loss:2.700 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:08:51,642][model8_pretrain.py][INFO] Epoch:[0/2](787100/4588595) loss:2.828 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:09:28,585][model8_pretrain.py][INFO] Epoch:[0/2](787200/4588595) loss:2.962 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:09:28,585][model8_pretrain.py][INFO] Epoch:[0/2](787200/4588595) loss:2.729 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:09:28,585][model8_pretrain.py][INFO] Epoch:[0/2](787200/4588595) loss:2.356 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:09:28,585][model8_pretrain.py][INFO] Epoch:[0/2](787200/4588595) loss:3.035 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:09:28,585][model8_pretrain.py][INFO] Epoch:[0/2](787200/4588595) loss:2.709 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:09:28,585][model8_pretrain.py][INFO] Epoch:[0/2](787200/4588595) loss:3.004 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:09:28,585][model8_pretrain.py][INFO] Epoch:[0/2](787200/4588595) loss:2.656 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:09:28,585][model8_pretrain.py][INFO] Epoch:[0/2](787200/4588595) loss:3.082 lr:0.0000100 epoch_Time:24192.0min: [2024-01-06 05:10:05,536][model8_pretrain.py][INFO] Epoch:[0/2](787300/4588595) loss:2.506 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:05,536][model8_pretrain.py][INFO] Epoch:[0/2](787300/4588595) loss:2.268 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:05,536][model8_pretrain.py][INFO] Epoch:[0/2](787300/4588595) loss:2.475 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:05,536][model8_pretrain.py][INFO] Epoch:[0/2](787300/4588595) loss:2.668 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:05,536][model8_pretrain.py][INFO] Epoch:[0/2](787300/4588595) loss:2.910 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:05,536][model8_pretrain.py][INFO] Epoch:[0/2](787300/4588595) loss:2.897 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:05,537][model8_pretrain.py][INFO] Epoch:[0/2](787300/4588595) loss:3.066 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:05,537][model8_pretrain.py][INFO] Epoch:[0/2](787300/4588595) loss:3.211 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:42,503][model8_pretrain.py][INFO] Epoch:[0/2](787400/4588595) loss:2.835 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:42,503][model8_pretrain.py][INFO] Epoch:[0/2](787400/4588595) loss:3.137 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:42,503][model8_pretrain.py][INFO] Epoch:[0/2](787400/4588595) loss:3.523 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:42,503][model8_pretrain.py][INFO] Epoch:[0/2](787400/4588595) loss:2.437 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:42,503][model8_pretrain.py][INFO] Epoch:[0/2](787400/4588595) loss:3.084 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:42,503][model8_pretrain.py][INFO] Epoch:[0/2](787400/4588595) loss:2.871 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:42,503][model8_pretrain.py][INFO] Epoch:[0/2](787400/4588595) loss:3.343 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:10:42,503][model8_pretrain.py][INFO] Epoch:[0/2](787400/4588595) loss:3.230 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:11:29,967][model8_pretrain.py][INFO] Epoch:[0/2](787500/4588595) loss:2.775 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:11:29,967][model8_pretrain.py][INFO] Epoch:[0/2](787500/4588595) loss:2.222 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:11:29,967][model8_pretrain.py][INFO] Epoch:[0/2](787500/4588595) loss:2.973 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:11:29,967][model8_pretrain.py][INFO] Epoch:[0/2](787500/4588595) loss:2.380 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:11:29,967][model8_pretrain.py][INFO] Epoch:[0/2](787500/4588595) loss:2.933 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:11:29,967][model8_pretrain.py][INFO] Epoch:[0/2](787500/4588595) loss:3.122 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:11:29,967][model8_pretrain.py][INFO] Epoch:[0/2](787500/4588595) loss:2.695 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:11:29,967][model8_pretrain.py][INFO] Epoch:[0/2](787500/4588595) loss:2.723 lr:0.0000100 epoch_Time:24191.0min: [2024-01-06 05:12:06,895][model8_pretrain.py][INFO] Epoch:[0/2](787600/4588595) loss:2.918 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:06,895][model8_pretrain.py][INFO] Epoch:[0/2](787600/4588595) loss:3.164 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:06,895][model8_pretrain.py][INFO] Epoch:[0/2](787600/4588595) loss:2.545 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:06,895][model8_pretrain.py][INFO] Epoch:[0/2](787600/4588595) loss:2.515 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:06,895][model8_pretrain.py][INFO] Epoch:[0/2](787600/4588595) loss:2.989 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:06,895][model8_pretrain.py][INFO] Epoch:[0/2](787600/4588595) loss:2.842 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:06,895][model8_pretrain.py][INFO] Epoch:[0/2](787600/4588595) loss:2.444 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:06,895][model8_pretrain.py][INFO] Epoch:[0/2](787600/4588595) loss:3.120 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:43,831][model8_pretrain.py][INFO] Epoch:[0/2](787700/4588595) loss:2.669 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:43,831][model8_pretrain.py][INFO] Epoch:[0/2](787700/4588595) loss:2.827 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:43,831][model8_pretrain.py][INFO] Epoch:[0/2](787700/4588595) loss:2.566 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:43,831][model8_pretrain.py][INFO] Epoch:[0/2](787700/4588595) loss:2.870 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:43,831][model8_pretrain.py][INFO] Epoch:[0/2](787700/4588595) loss:2.845 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:43,831][model8_pretrain.py][INFO] Epoch:[0/2](787700/4588595) loss:3.205 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:43,831][model8_pretrain.py][INFO] Epoch:[0/2](787700/4588595) loss:2.591 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:12:43,831][model8_pretrain.py][INFO] Epoch:[0/2](787700/4588595) loss:2.547 lr:0.0000100 epoch_Time:24189.0min: [2024-01-06 05:13:20,780][model8_pretrain.py][INFO] Epoch:[0/2](787800/4588595) loss:3.284 lr:0.0000100 epoch_Time:24188.0min: [2024-01-06 05:13:20,780][model8_pretrain.py][INFO] Epoch:[0/2](787800/4588595) loss:2.463 lr:0.0000100 epoch_Time:24188.0min: [2024-01-06 05:13:20,780][model8_pretrain.py][INFO] Epoch:[0/2](787800/4588595) loss:2.845 lr:0.0000100 epoch_Time:24188.0min: [2024-01-06 05:13:20,780][model8_pretrain.py][INFO] Epoch:[0/2](787800/4588595) loss:2.347 lr:0.0000100 epoch_Time:24188.0min: [2024-01-06 05:13:20,780][model8_pretrain.py][INFO] Epoch:[0/2](787800/4588595) loss:2.727 lr:0.0000100 epoch_Time:24188.0min: [2024-01-06 05:13:20,780][model8_pretrain.py][INFO] Epoch:[0/2](787800/4588595) loss:2.713 lr:0.0000100 epoch_Time:24188.0min: [2024-01-06 05:13:20,780][model8_pretrain.py][INFO] Epoch:[0/2](787800/4588595) loss:3.190 lr:0.0000100 epoch_Time:24188.0min: [2024-01-06 05:13:20,780][model8_pretrain.py][INFO] Epoch:[0/2](787800/4588595) loss:2.787 lr:0.0000100 epoch_Time:24188.0min: [2024-01-06 05:13:57,723][model8_pretrain.py][INFO] Epoch:[0/2](787900/4588595) loss:2.879 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:13:57,723][model8_pretrain.py][INFO] Epoch:[0/2](787900/4588595) loss:2.218 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:13:57,723][model8_pretrain.py][INFO] Epoch:[0/2](787900/4588595) loss:2.910 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:13:57,723][model8_pretrain.py][INFO] Epoch:[0/2](787900/4588595) loss:2.636 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:13:57,723][model8_pretrain.py][INFO] Epoch:[0/2](787900/4588595) loss:3.222 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:13:57,723][model8_pretrain.py][INFO] Epoch:[0/2](787900/4588595) loss:2.693 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:13:57,723][model8_pretrain.py][INFO] Epoch:[0/2](787900/4588595) loss:3.285 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:13:57,724][model8_pretrain.py][INFO] Epoch:[0/2](787900/4588595) loss:3.175 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:14:34,689][model8_pretrain.py][INFO] Epoch:[0/2](788000/4588595) loss:2.869 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:14:34,689][model8_pretrain.py][INFO] Epoch:[0/2](788000/4588595) loss:3.351 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:14:34,689][model8_pretrain.py][INFO] Epoch:[0/2](788000/4588595) loss:3.119 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:14:34,689][model8_pretrain.py][INFO] Epoch:[0/2](788000/4588595) loss:2.558 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:14:34,689][model8_pretrain.py][INFO] Epoch:[0/2](788000/4588595) loss:2.630 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:14:34,689][model8_pretrain.py][INFO] Epoch:[0/2](788000/4588595) loss:2.804 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:14:34,689][model8_pretrain.py][INFO] Epoch:[0/2](788000/4588595) loss:3.316 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:14:34,689][model8_pretrain.py][INFO] Epoch:[0/2](788000/4588595) loss:3.144 lr:0.0000100 epoch_Time:24187.0min: [2024-01-06 05:15:11,637][model8_pretrain.py][INFO] Epoch:[0/2](788100/4588595) loss:2.440 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:15:11,637][model8_pretrain.py][INFO] Epoch:[0/2](788100/4588595) loss:2.830 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:15:11,637][model8_pretrain.py][INFO] Epoch:[0/2](788100/4588595) loss:2.025 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:15:11,637][model8_pretrain.py][INFO] Epoch:[0/2](788100/4588595) loss:2.749 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:15:11,637][model8_pretrain.py][INFO] Epoch:[0/2](788100/4588595) loss:3.152 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:15:11,637][model8_pretrain.py][INFO] Epoch:[0/2](788100/4588595) loss:2.938 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:15:11,637][model8_pretrain.py][INFO] Epoch:[0/2](788100/4588595) loss:2.718 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:15:11,637][model8_pretrain.py][INFO] Epoch:[0/2](788100/4588595) loss:2.951 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:15:48,583][model8_pretrain.py][INFO] Epoch:[0/2](788200/4588595) loss:3.012 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:15:48,583][model8_pretrain.py][INFO] Epoch:[0/2](788200/4588595) loss:2.667 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:15:48,583][model8_pretrain.py][INFO] Epoch:[0/2](788200/4588595) loss:2.541 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:15:48,583][model8_pretrain.py][INFO] Epoch:[0/2](788200/4588595) loss:3.330 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:15:48,583][model8_pretrain.py][INFO] Epoch:[0/2](788200/4588595) loss:2.921 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:15:48,583][model8_pretrain.py][INFO] Epoch:[0/2](788200/4588595) loss:3.174 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:15:48,583][model8_pretrain.py][INFO] Epoch:[0/2](788200/4588595) loss:2.423 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:15:48,583][model8_pretrain.py][INFO] Epoch:[0/2](788200/4588595) loss:2.778 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:16:35,939][model8_pretrain.py][INFO] Epoch:[0/2](788300/4588595) loss:2.407 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:16:35,939][model8_pretrain.py][INFO] Epoch:[0/2](788300/4588595) loss:2.877 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:16:35,939][model8_pretrain.py][INFO] Epoch:[0/2](788300/4588595) loss:2.517 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:16:35,939][model8_pretrain.py][INFO] Epoch:[0/2](788300/4588595) loss:2.690 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:16:35,939][model8_pretrain.py][INFO] Epoch:[0/2](788300/4588595) loss:2.850 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:16:35,939][model8_pretrain.py][INFO] Epoch:[0/2](788300/4588595) loss:2.380 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:16:35,939][model8_pretrain.py][INFO] Epoch:[0/2](788300/4588595) loss:2.225 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:16:35,939][model8_pretrain.py][INFO] Epoch:[0/2](788300/4588595) loss:3.207 lr:0.0000100 epoch_Time:24186.0min: [2024-01-06 05:17:12,875][model8_pretrain.py][INFO] Epoch:[0/2](788400/4588595) loss:2.479 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:17:12,875][model8_pretrain.py][INFO] Epoch:[0/2](788400/4588595) loss:3.012 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:17:12,875][model8_pretrain.py][INFO] Epoch:[0/2](788400/4588595) loss:2.468 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:17:12,875][model8_pretrain.py][INFO] Epoch:[0/2](788400/4588595) loss:2.484 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:17:12,875][model8_pretrain.py][INFO] Epoch:[0/2](788400/4588595) loss:2.128 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:17:12,875][model8_pretrain.py][INFO] Epoch:[0/2](788400/4588595) loss:2.218 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:17:12,875][model8_pretrain.py][INFO] Epoch:[0/2](788400/4588595) loss:3.253 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:17:12,875][model8_pretrain.py][INFO] Epoch:[0/2](788400/4588595) loss:3.102 lr:0.0000100 epoch_Time:24185.0min: [2024-01-06 05:17:49,779][model8_pretrain.py][INFO] Epoch:[0/2](788500/4588595) loss:2.998 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:17:49,779][model8_pretrain.py][INFO] Epoch:[0/2](788500/4588595) loss:2.584 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:17:49,779][model8_pretrain.py][INFO] Epoch:[0/2](788500/4588595) loss:2.946 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:17:49,779][model8_pretrain.py][INFO] Epoch:[0/2](788500/4588595) loss:3.052 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:17:49,779][model8_pretrain.py][INFO] Epoch:[0/2](788500/4588595) loss:2.697 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:17:49,779][model8_pretrain.py][INFO] Epoch:[0/2](788500/4588595) loss:2.713 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:17:49,779][model8_pretrain.py][INFO] Epoch:[0/2](788500/4588595) loss:2.444 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:17:49,779][model8_pretrain.py][INFO] Epoch:[0/2](788500/4588595) loss:2.583 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](788600/4588595) loss:2.550 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](788600/4588595) loss:2.506 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](788600/4588595) loss:3.012 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](788600/4588595) loss:2.663 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](788600/4588595) loss:3.018 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](788600/4588595) loss:2.987 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](788600/4588595) loss:2.577 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:18:26,728][model8_pretrain.py][INFO] Epoch:[0/2](788600/4588595) loss:1.623 lr:0.0000100 epoch_Time:24183.0min: [2024-01-06 05:19:03,685][model8_pretrain.py][INFO] Epoch:[0/2](788700/4588595) loss:2.768 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:03,685][model8_pretrain.py][INFO] Epoch:[0/2](788700/4588595) loss:2.953 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:03,685][model8_pretrain.py][INFO] Epoch:[0/2](788700/4588595) loss:2.700 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:03,685][model8_pretrain.py][INFO] Epoch:[0/2](788700/4588595) loss:2.501 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:03,685][model8_pretrain.py][INFO] Epoch:[0/2](788700/4588595) loss:3.077 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:03,685][model8_pretrain.py][INFO] Epoch:[0/2](788700/4588595) loss:3.105 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:03,685][model8_pretrain.py][INFO] Epoch:[0/2](788700/4588595) loss:2.976 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:03,685][model8_pretrain.py][INFO] Epoch:[0/2](788700/4588595) loss:2.848 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:40,621][model8_pretrain.py][INFO] Epoch:[0/2](788800/4588595) loss:2.245 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:40,621][model8_pretrain.py][INFO] Epoch:[0/2](788800/4588595) loss:3.140 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:40,621][model8_pretrain.py][INFO] Epoch:[0/2](788800/4588595) loss:2.660 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:40,621][model8_pretrain.py][INFO] Epoch:[0/2](788800/4588595) loss:2.974 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:40,621][model8_pretrain.py][INFO] Epoch:[0/2](788800/4588595) loss:3.191 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:40,621][model8_pretrain.py][INFO] Epoch:[0/2](788800/4588595) loss:3.034 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:40,621][model8_pretrain.py][INFO] Epoch:[0/2](788800/4588595) loss:2.581 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:19:40,621][model8_pretrain.py][INFO] Epoch:[0/2](788800/4588595) loss:3.244 lr:0.0000100 epoch_Time:24182.0min: [2024-01-06 05:20:17,557][model8_pretrain.py][INFO] Epoch:[0/2](788900/4588595) loss:2.965 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:20:17,557][model8_pretrain.py][INFO] Epoch:[0/2](788900/4588595) loss:3.452 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:20:17,557][model8_pretrain.py][INFO] Epoch:[0/2](788900/4588595) loss:3.079 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:20:17,557][model8_pretrain.py][INFO] Epoch:[0/2](788900/4588595) loss:2.956 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:20:17,557][model8_pretrain.py][INFO] Epoch:[0/2](788900/4588595) loss:2.402 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:20:17,557][model8_pretrain.py][INFO] Epoch:[0/2](788900/4588595) loss:2.287 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:20:17,557][model8_pretrain.py][INFO] Epoch:[0/2](788900/4588595) loss:3.076 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:20:17,557][model8_pretrain.py][INFO] Epoch:[0/2](788900/4588595) loss:2.892 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:20:54,490][model8_pretrain.py][INFO] Epoch:[0/2](789000/4588595) loss:2.553 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:20:54,490][model8_pretrain.py][INFO] Epoch:[0/2](789000/4588595) loss:2.755 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:20:54,490][model8_pretrain.py][INFO] Epoch:[0/2](789000/4588595) loss:2.962 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:20:54,490][model8_pretrain.py][INFO] Epoch:[0/2](789000/4588595) loss:2.897 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:20:54,490][model8_pretrain.py][INFO] Epoch:[0/2](789000/4588595) loss:3.423 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:20:54,490][model8_pretrain.py][INFO] Epoch:[0/2](789000/4588595) loss:3.079 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:20:54,491][model8_pretrain.py][INFO] Epoch:[0/2](789000/4588595) loss:2.604 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:20:54,491][model8_pretrain.py][INFO] Epoch:[0/2](789000/4588595) loss:2.656 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:21:41,900][model8_pretrain.py][INFO] Epoch:[0/2](789100/4588595) loss:3.053 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:21:41,900][model8_pretrain.py][INFO] Epoch:[0/2](789100/4588595) loss:2.727 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:21:41,900][model8_pretrain.py][INFO] Epoch:[0/2](789100/4588595) loss:2.517 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:21:41,900][model8_pretrain.py][INFO] Epoch:[0/2](789100/4588595) loss:3.159 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:21:41,900][model8_pretrain.py][INFO] Epoch:[0/2](789100/4588595) loss:2.980 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:21:41,900][model8_pretrain.py][INFO] Epoch:[0/2](789100/4588595) loss:2.830 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:21:41,900][model8_pretrain.py][INFO] Epoch:[0/2](789100/4588595) loss:2.298 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:21:41,900][model8_pretrain.py][INFO] Epoch:[0/2](789100/4588595) loss:3.152 lr:0.0000100 epoch_Time:24181.0min: [2024-01-06 05:22:18,829][model8_pretrain.py][INFO] Epoch:[0/2](789200/4588595) loss:2.452 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:22:18,829][model8_pretrain.py][INFO] Epoch:[0/2](789200/4588595) loss:3.087 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:22:18,830][model8_pretrain.py][INFO] Epoch:[0/2](789200/4588595) loss:2.331 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:22:18,830][model8_pretrain.py][INFO] Epoch:[0/2](789200/4588595) loss:2.879 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:22:18,830][model8_pretrain.py][INFO] Epoch:[0/2](789200/4588595) loss:3.466 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:22:18,830][model8_pretrain.py][INFO] Epoch:[0/2](789200/4588595) loss:3.099 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:22:18,830][model8_pretrain.py][INFO] Epoch:[0/2](789200/4588595) loss:2.907 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:22:18,830][model8_pretrain.py][INFO] Epoch:[0/2](789200/4588595) loss:2.812 lr:0.0000100 epoch_Time:24180.0min: [2024-01-06 05:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](789300/4588595) loss:3.427 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:22:55,764][model8_pretrain.py][INFO] Epoch:[0/2](789300/4588595) loss:2.189 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:22:55,765][model8_pretrain.py][INFO] Epoch:[0/2](789300/4588595) loss:2.711 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:22:55,765][model8_pretrain.py][INFO] Epoch:[0/2](789300/4588595) loss:3.028 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:22:55,765][model8_pretrain.py][INFO] Epoch:[0/2](789300/4588595) loss:2.638 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:22:55,765][model8_pretrain.py][INFO] Epoch:[0/2](789300/4588595) loss:2.536 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:22:55,765][model8_pretrain.py][INFO] Epoch:[0/2](789300/4588595) loss:2.628 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:22:55,765][model8_pretrain.py][INFO] Epoch:[0/2](789300/4588595) loss:2.669 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:23:32,706][model8_pretrain.py][INFO] Epoch:[0/2](789400/4588595) loss:2.862 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:23:32,706][model8_pretrain.py][INFO] Epoch:[0/2](789400/4588595) loss:2.847 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:23:32,706][model8_pretrain.py][INFO] Epoch:[0/2](789400/4588595) loss:2.254 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:23:32,706][model8_pretrain.py][INFO] Epoch:[0/2](789400/4588595) loss:3.307 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:23:32,706][model8_pretrain.py][INFO] Epoch:[0/2](789400/4588595) loss:2.387 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:23:32,706][model8_pretrain.py][INFO] Epoch:[0/2](789400/4588595) loss:3.495 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:23:32,706][model8_pretrain.py][INFO] Epoch:[0/2](789400/4588595) loss:3.031 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:23:32,706][model8_pretrain.py][INFO] Epoch:[0/2](789400/4588595) loss:2.666 lr:0.0000100 epoch_Time:24178.0min: [2024-01-06 05:24:09,656][model8_pretrain.py][INFO] Epoch:[0/2](789500/4588595) loss:2.333 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:09,656][model8_pretrain.py][INFO] Epoch:[0/2](789500/4588595) loss:2.952 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:09,656][model8_pretrain.py][INFO] Epoch:[0/2](789500/4588595) loss:2.557 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:09,656][model8_pretrain.py][INFO] Epoch:[0/2](789500/4588595) loss:2.884 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:09,657][model8_pretrain.py][INFO] Epoch:[0/2](789500/4588595) loss:3.077 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:09,657][model8_pretrain.py][INFO] Epoch:[0/2](789500/4588595) loss:2.426 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:09,657][model8_pretrain.py][INFO] Epoch:[0/2](789500/4588595) loss:2.992 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:09,657][model8_pretrain.py][INFO] Epoch:[0/2](789500/4588595) loss:3.447 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:46,592][model8_pretrain.py][INFO] Epoch:[0/2](789600/4588595) loss:2.413 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:46,592][model8_pretrain.py][INFO] Epoch:[0/2](789600/4588595) loss:3.165 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:46,592][model8_pretrain.py][INFO] Epoch:[0/2](789600/4588595) loss:2.859 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:46,592][model8_pretrain.py][INFO] Epoch:[0/2](789600/4588595) loss:2.956 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:46,592][model8_pretrain.py][INFO] Epoch:[0/2](789600/4588595) loss:2.981 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:46,592][model8_pretrain.py][INFO] Epoch:[0/2](789600/4588595) loss:2.956 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:46,592][model8_pretrain.py][INFO] Epoch:[0/2](789600/4588595) loss:3.153 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:24:46,592][model8_pretrain.py][INFO] Epoch:[0/2](789600/4588595) loss:2.428 lr:0.0000100 epoch_Time:24177.0min: [2024-01-06 05:25:23,546][model8_pretrain.py][INFO] Epoch:[0/2](789700/4588595) loss:3.019 lr:0.0000100 epoch_Time:24176.0min: [2024-01-06 05:25:23,546][model8_pretrain.py][INFO] Epoch:[0/2](789700/4588595) loss:2.947 lr:0.0000100 epoch_Time:24176.0min: [2024-01-06 05:25:23,546][model8_pretrain.py][INFO] Epoch:[0/2](789700/4588595) loss:2.356 lr:0.0000100 epoch_Time:24176.0min: [2024-01-06 05:25:23,546][model8_pretrain.py][INFO] Epoch:[0/2](789700/4588595) loss:2.959 lr:0.0000100 epoch_Time:24176.0min: [2024-01-06 05:25:23,546][model8_pretrain.py][INFO] Epoch:[0/2](789700/4588595) loss:3.078 lr:0.0000100 epoch_Time:24176.0min: [2024-01-06 05:25:23,546][model8_pretrain.py][INFO] Epoch:[0/2](789700/4588595) loss:2.952 lr:0.0000100 epoch_Time:24176.0min: [2024-01-06 05:25:23,546][model8_pretrain.py][INFO] Epoch:[0/2](789700/4588595) loss:2.999 lr:0.0000100 epoch_Time:24176.0min: [2024-01-06 05:25:23,546][model8_pretrain.py][INFO] Epoch:[0/2](789700/4588595) loss:2.713 lr:0.0000100 epoch_Time:24176.0min: [2024-01-06 05:26:00,481][model8_pretrain.py][INFO] Epoch:[0/2](789800/4588595) loss:2.806 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:00,481][model8_pretrain.py][INFO] Epoch:[0/2](789800/4588595) loss:2.510 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:00,481][model8_pretrain.py][INFO] Epoch:[0/2](789800/4588595) loss:2.887 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:00,481][model8_pretrain.py][INFO] Epoch:[0/2](789800/4588595) loss:3.519 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:00,481][model8_pretrain.py][INFO] Epoch:[0/2](789800/4588595) loss:2.731 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:00,481][model8_pretrain.py][INFO] Epoch:[0/2](789800/4588595) loss:2.554 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:00,481][model8_pretrain.py][INFO] Epoch:[0/2](789800/4588595) loss:2.665 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:00,481][model8_pretrain.py][INFO] Epoch:[0/2](789800/4588595) loss:2.512 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:47,630][model8_pretrain.py][INFO] Epoch:[0/2](789900/4588595) loss:2.492 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:47,630][model8_pretrain.py][INFO] Epoch:[0/2](789900/4588595) loss:2.756 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:47,630][model8_pretrain.py][INFO] Epoch:[0/2](789900/4588595) loss:2.438 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:47,630][model8_pretrain.py][INFO] Epoch:[0/2](789900/4588595) loss:2.761 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:47,630][model8_pretrain.py][INFO] Epoch:[0/2](789900/4588595) loss:2.950 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:47,630][model8_pretrain.py][INFO] Epoch:[0/2](789900/4588595) loss:2.693 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:47,630][model8_pretrain.py][INFO] Epoch:[0/2](789900/4588595) loss:3.256 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:26:47,630][model8_pretrain.py][INFO] Epoch:[0/2](789900/4588595) loss:2.734 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:27:24,549][model8_pretrain.py][INFO] Epoch:[0/2](790000/4588595) loss:3.201 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:27:24,549][model8_pretrain.py][INFO] Epoch:[0/2](790000/4588595) loss:2.847 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:27:24,549][model8_pretrain.py][INFO] Epoch:[0/2](790000/4588595) loss:2.910 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:27:24,549][model8_pretrain.py][INFO] Epoch:[0/2](790000/4588595) loss:2.912 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:27:24,549][model8_pretrain.py][INFO] Epoch:[0/2](790000/4588595) loss:3.082 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:27:24,549][model8_pretrain.py][INFO] Epoch:[0/2](790000/4588595) loss:2.516 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:27:24,549][model8_pretrain.py][INFO] Epoch:[0/2](790000/4588595) loss:3.141 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:27:24,549][model8_pretrain.py][INFO] Epoch:[0/2](790000/4588595) loss:3.070 lr:0.0000100 epoch_Time:24175.0min: [2024-01-06 05:28:01,487][model8_pretrain.py][INFO] Epoch:[0/2](790100/4588595) loss:2.890 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:01,487][model8_pretrain.py][INFO] Epoch:[0/2](790100/4588595) loss:3.192 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:01,487][model8_pretrain.py][INFO] Epoch:[0/2](790100/4588595) loss:2.377 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:01,487][model8_pretrain.py][INFO] Epoch:[0/2](790100/4588595) loss:2.740 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:01,487][model8_pretrain.py][INFO] Epoch:[0/2](790100/4588595) loss:2.427 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:01,487][model8_pretrain.py][INFO] Epoch:[0/2](790100/4588595) loss:2.862 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:01,487][model8_pretrain.py][INFO] Epoch:[0/2](790100/4588595) loss:2.688 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:01,488][model8_pretrain.py][INFO] Epoch:[0/2](790100/4588595) loss:2.610 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:38,431][model8_pretrain.py][INFO] Epoch:[0/2](790200/4588595) loss:2.797 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:38,431][model8_pretrain.py][INFO] Epoch:[0/2](790200/4588595) loss:2.810 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:38,431][model8_pretrain.py][INFO] Epoch:[0/2](790200/4588595) loss:2.928 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:38,431][model8_pretrain.py][INFO] Epoch:[0/2](790200/4588595) loss:2.621 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:38,431][model8_pretrain.py][INFO] Epoch:[0/2](790200/4588595) loss:3.150 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:38,431][model8_pretrain.py][INFO] Epoch:[0/2](790200/4588595) loss:3.045 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:38,431][model8_pretrain.py][INFO] Epoch:[0/2](790200/4588595) loss:2.837 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:28:38,431][model8_pretrain.py][INFO] Epoch:[0/2](790200/4588595) loss:1.971 lr:0.0000100 epoch_Time:24173.0min: [2024-01-06 05:29:15,367][model8_pretrain.py][INFO] Epoch:[0/2](790300/4588595) loss:2.820 lr:0.0000100 epoch_Time:24172.0min: [2024-01-06 05:29:15,367][model8_pretrain.py][INFO] Epoch:[0/2](790300/4588595) loss:2.564 lr:0.0000100 epoch_Time:24172.0min: [2024-01-06 05:29:15,367][model8_pretrain.py][INFO] Epoch:[0/2](790300/4588595) loss:3.149 lr:0.0000100 epoch_Time:24172.0min: [2024-01-06 05:29:15,367][model8_pretrain.py][INFO] Epoch:[0/2](790300/4588595) loss:2.495 lr:0.0000100 epoch_Time:24172.0min: [2024-01-06 05:29:15,367][model8_pretrain.py][INFO] Epoch:[0/2](790300/4588595) loss:3.294 lr:0.0000100 epoch_Time:24172.0min: [2024-01-06 05:29:15,367][model8_pretrain.py][INFO] Epoch:[0/2](790300/4588595) loss:2.669 lr:0.0000100 epoch_Time:24172.0min: [2024-01-06 05:29:15,367][model8_pretrain.py][INFO] Epoch:[0/2](790300/4588595) loss:2.118 lr:0.0000100 epoch_Time:24172.0min: [2024-01-06 05:29:15,367][model8_pretrain.py][INFO] Epoch:[0/2](790300/4588595) loss:2.819 lr:0.0000100 epoch_Time:24172.0min: [2024-01-06 05:29:52,308][model8_pretrain.py][INFO] Epoch:[0/2](790400/4588595) loss:2.646 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:29:52,308][model8_pretrain.py][INFO] Epoch:[0/2](790400/4588595) loss:2.869 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:29:52,308][model8_pretrain.py][INFO] Epoch:[0/2](790400/4588595) loss:2.432 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:29:52,308][model8_pretrain.py][INFO] Epoch:[0/2](790400/4588595) loss:3.027 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:29:52,308][model8_pretrain.py][INFO] Epoch:[0/2](790400/4588595) loss:2.715 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:29:52,308][model8_pretrain.py][INFO] Epoch:[0/2](790400/4588595) loss:2.713 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:29:52,308][model8_pretrain.py][INFO] Epoch:[0/2](790400/4588595) loss:2.439 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:29:52,309][model8_pretrain.py][INFO] Epoch:[0/2](790400/4588595) loss:2.659 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:30:29,251][model8_pretrain.py][INFO] Epoch:[0/2](790500/4588595) loss:3.354 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:30:29,251][model8_pretrain.py][INFO] Epoch:[0/2](790500/4588595) loss:2.929 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:30:29,251][model8_pretrain.py][INFO] Epoch:[0/2](790500/4588595) loss:2.523 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:30:29,251][model8_pretrain.py][INFO] Epoch:[0/2](790500/4588595) loss:2.875 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:30:29,251][model8_pretrain.py][INFO] Epoch:[0/2](790500/4588595) loss:2.798 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:30:29,251][model8_pretrain.py][INFO] Epoch:[0/2](790500/4588595) loss:2.632 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:30:29,251][model8_pretrain.py][INFO] Epoch:[0/2](790500/4588595) loss:2.880 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:30:29,251][model8_pretrain.py][INFO] Epoch:[0/2](790500/4588595) loss:3.218 lr:0.0000100 epoch_Time:24171.0min: [2024-01-06 05:31:06,204][model8_pretrain.py][INFO] Epoch:[0/2](790600/4588595) loss:3.120 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:06,204][model8_pretrain.py][INFO] Epoch:[0/2](790600/4588595) loss:3.306 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:06,204][model8_pretrain.py][INFO] Epoch:[0/2](790600/4588595) loss:3.245 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:06,204][model8_pretrain.py][INFO] Epoch:[0/2](790600/4588595) loss:3.384 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:06,204][model8_pretrain.py][INFO] Epoch:[0/2](790600/4588595) loss:3.111 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:06,204][model8_pretrain.py][INFO] Epoch:[0/2](790600/4588595) loss:2.632 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:06,204][model8_pretrain.py][INFO] Epoch:[0/2](790600/4588595) loss:3.339 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:06,204][model8_pretrain.py][INFO] Epoch:[0/2](790600/4588595) loss:2.553 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:51,658][model8_pretrain.py][INFO] Epoch:[0/2](790700/4588595) loss:2.924 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:51,658][model8_pretrain.py][INFO] Epoch:[0/2](790700/4588595) loss:2.789 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:51,658][model8_pretrain.py][INFO] Epoch:[0/2](790700/4588595) loss:2.871 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:51,658][model8_pretrain.py][INFO] Epoch:[0/2](790700/4588595) loss:2.796 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:51,658][model8_pretrain.py][INFO] Epoch:[0/2](790700/4588595) loss:3.298 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:51,658][model8_pretrain.py][INFO] Epoch:[0/2](790700/4588595) loss:3.271 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:51,659][model8_pretrain.py][INFO] Epoch:[0/2](790700/4588595) loss:3.028 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:31:53,342][model8_pretrain.py][INFO] Epoch:[0/2](790700/4588595) loss:3.062 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:32:30,238][model8_pretrain.py][INFO] Epoch:[0/2](790800/4588595) loss:2.936 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:32:30,238][model8_pretrain.py][INFO] Epoch:[0/2](790800/4588595) loss:2.721 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:32:30,238][model8_pretrain.py][INFO] Epoch:[0/2](790800/4588595) loss:2.473 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:32:30,238][model8_pretrain.py][INFO] Epoch:[0/2](790800/4588595) loss:2.344 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:32:30,238][model8_pretrain.py][INFO] Epoch:[0/2](790800/4588595) loss:3.323 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:32:30,239][model8_pretrain.py][INFO] Epoch:[0/2](790800/4588595) loss:3.158 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:32:30,238][model8_pretrain.py][INFO] Epoch:[0/2](790800/4588595) loss:2.802 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:32:30,239][model8_pretrain.py][INFO] Epoch:[0/2](790800/4588595) loss:2.669 lr:0.0000100 epoch_Time:24170.0min: [2024-01-06 05:33:07,167][model8_pretrain.py][INFO] Epoch:[0/2](790900/4588595) loss:3.033 lr:0.0000100 epoch_Time:24169.0min: [2024-01-06 05:33:07,167][model8_pretrain.py][INFO] Epoch:[0/2](790900/4588595) loss:2.235 lr:0.0000100 epoch_Time:24169.0min: [2024-01-06 05:33:07,167][model8_pretrain.py][INFO] Epoch:[0/2](790900/4588595) loss:2.791 lr:0.0000100 epoch_Time:24169.0min: [2024-01-06 05:33:07,167][model8_pretrain.py][INFO] Epoch:[0/2](790900/4588595) loss:2.500 lr:0.0000100 epoch_Time:24169.0min: [2024-01-06 05:33:07,167][model8_pretrain.py][INFO] Epoch:[0/2](790900/4588595) loss:2.832 lr:0.0000100 epoch_Time:24169.0min: [2024-01-06 05:33:07,167][model8_pretrain.py][INFO] Epoch:[0/2](790900/4588595) loss:3.118 lr:0.0000100 epoch_Time:24169.0min: [2024-01-06 05:33:07,167][model8_pretrain.py][INFO] Epoch:[0/2](790900/4588595) loss:3.004 lr:0.0000100 epoch_Time:24169.0min: [2024-01-06 05:33:07,167][model8_pretrain.py][INFO] Epoch:[0/2](790900/4588595) loss:2.672 lr:0.0000100 epoch_Time:24169.0min: [2024-01-06 05:33:44,104][model8_pretrain.py][INFO] Epoch:[0/2](791000/4588595) loss:3.415 lr:0.0000100 epoch_Time:24168.0min: [2024-01-06 05:33:44,104][model8_pretrain.py][INFO] Epoch:[0/2](791000/4588595) loss:2.717 lr:0.0000100 epoch_Time:24168.0min: [2024-01-06 05:33:44,105][model8_pretrain.py][INFO] Epoch:[0/2](791000/4588595) loss:2.974 lr:0.0000100 epoch_Time:24168.0min: [2024-01-06 05:33:44,105][model8_pretrain.py][INFO] Epoch:[0/2](791000/4588595) loss:3.303 lr:0.0000100 epoch_Time:24168.0min: [2024-01-06 05:33:44,105][model8_pretrain.py][INFO] Epoch:[0/2](791000/4588595) loss:3.214 lr:0.0000100 epoch_Time:24168.0min: [2024-01-06 05:33:44,105][model8_pretrain.py][INFO] Epoch:[0/2](791000/4588595) loss:3.176 lr:0.0000100 epoch_Time:24168.0min: [2024-01-06 05:33:44,105][model8_pretrain.py][INFO] Epoch:[0/2](791000/4588595) loss:2.986 lr:0.0000100 epoch_Time:24168.0min: [2024-01-06 05:33:44,105][model8_pretrain.py][INFO] Epoch:[0/2](791000/4588595) loss:2.350 lr:0.0000100 epoch_Time:24168.0min: [2024-01-06 05:34:21,051][model8_pretrain.py][INFO] Epoch:[0/2](791100/4588595) loss:2.958 lr:0.0000100 epoch_Time:24167.0min: [2024-01-06 05:34:21,051][model8_pretrain.py][INFO] Epoch:[0/2](791100/4588595) loss:2.832 lr:0.0000100 epoch_Time:24167.0min: [2024-01-06 05:34:21,051][model8_pretrain.py][INFO] Epoch:[0/2](791100/4588595) loss:2.667 lr:0.0000100 epoch_Time:24167.0min: [2024-01-06 05:34:21,051][model8_pretrain.py][INFO] Epoch:[0/2](791100/4588595) loss:2.232 lr:0.0000100 epoch_Time:24167.0min: [2024-01-06 05:34:21,051][model8_pretrain.py][INFO] Epoch:[0/2](791100/4588595) loss:3.113 lr:0.0000100 epoch_Time:24167.0min: [2024-01-06 05:34:21,051][model8_pretrain.py][INFO] Epoch:[0/2](791100/4588595) loss:3.194 lr:0.0000100 epoch_Time:24167.0min: [2024-01-06 05:34:21,051][model8_pretrain.py][INFO] Epoch:[0/2](791100/4588595) loss:2.951 lr:0.0000100 epoch_Time:24167.0min: [2024-01-06 05:34:21,051][model8_pretrain.py][INFO] Epoch:[0/2](791100/4588595) loss:2.591 lr:0.0000100 epoch_Time:24167.0min: [2024-01-06 05:34:57,978][model8_pretrain.py][INFO] Epoch:[0/2](791200/4588595) loss:2.555 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:34:57,979][model8_pretrain.py][INFO] Epoch:[0/2](791200/4588595) loss:3.090 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:34:57,979][model8_pretrain.py][INFO] Epoch:[0/2](791200/4588595) loss:2.654 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:34:57,979][model8_pretrain.py][INFO] Epoch:[0/2](791200/4588595) loss:2.757 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:34:57,979][model8_pretrain.py][INFO] Epoch:[0/2](791200/4588595) loss:3.000 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:34:57,979][model8_pretrain.py][INFO] Epoch:[0/2](791200/4588595) loss:3.054 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:34:57,979][model8_pretrain.py][INFO] Epoch:[0/2](791200/4588595) loss:2.449 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:34:57,979][model8_pretrain.py][INFO] Epoch:[0/2](791200/4588595) loss:2.849 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:35:34,913][model8_pretrain.py][INFO] Epoch:[0/2](791300/4588595) loss:2.679 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:35:34,913][model8_pretrain.py][INFO] Epoch:[0/2](791300/4588595) loss:2.771 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:35:34,913][model8_pretrain.py][INFO] Epoch:[0/2](791300/4588595) loss:2.500 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:35:34,913][model8_pretrain.py][INFO] Epoch:[0/2](791300/4588595) loss:3.373 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:35:34,913][model8_pretrain.py][INFO] Epoch:[0/2](791300/4588595) loss:2.305 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:35:34,913][model8_pretrain.py][INFO] Epoch:[0/2](791300/4588595) loss:2.641 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:35:34,913][model8_pretrain.py][INFO] Epoch:[0/2](791300/4588595) loss:2.716 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:35:34,913][model8_pretrain.py][INFO] Epoch:[0/2](791300/4588595) loss:3.073 lr:0.0000100 epoch_Time:24166.0min: [2024-01-06 05:36:11,861][model8_pretrain.py][INFO] Epoch:[0/2](791400/4588595) loss:3.780 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:11,861][model8_pretrain.py][INFO] Epoch:[0/2](791400/4588595) loss:2.731 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:11,861][model8_pretrain.py][INFO] Epoch:[0/2](791400/4588595) loss:2.472 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:11,861][model8_pretrain.py][INFO] Epoch:[0/2](791400/4588595) loss:3.146 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:11,861][model8_pretrain.py][INFO] Epoch:[0/2](791400/4588595) loss:3.011 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:11,862][model8_pretrain.py][INFO] Epoch:[0/2](791400/4588595) loss:2.787 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:11,862][model8_pretrain.py][INFO] Epoch:[0/2](791400/4588595) loss:3.039 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:11,862][model8_pretrain.py][INFO] Epoch:[0/2](791400/4588595) loss:2.840 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:57,268][model8_pretrain.py][INFO] Epoch:[0/2](791500/4588595) loss:1.888 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:57,268][model8_pretrain.py][INFO] Epoch:[0/2](791500/4588595) loss:2.449 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:57,268][model8_pretrain.py][INFO] Epoch:[0/2](791500/4588595) loss:2.834 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:57,268][model8_pretrain.py][INFO] Epoch:[0/2](791500/4588595) loss:2.517 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:57,268][model8_pretrain.py][INFO] Epoch:[0/2](791500/4588595) loss:2.486 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:57,269][model8_pretrain.py][INFO] Epoch:[0/2](791500/4588595) loss:2.951 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:57,268][model8_pretrain.py][INFO] Epoch:[0/2](791500/4588595) loss:2.352 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:36:57,269][model8_pretrain.py][INFO] Epoch:[0/2](791500/4588595) loss:2.850 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:37:35,887][model8_pretrain.py][INFO] Epoch:[0/2](791600/4588595) loss:3.352 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:37:35,887][model8_pretrain.py][INFO] Epoch:[0/2](791600/4588595) loss:2.607 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:37:35,887][model8_pretrain.py][INFO] Epoch:[0/2](791600/4588595) loss:2.703 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:37:35,887][model8_pretrain.py][INFO] Epoch:[0/2](791600/4588595) loss:2.559 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:37:35,887][model8_pretrain.py][INFO] Epoch:[0/2](791600/4588595) loss:2.630 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:37:35,887][model8_pretrain.py][INFO] Epoch:[0/2](791600/4588595) loss:2.882 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:37:35,888][model8_pretrain.py][INFO] Epoch:[0/2](791600/4588595) loss:3.173 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:37:35,888][model8_pretrain.py][INFO] Epoch:[0/2](791600/4588595) loss:2.553 lr:0.0000100 epoch_Time:24165.0min: [2024-01-06 05:38:12,826][model8_pretrain.py][INFO] Epoch:[0/2](791700/4588595) loss:3.080 lr:0.0000100 epoch_Time:24164.0min: [2024-01-06 05:38:12,826][model8_pretrain.py][INFO] Epoch:[0/2](791700/4588595) loss:3.247 lr:0.0000100 epoch_Time:24164.0min: [2024-01-06 05:38:12,826][model8_pretrain.py][INFO] Epoch:[0/2](791700/4588595) loss:2.960 lr:0.0000100 epoch_Time:24164.0min: [2024-01-06 05:38:12,826][model8_pretrain.py][INFO] Epoch:[0/2](791700/4588595) loss:3.241 lr:0.0000100 epoch_Time:24164.0min: [2024-01-06 05:38:12,826][model8_pretrain.py][INFO] Epoch:[0/2](791700/4588595) loss:2.286 lr:0.0000100 epoch_Time:24164.0min: [2024-01-06 05:38:12,826][model8_pretrain.py][INFO] Epoch:[0/2](791700/4588595) loss:3.050 lr:0.0000100 epoch_Time:24164.0min: [2024-01-06 05:38:12,827][model8_pretrain.py][INFO] Epoch:[0/2](791700/4588595) loss:2.040 lr:0.0000100 epoch_Time:24164.0min: [2024-01-06 05:38:12,827][model8_pretrain.py][INFO] Epoch:[0/2](791700/4588595) loss:3.253 lr:0.0000100 epoch_Time:24164.0min: [2024-01-06 05:38:49,760][model8_pretrain.py][INFO] Epoch:[0/2](791800/4588595) loss:2.794 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:38:49,760][model8_pretrain.py][INFO] Epoch:[0/2](791800/4588595) loss:2.719 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:38:49,760][model8_pretrain.py][INFO] Epoch:[0/2](791800/4588595) loss:2.822 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:38:49,760][model8_pretrain.py][INFO] Epoch:[0/2](791800/4588595) loss:2.591 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:38:49,760][model8_pretrain.py][INFO] Epoch:[0/2](791800/4588595) loss:2.787 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:38:49,761][model8_pretrain.py][INFO] Epoch:[0/2](791800/4588595) loss:2.730 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:38:49,761][model8_pretrain.py][INFO] Epoch:[0/2](791800/4588595) loss:3.178 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:38:49,761][model8_pretrain.py][INFO] Epoch:[0/2](791800/4588595) loss:3.183 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:39:26,733][model8_pretrain.py][INFO] Epoch:[0/2](791900/4588595) loss:2.964 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:39:26,733][model8_pretrain.py][INFO] Epoch:[0/2](791900/4588595) loss:2.957 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:39:26,733][model8_pretrain.py][INFO] Epoch:[0/2](791900/4588595) loss:2.710 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:39:26,733][model8_pretrain.py][INFO] Epoch:[0/2](791900/4588595) loss:2.970 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:39:26,734][model8_pretrain.py][INFO] Epoch:[0/2](791900/4588595) loss:2.889 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:39:26,733][model8_pretrain.py][INFO] Epoch:[0/2](791900/4588595) loss:2.642 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:39:26,734][model8_pretrain.py][INFO] Epoch:[0/2](791900/4588595) loss:2.694 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:39:26,734][model8_pretrain.py][INFO] Epoch:[0/2](791900/4588595) loss:2.429 lr:0.0000100 epoch_Time:24162.0min: [2024-01-06 05:40:03,755][model8_pretrain.py][INFO] Epoch:[0/2](792000/4588595) loss:2.725 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:03,755][model8_pretrain.py][INFO] Epoch:[0/2](792000/4588595) loss:2.685 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:03,755][model8_pretrain.py][INFO] Epoch:[0/2](792000/4588595) loss:2.471 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:03,756][model8_pretrain.py][INFO] Epoch:[0/2](792000/4588595) loss:3.067 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:03,756][model8_pretrain.py][INFO] Epoch:[0/2](792000/4588595) loss:3.107 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:03,756][model8_pretrain.py][INFO] Epoch:[0/2](792000/4588595) loss:2.345 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:03,756][model8_pretrain.py][INFO] Epoch:[0/2](792000/4588595) loss:2.630 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:03,756][model8_pretrain.py][INFO] Epoch:[0/2](792000/4588595) loss:3.077 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:40,713][model8_pretrain.py][INFO] Epoch:[0/2](792100/4588595) loss:3.005 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:40,713][model8_pretrain.py][INFO] Epoch:[0/2](792100/4588595) loss:3.086 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:40,713][model8_pretrain.py][INFO] Epoch:[0/2](792100/4588595) loss:2.379 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:40,713][model8_pretrain.py][INFO] Epoch:[0/2](792100/4588595) loss:2.787 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:40,713][model8_pretrain.py][INFO] Epoch:[0/2](792100/4588595) loss:3.011 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:40,713][model8_pretrain.py][INFO] Epoch:[0/2](792100/4588595) loss:3.141 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:40,713][model8_pretrain.py][INFO] Epoch:[0/2](792100/4588595) loss:2.634 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:40:40,713][model8_pretrain.py][INFO] Epoch:[0/2](792100/4588595) loss:3.377 lr:0.0000100 epoch_Time:24161.0min: [2024-01-06 05:41:17,667][model8_pretrain.py][INFO] Epoch:[0/2](792200/4588595) loss:2.364 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:41:17,667][model8_pretrain.py][INFO] Epoch:[0/2](792200/4588595) loss:2.810 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:41:17,667][model8_pretrain.py][INFO] Epoch:[0/2](792200/4588595) loss:2.863 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:41:17,667][model8_pretrain.py][INFO] Epoch:[0/2](792200/4588595) loss:2.831 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:41:17,668][model8_pretrain.py][INFO] Epoch:[0/2](792200/4588595) loss:3.246 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:41:17,668][model8_pretrain.py][INFO] Epoch:[0/2](792200/4588595) loss:2.717 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:41:17,668][model8_pretrain.py][INFO] Epoch:[0/2](792200/4588595) loss:2.826 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:41:17,668][model8_pretrain.py][INFO] Epoch:[0/2](792200/4588595) loss:2.519 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:03,279][model8_pretrain.py][INFO] Epoch:[0/2](792300/4588595) loss:3.147 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:03,279][model8_pretrain.py][INFO] Epoch:[0/2](792300/4588595) loss:2.778 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:03,279][model8_pretrain.py][INFO] Epoch:[0/2](792300/4588595) loss:2.889 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:03,280][model8_pretrain.py][INFO] Epoch:[0/2](792300/4588595) loss:2.455 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:03,280][model8_pretrain.py][INFO] Epoch:[0/2](792300/4588595) loss:2.602 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:03,280][model8_pretrain.py][INFO] Epoch:[0/2](792300/4588595) loss:2.843 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:03,280][model8_pretrain.py][INFO] Epoch:[0/2](792300/4588595) loss:3.204 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:03,281][model8_pretrain.py][INFO] Epoch:[0/2](792300/4588595) loss:2.315 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:41,885][model8_pretrain.py][INFO] Epoch:[0/2](792400/4588595) loss:2.308 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:41,885][model8_pretrain.py][INFO] Epoch:[0/2](792400/4588595) loss:2.259 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:41,885][model8_pretrain.py][INFO] Epoch:[0/2](792400/4588595) loss:3.015 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:41,885][model8_pretrain.py][INFO] Epoch:[0/2](792400/4588595) loss:1.874 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:41,885][model8_pretrain.py][INFO] Epoch:[0/2](792400/4588595) loss:2.990 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:41,885][model8_pretrain.py][INFO] Epoch:[0/2](792400/4588595) loss:2.839 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:41,885][model8_pretrain.py][INFO] Epoch:[0/2](792400/4588595) loss:3.193 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:42:41,885][model8_pretrain.py][INFO] Epoch:[0/2](792400/4588595) loss:3.298 lr:0.0000100 epoch_Time:24160.0min: [2024-01-06 05:43:18,816][model8_pretrain.py][INFO] Epoch:[0/2](792500/4588595) loss:2.595 lr:0.0000100 epoch_Time:24159.0min: [2024-01-06 05:43:18,816][model8_pretrain.py][INFO] Epoch:[0/2](792500/4588595) loss:3.114 lr:0.0000100 epoch_Time:24159.0min: [2024-01-06 05:43:18,816][model8_pretrain.py][INFO] Epoch:[0/2](792500/4588595) loss:1.788 lr:0.0000100 epoch_Time:24159.0min: [2024-01-06 05:43:18,816][model8_pretrain.py][INFO] Epoch:[0/2](792500/4588595) loss:3.085 lr:0.0000100 epoch_Time:24159.0min: [2024-01-06 05:43:18,816][model8_pretrain.py][INFO] Epoch:[0/2](792500/4588595) loss:3.002 lr:0.0000100 epoch_Time:24159.0min: [2024-01-06 05:43:18,816][model8_pretrain.py][INFO] Epoch:[0/2](792500/4588595) loss:2.727 lr:0.0000100 epoch_Time:24159.0min: [2024-01-06 05:43:18,816][model8_pretrain.py][INFO] Epoch:[0/2](792500/4588595) loss:2.672 lr:0.0000100 epoch_Time:24159.0min: [2024-01-06 05:43:18,817][model8_pretrain.py][INFO] Epoch:[0/2](792500/4588595) loss:2.480 lr:0.0000100 epoch_Time:24159.0min: [2024-01-06 05:43:55,752][model8_pretrain.py][INFO] Epoch:[0/2](792600/4588595) loss:2.738 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:43:55,752][model8_pretrain.py][INFO] Epoch:[0/2](792600/4588595) loss:3.027 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:43:55,752][model8_pretrain.py][INFO] Epoch:[0/2](792600/4588595) loss:2.604 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:43:55,752][model8_pretrain.py][INFO] Epoch:[0/2](792600/4588595) loss:2.574 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:43:55,752][model8_pretrain.py][INFO] Epoch:[0/2](792600/4588595) loss:3.122 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:43:55,752][model8_pretrain.py][INFO] Epoch:[0/2](792600/4588595) loss:2.834 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:43:55,752][model8_pretrain.py][INFO] Epoch:[0/2](792600/4588595) loss:2.536 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:43:55,752][model8_pretrain.py][INFO] Epoch:[0/2](792600/4588595) loss:3.041 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:44:32,682][model8_pretrain.py][INFO] Epoch:[0/2](792700/4588595) loss:2.957 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:44:32,682][model8_pretrain.py][INFO] Epoch:[0/2](792700/4588595) loss:2.729 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:44:32,682][model8_pretrain.py][INFO] Epoch:[0/2](792700/4588595) loss:2.528 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:44:32,682][model8_pretrain.py][INFO] Epoch:[0/2](792700/4588595) loss:2.359 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:44:32,682][model8_pretrain.py][INFO] Epoch:[0/2](792700/4588595) loss:3.018 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:44:32,682][model8_pretrain.py][INFO] Epoch:[0/2](792700/4588595) loss:2.103 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:44:32,682][model8_pretrain.py][INFO] Epoch:[0/2](792700/4588595) loss:2.965 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:44:32,683][model8_pretrain.py][INFO] Epoch:[0/2](792700/4588595) loss:2.658 lr:0.0000100 epoch_Time:24157.0min: [2024-01-06 05:45:09,635][model8_pretrain.py][INFO] Epoch:[0/2](792800/4588595) loss:2.935 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:09,635][model8_pretrain.py][INFO] Epoch:[0/2](792800/4588595) loss:3.161 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:09,636][model8_pretrain.py][INFO] Epoch:[0/2](792800/4588595) loss:3.006 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:09,636][model8_pretrain.py][INFO] Epoch:[0/2](792800/4588595) loss:2.552 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:09,636][model8_pretrain.py][INFO] Epoch:[0/2](792800/4588595) loss:3.096 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:09,636][model8_pretrain.py][INFO] Epoch:[0/2](792800/4588595) loss:3.006 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:09,636][model8_pretrain.py][INFO] Epoch:[0/2](792800/4588595) loss:2.502 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:09,636][model8_pretrain.py][INFO] Epoch:[0/2](792800/4588595) loss:2.927 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:46,589][model8_pretrain.py][INFO] Epoch:[0/2](792900/4588595) loss:2.499 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:46,589][model8_pretrain.py][INFO] Epoch:[0/2](792900/4588595) loss:2.062 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:46,589][model8_pretrain.py][INFO] Epoch:[0/2](792900/4588595) loss:2.839 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:46,589][model8_pretrain.py][INFO] Epoch:[0/2](792900/4588595) loss:3.099 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:46,589][model8_pretrain.py][INFO] Epoch:[0/2](792900/4588595) loss:2.779 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:46,590][model8_pretrain.py][INFO] Epoch:[0/2](792900/4588595) loss:3.091 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:46,589][model8_pretrain.py][INFO] Epoch:[0/2](792900/4588595) loss:3.068 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:45:46,590][model8_pretrain.py][INFO] Epoch:[0/2](792900/4588595) loss:2.968 lr:0.0000100 epoch_Time:24156.0min: [2024-01-06 05:46:23,534][model8_pretrain.py][INFO] Epoch:[0/2](793000/4588595) loss:2.999 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:46:23,534][model8_pretrain.py][INFO] Epoch:[0/2](793000/4588595) loss:2.848 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:46:23,534][model8_pretrain.py][INFO] Epoch:[0/2](793000/4588595) loss:3.023 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:46:23,534][model8_pretrain.py][INFO] Epoch:[0/2](793000/4588595) loss:2.828 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:46:23,534][model8_pretrain.py][INFO] Epoch:[0/2](793000/4588595) loss:2.849 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:46:23,534][model8_pretrain.py][INFO] Epoch:[0/2](793000/4588595) loss:2.338 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:46:23,534][model8_pretrain.py][INFO] Epoch:[0/2](793000/4588595) loss:2.971 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:46:23,534][model8_pretrain.py][INFO] Epoch:[0/2](793000/4588595) loss:2.394 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:47:07,430][model8_pretrain.py][INFO] Epoch:[0/2](793100/4588595) loss:2.740 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:47:07,430][model8_pretrain.py][INFO] Epoch:[0/2](793100/4588595) loss:2.792 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:47:07,430][model8_pretrain.py][INFO] Epoch:[0/2](793100/4588595) loss:2.791 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:47:07,430][model8_pretrain.py][INFO] Epoch:[0/2](793100/4588595) loss:2.485 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:47:07,430][model8_pretrain.py][INFO] Epoch:[0/2](793100/4588595) loss:2.235 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:47:07,430][model8_pretrain.py][INFO] Epoch:[0/2](793100/4588595) loss:3.049 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:47:07,430][model8_pretrain.py][INFO] Epoch:[0/2](793100/4588595) loss:2.812 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:47:07,430][model8_pretrain.py][INFO] Epoch:[0/2](793100/4588595) loss:2.579 lr:0.0000100 epoch_Time:24155.0min: [2024-01-06 05:47:47,833][model8_pretrain.py][INFO] Epoch:[0/2](793200/4588595) loss:3.003 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:47:47,833][model8_pretrain.py][INFO] Epoch:[0/2](793200/4588595) loss:2.791 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:47:47,833][model8_pretrain.py][INFO] Epoch:[0/2](793200/4588595) loss:2.626 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:47:47,833][model8_pretrain.py][INFO] Epoch:[0/2](793200/4588595) loss:2.836 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:47:47,833][model8_pretrain.py][INFO] Epoch:[0/2](793200/4588595) loss:2.553 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:47:47,833][model8_pretrain.py][INFO] Epoch:[0/2](793200/4588595) loss:2.887 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:47:47,833][model8_pretrain.py][INFO] Epoch:[0/2](793200/4588595) loss:2.754 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:47:47,833][model8_pretrain.py][INFO] Epoch:[0/2](793200/4588595) loss:3.146 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:48:24,777][model8_pretrain.py][INFO] Epoch:[0/2](793300/4588595) loss:3.095 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:48:24,777][model8_pretrain.py][INFO] Epoch:[0/2](793300/4588595) loss:3.141 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:48:24,777][model8_pretrain.py][INFO] Epoch:[0/2](793300/4588595) loss:2.256 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:48:24,777][model8_pretrain.py][INFO] Epoch:[0/2](793300/4588595) loss:2.954 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:48:24,777][model8_pretrain.py][INFO] Epoch:[0/2](793300/4588595) loss:3.409 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:48:24,777][model8_pretrain.py][INFO] Epoch:[0/2](793300/4588595) loss:3.010 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:48:24,777][model8_pretrain.py][INFO] Epoch:[0/2](793300/4588595) loss:2.421 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:48:24,777][model8_pretrain.py][INFO] Epoch:[0/2](793300/4588595) loss:3.055 lr:0.0000100 epoch_Time:24154.0min: [2024-01-06 05:49:01,720][model8_pretrain.py][INFO] Epoch:[0/2](793400/4588595) loss:2.975 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:01,720][model8_pretrain.py][INFO] Epoch:[0/2](793400/4588595) loss:2.817 lr:0.0000100 epoch_Time:24153.0min: [2024-01-06 05:49:01,720][model8_pretrain.py][INFO] Epoch:[0/2](793400/4588595) loss:2.709 lr:0.0000100 epoch_Time:24153.0min: [2024-01-06 05:49:01,720][model8_pretrain.py][INFO] Epoch:[0/2](793400/4588595) loss:2.889 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:01,720][model8_pretrain.py][INFO] Epoch:[0/2](793400/4588595) loss:2.672 lr:0.0000100 epoch_Time:24153.0min: [2024-01-06 05:49:01,720][model8_pretrain.py][INFO] Epoch:[0/2](793400/4588595) loss:2.676 lr:0.0000100 epoch_Time:24153.0min: [2024-01-06 05:49:01,721][model8_pretrain.py][INFO] Epoch:[0/2](793400/4588595) loss:2.897 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:01,721][model8_pretrain.py][INFO] Epoch:[0/2](793400/4588595) loss:2.581 lr:0.0000100 epoch_Time:24153.0min: [2024-01-06 05:49:38,635][model8_pretrain.py][INFO] Epoch:[0/2](793500/4588595) loss:2.789 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:38,635][model8_pretrain.py][INFO] Epoch:[0/2](793500/4588595) loss:1.414 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:38,635][model8_pretrain.py][INFO] Epoch:[0/2](793500/4588595) loss:2.706 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:38,635][model8_pretrain.py][INFO] Epoch:[0/2](793500/4588595) loss:2.478 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:38,635][model8_pretrain.py][INFO] Epoch:[0/2](793500/4588595) loss:2.609 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:38,635][model8_pretrain.py][INFO] Epoch:[0/2](793500/4588595) loss:2.566 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:38,636][model8_pretrain.py][INFO] Epoch:[0/2](793500/4588595) loss:2.719 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:49:38,636][model8_pretrain.py][INFO] Epoch:[0/2](793500/4588595) loss:2.699 lr:0.0000100 epoch_Time:24152.0min: [2024-01-06 05:50:15,578][model8_pretrain.py][INFO] Epoch:[0/2](793600/4588595) loss:2.872 lr:0.0000100 epoch_Time:24151.0min: [2024-01-06 05:50:15,578][model8_pretrain.py][INFO] Epoch:[0/2](793600/4588595) loss:2.891 lr:0.0000100 epoch_Time:24151.0min: [2024-01-06 05:50:15,578][model8_pretrain.py][INFO] Epoch:[0/2](793600/4588595) loss:3.100 lr:0.0000100 epoch_Time:24151.0min: [2024-01-06 05:50:15,578][model8_pretrain.py][INFO] Epoch:[0/2](793600/4588595) loss:3.086 lr:0.0000100 epoch_Time:24151.0min: [2024-01-06 05:50:15,578][model8_pretrain.py][INFO] Epoch:[0/2](793600/4588595) loss:2.433 lr:0.0000100 epoch_Time:24151.0min: [2024-01-06 05:50:15,578][model8_pretrain.py][INFO] Epoch:[0/2](793600/4588595) loss:2.893 lr:0.0000100 epoch_Time:24151.0min: [2024-01-06 05:50:15,578][model8_pretrain.py][INFO] Epoch:[0/2](793600/4588595) loss:3.125 lr:0.0000100 epoch_Time:24151.0min: [2024-01-06 05:50:15,578][model8_pretrain.py][INFO] Epoch:[0/2](793600/4588595) loss:3.153 lr:0.0000100 epoch_Time:24151.0min: [2024-01-06 05:50:52,523][model8_pretrain.py][INFO] Epoch:[0/2](793700/4588595) loss:2.335 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:50:52,523][model8_pretrain.py][INFO] Epoch:[0/2](793700/4588595) loss:2.111 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:50:52,523][model8_pretrain.py][INFO] Epoch:[0/2](793700/4588595) loss:2.812 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:50:52,523][model8_pretrain.py][INFO] Epoch:[0/2](793700/4588595) loss:2.861 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:50:52,523][model8_pretrain.py][INFO] Epoch:[0/2](793700/4588595) loss:2.437 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:50:52,523][model8_pretrain.py][INFO] Epoch:[0/2](793700/4588595) loss:2.506 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:50:52,523][model8_pretrain.py][INFO] Epoch:[0/2](793700/4588595) loss:2.954 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:50:52,524][model8_pretrain.py][INFO] Epoch:[0/2](793700/4588595) loss:2.664 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:51:29,482][model8_pretrain.py][INFO] Epoch:[0/2](793800/4588595) loss:3.022 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:51:29,482][model8_pretrain.py][INFO] Epoch:[0/2](793800/4588595) loss:2.954 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:51:29,482][model8_pretrain.py][INFO] Epoch:[0/2](793800/4588595) loss:3.071 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:51:29,482][model8_pretrain.py][INFO] Epoch:[0/2](793800/4588595) loss:3.060 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:51:29,482][model8_pretrain.py][INFO] Epoch:[0/2](793800/4588595) loss:3.065 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:51:29,482][model8_pretrain.py][INFO] Epoch:[0/2](793800/4588595) loss:2.414 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:51:29,482][model8_pretrain.py][INFO] Epoch:[0/2](793800/4588595) loss:3.164 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:51:29,482][model8_pretrain.py][INFO] Epoch:[0/2](793800/4588595) loss:2.807 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:52:13,423][model8_pretrain.py][INFO] Epoch:[0/2](793900/4588595) loss:2.142 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:52:13,423][model8_pretrain.py][INFO] Epoch:[0/2](793900/4588595) loss:3.144 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:52:13,423][model8_pretrain.py][INFO] Epoch:[0/2](793900/4588595) loss:3.120 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:52:13,423][model8_pretrain.py][INFO] Epoch:[0/2](793900/4588595) loss:2.797 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:52:13,423][model8_pretrain.py][INFO] Epoch:[0/2](793900/4588595) loss:3.107 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:52:13,423][model8_pretrain.py][INFO] Epoch:[0/2](793900/4588595) loss:2.984 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:52:13,423][model8_pretrain.py][INFO] Epoch:[0/2](793900/4588595) loss:2.708 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:52:13,424][model8_pretrain.py][INFO] Epoch:[0/2](793900/4588595) loss:2.707 lr:0.0000100 epoch_Time:24150.0min: [2024-01-06 05:52:53,835][model8_pretrain.py][INFO] Epoch:[0/2](794000/4588595) loss:3.026 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:52:53,835][model8_pretrain.py][INFO] Epoch:[0/2](794000/4588595) loss:2.747 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:52:53,835][model8_pretrain.py][INFO] Epoch:[0/2](794000/4588595) loss:2.967 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:52:53,835][model8_pretrain.py][INFO] Epoch:[0/2](794000/4588595) loss:2.616 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:52:53,835][model8_pretrain.py][INFO] Epoch:[0/2](794000/4588595) loss:2.871 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:52:53,835][model8_pretrain.py][INFO] Epoch:[0/2](794000/4588595) loss:3.137 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:52:53,835][model8_pretrain.py][INFO] Epoch:[0/2](794000/4588595) loss:3.223 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:52:53,835][model8_pretrain.py][INFO] Epoch:[0/2](794000/4588595) loss:2.728 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:53:30,796][model8_pretrain.py][INFO] Epoch:[0/2](794100/4588595) loss:2.890 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:53:30,797][model8_pretrain.py][INFO] Epoch:[0/2](794100/4588595) loss:2.498 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:53:30,797][model8_pretrain.py][INFO] Epoch:[0/2](794100/4588595) loss:2.948 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:53:30,797][model8_pretrain.py][INFO] Epoch:[0/2](794100/4588595) loss:2.531 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:53:30,797][model8_pretrain.py][INFO] Epoch:[0/2](794100/4588595) loss:2.973 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:53:30,797][model8_pretrain.py][INFO] Epoch:[0/2](794100/4588595) loss:3.240 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:53:30,797][model8_pretrain.py][INFO] Epoch:[0/2](794100/4588595) loss:2.519 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:53:30,797][model8_pretrain.py][INFO] Epoch:[0/2](794100/4588595) loss:3.070 lr:0.0000100 epoch_Time:24149.0min: [2024-01-06 05:54:07,762][model8_pretrain.py][INFO] Epoch:[0/2](794200/4588595) loss:2.596 lr:0.0000100 epoch_Time:24148.0min: [2024-01-06 05:54:07,762][model8_pretrain.py][INFO] Epoch:[0/2](794200/4588595) loss:2.811 lr:0.0000100 epoch_Time:24148.0min: [2024-01-06 05:54:07,762][model8_pretrain.py][INFO] Epoch:[0/2](794200/4588595) loss:3.537 lr:0.0000100 epoch_Time:24148.0min: [2024-01-06 05:54:07,762][model8_pretrain.py][INFO] Epoch:[0/2](794200/4588595) loss:2.915 lr:0.0000100 epoch_Time:24148.0min: [2024-01-06 05:54:07,762][model8_pretrain.py][INFO] Epoch:[0/2](794200/4588595) loss:3.183 lr:0.0000100 epoch_Time:24148.0min: [2024-01-06 05:54:07,762][model8_pretrain.py][INFO] Epoch:[0/2](794200/4588595) loss:2.354 lr:0.0000100 epoch_Time:24148.0min: [2024-01-06 05:54:07,762][model8_pretrain.py][INFO] Epoch:[0/2](794200/4588595) loss:2.905 lr:0.0000100 epoch_Time:24148.0min: [2024-01-06 05:54:07,762][model8_pretrain.py][INFO] Epoch:[0/2](794200/4588595) loss:3.115 lr:0.0000100 epoch_Time:24148.0min: [2024-01-06 05:54:44,724][model8_pretrain.py][INFO] Epoch:[0/2](794300/4588595) loss:2.704 lr:0.0000100 epoch_Time:24147.0min: [2024-01-06 05:54:44,724][model8_pretrain.py][INFO] Epoch:[0/2](794300/4588595) loss:2.802 lr:0.0000100 epoch_Time:24147.0min: [2024-01-06 05:54:44,724][model8_pretrain.py][INFO] Epoch:[0/2](794300/4588595) loss:3.346 lr:0.0000100 epoch_Time:24147.0min: [2024-01-06 05:54:44,724][model8_pretrain.py][INFO] Epoch:[0/2](794300/4588595) loss:2.751 lr:0.0000100 epoch_Time:24147.0min: [2024-01-06 05:54:44,724][model8_pretrain.py][INFO] Epoch:[0/2](794300/4588595) loss:2.259 lr:0.0000100 epoch_Time:24147.0min: [2024-01-06 05:54:44,724][model8_pretrain.py][INFO] Epoch:[0/2](794300/4588595) loss:2.848 lr:0.0000100 epoch_Time:24147.0min: [2024-01-06 05:54:44,724][model8_pretrain.py][INFO] Epoch:[0/2](794300/4588595) loss:2.584 lr:0.0000100 epoch_Time:24147.0min: [2024-01-06 05:54:44,724][model8_pretrain.py][INFO] Epoch:[0/2](794300/4588595) loss:3.108 lr:0.0000100 epoch_Time:24147.0min: [2024-01-06 05:55:21,680][model8_pretrain.py][INFO] Epoch:[0/2](794400/4588595) loss:3.024 lr:0.0000100 epoch_Time:24146.0min: [2024-01-06 05:55:21,680][model8_pretrain.py][INFO] Epoch:[0/2](794400/4588595) loss:2.435 lr:0.0000100 epoch_Time:24146.0min: [2024-01-06 05:55:21,680][model8_pretrain.py][INFO] Epoch:[0/2](794400/4588595) loss:2.819 lr:0.0000100 epoch_Time:24146.0min: [2024-01-06 05:55:21,680][model8_pretrain.py][INFO] Epoch:[0/2](794400/4588595) loss:3.088 lr:0.0000100 epoch_Time:24146.0min: [2024-01-06 05:55:21,680][model8_pretrain.py][INFO] Epoch:[0/2](794400/4588595) loss:2.149 lr:0.0000100 epoch_Time:24146.0min: [2024-01-06 05:55:21,680][model8_pretrain.py][INFO] Epoch:[0/2](794400/4588595) loss:2.848 lr:0.0000100 epoch_Time:24146.0min: [2024-01-06 05:55:21,680][model8_pretrain.py][INFO] Epoch:[0/2](794400/4588595) loss:2.704 lr:0.0000100 epoch_Time:24146.0min: [2024-01-06 05:55:21,680][model8_pretrain.py][INFO] Epoch:[0/2](794400/4588595) loss:2.787 lr:0.0000100 epoch_Time:24146.0min: [2024-01-06 05:55:58,637][model8_pretrain.py][INFO] Epoch:[0/2](794500/4588595) loss:2.724 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:55:58,637][model8_pretrain.py][INFO] Epoch:[0/2](794500/4588595) loss:3.319 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:55:58,637][model8_pretrain.py][INFO] Epoch:[0/2](794500/4588595) loss:2.764 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:55:58,637][model8_pretrain.py][INFO] Epoch:[0/2](794500/4588595) loss:3.303 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:55:58,637][model8_pretrain.py][INFO] Epoch:[0/2](794500/4588595) loss:3.037 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:55:58,637][model8_pretrain.py][INFO] Epoch:[0/2](794500/4588595) loss:2.424 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:55:58,637][model8_pretrain.py][INFO] Epoch:[0/2](794500/4588595) loss:2.705 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:55:58,637][model8_pretrain.py][INFO] Epoch:[0/2](794500/4588595) loss:3.055 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:56:35,593][model8_pretrain.py][INFO] Epoch:[0/2](794600/4588595) loss:2.265 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:56:35,593][model8_pretrain.py][INFO] Epoch:[0/2](794600/4588595) loss:2.807 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:56:35,593][model8_pretrain.py][INFO] Epoch:[0/2](794600/4588595) loss:3.371 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:56:35,593][model8_pretrain.py][INFO] Epoch:[0/2](794600/4588595) loss:3.129 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:56:35,593][model8_pretrain.py][INFO] Epoch:[0/2](794600/4588595) loss:2.519 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:56:35,593][model8_pretrain.py][INFO] Epoch:[0/2](794600/4588595) loss:3.043 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:56:35,593][model8_pretrain.py][INFO] Epoch:[0/2](794600/4588595) loss:3.193 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:56:35,593][model8_pretrain.py][INFO] Epoch:[0/2](794600/4588595) loss:2.702 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:57:19,599][model8_pretrain.py][INFO] Epoch:[0/2](794700/4588595) loss:3.176 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:57:19,599][model8_pretrain.py][INFO] Epoch:[0/2](794700/4588595) loss:3.000 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:57:19,603][model8_pretrain.py][INFO] Epoch:[0/2](794700/4588595) loss:3.065 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:57:19,603][model8_pretrain.py][INFO] Epoch:[0/2](794700/4588595) loss:2.681 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:57:19,603][model8_pretrain.py][INFO] Epoch:[0/2](794700/4588595) loss:3.025 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:57:19,603][model8_pretrain.py][INFO] Epoch:[0/2](794700/4588595) loss:3.087 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:57:19,604][model8_pretrain.py][INFO] Epoch:[0/2](794700/4588595) loss:2.745 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:57:19,605][model8_pretrain.py][INFO] Epoch:[0/2](794700/4588595) loss:2.737 lr:0.0000100 epoch_Time:24145.0min: [2024-01-06 05:58:00,006][model8_pretrain.py][INFO] Epoch:[0/2](794800/4588595) loss:2.075 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:00,006][model8_pretrain.py][INFO] Epoch:[0/2](794800/4588595) loss:2.888 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:00,006][model8_pretrain.py][INFO] Epoch:[0/2](794800/4588595) loss:2.584 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:00,006][model8_pretrain.py][INFO] Epoch:[0/2](794800/4588595) loss:2.903 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:00,006][model8_pretrain.py][INFO] Epoch:[0/2](794800/4588595) loss:2.854 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:00,006][model8_pretrain.py][INFO] Epoch:[0/2](794800/4588595) loss:2.622 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:00,006][model8_pretrain.py][INFO] Epoch:[0/2](794800/4588595) loss:2.964 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:00,006][model8_pretrain.py][INFO] Epoch:[0/2](794800/4588595) loss:2.704 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:36,953][model8_pretrain.py][INFO] Epoch:[0/2](794900/4588595) loss:3.361 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:36,953][model8_pretrain.py][INFO] Epoch:[0/2](794900/4588595) loss:3.291 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:36,953][model8_pretrain.py][INFO] Epoch:[0/2](794900/4588595) loss:2.985 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:36,953][model8_pretrain.py][INFO] Epoch:[0/2](794900/4588595) loss:2.392 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:36,953][model8_pretrain.py][INFO] Epoch:[0/2](794900/4588595) loss:3.068 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:36,953][model8_pretrain.py][INFO] Epoch:[0/2](794900/4588595) loss:2.643 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:36,953][model8_pretrain.py][INFO] Epoch:[0/2](794900/4588595) loss:2.796 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:58:36,954][model8_pretrain.py][INFO] Epoch:[0/2](794900/4588595) loss:2.586 lr:0.0000100 epoch_Time:24144.0min: [2024-01-06 05:59:13,929][model8_pretrain.py][INFO] Epoch:[0/2](795000/4588595) loss:2.601 lr:0.0000100 epoch_Time:24143.0min: [2024-01-06 05:59:13,929][model8_pretrain.py][INFO] Epoch:[0/2](795000/4588595) loss:3.078 lr:0.0000100 epoch_Time:24143.0min: [2024-01-06 05:59:13,929][model8_pretrain.py][INFO] Epoch:[0/2](795000/4588595) loss:2.576 lr:0.0000100 epoch_Time:24143.0min: [2024-01-06 05:59:13,930][model8_pretrain.py][INFO] Epoch:[0/2](795000/4588595) loss:2.666 lr:0.0000100 epoch_Time:24143.0min: [2024-01-06 05:59:13,929][model8_pretrain.py][INFO] Epoch:[0/2](795000/4588595) loss:2.867 lr:0.0000100 epoch_Time:24143.0min: [2024-01-06 05:59:13,930][model8_pretrain.py][INFO] Epoch:[0/2](795000/4588595) loss:2.460 lr:0.0000100 epoch_Time:24143.0min: [2024-01-06 05:59:13,929][model8_pretrain.py][INFO] Epoch:[0/2](795000/4588595) loss:2.762 lr:0.0000100 epoch_Time:24143.0min: [2024-01-06 05:59:13,930][model8_pretrain.py][INFO] Epoch:[0/2](795000/4588595) loss:2.693 lr:0.0000100 epoch_Time:24143.0min: [2024-01-06 05:59:50,907][model8_pretrain.py][INFO] Epoch:[0/2](795100/4588595) loss:2.414 lr:0.0000100 epoch_Time:24142.0min: [2024-01-06 05:59:50,907][model8_pretrain.py][INFO] Epoch:[0/2](795100/4588595) loss:3.027 lr:0.0000100 epoch_Time:24142.0min: [2024-01-06 05:59:50,907][model8_pretrain.py][INFO] Epoch:[0/2](795100/4588595) loss:2.582 lr:0.0000100 epoch_Time:24142.0min: [2024-01-06 05:59:50,907][model8_pretrain.py][INFO] Epoch:[0/2](795100/4588595) loss:3.083 lr:0.0000100 epoch_Time:24142.0min: [2024-01-06 05:59:50,907][model8_pretrain.py][INFO] Epoch:[0/2](795100/4588595) loss:3.258 lr:0.0000100 epoch_Time:24142.0min: [2024-01-06 05:59:50,907][model8_pretrain.py][INFO] Epoch:[0/2](795100/4588595) loss:3.275 lr:0.0000100 epoch_Time:24142.0min: [2024-01-06 05:59:50,907][model8_pretrain.py][INFO] Epoch:[0/2](795100/4588595) loss:2.924 lr:0.0000100 epoch_Time:24142.0min: [2024-01-06 05:59:50,907][model8_pretrain.py][INFO] Epoch:[0/2](795100/4588595) loss:2.540 lr:0.0000100 epoch_Time:24142.0min: [2024-01-06 06:00:27,869][model8_pretrain.py][INFO] Epoch:[0/2](795200/4588595) loss:2.712 lr:0.0000100 epoch_Time:24141.0min: [2024-01-06 06:00:27,869][model8_pretrain.py][INFO] Epoch:[0/2](795200/4588595) loss:2.391 lr:0.0000100 epoch_Time:24141.0min: [2024-01-06 06:00:27,869][model8_pretrain.py][INFO] Epoch:[0/2](795200/4588595) loss:2.860 lr:0.0000100 epoch_Time:24141.0min: [2024-01-06 06:00:27,869][model8_pretrain.py][INFO] Epoch:[0/2](795200/4588595) loss:2.559 lr:0.0000100 epoch_Time:24141.0min: [2024-01-06 06:00:27,869][model8_pretrain.py][INFO] Epoch:[0/2](795200/4588595) loss:2.818 lr:0.0000100 epoch_Time:24141.0min: [2024-01-06 06:00:27,869][model8_pretrain.py][INFO] Epoch:[0/2](795200/4588595) loss:2.262 lr:0.0000100 epoch_Time:24141.0min: [2024-01-06 06:00:27,869][model8_pretrain.py][INFO] Epoch:[0/2](795200/4588595) loss:3.012 lr:0.0000100 epoch_Time:24141.0min: [2024-01-06 06:00:27,869][model8_pretrain.py][INFO] Epoch:[0/2](795200/4588595) loss:2.162 lr:0.0000100 epoch_Time:24141.0min: [2024-01-06 06:01:04,836][model8_pretrain.py][INFO] Epoch:[0/2](795300/4588595) loss:2.709 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:04,836][model8_pretrain.py][INFO] Epoch:[0/2](795300/4588595) loss:2.650 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:04,836][model8_pretrain.py][INFO] Epoch:[0/2](795300/4588595) loss:2.511 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:04,836][model8_pretrain.py][INFO] Epoch:[0/2](795300/4588595) loss:2.586 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:04,836][model8_pretrain.py][INFO] Epoch:[0/2](795300/4588595) loss:2.962 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:04,836][model8_pretrain.py][INFO] Epoch:[0/2](795300/4588595) loss:2.899 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:04,836][model8_pretrain.py][INFO] Epoch:[0/2](795300/4588595) loss:2.902 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:04,836][model8_pretrain.py][INFO] Epoch:[0/2](795300/4588595) loss:2.668 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:41,786][model8_pretrain.py][INFO] Epoch:[0/2](795400/4588595) loss:2.805 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:41,786][model8_pretrain.py][INFO] Epoch:[0/2](795400/4588595) loss:2.965 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:41,786][model8_pretrain.py][INFO] Epoch:[0/2](795400/4588595) loss:2.437 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:41,786][model8_pretrain.py][INFO] Epoch:[0/2](795400/4588595) loss:3.110 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:41,786][model8_pretrain.py][INFO] Epoch:[0/2](795400/4588595) loss:2.948 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:41,786][model8_pretrain.py][INFO] Epoch:[0/2](795400/4588595) loss:1.813 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:41,786][model8_pretrain.py][INFO] Epoch:[0/2](795400/4588595) loss:2.616 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:01:41,786][model8_pretrain.py][INFO] Epoch:[0/2](795400/4588595) loss:2.309 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:02:24,022][model8_pretrain.py][INFO] Epoch:[0/2](795500/4588595) loss:2.866 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:02:24,022][model8_pretrain.py][INFO] Epoch:[0/2](795500/4588595) loss:2.614 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:02:24,022][model8_pretrain.py][INFO] Epoch:[0/2](795500/4588595) loss:2.992 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:02:24,022][model8_pretrain.py][INFO] Epoch:[0/2](795500/4588595) loss:2.975 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:02:24,022][model8_pretrain.py][INFO] Epoch:[0/2](795500/4588595) loss:2.788 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:02:24,022][model8_pretrain.py][INFO] Epoch:[0/2](795500/4588595) loss:3.135 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:02:24,022][model8_pretrain.py][INFO] Epoch:[0/2](795500/4588595) loss:2.657 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:02:24,022][model8_pretrain.py][INFO] Epoch:[0/2](795500/4588595) loss:3.226 lr:0.0000100 epoch_Time:24140.0min: [2024-01-06 06:03:06,143][model8_pretrain.py][INFO] Epoch:[0/2](795600/4588595) loss:3.280 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:06,143][model8_pretrain.py][INFO] Epoch:[0/2](795600/4588595) loss:2.374 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:06,143][model8_pretrain.py][INFO] Epoch:[0/2](795600/4588595) loss:3.258 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:06,143][model8_pretrain.py][INFO] Epoch:[0/2](795600/4588595) loss:2.792 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:06,143][model8_pretrain.py][INFO] Epoch:[0/2](795600/4588595) loss:2.137 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:06,143][model8_pretrain.py][INFO] Epoch:[0/2](795600/4588595) loss:3.061 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:06,143][model8_pretrain.py][INFO] Epoch:[0/2](795600/4588595) loss:2.691 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:06,143][model8_pretrain.py][INFO] Epoch:[0/2](795600/4588595) loss:2.950 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:43,078][model8_pretrain.py][INFO] Epoch:[0/2](795700/4588595) loss:3.404 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:43,078][model8_pretrain.py][INFO] Epoch:[0/2](795700/4588595) loss:2.607 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:43,078][model8_pretrain.py][INFO] Epoch:[0/2](795700/4588595) loss:2.634 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:43,078][model8_pretrain.py][INFO] Epoch:[0/2](795700/4588595) loss:2.786 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:43,079][model8_pretrain.py][INFO] Epoch:[0/2](795700/4588595) loss:2.374 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:43,079][model8_pretrain.py][INFO] Epoch:[0/2](795700/4588595) loss:2.984 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:43,079][model8_pretrain.py][INFO] Epoch:[0/2](795700/4588595) loss:2.788 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:03:43,079][model8_pretrain.py][INFO] Epoch:[0/2](795700/4588595) loss:3.045 lr:0.0000100 epoch_Time:24139.0min: [2024-01-06 06:04:20,019][model8_pretrain.py][INFO] Epoch:[0/2](795800/4588595) loss:2.711 lr:0.0000100 epoch_Time:24138.0min: [2024-01-06 06:04:20,019][model8_pretrain.py][INFO] Epoch:[0/2](795800/4588595) loss:2.626 lr:0.0000100 epoch_Time:24138.0min: [2024-01-06 06:04:20,019][model8_pretrain.py][INFO] Epoch:[0/2](795800/4588595) loss:2.166 lr:0.0000100 epoch_Time:24138.0min: [2024-01-06 06:04:20,019][model8_pretrain.py][INFO] Epoch:[0/2](795800/4588595) loss:1.971 lr:0.0000100 epoch_Time:24138.0min: [2024-01-06 06:04:20,019][model8_pretrain.py][INFO] Epoch:[0/2](795800/4588595) loss:2.802 lr:0.0000100 epoch_Time:24138.0min: [2024-01-06 06:04:20,019][model8_pretrain.py][INFO] Epoch:[0/2](795800/4588595) loss:2.903 lr:0.0000100 epoch_Time:24138.0min: [2024-01-06 06:04:20,019][model8_pretrain.py][INFO] Epoch:[0/2](795800/4588595) loss:2.959 lr:0.0000100 epoch_Time:24138.0min: [2024-01-06 06:04:20,019][model8_pretrain.py][INFO] Epoch:[0/2](795800/4588595) loss:2.839 lr:0.0000100 epoch_Time:24138.0min: [2024-01-06 06:04:56,998][model8_pretrain.py][INFO] Epoch:[0/2](795900/4588595) loss:2.642 lr:0.0000100 epoch_Time:24137.0min: [2024-01-06 06:04:56,998][model8_pretrain.py][INFO] Epoch:[0/2](795900/4588595) loss:2.509 lr:0.0000100 epoch_Time:24137.0min: [2024-01-06 06:04:56,998][model8_pretrain.py][INFO] Epoch:[0/2](795900/4588595) loss:1.883 lr:0.0000100 epoch_Time:24137.0min: [2024-01-06 06:04:56,998][model8_pretrain.py][INFO] Epoch:[0/2](795900/4588595) loss:3.203 lr:0.0000100 epoch_Time:24137.0min: [2024-01-06 06:04:56,998][model8_pretrain.py][INFO] Epoch:[0/2](795900/4588595) loss:2.313 lr:0.0000100 epoch_Time:24137.0min: [2024-01-06 06:04:56,998][model8_pretrain.py][INFO] Epoch:[0/2](795900/4588595) loss:2.912 lr:0.0000100 epoch_Time:24137.0min: [2024-01-06 06:04:56,998][model8_pretrain.py][INFO] Epoch:[0/2](795900/4588595) loss:2.986 lr:0.0000100 epoch_Time:24137.0min: [2024-01-06 06:04:56,999][model8_pretrain.py][INFO] Epoch:[0/2](795900/4588595) loss:3.166 lr:0.0000100 epoch_Time:24137.0min: [2024-01-06 06:05:34,004][model8_pretrain.py][INFO] Epoch:[0/2](796000/4588595) loss:2.385 lr:0.0000100 epoch_Time:24136.0min: [2024-01-06 06:05:34,004][model8_pretrain.py][INFO] Epoch:[0/2](796000/4588595) loss:3.053 lr:0.0000100 epoch_Time:24136.0min: [2024-01-06 06:05:34,004][model8_pretrain.py][INFO] Epoch:[0/2](796000/4588595) loss:2.772 lr:0.0000100 epoch_Time:24136.0min: [2024-01-06 06:05:34,004][model8_pretrain.py][INFO] Epoch:[0/2](796000/4588595) loss:2.364 lr:0.0000100 epoch_Time:24136.0min: [2024-01-06 06:05:34,004][model8_pretrain.py][INFO] Epoch:[0/2](796000/4588595) loss:3.060 lr:0.0000100 epoch_Time:24136.0min: [2024-01-06 06:05:34,004][model8_pretrain.py][INFO] Epoch:[0/2](796000/4588595) loss:2.417 lr:0.0000100 epoch_Time:24136.0min: [2024-01-06 06:05:34,005][model8_pretrain.py][INFO] Epoch:[0/2](796000/4588595) loss:3.006 lr:0.0000100 epoch_Time:24136.0min: [2024-01-06 06:05:34,018][model8_pretrain.py][INFO] Epoch:[0/2](796000/4588595) loss:2.812 lr:0.0000100 epoch_Time:24136.0min: [2024-01-06 06:06:11,014][model8_pretrain.py][INFO] Epoch:[0/2](796100/4588595) loss:2.557 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:06:11,014][model8_pretrain.py][INFO] Epoch:[0/2](796100/4588595) loss:2.655 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:06:11,014][model8_pretrain.py][INFO] Epoch:[0/2](796100/4588595) loss:3.488 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:06:11,014][model8_pretrain.py][INFO] Epoch:[0/2](796100/4588595) loss:2.343 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:06:11,014][model8_pretrain.py][INFO] Epoch:[0/2](796100/4588595) loss:2.875 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:06:11,014][model8_pretrain.py][INFO] Epoch:[0/2](796100/4588595) loss:3.215 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:06:11,014][model8_pretrain.py][INFO] Epoch:[0/2](796100/4588595) loss:2.721 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:06:11,014][model8_pretrain.py][INFO] Epoch:[0/2](796100/4588595) loss:2.764 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:06:47,961][model8_pretrain.py][INFO] Epoch:[0/2](796200/4588595) loss:2.312 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:06:47,961][model8_pretrain.py][INFO] Epoch:[0/2](796200/4588595) loss:2.331 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:06:47,961][model8_pretrain.py][INFO] Epoch:[0/2](796200/4588595) loss:2.537 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:06:47,961][model8_pretrain.py][INFO] Epoch:[0/2](796200/4588595) loss:2.558 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:06:47,961][model8_pretrain.py][INFO] Epoch:[0/2](796200/4588595) loss:2.788 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:06:47,961][model8_pretrain.py][INFO] Epoch:[0/2](796200/4588595) loss:2.956 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:06:47,961][model8_pretrain.py][INFO] Epoch:[0/2](796200/4588595) loss:2.544 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:06:47,961][model8_pretrain.py][INFO] Epoch:[0/2](796200/4588595) loss:2.933 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:07:30,142][model8_pretrain.py][INFO] Epoch:[0/2](796300/4588595) loss:2.543 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:07:30,142][model8_pretrain.py][INFO] Epoch:[0/2](796300/4588595) loss:2.918 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:07:30,142][model8_pretrain.py][INFO] Epoch:[0/2](796300/4588595) loss:3.046 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:07:30,142][model8_pretrain.py][INFO] Epoch:[0/2](796300/4588595) loss:2.983 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:07:30,142][model8_pretrain.py][INFO] Epoch:[0/2](796300/4588595) loss:2.936 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:07:30,142][model8_pretrain.py][INFO] Epoch:[0/2](796300/4588595) loss:2.647 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:07:30,142][model8_pretrain.py][INFO] Epoch:[0/2](796300/4588595) loss:2.843 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:07:30,142][model8_pretrain.py][INFO] Epoch:[0/2](796300/4588595) loss:2.850 lr:0.0000100 epoch_Time:24135.0min: [2024-01-06 06:08:12,274][model8_pretrain.py][INFO] Epoch:[0/2](796400/4588595) loss:2.842 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:08:12,274][model8_pretrain.py][INFO] Epoch:[0/2](796400/4588595) loss:3.143 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:08:12,274][model8_pretrain.py][INFO] Epoch:[0/2](796400/4588595) loss:2.524 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:08:12,274][model8_pretrain.py][INFO] Epoch:[0/2](796400/4588595) loss:2.995 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:08:12,274][model8_pretrain.py][INFO] Epoch:[0/2](796400/4588595) loss:3.000 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:08:12,274][model8_pretrain.py][INFO] Epoch:[0/2](796400/4588595) loss:3.279 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:08:12,274][model8_pretrain.py][INFO] Epoch:[0/2](796400/4588595) loss:2.669 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:08:12,274][model8_pretrain.py][INFO] Epoch:[0/2](796400/4588595) loss:2.854 lr:0.0000100 epoch_Time:24134.0min: [2024-01-06 06:08:49,265][model8_pretrain.py][INFO] Epoch:[0/2](796500/4588595) loss:3.026 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:08:49,265][model8_pretrain.py][INFO] Epoch:[0/2](796500/4588595) loss:2.831 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:08:49,265][model8_pretrain.py][INFO] Epoch:[0/2](796500/4588595) loss:3.076 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:08:49,265][model8_pretrain.py][INFO] Epoch:[0/2](796500/4588595) loss:2.684 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:08:49,265][model8_pretrain.py][INFO] Epoch:[0/2](796500/4588595) loss:2.633 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:08:49,265][model8_pretrain.py][INFO] Epoch:[0/2](796500/4588595) loss:2.161 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:08:49,265][model8_pretrain.py][INFO] Epoch:[0/2](796500/4588595) loss:2.792 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:08:49,266][model8_pretrain.py][INFO] Epoch:[0/2](796500/4588595) loss:1.922 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:09:26,234][model8_pretrain.py][INFO] Epoch:[0/2](796600/4588595) loss:2.244 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:09:26,234][model8_pretrain.py][INFO] Epoch:[0/2](796600/4588595) loss:2.961 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:09:26,234][model8_pretrain.py][INFO] Epoch:[0/2](796600/4588595) loss:3.003 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:09:26,234][model8_pretrain.py][INFO] Epoch:[0/2](796600/4588595) loss:2.768 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:09:26,234][model8_pretrain.py][INFO] Epoch:[0/2](796600/4588595) loss:2.972 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:09:26,234][model8_pretrain.py][INFO] Epoch:[0/2](796600/4588595) loss:1.859 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:09:26,235][model8_pretrain.py][INFO] Epoch:[0/2](796600/4588595) loss:3.390 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:09:26,235][model8_pretrain.py][INFO] Epoch:[0/2](796600/4588595) loss:2.887 lr:0.0000100 epoch_Time:24133.0min: [2024-01-06 06:10:03,165][model8_pretrain.py][INFO] Epoch:[0/2](796700/4588595) loss:2.183 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:03,165][model8_pretrain.py][INFO] Epoch:[0/2](796700/4588595) loss:2.520 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:03,166][model8_pretrain.py][INFO] Epoch:[0/2](796700/4588595) loss:2.789 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:03,166][model8_pretrain.py][INFO] Epoch:[0/2](796700/4588595) loss:3.112 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:03,166][model8_pretrain.py][INFO] Epoch:[0/2](796700/4588595) loss:2.572 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:03,166][model8_pretrain.py][INFO] Epoch:[0/2](796700/4588595) loss:2.651 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:03,166][model8_pretrain.py][INFO] Epoch:[0/2](796700/4588595) loss:2.849 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:03,166][model8_pretrain.py][INFO] Epoch:[0/2](796700/4588595) loss:2.474 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:40,107][model8_pretrain.py][INFO] Epoch:[0/2](796800/4588595) loss:2.999 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:40,107][model8_pretrain.py][INFO] Epoch:[0/2](796800/4588595) loss:3.128 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:40,107][model8_pretrain.py][INFO] Epoch:[0/2](796800/4588595) loss:3.028 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:40,107][model8_pretrain.py][INFO] Epoch:[0/2](796800/4588595) loss:2.334 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:40,107][model8_pretrain.py][INFO] Epoch:[0/2](796800/4588595) loss:2.229 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:40,107][model8_pretrain.py][INFO] Epoch:[0/2](796800/4588595) loss:2.147 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:40,107][model8_pretrain.py][INFO] Epoch:[0/2](796800/4588595) loss:1.849 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:10:40,107][model8_pretrain.py][INFO] Epoch:[0/2](796800/4588595) loss:3.183 lr:0.0000100 epoch_Time:24132.0min: [2024-01-06 06:11:17,040][model8_pretrain.py][INFO] Epoch:[0/2](796900/4588595) loss:2.143 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:11:17,040][model8_pretrain.py][INFO] Epoch:[0/2](796900/4588595) loss:1.982 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:11:17,040][model8_pretrain.py][INFO] Epoch:[0/2](796900/4588595) loss:3.292 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:11:17,040][model8_pretrain.py][INFO] Epoch:[0/2](796900/4588595) loss:2.935 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:11:17,040][model8_pretrain.py][INFO] Epoch:[0/2](796900/4588595) loss:2.966 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:11:17,040][model8_pretrain.py][INFO] Epoch:[0/2](796900/4588595) loss:2.820 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:11:17,040][model8_pretrain.py][INFO] Epoch:[0/2](796900/4588595) loss:1.996 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:11:17,041][model8_pretrain.py][INFO] Epoch:[0/2](796900/4588595) loss:2.727 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:11:53,979][model8_pretrain.py][INFO] Epoch:[0/2](797000/4588595) loss:3.163 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:11:53,979][model8_pretrain.py][INFO] Epoch:[0/2](797000/4588595) loss:3.168 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:11:53,979][model8_pretrain.py][INFO] Epoch:[0/2](797000/4588595) loss:3.176 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:11:53,979][model8_pretrain.py][INFO] Epoch:[0/2](797000/4588595) loss:3.273 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:11:53,979][model8_pretrain.py][INFO] Epoch:[0/2](797000/4588595) loss:2.714 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:11:53,979][model8_pretrain.py][INFO] Epoch:[0/2](797000/4588595) loss:3.472 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:11:53,979][model8_pretrain.py][INFO] Epoch:[0/2](797000/4588595) loss:3.276 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:11:53,979][model8_pretrain.py][INFO] Epoch:[0/2](797000/4588595) loss:3.127 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:12:36,198][model8_pretrain.py][INFO] Epoch:[0/2](797100/4588595) loss:2.833 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:12:36,198][model8_pretrain.py][INFO] Epoch:[0/2](797100/4588595) loss:3.407 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:12:36,198][model8_pretrain.py][INFO] Epoch:[0/2](797100/4588595) loss:3.080 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:12:36,198][model8_pretrain.py][INFO] Epoch:[0/2](797100/4588595) loss:2.787 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:12:36,198][model8_pretrain.py][INFO] Epoch:[0/2](797100/4588595) loss:2.724 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:12:36,202][model8_pretrain.py][INFO] Epoch:[0/2](797100/4588595) loss:2.650 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:12:36,202][model8_pretrain.py][INFO] Epoch:[0/2](797100/4588595) loss:2.854 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:12:36,203][model8_pretrain.py][INFO] Epoch:[0/2](797100/4588595) loss:2.514 lr:0.0000100 epoch_Time:24130.0min: [2024-01-06 06:13:18,307][model8_pretrain.py][INFO] Epoch:[0/2](797200/4588595) loss:2.222 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:13:18,307][model8_pretrain.py][INFO] Epoch:[0/2](797200/4588595) loss:3.049 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:13:18,307][model8_pretrain.py][INFO] Epoch:[0/2](797200/4588595) loss:2.495 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:13:18,307][model8_pretrain.py][INFO] Epoch:[0/2](797200/4588595) loss:2.913 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:13:18,308][model8_pretrain.py][INFO] Epoch:[0/2](797200/4588595) loss:2.966 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:13:18,307][model8_pretrain.py][INFO] Epoch:[0/2](797200/4588595) loss:2.288 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:13:18,307][model8_pretrain.py][INFO] Epoch:[0/2](797200/4588595) loss:2.980 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:13:18,308][model8_pretrain.py][INFO] Epoch:[0/2](797200/4588595) loss:2.637 lr:0.0000100 epoch_Time:24129.0min: [2024-01-06 06:13:55,239][model8_pretrain.py][INFO] Epoch:[0/2](797300/4588595) loss:2.911 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:13:55,239][model8_pretrain.py][INFO] Epoch:[0/2](797300/4588595) loss:2.924 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:13:55,239][model8_pretrain.py][INFO] Epoch:[0/2](797300/4588595) loss:2.674 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:13:55,239][model8_pretrain.py][INFO] Epoch:[0/2](797300/4588595) loss:2.835 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:13:55,239][model8_pretrain.py][INFO] Epoch:[0/2](797300/4588595) loss:2.783 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:13:55,239][model8_pretrain.py][INFO] Epoch:[0/2](797300/4588595) loss:2.608 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:13:55,239][model8_pretrain.py][INFO] Epoch:[0/2](797300/4588595) loss:2.973 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:13:55,239][model8_pretrain.py][INFO] Epoch:[0/2](797300/4588595) loss:2.486 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:14:32,187][model8_pretrain.py][INFO] Epoch:[0/2](797400/4588595) loss:3.167 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:14:32,187][model8_pretrain.py][INFO] Epoch:[0/2](797400/4588595) loss:2.946 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:14:32,187][model8_pretrain.py][INFO] Epoch:[0/2](797400/4588595) loss:2.978 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:14:32,187][model8_pretrain.py][INFO] Epoch:[0/2](797400/4588595) loss:2.698 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:14:32,187][model8_pretrain.py][INFO] Epoch:[0/2](797400/4588595) loss:2.980 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:14:32,187][model8_pretrain.py][INFO] Epoch:[0/2](797400/4588595) loss:2.621 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:14:32,187][model8_pretrain.py][INFO] Epoch:[0/2](797400/4588595) loss:2.972 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:14:32,187][model8_pretrain.py][INFO] Epoch:[0/2](797400/4588595) loss:2.523 lr:0.0000100 epoch_Time:24128.0min: [2024-01-06 06:15:09,137][model8_pretrain.py][INFO] Epoch:[0/2](797500/4588595) loss:2.980 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:09,137][model8_pretrain.py][INFO] Epoch:[0/2](797500/4588595) loss:2.754 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:09,137][model8_pretrain.py][INFO] Epoch:[0/2](797500/4588595) loss:2.766 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:09,137][model8_pretrain.py][INFO] Epoch:[0/2](797500/4588595) loss:2.739 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:09,137][model8_pretrain.py][INFO] Epoch:[0/2](797500/4588595) loss:3.192 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:09,137][model8_pretrain.py][INFO] Epoch:[0/2](797500/4588595) loss:1.947 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:09,137][model8_pretrain.py][INFO] Epoch:[0/2](797500/4588595) loss:3.098 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:09,137][model8_pretrain.py][INFO] Epoch:[0/2](797500/4588595) loss:2.959 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:46,080][model8_pretrain.py][INFO] Epoch:[0/2](797600/4588595) loss:2.781 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:46,080][model8_pretrain.py][INFO] Epoch:[0/2](797600/4588595) loss:3.011 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:46,080][model8_pretrain.py][INFO] Epoch:[0/2](797600/4588595) loss:2.321 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:46,080][model8_pretrain.py][INFO] Epoch:[0/2](797600/4588595) loss:2.974 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:46,080][model8_pretrain.py][INFO] Epoch:[0/2](797600/4588595) loss:2.905 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:46,080][model8_pretrain.py][INFO] Epoch:[0/2](797600/4588595) loss:2.822 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:46,080][model8_pretrain.py][INFO] Epoch:[0/2](797600/4588595) loss:3.537 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:15:46,080][model8_pretrain.py][INFO] Epoch:[0/2](797600/4588595) loss:2.678 lr:0.0000100 epoch_Time:24127.0min: [2024-01-06 06:16:22,989][model8_pretrain.py][INFO] Epoch:[0/2](797700/4588595) loss:2.869 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:16:22,989][model8_pretrain.py][INFO] Epoch:[0/2](797700/4588595) loss:2.913 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:16:22,989][model8_pretrain.py][INFO] Epoch:[0/2](797700/4588595) loss:2.037 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:16:22,989][model8_pretrain.py][INFO] Epoch:[0/2](797700/4588595) loss:2.645 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:16:22,989][model8_pretrain.py][INFO] Epoch:[0/2](797700/4588595) loss:3.097 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:16:22,989][model8_pretrain.py][INFO] Epoch:[0/2](797700/4588595) loss:2.572 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:16:22,989][model8_pretrain.py][INFO] Epoch:[0/2](797700/4588595) loss:1.920 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:16:22,990][model8_pretrain.py][INFO] Epoch:[0/2](797700/4588595) loss:2.690 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:16:59,928][model8_pretrain.py][INFO] Epoch:[0/2](797800/4588595) loss:2.514 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:16:59,928][model8_pretrain.py][INFO] Epoch:[0/2](797800/4588595) loss:3.005 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:16:59,928][model8_pretrain.py][INFO] Epoch:[0/2](797800/4588595) loss:3.158 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:16:59,928][model8_pretrain.py][INFO] Epoch:[0/2](797800/4588595) loss:3.008 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:16:59,928][model8_pretrain.py][INFO] Epoch:[0/2](797800/4588595) loss:3.414 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:16:59,928][model8_pretrain.py][INFO] Epoch:[0/2](797800/4588595) loss:2.838 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:16:59,929][model8_pretrain.py][INFO] Epoch:[0/2](797800/4588595) loss:2.353 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:16:59,929][model8_pretrain.py][INFO] Epoch:[0/2](797800/4588595) loss:3.144 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:17:40,348][model8_pretrain.py][INFO] Epoch:[0/2](797900/4588595) loss:2.970 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:17:40,348][model8_pretrain.py][INFO] Epoch:[0/2](797900/4588595) loss:3.315 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:17:40,348][model8_pretrain.py][INFO] Epoch:[0/2](797900/4588595) loss:1.883 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:17:40,348][model8_pretrain.py][INFO] Epoch:[0/2](797900/4588595) loss:3.431 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:17:40,348][model8_pretrain.py][INFO] Epoch:[0/2](797900/4588595) loss:2.753 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:17:40,348][model8_pretrain.py][INFO] Epoch:[0/2](797900/4588595) loss:3.020 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:17:40,348][model8_pretrain.py][INFO] Epoch:[0/2](797900/4588595) loss:3.114 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:17:40,349][model8_pretrain.py][INFO] Epoch:[0/2](797900/4588595) loss:2.994 lr:0.0000100 epoch_Time:24125.0min: [2024-01-06 06:18:24,195][model8_pretrain.py][INFO] Epoch:[0/2](798000/4588595) loss:3.348 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:18:24,195][model8_pretrain.py][INFO] Epoch:[0/2](798000/4588595) loss:2.690 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:18:24,195][model8_pretrain.py][INFO] Epoch:[0/2](798000/4588595) loss:2.371 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:18:24,195][model8_pretrain.py][INFO] Epoch:[0/2](798000/4588595) loss:2.584 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:18:24,195][model8_pretrain.py][INFO] Epoch:[0/2](798000/4588595) loss:3.545 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:18:24,195][model8_pretrain.py][INFO] Epoch:[0/2](798000/4588595) loss:2.640 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:18:24,195][model8_pretrain.py][INFO] Epoch:[0/2](798000/4588595) loss:2.619 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:18:24,195][model8_pretrain.py][INFO] Epoch:[0/2](798000/4588595) loss:3.040 lr:0.0000100 epoch_Time:24124.0min: [2024-01-06 06:19:01,124][model8_pretrain.py][INFO] Epoch:[0/2](798100/4588595) loss:3.120 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:01,124][model8_pretrain.py][INFO] Epoch:[0/2](798100/4588595) loss:2.558 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:01,124][model8_pretrain.py][INFO] Epoch:[0/2](798100/4588595) loss:2.952 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:01,124][model8_pretrain.py][INFO] Epoch:[0/2](798100/4588595) loss:3.002 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:01,124][model8_pretrain.py][INFO] Epoch:[0/2](798100/4588595) loss:3.346 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:01,124][model8_pretrain.py][INFO] Epoch:[0/2](798100/4588595) loss:3.010 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:01,124][model8_pretrain.py][INFO] Epoch:[0/2](798100/4588595) loss:2.909 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:01,124][model8_pretrain.py][INFO] Epoch:[0/2](798100/4588595) loss:2.682 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:38,071][model8_pretrain.py][INFO] Epoch:[0/2](798200/4588595) loss:3.169 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:38,071][model8_pretrain.py][INFO] Epoch:[0/2](798200/4588595) loss:3.001 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:38,071][model8_pretrain.py][INFO] Epoch:[0/2](798200/4588595) loss:2.925 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:38,072][model8_pretrain.py][INFO] Epoch:[0/2](798200/4588595) loss:3.287 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:38,071][model8_pretrain.py][INFO] Epoch:[0/2](798200/4588595) loss:3.228 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:38,071][model8_pretrain.py][INFO] Epoch:[0/2](798200/4588595) loss:2.534 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:38,071][model8_pretrain.py][INFO] Epoch:[0/2](798200/4588595) loss:1.942 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:19:38,071][model8_pretrain.py][INFO] Epoch:[0/2](798200/4588595) loss:3.114 lr:0.0000100 epoch_Time:24123.0min: [2024-01-06 06:20:15,007][model8_pretrain.py][INFO] Epoch:[0/2](798300/4588595) loss:2.916 lr:0.0000100 epoch_Time:24122.0min: [2024-01-06 06:20:15,007][model8_pretrain.py][INFO] Epoch:[0/2](798300/4588595) loss:2.524 lr:0.0000100 epoch_Time:24122.0min: [2024-01-06 06:20:15,007][model8_pretrain.py][INFO] Epoch:[0/2](798300/4588595) loss:3.161 lr:0.0000100 epoch_Time:24122.0min: [2024-01-06 06:20:15,007][model8_pretrain.py][INFO] Epoch:[0/2](798300/4588595) loss:2.596 lr:0.0000100 epoch_Time:24122.0min: [2024-01-06 06:20:15,007][model8_pretrain.py][INFO] Epoch:[0/2](798300/4588595) loss:3.378 lr:0.0000100 epoch_Time:24122.0min: [2024-01-06 06:20:15,007][model8_pretrain.py][INFO] Epoch:[0/2](798300/4588595) loss:2.922 lr:0.0000100 epoch_Time:24122.0min: [2024-01-06 06:20:15,007][model8_pretrain.py][INFO] Epoch:[0/2](798300/4588595) loss:2.711 lr:0.0000100 epoch_Time:24122.0min: [2024-01-06 06:20:15,007][model8_pretrain.py][INFO] Epoch:[0/2](798300/4588595) loss:2.552 lr:0.0000100 epoch_Time:24122.0min: [2024-01-06 06:20:51,934][model8_pretrain.py][INFO] Epoch:[0/2](798400/4588595) loss:2.988 lr:0.0000100 epoch_Time:24121.0min: [2024-01-06 06:20:51,934][model8_pretrain.py][INFO] Epoch:[0/2](798400/4588595) loss:3.221 lr:0.0000100 epoch_Time:24121.0min: [2024-01-06 06:20:51,934][model8_pretrain.py][INFO] Epoch:[0/2](798400/4588595) loss:2.854 lr:0.0000100 epoch_Time:24121.0min: [2024-01-06 06:20:51,934][model8_pretrain.py][INFO] Epoch:[0/2](798400/4588595) loss:2.515 lr:0.0000100 epoch_Time:24121.0min: [2024-01-06 06:20:51,934][model8_pretrain.py][INFO] Epoch:[0/2](798400/4588595) loss:2.751 lr:0.0000100 epoch_Time:24121.0min: [2024-01-06 06:20:51,935][model8_pretrain.py][INFO] Epoch:[0/2](798400/4588595) loss:3.177 lr:0.0000100 epoch_Time:24121.0min: [2024-01-06 06:20:51,935][model8_pretrain.py][INFO] Epoch:[0/2](798400/4588595) loss:2.161 lr:0.0000100 epoch_Time:24121.0min: [2024-01-06 06:20:51,935][model8_pretrain.py][INFO] Epoch:[0/2](798400/4588595) loss:2.244 lr:0.0000100 epoch_Time:24121.0min: [2024-01-06 06:21:28,865][model8_pretrain.py][INFO] Epoch:[0/2](798500/4588595) loss:3.257 lr:0.0000100 epoch_Time:24120.0min: [2024-01-06 06:21:28,866][model8_pretrain.py][INFO] Epoch:[0/2](798500/4588595) loss:2.696 lr:0.0000100 epoch_Time:24120.0min: [2024-01-06 06:21:28,865][model8_pretrain.py][INFO] Epoch:[0/2](798500/4588595) loss:3.220 lr:0.0000100 epoch_Time:24120.0min: [2024-01-06 06:21:28,866][model8_pretrain.py][INFO] Epoch:[0/2](798500/4588595) loss:2.634 lr:0.0000100 epoch_Time:24120.0min: [2024-01-06 06:21:28,865][model8_pretrain.py][INFO] Epoch:[0/2](798500/4588595) loss:3.239 lr:0.0000100 epoch_Time:24120.0min: [2024-01-06 06:21:28,866][model8_pretrain.py][INFO] Epoch:[0/2](798500/4588595) loss:3.062 lr:0.0000100 epoch_Time:24120.0min: [2024-01-06 06:21:28,866][model8_pretrain.py][INFO] Epoch:[0/2](798500/4588595) loss:2.692 lr:0.0000100 epoch_Time:24120.0min: [2024-01-06 06:21:28,866][model8_pretrain.py][INFO] Epoch:[0/2](798500/4588595) loss:2.963 lr:0.0000100 epoch_Time:24120.0min: [2024-01-06 06:22:05,807][model8_pretrain.py][INFO] Epoch:[0/2](798600/4588595) loss:2.124 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:05,807][model8_pretrain.py][INFO] Epoch:[0/2](798600/4588595) loss:2.433 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:05,807][model8_pretrain.py][INFO] Epoch:[0/2](798600/4588595) loss:2.700 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:05,808][model8_pretrain.py][INFO] Epoch:[0/2](798600/4588595) loss:2.536 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:05,808][model8_pretrain.py][INFO] Epoch:[0/2](798600/4588595) loss:2.795 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:05,808][model8_pretrain.py][INFO] Epoch:[0/2](798600/4588595) loss:2.846 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:05,808][model8_pretrain.py][INFO] Epoch:[0/2](798600/4588595) loss:2.899 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:05,808][model8_pretrain.py][INFO] Epoch:[0/2](798600/4588595) loss:2.636 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:44,452][model8_pretrain.py][INFO] Epoch:[0/2](798700/4588595) loss:2.722 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:44,452][model8_pretrain.py][INFO] Epoch:[0/2](798700/4588595) loss:2.436 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:44,452][model8_pretrain.py][INFO] Epoch:[0/2](798700/4588595) loss:2.997 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:44,452][model8_pretrain.py][INFO] Epoch:[0/2](798700/4588595) loss:2.839 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:44,452][model8_pretrain.py][INFO] Epoch:[0/2](798700/4588595) loss:2.848 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:44,452][model8_pretrain.py][INFO] Epoch:[0/2](798700/4588595) loss:2.816 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:44,452][model8_pretrain.py][INFO] Epoch:[0/2](798700/4588595) loss:2.993 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:22:44,453][model8_pretrain.py][INFO] Epoch:[0/2](798700/4588595) loss:2.718 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:23:29,780][model8_pretrain.py][INFO] Epoch:[0/2](798800/4588595) loss:2.235 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:23:29,780][model8_pretrain.py][INFO] Epoch:[0/2](798800/4588595) loss:3.131 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:23:29,780][model8_pretrain.py][INFO] Epoch:[0/2](798800/4588595) loss:2.528 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:23:29,780][model8_pretrain.py][INFO] Epoch:[0/2](798800/4588595) loss:2.866 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:23:29,780][model8_pretrain.py][INFO] Epoch:[0/2](798800/4588595) loss:2.486 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:23:29,780][model8_pretrain.py][INFO] Epoch:[0/2](798800/4588595) loss:2.761 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:23:29,780][model8_pretrain.py][INFO] Epoch:[0/2](798800/4588595) loss:2.903 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:23:29,780][model8_pretrain.py][INFO] Epoch:[0/2](798800/4588595) loss:2.890 lr:0.0000100 epoch_Time:24119.0min: [2024-01-06 06:24:06,710][model8_pretrain.py][INFO] Epoch:[0/2](798900/4588595) loss:2.839 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:06,710][model8_pretrain.py][INFO] Epoch:[0/2](798900/4588595) loss:2.512 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:06,710][model8_pretrain.py][INFO] Epoch:[0/2](798900/4588595) loss:2.979 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:06,710][model8_pretrain.py][INFO] Epoch:[0/2](798900/4588595) loss:2.734 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:06,710][model8_pretrain.py][INFO] Epoch:[0/2](798900/4588595) loss:2.456 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:06,710][model8_pretrain.py][INFO] Epoch:[0/2](798900/4588595) loss:2.788 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:06,711][model8_pretrain.py][INFO] Epoch:[0/2](798900/4588595) loss:2.215 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:06,711][model8_pretrain.py][INFO] Epoch:[0/2](798900/4588595) loss:2.645 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:43,648][model8_pretrain.py][INFO] Epoch:[0/2](799000/4588595) loss:2.791 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:43,648][model8_pretrain.py][INFO] Epoch:[0/2](799000/4588595) loss:2.672 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:43,648][model8_pretrain.py][INFO] Epoch:[0/2](799000/4588595) loss:2.798 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:43,648][model8_pretrain.py][INFO] Epoch:[0/2](799000/4588595) loss:3.014 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:43,648][model8_pretrain.py][INFO] Epoch:[0/2](799000/4588595) loss:2.791 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:43,648][model8_pretrain.py][INFO] Epoch:[0/2](799000/4588595) loss:3.071 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:43,648][model8_pretrain.py][INFO] Epoch:[0/2](799000/4588595) loss:3.136 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:24:43,648][model8_pretrain.py][INFO] Epoch:[0/2](799000/4588595) loss:3.022 lr:0.0000100 epoch_Time:24118.0min: [2024-01-06 06:25:20,599][model8_pretrain.py][INFO] Epoch:[0/2](799100/4588595) loss:3.154 lr:0.0000100 epoch_Time:24117.0min: [2024-01-06 06:25:20,599][model8_pretrain.py][INFO] Epoch:[0/2](799100/4588595) loss:2.624 lr:0.0000100 epoch_Time:24117.0min: [2024-01-06 06:25:20,599][model8_pretrain.py][INFO] Epoch:[0/2](799100/4588595) loss:2.960 lr:0.0000100 epoch_Time:24117.0min: [2024-01-06 06:25:20,599][model8_pretrain.py][INFO] Epoch:[0/2](799100/4588595) loss:2.666 lr:0.0000100 epoch_Time:24117.0min: [2024-01-06 06:25:20,599][model8_pretrain.py][INFO] Epoch:[0/2](799100/4588595) loss:2.841 lr:0.0000100 epoch_Time:24117.0min: [2024-01-06 06:25:20,599][model8_pretrain.py][INFO] Epoch:[0/2](799100/4588595) loss:2.920 lr:0.0000100 epoch_Time:24117.0min: [2024-01-06 06:25:20,599][model8_pretrain.py][INFO] Epoch:[0/2](799100/4588595) loss:2.359 lr:0.0000100 epoch_Time:24117.0min: [2024-01-06 06:25:20,599][model8_pretrain.py][INFO] Epoch:[0/2](799100/4588595) loss:2.160 lr:0.0000100 epoch_Time:24117.0min: [2024-01-06 06:25:57,539][model8_pretrain.py][INFO] Epoch:[0/2](799200/4588595) loss:2.703 lr:0.0000100 epoch_Time:24116.0min: [2024-01-06 06:25:57,539][model8_pretrain.py][INFO] Epoch:[0/2](799200/4588595) loss:2.704 lr:0.0000100 epoch_Time:24116.0min: [2024-01-06 06:25:57,539][model8_pretrain.py][INFO] Epoch:[0/2](799200/4588595) loss:2.820 lr:0.0000100 epoch_Time:24116.0min: [2024-01-06 06:25:57,539][model8_pretrain.py][INFO] Epoch:[0/2](799200/4588595) loss:2.189 lr:0.0000100 epoch_Time:24116.0min: [2024-01-06 06:25:57,539][model8_pretrain.py][INFO] Epoch:[0/2](799200/4588595) loss:3.161 lr:0.0000100 epoch_Time:24116.0min: [2024-01-06 06:25:57,539][model8_pretrain.py][INFO] Epoch:[0/2](799200/4588595) loss:3.337 lr:0.0000100 epoch_Time:24116.0min: [2024-01-06 06:25:57,539][model8_pretrain.py][INFO] Epoch:[0/2](799200/4588595) loss:2.300 lr:0.0000100 epoch_Time:24116.0min: [2024-01-06 06:25:57,539][model8_pretrain.py][INFO] Epoch:[0/2](799200/4588595) loss:2.661 lr:0.0000100 epoch_Time:24116.0min: [2024-01-06 06:26:34,475][model8_pretrain.py][INFO] Epoch:[0/2](799300/4588595) loss:2.198 lr:0.0000100 epoch_Time:24115.0min: [2024-01-06 06:26:34,475][model8_pretrain.py][INFO] Epoch:[0/2](799300/4588595) loss:3.353 lr:0.0000100 epoch_Time:24115.0min: [2024-01-06 06:26:34,475][model8_pretrain.py][INFO] Epoch:[0/2](799300/4588595) loss:2.157 lr:0.0000100 epoch_Time:24115.0min: [2024-01-06 06:26:34,475][model8_pretrain.py][INFO] Epoch:[0/2](799300/4588595) loss:2.430 lr:0.0000100 epoch_Time:24115.0min: [2024-01-06 06:26:34,475][model8_pretrain.py][INFO] Epoch:[0/2](799300/4588595) loss:2.873 lr:0.0000100 epoch_Time:24115.0min: [2024-01-06 06:26:34,475][model8_pretrain.py][INFO] Epoch:[0/2](799300/4588595) loss:2.986 lr:0.0000100 epoch_Time:24115.0min: [2024-01-06 06:26:34,475][model8_pretrain.py][INFO] Epoch:[0/2](799300/4588595) loss:2.806 lr:0.0000100 epoch_Time:24115.0min: [2024-01-06 06:26:34,475][model8_pretrain.py][INFO] Epoch:[0/2](799300/4588595) loss:3.212 lr:0.0000100 epoch_Time:24115.0min: [2024-01-06 06:27:11,408][model8_pretrain.py][INFO] Epoch:[0/2](799400/4588595) loss:2.535 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:27:11,408][model8_pretrain.py][INFO] Epoch:[0/2](799400/4588595) loss:3.031 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:27:11,408][model8_pretrain.py][INFO] Epoch:[0/2](799400/4588595) loss:2.792 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:27:11,408][model8_pretrain.py][INFO] Epoch:[0/2](799400/4588595) loss:3.021 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:27:11,408][model8_pretrain.py][INFO] Epoch:[0/2](799400/4588595) loss:2.949 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:27:11,408][model8_pretrain.py][INFO] Epoch:[0/2](799400/4588595) loss:3.056 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:27:11,408][model8_pretrain.py][INFO] Epoch:[0/2](799400/4588595) loss:2.502 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:27:11,408][model8_pretrain.py][INFO] Epoch:[0/2](799400/4588595) loss:2.498 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:27:48,333][model8_pretrain.py][INFO] Epoch:[0/2](799500/4588595) loss:3.307 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:27:48,333][model8_pretrain.py][INFO] Epoch:[0/2](799500/4588595) loss:3.060 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:27:48,333][model8_pretrain.py][INFO] Epoch:[0/2](799500/4588595) loss:3.153 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:27:48,333][model8_pretrain.py][INFO] Epoch:[0/2](799500/4588595) loss:2.595 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:27:48,333][model8_pretrain.py][INFO] Epoch:[0/2](799500/4588595) loss:2.769 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:27:48,333][model8_pretrain.py][INFO] Epoch:[0/2](799500/4588595) loss:3.162 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:27:48,333][model8_pretrain.py][INFO] Epoch:[0/2](799500/4588595) loss:2.240 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:27:48,333][model8_pretrain.py][INFO] Epoch:[0/2](799500/4588595) loss:3.462 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:28:35,362][model8_pretrain.py][INFO] Epoch:[0/2](799600/4588595) loss:2.865 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:28:35,362][model8_pretrain.py][INFO] Epoch:[0/2](799600/4588595) loss:2.044 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:28:35,362][model8_pretrain.py][INFO] Epoch:[0/2](799600/4588595) loss:3.126 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:28:35,362][model8_pretrain.py][INFO] Epoch:[0/2](799600/4588595) loss:2.962 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:28:35,362][model8_pretrain.py][INFO] Epoch:[0/2](799600/4588595) loss:3.066 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:28:35,362][model8_pretrain.py][INFO] Epoch:[0/2](799600/4588595) loss:2.205 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:28:35,362][model8_pretrain.py][INFO] Epoch:[0/2](799600/4588595) loss:2.055 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:28:35,362][model8_pretrain.py][INFO] Epoch:[0/2](799600/4588595) loss:2.906 lr:0.0000100 epoch_Time:24114.0min: [2024-01-06 06:29:12,287][model8_pretrain.py][INFO] Epoch:[0/2](799700/4588595) loss:2.796 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:29:12,287][model8_pretrain.py][INFO] Epoch:[0/2](799700/4588595) loss:2.940 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:29:12,287][model8_pretrain.py][INFO] Epoch:[0/2](799700/4588595) loss:2.898 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:29:12,287][model8_pretrain.py][INFO] Epoch:[0/2](799700/4588595) loss:2.906 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:29:12,287][model8_pretrain.py][INFO] Epoch:[0/2](799700/4588595) loss:2.980 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:29:12,287][model8_pretrain.py][INFO] Epoch:[0/2](799700/4588595) loss:2.904 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:29:12,287][model8_pretrain.py][INFO] Epoch:[0/2](799700/4588595) loss:2.938 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:29:12,287][model8_pretrain.py][INFO] Epoch:[0/2](799700/4588595) loss:3.140 lr:0.0000100 epoch_Time:24113.0min: [2024-01-06 06:29:49,237][model8_pretrain.py][INFO] Epoch:[0/2](799800/4588595) loss:2.614 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:29:49,237][model8_pretrain.py][INFO] Epoch:[0/2](799800/4588595) loss:2.645 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:29:49,237][model8_pretrain.py][INFO] Epoch:[0/2](799800/4588595) loss:3.294 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:29:49,237][model8_pretrain.py][INFO] Epoch:[0/2](799800/4588595) loss:2.669 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:29:49,237][model8_pretrain.py][INFO] Epoch:[0/2](799800/4588595) loss:2.540 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:29:49,237][model8_pretrain.py][INFO] Epoch:[0/2](799800/4588595) loss:2.598 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:29:49,237][model8_pretrain.py][INFO] Epoch:[0/2](799800/4588595) loss:2.950 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:29:49,237][model8_pretrain.py][INFO] Epoch:[0/2](799800/4588595) loss:2.513 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:30:26,188][model8_pretrain.py][INFO] Epoch:[0/2](799900/4588595) loss:2.965 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:30:26,188][model8_pretrain.py][INFO] Epoch:[0/2](799900/4588595) loss:3.147 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:30:26,188][model8_pretrain.py][INFO] Epoch:[0/2](799900/4588595) loss:2.793 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:30:26,188][model8_pretrain.py][INFO] Epoch:[0/2](799900/4588595) loss:3.022 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:30:26,188][model8_pretrain.py][INFO] Epoch:[0/2](799900/4588595) loss:2.549 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:30:26,188][model8_pretrain.py][INFO] Epoch:[0/2](799900/4588595) loss:2.810 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:30:26,188][model8_pretrain.py][INFO] Epoch:[0/2](799900/4588595) loss:2.196 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:30:26,188][model8_pretrain.py][INFO] Epoch:[0/2](799900/4588595) loss:2.937 lr:0.0000100 epoch_Time:24112.0min: [2024-01-06 06:31:03,114][model8_pretrain.py][INFO] Epoch:[0/2](800000/4588595) loss:2.821 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:03,114][model8_pretrain.py][INFO] Epoch:[0/2](800000/4588595) loss:3.074 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:03,114][model8_pretrain.py][INFO] Epoch:[0/2](800000/4588595) loss:3.154 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:03,114][model8_pretrain.py][INFO] Epoch:[0/2](800000/4588595) loss:2.841 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:03,114][model8_pretrain.py][INFO] Epoch:[0/2](800000/4588595) loss:2.512 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:03,114][model8_pretrain.py][INFO] Epoch:[0/2](800000/4588595) loss:2.741 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:03,114][model8_pretrain.py][INFO] Epoch:[0/2](800000/4588595) loss:2.664 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:03,114][model8_pretrain.py][INFO] Epoch:[0/2](800000/4588595) loss:3.257 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:21,245][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_800000.pth [2024-01-06 06:31:21,245][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_800000.pth [2024-01-06 06:31:21,245][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_800000.pth [2024-01-06 06:31:21,245][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_800000.pth [2024-01-06 06:31:21,245][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_800000.pth [2024-01-06 06:31:21,249][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_800000.pth [2024-01-06 06:31:21,250][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_800000.pth [2024-01-06 06:31:21,255][model8_pretrain.py][INFO] Saved checkpoint to out/model8/checkpoint_step_800000.pth [2024-01-06 06:31:58,203][model8_pretrain.py][INFO] Epoch:[0/2](800100/4588595) loss:2.143 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:58,203][model8_pretrain.py][INFO] Epoch:[0/2](800100/4588595) loss:3.015 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:58,204][model8_pretrain.py][INFO] Epoch:[0/2](800100/4588595) loss:3.091 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:58,203][model8_pretrain.py][INFO] Epoch:[0/2](800100/4588595) loss:2.729 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:58,204][model8_pretrain.py][INFO] Epoch:[0/2](800100/4588595) loss:2.583 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:58,204][model8_pretrain.py][INFO] Epoch:[0/2](800100/4588595) loss:2.821 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:58,204][model8_pretrain.py][INFO] Epoch:[0/2](800100/4588595) loss:2.969 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:31:58,204][model8_pretrain.py][INFO] Epoch:[0/2](800100/4588595) loss:2.100 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:32:35,152][model8_pretrain.py][INFO] Epoch:[0/2](800200/4588595) loss:2.826 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:32:35,152][model8_pretrain.py][INFO] Epoch:[0/2](800200/4588595) loss:2.684 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:32:35,152][model8_pretrain.py][INFO] Epoch:[0/2](800200/4588595) loss:2.904 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:32:35,152][model8_pretrain.py][INFO] Epoch:[0/2](800200/4588595) loss:2.716 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:32:35,152][model8_pretrain.py][INFO] Epoch:[0/2](800200/4588595) loss:2.751 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:32:35,152][model8_pretrain.py][INFO] Epoch:[0/2](800200/4588595) loss:3.191 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:32:35,152][model8_pretrain.py][INFO] Epoch:[0/2](800200/4588595) loss:2.990 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:32:35,152][model8_pretrain.py][INFO] Epoch:[0/2](800200/4588595) loss:2.605 lr:0.0000100 epoch_Time:24111.0min: [2024-01-06 06:33:13,796][model8_pretrain.py][INFO] Epoch:[0/2](800300/4588595) loss:2.780 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:13,796][model8_pretrain.py][INFO] Epoch:[0/2](800300/4588595) loss:2.946 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:13,796][model8_pretrain.py][INFO] Epoch:[0/2](800300/4588595) loss:3.368 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:13,796][model8_pretrain.py][INFO] Epoch:[0/2](800300/4588595) loss:2.959 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:13,796][model8_pretrain.py][INFO] Epoch:[0/2](800300/4588595) loss:2.671 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:13,796][model8_pretrain.py][INFO] Epoch:[0/2](800300/4588595) loss:2.111 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:13,796][model8_pretrain.py][INFO] Epoch:[0/2](800300/4588595) loss:3.207 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:13,796][model8_pretrain.py][INFO] Epoch:[0/2](800300/4588595) loss:2.848 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:59,586][model8_pretrain.py][INFO] Epoch:[0/2](800400/4588595) loss:3.014 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:59,586][model8_pretrain.py][INFO] Epoch:[0/2](800400/4588595) loss:2.706 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:59,586][model8_pretrain.py][INFO] Epoch:[0/2](800400/4588595) loss:2.403 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:59,586][model8_pretrain.py][INFO] Epoch:[0/2](800400/4588595) loss:3.088 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:59,586][model8_pretrain.py][INFO] Epoch:[0/2](800400/4588595) loss:2.346 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:59,586][model8_pretrain.py][INFO] Epoch:[0/2](800400/4588595) loss:2.621 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:59,586][model8_pretrain.py][INFO] Epoch:[0/2](800400/4588595) loss:2.926 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:33:59,586][model8_pretrain.py][INFO] Epoch:[0/2](800400/4588595) loss:3.223 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:34:36,517][model8_pretrain.py][INFO] Epoch:[0/2](800500/4588595) loss:2.807 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:34:36,517][model8_pretrain.py][INFO] Epoch:[0/2](800500/4588595) loss:2.446 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:34:36,517][model8_pretrain.py][INFO] Epoch:[0/2](800500/4588595) loss:2.834 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:34:36,517][model8_pretrain.py][INFO] Epoch:[0/2](800500/4588595) loss:3.070 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:34:36,517][model8_pretrain.py][INFO] Epoch:[0/2](800500/4588595) loss:3.127 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:34:36,518][model8_pretrain.py][INFO] Epoch:[0/2](800500/4588595) loss:2.536 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:34:36,518][model8_pretrain.py][INFO] Epoch:[0/2](800500/4588595) loss:2.388 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:34:36,518][model8_pretrain.py][INFO] Epoch:[0/2](800500/4588595) loss:2.786 lr:0.0000100 epoch_Time:24110.0min: [2024-01-06 06:35:13,455][model8_pretrain.py][INFO] Epoch:[0/2](800600/4588595) loss:2.352 lr:0.0000100 epoch_Time:24109.0min: [2024-01-06 06:35:13,455][model8_pretrain.py][INFO] Epoch:[0/2](800600/4588595) loss:2.793 lr:0.0000100 epoch_Time:24109.0min: [2024-01-06 06:35:13,455][model8_pretrain.py][INFO] Epoch:[0/2](800600/4588595) loss:2.615 lr:0.0000100 epoch_Time:24109.0min: [2024-01-06 06:35:13,455][model8_pretrain.py][INFO] Epoch:[0/2](800600/4588595) loss:2.757 lr:0.0000100 epoch_Time:24109.0min: [2024-01-06 06:35:13,455][model8_pretrain.py][INFO] Epoch:[0/2](800600/4588595) loss:2.859 lr:0.0000100 epoch_Time:24109.0min: [2024-01-06 06:35:13,455][model8_pretrain.py][INFO] Epoch:[0/2](800600/4588595) loss:3.179 lr:0.0000100 epoch_Time:24109.0min: [2024-01-06 06:35:13,455][model8_pretrain.py][INFO] Epoch:[0/2](800600/4588595) loss:2.708 lr:0.0000100 epoch_Time:24109.0min: [2024-01-06 06:35:13,455][model8_pretrain.py][INFO] Epoch:[0/2](800600/4588595) loss:2.704 lr:0.0000100 epoch_Time:24109.0min: [2024-01-06 06:35:50,401][model8_pretrain.py][INFO] Epoch:[0/2](800700/4588595) loss:2.824 lr:0.0000100 epoch_Time:24108.0min: [2024-01-06 06:35:50,401][model8_pretrain.py][INFO] Epoch:[0/2](800700/4588595) loss:2.757 lr:0.0000100 epoch_Time:24108.0min: [2024-01-06 06:35:50,401][model8_pretrain.py][INFO] Epoch:[0/2](800700/4588595) loss:3.239 lr:0.0000100 epoch_Time:24108.0min: [2024-01-06 06:35:50,401][model8_pretrain.py][INFO] Epoch:[0/2](800700/4588595) loss:3.082 lr:0.0000100 epoch_Time:24108.0min: [2024-01-06 06:35:50,401][model8_pretrain.py][INFO] Epoch:[0/2](800700/4588595) loss:2.433 lr:0.0000100 epoch_Time:24108.0min: [2024-01-06 06:35:50,401][model8_pretrain.py][INFO] Epoch:[0/2](800700/4588595) loss:2.934 lr:0.0000100 epoch_Time:24108.0min: [2024-01-06 06:35:50,401][model8_pretrain.py][INFO] Epoch:[0/2](800700/4588595) loss:1.994 lr:0.0000100 epoch_Time:24108.0min: [2024-01-06 06:35:50,401][model8_pretrain.py][INFO] Epoch:[0/2](800700/4588595) loss:3.383 lr:0.0000100 epoch_Time:24108.0min: [2024-01-06 06:36:27,353][model8_pretrain.py][INFO] Epoch:[0/2](800800/4588595) loss:2.237 lr:0.0000100 epoch_Time:24107.0min: [2024-01-06 06:36:27,353][model8_pretrain.py][INFO] Epoch:[0/2](800800/4588595) loss:2.952 lr:0.0000100 epoch_Time:24107.0min: [2024-01-06 06:36:27,353][model8_pretrain.py][INFO] Epoch:[0/2](800800/4588595) loss:3.165 lr:0.0000100 epoch_Time:24107.0min: [2024-01-06 06:36:27,353][model8_pretrain.py][INFO] Epoch:[0/2](800800/4588595) loss:1.938 lr:0.0000100 epoch_Time:24107.0min: [2024-01-06 06:36:27,353][model8_pretrain.py][INFO] Epoch:[0/2](800800/4588595) loss:2.464 lr:0.0000100 epoch_Time:24107.0min: [2024-01-06 06:36:27,353][model8_pretrain.py][INFO] Epoch:[0/2](800800/4588595) loss:2.867 lr:0.0000100 epoch_Time:24107.0min: [2024-01-06 06:36:27,353][model8_pretrain.py][INFO] Epoch:[0/2](800800/4588595) loss:3.049 lr:0.0000100 epoch_Time:24107.0min: [2024-01-06 06:36:27,353][model8_pretrain.py][INFO] Epoch:[0/2](800800/4588595) loss:2.168 lr:0.0000100 epoch_Time:24107.0min: [2024-01-06 06:37:04,306][model8_pretrain.py][INFO] Epoch:[0/2](800900/4588595) loss:3.130 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:04,306][model8_pretrain.py][INFO] Epoch:[0/2](800900/4588595) loss:2.768 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:04,306][model8_pretrain.py][INFO] Epoch:[0/2](800900/4588595) loss:2.646 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:04,306][model8_pretrain.py][INFO] Epoch:[0/2](800900/4588595) loss:2.552 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:04,306][model8_pretrain.py][INFO] Epoch:[0/2](800900/4588595) loss:2.884 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:04,306][model8_pretrain.py][INFO] Epoch:[0/2](800900/4588595) loss:2.927 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:04,306][model8_pretrain.py][INFO] Epoch:[0/2](800900/4588595) loss:2.439 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:04,306][model8_pretrain.py][INFO] Epoch:[0/2](800900/4588595) loss:2.630 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:41,237][model8_pretrain.py][INFO] Epoch:[0/2](801000/4588595) loss:2.878 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:41,237][model8_pretrain.py][INFO] Epoch:[0/2](801000/4588595) loss:3.364 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:41,237][model8_pretrain.py][INFO] Epoch:[0/2](801000/4588595) loss:2.600 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:41,237][model8_pretrain.py][INFO] Epoch:[0/2](801000/4588595) loss:2.357 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:41,237][model8_pretrain.py][INFO] Epoch:[0/2](801000/4588595) loss:3.081 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:41,237][model8_pretrain.py][INFO] Epoch:[0/2](801000/4588595) loss:3.146 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:41,237][model8_pretrain.py][INFO] Epoch:[0/2](801000/4588595) loss:2.811 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:37:41,237][model8_pretrain.py][INFO] Epoch:[0/2](801000/4588595) loss:2.512 lr:0.0000100 epoch_Time:24106.0min: [2024-01-06 06:38:19,840][model8_pretrain.py][INFO] Epoch:[0/2](801100/4588595) loss:1.837 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:38:19,840][model8_pretrain.py][INFO] Epoch:[0/2](801100/4588595) loss:2.895 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:38:19,840][model8_pretrain.py][INFO] Epoch:[0/2](801100/4588595) loss:2.798 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:38:19,840][model8_pretrain.py][INFO] Epoch:[0/2](801100/4588595) loss:2.859 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:38:19,840][model8_pretrain.py][INFO] Epoch:[0/2](801100/4588595) loss:2.887 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:38:19,840][model8_pretrain.py][INFO] Epoch:[0/2](801100/4588595) loss:3.177 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:38:19,840][model8_pretrain.py][INFO] Epoch:[0/2](801100/4588595) loss:2.513 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:38:19,840][model8_pretrain.py][INFO] Epoch:[0/2](801100/4588595) loss:2.432 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:05,516][model8_pretrain.py][INFO] Epoch:[0/2](801200/4588595) loss:2.634 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:05,516][model8_pretrain.py][INFO] Epoch:[0/2](801200/4588595) loss:2.807 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:05,516][model8_pretrain.py][INFO] Epoch:[0/2](801200/4588595) loss:2.522 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:05,516][model8_pretrain.py][INFO] Epoch:[0/2](801200/4588595) loss:2.386 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:05,516][model8_pretrain.py][INFO] Epoch:[0/2](801200/4588595) loss:2.677 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:05,516][model8_pretrain.py][INFO] Epoch:[0/2](801200/4588595) loss:2.219 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:05,516][model8_pretrain.py][INFO] Epoch:[0/2](801200/4588595) loss:3.270 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:05,516][model8_pretrain.py][INFO] Epoch:[0/2](801200/4588595) loss:2.467 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:42,464][model8_pretrain.py][INFO] Epoch:[0/2](801300/4588595) loss:2.386 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:42,464][model8_pretrain.py][INFO] Epoch:[0/2](801300/4588595) loss:3.082 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:42,464][model8_pretrain.py][INFO] Epoch:[0/2](801300/4588595) loss:3.050 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:42,464][model8_pretrain.py][INFO] Epoch:[0/2](801300/4588595) loss:3.261 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:42,464][model8_pretrain.py][INFO] Epoch:[0/2](801300/4588595) loss:2.361 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:42,464][model8_pretrain.py][INFO] Epoch:[0/2](801300/4588595) loss:3.188 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:42,464][model8_pretrain.py][INFO] Epoch:[0/2](801300/4588595) loss:2.592 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:39:42,464][model8_pretrain.py][INFO] Epoch:[0/2](801300/4588595) loss:2.248 lr:0.0000100 epoch_Time:24105.0min: [2024-01-06 06:40:19,410][model8_pretrain.py][INFO] Epoch:[0/2](801400/4588595) loss:2.435 lr:0.0000100 epoch_Time:24104.0min: [2024-01-06 06:40:19,410][model8_pretrain.py][INFO] Epoch:[0/2](801400/4588595) loss:3.355 lr:0.0000100 epoch_Time:24104.0min: [2024-01-06 06:40:19,410][model8_pretrain.py][INFO] Epoch:[0/2](801400/4588595) loss:2.883 lr:0.0000100 epoch_Time:24104.0min: [2024-01-06 06:40:19,410][model8_pretrain.py][INFO] Epoch:[0/2](801400/4588595) loss:2.357 lr:0.0000100 epoch_Time:24104.0min: [2024-01-06 06:40:19,410][model8_pretrain.py][INFO] Epoch:[0/2](801400/4588595) loss:2.744 lr:0.0000100 epoch_Time:24104.0min: [2024-01-06 06:40:19,410][model8_pretrain.py][INFO] Epoch:[0/2](801400/4588595) loss:2.436 lr:0.0000100 epoch_Time:24104.0min: [2024-01-06 06:40:19,410][model8_pretrain.py][INFO] Epoch:[0/2](801400/4588595) loss:3.300 lr:0.0000100 epoch_Time:24104.0min: [2024-01-06 06:40:19,410][model8_pretrain.py][INFO] Epoch:[0/2](801400/4588595) loss:3.314 lr:0.0000100 epoch_Time:24104.0min: [2024-01-06 06:40:56,364][model8_pretrain.py][INFO] Epoch:[0/2](801500/4588595) loss:2.988 lr:0.0000100 epoch_Time:24103.0min: [2024-01-06 06:40:56,364][model8_pretrain.py][INFO] Epoch:[0/2](801500/4588595) loss:2.486 lr:0.0000100 epoch_Time:24103.0min: [2024-01-06 06:40:56,364][model8_pretrain.py][INFO] Epoch:[0/2](801500/4588595) loss:2.228 lr:0.0000100 epoch_Time:24103.0min: [2024-01-06 06:40:56,364][model8_pretrain.py][INFO] Epoch:[0/2](801500/4588595) loss:3.050 lr:0.0000100 epoch_Time:24103.0min: [2024-01-06 06:40:56,364][model8_pretrain.py][INFO] Epoch:[0/2](801500/4588595) loss:2.691 lr:0.0000100 epoch_Time:24103.0min: [2024-01-06 06:40:56,364][model8_pretrain.py][INFO] Epoch:[0/2](801500/4588595) loss:2.936 lr:0.0000100 epoch_Time:24103.0min: [2024-01-06 06:40:56,364][model8_pretrain.py][INFO] Epoch:[0/2](801500/4588595) loss:3.093 lr:0.0000100 epoch_Time:24103.0min: [2024-01-06 06:40:56,364][model8_pretrain.py][INFO] Epoch:[0/2](801500/4588595) loss:2.508 lr:0.0000100 epoch_Time:24103.0min: [2024-01-06 06:41:33,320][model8_pretrain.py][INFO] Epoch:[0/2](801600/4588595) loss:2.881 lr:0.0000100 epoch_Time:24102.0min: [2024-01-06 06:41:33,320][model8_pretrain.py][INFO] Epoch:[0/2](801600/4588595) loss:3.206 lr:0.0000100 epoch_Time:24102.0min: [2024-01-06 06:41:33,320][model8_pretrain.py][INFO] Epoch:[0/2](801600/4588595) loss:2.873 lr:0.0000100 epoch_Time:24102.0min: [2024-01-06 06:41:33,320][model8_pretrain.py][INFO] Epoch:[0/2](801600/4588595) loss:3.298 lr:0.0000100 epoch_Time:24102.0min: [2024-01-06 06:41:33,320][model8_pretrain.py][INFO] Epoch:[0/2](801600/4588595) loss:2.573 lr:0.0000100 epoch_Time:24102.0min: [2024-01-06 06:41:33,320][model8_pretrain.py][INFO] Epoch:[0/2](801600/4588595) loss:2.510 lr:0.0000100 epoch_Time:24102.0min: [2024-01-06 06:41:33,320][model8_pretrain.py][INFO] Epoch:[0/2](801600/4588595) loss:2.870 lr:0.0000100 epoch_Time:24102.0min: [2024-01-06 06:41:33,320][model8_pretrain.py][INFO] Epoch:[0/2](801600/4588595) loss:2.386 lr:0.0000100 epoch_Time:24102.0min: [2024-01-06 06:42:10,279][model8_pretrain.py][INFO] Epoch:[0/2](801700/4588595) loss:2.459 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:10,279][model8_pretrain.py][INFO] Epoch:[0/2](801700/4588595) loss:2.999 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:10,279][model8_pretrain.py][INFO] Epoch:[0/2](801700/4588595) loss:3.148 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:10,279][model8_pretrain.py][INFO] Epoch:[0/2](801700/4588595) loss:2.313 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:10,279][model8_pretrain.py][INFO] Epoch:[0/2](801700/4588595) loss:3.073 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:10,279][model8_pretrain.py][INFO] Epoch:[0/2](801700/4588595) loss:2.803 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:10,280][model8_pretrain.py][INFO] Epoch:[0/2](801700/4588595) loss:2.951 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:10,280][model8_pretrain.py][INFO] Epoch:[0/2](801700/4588595) loss:2.624 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:47,218][model8_pretrain.py][INFO] Epoch:[0/2](801800/4588595) loss:2.826 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:47,218][model8_pretrain.py][INFO] Epoch:[0/2](801800/4588595) loss:3.312 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:47,218][model8_pretrain.py][INFO] Epoch:[0/2](801800/4588595) loss:1.900 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:47,218][model8_pretrain.py][INFO] Epoch:[0/2](801800/4588595) loss:3.310 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:47,218][model8_pretrain.py][INFO] Epoch:[0/2](801800/4588595) loss:1.972 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:47,218][model8_pretrain.py][INFO] Epoch:[0/2](801800/4588595) loss:3.243 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:47,218][model8_pretrain.py][INFO] Epoch:[0/2](801800/4588595) loss:2.660 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:42:47,218][model8_pretrain.py][INFO] Epoch:[0/2](801800/4588595) loss:3.162 lr:0.0000100 epoch_Time:24101.0min: [2024-01-06 06:43:25,847][model8_pretrain.py][INFO] Epoch:[0/2](801900/4588595) loss:3.090 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:43:25,847][model8_pretrain.py][INFO] Epoch:[0/2](801900/4588595) loss:2.959 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:43:25,847][model8_pretrain.py][INFO] Epoch:[0/2](801900/4588595) loss:3.287 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:43:25,847][model8_pretrain.py][INFO] Epoch:[0/2](801900/4588595) loss:2.928 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:43:25,847][model8_pretrain.py][INFO] Epoch:[0/2](801900/4588595) loss:2.735 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:43:25,847][model8_pretrain.py][INFO] Epoch:[0/2](801900/4588595) loss:2.887 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:43:25,847][model8_pretrain.py][INFO] Epoch:[0/2](801900/4588595) loss:2.922 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:43:25,847][model8_pretrain.py][INFO] Epoch:[0/2](801900/4588595) loss:2.974 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:44:11,506][model8_pretrain.py][INFO] Epoch:[0/2](802000/4588595) loss:2.494 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:44:11,506][model8_pretrain.py][INFO] Epoch:[0/2](802000/4588595) loss:2.291 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:44:11,506][model8_pretrain.py][INFO] Epoch:[0/2](802000/4588595) loss:2.921 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:44:11,506][model8_pretrain.py][INFO] Epoch:[0/2](802000/4588595) loss:3.209 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:44:11,506][model8_pretrain.py][INFO] Epoch:[0/2](802000/4588595) loss:2.676 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:44:11,507][model8_pretrain.py][INFO] Epoch:[0/2](802000/4588595) loss:3.113 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:44:11,507][model8_pretrain.py][INFO] Epoch:[0/2](802000/4588595) loss:2.617 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:44:11,507][model8_pretrain.py][INFO] Epoch:[0/2](802000/4588595) loss:2.304 lr:0.0000100 epoch_Time:24100.0min: [2024-01-06 06:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](802100/4588595) loss:3.257 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](802100/4588595) loss:2.479 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](802100/4588595) loss:3.081 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](802100/4588595) loss:2.779 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](802100/4588595) loss:2.475 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:44:48,452][model8_pretrain.py][INFO] Epoch:[0/2](802100/4588595) loss:2.979 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:44:48,453][model8_pretrain.py][INFO] Epoch:[0/2](802100/4588595) loss:3.184 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:44:48,454][model8_pretrain.py][INFO] Epoch:[0/2](802100/4588595) loss:1.987 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:45:25,387][model8_pretrain.py][INFO] Epoch:[0/2](802200/4588595) loss:2.562 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:45:25,387][model8_pretrain.py][INFO] Epoch:[0/2](802200/4588595) loss:2.915 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:45:25,387][model8_pretrain.py][INFO] Epoch:[0/2](802200/4588595) loss:2.752 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:45:25,387][model8_pretrain.py][INFO] Epoch:[0/2](802200/4588595) loss:2.981 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:45:25,387][model8_pretrain.py][INFO] Epoch:[0/2](802200/4588595) loss:2.875 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:45:25,387][model8_pretrain.py][INFO] Epoch:[0/2](802200/4588595) loss:2.417 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:45:25,387][model8_pretrain.py][INFO] Epoch:[0/2](802200/4588595) loss:2.729 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:45:25,388][model8_pretrain.py][INFO] Epoch:[0/2](802200/4588595) loss:3.092 lr:0.0000100 epoch_Time:24099.0min: [2024-01-06 06:46:02,331][model8_pretrain.py][INFO] Epoch:[0/2](802300/4588595) loss:2.659 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:02,331][model8_pretrain.py][INFO] Epoch:[0/2](802300/4588595) loss:2.535 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:02,331][model8_pretrain.py][INFO] Epoch:[0/2](802300/4588595) loss:2.611 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:02,331][model8_pretrain.py][INFO] Epoch:[0/2](802300/4588595) loss:3.043 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:02,331][model8_pretrain.py][INFO] Epoch:[0/2](802300/4588595) loss:2.533 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:02,331][model8_pretrain.py][INFO] Epoch:[0/2](802300/4588595) loss:2.884 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:02,331][model8_pretrain.py][INFO] Epoch:[0/2](802300/4588595) loss:2.380 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:02,332][model8_pretrain.py][INFO] Epoch:[0/2](802300/4588595) loss:2.910 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](802400/4588595) loss:2.598 lr:0.0000100 epoch_Time:24097.0min: [2024-01-06 06:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](802400/4588595) loss:2.829 lr:0.0000100 epoch_Time:24097.0min: [2024-01-06 06:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](802400/4588595) loss:3.123 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](802400/4588595) loss:2.852 lr:0.0000100 epoch_Time:24097.0min: [2024-01-06 06:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](802400/4588595) loss:2.967 lr:0.0000100 epoch_Time:24098.0min: [2024-01-06 06:46:39,272][model8_pretrain.py][INFO] Epoch:[0/2](802400/4588595) loss:2.412 lr:0.0000100 epoch_Time:24097.0min: [2024-01-06 06:46:39,273][model8_pretrain.py][INFO] Epoch:[0/2](802400/4588595) loss:2.343 lr:0.0000100 epoch_Time:24097.0min: [2024-01-06 06:46:39,273][model8_pretrain.py][INFO] Epoch:[0/2](802400/4588595) loss:2.833 lr:0.0000100 epoch_Time:24097.0min: [2024-01-06 06:47:16,207][model8_pretrain.py][INFO] Epoch:[0/2](802500/4588595) loss:2.609 lr:0.0000100 epoch_Time:24096.0min: [2024-01-06 06:47:16,207][model8_pretrain.py][INFO] Epoch:[0/2](802500/4588595) loss:2.616 lr:0.0000100 epoch_Time:24096.0min: [2024-01-06 06:47:16,207][model8_pretrain.py][INFO] Epoch:[0/2](802500/4588595) loss:2.521 lr:0.0000100 epoch_Time:24096.0min: [2024-01-06 06:47:16,207][model8_pretrain.py][INFO] Epoch:[0/2](802500/4588595) loss:2.438 lr:0.0000100 epoch_Time:24096.0min: [2024-01-06 06:47:16,207][model8_pretrain.py][INFO] Epoch:[0/2](802500/4588595) loss:1.863 lr:0.0000100 epoch_Time:24096.0min: [2024-01-06 06:47:16,207][model8_pretrain.py][INFO] Epoch:[0/2](802500/4588595) loss:2.605 lr:0.0000100 epoch_Time:24096.0min: [2024-01-06 06:47:16,207][model8_pretrain.py][INFO] Epoch:[0/2](802500/4588595) loss:2.779 lr:0.0000100 epoch_Time:24096.0min: [2024-01-06 06:47:16,207][model8_pretrain.py][INFO] Epoch:[0/2](802500/4588595) loss:2.567 lr:0.0000100 epoch_Time:24096.0min: [2024-01-06 06:47:53,133][model8_pretrain.py][INFO] Epoch:[0/2](802600/4588595) loss:3.165 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:47:53,133][model8_pretrain.py][INFO] Epoch:[0/2](802600/4588595) loss:3.033 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:47:53,133][model8_pretrain.py][INFO] Epoch:[0/2](802600/4588595) loss:2.431 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:47:53,133][model8_pretrain.py][INFO] Epoch:[0/2](802600/4588595) loss:2.521 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:47:53,133][model8_pretrain.py][INFO] Epoch:[0/2](802600/4588595) loss:2.870 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:47:53,133][model8_pretrain.py][INFO] Epoch:[0/2](802600/4588595) loss:2.642 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:47:53,133][model8_pretrain.py][INFO] Epoch:[0/2](802600/4588595) loss:3.040 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:47:53,133][model8_pretrain.py][INFO] Epoch:[0/2](802600/4588595) loss:2.650 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:48:31,795][model8_pretrain.py][INFO] Epoch:[0/2](802700/4588595) loss:2.952 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:48:31,795][model8_pretrain.py][INFO] Epoch:[0/2](802700/4588595) loss:3.189 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:48:31,795][model8_pretrain.py][INFO] Epoch:[0/2](802700/4588595) loss:2.569 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:48:31,795][model8_pretrain.py][INFO] Epoch:[0/2](802700/4588595) loss:3.081 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:48:31,795][model8_pretrain.py][INFO] Epoch:[0/2](802700/4588595) loss:2.252 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:48:31,795][model8_pretrain.py][INFO] Epoch:[0/2](802700/4588595) loss:3.018 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:48:31,796][model8_pretrain.py][INFO] Epoch:[0/2](802700/4588595) loss:2.786 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:48:31,796][model8_pretrain.py][INFO] Epoch:[0/2](802700/4588595) loss:2.370 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:49:17,067][model8_pretrain.py][INFO] Epoch:[0/2](802800/4588595) loss:2.219 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:49:17,067][model8_pretrain.py][INFO] Epoch:[0/2](802800/4588595) loss:2.477 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:49:17,067][model8_pretrain.py][INFO] Epoch:[0/2](802800/4588595) loss:2.463 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:49:17,067][model8_pretrain.py][INFO] Epoch:[0/2](802800/4588595) loss:2.251 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:49:17,067][model8_pretrain.py][INFO] Epoch:[0/2](802800/4588595) loss:2.504 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:49:17,067][model8_pretrain.py][INFO] Epoch:[0/2](802800/4588595) loss:2.651 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:49:17,067][model8_pretrain.py][INFO] Epoch:[0/2](802800/4588595) loss:2.538 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:49:17,067][model8_pretrain.py][INFO] Epoch:[0/2](802800/4588595) loss:2.830 lr:0.0000100 epoch_Time:24095.0min: [2024-01-06 06:49:54,008][model8_pretrain.py][INFO] Epoch:[0/2](802900/4588595) loss:3.167 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:49:54,008][model8_pretrain.py][INFO] Epoch:[0/2](802900/4588595) loss:2.931 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:49:54,008][model8_pretrain.py][INFO] Epoch:[0/2](802900/4588595) loss:2.866 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:49:54,008][model8_pretrain.py][INFO] Epoch:[0/2](802900/4588595) loss:2.932 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:49:54,008][model8_pretrain.py][INFO] Epoch:[0/2](802900/4588595) loss:2.360 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:49:54,008][model8_pretrain.py][INFO] Epoch:[0/2](802900/4588595) loss:3.156 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:49:54,008][model8_pretrain.py][INFO] Epoch:[0/2](802900/4588595) loss:2.306 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:49:54,008][model8_pretrain.py][INFO] Epoch:[0/2](802900/4588595) loss:3.128 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:50:30,948][model8_pretrain.py][INFO] Epoch:[0/2](803000/4588595) loss:3.269 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:50:30,948][model8_pretrain.py][INFO] Epoch:[0/2](803000/4588595) loss:2.608 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:50:30,948][model8_pretrain.py][INFO] Epoch:[0/2](803000/4588595) loss:3.063 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:50:30,948][model8_pretrain.py][INFO] Epoch:[0/2](803000/4588595) loss:2.688 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:50:30,948][model8_pretrain.py][INFO] Epoch:[0/2](803000/4588595) loss:3.003 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:50:30,948][model8_pretrain.py][INFO] Epoch:[0/2](803000/4588595) loss:2.938 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:50:30,948][model8_pretrain.py][INFO] Epoch:[0/2](803000/4588595) loss:2.269 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:50:30,948][model8_pretrain.py][INFO] Epoch:[0/2](803000/4588595) loss:3.109 lr:0.0000100 epoch_Time:24094.0min: [2024-01-06 06:51:07,888][model8_pretrain.py][INFO] Epoch:[0/2](803100/4588595) loss:3.119 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:07,888][model8_pretrain.py][INFO] Epoch:[0/2](803100/4588595) loss:3.016 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:07,888][model8_pretrain.py][INFO] Epoch:[0/2](803100/4588595) loss:3.204 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:07,888][model8_pretrain.py][INFO] Epoch:[0/2](803100/4588595) loss:2.934 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:07,888][model8_pretrain.py][INFO] Epoch:[0/2](803100/4588595) loss:2.646 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:07,888][model8_pretrain.py][INFO] Epoch:[0/2](803100/4588595) loss:2.097 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:07,888][model8_pretrain.py][INFO] Epoch:[0/2](803100/4588595) loss:3.213 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:07,888][model8_pretrain.py][INFO] Epoch:[0/2](803100/4588595) loss:3.508 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:44,823][model8_pretrain.py][INFO] Epoch:[0/2](803200/4588595) loss:2.796 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:44,824][model8_pretrain.py][INFO] Epoch:[0/2](803200/4588595) loss:3.151 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:44,824][model8_pretrain.py][INFO] Epoch:[0/2](803200/4588595) loss:2.970 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:44,824][model8_pretrain.py][INFO] Epoch:[0/2](803200/4588595) loss:2.886 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:44,824][model8_pretrain.py][INFO] Epoch:[0/2](803200/4588595) loss:3.025 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:44,824][model8_pretrain.py][INFO] Epoch:[0/2](803200/4588595) loss:2.664 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:44,824][model8_pretrain.py][INFO] Epoch:[0/2](803200/4588595) loss:2.069 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:51:44,824][model8_pretrain.py][INFO] Epoch:[0/2](803200/4588595) loss:3.235 lr:0.0000100 epoch_Time:24093.0min: [2024-01-06 06:52:21,760][model8_pretrain.py][INFO] Epoch:[0/2](803300/4588595) loss:2.518 lr:0.0000100 epoch_Time:24091.0min: [2024-01-06 06:52:21,760][model8_pretrain.py][INFO] Epoch:[0/2](803300/4588595) loss:3.118 lr:0.0000100 epoch_Time:24091.0min: [2024-01-06 06:52:21,760][model8_pretrain.py][INFO] Epoch:[0/2](803300/4588595) loss:2.793 lr:0.0000100 epoch_Time:24091.0min: [2024-01-06 06:52:21,760][model8_pretrain.py][INFO] Epoch:[0/2](803300/4588595) loss:3.069 lr:0.0000100 epoch_Time:24091.0min: [2024-01-06 06:52:21,761][model8_pretrain.py][INFO] Epoch:[0/2](803300/4588595) loss:2.499 lr:0.0000100 epoch_Time:24091.0min: [2024-01-06 06:52:21,761][model8_pretrain.py][INFO] Epoch:[0/2](803300/4588595) loss:2.347 lr:0.0000100 epoch_Time:24091.0min: [2024-01-06 06:52:21,761][model8_pretrain.py][INFO] Epoch:[0/2](803300/4588595) loss:2.914 lr:0.0000100 epoch_Time:24091.0min: [2024-01-06 06:52:21,761][model8_pretrain.py][INFO] Epoch:[0/2](803300/4588595) loss:2.579 lr:0.0000100 epoch_Time:24091.0min: [2024-01-06 06:52:58,695][model8_pretrain.py][INFO] Epoch:[0/2](803400/4588595) loss:2.623 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:52:58,695][model8_pretrain.py][INFO] Epoch:[0/2](803400/4588595) loss:2.869 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:52:58,695][model8_pretrain.py][INFO] Epoch:[0/2](803400/4588595) loss:2.898 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:52:58,695][model8_pretrain.py][INFO] Epoch:[0/2](803400/4588595) loss:2.803 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:52:58,695][model8_pretrain.py][INFO] Epoch:[0/2](803400/4588595) loss:2.851 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:52:58,695][model8_pretrain.py][INFO] Epoch:[0/2](803400/4588595) loss:2.988 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:52:58,695][model8_pretrain.py][INFO] Epoch:[0/2](803400/4588595) loss:3.336 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:52:58,695][model8_pretrain.py][INFO] Epoch:[0/2](803400/4588595) loss:2.734 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:53:37,319][model8_pretrain.py][INFO] Epoch:[0/2](803500/4588595) loss:2.178 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:53:37,319][model8_pretrain.py][INFO] Epoch:[0/2](803500/4588595) loss:2.546 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:53:37,319][model8_pretrain.py][INFO] Epoch:[0/2](803500/4588595) loss:2.799 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:53:37,319][model8_pretrain.py][INFO] Epoch:[0/2](803500/4588595) loss:3.037 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:53:37,319][model8_pretrain.py][INFO] Epoch:[0/2](803500/4588595) loss:2.788 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:53:37,319][model8_pretrain.py][INFO] Epoch:[0/2](803500/4588595) loss:2.697 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:53:37,319][model8_pretrain.py][INFO] Epoch:[0/2](803500/4588595) loss:2.704 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:53:37,319][model8_pretrain.py][INFO] Epoch:[0/2](803500/4588595) loss:2.516 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:54:23,088][model8_pretrain.py][INFO] Epoch:[0/2](803600/4588595) loss:3.467 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:54:23,088][model8_pretrain.py][INFO] Epoch:[0/2](803600/4588595) loss:2.852 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:54:23,088][model8_pretrain.py][INFO] Epoch:[0/2](803600/4588595) loss:2.583 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:54:23,088][model8_pretrain.py][INFO] Epoch:[0/2](803600/4588595) loss:2.829 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:54:23,088][model8_pretrain.py][INFO] Epoch:[0/2](803600/4588595) loss:3.001 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:54:23,088][model8_pretrain.py][INFO] Epoch:[0/2](803600/4588595) loss:2.822 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:54:23,088][model8_pretrain.py][INFO] Epoch:[0/2](803600/4588595) loss:2.892 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:54:23,088][model8_pretrain.py][INFO] Epoch:[0/2](803600/4588595) loss:3.053 lr:0.0000100 epoch_Time:24090.0min: [2024-01-06 06:55:00,024][model8_pretrain.py][INFO] Epoch:[0/2](803700/4588595) loss:2.668 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:00,024][model8_pretrain.py][INFO] Epoch:[0/2](803700/4588595) loss:2.803 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:00,024][model8_pretrain.py][INFO] Epoch:[0/2](803700/4588595) loss:2.975 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:00,024][model8_pretrain.py][INFO] Epoch:[0/2](803700/4588595) loss:3.303 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:00,024][model8_pretrain.py][INFO] Epoch:[0/2](803700/4588595) loss:2.243 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:00,024][model8_pretrain.py][INFO] Epoch:[0/2](803700/4588595) loss:2.498 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:00,024][model8_pretrain.py][INFO] Epoch:[0/2](803700/4588595) loss:2.529 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:00,024][model8_pretrain.py][INFO] Epoch:[0/2](803700/4588595) loss:3.041 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:36,972][model8_pretrain.py][INFO] Epoch:[0/2](803800/4588595) loss:3.030 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:36,972][model8_pretrain.py][INFO] Epoch:[0/2](803800/4588595) loss:2.700 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:36,972][model8_pretrain.py][INFO] Epoch:[0/2](803800/4588595) loss:2.772 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:36,972][model8_pretrain.py][INFO] Epoch:[0/2](803800/4588595) loss:3.028 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:36,972][model8_pretrain.py][INFO] Epoch:[0/2](803800/4588595) loss:2.368 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:36,972][model8_pretrain.py][INFO] Epoch:[0/2](803800/4588595) loss:2.687 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:36,972][model8_pretrain.py][INFO] Epoch:[0/2](803800/4588595) loss:2.861 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:55:36,972][model8_pretrain.py][INFO] Epoch:[0/2](803800/4588595) loss:2.569 lr:0.0000100 epoch_Time:24089.0min: [2024-01-06 06:56:13,917][model8_pretrain.py][INFO] Epoch:[0/2](803900/4588595) loss:2.470 lr:0.0000100 epoch_Time:24088.0min: [2024-01-06 06:56:13,918][model8_pretrain.py][INFO] Epoch:[0/2](803900/4588595) loss:2.179 lr:0.0000100 epoch_Time:24088.0min: [2024-01-06 06:56:13,918][model8_pretrain.py][INFO] Epoch:[0/2](803900/4588595) loss:2.554 lr:0.0000100 epoch_Time:24088.0min: [2024-01-06 06:56:13,918][model8_pretrain.py][INFO] Epoch:[0/2](803900/4588595) loss:2.727 lr:0.0000100 epoch_Time:24088.0min: [2024-01-06 06:56:13,918][model8_pretrain.py][INFO] Epoch:[0/2](803900/4588595) loss:3.119 lr:0.0000100 epoch_Time:24088.0min: [2024-01-06 06:56:13,918][model8_pretrain.py][INFO] Epoch:[0/2](803900/4588595) loss:2.632 lr:0.0000100 epoch_Time:24088.0min: [2024-01-06 06:56:13,918][model8_pretrain.py][INFO] Epoch:[0/2](803900/4588595) loss:3.175 lr:0.0000100 epoch_Time:24088.0min: [2024-01-06 06:56:13,918][model8_pretrain.py][INFO] Epoch:[0/2](803900/4588595) loss:3.139 lr:0.0000100 epoch_Time:24088.0min: [2024-01-06 06:56:50,851][model8_pretrain.py][INFO] Epoch:[0/2](804000/4588595) loss:2.700 lr:0.0000100 epoch_Time:24087.0min: [2024-01-06 06:56:50,851][model8_pretrain.py][INFO] Epoch:[0/2](804000/4588595) loss:3.033 lr:0.0000100 epoch_Time:24087.0min: [2024-01-06 06:56:50,851][model8_pretrain.py][INFO] Epoch:[0/2](804000/4588595) loss:3.005 lr:0.0000100 epoch_Time:24087.0min: [2024-01-06 06:56:50,851][model8_pretrain.py][INFO] Epoch:[0/2](804000/4588595) loss:2.758 lr:0.0000100 epoch_Time:24087.0min: [2024-01-06 06:56:50,851][model8_pretrain.py][INFO] Epoch:[0/2](804000/4588595) loss:2.943 lr:0.0000100 epoch_Time:24087.0min: [2024-01-06 06:56:50,851][model8_pretrain.py][INFO] Epoch:[0/2](804000/4588595) loss:3.106 lr:0.0000100 epoch_Time:24087.0min: [2024-01-06 06:56:50,851][model8_pretrain.py][INFO] Epoch:[0/2](804000/4588595) loss:3.013 lr:0.0000100 epoch_Time:24087.0min: [2024-01-06 06:56:50,851][model8_pretrain.py][INFO] Epoch:[0/2](804000/4588595) loss:2.291 lr:0.0000100 epoch_Time:24087.0min: [2024-01-06 06:57:27,788][model8_pretrain.py][INFO] Epoch:[0/2](804100/4588595) loss:2.606 lr:0.0000100 epoch_Time:24086.0min: [2024-01-06 06:57:27,788][model8_pretrain.py][INFO] Epoch:[0/2](804100/4588595) loss:2.440 lr:0.0000100 epoch_Time:24086.0min: [2024-01-06 06:57:27,788][model8_pretrain.py][INFO] Epoch:[0/2](804100/4588595) loss:3.062 lr:0.0000100 epoch_Time:24086.0min: [2024-01-06 06:57:27,788][model8_pretrain.py][INFO] Epoch:[0/2](804100/4588595) loss:2.909 lr:0.0000100 epoch_Time:24086.0min: [2024-01-06 06:57:27,788][model8_pretrain.py][INFO] Epoch:[0/2](804100/4588595) loss:3.104 lr:0.0000100 epoch_Time:24086.0min: [2024-01-06 06:57:27,788][model8_pretrain.py][INFO] Epoch:[0/2](804100/4588595) loss:3.033 lr:0.0000100 epoch_Time:24086.0min: [2024-01-06 06:57:27,788][model8_pretrain.py][INFO] Epoch:[0/2](804100/4588595) loss:3.127 lr:0.0000100 epoch_Time:24086.0min: [2024-01-06 06:57:27,788][model8_pretrain.py][INFO] Epoch:[0/2](804100/4588595) loss:2.706 lr:0.0000100 epoch_Time:24086.0min: [2024-01-06 06:58:04,749][model8_pretrain.py][INFO] Epoch:[0/2](804200/4588595) loss:2.674 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:04,749][model8_pretrain.py][INFO] Epoch:[0/2](804200/4588595) loss:3.197 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:04,749][model8_pretrain.py][INFO] Epoch:[0/2](804200/4588595) loss:2.431 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:04,749][model8_pretrain.py][INFO] Epoch:[0/2](804200/4588595) loss:2.640 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:04,749][model8_pretrain.py][INFO] Epoch:[0/2](804200/4588595) loss:2.701 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:04,749][model8_pretrain.py][INFO] Epoch:[0/2](804200/4588595) loss:2.795 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:04,749][model8_pretrain.py][INFO] Epoch:[0/2](804200/4588595) loss:2.062 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:04,749][model8_pretrain.py][INFO] Epoch:[0/2](804200/4588595) loss:2.572 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:43,349][model8_pretrain.py][INFO] Epoch:[0/2](804300/4588595) loss:2.928 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:43,349][model8_pretrain.py][INFO] Epoch:[0/2](804300/4588595) loss:2.822 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:43,349][model8_pretrain.py][INFO] Epoch:[0/2](804300/4588595) loss:2.742 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:43,349][model8_pretrain.py][INFO] Epoch:[0/2](804300/4588595) loss:2.592 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:43,349][model8_pretrain.py][INFO] Epoch:[0/2](804300/4588595) loss:2.592 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:43,349][model8_pretrain.py][INFO] Epoch:[0/2](804300/4588595) loss:2.619 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:43,349][model8_pretrain.py][INFO] Epoch:[0/2](804300/4588595) loss:2.531 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:58:43,349][model8_pretrain.py][INFO] Epoch:[0/2](804300/4588595) loss:2.809 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:59:29,326][model8_pretrain.py][INFO] Epoch:[0/2](804400/4588595) loss:3.146 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:59:29,326][model8_pretrain.py][INFO] Epoch:[0/2](804400/4588595) loss:2.236 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:59:29,326][model8_pretrain.py][INFO] Epoch:[0/2](804400/4588595) loss:2.557 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:59:29,326][model8_pretrain.py][INFO] Epoch:[0/2](804400/4588595) loss:2.745 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:59:29,326][model8_pretrain.py][INFO] Epoch:[0/2](804400/4588595) loss:2.530 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:59:29,326][model8_pretrain.py][INFO] Epoch:[0/2](804400/4588595) loss:3.082 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:59:29,326][model8_pretrain.py][INFO] Epoch:[0/2](804400/4588595) loss:3.014 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 06:59:29,326][model8_pretrain.py][INFO] Epoch:[0/2](804400/4588595) loss:3.275 lr:0.0000100 epoch_Time:24085.0min: [2024-01-06 07:00:06,266][model8_pretrain.py][INFO] Epoch:[0/2](804500/4588595) loss:2.940 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:06,266][model8_pretrain.py][INFO] Epoch:[0/2](804500/4588595) loss:1.936 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:06,266][model8_pretrain.py][INFO] Epoch:[0/2](804500/4588595) loss:2.572 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:06,266][model8_pretrain.py][INFO] Epoch:[0/2](804500/4588595) loss:2.625 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:06,266][model8_pretrain.py][INFO] Epoch:[0/2](804500/4588595) loss:2.613 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:06,266][model8_pretrain.py][INFO] Epoch:[0/2](804500/4588595) loss:3.193 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:06,266][model8_pretrain.py][INFO] Epoch:[0/2](804500/4588595) loss:2.936 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:06,266][model8_pretrain.py][INFO] Epoch:[0/2](804500/4588595) loss:2.914 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:43,232][model8_pretrain.py][INFO] Epoch:[0/2](804600/4588595) loss:2.777 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:43,232][model8_pretrain.py][INFO] Epoch:[0/2](804600/4588595) loss:2.709 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:43,232][model8_pretrain.py][INFO] Epoch:[0/2](804600/4588595) loss:2.882 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:43,232][model8_pretrain.py][INFO] Epoch:[0/2](804600/4588595) loss:3.183 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:43,232][model8_pretrain.py][INFO] Epoch:[0/2](804600/4588595) loss:2.767 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:43,232][model8_pretrain.py][INFO] Epoch:[0/2](804600/4588595) loss:3.038 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:43,232][model8_pretrain.py][INFO] Epoch:[0/2](804600/4588595) loss:2.872 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:00:43,232][model8_pretrain.py][INFO] Epoch:[0/2](804600/4588595) loss:2.901 lr:0.0000100 epoch_Time:24084.0min: [2024-01-06 07:01:20,188][model8_pretrain.py][INFO] Epoch:[0/2](804700/4588595) loss:2.782 lr:0.0000100 epoch_Time:24083.0min: [2024-01-06 07:01:20,188][model8_pretrain.py][INFO] Epoch:[0/2](804700/4588595) loss:2.892 lr:0.0000100 epoch_Time:24083.0min: [2024-01-06 07:01:20,188][model8_pretrain.py][INFO] Epoch:[0/2](804700/4588595) loss:2.925 lr:0.0000100 epoch_Time:24083.0min: [2024-01-06 07:01:20,188][model8_pretrain.py][INFO] Epoch:[0/2](804700/4588595) loss:2.684 lr:0.0000100 epoch_Time:24083.0min: [2024-01-06 07:01:20,188][model8_pretrain.py][INFO] Epoch:[0/2](804700/4588595) loss:3.081 lr:0.0000100 epoch_Time:24083.0min: [2024-01-06 07:01:20,188][model8_pretrain.py][INFO] Epoch:[0/2](804700/4588595) loss:3.077 lr:0.0000100 epoch_Time:24083.0min: [2024-01-06 07:01:20,188][model8_pretrain.py][INFO] Epoch:[0/2](804700/4588595) loss:3.053 lr:0.0000100 epoch_Time:24083.0min: [2024-01-06 07:01:20,188][model8_pretrain.py][INFO] Epoch:[0/2](804700/4588595) loss:2.727 lr:0.0000100 epoch_Time:24083.0min: [2024-01-06 07:01:57,140][model8_pretrain.py][INFO] Epoch:[0/2](804800/4588595) loss:2.664 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:01:57,140][model8_pretrain.py][INFO] Epoch:[0/2](804800/4588595) loss:1.555 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:01:57,140][model8_pretrain.py][INFO] Epoch:[0/2](804800/4588595) loss:2.255 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:01:57,140][model8_pretrain.py][INFO] Epoch:[0/2](804800/4588595) loss:2.993 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:01:57,141][model8_pretrain.py][INFO] Epoch:[0/2](804800/4588595) loss:2.827 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:01:57,141][model8_pretrain.py][INFO] Epoch:[0/2](804800/4588595) loss:2.754 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:01:57,141][model8_pretrain.py][INFO] Epoch:[0/2](804800/4588595) loss:2.934 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:01:57,141][model8_pretrain.py][INFO] Epoch:[0/2](804800/4588595) loss:2.648 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:02:34,092][model8_pretrain.py][INFO] Epoch:[0/2](804900/4588595) loss:3.289 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:02:34,092][model8_pretrain.py][INFO] Epoch:[0/2](804900/4588595) loss:2.848 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:02:34,092][model8_pretrain.py][INFO] Epoch:[0/2](804900/4588595) loss:2.467 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:02:34,092][model8_pretrain.py][INFO] Epoch:[0/2](804900/4588595) loss:2.818 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:02:34,092][model8_pretrain.py][INFO] Epoch:[0/2](804900/4588595) loss:2.836 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:02:34,092][model8_pretrain.py][INFO] Epoch:[0/2](804900/4588595) loss:2.697 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:02:34,092][model8_pretrain.py][INFO] Epoch:[0/2](804900/4588595) loss:3.135 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:02:34,092][model8_pretrain.py][INFO] Epoch:[0/2](804900/4588595) loss:2.867 lr:0.0000100 epoch_Time:24082.0min: [2024-01-06 07:03:11,041][model8_pretrain.py][INFO] Epoch:[0/2](805000/4588595) loss:2.804 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:03:11,041][model8_pretrain.py][INFO] Epoch:[0/2](805000/4588595) loss:2.536 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:03:11,041][model8_pretrain.py][INFO] Epoch:[0/2](805000/4588595) loss:2.520 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:03:11,041][model8_pretrain.py][INFO] Epoch:[0/2](805000/4588595) loss:2.512 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:03:11,041][model8_pretrain.py][INFO] Epoch:[0/2](805000/4588595) loss:2.634 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:03:11,042][model8_pretrain.py][INFO] Epoch:[0/2](805000/4588595) loss:3.037 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:03:11,042][model8_pretrain.py][INFO] Epoch:[0/2](805000/4588595) loss:3.151 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:03:11,042][model8_pretrain.py][INFO] Epoch:[0/2](805000/4588595) loss:3.103 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:03:47,984][model8_pretrain.py][INFO] Epoch:[0/2](805100/4588595) loss:3.025 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:03:47,984][model8_pretrain.py][INFO] Epoch:[0/2](805100/4588595) loss:2.391 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:03:47,984][model8_pretrain.py][INFO] Epoch:[0/2](805100/4588595) loss:2.419 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:03:47,984][model8_pretrain.py][INFO] Epoch:[0/2](805100/4588595) loss:3.141 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:03:47,984][model8_pretrain.py][INFO] Epoch:[0/2](805100/4588595) loss:2.620 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:03:47,984][model8_pretrain.py][INFO] Epoch:[0/2](805100/4588595) loss:3.200 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:03:47,984][model8_pretrain.py][INFO] Epoch:[0/2](805100/4588595) loss:2.255 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:03:47,984][model8_pretrain.py][INFO] Epoch:[0/2](805100/4588595) loss:3.040 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:04:35,339][model8_pretrain.py][INFO] Epoch:[0/2](805200/4588595) loss:2.709 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:04:35,339][model8_pretrain.py][INFO] Epoch:[0/2](805200/4588595) loss:2.864 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:04:35,339][model8_pretrain.py][INFO] Epoch:[0/2](805200/4588595) loss:2.251 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:04:35,339][model8_pretrain.py][INFO] Epoch:[0/2](805200/4588595) loss:2.807 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:04:35,339][model8_pretrain.py][INFO] Epoch:[0/2](805200/4588595) loss:2.857 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:04:35,339][model8_pretrain.py][INFO] Epoch:[0/2](805200/4588595) loss:2.497 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:04:35,339][model8_pretrain.py][INFO] Epoch:[0/2](805200/4588595) loss:2.478 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:04:35,339][model8_pretrain.py][INFO] Epoch:[0/2](805200/4588595) loss:2.858 lr:0.0000100 epoch_Time:24080.0min: [2024-01-06 07:05:12,277][model8_pretrain.py][INFO] Epoch:[0/2](805300/4588595) loss:3.035 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:05:12,277][model8_pretrain.py][INFO] Epoch:[0/2](805300/4588595) loss:3.175 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:05:12,277][model8_pretrain.py][INFO] Epoch:[0/2](805300/4588595) loss:2.532 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:05:12,277][model8_pretrain.py][INFO] Epoch:[0/2](805300/4588595) loss:2.538 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:05:12,277][model8_pretrain.py][INFO] Epoch:[0/2](805300/4588595) loss:2.681 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:05:12,277][model8_pretrain.py][INFO] Epoch:[0/2](805300/4588595) loss:2.892 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:05:12,277][model8_pretrain.py][INFO] Epoch:[0/2](805300/4588595) loss:2.240 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:05:12,277][model8_pretrain.py][INFO] Epoch:[0/2](805300/4588595) loss:2.828 lr:0.0000100 epoch_Time:24079.0min: [2024-01-06 07:05:49,219][model8_pretrain.py][INFO] Epoch:[0/2](805400/4588595) loss:2.850 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:05:49,219][model8_pretrain.py][INFO] Epoch:[0/2](805400/4588595) loss:2.219 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:05:49,219][model8_pretrain.py][INFO] Epoch:[0/2](805400/4588595) loss:2.749 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:05:49,219][model8_pretrain.py][INFO] Epoch:[0/2](805400/4588595) loss:2.617 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:05:49,219][model8_pretrain.py][INFO] Epoch:[0/2](805400/4588595) loss:2.704 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:05:49,219][model8_pretrain.py][INFO] Epoch:[0/2](805400/4588595) loss:3.169 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:05:49,219][model8_pretrain.py][INFO] Epoch:[0/2](805400/4588595) loss:2.580 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:05:49,219][model8_pretrain.py][INFO] Epoch:[0/2](805400/4588595) loss:2.871 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:06:26,160][model8_pretrain.py][INFO] Epoch:[0/2](805500/4588595) loss:2.682 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:06:26,160][model8_pretrain.py][INFO] Epoch:[0/2](805500/4588595) loss:2.667 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:06:26,160][model8_pretrain.py][INFO] Epoch:[0/2](805500/4588595) loss:2.684 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:06:26,160][model8_pretrain.py][INFO] Epoch:[0/2](805500/4588595) loss:3.127 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:06:26,160][model8_pretrain.py][INFO] Epoch:[0/2](805500/4588595) loss:2.618 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:06:26,160][model8_pretrain.py][INFO] Epoch:[0/2](805500/4588595) loss:3.039 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:06:26,160][model8_pretrain.py][INFO] Epoch:[0/2](805500/4588595) loss:3.359 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:06:26,160][model8_pretrain.py][INFO] Epoch:[0/2](805500/4588595) loss:3.069 lr:0.0000100 epoch_Time:24078.0min: [2024-01-06 07:07:03,095][model8_pretrain.py][INFO] Epoch:[0/2](805600/4588595) loss:2.686 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:03,095][model8_pretrain.py][INFO] Epoch:[0/2](805600/4588595) loss:2.221 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:03,095][model8_pretrain.py][INFO] Epoch:[0/2](805600/4588595) loss:2.770 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:03,095][model8_pretrain.py][INFO] Epoch:[0/2](805600/4588595) loss:3.493 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:03,095][model8_pretrain.py][INFO] Epoch:[0/2](805600/4588595) loss:3.329 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:03,095][model8_pretrain.py][INFO] Epoch:[0/2](805600/4588595) loss:2.700 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:03,095][model8_pretrain.py][INFO] Epoch:[0/2](805600/4588595) loss:2.972 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:03,095][model8_pretrain.py][INFO] Epoch:[0/2](805600/4588595) loss:3.019 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:40,027][model8_pretrain.py][INFO] Epoch:[0/2](805700/4588595) loss:2.953 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:40,027][model8_pretrain.py][INFO] Epoch:[0/2](805700/4588595) loss:2.808 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:40,027][model8_pretrain.py][INFO] Epoch:[0/2](805700/4588595) loss:2.476 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:40,027][model8_pretrain.py][INFO] Epoch:[0/2](805700/4588595) loss:2.575 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:40,027][model8_pretrain.py][INFO] Epoch:[0/2](805700/4588595) loss:2.217 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:40,027][model8_pretrain.py][INFO] Epoch:[0/2](805700/4588595) loss:2.505 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:40,027][model8_pretrain.py][INFO] Epoch:[0/2](805700/4588595) loss:2.610 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:07:40,027][model8_pretrain.py][INFO] Epoch:[0/2](805700/4588595) loss:2.525 lr:0.0000100 epoch_Time:24077.0min: [2024-01-06 07:08:16,956][model8_pretrain.py][INFO] Epoch:[0/2](805800/4588595) loss:2.582 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:08:16,956][model8_pretrain.py][INFO] Epoch:[0/2](805800/4588595) loss:2.934 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:08:16,956][model8_pretrain.py][INFO] Epoch:[0/2](805800/4588595) loss:3.274 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:08:16,956][model8_pretrain.py][INFO] Epoch:[0/2](805800/4588595) loss:2.755 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:08:16,956][model8_pretrain.py][INFO] Epoch:[0/2](805800/4588595) loss:2.630 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:08:16,956][model8_pretrain.py][INFO] Epoch:[0/2](805800/4588595) loss:2.531 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:08:16,956][model8_pretrain.py][INFO] Epoch:[0/2](805800/4588595) loss:3.059 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:08:16,957][model8_pretrain.py][INFO] Epoch:[0/2](805800/4588595) loss:3.021 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:08:53,883][model8_pretrain.py][INFO] Epoch:[0/2](805900/4588595) loss:2.877 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:08:53,883][model8_pretrain.py][INFO] Epoch:[0/2](805900/4588595) loss:3.187 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:08:53,883][model8_pretrain.py][INFO] Epoch:[0/2](805900/4588595) loss:2.599 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:08:53,883][model8_pretrain.py][INFO] Epoch:[0/2](805900/4588595) loss:2.874 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:08:53,883][model8_pretrain.py][INFO] Epoch:[0/2](805900/4588595) loss:2.772 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:08:53,883][model8_pretrain.py][INFO] Epoch:[0/2](805900/4588595) loss:2.950 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:08:53,883][model8_pretrain.py][INFO] Epoch:[0/2](805900/4588595) loss:2.644 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:08:53,883][model8_pretrain.py][INFO] Epoch:[0/2](805900/4588595) loss:2.666 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:09:41,333][model8_pretrain.py][INFO] Epoch:[0/2](806000/4588595) loss:3.032 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:09:41,333][model8_pretrain.py][INFO] Epoch:[0/2](806000/4588595) loss:2.649 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:09:41,333][model8_pretrain.py][INFO] Epoch:[0/2](806000/4588595) loss:3.030 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:09:41,333][model8_pretrain.py][INFO] Epoch:[0/2](806000/4588595) loss:2.819 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:09:41,333][model8_pretrain.py][INFO] Epoch:[0/2](806000/4588595) loss:3.028 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:09:41,333][model8_pretrain.py][INFO] Epoch:[0/2](806000/4588595) loss:2.657 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:09:41,333][model8_pretrain.py][INFO] Epoch:[0/2](806000/4588595) loss:2.714 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:09:41,333][model8_pretrain.py][INFO] Epoch:[0/2](806000/4588595) loss:2.681 lr:0.0000100 epoch_Time:24075.0min: [2024-01-06 07:10:18,264][model8_pretrain.py][INFO] Epoch:[0/2](806100/4588595) loss:2.474 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:10:18,264][model8_pretrain.py][INFO] Epoch:[0/2](806100/4588595) loss:2.915 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:10:18,264][model8_pretrain.py][INFO] Epoch:[0/2](806100/4588595) loss:2.696 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:10:18,264][model8_pretrain.py][INFO] Epoch:[0/2](806100/4588595) loss:2.778 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:10:18,264][model8_pretrain.py][INFO] Epoch:[0/2](806100/4588595) loss:3.185 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:10:18,264][model8_pretrain.py][INFO] Epoch:[0/2](806100/4588595) loss:3.335 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:10:18,264][model8_pretrain.py][INFO] Epoch:[0/2](806100/4588595) loss:2.806 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:10:18,264][model8_pretrain.py][INFO] Epoch:[0/2](806100/4588595) loss:3.207 lr:0.0000100 epoch_Time:24074.0min: [2024-01-06 07:10:55,207][model8_pretrain.py][INFO] Epoch:[0/2](806200/4588595) loss:2.827 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:10:55,207][model8_pretrain.py][INFO] Epoch:[0/2](806200/4588595) loss:2.967 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:10:55,207][model8_pretrain.py][INFO] Epoch:[0/2](806200/4588595) loss:2.898 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:10:55,207][model8_pretrain.py][INFO] Epoch:[0/2](806200/4588595) loss:2.827 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:10:55,207][model8_pretrain.py][INFO] Epoch:[0/2](806200/4588595) loss:2.300 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:10:55,207][model8_pretrain.py][INFO] Epoch:[0/2](806200/4588595) loss:3.233 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:10:55,207][model8_pretrain.py][INFO] Epoch:[0/2](806200/4588595) loss:2.593 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:10:55,207][model8_pretrain.py][INFO] Epoch:[0/2](806200/4588595) loss:2.599 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:11:32,162][model8_pretrain.py][INFO] Epoch:[0/2](806300/4588595) loss:3.427 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:11:32,162][model8_pretrain.py][INFO] Epoch:[0/2](806300/4588595) loss:2.939 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:11:32,162][model8_pretrain.py][INFO] Epoch:[0/2](806300/4588595) loss:2.589 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:11:32,162][model8_pretrain.py][INFO] Epoch:[0/2](806300/4588595) loss:2.557 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:11:32,162][model8_pretrain.py][INFO] Epoch:[0/2](806300/4588595) loss:2.693 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:11:32,162][model8_pretrain.py][INFO] Epoch:[0/2](806300/4588595) loss:2.658 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:11:32,162][model8_pretrain.py][INFO] Epoch:[0/2](806300/4588595) loss:2.770 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:11:32,162][model8_pretrain.py][INFO] Epoch:[0/2](806300/4588595) loss:2.553 lr:0.0000100 epoch_Time:24073.0min: [2024-01-06 07:12:09,112][model8_pretrain.py][INFO] Epoch:[0/2](806400/4588595) loss:3.047 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:09,112][model8_pretrain.py][INFO] Epoch:[0/2](806400/4588595) loss:2.738 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:09,112][model8_pretrain.py][INFO] Epoch:[0/2](806400/4588595) loss:2.779 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:09,112][model8_pretrain.py][INFO] Epoch:[0/2](806400/4588595) loss:2.742 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:09,112][model8_pretrain.py][INFO] Epoch:[0/2](806400/4588595) loss:2.892 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:09,112][model8_pretrain.py][INFO] Epoch:[0/2](806400/4588595) loss:2.446 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:09,112][model8_pretrain.py][INFO] Epoch:[0/2](806400/4588595) loss:3.385 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:09,112][model8_pretrain.py][INFO] Epoch:[0/2](806400/4588595) loss:2.669 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:46,050][model8_pretrain.py][INFO] Epoch:[0/2](806500/4588595) loss:3.141 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:46,050][model8_pretrain.py][INFO] Epoch:[0/2](806500/4588595) loss:3.073 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:46,050][model8_pretrain.py][INFO] Epoch:[0/2](806500/4588595) loss:2.776 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:46,050][model8_pretrain.py][INFO] Epoch:[0/2](806500/4588595) loss:2.945 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:46,050][model8_pretrain.py][INFO] Epoch:[0/2](806500/4588595) loss:2.810 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:46,050][model8_pretrain.py][INFO] Epoch:[0/2](806500/4588595) loss:2.662 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:46,050][model8_pretrain.py][INFO] Epoch:[0/2](806500/4588595) loss:3.354 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:12:46,050][model8_pretrain.py][INFO] Epoch:[0/2](806500/4588595) loss:3.094 lr:0.0000100 epoch_Time:24072.0min: [2024-01-06 07:13:22,989][model8_pretrain.py][INFO] Epoch:[0/2](806600/4588595) loss:2.944 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:13:22,989][model8_pretrain.py][INFO] Epoch:[0/2](806600/4588595) loss:2.902 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:13:22,989][model8_pretrain.py][INFO] Epoch:[0/2](806600/4588595) loss:3.148 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:13:22,989][model8_pretrain.py][INFO] Epoch:[0/2](806600/4588595) loss:2.887 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:13:22,989][model8_pretrain.py][INFO] Epoch:[0/2](806600/4588595) loss:2.993 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:13:22,989][model8_pretrain.py][INFO] Epoch:[0/2](806600/4588595) loss:2.913 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:13:22,989][model8_pretrain.py][INFO] Epoch:[0/2](806600/4588595) loss:2.615 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:13:22,989][model8_pretrain.py][INFO] Epoch:[0/2](806600/4588595) loss:2.278 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:13:59,928][model8_pretrain.py][INFO] Epoch:[0/2](806700/4588595) loss:3.220 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:13:59,928][model8_pretrain.py][INFO] Epoch:[0/2](806700/4588595) loss:2.736 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:13:59,928][model8_pretrain.py][INFO] Epoch:[0/2](806700/4588595) loss:2.448 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:13:59,928][model8_pretrain.py][INFO] Epoch:[0/2](806700/4588595) loss:2.112 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:13:59,928][model8_pretrain.py][INFO] Epoch:[0/2](806700/4588595) loss:2.643 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:13:59,928][model8_pretrain.py][INFO] Epoch:[0/2](806700/4588595) loss:2.445 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:13:59,928][model8_pretrain.py][INFO] Epoch:[0/2](806700/4588595) loss:3.116 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:13:59,928][model8_pretrain.py][INFO] Epoch:[0/2](806700/4588595) loss:2.829 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:14:47,301][model8_pretrain.py][INFO] Epoch:[0/2](806800/4588595) loss:2.839 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:14:47,301][model8_pretrain.py][INFO] Epoch:[0/2](806800/4588595) loss:2.997 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:14:47,301][model8_pretrain.py][INFO] Epoch:[0/2](806800/4588595) loss:3.087 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:14:47,301][model8_pretrain.py][INFO] Epoch:[0/2](806800/4588595) loss:2.889 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:14:47,301][model8_pretrain.py][INFO] Epoch:[0/2](806800/4588595) loss:3.397 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:14:47,301][model8_pretrain.py][INFO] Epoch:[0/2](806800/4588595) loss:2.661 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:14:47,301][model8_pretrain.py][INFO] Epoch:[0/2](806800/4588595) loss:2.530 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:14:47,302][model8_pretrain.py][INFO] Epoch:[0/2](806800/4588595) loss:2.640 lr:0.0000100 epoch_Time:24070.0min: [2024-01-06 07:15:24,233][model8_pretrain.py][INFO] Epoch:[0/2](806900/4588595) loss:3.005 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:15:24,233][model8_pretrain.py][INFO] Epoch:[0/2](806900/4588595) loss:2.370 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:15:24,233][model8_pretrain.py][INFO] Epoch:[0/2](806900/4588595) loss:2.599 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:15:24,233][model8_pretrain.py][INFO] Epoch:[0/2](806900/4588595) loss:2.692 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:15:24,233][model8_pretrain.py][INFO] Epoch:[0/2](806900/4588595) loss:2.609 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:15:24,233][model8_pretrain.py][INFO] Epoch:[0/2](806900/4588595) loss:2.629 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:15:24,233][model8_pretrain.py][INFO] Epoch:[0/2](806900/4588595) loss:2.820 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:15:24,233][model8_pretrain.py][INFO] Epoch:[0/2](806900/4588595) loss:3.073 lr:0.0000100 epoch_Time:24069.0min: [2024-01-06 07:16:01,175][model8_pretrain.py][INFO] Epoch:[0/2](807000/4588595) loss:2.521 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:01,175][model8_pretrain.py][INFO] Epoch:[0/2](807000/4588595) loss:2.894 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:01,175][model8_pretrain.py][INFO] Epoch:[0/2](807000/4588595) loss:2.893 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:01,175][model8_pretrain.py][INFO] Epoch:[0/2](807000/4588595) loss:3.043 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:01,175][model8_pretrain.py][INFO] Epoch:[0/2](807000/4588595) loss:3.121 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:01,175][model8_pretrain.py][INFO] Epoch:[0/2](807000/4588595) loss:3.294 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:01,175][model8_pretrain.py][INFO] Epoch:[0/2](807000/4588595) loss:2.760 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:01,175][model8_pretrain.py][INFO] Epoch:[0/2](807000/4588595) loss:3.186 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:38,108][model8_pretrain.py][INFO] Epoch:[0/2](807100/4588595) loss:2.959 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:38,108][model8_pretrain.py][INFO] Epoch:[0/2](807100/4588595) loss:2.832 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:38,108][model8_pretrain.py][INFO] Epoch:[0/2](807100/4588595) loss:3.015 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:38,108][model8_pretrain.py][INFO] Epoch:[0/2](807100/4588595) loss:2.656 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:38,108][model8_pretrain.py][INFO] Epoch:[0/2](807100/4588595) loss:2.832 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:38,108][model8_pretrain.py][INFO] Epoch:[0/2](807100/4588595) loss:2.942 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:38,108][model8_pretrain.py][INFO] Epoch:[0/2](807100/4588595) loss:2.663 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:16:38,108][model8_pretrain.py][INFO] Epoch:[0/2](807100/4588595) loss:2.951 lr:0.0000100 epoch_Time:24068.0min: [2024-01-06 07:17:15,054][model8_pretrain.py][INFO] Epoch:[0/2](807200/4588595) loss:3.001 lr:0.0000100 epoch_Time:24067.0min: [2024-01-06 07:17:15,055][model8_pretrain.py][INFO] Epoch:[0/2](807200/4588595) loss:2.321 lr:0.0000100 epoch_Time:24067.0min: [2024-01-06 07:17:15,055][model8_pretrain.py][INFO] Epoch:[0/2](807200/4588595) loss:2.998 lr:0.0000100 epoch_Time:24067.0min: [2024-01-06 07:17:15,055][model8_pretrain.py][INFO] Epoch:[0/2](807200/4588595) loss:2.693 lr:0.0000100 epoch_Time:24067.0min: [2024-01-06 07:17:15,055][model8_pretrain.py][INFO] Epoch:[0/2](807200/4588595) loss:2.960 lr:0.0000100 epoch_Time:24067.0min: [2024-01-06 07:17:15,055][model8_pretrain.py][INFO] Epoch:[0/2](807200/4588595) loss:1.786 lr:0.0000100 epoch_Time:24067.0min: [2024-01-06 07:17:15,055][model8_pretrain.py][INFO] Epoch:[0/2](807200/4588595) loss:2.955 lr:0.0000100 epoch_Time:24067.0min: [2024-01-06 07:17:15,055][model8_pretrain.py][INFO] Epoch:[0/2](807200/4588595) loss:2.801 lr:0.0000100 epoch_Time:24067.0min: [2024-01-06 07:17:51,991][model8_pretrain.py][INFO] Epoch:[0/2](807300/4588595) loss:2.531 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:17:51,991][model8_pretrain.py][INFO] Epoch:[0/2](807300/4588595) loss:2.693 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:17:51,991][model8_pretrain.py][INFO] Epoch:[0/2](807300/4588595) loss:2.827 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:17:51,991][model8_pretrain.py][INFO] Epoch:[0/2](807300/4588595) loss:2.410 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:17:51,991][model8_pretrain.py][INFO] Epoch:[0/2](807300/4588595) loss:2.972 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:17:51,991][model8_pretrain.py][INFO] Epoch:[0/2](807300/4588595) loss:3.053 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:17:51,992][model8_pretrain.py][INFO] Epoch:[0/2](807300/4588595) loss:2.867 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:17:51,992][model8_pretrain.py][INFO] Epoch:[0/2](807300/4588595) loss:2.071 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:18:28,927][model8_pretrain.py][INFO] Epoch:[0/2](807400/4588595) loss:3.126 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:18:28,927][model8_pretrain.py][INFO] Epoch:[0/2](807400/4588595) loss:2.871 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:18:28,927][model8_pretrain.py][INFO] Epoch:[0/2](807400/4588595) loss:2.499 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:18:28,927][model8_pretrain.py][INFO] Epoch:[0/2](807400/4588595) loss:2.839 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:18:28,927][model8_pretrain.py][INFO] Epoch:[0/2](807400/4588595) loss:3.443 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:18:28,927][model8_pretrain.py][INFO] Epoch:[0/2](807400/4588595) loss:2.831 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:18:28,927][model8_pretrain.py][INFO] Epoch:[0/2](807400/4588595) loss:2.708 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:18:28,927][model8_pretrain.py][INFO] Epoch:[0/2](807400/4588595) loss:2.967 lr:0.0000100 epoch_Time:24066.0min: [2024-01-06 07:19:05,860][model8_pretrain.py][INFO] Epoch:[0/2](807500/4588595) loss:1.881 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:05,860][model8_pretrain.py][INFO] Epoch:[0/2](807500/4588595) loss:2.867 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:05,860][model8_pretrain.py][INFO] Epoch:[0/2](807500/4588595) loss:3.374 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:05,860][model8_pretrain.py][INFO] Epoch:[0/2](807500/4588595) loss:2.605 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:05,860][model8_pretrain.py][INFO] Epoch:[0/2](807500/4588595) loss:3.307 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:05,860][model8_pretrain.py][INFO] Epoch:[0/2](807500/4588595) loss:2.425 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:05,860][model8_pretrain.py][INFO] Epoch:[0/2](807500/4588595) loss:3.198 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:05,860][model8_pretrain.py][INFO] Epoch:[0/2](807500/4588595) loss:2.744 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:53,276][model8_pretrain.py][INFO] Epoch:[0/2](807600/4588595) loss:2.740 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:53,276][model8_pretrain.py][INFO] Epoch:[0/2](807600/4588595) loss:3.426 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:53,276][model8_pretrain.py][INFO] Epoch:[0/2](807600/4588595) loss:2.989 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:53,276][model8_pretrain.py][INFO] Epoch:[0/2](807600/4588595) loss:2.829 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:53,276][model8_pretrain.py][INFO] Epoch:[0/2](807600/4588595) loss:2.654 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:53,276][model8_pretrain.py][INFO] Epoch:[0/2](807600/4588595) loss:2.934 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:53,276][model8_pretrain.py][INFO] Epoch:[0/2](807600/4588595) loss:3.254 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:19:53,276][model8_pretrain.py][INFO] Epoch:[0/2](807600/4588595) loss:2.663 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:20:30,202][model8_pretrain.py][INFO] Epoch:[0/2](807700/4588595) loss:2.936 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:20:30,202][model8_pretrain.py][INFO] Epoch:[0/2](807700/4588595) loss:3.002 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:20:30,202][model8_pretrain.py][INFO] Epoch:[0/2](807700/4588595) loss:2.340 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:20:30,202][model8_pretrain.py][INFO] Epoch:[0/2](807700/4588595) loss:3.280 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:20:30,202][model8_pretrain.py][INFO] Epoch:[0/2](807700/4588595) loss:2.933 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:20:30,202][model8_pretrain.py][INFO] Epoch:[0/2](807700/4588595) loss:2.336 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:20:30,202][model8_pretrain.py][INFO] Epoch:[0/2](807700/4588595) loss:2.707 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:20:30,202][model8_pretrain.py][INFO] Epoch:[0/2](807700/4588595) loss:2.929 lr:0.0000100 epoch_Time:24064.0min: [2024-01-06 07:21:07,140][model8_pretrain.py][INFO] Epoch:[0/2](807800/4588595) loss:3.116 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:07,140][model8_pretrain.py][INFO] Epoch:[0/2](807800/4588595) loss:3.270 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:07,140][model8_pretrain.py][INFO] Epoch:[0/2](807800/4588595) loss:2.831 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:07,140][model8_pretrain.py][INFO] Epoch:[0/2](807800/4588595) loss:2.863 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:07,140][model8_pretrain.py][INFO] Epoch:[0/2](807800/4588595) loss:2.869 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:07,140][model8_pretrain.py][INFO] Epoch:[0/2](807800/4588595) loss:3.034 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:07,140][model8_pretrain.py][INFO] Epoch:[0/2](807800/4588595) loss:2.349 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:07,140][model8_pretrain.py][INFO] Epoch:[0/2](807800/4588595) loss:2.894 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:44,078][model8_pretrain.py][INFO] Epoch:[0/2](807900/4588595) loss:2.917 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:44,078][model8_pretrain.py][INFO] Epoch:[0/2](807900/4588595) loss:2.918 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:44,078][model8_pretrain.py][INFO] Epoch:[0/2](807900/4588595) loss:3.191 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:44,078][model8_pretrain.py][INFO] Epoch:[0/2](807900/4588595) loss:2.995 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:44,078][model8_pretrain.py][INFO] Epoch:[0/2](807900/4588595) loss:2.715 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:44,078][model8_pretrain.py][INFO] Epoch:[0/2](807900/4588595) loss:2.545 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:44,078][model8_pretrain.py][INFO] Epoch:[0/2](807900/4588595) loss:3.310 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:21:44,078][model8_pretrain.py][INFO] Epoch:[0/2](807900/4588595) loss:2.856 lr:0.0000100 epoch_Time:24063.0min: [2024-01-06 07:22:21,012][model8_pretrain.py][INFO] Epoch:[0/2](808000/4588595) loss:2.712 lr:0.0000100 epoch_Time:24062.0min: [2024-01-06 07:22:21,012][model8_pretrain.py][INFO] Epoch:[0/2](808000/4588595) loss:2.584 lr:0.0000100 epoch_Time:24062.0min: [2024-01-06 07:22:21,012][model8_pretrain.py][INFO] Epoch:[0/2](808000/4588595) loss:2.733 lr:0.0000100 epoch_Time:24062.0min: [2024-01-06 07:22:21,012][model8_pretrain.py][INFO] Epoch:[0/2](808000/4588595) loss:2.961 lr:0.0000100 epoch_Time:24062.0min: [2024-01-06 07:22:21,012][model8_pretrain.py][INFO] Epoch:[0/2](808000/4588595) loss:2.809 lr:0.0000100 epoch_Time:24062.0min: [2024-01-06 07:22:21,012][model8_pretrain.py][INFO] Epoch:[0/2](808000/4588595) loss:2.573 lr:0.0000100 epoch_Time:24062.0min: [2024-01-06 07:22:21,012][model8_pretrain.py][INFO] Epoch:[0/2](808000/4588595) loss:3.082 lr:0.0000100 epoch_Time:24062.0min: [2024-01-06 07:22:21,012][model8_pretrain.py][INFO] Epoch:[0/2](808000/4588595) loss:2.978 lr:0.0000100 epoch_Time:24062.0min: [2024-01-06 07:22:57,936][model8_pretrain.py][INFO] Epoch:[0/2](808100/4588595) loss:2.906 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:22:57,936][model8_pretrain.py][INFO] Epoch:[0/2](808100/4588595) loss:2.239 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:22:57,936][model8_pretrain.py][INFO] Epoch:[0/2](808100/4588595) loss:2.672 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:22:57,936][model8_pretrain.py][INFO] Epoch:[0/2](808100/4588595) loss:2.615 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:22:57,936][model8_pretrain.py][INFO] Epoch:[0/2](808100/4588595) loss:2.730 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:22:57,936][model8_pretrain.py][INFO] Epoch:[0/2](808100/4588595) loss:2.768 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:22:57,936][model8_pretrain.py][INFO] Epoch:[0/2](808100/4588595) loss:3.035 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:22:57,937][model8_pretrain.py][INFO] Epoch:[0/2](808100/4588595) loss:2.317 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:23:34,866][model8_pretrain.py][INFO] Epoch:[0/2](808200/4588595) loss:2.720 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:23:34,866][model8_pretrain.py][INFO] Epoch:[0/2](808200/4588595) loss:2.946 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:23:34,866][model8_pretrain.py][INFO] Epoch:[0/2](808200/4588595) loss:2.973 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:23:34,866][model8_pretrain.py][INFO] Epoch:[0/2](808200/4588595) loss:2.722 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:23:34,866][model8_pretrain.py][INFO] Epoch:[0/2](808200/4588595) loss:2.474 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:23:34,866][model8_pretrain.py][INFO] Epoch:[0/2](808200/4588595) loss:2.793 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:23:34,866][model8_pretrain.py][INFO] Epoch:[0/2](808200/4588595) loss:2.674 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:23:34,867][model8_pretrain.py][INFO] Epoch:[0/2](808200/4588595) loss:2.800 lr:0.0000100 epoch_Time:24061.0min: [2024-01-06 07:24:11,770][model8_pretrain.py][INFO] Epoch:[0/2](808300/4588595) loss:2.920 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:11,770][model8_pretrain.py][INFO] Epoch:[0/2](808300/4588595) loss:2.130 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:11,770][model8_pretrain.py][INFO] Epoch:[0/2](808300/4588595) loss:3.090 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:11,770][model8_pretrain.py][INFO] Epoch:[0/2](808300/4588595) loss:3.013 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:11,770][model8_pretrain.py][INFO] Epoch:[0/2](808300/4588595) loss:2.928 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:11,770][model8_pretrain.py][INFO] Epoch:[0/2](808300/4588595) loss:2.387 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:11,770][model8_pretrain.py][INFO] Epoch:[0/2](808300/4588595) loss:3.157 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:11,770][model8_pretrain.py][INFO] Epoch:[0/2](808300/4588595) loss:2.529 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:59,214][model8_pretrain.py][INFO] Epoch:[0/2](808400/4588595) loss:2.766 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:59,214][model8_pretrain.py][INFO] Epoch:[0/2](808400/4588595) loss:2.604 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:59,214][model8_pretrain.py][INFO] Epoch:[0/2](808400/4588595) loss:2.626 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:59,214][model8_pretrain.py][INFO] Epoch:[0/2](808400/4588595) loss:3.069 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:59,214][model8_pretrain.py][INFO] Epoch:[0/2](808400/4588595) loss:2.545 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:59,214][model8_pretrain.py][INFO] Epoch:[0/2](808400/4588595) loss:2.700 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:59,214][model8_pretrain.py][INFO] Epoch:[0/2](808400/4588595) loss:2.892 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:24:59,214][model8_pretrain.py][INFO] Epoch:[0/2](808400/4588595) loss:2.649 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:25:36,136][model8_pretrain.py][INFO] Epoch:[0/2](808500/4588595) loss:3.324 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:25:36,136][model8_pretrain.py][INFO] Epoch:[0/2](808500/4588595) loss:2.627 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:25:36,136][model8_pretrain.py][INFO] Epoch:[0/2](808500/4588595) loss:2.847 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:25:36,136][model8_pretrain.py][INFO] Epoch:[0/2](808500/4588595) loss:2.986 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:25:36,136][model8_pretrain.py][INFO] Epoch:[0/2](808500/4588595) loss:3.244 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:25:36,136][model8_pretrain.py][INFO] Epoch:[0/2](808500/4588595) loss:2.799 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:25:36,136][model8_pretrain.py][INFO] Epoch:[0/2](808500/4588595) loss:2.620 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:25:36,136][model8_pretrain.py][INFO] Epoch:[0/2](808500/4588595) loss:2.595 lr:0.0000100 epoch_Time:24059.0min: [2024-01-06 07:26:13,075][model8_pretrain.py][INFO] Epoch:[0/2](808600/4588595) loss:3.046 lr:0.0000100 epoch_Time:24058.0min: [2024-01-06 07:26:13,075][model8_pretrain.py][INFO] Epoch:[0/2](808600/4588595) loss:2.658 lr:0.0000100 epoch_Time:24058.0min: [2024-01-06 07:26:13,075][model8_pretrain.py][INFO] Epoch:[0/2](808600/4588595) loss:2.440 lr:0.0000100 epoch_Time:24058.0min: [2024-01-06 07:26:13,075][model8_pretrain.py][INFO] Epoch:[0/2](808600/4588595) loss:3.224 lr:0.0000100 epoch_Time:24058.0min: [2024-01-06 07:26:13,075][model8_pretrain.py][INFO] Epoch:[0/2](808600/4588595) loss:2.716 lr:0.0000100 epoch_Time:24058.0min: [2024-01-06 07:26:13,075][model8_pretrain.py][INFO] Epoch:[0/2](808600/4588595) loss:2.878 lr:0.0000100 epoch_Time:24058.0min: [2024-01-06 07:26:13,075][model8_pretrain.py][INFO] Epoch:[0/2](808600/4588595) loss:2.504 lr:0.0000100 epoch_Time:24058.0min: [2024-01-06 07:26:13,075][model8_pretrain.py][INFO] Epoch:[0/2](808600/4588595) loss:2.648 lr:0.0000100 epoch_Time:24058.0min: [2024-01-06 07:26:50,008][model8_pretrain.py][INFO] Epoch:[0/2](808700/4588595) loss:2.650 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:26:50,008][model8_pretrain.py][INFO] Epoch:[0/2](808700/4588595) loss:2.724 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:26:50,008][model8_pretrain.py][INFO] Epoch:[0/2](808700/4588595) loss:2.738 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:26:50,008][model8_pretrain.py][INFO] Epoch:[0/2](808700/4588595) loss:3.158 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:26:50,008][model8_pretrain.py][INFO] Epoch:[0/2](808700/4588595) loss:2.716 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:26:50,008][model8_pretrain.py][INFO] Epoch:[0/2](808700/4588595) loss:2.748 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:26:50,008][model8_pretrain.py][INFO] Epoch:[0/2](808700/4588595) loss:2.849 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:26:50,008][model8_pretrain.py][INFO] Epoch:[0/2](808700/4588595) loss:2.188 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:27:26,955][model8_pretrain.py][INFO] Epoch:[0/2](808800/4588595) loss:2.595 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:27:26,955][model8_pretrain.py][INFO] Epoch:[0/2](808800/4588595) loss:2.885 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:27:26,955][model8_pretrain.py][INFO] Epoch:[0/2](808800/4588595) loss:2.662 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:27:26,955][model8_pretrain.py][INFO] Epoch:[0/2](808800/4588595) loss:3.195 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:27:26,955][model8_pretrain.py][INFO] Epoch:[0/2](808800/4588595) loss:2.994 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:27:26,955][model8_pretrain.py][INFO] Epoch:[0/2](808800/4588595) loss:2.626 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:27:26,955][model8_pretrain.py][INFO] Epoch:[0/2](808800/4588595) loss:2.668 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:27:26,955][model8_pretrain.py][INFO] Epoch:[0/2](808800/4588595) loss:3.045 lr:0.0000100 epoch_Time:24057.0min: [2024-01-06 07:28:03,887][model8_pretrain.py][INFO] Epoch:[0/2](808900/4588595) loss:3.300 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:03,887][model8_pretrain.py][INFO] Epoch:[0/2](808900/4588595) loss:2.367 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:03,887][model8_pretrain.py][INFO] Epoch:[0/2](808900/4588595) loss:3.140 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:03,887][model8_pretrain.py][INFO] Epoch:[0/2](808900/4588595) loss:2.328 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:03,887][model8_pretrain.py][INFO] Epoch:[0/2](808900/4588595) loss:2.606 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:03,887][model8_pretrain.py][INFO] Epoch:[0/2](808900/4588595) loss:3.178 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:03,887][model8_pretrain.py][INFO] Epoch:[0/2](808900/4588595) loss:2.940 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:03,887][model8_pretrain.py][INFO] Epoch:[0/2](808900/4588595) loss:2.859 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:40,820][model8_pretrain.py][INFO] Epoch:[0/2](809000/4588595) loss:2.815 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:40,820][model8_pretrain.py][INFO] Epoch:[0/2](809000/4588595) loss:2.576 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:40,820][model8_pretrain.py][INFO] Epoch:[0/2](809000/4588595) loss:2.552 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:40,820][model8_pretrain.py][INFO] Epoch:[0/2](809000/4588595) loss:1.366 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:40,820][model8_pretrain.py][INFO] Epoch:[0/2](809000/4588595) loss:2.504 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:40,820][model8_pretrain.py][INFO] Epoch:[0/2](809000/4588595) loss:1.849 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:40,820][model8_pretrain.py][INFO] Epoch:[0/2](809000/4588595) loss:2.479 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:28:40,820][model8_pretrain.py][INFO] Epoch:[0/2](809000/4588595) loss:3.373 lr:0.0000100 epoch_Time:24056.0min: [2024-01-06 07:29:17,751][model8_pretrain.py][INFO] Epoch:[0/2](809100/4588595) loss:2.633 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:29:17,751][model8_pretrain.py][INFO] Epoch:[0/2](809100/4588595) loss:3.111 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:29:17,751][model8_pretrain.py][INFO] Epoch:[0/2](809100/4588595) loss:2.534 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:29:17,751][model8_pretrain.py][INFO] Epoch:[0/2](809100/4588595) loss:2.944 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:29:17,751][model8_pretrain.py][INFO] Epoch:[0/2](809100/4588595) loss:2.948 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:29:17,751][model8_pretrain.py][INFO] Epoch:[0/2](809100/4588595) loss:2.696 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:29:17,751][model8_pretrain.py][INFO] Epoch:[0/2](809100/4588595) loss:3.116 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:29:17,751][model8_pretrain.py][INFO] Epoch:[0/2](809100/4588595) loss:3.045 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:05,254][model8_pretrain.py][INFO] Epoch:[0/2](809200/4588595) loss:2.952 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:05,254][model8_pretrain.py][INFO] Epoch:[0/2](809200/4588595) loss:2.659 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:05,254][model8_pretrain.py][INFO] Epoch:[0/2](809200/4588595) loss:3.129 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:05,254][model8_pretrain.py][INFO] Epoch:[0/2](809200/4588595) loss:2.605 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:05,254][model8_pretrain.py][INFO] Epoch:[0/2](809200/4588595) loss:2.408 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:05,254][model8_pretrain.py][INFO] Epoch:[0/2](809200/4588595) loss:2.952 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:05,254][model8_pretrain.py][INFO] Epoch:[0/2](809200/4588595) loss:2.577 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:05,254][model8_pretrain.py][INFO] Epoch:[0/2](809200/4588595) loss:2.831 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:42,181][model8_pretrain.py][INFO] Epoch:[0/2](809300/4588595) loss:2.833 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:42,181][model8_pretrain.py][INFO] Epoch:[0/2](809300/4588595) loss:1.833 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:42,181][model8_pretrain.py][INFO] Epoch:[0/2](809300/4588595) loss:3.549 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:42,181][model8_pretrain.py][INFO] Epoch:[0/2](809300/4588595) loss:2.613 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:42,181][model8_pretrain.py][INFO] Epoch:[0/2](809300/4588595) loss:3.138 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:42,182][model8_pretrain.py][INFO] Epoch:[0/2](809300/4588595) loss:2.325 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:42,182][model8_pretrain.py][INFO] Epoch:[0/2](809300/4588595) loss:3.016 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:30:42,182][model8_pretrain.py][INFO] Epoch:[0/2](809300/4588595) loss:2.908 lr:0.0000100 epoch_Time:24054.0min: [2024-01-06 07:31:19,126][model8_pretrain.py][INFO] Epoch:[0/2](809400/4588595) loss:3.060 lr:0.0000100 epoch_Time:24053.0min: [2024-01-06 07:31:19,126][model8_pretrain.py][INFO] Epoch:[0/2](809400/4588595) loss:2.926 lr:0.0000100 epoch_Time:24053.0min: [2024-01-06 07:31:19,126][model8_pretrain.py][INFO] Epoch:[0/2](809400/4588595) loss:2.819 lr:0.0000100 epoch_Time:24053.0min: [2024-01-06 07:31:19,126][model8_pretrain.py][INFO] Epoch:[0/2](809400/4588595) loss:2.548 lr:0.0000100 epoch_Time:24053.0min: [2024-01-06 07:31:19,126][model8_pretrain.py][INFO] Epoch:[0/2](809400/4588595) loss:2.949 lr:0.0000100 epoch_Time:24053.0min: [2024-01-06 07:31:19,126][model8_pretrain.py][INFO] Epoch:[0/2](809400/4588595) loss:2.917 lr:0.0000100 epoch_Time:24053.0min: [2024-01-06 07:31:19,127][model8_pretrain.py][INFO] Epoch:[0/2](809400/4588595) loss:3.034 lr:0.0000100 epoch_Time:24053.0min: [2024-01-06 07:31:19,127][model8_pretrain.py][INFO] Epoch:[0/2](809400/4588595) loss:2.543 lr:0.0000100 epoch_Time:24053.0min: [2024-01-06 07:31:56,055][model8_pretrain.py][INFO] Epoch:[0/2](809500/4588595) loss:2.411 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:31:56,055][model8_pretrain.py][INFO] Epoch:[0/2](809500/4588595) loss:2.666 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:31:56,055][model8_pretrain.py][INFO] Epoch:[0/2](809500/4588595) loss:2.761 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:31:56,055][model8_pretrain.py][INFO] Epoch:[0/2](809500/4588595) loss:2.633 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:31:56,055][model8_pretrain.py][INFO] Epoch:[0/2](809500/4588595) loss:3.064 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:31:56,055][model8_pretrain.py][INFO] Epoch:[0/2](809500/4588595) loss:2.662 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:31:56,055][model8_pretrain.py][INFO] Epoch:[0/2](809500/4588595) loss:2.993 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:31:56,055][model8_pretrain.py][INFO] Epoch:[0/2](809500/4588595) loss:2.344 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:32:32,990][model8_pretrain.py][INFO] Epoch:[0/2](809600/4588595) loss:2.522 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:32:32,990][model8_pretrain.py][INFO] Epoch:[0/2](809600/4588595) loss:2.755 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:32:32,989][model8_pretrain.py][INFO] Epoch:[0/2](809600/4588595) loss:2.987 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:32:32,990][model8_pretrain.py][INFO] Epoch:[0/2](809600/4588595) loss:3.010 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:32:32,990][model8_pretrain.py][INFO] Epoch:[0/2](809600/4588595) loss:2.501 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:32:32,990][model8_pretrain.py][INFO] Epoch:[0/2](809600/4588595) loss:2.935 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:32:32,990][model8_pretrain.py][INFO] Epoch:[0/2](809600/4588595) loss:2.739 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:32:32,990][model8_pretrain.py][INFO] Epoch:[0/2](809600/4588595) loss:3.228 lr:0.0000100 epoch_Time:24052.0min: [2024-01-06 07:33:09,939][model8_pretrain.py][INFO] Epoch:[0/2](809700/4588595) loss:2.739 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:09,939][model8_pretrain.py][INFO] Epoch:[0/2](809700/4588595) loss:3.070 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:09,939][model8_pretrain.py][INFO] Epoch:[0/2](809700/4588595) loss:3.116 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:09,939][model8_pretrain.py][INFO] Epoch:[0/2](809700/4588595) loss:2.859 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:09,939][model8_pretrain.py][INFO] Epoch:[0/2](809700/4588595) loss:2.338 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:09,939][model8_pretrain.py][INFO] Epoch:[0/2](809700/4588595) loss:3.149 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:09,939][model8_pretrain.py][INFO] Epoch:[0/2](809700/4588595) loss:2.967 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:09,939][model8_pretrain.py][INFO] Epoch:[0/2](809700/4588595) loss:2.752 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:46,865][model8_pretrain.py][INFO] Epoch:[0/2](809800/4588595) loss:3.269 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:46,865][model8_pretrain.py][INFO] Epoch:[0/2](809800/4588595) loss:3.039 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:46,865][model8_pretrain.py][INFO] Epoch:[0/2](809800/4588595) loss:2.676 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:46,865][model8_pretrain.py][INFO] Epoch:[0/2](809800/4588595) loss:3.073 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:46,865][model8_pretrain.py][INFO] Epoch:[0/2](809800/4588595) loss:3.034 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:46,865][model8_pretrain.py][INFO] Epoch:[0/2](809800/4588595) loss:2.730 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:46,865][model8_pretrain.py][INFO] Epoch:[0/2](809800/4588595) loss:3.303 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:33:46,865][model8_pretrain.py][INFO] Epoch:[0/2](809800/4588595) loss:3.365 lr:0.0000100 epoch_Time:24051.0min: [2024-01-06 07:34:23,797][model8_pretrain.py][INFO] Epoch:[0/2](809900/4588595) loss:2.638 lr:0.0000100 epoch_Time:24050.0min: [2024-01-06 07:34:23,797][model8_pretrain.py][INFO] Epoch:[0/2](809900/4588595) loss:2.619 lr:0.0000100 epoch_Time:24050.0min: [2024-01-06 07:34:23,797][model8_pretrain.py][INFO] Epoch:[0/2](809900/4588595) loss:2.955 lr:0.0000100 epoch_Time:24050.0min: [2024-01-06 07:34:23,797][model8_pretrain.py][INFO] Epoch:[0/2](809900/4588595) loss:2.730 lr:0.0000100 epoch_Time:24050.0min: [2024-01-06 07:34:23,797][model8_pretrain.py][INFO] Epoch:[0/2](809900/4588595) loss:3.228 lr:0.0000100 epoch_Time:24050.0min: [2024-01-06 07:34:23,797][model8_pretrain.py][INFO] Epoch:[0/2](809900/4588595) loss:2.591 lr:0.0000100 epoch_Time:24050.0min: [2024-01-06 07:34:23,797][model8_pretrain.py][INFO] Epoch:[0/2](809900/4588595) loss:2.740 lr:0.0000100 epoch_Time:24050.0min: [2024-01-06 07:34:23,797][model8_pretrain.py][INFO] Epoch:[0/2](809900/4588595) loss:2.679 lr:0.0000100 epoch_Time:24050.0min: [2024-01-06 07:35:11,085][model8_pretrain.py][INFO] Epoch:[0/2](810000/4588595) loss:2.877 lr:0.0000100 epoch_Time:24049.0min: [2024-01-06 07:35:11,086][model8_pretrain.py][INFO] Epoch:[0/2](810000/4588595) loss:2.450 lr:0.0000100 epoch_Time:24049.0min: [2024-01-06 07:35:11,086][model8_pretrain.py][INFO] Epoch:[0/2](810000/4588595) loss:2.313 lr:0.0000100 epoch_Time:24049.0min: [2024-01-06 07:35:11,086][model8_pretrain.py][INFO] Epoch:[0/2](810000/4588595) loss:3.012 lr:0.0000100 epoch_Time:24049.0min: [2024-01-06 07:35:11,086][model8_pretrain.py][INFO] Epoch:[0/2](810000/4588595) loss:2.794 lr:0.0000100 epoch_Time:24049.0min: [2024-01-06 07:35:11,086][model8_pretrain.py][INFO] Epoch:[0/2](810000/4588595) loss:1.948 lr:0.0000100 epoch_Time:24049.0min: [2024-01-06 07:35:11,086][model8_pretrain.py][INFO] Epoch:[0/2](810000/4588595) loss:2.724 lr:0.0000100 epoch_Time:24049.0min: [2024-01-06 07:35:11,086][model8_pretrain.py][INFO] Epoch:[0/2](810000/4588595) loss:3.063 lr:0.0000100 epoch_Time:24049.0min: [2024-01-06 07:35:48,012][model8_pretrain.py][INFO] Epoch:[0/2](810100/4588595) loss:2.571 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:35:48,012][model8_pretrain.py][INFO] Epoch:[0/2](810100/4588595) loss:2.703 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:35:48,012][model8_pretrain.py][INFO] Epoch:[0/2](810100/4588595) loss:2.445 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:35:48,012][model8_pretrain.py][INFO] Epoch:[0/2](810100/4588595) loss:2.567 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:35:48,012][model8_pretrain.py][INFO] Epoch:[0/2](810100/4588595) loss:2.687 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:35:48,012][model8_pretrain.py][INFO] Epoch:[0/2](810100/4588595) loss:2.873 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:35:48,012][model8_pretrain.py][INFO] Epoch:[0/2](810100/4588595) loss:2.754 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:35:48,012][model8_pretrain.py][INFO] Epoch:[0/2](810100/4588595) loss:2.868 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:36:24,954][model8_pretrain.py][INFO] Epoch:[0/2](810200/4588595) loss:2.524 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:36:24,954][model8_pretrain.py][INFO] Epoch:[0/2](810200/4588595) loss:3.019 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:36:24,954][model8_pretrain.py][INFO] Epoch:[0/2](810200/4588595) loss:2.278 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:36:24,954][model8_pretrain.py][INFO] Epoch:[0/2](810200/4588595) loss:3.108 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:36:24,954][model8_pretrain.py][INFO] Epoch:[0/2](810200/4588595) loss:2.730 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:36:24,954][model8_pretrain.py][INFO] Epoch:[0/2](810200/4588595) loss:2.468 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:36:24,954][model8_pretrain.py][INFO] Epoch:[0/2](810200/4588595) loss:3.353 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:36:24,955][model8_pretrain.py][INFO] Epoch:[0/2](810200/4588595) loss:2.612 lr:0.0000100 epoch_Time:24048.0min: [2024-01-06 07:37:01,911][model8_pretrain.py][INFO] Epoch:[0/2](810300/4588595) loss:2.866 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:01,911][model8_pretrain.py][INFO] Epoch:[0/2](810300/4588595) loss:2.611 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:01,911][model8_pretrain.py][INFO] Epoch:[0/2](810300/4588595) loss:2.487 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:01,911][model8_pretrain.py][INFO] Epoch:[0/2](810300/4588595) loss:2.540 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:01,911][model8_pretrain.py][INFO] Epoch:[0/2](810300/4588595) loss:2.539 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:01,911][model8_pretrain.py][INFO] Epoch:[0/2](810300/4588595) loss:2.632 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:01,912][model8_pretrain.py][INFO] Epoch:[0/2](810300/4588595) loss:3.419 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:01,912][model8_pretrain.py][INFO] Epoch:[0/2](810300/4588595) loss:2.837 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:38,869][model8_pretrain.py][INFO] Epoch:[0/2](810400/4588595) loss:2.744 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:38,869][model8_pretrain.py][INFO] Epoch:[0/2](810400/4588595) loss:2.977 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:38,870][model8_pretrain.py][INFO] Epoch:[0/2](810400/4588595) loss:2.889 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:38,870][model8_pretrain.py][INFO] Epoch:[0/2](810400/4588595) loss:2.718 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:38,870][model8_pretrain.py][INFO] Epoch:[0/2](810400/4588595) loss:2.935 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:38,870][model8_pretrain.py][INFO] Epoch:[0/2](810400/4588595) loss:3.011 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:38,870][model8_pretrain.py][INFO] Epoch:[0/2](810400/4588595) loss:2.884 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:37:38,870][model8_pretrain.py][INFO] Epoch:[0/2](810400/4588595) loss:2.784 lr:0.0000100 epoch_Time:24047.0min: [2024-01-06 07:38:15,816][model8_pretrain.py][INFO] Epoch:[0/2](810500/4588595) loss:2.903 lr:0.0000100 epoch_Time:24046.0min: [2024-01-06 07:38:15,816][model8_pretrain.py][INFO] Epoch:[0/2](810500/4588595) loss:2.911 lr:0.0000100 epoch_Time:24046.0min: [2024-01-06 07:38:15,816][model8_pretrain.py][INFO] Epoch:[0/2](810500/4588595) loss:2.895 lr:0.0000100 epoch_Time:24046.0min: [2024-01-06 07:38:15,816][model8_pretrain.py][INFO] Epoch:[0/2](810500/4588595) loss:2.741 lr:0.0000100 epoch_Time:24046.0min: [2024-01-06 07:38:15,816][model8_pretrain.py][INFO] Epoch:[0/2](810500/4588595) loss:2.595 lr:0.0000100 epoch_Time:24046.0min: [2024-01-06 07:38:15,816][model8_pretrain.py][INFO] Epoch:[0/2](810500/4588595) loss:2.318 lr:0.0000100 epoch_Time:24046.0min: [2024-01-06 07:38:15,816][model8_pretrain.py][INFO] Epoch:[0/2](810500/4588595) loss:2.402 lr:0.0000100 epoch_Time:24046.0min: [2024-01-06 07:38:15,817][model8_pretrain.py][INFO] Epoch:[0/2](810500/4588595) loss:3.391 lr:0.0000100 epoch_Time:24046.0min: [2024-01-06 07:38:52,757][model8_pretrain.py][INFO] Epoch:[0/2](810600/4588595) loss:3.138 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:38:52,757][model8_pretrain.py][INFO] Epoch:[0/2](810600/4588595) loss:3.193 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:38:52,757][model8_pretrain.py][INFO] Epoch:[0/2](810600/4588595) loss:3.124 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:38:52,757][model8_pretrain.py][INFO] Epoch:[0/2](810600/4588595) loss:3.059 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:38:52,757][model8_pretrain.py][INFO] Epoch:[0/2](810600/4588595) loss:2.716 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:38:52,757][model8_pretrain.py][INFO] Epoch:[0/2](810600/4588595) loss:2.946 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:38:52,758][model8_pretrain.py][INFO] Epoch:[0/2](810600/4588595) loss:2.798 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:38:52,758][model8_pretrain.py][INFO] Epoch:[0/2](810600/4588595) loss:3.172 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:39:29,701][model8_pretrain.py][INFO] Epoch:[0/2](810700/4588595) loss:2.609 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:39:29,701][model8_pretrain.py][INFO] Epoch:[0/2](810700/4588595) loss:3.133 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:39:29,701][model8_pretrain.py][INFO] Epoch:[0/2](810700/4588595) loss:3.368 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:39:29,701][model8_pretrain.py][INFO] Epoch:[0/2](810700/4588595) loss:3.394 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:39:29,701][model8_pretrain.py][INFO] Epoch:[0/2](810700/4588595) loss:2.988 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:39:29,701][model8_pretrain.py][INFO] Epoch:[0/2](810700/4588595) loss:2.819 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:39:29,701][model8_pretrain.py][INFO] Epoch:[0/2](810700/4588595) loss:3.231 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:39:29,701][model8_pretrain.py][INFO] Epoch:[0/2](810700/4588595) loss:3.028 lr:0.0000100 epoch_Time:24045.0min: [2024-01-06 07:40:16,780][model8_pretrain.py][INFO] Epoch:[0/2](810800/4588595) loss:3.149 lr:0.0000100 epoch_Time:24044.0min: [2024-01-06 07:40:16,780][model8_pretrain.py][INFO] Epoch:[0/2](810800/4588595) loss:2.809 lr:0.0000100 epoch_Time:24044.0min: [2024-01-06 07:40:16,780][model8_pretrain.py][INFO] Epoch:[0/2](810800/4588595) loss:2.605 lr:0.0000100 epoch_Time:24044.0min: [2024-01-06 07:40:16,780][model8_pretrain.py][INFO] Epoch:[0/2](810800/4588595) loss:3.072 lr:0.0000100 epoch_Time:24044.0min: [2024-01-06 07:40:16,781][model8_pretrain.py][INFO] Epoch:[0/2](810800/4588595) loss:2.613 lr:0.0000100 epoch_Time:24044.0min: [2024-01-06 07:40:16,781][model8_pretrain.py][INFO] Epoch:[0/2](810800/4588595) loss:2.713 lr:0.0000100 epoch_Time:24044.0min: [2024-01-06 07:40:16,781][model8_pretrain.py][INFO] Epoch:[0/2](810800/4588595) loss:3.261 lr:0.0000100 epoch_Time:24044.0min: [2024-01-06 07:40:16,781][model8_pretrain.py][INFO] Epoch:[0/2](810800/4588595) loss:2.955 lr:0.0000100 epoch_Time:24044.0min: [2024-01-06 07:40:53,708][model8_pretrain.py][INFO] Epoch:[0/2](810900/4588595) loss:3.068 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:40:53,708][model8_pretrain.py][INFO] Epoch:[0/2](810900/4588595) loss:2.988 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:40:53,708][model8_pretrain.py][INFO] Epoch:[0/2](810900/4588595) loss:2.524 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:40:53,708][model8_pretrain.py][INFO] Epoch:[0/2](810900/4588595) loss:2.111 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:40:53,708][model8_pretrain.py][INFO] Epoch:[0/2](810900/4588595) loss:2.686 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:40:53,708][model8_pretrain.py][INFO] Epoch:[0/2](810900/4588595) loss:2.849 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:40:53,709][model8_pretrain.py][INFO] Epoch:[0/2](810900/4588595) loss:2.711 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:40:53,709][model8_pretrain.py][INFO] Epoch:[0/2](810900/4588595) loss:2.827 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:41:30,649][model8_pretrain.py][INFO] Epoch:[0/2](811000/4588595) loss:3.123 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:41:30,649][model8_pretrain.py][INFO] Epoch:[0/2](811000/4588595) loss:2.965 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:41:30,649][model8_pretrain.py][INFO] Epoch:[0/2](811000/4588595) loss:2.539 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:41:30,649][model8_pretrain.py][INFO] Epoch:[0/2](811000/4588595) loss:2.968 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:41:30,649][model8_pretrain.py][INFO] Epoch:[0/2](811000/4588595) loss:3.041 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:41:30,649][model8_pretrain.py][INFO] Epoch:[0/2](811000/4588595) loss:2.504 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:41:30,649][model8_pretrain.py][INFO] Epoch:[0/2](811000/4588595) loss:1.905 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:41:30,650][model8_pretrain.py][INFO] Epoch:[0/2](811000/4588595) loss:2.792 lr:0.0000100 epoch_Time:24043.0min: [2024-01-06 07:42:07,581][model8_pretrain.py][INFO] Epoch:[0/2](811100/4588595) loss:2.913 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:07,581][model8_pretrain.py][INFO] Epoch:[0/2](811100/4588595) loss:3.047 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:07,581][model8_pretrain.py][INFO] Epoch:[0/2](811100/4588595) loss:2.577 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:07,581][model8_pretrain.py][INFO] Epoch:[0/2](811100/4588595) loss:3.191 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:07,581][model8_pretrain.py][INFO] Epoch:[0/2](811100/4588595) loss:2.748 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:07,581][model8_pretrain.py][INFO] Epoch:[0/2](811100/4588595) loss:2.986 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:07,581][model8_pretrain.py][INFO] Epoch:[0/2](811100/4588595) loss:2.428 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:07,581][model8_pretrain.py][INFO] Epoch:[0/2](811100/4588595) loss:3.364 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:44,505][model8_pretrain.py][INFO] Epoch:[0/2](811200/4588595) loss:2.351 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:44,505][model8_pretrain.py][INFO] Epoch:[0/2](811200/4588595) loss:2.846 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:44,505][model8_pretrain.py][INFO] Epoch:[0/2](811200/4588595) loss:2.886 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:44,505][model8_pretrain.py][INFO] Epoch:[0/2](811200/4588595) loss:2.972 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:44,505][model8_pretrain.py][INFO] Epoch:[0/2](811200/4588595) loss:3.058 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:44,505][model8_pretrain.py][INFO] Epoch:[0/2](811200/4588595) loss:2.956 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:44,505][model8_pretrain.py][INFO] Epoch:[0/2](811200/4588595) loss:3.049 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:42:44,506][model8_pretrain.py][INFO] Epoch:[0/2](811200/4588595) loss:2.561 lr:0.0000100 epoch_Time:24042.0min: [2024-01-06 07:43:21,433][model8_pretrain.py][INFO] Epoch:[0/2](811300/4588595) loss:2.850 lr:0.0000100 epoch_Time:24041.0min: [2024-01-06 07:43:21,433][model8_pretrain.py][INFO] Epoch:[0/2](811300/4588595) loss:3.050 lr:0.0000100 epoch_Time:24041.0min: [2024-01-06 07:43:21,433][model8_pretrain.py][INFO] Epoch:[0/2](811300/4588595) loss:2.519 lr:0.0000100 epoch_Time:24041.0min: [2024-01-06 07:43:21,433][model8_pretrain.py][INFO] Epoch:[0/2](811300/4588595) loss:2.773 lr:0.0000100 epoch_Time:24041.0min: [2024-01-06 07:43:21,433][model8_pretrain.py][INFO] Epoch:[0/2](811300/4588595) loss:2.777 lr:0.0000100 epoch_Time:24041.0min: [2024-01-06 07:43:21,433][model8_pretrain.py][INFO] Epoch:[0/2](811300/4588595) loss:2.696 lr:0.0000100 epoch_Time:24041.0min: [2024-01-06 07:43:21,434][model8_pretrain.py][INFO] Epoch:[0/2](811300/4588595) loss:2.661 lr:0.0000100 epoch_Time:24041.0min: [2024-01-06 07:43:21,434][model8_pretrain.py][INFO] Epoch:[0/2](811300/4588595) loss:2.586 lr:0.0000100 epoch_Time:24041.0min: [2024-01-06 07:43:58,394][model8_pretrain.py][INFO] Epoch:[0/2](811400/4588595) loss:3.043 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:43:58,394][model8_pretrain.py][INFO] Epoch:[0/2](811400/4588595) loss:3.325 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:43:58,394][model8_pretrain.py][INFO] Epoch:[0/2](811400/4588595) loss:2.513 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:43:58,394][model8_pretrain.py][INFO] Epoch:[0/2](811400/4588595) loss:2.297 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:43:58,394][model8_pretrain.py][INFO] Epoch:[0/2](811400/4588595) loss:3.032 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:43:58,394][model8_pretrain.py][INFO] Epoch:[0/2](811400/4588595) loss:2.769 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:43:58,394][model8_pretrain.py][INFO] Epoch:[0/2](811400/4588595) loss:2.562 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:43:58,394][model8_pretrain.py][INFO] Epoch:[0/2](811400/4588595) loss:2.619 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:44:35,329][model8_pretrain.py][INFO] Epoch:[0/2](811500/4588595) loss:2.341 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:44:35,329][model8_pretrain.py][INFO] Epoch:[0/2](811500/4588595) loss:3.262 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:44:35,329][model8_pretrain.py][INFO] Epoch:[0/2](811500/4588595) loss:3.255 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:44:35,329][model8_pretrain.py][INFO] Epoch:[0/2](811500/4588595) loss:3.012 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:44:35,329][model8_pretrain.py][INFO] Epoch:[0/2](811500/4588595) loss:3.136 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:44:35,329][model8_pretrain.py][INFO] Epoch:[0/2](811500/4588595) loss:2.474 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:44:35,329][model8_pretrain.py][INFO] Epoch:[0/2](811500/4588595) loss:2.940 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:44:35,329][model8_pretrain.py][INFO] Epoch:[0/2](811500/4588595) loss:2.490 lr:0.0000100 epoch_Time:24040.0min: [2024-01-06 07:45:20,495][model8_pretrain.py][INFO] Epoch:[0/2](811600/4588595) loss:2.541 lr:0.0000100 epoch_Time:24039.0min: [2024-01-06 07:45:20,495][model8_pretrain.py][INFO] Epoch:[0/2](811600/4588595) loss:2.720 lr:0.0000100 epoch_Time:24039.0min: [2024-01-06 07:45:20,495][model8_pretrain.py][INFO] Epoch:[0/2](811600/4588595) loss:2.610 lr:0.0000100 epoch_Time:24039.0min: [2024-01-06 07:45:20,495][model8_pretrain.py][INFO] Epoch:[0/2](811600/4588595) loss:2.865 lr:0.0000100 epoch_Time:24039.0min: [2024-01-06 07:45:20,495][model8_pretrain.py][INFO] Epoch:[0/2](811600/4588595) loss:3.005 lr:0.0000100 epoch_Time:24039.0min: [2024-01-06 07:45:20,495][model8_pretrain.py][INFO] Epoch:[0/2](811600/4588595) loss:3.040 lr:0.0000100 epoch_Time:24039.0min: [2024-01-06 07:45:20,495][model8_pretrain.py][INFO] Epoch:[0/2](811600/4588595) loss:2.645 lr:0.0000100 epoch_Time:24039.0min: [2024-01-06 07:45:20,495][model8_pretrain.py][INFO] Epoch:[0/2](811600/4588595) loss:2.336 lr:0.0000100 epoch_Time:24039.0min: [2024-01-06 07:45:59,101][model8_pretrain.py][INFO] Epoch:[0/2](811700/4588595) loss:3.061 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:45:59,101][model8_pretrain.py][INFO] Epoch:[0/2](811700/4588595) loss:3.169 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:45:59,101][model8_pretrain.py][INFO] Epoch:[0/2](811700/4588595) loss:2.005 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:45:59,101][model8_pretrain.py][INFO] Epoch:[0/2](811700/4588595) loss:3.003 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:45:59,101][model8_pretrain.py][INFO] Epoch:[0/2](811700/4588595) loss:2.974 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:45:59,101][model8_pretrain.py][INFO] Epoch:[0/2](811700/4588595) loss:2.794 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:45:59,102][model8_pretrain.py][INFO] Epoch:[0/2](811700/4588595) loss:2.958 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:45:59,102][model8_pretrain.py][INFO] Epoch:[0/2](811700/4588595) loss:2.524 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:46:36,039][model8_pretrain.py][INFO] Epoch:[0/2](811800/4588595) loss:2.938 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:46:36,039][model8_pretrain.py][INFO] Epoch:[0/2](811800/4588595) loss:3.214 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:46:36,039][model8_pretrain.py][INFO] Epoch:[0/2](811800/4588595) loss:2.585 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:46:36,039][model8_pretrain.py][INFO] Epoch:[0/2](811800/4588595) loss:2.362 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:46:36,039][model8_pretrain.py][INFO] Epoch:[0/2](811800/4588595) loss:2.849 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:46:36,039][model8_pretrain.py][INFO] Epoch:[0/2](811800/4588595) loss:3.118 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:46:36,039][model8_pretrain.py][INFO] Epoch:[0/2](811800/4588595) loss:2.817 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:46:36,039][model8_pretrain.py][INFO] Epoch:[0/2](811800/4588595) loss:3.035 lr:0.0000100 epoch_Time:24038.0min: [2024-01-06 07:47:12,986][model8_pretrain.py][INFO] Epoch:[0/2](811900/4588595) loss:2.269 lr:0.0000100 epoch_Time:24037.0min: [2024-01-06 07:47:12,986][model8_pretrain.py][INFO] Epoch:[0/2](811900/4588595) loss:2.952 lr:0.0000100 epoch_Time:24037.0min: [2024-01-06 07:47:12,986][model8_pretrain.py][INFO] Epoch:[0/2](811900/4588595) loss:3.003 lr:0.0000100 epoch_Time:24037.0min: [2024-01-06 07:47:12,986][model8_pretrain.py][INFO] Epoch:[0/2](811900/4588595) loss:2.459 lr:0.0000100 epoch_Time:24037.0min: [2024-01-06 07:47:12,986][model8_pretrain.py][INFO] Epoch:[0/2](811900/4588595) loss:3.107 lr:0.0000100 epoch_Time:24037.0min: [2024-01-06 07:47:12,986][model8_pretrain.py][INFO] Epoch:[0/2](811900/4588595) loss:3.108 lr:0.0000100 epoch_Time:24037.0min: [2024-01-06 07:47:12,986][model8_pretrain.py][INFO] Epoch:[0/2](811900/4588595) loss:2.821 lr:0.0000100 epoch_Time:24037.0min: [2024-01-06 07:47:12,986][model8_pretrain.py][INFO] Epoch:[0/2](811900/4588595) loss:2.963 lr:0.0000100 epoch_Time:24037.0min: [2024-01-06 07:47:49,938][model8_pretrain.py][INFO] Epoch:[0/2](812000/4588595) loss:2.788 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:47:49,938][model8_pretrain.py][INFO] Epoch:[0/2](812000/4588595) loss:3.047 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:47:49,939][model8_pretrain.py][INFO] Epoch:[0/2](812000/4588595) loss:3.206 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:47:49,939][model8_pretrain.py][INFO] Epoch:[0/2](812000/4588595) loss:2.496 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:47:49,939][model8_pretrain.py][INFO] Epoch:[0/2](812000/4588595) loss:2.752 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:47:49,939][model8_pretrain.py][INFO] Epoch:[0/2](812000/4588595) loss:2.860 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:47:49,939][model8_pretrain.py][INFO] Epoch:[0/2](812000/4588595) loss:2.418 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:47:49,939][model8_pretrain.py][INFO] Epoch:[0/2](812000/4588595) loss:3.217 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:48:26,881][model8_pretrain.py][INFO] Epoch:[0/2](812100/4588595) loss:2.389 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:48:26,881][model8_pretrain.py][INFO] Epoch:[0/2](812100/4588595) loss:3.061 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:48:26,881][model8_pretrain.py][INFO] Epoch:[0/2](812100/4588595) loss:2.621 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:48:26,881][model8_pretrain.py][INFO] Epoch:[0/2](812100/4588595) loss:1.903 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:48:26,881][model8_pretrain.py][INFO] Epoch:[0/2](812100/4588595) loss:2.926 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:48:26,881][model8_pretrain.py][INFO] Epoch:[0/2](812100/4588595) loss:2.958 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:48:26,881][model8_pretrain.py][INFO] Epoch:[0/2](812100/4588595) loss:2.440 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:48:26,881][model8_pretrain.py][INFO] Epoch:[0/2](812100/4588595) loss:3.368 lr:0.0000100 epoch_Time:24036.0min: [2024-01-06 07:49:03,810][model8_pretrain.py][INFO] Epoch:[0/2](812200/4588595) loss:2.853 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:03,810][model8_pretrain.py][INFO] Epoch:[0/2](812200/4588595) loss:2.775 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:03,810][model8_pretrain.py][INFO] Epoch:[0/2](812200/4588595) loss:2.268 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:03,810][model8_pretrain.py][INFO] Epoch:[0/2](812200/4588595) loss:2.854 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:03,810][model8_pretrain.py][INFO] Epoch:[0/2](812200/4588595) loss:2.649 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:03,810][model8_pretrain.py][INFO] Epoch:[0/2](812200/4588595) loss:2.954 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:03,810][model8_pretrain.py][INFO] Epoch:[0/2](812200/4588595) loss:2.600 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:03,810][model8_pretrain.py][INFO] Epoch:[0/2](812200/4588595) loss:3.013 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:40,743][model8_pretrain.py][INFO] Epoch:[0/2](812300/4588595) loss:2.980 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:40,743][model8_pretrain.py][INFO] Epoch:[0/2](812300/4588595) loss:2.707 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:40,743][model8_pretrain.py][INFO] Epoch:[0/2](812300/4588595) loss:2.386 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:40,743][model8_pretrain.py][INFO] Epoch:[0/2](812300/4588595) loss:2.437 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:40,743][model8_pretrain.py][INFO] Epoch:[0/2](812300/4588595) loss:2.668 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:40,743][model8_pretrain.py][INFO] Epoch:[0/2](812300/4588595) loss:2.276 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:40,744][model8_pretrain.py][INFO] Epoch:[0/2](812300/4588595) loss:2.500 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:49:40,744][model8_pretrain.py][INFO] Epoch:[0/2](812300/4588595) loss:3.344 lr:0.0000100 epoch_Time:24035.0min: [2024-01-06 07:50:25,854][model8_pretrain.py][INFO] Epoch:[0/2](812400/4588595) loss:2.523 lr:0.0000100 epoch_Time:24034.0min: [2024-01-06 07:50:25,854][model8_pretrain.py][INFO] Epoch:[0/2](812400/4588595) loss:2.797 lr:0.0000100 epoch_Time:24034.0min: [2024-01-06 07:50:25,854][model8_pretrain.py][INFO] Epoch:[0/2](812400/4588595) loss:2.957 lr:0.0000100 epoch_Time:24034.0min: [2024-01-06 07:50:25,854][model8_pretrain.py][INFO] Epoch:[0/2](812400/4588595) loss:2.971 lr:0.0000100 epoch_Time:24034.0min: [2024-01-06 07:50:25,854][model8_pretrain.py][INFO] Epoch:[0/2](812400/4588595) loss:2.234 lr:0.0000100 epoch_Time:24034.0min: [2024-01-06 07:50:25,854][model8_pretrain.py][INFO] Epoch:[0/2](812400/4588595) loss:2.878 lr:0.0000100 epoch_Time:24034.0min: [2024-01-06 07:50:25,854][model8_pretrain.py][INFO] Epoch:[0/2](812400/4588595) loss:2.796 lr:0.0000100 epoch_Time:24034.0min: [2024-01-06 07:50:25,855][model8_pretrain.py][INFO] Epoch:[0/2](812400/4588595) loss:3.089 lr:0.0000100 epoch_Time:24034.0min: [2024-01-06 07:51:04,510][model8_pretrain.py][INFO] Epoch:[0/2](812500/4588595) loss:2.878 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:04,510][model8_pretrain.py][INFO] Epoch:[0/2](812500/4588595) loss:2.269 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:04,510][model8_pretrain.py][INFO] Epoch:[0/2](812500/4588595) loss:3.106 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:04,510][model8_pretrain.py][INFO] Epoch:[0/2](812500/4588595) loss:2.727 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:04,510][model8_pretrain.py][INFO] Epoch:[0/2](812500/4588595) loss:2.592 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:04,510][model8_pretrain.py][INFO] Epoch:[0/2](812500/4588595) loss:2.728 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:04,510][model8_pretrain.py][INFO] Epoch:[0/2](812500/4588595) loss:2.759 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:04,510][model8_pretrain.py][INFO] Epoch:[0/2](812500/4588595) loss:2.579 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:41,447][model8_pretrain.py][INFO] Epoch:[0/2](812600/4588595) loss:2.570 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:41,447][model8_pretrain.py][INFO] Epoch:[0/2](812600/4588595) loss:3.042 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:41,447][model8_pretrain.py][INFO] Epoch:[0/2](812600/4588595) loss:2.330 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:41,447][model8_pretrain.py][INFO] Epoch:[0/2](812600/4588595) loss:3.129 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:41,447][model8_pretrain.py][INFO] Epoch:[0/2](812600/4588595) loss:2.958 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:41,447][model8_pretrain.py][INFO] Epoch:[0/2](812600/4588595) loss:2.622 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:41,447][model8_pretrain.py][INFO] Epoch:[0/2](812600/4588595) loss:3.297 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:51:41,448][model8_pretrain.py][INFO] Epoch:[0/2](812600/4588595) loss:3.063 lr:0.0000100 epoch_Time:24033.0min: [2024-01-06 07:52:18,397][model8_pretrain.py][INFO] Epoch:[0/2](812700/4588595) loss:3.444 lr:0.0000100 epoch_Time:24032.0min: [2024-01-06 07:52:18,397][model8_pretrain.py][INFO] Epoch:[0/2](812700/4588595) loss:2.996 lr:0.0000100 epoch_Time:24032.0min: [2024-01-06 07:52:18,397][model8_pretrain.py][INFO] Epoch:[0/2](812700/4588595) loss:3.052 lr:0.0000100 epoch_Time:24032.0min: [2024-01-06 07:52:18,397][model8_pretrain.py][INFO] Epoch:[0/2](812700/4588595) loss:2.880 lr:0.0000100 epoch_Time:24032.0min: [2024-01-06 07:52:18,397][model8_pretrain.py][INFO] Epoch:[0/2](812700/4588595) loss:2.843 lr:0.0000100 epoch_Time:24032.0min: [2024-01-06 07:52:18,397][model8_pretrain.py][INFO] Epoch:[0/2](812700/4588595) loss:2.413 lr:0.0000100 epoch_Time:24032.0min: [2024-01-06 07:52:18,397][model8_pretrain.py][INFO] Epoch:[0/2](812700/4588595) loss:2.385 lr:0.0000100 epoch_Time:24032.0min: [2024-01-06 07:52:18,398][model8_pretrain.py][INFO] Epoch:[0/2](812700/4588595) loss:2.640 lr:0.0000100 epoch_Time:24032.0min: [2024-01-06 07:52:55,337][model8_pretrain.py][INFO] Epoch:[0/2](812800/4588595) loss:2.472 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:52:55,337][model8_pretrain.py][INFO] Epoch:[0/2](812800/4588595) loss:2.527 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:52:55,337][model8_pretrain.py][INFO] Epoch:[0/2](812800/4588595) loss:2.897 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:52:55,337][model8_pretrain.py][INFO] Epoch:[0/2](812800/4588595) loss:3.059 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:52:55,337][model8_pretrain.py][INFO] Epoch:[0/2](812800/4588595) loss:2.925 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:52:55,337][model8_pretrain.py][INFO] Epoch:[0/2](812800/4588595) loss:2.448 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:52:55,338][model8_pretrain.py][INFO] Epoch:[0/2](812800/4588595) loss:3.098 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:52:55,338][model8_pretrain.py][INFO] Epoch:[0/2](812800/4588595) loss:2.487 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:53:32,295][model8_pretrain.py][INFO] Epoch:[0/2](812900/4588595) loss:2.883 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:53:32,295][model8_pretrain.py][INFO] Epoch:[0/2](812900/4588595) loss:2.830 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:53:32,295][model8_pretrain.py][INFO] Epoch:[0/2](812900/4588595) loss:2.345 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:53:32,295][model8_pretrain.py][INFO] Epoch:[0/2](812900/4588595) loss:2.469 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:53:32,295][model8_pretrain.py][INFO] Epoch:[0/2](812900/4588595) loss:2.475 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:53:32,295][model8_pretrain.py][INFO] Epoch:[0/2](812900/4588595) loss:3.310 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:53:32,295][model8_pretrain.py][INFO] Epoch:[0/2](812900/4588595) loss:2.441 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:53:32,295][model8_pretrain.py][INFO] Epoch:[0/2](812900/4588595) loss:2.598 lr:0.0000100 epoch_Time:24031.0min: [2024-01-06 07:54:09,247][model8_pretrain.py][INFO] Epoch:[0/2](813000/4588595) loss:3.306 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:09,247][model8_pretrain.py][INFO] Epoch:[0/2](813000/4588595) loss:1.981 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:09,247][model8_pretrain.py][INFO] Epoch:[0/2](813000/4588595) loss:2.730 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:09,247][model8_pretrain.py][INFO] Epoch:[0/2](813000/4588595) loss:2.901 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:09,247][model8_pretrain.py][INFO] Epoch:[0/2](813000/4588595) loss:3.112 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:09,247][model8_pretrain.py][INFO] Epoch:[0/2](813000/4588595) loss:3.094 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:09,248][model8_pretrain.py][INFO] Epoch:[0/2](813000/4588595) loss:2.943 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:09,248][model8_pretrain.py][INFO] Epoch:[0/2](813000/4588595) loss:2.877 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:46,171][model8_pretrain.py][INFO] Epoch:[0/2](813100/4588595) loss:2.553 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:46,171][model8_pretrain.py][INFO] Epoch:[0/2](813100/4588595) loss:2.561 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:46,171][model8_pretrain.py][INFO] Epoch:[0/2](813100/4588595) loss:2.814 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:46,171][model8_pretrain.py][INFO] Epoch:[0/2](813100/4588595) loss:3.032 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:46,171][model8_pretrain.py][INFO] Epoch:[0/2](813100/4588595) loss:2.359 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:46,171][model8_pretrain.py][INFO] Epoch:[0/2](813100/4588595) loss:2.896 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:46,171][model8_pretrain.py][INFO] Epoch:[0/2](813100/4588595) loss:3.280 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:54:46,171][model8_pretrain.py][INFO] Epoch:[0/2](813100/4588595) loss:2.805 lr:0.0000100 epoch_Time:24030.0min: [2024-01-06 07:55:31,338][model8_pretrain.py][INFO] Epoch:[0/2](813200/4588595) loss:3.079 lr:0.0000100 epoch_Time:24029.0min: [2024-01-06 07:55:31,338][model8_pretrain.py][INFO] Epoch:[0/2](813200/4588595) loss:2.635 lr:0.0000100 epoch_Time:24029.0min: [2024-01-06 07:55:31,338][model8_pretrain.py][INFO] Epoch:[0/2](813200/4588595) loss:2.669 lr:0.0000100 epoch_Time:24029.0min: [2024-01-06 07:55:31,338][model8_pretrain.py][INFO] Epoch:[0/2](813200/4588595) loss:2.936 lr:0.0000100 epoch_Time:24029.0min: [2024-01-06 07:55:31,338][model8_pretrain.py][INFO] Epoch:[0/2](813200/4588595) loss:2.561 lr:0.0000100 epoch_Time:24029.0min: [2024-01-06 07:55:31,338][model8_pretrain.py][INFO] Epoch:[0/2](813200/4588595) loss:2.750 lr:0.0000100 epoch_Time:24029.0min: [2024-01-06 07:55:31,338][model8_pretrain.py][INFO] Epoch:[0/2](813200/4588595) loss:2.922 lr:0.0000100 epoch_Time:24029.0min: [2024-01-06 07:55:31,338][model8_pretrain.py][INFO] Epoch:[0/2](813200/4588595) loss:2.347 lr:0.0000100 epoch_Time:24029.0min: [2024-01-06 07:56:09,949][model8_pretrain.py][INFO] Epoch:[0/2](813300/4588595) loss:2.822 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:09,949][model8_pretrain.py][INFO] Epoch:[0/2](813300/4588595) loss:3.078 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:09,949][model8_pretrain.py][INFO] Epoch:[0/2](813300/4588595) loss:2.698 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:09,949][model8_pretrain.py][INFO] Epoch:[0/2](813300/4588595) loss:2.509 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:09,949][model8_pretrain.py][INFO] Epoch:[0/2](813300/4588595) loss:2.764 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:09,949][model8_pretrain.py][INFO] Epoch:[0/2](813300/4588595) loss:2.328 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:09,949][model8_pretrain.py][INFO] Epoch:[0/2](813300/4588595) loss:2.577 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:09,949][model8_pretrain.py][INFO] Epoch:[0/2](813300/4588595) loss:3.345 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:46,886][model8_pretrain.py][INFO] Epoch:[0/2](813400/4588595) loss:2.148 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:46,886][model8_pretrain.py][INFO] Epoch:[0/2](813400/4588595) loss:3.179 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:46,886][model8_pretrain.py][INFO] Epoch:[0/2](813400/4588595) loss:3.191 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:46,886][model8_pretrain.py][INFO] Epoch:[0/2](813400/4588595) loss:3.170 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:46,886][model8_pretrain.py][INFO] Epoch:[0/2](813400/4588595) loss:3.051 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:46,886][model8_pretrain.py][INFO] Epoch:[0/2](813400/4588595) loss:2.752 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:46,886][model8_pretrain.py][INFO] Epoch:[0/2](813400/4588595) loss:2.932 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:56:46,886][model8_pretrain.py][INFO] Epoch:[0/2](813400/4588595) loss:3.013 lr:0.0000100 epoch_Time:24028.0min: [2024-01-06 07:57:23,795][model8_pretrain.py][INFO] Epoch:[0/2](813500/4588595) loss:2.582 lr:0.0000100 epoch_Time:24027.0min: [2024-01-06 07:57:23,795][model8_pretrain.py][INFO] Epoch:[0/2](813500/4588595) loss:3.045 lr:0.0000100 epoch_Time:24027.0min: [2024-01-06 07:57:23,795][model8_pretrain.py][INFO] Epoch:[0/2](813500/4588595) loss:2.900 lr:0.0000100 epoch_Time:24027.0min: [2024-01-06 07:57:23,795][model8_pretrain.py][INFO] Epoch:[0/2](813500/4588595) loss:3.048 lr:0.0000100 epoch_Time:24027.0min: [2024-01-06 07:57:23,795][model8_pretrain.py][INFO] Epoch:[0/2](813500/4588595) loss:2.569 lr:0.0000100 epoch_Time:24027.0min: [2024-01-06 07:57:23,795][model8_pretrain.py][INFO] Epoch:[0/2](813500/4588595) loss:2.901 lr:0.0000100 epoch_Time:24027.0min: [2024-01-06 07:57:23,795][model8_pretrain.py][INFO] Epoch:[0/2](813500/4588595) loss:2.492 lr:0.0000100 epoch_Time:24027.0min: [2024-01-06 07:57:23,795][model8_pretrain.py][INFO] Epoch:[0/2](813500/4588595) loss:2.891 lr:0.0000100 epoch_Time:24027.0min: [2024-01-06 07:58:00,736][model8_pretrain.py][INFO] Epoch:[0/2](813600/4588595) loss:2.778 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:00,736][model8_pretrain.py][INFO] Epoch:[0/2](813600/4588595) loss:3.067 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:00,736][model8_pretrain.py][INFO] Epoch:[0/2](813600/4588595) loss:2.608 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:00,736][model8_pretrain.py][INFO] Epoch:[0/2](813600/4588595) loss:2.230 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:00,736][model8_pretrain.py][INFO] Epoch:[0/2](813600/4588595) loss:2.821 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:00,737][model8_pretrain.py][INFO] Epoch:[0/2](813600/4588595) loss:2.168 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:00,737][model8_pretrain.py][INFO] Epoch:[0/2](813600/4588595) loss:2.820 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:00,737][model8_pretrain.py][INFO] Epoch:[0/2](813600/4588595) loss:2.720 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:37,679][model8_pretrain.py][INFO] Epoch:[0/2](813700/4588595) loss:2.963 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:37,680][model8_pretrain.py][INFO] Epoch:[0/2](813700/4588595) loss:3.216 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:37,680][model8_pretrain.py][INFO] Epoch:[0/2](813700/4588595) loss:2.570 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:37,680][model8_pretrain.py][INFO] Epoch:[0/2](813700/4588595) loss:2.641 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:37,680][model8_pretrain.py][INFO] Epoch:[0/2](813700/4588595) loss:2.345 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:37,680][model8_pretrain.py][INFO] Epoch:[0/2](813700/4588595) loss:2.745 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:37,680][model8_pretrain.py][INFO] Epoch:[0/2](813700/4588595) loss:2.789 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:58:37,680][model8_pretrain.py][INFO] Epoch:[0/2](813700/4588595) loss:2.607 lr:0.0000100 epoch_Time:24026.0min: [2024-01-06 07:59:14,611][model8_pretrain.py][INFO] Epoch:[0/2](813800/4588595) loss:2.745 lr:0.0000100 epoch_Time:24025.0min: [2024-01-06 07:59:14,611][model8_pretrain.py][INFO] Epoch:[0/2](813800/4588595) loss:2.115 lr:0.0000100 epoch_Time:24025.0min: [2024-01-06 07:59:14,611][model8_pretrain.py][INFO] Epoch:[0/2](813800/4588595) loss:2.359 lr:0.0000100 epoch_Time:24025.0min: [2024-01-06 07:59:14,611][model8_pretrain.py][INFO] Epoch:[0/2](813800/4588595) loss:3.154 lr:0.0000100 epoch_Time:24025.0min: [2024-01-06 07:59:14,611][model8_pretrain.py][INFO] Epoch:[0/2](813800/4588595) loss:2.447 lr:0.0000100 epoch_Time:24025.0min: [2024-01-06 07:59:14,611][model8_pretrain.py][INFO] Epoch:[0/2](813800/4588595) loss:2.615 lr:0.0000100 epoch_Time:24025.0min: [2024-01-06 07:59:14,611][model8_pretrain.py][INFO] Epoch:[0/2](813800/4588595) loss:2.878 lr:0.0000100 epoch_Time:24025.0min: [2024-01-06 07:59:14,612][model8_pretrain.py][INFO] Epoch:[0/2](813800/4588595) loss:3.405 lr:0.0000100 epoch_Time:24025.0min: [2024-01-06 07:59:51,556][model8_pretrain.py][INFO] Epoch:[0/2](813900/4588595) loss:3.038 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 07:59:51,556][model8_pretrain.py][INFO] Epoch:[0/2](813900/4588595) loss:2.463 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 07:59:51,556][model8_pretrain.py][INFO] Epoch:[0/2](813900/4588595) loss:2.757 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 07:59:51,556][model8_pretrain.py][INFO] Epoch:[0/2](813900/4588595) loss:2.604 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 07:59:51,556][model8_pretrain.py][INFO] Epoch:[0/2](813900/4588595) loss:2.898 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 07:59:51,556][model8_pretrain.py][INFO] Epoch:[0/2](813900/4588595) loss:2.774 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 07:59:51,556][model8_pretrain.py][INFO] Epoch:[0/2](813900/4588595) loss:2.445 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 07:59:51,557][model8_pretrain.py][INFO] Epoch:[0/2](813900/4588595) loss:3.035 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 08:00:35,136][model8_pretrain.py][INFO] Epoch:[0/2](814000/4588595) loss:3.026 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 08:00:35,136][model8_pretrain.py][INFO] Epoch:[0/2](814000/4588595) loss:3.079 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 08:00:35,136][model8_pretrain.py][INFO] Epoch:[0/2](814000/4588595) loss:3.182 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 08:00:35,136][model8_pretrain.py][INFO] Epoch:[0/2](814000/4588595) loss:3.125 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 08:00:35,136][model8_pretrain.py][INFO] Epoch:[0/2](814000/4588595) loss:2.754 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 08:00:35,136][model8_pretrain.py][INFO] Epoch:[0/2](814000/4588595) loss:3.288 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 08:00:35,136][model8_pretrain.py][INFO] Epoch:[0/2](814000/4588595) loss:2.951 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 08:00:35,136][model8_pretrain.py][INFO] Epoch:[0/2](814000/4588595) loss:2.736 lr:0.0000100 epoch_Time:24024.0min: [2024-01-06 08:01:15,567][model8_pretrain.py][INFO] Epoch:[0/2](814100/4588595) loss:3.409 lr:0.0000100 epoch_Time:24023.0min: [2024-01-06 08:01:15,567][model8_pretrain.py][INFO] Epoch:[0/2](814100/4588595) loss:3.125 lr:0.0000100 epoch_Time:24023.0min: [2024-01-06 08:01:15,567][model8_pretrain.py][INFO] Epoch:[0/2](814100/4588595) loss:2.580 lr:0.0000100 epoch_Time:24023.0min: [2024-01-06 08:01:15,567][model8_pretrain.py][INFO] Epoch:[0/2](814100/4588595) loss:2.211 lr:0.0000100 epoch_Time:24023.0min: [2024-01-06 08:01:15,567][model8_pretrain.py][INFO] Epoch:[0/2](814100/4588595) loss:2.885 lr:0.0000100 epoch_Time:24023.0min: [2024-01-06 08:01:15,567][model8_pretrain.py][INFO] Epoch:[0/2](814100/4588595) loss:2.813 lr:0.0000100 epoch_Time:24023.0min: [2024-01-06 08:01:15,567][model8_pretrain.py][INFO] Epoch:[0/2](814100/4588595) loss:3.133 lr:0.0000100 epoch_Time:24023.0min: [2024-01-06 08:01:15,567][model8_pretrain.py][INFO] Epoch:[0/2](814100/4588595) loss:2.563 lr:0.0000100 epoch_Time:24023.0min: [2024-01-06 08:01:52,510][model8_pretrain.py][INFO] Epoch:[0/2](814200/4588595) loss:3.047 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:01:52,510][model8_pretrain.py][INFO] Epoch:[0/2](814200/4588595) loss:3.253 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:01:52,510][model8_pretrain.py][INFO] Epoch:[0/2](814200/4588595) loss:2.573 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:01:52,510][model8_pretrain.py][INFO] Epoch:[0/2](814200/4588595) loss:2.545 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:01:52,510][model8_pretrain.py][INFO] Epoch:[0/2](814200/4588595) loss:2.924 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:01:52,510][model8_pretrain.py][INFO] Epoch:[0/2](814200/4588595) loss:3.144 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:01:52,510][model8_pretrain.py][INFO] Epoch:[0/2](814200/4588595) loss:2.908 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:01:52,510][model8_pretrain.py][INFO] Epoch:[0/2](814200/4588595) loss:2.785 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:02:29,450][model8_pretrain.py][INFO] Epoch:[0/2](814300/4588595) loss:2.666 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:02:29,450][model8_pretrain.py][INFO] Epoch:[0/2](814300/4588595) loss:3.287 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:02:29,450][model8_pretrain.py][INFO] Epoch:[0/2](814300/4588595) loss:2.893 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:02:29,450][model8_pretrain.py][INFO] Epoch:[0/2](814300/4588595) loss:2.875 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:02:29,450][model8_pretrain.py][INFO] Epoch:[0/2](814300/4588595) loss:2.415 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:02:29,450][model8_pretrain.py][INFO] Epoch:[0/2](814300/4588595) loss:3.212 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:02:29,450][model8_pretrain.py][INFO] Epoch:[0/2](814300/4588595) loss:3.384 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:02:29,450][model8_pretrain.py][INFO] Epoch:[0/2](814300/4588595) loss:2.957 lr:0.0000100 epoch_Time:24022.0min: [2024-01-06 08:03:06,381][model8_pretrain.py][INFO] Epoch:[0/2](814400/4588595) loss:2.506 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:06,381][model8_pretrain.py][INFO] Epoch:[0/2](814400/4588595) loss:3.177 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:06,381][model8_pretrain.py][INFO] Epoch:[0/2](814400/4588595) loss:2.903 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:06,381][model8_pretrain.py][INFO] Epoch:[0/2](814400/4588595) loss:2.583 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:06,381][model8_pretrain.py][INFO] Epoch:[0/2](814400/4588595) loss:2.761 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:06,381][model8_pretrain.py][INFO] Epoch:[0/2](814400/4588595) loss:2.404 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:06,381][model8_pretrain.py][INFO] Epoch:[0/2](814400/4588595) loss:3.033 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:06,381][model8_pretrain.py][INFO] Epoch:[0/2](814400/4588595) loss:2.785 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:43,322][model8_pretrain.py][INFO] Epoch:[0/2](814500/4588595) loss:3.032 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:43,322][model8_pretrain.py][INFO] Epoch:[0/2](814500/4588595) loss:2.650 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:43,322][model8_pretrain.py][INFO] Epoch:[0/2](814500/4588595) loss:3.041 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:43,322][model8_pretrain.py][INFO] Epoch:[0/2](814500/4588595) loss:3.430 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:43,322][model8_pretrain.py][INFO] Epoch:[0/2](814500/4588595) loss:2.979 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:43,322][model8_pretrain.py][INFO] Epoch:[0/2](814500/4588595) loss:2.943 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:43,322][model8_pretrain.py][INFO] Epoch:[0/2](814500/4588595) loss:2.608 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:03:43,323][model8_pretrain.py][INFO] Epoch:[0/2](814500/4588595) loss:2.816 lr:0.0000100 epoch_Time:24021.0min: [2024-01-06 08:04:20,260][model8_pretrain.py][INFO] Epoch:[0/2](814600/4588595) loss:2.618 lr:0.0000100 epoch_Time:24020.0min: [2024-01-06 08:04:20,260][model8_pretrain.py][INFO] Epoch:[0/2](814600/4588595) loss:2.821 lr:0.0000100 epoch_Time:24020.0min: [2024-01-06 08:04:20,260][model8_pretrain.py][INFO] Epoch:[0/2](814600/4588595) loss:2.529 lr:0.0000100 epoch_Time:24020.0min: [2024-01-06 08:04:20,260][model8_pretrain.py][INFO] Epoch:[0/2](814600/4588595) loss:2.776 lr:0.0000100 epoch_Time:24020.0min: [2024-01-06 08:04:20,260][model8_pretrain.py][INFO] Epoch:[0/2](814600/4588595) loss:2.982 lr:0.0000100 epoch_Time:24020.0min: [2024-01-06 08:04:20,260][model8_pretrain.py][INFO] Epoch:[0/2](814600/4588595) loss:3.322 lr:0.0000100 epoch_Time:24020.0min: [2024-01-06 08:04:20,260][model8_pretrain.py][INFO] Epoch:[0/2](814600/4588595) loss:3.175 lr:0.0000100 epoch_Time:24020.0min: [2024-01-06 08:04:20,261][model8_pretrain.py][INFO] Epoch:[0/2](814600/4588595) loss:2.321 lr:0.0000100 epoch_Time:24020.0min: [2024-01-06 08:04:57,192][model8_pretrain.py][INFO] Epoch:[0/2](814700/4588595) loss:2.575 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:04:57,192][model8_pretrain.py][INFO] Epoch:[0/2](814700/4588595) loss:2.927 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:04:57,192][model8_pretrain.py][INFO] Epoch:[0/2](814700/4588595) loss:2.948 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:04:57,192][model8_pretrain.py][INFO] Epoch:[0/2](814700/4588595) loss:3.269 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:04:57,192][model8_pretrain.py][INFO] Epoch:[0/2](814700/4588595) loss:2.981 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:04:57,192][model8_pretrain.py][INFO] Epoch:[0/2](814700/4588595) loss:2.920 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:04:57,192][model8_pretrain.py][INFO] Epoch:[0/2](814700/4588595) loss:2.914 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:04:57,192][model8_pretrain.py][INFO] Epoch:[0/2](814700/4588595) loss:2.337 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:05:40,905][model8_pretrain.py][INFO] Epoch:[0/2](814800/4588595) loss:2.432 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:05:40,905][model8_pretrain.py][INFO] Epoch:[0/2](814800/4588595) loss:2.621 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:05:40,905][model8_pretrain.py][INFO] Epoch:[0/2](814800/4588595) loss:2.874 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:05:40,905][model8_pretrain.py][INFO] Epoch:[0/2](814800/4588595) loss:2.794 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:05:40,905][model8_pretrain.py][INFO] Epoch:[0/2](814800/4588595) loss:2.673 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:05:40,905][model8_pretrain.py][INFO] Epoch:[0/2](814800/4588595) loss:3.047 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:05:40,905][model8_pretrain.py][INFO] Epoch:[0/2](814800/4588595) loss:2.823 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:05:40,905][model8_pretrain.py][INFO] Epoch:[0/2](814800/4588595) loss:2.898 lr:0.0000100 epoch_Time:24019.0min: [2024-01-06 08:06:21,297][model8_pretrain.py][INFO] Epoch:[0/2](814900/4588595) loss:2.847 lr:0.0000100 epoch_Time:24018.0min: [2024-01-06 08:06:21,297][model8_pretrain.py][INFO] Epoch:[0/2](814900/4588595) loss:2.975 lr:0.0000100 epoch_Time:24018.0min: [2024-01-06 08:06:21,297][model8_pretrain.py][INFO] Epoch:[0/2](814900/4588595) loss:2.108 lr:0.0000100 epoch_Time:24018.0min: [2024-01-06 08:06:21,297][model8_pretrain.py][INFO] Epoch:[0/2](814900/4588595) loss:3.157 lr:0.0000100 epoch_Time:24018.0min: [2024-01-06 08:06:21,297][model8_pretrain.py][INFO] Epoch:[0/2](814900/4588595) loss:2.793 lr:0.0000100 epoch_Time:24018.0min: [2024-01-06 08:06:21,297][model8_pretrain.py][INFO] Epoch:[0/2](814900/4588595) loss:3.033 lr:0.0000100 epoch_Time:24018.0min: [2024-01-06 08:06:21,297][model8_pretrain.py][INFO] Epoch:[0/2](814900/4588595) loss:2.648 lr:0.0000100 epoch_Time:24018.0min: [2024-01-06 08:06:21,297][model8_pretrain.py][INFO] Epoch:[0/2](814900/4588595) loss:2.829 lr:0.0000100 epoch_Time:24018.0min: [2024-01-06 08:06:58,238][model8_pretrain.py][INFO] Epoch:[0/2](815000/4588595) loss:3.239 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:06:58,238][model8_pretrain.py][INFO] Epoch:[0/2](815000/4588595) loss:2.699 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:06:58,238][model8_pretrain.py][INFO] Epoch:[0/2](815000/4588595) loss:2.995 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:06:58,239][model8_pretrain.py][INFO] Epoch:[0/2](815000/4588595) loss:2.731 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:06:58,239][model8_pretrain.py][INFO] Epoch:[0/2](815000/4588595) loss:3.055 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:06:58,239][model8_pretrain.py][INFO] Epoch:[0/2](815000/4588595) loss:2.612 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:06:58,239][model8_pretrain.py][INFO] Epoch:[0/2](815000/4588595) loss:2.584 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:06:58,239][model8_pretrain.py][INFO] Epoch:[0/2](815000/4588595) loss:3.206 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:07:35,176][model8_pretrain.py][INFO] Epoch:[0/2](815100/4588595) loss:3.075 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:07:35,176][model8_pretrain.py][INFO] Epoch:[0/2](815100/4588595) loss:2.705 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:07:35,176][model8_pretrain.py][INFO] Epoch:[0/2](815100/4588595) loss:3.301 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:07:35,176][model8_pretrain.py][INFO] Epoch:[0/2](815100/4588595) loss:3.423 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:07:35,176][model8_pretrain.py][INFO] Epoch:[0/2](815100/4588595) loss:2.926 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:07:35,177][model8_pretrain.py][INFO] Epoch:[0/2](815100/4588595) loss:3.149 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:07:35,177][model8_pretrain.py][INFO] Epoch:[0/2](815100/4588595) loss:3.093 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:07:35,177][model8_pretrain.py][INFO] Epoch:[0/2](815100/4588595) loss:3.137 lr:0.0000100 epoch_Time:24017.0min: [2024-01-06 08:08:12,137][model8_pretrain.py][INFO] Epoch:[0/2](815200/4588595) loss:3.062 lr:0.0000100 epoch_Time:24016.0min: [2024-01-06 08:08:12,137][model8_pretrain.py][INFO] Epoch:[0/2](815200/4588595) loss:2.930 lr:0.0000100 epoch_Time:24016.0min: [2024-01-06 08:08:12,137][model8_pretrain.py][INFO] Epoch:[0/2](815200/4588595) loss:2.985 lr:0.0000100 epoch_Time:24016.0min: [2024-01-06 08:08:12,137][model8_pretrain.py][INFO] Epoch:[0/2](815200/4588595) loss:3.066 lr:0.0000100 epoch_Time:24016.0min: [2024-01-06 08:08:12,137][model8_pretrain.py][INFO] Epoch:[0/2](815200/4588595) loss:2.552 lr:0.0000100 epoch_Time:24016.0min: [2024-01-06 08:08:12,137][model8_pretrain.py][INFO] Epoch:[0/2](815200/4588595) loss:3.118 lr:0.0000100 epoch_Time:24016.0min: [2024-01-06 08:08:12,137][model8_pretrain.py][INFO] Epoch:[0/2](815200/4588595) loss:2.459 lr:0.0000100 epoch_Time:24016.0min: [2024-01-06 08:08:12,137][model8_pretrain.py][INFO] Epoch:[0/2](815200/4588595) loss:3.290 lr:0.0000100 epoch_Time:24016.0min: [2024-01-06 08:08:49,075][model8_pretrain.py][INFO] Epoch:[0/2](815300/4588595) loss:2.997 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:08:49,075][model8_pretrain.py][INFO] Epoch:[0/2](815300/4588595) loss:2.575 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:08:49,076][model8_pretrain.py][INFO] Epoch:[0/2](815300/4588595) loss:2.951 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:08:49,076][model8_pretrain.py][INFO] Epoch:[0/2](815300/4588595) loss:2.716 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:08:49,075][model8_pretrain.py][INFO] Epoch:[0/2](815300/4588595) loss:2.574 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:08:49,076][model8_pretrain.py][INFO] Epoch:[0/2](815300/4588595) loss:2.539 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:08:49,076][model8_pretrain.py][INFO] Epoch:[0/2](815300/4588595) loss:2.153 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:08:49,076][model8_pretrain.py][INFO] Epoch:[0/2](815300/4588595) loss:2.729 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:09:26,006][model8_pretrain.py][INFO] Epoch:[0/2](815400/4588595) loss:3.083 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:09:26,006][model8_pretrain.py][INFO] Epoch:[0/2](815400/4588595) loss:3.132 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:09:26,006][model8_pretrain.py][INFO] Epoch:[0/2](815400/4588595) loss:2.814 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:09:26,006][model8_pretrain.py][INFO] Epoch:[0/2](815400/4588595) loss:2.714 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:09:26,006][model8_pretrain.py][INFO] Epoch:[0/2](815400/4588595) loss:2.769 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:09:26,006][model8_pretrain.py][INFO] Epoch:[0/2](815400/4588595) loss:2.922 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:09:26,006][model8_pretrain.py][INFO] Epoch:[0/2](815400/4588595) loss:2.953 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:09:26,006][model8_pretrain.py][INFO] Epoch:[0/2](815400/4588595) loss:2.477 lr:0.0000100 epoch_Time:24015.0min: [2024-01-06 08:10:02,939][model8_pretrain.py][INFO] Epoch:[0/2](815500/4588595) loss:2.512 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:02,939][model8_pretrain.py][INFO] Epoch:[0/2](815500/4588595) loss:2.476 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:02,939][model8_pretrain.py][INFO] Epoch:[0/2](815500/4588595) loss:2.648 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:02,939][model8_pretrain.py][INFO] Epoch:[0/2](815500/4588595) loss:2.464 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:02,939][model8_pretrain.py][INFO] Epoch:[0/2](815500/4588595) loss:2.391 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:02,939][model8_pretrain.py][INFO] Epoch:[0/2](815500/4588595) loss:2.798 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:02,939][model8_pretrain.py][INFO] Epoch:[0/2](815500/4588595) loss:2.388 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:02,939][model8_pretrain.py][INFO] Epoch:[0/2](815500/4588595) loss:2.922 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:46,571][model8_pretrain.py][INFO] Epoch:[0/2](815600/4588595) loss:2.720 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:46,571][model8_pretrain.py][INFO] Epoch:[0/2](815600/4588595) loss:3.330 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:46,576][model8_pretrain.py][INFO] Epoch:[0/2](815600/4588595) loss:2.911 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:46,576][model8_pretrain.py][INFO] Epoch:[0/2](815600/4588595) loss:2.815 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:46,576][model8_pretrain.py][INFO] Epoch:[0/2](815600/4588595) loss:3.273 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:46,576][model8_pretrain.py][INFO] Epoch:[0/2](815600/4588595) loss:3.049 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:46,576][model8_pretrain.py][INFO] Epoch:[0/2](815600/4588595) loss:2.487 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:10:46,576][model8_pretrain.py][INFO] Epoch:[0/2](815600/4588595) loss:2.100 lr:0.0000100 epoch_Time:24014.0min: [2024-01-06 08:11:26,981][model8_pretrain.py][INFO] Epoch:[0/2](815700/4588595) loss:2.511 lr:0.0000100 epoch_Time:24013.0min: [2024-01-06 08:11:26,981][model8_pretrain.py][INFO] Epoch:[0/2](815700/4588595) loss:3.264 lr:0.0000100 epoch_Time:24013.0min: [2024-01-06 08:11:26,981][model8_pretrain.py][INFO] Epoch:[0/2](815700/4588595) loss:2.822 lr:0.0000100 epoch_Time:24013.0min: [2024-01-06 08:11:26,981][model8_pretrain.py][INFO] Epoch:[0/2](815700/4588595) loss:2.676 lr:0.0000100 epoch_Time:24013.0min: [2024-01-06 08:11:26,981][model8_pretrain.py][INFO] Epoch:[0/2](815700/4588595) loss:2.643 lr:0.0000100 epoch_Time:24013.0min: [2024-01-06 08:11:26,981][model8_pretrain.py][INFO] Epoch:[0/2](815700/4588595) loss:2.940 lr:0.0000100 epoch_Time:24013.0min: [2024-01-06 08:11:26,981][model8_pretrain.py][INFO] Epoch:[0/2](815700/4588595) loss:2.659 lr:0.0000100 epoch_Time:24013.0min: [2024-01-06 08:11:26,981][model8_pretrain.py][INFO] Epoch:[0/2](815700/4588595) loss:2.811 lr:0.0000100 epoch_Time:24013.0min: [2024-01-06 08:12:03,922][model8_pretrain.py][INFO] Epoch:[0/2](815800/4588595) loss:2.348 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:03,922][model8_pretrain.py][INFO] Epoch:[0/2](815800/4588595) loss:3.339 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:03,922][model8_pretrain.py][INFO] Epoch:[0/2](815800/4588595) loss:2.362 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:03,922][model8_pretrain.py][INFO] Epoch:[0/2](815800/4588595) loss:2.365 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:03,922][model8_pretrain.py][INFO] Epoch:[0/2](815800/4588595) loss:2.984 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:03,922][model8_pretrain.py][INFO] Epoch:[0/2](815800/4588595) loss:2.683 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:03,922][model8_pretrain.py][INFO] Epoch:[0/2](815800/4588595) loss:3.166 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:03,922][model8_pretrain.py][INFO] Epoch:[0/2](815800/4588595) loss:3.028 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:40,864][model8_pretrain.py][INFO] Epoch:[0/2](815900/4588595) loss:3.214 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:40,865][model8_pretrain.py][INFO] Epoch:[0/2](815900/4588595) loss:3.138 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:40,865][model8_pretrain.py][INFO] Epoch:[0/2](815900/4588595) loss:2.657 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:40,865][model8_pretrain.py][INFO] Epoch:[0/2](815900/4588595) loss:3.148 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:40,865][model8_pretrain.py][INFO] Epoch:[0/2](815900/4588595) loss:2.803 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:40,865][model8_pretrain.py][INFO] Epoch:[0/2](815900/4588595) loss:2.871 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:40,865][model8_pretrain.py][INFO] Epoch:[0/2](815900/4588595) loss:2.441 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:12:40,865][model8_pretrain.py][INFO] Epoch:[0/2](815900/4588595) loss:2.626 lr:0.0000100 epoch_Time:24012.0min: [2024-01-06 08:13:17,807][model8_pretrain.py][INFO] Epoch:[0/2](816000/4588595) loss:2.933 lr:0.0000100 epoch_Time:24011.0min: [2024-01-06 08:13:17,807][model8_pretrain.py][INFO] Epoch:[0/2](816000/4588595) loss:3.305 lr:0.0000100 epoch_Time:24011.0min: [2024-01-06 08:13:17,807][model8_pretrain.py][INFO] Epoch:[0/2](816000/4588595) loss:2.882 lr:0.0000100 epoch_Time:24011.0min: [2024-01-06 08:13:17,807][model8_pretrain.py][INFO] Epoch:[0/2](816000/4588595) loss:2.852 lr:0.0000100 epoch_Time:24011.0min: [2024-01-06 08:13:17,807][model8_pretrain.py][INFO] Epoch:[0/2](816000/4588595) loss:2.723 lr:0.0000100 epoch_Time:24011.0min: [2024-01-06 08:13:17,807][model8_pretrain.py][INFO] Epoch:[0/2](816000/4588595) loss:2.338 lr:0.0000100 epoch_Time:24011.0min: [2024-01-06 08:13:17,807][model8_pretrain.py][INFO] Epoch:[0/2](816000/4588595) loss:2.308 lr:0.0000100 epoch_Time:24011.0min: [2024-01-06 08:13:17,808][model8_pretrain.py][INFO] Epoch:[0/2](816000/4588595) loss:2.842 lr:0.0000100 epoch_Time:24011.0min: [2024-01-06 08:13:54,748][model8_pretrain.py][INFO] Epoch:[0/2](816100/4588595) loss:2.718 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:13:54,748][model8_pretrain.py][INFO] Epoch:[0/2](816100/4588595) loss:3.346 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:13:54,748][model8_pretrain.py][INFO] Epoch:[0/2](816100/4588595) loss:3.179 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:13:54,748][model8_pretrain.py][INFO] Epoch:[0/2](816100/4588595) loss:3.224 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:13:54,748][model8_pretrain.py][INFO] Epoch:[0/2](816100/4588595) loss:2.915 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:13:54,748][model8_pretrain.py][INFO] Epoch:[0/2](816100/4588595) loss:2.553 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:13:54,748][model8_pretrain.py][INFO] Epoch:[0/2](816100/4588595) loss:3.275 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:13:54,748][model8_pretrain.py][INFO] Epoch:[0/2](816100/4588595) loss:2.989 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:14:31,698][model8_pretrain.py][INFO] Epoch:[0/2](816200/4588595) loss:2.955 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:14:31,699][model8_pretrain.py][INFO] Epoch:[0/2](816200/4588595) loss:2.673 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:14:31,699][model8_pretrain.py][INFO] Epoch:[0/2](816200/4588595) loss:2.962 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:14:31,699][model8_pretrain.py][INFO] Epoch:[0/2](816200/4588595) loss:2.455 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:14:31,699][model8_pretrain.py][INFO] Epoch:[0/2](816200/4588595) loss:2.910 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:14:31,699][model8_pretrain.py][INFO] Epoch:[0/2](816200/4588595) loss:2.418 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:14:31,699][model8_pretrain.py][INFO] Epoch:[0/2](816200/4588595) loss:2.744 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:14:31,699][model8_pretrain.py][INFO] Epoch:[0/2](816200/4588595) loss:3.117 lr:0.0000100 epoch_Time:24010.0min: [2024-01-06 08:15:08,643][model8_pretrain.py][INFO] Epoch:[0/2](816300/4588595) loss:3.064 lr:0.0000100 epoch_Time:24009.0min: [2024-01-06 08:15:08,643][model8_pretrain.py][INFO] Epoch:[0/2](816300/4588595) loss:2.609 lr:0.0000100 epoch_Time:24009.0min: [2024-01-06 08:15:08,643][model8_pretrain.py][INFO] Epoch:[0/2](816300/4588595) loss:2.771 lr:0.0000100 epoch_Time:24009.0min: [2024-01-06 08:15:08,643][model8_pretrain.py][INFO] Epoch:[0/2](816300/4588595) loss:3.030 lr:0.0000100 epoch_Time:24009.0min: [2024-01-06 08:15:08,643][model8_pretrain.py][INFO] Epoch:[0/2](816300/4588595) loss:2.844 lr:0.0000100 epoch_Time:24009.0min: [2024-01-06 08:15:08,644][model8_pretrain.py][INFO] Epoch:[0/2](816300/4588595) loss:1.913 lr:0.0000100 epoch_Time:24009.0min: [2024-01-06 08:15:08,644][model8_pretrain.py][INFO] Epoch:[0/2](816300/4588595) loss:2.972 lr:0.0000100 epoch_Time:24009.0min: [2024-01-06 08:15:08,644][model8_pretrain.py][INFO] Epoch:[0/2](816300/4588595) loss:2.688 lr:0.0000100 epoch_Time:24009.0min: [2024-01-06 08:15:50,826][model8_pretrain.py][INFO] Epoch:[0/2](816400/4588595) loss:2.820 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:15:50,826][model8_pretrain.py][INFO] Epoch:[0/2](816400/4588595) loss:2.046 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:15:50,826][model8_pretrain.py][INFO] Epoch:[0/2](816400/4588595) loss:3.148 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:15:50,826][model8_pretrain.py][INFO] Epoch:[0/2](816400/4588595) loss:2.630 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:15:50,826][model8_pretrain.py][INFO] Epoch:[0/2](816400/4588595) loss:2.646 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:15:50,826][model8_pretrain.py][INFO] Epoch:[0/2](816400/4588595) loss:3.037 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:15:50,826][model8_pretrain.py][INFO] Epoch:[0/2](816400/4588595) loss:2.680 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:15:50,827][model8_pretrain.py][INFO] Epoch:[0/2](816400/4588595) loss:2.681 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:16:32,723][model8_pretrain.py][INFO] Epoch:[0/2](816500/4588595) loss:2.515 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:16:32,723][model8_pretrain.py][INFO] Epoch:[0/2](816500/4588595) loss:3.196 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:16:32,723][model8_pretrain.py][INFO] Epoch:[0/2](816500/4588595) loss:3.210 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:16:32,723][model8_pretrain.py][INFO] Epoch:[0/2](816500/4588595) loss:2.729 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:16:32,723][model8_pretrain.py][INFO] Epoch:[0/2](816500/4588595) loss:2.545 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:16:32,723][model8_pretrain.py][INFO] Epoch:[0/2](816500/4588595) loss:2.498 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:16:32,723][model8_pretrain.py][INFO] Epoch:[0/2](816500/4588595) loss:3.045 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:16:32,723][model8_pretrain.py][INFO] Epoch:[0/2](816500/4588595) loss:2.541 lr:0.0000100 epoch_Time:24008.0min: [2024-01-06 08:17:09,666][model8_pretrain.py][INFO] Epoch:[0/2](816600/4588595) loss:2.923 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:09,666][model8_pretrain.py][INFO] Epoch:[0/2](816600/4588595) loss:3.096 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:09,666][model8_pretrain.py][INFO] Epoch:[0/2](816600/4588595) loss:2.650 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:09,666][model8_pretrain.py][INFO] Epoch:[0/2](816600/4588595) loss:3.456 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:09,666][model8_pretrain.py][INFO] Epoch:[0/2](816600/4588595) loss:2.649 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:09,666][model8_pretrain.py][INFO] Epoch:[0/2](816600/4588595) loss:3.100 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:09,666][model8_pretrain.py][INFO] Epoch:[0/2](816600/4588595) loss:2.577 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:09,666][model8_pretrain.py][INFO] Epoch:[0/2](816600/4588595) loss:2.690 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:46,604][model8_pretrain.py][INFO] Epoch:[0/2](816700/4588595) loss:2.878 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:46,604][model8_pretrain.py][INFO] Epoch:[0/2](816700/4588595) loss:2.588 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:46,604][model8_pretrain.py][INFO] Epoch:[0/2](816700/4588595) loss:2.861 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:46,604][model8_pretrain.py][INFO] Epoch:[0/2](816700/4588595) loss:3.377 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:46,604][model8_pretrain.py][INFO] Epoch:[0/2](816700/4588595) loss:3.133 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:46,604][model8_pretrain.py][INFO] Epoch:[0/2](816700/4588595) loss:2.887 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:46,604][model8_pretrain.py][INFO] Epoch:[0/2](816700/4588595) loss:2.720 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:17:46,604][model8_pretrain.py][INFO] Epoch:[0/2](816700/4588595) loss:2.712 lr:0.0000100 epoch_Time:24007.0min: [2024-01-06 08:18:23,553][model8_pretrain.py][INFO] Epoch:[0/2](816800/4588595) loss:2.315 lr:0.0000100 epoch_Time:24006.0min: [2024-01-06 08:18:23,553][model8_pretrain.py][INFO] Epoch:[0/2](816800/4588595) loss:3.073 lr:0.0000100 epoch_Time:24006.0min: [2024-01-06 08:18:23,553][model8_pretrain.py][INFO] Epoch:[0/2](816800/4588595) loss:2.805 lr:0.0000100 epoch_Time:24006.0min: [2024-01-06 08:18:23,553][model8_pretrain.py][INFO] Epoch:[0/2](816800/4588595) loss:2.460 lr:0.0000100 epoch_Time:24006.0min: [2024-01-06 08:18:23,553][model8_pretrain.py][INFO] Epoch:[0/2](816800/4588595) loss:2.986 lr:0.0000100 epoch_Time:24006.0min: [2024-01-06 08:18:23,553][model8_pretrain.py][INFO] Epoch:[0/2](816800/4588595) loss:3.119 lr:0.0000100 epoch_Time:24006.0min: [2024-01-06 08:18:23,553][model8_pretrain.py][INFO] Epoch:[0/2](816800/4588595) loss:2.970 lr:0.0000100 epoch_Time:24006.0min: [2024-01-06 08:18:23,553][model8_pretrain.py][INFO] Epoch:[0/2](816800/4588595) loss:2.543 lr:0.0000100 epoch_Time:24006.0min: [2024-01-06 08:19:00,497][model8_pretrain.py][INFO] Epoch:[0/2](816900/4588595) loss:2.788 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:00,497][model8_pretrain.py][INFO] Epoch:[0/2](816900/4588595) loss:2.381 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:00,497][model8_pretrain.py][INFO] Epoch:[0/2](816900/4588595) loss:3.338 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:00,497][model8_pretrain.py][INFO] Epoch:[0/2](816900/4588595) loss:3.387 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:00,497][model8_pretrain.py][INFO] Epoch:[0/2](816900/4588595) loss:3.108 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:00,498][model8_pretrain.py][INFO] Epoch:[0/2](816900/4588595) loss:2.902 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:00,498][model8_pretrain.py][INFO] Epoch:[0/2](816900/4588595) loss:2.866 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:00,498][model8_pretrain.py][INFO] Epoch:[0/2](816900/4588595) loss:3.106 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:37,455][model8_pretrain.py][INFO] Epoch:[0/2](817000/4588595) loss:2.141 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:37,455][model8_pretrain.py][INFO] Epoch:[0/2](817000/4588595) loss:2.583 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:37,455][model8_pretrain.py][INFO] Epoch:[0/2](817000/4588595) loss:3.209 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:37,455][model8_pretrain.py][INFO] Epoch:[0/2](817000/4588595) loss:2.493 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:37,455][model8_pretrain.py][INFO] Epoch:[0/2](817000/4588595) loss:2.409 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:37,455][model8_pretrain.py][INFO] Epoch:[0/2](817000/4588595) loss:2.639 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:37,455][model8_pretrain.py][INFO] Epoch:[0/2](817000/4588595) loss:2.539 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:19:37,456][model8_pretrain.py][INFO] Epoch:[0/2](817000/4588595) loss:2.481 lr:0.0000100 epoch_Time:24005.0min: [2024-01-06 08:20:14,422][model8_pretrain.py][INFO] Epoch:[0/2](817100/4588595) loss:2.492 lr:0.0000100 epoch_Time:24004.0min: [2024-01-06 08:20:14,422][model8_pretrain.py][INFO] Epoch:[0/2](817100/4588595) loss:2.665 lr:0.0000100 epoch_Time:24004.0min: [2024-01-06 08:20:14,422][model8_pretrain.py][INFO] Epoch:[0/2](817100/4588595) loss:2.380 lr:0.0000100 epoch_Time:24004.0min: [2024-01-06 08:20:14,422][model8_pretrain.py][INFO] Epoch:[0/2](817100/4588595) loss:3.002 lr:0.0000100 epoch_Time:24004.0min: [2024-01-06 08:20:14,422][model8_pretrain.py][INFO] Epoch:[0/2](817100/4588595) loss:3.227 lr:0.0000100 epoch_Time:24004.0min: [2024-01-06 08:20:14,423][model8_pretrain.py][INFO] Epoch:[0/2](817100/4588595) loss:2.522 lr:0.0000100 epoch_Time:24004.0min: [2024-01-06 08:20:14,423][model8_pretrain.py][INFO] Epoch:[0/2](817100/4588595) loss:2.941 lr:0.0000100 epoch_Time:24004.0min: [2024-01-06 08:20:14,423][model8_pretrain.py][INFO] Epoch:[0/2](817100/4588595) loss:2.891 lr:0.0000100 epoch_Time:24004.0min: [2024-01-06 08:20:56,657][model8_pretrain.py][INFO] Epoch:[0/2](817200/4588595) loss:2.860 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:20:56,657][model8_pretrain.py][INFO] Epoch:[0/2](817200/4588595) loss:2.556 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:20:56,657][model8_pretrain.py][INFO] Epoch:[0/2](817200/4588595) loss:2.672 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:20:56,657][model8_pretrain.py][INFO] Epoch:[0/2](817200/4588595) loss:3.122 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:20:56,657][model8_pretrain.py][INFO] Epoch:[0/2](817200/4588595) loss:2.659 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:20:56,657][model8_pretrain.py][INFO] Epoch:[0/2](817200/4588595) loss:2.280 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:20:56,657][model8_pretrain.py][INFO] Epoch:[0/2](817200/4588595) loss:2.582 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:20:56,657][model8_pretrain.py][INFO] Epoch:[0/2](817200/4588595) loss:3.129 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:21:38,545][model8_pretrain.py][INFO] Epoch:[0/2](817300/4588595) loss:2.500 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:21:38,545][model8_pretrain.py][INFO] Epoch:[0/2](817300/4588595) loss:2.652 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:21:38,545][model8_pretrain.py][INFO] Epoch:[0/2](817300/4588595) loss:2.744 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:21:38,545][model8_pretrain.py][INFO] Epoch:[0/2](817300/4588595) loss:3.015 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:21:38,545][model8_pretrain.py][INFO] Epoch:[0/2](817300/4588595) loss:2.783 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:21:38,545][model8_pretrain.py][INFO] Epoch:[0/2](817300/4588595) loss:2.585 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:21:38,545][model8_pretrain.py][INFO] Epoch:[0/2](817300/4588595) loss:2.706 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:21:38,546][model8_pretrain.py][INFO] Epoch:[0/2](817300/4588595) loss:2.758 lr:0.0000100 epoch_Time:24003.0min: [2024-01-06 08:22:15,479][model8_pretrain.py][INFO] Epoch:[0/2](817400/4588595) loss:3.232 lr:0.0000100 epoch_Time:24002.0min: [2024-01-06 08:22:15,479][model8_pretrain.py][INFO] Epoch:[0/2](817400/4588595) loss:2.695 lr:0.0000100 epoch_Time:24002.0min: [2024-01-06 08:22:15,479][model8_pretrain.py][INFO] Epoch:[0/2](817400/4588595) loss:2.754 lr:0.0000100 epoch_Time:24002.0min: [2024-01-06 08:22:15,479][model8_pretrain.py][INFO] Epoch:[0/2](817400/4588595) loss:2.707 lr:0.0000100 epoch_Time:24002.0min: [2024-01-06 08:22:15,479][model8_pretrain.py][INFO] Epoch:[0/2](817400/4588595) loss:2.541 lr:0.0000100 epoch_Time:24002.0min: [2024-01-06 08:22:15,479][model8_pretrain.py][INFO] Epoch:[0/2](817400/4588595) loss:2.821 lr:0.0000100 epoch_Time:24002.0min: [2024-01-06 08:22:15,479][model8_pretrain.py][INFO] Epoch:[0/2](817400/4588595) loss:3.272 lr:0.0000100 epoch_Time:24002.0min: [2024-01-06 08:22:15,479][model8_pretrain.py][INFO] Epoch:[0/2](817400/4588595) loss:3.207 lr:0.0000100 epoch_Time:24002.0min: [2024-01-06 08:22:52,436][model8_pretrain.py][INFO] Epoch:[0/2](817500/4588595) loss:2.681 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:22:52,436][model8_pretrain.py][INFO] Epoch:[0/2](817500/4588595) loss:2.817 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:22:52,436][model8_pretrain.py][INFO] Epoch:[0/2](817500/4588595) loss:2.807 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:22:52,436][model8_pretrain.py][INFO] Epoch:[0/2](817500/4588595) loss:2.901 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:22:52,436][model8_pretrain.py][INFO] Epoch:[0/2](817500/4588595) loss:2.295 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:22:52,436][model8_pretrain.py][INFO] Epoch:[0/2](817500/4588595) loss:3.264 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:22:52,436][model8_pretrain.py][INFO] Epoch:[0/2](817500/4588595) loss:3.110 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:22:52,437][model8_pretrain.py][INFO] Epoch:[0/2](817500/4588595) loss:2.689 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:23:29,391][model8_pretrain.py][INFO] Epoch:[0/2](817600/4588595) loss:3.024 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:23:29,391][model8_pretrain.py][INFO] Epoch:[0/2](817600/4588595) loss:2.965 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:23:29,391][model8_pretrain.py][INFO] Epoch:[0/2](817600/4588595) loss:3.145 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:23:29,391][model8_pretrain.py][INFO] Epoch:[0/2](817600/4588595) loss:2.393 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:23:29,391][model8_pretrain.py][INFO] Epoch:[0/2](817600/4588595) loss:3.166 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:23:29,391][model8_pretrain.py][INFO] Epoch:[0/2](817600/4588595) loss:2.723 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:23:29,391][model8_pretrain.py][INFO] Epoch:[0/2](817600/4588595) loss:2.817 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:23:29,391][model8_pretrain.py][INFO] Epoch:[0/2](817600/4588595) loss:2.864 lr:0.0000100 epoch_Time:24001.0min: [2024-01-06 08:24:06,336][model8_pretrain.py][INFO] Epoch:[0/2](817700/4588595) loss:2.983 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:06,336][model8_pretrain.py][INFO] Epoch:[0/2](817700/4588595) loss:2.645 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:06,336][model8_pretrain.py][INFO] Epoch:[0/2](817700/4588595) loss:2.695 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:06,336][model8_pretrain.py][INFO] Epoch:[0/2](817700/4588595) loss:2.853 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:06,336][model8_pretrain.py][INFO] Epoch:[0/2](817700/4588595) loss:3.066 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:06,336][model8_pretrain.py][INFO] Epoch:[0/2](817700/4588595) loss:2.941 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:06,336][model8_pretrain.py][INFO] Epoch:[0/2](817700/4588595) loss:2.479 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:06,336][model8_pretrain.py][INFO] Epoch:[0/2](817700/4588595) loss:3.059 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:43,285][model8_pretrain.py][INFO] Epoch:[0/2](817800/4588595) loss:3.164 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:43,285][model8_pretrain.py][INFO] Epoch:[0/2](817800/4588595) loss:2.643 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:43,285][model8_pretrain.py][INFO] Epoch:[0/2](817800/4588595) loss:2.995 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:43,285][model8_pretrain.py][INFO] Epoch:[0/2](817800/4588595) loss:2.750 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:43,285][model8_pretrain.py][INFO] Epoch:[0/2](817800/4588595) loss:3.134 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:43,285][model8_pretrain.py][INFO] Epoch:[0/2](817800/4588595) loss:2.759 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:43,286][model8_pretrain.py][INFO] Epoch:[0/2](817800/4588595) loss:3.255 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:24:43,286][model8_pretrain.py][INFO] Epoch:[0/2](817800/4588595) loss:2.951 lr:0.0000100 epoch_Time:24000.0min: [2024-01-06 08:25:20,232][model8_pretrain.py][INFO] Epoch:[0/2](817900/4588595) loss:2.359 lr:0.0000100 epoch_Time:23999.0min: [2024-01-06 08:25:20,232][model8_pretrain.py][INFO] Epoch:[0/2](817900/4588595) loss:2.552 lr:0.0000100 epoch_Time:23999.0min: [2024-01-06 08:25:20,232][model8_pretrain.py][INFO] Epoch:[0/2](817900/4588595) loss:2.894 lr:0.0000100 epoch_Time:23999.0min: [2024-01-06 08:25:20,232][model8_pretrain.py][INFO] Epoch:[0/2](817900/4588595) loss:2.869 lr:0.0000100 epoch_Time:23999.0min: [2024-01-06 08:25:20,232][model8_pretrain.py][INFO] Epoch:[0/2](817900/4588595) loss:2.727 lr:0.0000100 epoch_Time:23999.0min: [2024-01-06 08:25:20,232][model8_pretrain.py][INFO] Epoch:[0/2](817900/4588595) loss:2.904 lr:0.0000100 epoch_Time:23999.0min: [2024-01-06 08:25:20,232][model8_pretrain.py][INFO] Epoch:[0/2](817900/4588595) loss:2.923 lr:0.0000100 epoch_Time:23999.0min: [2024-01-06 08:25:20,232][model8_pretrain.py][INFO] Epoch:[0/2](817900/4588595) loss:3.324 lr:0.0000100 epoch_Time:23999.0min: [2024-01-06 08:26:02,386][model8_pretrain.py][INFO] Epoch:[0/2](818000/4588595) loss:3.467 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:02,386][model8_pretrain.py][INFO] Epoch:[0/2](818000/4588595) loss:2.683 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:02,386][model8_pretrain.py][INFO] Epoch:[0/2](818000/4588595) loss:2.996 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:02,386][model8_pretrain.py][INFO] Epoch:[0/2](818000/4588595) loss:2.781 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:02,386][model8_pretrain.py][INFO] Epoch:[0/2](818000/4588595) loss:3.384 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:02,386][model8_pretrain.py][INFO] Epoch:[0/2](818000/4588595) loss:3.109 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:02,386][model8_pretrain.py][INFO] Epoch:[0/2](818000/4588595) loss:3.189 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:02,386][model8_pretrain.py][INFO] Epoch:[0/2](818000/4588595) loss:2.563 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:44,577][model8_pretrain.py][INFO] Epoch:[0/2](818100/4588595) loss:3.319 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:44,577][model8_pretrain.py][INFO] Epoch:[0/2](818100/4588595) loss:2.707 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:44,577][model8_pretrain.py][INFO] Epoch:[0/2](818100/4588595) loss:2.923 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:44,577][model8_pretrain.py][INFO] Epoch:[0/2](818100/4588595) loss:3.064 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:44,577][model8_pretrain.py][INFO] Epoch:[0/2](818100/4588595) loss:2.912 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:44,577][model8_pretrain.py][INFO] Epoch:[0/2](818100/4588595) loss:2.459 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:44,577][model8_pretrain.py][INFO] Epoch:[0/2](818100/4588595) loss:3.116 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:26:44,577][model8_pretrain.py][INFO] Epoch:[0/2](818100/4588595) loss:2.627 lr:0.0000100 epoch_Time:23998.0min: [2024-01-06 08:27:21,487][model8_pretrain.py][INFO] Epoch:[0/2](818200/4588595) loss:3.163 lr:0.0000100 epoch_Time:23997.0min: [2024-01-06 08:27:21,487][model8_pretrain.py][INFO] Epoch:[0/2](818200/4588595) loss:2.827 lr:0.0000100 epoch_Time:23997.0min: [2024-01-06 08:27:21,487][model8_pretrain.py][INFO] Epoch:[0/2](818200/4588595) loss:2.360 lr:0.0000100 epoch_Time:23997.0min: [2024-01-06 08:27:21,487][model8_pretrain.py][INFO] Epoch:[0/2](818200/4588595) loss:3.062 lr:0.0000100 epoch_Time:23997.0min: [2024-01-06 08:27:21,487][model8_pretrain.py][INFO] Epoch:[0/2](818200/4588595) loss:2.855 lr:0.0000100 epoch_Time:23997.0min: [2024-01-06 08:27:21,487][model8_pretrain.py][INFO] Epoch:[0/2](818200/4588595) loss:2.736 lr:0.0000100 epoch_Time:23997.0min: [2024-01-06 08:27:21,487][model8_pretrain.py][INFO] Epoch:[0/2](818200/4588595) loss:3.083 lr:0.0000100 epoch_Time:23997.0min: [2024-01-06 08:27:21,487][model8_pretrain.py][INFO] Epoch:[0/2](818200/4588595) loss:3.054 lr:0.0000100 epoch_Time:23997.0min: [2024-01-06 08:27:58,437][model8_pretrain.py][INFO] Epoch:[0/2](818300/4588595) loss:2.727 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:27:58,437][model8_pretrain.py][INFO] Epoch:[0/2](818300/4588595) loss:2.237 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:27:58,437][model8_pretrain.py][INFO] Epoch:[0/2](818300/4588595) loss:2.726 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:27:58,437][model8_pretrain.py][INFO] Epoch:[0/2](818300/4588595) loss:3.152 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:27:58,437][model8_pretrain.py][INFO] Epoch:[0/2](818300/4588595) loss:2.650 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:27:58,437][model8_pretrain.py][INFO] Epoch:[0/2](818300/4588595) loss:3.062 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:27:58,438][model8_pretrain.py][INFO] Epoch:[0/2](818300/4588595) loss:3.014 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:27:58,438][model8_pretrain.py][INFO] Epoch:[0/2](818300/4588595) loss:2.899 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:28:35,376][model8_pretrain.py][INFO] Epoch:[0/2](818400/4588595) loss:2.767 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:28:35,376][model8_pretrain.py][INFO] Epoch:[0/2](818400/4588595) loss:2.399 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:28:35,376][model8_pretrain.py][INFO] Epoch:[0/2](818400/4588595) loss:3.157 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:28:35,376][model8_pretrain.py][INFO] Epoch:[0/2](818400/4588595) loss:2.491 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:28:35,376][model8_pretrain.py][INFO] Epoch:[0/2](818400/4588595) loss:2.126 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:28:35,376][model8_pretrain.py][INFO] Epoch:[0/2](818400/4588595) loss:2.583 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:28:35,376][model8_pretrain.py][INFO] Epoch:[0/2](818400/4588595) loss:2.944 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:28:35,376][model8_pretrain.py][INFO] Epoch:[0/2](818400/4588595) loss:2.594 lr:0.0000100 epoch_Time:23996.0min: [2024-01-06 08:29:12,393][model8_pretrain.py][INFO] Epoch:[0/2](818500/4588595) loss:2.487 lr:0.0000100 epoch_Time:23995.0min: [2024-01-06 08:29:12,393][model8_pretrain.py][INFO] Epoch:[0/2](818500/4588595) loss:2.950 lr:0.0000100 epoch_Time:23995.0min: [2024-01-06 08:29:12,393][model8_pretrain.py][INFO] Epoch:[0/2](818500/4588595) loss:2.473 lr:0.0000100 epoch_Time:23995.0min: [2024-01-06 08:29:12,393][model8_pretrain.py][INFO] Epoch:[0/2](818500/4588595) loss:2.595 lr:0.0000100 epoch_Time:23995.0min: [2024-01-06 08:29:12,393][model8_pretrain.py][INFO] Epoch:[0/2](818500/4588595) loss:2.770 lr:0.0000100 epoch_Time:23995.0min: [2024-01-06 08:29:12,393][model8_pretrain.py][INFO] Epoch:[0/2](818500/4588595) loss:3.097 lr:0.0000100 epoch_Time:23995.0min: [2024-01-06 08:29:12,393][model8_pretrain.py][INFO] Epoch:[0/2](818500/4588595) loss:2.914 lr:0.0000100 epoch_Time:23995.0min: [2024-01-06 08:29:12,393][model8_pretrain.py][INFO] Epoch:[0/2](818500/4588595) loss:3.039 lr:0.0000100 epoch_Time:23995.0min: [2024-01-06 08:29:49,329][model8_pretrain.py][INFO] Epoch:[0/2](818600/4588595) loss:3.024 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:29:49,329][model8_pretrain.py][INFO] Epoch:[0/2](818600/4588595) loss:3.181 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:29:49,329][model8_pretrain.py][INFO] Epoch:[0/2](818600/4588595) loss:2.672 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:29:49,329][model8_pretrain.py][INFO] Epoch:[0/2](818600/4588595) loss:3.402 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:29:49,329][model8_pretrain.py][INFO] Epoch:[0/2](818600/4588595) loss:2.648 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:29:49,329][model8_pretrain.py][INFO] Epoch:[0/2](818600/4588595) loss:2.860 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:29:49,329][model8_pretrain.py][INFO] Epoch:[0/2](818600/4588595) loss:3.317 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:29:49,330][model8_pretrain.py][INFO] Epoch:[0/2](818600/4588595) loss:2.869 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:30:26,265][model8_pretrain.py][INFO] Epoch:[0/2](818700/4588595) loss:2.771 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:30:26,265][model8_pretrain.py][INFO] Epoch:[0/2](818700/4588595) loss:2.809 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:30:26,265][model8_pretrain.py][INFO] Epoch:[0/2](818700/4588595) loss:3.235 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:30:26,265][model8_pretrain.py][INFO] Epoch:[0/2](818700/4588595) loss:2.649 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:30:26,265][model8_pretrain.py][INFO] Epoch:[0/2](818700/4588595) loss:3.226 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:30:26,265][model8_pretrain.py][INFO] Epoch:[0/2](818700/4588595) loss:2.995 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:30:26,266][model8_pretrain.py][INFO] Epoch:[0/2](818700/4588595) loss:3.332 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:30:26,266][model8_pretrain.py][INFO] Epoch:[0/2](818700/4588595) loss:2.732 lr:0.0000100 epoch_Time:23994.0min: [2024-01-06 08:31:06,613][model8_pretrain.py][INFO] Epoch:[0/2](818800/4588595) loss:3.079 lr:0.0000100 epoch_Time:23993.0min: [2024-01-06 08:31:06,613][model8_pretrain.py][INFO] Epoch:[0/2](818800/4588595) loss:3.025 lr:0.0000100 epoch_Time:23993.0min: [2024-01-06 08:31:06,613][model8_pretrain.py][INFO] Epoch:[0/2](818800/4588595) loss:3.352 lr:0.0000100 epoch_Time:23993.0min: [2024-01-06 08:31:06,613][model8_pretrain.py][INFO] Epoch:[0/2](818800/4588595) loss:2.631 lr:0.0000100 epoch_Time:23993.0min: [2024-01-06 08:31:06,613][model8_pretrain.py][INFO] Epoch:[0/2](818800/4588595) loss:2.335 lr:0.0000100 epoch_Time:23993.0min: [2024-01-06 08:31:06,613][model8_pretrain.py][INFO] Epoch:[0/2](818800/4588595) loss:3.083 lr:0.0000100 epoch_Time:23993.0min: [2024-01-06 08:31:06,613][model8_pretrain.py][INFO] Epoch:[0/2](818800/4588595) loss:3.337 lr:0.0000100 epoch_Time:23993.0min: [2024-01-06 08:31:06,614][model8_pretrain.py][INFO] Epoch:[0/2](818800/4588595) loss:3.090 lr:0.0000100 epoch_Time:23993.0min: [2024-01-06 08:31:50,386][model8_pretrain.py][INFO] Epoch:[0/2](818900/4588595) loss:2.730 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:31:50,386][model8_pretrain.py][INFO] Epoch:[0/2](818900/4588595) loss:2.513 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:31:50,386][model8_pretrain.py][INFO] Epoch:[0/2](818900/4588595) loss:2.521 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:31:50,386][model8_pretrain.py][INFO] Epoch:[0/2](818900/4588595) loss:2.693 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:31:50,386][model8_pretrain.py][INFO] Epoch:[0/2](818900/4588595) loss:3.326 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:31:50,386][model8_pretrain.py][INFO] Epoch:[0/2](818900/4588595) loss:2.682 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:31:50,386][model8_pretrain.py][INFO] Epoch:[0/2](818900/4588595) loss:2.704 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:31:50,386][model8_pretrain.py][INFO] Epoch:[0/2](818900/4588595) loss:2.389 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:32:27,317][model8_pretrain.py][INFO] Epoch:[0/2](819000/4588595) loss:2.802 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:32:27,317][model8_pretrain.py][INFO] Epoch:[0/2](819000/4588595) loss:2.915 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:32:27,317][model8_pretrain.py][INFO] Epoch:[0/2](819000/4588595) loss:2.235 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:32:27,317][model8_pretrain.py][INFO] Epoch:[0/2](819000/4588595) loss:3.265 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:32:27,317][model8_pretrain.py][INFO] Epoch:[0/2](819000/4588595) loss:2.949 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:32:27,317][model8_pretrain.py][INFO] Epoch:[0/2](819000/4588595) loss:2.978 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:32:27,317][model8_pretrain.py][INFO] Epoch:[0/2](819000/4588595) loss:2.792 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:32:27,317][model8_pretrain.py][INFO] Epoch:[0/2](819000/4588595) loss:2.818 lr:0.0000100 epoch_Time:23992.0min: [2024-01-06 08:33:04,254][model8_pretrain.py][INFO] Epoch:[0/2](819100/4588595) loss:2.896 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:04,254][model8_pretrain.py][INFO] Epoch:[0/2](819100/4588595) loss:2.412 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:04,254][model8_pretrain.py][INFO] Epoch:[0/2](819100/4588595) loss:2.568 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:04,254][model8_pretrain.py][INFO] Epoch:[0/2](819100/4588595) loss:2.605 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:04,254][model8_pretrain.py][INFO] Epoch:[0/2](819100/4588595) loss:2.727 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:04,254][model8_pretrain.py][INFO] Epoch:[0/2](819100/4588595) loss:2.315 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:04,254][model8_pretrain.py][INFO] Epoch:[0/2](819100/4588595) loss:2.636 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:04,254][model8_pretrain.py][INFO] Epoch:[0/2](819100/4588595) loss:2.400 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:41,194][model8_pretrain.py][INFO] Epoch:[0/2](819200/4588595) loss:2.661 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:41,194][model8_pretrain.py][INFO] Epoch:[0/2](819200/4588595) loss:2.843 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:41,194][model8_pretrain.py][INFO] Epoch:[0/2](819200/4588595) loss:2.585 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:41,194][model8_pretrain.py][INFO] Epoch:[0/2](819200/4588595) loss:2.996 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:41,194][model8_pretrain.py][INFO] Epoch:[0/2](819200/4588595) loss:3.255 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:41,194][model8_pretrain.py][INFO] Epoch:[0/2](819200/4588595) loss:3.054 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:41,194][model8_pretrain.py][INFO] Epoch:[0/2](819200/4588595) loss:3.089 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:33:41,194][model8_pretrain.py][INFO] Epoch:[0/2](819200/4588595) loss:2.433 lr:0.0000100 epoch_Time:23991.0min: [2024-01-06 08:34:18,140][model8_pretrain.py][INFO] Epoch:[0/2](819300/4588595) loss:3.406 lr:0.0000100 epoch_Time:23990.0min: [2024-01-06 08:34:18,140][model8_pretrain.py][INFO] Epoch:[0/2](819300/4588595) loss:3.098 lr:0.0000100 epoch_Time:23990.0min: [2024-01-06 08:34:18,140][model8_pretrain.py][INFO] Epoch:[0/2](819300/4588595) loss:2.648 lr:0.0000100 epoch_Time:23990.0min: [2024-01-06 08:34:18,140][model8_pretrain.py][INFO] Epoch:[0/2](819300/4588595) loss:3.045 lr:0.0000100 epoch_Time:23990.0min: [2024-01-06 08:34:18,140][model8_pretrain.py][INFO] Epoch:[0/2](819300/4588595) loss:3.088 lr:0.0000100 epoch_Time:23990.0min: [2024-01-06 08:34:18,140][model8_pretrain.py][INFO] Epoch:[0/2](819300/4588595) loss:2.676 lr:0.0000100 epoch_Time:23990.0min: [2024-01-06 08:34:18,140][model8_pretrain.py][INFO] Epoch:[0/2](819300/4588595) loss:2.445 lr:0.0000100 epoch_Time:23990.0min: [2024-01-06 08:34:18,140][model8_pretrain.py][INFO] Epoch:[0/2](819300/4588595) loss:2.772 lr:0.0000100 epoch_Time:23990.0min: [2024-01-06 08:34:55,078][model8_pretrain.py][INFO] Epoch:[0/2](819400/4588595) loss:2.810 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:34:55,078][model8_pretrain.py][INFO] Epoch:[0/2](819400/4588595) loss:2.236 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:34:55,078][model8_pretrain.py][INFO] Epoch:[0/2](819400/4588595) loss:2.456 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:34:55,078][model8_pretrain.py][INFO] Epoch:[0/2](819400/4588595) loss:3.033 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:34:55,078][model8_pretrain.py][INFO] Epoch:[0/2](819400/4588595) loss:2.919 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:34:55,078][model8_pretrain.py][INFO] Epoch:[0/2](819400/4588595) loss:2.411 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:34:55,078][model8_pretrain.py][INFO] Epoch:[0/2](819400/4588595) loss:3.225 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:34:55,078][model8_pretrain.py][INFO] Epoch:[0/2](819400/4588595) loss:2.775 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:35:32,008][model8_pretrain.py][INFO] Epoch:[0/2](819500/4588595) loss:2.942 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:35:32,008][model8_pretrain.py][INFO] Epoch:[0/2](819500/4588595) loss:2.962 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:35:32,008][model8_pretrain.py][INFO] Epoch:[0/2](819500/4588595) loss:2.918 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:35:32,008][model8_pretrain.py][INFO] Epoch:[0/2](819500/4588595) loss:2.938 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:35:32,008][model8_pretrain.py][INFO] Epoch:[0/2](819500/4588595) loss:2.515 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:35:32,008][model8_pretrain.py][INFO] Epoch:[0/2](819500/4588595) loss:2.345 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:35:32,008][model8_pretrain.py][INFO] Epoch:[0/2](819500/4588595) loss:2.552 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:35:32,008][model8_pretrain.py][INFO] Epoch:[0/2](819500/4588595) loss:3.081 lr:0.0000100 epoch_Time:23989.0min: [2024-01-06 08:36:10,655][model8_pretrain.py][INFO] Epoch:[0/2](819600/4588595) loss:1.696 lr:0.0000100 epoch_Time:23988.0min: [2024-01-06 08:36:10,655][model8_pretrain.py][INFO] Epoch:[0/2](819600/4588595) loss:2.660 lr:0.0000100 epoch_Time:23988.0min: [2024-01-06 08:36:10,655][model8_pretrain.py][INFO] Epoch:[0/2](819600/4588595) loss:2.769 lr:0.0000100 epoch_Time:23988.0min: [2024-01-06 08:36:10,655][model8_pretrain.py][INFO] Epoch:[0/2](819600/4588595) loss:2.916 lr:0.0000100 epoch_Time:23988.0min: [2024-01-06 08:36:10,655][model8_pretrain.py][INFO] Epoch:[0/2](819600/4588595) loss:2.835 lr:0.0000100 epoch_Time:23988.0min: [2024-01-06 08:36:10,655][model8_pretrain.py][INFO] Epoch:[0/2](819600/4588595) loss:2.913 lr:0.0000100 epoch_Time:23988.0min: [2024-01-06 08:36:10,655][model8_pretrain.py][INFO] Epoch:[0/2](819600/4588595) loss:3.046 lr:0.0000100 epoch_Time:23988.0min: [2024-01-06 08:36:10,655][model8_pretrain.py][INFO] Epoch:[0/2](819600/4588595) loss:2.450 lr:0.0000100 epoch_Time:23988.0min: [2024-01-06 08:36:56,093][model8_pretrain.py][INFO] Epoch:[0/2](819700/4588595) loss:2.946 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:36:56,093][model8_pretrain.py][INFO] Epoch:[0/2](819700/4588595) loss:2.515 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:36:56,093][model8_pretrain.py][INFO] Epoch:[0/2](819700/4588595) loss:2.868 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:36:56,093][model8_pretrain.py][INFO] Epoch:[0/2](819700/4588595) loss:2.800 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:36:56,093][model8_pretrain.py][INFO] Epoch:[0/2](819700/4588595) loss:3.237 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:36:56,093][model8_pretrain.py][INFO] Epoch:[0/2](819700/4588595) loss:2.461 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:36:56,093][model8_pretrain.py][INFO] Epoch:[0/2](819700/4588595) loss:2.922 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:36:56,093][model8_pretrain.py][INFO] Epoch:[0/2](819700/4588595) loss:3.065 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:37:33,032][model8_pretrain.py][INFO] Epoch:[0/2](819800/4588595) loss:3.027 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:37:33,032][model8_pretrain.py][INFO] Epoch:[0/2](819800/4588595) loss:2.410 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:37:33,032][model8_pretrain.py][INFO] Epoch:[0/2](819800/4588595) loss:2.158 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:37:33,032][model8_pretrain.py][INFO] Epoch:[0/2](819800/4588595) loss:2.136 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:37:33,032][model8_pretrain.py][INFO] Epoch:[0/2](819800/4588595) loss:2.664 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:37:33,032][model8_pretrain.py][INFO] Epoch:[0/2](819800/4588595) loss:2.495 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:37:33,032][model8_pretrain.py][INFO] Epoch:[0/2](819800/4588595) loss:1.617 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:37:33,032][model8_pretrain.py][INFO] Epoch:[0/2](819800/4588595) loss:1.976 lr:0.0000100 epoch_Time:23987.0min: [2024-01-06 08:38:09,941][model8_pretrain.py][INFO] Epoch:[0/2](819900/4588595) loss:2.795 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:09,941][model8_pretrain.py][INFO] Epoch:[0/2](819900/4588595) loss:2.474 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:09,941][model8_pretrain.py][INFO] Epoch:[0/2](819900/4588595) loss:2.792 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:09,941][model8_pretrain.py][INFO] Epoch:[0/2](819900/4588595) loss:3.197 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:09,941][model8_pretrain.py][INFO] Epoch:[0/2](819900/4588595) loss:2.964 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:09,941][model8_pretrain.py][INFO] Epoch:[0/2](819900/4588595) loss:3.071 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:09,941][model8_pretrain.py][INFO] Epoch:[0/2](819900/4588595) loss:3.118 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:09,941][model8_pretrain.py][INFO] Epoch:[0/2](819900/4588595) loss:2.666 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:46,884][model8_pretrain.py][INFO] Epoch:[0/2](820000/4588595) loss:2.363 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:46,884][model8_pretrain.py][INFO] Epoch:[0/2](820000/4588595) loss:2.761 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:46,884][model8_pretrain.py][INFO] Epoch:[0/2](820000/4588595) loss:2.630 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:46,884][model8_pretrain.py][INFO] Epoch:[0/2](820000/4588595) loss:2.921 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:46,884][model8_pretrain.py][INFO] Epoch:[0/2](820000/4588595) loss:2.622 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:46,884][model8_pretrain.py][INFO] Epoch:[0/2](820000/4588595) loss:2.781 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:46,884][model8_pretrain.py][INFO] Epoch:[0/2](820000/4588595) loss:2.569 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:38:46,884][model8_pretrain.py][INFO] Epoch:[0/2](820000/4588595) loss:2.682 lr:0.0000100 epoch_Time:23986.0min: [2024-01-06 08:39:23,822][model8_pretrain.py][INFO] Epoch:[0/2](820100/4588595) loss:2.292 lr:0.0000100 epoch_Time:23985.0min: [2024-01-06 08:39:23,822][model8_pretrain.py][INFO] Epoch:[0/2](820100/4588595) loss:2.427 lr:0.0000100 epoch_Time:23985.0min: [2024-01-06 08:39:23,822][model8_pretrain.py][INFO] Epoch:[0/2](820100/4588595) loss:2.944 lr:0.0000100 epoch_Time:23985.0min: [2024-01-06 08:39:23,822][model8_pretrain.py][INFO] Epoch:[0/2](820100/4588595) loss:2.784 lr:0.0000100 epoch_Time:23985.0min: [2024-01-06 08:39:23,822][model8_pretrain.py][INFO] Epoch:[0/2](820100/4588595) loss:2.503 lr:0.0000100 epoch_Time:23985.0min: [2024-01-06 08:39:23,822][model8_pretrain.py][INFO] Epoch:[0/2](820100/4588595) loss:2.852 lr:0.0000100 epoch_Time:23985.0min: [2024-01-06 08:39:23,822][model8_pretrain.py][INFO] Epoch:[0/2](820100/4588595) loss:3.097 lr:0.0000100 epoch_Time:23985.0min: [2024-01-06 08:39:23,822][model8_pretrain.py][INFO] Epoch:[0/2](820100/4588595) loss:3.188 lr:0.0000100 epoch_Time:23985.0min: [2024-01-06 08:40:00,762][model8_pretrain.py][INFO] Epoch:[0/2](820200/4588595) loss:2.285 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:00,762][model8_pretrain.py][INFO] Epoch:[0/2](820200/4588595) loss:3.202 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:00,762][model8_pretrain.py][INFO] Epoch:[0/2](820200/4588595) loss:2.900 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:00,762][model8_pretrain.py][INFO] Epoch:[0/2](820200/4588595) loss:2.948 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:00,762][model8_pretrain.py][INFO] Epoch:[0/2](820200/4588595) loss:3.015 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:00,762][model8_pretrain.py][INFO] Epoch:[0/2](820200/4588595) loss:2.775 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:00,762][model8_pretrain.py][INFO] Epoch:[0/2](820200/4588595) loss:2.597 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:00,762][model8_pretrain.py][INFO] Epoch:[0/2](820200/4588595) loss:2.902 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:37,703][model8_pretrain.py][INFO] Epoch:[0/2](820300/4588595) loss:2.925 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:37,703][model8_pretrain.py][INFO] Epoch:[0/2](820300/4588595) loss:2.712 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:37,703][model8_pretrain.py][INFO] Epoch:[0/2](820300/4588595) loss:3.229 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:37,703][model8_pretrain.py][INFO] Epoch:[0/2](820300/4588595) loss:3.382 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:37,703][model8_pretrain.py][INFO] Epoch:[0/2](820300/4588595) loss:2.826 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:37,703][model8_pretrain.py][INFO] Epoch:[0/2](820300/4588595) loss:3.163 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:37,704][model8_pretrain.py][INFO] Epoch:[0/2](820300/4588595) loss:2.618 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:40:37,704][model8_pretrain.py][INFO] Epoch:[0/2](820300/4588595) loss:2.956 lr:0.0000100 epoch_Time:23984.0min: [2024-01-06 08:41:16,375][model8_pretrain.py][INFO] Epoch:[0/2](820400/4588595) loss:2.881 lr:0.0000100 epoch_Time:23983.0min: [2024-01-06 08:41:16,375][model8_pretrain.py][INFO] Epoch:[0/2](820400/4588595) loss:2.483 lr:0.0000100 epoch_Time:23983.0min: [2024-01-06 08:41:16,375][model8_pretrain.py][INFO] Epoch:[0/2](820400/4588595) loss:3.273 lr:0.0000100 epoch_Time:23983.0min: [2024-01-06 08:41:16,375][model8_pretrain.py][INFO] Epoch:[0/2](820400/4588595) loss:3.033 lr:0.0000100 epoch_Time:23983.0min: [2024-01-06 08:41:16,375][model8_pretrain.py][INFO] Epoch:[0/2](820400/4588595) loss:2.475 lr:0.0000100 epoch_Time:23983.0min: [2024-01-06 08:41:16,375][model8_pretrain.py][INFO] Epoch:[0/2](820400/4588595) loss:3.077 lr:0.0000100 epoch_Time:23983.0min: [2024-01-06 08:41:16,375][model8_pretrain.py][INFO] Epoch:[0/2](820400/4588595) loss:2.022 lr:0.0000100 epoch_Time:23983.0min: [2024-01-06 08:41:16,376][model8_pretrain.py][INFO] Epoch:[0/2](820400/4588595) loss:2.886 lr:0.0000100 epoch_Time:23983.0min: [2024-01-06 08:42:01,624][model8_pretrain.py][INFO] Epoch:[0/2](820500/4588595) loss:3.055 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:01,624][model8_pretrain.py][INFO] Epoch:[0/2](820500/4588595) loss:3.179 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:01,624][model8_pretrain.py][INFO] Epoch:[0/2](820500/4588595) loss:2.766 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:01,624][model8_pretrain.py][INFO] Epoch:[0/2](820500/4588595) loss:2.570 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:01,624][model8_pretrain.py][INFO] Epoch:[0/2](820500/4588595) loss:3.204 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:01,624][model8_pretrain.py][INFO] Epoch:[0/2](820500/4588595) loss:3.125 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:01,624][model8_pretrain.py][INFO] Epoch:[0/2](820500/4588595) loss:2.799 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:01,624][model8_pretrain.py][INFO] Epoch:[0/2](820500/4588595) loss:2.852 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:38,562][model8_pretrain.py][INFO] Epoch:[0/2](820600/4588595) loss:2.853 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:38,562][model8_pretrain.py][INFO] Epoch:[0/2](820600/4588595) loss:2.897 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:38,562][model8_pretrain.py][INFO] Epoch:[0/2](820600/4588595) loss:2.681 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:38,562][model8_pretrain.py][INFO] Epoch:[0/2](820600/4588595) loss:2.310 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:38,562][model8_pretrain.py][INFO] Epoch:[0/2](820600/4588595) loss:2.815 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:38,562][model8_pretrain.py][INFO] Epoch:[0/2](820600/4588595) loss:2.705 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:38,563][model8_pretrain.py][INFO] Epoch:[0/2](820600/4588595) loss:3.105 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:42:38,563][model8_pretrain.py][INFO] Epoch:[0/2](820600/4588595) loss:2.920 lr:0.0000100 epoch_Time:23982.0min: [2024-01-06 08:43:15,516][model8_pretrain.py][INFO] Epoch:[0/2](820700/4588595) loss:2.822 lr:0.0000100 epoch_Time:23981.0min: [2024-01-06 08:43:15,516][model8_pretrain.py][INFO] Epoch:[0/2](820700/4588595) loss:3.040 lr:0.0000100 epoch_Time:23981.0min: [2024-01-06 08:43:15,516][model8_pretrain.py][INFO] Epoch:[0/2](820700/4588595) loss:2.587 lr:0.0000100 epoch_Time:23981.0min: [2024-01-06 08:43:15,516][model8_pretrain.py][INFO] Epoch:[0/2](820700/4588595) loss:2.731 lr:0.0000100 epoch_Time:23981.0min: [2024-01-06 08:43:15,516][model8_pretrain.py][INFO] Epoch:[0/2](820700/4588595) loss:2.933 lr:0.0000100 epoch_Time:23981.0min: [2024-01-06 08:43:15,516][model8_pretrain.py][INFO] Epoch:[0/2](820700/4588595) loss:2.613 lr:0.0000100 epoch_Time:23981.0min: [2024-01-06 08:43:15,516][model8_pretrain.py][INFO] Epoch:[0/2](820700/4588595) loss:2.955 lr:0.0000100 epoch_Time:23981.0min: [2024-01-06 08:43:15,516][model8_pretrain.py][INFO] Epoch:[0/2](820700/4588595) loss:2.834 lr:0.0000100 epoch_Time:23981.0min: [2024-01-06 08:43:52,452][model8_pretrain.py][INFO] Epoch:[0/2](820800/4588595) loss:2.760 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:43:52,452][model8_pretrain.py][INFO] Epoch:[0/2](820800/4588595) loss:2.736 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:43:52,452][model8_pretrain.py][INFO] Epoch:[0/2](820800/4588595) loss:2.024 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:43:52,452][model8_pretrain.py][INFO] Epoch:[0/2](820800/4588595) loss:3.087 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:43:52,452][model8_pretrain.py][INFO] Epoch:[0/2](820800/4588595) loss:3.312 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:43:52,452][model8_pretrain.py][INFO] Epoch:[0/2](820800/4588595) loss:2.958 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:43:52,452][model8_pretrain.py][INFO] Epoch:[0/2](820800/4588595) loss:3.243 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:43:52,453][model8_pretrain.py][INFO] Epoch:[0/2](820800/4588595) loss:2.806 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:44:29,392][model8_pretrain.py][INFO] Epoch:[0/2](820900/4588595) loss:2.779 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:44:29,393][model8_pretrain.py][INFO] Epoch:[0/2](820900/4588595) loss:2.716 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:44:29,393][model8_pretrain.py][INFO] Epoch:[0/2](820900/4588595) loss:2.634 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:44:29,393][model8_pretrain.py][INFO] Epoch:[0/2](820900/4588595) loss:3.140 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:44:29,393][model8_pretrain.py][INFO] Epoch:[0/2](820900/4588595) loss:3.003 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:44:29,393][model8_pretrain.py][INFO] Epoch:[0/2](820900/4588595) loss:3.184 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:44:29,393][model8_pretrain.py][INFO] Epoch:[0/2](820900/4588595) loss:2.700 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:44:29,393][model8_pretrain.py][INFO] Epoch:[0/2](820900/4588595) loss:3.023 lr:0.0000100 epoch_Time:23980.0min: [2024-01-06 08:45:06,335][model8_pretrain.py][INFO] Epoch:[0/2](821000/4588595) loss:2.665 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:06,335][model8_pretrain.py][INFO] Epoch:[0/2](821000/4588595) loss:2.403 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:06,336][model8_pretrain.py][INFO] Epoch:[0/2](821000/4588595) loss:2.827 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:06,336][model8_pretrain.py][INFO] Epoch:[0/2](821000/4588595) loss:3.090 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:06,336][model8_pretrain.py][INFO] Epoch:[0/2](821000/4588595) loss:2.531 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:06,336][model8_pretrain.py][INFO] Epoch:[0/2](821000/4588595) loss:2.925 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:06,336][model8_pretrain.py][INFO] Epoch:[0/2](821000/4588595) loss:2.999 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:06,336][model8_pretrain.py][INFO] Epoch:[0/2](821000/4588595) loss:3.255 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:43,273][model8_pretrain.py][INFO] Epoch:[0/2](821100/4588595) loss:2.971 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:43,273][model8_pretrain.py][INFO] Epoch:[0/2](821100/4588595) loss:2.875 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:43,273][model8_pretrain.py][INFO] Epoch:[0/2](821100/4588595) loss:3.160 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:43,273][model8_pretrain.py][INFO] Epoch:[0/2](821100/4588595) loss:3.005 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:43,273][model8_pretrain.py][INFO] Epoch:[0/2](821100/4588595) loss:2.715 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:43,273][model8_pretrain.py][INFO] Epoch:[0/2](821100/4588595) loss:2.806 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:43,273][model8_pretrain.py][INFO] Epoch:[0/2](821100/4588595) loss:2.805 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:45:43,274][model8_pretrain.py][INFO] Epoch:[0/2](821100/4588595) loss:2.560 lr:0.0000100 epoch_Time:23979.0min: [2024-01-06 08:46:21,949][model8_pretrain.py][INFO] Epoch:[0/2](821200/4588595) loss:3.079 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:46:21,949][model8_pretrain.py][INFO] Epoch:[0/2](821200/4588595) loss:2.643 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:46:21,949][model8_pretrain.py][INFO] Epoch:[0/2](821200/4588595) loss:3.026 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:46:21,949][model8_pretrain.py][INFO] Epoch:[0/2](821200/4588595) loss:2.741 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:46:21,949][model8_pretrain.py][INFO] Epoch:[0/2](821200/4588595) loss:2.487 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:46:21,949][model8_pretrain.py][INFO] Epoch:[0/2](821200/4588595) loss:3.166 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:46:21,949][model8_pretrain.py][INFO] Epoch:[0/2](821200/4588595) loss:2.425 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:46:21,949][model8_pretrain.py][INFO] Epoch:[0/2](821200/4588595) loss:3.251 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:47:07,479][model8_pretrain.py][INFO] Epoch:[0/2](821300/4588595) loss:3.070 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:47:07,479][model8_pretrain.py][INFO] Epoch:[0/2](821300/4588595) loss:3.128 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:47:07,479][model8_pretrain.py][INFO] Epoch:[0/2](821300/4588595) loss:2.781 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:47:07,479][model8_pretrain.py][INFO] Epoch:[0/2](821300/4588595) loss:2.625 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:47:07,479][model8_pretrain.py][INFO] Epoch:[0/2](821300/4588595) loss:2.490 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:47:07,479][model8_pretrain.py][INFO] Epoch:[0/2](821300/4588595) loss:3.351 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:47:07,479][model8_pretrain.py][INFO] Epoch:[0/2](821300/4588595) loss:2.583 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:47:07,479][model8_pretrain.py][INFO] Epoch:[0/2](821300/4588595) loss:2.248 lr:0.0000100 epoch_Time:23978.0min: [2024-01-06 08:47:44,412][model8_pretrain.py][INFO] Epoch:[0/2](821400/4588595) loss:2.344 lr:0.0000100 epoch_Time:23977.0min: [2024-01-06 08:47:44,412][model8_pretrain.py][INFO] Epoch:[0/2](821400/4588595) loss:2.802 lr:0.0000100 epoch_Time:23977.0min: [2024-01-06 08:47:44,412][model8_pretrain.py][INFO] Epoch:[0/2](821400/4588595) loss:3.237 lr:0.0000100 epoch_Time:23977.0min: [2024-01-06 08:47:44,412][model8_pretrain.py][INFO] Epoch:[0/2](821400/4588595) loss:2.499 lr:0.0000100 epoch_Time:23977.0min: [2024-01-06 08:47:44,412][model8_pretrain.py][INFO] Epoch:[0/2](821400/4588595) loss:2.969 lr:0.0000100 epoch_Time:23977.0min: [2024-01-06 08:47:44,412][model8_pretrain.py][INFO] Epoch:[0/2](821400/4588595) loss:2.578 lr:0.0000100 epoch_Time:23977.0min: [2024-01-06 08:47:44,412][model8_pretrain.py][INFO] Epoch:[0/2](821400/4588595) loss:2.451 lr:0.0000100 epoch_Time:23977.0min: [2024-01-06 08:47:44,412][model8_pretrain.py][INFO] Epoch:[0/2](821400/4588595) loss:3.166 lr:0.0000100 epoch_Time:23977.0min: [2024-01-06 08:48:21,354][model8_pretrain.py][INFO] Epoch:[0/2](821500/4588595) loss:2.808 lr:0.0000100 epoch_Time:23976.0min: [2024-01-06 08:48:21,354][model8_pretrain.py][INFO] Epoch:[0/2](821500/4588595) loss:3.167 lr:0.0000100 epoch_Time:23976.0min: [2024-01-06 08:48:21,354][model8_pretrain.py][INFO] Epoch:[0/2](821500/4588595) loss:2.774 lr:0.0000100 epoch_Time:23976.0min: [2024-01-06 08:48:21,354][model8_pretrain.py][INFO] Epoch:[0/2](821500/4588595) loss:2.581 lr:0.0000100 epoch_Time:23976.0min: [2024-01-06 08:48:21,354][model8_pretrain.py][INFO] Epoch:[0/2](821500/4588595) loss:2.172 lr:0.0000100 epoch_Time:23976.0min: [2024-01-06 08:48:21,354][model8_pretrain.py][INFO] Epoch:[0/2](821500/4588595) loss:2.321 lr:0.0000100 epoch_Time:23976.0min: [2024-01-06 08:48:21,354][model8_pretrain.py][INFO] Epoch:[0/2](821500/4588595) loss:2.464 lr:0.0000100 epoch_Time:23976.0min: [2024-01-06 08:48:21,355][model8_pretrain.py][INFO] Epoch:[0/2](821500/4588595) loss:2.505 lr:0.0000100 epoch_Time:23976.0min: [2024-01-06 08:48:58,297][model8_pretrain.py][INFO] Epoch:[0/2](821600/4588595) loss:3.130 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:48:58,297][model8_pretrain.py][INFO] Epoch:[0/2](821600/4588595) loss:2.393 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:48:58,297][model8_pretrain.py][INFO] Epoch:[0/2](821600/4588595) loss:2.951 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:48:58,297][model8_pretrain.py][INFO] Epoch:[0/2](821600/4588595) loss:1.461 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:48:58,297][model8_pretrain.py][INFO] Epoch:[0/2](821600/4588595) loss:2.731 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:48:58,297][model8_pretrain.py][INFO] Epoch:[0/2](821600/4588595) loss:3.157 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:48:58,297][model8_pretrain.py][INFO] Epoch:[0/2](821600/4588595) loss:3.186 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:48:58,298][model8_pretrain.py][INFO] Epoch:[0/2](821600/4588595) loss:2.623 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:49:35,234][model8_pretrain.py][INFO] Epoch:[0/2](821700/4588595) loss:2.492 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:49:35,234][model8_pretrain.py][INFO] Epoch:[0/2](821700/4588595) loss:2.270 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:49:35,234][model8_pretrain.py][INFO] Epoch:[0/2](821700/4588595) loss:2.839 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:49:35,234][model8_pretrain.py][INFO] Epoch:[0/2](821700/4588595) loss:2.384 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:49:35,234][model8_pretrain.py][INFO] Epoch:[0/2](821700/4588595) loss:2.653 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:49:35,234][model8_pretrain.py][INFO] Epoch:[0/2](821700/4588595) loss:3.232 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:49:35,234][model8_pretrain.py][INFO] Epoch:[0/2](821700/4588595) loss:2.359 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:49:35,234][model8_pretrain.py][INFO] Epoch:[0/2](821700/4588595) loss:2.554 lr:0.0000100 epoch_Time:23975.0min: [2024-01-06 08:50:12,182][model8_pretrain.py][INFO] Epoch:[0/2](821800/4588595) loss:2.387 lr:0.0000100 epoch_Time:23974.0min: [2024-01-06 08:50:12,182][model8_pretrain.py][INFO] Epoch:[0/2](821800/4588595) loss:2.775 lr:0.0000100 epoch_Time:23974.0min: [2024-01-06 08:50:12,182][model8_pretrain.py][INFO] Epoch:[0/2](821800/4588595) loss:2.958 lr:0.0000100 epoch_Time:23974.0min: [2024-01-06 08:50:12,182][model8_pretrain.py][INFO] Epoch:[0/2](821800/4588595) loss:3.076 lr:0.0000100 epoch_Time:23974.0min: [2024-01-06 08:50:12,182][model8_pretrain.py][INFO] Epoch:[0/2](821800/4588595) loss:3.307 lr:0.0000100 epoch_Time:23974.0min: [2024-01-06 08:50:12,182][model8_pretrain.py][INFO] Epoch:[0/2](821800/4588595) loss:2.567 lr:0.0000100 epoch_Time:23974.0min: [2024-01-06 08:50:12,182][model8_pretrain.py][INFO] Epoch:[0/2](821800/4588595) loss:2.984 lr:0.0000100 epoch_Time:23974.0min: [2024-01-06 08:50:12,183][model8_pretrain.py][INFO] Epoch:[0/2](821800/4588595) loss:2.825 lr:0.0000100 epoch_Time:23974.0min: [2024-01-06 08:50:49,127][model8_pretrain.py][INFO] Epoch:[0/2](821900/4588595) loss:2.945 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:50:49,127][model8_pretrain.py][INFO] Epoch:[0/2](821900/4588595) loss:2.944 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:50:49,127][model8_pretrain.py][INFO] Epoch:[0/2](821900/4588595) loss:2.948 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:50:49,127][model8_pretrain.py][INFO] Epoch:[0/2](821900/4588595) loss:2.833 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:50:49,127][model8_pretrain.py][INFO] Epoch:[0/2](821900/4588595) loss:2.365 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:50:49,128][model8_pretrain.py][INFO] Epoch:[0/2](821900/4588595) loss:2.610 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:50:49,128][model8_pretrain.py][INFO] Epoch:[0/2](821900/4588595) loss:2.750 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:50:49,128][model8_pretrain.py][INFO] Epoch:[0/2](821900/4588595) loss:2.852 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:51:27,773][model8_pretrain.py][INFO] Epoch:[0/2](822000/4588595) loss:2.971 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:51:27,773][model8_pretrain.py][INFO] Epoch:[0/2](822000/4588595) loss:2.643 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:51:27,773][model8_pretrain.py][INFO] Epoch:[0/2](822000/4588595) loss:2.466 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:51:27,773][model8_pretrain.py][INFO] Epoch:[0/2](822000/4588595) loss:2.444 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:51:27,773][model8_pretrain.py][INFO] Epoch:[0/2](822000/4588595) loss:2.688 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:51:27,773][model8_pretrain.py][INFO] Epoch:[0/2](822000/4588595) loss:2.688 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:51:27,773][model8_pretrain.py][INFO] Epoch:[0/2](822000/4588595) loss:2.696 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:51:27,773][model8_pretrain.py][INFO] Epoch:[0/2](822000/4588595) loss:3.040 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:52:13,408][model8_pretrain.py][INFO] Epoch:[0/2](822100/4588595) loss:2.878 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:52:13,408][model8_pretrain.py][INFO] Epoch:[0/2](822100/4588595) loss:2.335 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:52:13,408][model8_pretrain.py][INFO] Epoch:[0/2](822100/4588595) loss:2.960 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:52:13,408][model8_pretrain.py][INFO] Epoch:[0/2](822100/4588595) loss:2.925 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:52:13,408][model8_pretrain.py][INFO] Epoch:[0/2](822100/4588595) loss:2.908 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:52:13,408][model8_pretrain.py][INFO] Epoch:[0/2](822100/4588595) loss:2.929 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:52:13,408][model8_pretrain.py][INFO] Epoch:[0/2](822100/4588595) loss:2.905 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:52:13,408][model8_pretrain.py][INFO] Epoch:[0/2](822100/4588595) loss:2.899 lr:0.0000100 epoch_Time:23973.0min: [2024-01-06 08:52:50,341][model8_pretrain.py][INFO] Epoch:[0/2](822200/4588595) loss:2.675 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:52:50,341][model8_pretrain.py][INFO] Epoch:[0/2](822200/4588595) loss:2.977 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:52:50,341][model8_pretrain.py][INFO] Epoch:[0/2](822200/4588595) loss:2.632 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:52:50,341][model8_pretrain.py][INFO] Epoch:[0/2](822200/4588595) loss:3.164 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:52:50,341][model8_pretrain.py][INFO] Epoch:[0/2](822200/4588595) loss:3.121 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:52:50,341][model8_pretrain.py][INFO] Epoch:[0/2](822200/4588595) loss:2.737 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:52:50,341][model8_pretrain.py][INFO] Epoch:[0/2](822200/4588595) loss:2.872 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:52:50,341][model8_pretrain.py][INFO] Epoch:[0/2](822200/4588595) loss:2.675 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:53:27,282][model8_pretrain.py][INFO] Epoch:[0/2](822300/4588595) loss:2.828 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:53:27,282][model8_pretrain.py][INFO] Epoch:[0/2](822300/4588595) loss:2.446 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:53:27,282][model8_pretrain.py][INFO] Epoch:[0/2](822300/4588595) loss:3.027 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:53:27,282][model8_pretrain.py][INFO] Epoch:[0/2](822300/4588595) loss:3.366 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:53:27,282][model8_pretrain.py][INFO] Epoch:[0/2](822300/4588595) loss:2.991 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:53:27,282][model8_pretrain.py][INFO] Epoch:[0/2](822300/4588595) loss:2.725 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:53:27,282][model8_pretrain.py][INFO] Epoch:[0/2](822300/4588595) loss:2.232 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:53:27,283][model8_pretrain.py][INFO] Epoch:[0/2](822300/4588595) loss:2.850 lr:0.0000100 epoch_Time:23971.0min: [2024-01-06 08:54:04,217][model8_pretrain.py][INFO] Epoch:[0/2](822400/4588595) loss:2.992 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:04,217][model8_pretrain.py][INFO] Epoch:[0/2](822400/4588595) loss:2.196 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:04,217][model8_pretrain.py][INFO] Epoch:[0/2](822400/4588595) loss:3.109 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:04,217][model8_pretrain.py][INFO] Epoch:[0/2](822400/4588595) loss:2.763 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:04,217][model8_pretrain.py][INFO] Epoch:[0/2](822400/4588595) loss:2.930 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:04,217][model8_pretrain.py][INFO] Epoch:[0/2](822400/4588595) loss:3.034 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:04,217][model8_pretrain.py][INFO] Epoch:[0/2](822400/4588595) loss:2.904 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:04,217][model8_pretrain.py][INFO] Epoch:[0/2](822400/4588595) loss:2.805 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:41,162][model8_pretrain.py][INFO] Epoch:[0/2](822500/4588595) loss:3.477 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:41,162][model8_pretrain.py][INFO] Epoch:[0/2](822500/4588595) loss:3.367 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:41,162][model8_pretrain.py][INFO] Epoch:[0/2](822500/4588595) loss:2.782 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:41,162][model8_pretrain.py][INFO] Epoch:[0/2](822500/4588595) loss:2.857 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:41,162][model8_pretrain.py][INFO] Epoch:[0/2](822500/4588595) loss:2.894 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:41,162][model8_pretrain.py][INFO] Epoch:[0/2](822500/4588595) loss:2.024 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:41,162][model8_pretrain.py][INFO] Epoch:[0/2](822500/4588595) loss:3.048 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:54:41,162][model8_pretrain.py][INFO] Epoch:[0/2](822500/4588595) loss:2.952 lr:0.0000100 epoch_Time:23970.0min: [2024-01-06 08:55:18,133][model8_pretrain.py][INFO] Epoch:[0/2](822600/4588595) loss:3.271 lr:0.0000100 epoch_Time:23969.0min: [2024-01-06 08:55:18,133][model8_pretrain.py][INFO] Epoch:[0/2](822600/4588595) loss:2.221 lr:0.0000100 epoch_Time:23969.0min: [2024-01-06 08:55:18,133][model8_pretrain.py][INFO] Epoch:[0/2](822600/4588595) loss:2.798 lr:0.0000100 epoch_Time:23969.0min: [2024-01-06 08:55:18,133][model8_pretrain.py][INFO] Epoch:[0/2](822600/4588595) loss:2.524 lr:0.0000100 epoch_Time:23969.0min: [2024-01-06 08:55:18,133][model8_pretrain.py][INFO] Epoch:[0/2](822600/4588595) loss:3.213 lr:0.0000100 epoch_Time:23969.0min: [2024-01-06 08:55:18,133][model8_pretrain.py][INFO] Epoch:[0/2](822600/4588595) loss:3.222 lr:0.0000100 epoch_Time:23969.0min: [2024-01-06 08:55:18,133][model8_pretrain.py][INFO] Epoch:[0/2](822600/4588595) loss:2.429 lr:0.0000100 epoch_Time:23969.0min: [2024-01-06 08:55:18,134][model8_pretrain.py][INFO] Epoch:[0/2](822600/4588595) loss:2.941 lr:0.0000100 epoch_Time:23969.0min: [2024-01-06 08:55:55,078][model8_pretrain.py][INFO] Epoch:[0/2](822700/4588595) loss:3.035 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:55:55,078][model8_pretrain.py][INFO] Epoch:[0/2](822700/4588595) loss:2.790 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:55:55,078][model8_pretrain.py][INFO] Epoch:[0/2](822700/4588595) loss:3.153 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:55:55,078][model8_pretrain.py][INFO] Epoch:[0/2](822700/4588595) loss:2.492 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:55:55,078][model8_pretrain.py][INFO] Epoch:[0/2](822700/4588595) loss:2.821 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:55:55,078][model8_pretrain.py][INFO] Epoch:[0/2](822700/4588595) loss:2.506 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:55:55,078][model8_pretrain.py][INFO] Epoch:[0/2](822700/4588595) loss:2.642 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:55:55,078][model8_pretrain.py][INFO] Epoch:[0/2](822700/4588595) loss:2.269 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:56:33,713][model8_pretrain.py][INFO] Epoch:[0/2](822800/4588595) loss:2.388 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:56:33,714][model8_pretrain.py][INFO] Epoch:[0/2](822800/4588595) loss:2.685 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:56:33,714][model8_pretrain.py][INFO] Epoch:[0/2](822800/4588595) loss:2.558 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:56:33,714][model8_pretrain.py][INFO] Epoch:[0/2](822800/4588595) loss:3.155 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:56:33,714][model8_pretrain.py][INFO] Epoch:[0/2](822800/4588595) loss:2.991 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:56:33,714][model8_pretrain.py][INFO] Epoch:[0/2](822800/4588595) loss:2.690 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:56:33,714][model8_pretrain.py][INFO] Epoch:[0/2](822800/4588595) loss:2.918 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:56:33,714][model8_pretrain.py][INFO] Epoch:[0/2](822800/4588595) loss:2.694 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:57:19,359][model8_pretrain.py][INFO] Epoch:[0/2](822900/4588595) loss:2.904 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:57:19,359][model8_pretrain.py][INFO] Epoch:[0/2](822900/4588595) loss:2.944 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:57:19,359][model8_pretrain.py][INFO] Epoch:[0/2](822900/4588595) loss:2.176 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:57:19,359][model8_pretrain.py][INFO] Epoch:[0/2](822900/4588595) loss:3.200 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:57:19,359][model8_pretrain.py][INFO] Epoch:[0/2](822900/4588595) loss:2.946 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:57:19,359][model8_pretrain.py][INFO] Epoch:[0/2](822900/4588595) loss:3.286 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:57:19,359][model8_pretrain.py][INFO] Epoch:[0/2](822900/4588595) loss:2.958 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:57:19,359][model8_pretrain.py][INFO] Epoch:[0/2](822900/4588595) loss:2.833 lr:0.0000100 epoch_Time:23968.0min: [2024-01-06 08:57:56,295][model8_pretrain.py][INFO] Epoch:[0/2](823000/4588595) loss:2.631 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:57:56,295][model8_pretrain.py][INFO] Epoch:[0/2](823000/4588595) loss:2.755 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:57:56,295][model8_pretrain.py][INFO] Epoch:[0/2](823000/4588595) loss:3.150 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:57:56,295][model8_pretrain.py][INFO] Epoch:[0/2](823000/4588595) loss:3.097 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:57:56,295][model8_pretrain.py][INFO] Epoch:[0/2](823000/4588595) loss:2.964 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:57:56,295][model8_pretrain.py][INFO] Epoch:[0/2](823000/4588595) loss:3.131 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:57:56,295][model8_pretrain.py][INFO] Epoch:[0/2](823000/4588595) loss:2.617 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:57:56,295][model8_pretrain.py][INFO] Epoch:[0/2](823000/4588595) loss:3.279 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:58:33,235][model8_pretrain.py][INFO] Epoch:[0/2](823100/4588595) loss:2.607 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:58:33,235][model8_pretrain.py][INFO] Epoch:[0/2](823100/4588595) loss:3.130 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:58:33,235][model8_pretrain.py][INFO] Epoch:[0/2](823100/4588595) loss:3.140 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:58:33,235][model8_pretrain.py][INFO] Epoch:[0/2](823100/4588595) loss:2.698 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:58:33,235][model8_pretrain.py][INFO] Epoch:[0/2](823100/4588595) loss:2.478 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:58:33,235][model8_pretrain.py][INFO] Epoch:[0/2](823100/4588595) loss:2.389 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:58:33,235][model8_pretrain.py][INFO] Epoch:[0/2](823100/4588595) loss:2.761 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:58:33,235][model8_pretrain.py][INFO] Epoch:[0/2](823100/4588595) loss:2.747 lr:0.0000100 epoch_Time:23966.0min: [2024-01-06 08:59:10,178][model8_pretrain.py][INFO] Epoch:[0/2](823200/4588595) loss:2.864 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:10,178][model8_pretrain.py][INFO] Epoch:[0/2](823200/4588595) loss:2.466 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:10,178][model8_pretrain.py][INFO] Epoch:[0/2](823200/4588595) loss:3.022 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:10,178][model8_pretrain.py][INFO] Epoch:[0/2](823200/4588595) loss:2.960 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:10,178][model8_pretrain.py][INFO] Epoch:[0/2](823200/4588595) loss:3.276 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:10,178][model8_pretrain.py][INFO] Epoch:[0/2](823200/4588595) loss:2.789 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:10,178][model8_pretrain.py][INFO] Epoch:[0/2](823200/4588595) loss:2.626 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:10,178][model8_pretrain.py][INFO] Epoch:[0/2](823200/4588595) loss:2.937 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:47,109][model8_pretrain.py][INFO] Epoch:[0/2](823300/4588595) loss:2.434 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:47,109][model8_pretrain.py][INFO] Epoch:[0/2](823300/4588595) loss:2.974 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:47,109][model8_pretrain.py][INFO] Epoch:[0/2](823300/4588595) loss:3.102 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:47,109][model8_pretrain.py][INFO] Epoch:[0/2](823300/4588595) loss:2.918 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:47,109][model8_pretrain.py][INFO] Epoch:[0/2](823300/4588595) loss:2.877 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:47,109][model8_pretrain.py][INFO] Epoch:[0/2](823300/4588595) loss:2.496 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:47,109][model8_pretrain.py][INFO] Epoch:[0/2](823300/4588595) loss:2.774 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 08:59:47,110][model8_pretrain.py][INFO] Epoch:[0/2](823300/4588595) loss:2.583 lr:0.0000100 epoch_Time:23965.0min: [2024-01-06 09:00:24,049][model8_pretrain.py][INFO] Epoch:[0/2](823400/4588595) loss:3.127 lr:0.0000100 epoch_Time:23964.0min: [2024-01-06 09:00:24,049][model8_pretrain.py][INFO] Epoch:[0/2](823400/4588595) loss:2.597 lr:0.0000100 epoch_Time:23964.0min: [2024-01-06 09:00:24,049][model8_pretrain.py][INFO] Epoch:[0/2](823400/4588595) loss:2.616 lr:0.0000100 epoch_Time:23964.0min: [2024-01-06 09:00:24,049][model8_pretrain.py][INFO] Epoch:[0/2](823400/4588595) loss:3.233 lr:0.0000100 epoch_Time:23964.0min: [2024-01-06 09:00:24,049][model8_pretrain.py][INFO] Epoch:[0/2](823400/4588595) loss:3.093 lr:0.0000100 epoch_Time:23964.0min: [2024-01-06 09:00:24,049][model8_pretrain.py][INFO] Epoch:[0/2](823400/4588595) loss:2.726 lr:0.0000100 epoch_Time:23964.0min: [2024-01-06 09:00:24,049][model8_pretrain.py][INFO] Epoch:[0/2](823400/4588595) loss:3.245 lr:0.0000100 epoch_Time:23964.0min: [2024-01-06 09:00:24,049][model8_pretrain.py][INFO] Epoch:[0/2](823400/4588595) loss:2.833 lr:0.0000100 epoch_Time:23964.0min: [2024-01-06 09:01:00,996][model8_pretrain.py][INFO] Epoch:[0/2](823500/4588595) loss:2.701 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:00,996][model8_pretrain.py][INFO] Epoch:[0/2](823500/4588595) loss:3.213 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:00,996][model8_pretrain.py][INFO] Epoch:[0/2](823500/4588595) loss:2.414 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:00,996][model8_pretrain.py][INFO] Epoch:[0/2](823500/4588595) loss:3.120 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:00,996][model8_pretrain.py][INFO] Epoch:[0/2](823500/4588595) loss:2.443 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:00,996][model8_pretrain.py][INFO] Epoch:[0/2](823500/4588595) loss:3.146 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:00,996][model8_pretrain.py][INFO] Epoch:[0/2](823500/4588595) loss:2.751 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:00,996][model8_pretrain.py][INFO] Epoch:[0/2](823500/4588595) loss:2.751 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:39,631][model8_pretrain.py][INFO] Epoch:[0/2](823600/4588595) loss:3.036 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:39,631][model8_pretrain.py][INFO] Epoch:[0/2](823600/4588595) loss:2.991 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:39,631][model8_pretrain.py][INFO] Epoch:[0/2](823600/4588595) loss:3.063 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:39,631][model8_pretrain.py][INFO] Epoch:[0/2](823600/4588595) loss:3.145 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:39,631][model8_pretrain.py][INFO] Epoch:[0/2](823600/4588595) loss:2.761 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:39,631][model8_pretrain.py][INFO] Epoch:[0/2](823600/4588595) loss:2.448 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:39,631][model8_pretrain.py][INFO] Epoch:[0/2](823600/4588595) loss:2.409 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:01:39,632][model8_pretrain.py][INFO] Epoch:[0/2](823600/4588595) loss:2.849 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:02:25,394][model8_pretrain.py][INFO] Epoch:[0/2](823700/4588595) loss:2.956 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:02:25,394][model8_pretrain.py][INFO] Epoch:[0/2](823700/4588595) loss:3.473 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:02:25,394][model8_pretrain.py][INFO] Epoch:[0/2](823700/4588595) loss:2.093 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:02:25,394][model8_pretrain.py][INFO] Epoch:[0/2](823700/4588595) loss:2.950 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:02:25,394][model8_pretrain.py][INFO] Epoch:[0/2](823700/4588595) loss:2.914 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:02:25,394][model8_pretrain.py][INFO] Epoch:[0/2](823700/4588595) loss:3.297 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:02:25,394][model8_pretrain.py][INFO] Epoch:[0/2](823700/4588595) loss:3.170 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:02:25,394][model8_pretrain.py][INFO] Epoch:[0/2](823700/4588595) loss:3.237 lr:0.0000100 epoch_Time:23963.0min: [2024-01-06 09:03:02,328][model8_pretrain.py][INFO] Epoch:[0/2](823800/4588595) loss:2.300 lr:0.0000100 epoch_Time:23962.0min: [2024-01-06 09:03:02,328][model8_pretrain.py][INFO] Epoch:[0/2](823800/4588595) loss:2.985 lr:0.0000100 epoch_Time:23962.0min: [2024-01-06 09:03:02,328][model8_pretrain.py][INFO] Epoch:[0/2](823800/4588595) loss:2.785 lr:0.0000100 epoch_Time:23962.0min: [2024-01-06 09:03:02,328][model8_pretrain.py][INFO] Epoch:[0/2](823800/4588595) loss:2.287 lr:0.0000100 epoch_Time:23962.0min: [2024-01-06 09:03:02,328][model8_pretrain.py][INFO] Epoch:[0/2](823800/4588595) loss:3.119 lr:0.0000100 epoch_Time:23962.0min: [2024-01-06 09:03:02,328][model8_pretrain.py][INFO] Epoch:[0/2](823800/4588595) loss:2.981 lr:0.0000100 epoch_Time:23962.0min: [2024-01-06 09:03:02,328][model8_pretrain.py][INFO] Epoch:[0/2](823800/4588595) loss:3.169 lr:0.0000100 epoch_Time:23962.0min: [2024-01-06 09:03:02,328][model8_pretrain.py][INFO] Epoch:[0/2](823800/4588595) loss:3.041 lr:0.0000100 epoch_Time:23962.0min: [2024-01-06 09:03:39,271][model8_pretrain.py][INFO] Epoch:[0/2](823900/4588595) loss:2.919 lr:0.0000100 epoch_Time:23961.0min: [2024-01-06 09:03:39,271][model8_pretrain.py][INFO] Epoch:[0/2](823900/4588595) loss:3.023 lr:0.0000100 epoch_Time:23961.0min: [2024-01-06 09:03:39,271][model8_pretrain.py][INFO] Epoch:[0/2](823900/4588595) loss:2.783 lr:0.0000100 epoch_Time:23961.0min: [2024-01-06 09:03:39,271][model8_pretrain.py][INFO] Epoch:[0/2](823900/4588595) loss:2.462 lr:0.0000100 epoch_Time:23961.0min: [2024-01-06 09:03:39,271][model8_pretrain.py][INFO] Epoch:[0/2](823900/4588595) loss:2.608 lr:0.0000100 epoch_Time:23961.0min: [2024-01-06 09:03:39,271][model8_pretrain.py][INFO] Epoch:[0/2](823900/4588595) loss:3.258 lr:0.0000100 epoch_Time:23961.0min: [2024-01-06 09:03:39,271][model8_pretrain.py][INFO] Epoch:[0/2](823900/4588595) loss:2.996 lr:0.0000100 epoch_Time:23961.0min: [2024-01-06 09:03:39,271][model8_pretrain.py][INFO] Epoch:[0/2](823900/4588595) loss:2.691 lr:0.0000100 epoch_Time:23961.0min: [2024-01-06 09:04:16,213][model8_pretrain.py][INFO] Epoch:[0/2](824000/4588595) loss:2.704 lr:0.0000100 epoch_Time:23960.0min: [2024-01-06 09:04:16,213][model8_pretrain.py][INFO] Epoch:[0/2](824000/4588595) loss:3.303 lr:0.0000100 epoch_Time:23960.0min: [2024-01-06 09:04:16,213][model8_pretrain.py][INFO] Epoch:[0/2](824000/4588595) loss:2.151 lr:0.0000100 epoch_Time:23960.0min: [2024-01-06 09:04:16,213][model8_pretrain.py][INFO] Epoch:[0/2](824000/4588595) loss:2.355 lr:0.0000100 epoch_Time:23960.0min: [2024-01-06 09:04:16,213][model8_pretrain.py][INFO] Epoch:[0/2](824000/4588595) loss:2.815 lr:0.0000100 epoch_Time:23960.0min: [2024-01-06 09:04:16,213][model8_pretrain.py][INFO] Epoch:[0/2](824000/4588595) loss:3.144 lr:0.0000100 epoch_Time:23960.0min: [2024-01-06 09:04:16,213][model8_pretrain.py][INFO] Epoch:[0/2](824000/4588595) loss:3.088 lr:0.0000100 epoch_Time:23960.0min: [2024-01-06 09:04:16,213][model8_pretrain.py][INFO] Epoch:[0/2](824000/4588595) loss:2.918 lr:0.0000100 epoch_Time:23960.0min: [2024-01-06 09:04:53,170][model8_pretrain.py][INFO] Epoch:[0/2](824100/4588595) loss:2.589 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:04:53,170][model8_pretrain.py][INFO] Epoch:[0/2](824100/4588595) loss:2.868 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:04:53,170][model8_pretrain.py][INFO] Epoch:[0/2](824100/4588595) loss:2.296 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:04:53,170][model8_pretrain.py][INFO] Epoch:[0/2](824100/4588595) loss:2.742 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:04:53,170][model8_pretrain.py][INFO] Epoch:[0/2](824100/4588595) loss:2.537 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:04:53,170][model8_pretrain.py][INFO] Epoch:[0/2](824100/4588595) loss:2.541 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:04:53,170][model8_pretrain.py][INFO] Epoch:[0/2](824100/4588595) loss:3.007 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:04:53,170][model8_pretrain.py][INFO] Epoch:[0/2](824100/4588595) loss:2.569 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:05:30,118][model8_pretrain.py][INFO] Epoch:[0/2](824200/4588595) loss:3.329 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:05:30,118][model8_pretrain.py][INFO] Epoch:[0/2](824200/4588595) loss:3.097 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:05:30,118][model8_pretrain.py][INFO] Epoch:[0/2](824200/4588595) loss:2.830 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:05:30,118][model8_pretrain.py][INFO] Epoch:[0/2](824200/4588595) loss:3.341 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:05:30,118][model8_pretrain.py][INFO] Epoch:[0/2](824200/4588595) loss:2.905 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:05:30,118][model8_pretrain.py][INFO] Epoch:[0/2](824200/4588595) loss:2.480 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:05:30,118][model8_pretrain.py][INFO] Epoch:[0/2](824200/4588595) loss:3.186 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:05:30,119][model8_pretrain.py][INFO] Epoch:[0/2](824200/4588595) loss:2.954 lr:0.0000100 epoch_Time:23959.0min: [2024-01-06 09:06:07,046][model8_pretrain.py][INFO] Epoch:[0/2](824300/4588595) loss:2.997 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:07,046][model8_pretrain.py][INFO] Epoch:[0/2](824300/4588595) loss:3.034 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:07,046][model8_pretrain.py][INFO] Epoch:[0/2](824300/4588595) loss:3.216 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:07,046][model8_pretrain.py][INFO] Epoch:[0/2](824300/4588595) loss:3.224 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:07,046][model8_pretrain.py][INFO] Epoch:[0/2](824300/4588595) loss:2.865 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:07,046][model8_pretrain.py][INFO] Epoch:[0/2](824300/4588595) loss:2.302 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:07,046][model8_pretrain.py][INFO] Epoch:[0/2](824300/4588595) loss:2.882 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:07,047][model8_pretrain.py][INFO] Epoch:[0/2](824300/4588595) loss:3.106 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:45,687][model8_pretrain.py][INFO] Epoch:[0/2](824400/4588595) loss:3.161 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:45,687][model8_pretrain.py][INFO] Epoch:[0/2](824400/4588595) loss:3.100 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:45,687][model8_pretrain.py][INFO] Epoch:[0/2](824400/4588595) loss:2.978 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:45,687][model8_pretrain.py][INFO] Epoch:[0/2](824400/4588595) loss:2.776 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:45,687][model8_pretrain.py][INFO] Epoch:[0/2](824400/4588595) loss:3.395 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:45,687][model8_pretrain.py][INFO] Epoch:[0/2](824400/4588595) loss:2.487 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:45,687][model8_pretrain.py][INFO] Epoch:[0/2](824400/4588595) loss:2.874 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:06:45,687][model8_pretrain.py][INFO] Epoch:[0/2](824400/4588595) loss:2.586 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:07:31,504][model8_pretrain.py][INFO] Epoch:[0/2](824500/4588595) loss:2.663 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:07:31,504][model8_pretrain.py][INFO] Epoch:[0/2](824500/4588595) loss:2.845 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:07:31,504][model8_pretrain.py][INFO] Epoch:[0/2](824500/4588595) loss:2.688 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:07:31,504][model8_pretrain.py][INFO] Epoch:[0/2](824500/4588595) loss:2.586 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:07:31,504][model8_pretrain.py][INFO] Epoch:[0/2](824500/4588595) loss:2.990 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:07:31,504][model8_pretrain.py][INFO] Epoch:[0/2](824500/4588595) loss:2.926 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:07:31,504][model8_pretrain.py][INFO] Epoch:[0/2](824500/4588595) loss:2.281 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:07:31,504][model8_pretrain.py][INFO] Epoch:[0/2](824500/4588595) loss:2.374 lr:0.0000100 epoch_Time:23958.0min: [2024-01-06 09:08:08,432][model8_pretrain.py][INFO] Epoch:[0/2](824600/4588595) loss:2.523 lr:0.0000100 epoch_Time:23957.0min: [2024-01-06 09:08:08,432][model8_pretrain.py][INFO] Epoch:[0/2](824600/4588595) loss:2.593 lr:0.0000100 epoch_Time:23957.0min: [2024-01-06 09:08:08,432][model8_pretrain.py][INFO] Epoch:[0/2](824600/4588595) loss:2.394 lr:0.0000100 epoch_Time:23957.0min: [2024-01-06 09:08:08,432][model8_pretrain.py][INFO] Epoch:[0/2](824600/4588595) loss:3.229 lr:0.0000100 epoch_Time:23957.0min: [2024-01-06 09:08:08,432][model8_pretrain.py][INFO] Epoch:[0/2](824600/4588595) loss:2.787 lr:0.0000100 epoch_Time:23957.0min: [2024-01-06 09:08:08,432][model8_pretrain.py][INFO] Epoch:[0/2](824600/4588595) loss:2.400 lr:0.0000100 epoch_Time:23957.0min: [2024-01-06 09:08:08,432][model8_pretrain.py][INFO] Epoch:[0/2](824600/4588595) loss:2.393 lr:0.0000100 epoch_Time:23957.0min: [2024-01-06 09:08:08,432][model8_pretrain.py][INFO] Epoch:[0/2](824600/4588595) loss:2.970 lr:0.0000100 epoch_Time:23957.0min: [2024-01-06 09:08:45,370][model8_pretrain.py][INFO] Epoch:[0/2](824700/4588595) loss:3.049 lr:0.0000100 epoch_Time:23956.0min: [2024-01-06 09:08:45,370][model8_pretrain.py][INFO] Epoch:[0/2](824700/4588595) loss:2.719 lr:0.0000100 epoch_Time:23956.0min: [2024-01-06 09:08:45,370][model8_pretrain.py][INFO] Epoch:[0/2](824700/4588595) loss:2.856 lr:0.0000100 epoch_Time:23956.0min: [2024-01-06 09:08:45,370][model8_pretrain.py][INFO] Epoch:[0/2](824700/4588595) loss:2.628 lr:0.0000100 epoch_Time:23956.0min: [2024-01-06 09:08:45,370][model8_pretrain.py][INFO] Epoch:[0/2](824700/4588595) loss:3.001 lr:0.0000100 epoch_Time:23956.0min: [2024-01-06 09:08:45,370][model8_pretrain.py][INFO] Epoch:[0/2](824700/4588595) loss:2.452 lr:0.0000100 epoch_Time:23956.0min: [2024-01-06 09:08:45,370][model8_pretrain.py][INFO] Epoch:[0/2](824700/4588595) loss:2.938 lr:0.0000100 epoch_Time:23956.0min: [2024-01-06 09:08:45,370][model8_pretrain.py][INFO] Epoch:[0/2](824700/4588595) loss:3.008 lr:0.0000100 epoch_Time:23956.0min: [2024-01-06 09:09:22,319][model8_pretrain.py][INFO] Epoch:[0/2](824800/4588595) loss:3.371 lr:0.0000100 epoch_Time:23955.0min: [2024-01-06 09:09:22,319][model8_pretrain.py][INFO] Epoch:[0/2](824800/4588595) loss:2.960 lr:0.0000100 epoch_Time:23955.0min: [2024-01-06 09:09:22,319][model8_pretrain.py][INFO] Epoch:[0/2](824800/4588595) loss:2.888 lr:0.0000100 epoch_Time:23955.0min: [2024-01-06 09:09:22,319][model8_pretrain.py][INFO] Epoch:[0/2](824800/4588595) loss:2.838 lr:0.0000100 epoch_Time:23955.0min: [2024-01-06 09:09:22,319][model8_pretrain.py][INFO] Epoch:[0/2](824800/4588595) loss:3.106 lr:0.0000100 epoch_Time:23955.0min: [2024-01-06 09:09:22,319][model8_pretrain.py][INFO] Epoch:[0/2](824800/4588595) loss:3.314 lr:0.0000100 epoch_Time:23955.0min: [2024-01-06 09:09:22,319][model8_pretrain.py][INFO] Epoch:[0/2](824800/4588595) loss:3.084 lr:0.0000100 epoch_Time:23955.0min: [2024-01-06 09:09:22,319][model8_pretrain.py][INFO] Epoch:[0/2](824800/4588595) loss:3.135 lr:0.0000100 epoch_Time:23955.0min: [2024-01-06 09:09:59,257][model8_pretrain.py][INFO] Epoch:[0/2](824900/4588595) loss:2.815 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:09:59,257][model8_pretrain.py][INFO] Epoch:[0/2](824900/4588595) loss:3.090 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:09:59,257][model8_pretrain.py][INFO] Epoch:[0/2](824900/4588595) loss:2.956 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:09:59,257][model8_pretrain.py][INFO] Epoch:[0/2](824900/4588595) loss:2.876 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:09:59,257][model8_pretrain.py][INFO] Epoch:[0/2](824900/4588595) loss:2.711 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:09:59,257][model8_pretrain.py][INFO] Epoch:[0/2](824900/4588595) loss:2.753 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:09:59,257][model8_pretrain.py][INFO] Epoch:[0/2](824900/4588595) loss:2.643 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:09:59,257][model8_pretrain.py][INFO] Epoch:[0/2](824900/4588595) loss:2.614 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:10:36,201][model8_pretrain.py][INFO] Epoch:[0/2](825000/4588595) loss:2.933 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:10:36,201][model8_pretrain.py][INFO] Epoch:[0/2](825000/4588595) loss:3.051 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:10:36,201][model8_pretrain.py][INFO] Epoch:[0/2](825000/4588595) loss:3.098 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:10:36,201][model8_pretrain.py][INFO] Epoch:[0/2](825000/4588595) loss:1.533 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:10:36,201][model8_pretrain.py][INFO] Epoch:[0/2](825000/4588595) loss:2.578 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:10:36,201][model8_pretrain.py][INFO] Epoch:[0/2](825000/4588595) loss:2.606 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:10:36,202][model8_pretrain.py][INFO] Epoch:[0/2](825000/4588595) loss:2.549 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:10:36,202][model8_pretrain.py][INFO] Epoch:[0/2](825000/4588595) loss:2.136 lr:0.0000100 epoch_Time:23954.0min: [2024-01-06 09:11:13,133][model8_pretrain.py][INFO] Epoch:[0/2](825100/4588595) loss:2.613 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:11:13,133][model8_pretrain.py][INFO] Epoch:[0/2](825100/4588595) loss:2.973 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:11:13,133][model8_pretrain.py][INFO] Epoch:[0/2](825100/4588595) loss:2.871 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:11:13,133][model8_pretrain.py][INFO] Epoch:[0/2](825100/4588595) loss:3.064 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:11:13,133][model8_pretrain.py][INFO] Epoch:[0/2](825100/4588595) loss:3.032 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:11:13,133][model8_pretrain.py][INFO] Epoch:[0/2](825100/4588595) loss:2.176 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:11:13,133][model8_pretrain.py][INFO] Epoch:[0/2](825100/4588595) loss:2.195 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:11:13,133][model8_pretrain.py][INFO] Epoch:[0/2](825100/4588595) loss:2.907 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:11:50,054][model8_pretrain.py][INFO] Epoch:[0/2](825200/4588595) loss:2.577 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:11:50,054][model8_pretrain.py][INFO] Epoch:[0/2](825200/4588595) loss:2.209 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:11:50,054][model8_pretrain.py][INFO] Epoch:[0/2](825200/4588595) loss:2.720 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:11:50,054][model8_pretrain.py][INFO] Epoch:[0/2](825200/4588595) loss:3.026 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:11:50,054][model8_pretrain.py][INFO] Epoch:[0/2](825200/4588595) loss:2.709 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:11:50,054][model8_pretrain.py][INFO] Epoch:[0/2](825200/4588595) loss:2.814 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:11:50,054][model8_pretrain.py][INFO] Epoch:[0/2](825200/4588595) loss:2.228 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:11:50,054][model8_pretrain.py][INFO] Epoch:[0/2](825200/4588595) loss:3.038 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:12:37,416][model8_pretrain.py][INFO] Epoch:[0/2](825300/4588595) loss:2.727 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:12:37,416][model8_pretrain.py][INFO] Epoch:[0/2](825300/4588595) loss:2.976 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:12:37,416][model8_pretrain.py][INFO] Epoch:[0/2](825300/4588595) loss:2.477 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:12:37,416][model8_pretrain.py][INFO] Epoch:[0/2](825300/4588595) loss:2.669 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:12:37,416][model8_pretrain.py][INFO] Epoch:[0/2](825300/4588595) loss:2.807 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:12:37,416][model8_pretrain.py][INFO] Epoch:[0/2](825300/4588595) loss:3.065 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:12:37,417][model8_pretrain.py][INFO] Epoch:[0/2](825300/4588595) loss:2.826 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:12:37,417][model8_pretrain.py][INFO] Epoch:[0/2](825300/4588595) loss:2.686 lr:0.0000100 epoch_Time:23953.0min: [2024-01-06 09:13:14,364][model8_pretrain.py][INFO] Epoch:[0/2](825400/4588595) loss:2.290 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:13:14,364][model8_pretrain.py][INFO] Epoch:[0/2](825400/4588595) loss:3.249 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:13:14,364][model8_pretrain.py][INFO] Epoch:[0/2](825400/4588595) loss:2.989 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:13:14,364][model8_pretrain.py][INFO] Epoch:[0/2](825400/4588595) loss:2.886 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:13:14,364][model8_pretrain.py][INFO] Epoch:[0/2](825400/4588595) loss:3.112 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:13:14,364][model8_pretrain.py][INFO] Epoch:[0/2](825400/4588595) loss:3.103 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:13:14,364][model8_pretrain.py][INFO] Epoch:[0/2](825400/4588595) loss:3.180 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:13:14,364][model8_pretrain.py][INFO] Epoch:[0/2](825400/4588595) loss:2.807 lr:0.0000100 epoch_Time:23952.0min: [2024-01-06 09:13:51,312][model8_pretrain.py][INFO] Epoch:[0/2](825500/4588595) loss:3.025 lr:0.0000100 epoch_Time:23951.0min: [2024-01-06 09:13:51,312][model8_pretrain.py][INFO] Epoch:[0/2](825500/4588595) loss:3.068 lr:0.0000100 epoch_Time:23951.0min: [2024-01-06 09:13:51,312][model8_pretrain.py][INFO] Epoch:[0/2](825500/4588595) loss:2.518 lr:0.0000100 epoch_Time:23951.0min: [2024-01-06 09:13:51,312][model8_pretrain.py][INFO] Epoch:[0/2](825500/4588595) loss:3.390 lr:0.0000100 epoch_Time:23951.0min: [2024-01-06 09:13:51,312][model8_pretrain.py][INFO] Epoch:[0/2](825500/4588595) loss:2.753 lr:0.0000100 epoch_Time:23951.0min: [2024-01-06 09:13:51,312][model8_pretrain.py][INFO] Epoch:[0/2](825500/4588595) loss:2.737 lr:0.0000100 epoch_Time:23951.0min: [2024-01-06 09:13:51,312][model8_pretrain.py][INFO] Epoch:[0/2](825500/4588595) loss:2.355 lr:0.0000100 epoch_Time:23951.0min: [2024-01-06 09:13:51,312][model8_pretrain.py][INFO] Epoch:[0/2](825500/4588595) loss:3.413 lr:0.0000100 epoch_Time:23951.0min: [2024-01-06 09:14:28,254][model8_pretrain.py][INFO] Epoch:[0/2](825600/4588595) loss:2.676 lr:0.0000100 epoch_Time:23950.0min: [2024-01-06 09:14:28,254][model8_pretrain.py][INFO] Epoch:[0/2](825600/4588595) loss:2.456 lr:0.0000100 epoch_Time:23950.0min: [2024-01-06 09:14:28,254][model8_pretrain.py][INFO] Epoch:[0/2](825600/4588595) loss:2.782 lr:0.0000100 epoch_Time:23950.0min: [2024-01-06 09:14:28,254][model8_pretrain.py][INFO] Epoch:[0/2](825600/4588595) loss:2.610 lr:0.0000100 epoch_Time:23950.0min: [2024-01-06 09:14:28,254][model8_pretrain.py][INFO] Epoch:[0/2](825600/4588595) loss:2.797 lr:0.0000100 epoch_Time:23950.0min: [2024-01-06 09:14:28,255][model8_pretrain.py][INFO] Epoch:[0/2](825600/4588595) loss:2.429 lr:0.0000100 epoch_Time:23950.0min: [2024-01-06 09:14:28,255][model8_pretrain.py][INFO] Epoch:[0/2](825600/4588595) loss:3.039 lr:0.0000100 epoch_Time:23950.0min: [2024-01-06 09:14:28,255][model8_pretrain.py][INFO] Epoch:[0/2](825600/4588595) loss:2.841 lr:0.0000100 epoch_Time:23950.0min: [2024-01-06 09:15:05,198][model8_pretrain.py][INFO] Epoch:[0/2](825700/4588595) loss:2.867 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:05,198][model8_pretrain.py][INFO] Epoch:[0/2](825700/4588595) loss:2.632 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:05,198][model8_pretrain.py][INFO] Epoch:[0/2](825700/4588595) loss:2.804 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:05,198][model8_pretrain.py][INFO] Epoch:[0/2](825700/4588595) loss:2.925 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:05,198][model8_pretrain.py][INFO] Epoch:[0/2](825700/4588595) loss:2.597 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:05,198][model8_pretrain.py][INFO] Epoch:[0/2](825700/4588595) loss:2.796 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:05,198][model8_pretrain.py][INFO] Epoch:[0/2](825700/4588595) loss:3.026 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:05,198][model8_pretrain.py][INFO] Epoch:[0/2](825700/4588595) loss:2.427 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:42,137][model8_pretrain.py][INFO] Epoch:[0/2](825800/4588595) loss:2.930 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:42,137][model8_pretrain.py][INFO] Epoch:[0/2](825800/4588595) loss:3.140 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:42,137][model8_pretrain.py][INFO] Epoch:[0/2](825800/4588595) loss:2.888 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:42,137][model8_pretrain.py][INFO] Epoch:[0/2](825800/4588595) loss:3.112 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:42,137][model8_pretrain.py][INFO] Epoch:[0/2](825800/4588595) loss:2.823 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:42,137][model8_pretrain.py][INFO] Epoch:[0/2](825800/4588595) loss:2.565 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:42,137][model8_pretrain.py][INFO] Epoch:[0/2](825800/4588595) loss:3.205 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:15:42,138][model8_pretrain.py][INFO] Epoch:[0/2](825800/4588595) loss:3.075 lr:0.0000100 epoch_Time:23949.0min: [2024-01-06 09:16:19,088][model8_pretrain.py][INFO] Epoch:[0/2](825900/4588595) loss:2.753 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:16:19,088][model8_pretrain.py][INFO] Epoch:[0/2](825900/4588595) loss:2.825 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:16:19,088][model8_pretrain.py][INFO] Epoch:[0/2](825900/4588595) loss:2.474 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:16:19,088][model8_pretrain.py][INFO] Epoch:[0/2](825900/4588595) loss:2.588 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:16:19,088][model8_pretrain.py][INFO] Epoch:[0/2](825900/4588595) loss:2.491 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:16:19,088][model8_pretrain.py][INFO] Epoch:[0/2](825900/4588595) loss:2.454 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:16:19,088][model8_pretrain.py][INFO] Epoch:[0/2](825900/4588595) loss:2.784 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:16:19,088][model8_pretrain.py][INFO] Epoch:[0/2](825900/4588595) loss:2.464 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:16:56,038][model8_pretrain.py][INFO] Epoch:[0/2](826000/4588595) loss:2.722 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:16:56,038][model8_pretrain.py][INFO] Epoch:[0/2](826000/4588595) loss:3.016 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:16:56,038][model8_pretrain.py][INFO] Epoch:[0/2](826000/4588595) loss:2.701 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:16:56,038][model8_pretrain.py][INFO] Epoch:[0/2](826000/4588595) loss:2.646 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:16:56,038][model8_pretrain.py][INFO] Epoch:[0/2](826000/4588595) loss:2.820 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:16:56,038][model8_pretrain.py][INFO] Epoch:[0/2](826000/4588595) loss:2.699 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:16:56,038][model8_pretrain.py][INFO] Epoch:[0/2](826000/4588595) loss:3.168 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:16:56,038][model8_pretrain.py][INFO] Epoch:[0/2](826000/4588595) loss:2.917 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:17:43,352][model8_pretrain.py][INFO] Epoch:[0/2](826100/4588595) loss:2.590 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:17:43,352][model8_pretrain.py][INFO] Epoch:[0/2](826100/4588595) loss:2.519 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:17:43,352][model8_pretrain.py][INFO] Epoch:[0/2](826100/4588595) loss:2.075 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:17:43,352][model8_pretrain.py][INFO] Epoch:[0/2](826100/4588595) loss:3.354 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:17:43,352][model8_pretrain.py][INFO] Epoch:[0/2](826100/4588595) loss:3.269 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:17:43,352][model8_pretrain.py][INFO] Epoch:[0/2](826100/4588595) loss:3.230 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:17:43,352][model8_pretrain.py][INFO] Epoch:[0/2](826100/4588595) loss:2.832 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:17:43,352][model8_pretrain.py][INFO] Epoch:[0/2](826100/4588595) loss:2.745 lr:0.0000100 epoch_Time:23948.0min: [2024-01-06 09:18:20,263][model8_pretrain.py][INFO] Epoch:[0/2](826200/4588595) loss:2.728 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:18:20,263][model8_pretrain.py][INFO] Epoch:[0/2](826200/4588595) loss:2.934 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:18:20,263][model8_pretrain.py][INFO] Epoch:[0/2](826200/4588595) loss:3.141 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:18:20,263][model8_pretrain.py][INFO] Epoch:[0/2](826200/4588595) loss:2.899 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:18:20,263][model8_pretrain.py][INFO] Epoch:[0/2](826200/4588595) loss:2.813 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:18:20,263][model8_pretrain.py][INFO] Epoch:[0/2](826200/4588595) loss:3.199 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:18:20,263][model8_pretrain.py][INFO] Epoch:[0/2](826200/4588595) loss:2.517 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:18:20,263][model8_pretrain.py][INFO] Epoch:[0/2](826200/4588595) loss:2.953 lr:0.0000100 epoch_Time:23947.0min: [2024-01-06 09:18:57,194][model8_pretrain.py][INFO] Epoch:[0/2](826300/4588595) loss:2.882 lr:0.0000100 epoch_Time:23946.0min: [2024-01-06 09:18:57,194][model8_pretrain.py][INFO] Epoch:[0/2](826300/4588595) loss:2.694 lr:0.0000100 epoch_Time:23946.0min: [2024-01-06 09:18:57,194][model8_pretrain.py][INFO] Epoch:[0/2](826300/4588595) loss:2.573 lr:0.0000100 epoch_Time:23946.0min: [2024-01-06 09:18:57,194][model8_pretrain.py][INFO] Epoch:[0/2](826300/4588595) loss:2.988 lr:0.0000100 epoch_Time:23946.0min: [2024-01-06 09:18:57,194][model8_pretrain.py][INFO] Epoch:[0/2](826300/4588595) loss:2.925 lr:0.0000100 epoch_Time:23946.0min: [2024-01-06 09:18:57,194][model8_pretrain.py][INFO] Epoch:[0/2](826300/4588595) loss:2.863 lr:0.0000100 epoch_Time:23946.0min: [2024-01-06 09:18:57,194][model8_pretrain.py][INFO] Epoch:[0/2](826300/4588595) loss:3.168 lr:0.0000100 epoch_Time:23946.0min: [2024-01-06 09:18:57,195][model8_pretrain.py][INFO] Epoch:[0/2](826300/4588595) loss:2.554 lr:0.0000100 epoch_Time:23946.0min: [2024-01-06 09:19:34,138][model8_pretrain.py][INFO] Epoch:[0/2](826400/4588595) loss:3.185 lr:0.0000100 epoch_Time:23945.0min: [2024-01-06 09:19:34,138][model8_pretrain.py][INFO] Epoch:[0/2](826400/4588595) loss:3.158 lr:0.0000100 epoch_Time:23945.0min: [2024-01-06 09:19:34,138][model8_pretrain.py][INFO] Epoch:[0/2](826400/4588595) loss:2.715 lr:0.0000100 epoch_Time:23945.0min: [2024-01-06 09:19:34,138][model8_pretrain.py][INFO] Epoch:[0/2](826400/4588595) loss:2.932 lr:0.0000100 epoch_Time:23945.0min: [2024-01-06 09:19:34,138][model8_pretrain.py][INFO] Epoch:[0/2](826400/4588595) loss:2.559 lr:0.0000100 epoch_Time:23945.0min: [2024-01-06 09:19:34,138][model8_pretrain.py][INFO] Epoch:[0/2](826400/4588595) loss:2.771 lr:0.0000100 epoch_Time:23945.0min: [2024-01-06 09:19:34,139][model8_pretrain.py][INFO] Epoch:[0/2](826400/4588595) loss:3.229 lr:0.0000100 epoch_Time:23945.0min: [2024-01-06 09:19:34,140][model8_pretrain.py][INFO] Epoch:[0/2](826400/4588595) loss:2.198 lr:0.0000100 epoch_Time:23945.0min: [2024-01-06 09:20:11,080][model8_pretrain.py][INFO] Epoch:[0/2](826500/4588595) loss:2.661 lr:0.0000100 epoch_Time:23944.0min: [2024-01-06 09:20:11,080][model8_pretrain.py][INFO] Epoch:[0/2](826500/4588595) loss:2.614 lr:0.0000100 epoch_Time:23944.0min: [2024-01-06 09:20:11,080][model8_pretrain.py][INFO] Epoch:[0/2](826500/4588595) loss:2.950 lr:0.0000100 epoch_Time:23944.0min: [2024-01-06 09:20:11,080][model8_pretrain.py][INFO] Epoch:[0/2](826500/4588595) loss:2.144 lr:0.0000100 epoch_Time:23944.0min: [2024-01-06 09:20:11,080][model8_pretrain.py][INFO] Epoch:[0/2](826500/4588595) loss:2.921 lr:0.0000100 epoch_Time:23944.0min: [2024-01-06 09:20:11,080][model8_pretrain.py][INFO] Epoch:[0/2](826500/4588595) loss:2.549 lr:0.0000100 epoch_Time:23944.0min: [2024-01-06 09:20:11,080][model8_pretrain.py][INFO] Epoch:[0/2](826500/4588595) loss:2.661 lr:0.0000100 epoch_Time:23944.0min: [2024-01-06 09:20:11,080][model8_pretrain.py][INFO] Epoch:[0/2](826500/4588595) loss:2.931 lr:0.0000100 epoch_Time:23944.0min: [2024-01-06 09:20:48,019][model8_pretrain.py][INFO] Epoch:[0/2](826600/4588595) loss:2.894 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:20:48,019][model8_pretrain.py][INFO] Epoch:[0/2](826600/4588595) loss:2.479 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:20:48,019][model8_pretrain.py][INFO] Epoch:[0/2](826600/4588595) loss:2.747 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:20:48,019][model8_pretrain.py][INFO] Epoch:[0/2](826600/4588595) loss:2.836 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:20:48,019][model8_pretrain.py][INFO] Epoch:[0/2](826600/4588595) loss:2.428 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:20:48,019][model8_pretrain.py][INFO] Epoch:[0/2](826600/4588595) loss:2.995 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:20:48,019][model8_pretrain.py][INFO] Epoch:[0/2](826600/4588595) loss:2.771 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:20:48,020][model8_pretrain.py][INFO] Epoch:[0/2](826600/4588595) loss:2.227 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:21:24,965][model8_pretrain.py][INFO] Epoch:[0/2](826700/4588595) loss:2.649 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:21:24,965][model8_pretrain.py][INFO] Epoch:[0/2](826700/4588595) loss:2.805 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:21:24,965][model8_pretrain.py][INFO] Epoch:[0/2](826700/4588595) loss:2.683 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:21:24,965][model8_pretrain.py][INFO] Epoch:[0/2](826700/4588595) loss:3.176 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:21:24,965][model8_pretrain.py][INFO] Epoch:[0/2](826700/4588595) loss:2.731 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:21:24,965][model8_pretrain.py][INFO] Epoch:[0/2](826700/4588595) loss:2.791 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:21:24,965][model8_pretrain.py][INFO] Epoch:[0/2](826700/4588595) loss:3.045 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:21:24,965][model8_pretrain.py][INFO] Epoch:[0/2](826700/4588595) loss:2.737 lr:0.0000100 epoch_Time:23943.0min: [2024-01-06 09:22:01,910][model8_pretrain.py][INFO] Epoch:[0/2](826800/4588595) loss:3.154 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:01,910][model8_pretrain.py][INFO] Epoch:[0/2](826800/4588595) loss:2.647 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:01,910][model8_pretrain.py][INFO] Epoch:[0/2](826800/4588595) loss:2.938 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:01,910][model8_pretrain.py][INFO] Epoch:[0/2](826800/4588595) loss:2.700 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:01,910][model8_pretrain.py][INFO] Epoch:[0/2](826800/4588595) loss:2.895 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:01,910][model8_pretrain.py][INFO] Epoch:[0/2](826800/4588595) loss:2.506 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:01,910][model8_pretrain.py][INFO] Epoch:[0/2](826800/4588595) loss:2.569 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:01,910][model8_pretrain.py][INFO] Epoch:[0/2](826800/4588595) loss:2.342 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:49,411][model8_pretrain.py][INFO] Epoch:[0/2](826900/4588595) loss:2.565 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:49,411][model8_pretrain.py][INFO] Epoch:[0/2](826900/4588595) loss:2.286 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:49,411][model8_pretrain.py][INFO] Epoch:[0/2](826900/4588595) loss:2.548 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:49,411][model8_pretrain.py][INFO] Epoch:[0/2](826900/4588595) loss:2.577 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:49,411][model8_pretrain.py][INFO] Epoch:[0/2](826900/4588595) loss:3.189 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:49,411][model8_pretrain.py][INFO] Epoch:[0/2](826900/4588595) loss:3.169 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:49,411][model8_pretrain.py][INFO] Epoch:[0/2](826900/4588595) loss:2.752 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:22:49,411][model8_pretrain.py][INFO] Epoch:[0/2](826900/4588595) loss:2.933 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:23:26,333][model8_pretrain.py][INFO] Epoch:[0/2](827000/4588595) loss:2.908 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:23:26,333][model8_pretrain.py][INFO] Epoch:[0/2](827000/4588595) loss:2.458 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:23:26,333][model8_pretrain.py][INFO] Epoch:[0/2](827000/4588595) loss:2.366 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:23:26,333][model8_pretrain.py][INFO] Epoch:[0/2](827000/4588595) loss:2.634 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:23:26,333][model8_pretrain.py][INFO] Epoch:[0/2](827000/4588595) loss:2.781 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:23:26,333][model8_pretrain.py][INFO] Epoch:[0/2](827000/4588595) loss:3.009 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:23:26,333][model8_pretrain.py][INFO] Epoch:[0/2](827000/4588595) loss:2.985 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:23:26,333][model8_pretrain.py][INFO] Epoch:[0/2](827000/4588595) loss:2.595 lr:0.0000100 epoch_Time:23942.0min: [2024-01-06 09:24:03,266][model8_pretrain.py][INFO] Epoch:[0/2](827100/4588595) loss:2.627 lr:0.0000100 epoch_Time:23941.0min: [2024-01-06 09:24:03,266][model8_pretrain.py][INFO] Epoch:[0/2](827100/4588595) loss:3.136 lr:0.0000100 epoch_Time:23941.0min: [2024-01-06 09:24:03,266][model8_pretrain.py][INFO] Epoch:[0/2](827100/4588595) loss:2.394 lr:0.0000100 epoch_Time:23941.0min: [2024-01-06 09:24:03,266][model8_pretrain.py][INFO] Epoch:[0/2](827100/4588595) loss:2.680 lr:0.0000100 epoch_Time:23941.0min: [2024-01-06 09:24:03,266][model8_pretrain.py][INFO] Epoch:[0/2](827100/4588595) loss:2.934 lr:0.0000100 epoch_Time:23941.0min: [2024-01-06 09:24:03,266][model8_pretrain.py][INFO] Epoch:[0/2](827100/4588595) loss:2.984 lr:0.0000100 epoch_Time:23941.0min: [2024-01-06 09:24:03,266][model8_pretrain.py][INFO] Epoch:[0/2](827100/4588595) loss:3.129 lr:0.0000100 epoch_Time:23941.0min: [2024-01-06 09:24:03,266][model8_pretrain.py][INFO] Epoch:[0/2](827100/4588595) loss:2.838 lr:0.0000100 epoch_Time:23941.0min: [2024-01-06 09:24:40,207][model8_pretrain.py][INFO] Epoch:[0/2](827200/4588595) loss:3.076 lr:0.0000100 epoch_Time:23940.0min: [2024-01-06 09:24:40,207][model8_pretrain.py][INFO] Epoch:[0/2](827200/4588595) loss:2.744 lr:0.0000100 epoch_Time:23940.0min: [2024-01-06 09:24:40,207][model8_pretrain.py][INFO] Epoch:[0/2](827200/4588595) loss:2.867 lr:0.0000100 epoch_Time:23940.0min: [2024-01-06 09:24:40,207][model8_pretrain.py][INFO] Epoch:[0/2](827200/4588595) loss:2.787 lr:0.0000100 epoch_Time:23940.0min: [2024-01-06 09:24:40,207][model8_pretrain.py][INFO] Epoch:[0/2](827200/4588595) loss:2.535 lr:0.0000100 epoch_Time:23940.0min: [2024-01-06 09:24:40,207][model8_pretrain.py][INFO] Epoch:[0/2](827200/4588595) loss:2.720 lr:0.0000100 epoch_Time:23940.0min: [2024-01-06 09:24:40,207][model8_pretrain.py][INFO] Epoch:[0/2](827200/4588595) loss:3.202 lr:0.0000100 epoch_Time:23940.0min: [2024-01-06 09:24:40,207][model8_pretrain.py][INFO] Epoch:[0/2](827200/4588595) loss:2.959 lr:0.0000100 epoch_Time:23940.0min: [2024-01-06 09:25:17,136][model8_pretrain.py][INFO] Epoch:[0/2](827300/4588595) loss:3.102 lr:0.0000100 epoch_Time:23939.0min: [2024-01-06 09:25:17,136][model8_pretrain.py][INFO] Epoch:[0/2](827300/4588595) loss:2.860 lr:0.0000100 epoch_Time:23939.0min: [2024-01-06 09:25:17,136][model8_pretrain.py][INFO] Epoch:[0/2](827300/4588595) loss:2.545 lr:0.0000100 epoch_Time:23939.0min: [2024-01-06 09:25:17,136][model8_pretrain.py][INFO] Epoch:[0/2](827300/4588595) loss:3.333 lr:0.0000100 epoch_Time:23939.0min: [2024-01-06 09:25:17,137][model8_pretrain.py][INFO] Epoch:[0/2](827300/4588595) loss:2.986 lr:0.0000100 epoch_Time:23939.0min: [2024-01-06 09:25:17,137][model8_pretrain.py][INFO] Epoch:[0/2](827300/4588595) loss:2.481 lr:0.0000100 epoch_Time:23939.0min: [2024-01-06 09:25:17,137][model8_pretrain.py][INFO] Epoch:[0/2](827300/4588595) loss:2.827 lr:0.0000100 epoch_Time:23939.0min: [2024-01-06 09:25:17,138][model8_pretrain.py][INFO] Epoch:[0/2](827300/4588595) loss:2.851 lr:0.0000100 epoch_Time:23939.0min: [2024-01-06 09:25:54,085][model8_pretrain.py][INFO] Epoch:[0/2](827400/4588595) loss:2.016 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:25:54,085][model8_pretrain.py][INFO] Epoch:[0/2](827400/4588595) loss:2.925 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:25:54,085][model8_pretrain.py][INFO] Epoch:[0/2](827400/4588595) loss:3.110 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:25:54,085][model8_pretrain.py][INFO] Epoch:[0/2](827400/4588595) loss:2.925 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:25:54,085][model8_pretrain.py][INFO] Epoch:[0/2](827400/4588595) loss:2.450 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:25:54,085][model8_pretrain.py][INFO] Epoch:[0/2](827400/4588595) loss:2.911 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:25:54,085][model8_pretrain.py][INFO] Epoch:[0/2](827400/4588595) loss:3.051 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:25:54,085][model8_pretrain.py][INFO] Epoch:[0/2](827400/4588595) loss:2.666 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:26:31,023][model8_pretrain.py][INFO] Epoch:[0/2](827500/4588595) loss:2.312 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:26:31,023][model8_pretrain.py][INFO] Epoch:[0/2](827500/4588595) loss:2.811 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:26:31,023][model8_pretrain.py][INFO] Epoch:[0/2](827500/4588595) loss:2.919 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:26:31,023][model8_pretrain.py][INFO] Epoch:[0/2](827500/4588595) loss:2.341 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:26:31,023][model8_pretrain.py][INFO] Epoch:[0/2](827500/4588595) loss:2.749 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:26:31,023][model8_pretrain.py][INFO] Epoch:[0/2](827500/4588595) loss:2.544 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:26:31,023][model8_pretrain.py][INFO] Epoch:[0/2](827500/4588595) loss:2.757 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:26:31,023][model8_pretrain.py][INFO] Epoch:[0/2](827500/4588595) loss:3.221 lr:0.0000100 epoch_Time:23938.0min: [2024-01-06 09:27:07,956][model8_pretrain.py][INFO] Epoch:[0/2](827600/4588595) loss:3.114 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:07,956][model8_pretrain.py][INFO] Epoch:[0/2](827600/4588595) loss:2.386 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:07,956][model8_pretrain.py][INFO] Epoch:[0/2](827600/4588595) loss:3.019 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:07,956][model8_pretrain.py][INFO] Epoch:[0/2](827600/4588595) loss:2.968 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:07,956][model8_pretrain.py][INFO] Epoch:[0/2](827600/4588595) loss:2.192 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:07,956][model8_pretrain.py][INFO] Epoch:[0/2](827600/4588595) loss:2.951 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:07,956][model8_pretrain.py][INFO] Epoch:[0/2](827600/4588595) loss:3.000 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:07,956][model8_pretrain.py][INFO] Epoch:[0/2](827600/4588595) loss:2.732 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:55,078][model8_pretrain.py][INFO] Epoch:[0/2](827700/4588595) loss:3.024 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:55,078][model8_pretrain.py][INFO] Epoch:[0/2](827700/4588595) loss:2.960 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:55,078][model8_pretrain.py][INFO] Epoch:[0/2](827700/4588595) loss:2.864 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:55,078][model8_pretrain.py][INFO] Epoch:[0/2](827700/4588595) loss:2.122 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:55,078][model8_pretrain.py][INFO] Epoch:[0/2](827700/4588595) loss:3.286 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:55,078][model8_pretrain.py][INFO] Epoch:[0/2](827700/4588595) loss:3.221 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:55,078][model8_pretrain.py][INFO] Epoch:[0/2](827700/4588595) loss:2.712 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:27:55,078][model8_pretrain.py][INFO] Epoch:[0/2](827700/4588595) loss:3.141 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:28:32,008][model8_pretrain.py][INFO] Epoch:[0/2](827800/4588595) loss:3.036 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:28:32,008][model8_pretrain.py][INFO] Epoch:[0/2](827800/4588595) loss:3.014 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:28:32,008][model8_pretrain.py][INFO] Epoch:[0/2](827800/4588595) loss:3.630 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:28:32,008][model8_pretrain.py][INFO] Epoch:[0/2](827800/4588595) loss:2.874 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:28:32,008][model8_pretrain.py][INFO] Epoch:[0/2](827800/4588595) loss:2.364 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:28:32,008][model8_pretrain.py][INFO] Epoch:[0/2](827800/4588595) loss:2.873 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:28:32,008][model8_pretrain.py][INFO] Epoch:[0/2](827800/4588595) loss:3.044 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:28:32,008][model8_pretrain.py][INFO] Epoch:[0/2](827800/4588595) loss:2.742 lr:0.0000100 epoch_Time:23937.0min: [2024-01-06 09:29:08,942][model8_pretrain.py][INFO] Epoch:[0/2](827900/4588595) loss:2.603 lr:0.0000100 epoch_Time:23936.0min: [2024-01-06 09:29:08,941][model8_pretrain.py][INFO] Epoch:[0/2](827900/4588595) loss:2.785 lr:0.0000100 epoch_Time:23936.0min: [2024-01-06 09:29:08,941][model8_pretrain.py][INFO] Epoch:[0/2](827900/4588595) loss:2.909 lr:0.0000100 epoch_Time:23936.0min: [2024-01-06 09:29:08,942][model8_pretrain.py][INFO] Epoch:[0/2](827900/4588595) loss:2.807 lr:0.0000100 epoch_Time:23936.0min: [2024-01-06 09:29:08,942][model8_pretrain.py][INFO] Epoch:[0/2](827900/4588595) loss:2.665 lr:0.0000100 epoch_Time:23936.0min: [2024-01-06 09:29:08,942][model8_pretrain.py][INFO] Epoch:[0/2](827900/4588595) loss:2.628 lr:0.0000100 epoch_Time:23936.0min: [2024-01-06 09:29:08,942][model8_pretrain.py][INFO] Epoch:[0/2](827900/4588595) loss:2.858 lr:0.0000100 epoch_Time:23936.0min: [2024-01-06 09:29:08,942][model8_pretrain.py][INFO] Epoch:[0/2](827900/4588595) loss:2.616 lr:0.0000100 epoch_Time:23936.0min: [2024-01-06 09:29:45,874][model8_pretrain.py][INFO] Epoch:[0/2](828000/4588595) loss:2.551 lr:0.0000100 epoch_Time:23935.0min: [2024-01-06 09:29:45,874][model8_pretrain.py][INFO] Epoch:[0/2](828000/4588595) loss:2.774 lr:0.0000100 epoch_Time:23935.0min: [2024-01-06 09:29:45,874][model8_pretrain.py][INFO] Epoch:[0/2](828000/4588595) loss:3.315 lr:0.0000100 epoch_Time:23935.0min: [2024-01-06 09:29:45,874][model8_pretrain.py][INFO] Epoch:[0/2](828000/4588595) loss:2.372 lr:0.0000100 epoch_Time:23935.0min: [2024-01-06 09:29:45,874][model8_pretrain.py][INFO] Epoch:[0/2](828000/4588595) loss:2.698 lr:0.0000100 epoch_Time:23935.0min: [2024-01-06 09:29:45,874][model8_pretrain.py][INFO] Epoch:[0/2](828000/4588595) loss:3.429 lr:0.0000100 epoch_Time:23935.0min: [2024-01-06 09:29:45,875][model8_pretrain.py][INFO] Epoch:[0/2](828000/4588595) loss:2.864 lr:0.0000100 epoch_Time:23935.0min: [2024-01-06 09:29:45,875][model8_pretrain.py][INFO] Epoch:[0/2](828000/4588595) loss:2.924 lr:0.0000100 epoch_Time:23935.0min: [2024-01-06 09:30:22,815][model8_pretrain.py][INFO] Epoch:[0/2](828100/4588595) loss:2.424 lr:0.0000100 epoch_Time:23934.0min: [2024-01-06 09:30:22,815][model8_pretrain.py][INFO] Epoch:[0/2](828100/4588595) loss:2.769 lr:0.0000100 epoch_Time:23934.0min: [2024-01-06 09:30:22,815][model8_pretrain.py][INFO] Epoch:[0/2](828100/4588595) loss:2.511 lr:0.0000100 epoch_Time:23934.0min: [2024-01-06 09:30:22,815][model8_pretrain.py][INFO] Epoch:[0/2](828100/4588595) loss:2.256 lr:0.0000100 epoch_Time:23934.0min: [2024-01-06 09:30:22,815][model8_pretrain.py][INFO] Epoch:[0/2](828100/4588595) loss:2.942 lr:0.0000100 epoch_Time:23934.0min: [2024-01-06 09:30:22,815][model8_pretrain.py][INFO] Epoch:[0/2](828100/4588595) loss:2.541 lr:0.0000100 epoch_Time:23934.0min: [2024-01-06 09:30:22,815][model8_pretrain.py][INFO] Epoch:[0/2](828100/4588595) loss:2.726 lr:0.0000100 epoch_Time:23934.0min: [2024-01-06 09:30:22,816][model8_pretrain.py][INFO] Epoch:[0/2](828100/4588595) loss:2.729 lr:0.0000100 epoch_Time:23934.0min: [2024-01-06 09:30:59,752][model8_pretrain.py][INFO] Epoch:[0/2](828200/4588595) loss:2.718 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:30:59,752][model8_pretrain.py][INFO] Epoch:[0/2](828200/4588595) loss:2.362 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:30:59,752][model8_pretrain.py][INFO] Epoch:[0/2](828200/4588595) loss:2.810 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:30:59,752][model8_pretrain.py][INFO] Epoch:[0/2](828200/4588595) loss:2.667 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:30:59,753][model8_pretrain.py][INFO] Epoch:[0/2](828200/4588595) loss:2.826 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:30:59,753][model8_pretrain.py][INFO] Epoch:[0/2](828200/4588595) loss:2.917 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:30:59,753][model8_pretrain.py][INFO] Epoch:[0/2](828200/4588595) loss:2.721 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:30:59,753][model8_pretrain.py][INFO] Epoch:[0/2](828200/4588595) loss:2.395 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:31:36,694][model8_pretrain.py][INFO] Epoch:[0/2](828300/4588595) loss:2.711 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:31:36,694][model8_pretrain.py][INFO] Epoch:[0/2](828300/4588595) loss:3.225 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:31:36,694][model8_pretrain.py][INFO] Epoch:[0/2](828300/4588595) loss:2.180 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:31:36,694][model8_pretrain.py][INFO] Epoch:[0/2](828300/4588595) loss:2.914 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:31:36,694][model8_pretrain.py][INFO] Epoch:[0/2](828300/4588595) loss:2.730 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:31:36,694][model8_pretrain.py][INFO] Epoch:[0/2](828300/4588595) loss:3.229 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:31:36,694][model8_pretrain.py][INFO] Epoch:[0/2](828300/4588595) loss:3.479 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:31:36,694][model8_pretrain.py][INFO] Epoch:[0/2](828300/4588595) loss:2.755 lr:0.0000100 epoch_Time:23933.0min: [2024-01-06 09:32:13,637][model8_pretrain.py][INFO] Epoch:[0/2](828400/4588595) loss:2.711 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:32:13,637][model8_pretrain.py][INFO] Epoch:[0/2](828400/4588595) loss:2.368 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:32:13,637][model8_pretrain.py][INFO] Epoch:[0/2](828400/4588595) loss:3.243 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:32:13,637][model8_pretrain.py][INFO] Epoch:[0/2](828400/4588595) loss:2.638 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:32:13,637][model8_pretrain.py][INFO] Epoch:[0/2](828400/4588595) loss:2.747 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:32:13,637][model8_pretrain.py][INFO] Epoch:[0/2](828400/4588595) loss:2.693 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:32:13,637][model8_pretrain.py][INFO] Epoch:[0/2](828400/4588595) loss:2.588 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:32:13,637][model8_pretrain.py][INFO] Epoch:[0/2](828400/4588595) loss:2.849 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:00,343][model8_pretrain.py][INFO] Epoch:[0/2](828500/4588595) loss:2.375 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:00,343][model8_pretrain.py][INFO] Epoch:[0/2](828500/4588595) loss:2.692 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:00,343][model8_pretrain.py][INFO] Epoch:[0/2](828500/4588595) loss:2.637 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:00,343][model8_pretrain.py][INFO] Epoch:[0/2](828500/4588595) loss:2.923 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:00,343][model8_pretrain.py][INFO] Epoch:[0/2](828500/4588595) loss:3.006 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:00,343][model8_pretrain.py][INFO] Epoch:[0/2](828500/4588595) loss:3.482 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:00,343][model8_pretrain.py][INFO] Epoch:[0/2](828500/4588595) loss:2.907 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:00,343][model8_pretrain.py][INFO] Epoch:[0/2](828500/4588595) loss:2.771 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:37,260][model8_pretrain.py][INFO] Epoch:[0/2](828600/4588595) loss:2.201 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:37,260][model8_pretrain.py][INFO] Epoch:[0/2](828600/4588595) loss:2.971 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:37,260][model8_pretrain.py][INFO] Epoch:[0/2](828600/4588595) loss:2.930 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:37,260][model8_pretrain.py][INFO] Epoch:[0/2](828600/4588595) loss:3.079 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:37,260][model8_pretrain.py][INFO] Epoch:[0/2](828600/4588595) loss:2.927 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:37,260][model8_pretrain.py][INFO] Epoch:[0/2](828600/4588595) loss:2.680 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:37,260][model8_pretrain.py][INFO] Epoch:[0/2](828600/4588595) loss:3.131 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:33:37,260][model8_pretrain.py][INFO] Epoch:[0/2](828600/4588595) loss:3.375 lr:0.0000100 epoch_Time:23932.0min: [2024-01-06 09:34:14,192][model8_pretrain.py][INFO] Epoch:[0/2](828700/4588595) loss:3.016 lr:0.0000100 epoch_Time:23931.0min: [2024-01-06 09:34:14,192][model8_pretrain.py][INFO] Epoch:[0/2](828700/4588595) loss:2.567 lr:0.0000100 epoch_Time:23931.0min: [2024-01-06 09:34:14,192][model8_pretrain.py][INFO] Epoch:[0/2](828700/4588595) loss:2.741 lr:0.0000100 epoch_Time:23931.0min: [2024-01-06 09:34:14,192][model8_pretrain.py][INFO] Epoch:[0/2](828700/4588595) loss:3.220 lr:0.0000100 epoch_Time:23931.0min: [2024-01-06 09:34:14,192][model8_pretrain.py][INFO] Epoch:[0/2](828700/4588595) loss:3.114 lr:0.0000100 epoch_Time:23931.0min: [2024-01-06 09:34:14,192][model8_pretrain.py][INFO] Epoch:[0/2](828700/4588595) loss:2.512 lr:0.0000100 epoch_Time:23931.0min: [2024-01-06 09:34:14,192][model8_pretrain.py][INFO] Epoch:[0/2](828700/4588595) loss:2.913 lr:0.0000100 epoch_Time:23931.0min: [2024-01-06 09:34:14,192][model8_pretrain.py][INFO] Epoch:[0/2](828700/4588595) loss:2.281 lr:0.0000100 epoch_Time:23931.0min: [2024-01-06 09:34:51,135][model8_pretrain.py][INFO] Epoch:[0/2](828800/4588595) loss:2.948 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:34:51,135][model8_pretrain.py][INFO] Epoch:[0/2](828800/4588595) loss:2.990 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:34:51,135][model8_pretrain.py][INFO] Epoch:[0/2](828800/4588595) loss:2.043 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:34:51,135][model8_pretrain.py][INFO] Epoch:[0/2](828800/4588595) loss:2.976 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:34:51,135][model8_pretrain.py][INFO] Epoch:[0/2](828800/4588595) loss:3.135 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:34:51,135][model8_pretrain.py][INFO] Epoch:[0/2](828800/4588595) loss:2.444 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:34:51,135][model8_pretrain.py][INFO] Epoch:[0/2](828800/4588595) loss:2.573 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:34:51,135][model8_pretrain.py][INFO] Epoch:[0/2](828800/4588595) loss:3.048 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:35:28,082][model8_pretrain.py][INFO] Epoch:[0/2](828900/4588595) loss:2.814 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:35:28,082][model8_pretrain.py][INFO] Epoch:[0/2](828900/4588595) loss:2.728 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:35:28,082][model8_pretrain.py][INFO] Epoch:[0/2](828900/4588595) loss:2.950 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:35:28,082][model8_pretrain.py][INFO] Epoch:[0/2](828900/4588595) loss:3.107 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:35:28,082][model8_pretrain.py][INFO] Epoch:[0/2](828900/4588595) loss:2.878 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:35:28,082][model8_pretrain.py][INFO] Epoch:[0/2](828900/4588595) loss:2.768 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:35:28,082][model8_pretrain.py][INFO] Epoch:[0/2](828900/4588595) loss:2.841 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:35:28,082][model8_pretrain.py][INFO] Epoch:[0/2](828900/4588595) loss:3.168 lr:0.0000100 epoch_Time:23929.0min: [2024-01-06 09:36:05,028][model8_pretrain.py][INFO] Epoch:[0/2](829000/4588595) loss:2.315 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:05,028][model8_pretrain.py][INFO] Epoch:[0/2](829000/4588595) loss:2.665 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:05,028][model8_pretrain.py][INFO] Epoch:[0/2](829000/4588595) loss:2.264 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:05,028][model8_pretrain.py][INFO] Epoch:[0/2](829000/4588595) loss:2.866 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:05,028][model8_pretrain.py][INFO] Epoch:[0/2](829000/4588595) loss:2.952 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:05,028][model8_pretrain.py][INFO] Epoch:[0/2](829000/4588595) loss:3.196 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:05,028][model8_pretrain.py][INFO] Epoch:[0/2](829000/4588595) loss:3.226 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:05,028][model8_pretrain.py][INFO] Epoch:[0/2](829000/4588595) loss:2.975 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:41,966][model8_pretrain.py][INFO] Epoch:[0/2](829100/4588595) loss:2.541 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:41,966][model8_pretrain.py][INFO] Epoch:[0/2](829100/4588595) loss:2.049 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:41,966][model8_pretrain.py][INFO] Epoch:[0/2](829100/4588595) loss:3.119 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:41,967][model8_pretrain.py][INFO] Epoch:[0/2](829100/4588595) loss:2.619 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:41,967][model8_pretrain.py][INFO] Epoch:[0/2](829100/4588595) loss:3.132 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:41,967][model8_pretrain.py][INFO] Epoch:[0/2](829100/4588595) loss:3.247 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:41,967][model8_pretrain.py][INFO] Epoch:[0/2](829100/4588595) loss:3.185 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:36:41,967][model8_pretrain.py][INFO] Epoch:[0/2](829100/4588595) loss:2.622 lr:0.0000100 epoch_Time:23928.0min: [2024-01-06 09:37:18,922][model8_pretrain.py][INFO] Epoch:[0/2](829200/4588595) loss:3.180 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:37:18,922][model8_pretrain.py][INFO] Epoch:[0/2](829200/4588595) loss:2.833 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:37:18,922][model8_pretrain.py][INFO] Epoch:[0/2](829200/4588595) loss:2.372 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:37:18,922][model8_pretrain.py][INFO] Epoch:[0/2](829200/4588595) loss:2.468 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:37:18,922][model8_pretrain.py][INFO] Epoch:[0/2](829200/4588595) loss:2.651 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:37:18,922][model8_pretrain.py][INFO] Epoch:[0/2](829200/4588595) loss:2.552 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:37:18,922][model8_pretrain.py][INFO] Epoch:[0/2](829200/4588595) loss:2.699 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:37:18,922][model8_pretrain.py][INFO] Epoch:[0/2](829200/4588595) loss:2.747 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:05,842][model8_pretrain.py][INFO] Epoch:[0/2](829300/4588595) loss:2.646 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:05,842][model8_pretrain.py][INFO] Epoch:[0/2](829300/4588595) loss:2.158 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:05,842][model8_pretrain.py][INFO] Epoch:[0/2](829300/4588595) loss:3.063 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:05,842][model8_pretrain.py][INFO] Epoch:[0/2](829300/4588595) loss:2.974 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:05,842][model8_pretrain.py][INFO] Epoch:[0/2](829300/4588595) loss:3.393 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:05,842][model8_pretrain.py][INFO] Epoch:[0/2](829300/4588595) loss:3.036 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:05,842][model8_pretrain.py][INFO] Epoch:[0/2](829300/4588595) loss:3.313 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:05,842][model8_pretrain.py][INFO] Epoch:[0/2](829300/4588595) loss:2.415 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:42,770][model8_pretrain.py][INFO] Epoch:[0/2](829400/4588595) loss:3.090 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:42,770][model8_pretrain.py][INFO] Epoch:[0/2](829400/4588595) loss:2.866 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:42,770][model8_pretrain.py][INFO] Epoch:[0/2](829400/4588595) loss:2.616 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:42,770][model8_pretrain.py][INFO] Epoch:[0/2](829400/4588595) loss:2.945 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:42,770][model8_pretrain.py][INFO] Epoch:[0/2](829400/4588595) loss:2.687 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:42,770][model8_pretrain.py][INFO] Epoch:[0/2](829400/4588595) loss:2.439 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:42,770][model8_pretrain.py][INFO] Epoch:[0/2](829400/4588595) loss:3.010 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:38:42,770][model8_pretrain.py][INFO] Epoch:[0/2](829400/4588595) loss:2.942 lr:0.0000100 epoch_Time:23927.0min: [2024-01-06 09:39:19,701][model8_pretrain.py][INFO] Epoch:[0/2](829500/4588595) loss:2.543 lr:0.0000100 epoch_Time:23926.0min: [2024-01-06 09:39:19,701][model8_pretrain.py][INFO] Epoch:[0/2](829500/4588595) loss:2.192 lr:0.0000100 epoch_Time:23926.0min: [2024-01-06 09:39:19,701][model8_pretrain.py][INFO] Epoch:[0/2](829500/4588595) loss:2.808 lr:0.0000100 epoch_Time:23926.0min: [2024-01-06 09:39:19,701][model8_pretrain.py][INFO] Epoch:[0/2](829500/4588595) loss:2.830 lr:0.0000100 epoch_Time:23926.0min: [2024-01-06 09:39:19,701][model8_pretrain.py][INFO] Epoch:[0/2](829500/4588595) loss:3.017 lr:0.0000100 epoch_Time:23926.0min: [2024-01-06 09:39:19,701][model8_pretrain.py][INFO] Epoch:[0/2](829500/4588595) loss:2.295 lr:0.0000100 epoch_Time:23926.0min: [2024-01-06 09:39:19,701][model8_pretrain.py][INFO] Epoch:[0/2](829500/4588595) loss:3.078 lr:0.0000100 epoch_Time:23926.0min: [2024-01-06 09:39:19,701][model8_pretrain.py][INFO] Epoch:[0/2](829500/4588595) loss:2.719 lr:0.0000100 epoch_Time:23926.0min: [2024-01-06 09:39:56,653][model8_pretrain.py][INFO] Epoch:[0/2](829600/4588595) loss:2.731 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:39:56,653][model8_pretrain.py][INFO] Epoch:[0/2](829600/4588595) loss:2.720 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:39:56,653][model8_pretrain.py][INFO] Epoch:[0/2](829600/4588595) loss:2.808 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:39:56,653][model8_pretrain.py][INFO] Epoch:[0/2](829600/4588595) loss:2.211 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:39:56,653][model8_pretrain.py][INFO] Epoch:[0/2](829600/4588595) loss:2.897 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:39:56,653][model8_pretrain.py][INFO] Epoch:[0/2](829600/4588595) loss:2.611 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:39:56,653][model8_pretrain.py][INFO] Epoch:[0/2](829600/4588595) loss:3.078 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:39:56,654][model8_pretrain.py][INFO] Epoch:[0/2](829600/4588595) loss:2.951 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:40:33,607][model8_pretrain.py][INFO] Epoch:[0/2](829700/4588595) loss:3.140 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:40:33,607][model8_pretrain.py][INFO] Epoch:[0/2](829700/4588595) loss:2.899 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:40:33,607][model8_pretrain.py][INFO] Epoch:[0/2](829700/4588595) loss:2.339 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:40:33,607][model8_pretrain.py][INFO] Epoch:[0/2](829700/4588595) loss:3.142 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:40:33,607][model8_pretrain.py][INFO] Epoch:[0/2](829700/4588595) loss:2.592 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:40:33,607][model8_pretrain.py][INFO] Epoch:[0/2](829700/4588595) loss:3.048 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:40:33,607][model8_pretrain.py][INFO] Epoch:[0/2](829700/4588595) loss:2.855 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:40:33,607][model8_pretrain.py][INFO] Epoch:[0/2](829700/4588595) loss:3.072 lr:0.0000100 epoch_Time:23924.0min: [2024-01-06 09:41:10,556][model8_pretrain.py][INFO] Epoch:[0/2](829800/4588595) loss:2.775 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:10,556][model8_pretrain.py][INFO] Epoch:[0/2](829800/4588595) loss:2.448 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:10,556][model8_pretrain.py][INFO] Epoch:[0/2](829800/4588595) loss:3.015 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:10,556][model8_pretrain.py][INFO] Epoch:[0/2](829800/4588595) loss:2.724 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:10,556][model8_pretrain.py][INFO] Epoch:[0/2](829800/4588595) loss:1.688 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:10,556][model8_pretrain.py][INFO] Epoch:[0/2](829800/4588595) loss:3.362 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:10,556][model8_pretrain.py][INFO] Epoch:[0/2](829800/4588595) loss:2.996 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:10,556][model8_pretrain.py][INFO] Epoch:[0/2](829800/4588595) loss:3.016 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:47,500][model8_pretrain.py][INFO] Epoch:[0/2](829900/4588595) loss:2.546 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:47,500][model8_pretrain.py][INFO] Epoch:[0/2](829900/4588595) loss:2.404 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:47,500][model8_pretrain.py][INFO] Epoch:[0/2](829900/4588595) loss:3.094 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:47,500][model8_pretrain.py][INFO] Epoch:[0/2](829900/4588595) loss:2.887 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:47,500][model8_pretrain.py][INFO] Epoch:[0/2](829900/4588595) loss:2.306 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:47,500][model8_pretrain.py][INFO] Epoch:[0/2](829900/4588595) loss:3.102 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:47,500][model8_pretrain.py][INFO] Epoch:[0/2](829900/4588595) loss:3.068 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:41:47,500][model8_pretrain.py][INFO] Epoch:[0/2](829900/4588595) loss:2.770 lr:0.0000100 epoch_Time:23923.0min: [2024-01-06 09:42:24,448][model8_pretrain.py][INFO] Epoch:[0/2](830000/4588595) loss:2.896 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:42:24,448][model8_pretrain.py][INFO] Epoch:[0/2](830000/4588595) loss:2.797 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:42:24,448][model8_pretrain.py][INFO] Epoch:[0/2](830000/4588595) loss:2.603 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:42:24,448][model8_pretrain.py][INFO] Epoch:[0/2](830000/4588595) loss:2.183 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:42:24,448][model8_pretrain.py][INFO] Epoch:[0/2](830000/4588595) loss:2.646 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:42:24,448][model8_pretrain.py][INFO] Epoch:[0/2](830000/4588595) loss:2.797 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:42:24,449][model8_pretrain.py][INFO] Epoch:[0/2](830000/4588595) loss:3.272 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:42:24,449][model8_pretrain.py][INFO] Epoch:[0/2](830000/4588595) loss:2.539 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:43:11,498][model8_pretrain.py][INFO] Epoch:[0/2](830100/4588595) loss:2.297 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:43:11,498][model8_pretrain.py][INFO] Epoch:[0/2](830100/4588595) loss:2.780 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:43:11,498][model8_pretrain.py][INFO] Epoch:[0/2](830100/4588595) loss:2.521 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:43:11,498][model8_pretrain.py][INFO] Epoch:[0/2](830100/4588595) loss:2.619 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:43:11,498][model8_pretrain.py][INFO] Epoch:[0/2](830100/4588595) loss:3.062 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:43:11,498][model8_pretrain.py][INFO] Epoch:[0/2](830100/4588595) loss:3.139 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:43:11,498][model8_pretrain.py][INFO] Epoch:[0/2](830100/4588595) loss:2.954 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:43:11,498][model8_pretrain.py][INFO] Epoch:[0/2](830100/4588595) loss:2.840 lr:0.0000100 epoch_Time:23922.0min: [2024-01-06 09:43:48,431][model8_pretrain.py][INFO] Epoch:[0/2](830200/4588595) loss:3.132 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:43:48,431][model8_pretrain.py][INFO] Epoch:[0/2](830200/4588595) loss:2.880 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:43:48,431][model8_pretrain.py][INFO] Epoch:[0/2](830200/4588595) loss:3.236 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:43:48,431][model8_pretrain.py][INFO] Epoch:[0/2](830200/4588595) loss:3.204 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:43:48,431][model8_pretrain.py][INFO] Epoch:[0/2](830200/4588595) loss:2.338 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:43:48,431][model8_pretrain.py][INFO] Epoch:[0/2](830200/4588595) loss:2.629 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:43:48,431][model8_pretrain.py][INFO] Epoch:[0/2](830200/4588595) loss:2.456 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:43:48,431][model8_pretrain.py][INFO] Epoch:[0/2](830200/4588595) loss:2.661 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:44:25,362][model8_pretrain.py][INFO] Epoch:[0/2](830300/4588595) loss:2.341 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:44:25,362][model8_pretrain.py][INFO] Epoch:[0/2](830300/4588595) loss:2.705 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:44:25,362][model8_pretrain.py][INFO] Epoch:[0/2](830300/4588595) loss:2.776 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:44:25,362][model8_pretrain.py][INFO] Epoch:[0/2](830300/4588595) loss:3.049 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:44:25,362][model8_pretrain.py][INFO] Epoch:[0/2](830300/4588595) loss:2.997 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:44:25,362][model8_pretrain.py][INFO] Epoch:[0/2](830300/4588595) loss:2.936 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:44:25,363][model8_pretrain.py][INFO] Epoch:[0/2](830300/4588595) loss:2.828 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:44:25,363][model8_pretrain.py][INFO] Epoch:[0/2](830300/4588595) loss:2.787 lr:0.0000100 epoch_Time:23921.0min: [2024-01-06 09:45:02,316][model8_pretrain.py][INFO] Epoch:[0/2](830400/4588595) loss:2.881 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:02,317][model8_pretrain.py][INFO] Epoch:[0/2](830400/4588595) loss:2.735 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:02,317][model8_pretrain.py][INFO] Epoch:[0/2](830400/4588595) loss:3.065 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:02,316][model8_pretrain.py][INFO] Epoch:[0/2](830400/4588595) loss:2.494 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:02,317][model8_pretrain.py][INFO] Epoch:[0/2](830400/4588595) loss:3.045 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:02,317][model8_pretrain.py][INFO] Epoch:[0/2](830400/4588595) loss:2.266 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:02,317][model8_pretrain.py][INFO] Epoch:[0/2](830400/4588595) loss:3.293 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:02,317][model8_pretrain.py][INFO] Epoch:[0/2](830400/4588595) loss:2.924 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:39,306][model8_pretrain.py][INFO] Epoch:[0/2](830500/4588595) loss:2.660 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:39,306][model8_pretrain.py][INFO] Epoch:[0/2](830500/4588595) loss:2.909 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:39,306][model8_pretrain.py][INFO] Epoch:[0/2](830500/4588595) loss:3.126 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:39,306][model8_pretrain.py][INFO] Epoch:[0/2](830500/4588595) loss:2.992 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:39,306][model8_pretrain.py][INFO] Epoch:[0/2](830500/4588595) loss:3.221 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:39,306][model8_pretrain.py][INFO] Epoch:[0/2](830500/4588595) loss:3.101 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:39,306][model8_pretrain.py][INFO] Epoch:[0/2](830500/4588595) loss:2.878 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:45:39,307][model8_pretrain.py][INFO] Epoch:[0/2](830500/4588595) loss:3.105 lr:0.0000100 epoch_Time:23919.0min: [2024-01-06 09:46:16,277][model8_pretrain.py][INFO] Epoch:[0/2](830600/4588595) loss:2.872 lr:0.0000100 epoch_Time:23918.0min: [2024-01-06 09:46:16,277][model8_pretrain.py][INFO] Epoch:[0/2](830600/4588595) loss:2.748 lr:0.0000100 epoch_Time:23918.0min: [2024-01-06 09:46:16,277][model8_pretrain.py][INFO] Epoch:[0/2](830600/4588595) loss:2.526 lr:0.0000100 epoch_Time:23918.0min: [2024-01-06 09:46:16,277][model8_pretrain.py][INFO] Epoch:[0/2](830600/4588595) loss:2.758 lr:0.0000100 epoch_Time:23918.0min: [2024-01-06 09:46:16,277][model8_pretrain.py][INFO] Epoch:[0/2](830600/4588595) loss:2.782 lr:0.0000100 epoch_Time:23918.0min: [2024-01-06 09:46:16,277][model8_pretrain.py][INFO] Epoch:[0/2](830600/4588595) loss:2.292 lr:0.0000100 epoch_Time:23918.0min: [2024-01-06 09:46:16,277][model8_pretrain.py][INFO] Epoch:[0/2](830600/4588595) loss:3.077 lr:0.0000100 epoch_Time:23918.0min: [2024-01-06 09:46:16,278][model8_pretrain.py][INFO] Epoch:[0/2](830600/4588595) loss:2.911 lr:0.0000100 epoch_Time:23918.0min: [2024-01-06 09:46:53,230][model8_pretrain.py][INFO] Epoch:[0/2](830700/4588595) loss:2.573 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:46:53,230][model8_pretrain.py][INFO] Epoch:[0/2](830700/4588595) loss:2.562 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:46:53,230][model8_pretrain.py][INFO] Epoch:[0/2](830700/4588595) loss:2.573 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:46:53,230][model8_pretrain.py][INFO] Epoch:[0/2](830700/4588595) loss:2.490 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:46:53,230][model8_pretrain.py][INFO] Epoch:[0/2](830700/4588595) loss:2.985 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:46:53,230][model8_pretrain.py][INFO] Epoch:[0/2](830700/4588595) loss:2.682 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:46:53,230][model8_pretrain.py][INFO] Epoch:[0/2](830700/4588595) loss:3.150 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:46:53,230][model8_pretrain.py][INFO] Epoch:[0/2](830700/4588595) loss:2.704 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:47:30,176][model8_pretrain.py][INFO] Epoch:[0/2](830800/4588595) loss:3.113 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:47:30,176][model8_pretrain.py][INFO] Epoch:[0/2](830800/4588595) loss:3.115 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:47:30,176][model8_pretrain.py][INFO] Epoch:[0/2](830800/4588595) loss:2.785 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:47:30,176][model8_pretrain.py][INFO] Epoch:[0/2](830800/4588595) loss:2.097 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:47:30,176][model8_pretrain.py][INFO] Epoch:[0/2](830800/4588595) loss:3.026 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:47:30,176][model8_pretrain.py][INFO] Epoch:[0/2](830800/4588595) loss:2.999 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:47:30,176][model8_pretrain.py][INFO] Epoch:[0/2](830800/4588595) loss:2.445 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:47:30,176][model8_pretrain.py][INFO] Epoch:[0/2](830800/4588595) loss:2.645 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:48:17,653][model8_pretrain.py][INFO] Epoch:[0/2](830900/4588595) loss:2.466 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:48:17,653][model8_pretrain.py][INFO] Epoch:[0/2](830900/4588595) loss:2.408 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:48:17,653][model8_pretrain.py][INFO] Epoch:[0/2](830900/4588595) loss:2.737 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:48:17,653][model8_pretrain.py][INFO] Epoch:[0/2](830900/4588595) loss:2.704 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:48:17,653][model8_pretrain.py][INFO] Epoch:[0/2](830900/4588595) loss:2.999 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:48:17,653][model8_pretrain.py][INFO] Epoch:[0/2](830900/4588595) loss:2.544 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:48:17,653][model8_pretrain.py][INFO] Epoch:[0/2](830900/4588595) loss:2.621 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:48:17,653][model8_pretrain.py][INFO] Epoch:[0/2](830900/4588595) loss:3.337 lr:0.0000100 epoch_Time:23917.0min: [2024-01-06 09:48:54,583][model8_pretrain.py][INFO] Epoch:[0/2](831000/4588595) loss:2.602 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:48:54,583][model8_pretrain.py][INFO] Epoch:[0/2](831000/4588595) loss:3.030 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:48:54,583][model8_pretrain.py][INFO] Epoch:[0/2](831000/4588595) loss:2.622 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:48:54,583][model8_pretrain.py][INFO] Epoch:[0/2](831000/4588595) loss:2.556 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:48:54,583][model8_pretrain.py][INFO] Epoch:[0/2](831000/4588595) loss:3.012 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:48:54,583][model8_pretrain.py][INFO] Epoch:[0/2](831000/4588595) loss:2.919 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:48:54,583][model8_pretrain.py][INFO] Epoch:[0/2](831000/4588595) loss:2.192 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:48:54,583][model8_pretrain.py][INFO] Epoch:[0/2](831000/4588595) loss:3.015 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:49:31,489][model8_pretrain.py][INFO] Epoch:[0/2](831100/4588595) loss:3.058 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:49:31,489][model8_pretrain.py][INFO] Epoch:[0/2](831100/4588595) loss:3.045 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:49:31,489][model8_pretrain.py][INFO] Epoch:[0/2](831100/4588595) loss:3.487 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:49:31,489][model8_pretrain.py][INFO] Epoch:[0/2](831100/4588595) loss:2.961 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:49:31,489][model8_pretrain.py][INFO] Epoch:[0/2](831100/4588595) loss:3.242 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:49:31,489][model8_pretrain.py][INFO] Epoch:[0/2](831100/4588595) loss:2.674 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:49:31,489][model8_pretrain.py][INFO] Epoch:[0/2](831100/4588595) loss:2.505 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:49:31,489][model8_pretrain.py][INFO] Epoch:[0/2](831100/4588595) loss:2.577 lr:0.0000100 epoch_Time:23916.0min: [2024-01-06 09:50:08,448][model8_pretrain.py][INFO] Epoch:[0/2](831200/4588595) loss:3.195 lr:0.0000100 epoch_Time:23915.0min: [2024-01-06 09:50:08,448][model8_pretrain.py][INFO] Epoch:[0/2](831200/4588595) loss:3.160 lr:0.0000100 epoch_Time:23915.0min: [2024-01-06 09:50:08,448][model8_pretrain.py][INFO] Epoch:[0/2](831200/4588595) loss:2.965 lr:0.0000100 epoch_Time:23915.0min: [2024-01-06 09:50:08,448][model8_pretrain.py][INFO] Epoch:[0/2](831200/4588595) loss:2.762 lr:0.0000100 epoch_Time:23915.0min: [2024-01-06 09:50:08,448][model8_pretrain.py][INFO] Epoch:[0/2](831200/4588595) loss:2.883 lr:0.0000100 epoch_Time:23915.0min: [2024-01-06 09:50:08,448][model8_pretrain.py][INFO] Epoch:[0/2](831200/4588595) loss:2.762 lr:0.0000100 epoch_Time:23915.0min: [2024-01-06 09:50:08,448][model8_pretrain.py][INFO] Epoch:[0/2](831200/4588595) loss:3.096 lr:0.0000100 epoch_Time:23915.0min: [2024-01-06 09:50:08,448][model8_pretrain.py][INFO] Epoch:[0/2](831200/4588595) loss:2.957 lr:0.0000100 epoch_Time:23915.0min: [2024-01-06 09:50:45,386][model8_pretrain.py][INFO] Epoch:[0/2](831300/4588595) loss:2.593 lr:0.0000100 epoch_Time:23914.0min: [2024-01-06 09:50:45,386][model8_pretrain.py][INFO] Epoch:[0/2](831300/4588595) loss:2.870 lr:0.0000100 epoch_Time:23914.0min: [2024-01-06 09:50:45,386][model8_pretrain.py][INFO] Epoch:[0/2](831300/4588595) loss:2.644 lr:0.0000100 epoch_Time:23914.0min: [2024-01-06 09:50:45,386][model8_pretrain.py][INFO] Epoch:[0/2](831300/4588595) loss:2.408 lr:0.0000100 epoch_Time:23914.0min: [2024-01-06 09:50:45,386][model8_pretrain.py][INFO] Epoch:[0/2](831300/4588595) loss:2.947 lr:0.0000100 epoch_Time:23914.0min: [2024-01-06 09:50:45,386][model8_pretrain.py][INFO] Epoch:[0/2](831300/4588595) loss:2.757 lr:0.0000100 epoch_Time:23914.0min: [2024-01-06 09:50:45,386][model8_pretrain.py][INFO] Epoch:[0/2](831300/4588595) loss:2.824 lr:0.0000100 epoch_Time:23914.0min: [2024-01-06 09:50:45,386][model8_pretrain.py][INFO] Epoch:[0/2](831300/4588595) loss:3.288 lr:0.0000100 epoch_Time:23914.0min: [2024-01-06 09:51:22,320][model8_pretrain.py][INFO] Epoch:[0/2](831400/4588595) loss:2.972 lr:0.0000100 epoch_Time:23913.0min: [2024-01-06 09:51:22,320][model8_pretrain.py][INFO] Epoch:[0/2](831400/4588595) loss:2.307 lr:0.0000100 epoch_Time:23913.0min: [2024-01-06 09:51:22,320][model8_pretrain.py][INFO] Epoch:[0/2](831400/4588595) loss:2.912 lr:0.0000100 epoch_Time:23913.0min: [2024-01-06 09:51:22,320][model8_pretrain.py][INFO] Epoch:[0/2](831400/4588595) loss:2.845 lr:0.0000100 epoch_Time:23913.0min: [2024-01-06 09:51:22,320][model8_pretrain.py][INFO] Epoch:[0/2](831400/4588595) loss:2.445 lr:0.0000100 epoch_Time:23913.0min: [2024-01-06 09:51:22,320][model8_pretrain.py][INFO] Epoch:[0/2](831400/4588595) loss:2.612 lr:0.0000100 epoch_Time:23913.0min: [2024-01-06 09:51:22,321][model8_pretrain.py][INFO] Epoch:[0/2](831400/4588595) loss:2.642 lr:0.0000100 epoch_Time:23913.0min: [2024-01-06 09:51:22,321][model8_pretrain.py][INFO] Epoch:[0/2](831400/4588595) loss:2.732 lr:0.0000100 epoch_Time:23913.0min: [2024-01-06 09:51:59,264][model8_pretrain.py][INFO] Epoch:[0/2](831500/4588595) loss:2.149 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:51:59,264][model8_pretrain.py][INFO] Epoch:[0/2](831500/4588595) loss:2.668 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:51:59,264][model8_pretrain.py][INFO] Epoch:[0/2](831500/4588595) loss:3.053 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:51:59,264][model8_pretrain.py][INFO] Epoch:[0/2](831500/4588595) loss:2.810 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:51:59,264][model8_pretrain.py][INFO] Epoch:[0/2](831500/4588595) loss:2.974 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:51:59,264][model8_pretrain.py][INFO] Epoch:[0/2](831500/4588595) loss:2.236 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:51:59,265][model8_pretrain.py][INFO] Epoch:[0/2](831500/4588595) loss:2.822 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:51:59,265][model8_pretrain.py][INFO] Epoch:[0/2](831500/4588595) loss:3.093 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:52:36,224][model8_pretrain.py][INFO] Epoch:[0/2](831600/4588595) loss:3.262 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:52:36,225][model8_pretrain.py][INFO] Epoch:[0/2](831600/4588595) loss:2.893 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:52:36,225][model8_pretrain.py][INFO] Epoch:[0/2](831600/4588595) loss:3.010 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:52:36,225][model8_pretrain.py][INFO] Epoch:[0/2](831600/4588595) loss:2.900 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:52:36,225][model8_pretrain.py][INFO] Epoch:[0/2](831600/4588595) loss:2.889 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:52:36,225][model8_pretrain.py][INFO] Epoch:[0/2](831600/4588595) loss:3.244 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:52:36,225][model8_pretrain.py][INFO] Epoch:[0/2](831600/4588595) loss:2.833 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:52:36,225][model8_pretrain.py][INFO] Epoch:[0/2](831600/4588595) loss:2.739 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:53:21,666][model8_pretrain.py][INFO] Epoch:[0/2](831700/4588595) loss:2.598 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:53:21,666][model8_pretrain.py][INFO] Epoch:[0/2](831700/4588595) loss:2.515 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:53:21,666][model8_pretrain.py][INFO] Epoch:[0/2](831700/4588595) loss:3.076 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:53:21,666][model8_pretrain.py][INFO] Epoch:[0/2](831700/4588595) loss:3.063 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:53:21,666][model8_pretrain.py][INFO] Epoch:[0/2](831700/4588595) loss:2.049 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:53:21,666][model8_pretrain.py][INFO] Epoch:[0/2](831700/4588595) loss:2.698 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:53:21,666][model8_pretrain.py][INFO] Epoch:[0/2](831700/4588595) loss:3.085 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:53:21,666][model8_pretrain.py][INFO] Epoch:[0/2](831700/4588595) loss:2.985 lr:0.0000100 epoch_Time:23912.0min: [2024-01-06 09:54:00,285][model8_pretrain.py][INFO] Epoch:[0/2](831800/4588595) loss:2.055 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:00,285][model8_pretrain.py][INFO] Epoch:[0/2](831800/4588595) loss:3.012 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:00,285][model8_pretrain.py][INFO] Epoch:[0/2](831800/4588595) loss:2.881 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:00,285][model8_pretrain.py][INFO] Epoch:[0/2](831800/4588595) loss:2.550 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:00,285][model8_pretrain.py][INFO] Epoch:[0/2](831800/4588595) loss:3.215 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:00,285][model8_pretrain.py][INFO] Epoch:[0/2](831800/4588595) loss:2.903 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:00,285][model8_pretrain.py][INFO] Epoch:[0/2](831800/4588595) loss:2.891 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:00,285][model8_pretrain.py][INFO] Epoch:[0/2](831800/4588595) loss:2.984 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:37,272][model8_pretrain.py][INFO] Epoch:[0/2](831900/4588595) loss:3.486 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:37,272][model8_pretrain.py][INFO] Epoch:[0/2](831900/4588595) loss:3.172 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:37,272][model8_pretrain.py][INFO] Epoch:[0/2](831900/4588595) loss:2.563 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:37,272][model8_pretrain.py][INFO] Epoch:[0/2](831900/4588595) loss:3.048 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:37,272][model8_pretrain.py][INFO] Epoch:[0/2](831900/4588595) loss:2.299 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:37,272][model8_pretrain.py][INFO] Epoch:[0/2](831900/4588595) loss:2.798 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:37,272][model8_pretrain.py][INFO] Epoch:[0/2](831900/4588595) loss:2.883 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:54:37,272][model8_pretrain.py][INFO] Epoch:[0/2](831900/4588595) loss:3.062 lr:0.0000100 epoch_Time:23911.0min: [2024-01-06 09:55:14,208][model8_pretrain.py][INFO] Epoch:[0/2](832000/4588595) loss:3.270 lr:0.0000100 epoch_Time:23910.0min: [2024-01-06 09:55:14,208][model8_pretrain.py][INFO] Epoch:[0/2](832000/4588595) loss:2.799 lr:0.0000100 epoch_Time:23910.0min: [2024-01-06 09:55:14,208][model8_pretrain.py][INFO] Epoch:[0/2](832000/4588595) loss:2.808 lr:0.0000100 epoch_Time:23910.0min: [2024-01-06 09:55:14,208][model8_pretrain.py][INFO] Epoch:[0/2](832000/4588595) loss:3.005 lr:0.0000100 epoch_Time:23910.0min: [2024-01-06 09:55:14,208][model8_pretrain.py][INFO] Epoch:[0/2](832000/4588595) loss:2.714 lr:0.0000100 epoch_Time:23910.0min: [2024-01-06 09:55:14,208][model8_pretrain.py][INFO] Epoch:[0/2](832000/4588595) loss:3.225 lr:0.0000100 epoch_Time:23910.0min: [2024-01-06 09:55:14,208][model8_pretrain.py][INFO] Epoch:[0/2](832000/4588595) loss:2.739 lr:0.0000100 epoch_Time:23910.0min: [2024-01-06 09:55:14,208][model8_pretrain.py][INFO] Epoch:[0/2](832000/4588595) loss:2.442 lr:0.0000100 epoch_Time:23910.0min: [2024-01-06 09:55:51,153][model8_pretrain.py][INFO] Epoch:[0/2](832100/4588595) loss:2.997 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:55:51,153][model8_pretrain.py][INFO] Epoch:[0/2](832100/4588595) loss:2.011 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:55:51,153][model8_pretrain.py][INFO] Epoch:[0/2](832100/4588595) loss:2.729 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:55:51,153][model8_pretrain.py][INFO] Epoch:[0/2](832100/4588595) loss:3.190 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:55:51,153][model8_pretrain.py][INFO] Epoch:[0/2](832100/4588595) loss:2.960 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:55:51,153][model8_pretrain.py][INFO] Epoch:[0/2](832100/4588595) loss:2.820 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:55:51,153][model8_pretrain.py][INFO] Epoch:[0/2](832100/4588595) loss:3.533 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:55:51,153][model8_pretrain.py][INFO] Epoch:[0/2](832100/4588595) loss:3.290 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:56:28,096][model8_pretrain.py][INFO] Epoch:[0/2](832200/4588595) loss:3.420 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:56:28,096][model8_pretrain.py][INFO] Epoch:[0/2](832200/4588595) loss:2.705 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:56:28,096][model8_pretrain.py][INFO] Epoch:[0/2](832200/4588595) loss:3.270 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:56:28,096][model8_pretrain.py][INFO] Epoch:[0/2](832200/4588595) loss:2.923 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:56:28,097][model8_pretrain.py][INFO] Epoch:[0/2](832200/4588595) loss:2.798 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:56:28,097][model8_pretrain.py][INFO] Epoch:[0/2](832200/4588595) loss:2.507 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:56:28,098][model8_pretrain.py][INFO] Epoch:[0/2](832200/4588595) loss:2.818 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:56:28,099][model8_pretrain.py][INFO] Epoch:[0/2](832200/4588595) loss:2.435 lr:0.0000100 epoch_Time:23908.0min: [2024-01-06 09:57:05,040][model8_pretrain.py][INFO] Epoch:[0/2](832300/4588595) loss:2.825 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:05,040][model8_pretrain.py][INFO] Epoch:[0/2](832300/4588595) loss:2.664 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:05,040][model8_pretrain.py][INFO] Epoch:[0/2](832300/4588595) loss:2.482 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:05,040][model8_pretrain.py][INFO] Epoch:[0/2](832300/4588595) loss:3.337 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:05,040][model8_pretrain.py][INFO] Epoch:[0/2](832300/4588595) loss:2.897 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:05,041][model8_pretrain.py][INFO] Epoch:[0/2](832300/4588595) loss:3.456 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:05,040][model8_pretrain.py][INFO] Epoch:[0/2](832300/4588595) loss:3.038 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:05,041][model8_pretrain.py][INFO] Epoch:[0/2](832300/4588595) loss:2.427 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:41,980][model8_pretrain.py][INFO] Epoch:[0/2](832400/4588595) loss:2.307 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:41,980][model8_pretrain.py][INFO] Epoch:[0/2](832400/4588595) loss:2.466 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:41,980][model8_pretrain.py][INFO] Epoch:[0/2](832400/4588595) loss:2.741 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:41,980][model8_pretrain.py][INFO] Epoch:[0/2](832400/4588595) loss:3.126 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:41,980][model8_pretrain.py][INFO] Epoch:[0/2](832400/4588595) loss:3.034 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:41,981][model8_pretrain.py][INFO] Epoch:[0/2](832400/4588595) loss:2.640 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:41,981][model8_pretrain.py][INFO] Epoch:[0/2](832400/4588595) loss:3.152 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:57:41,981][model8_pretrain.py][INFO] Epoch:[0/2](832400/4588595) loss:3.064 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:58:27,450][model8_pretrain.py][INFO] Epoch:[0/2](832500/4588595) loss:3.177 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:58:27,450][model8_pretrain.py][INFO] Epoch:[0/2](832500/4588595) loss:3.492 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:58:27,450][model8_pretrain.py][INFO] Epoch:[0/2](832500/4588595) loss:3.159 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:58:27,450][model8_pretrain.py][INFO] Epoch:[0/2](832500/4588595) loss:2.302 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:58:27,450][model8_pretrain.py][INFO] Epoch:[0/2](832500/4588595) loss:2.711 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:58:27,450][model8_pretrain.py][INFO] Epoch:[0/2](832500/4588595) loss:3.282 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:58:27,450][model8_pretrain.py][INFO] Epoch:[0/2](832500/4588595) loss:3.026 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:58:27,450][model8_pretrain.py][INFO] Epoch:[0/2](832500/4588595) loss:3.104 lr:0.0000100 epoch_Time:23907.0min: [2024-01-06 09:59:06,061][model8_pretrain.py][INFO] Epoch:[0/2](832600/4588595) loss:2.469 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:06,061][model8_pretrain.py][INFO] Epoch:[0/2](832600/4588595) loss:3.161 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:06,061][model8_pretrain.py][INFO] Epoch:[0/2](832600/4588595) loss:3.198 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:06,061][model8_pretrain.py][INFO] Epoch:[0/2](832600/4588595) loss:2.919 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:06,061][model8_pretrain.py][INFO] Epoch:[0/2](832600/4588595) loss:3.350 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:06,061][model8_pretrain.py][INFO] Epoch:[0/2](832600/4588595) loss:2.741 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:06,061][model8_pretrain.py][INFO] Epoch:[0/2](832600/4588595) loss:2.401 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:06,062][model8_pretrain.py][INFO] Epoch:[0/2](832600/4588595) loss:2.874 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:43,012][model8_pretrain.py][INFO] Epoch:[0/2](832700/4588595) loss:2.653 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:43,012][model8_pretrain.py][INFO] Epoch:[0/2](832700/4588595) loss:2.905 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:43,012][model8_pretrain.py][INFO] Epoch:[0/2](832700/4588595) loss:2.768 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:43,012][model8_pretrain.py][INFO] Epoch:[0/2](832700/4588595) loss:3.373 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:43,012][model8_pretrain.py][INFO] Epoch:[0/2](832700/4588595) loss:3.044 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:43,012][model8_pretrain.py][INFO] Epoch:[0/2](832700/4588595) loss:2.655 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:43,012][model8_pretrain.py][INFO] Epoch:[0/2](832700/4588595) loss:2.432 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 09:59:43,013][model8_pretrain.py][INFO] Epoch:[0/2](832700/4588595) loss:3.342 lr:0.0000100 epoch_Time:23906.0min: [2024-01-06 10:00:19,968][model8_pretrain.py][INFO] Epoch:[0/2](832800/4588595) loss:2.554 lr:0.0000100 epoch_Time:23905.0min: [2024-01-06 10:00:19,968][model8_pretrain.py][INFO] Epoch:[0/2](832800/4588595) loss:2.616 lr:0.0000100 epoch_Time:23905.0min: [2024-01-06 10:00:19,968][model8_pretrain.py][INFO] Epoch:[0/2](832800/4588595) loss:2.688 lr:0.0000100 epoch_Time:23905.0min: [2024-01-06 10:00:19,968][model8_pretrain.py][INFO] Epoch:[0/2](832800/4588595) loss:2.527 lr:0.0000100 epoch_Time:23905.0min: [2024-01-06 10:00:19,968][model8_pretrain.py][INFO] Epoch:[0/2](832800/4588595) loss:3.023 lr:0.0000100 epoch_Time:23905.0min: [2024-01-06 10:00:19,968][model8_pretrain.py][INFO] Epoch:[0/2](832800/4588595) loss:2.965 lr:0.0000100 epoch_Time:23905.0min: [2024-01-06 10:00:19,968][model8_pretrain.py][INFO] Epoch:[0/2](832800/4588595) loss:3.172 lr:0.0000100 epoch_Time:23905.0min: [2024-01-06 10:00:19,968][model8_pretrain.py][INFO] Epoch:[0/2](832800/4588595) loss:2.402 lr:0.0000100 epoch_Time:23905.0min: [2024-01-06 10:00:56,918][model8_pretrain.py][INFO] Epoch:[0/2](832900/4588595) loss:2.401 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:00:56,918][model8_pretrain.py][INFO] Epoch:[0/2](832900/4588595) loss:2.767 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:00:56,918][model8_pretrain.py][INFO] Epoch:[0/2](832900/4588595) loss:3.157 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:00:56,918][model8_pretrain.py][INFO] Epoch:[0/2](832900/4588595) loss:2.967 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:00:56,918][model8_pretrain.py][INFO] Epoch:[0/2](832900/4588595) loss:2.439 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:00:56,918][model8_pretrain.py][INFO] Epoch:[0/2](832900/4588595) loss:2.899 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:00:56,918][model8_pretrain.py][INFO] Epoch:[0/2](832900/4588595) loss:2.573 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:00:56,918][model8_pretrain.py][INFO] Epoch:[0/2](832900/4588595) loss:2.044 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:01:33,883][model8_pretrain.py][INFO] Epoch:[0/2](833000/4588595) loss:2.579 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:01:33,883][model8_pretrain.py][INFO] Epoch:[0/2](833000/4588595) loss:2.153 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:01:33,883][model8_pretrain.py][INFO] Epoch:[0/2](833000/4588595) loss:3.110 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:01:33,883][model8_pretrain.py][INFO] Epoch:[0/2](833000/4588595) loss:3.336 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:01:33,883][model8_pretrain.py][INFO] Epoch:[0/2](833000/4588595) loss:2.440 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:01:33,883][model8_pretrain.py][INFO] Epoch:[0/2](833000/4588595) loss:2.925 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:01:33,883][model8_pretrain.py][INFO] Epoch:[0/2](833000/4588595) loss:2.748 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:01:33,883][model8_pretrain.py][INFO] Epoch:[0/2](833000/4588595) loss:2.515 lr:0.0000100 epoch_Time:23903.0min: [2024-01-06 10:02:10,835][model8_pretrain.py][INFO] Epoch:[0/2](833100/4588595) loss:2.700 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:02:10,835][model8_pretrain.py][INFO] Epoch:[0/2](833100/4588595) loss:3.106 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:02:10,835][model8_pretrain.py][INFO] Epoch:[0/2](833100/4588595) loss:3.106 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:02:10,835][model8_pretrain.py][INFO] Epoch:[0/2](833100/4588595) loss:2.853 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:02:10,835][model8_pretrain.py][INFO] Epoch:[0/2](833100/4588595) loss:3.010 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:02:10,835][model8_pretrain.py][INFO] Epoch:[0/2](833100/4588595) loss:2.654 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:02:10,835][model8_pretrain.py][INFO] Epoch:[0/2](833100/4588595) loss:2.913 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:02:10,835][model8_pretrain.py][INFO] Epoch:[0/2](833100/4588595) loss:3.366 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:02:47,792][model8_pretrain.py][INFO] Epoch:[0/2](833200/4588595) loss:2.874 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:02:47,792][model8_pretrain.py][INFO] Epoch:[0/2](833200/4588595) loss:3.002 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:02:47,792][model8_pretrain.py][INFO] Epoch:[0/2](833200/4588595) loss:2.981 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:02:47,792][model8_pretrain.py][INFO] Epoch:[0/2](833200/4588595) loss:2.839 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:02:47,792][model8_pretrain.py][INFO] Epoch:[0/2](833200/4588595) loss:2.944 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:02:47,792][model8_pretrain.py][INFO] Epoch:[0/2](833200/4588595) loss:2.607 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:02:47,793][model8_pretrain.py][INFO] Epoch:[0/2](833200/4588595) loss:2.529 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:02:47,793][model8_pretrain.py][INFO] Epoch:[0/2](833200/4588595) loss:2.873 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:03:33,410][model8_pretrain.py][INFO] Epoch:[0/2](833300/4588595) loss:3.173 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:03:33,410][model8_pretrain.py][INFO] Epoch:[0/2](833300/4588595) loss:2.384 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:03:33,410][model8_pretrain.py][INFO] Epoch:[0/2](833300/4588595) loss:3.038 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:03:33,410][model8_pretrain.py][INFO] Epoch:[0/2](833300/4588595) loss:3.252 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:03:33,410][model8_pretrain.py][INFO] Epoch:[0/2](833300/4588595) loss:2.972 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:03:33,410][model8_pretrain.py][INFO] Epoch:[0/2](833300/4588595) loss:2.576 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:03:33,410][model8_pretrain.py][INFO] Epoch:[0/2](833300/4588595) loss:3.167 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:03:33,410][model8_pretrain.py][INFO] Epoch:[0/2](833300/4588595) loss:2.669 lr:0.0000100 epoch_Time:23902.0min: [2024-01-06 10:04:12,110][model8_pretrain.py][INFO] Epoch:[0/2](833400/4588595) loss:3.230 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:04:12,110][model8_pretrain.py][INFO] Epoch:[0/2](833400/4588595) loss:3.335 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:04:12,110][model8_pretrain.py][INFO] Epoch:[0/2](833400/4588595) loss:3.032 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:04:12,110][model8_pretrain.py][INFO] Epoch:[0/2](833400/4588595) loss:2.844 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:04:12,110][model8_pretrain.py][INFO] Epoch:[0/2](833400/4588595) loss:3.038 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:04:12,110][model8_pretrain.py][INFO] Epoch:[0/2](833400/4588595) loss:2.699 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:04:12,110][model8_pretrain.py][INFO] Epoch:[0/2](833400/4588595) loss:2.180 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:04:12,110][model8_pretrain.py][INFO] Epoch:[0/2](833400/4588595) loss:2.591 lr:0.0000100 epoch_Time:23901.0min: [2024-01-06 10:04:49,061][model8_pretrain.py][INFO] Epoch:[0/2](833500/4588595) loss:2.767 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:04:49,061][model8_pretrain.py][INFO] Epoch:[0/2](833500/4588595) loss:3.025 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:04:49,061][model8_pretrain.py][INFO] Epoch:[0/2](833500/4588595) loss:2.330 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:04:49,061][model8_pretrain.py][INFO] Epoch:[0/2](833500/4588595) loss:2.747 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:04:49,061][model8_pretrain.py][INFO] Epoch:[0/2](833500/4588595) loss:2.788 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:04:49,061][model8_pretrain.py][INFO] Epoch:[0/2](833500/4588595) loss:3.030 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:04:49,061][model8_pretrain.py][INFO] Epoch:[0/2](833500/4588595) loss:2.751 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:04:49,061][model8_pretrain.py][INFO] Epoch:[0/2](833500/4588595) loss:3.081 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:05:25,999][model8_pretrain.py][INFO] Epoch:[0/2](833600/4588595) loss:3.271 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:05:25,999][model8_pretrain.py][INFO] Epoch:[0/2](833600/4588595) loss:3.286 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:05:25,999][model8_pretrain.py][INFO] Epoch:[0/2](833600/4588595) loss:2.882 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:05:26,000][model8_pretrain.py][INFO] Epoch:[0/2](833600/4588595) loss:2.967 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:05:26,000][model8_pretrain.py][INFO] Epoch:[0/2](833600/4588595) loss:2.517 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:05:26,000][model8_pretrain.py][INFO] Epoch:[0/2](833600/4588595) loss:3.190 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:05:26,000][model8_pretrain.py][INFO] Epoch:[0/2](833600/4588595) loss:2.807 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:05:26,000][model8_pretrain.py][INFO] Epoch:[0/2](833600/4588595) loss:3.154 lr:0.0000100 epoch_Time:23900.0min: [2024-01-06 10:06:02,946][model8_pretrain.py][INFO] Epoch:[0/2](833700/4588595) loss:2.300 lr:0.0000100 epoch_Time:23899.0min: [2024-01-06 10:06:02,946][model8_pretrain.py][INFO] Epoch:[0/2](833700/4588595) loss:3.101 lr:0.0000100 epoch_Time:23899.0min: [2024-01-06 10:06:02,946][model8_pretrain.py][INFO] Epoch:[0/2](833700/4588595) loss:2.674 lr:0.0000100 epoch_Time:23899.0min: [2024-01-06 10:06:02,946][model8_pretrain.py][INFO] Epoch:[0/2](833700/4588595) loss:3.044 lr:0.0000100 epoch_Time:23899.0min: [2024-01-06 10:06:02,946][model8_pretrain.py][INFO] Epoch:[0/2](833700/4588595) loss:3.349 lr:0.0000100 epoch_Time:23899.0min: [2024-01-06 10:06:02,946][model8_pretrain.py][INFO] Epoch:[0/2](833700/4588595) loss:3.144 lr:0.0000100 epoch_Time:23899.0min: [2024-01-06 10:06:02,946][model8_pretrain.py][INFO] Epoch:[0/2](833700/4588595) loss:2.781 lr:0.0000100 epoch_Time:23899.0min: [2024-01-06 10:06:02,947][model8_pretrain.py][INFO] Epoch:[0/2](833700/4588595) loss:2.862 lr:0.0000100 epoch_Time:23899.0min: [2024-01-06 10:06:39,883][model8_pretrain.py][INFO] Epoch:[0/2](833800/4588595) loss:3.043 lr:0.0000100 epoch_Time:23898.0min: [2024-01-06 10:06:39,883][model8_pretrain.py][INFO] Epoch:[0/2](833800/4588595) loss:2.607 lr:0.0000100 epoch_Time:23898.0min: [2024-01-06 10:06:39,883][model8_pretrain.py][INFO] Epoch:[0/2](833800/4588595) loss:3.076 lr:0.0000100 epoch_Time:23898.0min: [2024-01-06 10:06:39,883][model8_pretrain.py][INFO] Epoch:[0/2](833800/4588595) loss:2.739 lr:0.0000100 epoch_Time:23898.0min: [2024-01-06 10:06:39,883][model8_pretrain.py][INFO] Epoch:[0/2](833800/4588595) loss:2.588 lr:0.0000100 epoch_Time:23898.0min: [2024-01-06 10:06:39,884][model8_pretrain.py][INFO] Epoch:[0/2](833800/4588595) loss:2.944 lr:0.0000100 epoch_Time:23898.0min: [2024-01-06 10:06:39,884][model8_pretrain.py][INFO] Epoch:[0/2](833800/4588595) loss:2.602 lr:0.0000100 epoch_Time:23898.0min: [2024-01-06 10:06:39,884][model8_pretrain.py][INFO] Epoch:[0/2](833800/4588595) loss:2.380 lr:0.0000100 epoch_Time:23898.0min: [2024-01-06 10:07:16,810][model8_pretrain.py][INFO] Epoch:[0/2](833900/4588595) loss:2.730 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:07:16,810][model8_pretrain.py][INFO] Epoch:[0/2](833900/4588595) loss:2.531 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:07:16,810][model8_pretrain.py][INFO] Epoch:[0/2](833900/4588595) loss:2.632 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:07:16,810][model8_pretrain.py][INFO] Epoch:[0/2](833900/4588595) loss:3.475 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:07:16,810][model8_pretrain.py][INFO] Epoch:[0/2](833900/4588595) loss:2.959 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:07:16,810][model8_pretrain.py][INFO] Epoch:[0/2](833900/4588595) loss:2.396 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:07:16,810][model8_pretrain.py][INFO] Epoch:[0/2](833900/4588595) loss:3.086 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:07:16,810][model8_pretrain.py][INFO] Epoch:[0/2](833900/4588595) loss:2.178 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:07:53,734][model8_pretrain.py][INFO] Epoch:[0/2](834000/4588595) loss:2.612 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:07:53,734][model8_pretrain.py][INFO] Epoch:[0/2](834000/4588595) loss:2.964 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:07:53,734][model8_pretrain.py][INFO] Epoch:[0/2](834000/4588595) loss:3.281 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:07:53,734][model8_pretrain.py][INFO] Epoch:[0/2](834000/4588595) loss:3.087 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:07:53,734][model8_pretrain.py][INFO] Epoch:[0/2](834000/4588595) loss:2.502 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:07:53,734][model8_pretrain.py][INFO] Epoch:[0/2](834000/4588595) loss:2.903 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:07:53,735][model8_pretrain.py][INFO] Epoch:[0/2](834000/4588595) loss:2.818 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:07:53,735][model8_pretrain.py][INFO] Epoch:[0/2](834000/4588595) loss:2.817 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:08:37,625][model8_pretrain.py][INFO] Epoch:[0/2](834100/4588595) loss:2.810 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:08:37,625][model8_pretrain.py][INFO] Epoch:[0/2](834100/4588595) loss:2.816 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:08:37,625][model8_pretrain.py][INFO] Epoch:[0/2](834100/4588595) loss:2.654 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:08:37,625][model8_pretrain.py][INFO] Epoch:[0/2](834100/4588595) loss:2.833 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:08:37,625][model8_pretrain.py][INFO] Epoch:[0/2](834100/4588595) loss:3.006 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:08:37,625][model8_pretrain.py][INFO] Epoch:[0/2](834100/4588595) loss:3.155 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:08:37,625][model8_pretrain.py][INFO] Epoch:[0/2](834100/4588595) loss:3.226 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:08:37,625][model8_pretrain.py][INFO] Epoch:[0/2](834100/4588595) loss:2.780 lr:0.0000100 epoch_Time:23897.0min: [2024-01-06 10:09:18,013][model8_pretrain.py][INFO] Epoch:[0/2](834200/4588595) loss:3.346 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:09:18,013][model8_pretrain.py][INFO] Epoch:[0/2](834200/4588595) loss:2.862 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:09:18,013][model8_pretrain.py][INFO] Epoch:[0/2](834200/4588595) loss:3.056 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:09:18,013][model8_pretrain.py][INFO] Epoch:[0/2](834200/4588595) loss:2.907 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:09:18,013][model8_pretrain.py][INFO] Epoch:[0/2](834200/4588595) loss:2.066 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:09:18,013][model8_pretrain.py][INFO] Epoch:[0/2](834200/4588595) loss:2.784 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:09:18,014][model8_pretrain.py][INFO] Epoch:[0/2](834200/4588595) loss:2.045 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:09:18,013][model8_pretrain.py][INFO] Epoch:[0/2](834200/4588595) loss:3.099 lr:0.0000100 epoch_Time:23896.0min: [2024-01-06 10:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](834300/4588595) loss:2.363 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](834300/4588595) loss:2.861 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](834300/4588595) loss:3.019 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](834300/4588595) loss:2.982 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](834300/4588595) loss:3.171 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](834300/4588595) loss:2.446 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](834300/4588595) loss:2.696 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:09:54,956][model8_pretrain.py][INFO] Epoch:[0/2](834300/4588595) loss:2.519 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:10:31,897][model8_pretrain.py][INFO] Epoch:[0/2](834400/4588595) loss:2.853 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:10:31,897][model8_pretrain.py][INFO] Epoch:[0/2](834400/4588595) loss:3.467 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:10:31,897][model8_pretrain.py][INFO] Epoch:[0/2](834400/4588595) loss:3.371 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:10:31,897][model8_pretrain.py][INFO] Epoch:[0/2](834400/4588595) loss:3.203 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:10:31,897][model8_pretrain.py][INFO] Epoch:[0/2](834400/4588595) loss:3.058 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:10:31,897][model8_pretrain.py][INFO] Epoch:[0/2](834400/4588595) loss:2.452 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:10:31,897][model8_pretrain.py][INFO] Epoch:[0/2](834400/4588595) loss:2.403 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:10:31,897][model8_pretrain.py][INFO] Epoch:[0/2](834400/4588595) loss:2.782 lr:0.0000100 epoch_Time:23895.0min: [2024-01-06 10:11:08,845][model8_pretrain.py][INFO] Epoch:[0/2](834500/4588595) loss:2.614 lr:0.0000100 epoch_Time:23894.0min: [2024-01-06 10:11:08,845][model8_pretrain.py][INFO] Epoch:[0/2](834500/4588595) loss:2.869 lr:0.0000100 epoch_Time:23894.0min: [2024-01-06 10:11:08,845][model8_pretrain.py][INFO] Epoch:[0/2](834500/4588595) loss:2.494 lr:0.0000100 epoch_Time:23894.0min: [2024-01-06 10:11:08,845][model8_pretrain.py][INFO] Epoch:[0/2](834500/4588595) loss:2.927 lr:0.0000100 epoch_Time:23894.0min: [2024-01-06 10:11:08,845][model8_pretrain.py][INFO] Epoch:[0/2](834500/4588595) loss:3.096 lr:0.0000100 epoch_Time:23894.0min: [2024-01-06 10:11:08,845][model8_pretrain.py][INFO] Epoch:[0/2](834500/4588595) loss:2.247 lr:0.0000100 epoch_Time:23894.0min: [2024-01-06 10:11:08,845][model8_pretrain.py][INFO] Epoch:[0/2](834500/4588595) loss:2.388 lr:0.0000100 epoch_Time:23894.0min: [2024-01-06 10:11:08,845][model8_pretrain.py][INFO] Epoch:[0/2](834500/4588595) loss:2.766 lr:0.0000100 epoch_Time:23894.0min: [2024-01-06 10:11:45,786][model8_pretrain.py][INFO] Epoch:[0/2](834600/4588595) loss:2.492 lr:0.0000100 epoch_Time:23893.0min: [2024-01-06 10:11:45,786][model8_pretrain.py][INFO] Epoch:[0/2](834600/4588595) loss:2.740 lr:0.0000100 epoch_Time:23893.0min: [2024-01-06 10:11:45,786][model8_pretrain.py][INFO] Epoch:[0/2](834600/4588595) loss:2.508 lr:0.0000100 epoch_Time:23893.0min: [2024-01-06 10:11:45,786][model8_pretrain.py][INFO] Epoch:[0/2](834600/4588595) loss:3.171 lr:0.0000100 epoch_Time:23893.0min: [2024-01-06 10:11:45,786][model8_pretrain.py][INFO] Epoch:[0/2](834600/4588595) loss:2.743 lr:0.0000100 epoch_Time:23893.0min: [2024-01-06 10:11:45,786][model8_pretrain.py][INFO] Epoch:[0/2](834600/4588595) loss:2.095 lr:0.0000100 epoch_Time:23893.0min: [2024-01-06 10:11:45,786][model8_pretrain.py][INFO] Epoch:[0/2](834600/4588595) loss:3.100 lr:0.0000100 epoch_Time:23893.0min: [2024-01-06 10:11:45,786][model8_pretrain.py][INFO] Epoch:[0/2](834600/4588595) loss:2.375 lr:0.0000100 epoch_Time:23893.0min: [2024-01-06 10:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](834700/4588595) loss:2.543 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](834700/4588595) loss:2.659 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](834700/4588595) loss:2.851 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](834700/4588595) loss:2.679 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](834700/4588595) loss:3.336 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](834700/4588595) loss:3.011 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](834700/4588595) loss:2.988 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:12:22,726][model8_pretrain.py][INFO] Epoch:[0/2](834700/4588595) loss:2.911 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:12:59,668][model8_pretrain.py][INFO] Epoch:[0/2](834800/4588595) loss:2.441 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:12:59,668][model8_pretrain.py][INFO] Epoch:[0/2](834800/4588595) loss:3.086 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:12:59,668][model8_pretrain.py][INFO] Epoch:[0/2](834800/4588595) loss:3.180 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:12:59,668][model8_pretrain.py][INFO] Epoch:[0/2](834800/4588595) loss:2.773 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:12:59,668][model8_pretrain.py][INFO] Epoch:[0/2](834800/4588595) loss:3.085 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:12:59,668][model8_pretrain.py][INFO] Epoch:[0/2](834800/4588595) loss:2.794 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:12:59,668][model8_pretrain.py][INFO] Epoch:[0/2](834800/4588595) loss:2.805 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:12:59,668][model8_pretrain.py][INFO] Epoch:[0/2](834800/4588595) loss:2.701 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:13:43,558][model8_pretrain.py][INFO] Epoch:[0/2](834900/4588595) loss:2.707 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:13:43,559][model8_pretrain.py][INFO] Epoch:[0/2](834900/4588595) loss:2.939 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:13:43,559][model8_pretrain.py][INFO] Epoch:[0/2](834900/4588595) loss:1.892 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:13:43,559][model8_pretrain.py][INFO] Epoch:[0/2](834900/4588595) loss:2.797 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:13:43,559][model8_pretrain.py][INFO] Epoch:[0/2](834900/4588595) loss:2.534 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:13:43,559][model8_pretrain.py][INFO] Epoch:[0/2](834900/4588595) loss:3.248 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:13:43,559][model8_pretrain.py][INFO] Epoch:[0/2](834900/4588595) loss:2.879 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:13:43,559][model8_pretrain.py][INFO] Epoch:[0/2](834900/4588595) loss:2.775 lr:0.0000100 epoch_Time:23892.0min: [2024-01-06 10:14:23,854][model8_pretrain.py][INFO] Epoch:[0/2](835000/4588595) loss:2.211 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:14:23,854][model8_pretrain.py][INFO] Epoch:[0/2](835000/4588595) loss:2.598 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:14:23,854][model8_pretrain.py][INFO] Epoch:[0/2](835000/4588595) loss:2.499 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:14:23,854][model8_pretrain.py][INFO] Epoch:[0/2](835000/4588595) loss:3.055 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:14:23,854][model8_pretrain.py][INFO] Epoch:[0/2](835000/4588595) loss:2.703 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:14:23,854][model8_pretrain.py][INFO] Epoch:[0/2](835000/4588595) loss:2.610 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:14:23,854][model8_pretrain.py][INFO] Epoch:[0/2](835000/4588595) loss:2.802 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:14:23,855][model8_pretrain.py][INFO] Epoch:[0/2](835000/4588595) loss:2.876 lr:0.0000100 epoch_Time:23891.0min: [2024-01-06 10:15:00,793][model8_pretrain.py][INFO] Epoch:[0/2](835100/4588595) loss:3.171 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:00,793][model8_pretrain.py][INFO] Epoch:[0/2](835100/4588595) loss:2.364 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:00,793][model8_pretrain.py][INFO] Epoch:[0/2](835100/4588595) loss:2.977 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:00,793][model8_pretrain.py][INFO] Epoch:[0/2](835100/4588595) loss:3.172 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:00,793][model8_pretrain.py][INFO] Epoch:[0/2](835100/4588595) loss:2.840 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:00,793][model8_pretrain.py][INFO] Epoch:[0/2](835100/4588595) loss:2.919 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:00,794][model8_pretrain.py][INFO] Epoch:[0/2](835100/4588595) loss:2.897 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:00,794][model8_pretrain.py][INFO] Epoch:[0/2](835100/4588595) loss:2.709 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:37,744][model8_pretrain.py][INFO] Epoch:[0/2](835200/4588595) loss:2.445 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:37,744][model8_pretrain.py][INFO] Epoch:[0/2](835200/4588595) loss:2.759 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:37,744][model8_pretrain.py][INFO] Epoch:[0/2](835200/4588595) loss:2.682 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:37,744][model8_pretrain.py][INFO] Epoch:[0/2](835200/4588595) loss:2.808 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:37,744][model8_pretrain.py][INFO] Epoch:[0/2](835200/4588595) loss:3.277 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:37,744][model8_pretrain.py][INFO] Epoch:[0/2](835200/4588595) loss:2.837 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:37,745][model8_pretrain.py][INFO] Epoch:[0/2](835200/4588595) loss:2.943 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:15:37,745][model8_pretrain.py][INFO] Epoch:[0/2](835200/4588595) loss:2.502 lr:0.0000100 epoch_Time:23890.0min: [2024-01-06 10:16:14,673][model8_pretrain.py][INFO] Epoch:[0/2](835300/4588595) loss:3.048 lr:0.0000100 epoch_Time:23889.0min: [2024-01-06 10:16:14,673][model8_pretrain.py][INFO] Epoch:[0/2](835300/4588595) loss:2.555 lr:0.0000100 epoch_Time:23889.0min: [2024-01-06 10:16:14,673][model8_pretrain.py][INFO] Epoch:[0/2](835300/4588595) loss:2.754 lr:0.0000100 epoch_Time:23889.0min: [2024-01-06 10:16:14,673][model8_pretrain.py][INFO] Epoch:[0/2](835300/4588595) loss:3.353 lr:0.0000100 epoch_Time:23889.0min: [2024-01-06 10:16:14,673][model8_pretrain.py][INFO] Epoch:[0/2](835300/4588595) loss:2.820 lr:0.0000100 epoch_Time:23889.0min: [2024-01-06 10:16:14,673][model8_pretrain.py][INFO] Epoch:[0/2](835300/4588595) loss:2.767 lr:0.0000100 epoch_Time:23889.0min: [2024-01-06 10:16:14,673][model8_pretrain.py][INFO] Epoch:[0/2](835300/4588595) loss:2.806 lr:0.0000100 epoch_Time:23889.0min: [2024-01-06 10:16:14,674][model8_pretrain.py][INFO] Epoch:[0/2](835300/4588595) loss:2.780 lr:0.0000100 epoch_Time:23889.0min: [2024-01-06 10:16:51,614][model8_pretrain.py][INFO] Epoch:[0/2](835400/4588595) loss:2.188 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:16:51,614][model8_pretrain.py][INFO] Epoch:[0/2](835400/4588595) loss:2.535 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:16:51,614][model8_pretrain.py][INFO] Epoch:[0/2](835400/4588595) loss:2.194 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:16:51,614][model8_pretrain.py][INFO] Epoch:[0/2](835400/4588595) loss:2.671 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:16:51,614][model8_pretrain.py][INFO] Epoch:[0/2](835400/4588595) loss:2.703 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:16:51,614][model8_pretrain.py][INFO] Epoch:[0/2](835400/4588595) loss:2.573 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:16:51,614][model8_pretrain.py][INFO] Epoch:[0/2](835400/4588595) loss:3.234 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:16:51,614][model8_pretrain.py][INFO] Epoch:[0/2](835400/4588595) loss:3.190 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:17:28,556][model8_pretrain.py][INFO] Epoch:[0/2](835500/4588595) loss:3.043 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:17:28,557][model8_pretrain.py][INFO] Epoch:[0/2](835500/4588595) loss:2.744 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:17:28,557][model8_pretrain.py][INFO] Epoch:[0/2](835500/4588595) loss:2.886 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:17:28,557][model8_pretrain.py][INFO] Epoch:[0/2](835500/4588595) loss:2.839 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:17:28,557][model8_pretrain.py][INFO] Epoch:[0/2](835500/4588595) loss:2.549 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:17:28,557][model8_pretrain.py][INFO] Epoch:[0/2](835500/4588595) loss:2.846 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:17:28,557][model8_pretrain.py][INFO] Epoch:[0/2](835500/4588595) loss:2.844 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:17:28,558][model8_pretrain.py][INFO] Epoch:[0/2](835500/4588595) loss:2.871 lr:0.0000100 epoch_Time:23887.0min: [2024-01-06 10:18:05,514][model8_pretrain.py][INFO] Epoch:[0/2](835600/4588595) loss:2.187 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:05,514][model8_pretrain.py][INFO] Epoch:[0/2](835600/4588595) loss:3.310 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:05,514][model8_pretrain.py][INFO] Epoch:[0/2](835600/4588595) loss:2.916 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:05,514][model8_pretrain.py][INFO] Epoch:[0/2](835600/4588595) loss:2.742 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:05,514][model8_pretrain.py][INFO] Epoch:[0/2](835600/4588595) loss:2.769 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:05,514][model8_pretrain.py][INFO] Epoch:[0/2](835600/4588595) loss:2.806 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:05,514][model8_pretrain.py][INFO] Epoch:[0/2](835600/4588595) loss:2.912 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:05,515][model8_pretrain.py][INFO] Epoch:[0/2](835600/4588595) loss:1.814 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:49,446][model8_pretrain.py][INFO] Epoch:[0/2](835700/4588595) loss:3.046 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:49,446][model8_pretrain.py][INFO] Epoch:[0/2](835700/4588595) loss:3.116 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:49,451][model8_pretrain.py][INFO] Epoch:[0/2](835700/4588595) loss:2.856 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:49,451][model8_pretrain.py][INFO] Epoch:[0/2](835700/4588595) loss:2.157 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:49,451][model8_pretrain.py][INFO] Epoch:[0/2](835700/4588595) loss:2.791 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:49,451][model8_pretrain.py][INFO] Epoch:[0/2](835700/4588595) loss:2.734 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:49,451][model8_pretrain.py][INFO] Epoch:[0/2](835700/4588595) loss:2.853 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:18:49,451][model8_pretrain.py][INFO] Epoch:[0/2](835700/4588595) loss:2.956 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:19:29,638][model8_pretrain.py][INFO] Epoch:[0/2](835800/4588595) loss:2.497 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:19:29,638][model8_pretrain.py][INFO] Epoch:[0/2](835800/4588595) loss:2.324 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:19:29,638][model8_pretrain.py][INFO] Epoch:[0/2](835800/4588595) loss:2.534 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:19:29,638][model8_pretrain.py][INFO] Epoch:[0/2](835800/4588595) loss:2.946 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:19:29,638][model8_pretrain.py][INFO] Epoch:[0/2](835800/4588595) loss:2.723 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:19:29,638][model8_pretrain.py][INFO] Epoch:[0/2](835800/4588595) loss:3.150 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:19:29,638][model8_pretrain.py][INFO] Epoch:[0/2](835800/4588595) loss:2.534 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:19:29,639][model8_pretrain.py][INFO] Epoch:[0/2](835800/4588595) loss:2.394 lr:0.0000100 epoch_Time:23886.0min: [2024-01-06 10:20:06,551][model8_pretrain.py][INFO] Epoch:[0/2](835900/4588595) loss:2.374 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:06,551][model8_pretrain.py][INFO] Epoch:[0/2](835900/4588595) loss:2.624 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:06,551][model8_pretrain.py][INFO] Epoch:[0/2](835900/4588595) loss:2.903 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:06,551][model8_pretrain.py][INFO] Epoch:[0/2](835900/4588595) loss:3.101 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:06,551][model8_pretrain.py][INFO] Epoch:[0/2](835900/4588595) loss:2.760 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:06,551][model8_pretrain.py][INFO] Epoch:[0/2](835900/4588595) loss:3.010 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:06,551][model8_pretrain.py][INFO] Epoch:[0/2](835900/4588595) loss:3.034 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:06,552][model8_pretrain.py][INFO] Epoch:[0/2](835900/4588595) loss:2.588 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:43,493][model8_pretrain.py][INFO] Epoch:[0/2](836000/4588595) loss:3.255 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:43,493][model8_pretrain.py][INFO] Epoch:[0/2](836000/4588595) loss:2.421 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:43,493][model8_pretrain.py][INFO] Epoch:[0/2](836000/4588595) loss:2.768 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:43,493][model8_pretrain.py][INFO] Epoch:[0/2](836000/4588595) loss:2.603 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:43,493][model8_pretrain.py][INFO] Epoch:[0/2](836000/4588595) loss:2.917 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:43,493][model8_pretrain.py][INFO] Epoch:[0/2](836000/4588595) loss:3.403 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:43,493][model8_pretrain.py][INFO] Epoch:[0/2](836000/4588595) loss:2.850 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:20:43,493][model8_pretrain.py][INFO] Epoch:[0/2](836000/4588595) loss:2.757 lr:0.0000100 epoch_Time:23885.0min: [2024-01-06 10:21:20,434][model8_pretrain.py][INFO] Epoch:[0/2](836100/4588595) loss:2.777 lr:0.0000100 epoch_Time:23884.0min: [2024-01-06 10:21:20,434][model8_pretrain.py][INFO] Epoch:[0/2](836100/4588595) loss:2.559 lr:0.0000100 epoch_Time:23884.0min: [2024-01-06 10:21:20,434][model8_pretrain.py][INFO] Epoch:[0/2](836100/4588595) loss:3.285 lr:0.0000100 epoch_Time:23884.0min: [2024-01-06 10:21:20,434][model8_pretrain.py][INFO] Epoch:[0/2](836100/4588595) loss:2.904 lr:0.0000100 epoch_Time:23884.0min: [2024-01-06 10:21:20,434][model8_pretrain.py][INFO] Epoch:[0/2](836100/4588595) loss:2.795 lr:0.0000100 epoch_Time:23884.0min: [2024-01-06 10:21:20,434][model8_pretrain.py][INFO] Epoch:[0/2](836100/4588595) loss:2.936 lr:0.0000100 epoch_Time:23884.0min: [2024-01-06 10:21:20,434][model8_pretrain.py][INFO] Epoch:[0/2](836100/4588595) loss:2.857 lr:0.0000100 epoch_Time:23884.0min: [2024-01-06 10:21:20,434][model8_pretrain.py][INFO] Epoch:[0/2](836100/4588595) loss:2.786 lr:0.0000100 epoch_Time:23884.0min: [2024-01-06 10:21:57,372][model8_pretrain.py][INFO] Epoch:[0/2](836200/4588595) loss:3.037 lr:0.0000100 epoch_Time:23883.0min: [2024-01-06 10:21:57,373][model8_pretrain.py][INFO] Epoch:[0/2](836200/4588595) loss:3.291 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:21:57,373][model8_pretrain.py][INFO] Epoch:[0/2](836200/4588595) loss:2.654 lr:0.0000100 epoch_Time:23883.0min: [2024-01-06 10:21:57,373][model8_pretrain.py][INFO] Epoch:[0/2](836200/4588595) loss:2.950 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:21:57,373][model8_pretrain.py][INFO] Epoch:[0/2](836200/4588595) loss:2.825 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:21:57,373][model8_pretrain.py][INFO] Epoch:[0/2](836200/4588595) loss:3.004 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:21:57,373][model8_pretrain.py][INFO] Epoch:[0/2](836200/4588595) loss:2.966 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:21:57,373][model8_pretrain.py][INFO] Epoch:[0/2](836200/4588595) loss:2.939 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:22:34,322][model8_pretrain.py][INFO] Epoch:[0/2](836300/4588595) loss:3.267 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:22:34,322][model8_pretrain.py][INFO] Epoch:[0/2](836300/4588595) loss:3.271 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:22:34,322][model8_pretrain.py][INFO] Epoch:[0/2](836300/4588595) loss:2.756 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:22:34,322][model8_pretrain.py][INFO] Epoch:[0/2](836300/4588595) loss:3.058 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:22:34,322][model8_pretrain.py][INFO] Epoch:[0/2](836300/4588595) loss:2.992 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:22:34,322][model8_pretrain.py][INFO] Epoch:[0/2](836300/4588595) loss:2.479 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:22:34,322][model8_pretrain.py][INFO] Epoch:[0/2](836300/4588595) loss:2.653 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:22:34,322][model8_pretrain.py][INFO] Epoch:[0/2](836300/4588595) loss:2.840 lr:0.0000100 epoch_Time:23882.0min: [2024-01-06 10:23:11,268][model8_pretrain.py][INFO] Epoch:[0/2](836400/4588595) loss:2.751 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:11,268][model8_pretrain.py][INFO] Epoch:[0/2](836400/4588595) loss:2.464 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:11,268][model8_pretrain.py][INFO] Epoch:[0/2](836400/4588595) loss:3.198 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:11,268][model8_pretrain.py][INFO] Epoch:[0/2](836400/4588595) loss:3.015 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:11,268][model8_pretrain.py][INFO] Epoch:[0/2](836400/4588595) loss:2.830 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:11,268][model8_pretrain.py][INFO] Epoch:[0/2](836400/4588595) loss:3.159 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:11,268][model8_pretrain.py][INFO] Epoch:[0/2](836400/4588595) loss:2.996 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:11,269][model8_pretrain.py][INFO] Epoch:[0/2](836400/4588595) loss:2.666 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:53,379][model8_pretrain.py][INFO] Epoch:[0/2](836500/4588595) loss:3.362 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:53,379][model8_pretrain.py][INFO] Epoch:[0/2](836500/4588595) loss:2.749 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:53,379][model8_pretrain.py][INFO] Epoch:[0/2](836500/4588595) loss:3.054 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:53,379][model8_pretrain.py][INFO] Epoch:[0/2](836500/4588595) loss:3.031 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:53,379][model8_pretrain.py][INFO] Epoch:[0/2](836500/4588595) loss:2.834 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:53,379][model8_pretrain.py][INFO] Epoch:[0/2](836500/4588595) loss:2.960 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:53,379][model8_pretrain.py][INFO] Epoch:[0/2](836500/4588595) loss:2.299 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:23:53,379][model8_pretrain.py][INFO] Epoch:[0/2](836500/4588595) loss:3.393 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:24:35,225][model8_pretrain.py][INFO] Epoch:[0/2](836600/4588595) loss:2.591 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:24:35,225][model8_pretrain.py][INFO] Epoch:[0/2](836600/4588595) loss:2.667 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:24:35,225][model8_pretrain.py][INFO] Epoch:[0/2](836600/4588595) loss:3.045 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:24:35,225][model8_pretrain.py][INFO] Epoch:[0/2](836600/4588595) loss:2.503 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:24:35,225][model8_pretrain.py][INFO] Epoch:[0/2](836600/4588595) loss:2.710 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:24:35,225][model8_pretrain.py][INFO] Epoch:[0/2](836600/4588595) loss:2.166 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:24:35,225][model8_pretrain.py][INFO] Epoch:[0/2](836600/4588595) loss:2.839 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:24:35,225][model8_pretrain.py][INFO] Epoch:[0/2](836600/4588595) loss:3.156 lr:0.0000100 epoch_Time:23881.0min: [2024-01-06 10:25:12,171][model8_pretrain.py][INFO] Epoch:[0/2](836700/4588595) loss:2.718 lr:0.0000100 epoch_Time:23880.0min: [2024-01-06 10:25:12,171][model8_pretrain.py][INFO] Epoch:[0/2](836700/4588595) loss:2.640 lr:0.0000100 epoch_Time:23880.0min: [2024-01-06 10:25:12,171][model8_pretrain.py][INFO] Epoch:[0/2](836700/4588595) loss:2.806 lr:0.0000100 epoch_Time:23880.0min: [2024-01-06 10:25:12,171][model8_pretrain.py][INFO] Epoch:[0/2](836700/4588595) loss:2.784 lr:0.0000100 epoch_Time:23880.0min: [2024-01-06 10:25:12,171][model8_pretrain.py][INFO] Epoch:[0/2](836700/4588595) loss:3.238 lr:0.0000100 epoch_Time:23880.0min: [2024-01-06 10:25:12,171][model8_pretrain.py][INFO] Epoch:[0/2](836700/4588595) loss:2.823 lr:0.0000100 epoch_Time:23880.0min: [2024-01-06 10:25:12,171][model8_pretrain.py][INFO] Epoch:[0/2](836700/4588595) loss:3.268 lr:0.0000100 epoch_Time:23880.0min: [2024-01-06 10:25:12,172][model8_pretrain.py][INFO] Epoch:[0/2](836700/4588595) loss:2.643 lr:0.0000100 epoch_Time:23880.0min: [2024-01-06 10:25:49,116][model8_pretrain.py][INFO] Epoch:[0/2](836800/4588595) loss:3.035 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:25:49,116][model8_pretrain.py][INFO] Epoch:[0/2](836800/4588595) loss:2.394 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:25:49,116][model8_pretrain.py][INFO] Epoch:[0/2](836800/4588595) loss:2.919 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:25:49,116][model8_pretrain.py][INFO] Epoch:[0/2](836800/4588595) loss:2.595 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:25:49,116][model8_pretrain.py][INFO] Epoch:[0/2](836800/4588595) loss:3.166 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:25:49,117][model8_pretrain.py][INFO] Epoch:[0/2](836800/4588595) loss:2.174 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:25:49,117][model8_pretrain.py][INFO] Epoch:[0/2](836800/4588595) loss:2.757 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:25:49,117][model8_pretrain.py][INFO] Epoch:[0/2](836800/4588595) loss:2.488 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:26:26,051][model8_pretrain.py][INFO] Epoch:[0/2](836900/4588595) loss:2.852 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:26:26,051][model8_pretrain.py][INFO] Epoch:[0/2](836900/4588595) loss:2.796 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:26:26,051][model8_pretrain.py][INFO] Epoch:[0/2](836900/4588595) loss:2.574 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:26:26,051][model8_pretrain.py][INFO] Epoch:[0/2](836900/4588595) loss:3.013 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:26:26,051][model8_pretrain.py][INFO] Epoch:[0/2](836900/4588595) loss:3.640 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:26:26,051][model8_pretrain.py][INFO] Epoch:[0/2](836900/4588595) loss:2.864 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:26:26,051][model8_pretrain.py][INFO] Epoch:[0/2](836900/4588595) loss:2.915 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:26:26,051][model8_pretrain.py][INFO] Epoch:[0/2](836900/4588595) loss:2.952 lr:0.0000100 epoch_Time:23879.0min: [2024-01-06 10:27:02,991][model8_pretrain.py][INFO] Epoch:[0/2](837000/4588595) loss:2.780 lr:0.0000100 epoch_Time:23878.0min: [2024-01-06 10:27:02,991][model8_pretrain.py][INFO] Epoch:[0/2](837000/4588595) loss:2.045 lr:0.0000100 epoch_Time:23878.0min: [2024-01-06 10:27:02,991][model8_pretrain.py][INFO] Epoch:[0/2](837000/4588595) loss:2.650 lr:0.0000100 epoch_Time:23878.0min: [2024-01-06 10:27:02,991][model8_pretrain.py][INFO] Epoch:[0/2](837000/4588595) loss:3.019 lr:0.0000100 epoch_Time:23878.0min: [2024-01-06 10:27:02,991][model8_pretrain.py][INFO] Epoch:[0/2](837000/4588595) loss:3.282 lr:0.0000100 epoch_Time:23878.0min: [2024-01-06 10:27:02,991][model8_pretrain.py][INFO] Epoch:[0/2](837000/4588595) loss:2.484 lr:0.0000100 epoch_Time:23878.0min: [2024-01-06 10:27:02,991][model8_pretrain.py][INFO] Epoch:[0/2](837000/4588595) loss:2.851 lr:0.0000100 epoch_Time:23878.0min: [2024-01-06 10:27:02,991][model8_pretrain.py][INFO] Epoch:[0/2](837000/4588595) loss:2.779 lr:0.0000100 epoch_Time:23878.0min: [2024-01-06 10:27:39,937][model8_pretrain.py][INFO] Epoch:[0/2](837100/4588595) loss:2.443 lr:0.0000100 epoch_Time:23877.0min: [2024-01-06 10:27:39,937][model8_pretrain.py][INFO] Epoch:[0/2](837100/4588595) loss:2.920 lr:0.0000100 epoch_Time:23877.0min: [2024-01-06 10:27:39,937][model8_pretrain.py][INFO] Epoch:[0/2](837100/4588595) loss:2.537 lr:0.0000100 epoch_Time:23877.0min: [2024-01-06 10:27:39,937][model8_pretrain.py][INFO] Epoch:[0/2](837100/4588595) loss:2.382 lr:0.0000100 epoch_Time:23877.0min: [2024-01-06 10:27:39,937][model8_pretrain.py][INFO] Epoch:[0/2](837100/4588595) loss:2.759 lr:0.0000100 epoch_Time:23877.0min: [2024-01-06 10:27:39,937][model8_pretrain.py][INFO] Epoch:[0/2](837100/4588595) loss:2.894 lr:0.0000100 epoch_Time:23877.0min: [2024-01-06 10:27:39,937][model8_pretrain.py][INFO] Epoch:[0/2](837100/4588595) loss:2.989 lr:0.0000100 epoch_Time:23877.0min: [2024-01-06 10:27:39,937][model8_pretrain.py][INFO] Epoch:[0/2](837100/4588595) loss:2.967 lr:0.0000100 epoch_Time:23877.0min: [2024-01-06 10:28:16,890][model8_pretrain.py][INFO] Epoch:[0/2](837200/4588595) loss:3.053 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:16,890][model8_pretrain.py][INFO] Epoch:[0/2](837200/4588595) loss:2.475 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:16,890][model8_pretrain.py][INFO] Epoch:[0/2](837200/4588595) loss:3.151 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:16,890][model8_pretrain.py][INFO] Epoch:[0/2](837200/4588595) loss:2.844 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:16,890][model8_pretrain.py][INFO] Epoch:[0/2](837200/4588595) loss:2.637 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:16,890][model8_pretrain.py][INFO] Epoch:[0/2](837200/4588595) loss:2.558 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:16,890][model8_pretrain.py][INFO] Epoch:[0/2](837200/4588595) loss:3.020 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:16,890][model8_pretrain.py][INFO] Epoch:[0/2](837200/4588595) loss:2.295 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:59,041][model8_pretrain.py][INFO] Epoch:[0/2](837300/4588595) loss:2.309 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:59,041][model8_pretrain.py][INFO] Epoch:[0/2](837300/4588595) loss:2.473 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:59,041][model8_pretrain.py][INFO] Epoch:[0/2](837300/4588595) loss:1.614 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:59,041][model8_pretrain.py][INFO] Epoch:[0/2](837300/4588595) loss:2.731 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:59,041][model8_pretrain.py][INFO] Epoch:[0/2](837300/4588595) loss:3.040 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:59,041][model8_pretrain.py][INFO] Epoch:[0/2](837300/4588595) loss:3.043 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:59,041][model8_pretrain.py][INFO] Epoch:[0/2](837300/4588595) loss:2.719 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:28:59,041][model8_pretrain.py][INFO] Epoch:[0/2](837300/4588595) loss:2.984 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:29:40,991][model8_pretrain.py][INFO] Epoch:[0/2](837400/4588595) loss:2.881 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:29:40,992][model8_pretrain.py][INFO] Epoch:[0/2](837400/4588595) loss:2.732 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:29:40,992][model8_pretrain.py][INFO] Epoch:[0/2](837400/4588595) loss:2.897 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:29:40,992][model8_pretrain.py][INFO] Epoch:[0/2](837400/4588595) loss:3.289 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:29:40,992][model8_pretrain.py][INFO] Epoch:[0/2](837400/4588595) loss:3.084 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:29:40,992][model8_pretrain.py][INFO] Epoch:[0/2](837400/4588595) loss:2.235 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:29:40,992][model8_pretrain.py][INFO] Epoch:[0/2](837400/4588595) loss:2.796 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:29:40,992][model8_pretrain.py][INFO] Epoch:[0/2](837400/4588595) loss:3.100 lr:0.0000100 epoch_Time:23876.0min: [2024-01-06 10:30:17,914][model8_pretrain.py][INFO] Epoch:[0/2](837500/4588595) loss:2.715 lr:0.0000100 epoch_Time:23875.0min: [2024-01-06 10:30:17,914][model8_pretrain.py][INFO] Epoch:[0/2](837500/4588595) loss:2.725 lr:0.0000100 epoch_Time:23875.0min: [2024-01-06 10:30:17,914][model8_pretrain.py][INFO] Epoch:[0/2](837500/4588595) loss:2.617 lr:0.0000100 epoch_Time:23875.0min: [2024-01-06 10:30:17,914][model8_pretrain.py][INFO] Epoch:[0/2](837500/4588595) loss:2.728 lr:0.0000100 epoch_Time:23875.0min: [2024-01-06 10:30:17,914][model8_pretrain.py][INFO] Epoch:[0/2](837500/4588595) loss:3.048 lr:0.0000100 epoch_Time:23875.0min: [2024-01-06 10:30:17,914][model8_pretrain.py][INFO] Epoch:[0/2](837500/4588595) loss:3.062 lr:0.0000100 epoch_Time:23875.0min: [2024-01-06 10:30:17,914][model8_pretrain.py][INFO] Epoch:[0/2](837500/4588595) loss:2.887 lr:0.0000100 epoch_Time:23875.0min: [2024-01-06 10:30:17,914][model8_pretrain.py][INFO] Epoch:[0/2](837500/4588595) loss:2.873 lr:0.0000100 epoch_Time:23875.0min: [2024-01-06 10:30:54,887][model8_pretrain.py][INFO] Epoch:[0/2](837600/4588595) loss:2.392 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:30:54,887][model8_pretrain.py][INFO] Epoch:[0/2](837600/4588595) loss:2.601 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:30:54,888][model8_pretrain.py][INFO] Epoch:[0/2](837600/4588595) loss:2.818 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:30:54,887][model8_pretrain.py][INFO] Epoch:[0/2](837600/4588595) loss:2.716 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:30:54,887][model8_pretrain.py][INFO] Epoch:[0/2](837600/4588595) loss:2.785 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:30:54,887][model8_pretrain.py][INFO] Epoch:[0/2](837600/4588595) loss:3.186 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:30:54,888][model8_pretrain.py][INFO] Epoch:[0/2](837600/4588595) loss:2.792 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:30:54,888][model8_pretrain.py][INFO] Epoch:[0/2](837600/4588595) loss:2.966 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:31:31,854][model8_pretrain.py][INFO] Epoch:[0/2](837700/4588595) loss:2.891 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:31:31,854][model8_pretrain.py][INFO] Epoch:[0/2](837700/4588595) loss:3.278 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:31:31,854][model8_pretrain.py][INFO] Epoch:[0/2](837700/4588595) loss:2.318 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:31:31,854][model8_pretrain.py][INFO] Epoch:[0/2](837700/4588595) loss:2.451 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:31:31,854][model8_pretrain.py][INFO] Epoch:[0/2](837700/4588595) loss:3.270 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:31:31,854][model8_pretrain.py][INFO] Epoch:[0/2](837700/4588595) loss:3.211 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:31:31,855][model8_pretrain.py][INFO] Epoch:[0/2](837700/4588595) loss:2.248 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:31:31,855][model8_pretrain.py][INFO] Epoch:[0/2](837700/4588595) loss:2.311 lr:0.0000100 epoch_Time:23874.0min: [2024-01-06 10:32:08,801][model8_pretrain.py][INFO] Epoch:[0/2](837800/4588595) loss:2.868 lr:0.0000100 epoch_Time:23873.0min: [2024-01-06 10:32:08,801][model8_pretrain.py][INFO] Epoch:[0/2](837800/4588595) loss:3.218 lr:0.0000100 epoch_Time:23873.0min: [2024-01-06 10:32:08,801][model8_pretrain.py][INFO] Epoch:[0/2](837800/4588595) loss:2.665 lr:0.0000100 epoch_Time:23873.0min: [2024-01-06 10:32:08,801][model8_pretrain.py][INFO] Epoch:[0/2](837800/4588595) loss:2.414 lr:0.0000100 epoch_Time:23873.0min: [2024-01-06 10:32:08,801][model8_pretrain.py][INFO] Epoch:[0/2](837800/4588595) loss:2.641 lr:0.0000100 epoch_Time:23873.0min: [2024-01-06 10:32:08,801][model8_pretrain.py][INFO] Epoch:[0/2](837800/4588595) loss:3.202 lr:0.0000100 epoch_Time:23873.0min: [2024-01-06 10:32:08,801][model8_pretrain.py][INFO] Epoch:[0/2](837800/4588595) loss:2.807 lr:0.0000100 epoch_Time:23873.0min: [2024-01-06 10:32:08,801][model8_pretrain.py][INFO] Epoch:[0/2](837800/4588595) loss:3.165 lr:0.0000100 epoch_Time:23873.0min: [2024-01-06 10:32:45,739][model8_pretrain.py][INFO] Epoch:[0/2](837900/4588595) loss:2.626 lr:0.0000100 epoch_Time:23872.0min: [2024-01-06 10:32:45,739][model8_pretrain.py][INFO] Epoch:[0/2](837900/4588595) loss:2.749 lr:0.0000100 epoch_Time:23872.0min: [2024-01-06 10:32:45,739][model8_pretrain.py][INFO] Epoch:[0/2](837900/4588595) loss:2.446 lr:0.0000100 epoch_Time:23872.0min: [2024-01-06 10:32:45,739][model8_pretrain.py][INFO] Epoch:[0/2](837900/4588595) loss:3.156 lr:0.0000100 epoch_Time:23872.0min: [2024-01-06 10:32:45,739][model8_pretrain.py][INFO] Epoch:[0/2](837900/4588595) loss:3.050 lr:0.0000100 epoch_Time:23872.0min: [2024-01-06 10:32:45,739][model8_pretrain.py][INFO] Epoch:[0/2](837900/4588595) loss:2.882 lr:0.0000100 epoch_Time:23872.0min: [2024-01-06 10:32:45,739][model8_pretrain.py][INFO] Epoch:[0/2](837900/4588595) loss:2.509 lr:0.0000100 epoch_Time:23872.0min: [2024-01-06 10:32:45,740][model8_pretrain.py][INFO] Epoch:[0/2](837900/4588595) loss:2.946 lr:0.0000100 epoch_Time:23872.0min: [2024-01-06 10:33:22,683][model8_pretrain.py][INFO] Epoch:[0/2](838000/4588595) loss:2.726 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:33:22,683][model8_pretrain.py][INFO] Epoch:[0/2](838000/4588595) loss:2.669 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:33:22,683][model8_pretrain.py][INFO] Epoch:[0/2](838000/4588595) loss:3.020 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:33:22,683][model8_pretrain.py][INFO] Epoch:[0/2](838000/4588595) loss:2.658 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:33:22,683][model8_pretrain.py][INFO] Epoch:[0/2](838000/4588595) loss:2.915 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:33:22,683][model8_pretrain.py][INFO] Epoch:[0/2](838000/4588595) loss:3.200 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:33:22,684][model8_pretrain.py][INFO] Epoch:[0/2](838000/4588595) loss:2.267 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:33:22,684][model8_pretrain.py][INFO] Epoch:[0/2](838000/4588595) loss:2.860 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:04,900][model8_pretrain.py][INFO] Epoch:[0/2](838100/4588595) loss:2.632 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:04,901][model8_pretrain.py][INFO] Epoch:[0/2](838100/4588595) loss:3.027 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:04,901][model8_pretrain.py][INFO] Epoch:[0/2](838100/4588595) loss:2.414 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:04,905][model8_pretrain.py][INFO] Epoch:[0/2](838100/4588595) loss:2.932 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:04,905][model8_pretrain.py][INFO] Epoch:[0/2](838100/4588595) loss:3.135 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:04,905][model8_pretrain.py][INFO] Epoch:[0/2](838100/4588595) loss:2.284 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:04,905][model8_pretrain.py][INFO] Epoch:[0/2](838100/4588595) loss:2.652 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:04,905][model8_pretrain.py][INFO] Epoch:[0/2](838100/4588595) loss:2.362 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:46,996][model8_pretrain.py][INFO] Epoch:[0/2](838200/4588595) loss:2.803 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:46,996][model8_pretrain.py][INFO] Epoch:[0/2](838200/4588595) loss:2.423 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:46,996][model8_pretrain.py][INFO] Epoch:[0/2](838200/4588595) loss:2.576 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:46,996][model8_pretrain.py][INFO] Epoch:[0/2](838200/4588595) loss:2.473 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:46,996][model8_pretrain.py][INFO] Epoch:[0/2](838200/4588595) loss:2.731 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:46,996][model8_pretrain.py][INFO] Epoch:[0/2](838200/4588595) loss:3.007 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:46,996][model8_pretrain.py][INFO] Epoch:[0/2](838200/4588595) loss:2.276 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:34:46,996][model8_pretrain.py][INFO] Epoch:[0/2](838200/4588595) loss:2.867 lr:0.0000100 epoch_Time:23871.0min: [2024-01-06 10:35:23,939][model8_pretrain.py][INFO] Epoch:[0/2](838300/4588595) loss:3.012 lr:0.0000100 epoch_Time:23870.0min: [2024-01-06 10:35:23,940][model8_pretrain.py][INFO] Epoch:[0/2](838300/4588595) loss:3.025 lr:0.0000100 epoch_Time:23870.0min: [2024-01-06 10:35:23,940][model8_pretrain.py][INFO] Epoch:[0/2](838300/4588595) loss:3.166 lr:0.0000100 epoch_Time:23870.0min: [2024-01-06 10:35:23,940][model8_pretrain.py][INFO] Epoch:[0/2](838300/4588595) loss:3.012 lr:0.0000100 epoch_Time:23870.0min: [2024-01-06 10:35:23,940][model8_pretrain.py][INFO] Epoch:[0/2](838300/4588595) loss:2.995 lr:0.0000100 epoch_Time:23870.0min: [2024-01-06 10:35:23,940][model8_pretrain.py][INFO] Epoch:[0/2](838300/4588595) loss:3.276 lr:0.0000100 epoch_Time:23870.0min: [2024-01-06 10:35:23,940][model8_pretrain.py][INFO] Epoch:[0/2](838300/4588595) loss:2.929 lr:0.0000100 epoch_Time:23870.0min: [2024-01-06 10:35:23,940][model8_pretrain.py][INFO] Epoch:[0/2](838300/4588595) loss:2.634 lr:0.0000100 epoch_Time:23870.0min: [2024-01-06 10:36:00,877][model8_pretrain.py][INFO] Epoch:[0/2](838400/4588595) loss:2.631 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:00,877][model8_pretrain.py][INFO] Epoch:[0/2](838400/4588595) loss:2.789 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:00,877][model8_pretrain.py][INFO] Epoch:[0/2](838400/4588595) loss:2.395 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:00,877][model8_pretrain.py][INFO] Epoch:[0/2](838400/4588595) loss:2.501 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:00,877][model8_pretrain.py][INFO] Epoch:[0/2](838400/4588595) loss:2.725 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:00,877][model8_pretrain.py][INFO] Epoch:[0/2](838400/4588595) loss:2.795 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:00,877][model8_pretrain.py][INFO] Epoch:[0/2](838400/4588595) loss:2.778 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:00,877][model8_pretrain.py][INFO] Epoch:[0/2](838400/4588595) loss:3.118 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:37,821][model8_pretrain.py][INFO] Epoch:[0/2](838500/4588595) loss:2.927 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:37,821][model8_pretrain.py][INFO] Epoch:[0/2](838500/4588595) loss:3.043 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:37,821][model8_pretrain.py][INFO] Epoch:[0/2](838500/4588595) loss:3.192 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:37,821][model8_pretrain.py][INFO] Epoch:[0/2](838500/4588595) loss:2.106 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:37,821][model8_pretrain.py][INFO] Epoch:[0/2](838500/4588595) loss:2.757 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:37,821][model8_pretrain.py][INFO] Epoch:[0/2](838500/4588595) loss:2.683 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:37,821][model8_pretrain.py][INFO] Epoch:[0/2](838500/4588595) loss:2.570 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:36:37,821][model8_pretrain.py][INFO] Epoch:[0/2](838500/4588595) loss:3.353 lr:0.0000100 epoch_Time:23869.0min: [2024-01-06 10:37:14,790][model8_pretrain.py][INFO] Epoch:[0/2](838600/4588595) loss:2.818 lr:0.0000100 epoch_Time:23868.0min: [2024-01-06 10:37:14,790][model8_pretrain.py][INFO] Epoch:[0/2](838600/4588595) loss:2.728 lr:0.0000100 epoch_Time:23868.0min: [2024-01-06 10:37:14,790][model8_pretrain.py][INFO] Epoch:[0/2](838600/4588595) loss:2.999 lr:0.0000100 epoch_Time:23868.0min: [2024-01-06 10:37:14,790][model8_pretrain.py][INFO] Epoch:[0/2](838600/4588595) loss:2.920 lr:0.0000100 epoch_Time:23868.0min: [2024-01-06 10:37:14,790][model8_pretrain.py][INFO] Epoch:[0/2](838600/4588595) loss:2.697 lr:0.0000100 epoch_Time:23868.0min: [2024-01-06 10:37:14,790][model8_pretrain.py][INFO] Epoch:[0/2](838600/4588595) loss:2.892 lr:0.0000100 epoch_Time:23868.0min: [2024-01-06 10:37:14,790][model8_pretrain.py][INFO] Epoch:[0/2](838600/4588595) loss:3.173 lr:0.0000100 epoch_Time:23868.0min: [2024-01-06 10:37:14,791][model8_pretrain.py][INFO] Epoch:[0/2](838600/4588595) loss:2.522 lr:0.0000100 epoch_Time:23868.0min: [2024-01-06 10:37:51,854][model8_pretrain.py][INFO] Epoch:[0/2](838700/4588595) loss:2.782 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:37:51,854][model8_pretrain.py][INFO] Epoch:[0/2](838700/4588595) loss:2.841 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:37:51,854][model8_pretrain.py][INFO] Epoch:[0/2](838700/4588595) loss:3.147 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:37:51,854][model8_pretrain.py][INFO] Epoch:[0/2](838700/4588595) loss:2.531 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:37:51,854][model8_pretrain.py][INFO] Epoch:[0/2](838700/4588595) loss:2.972 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:37:51,854][model8_pretrain.py][INFO] Epoch:[0/2](838700/4588595) loss:2.350 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:37:51,854][model8_pretrain.py][INFO] Epoch:[0/2](838700/4588595) loss:2.575 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:37:51,854][model8_pretrain.py][INFO] Epoch:[0/2](838700/4588595) loss:2.858 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](838800/4588595) loss:2.711 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](838800/4588595) loss:2.503 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](838800/4588595) loss:3.080 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](838800/4588595) loss:3.613 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](838800/4588595) loss:2.955 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:38:28,797][model8_pretrain.py][INFO] Epoch:[0/2](838800/4588595) loss:2.182 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:38:28,798][model8_pretrain.py][INFO] Epoch:[0/2](838800/4588595) loss:2.975 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:38:28,798][model8_pretrain.py][INFO] Epoch:[0/2](838800/4588595) loss:2.849 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:39:09,209][model8_pretrain.py][INFO] Epoch:[0/2](838900/4588595) loss:3.002 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:39:09,209][model8_pretrain.py][INFO] Epoch:[0/2](838900/4588595) loss:2.761 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:39:09,209][model8_pretrain.py][INFO] Epoch:[0/2](838900/4588595) loss:2.533 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:39:09,209][model8_pretrain.py][INFO] Epoch:[0/2](838900/4588595) loss:2.687 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:39:09,209][model8_pretrain.py][INFO] Epoch:[0/2](838900/4588595) loss:2.654 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:39:09,209][model8_pretrain.py][INFO] Epoch:[0/2](838900/4588595) loss:3.080 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:39:09,209][model8_pretrain.py][INFO] Epoch:[0/2](838900/4588595) loss:2.735 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:39:09,209][model8_pretrain.py][INFO] Epoch:[0/2](838900/4588595) loss:3.304 lr:0.0000100 epoch_Time:23866.0min: [2024-01-06 10:39:53,177][model8_pretrain.py][INFO] Epoch:[0/2](839000/4588595) loss:2.678 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:39:53,177][model8_pretrain.py][INFO] Epoch:[0/2](839000/4588595) loss:3.174 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:39:53,177][model8_pretrain.py][INFO] Epoch:[0/2](839000/4588595) loss:2.975 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:39:53,177][model8_pretrain.py][INFO] Epoch:[0/2](839000/4588595) loss:2.802 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:39:53,177][model8_pretrain.py][INFO] Epoch:[0/2](839000/4588595) loss:2.376 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:39:53,177][model8_pretrain.py][INFO] Epoch:[0/2](839000/4588595) loss:2.592 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:39:53,177][model8_pretrain.py][INFO] Epoch:[0/2](839000/4588595) loss:2.920 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:39:53,178][model8_pretrain.py][INFO] Epoch:[0/2](839000/4588595) loss:2.786 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:40:30,117][model8_pretrain.py][INFO] Epoch:[0/2](839100/4588595) loss:2.805 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:40:30,117][model8_pretrain.py][INFO] Epoch:[0/2](839100/4588595) loss:3.251 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:40:30,117][model8_pretrain.py][INFO] Epoch:[0/2](839100/4588595) loss:2.512 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:40:30,117][model8_pretrain.py][INFO] Epoch:[0/2](839100/4588595) loss:2.611 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:40:30,117][model8_pretrain.py][INFO] Epoch:[0/2](839100/4588595) loss:2.885 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:40:30,117][model8_pretrain.py][INFO] Epoch:[0/2](839100/4588595) loss:2.446 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:40:30,117][model8_pretrain.py][INFO] Epoch:[0/2](839100/4588595) loss:2.334 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:40:30,117][model8_pretrain.py][INFO] Epoch:[0/2](839100/4588595) loss:3.139 lr:0.0000100 epoch_Time:23865.0min: [2024-01-06 10:41:07,124][model8_pretrain.py][INFO] Epoch:[0/2](839200/4588595) loss:2.421 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:07,124][model8_pretrain.py][INFO] Epoch:[0/2](839200/4588595) loss:2.646 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:07,124][model8_pretrain.py][INFO] Epoch:[0/2](839200/4588595) loss:2.476 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:07,124][model8_pretrain.py][INFO] Epoch:[0/2](839200/4588595) loss:3.119 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:07,124][model8_pretrain.py][INFO] Epoch:[0/2](839200/4588595) loss:3.180 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:07,124][model8_pretrain.py][INFO] Epoch:[0/2](839200/4588595) loss:2.940 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:07,124][model8_pretrain.py][INFO] Epoch:[0/2](839200/4588595) loss:3.243 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:07,124][model8_pretrain.py][INFO] Epoch:[0/2](839200/4588595) loss:2.770 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:44,069][model8_pretrain.py][INFO] Epoch:[0/2](839300/4588595) loss:2.559 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:44,069][model8_pretrain.py][INFO] Epoch:[0/2](839300/4588595) loss:2.926 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:44,069][model8_pretrain.py][INFO] Epoch:[0/2](839300/4588595) loss:3.151 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:44,069][model8_pretrain.py][INFO] Epoch:[0/2](839300/4588595) loss:2.667 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:44,069][model8_pretrain.py][INFO] Epoch:[0/2](839300/4588595) loss:3.036 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:44,069][model8_pretrain.py][INFO] Epoch:[0/2](839300/4588595) loss:3.108 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:44,070][model8_pretrain.py][INFO] Epoch:[0/2](839300/4588595) loss:2.920 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:41:44,070][model8_pretrain.py][INFO] Epoch:[0/2](839300/4588595) loss:2.872 lr:0.0000100 epoch_Time:23864.0min: [2024-01-06 10:42:21,037][model8_pretrain.py][INFO] Epoch:[0/2](839400/4588595) loss:2.696 lr:0.0000100 epoch_Time:23863.0min: [2024-01-06 10:42:21,037][model8_pretrain.py][INFO] Epoch:[0/2](839400/4588595) loss:2.440 lr:0.0000100 epoch_Time:23863.0min: [2024-01-06 10:42:21,037][model8_pretrain.py][INFO] Epoch:[0/2](839400/4588595) loss:3.085 lr:0.0000100 epoch_Time:23863.0min: [2024-01-06 10:42:21,037][model8_pretrain.py][INFO] Epoch:[0/2](839400/4588595) loss:2.761 lr:0.0000100 epoch_Time:23863.0min: [2024-01-06 10:42:21,037][model8_pretrain.py][INFO] Epoch:[0/2](839400/4588595) loss:2.771 lr:0.0000100 epoch_Time:23863.0min: [2024-01-06 10:42:21,037][model8_pretrain.py][INFO] Epoch:[0/2](839400/4588595) loss:2.711 lr:0.0000100 epoch_Time:23863.0min: [2024-01-06 10:42:21,037][model8_pretrain.py][INFO] Epoch:[0/2](839400/4588595) loss:3.445 lr:0.0000100 epoch_Time:23863.0min: [2024-01-06 10:42:21,037][model8_pretrain.py][INFO] Epoch:[0/2](839400/4588595) loss:2.459 lr:0.0000100 epoch_Time:23863.0min: [2024-01-06 10:42:57,979][model8_pretrain.py][INFO] Epoch:[0/2](839500/4588595) loss:3.141 lr:0.0000100 epoch_Time:23862.0min: [2024-01-06 10:42:57,979][model8_pretrain.py][INFO] Epoch:[0/2](839500/4588595) loss:2.851 lr:0.0000100 epoch_Time:23862.0min: [2024-01-06 10:42:57,979][model8_pretrain.py][INFO] Epoch:[0/2](839500/4588595) loss:2.471 lr:0.0000100 epoch_Time:23862.0min: [2024-01-06 10:42:57,979][model8_pretrain.py][INFO] Epoch:[0/2](839500/4588595) loss:2.618 lr:0.0000100 epoch_Time:23862.0min: [2024-01-06 10:42:57,979][model8_pretrain.py][INFO] Epoch:[0/2](839500/4588595) loss:3.195 lr:0.0000100 epoch_Time:23862.0min: [2024-01-06 10:42:57,979][model8_pretrain.py][INFO] Epoch:[0/2](839500/4588595) loss:3.291 lr:0.0000100 epoch_Time:23862.0min: [2024-01-06 10:42:57,979][model8_pretrain.py][INFO] Epoch:[0/2](839500/4588595) loss:2.854 lr:0.0000100 epoch_Time:23862.0min: [2024-01-06 10:42:57,979][model8_pretrain.py][INFO] Epoch:[0/2](839500/4588595) loss:3.030 lr:0.0000100 epoch_Time:23862.0min: [2024-01-06 10:43:34,925][model8_pretrain.py][INFO] Epoch:[0/2](839600/4588595) loss:2.790 lr:0.0000100 epoch_Time:23861.0min: [2024-01-06 10:43:34,925][model8_pretrain.py][INFO] Epoch:[0/2](839600/4588595) loss:3.370 lr:0.0000100 epoch_Time:23861.0min: [2024-01-06 10:43:34,925][model8_pretrain.py][INFO] Epoch:[0/2](839600/4588595) loss:2.979 lr:0.0000100 epoch_Time:23861.0min: [2024-01-06 10:43:34,925][model8_pretrain.py][INFO] Epoch:[0/2](839600/4588595) loss:3.060 lr:0.0000100 epoch_Time:23861.0min: [2024-01-06 10:43:34,925][model8_pretrain.py][INFO] Epoch:[0/2](839600/4588595) loss:3.341 lr:0.0000100 epoch_Time:23861.0min: [2024-01-06 10:43:34,925][model8_pretrain.py][INFO] Epoch:[0/2](839600/4588595) loss:2.950 lr:0.0000100 epoch_Time:23861.0min: [2024-01-06 10:43:34,925][model8_pretrain.py][INFO] Epoch:[0/2](839600/4588595) loss:2.695 lr:0.0000100 epoch_Time:23861.0min: [2024-01-06 10:43:34,925][model8_pretrain.py][INFO] Epoch:[0/2](839600/4588595) loss:2.936 lr:0.0000100 epoch_Time:23861.0min: [2024-01-06 10:44:13,595][model8_pretrain.py][INFO] Epoch:[0/2](839700/4588595) loss:3.298 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:13,595][model8_pretrain.py][INFO] Epoch:[0/2](839700/4588595) loss:3.011 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:13,595][model8_pretrain.py][INFO] Epoch:[0/2](839700/4588595) loss:2.919 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:13,595][model8_pretrain.py][INFO] Epoch:[0/2](839700/4588595) loss:3.343 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:13,595][model8_pretrain.py][INFO] Epoch:[0/2](839700/4588595) loss:2.790 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:13,595][model8_pretrain.py][INFO] Epoch:[0/2](839700/4588595) loss:2.266 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:13,595][model8_pretrain.py][INFO] Epoch:[0/2](839700/4588595) loss:2.968 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:13,595][model8_pretrain.py][INFO] Epoch:[0/2](839700/4588595) loss:2.310 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:59,355][model8_pretrain.py][INFO] Epoch:[0/2](839800/4588595) loss:2.874 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:59,355][model8_pretrain.py][INFO] Epoch:[0/2](839800/4588595) loss:2.890 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:59,355][model8_pretrain.py][INFO] Epoch:[0/2](839800/4588595) loss:2.318 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:59,355][model8_pretrain.py][INFO] Epoch:[0/2](839800/4588595) loss:2.385 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:59,355][model8_pretrain.py][INFO] Epoch:[0/2](839800/4588595) loss:2.261 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:59,355][model8_pretrain.py][INFO] Epoch:[0/2](839800/4588595) loss:2.522 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:59,355][model8_pretrain.py][INFO] Epoch:[0/2](839800/4588595) loss:2.760 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:44:59,355][model8_pretrain.py][INFO] Epoch:[0/2](839800/4588595) loss:2.863 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:45:36,319][model8_pretrain.py][INFO] Epoch:[0/2](839900/4588595) loss:2.917 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:45:36,319][model8_pretrain.py][INFO] Epoch:[0/2](839900/4588595) loss:2.812 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:45:36,319][model8_pretrain.py][INFO] Epoch:[0/2](839900/4588595) loss:2.624 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:45:36,319][model8_pretrain.py][INFO] Epoch:[0/2](839900/4588595) loss:2.909 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:45:36,319][model8_pretrain.py][INFO] Epoch:[0/2](839900/4588595) loss:2.773 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:45:36,319][model8_pretrain.py][INFO] Epoch:[0/2](839900/4588595) loss:2.152 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:45:36,319][model8_pretrain.py][INFO] Epoch:[0/2](839900/4588595) loss:2.613 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:45:36,319][model8_pretrain.py][INFO] Epoch:[0/2](839900/4588595) loss:3.406 lr:0.0000100 epoch_Time:23860.0min: [2024-01-06 10:46:13,312][model8_pretrain.py][INFO] Epoch:[0/2](840000/4588595) loss:3.112 lr:0.0000100 epoch_Time:23859.0min: [2024-01-06 10:46:13,312][model8_pretrain.py][INFO] Epoch:[0/2](840000/4588595) loss:2.644 lr:0.0000100 epoch_Time:23859.0min: [2024-01-06 10:46:13,312][model8_pretrain.py][INFO] Epoch:[0/2](840000/4588595) loss:3.183 lr:0.0000100 epoch_Time:23859.0min: [2024-01-06 10:46:13,312][model8_pretrain.py][INFO] Epoch:[0/2](840000/4588595) loss:2.694 lr:0.0000100 epoch_Time:23859.0min: [2024-01-06 10:46:13,312][model8_pretrain.py][INFO] Epoch:[0/2](840000/4588595) loss:3.218 lr:0.0000100 epoch_Time:23859.0min: [2024-01-06 10:46:13,312][model8_pretrain.py][INFO] Epoch:[0/2](840000/4588595) loss:2.806 lr:0.0000100 epoch_Time:23859.0min: [2024-01-06 10:46:13,313][model8_pretrain.py][INFO] Epoch:[0/2](840000/4588595) loss:2.652 lr:0.0000100 epoch_Time:23859.0min: [2024-01-06 10:46:13,312][model8_pretrain.py][INFO] Epoch:[0/2](840000/4588595) loss:2.600 lr:0.0000100 epoch_Time:23859.0min: [2024-01-06 10:46:50,300][model8_pretrain.py][INFO] Epoch:[0/2](840100/4588595) loss:2.875 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:46:50,300][model8_pretrain.py][INFO] Epoch:[0/2](840100/4588595) loss:2.683 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:46:50,300][model8_pretrain.py][INFO] Epoch:[0/2](840100/4588595) loss:3.082 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:46:50,300][model8_pretrain.py][INFO] Epoch:[0/2](840100/4588595) loss:2.852 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:46:50,300][model8_pretrain.py][INFO] Epoch:[0/2](840100/4588595) loss:2.755 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:46:50,300][model8_pretrain.py][INFO] Epoch:[0/2](840100/4588595) loss:2.283 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:46:50,300][model8_pretrain.py][INFO] Epoch:[0/2](840100/4588595) loss:2.685 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:46:50,300][model8_pretrain.py][INFO] Epoch:[0/2](840100/4588595) loss:2.997 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:47:27,261][model8_pretrain.py][INFO] Epoch:[0/2](840200/4588595) loss:2.407 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:47:27,261][model8_pretrain.py][INFO] Epoch:[0/2](840200/4588595) loss:3.027 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:47:27,261][model8_pretrain.py][INFO] Epoch:[0/2](840200/4588595) loss:3.136 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:47:27,261][model8_pretrain.py][INFO] Epoch:[0/2](840200/4588595) loss:2.544 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:47:27,261][model8_pretrain.py][INFO] Epoch:[0/2](840200/4588595) loss:2.419 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:47:27,261][model8_pretrain.py][INFO] Epoch:[0/2](840200/4588595) loss:2.896 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:47:27,261][model8_pretrain.py][INFO] Epoch:[0/2](840200/4588595) loss:2.785 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:47:27,261][model8_pretrain.py][INFO] Epoch:[0/2](840200/4588595) loss:3.366 lr:0.0000100 epoch_Time:23858.0min: [2024-01-06 10:48:04,209][model8_pretrain.py][INFO] Epoch:[0/2](840300/4588595) loss:3.483 lr:0.0000100 epoch_Time:23857.0min: [2024-01-06 10:48:04,209][model8_pretrain.py][INFO] Epoch:[0/2](840300/4588595) loss:2.755 lr:0.0000100 epoch_Time:23857.0min: [2024-01-06 10:48:04,209][model8_pretrain.py][INFO] Epoch:[0/2](840300/4588595) loss:2.398 lr:0.0000100 epoch_Time:23857.0min: [2024-01-06 10:48:04,209][model8_pretrain.py][INFO] Epoch:[0/2](840300/4588595) loss:2.526 lr:0.0000100 epoch_Time:23857.0min: [2024-01-06 10:48:04,209][model8_pretrain.py][INFO] Epoch:[0/2](840300/4588595) loss:2.301 lr:0.0000100 epoch_Time:23857.0min: [2024-01-06 10:48:04,209][model8_pretrain.py][INFO] Epoch:[0/2](840300/4588595) loss:2.251 lr:0.0000100 epoch_Time:23857.0min: [2024-01-06 10:48:04,209][model8_pretrain.py][INFO] Epoch:[0/2](840300/4588595) loss:2.607 lr:0.0000100 epoch_Time:23857.0min: [2024-01-06 10:48:04,209][model8_pretrain.py][INFO] Epoch:[0/2](840300/4588595) loss:3.449 lr:0.0000100 epoch_Time:23857.0min: [2024-01-06 10:48:41,158][model8_pretrain.py][INFO] Epoch:[0/2](840400/4588595) loss:3.085 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:48:41,158][model8_pretrain.py][INFO] Epoch:[0/2](840400/4588595) loss:2.600 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:48:41,158][model8_pretrain.py][INFO] Epoch:[0/2](840400/4588595) loss:2.697 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:48:41,158][model8_pretrain.py][INFO] Epoch:[0/2](840400/4588595) loss:2.835 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:48:41,158][model8_pretrain.py][INFO] Epoch:[0/2](840400/4588595) loss:2.557 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:48:41,158][model8_pretrain.py][INFO] Epoch:[0/2](840400/4588595) loss:3.086 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:48:41,158][model8_pretrain.py][INFO] Epoch:[0/2](840400/4588595) loss:3.101 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:48:41,158][model8_pretrain.py][INFO] Epoch:[0/2](840400/4588595) loss:2.580 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:49:19,833][model8_pretrain.py][INFO] Epoch:[0/2](840500/4588595) loss:3.232 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:49:19,833][model8_pretrain.py][INFO] Epoch:[0/2](840500/4588595) loss:2.596 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:49:19,833][model8_pretrain.py][INFO] Epoch:[0/2](840500/4588595) loss:1.511 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:49:19,833][model8_pretrain.py][INFO] Epoch:[0/2](840500/4588595) loss:3.329 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:49:19,833][model8_pretrain.py][INFO] Epoch:[0/2](840500/4588595) loss:2.808 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:49:19,833][model8_pretrain.py][INFO] Epoch:[0/2](840500/4588595) loss:2.784 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:49:19,833][model8_pretrain.py][INFO] Epoch:[0/2](840500/4588595) loss:1.898 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:49:19,833][model8_pretrain.py][INFO] Epoch:[0/2](840500/4588595) loss:3.145 lr:0.0000100 epoch_Time:23856.0min: [2024-01-06 10:50:05,556][model8_pretrain.py][INFO] Epoch:[0/2](840600/4588595) loss:3.130 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:05,556][model8_pretrain.py][INFO] Epoch:[0/2](840600/4588595) loss:2.337 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:05,556][model8_pretrain.py][INFO] Epoch:[0/2](840600/4588595) loss:3.351 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:05,556][model8_pretrain.py][INFO] Epoch:[0/2](840600/4588595) loss:2.866 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:05,556][model8_pretrain.py][INFO] Epoch:[0/2](840600/4588595) loss:2.153 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:05,556][model8_pretrain.py][INFO] Epoch:[0/2](840600/4588595) loss:2.919 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:05,556][model8_pretrain.py][INFO] Epoch:[0/2](840600/4588595) loss:2.190 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:05,556][model8_pretrain.py][INFO] Epoch:[0/2](840600/4588595) loss:2.857 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:42,496][model8_pretrain.py][INFO] Epoch:[0/2](840700/4588595) loss:2.878 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:42,497][model8_pretrain.py][INFO] Epoch:[0/2](840700/4588595) loss:3.098 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:42,497][model8_pretrain.py][INFO] Epoch:[0/2](840700/4588595) loss:2.840 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:42,497][model8_pretrain.py][INFO] Epoch:[0/2](840700/4588595) loss:2.384 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:42,497][model8_pretrain.py][INFO] Epoch:[0/2](840700/4588595) loss:2.843 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:42,497][model8_pretrain.py][INFO] Epoch:[0/2](840700/4588595) loss:2.443 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:42,497][model8_pretrain.py][INFO] Epoch:[0/2](840700/4588595) loss:3.380 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:50:42,497][model8_pretrain.py][INFO] Epoch:[0/2](840700/4588595) loss:2.748 lr:0.0000100 epoch_Time:23855.0min: [2024-01-06 10:51:19,446][model8_pretrain.py][INFO] Epoch:[0/2](840800/4588595) loss:3.550 lr:0.0000100 epoch_Time:23854.0min: [2024-01-06 10:51:19,446][model8_pretrain.py][INFO] Epoch:[0/2](840800/4588595) loss:2.978 lr:0.0000100 epoch_Time:23854.0min: [2024-01-06 10:51:19,446][model8_pretrain.py][INFO] Epoch:[0/2](840800/4588595) loss:3.110 lr:0.0000100 epoch_Time:23854.0min: [2024-01-06 10:51:19,446][model8_pretrain.py][INFO] Epoch:[0/2](840800/4588595) loss:2.974 lr:0.0000100 epoch_Time:23854.0min: [2024-01-06 10:51:19,446][model8_pretrain.py][INFO] Epoch:[0/2](840800/4588595) loss:2.334 lr:0.0000100 epoch_Time:23854.0min: [2024-01-06 10:51:19,446][model8_pretrain.py][INFO] Epoch:[0/2](840800/4588595) loss:2.758 lr:0.0000100 epoch_Time:23854.0min: [2024-01-06 10:51:19,446][model8_pretrain.py][INFO] Epoch:[0/2](840800/4588595) loss:2.727 lr:0.0000100 epoch_Time:23854.0min: [2024-01-06 10:51:19,447][model8_pretrain.py][INFO] Epoch:[0/2](840800/4588595) loss:3.072 lr:0.0000100 epoch_Time:23854.0min: [2024-01-06 10:51:56,382][model8_pretrain.py][INFO] Epoch:[0/2](840900/4588595) loss:2.888 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:51:56,382][model8_pretrain.py][INFO] Epoch:[0/2](840900/4588595) loss:2.795 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:51:56,382][model8_pretrain.py][INFO] Epoch:[0/2](840900/4588595) loss:2.644 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:51:56,382][model8_pretrain.py][INFO] Epoch:[0/2](840900/4588595) loss:2.793 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:51:56,382][model8_pretrain.py][INFO] Epoch:[0/2](840900/4588595) loss:3.033 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:51:56,382][model8_pretrain.py][INFO] Epoch:[0/2](840900/4588595) loss:2.759 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:51:56,382][model8_pretrain.py][INFO] Epoch:[0/2](840900/4588595) loss:2.906 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:51:56,382][model8_pretrain.py][INFO] Epoch:[0/2](840900/4588595) loss:2.744 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:52:33,333][model8_pretrain.py][INFO] Epoch:[0/2](841000/4588595) loss:2.829 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:52:33,333][model8_pretrain.py][INFO] Epoch:[0/2](841000/4588595) loss:3.280 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:52:33,333][model8_pretrain.py][INFO] Epoch:[0/2](841000/4588595) loss:3.409 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:52:33,333][model8_pretrain.py][INFO] Epoch:[0/2](841000/4588595) loss:2.578 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:52:33,333][model8_pretrain.py][INFO] Epoch:[0/2](841000/4588595) loss:2.320 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:52:33,333][model8_pretrain.py][INFO] Epoch:[0/2](841000/4588595) loss:3.003 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:52:33,333][model8_pretrain.py][INFO] Epoch:[0/2](841000/4588595) loss:3.008 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:52:33,333][model8_pretrain.py][INFO] Epoch:[0/2](841000/4588595) loss:2.657 lr:0.0000100 epoch_Time:23853.0min: [2024-01-06 10:53:10,288][model8_pretrain.py][INFO] Epoch:[0/2](841100/4588595) loss:2.838 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:10,288][model8_pretrain.py][INFO] Epoch:[0/2](841100/4588595) loss:2.374 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:10,288][model8_pretrain.py][INFO] Epoch:[0/2](841100/4588595) loss:2.695 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:10,288][model8_pretrain.py][INFO] Epoch:[0/2](841100/4588595) loss:2.521 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:10,288][model8_pretrain.py][INFO] Epoch:[0/2](841100/4588595) loss:2.374 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:10,288][model8_pretrain.py][INFO] Epoch:[0/2](841100/4588595) loss:2.630 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:10,288][model8_pretrain.py][INFO] Epoch:[0/2](841100/4588595) loss:3.013 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:10,288][model8_pretrain.py][INFO] Epoch:[0/2](841100/4588595) loss:3.277 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:47,238][model8_pretrain.py][INFO] Epoch:[0/2](841200/4588595) loss:2.487 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:47,238][model8_pretrain.py][INFO] Epoch:[0/2](841200/4588595) loss:2.842 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:47,238][model8_pretrain.py][INFO] Epoch:[0/2](841200/4588595) loss:2.910 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:47,238][model8_pretrain.py][INFO] Epoch:[0/2](841200/4588595) loss:3.123 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:47,238][model8_pretrain.py][INFO] Epoch:[0/2](841200/4588595) loss:2.458 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:47,238][model8_pretrain.py][INFO] Epoch:[0/2](841200/4588595) loss:2.709 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:47,238][model8_pretrain.py][INFO] Epoch:[0/2](841200/4588595) loss:3.201 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:53:47,238][model8_pretrain.py][INFO] Epoch:[0/2](841200/4588595) loss:2.524 lr:0.0000100 epoch_Time:23852.0min: [2024-01-06 10:54:25,934][model8_pretrain.py][INFO] Epoch:[0/2](841300/4588595) loss:2.913 lr:0.0000100 epoch_Time:23851.0min: [2024-01-06 10:54:25,935][model8_pretrain.py][INFO] Epoch:[0/2](841300/4588595) loss:2.283 lr:0.0000100 epoch_Time:23851.0min: [2024-01-06 10:54:25,935][model8_pretrain.py][INFO] Epoch:[0/2](841300/4588595) loss:2.339 lr:0.0000100 epoch_Time:23851.0min: [2024-01-06 10:54:25,935][model8_pretrain.py][INFO] Epoch:[0/2](841300/4588595) loss:3.094 lr:0.0000100 epoch_Time:23851.0min: [2024-01-06 10:54:25,935][model8_pretrain.py][INFO] Epoch:[0/2](841300/4588595) loss:2.590 lr:0.0000100 epoch_Time:23851.0min: [2024-01-06 10:54:25,935][model8_pretrain.py][INFO] Epoch:[0/2](841300/4588595) loss:2.801 lr:0.0000100 epoch_Time:23851.0min: [2024-01-06 10:54:25,935][model8_pretrain.py][INFO] Epoch:[0/2](841300/4588595) loss:3.137 lr:0.0000100 epoch_Time:23851.0min: [2024-01-06 10:54:25,935][model8_pretrain.py][INFO] Epoch:[0/2](841300/4588595) loss:2.785 lr:0.0000100 epoch_Time:23851.0min: [2024-01-06 10:55:12,342][model8_pretrain.py][INFO] Epoch:[0/2](841400/4588595) loss:2.724 lr:0.0000100 epoch_Time:23850.0min: [2024-01-06 10:55:12,342][model8_pretrain.py][INFO] Epoch:[0/2](841400/4588595) loss:3.140 lr:0.0000100 epoch_Time:23850.0min: [2024-01-06 10:55:12,342][model8_pretrain.py][INFO] Epoch:[0/2](841400/4588595) loss:2.755 lr:0.0000100 epoch_Time:23850.0min: [2024-01-06 10:55:12,342][model8_pretrain.py][INFO] Epoch:[0/2](841400/4588595) loss:3.027 lr:0.0000100 epoch_Time:23850.0min: [2024-01-06 10:55:12,342][model8_pretrain.py][INFO] Epoch:[0/2](841400/4588595) loss:3.043 lr:0.0000100 epoch_Time:23850.0min: [2024-01-06 10:55:12,342][model8_pretrain.py][INFO] Epoch:[0/2](841400/4588595) loss:2.710 lr:0.0000100 epoch_Time:23850.0min: [2024-01-06 10:55:12,342][model8_pretrain.py][INFO] Epoch:[0/2](841400/4588595) loss:2.397 lr:0.0000100 epoch_Time:23850.0min: [2024-01-06 10:55:12,342][model8_pretrain.py][INFO] Epoch:[0/2](841400/4588595) loss:2.620 lr:0.0000100 epoch_Time:23850.0min: [2024-01-06 10:55:49,280][model8_pretrain.py][INFO] Epoch:[0/2](841500/4588595) loss:2.763 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:55:49,280][model8_pretrain.py][INFO] Epoch:[0/2](841500/4588595) loss:3.164 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:55:49,280][model8_pretrain.py][INFO] Epoch:[0/2](841500/4588595) loss:3.008 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:55:49,280][model8_pretrain.py][INFO] Epoch:[0/2](841500/4588595) loss:2.950 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:55:49,280][model8_pretrain.py][INFO] Epoch:[0/2](841500/4588595) loss:2.995 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:55:49,280][model8_pretrain.py][INFO] Epoch:[0/2](841500/4588595) loss:2.956 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:55:49,281][model8_pretrain.py][INFO] Epoch:[0/2](841500/4588595) loss:2.928 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:55:49,281][model8_pretrain.py][INFO] Epoch:[0/2](841500/4588595) loss:2.896 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:56:26,224][model8_pretrain.py][INFO] Epoch:[0/2](841600/4588595) loss:2.846 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:56:26,224][model8_pretrain.py][INFO] Epoch:[0/2](841600/4588595) loss:2.935 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:56:26,224][model8_pretrain.py][INFO] Epoch:[0/2](841600/4588595) loss:2.398 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:56:26,224][model8_pretrain.py][INFO] Epoch:[0/2](841600/4588595) loss:2.504 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:56:26,224][model8_pretrain.py][INFO] Epoch:[0/2](841600/4588595) loss:1.970 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:56:26,224][model8_pretrain.py][INFO] Epoch:[0/2](841600/4588595) loss:2.796 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:56:26,225][model8_pretrain.py][INFO] Epoch:[0/2](841600/4588595) loss:2.733 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:56:26,224][model8_pretrain.py][INFO] Epoch:[0/2](841600/4588595) loss:3.016 lr:0.0000100 epoch_Time:23849.0min: [2024-01-06 10:57:03,167][model8_pretrain.py][INFO] Epoch:[0/2](841700/4588595) loss:2.755 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:03,167][model8_pretrain.py][INFO] Epoch:[0/2](841700/4588595) loss:2.870 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:03,167][model8_pretrain.py][INFO] Epoch:[0/2](841700/4588595) loss:2.864 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:03,167][model8_pretrain.py][INFO] Epoch:[0/2](841700/4588595) loss:2.752 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:03,167][model8_pretrain.py][INFO] Epoch:[0/2](841700/4588595) loss:2.981 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:03,168][model8_pretrain.py][INFO] Epoch:[0/2](841700/4588595) loss:2.977 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:03,168][model8_pretrain.py][INFO] Epoch:[0/2](841700/4588595) loss:2.443 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:03,168][model8_pretrain.py][INFO] Epoch:[0/2](841700/4588595) loss:2.715 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:40,115][model8_pretrain.py][INFO] Epoch:[0/2](841800/4588595) loss:2.848 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:40,115][model8_pretrain.py][INFO] Epoch:[0/2](841800/4588595) loss:2.787 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:40,115][model8_pretrain.py][INFO] Epoch:[0/2](841800/4588595) loss:2.970 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:40,115][model8_pretrain.py][INFO] Epoch:[0/2](841800/4588595) loss:3.309 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:40,115][model8_pretrain.py][INFO] Epoch:[0/2](841800/4588595) loss:3.224 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:40,116][model8_pretrain.py][INFO] Epoch:[0/2](841800/4588595) loss:2.881 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:40,116][model8_pretrain.py][INFO] Epoch:[0/2](841800/4588595) loss:3.025 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:57:40,116][model8_pretrain.py][INFO] Epoch:[0/2](841800/4588595) loss:2.601 lr:0.0000100 epoch_Time:23848.0min: [2024-01-06 10:58:17,075][model8_pretrain.py][INFO] Epoch:[0/2](841900/4588595) loss:2.756 lr:0.0000100 epoch_Time:23847.0min: [2024-01-06 10:58:17,075][model8_pretrain.py][INFO] Epoch:[0/2](841900/4588595) loss:2.788 lr:0.0000100 epoch_Time:23847.0min: [2024-01-06 10:58:17,075][model8_pretrain.py][INFO] Epoch:[0/2](841900/4588595) loss:3.355 lr:0.0000100 epoch_Time:23847.0min: [2024-01-06 10:58:17,075][model8_pretrain.py][INFO] Epoch:[0/2](841900/4588595) loss:3.227 lr:0.0000100 epoch_Time:23847.0min: [2024-01-06 10:58:17,075][model8_pretrain.py][INFO] Epoch:[0/2](841900/4588595) loss:2.701 lr:0.0000100 epoch_Time:23847.0min: [2024-01-06 10:58:17,075][model8_pretrain.py][INFO] Epoch:[0/2](841900/4588595) loss:2.753 lr:0.0000100 epoch_Time:23847.0min: [2024-01-06 10:58:17,075][model8_pretrain.py][INFO] Epoch:[0/2](841900/4588595) loss:2.763 lr:0.0000100 epoch_Time:23847.0min: [2024-01-06 10:58:17,075][model8_pretrain.py][INFO] Epoch:[0/2](841900/4588595) loss:2.674 lr:0.0000100 epoch_Time:23847.0min: [2024-01-06 10:58:54,049][model8_pretrain.py][INFO] Epoch:[0/2](842000/4588595) loss:3.163 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:58:54,049][model8_pretrain.py][INFO] Epoch:[0/2](842000/4588595) loss:2.882 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:58:54,049][model8_pretrain.py][INFO] Epoch:[0/2](842000/4588595) loss:2.622 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:58:54,049][model8_pretrain.py][INFO] Epoch:[0/2](842000/4588595) loss:3.056 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:58:54,049][model8_pretrain.py][INFO] Epoch:[0/2](842000/4588595) loss:2.790 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:58:54,049][model8_pretrain.py][INFO] Epoch:[0/2](842000/4588595) loss:2.779 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:58:54,049][model8_pretrain.py][INFO] Epoch:[0/2](842000/4588595) loss:2.978 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:58:54,049][model8_pretrain.py][INFO] Epoch:[0/2](842000/4588595) loss:2.473 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:59:32,695][model8_pretrain.py][INFO] Epoch:[0/2](842100/4588595) loss:2.653 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:59:32,695][model8_pretrain.py][INFO] Epoch:[0/2](842100/4588595) loss:1.985 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:59:32,695][model8_pretrain.py][INFO] Epoch:[0/2](842100/4588595) loss:2.956 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:59:32,695][model8_pretrain.py][INFO] Epoch:[0/2](842100/4588595) loss:2.585 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:59:32,695][model8_pretrain.py][INFO] Epoch:[0/2](842100/4588595) loss:2.467 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:59:32,695][model8_pretrain.py][INFO] Epoch:[0/2](842100/4588595) loss:3.435 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:59:32,695][model8_pretrain.py][INFO] Epoch:[0/2](842100/4588595) loss:2.539 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 10:59:32,695][model8_pretrain.py][INFO] Epoch:[0/2](842100/4588595) loss:2.969 lr:0.0000100 epoch_Time:23846.0min: [2024-01-06 11:00:18,402][model8_pretrain.py][INFO] Epoch:[0/2](842200/4588595) loss:2.016 lr:0.0000100 epoch_Time:23845.0min: [2024-01-06 11:00:18,402][model8_pretrain.py][INFO] Epoch:[0/2](842200/4588595) loss:2.158 lr:0.0000100 epoch_Time:23845.0min: [2024-01-06 11:00:18,402][model8_pretrain.py][INFO] Epoch:[0/2](842200/4588595) loss:3.158 lr:0.0000100 epoch_Time:23845.0min: [2024-01-06 11:00:18,402][model8_pretrain.py][INFO] Epoch:[0/2](842200/4588595) loss:3.168 lr:0.0000100 epoch_Time:23845.0min: [2024-01-06 11:00:18,402][model8_pretrain.py][INFO] Epoch:[0/2](842200/4588595) loss:2.839 lr:0.0000100 epoch_Time:23845.0min: [2024-01-06 11:00:18,402][model8_pretrain.py][INFO] Epoch:[0/2](842200/4588595) loss:3.117 lr:0.0000100 epoch_Time:23845.0min: [2024-01-06 11:00:18,402][model8_pretrain.py][INFO] Epoch:[0/2](842200/4588595) loss:2.366 lr:0.0000100 epoch_Time:23845.0min: [2024-01-06 11:00:18,402][model8_pretrain.py][INFO] Epoch:[0/2](842200/4588595) loss:2.965 lr:0.0000100 epoch_Time:23845.0min: [2024-01-06 11:00:55,335][model8_pretrain.py][INFO] Epoch:[0/2](842300/4588595) loss:2.864 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:00:55,335][model8_pretrain.py][INFO] Epoch:[0/2](842300/4588595) loss:2.306 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:00:55,335][model8_pretrain.py][INFO] Epoch:[0/2](842300/4588595) loss:3.104 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:00:55,335][model8_pretrain.py][INFO] Epoch:[0/2](842300/4588595) loss:2.484 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:00:55,335][model8_pretrain.py][INFO] Epoch:[0/2](842300/4588595) loss:3.241 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:00:55,335][model8_pretrain.py][INFO] Epoch:[0/2](842300/4588595) loss:2.513 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:00:55,335][model8_pretrain.py][INFO] Epoch:[0/2](842300/4588595) loss:2.421 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:00:55,335][model8_pretrain.py][INFO] Epoch:[0/2](842300/4588595) loss:2.643 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:01:32,266][model8_pretrain.py][INFO] Epoch:[0/2](842400/4588595) loss:2.842 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:01:32,266][model8_pretrain.py][INFO] Epoch:[0/2](842400/4588595) loss:2.730 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:01:32,266][model8_pretrain.py][INFO] Epoch:[0/2](842400/4588595) loss:2.860 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:01:32,266][model8_pretrain.py][INFO] Epoch:[0/2](842400/4588595) loss:2.989 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:01:32,266][model8_pretrain.py][INFO] Epoch:[0/2](842400/4588595) loss:2.254 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:01:32,266][model8_pretrain.py][INFO] Epoch:[0/2](842400/4588595) loss:2.747 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:01:32,266][model8_pretrain.py][INFO] Epoch:[0/2](842400/4588595) loss:2.895 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:01:32,266][model8_pretrain.py][INFO] Epoch:[0/2](842400/4588595) loss:2.814 lr:0.0000100 epoch_Time:23844.0min: [2024-01-06 11:02:09,210][model8_pretrain.py][INFO] Epoch:[0/2](842500/4588595) loss:2.626 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:09,210][model8_pretrain.py][INFO] Epoch:[0/2](842500/4588595) loss:3.104 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:09,210][model8_pretrain.py][INFO] Epoch:[0/2](842500/4588595) loss:2.778 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:09,210][model8_pretrain.py][INFO] Epoch:[0/2](842500/4588595) loss:3.076 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:09,210][model8_pretrain.py][INFO] Epoch:[0/2](842500/4588595) loss:2.339 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:09,210][model8_pretrain.py][INFO] Epoch:[0/2](842500/4588595) loss:2.624 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:09,210][model8_pretrain.py][INFO] Epoch:[0/2](842500/4588595) loss:2.431 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:09,210][model8_pretrain.py][INFO] Epoch:[0/2](842500/4588595) loss:3.351 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:46,155][model8_pretrain.py][INFO] Epoch:[0/2](842600/4588595) loss:3.508 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:46,155][model8_pretrain.py][INFO] Epoch:[0/2](842600/4588595) loss:3.207 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:46,155][model8_pretrain.py][INFO] Epoch:[0/2](842600/4588595) loss:2.662 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:46,155][model8_pretrain.py][INFO] Epoch:[0/2](842600/4588595) loss:2.873 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:46,155][model8_pretrain.py][INFO] Epoch:[0/2](842600/4588595) loss:3.136 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:46,155][model8_pretrain.py][INFO] Epoch:[0/2](842600/4588595) loss:2.068 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:46,156][model8_pretrain.py][INFO] Epoch:[0/2](842600/4588595) loss:2.999 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:02:46,156][model8_pretrain.py][INFO] Epoch:[0/2](842600/4588595) loss:2.984 lr:0.0000100 epoch_Time:23843.0min: [2024-01-06 11:03:23,094][model8_pretrain.py][INFO] Epoch:[0/2](842700/4588595) loss:3.016 lr:0.0000100 epoch_Time:23842.0min: [2024-01-06 11:03:23,094][model8_pretrain.py][INFO] Epoch:[0/2](842700/4588595) loss:2.968 lr:0.0000100 epoch_Time:23842.0min: [2024-01-06 11:03:23,094][model8_pretrain.py][INFO] Epoch:[0/2](842700/4588595) loss:2.126 lr:0.0000100 epoch_Time:23842.0min: [2024-01-06 11:03:23,094][model8_pretrain.py][INFO] Epoch:[0/2](842700/4588595) loss:2.584 lr:0.0000100 epoch_Time:23842.0min: [2024-01-06 11:03:23,094][model8_pretrain.py][INFO] Epoch:[0/2](842700/4588595) loss:3.168 lr:0.0000100 epoch_Time:23842.0min: [2024-01-06 11:03:23,094][model8_pretrain.py][INFO] Epoch:[0/2](842700/4588595) loss:2.609 lr:0.0000100 epoch_Time:23842.0min: [2024-01-06 11:03:23,094][model8_pretrain.py][INFO] Epoch:[0/2](842700/4588595) loss:2.405 lr:0.0000100 epoch_Time:23842.0min: [2024-01-06 11:03:23,094][model8_pretrain.py][INFO] Epoch:[0/2](842700/4588595) loss:2.799 lr:0.0000100 epoch_Time:23842.0min: [2024-01-06 11:04:00,048][model8_pretrain.py][INFO] Epoch:[0/2](842800/4588595) loss:3.022 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:00,048][model8_pretrain.py][INFO] Epoch:[0/2](842800/4588595) loss:2.948 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:00,049][model8_pretrain.py][INFO] Epoch:[0/2](842800/4588595) loss:2.514 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:00,048][model8_pretrain.py][INFO] Epoch:[0/2](842800/4588595) loss:2.417 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:00,048][model8_pretrain.py][INFO] Epoch:[0/2](842800/4588595) loss:2.732 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:00,049][model8_pretrain.py][INFO] Epoch:[0/2](842800/4588595) loss:3.152 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:00,049][model8_pretrain.py][INFO] Epoch:[0/2](842800/4588595) loss:2.587 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:00,049][model8_pretrain.py][INFO] Epoch:[0/2](842800/4588595) loss:2.726 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:38,612][model8_pretrain.py][INFO] Epoch:[0/2](842900/4588595) loss:2.286 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:38,612][model8_pretrain.py][INFO] Epoch:[0/2](842900/4588595) loss:3.313 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:38,612][model8_pretrain.py][INFO] Epoch:[0/2](842900/4588595) loss:2.945 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:38,612][model8_pretrain.py][INFO] Epoch:[0/2](842900/4588595) loss:3.464 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:38,612][model8_pretrain.py][INFO] Epoch:[0/2](842900/4588595) loss:3.194 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:38,612][model8_pretrain.py][INFO] Epoch:[0/2](842900/4588595) loss:2.828 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:38,612][model8_pretrain.py][INFO] Epoch:[0/2](842900/4588595) loss:3.125 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:04:38,612][model8_pretrain.py][INFO] Epoch:[0/2](842900/4588595) loss:2.761 lr:0.0000100 epoch_Time:23841.0min: [2024-01-06 11:05:23,998][model8_pretrain.py][INFO] Epoch:[0/2](843000/4588595) loss:2.825 lr:0.0000100 epoch_Time:23840.0min: [2024-01-06 11:05:23,998][model8_pretrain.py][INFO] Epoch:[0/2](843000/4588595) loss:2.505 lr:0.0000100 epoch_Time:23840.0min: [2024-01-06 11:05:23,998][model8_pretrain.py][INFO] Epoch:[0/2](843000/4588595) loss:2.217 lr:0.0000100 epoch_Time:23840.0min: [2024-01-06 11:05:23,998][model8_pretrain.py][INFO] Epoch:[0/2](843000/4588595) loss:2.934 lr:0.0000100 epoch_Time:23840.0min: [2024-01-06 11:05:23,998][model8_pretrain.py][INFO] Epoch:[0/2](843000/4588595) loss:2.609 lr:0.0000100 epoch_Time:23840.0min: [2024-01-06 11:05:23,998][model8_pretrain.py][INFO] Epoch:[0/2](843000/4588595) loss:3.190 lr:0.0000100 epoch_Time:23840.0min: [2024-01-06 11:05:23,998][model8_pretrain.py][INFO] Epoch:[0/2](843000/4588595) loss:3.007 lr:0.0000100 epoch_Time:23840.0min: [2024-01-06 11:05:23,998][model8_pretrain.py][INFO] Epoch:[0/2](843000/4588595) loss:2.781 lr:0.0000100 epoch_Time:23840.0min: [2024-01-06 11:06:00,925][model8_pretrain.py][INFO] Epoch:[0/2](843100/4588595) loss:2.983 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:00,925][model8_pretrain.py][INFO] Epoch:[0/2](843100/4588595) loss:2.774 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:00,925][model8_pretrain.py][INFO] Epoch:[0/2](843100/4588595) loss:2.775 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:00,925][model8_pretrain.py][INFO] Epoch:[0/2](843100/4588595) loss:3.074 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:00,925][model8_pretrain.py][INFO] Epoch:[0/2](843100/4588595) loss:3.002 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:00,925][model8_pretrain.py][INFO] Epoch:[0/2](843100/4588595) loss:2.921 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:00,925][model8_pretrain.py][INFO] Epoch:[0/2](843100/4588595) loss:2.536 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:00,925][model8_pretrain.py][INFO] Epoch:[0/2](843100/4588595) loss:3.266 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:37,861][model8_pretrain.py][INFO] Epoch:[0/2](843200/4588595) loss:2.673 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:37,861][model8_pretrain.py][INFO] Epoch:[0/2](843200/4588595) loss:2.957 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:37,861][model8_pretrain.py][INFO] Epoch:[0/2](843200/4588595) loss:3.550 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:37,861][model8_pretrain.py][INFO] Epoch:[0/2](843200/4588595) loss:2.650 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:37,861][model8_pretrain.py][INFO] Epoch:[0/2](843200/4588595) loss:3.001 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:37,861][model8_pretrain.py][INFO] Epoch:[0/2](843200/4588595) loss:2.296 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:37,861][model8_pretrain.py][INFO] Epoch:[0/2](843200/4588595) loss:2.951 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:06:37,861][model8_pretrain.py][INFO] Epoch:[0/2](843200/4588595) loss:2.556 lr:0.0000100 epoch_Time:23839.0min: [2024-01-06 11:07:14,796][model8_pretrain.py][INFO] Epoch:[0/2](843300/4588595) loss:2.889 lr:0.0000100 epoch_Time:23838.0min: [2024-01-06 11:07:14,796][model8_pretrain.py][INFO] Epoch:[0/2](843300/4588595) loss:2.080 lr:0.0000100 epoch_Time:23838.0min: [2024-01-06 11:07:14,796][model8_pretrain.py][INFO] Epoch:[0/2](843300/4588595) loss:2.823 lr:0.0000100 epoch_Time:23838.0min: [2024-01-06 11:07:14,796][model8_pretrain.py][INFO] Epoch:[0/2](843300/4588595) loss:2.009 lr:0.0000100 epoch_Time:23838.0min: [2024-01-06 11:07:14,796][model8_pretrain.py][INFO] Epoch:[0/2](843300/4588595) loss:2.506 lr:0.0000100 epoch_Time:23838.0min: [2024-01-06 11:07:14,796][model8_pretrain.py][INFO] Epoch:[0/2](843300/4588595) loss:2.396 lr:0.0000100 epoch_Time:23838.0min: [2024-01-06 11:07:14,796][model8_pretrain.py][INFO] Epoch:[0/2](843300/4588595) loss:3.323 lr:0.0000100 epoch_Time:23838.0min: [2024-01-06 11:07:14,796][model8_pretrain.py][INFO] Epoch:[0/2](843300/4588595) loss:2.448 lr:0.0000100 epoch_Time:23838.0min: [2024-01-06 11:07:51,739][model8_pretrain.py][INFO] Epoch:[0/2](843400/4588595) loss:2.640 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:07:51,739][model8_pretrain.py][INFO] Epoch:[0/2](843400/4588595) loss:2.681 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:07:51,739][model8_pretrain.py][INFO] Epoch:[0/2](843400/4588595) loss:2.630 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:07:51,739][model8_pretrain.py][INFO] Epoch:[0/2](843400/4588595) loss:2.991 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:07:51,739][model8_pretrain.py][INFO] Epoch:[0/2](843400/4588595) loss:2.691 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:07:51,739][model8_pretrain.py][INFO] Epoch:[0/2](843400/4588595) loss:2.728 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:07:51,739][model8_pretrain.py][INFO] Epoch:[0/2](843400/4588595) loss:2.654 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:07:51,739][model8_pretrain.py][INFO] Epoch:[0/2](843400/4588595) loss:3.263 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:08:28,675][model8_pretrain.py][INFO] Epoch:[0/2](843500/4588595) loss:2.080 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:08:28,675][model8_pretrain.py][INFO] Epoch:[0/2](843500/4588595) loss:2.850 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:08:28,675][model8_pretrain.py][INFO] Epoch:[0/2](843500/4588595) loss:2.990 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:08:28,675][model8_pretrain.py][INFO] Epoch:[0/2](843500/4588595) loss:3.135 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:08:28,675][model8_pretrain.py][INFO] Epoch:[0/2](843500/4588595) loss:2.055 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:08:28,675][model8_pretrain.py][INFO] Epoch:[0/2](843500/4588595) loss:3.183 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:08:28,675][model8_pretrain.py][INFO] Epoch:[0/2](843500/4588595) loss:2.883 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:08:28,675][model8_pretrain.py][INFO] Epoch:[0/2](843500/4588595) loss:2.919 lr:0.0000100 epoch_Time:23837.0min: [2024-01-06 11:09:05,617][model8_pretrain.py][INFO] Epoch:[0/2](843600/4588595) loss:2.902 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:05,617][model8_pretrain.py][INFO] Epoch:[0/2](843600/4588595) loss:2.881 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:05,617][model8_pretrain.py][INFO] Epoch:[0/2](843600/4588595) loss:2.910 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:05,617][model8_pretrain.py][INFO] Epoch:[0/2](843600/4588595) loss:2.571 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:05,617][model8_pretrain.py][INFO] Epoch:[0/2](843600/4588595) loss:2.411 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:05,617][model8_pretrain.py][INFO] Epoch:[0/2](843600/4588595) loss:2.794 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:05,617][model8_pretrain.py][INFO] Epoch:[0/2](843600/4588595) loss:3.156 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:05,617][model8_pretrain.py][INFO] Epoch:[0/2](843600/4588595) loss:2.334 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:44,033][model8_pretrain.py][INFO] Epoch:[0/2](843700/4588595) loss:2.845 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:44,033][model8_pretrain.py][INFO] Epoch:[0/2](843700/4588595) loss:2.958 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:44,033][model8_pretrain.py][INFO] Epoch:[0/2](843700/4588595) loss:3.340 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:44,033][model8_pretrain.py][INFO] Epoch:[0/2](843700/4588595) loss:3.044 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:44,033][model8_pretrain.py][INFO] Epoch:[0/2](843700/4588595) loss:2.763 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:44,033][model8_pretrain.py][INFO] Epoch:[0/2](843700/4588595) loss:2.737 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:44,033][model8_pretrain.py][INFO] Epoch:[0/2](843700/4588595) loss:2.995 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:09:44,033][model8_pretrain.py][INFO] Epoch:[0/2](843700/4588595) loss:2.819 lr:0.0000100 epoch_Time:23836.0min: [2024-01-06 11:10:29,485][model8_pretrain.py][INFO] Epoch:[0/2](843800/4588595) loss:2.205 lr:0.0000100 epoch_Time:23835.0min: [2024-01-06 11:10:29,485][model8_pretrain.py][INFO] Epoch:[0/2](843800/4588595) loss:3.395 lr:0.0000100 epoch_Time:23835.0min: [2024-01-06 11:10:29,485][model8_pretrain.py][INFO] Epoch:[0/2](843800/4588595) loss:2.834 lr:0.0000100 epoch_Time:23835.0min: [2024-01-06 11:10:29,485][model8_pretrain.py][INFO] Epoch:[0/2](843800/4588595) loss:2.384 lr:0.0000100 epoch_Time:23835.0min: [2024-01-06 11:10:29,485][model8_pretrain.py][INFO] Epoch:[0/2](843800/4588595) loss:2.683 lr:0.0000100 epoch_Time:23835.0min: [2024-01-06 11:10:29,485][model8_pretrain.py][INFO] Epoch:[0/2](843800/4588595) loss:3.097 lr:0.0000100 epoch_Time:23835.0min: [2024-01-06 11:10:29,485][model8_pretrain.py][INFO] Epoch:[0/2](843800/4588595) loss:2.144 lr:0.0000100 epoch_Time:23835.0min: [2024-01-06 11:10:29,485][model8_pretrain.py][INFO] Epoch:[0/2](843800/4588595) loss:3.045 lr:0.0000100 epoch_Time:23835.0min: [2024-01-06 11:11:06,412][model8_pretrain.py][INFO] Epoch:[0/2](843900/4588595) loss:3.034 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:06,412][model8_pretrain.py][INFO] Epoch:[0/2](843900/4588595) loss:2.918 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:06,412][model8_pretrain.py][INFO] Epoch:[0/2](843900/4588595) loss:2.733 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:06,412][model8_pretrain.py][INFO] Epoch:[0/2](843900/4588595) loss:3.388 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:06,412][model8_pretrain.py][INFO] Epoch:[0/2](843900/4588595) loss:2.465 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:06,412][model8_pretrain.py][INFO] Epoch:[0/2](843900/4588595) loss:2.643 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:06,412][model8_pretrain.py][INFO] Epoch:[0/2](843900/4588595) loss:2.693 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:06,412][model8_pretrain.py][INFO] Epoch:[0/2](843900/4588595) loss:2.870 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:43,343][model8_pretrain.py][INFO] Epoch:[0/2](844000/4588595) loss:2.457 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:43,343][model8_pretrain.py][INFO] Epoch:[0/2](844000/4588595) loss:2.378 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:43,343][model8_pretrain.py][INFO] Epoch:[0/2](844000/4588595) loss:2.732 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:43,343][model8_pretrain.py][INFO] Epoch:[0/2](844000/4588595) loss:2.506 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:43,343][model8_pretrain.py][INFO] Epoch:[0/2](844000/4588595) loss:2.911 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:43,343][model8_pretrain.py][INFO] Epoch:[0/2](844000/4588595) loss:2.719 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:43,343][model8_pretrain.py][INFO] Epoch:[0/2](844000/4588595) loss:2.974 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:11:43,343][model8_pretrain.py][INFO] Epoch:[0/2](844000/4588595) loss:2.776 lr:0.0000100 epoch_Time:23834.0min: [2024-01-06 11:12:20,276][model8_pretrain.py][INFO] Epoch:[0/2](844100/4588595) loss:3.050 lr:0.0000100 epoch_Time:23833.0min: [2024-01-06 11:12:20,276][model8_pretrain.py][INFO] Epoch:[0/2](844100/4588595) loss:2.927 lr:0.0000100 epoch_Time:23833.0min: [2024-01-06 11:12:20,276][model8_pretrain.py][INFO] Epoch:[0/2](844100/4588595) loss:2.571 lr:0.0000100 epoch_Time:23833.0min: [2024-01-06 11:12:20,276][model8_pretrain.py][INFO] Epoch:[0/2](844100/4588595) loss:3.053 lr:0.0000100 epoch_Time:23833.0min: [2024-01-06 11:12:20,276][model8_pretrain.py][INFO] Epoch:[0/2](844100/4588595) loss:3.051 lr:0.0000100 epoch_Time:23833.0min: [2024-01-06 11:12:20,276][model8_pretrain.py][INFO] Epoch:[0/2](844100/4588595) loss:2.302 lr:0.0000100 epoch_Time:23833.0min: [2024-01-06 11:12:20,277][model8_pretrain.py][INFO] Epoch:[0/2](844100/4588595) loss:2.972 lr:0.0000100 epoch_Time:23833.0min: [2024-01-06 11:12:20,277][model8_pretrain.py][INFO] Epoch:[0/2](844100/4588595) loss:3.270 lr:0.0000100 epoch_Time:23833.0min: [2024-01-06 11:12:57,207][model8_pretrain.py][INFO] Epoch:[0/2](844200/4588595) loss:2.687 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:12:57,207][model8_pretrain.py][INFO] Epoch:[0/2](844200/4588595) loss:3.206 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:12:57,207][model8_pretrain.py][INFO] Epoch:[0/2](844200/4588595) loss:2.659 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:12:57,207][model8_pretrain.py][INFO] Epoch:[0/2](844200/4588595) loss:3.441 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:12:57,207][model8_pretrain.py][INFO] Epoch:[0/2](844200/4588595) loss:2.841 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:12:57,207][model8_pretrain.py][INFO] Epoch:[0/2](844200/4588595) loss:2.811 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:12:57,207][model8_pretrain.py][INFO] Epoch:[0/2](844200/4588595) loss:3.081 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:12:57,207][model8_pretrain.py][INFO] Epoch:[0/2](844200/4588595) loss:3.242 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:13:34,139][model8_pretrain.py][INFO] Epoch:[0/2](844300/4588595) loss:2.345 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:13:34,139][model8_pretrain.py][INFO] Epoch:[0/2](844300/4588595) loss:3.267 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:13:34,139][model8_pretrain.py][INFO] Epoch:[0/2](844300/4588595) loss:2.958 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:13:34,140][model8_pretrain.py][INFO] Epoch:[0/2](844300/4588595) loss:2.813 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:13:34,139][model8_pretrain.py][INFO] Epoch:[0/2](844300/4588595) loss:3.175 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:13:34,140][model8_pretrain.py][INFO] Epoch:[0/2](844300/4588595) loss:2.541 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:13:34,140][model8_pretrain.py][INFO] Epoch:[0/2](844300/4588595) loss:2.887 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:13:34,140][model8_pretrain.py][INFO] Epoch:[0/2](844300/4588595) loss:3.584 lr:0.0000100 epoch_Time:23832.0min: [2024-01-06 11:14:11,081][model8_pretrain.py][INFO] Epoch:[0/2](844400/4588595) loss:2.545 lr:0.0000100 epoch_Time:23831.0min: [2024-01-06 11:14:11,081][model8_pretrain.py][INFO] Epoch:[0/2](844400/4588595) loss:2.807 lr:0.0000100 epoch_Time:23831.0min: [2024-01-06 11:14:11,081][model8_pretrain.py][INFO] Epoch:[0/2](844400/4588595) loss:2.444 lr:0.0000100 epoch_Time:23831.0min: [2024-01-06 11:14:11,081][model8_pretrain.py][INFO] Epoch:[0/2](844400/4588595) loss:3.220 lr:0.0000100 epoch_Time:23831.0min: [2024-01-06 11:14:11,081][model8_pretrain.py][INFO] Epoch:[0/2](844400/4588595) loss:3.243 lr:0.0000100 epoch_Time:23831.0min: [2024-01-06 11:14:11,081][model8_pretrain.py][INFO] Epoch:[0/2](844400/4588595) loss:2.317 lr:0.0000100 epoch_Time:23831.0min: [2024-01-06 11:14:11,081][model8_pretrain.py][INFO] Epoch:[0/2](844400/4588595) loss:2.550 lr:0.0000100 epoch_Time:23831.0min: [2024-01-06 11:14:11,081][model8_pretrain.py][INFO] Epoch:[0/2](844400/4588595) loss:2.841 lr:0.0000100 epoch_Time:23831.0min: [2024-01-06 11:14:49,561][model8_pretrain.py][INFO] Epoch:[0/2](844500/4588595) loss:2.731 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:14:49,561][model8_pretrain.py][INFO] Epoch:[0/2](844500/4588595) loss:3.078 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:14:49,561][model8_pretrain.py][INFO] Epoch:[0/2](844500/4588595) loss:2.542 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:14:49,561][model8_pretrain.py][INFO] Epoch:[0/2](844500/4588595) loss:2.637 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:14:49,561][model8_pretrain.py][INFO] Epoch:[0/2](844500/4588595) loss:3.183 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:14:49,561][model8_pretrain.py][INFO] Epoch:[0/2](844500/4588595) loss:2.947 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:14:49,561][model8_pretrain.py][INFO] Epoch:[0/2](844500/4588595) loss:2.594 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:14:49,561][model8_pretrain.py][INFO] Epoch:[0/2](844500/4588595) loss:2.537 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:15:34,959][model8_pretrain.py][INFO] Epoch:[0/2](844600/4588595) loss:2.875 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:15:34,959][model8_pretrain.py][INFO] Epoch:[0/2](844600/4588595) loss:2.924 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:15:34,959][model8_pretrain.py][INFO] Epoch:[0/2](844600/4588595) loss:2.739 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:15:34,959][model8_pretrain.py][INFO] Epoch:[0/2](844600/4588595) loss:3.051 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:15:34,960][model8_pretrain.py][INFO] Epoch:[0/2](844600/4588595) loss:3.292 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:15:34,960][model8_pretrain.py][INFO] Epoch:[0/2](844600/4588595) loss:3.187 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:15:34,960][model8_pretrain.py][INFO] Epoch:[0/2](844600/4588595) loss:2.287 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:15:34,960][model8_pretrain.py][INFO] Epoch:[0/2](844600/4588595) loss:2.811 lr:0.0000100 epoch_Time:23830.0min: [2024-01-06 11:16:11,886][model8_pretrain.py][INFO] Epoch:[0/2](844700/4588595) loss:3.046 lr:0.0000100 epoch_Time:23829.0min: [2024-01-06 11:16:11,886][model8_pretrain.py][INFO] Epoch:[0/2](844700/4588595) loss:2.513 lr:0.0000100 epoch_Time:23829.0min: [2024-01-06 11:16:11,886][model8_pretrain.py][INFO] Epoch:[0/2](844700/4588595) loss:3.377 lr:0.0000100 epoch_Time:23829.0min: [2024-01-06 11:16:11,886][model8_pretrain.py][INFO] Epoch:[0/2](844700/4588595) loss:2.305 lr:0.0000100 epoch_Time:23829.0min: [2024-01-06 11:16:11,886][model8_pretrain.py][INFO] Epoch:[0/2](844700/4588595) loss:2.767 lr:0.0000100 epoch_Time:23829.0min: [2024-01-06 11:16:11,886][model8_pretrain.py][INFO] Epoch:[0/2](844700/4588595) loss:3.103 lr:0.0000100 epoch_Time:23829.0min: [2024-01-06 11:16:11,886][model8_pretrain.py][INFO] Epoch:[0/2](844700/4588595) loss:2.974 lr:0.0000100 epoch_Time:23829.0min: [2024-01-06 11:16:11,886][model8_pretrain.py][INFO] Epoch:[0/2](844700/4588595) loss:2.770 lr:0.0000100 epoch_Time:23829.0min: [2024-01-06 11:16:48,813][model8_pretrain.py][INFO] Epoch:[0/2](844800/4588595) loss:2.675 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:16:48,813][model8_pretrain.py][INFO] Epoch:[0/2](844800/4588595) loss:3.118 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:16:48,813][model8_pretrain.py][INFO] Epoch:[0/2](844800/4588595) loss:2.755 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:16:48,813][model8_pretrain.py][INFO] Epoch:[0/2](844800/4588595) loss:2.969 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:16:48,813][model8_pretrain.py][INFO] Epoch:[0/2](844800/4588595) loss:2.639 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:16:48,814][model8_pretrain.py][INFO] Epoch:[0/2](844800/4588595) loss:2.717 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:16:48,814][model8_pretrain.py][INFO] Epoch:[0/2](844800/4588595) loss:2.645 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:16:48,814][model8_pretrain.py][INFO] Epoch:[0/2](844800/4588595) loss:2.498 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:17:25,747][model8_pretrain.py][INFO] Epoch:[0/2](844900/4588595) loss:2.746 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:17:25,747][model8_pretrain.py][INFO] Epoch:[0/2](844900/4588595) loss:3.015 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:17:25,747][model8_pretrain.py][INFO] Epoch:[0/2](844900/4588595) loss:3.216 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:17:25,747][model8_pretrain.py][INFO] Epoch:[0/2](844900/4588595) loss:2.645 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:17:25,747][model8_pretrain.py][INFO] Epoch:[0/2](844900/4588595) loss:2.304 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:17:25,747][model8_pretrain.py][INFO] Epoch:[0/2](844900/4588595) loss:2.741 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:17:25,747][model8_pretrain.py][INFO] Epoch:[0/2](844900/4588595) loss:2.824 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:17:25,747][model8_pretrain.py][INFO] Epoch:[0/2](844900/4588595) loss:3.052 lr:0.0000100 epoch_Time:23828.0min: [2024-01-06 11:18:02,700][model8_pretrain.py][INFO] Epoch:[0/2](845000/4588595) loss:2.940 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:02,700][model8_pretrain.py][INFO] Epoch:[0/2](845000/4588595) loss:2.254 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:02,700][model8_pretrain.py][INFO] Epoch:[0/2](845000/4588595) loss:2.483 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:02,700][model8_pretrain.py][INFO] Epoch:[0/2](845000/4588595) loss:2.970 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:02,700][model8_pretrain.py][INFO] Epoch:[0/2](845000/4588595) loss:3.081 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:02,701][model8_pretrain.py][INFO] Epoch:[0/2](845000/4588595) loss:2.422 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:02,701][model8_pretrain.py][INFO] Epoch:[0/2](845000/4588595) loss:2.733 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:02,701][model8_pretrain.py][INFO] Epoch:[0/2](845000/4588595) loss:2.774 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:39,673][model8_pretrain.py][INFO] Epoch:[0/2](845100/4588595) loss:2.728 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:39,673][model8_pretrain.py][INFO] Epoch:[0/2](845100/4588595) loss:2.280 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:39,673][model8_pretrain.py][INFO] Epoch:[0/2](845100/4588595) loss:3.193 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:39,673][model8_pretrain.py][INFO] Epoch:[0/2](845100/4588595) loss:3.236 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:39,673][model8_pretrain.py][INFO] Epoch:[0/2](845100/4588595) loss:3.026 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:39,673][model8_pretrain.py][INFO] Epoch:[0/2](845100/4588595) loss:3.089 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:39,673][model8_pretrain.py][INFO] Epoch:[0/2](845100/4588595) loss:2.399 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:18:39,673][model8_pretrain.py][INFO] Epoch:[0/2](845100/4588595) loss:2.977 lr:0.0000100 epoch_Time:23827.0min: [2024-01-06 11:19:16,605][model8_pretrain.py][INFO] Epoch:[0/2](845200/4588595) loss:3.212 lr:0.0000100 epoch_Time:23826.0min: [2024-01-06 11:19:16,605][model8_pretrain.py][INFO] Epoch:[0/2](845200/4588595) loss:2.744 lr:0.0000100 epoch_Time:23826.0min: [2024-01-06 11:19:16,605][model8_pretrain.py][INFO] Epoch:[0/2](845200/4588595) loss:3.055 lr:0.0000100 epoch_Time:23826.0min: [2024-01-06 11:19:16,605][model8_pretrain.py][INFO] Epoch:[0/2](845200/4588595) loss:3.095 lr:0.0000100 epoch_Time:23826.0min: [2024-01-06 11:19:16,605][model8_pretrain.py][INFO] Epoch:[0/2](845200/4588595) loss:3.012 lr:0.0000100 epoch_Time:23826.0min: [2024-01-06 11:19:16,605][model8_pretrain.py][INFO] Epoch:[0/2](845200/4588595) loss:2.827 lr:0.0000100 epoch_Time:23826.0min: [2024-01-06 11:19:16,606][model8_pretrain.py][INFO] Epoch:[0/2](845200/4588595) loss:2.274 lr:0.0000100 epoch_Time:23826.0min: [2024-01-06 11:19:16,606][model8_pretrain.py][INFO] Epoch:[0/2](845200/4588595) loss:3.019 lr:0.0000100 epoch_Time:23826.0min: [2024-01-06 11:19:53,540][model8_pretrain.py][INFO] Epoch:[0/2](845300/4588595) loss:2.606 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:19:53,540][model8_pretrain.py][INFO] Epoch:[0/2](845300/4588595) loss:2.682 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:19:53,540][model8_pretrain.py][INFO] Epoch:[0/2](845300/4588595) loss:3.131 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:19:53,540][model8_pretrain.py][INFO] Epoch:[0/2](845300/4588595) loss:2.435 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:19:53,540][model8_pretrain.py][INFO] Epoch:[0/2](845300/4588595) loss:2.490 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:19:53,540][model8_pretrain.py][INFO] Epoch:[0/2](845300/4588595) loss:3.121 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:19:53,540][model8_pretrain.py][INFO] Epoch:[0/2](845300/4588595) loss:3.139 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:19:53,540][model8_pretrain.py][INFO] Epoch:[0/2](845300/4588595) loss:3.020 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:20:40,541][model8_pretrain.py][INFO] Epoch:[0/2](845400/4588595) loss:3.062 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:20:40,541][model8_pretrain.py][INFO] Epoch:[0/2](845400/4588595) loss:3.021 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:20:40,541][model8_pretrain.py][INFO] Epoch:[0/2](845400/4588595) loss:3.108 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:20:40,541][model8_pretrain.py][INFO] Epoch:[0/2](845400/4588595) loss:3.216 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:20:40,541][model8_pretrain.py][INFO] Epoch:[0/2](845400/4588595) loss:2.638 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:20:40,541][model8_pretrain.py][INFO] Epoch:[0/2](845400/4588595) loss:2.943 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:20:40,541][model8_pretrain.py][INFO] Epoch:[0/2](845400/4588595) loss:2.547 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:20:40,541][model8_pretrain.py][INFO] Epoch:[0/2](845400/4588595) loss:3.008 lr:0.0000100 epoch_Time:23825.0min: [2024-01-06 11:21:17,466][model8_pretrain.py][INFO] Epoch:[0/2](845500/4588595) loss:2.704 lr:0.0000100 epoch_Time:23824.0min: [2024-01-06 11:21:17,466][model8_pretrain.py][INFO] Epoch:[0/2](845500/4588595) loss:2.332 lr:0.0000100 epoch_Time:23824.0min: [2024-01-06 11:21:17,466][model8_pretrain.py][INFO] Epoch:[0/2](845500/4588595) loss:3.134 lr:0.0000100 epoch_Time:23824.0min: [2024-01-06 11:21:17,466][model8_pretrain.py][INFO] Epoch:[0/2](845500/4588595) loss:3.216 lr:0.0000100 epoch_Time:23824.0min: [2024-01-06 11:21:17,466][model8_pretrain.py][INFO] Epoch:[0/2](845500/4588595) loss:2.740 lr:0.0000100 epoch_Time:23824.0min: [2024-01-06 11:21:17,466][model8_pretrain.py][INFO] Epoch:[0/2](845500/4588595) loss:2.972 lr:0.0000100 epoch_Time:23824.0min: [2024-01-06 11:21:17,466][model8_pretrain.py][INFO] Epoch:[0/2](845500/4588595) loss:2.482 lr:0.0000100 epoch_Time:23824.0min: [2024-01-06 11:21:17,466][model8_pretrain.py][INFO] Epoch:[0/2](845500/4588595) loss:2.334 lr:0.0000100 epoch_Time:23824.0min: [2024-01-06 11:21:54,391][model8_pretrain.py][INFO] Epoch:[0/2](845600/4588595) loss:2.726 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:21:54,391][model8_pretrain.py][INFO] Epoch:[0/2](845600/4588595) loss:2.503 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:21:54,391][model8_pretrain.py][INFO] Epoch:[0/2](845600/4588595) loss:2.275 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:21:54,391][model8_pretrain.py][INFO] Epoch:[0/2](845600/4588595) loss:2.680 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:21:54,391][model8_pretrain.py][INFO] Epoch:[0/2](845600/4588595) loss:2.451 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:21:54,392][model8_pretrain.py][INFO] Epoch:[0/2](845600/4588595) loss:3.104 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:21:54,392][model8_pretrain.py][INFO] Epoch:[0/2](845600/4588595) loss:2.497 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:21:54,392][model8_pretrain.py][INFO] Epoch:[0/2](845600/4588595) loss:3.065 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:22:31,327][model8_pretrain.py][INFO] Epoch:[0/2](845700/4588595) loss:3.254 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:22:31,327][model8_pretrain.py][INFO] Epoch:[0/2](845700/4588595) loss:2.799 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:22:31,327][model8_pretrain.py][INFO] Epoch:[0/2](845700/4588595) loss:3.030 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:22:31,327][model8_pretrain.py][INFO] Epoch:[0/2](845700/4588595) loss:2.794 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:22:31,327][model8_pretrain.py][INFO] Epoch:[0/2](845700/4588595) loss:3.107 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:22:31,327][model8_pretrain.py][INFO] Epoch:[0/2](845700/4588595) loss:2.668 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:22:31,327][model8_pretrain.py][INFO] Epoch:[0/2](845700/4588595) loss:2.759 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:22:31,327][model8_pretrain.py][INFO] Epoch:[0/2](845700/4588595) loss:2.699 lr:0.0000100 epoch_Time:23823.0min: [2024-01-06 11:23:08,286][model8_pretrain.py][INFO] Epoch:[0/2](845800/4588595) loss:2.279 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:08,286][model8_pretrain.py][INFO] Epoch:[0/2](845800/4588595) loss:2.979 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:08,286][model8_pretrain.py][INFO] Epoch:[0/2](845800/4588595) loss:2.276 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:08,286][model8_pretrain.py][INFO] Epoch:[0/2](845800/4588595) loss:2.803 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:08,286][model8_pretrain.py][INFO] Epoch:[0/2](845800/4588595) loss:2.227 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:08,286][model8_pretrain.py][INFO] Epoch:[0/2](845800/4588595) loss:3.119 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:08,287][model8_pretrain.py][INFO] Epoch:[0/2](845800/4588595) loss:2.588 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:08,287][model8_pretrain.py][INFO] Epoch:[0/2](845800/4588595) loss:3.122 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:45,227][model8_pretrain.py][INFO] Epoch:[0/2](845900/4588595) loss:2.473 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:45,227][model8_pretrain.py][INFO] Epoch:[0/2](845900/4588595) loss:2.629 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:45,227][model8_pretrain.py][INFO] Epoch:[0/2](845900/4588595) loss:2.159 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:45,227][model8_pretrain.py][INFO] Epoch:[0/2](845900/4588595) loss:3.158 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:45,227][model8_pretrain.py][INFO] Epoch:[0/2](845900/4588595) loss:2.685 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:45,227][model8_pretrain.py][INFO] Epoch:[0/2](845900/4588595) loss:2.654 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:45,227][model8_pretrain.py][INFO] Epoch:[0/2](845900/4588595) loss:3.187 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:23:45,228][model8_pretrain.py][INFO] Epoch:[0/2](845900/4588595) loss:2.343 lr:0.0000100 epoch_Time:23822.0min: [2024-01-06 11:24:22,166][model8_pretrain.py][INFO] Epoch:[0/2](846000/4588595) loss:2.498 lr:0.0000100 epoch_Time:23821.0min: [2024-01-06 11:24:22,166][model8_pretrain.py][INFO] Epoch:[0/2](846000/4588595) loss:2.890 lr:0.0000100 epoch_Time:23821.0min: [2024-01-06 11:24:22,166][model8_pretrain.py][INFO] Epoch:[0/2](846000/4588595) loss:3.227 lr:0.0000100 epoch_Time:23821.0min: [2024-01-06 11:24:22,166][model8_pretrain.py][INFO] Epoch:[0/2](846000/4588595) loss:2.829 lr:0.0000100 epoch_Time:23821.0min: [2024-01-06 11:24:22,167][model8_pretrain.py][INFO] Epoch:[0/2](846000/4588595) loss:3.301 lr:0.0000100 epoch_Time:23821.0min: [2024-01-06 11:24:22,167][model8_pretrain.py][INFO] Epoch:[0/2](846000/4588595) loss:2.996 lr:0.0000100 epoch_Time:23821.0min: [2024-01-06 11:24:22,167][model8_pretrain.py][INFO] Epoch:[0/2](846000/4588595) loss:2.860 lr:0.0000100 epoch_Time:23821.0min: [2024-01-06 11:24:22,167][model8_pretrain.py][INFO] Epoch:[0/2](846000/4588595) loss:2.892 lr:0.0000100 epoch_Time:23821.0min: [2024-01-06 11:24:59,097][model8_pretrain.py][INFO] Epoch:[0/2](846100/4588595) loss:2.966 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:24:59,097][model8_pretrain.py][INFO] Epoch:[0/2](846100/4588595) loss:2.702 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:24:59,097][model8_pretrain.py][INFO] Epoch:[0/2](846100/4588595) loss:2.352 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:24:59,097][model8_pretrain.py][INFO] Epoch:[0/2](846100/4588595) loss:2.432 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:24:59,097][model8_pretrain.py][INFO] Epoch:[0/2](846100/4588595) loss:2.781 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:24:59,097][model8_pretrain.py][INFO] Epoch:[0/2](846100/4588595) loss:2.352 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:24:59,097][model8_pretrain.py][INFO] Epoch:[0/2](846100/4588595) loss:3.001 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:24:59,097][model8_pretrain.py][INFO] Epoch:[0/2](846100/4588595) loss:2.673 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:25:46,384][model8_pretrain.py][INFO] Epoch:[0/2](846200/4588595) loss:2.934 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:25:46,384][model8_pretrain.py][INFO] Epoch:[0/2](846200/4588595) loss:2.505 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:25:46,384][model8_pretrain.py][INFO] Epoch:[0/2](846200/4588595) loss:3.313 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:25:46,384][model8_pretrain.py][INFO] Epoch:[0/2](846200/4588595) loss:2.612 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:25:46,384][model8_pretrain.py][INFO] Epoch:[0/2](846200/4588595) loss:2.189 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:25:46,384][model8_pretrain.py][INFO] Epoch:[0/2](846200/4588595) loss:3.027 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:25:46,384][model8_pretrain.py][INFO] Epoch:[0/2](846200/4588595) loss:3.089 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:25:46,384][model8_pretrain.py][INFO] Epoch:[0/2](846200/4588595) loss:2.039 lr:0.0000100 epoch_Time:23820.0min: [2024-01-06 11:26:23,314][model8_pretrain.py][INFO] Epoch:[0/2](846300/4588595) loss:2.956 lr:0.0000100 epoch_Time:23819.0min: [2024-01-06 11:26:23,314][model8_pretrain.py][INFO] Epoch:[0/2](846300/4588595) loss:3.295 lr:0.0000100 epoch_Time:23819.0min: [2024-01-06 11:26:23,314][model8_pretrain.py][INFO] Epoch:[0/2](846300/4588595) loss:2.981 lr:0.0000100 epoch_Time:23819.0min: [2024-01-06 11:26:23,314][model8_pretrain.py][INFO] Epoch:[0/2](846300/4588595) loss:3.106 lr:0.0000100 epoch_Time:23819.0min: [2024-01-06 11:26:23,314][model8_pretrain.py][INFO] Epoch:[0/2](846300/4588595) loss:2.815 lr:0.0000100 epoch_Time:23819.0min: [2024-01-06 11:26:23,314][model8_pretrain.py][INFO] Epoch:[0/2](846300/4588595) loss:2.951 lr:0.0000100 epoch_Time:23819.0min: [2024-01-06 11:26:23,314][model8_pretrain.py][INFO] Epoch:[0/2](846300/4588595) loss:3.130 lr:0.0000100 epoch_Time:23819.0min: [2024-01-06 11:26:23,314][model8_pretrain.py][INFO] Epoch:[0/2](846300/4588595) loss:3.261 lr:0.0000100 epoch_Time:23819.0min: [2024-01-06 11:27:00,255][model8_pretrain.py][INFO] Epoch:[0/2](846400/4588595) loss:2.983 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:00,255][model8_pretrain.py][INFO] Epoch:[0/2](846400/4588595) loss:1.926 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:00,255][model8_pretrain.py][INFO] Epoch:[0/2](846400/4588595) loss:2.912 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:00,255][model8_pretrain.py][INFO] Epoch:[0/2](846400/4588595) loss:2.657 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:00,255][model8_pretrain.py][INFO] Epoch:[0/2](846400/4588595) loss:3.401 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:00,255][model8_pretrain.py][INFO] Epoch:[0/2](846400/4588595) loss:2.178 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:00,255][model8_pretrain.py][INFO] Epoch:[0/2](846400/4588595) loss:2.946 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:00,255][model8_pretrain.py][INFO] Epoch:[0/2](846400/4588595) loss:2.446 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:37,189][model8_pretrain.py][INFO] Epoch:[0/2](846500/4588595) loss:2.592 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:37,189][model8_pretrain.py][INFO] Epoch:[0/2](846500/4588595) loss:3.121 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:37,189][model8_pretrain.py][INFO] Epoch:[0/2](846500/4588595) loss:2.390 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:37,189][model8_pretrain.py][INFO] Epoch:[0/2](846500/4588595) loss:2.785 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:37,189][model8_pretrain.py][INFO] Epoch:[0/2](846500/4588595) loss:2.279 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:37,189][model8_pretrain.py][INFO] Epoch:[0/2](846500/4588595) loss:2.999 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:37,189][model8_pretrain.py][INFO] Epoch:[0/2](846500/4588595) loss:2.902 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:27:37,189][model8_pretrain.py][INFO] Epoch:[0/2](846500/4588595) loss:2.329 lr:0.0000100 epoch_Time:23818.0min: [2024-01-06 11:28:14,129][model8_pretrain.py][INFO] Epoch:[0/2](846600/4588595) loss:2.262 lr:0.0000100 epoch_Time:23817.0min: [2024-01-06 11:28:14,129][model8_pretrain.py][INFO] Epoch:[0/2](846600/4588595) loss:2.380 lr:0.0000100 epoch_Time:23817.0min: [2024-01-06 11:28:14,129][model8_pretrain.py][INFO] Epoch:[0/2](846600/4588595) loss:2.368 lr:0.0000100 epoch_Time:23817.0min: [2024-01-06 11:28:14,129][model8_pretrain.py][INFO] Epoch:[0/2](846600/4588595) loss:2.759 lr:0.0000100 epoch_Time:23817.0min: [2024-01-06 11:28:14,129][model8_pretrain.py][INFO] Epoch:[0/2](846600/4588595) loss:2.753 lr:0.0000100 epoch_Time:23817.0min: [2024-01-06 11:28:14,129][model8_pretrain.py][INFO] Epoch:[0/2](846600/4588595) loss:2.750 lr:0.0000100 epoch_Time:23817.0min: [2024-01-06 11:28:14,129][model8_pretrain.py][INFO] Epoch:[0/2](846600/4588595) loss:3.141 lr:0.0000100 epoch_Time:23817.0min: [2024-01-06 11:28:14,129][model8_pretrain.py][INFO] Epoch:[0/2](846600/4588595) loss:2.266 lr:0.0000100 epoch_Time:23817.0min: [2024-01-06 11:28:51,073][model8_pretrain.py][INFO] Epoch:[0/2](846700/4588595) loss:2.950 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:28:51,073][model8_pretrain.py][INFO] Epoch:[0/2](846700/4588595) loss:3.410 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:28:51,073][model8_pretrain.py][INFO] Epoch:[0/2](846700/4588595) loss:2.445 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:28:51,073][model8_pretrain.py][INFO] Epoch:[0/2](846700/4588595) loss:3.415 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:28:51,073][model8_pretrain.py][INFO] Epoch:[0/2](846700/4588595) loss:2.916 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:28:51,073][model8_pretrain.py][INFO] Epoch:[0/2](846700/4588595) loss:2.691 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:28:51,073][model8_pretrain.py][INFO] Epoch:[0/2](846700/4588595) loss:2.673 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:28:51,073][model8_pretrain.py][INFO] Epoch:[0/2](846700/4588595) loss:2.010 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:29:28,014][model8_pretrain.py][INFO] Epoch:[0/2](846800/4588595) loss:3.004 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:29:28,014][model8_pretrain.py][INFO] Epoch:[0/2](846800/4588595) loss:3.442 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:29:28,014][model8_pretrain.py][INFO] Epoch:[0/2](846800/4588595) loss:3.057 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:29:28,014][model8_pretrain.py][INFO] Epoch:[0/2](846800/4588595) loss:2.802 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:29:28,014][model8_pretrain.py][INFO] Epoch:[0/2](846800/4588595) loss:2.607 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:29:28,014][model8_pretrain.py][INFO] Epoch:[0/2](846800/4588595) loss:2.671 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:29:28,014][model8_pretrain.py][INFO] Epoch:[0/2](846800/4588595) loss:2.519 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:29:28,014][model8_pretrain.py][INFO] Epoch:[0/2](846800/4588595) loss:3.154 lr:0.0000100 epoch_Time:23816.0min: [2024-01-06 11:30:04,990][model8_pretrain.py][INFO] Epoch:[0/2](846900/4588595) loss:2.428 lr:0.0000100 epoch_Time:23815.0min: [2024-01-06 11:30:04,990][model8_pretrain.py][INFO] Epoch:[0/2](846900/4588595) loss:2.831 lr:0.0000100 epoch_Time:23815.0min: [2024-01-06 11:30:04,990][model8_pretrain.py][INFO] Epoch:[0/2](846900/4588595) loss:2.871 lr:0.0000100 epoch_Time:23815.0min: [2024-01-06 11:30:04,990][model8_pretrain.py][INFO] Epoch:[0/2](846900/4588595) loss:2.567 lr:0.0000100 epoch_Time:23815.0min: [2024-01-06 11:30:04,990][model8_pretrain.py][INFO] Epoch:[0/2](846900/4588595) loss:3.462 lr:0.0000100 epoch_Time:23815.0min: [2024-01-06 11:30:04,990][model8_pretrain.py][INFO] Epoch:[0/2](846900/4588595) loss:2.580 lr:0.0000100 epoch_Time:23815.0min: [2024-01-06 11:30:04,990][model8_pretrain.py][INFO] Epoch:[0/2](846900/4588595) loss:3.410 lr:0.0000100 epoch_Time:23815.0min: [2024-01-06 11:30:04,990][model8_pretrain.py][INFO] Epoch:[0/2](846900/4588595) loss:2.945 lr:0.0000100 epoch_Time:23815.0min: [2024-01-06 11:30:52,350][model8_pretrain.py][INFO] Epoch:[0/2](847000/4588595) loss:2.910 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:30:52,350][model8_pretrain.py][INFO] Epoch:[0/2](847000/4588595) loss:3.401 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:30:52,350][model8_pretrain.py][INFO] Epoch:[0/2](847000/4588595) loss:2.641 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:30:52,350][model8_pretrain.py][INFO] Epoch:[0/2](847000/4588595) loss:2.886 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:30:52,350][model8_pretrain.py][INFO] Epoch:[0/2](847000/4588595) loss:3.251 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:30:52,350][model8_pretrain.py][INFO] Epoch:[0/2](847000/4588595) loss:2.871 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:30:52,350][model8_pretrain.py][INFO] Epoch:[0/2](847000/4588595) loss:2.140 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:30:52,350][model8_pretrain.py][INFO] Epoch:[0/2](847000/4588595) loss:3.329 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:31:29,300][model8_pretrain.py][INFO] Epoch:[0/2](847100/4588595) loss:2.924 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:31:29,300][model8_pretrain.py][INFO] Epoch:[0/2](847100/4588595) loss:3.567 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:31:29,300][model8_pretrain.py][INFO] Epoch:[0/2](847100/4588595) loss:2.847 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:31:29,300][model8_pretrain.py][INFO] Epoch:[0/2](847100/4588595) loss:2.988 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:31:29,300][model8_pretrain.py][INFO] Epoch:[0/2](847100/4588595) loss:2.168 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:31:29,300][model8_pretrain.py][INFO] Epoch:[0/2](847100/4588595) loss:3.287 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:31:29,300][model8_pretrain.py][INFO] Epoch:[0/2](847100/4588595) loss:2.656 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:31:29,300][model8_pretrain.py][INFO] Epoch:[0/2](847100/4588595) loss:2.682 lr:0.0000100 epoch_Time:23814.0min: [2024-01-06 11:32:06,248][model8_pretrain.py][INFO] Epoch:[0/2](847200/4588595) loss:3.106 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:06,248][model8_pretrain.py][INFO] Epoch:[0/2](847200/4588595) loss:2.665 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:06,248][model8_pretrain.py][INFO] Epoch:[0/2](847200/4588595) loss:3.040 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:06,248][model8_pretrain.py][INFO] Epoch:[0/2](847200/4588595) loss:2.879 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:06,248][model8_pretrain.py][INFO] Epoch:[0/2](847200/4588595) loss:2.777 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:06,248][model8_pretrain.py][INFO] Epoch:[0/2](847200/4588595) loss:3.075 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:06,249][model8_pretrain.py][INFO] Epoch:[0/2](847200/4588595) loss:2.643 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:06,249][model8_pretrain.py][INFO] Epoch:[0/2](847200/4588595) loss:2.784 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:43,200][model8_pretrain.py][INFO] Epoch:[0/2](847300/4588595) loss:2.767 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:43,200][model8_pretrain.py][INFO] Epoch:[0/2](847300/4588595) loss:2.707 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:43,200][model8_pretrain.py][INFO] Epoch:[0/2](847300/4588595) loss:2.717 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:43,200][model8_pretrain.py][INFO] Epoch:[0/2](847300/4588595) loss:2.670 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:43,200][model8_pretrain.py][INFO] Epoch:[0/2](847300/4588595) loss:3.031 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:43,200][model8_pretrain.py][INFO] Epoch:[0/2](847300/4588595) loss:3.095 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:43,200][model8_pretrain.py][INFO] Epoch:[0/2](847300/4588595) loss:2.476 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:32:43,200][model8_pretrain.py][INFO] Epoch:[0/2](847300/4588595) loss:3.348 lr:0.0000100 epoch_Time:23813.0min: [2024-01-06 11:33:20,155][model8_pretrain.py][INFO] Epoch:[0/2](847400/4588595) loss:2.924 lr:0.0000100 epoch_Time:23812.0min: [2024-01-06 11:33:20,155][model8_pretrain.py][INFO] Epoch:[0/2](847400/4588595) loss:2.414 lr:0.0000100 epoch_Time:23812.0min: [2024-01-06 11:33:20,155][model8_pretrain.py][INFO] Epoch:[0/2](847400/4588595) loss:2.758 lr:0.0000100 epoch_Time:23812.0min: [2024-01-06 11:33:20,155][model8_pretrain.py][INFO] Epoch:[0/2](847400/4588595) loss:2.796 lr:0.0000100 epoch_Time:23812.0min: [2024-01-06 11:33:20,155][model8_pretrain.py][INFO] Epoch:[0/2](847400/4588595) loss:2.077 lr:0.0000100 epoch_Time:23812.0min: [2024-01-06 11:33:20,155][model8_pretrain.py][INFO] Epoch:[0/2](847400/4588595) loss:2.876 lr:0.0000100 epoch_Time:23812.0min: [2024-01-06 11:33:20,155][model8_pretrain.py][INFO] Epoch:[0/2](847400/4588595) loss:3.071 lr:0.0000100 epoch_Time:23812.0min: [2024-01-06 11:33:20,156][model8_pretrain.py][INFO] Epoch:[0/2](847400/4588595) loss:2.969 lr:0.0000100 epoch_Time:23812.0min: [2024-01-06 11:33:57,103][model8_pretrain.py][INFO] Epoch:[0/2](847500/4588595) loss:2.901 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:33:57,103][model8_pretrain.py][INFO] Epoch:[0/2](847500/4588595) loss:2.879 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:33:57,103][model8_pretrain.py][INFO] Epoch:[0/2](847500/4588595) loss:3.288 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:33:57,104][model8_pretrain.py][INFO] Epoch:[0/2](847500/4588595) loss:2.818 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:33:57,104][model8_pretrain.py][INFO] Epoch:[0/2](847500/4588595) loss:3.520 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:33:57,104][model8_pretrain.py][INFO] Epoch:[0/2](847500/4588595) loss:2.642 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:33:57,104][model8_pretrain.py][INFO] Epoch:[0/2](847500/4588595) loss:2.323 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:33:57,104][model8_pretrain.py][INFO] Epoch:[0/2](847500/4588595) loss:3.010 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:34:34,067][model8_pretrain.py][INFO] Epoch:[0/2](847600/4588595) loss:3.071 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:34:34,067][model8_pretrain.py][INFO] Epoch:[0/2](847600/4588595) loss:2.905 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:34:34,067][model8_pretrain.py][INFO] Epoch:[0/2](847600/4588595) loss:3.146 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:34:34,068][model8_pretrain.py][INFO] Epoch:[0/2](847600/4588595) loss:2.816 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:34:34,068][model8_pretrain.py][INFO] Epoch:[0/2](847600/4588595) loss:2.241 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:34:34,068][model8_pretrain.py][INFO] Epoch:[0/2](847600/4588595) loss:3.083 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:34:34,068][model8_pretrain.py][INFO] Epoch:[0/2](847600/4588595) loss:2.077 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:34:34,068][model8_pretrain.py][INFO] Epoch:[0/2](847600/4588595) loss:2.542 lr:0.0000100 epoch_Time:23811.0min: [2024-01-06 11:35:11,024][model8_pretrain.py][INFO] Epoch:[0/2](847700/4588595) loss:2.712 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:11,024][model8_pretrain.py][INFO] Epoch:[0/2](847700/4588595) loss:2.698 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:11,024][model8_pretrain.py][INFO] Epoch:[0/2](847700/4588595) loss:3.036 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:11,024][model8_pretrain.py][INFO] Epoch:[0/2](847700/4588595) loss:2.824 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:11,024][model8_pretrain.py][INFO] Epoch:[0/2](847700/4588595) loss:2.349 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:11,024][model8_pretrain.py][INFO] Epoch:[0/2](847700/4588595) loss:2.885 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:11,024][model8_pretrain.py][INFO] Epoch:[0/2](847700/4588595) loss:3.023 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:11,024][model8_pretrain.py][INFO] Epoch:[0/2](847700/4588595) loss:2.464 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:58,397][model8_pretrain.py][INFO] Epoch:[0/2](847800/4588595) loss:2.747 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:58,397][model8_pretrain.py][INFO] Epoch:[0/2](847800/4588595) loss:3.101 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:58,397][model8_pretrain.py][INFO] Epoch:[0/2](847800/4588595) loss:2.418 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:58,397][model8_pretrain.py][INFO] Epoch:[0/2](847800/4588595) loss:2.645 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:58,397][model8_pretrain.py][INFO] Epoch:[0/2](847800/4588595) loss:2.738 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:58,397][model8_pretrain.py][INFO] Epoch:[0/2](847800/4588595) loss:2.833 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:58,397][model8_pretrain.py][INFO] Epoch:[0/2](847800/4588595) loss:2.041 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:35:58,398][model8_pretrain.py][INFO] Epoch:[0/2](847800/4588595) loss:3.281 lr:0.0000100 epoch_Time:23810.0min: [2024-01-06 11:36:35,332][model8_pretrain.py][INFO] Epoch:[0/2](847900/4588595) loss:2.438 lr:0.0000100 epoch_Time:23809.0min: [2024-01-06 11:36:35,332][model8_pretrain.py][INFO] Epoch:[0/2](847900/4588595) loss:2.682 lr:0.0000100 epoch_Time:23809.0min: [2024-01-06 11:36:35,332][model8_pretrain.py][INFO] Epoch:[0/2](847900/4588595) loss:2.346 lr:0.0000100 epoch_Time:23809.0min: [2024-01-06 11:36:35,332][model8_pretrain.py][INFO] Epoch:[0/2](847900/4588595) loss:2.544 lr:0.0000100 epoch_Time:23809.0min: [2024-01-06 11:36:35,332][model8_pretrain.py][INFO] Epoch:[0/2](847900/4588595) loss:3.072 lr:0.0000100 epoch_Time:23809.0min: [2024-01-06 11:36:35,333][model8_pretrain.py][INFO] Epoch:[0/2](847900/4588595) loss:2.604 lr:0.0000100 epoch_Time:23809.0min: [2024-01-06 11:36:35,333][model8_pretrain.py][INFO] Epoch:[0/2](847900/4588595) loss:2.816 lr:0.0000100 epoch_Time:23809.0min: [2024-01-06 11:36:35,333][model8_pretrain.py][INFO] Epoch:[0/2](847900/4588595) loss:2.625 lr:0.0000100 epoch_Time:23809.0min: [2024-01-06 11:37:12,276][model8_pretrain.py][INFO] Epoch:[0/2](848000/4588595) loss:3.083 lr:0.0000100 epoch_Time:23808.0min: [2024-01-06 11:37:12,276][model8_pretrain.py][INFO] Epoch:[0/2](848000/4588595) loss:2.702 lr:0.0000100 epoch_Time:23808.0min: [2024-01-06 11:37:12,276][model8_pretrain.py][INFO] Epoch:[0/2](848000/4588595) loss:2.573 lr:0.0000100 epoch_Time:23808.0min: [2024-01-06 11:37:12,276][model8_pretrain.py][INFO] Epoch:[0/2](848000/4588595) loss:2.813 lr:0.0000100 epoch_Time:23808.0min: [2024-01-06 11:37:12,276][model8_pretrain.py][INFO] Epoch:[0/2](848000/4588595) loss:2.769 lr:0.0000100 epoch_Time:23808.0min: [2024-01-06 11:37:12,276][model8_pretrain.py][INFO] Epoch:[0/2](848000/4588595) loss:2.623 lr:0.0000100 epoch_Time:23808.0min: [2024-01-06 11:37:12,276][model8_pretrain.py][INFO] Epoch:[0/2](848000/4588595) loss:2.781 lr:0.0000100 epoch_Time:23808.0min: [2024-01-06 11:37:12,276][model8_pretrain.py][INFO] Epoch:[0/2](848000/4588595) loss:2.505 lr:0.0000100 epoch_Time:23808.0min: [2024-01-06 11:37:49,218][model8_pretrain.py][INFO] Epoch:[0/2](848100/4588595) loss:2.860 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:37:49,218][model8_pretrain.py][INFO] Epoch:[0/2](848100/4588595) loss:3.327 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:37:49,218][model8_pretrain.py][INFO] Epoch:[0/2](848100/4588595) loss:2.079 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:37:49,218][model8_pretrain.py][INFO] Epoch:[0/2](848100/4588595) loss:2.877 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:37:49,218][model8_pretrain.py][INFO] Epoch:[0/2](848100/4588595) loss:3.137 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:37:49,218][model8_pretrain.py][INFO] Epoch:[0/2](848100/4588595) loss:2.918 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:37:49,218][model8_pretrain.py][INFO] Epoch:[0/2](848100/4588595) loss:2.642 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:37:49,218][model8_pretrain.py][INFO] Epoch:[0/2](848100/4588595) loss:3.317 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:38:26,170][model8_pretrain.py][INFO] Epoch:[0/2](848200/4588595) loss:2.619 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:38:26,170][model8_pretrain.py][INFO] Epoch:[0/2](848200/4588595) loss:3.081 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:38:26,170][model8_pretrain.py][INFO] Epoch:[0/2](848200/4588595) loss:2.990 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:38:26,170][model8_pretrain.py][INFO] Epoch:[0/2](848200/4588595) loss:3.314 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:38:26,170][model8_pretrain.py][INFO] Epoch:[0/2](848200/4588595) loss:3.030 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:38:26,170][model8_pretrain.py][INFO] Epoch:[0/2](848200/4588595) loss:2.975 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:38:26,170][model8_pretrain.py][INFO] Epoch:[0/2](848200/4588595) loss:2.577 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:38:26,170][model8_pretrain.py][INFO] Epoch:[0/2](848200/4588595) loss:2.034 lr:0.0000100 epoch_Time:23807.0min: [2024-01-06 11:39:03,143][model8_pretrain.py][INFO] Epoch:[0/2](848300/4588595) loss:2.845 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:03,143][model8_pretrain.py][INFO] Epoch:[0/2](848300/4588595) loss:3.099 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:03,143][model8_pretrain.py][INFO] Epoch:[0/2](848300/4588595) loss:2.352 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:03,143][model8_pretrain.py][INFO] Epoch:[0/2](848300/4588595) loss:2.661 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:03,143][model8_pretrain.py][INFO] Epoch:[0/2](848300/4588595) loss:2.854 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:03,143][model8_pretrain.py][INFO] Epoch:[0/2](848300/4588595) loss:2.813 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:03,143][model8_pretrain.py][INFO] Epoch:[0/2](848300/4588595) loss:2.203 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:03,143][model8_pretrain.py][INFO] Epoch:[0/2](848300/4588595) loss:2.166 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:40,090][model8_pretrain.py][INFO] Epoch:[0/2](848400/4588595) loss:3.038 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:40,090][model8_pretrain.py][INFO] Epoch:[0/2](848400/4588595) loss:2.602 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:40,090][model8_pretrain.py][INFO] Epoch:[0/2](848400/4588595) loss:3.071 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:40,090][model8_pretrain.py][INFO] Epoch:[0/2](848400/4588595) loss:3.232 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:40,090][model8_pretrain.py][INFO] Epoch:[0/2](848400/4588595) loss:2.735 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:40,090][model8_pretrain.py][INFO] Epoch:[0/2](848400/4588595) loss:2.672 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:40,090][model8_pretrain.py][INFO] Epoch:[0/2](848400/4588595) loss:3.185 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:39:40,091][model8_pretrain.py][INFO] Epoch:[0/2](848400/4588595) loss:3.270 lr:0.0000100 epoch_Time:23806.0min: [2024-01-06 11:40:17,041][model8_pretrain.py][INFO] Epoch:[0/2](848500/4588595) loss:2.682 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:40:17,041][model8_pretrain.py][INFO] Epoch:[0/2](848500/4588595) loss:3.042 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:40:17,041][model8_pretrain.py][INFO] Epoch:[0/2](848500/4588595) loss:2.725 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:40:17,041][model8_pretrain.py][INFO] Epoch:[0/2](848500/4588595) loss:2.410 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:40:17,041][model8_pretrain.py][INFO] Epoch:[0/2](848500/4588595) loss:2.431 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:40:17,041][model8_pretrain.py][INFO] Epoch:[0/2](848500/4588595) loss:2.770 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:40:17,041][model8_pretrain.py][INFO] Epoch:[0/2](848500/4588595) loss:2.825 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:40:17,041][model8_pretrain.py][INFO] Epoch:[0/2](848500/4588595) loss:2.808 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:41:04,208][model8_pretrain.py][INFO] Epoch:[0/2](848600/4588595) loss:3.178 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:41:04,208][model8_pretrain.py][INFO] Epoch:[0/2](848600/4588595) loss:2.903 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:41:04,208][model8_pretrain.py][INFO] Epoch:[0/2](848600/4588595) loss:2.787 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:41:04,208][model8_pretrain.py][INFO] Epoch:[0/2](848600/4588595) loss:2.969 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:41:04,208][model8_pretrain.py][INFO] Epoch:[0/2](848600/4588595) loss:2.886 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:41:04,208][model8_pretrain.py][INFO] Epoch:[0/2](848600/4588595) loss:2.989 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:41:04,208][model8_pretrain.py][INFO] Epoch:[0/2](848600/4588595) loss:2.567 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:41:04,208][model8_pretrain.py][INFO] Epoch:[0/2](848600/4588595) loss:2.543 lr:0.0000100 epoch_Time:23805.0min: [2024-01-06 11:41:41,140][model8_pretrain.py][INFO] Epoch:[0/2](848700/4588595) loss:3.125 lr:0.0000100 epoch_Time:23804.0min: [2024-01-06 11:41:41,140][model8_pretrain.py][INFO] Epoch:[0/2](848700/4588595) loss:2.861 lr:0.0000100 epoch_Time:23804.0min: [2024-01-06 11:41:41,140][model8_pretrain.py][INFO] Epoch:[0/2](848700/4588595) loss:3.236 lr:0.0000100 epoch_Time:23804.0min: [2024-01-06 11:41:41,140][model8_pretrain.py][INFO] Epoch:[0/2](848700/4588595) loss:2.235 lr:0.0000100 epoch_Time:23804.0min: [2024-01-06 11:41:41,140][model8_pretrain.py][INFO] Epoch:[0/2](848700/4588595) loss:2.862 lr:0.0000100 epoch_Time:23804.0min: [2024-01-06 11:41:41,140][model8_pretrain.py][INFO] Epoch:[0/2](848700/4588595) loss:3.255 lr:0.0000100 epoch_Time:23804.0min: [2024-01-06 11:41:41,140][model8_pretrain.py][INFO] Epoch:[0/2](848700/4588595) loss:2.690 lr:0.0000100 epoch_Time:23804.0min: [2024-01-06 11:41:41,140][model8_pretrain.py][INFO] Epoch:[0/2](848700/4588595) loss:2.976 lr:0.0000100 epoch_Time:23804.0min: [2024-01-06 11:42:18,083][model8_pretrain.py][INFO] Epoch:[0/2](848800/4588595) loss:2.205 lr:0.0000100 epoch_Time:23803.0min: [2024-01-06 11:42:18,083][model8_pretrain.py][INFO] Epoch:[0/2](848800/4588595) loss:3.546 lr:0.0000100 epoch_Time:23803.0min: [2024-01-06 11:42:18,083][model8_pretrain.py][INFO] Epoch:[0/2](848800/4588595) loss:2.724 lr:0.0000100 epoch_Time:23803.0min: [2024-01-06 11:42:18,083][model8_pretrain.py][INFO] Epoch:[0/2](848800/4588595) loss:2.735 lr:0.0000100 epoch_Time:23803.0min: [2024-01-06 11:42:18,083][model8_pretrain.py][INFO] Epoch:[0/2](848800/4588595) loss:2.989 lr:0.0000100 epoch_Time:23803.0min: [2024-01-06 11:42:18,083][model8_pretrain.py][INFO] Epoch:[0/2](848800/4588595) loss:2.612 lr:0.0000100 epoch_Time:23803.0min: [2024-01-06 11:42:18,084][model8_pretrain.py][INFO] Epoch:[0/2](848800/4588595) loss:2.276 lr:0.0000100 epoch_Time:23803.0min: [2024-01-06 11:42:18,084][model8_pretrain.py][INFO] Epoch:[0/2](848800/4588595) loss:2.686 lr:0.0000100 epoch_Time:23803.0min: [2024-01-06 11:42:55,018][model8_pretrain.py][INFO] Epoch:[0/2](848900/4588595) loss:2.801 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:42:55,018][model8_pretrain.py][INFO] Epoch:[0/2](848900/4588595) loss:3.027 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:42:55,018][model8_pretrain.py][INFO] Epoch:[0/2](848900/4588595) loss:2.514 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:42:55,018][model8_pretrain.py][INFO] Epoch:[0/2](848900/4588595) loss:2.869 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:42:55,018][model8_pretrain.py][INFO] Epoch:[0/2](848900/4588595) loss:2.562 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:42:55,018][model8_pretrain.py][INFO] Epoch:[0/2](848900/4588595) loss:2.483 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:42:55,018][model8_pretrain.py][INFO] Epoch:[0/2](848900/4588595) loss:2.830 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:42:55,018][model8_pretrain.py][INFO] Epoch:[0/2](848900/4588595) loss:3.274 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:43:31,954][model8_pretrain.py][INFO] Epoch:[0/2](849000/4588595) loss:2.775 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:43:31,954][model8_pretrain.py][INFO] Epoch:[0/2](849000/4588595) loss:2.308 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:43:31,954][model8_pretrain.py][INFO] Epoch:[0/2](849000/4588595) loss:2.895 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:43:31,954][model8_pretrain.py][INFO] Epoch:[0/2](849000/4588595) loss:2.553 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:43:31,955][model8_pretrain.py][INFO] Epoch:[0/2](849000/4588595) loss:3.098 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:43:31,955][model8_pretrain.py][INFO] Epoch:[0/2](849000/4588595) loss:2.618 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:43:31,955][model8_pretrain.py][INFO] Epoch:[0/2](849000/4588595) loss:3.068 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:43:31,955][model8_pretrain.py][INFO] Epoch:[0/2](849000/4588595) loss:3.175 lr:0.0000100 epoch_Time:23802.0min: [2024-01-06 11:44:08,895][model8_pretrain.py][INFO] Epoch:[0/2](849100/4588595) loss:2.518 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:08,895][model8_pretrain.py][INFO] Epoch:[0/2](849100/4588595) loss:2.609 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:08,895][model8_pretrain.py][INFO] Epoch:[0/2](849100/4588595) loss:2.152 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:08,895][model8_pretrain.py][INFO] Epoch:[0/2](849100/4588595) loss:3.072 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:08,895][model8_pretrain.py][INFO] Epoch:[0/2](849100/4588595) loss:2.564 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:08,895][model8_pretrain.py][INFO] Epoch:[0/2](849100/4588595) loss:2.736 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:08,895][model8_pretrain.py][INFO] Epoch:[0/2](849100/4588595) loss:3.115 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:08,895][model8_pretrain.py][INFO] Epoch:[0/2](849100/4588595) loss:3.316 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:45,841][model8_pretrain.py][INFO] Epoch:[0/2](849200/4588595) loss:3.103 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:45,841][model8_pretrain.py][INFO] Epoch:[0/2](849200/4588595) loss:2.979 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:45,841][model8_pretrain.py][INFO] Epoch:[0/2](849200/4588595) loss:2.834 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:45,841][model8_pretrain.py][INFO] Epoch:[0/2](849200/4588595) loss:3.090 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:45,841][model8_pretrain.py][INFO] Epoch:[0/2](849200/4588595) loss:2.721 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:45,841][model8_pretrain.py][INFO] Epoch:[0/2](849200/4588595) loss:2.393 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:45,841][model8_pretrain.py][INFO] Epoch:[0/2](849200/4588595) loss:2.375 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:44:45,841][model8_pretrain.py][INFO] Epoch:[0/2](849200/4588595) loss:3.028 lr:0.0000100 epoch_Time:23801.0min: [2024-01-06 11:45:22,787][model8_pretrain.py][INFO] Epoch:[0/2](849300/4588595) loss:2.902 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:45:22,787][model8_pretrain.py][INFO] Epoch:[0/2](849300/4588595) loss:2.362 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:45:22,787][model8_pretrain.py][INFO] Epoch:[0/2](849300/4588595) loss:2.581 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:45:22,787][model8_pretrain.py][INFO] Epoch:[0/2](849300/4588595) loss:2.776 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:45:22,787][model8_pretrain.py][INFO] Epoch:[0/2](849300/4588595) loss:2.490 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:45:22,787][model8_pretrain.py][INFO] Epoch:[0/2](849300/4588595) loss:2.739 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:45:22,787][model8_pretrain.py][INFO] Epoch:[0/2](849300/4588595) loss:2.595 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:45:22,787][model8_pretrain.py][INFO] Epoch:[0/2](849300/4588595) loss:2.620 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:46:10,206][model8_pretrain.py][INFO] Epoch:[0/2](849400/4588595) loss:2.665 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:46:10,206][model8_pretrain.py][INFO] Epoch:[0/2](849400/4588595) loss:2.957 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:46:10,206][model8_pretrain.py][INFO] Epoch:[0/2](849400/4588595) loss:2.877 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:46:10,206][model8_pretrain.py][INFO] Epoch:[0/2](849400/4588595) loss:2.453 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:46:10,206][model8_pretrain.py][INFO] Epoch:[0/2](849400/4588595) loss:2.677 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:46:10,206][model8_pretrain.py][INFO] Epoch:[0/2](849400/4588595) loss:3.126 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:46:10,206][model8_pretrain.py][INFO] Epoch:[0/2](849400/4588595) loss:3.166 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:46:10,206][model8_pretrain.py][INFO] Epoch:[0/2](849400/4588595) loss:2.885 lr:0.0000100 epoch_Time:23800.0min: [2024-01-06 11:46:47,165][model8_pretrain.py][INFO] Epoch:[0/2](849500/4588595) loss:2.672 lr:0.0000100 epoch_Time:23799.0min: [2024-01-06 11:46:47,165][model8_pretrain.py][INFO] Epoch:[0/2](849500/4588595) loss:2.415 lr:0.0000100 epoch_Time:23799.0min: [2024-01-06 11:46:47,165][model8_pretrain.py][INFO] Epoch:[0/2](849500/4588595) loss:2.760 lr:0.0000100 epoch_Time:23799.0min: [2024-01-06 11:46:47,165][model8_pretrain.py][INFO] Epoch:[0/2](849500/4588595) loss:3.121 lr:0.0000100 epoch_Time:23799.0min: [2024-01-06 11:46:47,165][model8_pretrain.py][INFO] Epoch:[0/2](849500/4588595) loss:2.859 lr:0.0000100 epoch_Time:23799.0min: [2024-01-06 11:46:47,165][model8_pretrain.py][INFO] Epoch:[0/2](849500/4588595) loss:3.028 lr:0.0000100 epoch_Time:23799.0min: [2024-01-06 11:46:47,165][model8_pretrain.py][INFO] Epoch:[0/2](849500/4588595) loss:2.868 lr:0.0000100 epoch_Time:23799.0min: [2024-01-06 11:46:47,165][model8_pretrain.py][INFO] Epoch:[0/2](849500/4588595) loss:3.176 lr:0.0000100 epoch_Time:23799.0min: [2024-01-06 11:47:24,103][model8_pretrain.py][INFO] Epoch:[0/2](849600/4588595) loss:2.875 lr:0.0000100 epoch_Time:23798.0min: [2024-01-06 11:47:24,103][model8_pretrain.py][INFO] Epoch:[0/2](849600/4588595) loss:2.783 lr:0.0000100 epoch_Time:23798.0min: [2024-01-06 11:47:24,103][model8_pretrain.py][INFO] Epoch:[0/2](849600/4588595) loss:2.834 lr:0.0000100 epoch_Time:23798.0min: [2024-01-06 11:47:24,103][model8_pretrain.py][INFO] Epoch:[0/2](849600/4588595) loss:2.248 lr:0.0000100 epoch_Time:23798.0min: [2024-01-06 11:47:24,103][model8_pretrain.py][INFO] Epoch:[0/2](849600/4588595) loss:1.845 lr:0.0000100 epoch_Time:23798.0min: [2024-01-06 11:47:24,103][model8_pretrain.py][INFO] Epoch:[0/2](849600/4588595) loss:2.547 lr:0.0000100 epoch_Time:23798.0min: [2024-01-06 11:47:24,103][model8_pretrain.py][INFO] Epoch:[0/2](849600/4588595) loss:2.808 lr:0.0000100 epoch_Time:23798.0min: [2024-01-06 11:47:24,103][model8_pretrain.py][INFO] Epoch:[0/2](849600/4588595) loss:2.539 lr:0.0000100 epoch_Time:23798.0min: [2024-01-06 11:48:01,047][model8_pretrain.py][INFO] Epoch:[0/2](849700/4588595) loss:2.941 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:01,047][model8_pretrain.py][INFO] Epoch:[0/2](849700/4588595) loss:2.272 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:01,047][model8_pretrain.py][INFO] Epoch:[0/2](849700/4588595) loss:3.393 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:01,047][model8_pretrain.py][INFO] Epoch:[0/2](849700/4588595) loss:3.529 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:01,047][model8_pretrain.py][INFO] Epoch:[0/2](849700/4588595) loss:2.898 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:01,047][model8_pretrain.py][INFO] Epoch:[0/2](849700/4588595) loss:2.722 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:01,047][model8_pretrain.py][INFO] Epoch:[0/2](849700/4588595) loss:2.694 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:01,047][model8_pretrain.py][INFO] Epoch:[0/2](849700/4588595) loss:3.011 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:37,989][model8_pretrain.py][INFO] Epoch:[0/2](849800/4588595) loss:2.012 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:37,989][model8_pretrain.py][INFO] Epoch:[0/2](849800/4588595) loss:2.810 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:37,989][model8_pretrain.py][INFO] Epoch:[0/2](849800/4588595) loss:3.161 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:37,990][model8_pretrain.py][INFO] Epoch:[0/2](849800/4588595) loss:2.769 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:37,990][model8_pretrain.py][INFO] Epoch:[0/2](849800/4588595) loss:2.890 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:37,990][model8_pretrain.py][INFO] Epoch:[0/2](849800/4588595) loss:2.715 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:37,990][model8_pretrain.py][INFO] Epoch:[0/2](849800/4588595) loss:3.072 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:48:37,990][model8_pretrain.py][INFO] Epoch:[0/2](849800/4588595) loss:2.968 lr:0.0000100 epoch_Time:23797.0min: [2024-01-06 11:49:14,939][model8_pretrain.py][INFO] Epoch:[0/2](849900/4588595) loss:3.220 lr:0.0000100 epoch_Time:23796.0min: [2024-01-06 11:49:14,939][model8_pretrain.py][INFO] Epoch:[0/2](849900/4588595) loss:3.123 lr:0.0000100 epoch_Time:23796.0min: [2024-01-06 11:49:14,939][model8_pretrain.py][INFO] Epoch:[0/2](849900/4588595) loss:2.536 lr:0.0000100 epoch_Time:23796.0min: [2024-01-06 11:49:14,939][model8_pretrain.py][INFO] Epoch:[0/2](849900/4588595) loss:2.547 lr:0.0000100 epoch_Time:23796.0min: [2024-01-06 11:49:14,939][model8_pretrain.py][INFO] Epoch:[0/2](849900/4588595) loss:2.934 lr:0.0000100 epoch_Time:23796.0min: [2024-01-06 11:49:14,939][model8_pretrain.py][INFO] Epoch:[0/2](849900/4588595) loss:2.658 lr:0.0000100 epoch_Time:23796.0min: [2024-01-06 11:49:14,939][model8_pretrain.py][INFO] Epoch:[0/2](849900/4588595) loss:3.285 lr:0.0000100 epoch_Time:23796.0min: [2024-01-06 11:49:14,939][model8_pretrain.py][INFO] Epoch:[0/2](849900/4588595) loss:2.798 lr:0.0000100 epoch_Time:23796.0min: [2024-01-06 11:49:51,896][model8_pretrain.py][INFO] Epoch:[0/2](850000/4588595) loss:3.048 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:49:51,896][model8_pretrain.py][INFO] Epoch:[0/2](850000/4588595) loss:2.959 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:49:51,896][model8_pretrain.py][INFO] Epoch:[0/2](850000/4588595) loss:2.529 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:49:51,896][model8_pretrain.py][INFO] Epoch:[0/2](850000/4588595) loss:2.639 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:49:51,896][model8_pretrain.py][INFO] Epoch:[0/2](850000/4588595) loss:3.017 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:49:51,896][model8_pretrain.py][INFO] Epoch:[0/2](850000/4588595) loss:2.676 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:49:51,896][model8_pretrain.py][INFO] Epoch:[0/2](850000/4588595) loss:2.918 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:49:51,896][model8_pretrain.py][INFO] Epoch:[0/2](850000/4588595) loss:2.582 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:50:28,840][model8_pretrain.py][INFO] Epoch:[0/2](850100/4588595) loss:2.708 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:50:28,840][model8_pretrain.py][INFO] Epoch:[0/2](850100/4588595) loss:2.962 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:50:28,840][model8_pretrain.py][INFO] Epoch:[0/2](850100/4588595) loss:3.214 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:50:28,840][model8_pretrain.py][INFO] Epoch:[0/2](850100/4588595) loss:3.314 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:50:28,840][model8_pretrain.py][INFO] Epoch:[0/2](850100/4588595) loss:2.569 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:50:28,840][model8_pretrain.py][INFO] Epoch:[0/2](850100/4588595) loss:2.820 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:50:28,840][model8_pretrain.py][INFO] Epoch:[0/2](850100/4588595) loss:2.491 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:50:28,840][model8_pretrain.py][INFO] Epoch:[0/2](850100/4588595) loss:2.802 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:51:16,184][model8_pretrain.py][INFO] Epoch:[0/2](850200/4588595) loss:2.713 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:51:16,184][model8_pretrain.py][INFO] Epoch:[0/2](850200/4588595) loss:2.567 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:51:16,184][model8_pretrain.py][INFO] Epoch:[0/2](850200/4588595) loss:3.105 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:51:16,184][model8_pretrain.py][INFO] Epoch:[0/2](850200/4588595) loss:3.041 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:51:16,184][model8_pretrain.py][INFO] Epoch:[0/2](850200/4588595) loss:3.022 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:51:16,184][model8_pretrain.py][INFO] Epoch:[0/2](850200/4588595) loss:3.082 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:51:16,184][model8_pretrain.py][INFO] Epoch:[0/2](850200/4588595) loss:2.677 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:51:16,184][model8_pretrain.py][INFO] Epoch:[0/2](850200/4588595) loss:2.585 lr:0.0000100 epoch_Time:23795.0min: [2024-01-06 11:51:53,103][model8_pretrain.py][INFO] Epoch:[0/2](850300/4588595) loss:3.053 lr:0.0000100 epoch_Time:23794.0min: [2024-01-06 11:51:53,103][model8_pretrain.py][INFO] Epoch:[0/2](850300/4588595) loss:3.069 lr:0.0000100 epoch_Time:23794.0min: [2024-01-06 11:51:53,104][model8_pretrain.py][INFO] Epoch:[0/2](850300/4588595) loss:2.667 lr:0.0000100 epoch_Time:23794.0min: [2024-01-06 11:51:53,104][model8_pretrain.py][INFO] Epoch:[0/2](850300/4588595) loss:3.255 lr:0.0000100 epoch_Time:23794.0min: [2024-01-06 11:51:53,103][model8_pretrain.py][INFO] Epoch:[0/2](850300/4588595) loss:2.607 lr:0.0000100 epoch_Time:23794.0min: [2024-01-06 11:51:53,104][model8_pretrain.py][INFO] Epoch:[0/2](850300/4588595) loss:2.922 lr:0.0000100 epoch_Time:23794.0min: [2024-01-06 11:51:53,104][model8_pretrain.py][INFO] Epoch:[0/2](850300/4588595) loss:3.069 lr:0.0000100 epoch_Time:23794.0min: [2024-01-06 11:51:53,104][model8_pretrain.py][INFO] Epoch:[0/2](850300/4588595) loss:2.744 lr:0.0000100 epoch_Time:23794.0min: [2024-01-06 11:52:30,034][model8_pretrain.py][INFO] Epoch:[0/2](850400/4588595) loss:2.929 lr:0.0000100 epoch_Time:23793.0min: [2024-01-06 11:52:30,034][model8_pretrain.py][INFO] Epoch:[0/2](850400/4588595) loss:3.162 lr:0.0000100 epoch_Time:23793.0min: [2024-01-06 11:52:30,034][model8_pretrain.py][INFO] Epoch:[0/2](850400/4588595) loss:2.696 lr:0.0000100 epoch_Time:23793.0min: [2024-01-06 11:52:30,034][model8_pretrain.py][INFO] Epoch:[0/2](850400/4588595) loss:2.912 lr:0.0000100 epoch_Time:23793.0min: [2024-01-06 11:52:30,034][model8_pretrain.py][INFO] Epoch:[0/2](850400/4588595) loss:2.818 lr:0.0000100 epoch_Time:23793.0min: [2024-01-06 11:52:30,034][model8_pretrain.py][INFO] Epoch:[0/2](850400/4588595) loss:2.834 lr:0.0000100 epoch_Time:23793.0min: [2024-01-06 11:52:30,034][model8_pretrain.py][INFO] Epoch:[0/2](850400/4588595) loss:2.384 lr:0.0000100 epoch_Time:23793.0min: [2024-01-06 11:52:30,034][model8_pretrain.py][INFO] Epoch:[0/2](850400/4588595) loss:2.183 lr:0.0000100 epoch_Time:23793.0min: [2024-01-06 11:53:06,966][model8_pretrain.py][INFO] Epoch:[0/2](850500/4588595) loss:2.959 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:06,966][model8_pretrain.py][INFO] Epoch:[0/2](850500/4588595) loss:2.620 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:06,966][model8_pretrain.py][INFO] Epoch:[0/2](850500/4588595) loss:3.012 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:06,966][model8_pretrain.py][INFO] Epoch:[0/2](850500/4588595) loss:3.071 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:06,966][model8_pretrain.py][INFO] Epoch:[0/2](850500/4588595) loss:2.451 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:06,966][model8_pretrain.py][INFO] Epoch:[0/2](850500/4588595) loss:3.327 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:06,966][model8_pretrain.py][INFO] Epoch:[0/2](850500/4588595) loss:2.968 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:06,966][model8_pretrain.py][INFO] Epoch:[0/2](850500/4588595) loss:2.895 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:43,905][model8_pretrain.py][INFO] Epoch:[0/2](850600/4588595) loss:2.343 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:43,906][model8_pretrain.py][INFO] Epoch:[0/2](850600/4588595) loss:2.044 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:43,906][model8_pretrain.py][INFO] Epoch:[0/2](850600/4588595) loss:3.039 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:43,906][model8_pretrain.py][INFO] Epoch:[0/2](850600/4588595) loss:3.062 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:43,906][model8_pretrain.py][INFO] Epoch:[0/2](850600/4588595) loss:2.980 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:43,906][model8_pretrain.py][INFO] Epoch:[0/2](850600/4588595) loss:2.422 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:43,906][model8_pretrain.py][INFO] Epoch:[0/2](850600/4588595) loss:3.108 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:53:43,907][model8_pretrain.py][INFO] Epoch:[0/2](850600/4588595) loss:2.831 lr:0.0000100 epoch_Time:23792.0min: [2024-01-06 11:54:20,874][model8_pretrain.py][INFO] Epoch:[0/2](850700/4588595) loss:2.870 lr:0.0000100 epoch_Time:23791.0min: [2024-01-06 11:54:20,874][model8_pretrain.py][INFO] Epoch:[0/2](850700/4588595) loss:2.859 lr:0.0000100 epoch_Time:23791.0min: [2024-01-06 11:54:20,874][model8_pretrain.py][INFO] Epoch:[0/2](850700/4588595) loss:2.522 lr:0.0000100 epoch_Time:23791.0min: [2024-01-06 11:54:20,874][model8_pretrain.py][INFO] Epoch:[0/2](850700/4588595) loss:2.897 lr:0.0000100 epoch_Time:23791.0min: [2024-01-06 11:54:20,874][model8_pretrain.py][INFO] Epoch:[0/2](850700/4588595) loss:3.299 lr:0.0000100 epoch_Time:23791.0min: [2024-01-06 11:54:20,874][model8_pretrain.py][INFO] Epoch:[0/2](850700/4588595) loss:3.144 lr:0.0000100 epoch_Time:23791.0min: [2024-01-06 11:54:20,874][model8_pretrain.py][INFO] Epoch:[0/2](850700/4588595) loss:2.780 lr:0.0000100 epoch_Time:23791.0min: [2024-01-06 11:54:20,874][model8_pretrain.py][INFO] Epoch:[0/2](850700/4588595) loss:3.224 lr:0.0000100 epoch_Time:23791.0min: [2024-01-06 11:54:57,817][model8_pretrain.py][INFO] Epoch:[0/2](850800/4588595) loss:3.133 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:54:57,817][model8_pretrain.py][INFO] Epoch:[0/2](850800/4588595) loss:2.934 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:54:57,817][model8_pretrain.py][INFO] Epoch:[0/2](850800/4588595) loss:3.362 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:54:57,817][model8_pretrain.py][INFO] Epoch:[0/2](850800/4588595) loss:3.012 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:54:57,817][model8_pretrain.py][INFO] Epoch:[0/2](850800/4588595) loss:2.625 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:54:57,818][model8_pretrain.py][INFO] Epoch:[0/2](850800/4588595) loss:3.425 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:54:57,818][model8_pretrain.py][INFO] Epoch:[0/2](850800/4588595) loss:2.380 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:54:57,822][model8_pretrain.py][INFO] Epoch:[0/2](850800/4588595) loss:2.307 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:55:34,798][model8_pretrain.py][INFO] Epoch:[0/2](850900/4588595) loss:2.452 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:55:34,798][model8_pretrain.py][INFO] Epoch:[0/2](850900/4588595) loss:2.911 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:55:34,798][model8_pretrain.py][INFO] Epoch:[0/2](850900/4588595) loss:2.732 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:55:34,798][model8_pretrain.py][INFO] Epoch:[0/2](850900/4588595) loss:3.011 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:55:34,798][model8_pretrain.py][INFO] Epoch:[0/2](850900/4588595) loss:2.659 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:55:34,798][model8_pretrain.py][INFO] Epoch:[0/2](850900/4588595) loss:3.315 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:55:34,798][model8_pretrain.py][INFO] Epoch:[0/2](850900/4588595) loss:3.308 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:55:34,798][model8_pretrain.py][INFO] Epoch:[0/2](850900/4588595) loss:2.804 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:56:22,059][model8_pretrain.py][INFO] Epoch:[0/2](851000/4588595) loss:3.119 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:56:22,059][model8_pretrain.py][INFO] Epoch:[0/2](851000/4588595) loss:3.158 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:56:22,059][model8_pretrain.py][INFO] Epoch:[0/2](851000/4588595) loss:2.514 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:56:22,059][model8_pretrain.py][INFO] Epoch:[0/2](851000/4588595) loss:2.859 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:56:22,059][model8_pretrain.py][INFO] Epoch:[0/2](851000/4588595) loss:2.926 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:56:22,059][model8_pretrain.py][INFO] Epoch:[0/2](851000/4588595) loss:2.768 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:56:22,059][model8_pretrain.py][INFO] Epoch:[0/2](851000/4588595) loss:2.872 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:56:22,059][model8_pretrain.py][INFO] Epoch:[0/2](851000/4588595) loss:3.188 lr:0.0000100 epoch_Time:23790.0min: [2024-01-06 11:56:58,995][model8_pretrain.py][INFO] Epoch:[0/2](851100/4588595) loss:2.707 lr:0.0000100 epoch_Time:23789.0min: [2024-01-06 11:56:58,995][model8_pretrain.py][INFO] Epoch:[0/2](851100/4588595) loss:2.355 lr:0.0000100 epoch_Time:23789.0min: [2024-01-06 11:56:58,995][model8_pretrain.py][INFO] Epoch:[0/2](851100/4588595) loss:2.872 lr:0.0000100 epoch_Time:23789.0min: [2024-01-06 11:56:58,995][model8_pretrain.py][INFO] Epoch:[0/2](851100/4588595) loss:2.560 lr:0.0000100 epoch_Time:23789.0min: [2024-01-06 11:56:58,995][model8_pretrain.py][INFO] Epoch:[0/2](851100/4588595) loss:3.137 lr:0.0000100 epoch_Time:23789.0min: [2024-01-06 11:56:58,995][model8_pretrain.py][INFO] Epoch:[0/2](851100/4588595) loss:2.887 lr:0.0000100 epoch_Time:23789.0min: [2024-01-06 11:56:58,995][model8_pretrain.py][INFO] Epoch:[0/2](851100/4588595) loss:3.143 lr:0.0000100 epoch_Time:23789.0min: [2024-01-06 11:56:58,995][model8_pretrain.py][INFO] Epoch:[0/2](851100/4588595) loss:3.592 lr:0.0000100 epoch_Time:23789.0min: [2024-01-06 11:57:35,948][model8_pretrain.py][INFO] Epoch:[0/2](851200/4588595) loss:3.096 lr:0.0000100 epoch_Time:23788.0min: [2024-01-06 11:57:35,948][model8_pretrain.py][INFO] Epoch:[0/2](851200/4588595) loss:2.775 lr:0.0000100 epoch_Time:23788.0min: [2024-01-06 11:57:35,948][model8_pretrain.py][INFO] Epoch:[0/2](851200/4588595) loss:2.786 lr:0.0000100 epoch_Time:23788.0min: [2024-01-06 11:57:35,948][model8_pretrain.py][INFO] Epoch:[0/2](851200/4588595) loss:3.270 lr:0.0000100 epoch_Time:23788.0min: [2024-01-06 11:57:35,948][model8_pretrain.py][INFO] Epoch:[0/2](851200/4588595) loss:2.826 lr:0.0000100 epoch_Time:23788.0min: [2024-01-06 11:57:35,948][model8_pretrain.py][INFO] Epoch:[0/2](851200/4588595) loss:2.641 lr:0.0000100 epoch_Time:23788.0min: [2024-01-06 11:57:35,948][model8_pretrain.py][INFO] Epoch:[0/2](851200/4588595) loss:2.692 lr:0.0000100 epoch_Time:23788.0min: [2024-01-06 11:57:35,948][model8_pretrain.py][INFO] Epoch:[0/2](851200/4588595) loss:2.679 lr:0.0000100 epoch_Time:23788.0min: [2024-01-06 11:58:12,899][model8_pretrain.py][INFO] Epoch:[0/2](851300/4588595) loss:2.910 lr:0.0000100 epoch_Time:23787.0min: [2024-01-06 11:58:12,899][model8_pretrain.py][INFO] Epoch:[0/2](851300/4588595) loss:3.236 lr:0.0000100 epoch_Time:23787.0min: [2024-01-06 11:58:12,899][model8_pretrain.py][INFO] Epoch:[0/2](851300/4588595) loss:2.981 lr:0.0000100 epoch_Time:23787.0min: [2024-01-06 11:58:12,899][model8_pretrain.py][INFO] Epoch:[0/2](851300/4588595) loss:2.660 lr:0.0000100 epoch_Time:23787.0min: [2024-01-06 11:58:12,899][model8_pretrain.py][INFO] Epoch:[0/2](851300/4588595) loss:3.211 lr:0.0000100 epoch_Time:23787.0min: [2024-01-06 11:58:12,899][model8_pretrain.py][INFO] Epoch:[0/2](851300/4588595) loss:2.964 lr:0.0000100 epoch_Time:23787.0min: [2024-01-06 11:58:12,899][model8_pretrain.py][INFO] Epoch:[0/2](851300/4588595) loss:3.066 lr:0.0000100 epoch_Time:23787.0min: [2024-01-06 11:58:12,899][model8_pretrain.py][INFO] Epoch:[0/2](851300/4588595) loss:2.615 lr:0.0000100 epoch_Time:23787.0min: [2024-01-06 11:58:49,862][model8_pretrain.py][INFO] Epoch:[0/2](851400/4588595) loss:2.742 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:58:49,862][model8_pretrain.py][INFO] Epoch:[0/2](851400/4588595) loss:2.425 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:58:49,862][model8_pretrain.py][INFO] Epoch:[0/2](851400/4588595) loss:2.545 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:58:49,862][model8_pretrain.py][INFO] Epoch:[0/2](851400/4588595) loss:2.885 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:58:49,862][model8_pretrain.py][INFO] Epoch:[0/2](851400/4588595) loss:2.474 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:58:49,862][model8_pretrain.py][INFO] Epoch:[0/2](851400/4588595) loss:3.366 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:58:49,862][model8_pretrain.py][INFO] Epoch:[0/2](851400/4588595) loss:2.821 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:58:49,862][model8_pretrain.py][INFO] Epoch:[0/2](851400/4588595) loss:3.080 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:59:26,831][model8_pretrain.py][INFO] Epoch:[0/2](851500/4588595) loss:2.740 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:59:26,831][model8_pretrain.py][INFO] Epoch:[0/2](851500/4588595) loss:3.057 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:59:26,831][model8_pretrain.py][INFO] Epoch:[0/2](851500/4588595) loss:2.909 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:59:26,831][model8_pretrain.py][INFO] Epoch:[0/2](851500/4588595) loss:2.540 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:59:26,831][model8_pretrain.py][INFO] Epoch:[0/2](851500/4588595) loss:3.014 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:59:26,831][model8_pretrain.py][INFO] Epoch:[0/2](851500/4588595) loss:2.901 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:59:26,831][model8_pretrain.py][INFO] Epoch:[0/2](851500/4588595) loss:2.855 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 11:59:26,831][model8_pretrain.py][INFO] Epoch:[0/2](851500/4588595) loss:2.841 lr:0.0000100 epoch_Time:23786.0min: [2024-01-06 12:00:03,782][model8_pretrain.py][INFO] Epoch:[0/2](851600/4588595) loss:2.632 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:03,782][model8_pretrain.py][INFO] Epoch:[0/2](851600/4588595) loss:2.603 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:03,782][model8_pretrain.py][INFO] Epoch:[0/2](851600/4588595) loss:2.616 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:03,782][model8_pretrain.py][INFO] Epoch:[0/2](851600/4588595) loss:2.661 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:03,782][model8_pretrain.py][INFO] Epoch:[0/2](851600/4588595) loss:2.116 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:03,782][model8_pretrain.py][INFO] Epoch:[0/2](851600/4588595) loss:2.697 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:03,782][model8_pretrain.py][INFO] Epoch:[0/2](851600/4588595) loss:2.518 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:03,783][model8_pretrain.py][INFO] Epoch:[0/2](851600/4588595) loss:3.458 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:40,742][model8_pretrain.py][INFO] Epoch:[0/2](851700/4588595) loss:2.340 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:40,742][model8_pretrain.py][INFO] Epoch:[0/2](851700/4588595) loss:2.062 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:40,742][model8_pretrain.py][INFO] Epoch:[0/2](851700/4588595) loss:2.565 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:40,742][model8_pretrain.py][INFO] Epoch:[0/2](851700/4588595) loss:2.485 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:40,742][model8_pretrain.py][INFO] Epoch:[0/2](851700/4588595) loss:2.743 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:40,742][model8_pretrain.py][INFO] Epoch:[0/2](851700/4588595) loss:2.072 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:40,742][model8_pretrain.py][INFO] Epoch:[0/2](851700/4588595) loss:2.918 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:00:40,742][model8_pretrain.py][INFO] Epoch:[0/2](851700/4588595) loss:3.149 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:01:26,337][model8_pretrain.py][INFO] Epoch:[0/2](851800/4588595) loss:3.207 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:01:26,337][model8_pretrain.py][INFO] Epoch:[0/2](851800/4588595) loss:3.029 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:01:26,337][model8_pretrain.py][INFO] Epoch:[0/2](851800/4588595) loss:3.100 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:01:26,337][model8_pretrain.py][INFO] Epoch:[0/2](851800/4588595) loss:2.596 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:01:26,337][model8_pretrain.py][INFO] Epoch:[0/2](851800/4588595) loss:3.110 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:01:26,337][model8_pretrain.py][INFO] Epoch:[0/2](851800/4588595) loss:3.212 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:01:26,337][model8_pretrain.py][INFO] Epoch:[0/2](851800/4588595) loss:2.789 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:01:26,337][model8_pretrain.py][INFO] Epoch:[0/2](851800/4588595) loss:2.393 lr:0.0000100 epoch_Time:23785.0min: [2024-01-06 12:02:04,728][model8_pretrain.py][INFO] Epoch:[0/2](851900/4588595) loss:2.754 lr:0.0000100 epoch_Time:23784.0min: [2024-01-06 12:02:04,728][model8_pretrain.py][INFO] Epoch:[0/2](851900/4588595) loss:2.780 lr:0.0000100 epoch_Time:23784.0min: [2024-01-06 12:02:04,728][model8_pretrain.py][INFO] Epoch:[0/2](851900/4588595) loss:3.044 lr:0.0000100 epoch_Time:23784.0min: [2024-01-06 12:02:04,728][model8_pretrain.py][INFO] Epoch:[0/2](851900/4588595) loss:2.287 lr:0.0000100 epoch_Time:23784.0min: [2024-01-06 12:02:04,728][model8_pretrain.py][INFO] Epoch:[0/2](851900/4588595) loss:2.908 lr:0.0000100 epoch_Time:23784.0min: [2024-01-06 12:02:04,728][model8_pretrain.py][INFO] Epoch:[0/2](851900/4588595) loss:3.313 lr:0.0000100 epoch_Time:23784.0min: [2024-01-06 12:02:04,729][model8_pretrain.py][INFO] Epoch:[0/2](851900/4588595) loss:2.713 lr:0.0000100 epoch_Time:23784.0min: [2024-01-06 12:02:04,729][model8_pretrain.py][INFO] Epoch:[0/2](851900/4588595) loss:3.086 lr:0.0000100 epoch_Time:23784.0min: [2024-01-06 12:02:41,683][model8_pretrain.py][INFO] Epoch:[0/2](852000/4588595) loss:3.363 lr:0.0000100 epoch_Time:23783.0min: [2024-01-06 12:02:41,683][model8_pretrain.py][INFO] Epoch:[0/2](852000/4588595) loss:2.842 lr:0.0000100 epoch_Time:23783.0min: [2024-01-06 12:02:41,683][model8_pretrain.py][INFO] Epoch:[0/2](852000/4588595) loss:2.821 lr:0.0000100 epoch_Time:23783.0min: [2024-01-06 12:02:41,683][model8_pretrain.py][INFO] Epoch:[0/2](852000/4588595) loss:2.371 lr:0.0000100 epoch_Time:23783.0min: [2024-01-06 12:02:41,683][model8_pretrain.py][INFO] Epoch:[0/2](852000/4588595) loss:2.334 lr:0.0000100 epoch_Time:23783.0min: [2024-01-06 12:02:41,683][model8_pretrain.py][INFO] Epoch:[0/2](852000/4588595) loss:2.674 lr:0.0000100 epoch_Time:23783.0min: [2024-01-06 12:02:41,683][model8_pretrain.py][INFO] Epoch:[0/2](852000/4588595) loss:2.837 lr:0.0000100 epoch_Time:23783.0min: [2024-01-06 12:02:41,684][model8_pretrain.py][INFO] Epoch:[0/2](852000/4588595) loss:2.786 lr:0.0000100 epoch_Time:23783.0min: [2024-01-06 12:03:18,637][model8_pretrain.py][INFO] Epoch:[0/2](852100/4588595) loss:2.627 lr:0.0000100 epoch_Time:23782.0min: [2024-01-06 12:03:18,637][model8_pretrain.py][INFO] Epoch:[0/2](852100/4588595) loss:2.692 lr:0.0000100 epoch_Time:23782.0min: [2024-01-06 12:03:18,637][model8_pretrain.py][INFO] Epoch:[0/2](852100/4588595) loss:2.011 lr:0.0000100 epoch_Time:23782.0min: [2024-01-06 12:03:18,638][model8_pretrain.py][INFO] Epoch:[0/2](852100/4588595) loss:2.366 lr:0.0000100 epoch_Time:23782.0min: [2024-01-06 12:03:18,637][model8_pretrain.py][INFO] Epoch:[0/2](852100/4588595) loss:2.777 lr:0.0000100 epoch_Time:23782.0min: [2024-01-06 12:03:18,638][model8_pretrain.py][INFO] Epoch:[0/2](852100/4588595) loss:3.366 lr:0.0000100 epoch_Time:23782.0min: [2024-01-06 12:03:18,638][model8_pretrain.py][INFO] Epoch:[0/2](852100/4588595) loss:2.272 lr:0.0000100 epoch_Time:23782.0min: [2024-01-06 12:03:18,638][model8_pretrain.py][INFO] Epoch:[0/2](852100/4588595) loss:2.895 lr:0.0000100 epoch_Time:23782.0min: [2024-01-06 12:03:55,585][model8_pretrain.py][INFO] Epoch:[0/2](852200/4588595) loss:3.265 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:03:55,585][model8_pretrain.py][INFO] Epoch:[0/2](852200/4588595) loss:2.926 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:03:55,585][model8_pretrain.py][INFO] Epoch:[0/2](852200/4588595) loss:3.167 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:03:55,585][model8_pretrain.py][INFO] Epoch:[0/2](852200/4588595) loss:2.685 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:03:55,585][model8_pretrain.py][INFO] Epoch:[0/2](852200/4588595) loss:2.124 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:03:55,585][model8_pretrain.py][INFO] Epoch:[0/2](852200/4588595) loss:2.846 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:03:55,585][model8_pretrain.py][INFO] Epoch:[0/2](852200/4588595) loss:2.701 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:03:55,586][model8_pretrain.py][INFO] Epoch:[0/2](852200/4588595) loss:2.786 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:04:32,558][model8_pretrain.py][INFO] Epoch:[0/2](852300/4588595) loss:3.126 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:04:32,558][model8_pretrain.py][INFO] Epoch:[0/2](852300/4588595) loss:2.392 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:04:32,558][model8_pretrain.py][INFO] Epoch:[0/2](852300/4588595) loss:3.127 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:04:32,558][model8_pretrain.py][INFO] Epoch:[0/2](852300/4588595) loss:2.938 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:04:32,558][model8_pretrain.py][INFO] Epoch:[0/2](852300/4588595) loss:3.072 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:04:32,558][model8_pretrain.py][INFO] Epoch:[0/2](852300/4588595) loss:2.645 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:04:32,558][model8_pretrain.py][INFO] Epoch:[0/2](852300/4588595) loss:3.005 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:04:32,559][model8_pretrain.py][INFO] Epoch:[0/2](852300/4588595) loss:2.833 lr:0.0000100 epoch_Time:23781.0min: [2024-01-06 12:05:09,516][model8_pretrain.py][INFO] Epoch:[0/2](852400/4588595) loss:2.082 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:09,516][model8_pretrain.py][INFO] Epoch:[0/2](852400/4588595) loss:2.849 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:09,516][model8_pretrain.py][INFO] Epoch:[0/2](852400/4588595) loss:2.673 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:09,516][model8_pretrain.py][INFO] Epoch:[0/2](852400/4588595) loss:3.197 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:09,516][model8_pretrain.py][INFO] Epoch:[0/2](852400/4588595) loss:2.613 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:09,516][model8_pretrain.py][INFO] Epoch:[0/2](852400/4588595) loss:2.660 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:09,516][model8_pretrain.py][INFO] Epoch:[0/2](852400/4588595) loss:3.085 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:09,516][model8_pretrain.py][INFO] Epoch:[0/2](852400/4588595) loss:2.543 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:46,469][model8_pretrain.py][INFO] Epoch:[0/2](852500/4588595) loss:2.863 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:46,469][model8_pretrain.py][INFO] Epoch:[0/2](852500/4588595) loss:2.760 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:46,469][model8_pretrain.py][INFO] Epoch:[0/2](852500/4588595) loss:2.868 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:46,469][model8_pretrain.py][INFO] Epoch:[0/2](852500/4588595) loss:2.862 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:46,469][model8_pretrain.py][INFO] Epoch:[0/2](852500/4588595) loss:3.268 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:46,469][model8_pretrain.py][INFO] Epoch:[0/2](852500/4588595) loss:3.276 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:46,469][model8_pretrain.py][INFO] Epoch:[0/2](852500/4588595) loss:2.408 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:05:46,469][model8_pretrain.py][INFO] Epoch:[0/2](852500/4588595) loss:2.708 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:06:32,151][model8_pretrain.py][INFO] Epoch:[0/2](852600/4588595) loss:3.093 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:06:32,151][model8_pretrain.py][INFO] Epoch:[0/2](852600/4588595) loss:2.149 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:06:32,151][model8_pretrain.py][INFO] Epoch:[0/2](852600/4588595) loss:3.111 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:06:32,151][model8_pretrain.py][INFO] Epoch:[0/2](852600/4588595) loss:3.348 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:06:32,151][model8_pretrain.py][INFO] Epoch:[0/2](852600/4588595) loss:3.057 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:06:32,151][model8_pretrain.py][INFO] Epoch:[0/2](852600/4588595) loss:2.989 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:06:32,151][model8_pretrain.py][INFO] Epoch:[0/2](852600/4588595) loss:3.305 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:06:32,151][model8_pretrain.py][INFO] Epoch:[0/2](852600/4588595) loss:2.826 lr:0.0000100 epoch_Time:23780.0min: [2024-01-06 12:07:10,764][model8_pretrain.py][INFO] Epoch:[0/2](852700/4588595) loss:2.610 lr:0.0000100 epoch_Time:23779.0min: [2024-01-06 12:07:10,764][model8_pretrain.py][INFO] Epoch:[0/2](852700/4588595) loss:2.312 lr:0.0000100 epoch_Time:23779.0min: [2024-01-06 12:07:10,764][model8_pretrain.py][INFO] Epoch:[0/2](852700/4588595) loss:2.859 lr:0.0000100 epoch_Time:23779.0min: [2024-01-06 12:07:10,764][model8_pretrain.py][INFO] Epoch:[0/2](852700/4588595) loss:2.319 lr:0.0000100 epoch_Time:23779.0min: [2024-01-06 12:07:10,764][model8_pretrain.py][INFO] Epoch:[0/2](852700/4588595) loss:3.010 lr:0.0000100 epoch_Time:23779.0min: [2024-01-06 12:07:10,764][model8_pretrain.py][INFO] Epoch:[0/2](852700/4588595) loss:2.824 lr:0.0000100 epoch_Time:23779.0min: [2024-01-06 12:07:10,764][model8_pretrain.py][INFO] Epoch:[0/2](852700/4588595) loss:2.453 lr:0.0000100 epoch_Time:23779.0min: [2024-01-06 12:07:10,765][model8_pretrain.py][INFO] Epoch:[0/2](852700/4588595) loss:2.255 lr:0.0000100 epoch_Time:23779.0min: [2024-01-06 12:07:47,701][model8_pretrain.py][INFO] Epoch:[0/2](852800/4588595) loss:2.694 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:07:47,701][model8_pretrain.py][INFO] Epoch:[0/2](852800/4588595) loss:2.101 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:07:47,701][model8_pretrain.py][INFO] Epoch:[0/2](852800/4588595) loss:3.189 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:07:47,701][model8_pretrain.py][INFO] Epoch:[0/2](852800/4588595) loss:2.991 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:07:47,701][model8_pretrain.py][INFO] Epoch:[0/2](852800/4588595) loss:2.775 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:07:47,701][model8_pretrain.py][INFO] Epoch:[0/2](852800/4588595) loss:2.885 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:07:47,701][model8_pretrain.py][INFO] Epoch:[0/2](852800/4588595) loss:2.843 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:07:47,701][model8_pretrain.py][INFO] Epoch:[0/2](852800/4588595) loss:3.186 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:08:24,650][model8_pretrain.py][INFO] Epoch:[0/2](852900/4588595) loss:2.987 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:08:24,650][model8_pretrain.py][INFO] Epoch:[0/2](852900/4588595) loss:2.589 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:08:24,650][model8_pretrain.py][INFO] Epoch:[0/2](852900/4588595) loss:2.933 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:08:24,650][model8_pretrain.py][INFO] Epoch:[0/2](852900/4588595) loss:3.139 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:08:24,650][model8_pretrain.py][INFO] Epoch:[0/2](852900/4588595) loss:3.296 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:08:24,650][model8_pretrain.py][INFO] Epoch:[0/2](852900/4588595) loss:2.527 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:08:24,650][model8_pretrain.py][INFO] Epoch:[0/2](852900/4588595) loss:2.730 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:08:24,651][model8_pretrain.py][INFO] Epoch:[0/2](852900/4588595) loss:3.199 lr:0.0000100 epoch_Time:23777.0min: [2024-01-06 12:09:01,591][model8_pretrain.py][INFO] Epoch:[0/2](853000/4588595) loss:2.699 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:01,591][model8_pretrain.py][INFO] Epoch:[0/2](853000/4588595) loss:2.860 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:01,591][model8_pretrain.py][INFO] Epoch:[0/2](853000/4588595) loss:3.060 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:01,591][model8_pretrain.py][INFO] Epoch:[0/2](853000/4588595) loss:2.652 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:01,591][model8_pretrain.py][INFO] Epoch:[0/2](853000/4588595) loss:2.759 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:01,591][model8_pretrain.py][INFO] Epoch:[0/2](853000/4588595) loss:2.772 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:01,591][model8_pretrain.py][INFO] Epoch:[0/2](853000/4588595) loss:2.816 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:01,591][model8_pretrain.py][INFO] Epoch:[0/2](853000/4588595) loss:2.784 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:38,532][model8_pretrain.py][INFO] Epoch:[0/2](853100/4588595) loss:2.253 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:38,532][model8_pretrain.py][INFO] Epoch:[0/2](853100/4588595) loss:3.083 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:38,532][model8_pretrain.py][INFO] Epoch:[0/2](853100/4588595) loss:2.518 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:38,532][model8_pretrain.py][INFO] Epoch:[0/2](853100/4588595) loss:2.643 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:38,532][model8_pretrain.py][INFO] Epoch:[0/2](853100/4588595) loss:2.464 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:38,532][model8_pretrain.py][INFO] Epoch:[0/2](853100/4588595) loss:3.050 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:38,532][model8_pretrain.py][INFO] Epoch:[0/2](853100/4588595) loss:3.082 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:09:38,532][model8_pretrain.py][INFO] Epoch:[0/2](853100/4588595) loss:2.664 lr:0.0000100 epoch_Time:23776.0min: [2024-01-06 12:10:15,482][model8_pretrain.py][INFO] Epoch:[0/2](853200/4588595) loss:3.201 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:10:15,482][model8_pretrain.py][INFO] Epoch:[0/2](853200/4588595) loss:2.722 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:10:15,482][model8_pretrain.py][INFO] Epoch:[0/2](853200/4588595) loss:2.729 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:10:15,482][model8_pretrain.py][INFO] Epoch:[0/2](853200/4588595) loss:2.810 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:10:15,482][model8_pretrain.py][INFO] Epoch:[0/2](853200/4588595) loss:2.657 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:10:15,482][model8_pretrain.py][INFO] Epoch:[0/2](853200/4588595) loss:2.582 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:10:15,482][model8_pretrain.py][INFO] Epoch:[0/2](853200/4588595) loss:3.133 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:10:15,482][model8_pretrain.py][INFO] Epoch:[0/2](853200/4588595) loss:2.996 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:10:52,431][model8_pretrain.py][INFO] Epoch:[0/2](853300/4588595) loss:2.876 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:10:52,431][model8_pretrain.py][INFO] Epoch:[0/2](853300/4588595) loss:2.909 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:10:52,431][model8_pretrain.py][INFO] Epoch:[0/2](853300/4588595) loss:3.088 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:10:52,432][model8_pretrain.py][INFO] Epoch:[0/2](853300/4588595) loss:2.700 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:10:52,432][model8_pretrain.py][INFO] Epoch:[0/2](853300/4588595) loss:3.118 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:10:52,432][model8_pretrain.py][INFO] Epoch:[0/2](853300/4588595) loss:3.047 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:10:52,432][model8_pretrain.py][INFO] Epoch:[0/2](853300/4588595) loss:2.438 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:10:52,432][model8_pretrain.py][INFO] Epoch:[0/2](853300/4588595) loss:2.520 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:11:37,878][model8_pretrain.py][INFO] Epoch:[0/2](853400/4588595) loss:3.255 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:11:37,878][model8_pretrain.py][INFO] Epoch:[0/2](853400/4588595) loss:2.680 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:11:37,878][model8_pretrain.py][INFO] Epoch:[0/2](853400/4588595) loss:1.945 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:11:37,878][model8_pretrain.py][INFO] Epoch:[0/2](853400/4588595) loss:2.622 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:11:37,878][model8_pretrain.py][INFO] Epoch:[0/2](853400/4588595) loss:2.358 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:11:37,878][model8_pretrain.py][INFO] Epoch:[0/2](853400/4588595) loss:3.240 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:11:37,878][model8_pretrain.py][INFO] Epoch:[0/2](853400/4588595) loss:3.760 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:11:37,878][model8_pretrain.py][INFO] Epoch:[0/2](853400/4588595) loss:2.469 lr:0.0000100 epoch_Time:23775.0min: [2024-01-06 12:12:16,487][model8_pretrain.py][INFO] Epoch:[0/2](853500/4588595) loss:2.769 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:12:16,488][model8_pretrain.py][INFO] Epoch:[0/2](853500/4588595) loss:2.904 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:12:16,487][model8_pretrain.py][INFO] Epoch:[0/2](853500/4588595) loss:2.990 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:12:16,488][model8_pretrain.py][INFO] Epoch:[0/2](853500/4588595) loss:2.598 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:12:16,488][model8_pretrain.py][INFO] Epoch:[0/2](853500/4588595) loss:3.586 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:12:16,488][model8_pretrain.py][INFO] Epoch:[0/2](853500/4588595) loss:3.270 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:12:16,488][model8_pretrain.py][INFO] Epoch:[0/2](853500/4588595) loss:2.890 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:12:16,488][model8_pretrain.py][INFO] Epoch:[0/2](853500/4588595) loss:2.853 lr:0.0000100 epoch_Time:23774.0min: [2024-01-06 12:12:53,425][model8_pretrain.py][INFO] Epoch:[0/2](853600/4588595) loss:3.033 lr:0.0000100 epoch_Time:23773.0min: [2024-01-06 12:12:53,425][model8_pretrain.py][INFO] Epoch:[0/2](853600/4588595) loss:2.831 lr:0.0000100 epoch_Time:23773.0min: [2024-01-06 12:12:53,425][model8_pretrain.py][INFO] Epoch:[0/2](853600/4588595) loss:2.922 lr:0.0000100 epoch_Time:23773.0min: [2024-01-06 12:12:53,425][model8_pretrain.py][INFO] Epoch:[0/2](853600/4588595) loss:3.163 lr:0.0000100 epoch_Time:23773.0min: [2024-01-06 12:12:53,425][model8_pretrain.py][INFO] Epoch:[0/2](853600/4588595) loss:3.380 lr:0.0000100 epoch_Time:23773.0min: [2024-01-06 12:12:53,425][model8_pretrain.py][INFO] Epoch:[0/2](853600/4588595) loss:3.006 lr:0.0000100 epoch_Time:23773.0min: [2024-01-06 12:12:53,425][model8_pretrain.py][INFO] Epoch:[0/2](853600/4588595) loss:2.523 lr:0.0000100 epoch_Time:23773.0min: [2024-01-06 12:12:53,425][model8_pretrain.py][INFO] Epoch:[0/2](853600/4588595) loss:3.184 lr:0.0000100 epoch_Time:23773.0min: [2024-01-06 12:13:30,379][model8_pretrain.py][INFO] Epoch:[0/2](853700/4588595) loss:2.891 lr:0.0000100 epoch_Time:23772.0min: [2024-01-06 12:13:30,379][model8_pretrain.py][INFO] Epoch:[0/2](853700/4588595) loss:2.940 lr:0.0000100 epoch_Time:23772.0min: [2024-01-06 12:13:30,379][model8_pretrain.py][INFO] Epoch:[0/2](853700/4588595) loss:2.440 lr:0.0000100 epoch_Time:23772.0min: [2024-01-06 12:13:30,379][model8_pretrain.py][INFO] Epoch:[0/2](853700/4588595) loss:3.265 lr:0.0000100 epoch_Time:23772.0min: [2024-01-06 12:13:30,379][model8_pretrain.py][INFO] Epoch:[0/2](853700/4588595) loss:2.450 lr:0.0000100 epoch_Time:23772.0min: [2024-01-06 12:13:30,379][model8_pretrain.py][INFO] Epoch:[0/2](853700/4588595) loss:3.261 lr:0.0000100 epoch_Time:23772.0min: [2024-01-06 12:13:30,379][model8_pretrain.py][INFO] Epoch:[0/2](853700/4588595) loss:2.481 lr:0.0000100 epoch_Time:23772.0min: [2024-01-06 12:13:30,379][model8_pretrain.py][INFO] Epoch:[0/2](853700/4588595) loss:3.585 lr:0.0000100 epoch_Time:23772.0min: [2024-01-06 12:14:07,338][model8_pretrain.py][INFO] Epoch:[0/2](853800/4588595) loss:2.135 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:07,338][model8_pretrain.py][INFO] Epoch:[0/2](853800/4588595) loss:2.413 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:07,338][model8_pretrain.py][INFO] Epoch:[0/2](853800/4588595) loss:2.930 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:07,338][model8_pretrain.py][INFO] Epoch:[0/2](853800/4588595) loss:2.895 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:07,338][model8_pretrain.py][INFO] Epoch:[0/2](853800/4588595) loss:2.356 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:07,338][model8_pretrain.py][INFO] Epoch:[0/2](853800/4588595) loss:3.342 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:07,338][model8_pretrain.py][INFO] Epoch:[0/2](853800/4588595) loss:3.030 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:07,338][model8_pretrain.py][INFO] Epoch:[0/2](853800/4588595) loss:2.152 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:44,285][model8_pretrain.py][INFO] Epoch:[0/2](853900/4588595) loss:2.910 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:44,285][model8_pretrain.py][INFO] Epoch:[0/2](853900/4588595) loss:2.946 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:44,285][model8_pretrain.py][INFO] Epoch:[0/2](853900/4588595) loss:2.643 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:44,285][model8_pretrain.py][INFO] Epoch:[0/2](853900/4588595) loss:3.027 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:44,285][model8_pretrain.py][INFO] Epoch:[0/2](853900/4588595) loss:3.237 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:44,285][model8_pretrain.py][INFO] Epoch:[0/2](853900/4588595) loss:2.792 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:44,285][model8_pretrain.py][INFO] Epoch:[0/2](853900/4588595) loss:3.031 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:14:44,285][model8_pretrain.py][INFO] Epoch:[0/2](853900/4588595) loss:2.387 lr:0.0000100 epoch_Time:23771.0min: [2024-01-06 12:15:21,217][model8_pretrain.py][INFO] Epoch:[0/2](854000/4588595) loss:2.547 lr:0.0000100 epoch_Time:23770.0min: [2024-01-06 12:15:21,217][model8_pretrain.py][INFO] Epoch:[0/2](854000/4588595) loss:2.847 lr:0.0000100 epoch_Time:23770.0min: [2024-01-06 12:15:21,217][model8_pretrain.py][INFO] Epoch:[0/2](854000/4588595) loss:3.216 lr:0.0000100 epoch_Time:23770.0min: [2024-01-06 12:15:21,217][model8_pretrain.py][INFO] Epoch:[0/2](854000/4588595) loss:3.109 lr:0.0000100 epoch_Time:23770.0min: [2024-01-06 12:15:21,217][model8_pretrain.py][INFO] Epoch:[0/2](854000/4588595) loss:2.490 lr:0.0000100 epoch_Time:23770.0min: [2024-01-06 12:15:21,217][model8_pretrain.py][INFO] Epoch:[0/2](854000/4588595) loss:3.160 lr:0.0000100 epoch_Time:23770.0min: [2024-01-06 12:15:21,217][model8_pretrain.py][INFO] Epoch:[0/2](854000/4588595) loss:1.979 lr:0.0000100 epoch_Time:23770.0min: [2024-01-06 12:15:21,217][model8_pretrain.py][INFO] Epoch:[0/2](854000/4588595) loss:2.890 lr:0.0000100 epoch_Time:23770.0min: [2024-01-06 12:15:58,159][model8_pretrain.py][INFO] Epoch:[0/2](854100/4588595) loss:3.092 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:15:58,159][model8_pretrain.py][INFO] Epoch:[0/2](854100/4588595) loss:3.012 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:15:58,159][model8_pretrain.py][INFO] Epoch:[0/2](854100/4588595) loss:2.851 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:15:58,159][model8_pretrain.py][INFO] Epoch:[0/2](854100/4588595) loss:3.080 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:15:58,159][model8_pretrain.py][INFO] Epoch:[0/2](854100/4588595) loss:3.215 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:15:58,159][model8_pretrain.py][INFO] Epoch:[0/2](854100/4588595) loss:2.710 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:15:58,160][model8_pretrain.py][INFO] Epoch:[0/2](854100/4588595) loss:2.704 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:15:58,160][model8_pretrain.py][INFO] Epoch:[0/2](854100/4588595) loss:2.944 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:16:42,044][model8_pretrain.py][INFO] Epoch:[0/2](854200/4588595) loss:2.851 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:16:42,044][model8_pretrain.py][INFO] Epoch:[0/2](854200/4588595) loss:3.130 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:16:42,044][model8_pretrain.py][INFO] Epoch:[0/2](854200/4588595) loss:3.312 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:16:42,044][model8_pretrain.py][INFO] Epoch:[0/2](854200/4588595) loss:2.437 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:16:42,044][model8_pretrain.py][INFO] Epoch:[0/2](854200/4588595) loss:2.768 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:16:42,044][model8_pretrain.py][INFO] Epoch:[0/2](854200/4588595) loss:3.089 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:16:42,044][model8_pretrain.py][INFO] Epoch:[0/2](854200/4588595) loss:2.908 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:16:42,045][model8_pretrain.py][INFO] Epoch:[0/2](854200/4588595) loss:2.367 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](854300/4588595) loss:2.539 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](854300/4588595) loss:3.152 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](854300/4588595) loss:2.730 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](854300/4588595) loss:2.230 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](854300/4588595) loss:2.621 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](854300/4588595) loss:2.268 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](854300/4588595) loss:2.645 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:17:22,447][model8_pretrain.py][INFO] Epoch:[0/2](854300/4588595) loss:2.438 lr:0.0000100 epoch_Time:23769.0min: [2024-01-06 12:17:59,387][model8_pretrain.py][INFO] Epoch:[0/2](854400/4588595) loss:2.997 lr:0.0000100 epoch_Time:23768.0min: [2024-01-06 12:17:59,387][model8_pretrain.py][INFO] Epoch:[0/2](854400/4588595) loss:2.731 lr:0.0000100 epoch_Time:23768.0min: [2024-01-06 12:17:59,387][model8_pretrain.py][INFO] Epoch:[0/2](854400/4588595) loss:2.734 lr:0.0000100 epoch_Time:23768.0min: [2024-01-06 12:17:59,387][model8_pretrain.py][INFO] Epoch:[0/2](854400/4588595) loss:2.004 lr:0.0000100 epoch_Time:23768.0min: [2024-01-06 12:17:59,387][model8_pretrain.py][INFO] Epoch:[0/2](854400/4588595) loss:3.043 lr:0.0000100 epoch_Time:23768.0min: [2024-01-06 12:17:59,387][model8_pretrain.py][INFO] Epoch:[0/2](854400/4588595) loss:2.900 lr:0.0000100 epoch_Time:23768.0min: [2024-01-06 12:17:59,387][model8_pretrain.py][INFO] Epoch:[0/2](854400/4588595) loss:2.399 lr:0.0000100 epoch_Time:23768.0min: [2024-01-06 12:17:59,388][model8_pretrain.py][INFO] Epoch:[0/2](854400/4588595) loss:2.646 lr:0.0000100 epoch_Time:23768.0min: [2024-01-06 12:18:36,331][model8_pretrain.py][INFO] Epoch:[0/2](854500/4588595) loss:2.419 lr:0.0000100 epoch_Time:23767.0min: [2024-01-06 12:18:36,331][model8_pretrain.py][INFO] Epoch:[0/2](854500/4588595) loss:3.127 lr:0.0000100 epoch_Time:23767.0min: [2024-01-06 12:18:36,331][model8_pretrain.py][INFO] Epoch:[0/2](854500/4588595) loss:2.712 lr:0.0000100 epoch_Time:23767.0min: [2024-01-06 12:18:36,331][model8_pretrain.py][INFO] Epoch:[0/2](854500/4588595) loss:2.819 lr:0.0000100 epoch_Time:23767.0min: [2024-01-06 12:18:36,331][model8_pretrain.py][INFO] Epoch:[0/2](854500/4588595) loss:2.855 lr:0.0000100 epoch_Time:23767.0min: [2024-01-06 12:18:36,331][model8_pretrain.py][INFO] Epoch:[0/2](854500/4588595) loss:2.988 lr:0.0000100 epoch_Time:23767.0min: [2024-01-06 12:18:36,331][model8_pretrain.py][INFO] Epoch:[0/2](854500/4588595) loss:3.040 lr:0.0000100 epoch_Time:23767.0min: [2024-01-06 12:18:36,331][model8_pretrain.py][INFO] Epoch:[0/2](854500/4588595) loss:2.870 lr:0.0000100 epoch_Time:23767.0min: [2024-01-06 12:19:13,287][model8_pretrain.py][INFO] Epoch:[0/2](854600/4588595) loss:2.568 lr:0.0000100 epoch_Time:23766.0min: [2024-01-06 12:19:13,287][model8_pretrain.py][INFO] Epoch:[0/2](854600/4588595) loss:2.779 lr:0.0000100 epoch_Time:23766.0min: [2024-01-06 12:19:13,287][model8_pretrain.py][INFO] Epoch:[0/2](854600/4588595) loss:2.592 lr:0.0000100 epoch_Time:23766.0min: [2024-01-06 12:19:13,287][model8_pretrain.py][INFO] Epoch:[0/2](854600/4588595) loss:2.666 lr:0.0000100 epoch_Time:23766.0min: [2024-01-06 12:19:13,287][model8_pretrain.py][INFO] Epoch:[0/2](854600/4588595) loss:2.472 lr:0.0000100 epoch_Time:23766.0min: [2024-01-06 12:19:13,287][model8_pretrain.py][INFO] Epoch:[0/2](854600/4588595) loss:2.940 lr:0.0000100 epoch_Time:23766.0min: [2024-01-06 12:19:13,287][model8_pretrain.py][INFO] Epoch:[0/2](854600/4588595) loss:2.961 lr:0.0000100 epoch_Time:23766.0min: [2024-01-06 12:19:13,287][model8_pretrain.py][INFO] Epoch:[0/2](854600/4588595) loss:2.696 lr:0.0000100 epoch_Time:23766.0min: [2024-01-06 12:19:50,234][model8_pretrain.py][INFO] Epoch:[0/2](854700/4588595) loss:2.920 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:19:50,234][model8_pretrain.py][INFO] Epoch:[0/2](854700/4588595) loss:2.580 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:19:50,234][model8_pretrain.py][INFO] Epoch:[0/2](854700/4588595) loss:2.555 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:19:50,234][model8_pretrain.py][INFO] Epoch:[0/2](854700/4588595) loss:2.826 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:19:50,234][model8_pretrain.py][INFO] Epoch:[0/2](854700/4588595) loss:3.091 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:19:50,234][model8_pretrain.py][INFO] Epoch:[0/2](854700/4588595) loss:2.711 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:19:50,234][model8_pretrain.py][INFO] Epoch:[0/2](854700/4588595) loss:2.839 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:19:50,234][model8_pretrain.py][INFO] Epoch:[0/2](854700/4588595) loss:3.014 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:20:27,212][model8_pretrain.py][INFO] Epoch:[0/2](854800/4588595) loss:2.948 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:20:27,212][model8_pretrain.py][INFO] Epoch:[0/2](854800/4588595) loss:2.834 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:20:27,212][model8_pretrain.py][INFO] Epoch:[0/2](854800/4588595) loss:2.598 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:20:27,212][model8_pretrain.py][INFO] Epoch:[0/2](854800/4588595) loss:2.704 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:20:27,212][model8_pretrain.py][INFO] Epoch:[0/2](854800/4588595) loss:2.590 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:20:27,212][model8_pretrain.py][INFO] Epoch:[0/2](854800/4588595) loss:2.798 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:20:27,212][model8_pretrain.py][INFO] Epoch:[0/2](854800/4588595) loss:2.943 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:20:27,212][model8_pretrain.py][INFO] Epoch:[0/2](854800/4588595) loss:2.293 lr:0.0000100 epoch_Time:23765.0min: [2024-01-06 12:21:04,172][model8_pretrain.py][INFO] Epoch:[0/2](854900/4588595) loss:3.135 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:04,172][model8_pretrain.py][INFO] Epoch:[0/2](854900/4588595) loss:3.004 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:04,172][model8_pretrain.py][INFO] Epoch:[0/2](854900/4588595) loss:2.757 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:04,172][model8_pretrain.py][INFO] Epoch:[0/2](854900/4588595) loss:3.058 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:04,172][model8_pretrain.py][INFO] Epoch:[0/2](854900/4588595) loss:2.443 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:04,172][model8_pretrain.py][INFO] Epoch:[0/2](854900/4588595) loss:2.491 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:04,172][model8_pretrain.py][INFO] Epoch:[0/2](854900/4588595) loss:2.704 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:04,172][model8_pretrain.py][INFO] Epoch:[0/2](854900/4588595) loss:2.314 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:48,071][model8_pretrain.py][INFO] Epoch:[0/2](855000/4588595) loss:2.089 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:48,071][model8_pretrain.py][INFO] Epoch:[0/2](855000/4588595) loss:3.240 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:48,071][model8_pretrain.py][INFO] Epoch:[0/2](855000/4588595) loss:3.318 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:48,071][model8_pretrain.py][INFO] Epoch:[0/2](855000/4588595) loss:2.904 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:48,071][model8_pretrain.py][INFO] Epoch:[0/2](855000/4588595) loss:2.089 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:48,071][model8_pretrain.py][INFO] Epoch:[0/2](855000/4588595) loss:2.788 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:48,071][model8_pretrain.py][INFO] Epoch:[0/2](855000/4588595) loss:2.544 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:21:48,071][model8_pretrain.py][INFO] Epoch:[0/2](855000/4588595) loss:2.376 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:22:28,485][model8_pretrain.py][INFO] Epoch:[0/2](855100/4588595) loss:2.403 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:22:28,485][model8_pretrain.py][INFO] Epoch:[0/2](855100/4588595) loss:3.331 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:22:28,485][model8_pretrain.py][INFO] Epoch:[0/2](855100/4588595) loss:2.833 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:22:28,485][model8_pretrain.py][INFO] Epoch:[0/2](855100/4588595) loss:2.699 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:22:28,485][model8_pretrain.py][INFO] Epoch:[0/2](855100/4588595) loss:3.155 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:22:28,485][model8_pretrain.py][INFO] Epoch:[0/2](855100/4588595) loss:3.064 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:22:28,485][model8_pretrain.py][INFO] Epoch:[0/2](855100/4588595) loss:2.897 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:22:28,485][model8_pretrain.py][INFO] Epoch:[0/2](855100/4588595) loss:2.600 lr:0.0000100 epoch_Time:23764.0min: [2024-01-06 12:23:05,425][model8_pretrain.py][INFO] Epoch:[0/2](855200/4588595) loss:2.807 lr:0.0000100 epoch_Time:23763.0min: [2024-01-06 12:23:05,425][model8_pretrain.py][INFO] Epoch:[0/2](855200/4588595) loss:2.923 lr:0.0000100 epoch_Time:23763.0min: [2024-01-06 12:23:05,425][model8_pretrain.py][INFO] Epoch:[0/2](855200/4588595) loss:3.178 lr:0.0000100 epoch_Time:23763.0min: [2024-01-06 12:23:05,425][model8_pretrain.py][INFO] Epoch:[0/2](855200/4588595) loss:3.234 lr:0.0000100 epoch_Time:23763.0min: [2024-01-06 12:23:05,425][model8_pretrain.py][INFO] Epoch:[0/2](855200/4588595) loss:2.619 lr:0.0000100 epoch_Time:23763.0min: [2024-01-06 12:23:05,425][model8_pretrain.py][INFO] Epoch:[0/2](855200/4588595) loss:3.174 lr:0.0000100 epoch_Time:23763.0min: [2024-01-06 12:23:05,425][model8_pretrain.py][INFO] Epoch:[0/2](855200/4588595) loss:2.926 lr:0.0000100 epoch_Time:23763.0min: [2024-01-06 12:23:05,425][model8_pretrain.py][INFO] Epoch:[0/2](855200/4588595) loss:2.521 lr:0.0000100 epoch_Time:23763.0min: [2024-01-06 12:23:42,363][model8_pretrain.py][INFO] Epoch:[0/2](855300/4588595) loss:2.415 lr:0.0000100 epoch_Time:23762.0min: [2024-01-06 12:23:42,364][model8_pretrain.py][INFO] Epoch:[0/2](855300/4588595) loss:2.634 lr:0.0000100 epoch_Time:23762.0min: [2024-01-06 12:23:42,364][model8_pretrain.py][INFO] Epoch:[0/2](855300/4588595) loss:2.222 lr:0.0000100 epoch_Time:23762.0min: [2024-01-06 12:23:42,364][model8_pretrain.py][INFO] Epoch:[0/2](855300/4588595) loss:2.815 lr:0.0000100 epoch_Time:23762.0min: [2024-01-06 12:23:42,364][model8_pretrain.py][INFO] Epoch:[0/2](855300/4588595) loss:2.274 lr:0.0000100 epoch_Time:23762.0min: [2024-01-06 12:23:42,364][model8_pretrain.py][INFO] Epoch:[0/2](855300/4588595) loss:2.824 lr:0.0000100 epoch_Time:23762.0min: [2024-01-06 12:23:42,364][model8_pretrain.py][INFO] Epoch:[0/2](855300/4588595) loss:3.159 lr:0.0000100 epoch_Time:23762.0min: [2024-01-06 12:23:42,364][model8_pretrain.py][INFO] Epoch:[0/2](855300/4588595) loss:2.463 lr:0.0000100 epoch_Time:23762.0min: [2024-01-06 12:24:19,305][model8_pretrain.py][INFO] Epoch:[0/2](855400/4588595) loss:2.650 lr:0.0000100 epoch_Time:23761.0min: [2024-01-06 12:24:19,305][model8_pretrain.py][INFO] Epoch:[0/2](855400/4588595) loss:2.903 lr:0.0000100 epoch_Time:23761.0min: [2024-01-06 12:24:19,305][model8_pretrain.py][INFO] Epoch:[0/2](855400/4588595) loss:3.015 lr:0.0000100 epoch_Time:23761.0min: [2024-01-06 12:24:19,305][model8_pretrain.py][INFO] Epoch:[0/2](855400/4588595) loss:2.810 lr:0.0000100 epoch_Time:23761.0min: [2024-01-06 12:24:19,305][model8_pretrain.py][INFO] Epoch:[0/2](855400/4588595) loss:3.095 lr:0.0000100 epoch_Time:23761.0min: [2024-01-06 12:24:19,305][model8_pretrain.py][INFO] Epoch:[0/2](855400/4588595) loss:3.092 lr:0.0000100 epoch_Time:23761.0min: [2024-01-06 12:24:19,305][model8_pretrain.py][INFO] Epoch:[0/2](855400/4588595) loss:3.172 lr:0.0000100 epoch_Time:23761.0min: [2024-01-06 12:24:19,305][model8_pretrain.py][INFO] Epoch:[0/2](855400/4588595) loss:2.613 lr:0.0000100 epoch_Time:23761.0min: [2024-01-06 12:24:56,245][model8_pretrain.py][INFO] Epoch:[0/2](855500/4588595) loss:2.513 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:24:56,245][model8_pretrain.py][INFO] Epoch:[0/2](855500/4588595) loss:2.694 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:24:56,245][model8_pretrain.py][INFO] Epoch:[0/2](855500/4588595) loss:2.783 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:24:56,245][model8_pretrain.py][INFO] Epoch:[0/2](855500/4588595) loss:3.190 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:24:56,245][model8_pretrain.py][INFO] Epoch:[0/2](855500/4588595) loss:2.796 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:24:56,245][model8_pretrain.py][INFO] Epoch:[0/2](855500/4588595) loss:3.007 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:24:56,245][model8_pretrain.py][INFO] Epoch:[0/2](855500/4588595) loss:3.050 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:24:56,245][model8_pretrain.py][INFO] Epoch:[0/2](855500/4588595) loss:3.084 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:25:33,197][model8_pretrain.py][INFO] Epoch:[0/2](855600/4588595) loss:2.896 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:25:33,197][model8_pretrain.py][INFO] Epoch:[0/2](855600/4588595) loss:2.867 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:25:33,197][model8_pretrain.py][INFO] Epoch:[0/2](855600/4588595) loss:2.695 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:25:33,197][model8_pretrain.py][INFO] Epoch:[0/2](855600/4588595) loss:2.803 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:25:33,197][model8_pretrain.py][INFO] Epoch:[0/2](855600/4588595) loss:3.006 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:25:33,197][model8_pretrain.py][INFO] Epoch:[0/2](855600/4588595) loss:2.852 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:25:33,197][model8_pretrain.py][INFO] Epoch:[0/2](855600/4588595) loss:2.885 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:25:33,198][model8_pretrain.py][INFO] Epoch:[0/2](855600/4588595) loss:2.628 lr:0.0000100 epoch_Time:23760.0min: [2024-01-06 12:26:10,141][model8_pretrain.py][INFO] Epoch:[0/2](855700/4588595) loss:2.212 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:10,141][model8_pretrain.py][INFO] Epoch:[0/2](855700/4588595) loss:2.797 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:10,141][model8_pretrain.py][INFO] Epoch:[0/2](855700/4588595) loss:2.921 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:10,141][model8_pretrain.py][INFO] Epoch:[0/2](855700/4588595) loss:2.869 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:10,141][model8_pretrain.py][INFO] Epoch:[0/2](855700/4588595) loss:3.067 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:10,141][model8_pretrain.py][INFO] Epoch:[0/2](855700/4588595) loss:3.158 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:10,141][model8_pretrain.py][INFO] Epoch:[0/2](855700/4588595) loss:2.750 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:10,142][model8_pretrain.py][INFO] Epoch:[0/2](855700/4588595) loss:2.659 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:53,992][model8_pretrain.py][INFO] Epoch:[0/2](855800/4588595) loss:2.426 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:53,992][model8_pretrain.py][INFO] Epoch:[0/2](855800/4588595) loss:2.654 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:53,997][model8_pretrain.py][INFO] Epoch:[0/2](855800/4588595) loss:3.115 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:53,997][model8_pretrain.py][INFO] Epoch:[0/2](855800/4588595) loss:2.428 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:53,997][model8_pretrain.py][INFO] Epoch:[0/2](855800/4588595) loss:2.209 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:53,997][model8_pretrain.py][INFO] Epoch:[0/2](855800/4588595) loss:2.805 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:53,997][model8_pretrain.py][INFO] Epoch:[0/2](855800/4588595) loss:2.889 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:26:53,997][model8_pretrain.py][INFO] Epoch:[0/2](855800/4588595) loss:2.654 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:27:34,448][model8_pretrain.py][INFO] Epoch:[0/2](855900/4588595) loss:2.591 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:27:34,448][model8_pretrain.py][INFO] Epoch:[0/2](855900/4588595) loss:2.606 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:27:34,448][model8_pretrain.py][INFO] Epoch:[0/2](855900/4588595) loss:2.788 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:27:34,448][model8_pretrain.py][INFO] Epoch:[0/2](855900/4588595) loss:2.442 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:27:34,449][model8_pretrain.py][INFO] Epoch:[0/2](855900/4588595) loss:2.813 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:27:34,449][model8_pretrain.py][INFO] Epoch:[0/2](855900/4588595) loss:2.806 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:27:34,449][model8_pretrain.py][INFO] Epoch:[0/2](855900/4588595) loss:2.782 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:27:34,449][model8_pretrain.py][INFO] Epoch:[0/2](855900/4588595) loss:2.906 lr:0.0000100 epoch_Time:23759.0min: [2024-01-06 12:28:11,392][model8_pretrain.py][INFO] Epoch:[0/2](856000/4588595) loss:2.415 lr:0.0000100 epoch_Time:23758.0min: [2024-01-06 12:28:11,392][model8_pretrain.py][INFO] Epoch:[0/2](856000/4588595) loss:2.741 lr:0.0000100 epoch_Time:23758.0min: [2024-01-06 12:28:11,392][model8_pretrain.py][INFO] Epoch:[0/2](856000/4588595) loss:3.205 lr:0.0000100 epoch_Time:23758.0min: [2024-01-06 12:28:11,392][model8_pretrain.py][INFO] Epoch:[0/2](856000/4588595) loss:2.342 lr:0.0000100 epoch_Time:23758.0min: [2024-01-06 12:28:11,392][model8_pretrain.py][INFO] Epoch:[0/2](856000/4588595) loss:2.637 lr:0.0000100 epoch_Time:23758.0min: [2024-01-06 12:28:11,392][model8_pretrain.py][INFO] Epoch:[0/2](856000/4588595) loss:3.054 lr:0.0000100 epoch_Time:23758.0min: [2024-01-06 12:28:11,394][model8_pretrain.py][INFO] Epoch:[0/2](856000/4588595) loss:3.004 lr:0.0000100 epoch_Time:23758.0min: [2024-01-06 12:28:11,394][model8_pretrain.py][INFO] Epoch:[0/2](856000/4588595) loss:2.798 lr:0.0000100 epoch_Time:23758.0min: [2024-01-06 12:28:48,306][model8_pretrain.py][INFO] Epoch:[0/2](856100/4588595) loss:2.657 lr:0.0000100 epoch_Time:23757.0min: [2024-01-06 12:28:48,306][model8_pretrain.py][INFO] Epoch:[0/2](856100/4588595) loss:3.035 lr:0.0000100 epoch_Time:23757.0min: [2024-01-06 12:28:48,306][model8_pretrain.py][INFO] Epoch:[0/2](856100/4588595) loss:2.938 lr:0.0000100 epoch_Time:23757.0min: [2024-01-06 12:28:48,306][model8_pretrain.py][INFO] Epoch:[0/2](856100/4588595) loss:2.579 lr:0.0000100 epoch_Time:23757.0min: [2024-01-06 12:28:48,306][model8_pretrain.py][INFO] Epoch:[0/2](856100/4588595) loss:2.771 lr:0.0000100 epoch_Time:23757.0min: [2024-01-06 12:28:48,306][model8_pretrain.py][INFO] Epoch:[0/2](856100/4588595) loss:2.907 lr:0.0000100 epoch_Time:23757.0min: [2024-01-06 12:28:48,306][model8_pretrain.py][INFO] Epoch:[0/2](856100/4588595) loss:2.813 lr:0.0000100 epoch_Time:23757.0min: [2024-01-06 12:28:48,306][model8_pretrain.py][INFO] Epoch:[0/2](856100/4588595) loss:3.271 lr:0.0000100 epoch_Time:23757.0min: [2024-01-06 12:29:25,237][model8_pretrain.py][INFO] Epoch:[0/2](856200/4588595) loss:3.655 lr:0.0000100 epoch_Time:23756.0min: [2024-01-06 12:29:25,237][model8_pretrain.py][INFO] Epoch:[0/2](856200/4588595) loss:2.265 lr:0.0000100 epoch_Time:23756.0min: [2024-01-06 12:29:25,237][model8_pretrain.py][INFO] Epoch:[0/2](856200/4588595) loss:2.486 lr:0.0000100 epoch_Time:23756.0min: [2024-01-06 12:29:25,237][model8_pretrain.py][INFO] Epoch:[0/2](856200/4588595) loss:3.038 lr:0.0000100 epoch_Time:23756.0min: [2024-01-06 12:29:25,237][model8_pretrain.py][INFO] Epoch:[0/2](856200/4588595) loss:2.954 lr:0.0000100 epoch_Time:23756.0min: [2024-01-06 12:29:25,237][model8_pretrain.py][INFO] Epoch:[0/2](856200/4588595) loss:2.868 lr:0.0000100 epoch_Time:23756.0min: [2024-01-06 12:29:25,238][model8_pretrain.py][INFO] Epoch:[0/2](856200/4588595) loss:2.260 lr:0.0000100 epoch_Time:23756.0min: [2024-01-06 12:29:25,238][model8_pretrain.py][INFO] Epoch:[0/2](856200/4588595) loss:2.789 lr:0.0000100 epoch_Time:23756.0min: [2024-01-06 12:30:02,178][model8_pretrain.py][INFO] Epoch:[0/2](856300/4588595) loss:2.712 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:02,178][model8_pretrain.py][INFO] Epoch:[0/2](856300/4588595) loss:2.768 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:02,178][model8_pretrain.py][INFO] Epoch:[0/2](856300/4588595) loss:2.658 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:02,178][model8_pretrain.py][INFO] Epoch:[0/2](856300/4588595) loss:3.048 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:02,178][model8_pretrain.py][INFO] Epoch:[0/2](856300/4588595) loss:2.855 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:02,178][model8_pretrain.py][INFO] Epoch:[0/2](856300/4588595) loss:2.822 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:02,178][model8_pretrain.py][INFO] Epoch:[0/2](856300/4588595) loss:2.514 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:02,178][model8_pretrain.py][INFO] Epoch:[0/2](856300/4588595) loss:2.655 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:39,119][model8_pretrain.py][INFO] Epoch:[0/2](856400/4588595) loss:2.410 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:39,119][model8_pretrain.py][INFO] Epoch:[0/2](856400/4588595) loss:2.608 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:39,119][model8_pretrain.py][INFO] Epoch:[0/2](856400/4588595) loss:2.764 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:39,119][model8_pretrain.py][INFO] Epoch:[0/2](856400/4588595) loss:3.021 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:39,119][model8_pretrain.py][INFO] Epoch:[0/2](856400/4588595) loss:3.232 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:39,119][model8_pretrain.py][INFO] Epoch:[0/2](856400/4588595) loss:2.980 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:39,119][model8_pretrain.py][INFO] Epoch:[0/2](856400/4588595) loss:2.936 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:30:39,119][model8_pretrain.py][INFO] Epoch:[0/2](856400/4588595) loss:2.804 lr:0.0000100 epoch_Time:23755.0min: [2024-01-06 12:31:16,047][model8_pretrain.py][INFO] Epoch:[0/2](856500/4588595) loss:2.578 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:31:16,047][model8_pretrain.py][INFO] Epoch:[0/2](856500/4588595) loss:2.852 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:31:16,047][model8_pretrain.py][INFO] Epoch:[0/2](856500/4588595) loss:2.837 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:31:16,047][model8_pretrain.py][INFO] Epoch:[0/2](856500/4588595) loss:2.723 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:31:16,047][model8_pretrain.py][INFO] Epoch:[0/2](856500/4588595) loss:2.705 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:31:16,047][model8_pretrain.py][INFO] Epoch:[0/2](856500/4588595) loss:3.190 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:31:16,047][model8_pretrain.py][INFO] Epoch:[0/2](856500/4588595) loss:2.374 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:31:16,047][model8_pretrain.py][INFO] Epoch:[0/2](856500/4588595) loss:2.805 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:31:58,248][model8_pretrain.py][INFO] Epoch:[0/2](856600/4588595) loss:2.733 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:31:58,248][model8_pretrain.py][INFO] Epoch:[0/2](856600/4588595) loss:2.238 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:31:58,248][model8_pretrain.py][INFO] Epoch:[0/2](856600/4588595) loss:2.735 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:31:58,248][model8_pretrain.py][INFO] Epoch:[0/2](856600/4588595) loss:2.777 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:31:58,248][model8_pretrain.py][INFO] Epoch:[0/2](856600/4588595) loss:2.504 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:31:58,249][model8_pretrain.py][INFO] Epoch:[0/2](856600/4588595) loss:2.048 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:31:58,249][model8_pretrain.py][INFO] Epoch:[0/2](856600/4588595) loss:3.113 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:31:58,249][model8_pretrain.py][INFO] Epoch:[0/2](856600/4588595) loss:2.804 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:32:40,370][model8_pretrain.py][INFO] Epoch:[0/2](856700/4588595) loss:2.316 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:32:40,370][model8_pretrain.py][INFO] Epoch:[0/2](856700/4588595) loss:3.225 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:32:40,370][model8_pretrain.py][INFO] Epoch:[0/2](856700/4588595) loss:2.726 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:32:40,370][model8_pretrain.py][INFO] Epoch:[0/2](856700/4588595) loss:3.124 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:32:40,370][model8_pretrain.py][INFO] Epoch:[0/2](856700/4588595) loss:2.851 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:32:40,370][model8_pretrain.py][INFO] Epoch:[0/2](856700/4588595) loss:2.567 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:32:40,370][model8_pretrain.py][INFO] Epoch:[0/2](856700/4588595) loss:2.822 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:32:40,370][model8_pretrain.py][INFO] Epoch:[0/2](856700/4588595) loss:2.072 lr:0.0000100 epoch_Time:23754.0min: [2024-01-06 12:33:17,301][model8_pretrain.py][INFO] Epoch:[0/2](856800/4588595) loss:2.740 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:33:17,301][model8_pretrain.py][INFO] Epoch:[0/2](856800/4588595) loss:2.407 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:33:17,301][model8_pretrain.py][INFO] Epoch:[0/2](856800/4588595) loss:2.311 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:33:17,301][model8_pretrain.py][INFO] Epoch:[0/2](856800/4588595) loss:2.613 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:33:17,301][model8_pretrain.py][INFO] Epoch:[0/2](856800/4588595) loss:2.739 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:33:17,301][model8_pretrain.py][INFO] Epoch:[0/2](856800/4588595) loss:3.301 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:33:17,301][model8_pretrain.py][INFO] Epoch:[0/2](856800/4588595) loss:3.050 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:33:17,301][model8_pretrain.py][INFO] Epoch:[0/2](856800/4588595) loss:2.793 lr:0.0000100 epoch_Time:23753.0min: [2024-01-06 12:33:54,239][model8_pretrain.py][INFO] Epoch:[0/2](856900/4588595) loss:3.066 lr:0.0000100 epoch_Time:23752.0min: [2024-01-06 12:33:54,239][model8_pretrain.py][INFO] Epoch:[0/2](856900/4588595) loss:2.797 lr:0.0000100 epoch_Time:23752.0min: [2024-01-06 12:33:54,239][model8_pretrain.py][INFO] Epoch:[0/2](856900/4588595) loss:2.701 lr:0.0000100 epoch_Time:23752.0min: [2024-01-06 12:33:54,239][model8_pretrain.py][INFO] Epoch:[0/2](856900/4588595) loss:3.019 lr:0.0000100 epoch_Time:23752.0min: [2024-01-06 12:33:54,239][model8_pretrain.py][INFO] Epoch:[0/2](856900/4588595) loss:3.232 lr:0.0000100 epoch_Time:23752.0min: [2024-01-06 12:33:54,239][model8_pretrain.py][INFO] Epoch:[0/2](856900/4588595) loss:2.998 lr:0.0000100 epoch_Time:23752.0min: [2024-01-06 12:33:54,239][model8_pretrain.py][INFO] Epoch:[0/2](856900/4588595) loss:2.881 lr:0.0000100 epoch_Time:23752.0min: [2024-01-06 12:33:54,240][model8_pretrain.py][INFO] Epoch:[0/2](856900/4588595) loss:2.853 lr:0.0000100 epoch_Time:23752.0min: [2024-01-06 12:34:31,171][model8_pretrain.py][INFO] Epoch:[0/2](857000/4588595) loss:3.363 lr:0.0000100 epoch_Time:23751.0min: [2024-01-06 12:34:31,171][model8_pretrain.py][INFO] Epoch:[0/2](857000/4588595) loss:3.168 lr:0.0000100 epoch_Time:23751.0min: [2024-01-06 12:34:31,171][model8_pretrain.py][INFO] Epoch:[0/2](857000/4588595) loss:2.895 lr:0.0000100 epoch_Time:23751.0min: [2024-01-06 12:34:31,171][model8_pretrain.py][INFO] Epoch:[0/2](857000/4588595) loss:2.378 lr:0.0000100 epoch_Time:23751.0min: [2024-01-06 12:34:31,171][model8_pretrain.py][INFO] Epoch:[0/2](857000/4588595) loss:3.239 lr:0.0000100 epoch_Time:23751.0min: [2024-01-06 12:34:31,171][model8_pretrain.py][INFO] Epoch:[0/2](857000/4588595) loss:2.587 lr:0.0000100 epoch_Time:23751.0min: [2024-01-06 12:34:31,171][model8_pretrain.py][INFO] Epoch:[0/2](857000/4588595) loss:2.907 lr:0.0000100 epoch_Time:23751.0min: [2024-01-06 12:34:31,172][model8_pretrain.py][INFO] Epoch:[0/2](857000/4588595) loss:2.232 lr:0.0000100 epoch_Time:23751.0min: [2024-01-06 12:35:08,117][model8_pretrain.py][INFO] Epoch:[0/2](857100/4588595) loss:2.122 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:08,117][model8_pretrain.py][INFO] Epoch:[0/2](857100/4588595) loss:2.607 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:08,117][model8_pretrain.py][INFO] Epoch:[0/2](857100/4588595) loss:3.108 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:08,117][model8_pretrain.py][INFO] Epoch:[0/2](857100/4588595) loss:2.821 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:08,117][model8_pretrain.py][INFO] Epoch:[0/2](857100/4588595) loss:3.030 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:08,117][model8_pretrain.py][INFO] Epoch:[0/2](857100/4588595) loss:2.979 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:08,118][model8_pretrain.py][INFO] Epoch:[0/2](857100/4588595) loss:3.307 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:08,118][model8_pretrain.py][INFO] Epoch:[0/2](857100/4588595) loss:3.101 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:45,062][model8_pretrain.py][INFO] Epoch:[0/2](857200/4588595) loss:2.419 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:45,062][model8_pretrain.py][INFO] Epoch:[0/2](857200/4588595) loss:3.003 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:45,062][model8_pretrain.py][INFO] Epoch:[0/2](857200/4588595) loss:2.765 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:45,062][model8_pretrain.py][INFO] Epoch:[0/2](857200/4588595) loss:2.558 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:45,062][model8_pretrain.py][INFO] Epoch:[0/2](857200/4588595) loss:2.491 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:45,062][model8_pretrain.py][INFO] Epoch:[0/2](857200/4588595) loss:2.911 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:45,062][model8_pretrain.py][INFO] Epoch:[0/2](857200/4588595) loss:2.521 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:35:45,063][model8_pretrain.py][INFO] Epoch:[0/2](857200/4588595) loss:3.193 lr:0.0000100 epoch_Time:23750.0min: [2024-01-06 12:36:22,012][model8_pretrain.py][INFO] Epoch:[0/2](857300/4588595) loss:3.053 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:36:22,012][model8_pretrain.py][INFO] Epoch:[0/2](857300/4588595) loss:2.784 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:36:22,012][model8_pretrain.py][INFO] Epoch:[0/2](857300/4588595) loss:3.264 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:36:22,012][model8_pretrain.py][INFO] Epoch:[0/2](857300/4588595) loss:2.504 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:36:22,013][model8_pretrain.py][INFO] Epoch:[0/2](857300/4588595) loss:2.181 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:36:22,013][model8_pretrain.py][INFO] Epoch:[0/2](857300/4588595) loss:2.273 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:36:22,013][model8_pretrain.py][INFO] Epoch:[0/2](857300/4588595) loss:2.931 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:36:22,013][model8_pretrain.py][INFO] Epoch:[0/2](857300/4588595) loss:2.541 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:37:03,937][model8_pretrain.py][INFO] Epoch:[0/2](857400/4588595) loss:2.686 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:37:03,937][model8_pretrain.py][INFO] Epoch:[0/2](857400/4588595) loss:3.107 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:37:03,937][model8_pretrain.py][INFO] Epoch:[0/2](857400/4588595) loss:3.355 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:37:03,937][model8_pretrain.py][INFO] Epoch:[0/2](857400/4588595) loss:2.475 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:37:03,937][model8_pretrain.py][INFO] Epoch:[0/2](857400/4588595) loss:2.468 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:37:03,937][model8_pretrain.py][INFO] Epoch:[0/2](857400/4588595) loss:2.893 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:37:03,937][model8_pretrain.py][INFO] Epoch:[0/2](857400/4588595) loss:2.658 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:37:03,937][model8_pretrain.py][INFO] Epoch:[0/2](857400/4588595) loss:3.150 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:37:46,105][model8_pretrain.py][INFO] Epoch:[0/2](857500/4588595) loss:1.808 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:37:46,105][model8_pretrain.py][INFO] Epoch:[0/2](857500/4588595) loss:2.602 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:37:46,105][model8_pretrain.py][INFO] Epoch:[0/2](857500/4588595) loss:2.526 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:37:46,105][model8_pretrain.py][INFO] Epoch:[0/2](857500/4588595) loss:3.210 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:37:46,105][model8_pretrain.py][INFO] Epoch:[0/2](857500/4588595) loss:3.008 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:37:46,105][model8_pretrain.py][INFO] Epoch:[0/2](857500/4588595) loss:2.163 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:37:46,105][model8_pretrain.py][INFO] Epoch:[0/2](857500/4588595) loss:2.866 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:37:46,105][model8_pretrain.py][INFO] Epoch:[0/2](857500/4588595) loss:2.941 lr:0.0000100 epoch_Time:23749.0min: [2024-01-06 12:38:23,043][model8_pretrain.py][INFO] Epoch:[0/2](857600/4588595) loss:3.225 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:38:23,043][model8_pretrain.py][INFO] Epoch:[0/2](857600/4588595) loss:2.715 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:38:23,044][model8_pretrain.py][INFO] Epoch:[0/2](857600/4588595) loss:2.939 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:38:23,043][model8_pretrain.py][INFO] Epoch:[0/2](857600/4588595) loss:2.408 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:38:23,044][model8_pretrain.py][INFO] Epoch:[0/2](857600/4588595) loss:2.249 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:38:23,044][model8_pretrain.py][INFO] Epoch:[0/2](857600/4588595) loss:2.545 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:38:23,044][model8_pretrain.py][INFO] Epoch:[0/2](857600/4588595) loss:2.665 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:38:23,044][model8_pretrain.py][INFO] Epoch:[0/2](857600/4588595) loss:2.717 lr:0.0000100 epoch_Time:23748.0min: [2024-01-06 12:38:59,981][model8_pretrain.py][INFO] Epoch:[0/2](857700/4588595) loss:3.002 lr:0.0000100 epoch_Time:23747.0min: [2024-01-06 12:38:59,981][model8_pretrain.py][INFO] Epoch:[0/2](857700/4588595) loss:2.821 lr:0.0000100 epoch_Time:23747.0min: [2024-01-06 12:38:59,981][model8_pretrain.py][INFO] Epoch:[0/2](857700/4588595) loss:2.840 lr:0.0000100 epoch_Time:23747.0min: [2024-01-06 12:38:59,981][model8_pretrain.py][INFO] Epoch:[0/2](857700/4588595) loss:2.853 lr:0.0000100 epoch_Time:23747.0min: [2024-01-06 12:38:59,981][model8_pretrain.py][INFO] Epoch:[0/2](857700/4588595) loss:2.418 lr:0.0000100 epoch_Time:23747.0min: [2024-01-06 12:38:59,981][model8_pretrain.py][INFO] Epoch:[0/2](857700/4588595) loss:2.987 lr:0.0000100 epoch_Time:23747.0min: [2024-01-06 12:38:59,981][model8_pretrain.py][INFO] Epoch:[0/2](857700/4588595) loss:1.908 lr:0.0000100 epoch_Time:23747.0min: [2024-01-06 12:38:59,982][model8_pretrain.py][INFO] Epoch:[0/2](857700/4588595) loss:2.782 lr:0.0000100 epoch_Time:23747.0min: [2024-01-06 12:39:36,921][model8_pretrain.py][INFO] Epoch:[0/2](857800/4588595) loss:2.674 lr:0.0000100 epoch_Time:23746.0min: [2024-01-06 12:39:36,921][model8_pretrain.py][INFO] Epoch:[0/2](857800/4588595) loss:2.889 lr:0.0000100 epoch_Time:23746.0min: [2024-01-06 12:39:36,921][model8_pretrain.py][INFO] Epoch:[0/2](857800/4588595) loss:2.496 lr:0.0000100 epoch_Time:23746.0min: [2024-01-06 12:39:36,921][model8_pretrain.py][INFO] Epoch:[0/2](857800/4588595) loss:2.705 lr:0.0000100 epoch_Time:23746.0min: [2024-01-06 12:39:36,921][model8_pretrain.py][INFO] Epoch:[0/2](857800/4588595) loss:2.823 lr:0.0000100 epoch_Time:23746.0min: [2024-01-06 12:39:36,921][model8_pretrain.py][INFO] Epoch:[0/2](857800/4588595) loss:3.051 lr:0.0000100 epoch_Time:23746.0min: [2024-01-06 12:39:36,921][model8_pretrain.py][INFO] Epoch:[0/2](857800/4588595) loss:2.283 lr:0.0000100 epoch_Time:23746.0min: [2024-01-06 12:39:36,921][model8_pretrain.py][INFO] Epoch:[0/2](857800/4588595) loss:3.250 lr:0.0000100 epoch_Time:23746.0min: [2024-01-06 12:40:13,856][model8_pretrain.py][INFO] Epoch:[0/2](857900/4588595) loss:2.685 lr:0.0000100 epoch_Time:23745.0min: [2024-01-06 12:40:13,856][model8_pretrain.py][INFO] Epoch:[0/2](857900/4588595) loss:2.794 lr:0.0000100 epoch_Time:23745.0min: [2024-01-06 12:40:13,856][model8_pretrain.py][INFO] Epoch:[0/2](857900/4588595) loss:3.003 lr:0.0000100 epoch_Time:23745.0min: [2024-01-06 12:40:13,857][model8_pretrain.py][INFO] Epoch:[0/2](857900/4588595) loss:2.653 lr:0.0000100 epoch_Time:23745.0min: [2024-01-06 12:40:13,857][model8_pretrain.py][INFO] Epoch:[0/2](857900/4588595) loss:2.621 lr:0.0000100 epoch_Time:23745.0min: [2024-01-06 12:40:13,856][model8_pretrain.py][INFO] Epoch:[0/2](857900/4588595) loss:3.310 lr:0.0000100 epoch_Time:23745.0min: [2024-01-06 12:40:13,857][model8_pretrain.py][INFO] Epoch:[0/2](857900/4588595) loss:2.781 lr:0.0000100 epoch_Time:23745.0min: [2024-01-06 12:40:13,857][model8_pretrain.py][INFO] Epoch:[0/2](857900/4588595) loss:2.972 lr:0.0000100 epoch_Time:23745.0min: [2024-01-06 12:40:50,797][model8_pretrain.py][INFO] Epoch:[0/2](858000/4588595) loss:3.186 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:40:50,797][model8_pretrain.py][INFO] Epoch:[0/2](858000/4588595) loss:1.721 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:40:50,797][model8_pretrain.py][INFO] Epoch:[0/2](858000/4588595) loss:2.492 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:40:50,797][model8_pretrain.py][INFO] Epoch:[0/2](858000/4588595) loss:2.969 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:40:50,797][model8_pretrain.py][INFO] Epoch:[0/2](858000/4588595) loss:2.015 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:40:50,797][model8_pretrain.py][INFO] Epoch:[0/2](858000/4588595) loss:2.644 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:40:50,797][model8_pretrain.py][INFO] Epoch:[0/2](858000/4588595) loss:2.301 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:40:50,797][model8_pretrain.py][INFO] Epoch:[0/2](858000/4588595) loss:2.735 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:41:27,742][model8_pretrain.py][INFO] Epoch:[0/2](858100/4588595) loss:2.861 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:41:27,742][model8_pretrain.py][INFO] Epoch:[0/2](858100/4588595) loss:3.077 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:41:27,742][model8_pretrain.py][INFO] Epoch:[0/2](858100/4588595) loss:2.621 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:41:27,742][model8_pretrain.py][INFO] Epoch:[0/2](858100/4588595) loss:3.124 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:41:27,742][model8_pretrain.py][INFO] Epoch:[0/2](858100/4588595) loss:2.181 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:41:27,742][model8_pretrain.py][INFO] Epoch:[0/2](858100/4588595) loss:2.755 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:41:27,742][model8_pretrain.py][INFO] Epoch:[0/2](858100/4588595) loss:2.167 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:41:27,742][model8_pretrain.py][INFO] Epoch:[0/2](858100/4588595) loss:2.773 lr:0.0000100 epoch_Time:23744.0min: [2024-01-06 12:42:09,633][model8_pretrain.py][INFO] Epoch:[0/2](858200/4588595) loss:2.615 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:09,633][model8_pretrain.py][INFO] Epoch:[0/2](858200/4588595) loss:3.106 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:09,633][model8_pretrain.py][INFO] Epoch:[0/2](858200/4588595) loss:2.730 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:09,633][model8_pretrain.py][INFO] Epoch:[0/2](858200/4588595) loss:2.697 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:09,633][model8_pretrain.py][INFO] Epoch:[0/2](858200/4588595) loss:2.903 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:09,634][model8_pretrain.py][INFO] Epoch:[0/2](858200/4588595) loss:2.946 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:09,635][model8_pretrain.py][INFO] Epoch:[0/2](858200/4588595) loss:2.663 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:09,638][model8_pretrain.py][INFO] Epoch:[0/2](858200/4588595) loss:3.133 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:52,244][model8_pretrain.py][INFO] Epoch:[0/2](858300/4588595) loss:3.228 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:52,244][model8_pretrain.py][INFO] Epoch:[0/2](858300/4588595) loss:3.009 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:52,244][model8_pretrain.py][INFO] Epoch:[0/2](858300/4588595) loss:2.912 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:52,244][model8_pretrain.py][INFO] Epoch:[0/2](858300/4588595) loss:3.553 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:52,244][model8_pretrain.py][INFO] Epoch:[0/2](858300/4588595) loss:2.741 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:52,244][model8_pretrain.py][INFO] Epoch:[0/2](858300/4588595) loss:3.216 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:52,245][model8_pretrain.py][INFO] Epoch:[0/2](858300/4588595) loss:2.786 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:42:52,245][model8_pretrain.py][INFO] Epoch:[0/2](858300/4588595) loss:2.281 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:43:29,183][model8_pretrain.py][INFO] Epoch:[0/2](858400/4588595) loss:2.553 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:43:29,183][model8_pretrain.py][INFO] Epoch:[0/2](858400/4588595) loss:3.103 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:43:29,183][model8_pretrain.py][INFO] Epoch:[0/2](858400/4588595) loss:2.651 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:43:29,183][model8_pretrain.py][INFO] Epoch:[0/2](858400/4588595) loss:2.953 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:43:29,183][model8_pretrain.py][INFO] Epoch:[0/2](858400/4588595) loss:3.182 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:43:29,183][model8_pretrain.py][INFO] Epoch:[0/2](858400/4588595) loss:3.119 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:43:29,183][model8_pretrain.py][INFO] Epoch:[0/2](858400/4588595) loss:2.769 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:43:29,184][model8_pretrain.py][INFO] Epoch:[0/2](858400/4588595) loss:3.047 lr:0.0000100 epoch_Time:23743.0min: [2024-01-06 12:44:06,139][model8_pretrain.py][INFO] Epoch:[0/2](858500/4588595) loss:3.018 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:06,139][model8_pretrain.py][INFO] Epoch:[0/2](858500/4588595) loss:2.489 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:06,139][model8_pretrain.py][INFO] Epoch:[0/2](858500/4588595) loss:3.045 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:06,139][model8_pretrain.py][INFO] Epoch:[0/2](858500/4588595) loss:2.705 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:06,139][model8_pretrain.py][INFO] Epoch:[0/2](858500/4588595) loss:2.792 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:06,139][model8_pretrain.py][INFO] Epoch:[0/2](858500/4588595) loss:2.371 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:06,139][model8_pretrain.py][INFO] Epoch:[0/2](858500/4588595) loss:3.185 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:06,139][model8_pretrain.py][INFO] Epoch:[0/2](858500/4588595) loss:2.377 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:43,086][model8_pretrain.py][INFO] Epoch:[0/2](858600/4588595) loss:2.757 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:43,086][model8_pretrain.py][INFO] Epoch:[0/2](858600/4588595) loss:2.437 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:43,086][model8_pretrain.py][INFO] Epoch:[0/2](858600/4588595) loss:2.896 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:43,086][model8_pretrain.py][INFO] Epoch:[0/2](858600/4588595) loss:2.851 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:43,086][model8_pretrain.py][INFO] Epoch:[0/2](858600/4588595) loss:3.107 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:43,086][model8_pretrain.py][INFO] Epoch:[0/2](858600/4588595) loss:3.017 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:43,086][model8_pretrain.py][INFO] Epoch:[0/2](858600/4588595) loss:3.249 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:44:43,086][model8_pretrain.py][INFO] Epoch:[0/2](858600/4588595) loss:2.144 lr:0.0000100 epoch_Time:23742.0min: [2024-01-06 12:45:20,042][model8_pretrain.py][INFO] Epoch:[0/2](858700/4588595) loss:2.120 lr:0.0000100 epoch_Time:23740.0min: [2024-01-06 12:45:20,042][model8_pretrain.py][INFO] Epoch:[0/2](858700/4588595) loss:2.695 lr:0.0000100 epoch_Time:23740.0min: [2024-01-06 12:45:20,042][model8_pretrain.py][INFO] Epoch:[0/2](858700/4588595) loss:2.945 lr:0.0000100 epoch_Time:23740.0min: [2024-01-06 12:45:20,042][model8_pretrain.py][INFO] Epoch:[0/2](858700/4588595) loss:3.053 lr:0.0000100 epoch_Time:23740.0min: [2024-01-06 12:45:20,042][model8_pretrain.py][INFO] Epoch:[0/2](858700/4588595) loss:3.513 lr:0.0000100 epoch_Time:23740.0min: [2024-01-06 12:45:20,042][model8_pretrain.py][INFO] Epoch:[0/2](858700/4588595) loss:2.927 lr:0.0000100 epoch_Time:23740.0min: [2024-01-06 12:45:20,042][model8_pretrain.py][INFO] Epoch:[0/2](858700/4588595) loss:3.195 lr:0.0000100 epoch_Time:23740.0min: [2024-01-06 12:45:20,043][model8_pretrain.py][INFO] Epoch:[0/2](858700/4588595) loss:2.179 lr:0.0000100 epoch_Time:23740.0min: [2024-01-06 12:45:56,988][model8_pretrain.py][INFO] Epoch:[0/2](858800/4588595) loss:2.598 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:45:56,988][model8_pretrain.py][INFO] Epoch:[0/2](858800/4588595) loss:2.798 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:45:56,988][model8_pretrain.py][INFO] Epoch:[0/2](858800/4588595) loss:3.185 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:45:56,988][model8_pretrain.py][INFO] Epoch:[0/2](858800/4588595) loss:2.482 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:45:56,988][model8_pretrain.py][INFO] Epoch:[0/2](858800/4588595) loss:3.030 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:45:56,988][model8_pretrain.py][INFO] Epoch:[0/2](858800/4588595) loss:2.772 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:45:56,988][model8_pretrain.py][INFO] Epoch:[0/2](858800/4588595) loss:3.279 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:45:56,988][model8_pretrain.py][INFO] Epoch:[0/2](858800/4588595) loss:3.017 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:46:33,934][model8_pretrain.py][INFO] Epoch:[0/2](858900/4588595) loss:2.958 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:46:33,934][model8_pretrain.py][INFO] Epoch:[0/2](858900/4588595) loss:2.453 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:46:33,934][model8_pretrain.py][INFO] Epoch:[0/2](858900/4588595) loss:2.369 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:46:33,934][model8_pretrain.py][INFO] Epoch:[0/2](858900/4588595) loss:2.067 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:46:33,934][model8_pretrain.py][INFO] Epoch:[0/2](858900/4588595) loss:2.575 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:46:33,934][model8_pretrain.py][INFO] Epoch:[0/2](858900/4588595) loss:2.829 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:46:33,934][model8_pretrain.py][INFO] Epoch:[0/2](858900/4588595) loss:3.047 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:46:33,935][model8_pretrain.py][INFO] Epoch:[0/2](858900/4588595) loss:2.869 lr:0.0000100 epoch_Time:23739.0min: [2024-01-06 12:47:13,988][model8_pretrain.py][INFO] Epoch:[0/2](859000/4588595) loss:3.015 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:13,988][model8_pretrain.py][INFO] Epoch:[0/2](859000/4588595) loss:2.623 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:13,988][model8_pretrain.py][INFO] Epoch:[0/2](859000/4588595) loss:2.865 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:13,988][model8_pretrain.py][INFO] Epoch:[0/2](859000/4588595) loss:2.970 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:13,988][model8_pretrain.py][INFO] Epoch:[0/2](859000/4588595) loss:2.763 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:13,988][model8_pretrain.py][INFO] Epoch:[0/2](859000/4588595) loss:2.751 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:13,988][model8_pretrain.py][INFO] Epoch:[0/2](859000/4588595) loss:3.139 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:13,988][model8_pretrain.py][INFO] Epoch:[0/2](859000/4588595) loss:2.802 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:57,881][model8_pretrain.py][INFO] Epoch:[0/2](859100/4588595) loss:3.018 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:57,881][model8_pretrain.py][INFO] Epoch:[0/2](859100/4588595) loss:2.538 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:57,881][model8_pretrain.py][INFO] Epoch:[0/2](859100/4588595) loss:2.978 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:57,881][model8_pretrain.py][INFO] Epoch:[0/2](859100/4588595) loss:2.710 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:57,881][model8_pretrain.py][INFO] Epoch:[0/2](859100/4588595) loss:2.649 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:57,881][model8_pretrain.py][INFO] Epoch:[0/2](859100/4588595) loss:2.816 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:57,881][model8_pretrain.py][INFO] Epoch:[0/2](859100/4588595) loss:2.845 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:47:57,881][model8_pretrain.py][INFO] Epoch:[0/2](859100/4588595) loss:3.209 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:48:34,811][model8_pretrain.py][INFO] Epoch:[0/2](859200/4588595) loss:2.756 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:48:34,811][model8_pretrain.py][INFO] Epoch:[0/2](859200/4588595) loss:3.084 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:48:34,811][model8_pretrain.py][INFO] Epoch:[0/2](859200/4588595) loss:2.910 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:48:34,811][model8_pretrain.py][INFO] Epoch:[0/2](859200/4588595) loss:2.767 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:48:34,811][model8_pretrain.py][INFO] Epoch:[0/2](859200/4588595) loss:2.670 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:48:34,811][model8_pretrain.py][INFO] Epoch:[0/2](859200/4588595) loss:2.524 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:48:34,811][model8_pretrain.py][INFO] Epoch:[0/2](859200/4588595) loss:2.457 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:48:34,812][model8_pretrain.py][INFO] Epoch:[0/2](859200/4588595) loss:2.995 lr:0.0000100 epoch_Time:23738.0min: [2024-01-06 12:49:11,753][model8_pretrain.py][INFO] Epoch:[0/2](859300/4588595) loss:2.691 lr:0.0000100 epoch_Time:23737.0min: [2024-01-06 12:49:11,753][model8_pretrain.py][INFO] Epoch:[0/2](859300/4588595) loss:3.130 lr:0.0000100 epoch_Time:23737.0min: [2024-01-06 12:49:11,753][model8_pretrain.py][INFO] Epoch:[0/2](859300/4588595) loss:3.263 lr:0.0000100 epoch_Time:23737.0min: [2024-01-06 12:49:11,753][model8_pretrain.py][INFO] Epoch:[0/2](859300/4588595) loss:2.565 lr:0.0000100 epoch_Time:23737.0min: [2024-01-06 12:49:11,754][model8_pretrain.py][INFO] Epoch:[0/2](859300/4588595) loss:2.942 lr:0.0000100 epoch_Time:23737.0min: [2024-01-06 12:49:11,754][model8_pretrain.py][INFO] Epoch:[0/2](859300/4588595) loss:2.970 lr:0.0000100 epoch_Time:23737.0min: [2024-01-06 12:49:11,754][model8_pretrain.py][INFO] Epoch:[0/2](859300/4588595) loss:2.890 lr:0.0000100 epoch_Time:23737.0min: [2024-01-06 12:49:11,754][model8_pretrain.py][INFO] Epoch:[0/2](859300/4588595) loss:2.564 lr:0.0000100 epoch_Time:23737.0min: [2024-01-06 12:49:48,699][model8_pretrain.py][INFO] Epoch:[0/2](859400/4588595) loss:3.068 lr:0.0000100 epoch_Time:23736.0min: [2024-01-06 12:49:48,699][model8_pretrain.py][INFO] Epoch:[0/2](859400/4588595) loss:2.697 lr:0.0000100 epoch_Time:23736.0min: [2024-01-06 12:49:48,699][model8_pretrain.py][INFO] Epoch:[0/2](859400/4588595) loss:2.809 lr:0.0000100 epoch_Time:23736.0min: [2024-01-06 12:49:48,699][model8_pretrain.py][INFO] Epoch:[0/2](859400/4588595) loss:2.689 lr:0.0000100 epoch_Time:23736.0min: [2024-01-06 12:49:48,699][model8_pretrain.py][INFO] Epoch:[0/2](859400/4588595) loss:2.814 lr:0.0000100 epoch_Time:23736.0min: [2024-01-06 12:49:48,699][model8_pretrain.py][INFO] Epoch:[0/2](859400/4588595) loss:2.882 lr:0.0000100 epoch_Time:23736.0min: [2024-01-06 12:49:48,699][model8_pretrain.py][INFO] Epoch:[0/2](859400/4588595) loss:2.794 lr:0.0000100 epoch_Time:23736.0min: [2024-01-06 12:49:48,700][model8_pretrain.py][INFO] Epoch:[0/2](859400/4588595) loss:3.210 lr:0.0000100 epoch_Time:23736.0min: [2024-01-06 12:50:25,649][model8_pretrain.py][INFO] Epoch:[0/2](859500/4588595) loss:2.660 lr:0.0000100 epoch_Time:23735.0min: [2024-01-06 12:50:25,649][model8_pretrain.py][INFO] Epoch:[0/2](859500/4588595) loss:2.998 lr:0.0000100 epoch_Time:23735.0min: [2024-01-06 12:50:25,649][model8_pretrain.py][INFO] Epoch:[0/2](859500/4588595) loss:3.065 lr:0.0000100 epoch_Time:23735.0min: [2024-01-06 12:50:25,649][model8_pretrain.py][INFO] Epoch:[0/2](859500/4588595) loss:2.057 lr:0.0000100 epoch_Time:23735.0min: [2024-01-06 12:50:25,649][model8_pretrain.py][INFO] Epoch:[0/2](859500/4588595) loss:2.789 lr:0.0000100 epoch_Time:23735.0min: [2024-01-06 12:50:25,649][model8_pretrain.py][INFO] Epoch:[0/2](859500/4588595) loss:2.245 lr:0.0000100 epoch_Time:23735.0min: [2024-01-06 12:50:25,649][model8_pretrain.py][INFO] Epoch:[0/2](859500/4588595) loss:2.585 lr:0.0000100 epoch_Time:23735.0min: [2024-01-06 12:50:25,650][model8_pretrain.py][INFO] Epoch:[0/2](859500/4588595) loss:2.166 lr:0.0000100 epoch_Time:23735.0min: [2024-01-06 12:51:02,588][model8_pretrain.py][INFO] Epoch:[0/2](859600/4588595) loss:2.873 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:02,588][model8_pretrain.py][INFO] Epoch:[0/2](859600/4588595) loss:2.985 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:02,588][model8_pretrain.py][INFO] Epoch:[0/2](859600/4588595) loss:3.196 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:02,588][model8_pretrain.py][INFO] Epoch:[0/2](859600/4588595) loss:2.934 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:02,588][model8_pretrain.py][INFO] Epoch:[0/2](859600/4588595) loss:2.471 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:02,589][model8_pretrain.py][INFO] Epoch:[0/2](859600/4588595) loss:2.736 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:02,589][model8_pretrain.py][INFO] Epoch:[0/2](859600/4588595) loss:2.842 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:02,589][model8_pretrain.py][INFO] Epoch:[0/2](859600/4588595) loss:3.031 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:39,535][model8_pretrain.py][INFO] Epoch:[0/2](859700/4588595) loss:2.859 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:39,535][model8_pretrain.py][INFO] Epoch:[0/2](859700/4588595) loss:2.738 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:39,535][model8_pretrain.py][INFO] Epoch:[0/2](859700/4588595) loss:2.964 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:39,535][model8_pretrain.py][INFO] Epoch:[0/2](859700/4588595) loss:3.001 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:39,535][model8_pretrain.py][INFO] Epoch:[0/2](859700/4588595) loss:2.851 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:39,535][model8_pretrain.py][INFO] Epoch:[0/2](859700/4588595) loss:3.357 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:39,535][model8_pretrain.py][INFO] Epoch:[0/2](859700/4588595) loss:3.043 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:51:39,535][model8_pretrain.py][INFO] Epoch:[0/2](859700/4588595) loss:3.068 lr:0.0000100 epoch_Time:23734.0min: [2024-01-06 12:52:17,904][model8_pretrain.py][INFO] Epoch:[0/2](859800/4588595) loss:2.814 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:52:17,904][model8_pretrain.py][INFO] Epoch:[0/2](859800/4588595) loss:2.834 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:52:17,904][model8_pretrain.py][INFO] Epoch:[0/2](859800/4588595) loss:2.660 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:52:17,905][model8_pretrain.py][INFO] Epoch:[0/2](859800/4588595) loss:2.717 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:52:17,905][model8_pretrain.py][INFO] Epoch:[0/2](859800/4588595) loss:2.656 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:52:17,905][model8_pretrain.py][INFO] Epoch:[0/2](859800/4588595) loss:3.043 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:52:17,905][model8_pretrain.py][INFO] Epoch:[0/2](859800/4588595) loss:3.036 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:52:17,905][model8_pretrain.py][INFO] Epoch:[0/2](859800/4588595) loss:2.430 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:03,411][model8_pretrain.py][INFO] Epoch:[0/2](859900/4588595) loss:2.721 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:03,411][model8_pretrain.py][INFO] Epoch:[0/2](859900/4588595) loss:2.901 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:03,412][model8_pretrain.py][INFO] Epoch:[0/2](859900/4588595) loss:3.107 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:03,412][model8_pretrain.py][INFO] Epoch:[0/2](859900/4588595) loss:2.932 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:03,412][model8_pretrain.py][INFO] Epoch:[0/2](859900/4588595) loss:3.087 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:03,412][model8_pretrain.py][INFO] Epoch:[0/2](859900/4588595) loss:3.260 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:03,412][model8_pretrain.py][INFO] Epoch:[0/2](859900/4588595) loss:3.079 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:03,412][model8_pretrain.py][INFO] Epoch:[0/2](859900/4588595) loss:2.707 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:40,340][model8_pretrain.py][INFO] Epoch:[0/2](860000/4588595) loss:2.786 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:40,340][model8_pretrain.py][INFO] Epoch:[0/2](860000/4588595) loss:2.667 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:40,340][model8_pretrain.py][INFO] Epoch:[0/2](860000/4588595) loss:2.833 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:40,340][model8_pretrain.py][INFO] Epoch:[0/2](860000/4588595) loss:2.999 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:40,340][model8_pretrain.py][INFO] Epoch:[0/2](860000/4588595) loss:3.103 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:40,340][model8_pretrain.py][INFO] Epoch:[0/2](860000/4588595) loss:3.058 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:40,340][model8_pretrain.py][INFO] Epoch:[0/2](860000/4588595) loss:2.221 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:53:40,340][model8_pretrain.py][INFO] Epoch:[0/2](860000/4588595) loss:2.872 lr:0.0000100 epoch_Time:23733.0min: [2024-01-06 12:54:17,280][model8_pretrain.py][INFO] Epoch:[0/2](860100/4588595) loss:2.492 lr:0.0000100 epoch_Time:23732.0min: [2024-01-06 12:54:17,280][model8_pretrain.py][INFO] Epoch:[0/2](860100/4588595) loss:3.047 lr:0.0000100 epoch_Time:23732.0min: [2024-01-06 12:54:17,280][model8_pretrain.py][INFO] Epoch:[0/2](860100/4588595) loss:3.022 lr:0.0000100 epoch_Time:23732.0min: [2024-01-06 12:54:17,280][model8_pretrain.py][INFO] Epoch:[0/2](860100/4588595) loss:2.970 lr:0.0000100 epoch_Time:23732.0min: [2024-01-06 12:54:17,280][model8_pretrain.py][INFO] Epoch:[0/2](860100/4588595) loss:2.545 lr:0.0000100 epoch_Time:23732.0min: [2024-01-06 12:54:17,280][model8_pretrain.py][INFO] Epoch:[0/2](860100/4588595) loss:2.631 lr:0.0000100 epoch_Time:23732.0min: [2024-01-06 12:54:17,280][model8_pretrain.py][INFO] Epoch:[0/2](860100/4588595) loss:3.224 lr:0.0000100 epoch_Time:23732.0min: [2024-01-06 12:54:17,280][model8_pretrain.py][INFO] Epoch:[0/2](860100/4588595) loss:2.633 lr:0.0000100 epoch_Time:23732.0min: [2024-01-06 12:54:54,229][model8_pretrain.py][INFO] Epoch:[0/2](860200/4588595) loss:2.571 lr:0.0000100 epoch_Time:23731.0min: [2024-01-06 12:54:54,229][model8_pretrain.py][INFO] Epoch:[0/2](860200/4588595) loss:3.079 lr:0.0000100 epoch_Time:23731.0min: [2024-01-06 12:54:54,229][model8_pretrain.py][INFO] Epoch:[0/2](860200/4588595) loss:3.093 lr:0.0000100 epoch_Time:23731.0min: [2024-01-06 12:54:54,229][model8_pretrain.py][INFO] Epoch:[0/2](860200/4588595) loss:2.748 lr:0.0000100 epoch_Time:23731.0min: [2024-01-06 12:54:54,229][model8_pretrain.py][INFO] Epoch:[0/2](860200/4588595) loss:2.319 lr:0.0000100 epoch_Time:23731.0min: [2024-01-06 12:54:54,229][model8_pretrain.py][INFO] Epoch:[0/2](860200/4588595) loss:2.699 lr:0.0000100 epoch_Time:23731.0min: [2024-01-06 12:54:54,229][model8_pretrain.py][INFO] Epoch:[0/2](860200/4588595) loss:3.224 lr:0.0000100 epoch_Time:23731.0min: [2024-01-06 12:54:54,229][model8_pretrain.py][INFO] Epoch:[0/2](860200/4588595) loss:2.662 lr:0.0000100 epoch_Time:23731.0min: [2024-01-06 12:55:31,180][model8_pretrain.py][INFO] Epoch:[0/2](860300/4588595) loss:2.538 lr:0.0000100 epoch_Time:23730.0min: [2024-01-06 12:55:31,180][model8_pretrain.py][INFO] Epoch:[0/2](860300/4588595) loss:3.303 lr:0.0000100 epoch_Time:23730.0min: [2024-01-06 12:55:31,180][model8_pretrain.py][INFO] Epoch:[0/2](860300/4588595) loss:3.043 lr:0.0000100 epoch_Time:23730.0min: [2024-01-06 12:55:31,180][model8_pretrain.py][INFO] Epoch:[0/2](860300/4588595) loss:2.773 lr:0.0000100 epoch_Time:23730.0min: [2024-01-06 12:55:31,181][model8_pretrain.py][INFO] Epoch:[0/2](860300/4588595) loss:2.195 lr:0.0000100 epoch_Time:23730.0min: [2024-01-06 12:55:31,181][model8_pretrain.py][INFO] Epoch:[0/2](860300/4588595) loss:2.368 lr:0.0000100 epoch_Time:23730.0min: [2024-01-06 12:55:31,181][model8_pretrain.py][INFO] Epoch:[0/2](860300/4588595) loss:3.027 lr:0.0000100 epoch_Time:23730.0min: [2024-01-06 12:55:31,181][model8_pretrain.py][INFO] Epoch:[0/2](860300/4588595) loss:2.609 lr:0.0000100 epoch_Time:23730.0min: [2024-01-06 12:56:08,139][model8_pretrain.py][INFO] Epoch:[0/2](860400/4588595) loss:2.970 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:08,139][model8_pretrain.py][INFO] Epoch:[0/2](860400/4588595) loss:3.169 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:08,139][model8_pretrain.py][INFO] Epoch:[0/2](860400/4588595) loss:2.779 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:08,139][model8_pretrain.py][INFO] Epoch:[0/2](860400/4588595) loss:2.539 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:08,139][model8_pretrain.py][INFO] Epoch:[0/2](860400/4588595) loss:2.731 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:08,139][model8_pretrain.py][INFO] Epoch:[0/2](860400/4588595) loss:2.773 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:08,141][model8_pretrain.py][INFO] Epoch:[0/2](860400/4588595) loss:2.571 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:08,141][model8_pretrain.py][INFO] Epoch:[0/2](860400/4588595) loss:2.527 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:45,094][model8_pretrain.py][INFO] Epoch:[0/2](860500/4588595) loss:2.540 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:45,094][model8_pretrain.py][INFO] Epoch:[0/2](860500/4588595) loss:2.406 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:45,094][model8_pretrain.py][INFO] Epoch:[0/2](860500/4588595) loss:3.252 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:45,094][model8_pretrain.py][INFO] Epoch:[0/2](860500/4588595) loss:3.075 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:45,094][model8_pretrain.py][INFO] Epoch:[0/2](860500/4588595) loss:3.078 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:45,094][model8_pretrain.py][INFO] Epoch:[0/2](860500/4588595) loss:3.084 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:45,094][model8_pretrain.py][INFO] Epoch:[0/2](860500/4588595) loss:2.500 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:56:45,094][model8_pretrain.py][INFO] Epoch:[0/2](860500/4588595) loss:1.656 lr:0.0000100 epoch_Time:23729.0min: [2024-01-06 12:57:23,471][model8_pretrain.py][INFO] Epoch:[0/2](860600/4588595) loss:2.443 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:57:23,471][model8_pretrain.py][INFO] Epoch:[0/2](860600/4588595) loss:2.541 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:57:23,471][model8_pretrain.py][INFO] Epoch:[0/2](860600/4588595) loss:2.811 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:57:23,471][model8_pretrain.py][INFO] Epoch:[0/2](860600/4588595) loss:2.971 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:57:23,471][model8_pretrain.py][INFO] Epoch:[0/2](860600/4588595) loss:2.642 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:57:23,471][model8_pretrain.py][INFO] Epoch:[0/2](860600/4588595) loss:2.670 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:57:23,471][model8_pretrain.py][INFO] Epoch:[0/2](860600/4588595) loss:3.178 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:57:23,471][model8_pretrain.py][INFO] Epoch:[0/2](860600/4588595) loss:2.880 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:09,207][model8_pretrain.py][INFO] Epoch:[0/2](860700/4588595) loss:2.637 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:09,207][model8_pretrain.py][INFO] Epoch:[0/2](860700/4588595) loss:2.933 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:09,207][model8_pretrain.py][INFO] Epoch:[0/2](860700/4588595) loss:2.843 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:09,207][model8_pretrain.py][INFO] Epoch:[0/2](860700/4588595) loss:3.055 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:09,207][model8_pretrain.py][INFO] Epoch:[0/2](860700/4588595) loss:3.384 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:09,207][model8_pretrain.py][INFO] Epoch:[0/2](860700/4588595) loss:2.813 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:09,207][model8_pretrain.py][INFO] Epoch:[0/2](860700/4588595) loss:2.786 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:09,207][model8_pretrain.py][INFO] Epoch:[0/2](860700/4588595) loss:2.706 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:46,142][model8_pretrain.py][INFO] Epoch:[0/2](860800/4588595) loss:2.663 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:46,142][model8_pretrain.py][INFO] Epoch:[0/2](860800/4588595) loss:2.971 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:46,142][model8_pretrain.py][INFO] Epoch:[0/2](860800/4588595) loss:3.069 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:46,142][model8_pretrain.py][INFO] Epoch:[0/2](860800/4588595) loss:2.292 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:46,142][model8_pretrain.py][INFO] Epoch:[0/2](860800/4588595) loss:2.490 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:46,142][model8_pretrain.py][INFO] Epoch:[0/2](860800/4588595) loss:3.081 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:46,142][model8_pretrain.py][INFO] Epoch:[0/2](860800/4588595) loss:2.738 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:58:46,142][model8_pretrain.py][INFO] Epoch:[0/2](860800/4588595) loss:3.114 lr:0.0000100 epoch_Time:23728.0min: [2024-01-06 12:59:23,077][model8_pretrain.py][INFO] Epoch:[0/2](860900/4588595) loss:2.794 lr:0.0000100 epoch_Time:23727.0min: [2024-01-06 12:59:23,077][model8_pretrain.py][INFO] Epoch:[0/2](860900/4588595) loss:2.908 lr:0.0000100 epoch_Time:23727.0min: [2024-01-06 12:59:23,077][model8_pretrain.py][INFO] Epoch:[0/2](860900/4588595) loss:2.160 lr:0.0000100 epoch_Time:23727.0min: [2024-01-06 12:59:23,077][model8_pretrain.py][INFO] Epoch:[0/2](860900/4588595) loss:2.685 lr:0.0000100 epoch_Time:23727.0min: [2024-01-06 12:59:23,077][model8_pretrain.py][INFO] Epoch:[0/2](860900/4588595) loss:2.952 lr:0.0000100 epoch_Time:23727.0min: [2024-01-06 12:59:23,077][model8_pretrain.py][INFO] Epoch:[0/2](860900/4588595) loss:2.735 lr:0.0000100 epoch_Time:23727.0min: [2024-01-06 12:59:23,077][model8_pretrain.py][INFO] Epoch:[0/2](860900/4588595) loss:2.449 lr:0.0000100 epoch_Time:23727.0min: [2024-01-06 12:59:23,078][model8_pretrain.py][INFO] Epoch:[0/2](860900/4588595) loss:3.013 lr:0.0000100 epoch_Time:23727.0min: [2024-01-06 13:00:00,024][model8_pretrain.py][INFO] Epoch:[0/2](861000/4588595) loss:2.608 lr:0.0000100 epoch_Time:23726.0min: [2024-01-06 13:00:00,024][model8_pretrain.py][INFO] Epoch:[0/2](861000/4588595) loss:2.635 lr:0.0000100 epoch_Time:23726.0min: [2024-01-06 13:00:00,024][model8_pretrain.py][INFO] Epoch:[0/2](861000/4588595) loss:2.669 lr:0.0000100 epoch_Time:23726.0min: [2024-01-06 13:00:00,024][model8_pretrain.py][INFO] Epoch:[0/2](861000/4588595) loss:2.078 lr:0.0000100 epoch_Time:23726.0min: [2024-01-06 13:00:00,024][model8_pretrain.py][INFO] Epoch:[0/2](861000/4588595) loss:2.631 lr:0.0000100 epoch_Time:23726.0min: [2024-01-06 13:00:00,024][model8_pretrain.py][INFO] Epoch:[0/2](861000/4588595) loss:3.392 lr:0.0000100 epoch_Time:23726.0min: [2024-01-06 13:00:00,024][model8_pretrain.py][INFO] Epoch:[0/2](861000/4588595) loss:3.280 lr:0.0000100 epoch_Time:23726.0min: [2024-01-06 13:00:00,024][model8_pretrain.py][INFO] Epoch:[0/2](861000/4588595) loss:3.037 lr:0.0000100 epoch_Time:23726.0min: [2024-01-06 13:00:36,969][model8_pretrain.py][INFO] Epoch:[0/2](861100/4588595) loss:2.197 lr:0.0000100 epoch_Time:23725.0min: [2024-01-06 13:00:36,969][model8_pretrain.py][INFO] Epoch:[0/2](861100/4588595) loss:2.674 lr:0.0000100 epoch_Time:23725.0min: [2024-01-06 13:00:36,969][model8_pretrain.py][INFO] Epoch:[0/2](861100/4588595) loss:3.093 lr:0.0000100 epoch_Time:23725.0min: [2024-01-06 13:00:36,969][model8_pretrain.py][INFO] Epoch:[0/2](861100/4588595) loss:2.751 lr:0.0000100 epoch_Time:23725.0min: [2024-01-06 13:00:36,969][model8_pretrain.py][INFO] Epoch:[0/2](861100/4588595) loss:2.426 lr:0.0000100 epoch_Time:23725.0min: [2024-01-06 13:00:36,969][model8_pretrain.py][INFO] Epoch:[0/2](861100/4588595) loss:2.276 lr:0.0000100 epoch_Time:23725.0min: [2024-01-06 13:00:36,969][model8_pretrain.py][INFO] Epoch:[0/2](861100/4588595) loss:2.684 lr:0.0000100 epoch_Time:23725.0min: [2024-01-06 13:00:36,970][model8_pretrain.py][INFO] Epoch:[0/2](861100/4588595) loss:2.891 lr:0.0000100 epoch_Time:23725.0min: [2024-01-06 13:01:13,920][model8_pretrain.py][INFO] Epoch:[0/2](861200/4588595) loss:2.762 lr:0.0000100 epoch_Time:23724.0min: [2024-01-06 13:01:13,920][model8_pretrain.py][INFO] Epoch:[0/2](861200/4588595) loss:3.030 lr:0.0000100 epoch_Time:23724.0min: [2024-01-06 13:01:13,920][model8_pretrain.py][INFO] Epoch:[0/2](861200/4588595) loss:2.640 lr:0.0000100 epoch_Time:23724.0min: [2024-01-06 13:01:13,920][model8_pretrain.py][INFO] Epoch:[0/2](861200/4588595) loss:2.569 lr:0.0000100 epoch_Time:23724.0min: [2024-01-06 13:01:13,920][model8_pretrain.py][INFO] Epoch:[0/2](861200/4588595) loss:2.373 lr:0.0000100 epoch_Time:23724.0min: [2024-01-06 13:01:13,920][model8_pretrain.py][INFO] Epoch:[0/2](861200/4588595) loss:2.941 lr:0.0000100 epoch_Time:23724.0min: [2024-01-06 13:01:13,920][model8_pretrain.py][INFO] Epoch:[0/2](861200/4588595) loss:2.491 lr:0.0000100 epoch_Time:23724.0min: [2024-01-06 13:01:13,920][model8_pretrain.py][INFO] Epoch:[0/2](861200/4588595) loss:2.720 lr:0.0000100 epoch_Time:23724.0min: [2024-01-06 13:01:50,860][model8_pretrain.py][INFO] Epoch:[0/2](861300/4588595) loss:2.245 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:01:50,860][model8_pretrain.py][INFO] Epoch:[0/2](861300/4588595) loss:2.907 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:01:50,860][model8_pretrain.py][INFO] Epoch:[0/2](861300/4588595) loss:2.820 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:01:50,860][model8_pretrain.py][INFO] Epoch:[0/2](861300/4588595) loss:2.924 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:01:50,860][model8_pretrain.py][INFO] Epoch:[0/2](861300/4588595) loss:2.670 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:01:50,860][model8_pretrain.py][INFO] Epoch:[0/2](861300/4588595) loss:2.901 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:01:50,860][model8_pretrain.py][INFO] Epoch:[0/2](861300/4588595) loss:2.927 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:01:50,860][model8_pretrain.py][INFO] Epoch:[0/2](861300/4588595) loss:3.159 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:02:29,267][model8_pretrain.py][INFO] Epoch:[0/2](861400/4588595) loss:2.888 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:02:29,267][model8_pretrain.py][INFO] Epoch:[0/2](861400/4588595) loss:2.757 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:02:29,267][model8_pretrain.py][INFO] Epoch:[0/2](861400/4588595) loss:3.009 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:02:29,267][model8_pretrain.py][INFO] Epoch:[0/2](861400/4588595) loss:2.791 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:02:29,267][model8_pretrain.py][INFO] Epoch:[0/2](861400/4588595) loss:2.796 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:02:29,267][model8_pretrain.py][INFO] Epoch:[0/2](861400/4588595) loss:2.822 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:02:29,267][model8_pretrain.py][INFO] Epoch:[0/2](861400/4588595) loss:2.853 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:02:29,267][model8_pretrain.py][INFO] Epoch:[0/2](861400/4588595) loss:3.033 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:03:14,938][model8_pretrain.py][INFO] Epoch:[0/2](861500/4588595) loss:2.636 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:03:14,938][model8_pretrain.py][INFO] Epoch:[0/2](861500/4588595) loss:2.819 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:03:14,938][model8_pretrain.py][INFO] Epoch:[0/2](861500/4588595) loss:3.042 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:03:14,938][model8_pretrain.py][INFO] Epoch:[0/2](861500/4588595) loss:2.945 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:03:14,938][model8_pretrain.py][INFO] Epoch:[0/2](861500/4588595) loss:2.419 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:03:14,938][model8_pretrain.py][INFO] Epoch:[0/2](861500/4588595) loss:2.943 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:03:14,938][model8_pretrain.py][INFO] Epoch:[0/2](861500/4588595) loss:2.637 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:03:14,938][model8_pretrain.py][INFO] Epoch:[0/2](861500/4588595) loss:3.046 lr:0.0000100 epoch_Time:23723.0min: [2024-01-06 13:03:51,878][model8_pretrain.py][INFO] Epoch:[0/2](861600/4588595) loss:2.740 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:03:51,878][model8_pretrain.py][INFO] Epoch:[0/2](861600/4588595) loss:3.061 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:03:51,878][model8_pretrain.py][INFO] Epoch:[0/2](861600/4588595) loss:3.220 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:03:51,878][model8_pretrain.py][INFO] Epoch:[0/2](861600/4588595) loss:2.534 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:03:51,878][model8_pretrain.py][INFO] Epoch:[0/2](861600/4588595) loss:3.057 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:03:51,878][model8_pretrain.py][INFO] Epoch:[0/2](861600/4588595) loss:2.370 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:03:51,878][model8_pretrain.py][INFO] Epoch:[0/2](861600/4588595) loss:2.623 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:03:51,879][model8_pretrain.py][INFO] Epoch:[0/2](861600/4588595) loss:2.963 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:04:28,818][model8_pretrain.py][INFO] Epoch:[0/2](861700/4588595) loss:3.132 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:04:28,818][model8_pretrain.py][INFO] Epoch:[0/2](861700/4588595) loss:2.958 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:04:28,818][model8_pretrain.py][INFO] Epoch:[0/2](861700/4588595) loss:3.335 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:04:28,818][model8_pretrain.py][INFO] Epoch:[0/2](861700/4588595) loss:2.947 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:04:28,818][model8_pretrain.py][INFO] Epoch:[0/2](861700/4588595) loss:2.668 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:04:28,818][model8_pretrain.py][INFO] Epoch:[0/2](861700/4588595) loss:2.933 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:04:28,818][model8_pretrain.py][INFO] Epoch:[0/2](861700/4588595) loss:2.628 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:04:28,818][model8_pretrain.py][INFO] Epoch:[0/2](861700/4588595) loss:2.550 lr:0.0000100 epoch_Time:23722.0min: [2024-01-06 13:05:05,756][model8_pretrain.py][INFO] Epoch:[0/2](861800/4588595) loss:3.124 lr:0.0000100 epoch_Time:23721.0min: [2024-01-06 13:05:05,756][model8_pretrain.py][INFO] Epoch:[0/2](861800/4588595) loss:2.417 lr:0.0000100 epoch_Time:23721.0min: [2024-01-06 13:05:05,756][model8_pretrain.py][INFO] Epoch:[0/2](861800/4588595) loss:2.725 lr:0.0000100 epoch_Time:23721.0min: [2024-01-06 13:05:05,756][model8_pretrain.py][INFO] Epoch:[0/2](861800/4588595) loss:2.746 lr:0.0000100 epoch_Time:23721.0min: [2024-01-06 13:05:05,756][model8_pretrain.py][INFO] Epoch:[0/2](861800/4588595) loss:2.923 lr:0.0000100 epoch_Time:23721.0min: [2024-01-06 13:05:05,756][model8_pretrain.py][INFO] Epoch:[0/2](861800/4588595) loss:2.720 lr:0.0000100 epoch_Time:23721.0min: [2024-01-06 13:05:05,756][model8_pretrain.py][INFO] Epoch:[0/2](861800/4588595) loss:3.042 lr:0.0000100 epoch_Time:23721.0min: [2024-01-06 13:05:05,756][model8_pretrain.py][INFO] Epoch:[0/2](861800/4588595) loss:2.665 lr:0.0000100 epoch_Time:23721.0min: [2024-01-06 13:05:42,707][model8_pretrain.py][INFO] Epoch:[0/2](861900/4588595) loss:2.540 lr:0.0000100 epoch_Time:23720.0min: [2024-01-06 13:05:42,707][model8_pretrain.py][INFO] Epoch:[0/2](861900/4588595) loss:2.443 lr:0.0000100 epoch_Time:23720.0min: [2024-01-06 13:05:42,707][model8_pretrain.py][INFO] Epoch:[0/2](861900/4588595) loss:3.196 lr:0.0000100 epoch_Time:23720.0min: [2024-01-06 13:05:42,707][model8_pretrain.py][INFO] Epoch:[0/2](861900/4588595) loss:2.616 lr:0.0000100 epoch_Time:23720.0min: [2024-01-06 13:05:42,707][model8_pretrain.py][INFO] Epoch:[0/2](861900/4588595) loss:2.953 lr:0.0000100 epoch_Time:23720.0min: [2024-01-06 13:05:42,707][model8_pretrain.py][INFO] Epoch:[0/2](861900/4588595) loss:2.977 lr:0.0000100 epoch_Time:23720.0min: [2024-01-06 13:05:42,707][model8_pretrain.py][INFO] Epoch:[0/2](861900/4588595) loss:3.226 lr:0.0000100 epoch_Time:23720.0min: [2024-01-06 13:05:42,708][model8_pretrain.py][INFO] Epoch:[0/2](861900/4588595) loss:3.003 lr:0.0000100 epoch_Time:23720.0min: [2024-01-06 13:06:19,667][model8_pretrain.py][INFO] Epoch:[0/2](862000/4588595) loss:2.589 lr:0.0000100 epoch_Time:23719.0min: [2024-01-06 13:06:19,667][model8_pretrain.py][INFO] Epoch:[0/2](862000/4588595) loss:2.549 lr:0.0000100 epoch_Time:23719.0min: [2024-01-06 13:06:19,667][model8_pretrain.py][INFO] Epoch:[0/2](862000/4588595) loss:2.834 lr:0.0000100 epoch_Time:23719.0min: [2024-01-06 13:06:19,667][model8_pretrain.py][INFO] Epoch:[0/2](862000/4588595) loss:3.174 lr:0.0000100 epoch_Time:23719.0min: [2024-01-06 13:06:19,667][model8_pretrain.py][INFO] Epoch:[0/2](862000/4588595) loss:2.409 lr:0.0000100 epoch_Time:23719.0min: [2024-01-06 13:06:19,667][model8_pretrain.py][INFO] Epoch:[0/2](862000/4588595) loss:2.835 lr:0.0000100 epoch_Time:23719.0min: [2024-01-06 13:06:19,667][model8_pretrain.py][INFO] Epoch:[0/2](862000/4588595) loss:2.644 lr:0.0000100 epoch_Time:23719.0min: [2024-01-06 13:06:19,667][model8_pretrain.py][INFO] Epoch:[0/2](862000/4588595) loss:3.097 lr:0.0000100 epoch_Time:23719.0min: [2024-01-06 13:06:56,606][model8_pretrain.py][INFO] Epoch:[0/2](862100/4588595) loss:3.284 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:06:56,606][model8_pretrain.py][INFO] Epoch:[0/2](862100/4588595) loss:2.904 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:06:56,606][model8_pretrain.py][INFO] Epoch:[0/2](862100/4588595) loss:2.687 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:06:56,606][model8_pretrain.py][INFO] Epoch:[0/2](862100/4588595) loss:2.761 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:06:56,606][model8_pretrain.py][INFO] Epoch:[0/2](862100/4588595) loss:2.886 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:06:56,606][model8_pretrain.py][INFO] Epoch:[0/2](862100/4588595) loss:2.512 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:06:56,606][model8_pretrain.py][INFO] Epoch:[0/2](862100/4588595) loss:3.088 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:06:56,607][model8_pretrain.py][INFO] Epoch:[0/2](862100/4588595) loss:2.696 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:07:35,247][model8_pretrain.py][INFO] Epoch:[0/2](862200/4588595) loss:2.788 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:07:35,247][model8_pretrain.py][INFO] Epoch:[0/2](862200/4588595) loss:2.848 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:07:35,247][model8_pretrain.py][INFO] Epoch:[0/2](862200/4588595) loss:2.758 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:07:35,247][model8_pretrain.py][INFO] Epoch:[0/2](862200/4588595) loss:2.561 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:07:35,247][model8_pretrain.py][INFO] Epoch:[0/2](862200/4588595) loss:3.413 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:07:35,247][model8_pretrain.py][INFO] Epoch:[0/2](862200/4588595) loss:2.618 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:07:35,247][model8_pretrain.py][INFO] Epoch:[0/2](862200/4588595) loss:2.566 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:07:35,247][model8_pretrain.py][INFO] Epoch:[0/2](862200/4588595) loss:2.868 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:08:20,950][model8_pretrain.py][INFO] Epoch:[0/2](862300/4588595) loss:2.930 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:08:20,950][model8_pretrain.py][INFO] Epoch:[0/2](862300/4588595) loss:2.684 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:08:20,950][model8_pretrain.py][INFO] Epoch:[0/2](862300/4588595) loss:2.281 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:08:20,950][model8_pretrain.py][INFO] Epoch:[0/2](862300/4588595) loss:2.953 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:08:20,950][model8_pretrain.py][INFO] Epoch:[0/2](862300/4588595) loss:2.717 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:08:20,950][model8_pretrain.py][INFO] Epoch:[0/2](862300/4588595) loss:2.472 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:08:20,950][model8_pretrain.py][INFO] Epoch:[0/2](862300/4588595) loss:2.726 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:08:20,950][model8_pretrain.py][INFO] Epoch:[0/2](862300/4588595) loss:2.432 lr:0.0000100 epoch_Time:23718.0min: [2024-01-06 13:08:57,883][model8_pretrain.py][INFO] Epoch:[0/2](862400/4588595) loss:3.051 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:08:57,883][model8_pretrain.py][INFO] Epoch:[0/2](862400/4588595) loss:2.654 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:08:57,883][model8_pretrain.py][INFO] Epoch:[0/2](862400/4588595) loss:2.331 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:08:57,883][model8_pretrain.py][INFO] Epoch:[0/2](862400/4588595) loss:2.470 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:08:57,883][model8_pretrain.py][INFO] Epoch:[0/2](862400/4588595) loss:2.603 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:08:57,883][model8_pretrain.py][INFO] Epoch:[0/2](862400/4588595) loss:2.731 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:08:57,884][model8_pretrain.py][INFO] Epoch:[0/2](862400/4588595) loss:2.804 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:08:57,884][model8_pretrain.py][INFO] Epoch:[0/2](862400/4588595) loss:3.230 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:09:34,813][model8_pretrain.py][INFO] Epoch:[0/2](862500/4588595) loss:2.915 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:09:34,813][model8_pretrain.py][INFO] Epoch:[0/2](862500/4588595) loss:2.803 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:09:34,813][model8_pretrain.py][INFO] Epoch:[0/2](862500/4588595) loss:2.933 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:09:34,813][model8_pretrain.py][INFO] Epoch:[0/2](862500/4588595) loss:2.664 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:09:34,813][model8_pretrain.py][INFO] Epoch:[0/2](862500/4588595) loss:3.127 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:09:34,814][model8_pretrain.py][INFO] Epoch:[0/2](862500/4588595) loss:2.766 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:09:34,814][model8_pretrain.py][INFO] Epoch:[0/2](862500/4588595) loss:3.143 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:09:34,814][model8_pretrain.py][INFO] Epoch:[0/2](862500/4588595) loss:3.136 lr:0.0000100 epoch_Time:23717.0min: [2024-01-06 13:10:11,750][model8_pretrain.py][INFO] Epoch:[0/2](862600/4588595) loss:3.002 lr:0.0000100 epoch_Time:23716.0min: [2024-01-06 13:10:11,750][model8_pretrain.py][INFO] Epoch:[0/2](862600/4588595) loss:2.903 lr:0.0000100 epoch_Time:23716.0min: [2024-01-06 13:10:11,750][model8_pretrain.py][INFO] Epoch:[0/2](862600/4588595) loss:3.136 lr:0.0000100 epoch_Time:23716.0min: [2024-01-06 13:10:11,750][model8_pretrain.py][INFO] Epoch:[0/2](862600/4588595) loss:2.915 lr:0.0000100 epoch_Time:23716.0min: [2024-01-06 13:10:11,750][model8_pretrain.py][INFO] Epoch:[0/2](862600/4588595) loss:2.334 lr:0.0000100 epoch_Time:23716.0min: [2024-01-06 13:10:11,750][model8_pretrain.py][INFO] Epoch:[0/2](862600/4588595) loss:2.809 lr:0.0000100 epoch_Time:23716.0min: [2024-01-06 13:10:11,750][model8_pretrain.py][INFO] Epoch:[0/2](862600/4588595) loss:2.453 lr:0.0000100 epoch_Time:23716.0min: [2024-01-06 13:10:11,750][model8_pretrain.py][INFO] Epoch:[0/2](862600/4588595) loss:2.639 lr:0.0000100 epoch_Time:23716.0min: [2024-01-06 13:10:48,699][model8_pretrain.py][INFO] Epoch:[0/2](862700/4588595) loss:2.467 lr:0.0000100 epoch_Time:23715.0min: [2024-01-06 13:10:48,699][model8_pretrain.py][INFO] Epoch:[0/2](862700/4588595) loss:2.833 lr:0.0000100 epoch_Time:23715.0min: [2024-01-06 13:10:48,699][model8_pretrain.py][INFO] Epoch:[0/2](862700/4588595) loss:2.726 lr:0.0000100 epoch_Time:23715.0min: [2024-01-06 13:10:48,699][model8_pretrain.py][INFO] Epoch:[0/2](862700/4588595) loss:2.946 lr:0.0000100 epoch_Time:23715.0min: [2024-01-06 13:10:48,700][model8_pretrain.py][INFO] Epoch:[0/2](862700/4588595) loss:3.375 lr:0.0000100 epoch_Time:23715.0min: [2024-01-06 13:10:48,700][model8_pretrain.py][INFO] Epoch:[0/2](862700/4588595) loss:2.810 lr:0.0000100 epoch_Time:23715.0min: [2024-01-06 13:10:48,700][model8_pretrain.py][INFO] Epoch:[0/2](862700/4588595) loss:3.071 lr:0.0000100 epoch_Time:23715.0min: [2024-01-06 13:10:48,700][model8_pretrain.py][INFO] Epoch:[0/2](862700/4588595) loss:3.183 lr:0.0000100 epoch_Time:23715.0min: [2024-01-06 13:11:25,654][model8_pretrain.py][INFO] Epoch:[0/2](862800/4588595) loss:2.756 lr:0.0000100 epoch_Time:23714.0min: [2024-01-06 13:11:25,654][model8_pretrain.py][INFO] Epoch:[0/2](862800/4588595) loss:2.530 lr:0.0000100 epoch_Time:23714.0min: [2024-01-06 13:11:25,655][model8_pretrain.py][INFO] Epoch:[0/2](862800/4588595) loss:2.440 lr:0.0000100 epoch_Time:23714.0min: [2024-01-06 13:11:25,655][model8_pretrain.py][INFO] Epoch:[0/2](862800/4588595) loss:2.937 lr:0.0000100 epoch_Time:23714.0min: [2024-01-06 13:11:25,655][model8_pretrain.py][INFO] Epoch:[0/2](862800/4588595) loss:3.287 lr:0.0000100 epoch_Time:23714.0min: [2024-01-06 13:11:25,655][model8_pretrain.py][INFO] Epoch:[0/2](862800/4588595) loss:2.421 lr:0.0000100 epoch_Time:23714.0min: [2024-01-06 13:11:25,655][model8_pretrain.py][INFO] Epoch:[0/2](862800/4588595) loss:3.018 lr:0.0000100 epoch_Time:23714.0min: [2024-01-06 13:11:25,655][model8_pretrain.py][INFO] Epoch:[0/2](862800/4588595) loss:2.413 lr:0.0000100 epoch_Time:23714.0min: [2024-01-06 13:12:02,614][model8_pretrain.py][INFO] Epoch:[0/2](862900/4588595) loss:3.051 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:02,614][model8_pretrain.py][INFO] Epoch:[0/2](862900/4588595) loss:2.187 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:02,614][model8_pretrain.py][INFO] Epoch:[0/2](862900/4588595) loss:2.918 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:02,615][model8_pretrain.py][INFO] Epoch:[0/2](862900/4588595) loss:2.251 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:02,614][model8_pretrain.py][INFO] Epoch:[0/2](862900/4588595) loss:2.845 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:02,615][model8_pretrain.py][INFO] Epoch:[0/2](862900/4588595) loss:2.776 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:02,615][model8_pretrain.py][INFO] Epoch:[0/2](862900/4588595) loss:2.910 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:02,615][model8_pretrain.py][INFO] Epoch:[0/2](862900/4588595) loss:3.318 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:41,304][model8_pretrain.py][INFO] Epoch:[0/2](863000/4588595) loss:2.779 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:41,304][model8_pretrain.py][INFO] Epoch:[0/2](863000/4588595) loss:2.599 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:41,304][model8_pretrain.py][INFO] Epoch:[0/2](863000/4588595) loss:3.077 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:41,304][model8_pretrain.py][INFO] Epoch:[0/2](863000/4588595) loss:2.482 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:41,304][model8_pretrain.py][INFO] Epoch:[0/2](863000/4588595) loss:3.023 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:41,304][model8_pretrain.py][INFO] Epoch:[0/2](863000/4588595) loss:2.717 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:41,304][model8_pretrain.py][INFO] Epoch:[0/2](863000/4588595) loss:2.431 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:12:41,304][model8_pretrain.py][INFO] Epoch:[0/2](863000/4588595) loss:3.526 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:13:26,991][model8_pretrain.py][INFO] Epoch:[0/2](863100/4588595) loss:2.636 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:13:26,991][model8_pretrain.py][INFO] Epoch:[0/2](863100/4588595) loss:2.576 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:13:26,992][model8_pretrain.py][INFO] Epoch:[0/2](863100/4588595) loss:2.168 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:13:26,991][model8_pretrain.py][INFO] Epoch:[0/2](863100/4588595) loss:3.047 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:13:26,992][model8_pretrain.py][INFO] Epoch:[0/2](863100/4588595) loss:2.900 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:13:26,992][model8_pretrain.py][INFO] Epoch:[0/2](863100/4588595) loss:2.935 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:13:26,992][model8_pretrain.py][INFO] Epoch:[0/2](863100/4588595) loss:2.610 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:13:26,993][model8_pretrain.py][INFO] Epoch:[0/2](863100/4588595) loss:2.799 lr:0.0000100 epoch_Time:23713.0min: [2024-01-06 13:14:03,930][model8_pretrain.py][INFO] Epoch:[0/2](863200/4588595) loss:2.755 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:03,930][model8_pretrain.py][INFO] Epoch:[0/2](863200/4588595) loss:2.817 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:03,930][model8_pretrain.py][INFO] Epoch:[0/2](863200/4588595) loss:3.021 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:03,930][model8_pretrain.py][INFO] Epoch:[0/2](863200/4588595) loss:2.954 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:03,930][model8_pretrain.py][INFO] Epoch:[0/2](863200/4588595) loss:2.834 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:03,930][model8_pretrain.py][INFO] Epoch:[0/2](863200/4588595) loss:2.746 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:03,930][model8_pretrain.py][INFO] Epoch:[0/2](863200/4588595) loss:2.262 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:03,930][model8_pretrain.py][INFO] Epoch:[0/2](863200/4588595) loss:2.968 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:40,872][model8_pretrain.py][INFO] Epoch:[0/2](863300/4588595) loss:3.030 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:40,872][model8_pretrain.py][INFO] Epoch:[0/2](863300/4588595) loss:2.538 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:40,872][model8_pretrain.py][INFO] Epoch:[0/2](863300/4588595) loss:2.567 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:40,872][model8_pretrain.py][INFO] Epoch:[0/2](863300/4588595) loss:2.887 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:40,872][model8_pretrain.py][INFO] Epoch:[0/2](863300/4588595) loss:2.814 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:40,872][model8_pretrain.py][INFO] Epoch:[0/2](863300/4588595) loss:3.060 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:40,872][model8_pretrain.py][INFO] Epoch:[0/2](863300/4588595) loss:2.667 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:14:40,872][model8_pretrain.py][INFO] Epoch:[0/2](863300/4588595) loss:3.236 lr:0.0000100 epoch_Time:23712.0min: [2024-01-06 13:15:17,810][model8_pretrain.py][INFO] Epoch:[0/2](863400/4588595) loss:2.555 lr:0.0000100 epoch_Time:23711.0min: [2024-01-06 13:15:17,810][model8_pretrain.py][INFO] Epoch:[0/2](863400/4588595) loss:2.807 lr:0.0000100 epoch_Time:23711.0min: [2024-01-06 13:15:17,810][model8_pretrain.py][INFO] Epoch:[0/2](863400/4588595) loss:2.013 lr:0.0000100 epoch_Time:23711.0min: [2024-01-06 13:15:17,810][model8_pretrain.py][INFO] Epoch:[0/2](863400/4588595) loss:3.002 lr:0.0000100 epoch_Time:23711.0min: [2024-01-06 13:15:17,810][model8_pretrain.py][INFO] Epoch:[0/2](863400/4588595) loss:2.735 lr:0.0000100 epoch_Time:23711.0min: [2024-01-06 13:15:17,810][model8_pretrain.py][INFO] Epoch:[0/2](863400/4588595) loss:2.296 lr:0.0000100 epoch_Time:23711.0min: [2024-01-06 13:15:17,811][model8_pretrain.py][INFO] Epoch:[0/2](863400/4588595) loss:2.701 lr:0.0000100 epoch_Time:23711.0min: [2024-01-06 13:15:17,811][model8_pretrain.py][INFO] Epoch:[0/2](863400/4588595) loss:2.763 lr:0.0000100 epoch_Time:23711.0min: [2024-01-06 13:15:54,744][model8_pretrain.py][INFO] Epoch:[0/2](863500/4588595) loss:2.636 lr:0.0000100 epoch_Time:23710.0min: [2024-01-06 13:15:54,744][model8_pretrain.py][INFO] Epoch:[0/2](863500/4588595) loss:3.315 lr:0.0000100 epoch_Time:23710.0min: [2024-01-06 13:15:54,744][model8_pretrain.py][INFO] Epoch:[0/2](863500/4588595) loss:2.973 lr:0.0000100 epoch_Time:23710.0min: [2024-01-06 13:15:54,744][model8_pretrain.py][INFO] Epoch:[0/2](863500/4588595) loss:2.703 lr:0.0000100 epoch_Time:23710.0min: [2024-01-06 13:15:54,744][model8_pretrain.py][INFO] Epoch:[0/2](863500/4588595) loss:2.748 lr:0.0000100 epoch_Time:23710.0min: [2024-01-06 13:15:54,745][model8_pretrain.py][INFO] Epoch:[0/2](863500/4588595) loss:1.982 lr:0.0000100 epoch_Time:23710.0min: [2024-01-06 13:15:54,745][model8_pretrain.py][INFO] Epoch:[0/2](863500/4588595) loss:3.051 lr:0.0000100 epoch_Time:23710.0min: [2024-01-06 13:15:54,745][model8_pretrain.py][INFO] Epoch:[0/2](863500/4588595) loss:2.108 lr:0.0000100 epoch_Time:23710.0min: [2024-01-06 13:16:31,670][model8_pretrain.py][INFO] Epoch:[0/2](863600/4588595) loss:2.526 lr:0.0000100 epoch_Time:23709.0min: [2024-01-06 13:16:31,671][model8_pretrain.py][INFO] Epoch:[0/2](863600/4588595) loss:2.658 lr:0.0000100 epoch_Time:23709.0min: [2024-01-06 13:16:31,671][model8_pretrain.py][INFO] Epoch:[0/2](863600/4588595) loss:2.448 lr:0.0000100 epoch_Time:23709.0min: [2024-01-06 13:16:31,671][model8_pretrain.py][INFO] Epoch:[0/2](863600/4588595) loss:3.051 lr:0.0000100 epoch_Time:23709.0min: [2024-01-06 13:16:31,671][model8_pretrain.py][INFO] Epoch:[0/2](863600/4588595) loss:2.892 lr:0.0000100 epoch_Time:23709.0min: [2024-01-06 13:16:31,671][model8_pretrain.py][INFO] Epoch:[0/2](863600/4588595) loss:2.703 lr:0.0000100 epoch_Time:23709.0min: [2024-01-06 13:16:31,671][model8_pretrain.py][INFO] Epoch:[0/2](863600/4588595) loss:2.615 lr:0.0000100 epoch_Time:23709.0min: [2024-01-06 13:16:31,671][model8_pretrain.py][INFO] Epoch:[0/2](863600/4588595) loss:3.141 lr:0.0000100 epoch_Time:23709.0min: [2024-01-06 13:17:08,609][model8_pretrain.py][INFO] Epoch:[0/2](863700/4588595) loss:2.680 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:08,609][model8_pretrain.py][INFO] Epoch:[0/2](863700/4588595) loss:2.961 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:08,609][model8_pretrain.py][INFO] Epoch:[0/2](863700/4588595) loss:2.656 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:08,609][model8_pretrain.py][INFO] Epoch:[0/2](863700/4588595) loss:2.804 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:08,609][model8_pretrain.py][INFO] Epoch:[0/2](863700/4588595) loss:3.006 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:08,609][model8_pretrain.py][INFO] Epoch:[0/2](863700/4588595) loss:2.739 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:08,609][model8_pretrain.py][INFO] Epoch:[0/2](863700/4588595) loss:3.290 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:08,609][model8_pretrain.py][INFO] Epoch:[0/2](863700/4588595) loss:2.268 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:47,249][model8_pretrain.py][INFO] Epoch:[0/2](863800/4588595) loss:2.746 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:47,249][model8_pretrain.py][INFO] Epoch:[0/2](863800/4588595) loss:3.016 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:47,249][model8_pretrain.py][INFO] Epoch:[0/2](863800/4588595) loss:2.912 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:47,249][model8_pretrain.py][INFO] Epoch:[0/2](863800/4588595) loss:2.696 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:47,249][model8_pretrain.py][INFO] Epoch:[0/2](863800/4588595) loss:3.152 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:47,249][model8_pretrain.py][INFO] Epoch:[0/2](863800/4588595) loss:2.966 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:47,250][model8_pretrain.py][INFO] Epoch:[0/2](863800/4588595) loss:1.981 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:17:47,250][model8_pretrain.py][INFO] Epoch:[0/2](863800/4588595) loss:2.883 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:18:32,907][model8_pretrain.py][INFO] Epoch:[0/2](863900/4588595) loss:3.010 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:18:32,907][model8_pretrain.py][INFO] Epoch:[0/2](863900/4588595) loss:3.111 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:18:32,907][model8_pretrain.py][INFO] Epoch:[0/2](863900/4588595) loss:3.238 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:18:32,907][model8_pretrain.py][INFO] Epoch:[0/2](863900/4588595) loss:2.920 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:18:32,907][model8_pretrain.py][INFO] Epoch:[0/2](863900/4588595) loss:2.998 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:18:32,907][model8_pretrain.py][INFO] Epoch:[0/2](863900/4588595) loss:2.991 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:18:32,907][model8_pretrain.py][INFO] Epoch:[0/2](863900/4588595) loss:2.681 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:18:32,907][model8_pretrain.py][INFO] Epoch:[0/2](863900/4588595) loss:2.596 lr:0.0000100 epoch_Time:23708.0min: [2024-01-06 13:19:09,836][model8_pretrain.py][INFO] Epoch:[0/2](864000/4588595) loss:1.812 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:09,836][model8_pretrain.py][INFO] Epoch:[0/2](864000/4588595) loss:2.612 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:09,836][model8_pretrain.py][INFO] Epoch:[0/2](864000/4588595) loss:2.674 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:09,836][model8_pretrain.py][INFO] Epoch:[0/2](864000/4588595) loss:3.305 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:09,836][model8_pretrain.py][INFO] Epoch:[0/2](864000/4588595) loss:3.180 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:09,836][model8_pretrain.py][INFO] Epoch:[0/2](864000/4588595) loss:3.115 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:09,836][model8_pretrain.py][INFO] Epoch:[0/2](864000/4588595) loss:2.689 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:09,836][model8_pretrain.py][INFO] Epoch:[0/2](864000/4588595) loss:2.617 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:46,768][model8_pretrain.py][INFO] Epoch:[0/2](864100/4588595) loss:2.140 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:46,768][model8_pretrain.py][INFO] Epoch:[0/2](864100/4588595) loss:2.812 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:46,768][model8_pretrain.py][INFO] Epoch:[0/2](864100/4588595) loss:3.068 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:46,768][model8_pretrain.py][INFO] Epoch:[0/2](864100/4588595) loss:2.440 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:46,768][model8_pretrain.py][INFO] Epoch:[0/2](864100/4588595) loss:2.693 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:46,768][model8_pretrain.py][INFO] Epoch:[0/2](864100/4588595) loss:2.865 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:46,768][model8_pretrain.py][INFO] Epoch:[0/2](864100/4588595) loss:2.822 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:19:46,768][model8_pretrain.py][INFO] Epoch:[0/2](864100/4588595) loss:2.996 lr:0.0000100 epoch_Time:23707.0min: [2024-01-06 13:20:23,705][model8_pretrain.py][INFO] Epoch:[0/2](864200/4588595) loss:2.386 lr:0.0000100 epoch_Time:23706.0min: [2024-01-06 13:20:23,705][model8_pretrain.py][INFO] Epoch:[0/2](864200/4588595) loss:2.410 lr:0.0000100 epoch_Time:23706.0min: [2024-01-06 13:20:23,705][model8_pretrain.py][INFO] Epoch:[0/2](864200/4588595) loss:2.372 lr:0.0000100 epoch_Time:23706.0min: [2024-01-06 13:20:23,705][model8_pretrain.py][INFO] Epoch:[0/2](864200/4588595) loss:3.013 lr:0.0000100 epoch_Time:23706.0min: [2024-01-06 13:20:23,705][model8_pretrain.py][INFO] Epoch:[0/2](864200/4588595) loss:3.222 lr:0.0000100 epoch_Time:23706.0min: [2024-01-06 13:20:23,705][model8_pretrain.py][INFO] Epoch:[0/2](864200/4588595) loss:2.902 lr:0.0000100 epoch_Time:23706.0min: [2024-01-06 13:20:23,705][model8_pretrain.py][INFO] Epoch:[0/2](864200/4588595) loss:2.809 lr:0.0000100 epoch_Time:23706.0min: [2024-01-06 13:20:23,706][model8_pretrain.py][INFO] Epoch:[0/2](864200/4588595) loss:2.580 lr:0.0000100 epoch_Time:23706.0min: [2024-01-06 13:21:00,650][model8_pretrain.py][INFO] Epoch:[0/2](864300/4588595) loss:2.891 lr:0.0000100 epoch_Time:23705.0min: [2024-01-06 13:21:00,650][model8_pretrain.py][INFO] Epoch:[0/2](864300/4588595) loss:3.002 lr:0.0000100 epoch_Time:23705.0min: [2024-01-06 13:21:00,650][model8_pretrain.py][INFO] Epoch:[0/2](864300/4588595) loss:2.930 lr:0.0000100 epoch_Time:23705.0min: [2024-01-06 13:21:00,651][model8_pretrain.py][INFO] Epoch:[0/2](864300/4588595) loss:3.034 lr:0.0000100 epoch_Time:23705.0min: [2024-01-06 13:21:00,651][model8_pretrain.py][INFO] Epoch:[0/2](864300/4588595) loss:2.564 lr:0.0000100 epoch_Time:23705.0min: [2024-01-06 13:21:00,651][model8_pretrain.py][INFO] Epoch:[0/2](864300/4588595) loss:2.127 lr:0.0000100 epoch_Time:23705.0min: [2024-01-06 13:21:00,651][model8_pretrain.py][INFO] Epoch:[0/2](864300/4588595) loss:2.577 lr:0.0000100 epoch_Time:23705.0min: [2024-01-06 13:21:00,652][model8_pretrain.py][INFO] Epoch:[0/2](864300/4588595) loss:2.538 lr:0.0000100 epoch_Time:23705.0min: [2024-01-06 13:21:37,587][model8_pretrain.py][INFO] Epoch:[0/2](864400/4588595) loss:2.691 lr:0.0000100 epoch_Time:23704.0min: [2024-01-06 13:21:37,587][model8_pretrain.py][INFO] Epoch:[0/2](864400/4588595) loss:2.542 lr:0.0000100 epoch_Time:23704.0min: [2024-01-06 13:21:37,587][model8_pretrain.py][INFO] Epoch:[0/2](864400/4588595) loss:2.548 lr:0.0000100 epoch_Time:23704.0min: [2024-01-06 13:21:37,587][model8_pretrain.py][INFO] Epoch:[0/2](864400/4588595) loss:2.831 lr:0.0000100 epoch_Time:23704.0min: [2024-01-06 13:21:37,587][model8_pretrain.py][INFO] Epoch:[0/2](864400/4588595) loss:2.783 lr:0.0000100 epoch_Time:23704.0min: [2024-01-06 13:21:37,587][model8_pretrain.py][INFO] Epoch:[0/2](864400/4588595) loss:2.613 lr:0.0000100 epoch_Time:23704.0min: [2024-01-06 13:21:37,588][model8_pretrain.py][INFO] Epoch:[0/2](864400/4588595) loss:2.837 lr:0.0000100 epoch_Time:23704.0min: [2024-01-06 13:21:37,588][model8_pretrain.py][INFO] Epoch:[0/2](864400/4588595) loss:3.155 lr:0.0000100 epoch_Time:23704.0min: [2024-01-06 13:22:14,558][model8_pretrain.py][INFO] Epoch:[0/2](864500/4588595) loss:3.009 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:22:14,558][model8_pretrain.py][INFO] Epoch:[0/2](864500/4588595) loss:2.999 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:22:14,558][model8_pretrain.py][INFO] Epoch:[0/2](864500/4588595) loss:2.469 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:22:14,558][model8_pretrain.py][INFO] Epoch:[0/2](864500/4588595) loss:3.143 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:22:14,558][model8_pretrain.py][INFO] Epoch:[0/2](864500/4588595) loss:2.417 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:22:14,558][model8_pretrain.py][INFO] Epoch:[0/2](864500/4588595) loss:2.732 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:22:14,558][model8_pretrain.py][INFO] Epoch:[0/2](864500/4588595) loss:2.364 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:22:14,558][model8_pretrain.py][INFO] Epoch:[0/2](864500/4588595) loss:3.367 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:22:53,212][model8_pretrain.py][INFO] Epoch:[0/2](864600/4588595) loss:2.274 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:22:53,212][model8_pretrain.py][INFO] Epoch:[0/2](864600/4588595) loss:3.103 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:22:53,212][model8_pretrain.py][INFO] Epoch:[0/2](864600/4588595) loss:3.101 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:22:53,212][model8_pretrain.py][INFO] Epoch:[0/2](864600/4588595) loss:2.843 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:22:53,212][model8_pretrain.py][INFO] Epoch:[0/2](864600/4588595) loss:2.693 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:22:53,212][model8_pretrain.py][INFO] Epoch:[0/2](864600/4588595) loss:2.905 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:22:53,213][model8_pretrain.py][INFO] Epoch:[0/2](864600/4588595) loss:2.561 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:22:53,213][model8_pretrain.py][INFO] Epoch:[0/2](864600/4588595) loss:2.904 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:23:39,631][model8_pretrain.py][INFO] Epoch:[0/2](864700/4588595) loss:2.716 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:23:39,631][model8_pretrain.py][INFO] Epoch:[0/2](864700/4588595) loss:2.445 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:23:39,631][model8_pretrain.py][INFO] Epoch:[0/2](864700/4588595) loss:2.353 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:23:39,631][model8_pretrain.py][INFO] Epoch:[0/2](864700/4588595) loss:2.557 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:23:39,631][model8_pretrain.py][INFO] Epoch:[0/2](864700/4588595) loss:3.656 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:23:39,631][model8_pretrain.py][INFO] Epoch:[0/2](864700/4588595) loss:2.645 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:23:39,631][model8_pretrain.py][INFO] Epoch:[0/2](864700/4588595) loss:2.441 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:23:39,631][model8_pretrain.py][INFO] Epoch:[0/2](864700/4588595) loss:2.381 lr:0.0000100 epoch_Time:23703.0min: [2024-01-06 13:24:16,562][model8_pretrain.py][INFO] Epoch:[0/2](864800/4588595) loss:3.247 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:24:16,562][model8_pretrain.py][INFO] Epoch:[0/2](864800/4588595) loss:2.581 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:24:16,562][model8_pretrain.py][INFO] Epoch:[0/2](864800/4588595) loss:2.499 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:24:16,562][model8_pretrain.py][INFO] Epoch:[0/2](864800/4588595) loss:3.308 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:24:16,562][model8_pretrain.py][INFO] Epoch:[0/2](864800/4588595) loss:2.828 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:24:16,562][model8_pretrain.py][INFO] Epoch:[0/2](864800/4588595) loss:2.752 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:24:16,562][model8_pretrain.py][INFO] Epoch:[0/2](864800/4588595) loss:3.242 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:24:16,562][model8_pretrain.py][INFO] Epoch:[0/2](864800/4588595) loss:2.483 lr:0.0000100 epoch_Time:23702.0min: [2024-01-06 13:24:53,463][model8_pretrain.py][INFO] Epoch:[0/2](864900/4588595) loss:3.225 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:24:53,463][model8_pretrain.py][INFO] Epoch:[0/2](864900/4588595) loss:2.605 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:24:53,463][model8_pretrain.py][INFO] Epoch:[0/2](864900/4588595) loss:3.337 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:24:53,463][model8_pretrain.py][INFO] Epoch:[0/2](864900/4588595) loss:3.090 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:24:53,463][model8_pretrain.py][INFO] Epoch:[0/2](864900/4588595) loss:3.092 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:24:53,463][model8_pretrain.py][INFO] Epoch:[0/2](864900/4588595) loss:2.713 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:24:53,463][model8_pretrain.py][INFO] Epoch:[0/2](864900/4588595) loss:2.158 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:24:53,463][model8_pretrain.py][INFO] Epoch:[0/2](864900/4588595) loss:2.849 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:25:30,401][model8_pretrain.py][INFO] Epoch:[0/2](865000/4588595) loss:3.131 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:25:30,401][model8_pretrain.py][INFO] Epoch:[0/2](865000/4588595) loss:3.077 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:25:30,401][model8_pretrain.py][INFO] Epoch:[0/2](865000/4588595) loss:2.111 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:25:30,401][model8_pretrain.py][INFO] Epoch:[0/2](865000/4588595) loss:2.936 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:25:30,401][model8_pretrain.py][INFO] Epoch:[0/2](865000/4588595) loss:2.847 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:25:30,401][model8_pretrain.py][INFO] Epoch:[0/2](865000/4588595) loss:3.353 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:25:30,401][model8_pretrain.py][INFO] Epoch:[0/2](865000/4588595) loss:2.481 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:25:30,401][model8_pretrain.py][INFO] Epoch:[0/2](865000/4588595) loss:2.712 lr:0.0000100 epoch_Time:23701.0min: [2024-01-06 13:26:07,351][model8_pretrain.py][INFO] Epoch:[0/2](865100/4588595) loss:2.665 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:07,351][model8_pretrain.py][INFO] Epoch:[0/2](865100/4588595) loss:2.311 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:07,351][model8_pretrain.py][INFO] Epoch:[0/2](865100/4588595) loss:2.287 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:07,351][model8_pretrain.py][INFO] Epoch:[0/2](865100/4588595) loss:2.707 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:07,351][model8_pretrain.py][INFO] Epoch:[0/2](865100/4588595) loss:3.311 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:07,351][model8_pretrain.py][INFO] Epoch:[0/2](865100/4588595) loss:2.419 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:07,351][model8_pretrain.py][INFO] Epoch:[0/2](865100/4588595) loss:3.400 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:07,351][model8_pretrain.py][INFO] Epoch:[0/2](865100/4588595) loss:2.678 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:44,285][model8_pretrain.py][INFO] Epoch:[0/2](865200/4588595) loss:2.837 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:44,285][model8_pretrain.py][INFO] Epoch:[0/2](865200/4588595) loss:3.077 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:44,285][model8_pretrain.py][INFO] Epoch:[0/2](865200/4588595) loss:3.375 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:44,285][model8_pretrain.py][INFO] Epoch:[0/2](865200/4588595) loss:2.747 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:44,285][model8_pretrain.py][INFO] Epoch:[0/2](865200/4588595) loss:2.717 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:44,285][model8_pretrain.py][INFO] Epoch:[0/2](865200/4588595) loss:2.646 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:44,285][model8_pretrain.py][INFO] Epoch:[0/2](865200/4588595) loss:2.783 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:26:44,285][model8_pretrain.py][INFO] Epoch:[0/2](865200/4588595) loss:3.234 lr:0.0000100 epoch_Time:23700.0min: [2024-01-06 13:27:21,223][model8_pretrain.py][INFO] Epoch:[0/2](865300/4588595) loss:2.972 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:27:21,223][model8_pretrain.py][INFO] Epoch:[0/2](865300/4588595) loss:2.562 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:27:21,223][model8_pretrain.py][INFO] Epoch:[0/2](865300/4588595) loss:3.185 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:27:21,223][model8_pretrain.py][INFO] Epoch:[0/2](865300/4588595) loss:3.147 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:27:21,223][model8_pretrain.py][INFO] Epoch:[0/2](865300/4588595) loss:2.790 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:27:21,223][model8_pretrain.py][INFO] Epoch:[0/2](865300/4588595) loss:3.031 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:27:21,223][model8_pretrain.py][INFO] Epoch:[0/2](865300/4588595) loss:2.839 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:27:21,223][model8_pretrain.py][INFO] Epoch:[0/2](865300/4588595) loss:2.547 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:27:58,186][model8_pretrain.py][INFO] Epoch:[0/2](865400/4588595) loss:2.562 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:27:58,186][model8_pretrain.py][INFO] Epoch:[0/2](865400/4588595) loss:2.748 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:27:58,186][model8_pretrain.py][INFO] Epoch:[0/2](865400/4588595) loss:2.329 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:27:58,186][model8_pretrain.py][INFO] Epoch:[0/2](865400/4588595) loss:3.239 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:27:58,186][model8_pretrain.py][INFO] Epoch:[0/2](865400/4588595) loss:3.151 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:27:58,186][model8_pretrain.py][INFO] Epoch:[0/2](865400/4588595) loss:2.876 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:27:58,186][model8_pretrain.py][INFO] Epoch:[0/2](865400/4588595) loss:1.965 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:27:58,186][model8_pretrain.py][INFO] Epoch:[0/2](865400/4588595) loss:2.570 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:28:45,683][model8_pretrain.py][INFO] Epoch:[0/2](865500/4588595) loss:3.118 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:28:45,683][model8_pretrain.py][INFO] Epoch:[0/2](865500/4588595) loss:2.644 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:28:45,683][model8_pretrain.py][INFO] Epoch:[0/2](865500/4588595) loss:2.605 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:28:45,683][model8_pretrain.py][INFO] Epoch:[0/2](865500/4588595) loss:2.581 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:28:45,683][model8_pretrain.py][INFO] Epoch:[0/2](865500/4588595) loss:3.568 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:28:45,683][model8_pretrain.py][INFO] Epoch:[0/2](865500/4588595) loss:3.006 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:28:45,683][model8_pretrain.py][INFO] Epoch:[0/2](865500/4588595) loss:2.935 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:28:45,683][model8_pretrain.py][INFO] Epoch:[0/2](865500/4588595) loss:3.372 lr:0.0000100 epoch_Time:23698.0min: [2024-01-06 13:29:22,617][model8_pretrain.py][INFO] Epoch:[0/2](865600/4588595) loss:2.793 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:29:22,617][model8_pretrain.py][INFO] Epoch:[0/2](865600/4588595) loss:2.953 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:29:22,617][model8_pretrain.py][INFO] Epoch:[0/2](865600/4588595) loss:3.102 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:29:22,617][model8_pretrain.py][INFO] Epoch:[0/2](865600/4588595) loss:2.831 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:29:22,617][model8_pretrain.py][INFO] Epoch:[0/2](865600/4588595) loss:2.648 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:29:22,617][model8_pretrain.py][INFO] Epoch:[0/2](865600/4588595) loss:2.906 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:29:22,617][model8_pretrain.py][INFO] Epoch:[0/2](865600/4588595) loss:2.098 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:29:22,618][model8_pretrain.py][INFO] Epoch:[0/2](865600/4588595) loss:3.052 lr:0.0000100 epoch_Time:23697.0min: [2024-01-06 13:29:59,551][model8_pretrain.py][INFO] Epoch:[0/2](865700/4588595) loss:2.486 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:29:59,551][model8_pretrain.py][INFO] Epoch:[0/2](865700/4588595) loss:2.523 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:29:59,551][model8_pretrain.py][INFO] Epoch:[0/2](865700/4588595) loss:2.533 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:29:59,551][model8_pretrain.py][INFO] Epoch:[0/2](865700/4588595) loss:2.464 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:29:59,551][model8_pretrain.py][INFO] Epoch:[0/2](865700/4588595) loss:2.821 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:29:59,551][model8_pretrain.py][INFO] Epoch:[0/2](865700/4588595) loss:2.861 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:29:59,551][model8_pretrain.py][INFO] Epoch:[0/2](865700/4588595) loss:3.028 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:29:59,551][model8_pretrain.py][INFO] Epoch:[0/2](865700/4588595) loss:2.847 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:30:36,479][model8_pretrain.py][INFO] Epoch:[0/2](865800/4588595) loss:2.456 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:30:36,479][model8_pretrain.py][INFO] Epoch:[0/2](865800/4588595) loss:3.366 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:30:36,480][model8_pretrain.py][INFO] Epoch:[0/2](865800/4588595) loss:3.108 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:30:36,480][model8_pretrain.py][INFO] Epoch:[0/2](865800/4588595) loss:2.758 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:30:36,480][model8_pretrain.py][INFO] Epoch:[0/2](865800/4588595) loss:2.843 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:30:36,480][model8_pretrain.py][INFO] Epoch:[0/2](865800/4588595) loss:3.170 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:30:36,480][model8_pretrain.py][INFO] Epoch:[0/2](865800/4588595) loss:3.137 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:30:36,480][model8_pretrain.py][INFO] Epoch:[0/2](865800/4588595) loss:3.164 lr:0.0000100 epoch_Time:23696.0min: [2024-01-06 13:31:13,414][model8_pretrain.py][INFO] Epoch:[0/2](865900/4588595) loss:3.129 lr:0.0000100 epoch_Time:23695.0min: [2024-01-06 13:31:13,414][model8_pretrain.py][INFO] Epoch:[0/2](865900/4588595) loss:2.717 lr:0.0000100 epoch_Time:23695.0min: [2024-01-06 13:31:13,414][model8_pretrain.py][INFO] Epoch:[0/2](865900/4588595) loss:3.068 lr:0.0000100 epoch_Time:23695.0min: [2024-01-06 13:31:13,414][model8_pretrain.py][INFO] Epoch:[0/2](865900/4588595) loss:3.025 lr:0.0000100 epoch_Time:23695.0min: [2024-01-06 13:31:13,414][model8_pretrain.py][INFO] Epoch:[0/2](865900/4588595) loss:3.065 lr:0.0000100 epoch_Time:23695.0min: [2024-01-06 13:31:13,415][model8_pretrain.py][INFO] Epoch:[0/2](865900/4588595) loss:2.579 lr:0.0000100 epoch_Time:23695.0min: [2024-01-06 13:31:13,415][model8_pretrain.py][INFO] Epoch:[0/2](865900/4588595) loss:3.224 lr:0.0000100 epoch_Time:23695.0min: [2024-01-06 13:31:13,415][model8_pretrain.py][INFO] Epoch:[0/2](865900/4588595) loss:3.410 lr:0.0000100 epoch_Time:23695.0min: [2024-01-06 13:31:50,361][model8_pretrain.py][INFO] Epoch:[0/2](866000/4588595) loss:3.243 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:31:50,361][model8_pretrain.py][INFO] Epoch:[0/2](866000/4588595) loss:2.900 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:31:50,361][model8_pretrain.py][INFO] Epoch:[0/2](866000/4588595) loss:2.184 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:31:50,361][model8_pretrain.py][INFO] Epoch:[0/2](866000/4588595) loss:2.777 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:31:50,361][model8_pretrain.py][INFO] Epoch:[0/2](866000/4588595) loss:3.034 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:31:50,361][model8_pretrain.py][INFO] Epoch:[0/2](866000/4588595) loss:2.314 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:31:50,361][model8_pretrain.py][INFO] Epoch:[0/2](866000/4588595) loss:3.070 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:31:50,361][model8_pretrain.py][INFO] Epoch:[0/2](866000/4588595) loss:2.622 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:32:27,296][model8_pretrain.py][INFO] Epoch:[0/2](866100/4588595) loss:2.704 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:32:27,296][model8_pretrain.py][INFO] Epoch:[0/2](866100/4588595) loss:3.174 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:32:27,296][model8_pretrain.py][INFO] Epoch:[0/2](866100/4588595) loss:2.796 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:32:27,296][model8_pretrain.py][INFO] Epoch:[0/2](866100/4588595) loss:3.107 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:32:27,296][model8_pretrain.py][INFO] Epoch:[0/2](866100/4588595) loss:3.316 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:32:27,296][model8_pretrain.py][INFO] Epoch:[0/2](866100/4588595) loss:2.645 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:32:27,296][model8_pretrain.py][INFO] Epoch:[0/2](866100/4588595) loss:2.990 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:32:27,296][model8_pretrain.py][INFO] Epoch:[0/2](866100/4588595) loss:2.589 lr:0.0000100 epoch_Time:23694.0min: [2024-01-06 13:33:04,241][model8_pretrain.py][INFO] Epoch:[0/2](866200/4588595) loss:2.280 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:04,241][model8_pretrain.py][INFO] Epoch:[0/2](866200/4588595) loss:3.228 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:04,241][model8_pretrain.py][INFO] Epoch:[0/2](866200/4588595) loss:2.715 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:04,241][model8_pretrain.py][INFO] Epoch:[0/2](866200/4588595) loss:2.960 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:04,241][model8_pretrain.py][INFO] Epoch:[0/2](866200/4588595) loss:2.915 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:04,241][model8_pretrain.py][INFO] Epoch:[0/2](866200/4588595) loss:3.047 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:04,241][model8_pretrain.py][INFO] Epoch:[0/2](866200/4588595) loss:2.615 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:04,242][model8_pretrain.py][INFO] Epoch:[0/2](866200/4588595) loss:3.170 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:51,637][model8_pretrain.py][INFO] Epoch:[0/2](866300/4588595) loss:2.661 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:51,637][model8_pretrain.py][INFO] Epoch:[0/2](866300/4588595) loss:3.143 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:51,637][model8_pretrain.py][INFO] Epoch:[0/2](866300/4588595) loss:2.928 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:51,637][model8_pretrain.py][INFO] Epoch:[0/2](866300/4588595) loss:2.997 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:51,637][model8_pretrain.py][INFO] Epoch:[0/2](866300/4588595) loss:3.278 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:51,637][model8_pretrain.py][INFO] Epoch:[0/2](866300/4588595) loss:3.027 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:51,637][model8_pretrain.py][INFO] Epoch:[0/2](866300/4588595) loss:3.470 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:33:51,637][model8_pretrain.py][INFO] Epoch:[0/2](866300/4588595) loss:2.820 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:34:28,593][model8_pretrain.py][INFO] Epoch:[0/2](866400/4588595) loss:2.577 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:34:28,593][model8_pretrain.py][INFO] Epoch:[0/2](866400/4588595) loss:2.970 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:34:28,593][model8_pretrain.py][INFO] Epoch:[0/2](866400/4588595) loss:2.790 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:34:28,593][model8_pretrain.py][INFO] Epoch:[0/2](866400/4588595) loss:2.909 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:34:28,593][model8_pretrain.py][INFO] Epoch:[0/2](866400/4588595) loss:3.082 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:34:28,593][model8_pretrain.py][INFO] Epoch:[0/2](866400/4588595) loss:3.022 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:34:28,593][model8_pretrain.py][INFO] Epoch:[0/2](866400/4588595) loss:3.171 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:34:28,593][model8_pretrain.py][INFO] Epoch:[0/2](866400/4588595) loss:2.658 lr:0.0000100 epoch_Time:23692.0min: [2024-01-06 13:35:05,545][model8_pretrain.py][INFO] Epoch:[0/2](866500/4588595) loss:2.993 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:05,545][model8_pretrain.py][INFO] Epoch:[0/2](866500/4588595) loss:2.899 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:05,545][model8_pretrain.py][INFO] Epoch:[0/2](866500/4588595) loss:2.946 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:05,545][model8_pretrain.py][INFO] Epoch:[0/2](866500/4588595) loss:2.886 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:05,545][model8_pretrain.py][INFO] Epoch:[0/2](866500/4588595) loss:3.392 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:05,545][model8_pretrain.py][INFO] Epoch:[0/2](866500/4588595) loss:2.461 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:05,546][model8_pretrain.py][INFO] Epoch:[0/2](866500/4588595) loss:2.555 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:05,546][model8_pretrain.py][INFO] Epoch:[0/2](866500/4588595) loss:2.865 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:42,510][model8_pretrain.py][INFO] Epoch:[0/2](866600/4588595) loss:3.031 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:42,510][model8_pretrain.py][INFO] Epoch:[0/2](866600/4588595) loss:2.676 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:42,510][model8_pretrain.py][INFO] Epoch:[0/2](866600/4588595) loss:2.285 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:42,510][model8_pretrain.py][INFO] Epoch:[0/2](866600/4588595) loss:2.956 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:42,510][model8_pretrain.py][INFO] Epoch:[0/2](866600/4588595) loss:2.333 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:42,510][model8_pretrain.py][INFO] Epoch:[0/2](866600/4588595) loss:3.082 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:42,511][model8_pretrain.py][INFO] Epoch:[0/2](866600/4588595) loss:2.437 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:35:42,511][model8_pretrain.py][INFO] Epoch:[0/2](866600/4588595) loss:2.849 lr:0.0000100 epoch_Time:23691.0min: [2024-01-06 13:36:19,465][model8_pretrain.py][INFO] Epoch:[0/2](866700/4588595) loss:3.098 lr:0.0000100 epoch_Time:23690.0min: [2024-01-06 13:36:19,465][model8_pretrain.py][INFO] Epoch:[0/2](866700/4588595) loss:3.026 lr:0.0000100 epoch_Time:23690.0min: [2024-01-06 13:36:19,465][model8_pretrain.py][INFO] Epoch:[0/2](866700/4588595) loss:2.787 lr:0.0000100 epoch_Time:23690.0min: [2024-01-06 13:36:19,465][model8_pretrain.py][INFO] Epoch:[0/2](866700/4588595) loss:2.657 lr:0.0000100 epoch_Time:23690.0min: [2024-01-06 13:36:19,465][model8_pretrain.py][INFO] Epoch:[0/2](866700/4588595) loss:3.058 lr:0.0000100 epoch_Time:23690.0min: [2024-01-06 13:36:19,465][model8_pretrain.py][INFO] Epoch:[0/2](866700/4588595) loss:3.268 lr:0.0000100 epoch_Time:23690.0min: [2024-01-06 13:36:19,465][model8_pretrain.py][INFO] Epoch:[0/2](866700/4588595) loss:2.733 lr:0.0000100 epoch_Time:23690.0min: [2024-01-06 13:36:19,465][model8_pretrain.py][INFO] Epoch:[0/2](866700/4588595) loss:2.601 lr:0.0000100 epoch_Time:23690.0min: [2024-01-06 13:36:56,415][model8_pretrain.py][INFO] Epoch:[0/2](866800/4588595) loss:2.322 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:36:56,415][model8_pretrain.py][INFO] Epoch:[0/2](866800/4588595) loss:2.957 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:36:56,415][model8_pretrain.py][INFO] Epoch:[0/2](866800/4588595) loss:2.673 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:36:56,415][model8_pretrain.py][INFO] Epoch:[0/2](866800/4588595) loss:2.540 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:36:56,415][model8_pretrain.py][INFO] Epoch:[0/2](866800/4588595) loss:2.601 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:36:56,415][model8_pretrain.py][INFO] Epoch:[0/2](866800/4588595) loss:2.323 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:36:56,415][model8_pretrain.py][INFO] Epoch:[0/2](866800/4588595) loss:3.068 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:36:56,415][model8_pretrain.py][INFO] Epoch:[0/2](866800/4588595) loss:3.303 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:37:33,374][model8_pretrain.py][INFO] Epoch:[0/2](866900/4588595) loss:3.121 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:37:33,374][model8_pretrain.py][INFO] Epoch:[0/2](866900/4588595) loss:2.755 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:37:33,374][model8_pretrain.py][INFO] Epoch:[0/2](866900/4588595) loss:2.927 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:37:33,374][model8_pretrain.py][INFO] Epoch:[0/2](866900/4588595) loss:2.702 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:37:33,374][model8_pretrain.py][INFO] Epoch:[0/2](866900/4588595) loss:2.845 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:37:33,374][model8_pretrain.py][INFO] Epoch:[0/2](866900/4588595) loss:2.305 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:37:33,374][model8_pretrain.py][INFO] Epoch:[0/2](866900/4588595) loss:2.899 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:37:33,374][model8_pretrain.py][INFO] Epoch:[0/2](866900/4588595) loss:2.004 lr:0.0000100 epoch_Time:23689.0min: [2024-01-06 13:38:10,339][model8_pretrain.py][INFO] Epoch:[0/2](867000/4588595) loss:2.640 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:10,339][model8_pretrain.py][INFO] Epoch:[0/2](867000/4588595) loss:2.947 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:10,339][model8_pretrain.py][INFO] Epoch:[0/2](867000/4588595) loss:2.247 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:10,339][model8_pretrain.py][INFO] Epoch:[0/2](867000/4588595) loss:2.707 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:10,339][model8_pretrain.py][INFO] Epoch:[0/2](867000/4588595) loss:2.580 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:10,339][model8_pretrain.py][INFO] Epoch:[0/2](867000/4588595) loss:2.462 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:10,339][model8_pretrain.py][INFO] Epoch:[0/2](867000/4588595) loss:2.554 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:10,339][model8_pretrain.py][INFO] Epoch:[0/2](867000/4588595) loss:2.471 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:57,770][model8_pretrain.py][INFO] Epoch:[0/2](867100/4588595) loss:2.376 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:57,770][model8_pretrain.py][INFO] Epoch:[0/2](867100/4588595) loss:2.812 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:57,770][model8_pretrain.py][INFO] Epoch:[0/2](867100/4588595) loss:2.838 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:57,770][model8_pretrain.py][INFO] Epoch:[0/2](867100/4588595) loss:2.735 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:57,770][model8_pretrain.py][INFO] Epoch:[0/2](867100/4588595) loss:3.046 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:57,770][model8_pretrain.py][INFO] Epoch:[0/2](867100/4588595) loss:2.926 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:57,771][model8_pretrain.py][INFO] Epoch:[0/2](867100/4588595) loss:2.685 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:38:57,771][model8_pretrain.py][INFO] Epoch:[0/2](867100/4588595) loss:2.485 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:39:34,685][model8_pretrain.py][INFO] Epoch:[0/2](867200/4588595) loss:3.261 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:39:34,685][model8_pretrain.py][INFO] Epoch:[0/2](867200/4588595) loss:2.663 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:39:34,685][model8_pretrain.py][INFO] Epoch:[0/2](867200/4588595) loss:2.093 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:39:34,685][model8_pretrain.py][INFO] Epoch:[0/2](867200/4588595) loss:3.208 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:39:34,685][model8_pretrain.py][INFO] Epoch:[0/2](867200/4588595) loss:2.744 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:39:34,685][model8_pretrain.py][INFO] Epoch:[0/2](867200/4588595) loss:2.889 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:39:34,685][model8_pretrain.py][INFO] Epoch:[0/2](867200/4588595) loss:2.953 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:39:34,685][model8_pretrain.py][INFO] Epoch:[0/2](867200/4588595) loss:2.879 lr:0.0000100 epoch_Time:23687.0min: [2024-01-06 13:40:11,633][model8_pretrain.py][INFO] Epoch:[0/2](867300/4588595) loss:2.695 lr:0.0000100 epoch_Time:23686.0min: [2024-01-06 13:40:11,633][model8_pretrain.py][INFO] Epoch:[0/2](867300/4588595) loss:3.048 lr:0.0000100 epoch_Time:23686.0min: [2024-01-06 13:40:11,633][model8_pretrain.py][INFO] Epoch:[0/2](867300/4588595) loss:2.876 lr:0.0000100 epoch_Time:23686.0min: [2024-01-06 13:40:11,633][model8_pretrain.py][INFO] Epoch:[0/2](867300/4588595) loss:2.719 lr:0.0000100 epoch_Time:23686.0min: [2024-01-06 13:40:11,633][model8_pretrain.py][INFO] Epoch:[0/2](867300/4588595) loss:3.187 lr:0.0000100 epoch_Time:23686.0min: [2024-01-06 13:40:11,633][model8_pretrain.py][INFO] Epoch:[0/2](867300/4588595) loss:2.473 lr:0.0000100 epoch_Time:23686.0min: [2024-01-06 13:40:11,633][model8_pretrain.py][INFO] Epoch:[0/2](867300/4588595) loss:2.746 lr:0.0000100 epoch_Time:23686.0min: [2024-01-06 13:40:11,633][model8_pretrain.py][INFO] Epoch:[0/2](867300/4588595) loss:3.068 lr:0.0000100 epoch_Time:23686.0min: [2024-01-06 13:40:48,594][model8_pretrain.py][INFO] Epoch:[0/2](867400/4588595) loss:2.915 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:40:48,594][model8_pretrain.py][INFO] Epoch:[0/2](867400/4588595) loss:2.403 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:40:48,594][model8_pretrain.py][INFO] Epoch:[0/2](867400/4588595) loss:2.131 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:40:48,594][model8_pretrain.py][INFO] Epoch:[0/2](867400/4588595) loss:3.089 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:40:48,594][model8_pretrain.py][INFO] Epoch:[0/2](867400/4588595) loss:2.715 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:40:48,595][model8_pretrain.py][INFO] Epoch:[0/2](867400/4588595) loss:2.534 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:40:48,594][model8_pretrain.py][INFO] Epoch:[0/2](867400/4588595) loss:2.808 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:40:48,595][model8_pretrain.py][INFO] Epoch:[0/2](867400/4588595) loss:3.030 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:41:25,553][model8_pretrain.py][INFO] Epoch:[0/2](867500/4588595) loss:2.254 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:41:25,553][model8_pretrain.py][INFO] Epoch:[0/2](867500/4588595) loss:2.575 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:41:25,553][model8_pretrain.py][INFO] Epoch:[0/2](867500/4588595) loss:2.522 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:41:25,553][model8_pretrain.py][INFO] Epoch:[0/2](867500/4588595) loss:3.001 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:41:25,553][model8_pretrain.py][INFO] Epoch:[0/2](867500/4588595) loss:2.474 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:41:25,553][model8_pretrain.py][INFO] Epoch:[0/2](867500/4588595) loss:2.502 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:41:25,553][model8_pretrain.py][INFO] Epoch:[0/2](867500/4588595) loss:2.322 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:41:25,554][model8_pretrain.py][INFO] Epoch:[0/2](867500/4588595) loss:2.927 lr:0.0000100 epoch_Time:23685.0min: [2024-01-06 13:42:02,518][model8_pretrain.py][INFO] Epoch:[0/2](867600/4588595) loss:2.817 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:02,518][model8_pretrain.py][INFO] Epoch:[0/2](867600/4588595) loss:2.664 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:02,518][model8_pretrain.py][INFO] Epoch:[0/2](867600/4588595) loss:2.742 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:02,518][model8_pretrain.py][INFO] Epoch:[0/2](867600/4588595) loss:2.987 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:02,518][model8_pretrain.py][INFO] Epoch:[0/2](867600/4588595) loss:2.413 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:02,518][model8_pretrain.py][INFO] Epoch:[0/2](867600/4588595) loss:2.640 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:02,518][model8_pretrain.py][INFO] Epoch:[0/2](867600/4588595) loss:2.574 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:02,519][model8_pretrain.py][INFO] Epoch:[0/2](867600/4588595) loss:2.479 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:39,483][model8_pretrain.py][INFO] Epoch:[0/2](867700/4588595) loss:2.887 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:39,483][model8_pretrain.py][INFO] Epoch:[0/2](867700/4588595) loss:2.645 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:39,483][model8_pretrain.py][INFO] Epoch:[0/2](867700/4588595) loss:2.960 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:39,483][model8_pretrain.py][INFO] Epoch:[0/2](867700/4588595) loss:3.090 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:39,483][model8_pretrain.py][INFO] Epoch:[0/2](867700/4588595) loss:2.193 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:39,483][model8_pretrain.py][INFO] Epoch:[0/2](867700/4588595) loss:3.084 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:39,483][model8_pretrain.py][INFO] Epoch:[0/2](867700/4588595) loss:2.958 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:42:39,483][model8_pretrain.py][INFO] Epoch:[0/2](867700/4588595) loss:2.209 lr:0.0000100 epoch_Time:23684.0min: [2024-01-06 13:43:16,433][model8_pretrain.py][INFO] Epoch:[0/2](867800/4588595) loss:2.608 lr:0.0000100 epoch_Time:23683.0min: [2024-01-06 13:43:16,433][model8_pretrain.py][INFO] Epoch:[0/2](867800/4588595) loss:3.037 lr:0.0000100 epoch_Time:23683.0min: [2024-01-06 13:43:16,433][model8_pretrain.py][INFO] Epoch:[0/2](867800/4588595) loss:3.073 lr:0.0000100 epoch_Time:23683.0min: [2024-01-06 13:43:16,433][model8_pretrain.py][INFO] Epoch:[0/2](867800/4588595) loss:3.277 lr:0.0000100 epoch_Time:23683.0min: [2024-01-06 13:43:16,433][model8_pretrain.py][INFO] Epoch:[0/2](867800/4588595) loss:2.473 lr:0.0000100 epoch_Time:23683.0min: [2024-01-06 13:43:16,433][model8_pretrain.py][INFO] Epoch:[0/2](867800/4588595) loss:2.871 lr:0.0000100 epoch_Time:23683.0min: [2024-01-06 13:43:16,433][model8_pretrain.py][INFO] Epoch:[0/2](867800/4588595) loss:2.553 lr:0.0000100 epoch_Time:23683.0min: [2024-01-06 13:43:16,434][model8_pretrain.py][INFO] Epoch:[0/2](867800/4588595) loss:2.604 lr:0.0000100 epoch_Time:23683.0min: [2024-01-06 13:44:03,906][model8_pretrain.py][INFO] Epoch:[0/2](867900/4588595) loss:2.662 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:03,906][model8_pretrain.py][INFO] Epoch:[0/2](867900/4588595) loss:2.515 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:03,906][model8_pretrain.py][INFO] Epoch:[0/2](867900/4588595) loss:2.494 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:03,906][model8_pretrain.py][INFO] Epoch:[0/2](867900/4588595) loss:2.593 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:03,906][model8_pretrain.py][INFO] Epoch:[0/2](867900/4588595) loss:2.425 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:03,906][model8_pretrain.py][INFO] Epoch:[0/2](867900/4588595) loss:2.706 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:03,907][model8_pretrain.py][INFO] Epoch:[0/2](867900/4588595) loss:2.904 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:03,907][model8_pretrain.py][INFO] Epoch:[0/2](867900/4588595) loss:3.054 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:40,833][model8_pretrain.py][INFO] Epoch:[0/2](868000/4588595) loss:2.692 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:40,833][model8_pretrain.py][INFO] Epoch:[0/2](868000/4588595) loss:3.015 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](868000/4588595) loss:2.994 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](868000/4588595) loss:3.214 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](868000/4588595) loss:2.547 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](868000/4588595) loss:2.882 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](868000/4588595) loss:3.029 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:44:40,834][model8_pretrain.py][INFO] Epoch:[0/2](868000/4588595) loss:3.094 lr:0.0000100 epoch_Time:23682.0min: [2024-01-06 13:45:17,765][model8_pretrain.py][INFO] Epoch:[0/2](868100/4588595) loss:2.961 lr:0.0000100 epoch_Time:23681.0min: [2024-01-06 13:45:17,765][model8_pretrain.py][INFO] Epoch:[0/2](868100/4588595) loss:2.512 lr:0.0000100 epoch_Time:23681.0min: [2024-01-06 13:45:17,765][model8_pretrain.py][INFO] Epoch:[0/2](868100/4588595) loss:2.598 lr:0.0000100 epoch_Time:23681.0min: [2024-01-06 13:45:17,765][model8_pretrain.py][INFO] Epoch:[0/2](868100/4588595) loss:3.252 lr:0.0000100 epoch_Time:23681.0min: [2024-01-06 13:45:17,765][model8_pretrain.py][INFO] Epoch:[0/2](868100/4588595) loss:2.488 lr:0.0000100 epoch_Time:23681.0min: [2024-01-06 13:45:17,765][model8_pretrain.py][INFO] Epoch:[0/2](868100/4588595) loss:2.351 lr:0.0000100 epoch_Time:23681.0min: [2024-01-06 13:45:17,765][model8_pretrain.py][INFO] Epoch:[0/2](868100/4588595) loss:2.907 lr:0.0000100 epoch_Time:23681.0min: [2024-01-06 13:45:17,766][model8_pretrain.py][INFO] Epoch:[0/2](868100/4588595) loss:2.934 lr:0.0000100 epoch_Time:23681.0min: [2024-01-06 13:45:54,714][model8_pretrain.py][INFO] Epoch:[0/2](868200/4588595) loss:2.282 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:45:54,714][model8_pretrain.py][INFO] Epoch:[0/2](868200/4588595) loss:2.731 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:45:54,714][model8_pretrain.py][INFO] Epoch:[0/2](868200/4588595) loss:3.062 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:45:54,714][model8_pretrain.py][INFO] Epoch:[0/2](868200/4588595) loss:2.448 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:45:54,714][model8_pretrain.py][INFO] Epoch:[0/2](868200/4588595) loss:2.331 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:45:54,714][model8_pretrain.py][INFO] Epoch:[0/2](868200/4588595) loss:2.895 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:45:54,714][model8_pretrain.py][INFO] Epoch:[0/2](868200/4588595) loss:3.179 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:45:54,714][model8_pretrain.py][INFO] Epoch:[0/2](868200/4588595) loss:3.216 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:46:31,649][model8_pretrain.py][INFO] Epoch:[0/2](868300/4588595) loss:3.106 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:46:31,649][model8_pretrain.py][INFO] Epoch:[0/2](868300/4588595) loss:2.391 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:46:31,649][model8_pretrain.py][INFO] Epoch:[0/2](868300/4588595) loss:2.001 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:46:31,649][model8_pretrain.py][INFO] Epoch:[0/2](868300/4588595) loss:3.180 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:46:31,649][model8_pretrain.py][INFO] Epoch:[0/2](868300/4588595) loss:2.792 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:46:31,649][model8_pretrain.py][INFO] Epoch:[0/2](868300/4588595) loss:2.861 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:46:31,649][model8_pretrain.py][INFO] Epoch:[0/2](868300/4588595) loss:2.605 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:46:31,650][model8_pretrain.py][INFO] Epoch:[0/2](868300/4588595) loss:2.878 lr:0.0000100 epoch_Time:23680.0min: [2024-01-06 13:47:08,587][model8_pretrain.py][INFO] Epoch:[0/2](868400/4588595) loss:2.858 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:08,587][model8_pretrain.py][INFO] Epoch:[0/2](868400/4588595) loss:2.770 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:08,587][model8_pretrain.py][INFO] Epoch:[0/2](868400/4588595) loss:2.753 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:08,587][model8_pretrain.py][INFO] Epoch:[0/2](868400/4588595) loss:2.964 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:08,587][model8_pretrain.py][INFO] Epoch:[0/2](868400/4588595) loss:2.694 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:08,587][model8_pretrain.py][INFO] Epoch:[0/2](868400/4588595) loss:2.744 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:08,587][model8_pretrain.py][INFO] Epoch:[0/2](868400/4588595) loss:3.093 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:08,587][model8_pretrain.py][INFO] Epoch:[0/2](868400/4588595) loss:2.682 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:45,531][model8_pretrain.py][INFO] Epoch:[0/2](868500/4588595) loss:2.723 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:45,531][model8_pretrain.py][INFO] Epoch:[0/2](868500/4588595) loss:2.889 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:45,532][model8_pretrain.py][INFO] Epoch:[0/2](868500/4588595) loss:2.921 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:45,532][model8_pretrain.py][INFO] Epoch:[0/2](868500/4588595) loss:3.034 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:45,532][model8_pretrain.py][INFO] Epoch:[0/2](868500/4588595) loss:2.912 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:45,532][model8_pretrain.py][INFO] Epoch:[0/2](868500/4588595) loss:3.483 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:45,532][model8_pretrain.py][INFO] Epoch:[0/2](868500/4588595) loss:2.983 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:47:45,532][model8_pretrain.py][INFO] Epoch:[0/2](868500/4588595) loss:2.906 lr:0.0000100 epoch_Time:23679.0min: [2024-01-06 13:48:22,473][model8_pretrain.py][INFO] Epoch:[0/2](868600/4588595) loss:2.639 lr:0.0000100 epoch_Time:23678.0min: [2024-01-06 13:48:22,473][model8_pretrain.py][INFO] Epoch:[0/2](868600/4588595) loss:3.068 lr:0.0000100 epoch_Time:23678.0min: [2024-01-06 13:48:22,473][model8_pretrain.py][INFO] Epoch:[0/2](868600/4588595) loss:3.225 lr:0.0000100 epoch_Time:23678.0min: [2024-01-06 13:48:22,473][model8_pretrain.py][INFO] Epoch:[0/2](868600/4588595) loss:2.881 lr:0.0000100 epoch_Time:23678.0min: [2024-01-06 13:48:22,473][model8_pretrain.py][INFO] Epoch:[0/2](868600/4588595) loss:2.738 lr:0.0000100 epoch_Time:23678.0min: [2024-01-06 13:48:22,473][model8_pretrain.py][INFO] Epoch:[0/2](868600/4588595) loss:2.486 lr:0.0000100 epoch_Time:23678.0min: [2024-01-06 13:48:22,473][model8_pretrain.py][INFO] Epoch:[0/2](868600/4588595) loss:3.113 lr:0.0000100 epoch_Time:23678.0min: [2024-01-06 13:48:22,473][model8_pretrain.py][INFO] Epoch:[0/2](868600/4588595) loss:2.560 lr:0.0000100 epoch_Time:23678.0min: [2024-01-06 13:49:09,871][model8_pretrain.py][INFO] Epoch:[0/2](868700/4588595) loss:2.852 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:09,871][model8_pretrain.py][INFO] Epoch:[0/2](868700/4588595) loss:2.828 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:09,871][model8_pretrain.py][INFO] Epoch:[0/2](868700/4588595) loss:2.681 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:09,871][model8_pretrain.py][INFO] Epoch:[0/2](868700/4588595) loss:3.039 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:09,871][model8_pretrain.py][INFO] Epoch:[0/2](868700/4588595) loss:2.565 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:09,871][model8_pretrain.py][INFO] Epoch:[0/2](868700/4588595) loss:2.612 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:09,871][model8_pretrain.py][INFO] Epoch:[0/2](868700/4588595) loss:1.808 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:09,871][model8_pretrain.py][INFO] Epoch:[0/2](868700/4588595) loss:2.756 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:46,788][model8_pretrain.py][INFO] Epoch:[0/2](868800/4588595) loss:3.207 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:46,788][model8_pretrain.py][INFO] Epoch:[0/2](868800/4588595) loss:3.302 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:46,788][model8_pretrain.py][INFO] Epoch:[0/2](868800/4588595) loss:2.790 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:46,788][model8_pretrain.py][INFO] Epoch:[0/2](868800/4588595) loss:2.983 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:46,788][model8_pretrain.py][INFO] Epoch:[0/2](868800/4588595) loss:2.771 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:46,788][model8_pretrain.py][INFO] Epoch:[0/2](868800/4588595) loss:2.334 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:46,788][model8_pretrain.py][INFO] Epoch:[0/2](868800/4588595) loss:2.985 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:49:46,788][model8_pretrain.py][INFO] Epoch:[0/2](868800/4588595) loss:3.027 lr:0.0000100 epoch_Time:23677.0min: [2024-01-06 13:50:23,727][model8_pretrain.py][INFO] Epoch:[0/2](868900/4588595) loss:2.925 lr:0.0000100 epoch_Time:23676.0min: [2024-01-06 13:50:23,727][model8_pretrain.py][INFO] Epoch:[0/2](868900/4588595) loss:2.685 lr:0.0000100 epoch_Time:23676.0min: [2024-01-06 13:50:23,727][model8_pretrain.py][INFO] Epoch:[0/2](868900/4588595) loss:2.407 lr:0.0000100 epoch_Time:23676.0min: [2024-01-06 13:50:23,727][model8_pretrain.py][INFO] Epoch:[0/2](868900/4588595) loss:2.660 lr:0.0000100 epoch_Time:23676.0min: [2024-01-06 13:50:23,727][model8_pretrain.py][INFO] Epoch:[0/2](868900/4588595) loss:2.233 lr:0.0000100 epoch_Time:23676.0min: [2024-01-06 13:50:23,727][model8_pretrain.py][INFO] Epoch:[0/2](868900/4588595) loss:2.664 lr:0.0000100 epoch_Time:23676.0min: [2024-01-06 13:50:23,727][model8_pretrain.py][INFO] Epoch:[0/2](868900/4588595) loss:2.830 lr:0.0000100 epoch_Time:23676.0min: [2024-01-06 13:50:23,727][model8_pretrain.py][INFO] Epoch:[0/2](868900/4588595) loss:2.515 lr:0.0000100 epoch_Time:23676.0min: [2024-01-06 13:51:00,663][model8_pretrain.py][INFO] Epoch:[0/2](869000/4588595) loss:2.865 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:00,663][model8_pretrain.py][INFO] Epoch:[0/2](869000/4588595) loss:2.579 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:00,663][model8_pretrain.py][INFO] Epoch:[0/2](869000/4588595) loss:2.502 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:00,663][model8_pretrain.py][INFO] Epoch:[0/2](869000/4588595) loss:3.271 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:00,663][model8_pretrain.py][INFO] Epoch:[0/2](869000/4588595) loss:2.894 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:00,663][model8_pretrain.py][INFO] Epoch:[0/2](869000/4588595) loss:2.983 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:00,663][model8_pretrain.py][INFO] Epoch:[0/2](869000/4588595) loss:2.968 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:00,663][model8_pretrain.py][INFO] Epoch:[0/2](869000/4588595) loss:2.005 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:37,593][model8_pretrain.py][INFO] Epoch:[0/2](869100/4588595) loss:2.363 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:37,593][model8_pretrain.py][INFO] Epoch:[0/2](869100/4588595) loss:2.882 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:37,593][model8_pretrain.py][INFO] Epoch:[0/2](869100/4588595) loss:2.673 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:37,593][model8_pretrain.py][INFO] Epoch:[0/2](869100/4588595) loss:2.572 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:37,593][model8_pretrain.py][INFO] Epoch:[0/2](869100/4588595) loss:2.415 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:37,593][model8_pretrain.py][INFO] Epoch:[0/2](869100/4588595) loss:2.894 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:37,593][model8_pretrain.py][INFO] Epoch:[0/2](869100/4588595) loss:2.635 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:51:37,594][model8_pretrain.py][INFO] Epoch:[0/2](869100/4588595) loss:2.808 lr:0.0000100 epoch_Time:23675.0min: [2024-01-06 13:52:14,546][model8_pretrain.py][INFO] Epoch:[0/2](869200/4588595) loss:2.174 lr:0.0000100 epoch_Time:23674.0min: [2024-01-06 13:52:14,546][model8_pretrain.py][INFO] Epoch:[0/2](869200/4588595) loss:2.365 lr:0.0000100 epoch_Time:23674.0min: [2024-01-06 13:52:14,546][model8_pretrain.py][INFO] Epoch:[0/2](869200/4588595) loss:3.142 lr:0.0000100 epoch_Time:23674.0min: [2024-01-06 13:52:14,546][model8_pretrain.py][INFO] Epoch:[0/2](869200/4588595) loss:2.155 lr:0.0000100 epoch_Time:23674.0min: [2024-01-06 13:52:14,546][model8_pretrain.py][INFO] Epoch:[0/2](869200/4588595) loss:2.880 lr:0.0000100 epoch_Time:23674.0min: [2024-01-06 13:52:14,546][model8_pretrain.py][INFO] Epoch:[0/2](869200/4588595) loss:2.852 lr:0.0000100 epoch_Time:23674.0min: [2024-01-06 13:52:14,546][model8_pretrain.py][INFO] Epoch:[0/2](869200/4588595) loss:2.859 lr:0.0000100 epoch_Time:23674.0min: [2024-01-06 13:52:14,546][model8_pretrain.py][INFO] Epoch:[0/2](869200/4588595) loss:2.859 lr:0.0000100 epoch_Time:23674.0min: [2024-01-06 13:52:51,503][model8_pretrain.py][INFO] Epoch:[0/2](869300/4588595) loss:3.093 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:52:51,503][model8_pretrain.py][INFO] Epoch:[0/2](869300/4588595) loss:2.904 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:52:51,503][model8_pretrain.py][INFO] Epoch:[0/2](869300/4588595) loss:3.097 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:52:51,503][model8_pretrain.py][INFO] Epoch:[0/2](869300/4588595) loss:3.068 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:52:51,503][model8_pretrain.py][INFO] Epoch:[0/2](869300/4588595) loss:3.154 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:52:51,503][model8_pretrain.py][INFO] Epoch:[0/2](869300/4588595) loss:2.923 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:52:51,503][model8_pretrain.py][INFO] Epoch:[0/2](869300/4588595) loss:3.015 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:52:51,503][model8_pretrain.py][INFO] Epoch:[0/2](869300/4588595) loss:2.899 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:53:28,458][model8_pretrain.py][INFO] Epoch:[0/2](869400/4588595) loss:2.929 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:53:28,458][model8_pretrain.py][INFO] Epoch:[0/2](869400/4588595) loss:2.570 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:53:28,458][model8_pretrain.py][INFO] Epoch:[0/2](869400/4588595) loss:3.065 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:53:28,458][model8_pretrain.py][INFO] Epoch:[0/2](869400/4588595) loss:2.586 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:53:28,458][model8_pretrain.py][INFO] Epoch:[0/2](869400/4588595) loss:2.665 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:53:28,458][model8_pretrain.py][INFO] Epoch:[0/2](869400/4588595) loss:3.098 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:53:28,458][model8_pretrain.py][INFO] Epoch:[0/2](869400/4588595) loss:3.110 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:53:28,458][model8_pretrain.py][INFO] Epoch:[0/2](869400/4588595) loss:2.855 lr:0.0000100 epoch_Time:23673.0min: [2024-01-06 13:54:15,906][model8_pretrain.py][INFO] Epoch:[0/2](869500/4588595) loss:2.974 lr:0.0000100 epoch_Time:23672.0min: [2024-01-06 13:54:15,906][model8_pretrain.py][INFO] Epoch:[0/2](869500/4588595) loss:2.676 lr:0.0000100 epoch_Time:23672.0min: [2024-01-06 13:54:15,906][model8_pretrain.py][INFO] Epoch:[0/2](869500/4588595) loss:2.615 lr:0.0000100 epoch_Time:23672.0min: [2024-01-06 13:54:15,906][model8_pretrain.py][INFO] Epoch:[0/2](869500/4588595) loss:2.835 lr:0.0000100 epoch_Time:23672.0min: [2024-01-06 13:54:15,906][model8_pretrain.py][INFO] Epoch:[0/2](869500/4588595) loss:2.861 lr:0.0000100 epoch_Time:23672.0min: [2024-01-06 13:54:15,906][model8_pretrain.py][INFO] Epoch:[0/2](869500/4588595) loss:2.699 lr:0.0000100 epoch_Time:23672.0min: [2024-01-06 13:54:15,906][model8_pretrain.py][INFO] Epoch:[0/2](869500/4588595) loss:3.001 lr:0.0000100 epoch_Time:23672.0min: [2024-01-06 13:54:15,906][model8_pretrain.py][INFO] Epoch:[0/2](869500/4588595) loss:3.137 lr:0.0000100 epoch_Time:23672.0min: [2024-01-06 13:54:52,835][model8_pretrain.py][INFO] Epoch:[0/2](869600/4588595) loss:2.993 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:54:52,835][model8_pretrain.py][INFO] Epoch:[0/2](869600/4588595) loss:2.448 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:54:52,835][model8_pretrain.py][INFO] Epoch:[0/2](869600/4588595) loss:2.395 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:54:52,835][model8_pretrain.py][INFO] Epoch:[0/2](869600/4588595) loss:2.844 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:54:52,835][model8_pretrain.py][INFO] Epoch:[0/2](869600/4588595) loss:2.887 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:54:52,835][model8_pretrain.py][INFO] Epoch:[0/2](869600/4588595) loss:2.477 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:54:52,835][model8_pretrain.py][INFO] Epoch:[0/2](869600/4588595) loss:2.633 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:54:52,835][model8_pretrain.py][INFO] Epoch:[0/2](869600/4588595) loss:2.215 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:55:29,771][model8_pretrain.py][INFO] Epoch:[0/2](869700/4588595) loss:2.694 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:55:29,771][model8_pretrain.py][INFO] Epoch:[0/2](869700/4588595) loss:2.531 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:55:29,771][model8_pretrain.py][INFO] Epoch:[0/2](869700/4588595) loss:2.383 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:55:29,771][model8_pretrain.py][INFO] Epoch:[0/2](869700/4588595) loss:2.738 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:55:29,771][model8_pretrain.py][INFO] Epoch:[0/2](869700/4588595) loss:3.000 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:55:29,771][model8_pretrain.py][INFO] Epoch:[0/2](869700/4588595) loss:3.005 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:55:29,771][model8_pretrain.py][INFO] Epoch:[0/2](869700/4588595) loss:2.021 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:55:29,771][model8_pretrain.py][INFO] Epoch:[0/2](869700/4588595) loss:2.998 lr:0.0000100 epoch_Time:23671.0min: [2024-01-06 13:56:06,710][model8_pretrain.py][INFO] Epoch:[0/2](869800/4588595) loss:2.826 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:06,710][model8_pretrain.py][INFO] Epoch:[0/2](869800/4588595) loss:3.228 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:06,710][model8_pretrain.py][INFO] Epoch:[0/2](869800/4588595) loss:2.534 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:06,710][model8_pretrain.py][INFO] Epoch:[0/2](869800/4588595) loss:3.001 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:06,710][model8_pretrain.py][INFO] Epoch:[0/2](869800/4588595) loss:2.244 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:06,710][model8_pretrain.py][INFO] Epoch:[0/2](869800/4588595) loss:2.702 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:06,710][model8_pretrain.py][INFO] Epoch:[0/2](869800/4588595) loss:2.876 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:06,710][model8_pretrain.py][INFO] Epoch:[0/2](869800/4588595) loss:2.614 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:43,652][model8_pretrain.py][INFO] Epoch:[0/2](869900/4588595) loss:3.263 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:43,652][model8_pretrain.py][INFO] Epoch:[0/2](869900/4588595) loss:2.771 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:43,652][model8_pretrain.py][INFO] Epoch:[0/2](869900/4588595) loss:2.787 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:43,652][model8_pretrain.py][INFO] Epoch:[0/2](869900/4588595) loss:2.800 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:43,652][model8_pretrain.py][INFO] Epoch:[0/2](869900/4588595) loss:3.337 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:43,652][model8_pretrain.py][INFO] Epoch:[0/2](869900/4588595) loss:2.880 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:43,652][model8_pretrain.py][INFO] Epoch:[0/2](869900/4588595) loss:3.019 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:56:43,653][model8_pretrain.py][INFO] Epoch:[0/2](869900/4588595) loss:2.755 lr:0.0000100 epoch_Time:23670.0min: [2024-01-06 13:57:20,609][model8_pretrain.py][INFO] Epoch:[0/2](870000/4588595) loss:2.725 lr:0.0000100 epoch_Time:23669.0min: [2024-01-06 13:57:20,609][model8_pretrain.py][INFO] Epoch:[0/2](870000/4588595) loss:2.722 lr:0.0000100 epoch_Time:23669.0min: [2024-01-06 13:57:20,609][model8_pretrain.py][INFO] Epoch:[0/2](870000/4588595) loss:2.783 lr:0.0000100 epoch_Time:23669.0min: [2024-01-06 13:57:20,609][model8_pretrain.py][INFO] Epoch:[0/2](870000/4588595) loss:2.984 lr:0.0000100 epoch_Time:23669.0min: [2024-01-06 13:57:20,609][model8_pretrain.py][INFO] Epoch:[0/2](870000/4588595) loss:2.660 lr:0.0000100 epoch_Time:23669.0min: [2024-01-06 13:57:20,609][model8_pretrain.py][INFO] Epoch:[0/2](870000/4588595) loss:2.795 lr:0.0000100 epoch_Time:23669.0min: [2024-01-06 13:57:20,609][model8_pretrain.py][INFO] Epoch:[0/2](870000/4588595) loss:2.620 lr:0.0000100 epoch_Time:23669.0min: [2024-01-06 13:57:20,609][model8_pretrain.py][INFO] Epoch:[0/2](870000/4588595) loss:2.993 lr:0.0000100 epoch_Time:23669.0min: [2024-01-06 13:57:57,540][model8_pretrain.py][INFO] Epoch:[0/2](870100/4588595) loss:2.510 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:57:57,540][model8_pretrain.py][INFO] Epoch:[0/2](870100/4588595) loss:3.034 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:57:57,540][model8_pretrain.py][INFO] Epoch:[0/2](870100/4588595) loss:2.459 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:57:57,540][model8_pretrain.py][INFO] Epoch:[0/2](870100/4588595) loss:2.712 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:57:57,540][model8_pretrain.py][INFO] Epoch:[0/2](870100/4588595) loss:3.137 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:57:57,540][model8_pretrain.py][INFO] Epoch:[0/2](870100/4588595) loss:3.005 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:57:57,540][model8_pretrain.py][INFO] Epoch:[0/2](870100/4588595) loss:2.577 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:57:57,540][model8_pretrain.py][INFO] Epoch:[0/2](870100/4588595) loss:3.274 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:58:34,491][model8_pretrain.py][INFO] Epoch:[0/2](870200/4588595) loss:3.110 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:58:34,491][model8_pretrain.py][INFO] Epoch:[0/2](870200/4588595) loss:2.725 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:58:34,491][model8_pretrain.py][INFO] Epoch:[0/2](870200/4588595) loss:2.967 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:58:34,491][model8_pretrain.py][INFO] Epoch:[0/2](870200/4588595) loss:1.902 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:58:34,491][model8_pretrain.py][INFO] Epoch:[0/2](870200/4588595) loss:3.052 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:58:34,492][model8_pretrain.py][INFO] Epoch:[0/2](870200/4588595) loss:3.200 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:58:34,492][model8_pretrain.py][INFO] Epoch:[0/2](870200/4588595) loss:2.873 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:58:34,492][model8_pretrain.py][INFO] Epoch:[0/2](870200/4588595) loss:2.977 lr:0.0000100 epoch_Time:23668.0min: [2024-01-06 13:59:21,931][model8_pretrain.py][INFO] Epoch:[0/2](870300/4588595) loss:2.746 lr:0.0000100 epoch_Time:23667.0min: [2024-01-06 13:59:21,931][model8_pretrain.py][INFO] Epoch:[0/2](870300/4588595) loss:2.670 lr:0.0000100 epoch_Time:23667.0min: [2024-01-06 13:59:21,931][model8_pretrain.py][INFO] Epoch:[0/2](870300/4588595) loss:2.619 lr:0.0000100 epoch_Time:23667.0min: [2024-01-06 13:59:21,931][model8_pretrain.py][INFO] Epoch:[0/2](870300/4588595) loss:2.526 lr:0.0000100 epoch_Time:23667.0min: [2024-01-06 13:59:21,931][model8_pretrain.py][INFO] Epoch:[0/2](870300/4588595) loss:2.301 lr:0.0000100 epoch_Time:23667.0min: [2024-01-06 13:59:21,931][model8_pretrain.py][INFO] Epoch:[0/2](870300/4588595) loss:2.825 lr:0.0000100 epoch_Time:23667.0min: [2024-01-06 13:59:21,931][model8_pretrain.py][INFO] Epoch:[0/2](870300/4588595) loss:2.556 lr:0.0000100 epoch_Time:23667.0min: [2024-01-06 13:59:21,932][model8_pretrain.py][INFO] Epoch:[0/2](870300/4588595) loss:2.743 lr:0.0000100 epoch_Time:23667.0min: [2024-01-06 13:59:58,862][model8_pretrain.py][INFO] Epoch:[0/2](870400/4588595) loss:2.896 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 13:59:58,862][model8_pretrain.py][INFO] Epoch:[0/2](870400/4588595) loss:2.805 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 13:59:58,863][model8_pretrain.py][INFO] Epoch:[0/2](870400/4588595) loss:2.812 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 13:59:58,863][model8_pretrain.py][INFO] Epoch:[0/2](870400/4588595) loss:2.592 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 13:59:58,863][model8_pretrain.py][INFO] Epoch:[0/2](870400/4588595) loss:3.065 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 13:59:58,863][model8_pretrain.py][INFO] Epoch:[0/2](870400/4588595) loss:2.855 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 13:59:58,863][model8_pretrain.py][INFO] Epoch:[0/2](870400/4588595) loss:2.936 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 13:59:58,863][model8_pretrain.py][INFO] Epoch:[0/2](870400/4588595) loss:2.977 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 14:00:35,796][model8_pretrain.py][INFO] Epoch:[0/2](870500/4588595) loss:2.934 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 14:00:35,796][model8_pretrain.py][INFO] Epoch:[0/2](870500/4588595) loss:3.133 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 14:00:35,796][model8_pretrain.py][INFO] Epoch:[0/2](870500/4588595) loss:2.588 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 14:00:35,796][model8_pretrain.py][INFO] Epoch:[0/2](870500/4588595) loss:2.986 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 14:00:35,796][model8_pretrain.py][INFO] Epoch:[0/2](870500/4588595) loss:3.179 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 14:00:35,796][model8_pretrain.py][INFO] Epoch:[0/2](870500/4588595) loss:2.615 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 14:00:35,796][model8_pretrain.py][INFO] Epoch:[0/2](870500/4588595) loss:2.627 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 14:00:35,796][model8_pretrain.py][INFO] Epoch:[0/2](870500/4588595) loss:2.419 lr:0.0000100 epoch_Time:23666.0min: [2024-01-06 14:01:12,732][model8_pretrain.py][INFO] Epoch:[0/2](870600/4588595) loss:3.236 lr:0.0000100 epoch_Time:23665.0min: [2024-01-06 14:01:12,732][model8_pretrain.py][INFO] Epoch:[0/2](870600/4588595) loss:2.808 lr:0.0000100 epoch_Time:23665.0min: [2024-01-06 14:01:12,732][model8_pretrain.py][INFO] Epoch:[0/2](870600/4588595) loss:2.704 lr:0.0000100 epoch_Time:23665.0min: [2024-01-06 14:01:12,732][model8_pretrain.py][INFO] Epoch:[0/2](870600/4588595) loss:2.728 lr:0.0000100 epoch_Time:23665.0min: [2024-01-06 14:01:12,732][model8_pretrain.py][INFO] Epoch:[0/2](870600/4588595) loss:2.651 lr:0.0000100 epoch_Time:23665.0min: [2024-01-06 14:01:12,732][model8_pretrain.py][INFO] Epoch:[0/2](870600/4588595) loss:2.366 lr:0.0000100 epoch_Time:23665.0min: [2024-01-06 14:01:12,732][model8_pretrain.py][INFO] Epoch:[0/2](870600/4588595) loss:2.737 lr:0.0000100 epoch_Time:23665.0min: [2024-01-06 14:01:12,733][model8_pretrain.py][INFO] Epoch:[0/2](870600/4588595) loss:2.760 lr:0.0000100 epoch_Time:23665.0min: [2024-01-06 14:01:49,676][model8_pretrain.py][INFO] Epoch:[0/2](870700/4588595) loss:2.988 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:01:49,676][model8_pretrain.py][INFO] Epoch:[0/2](870700/4588595) loss:2.746 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:01:49,676][model8_pretrain.py][INFO] Epoch:[0/2](870700/4588595) loss:2.970 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:01:49,676][model8_pretrain.py][INFO] Epoch:[0/2](870700/4588595) loss:2.513 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:01:49,676][model8_pretrain.py][INFO] Epoch:[0/2](870700/4588595) loss:2.940 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:01:49,677][model8_pretrain.py][INFO] Epoch:[0/2](870700/4588595) loss:2.661 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:01:49,676][model8_pretrain.py][INFO] Epoch:[0/2](870700/4588595) loss:2.867 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:01:49,677][model8_pretrain.py][INFO] Epoch:[0/2](870700/4588595) loss:3.009 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:02:26,625][model8_pretrain.py][INFO] Epoch:[0/2](870800/4588595) loss:3.136 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:02:26,625][model8_pretrain.py][INFO] Epoch:[0/2](870800/4588595) loss:2.810 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:02:26,625][model8_pretrain.py][INFO] Epoch:[0/2](870800/4588595) loss:2.777 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:02:26,625][model8_pretrain.py][INFO] Epoch:[0/2](870800/4588595) loss:2.530 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:02:26,625][model8_pretrain.py][INFO] Epoch:[0/2](870800/4588595) loss:2.180 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:02:26,625][model8_pretrain.py][INFO] Epoch:[0/2](870800/4588595) loss:2.381 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:02:26,625][model8_pretrain.py][INFO] Epoch:[0/2](870800/4588595) loss:3.095 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:02:26,626][model8_pretrain.py][INFO] Epoch:[0/2](870800/4588595) loss:2.793 lr:0.0000100 epoch_Time:23664.0min: [2024-01-06 14:03:03,573][model8_pretrain.py][INFO] Epoch:[0/2](870900/4588595) loss:2.645 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:03,573][model8_pretrain.py][INFO] Epoch:[0/2](870900/4588595) loss:3.111 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:03,573][model8_pretrain.py][INFO] Epoch:[0/2](870900/4588595) loss:2.625 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:03,573][model8_pretrain.py][INFO] Epoch:[0/2](870900/4588595) loss:2.234 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:03,573][model8_pretrain.py][INFO] Epoch:[0/2](870900/4588595) loss:2.463 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:03,573][model8_pretrain.py][INFO] Epoch:[0/2](870900/4588595) loss:2.592 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:03,573][model8_pretrain.py][INFO] Epoch:[0/2](870900/4588595) loss:2.913 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:03,573][model8_pretrain.py][INFO] Epoch:[0/2](870900/4588595) loss:2.842 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:40,521][model8_pretrain.py][INFO] Epoch:[0/2](871000/4588595) loss:2.545 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:40,521][model8_pretrain.py][INFO] Epoch:[0/2](871000/4588595) loss:2.874 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:40,521][model8_pretrain.py][INFO] Epoch:[0/2](871000/4588595) loss:2.753 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:40,521][model8_pretrain.py][INFO] Epoch:[0/2](871000/4588595) loss:2.449 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:40,521][model8_pretrain.py][INFO] Epoch:[0/2](871000/4588595) loss:2.439 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:40,521][model8_pretrain.py][INFO] Epoch:[0/2](871000/4588595) loss:2.735 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:40,521][model8_pretrain.py][INFO] Epoch:[0/2](871000/4588595) loss:3.170 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:03:40,522][model8_pretrain.py][INFO] Epoch:[0/2](871000/4588595) loss:2.599 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:04:27,931][model8_pretrain.py][INFO] Epoch:[0/2](871100/4588595) loss:3.197 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:04:27,931][model8_pretrain.py][INFO] Epoch:[0/2](871100/4588595) loss:2.864 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:04:27,931][model8_pretrain.py][INFO] Epoch:[0/2](871100/4588595) loss:2.892 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:04:27,931][model8_pretrain.py][INFO] Epoch:[0/2](871100/4588595) loss:2.764 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:04:27,931][model8_pretrain.py][INFO] Epoch:[0/2](871100/4588595) loss:3.052 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:04:27,931][model8_pretrain.py][INFO] Epoch:[0/2](871100/4588595) loss:3.115 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:04:27,931][model8_pretrain.py][INFO] Epoch:[0/2](871100/4588595) loss:2.885 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:04:27,931][model8_pretrain.py][INFO] Epoch:[0/2](871100/4588595) loss:2.390 lr:0.0000100 epoch_Time:23663.0min: [2024-01-06 14:05:04,866][model8_pretrain.py][INFO] Epoch:[0/2](871200/4588595) loss:2.586 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:04,866][model8_pretrain.py][INFO] Epoch:[0/2](871200/4588595) loss:2.777 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:04,866][model8_pretrain.py][INFO] Epoch:[0/2](871200/4588595) loss:2.456 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:04,866][model8_pretrain.py][INFO] Epoch:[0/2](871200/4588595) loss:3.052 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:04,866][model8_pretrain.py][INFO] Epoch:[0/2](871200/4588595) loss:2.802 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:04,866][model8_pretrain.py][INFO] Epoch:[0/2](871200/4588595) loss:2.597 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:04,866][model8_pretrain.py][INFO] Epoch:[0/2](871200/4588595) loss:2.472 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:04,866][model8_pretrain.py][INFO] Epoch:[0/2](871200/4588595) loss:2.810 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:41,776][model8_pretrain.py][INFO] Epoch:[0/2](871300/4588595) loss:3.153 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:41,776][model8_pretrain.py][INFO] Epoch:[0/2](871300/4588595) loss:2.935 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:41,776][model8_pretrain.py][INFO] Epoch:[0/2](871300/4588595) loss:2.319 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:41,776][model8_pretrain.py][INFO] Epoch:[0/2](871300/4588595) loss:2.329 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:41,776][model8_pretrain.py][INFO] Epoch:[0/2](871300/4588595) loss:2.884 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:41,776][model8_pretrain.py][INFO] Epoch:[0/2](871300/4588595) loss:2.735 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:41,776][model8_pretrain.py][INFO] Epoch:[0/2](871300/4588595) loss:3.073 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:05:41,776][model8_pretrain.py][INFO] Epoch:[0/2](871300/4588595) loss:2.555 lr:0.0000100 epoch_Time:23661.0min: [2024-01-06 14:06:18,724][model8_pretrain.py][INFO] Epoch:[0/2](871400/4588595) loss:2.646 lr:0.0000100 epoch_Time:23660.0min: [2024-01-06 14:06:18,724][model8_pretrain.py][INFO] Epoch:[0/2](871400/4588595) loss:2.274 lr:0.0000100 epoch_Time:23660.0min: [2024-01-06 14:06:18,724][model8_pretrain.py][INFO] Epoch:[0/2](871400/4588595) loss:2.847 lr:0.0000100 epoch_Time:23660.0min: [2024-01-06 14:06:18,724][model8_pretrain.py][INFO] Epoch:[0/2](871400/4588595) loss:2.768 lr:0.0000100 epoch_Time:23660.0min: [2024-01-06 14:06:18,724][model8_pretrain.py][INFO] Epoch:[0/2](871400/4588595) loss:2.913 lr:0.0000100 epoch_Time:23660.0min: [2024-01-06 14:06:18,724][model8_pretrain.py][INFO] Epoch:[0/2](871400/4588595) loss:3.017 lr:0.0000100 epoch_Time:23660.0min: [2024-01-06 14:06:18,724][model8_pretrain.py][INFO] Epoch:[0/2](871400/4588595) loss:2.646 lr:0.0000100 epoch_Time:23660.0min: [2024-01-06 14:06:18,725][model8_pretrain.py][INFO] Epoch:[0/2](871400/4588595) loss:2.676 lr:0.0000100 epoch_Time:23660.0min: [2024-01-06 14:06:55,669][model8_pretrain.py][INFO] Epoch:[0/2](871500/4588595) loss:2.402 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:06:55,669][model8_pretrain.py][INFO] Epoch:[0/2](871500/4588595) loss:2.460 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:06:55,669][model8_pretrain.py][INFO] Epoch:[0/2](871500/4588595) loss:2.568 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:06:55,669][model8_pretrain.py][INFO] Epoch:[0/2](871500/4588595) loss:2.468 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:06:55,669][model8_pretrain.py][INFO] Epoch:[0/2](871500/4588595) loss:3.003 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:06:55,669][model8_pretrain.py][INFO] Epoch:[0/2](871500/4588595) loss:2.484 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:06:55,669][model8_pretrain.py][INFO] Epoch:[0/2](871500/4588595) loss:3.125 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:06:55,684][model8_pretrain.py][INFO] Epoch:[0/2](871500/4588595) loss:2.953 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:07:32,626][model8_pretrain.py][INFO] Epoch:[0/2](871600/4588595) loss:2.697 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:07:32,626][model8_pretrain.py][INFO] Epoch:[0/2](871600/4588595) loss:2.550 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:07:32,626][model8_pretrain.py][INFO] Epoch:[0/2](871600/4588595) loss:2.742 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:07:32,626][model8_pretrain.py][INFO] Epoch:[0/2](871600/4588595) loss:2.081 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:07:32,626][model8_pretrain.py][INFO] Epoch:[0/2](871600/4588595) loss:2.644 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:07:32,626][model8_pretrain.py][INFO] Epoch:[0/2](871600/4588595) loss:2.327 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:07:32,626][model8_pretrain.py][INFO] Epoch:[0/2](871600/4588595) loss:2.893 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:07:32,626][model8_pretrain.py][INFO] Epoch:[0/2](871600/4588595) loss:2.885 lr:0.0000100 epoch_Time:23659.0min: [2024-01-06 14:08:09,573][model8_pretrain.py][INFO] Epoch:[0/2](871700/4588595) loss:2.419 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:09,574][model8_pretrain.py][INFO] Epoch:[0/2](871700/4588595) loss:2.972 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:09,574][model8_pretrain.py][INFO] Epoch:[0/2](871700/4588595) loss:2.924 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:09,574][model8_pretrain.py][INFO] Epoch:[0/2](871700/4588595) loss:3.009 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:09,574][model8_pretrain.py][INFO] Epoch:[0/2](871700/4588595) loss:3.288 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:09,574][model8_pretrain.py][INFO] Epoch:[0/2](871700/4588595) loss:2.949 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:09,574][model8_pretrain.py][INFO] Epoch:[0/2](871700/4588595) loss:2.767 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:09,574][model8_pretrain.py][INFO] Epoch:[0/2](871700/4588595) loss:2.734 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:46,513][model8_pretrain.py][INFO] Epoch:[0/2](871800/4588595) loss:2.803 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:46,513][model8_pretrain.py][INFO] Epoch:[0/2](871800/4588595) loss:3.025 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:46,513][model8_pretrain.py][INFO] Epoch:[0/2](871800/4588595) loss:3.001 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:46,513][model8_pretrain.py][INFO] Epoch:[0/2](871800/4588595) loss:2.981 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:46,513][model8_pretrain.py][INFO] Epoch:[0/2](871800/4588595) loss:2.555 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:46,513][model8_pretrain.py][INFO] Epoch:[0/2](871800/4588595) loss:2.656 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:46,513][model8_pretrain.py][INFO] Epoch:[0/2](871800/4588595) loss:2.262 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:08:46,514][model8_pretrain.py][INFO] Epoch:[0/2](871800/4588595) loss:2.743 lr:0.0000100 epoch_Time:23658.0min: [2024-01-06 14:09:32,012][model8_pretrain.py][INFO] Epoch:[0/2](871900/4588595) loss:2.887 lr:0.0000100 epoch_Time:23657.0min: [2024-01-06 14:09:32,012][model8_pretrain.py][INFO] Epoch:[0/2](871900/4588595) loss:2.694 lr:0.0000100 epoch_Time:23657.0min: [2024-01-06 14:09:32,012][model8_pretrain.py][INFO] Epoch:[0/2](871900/4588595) loss:2.736 lr:0.0000100 epoch_Time:23657.0min: [2024-01-06 14:09:32,012][model8_pretrain.py][INFO] Epoch:[0/2](871900/4588595) loss:2.647 lr:0.0000100 epoch_Time:23657.0min: [2024-01-06 14:09:32,012][model8_pretrain.py][INFO] Epoch:[0/2](871900/4588595) loss:2.786 lr:0.0000100 epoch_Time:23657.0min: [2024-01-06 14:09:32,012][model8_pretrain.py][INFO] Epoch:[0/2](871900/4588595) loss:2.429 lr:0.0000100 epoch_Time:23657.0min: [2024-01-06 14:09:32,012][model8_pretrain.py][INFO] Epoch:[0/2](871900/4588595) loss:2.608 lr:0.0000100 epoch_Time:23657.0min: [2024-01-06 14:09:32,013][model8_pretrain.py][INFO] Epoch:[0/2](871900/4588595) loss:2.612 lr:0.0000100 epoch_Time:23657.0min: [2024-01-06 14:10:10,385][model8_pretrain.py][INFO] Epoch:[0/2](872000/4588595) loss:2.325 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:10,385][model8_pretrain.py][INFO] Epoch:[0/2](872000/4588595) loss:3.034 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:10,385][model8_pretrain.py][INFO] Epoch:[0/2](872000/4588595) loss:3.031 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:10,385][model8_pretrain.py][INFO] Epoch:[0/2](872000/4588595) loss:3.436 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:10,385][model8_pretrain.py][INFO] Epoch:[0/2](872000/4588595) loss:2.886 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:10,385][model8_pretrain.py][INFO] Epoch:[0/2](872000/4588595) loss:2.867 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:10,385][model8_pretrain.py][INFO] Epoch:[0/2](872000/4588595) loss:2.365 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:10,385][model8_pretrain.py][INFO] Epoch:[0/2](872000/4588595) loss:2.546 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:47,325][model8_pretrain.py][INFO] Epoch:[0/2](872100/4588595) loss:2.961 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:47,325][model8_pretrain.py][INFO] Epoch:[0/2](872100/4588595) loss:2.901 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:47,325][model8_pretrain.py][INFO] Epoch:[0/2](872100/4588595) loss:2.451 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:47,325][model8_pretrain.py][INFO] Epoch:[0/2](872100/4588595) loss:2.549 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:47,326][model8_pretrain.py][INFO] Epoch:[0/2](872100/4588595) loss:2.800 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:47,325][model8_pretrain.py][INFO] Epoch:[0/2](872100/4588595) loss:3.009 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:47,326][model8_pretrain.py][INFO] Epoch:[0/2](872100/4588595) loss:2.732 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:10:47,326][model8_pretrain.py][INFO] Epoch:[0/2](872100/4588595) loss:3.443 lr:0.0000100 epoch_Time:23656.0min: [2024-01-06 14:11:24,274][model8_pretrain.py][INFO] Epoch:[0/2](872200/4588595) loss:2.751 lr:0.0000100 epoch_Time:23655.0min: [2024-01-06 14:11:24,274][model8_pretrain.py][INFO] Epoch:[0/2](872200/4588595) loss:2.832 lr:0.0000100 epoch_Time:23655.0min: [2024-01-06 14:11:24,274][model8_pretrain.py][INFO] Epoch:[0/2](872200/4588595) loss:2.641 lr:0.0000100 epoch_Time:23655.0min: [2024-01-06 14:11:24,274][model8_pretrain.py][INFO] Epoch:[0/2](872200/4588595) loss:2.398 lr:0.0000100 epoch_Time:23655.0min: [2024-01-06 14:11:24,274][model8_pretrain.py][INFO] Epoch:[0/2](872200/4588595) loss:3.178 lr:0.0000100 epoch_Time:23655.0min: [2024-01-06 14:11:24,274][model8_pretrain.py][INFO] Epoch:[0/2](872200/4588595) loss:2.952 lr:0.0000100 epoch_Time:23655.0min: [2024-01-06 14:11:24,274][model8_pretrain.py][INFO] Epoch:[0/2](872200/4588595) loss:2.470 lr:0.0000100 epoch_Time:23655.0min: [2024-01-06 14:11:24,275][model8_pretrain.py][INFO] Epoch:[0/2](872200/4588595) loss:3.166 lr:0.0000100 epoch_Time:23655.0min: [2024-01-06 14:12:01,218][model8_pretrain.py][INFO] Epoch:[0/2](872300/4588595) loss:2.493 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:01,218][model8_pretrain.py][INFO] Epoch:[0/2](872300/4588595) loss:2.573 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:01,218][model8_pretrain.py][INFO] Epoch:[0/2](872300/4588595) loss:2.746 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:01,218][model8_pretrain.py][INFO] Epoch:[0/2](872300/4588595) loss:2.807 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:01,218][model8_pretrain.py][INFO] Epoch:[0/2](872300/4588595) loss:2.855 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:01,218][model8_pretrain.py][INFO] Epoch:[0/2](872300/4588595) loss:3.197 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:01,218][model8_pretrain.py][INFO] Epoch:[0/2](872300/4588595) loss:2.378 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:01,218][model8_pretrain.py][INFO] Epoch:[0/2](872300/4588595) loss:2.887 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:38,166][model8_pretrain.py][INFO] Epoch:[0/2](872400/4588595) loss:2.788 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:38,166][model8_pretrain.py][INFO] Epoch:[0/2](872400/4588595) loss:2.294 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:38,167][model8_pretrain.py][INFO] Epoch:[0/2](872400/4588595) loss:2.931 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:38,168][model8_pretrain.py][INFO] Epoch:[0/2](872400/4588595) loss:2.780 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:38,168][model8_pretrain.py][INFO] Epoch:[0/2](872400/4588595) loss:2.661 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:38,168][model8_pretrain.py][INFO] Epoch:[0/2](872400/4588595) loss:2.577 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:38,168][model8_pretrain.py][INFO] Epoch:[0/2](872400/4588595) loss:2.997 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:12:38,168][model8_pretrain.py][INFO] Epoch:[0/2](872400/4588595) loss:2.380 lr:0.0000100 epoch_Time:23654.0min: [2024-01-06 14:13:15,122][model8_pretrain.py][INFO] Epoch:[0/2](872500/4588595) loss:3.143 lr:0.0000100 epoch_Time:23653.0min: [2024-01-06 14:13:15,122][model8_pretrain.py][INFO] Epoch:[0/2](872500/4588595) loss:3.155 lr:0.0000100 epoch_Time:23653.0min: [2024-01-06 14:13:15,122][model8_pretrain.py][INFO] Epoch:[0/2](872500/4588595) loss:3.044 lr:0.0000100 epoch_Time:23653.0min: [2024-01-06 14:13:15,122][model8_pretrain.py][INFO] Epoch:[0/2](872500/4588595) loss:2.663 lr:0.0000100 epoch_Time:23653.0min: [2024-01-06 14:13:15,122][model8_pretrain.py][INFO] Epoch:[0/2](872500/4588595) loss:2.724 lr:0.0000100 epoch_Time:23653.0min: [2024-01-06 14:13:15,122][model8_pretrain.py][INFO] Epoch:[0/2](872500/4588595) loss:2.208 lr:0.0000100 epoch_Time:23653.0min: [2024-01-06 14:13:15,122][model8_pretrain.py][INFO] Epoch:[0/2](872500/4588595) loss:3.068 lr:0.0000100 epoch_Time:23653.0min: [2024-01-06 14:13:15,123][model8_pretrain.py][INFO] Epoch:[0/2](872500/4588595) loss:2.894 lr:0.0000100 epoch_Time:23653.0min: [2024-01-06 14:13:52,061][model8_pretrain.py][INFO] Epoch:[0/2](872600/4588595) loss:2.393 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:13:52,061][model8_pretrain.py][INFO] Epoch:[0/2](872600/4588595) loss:3.597 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:13:52,061][model8_pretrain.py][INFO] Epoch:[0/2](872600/4588595) loss:2.548 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:13:52,061][model8_pretrain.py][INFO] Epoch:[0/2](872600/4588595) loss:3.103 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:13:52,061][model8_pretrain.py][INFO] Epoch:[0/2](872600/4588595) loss:2.943 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:13:52,061][model8_pretrain.py][INFO] Epoch:[0/2](872600/4588595) loss:2.938 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:13:52,061][model8_pretrain.py][INFO] Epoch:[0/2](872600/4588595) loss:2.692 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:13:52,061][model8_pretrain.py][INFO] Epoch:[0/2](872600/4588595) loss:3.433 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:14:37,631][model8_pretrain.py][INFO] Epoch:[0/2](872700/4588595) loss:3.095 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:14:37,631][model8_pretrain.py][INFO] Epoch:[0/2](872700/4588595) loss:2.493 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:14:37,631][model8_pretrain.py][INFO] Epoch:[0/2](872700/4588595) loss:2.489 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:14:37,631][model8_pretrain.py][INFO] Epoch:[0/2](872700/4588595) loss:2.358 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:14:37,631][model8_pretrain.py][INFO] Epoch:[0/2](872700/4588595) loss:3.117 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:14:37,631][model8_pretrain.py][INFO] Epoch:[0/2](872700/4588595) loss:2.945 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:14:37,631][model8_pretrain.py][INFO] Epoch:[0/2](872700/4588595) loss:2.351 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:14:37,631][model8_pretrain.py][INFO] Epoch:[0/2](872700/4588595) loss:2.519 lr:0.0000100 epoch_Time:23652.0min: [2024-01-06 14:15:16,015][model8_pretrain.py][INFO] Epoch:[0/2](872800/4588595) loss:2.558 lr:0.0000100 epoch_Time:23651.0min: [2024-01-06 14:15:16,015][model8_pretrain.py][INFO] Epoch:[0/2](872800/4588595) loss:2.834 lr:0.0000100 epoch_Time:23651.0min: [2024-01-06 14:15:16,015][model8_pretrain.py][INFO] Epoch:[0/2](872800/4588595) loss:3.306 lr:0.0000100 epoch_Time:23651.0min: [2024-01-06 14:15:16,015][model8_pretrain.py][INFO] Epoch:[0/2](872800/4588595) loss:2.940 lr:0.0000100 epoch_Time:23651.0min: [2024-01-06 14:15:16,015][model8_pretrain.py][INFO] Epoch:[0/2](872800/4588595) loss:2.979 lr:0.0000100 epoch_Time:23651.0min: [2024-01-06 14:15:16,015][model8_pretrain.py][INFO] Epoch:[0/2](872800/4588595) loss:2.579 lr:0.0000100 epoch_Time:23651.0min: [2024-01-06 14:15:16,015][model8_pretrain.py][INFO] Epoch:[0/2](872800/4588595) loss:2.472 lr:0.0000100 epoch_Time:23651.0min: [2024-01-06 14:15:16,015][model8_pretrain.py][INFO] Epoch:[0/2](872800/4588595) loss:3.028 lr:0.0000100 epoch_Time:23651.0min: [2024-01-06 14:15:52,969][model8_pretrain.py][INFO] Epoch:[0/2](872900/4588595) loss:3.023 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:15:52,969][model8_pretrain.py][INFO] Epoch:[0/2](872900/4588595) loss:2.846 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:15:52,969][model8_pretrain.py][INFO] Epoch:[0/2](872900/4588595) loss:2.807 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:15:52,969][model8_pretrain.py][INFO] Epoch:[0/2](872900/4588595) loss:3.168 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:15:52,969][model8_pretrain.py][INFO] Epoch:[0/2](872900/4588595) loss:2.646 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:15:52,969][model8_pretrain.py][INFO] Epoch:[0/2](872900/4588595) loss:2.698 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:15:52,969][model8_pretrain.py][INFO] Epoch:[0/2](872900/4588595) loss:2.791 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:15:52,969][model8_pretrain.py][INFO] Epoch:[0/2](872900/4588595) loss:2.727 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:16:29,926][model8_pretrain.py][INFO] Epoch:[0/2](873000/4588595) loss:3.024 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:16:29,926][model8_pretrain.py][INFO] Epoch:[0/2](873000/4588595) loss:2.463 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:16:29,926][model8_pretrain.py][INFO] Epoch:[0/2](873000/4588595) loss:2.503 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:16:29,926][model8_pretrain.py][INFO] Epoch:[0/2](873000/4588595) loss:2.971 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:16:29,926][model8_pretrain.py][INFO] Epoch:[0/2](873000/4588595) loss:2.425 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:16:29,926][model8_pretrain.py][INFO] Epoch:[0/2](873000/4588595) loss:3.222 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:16:29,927][model8_pretrain.py][INFO] Epoch:[0/2](873000/4588595) loss:2.661 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:16:29,927][model8_pretrain.py][INFO] Epoch:[0/2](873000/4588595) loss:2.541 lr:0.0000100 epoch_Time:23650.0min: [2024-01-06 14:17:06,879][model8_pretrain.py][INFO] Epoch:[0/2](873100/4588595) loss:2.709 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:06,879][model8_pretrain.py][INFO] Epoch:[0/2](873100/4588595) loss:2.788 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:06,879][model8_pretrain.py][INFO] Epoch:[0/2](873100/4588595) loss:2.589 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:06,879][model8_pretrain.py][INFO] Epoch:[0/2](873100/4588595) loss:2.936 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:06,879][model8_pretrain.py][INFO] Epoch:[0/2](873100/4588595) loss:3.144 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:06,879][model8_pretrain.py][INFO] Epoch:[0/2](873100/4588595) loss:2.746 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:06,879][model8_pretrain.py][INFO] Epoch:[0/2](873100/4588595) loss:2.803 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:06,879][model8_pretrain.py][INFO] Epoch:[0/2](873100/4588595) loss:3.058 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:43,832][model8_pretrain.py][INFO] Epoch:[0/2](873200/4588595) loss:3.317 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:43,833][model8_pretrain.py][INFO] Epoch:[0/2](873200/4588595) loss:2.157 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:43,833][model8_pretrain.py][INFO] Epoch:[0/2](873200/4588595) loss:2.208 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:43,833][model8_pretrain.py][INFO] Epoch:[0/2](873200/4588595) loss:2.355 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:43,833][model8_pretrain.py][INFO] Epoch:[0/2](873200/4588595) loss:2.540 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:43,833][model8_pretrain.py][INFO] Epoch:[0/2](873200/4588595) loss:2.551 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:43,833][model8_pretrain.py][INFO] Epoch:[0/2](873200/4588595) loss:2.375 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:17:43,833][model8_pretrain.py][INFO] Epoch:[0/2](873200/4588595) loss:2.872 lr:0.0000100 epoch_Time:23649.0min: [2024-01-06 14:18:20,790][model8_pretrain.py][INFO] Epoch:[0/2](873300/4588595) loss:2.969 lr:0.0000100 epoch_Time:23648.0min: [2024-01-06 14:18:20,790][model8_pretrain.py][INFO] Epoch:[0/2](873300/4588595) loss:3.101 lr:0.0000100 epoch_Time:23648.0min: [2024-01-06 14:18:20,790][model8_pretrain.py][INFO] Epoch:[0/2](873300/4588595) loss:3.006 lr:0.0000100 epoch_Time:23648.0min: [2024-01-06 14:18:20,790][model8_pretrain.py][INFO] Epoch:[0/2](873300/4588595) loss:3.069 lr:0.0000100 epoch_Time:23648.0min: [2024-01-06 14:18:20,790][model8_pretrain.py][INFO] Epoch:[0/2](873300/4588595) loss:2.838 lr:0.0000100 epoch_Time:23648.0min: [2024-01-06 14:18:20,790][model8_pretrain.py][INFO] Epoch:[0/2](873300/4588595) loss:3.287 lr:0.0000100 epoch_Time:23648.0min: [2024-01-06 14:18:20,790][model8_pretrain.py][INFO] Epoch:[0/2](873300/4588595) loss:2.441 lr:0.0000100 epoch_Time:23648.0min: [2024-01-06 14:18:20,790][model8_pretrain.py][INFO] Epoch:[0/2](873300/4588595) loss:2.152 lr:0.0000100 epoch_Time:23648.0min: [2024-01-06 14:18:57,766][model8_pretrain.py][INFO] Epoch:[0/2](873400/4588595) loss:3.064 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:18:57,767][model8_pretrain.py][INFO] Epoch:[0/2](873400/4588595) loss:3.305 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:18:57,767][model8_pretrain.py][INFO] Epoch:[0/2](873400/4588595) loss:2.955 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:18:57,767][model8_pretrain.py][INFO] Epoch:[0/2](873400/4588595) loss:2.637 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:18:57,767][model8_pretrain.py][INFO] Epoch:[0/2](873400/4588595) loss:2.826 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:18:57,767][model8_pretrain.py][INFO] Epoch:[0/2](873400/4588595) loss:2.883 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:18:57,767][model8_pretrain.py][INFO] Epoch:[0/2](873400/4588595) loss:2.333 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:18:57,767][model8_pretrain.py][INFO] Epoch:[0/2](873400/4588595) loss:2.720 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:19:43,226][model8_pretrain.py][INFO] Epoch:[0/2](873500/4588595) loss:2.893 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:19:43,226][model8_pretrain.py][INFO] Epoch:[0/2](873500/4588595) loss:3.231 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:19:43,226][model8_pretrain.py][INFO] Epoch:[0/2](873500/4588595) loss:2.529 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:19:43,226][model8_pretrain.py][INFO] Epoch:[0/2](873500/4588595) loss:3.067 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:19:43,226][model8_pretrain.py][INFO] Epoch:[0/2](873500/4588595) loss:3.242 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:19:43,226][model8_pretrain.py][INFO] Epoch:[0/2](873500/4588595) loss:3.254 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:19:43,226][model8_pretrain.py][INFO] Epoch:[0/2](873500/4588595) loss:3.032 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:19:43,226][model8_pretrain.py][INFO] Epoch:[0/2](873500/4588595) loss:3.013 lr:0.0000100 epoch_Time:23647.0min: [2024-01-06 14:20:21,635][model8_pretrain.py][INFO] Epoch:[0/2](873600/4588595) loss:2.535 lr:0.0000100 epoch_Time:23646.0min: [2024-01-06 14:20:21,635][model8_pretrain.py][INFO] Epoch:[0/2](873600/4588595) loss:2.479 lr:0.0000100 epoch_Time:23646.0min: [2024-01-06 14:20:21,635][model8_pretrain.py][INFO] Epoch:[0/2](873600/4588595) loss:3.168 lr:0.0000100 epoch_Time:23646.0min: [2024-01-06 14:20:21,634][model8_pretrain.py][INFO] Epoch:[0/2](873600/4588595) loss:3.050 lr:0.0000100 epoch_Time:23646.0min: [2024-01-06 14:20:21,635][model8_pretrain.py][INFO] Epoch:[0/2](873600/4588595) loss:2.364 lr:0.0000100 epoch_Time:23646.0min: [2024-01-06 14:20:21,635][model8_pretrain.py][INFO] Epoch:[0/2](873600/4588595) loss:2.770 lr:0.0000100 epoch_Time:23646.0min: [2024-01-06 14:20:21,635][model8_pretrain.py][INFO] Epoch:[0/2](873600/4588595) loss:2.874 lr:0.0000100 epoch_Time:23646.0min: [2024-01-06 14:20:21,635][model8_pretrain.py][INFO] Epoch:[0/2](873600/4588595) loss:2.435 lr:0.0000100 epoch_Time:23646.0min: [2024-01-06 14:20:58,574][model8_pretrain.py][INFO] Epoch:[0/2](873700/4588595) loss:3.022 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:20:58,574][model8_pretrain.py][INFO] Epoch:[0/2](873700/4588595) loss:2.241 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:20:58,574][model8_pretrain.py][INFO] Epoch:[0/2](873700/4588595) loss:2.999 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:20:58,574][model8_pretrain.py][INFO] Epoch:[0/2](873700/4588595) loss:2.641 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:20:58,574][model8_pretrain.py][INFO] Epoch:[0/2](873700/4588595) loss:2.708 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:20:58,575][model8_pretrain.py][INFO] Epoch:[0/2](873700/4588595) loss:3.044 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:20:58,575][model8_pretrain.py][INFO] Epoch:[0/2](873700/4588595) loss:2.649 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:20:58,574][model8_pretrain.py][INFO] Epoch:[0/2](873700/4588595) loss:2.713 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:21:35,515][model8_pretrain.py][INFO] Epoch:[0/2](873800/4588595) loss:2.395 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:21:35,515][model8_pretrain.py][INFO] Epoch:[0/2](873800/4588595) loss:2.492 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:21:35,515][model8_pretrain.py][INFO] Epoch:[0/2](873800/4588595) loss:2.856 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:21:35,515][model8_pretrain.py][INFO] Epoch:[0/2](873800/4588595) loss:2.867 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:21:35,515][model8_pretrain.py][INFO] Epoch:[0/2](873800/4588595) loss:2.932 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:21:35,515][model8_pretrain.py][INFO] Epoch:[0/2](873800/4588595) loss:2.518 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:21:35,515][model8_pretrain.py][INFO] Epoch:[0/2](873800/4588595) loss:3.409 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:21:35,516][model8_pretrain.py][INFO] Epoch:[0/2](873800/4588595) loss:3.075 lr:0.0000100 epoch_Time:23645.0min: [2024-01-06 14:22:12,466][model8_pretrain.py][INFO] Epoch:[0/2](873900/4588595) loss:3.064 lr:0.0000100 epoch_Time:23644.0min: [2024-01-06 14:22:12,466][model8_pretrain.py][INFO] Epoch:[0/2](873900/4588595) loss:2.445 lr:0.0000100 epoch_Time:23644.0min: [2024-01-06 14:22:12,466][model8_pretrain.py][INFO] Epoch:[0/2](873900/4588595) loss:2.929 lr:0.0000100 epoch_Time:23644.0min: [2024-01-06 14:22:12,466][model8_pretrain.py][INFO] Epoch:[0/2](873900/4588595) loss:2.890 lr:0.0000100 epoch_Time:23644.0min: [2024-01-06 14:22:12,466][model8_pretrain.py][INFO] Epoch:[0/2](873900/4588595) loss:2.998 lr:0.0000100 epoch_Time:23644.0min: [2024-01-06 14:22:12,466][model8_pretrain.py][INFO] Epoch:[0/2](873900/4588595) loss:2.337 lr:0.0000100 epoch_Time:23644.0min: [2024-01-06 14:22:12,466][model8_pretrain.py][INFO] Epoch:[0/2](873900/4588595) loss:2.324 lr:0.0000100 epoch_Time:23644.0min: [2024-01-06 14:22:12,466][model8_pretrain.py][INFO] Epoch:[0/2](873900/4588595) loss:3.023 lr:0.0000100 epoch_Time:23644.0min: [2024-01-06 14:22:49,413][model8_pretrain.py][INFO] Epoch:[0/2](874000/4588595) loss:2.841 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:22:49,413][model8_pretrain.py][INFO] Epoch:[0/2](874000/4588595) loss:2.747 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:22:49,413][model8_pretrain.py][INFO] Epoch:[0/2](874000/4588595) loss:2.655 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:22:49,413][model8_pretrain.py][INFO] Epoch:[0/2](874000/4588595) loss:3.235 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:22:49,413][model8_pretrain.py][INFO] Epoch:[0/2](874000/4588595) loss:2.881 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:22:49,413][model8_pretrain.py][INFO] Epoch:[0/2](874000/4588595) loss:2.877 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:22:49,414][model8_pretrain.py][INFO] Epoch:[0/2](874000/4588595) loss:2.473 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:22:49,414][model8_pretrain.py][INFO] Epoch:[0/2](874000/4588595) loss:2.770 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:23:26,356][model8_pretrain.py][INFO] Epoch:[0/2](874100/4588595) loss:2.245 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:23:26,356][model8_pretrain.py][INFO] Epoch:[0/2](874100/4588595) loss:3.227 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:23:26,356][model8_pretrain.py][INFO] Epoch:[0/2](874100/4588595) loss:2.361 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:23:26,356][model8_pretrain.py][INFO] Epoch:[0/2](874100/4588595) loss:2.656 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:23:26,356][model8_pretrain.py][INFO] Epoch:[0/2](874100/4588595) loss:3.158 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:23:26,356][model8_pretrain.py][INFO] Epoch:[0/2](874100/4588595) loss:2.995 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:23:26,357][model8_pretrain.py][INFO] Epoch:[0/2](874100/4588595) loss:3.188 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:23:26,357][model8_pretrain.py][INFO] Epoch:[0/2](874100/4588595) loss:2.461 lr:0.0000100 epoch_Time:23643.0min: [2024-01-06 14:24:03,303][model8_pretrain.py][INFO] Epoch:[0/2](874200/4588595) loss:2.888 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:03,303][model8_pretrain.py][INFO] Epoch:[0/2](874200/4588595) loss:2.542 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:03,303][model8_pretrain.py][INFO] Epoch:[0/2](874200/4588595) loss:3.326 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:03,303][model8_pretrain.py][INFO] Epoch:[0/2](874200/4588595) loss:3.071 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:03,303][model8_pretrain.py][INFO] Epoch:[0/2](874200/4588595) loss:2.651 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:03,303][model8_pretrain.py][INFO] Epoch:[0/2](874200/4588595) loss:2.582 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:03,303][model8_pretrain.py][INFO] Epoch:[0/2](874200/4588595) loss:2.637 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:03,303][model8_pretrain.py][INFO] Epoch:[0/2](874200/4588595) loss:2.799 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:47,012][model8_pretrain.py][INFO] Epoch:[0/2](874300/4588595) loss:2.539 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:47,012][model8_pretrain.py][INFO] Epoch:[0/2](874300/4588595) loss:2.899 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:47,012][model8_pretrain.py][INFO] Epoch:[0/2](874300/4588595) loss:2.762 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:47,012][model8_pretrain.py][INFO] Epoch:[0/2](874300/4588595) loss:2.675 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:47,012][model8_pretrain.py][INFO] Epoch:[0/2](874300/4588595) loss:3.372 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:47,012][model8_pretrain.py][INFO] Epoch:[0/2](874300/4588595) loss:2.137 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:47,012][model8_pretrain.py][INFO] Epoch:[0/2](874300/4588595) loss:2.210 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:24:47,013][model8_pretrain.py][INFO] Epoch:[0/2](874300/4588595) loss:2.786 lr:0.0000100 epoch_Time:23642.0min: [2024-01-06 14:25:27,298][model8_pretrain.py][INFO] Epoch:[0/2](874400/4588595) loss:2.481 lr:0.0000100 epoch_Time:23641.0min: [2024-01-06 14:25:27,298][model8_pretrain.py][INFO] Epoch:[0/2](874400/4588595) loss:2.737 lr:0.0000100 epoch_Time:23641.0min: [2024-01-06 14:25:27,298][model8_pretrain.py][INFO] Epoch:[0/2](874400/4588595) loss:1.875 lr:0.0000100 epoch_Time:23641.0min: [2024-01-06 14:25:27,298][model8_pretrain.py][INFO] Epoch:[0/2](874400/4588595) loss:2.604 lr:0.0000100 epoch_Time:23641.0min: [2024-01-06 14:25:27,298][model8_pretrain.py][INFO] Epoch:[0/2](874400/4588595) loss:3.107 lr:0.0000100 epoch_Time:23641.0min: [2024-01-06 14:25:27,299][model8_pretrain.py][INFO] Epoch:[0/2](874400/4588595) loss:2.865 lr:0.0000100 epoch_Time:23641.0min: [2024-01-06 14:25:27,298][model8_pretrain.py][INFO] Epoch:[0/2](874400/4588595) loss:2.251 lr:0.0000100 epoch_Time:23641.0min: [2024-01-06 14:25:27,299][model8_pretrain.py][INFO] Epoch:[0/2](874400/4588595) loss:2.943 lr:0.0000100 epoch_Time:23641.0min: [2024-01-06 14:26:04,253][model8_pretrain.py][INFO] Epoch:[0/2](874500/4588595) loss:2.969 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:04,253][model8_pretrain.py][INFO] Epoch:[0/2](874500/4588595) loss:3.241 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:04,253][model8_pretrain.py][INFO] Epoch:[0/2](874500/4588595) loss:2.899 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:04,253][model8_pretrain.py][INFO] Epoch:[0/2](874500/4588595) loss:3.010 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:04,253][model8_pretrain.py][INFO] Epoch:[0/2](874500/4588595) loss:3.458 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:04,253][model8_pretrain.py][INFO] Epoch:[0/2](874500/4588595) loss:3.254 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:04,253][model8_pretrain.py][INFO] Epoch:[0/2](874500/4588595) loss:3.209 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:04,253][model8_pretrain.py][INFO] Epoch:[0/2](874500/4588595) loss:2.490 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:41,203][model8_pretrain.py][INFO] Epoch:[0/2](874600/4588595) loss:2.978 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:41,203][model8_pretrain.py][INFO] Epoch:[0/2](874600/4588595) loss:3.099 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:41,203][model8_pretrain.py][INFO] Epoch:[0/2](874600/4588595) loss:2.514 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:41,203][model8_pretrain.py][INFO] Epoch:[0/2](874600/4588595) loss:3.088 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:41,203][model8_pretrain.py][INFO] Epoch:[0/2](874600/4588595) loss:2.780 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:41,203][model8_pretrain.py][INFO] Epoch:[0/2](874600/4588595) loss:2.479 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:41,203][model8_pretrain.py][INFO] Epoch:[0/2](874600/4588595) loss:3.230 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:26:41,203][model8_pretrain.py][INFO] Epoch:[0/2](874600/4588595) loss:2.347 lr:0.0000100 epoch_Time:23640.0min: [2024-01-06 14:27:18,160][model8_pretrain.py][INFO] Epoch:[0/2](874700/4588595) loss:2.420 lr:0.0000100 epoch_Time:23639.0min: [2024-01-06 14:27:18,160][model8_pretrain.py][INFO] Epoch:[0/2](874700/4588595) loss:3.131 lr:0.0000100 epoch_Time:23639.0min: [2024-01-06 14:27:18,160][model8_pretrain.py][INFO] Epoch:[0/2](874700/4588595) loss:3.095 lr:0.0000100 epoch_Time:23639.0min: [2024-01-06 14:27:18,160][model8_pretrain.py][INFO] Epoch:[0/2](874700/4588595) loss:2.424 lr:0.0000100 epoch_Time:23639.0min: [2024-01-06 14:27:18,160][model8_pretrain.py][INFO] Epoch:[0/2](874700/4588595) loss:3.335 lr:0.0000100 epoch_Time:23639.0min: [2024-01-06 14:27:18,160][model8_pretrain.py][INFO] Epoch:[0/2](874700/4588595) loss:2.987 lr:0.0000100 epoch_Time:23639.0min: [2024-01-06 14:27:18,160][model8_pretrain.py][INFO] Epoch:[0/2](874700/4588595) loss:2.159 lr:0.0000100 epoch_Time:23639.0min: [2024-01-06 14:27:18,161][model8_pretrain.py][INFO] Epoch:[0/2](874700/4588595) loss:2.981 lr:0.0000100 epoch_Time:23639.0min: [2024-01-06 14:27:55,112][model8_pretrain.py][INFO] Epoch:[0/2](874800/4588595) loss:2.962 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:27:55,112][model8_pretrain.py][INFO] Epoch:[0/2](874800/4588595) loss:3.191 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:27:55,112][model8_pretrain.py][INFO] Epoch:[0/2](874800/4588595) loss:2.975 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:27:55,112][model8_pretrain.py][INFO] Epoch:[0/2](874800/4588595) loss:3.447 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:27:55,112][model8_pretrain.py][INFO] Epoch:[0/2](874800/4588595) loss:2.736 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:27:55,112][model8_pretrain.py][INFO] Epoch:[0/2](874800/4588595) loss:2.811 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:27:55,112][model8_pretrain.py][INFO] Epoch:[0/2](874800/4588595) loss:2.202 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:27:55,112][model8_pretrain.py][INFO] Epoch:[0/2](874800/4588595) loss:3.068 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:28:32,100][model8_pretrain.py][INFO] Epoch:[0/2](874900/4588595) loss:2.953 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:28:32,100][model8_pretrain.py][INFO] Epoch:[0/2](874900/4588595) loss:2.642 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:28:32,100][model8_pretrain.py][INFO] Epoch:[0/2](874900/4588595) loss:2.349 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:28:32,100][model8_pretrain.py][INFO] Epoch:[0/2](874900/4588595) loss:2.994 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:28:32,100][model8_pretrain.py][INFO] Epoch:[0/2](874900/4588595) loss:2.349 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:28:32,100][model8_pretrain.py][INFO] Epoch:[0/2](874900/4588595) loss:2.639 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:28:32,100][model8_pretrain.py][INFO] Epoch:[0/2](874900/4588595) loss:3.069 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:28:32,100][model8_pretrain.py][INFO] Epoch:[0/2](874900/4588595) loss:2.810 lr:0.0000100 epoch_Time:23638.0min: [2024-01-06 14:29:09,066][model8_pretrain.py][INFO] Epoch:[0/2](875000/4588595) loss:2.941 lr:0.0000100 epoch_Time:23637.0min: [2024-01-06 14:29:09,066][model8_pretrain.py][INFO] Epoch:[0/2](875000/4588595) loss:1.936 lr:0.0000100 epoch_Time:23637.0min: [2024-01-06 14:29:09,067][model8_pretrain.py][INFO] Epoch:[0/2](875000/4588595) loss:2.795 lr:0.0000100 epoch_Time:23637.0min: [2024-01-06 14:29:09,067][model8_pretrain.py][INFO] Epoch:[0/2](875000/4588595) loss:3.179 lr:0.0000100 epoch_Time:23637.0min: [2024-01-06 14:29:09,067][model8_pretrain.py][INFO] Epoch:[0/2](875000/4588595) loss:2.930 lr:0.0000100 epoch_Time:23637.0min: [2024-01-06 14:29:09,067][model8_pretrain.py][INFO] Epoch:[0/2](875000/4588595) loss:2.702 lr:0.0000100 epoch_Time:23637.0min: [2024-01-06 14:29:09,067][model8_pretrain.py][INFO] Epoch:[0/2](875000/4588595) loss:2.120 lr:0.0000100 epoch_Time:23637.0min: [2024-01-06 14:29:09,067][model8_pretrain.py][INFO] Epoch:[0/2](875000/4588595) loss:2.757 lr:0.0000100 epoch_Time:23637.0min: [2024-01-06 14:29:52,897][model8_pretrain.py][INFO] Epoch:[0/2](875100/4588595) loss:2.768 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:29:52,897][model8_pretrain.py][INFO] Epoch:[0/2](875100/4588595) loss:3.034 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:29:52,897][model8_pretrain.py][INFO] Epoch:[0/2](875100/4588595) loss:3.292 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:29:52,897][model8_pretrain.py][INFO] Epoch:[0/2](875100/4588595) loss:2.239 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:29:52,897][model8_pretrain.py][INFO] Epoch:[0/2](875100/4588595) loss:2.793 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:29:52,897][model8_pretrain.py][INFO] Epoch:[0/2](875100/4588595) loss:2.711 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:29:52,897][model8_pretrain.py][INFO] Epoch:[0/2](875100/4588595) loss:2.355 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:29:52,897][model8_pretrain.py][INFO] Epoch:[0/2](875100/4588595) loss:3.416 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:30:33,297][model8_pretrain.py][INFO] Epoch:[0/2](875200/4588595) loss:2.761 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:30:33,297][model8_pretrain.py][INFO] Epoch:[0/2](875200/4588595) loss:2.551 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:30:33,297][model8_pretrain.py][INFO] Epoch:[0/2](875200/4588595) loss:2.562 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:30:33,297][model8_pretrain.py][INFO] Epoch:[0/2](875200/4588595) loss:2.386 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:30:33,297][model8_pretrain.py][INFO] Epoch:[0/2](875200/4588595) loss:2.936 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:30:33,297][model8_pretrain.py][INFO] Epoch:[0/2](875200/4588595) loss:2.298 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:30:33,297][model8_pretrain.py][INFO] Epoch:[0/2](875200/4588595) loss:3.022 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:30:33,297][model8_pretrain.py][INFO] Epoch:[0/2](875200/4588595) loss:2.674 lr:0.0000100 epoch_Time:23636.0min: [2024-01-06 14:31:10,246][model8_pretrain.py][INFO] Epoch:[0/2](875300/4588595) loss:2.774 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:10,246][model8_pretrain.py][INFO] Epoch:[0/2](875300/4588595) loss:2.665 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:10,246][model8_pretrain.py][INFO] Epoch:[0/2](875300/4588595) loss:2.923 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:10,246][model8_pretrain.py][INFO] Epoch:[0/2](875300/4588595) loss:2.925 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:10,246][model8_pretrain.py][INFO] Epoch:[0/2](875300/4588595) loss:2.603 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:10,246][model8_pretrain.py][INFO] Epoch:[0/2](875300/4588595) loss:3.159 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:10,246][model8_pretrain.py][INFO] Epoch:[0/2](875300/4588595) loss:2.788 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:10,246][model8_pretrain.py][INFO] Epoch:[0/2](875300/4588595) loss:2.967 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:47,180][model8_pretrain.py][INFO] Epoch:[0/2](875400/4588595) loss:2.128 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:47,180][model8_pretrain.py][INFO] Epoch:[0/2](875400/4588595) loss:3.159 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:47,180][model8_pretrain.py][INFO] Epoch:[0/2](875400/4588595) loss:2.548 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:47,180][model8_pretrain.py][INFO] Epoch:[0/2](875400/4588595) loss:3.117 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:47,180][model8_pretrain.py][INFO] Epoch:[0/2](875400/4588595) loss:2.897 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:47,180][model8_pretrain.py][INFO] Epoch:[0/2](875400/4588595) loss:2.955 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:47,180][model8_pretrain.py][INFO] Epoch:[0/2](875400/4588595) loss:3.230 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:31:47,181][model8_pretrain.py][INFO] Epoch:[0/2](875400/4588595) loss:2.932 lr:0.0000100 epoch_Time:23635.0min: [2024-01-06 14:32:24,122][model8_pretrain.py][INFO] Epoch:[0/2](875500/4588595) loss:2.280 lr:0.0000100 epoch_Time:23634.0min: [2024-01-06 14:32:24,123][model8_pretrain.py][INFO] Epoch:[0/2](875500/4588595) loss:2.903 lr:0.0000100 epoch_Time:23634.0min: [2024-01-06 14:32:24,123][model8_pretrain.py][INFO] Epoch:[0/2](875500/4588595) loss:2.800 lr:0.0000100 epoch_Time:23634.0min: [2024-01-06 14:32:24,123][model8_pretrain.py][INFO] Epoch:[0/2](875500/4588595) loss:2.819 lr:0.0000100 epoch_Time:23634.0min: [2024-01-06 14:32:24,123][model8_pretrain.py][INFO] Epoch:[0/2](875500/4588595) loss:2.896 lr:0.0000100 epoch_Time:23634.0min: [2024-01-06 14:32:24,123][model8_pretrain.py][INFO] Epoch:[0/2](875500/4588595) loss:3.193 lr:0.0000100 epoch_Time:23634.0min: [2024-01-06 14:32:24,123][model8_pretrain.py][INFO] Epoch:[0/2](875500/4588595) loss:2.474 lr:0.0000100 epoch_Time:23634.0min: [2024-01-06 14:32:24,123][model8_pretrain.py][INFO] Epoch:[0/2](875500/4588595) loss:2.607 lr:0.0000100 epoch_Time:23634.0min: [2024-01-06 14:33:01,071][model8_pretrain.py][INFO] Epoch:[0/2](875600/4588595) loss:2.707 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:01,071][model8_pretrain.py][INFO] Epoch:[0/2](875600/4588595) loss:3.213 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:01,071][model8_pretrain.py][INFO] Epoch:[0/2](875600/4588595) loss:2.907 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:01,071][model8_pretrain.py][INFO] Epoch:[0/2](875600/4588595) loss:2.762 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:01,071][model8_pretrain.py][INFO] Epoch:[0/2](875600/4588595) loss:3.018 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:01,071][model8_pretrain.py][INFO] Epoch:[0/2](875600/4588595) loss:2.925 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:01,071][model8_pretrain.py][INFO] Epoch:[0/2](875600/4588595) loss:2.790 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:01,071][model8_pretrain.py][INFO] Epoch:[0/2](875600/4588595) loss:2.867 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:38,019][model8_pretrain.py][INFO] Epoch:[0/2](875700/4588595) loss:3.025 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:38,019][model8_pretrain.py][INFO] Epoch:[0/2](875700/4588595) loss:3.015 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:38,019][model8_pretrain.py][INFO] Epoch:[0/2](875700/4588595) loss:2.945 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:38,019][model8_pretrain.py][INFO] Epoch:[0/2](875700/4588595) loss:3.065 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:38,019][model8_pretrain.py][INFO] Epoch:[0/2](875700/4588595) loss:3.388 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:38,019][model8_pretrain.py][INFO] Epoch:[0/2](875700/4588595) loss:2.626 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:38,019][model8_pretrain.py][INFO] Epoch:[0/2](875700/4588595) loss:2.961 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:33:38,019][model8_pretrain.py][INFO] Epoch:[0/2](875700/4588595) loss:2.881 lr:0.0000100 epoch_Time:23633.0min: [2024-01-06 14:34:14,963][model8_pretrain.py][INFO] Epoch:[0/2](875800/4588595) loss:3.364 lr:0.0000100 epoch_Time:23632.0min: [2024-01-06 14:34:14,963][model8_pretrain.py][INFO] Epoch:[0/2](875800/4588595) loss:3.012 lr:0.0000100 epoch_Time:23632.0min: [2024-01-06 14:34:14,963][model8_pretrain.py][INFO] Epoch:[0/2](875800/4588595) loss:2.735 lr:0.0000100 epoch_Time:23632.0min: [2024-01-06 14:34:14,963][model8_pretrain.py][INFO] Epoch:[0/2](875800/4588595) loss:3.310 lr:0.0000100 epoch_Time:23632.0min: [2024-01-06 14:34:14,963][model8_pretrain.py][INFO] Epoch:[0/2](875800/4588595) loss:2.916 lr:0.0000100 epoch_Time:23632.0min: [2024-01-06 14:34:14,963][model8_pretrain.py][INFO] Epoch:[0/2](875800/4588595) loss:3.080 lr:0.0000100 epoch_Time:23632.0min: [2024-01-06 14:34:14,963][model8_pretrain.py][INFO] Epoch:[0/2](875800/4588595) loss:2.797 lr:0.0000100 epoch_Time:23632.0min: [2024-01-06 14:34:14,963][model8_pretrain.py][INFO] Epoch:[0/2](875800/4588595) loss:2.879 lr:0.0000100 epoch_Time:23632.0min: [2024-01-06 14:34:58,572][model8_pretrain.py][INFO] Epoch:[0/2](875900/4588595) loss:2.804 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:34:58,572][model8_pretrain.py][INFO] Epoch:[0/2](875900/4588595) loss:3.102 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:34:58,577][model8_pretrain.py][INFO] Epoch:[0/2](875900/4588595) loss:2.920 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:34:58,577][model8_pretrain.py][INFO] Epoch:[0/2](875900/4588595) loss:2.581 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:34:58,577][model8_pretrain.py][INFO] Epoch:[0/2](875900/4588595) loss:2.917 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:34:58,577][model8_pretrain.py][INFO] Epoch:[0/2](875900/4588595) loss:2.508 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:34:58,577][model8_pretrain.py][INFO] Epoch:[0/2](875900/4588595) loss:3.032 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:34:58,577][model8_pretrain.py][INFO] Epoch:[0/2](875900/4588595) loss:2.786 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:35:38,989][model8_pretrain.py][INFO] Epoch:[0/2](876000/4588595) loss:2.718 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:35:38,989][model8_pretrain.py][INFO] Epoch:[0/2](876000/4588595) loss:2.448 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:35:38,989][model8_pretrain.py][INFO] Epoch:[0/2](876000/4588595) loss:2.864 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:35:38,989][model8_pretrain.py][INFO] Epoch:[0/2](876000/4588595) loss:3.153 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:35:38,989][model8_pretrain.py][INFO] Epoch:[0/2](876000/4588595) loss:2.543 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:35:38,989][model8_pretrain.py][INFO] Epoch:[0/2](876000/4588595) loss:2.806 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:35:38,989][model8_pretrain.py][INFO] Epoch:[0/2](876000/4588595) loss:2.733 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:35:38,989][model8_pretrain.py][INFO] Epoch:[0/2](876000/4588595) loss:3.457 lr:0.0000100 epoch_Time:23631.0min: [2024-01-06 14:36:15,924][model8_pretrain.py][INFO] Epoch:[0/2](876100/4588595) loss:2.856 lr:0.0000100 epoch_Time:23630.0min: [2024-01-06 14:36:15,924][model8_pretrain.py][INFO] Epoch:[0/2](876100/4588595) loss:3.044 lr:0.0000100 epoch_Time:23630.0min: [2024-01-06 14:36:15,924][model8_pretrain.py][INFO] Epoch:[0/2](876100/4588595) loss:2.615 lr:0.0000100 epoch_Time:23630.0min: [2024-01-06 14:36:15,924][model8_pretrain.py][INFO] Epoch:[0/2](876100/4588595) loss:3.368 lr:0.0000100 epoch_Time:23630.0min: [2024-01-06 14:36:15,924][model8_pretrain.py][INFO] Epoch:[0/2](876100/4588595) loss:2.853 lr:0.0000100 epoch_Time:23630.0min: [2024-01-06 14:36:15,924][model8_pretrain.py][INFO] Epoch:[0/2](876100/4588595) loss:2.744 lr:0.0000100 epoch_Time:23630.0min: [2024-01-06 14:36:15,924][model8_pretrain.py][INFO] Epoch:[0/2](876100/4588595) loss:3.223 lr:0.0000100 epoch_Time:23630.0min: [2024-01-06 14:36:15,924][model8_pretrain.py][INFO] Epoch:[0/2](876100/4588595) loss:2.895 lr:0.0000100 epoch_Time:23630.0min: [2024-01-06 14:36:52,863][model8_pretrain.py][INFO] Epoch:[0/2](876200/4588595) loss:2.770 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:36:52,863][model8_pretrain.py][INFO] Epoch:[0/2](876200/4588595) loss:3.298 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:36:52,863][model8_pretrain.py][INFO] Epoch:[0/2](876200/4588595) loss:2.087 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:36:52,863][model8_pretrain.py][INFO] Epoch:[0/2](876200/4588595) loss:3.385 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:36:52,863][model8_pretrain.py][INFO] Epoch:[0/2](876200/4588595) loss:3.073 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:36:52,864][model8_pretrain.py][INFO] Epoch:[0/2](876200/4588595) loss:2.949 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:36:52,864][model8_pretrain.py][INFO] Epoch:[0/2](876200/4588595) loss:3.128 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:36:52,864][model8_pretrain.py][INFO] Epoch:[0/2](876200/4588595) loss:3.321 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:37:29,808][model8_pretrain.py][INFO] Epoch:[0/2](876300/4588595) loss:3.019 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:37:29,808][model8_pretrain.py][INFO] Epoch:[0/2](876300/4588595) loss:2.384 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:37:29,808][model8_pretrain.py][INFO] Epoch:[0/2](876300/4588595) loss:2.812 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:37:29,808][model8_pretrain.py][INFO] Epoch:[0/2](876300/4588595) loss:3.073 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:37:29,808][model8_pretrain.py][INFO] Epoch:[0/2](876300/4588595) loss:2.482 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:37:29,808][model8_pretrain.py][INFO] Epoch:[0/2](876300/4588595) loss:2.227 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:37:29,808][model8_pretrain.py][INFO] Epoch:[0/2](876300/4588595) loss:2.885 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:37:29,808][model8_pretrain.py][INFO] Epoch:[0/2](876300/4588595) loss:2.594 lr:0.0000100 epoch_Time:23629.0min: [2024-01-06 14:38:06,740][model8_pretrain.py][INFO] Epoch:[0/2](876400/4588595) loss:2.819 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:06,740][model8_pretrain.py][INFO] Epoch:[0/2](876400/4588595) loss:2.766 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:06,740][model8_pretrain.py][INFO] Epoch:[0/2](876400/4588595) loss:2.859 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:06,740][model8_pretrain.py][INFO] Epoch:[0/2](876400/4588595) loss:2.824 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:06,740][model8_pretrain.py][INFO] Epoch:[0/2](876400/4588595) loss:2.684 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:06,740][model8_pretrain.py][INFO] Epoch:[0/2](876400/4588595) loss:2.833 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:06,740][model8_pretrain.py][INFO] Epoch:[0/2](876400/4588595) loss:3.405 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:06,740][model8_pretrain.py][INFO] Epoch:[0/2](876400/4588595) loss:2.612 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:43,668][model8_pretrain.py][INFO] Epoch:[0/2](876500/4588595) loss:3.316 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:43,668][model8_pretrain.py][INFO] Epoch:[0/2](876500/4588595) loss:3.394 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:43,668][model8_pretrain.py][INFO] Epoch:[0/2](876500/4588595) loss:3.146 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:43,668][model8_pretrain.py][INFO] Epoch:[0/2](876500/4588595) loss:2.241 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:43,668][model8_pretrain.py][INFO] Epoch:[0/2](876500/4588595) loss:2.995 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:43,668][model8_pretrain.py][INFO] Epoch:[0/2](876500/4588595) loss:3.349 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:43,668][model8_pretrain.py][INFO] Epoch:[0/2](876500/4588595) loss:3.005 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:38:43,668][model8_pretrain.py][INFO] Epoch:[0/2](876500/4588595) loss:2.400 lr:0.0000100 epoch_Time:23628.0min: [2024-01-06 14:39:20,594][model8_pretrain.py][INFO] Epoch:[0/2](876600/4588595) loss:2.609 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:39:20,594][model8_pretrain.py][INFO] Epoch:[0/2](876600/4588595) loss:2.124 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:39:20,594][model8_pretrain.py][INFO] Epoch:[0/2](876600/4588595) loss:2.960 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:39:20,594][model8_pretrain.py][INFO] Epoch:[0/2](876600/4588595) loss:3.293 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:39:20,594][model8_pretrain.py][INFO] Epoch:[0/2](876600/4588595) loss:3.269 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:39:20,594][model8_pretrain.py][INFO] Epoch:[0/2](876600/4588595) loss:2.827 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:39:20,594][model8_pretrain.py][INFO] Epoch:[0/2](876600/4588595) loss:3.163 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:39:20,594][model8_pretrain.py][INFO] Epoch:[0/2](876600/4588595) loss:2.410 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:40:02,534][model8_pretrain.py][INFO] Epoch:[0/2](876700/4588595) loss:2.652 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:02,534][model8_pretrain.py][INFO] Epoch:[0/2](876700/4588595) loss:2.787 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:02,534][model8_pretrain.py][INFO] Epoch:[0/2](876700/4588595) loss:2.546 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:02,534][model8_pretrain.py][INFO] Epoch:[0/2](876700/4588595) loss:2.634 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:02,534][model8_pretrain.py][INFO] Epoch:[0/2](876700/4588595) loss:3.115 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:02,534][model8_pretrain.py][INFO] Epoch:[0/2](876700/4588595) loss:2.988 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:02,534][model8_pretrain.py][INFO] Epoch:[0/2](876700/4588595) loss:2.734 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:02,534][model8_pretrain.py][INFO] Epoch:[0/2](876700/4588595) loss:2.431 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:44,695][model8_pretrain.py][INFO] Epoch:[0/2](876800/4588595) loss:2.589 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:44,695][model8_pretrain.py][INFO] Epoch:[0/2](876800/4588595) loss:3.162 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:40:44,695][model8_pretrain.py][INFO] Epoch:[0/2](876800/4588595) loss:3.027 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:44,695][model8_pretrain.py][INFO] Epoch:[0/2](876800/4588595) loss:2.192 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:44,695][model8_pretrain.py][INFO] Epoch:[0/2](876800/4588595) loss:2.641 lr:0.0000100 epoch_Time:23627.0min: [2024-01-06 14:40:44,695][model8_pretrain.py][INFO] Epoch:[0/2](876800/4588595) loss:2.935 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:44,695][model8_pretrain.py][INFO] Epoch:[0/2](876800/4588595) loss:2.828 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:40:44,695][model8_pretrain.py][INFO] Epoch:[0/2](876800/4588595) loss:2.330 lr:0.0000100 epoch_Time:23626.0min: [2024-01-06 14:41:21,627][model8_pretrain.py][INFO] Epoch:[0/2](876900/4588595) loss:2.849 lr:0.0000100 epoch_Time:23625.0min: [2024-01-06 14:41:21,627][model8_pretrain.py][INFO] Epoch:[0/2](876900/4588595) loss:2.561 lr:0.0000100 epoch_Time:23625.0min: [2024-01-06 14:41:21,627][model8_pretrain.py][INFO] Epoch:[0/2](876900/4588595) loss:2.718 lr:0.0000100 epoch_Time:23625.0min: [2024-01-06 14:41:21,627][model8_pretrain.py][INFO] Epoch:[0/2](876900/4588595) loss:1.821 lr:0.0000100 epoch_Time:23625.0min: [2024-01-06 14:41:21,627][model8_pretrain.py][INFO] Epoch:[0/2](876900/4588595) loss:2.774 lr:0.0000100 epoch_Time:23625.0min: [2024-01-06 14:41:21,627][model8_pretrain.py][INFO] Epoch:[0/2](876900/4588595) loss:3.202 lr:0.0000100 epoch_Time:23625.0min: [2024-01-06 14:41:21,627][model8_pretrain.py][INFO] Epoch:[0/2](876900/4588595) loss:2.828 lr:0.0000100 epoch_Time:23625.0min: [2024-01-06 14:41:21,627][model8_pretrain.py][INFO] Epoch:[0/2](876900/4588595) loss:3.028 lr:0.0000100 epoch_Time:23625.0min: [2024-01-06 14:41:58,564][model8_pretrain.py][INFO] Epoch:[0/2](877000/4588595) loss:2.531 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:41:58,564][model8_pretrain.py][INFO] Epoch:[0/2](877000/4588595) loss:2.768 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:41:58,564][model8_pretrain.py][INFO] Epoch:[0/2](877000/4588595) loss:2.024 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:41:58,564][model8_pretrain.py][INFO] Epoch:[0/2](877000/4588595) loss:2.575 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:41:58,564][model8_pretrain.py][INFO] Epoch:[0/2](877000/4588595) loss:2.835 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:41:58,564][model8_pretrain.py][INFO] Epoch:[0/2](877000/4588595) loss:2.857 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:41:58,564][model8_pretrain.py][INFO] Epoch:[0/2](877000/4588595) loss:3.317 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:41:58,564][model8_pretrain.py][INFO] Epoch:[0/2](877000/4588595) loss:3.016 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:42:35,488][model8_pretrain.py][INFO] Epoch:[0/2](877100/4588595) loss:2.713 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:42:35,488][model8_pretrain.py][INFO] Epoch:[0/2](877100/4588595) loss:3.032 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:42:35,488][model8_pretrain.py][INFO] Epoch:[0/2](877100/4588595) loss:3.499 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:42:35,488][model8_pretrain.py][INFO] Epoch:[0/2](877100/4588595) loss:2.930 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:42:35,488][model8_pretrain.py][INFO] Epoch:[0/2](877100/4588595) loss:2.656 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:42:35,488][model8_pretrain.py][INFO] Epoch:[0/2](877100/4588595) loss:2.880 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:42:35,488][model8_pretrain.py][INFO] Epoch:[0/2](877100/4588595) loss:2.926 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:42:35,488][model8_pretrain.py][INFO] Epoch:[0/2](877100/4588595) loss:2.993 lr:0.0000100 epoch_Time:23624.0min: [2024-01-06 14:43:12,433][model8_pretrain.py][INFO] Epoch:[0/2](877200/4588595) loss:2.218 lr:0.0000100 epoch_Time:23623.0min: [2024-01-06 14:43:12,433][model8_pretrain.py][INFO] Epoch:[0/2](877200/4588595) loss:2.593 lr:0.0000100 epoch_Time:23623.0min: [2024-01-06 14:43:12,433][model8_pretrain.py][INFO] Epoch:[0/2](877200/4588595) loss:2.963 lr:0.0000100 epoch_Time:23623.0min: [2024-01-06 14:43:12,433][model8_pretrain.py][INFO] Epoch:[0/2](877200/4588595) loss:2.822 lr:0.0000100 epoch_Time:23623.0min: [2024-01-06 14:43:12,433][model8_pretrain.py][INFO] Epoch:[0/2](877200/4588595) loss:2.837 lr:0.0000100 epoch_Time:23623.0min: [2024-01-06 14:43:12,433][model8_pretrain.py][INFO] Epoch:[0/2](877200/4588595) loss:2.123 lr:0.0000100 epoch_Time:23623.0min: [2024-01-06 14:43:12,433][model8_pretrain.py][INFO] Epoch:[0/2](877200/4588595) loss:3.026 lr:0.0000100 epoch_Time:23623.0min: [2024-01-06 14:43:12,433][model8_pretrain.py][INFO] Epoch:[0/2](877200/4588595) loss:2.863 lr:0.0000100 epoch_Time:23623.0min: [2024-01-06 14:43:49,370][model8_pretrain.py][INFO] Epoch:[0/2](877300/4588595) loss:2.354 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:43:49,370][model8_pretrain.py][INFO] Epoch:[0/2](877300/4588595) loss:2.967 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:43:49,370][model8_pretrain.py][INFO] Epoch:[0/2](877300/4588595) loss:2.356 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:43:49,370][model8_pretrain.py][INFO] Epoch:[0/2](877300/4588595) loss:2.473 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:43:49,370][model8_pretrain.py][INFO] Epoch:[0/2](877300/4588595) loss:2.818 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:43:49,370][model8_pretrain.py][INFO] Epoch:[0/2](877300/4588595) loss:3.233 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:43:49,370][model8_pretrain.py][INFO] Epoch:[0/2](877300/4588595) loss:2.568 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:43:49,371][model8_pretrain.py][INFO] Epoch:[0/2](877300/4588595) loss:2.775 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:44:26,280][model8_pretrain.py][INFO] Epoch:[0/2](877400/4588595) loss:2.825 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:44:26,280][model8_pretrain.py][INFO] Epoch:[0/2](877400/4588595) loss:2.946 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:44:26,280][model8_pretrain.py][INFO] Epoch:[0/2](877400/4588595) loss:3.031 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:44:26,280][model8_pretrain.py][INFO] Epoch:[0/2](877400/4588595) loss:2.439 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:44:26,280][model8_pretrain.py][INFO] Epoch:[0/2](877400/4588595) loss:1.964 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:44:26,280][model8_pretrain.py][INFO] Epoch:[0/2](877400/4588595) loss:2.551 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:44:26,280][model8_pretrain.py][INFO] Epoch:[0/2](877400/4588595) loss:3.283 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:44:26,280][model8_pretrain.py][INFO] Epoch:[0/2](877400/4588595) loss:2.521 lr:0.0000100 epoch_Time:23622.0min: [2024-01-06 14:45:08,286][model8_pretrain.py][INFO] Epoch:[0/2](877500/4588595) loss:2.945 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:08,286][model8_pretrain.py][INFO] Epoch:[0/2](877500/4588595) loss:2.828 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:08,286][model8_pretrain.py][INFO] Epoch:[0/2](877500/4588595) loss:2.862 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:08,286][model8_pretrain.py][INFO] Epoch:[0/2](877500/4588595) loss:3.123 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:08,286][model8_pretrain.py][INFO] Epoch:[0/2](877500/4588595) loss:2.488 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:08,286][model8_pretrain.py][INFO] Epoch:[0/2](877500/4588595) loss:2.964 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:08,286][model8_pretrain.py][INFO] Epoch:[0/2](877500/4588595) loss:3.099 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:08,286][model8_pretrain.py][INFO] Epoch:[0/2](877500/4588595) loss:2.853 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:50,407][model8_pretrain.py][INFO] Epoch:[0/2](877600/4588595) loss:2.573 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:50,407][model8_pretrain.py][INFO] Epoch:[0/2](877600/4588595) loss:2.473 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:50,407][model8_pretrain.py][INFO] Epoch:[0/2](877600/4588595) loss:2.617 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:50,407][model8_pretrain.py][INFO] Epoch:[0/2](877600/4588595) loss:3.142 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:50,407][model8_pretrain.py][INFO] Epoch:[0/2](877600/4588595) loss:2.874 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:50,407][model8_pretrain.py][INFO] Epoch:[0/2](877600/4588595) loss:3.245 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:50,407][model8_pretrain.py][INFO] Epoch:[0/2](877600/4588595) loss:2.664 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:45:50,407][model8_pretrain.py][INFO] Epoch:[0/2](877600/4588595) loss:2.378 lr:0.0000100 epoch_Time:23621.0min: [2024-01-06 14:46:27,356][model8_pretrain.py][INFO] Epoch:[0/2](877700/4588595) loss:2.718 lr:0.0000100 epoch_Time:23620.0min: [2024-01-06 14:46:27,356][model8_pretrain.py][INFO] Epoch:[0/2](877700/4588595) loss:2.976 lr:0.0000100 epoch_Time:23620.0min: [2024-01-06 14:46:27,356][model8_pretrain.py][INFO] Epoch:[0/2](877700/4588595) loss:2.756 lr:0.0000100 epoch_Time:23620.0min: [2024-01-06 14:46:27,356][model8_pretrain.py][INFO] Epoch:[0/2](877700/4588595) loss:3.357 lr:0.0000100 epoch_Time:23620.0min: [2024-01-06 14:46:27,356][model8_pretrain.py][INFO] Epoch:[0/2](877700/4588595) loss:2.622 lr:0.0000100 epoch_Time:23620.0min: [2024-01-06 14:46:27,356][model8_pretrain.py][INFO] Epoch:[0/2](877700/4588595) loss:2.338 lr:0.0000100 epoch_Time:23620.0min: [2024-01-06 14:46:27,356][model8_pretrain.py][INFO] Epoch:[0/2](877700/4588595) loss:2.840 lr:0.0000100 epoch_Time:23620.0min: [2024-01-06 14:46:27,356][model8_pretrain.py][INFO] Epoch:[0/2](877700/4588595) loss:2.707 lr:0.0000100 epoch_Time:23620.0min: [2024-01-06 14:47:04,316][model8_pretrain.py][INFO] Epoch:[0/2](877800/4588595) loss:2.827 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:04,316][model8_pretrain.py][INFO] Epoch:[0/2](877800/4588595) loss:2.451 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:04,316][model8_pretrain.py][INFO] Epoch:[0/2](877800/4588595) loss:2.827 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:04,316][model8_pretrain.py][INFO] Epoch:[0/2](877800/4588595) loss:2.574 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:04,316][model8_pretrain.py][INFO] Epoch:[0/2](877800/4588595) loss:2.915 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:04,316][model8_pretrain.py][INFO] Epoch:[0/2](877800/4588595) loss:2.351 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:04,316][model8_pretrain.py][INFO] Epoch:[0/2](877800/4588595) loss:3.062 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:04,316][model8_pretrain.py][INFO] Epoch:[0/2](877800/4588595) loss:3.197 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:41,250][model8_pretrain.py][INFO] Epoch:[0/2](877900/4588595) loss:2.709 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:41,250][model8_pretrain.py][INFO] Epoch:[0/2](877900/4588595) loss:2.673 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:41,250][model8_pretrain.py][INFO] Epoch:[0/2](877900/4588595) loss:2.721 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:41,250][model8_pretrain.py][INFO] Epoch:[0/2](877900/4588595) loss:2.860 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:41,250][model8_pretrain.py][INFO] Epoch:[0/2](877900/4588595) loss:2.855 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:41,250][model8_pretrain.py][INFO] Epoch:[0/2](877900/4588595) loss:2.578 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:41,250][model8_pretrain.py][INFO] Epoch:[0/2](877900/4588595) loss:2.851 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:47:41,250][model8_pretrain.py][INFO] Epoch:[0/2](877900/4588595) loss:2.476 lr:0.0000100 epoch_Time:23619.0min: [2024-01-06 14:48:18,194][model8_pretrain.py][INFO] Epoch:[0/2](878000/4588595) loss:3.322 lr:0.0000100 epoch_Time:23618.0min: [2024-01-06 14:48:18,194][model8_pretrain.py][INFO] Epoch:[0/2](878000/4588595) loss:2.398 lr:0.0000100 epoch_Time:23618.0min: [2024-01-06 14:48:18,194][model8_pretrain.py][INFO] Epoch:[0/2](878000/4588595) loss:2.541 lr:0.0000100 epoch_Time:23618.0min: [2024-01-06 14:48:18,194][model8_pretrain.py][INFO] Epoch:[0/2](878000/4588595) loss:2.117 lr:0.0000100 epoch_Time:23618.0min: [2024-01-06 14:48:18,194][model8_pretrain.py][INFO] Epoch:[0/2](878000/4588595) loss:2.731 lr:0.0000100 epoch_Time:23618.0min: [2024-01-06 14:48:18,194][model8_pretrain.py][INFO] Epoch:[0/2](878000/4588595) loss:2.665 lr:0.0000100 epoch_Time:23618.0min: [2024-01-06 14:48:18,194][model8_pretrain.py][INFO] Epoch:[0/2](878000/4588595) loss:3.399 lr:0.0000100 epoch_Time:23618.0min: [2024-01-06 14:48:18,194][model8_pretrain.py][INFO] Epoch:[0/2](878000/4588595) loss:2.720 lr:0.0000100 epoch_Time:23618.0min: [2024-01-06 14:48:55,132][model8_pretrain.py][INFO] Epoch:[0/2](878100/4588595) loss:3.246 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:48:55,133][model8_pretrain.py][INFO] Epoch:[0/2](878100/4588595) loss:2.669 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:48:55,133][model8_pretrain.py][INFO] Epoch:[0/2](878100/4588595) loss:2.791 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:48:55,133][model8_pretrain.py][INFO] Epoch:[0/2](878100/4588595) loss:2.959 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:48:55,133][model8_pretrain.py][INFO] Epoch:[0/2](878100/4588595) loss:3.041 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:48:55,133][model8_pretrain.py][INFO] Epoch:[0/2](878100/4588595) loss:2.715 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:48:55,133][model8_pretrain.py][INFO] Epoch:[0/2](878100/4588595) loss:2.920 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:48:55,133][model8_pretrain.py][INFO] Epoch:[0/2](878100/4588595) loss:2.640 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:49:32,083][model8_pretrain.py][INFO] Epoch:[0/2](878200/4588595) loss:3.155 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:49:32,083][model8_pretrain.py][INFO] Epoch:[0/2](878200/4588595) loss:2.854 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:49:32,083][model8_pretrain.py][INFO] Epoch:[0/2](878200/4588595) loss:2.233 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:49:32,083][model8_pretrain.py][INFO] Epoch:[0/2](878200/4588595) loss:3.013 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:49:32,083][model8_pretrain.py][INFO] Epoch:[0/2](878200/4588595) loss:3.185 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:49:32,083][model8_pretrain.py][INFO] Epoch:[0/2](878200/4588595) loss:3.094 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:49:32,083][model8_pretrain.py][INFO] Epoch:[0/2](878200/4588595) loss:2.723 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:49:32,084][model8_pretrain.py][INFO] Epoch:[0/2](878200/4588595) loss:3.126 lr:0.0000100 epoch_Time:23617.0min: [2024-01-06 14:50:14,255][model8_pretrain.py][INFO] Epoch:[0/2](878300/4588595) loss:2.528 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:14,255][model8_pretrain.py][INFO] Epoch:[0/2](878300/4588595) loss:2.530 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:14,255][model8_pretrain.py][INFO] Epoch:[0/2](878300/4588595) loss:3.025 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:14,256][model8_pretrain.py][INFO] Epoch:[0/2](878300/4588595) loss:2.875 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:14,257][model8_pretrain.py][INFO] Epoch:[0/2](878300/4588595) loss:2.067 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:14,259][model8_pretrain.py][INFO] Epoch:[0/2](878300/4588595) loss:3.052 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:14,259][model8_pretrain.py][INFO] Epoch:[0/2](878300/4588595) loss:2.570 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:14,260][model8_pretrain.py][INFO] Epoch:[0/2](878300/4588595) loss:2.448 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:56,350][model8_pretrain.py][INFO] Epoch:[0/2](878400/4588595) loss:2.483 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:56,350][model8_pretrain.py][INFO] Epoch:[0/2](878400/4588595) loss:2.977 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:56,350][model8_pretrain.py][INFO] Epoch:[0/2](878400/4588595) loss:3.332 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:56,350][model8_pretrain.py][INFO] Epoch:[0/2](878400/4588595) loss:3.011 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:56,350][model8_pretrain.py][INFO] Epoch:[0/2](878400/4588595) loss:2.866 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:56,350][model8_pretrain.py][INFO] Epoch:[0/2](878400/4588595) loss:2.087 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:56,350][model8_pretrain.py][INFO] Epoch:[0/2](878400/4588595) loss:2.929 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:50:56,350][model8_pretrain.py][INFO] Epoch:[0/2](878400/4588595) loss:2.255 lr:0.0000100 epoch_Time:23616.0min: [2024-01-06 14:51:33,295][model8_pretrain.py][INFO] Epoch:[0/2](878500/4588595) loss:2.105 lr:0.0000100 epoch_Time:23615.0min: [2024-01-06 14:51:33,295][model8_pretrain.py][INFO] Epoch:[0/2](878500/4588595) loss:2.607 lr:0.0000100 epoch_Time:23615.0min: [2024-01-06 14:51:33,295][model8_pretrain.py][INFO] Epoch:[0/2](878500/4588595) loss:2.654 lr:0.0000100 epoch_Time:23615.0min: [2024-01-06 14:51:33,295][model8_pretrain.py][INFO] Epoch:[0/2](878500/4588595) loss:2.933 lr:0.0000100 epoch_Time:23615.0min: [2024-01-06 14:51:33,295][model8_pretrain.py][INFO] Epoch:[0/2](878500/4588595) loss:2.946 lr:0.0000100 epoch_Time:23615.0min: [2024-01-06 14:51:33,295][model8_pretrain.py][INFO] Epoch:[0/2](878500/4588595) loss:2.265 lr:0.0000100 epoch_Time:23615.0min: [2024-01-06 14:51:33,295][model8_pretrain.py][INFO] Epoch:[0/2](878500/4588595) loss:2.714 lr:0.0000100 epoch_Time:23615.0min: [2024-01-06 14:51:33,295][model8_pretrain.py][INFO] Epoch:[0/2](878500/4588595) loss:2.657 lr:0.0000100 epoch_Time:23615.0min: [2024-01-06 14:52:10,233][model8_pretrain.py][INFO] Epoch:[0/2](878600/4588595) loss:2.208 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:10,233][model8_pretrain.py][INFO] Epoch:[0/2](878600/4588595) loss:3.156 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:10,233][model8_pretrain.py][INFO] Epoch:[0/2](878600/4588595) loss:3.206 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:10,233][model8_pretrain.py][INFO] Epoch:[0/2](878600/4588595) loss:2.780 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:10,233][model8_pretrain.py][INFO] Epoch:[0/2](878600/4588595) loss:3.185 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:10,233][model8_pretrain.py][INFO] Epoch:[0/2](878600/4588595) loss:2.712 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:10,233][model8_pretrain.py][INFO] Epoch:[0/2](878600/4588595) loss:2.355 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:10,233][model8_pretrain.py][INFO] Epoch:[0/2](878600/4588595) loss:3.008 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:47,168][model8_pretrain.py][INFO] Epoch:[0/2](878700/4588595) loss:3.114 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:47,168][model8_pretrain.py][INFO] Epoch:[0/2](878700/4588595) loss:3.199 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:47,168][model8_pretrain.py][INFO] Epoch:[0/2](878700/4588595) loss:3.447 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:47,168][model8_pretrain.py][INFO] Epoch:[0/2](878700/4588595) loss:3.124 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:47,168][model8_pretrain.py][INFO] Epoch:[0/2](878700/4588595) loss:2.749 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:47,168][model8_pretrain.py][INFO] Epoch:[0/2](878700/4588595) loss:2.743 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:47,168][model8_pretrain.py][INFO] Epoch:[0/2](878700/4588595) loss:2.873 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:52:47,169][model8_pretrain.py][INFO] Epoch:[0/2](878700/4588595) loss:2.845 lr:0.0000100 epoch_Time:23614.0min: [2024-01-06 14:53:24,118][model8_pretrain.py][INFO] Epoch:[0/2](878800/4588595) loss:3.072 lr:0.0000100 epoch_Time:23613.0min: [2024-01-06 14:53:24,118][model8_pretrain.py][INFO] Epoch:[0/2](878800/4588595) loss:1.548 lr:0.0000100 epoch_Time:23613.0min: [2024-01-06 14:53:24,118][model8_pretrain.py][INFO] Epoch:[0/2](878800/4588595) loss:2.806 lr:0.0000100 epoch_Time:23613.0min: [2024-01-06 14:53:24,118][model8_pretrain.py][INFO] Epoch:[0/2](878800/4588595) loss:3.069 lr:0.0000100 epoch_Time:23613.0min: [2024-01-06 14:53:24,118][model8_pretrain.py][INFO] Epoch:[0/2](878800/4588595) loss:3.012 lr:0.0000100 epoch_Time:23613.0min: [2024-01-06 14:53:24,118][model8_pretrain.py][INFO] Epoch:[0/2](878800/4588595) loss:2.508 lr:0.0000100 epoch_Time:23613.0min: [2024-01-06 14:53:24,118][model8_pretrain.py][INFO] Epoch:[0/2](878800/4588595) loss:2.808 lr:0.0000100 epoch_Time:23613.0min: [2024-01-06 14:53:24,118][model8_pretrain.py][INFO] Epoch:[0/2](878800/4588595) loss:2.664 lr:0.0000100 epoch_Time:23613.0min: [2024-01-06 14:54:01,069][model8_pretrain.py][INFO] Epoch:[0/2](878900/4588595) loss:2.344 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:01,069][model8_pretrain.py][INFO] Epoch:[0/2](878900/4588595) loss:2.913 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:01,069][model8_pretrain.py][INFO] Epoch:[0/2](878900/4588595) loss:3.097 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:01,069][model8_pretrain.py][INFO] Epoch:[0/2](878900/4588595) loss:2.758 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:01,069][model8_pretrain.py][INFO] Epoch:[0/2](878900/4588595) loss:3.062 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:01,069][model8_pretrain.py][INFO] Epoch:[0/2](878900/4588595) loss:2.892 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:01,069][model8_pretrain.py][INFO] Epoch:[0/2](878900/4588595) loss:3.085 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:01,070][model8_pretrain.py][INFO] Epoch:[0/2](878900/4588595) loss:3.107 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:38,020][model8_pretrain.py][INFO] Epoch:[0/2](879000/4588595) loss:3.393 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:38,020][model8_pretrain.py][INFO] Epoch:[0/2](879000/4588595) loss:2.941 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:38,020][model8_pretrain.py][INFO] Epoch:[0/2](879000/4588595) loss:2.763 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:38,020][model8_pretrain.py][INFO] Epoch:[0/2](879000/4588595) loss:3.394 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:38,020][model8_pretrain.py][INFO] Epoch:[0/2](879000/4588595) loss:2.224 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:38,020][model8_pretrain.py][INFO] Epoch:[0/2](879000/4588595) loss:2.443 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:38,020][model8_pretrain.py][INFO] Epoch:[0/2](879000/4588595) loss:3.179 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:54:38,020][model8_pretrain.py][INFO] Epoch:[0/2](879000/4588595) loss:3.041 lr:0.0000100 epoch_Time:23612.0min: [2024-01-06 14:55:18,385][model8_pretrain.py][INFO] Epoch:[0/2](879100/4588595) loss:2.572 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:55:18,385][model8_pretrain.py][INFO] Epoch:[0/2](879100/4588595) loss:2.608 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:55:18,385][model8_pretrain.py][INFO] Epoch:[0/2](879100/4588595) loss:2.151 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:55:18,385][model8_pretrain.py][INFO] Epoch:[0/2](879100/4588595) loss:2.817 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:55:18,385][model8_pretrain.py][INFO] Epoch:[0/2](879100/4588595) loss:2.816 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:55:18,385][model8_pretrain.py][INFO] Epoch:[0/2](879100/4588595) loss:2.517 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:55:18,385][model8_pretrain.py][INFO] Epoch:[0/2](879100/4588595) loss:2.755 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:55:18,385][model8_pretrain.py][INFO] Epoch:[0/2](879100/4588595) loss:2.918 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:56:02,385][model8_pretrain.py][INFO] Epoch:[0/2](879200/4588595) loss:3.029 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:56:02,386][model8_pretrain.py][INFO] Epoch:[0/2](879200/4588595) loss:2.804 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:56:02,386][model8_pretrain.py][INFO] Epoch:[0/2](879200/4588595) loss:2.846 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:56:02,386][model8_pretrain.py][INFO] Epoch:[0/2](879200/4588595) loss:2.353 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:56:02,386][model8_pretrain.py][INFO] Epoch:[0/2](879200/4588595) loss:3.035 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:56:02,386][model8_pretrain.py][INFO] Epoch:[0/2](879200/4588595) loss:2.839 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:56:02,386][model8_pretrain.py][INFO] Epoch:[0/2](879200/4588595) loss:2.973 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:56:02,386][model8_pretrain.py][INFO] Epoch:[0/2](879200/4588595) loss:3.199 lr:0.0000100 epoch_Time:23611.0min: [2024-01-06 14:56:39,323][model8_pretrain.py][INFO] Epoch:[0/2](879300/4588595) loss:2.849 lr:0.0000100 epoch_Time:23610.0min: [2024-01-06 14:56:39,323][model8_pretrain.py][INFO] Epoch:[0/2](879300/4588595) loss:2.931 lr:0.0000100 epoch_Time:23610.0min: [2024-01-06 14:56:39,323][model8_pretrain.py][INFO] Epoch:[0/2](879300/4588595) loss:2.809 lr:0.0000100 epoch_Time:23610.0min: [2024-01-06 14:56:39,323][model8_pretrain.py][INFO] Epoch:[0/2](879300/4588595) loss:3.091 lr:0.0000100 epoch_Time:23610.0min: [2024-01-06 14:56:39,323][model8_pretrain.py][INFO] Epoch:[0/2](879300/4588595) loss:3.016 lr:0.0000100 epoch_Time:23610.0min: [2024-01-06 14:56:39,323][model8_pretrain.py][INFO] Epoch:[0/2](879300/4588595) loss:2.732 lr:0.0000100 epoch_Time:23610.0min: [2024-01-06 14:56:39,323][model8_pretrain.py][INFO] Epoch:[0/2](879300/4588595) loss:2.821 lr:0.0000100 epoch_Time:23610.0min: [2024-01-06 14:56:39,323][model8_pretrain.py][INFO] Epoch:[0/2](879300/4588595) loss:2.150 lr:0.0000100 epoch_Time:23610.0min: [2024-01-06 14:57:16,268][model8_pretrain.py][INFO] Epoch:[0/2](879400/4588595) loss:3.200 lr:0.0000100 epoch_Time:23609.0min: [2024-01-06 14:57:16,268][model8_pretrain.py][INFO] Epoch:[0/2](879400/4588595) loss:2.600 lr:0.0000100 epoch_Time:23609.0min: [2024-01-06 14:57:16,268][model8_pretrain.py][INFO] Epoch:[0/2](879400/4588595) loss:2.728 lr:0.0000100 epoch_Time:23609.0min: [2024-01-06 14:57:16,268][model8_pretrain.py][INFO] Epoch:[0/2](879400/4588595) loss:3.124 lr:0.0000100 epoch_Time:23609.0min: [2024-01-06 14:57:16,268][model8_pretrain.py][INFO] Epoch:[0/2](879400/4588595) loss:3.104 lr:0.0000100 epoch_Time:23609.0min: [2024-01-06 14:57:16,268][model8_pretrain.py][INFO] Epoch:[0/2](879400/4588595) loss:3.119 lr:0.0000100 epoch_Time:23609.0min: [2024-01-06 14:57:16,269][model8_pretrain.py][INFO] Epoch:[0/2](879400/4588595) loss:2.958 lr:0.0000100 epoch_Time:23609.0min: [2024-01-06 14:57:16,269][model8_pretrain.py][INFO] Epoch:[0/2](879400/4588595) loss:2.770 lr:0.0000100 epoch_Time:23609.0min: [2024-01-06 14:57:53,226][model8_pretrain.py][INFO] Epoch:[0/2](879500/4588595) loss:2.558 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:57:53,226][model8_pretrain.py][INFO] Epoch:[0/2](879500/4588595) loss:2.500 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:57:53,226][model8_pretrain.py][INFO] Epoch:[0/2](879500/4588595) loss:2.938 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:57:53,226][model8_pretrain.py][INFO] Epoch:[0/2](879500/4588595) loss:2.898 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:57:53,226][model8_pretrain.py][INFO] Epoch:[0/2](879500/4588595) loss:3.222 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:57:53,226][model8_pretrain.py][INFO] Epoch:[0/2](879500/4588595) loss:2.350 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:57:53,226][model8_pretrain.py][INFO] Epoch:[0/2](879500/4588595) loss:2.840 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:57:53,227][model8_pretrain.py][INFO] Epoch:[0/2](879500/4588595) loss:2.661 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:58:30,192][model8_pretrain.py][INFO] Epoch:[0/2](879600/4588595) loss:2.780 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:58:30,192][model8_pretrain.py][INFO] Epoch:[0/2](879600/4588595) loss:2.999 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:58:30,192][model8_pretrain.py][INFO] Epoch:[0/2](879600/4588595) loss:2.187 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:58:30,192][model8_pretrain.py][INFO] Epoch:[0/2](879600/4588595) loss:2.866 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:58:30,192][model8_pretrain.py][INFO] Epoch:[0/2](879600/4588595) loss:3.378 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:58:30,192][model8_pretrain.py][INFO] Epoch:[0/2](879600/4588595) loss:2.697 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:58:30,192][model8_pretrain.py][INFO] Epoch:[0/2](879600/4588595) loss:2.351 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:58:30,192][model8_pretrain.py][INFO] Epoch:[0/2](879600/4588595) loss:2.514 lr:0.0000100 epoch_Time:23608.0min: [2024-01-06 14:59:07,253][model8_pretrain.py][INFO] Epoch:[0/2](879700/4588595) loss:2.574 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:07,254][model8_pretrain.py][INFO] Epoch:[0/2](879700/4588595) loss:2.369 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:07,254][model8_pretrain.py][INFO] Epoch:[0/2](879700/4588595) loss:2.311 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:07,254][model8_pretrain.py][INFO] Epoch:[0/2](879700/4588595) loss:3.122 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:07,254][model8_pretrain.py][INFO] Epoch:[0/2](879700/4588595) loss:2.818 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:07,254][model8_pretrain.py][INFO] Epoch:[0/2](879700/4588595) loss:2.379 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:07,254][model8_pretrain.py][INFO] Epoch:[0/2](879700/4588595) loss:2.534 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:07,254][model8_pretrain.py][INFO] Epoch:[0/2](879700/4588595) loss:3.125 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:44,198][model8_pretrain.py][INFO] Epoch:[0/2](879800/4588595) loss:2.944 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:44,198][model8_pretrain.py][INFO] Epoch:[0/2](879800/4588595) loss:2.943 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:44,198][model8_pretrain.py][INFO] Epoch:[0/2](879800/4588595) loss:3.035 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:44,198][model8_pretrain.py][INFO] Epoch:[0/2](879800/4588595) loss:2.719 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:44,198][model8_pretrain.py][INFO] Epoch:[0/2](879800/4588595) loss:2.763 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:44,198][model8_pretrain.py][INFO] Epoch:[0/2](879800/4588595) loss:3.029 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:44,199][model8_pretrain.py][INFO] Epoch:[0/2](879800/4588595) loss:2.796 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 14:59:44,199][model8_pretrain.py][INFO] Epoch:[0/2](879800/4588595) loss:2.848 lr:0.0000100 epoch_Time:23607.0min: [2024-01-06 15:00:22,852][model8_pretrain.py][INFO] Epoch:[0/2](879900/4588595) loss:2.935 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:00:22,852][model8_pretrain.py][INFO] Epoch:[0/2](879900/4588595) loss:2.406 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:00:22,852][model8_pretrain.py][INFO] Epoch:[0/2](879900/4588595) loss:2.820 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:00:22,852][model8_pretrain.py][INFO] Epoch:[0/2](879900/4588595) loss:2.868 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:00:22,852][model8_pretrain.py][INFO] Epoch:[0/2](879900/4588595) loss:2.886 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:00:22,852][model8_pretrain.py][INFO] Epoch:[0/2](879900/4588595) loss:2.371 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:00:22,852][model8_pretrain.py][INFO] Epoch:[0/2](879900/4588595) loss:2.548 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:00:22,852][model8_pretrain.py][INFO] Epoch:[0/2](879900/4588595) loss:3.174 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:08,576][model8_pretrain.py][INFO] Epoch:[0/2](880000/4588595) loss:3.046 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:08,576][model8_pretrain.py][INFO] Epoch:[0/2](880000/4588595) loss:2.983 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:08,576][model8_pretrain.py][INFO] Epoch:[0/2](880000/4588595) loss:2.709 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:08,576][model8_pretrain.py][INFO] Epoch:[0/2](880000/4588595) loss:2.767 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:08,576][model8_pretrain.py][INFO] Epoch:[0/2](880000/4588595) loss:2.322 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:08,577][model8_pretrain.py][INFO] Epoch:[0/2](880000/4588595) loss:2.667 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:08,577][model8_pretrain.py][INFO] Epoch:[0/2](880000/4588595) loss:2.780 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:08,577][model8_pretrain.py][INFO] Epoch:[0/2](880000/4588595) loss:2.882 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:45,514][model8_pretrain.py][INFO] Epoch:[0/2](880100/4588595) loss:2.741 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:45,514][model8_pretrain.py][INFO] Epoch:[0/2](880100/4588595) loss:3.073 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:45,514][model8_pretrain.py][INFO] Epoch:[0/2](880100/4588595) loss:3.039 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:45,514][model8_pretrain.py][INFO] Epoch:[0/2](880100/4588595) loss:2.695 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:45,514][model8_pretrain.py][INFO] Epoch:[0/2](880100/4588595) loss:2.891 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:45,514][model8_pretrain.py][INFO] Epoch:[0/2](880100/4588595) loss:2.713 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:45,514][model8_pretrain.py][INFO] Epoch:[0/2](880100/4588595) loss:2.698 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:01:45,514][model8_pretrain.py][INFO] Epoch:[0/2](880100/4588595) loss:3.461 lr:0.0000100 epoch_Time:23606.0min: [2024-01-06 15:02:22,465][model8_pretrain.py][INFO] Epoch:[0/2](880200/4588595) loss:3.265 lr:0.0000100 epoch_Time:23604.0min: [2024-01-06 15:02:22,465][model8_pretrain.py][INFO] Epoch:[0/2](880200/4588595) loss:3.107 lr:0.0000100 epoch_Time:23604.0min: [2024-01-06 15:02:22,465][model8_pretrain.py][INFO] Epoch:[0/2](880200/4588595) loss:3.021 lr:0.0000100 epoch_Time:23604.0min: [2024-01-06 15:02:22,465][model8_pretrain.py][INFO] Epoch:[0/2](880200/4588595) loss:3.075 lr:0.0000100 epoch_Time:23604.0min: [2024-01-06 15:02:22,465][model8_pretrain.py][INFO] Epoch:[0/2](880200/4588595) loss:2.661 lr:0.0000100 epoch_Time:23604.0min: [2024-01-06 15:02:22,465][model8_pretrain.py][INFO] Epoch:[0/2](880200/4588595) loss:3.335 lr:0.0000100 epoch_Time:23604.0min: [2024-01-06 15:02:22,465][model8_pretrain.py][INFO] Epoch:[0/2](880200/4588595) loss:2.913 lr:0.0000100 epoch_Time:23604.0min: [2024-01-06 15:02:22,465][model8_pretrain.py][INFO] Epoch:[0/2](880200/4588595) loss:3.258 lr:0.0000100 epoch_Time:23604.0min: [2024-01-06 15:02:59,405][model8_pretrain.py][INFO] Epoch:[0/2](880300/4588595) loss:2.805 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:02:59,405][model8_pretrain.py][INFO] Epoch:[0/2](880300/4588595) loss:2.744 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:02:59,405][model8_pretrain.py][INFO] Epoch:[0/2](880300/4588595) loss:2.696 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:02:59,405][model8_pretrain.py][INFO] Epoch:[0/2](880300/4588595) loss:2.967 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:02:59,405][model8_pretrain.py][INFO] Epoch:[0/2](880300/4588595) loss:3.253 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:02:59,405][model8_pretrain.py][INFO] Epoch:[0/2](880300/4588595) loss:3.211 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:02:59,406][model8_pretrain.py][INFO] Epoch:[0/2](880300/4588595) loss:1.911 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:02:59,406][model8_pretrain.py][INFO] Epoch:[0/2](880300/4588595) loss:3.179 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:03:36,349][model8_pretrain.py][INFO] Epoch:[0/2](880400/4588595) loss:2.983 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:03:36,349][model8_pretrain.py][INFO] Epoch:[0/2](880400/4588595) loss:3.063 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:03:36,349][model8_pretrain.py][INFO] Epoch:[0/2](880400/4588595) loss:2.892 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:03:36,349][model8_pretrain.py][INFO] Epoch:[0/2](880400/4588595) loss:2.585 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:03:36,349][model8_pretrain.py][INFO] Epoch:[0/2](880400/4588595) loss:2.980 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:03:36,349][model8_pretrain.py][INFO] Epoch:[0/2](880400/4588595) loss:2.649 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:03:36,349][model8_pretrain.py][INFO] Epoch:[0/2](880400/4588595) loss:2.895 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:03:36,349][model8_pretrain.py][INFO] Epoch:[0/2](880400/4588595) loss:2.927 lr:0.0000100 epoch_Time:23603.0min: [2024-01-06 15:04:13,286][model8_pretrain.py][INFO] Epoch:[0/2](880500/4588595) loss:3.269 lr:0.0000100 epoch_Time:23602.0min: [2024-01-06 15:04:13,286][model8_pretrain.py][INFO] Epoch:[0/2](880500/4588595) loss:3.226 lr:0.0000100 epoch_Time:23602.0min: [2024-01-06 15:04:13,286][model8_pretrain.py][INFO] Epoch:[0/2](880500/4588595) loss:3.076 lr:0.0000100 epoch_Time:23602.0min: [2024-01-06 15:04:13,286][model8_pretrain.py][INFO] Epoch:[0/2](880500/4588595) loss:2.453 lr:0.0000100 epoch_Time:23602.0min: [2024-01-06 15:04:13,286][model8_pretrain.py][INFO] Epoch:[0/2](880500/4588595) loss:2.634 lr:0.0000100 epoch_Time:23602.0min: [2024-01-06 15:04:13,287][model8_pretrain.py][INFO] Epoch:[0/2](880500/4588595) loss:2.660 lr:0.0000100 epoch_Time:23602.0min: [2024-01-06 15:04:13,287][model8_pretrain.py][INFO] Epoch:[0/2](880500/4588595) loss:2.872 lr:0.0000100 epoch_Time:23602.0min: [2024-01-06 15:04:13,287][model8_pretrain.py][INFO] Epoch:[0/2](880500/4588595) loss:2.828 lr:0.0000100 epoch_Time:23602.0min: [2024-01-06 15:04:50,231][model8_pretrain.py][INFO] Epoch:[0/2](880600/4588595) loss:3.327 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:04:50,231][model8_pretrain.py][INFO] Epoch:[0/2](880600/4588595) loss:2.568 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:04:50,231][model8_pretrain.py][INFO] Epoch:[0/2](880600/4588595) loss:2.815 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:04:50,231][model8_pretrain.py][INFO] Epoch:[0/2](880600/4588595) loss:2.642 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:04:50,231][model8_pretrain.py][INFO] Epoch:[0/2](880600/4588595) loss:2.756 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:04:50,231][model8_pretrain.py][INFO] Epoch:[0/2](880600/4588595) loss:2.142 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:04:50,231][model8_pretrain.py][INFO] Epoch:[0/2](880600/4588595) loss:2.398 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:04:50,231][model8_pretrain.py][INFO] Epoch:[0/2](880600/4588595) loss:3.437 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:05:28,894][model8_pretrain.py][INFO] Epoch:[0/2](880700/4588595) loss:2.262 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:05:28,894][model8_pretrain.py][INFO] Epoch:[0/2](880700/4588595) loss:2.669 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:05:28,894][model8_pretrain.py][INFO] Epoch:[0/2](880700/4588595) loss:2.732 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:05:28,894][model8_pretrain.py][INFO] Epoch:[0/2](880700/4588595) loss:2.531 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:05:28,894][model8_pretrain.py][INFO] Epoch:[0/2](880700/4588595) loss:3.511 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:05:28,894][model8_pretrain.py][INFO] Epoch:[0/2](880700/4588595) loss:2.988 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:05:28,894][model8_pretrain.py][INFO] Epoch:[0/2](880700/4588595) loss:2.681 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:05:28,894][model8_pretrain.py][INFO] Epoch:[0/2](880700/4588595) loss:2.804 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:06:14,540][model8_pretrain.py][INFO] Epoch:[0/2](880800/4588595) loss:2.424 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:06:14,540][model8_pretrain.py][INFO] Epoch:[0/2](880800/4588595) loss:3.236 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:06:14,540][model8_pretrain.py][INFO] Epoch:[0/2](880800/4588595) loss:2.471 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:06:14,540][model8_pretrain.py][INFO] Epoch:[0/2](880800/4588595) loss:2.295 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:06:14,540][model8_pretrain.py][INFO] Epoch:[0/2](880800/4588595) loss:2.944 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:06:14,540][model8_pretrain.py][INFO] Epoch:[0/2](880800/4588595) loss:2.712 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:06:14,540][model8_pretrain.py][INFO] Epoch:[0/2](880800/4588595) loss:2.391 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:06:14,540][model8_pretrain.py][INFO] Epoch:[0/2](880800/4588595) loss:2.841 lr:0.0000100 epoch_Time:23601.0min: [2024-01-06 15:06:51,483][model8_pretrain.py][INFO] Epoch:[0/2](880900/4588595) loss:2.997 lr:0.0000100 epoch_Time:23600.0min: [2024-01-06 15:06:51,483][model8_pretrain.py][INFO] Epoch:[0/2](880900/4588595) loss:3.472 lr:0.0000100 epoch_Time:23600.0min: [2024-01-06 15:06:51,483][model8_pretrain.py][INFO] Epoch:[0/2](880900/4588595) loss:2.635 lr:0.0000100 epoch_Time:23600.0min: [2024-01-06 15:06:51,483][model8_pretrain.py][INFO] Epoch:[0/2](880900/4588595) loss:2.622 lr:0.0000100 epoch_Time:23600.0min: [2024-01-06 15:06:51,483][model8_pretrain.py][INFO] Epoch:[0/2](880900/4588595) loss:2.518 lr:0.0000100 epoch_Time:23600.0min: [2024-01-06 15:06:51,483][model8_pretrain.py][INFO] Epoch:[0/2](880900/4588595) loss:2.581 lr:0.0000100 epoch_Time:23600.0min: [2024-01-06 15:06:51,483][model8_pretrain.py][INFO] Epoch:[0/2](880900/4588595) loss:2.879 lr:0.0000100 epoch_Time:23600.0min: [2024-01-06 15:06:51,483][model8_pretrain.py][INFO] Epoch:[0/2](880900/4588595) loss:1.878 lr:0.0000100 epoch_Time:23600.0min: [2024-01-06 15:07:28,435][model8_pretrain.py][INFO] Epoch:[0/2](881000/4588595) loss:3.017 lr:0.0000100 epoch_Time:23599.0min: [2024-01-06 15:07:28,435][model8_pretrain.py][INFO] Epoch:[0/2](881000/4588595) loss:2.758 lr:0.0000100 epoch_Time:23599.0min: [2024-01-06 15:07:28,435][model8_pretrain.py][INFO] Epoch:[0/2](881000/4588595) loss:3.053 lr:0.0000100 epoch_Time:23599.0min: [2024-01-06 15:07:28,435][model8_pretrain.py][INFO] Epoch:[0/2](881000/4588595) loss:3.341 lr:0.0000100 epoch_Time:23599.0min: [2024-01-06 15:07:28,435][model8_pretrain.py][INFO] Epoch:[0/2](881000/4588595) loss:3.296 lr:0.0000100 epoch_Time:23599.0min: [2024-01-06 15:07:28,435][model8_pretrain.py][INFO] Epoch:[0/2](881000/4588595) loss:3.149 lr:0.0000100 epoch_Time:23599.0min: [2024-01-06 15:07:28,435][model8_pretrain.py][INFO] Epoch:[0/2](881000/4588595) loss:2.855 lr:0.0000100 epoch_Time:23599.0min: [2024-01-06 15:07:28,435][model8_pretrain.py][INFO] Epoch:[0/2](881000/4588595) loss:3.246 lr:0.0000100 epoch_Time:23599.0min: [2024-01-06 15:08:05,371][model8_pretrain.py][INFO] Epoch:[0/2](881100/4588595) loss:2.819 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:05,371][model8_pretrain.py][INFO] Epoch:[0/2](881100/4588595) loss:2.007 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:05,371][model8_pretrain.py][INFO] Epoch:[0/2](881100/4588595) loss:2.919 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:05,371][model8_pretrain.py][INFO] Epoch:[0/2](881100/4588595) loss:2.927 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:05,371][model8_pretrain.py][INFO] Epoch:[0/2](881100/4588595) loss:2.408 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:05,371][model8_pretrain.py][INFO] Epoch:[0/2](881100/4588595) loss:3.265 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:05,371][model8_pretrain.py][INFO] Epoch:[0/2](881100/4588595) loss:2.812 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:05,372][model8_pretrain.py][INFO] Epoch:[0/2](881100/4588595) loss:2.936 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:42,301][model8_pretrain.py][INFO] Epoch:[0/2](881200/4588595) loss:3.074 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:42,301][model8_pretrain.py][INFO] Epoch:[0/2](881200/4588595) loss:2.724 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:42,301][model8_pretrain.py][INFO] Epoch:[0/2](881200/4588595) loss:3.152 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:42,301][model8_pretrain.py][INFO] Epoch:[0/2](881200/4588595) loss:2.811 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:42,301][model8_pretrain.py][INFO] Epoch:[0/2](881200/4588595) loss:3.092 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:42,301][model8_pretrain.py][INFO] Epoch:[0/2](881200/4588595) loss:2.714 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:42,301][model8_pretrain.py][INFO] Epoch:[0/2](881200/4588595) loss:2.585 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:08:42,301][model8_pretrain.py][INFO] Epoch:[0/2](881200/4588595) loss:2.683 lr:0.0000100 epoch_Time:23598.0min: [2024-01-06 15:09:19,235][model8_pretrain.py][INFO] Epoch:[0/2](881300/4588595) loss:3.054 lr:0.0000100 epoch_Time:23597.0min: [2024-01-06 15:09:19,235][model8_pretrain.py][INFO] Epoch:[0/2](881300/4588595) loss:3.128 lr:0.0000100 epoch_Time:23597.0min: [2024-01-06 15:09:19,235][model8_pretrain.py][INFO] Epoch:[0/2](881300/4588595) loss:2.386 lr:0.0000100 epoch_Time:23597.0min: [2024-01-06 15:09:19,235][model8_pretrain.py][INFO] Epoch:[0/2](881300/4588595) loss:2.033 lr:0.0000100 epoch_Time:23597.0min: [2024-01-06 15:09:19,235][model8_pretrain.py][INFO] Epoch:[0/2](881300/4588595) loss:3.167 lr:0.0000100 epoch_Time:23597.0min: [2024-01-06 15:09:19,235][model8_pretrain.py][INFO] Epoch:[0/2](881300/4588595) loss:2.742 lr:0.0000100 epoch_Time:23597.0min: [2024-01-06 15:09:19,235][model8_pretrain.py][INFO] Epoch:[0/2](881300/4588595) loss:3.040 lr:0.0000100 epoch_Time:23597.0min: [2024-01-06 15:09:19,235][model8_pretrain.py][INFO] Epoch:[0/2](881300/4588595) loss:2.716 lr:0.0000100 epoch_Time:23597.0min: [2024-01-06 15:09:56,134][model8_pretrain.py][INFO] Epoch:[0/2](881400/4588595) loss:2.587 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:09:56,134][model8_pretrain.py][INFO] Epoch:[0/2](881400/4588595) loss:2.651 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:09:56,134][model8_pretrain.py][INFO] Epoch:[0/2](881400/4588595) loss:3.092 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:09:56,134][model8_pretrain.py][INFO] Epoch:[0/2](881400/4588595) loss:2.369 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:09:56,134][model8_pretrain.py][INFO] Epoch:[0/2](881400/4588595) loss:2.628 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:09:56,134][model8_pretrain.py][INFO] Epoch:[0/2](881400/4588595) loss:2.996 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:09:56,134][model8_pretrain.py][INFO] Epoch:[0/2](881400/4588595) loss:2.313 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:09:56,134][model8_pretrain.py][INFO] Epoch:[0/2](881400/4588595) loss:2.934 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:10:34,803][model8_pretrain.py][INFO] Epoch:[0/2](881500/4588595) loss:2.465 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:10:34,803][model8_pretrain.py][INFO] Epoch:[0/2](881500/4588595) loss:2.186 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:10:34,803][model8_pretrain.py][INFO] Epoch:[0/2](881500/4588595) loss:2.970 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:10:34,803][model8_pretrain.py][INFO] Epoch:[0/2](881500/4588595) loss:2.349 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:10:34,803][model8_pretrain.py][INFO] Epoch:[0/2](881500/4588595) loss:2.275 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:10:34,803][model8_pretrain.py][INFO] Epoch:[0/2](881500/4588595) loss:2.781 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:10:34,803][model8_pretrain.py][INFO] Epoch:[0/2](881500/4588595) loss:2.989 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:10:34,803][model8_pretrain.py][INFO] Epoch:[0/2](881500/4588595) loss:2.638 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:11:20,362][model8_pretrain.py][INFO] Epoch:[0/2](881600/4588595) loss:2.177 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:11:20,362][model8_pretrain.py][INFO] Epoch:[0/2](881600/4588595) loss:2.797 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:11:20,362][model8_pretrain.py][INFO] Epoch:[0/2](881600/4588595) loss:2.804 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:11:20,362][model8_pretrain.py][INFO] Epoch:[0/2](881600/4588595) loss:3.009 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:11:20,362][model8_pretrain.py][INFO] Epoch:[0/2](881600/4588595) loss:3.067 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:11:20,362][model8_pretrain.py][INFO] Epoch:[0/2](881600/4588595) loss:2.189 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:11:20,362][model8_pretrain.py][INFO] Epoch:[0/2](881600/4588595) loss:3.000 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:11:20,363][model8_pretrain.py][INFO] Epoch:[0/2](881600/4588595) loss:2.652 lr:0.0000100 epoch_Time:23596.0min: [2024-01-06 15:11:57,294][model8_pretrain.py][INFO] Epoch:[0/2](881700/4588595) loss:2.401 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:11:57,295][model8_pretrain.py][INFO] Epoch:[0/2](881700/4588595) loss:3.139 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:11:57,294][model8_pretrain.py][INFO] Epoch:[0/2](881700/4588595) loss:3.025 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:11:57,295][model8_pretrain.py][INFO] Epoch:[0/2](881700/4588595) loss:2.556 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:11:57,295][model8_pretrain.py][INFO] Epoch:[0/2](881700/4588595) loss:3.174 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:11:57,294][model8_pretrain.py][INFO] Epoch:[0/2](881700/4588595) loss:2.386 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:11:57,295][model8_pretrain.py][INFO] Epoch:[0/2](881700/4588595) loss:2.716 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:11:57,295][model8_pretrain.py][INFO] Epoch:[0/2](881700/4588595) loss:3.131 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:12:34,232][model8_pretrain.py][INFO] Epoch:[0/2](881800/4588595) loss:2.501 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:12:34,232][model8_pretrain.py][INFO] Epoch:[0/2](881800/4588595) loss:2.572 lr:0.0000100 epoch_Time:23594.0min: [2024-01-06 15:12:34,232][model8_pretrain.py][INFO] Epoch:[0/2](881800/4588595) loss:2.551 lr:0.0000100 epoch_Time:23594.0min: [2024-01-06 15:12:34,232][model8_pretrain.py][INFO] Epoch:[0/2](881800/4588595) loss:2.576 lr:0.0000100 epoch_Time:23594.0min: [2024-01-06 15:12:34,233][model8_pretrain.py][INFO] Epoch:[0/2](881800/4588595) loss:2.327 lr:0.0000100 epoch_Time:23594.0min: [2024-01-06 15:12:34,233][model8_pretrain.py][INFO] Epoch:[0/2](881800/4588595) loss:2.766 lr:0.0000100 epoch_Time:23594.0min: [2024-01-06 15:12:34,233][model8_pretrain.py][INFO] Epoch:[0/2](881800/4588595) loss:2.725 lr:0.0000100 epoch_Time:23594.0min: [2024-01-06 15:12:34,233][model8_pretrain.py][INFO] Epoch:[0/2](881800/4588595) loss:3.288 lr:0.0000100 epoch_Time:23595.0min: [2024-01-06 15:13:11,166][model8_pretrain.py][INFO] Epoch:[0/2](881900/4588595) loss:2.695 lr:0.0000100 epoch_Time:23593.0min: [2024-01-06 15:13:11,166][model8_pretrain.py][INFO] Epoch:[0/2](881900/4588595) loss:1.977 lr:0.0000100 epoch_Time:23593.0min: [2024-01-06 15:13:11,167][model8_pretrain.py][INFO] Epoch:[0/2](881900/4588595) loss:3.078 lr:0.0000100 epoch_Time:23593.0min: [2024-01-06 15:13:11,167][model8_pretrain.py][INFO] Epoch:[0/2](881900/4588595) loss:2.639 lr:0.0000100 epoch_Time:23593.0min: [2024-01-06 15:13:11,167][model8_pretrain.py][INFO] Epoch:[0/2](881900/4588595) loss:3.013 lr:0.0000100 epoch_Time:23593.0min: [2024-01-06 15:13:11,167][model8_pretrain.py][INFO] Epoch:[0/2](881900/4588595) loss:3.205 lr:0.0000100 epoch_Time:23593.0min: [2024-01-06 15:13:11,167][model8_pretrain.py][INFO] Epoch:[0/2](881900/4588595) loss:3.096 lr:0.0000100 epoch_Time:23593.0min: [2024-01-06 15:13:11,167][model8_pretrain.py][INFO] Epoch:[0/2](881900/4588595) loss:3.388 lr:0.0000100 epoch_Time:23593.0min: [2024-01-06 15:13:48,105][model8_pretrain.py][INFO] Epoch:[0/2](882000/4588595) loss:2.614 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:13:48,105][model8_pretrain.py][INFO] Epoch:[0/2](882000/4588595) loss:3.027 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:13:48,105][model8_pretrain.py][INFO] Epoch:[0/2](882000/4588595) loss:2.300 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:13:48,105][model8_pretrain.py][INFO] Epoch:[0/2](882000/4588595) loss:3.511 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:13:48,105][model8_pretrain.py][INFO] Epoch:[0/2](882000/4588595) loss:2.530 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:13:48,105][model8_pretrain.py][INFO] Epoch:[0/2](882000/4588595) loss:3.351 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:13:48,105][model8_pretrain.py][INFO] Epoch:[0/2](882000/4588595) loss:2.658 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:13:48,106][model8_pretrain.py][INFO] Epoch:[0/2](882000/4588595) loss:3.347 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:14:25,052][model8_pretrain.py][INFO] Epoch:[0/2](882100/4588595) loss:2.691 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:14:25,052][model8_pretrain.py][INFO] Epoch:[0/2](882100/4588595) loss:3.228 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:14:25,052][model8_pretrain.py][INFO] Epoch:[0/2](882100/4588595) loss:2.433 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:14:25,052][model8_pretrain.py][INFO] Epoch:[0/2](882100/4588595) loss:2.589 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:14:25,052][model8_pretrain.py][INFO] Epoch:[0/2](882100/4588595) loss:2.384 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:14:25,052][model8_pretrain.py][INFO] Epoch:[0/2](882100/4588595) loss:2.510 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:14:25,052][model8_pretrain.py][INFO] Epoch:[0/2](882100/4588595) loss:2.655 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:14:25,052][model8_pretrain.py][INFO] Epoch:[0/2](882100/4588595) loss:2.865 lr:0.0000100 epoch_Time:23592.0min: [2024-01-06 15:15:01,990][model8_pretrain.py][INFO] Epoch:[0/2](882200/4588595) loss:2.892 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:01,990][model8_pretrain.py][INFO] Epoch:[0/2](882200/4588595) loss:2.990 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:01,990][model8_pretrain.py][INFO] Epoch:[0/2](882200/4588595) loss:2.467 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:01,990][model8_pretrain.py][INFO] Epoch:[0/2](882200/4588595) loss:3.442 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:01,990][model8_pretrain.py][INFO] Epoch:[0/2](882200/4588595) loss:2.119 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:01,990][model8_pretrain.py][INFO] Epoch:[0/2](882200/4588595) loss:2.851 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:01,991][model8_pretrain.py][INFO] Epoch:[0/2](882200/4588595) loss:2.829 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:01,991][model8_pretrain.py][INFO] Epoch:[0/2](882200/4588595) loss:3.380 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:40,622][model8_pretrain.py][INFO] Epoch:[0/2](882300/4588595) loss:2.735 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:40,622][model8_pretrain.py][INFO] Epoch:[0/2](882300/4588595) loss:2.957 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:40,622][model8_pretrain.py][INFO] Epoch:[0/2](882300/4588595) loss:2.990 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:40,622][model8_pretrain.py][INFO] Epoch:[0/2](882300/4588595) loss:2.523 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:40,622][model8_pretrain.py][INFO] Epoch:[0/2](882300/4588595) loss:2.791 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:40,622][model8_pretrain.py][INFO] Epoch:[0/2](882300/4588595) loss:3.193 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:40,622][model8_pretrain.py][INFO] Epoch:[0/2](882300/4588595) loss:3.188 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:15:40,622][model8_pretrain.py][INFO] Epoch:[0/2](882300/4588595) loss:2.818 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:16:26,130][model8_pretrain.py][INFO] Epoch:[0/2](882400/4588595) loss:3.336 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:16:26,130][model8_pretrain.py][INFO] Epoch:[0/2](882400/4588595) loss:3.587 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:16:26,130][model8_pretrain.py][INFO] Epoch:[0/2](882400/4588595) loss:2.793 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:16:26,130][model8_pretrain.py][INFO] Epoch:[0/2](882400/4588595) loss:2.426 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:16:26,131][model8_pretrain.py][INFO] Epoch:[0/2](882400/4588595) loss:2.489 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:16:26,131][model8_pretrain.py][INFO] Epoch:[0/2](882400/4588595) loss:2.944 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:16:26,131][model8_pretrain.py][INFO] Epoch:[0/2](882400/4588595) loss:2.987 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:16:26,131][model8_pretrain.py][INFO] Epoch:[0/2](882400/4588595) loss:2.490 lr:0.0000100 epoch_Time:23591.0min: [2024-01-06 15:17:03,058][model8_pretrain.py][INFO] Epoch:[0/2](882500/4588595) loss:2.909 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:03,058][model8_pretrain.py][INFO] Epoch:[0/2](882500/4588595) loss:2.988 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:03,058][model8_pretrain.py][INFO] Epoch:[0/2](882500/4588595) loss:2.373 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:03,058][model8_pretrain.py][INFO] Epoch:[0/2](882500/4588595) loss:2.645 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:03,058][model8_pretrain.py][INFO] Epoch:[0/2](882500/4588595) loss:2.376 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:03,058][model8_pretrain.py][INFO] Epoch:[0/2](882500/4588595) loss:1.611 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:03,058][model8_pretrain.py][INFO] Epoch:[0/2](882500/4588595) loss:2.872 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:03,058][model8_pretrain.py][INFO] Epoch:[0/2](882500/4588595) loss:3.161 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:39,993][model8_pretrain.py][INFO] Epoch:[0/2](882600/4588595) loss:2.311 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:39,993][model8_pretrain.py][INFO] Epoch:[0/2](882600/4588595) loss:2.652 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:39,993][model8_pretrain.py][INFO] Epoch:[0/2](882600/4588595) loss:3.277 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:39,993][model8_pretrain.py][INFO] Epoch:[0/2](882600/4588595) loss:2.709 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:39,993][model8_pretrain.py][INFO] Epoch:[0/2](882600/4588595) loss:3.020 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:39,993][model8_pretrain.py][INFO] Epoch:[0/2](882600/4588595) loss:3.125 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:39,993][model8_pretrain.py][INFO] Epoch:[0/2](882600/4588595) loss:3.108 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:17:39,993][model8_pretrain.py][INFO] Epoch:[0/2](882600/4588595) loss:2.828 lr:0.0000100 epoch_Time:23590.0min: [2024-01-06 15:18:16,930][model8_pretrain.py][INFO] Epoch:[0/2](882700/4588595) loss:2.083 lr:0.0000100 epoch_Time:23588.0min: [2024-01-06 15:18:16,930][model8_pretrain.py][INFO] Epoch:[0/2](882700/4588595) loss:2.558 lr:0.0000100 epoch_Time:23588.0min: [2024-01-06 15:18:16,930][model8_pretrain.py][INFO] Epoch:[0/2](882700/4588595) loss:2.956 lr:0.0000100 epoch_Time:23588.0min: [2024-01-06 15:18:16,930][model8_pretrain.py][INFO] Epoch:[0/2](882700/4588595) loss:2.938 lr:0.0000100 epoch_Time:23588.0min: [2024-01-06 15:18:16,930][model8_pretrain.py][INFO] Epoch:[0/2](882700/4588595) loss:2.627 lr:0.0000100 epoch_Time:23588.0min: [2024-01-06 15:18:16,930][model8_pretrain.py][INFO] Epoch:[0/2](882700/4588595) loss:2.224 lr:0.0000100 epoch_Time:23588.0min: [2024-01-06 15:18:16,930][model8_pretrain.py][INFO] Epoch:[0/2](882700/4588595) loss:2.685 lr:0.0000100 epoch_Time:23588.0min: [2024-01-06 15:18:16,930][model8_pretrain.py][INFO] Epoch:[0/2](882700/4588595) loss:2.615 lr:0.0000100 epoch_Time:23588.0min: [2024-01-06 15:18:53,865][model8_pretrain.py][INFO] Epoch:[0/2](882800/4588595) loss:3.043 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:18:53,865][model8_pretrain.py][INFO] Epoch:[0/2](882800/4588595) loss:2.793 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:18:53,865][model8_pretrain.py][INFO] Epoch:[0/2](882800/4588595) loss:3.229 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:18:53,865][model8_pretrain.py][INFO] Epoch:[0/2](882800/4588595) loss:2.793 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:18:53,865][model8_pretrain.py][INFO] Epoch:[0/2](882800/4588595) loss:2.421 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:18:53,865][model8_pretrain.py][INFO] Epoch:[0/2](882800/4588595) loss:2.706 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:18:53,865][model8_pretrain.py][INFO] Epoch:[0/2](882800/4588595) loss:3.089 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:18:53,865][model8_pretrain.py][INFO] Epoch:[0/2](882800/4588595) loss:3.120 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:19:30,803][model8_pretrain.py][INFO] Epoch:[0/2](882900/4588595) loss:2.259 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:19:30,803][model8_pretrain.py][INFO] Epoch:[0/2](882900/4588595) loss:2.548 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:19:30,803][model8_pretrain.py][INFO] Epoch:[0/2](882900/4588595) loss:2.871 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:19:30,803][model8_pretrain.py][INFO] Epoch:[0/2](882900/4588595) loss:2.231 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:19:30,803][model8_pretrain.py][INFO] Epoch:[0/2](882900/4588595) loss:2.759 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:19:30,803][model8_pretrain.py][INFO] Epoch:[0/2](882900/4588595) loss:3.088 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:19:30,803][model8_pretrain.py][INFO] Epoch:[0/2](882900/4588595) loss:2.792 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:19:30,803][model8_pretrain.py][INFO] Epoch:[0/2](882900/4588595) loss:2.177 lr:0.0000100 epoch_Time:23587.0min: [2024-01-06 15:20:07,735][model8_pretrain.py][INFO] Epoch:[0/2](883000/4588595) loss:3.003 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:07,735][model8_pretrain.py][INFO] Epoch:[0/2](883000/4588595) loss:2.982 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:07,735][model8_pretrain.py][INFO] Epoch:[0/2](883000/4588595) loss:3.099 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:07,735][model8_pretrain.py][INFO] Epoch:[0/2](883000/4588595) loss:3.093 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:07,736][model8_pretrain.py][INFO] Epoch:[0/2](883000/4588595) loss:2.960 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:07,736][model8_pretrain.py][INFO] Epoch:[0/2](883000/4588595) loss:2.559 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:07,736][model8_pretrain.py][INFO] Epoch:[0/2](883000/4588595) loss:3.032 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:07,736][model8_pretrain.py][INFO] Epoch:[0/2](883000/4588595) loss:2.833 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:46,365][model8_pretrain.py][INFO] Epoch:[0/2](883100/4588595) loss:3.062 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:46,365][model8_pretrain.py][INFO] Epoch:[0/2](883100/4588595) loss:2.595 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:46,365][model8_pretrain.py][INFO] Epoch:[0/2](883100/4588595) loss:3.054 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:46,365][model8_pretrain.py][INFO] Epoch:[0/2](883100/4588595) loss:2.991 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:46,365][model8_pretrain.py][INFO] Epoch:[0/2](883100/4588595) loss:2.334 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:46,365][model8_pretrain.py][INFO] Epoch:[0/2](883100/4588595) loss:2.636 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:46,365][model8_pretrain.py][INFO] Epoch:[0/2](883100/4588595) loss:1.832 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:20:46,365][model8_pretrain.py][INFO] Epoch:[0/2](883100/4588595) loss:3.204 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:21:31,971][model8_pretrain.py][INFO] Epoch:[0/2](883200/4588595) loss:2.924 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:21:31,971][model8_pretrain.py][INFO] Epoch:[0/2](883200/4588595) loss:3.061 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:21:31,971][model8_pretrain.py][INFO] Epoch:[0/2](883200/4588595) loss:2.597 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:21:31,971][model8_pretrain.py][INFO] Epoch:[0/2](883200/4588595) loss:2.423 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:21:31,971][model8_pretrain.py][INFO] Epoch:[0/2](883200/4588595) loss:2.350 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:21:31,971][model8_pretrain.py][INFO] Epoch:[0/2](883200/4588595) loss:2.645 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:21:31,972][model8_pretrain.py][INFO] Epoch:[0/2](883200/4588595) loss:3.352 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:21:31,972][model8_pretrain.py][INFO] Epoch:[0/2](883200/4588595) loss:2.105 lr:0.0000100 epoch_Time:23586.0min: [2024-01-06 15:22:08,899][model8_pretrain.py][INFO] Epoch:[0/2](883300/4588595) loss:2.762 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:08,899][model8_pretrain.py][INFO] Epoch:[0/2](883300/4588595) loss:3.010 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:08,899][model8_pretrain.py][INFO] Epoch:[0/2](883300/4588595) loss:2.640 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:08,899][model8_pretrain.py][INFO] Epoch:[0/2](883300/4588595) loss:3.039 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:08,899][model8_pretrain.py][INFO] Epoch:[0/2](883300/4588595) loss:3.183 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:08,899][model8_pretrain.py][INFO] Epoch:[0/2](883300/4588595) loss:3.130 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:08,900][model8_pretrain.py][INFO] Epoch:[0/2](883300/4588595) loss:2.719 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:08,900][model8_pretrain.py][INFO] Epoch:[0/2](883300/4588595) loss:2.782 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:45,829][model8_pretrain.py][INFO] Epoch:[0/2](883400/4588595) loss:2.567 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:45,829][model8_pretrain.py][INFO] Epoch:[0/2](883400/4588595) loss:2.696 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:45,829][model8_pretrain.py][INFO] Epoch:[0/2](883400/4588595) loss:2.564 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:45,829][model8_pretrain.py][INFO] Epoch:[0/2](883400/4588595) loss:3.223 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:45,829][model8_pretrain.py][INFO] Epoch:[0/2](883400/4588595) loss:3.289 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:45,829][model8_pretrain.py][INFO] Epoch:[0/2](883400/4588595) loss:2.897 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:45,829][model8_pretrain.py][INFO] Epoch:[0/2](883400/4588595) loss:2.898 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:22:45,829][model8_pretrain.py][INFO] Epoch:[0/2](883400/4588595) loss:2.819 lr:0.0000100 epoch_Time:23585.0min: [2024-01-06 15:23:22,777][model8_pretrain.py][INFO] Epoch:[0/2](883500/4588595) loss:3.062 lr:0.0000100 epoch_Time:23583.0min: [2024-01-06 15:23:22,777][model8_pretrain.py][INFO] Epoch:[0/2](883500/4588595) loss:3.168 lr:0.0000100 epoch_Time:23583.0min: [2024-01-06 15:23:22,777][model8_pretrain.py][INFO] Epoch:[0/2](883500/4588595) loss:3.103 lr:0.0000100 epoch_Time:23583.0min: [2024-01-06 15:23:22,777][model8_pretrain.py][INFO] Epoch:[0/2](883500/4588595) loss:3.044 lr:0.0000100 epoch_Time:23583.0min: [2024-01-06 15:23:22,777][model8_pretrain.py][INFO] Epoch:[0/2](883500/4588595) loss:2.728 lr:0.0000100 epoch_Time:23583.0min: [2024-01-06 15:23:22,777][model8_pretrain.py][INFO] Epoch:[0/2](883500/4588595) loss:3.023 lr:0.0000100 epoch_Time:23583.0min: [2024-01-06 15:23:22,777][model8_pretrain.py][INFO] Epoch:[0/2](883500/4588595) loss:3.076 lr:0.0000100 epoch_Time:23583.0min: [2024-01-06 15:23:22,777][model8_pretrain.py][INFO] Epoch:[0/2](883500/4588595) loss:2.928 lr:0.0000100 epoch_Time:23583.0min: [2024-01-06 15:23:59,725][model8_pretrain.py][INFO] Epoch:[0/2](883600/4588595) loss:2.701 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:23:59,725][model8_pretrain.py][INFO] Epoch:[0/2](883600/4588595) loss:2.786 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:23:59,725][model8_pretrain.py][INFO] Epoch:[0/2](883600/4588595) loss:2.520 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:23:59,725][model8_pretrain.py][INFO] Epoch:[0/2](883600/4588595) loss:2.309 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:23:59,725][model8_pretrain.py][INFO] Epoch:[0/2](883600/4588595) loss:2.649 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:23:59,725][model8_pretrain.py][INFO] Epoch:[0/2](883600/4588595) loss:2.746 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:23:59,725][model8_pretrain.py][INFO] Epoch:[0/2](883600/4588595) loss:2.544 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:23:59,725][model8_pretrain.py][INFO] Epoch:[0/2](883600/4588595) loss:2.855 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:24:36,668][model8_pretrain.py][INFO] Epoch:[0/2](883700/4588595) loss:2.928 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:24:36,668][model8_pretrain.py][INFO] Epoch:[0/2](883700/4588595) loss:2.546 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:24:36,668][model8_pretrain.py][INFO] Epoch:[0/2](883700/4588595) loss:2.796 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:24:36,668][model8_pretrain.py][INFO] Epoch:[0/2](883700/4588595) loss:2.291 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:24:36,668][model8_pretrain.py][INFO] Epoch:[0/2](883700/4588595) loss:3.000 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:24:36,668][model8_pretrain.py][INFO] Epoch:[0/2](883700/4588595) loss:2.620 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:24:36,668][model8_pretrain.py][INFO] Epoch:[0/2](883700/4588595) loss:2.787 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:24:36,669][model8_pretrain.py][INFO] Epoch:[0/2](883700/4588595) loss:2.513 lr:0.0000100 epoch_Time:23582.0min: [2024-01-06 15:25:13,620][model8_pretrain.py][INFO] Epoch:[0/2](883800/4588595) loss:2.850 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:25:13,620][model8_pretrain.py][INFO] Epoch:[0/2](883800/4588595) loss:3.012 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:25:13,620][model8_pretrain.py][INFO] Epoch:[0/2](883800/4588595) loss:2.528 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:25:13,620][model8_pretrain.py][INFO] Epoch:[0/2](883800/4588595) loss:2.559 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:25:13,620][model8_pretrain.py][INFO] Epoch:[0/2](883800/4588595) loss:2.962 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:25:13,620][model8_pretrain.py][INFO] Epoch:[0/2](883800/4588595) loss:2.725 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:25:13,620][model8_pretrain.py][INFO] Epoch:[0/2](883800/4588595) loss:2.568 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:25:13,620][model8_pretrain.py][INFO] Epoch:[0/2](883800/4588595) loss:2.879 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:25:52,256][model8_pretrain.py][INFO] Epoch:[0/2](883900/4588595) loss:2.371 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:25:52,256][model8_pretrain.py][INFO] Epoch:[0/2](883900/4588595) loss:2.915 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:25:52,256][model8_pretrain.py][INFO] Epoch:[0/2](883900/4588595) loss:3.100 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:25:52,256][model8_pretrain.py][INFO] Epoch:[0/2](883900/4588595) loss:2.953 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:25:52,256][model8_pretrain.py][INFO] Epoch:[0/2](883900/4588595) loss:2.973 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:25:52,256][model8_pretrain.py][INFO] Epoch:[0/2](883900/4588595) loss:3.103 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:25:52,256][model8_pretrain.py][INFO] Epoch:[0/2](883900/4588595) loss:2.646 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:25:52,256][model8_pretrain.py][INFO] Epoch:[0/2](883900/4588595) loss:1.988 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:26:37,869][model8_pretrain.py][INFO] Epoch:[0/2](884000/4588595) loss:2.778 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](884000/4588595) loss:2.887 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](884000/4588595) loss:2.943 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](884000/4588595) loss:3.095 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](884000/4588595) loss:2.459 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](884000/4588595) loss:2.119 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](884000/4588595) loss:2.245 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:26:37,870][model8_pretrain.py][INFO] Epoch:[0/2](884000/4588595) loss:2.443 lr:0.0000100 epoch_Time:23581.0min: [2024-01-06 15:27:14,800][model8_pretrain.py][INFO] Epoch:[0/2](884100/4588595) loss:2.790 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:27:14,800][model8_pretrain.py][INFO] Epoch:[0/2](884100/4588595) loss:2.889 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:27:14,800][model8_pretrain.py][INFO] Epoch:[0/2](884100/4588595) loss:2.408 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:27:14,800][model8_pretrain.py][INFO] Epoch:[0/2](884100/4588595) loss:3.233 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:27:14,800][model8_pretrain.py][INFO] Epoch:[0/2](884100/4588595) loss:3.180 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:27:14,800][model8_pretrain.py][INFO] Epoch:[0/2](884100/4588595) loss:2.999 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:27:14,800][model8_pretrain.py][INFO] Epoch:[0/2](884100/4588595) loss:2.813 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:27:14,800][model8_pretrain.py][INFO] Epoch:[0/2](884100/4588595) loss:3.011 lr:0.0000100 epoch_Time:23580.0min: [2024-01-06 15:27:51,761][model8_pretrain.py][INFO] Epoch:[0/2](884200/4588595) loss:3.218 lr:0.0000100 epoch_Time:23579.0min: [2024-01-06 15:27:51,761][model8_pretrain.py][INFO] Epoch:[0/2](884200/4588595) loss:2.880 lr:0.0000100 epoch_Time:23579.0min: [2024-01-06 15:27:51,761][model8_pretrain.py][INFO] Epoch:[0/2](884200/4588595) loss:2.887 lr:0.0000100 epoch_Time:23579.0min: [2024-01-06 15:27:51,761][model8_pretrain.py][INFO] Epoch:[0/2](884200/4588595) loss:2.302 lr:0.0000100 epoch_Time:23579.0min: [2024-01-06 15:27:51,761][model8_pretrain.py][INFO] Epoch:[0/2](884200/4588595) loss:2.664 lr:0.0000100 epoch_Time:23579.0min: [2024-01-06 15:27:51,761][model8_pretrain.py][INFO] Epoch:[0/2](884200/4588595) loss:3.032 lr:0.0000100 epoch_Time:23579.0min: [2024-01-06 15:27:51,761][model8_pretrain.py][INFO] Epoch:[0/2](884200/4588595) loss:2.214 lr:0.0000100 epoch_Time:23579.0min: [2024-01-06 15:27:51,761][model8_pretrain.py][INFO] Epoch:[0/2](884200/4588595) loss:2.649 lr:0.0000100 epoch_Time:23579.0min: [2024-01-06 15:28:28,699][model8_pretrain.py][INFO] Epoch:[0/2](884300/4588595) loss:2.539 lr:0.0000100 epoch_Time:23578.0min: [2024-01-06 15:28:28,699][model8_pretrain.py][INFO] Epoch:[0/2](884300/4588595) loss:3.041 lr:0.0000100 epoch_Time:23578.0min: [2024-01-06 15:28:28,699][model8_pretrain.py][INFO] Epoch:[0/2](884300/4588595) loss:2.881 lr:0.0000100 epoch_Time:23578.0min: [2024-01-06 15:28:28,699][model8_pretrain.py][INFO] Epoch:[0/2](884300/4588595) loss:3.118 lr:0.0000100 epoch_Time:23578.0min: [2024-01-06 15:28:28,699][model8_pretrain.py][INFO] Epoch:[0/2](884300/4588595) loss:2.764 lr:0.0000100 epoch_Time:23578.0min: [2024-01-06 15:28:28,699][model8_pretrain.py][INFO] Epoch:[0/2](884300/4588595) loss:2.186 lr:0.0000100 epoch_Time:23578.0min: [2024-01-06 15:28:28,700][model8_pretrain.py][INFO] Epoch:[0/2](884300/4588595) loss:3.199 lr:0.0000100 epoch_Time:23578.0min: [2024-01-06 15:28:28,700][model8_pretrain.py][INFO] Epoch:[0/2](884300/4588595) loss:2.260 lr:0.0000100 epoch_Time:23578.0min: [2024-01-06 15:29:05,635][model8_pretrain.py][INFO] Epoch:[0/2](884400/4588595) loss:2.521 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:05,635][model8_pretrain.py][INFO] Epoch:[0/2](884400/4588595) loss:2.324 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:05,635][model8_pretrain.py][INFO] Epoch:[0/2](884400/4588595) loss:2.584 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:05,635][model8_pretrain.py][INFO] Epoch:[0/2](884400/4588595) loss:2.812 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:05,635][model8_pretrain.py][INFO] Epoch:[0/2](884400/4588595) loss:2.884 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:05,635][model8_pretrain.py][INFO] Epoch:[0/2](884400/4588595) loss:2.730 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:05,635][model8_pretrain.py][INFO] Epoch:[0/2](884400/4588595) loss:2.519 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:05,636][model8_pretrain.py][INFO] Epoch:[0/2](884400/4588595) loss:2.824 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:42,539][model8_pretrain.py][INFO] Epoch:[0/2](884500/4588595) loss:2.837 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:42,539][model8_pretrain.py][INFO] Epoch:[0/2](884500/4588595) loss:2.645 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:42,539][model8_pretrain.py][INFO] Epoch:[0/2](884500/4588595) loss:2.586 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:42,539][model8_pretrain.py][INFO] Epoch:[0/2](884500/4588595) loss:2.920 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:42,540][model8_pretrain.py][INFO] Epoch:[0/2](884500/4588595) loss:2.796 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:42,540][model8_pretrain.py][INFO] Epoch:[0/2](884500/4588595) loss:2.587 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:42,540][model8_pretrain.py][INFO] Epoch:[0/2](884500/4588595) loss:2.619 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:29:42,540][model8_pretrain.py][INFO] Epoch:[0/2](884500/4588595) loss:2.916 lr:0.0000100 epoch_Time:23577.0min: [2024-01-06 15:30:19,465][model8_pretrain.py][INFO] Epoch:[0/2](884600/4588595) loss:2.299 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:30:19,465][model8_pretrain.py][INFO] Epoch:[0/2](884600/4588595) loss:2.954 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:30:19,465][model8_pretrain.py][INFO] Epoch:[0/2](884600/4588595) loss:2.587 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:30:19,465][model8_pretrain.py][INFO] Epoch:[0/2](884600/4588595) loss:3.146 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:30:19,465][model8_pretrain.py][INFO] Epoch:[0/2](884600/4588595) loss:2.213 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:30:19,465][model8_pretrain.py][INFO] Epoch:[0/2](884600/4588595) loss:2.978 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:30:19,465][model8_pretrain.py][INFO] Epoch:[0/2](884600/4588595) loss:2.844 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:30:19,466][model8_pretrain.py][INFO] Epoch:[0/2](884600/4588595) loss:2.744 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:30:56,390][model8_pretrain.py][INFO] Epoch:[0/2](884700/4588595) loss:2.747 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:30:56,390][model8_pretrain.py][INFO] Epoch:[0/2](884700/4588595) loss:2.347 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:30:56,390][model8_pretrain.py][INFO] Epoch:[0/2](884700/4588595) loss:2.454 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:30:56,390][model8_pretrain.py][INFO] Epoch:[0/2](884700/4588595) loss:3.084 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:30:56,390][model8_pretrain.py][INFO] Epoch:[0/2](884700/4588595) loss:3.116 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:30:56,391][model8_pretrain.py][INFO] Epoch:[0/2](884700/4588595) loss:2.627 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:30:56,392][model8_pretrain.py][INFO] Epoch:[0/2](884700/4588595) loss:2.951 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:30:58,120][model8_pretrain.py][INFO] Epoch:[0/2](884700/4588595) loss:2.612 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:31:43,831][model8_pretrain.py][INFO] Epoch:[0/2](884800/4588595) loss:2.816 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:31:43,831][model8_pretrain.py][INFO] Epoch:[0/2](884800/4588595) loss:2.689 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:31:43,831][model8_pretrain.py][INFO] Epoch:[0/2](884800/4588595) loss:3.208 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:31:43,831][model8_pretrain.py][INFO] Epoch:[0/2](884800/4588595) loss:2.874 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:31:43,831][model8_pretrain.py][INFO] Epoch:[0/2](884800/4588595) loss:3.352 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:31:43,831][model8_pretrain.py][INFO] Epoch:[0/2](884800/4588595) loss:3.152 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:31:43,831][model8_pretrain.py][INFO] Epoch:[0/2](884800/4588595) loss:2.868 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:31:43,831][model8_pretrain.py][INFO] Epoch:[0/2](884800/4588595) loss:2.472 lr:0.0000100 epoch_Time:23576.0min: [2024-01-06 15:32:20,754][model8_pretrain.py][INFO] Epoch:[0/2](884900/4588595) loss:2.658 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:32:20,754][model8_pretrain.py][INFO] Epoch:[0/2](884900/4588595) loss:2.214 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:32:20,754][model8_pretrain.py][INFO] Epoch:[0/2](884900/4588595) loss:2.602 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:32:20,754][model8_pretrain.py][INFO] Epoch:[0/2](884900/4588595) loss:3.162 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:32:20,754][model8_pretrain.py][INFO] Epoch:[0/2](884900/4588595) loss:2.780 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:32:20,754][model8_pretrain.py][INFO] Epoch:[0/2](884900/4588595) loss:2.738 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:32:20,754][model8_pretrain.py][INFO] Epoch:[0/2](884900/4588595) loss:3.007 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:32:20,754][model8_pretrain.py][INFO] Epoch:[0/2](884900/4588595) loss:2.825 lr:0.0000100 epoch_Time:23575.0min: [2024-01-06 15:32:57,700][model8_pretrain.py][INFO] Epoch:[0/2](885000/4588595) loss:2.722 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:32:57,700][model8_pretrain.py][INFO] Epoch:[0/2](885000/4588595) loss:2.551 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:32:57,700][model8_pretrain.py][INFO] Epoch:[0/2](885000/4588595) loss:3.006 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:32:57,700][model8_pretrain.py][INFO] Epoch:[0/2](885000/4588595) loss:3.172 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:32:57,700][model8_pretrain.py][INFO] Epoch:[0/2](885000/4588595) loss:2.957 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:32:57,700][model8_pretrain.py][INFO] Epoch:[0/2](885000/4588595) loss:2.594 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:32:57,700][model8_pretrain.py][INFO] Epoch:[0/2](885000/4588595) loss:3.080 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:32:57,700][model8_pretrain.py][INFO] Epoch:[0/2](885000/4588595) loss:2.860 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:33:34,641][model8_pretrain.py][INFO] Epoch:[0/2](885100/4588595) loss:2.471 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:33:34,641][model8_pretrain.py][INFO] Epoch:[0/2](885100/4588595) loss:2.427 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:33:34,641][model8_pretrain.py][INFO] Epoch:[0/2](885100/4588595) loss:2.701 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:33:34,641][model8_pretrain.py][INFO] Epoch:[0/2](885100/4588595) loss:2.514 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:33:34,641][model8_pretrain.py][INFO] Epoch:[0/2](885100/4588595) loss:2.774 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:33:34,641][model8_pretrain.py][INFO] Epoch:[0/2](885100/4588595) loss:1.957 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:33:34,641][model8_pretrain.py][INFO] Epoch:[0/2](885100/4588595) loss:3.260 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:33:34,642][model8_pretrain.py][INFO] Epoch:[0/2](885100/4588595) loss:2.933 lr:0.0000100 epoch_Time:23574.0min: [2024-01-06 15:34:11,582][model8_pretrain.py][INFO] Epoch:[0/2](885200/4588595) loss:2.286 lr:0.0000100 epoch_Time:23572.0min: [2024-01-06 15:34:11,582][model8_pretrain.py][INFO] Epoch:[0/2](885200/4588595) loss:3.143 lr:0.0000100 epoch_Time:23572.0min: [2024-01-06 15:34:11,582][model8_pretrain.py][INFO] Epoch:[0/2](885200/4588595) loss:3.108 lr:0.0000100 epoch_Time:23572.0min: [2024-01-06 15:34:11,582][model8_pretrain.py][INFO] Epoch:[0/2](885200/4588595) loss:3.079 lr:0.0000100 epoch_Time:23572.0min: [2024-01-06 15:34:11,582][model8_pretrain.py][INFO] Epoch:[0/2](885200/4588595) loss:3.012 lr:0.0000100 epoch_Time:23572.0min: [2024-01-06 15:34:11,582][model8_pretrain.py][INFO] Epoch:[0/2](885200/4588595) loss:3.008 lr:0.0000100 epoch_Time:23572.0min: [2024-01-06 15:34:11,582][model8_pretrain.py][INFO] Epoch:[0/2](885200/4588595) loss:2.711 lr:0.0000100 epoch_Time:23572.0min: [2024-01-06 15:34:11,582][model8_pretrain.py][INFO] Epoch:[0/2](885200/4588595) loss:2.610 lr:0.0000100 epoch_Time:23572.0min: [2024-01-06 15:34:48,521][model8_pretrain.py][INFO] Epoch:[0/2](885300/4588595) loss:2.168 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:34:48,521][model8_pretrain.py][INFO] Epoch:[0/2](885300/4588595) loss:3.232 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:34:48,521][model8_pretrain.py][INFO] Epoch:[0/2](885300/4588595) loss:2.421 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:34:48,521][model8_pretrain.py][INFO] Epoch:[0/2](885300/4588595) loss:2.498 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:34:48,521][model8_pretrain.py][INFO] Epoch:[0/2](885300/4588595) loss:3.285 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:34:48,521][model8_pretrain.py][INFO] Epoch:[0/2](885300/4588595) loss:2.338 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:34:48,521][model8_pretrain.py][INFO] Epoch:[0/2](885300/4588595) loss:3.223 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:34:48,521][model8_pretrain.py][INFO] Epoch:[0/2](885300/4588595) loss:3.002 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:35:25,445][model8_pretrain.py][INFO] Epoch:[0/2](885400/4588595) loss:2.131 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:35:25,445][model8_pretrain.py][INFO] Epoch:[0/2](885400/4588595) loss:2.929 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:35:25,445][model8_pretrain.py][INFO] Epoch:[0/2](885400/4588595) loss:2.874 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:35:25,445][model8_pretrain.py][INFO] Epoch:[0/2](885400/4588595) loss:2.723 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:35:25,445][model8_pretrain.py][INFO] Epoch:[0/2](885400/4588595) loss:2.798 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:35:25,445][model8_pretrain.py][INFO] Epoch:[0/2](885400/4588595) loss:3.200 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:35:25,446][model8_pretrain.py][INFO] Epoch:[0/2](885400/4588595) loss:2.916 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:35:25,446][model8_pretrain.py][INFO] Epoch:[0/2](885400/4588595) loss:2.671 lr:0.0000100 epoch_Time:23571.0min: [2024-01-06 15:36:02,374][model8_pretrain.py][INFO] Epoch:[0/2](885500/4588595) loss:2.583 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:02,374][model8_pretrain.py][INFO] Epoch:[0/2](885500/4588595) loss:2.452 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:02,374][model8_pretrain.py][INFO] Epoch:[0/2](885500/4588595) loss:2.570 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:02,374][model8_pretrain.py][INFO] Epoch:[0/2](885500/4588595) loss:2.375 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:02,374][model8_pretrain.py][INFO] Epoch:[0/2](885500/4588595) loss:2.889 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:02,375][model8_pretrain.py][INFO] Epoch:[0/2](885500/4588595) loss:2.594 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:02,375][model8_pretrain.py][INFO] Epoch:[0/2](885500/4588595) loss:2.905 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:02,375][model8_pretrain.py][INFO] Epoch:[0/2](885500/4588595) loss:2.313 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:49,767][model8_pretrain.py][INFO] Epoch:[0/2](885600/4588595) loss:2.586 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:49,767][model8_pretrain.py][INFO] Epoch:[0/2](885600/4588595) loss:3.014 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:49,767][model8_pretrain.py][INFO] Epoch:[0/2](885600/4588595) loss:2.788 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:49,767][model8_pretrain.py][INFO] Epoch:[0/2](885600/4588595) loss:2.663 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:49,767][model8_pretrain.py][INFO] Epoch:[0/2](885600/4588595) loss:2.741 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:49,767][model8_pretrain.py][INFO] Epoch:[0/2](885600/4588595) loss:3.293 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:49,768][model8_pretrain.py][INFO] Epoch:[0/2](885600/4588595) loss:2.723 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:36:49,768][model8_pretrain.py][INFO] Epoch:[0/2](885600/4588595) loss:2.521 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:37:26,694][model8_pretrain.py][INFO] Epoch:[0/2](885700/4588595) loss:3.049 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:37:26,695][model8_pretrain.py][INFO] Epoch:[0/2](885700/4588595) loss:2.746 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:37:26,695][model8_pretrain.py][INFO] Epoch:[0/2](885700/4588595) loss:2.944 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:37:26,695][model8_pretrain.py][INFO] Epoch:[0/2](885700/4588595) loss:2.877 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:37:26,695][model8_pretrain.py][INFO] Epoch:[0/2](885700/4588595) loss:2.440 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:37:26,695][model8_pretrain.py][INFO] Epoch:[0/2](885700/4588595) loss:2.992 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:37:26,695][model8_pretrain.py][INFO] Epoch:[0/2](885700/4588595) loss:3.314 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:37:26,695][model8_pretrain.py][INFO] Epoch:[0/2](885700/4588595) loss:2.236 lr:0.0000100 epoch_Time:23570.0min: [2024-01-06 15:38:03,624][model8_pretrain.py][INFO] Epoch:[0/2](885800/4588595) loss:3.065 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:03,624][model8_pretrain.py][INFO] Epoch:[0/2](885800/4588595) loss:2.804 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:03,624][model8_pretrain.py][INFO] Epoch:[0/2](885800/4588595) loss:2.708 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:03,624][model8_pretrain.py][INFO] Epoch:[0/2](885800/4588595) loss:2.473 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:03,624][model8_pretrain.py][INFO] Epoch:[0/2](885800/4588595) loss:2.707 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:03,624][model8_pretrain.py][INFO] Epoch:[0/2](885800/4588595) loss:2.651 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:03,624][model8_pretrain.py][INFO] Epoch:[0/2](885800/4588595) loss:3.327 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:03,624][model8_pretrain.py][INFO] Epoch:[0/2](885800/4588595) loss:2.705 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:40,571][model8_pretrain.py][INFO] Epoch:[0/2](885900/4588595) loss:2.835 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:40,571][model8_pretrain.py][INFO] Epoch:[0/2](885900/4588595) loss:2.976 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:40,571][model8_pretrain.py][INFO] Epoch:[0/2](885900/4588595) loss:2.486 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:40,571][model8_pretrain.py][INFO] Epoch:[0/2](885900/4588595) loss:2.965 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:40,571][model8_pretrain.py][INFO] Epoch:[0/2](885900/4588595) loss:2.579 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:40,571][model8_pretrain.py][INFO] Epoch:[0/2](885900/4588595) loss:3.128 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:40,571][model8_pretrain.py][INFO] Epoch:[0/2](885900/4588595) loss:2.926 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:38:40,571][model8_pretrain.py][INFO] Epoch:[0/2](885900/4588595) loss:3.053 lr:0.0000100 epoch_Time:23569.0min: [2024-01-06 15:39:17,499][model8_pretrain.py][INFO] Epoch:[0/2](886000/4588595) loss:2.858 lr:0.0000100 epoch_Time:23567.0min: [2024-01-06 15:39:17,499][model8_pretrain.py][INFO] Epoch:[0/2](886000/4588595) loss:2.918 lr:0.0000100 epoch_Time:23567.0min: [2024-01-06 15:39:17,499][model8_pretrain.py][INFO] Epoch:[0/2](886000/4588595) loss:2.114 lr:0.0000100 epoch_Time:23567.0min: [2024-01-06 15:39:17,499][model8_pretrain.py][INFO] Epoch:[0/2](886000/4588595) loss:3.221 lr:0.0000100 epoch_Time:23567.0min: [2024-01-06 15:39:17,499][model8_pretrain.py][INFO] Epoch:[0/2](886000/4588595) loss:3.170 lr:0.0000100 epoch_Time:23567.0min: [2024-01-06 15:39:17,499][model8_pretrain.py][INFO] Epoch:[0/2](886000/4588595) loss:2.886 lr:0.0000100 epoch_Time:23567.0min: [2024-01-06 15:39:17,499][model8_pretrain.py][INFO] Epoch:[0/2](886000/4588595) loss:2.874 lr:0.0000100 epoch_Time:23567.0min: [2024-01-06 15:39:17,500][model8_pretrain.py][INFO] Epoch:[0/2](886000/4588595) loss:2.725 lr:0.0000100 epoch_Time:23567.0min: [2024-01-06 15:39:54,433][model8_pretrain.py][INFO] Epoch:[0/2](886100/4588595) loss:3.076 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:39:54,434][model8_pretrain.py][INFO] Epoch:[0/2](886100/4588595) loss:2.938 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:39:54,434][model8_pretrain.py][INFO] Epoch:[0/2](886100/4588595) loss:3.080 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:39:54,434][model8_pretrain.py][INFO] Epoch:[0/2](886100/4588595) loss:3.043 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:39:54,434][model8_pretrain.py][INFO] Epoch:[0/2](886100/4588595) loss:3.044 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:39:54,434][model8_pretrain.py][INFO] Epoch:[0/2](886100/4588595) loss:3.163 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:39:54,434][model8_pretrain.py][INFO] Epoch:[0/2](886100/4588595) loss:2.911 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:39:54,434][model8_pretrain.py][INFO] Epoch:[0/2](886100/4588595) loss:2.784 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:40:31,376][model8_pretrain.py][INFO] Epoch:[0/2](886200/4588595) loss:2.570 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:40:31,376][model8_pretrain.py][INFO] Epoch:[0/2](886200/4588595) loss:2.070 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:40:31,376][model8_pretrain.py][INFO] Epoch:[0/2](886200/4588595) loss:2.594 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:40:31,376][model8_pretrain.py][INFO] Epoch:[0/2](886200/4588595) loss:3.159 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:40:31,376][model8_pretrain.py][INFO] Epoch:[0/2](886200/4588595) loss:2.970 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:40:31,376][model8_pretrain.py][INFO] Epoch:[0/2](886200/4588595) loss:3.036 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:40:31,376][model8_pretrain.py][INFO] Epoch:[0/2](886200/4588595) loss:2.490 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:40:31,376][model8_pretrain.py][INFO] Epoch:[0/2](886200/4588595) loss:2.661 lr:0.0000100 epoch_Time:23566.0min: [2024-01-06 15:41:08,309][model8_pretrain.py][INFO] Epoch:[0/2](886300/4588595) loss:3.177 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:08,309][model8_pretrain.py][INFO] Epoch:[0/2](886300/4588595) loss:2.612 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:08,309][model8_pretrain.py][INFO] Epoch:[0/2](886300/4588595) loss:3.006 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:08,309][model8_pretrain.py][INFO] Epoch:[0/2](886300/4588595) loss:3.127 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:08,309][model8_pretrain.py][INFO] Epoch:[0/2](886300/4588595) loss:2.676 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:08,309][model8_pretrain.py][INFO] Epoch:[0/2](886300/4588595) loss:3.157 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:08,309][model8_pretrain.py][INFO] Epoch:[0/2](886300/4588595) loss:2.744 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:08,309][model8_pretrain.py][INFO] Epoch:[0/2](886300/4588595) loss:2.350 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:55,613][model8_pretrain.py][INFO] Epoch:[0/2](886400/4588595) loss:2.570 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:55,613][model8_pretrain.py][INFO] Epoch:[0/2](886400/4588595) loss:2.887 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:55,613][model8_pretrain.py][INFO] Epoch:[0/2](886400/4588595) loss:2.490 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:55,613][model8_pretrain.py][INFO] Epoch:[0/2](886400/4588595) loss:2.523 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:55,614][model8_pretrain.py][INFO] Epoch:[0/2](886400/4588595) loss:2.758 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:55,614][model8_pretrain.py][INFO] Epoch:[0/2](886400/4588595) loss:2.375 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:55,613][model8_pretrain.py][INFO] Epoch:[0/2](886400/4588595) loss:3.230 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:41:55,614][model8_pretrain.py][INFO] Epoch:[0/2](886400/4588595) loss:3.201 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:42:32,549][model8_pretrain.py][INFO] Epoch:[0/2](886500/4588595) loss:3.457 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:42:32,549][model8_pretrain.py][INFO] Epoch:[0/2](886500/4588595) loss:2.874 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:42:32,549][model8_pretrain.py][INFO] Epoch:[0/2](886500/4588595) loss:2.551 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:42:32,549][model8_pretrain.py][INFO] Epoch:[0/2](886500/4588595) loss:2.685 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:42:32,549][model8_pretrain.py][INFO] Epoch:[0/2](886500/4588595) loss:2.972 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:42:32,549][model8_pretrain.py][INFO] Epoch:[0/2](886500/4588595) loss:2.936 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:42:32,549][model8_pretrain.py][INFO] Epoch:[0/2](886500/4588595) loss:3.032 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:42:32,550][model8_pretrain.py][INFO] Epoch:[0/2](886500/4588595) loss:2.957 lr:0.0000100 epoch_Time:23565.0min: [2024-01-06 15:43:09,480][model8_pretrain.py][INFO] Epoch:[0/2](886600/4588595) loss:2.561 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:09,480][model8_pretrain.py][INFO] Epoch:[0/2](886600/4588595) loss:2.559 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:09,480][model8_pretrain.py][INFO] Epoch:[0/2](886600/4588595) loss:2.482 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:09,480][model8_pretrain.py][INFO] Epoch:[0/2](886600/4588595) loss:3.017 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:09,480][model8_pretrain.py][INFO] Epoch:[0/2](886600/4588595) loss:3.105 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:09,480][model8_pretrain.py][INFO] Epoch:[0/2](886600/4588595) loss:2.598 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:09,481][model8_pretrain.py][INFO] Epoch:[0/2](886600/4588595) loss:2.905 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:09,481][model8_pretrain.py][INFO] Epoch:[0/2](886600/4588595) loss:3.444 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:46,414][model8_pretrain.py][INFO] Epoch:[0/2](886700/4588595) loss:2.736 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:46,414][model8_pretrain.py][INFO] Epoch:[0/2](886700/4588595) loss:2.772 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:46,414][model8_pretrain.py][INFO] Epoch:[0/2](886700/4588595) loss:2.713 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:46,414][model8_pretrain.py][INFO] Epoch:[0/2](886700/4588595) loss:3.145 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:46,414][model8_pretrain.py][INFO] Epoch:[0/2](886700/4588595) loss:2.998 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:46,414][model8_pretrain.py][INFO] Epoch:[0/2](886700/4588595) loss:3.236 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:46,414][model8_pretrain.py][INFO] Epoch:[0/2](886700/4588595) loss:3.069 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:43:46,415][model8_pretrain.py][INFO] Epoch:[0/2](886700/4588595) loss:2.768 lr:0.0000100 epoch_Time:23564.0min: [2024-01-06 15:44:23,355][model8_pretrain.py][INFO] Epoch:[0/2](886800/4588595) loss:3.064 lr:0.0000100 epoch_Time:23562.0min: [2024-01-06 15:44:23,355][model8_pretrain.py][INFO] Epoch:[0/2](886800/4588595) loss:2.484 lr:0.0000100 epoch_Time:23562.0min: [2024-01-06 15:44:23,355][model8_pretrain.py][INFO] Epoch:[0/2](886800/4588595) loss:2.761 lr:0.0000100 epoch_Time:23562.0min: [2024-01-06 15:44:23,355][model8_pretrain.py][INFO] Epoch:[0/2](886800/4588595) loss:3.140 lr:0.0000100 epoch_Time:23562.0min: [2024-01-06 15:44:23,355][model8_pretrain.py][INFO] Epoch:[0/2](886800/4588595) loss:2.452 lr:0.0000100 epoch_Time:23562.0min: [2024-01-06 15:44:23,355][model8_pretrain.py][INFO] Epoch:[0/2](886800/4588595) loss:2.747 lr:0.0000100 epoch_Time:23562.0min: [2024-01-06 15:44:23,355][model8_pretrain.py][INFO] Epoch:[0/2](886800/4588595) loss:2.570 lr:0.0000100 epoch_Time:23562.0min: [2024-01-06 15:44:23,355][model8_pretrain.py][INFO] Epoch:[0/2](886800/4588595) loss:2.804 lr:0.0000100 epoch_Time:23562.0min: [2024-01-06 15:45:00,295][model8_pretrain.py][INFO] Epoch:[0/2](886900/4588595) loss:2.530 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:00,295][model8_pretrain.py][INFO] Epoch:[0/2](886900/4588595) loss:2.606 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:00,295][model8_pretrain.py][INFO] Epoch:[0/2](886900/4588595) loss:3.060 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:00,296][model8_pretrain.py][INFO] Epoch:[0/2](886900/4588595) loss:3.316 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:00,296][model8_pretrain.py][INFO] Epoch:[0/2](886900/4588595) loss:2.974 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:00,296][model8_pretrain.py][INFO] Epoch:[0/2](886900/4588595) loss:2.667 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:00,296][model8_pretrain.py][INFO] Epoch:[0/2](886900/4588595) loss:2.926 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:00,296][model8_pretrain.py][INFO] Epoch:[0/2](886900/4588595) loss:2.308 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:37,223][model8_pretrain.py][INFO] Epoch:[0/2](887000/4588595) loss:3.123 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:37,223][model8_pretrain.py][INFO] Epoch:[0/2](887000/4588595) loss:2.710 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:37,223][model8_pretrain.py][INFO] Epoch:[0/2](887000/4588595) loss:3.210 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:37,223][model8_pretrain.py][INFO] Epoch:[0/2](887000/4588595) loss:3.070 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:37,223][model8_pretrain.py][INFO] Epoch:[0/2](887000/4588595) loss:3.033 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:37,223][model8_pretrain.py][INFO] Epoch:[0/2](887000/4588595) loss:3.060 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:37,223][model8_pretrain.py][INFO] Epoch:[0/2](887000/4588595) loss:2.277 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:45:37,223][model8_pretrain.py][INFO] Epoch:[0/2](887000/4588595) loss:2.831 lr:0.0000100 epoch_Time:23561.0min: [2024-01-06 15:46:14,134][model8_pretrain.py][INFO] Epoch:[0/2](887100/4588595) loss:2.788 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:46:14,134][model8_pretrain.py][INFO] Epoch:[0/2](887100/4588595) loss:2.390 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:46:14,134][model8_pretrain.py][INFO] Epoch:[0/2](887100/4588595) loss:3.061 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:46:14,134][model8_pretrain.py][INFO] Epoch:[0/2](887100/4588595) loss:2.757 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:46:14,134][model8_pretrain.py][INFO] Epoch:[0/2](887100/4588595) loss:3.170 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:46:14,134][model8_pretrain.py][INFO] Epoch:[0/2](887100/4588595) loss:2.924 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:46:14,134][model8_pretrain.py][INFO] Epoch:[0/2](887100/4588595) loss:2.686 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:46:14,134][model8_pretrain.py][INFO] Epoch:[0/2](887100/4588595) loss:2.844 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:01,524][model8_pretrain.py][INFO] Epoch:[0/2](887200/4588595) loss:2.537 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:01,524][model8_pretrain.py][INFO] Epoch:[0/2](887200/4588595) loss:2.613 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:01,524][model8_pretrain.py][INFO] Epoch:[0/2](887200/4588595) loss:3.271 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:01,524][model8_pretrain.py][INFO] Epoch:[0/2](887200/4588595) loss:2.776 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:01,524][model8_pretrain.py][INFO] Epoch:[0/2](887200/4588595) loss:3.165 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:01,524][model8_pretrain.py][INFO] Epoch:[0/2](887200/4588595) loss:2.890 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:01,524][model8_pretrain.py][INFO] Epoch:[0/2](887200/4588595) loss:2.534 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:01,524][model8_pretrain.py][INFO] Epoch:[0/2](887200/4588595) loss:2.860 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:38,453][model8_pretrain.py][INFO] Epoch:[0/2](887300/4588595) loss:2.437 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:38,453][model8_pretrain.py][INFO] Epoch:[0/2](887300/4588595) loss:3.061 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:38,453][model8_pretrain.py][INFO] Epoch:[0/2](887300/4588595) loss:3.065 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:38,454][model8_pretrain.py][INFO] Epoch:[0/2](887300/4588595) loss:2.979 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:38,453][model8_pretrain.py][INFO] Epoch:[0/2](887300/4588595) loss:3.330 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:38,454][model8_pretrain.py][INFO] Epoch:[0/2](887300/4588595) loss:2.655 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:38,454][model8_pretrain.py][INFO] Epoch:[0/2](887300/4588595) loss:2.762 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:47:38,454][model8_pretrain.py][INFO] Epoch:[0/2](887300/4588595) loss:2.123 lr:0.0000100 epoch_Time:23560.0min: [2024-01-06 15:48:15,388][model8_pretrain.py][INFO] Epoch:[0/2](887400/4588595) loss:2.559 lr:0.0000100 epoch_Time:23559.0min: [2024-01-06 15:48:15,388][model8_pretrain.py][INFO] Epoch:[0/2](887400/4588595) loss:2.347 lr:0.0000100 epoch_Time:23559.0min: [2024-01-06 15:48:15,389][model8_pretrain.py][INFO] Epoch:[0/2](887400/4588595) loss:3.214 lr:0.0000100 epoch_Time:23559.0min: [2024-01-06 15:48:15,389][model8_pretrain.py][INFO] Epoch:[0/2](887400/4588595) loss:2.962 lr:0.0000100 epoch_Time:23559.0min: [2024-01-06 15:48:15,389][model8_pretrain.py][INFO] Epoch:[0/2](887400/4588595) loss:2.620 lr:0.0000100 epoch_Time:23559.0min: [2024-01-06 15:48:15,389][model8_pretrain.py][INFO] Epoch:[0/2](887400/4588595) loss:3.246 lr:0.0000100 epoch_Time:23559.0min: [2024-01-06 15:48:15,389][model8_pretrain.py][INFO] Epoch:[0/2](887400/4588595) loss:2.738 lr:0.0000100 epoch_Time:23559.0min: [2024-01-06 15:48:15,389][model8_pretrain.py][INFO] Epoch:[0/2](887400/4588595) loss:3.212 lr:0.0000100 epoch_Time:23559.0min: [2024-01-06 15:48:52,326][model8_pretrain.py][INFO] Epoch:[0/2](887500/4588595) loss:2.917 lr:0.0000100 epoch_Time:23558.0min: [2024-01-06 15:48:52,326][model8_pretrain.py][INFO] Epoch:[0/2](887500/4588595) loss:2.362 lr:0.0000100 epoch_Time:23558.0min: [2024-01-06 15:48:52,326][model8_pretrain.py][INFO] Epoch:[0/2](887500/4588595) loss:2.945 lr:0.0000100 epoch_Time:23558.0min: [2024-01-06 15:48:52,326][model8_pretrain.py][INFO] Epoch:[0/2](887500/4588595) loss:2.699 lr:0.0000100 epoch_Time:23558.0min: [2024-01-06 15:48:52,326][model8_pretrain.py][INFO] Epoch:[0/2](887500/4588595) loss:3.346 lr:0.0000100 epoch_Time:23558.0min: [2024-01-06 15:48:52,326][model8_pretrain.py][INFO] Epoch:[0/2](887500/4588595) loss:2.788 lr:0.0000100 epoch_Time:23558.0min: [2024-01-06 15:48:52,326][model8_pretrain.py][INFO] Epoch:[0/2](887500/4588595) loss:2.965 lr:0.0000100 epoch_Time:23558.0min: [2024-01-06 15:48:52,326][model8_pretrain.py][INFO] Epoch:[0/2](887500/4588595) loss:2.692 lr:0.0000100 epoch_Time:23558.0min: [2024-01-06 15:49:29,266][model8_pretrain.py][INFO] Epoch:[0/2](887600/4588595) loss:2.824 lr:0.0000100 epoch_Time:23557.0min: [2024-01-06 15:49:29,266][model8_pretrain.py][INFO] Epoch:[0/2](887600/4588595) loss:2.513 lr:0.0000100 epoch_Time:23557.0min: [2024-01-06 15:49:29,266][model8_pretrain.py][INFO] Epoch:[0/2](887600/4588595) loss:2.486 lr:0.0000100 epoch_Time:23557.0min: [2024-01-06 15:49:29,266][model8_pretrain.py][INFO] Epoch:[0/2](887600/4588595) loss:2.656 lr:0.0000100 epoch_Time:23557.0min: [2024-01-06 15:49:29,266][model8_pretrain.py][INFO] Epoch:[0/2](887600/4588595) loss:2.139 lr:0.0000100 epoch_Time:23557.0min: [2024-01-06 15:49:29,266][model8_pretrain.py][INFO] Epoch:[0/2](887600/4588595) loss:2.932 lr:0.0000100 epoch_Time:23557.0min: [2024-01-06 15:49:29,266][model8_pretrain.py][INFO] Epoch:[0/2](887600/4588595) loss:2.652 lr:0.0000100 epoch_Time:23557.0min: [2024-01-06 15:49:29,266][model8_pretrain.py][INFO] Epoch:[0/2](887600/4588595) loss:3.143 lr:0.0000100 epoch_Time:23557.0min: [2024-01-06 15:50:06,212][model8_pretrain.py][INFO] Epoch:[0/2](887700/4588595) loss:2.998 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:06,212][model8_pretrain.py][INFO] Epoch:[0/2](887700/4588595) loss:2.786 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:06,212][model8_pretrain.py][INFO] Epoch:[0/2](887700/4588595) loss:2.554 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:06,212][model8_pretrain.py][INFO] Epoch:[0/2](887700/4588595) loss:2.716 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:06,212][model8_pretrain.py][INFO] Epoch:[0/2](887700/4588595) loss:2.583 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:06,212][model8_pretrain.py][INFO] Epoch:[0/2](887700/4588595) loss:2.674 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:06,212][model8_pretrain.py][INFO] Epoch:[0/2](887700/4588595) loss:2.727 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:06,213][model8_pretrain.py][INFO] Epoch:[0/2](887700/4588595) loss:2.742 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:43,146][model8_pretrain.py][INFO] Epoch:[0/2](887800/4588595) loss:2.644 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:43,146][model8_pretrain.py][INFO] Epoch:[0/2](887800/4588595) loss:2.961 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:43,146][model8_pretrain.py][INFO] Epoch:[0/2](887800/4588595) loss:2.874 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:43,146][model8_pretrain.py][INFO] Epoch:[0/2](887800/4588595) loss:2.552 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:43,147][model8_pretrain.py][INFO] Epoch:[0/2](887800/4588595) loss:2.574 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:43,147][model8_pretrain.py][INFO] Epoch:[0/2](887800/4588595) loss:2.935 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:43,147][model8_pretrain.py][INFO] Epoch:[0/2](887800/4588595) loss:3.110 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:50:43,147][model8_pretrain.py][INFO] Epoch:[0/2](887800/4588595) loss:3.223 lr:0.0000100 epoch_Time:23556.0min: [2024-01-06 15:51:20,113][model8_pretrain.py][INFO] Epoch:[0/2](887900/4588595) loss:2.154 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:51:20,113][model8_pretrain.py][INFO] Epoch:[0/2](887900/4588595) loss:3.144 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:51:20,113][model8_pretrain.py][INFO] Epoch:[0/2](887900/4588595) loss:3.305 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:51:20,113][model8_pretrain.py][INFO] Epoch:[0/2](887900/4588595) loss:2.162 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:51:20,113][model8_pretrain.py][INFO] Epoch:[0/2](887900/4588595) loss:2.690 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:51:20,113][model8_pretrain.py][INFO] Epoch:[0/2](887900/4588595) loss:2.752 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:51:20,113][model8_pretrain.py][INFO] Epoch:[0/2](887900/4588595) loss:2.676 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:51:20,113][model8_pretrain.py][INFO] Epoch:[0/2](887900/4588595) loss:2.514 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:07,275][model8_pretrain.py][INFO] Epoch:[0/2](888000/4588595) loss:3.470 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:07,275][model8_pretrain.py][INFO] Epoch:[0/2](888000/4588595) loss:3.068 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:07,275][model8_pretrain.py][INFO] Epoch:[0/2](888000/4588595) loss:2.778 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:07,275][model8_pretrain.py][INFO] Epoch:[0/2](888000/4588595) loss:2.932 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:07,275][model8_pretrain.py][INFO] Epoch:[0/2](888000/4588595) loss:2.717 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:07,275][model8_pretrain.py][INFO] Epoch:[0/2](888000/4588595) loss:2.503 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:07,275][model8_pretrain.py][INFO] Epoch:[0/2](888000/4588595) loss:2.151 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:07,275][model8_pretrain.py][INFO] Epoch:[0/2](888000/4588595) loss:3.213 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:44,197][model8_pretrain.py][INFO] Epoch:[0/2](888100/4588595) loss:3.185 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:44,197][model8_pretrain.py][INFO] Epoch:[0/2](888100/4588595) loss:2.725 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:44,197][model8_pretrain.py][INFO] Epoch:[0/2](888100/4588595) loss:2.869 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:44,197][model8_pretrain.py][INFO] Epoch:[0/2](888100/4588595) loss:3.075 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:44,197][model8_pretrain.py][INFO] Epoch:[0/2](888100/4588595) loss:3.288 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:44,197][model8_pretrain.py][INFO] Epoch:[0/2](888100/4588595) loss:2.430 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:44,197][model8_pretrain.py][INFO] Epoch:[0/2](888100/4588595) loss:3.121 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:52:44,197][model8_pretrain.py][INFO] Epoch:[0/2](888100/4588595) loss:3.182 lr:0.0000100 epoch_Time:23555.0min: [2024-01-06 15:53:21,140][model8_pretrain.py][INFO] Epoch:[0/2](888200/4588595) loss:2.875 lr:0.0000100 epoch_Time:23554.0min: [2024-01-06 15:53:21,140][model8_pretrain.py][INFO] Epoch:[0/2](888200/4588595) loss:2.941 lr:0.0000100 epoch_Time:23554.0min: [2024-01-06 15:53:21,140][model8_pretrain.py][INFO] Epoch:[0/2](888200/4588595) loss:2.756 lr:0.0000100 epoch_Time:23554.0min: [2024-01-06 15:53:21,140][model8_pretrain.py][INFO] Epoch:[0/2](888200/4588595) loss:2.599 lr:0.0000100 epoch_Time:23554.0min: [2024-01-06 15:53:21,140][model8_pretrain.py][INFO] Epoch:[0/2](888200/4588595) loss:2.518 lr:0.0000100 epoch_Time:23554.0min: [2024-01-06 15:53:21,140][model8_pretrain.py][INFO] Epoch:[0/2](888200/4588595) loss:3.482 lr:0.0000100 epoch_Time:23554.0min: [2024-01-06 15:53:21,140][model8_pretrain.py][INFO] Epoch:[0/2](888200/4588595) loss:2.809 lr:0.0000100 epoch_Time:23554.0min: [2024-01-06 15:53:21,140][model8_pretrain.py][INFO] Epoch:[0/2](888200/4588595) loss:2.686 lr:0.0000100 epoch_Time:23554.0min: [2024-01-06 15:53:58,077][model8_pretrain.py][INFO] Epoch:[0/2](888300/4588595) loss:2.568 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:53:58,077][model8_pretrain.py][INFO] Epoch:[0/2](888300/4588595) loss:2.359 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:53:58,077][model8_pretrain.py][INFO] Epoch:[0/2](888300/4588595) loss:3.100 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:53:58,077][model8_pretrain.py][INFO] Epoch:[0/2](888300/4588595) loss:2.708 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:53:58,077][model8_pretrain.py][INFO] Epoch:[0/2](888300/4588595) loss:3.074 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:53:58,077][model8_pretrain.py][INFO] Epoch:[0/2](888300/4588595) loss:2.314 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:53:58,077][model8_pretrain.py][INFO] Epoch:[0/2](888300/4588595) loss:2.241 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:53:58,077][model8_pretrain.py][INFO] Epoch:[0/2](888300/4588595) loss:3.196 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:54:35,024][model8_pretrain.py][INFO] Epoch:[0/2](888400/4588595) loss:2.867 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:54:35,024][model8_pretrain.py][INFO] Epoch:[0/2](888400/4588595) loss:2.923 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:54:35,024][model8_pretrain.py][INFO] Epoch:[0/2](888400/4588595) loss:3.046 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:54:35,024][model8_pretrain.py][INFO] Epoch:[0/2](888400/4588595) loss:2.712 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:54:35,024][model8_pretrain.py][INFO] Epoch:[0/2](888400/4588595) loss:2.987 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:54:35,024][model8_pretrain.py][INFO] Epoch:[0/2](888400/4588595) loss:3.093 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:54:35,025][model8_pretrain.py][INFO] Epoch:[0/2](888400/4588595) loss:2.077 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:54:35,025][model8_pretrain.py][INFO] Epoch:[0/2](888400/4588595) loss:2.889 lr:0.0000100 epoch_Time:23553.0min: [2024-01-06 15:55:11,952][model8_pretrain.py][INFO] Epoch:[0/2](888500/4588595) loss:2.852 lr:0.0000100 epoch_Time:23551.0min: [2024-01-06 15:55:11,952][model8_pretrain.py][INFO] Epoch:[0/2](888500/4588595) loss:2.787 lr:0.0000100 epoch_Time:23551.0min: [2024-01-06 15:55:11,952][model8_pretrain.py][INFO] Epoch:[0/2](888500/4588595) loss:1.997 lr:0.0000100 epoch_Time:23551.0min: [2024-01-06 15:55:11,952][model8_pretrain.py][INFO] Epoch:[0/2](888500/4588595) loss:2.616 lr:0.0000100 epoch_Time:23551.0min: [2024-01-06 15:55:11,952][model8_pretrain.py][INFO] Epoch:[0/2](888500/4588595) loss:3.152 lr:0.0000100 epoch_Time:23551.0min: [2024-01-06 15:55:11,952][model8_pretrain.py][INFO] Epoch:[0/2](888500/4588595) loss:2.245 lr:0.0000100 epoch_Time:23551.0min: [2024-01-06 15:55:11,952][model8_pretrain.py][INFO] Epoch:[0/2](888500/4588595) loss:2.896 lr:0.0000100 epoch_Time:23551.0min: [2024-01-06 15:55:11,952][model8_pretrain.py][INFO] Epoch:[0/2](888500/4588595) loss:2.853 lr:0.0000100 epoch_Time:23551.0min: [2024-01-06 15:55:48,885][model8_pretrain.py][INFO] Epoch:[0/2](888600/4588595) loss:3.050 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:55:48,885][model8_pretrain.py][INFO] Epoch:[0/2](888600/4588595) loss:3.047 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:55:48,885][model8_pretrain.py][INFO] Epoch:[0/2](888600/4588595) loss:2.576 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:55:48,885][model8_pretrain.py][INFO] Epoch:[0/2](888600/4588595) loss:2.770 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:55:48,885][model8_pretrain.py][INFO] Epoch:[0/2](888600/4588595) loss:3.141 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:55:48,885][model8_pretrain.py][INFO] Epoch:[0/2](888600/4588595) loss:2.881 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:55:48,886][model8_pretrain.py][INFO] Epoch:[0/2](888600/4588595) loss:2.677 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:55:48,886][model8_pretrain.py][INFO] Epoch:[0/2](888600/4588595) loss:2.585 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:56:25,816][model8_pretrain.py][INFO] Epoch:[0/2](888700/4588595) loss:2.562 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:56:25,816][model8_pretrain.py][INFO] Epoch:[0/2](888700/4588595) loss:2.948 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:56:25,816][model8_pretrain.py][INFO] Epoch:[0/2](888700/4588595) loss:2.553 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:56:25,816][model8_pretrain.py][INFO] Epoch:[0/2](888700/4588595) loss:2.749 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:56:25,816][model8_pretrain.py][INFO] Epoch:[0/2](888700/4588595) loss:3.068 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:56:25,816][model8_pretrain.py][INFO] Epoch:[0/2](888700/4588595) loss:2.490 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:56:25,816][model8_pretrain.py][INFO] Epoch:[0/2](888700/4588595) loss:2.340 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:56:25,817][model8_pretrain.py][INFO] Epoch:[0/2](888700/4588595) loss:2.999 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:57:12,845][model8_pretrain.py][INFO] Epoch:[0/2](888800/4588595) loss:2.841 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:57:12,845][model8_pretrain.py][INFO] Epoch:[0/2](888800/4588595) loss:3.099 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:57:12,845][model8_pretrain.py][INFO] Epoch:[0/2](888800/4588595) loss:2.502 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:57:12,845][model8_pretrain.py][INFO] Epoch:[0/2](888800/4588595) loss:2.231 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:57:12,845][model8_pretrain.py][INFO] Epoch:[0/2](888800/4588595) loss:2.761 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:57:12,845][model8_pretrain.py][INFO] Epoch:[0/2](888800/4588595) loss:3.005 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:57:12,845][model8_pretrain.py][INFO] Epoch:[0/2](888800/4588595) loss:3.210 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:57:12,845][model8_pretrain.py][INFO] Epoch:[0/2](888800/4588595) loss:2.642 lr:0.0000100 epoch_Time:23550.0min: [2024-01-06 15:57:49,765][model8_pretrain.py][INFO] Epoch:[0/2](888900/4588595) loss:2.960 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:57:49,765][model8_pretrain.py][INFO] Epoch:[0/2](888900/4588595) loss:2.482 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:57:49,765][model8_pretrain.py][INFO] Epoch:[0/2](888900/4588595) loss:3.121 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:57:49,765][model8_pretrain.py][INFO] Epoch:[0/2](888900/4588595) loss:2.746 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:57:49,765][model8_pretrain.py][INFO] Epoch:[0/2](888900/4588595) loss:2.398 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:57:49,765][model8_pretrain.py][INFO] Epoch:[0/2](888900/4588595) loss:2.933 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:57:49,766][model8_pretrain.py][INFO] Epoch:[0/2](888900/4588595) loss:2.651 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:57:49,766][model8_pretrain.py][INFO] Epoch:[0/2](888900/4588595) loss:3.152 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:58:26,707][model8_pretrain.py][INFO] Epoch:[0/2](889000/4588595) loss:3.041 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:58:26,707][model8_pretrain.py][INFO] Epoch:[0/2](889000/4588595) loss:3.068 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:58:26,707][model8_pretrain.py][INFO] Epoch:[0/2](889000/4588595) loss:2.728 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:58:26,707][model8_pretrain.py][INFO] Epoch:[0/2](889000/4588595) loss:3.200 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:58:26,707][model8_pretrain.py][INFO] Epoch:[0/2](889000/4588595) loss:3.086 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:58:26,707][model8_pretrain.py][INFO] Epoch:[0/2](889000/4588595) loss:3.215 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:58:26,707][model8_pretrain.py][INFO] Epoch:[0/2](889000/4588595) loss:2.341 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:58:26,708][model8_pretrain.py][INFO] Epoch:[0/2](889000/4588595) loss:2.802 lr:0.0000100 epoch_Time:23549.0min: [2024-01-06 15:59:03,652][model8_pretrain.py][INFO] Epoch:[0/2](889100/4588595) loss:3.049 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:03,652][model8_pretrain.py][INFO] Epoch:[0/2](889100/4588595) loss:2.826 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:03,652][model8_pretrain.py][INFO] Epoch:[0/2](889100/4588595) loss:3.021 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:03,652][model8_pretrain.py][INFO] Epoch:[0/2](889100/4588595) loss:2.628 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:03,652][model8_pretrain.py][INFO] Epoch:[0/2](889100/4588595) loss:2.825 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:03,652][model8_pretrain.py][INFO] Epoch:[0/2](889100/4588595) loss:2.679 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:03,652][model8_pretrain.py][INFO] Epoch:[0/2](889100/4588595) loss:3.091 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:03,652][model8_pretrain.py][INFO] Epoch:[0/2](889100/4588595) loss:2.600 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:40,596][model8_pretrain.py][INFO] Epoch:[0/2](889200/4588595) loss:3.009 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:40,596][model8_pretrain.py][INFO] Epoch:[0/2](889200/4588595) loss:3.239 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:40,596][model8_pretrain.py][INFO] Epoch:[0/2](889200/4588595) loss:2.730 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:40,596][model8_pretrain.py][INFO] Epoch:[0/2](889200/4588595) loss:3.162 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:40,596][model8_pretrain.py][INFO] Epoch:[0/2](889200/4588595) loss:2.830 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:40,596][model8_pretrain.py][INFO] Epoch:[0/2](889200/4588595) loss:2.578 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:40,596][model8_pretrain.py][INFO] Epoch:[0/2](889200/4588595) loss:2.400 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 15:59:40,596][model8_pretrain.py][INFO] Epoch:[0/2](889200/4588595) loss:2.815 lr:0.0000100 epoch_Time:23548.0min: [2024-01-06 16:00:17,523][model8_pretrain.py][INFO] Epoch:[0/2](889300/4588595) loss:2.020 lr:0.0000100 epoch_Time:23546.0min: [2024-01-06 16:00:17,523][model8_pretrain.py][INFO] Epoch:[0/2](889300/4588595) loss:2.965 lr:0.0000100 epoch_Time:23546.0min: [2024-01-06 16:00:17,523][model8_pretrain.py][INFO] Epoch:[0/2](889300/4588595) loss:3.144 lr:0.0000100 epoch_Time:23546.0min: [2024-01-06 16:00:17,523][model8_pretrain.py][INFO] Epoch:[0/2](889300/4588595) loss:2.510 lr:0.0000100 epoch_Time:23546.0min: [2024-01-06 16:00:17,523][model8_pretrain.py][INFO] Epoch:[0/2](889300/4588595) loss:2.801 lr:0.0000100 epoch_Time:23546.0min: [2024-01-06 16:00:17,523][model8_pretrain.py][INFO] Epoch:[0/2](889300/4588595) loss:2.832 lr:0.0000100 epoch_Time:23546.0min: [2024-01-06 16:00:17,523][model8_pretrain.py][INFO] Epoch:[0/2](889300/4588595) loss:2.748 lr:0.0000100 epoch_Time:23546.0min: [2024-01-06 16:00:17,523][model8_pretrain.py][INFO] Epoch:[0/2](889300/4588595) loss:2.712 lr:0.0000100 epoch_Time:23546.0min: [2024-01-06 16:00:54,455][model8_pretrain.py][INFO] Epoch:[0/2](889400/4588595) loss:3.210 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:00:54,455][model8_pretrain.py][INFO] Epoch:[0/2](889400/4588595) loss:3.395 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:00:54,455][model8_pretrain.py][INFO] Epoch:[0/2](889400/4588595) loss:2.741 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:00:54,455][model8_pretrain.py][INFO] Epoch:[0/2](889400/4588595) loss:2.868 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:00:54,455][model8_pretrain.py][INFO] Epoch:[0/2](889400/4588595) loss:2.948 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:00:54,455][model8_pretrain.py][INFO] Epoch:[0/2](889400/4588595) loss:3.119 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:00:54,455][model8_pretrain.py][INFO] Epoch:[0/2](889400/4588595) loss:2.373 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:00:54,456][model8_pretrain.py][INFO] Epoch:[0/2](889400/4588595) loss:2.883 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:01:31,395][model8_pretrain.py][INFO] Epoch:[0/2](889500/4588595) loss:2.321 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:01:31,395][model8_pretrain.py][INFO] Epoch:[0/2](889500/4588595) loss:2.835 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:01:31,395][model8_pretrain.py][INFO] Epoch:[0/2](889500/4588595) loss:2.834 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:01:31,395][model8_pretrain.py][INFO] Epoch:[0/2](889500/4588595) loss:2.763 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:01:31,395][model8_pretrain.py][INFO] Epoch:[0/2](889500/4588595) loss:3.152 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:01:31,395][model8_pretrain.py][INFO] Epoch:[0/2](889500/4588595) loss:3.099 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:01:31,395][model8_pretrain.py][INFO] Epoch:[0/2](889500/4588595) loss:2.572 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:01:31,395][model8_pretrain.py][INFO] Epoch:[0/2](889500/4588595) loss:2.913 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:02:18,471][model8_pretrain.py][INFO] Epoch:[0/2](889600/4588595) loss:2.389 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:02:18,471][model8_pretrain.py][INFO] Epoch:[0/2](889600/4588595) loss:2.889 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:02:18,471][model8_pretrain.py][INFO] Epoch:[0/2](889600/4588595) loss:2.433 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:02:18,471][model8_pretrain.py][INFO] Epoch:[0/2](889600/4588595) loss:2.532 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:02:18,471][model8_pretrain.py][INFO] Epoch:[0/2](889600/4588595) loss:2.985 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:02:18,471][model8_pretrain.py][INFO] Epoch:[0/2](889600/4588595) loss:3.363 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:02:18,471][model8_pretrain.py][INFO] Epoch:[0/2](889600/4588595) loss:3.031 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:02:18,471][model8_pretrain.py][INFO] Epoch:[0/2](889600/4588595) loss:2.817 lr:0.0000100 epoch_Time:23545.0min: [2024-01-06 16:02:55,417][model8_pretrain.py][INFO] Epoch:[0/2](889700/4588595) loss:2.816 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:02:55,417][model8_pretrain.py][INFO] Epoch:[0/2](889700/4588595) loss:2.801 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:02:55,417][model8_pretrain.py][INFO] Epoch:[0/2](889700/4588595) loss:2.712 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:02:55,417][model8_pretrain.py][INFO] Epoch:[0/2](889700/4588595) loss:2.503 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:02:55,417][model8_pretrain.py][INFO] Epoch:[0/2](889700/4588595) loss:3.009 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:02:55,417][model8_pretrain.py][INFO] Epoch:[0/2](889700/4588595) loss:3.310 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:02:55,417][model8_pretrain.py][INFO] Epoch:[0/2](889700/4588595) loss:2.171 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:02:55,417][model8_pretrain.py][INFO] Epoch:[0/2](889700/4588595) loss:2.851 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:03:32,371][model8_pretrain.py][INFO] Epoch:[0/2](889800/4588595) loss:3.012 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:03:32,371][model8_pretrain.py][INFO] Epoch:[0/2](889800/4588595) loss:2.523 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:03:32,371][model8_pretrain.py][INFO] Epoch:[0/2](889800/4588595) loss:2.941 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:03:32,371][model8_pretrain.py][INFO] Epoch:[0/2](889800/4588595) loss:3.268 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:03:32,371][model8_pretrain.py][INFO] Epoch:[0/2](889800/4588595) loss:2.777 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:03:32,371][model8_pretrain.py][INFO] Epoch:[0/2](889800/4588595) loss:3.084 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:03:32,371][model8_pretrain.py][INFO] Epoch:[0/2](889800/4588595) loss:2.865 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:03:32,371][model8_pretrain.py][INFO] Epoch:[0/2](889800/4588595) loss:2.480 lr:0.0000100 epoch_Time:23544.0min: [2024-01-06 16:04:09,320][model8_pretrain.py][INFO] Epoch:[0/2](889900/4588595) loss:2.593 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:09,320][model8_pretrain.py][INFO] Epoch:[0/2](889900/4588595) loss:2.729 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:09,320][model8_pretrain.py][INFO] Epoch:[0/2](889900/4588595) loss:3.010 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:09,320][model8_pretrain.py][INFO] Epoch:[0/2](889900/4588595) loss:3.164 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:09,320][model8_pretrain.py][INFO] Epoch:[0/2](889900/4588595) loss:2.442 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:09,320][model8_pretrain.py][INFO] Epoch:[0/2](889900/4588595) loss:2.448 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:09,320][model8_pretrain.py][INFO] Epoch:[0/2](889900/4588595) loss:2.557 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:09,320][model8_pretrain.py][INFO] Epoch:[0/2](889900/4588595) loss:2.760 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:46,264][model8_pretrain.py][INFO] Epoch:[0/2](890000/4588595) loss:2.683 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:46,264][model8_pretrain.py][INFO] Epoch:[0/2](890000/4588595) loss:3.009 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:46,264][model8_pretrain.py][INFO] Epoch:[0/2](890000/4588595) loss:2.856 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:46,264][model8_pretrain.py][INFO] Epoch:[0/2](890000/4588595) loss:2.598 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:46,264][model8_pretrain.py][INFO] Epoch:[0/2](890000/4588595) loss:2.501 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:46,264][model8_pretrain.py][INFO] Epoch:[0/2](890000/4588595) loss:3.284 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:46,264][model8_pretrain.py][INFO] Epoch:[0/2](890000/4588595) loss:3.082 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:04:46,264][model8_pretrain.py][INFO] Epoch:[0/2](890000/4588595) loss:2.754 lr:0.0000100 epoch_Time:23543.0min: [2024-01-06 16:05:23,205][model8_pretrain.py][INFO] Epoch:[0/2](890100/4588595) loss:2.607 lr:0.0000100 epoch_Time:23541.0min: [2024-01-06 16:05:23,205][model8_pretrain.py][INFO] Epoch:[0/2](890100/4588595) loss:3.128 lr:0.0000100 epoch_Time:23541.0min: [2024-01-06 16:05:23,205][model8_pretrain.py][INFO] Epoch:[0/2](890100/4588595) loss:2.325 lr:0.0000100 epoch_Time:23541.0min: [2024-01-06 16:05:23,205][model8_pretrain.py][INFO] Epoch:[0/2](890100/4588595) loss:2.616 lr:0.0000100 epoch_Time:23541.0min: [2024-01-06 16:05:23,205][model8_pretrain.py][INFO] Epoch:[0/2](890100/4588595) loss:3.326 lr:0.0000100 epoch_Time:23541.0min: [2024-01-06 16:05:23,205][model8_pretrain.py][INFO] Epoch:[0/2](890100/4588595) loss:3.108 lr:0.0000100 epoch_Time:23541.0min: [2024-01-06 16:05:23,205][model8_pretrain.py][INFO] Epoch:[0/2](890100/4588595) loss:3.166 lr:0.0000100 epoch_Time:23541.0min: [2024-01-06 16:05:23,205][model8_pretrain.py][INFO] Epoch:[0/2](890100/4588595) loss:1.912 lr:0.0000100 epoch_Time:23541.0min: [2024-01-06 16:06:00,133][model8_pretrain.py][INFO] Epoch:[0/2](890200/4588595) loss:2.869 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:00,133][model8_pretrain.py][INFO] Epoch:[0/2](890200/4588595) loss:2.423 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:00,133][model8_pretrain.py][INFO] Epoch:[0/2](890200/4588595) loss:2.606 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:00,133][model8_pretrain.py][INFO] Epoch:[0/2](890200/4588595) loss:2.465 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:00,133][model8_pretrain.py][INFO] Epoch:[0/2](890200/4588595) loss:2.834 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:00,133][model8_pretrain.py][INFO] Epoch:[0/2](890200/4588595) loss:2.939 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:00,133][model8_pretrain.py][INFO] Epoch:[0/2](890200/4588595) loss:2.658 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:00,133][model8_pretrain.py][INFO] Epoch:[0/2](890200/4588595) loss:3.240 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:37,072][model8_pretrain.py][INFO] Epoch:[0/2](890300/4588595) loss:3.157 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:37,072][model8_pretrain.py][INFO] Epoch:[0/2](890300/4588595) loss:3.188 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:37,072][model8_pretrain.py][INFO] Epoch:[0/2](890300/4588595) loss:3.191 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:37,072][model8_pretrain.py][INFO] Epoch:[0/2](890300/4588595) loss:2.281 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:37,072][model8_pretrain.py][INFO] Epoch:[0/2](890300/4588595) loss:3.036 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:37,072][model8_pretrain.py][INFO] Epoch:[0/2](890300/4588595) loss:2.804 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:37,072][model8_pretrain.py][INFO] Epoch:[0/2](890300/4588595) loss:3.261 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:06:37,072][model8_pretrain.py][INFO] Epoch:[0/2](890300/4588595) loss:2.611 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:07:26,145][model8_pretrain.py][INFO] Epoch:[0/2](890400/4588595) loss:2.589 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:07:26,145][model8_pretrain.py][INFO] Epoch:[0/2](890400/4588595) loss:3.037 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:07:26,145][model8_pretrain.py][INFO] Epoch:[0/2](890400/4588595) loss:2.683 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:07:26,145][model8_pretrain.py][INFO] Epoch:[0/2](890400/4588595) loss:3.273 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:07:26,145][model8_pretrain.py][INFO] Epoch:[0/2](890400/4588595) loss:3.194 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:07:26,145][model8_pretrain.py][INFO] Epoch:[0/2](890400/4588595) loss:3.106 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:07:26,145][model8_pretrain.py][INFO] Epoch:[0/2](890400/4588595) loss:2.546 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:07:26,145][model8_pretrain.py][INFO] Epoch:[0/2](890400/4588595) loss:2.824 lr:0.0000100 epoch_Time:23540.0min: [2024-01-06 16:08:03,078][model8_pretrain.py][INFO] Epoch:[0/2](890500/4588595) loss:2.442 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:03,078][model8_pretrain.py][INFO] Epoch:[0/2](890500/4588595) loss:2.900 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:03,078][model8_pretrain.py][INFO] Epoch:[0/2](890500/4588595) loss:3.038 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:03,078][model8_pretrain.py][INFO] Epoch:[0/2](890500/4588595) loss:2.253 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:03,078][model8_pretrain.py][INFO] Epoch:[0/2](890500/4588595) loss:2.957 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:03,078][model8_pretrain.py][INFO] Epoch:[0/2](890500/4588595) loss:3.145 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:03,078][model8_pretrain.py][INFO] Epoch:[0/2](890500/4588595) loss:2.585 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:03,078][model8_pretrain.py][INFO] Epoch:[0/2](890500/4588595) loss:3.472 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:40,021][model8_pretrain.py][INFO] Epoch:[0/2](890600/4588595) loss:3.160 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:40,021][model8_pretrain.py][INFO] Epoch:[0/2](890600/4588595) loss:3.031 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:40,021][model8_pretrain.py][INFO] Epoch:[0/2](890600/4588595) loss:2.928 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:40,021][model8_pretrain.py][INFO] Epoch:[0/2](890600/4588595) loss:2.773 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:40,021][model8_pretrain.py][INFO] Epoch:[0/2](890600/4588595) loss:3.052 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:40,021][model8_pretrain.py][INFO] Epoch:[0/2](890600/4588595) loss:2.479 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:40,021][model8_pretrain.py][INFO] Epoch:[0/2](890600/4588595) loss:3.172 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:08:40,021][model8_pretrain.py][INFO] Epoch:[0/2](890600/4588595) loss:2.700 lr:0.0000100 epoch_Time:23539.0min: [2024-01-06 16:09:16,975][model8_pretrain.py][INFO] Epoch:[0/2](890700/4588595) loss:2.622 lr:0.0000100 epoch_Time:23538.0min: [2024-01-06 16:09:16,975][model8_pretrain.py][INFO] Epoch:[0/2](890700/4588595) loss:3.142 lr:0.0000100 epoch_Time:23538.0min: [2024-01-06 16:09:16,975][model8_pretrain.py][INFO] Epoch:[0/2](890700/4588595) loss:2.822 lr:0.0000100 epoch_Time:23538.0min: [2024-01-06 16:09:16,975][model8_pretrain.py][INFO] Epoch:[0/2](890700/4588595) loss:2.880 lr:0.0000100 epoch_Time:23538.0min: [2024-01-06 16:09:16,975][model8_pretrain.py][INFO] Epoch:[0/2](890700/4588595) loss:3.521 lr:0.0000100 epoch_Time:23538.0min: [2024-01-06 16:09:16,975][model8_pretrain.py][INFO] Epoch:[0/2](890700/4588595) loss:2.821 lr:0.0000100 epoch_Time:23538.0min: [2024-01-06 16:09:16,975][model8_pretrain.py][INFO] Epoch:[0/2](890700/4588595) loss:2.653 lr:0.0000100 epoch_Time:23538.0min: [2024-01-06 16:09:16,975][model8_pretrain.py][INFO] Epoch:[0/2](890700/4588595) loss:2.634 lr:0.0000100 epoch_Time:23538.0min: [2024-01-06 16:09:53,927][model8_pretrain.py][INFO] Epoch:[0/2](890800/4588595) loss:3.225 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:09:53,927][model8_pretrain.py][INFO] Epoch:[0/2](890800/4588595) loss:2.804 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:09:53,927][model8_pretrain.py][INFO] Epoch:[0/2](890800/4588595) loss:3.129 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:09:53,927][model8_pretrain.py][INFO] Epoch:[0/2](890800/4588595) loss:3.117 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:09:53,927][model8_pretrain.py][INFO] Epoch:[0/2](890800/4588595) loss:2.771 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:09:53,927][model8_pretrain.py][INFO] Epoch:[0/2](890800/4588595) loss:3.228 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:09:53,927][model8_pretrain.py][INFO] Epoch:[0/2](890800/4588595) loss:2.235 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:09:53,927][model8_pretrain.py][INFO] Epoch:[0/2](890800/4588595) loss:2.426 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:10:30,897][model8_pretrain.py][INFO] Epoch:[0/2](890900/4588595) loss:2.669 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:10:30,897][model8_pretrain.py][INFO] Epoch:[0/2](890900/4588595) loss:2.094 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:10:30,897][model8_pretrain.py][INFO] Epoch:[0/2](890900/4588595) loss:3.113 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:10:30,897][model8_pretrain.py][INFO] Epoch:[0/2](890900/4588595) loss:2.478 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:10:30,898][model8_pretrain.py][INFO] Epoch:[0/2](890900/4588595) loss:2.862 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:10:30,898][model8_pretrain.py][INFO] Epoch:[0/2](890900/4588595) loss:2.812 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:10:30,898][model8_pretrain.py][INFO] Epoch:[0/2](890900/4588595) loss:2.731 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:10:30,898][model8_pretrain.py][INFO] Epoch:[0/2](890900/4588595) loss:4.001 lr:0.0000100 epoch_Time:23537.0min: [2024-01-06 16:11:07,845][model8_pretrain.py][INFO] Epoch:[0/2](891000/4588595) loss:2.575 lr:0.0000100 epoch_Time:23536.0min: [2024-01-06 16:11:07,845][model8_pretrain.py][INFO] Epoch:[0/2](891000/4588595) loss:3.254 lr:0.0000100 epoch_Time:23536.0min: [2024-01-06 16:11:07,846][model8_pretrain.py][INFO] Epoch:[0/2](891000/4588595) loss:2.832 lr:0.0000100 epoch_Time:23536.0min: [2024-01-06 16:11:07,846][model8_pretrain.py][INFO] Epoch:[0/2](891000/4588595) loss:2.449 lr:0.0000100 epoch_Time:23536.0min: [2024-01-06 16:11:07,846][model8_pretrain.py][INFO] Epoch:[0/2](891000/4588595) loss:3.001 lr:0.0000100 epoch_Time:23536.0min: [2024-01-06 16:11:07,846][model8_pretrain.py][INFO] Epoch:[0/2](891000/4588595) loss:3.047 lr:0.0000100 epoch_Time:23536.0min: [2024-01-06 16:11:07,846][model8_pretrain.py][INFO] Epoch:[0/2](891000/4588595) loss:2.917 lr:0.0000100 epoch_Time:23536.0min: [2024-01-06 16:11:07,846][model8_pretrain.py][INFO] Epoch:[0/2](891000/4588595) loss:2.279 lr:0.0000100 epoch_Time:23536.0min: [2024-01-06 16:11:44,796][model8_pretrain.py][INFO] Epoch:[0/2](891100/4588595) loss:2.903 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:11:44,796][model8_pretrain.py][INFO] Epoch:[0/2](891100/4588595) loss:3.196 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:11:44,796][model8_pretrain.py][INFO] Epoch:[0/2](891100/4588595) loss:2.568 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:11:44,796][model8_pretrain.py][INFO] Epoch:[0/2](891100/4588595) loss:2.674 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:11:44,796][model8_pretrain.py][INFO] Epoch:[0/2](891100/4588595) loss:2.571 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:11:44,796][model8_pretrain.py][INFO] Epoch:[0/2](891100/4588595) loss:3.170 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:11:44,796][model8_pretrain.py][INFO] Epoch:[0/2](891100/4588595) loss:2.906 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:11:44,796][model8_pretrain.py][INFO] Epoch:[0/2](891100/4588595) loss:2.942 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:12:33,914][model8_pretrain.py][INFO] Epoch:[0/2](891200/4588595) loss:2.951 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:12:33,914][model8_pretrain.py][INFO] Epoch:[0/2](891200/4588595) loss:2.598 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:12:33,914][model8_pretrain.py][INFO] Epoch:[0/2](891200/4588595) loss:3.326 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:12:33,914][model8_pretrain.py][INFO] Epoch:[0/2](891200/4588595) loss:2.453 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:12:33,914][model8_pretrain.py][INFO] Epoch:[0/2](891200/4588595) loss:2.088 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:12:33,915][model8_pretrain.py][INFO] Epoch:[0/2](891200/4588595) loss:2.730 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:12:33,915][model8_pretrain.py][INFO] Epoch:[0/2](891200/4588595) loss:2.960 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:12:33,916][model8_pretrain.py][INFO] Epoch:[0/2](891200/4588595) loss:2.504 lr:0.0000100 epoch_Time:23535.0min: [2024-01-06 16:13:10,831][model8_pretrain.py][INFO] Epoch:[0/2](891300/4588595) loss:2.708 lr:0.0000100 epoch_Time:23534.0min: [2024-01-06 16:13:10,831][model8_pretrain.py][INFO] Epoch:[0/2](891300/4588595) loss:2.946 lr:0.0000100 epoch_Time:23534.0min: [2024-01-06 16:13:10,831][model8_pretrain.py][INFO] Epoch:[0/2](891300/4588595) loss:2.625 lr:0.0000100 epoch_Time:23534.0min: [2024-01-06 16:13:10,831][model8_pretrain.py][INFO] Epoch:[0/2](891300/4588595) loss:3.216 lr:0.0000100 epoch_Time:23534.0min: [2024-01-06 16:13:10,831][model8_pretrain.py][INFO] Epoch:[0/2](891300/4588595) loss:2.700 lr:0.0000100 epoch_Time:23534.0min: [2024-01-06 16:13:10,831][model8_pretrain.py][INFO] Epoch:[0/2](891300/4588595) loss:2.571 lr:0.0000100 epoch_Time:23534.0min: [2024-01-06 16:13:10,831][model8_pretrain.py][INFO] Epoch:[0/2](891300/4588595) loss:2.992 lr:0.0000100 epoch_Time:23534.0min: [2024-01-06 16:13:10,832][model8_pretrain.py][INFO] Epoch:[0/2](891300/4588595) loss:2.940 lr:0.0000100 epoch_Time:23534.0min: [2024-01-06 16:13:47,765][model8_pretrain.py][INFO] Epoch:[0/2](891400/4588595) loss:2.276 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:13:47,765][model8_pretrain.py][INFO] Epoch:[0/2](891400/4588595) loss:2.650 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:13:47,765][model8_pretrain.py][INFO] Epoch:[0/2](891400/4588595) loss:2.974 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:13:47,765][model8_pretrain.py][INFO] Epoch:[0/2](891400/4588595) loss:2.753 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:13:47,766][model8_pretrain.py][INFO] Epoch:[0/2](891400/4588595) loss:2.870 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:13:47,766][model8_pretrain.py][INFO] Epoch:[0/2](891400/4588595) loss:2.384 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:13:47,766][model8_pretrain.py][INFO] Epoch:[0/2](891400/4588595) loss:2.515 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:13:47,766][model8_pretrain.py][INFO] Epoch:[0/2](891400/4588595) loss:2.630 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:14:24,741][model8_pretrain.py][INFO] Epoch:[0/2](891500/4588595) loss:2.454 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:14:24,741][model8_pretrain.py][INFO] Epoch:[0/2](891500/4588595) loss:2.583 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:14:24,741][model8_pretrain.py][INFO] Epoch:[0/2](891500/4588595) loss:2.862 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:14:24,741][model8_pretrain.py][INFO] Epoch:[0/2](891500/4588595) loss:2.672 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:14:24,741][model8_pretrain.py][INFO] Epoch:[0/2](891500/4588595) loss:3.029 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:14:24,741][model8_pretrain.py][INFO] Epoch:[0/2](891500/4588595) loss:3.168 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:14:24,741][model8_pretrain.py][INFO] Epoch:[0/2](891500/4588595) loss:2.964 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:14:24,741][model8_pretrain.py][INFO] Epoch:[0/2](891500/4588595) loss:2.130 lr:0.0000100 epoch_Time:23533.0min: [2024-01-06 16:15:01,719][model8_pretrain.py][INFO] Epoch:[0/2](891600/4588595) loss:3.029 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:01,719][model8_pretrain.py][INFO] Epoch:[0/2](891600/4588595) loss:2.923 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:01,719][model8_pretrain.py][INFO] Epoch:[0/2](891600/4588595) loss:3.105 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:01,719][model8_pretrain.py][INFO] Epoch:[0/2](891600/4588595) loss:2.873 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:01,719][model8_pretrain.py][INFO] Epoch:[0/2](891600/4588595) loss:2.821 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:01,719][model8_pretrain.py][INFO] Epoch:[0/2](891600/4588595) loss:2.600 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:01,719][model8_pretrain.py][INFO] Epoch:[0/2](891600/4588595) loss:2.854 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:01,719][model8_pretrain.py][INFO] Epoch:[0/2](891600/4588595) loss:3.128 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:38,667][model8_pretrain.py][INFO] Epoch:[0/2](891700/4588595) loss:2.985 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:38,667][model8_pretrain.py][INFO] Epoch:[0/2](891700/4588595) loss:3.067 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:38,667][model8_pretrain.py][INFO] Epoch:[0/2](891700/4588595) loss:2.399 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:38,667][model8_pretrain.py][INFO] Epoch:[0/2](891700/4588595) loss:2.548 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:38,667][model8_pretrain.py][INFO] Epoch:[0/2](891700/4588595) loss:2.747 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:38,667][model8_pretrain.py][INFO] Epoch:[0/2](891700/4588595) loss:2.165 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:38,667][model8_pretrain.py][INFO] Epoch:[0/2](891700/4588595) loss:2.037 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:15:38,668][model8_pretrain.py][INFO] Epoch:[0/2](891700/4588595) loss:2.826 lr:0.0000100 epoch_Time:23532.0min: [2024-01-06 16:16:15,609][model8_pretrain.py][INFO] Epoch:[0/2](891800/4588595) loss:2.620 lr:0.0000100 epoch_Time:23531.0min: [2024-01-06 16:16:15,609][model8_pretrain.py][INFO] Epoch:[0/2](891800/4588595) loss:2.868 lr:0.0000100 epoch_Time:23531.0min: [2024-01-06 16:16:15,609][model8_pretrain.py][INFO] Epoch:[0/2](891800/4588595) loss:2.499 lr:0.0000100 epoch_Time:23531.0min: [2024-01-06 16:16:15,609][model8_pretrain.py][INFO] Epoch:[0/2](891800/4588595) loss:2.861 lr:0.0000100 epoch_Time:23531.0min: [2024-01-06 16:16:15,609][model8_pretrain.py][INFO] Epoch:[0/2](891800/4588595) loss:2.848 lr:0.0000100 epoch_Time:23531.0min: [2024-01-06 16:16:15,609][model8_pretrain.py][INFO] Epoch:[0/2](891800/4588595) loss:1.822 lr:0.0000100 epoch_Time:23531.0min: [2024-01-06 16:16:15,609][model8_pretrain.py][INFO] Epoch:[0/2](891800/4588595) loss:2.810 lr:0.0000100 epoch_Time:23531.0min: [2024-01-06 16:16:15,610][model8_pretrain.py][INFO] Epoch:[0/2](891800/4588595) loss:3.060 lr:0.0000100 epoch_Time:23531.0min: [2024-01-06 16:16:52,543][model8_pretrain.py][INFO] Epoch:[0/2](891900/4588595) loss:3.282 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:16:52,543][model8_pretrain.py][INFO] Epoch:[0/2](891900/4588595) loss:3.308 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:16:52,543][model8_pretrain.py][INFO] Epoch:[0/2](891900/4588595) loss:2.652 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:16:52,543][model8_pretrain.py][INFO] Epoch:[0/2](891900/4588595) loss:3.219 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:16:52,543][model8_pretrain.py][INFO] Epoch:[0/2](891900/4588595) loss:2.821 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:16:52,543][model8_pretrain.py][INFO] Epoch:[0/2](891900/4588595) loss:2.673 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:16:52,543][model8_pretrain.py][INFO] Epoch:[0/2](891900/4588595) loss:2.215 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:16:52,543][model8_pretrain.py][INFO] Epoch:[0/2](891900/4588595) loss:2.179 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:17:39,586][model8_pretrain.py][INFO] Epoch:[0/2](892000/4588595) loss:3.145 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:17:39,586][model8_pretrain.py][INFO] Epoch:[0/2](892000/4588595) loss:2.420 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:17:39,586][model8_pretrain.py][INFO] Epoch:[0/2](892000/4588595) loss:2.452 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:17:39,586][model8_pretrain.py][INFO] Epoch:[0/2](892000/4588595) loss:2.881 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:17:39,586][model8_pretrain.py][INFO] Epoch:[0/2](892000/4588595) loss:3.153 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:17:39,586][model8_pretrain.py][INFO] Epoch:[0/2](892000/4588595) loss:2.606 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:17:39,586][model8_pretrain.py][INFO] Epoch:[0/2](892000/4588595) loss:2.877 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:17:39,586][model8_pretrain.py][INFO] Epoch:[0/2](892000/4588595) loss:3.068 lr:0.0000100 epoch_Time:23530.0min: [2024-01-06 16:18:18,182][model8_pretrain.py][INFO] Epoch:[0/2](892100/4588595) loss:2.503 lr:0.0000100 epoch_Time:23529.0min: [2024-01-06 16:18:18,182][model8_pretrain.py][INFO] Epoch:[0/2](892100/4588595) loss:2.878 lr:0.0000100 epoch_Time:23529.0min: [2024-01-06 16:18:18,182][model8_pretrain.py][INFO] Epoch:[0/2](892100/4588595) loss:2.624 lr:0.0000100 epoch_Time:23529.0min: [2024-01-06 16:18:18,182][model8_pretrain.py][INFO] Epoch:[0/2](892100/4588595) loss:2.482 lr:0.0000100 epoch_Time:23529.0min: [2024-01-06 16:18:18,182][model8_pretrain.py][INFO] Epoch:[0/2](892100/4588595) loss:2.500 lr:0.0000100 epoch_Time:23529.0min: [2024-01-06 16:18:18,182][model8_pretrain.py][INFO] Epoch:[0/2](892100/4588595) loss:2.770 lr:0.0000100 epoch_Time:23529.0min: [2024-01-06 16:18:18,182][model8_pretrain.py][INFO] Epoch:[0/2](892100/4588595) loss:2.799 lr:0.0000100 epoch_Time:23529.0min: [2024-01-06 16:18:18,182][model8_pretrain.py][INFO] Epoch:[0/2](892100/4588595) loss:1.988 lr:0.0000100 epoch_Time:23529.0min: [2024-01-06 16:18:55,130][model8_pretrain.py][INFO] Epoch:[0/2](892200/4588595) loss:2.935 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:18:55,130][model8_pretrain.py][INFO] Epoch:[0/2](892200/4588595) loss:2.832 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:18:55,130][model8_pretrain.py][INFO] Epoch:[0/2](892200/4588595) loss:2.467 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:18:55,130][model8_pretrain.py][INFO] Epoch:[0/2](892200/4588595) loss:2.644 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:18:55,130][model8_pretrain.py][INFO] Epoch:[0/2](892200/4588595) loss:2.472 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:18:55,130][model8_pretrain.py][INFO] Epoch:[0/2](892200/4588595) loss:2.691 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:18:55,130][model8_pretrain.py][INFO] Epoch:[0/2](892200/4588595) loss:3.193 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:18:55,130][model8_pretrain.py][INFO] Epoch:[0/2](892200/4588595) loss:3.157 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:19:32,136][model8_pretrain.py][INFO] Epoch:[0/2](892300/4588595) loss:3.038 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:19:32,136][model8_pretrain.py][INFO] Epoch:[0/2](892300/4588595) loss:2.808 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:19:32,136][model8_pretrain.py][INFO] Epoch:[0/2](892300/4588595) loss:2.652 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:19:32,136][model8_pretrain.py][INFO] Epoch:[0/2](892300/4588595) loss:3.001 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:19:32,136][model8_pretrain.py][INFO] Epoch:[0/2](892300/4588595) loss:2.795 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:19:32,136][model8_pretrain.py][INFO] Epoch:[0/2](892300/4588595) loss:2.479 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:19:32,136][model8_pretrain.py][INFO] Epoch:[0/2](892300/4588595) loss:2.935 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:19:32,136][model8_pretrain.py][INFO] Epoch:[0/2](892300/4588595) loss:3.109 lr:0.0000100 epoch_Time:23528.0min: [2024-01-06 16:20:09,090][model8_pretrain.py][INFO] Epoch:[0/2](892400/4588595) loss:2.785 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:09,090][model8_pretrain.py][INFO] Epoch:[0/2](892400/4588595) loss:2.687 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:09,090][model8_pretrain.py][INFO] Epoch:[0/2](892400/4588595) loss:3.211 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:09,090][model8_pretrain.py][INFO] Epoch:[0/2](892400/4588595) loss:2.974 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:09,090][model8_pretrain.py][INFO] Epoch:[0/2](892400/4588595) loss:2.890 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:09,090][model8_pretrain.py][INFO] Epoch:[0/2](892400/4588595) loss:2.253 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:09,090][model8_pretrain.py][INFO] Epoch:[0/2](892400/4588595) loss:2.798 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:09,090][model8_pretrain.py][INFO] Epoch:[0/2](892400/4588595) loss:2.160 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:46,036][model8_pretrain.py][INFO] Epoch:[0/2](892500/4588595) loss:2.707 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:46,036][model8_pretrain.py][INFO] Epoch:[0/2](892500/4588595) loss:2.998 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:46,036][model8_pretrain.py][INFO] Epoch:[0/2](892500/4588595) loss:2.639 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:46,036][model8_pretrain.py][INFO] Epoch:[0/2](892500/4588595) loss:3.041 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:46,036][model8_pretrain.py][INFO] Epoch:[0/2](892500/4588595) loss:2.981 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:46,036][model8_pretrain.py][INFO] Epoch:[0/2](892500/4588595) loss:2.318 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:46,036][model8_pretrain.py][INFO] Epoch:[0/2](892500/4588595) loss:2.436 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:20:46,037][model8_pretrain.py][INFO] Epoch:[0/2](892500/4588595) loss:2.698 lr:0.0000100 epoch_Time:23527.0min: [2024-01-06 16:21:22,971][model8_pretrain.py][INFO] Epoch:[0/2](892600/4588595) loss:3.169 lr:0.0000100 epoch_Time:23526.0min: [2024-01-06 16:21:22,971][model8_pretrain.py][INFO] Epoch:[0/2](892600/4588595) loss:2.330 lr:0.0000100 epoch_Time:23526.0min: [2024-01-06 16:21:22,971][model8_pretrain.py][INFO] Epoch:[0/2](892600/4588595) loss:2.582 lr:0.0000100 epoch_Time:23526.0min: [2024-01-06 16:21:22,971][model8_pretrain.py][INFO] Epoch:[0/2](892600/4588595) loss:2.874 lr:0.0000100 epoch_Time:23526.0min: [2024-01-06 16:21:22,971][model8_pretrain.py][INFO] Epoch:[0/2](892600/4588595) loss:3.125 lr:0.0000100 epoch_Time:23526.0min: [2024-01-06 16:21:22,971][model8_pretrain.py][INFO] Epoch:[0/2](892600/4588595) loss:3.018 lr:0.0000100 epoch_Time:23526.0min: [2024-01-06 16:21:22,971][model8_pretrain.py][INFO] Epoch:[0/2](892600/4588595) loss:3.105 lr:0.0000100 epoch_Time:23526.0min: [2024-01-06 16:21:22,971][model8_pretrain.py][INFO] Epoch:[0/2](892600/4588595) loss:3.068 lr:0.0000100 epoch_Time:23526.0min: [2024-01-06 16:21:59,901][model8_pretrain.py][INFO] Epoch:[0/2](892700/4588595) loss:2.797 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:21:59,901][model8_pretrain.py][INFO] Epoch:[0/2](892700/4588595) loss:2.459 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:21:59,901][model8_pretrain.py][INFO] Epoch:[0/2](892700/4588595) loss:2.632 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:21:59,901][model8_pretrain.py][INFO] Epoch:[0/2](892700/4588595) loss:3.047 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:21:59,901][model8_pretrain.py][INFO] Epoch:[0/2](892700/4588595) loss:2.688 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:21:59,901][model8_pretrain.py][INFO] Epoch:[0/2](892700/4588595) loss:2.535 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:21:59,901][model8_pretrain.py][INFO] Epoch:[0/2](892700/4588595) loss:3.408 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:21:59,901][model8_pretrain.py][INFO] Epoch:[0/2](892700/4588595) loss:2.738 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:22:46,703][model8_pretrain.py][INFO] Epoch:[0/2](892800/4588595) loss:3.110 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:22:46,703][model8_pretrain.py][INFO] Epoch:[0/2](892800/4588595) loss:2.866 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:22:46,703][model8_pretrain.py][INFO] Epoch:[0/2](892800/4588595) loss:2.969 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:22:46,703][model8_pretrain.py][INFO] Epoch:[0/2](892800/4588595) loss:3.111 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:22:46,703][model8_pretrain.py][INFO] Epoch:[0/2](892800/4588595) loss:2.104 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:22:46,703][model8_pretrain.py][INFO] Epoch:[0/2](892800/4588595) loss:3.373 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:22:46,703][model8_pretrain.py][INFO] Epoch:[0/2](892800/4588595) loss:2.974 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:22:46,703][model8_pretrain.py][INFO] Epoch:[0/2](892800/4588595) loss:2.897 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:23:25,307][model8_pretrain.py][INFO] Epoch:[0/2](892900/4588595) loss:2.681 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:23:25,307][model8_pretrain.py][INFO] Epoch:[0/2](892900/4588595) loss:2.756 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:23:25,307][model8_pretrain.py][INFO] Epoch:[0/2](892900/4588595) loss:3.279 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:23:25,307][model8_pretrain.py][INFO] Epoch:[0/2](892900/4588595) loss:2.512 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:23:25,307][model8_pretrain.py][INFO] Epoch:[0/2](892900/4588595) loss:2.502 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:23:25,307][model8_pretrain.py][INFO] Epoch:[0/2](892900/4588595) loss:2.364 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:23:25,307][model8_pretrain.py][INFO] Epoch:[0/2](892900/4588595) loss:2.626 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:23:25,308][model8_pretrain.py][INFO] Epoch:[0/2](892900/4588595) loss:2.458 lr:0.0000100 epoch_Time:23525.0min: [2024-01-06 16:24:02,246][model8_pretrain.py][INFO] Epoch:[0/2](893000/4588595) loss:2.849 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:02,246][model8_pretrain.py][INFO] Epoch:[0/2](893000/4588595) loss:1.988 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:02,246][model8_pretrain.py][INFO] Epoch:[0/2](893000/4588595) loss:2.743 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:02,246][model8_pretrain.py][INFO] Epoch:[0/2](893000/4588595) loss:2.353 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:02,246][model8_pretrain.py][INFO] Epoch:[0/2](893000/4588595) loss:2.447 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:02,246][model8_pretrain.py][INFO] Epoch:[0/2](893000/4588595) loss:2.551 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:02,246][model8_pretrain.py][INFO] Epoch:[0/2](893000/4588595) loss:2.991 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:02,246][model8_pretrain.py][INFO] Epoch:[0/2](893000/4588595) loss:3.012 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:39,190][model8_pretrain.py][INFO] Epoch:[0/2](893100/4588595) loss:2.833 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:39,190][model8_pretrain.py][INFO] Epoch:[0/2](893100/4588595) loss:2.966 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:39,190][model8_pretrain.py][INFO] Epoch:[0/2](893100/4588595) loss:3.229 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:39,190][model8_pretrain.py][INFO] Epoch:[0/2](893100/4588595) loss:2.692 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:39,190][model8_pretrain.py][INFO] Epoch:[0/2](893100/4588595) loss:3.094 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:39,190][model8_pretrain.py][INFO] Epoch:[0/2](893100/4588595) loss:2.852 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:39,190][model8_pretrain.py][INFO] Epoch:[0/2](893100/4588595) loss:3.093 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:24:39,190][model8_pretrain.py][INFO] Epoch:[0/2](893100/4588595) loss:3.258 lr:0.0000100 epoch_Time:23523.0min: [2024-01-06 16:25:16,130][model8_pretrain.py][INFO] Epoch:[0/2](893200/4588595) loss:3.141 lr:0.0000100 epoch_Time:23522.0min: [2024-01-06 16:25:16,130][model8_pretrain.py][INFO] Epoch:[0/2](893200/4588595) loss:2.431 lr:0.0000100 epoch_Time:23522.0min: [2024-01-06 16:25:16,130][model8_pretrain.py][INFO] Epoch:[0/2](893200/4588595) loss:2.487 lr:0.0000100 epoch_Time:23522.0min: [2024-01-06 16:25:16,130][model8_pretrain.py][INFO] Epoch:[0/2](893200/4588595) loss:2.764 lr:0.0000100 epoch_Time:23522.0min: [2024-01-06 16:25:16,130][model8_pretrain.py][INFO] Epoch:[0/2](893200/4588595) loss:2.910 lr:0.0000100 epoch_Time:23522.0min: [2024-01-06 16:25:16,130][model8_pretrain.py][INFO] Epoch:[0/2](893200/4588595) loss:2.678 lr:0.0000100 epoch_Time:23522.0min: [2024-01-06 16:25:16,130][model8_pretrain.py][INFO] Epoch:[0/2](893200/4588595) loss:2.463 lr:0.0000100 epoch_Time:23522.0min: [2024-01-06 16:25:16,130][model8_pretrain.py][INFO] Epoch:[0/2](893200/4588595) loss:2.505 lr:0.0000100 epoch_Time:23522.0min: [2024-01-06 16:25:53,037][model8_pretrain.py][INFO] Epoch:[0/2](893300/4588595) loss:2.363 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:25:53,037][model8_pretrain.py][INFO] Epoch:[0/2](893300/4588595) loss:2.814 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:25:53,037][model8_pretrain.py][INFO] Epoch:[0/2](893300/4588595) loss:2.829 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:25:53,037][model8_pretrain.py][INFO] Epoch:[0/2](893300/4588595) loss:2.943 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:25:53,037][model8_pretrain.py][INFO] Epoch:[0/2](893300/4588595) loss:2.488 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:25:53,037][model8_pretrain.py][INFO] Epoch:[0/2](893300/4588595) loss:3.135 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:25:53,037][model8_pretrain.py][INFO] Epoch:[0/2](893300/4588595) loss:3.201 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:25:53,037][model8_pretrain.py][INFO] Epoch:[0/2](893300/4588595) loss:2.585 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:26:29,975][model8_pretrain.py][INFO] Epoch:[0/2](893400/4588595) loss:3.001 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:26:29,975][model8_pretrain.py][INFO] Epoch:[0/2](893400/4588595) loss:2.690 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:26:29,975][model8_pretrain.py][INFO] Epoch:[0/2](893400/4588595) loss:2.939 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:26:29,975][model8_pretrain.py][INFO] Epoch:[0/2](893400/4588595) loss:3.230 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:26:29,975][model8_pretrain.py][INFO] Epoch:[0/2](893400/4588595) loss:3.209 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:26:29,975][model8_pretrain.py][INFO] Epoch:[0/2](893400/4588595) loss:3.238 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:26:29,975][model8_pretrain.py][INFO] Epoch:[0/2](893400/4588595) loss:3.345 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:26:29,976][model8_pretrain.py][INFO] Epoch:[0/2](893400/4588595) loss:2.531 lr:0.0000100 epoch_Time:23521.0min: [2024-01-06 16:27:06,905][model8_pretrain.py][INFO] Epoch:[0/2](893500/4588595) loss:2.141 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:06,905][model8_pretrain.py][INFO] Epoch:[0/2](893500/4588595) loss:2.918 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:06,905][model8_pretrain.py][INFO] Epoch:[0/2](893500/4588595) loss:3.137 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:06,905][model8_pretrain.py][INFO] Epoch:[0/2](893500/4588595) loss:2.959 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:06,905][model8_pretrain.py][INFO] Epoch:[0/2](893500/4588595) loss:2.480 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:06,905][model8_pretrain.py][INFO] Epoch:[0/2](893500/4588595) loss:2.832 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:06,905][model8_pretrain.py][INFO] Epoch:[0/2](893500/4588595) loss:3.400 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:06,905][model8_pretrain.py][INFO] Epoch:[0/2](893500/4588595) loss:2.917 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:53,878][model8_pretrain.py][INFO] Epoch:[0/2](893600/4588595) loss:2.889 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:53,878][model8_pretrain.py][INFO] Epoch:[0/2](893600/4588595) loss:2.847 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:53,878][model8_pretrain.py][INFO] Epoch:[0/2](893600/4588595) loss:3.068 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:53,878][model8_pretrain.py][INFO] Epoch:[0/2](893600/4588595) loss:3.178 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:53,878][model8_pretrain.py][INFO] Epoch:[0/2](893600/4588595) loss:3.040 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:53,879][model8_pretrain.py][INFO] Epoch:[0/2](893600/4588595) loss:2.885 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:53,879][model8_pretrain.py][INFO] Epoch:[0/2](893600/4588595) loss:3.304 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:27:53,879][model8_pretrain.py][INFO] Epoch:[0/2](893600/4588595) loss:2.598 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:28:32,529][model8_pretrain.py][INFO] Epoch:[0/2](893700/4588595) loss:2.498 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:28:32,529][model8_pretrain.py][INFO] Epoch:[0/2](893700/4588595) loss:2.685 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:28:32,529][model8_pretrain.py][INFO] Epoch:[0/2](893700/4588595) loss:2.549 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:28:32,529][model8_pretrain.py][INFO] Epoch:[0/2](893700/4588595) loss:2.800 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:28:32,529][model8_pretrain.py][INFO] Epoch:[0/2](893700/4588595) loss:2.755 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:28:32,529][model8_pretrain.py][INFO] Epoch:[0/2](893700/4588595) loss:3.306 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:28:32,529][model8_pretrain.py][INFO] Epoch:[0/2](893700/4588595) loss:2.879 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:28:32,530][model8_pretrain.py][INFO] Epoch:[0/2](893700/4588595) loss:2.787 lr:0.0000100 epoch_Time:23520.0min: [2024-01-06 16:29:09,471][model8_pretrain.py][INFO] Epoch:[0/2](893800/4588595) loss:3.213 lr:0.0000100 epoch_Time:23519.0min: [2024-01-06 16:29:09,472][model8_pretrain.py][INFO] Epoch:[0/2](893800/4588595) loss:2.697 lr:0.0000100 epoch_Time:23519.0min: [2024-01-06 16:29:09,472][model8_pretrain.py][INFO] Epoch:[0/2](893800/4588595) loss:2.415 lr:0.0000100 epoch_Time:23519.0min: [2024-01-06 16:29:09,472][model8_pretrain.py][INFO] Epoch:[0/2](893800/4588595) loss:3.047 lr:0.0000100 epoch_Time:23519.0min: [2024-01-06 16:29:09,472][model8_pretrain.py][INFO] Epoch:[0/2](893800/4588595) loss:2.565 lr:0.0000100 epoch_Time:23519.0min: [2024-01-06 16:29:09,472][model8_pretrain.py][INFO] Epoch:[0/2](893800/4588595) loss:2.722 lr:0.0000100 epoch_Time:23519.0min: [2024-01-06 16:29:09,472][model8_pretrain.py][INFO] Epoch:[0/2](893800/4588595) loss:2.583 lr:0.0000100 epoch_Time:23519.0min: [2024-01-06 16:29:09,472][model8_pretrain.py][INFO] Epoch:[0/2](893800/4588595) loss:3.120 lr:0.0000100 epoch_Time:23519.0min: [2024-01-06 16:29:46,417][model8_pretrain.py][INFO] Epoch:[0/2](893900/4588595) loss:2.312 lr:0.0000100 epoch_Time:23518.0min: [2024-01-06 16:29:46,417][model8_pretrain.py][INFO] Epoch:[0/2](893900/4588595) loss:2.803 lr:0.0000100 epoch_Time:23518.0min: [2024-01-06 16:29:46,417][model8_pretrain.py][INFO] Epoch:[0/2](893900/4588595) loss:1.949 lr:0.0000100 epoch_Time:23518.0min: [2024-01-06 16:29:46,417][model8_pretrain.py][INFO] Epoch:[0/2](893900/4588595) loss:2.684 lr:0.0000100 epoch_Time:23518.0min: [2024-01-06 16:29:46,417][model8_pretrain.py][INFO] Epoch:[0/2](893900/4588595) loss:2.956 lr:0.0000100 epoch_Time:23518.0min: [2024-01-06 16:29:46,417][model8_pretrain.py][INFO] Epoch:[0/2](893900/4588595) loss:3.525 lr:0.0000100 epoch_Time:23518.0min: [2024-01-06 16:29:46,417][model8_pretrain.py][INFO] Epoch:[0/2](893900/4588595) loss:3.063 lr:0.0000100 epoch_Time:23518.0min: [2024-01-06 16:29:46,417][model8_pretrain.py][INFO] Epoch:[0/2](893900/4588595) loss:2.696 lr:0.0000100 epoch_Time:23518.0min: [2024-01-06 16:30:23,370][model8_pretrain.py][INFO] Epoch:[0/2](894000/4588595) loss:2.955 lr:0.0000100 epoch_Time:23517.0min: [2024-01-06 16:30:23,370][model8_pretrain.py][INFO] Epoch:[0/2](894000/4588595) loss:2.684 lr:0.0000100 epoch_Time:23517.0min: [2024-01-06 16:30:23,370][model8_pretrain.py][INFO] Epoch:[0/2](894000/4588595) loss:3.232 lr:0.0000100 epoch_Time:23517.0min: [2024-01-06 16:30:23,370][model8_pretrain.py][INFO] Epoch:[0/2](894000/4588595) loss:2.552 lr:0.0000100 epoch_Time:23517.0min: [2024-01-06 16:30:23,370][model8_pretrain.py][INFO] Epoch:[0/2](894000/4588595) loss:3.178 lr:0.0000100 epoch_Time:23517.0min: [2024-01-06 16:30:23,370][model8_pretrain.py][INFO] Epoch:[0/2](894000/4588595) loss:3.188 lr:0.0000100 epoch_Time:23517.0min: [2024-01-06 16:30:23,370][model8_pretrain.py][INFO] Epoch:[0/2](894000/4588595) loss:3.006 lr:0.0000100 epoch_Time:23517.0min: [2024-01-06 16:30:23,370][model8_pretrain.py][INFO] Epoch:[0/2](894000/4588595) loss:2.255 lr:0.0000100 epoch_Time:23517.0min: [2024-01-06 16:31:00,313][model8_pretrain.py][INFO] Epoch:[0/2](894100/4588595) loss:1.767 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:00,313][model8_pretrain.py][INFO] Epoch:[0/2](894100/4588595) loss:3.059 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:00,313][model8_pretrain.py][INFO] Epoch:[0/2](894100/4588595) loss:2.312 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:00,313][model8_pretrain.py][INFO] Epoch:[0/2](894100/4588595) loss:2.930 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:00,313][model8_pretrain.py][INFO] Epoch:[0/2](894100/4588595) loss:2.837 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:00,313][model8_pretrain.py][INFO] Epoch:[0/2](894100/4588595) loss:2.678 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:00,313][model8_pretrain.py][INFO] Epoch:[0/2](894100/4588595) loss:3.349 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:00,314][model8_pretrain.py][INFO] Epoch:[0/2](894100/4588595) loss:2.486 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:37,243][model8_pretrain.py][INFO] Epoch:[0/2](894200/4588595) loss:2.592 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:37,243][model8_pretrain.py][INFO] Epoch:[0/2](894200/4588595) loss:2.903 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:37,243][model8_pretrain.py][INFO] Epoch:[0/2](894200/4588595) loss:2.444 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:37,243][model8_pretrain.py][INFO] Epoch:[0/2](894200/4588595) loss:2.616 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:37,243][model8_pretrain.py][INFO] Epoch:[0/2](894200/4588595) loss:2.891 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:37,243][model8_pretrain.py][INFO] Epoch:[0/2](894200/4588595) loss:2.879 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:37,243][model8_pretrain.py][INFO] Epoch:[0/2](894200/4588595) loss:2.211 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:31:37,243][model8_pretrain.py][INFO] Epoch:[0/2](894200/4588595) loss:3.129 lr:0.0000100 epoch_Time:23516.0min: [2024-01-06 16:32:14,173][model8_pretrain.py][INFO] Epoch:[0/2](894300/4588595) loss:3.235 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:14,173][model8_pretrain.py][INFO] Epoch:[0/2](894300/4588595) loss:2.254 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:14,173][model8_pretrain.py][INFO] Epoch:[0/2](894300/4588595) loss:2.862 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:14,173][model8_pretrain.py][INFO] Epoch:[0/2](894300/4588595) loss:2.397 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:14,173][model8_pretrain.py][INFO] Epoch:[0/2](894300/4588595) loss:2.758 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:14,173][model8_pretrain.py][INFO] Epoch:[0/2](894300/4588595) loss:2.920 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:14,173][model8_pretrain.py][INFO] Epoch:[0/2](894300/4588595) loss:3.327 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:14,173][model8_pretrain.py][INFO] Epoch:[0/2](894300/4588595) loss:2.394 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:59,248][model8_pretrain.py][INFO] Epoch:[0/2](894400/4588595) loss:2.230 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:59,248][model8_pretrain.py][INFO] Epoch:[0/2](894400/4588595) loss:3.293 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:59,248][model8_pretrain.py][INFO] Epoch:[0/2](894400/4588595) loss:2.449 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:59,248][model8_pretrain.py][INFO] Epoch:[0/2](894400/4588595) loss:2.699 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:59,248][model8_pretrain.py][INFO] Epoch:[0/2](894400/4588595) loss:2.898 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:59,248][model8_pretrain.py][INFO] Epoch:[0/2](894400/4588595) loss:2.824 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:59,248][model8_pretrain.py][INFO] Epoch:[0/2](894400/4588595) loss:3.095 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:32:59,248][model8_pretrain.py][INFO] Epoch:[0/2](894400/4588595) loss:2.952 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:33:39,660][model8_pretrain.py][INFO] Epoch:[0/2](894500/4588595) loss:2.628 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:33:39,660][model8_pretrain.py][INFO] Epoch:[0/2](894500/4588595) loss:3.270 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:33:39,660][model8_pretrain.py][INFO] Epoch:[0/2](894500/4588595) loss:2.494 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:33:39,660][model8_pretrain.py][INFO] Epoch:[0/2](894500/4588595) loss:2.646 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:33:39,660][model8_pretrain.py][INFO] Epoch:[0/2](894500/4588595) loss:2.176 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:33:39,660][model8_pretrain.py][INFO] Epoch:[0/2](894500/4588595) loss:3.171 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:33:39,660][model8_pretrain.py][INFO] Epoch:[0/2](894500/4588595) loss:2.230 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:33:39,660][model8_pretrain.py][INFO] Epoch:[0/2](894500/4588595) loss:2.371 lr:0.0000100 epoch_Time:23515.0min: [2024-01-06 16:34:16,617][model8_pretrain.py][INFO] Epoch:[0/2](894600/4588595) loss:2.884 lr:0.0000100 epoch_Time:23514.0min: [2024-01-06 16:34:16,617][model8_pretrain.py][INFO] Epoch:[0/2](894600/4588595) loss:2.937 lr:0.0000100 epoch_Time:23514.0min: [2024-01-06 16:34:16,617][model8_pretrain.py][INFO] Epoch:[0/2](894600/4588595) loss:3.031 lr:0.0000100 epoch_Time:23514.0min: [2024-01-06 16:34:16,617][model8_pretrain.py][INFO] Epoch:[0/2](894600/4588595) loss:3.142 lr:0.0000100 epoch_Time:23514.0min: [2024-01-06 16:34:16,617][model8_pretrain.py][INFO] Epoch:[0/2](894600/4588595) loss:2.978 lr:0.0000100 epoch_Time:23514.0min: [2024-01-06 16:34:16,617][model8_pretrain.py][INFO] Epoch:[0/2](894600/4588595) loss:2.417 lr:0.0000100 epoch_Time:23514.0min: [2024-01-06 16:34:16,617][model8_pretrain.py][INFO] Epoch:[0/2](894600/4588595) loss:2.693 lr:0.0000100 epoch_Time:23514.0min: [2024-01-06 16:34:16,617][model8_pretrain.py][INFO] Epoch:[0/2](894600/4588595) loss:2.528 lr:0.0000100 epoch_Time:23514.0min: [2024-01-06 16:34:53,552][model8_pretrain.py][INFO] Epoch:[0/2](894700/4588595) loss:2.969 lr:0.0000100 epoch_Time:23513.0min: [2024-01-06 16:34:53,552][model8_pretrain.py][INFO] Epoch:[0/2](894700/4588595) loss:2.729 lr:0.0000100 epoch_Time:23513.0min: [2024-01-06 16:34:53,553][model8_pretrain.py][INFO] Epoch:[0/2](894700/4588595) loss:3.131 lr:0.0000100 epoch_Time:23513.0min: [2024-01-06 16:34:53,553][model8_pretrain.py][INFO] Epoch:[0/2](894700/4588595) loss:2.873 lr:0.0000100 epoch_Time:23513.0min: [2024-01-06 16:34:53,553][model8_pretrain.py][INFO] Epoch:[0/2](894700/4588595) loss:3.052 lr:0.0000100 epoch_Time:23513.0min: [2024-01-06 16:34:53,553][model8_pretrain.py][INFO] Epoch:[0/2](894700/4588595) loss:2.667 lr:0.0000100 epoch_Time:23513.0min: [2024-01-06 16:34:53,553][model8_pretrain.py][INFO] Epoch:[0/2](894700/4588595) loss:2.754 lr:0.0000100 epoch_Time:23513.0min: [2024-01-06 16:34:53,553][model8_pretrain.py][INFO] Epoch:[0/2](894700/4588595) loss:3.052 lr:0.0000100 epoch_Time:23513.0min: [2024-01-06 16:35:30,505][model8_pretrain.py][INFO] Epoch:[0/2](894800/4588595) loss:2.374 lr:0.0000100 epoch_Time:23512.0min: [2024-01-06 16:35:30,505][model8_pretrain.py][INFO] Epoch:[0/2](894800/4588595) loss:2.339 lr:0.0000100 epoch_Time:23512.0min: [2024-01-06 16:35:30,505][model8_pretrain.py][INFO] Epoch:[0/2](894800/4588595) loss:2.942 lr:0.0000100 epoch_Time:23512.0min: [2024-01-06 16:35:30,505][model8_pretrain.py][INFO] Epoch:[0/2](894800/4588595) loss:2.492 lr:0.0000100 epoch_Time:23512.0min: [2024-01-06 16:35:30,505][model8_pretrain.py][INFO] Epoch:[0/2](894800/4588595) loss:2.523 lr:0.0000100 epoch_Time:23512.0min: [2024-01-06 16:35:30,505][model8_pretrain.py][INFO] Epoch:[0/2](894800/4588595) loss:2.484 lr:0.0000100 epoch_Time:23512.0min: [2024-01-06 16:35:30,505][model8_pretrain.py][INFO] Epoch:[0/2](894800/4588595) loss:3.385 lr:0.0000100 epoch_Time:23512.0min: [2024-01-06 16:35:30,505][model8_pretrain.py][INFO] Epoch:[0/2](894800/4588595) loss:3.377 lr:0.0000100 epoch_Time:23512.0min: [2024-01-06 16:36:07,457][model8_pretrain.py][INFO] Epoch:[0/2](894900/4588595) loss:2.602 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:07,457][model8_pretrain.py][INFO] Epoch:[0/2](894900/4588595) loss:3.251 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:07,457][model8_pretrain.py][INFO] Epoch:[0/2](894900/4588595) loss:3.171 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:07,457][model8_pretrain.py][INFO] Epoch:[0/2](894900/4588595) loss:2.803 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:07,457][model8_pretrain.py][INFO] Epoch:[0/2](894900/4588595) loss:3.133 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:07,457][model8_pretrain.py][INFO] Epoch:[0/2](894900/4588595) loss:2.541 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:07,457][model8_pretrain.py][INFO] Epoch:[0/2](894900/4588595) loss:2.431 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:07,457][model8_pretrain.py][INFO] Epoch:[0/2](894900/4588595) loss:2.671 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:44,418][model8_pretrain.py][INFO] Epoch:[0/2](895000/4588595) loss:2.671 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:44,418][model8_pretrain.py][INFO] Epoch:[0/2](895000/4588595) loss:2.389 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:44,418][model8_pretrain.py][INFO] Epoch:[0/2](895000/4588595) loss:2.829 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:44,418][model8_pretrain.py][INFO] Epoch:[0/2](895000/4588595) loss:2.939 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:44,418][model8_pretrain.py][INFO] Epoch:[0/2](895000/4588595) loss:2.102 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:44,418][model8_pretrain.py][INFO] Epoch:[0/2](895000/4588595) loss:3.335 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:44,418][model8_pretrain.py][INFO] Epoch:[0/2](895000/4588595) loss:2.760 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:36:44,418][model8_pretrain.py][INFO] Epoch:[0/2](895000/4588595) loss:2.406 lr:0.0000100 epoch_Time:23511.0min: [2024-01-06 16:37:21,365][model8_pretrain.py][INFO] Epoch:[0/2](895100/4588595) loss:2.837 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:37:21,365][model8_pretrain.py][INFO] Epoch:[0/2](895100/4588595) loss:2.603 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:37:21,365][model8_pretrain.py][INFO] Epoch:[0/2](895100/4588595) loss:2.952 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:37:21,365][model8_pretrain.py][INFO] Epoch:[0/2](895100/4588595) loss:2.779 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:37:21,365][model8_pretrain.py][INFO] Epoch:[0/2](895100/4588595) loss:2.700 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:37:21,365][model8_pretrain.py][INFO] Epoch:[0/2](895100/4588595) loss:3.099 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:37:21,365][model8_pretrain.py][INFO] Epoch:[0/2](895100/4588595) loss:3.107 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:37:21,365][model8_pretrain.py][INFO] Epoch:[0/2](895100/4588595) loss:3.085 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:06,864][model8_pretrain.py][INFO] Epoch:[0/2](895200/4588595) loss:2.627 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:06,864][model8_pretrain.py][INFO] Epoch:[0/2](895200/4588595) loss:2.774 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:06,864][model8_pretrain.py][INFO] Epoch:[0/2](895200/4588595) loss:3.037 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:06,864][model8_pretrain.py][INFO] Epoch:[0/2](895200/4588595) loss:3.223 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:06,864][model8_pretrain.py][INFO] Epoch:[0/2](895200/4588595) loss:2.732 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:06,864][model8_pretrain.py][INFO] Epoch:[0/2](895200/4588595) loss:2.690 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:06,864][model8_pretrain.py][INFO] Epoch:[0/2](895200/4588595) loss:2.939 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:06,864][model8_pretrain.py][INFO] Epoch:[0/2](895200/4588595) loss:2.641 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:47,252][model8_pretrain.py][INFO] Epoch:[0/2](895300/4588595) loss:3.410 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:47,252][model8_pretrain.py][INFO] Epoch:[0/2](895300/4588595) loss:2.437 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:47,252][model8_pretrain.py][INFO] Epoch:[0/2](895300/4588595) loss:3.160 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:47,252][model8_pretrain.py][INFO] Epoch:[0/2](895300/4588595) loss:2.609 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:47,252][model8_pretrain.py][INFO] Epoch:[0/2](895300/4588595) loss:3.297 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:47,253][model8_pretrain.py][INFO] Epoch:[0/2](895300/4588595) loss:2.876 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:47,253][model8_pretrain.py][INFO] Epoch:[0/2](895300/4588595) loss:3.201 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:38:47,256][model8_pretrain.py][INFO] Epoch:[0/2](895300/4588595) loss:2.994 lr:0.0000100 epoch_Time:23510.0min: [2024-01-06 16:39:24,182][model8_pretrain.py][INFO] Epoch:[0/2](895400/4588595) loss:3.435 lr:0.0000100 epoch_Time:23509.0min: [2024-01-06 16:39:24,182][model8_pretrain.py][INFO] Epoch:[0/2](895400/4588595) loss:2.734 lr:0.0000100 epoch_Time:23509.0min: [2024-01-06 16:39:24,182][model8_pretrain.py][INFO] Epoch:[0/2](895400/4588595) loss:2.302 lr:0.0000100 epoch_Time:23509.0min: [2024-01-06 16:39:24,182][model8_pretrain.py][INFO] Epoch:[0/2](895400/4588595) loss:2.366 lr:0.0000100 epoch_Time:23509.0min: [2024-01-06 16:39:24,182][model8_pretrain.py][INFO] Epoch:[0/2](895400/4588595) loss:2.643 lr:0.0000100 epoch_Time:23509.0min: [2024-01-06 16:39:24,182][model8_pretrain.py][INFO] Epoch:[0/2](895400/4588595) loss:2.586 lr:0.0000100 epoch_Time:23509.0min: [2024-01-06 16:39:24,183][model8_pretrain.py][INFO] Epoch:[0/2](895400/4588595) loss:3.223 lr:0.0000100 epoch_Time:23509.0min: [2024-01-06 16:39:24,183][model8_pretrain.py][INFO] Epoch:[0/2](895400/4588595) loss:2.979 lr:0.0000100 epoch_Time:23509.0min: [2024-01-06 16:40:01,118][model8_pretrain.py][INFO] Epoch:[0/2](895500/4588595) loss:3.313 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:01,118][model8_pretrain.py][INFO] Epoch:[0/2](895500/4588595) loss:3.001 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:01,118][model8_pretrain.py][INFO] Epoch:[0/2](895500/4588595) loss:2.614 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:01,118][model8_pretrain.py][INFO] Epoch:[0/2](895500/4588595) loss:3.188 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:01,118][model8_pretrain.py][INFO] Epoch:[0/2](895500/4588595) loss:2.933 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:01,118][model8_pretrain.py][INFO] Epoch:[0/2](895500/4588595) loss:3.194 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:01,118][model8_pretrain.py][INFO] Epoch:[0/2](895500/4588595) loss:2.748 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:01,118][model8_pretrain.py][INFO] Epoch:[0/2](895500/4588595) loss:2.868 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:38,051][model8_pretrain.py][INFO] Epoch:[0/2](895600/4588595) loss:2.894 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:38,051][model8_pretrain.py][INFO] Epoch:[0/2](895600/4588595) loss:2.791 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:38,051][model8_pretrain.py][INFO] Epoch:[0/2](895600/4588595) loss:2.566 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:38,051][model8_pretrain.py][INFO] Epoch:[0/2](895600/4588595) loss:2.447 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:38,051][model8_pretrain.py][INFO] Epoch:[0/2](895600/4588595) loss:2.512 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:38,051][model8_pretrain.py][INFO] Epoch:[0/2](895600/4588595) loss:2.859 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:38,051][model8_pretrain.py][INFO] Epoch:[0/2](895600/4588595) loss:2.844 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:40:38,051][model8_pretrain.py][INFO] Epoch:[0/2](895600/4588595) loss:2.839 lr:0.0000100 epoch_Time:23508.0min: [2024-01-06 16:41:14,981][model8_pretrain.py][INFO] Epoch:[0/2](895700/4588595) loss:2.766 lr:0.0000100 epoch_Time:23507.0min: [2024-01-06 16:41:14,981][model8_pretrain.py][INFO] Epoch:[0/2](895700/4588595) loss:2.898 lr:0.0000100 epoch_Time:23507.0min: [2024-01-06 16:41:14,981][model8_pretrain.py][INFO] Epoch:[0/2](895700/4588595) loss:2.838 lr:0.0000100 epoch_Time:23507.0min: [2024-01-06 16:41:14,981][model8_pretrain.py][INFO] Epoch:[0/2](895700/4588595) loss:3.199 lr:0.0000100 epoch_Time:23507.0min: [2024-01-06 16:41:14,981][model8_pretrain.py][INFO] Epoch:[0/2](895700/4588595) loss:2.851 lr:0.0000100 epoch_Time:23507.0min: [2024-01-06 16:41:14,981][model8_pretrain.py][INFO] Epoch:[0/2](895700/4588595) loss:3.150 lr:0.0000100 epoch_Time:23507.0min: [2024-01-06 16:41:14,981][model8_pretrain.py][INFO] Epoch:[0/2](895700/4588595) loss:2.819 lr:0.0000100 epoch_Time:23507.0min: [2024-01-06 16:41:14,981][model8_pretrain.py][INFO] Epoch:[0/2](895700/4588595) loss:2.826 lr:0.0000100 epoch_Time:23507.0min: [2024-01-06 16:41:51,906][model8_pretrain.py][INFO] Epoch:[0/2](895800/4588595) loss:2.483 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:41:51,906][model8_pretrain.py][INFO] Epoch:[0/2](895800/4588595) loss:2.109 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:41:51,906][model8_pretrain.py][INFO] Epoch:[0/2](895800/4588595) loss:2.403 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:41:51,906][model8_pretrain.py][INFO] Epoch:[0/2](895800/4588595) loss:2.108 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:41:51,906][model8_pretrain.py][INFO] Epoch:[0/2](895800/4588595) loss:2.794 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:41:51,906][model8_pretrain.py][INFO] Epoch:[0/2](895800/4588595) loss:3.397 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:41:51,906][model8_pretrain.py][INFO] Epoch:[0/2](895800/4588595) loss:2.933 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:41:51,906][model8_pretrain.py][INFO] Epoch:[0/2](895800/4588595) loss:2.942 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:42:28,848][model8_pretrain.py][INFO] Epoch:[0/2](895900/4588595) loss:2.741 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:42:28,848][model8_pretrain.py][INFO] Epoch:[0/2](895900/4588595) loss:2.873 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:42:28,848][model8_pretrain.py][INFO] Epoch:[0/2](895900/4588595) loss:2.484 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:42:28,848][model8_pretrain.py][INFO] Epoch:[0/2](895900/4588595) loss:2.640 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:42:28,848][model8_pretrain.py][INFO] Epoch:[0/2](895900/4588595) loss:2.221 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:42:28,848][model8_pretrain.py][INFO] Epoch:[0/2](895900/4588595) loss:3.372 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:42:28,848][model8_pretrain.py][INFO] Epoch:[0/2](895900/4588595) loss:3.175 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:42:28,848][model8_pretrain.py][INFO] Epoch:[0/2](895900/4588595) loss:2.445 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:43:14,217][model8_pretrain.py][INFO] Epoch:[0/2](896000/4588595) loss:2.738 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:43:14,217][model8_pretrain.py][INFO] Epoch:[0/2](896000/4588595) loss:2.561 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:43:14,217][model8_pretrain.py][INFO] Epoch:[0/2](896000/4588595) loss:2.662 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:43:14,217][model8_pretrain.py][INFO] Epoch:[0/2](896000/4588595) loss:2.935 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:43:14,217][model8_pretrain.py][INFO] Epoch:[0/2](896000/4588595) loss:2.619 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:43:14,217][model8_pretrain.py][INFO] Epoch:[0/2](896000/4588595) loss:2.520 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:43:14,222][model8_pretrain.py][INFO] Epoch:[0/2](896000/4588595) loss:2.396 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:43:14,256][model8_pretrain.py][INFO] Epoch:[0/2](896000/4588595) loss:2.808 lr:0.0000100 epoch_Time:23505.0min: [2024-01-06 16:43:54,675][model8_pretrain.py][INFO] Epoch:[0/2](896100/4588595) loss:3.307 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:43:54,674][model8_pretrain.py][INFO] Epoch:[0/2](896100/4588595) loss:2.787 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:43:54,674][model8_pretrain.py][INFO] Epoch:[0/2](896100/4588595) loss:2.871 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:43:54,675][model8_pretrain.py][INFO] Epoch:[0/2](896100/4588595) loss:2.915 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:43:54,675][model8_pretrain.py][INFO] Epoch:[0/2](896100/4588595) loss:2.766 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:43:54,674][model8_pretrain.py][INFO] Epoch:[0/2](896100/4588595) loss:3.158 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:43:54,675][model8_pretrain.py][INFO] Epoch:[0/2](896100/4588595) loss:2.560 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:43:54,675][model8_pretrain.py][INFO] Epoch:[0/2](896100/4588595) loss:2.561 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:44:31,613][model8_pretrain.py][INFO] Epoch:[0/2](896200/4588595) loss:2.754 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:44:31,613][model8_pretrain.py][INFO] Epoch:[0/2](896200/4588595) loss:2.981 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:44:31,613][model8_pretrain.py][INFO] Epoch:[0/2](896200/4588595) loss:2.704 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:44:31,613][model8_pretrain.py][INFO] Epoch:[0/2](896200/4588595) loss:2.507 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:44:31,613][model8_pretrain.py][INFO] Epoch:[0/2](896200/4588595) loss:2.425 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:44:31,613][model8_pretrain.py][INFO] Epoch:[0/2](896200/4588595) loss:2.751 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:44:31,613][model8_pretrain.py][INFO] Epoch:[0/2](896200/4588595) loss:2.703 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:44:31,613][model8_pretrain.py][INFO] Epoch:[0/2](896200/4588595) loss:2.615 lr:0.0000100 epoch_Time:23504.0min: [2024-01-06 16:45:08,568][model8_pretrain.py][INFO] Epoch:[0/2](896300/4588595) loss:2.394 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:08,568][model8_pretrain.py][INFO] Epoch:[0/2](896300/4588595) loss:2.777 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:08,568][model8_pretrain.py][INFO] Epoch:[0/2](896300/4588595) loss:2.417 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:08,568][model8_pretrain.py][INFO] Epoch:[0/2](896300/4588595) loss:2.317 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:08,568][model8_pretrain.py][INFO] Epoch:[0/2](896300/4588595) loss:2.312 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:08,568][model8_pretrain.py][INFO] Epoch:[0/2](896300/4588595) loss:3.159 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:08,568][model8_pretrain.py][INFO] Epoch:[0/2](896300/4588595) loss:2.644 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:08,568][model8_pretrain.py][INFO] Epoch:[0/2](896300/4588595) loss:2.264 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:45,517][model8_pretrain.py][INFO] Epoch:[0/2](896400/4588595) loss:2.300 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:45,517][model8_pretrain.py][INFO] Epoch:[0/2](896400/4588595) loss:2.334 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:45,517][model8_pretrain.py][INFO] Epoch:[0/2](896400/4588595) loss:2.808 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:45,517][model8_pretrain.py][INFO] Epoch:[0/2](896400/4588595) loss:2.877 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:45,518][model8_pretrain.py][INFO] Epoch:[0/2](896400/4588595) loss:2.946 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:45,518][model8_pretrain.py][INFO] Epoch:[0/2](896400/4588595) loss:1.992 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:45,518][model8_pretrain.py][INFO] Epoch:[0/2](896400/4588595) loss:3.062 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:45:45,518][model8_pretrain.py][INFO] Epoch:[0/2](896400/4588595) loss:3.431 lr:0.0000100 epoch_Time:23503.0min: [2024-01-06 16:46:22,480][model8_pretrain.py][INFO] Epoch:[0/2](896500/4588595) loss:2.121 lr:0.0000100 epoch_Time:23502.0min: [2024-01-06 16:46:22,480][model8_pretrain.py][INFO] Epoch:[0/2](896500/4588595) loss:3.327 lr:0.0000100 epoch_Time:23502.0min: [2024-01-06 16:46:22,480][model8_pretrain.py][INFO] Epoch:[0/2](896500/4588595) loss:3.049 lr:0.0000100 epoch_Time:23502.0min: [2024-01-06 16:46:22,480][model8_pretrain.py][INFO] Epoch:[0/2](896500/4588595) loss:2.981 lr:0.0000100 epoch_Time:23502.0min: [2024-01-06 16:46:22,480][model8_pretrain.py][INFO] Epoch:[0/2](896500/4588595) loss:2.780 lr:0.0000100 epoch_Time:23502.0min: [2024-01-06 16:46:22,480][model8_pretrain.py][INFO] Epoch:[0/2](896500/4588595) loss:2.846 lr:0.0000100 epoch_Time:23502.0min: [2024-01-06 16:46:22,481][model8_pretrain.py][INFO] Epoch:[0/2](896500/4588595) loss:3.016 lr:0.0000100 epoch_Time:23502.0min: [2024-01-06 16:46:22,481][model8_pretrain.py][INFO] Epoch:[0/2](896500/4588595) loss:3.167 lr:0.0000100 epoch_Time:23502.0min: [2024-01-06 16:46:59,420][model8_pretrain.py][INFO] Epoch:[0/2](896600/4588595) loss:2.365 lr:0.0000100 epoch_Time:23501.0min: [2024-01-06 16:46:59,420][model8_pretrain.py][INFO] Epoch:[0/2](896600/4588595) loss:2.305 lr:0.0000100 epoch_Time:23501.0min: [2024-01-06 16:46:59,420][model8_pretrain.py][INFO] Epoch:[0/2](896600/4588595) loss:3.117 lr:0.0000100 epoch_Time:23501.0min: [2024-01-06 16:46:59,420][model8_pretrain.py][INFO] Epoch:[0/2](896600/4588595) loss:2.820 lr:0.0000100 epoch_Time:23501.0min: [2024-01-06 16:46:59,420][model8_pretrain.py][INFO] Epoch:[0/2](896600/4588595) loss:2.551 lr:0.0000100 epoch_Time:23501.0min: [2024-01-06 16:46:59,420][model8_pretrain.py][INFO] Epoch:[0/2](896600/4588595) loss:2.845 lr:0.0000100 epoch_Time:23501.0min: [2024-01-06 16:46:59,420][model8_pretrain.py][INFO] Epoch:[0/2](896600/4588595) loss:3.251 lr:0.0000100 epoch_Time:23501.0min: [2024-01-06 16:46:59,420][model8_pretrain.py][INFO] Epoch:[0/2](896600/4588595) loss:3.072 lr:0.0000100 epoch_Time:23501.0min: [2024-01-06 16:47:36,352][model8_pretrain.py][INFO] Epoch:[0/2](896700/4588595) loss:2.708 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:47:36,352][model8_pretrain.py][INFO] Epoch:[0/2](896700/4588595) loss:2.493 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:47:36,352][model8_pretrain.py][INFO] Epoch:[0/2](896700/4588595) loss:2.316 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:47:36,352][model8_pretrain.py][INFO] Epoch:[0/2](896700/4588595) loss:2.286 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:47:36,352][model8_pretrain.py][INFO] Epoch:[0/2](896700/4588595) loss:2.880 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:47:36,352][model8_pretrain.py][INFO] Epoch:[0/2](896700/4588595) loss:2.940 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:47:36,353][model8_pretrain.py][INFO] Epoch:[0/2](896700/4588595) loss:2.655 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:47:36,353][model8_pretrain.py][INFO] Epoch:[0/2](896700/4588595) loss:2.809 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:48:19,918][model8_pretrain.py][INFO] Epoch:[0/2](896800/4588595) loss:2.727 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:48:19,918][model8_pretrain.py][INFO] Epoch:[0/2](896800/4588595) loss:2.597 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:48:19,918][model8_pretrain.py][INFO] Epoch:[0/2](896800/4588595) loss:2.802 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:48:19,918][model8_pretrain.py][INFO] Epoch:[0/2](896800/4588595) loss:2.438 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:48:19,918][model8_pretrain.py][INFO] Epoch:[0/2](896800/4588595) loss:2.844 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:48:19,918][model8_pretrain.py][INFO] Epoch:[0/2](896800/4588595) loss:2.682 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:48:19,918][model8_pretrain.py][INFO] Epoch:[0/2](896800/4588595) loss:2.698 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:48:19,918][model8_pretrain.py][INFO] Epoch:[0/2](896800/4588595) loss:2.786 lr:0.0000100 epoch_Time:23500.0min: [2024-01-06 16:49:02,085][model8_pretrain.py][INFO] Epoch:[0/2](896900/4588595) loss:3.374 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:02,085][model8_pretrain.py][INFO] Epoch:[0/2](896900/4588595) loss:2.678 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:02,085][model8_pretrain.py][INFO] Epoch:[0/2](896900/4588595) loss:3.238 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:02,085][model8_pretrain.py][INFO] Epoch:[0/2](896900/4588595) loss:2.871 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:02,085][model8_pretrain.py][INFO] Epoch:[0/2](896900/4588595) loss:2.531 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:02,085][model8_pretrain.py][INFO] Epoch:[0/2](896900/4588595) loss:2.538 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:02,085][model8_pretrain.py][INFO] Epoch:[0/2](896900/4588595) loss:2.812 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:02,085][model8_pretrain.py][INFO] Epoch:[0/2](896900/4588595) loss:2.811 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:39,018][model8_pretrain.py][INFO] Epoch:[0/2](897000/4588595) loss:2.900 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:39,018][model8_pretrain.py][INFO] Epoch:[0/2](897000/4588595) loss:2.855 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:39,018][model8_pretrain.py][INFO] Epoch:[0/2](897000/4588595) loss:2.677 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:39,019][model8_pretrain.py][INFO] Epoch:[0/2](897000/4588595) loss:2.641 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:39,019][model8_pretrain.py][INFO] Epoch:[0/2](897000/4588595) loss:2.697 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:39,019][model8_pretrain.py][INFO] Epoch:[0/2](897000/4588595) loss:3.076 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:39,019][model8_pretrain.py][INFO] Epoch:[0/2](897000/4588595) loss:3.124 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:49:39,019][model8_pretrain.py][INFO] Epoch:[0/2](897000/4588595) loss:2.568 lr:0.0000100 epoch_Time:23499.0min: [2024-01-06 16:50:15,960][model8_pretrain.py][INFO] Epoch:[0/2](897100/4588595) loss:2.698 lr:0.0000100 epoch_Time:23498.0min: [2024-01-06 16:50:15,960][model8_pretrain.py][INFO] Epoch:[0/2](897100/4588595) loss:2.496 lr:0.0000100 epoch_Time:23498.0min: [2024-01-06 16:50:15,960][model8_pretrain.py][INFO] Epoch:[0/2](897100/4588595) loss:2.408 lr:0.0000100 epoch_Time:23498.0min: [2024-01-06 16:50:15,960][model8_pretrain.py][INFO] Epoch:[0/2](897100/4588595) loss:2.426 lr:0.0000100 epoch_Time:23498.0min: [2024-01-06 16:50:15,960][model8_pretrain.py][INFO] Epoch:[0/2](897100/4588595) loss:2.570 lr:0.0000100 epoch_Time:23498.0min: [2024-01-06 16:50:15,960][model8_pretrain.py][INFO] Epoch:[0/2](897100/4588595) loss:2.751 lr:0.0000100 epoch_Time:23498.0min: [2024-01-06 16:50:15,960][model8_pretrain.py][INFO] Epoch:[0/2](897100/4588595) loss:2.719 lr:0.0000100 epoch_Time:23498.0min: [2024-01-06 16:50:15,960][model8_pretrain.py][INFO] Epoch:[0/2](897100/4588595) loss:2.907 lr:0.0000100 epoch_Time:23498.0min: [2024-01-06 16:50:52,900][model8_pretrain.py][INFO] Epoch:[0/2](897200/4588595) loss:2.791 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:50:52,900][model8_pretrain.py][INFO] Epoch:[0/2](897200/4588595) loss:2.678 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:50:52,900][model8_pretrain.py][INFO] Epoch:[0/2](897200/4588595) loss:2.428 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:50:52,900][model8_pretrain.py][INFO] Epoch:[0/2](897200/4588595) loss:2.847 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:50:52,900][model8_pretrain.py][INFO] Epoch:[0/2](897200/4588595) loss:2.369 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:50:52,900][model8_pretrain.py][INFO] Epoch:[0/2](897200/4588595) loss:2.800 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:50:52,900][model8_pretrain.py][INFO] Epoch:[0/2](897200/4588595) loss:3.015 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:50:52,900][model8_pretrain.py][INFO] Epoch:[0/2](897200/4588595) loss:2.930 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:51:29,868][model8_pretrain.py][INFO] Epoch:[0/2](897300/4588595) loss:2.846 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:51:29,868][model8_pretrain.py][INFO] Epoch:[0/2](897300/4588595) loss:2.829 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:51:29,868][model8_pretrain.py][INFO] Epoch:[0/2](897300/4588595) loss:2.358 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:51:29,868][model8_pretrain.py][INFO] Epoch:[0/2](897300/4588595) loss:2.839 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:51:29,868][model8_pretrain.py][INFO] Epoch:[0/2](897300/4588595) loss:3.142 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:51:29,868][model8_pretrain.py][INFO] Epoch:[0/2](897300/4588595) loss:3.055 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:51:29,868][model8_pretrain.py][INFO] Epoch:[0/2](897300/4588595) loss:3.313 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:51:29,868][model8_pretrain.py][INFO] Epoch:[0/2](897300/4588595) loss:3.013 lr:0.0000100 epoch_Time:23497.0min: [2024-01-06 16:52:06,801][model8_pretrain.py][INFO] Epoch:[0/2](897400/4588595) loss:2.254 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:06,801][model8_pretrain.py][INFO] Epoch:[0/2](897400/4588595) loss:2.660 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:06,801][model8_pretrain.py][INFO] Epoch:[0/2](897400/4588595) loss:3.252 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:06,801][model8_pretrain.py][INFO] Epoch:[0/2](897400/4588595) loss:3.020 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:06,801][model8_pretrain.py][INFO] Epoch:[0/2](897400/4588595) loss:2.476 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:06,801][model8_pretrain.py][INFO] Epoch:[0/2](897400/4588595) loss:3.023 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:06,801][model8_pretrain.py][INFO] Epoch:[0/2](897400/4588595) loss:2.277 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:06,801][model8_pretrain.py][INFO] Epoch:[0/2](897400/4588595) loss:3.348 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:43,732][model8_pretrain.py][INFO] Epoch:[0/2](897500/4588595) loss:2.364 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:43,732][model8_pretrain.py][INFO] Epoch:[0/2](897500/4588595) loss:2.819 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:43,733][model8_pretrain.py][INFO] Epoch:[0/2](897500/4588595) loss:2.317 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:43,733][model8_pretrain.py][INFO] Epoch:[0/2](897500/4588595) loss:2.687 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:43,733][model8_pretrain.py][INFO] Epoch:[0/2](897500/4588595) loss:2.889 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:43,733][model8_pretrain.py][INFO] Epoch:[0/2](897500/4588595) loss:3.254 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:43,733][model8_pretrain.py][INFO] Epoch:[0/2](897500/4588595) loss:2.720 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:52:43,733][model8_pretrain.py][INFO] Epoch:[0/2](897500/4588595) loss:3.125 lr:0.0000100 epoch_Time:23496.0min: [2024-01-06 16:53:27,460][model8_pretrain.py][INFO] Epoch:[0/2](897600/4588595) loss:3.227 lr:0.0000100 epoch_Time:23495.0min: [2024-01-06 16:53:27,460][model8_pretrain.py][INFO] Epoch:[0/2](897600/4588595) loss:3.112 lr:0.0000100 epoch_Time:23495.0min: [2024-01-06 16:53:27,460][model8_pretrain.py][INFO] Epoch:[0/2](897600/4588595) loss:2.079 lr:0.0000100 epoch_Time:23495.0min: [2024-01-06 16:53:27,460][model8_pretrain.py][INFO] Epoch:[0/2](897600/4588595) loss:2.204 lr:0.0000100 epoch_Time:23495.0min: [2024-01-06 16:53:27,460][model8_pretrain.py][INFO] Epoch:[0/2](897600/4588595) loss:3.306 lr:0.0000100 epoch_Time:23495.0min: [2024-01-06 16:53:27,460][model8_pretrain.py][INFO] Epoch:[0/2](897600/4588595) loss:2.144 lr:0.0000100 epoch_Time:23495.0min: [2024-01-06 16:53:27,461][model8_pretrain.py][INFO] Epoch:[0/2](897600/4588595) loss:2.787 lr:0.0000100 epoch_Time:23495.0min: [2024-01-06 16:53:27,461][model8_pretrain.py][INFO] Epoch:[0/2](897600/4588595) loss:2.428 lr:0.0000100 epoch_Time:23495.0min: [2024-01-06 16:54:09,582][model8_pretrain.py][INFO] Epoch:[0/2](897700/4588595) loss:2.910 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:09,582][model8_pretrain.py][INFO] Epoch:[0/2](897700/4588595) loss:2.381 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:09,582][model8_pretrain.py][INFO] Epoch:[0/2](897700/4588595) loss:3.140 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:09,582][model8_pretrain.py][INFO] Epoch:[0/2](897700/4588595) loss:2.846 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:09,582][model8_pretrain.py][INFO] Epoch:[0/2](897700/4588595) loss:3.083 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:09,582][model8_pretrain.py][INFO] Epoch:[0/2](897700/4588595) loss:3.440 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:09,582][model8_pretrain.py][INFO] Epoch:[0/2](897700/4588595) loss:2.780 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:09,582][model8_pretrain.py][INFO] Epoch:[0/2](897700/4588595) loss:3.064 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:46,513][model8_pretrain.py][INFO] Epoch:[0/2](897800/4588595) loss:2.735 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:46,513][model8_pretrain.py][INFO] Epoch:[0/2](897800/4588595) loss:2.463 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:46,513][model8_pretrain.py][INFO] Epoch:[0/2](897800/4588595) loss:2.602 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:46,513][model8_pretrain.py][INFO] Epoch:[0/2](897800/4588595) loss:2.853 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:46,514][model8_pretrain.py][INFO] Epoch:[0/2](897800/4588595) loss:2.830 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:46,514][model8_pretrain.py][INFO] Epoch:[0/2](897800/4588595) loss:2.587 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:46,514][model8_pretrain.py][INFO] Epoch:[0/2](897800/4588595) loss:2.940 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:54:46,514][model8_pretrain.py][INFO] Epoch:[0/2](897800/4588595) loss:3.296 lr:0.0000100 epoch_Time:23494.0min: [2024-01-06 16:55:23,448][model8_pretrain.py][INFO] Epoch:[0/2](897900/4588595) loss:1.942 lr:0.0000100 epoch_Time:23493.0min: [2024-01-06 16:55:23,448][model8_pretrain.py][INFO] Epoch:[0/2](897900/4588595) loss:2.424 lr:0.0000100 epoch_Time:23493.0min: [2024-01-06 16:55:23,448][model8_pretrain.py][INFO] Epoch:[0/2](897900/4588595) loss:2.631 lr:0.0000100 epoch_Time:23493.0min: [2024-01-06 16:55:23,448][model8_pretrain.py][INFO] Epoch:[0/2](897900/4588595) loss:3.107 lr:0.0000100 epoch_Time:23493.0min: [2024-01-06 16:55:23,448][model8_pretrain.py][INFO] Epoch:[0/2](897900/4588595) loss:2.756 lr:0.0000100 epoch_Time:23493.0min: [2024-01-06 16:55:23,448][model8_pretrain.py][INFO] Epoch:[0/2](897900/4588595) loss:3.300 lr:0.0000100 epoch_Time:23493.0min: [2024-01-06 16:55:23,448][model8_pretrain.py][INFO] Epoch:[0/2](897900/4588595) loss:2.727 lr:0.0000100 epoch_Time:23493.0min: [2024-01-06 16:55:23,448][model8_pretrain.py][INFO] Epoch:[0/2](897900/4588595) loss:2.922 lr:0.0000100 epoch_Time:23493.0min: [2024-01-06 16:56:00,402][model8_pretrain.py][INFO] Epoch:[0/2](898000/4588595) loss:3.187 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:00,402][model8_pretrain.py][INFO] Epoch:[0/2](898000/4588595) loss:2.676 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:00,402][model8_pretrain.py][INFO] Epoch:[0/2](898000/4588595) loss:2.813 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:00,402][model8_pretrain.py][INFO] Epoch:[0/2](898000/4588595) loss:3.235 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:00,402][model8_pretrain.py][INFO] Epoch:[0/2](898000/4588595) loss:2.172 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:00,402][model8_pretrain.py][INFO] Epoch:[0/2](898000/4588595) loss:2.988 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:00,402][model8_pretrain.py][INFO] Epoch:[0/2](898000/4588595) loss:3.078 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:00,402][model8_pretrain.py][INFO] Epoch:[0/2](898000/4588595) loss:2.778 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:37,332][model8_pretrain.py][INFO] Epoch:[0/2](898100/4588595) loss:3.214 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:37,332][model8_pretrain.py][INFO] Epoch:[0/2](898100/4588595) loss:2.755 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:37,332][model8_pretrain.py][INFO] Epoch:[0/2](898100/4588595) loss:2.888 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:37,332][model8_pretrain.py][INFO] Epoch:[0/2](898100/4588595) loss:2.068 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:37,332][model8_pretrain.py][INFO] Epoch:[0/2](898100/4588595) loss:3.146 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:37,332][model8_pretrain.py][INFO] Epoch:[0/2](898100/4588595) loss:2.762 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:37,332][model8_pretrain.py][INFO] Epoch:[0/2](898100/4588595) loss:2.739 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:56:37,332][model8_pretrain.py][INFO] Epoch:[0/2](898100/4588595) loss:2.320 lr:0.0000100 epoch_Time:23492.0min: [2024-01-06 16:57:14,263][model8_pretrain.py][INFO] Epoch:[0/2](898200/4588595) loss:3.143 lr:0.0000100 epoch_Time:23491.0min: [2024-01-06 16:57:14,263][model8_pretrain.py][INFO] Epoch:[0/2](898200/4588595) loss:2.918 lr:0.0000100 epoch_Time:23491.0min: [2024-01-06 16:57:14,263][model8_pretrain.py][INFO] Epoch:[0/2](898200/4588595) loss:2.642 lr:0.0000100 epoch_Time:23491.0min: [2024-01-06 16:57:14,263][model8_pretrain.py][INFO] Epoch:[0/2](898200/4588595) loss:2.527 lr:0.0000100 epoch_Time:23491.0min: [2024-01-06 16:57:14,263][model8_pretrain.py][INFO] Epoch:[0/2](898200/4588595) loss:2.834 lr:0.0000100 epoch_Time:23491.0min: [2024-01-06 16:57:14,263][model8_pretrain.py][INFO] Epoch:[0/2](898200/4588595) loss:2.682 lr:0.0000100 epoch_Time:23491.0min: [2024-01-06 16:57:14,263][model8_pretrain.py][INFO] Epoch:[0/2](898200/4588595) loss:3.130 lr:0.0000100 epoch_Time:23491.0min: [2024-01-06 16:57:14,263][model8_pretrain.py][INFO] Epoch:[0/2](898200/4588595) loss:2.700 lr:0.0000100 epoch_Time:23491.0min: [2024-01-06 16:57:51,191][model8_pretrain.py][INFO] Epoch:[0/2](898300/4588595) loss:2.547 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:57:51,191][model8_pretrain.py][INFO] Epoch:[0/2](898300/4588595) loss:2.405 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:57:51,191][model8_pretrain.py][INFO] Epoch:[0/2](898300/4588595) loss:2.667 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:57:51,191][model8_pretrain.py][INFO] Epoch:[0/2](898300/4588595) loss:2.565 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:57:51,191][model8_pretrain.py][INFO] Epoch:[0/2](898300/4588595) loss:3.123 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:57:51,191][model8_pretrain.py][INFO] Epoch:[0/2](898300/4588595) loss:2.346 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:57:51,191][model8_pretrain.py][INFO] Epoch:[0/2](898300/4588595) loss:2.533 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:57:51,191][model8_pretrain.py][INFO] Epoch:[0/2](898300/4588595) loss:2.378 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:58:33,165][model8_pretrain.py][INFO] Epoch:[0/2](898400/4588595) loss:2.549 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:58:33,165][model8_pretrain.py][INFO] Epoch:[0/2](898400/4588595) loss:3.024 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:58:33,165][model8_pretrain.py][INFO] Epoch:[0/2](898400/4588595) loss:3.018 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:58:33,170][model8_pretrain.py][INFO] Epoch:[0/2](898400/4588595) loss:2.115 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:58:33,170][model8_pretrain.py][INFO] Epoch:[0/2](898400/4588595) loss:2.673 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:58:33,170][model8_pretrain.py][INFO] Epoch:[0/2](898400/4588595) loss:2.899 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:58:33,170][model8_pretrain.py][INFO] Epoch:[0/2](898400/4588595) loss:2.480 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:58:34,904][model8_pretrain.py][INFO] Epoch:[0/2](898400/4588595) loss:2.620 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:59:18,400][model8_pretrain.py][INFO] Epoch:[0/2](898500/4588595) loss:3.111 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:59:18,400][model8_pretrain.py][INFO] Epoch:[0/2](898500/4588595) loss:2.236 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:59:18,401][model8_pretrain.py][INFO] Epoch:[0/2](898500/4588595) loss:2.763 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:59:18,401][model8_pretrain.py][INFO] Epoch:[0/2](898500/4588595) loss:2.821 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:59:18,401][model8_pretrain.py][INFO] Epoch:[0/2](898500/4588595) loss:3.181 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:59:18,401][model8_pretrain.py][INFO] Epoch:[0/2](898500/4588595) loss:2.658 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:59:18,401][model8_pretrain.py][INFO] Epoch:[0/2](898500/4588595) loss:2.529 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:59:18,401][model8_pretrain.py][INFO] Epoch:[0/2](898500/4588595) loss:2.399 lr:0.0000100 epoch_Time:23490.0min: [2024-01-06 16:59:55,334][model8_pretrain.py][INFO] Epoch:[0/2](898600/4588595) loss:2.674 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 16:59:55,334][model8_pretrain.py][INFO] Epoch:[0/2](898600/4588595) loss:3.340 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 16:59:55,334][model8_pretrain.py][INFO] Epoch:[0/2](898600/4588595) loss:2.614 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 16:59:55,334][model8_pretrain.py][INFO] Epoch:[0/2](898600/4588595) loss:2.965 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 16:59:55,334][model8_pretrain.py][INFO] Epoch:[0/2](898600/4588595) loss:2.556 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 16:59:55,334][model8_pretrain.py][INFO] Epoch:[0/2](898600/4588595) loss:2.831 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 16:59:55,334][model8_pretrain.py][INFO] Epoch:[0/2](898600/4588595) loss:2.156 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 16:59:55,334][model8_pretrain.py][INFO] Epoch:[0/2](898600/4588595) loss:2.776 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 17:00:32,278][model8_pretrain.py][INFO] Epoch:[0/2](898700/4588595) loss:3.264 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 17:00:32,278][model8_pretrain.py][INFO] Epoch:[0/2](898700/4588595) loss:2.774 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 17:00:32,278][model8_pretrain.py][INFO] Epoch:[0/2](898700/4588595) loss:2.619 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 17:00:32,278][model8_pretrain.py][INFO] Epoch:[0/2](898700/4588595) loss:2.643 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 17:00:32,278][model8_pretrain.py][INFO] Epoch:[0/2](898700/4588595) loss:3.100 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 17:00:32,278][model8_pretrain.py][INFO] Epoch:[0/2](898700/4588595) loss:2.920 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 17:00:32,278][model8_pretrain.py][INFO] Epoch:[0/2](898700/4588595) loss:3.021 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 17:00:32,278][model8_pretrain.py][INFO] Epoch:[0/2](898700/4588595) loss:2.481 lr:0.0000100 epoch_Time:23489.0min: [2024-01-06 17:01:09,222][model8_pretrain.py][INFO] Epoch:[0/2](898800/4588595) loss:2.538 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:09,222][model8_pretrain.py][INFO] Epoch:[0/2](898800/4588595) loss:2.707 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:09,222][model8_pretrain.py][INFO] Epoch:[0/2](898800/4588595) loss:2.834 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:09,223][model8_pretrain.py][INFO] Epoch:[0/2](898800/4588595) loss:2.848 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:09,223][model8_pretrain.py][INFO] Epoch:[0/2](898800/4588595) loss:2.133 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:09,223][model8_pretrain.py][INFO] Epoch:[0/2](898800/4588595) loss:2.457 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:09,223][model8_pretrain.py][INFO] Epoch:[0/2](898800/4588595) loss:2.884 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:09,223][model8_pretrain.py][INFO] Epoch:[0/2](898800/4588595) loss:2.890 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:46,147][model8_pretrain.py][INFO] Epoch:[0/2](898900/4588595) loss:2.641 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:46,147][model8_pretrain.py][INFO] Epoch:[0/2](898900/4588595) loss:2.835 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:46,147][model8_pretrain.py][INFO] Epoch:[0/2](898900/4588595) loss:2.722 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:46,147][model8_pretrain.py][INFO] Epoch:[0/2](898900/4588595) loss:2.882 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:46,147][model8_pretrain.py][INFO] Epoch:[0/2](898900/4588595) loss:2.674 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:46,147][model8_pretrain.py][INFO] Epoch:[0/2](898900/4588595) loss:1.894 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:46,147][model8_pretrain.py][INFO] Epoch:[0/2](898900/4588595) loss:2.928 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:01:46,147][model8_pretrain.py][INFO] Epoch:[0/2](898900/4588595) loss:2.540 lr:0.0000100 epoch_Time:23487.0min: [2024-01-06 17:02:23,081][model8_pretrain.py][INFO] Epoch:[0/2](899000/4588595) loss:2.897 lr:0.0000100 epoch_Time:23486.0min: [2024-01-06 17:02:23,081][model8_pretrain.py][INFO] Epoch:[0/2](899000/4588595) loss:2.898 lr:0.0000100 epoch_Time:23486.0min: [2024-01-06 17:02:23,081][model8_pretrain.py][INFO] Epoch:[0/2](899000/4588595) loss:2.517 lr:0.0000100 epoch_Time:23486.0min: [2024-01-06 17:02:23,081][model8_pretrain.py][INFO] Epoch:[0/2](899000/4588595) loss:2.714 lr:0.0000100 epoch_Time:23486.0min: [2024-01-06 17:02:23,081][model8_pretrain.py][INFO] Epoch:[0/2](899000/4588595) loss:2.470 lr:0.0000100 epoch_Time:23486.0min: [2024-01-06 17:02:23,081][model8_pretrain.py][INFO] Epoch:[0/2](899000/4588595) loss:2.585 lr:0.0000100 epoch_Time:23486.0min: [2024-01-06 17:02:23,081][model8_pretrain.py][INFO] Epoch:[0/2](899000/4588595) loss:2.693 lr:0.0000100 epoch_Time:23486.0min: [2024-01-06 17:02:23,081][model8_pretrain.py][INFO] Epoch:[0/2](899000/4588595) loss:2.757 lr:0.0000100 epoch_Time:23486.0min: [2024-01-06 17:03:00,010][model8_pretrain.py][INFO] Epoch:[0/2](899100/4588595) loss:2.170 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:00,010][model8_pretrain.py][INFO] Epoch:[0/2](899100/4588595) loss:2.195 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:00,010][model8_pretrain.py][INFO] Epoch:[0/2](899100/4588595) loss:2.626 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:00,010][model8_pretrain.py][INFO] Epoch:[0/2](899100/4588595) loss:2.276 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:00,010][model8_pretrain.py][INFO] Epoch:[0/2](899100/4588595) loss:3.017 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:00,010][model8_pretrain.py][INFO] Epoch:[0/2](899100/4588595) loss:3.191 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:00,010][model8_pretrain.py][INFO] Epoch:[0/2](899100/4588595) loss:3.460 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:00,010][model8_pretrain.py][INFO] Epoch:[0/2](899100/4588595) loss:2.777 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:40,471][model8_pretrain.py][INFO] Epoch:[0/2](899200/4588595) loss:3.051 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:40,471][model8_pretrain.py][INFO] Epoch:[0/2](899200/4588595) loss:3.090 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:40,471][model8_pretrain.py][INFO] Epoch:[0/2](899200/4588595) loss:2.147 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:40,471][model8_pretrain.py][INFO] Epoch:[0/2](899200/4588595) loss:2.495 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:40,471][model8_pretrain.py][INFO] Epoch:[0/2](899200/4588595) loss:3.284 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:40,472][model8_pretrain.py][INFO] Epoch:[0/2](899200/4588595) loss:2.665 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:40,471][model8_pretrain.py][INFO] Epoch:[0/2](899200/4588595) loss:2.926 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:03:40,472][model8_pretrain.py][INFO] Epoch:[0/2](899200/4588595) loss:3.116 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:04:27,426][model8_pretrain.py][INFO] Epoch:[0/2](899300/4588595) loss:3.149 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:04:27,426][model8_pretrain.py][INFO] Epoch:[0/2](899300/4588595) loss:2.749 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:04:27,427][model8_pretrain.py][INFO] Epoch:[0/2](899300/4588595) loss:2.848 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:04:27,427][model8_pretrain.py][INFO] Epoch:[0/2](899300/4588595) loss:2.614 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:04:27,427][model8_pretrain.py][INFO] Epoch:[0/2](899300/4588595) loss:2.991 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:04:27,427][model8_pretrain.py][INFO] Epoch:[0/2](899300/4588595) loss:2.855 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:04:27,427][model8_pretrain.py][INFO] Epoch:[0/2](899300/4588595) loss:2.515 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:04:27,427][model8_pretrain.py][INFO] Epoch:[0/2](899300/4588595) loss:2.491 lr:0.0000100 epoch_Time:23485.0min: [2024-01-06 17:05:04,382][model8_pretrain.py][INFO] Epoch:[0/2](899400/4588595) loss:2.478 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:04,382][model8_pretrain.py][INFO] Epoch:[0/2](899400/4588595) loss:2.604 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:04,382][model8_pretrain.py][INFO] Epoch:[0/2](899400/4588595) loss:3.310 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:04,382][model8_pretrain.py][INFO] Epoch:[0/2](899400/4588595) loss:3.055 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:04,382][model8_pretrain.py][INFO] Epoch:[0/2](899400/4588595) loss:3.352 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:04,382][model8_pretrain.py][INFO] Epoch:[0/2](899400/4588595) loss:2.953 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:04,382][model8_pretrain.py][INFO] Epoch:[0/2](899400/4588595) loss:2.413 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:04,382][model8_pretrain.py][INFO] Epoch:[0/2](899400/4588595) loss:2.520 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:41,342][model8_pretrain.py][INFO] Epoch:[0/2](899500/4588595) loss:2.650 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:41,342][model8_pretrain.py][INFO] Epoch:[0/2](899500/4588595) loss:2.668 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:41,342][model8_pretrain.py][INFO] Epoch:[0/2](899500/4588595) loss:2.212 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:41,342][model8_pretrain.py][INFO] Epoch:[0/2](899500/4588595) loss:2.827 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:41,342][model8_pretrain.py][INFO] Epoch:[0/2](899500/4588595) loss:3.171 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:41,342][model8_pretrain.py][INFO] Epoch:[0/2](899500/4588595) loss:3.153 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:41,342][model8_pretrain.py][INFO] Epoch:[0/2](899500/4588595) loss:2.855 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:05:41,342][model8_pretrain.py][INFO] Epoch:[0/2](899500/4588595) loss:2.833 lr:0.0000100 epoch_Time:23484.0min: [2024-01-06 17:06:18,288][model8_pretrain.py][INFO] Epoch:[0/2](899600/4588595) loss:3.250 lr:0.0000100 epoch_Time:23483.0min: [2024-01-06 17:06:18,288][model8_pretrain.py][INFO] Epoch:[0/2](899600/4588595) loss:2.736 lr:0.0000100 epoch_Time:23483.0min: [2024-01-06 17:06:18,288][model8_pretrain.py][INFO] Epoch:[0/2](899600/4588595) loss:3.081 lr:0.0000100 epoch_Time:23483.0min: [2024-01-06 17:06:18,289][model8_pretrain.py][INFO] Epoch:[0/2](899600/4588595) loss:3.194 lr:0.0000100 epoch_Time:23483.0min: [2024-01-06 17:06:18,288][model8_pretrain.py][INFO] Epoch:[0/2](899600/4588595) loss:2.951 lr:0.0000100 epoch_Time:23483.0min: [2024-01-06 17:06:18,289][model8_pretrain.py][INFO] Epoch:[0/2](899600/4588595) loss:2.934 lr:0.0000100 epoch_Time:23483.0min: [2024-01-06 17:06:18,289][model8_pretrain.py][INFO] Epoch:[0/2](899600/4588595) loss:2.634 lr:0.0000100 epoch_Time:23483.0min: [2024-01-06 17:06:18,289][model8_pretrain.py][INFO] Epoch:[0/2](899600/4588595) loss:2.869 lr:0.0000100 epoch_Time:23483.0min: [2024-01-06 17:06:55,227][model8_pretrain.py][INFO] Epoch:[0/2](899700/4588595) loss:2.734 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:06:55,227][model8_pretrain.py][INFO] Epoch:[0/2](899700/4588595) loss:2.917 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:06:55,227][model8_pretrain.py][INFO] Epoch:[0/2](899700/4588595) loss:2.974 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:06:55,227][model8_pretrain.py][INFO] Epoch:[0/2](899700/4588595) loss:2.771 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:06:55,227][model8_pretrain.py][INFO] Epoch:[0/2](899700/4588595) loss:2.569 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:06:55,227][model8_pretrain.py][INFO] Epoch:[0/2](899700/4588595) loss:2.629 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:06:55,227][model8_pretrain.py][INFO] Epoch:[0/2](899700/4588595) loss:2.866 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:06:55,228][model8_pretrain.py][INFO] Epoch:[0/2](899700/4588595) loss:2.500 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:07:32,188][model8_pretrain.py][INFO] Epoch:[0/2](899800/4588595) loss:3.092 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:07:32,188][model8_pretrain.py][INFO] Epoch:[0/2](899800/4588595) loss:3.156 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:07:32,188][model8_pretrain.py][INFO] Epoch:[0/2](899800/4588595) loss:2.872 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:07:32,188][model8_pretrain.py][INFO] Epoch:[0/2](899800/4588595) loss:3.052 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:07:32,188][model8_pretrain.py][INFO] Epoch:[0/2](899800/4588595) loss:2.595 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:07:32,188][model8_pretrain.py][INFO] Epoch:[0/2](899800/4588595) loss:2.763 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:07:32,188][model8_pretrain.py][INFO] Epoch:[0/2](899800/4588595) loss:2.207 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:07:32,188][model8_pretrain.py][INFO] Epoch:[0/2](899800/4588595) loss:2.439 lr:0.0000100 epoch_Time:23482.0min: [2024-01-06 17:08:09,142][model8_pretrain.py][INFO] Epoch:[0/2](899900/4588595) loss:2.508 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:08:09,142][model8_pretrain.py][INFO] Epoch:[0/2](899900/4588595) loss:2.969 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:08:09,142][model8_pretrain.py][INFO] Epoch:[0/2](899900/4588595) loss:2.274 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:08:09,142][model8_pretrain.py][INFO] Epoch:[0/2](899900/4588595) loss:3.123 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:08:09,142][model8_pretrain.py][INFO] Epoch:[0/2](899900/4588595) loss:2.779 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:08:09,142][model8_pretrain.py][INFO] Epoch:[0/2](899900/4588595) loss:3.193 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:08:09,142][model8_pretrain.py][INFO] Epoch:[0/2](899900/4588595) loss:2.758 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:08:09,142][model8_pretrain.py][INFO] Epoch:[0/2](899900/4588595) loss:2.947 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:08:47,857][model8_pretrain.py][INFO] Epoch:[0/2](900000/4588595) loss:3.253 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:08:47,857][model8_pretrain.py][INFO] Epoch:[0/2](900000/4588595) loss:2.468 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:08:47,857][model8_pretrain.py][INFO] Epoch:[0/2](900000/4588595) loss:2.944 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:08:47,857][model8_pretrain.py][INFO] Epoch:[0/2](900000/4588595) loss:2.840 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:08:47,857][model8_pretrain.py][INFO] Epoch:[0/2](900000/4588595) loss:2.753 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:08:47,857][model8_pretrain.py][INFO] Epoch:[0/2](900000/4588595) loss:2.806 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:08:47,857][model8_pretrain.py][INFO] Epoch:[0/2](900000/4588595) loss:2.683 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:08:47,857][model8_pretrain.py][INFO] Epoch:[0/2](900000/4588595) loss:2.993 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:09:36,824][model8_pretrain.py][INFO] Epoch:[0/2](900100/4588595) loss:3.148 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:09:36,824][model8_pretrain.py][INFO] Epoch:[0/2](900100/4588595) loss:2.999 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:09:36,824][model8_pretrain.py][INFO] Epoch:[0/2](900100/4588595) loss:2.776 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:09:36,824][model8_pretrain.py][INFO] Epoch:[0/2](900100/4588595) loss:2.746 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:09:36,824][model8_pretrain.py][INFO] Epoch:[0/2](900100/4588595) loss:3.083 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:09:36,824][model8_pretrain.py][INFO] Epoch:[0/2](900100/4588595) loss:3.282 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:09:36,824][model8_pretrain.py][INFO] Epoch:[0/2](900100/4588595) loss:2.236 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:09:36,824][model8_pretrain.py][INFO] Epoch:[0/2](900100/4588595) loss:2.632 lr:0.0000100 epoch_Time:23480.0min: [2024-01-06 17:10:13,760][model8_pretrain.py][INFO] Epoch:[0/2](900200/4588595) loss:3.650 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:10:13,760][model8_pretrain.py][INFO] Epoch:[0/2](900200/4588595) loss:3.009 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:10:13,760][model8_pretrain.py][INFO] Epoch:[0/2](900200/4588595) loss:2.472 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:10:13,760][model8_pretrain.py][INFO] Epoch:[0/2](900200/4588595) loss:2.965 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:10:13,760][model8_pretrain.py][INFO] Epoch:[0/2](900200/4588595) loss:3.420 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:10:13,760][model8_pretrain.py][INFO] Epoch:[0/2](900200/4588595) loss:2.831 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:10:13,760][model8_pretrain.py][INFO] Epoch:[0/2](900200/4588595) loss:2.921 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:10:13,760][model8_pretrain.py][INFO] Epoch:[0/2](900200/4588595) loss:2.504 lr:0.0000100 epoch_Time:23479.0min: [2024-01-06 17:10:50,702][model8_pretrain.py][INFO] Epoch:[0/2](900300/4588595) loss:3.026 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:10:50,702][model8_pretrain.py][INFO] Epoch:[0/2](900300/4588595) loss:2.887 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:10:50,702][model8_pretrain.py][INFO] Epoch:[0/2](900300/4588595) loss:2.994 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:10:50,702][model8_pretrain.py][INFO] Epoch:[0/2](900300/4588595) loss:2.714 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:10:50,702][model8_pretrain.py][INFO] Epoch:[0/2](900300/4588595) loss:3.009 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:10:50,702][model8_pretrain.py][INFO] Epoch:[0/2](900300/4588595) loss:2.322 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:10:50,702][model8_pretrain.py][INFO] Epoch:[0/2](900300/4588595) loss:2.704 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:10:50,703][model8_pretrain.py][INFO] Epoch:[0/2](900300/4588595) loss:2.548 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:11:27,609][model8_pretrain.py][INFO] Epoch:[0/2](900400/4588595) loss:3.138 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:11:27,609][model8_pretrain.py][INFO] Epoch:[0/2](900400/4588595) loss:3.037 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:11:27,610][model8_pretrain.py][INFO] Epoch:[0/2](900400/4588595) loss:2.762 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:11:27,610][model8_pretrain.py][INFO] Epoch:[0/2](900400/4588595) loss:2.505 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:11:27,610][model8_pretrain.py][INFO] Epoch:[0/2](900400/4588595) loss:3.122 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:11:27,610][model8_pretrain.py][INFO] Epoch:[0/2](900400/4588595) loss:2.968 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:11:27,610][model8_pretrain.py][INFO] Epoch:[0/2](900400/4588595) loss:2.259 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:11:27,611][model8_pretrain.py][INFO] Epoch:[0/2](900400/4588595) loss:2.730 lr:0.0000100 epoch_Time:23478.0min: [2024-01-06 17:12:04,542][model8_pretrain.py][INFO] Epoch:[0/2](900500/4588595) loss:2.750 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:04,542][model8_pretrain.py][INFO] Epoch:[0/2](900500/4588595) loss:2.810 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:04,542][model8_pretrain.py][INFO] Epoch:[0/2](900500/4588595) loss:2.869 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:04,542][model8_pretrain.py][INFO] Epoch:[0/2](900500/4588595) loss:2.688 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:04,542][model8_pretrain.py][INFO] Epoch:[0/2](900500/4588595) loss:3.076 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:04,542][model8_pretrain.py][INFO] Epoch:[0/2](900500/4588595) loss:1.999 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:04,542][model8_pretrain.py][INFO] Epoch:[0/2](900500/4588595) loss:3.226 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:04,542][model8_pretrain.py][INFO] Epoch:[0/2](900500/4588595) loss:3.207 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:41,473][model8_pretrain.py][INFO] Epoch:[0/2](900600/4588595) loss:2.317 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:41,473][model8_pretrain.py][INFO] Epoch:[0/2](900600/4588595) loss:3.037 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:41,473][model8_pretrain.py][INFO] Epoch:[0/2](900600/4588595) loss:2.599 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:41,473][model8_pretrain.py][INFO] Epoch:[0/2](900600/4588595) loss:2.361 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:41,473][model8_pretrain.py][INFO] Epoch:[0/2](900600/4588595) loss:2.512 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:41,473][model8_pretrain.py][INFO] Epoch:[0/2](900600/4588595) loss:2.802 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:41,473][model8_pretrain.py][INFO] Epoch:[0/2](900600/4588595) loss:2.246 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:12:41,473][model8_pretrain.py][INFO] Epoch:[0/2](900600/4588595) loss:2.945 lr:0.0000100 epoch_Time:23477.0min: [2024-01-06 17:13:18,416][model8_pretrain.py][INFO] Epoch:[0/2](900700/4588595) loss:2.232 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:13:18,416][model8_pretrain.py][INFO] Epoch:[0/2](900700/4588595) loss:2.865 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:13:18,416][model8_pretrain.py][INFO] Epoch:[0/2](900700/4588595) loss:1.896 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:13:18,416][model8_pretrain.py][INFO] Epoch:[0/2](900700/4588595) loss:2.594 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:13:18,416][model8_pretrain.py][INFO] Epoch:[0/2](900700/4588595) loss:2.450 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:13:18,416][model8_pretrain.py][INFO] Epoch:[0/2](900700/4588595) loss:2.780 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:13:18,416][model8_pretrain.py][INFO] Epoch:[0/2](900700/4588595) loss:2.655 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:13:18,416][model8_pretrain.py][INFO] Epoch:[0/2](900700/4588595) loss:2.975 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:13:57,082][model8_pretrain.py][INFO] Epoch:[0/2](900800/4588595) loss:3.507 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:13:57,082][model8_pretrain.py][INFO] Epoch:[0/2](900800/4588595) loss:2.764 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:13:57,082][model8_pretrain.py][INFO] Epoch:[0/2](900800/4588595) loss:2.379 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:13:57,082][model8_pretrain.py][INFO] Epoch:[0/2](900800/4588595) loss:2.812 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:13:57,082][model8_pretrain.py][INFO] Epoch:[0/2](900800/4588595) loss:2.849 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:13:57,082][model8_pretrain.py][INFO] Epoch:[0/2](900800/4588595) loss:3.080 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:13:57,082][model8_pretrain.py][INFO] Epoch:[0/2](900800/4588595) loss:2.567 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:13:57,083][model8_pretrain.py][INFO] Epoch:[0/2](900800/4588595) loss:2.671 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:14:45,818][model8_pretrain.py][INFO] Epoch:[0/2](900900/4588595) loss:2.931 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:14:45,818][model8_pretrain.py][INFO] Epoch:[0/2](900900/4588595) loss:3.092 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:14:45,818][model8_pretrain.py][INFO] Epoch:[0/2](900900/4588595) loss:3.328 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:14:45,818][model8_pretrain.py][INFO] Epoch:[0/2](900900/4588595) loss:3.054 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:14:45,818][model8_pretrain.py][INFO] Epoch:[0/2](900900/4588595) loss:2.536 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:14:45,818][model8_pretrain.py][INFO] Epoch:[0/2](900900/4588595) loss:3.008 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:14:45,818][model8_pretrain.py][INFO] Epoch:[0/2](900900/4588595) loss:2.874 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:14:45,818][model8_pretrain.py][INFO] Epoch:[0/2](900900/4588595) loss:3.285 lr:0.0000100 epoch_Time:23476.0min: [2024-01-06 17:15:22,758][model8_pretrain.py][INFO] Epoch:[0/2](901000/4588595) loss:2.737 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:15:22,758][model8_pretrain.py][INFO] Epoch:[0/2](901000/4588595) loss:2.949 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:15:22,758][model8_pretrain.py][INFO] Epoch:[0/2](901000/4588595) loss:2.071 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:15:22,758][model8_pretrain.py][INFO] Epoch:[0/2](901000/4588595) loss:3.410 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:15:22,758][model8_pretrain.py][INFO] Epoch:[0/2](901000/4588595) loss:2.919 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:15:22,758][model8_pretrain.py][INFO] Epoch:[0/2](901000/4588595) loss:2.859 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:15:22,758][model8_pretrain.py][INFO] Epoch:[0/2](901000/4588595) loss:3.192 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:15:22,758][model8_pretrain.py][INFO] Epoch:[0/2](901000/4588595) loss:2.751 lr:0.0000100 epoch_Time:23475.0min: [2024-01-06 17:15:59,698][model8_pretrain.py][INFO] Epoch:[0/2](901100/4588595) loss:2.955 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:15:59,698][model8_pretrain.py][INFO] Epoch:[0/2](901100/4588595) loss:2.376 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:15:59,698][model8_pretrain.py][INFO] Epoch:[0/2](901100/4588595) loss:3.107 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:15:59,698][model8_pretrain.py][INFO] Epoch:[0/2](901100/4588595) loss:2.552 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:15:59,698][model8_pretrain.py][INFO] Epoch:[0/2](901100/4588595) loss:2.643 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:15:59,698][model8_pretrain.py][INFO] Epoch:[0/2](901100/4588595) loss:2.609 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:15:59,698][model8_pretrain.py][INFO] Epoch:[0/2](901100/4588595) loss:2.349 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:15:59,698][model8_pretrain.py][INFO] Epoch:[0/2](901100/4588595) loss:2.771 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:16:36,644][model8_pretrain.py][INFO] Epoch:[0/2](901200/4588595) loss:2.535 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:16:36,644][model8_pretrain.py][INFO] Epoch:[0/2](901200/4588595) loss:3.376 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:16:36,644][model8_pretrain.py][INFO] Epoch:[0/2](901200/4588595) loss:2.258 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:16:36,645][model8_pretrain.py][INFO] Epoch:[0/2](901200/4588595) loss:2.943 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:16:36,645][model8_pretrain.py][INFO] Epoch:[0/2](901200/4588595) loss:2.911 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:16:36,645][model8_pretrain.py][INFO] Epoch:[0/2](901200/4588595) loss:2.660 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:16:36,645][model8_pretrain.py][INFO] Epoch:[0/2](901200/4588595) loss:2.993 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:16:36,645][model8_pretrain.py][INFO] Epoch:[0/2](901200/4588595) loss:2.641 lr:0.0000100 epoch_Time:23473.0min: [2024-01-06 17:17:13,581][model8_pretrain.py][INFO] Epoch:[0/2](901300/4588595) loss:3.163 lr:0.0000100 epoch_Time:23472.0min: [2024-01-06 17:17:13,581][model8_pretrain.py][INFO] Epoch:[0/2](901300/4588595) loss:1.844 lr:0.0000100 epoch_Time:23472.0min: [2024-01-06 17:17:13,581][model8_pretrain.py][INFO] Epoch:[0/2](901300/4588595) loss:2.850 lr:0.0000100 epoch_Time:23472.0min: [2024-01-06 17:17:13,581][model8_pretrain.py][INFO] Epoch:[0/2](901300/4588595) loss:2.882 lr:0.0000100 epoch_Time:23472.0min: [2024-01-06 17:17:13,581][model8_pretrain.py][INFO] Epoch:[0/2](901300/4588595) loss:2.725 lr:0.0000100 epoch_Time:23472.0min: [2024-01-06 17:17:13,581][model8_pretrain.py][INFO] Epoch:[0/2](901300/4588595) loss:2.597 lr:0.0000100 epoch_Time:23472.0min: [2024-01-06 17:17:13,581][model8_pretrain.py][INFO] Epoch:[0/2](901300/4588595) loss:3.206 lr:0.0000100 epoch_Time:23472.0min: [2024-01-06 17:17:13,581][model8_pretrain.py][INFO] Epoch:[0/2](901300/4588595) loss:2.664 lr:0.0000100 epoch_Time:23472.0min: [2024-01-06 17:17:50,514][model8_pretrain.py][INFO] Epoch:[0/2](901400/4588595) loss:2.741 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:17:50,514][model8_pretrain.py][INFO] Epoch:[0/2](901400/4588595) loss:2.824 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:17:50,514][model8_pretrain.py][INFO] Epoch:[0/2](901400/4588595) loss:2.781 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:17:50,514][model8_pretrain.py][INFO] Epoch:[0/2](901400/4588595) loss:2.574 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:17:50,514][model8_pretrain.py][INFO] Epoch:[0/2](901400/4588595) loss:3.207 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:17:50,514][model8_pretrain.py][INFO] Epoch:[0/2](901400/4588595) loss:3.024 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:17:50,514][model8_pretrain.py][INFO] Epoch:[0/2](901400/4588595) loss:2.331 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:17:50,514][model8_pretrain.py][INFO] Epoch:[0/2](901400/4588595) loss:2.751 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:18:27,443][model8_pretrain.py][INFO] Epoch:[0/2](901500/4588595) loss:2.688 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:18:27,443][model8_pretrain.py][INFO] Epoch:[0/2](901500/4588595) loss:3.295 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:18:27,443][model8_pretrain.py][INFO] Epoch:[0/2](901500/4588595) loss:2.673 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:18:27,443][model8_pretrain.py][INFO] Epoch:[0/2](901500/4588595) loss:2.439 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:18:27,443][model8_pretrain.py][INFO] Epoch:[0/2](901500/4588595) loss:2.492 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:18:27,443][model8_pretrain.py][INFO] Epoch:[0/2](901500/4588595) loss:3.081 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:18:27,443][model8_pretrain.py][INFO] Epoch:[0/2](901500/4588595) loss:3.136 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:18:27,443][model8_pretrain.py][INFO] Epoch:[0/2](901500/4588595) loss:3.080 lr:0.0000100 epoch_Time:23471.0min: [2024-01-06 17:19:06,053][model8_pretrain.py][INFO] Epoch:[0/2](901600/4588595) loss:3.058 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:06,053][model8_pretrain.py][INFO] Epoch:[0/2](901600/4588595) loss:3.069 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:06,053][model8_pretrain.py][INFO] Epoch:[0/2](901600/4588595) loss:3.091 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:06,053][model8_pretrain.py][INFO] Epoch:[0/2](901600/4588595) loss:2.834 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:06,053][model8_pretrain.py][INFO] Epoch:[0/2](901600/4588595) loss:2.679 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:06,053][model8_pretrain.py][INFO] Epoch:[0/2](901600/4588595) loss:2.614 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:06,054][model8_pretrain.py][INFO] Epoch:[0/2](901600/4588595) loss:2.869 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:06,054][model8_pretrain.py][INFO] Epoch:[0/2](901600/4588595) loss:2.504 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:54,933][model8_pretrain.py][INFO] Epoch:[0/2](901700/4588595) loss:2.996 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:54,933][model8_pretrain.py][INFO] Epoch:[0/2](901700/4588595) loss:2.846 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:54,933][model8_pretrain.py][INFO] Epoch:[0/2](901700/4588595) loss:3.116 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:54,933][model8_pretrain.py][INFO] Epoch:[0/2](901700/4588595) loss:2.583 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:54,933][model8_pretrain.py][INFO] Epoch:[0/2](901700/4588595) loss:2.703 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:54,933][model8_pretrain.py][INFO] Epoch:[0/2](901700/4588595) loss:3.541 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:54,933][model8_pretrain.py][INFO] Epoch:[0/2](901700/4588595) loss:2.916 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:19:54,933][model8_pretrain.py][INFO] Epoch:[0/2](901700/4588595) loss:2.469 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:20:31,871][model8_pretrain.py][INFO] Epoch:[0/2](901800/4588595) loss:3.305 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:20:31,871][model8_pretrain.py][INFO] Epoch:[0/2](901800/4588595) loss:2.465 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:20:31,871][model8_pretrain.py][INFO] Epoch:[0/2](901800/4588595) loss:3.018 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:20:31,871][model8_pretrain.py][INFO] Epoch:[0/2](901800/4588595) loss:2.796 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:20:31,871][model8_pretrain.py][INFO] Epoch:[0/2](901800/4588595) loss:2.589 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:20:31,871][model8_pretrain.py][INFO] Epoch:[0/2](901800/4588595) loss:3.135 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:20:31,871][model8_pretrain.py][INFO] Epoch:[0/2](901800/4588595) loss:2.954 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:20:31,871][model8_pretrain.py][INFO] Epoch:[0/2](901800/4588595) loss:3.427 lr:0.0000100 epoch_Time:23470.0min: [2024-01-06 17:21:08,816][model8_pretrain.py][INFO] Epoch:[0/2](901900/4588595) loss:3.178 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:08,816][model8_pretrain.py][INFO] Epoch:[0/2](901900/4588595) loss:3.200 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:08,816][model8_pretrain.py][INFO] Epoch:[0/2](901900/4588595) loss:3.332 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:08,816][model8_pretrain.py][INFO] Epoch:[0/2](901900/4588595) loss:2.850 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:08,816][model8_pretrain.py][INFO] Epoch:[0/2](901900/4588595) loss:3.224 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:08,816][model8_pretrain.py][INFO] Epoch:[0/2](901900/4588595) loss:3.335 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:08,816][model8_pretrain.py][INFO] Epoch:[0/2](901900/4588595) loss:2.436 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:08,816][model8_pretrain.py][INFO] Epoch:[0/2](901900/4588595) loss:2.418 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:45,753][model8_pretrain.py][INFO] Epoch:[0/2](902000/4588595) loss:2.836 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:45,753][model8_pretrain.py][INFO] Epoch:[0/2](902000/4588595) loss:2.543 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:45,753][model8_pretrain.py][INFO] Epoch:[0/2](902000/4588595) loss:2.967 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:45,753][model8_pretrain.py][INFO] Epoch:[0/2](902000/4588595) loss:2.835 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:45,753][model8_pretrain.py][INFO] Epoch:[0/2](902000/4588595) loss:2.675 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:45,753][model8_pretrain.py][INFO] Epoch:[0/2](902000/4588595) loss:2.685 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:45,753][model8_pretrain.py][INFO] Epoch:[0/2](902000/4588595) loss:2.626 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:21:45,753][model8_pretrain.py][INFO] Epoch:[0/2](902000/4588595) loss:2.656 lr:0.0000100 epoch_Time:23469.0min: [2024-01-06 17:22:22,695][model8_pretrain.py][INFO] Epoch:[0/2](902100/4588595) loss:2.595 lr:0.0000100 epoch_Time:23468.0min: [2024-01-06 17:22:22,695][model8_pretrain.py][INFO] Epoch:[0/2](902100/4588595) loss:3.121 lr:0.0000100 epoch_Time:23468.0min: [2024-01-06 17:22:22,695][model8_pretrain.py][INFO] Epoch:[0/2](902100/4588595) loss:2.891 lr:0.0000100 epoch_Time:23468.0min: [2024-01-06 17:22:22,695][model8_pretrain.py][INFO] Epoch:[0/2](902100/4588595) loss:2.963 lr:0.0000100 epoch_Time:23468.0min: [2024-01-06 17:22:22,696][model8_pretrain.py][INFO] Epoch:[0/2](902100/4588595) loss:2.884 lr:0.0000100 epoch_Time:23468.0min: [2024-01-06 17:22:22,695][model8_pretrain.py][INFO] Epoch:[0/2](902100/4588595) loss:2.821 lr:0.0000100 epoch_Time:23468.0min: [2024-01-06 17:22:22,696][model8_pretrain.py][INFO] Epoch:[0/2](902100/4588595) loss:2.462 lr:0.0000100 epoch_Time:23468.0min: [2024-01-06 17:22:22,696][model8_pretrain.py][INFO] Epoch:[0/2](902100/4588595) loss:2.733 lr:0.0000100 epoch_Time:23468.0min: [2024-01-06 17:22:59,632][model8_pretrain.py][INFO] Epoch:[0/2](902200/4588595) loss:2.438 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:22:59,632][model8_pretrain.py][INFO] Epoch:[0/2](902200/4588595) loss:2.691 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:22:59,632][model8_pretrain.py][INFO] Epoch:[0/2](902200/4588595) loss:2.903 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:22:59,632][model8_pretrain.py][INFO] Epoch:[0/2](902200/4588595) loss:2.945 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:22:59,632][model8_pretrain.py][INFO] Epoch:[0/2](902200/4588595) loss:2.291 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:22:59,632][model8_pretrain.py][INFO] Epoch:[0/2](902200/4588595) loss:1.994 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:22:59,632][model8_pretrain.py][INFO] Epoch:[0/2](902200/4588595) loss:1.745 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:22:59,632][model8_pretrain.py][INFO] Epoch:[0/2](902200/4588595) loss:2.798 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:23:36,564][model8_pretrain.py][INFO] Epoch:[0/2](902300/4588595) loss:2.964 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:23:36,564][model8_pretrain.py][INFO] Epoch:[0/2](902300/4588595) loss:2.630 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:23:36,564][model8_pretrain.py][INFO] Epoch:[0/2](902300/4588595) loss:2.911 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:23:36,564][model8_pretrain.py][INFO] Epoch:[0/2](902300/4588595) loss:2.283 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:23:36,564][model8_pretrain.py][INFO] Epoch:[0/2](902300/4588595) loss:2.881 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:23:36,564][model8_pretrain.py][INFO] Epoch:[0/2](902300/4588595) loss:2.877 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:23:36,564][model8_pretrain.py][INFO] Epoch:[0/2](902300/4588595) loss:2.627 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:23:36,564][model8_pretrain.py][INFO] Epoch:[0/2](902300/4588595) loss:2.743 lr:0.0000100 epoch_Time:23466.0min: [2024-01-06 17:24:15,192][model8_pretrain.py][INFO] Epoch:[0/2](902400/4588595) loss:2.838 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:24:15,192][model8_pretrain.py][INFO] Epoch:[0/2](902400/4588595) loss:2.060 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:24:15,192][model8_pretrain.py][INFO] Epoch:[0/2](902400/4588595) loss:2.729 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:24:15,192][model8_pretrain.py][INFO] Epoch:[0/2](902400/4588595) loss:2.927 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:24:15,192][model8_pretrain.py][INFO] Epoch:[0/2](902400/4588595) loss:3.226 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:24:15,192][model8_pretrain.py][INFO] Epoch:[0/2](902400/4588595) loss:2.526 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:24:15,192][model8_pretrain.py][INFO] Epoch:[0/2](902400/4588595) loss:2.294 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:24:15,193][model8_pretrain.py][INFO] Epoch:[0/2](902400/4588595) loss:2.796 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:04,021][model8_pretrain.py][INFO] Epoch:[0/2](902500/4588595) loss:2.994 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:04,021][model8_pretrain.py][INFO] Epoch:[0/2](902500/4588595) loss:2.738 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:04,021][model8_pretrain.py][INFO] Epoch:[0/2](902500/4588595) loss:2.799 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:04,021][model8_pretrain.py][INFO] Epoch:[0/2](902500/4588595) loss:3.048 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:04,021][model8_pretrain.py][INFO] Epoch:[0/2](902500/4588595) loss:3.063 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:04,021][model8_pretrain.py][INFO] Epoch:[0/2](902500/4588595) loss:3.353 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:04,021][model8_pretrain.py][INFO] Epoch:[0/2](902500/4588595) loss:2.456 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:04,021][model8_pretrain.py][INFO] Epoch:[0/2](902500/4588595) loss:2.557 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:40,980][model8_pretrain.py][INFO] Epoch:[0/2](902600/4588595) loss:2.344 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:40,980][model8_pretrain.py][INFO] Epoch:[0/2](902600/4588595) loss:2.771 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:40,980][model8_pretrain.py][INFO] Epoch:[0/2](902600/4588595) loss:2.102 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:40,980][model8_pretrain.py][INFO] Epoch:[0/2](902600/4588595) loss:2.587 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:40,980][model8_pretrain.py][INFO] Epoch:[0/2](902600/4588595) loss:2.973 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:40,980][model8_pretrain.py][INFO] Epoch:[0/2](902600/4588595) loss:2.849 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:40,980][model8_pretrain.py][INFO] Epoch:[0/2](902600/4588595) loss:2.727 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:25:40,980][model8_pretrain.py][INFO] Epoch:[0/2](902600/4588595) loss:2.980 lr:0.0000100 epoch_Time:23465.0min: [2024-01-06 17:26:17,938][model8_pretrain.py][INFO] Epoch:[0/2](902700/4588595) loss:2.741 lr:0.0000100 epoch_Time:23464.0min: [2024-01-06 17:26:17,938][model8_pretrain.py][INFO] Epoch:[0/2](902700/4588595) loss:2.319 lr:0.0000100 epoch_Time:23464.0min: [2024-01-06 17:26:17,938][model8_pretrain.py][INFO] Epoch:[0/2](902700/4588595) loss:2.980 lr:0.0000100 epoch_Time:23464.0min: [2024-01-06 17:26:17,938][model8_pretrain.py][INFO] Epoch:[0/2](902700/4588595) loss:3.024 lr:0.0000100 epoch_Time:23464.0min: [2024-01-06 17:26:17,938][model8_pretrain.py][INFO] Epoch:[0/2](902700/4588595) loss:2.704 lr:0.0000100 epoch_Time:23464.0min: [2024-01-06 17:26:17,938][model8_pretrain.py][INFO] Epoch:[0/2](902700/4588595) loss:2.743 lr:0.0000100 epoch_Time:23464.0min: [2024-01-06 17:26:17,938][model8_pretrain.py][INFO] Epoch:[0/2](902700/4588595) loss:2.872 lr:0.0000100 epoch_Time:23464.0min: [2024-01-06 17:26:17,939][model8_pretrain.py][INFO] Epoch:[0/2](902700/4588595) loss:2.780 lr:0.0000100 epoch_Time:23464.0min: [2024-01-06 17:26:54,895][model8_pretrain.py][INFO] Epoch:[0/2](902800/4588595) loss:2.939 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:26:54,895][model8_pretrain.py][INFO] Epoch:[0/2](902800/4588595) loss:2.936 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:26:54,895][model8_pretrain.py][INFO] Epoch:[0/2](902800/4588595) loss:2.927 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:26:54,895][model8_pretrain.py][INFO] Epoch:[0/2](902800/4588595) loss:2.383 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:26:54,895][model8_pretrain.py][INFO] Epoch:[0/2](902800/4588595) loss:2.520 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:26:54,895][model8_pretrain.py][INFO] Epoch:[0/2](902800/4588595) loss:2.228 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:26:54,895][model8_pretrain.py][INFO] Epoch:[0/2](902800/4588595) loss:2.713 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:26:54,895][model8_pretrain.py][INFO] Epoch:[0/2](902800/4588595) loss:2.569 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:27:31,857][model8_pretrain.py][INFO] Epoch:[0/2](902900/4588595) loss:3.159 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:27:31,857][model8_pretrain.py][INFO] Epoch:[0/2](902900/4588595) loss:2.977 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:27:31,857][model8_pretrain.py][INFO] Epoch:[0/2](902900/4588595) loss:2.931 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:27:31,857][model8_pretrain.py][INFO] Epoch:[0/2](902900/4588595) loss:2.141 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:27:31,857][model8_pretrain.py][INFO] Epoch:[0/2](902900/4588595) loss:2.856 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:27:31,857][model8_pretrain.py][INFO] Epoch:[0/2](902900/4588595) loss:2.632 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:27:31,857][model8_pretrain.py][INFO] Epoch:[0/2](902900/4588595) loss:3.495 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:27:31,858][model8_pretrain.py][INFO] Epoch:[0/2](902900/4588595) loss:3.111 lr:0.0000100 epoch_Time:23463.0min: [2024-01-06 17:28:08,823][model8_pretrain.py][INFO] Epoch:[0/2](903000/4588595) loss:1.954 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:08,823][model8_pretrain.py][INFO] Epoch:[0/2](903000/4588595) loss:2.983 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:08,823][model8_pretrain.py][INFO] Epoch:[0/2](903000/4588595) loss:3.036 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:08,823][model8_pretrain.py][INFO] Epoch:[0/2](903000/4588595) loss:2.373 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:08,823][model8_pretrain.py][INFO] Epoch:[0/2](903000/4588595) loss:2.893 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:08,823][model8_pretrain.py][INFO] Epoch:[0/2](903000/4588595) loss:2.618 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:08,823][model8_pretrain.py][INFO] Epoch:[0/2](903000/4588595) loss:2.858 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:08,824][model8_pretrain.py][INFO] Epoch:[0/2](903000/4588595) loss:3.050 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:45,783][model8_pretrain.py][INFO] Epoch:[0/2](903100/4588595) loss:2.754 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:45,783][model8_pretrain.py][INFO] Epoch:[0/2](903100/4588595) loss:2.672 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:45,783][model8_pretrain.py][INFO] Epoch:[0/2](903100/4588595) loss:3.123 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:45,783][model8_pretrain.py][INFO] Epoch:[0/2](903100/4588595) loss:2.861 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:45,783][model8_pretrain.py][INFO] Epoch:[0/2](903100/4588595) loss:3.280 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:45,783][model8_pretrain.py][INFO] Epoch:[0/2](903100/4588595) loss:2.771 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:45,783][model8_pretrain.py][INFO] Epoch:[0/2](903100/4588595) loss:2.815 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:28:45,783][model8_pretrain.py][INFO] Epoch:[0/2](903100/4588595) loss:2.634 lr:0.0000100 epoch_Time:23462.0min: [2024-01-06 17:29:24,436][model8_pretrain.py][INFO] Epoch:[0/2](903200/4588595) loss:2.620 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:29:24,436][model8_pretrain.py][INFO] Epoch:[0/2](903200/4588595) loss:2.428 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:29:24,436][model8_pretrain.py][INFO] Epoch:[0/2](903200/4588595) loss:2.545 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:29:24,436][model8_pretrain.py][INFO] Epoch:[0/2](903200/4588595) loss:2.463 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:29:24,436][model8_pretrain.py][INFO] Epoch:[0/2](903200/4588595) loss:2.560 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:29:24,436][model8_pretrain.py][INFO] Epoch:[0/2](903200/4588595) loss:2.574 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:29:24,437][model8_pretrain.py][INFO] Epoch:[0/2](903200/4588595) loss:2.547 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:29:24,437][model8_pretrain.py][INFO] Epoch:[0/2](903200/4588595) loss:2.837 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:30:13,381][model8_pretrain.py][INFO] Epoch:[0/2](903300/4588595) loss:2.730 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:30:13,381][model8_pretrain.py][INFO] Epoch:[0/2](903300/4588595) loss:2.833 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:30:13,381][model8_pretrain.py][INFO] Epoch:[0/2](903300/4588595) loss:2.856 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:30:13,381][model8_pretrain.py][INFO] Epoch:[0/2](903300/4588595) loss:2.905 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:30:13,381][model8_pretrain.py][INFO] Epoch:[0/2](903300/4588595) loss:2.587 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:30:13,381][model8_pretrain.py][INFO] Epoch:[0/2](903300/4588595) loss:2.962 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:30:13,381][model8_pretrain.py][INFO] Epoch:[0/2](903300/4588595) loss:2.417 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:30:13,381][model8_pretrain.py][INFO] Epoch:[0/2](903300/4588595) loss:2.769 lr:0.0000100 epoch_Time:23461.0min: [2024-01-06 17:30:50,323][model8_pretrain.py][INFO] Epoch:[0/2](903400/4588595) loss:2.718 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:30:50,323][model8_pretrain.py][INFO] Epoch:[0/2](903400/4588595) loss:2.137 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:30:50,323][model8_pretrain.py][INFO] Epoch:[0/2](903400/4588595) loss:2.832 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:30:50,323][model8_pretrain.py][INFO] Epoch:[0/2](903400/4588595) loss:2.590 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:30:50,323][model8_pretrain.py][INFO] Epoch:[0/2](903400/4588595) loss:3.103 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:30:50,323][model8_pretrain.py][INFO] Epoch:[0/2](903400/4588595) loss:3.012 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:30:50,323][model8_pretrain.py][INFO] Epoch:[0/2](903400/4588595) loss:2.855 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:30:50,323][model8_pretrain.py][INFO] Epoch:[0/2](903400/4588595) loss:2.634 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:31:27,266][model8_pretrain.py][INFO] Epoch:[0/2](903500/4588595) loss:2.525 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:31:27,266][model8_pretrain.py][INFO] Epoch:[0/2](903500/4588595) loss:2.616 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:31:27,266][model8_pretrain.py][INFO] Epoch:[0/2](903500/4588595) loss:2.610 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:31:27,266][model8_pretrain.py][INFO] Epoch:[0/2](903500/4588595) loss:2.851 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:31:27,266][model8_pretrain.py][INFO] Epoch:[0/2](903500/4588595) loss:3.184 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:31:27,266][model8_pretrain.py][INFO] Epoch:[0/2](903500/4588595) loss:2.656 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:31:27,266][model8_pretrain.py][INFO] Epoch:[0/2](903500/4588595) loss:2.719 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:31:27,266][model8_pretrain.py][INFO] Epoch:[0/2](903500/4588595) loss:2.398 lr:0.0000100 epoch_Time:23459.0min: [2024-01-06 17:32:04,207][model8_pretrain.py][INFO] Epoch:[0/2](903600/4588595) loss:2.763 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:04,207][model8_pretrain.py][INFO] Epoch:[0/2](903600/4588595) loss:2.505 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:04,207][model8_pretrain.py][INFO] Epoch:[0/2](903600/4588595) loss:3.110 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:04,207][model8_pretrain.py][INFO] Epoch:[0/2](903600/4588595) loss:2.788 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:04,207][model8_pretrain.py][INFO] Epoch:[0/2](903600/4588595) loss:3.081 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:04,207][model8_pretrain.py][INFO] Epoch:[0/2](903600/4588595) loss:2.147 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:04,207][model8_pretrain.py][INFO] Epoch:[0/2](903600/4588595) loss:3.241 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:04,207][model8_pretrain.py][INFO] Epoch:[0/2](903600/4588595) loss:3.043 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:41,165][model8_pretrain.py][INFO] Epoch:[0/2](903700/4588595) loss:1.921 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:41,165][model8_pretrain.py][INFO] Epoch:[0/2](903700/4588595) loss:2.968 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:41,165][model8_pretrain.py][INFO] Epoch:[0/2](903700/4588595) loss:2.758 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:41,165][model8_pretrain.py][INFO] Epoch:[0/2](903700/4588595) loss:3.075 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:41,165][model8_pretrain.py][INFO] Epoch:[0/2](903700/4588595) loss:2.942 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:41,165][model8_pretrain.py][INFO] Epoch:[0/2](903700/4588595) loss:2.422 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:41,165][model8_pretrain.py][INFO] Epoch:[0/2](903700/4588595) loss:2.724 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:32:41,165][model8_pretrain.py][INFO] Epoch:[0/2](903700/4588595) loss:2.517 lr:0.0000100 epoch_Time:23458.0min: [2024-01-06 17:33:18,115][model8_pretrain.py][INFO] Epoch:[0/2](903800/4588595) loss:2.946 lr:0.0000100 epoch_Time:23457.0min: [2024-01-06 17:33:18,115][model8_pretrain.py][INFO] Epoch:[0/2](903800/4588595) loss:3.071 lr:0.0000100 epoch_Time:23457.0min: [2024-01-06 17:33:18,115][model8_pretrain.py][INFO] Epoch:[0/2](903800/4588595) loss:2.801 lr:0.0000100 epoch_Time:23457.0min: [2024-01-06 17:33:18,116][model8_pretrain.py][INFO] Epoch:[0/2](903800/4588595) loss:2.040 lr:0.0000100 epoch_Time:23457.0min: [2024-01-06 17:33:18,116][model8_pretrain.py][INFO] Epoch:[0/2](903800/4588595) loss:2.773 lr:0.0000100 epoch_Time:23457.0min: [2024-01-06 17:33:18,117][model8_pretrain.py][INFO] Epoch:[0/2](903800/4588595) loss:3.150 lr:0.0000100 epoch_Time:23457.0min: [2024-01-06 17:33:18,115][model8_pretrain.py][INFO] Epoch:[0/2](903800/4588595) loss:2.446 lr:0.0000100 epoch_Time:23457.0min: [2024-01-06 17:33:18,116][model8_pretrain.py][INFO] Epoch:[0/2](903800/4588595) loss:2.894 lr:0.0000100 epoch_Time:23457.0min: [2024-01-06 17:33:55,065][model8_pretrain.py][INFO] Epoch:[0/2](903900/4588595) loss:2.932 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:33:55,065][model8_pretrain.py][INFO] Epoch:[0/2](903900/4588595) loss:2.975 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:33:55,065][model8_pretrain.py][INFO] Epoch:[0/2](903900/4588595) loss:2.737 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:33:55,065][model8_pretrain.py][INFO] Epoch:[0/2](903900/4588595) loss:2.874 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:33:55,065][model8_pretrain.py][INFO] Epoch:[0/2](903900/4588595) loss:2.211 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:33:55,066][model8_pretrain.py][INFO] Epoch:[0/2](903900/4588595) loss:2.815 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:33:55,066][model8_pretrain.py][INFO] Epoch:[0/2](903900/4588595) loss:2.978 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:33:55,066][model8_pretrain.py][INFO] Epoch:[0/2](903900/4588595) loss:3.048 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:34:33,721][model8_pretrain.py][INFO] Epoch:[0/2](904000/4588595) loss:2.856 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:34:33,721][model8_pretrain.py][INFO] Epoch:[0/2](904000/4588595) loss:2.621 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:34:33,721][model8_pretrain.py][INFO] Epoch:[0/2](904000/4588595) loss:2.874 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:34:33,721][model8_pretrain.py][INFO] Epoch:[0/2](904000/4588595) loss:2.383 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:34:33,721][model8_pretrain.py][INFO] Epoch:[0/2](904000/4588595) loss:2.861 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:34:33,721][model8_pretrain.py][INFO] Epoch:[0/2](904000/4588595) loss:2.947 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:34:33,721][model8_pretrain.py][INFO] Epoch:[0/2](904000/4588595) loss:2.067 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:34:33,721][model8_pretrain.py][INFO] Epoch:[0/2](904000/4588595) loss:2.759 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:35:22,660][model8_pretrain.py][INFO] Epoch:[0/2](904100/4588595) loss:2.614 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:35:22,660][model8_pretrain.py][INFO] Epoch:[0/2](904100/4588595) loss:3.063 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:35:22,660][model8_pretrain.py][INFO] Epoch:[0/2](904100/4588595) loss:2.696 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:35:22,660][model8_pretrain.py][INFO] Epoch:[0/2](904100/4588595) loss:2.672 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:35:22,660][model8_pretrain.py][INFO] Epoch:[0/2](904100/4588595) loss:3.126 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:35:22,660][model8_pretrain.py][INFO] Epoch:[0/2](904100/4588595) loss:2.650 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:35:22,660][model8_pretrain.py][INFO] Epoch:[0/2](904100/4588595) loss:2.808 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:35:22,660][model8_pretrain.py][INFO] Epoch:[0/2](904100/4588595) loss:3.207 lr:0.0000100 epoch_Time:23456.0min: [2024-01-06 17:35:59,598][model8_pretrain.py][INFO] Epoch:[0/2](904200/4588595) loss:2.317 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:35:59,598][model8_pretrain.py][INFO] Epoch:[0/2](904200/4588595) loss:2.577 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:35:59,598][model8_pretrain.py][INFO] Epoch:[0/2](904200/4588595) loss:2.514 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:35:59,598][model8_pretrain.py][INFO] Epoch:[0/2](904200/4588595) loss:3.163 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:35:59,598][model8_pretrain.py][INFO] Epoch:[0/2](904200/4588595) loss:3.025 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:35:59,598][model8_pretrain.py][INFO] Epoch:[0/2](904200/4588595) loss:1.981 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:35:59,598][model8_pretrain.py][INFO] Epoch:[0/2](904200/4588595) loss:2.806 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:35:59,599][model8_pretrain.py][INFO] Epoch:[0/2](904200/4588595) loss:2.397 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:36:36,546][model8_pretrain.py][INFO] Epoch:[0/2](904300/4588595) loss:3.118 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:36:36,546][model8_pretrain.py][INFO] Epoch:[0/2](904300/4588595) loss:2.984 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:36:36,546][model8_pretrain.py][INFO] Epoch:[0/2](904300/4588595) loss:2.437 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:36:36,547][model8_pretrain.py][INFO] Epoch:[0/2](904300/4588595) loss:3.389 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:36:36,547][model8_pretrain.py][INFO] Epoch:[0/2](904300/4588595) loss:2.472 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:36:36,547][model8_pretrain.py][INFO] Epoch:[0/2](904300/4588595) loss:2.838 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:36:36,547][model8_pretrain.py][INFO] Epoch:[0/2](904300/4588595) loss:3.129 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:36:36,547][model8_pretrain.py][INFO] Epoch:[0/2](904300/4588595) loss:2.487 lr:0.0000100 epoch_Time:23455.0min: [2024-01-06 17:37:13,491][model8_pretrain.py][INFO] Epoch:[0/2](904400/4588595) loss:3.028 lr:0.0000100 epoch_Time:23454.0min: [2024-01-06 17:37:13,491][model8_pretrain.py][INFO] Epoch:[0/2](904400/4588595) loss:2.988 lr:0.0000100 epoch_Time:23454.0min: [2024-01-06 17:37:13,491][model8_pretrain.py][INFO] Epoch:[0/2](904400/4588595) loss:1.383 lr:0.0000100 epoch_Time:23454.0min: [2024-01-06 17:37:13,491][model8_pretrain.py][INFO] Epoch:[0/2](904400/4588595) loss:2.716 lr:0.0000100 epoch_Time:23454.0min: [2024-01-06 17:37:13,491][model8_pretrain.py][INFO] Epoch:[0/2](904400/4588595) loss:2.524 lr:0.0000100 epoch_Time:23454.0min: [2024-01-06 17:37:13,491][model8_pretrain.py][INFO] Epoch:[0/2](904400/4588595) loss:2.677 lr:0.0000100 epoch_Time:23454.0min: [2024-01-06 17:37:13,491][model8_pretrain.py][INFO] Epoch:[0/2](904400/4588595) loss:2.424 lr:0.0000100 epoch_Time:23454.0min: [2024-01-06 17:37:13,491][model8_pretrain.py][INFO] Epoch:[0/2](904400/4588595) loss:2.565 lr:0.0000100 epoch_Time:23454.0min: [2024-01-06 17:37:50,436][model8_pretrain.py][INFO] Epoch:[0/2](904500/4588595) loss:3.182 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:37:50,436][model8_pretrain.py][INFO] Epoch:[0/2](904500/4588595) loss:2.714 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:37:50,436][model8_pretrain.py][INFO] Epoch:[0/2](904500/4588595) loss:2.996 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:37:50,436][model8_pretrain.py][INFO] Epoch:[0/2](904500/4588595) loss:2.264 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:37:50,436][model8_pretrain.py][INFO] Epoch:[0/2](904500/4588595) loss:3.125 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:37:50,437][model8_pretrain.py][INFO] Epoch:[0/2](904500/4588595) loss:3.015 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:37:50,437][model8_pretrain.py][INFO] Epoch:[0/2](904500/4588595) loss:3.045 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:37:50,437][model8_pretrain.py][INFO] Epoch:[0/2](904500/4588595) loss:2.683 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:38:27,379][model8_pretrain.py][INFO] Epoch:[0/2](904600/4588595) loss:2.156 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:38:27,379][model8_pretrain.py][INFO] Epoch:[0/2](904600/4588595) loss:2.396 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:38:27,379][model8_pretrain.py][INFO] Epoch:[0/2](904600/4588595) loss:2.601 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:38:27,379][model8_pretrain.py][INFO] Epoch:[0/2](904600/4588595) loss:2.906 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:38:27,379][model8_pretrain.py][INFO] Epoch:[0/2](904600/4588595) loss:2.527 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:38:27,379][model8_pretrain.py][INFO] Epoch:[0/2](904600/4588595) loss:3.223 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:38:27,379][model8_pretrain.py][INFO] Epoch:[0/2](904600/4588595) loss:2.941 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:38:27,380][model8_pretrain.py][INFO] Epoch:[0/2](904600/4588595) loss:3.081 lr:0.0000100 epoch_Time:23452.0min: [2024-01-06 17:39:04,344][model8_pretrain.py][INFO] Epoch:[0/2](904700/4588595) loss:2.631 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:04,344][model8_pretrain.py][INFO] Epoch:[0/2](904700/4588595) loss:3.003 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:04,344][model8_pretrain.py][INFO] Epoch:[0/2](904700/4588595) loss:3.090 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:04,344][model8_pretrain.py][INFO] Epoch:[0/2](904700/4588595) loss:3.150 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:04,344][model8_pretrain.py][INFO] Epoch:[0/2](904700/4588595) loss:3.224 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:04,344][model8_pretrain.py][INFO] Epoch:[0/2](904700/4588595) loss:2.968 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:04,344][model8_pretrain.py][INFO] Epoch:[0/2](904700/4588595) loss:2.927 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:04,344][model8_pretrain.py][INFO] Epoch:[0/2](904700/4588595) loss:2.589 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:41,298][model8_pretrain.py][INFO] Epoch:[0/2](904800/4588595) loss:2.976 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:41,298][model8_pretrain.py][INFO] Epoch:[0/2](904800/4588595) loss:2.714 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:41,298][model8_pretrain.py][INFO] Epoch:[0/2](904800/4588595) loss:3.021 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:41,298][model8_pretrain.py][INFO] Epoch:[0/2](904800/4588595) loss:2.720 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:41,298][model8_pretrain.py][INFO] Epoch:[0/2](904800/4588595) loss:2.738 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:41,299][model8_pretrain.py][INFO] Epoch:[0/2](904800/4588595) loss:3.028 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:41,299][model8_pretrain.py][INFO] Epoch:[0/2](904800/4588595) loss:3.148 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:39:42,994][model8_pretrain.py][INFO] Epoch:[0/2](904800/4588595) loss:3.191 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:40:32,056][model8_pretrain.py][INFO] Epoch:[0/2](904900/4588595) loss:2.629 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:40:32,056][model8_pretrain.py][INFO] Epoch:[0/2](904900/4588595) loss:2.598 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:40:32,056][model8_pretrain.py][INFO] Epoch:[0/2](904900/4588595) loss:2.618 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:40:32,056][model8_pretrain.py][INFO] Epoch:[0/2](904900/4588595) loss:2.697 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:40:32,056][model8_pretrain.py][INFO] Epoch:[0/2](904900/4588595) loss:2.897 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:40:32,056][model8_pretrain.py][INFO] Epoch:[0/2](904900/4588595) loss:3.085 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:40:32,056][model8_pretrain.py][INFO] Epoch:[0/2](904900/4588595) loss:2.151 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:40:32,056][model8_pretrain.py][INFO] Epoch:[0/2](904900/4588595) loss:3.002 lr:0.0000100 epoch_Time:23451.0min: [2024-01-06 17:41:08,978][model8_pretrain.py][INFO] Epoch:[0/2](905000/4588595) loss:3.229 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:08,978][model8_pretrain.py][INFO] Epoch:[0/2](905000/4588595) loss:2.206 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:08,978][model8_pretrain.py][INFO] Epoch:[0/2](905000/4588595) loss:2.905 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:08,978][model8_pretrain.py][INFO] Epoch:[0/2](905000/4588595) loss:3.349 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:08,978][model8_pretrain.py][INFO] Epoch:[0/2](905000/4588595) loss:3.127 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:08,978][model8_pretrain.py][INFO] Epoch:[0/2](905000/4588595) loss:2.810 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:08,978][model8_pretrain.py][INFO] Epoch:[0/2](905000/4588595) loss:3.464 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:08,979][model8_pretrain.py][INFO] Epoch:[0/2](905000/4588595) loss:2.803 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:45,922][model8_pretrain.py][INFO] Epoch:[0/2](905100/4588595) loss:2.669 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:45,922][model8_pretrain.py][INFO] Epoch:[0/2](905100/4588595) loss:2.541 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:45,923][model8_pretrain.py][INFO] Epoch:[0/2](905100/4588595) loss:3.284 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:45,923][model8_pretrain.py][INFO] Epoch:[0/2](905100/4588595) loss:2.496 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:45,923][model8_pretrain.py][INFO] Epoch:[0/2](905100/4588595) loss:2.608 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:45,923][model8_pretrain.py][INFO] Epoch:[0/2](905100/4588595) loss:2.594 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:45,923][model8_pretrain.py][INFO] Epoch:[0/2](905100/4588595) loss:2.768 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:41:45,923][model8_pretrain.py][INFO] Epoch:[0/2](905100/4588595) loss:2.940 lr:0.0000100 epoch_Time:23450.0min: [2024-01-06 17:42:22,875][model8_pretrain.py][INFO] Epoch:[0/2](905200/4588595) loss:2.744 lr:0.0000100 epoch_Time:23449.0min: [2024-01-06 17:42:22,875][model8_pretrain.py][INFO] Epoch:[0/2](905200/4588595) loss:2.939 lr:0.0000100 epoch_Time:23449.0min: [2024-01-06 17:42:22,875][model8_pretrain.py][INFO] Epoch:[0/2](905200/4588595) loss:2.809 lr:0.0000100 epoch_Time:23449.0min: [2024-01-06 17:42:22,875][model8_pretrain.py][INFO] Epoch:[0/2](905200/4588595) loss:2.948 lr:0.0000100 epoch_Time:23449.0min: [2024-01-06 17:42:22,875][model8_pretrain.py][INFO] Epoch:[0/2](905200/4588595) loss:3.017 lr:0.0000100 epoch_Time:23449.0min: [2024-01-06 17:42:22,875][model8_pretrain.py][INFO] Epoch:[0/2](905200/4588595) loss:2.534 lr:0.0000100 epoch_Time:23449.0min: [2024-01-06 17:42:22,875][model8_pretrain.py][INFO] Epoch:[0/2](905200/4588595) loss:2.414 lr:0.0000100 epoch_Time:23449.0min: [2024-01-06 17:42:22,875][model8_pretrain.py][INFO] Epoch:[0/2](905200/4588595) loss:2.655 lr:0.0000100 epoch_Time:23449.0min: [2024-01-06 17:42:59,810][model8_pretrain.py][INFO] Epoch:[0/2](905300/4588595) loss:2.580 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:42:59,810][model8_pretrain.py][INFO] Epoch:[0/2](905300/4588595) loss:2.853 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:42:59,810][model8_pretrain.py][INFO] Epoch:[0/2](905300/4588595) loss:2.794 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:42:59,810][model8_pretrain.py][INFO] Epoch:[0/2](905300/4588595) loss:2.436 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:42:59,810][model8_pretrain.py][INFO] Epoch:[0/2](905300/4588595) loss:2.669 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:42:59,810][model8_pretrain.py][INFO] Epoch:[0/2](905300/4588595) loss:2.248 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:42:59,810][model8_pretrain.py][INFO] Epoch:[0/2](905300/4588595) loss:2.860 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:42:59,810][model8_pretrain.py][INFO] Epoch:[0/2](905300/4588595) loss:2.536 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:43:36,735][model8_pretrain.py][INFO] Epoch:[0/2](905400/4588595) loss:2.672 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:43:36,735][model8_pretrain.py][INFO] Epoch:[0/2](905400/4588595) loss:2.893 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:43:36,735][model8_pretrain.py][INFO] Epoch:[0/2](905400/4588595) loss:3.060 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:43:36,735][model8_pretrain.py][INFO] Epoch:[0/2](905400/4588595) loss:2.928 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:43:36,735][model8_pretrain.py][INFO] Epoch:[0/2](905400/4588595) loss:2.717 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:43:36,735][model8_pretrain.py][INFO] Epoch:[0/2](905400/4588595) loss:2.373 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:43:36,735][model8_pretrain.py][INFO] Epoch:[0/2](905400/4588595) loss:2.952 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:43:36,735][model8_pretrain.py][INFO] Epoch:[0/2](905400/4588595) loss:3.126 lr:0.0000100 epoch_Time:23448.0min: [2024-01-06 17:44:13,681][model8_pretrain.py][INFO] Epoch:[0/2](905500/4588595) loss:2.811 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:44:13,681][model8_pretrain.py][INFO] Epoch:[0/2](905500/4588595) loss:3.366 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:44:13,681][model8_pretrain.py][INFO] Epoch:[0/2](905500/4588595) loss:2.587 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:44:13,681][model8_pretrain.py][INFO] Epoch:[0/2](905500/4588595) loss:3.059 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:44:13,681][model8_pretrain.py][INFO] Epoch:[0/2](905500/4588595) loss:2.819 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:44:13,682][model8_pretrain.py][INFO] Epoch:[0/2](905500/4588595) loss:3.058 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:44:13,682][model8_pretrain.py][INFO] Epoch:[0/2](905500/4588595) loss:2.942 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:44:13,682][model8_pretrain.py][INFO] Epoch:[0/2](905500/4588595) loss:2.552 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:44:50,645][model8_pretrain.py][INFO] Epoch:[0/2](905600/4588595) loss:1.823 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:44:50,645][model8_pretrain.py][INFO] Epoch:[0/2](905600/4588595) loss:2.914 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:44:50,645][model8_pretrain.py][INFO] Epoch:[0/2](905600/4588595) loss:2.854 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:44:50,646][model8_pretrain.py][INFO] Epoch:[0/2](905600/4588595) loss:2.993 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:44:50,646][model8_pretrain.py][INFO] Epoch:[0/2](905600/4588595) loss:2.522 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:44:50,646][model8_pretrain.py][INFO] Epoch:[0/2](905600/4588595) loss:2.579 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:44:50,646][model8_pretrain.py][INFO] Epoch:[0/2](905600/4588595) loss:2.407 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:44:50,646][model8_pretrain.py][INFO] Epoch:[0/2](905600/4588595) loss:3.217 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:45:41,391][model8_pretrain.py][INFO] Epoch:[0/2](905700/4588595) loss:2.715 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:45:41,391][model8_pretrain.py][INFO] Epoch:[0/2](905700/4588595) loss:3.238 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:45:41,391][model8_pretrain.py][INFO] Epoch:[0/2](905700/4588595) loss:2.840 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:45:41,391][model8_pretrain.py][INFO] Epoch:[0/2](905700/4588595) loss:1.398 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:45:41,391][model8_pretrain.py][INFO] Epoch:[0/2](905700/4588595) loss:2.671 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:45:41,391][model8_pretrain.py][INFO] Epoch:[0/2](905700/4588595) loss:2.555 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:45:41,391][model8_pretrain.py][INFO] Epoch:[0/2](905700/4588595) loss:2.623 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:45:41,391][model8_pretrain.py][INFO] Epoch:[0/2](905700/4588595) loss:3.305 lr:0.0000100 epoch_Time:23447.0min: [2024-01-06 17:46:18,323][model8_pretrain.py][INFO] Epoch:[0/2](905800/4588595) loss:2.402 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:46:18,323][model8_pretrain.py][INFO] Epoch:[0/2](905800/4588595) loss:2.862 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:46:18,323][model8_pretrain.py][INFO] Epoch:[0/2](905800/4588595) loss:3.224 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:46:18,323][model8_pretrain.py][INFO] Epoch:[0/2](905800/4588595) loss:2.638 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:46:18,323][model8_pretrain.py][INFO] Epoch:[0/2](905800/4588595) loss:2.452 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:46:18,323][model8_pretrain.py][INFO] Epoch:[0/2](905800/4588595) loss:2.808 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:46:18,323][model8_pretrain.py][INFO] Epoch:[0/2](905800/4588595) loss:2.876 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:46:18,323][model8_pretrain.py][INFO] Epoch:[0/2](905800/4588595) loss:3.117 lr:0.0000100 epoch_Time:23445.0min: [2024-01-06 17:46:55,262][model8_pretrain.py][INFO] Epoch:[0/2](905900/4588595) loss:2.571 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:46:55,262][model8_pretrain.py][INFO] Epoch:[0/2](905900/4588595) loss:2.308 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:46:55,262][model8_pretrain.py][INFO] Epoch:[0/2](905900/4588595) loss:2.707 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:46:55,262][model8_pretrain.py][INFO] Epoch:[0/2](905900/4588595) loss:2.889 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:46:55,262][model8_pretrain.py][INFO] Epoch:[0/2](905900/4588595) loss:2.250 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:46:55,262][model8_pretrain.py][INFO] Epoch:[0/2](905900/4588595) loss:2.613 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:46:55,262][model8_pretrain.py][INFO] Epoch:[0/2](905900/4588595) loss:3.039 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:46:55,262][model8_pretrain.py][INFO] Epoch:[0/2](905900/4588595) loss:3.061 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:47:32,208][model8_pretrain.py][INFO] Epoch:[0/2](906000/4588595) loss:2.881 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:47:32,208][model8_pretrain.py][INFO] Epoch:[0/2](906000/4588595) loss:3.025 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:47:32,208][model8_pretrain.py][INFO] Epoch:[0/2](906000/4588595) loss:2.554 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:47:32,208][model8_pretrain.py][INFO] Epoch:[0/2](906000/4588595) loss:3.178 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:47:32,208][model8_pretrain.py][INFO] Epoch:[0/2](906000/4588595) loss:2.889 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:47:32,208][model8_pretrain.py][INFO] Epoch:[0/2](906000/4588595) loss:3.081 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:47:32,208][model8_pretrain.py][INFO] Epoch:[0/2](906000/4588595) loss:2.591 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:47:32,208][model8_pretrain.py][INFO] Epoch:[0/2](906000/4588595) loss:2.872 lr:0.0000100 epoch_Time:23444.0min: [2024-01-06 17:48:09,152][model8_pretrain.py][INFO] Epoch:[0/2](906100/4588595) loss:3.003 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:09,152][model8_pretrain.py][INFO] Epoch:[0/2](906100/4588595) loss:2.987 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:09,152][model8_pretrain.py][INFO] Epoch:[0/2](906100/4588595) loss:2.338 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:09,152][model8_pretrain.py][INFO] Epoch:[0/2](906100/4588595) loss:2.958 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:09,152][model8_pretrain.py][INFO] Epoch:[0/2](906100/4588595) loss:2.441 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:09,152][model8_pretrain.py][INFO] Epoch:[0/2](906100/4588595) loss:2.467 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:09,152][model8_pretrain.py][INFO] Epoch:[0/2](906100/4588595) loss:3.429 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:09,153][model8_pretrain.py][INFO] Epoch:[0/2](906100/4588595) loss:2.079 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:46,084][model8_pretrain.py][INFO] Epoch:[0/2](906200/4588595) loss:2.506 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:46,084][model8_pretrain.py][INFO] Epoch:[0/2](906200/4588595) loss:2.653 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:46,084][model8_pretrain.py][INFO] Epoch:[0/2](906200/4588595) loss:2.597 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:46,084][model8_pretrain.py][INFO] Epoch:[0/2](906200/4588595) loss:3.034 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:46,084][model8_pretrain.py][INFO] Epoch:[0/2](906200/4588595) loss:2.028 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:46,084][model8_pretrain.py][INFO] Epoch:[0/2](906200/4588595) loss:2.634 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:46,085][model8_pretrain.py][INFO] Epoch:[0/2](906200/4588595) loss:3.234 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:48:46,085][model8_pretrain.py][INFO] Epoch:[0/2](906200/4588595) loss:2.815 lr:0.0000100 epoch_Time:23443.0min: [2024-01-06 17:49:23,027][model8_pretrain.py][INFO] Epoch:[0/2](906300/4588595) loss:2.711 lr:0.0000100 epoch_Time:23442.0min: [2024-01-06 17:49:23,027][model8_pretrain.py][INFO] Epoch:[0/2](906300/4588595) loss:2.280 lr:0.0000100 epoch_Time:23442.0min: [2024-01-06 17:49:23,027][model8_pretrain.py][INFO] Epoch:[0/2](906300/4588595) loss:2.836 lr:0.0000100 epoch_Time:23442.0min: [2024-01-06 17:49:23,027][model8_pretrain.py][INFO] Epoch:[0/2](906300/4588595) loss:2.394 lr:0.0000100 epoch_Time:23442.0min: [2024-01-06 17:49:23,027][model8_pretrain.py][INFO] Epoch:[0/2](906300/4588595) loss:3.018 lr:0.0000100 epoch_Time:23442.0min: [2024-01-06 17:49:23,027][model8_pretrain.py][INFO] Epoch:[0/2](906300/4588595) loss:2.554 lr:0.0000100 epoch_Time:23442.0min: [2024-01-06 17:49:23,027][model8_pretrain.py][INFO] Epoch:[0/2](906300/4588595) loss:2.976 lr:0.0000100 epoch_Time:23442.0min: [2024-01-06 17:49:23,027][model8_pretrain.py][INFO] Epoch:[0/2](906300/4588595) loss:2.920 lr:0.0000100 epoch_Time:23442.0min: [2024-01-06 17:49:59,973][model8_pretrain.py][INFO] Epoch:[0/2](906400/4588595) loss:2.595 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:49:59,973][model8_pretrain.py][INFO] Epoch:[0/2](906400/4588595) loss:2.515 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:49:59,973][model8_pretrain.py][INFO] Epoch:[0/2](906400/4588595) loss:3.231 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:49:59,973][model8_pretrain.py][INFO] Epoch:[0/2](906400/4588595) loss:3.042 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:49:59,973][model8_pretrain.py][INFO] Epoch:[0/2](906400/4588595) loss:2.876 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:49:59,973][model8_pretrain.py][INFO] Epoch:[0/2](906400/4588595) loss:2.526 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:49:59,973][model8_pretrain.py][INFO] Epoch:[0/2](906400/4588595) loss:2.472 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:49:59,973][model8_pretrain.py][INFO] Epoch:[0/2](906400/4588595) loss:2.889 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:50:50,793][model8_pretrain.py][INFO] Epoch:[0/2](906500/4588595) loss:2.636 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:50:50,793][model8_pretrain.py][INFO] Epoch:[0/2](906500/4588595) loss:2.480 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:50:50,793][model8_pretrain.py][INFO] Epoch:[0/2](906500/4588595) loss:2.327 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:50:50,793][model8_pretrain.py][INFO] Epoch:[0/2](906500/4588595) loss:3.119 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:50:50,793][model8_pretrain.py][INFO] Epoch:[0/2](906500/4588595) loss:2.359 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:50:50,793][model8_pretrain.py][INFO] Epoch:[0/2](906500/4588595) loss:2.583 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:50:50,793][model8_pretrain.py][INFO] Epoch:[0/2](906500/4588595) loss:2.354 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:50:50,793][model8_pretrain.py][INFO] Epoch:[0/2](906500/4588595) loss:3.072 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:51:27,719][model8_pretrain.py][INFO] Epoch:[0/2](906600/4588595) loss:2.936 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:51:27,719][model8_pretrain.py][INFO] Epoch:[0/2](906600/4588595) loss:3.091 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:51:27,719][model8_pretrain.py][INFO] Epoch:[0/2](906600/4588595) loss:2.411 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:51:27,719][model8_pretrain.py][INFO] Epoch:[0/2](906600/4588595) loss:2.921 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:51:27,719][model8_pretrain.py][INFO] Epoch:[0/2](906600/4588595) loss:3.125 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:51:27,720][model8_pretrain.py][INFO] Epoch:[0/2](906600/4588595) loss:2.300 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:51:27,720][model8_pretrain.py][INFO] Epoch:[0/2](906600/4588595) loss:2.708 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:51:27,720][model8_pretrain.py][INFO] Epoch:[0/2](906600/4588595) loss:3.208 lr:0.0000100 epoch_Time:23441.0min: [2024-01-06 17:52:04,656][model8_pretrain.py][INFO] Epoch:[0/2](906700/4588595) loss:2.864 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:04,656][model8_pretrain.py][INFO] Epoch:[0/2](906700/4588595) loss:2.105 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:04,656][model8_pretrain.py][INFO] Epoch:[0/2](906700/4588595) loss:2.665 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:04,656][model8_pretrain.py][INFO] Epoch:[0/2](906700/4588595) loss:3.046 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:04,656][model8_pretrain.py][INFO] Epoch:[0/2](906700/4588595) loss:2.618 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:04,656][model8_pretrain.py][INFO] Epoch:[0/2](906700/4588595) loss:2.859 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:04,656][model8_pretrain.py][INFO] Epoch:[0/2](906700/4588595) loss:2.473 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:04,656][model8_pretrain.py][INFO] Epoch:[0/2](906700/4588595) loss:3.089 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:41,597][model8_pretrain.py][INFO] Epoch:[0/2](906800/4588595) loss:2.542 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:41,597][model8_pretrain.py][INFO] Epoch:[0/2](906800/4588595) loss:2.634 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:41,597][model8_pretrain.py][INFO] Epoch:[0/2](906800/4588595) loss:2.598 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:41,597][model8_pretrain.py][INFO] Epoch:[0/2](906800/4588595) loss:2.699 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:41,597][model8_pretrain.py][INFO] Epoch:[0/2](906800/4588595) loss:2.814 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:41,597][model8_pretrain.py][INFO] Epoch:[0/2](906800/4588595) loss:2.305 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:41,597][model8_pretrain.py][INFO] Epoch:[0/2](906800/4588595) loss:2.394 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:52:41,597][model8_pretrain.py][INFO] Epoch:[0/2](906800/4588595) loss:2.987 lr:0.0000100 epoch_Time:23440.0min: [2024-01-06 17:53:18,544][model8_pretrain.py][INFO] Epoch:[0/2](906900/4588595) loss:2.471 lr:0.0000100 epoch_Time:23438.0min: [2024-01-06 17:53:18,545][model8_pretrain.py][INFO] Epoch:[0/2](906900/4588595) loss:2.894 lr:0.0000100 epoch_Time:23438.0min: [2024-01-06 17:53:18,545][model8_pretrain.py][INFO] Epoch:[0/2](906900/4588595) loss:2.417 lr:0.0000100 epoch_Time:23438.0min: [2024-01-06 17:53:18,545][model8_pretrain.py][INFO] Epoch:[0/2](906900/4588595) loss:2.762 lr:0.0000100 epoch_Time:23438.0min: [2024-01-06 17:53:18,545][model8_pretrain.py][INFO] Epoch:[0/2](906900/4588595) loss:2.844 lr:0.0000100 epoch_Time:23438.0min: [2024-01-06 17:53:18,545][model8_pretrain.py][INFO] Epoch:[0/2](906900/4588595) loss:2.315 lr:0.0000100 epoch_Time:23438.0min: [2024-01-06 17:53:18,545][model8_pretrain.py][INFO] Epoch:[0/2](906900/4588595) loss:3.293 lr:0.0000100 epoch_Time:23438.0min: [2024-01-06 17:53:18,545][model8_pretrain.py][INFO] Epoch:[0/2](906900/4588595) loss:2.733 lr:0.0000100 epoch_Time:23438.0min: [2024-01-06 17:53:55,466][model8_pretrain.py][INFO] Epoch:[0/2](907000/4588595) loss:2.696 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:53:55,466][model8_pretrain.py][INFO] Epoch:[0/2](907000/4588595) loss:2.374 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:53:55,466][model8_pretrain.py][INFO] Epoch:[0/2](907000/4588595) loss:2.576 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:53:55,466][model8_pretrain.py][INFO] Epoch:[0/2](907000/4588595) loss:2.622 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:53:55,466][model8_pretrain.py][INFO] Epoch:[0/2](907000/4588595) loss:3.070 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:53:55,466][model8_pretrain.py][INFO] Epoch:[0/2](907000/4588595) loss:2.775 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:53:55,467][model8_pretrain.py][INFO] Epoch:[0/2](907000/4588595) loss:2.617 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:53:55,467][model8_pretrain.py][INFO] Epoch:[0/2](907000/4588595) loss:2.748 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:54:32,409][model8_pretrain.py][INFO] Epoch:[0/2](907100/4588595) loss:3.384 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:54:32,409][model8_pretrain.py][INFO] Epoch:[0/2](907100/4588595) loss:2.870 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:54:32,409][model8_pretrain.py][INFO] Epoch:[0/2](907100/4588595) loss:2.513 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:54:32,409][model8_pretrain.py][INFO] Epoch:[0/2](907100/4588595) loss:2.477 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:54:32,409][model8_pretrain.py][INFO] Epoch:[0/2](907100/4588595) loss:3.242 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:54:32,409][model8_pretrain.py][INFO] Epoch:[0/2](907100/4588595) loss:2.294 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:54:32,409][model8_pretrain.py][INFO] Epoch:[0/2](907100/4588595) loss:2.717 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:54:32,409][model8_pretrain.py][INFO] Epoch:[0/2](907100/4588595) loss:3.261 lr:0.0000100 epoch_Time:23437.0min: [2024-01-06 17:55:09,364][model8_pretrain.py][INFO] Epoch:[0/2](907200/4588595) loss:2.863 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:55:09,364][model8_pretrain.py][INFO] Epoch:[0/2](907200/4588595) loss:2.992 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:55:09,364][model8_pretrain.py][INFO] Epoch:[0/2](907200/4588595) loss:3.253 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:55:09,364][model8_pretrain.py][INFO] Epoch:[0/2](907200/4588595) loss:2.713 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:55:09,364][model8_pretrain.py][INFO] Epoch:[0/2](907200/4588595) loss:2.773 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:55:09,364][model8_pretrain.py][INFO] Epoch:[0/2](907200/4588595) loss:2.742 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:55:09,364][model8_pretrain.py][INFO] Epoch:[0/2](907200/4588595) loss:2.848 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:55:09,364][model8_pretrain.py][INFO] Epoch:[0/2](907200/4588595) loss:3.076 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:00,200][model8_pretrain.py][INFO] Epoch:[0/2](907300/4588595) loss:2.798 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:00,200][model8_pretrain.py][INFO] Epoch:[0/2](907300/4588595) loss:2.589 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:00,200][model8_pretrain.py][INFO] Epoch:[0/2](907300/4588595) loss:2.828 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:00,200][model8_pretrain.py][INFO] Epoch:[0/2](907300/4588595) loss:2.818 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:00,200][model8_pretrain.py][INFO] Epoch:[0/2](907300/4588595) loss:3.234 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:00,200][model8_pretrain.py][INFO] Epoch:[0/2](907300/4588595) loss:2.316 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:00,200][model8_pretrain.py][INFO] Epoch:[0/2](907300/4588595) loss:2.704 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:00,200][model8_pretrain.py][INFO] Epoch:[0/2](907300/4588595) loss:2.752 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:37,121][model8_pretrain.py][INFO] Epoch:[0/2](907400/4588595) loss:3.295 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:37,121][model8_pretrain.py][INFO] Epoch:[0/2](907400/4588595) loss:2.665 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:37,121][model8_pretrain.py][INFO] Epoch:[0/2](907400/4588595) loss:3.340 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:37,121][model8_pretrain.py][INFO] Epoch:[0/2](907400/4588595) loss:2.794 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:37,121][model8_pretrain.py][INFO] Epoch:[0/2](907400/4588595) loss:3.172 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:37,121][model8_pretrain.py][INFO] Epoch:[0/2](907400/4588595) loss:2.345 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:37,121][model8_pretrain.py][INFO] Epoch:[0/2](907400/4588595) loss:3.082 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:56:37,121][model8_pretrain.py][INFO] Epoch:[0/2](907400/4588595) loss:2.967 lr:0.0000100 epoch_Time:23436.0min: [2024-01-06 17:57:14,052][model8_pretrain.py][INFO] Epoch:[0/2](907500/4588595) loss:3.375 lr:0.0000100 epoch_Time:23435.0min: [2024-01-06 17:57:14,052][model8_pretrain.py][INFO] Epoch:[0/2](907500/4588595) loss:2.506 lr:0.0000100 epoch_Time:23435.0min: [2024-01-06 17:57:14,052][model8_pretrain.py][INFO] Epoch:[0/2](907500/4588595) loss:3.149 lr:0.0000100 epoch_Time:23435.0min: [2024-01-06 17:57:14,052][model8_pretrain.py][INFO] Epoch:[0/2](907500/4588595) loss:2.565 lr:0.0000100 epoch_Time:23435.0min: [2024-01-06 17:57:14,052][model8_pretrain.py][INFO] Epoch:[0/2](907500/4588595) loss:3.222 lr:0.0000100 epoch_Time:23435.0min: [2024-01-06 17:57:14,052][model8_pretrain.py][INFO] Epoch:[0/2](907500/4588595) loss:2.977 lr:0.0000100 epoch_Time:23435.0min: [2024-01-06 17:57:14,052][model8_pretrain.py][INFO] Epoch:[0/2](907500/4588595) loss:2.568 lr:0.0000100 epoch_Time:23435.0min: [2024-01-06 17:57:14,052][model8_pretrain.py][INFO] Epoch:[0/2](907500/4588595) loss:2.800 lr:0.0000100 epoch_Time:23435.0min: [2024-01-06 17:57:50,989][model8_pretrain.py][INFO] Epoch:[0/2](907600/4588595) loss:2.610 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:57:50,989][model8_pretrain.py][INFO] Epoch:[0/2](907600/4588595) loss:2.440 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:57:50,989][model8_pretrain.py][INFO] Epoch:[0/2](907600/4588595) loss:2.828 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:57:50,989][model8_pretrain.py][INFO] Epoch:[0/2](907600/4588595) loss:2.618 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:57:50,989][model8_pretrain.py][INFO] Epoch:[0/2](907600/4588595) loss:2.782 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:57:50,989][model8_pretrain.py][INFO] Epoch:[0/2](907600/4588595) loss:2.485 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:57:50,989][model8_pretrain.py][INFO] Epoch:[0/2](907600/4588595) loss:2.379 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:57:50,989][model8_pretrain.py][INFO] Epoch:[0/2](907600/4588595) loss:3.171 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:58:27,927][model8_pretrain.py][INFO] Epoch:[0/2](907700/4588595) loss:2.837 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:58:27,927][model8_pretrain.py][INFO] Epoch:[0/2](907700/4588595) loss:3.268 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:58:27,927][model8_pretrain.py][INFO] Epoch:[0/2](907700/4588595) loss:3.159 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:58:27,927][model8_pretrain.py][INFO] Epoch:[0/2](907700/4588595) loss:2.222 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:58:27,927][model8_pretrain.py][INFO] Epoch:[0/2](907700/4588595) loss:2.348 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:58:27,927][model8_pretrain.py][INFO] Epoch:[0/2](907700/4588595) loss:2.617 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:58:27,927][model8_pretrain.py][INFO] Epoch:[0/2](907700/4588595) loss:2.591 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:58:27,927][model8_pretrain.py][INFO] Epoch:[0/2](907700/4588595) loss:2.875 lr:0.0000100 epoch_Time:23434.0min: [2024-01-06 17:59:04,868][model8_pretrain.py][INFO] Epoch:[0/2](907800/4588595) loss:3.033 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:04,868][model8_pretrain.py][INFO] Epoch:[0/2](907800/4588595) loss:3.026 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:04,868][model8_pretrain.py][INFO] Epoch:[0/2](907800/4588595) loss:2.951 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:04,868][model8_pretrain.py][INFO] Epoch:[0/2](907800/4588595) loss:2.670 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:04,868][model8_pretrain.py][INFO] Epoch:[0/2](907800/4588595) loss:2.827 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:04,868][model8_pretrain.py][INFO] Epoch:[0/2](907800/4588595) loss:1.999 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:04,868][model8_pretrain.py][INFO] Epoch:[0/2](907800/4588595) loss:2.576 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:04,868][model8_pretrain.py][INFO] Epoch:[0/2](907800/4588595) loss:3.053 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:41,809][model8_pretrain.py][INFO] Epoch:[0/2](907900/4588595) loss:2.607 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:41,809][model8_pretrain.py][INFO] Epoch:[0/2](907900/4588595) loss:2.672 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:41,809][model8_pretrain.py][INFO] Epoch:[0/2](907900/4588595) loss:3.148 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:41,809][model8_pretrain.py][INFO] Epoch:[0/2](907900/4588595) loss:2.948 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:41,809][model8_pretrain.py][INFO] Epoch:[0/2](907900/4588595) loss:2.662 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:41,809][model8_pretrain.py][INFO] Epoch:[0/2](907900/4588595) loss:3.105 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:41,809][model8_pretrain.py][INFO] Epoch:[0/2](907900/4588595) loss:3.001 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 17:59:41,809][model8_pretrain.py][INFO] Epoch:[0/2](907900/4588595) loss:3.339 lr:0.0000100 epoch_Time:23433.0min: [2024-01-06 18:00:18,745][model8_pretrain.py][INFO] Epoch:[0/2](908000/4588595) loss:3.431 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:00:18,745][model8_pretrain.py][INFO] Epoch:[0/2](908000/4588595) loss:2.498 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:00:18,745][model8_pretrain.py][INFO] Epoch:[0/2](908000/4588595) loss:2.754 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:00:18,745][model8_pretrain.py][INFO] Epoch:[0/2](908000/4588595) loss:2.876 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:00:18,745][model8_pretrain.py][INFO] Epoch:[0/2](908000/4588595) loss:2.941 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:00:18,745][model8_pretrain.py][INFO] Epoch:[0/2](908000/4588595) loss:3.080 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:00:18,745][model8_pretrain.py][INFO] Epoch:[0/2](908000/4588595) loss:3.192 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:00:18,745][model8_pretrain.py][INFO] Epoch:[0/2](908000/4588595) loss:2.120 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:01:09,768][model8_pretrain.py][INFO] Epoch:[0/2](908100/4588595) loss:3.187 lr:0.0000100 epoch_Time:23432.0min: [2024-01-06 18:01:09,768][model8_pretrain.py][INFO] Epoch:[0/2](908100/4588595) loss:2.344 lr:0.0000100 epoch_Time:23432.0min: [2024-01-06 18:01:09,768][model8_pretrain.py][INFO] Epoch:[0/2](908100/4588595) loss:3.199 lr:0.0000100 epoch_Time:23432.0min: [2024-01-06 18:01:09,768][model8_pretrain.py][INFO] Epoch:[0/2](908100/4588595) loss:2.674 lr:0.0000100 epoch_Time:23432.0min: [2024-01-06 18:01:09,768][model8_pretrain.py][INFO] Epoch:[0/2](908100/4588595) loss:2.671 lr:0.0000100 epoch_Time:23432.0min: [2024-01-06 18:01:09,768][model8_pretrain.py][INFO] Epoch:[0/2](908100/4588595) loss:2.824 lr:0.0000100 epoch_Time:23432.0min: [2024-01-06 18:01:09,768][model8_pretrain.py][INFO] Epoch:[0/2](908100/4588595) loss:2.786 lr:0.0000100 epoch_Time:23432.0min: [2024-01-06 18:01:09,768][model8_pretrain.py][INFO] Epoch:[0/2](908100/4588595) loss:2.584 lr:0.0000100 epoch_Time:23432.0min: [2024-01-06 18:01:46,695][model8_pretrain.py][INFO] Epoch:[0/2](908200/4588595) loss:2.376 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:01:46,695][model8_pretrain.py][INFO] Epoch:[0/2](908200/4588595) loss:2.516 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:01:46,695][model8_pretrain.py][INFO] Epoch:[0/2](908200/4588595) loss:3.577 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:01:46,695][model8_pretrain.py][INFO] Epoch:[0/2](908200/4588595) loss:2.974 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:01:46,695][model8_pretrain.py][INFO] Epoch:[0/2](908200/4588595) loss:2.746 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:01:46,695][model8_pretrain.py][INFO] Epoch:[0/2](908200/4588595) loss:2.766 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:01:46,695][model8_pretrain.py][INFO] Epoch:[0/2](908200/4588595) loss:2.906 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:01:46,695][model8_pretrain.py][INFO] Epoch:[0/2](908200/4588595) loss:3.143 lr:0.0000100 epoch_Time:23431.0min: [2024-01-06 18:02:23,634][model8_pretrain.py][INFO] Epoch:[0/2](908300/4588595) loss:2.730 lr:0.0000100 epoch_Time:23430.0min: [2024-01-06 18:02:23,634][model8_pretrain.py][INFO] Epoch:[0/2](908300/4588595) loss:2.350 lr:0.0000100 epoch_Time:23430.0min: [2024-01-06 18:02:23,634][model8_pretrain.py][INFO] Epoch:[0/2](908300/4588595) loss:2.738 lr:0.0000100 epoch_Time:23430.0min: [2024-01-06 18:02:23,634][model8_pretrain.py][INFO] Epoch:[0/2](908300/4588595) loss:3.171 lr:0.0000100 epoch_Time:23430.0min: [2024-01-06 18:02:23,635][model8_pretrain.py][INFO] Epoch:[0/2](908300/4588595) loss:3.319 lr:0.0000100 epoch_Time:23430.0min: [2024-01-06 18:02:23,635][model8_pretrain.py][INFO] Epoch:[0/2](908300/4588595) loss:3.029 lr:0.0000100 epoch_Time:23430.0min: [2024-01-06 18:02:23,635][model8_pretrain.py][INFO] Epoch:[0/2](908300/4588595) loss:2.737 lr:0.0000100 epoch_Time:23430.0min: [2024-01-06 18:02:23,635][model8_pretrain.py][INFO] Epoch:[0/2](908300/4588595) loss:2.553 lr:0.0000100 epoch_Time:23430.0min: [2024-01-06 18:03:00,580][model8_pretrain.py][INFO] Epoch:[0/2](908400/4588595) loss:2.906 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:00,580][model8_pretrain.py][INFO] Epoch:[0/2](908400/4588595) loss:2.512 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:00,580][model8_pretrain.py][INFO] Epoch:[0/2](908400/4588595) loss:2.041 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:00,580][model8_pretrain.py][INFO] Epoch:[0/2](908400/4588595) loss:2.816 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:00,580][model8_pretrain.py][INFO] Epoch:[0/2](908400/4588595) loss:2.640 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:00,580][model8_pretrain.py][INFO] Epoch:[0/2](908400/4588595) loss:2.747 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:00,580][model8_pretrain.py][INFO] Epoch:[0/2](908400/4588595) loss:2.596 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:00,580][model8_pretrain.py][INFO] Epoch:[0/2](908400/4588595) loss:3.056 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:37,516][model8_pretrain.py][INFO] Epoch:[0/2](908500/4588595) loss:3.112 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:37,517][model8_pretrain.py][INFO] Epoch:[0/2](908500/4588595) loss:3.188 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:37,517][model8_pretrain.py][INFO] Epoch:[0/2](908500/4588595) loss:2.317 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:37,517][model8_pretrain.py][INFO] Epoch:[0/2](908500/4588595) loss:1.815 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:37,517][model8_pretrain.py][INFO] Epoch:[0/2](908500/4588595) loss:2.576 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:37,517][model8_pretrain.py][INFO] Epoch:[0/2](908500/4588595) loss:2.669 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:37,517][model8_pretrain.py][INFO] Epoch:[0/2](908500/4588595) loss:3.044 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:03:37,517][model8_pretrain.py][INFO] Epoch:[0/2](908500/4588595) loss:2.790 lr:0.0000100 epoch_Time:23429.0min: [2024-01-06 18:04:14,457][model8_pretrain.py][INFO] Epoch:[0/2](908600/4588595) loss:2.610 lr:0.0000100 epoch_Time:23428.0min: [2024-01-06 18:04:14,457][model8_pretrain.py][INFO] Epoch:[0/2](908600/4588595) loss:2.003 lr:0.0000100 epoch_Time:23428.0min: [2024-01-06 18:04:14,457][model8_pretrain.py][INFO] Epoch:[0/2](908600/4588595) loss:2.840 lr:0.0000100 epoch_Time:23428.0min: [2024-01-06 18:04:14,457][model8_pretrain.py][INFO] Epoch:[0/2](908600/4588595) loss:2.713 lr:0.0000100 epoch_Time:23428.0min: [2024-01-06 18:04:14,457][model8_pretrain.py][INFO] Epoch:[0/2](908600/4588595) loss:3.118 lr:0.0000100 epoch_Time:23428.0min: [2024-01-06 18:04:14,457][model8_pretrain.py][INFO] Epoch:[0/2](908600/4588595) loss:2.721 lr:0.0000100 epoch_Time:23428.0min: [2024-01-06 18:04:14,457][model8_pretrain.py][INFO] Epoch:[0/2](908600/4588595) loss:2.549 lr:0.0000100 epoch_Time:23428.0min: [2024-01-06 18:04:14,457][model8_pretrain.py][INFO] Epoch:[0/2](908600/4588595) loss:2.714 lr:0.0000100 epoch_Time:23428.0min: [2024-01-06 18:04:51,392][model8_pretrain.py][INFO] Epoch:[0/2](908700/4588595) loss:3.134 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:04:51,392][model8_pretrain.py][INFO] Epoch:[0/2](908700/4588595) loss:2.594 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:04:51,392][model8_pretrain.py][INFO] Epoch:[0/2](908700/4588595) loss:2.593 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:04:51,392][model8_pretrain.py][INFO] Epoch:[0/2](908700/4588595) loss:2.324 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:04:51,392][model8_pretrain.py][INFO] Epoch:[0/2](908700/4588595) loss:2.807 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:04:51,392][model8_pretrain.py][INFO] Epoch:[0/2](908700/4588595) loss:3.198 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:04:51,392][model8_pretrain.py][INFO] Epoch:[0/2](908700/4588595) loss:3.422 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:04:51,392][model8_pretrain.py][INFO] Epoch:[0/2](908700/4588595) loss:2.874 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:05:28,313][model8_pretrain.py][INFO] Epoch:[0/2](908800/4588595) loss:2.908 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:05:28,313][model8_pretrain.py][INFO] Epoch:[0/2](908800/4588595) loss:2.502 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:05:28,313][model8_pretrain.py][INFO] Epoch:[0/2](908800/4588595) loss:3.010 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:05:28,313][model8_pretrain.py][INFO] Epoch:[0/2](908800/4588595) loss:2.801 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:05:28,313][model8_pretrain.py][INFO] Epoch:[0/2](908800/4588595) loss:2.763 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:05:28,313][model8_pretrain.py][INFO] Epoch:[0/2](908800/4588595) loss:3.044 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:05:28,313][model8_pretrain.py][INFO] Epoch:[0/2](908800/4588595) loss:3.248 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:05:28,313][model8_pretrain.py][INFO] Epoch:[0/2](908800/4588595) loss:3.022 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:06:18,892][model8_pretrain.py][INFO] Epoch:[0/2](908900/4588595) loss:2.885 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:06:18,892][model8_pretrain.py][INFO] Epoch:[0/2](908900/4588595) loss:2.467 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:06:18,892][model8_pretrain.py][INFO] Epoch:[0/2](908900/4588595) loss:3.053 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:06:18,892][model8_pretrain.py][INFO] Epoch:[0/2](908900/4588595) loss:2.793 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:06:18,892][model8_pretrain.py][INFO] Epoch:[0/2](908900/4588595) loss:2.934 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:06:18,892][model8_pretrain.py][INFO] Epoch:[0/2](908900/4588595) loss:2.098 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:06:18,892][model8_pretrain.py][INFO] Epoch:[0/2](908900/4588595) loss:2.478 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:06:18,892][model8_pretrain.py][INFO] Epoch:[0/2](908900/4588595) loss:2.746 lr:0.0000100 epoch_Time:23427.0min: [2024-01-06 18:06:55,776][model8_pretrain.py][INFO] Epoch:[0/2](909000/4588595) loss:2.808 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:06:55,776][model8_pretrain.py][INFO] Epoch:[0/2](909000/4588595) loss:2.814 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:06:55,776][model8_pretrain.py][INFO] Epoch:[0/2](909000/4588595) loss:3.227 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:06:55,776][model8_pretrain.py][INFO] Epoch:[0/2](909000/4588595) loss:2.992 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:06:55,776][model8_pretrain.py][INFO] Epoch:[0/2](909000/4588595) loss:2.644 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:06:55,776][model8_pretrain.py][INFO] Epoch:[0/2](909000/4588595) loss:2.471 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:06:55,776][model8_pretrain.py][INFO] Epoch:[0/2](909000/4588595) loss:2.911 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:06:55,777][model8_pretrain.py][INFO] Epoch:[0/2](909000/4588595) loss:2.862 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:07:32,711][model8_pretrain.py][INFO] Epoch:[0/2](909100/4588595) loss:2.806 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:07:32,711][model8_pretrain.py][INFO] Epoch:[0/2](909100/4588595) loss:2.871 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:07:32,712][model8_pretrain.py][INFO] Epoch:[0/2](909100/4588595) loss:2.763 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:07:32,712][model8_pretrain.py][INFO] Epoch:[0/2](909100/4588595) loss:2.639 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:07:32,712][model8_pretrain.py][INFO] Epoch:[0/2](909100/4588595) loss:2.826 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:07:32,712][model8_pretrain.py][INFO] Epoch:[0/2](909100/4588595) loss:2.740 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:07:32,712][model8_pretrain.py][INFO] Epoch:[0/2](909100/4588595) loss:2.877 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:07:32,712][model8_pretrain.py][INFO] Epoch:[0/2](909100/4588595) loss:2.537 lr:0.0000100 epoch_Time:23426.0min: [2024-01-06 18:08:09,662][model8_pretrain.py][INFO] Epoch:[0/2](909200/4588595) loss:2.712 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:09,662][model8_pretrain.py][INFO] Epoch:[0/2](909200/4588595) loss:2.813 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:09,662][model8_pretrain.py][INFO] Epoch:[0/2](909200/4588595) loss:2.610 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:09,662][model8_pretrain.py][INFO] Epoch:[0/2](909200/4588595) loss:3.088 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:09,662][model8_pretrain.py][INFO] Epoch:[0/2](909200/4588595) loss:3.424 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:09,662][model8_pretrain.py][INFO] Epoch:[0/2](909200/4588595) loss:2.212 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:09,662][model8_pretrain.py][INFO] Epoch:[0/2](909200/4588595) loss:2.980 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:09,662][model8_pretrain.py][INFO] Epoch:[0/2](909200/4588595) loss:2.460 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:46,597][model8_pretrain.py][INFO] Epoch:[0/2](909300/4588595) loss:2.893 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:46,597][model8_pretrain.py][INFO] Epoch:[0/2](909300/4588595) loss:2.886 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:46,597][model8_pretrain.py][INFO] Epoch:[0/2](909300/4588595) loss:2.707 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:46,597][model8_pretrain.py][INFO] Epoch:[0/2](909300/4588595) loss:2.882 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:46,597][model8_pretrain.py][INFO] Epoch:[0/2](909300/4588595) loss:3.442 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:46,597][model8_pretrain.py][INFO] Epoch:[0/2](909300/4588595) loss:3.000 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:46,597][model8_pretrain.py][INFO] Epoch:[0/2](909300/4588595) loss:2.351 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:08:46,597][model8_pretrain.py][INFO] Epoch:[0/2](909300/4588595) loss:2.919 lr:0.0000100 epoch_Time:23424.0min: [2024-01-06 18:09:23,502][model8_pretrain.py][INFO] Epoch:[0/2](909400/4588595) loss:2.660 lr:0.0000100 epoch_Time:23423.0min: [2024-01-06 18:09:23,502][model8_pretrain.py][INFO] Epoch:[0/2](909400/4588595) loss:3.038 lr:0.0000100 epoch_Time:23423.0min: [2024-01-06 18:09:23,502][model8_pretrain.py][INFO] Epoch:[0/2](909400/4588595) loss:2.808 lr:0.0000100 epoch_Time:23423.0min: [2024-01-06 18:09:23,502][model8_pretrain.py][INFO] Epoch:[0/2](909400/4588595) loss:2.598 lr:0.0000100 epoch_Time:23423.0min: [2024-01-06 18:09:23,502][model8_pretrain.py][INFO] Epoch:[0/2](909400/4588595) loss:2.585 lr:0.0000100 epoch_Time:23423.0min: [2024-01-06 18:09:23,502][model8_pretrain.py][INFO] Epoch:[0/2](909400/4588595) loss:2.606 lr:0.0000100 epoch_Time:23423.0min: [2024-01-06 18:09:23,502][model8_pretrain.py][INFO] Epoch:[0/2](909400/4588595) loss:3.059 lr:0.0000100 epoch_Time:23423.0min: [2024-01-06 18:09:23,502][model8_pretrain.py][INFO] Epoch:[0/2](909400/4588595) loss:2.705 lr:0.0000100 epoch_Time:23423.0min: [2024-01-06 18:10:00,432][model8_pretrain.py][INFO] Epoch:[0/2](909500/4588595) loss:2.319 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:00,432][model8_pretrain.py][INFO] Epoch:[0/2](909500/4588595) loss:2.673 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:00,432][model8_pretrain.py][INFO] Epoch:[0/2](909500/4588595) loss:2.295 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:00,433][model8_pretrain.py][INFO] Epoch:[0/2](909500/4588595) loss:2.154 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:00,433][model8_pretrain.py][INFO] Epoch:[0/2](909500/4588595) loss:2.514 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:00,433][model8_pretrain.py][INFO] Epoch:[0/2](909500/4588595) loss:2.623 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:00,433][model8_pretrain.py][INFO] Epoch:[0/2](909500/4588595) loss:2.714 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:00,433][model8_pretrain.py][INFO] Epoch:[0/2](909500/4588595) loss:2.084 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:37,367][model8_pretrain.py][INFO] Epoch:[0/2](909600/4588595) loss:2.638 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:37,367][model8_pretrain.py][INFO] Epoch:[0/2](909600/4588595) loss:2.540 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:37,367][model8_pretrain.py][INFO] Epoch:[0/2](909600/4588595) loss:3.005 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:37,367][model8_pretrain.py][INFO] Epoch:[0/2](909600/4588595) loss:2.450 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:37,367][model8_pretrain.py][INFO] Epoch:[0/2](909600/4588595) loss:3.075 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:37,368][model8_pretrain.py][INFO] Epoch:[0/2](909600/4588595) loss:3.097 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:37,367][model8_pretrain.py][INFO] Epoch:[0/2](909600/4588595) loss:3.115 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:10:37,368][model8_pretrain.py][INFO] Epoch:[0/2](909600/4588595) loss:2.965 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:11:27,938][model8_pretrain.py][INFO] Epoch:[0/2](909700/4588595) loss:2.462 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:11:27,938][model8_pretrain.py][INFO] Epoch:[0/2](909700/4588595) loss:2.865 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:11:27,938][model8_pretrain.py][INFO] Epoch:[0/2](909700/4588595) loss:2.574 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:11:27,938][model8_pretrain.py][INFO] Epoch:[0/2](909700/4588595) loss:2.636 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:11:27,938][model8_pretrain.py][INFO] Epoch:[0/2](909700/4588595) loss:2.581 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:11:27,938][model8_pretrain.py][INFO] Epoch:[0/2](909700/4588595) loss:2.863 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:11:27,938][model8_pretrain.py][INFO] Epoch:[0/2](909700/4588595) loss:2.748 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:11:27,938][model8_pretrain.py][INFO] Epoch:[0/2](909700/4588595) loss:3.092 lr:0.0000100 epoch_Time:23422.0min: [2024-01-06 18:12:04,858][model8_pretrain.py][INFO] Epoch:[0/2](909800/4588595) loss:2.734 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:04,858][model8_pretrain.py][INFO] Epoch:[0/2](909800/4588595) loss:3.152 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:04,858][model8_pretrain.py][INFO] Epoch:[0/2](909800/4588595) loss:2.872 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:04,858][model8_pretrain.py][INFO] Epoch:[0/2](909800/4588595) loss:2.104 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:04,859][model8_pretrain.py][INFO] Epoch:[0/2](909800/4588595) loss:2.761 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:04,859][model8_pretrain.py][INFO] Epoch:[0/2](909800/4588595) loss:3.044 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:04,859][model8_pretrain.py][INFO] Epoch:[0/2](909800/4588595) loss:2.385 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:04,860][model8_pretrain.py][INFO] Epoch:[0/2](909800/4588595) loss:2.658 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:41,793][model8_pretrain.py][INFO] Epoch:[0/2](909900/4588595) loss:2.952 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:41,793][model8_pretrain.py][INFO] Epoch:[0/2](909900/4588595) loss:3.256 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:41,793][model8_pretrain.py][INFO] Epoch:[0/2](909900/4588595) loss:3.036 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:41,793][model8_pretrain.py][INFO] Epoch:[0/2](909900/4588595) loss:2.762 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:41,793][model8_pretrain.py][INFO] Epoch:[0/2](909900/4588595) loss:3.015 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:41,793][model8_pretrain.py][INFO] Epoch:[0/2](909900/4588595) loss:3.103 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:41,794][model8_pretrain.py][INFO] Epoch:[0/2](909900/4588595) loss:2.731 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:12:41,793][model8_pretrain.py][INFO] Epoch:[0/2](909900/4588595) loss:2.656 lr:0.0000100 epoch_Time:23421.0min: [2024-01-06 18:13:18,730][model8_pretrain.py][INFO] Epoch:[0/2](910000/4588595) loss:2.968 lr:0.0000100 epoch_Time:23420.0min: [2024-01-06 18:13:18,730][model8_pretrain.py][INFO] Epoch:[0/2](910000/4588595) loss:2.571 lr:0.0000100 epoch_Time:23420.0min: [2024-01-06 18:13:18,730][model8_pretrain.py][INFO] Epoch:[0/2](910000/4588595) loss:2.816 lr:0.0000100 epoch_Time:23420.0min: [2024-01-06 18:13:18,730][model8_pretrain.py][INFO] Epoch:[0/2](910000/4588595) loss:2.926 lr:0.0000100 epoch_Time:23420.0min: [2024-01-06 18:13:18,730][model8_pretrain.py][INFO] Epoch:[0/2](910000/4588595) loss:3.060 lr:0.0000100 epoch_Time:23420.0min: [2024-01-06 18:13:18,730][model8_pretrain.py][INFO] Epoch:[0/2](910000/4588595) loss:3.036 lr:0.0000100 epoch_Time:23420.0min: [2024-01-06 18:13:18,731][model8_pretrain.py][INFO] Epoch:[0/2](910000/4588595) loss:3.239 lr:0.0000100 epoch_Time:23420.0min: [2024-01-06 18:13:18,732][model8_pretrain.py][INFO] Epoch:[0/2](910000/4588595) loss:1.829 lr:0.0000100 epoch_Time:23420.0min: [2024-01-06 18:13:55,665][model8_pretrain.py][INFO] Epoch:[0/2](910100/4588595) loss:3.025 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:13:55,666][model8_pretrain.py][INFO] Epoch:[0/2](910100/4588595) loss:2.730 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:13:55,666][model8_pretrain.py][INFO] Epoch:[0/2](910100/4588595) loss:3.029 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:13:55,666][model8_pretrain.py][INFO] Epoch:[0/2](910100/4588595) loss:2.669 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:13:55,666][model8_pretrain.py][INFO] Epoch:[0/2](910100/4588595) loss:3.207 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:13:55,666][model8_pretrain.py][INFO] Epoch:[0/2](910100/4588595) loss:2.966 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:13:55,666][model8_pretrain.py][INFO] Epoch:[0/2](910100/4588595) loss:2.807 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:13:55,666][model8_pretrain.py][INFO] Epoch:[0/2](910100/4588595) loss:2.596 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:14:32,617][model8_pretrain.py][INFO] Epoch:[0/2](910200/4588595) loss:3.012 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:14:32,617][model8_pretrain.py][INFO] Epoch:[0/2](910200/4588595) loss:2.840 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:14:32,617][model8_pretrain.py][INFO] Epoch:[0/2](910200/4588595) loss:2.859 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:14:32,617][model8_pretrain.py][INFO] Epoch:[0/2](910200/4588595) loss:2.880 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:14:32,617][model8_pretrain.py][INFO] Epoch:[0/2](910200/4588595) loss:2.810 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:14:32,617][model8_pretrain.py][INFO] Epoch:[0/2](910200/4588595) loss:2.906 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:14:32,617][model8_pretrain.py][INFO] Epoch:[0/2](910200/4588595) loss:2.703 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:14:32,617][model8_pretrain.py][INFO] Epoch:[0/2](910200/4588595) loss:3.359 lr:0.0000100 epoch_Time:23419.0min: [2024-01-06 18:15:09,545][model8_pretrain.py][INFO] Epoch:[0/2](910300/4588595) loss:2.999 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:09,545][model8_pretrain.py][INFO] Epoch:[0/2](910300/4588595) loss:3.123 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:09,545][model8_pretrain.py][INFO] Epoch:[0/2](910300/4588595) loss:2.645 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:09,545][model8_pretrain.py][INFO] Epoch:[0/2](910300/4588595) loss:3.319 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:09,545][model8_pretrain.py][INFO] Epoch:[0/2](910300/4588595) loss:2.550 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:09,545][model8_pretrain.py][INFO] Epoch:[0/2](910300/4588595) loss:2.320 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:09,545][model8_pretrain.py][INFO] Epoch:[0/2](910300/4588595) loss:2.900 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:09,545][model8_pretrain.py][INFO] Epoch:[0/2](910300/4588595) loss:2.950 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:46,461][model8_pretrain.py][INFO] Epoch:[0/2](910400/4588595) loss:2.666 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:46,461][model8_pretrain.py][INFO] Epoch:[0/2](910400/4588595) loss:3.248 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:46,461][model8_pretrain.py][INFO] Epoch:[0/2](910400/4588595) loss:3.098 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:46,461][model8_pretrain.py][INFO] Epoch:[0/2](910400/4588595) loss:3.028 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:46,461][model8_pretrain.py][INFO] Epoch:[0/2](910400/4588595) loss:2.454 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:46,461][model8_pretrain.py][INFO] Epoch:[0/2](910400/4588595) loss:3.175 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:46,461][model8_pretrain.py][INFO] Epoch:[0/2](910400/4588595) loss:3.265 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:15:46,461][model8_pretrain.py][INFO] Epoch:[0/2](910400/4588595) loss:2.399 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:16:37,089][model8_pretrain.py][INFO] Epoch:[0/2](910500/4588595) loss:3.013 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:16:37,089][model8_pretrain.py][INFO] Epoch:[0/2](910500/4588595) loss:2.660 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:16:37,089][model8_pretrain.py][INFO] Epoch:[0/2](910500/4588595) loss:2.528 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:16:37,089][model8_pretrain.py][INFO] Epoch:[0/2](910500/4588595) loss:2.835 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:16:37,089][model8_pretrain.py][INFO] Epoch:[0/2](910500/4588595) loss:2.876 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:16:37,089][model8_pretrain.py][INFO] Epoch:[0/2](910500/4588595) loss:3.255 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:16:37,089][model8_pretrain.py][INFO] Epoch:[0/2](910500/4588595) loss:2.572 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:16:37,089][model8_pretrain.py][INFO] Epoch:[0/2](910500/4588595) loss:2.924 lr:0.0000100 epoch_Time:23417.0min: [2024-01-06 18:17:14,002][model8_pretrain.py][INFO] Epoch:[0/2](910600/4588595) loss:2.729 lr:0.0000100 epoch_Time:23416.0min: [2024-01-06 18:17:14,002][model8_pretrain.py][INFO] Epoch:[0/2](910600/4588595) loss:2.589 lr:0.0000100 epoch_Time:23416.0min: [2024-01-06 18:17:14,002][model8_pretrain.py][INFO] Epoch:[0/2](910600/4588595) loss:2.859 lr:0.0000100 epoch_Time:23416.0min: [2024-01-06 18:17:14,002][model8_pretrain.py][INFO] Epoch:[0/2](910600/4588595) loss:1.689 lr:0.0000100 epoch_Time:23416.0min: [2024-01-06 18:17:14,002][model8_pretrain.py][INFO] Epoch:[0/2](910600/4588595) loss:2.530 lr:0.0000100 epoch_Time:23416.0min: [2024-01-06 18:17:14,002][model8_pretrain.py][INFO] Epoch:[0/2](910600/4588595) loss:2.937 lr:0.0000100 epoch_Time:23416.0min: [2024-01-06 18:17:14,002][model8_pretrain.py][INFO] Epoch:[0/2](910600/4588595) loss:2.882 lr:0.0000100 epoch_Time:23416.0min: [2024-01-06 18:17:14,002][model8_pretrain.py][INFO] Epoch:[0/2](910600/4588595) loss:2.746 lr:0.0000100 epoch_Time:23416.0min: [2024-01-06 18:17:50,928][model8_pretrain.py][INFO] Epoch:[0/2](910700/4588595) loss:3.197 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:17:50,928][model8_pretrain.py][INFO] Epoch:[0/2](910700/4588595) loss:2.699 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:17:50,928][model8_pretrain.py][INFO] Epoch:[0/2](910700/4588595) loss:2.493 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:17:50,928][model8_pretrain.py][INFO] Epoch:[0/2](910700/4588595) loss:2.963 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:17:50,928][model8_pretrain.py][INFO] Epoch:[0/2](910700/4588595) loss:3.074 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:17:50,928][model8_pretrain.py][INFO] Epoch:[0/2](910700/4588595) loss:2.996 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:17:50,928][model8_pretrain.py][INFO] Epoch:[0/2](910700/4588595) loss:2.413 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:17:50,928][model8_pretrain.py][INFO] Epoch:[0/2](910700/4588595) loss:2.706 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:18:27,865][model8_pretrain.py][INFO] Epoch:[0/2](910800/4588595) loss:3.397 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:18:27,865][model8_pretrain.py][INFO] Epoch:[0/2](910800/4588595) loss:2.329 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:18:27,865][model8_pretrain.py][INFO] Epoch:[0/2](910800/4588595) loss:2.118 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:18:27,865][model8_pretrain.py][INFO] Epoch:[0/2](910800/4588595) loss:3.003 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:18:27,865][model8_pretrain.py][INFO] Epoch:[0/2](910800/4588595) loss:3.203 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:18:27,865][model8_pretrain.py][INFO] Epoch:[0/2](910800/4588595) loss:2.891 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:18:27,865][model8_pretrain.py][INFO] Epoch:[0/2](910800/4588595) loss:3.492 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:18:27,865][model8_pretrain.py][INFO] Epoch:[0/2](910800/4588595) loss:2.231 lr:0.0000100 epoch_Time:23415.0min: [2024-01-06 18:19:04,800][model8_pretrain.py][INFO] Epoch:[0/2](910900/4588595) loss:3.338 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:04,800][model8_pretrain.py][INFO] Epoch:[0/2](910900/4588595) loss:2.863 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:04,800][model8_pretrain.py][INFO] Epoch:[0/2](910900/4588595) loss:2.758 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:04,800][model8_pretrain.py][INFO] Epoch:[0/2](910900/4588595) loss:3.211 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:04,800][model8_pretrain.py][INFO] Epoch:[0/2](910900/4588595) loss:2.997 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:04,800][model8_pretrain.py][INFO] Epoch:[0/2](910900/4588595) loss:1.928 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:04,800][model8_pretrain.py][INFO] Epoch:[0/2](910900/4588595) loss:2.750 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:04,800][model8_pretrain.py][INFO] Epoch:[0/2](910900/4588595) loss:3.332 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:41,732][model8_pretrain.py][INFO] Epoch:[0/2](911000/4588595) loss:3.049 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:41,732][model8_pretrain.py][INFO] Epoch:[0/2](911000/4588595) loss:2.047 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:41,732][model8_pretrain.py][INFO] Epoch:[0/2](911000/4588595) loss:2.614 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:41,732][model8_pretrain.py][INFO] Epoch:[0/2](911000/4588595) loss:2.952 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:41,732][model8_pretrain.py][INFO] Epoch:[0/2](911000/4588595) loss:2.249 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:41,732][model8_pretrain.py][INFO] Epoch:[0/2](911000/4588595) loss:2.509 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:41,732][model8_pretrain.py][INFO] Epoch:[0/2](911000/4588595) loss:3.091 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:19:41,732][model8_pretrain.py][INFO] Epoch:[0/2](911000/4588595) loss:2.918 lr:0.0000100 epoch_Time:23414.0min: [2024-01-06 18:20:18,674][model8_pretrain.py][INFO] Epoch:[0/2](911100/4588595) loss:3.138 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:20:18,674][model8_pretrain.py][INFO] Epoch:[0/2](911100/4588595) loss:2.529 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:20:18,674][model8_pretrain.py][INFO] Epoch:[0/2](911100/4588595) loss:3.251 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:20:18,674][model8_pretrain.py][INFO] Epoch:[0/2](911100/4588595) loss:2.361 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:20:18,674][model8_pretrain.py][INFO] Epoch:[0/2](911100/4588595) loss:3.055 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:20:18,674][model8_pretrain.py][INFO] Epoch:[0/2](911100/4588595) loss:2.717 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:20:18,674][model8_pretrain.py][INFO] Epoch:[0/2](911100/4588595) loss:2.445 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:20:18,674][model8_pretrain.py][INFO] Epoch:[0/2](911100/4588595) loss:2.459 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:20:55,610][model8_pretrain.py][INFO] Epoch:[0/2](911200/4588595) loss:3.049 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:20:55,610][model8_pretrain.py][INFO] Epoch:[0/2](911200/4588595) loss:2.809 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:20:55,610][model8_pretrain.py][INFO] Epoch:[0/2](911200/4588595) loss:2.779 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:20:55,610][model8_pretrain.py][INFO] Epoch:[0/2](911200/4588595) loss:2.956 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:20:55,610][model8_pretrain.py][INFO] Epoch:[0/2](911200/4588595) loss:2.507 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:20:55,610][model8_pretrain.py][INFO] Epoch:[0/2](911200/4588595) loss:3.221 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:20:55,610][model8_pretrain.py][INFO] Epoch:[0/2](911200/4588595) loss:2.179 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:20:55,610][model8_pretrain.py][INFO] Epoch:[0/2](911200/4588595) loss:2.773 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:21:46,229][model8_pretrain.py][INFO] Epoch:[0/2](911300/4588595) loss:3.117 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:21:46,229][model8_pretrain.py][INFO] Epoch:[0/2](911300/4588595) loss:3.098 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:21:46,229][model8_pretrain.py][INFO] Epoch:[0/2](911300/4588595) loss:2.543 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:21:46,229][model8_pretrain.py][INFO] Epoch:[0/2](911300/4588595) loss:3.235 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:21:46,229][model8_pretrain.py][INFO] Epoch:[0/2](911300/4588595) loss:3.028 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:21:46,229][model8_pretrain.py][INFO] Epoch:[0/2](911300/4588595) loss:2.914 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:21:46,229][model8_pretrain.py][INFO] Epoch:[0/2](911300/4588595) loss:2.813 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:21:46,229][model8_pretrain.py][INFO] Epoch:[0/2](911300/4588595) loss:3.098 lr:0.0000100 epoch_Time:23413.0min: [2024-01-06 18:22:23,167][model8_pretrain.py][INFO] Epoch:[0/2](911400/4588595) loss:2.338 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:22:23,167][model8_pretrain.py][INFO] Epoch:[0/2](911400/4588595) loss:2.699 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:22:23,167][model8_pretrain.py][INFO] Epoch:[0/2](911400/4588595) loss:2.139 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:22:23,167][model8_pretrain.py][INFO] Epoch:[0/2](911400/4588595) loss:2.740 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:22:23,167][model8_pretrain.py][INFO] Epoch:[0/2](911400/4588595) loss:2.251 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:22:23,167][model8_pretrain.py][INFO] Epoch:[0/2](911400/4588595) loss:3.049 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:22:23,167][model8_pretrain.py][INFO] Epoch:[0/2](911400/4588595) loss:2.804 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:22:23,167][model8_pretrain.py][INFO] Epoch:[0/2](911400/4588595) loss:2.780 lr:0.0000100 epoch_Time:23412.0min: [2024-01-06 18:23:00,115][model8_pretrain.py][INFO] Epoch:[0/2](911500/4588595) loss:2.912 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:00,115][model8_pretrain.py][INFO] Epoch:[0/2](911500/4588595) loss:3.014 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:00,115][model8_pretrain.py][INFO] Epoch:[0/2](911500/4588595) loss:3.232 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:00,115][model8_pretrain.py][INFO] Epoch:[0/2](911500/4588595) loss:2.356 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:00,115][model8_pretrain.py][INFO] Epoch:[0/2](911500/4588595) loss:2.867 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:00,115][model8_pretrain.py][INFO] Epoch:[0/2](911500/4588595) loss:2.780 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:00,115][model8_pretrain.py][INFO] Epoch:[0/2](911500/4588595) loss:2.457 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:00,115][model8_pretrain.py][INFO] Epoch:[0/2](911500/4588595) loss:2.883 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:37,067][model8_pretrain.py][INFO] Epoch:[0/2](911600/4588595) loss:3.234 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:37,067][model8_pretrain.py][INFO] Epoch:[0/2](911600/4588595) loss:2.755 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:37,067][model8_pretrain.py][INFO] Epoch:[0/2](911600/4588595) loss:2.237 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:37,068][model8_pretrain.py][INFO] Epoch:[0/2](911600/4588595) loss:2.279 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:37,068][model8_pretrain.py][INFO] Epoch:[0/2](911600/4588595) loss:2.952 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:37,068][model8_pretrain.py][INFO] Epoch:[0/2](911600/4588595) loss:2.758 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:37,068][model8_pretrain.py][INFO] Epoch:[0/2](911600/4588595) loss:2.747 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:23:37,068][model8_pretrain.py][INFO] Epoch:[0/2](911600/4588595) loss:2.369 lr:0.0000100 epoch_Time:23410.0min: [2024-01-06 18:24:14,021][model8_pretrain.py][INFO] Epoch:[0/2](911700/4588595) loss:2.767 lr:0.0000100 epoch_Time:23409.0min: [2024-01-06 18:24:14,021][model8_pretrain.py][INFO] Epoch:[0/2](911700/4588595) loss:3.189 lr:0.0000100 epoch_Time:23409.0min: [2024-01-06 18:24:14,021][model8_pretrain.py][INFO] Epoch:[0/2](911700/4588595) loss:2.638 lr:0.0000100 epoch_Time:23409.0min: [2024-01-06 18:24:14,021][model8_pretrain.py][INFO] Epoch:[0/2](911700/4588595) loss:2.745 lr:0.0000100 epoch_Time:23409.0min: [2024-01-06 18:24:14,021][model8_pretrain.py][INFO] Epoch:[0/2](911700/4588595) loss:3.109 lr:0.0000100 epoch_Time:23409.0min: [2024-01-06 18:24:14,021][model8_pretrain.py][INFO] Epoch:[0/2](911700/4588595) loss:2.922 lr:0.0000100 epoch_Time:23409.0min: [2024-01-06 18:24:14,021][model8_pretrain.py][INFO] Epoch:[0/2](911700/4588595) loss:1.517 lr:0.0000100 epoch_Time:23409.0min: [2024-01-06 18:24:14,021][model8_pretrain.py][INFO] Epoch:[0/2](911700/4588595) loss:2.595 lr:0.0000100 epoch_Time:23409.0min: [2024-01-06 18:24:50,979][model8_pretrain.py][INFO] Epoch:[0/2](911800/4588595) loss:2.951 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:24:50,979][model8_pretrain.py][INFO] Epoch:[0/2](911800/4588595) loss:3.001 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:24:50,979][model8_pretrain.py][INFO] Epoch:[0/2](911800/4588595) loss:2.656 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:24:50,979][model8_pretrain.py][INFO] Epoch:[0/2](911800/4588595) loss:2.820 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:24:50,979][model8_pretrain.py][INFO] Epoch:[0/2](911800/4588595) loss:2.951 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:24:50,980][model8_pretrain.py][INFO] Epoch:[0/2](911800/4588595) loss:2.984 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:24:50,980][model8_pretrain.py][INFO] Epoch:[0/2](911800/4588595) loss:2.792 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:24:50,980][model8_pretrain.py][INFO] Epoch:[0/2](911800/4588595) loss:2.689 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:25:27,923][model8_pretrain.py][INFO] Epoch:[0/2](911900/4588595) loss:2.731 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:25:27,923][model8_pretrain.py][INFO] Epoch:[0/2](911900/4588595) loss:2.932 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:25:27,923][model8_pretrain.py][INFO] Epoch:[0/2](911900/4588595) loss:2.330 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:25:27,923][model8_pretrain.py][INFO] Epoch:[0/2](911900/4588595) loss:2.930 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:25:27,923][model8_pretrain.py][INFO] Epoch:[0/2](911900/4588595) loss:2.885 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:25:27,923][model8_pretrain.py][INFO] Epoch:[0/2](911900/4588595) loss:3.206 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:25:27,924][model8_pretrain.py][INFO] Epoch:[0/2](911900/4588595) loss:2.910 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:25:27,924][model8_pretrain.py][INFO] Epoch:[0/2](911900/4588595) loss:2.238 lr:0.0000100 epoch_Time:23408.0min: [2024-01-06 18:26:04,871][model8_pretrain.py][INFO] Epoch:[0/2](912000/4588595) loss:3.099 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:04,871][model8_pretrain.py][INFO] Epoch:[0/2](912000/4588595) loss:2.613 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:04,871][model8_pretrain.py][INFO] Epoch:[0/2](912000/4588595) loss:2.554 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:04,871][model8_pretrain.py][INFO] Epoch:[0/2](912000/4588595) loss:2.517 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:04,871][model8_pretrain.py][INFO] Epoch:[0/2](912000/4588595) loss:2.875 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:04,871][model8_pretrain.py][INFO] Epoch:[0/2](912000/4588595) loss:2.906 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:04,871][model8_pretrain.py][INFO] Epoch:[0/2](912000/4588595) loss:2.783 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:04,871][model8_pretrain.py][INFO] Epoch:[0/2](912000/4588595) loss:2.444 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:53,840][model8_pretrain.py][INFO] Epoch:[0/2](912100/4588595) loss:2.909 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:53,840][model8_pretrain.py][INFO] Epoch:[0/2](912100/4588595) loss:2.903 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:53,840][model8_pretrain.py][INFO] Epoch:[0/2](912100/4588595) loss:3.278 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:53,840][model8_pretrain.py][INFO] Epoch:[0/2](912100/4588595) loss:2.987 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:53,840][model8_pretrain.py][INFO] Epoch:[0/2](912100/4588595) loss:3.087 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:53,840][model8_pretrain.py][INFO] Epoch:[0/2](912100/4588595) loss:2.511 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:53,840][model8_pretrain.py][INFO] Epoch:[0/2](912100/4588595) loss:2.055 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:26:53,840][model8_pretrain.py][INFO] Epoch:[0/2](912100/4588595) loss:3.074 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:27:32,447][model8_pretrain.py][INFO] Epoch:[0/2](912200/4588595) loss:2.990 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:27:32,447][model8_pretrain.py][INFO] Epoch:[0/2](912200/4588595) loss:2.993 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:27:32,447][model8_pretrain.py][INFO] Epoch:[0/2](912200/4588595) loss:2.709 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:27:32,447][model8_pretrain.py][INFO] Epoch:[0/2](912200/4588595) loss:3.126 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:27:32,447][model8_pretrain.py][INFO] Epoch:[0/2](912200/4588595) loss:3.719 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:27:32,447][model8_pretrain.py][INFO] Epoch:[0/2](912200/4588595) loss:3.393 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:27:32,447][model8_pretrain.py][INFO] Epoch:[0/2](912200/4588595) loss:3.095 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:27:32,447][model8_pretrain.py][INFO] Epoch:[0/2](912200/4588595) loss:2.755 lr:0.0000100 epoch_Time:23407.0min: [2024-01-06 18:28:09,385][model8_pretrain.py][INFO] Epoch:[0/2](912300/4588595) loss:2.346 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:09,385][model8_pretrain.py][INFO] Epoch:[0/2](912300/4588595) loss:2.665 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:09,385][model8_pretrain.py][INFO] Epoch:[0/2](912300/4588595) loss:2.556 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:09,385][model8_pretrain.py][INFO] Epoch:[0/2](912300/4588595) loss:3.221 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:09,385][model8_pretrain.py][INFO] Epoch:[0/2](912300/4588595) loss:2.557 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:09,385][model8_pretrain.py][INFO] Epoch:[0/2](912300/4588595) loss:2.615 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:09,385][model8_pretrain.py][INFO] Epoch:[0/2](912300/4588595) loss:2.781 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:09,385][model8_pretrain.py][INFO] Epoch:[0/2](912300/4588595) loss:2.728 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:46,316][model8_pretrain.py][INFO] Epoch:[0/2](912400/4588595) loss:2.732 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:46,316][model8_pretrain.py][INFO] Epoch:[0/2](912400/4588595) loss:2.239 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:46,317][model8_pretrain.py][INFO] Epoch:[0/2](912400/4588595) loss:3.308 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:46,316][model8_pretrain.py][INFO] Epoch:[0/2](912400/4588595) loss:2.768 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:46,317][model8_pretrain.py][INFO] Epoch:[0/2](912400/4588595) loss:2.465 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:46,317][model8_pretrain.py][INFO] Epoch:[0/2](912400/4588595) loss:2.857 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:46,317][model8_pretrain.py][INFO] Epoch:[0/2](912400/4588595) loss:2.773 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:28:46,317][model8_pretrain.py][INFO] Epoch:[0/2](912400/4588595) loss:2.961 lr:0.0000100 epoch_Time:23406.0min: [2024-01-06 18:29:23,259][model8_pretrain.py][INFO] Epoch:[0/2](912500/4588595) loss:2.764 lr:0.0000100 epoch_Time:23405.0min: [2024-01-06 18:29:23,259][model8_pretrain.py][INFO] Epoch:[0/2](912500/4588595) loss:2.476 lr:0.0000100 epoch_Time:23405.0min: [2024-01-06 18:29:23,259][model8_pretrain.py][INFO] Epoch:[0/2](912500/4588595) loss:2.536 lr:0.0000100 epoch_Time:23405.0min: [2024-01-06 18:29:23,259][model8_pretrain.py][INFO] Epoch:[0/2](912500/4588595) loss:2.852 lr:0.0000100 epoch_Time:23405.0min: [2024-01-06 18:29:23,259][model8_pretrain.py][INFO] Epoch:[0/2](912500/4588595) loss:3.056 lr:0.0000100 epoch_Time:23405.0min: [2024-01-06 18:29:23,259][model8_pretrain.py][INFO] Epoch:[0/2](912500/4588595) loss:3.471 lr:0.0000100 epoch_Time:23405.0min: [2024-01-06 18:29:23,259][model8_pretrain.py][INFO] Epoch:[0/2](912500/4588595) loss:2.738 lr:0.0000100 epoch_Time:23405.0min: [2024-01-06 18:29:23,259][model8_pretrain.py][INFO] Epoch:[0/2](912500/4588595) loss:3.080 lr:0.0000100 epoch_Time:23405.0min: [2024-01-06 18:30:00,210][model8_pretrain.py][INFO] Epoch:[0/2](912600/4588595) loss:3.075 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:00,210][model8_pretrain.py][INFO] Epoch:[0/2](912600/4588595) loss:2.846 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:00,210][model8_pretrain.py][INFO] Epoch:[0/2](912600/4588595) loss:3.031 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:00,210][model8_pretrain.py][INFO] Epoch:[0/2](912600/4588595) loss:2.819 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:00,210][model8_pretrain.py][INFO] Epoch:[0/2](912600/4588595) loss:2.811 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:00,210][model8_pretrain.py][INFO] Epoch:[0/2](912600/4588595) loss:3.149 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:00,210][model8_pretrain.py][INFO] Epoch:[0/2](912600/4588595) loss:2.692 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:00,210][model8_pretrain.py][INFO] Epoch:[0/2](912600/4588595) loss:2.893 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:37,156][model8_pretrain.py][INFO] Epoch:[0/2](912700/4588595) loss:2.573 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:37,156][model8_pretrain.py][INFO] Epoch:[0/2](912700/4588595) loss:2.922 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:37,156][model8_pretrain.py][INFO] Epoch:[0/2](912700/4588595) loss:2.895 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:37,156][model8_pretrain.py][INFO] Epoch:[0/2](912700/4588595) loss:2.935 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:37,156][model8_pretrain.py][INFO] Epoch:[0/2](912700/4588595) loss:2.943 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:37,156][model8_pretrain.py][INFO] Epoch:[0/2](912700/4588595) loss:3.038 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:37,156][model8_pretrain.py][INFO] Epoch:[0/2](912700/4588595) loss:3.144 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:30:37,156][model8_pretrain.py][INFO] Epoch:[0/2](912700/4588595) loss:2.874 lr:0.0000100 epoch_Time:23403.0min: [2024-01-06 18:31:14,103][model8_pretrain.py][INFO] Epoch:[0/2](912800/4588595) loss:2.546 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:31:14,103][model8_pretrain.py][INFO] Epoch:[0/2](912800/4588595) loss:1.770 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:31:14,103][model8_pretrain.py][INFO] Epoch:[0/2](912800/4588595) loss:3.243 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:31:14,103][model8_pretrain.py][INFO] Epoch:[0/2](912800/4588595) loss:2.968 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:31:14,103][model8_pretrain.py][INFO] Epoch:[0/2](912800/4588595) loss:2.771 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:31:14,103][model8_pretrain.py][INFO] Epoch:[0/2](912800/4588595) loss:2.795 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:31:14,104][model8_pretrain.py][INFO] Epoch:[0/2](912800/4588595) loss:2.815 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:31:14,104][model8_pretrain.py][INFO] Epoch:[0/2](912800/4588595) loss:3.116 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:03,098][model8_pretrain.py][INFO] Epoch:[0/2](912900/4588595) loss:2.557 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:03,098][model8_pretrain.py][INFO] Epoch:[0/2](912900/4588595) loss:3.095 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:03,098][model8_pretrain.py][INFO] Epoch:[0/2](912900/4588595) loss:2.922 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:03,098][model8_pretrain.py][INFO] Epoch:[0/2](912900/4588595) loss:2.700 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:03,099][model8_pretrain.py][INFO] Epoch:[0/2](912900/4588595) loss:3.017 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:03,099][model8_pretrain.py][INFO] Epoch:[0/2](912900/4588595) loss:2.816 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:03,099][model8_pretrain.py][INFO] Epoch:[0/2](912900/4588595) loss:3.115 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:03,099][model8_pretrain.py][INFO] Epoch:[0/2](912900/4588595) loss:3.219 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:41,703][model8_pretrain.py][INFO] Epoch:[0/2](913000/4588595) loss:3.186 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:41,703][model8_pretrain.py][INFO] Epoch:[0/2](913000/4588595) loss:3.082 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:41,703][model8_pretrain.py][INFO] Epoch:[0/2](913000/4588595) loss:2.859 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:41,703][model8_pretrain.py][INFO] Epoch:[0/2](913000/4588595) loss:2.674 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:41,703][model8_pretrain.py][INFO] Epoch:[0/2](913000/4588595) loss:2.920 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:41,703][model8_pretrain.py][INFO] Epoch:[0/2](913000/4588595) loss:2.416 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:41,703][model8_pretrain.py][INFO] Epoch:[0/2](913000/4588595) loss:2.611 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:32:41,703][model8_pretrain.py][INFO] Epoch:[0/2](913000/4588595) loss:2.693 lr:0.0000100 epoch_Time:23402.0min: [2024-01-06 18:33:18,642][model8_pretrain.py][INFO] Epoch:[0/2](913100/4588595) loss:3.165 lr:0.0000100 epoch_Time:23401.0min: [2024-01-06 18:33:18,642][model8_pretrain.py][INFO] Epoch:[0/2](913100/4588595) loss:2.294 lr:0.0000100 epoch_Time:23401.0min: [2024-01-06 18:33:18,642][model8_pretrain.py][INFO] Epoch:[0/2](913100/4588595) loss:2.853 lr:0.0000100 epoch_Time:23401.0min: [2024-01-06 18:33:18,642][model8_pretrain.py][INFO] Epoch:[0/2](913100/4588595) loss:2.417 lr:0.0000100 epoch_Time:23401.0min: [2024-01-06 18:33:18,642][model8_pretrain.py][INFO] Epoch:[0/2](913100/4588595) loss:2.713 lr:0.0000100 epoch_Time:23401.0min: [2024-01-06 18:33:18,642][model8_pretrain.py][INFO] Epoch:[0/2](913100/4588595) loss:2.974 lr:0.0000100 epoch_Time:23401.0min: [2024-01-06 18:33:18,642][model8_pretrain.py][INFO] Epoch:[0/2](913100/4588595) loss:3.052 lr:0.0000100 epoch_Time:23401.0min: [2024-01-06 18:33:18,642][model8_pretrain.py][INFO] Epoch:[0/2](913100/4588595) loss:2.456 lr:0.0000100 epoch_Time:23401.0min: [2024-01-06 18:33:55,584][model8_pretrain.py][INFO] Epoch:[0/2](913200/4588595) loss:3.107 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:33:55,584][model8_pretrain.py][INFO] Epoch:[0/2](913200/4588595) loss:2.579 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:33:55,584][model8_pretrain.py][INFO] Epoch:[0/2](913200/4588595) loss:2.459 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:33:55,584][model8_pretrain.py][INFO] Epoch:[0/2](913200/4588595) loss:2.763 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:33:55,584][model8_pretrain.py][INFO] Epoch:[0/2](913200/4588595) loss:3.078 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:33:55,584][model8_pretrain.py][INFO] Epoch:[0/2](913200/4588595) loss:1.818 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:33:55,584][model8_pretrain.py][INFO] Epoch:[0/2](913200/4588595) loss:2.353 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:33:55,584][model8_pretrain.py][INFO] Epoch:[0/2](913200/4588595) loss:2.582 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:34:32,536][model8_pretrain.py][INFO] Epoch:[0/2](913300/4588595) loss:2.358 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:34:32,536][model8_pretrain.py][INFO] Epoch:[0/2](913300/4588595) loss:2.751 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:34:32,536][model8_pretrain.py][INFO] Epoch:[0/2](913300/4588595) loss:3.051 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:34:32,536][model8_pretrain.py][INFO] Epoch:[0/2](913300/4588595) loss:2.866 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:34:32,536][model8_pretrain.py][INFO] Epoch:[0/2](913300/4588595) loss:2.655 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:34:32,536][model8_pretrain.py][INFO] Epoch:[0/2](913300/4588595) loss:1.884 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:34:32,536][model8_pretrain.py][INFO] Epoch:[0/2](913300/4588595) loss:2.531 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:34:32,536][model8_pretrain.py][INFO] Epoch:[0/2](913300/4588595) loss:3.146 lr:0.0000100 epoch_Time:23400.0min: [2024-01-06 18:35:09,446][model8_pretrain.py][INFO] Epoch:[0/2](913400/4588595) loss:3.123 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:09,446][model8_pretrain.py][INFO] Epoch:[0/2](913400/4588595) loss:2.999 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:09,446][model8_pretrain.py][INFO] Epoch:[0/2](913400/4588595) loss:2.239 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:09,446][model8_pretrain.py][INFO] Epoch:[0/2](913400/4588595) loss:2.486 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:09,446][model8_pretrain.py][INFO] Epoch:[0/2](913400/4588595) loss:2.811 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:09,446][model8_pretrain.py][INFO] Epoch:[0/2](913400/4588595) loss:2.930 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:09,446][model8_pretrain.py][INFO] Epoch:[0/2](913400/4588595) loss:3.147 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:09,447][model8_pretrain.py][INFO] Epoch:[0/2](913400/4588595) loss:3.058 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:46,373][model8_pretrain.py][INFO] Epoch:[0/2](913500/4588595) loss:2.240 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:46,373][model8_pretrain.py][INFO] Epoch:[0/2](913500/4588595) loss:2.893 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:46,373][model8_pretrain.py][INFO] Epoch:[0/2](913500/4588595) loss:2.916 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:46,373][model8_pretrain.py][INFO] Epoch:[0/2](913500/4588595) loss:2.658 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:46,373][model8_pretrain.py][INFO] Epoch:[0/2](913500/4588595) loss:2.922 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:46,373][model8_pretrain.py][INFO] Epoch:[0/2](913500/4588595) loss:2.643 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:46,373][model8_pretrain.py][INFO] Epoch:[0/2](913500/4588595) loss:2.867 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:35:46,373][model8_pretrain.py][INFO] Epoch:[0/2](913500/4588595) loss:2.639 lr:0.0000100 epoch_Time:23399.0min: [2024-01-06 18:36:23,313][model8_pretrain.py][INFO] Epoch:[0/2](913600/4588595) loss:2.872 lr:0.0000100 epoch_Time:23398.0min: [2024-01-06 18:36:23,313][model8_pretrain.py][INFO] Epoch:[0/2](913600/4588595) loss:2.407 lr:0.0000100 epoch_Time:23398.0min: [2024-01-06 18:36:23,313][model8_pretrain.py][INFO] Epoch:[0/2](913600/4588595) loss:2.853 lr:0.0000100 epoch_Time:23398.0min: [2024-01-06 18:36:23,313][model8_pretrain.py][INFO] Epoch:[0/2](913600/4588595) loss:2.749 lr:0.0000100 epoch_Time:23398.0min: [2024-01-06 18:36:23,313][model8_pretrain.py][INFO] Epoch:[0/2](913600/4588595) loss:3.320 lr:0.0000100 epoch_Time:23398.0min: [2024-01-06 18:36:23,313][model8_pretrain.py][INFO] Epoch:[0/2](913600/4588595) loss:2.468 lr:0.0000100 epoch_Time:23398.0min: [2024-01-06 18:36:23,313][model8_pretrain.py][INFO] Epoch:[0/2](913600/4588595) loss:1.904 lr:0.0000100 epoch_Time:23398.0min: [2024-01-06 18:36:23,313][model8_pretrain.py][INFO] Epoch:[0/2](913600/4588595) loss:2.657 lr:0.0000100 epoch_Time:23398.0min: [2024-01-06 18:37:12,309][model8_pretrain.py][INFO] Epoch:[0/2](913700/4588595) loss:3.045 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:12,309][model8_pretrain.py][INFO] Epoch:[0/2](913700/4588595) loss:2.599 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:12,309][model8_pretrain.py][INFO] Epoch:[0/2](913700/4588595) loss:2.573 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:12,309][model8_pretrain.py][INFO] Epoch:[0/2](913700/4588595) loss:2.158 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:12,309][model8_pretrain.py][INFO] Epoch:[0/2](913700/4588595) loss:2.849 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:12,309][model8_pretrain.py][INFO] Epoch:[0/2](913700/4588595) loss:3.381 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:12,309][model8_pretrain.py][INFO] Epoch:[0/2](913700/4588595) loss:2.891 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:12,309][model8_pretrain.py][INFO] Epoch:[0/2](913700/4588595) loss:3.092 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:50,910][model8_pretrain.py][INFO] Epoch:[0/2](913800/4588595) loss:2.828 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:50,910][model8_pretrain.py][INFO] Epoch:[0/2](913800/4588595) loss:2.773 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:50,910][model8_pretrain.py][INFO] Epoch:[0/2](913800/4588595) loss:2.836 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:50,910][model8_pretrain.py][INFO] Epoch:[0/2](913800/4588595) loss:2.548 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:50,910][model8_pretrain.py][INFO] Epoch:[0/2](913800/4588595) loss:2.559 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:50,910][model8_pretrain.py][INFO] Epoch:[0/2](913800/4588595) loss:2.695 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:50,910][model8_pretrain.py][INFO] Epoch:[0/2](913800/4588595) loss:2.769 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:37:50,910][model8_pretrain.py][INFO] Epoch:[0/2](913800/4588595) loss:2.769 lr:0.0000100 epoch_Time:23397.0min: [2024-01-06 18:38:27,839][model8_pretrain.py][INFO] Epoch:[0/2](913900/4588595) loss:2.810 lr:0.0000100 epoch_Time:23396.0min: [2024-01-06 18:38:27,839][model8_pretrain.py][INFO] Epoch:[0/2](913900/4588595) loss:2.737 lr:0.0000100 epoch_Time:23396.0min: [2024-01-06 18:38:27,839][model8_pretrain.py][INFO] Epoch:[0/2](913900/4588595) loss:2.846 lr:0.0000100 epoch_Time:23396.0min: [2024-01-06 18:38:27,839][model8_pretrain.py][INFO] Epoch:[0/2](913900/4588595) loss:2.905 lr:0.0000100 epoch_Time:23396.0min: [2024-01-06 18:38:27,839][model8_pretrain.py][INFO] Epoch:[0/2](913900/4588595) loss:2.896 lr:0.0000100 epoch_Time:23396.0min: [2024-01-06 18:38:27,839][model8_pretrain.py][INFO] Epoch:[0/2](913900/4588595) loss:2.974 lr:0.0000100 epoch_Time:23396.0min: [2024-01-06 18:38:27,839][model8_pretrain.py][INFO] Epoch:[0/2](913900/4588595) loss:2.266 lr:0.0000100 epoch_Time:23396.0min: [2024-01-06 18:38:27,839][model8_pretrain.py][INFO] Epoch:[0/2](913900/4588595) loss:2.759 lr:0.0000100 epoch_Time:23396.0min: [2024-01-06 18:39:04,780][model8_pretrain.py][INFO] Epoch:[0/2](914000/4588595) loss:2.990 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:04,780][model8_pretrain.py][INFO] Epoch:[0/2](914000/4588595) loss:2.740 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:04,780][model8_pretrain.py][INFO] Epoch:[0/2](914000/4588595) loss:2.897 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:04,780][model8_pretrain.py][INFO] Epoch:[0/2](914000/4588595) loss:2.577 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:04,780][model8_pretrain.py][INFO] Epoch:[0/2](914000/4588595) loss:2.589 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:04,780][model8_pretrain.py][INFO] Epoch:[0/2](914000/4588595) loss:3.203 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:04,780][model8_pretrain.py][INFO] Epoch:[0/2](914000/4588595) loss:3.032 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:04,782][model8_pretrain.py][INFO] Epoch:[0/2](914000/4588595) loss:2.933 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:41,721][model8_pretrain.py][INFO] Epoch:[0/2](914100/4588595) loss:2.841 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:41,721][model8_pretrain.py][INFO] Epoch:[0/2](914100/4588595) loss:2.890 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:41,721][model8_pretrain.py][INFO] Epoch:[0/2](914100/4588595) loss:2.956 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:41,721][model8_pretrain.py][INFO] Epoch:[0/2](914100/4588595) loss:2.020 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:41,721][model8_pretrain.py][INFO] Epoch:[0/2](914100/4588595) loss:3.174 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:41,721][model8_pretrain.py][INFO] Epoch:[0/2](914100/4588595) loss:3.338 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:41,721][model8_pretrain.py][INFO] Epoch:[0/2](914100/4588595) loss:3.044 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:39:41,721][model8_pretrain.py][INFO] Epoch:[0/2](914100/4588595) loss:2.879 lr:0.0000100 epoch_Time:23395.0min: [2024-01-06 18:40:18,657][model8_pretrain.py][INFO] Epoch:[0/2](914200/4588595) loss:2.730 lr:0.0000100 epoch_Time:23394.0min: [2024-01-06 18:40:18,657][model8_pretrain.py][INFO] Epoch:[0/2](914200/4588595) loss:2.898 lr:0.0000100 epoch_Time:23394.0min: [2024-01-06 18:40:18,657][model8_pretrain.py][INFO] Epoch:[0/2](914200/4588595) loss:2.538 lr:0.0000100 epoch_Time:23394.0min: [2024-01-06 18:40:18,657][model8_pretrain.py][INFO] Epoch:[0/2](914200/4588595) loss:2.515 lr:0.0000100 epoch_Time:23394.0min: [2024-01-06 18:40:18,657][model8_pretrain.py][INFO] Epoch:[0/2](914200/4588595) loss:2.701 lr:0.0000100 epoch_Time:23394.0min: [2024-01-06 18:40:18,657][model8_pretrain.py][INFO] Epoch:[0/2](914200/4588595) loss:2.806 lr:0.0000100 epoch_Time:23394.0min: [2024-01-06 18:40:18,657][model8_pretrain.py][INFO] Epoch:[0/2](914200/4588595) loss:2.584 lr:0.0000100 epoch_Time:23394.0min: [2024-01-06 18:40:18,657][model8_pretrain.py][INFO] Epoch:[0/2](914200/4588595) loss:2.719 lr:0.0000100 epoch_Time:23394.0min: [2024-01-06 18:40:55,598][model8_pretrain.py][INFO] Epoch:[0/2](914300/4588595) loss:3.203 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:40:55,598][model8_pretrain.py][INFO] Epoch:[0/2](914300/4588595) loss:2.897 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:40:55,598][model8_pretrain.py][INFO] Epoch:[0/2](914300/4588595) loss:3.142 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:40:55,598][model8_pretrain.py][INFO] Epoch:[0/2](914300/4588595) loss:1.930 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:40:55,598][model8_pretrain.py][INFO] Epoch:[0/2](914300/4588595) loss:3.002 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:40:55,598][model8_pretrain.py][INFO] Epoch:[0/2](914300/4588595) loss:2.503 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:40:55,598][model8_pretrain.py][INFO] Epoch:[0/2](914300/4588595) loss:2.884 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:40:55,598][model8_pretrain.py][INFO] Epoch:[0/2](914300/4588595) loss:3.147 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:41:32,534][model8_pretrain.py][INFO] Epoch:[0/2](914400/4588595) loss:2.453 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:41:32,534][model8_pretrain.py][INFO] Epoch:[0/2](914400/4588595) loss:3.246 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:41:32,534][model8_pretrain.py][INFO] Epoch:[0/2](914400/4588595) loss:2.610 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:41:32,534][model8_pretrain.py][INFO] Epoch:[0/2](914400/4588595) loss:2.583 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:41:32,534][model8_pretrain.py][INFO] Epoch:[0/2](914400/4588595) loss:2.761 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:41:32,534][model8_pretrain.py][INFO] Epoch:[0/2](914400/4588595) loss:3.058 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:41:32,534][model8_pretrain.py][INFO] Epoch:[0/2](914400/4588595) loss:3.314 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:41:32,534][model8_pretrain.py][INFO] Epoch:[0/2](914400/4588595) loss:2.905 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:42:19,860][model8_pretrain.py][INFO] Epoch:[0/2](914500/4588595) loss:2.709 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:42:19,860][model8_pretrain.py][INFO] Epoch:[0/2](914500/4588595) loss:2.286 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:42:19,860][model8_pretrain.py][INFO] Epoch:[0/2](914500/4588595) loss:2.438 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:42:19,860][model8_pretrain.py][INFO] Epoch:[0/2](914500/4588595) loss:3.191 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:42:19,860][model8_pretrain.py][INFO] Epoch:[0/2](914500/4588595) loss:2.723 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:42:19,860][model8_pretrain.py][INFO] Epoch:[0/2](914500/4588595) loss:2.966 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:42:19,860][model8_pretrain.py][INFO] Epoch:[0/2](914500/4588595) loss:2.332 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:42:19,860][model8_pretrain.py][INFO] Epoch:[0/2](914500/4588595) loss:2.657 lr:0.0000100 epoch_Time:23393.0min: [2024-01-06 18:43:00,431][model8_pretrain.py][INFO] Epoch:[0/2](914600/4588595) loss:2.864 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:00,431][model8_pretrain.py][INFO] Epoch:[0/2](914600/4588595) loss:2.573 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:00,431][model8_pretrain.py][INFO] Epoch:[0/2](914600/4588595) loss:2.863 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:00,431][model8_pretrain.py][INFO] Epoch:[0/2](914600/4588595) loss:2.462 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:00,431][model8_pretrain.py][INFO] Epoch:[0/2](914600/4588595) loss:3.430 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:00,431][model8_pretrain.py][INFO] Epoch:[0/2](914600/4588595) loss:2.428 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:00,431][model8_pretrain.py][INFO] Epoch:[0/2](914600/4588595) loss:2.883 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:00,431][model8_pretrain.py][INFO] Epoch:[0/2](914600/4588595) loss:2.987 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:37,378][model8_pretrain.py][INFO] Epoch:[0/2](914700/4588595) loss:2.629 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:37,379][model8_pretrain.py][INFO] Epoch:[0/2](914700/4588595) loss:2.523 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:37,379][model8_pretrain.py][INFO] Epoch:[0/2](914700/4588595) loss:3.015 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:37,379][model8_pretrain.py][INFO] Epoch:[0/2](914700/4588595) loss:2.746 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:37,379][model8_pretrain.py][INFO] Epoch:[0/2](914700/4588595) loss:2.882 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:37,379][model8_pretrain.py][INFO] Epoch:[0/2](914700/4588595) loss:2.582 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:37,379][model8_pretrain.py][INFO] Epoch:[0/2](914700/4588595) loss:2.832 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:43:37,379][model8_pretrain.py][INFO] Epoch:[0/2](914700/4588595) loss:3.410 lr:0.0000100 epoch_Time:23392.0min: [2024-01-06 18:44:14,327][model8_pretrain.py][INFO] Epoch:[0/2](914800/4588595) loss:2.814 lr:0.0000100 epoch_Time:23391.0min: [2024-01-06 18:44:14,327][model8_pretrain.py][INFO] Epoch:[0/2](914800/4588595) loss:2.486 lr:0.0000100 epoch_Time:23391.0min: [2024-01-06 18:44:14,327][model8_pretrain.py][INFO] Epoch:[0/2](914800/4588595) loss:2.713 lr:0.0000100 epoch_Time:23391.0min: [2024-01-06 18:44:14,327][model8_pretrain.py][INFO] Epoch:[0/2](914800/4588595) loss:3.466 lr:0.0000100 epoch_Time:23391.0min: [2024-01-06 18:44:14,327][model8_pretrain.py][INFO] Epoch:[0/2](914800/4588595) loss:2.696 lr:0.0000100 epoch_Time:23391.0min: [2024-01-06 18:44:14,327][model8_pretrain.py][INFO] Epoch:[0/2](914800/4588595) loss:2.645 lr:0.0000100 epoch_Time:23391.0min: [2024-01-06 18:44:14,327][model8_pretrain.py][INFO] Epoch:[0/2](914800/4588595) loss:2.610 lr:0.0000100 epoch_Time:23391.0min: [2024-01-06 18:44:14,327][model8_pretrain.py][INFO] Epoch:[0/2](914800/4588595) loss:2.846 lr:0.0000100 epoch_Time:23391.0min: [2024-01-06 18:44:51,269][model8_pretrain.py][INFO] Epoch:[0/2](914900/4588595) loss:3.209 lr:0.0000100 epoch_Time:23390.0min: [2024-01-06 18:44:51,270][model8_pretrain.py][INFO] Epoch:[0/2](914900/4588595) loss:2.508 lr:0.0000100 epoch_Time:23390.0min: [2024-01-06 18:44:51,269][model8_pretrain.py][INFO] Epoch:[0/2](914900/4588595) loss:3.253 lr:0.0000100 epoch_Time:23390.0min: [2024-01-06 18:44:51,270][model8_pretrain.py][INFO] Epoch:[0/2](914900/4588595) loss:2.591 lr:0.0000100 epoch_Time:23390.0min: [2024-01-06 18:44:51,270][model8_pretrain.py][INFO] Epoch:[0/2](914900/4588595) loss:2.107 lr:0.0000100 epoch_Time:23390.0min: [2024-01-06 18:44:51,270][model8_pretrain.py][INFO] Epoch:[0/2](914900/4588595) loss:3.214 lr:0.0000100 epoch_Time:23390.0min: [2024-01-06 18:44:51,270][model8_pretrain.py][INFO] Epoch:[0/2](914900/4588595) loss:2.576 lr:0.0000100 epoch_Time:23390.0min: [2024-01-06 18:44:51,270][model8_pretrain.py][INFO] Epoch:[0/2](914900/4588595) loss:3.017 lr:0.0000100 epoch_Time:23390.0min: [2024-01-06 18:45:28,229][model8_pretrain.py][INFO] Epoch:[0/2](915000/4588595) loss:2.578 lr:0.0000100 epoch_Time:23389.0min: [2024-01-06 18:45:28,229][model8_pretrain.py][INFO] Epoch:[0/2](915000/4588595) loss:3.096 lr:0.0000100 epoch_Time:23389.0min: [2024-01-06 18:45:28,229][model8_pretrain.py][INFO] Epoch:[0/2](915000/4588595) loss:3.205 lr:0.0000100 epoch_Time:23389.0min: [2024-01-06 18:45:28,229][model8_pretrain.py][INFO] Epoch:[0/2](915000/4588595) loss:2.547 lr:0.0000100 epoch_Time:23389.0min: [2024-01-06 18:45:28,229][model8_pretrain.py][INFO] Epoch:[0/2](915000/4588595) loss:2.607 lr:0.0000100 epoch_Time:23389.0min: [2024-01-06 18:45:28,229][model8_pretrain.py][INFO] Epoch:[0/2](915000/4588595) loss:2.983 lr:0.0000100 epoch_Time:23389.0min: [2024-01-06 18:45:28,229][model8_pretrain.py][INFO] Epoch:[0/2](915000/4588595) loss:2.805 lr:0.0000100 epoch_Time:23389.0min: [2024-01-06 18:45:28,229][model8_pretrain.py][INFO] Epoch:[0/2](915000/4588595) loss:2.746 lr:0.0000100 epoch_Time:23389.0min: [2024-01-06 18:46:05,179][model8_pretrain.py][INFO] Epoch:[0/2](915100/4588595) loss:3.036 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:05,179][model8_pretrain.py][INFO] Epoch:[0/2](915100/4588595) loss:2.324 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:05,179][model8_pretrain.py][INFO] Epoch:[0/2](915100/4588595) loss:2.932 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:05,179][model8_pretrain.py][INFO] Epoch:[0/2](915100/4588595) loss:3.314 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:05,180][model8_pretrain.py][INFO] Epoch:[0/2](915100/4588595) loss:3.071 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:05,180][model8_pretrain.py][INFO] Epoch:[0/2](915100/4588595) loss:2.762 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:05,180][model8_pretrain.py][INFO] Epoch:[0/2](915100/4588595) loss:3.081 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:05,180][model8_pretrain.py][INFO] Epoch:[0/2](915100/4588595) loss:2.836 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:42,132][model8_pretrain.py][INFO] Epoch:[0/2](915200/4588595) loss:1.937 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:42,132][model8_pretrain.py][INFO] Epoch:[0/2](915200/4588595) loss:2.781 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:42,132][model8_pretrain.py][INFO] Epoch:[0/2](915200/4588595) loss:2.317 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:42,132][model8_pretrain.py][INFO] Epoch:[0/2](915200/4588595) loss:3.112 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:42,132][model8_pretrain.py][INFO] Epoch:[0/2](915200/4588595) loss:3.058 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:42,132][model8_pretrain.py][INFO] Epoch:[0/2](915200/4588595) loss:2.736 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:42,132][model8_pretrain.py][INFO] Epoch:[0/2](915200/4588595) loss:2.386 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:46:42,132][model8_pretrain.py][INFO] Epoch:[0/2](915200/4588595) loss:2.424 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:47:29,463][model8_pretrain.py][INFO] Epoch:[0/2](915300/4588595) loss:2.462 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:47:29,463][model8_pretrain.py][INFO] Epoch:[0/2](915300/4588595) loss:3.170 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:47:29,463][model8_pretrain.py][INFO] Epoch:[0/2](915300/4588595) loss:2.336 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:47:29,463][model8_pretrain.py][INFO] Epoch:[0/2](915300/4588595) loss:2.488 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:47:29,463][model8_pretrain.py][INFO] Epoch:[0/2](915300/4588595) loss:2.651 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:47:29,463][model8_pretrain.py][INFO] Epoch:[0/2](915300/4588595) loss:2.870 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:47:29,463][model8_pretrain.py][INFO] Epoch:[0/2](915300/4588595) loss:2.813 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:47:29,463][model8_pretrain.py][INFO] Epoch:[0/2](915300/4588595) loss:3.346 lr:0.0000100 epoch_Time:23388.0min: [2024-01-06 18:48:09,981][model8_pretrain.py][INFO] Epoch:[0/2](915400/4588595) loss:2.631 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:09,981][model8_pretrain.py][INFO] Epoch:[0/2](915400/4588595) loss:2.583 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:09,981][model8_pretrain.py][INFO] Epoch:[0/2](915400/4588595) loss:2.623 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:09,981][model8_pretrain.py][INFO] Epoch:[0/2](915400/4588595) loss:3.410 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:09,981][model8_pretrain.py][INFO] Epoch:[0/2](915400/4588595) loss:3.489 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:09,981][model8_pretrain.py][INFO] Epoch:[0/2](915400/4588595) loss:3.269 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:09,981][model8_pretrain.py][INFO] Epoch:[0/2](915400/4588595) loss:3.123 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:09,981][model8_pretrain.py][INFO] Epoch:[0/2](915400/4588595) loss:2.181 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:46,926][model8_pretrain.py][INFO] Epoch:[0/2](915500/4588595) loss:2.653 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:46,926][model8_pretrain.py][INFO] Epoch:[0/2](915500/4588595) loss:2.802 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:46,926][model8_pretrain.py][INFO] Epoch:[0/2](915500/4588595) loss:3.093 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:46,926][model8_pretrain.py][INFO] Epoch:[0/2](915500/4588595) loss:2.690 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:46,926][model8_pretrain.py][INFO] Epoch:[0/2](915500/4588595) loss:3.194 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:46,926][model8_pretrain.py][INFO] Epoch:[0/2](915500/4588595) loss:2.949 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:46,926][model8_pretrain.py][INFO] Epoch:[0/2](915500/4588595) loss:3.115 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:48:46,926][model8_pretrain.py][INFO] Epoch:[0/2](915500/4588595) loss:2.359 lr:0.0000100 epoch_Time:23387.0min: [2024-01-06 18:49:23,868][model8_pretrain.py][INFO] Epoch:[0/2](915600/4588595) loss:2.878 lr:0.0000100 epoch_Time:23386.0min: [2024-01-06 18:49:23,868][model8_pretrain.py][INFO] Epoch:[0/2](915600/4588595) loss:3.151 lr:0.0000100 epoch_Time:23386.0min: [2024-01-06 18:49:23,868][model8_pretrain.py][INFO] Epoch:[0/2](915600/4588595) loss:2.748 lr:0.0000100 epoch_Time:23386.0min: [2024-01-06 18:49:23,868][model8_pretrain.py][INFO] Epoch:[0/2](915600/4588595) loss:2.746 lr:0.0000100 epoch_Time:23386.0min: [2024-01-06 18:49:23,868][model8_pretrain.py][INFO] Epoch:[0/2](915600/4588595) loss:2.825 lr:0.0000100 epoch_Time:23386.0min: [2024-01-06 18:49:23,868][model8_pretrain.py][INFO] Epoch:[0/2](915600/4588595) loss:2.745 lr:0.0000100 epoch_Time:23386.0min: [2024-01-06 18:49:23,868][model8_pretrain.py][INFO] Epoch:[0/2](915600/4588595) loss:2.431 lr:0.0000100 epoch_Time:23386.0min: [2024-01-06 18:49:23,869][model8_pretrain.py][INFO] Epoch:[0/2](915600/4588595) loss:2.764 lr:0.0000100 epoch_Time:23386.0min: [2024-01-06 18:50:00,819][model8_pretrain.py][INFO] Epoch:[0/2](915700/4588595) loss:2.449 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:00,819][model8_pretrain.py][INFO] Epoch:[0/2](915700/4588595) loss:3.056 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:00,819][model8_pretrain.py][INFO] Epoch:[0/2](915700/4588595) loss:2.543 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:00,819][model8_pretrain.py][INFO] Epoch:[0/2](915700/4588595) loss:2.715 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:00,819][model8_pretrain.py][INFO] Epoch:[0/2](915700/4588595) loss:2.963 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:00,819][model8_pretrain.py][INFO] Epoch:[0/2](915700/4588595) loss:3.072 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:00,819][model8_pretrain.py][INFO] Epoch:[0/2](915700/4588595) loss:2.857 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:00,819][model8_pretrain.py][INFO] Epoch:[0/2](915700/4588595) loss:2.953 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:37,771][model8_pretrain.py][INFO] Epoch:[0/2](915800/4588595) loss:2.419 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:37,771][model8_pretrain.py][INFO] Epoch:[0/2](915800/4588595) loss:2.793 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:37,771][model8_pretrain.py][INFO] Epoch:[0/2](915800/4588595) loss:2.348 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:37,771][model8_pretrain.py][INFO] Epoch:[0/2](915800/4588595) loss:2.397 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:37,771][model8_pretrain.py][INFO] Epoch:[0/2](915800/4588595) loss:3.014 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:37,771][model8_pretrain.py][INFO] Epoch:[0/2](915800/4588595) loss:2.515 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:37,772][model8_pretrain.py][INFO] Epoch:[0/2](915800/4588595) loss:3.266 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:50:37,772][model8_pretrain.py][INFO] Epoch:[0/2](915800/4588595) loss:3.226 lr:0.0000100 epoch_Time:23385.0min: [2024-01-06 18:51:14,724][model8_pretrain.py][INFO] Epoch:[0/2](915900/4588595) loss:2.865 lr:0.0000100 epoch_Time:23384.0min: [2024-01-06 18:51:14,724][model8_pretrain.py][INFO] Epoch:[0/2](915900/4588595) loss:2.778 lr:0.0000100 epoch_Time:23384.0min: [2024-01-06 18:51:14,724][model8_pretrain.py][INFO] Epoch:[0/2](915900/4588595) loss:2.814 lr:0.0000100 epoch_Time:23384.0min: [2024-01-06 18:51:14,724][model8_pretrain.py][INFO] Epoch:[0/2](915900/4588595) loss:3.006 lr:0.0000100 epoch_Time:23384.0min: [2024-01-06 18:51:14,724][model8_pretrain.py][INFO] Epoch:[0/2](915900/4588595) loss:2.818 lr:0.0000100 epoch_Time:23384.0min: [2024-01-06 18:51:14,725][model8_pretrain.py][INFO] Epoch:[0/2](915900/4588595) loss:2.757 lr:0.0000100 epoch_Time:23384.0min: [2024-01-06 18:51:14,725][model8_pretrain.py][INFO] Epoch:[0/2](915900/4588595) loss:2.974 lr:0.0000100 epoch_Time:23384.0min: [2024-01-06 18:51:14,725][model8_pretrain.py][INFO] Epoch:[0/2](915900/4588595) loss:2.414 lr:0.0000100 epoch_Time:23384.0min: [2024-01-06 18:51:51,669][model8_pretrain.py][INFO] Epoch:[0/2](916000/4588595) loss:2.825 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:51:51,669][model8_pretrain.py][INFO] Epoch:[0/2](916000/4588595) loss:2.957 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:51:51,669][model8_pretrain.py][INFO] Epoch:[0/2](916000/4588595) loss:2.618 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:51:51,669][model8_pretrain.py][INFO] Epoch:[0/2](916000/4588595) loss:3.033 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:51:51,669][model8_pretrain.py][INFO] Epoch:[0/2](916000/4588595) loss:2.936 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:51:51,669][model8_pretrain.py][INFO] Epoch:[0/2](916000/4588595) loss:2.486 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:51:51,669][model8_pretrain.py][INFO] Epoch:[0/2](916000/4588595) loss:2.766 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:51:51,670][model8_pretrain.py][INFO] Epoch:[0/2](916000/4588595) loss:3.061 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:52:37,192][model8_pretrain.py][INFO] Epoch:[0/2](916100/4588595) loss:3.148 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:52:37,192][model8_pretrain.py][INFO] Epoch:[0/2](916100/4588595) loss:3.014 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:52:37,192][model8_pretrain.py][INFO] Epoch:[0/2](916100/4588595) loss:3.024 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:52:37,192][model8_pretrain.py][INFO] Epoch:[0/2](916100/4588595) loss:3.247 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:52:37,192][model8_pretrain.py][INFO] Epoch:[0/2](916100/4588595) loss:2.989 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:52:37,192][model8_pretrain.py][INFO] Epoch:[0/2](916100/4588595) loss:2.774 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:52:37,193][model8_pretrain.py][INFO] Epoch:[0/2](916100/4588595) loss:2.773 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:52:37,197][model8_pretrain.py][INFO] Epoch:[0/2](916100/4588595) loss:2.921 lr:0.0000100 epoch_Time:23383.0min: [2024-01-06 18:53:19,316][model8_pretrain.py][INFO] Epoch:[0/2](916200/4588595) loss:2.996 lr:0.0000100 epoch_Time:23382.0min: [2024-01-06 18:53:19,316][model8_pretrain.py][INFO] Epoch:[0/2](916200/4588595) loss:2.811 lr:0.0000100 epoch_Time:23382.0min: [2024-01-06 18:53:19,316][model8_pretrain.py][INFO] Epoch:[0/2](916200/4588595) loss:2.861 lr:0.0000100 epoch_Time:23382.0min: [2024-01-06 18:53:19,316][model8_pretrain.py][INFO] Epoch:[0/2](916200/4588595) loss:2.186 lr:0.0000100 epoch_Time:23382.0min: [2024-01-06 18:53:19,316][model8_pretrain.py][INFO] Epoch:[0/2](916200/4588595) loss:2.953 lr:0.0000100 epoch_Time:23382.0min: [2024-01-06 18:53:19,316][model8_pretrain.py][INFO] Epoch:[0/2](916200/4588595) loss:2.915 lr:0.0000100 epoch_Time:23382.0min: [2024-01-06 18:53:19,317][model8_pretrain.py][INFO] Epoch:[0/2](916200/4588595) loss:2.791 lr:0.0000100 epoch_Time:23382.0min: [2024-01-06 18:53:19,317][model8_pretrain.py][INFO] Epoch:[0/2](916200/4588595) loss:2.742 lr:0.0000100 epoch_Time:23382.0min: [2024-01-06 18:53:56,268][model8_pretrain.py][INFO] Epoch:[0/2](916300/4588595) loss:2.468 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:53:56,268][model8_pretrain.py][INFO] Epoch:[0/2](916300/4588595) loss:2.367 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:53:56,268][model8_pretrain.py][INFO] Epoch:[0/2](916300/4588595) loss:2.875 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:53:56,268][model8_pretrain.py][INFO] Epoch:[0/2](916300/4588595) loss:2.303 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:53:56,268][model8_pretrain.py][INFO] Epoch:[0/2](916300/4588595) loss:2.817 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:53:56,268][model8_pretrain.py][INFO] Epoch:[0/2](916300/4588595) loss:2.827 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:53:56,268][model8_pretrain.py][INFO] Epoch:[0/2](916300/4588595) loss:3.019 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:53:56,269][model8_pretrain.py][INFO] Epoch:[0/2](916300/4588595) loss:2.758 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:54:33,219][model8_pretrain.py][INFO] Epoch:[0/2](916400/4588595) loss:2.876 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:54:33,219][model8_pretrain.py][INFO] Epoch:[0/2](916400/4588595) loss:2.836 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:54:33,219][model8_pretrain.py][INFO] Epoch:[0/2](916400/4588595) loss:2.439 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:54:33,219][model8_pretrain.py][INFO] Epoch:[0/2](916400/4588595) loss:2.883 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:54:33,219][model8_pretrain.py][INFO] Epoch:[0/2](916400/4588595) loss:2.738 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:54:33,219][model8_pretrain.py][INFO] Epoch:[0/2](916400/4588595) loss:2.600 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:54:33,219][model8_pretrain.py][INFO] Epoch:[0/2](916400/4588595) loss:2.393 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:54:33,219][model8_pretrain.py][INFO] Epoch:[0/2](916400/4588595) loss:2.925 lr:0.0000100 epoch_Time:23381.0min: [2024-01-06 18:55:10,169][model8_pretrain.py][INFO] Epoch:[0/2](916500/4588595) loss:2.654 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:10,169][model8_pretrain.py][INFO] Epoch:[0/2](916500/4588595) loss:2.804 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:10,169][model8_pretrain.py][INFO] Epoch:[0/2](916500/4588595) loss:2.876 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:10,169][model8_pretrain.py][INFO] Epoch:[0/2](916500/4588595) loss:2.837 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:10,169][model8_pretrain.py][INFO] Epoch:[0/2](916500/4588595) loss:2.881 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:10,169][model8_pretrain.py][INFO] Epoch:[0/2](916500/4588595) loss:3.179 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:10,169][model8_pretrain.py][INFO] Epoch:[0/2](916500/4588595) loss:2.889 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:10,169][model8_pretrain.py][INFO] Epoch:[0/2](916500/4588595) loss:2.743 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:47,113][model8_pretrain.py][INFO] Epoch:[0/2](916600/4588595) loss:2.675 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:47,114][model8_pretrain.py][INFO] Epoch:[0/2](916600/4588595) loss:2.756 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:47,114][model8_pretrain.py][INFO] Epoch:[0/2](916600/4588595) loss:2.923 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:47,114][model8_pretrain.py][INFO] Epoch:[0/2](916600/4588595) loss:2.830 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:47,114][model8_pretrain.py][INFO] Epoch:[0/2](916600/4588595) loss:2.788 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:47,114][model8_pretrain.py][INFO] Epoch:[0/2](916600/4588595) loss:2.510 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:47,114][model8_pretrain.py][INFO] Epoch:[0/2](916600/4588595) loss:3.024 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:55:47,114][model8_pretrain.py][INFO] Epoch:[0/2](916600/4588595) loss:2.416 lr:0.0000100 epoch_Time:23380.0min: [2024-01-06 18:56:24,067][model8_pretrain.py][INFO] Epoch:[0/2](916700/4588595) loss:2.927 lr:0.0000100 epoch_Time:23379.0min: [2024-01-06 18:56:24,067][model8_pretrain.py][INFO] Epoch:[0/2](916700/4588595) loss:3.196 lr:0.0000100 epoch_Time:23379.0min: [2024-01-06 18:56:24,067][model8_pretrain.py][INFO] Epoch:[0/2](916700/4588595) loss:2.801 lr:0.0000100 epoch_Time:23379.0min: [2024-01-06 18:56:24,067][model8_pretrain.py][INFO] Epoch:[0/2](916700/4588595) loss:3.097 lr:0.0000100 epoch_Time:23379.0min: [2024-01-06 18:56:24,067][model8_pretrain.py][INFO] Epoch:[0/2](916700/4588595) loss:2.207 lr:0.0000100 epoch_Time:23379.0min: [2024-01-06 18:56:24,067][model8_pretrain.py][INFO] Epoch:[0/2](916700/4588595) loss:3.289 lr:0.0000100 epoch_Time:23379.0min: [2024-01-06 18:56:24,067][model8_pretrain.py][INFO] Epoch:[0/2](916700/4588595) loss:2.995 lr:0.0000100 epoch_Time:23379.0min: [2024-01-06 18:56:24,067][model8_pretrain.py][INFO] Epoch:[0/2](916700/4588595) loss:3.017 lr:0.0000100 epoch_Time:23379.0min: [2024-01-06 18:57:01,019][model8_pretrain.py][INFO] Epoch:[0/2](916800/4588595) loss:2.492 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:01,019][model8_pretrain.py][INFO] Epoch:[0/2](916800/4588595) loss:2.538 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:01,019][model8_pretrain.py][INFO] Epoch:[0/2](916800/4588595) loss:2.677 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:01,019][model8_pretrain.py][INFO] Epoch:[0/2](916800/4588595) loss:3.418 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:01,019][model8_pretrain.py][INFO] Epoch:[0/2](916800/4588595) loss:3.414 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:01,019][model8_pretrain.py][INFO] Epoch:[0/2](916800/4588595) loss:2.751 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:01,019][model8_pretrain.py][INFO] Epoch:[0/2](916800/4588595) loss:2.964 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:01,019][model8_pretrain.py][INFO] Epoch:[0/2](916800/4588595) loss:2.946 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:44,784][model8_pretrain.py][INFO] Epoch:[0/2](916900/4588595) loss:3.316 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:44,784][model8_pretrain.py][INFO] Epoch:[0/2](916900/4588595) loss:2.861 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:44,784][model8_pretrain.py][INFO] Epoch:[0/2](916900/4588595) loss:2.419 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:44,784][model8_pretrain.py][INFO] Epoch:[0/2](916900/4588595) loss:2.716 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:44,784][model8_pretrain.py][INFO] Epoch:[0/2](916900/4588595) loss:2.028 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:44,784][model8_pretrain.py][INFO] Epoch:[0/2](916900/4588595) loss:2.835 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:44,784][model8_pretrain.py][INFO] Epoch:[0/2](916900/4588595) loss:3.035 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:57:44,784][model8_pretrain.py][INFO] Epoch:[0/2](916900/4588595) loss:2.875 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:58:28,513][model8_pretrain.py][INFO] Epoch:[0/2](917000/4588595) loss:3.164 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:58:28,513][model8_pretrain.py][INFO] Epoch:[0/2](917000/4588595) loss:2.957 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:58:28,513][model8_pretrain.py][INFO] Epoch:[0/2](917000/4588595) loss:2.734 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:58:28,513][model8_pretrain.py][INFO] Epoch:[0/2](917000/4588595) loss:3.250 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:58:28,513][model8_pretrain.py][INFO] Epoch:[0/2](917000/4588595) loss:2.594 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:58:28,513][model8_pretrain.py][INFO] Epoch:[0/2](917000/4588595) loss:2.845 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:58:28,513][model8_pretrain.py][INFO] Epoch:[0/2](917000/4588595) loss:2.481 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:58:28,513][model8_pretrain.py][INFO] Epoch:[0/2](917000/4588595) loss:2.761 lr:0.0000100 epoch_Time:23378.0min: [2024-01-06 18:59:05,452][model8_pretrain.py][INFO] Epoch:[0/2](917100/4588595) loss:2.830 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:05,452][model8_pretrain.py][INFO] Epoch:[0/2](917100/4588595) loss:3.072 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:05,452][model8_pretrain.py][INFO] Epoch:[0/2](917100/4588595) loss:2.920 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:05,452][model8_pretrain.py][INFO] Epoch:[0/2](917100/4588595) loss:2.323 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:05,452][model8_pretrain.py][INFO] Epoch:[0/2](917100/4588595) loss:3.147 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:05,452][model8_pretrain.py][INFO] Epoch:[0/2](917100/4588595) loss:3.371 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:05,452][model8_pretrain.py][INFO] Epoch:[0/2](917100/4588595) loss:2.243 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:05,453][model8_pretrain.py][INFO] Epoch:[0/2](917100/4588595) loss:3.065 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:42,374][model8_pretrain.py][INFO] Epoch:[0/2](917200/4588595) loss:3.034 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:42,374][model8_pretrain.py][INFO] Epoch:[0/2](917200/4588595) loss:2.664 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:42,374][model8_pretrain.py][INFO] Epoch:[0/2](917200/4588595) loss:2.924 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:42,374][model8_pretrain.py][INFO] Epoch:[0/2](917200/4588595) loss:2.869 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:42,374][model8_pretrain.py][INFO] Epoch:[0/2](917200/4588595) loss:3.289 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:42,374][model8_pretrain.py][INFO] Epoch:[0/2](917200/4588595) loss:2.807 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:42,374][model8_pretrain.py][INFO] Epoch:[0/2](917200/4588595) loss:2.153 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 18:59:42,374][model8_pretrain.py][INFO] Epoch:[0/2](917200/4588595) loss:2.713 lr:0.0000100 epoch_Time:23377.0min: [2024-01-06 19:00:19,309][model8_pretrain.py][INFO] Epoch:[0/2](917300/4588595) loss:3.232 lr:0.0000100 epoch_Time:23375.0min: [2024-01-06 19:00:19,309][model8_pretrain.py][INFO] Epoch:[0/2](917300/4588595) loss:3.058 lr:0.0000100 epoch_Time:23375.0min: [2024-01-06 19:00:19,309][model8_pretrain.py][INFO] Epoch:[0/2](917300/4588595) loss:2.636 lr:0.0000100 epoch_Time:23375.0min: [2024-01-06 19:00:19,309][model8_pretrain.py][INFO] Epoch:[0/2](917300/4588595) loss:2.970 lr:0.0000100 epoch_Time:23375.0min: [2024-01-06 19:00:19,309][model8_pretrain.py][INFO] Epoch:[0/2](917300/4588595) loss:3.199 lr:0.0000100 epoch_Time:23375.0min: [2024-01-06 19:00:19,309][model8_pretrain.py][INFO] Epoch:[0/2](917300/4588595) loss:2.714 lr:0.0000100 epoch_Time:23375.0min: [2024-01-06 19:00:19,309][model8_pretrain.py][INFO] Epoch:[0/2](917300/4588595) loss:2.986 lr:0.0000100 epoch_Time:23375.0min: [2024-01-06 19:00:19,309][model8_pretrain.py][INFO] Epoch:[0/2](917300/4588595) loss:2.588 lr:0.0000100 epoch_Time:23375.0min: [2024-01-06 19:00:56,246][model8_pretrain.py][INFO] Epoch:[0/2](917400/4588595) loss:3.119 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:00:56,246][model8_pretrain.py][INFO] Epoch:[0/2](917400/4588595) loss:2.623 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:00:56,246][model8_pretrain.py][INFO] Epoch:[0/2](917400/4588595) loss:2.839 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:00:56,246][model8_pretrain.py][INFO] Epoch:[0/2](917400/4588595) loss:2.231 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:00:56,246][model8_pretrain.py][INFO] Epoch:[0/2](917400/4588595) loss:2.719 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:00:56,246][model8_pretrain.py][INFO] Epoch:[0/2](917400/4588595) loss:2.654 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:00:56,246][model8_pretrain.py][INFO] Epoch:[0/2](917400/4588595) loss:2.718 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:00:56,246][model8_pretrain.py][INFO] Epoch:[0/2](917400/4588595) loss:3.545 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:01:33,194][model8_pretrain.py][INFO] Epoch:[0/2](917500/4588595) loss:2.495 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:01:33,194][model8_pretrain.py][INFO] Epoch:[0/2](917500/4588595) loss:2.803 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:01:33,194][model8_pretrain.py][INFO] Epoch:[0/2](917500/4588595) loss:3.580 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:01:33,194][model8_pretrain.py][INFO] Epoch:[0/2](917500/4588595) loss:2.731 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:01:33,194][model8_pretrain.py][INFO] Epoch:[0/2](917500/4588595) loss:2.971 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:01:33,194][model8_pretrain.py][INFO] Epoch:[0/2](917500/4588595) loss:3.005 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:01:33,194][model8_pretrain.py][INFO] Epoch:[0/2](917500/4588595) loss:2.390 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:01:33,194][model8_pretrain.py][INFO] Epoch:[0/2](917500/4588595) loss:3.141 lr:0.0000100 epoch_Time:23374.0min: [2024-01-06 19:02:10,156][model8_pretrain.py][INFO] Epoch:[0/2](917600/4588595) loss:2.697 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:10,156][model8_pretrain.py][INFO] Epoch:[0/2](917600/4588595) loss:2.575 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:10,156][model8_pretrain.py][INFO] Epoch:[0/2](917600/4588595) loss:2.766 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:10,156][model8_pretrain.py][INFO] Epoch:[0/2](917600/4588595) loss:2.912 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:10,156][model8_pretrain.py][INFO] Epoch:[0/2](917600/4588595) loss:2.860 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:10,156][model8_pretrain.py][INFO] Epoch:[0/2](917600/4588595) loss:2.584 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:10,156][model8_pretrain.py][INFO] Epoch:[0/2](917600/4588595) loss:2.768 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:10,156][model8_pretrain.py][INFO] Epoch:[0/2](917600/4588595) loss:2.564 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:53,718][model8_pretrain.py][INFO] Epoch:[0/2](917700/4588595) loss:3.160 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:53,718][model8_pretrain.py][INFO] Epoch:[0/2](917700/4588595) loss:3.043 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:53,718][model8_pretrain.py][INFO] Epoch:[0/2](917700/4588595) loss:3.213 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:53,718][model8_pretrain.py][INFO] Epoch:[0/2](917700/4588595) loss:2.831 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:53,718][model8_pretrain.py][INFO] Epoch:[0/2](917700/4588595) loss:2.915 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:53,718][model8_pretrain.py][INFO] Epoch:[0/2](917700/4588595) loss:2.813 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:53,718][model8_pretrain.py][INFO] Epoch:[0/2](917700/4588595) loss:3.358 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:02:53,718][model8_pretrain.py][INFO] Epoch:[0/2](917700/4588595) loss:2.338 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:03:37,264][model8_pretrain.py][INFO] Epoch:[0/2](917800/4588595) loss:2.658 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:03:37,264][model8_pretrain.py][INFO] Epoch:[0/2](917800/4588595) loss:2.891 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:03:37,264][model8_pretrain.py][INFO] Epoch:[0/2](917800/4588595) loss:2.845 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:03:37,264][model8_pretrain.py][INFO] Epoch:[0/2](917800/4588595) loss:2.654 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:03:37,264][model8_pretrain.py][INFO] Epoch:[0/2](917800/4588595) loss:2.582 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:03:37,264][model8_pretrain.py][INFO] Epoch:[0/2](917800/4588595) loss:3.166 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:03:37,265][model8_pretrain.py][INFO] Epoch:[0/2](917800/4588595) loss:2.635 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:03:37,265][model8_pretrain.py][INFO] Epoch:[0/2](917800/4588595) loss:2.108 lr:0.0000100 epoch_Time:23373.0min: [2024-01-06 19:04:14,196][model8_pretrain.py][INFO] Epoch:[0/2](917900/4588595) loss:3.021 lr:0.0000100 epoch_Time:23372.0min: [2024-01-06 19:04:14,196][model8_pretrain.py][INFO] Epoch:[0/2](917900/4588595) loss:3.010 lr:0.0000100 epoch_Time:23372.0min: [2024-01-06 19:04:14,196][model8_pretrain.py][INFO] Epoch:[0/2](917900/4588595) loss:2.472 lr:0.0000100 epoch_Time:23372.0min: [2024-01-06 19:04:14,196][model8_pretrain.py][INFO] Epoch:[0/2](917900/4588595) loss:2.523 lr:0.0000100 epoch_Time:23372.0min: [2024-01-06 19:04:14,196][model8_pretrain.py][INFO] Epoch:[0/2](917900/4588595) loss:2.976 lr:0.0000100 epoch_Time:23372.0min: [2024-01-06 19:04:14,196][model8_pretrain.py][INFO] Epoch:[0/2](917900/4588595) loss:2.821 lr:0.0000100 epoch_Time:23372.0min: [2024-01-06 19:04:14,197][model8_pretrain.py][INFO] Epoch:[0/2](917900/4588595) loss:2.847 lr:0.0000100 epoch_Time:23372.0min: [2024-01-06 19:04:14,198][model8_pretrain.py][INFO] Epoch:[0/2](917900/4588595) loss:2.646 lr:0.0000100 epoch_Time:23372.0min: [2024-01-06 19:04:51,136][model8_pretrain.py][INFO] Epoch:[0/2](918000/4588595) loss:2.701 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:04:51,136][model8_pretrain.py][INFO] Epoch:[0/2](918000/4588595) loss:3.118 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:04:51,136][model8_pretrain.py][INFO] Epoch:[0/2](918000/4588595) loss:2.892 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:04:51,136][model8_pretrain.py][INFO] Epoch:[0/2](918000/4588595) loss:3.123 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:04:51,136][model8_pretrain.py][INFO] Epoch:[0/2](918000/4588595) loss:3.392 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:04:51,136][model8_pretrain.py][INFO] Epoch:[0/2](918000/4588595) loss:3.196 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:04:51,136][model8_pretrain.py][INFO] Epoch:[0/2](918000/4588595) loss:2.874 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:04:51,136][model8_pretrain.py][INFO] Epoch:[0/2](918000/4588595) loss:2.605 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:05:28,072][model8_pretrain.py][INFO] Epoch:[0/2](918100/4588595) loss:3.039 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:05:28,072][model8_pretrain.py][INFO] Epoch:[0/2](918100/4588595) loss:3.300 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:05:28,072][model8_pretrain.py][INFO] Epoch:[0/2](918100/4588595) loss:2.822 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:05:28,072][model8_pretrain.py][INFO] Epoch:[0/2](918100/4588595) loss:2.844 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:05:28,072][model8_pretrain.py][INFO] Epoch:[0/2](918100/4588595) loss:2.492 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:05:28,072][model8_pretrain.py][INFO] Epoch:[0/2](918100/4588595) loss:2.471 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:05:28,072][model8_pretrain.py][INFO] Epoch:[0/2](918100/4588595) loss:3.018 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:05:28,072][model8_pretrain.py][INFO] Epoch:[0/2](918100/4588595) loss:2.794 lr:0.0000100 epoch_Time:23371.0min: [2024-01-06 19:06:05,013][model8_pretrain.py][INFO] Epoch:[0/2](918200/4588595) loss:2.293 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:05,013][model8_pretrain.py][INFO] Epoch:[0/2](918200/4588595) loss:2.888 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:05,013][model8_pretrain.py][INFO] Epoch:[0/2](918200/4588595) loss:2.582 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:05,013][model8_pretrain.py][INFO] Epoch:[0/2](918200/4588595) loss:3.058 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:05,013][model8_pretrain.py][INFO] Epoch:[0/2](918200/4588595) loss:2.716 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:05,013][model8_pretrain.py][INFO] Epoch:[0/2](918200/4588595) loss:3.024 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:05,013][model8_pretrain.py][INFO] Epoch:[0/2](918200/4588595) loss:2.933 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:05,013][model8_pretrain.py][INFO] Epoch:[0/2](918200/4588595) loss:2.567 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:41,976][model8_pretrain.py][INFO] Epoch:[0/2](918300/4588595) loss:2.334 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:41,976][model8_pretrain.py][INFO] Epoch:[0/2](918300/4588595) loss:2.946 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:41,976][model8_pretrain.py][INFO] Epoch:[0/2](918300/4588595) loss:2.850 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:41,976][model8_pretrain.py][INFO] Epoch:[0/2](918300/4588595) loss:3.196 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:41,976][model8_pretrain.py][INFO] Epoch:[0/2](918300/4588595) loss:2.743 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:41,976][model8_pretrain.py][INFO] Epoch:[0/2](918300/4588595) loss:2.486 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:41,976][model8_pretrain.py][INFO] Epoch:[0/2](918300/4588595) loss:2.478 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:06:41,976][model8_pretrain.py][INFO] Epoch:[0/2](918300/4588595) loss:3.244 lr:0.0000100 epoch_Time:23370.0min: [2024-01-06 19:07:18,932][model8_pretrain.py][INFO] Epoch:[0/2](918400/4588595) loss:3.374 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:07:18,932][model8_pretrain.py][INFO] Epoch:[0/2](918400/4588595) loss:2.498 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:07:18,932][model8_pretrain.py][INFO] Epoch:[0/2](918400/4588595) loss:2.936 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:07:18,932][model8_pretrain.py][INFO] Epoch:[0/2](918400/4588595) loss:2.085 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:07:18,932][model8_pretrain.py][INFO] Epoch:[0/2](918400/4588595) loss:3.138 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:07:18,932][model8_pretrain.py][INFO] Epoch:[0/2](918400/4588595) loss:2.831 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:07:18,932][model8_pretrain.py][INFO] Epoch:[0/2](918400/4588595) loss:2.869 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:07:18,932][model8_pretrain.py][INFO] Epoch:[0/2](918400/4588595) loss:2.972 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:00,580][model8_pretrain.py][INFO] Epoch:[0/2](918500/4588595) loss:2.907 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:00,580][model8_pretrain.py][INFO] Epoch:[0/2](918500/4588595) loss:2.567 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:00,580][model8_pretrain.py][INFO] Epoch:[0/2](918500/4588595) loss:2.251 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:00,585][model8_pretrain.py][INFO] Epoch:[0/2](918500/4588595) loss:2.149 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:00,585][model8_pretrain.py][INFO] Epoch:[0/2](918500/4588595) loss:2.654 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:00,585][model8_pretrain.py][INFO] Epoch:[0/2](918500/4588595) loss:3.074 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:00,585][model8_pretrain.py][INFO] Epoch:[0/2](918500/4588595) loss:2.664 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:02,327][model8_pretrain.py][INFO] Epoch:[0/2](918500/4588595) loss:2.712 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:45,879][model8_pretrain.py][INFO] Epoch:[0/2](918600/4588595) loss:3.092 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:45,880][model8_pretrain.py][INFO] Epoch:[0/2](918600/4588595) loss:2.695 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:45,880][model8_pretrain.py][INFO] Epoch:[0/2](918600/4588595) loss:2.966 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:45,880][model8_pretrain.py][INFO] Epoch:[0/2](918600/4588595) loss:3.061 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:45,880][model8_pretrain.py][INFO] Epoch:[0/2](918600/4588595) loss:2.966 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:45,880][model8_pretrain.py][INFO] Epoch:[0/2](918600/4588595) loss:2.981 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:45,880][model8_pretrain.py][INFO] Epoch:[0/2](918600/4588595) loss:2.749 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:08:45,880][model8_pretrain.py][INFO] Epoch:[0/2](918600/4588595) loss:2.944 lr:0.0000100 epoch_Time:23368.0min: [2024-01-06 19:09:22,804][model8_pretrain.py][INFO] Epoch:[0/2](918700/4588595) loss:2.228 lr:0.0000100 epoch_Time:23367.0min: [2024-01-06 19:09:22,804][model8_pretrain.py][INFO] Epoch:[0/2](918700/4588595) loss:2.951 lr:0.0000100 epoch_Time:23367.0min: [2024-01-06 19:09:22,805][model8_pretrain.py][INFO] Epoch:[0/2](918700/4588595) loss:3.137 lr:0.0000100 epoch_Time:23367.0min: [2024-01-06 19:09:22,805][model8_pretrain.py][INFO] Epoch:[0/2](918700/4588595) loss:2.680 lr:0.0000100 epoch_Time:23367.0min: [2024-01-06 19:09:22,805][model8_pretrain.py][INFO] Epoch:[0/2](918700/4588595) loss:3.124 lr:0.0000100 epoch_Time:23367.0min: [2024-01-06 19:09:22,804][model8_pretrain.py][INFO] Epoch:[0/2](918700/4588595) loss:3.194 lr:0.0000100 epoch_Time:23367.0min: [2024-01-06 19:09:22,805][model8_pretrain.py][INFO] Epoch:[0/2](918700/4588595) loss:2.666 lr:0.0000100 epoch_Time:23367.0min: [2024-01-06 19:09:22,805][model8_pretrain.py][INFO] Epoch:[0/2](918700/4588595) loss:3.060 lr:0.0000100 epoch_Time:23367.0min: [2024-01-06 19:09:59,738][model8_pretrain.py][INFO] Epoch:[0/2](918800/4588595) loss:2.499 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:09:59,738][model8_pretrain.py][INFO] Epoch:[0/2](918800/4588595) loss:2.798 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:09:59,738][model8_pretrain.py][INFO] Epoch:[0/2](918800/4588595) loss:3.094 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:09:59,738][model8_pretrain.py][INFO] Epoch:[0/2](918800/4588595) loss:3.039 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:09:59,738][model8_pretrain.py][INFO] Epoch:[0/2](918800/4588595) loss:2.649 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:09:59,738][model8_pretrain.py][INFO] Epoch:[0/2](918800/4588595) loss:2.878 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:09:59,738][model8_pretrain.py][INFO] Epoch:[0/2](918800/4588595) loss:2.975 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:09:59,738][model8_pretrain.py][INFO] Epoch:[0/2](918800/4588595) loss:2.923 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:10:36,670][model8_pretrain.py][INFO] Epoch:[0/2](918900/4588595) loss:2.856 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:10:36,670][model8_pretrain.py][INFO] Epoch:[0/2](918900/4588595) loss:2.947 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:10:36,670][model8_pretrain.py][INFO] Epoch:[0/2](918900/4588595) loss:2.811 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:10:36,670][model8_pretrain.py][INFO] Epoch:[0/2](918900/4588595) loss:3.176 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:10:36,670][model8_pretrain.py][INFO] Epoch:[0/2](918900/4588595) loss:2.084 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:10:36,670][model8_pretrain.py][INFO] Epoch:[0/2](918900/4588595) loss:3.339 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:10:36,670][model8_pretrain.py][INFO] Epoch:[0/2](918900/4588595) loss:3.093 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:10:36,670][model8_pretrain.py][INFO] Epoch:[0/2](918900/4588595) loss:3.145 lr:0.0000100 epoch_Time:23366.0min: [2024-01-06 19:11:13,609][model8_pretrain.py][INFO] Epoch:[0/2](919000/4588595) loss:2.507 lr:0.0000100 epoch_Time:23365.0min: [2024-01-06 19:11:13,609][model8_pretrain.py][INFO] Epoch:[0/2](919000/4588595) loss:2.660 lr:0.0000100 epoch_Time:23365.0min: [2024-01-06 19:11:13,609][model8_pretrain.py][INFO] Epoch:[0/2](919000/4588595) loss:2.805 lr:0.0000100 epoch_Time:23365.0min: [2024-01-06 19:11:13,609][model8_pretrain.py][INFO] Epoch:[0/2](919000/4588595) loss:2.888 lr:0.0000100 epoch_Time:23365.0min: [2024-01-06 19:11:13,609][model8_pretrain.py][INFO] Epoch:[0/2](919000/4588595) loss:2.511 lr:0.0000100 epoch_Time:23365.0min: [2024-01-06 19:11:13,609][model8_pretrain.py][INFO] Epoch:[0/2](919000/4588595) loss:2.609 lr:0.0000100 epoch_Time:23365.0min: [2024-01-06 19:11:13,609][model8_pretrain.py][INFO] Epoch:[0/2](919000/4588595) loss:2.686 lr:0.0000100 epoch_Time:23365.0min: [2024-01-06 19:11:13,609][model8_pretrain.py][INFO] Epoch:[0/2](919000/4588595) loss:2.642 lr:0.0000100 epoch_Time:23365.0min: [2024-01-06 19:11:50,546][model8_pretrain.py][INFO] Epoch:[0/2](919100/4588595) loss:2.779 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:11:50,546][model8_pretrain.py][INFO] Epoch:[0/2](919100/4588595) loss:2.501 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:11:50,546][model8_pretrain.py][INFO] Epoch:[0/2](919100/4588595) loss:2.505 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:11:50,546][model8_pretrain.py][INFO] Epoch:[0/2](919100/4588595) loss:2.612 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:11:50,546][model8_pretrain.py][INFO] Epoch:[0/2](919100/4588595) loss:2.983 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:11:50,546][model8_pretrain.py][INFO] Epoch:[0/2](919100/4588595) loss:3.267 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:11:50,546][model8_pretrain.py][INFO] Epoch:[0/2](919100/4588595) loss:3.293 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:11:50,546][model8_pretrain.py][INFO] Epoch:[0/2](919100/4588595) loss:2.957 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:12:27,496][model8_pretrain.py][INFO] Epoch:[0/2](919200/4588595) loss:2.902 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:12:27,496][model8_pretrain.py][INFO] Epoch:[0/2](919200/4588595) loss:2.229 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:12:27,496][model8_pretrain.py][INFO] Epoch:[0/2](919200/4588595) loss:2.906 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:12:27,496][model8_pretrain.py][INFO] Epoch:[0/2](919200/4588595) loss:3.011 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:12:27,496][model8_pretrain.py][INFO] Epoch:[0/2](919200/4588595) loss:2.438 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:12:27,496][model8_pretrain.py][INFO] Epoch:[0/2](919200/4588595) loss:2.068 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:12:27,496][model8_pretrain.py][INFO] Epoch:[0/2](919200/4588595) loss:2.497 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:12:27,496][model8_pretrain.py][INFO] Epoch:[0/2](919200/4588595) loss:2.939 lr:0.0000100 epoch_Time:23364.0min: [2024-01-06 19:13:07,666][model8_pretrain.py][INFO] Epoch:[0/2](919300/4588595) loss:2.407 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:07,666][model8_pretrain.py][INFO] Epoch:[0/2](919300/4588595) loss:2.501 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:07,666][model8_pretrain.py][INFO] Epoch:[0/2](919300/4588595) loss:2.661 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:07,666][model8_pretrain.py][INFO] Epoch:[0/2](919300/4588595) loss:2.754 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:07,666][model8_pretrain.py][INFO] Epoch:[0/2](919300/4588595) loss:3.280 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:07,666][model8_pretrain.py][INFO] Epoch:[0/2](919300/4588595) loss:3.073 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:07,666][model8_pretrain.py][INFO] Epoch:[0/2](919300/4588595) loss:2.931 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:07,666][model8_pretrain.py][INFO] Epoch:[0/2](919300/4588595) loss:2.725 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:54,452][model8_pretrain.py][INFO] Epoch:[0/2](919400/4588595) loss:2.920 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:54,452][model8_pretrain.py][INFO] Epoch:[0/2](919400/4588595) loss:2.995 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:54,452][model8_pretrain.py][INFO] Epoch:[0/2](919400/4588595) loss:2.696 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:54,452][model8_pretrain.py][INFO] Epoch:[0/2](919400/4588595) loss:2.739 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:54,452][model8_pretrain.py][INFO] Epoch:[0/2](919400/4588595) loss:2.487 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:54,452][model8_pretrain.py][INFO] Epoch:[0/2](919400/4588595) loss:2.642 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:54,452][model8_pretrain.py][INFO] Epoch:[0/2](919400/4588595) loss:2.740 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:13:54,452][model8_pretrain.py][INFO] Epoch:[0/2](919400/4588595) loss:2.727 lr:0.0000100 epoch_Time:23363.0min: [2024-01-06 19:14:31,408][model8_pretrain.py][INFO] Epoch:[0/2](919500/4588595) loss:2.047 lr:0.0000100 epoch_Time:23362.0min: [2024-01-06 19:14:31,408][model8_pretrain.py][INFO] Epoch:[0/2](919500/4588595) loss:3.058 lr:0.0000100 epoch_Time:23362.0min: [2024-01-06 19:14:31,408][model8_pretrain.py][INFO] Epoch:[0/2](919500/4588595) loss:2.244 lr:0.0000100 epoch_Time:23362.0min: [2024-01-06 19:14:31,408][model8_pretrain.py][INFO] Epoch:[0/2](919500/4588595) loss:2.656 lr:0.0000100 epoch_Time:23362.0min: [2024-01-06 19:14:31,408][model8_pretrain.py][INFO] Epoch:[0/2](919500/4588595) loss:3.241 lr:0.0000100 epoch_Time:23362.0min: [2024-01-06 19:14:31,408][model8_pretrain.py][INFO] Epoch:[0/2](919500/4588595) loss:3.064 lr:0.0000100 epoch_Time:23362.0min: [2024-01-06 19:14:31,408][model8_pretrain.py][INFO] Epoch:[0/2](919500/4588595) loss:2.944 lr:0.0000100 epoch_Time:23362.0min: [2024-01-06 19:14:31,408][model8_pretrain.py][INFO] Epoch:[0/2](919500/4588595) loss:2.405 lr:0.0000100 epoch_Time:23362.0min: [2024-01-06 19:15:08,320][model8_pretrain.py][INFO] Epoch:[0/2](919600/4588595) loss:3.208 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:08,320][model8_pretrain.py][INFO] Epoch:[0/2](919600/4588595) loss:3.283 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:08,320][model8_pretrain.py][INFO] Epoch:[0/2](919600/4588595) loss:2.369 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:08,320][model8_pretrain.py][INFO] Epoch:[0/2](919600/4588595) loss:3.326 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:08,320][model8_pretrain.py][INFO] Epoch:[0/2](919600/4588595) loss:3.125 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:08,320][model8_pretrain.py][INFO] Epoch:[0/2](919600/4588595) loss:3.182 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:08,320][model8_pretrain.py][INFO] Epoch:[0/2](919600/4588595) loss:3.335 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:08,320][model8_pretrain.py][INFO] Epoch:[0/2](919600/4588595) loss:2.600 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:45,227][model8_pretrain.py][INFO] Epoch:[0/2](919700/4588595) loss:2.941 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:45,227][model8_pretrain.py][INFO] Epoch:[0/2](919700/4588595) loss:2.719 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:45,228][model8_pretrain.py][INFO] Epoch:[0/2](919700/4588595) loss:2.979 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:45,228][model8_pretrain.py][INFO] Epoch:[0/2](919700/4588595) loss:2.422 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:45,228][model8_pretrain.py][INFO] Epoch:[0/2](919700/4588595) loss:2.878 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:45,228][model8_pretrain.py][INFO] Epoch:[0/2](919700/4588595) loss:2.789 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:45,228][model8_pretrain.py][INFO] Epoch:[0/2](919700/4588595) loss:3.144 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:15:45,228][model8_pretrain.py][INFO] Epoch:[0/2](919700/4588595) loss:2.943 lr:0.0000100 epoch_Time:23361.0min: [2024-01-06 19:16:22,172][model8_pretrain.py][INFO] Epoch:[0/2](919800/4588595) loss:3.217 lr:0.0000100 epoch_Time:23360.0min: [2024-01-06 19:16:22,172][model8_pretrain.py][INFO] Epoch:[0/2](919800/4588595) loss:2.549 lr:0.0000100 epoch_Time:23360.0min: [2024-01-06 19:16:22,172][model8_pretrain.py][INFO] Epoch:[0/2](919800/4588595) loss:2.745 lr:0.0000100 epoch_Time:23360.0min: [2024-01-06 19:16:22,172][model8_pretrain.py][INFO] Epoch:[0/2](919800/4588595) loss:2.674 lr:0.0000100 epoch_Time:23360.0min: [2024-01-06 19:16:22,172][model8_pretrain.py][INFO] Epoch:[0/2](919800/4588595) loss:2.584 lr:0.0000100 epoch_Time:23360.0min: [2024-01-06 19:16:22,172][model8_pretrain.py][INFO] Epoch:[0/2](919800/4588595) loss:2.768 lr:0.0000100 epoch_Time:23360.0min: [2024-01-06 19:16:22,172][model8_pretrain.py][INFO] Epoch:[0/2](919800/4588595) loss:3.113 lr:0.0000100 epoch_Time:23360.0min: [2024-01-06 19:16:22,172][model8_pretrain.py][INFO] Epoch:[0/2](919800/4588595) loss:2.495 lr:0.0000100 epoch_Time:23360.0min: [2024-01-06 19:16:59,117][model8_pretrain.py][INFO] Epoch:[0/2](919900/4588595) loss:2.370 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:16:59,117][model8_pretrain.py][INFO] Epoch:[0/2](919900/4588595) loss:2.758 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:16:59,117][model8_pretrain.py][INFO] Epoch:[0/2](919900/4588595) loss:3.219 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:16:59,117][model8_pretrain.py][INFO] Epoch:[0/2](919900/4588595) loss:2.200 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:16:59,117][model8_pretrain.py][INFO] Epoch:[0/2](919900/4588595) loss:2.565 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:16:59,117][model8_pretrain.py][INFO] Epoch:[0/2](919900/4588595) loss:3.093 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:16:59,117][model8_pretrain.py][INFO] Epoch:[0/2](919900/4588595) loss:2.466 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:16:59,118][model8_pretrain.py][INFO] Epoch:[0/2](919900/4588595) loss:2.498 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:17:36,055][model8_pretrain.py][INFO] Epoch:[0/2](920000/4588595) loss:2.818 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:17:36,055][model8_pretrain.py][INFO] Epoch:[0/2](920000/4588595) loss:2.485 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:17:36,055][model8_pretrain.py][INFO] Epoch:[0/2](920000/4588595) loss:2.742 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:17:36,055][model8_pretrain.py][INFO] Epoch:[0/2](920000/4588595) loss:2.913 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:17:36,056][model8_pretrain.py][INFO] Epoch:[0/2](920000/4588595) loss:3.154 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:17:36,056][model8_pretrain.py][INFO] Epoch:[0/2](920000/4588595) loss:3.101 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:17:36,056][model8_pretrain.py][INFO] Epoch:[0/2](920000/4588595) loss:3.196 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:17:36,056][model8_pretrain.py][INFO] Epoch:[0/2](920000/4588595) loss:2.583 lr:0.0000100 epoch_Time:23359.0min: [2024-01-06 19:18:14,499][model8_pretrain.py][INFO] Epoch:[0/2](920100/4588595) loss:2.674 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:18:14,499][model8_pretrain.py][INFO] Epoch:[0/2](920100/4588595) loss:3.196 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:18:14,499][model8_pretrain.py][INFO] Epoch:[0/2](920100/4588595) loss:2.706 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:18:14,499][model8_pretrain.py][INFO] Epoch:[0/2](920100/4588595) loss:3.110 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:18:14,499][model8_pretrain.py][INFO] Epoch:[0/2](920100/4588595) loss:2.980 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:18:14,499][model8_pretrain.py][INFO] Epoch:[0/2](920100/4588595) loss:3.015 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:18:14,500][model8_pretrain.py][INFO] Epoch:[0/2](920100/4588595) loss:3.121 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:18:14,500][model8_pretrain.py][INFO] Epoch:[0/2](920100/4588595) loss:2.882 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:03,072][model8_pretrain.py][INFO] Epoch:[0/2](920200/4588595) loss:2.960 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:03,072][model8_pretrain.py][INFO] Epoch:[0/2](920200/4588595) loss:2.970 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:03,072][model8_pretrain.py][INFO] Epoch:[0/2](920200/4588595) loss:2.896 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:03,072][model8_pretrain.py][INFO] Epoch:[0/2](920200/4588595) loss:3.455 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:03,072][model8_pretrain.py][INFO] Epoch:[0/2](920200/4588595) loss:3.065 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:03,072][model8_pretrain.py][INFO] Epoch:[0/2](920200/4588595) loss:2.713 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:03,072][model8_pretrain.py][INFO] Epoch:[0/2](920200/4588595) loss:2.835 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:03,072][model8_pretrain.py][INFO] Epoch:[0/2](920200/4588595) loss:3.360 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:40,013][model8_pretrain.py][INFO] Epoch:[0/2](920300/4588595) loss:2.902 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:40,013][model8_pretrain.py][INFO] Epoch:[0/2](920300/4588595) loss:2.618 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:40,013][model8_pretrain.py][INFO] Epoch:[0/2](920300/4588595) loss:2.421 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:40,013][model8_pretrain.py][INFO] Epoch:[0/2](920300/4588595) loss:2.996 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:40,013][model8_pretrain.py][INFO] Epoch:[0/2](920300/4588595) loss:2.464 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:40,013][model8_pretrain.py][INFO] Epoch:[0/2](920300/4588595) loss:2.340 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:40,013][model8_pretrain.py][INFO] Epoch:[0/2](920300/4588595) loss:3.164 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:19:40,013][model8_pretrain.py][INFO] Epoch:[0/2](920300/4588595) loss:2.418 lr:0.0000100 epoch_Time:23358.0min: [2024-01-06 19:20:16,978][model8_pretrain.py][INFO] Epoch:[0/2](920400/4588595) loss:2.628 lr:0.0000100 epoch_Time:23357.0min: [2024-01-06 19:20:16,978][model8_pretrain.py][INFO] Epoch:[0/2](920400/4588595) loss:2.896 lr:0.0000100 epoch_Time:23357.0min: [2024-01-06 19:20:16,978][model8_pretrain.py][INFO] Epoch:[0/2](920400/4588595) loss:2.439 lr:0.0000100 epoch_Time:23357.0min: [2024-01-06 19:20:16,978][model8_pretrain.py][INFO] Epoch:[0/2](920400/4588595) loss:2.564 lr:0.0000100 epoch_Time:23357.0min: [2024-01-06 19:20:16,978][model8_pretrain.py][INFO] Epoch:[0/2](920400/4588595) loss:3.034 lr:0.0000100 epoch_Time:23357.0min: [2024-01-06 19:20:16,978][model8_pretrain.py][INFO] Epoch:[0/2](920400/4588595) loss:2.561 lr:0.0000100 epoch_Time:23357.0min: [2024-01-06 19:20:16,978][model8_pretrain.py][INFO] Epoch:[0/2](920400/4588595) loss:3.155 lr:0.0000100 epoch_Time:23357.0min: [2024-01-06 19:20:16,978][model8_pretrain.py][INFO] Epoch:[0/2](920400/4588595) loss:2.378 lr:0.0000100 epoch_Time:23357.0min: [2024-01-06 19:20:53,912][model8_pretrain.py][INFO] Epoch:[0/2](920500/4588595) loss:2.427 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:20:53,912][model8_pretrain.py][INFO] Epoch:[0/2](920500/4588595) loss:2.658 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:20:53,912][model8_pretrain.py][INFO] Epoch:[0/2](920500/4588595) loss:2.469 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:20:53,912][model8_pretrain.py][INFO] Epoch:[0/2](920500/4588595) loss:2.725 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:20:53,912][model8_pretrain.py][INFO] Epoch:[0/2](920500/4588595) loss:2.961 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:20:53,912][model8_pretrain.py][INFO] Epoch:[0/2](920500/4588595) loss:3.020 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:20:53,912][model8_pretrain.py][INFO] Epoch:[0/2](920500/4588595) loss:2.912 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:20:53,912][model8_pretrain.py][INFO] Epoch:[0/2](920500/4588595) loss:3.064 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:21:30,849][model8_pretrain.py][INFO] Epoch:[0/2](920600/4588595) loss:2.663 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:21:30,849][model8_pretrain.py][INFO] Epoch:[0/2](920600/4588595) loss:3.057 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:21:30,849][model8_pretrain.py][INFO] Epoch:[0/2](920600/4588595) loss:2.825 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:21:30,849][model8_pretrain.py][INFO] Epoch:[0/2](920600/4588595) loss:3.097 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:21:30,849][model8_pretrain.py][INFO] Epoch:[0/2](920600/4588595) loss:2.313 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:21:30,849][model8_pretrain.py][INFO] Epoch:[0/2](920600/4588595) loss:2.565 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:21:30,850][model8_pretrain.py][INFO] Epoch:[0/2](920600/4588595) loss:2.932 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:21:30,850][model8_pretrain.py][INFO] Epoch:[0/2](920600/4588595) loss:3.215 lr:0.0000100 epoch_Time:23355.0min: [2024-01-06 19:22:07,788][model8_pretrain.py][INFO] Epoch:[0/2](920700/4588595) loss:2.936 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:07,788][model8_pretrain.py][INFO] Epoch:[0/2](920700/4588595) loss:3.092 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:07,788][model8_pretrain.py][INFO] Epoch:[0/2](920700/4588595) loss:2.663 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:07,788][model8_pretrain.py][INFO] Epoch:[0/2](920700/4588595) loss:3.127 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:07,788][model8_pretrain.py][INFO] Epoch:[0/2](920700/4588595) loss:3.280 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:07,788][model8_pretrain.py][INFO] Epoch:[0/2](920700/4588595) loss:3.123 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:07,788][model8_pretrain.py][INFO] Epoch:[0/2](920700/4588595) loss:2.667 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:07,789][model8_pretrain.py][INFO] Epoch:[0/2](920700/4588595) loss:2.648 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:44,729][model8_pretrain.py][INFO] Epoch:[0/2](920800/4588595) loss:2.695 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:44,729][model8_pretrain.py][INFO] Epoch:[0/2](920800/4588595) loss:2.905 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:44,729][model8_pretrain.py][INFO] Epoch:[0/2](920800/4588595) loss:2.647 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:44,729][model8_pretrain.py][INFO] Epoch:[0/2](920800/4588595) loss:3.235 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:44,729][model8_pretrain.py][INFO] Epoch:[0/2](920800/4588595) loss:2.523 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:44,729][model8_pretrain.py][INFO] Epoch:[0/2](920800/4588595) loss:2.796 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:44,729][model8_pretrain.py][INFO] Epoch:[0/2](920800/4588595) loss:2.633 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:22:44,729][model8_pretrain.py][INFO] Epoch:[0/2](920800/4588595) loss:2.817 lr:0.0000100 epoch_Time:23354.0min: [2024-01-06 19:23:23,211][model8_pretrain.py][INFO] Epoch:[0/2](920900/4588595) loss:3.009 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:23:23,211][model8_pretrain.py][INFO] Epoch:[0/2](920900/4588595) loss:2.865 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:23:23,211][model8_pretrain.py][INFO] Epoch:[0/2](920900/4588595) loss:2.743 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:23:23,211][model8_pretrain.py][INFO] Epoch:[0/2](920900/4588595) loss:2.755 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:23:23,211][model8_pretrain.py][INFO] Epoch:[0/2](920900/4588595) loss:2.457 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:23:23,211][model8_pretrain.py][INFO] Epoch:[0/2](920900/4588595) loss:2.512 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:23:23,212][model8_pretrain.py][INFO] Epoch:[0/2](920900/4588595) loss:2.960 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:23:23,212][model8_pretrain.py][INFO] Epoch:[0/2](920900/4588595) loss:2.751 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:24:12,357][model8_pretrain.py][INFO] Epoch:[0/2](921000/4588595) loss:2.845 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:24:12,357][model8_pretrain.py][INFO] Epoch:[0/2](921000/4588595) loss:3.059 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:24:12,357][model8_pretrain.py][INFO] Epoch:[0/2](921000/4588595) loss:3.137 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:24:12,357][model8_pretrain.py][INFO] Epoch:[0/2](921000/4588595) loss:2.951 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:24:12,357][model8_pretrain.py][INFO] Epoch:[0/2](921000/4588595) loss:3.119 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:24:12,357][model8_pretrain.py][INFO] Epoch:[0/2](921000/4588595) loss:3.291 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:24:12,357][model8_pretrain.py][INFO] Epoch:[0/2](921000/4588595) loss:2.274 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:24:12,357][model8_pretrain.py][INFO] Epoch:[0/2](921000/4588595) loss:2.714 lr:0.0000100 epoch_Time:23353.0min: [2024-01-06 19:24:49,300][model8_pretrain.py][INFO] Epoch:[0/2](921100/4588595) loss:2.807 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:24:49,300][model8_pretrain.py][INFO] Epoch:[0/2](921100/4588595) loss:3.122 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:24:49,300][model8_pretrain.py][INFO] Epoch:[0/2](921100/4588595) loss:2.897 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:24:49,300][model8_pretrain.py][INFO] Epoch:[0/2](921100/4588595) loss:2.378 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:24:49,300][model8_pretrain.py][INFO] Epoch:[0/2](921100/4588595) loss:2.523 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:24:49,300][model8_pretrain.py][INFO] Epoch:[0/2](921100/4588595) loss:2.675 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:24:49,300][model8_pretrain.py][INFO] Epoch:[0/2](921100/4588595) loss:2.913 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:24:49,300][model8_pretrain.py][INFO] Epoch:[0/2](921100/4588595) loss:2.632 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:25:26,248][model8_pretrain.py][INFO] Epoch:[0/2](921200/4588595) loss:2.497 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:25:26,248][model8_pretrain.py][INFO] Epoch:[0/2](921200/4588595) loss:2.936 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:25:26,248][model8_pretrain.py][INFO] Epoch:[0/2](921200/4588595) loss:2.092 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:25:26,248][model8_pretrain.py][INFO] Epoch:[0/2](921200/4588595) loss:2.812 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:25:26,248][model8_pretrain.py][INFO] Epoch:[0/2](921200/4588595) loss:2.883 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:25:26,248][model8_pretrain.py][INFO] Epoch:[0/2](921200/4588595) loss:2.824 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:25:26,248][model8_pretrain.py][INFO] Epoch:[0/2](921200/4588595) loss:2.718 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:25:26,248][model8_pretrain.py][INFO] Epoch:[0/2](921200/4588595) loss:3.058 lr:0.0000100 epoch_Time:23352.0min: [2024-01-06 19:26:03,207][model8_pretrain.py][INFO] Epoch:[0/2](921300/4588595) loss:3.058 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:03,207][model8_pretrain.py][INFO] Epoch:[0/2](921300/4588595) loss:2.499 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:03,207][model8_pretrain.py][INFO] Epoch:[0/2](921300/4588595) loss:2.819 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:03,207][model8_pretrain.py][INFO] Epoch:[0/2](921300/4588595) loss:2.293 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:03,207][model8_pretrain.py][INFO] Epoch:[0/2](921300/4588595) loss:2.502 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:03,207][model8_pretrain.py][INFO] Epoch:[0/2](921300/4588595) loss:2.555 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:03,208][model8_pretrain.py][INFO] Epoch:[0/2](921300/4588595) loss:2.591 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:03,208][model8_pretrain.py][INFO] Epoch:[0/2](921300/4588595) loss:2.648 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:40,162][model8_pretrain.py][INFO] Epoch:[0/2](921400/4588595) loss:3.050 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:40,162][model8_pretrain.py][INFO] Epoch:[0/2](921400/4588595) loss:1.905 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:40,162][model8_pretrain.py][INFO] Epoch:[0/2](921400/4588595) loss:2.962 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:40,162][model8_pretrain.py][INFO] Epoch:[0/2](921400/4588595) loss:1.925 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:40,162][model8_pretrain.py][INFO] Epoch:[0/2](921400/4588595) loss:3.083 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:40,162][model8_pretrain.py][INFO] Epoch:[0/2](921400/4588595) loss:2.325 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:40,162][model8_pretrain.py][INFO] Epoch:[0/2](921400/4588595) loss:3.591 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:26:40,162][model8_pretrain.py][INFO] Epoch:[0/2](921400/4588595) loss:2.581 lr:0.0000100 epoch_Time:23351.0min: [2024-01-06 19:27:17,099][model8_pretrain.py][INFO] Epoch:[0/2](921500/4588595) loss:3.008 lr:0.0000100 epoch_Time:23350.0min: [2024-01-06 19:27:17,099][model8_pretrain.py][INFO] Epoch:[0/2](921500/4588595) loss:3.291 lr:0.0000100 epoch_Time:23350.0min: [2024-01-06 19:27:17,099][model8_pretrain.py][INFO] Epoch:[0/2](921500/4588595) loss:2.611 lr:0.0000100 epoch_Time:23350.0min: [2024-01-06 19:27:17,099][model8_pretrain.py][INFO] Epoch:[0/2](921500/4588595) loss:3.032 lr:0.0000100 epoch_Time:23350.0min: [2024-01-06 19:27:17,099][model8_pretrain.py][INFO] Epoch:[0/2](921500/4588595) loss:2.674 lr:0.0000100 epoch_Time:23350.0min: [2024-01-06 19:27:17,099][model8_pretrain.py][INFO] Epoch:[0/2](921500/4588595) loss:2.958 lr:0.0000100 epoch_Time:23350.0min: [2024-01-06 19:27:17,099][model8_pretrain.py][INFO] Epoch:[0/2](921500/4588595) loss:3.135 lr:0.0000100 epoch_Time:23350.0min: [2024-01-06 19:27:17,099][model8_pretrain.py][INFO] Epoch:[0/2](921500/4588595) loss:3.130 lr:0.0000100 epoch_Time:23350.0min: [2024-01-06 19:27:54,046][model8_pretrain.py][INFO] Epoch:[0/2](921600/4588595) loss:2.809 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:27:54,046][model8_pretrain.py][INFO] Epoch:[0/2](921600/4588595) loss:3.047 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:27:54,046][model8_pretrain.py][INFO] Epoch:[0/2](921600/4588595) loss:2.915 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:27:54,046][model8_pretrain.py][INFO] Epoch:[0/2](921600/4588595) loss:2.980 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:27:54,046][model8_pretrain.py][INFO] Epoch:[0/2](921600/4588595) loss:2.513 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:27:54,046][model8_pretrain.py][INFO] Epoch:[0/2](921600/4588595) loss:2.900 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:27:54,046][model8_pretrain.py][INFO] Epoch:[0/2](921600/4588595) loss:2.570 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:27:54,046][model8_pretrain.py][INFO] Epoch:[0/2](921600/4588595) loss:2.870 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:28:32,717][model8_pretrain.py][INFO] Epoch:[0/2](921700/4588595) loss:2.721 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:28:32,717][model8_pretrain.py][INFO] Epoch:[0/2](921700/4588595) loss:2.795 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:28:32,717][model8_pretrain.py][INFO] Epoch:[0/2](921700/4588595) loss:2.432 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:28:32,717][model8_pretrain.py][INFO] Epoch:[0/2](921700/4588595) loss:3.144 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:28:32,718][model8_pretrain.py][INFO] Epoch:[0/2](921700/4588595) loss:2.421 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:28:32,718][model8_pretrain.py][INFO] Epoch:[0/2](921700/4588595) loss:2.517 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:28:32,718][model8_pretrain.py][INFO] Epoch:[0/2](921700/4588595) loss:2.702 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:28:32,718][model8_pretrain.py][INFO] Epoch:[0/2](921700/4588595) loss:2.505 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:29:21,644][model8_pretrain.py][INFO] Epoch:[0/2](921800/4588595) loss:2.986 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:29:21,644][model8_pretrain.py][INFO] Epoch:[0/2](921800/4588595) loss:2.932 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:29:21,644][model8_pretrain.py][INFO] Epoch:[0/2](921800/4588595) loss:3.002 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:29:21,644][model8_pretrain.py][INFO] Epoch:[0/2](921800/4588595) loss:3.090 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:29:21,645][model8_pretrain.py][INFO] Epoch:[0/2](921800/4588595) loss:2.765 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:29:21,645][model8_pretrain.py][INFO] Epoch:[0/2](921800/4588595) loss:2.577 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:29:21,645][model8_pretrain.py][INFO] Epoch:[0/2](921800/4588595) loss:3.273 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:29:21,645][model8_pretrain.py][INFO] Epoch:[0/2](921800/4588595) loss:2.890 lr:0.0000100 epoch_Time:23348.0min: [2024-01-06 19:29:58,578][model8_pretrain.py][INFO] Epoch:[0/2](921900/4588595) loss:2.722 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:29:58,578][model8_pretrain.py][INFO] Epoch:[0/2](921900/4588595) loss:2.997 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:29:58,578][model8_pretrain.py][INFO] Epoch:[0/2](921900/4588595) loss:2.210 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:29:58,578][model8_pretrain.py][INFO] Epoch:[0/2](921900/4588595) loss:2.719 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:29:58,578][model8_pretrain.py][INFO] Epoch:[0/2](921900/4588595) loss:3.062 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:29:58,578][model8_pretrain.py][INFO] Epoch:[0/2](921900/4588595) loss:2.692 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:29:58,578][model8_pretrain.py][INFO] Epoch:[0/2](921900/4588595) loss:3.245 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:29:58,578][model8_pretrain.py][INFO] Epoch:[0/2](921900/4588595) loss:2.681 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:30:35,589][model8_pretrain.py][INFO] Epoch:[0/2](922000/4588595) loss:3.023 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:30:35,589][model8_pretrain.py][INFO] Epoch:[0/2](922000/4588595) loss:2.415 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:30:35,589][model8_pretrain.py][INFO] Epoch:[0/2](922000/4588595) loss:2.819 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:30:35,589][model8_pretrain.py][INFO] Epoch:[0/2](922000/4588595) loss:2.965 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:30:35,589][model8_pretrain.py][INFO] Epoch:[0/2](922000/4588595) loss:2.578 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:30:35,589][model8_pretrain.py][INFO] Epoch:[0/2](922000/4588595) loss:3.114 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:30:35,589][model8_pretrain.py][INFO] Epoch:[0/2](922000/4588595) loss:2.854 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:30:35,589][model8_pretrain.py][INFO] Epoch:[0/2](922000/4588595) loss:2.781 lr:0.0000100 epoch_Time:23347.0min: [2024-01-06 19:31:12,554][model8_pretrain.py][INFO] Epoch:[0/2](922100/4588595) loss:2.681 lr:0.0000100 epoch_Time:23346.0min: [2024-01-06 19:31:12,554][model8_pretrain.py][INFO] Epoch:[0/2](922100/4588595) loss:3.104 lr:0.0000100 epoch_Time:23346.0min: [2024-01-06 19:31:12,554][model8_pretrain.py][INFO] Epoch:[0/2](922100/4588595) loss:2.318 lr:0.0000100 epoch_Time:23346.0min: [2024-01-06 19:31:12,554][model8_pretrain.py][INFO] Epoch:[0/2](922100/4588595) loss:3.210 lr:0.0000100 epoch_Time:23346.0min: [2024-01-06 19:31:12,554][model8_pretrain.py][INFO] Epoch:[0/2](922100/4588595) loss:2.493 lr:0.0000100 epoch_Time:23346.0min: [2024-01-06 19:31:12,554][model8_pretrain.py][INFO] Epoch:[0/2](922100/4588595) loss:2.472 lr:0.0000100 epoch_Time:23346.0min: [2024-01-06 19:31:12,554][model8_pretrain.py][INFO] Epoch:[0/2](922100/4588595) loss:3.150 lr:0.0000100 epoch_Time:23346.0min: [2024-01-06 19:31:12,554][model8_pretrain.py][INFO] Epoch:[0/2](922100/4588595) loss:2.307 lr:0.0000100 epoch_Time:23346.0min: [2024-01-06 19:31:49,505][model8_pretrain.py][INFO] Epoch:[0/2](922200/4588595) loss:2.269 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:31:49,506][model8_pretrain.py][INFO] Epoch:[0/2](922200/4588595) loss:3.347 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:31:49,506][model8_pretrain.py][INFO] Epoch:[0/2](922200/4588595) loss:2.979 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:31:49,506][model8_pretrain.py][INFO] Epoch:[0/2](922200/4588595) loss:3.339 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:31:49,506][model8_pretrain.py][INFO] Epoch:[0/2](922200/4588595) loss:2.689 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:31:49,506][model8_pretrain.py][INFO] Epoch:[0/2](922200/4588595) loss:2.435 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:31:49,506][model8_pretrain.py][INFO] Epoch:[0/2](922200/4588595) loss:2.860 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:31:49,506][model8_pretrain.py][INFO] Epoch:[0/2](922200/4588595) loss:2.893 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:32:26,446][model8_pretrain.py][INFO] Epoch:[0/2](922300/4588595) loss:3.468 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:32:26,446][model8_pretrain.py][INFO] Epoch:[0/2](922300/4588595) loss:2.749 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:32:26,446][model8_pretrain.py][INFO] Epoch:[0/2](922300/4588595) loss:2.776 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:32:26,446][model8_pretrain.py][INFO] Epoch:[0/2](922300/4588595) loss:2.777 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:32:26,446][model8_pretrain.py][INFO] Epoch:[0/2](922300/4588595) loss:2.931 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:32:26,446][model8_pretrain.py][INFO] Epoch:[0/2](922300/4588595) loss:2.980 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:32:26,446][model8_pretrain.py][INFO] Epoch:[0/2](922300/4588595) loss:3.004 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:32:26,447][model8_pretrain.py][INFO] Epoch:[0/2](922300/4588595) loss:2.365 lr:0.0000100 epoch_Time:23345.0min: [2024-01-06 19:33:03,387][model8_pretrain.py][INFO] Epoch:[0/2](922400/4588595) loss:3.165 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:03,387][model8_pretrain.py][INFO] Epoch:[0/2](922400/4588595) loss:2.839 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:03,388][model8_pretrain.py][INFO] Epoch:[0/2](922400/4588595) loss:3.079 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:03,387][model8_pretrain.py][INFO] Epoch:[0/2](922400/4588595) loss:3.079 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:03,388][model8_pretrain.py][INFO] Epoch:[0/2](922400/4588595) loss:2.286 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:03,388][model8_pretrain.py][INFO] Epoch:[0/2](922400/4588595) loss:2.665 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:03,388][model8_pretrain.py][INFO] Epoch:[0/2](922400/4588595) loss:2.561 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:03,388][model8_pretrain.py][INFO] Epoch:[0/2](922400/4588595) loss:2.571 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:42,015][model8_pretrain.py][INFO] Epoch:[0/2](922500/4588595) loss:3.230 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:42,015][model8_pretrain.py][INFO] Epoch:[0/2](922500/4588595) loss:2.811 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:42,015][model8_pretrain.py][INFO] Epoch:[0/2](922500/4588595) loss:2.928 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:42,015][model8_pretrain.py][INFO] Epoch:[0/2](922500/4588595) loss:3.035 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:42,015][model8_pretrain.py][INFO] Epoch:[0/2](922500/4588595) loss:2.667 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:42,015][model8_pretrain.py][INFO] Epoch:[0/2](922500/4588595) loss:3.553 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:42,015][model8_pretrain.py][INFO] Epoch:[0/2](922500/4588595) loss:2.671 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:33:42,015][model8_pretrain.py][INFO] Epoch:[0/2](922500/4588595) loss:2.492 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:34:30,865][model8_pretrain.py][INFO] Epoch:[0/2](922600/4588595) loss:2.504 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:34:30,865][model8_pretrain.py][INFO] Epoch:[0/2](922600/4588595) loss:2.730 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:34:30,865][model8_pretrain.py][INFO] Epoch:[0/2](922600/4588595) loss:3.000 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:34:30,865][model8_pretrain.py][INFO] Epoch:[0/2](922600/4588595) loss:2.988 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:34:30,865][model8_pretrain.py][INFO] Epoch:[0/2](922600/4588595) loss:2.347 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:34:30,866][model8_pretrain.py][INFO] Epoch:[0/2](922600/4588595) loss:2.587 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:34:30,865][model8_pretrain.py][INFO] Epoch:[0/2](922600/4588595) loss:2.829 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:34:30,866][model8_pretrain.py][INFO] Epoch:[0/2](922600/4588595) loss:3.026 lr:0.0000100 epoch_Time:23344.0min: [2024-01-06 19:35:07,827][model8_pretrain.py][INFO] Epoch:[0/2](922700/4588595) loss:3.284 lr:0.0000100 epoch_Time:23343.0min: [2024-01-06 19:35:07,827][model8_pretrain.py][INFO] Epoch:[0/2](922700/4588595) loss:2.952 lr:0.0000100 epoch_Time:23343.0min: [2024-01-06 19:35:07,827][model8_pretrain.py][INFO] Epoch:[0/2](922700/4588595) loss:2.594 lr:0.0000100 epoch_Time:23343.0min: [2024-01-06 19:35:07,827][model8_pretrain.py][INFO] Epoch:[0/2](922700/4588595) loss:2.565 lr:0.0000100 epoch_Time:23343.0min: [2024-01-06 19:35:07,827][model8_pretrain.py][INFO] Epoch:[0/2](922700/4588595) loss:2.597 lr:0.0000100 epoch_Time:23343.0min: [2024-01-06 19:35:07,827][model8_pretrain.py][INFO] Epoch:[0/2](922700/4588595) loss:3.119 lr:0.0000100 epoch_Time:23343.0min: [2024-01-06 19:35:07,827][model8_pretrain.py][INFO] Epoch:[0/2](922700/4588595) loss:2.752 lr:0.0000100 epoch_Time:23343.0min: [2024-01-06 19:35:07,827][model8_pretrain.py][INFO] Epoch:[0/2](922700/4588595) loss:2.695 lr:0.0000100 epoch_Time:23343.0min: [2024-01-06 19:35:44,790][model8_pretrain.py][INFO] Epoch:[0/2](922800/4588595) loss:3.177 lr:0.0000100 epoch_Time:23342.0min: [2024-01-06 19:35:44,790][model8_pretrain.py][INFO] Epoch:[0/2](922800/4588595) loss:3.103 lr:0.0000100 epoch_Time:23342.0min: [2024-01-06 19:35:44,790][model8_pretrain.py][INFO] Epoch:[0/2](922800/4588595) loss:2.871 lr:0.0000100 epoch_Time:23342.0min: [2024-01-06 19:35:44,790][model8_pretrain.py][INFO] Epoch:[0/2](922800/4588595) loss:2.820 lr:0.0000100 epoch_Time:23342.0min: [2024-01-06 19:35:44,790][model8_pretrain.py][INFO] Epoch:[0/2](922800/4588595) loss:3.478 lr:0.0000100 epoch_Time:23342.0min: [2024-01-06 19:35:44,790][model8_pretrain.py][INFO] Epoch:[0/2](922800/4588595) loss:3.207 lr:0.0000100 epoch_Time:23342.0min: [2024-01-06 19:35:44,790][model8_pretrain.py][INFO] Epoch:[0/2](922800/4588595) loss:3.066 lr:0.0000100 epoch_Time:23342.0min: [2024-01-06 19:35:44,791][model8_pretrain.py][INFO] Epoch:[0/2](922800/4588595) loss:3.048 lr:0.0000100 epoch_Time:23342.0min: [2024-01-06 19:36:21,739][model8_pretrain.py][INFO] Epoch:[0/2](922900/4588595) loss:2.822 lr:0.0000100 epoch_Time:23341.0min: [2024-01-06 19:36:21,739][model8_pretrain.py][INFO] Epoch:[0/2](922900/4588595) loss:2.966 lr:0.0000100 epoch_Time:23341.0min: [2024-01-06 19:36:21,739][model8_pretrain.py][INFO] Epoch:[0/2](922900/4588595) loss:2.790 lr:0.0000100 epoch_Time:23341.0min: [2024-01-06 19:36:21,739][model8_pretrain.py][INFO] Epoch:[0/2](922900/4588595) loss:3.015 lr:0.0000100 epoch_Time:23341.0min: [2024-01-06 19:36:21,739][model8_pretrain.py][INFO] Epoch:[0/2](922900/4588595) loss:3.028 lr:0.0000100 epoch_Time:23341.0min: [2024-01-06 19:36:21,739][model8_pretrain.py][INFO] Epoch:[0/2](922900/4588595) loss:2.638 lr:0.0000100 epoch_Time:23341.0min: [2024-01-06 19:36:21,739][model8_pretrain.py][INFO] Epoch:[0/2](922900/4588595) loss:2.771 lr:0.0000100 epoch_Time:23341.0min: [2024-01-06 19:36:21,739][model8_pretrain.py][INFO] Epoch:[0/2](922900/4588595) loss:2.254 lr:0.0000100 epoch_Time:23341.0min: [2024-01-06 19:36:58,712][model8_pretrain.py][INFO] Epoch:[0/2](923000/4588595) loss:2.205 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:36:58,712][model8_pretrain.py][INFO] Epoch:[0/2](923000/4588595) loss:3.112 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:36:58,712][model8_pretrain.py][INFO] Epoch:[0/2](923000/4588595) loss:3.366 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:36:58,712][model8_pretrain.py][INFO] Epoch:[0/2](923000/4588595) loss:2.369 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:36:58,712][model8_pretrain.py][INFO] Epoch:[0/2](923000/4588595) loss:2.833 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:36:58,712][model8_pretrain.py][INFO] Epoch:[0/2](923000/4588595) loss:1.838 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:36:58,712][model8_pretrain.py][INFO] Epoch:[0/2](923000/4588595) loss:2.707 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:36:58,712][model8_pretrain.py][INFO] Epoch:[0/2](923000/4588595) loss:3.089 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:37:35,670][model8_pretrain.py][INFO] Epoch:[0/2](923100/4588595) loss:2.933 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:37:35,670][model8_pretrain.py][INFO] Epoch:[0/2](923100/4588595) loss:2.752 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:37:35,670][model8_pretrain.py][INFO] Epoch:[0/2](923100/4588595) loss:2.459 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:37:35,670][model8_pretrain.py][INFO] Epoch:[0/2](923100/4588595) loss:3.014 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:37:35,670][model8_pretrain.py][INFO] Epoch:[0/2](923100/4588595) loss:2.219 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:37:35,670][model8_pretrain.py][INFO] Epoch:[0/2](923100/4588595) loss:2.471 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:37:35,670][model8_pretrain.py][INFO] Epoch:[0/2](923100/4588595) loss:3.159 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:37:35,670][model8_pretrain.py][INFO] Epoch:[0/2](923100/4588595) loss:2.730 lr:0.0000100 epoch_Time:23340.0min: [2024-01-06 19:38:12,618][model8_pretrain.py][INFO] Epoch:[0/2](923200/4588595) loss:3.093 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:38:12,618][model8_pretrain.py][INFO] Epoch:[0/2](923200/4588595) loss:2.592 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:38:12,618][model8_pretrain.py][INFO] Epoch:[0/2](923200/4588595) loss:2.404 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:38:12,618][model8_pretrain.py][INFO] Epoch:[0/2](923200/4588595) loss:3.093 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:38:12,618][model8_pretrain.py][INFO] Epoch:[0/2](923200/4588595) loss:3.358 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:38:12,618][model8_pretrain.py][INFO] Epoch:[0/2](923200/4588595) loss:3.148 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:38:12,619][model8_pretrain.py][INFO] Epoch:[0/2](923200/4588595) loss:2.897 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:38:12,619][model8_pretrain.py][INFO] Epoch:[0/2](923200/4588595) loss:2.440 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:38:51,321][model8_pretrain.py][INFO] Epoch:[0/2](923300/4588595) loss:2.685 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:38:51,321][model8_pretrain.py][INFO] Epoch:[0/2](923300/4588595) loss:2.442 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:38:51,321][model8_pretrain.py][INFO] Epoch:[0/2](923300/4588595) loss:3.005 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:38:51,321][model8_pretrain.py][INFO] Epoch:[0/2](923300/4588595) loss:2.746 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:38:51,321][model8_pretrain.py][INFO] Epoch:[0/2](923300/4588595) loss:2.913 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:38:51,321][model8_pretrain.py][INFO] Epoch:[0/2](923300/4588595) loss:3.286 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:38:51,321][model8_pretrain.py][INFO] Epoch:[0/2](923300/4588595) loss:2.746 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:38:51,322][model8_pretrain.py][INFO] Epoch:[0/2](923300/4588595) loss:2.083 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:39:40,097][model8_pretrain.py][INFO] Epoch:[0/2](923400/4588595) loss:3.183 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:39:40,097][model8_pretrain.py][INFO] Epoch:[0/2](923400/4588595) loss:2.806 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:39:40,097][model8_pretrain.py][INFO] Epoch:[0/2](923400/4588595) loss:2.956 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:39:40,097][model8_pretrain.py][INFO] Epoch:[0/2](923400/4588595) loss:2.615 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:39:40,097][model8_pretrain.py][INFO] Epoch:[0/2](923400/4588595) loss:2.714 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:39:40,097][model8_pretrain.py][INFO] Epoch:[0/2](923400/4588595) loss:2.381 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:39:40,097][model8_pretrain.py][INFO] Epoch:[0/2](923400/4588595) loss:2.026 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:39:40,097][model8_pretrain.py][INFO] Epoch:[0/2](923400/4588595) loss:3.209 lr:0.0000100 epoch_Time:23339.0min: [2024-01-06 19:40:17,019][model8_pretrain.py][INFO] Epoch:[0/2](923500/4588595) loss:2.992 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:40:17,019][model8_pretrain.py][INFO] Epoch:[0/2](923500/4588595) loss:2.798 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:40:17,019][model8_pretrain.py][INFO] Epoch:[0/2](923500/4588595) loss:2.990 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:40:17,019][model8_pretrain.py][INFO] Epoch:[0/2](923500/4588595) loss:2.881 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:40:17,019][model8_pretrain.py][INFO] Epoch:[0/2](923500/4588595) loss:2.572 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:40:17,019][model8_pretrain.py][INFO] Epoch:[0/2](923500/4588595) loss:2.801 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:40:17,020][model8_pretrain.py][INFO] Epoch:[0/2](923500/4588595) loss:2.760 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:40:17,020][model8_pretrain.py][INFO] Epoch:[0/2](923500/4588595) loss:3.272 lr:0.0000100 epoch_Time:23338.0min: [2024-01-06 19:40:53,948][model8_pretrain.py][INFO] Epoch:[0/2](923600/4588595) loss:3.133 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:40:53,948][model8_pretrain.py][INFO] Epoch:[0/2](923600/4588595) loss:2.636 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:40:53,948][model8_pretrain.py][INFO] Epoch:[0/2](923600/4588595) loss:2.563 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:40:53,948][model8_pretrain.py][INFO] Epoch:[0/2](923600/4588595) loss:2.530 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:40:53,948][model8_pretrain.py][INFO] Epoch:[0/2](923600/4588595) loss:3.086 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:40:53,948][model8_pretrain.py][INFO] Epoch:[0/2](923600/4588595) loss:2.607 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:40:53,948][model8_pretrain.py][INFO] Epoch:[0/2](923600/4588595) loss:2.726 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:40:53,948][model8_pretrain.py][INFO] Epoch:[0/2](923600/4588595) loss:3.023 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:41:30,878][model8_pretrain.py][INFO] Epoch:[0/2](923700/4588595) loss:2.271 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:41:30,878][model8_pretrain.py][INFO] Epoch:[0/2](923700/4588595) loss:2.792 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:41:30,878][model8_pretrain.py][INFO] Epoch:[0/2](923700/4588595) loss:2.590 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:41:30,878][model8_pretrain.py][INFO] Epoch:[0/2](923700/4588595) loss:3.093 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:41:30,878][model8_pretrain.py][INFO] Epoch:[0/2](923700/4588595) loss:3.273 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:41:30,878][model8_pretrain.py][INFO] Epoch:[0/2](923700/4588595) loss:2.477 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:41:30,878][model8_pretrain.py][INFO] Epoch:[0/2](923700/4588595) loss:3.407 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:41:30,879][model8_pretrain.py][INFO] Epoch:[0/2](923700/4588595) loss:2.962 lr:0.0000100 epoch_Time:23337.0min: [2024-01-06 19:42:07,813][model8_pretrain.py][INFO] Epoch:[0/2](923800/4588595) loss:2.813 lr:0.0000100 epoch_Time:23336.0min: [2024-01-06 19:42:07,813][model8_pretrain.py][INFO] Epoch:[0/2](923800/4588595) loss:2.707 lr:0.0000100 epoch_Time:23336.0min: [2024-01-06 19:42:07,813][model8_pretrain.py][INFO] Epoch:[0/2](923800/4588595) loss:2.891 lr:0.0000100 epoch_Time:23336.0min: [2024-01-06 19:42:07,813][model8_pretrain.py][INFO] Epoch:[0/2](923800/4588595) loss:2.347 lr:0.0000100 epoch_Time:23336.0min: [2024-01-06 19:42:07,813][model8_pretrain.py][INFO] Epoch:[0/2](923800/4588595) loss:2.719 lr:0.0000100 epoch_Time:23336.0min: [2024-01-06 19:42:07,813][model8_pretrain.py][INFO] Epoch:[0/2](923800/4588595) loss:3.132 lr:0.0000100 epoch_Time:23336.0min: [2024-01-06 19:42:07,813][model8_pretrain.py][INFO] Epoch:[0/2](923800/4588595) loss:2.903 lr:0.0000100 epoch_Time:23336.0min: [2024-01-06 19:42:07,814][model8_pretrain.py][INFO] Epoch:[0/2](923800/4588595) loss:2.507 lr:0.0000100 epoch_Time:23336.0min: [2024-01-06 19:42:44,745][model8_pretrain.py][INFO] Epoch:[0/2](923900/4588595) loss:2.733 lr:0.0000100 epoch_Time:23335.0min: [2024-01-06 19:42:44,745][model8_pretrain.py][INFO] Epoch:[0/2](923900/4588595) loss:2.495 lr:0.0000100 epoch_Time:23335.0min: [2024-01-06 19:42:44,745][model8_pretrain.py][INFO] Epoch:[0/2](923900/4588595) loss:2.023 lr:0.0000100 epoch_Time:23335.0min: [2024-01-06 19:42:44,745][model8_pretrain.py][INFO] Epoch:[0/2](923900/4588595) loss:2.395 lr:0.0000100 epoch_Time:23335.0min: [2024-01-06 19:42:44,745][model8_pretrain.py][INFO] Epoch:[0/2](923900/4588595) loss:2.515 lr:0.0000100 epoch_Time:23335.0min: [2024-01-06 19:42:44,745][model8_pretrain.py][INFO] Epoch:[0/2](923900/4588595) loss:2.649 lr:0.0000100 epoch_Time:23335.0min: [2024-01-06 19:42:44,745][model8_pretrain.py][INFO] Epoch:[0/2](923900/4588595) loss:2.980 lr:0.0000100 epoch_Time:23335.0min: [2024-01-06 19:42:44,745][model8_pretrain.py][INFO] Epoch:[0/2](923900/4588595) loss:2.866 lr:0.0000100 epoch_Time:23335.0min: [2024-01-06 19:43:21,682][model8_pretrain.py][INFO] Epoch:[0/2](924000/4588595) loss:2.767 lr:0.0000100 epoch_Time:23334.0min: [2024-01-06 19:43:21,682][model8_pretrain.py][INFO] Epoch:[0/2](924000/4588595) loss:2.296 lr:0.0000100 epoch_Time:23334.0min: [2024-01-06 19:43:21,682][model8_pretrain.py][INFO] Epoch:[0/2](924000/4588595) loss:2.971 lr:0.0000100 epoch_Time:23334.0min: [2024-01-06 19:43:21,682][model8_pretrain.py][INFO] Epoch:[0/2](924000/4588595) loss:2.421 lr:0.0000100 epoch_Time:23334.0min: [2024-01-06 19:43:21,682][model8_pretrain.py][INFO] Epoch:[0/2](924000/4588595) loss:3.189 lr:0.0000100 epoch_Time:23334.0min: [2024-01-06 19:43:21,682][model8_pretrain.py][INFO] Epoch:[0/2](924000/4588595) loss:3.020 lr:0.0000100 epoch_Time:23334.0min: [2024-01-06 19:43:21,682][model8_pretrain.py][INFO] Epoch:[0/2](924000/4588595) loss:2.591 lr:0.0000100 epoch_Time:23334.0min: [2024-01-06 19:43:21,682][model8_pretrain.py][INFO] Epoch:[0/2](924000/4588595) loss:2.838 lr:0.0000100 epoch_Time:23334.0min: [2024-01-06 19:44:00,368][model8_pretrain.py][INFO] Epoch:[0/2](924100/4588595) loss:2.936 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:00,368][model8_pretrain.py][INFO] Epoch:[0/2](924100/4588595) loss:3.230 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:00,368][model8_pretrain.py][INFO] Epoch:[0/2](924100/4588595) loss:2.601 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:00,368][model8_pretrain.py][INFO] Epoch:[0/2](924100/4588595) loss:3.146 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:00,368][model8_pretrain.py][INFO] Epoch:[0/2](924100/4588595) loss:3.106 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:00,368][model8_pretrain.py][INFO] Epoch:[0/2](924100/4588595) loss:2.611 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:00,368][model8_pretrain.py][INFO] Epoch:[0/2](924100/4588595) loss:3.101 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:00,368][model8_pretrain.py][INFO] Epoch:[0/2](924100/4588595) loss:2.608 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:48,971][model8_pretrain.py][INFO] Epoch:[0/2](924200/4588595) loss:2.260 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:48,971][model8_pretrain.py][INFO] Epoch:[0/2](924200/4588595) loss:2.505 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:48,971][model8_pretrain.py][INFO] Epoch:[0/2](924200/4588595) loss:3.040 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:48,971][model8_pretrain.py][INFO] Epoch:[0/2](924200/4588595) loss:3.140 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:48,971][model8_pretrain.py][INFO] Epoch:[0/2](924200/4588595) loss:2.798 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:48,971][model8_pretrain.py][INFO] Epoch:[0/2](924200/4588595) loss:3.088 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:48,971][model8_pretrain.py][INFO] Epoch:[0/2](924200/4588595) loss:2.501 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:44:48,971][model8_pretrain.py][INFO] Epoch:[0/2](924200/4588595) loss:2.537 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:45:25,903][model8_pretrain.py][INFO] Epoch:[0/2](924300/4588595) loss:2.946 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:45:25,903][model8_pretrain.py][INFO] Epoch:[0/2](924300/4588595) loss:3.113 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:45:25,903][model8_pretrain.py][INFO] Epoch:[0/2](924300/4588595) loss:2.336 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:45:25,903][model8_pretrain.py][INFO] Epoch:[0/2](924300/4588595) loss:2.725 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:45:25,903][model8_pretrain.py][INFO] Epoch:[0/2](924300/4588595) loss:2.993 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:45:25,903][model8_pretrain.py][INFO] Epoch:[0/2](924300/4588595) loss:2.568 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:45:25,903][model8_pretrain.py][INFO] Epoch:[0/2](924300/4588595) loss:2.769 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:45:25,903][model8_pretrain.py][INFO] Epoch:[0/2](924300/4588595) loss:2.733 lr:0.0000100 epoch_Time:23333.0min: [2024-01-06 19:46:02,858][model8_pretrain.py][INFO] Epoch:[0/2](924400/4588595) loss:2.461 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:02,858][model8_pretrain.py][INFO] Epoch:[0/2](924400/4588595) loss:2.655 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:02,858][model8_pretrain.py][INFO] Epoch:[0/2](924400/4588595) loss:3.432 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:02,858][model8_pretrain.py][INFO] Epoch:[0/2](924400/4588595) loss:3.086 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:02,858][model8_pretrain.py][INFO] Epoch:[0/2](924400/4588595) loss:2.686 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:02,858][model8_pretrain.py][INFO] Epoch:[0/2](924400/4588595) loss:2.879 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:02,858][model8_pretrain.py][INFO] Epoch:[0/2](924400/4588595) loss:3.116 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:02,858][model8_pretrain.py][INFO] Epoch:[0/2](924400/4588595) loss:2.575 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:39,809][model8_pretrain.py][INFO] Epoch:[0/2](924500/4588595) loss:2.886 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:39,809][model8_pretrain.py][INFO] Epoch:[0/2](924500/4588595) loss:2.368 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:39,809][model8_pretrain.py][INFO] Epoch:[0/2](924500/4588595) loss:3.291 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:39,809][model8_pretrain.py][INFO] Epoch:[0/2](924500/4588595) loss:2.703 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:39,809][model8_pretrain.py][INFO] Epoch:[0/2](924500/4588595) loss:3.171 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:39,809][model8_pretrain.py][INFO] Epoch:[0/2](924500/4588595) loss:2.623 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:39,810][model8_pretrain.py][INFO] Epoch:[0/2](924500/4588595) loss:2.924 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:46:39,810][model8_pretrain.py][INFO] Epoch:[0/2](924500/4588595) loss:3.342 lr:0.0000100 epoch_Time:23332.0min: [2024-01-06 19:47:16,751][model8_pretrain.py][INFO] Epoch:[0/2](924600/4588595) loss:2.602 lr:0.0000100 epoch_Time:23331.0min: [2024-01-06 19:47:16,751][model8_pretrain.py][INFO] Epoch:[0/2](924600/4588595) loss:2.504 lr:0.0000100 epoch_Time:23331.0min: [2024-01-06 19:47:16,751][model8_pretrain.py][INFO] Epoch:[0/2](924600/4588595) loss:2.900 lr:0.0000100 epoch_Time:23331.0min: [2024-01-06 19:47:16,751][model8_pretrain.py][INFO] Epoch:[0/2](924600/4588595) loss:2.355 lr:0.0000100 epoch_Time:23331.0min: [2024-01-06 19:47:16,751][model8_pretrain.py][INFO] Epoch:[0/2](924600/4588595) loss:2.412 lr:0.0000100 epoch_Time:23331.0min: [2024-01-06 19:47:16,751][model8_pretrain.py][INFO] Epoch:[0/2](924600/4588595) loss:2.770 lr:0.0000100 epoch_Time:23331.0min: [2024-01-06 19:47:16,751][model8_pretrain.py][INFO] Epoch:[0/2](924600/4588595) loss:2.659 lr:0.0000100 epoch_Time:23331.0min: [2024-01-06 19:47:16,751][model8_pretrain.py][INFO] Epoch:[0/2](924600/4588595) loss:3.200 lr:0.0000100 epoch_Time:23331.0min: [2024-01-06 19:47:53,701][model8_pretrain.py][INFO] Epoch:[0/2](924700/4588595) loss:2.759 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:47:53,701][model8_pretrain.py][INFO] Epoch:[0/2](924700/4588595) loss:3.018 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:47:53,701][model8_pretrain.py][INFO] Epoch:[0/2](924700/4588595) loss:2.898 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:47:53,701][model8_pretrain.py][INFO] Epoch:[0/2](924700/4588595) loss:2.824 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:47:53,701][model8_pretrain.py][INFO] Epoch:[0/2](924700/4588595) loss:2.941 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:47:53,701][model8_pretrain.py][INFO] Epoch:[0/2](924700/4588595) loss:2.996 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:47:53,701][model8_pretrain.py][INFO] Epoch:[0/2](924700/4588595) loss:2.954 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:47:53,701][model8_pretrain.py][INFO] Epoch:[0/2](924700/4588595) loss:2.924 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:48:30,654][model8_pretrain.py][INFO] Epoch:[0/2](924800/4588595) loss:3.083 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:48:30,654][model8_pretrain.py][INFO] Epoch:[0/2](924800/4588595) loss:3.081 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:48:30,654][model8_pretrain.py][INFO] Epoch:[0/2](924800/4588595) loss:2.932 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:48:30,654][model8_pretrain.py][INFO] Epoch:[0/2](924800/4588595) loss:2.574 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:48:30,654][model8_pretrain.py][INFO] Epoch:[0/2](924800/4588595) loss:2.762 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:48:30,654][model8_pretrain.py][INFO] Epoch:[0/2](924800/4588595) loss:3.176 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:48:30,654][model8_pretrain.py][INFO] Epoch:[0/2](924800/4588595) loss:3.091 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:48:30,654][model8_pretrain.py][INFO] Epoch:[0/2](924800/4588595) loss:2.829 lr:0.0000100 epoch_Time:23330.0min: [2024-01-06 19:49:07,603][model8_pretrain.py][INFO] Epoch:[0/2](924900/4588595) loss:2.830 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:07,603][model8_pretrain.py][INFO] Epoch:[0/2](924900/4588595) loss:2.924 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:07,603][model8_pretrain.py][INFO] Epoch:[0/2](924900/4588595) loss:2.137 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:07,603][model8_pretrain.py][INFO] Epoch:[0/2](924900/4588595) loss:2.962 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:07,603][model8_pretrain.py][INFO] Epoch:[0/2](924900/4588595) loss:2.815 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:07,604][model8_pretrain.py][INFO] Epoch:[0/2](924900/4588595) loss:3.343 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:07,604][model8_pretrain.py][INFO] Epoch:[0/2](924900/4588595) loss:2.942 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:09,339][model8_pretrain.py][INFO] Epoch:[0/2](924900/4588595) loss:3.097 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:58,031][model8_pretrain.py][INFO] Epoch:[0/2](925000/4588595) loss:2.604 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:58,031][model8_pretrain.py][INFO] Epoch:[0/2](925000/4588595) loss:2.553 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:58,031][model8_pretrain.py][INFO] Epoch:[0/2](925000/4588595) loss:2.624 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:58,031][model8_pretrain.py][INFO] Epoch:[0/2](925000/4588595) loss:2.954 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:58,031][model8_pretrain.py][INFO] Epoch:[0/2](925000/4588595) loss:2.744 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:58,031][model8_pretrain.py][INFO] Epoch:[0/2](925000/4588595) loss:2.398 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:58,031][model8_pretrain.py][INFO] Epoch:[0/2](925000/4588595) loss:2.993 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:49:58,031][model8_pretrain.py][INFO] Epoch:[0/2](925000/4588595) loss:3.362 lr:0.0000100 epoch_Time:23329.0min: [2024-01-06 19:50:34,981][model8_pretrain.py][INFO] Epoch:[0/2](925100/4588595) loss:2.746 lr:0.0000100 epoch_Time:23328.0min: [2024-01-06 19:50:34,981][model8_pretrain.py][INFO] Epoch:[0/2](925100/4588595) loss:3.084 lr:0.0000100 epoch_Time:23328.0min: [2024-01-06 19:50:34,981][model8_pretrain.py][INFO] Epoch:[0/2](925100/4588595) loss:2.626 lr:0.0000100 epoch_Time:23328.0min: [2024-01-06 19:50:34,981][model8_pretrain.py][INFO] Epoch:[0/2](925100/4588595) loss:3.102 lr:0.0000100 epoch_Time:23328.0min: [2024-01-06 19:50:34,981][model8_pretrain.py][INFO] Epoch:[0/2](925100/4588595) loss:2.590 lr:0.0000100 epoch_Time:23328.0min: [2024-01-06 19:50:34,981][model8_pretrain.py][INFO] Epoch:[0/2](925100/4588595) loss:2.374 lr:0.0000100 epoch_Time:23328.0min: [2024-01-06 19:50:34,981][model8_pretrain.py][INFO] Epoch:[0/2](925100/4588595) loss:2.349 lr:0.0000100 epoch_Time:23328.0min: [2024-01-06 19:50:34,981][model8_pretrain.py][INFO] Epoch:[0/2](925100/4588595) loss:2.913 lr:0.0000100 epoch_Time:23328.0min: [2024-01-06 19:51:11,922][model8_pretrain.py][INFO] Epoch:[0/2](925200/4588595) loss:2.403 lr:0.0000100 epoch_Time:23327.0min: [2024-01-06 19:51:11,922][model8_pretrain.py][INFO] Epoch:[0/2](925200/4588595) loss:3.038 lr:0.0000100 epoch_Time:23327.0min: [2024-01-06 19:51:11,922][model8_pretrain.py][INFO] Epoch:[0/2](925200/4588595) loss:3.237 lr:0.0000100 epoch_Time:23327.0min: [2024-01-06 19:51:11,922][model8_pretrain.py][INFO] Epoch:[0/2](925200/4588595) loss:2.373 lr:0.0000100 epoch_Time:23327.0min: [2024-01-06 19:51:11,923][model8_pretrain.py][INFO] Epoch:[0/2](925200/4588595) loss:2.427 lr:0.0000100 epoch_Time:23327.0min: [2024-01-06 19:51:11,923][model8_pretrain.py][INFO] Epoch:[0/2](925200/4588595) loss:2.727 lr:0.0000100 epoch_Time:23327.0min: [2024-01-06 19:51:11,923][model8_pretrain.py][INFO] Epoch:[0/2](925200/4588595) loss:2.671 lr:0.0000100 epoch_Time:23327.0min: [2024-01-06 19:51:11,923][model8_pretrain.py][INFO] Epoch:[0/2](925200/4588595) loss:2.744 lr:0.0000100 epoch_Time:23327.0min: [2024-01-06 19:51:48,875][model8_pretrain.py][INFO] Epoch:[0/2](925300/4588595) loss:2.760 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:51:48,875][model8_pretrain.py][INFO] Epoch:[0/2](925300/4588595) loss:2.581 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:51:48,875][model8_pretrain.py][INFO] Epoch:[0/2](925300/4588595) loss:2.898 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:51:48,875][model8_pretrain.py][INFO] Epoch:[0/2](925300/4588595) loss:3.260 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:51:48,875][model8_pretrain.py][INFO] Epoch:[0/2](925300/4588595) loss:2.870 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:51:48,875][model8_pretrain.py][INFO] Epoch:[0/2](925300/4588595) loss:2.813 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:51:48,875][model8_pretrain.py][INFO] Epoch:[0/2](925300/4588595) loss:3.224 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:51:48,876][model8_pretrain.py][INFO] Epoch:[0/2](925300/4588595) loss:3.403 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:52:25,836][model8_pretrain.py][INFO] Epoch:[0/2](925400/4588595) loss:2.908 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:52:25,836][model8_pretrain.py][INFO] Epoch:[0/2](925400/4588595) loss:3.082 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:52:25,836][model8_pretrain.py][INFO] Epoch:[0/2](925400/4588595) loss:2.154 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:52:25,836][model8_pretrain.py][INFO] Epoch:[0/2](925400/4588595) loss:2.431 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:52:25,836][model8_pretrain.py][INFO] Epoch:[0/2](925400/4588595) loss:2.807 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:52:25,836][model8_pretrain.py][INFO] Epoch:[0/2](925400/4588595) loss:2.340 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:52:25,836][model8_pretrain.py][INFO] Epoch:[0/2](925400/4588595) loss:2.508 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:52:25,836][model8_pretrain.py][INFO] Epoch:[0/2](925400/4588595) loss:3.424 lr:0.0000100 epoch_Time:23326.0min: [2024-01-06 19:53:02,783][model8_pretrain.py][INFO] Epoch:[0/2](925500/4588595) loss:2.975 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:02,783][model8_pretrain.py][INFO] Epoch:[0/2](925500/4588595) loss:2.862 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:02,783][model8_pretrain.py][INFO] Epoch:[0/2](925500/4588595) loss:2.752 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:02,783][model8_pretrain.py][INFO] Epoch:[0/2](925500/4588595) loss:3.282 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:02,783][model8_pretrain.py][INFO] Epoch:[0/2](925500/4588595) loss:3.027 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:02,783][model8_pretrain.py][INFO] Epoch:[0/2](925500/4588595) loss:3.172 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:02,783][model8_pretrain.py][INFO] Epoch:[0/2](925500/4588595) loss:3.059 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:02,783][model8_pretrain.py][INFO] Epoch:[0/2](925500/4588595) loss:3.181 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:39,721][model8_pretrain.py][INFO] Epoch:[0/2](925600/4588595) loss:2.938 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:39,721][model8_pretrain.py][INFO] Epoch:[0/2](925600/4588595) loss:3.040 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:39,721][model8_pretrain.py][INFO] Epoch:[0/2](925600/4588595) loss:2.524 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:39,721][model8_pretrain.py][INFO] Epoch:[0/2](925600/4588595) loss:2.998 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:39,721][model8_pretrain.py][INFO] Epoch:[0/2](925600/4588595) loss:2.809 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:39,721][model8_pretrain.py][INFO] Epoch:[0/2](925600/4588595) loss:3.003 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:39,721][model8_pretrain.py][INFO] Epoch:[0/2](925600/4588595) loss:2.700 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:53:39,721][model8_pretrain.py][INFO] Epoch:[0/2](925600/4588595) loss:2.700 lr:0.0000100 epoch_Time:23325.0min: [2024-01-06 19:54:16,669][model8_pretrain.py][INFO] Epoch:[0/2](925700/4588595) loss:2.675 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:54:16,669][model8_pretrain.py][INFO] Epoch:[0/2](925700/4588595) loss:2.808 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:54:16,669][model8_pretrain.py][INFO] Epoch:[0/2](925700/4588595) loss:2.400 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:54:16,670][model8_pretrain.py][INFO] Epoch:[0/2](925700/4588595) loss:3.304 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:54:16,670][model8_pretrain.py][INFO] Epoch:[0/2](925700/4588595) loss:2.263 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:54:16,670][model8_pretrain.py][INFO] Epoch:[0/2](925700/4588595) loss:2.572 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:54:16,670][model8_pretrain.py][INFO] Epoch:[0/2](925700/4588595) loss:3.005 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:54:16,671][model8_pretrain.py][INFO] Epoch:[0/2](925700/4588595) loss:3.069 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:07,148][model8_pretrain.py][INFO] Epoch:[0/2](925800/4588595) loss:2.602 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:07,148][model8_pretrain.py][INFO] Epoch:[0/2](925800/4588595) loss:2.874 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:07,148][model8_pretrain.py][INFO] Epoch:[0/2](925800/4588595) loss:2.919 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:07,148][model8_pretrain.py][INFO] Epoch:[0/2](925800/4588595) loss:3.335 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:07,148][model8_pretrain.py][INFO] Epoch:[0/2](925800/4588595) loss:1.561 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:07,148][model8_pretrain.py][INFO] Epoch:[0/2](925800/4588595) loss:3.023 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:07,148][model8_pretrain.py][INFO] Epoch:[0/2](925800/4588595) loss:2.826 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:07,148][model8_pretrain.py][INFO] Epoch:[0/2](925800/4588595) loss:2.365 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:44,096][model8_pretrain.py][INFO] Epoch:[0/2](925900/4588595) loss:2.166 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:44,096][model8_pretrain.py][INFO] Epoch:[0/2](925900/4588595) loss:2.899 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:44,096][model8_pretrain.py][INFO] Epoch:[0/2](925900/4588595) loss:2.635 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:44,096][model8_pretrain.py][INFO] Epoch:[0/2](925900/4588595) loss:2.478 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:44,096][model8_pretrain.py][INFO] Epoch:[0/2](925900/4588595) loss:2.869 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:44,096][model8_pretrain.py][INFO] Epoch:[0/2](925900/4588595) loss:3.279 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:44,096][model8_pretrain.py][INFO] Epoch:[0/2](925900/4588595) loss:2.995 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:55:44,096][model8_pretrain.py][INFO] Epoch:[0/2](925900/4588595) loss:2.341 lr:0.0000100 epoch_Time:23324.0min: [2024-01-06 19:56:21,044][model8_pretrain.py][INFO] Epoch:[0/2](926000/4588595) loss:2.708 lr:0.0000100 epoch_Time:23323.0min: [2024-01-06 19:56:21,044][model8_pretrain.py][INFO] Epoch:[0/2](926000/4588595) loss:3.037 lr:0.0000100 epoch_Time:23323.0min: [2024-01-06 19:56:21,044][model8_pretrain.py][INFO] Epoch:[0/2](926000/4588595) loss:3.149 lr:0.0000100 epoch_Time:23323.0min: [2024-01-06 19:56:21,044][model8_pretrain.py][INFO] Epoch:[0/2](926000/4588595) loss:2.408 lr:0.0000100 epoch_Time:23323.0min: [2024-01-06 19:56:21,044][model8_pretrain.py][INFO] Epoch:[0/2](926000/4588595) loss:2.765 lr:0.0000100 epoch_Time:23323.0min: [2024-01-06 19:56:21,044][model8_pretrain.py][INFO] Epoch:[0/2](926000/4588595) loss:2.940 lr:0.0000100 epoch_Time:23323.0min: [2024-01-06 19:56:21,045][model8_pretrain.py][INFO] Epoch:[0/2](926000/4588595) loss:2.491 lr:0.0000100 epoch_Time:23323.0min: [2024-01-06 19:56:21,045][model8_pretrain.py][INFO] Epoch:[0/2](926000/4588595) loss:2.413 lr:0.0000100 epoch_Time:23323.0min: [2024-01-06 19:56:57,995][model8_pretrain.py][INFO] Epoch:[0/2](926100/4588595) loss:2.826 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:56:57,995][model8_pretrain.py][INFO] Epoch:[0/2](926100/4588595) loss:2.237 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:56:57,996][model8_pretrain.py][INFO] Epoch:[0/2](926100/4588595) loss:2.890 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:56:57,996][model8_pretrain.py][INFO] Epoch:[0/2](926100/4588595) loss:3.045 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:56:57,996][model8_pretrain.py][INFO] Epoch:[0/2](926100/4588595) loss:2.508 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:56:57,996][model8_pretrain.py][INFO] Epoch:[0/2](926100/4588595) loss:2.872 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:56:57,996][model8_pretrain.py][INFO] Epoch:[0/2](926100/4588595) loss:3.194 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:56:57,996][model8_pretrain.py][INFO] Epoch:[0/2](926100/4588595) loss:2.753 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:57:34,949][model8_pretrain.py][INFO] Epoch:[0/2](926200/4588595) loss:2.738 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:57:34,949][model8_pretrain.py][INFO] Epoch:[0/2](926200/4588595) loss:3.096 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:57:34,949][model8_pretrain.py][INFO] Epoch:[0/2](926200/4588595) loss:2.656 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:57:34,949][model8_pretrain.py][INFO] Epoch:[0/2](926200/4588595) loss:2.383 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:57:34,949][model8_pretrain.py][INFO] Epoch:[0/2](926200/4588595) loss:2.123 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:57:34,949][model8_pretrain.py][INFO] Epoch:[0/2](926200/4588595) loss:2.981 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:57:34,950][model8_pretrain.py][INFO] Epoch:[0/2](926200/4588595) loss:2.978 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:57:34,950][model8_pretrain.py][INFO] Epoch:[0/2](926200/4588595) loss:2.801 lr:0.0000100 epoch_Time:23321.0min: [2024-01-06 19:58:11,899][model8_pretrain.py][INFO] Epoch:[0/2](926300/4588595) loss:2.974 lr:0.0000100 epoch_Time:23320.0min: [2024-01-06 19:58:11,899][model8_pretrain.py][INFO] Epoch:[0/2](926300/4588595) loss:2.923 lr:0.0000100 epoch_Time:23320.0min: [2024-01-06 19:58:11,899][model8_pretrain.py][INFO] Epoch:[0/2](926300/4588595) loss:2.595 lr:0.0000100 epoch_Time:23320.0min: [2024-01-06 19:58:11,899][model8_pretrain.py][INFO] Epoch:[0/2](926300/4588595) loss:2.190 lr:0.0000100 epoch_Time:23320.0min: [2024-01-06 19:58:11,899][model8_pretrain.py][INFO] Epoch:[0/2](926300/4588595) loss:2.917 lr:0.0000100 epoch_Time:23320.0min: [2024-01-06 19:58:11,899][model8_pretrain.py][INFO] Epoch:[0/2](926300/4588595) loss:2.991 lr:0.0000100 epoch_Time:23320.0min: [2024-01-06 19:58:11,899][model8_pretrain.py][INFO] Epoch:[0/2](926300/4588595) loss:2.655 lr:0.0000100 epoch_Time:23320.0min: [2024-01-06 19:58:11,899][model8_pretrain.py][INFO] Epoch:[0/2](926300/4588595) loss:2.755 lr:0.0000100 epoch_Time:23320.0min: [2024-01-06 19:58:48,855][model8_pretrain.py][INFO] Epoch:[0/2](926400/4588595) loss:2.717 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:58:48,855][model8_pretrain.py][INFO] Epoch:[0/2](926400/4588595) loss:2.808 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:58:48,855][model8_pretrain.py][INFO] Epoch:[0/2](926400/4588595) loss:2.467 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:58:48,855][model8_pretrain.py][INFO] Epoch:[0/2](926400/4588595) loss:2.355 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:58:48,855][model8_pretrain.py][INFO] Epoch:[0/2](926400/4588595) loss:3.191 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:58:48,855][model8_pretrain.py][INFO] Epoch:[0/2](926400/4588595) loss:3.430 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:58:48,855][model8_pretrain.py][INFO] Epoch:[0/2](926400/4588595) loss:2.766 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:58:48,855][model8_pretrain.py][INFO] Epoch:[0/2](926400/4588595) loss:3.077 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:59:25,810][model8_pretrain.py][INFO] Epoch:[0/2](926500/4588595) loss:3.195 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:59:25,810][model8_pretrain.py][INFO] Epoch:[0/2](926500/4588595) loss:2.546 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:59:25,810][model8_pretrain.py][INFO] Epoch:[0/2](926500/4588595) loss:2.155 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:59:25,810][model8_pretrain.py][INFO] Epoch:[0/2](926500/4588595) loss:3.304 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:59:25,811][model8_pretrain.py][INFO] Epoch:[0/2](926500/4588595) loss:2.817 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:59:25,811][model8_pretrain.py][INFO] Epoch:[0/2](926500/4588595) loss:3.253 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:59:25,811][model8_pretrain.py][INFO] Epoch:[0/2](926500/4588595) loss:2.604 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 19:59:25,811][model8_pretrain.py][INFO] Epoch:[0/2](926500/4588595) loss:2.633 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 20:00:16,318][model8_pretrain.py][INFO] Epoch:[0/2](926600/4588595) loss:2.938 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 20:00:16,318][model8_pretrain.py][INFO] Epoch:[0/2](926600/4588595) loss:2.803 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 20:00:16,318][model8_pretrain.py][INFO] Epoch:[0/2](926600/4588595) loss:2.757 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 20:00:16,318][model8_pretrain.py][INFO] Epoch:[0/2](926600/4588595) loss:2.919 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 20:00:16,318][model8_pretrain.py][INFO] Epoch:[0/2](926600/4588595) loss:2.980 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 20:00:16,318][model8_pretrain.py][INFO] Epoch:[0/2](926600/4588595) loss:2.841 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 20:00:16,318][model8_pretrain.py][INFO] Epoch:[0/2](926600/4588595) loss:2.820 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 20:00:16,318][model8_pretrain.py][INFO] Epoch:[0/2](926600/4588595) loss:2.639 lr:0.0000100 epoch_Time:23319.0min: [2024-01-06 20:00:53,262][model8_pretrain.py][INFO] Epoch:[0/2](926700/4588595) loss:3.202 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:00:53,262][model8_pretrain.py][INFO] Epoch:[0/2](926700/4588595) loss:2.722 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:00:53,262][model8_pretrain.py][INFO] Epoch:[0/2](926700/4588595) loss:2.828 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:00:53,262][model8_pretrain.py][INFO] Epoch:[0/2](926700/4588595) loss:2.114 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:00:53,263][model8_pretrain.py][INFO] Epoch:[0/2](926700/4588595) loss:2.845 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:00:53,262][model8_pretrain.py][INFO] Epoch:[0/2](926700/4588595) loss:2.972 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:00:53,263][model8_pretrain.py][INFO] Epoch:[0/2](926700/4588595) loss:1.989 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:00:53,263][model8_pretrain.py][INFO] Epoch:[0/2](926700/4588595) loss:3.212 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:01:30,229][model8_pretrain.py][INFO] Epoch:[0/2](926800/4588595) loss:3.028 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:01:30,229][model8_pretrain.py][INFO] Epoch:[0/2](926800/4588595) loss:3.161 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:01:30,229][model8_pretrain.py][INFO] Epoch:[0/2](926800/4588595) loss:3.201 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:01:30,229][model8_pretrain.py][INFO] Epoch:[0/2](926800/4588595) loss:2.722 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:01:30,230][model8_pretrain.py][INFO] Epoch:[0/2](926800/4588595) loss:2.568 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:01:30,229][model8_pretrain.py][INFO] Epoch:[0/2](926800/4588595) loss:3.103 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:01:30,230][model8_pretrain.py][INFO] Epoch:[0/2](926800/4588595) loss:2.816 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:01:30,230][model8_pretrain.py][INFO] Epoch:[0/2](926800/4588595) loss:2.798 lr:0.0000100 epoch_Time:23318.0min: [2024-01-06 20:02:07,185][model8_pretrain.py][INFO] Epoch:[0/2](926900/4588595) loss:2.728 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:07,185][model8_pretrain.py][INFO] Epoch:[0/2](926900/4588595) loss:3.035 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:07,185][model8_pretrain.py][INFO] Epoch:[0/2](926900/4588595) loss:2.566 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:07,186][model8_pretrain.py][INFO] Epoch:[0/2](926900/4588595) loss:2.094 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:07,186][model8_pretrain.py][INFO] Epoch:[0/2](926900/4588595) loss:2.794 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:07,186][model8_pretrain.py][INFO] Epoch:[0/2](926900/4588595) loss:2.481 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:07,186][model8_pretrain.py][INFO] Epoch:[0/2](926900/4588595) loss:2.734 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:07,186][model8_pretrain.py][INFO] Epoch:[0/2](926900/4588595) loss:2.996 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:44,109][model8_pretrain.py][INFO] Epoch:[0/2](927000/4588595) loss:2.845 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:44,109][model8_pretrain.py][INFO] Epoch:[0/2](927000/4588595) loss:2.843 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:44,109][model8_pretrain.py][INFO] Epoch:[0/2](927000/4588595) loss:3.105 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:44,109][model8_pretrain.py][INFO] Epoch:[0/2](927000/4588595) loss:2.788 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:44,109][model8_pretrain.py][INFO] Epoch:[0/2](927000/4588595) loss:2.743 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:44,109][model8_pretrain.py][INFO] Epoch:[0/2](927000/4588595) loss:2.255 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:44,109][model8_pretrain.py][INFO] Epoch:[0/2](927000/4588595) loss:2.786 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:02:44,109][model8_pretrain.py][INFO] Epoch:[0/2](927000/4588595) loss:2.158 lr:0.0000100 epoch_Time:23317.0min: [2024-01-06 20:03:21,068][model8_pretrain.py][INFO] Epoch:[0/2](927100/4588595) loss:2.527 lr:0.0000100 epoch_Time:23316.0min: [2024-01-06 20:03:21,068][model8_pretrain.py][INFO] Epoch:[0/2](927100/4588595) loss:2.834 lr:0.0000100 epoch_Time:23316.0min: [2024-01-06 20:03:21,068][model8_pretrain.py][INFO] Epoch:[0/2](927100/4588595) loss:2.918 lr:0.0000100 epoch_Time:23316.0min: [2024-01-06 20:03:21,068][model8_pretrain.py][INFO] Epoch:[0/2](927100/4588595) loss:2.298 lr:0.0000100 epoch_Time:23316.0min: [2024-01-06 20:03:21,068][model8_pretrain.py][INFO] Epoch:[0/2](927100/4588595) loss:2.035 lr:0.0000100 epoch_Time:23316.0min: [2024-01-06 20:03:21,068][model8_pretrain.py][INFO] Epoch:[0/2](927100/4588595) loss:2.839 lr:0.0000100 epoch_Time:23316.0min: [2024-01-06 20:03:21,069][model8_pretrain.py][INFO] Epoch:[0/2](927100/4588595) loss:3.246 lr:0.0000100 epoch_Time:23316.0min: [2024-01-06 20:03:21,068][model8_pretrain.py][INFO] Epoch:[0/2](927100/4588595) loss:2.817 lr:0.0000100 epoch_Time:23316.0min: [2024-01-06 20:03:58,020][model8_pretrain.py][INFO] Epoch:[0/2](927200/4588595) loss:2.521 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:03:58,020][model8_pretrain.py][INFO] Epoch:[0/2](927200/4588595) loss:2.558 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:03:58,020][model8_pretrain.py][INFO] Epoch:[0/2](927200/4588595) loss:2.463 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:03:58,020][model8_pretrain.py][INFO] Epoch:[0/2](927200/4588595) loss:2.978 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:03:58,020][model8_pretrain.py][INFO] Epoch:[0/2](927200/4588595) loss:2.748 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:03:58,020][model8_pretrain.py][INFO] Epoch:[0/2](927200/4588595) loss:2.700 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:03:58,020][model8_pretrain.py][INFO] Epoch:[0/2](927200/4588595) loss:2.765 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:03:58,020][model8_pretrain.py][INFO] Epoch:[0/2](927200/4588595) loss:2.657 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:04:34,968][model8_pretrain.py][INFO] Epoch:[0/2](927300/4588595) loss:2.413 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:04:34,968][model8_pretrain.py][INFO] Epoch:[0/2](927300/4588595) loss:2.343 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:04:34,968][model8_pretrain.py][INFO] Epoch:[0/2](927300/4588595) loss:2.572 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:04:34,968][model8_pretrain.py][INFO] Epoch:[0/2](927300/4588595) loss:2.264 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:04:34,968][model8_pretrain.py][INFO] Epoch:[0/2](927300/4588595) loss:2.804 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:04:34,968][model8_pretrain.py][INFO] Epoch:[0/2](927300/4588595) loss:2.792 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:04:34,968][model8_pretrain.py][INFO] Epoch:[0/2](927300/4588595) loss:3.183 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:04:34,968][model8_pretrain.py][INFO] Epoch:[0/2](927300/4588595) loss:2.710 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:05:25,732][model8_pretrain.py][INFO] Epoch:[0/2](927400/4588595) loss:2.770 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:05:25,732][model8_pretrain.py][INFO] Epoch:[0/2](927400/4588595) loss:2.650 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:05:25,732][model8_pretrain.py][INFO] Epoch:[0/2](927400/4588595) loss:2.313 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:05:25,732][model8_pretrain.py][INFO] Epoch:[0/2](927400/4588595) loss:2.318 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:05:25,732][model8_pretrain.py][INFO] Epoch:[0/2](927400/4588595) loss:2.984 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:05:25,732][model8_pretrain.py][INFO] Epoch:[0/2](927400/4588595) loss:2.529 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:05:25,732][model8_pretrain.py][INFO] Epoch:[0/2](927400/4588595) loss:3.173 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:05:25,732][model8_pretrain.py][INFO] Epoch:[0/2](927400/4588595) loss:3.450 lr:0.0000100 epoch_Time:23314.0min: [2024-01-06 20:06:02,705][model8_pretrain.py][INFO] Epoch:[0/2](927500/4588595) loss:2.595 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:02,705][model8_pretrain.py][INFO] Epoch:[0/2](927500/4588595) loss:2.442 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:02,705][model8_pretrain.py][INFO] Epoch:[0/2](927500/4588595) loss:3.067 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:02,705][model8_pretrain.py][INFO] Epoch:[0/2](927500/4588595) loss:2.664 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:02,705][model8_pretrain.py][INFO] Epoch:[0/2](927500/4588595) loss:2.326 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:02,705][model8_pretrain.py][INFO] Epoch:[0/2](927500/4588595) loss:2.906 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:02,705][model8_pretrain.py][INFO] Epoch:[0/2](927500/4588595) loss:2.663 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:02,705][model8_pretrain.py][INFO] Epoch:[0/2](927500/4588595) loss:2.692 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:39,676][model8_pretrain.py][INFO] Epoch:[0/2](927600/4588595) loss:2.867 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:39,676][model8_pretrain.py][INFO] Epoch:[0/2](927600/4588595) loss:2.942 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:39,676][model8_pretrain.py][INFO] Epoch:[0/2](927600/4588595) loss:3.133 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:39,676][model8_pretrain.py][INFO] Epoch:[0/2](927600/4588595) loss:2.740 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:39,676][model8_pretrain.py][INFO] Epoch:[0/2](927600/4588595) loss:2.450 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:39,676][model8_pretrain.py][INFO] Epoch:[0/2](927600/4588595) loss:2.637 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:39,676][model8_pretrain.py][INFO] Epoch:[0/2](927600/4588595) loss:2.851 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:06:39,676][model8_pretrain.py][INFO] Epoch:[0/2](927600/4588595) loss:2.564 lr:0.0000100 epoch_Time:23313.0min: [2024-01-06 20:07:16,641][model8_pretrain.py][INFO] Epoch:[0/2](927700/4588595) loss:2.943 lr:0.0000100 epoch_Time:23312.0min: [2024-01-06 20:07:16,641][model8_pretrain.py][INFO] Epoch:[0/2](927700/4588595) loss:2.923 lr:0.0000100 epoch_Time:23312.0min: [2024-01-06 20:07:16,641][model8_pretrain.py][INFO] Epoch:[0/2](927700/4588595) loss:2.460 lr:0.0000100 epoch_Time:23312.0min: [2024-01-06 20:07:16,641][model8_pretrain.py][INFO] Epoch:[0/2](927700/4588595) loss:3.489 lr:0.0000100 epoch_Time:23312.0min: [2024-01-06 20:07:16,641][model8_pretrain.py][INFO] Epoch:[0/2](927700/4588595) loss:2.619 lr:0.0000100 epoch_Time:23312.0min: [2024-01-06 20:07:16,642][model8_pretrain.py][INFO] Epoch:[0/2](927700/4588595) loss:3.521 lr:0.0000100 epoch_Time:23312.0min: [2024-01-06 20:07:16,642][model8_pretrain.py][INFO] Epoch:[0/2](927700/4588595) loss:2.760 lr:0.0000100 epoch_Time:23312.0min: [2024-01-06 20:07:16,642][model8_pretrain.py][INFO] Epoch:[0/2](927700/4588595) loss:2.702 lr:0.0000100 epoch_Time:23312.0min: [2024-01-06 20:07:53,602][model8_pretrain.py][INFO] Epoch:[0/2](927800/4588595) loss:3.186 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:07:53,602][model8_pretrain.py][INFO] Epoch:[0/2](927800/4588595) loss:2.679 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:07:53,602][model8_pretrain.py][INFO] Epoch:[0/2](927800/4588595) loss:2.919 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:07:53,602][model8_pretrain.py][INFO] Epoch:[0/2](927800/4588595) loss:2.994 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:07:53,602][model8_pretrain.py][INFO] Epoch:[0/2](927800/4588595) loss:3.009 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:07:53,603][model8_pretrain.py][INFO] Epoch:[0/2](927800/4588595) loss:2.780 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:07:53,603][model8_pretrain.py][INFO] Epoch:[0/2](927800/4588595) loss:3.059 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:07:53,603][model8_pretrain.py][INFO] Epoch:[0/2](927800/4588595) loss:3.025 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:08:30,601][model8_pretrain.py][INFO] Epoch:[0/2](927900/4588595) loss:1.567 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:08:30,601][model8_pretrain.py][INFO] Epoch:[0/2](927900/4588595) loss:3.212 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:08:30,601][model8_pretrain.py][INFO] Epoch:[0/2](927900/4588595) loss:3.033 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:08:30,601][model8_pretrain.py][INFO] Epoch:[0/2](927900/4588595) loss:3.091 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:08:30,601][model8_pretrain.py][INFO] Epoch:[0/2](927900/4588595) loss:2.030 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:08:30,601][model8_pretrain.py][INFO] Epoch:[0/2](927900/4588595) loss:2.786 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:08:30,601][model8_pretrain.py][INFO] Epoch:[0/2](927900/4588595) loss:2.234 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:08:30,601][model8_pretrain.py][INFO] Epoch:[0/2](927900/4588595) loss:2.247 lr:0.0000100 epoch_Time:23311.0min: [2024-01-06 20:09:07,617][model8_pretrain.py][INFO] Epoch:[0/2](928000/4588595) loss:2.875 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:07,617][model8_pretrain.py][INFO] Epoch:[0/2](928000/4588595) loss:3.101 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:07,617][model8_pretrain.py][INFO] Epoch:[0/2](928000/4588595) loss:3.051 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:07,618][model8_pretrain.py][INFO] Epoch:[0/2](928000/4588595) loss:2.522 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:07,618][model8_pretrain.py][INFO] Epoch:[0/2](928000/4588595) loss:2.834 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:07,618][model8_pretrain.py][INFO] Epoch:[0/2](928000/4588595) loss:2.739 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:07,618][model8_pretrain.py][INFO] Epoch:[0/2](928000/4588595) loss:3.087 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:07,618][model8_pretrain.py][INFO] Epoch:[0/2](928000/4588595) loss:3.021 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:44,595][model8_pretrain.py][INFO] Epoch:[0/2](928100/4588595) loss:2.780 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:44,595][model8_pretrain.py][INFO] Epoch:[0/2](928100/4588595) loss:2.118 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:44,595][model8_pretrain.py][INFO] Epoch:[0/2](928100/4588595) loss:2.672 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:44,595][model8_pretrain.py][INFO] Epoch:[0/2](928100/4588595) loss:2.749 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:44,595][model8_pretrain.py][INFO] Epoch:[0/2](928100/4588595) loss:2.729 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:44,595][model8_pretrain.py][INFO] Epoch:[0/2](928100/4588595) loss:2.673 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:44,595][model8_pretrain.py][INFO] Epoch:[0/2](928100/4588595) loss:2.844 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:09:44,595][model8_pretrain.py][INFO] Epoch:[0/2](928100/4588595) loss:2.936 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:10:35,370][model8_pretrain.py][INFO] Epoch:[0/2](928200/4588595) loss:3.474 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:10:35,370][model8_pretrain.py][INFO] Epoch:[0/2](928200/4588595) loss:3.098 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:10:35,370][model8_pretrain.py][INFO] Epoch:[0/2](928200/4588595) loss:3.050 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:10:35,370][model8_pretrain.py][INFO] Epoch:[0/2](928200/4588595) loss:2.639 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:10:35,370][model8_pretrain.py][INFO] Epoch:[0/2](928200/4588595) loss:3.218 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:10:35,370][model8_pretrain.py][INFO] Epoch:[0/2](928200/4588595) loss:2.835 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:10:35,370][model8_pretrain.py][INFO] Epoch:[0/2](928200/4588595) loss:3.485 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:10:35,370][model8_pretrain.py][INFO] Epoch:[0/2](928200/4588595) loss:2.758 lr:0.0000100 epoch_Time:23310.0min: [2024-01-06 20:11:12,310][model8_pretrain.py][INFO] Epoch:[0/2](928300/4588595) loss:3.406 lr:0.0000100 epoch_Time:23309.0min: [2024-01-06 20:11:12,310][model8_pretrain.py][INFO] Epoch:[0/2](928300/4588595) loss:2.971 lr:0.0000100 epoch_Time:23309.0min: [2024-01-06 20:11:12,310][model8_pretrain.py][INFO] Epoch:[0/2](928300/4588595) loss:2.717 lr:0.0000100 epoch_Time:23309.0min: [2024-01-06 20:11:12,310][model8_pretrain.py][INFO] Epoch:[0/2](928300/4588595) loss:2.989 lr:0.0000100 epoch_Time:23309.0min: [2024-01-06 20:11:12,310][model8_pretrain.py][INFO] Epoch:[0/2](928300/4588595) loss:3.056 lr:0.0000100 epoch_Time:23309.0min: [2024-01-06 20:11:12,310][model8_pretrain.py][INFO] Epoch:[0/2](928300/4588595) loss:2.816 lr:0.0000100 epoch_Time:23309.0min: [2024-01-06 20:11:12,310][model8_pretrain.py][INFO] Epoch:[0/2](928300/4588595) loss:3.393 lr:0.0000100 epoch_Time:23309.0min: [2024-01-06 20:11:12,310][model8_pretrain.py][INFO] Epoch:[0/2](928300/4588595) loss:2.973 lr:0.0000100 epoch_Time:23309.0min: [2024-01-06 20:11:49,255][model8_pretrain.py][INFO] Epoch:[0/2](928400/4588595) loss:2.734 lr:0.0000100 epoch_Time:23308.0min: [2024-01-06 20:11:49,256][model8_pretrain.py][INFO] Epoch:[0/2](928400/4588595) loss:2.919 lr:0.0000100 epoch_Time:23308.0min: [2024-01-06 20:11:49,256][model8_pretrain.py][INFO] Epoch:[0/2](928400/4588595) loss:2.449 lr:0.0000100 epoch_Time:23308.0min: [2024-01-06 20:11:49,256][model8_pretrain.py][INFO] Epoch:[0/2](928400/4588595) loss:2.656 lr:0.0000100 epoch_Time:23308.0min: [2024-01-06 20:11:49,256][model8_pretrain.py][INFO] Epoch:[0/2](928400/4588595) loss:3.245 lr:0.0000100 epoch_Time:23308.0min: [2024-01-06 20:11:49,256][model8_pretrain.py][INFO] Epoch:[0/2](928400/4588595) loss:2.609 lr:0.0000100 epoch_Time:23308.0min: [2024-01-06 20:11:49,256][model8_pretrain.py][INFO] Epoch:[0/2](928400/4588595) loss:2.770 lr:0.0000100 epoch_Time:23308.0min: [2024-01-06 20:11:49,256][model8_pretrain.py][INFO] Epoch:[0/2](928400/4588595) loss:2.920 lr:0.0000100 epoch_Time:23308.0min: [2024-01-06 20:12:26,213][model8_pretrain.py][INFO] Epoch:[0/2](928500/4588595) loss:2.565 lr:0.0000100 epoch_Time:23307.0min: [2024-01-06 20:12:26,213][model8_pretrain.py][INFO] Epoch:[0/2](928500/4588595) loss:2.864 lr:0.0000100 epoch_Time:23307.0min: [2024-01-06 20:12:26,213][model8_pretrain.py][INFO] Epoch:[0/2](928500/4588595) loss:2.872 lr:0.0000100 epoch_Time:23307.0min: [2024-01-06 20:12:26,213][model8_pretrain.py][INFO] Epoch:[0/2](928500/4588595) loss:2.894 lr:0.0000100 epoch_Time:23307.0min: [2024-01-06 20:12:26,213][model8_pretrain.py][INFO] Epoch:[0/2](928500/4588595) loss:2.747 lr:0.0000100 epoch_Time:23307.0min: [2024-01-06 20:12:26,213][model8_pretrain.py][INFO] Epoch:[0/2](928500/4588595) loss:2.819 lr:0.0000100 epoch_Time:23307.0min: [2024-01-06 20:12:26,213][model8_pretrain.py][INFO] Epoch:[0/2](928500/4588595) loss:2.757 lr:0.0000100 epoch_Time:23307.0min: [2024-01-06 20:12:26,213][model8_pretrain.py][INFO] Epoch:[0/2](928500/4588595) loss:2.157 lr:0.0000100 epoch_Time:23307.0min: [2024-01-06 20:13:03,186][model8_pretrain.py][INFO] Epoch:[0/2](928600/4588595) loss:3.200 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:03,186][model8_pretrain.py][INFO] Epoch:[0/2](928600/4588595) loss:2.958 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:03,186][model8_pretrain.py][INFO] Epoch:[0/2](928600/4588595) loss:2.617 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:03,186][model8_pretrain.py][INFO] Epoch:[0/2](928600/4588595) loss:2.285 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:03,186][model8_pretrain.py][INFO] Epoch:[0/2](928600/4588595) loss:2.728 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:03,186][model8_pretrain.py][INFO] Epoch:[0/2](928600/4588595) loss:2.709 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:03,186][model8_pretrain.py][INFO] Epoch:[0/2](928600/4588595) loss:2.412 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:03,186][model8_pretrain.py][INFO] Epoch:[0/2](928600/4588595) loss:2.809 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:40,174][model8_pretrain.py][INFO] Epoch:[0/2](928700/4588595) loss:2.856 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:40,174][model8_pretrain.py][INFO] Epoch:[0/2](928700/4588595) loss:2.431 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:40,174][model8_pretrain.py][INFO] Epoch:[0/2](928700/4588595) loss:2.948 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:40,174][model8_pretrain.py][INFO] Epoch:[0/2](928700/4588595) loss:3.132 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:40,174][model8_pretrain.py][INFO] Epoch:[0/2](928700/4588595) loss:2.536 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:40,174][model8_pretrain.py][INFO] Epoch:[0/2](928700/4588595) loss:2.764 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:40,174][model8_pretrain.py][INFO] Epoch:[0/2](928700/4588595) loss:2.533 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:13:40,175][model8_pretrain.py][INFO] Epoch:[0/2](928700/4588595) loss:2.910 lr:0.0000100 epoch_Time:23306.0min: [2024-01-06 20:14:17,155][model8_pretrain.py][INFO] Epoch:[0/2](928800/4588595) loss:2.405 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:14:17,155][model8_pretrain.py][INFO] Epoch:[0/2](928800/4588595) loss:2.872 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:14:17,155][model8_pretrain.py][INFO] Epoch:[0/2](928800/4588595) loss:2.622 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:14:17,155][model8_pretrain.py][INFO] Epoch:[0/2](928800/4588595) loss:3.039 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:14:17,155][model8_pretrain.py][INFO] Epoch:[0/2](928800/4588595) loss:3.075 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:14:17,155][model8_pretrain.py][INFO] Epoch:[0/2](928800/4588595) loss:3.013 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:14:17,155][model8_pretrain.py][INFO] Epoch:[0/2](928800/4588595) loss:3.161 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:14:17,156][model8_pretrain.py][INFO] Epoch:[0/2](928800/4588595) loss:2.975 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:14:54,125][model8_pretrain.py][INFO] Epoch:[0/2](928900/4588595) loss:3.556 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:14:54,125][model8_pretrain.py][INFO] Epoch:[0/2](928900/4588595) loss:2.842 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:14:54,125][model8_pretrain.py][INFO] Epoch:[0/2](928900/4588595) loss:2.692 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:14:54,125][model8_pretrain.py][INFO] Epoch:[0/2](928900/4588595) loss:2.668 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:14:54,125][model8_pretrain.py][INFO] Epoch:[0/2](928900/4588595) loss:2.741 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:14:54,125][model8_pretrain.py][INFO] Epoch:[0/2](928900/4588595) loss:2.995 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:14:54,125][model8_pretrain.py][INFO] Epoch:[0/2](928900/4588595) loss:2.461 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:14:54,125][model8_pretrain.py][INFO] Epoch:[0/2](928900/4588595) loss:2.463 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:15:44,957][model8_pretrain.py][INFO] Epoch:[0/2](929000/4588595) loss:3.364 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:15:44,957][model8_pretrain.py][INFO] Epoch:[0/2](929000/4588595) loss:2.671 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:15:44,957][model8_pretrain.py][INFO] Epoch:[0/2](929000/4588595) loss:2.991 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:15:44,957][model8_pretrain.py][INFO] Epoch:[0/2](929000/4588595) loss:2.729 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:15:44,957][model8_pretrain.py][INFO] Epoch:[0/2](929000/4588595) loss:2.933 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:15:44,957][model8_pretrain.py][INFO] Epoch:[0/2](929000/4588595) loss:3.275 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:15:44,957][model8_pretrain.py][INFO] Epoch:[0/2](929000/4588595) loss:2.603 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:15:44,957][model8_pretrain.py][INFO] Epoch:[0/2](929000/4588595) loss:2.765 lr:0.0000100 epoch_Time:23305.0min: [2024-01-06 20:16:21,889][model8_pretrain.py][INFO] Epoch:[0/2](929100/4588595) loss:3.167 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:16:21,889][model8_pretrain.py][INFO] Epoch:[0/2](929100/4588595) loss:2.836 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:16:21,889][model8_pretrain.py][INFO] Epoch:[0/2](929100/4588595) loss:2.440 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:16:21,889][model8_pretrain.py][INFO] Epoch:[0/2](929100/4588595) loss:2.533 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:16:21,889][model8_pretrain.py][INFO] Epoch:[0/2](929100/4588595) loss:3.236 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:16:21,889][model8_pretrain.py][INFO] Epoch:[0/2](929100/4588595) loss:2.837 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:16:21,889][model8_pretrain.py][INFO] Epoch:[0/2](929100/4588595) loss:3.031 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:16:21,889][model8_pretrain.py][INFO] Epoch:[0/2](929100/4588595) loss:2.769 lr:0.0000100 epoch_Time:23304.0min: [2024-01-06 20:16:58,841][model8_pretrain.py][INFO] Epoch:[0/2](929200/4588595) loss:2.451 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:16:58,841][model8_pretrain.py][INFO] Epoch:[0/2](929200/4588595) loss:3.017 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:16:58,841][model8_pretrain.py][INFO] Epoch:[0/2](929200/4588595) loss:2.757 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:16:58,841][model8_pretrain.py][INFO] Epoch:[0/2](929200/4588595) loss:2.812 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:16:58,841][model8_pretrain.py][INFO] Epoch:[0/2](929200/4588595) loss:2.893 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:16:58,841][model8_pretrain.py][INFO] Epoch:[0/2](929200/4588595) loss:3.107 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:16:58,841][model8_pretrain.py][INFO] Epoch:[0/2](929200/4588595) loss:3.140 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:16:58,841][model8_pretrain.py][INFO] Epoch:[0/2](929200/4588595) loss:2.493 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:17:35,801][model8_pretrain.py][INFO] Epoch:[0/2](929300/4588595) loss:2.792 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:17:35,801][model8_pretrain.py][INFO] Epoch:[0/2](929300/4588595) loss:2.855 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:17:35,801][model8_pretrain.py][INFO] Epoch:[0/2](929300/4588595) loss:2.439 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:17:35,801][model8_pretrain.py][INFO] Epoch:[0/2](929300/4588595) loss:2.938 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:17:35,801][model8_pretrain.py][INFO] Epoch:[0/2](929300/4588595) loss:2.471 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:17:35,801][model8_pretrain.py][INFO] Epoch:[0/2](929300/4588595) loss:2.845 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:17:35,801][model8_pretrain.py][INFO] Epoch:[0/2](929300/4588595) loss:2.845 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:17:35,802][model8_pretrain.py][INFO] Epoch:[0/2](929300/4588595) loss:3.007 lr:0.0000100 epoch_Time:23303.0min: [2024-01-06 20:18:12,760][model8_pretrain.py][INFO] Epoch:[0/2](929400/4588595) loss:2.704 lr:0.0000100 epoch_Time:23302.0min: [2024-01-06 20:18:12,760][model8_pretrain.py][INFO] Epoch:[0/2](929400/4588595) loss:3.260 lr:0.0000100 epoch_Time:23302.0min: [2024-01-06 20:18:12,760][model8_pretrain.py][INFO] Epoch:[0/2](929400/4588595) loss:2.802 lr:0.0000100 epoch_Time:23302.0min: [2024-01-06 20:18:12,760][model8_pretrain.py][INFO] Epoch:[0/2](929400/4588595) loss:2.432 lr:0.0000100 epoch_Time:23302.0min: [2024-01-06 20:18:12,760][model8_pretrain.py][INFO] Epoch:[0/2](929400/4588595) loss:2.782 lr:0.0000100 epoch_Time:23302.0min: [2024-01-06 20:18:12,760][model8_pretrain.py][INFO] Epoch:[0/2](929400/4588595) loss:2.678 lr:0.0000100 epoch_Time:23302.0min: [2024-01-06 20:18:12,760][model8_pretrain.py][INFO] Epoch:[0/2](929400/4588595) loss:1.995 lr:0.0000100 epoch_Time:23302.0min: [2024-01-06 20:18:12,760][model8_pretrain.py][INFO] Epoch:[0/2](929400/4588595) loss:2.479 lr:0.0000100 epoch_Time:23302.0min: [2024-01-06 20:18:49,717][model8_pretrain.py][INFO] Epoch:[0/2](929500/4588595) loss:2.228 lr:0.0000100 epoch_Time:23301.0min: [2024-01-06 20:18:49,717][model8_pretrain.py][INFO] Epoch:[0/2](929500/4588595) loss:2.417 lr:0.0000100 epoch_Time:23301.0min: [2024-01-06 20:18:49,717][model8_pretrain.py][INFO] Epoch:[0/2](929500/4588595) loss:2.106 lr:0.0000100 epoch_Time:23301.0min: [2024-01-06 20:18:49,717][model8_pretrain.py][INFO] Epoch:[0/2](929500/4588595) loss:2.751 lr:0.0000100 epoch_Time:23301.0min: [2024-01-06 20:18:49,717][model8_pretrain.py][INFO] Epoch:[0/2](929500/4588595) loss:2.778 lr:0.0000100 epoch_Time:23301.0min: [2024-01-06 20:18:49,717][model8_pretrain.py][INFO] Epoch:[0/2](929500/4588595) loss:2.253 lr:0.0000100 epoch_Time:23301.0min: [2024-01-06 20:18:49,717][model8_pretrain.py][INFO] Epoch:[0/2](929500/4588595) loss:2.593 lr:0.0000100 epoch_Time:23301.0min: [2024-01-06 20:18:49,717][model8_pretrain.py][INFO] Epoch:[0/2](929500/4588595) loss:2.925 lr:0.0000100 epoch_Time:23301.0min: [2024-01-06 20:19:26,668][model8_pretrain.py][INFO] Epoch:[0/2](929600/4588595) loss:3.012 lr:0.0000100 epoch_Time:23300.0min: [2024-01-06 20:19:26,668][model8_pretrain.py][INFO] Epoch:[0/2](929600/4588595) loss:3.105 lr:0.0000100 epoch_Time:23300.0min: [2024-01-06 20:19:26,668][model8_pretrain.py][INFO] Epoch:[0/2](929600/4588595) loss:3.065 lr:0.0000100 epoch_Time:23300.0min: [2024-01-06 20:19:26,668][model8_pretrain.py][INFO] Epoch:[0/2](929600/4588595) loss:2.425 lr:0.0000100 epoch_Time:23300.0min: [2024-01-06 20:19:26,668][model8_pretrain.py][INFO] Epoch:[0/2](929600/4588595) loss:2.807 lr:0.0000100 epoch_Time:23300.0min: [2024-01-06 20:19:26,668][model8_pretrain.py][INFO] Epoch:[0/2](929600/4588595) loss:2.958 lr:0.0000100 epoch_Time:23300.0min: [2024-01-06 20:19:26,668][model8_pretrain.py][INFO] Epoch:[0/2](929600/4588595) loss:2.705 lr:0.0000100 epoch_Time:23300.0min: [2024-01-06 20:19:26,668][model8_pretrain.py][INFO] Epoch:[0/2](929600/4588595) loss:2.974 lr:0.0000100 epoch_Time:23300.0min: [2024-01-06 20:20:03,616][model8_pretrain.py][INFO] Epoch:[0/2](929700/4588595) loss:2.931 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:03,616][model8_pretrain.py][INFO] Epoch:[0/2](929700/4588595) loss:2.340 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:03,616][model8_pretrain.py][INFO] Epoch:[0/2](929700/4588595) loss:2.214 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:03,616][model8_pretrain.py][INFO] Epoch:[0/2](929700/4588595) loss:2.937 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:03,616][model8_pretrain.py][INFO] Epoch:[0/2](929700/4588595) loss:3.092 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:03,616][model8_pretrain.py][INFO] Epoch:[0/2](929700/4588595) loss:3.435 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:03,616][model8_pretrain.py][INFO] Epoch:[0/2](929700/4588595) loss:2.648 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:03,616][model8_pretrain.py][INFO] Epoch:[0/2](929700/4588595) loss:2.697 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:54,411][model8_pretrain.py][INFO] Epoch:[0/2](929800/4588595) loss:3.043 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:54,411][model8_pretrain.py][INFO] Epoch:[0/2](929800/4588595) loss:2.585 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:54,411][model8_pretrain.py][INFO] Epoch:[0/2](929800/4588595) loss:2.626 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:54,411][model8_pretrain.py][INFO] Epoch:[0/2](929800/4588595) loss:3.123 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:54,411][model8_pretrain.py][INFO] Epoch:[0/2](929800/4588595) loss:2.440 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:54,411][model8_pretrain.py][INFO] Epoch:[0/2](929800/4588595) loss:2.712 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:54,411][model8_pretrain.py][INFO] Epoch:[0/2](929800/4588595) loss:2.901 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:20:54,411][model8_pretrain.py][INFO] Epoch:[0/2](929800/4588595) loss:2.674 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:21:31,322][model8_pretrain.py][INFO] Epoch:[0/2](929900/4588595) loss:2.578 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:21:31,322][model8_pretrain.py][INFO] Epoch:[0/2](929900/4588595) loss:2.427 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:21:31,322][model8_pretrain.py][INFO] Epoch:[0/2](929900/4588595) loss:2.686 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:21:31,322][model8_pretrain.py][INFO] Epoch:[0/2](929900/4588595) loss:2.655 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:21:31,322][model8_pretrain.py][INFO] Epoch:[0/2](929900/4588595) loss:2.726 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:21:31,322][model8_pretrain.py][INFO] Epoch:[0/2](929900/4588595) loss:2.617 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:21:31,322][model8_pretrain.py][INFO] Epoch:[0/2](929900/4588595) loss:2.953 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:21:31,322][model8_pretrain.py][INFO] Epoch:[0/2](929900/4588595) loss:2.807 lr:0.0000100 epoch_Time:23299.0min: [2024-01-06 20:22:08,264][model8_pretrain.py][INFO] Epoch:[0/2](930000/4588595) loss:3.277 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:08,264][model8_pretrain.py][INFO] Epoch:[0/2](930000/4588595) loss:2.197 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:08,264][model8_pretrain.py][INFO] Epoch:[0/2](930000/4588595) loss:2.508 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:08,264][model8_pretrain.py][INFO] Epoch:[0/2](930000/4588595) loss:2.916 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:08,264][model8_pretrain.py][INFO] Epoch:[0/2](930000/4588595) loss:2.853 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:08,264][model8_pretrain.py][INFO] Epoch:[0/2](930000/4588595) loss:2.255 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:08,264][model8_pretrain.py][INFO] Epoch:[0/2](930000/4588595) loss:2.753 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:08,264][model8_pretrain.py][INFO] Epoch:[0/2](930000/4588595) loss:2.977 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:45,218][model8_pretrain.py][INFO] Epoch:[0/2](930100/4588595) loss:3.073 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:45,218][model8_pretrain.py][INFO] Epoch:[0/2](930100/4588595) loss:2.197 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:45,218][model8_pretrain.py][INFO] Epoch:[0/2](930100/4588595) loss:3.061 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:45,218][model8_pretrain.py][INFO] Epoch:[0/2](930100/4588595) loss:3.049 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:45,218][model8_pretrain.py][INFO] Epoch:[0/2](930100/4588595) loss:2.816 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:45,218][model8_pretrain.py][INFO] Epoch:[0/2](930100/4588595) loss:3.349 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:45,218][model8_pretrain.py][INFO] Epoch:[0/2](930100/4588595) loss:3.165 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:22:45,218][model8_pretrain.py][INFO] Epoch:[0/2](930100/4588595) loss:2.837 lr:0.0000100 epoch_Time:23298.0min: [2024-01-06 20:23:22,172][model8_pretrain.py][INFO] Epoch:[0/2](930200/4588595) loss:3.226 lr:0.0000100 epoch_Time:23297.0min: [2024-01-06 20:23:22,172][model8_pretrain.py][INFO] Epoch:[0/2](930200/4588595) loss:3.030 lr:0.0000100 epoch_Time:23297.0min: [2024-01-06 20:23:22,172][model8_pretrain.py][INFO] Epoch:[0/2](930200/4588595) loss:2.907 lr:0.0000100 epoch_Time:23297.0min: [2024-01-06 20:23:22,172][model8_pretrain.py][INFO] Epoch:[0/2](930200/4588595) loss:2.664 lr:0.0000100 epoch_Time:23297.0min: [2024-01-06 20:23:22,172][model8_pretrain.py][INFO] Epoch:[0/2](930200/4588595) loss:3.109 lr:0.0000100 epoch_Time:23297.0min: [2024-01-06 20:23:22,172][model8_pretrain.py][INFO] Epoch:[0/2](930200/4588595) loss:2.658 lr:0.0000100 epoch_Time:23297.0min: [2024-01-06 20:23:22,172][model8_pretrain.py][INFO] Epoch:[0/2](930200/4588595) loss:2.835 lr:0.0000100 epoch_Time:23297.0min: [2024-01-06 20:23:22,172][model8_pretrain.py][INFO] Epoch:[0/2](930200/4588595) loss:2.837 lr:0.0000100 epoch_Time:23297.0min: [2024-01-06 20:23:59,119][model8_pretrain.py][INFO] Epoch:[0/2](930300/4588595) loss:2.858 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:23:59,119][model8_pretrain.py][INFO] Epoch:[0/2](930300/4588595) loss:2.612 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:23:59,119][model8_pretrain.py][INFO] Epoch:[0/2](930300/4588595) loss:2.953 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:23:59,119][model8_pretrain.py][INFO] Epoch:[0/2](930300/4588595) loss:2.833 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:23:59,119][model8_pretrain.py][INFO] Epoch:[0/2](930300/4588595) loss:2.946 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:23:59,119][model8_pretrain.py][INFO] Epoch:[0/2](930300/4588595) loss:3.245 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:23:59,119][model8_pretrain.py][INFO] Epoch:[0/2](930300/4588595) loss:2.083 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:23:59,119][model8_pretrain.py][INFO] Epoch:[0/2](930300/4588595) loss:3.213 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:24:36,063][model8_pretrain.py][INFO] Epoch:[0/2](930400/4588595) loss:2.501 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:24:36,063][model8_pretrain.py][INFO] Epoch:[0/2](930400/4588595) loss:2.714 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:24:36,063][model8_pretrain.py][INFO] Epoch:[0/2](930400/4588595) loss:2.823 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:24:36,063][model8_pretrain.py][INFO] Epoch:[0/2](930400/4588595) loss:2.894 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:24:36,063][model8_pretrain.py][INFO] Epoch:[0/2](930400/4588595) loss:2.992 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:24:36,063][model8_pretrain.py][INFO] Epoch:[0/2](930400/4588595) loss:2.665 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:24:36,063][model8_pretrain.py][INFO] Epoch:[0/2](930400/4588595) loss:2.961 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:24:36,064][model8_pretrain.py][INFO] Epoch:[0/2](930400/4588595) loss:3.324 lr:0.0000100 epoch_Time:23296.0min: [2024-01-06 20:25:13,018][model8_pretrain.py][INFO] Epoch:[0/2](930500/4588595) loss:3.084 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:25:13,018][model8_pretrain.py][INFO] Epoch:[0/2](930500/4588595) loss:2.507 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:25:13,018][model8_pretrain.py][INFO] Epoch:[0/2](930500/4588595) loss:3.043 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:25:13,018][model8_pretrain.py][INFO] Epoch:[0/2](930500/4588595) loss:3.238 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:25:13,018][model8_pretrain.py][INFO] Epoch:[0/2](930500/4588595) loss:2.713 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:25:13,018][model8_pretrain.py][INFO] Epoch:[0/2](930500/4588595) loss:2.914 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:25:13,018][model8_pretrain.py][INFO] Epoch:[0/2](930500/4588595) loss:3.005 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:25:13,019][model8_pretrain.py][INFO] Epoch:[0/2](930500/4588595) loss:2.623 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:03,941][model8_pretrain.py][INFO] Epoch:[0/2](930600/4588595) loss:2.835 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:03,941][model8_pretrain.py][INFO] Epoch:[0/2](930600/4588595) loss:2.383 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:03,941][model8_pretrain.py][INFO] Epoch:[0/2](930600/4588595) loss:2.970 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:03,941][model8_pretrain.py][INFO] Epoch:[0/2](930600/4588595) loss:3.040 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:03,941][model8_pretrain.py][INFO] Epoch:[0/2](930600/4588595) loss:2.840 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:03,941][model8_pretrain.py][INFO] Epoch:[0/2](930600/4588595) loss:2.843 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:03,941][model8_pretrain.py][INFO] Epoch:[0/2](930600/4588595) loss:3.125 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:03,942][model8_pretrain.py][INFO] Epoch:[0/2](930600/4588595) loss:2.616 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:40,874][model8_pretrain.py][INFO] Epoch:[0/2](930700/4588595) loss:2.931 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:40,874][model8_pretrain.py][INFO] Epoch:[0/2](930700/4588595) loss:2.812 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:40,874][model8_pretrain.py][INFO] Epoch:[0/2](930700/4588595) loss:2.668 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:40,874][model8_pretrain.py][INFO] Epoch:[0/2](930700/4588595) loss:2.601 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:40,874][model8_pretrain.py][INFO] Epoch:[0/2](930700/4588595) loss:2.715 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:40,874][model8_pretrain.py][INFO] Epoch:[0/2](930700/4588595) loss:3.085 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:40,874][model8_pretrain.py][INFO] Epoch:[0/2](930700/4588595) loss:3.252 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:26:40,874][model8_pretrain.py][INFO] Epoch:[0/2](930700/4588595) loss:2.868 lr:0.0000100 epoch_Time:23295.0min: [2024-01-06 20:27:17,790][model8_pretrain.py][INFO] Epoch:[0/2](930800/4588595) loss:2.360 lr:0.0000100 epoch_Time:23293.0min: [2024-01-06 20:27:17,790][model8_pretrain.py][INFO] Epoch:[0/2](930800/4588595) loss:2.993 lr:0.0000100 epoch_Time:23293.0min: [2024-01-06 20:27:17,790][model8_pretrain.py][INFO] Epoch:[0/2](930800/4588595) loss:2.006 lr:0.0000100 epoch_Time:23293.0min: [2024-01-06 20:27:17,790][model8_pretrain.py][INFO] Epoch:[0/2](930800/4588595) loss:2.498 lr:0.0000100 epoch_Time:23293.0min: [2024-01-06 20:27:17,790][model8_pretrain.py][INFO] Epoch:[0/2](930800/4588595) loss:2.725 lr:0.0000100 epoch_Time:23293.0min: [2024-01-06 20:27:17,790][model8_pretrain.py][INFO] Epoch:[0/2](930800/4588595) loss:2.398 lr:0.0000100 epoch_Time:23293.0min: [2024-01-06 20:27:17,790][model8_pretrain.py][INFO] Epoch:[0/2](930800/4588595) loss:2.569 lr:0.0000100 epoch_Time:23293.0min: [2024-01-06 20:27:17,791][model8_pretrain.py][INFO] Epoch:[0/2](930800/4588595) loss:2.863 lr:0.0000100 epoch_Time:23293.0min: [2024-01-06 20:27:54,730][model8_pretrain.py][INFO] Epoch:[0/2](930900/4588595) loss:2.805 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:27:54,730][model8_pretrain.py][INFO] Epoch:[0/2](930900/4588595) loss:3.047 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:27:54,730][model8_pretrain.py][INFO] Epoch:[0/2](930900/4588595) loss:2.476 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:27:54,730][model8_pretrain.py][INFO] Epoch:[0/2](930900/4588595) loss:2.521 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:27:54,730][model8_pretrain.py][INFO] Epoch:[0/2](930900/4588595) loss:2.923 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:27:54,730][model8_pretrain.py][INFO] Epoch:[0/2](930900/4588595) loss:2.200 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:27:54,731][model8_pretrain.py][INFO] Epoch:[0/2](930900/4588595) loss:2.312 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:27:54,731][model8_pretrain.py][INFO] Epoch:[0/2](930900/4588595) loss:2.102 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:28:31,676][model8_pretrain.py][INFO] Epoch:[0/2](931000/4588595) loss:2.407 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:28:31,676][model8_pretrain.py][INFO] Epoch:[0/2](931000/4588595) loss:1.868 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:28:31,676][model8_pretrain.py][INFO] Epoch:[0/2](931000/4588595) loss:3.025 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:28:31,676][model8_pretrain.py][INFO] Epoch:[0/2](931000/4588595) loss:2.684 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:28:31,676][model8_pretrain.py][INFO] Epoch:[0/2](931000/4588595) loss:2.734 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:28:31,676][model8_pretrain.py][INFO] Epoch:[0/2](931000/4588595) loss:3.193 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:28:31,676][model8_pretrain.py][INFO] Epoch:[0/2](931000/4588595) loss:2.502 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:28:31,676][model8_pretrain.py][INFO] Epoch:[0/2](931000/4588595) loss:3.364 lr:0.0000100 epoch_Time:23292.0min: [2024-01-06 20:29:08,622][model8_pretrain.py][INFO] Epoch:[0/2](931100/4588595) loss:2.914 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:08,622][model8_pretrain.py][INFO] Epoch:[0/2](931100/4588595) loss:2.599 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:08,622][model8_pretrain.py][INFO] Epoch:[0/2](931100/4588595) loss:3.043 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:08,622][model8_pretrain.py][INFO] Epoch:[0/2](931100/4588595) loss:3.336 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:08,622][model8_pretrain.py][INFO] Epoch:[0/2](931100/4588595) loss:2.730 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:08,622][model8_pretrain.py][INFO] Epoch:[0/2](931100/4588595) loss:2.790 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:08,623][model8_pretrain.py][INFO] Epoch:[0/2](931100/4588595) loss:2.652 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:08,623][model8_pretrain.py][INFO] Epoch:[0/2](931100/4588595) loss:2.990 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:45,563][model8_pretrain.py][INFO] Epoch:[0/2](931200/4588595) loss:2.670 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:45,563][model8_pretrain.py][INFO] Epoch:[0/2](931200/4588595) loss:2.841 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:45,563][model8_pretrain.py][INFO] Epoch:[0/2](931200/4588595) loss:3.080 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:45,563][model8_pretrain.py][INFO] Epoch:[0/2](931200/4588595) loss:2.699 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:45,563][model8_pretrain.py][INFO] Epoch:[0/2](931200/4588595) loss:2.004 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:45,563][model8_pretrain.py][INFO] Epoch:[0/2](931200/4588595) loss:2.305 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:45,563][model8_pretrain.py][INFO] Epoch:[0/2](931200/4588595) loss:2.372 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:29:45,563][model8_pretrain.py][INFO] Epoch:[0/2](931200/4588595) loss:3.239 lr:0.0000100 epoch_Time:23291.0min: [2024-01-06 20:30:22,510][model8_pretrain.py][INFO] Epoch:[0/2](931300/4588595) loss:2.439 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:30:22,510][model8_pretrain.py][INFO] Epoch:[0/2](931300/4588595) loss:2.722 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:30:22,510][model8_pretrain.py][INFO] Epoch:[0/2](931300/4588595) loss:2.698 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:30:22,510][model8_pretrain.py][INFO] Epoch:[0/2](931300/4588595) loss:3.151 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:30:22,510][model8_pretrain.py][INFO] Epoch:[0/2](931300/4588595) loss:2.219 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:30:22,510][model8_pretrain.py][INFO] Epoch:[0/2](931300/4588595) loss:2.651 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:30:22,510][model8_pretrain.py][INFO] Epoch:[0/2](931300/4588595) loss:1.843 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:30:22,510][model8_pretrain.py][INFO] Epoch:[0/2](931300/4588595) loss:2.811 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:31:13,240][model8_pretrain.py][INFO] Epoch:[0/2](931400/4588595) loss:2.359 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:31:13,240][model8_pretrain.py][INFO] Epoch:[0/2](931400/4588595) loss:2.947 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:31:13,241][model8_pretrain.py][INFO] Epoch:[0/2](931400/4588595) loss:2.884 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:31:13,241][model8_pretrain.py][INFO] Epoch:[0/2](931400/4588595) loss:2.528 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:31:13,241][model8_pretrain.py][INFO] Epoch:[0/2](931400/4588595) loss:2.949 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:31:13,241][model8_pretrain.py][INFO] Epoch:[0/2](931400/4588595) loss:2.891 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:31:13,241][model8_pretrain.py][INFO] Epoch:[0/2](931400/4588595) loss:2.366 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:31:13,241][model8_pretrain.py][INFO] Epoch:[0/2](931400/4588595) loss:2.908 lr:0.0000100 epoch_Time:23290.0min: [2024-01-06 20:31:50,165][model8_pretrain.py][INFO] Epoch:[0/2](931500/4588595) loss:3.146 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:31:50,165][model8_pretrain.py][INFO] Epoch:[0/2](931500/4588595) loss:2.232 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:31:50,165][model8_pretrain.py][INFO] Epoch:[0/2](931500/4588595) loss:2.492 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:31:50,165][model8_pretrain.py][INFO] Epoch:[0/2](931500/4588595) loss:3.076 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:31:50,165][model8_pretrain.py][INFO] Epoch:[0/2](931500/4588595) loss:3.004 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:31:50,165][model8_pretrain.py][INFO] Epoch:[0/2](931500/4588595) loss:2.932 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:31:50,166][model8_pretrain.py][INFO] Epoch:[0/2](931500/4588595) loss:2.925 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:31:50,165][model8_pretrain.py][INFO] Epoch:[0/2](931500/4588595) loss:2.663 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:32:27,104][model8_pretrain.py][INFO] Epoch:[0/2](931600/4588595) loss:2.718 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:32:27,104][model8_pretrain.py][INFO] Epoch:[0/2](931600/4588595) loss:2.994 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:32:27,105][model8_pretrain.py][INFO] Epoch:[0/2](931600/4588595) loss:3.228 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:32:27,105][model8_pretrain.py][INFO] Epoch:[0/2](931600/4588595) loss:2.918 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:32:27,105][model8_pretrain.py][INFO] Epoch:[0/2](931600/4588595) loss:2.538 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:32:27,105][model8_pretrain.py][INFO] Epoch:[0/2](931600/4588595) loss:3.030 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:32:27,105][model8_pretrain.py][INFO] Epoch:[0/2](931600/4588595) loss:2.749 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:32:27,105][model8_pretrain.py][INFO] Epoch:[0/2](931600/4588595) loss:2.742 lr:0.0000100 epoch_Time:23289.0min: [2024-01-06 20:33:04,049][model8_pretrain.py][INFO] Epoch:[0/2](931700/4588595) loss:3.083 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:04,049][model8_pretrain.py][INFO] Epoch:[0/2](931700/4588595) loss:2.455 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:04,049][model8_pretrain.py][INFO] Epoch:[0/2](931700/4588595) loss:3.055 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:04,049][model8_pretrain.py][INFO] Epoch:[0/2](931700/4588595) loss:3.137 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:04,049][model8_pretrain.py][INFO] Epoch:[0/2](931700/4588595) loss:2.971 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:04,049][model8_pretrain.py][INFO] Epoch:[0/2](931700/4588595) loss:2.966 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:04,049][model8_pretrain.py][INFO] Epoch:[0/2](931700/4588595) loss:3.176 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:04,049][model8_pretrain.py][INFO] Epoch:[0/2](931700/4588595) loss:2.817 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:41,000][model8_pretrain.py][INFO] Epoch:[0/2](931800/4588595) loss:3.165 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:41,000][model8_pretrain.py][INFO] Epoch:[0/2](931800/4588595) loss:2.534 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:41,000][model8_pretrain.py][INFO] Epoch:[0/2](931800/4588595) loss:2.297 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:41,000][model8_pretrain.py][INFO] Epoch:[0/2](931800/4588595) loss:3.084 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:41,000][model8_pretrain.py][INFO] Epoch:[0/2](931800/4588595) loss:2.656 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:41,000][model8_pretrain.py][INFO] Epoch:[0/2](931800/4588595) loss:3.200 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:41,001][model8_pretrain.py][INFO] Epoch:[0/2](931800/4588595) loss:2.995 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:33:41,001][model8_pretrain.py][INFO] Epoch:[0/2](931800/4588595) loss:2.546 lr:0.0000100 epoch_Time:23288.0min: [2024-01-06 20:34:17,949][model8_pretrain.py][INFO] Epoch:[0/2](931900/4588595) loss:2.824 lr:0.0000100 epoch_Time:23286.0min: [2024-01-06 20:34:17,950][model8_pretrain.py][INFO] Epoch:[0/2](931900/4588595) loss:2.676 lr:0.0000100 epoch_Time:23286.0min: [2024-01-06 20:34:17,949][model8_pretrain.py][INFO] Epoch:[0/2](931900/4588595) loss:3.022 lr:0.0000100 epoch_Time:23286.0min: [2024-01-06 20:34:17,950][model8_pretrain.py][INFO] Epoch:[0/2](931900/4588595) loss:3.187 lr:0.0000100 epoch_Time:23286.0min: [2024-01-06 20:34:17,950][model8_pretrain.py][INFO] Epoch:[0/2](931900/4588595) loss:2.718 lr:0.0000100 epoch_Time:23286.0min: [2024-01-06 20:34:17,950][model8_pretrain.py][INFO] Epoch:[0/2](931900/4588595) loss:2.707 lr:0.0000100 epoch_Time:23286.0min: [2024-01-06 20:34:17,950][model8_pretrain.py][INFO] Epoch:[0/2](931900/4588595) loss:2.982 lr:0.0000100 epoch_Time:23286.0min: [2024-01-06 20:34:17,950][model8_pretrain.py][INFO] Epoch:[0/2](931900/4588595) loss:2.462 lr:0.0000100 epoch_Time:23286.0min: [2024-01-06 20:34:54,950][model8_pretrain.py][INFO] Epoch:[0/2](932000/4588595) loss:3.118 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:34:54,950][model8_pretrain.py][INFO] Epoch:[0/2](932000/4588595) loss:2.649 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:34:54,950][model8_pretrain.py][INFO] Epoch:[0/2](932000/4588595) loss:2.650 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:34:54,950][model8_pretrain.py][INFO] Epoch:[0/2](932000/4588595) loss:2.100 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:34:54,950][model8_pretrain.py][INFO] Epoch:[0/2](932000/4588595) loss:3.036 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:34:54,950][model8_pretrain.py][INFO] Epoch:[0/2](932000/4588595) loss:3.035 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:34:54,950][model8_pretrain.py][INFO] Epoch:[0/2](932000/4588595) loss:1.957 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:34:54,950][model8_pretrain.py][INFO] Epoch:[0/2](932000/4588595) loss:3.019 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:35:31,946][model8_pretrain.py][INFO] Epoch:[0/2](932100/4588595) loss:2.406 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:35:31,946][model8_pretrain.py][INFO] Epoch:[0/2](932100/4588595) loss:2.971 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:35:31,946][model8_pretrain.py][INFO] Epoch:[0/2](932100/4588595) loss:3.047 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:35:31,946][model8_pretrain.py][INFO] Epoch:[0/2](932100/4588595) loss:2.228 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:35:31,946][model8_pretrain.py][INFO] Epoch:[0/2](932100/4588595) loss:3.275 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:35:31,946][model8_pretrain.py][INFO] Epoch:[0/2](932100/4588595) loss:2.726 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:35:31,946][model8_pretrain.py][INFO] Epoch:[0/2](932100/4588595) loss:2.836 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:35:31,946][model8_pretrain.py][INFO] Epoch:[0/2](932100/4588595) loss:2.554 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:36:21,045][model8_pretrain.py][INFO] Epoch:[0/2](932200/4588595) loss:2.326 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:36:21,045][model8_pretrain.py][INFO] Epoch:[0/2](932200/4588595) loss:2.711 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:36:21,045][model8_pretrain.py][INFO] Epoch:[0/2](932200/4588595) loss:2.745 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:36:21,045][model8_pretrain.py][INFO] Epoch:[0/2](932200/4588595) loss:2.755 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:36:21,045][model8_pretrain.py][INFO] Epoch:[0/2](932200/4588595) loss:2.347 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:36:21,045][model8_pretrain.py][INFO] Epoch:[0/2](932200/4588595) loss:2.853 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:36:21,045][model8_pretrain.py][INFO] Epoch:[0/2](932200/4588595) loss:2.977 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:36:21,045][model8_pretrain.py][INFO] Epoch:[0/2](932200/4588595) loss:2.984 lr:0.0000100 epoch_Time:23285.0min: [2024-01-06 20:36:59,660][model8_pretrain.py][INFO] Epoch:[0/2](932300/4588595) loss:3.169 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:36:59,660][model8_pretrain.py][INFO] Epoch:[0/2](932300/4588595) loss:2.643 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:36:59,660][model8_pretrain.py][INFO] Epoch:[0/2](932300/4588595) loss:2.718 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:36:59,660][model8_pretrain.py][INFO] Epoch:[0/2](932300/4588595) loss:2.949 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:36:59,660][model8_pretrain.py][INFO] Epoch:[0/2](932300/4588595) loss:2.816 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:36:59,660][model8_pretrain.py][INFO] Epoch:[0/2](932300/4588595) loss:2.263 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:36:59,660][model8_pretrain.py][INFO] Epoch:[0/2](932300/4588595) loss:2.794 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:36:59,660][model8_pretrain.py][INFO] Epoch:[0/2](932300/4588595) loss:2.952 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:37:36,596][model8_pretrain.py][INFO] Epoch:[0/2](932400/4588595) loss:2.455 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:37:36,596][model8_pretrain.py][INFO] Epoch:[0/2](932400/4588595) loss:2.794 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:37:36,596][model8_pretrain.py][INFO] Epoch:[0/2](932400/4588595) loss:3.190 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:37:36,596][model8_pretrain.py][INFO] Epoch:[0/2](932400/4588595) loss:2.453 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:37:36,596][model8_pretrain.py][INFO] Epoch:[0/2](932400/4588595) loss:2.589 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:37:36,596][model8_pretrain.py][INFO] Epoch:[0/2](932400/4588595) loss:2.113 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:37:36,596][model8_pretrain.py][INFO] Epoch:[0/2](932400/4588595) loss:3.337 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:37:36,596][model8_pretrain.py][INFO] Epoch:[0/2](932400/4588595) loss:2.537 lr:0.0000100 epoch_Time:23284.0min: [2024-01-06 20:38:13,542][model8_pretrain.py][INFO] Epoch:[0/2](932500/4588595) loss:2.625 lr:0.0000100 epoch_Time:23283.0min: [2024-01-06 20:38:13,542][model8_pretrain.py][INFO] Epoch:[0/2](932500/4588595) loss:2.947 lr:0.0000100 epoch_Time:23283.0min: [2024-01-06 20:38:13,542][model8_pretrain.py][INFO] Epoch:[0/2](932500/4588595) loss:2.841 lr:0.0000100 epoch_Time:23283.0min: [2024-01-06 20:38:13,542][model8_pretrain.py][INFO] Epoch:[0/2](932500/4588595) loss:2.954 lr:0.0000100 epoch_Time:23283.0min: [2024-01-06 20:38:13,542][model8_pretrain.py][INFO] Epoch:[0/2](932500/4588595) loss:2.724 lr:0.0000100 epoch_Time:23283.0min: [2024-01-06 20:38:13,542][model8_pretrain.py][INFO] Epoch:[0/2](932500/4588595) loss:2.418 lr:0.0000100 epoch_Time:23283.0min: [2024-01-06 20:38:13,542][model8_pretrain.py][INFO] Epoch:[0/2](932500/4588595) loss:2.905 lr:0.0000100 epoch_Time:23283.0min: [2024-01-06 20:38:13,542][model8_pretrain.py][INFO] Epoch:[0/2](932500/4588595) loss:3.091 lr:0.0000100 epoch_Time:23283.0min: [2024-01-06 20:38:50,477][model8_pretrain.py][INFO] Epoch:[0/2](932600/4588595) loss:3.255 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:38:50,477][model8_pretrain.py][INFO] Epoch:[0/2](932600/4588595) loss:2.945 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:38:50,477][model8_pretrain.py][INFO] Epoch:[0/2](932600/4588595) loss:2.890 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:38:50,477][model8_pretrain.py][INFO] Epoch:[0/2](932600/4588595) loss:2.851 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:38:50,477][model8_pretrain.py][INFO] Epoch:[0/2](932600/4588595) loss:2.740 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:38:50,477][model8_pretrain.py][INFO] Epoch:[0/2](932600/4588595) loss:2.940 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:38:50,477][model8_pretrain.py][INFO] Epoch:[0/2](932600/4588595) loss:2.651 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:38:50,477][model8_pretrain.py][INFO] Epoch:[0/2](932600/4588595) loss:2.683 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:39:27,423][model8_pretrain.py][INFO] Epoch:[0/2](932700/4588595) loss:2.210 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:39:27,423][model8_pretrain.py][INFO] Epoch:[0/2](932700/4588595) loss:3.206 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:39:27,423][model8_pretrain.py][INFO] Epoch:[0/2](932700/4588595) loss:2.917 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:39:27,423][model8_pretrain.py][INFO] Epoch:[0/2](932700/4588595) loss:3.078 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:39:27,423][model8_pretrain.py][INFO] Epoch:[0/2](932700/4588595) loss:2.903 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:39:27,423][model8_pretrain.py][INFO] Epoch:[0/2](932700/4588595) loss:2.635 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:39:27,423][model8_pretrain.py][INFO] Epoch:[0/2](932700/4588595) loss:2.782 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:39:27,423][model8_pretrain.py][INFO] Epoch:[0/2](932700/4588595) loss:2.545 lr:0.0000100 epoch_Time:23282.0min: [2024-01-06 20:40:04,370][model8_pretrain.py][INFO] Epoch:[0/2](932800/4588595) loss:2.834 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:04,370][model8_pretrain.py][INFO] Epoch:[0/2](932800/4588595) loss:2.966 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:04,370][model8_pretrain.py][INFO] Epoch:[0/2](932800/4588595) loss:3.126 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:04,370][model8_pretrain.py][INFO] Epoch:[0/2](932800/4588595) loss:2.576 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:04,370][model8_pretrain.py][INFO] Epoch:[0/2](932800/4588595) loss:2.835 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:04,370][model8_pretrain.py][INFO] Epoch:[0/2](932800/4588595) loss:2.507 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:04,370][model8_pretrain.py][INFO] Epoch:[0/2](932800/4588595) loss:2.826 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:04,370][model8_pretrain.py][INFO] Epoch:[0/2](932800/4588595) loss:3.145 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:41,334][model8_pretrain.py][INFO] Epoch:[0/2](932900/4588595) loss:2.898 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:41,334][model8_pretrain.py][INFO] Epoch:[0/2](932900/4588595) loss:2.886 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:41,334][model8_pretrain.py][INFO] Epoch:[0/2](932900/4588595) loss:2.428 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:41,334][model8_pretrain.py][INFO] Epoch:[0/2](932900/4588595) loss:2.514 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:41,334][model8_pretrain.py][INFO] Epoch:[0/2](932900/4588595) loss:2.276 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:41,334][model8_pretrain.py][INFO] Epoch:[0/2](932900/4588595) loss:2.425 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:41,334][model8_pretrain.py][INFO] Epoch:[0/2](932900/4588595) loss:2.933 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:40:41,334][model8_pretrain.py][INFO] Epoch:[0/2](932900/4588595) loss:2.885 lr:0.0000100 epoch_Time:23281.0min: [2024-01-06 20:41:30,267][model8_pretrain.py][INFO] Epoch:[0/2](933000/4588595) loss:3.108 lr:0.0000100 epoch_Time:23280.0min: [2024-01-06 20:41:30,267][model8_pretrain.py][INFO] Epoch:[0/2](933000/4588595) loss:2.746 lr:0.0000100 epoch_Time:23280.0min: [2024-01-06 20:41:30,267][model8_pretrain.py][INFO] Epoch:[0/2](933000/4588595) loss:3.364 lr:0.0000100 epoch_Time:23280.0min: [2024-01-06 20:41:30,267][model8_pretrain.py][INFO] Epoch:[0/2](933000/4588595) loss:3.039 lr:0.0000100 epoch_Time:23280.0min: [2024-01-06 20:41:30,267][model8_pretrain.py][INFO] Epoch:[0/2](933000/4588595) loss:2.998 lr:0.0000100 epoch_Time:23280.0min: [2024-01-06 20:41:30,267][model8_pretrain.py][INFO] Epoch:[0/2](933000/4588595) loss:3.080 lr:0.0000100 epoch_Time:23280.0min: [2024-01-06 20:41:30,267][model8_pretrain.py][INFO] Epoch:[0/2](933000/4588595) loss:3.060 lr:0.0000100 epoch_Time:23280.0min: [2024-01-06 20:41:30,267][model8_pretrain.py][INFO] Epoch:[0/2](933000/4588595) loss:2.735 lr:0.0000100 epoch_Time:23280.0min: [2024-01-06 20:42:08,886][model8_pretrain.py][INFO] Epoch:[0/2](933100/4588595) loss:2.852 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:08,886][model8_pretrain.py][INFO] Epoch:[0/2](933100/4588595) loss:2.837 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:08,886][model8_pretrain.py][INFO] Epoch:[0/2](933100/4588595) loss:2.735 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:08,886][model8_pretrain.py][INFO] Epoch:[0/2](933100/4588595) loss:2.390 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:08,886][model8_pretrain.py][INFO] Epoch:[0/2](933100/4588595) loss:2.634 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:08,886][model8_pretrain.py][INFO] Epoch:[0/2](933100/4588595) loss:2.501 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:08,886][model8_pretrain.py][INFO] Epoch:[0/2](933100/4588595) loss:2.992 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:08,887][model8_pretrain.py][INFO] Epoch:[0/2](933100/4588595) loss:2.706 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:45,833][model8_pretrain.py][INFO] Epoch:[0/2](933200/4588595) loss:2.701 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:45,833][model8_pretrain.py][INFO] Epoch:[0/2](933200/4588595) loss:3.319 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:45,833][model8_pretrain.py][INFO] Epoch:[0/2](933200/4588595) loss:2.487 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:45,833][model8_pretrain.py][INFO] Epoch:[0/2](933200/4588595) loss:2.762 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:45,833][model8_pretrain.py][INFO] Epoch:[0/2](933200/4588595) loss:3.094 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:45,833][model8_pretrain.py][INFO] Epoch:[0/2](933200/4588595) loss:2.660 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:45,833][model8_pretrain.py][INFO] Epoch:[0/2](933200/4588595) loss:2.507 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:42:45,834][model8_pretrain.py][INFO] Epoch:[0/2](933200/4588595) loss:2.839 lr:0.0000100 epoch_Time:23279.0min: [2024-01-06 20:43:22,771][model8_pretrain.py][INFO] Epoch:[0/2](933300/4588595) loss:2.665 lr:0.0000100 epoch_Time:23278.0min: [2024-01-06 20:43:22,771][model8_pretrain.py][INFO] Epoch:[0/2](933300/4588595) loss:2.691 lr:0.0000100 epoch_Time:23278.0min: [2024-01-06 20:43:22,771][model8_pretrain.py][INFO] Epoch:[0/2](933300/4588595) loss:2.739 lr:0.0000100 epoch_Time:23278.0min: [2024-01-06 20:43:22,771][model8_pretrain.py][INFO] Epoch:[0/2](933300/4588595) loss:2.949 lr:0.0000100 epoch_Time:23278.0min: [2024-01-06 20:43:22,771][model8_pretrain.py][INFO] Epoch:[0/2](933300/4588595) loss:3.022 lr:0.0000100 epoch_Time:23278.0min: [2024-01-06 20:43:22,771][model8_pretrain.py][INFO] Epoch:[0/2](933300/4588595) loss:2.545 lr:0.0000100 epoch_Time:23278.0min: [2024-01-06 20:43:22,771][model8_pretrain.py][INFO] Epoch:[0/2](933300/4588595) loss:3.532 lr:0.0000100 epoch_Time:23278.0min: [2024-01-06 20:43:22,771][model8_pretrain.py][INFO] Epoch:[0/2](933300/4588595) loss:3.195 lr:0.0000100 epoch_Time:23278.0min: [2024-01-06 20:43:59,711][model8_pretrain.py][INFO] Epoch:[0/2](933400/4588595) loss:1.973 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:43:59,711][model8_pretrain.py][INFO] Epoch:[0/2](933400/4588595) loss:3.139 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:43:59,711][model8_pretrain.py][INFO] Epoch:[0/2](933400/4588595) loss:3.111 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:43:59,711][model8_pretrain.py][INFO] Epoch:[0/2](933400/4588595) loss:2.306 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:43:59,711][model8_pretrain.py][INFO] Epoch:[0/2](933400/4588595) loss:3.012 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:43:59,711][model8_pretrain.py][INFO] Epoch:[0/2](933400/4588595) loss:2.669 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:43:59,711][model8_pretrain.py][INFO] Epoch:[0/2](933400/4588595) loss:2.826 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:43:59,711][model8_pretrain.py][INFO] Epoch:[0/2](933400/4588595) loss:2.947 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:44:36,657][model8_pretrain.py][INFO] Epoch:[0/2](933500/4588595) loss:2.813 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:44:36,657][model8_pretrain.py][INFO] Epoch:[0/2](933500/4588595) loss:2.632 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:44:36,657][model8_pretrain.py][INFO] Epoch:[0/2](933500/4588595) loss:2.425 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:44:36,657][model8_pretrain.py][INFO] Epoch:[0/2](933500/4588595) loss:2.891 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:44:36,657][model8_pretrain.py][INFO] Epoch:[0/2](933500/4588595) loss:3.079 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:44:36,657][model8_pretrain.py][INFO] Epoch:[0/2](933500/4588595) loss:2.968 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:44:36,657][model8_pretrain.py][INFO] Epoch:[0/2](933500/4588595) loss:2.667 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:44:36,658][model8_pretrain.py][INFO] Epoch:[0/2](933500/4588595) loss:3.069 lr:0.0000100 epoch_Time:23277.0min: [2024-01-06 20:45:13,601][model8_pretrain.py][INFO] Epoch:[0/2](933600/4588595) loss:2.960 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:45:13,601][model8_pretrain.py][INFO] Epoch:[0/2](933600/4588595) loss:2.600 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:45:13,601][model8_pretrain.py][INFO] Epoch:[0/2](933600/4588595) loss:2.937 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:45:13,601][model8_pretrain.py][INFO] Epoch:[0/2](933600/4588595) loss:2.631 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:45:13,601][model8_pretrain.py][INFO] Epoch:[0/2](933600/4588595) loss:2.959 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:45:13,601][model8_pretrain.py][INFO] Epoch:[0/2](933600/4588595) loss:2.560 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:45:13,601][model8_pretrain.py][INFO] Epoch:[0/2](933600/4588595) loss:3.041 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:45:13,601][model8_pretrain.py][INFO] Epoch:[0/2](933600/4588595) loss:3.027 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:45:50,546][model8_pretrain.py][INFO] Epoch:[0/2](933700/4588595) loss:2.651 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:45:50,546][model8_pretrain.py][INFO] Epoch:[0/2](933700/4588595) loss:3.348 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:45:50,546][model8_pretrain.py][INFO] Epoch:[0/2](933700/4588595) loss:2.798 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:45:50,546][model8_pretrain.py][INFO] Epoch:[0/2](933700/4588595) loss:2.416 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:45:50,546][model8_pretrain.py][INFO] Epoch:[0/2](933700/4588595) loss:2.971 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:45:50,546][model8_pretrain.py][INFO] Epoch:[0/2](933700/4588595) loss:2.858 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:45:50,547][model8_pretrain.py][INFO] Epoch:[0/2](933700/4588595) loss:2.602 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:45:50,547][model8_pretrain.py][INFO] Epoch:[0/2](933700/4588595) loss:2.664 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:46:39,467][model8_pretrain.py][INFO] Epoch:[0/2](933800/4588595) loss:3.043 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:46:39,467][model8_pretrain.py][INFO] Epoch:[0/2](933800/4588595) loss:2.261 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:46:39,467][model8_pretrain.py][INFO] Epoch:[0/2](933800/4588595) loss:2.686 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:46:39,467][model8_pretrain.py][INFO] Epoch:[0/2](933800/4588595) loss:2.643 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:46:39,467][model8_pretrain.py][INFO] Epoch:[0/2](933800/4588595) loss:2.844 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:46:39,467][model8_pretrain.py][INFO] Epoch:[0/2](933800/4588595) loss:2.717 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:46:39,467][model8_pretrain.py][INFO] Epoch:[0/2](933800/4588595) loss:3.097 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:46:39,467][model8_pretrain.py][INFO] Epoch:[0/2](933800/4588595) loss:2.809 lr:0.0000100 epoch_Time:23276.0min: [2024-01-06 20:47:18,085][model8_pretrain.py][INFO] Epoch:[0/2](933900/4588595) loss:3.077 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:47:18,085][model8_pretrain.py][INFO] Epoch:[0/2](933900/4588595) loss:2.907 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:47:18,086][model8_pretrain.py][INFO] Epoch:[0/2](933900/4588595) loss:2.483 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:47:18,085][model8_pretrain.py][INFO] Epoch:[0/2](933900/4588595) loss:2.690 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:47:18,086][model8_pretrain.py][INFO] Epoch:[0/2](933900/4588595) loss:2.981 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:47:18,086][model8_pretrain.py][INFO] Epoch:[0/2](933900/4588595) loss:3.098 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:47:18,086][model8_pretrain.py][INFO] Epoch:[0/2](933900/4588595) loss:2.398 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:47:18,087][model8_pretrain.py][INFO] Epoch:[0/2](933900/4588595) loss:2.845 lr:0.0000100 epoch_Time:23275.0min: [2024-01-06 20:47:55,026][model8_pretrain.py][INFO] Epoch:[0/2](934000/4588595) loss:2.737 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:47:55,026][model8_pretrain.py][INFO] Epoch:[0/2](934000/4588595) loss:3.058 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:47:55,026][model8_pretrain.py][INFO] Epoch:[0/2](934000/4588595) loss:2.607 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:47:55,026][model8_pretrain.py][INFO] Epoch:[0/2](934000/4588595) loss:2.439 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:47:55,026][model8_pretrain.py][INFO] Epoch:[0/2](934000/4588595) loss:2.305 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:47:55,026][model8_pretrain.py][INFO] Epoch:[0/2](934000/4588595) loss:2.957 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:47:55,026][model8_pretrain.py][INFO] Epoch:[0/2](934000/4588595) loss:2.325 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:47:55,026][model8_pretrain.py][INFO] Epoch:[0/2](934000/4588595) loss:2.800 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:48:31,962][model8_pretrain.py][INFO] Epoch:[0/2](934100/4588595) loss:2.486 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:48:31,962][model8_pretrain.py][INFO] Epoch:[0/2](934100/4588595) loss:2.442 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:48:31,962][model8_pretrain.py][INFO] Epoch:[0/2](934100/4588595) loss:3.030 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:48:31,962][model8_pretrain.py][INFO] Epoch:[0/2](934100/4588595) loss:2.237 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:48:31,962][model8_pretrain.py][INFO] Epoch:[0/2](934100/4588595) loss:2.799 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:48:31,962][model8_pretrain.py][INFO] Epoch:[0/2](934100/4588595) loss:2.888 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:48:31,963][model8_pretrain.py][INFO] Epoch:[0/2](934100/4588595) loss:3.127 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:48:31,963][model8_pretrain.py][INFO] Epoch:[0/2](934100/4588595) loss:3.167 lr:0.0000100 epoch_Time:23274.0min: [2024-01-06 20:49:08,913][model8_pretrain.py][INFO] Epoch:[0/2](934200/4588595) loss:2.320 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:08,913][model8_pretrain.py][INFO] Epoch:[0/2](934200/4588595) loss:3.250 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:08,913][model8_pretrain.py][INFO] Epoch:[0/2](934200/4588595) loss:2.825 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:08,913][model8_pretrain.py][INFO] Epoch:[0/2](934200/4588595) loss:3.424 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:08,913][model8_pretrain.py][INFO] Epoch:[0/2](934200/4588595) loss:2.695 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:08,913][model8_pretrain.py][INFO] Epoch:[0/2](934200/4588595) loss:2.147 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:08,914][model8_pretrain.py][INFO] Epoch:[0/2](934200/4588595) loss:1.692 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:08,914][model8_pretrain.py][INFO] Epoch:[0/2](934200/4588595) loss:2.567 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:45,900][model8_pretrain.py][INFO] Epoch:[0/2](934300/4588595) loss:2.795 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:45,900][model8_pretrain.py][INFO] Epoch:[0/2](934300/4588595) loss:3.111 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:45,900][model8_pretrain.py][INFO] Epoch:[0/2](934300/4588595) loss:3.333 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:45,900][model8_pretrain.py][INFO] Epoch:[0/2](934300/4588595) loss:2.739 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:45,900][model8_pretrain.py][INFO] Epoch:[0/2](934300/4588595) loss:2.466 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:45,900][model8_pretrain.py][INFO] Epoch:[0/2](934300/4588595) loss:2.807 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:45,900][model8_pretrain.py][INFO] Epoch:[0/2](934300/4588595) loss:2.696 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:49:45,900][model8_pretrain.py][INFO] Epoch:[0/2](934300/4588595) loss:2.932 lr:0.0000100 epoch_Time:23272.0min: [2024-01-06 20:50:22,868][model8_pretrain.py][INFO] Epoch:[0/2](934400/4588595) loss:2.529 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:50:22,868][model8_pretrain.py][INFO] Epoch:[0/2](934400/4588595) loss:3.182 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:50:22,868][model8_pretrain.py][INFO] Epoch:[0/2](934400/4588595) loss:2.739 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:50:22,868][model8_pretrain.py][INFO] Epoch:[0/2](934400/4588595) loss:2.428 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:50:22,868][model8_pretrain.py][INFO] Epoch:[0/2](934400/4588595) loss:2.158 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:50:22,868][model8_pretrain.py][INFO] Epoch:[0/2](934400/4588595) loss:3.076 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:50:22,868][model8_pretrain.py][INFO] Epoch:[0/2](934400/4588595) loss:2.744 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:50:22,868][model8_pretrain.py][INFO] Epoch:[0/2](934400/4588595) loss:2.708 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:50:59,827][model8_pretrain.py][INFO] Epoch:[0/2](934500/4588595) loss:3.195 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:50:59,827][model8_pretrain.py][INFO] Epoch:[0/2](934500/4588595) loss:2.720 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:50:59,827][model8_pretrain.py][INFO] Epoch:[0/2](934500/4588595) loss:2.800 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:50:59,827][model8_pretrain.py][INFO] Epoch:[0/2](934500/4588595) loss:2.682 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:50:59,827][model8_pretrain.py][INFO] Epoch:[0/2](934500/4588595) loss:3.279 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:50:59,827][model8_pretrain.py][INFO] Epoch:[0/2](934500/4588595) loss:2.434 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:50:59,827][model8_pretrain.py][INFO] Epoch:[0/2](934500/4588595) loss:2.885 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:50:59,827][model8_pretrain.py][INFO] Epoch:[0/2](934500/4588595) loss:2.687 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:51:47,056][model8_pretrain.py][INFO] Epoch:[0/2](934600/4588595) loss:3.059 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:51:47,056][model8_pretrain.py][INFO] Epoch:[0/2](934600/4588595) loss:2.792 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:51:47,056][model8_pretrain.py][INFO] Epoch:[0/2](934600/4588595) loss:3.064 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:51:47,056][model8_pretrain.py][INFO] Epoch:[0/2](934600/4588595) loss:3.005 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:51:47,056][model8_pretrain.py][INFO] Epoch:[0/2](934600/4588595) loss:3.283 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:51:47,056][model8_pretrain.py][INFO] Epoch:[0/2](934600/4588595) loss:2.649 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:51:47,056][model8_pretrain.py][INFO] Epoch:[0/2](934600/4588595) loss:2.805 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:51:47,056][model8_pretrain.py][INFO] Epoch:[0/2](934600/4588595) loss:2.462 lr:0.0000100 epoch_Time:23271.0min: [2024-01-06 20:52:27,481][model8_pretrain.py][INFO] Epoch:[0/2](934700/4588595) loss:2.908 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:52:27,481][model8_pretrain.py][INFO] Epoch:[0/2](934700/4588595) loss:2.940 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:52:27,481][model8_pretrain.py][INFO] Epoch:[0/2](934700/4588595) loss:2.886 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:52:27,481][model8_pretrain.py][INFO] Epoch:[0/2](934700/4588595) loss:2.613 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:52:27,481][model8_pretrain.py][INFO] Epoch:[0/2](934700/4588595) loss:3.142 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:52:27,481][model8_pretrain.py][INFO] Epoch:[0/2](934700/4588595) loss:3.088 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:52:27,481][model8_pretrain.py][INFO] Epoch:[0/2](934700/4588595) loss:2.436 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:52:27,481][model8_pretrain.py][INFO] Epoch:[0/2](934700/4588595) loss:3.181 lr:0.0000100 epoch_Time:23270.0min: [2024-01-06 20:53:04,422][model8_pretrain.py][INFO] Epoch:[0/2](934800/4588595) loss:2.361 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:04,422][model8_pretrain.py][INFO] Epoch:[0/2](934800/4588595) loss:2.400 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:04,422][model8_pretrain.py][INFO] Epoch:[0/2](934800/4588595) loss:3.107 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:04,422][model8_pretrain.py][INFO] Epoch:[0/2](934800/4588595) loss:2.956 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:04,422][model8_pretrain.py][INFO] Epoch:[0/2](934800/4588595) loss:2.752 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:04,422][model8_pretrain.py][INFO] Epoch:[0/2](934800/4588595) loss:2.813 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:04,422][model8_pretrain.py][INFO] Epoch:[0/2](934800/4588595) loss:3.259 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:04,423][model8_pretrain.py][INFO] Epoch:[0/2](934800/4588595) loss:3.002 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:41,370][model8_pretrain.py][INFO] Epoch:[0/2](934900/4588595) loss:2.617 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:41,370][model8_pretrain.py][INFO] Epoch:[0/2](934900/4588595) loss:2.800 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:41,370][model8_pretrain.py][INFO] Epoch:[0/2](934900/4588595) loss:2.825 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:41,370][model8_pretrain.py][INFO] Epoch:[0/2](934900/4588595) loss:3.131 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:41,370][model8_pretrain.py][INFO] Epoch:[0/2](934900/4588595) loss:3.002 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:41,370][model8_pretrain.py][INFO] Epoch:[0/2](934900/4588595) loss:2.269 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:41,370][model8_pretrain.py][INFO] Epoch:[0/2](934900/4588595) loss:2.972 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:53:41,371][model8_pretrain.py][INFO] Epoch:[0/2](934900/4588595) loss:2.536 lr:0.0000100 epoch_Time:23269.0min: [2024-01-06 20:54:18,320][model8_pretrain.py][INFO] Epoch:[0/2](935000/4588595) loss:2.299 lr:0.0000100 epoch_Time:23268.0min: [2024-01-06 20:54:18,320][model8_pretrain.py][INFO] Epoch:[0/2](935000/4588595) loss:2.760 lr:0.0000100 epoch_Time:23268.0min: [2024-01-06 20:54:18,320][model8_pretrain.py][INFO] Epoch:[0/2](935000/4588595) loss:2.592 lr:0.0000100 epoch_Time:23268.0min: [2024-01-06 20:54:18,320][model8_pretrain.py][INFO] Epoch:[0/2](935000/4588595) loss:2.878 lr:0.0000100 epoch_Time:23268.0min: [2024-01-06 20:54:18,320][model8_pretrain.py][INFO] Epoch:[0/2](935000/4588595) loss:3.677 lr:0.0000100 epoch_Time:23268.0min: [2024-01-06 20:54:18,320][model8_pretrain.py][INFO] Epoch:[0/2](935000/4588595) loss:2.800 lr:0.0000100 epoch_Time:23268.0min: [2024-01-06 20:54:18,320][model8_pretrain.py][INFO] Epoch:[0/2](935000/4588595) loss:1.913 lr:0.0000100 epoch_Time:23268.0min: [2024-01-06 20:54:18,320][model8_pretrain.py][INFO] Epoch:[0/2](935000/4588595) loss:2.787 lr:0.0000100 epoch_Time:23268.0min: [2024-01-06 20:54:55,272][model8_pretrain.py][INFO] Epoch:[0/2](935100/4588595) loss:2.878 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:54:55,272][model8_pretrain.py][INFO] Epoch:[0/2](935100/4588595) loss:1.965 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:54:55,272][model8_pretrain.py][INFO] Epoch:[0/2](935100/4588595) loss:2.517 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:54:55,272][model8_pretrain.py][INFO] Epoch:[0/2](935100/4588595) loss:2.428 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:54:55,272][model8_pretrain.py][INFO] Epoch:[0/2](935100/4588595) loss:2.833 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:54:55,272][model8_pretrain.py][INFO] Epoch:[0/2](935100/4588595) loss:3.173 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:54:55,272][model8_pretrain.py][INFO] Epoch:[0/2](935100/4588595) loss:2.557 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:54:55,272][model8_pretrain.py][INFO] Epoch:[0/2](935100/4588595) loss:2.703 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:55:32,229][model8_pretrain.py][INFO] Epoch:[0/2](935200/4588595) loss:3.157 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:55:32,229][model8_pretrain.py][INFO] Epoch:[0/2](935200/4588595) loss:2.881 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:55:32,229][model8_pretrain.py][INFO] Epoch:[0/2](935200/4588595) loss:3.065 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:55:32,229][model8_pretrain.py][INFO] Epoch:[0/2](935200/4588595) loss:3.395 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:55:32,229][model8_pretrain.py][INFO] Epoch:[0/2](935200/4588595) loss:2.479 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:55:32,229][model8_pretrain.py][INFO] Epoch:[0/2](935200/4588595) loss:2.646 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:55:32,229][model8_pretrain.py][INFO] Epoch:[0/2](935200/4588595) loss:2.586 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:55:32,229][model8_pretrain.py][INFO] Epoch:[0/2](935200/4588595) loss:2.579 lr:0.0000100 epoch_Time:23267.0min: [2024-01-06 20:56:09,189][model8_pretrain.py][INFO] Epoch:[0/2](935300/4588595) loss:1.958 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:09,189][model8_pretrain.py][INFO] Epoch:[0/2](935300/4588595) loss:2.612 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:09,189][model8_pretrain.py][INFO] Epoch:[0/2](935300/4588595) loss:2.731 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:09,189][model8_pretrain.py][INFO] Epoch:[0/2](935300/4588595) loss:2.684 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:09,189][model8_pretrain.py][INFO] Epoch:[0/2](935300/4588595) loss:2.679 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:09,189][model8_pretrain.py][INFO] Epoch:[0/2](935300/4588595) loss:2.772 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:09,189][model8_pretrain.py][INFO] Epoch:[0/2](935300/4588595) loss:2.350 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:09,189][model8_pretrain.py][INFO] Epoch:[0/2](935300/4588595) loss:2.438 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:56,092][model8_pretrain.py][INFO] Epoch:[0/2](935400/4588595) loss:2.480 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:56,092][model8_pretrain.py][INFO] Epoch:[0/2](935400/4588595) loss:2.583 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:56,092][model8_pretrain.py][INFO] Epoch:[0/2](935400/4588595) loss:3.106 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:56,092][model8_pretrain.py][INFO] Epoch:[0/2](935400/4588595) loss:3.295 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:56,092][model8_pretrain.py][INFO] Epoch:[0/2](935400/4588595) loss:2.878 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:56,092][model8_pretrain.py][INFO] Epoch:[0/2](935400/4588595) loss:2.925 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:56,092][model8_pretrain.py][INFO] Epoch:[0/2](935400/4588595) loss:2.360 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:56:56,092][model8_pretrain.py][INFO] Epoch:[0/2](935400/4588595) loss:2.855 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:57:36,470][model8_pretrain.py][INFO] Epoch:[0/2](935500/4588595) loss:2.956 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:57:36,470][model8_pretrain.py][INFO] Epoch:[0/2](935500/4588595) loss:3.273 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:57:36,470][model8_pretrain.py][INFO] Epoch:[0/2](935500/4588595) loss:2.308 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:57:36,470][model8_pretrain.py][INFO] Epoch:[0/2](935500/4588595) loss:2.671 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:57:36,470][model8_pretrain.py][INFO] Epoch:[0/2](935500/4588595) loss:3.406 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:57:36,470][model8_pretrain.py][INFO] Epoch:[0/2](935500/4588595) loss:2.283 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:57:36,471][model8_pretrain.py][INFO] Epoch:[0/2](935500/4588595) loss:2.903 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:57:36,471][model8_pretrain.py][INFO] Epoch:[0/2](935500/4588595) loss:2.984 lr:0.0000100 epoch_Time:23265.0min: [2024-01-06 20:58:13,411][model8_pretrain.py][INFO] Epoch:[0/2](935600/4588595) loss:2.744 lr:0.0000100 epoch_Time:23264.0min: [2024-01-06 20:58:13,411][model8_pretrain.py][INFO] Epoch:[0/2](935600/4588595) loss:3.162 lr:0.0000100 epoch_Time:23264.0min: [2024-01-06 20:58:13,411][model8_pretrain.py][INFO] Epoch:[0/2](935600/4588595) loss:2.768 lr:0.0000100 epoch_Time:23264.0min: [2024-01-06 20:58:13,411][model8_pretrain.py][INFO] Epoch:[0/2](935600/4588595) loss:3.063 lr:0.0000100 epoch_Time:23264.0min: [2024-01-06 20:58:13,411][model8_pretrain.py][INFO] Epoch:[0/2](935600/4588595) loss:2.784 lr:0.0000100 epoch_Time:23264.0min: [2024-01-06 20:58:13,411][model8_pretrain.py][INFO] Epoch:[0/2](935600/4588595) loss:2.655 lr:0.0000100 epoch_Time:23264.0min: [2024-01-06 20:58:13,411][model8_pretrain.py][INFO] Epoch:[0/2](935600/4588595) loss:2.827 lr:0.0000100 epoch_Time:23264.0min: [2024-01-06 20:58:13,411][model8_pretrain.py][INFO] Epoch:[0/2](935600/4588595) loss:2.678 lr:0.0000100 epoch_Time:23264.0min: [2024-01-06 20:58:50,360][model8_pretrain.py][INFO] Epoch:[0/2](935700/4588595) loss:2.444 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:58:50,360][model8_pretrain.py][INFO] Epoch:[0/2](935700/4588595) loss:2.549 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:58:50,360][model8_pretrain.py][INFO] Epoch:[0/2](935700/4588595) loss:2.812 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:58:50,360][model8_pretrain.py][INFO] Epoch:[0/2](935700/4588595) loss:2.745 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:58:50,360][model8_pretrain.py][INFO] Epoch:[0/2](935700/4588595) loss:2.556 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:58:50,360][model8_pretrain.py][INFO] Epoch:[0/2](935700/4588595) loss:2.961 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:58:50,360][model8_pretrain.py][INFO] Epoch:[0/2](935700/4588595) loss:2.812 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:58:50,360][model8_pretrain.py][INFO] Epoch:[0/2](935700/4588595) loss:2.157 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:59:27,309][model8_pretrain.py][INFO] Epoch:[0/2](935800/4588595) loss:3.299 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:59:27,309][model8_pretrain.py][INFO] Epoch:[0/2](935800/4588595) loss:3.114 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:59:27,309][model8_pretrain.py][INFO] Epoch:[0/2](935800/4588595) loss:3.138 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:59:27,309][model8_pretrain.py][INFO] Epoch:[0/2](935800/4588595) loss:2.820 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:59:27,309][model8_pretrain.py][INFO] Epoch:[0/2](935800/4588595) loss:2.877 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:59:27,309][model8_pretrain.py][INFO] Epoch:[0/2](935800/4588595) loss:2.654 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:59:27,309][model8_pretrain.py][INFO] Epoch:[0/2](935800/4588595) loss:2.845 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 20:59:27,309][model8_pretrain.py][INFO] Epoch:[0/2](935800/4588595) loss:2.712 lr:0.0000100 epoch_Time:23263.0min: [2024-01-06 21:00:04,261][model8_pretrain.py][INFO] Epoch:[0/2](935900/4588595) loss:2.241 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:04,261][model8_pretrain.py][INFO] Epoch:[0/2](935900/4588595) loss:3.085 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:04,261][model8_pretrain.py][INFO] Epoch:[0/2](935900/4588595) loss:3.078 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:04,262][model8_pretrain.py][INFO] Epoch:[0/2](935900/4588595) loss:3.072 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:04,262][model8_pretrain.py][INFO] Epoch:[0/2](935900/4588595) loss:2.681 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:04,262][model8_pretrain.py][INFO] Epoch:[0/2](935900/4588595) loss:2.544 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:04,262][model8_pretrain.py][INFO] Epoch:[0/2](935900/4588595) loss:2.924 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:04,262][model8_pretrain.py][INFO] Epoch:[0/2](935900/4588595) loss:2.893 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:41,205][model8_pretrain.py][INFO] Epoch:[0/2](936000/4588595) loss:2.855 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:41,205][model8_pretrain.py][INFO] Epoch:[0/2](936000/4588595) loss:2.677 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:41,205][model8_pretrain.py][INFO] Epoch:[0/2](936000/4588595) loss:2.663 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:41,205][model8_pretrain.py][INFO] Epoch:[0/2](936000/4588595) loss:2.260 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:41,205][model8_pretrain.py][INFO] Epoch:[0/2](936000/4588595) loss:2.506 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:41,205][model8_pretrain.py][INFO] Epoch:[0/2](936000/4588595) loss:2.630 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:41,205][model8_pretrain.py][INFO] Epoch:[0/2](936000/4588595) loss:2.393 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:00:41,206][model8_pretrain.py][INFO] Epoch:[0/2](936000/4588595) loss:3.044 lr:0.0000100 epoch_Time:23262.0min: [2024-01-06 21:01:18,154][model8_pretrain.py][INFO] Epoch:[0/2](936100/4588595) loss:2.832 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:01:18,154][model8_pretrain.py][INFO] Epoch:[0/2](936100/4588595) loss:2.839 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:01:18,155][model8_pretrain.py][INFO] Epoch:[0/2](936100/4588595) loss:3.055 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:01:18,155][model8_pretrain.py][INFO] Epoch:[0/2](936100/4588595) loss:3.283 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:01:18,155][model8_pretrain.py][INFO] Epoch:[0/2](936100/4588595) loss:3.330 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:01:18,155][model8_pretrain.py][INFO] Epoch:[0/2](936100/4588595) loss:2.859 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:01:18,156][model8_pretrain.py][INFO] Epoch:[0/2](936100/4588595) loss:3.210 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:01:18,156][model8_pretrain.py][INFO] Epoch:[0/2](936100/4588595) loss:3.235 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:02:03,256][model8_pretrain.py][INFO] Epoch:[0/2](936200/4588595) loss:2.845 lr:0.0000100 epoch_Time:23260.0min: [2024-01-06 21:02:03,256][model8_pretrain.py][INFO] Epoch:[0/2](936200/4588595) loss:2.676 lr:0.0000100 epoch_Time:23260.0min: [2024-01-06 21:02:03,256][model8_pretrain.py][INFO] Epoch:[0/2](936200/4588595) loss:2.832 lr:0.0000100 epoch_Time:23260.0min: [2024-01-06 21:02:03,256][model8_pretrain.py][INFO] Epoch:[0/2](936200/4588595) loss:2.957 lr:0.0000100 epoch_Time:23260.0min: [2024-01-06 21:02:03,256][model8_pretrain.py][INFO] Epoch:[0/2](936200/4588595) loss:3.173 lr:0.0000100 epoch_Time:23260.0min: [2024-01-06 21:02:03,256][model8_pretrain.py][INFO] Epoch:[0/2](936200/4588595) loss:2.713 lr:0.0000100 epoch_Time:23260.0min: [2024-01-06 21:02:03,256][model8_pretrain.py][INFO] Epoch:[0/2](936200/4588595) loss:3.311 lr:0.0000100 epoch_Time:23260.0min: [2024-01-06 21:02:03,261][model8_pretrain.py][INFO] Epoch:[0/2](936200/4588595) loss:3.081 lr:0.0000100 epoch_Time:23260.0min: [2024-01-06 21:02:45,319][model8_pretrain.py][INFO] Epoch:[0/2](936300/4588595) loss:2.546 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:02:45,319][model8_pretrain.py][INFO] Epoch:[0/2](936300/4588595) loss:3.099 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:02:45,319][model8_pretrain.py][INFO] Epoch:[0/2](936300/4588595) loss:3.020 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:02:45,319][model8_pretrain.py][INFO] Epoch:[0/2](936300/4588595) loss:2.707 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:02:45,319][model8_pretrain.py][INFO] Epoch:[0/2](936300/4588595) loss:3.194 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:02:45,319][model8_pretrain.py][INFO] Epoch:[0/2](936300/4588595) loss:2.551 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:02:45,319][model8_pretrain.py][INFO] Epoch:[0/2](936300/4588595) loss:2.964 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:02:45,319][model8_pretrain.py][INFO] Epoch:[0/2](936300/4588595) loss:2.738 lr:0.0000100 epoch_Time:23261.0min: [2024-01-06 21:03:22,259][model8_pretrain.py][INFO] Epoch:[0/2](936400/4588595) loss:2.829 lr:0.0000100 epoch_Time:23259.0min: [2024-01-06 21:03:22,259][model8_pretrain.py][INFO] Epoch:[0/2](936400/4588595) loss:2.549 lr:0.0000100 epoch_Time:23259.0min: [2024-01-06 21:03:22,259][model8_pretrain.py][INFO] Epoch:[0/2](936400/4588595) loss:3.200 lr:0.0000100 epoch_Time:23259.0min: [2024-01-06 21:03:22,259][model8_pretrain.py][INFO] Epoch:[0/2](936400/4588595) loss:3.425 lr:0.0000100 epoch_Time:23259.0min: [2024-01-06 21:03:22,259][model8_pretrain.py][INFO] Epoch:[0/2](936400/4588595) loss:3.059 lr:0.0000100 epoch_Time:23259.0min: [2024-01-06 21:03:22,259][model8_pretrain.py][INFO] Epoch:[0/2](936400/4588595) loss:2.703 lr:0.0000100 epoch_Time:23259.0min: [2024-01-06 21:03:22,259][model8_pretrain.py][INFO] Epoch:[0/2](936400/4588595) loss:3.219 lr:0.0000100 epoch_Time:23259.0min: [2024-01-06 21:03:22,260][model8_pretrain.py][INFO] Epoch:[0/2](936400/4588595) loss:2.900 lr:0.0000100 epoch_Time:23259.0min: [2024-01-06 21:03:59,206][model8_pretrain.py][INFO] Epoch:[0/2](936500/4588595) loss:3.054 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:03:59,206][model8_pretrain.py][INFO] Epoch:[0/2](936500/4588595) loss:2.464 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:03:59,206][model8_pretrain.py][INFO] Epoch:[0/2](936500/4588595) loss:3.050 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:03:59,206][model8_pretrain.py][INFO] Epoch:[0/2](936500/4588595) loss:2.361 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:03:59,206][model8_pretrain.py][INFO] Epoch:[0/2](936500/4588595) loss:3.063 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:03:59,206][model8_pretrain.py][INFO] Epoch:[0/2](936500/4588595) loss:2.114 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:03:59,207][model8_pretrain.py][INFO] Epoch:[0/2](936500/4588595) loss:2.708 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:03:59,207][model8_pretrain.py][INFO] Epoch:[0/2](936500/4588595) loss:2.531 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:04:36,153][model8_pretrain.py][INFO] Epoch:[0/2](936600/4588595) loss:2.833 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:04:36,153][model8_pretrain.py][INFO] Epoch:[0/2](936600/4588595) loss:2.386 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:04:36,153][model8_pretrain.py][INFO] Epoch:[0/2](936600/4588595) loss:2.718 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:04:36,153][model8_pretrain.py][INFO] Epoch:[0/2](936600/4588595) loss:2.950 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:04:36,153][model8_pretrain.py][INFO] Epoch:[0/2](936600/4588595) loss:2.509 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:04:36,153][model8_pretrain.py][INFO] Epoch:[0/2](936600/4588595) loss:3.086 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:04:36,153][model8_pretrain.py][INFO] Epoch:[0/2](936600/4588595) loss:2.839 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:04:36,153][model8_pretrain.py][INFO] Epoch:[0/2](936600/4588595) loss:3.098 lr:0.0000100 epoch_Time:23258.0min: [2024-01-06 21:05:13,100][model8_pretrain.py][INFO] Epoch:[0/2](936700/4588595) loss:2.239 lr:0.0000100 epoch_Time:23257.0min: [2024-01-06 21:05:13,100][model8_pretrain.py][INFO] Epoch:[0/2](936700/4588595) loss:2.424 lr:0.0000100 epoch_Time:23257.0min: [2024-01-06 21:05:13,101][model8_pretrain.py][INFO] Epoch:[0/2](936700/4588595) loss:2.791 lr:0.0000100 epoch_Time:23257.0min: [2024-01-06 21:05:13,101][model8_pretrain.py][INFO] Epoch:[0/2](936700/4588595) loss:2.834 lr:0.0000100 epoch_Time:23257.0min: [2024-01-06 21:05:13,101][model8_pretrain.py][INFO] Epoch:[0/2](936700/4588595) loss:2.011 lr:0.0000100 epoch_Time:23257.0min: [2024-01-06 21:05:13,101][model8_pretrain.py][INFO] Epoch:[0/2](936700/4588595) loss:3.075 lr:0.0000100 epoch_Time:23257.0min: [2024-01-06 21:05:13,101][model8_pretrain.py][INFO] Epoch:[0/2](936700/4588595) loss:3.153 lr:0.0000100 epoch_Time:23257.0min: [2024-01-06 21:05:13,101][model8_pretrain.py][INFO] Epoch:[0/2](936700/4588595) loss:2.537 lr:0.0000100 epoch_Time:23257.0min: [2024-01-06 21:05:50,052][model8_pretrain.py][INFO] Epoch:[0/2](936800/4588595) loss:2.109 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:05:50,052][model8_pretrain.py][INFO] Epoch:[0/2](936800/4588595) loss:2.854 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:05:50,052][model8_pretrain.py][INFO] Epoch:[0/2](936800/4588595) loss:2.492 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:05:50,052][model8_pretrain.py][INFO] Epoch:[0/2](936800/4588595) loss:2.999 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:05:50,052][model8_pretrain.py][INFO] Epoch:[0/2](936800/4588595) loss:2.372 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:05:50,053][model8_pretrain.py][INFO] Epoch:[0/2](936800/4588595) loss:2.811 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:05:50,053][model8_pretrain.py][INFO] Epoch:[0/2](936800/4588595) loss:2.265 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:05:50,053][model8_pretrain.py][INFO] Epoch:[0/2](936800/4588595) loss:2.760 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:06:27,011][model8_pretrain.py][INFO] Epoch:[0/2](936900/4588595) loss:2.702 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:06:27,011][model8_pretrain.py][INFO] Epoch:[0/2](936900/4588595) loss:2.803 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:06:27,011][model8_pretrain.py][INFO] Epoch:[0/2](936900/4588595) loss:2.869 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:06:27,012][model8_pretrain.py][INFO] Epoch:[0/2](936900/4588595) loss:3.144 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:06:27,012][model8_pretrain.py][INFO] Epoch:[0/2](936900/4588595) loss:2.867 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:06:27,012][model8_pretrain.py][INFO] Epoch:[0/2](936900/4588595) loss:3.009 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:06:27,012][model8_pretrain.py][INFO] Epoch:[0/2](936900/4588595) loss:3.015 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:06:27,012][model8_pretrain.py][INFO] Epoch:[0/2](936900/4588595) loss:3.119 lr:0.0000100 epoch_Time:23256.0min: [2024-01-06 21:07:10,565][model8_pretrain.py][INFO] Epoch:[0/2](937000/4588595) loss:2.967 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:10,565][model8_pretrain.py][INFO] Epoch:[0/2](937000/4588595) loss:1.682 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:10,565][model8_pretrain.py][INFO] Epoch:[0/2](937000/4588595) loss:2.387 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:10,565][model8_pretrain.py][INFO] Epoch:[0/2](937000/4588595) loss:2.507 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:10,565][model8_pretrain.py][INFO] Epoch:[0/2](937000/4588595) loss:3.040 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:10,565][model8_pretrain.py][INFO] Epoch:[0/2](937000/4588595) loss:3.108 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:10,565][model8_pretrain.py][INFO] Epoch:[0/2](937000/4588595) loss:3.069 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:10,566][model8_pretrain.py][INFO] Epoch:[0/2](937000/4588595) loss:2.812 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:54,261][model8_pretrain.py][INFO] Epoch:[0/2](937100/4588595) loss:2.223 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:54,261][model8_pretrain.py][INFO] Epoch:[0/2](937100/4588595) loss:2.928 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:54,261][model8_pretrain.py][INFO] Epoch:[0/2](937100/4588595) loss:2.788 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:54,261][model8_pretrain.py][INFO] Epoch:[0/2](937100/4588595) loss:2.435 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:54,261][model8_pretrain.py][INFO] Epoch:[0/2](937100/4588595) loss:2.050 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:54,261][model8_pretrain.py][INFO] Epoch:[0/2](937100/4588595) loss:3.049 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:54,261][model8_pretrain.py][INFO] Epoch:[0/2](937100/4588595) loss:2.621 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:07:54,261][model8_pretrain.py][INFO] Epoch:[0/2](937100/4588595) loss:2.058 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:08:31,195][model8_pretrain.py][INFO] Epoch:[0/2](937200/4588595) loss:2.710 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:08:31,195][model8_pretrain.py][INFO] Epoch:[0/2](937200/4588595) loss:3.148 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:08:31,195][model8_pretrain.py][INFO] Epoch:[0/2](937200/4588595) loss:3.617 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:08:31,195][model8_pretrain.py][INFO] Epoch:[0/2](937200/4588595) loss:2.959 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:08:31,195][model8_pretrain.py][INFO] Epoch:[0/2](937200/4588595) loss:3.067 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:08:31,195][model8_pretrain.py][INFO] Epoch:[0/2](937200/4588595) loss:3.585 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:08:31,195][model8_pretrain.py][INFO] Epoch:[0/2](937200/4588595) loss:2.967 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:08:31,195][model8_pretrain.py][INFO] Epoch:[0/2](937200/4588595) loss:2.722 lr:0.0000100 epoch_Time:23255.0min: [2024-01-06 21:09:08,142][model8_pretrain.py][INFO] Epoch:[0/2](937300/4588595) loss:2.941 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:08,142][model8_pretrain.py][INFO] Epoch:[0/2](937300/4588595) loss:2.896 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:08,142][model8_pretrain.py][INFO] Epoch:[0/2](937300/4588595) loss:2.557 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:08,142][model8_pretrain.py][INFO] Epoch:[0/2](937300/4588595) loss:3.078 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:08,142][model8_pretrain.py][INFO] Epoch:[0/2](937300/4588595) loss:1.788 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:08,142][model8_pretrain.py][INFO] Epoch:[0/2](937300/4588595) loss:2.904 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:08,142][model8_pretrain.py][INFO] Epoch:[0/2](937300/4588595) loss:3.080 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:08,142][model8_pretrain.py][INFO] Epoch:[0/2](937300/4588595) loss:2.576 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:45,096][model8_pretrain.py][INFO] Epoch:[0/2](937400/4588595) loss:3.081 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:45,096][model8_pretrain.py][INFO] Epoch:[0/2](937400/4588595) loss:2.658 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:45,096][model8_pretrain.py][INFO] Epoch:[0/2](937400/4588595) loss:2.994 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:45,096][model8_pretrain.py][INFO] Epoch:[0/2](937400/4588595) loss:2.926 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:45,096][model8_pretrain.py][INFO] Epoch:[0/2](937400/4588595) loss:2.803 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:45,096][model8_pretrain.py][INFO] Epoch:[0/2](937400/4588595) loss:3.059 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:45,096][model8_pretrain.py][INFO] Epoch:[0/2](937400/4588595) loss:2.980 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:09:45,096][model8_pretrain.py][INFO] Epoch:[0/2](937400/4588595) loss:2.797 lr:0.0000100 epoch_Time:23254.0min: [2024-01-06 21:10:22,049][model8_pretrain.py][INFO] Epoch:[0/2](937500/4588595) loss:2.147 lr:0.0000100 epoch_Time:23252.0min: [2024-01-06 21:10:22,049][model8_pretrain.py][INFO] Epoch:[0/2](937500/4588595) loss:2.361 lr:0.0000100 epoch_Time:23252.0min: [2024-01-06 21:10:22,049][model8_pretrain.py][INFO] Epoch:[0/2](937500/4588595) loss:2.828 lr:0.0000100 epoch_Time:23252.0min: [2024-01-06 21:10:22,049][model8_pretrain.py][INFO] Epoch:[0/2](937500/4588595) loss:2.384 lr:0.0000100 epoch_Time:23252.0min: [2024-01-06 21:10:22,049][model8_pretrain.py][INFO] Epoch:[0/2](937500/4588595) loss:3.083 lr:0.0000100 epoch_Time:23252.0min: [2024-01-06 21:10:22,049][model8_pretrain.py][INFO] Epoch:[0/2](937500/4588595) loss:3.142 lr:0.0000100 epoch_Time:23252.0min: [2024-01-06 21:10:22,049][model8_pretrain.py][INFO] Epoch:[0/2](937500/4588595) loss:2.819 lr:0.0000100 epoch_Time:23252.0min: [2024-01-06 21:10:22,049][model8_pretrain.py][INFO] Epoch:[0/2](937500/4588595) loss:2.772 lr:0.0000100 epoch_Time:23252.0min: [2024-01-06 21:10:58,995][model8_pretrain.py][INFO] Epoch:[0/2](937600/4588595) loss:2.411 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:10:58,995][model8_pretrain.py][INFO] Epoch:[0/2](937600/4588595) loss:3.132 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:10:58,995][model8_pretrain.py][INFO] Epoch:[0/2](937600/4588595) loss:2.695 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:10:58,995][model8_pretrain.py][INFO] Epoch:[0/2](937600/4588595) loss:2.129 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:10:58,995][model8_pretrain.py][INFO] Epoch:[0/2](937600/4588595) loss:2.650 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:10:58,995][model8_pretrain.py][INFO] Epoch:[0/2](937600/4588595) loss:2.917 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:10:58,995][model8_pretrain.py][INFO] Epoch:[0/2](937600/4588595) loss:2.811 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:10:58,995][model8_pretrain.py][INFO] Epoch:[0/2](937600/4588595) loss:2.491 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:11:35,953][model8_pretrain.py][INFO] Epoch:[0/2](937700/4588595) loss:3.072 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:11:35,953][model8_pretrain.py][INFO] Epoch:[0/2](937700/4588595) loss:2.776 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:11:35,953][model8_pretrain.py][INFO] Epoch:[0/2](937700/4588595) loss:2.630 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:11:35,953][model8_pretrain.py][INFO] Epoch:[0/2](937700/4588595) loss:3.055 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:11:35,953][model8_pretrain.py][INFO] Epoch:[0/2](937700/4588595) loss:2.851 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:11:35,953][model8_pretrain.py][INFO] Epoch:[0/2](937700/4588595) loss:3.132 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:11:35,953][model8_pretrain.py][INFO] Epoch:[0/2](937700/4588595) loss:2.297 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:11:35,953][model8_pretrain.py][INFO] Epoch:[0/2](937700/4588595) loss:2.333 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:12:19,636][model8_pretrain.py][INFO] Epoch:[0/2](937800/4588595) loss:1.860 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:12:19,636][model8_pretrain.py][INFO] Epoch:[0/2](937800/4588595) loss:3.094 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:12:19,636][model8_pretrain.py][INFO] Epoch:[0/2](937800/4588595) loss:2.940 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:12:19,636][model8_pretrain.py][INFO] Epoch:[0/2](937800/4588595) loss:2.661 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:12:19,636][model8_pretrain.py][INFO] Epoch:[0/2](937800/4588595) loss:2.834 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:12:19,636][model8_pretrain.py][INFO] Epoch:[0/2](937800/4588595) loss:2.886 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:12:19,636][model8_pretrain.py][INFO] Epoch:[0/2](937800/4588595) loss:2.787 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:12:19,636][model8_pretrain.py][INFO] Epoch:[0/2](937800/4588595) loss:2.963 lr:0.0000100 epoch_Time:23251.0min: [2024-01-06 21:13:03,340][model8_pretrain.py][INFO] Epoch:[0/2](937900/4588595) loss:2.759 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:03,340][model8_pretrain.py][INFO] Epoch:[0/2](937900/4588595) loss:2.797 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:03,340][model8_pretrain.py][INFO] Epoch:[0/2](937900/4588595) loss:3.182 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:03,340][model8_pretrain.py][INFO] Epoch:[0/2](937900/4588595) loss:2.431 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:03,340][model8_pretrain.py][INFO] Epoch:[0/2](937900/4588595) loss:3.106 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:03,340][model8_pretrain.py][INFO] Epoch:[0/2](937900/4588595) loss:3.148 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:03,340][model8_pretrain.py][INFO] Epoch:[0/2](937900/4588595) loss:2.558 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:03,340][model8_pretrain.py][INFO] Epoch:[0/2](937900/4588595) loss:2.412 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:40,301][model8_pretrain.py][INFO] Epoch:[0/2](938000/4588595) loss:2.790 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:40,301][model8_pretrain.py][INFO] Epoch:[0/2](938000/4588595) loss:3.233 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:40,301][model8_pretrain.py][INFO] Epoch:[0/2](938000/4588595) loss:2.753 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:40,301][model8_pretrain.py][INFO] Epoch:[0/2](938000/4588595) loss:2.042 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:40,301][model8_pretrain.py][INFO] Epoch:[0/2](938000/4588595) loss:3.113 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:40,301][model8_pretrain.py][INFO] Epoch:[0/2](938000/4588595) loss:2.793 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:40,301][model8_pretrain.py][INFO] Epoch:[0/2](938000/4588595) loss:2.512 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:13:40,301][model8_pretrain.py][INFO] Epoch:[0/2](938000/4588595) loss:2.821 lr:0.0000100 epoch_Time:23250.0min: [2024-01-06 21:14:17,277][model8_pretrain.py][INFO] Epoch:[0/2](938100/4588595) loss:2.920 lr:0.0000100 epoch_Time:23249.0min: [2024-01-06 21:14:17,277][model8_pretrain.py][INFO] Epoch:[0/2](938100/4588595) loss:3.133 lr:0.0000100 epoch_Time:23249.0min: [2024-01-06 21:14:17,277][model8_pretrain.py][INFO] Epoch:[0/2](938100/4588595) loss:2.785 lr:0.0000100 epoch_Time:23249.0min: [2024-01-06 21:14:17,277][model8_pretrain.py][INFO] Epoch:[0/2](938100/4588595) loss:2.723 lr:0.0000100 epoch_Time:23249.0min: [2024-01-06 21:14:17,277][model8_pretrain.py][INFO] Epoch:[0/2](938100/4588595) loss:2.370 lr:0.0000100 epoch_Time:23249.0min: [2024-01-06 21:14:17,277][model8_pretrain.py][INFO] Epoch:[0/2](938100/4588595) loss:3.169 lr:0.0000100 epoch_Time:23249.0min: [2024-01-06 21:14:17,277][model8_pretrain.py][INFO] Epoch:[0/2](938100/4588595) loss:2.603 lr:0.0000100 epoch_Time:23249.0min: [2024-01-06 21:14:17,277][model8_pretrain.py][INFO] Epoch:[0/2](938100/4588595) loss:3.129 lr:0.0000100 epoch_Time:23249.0min: [2024-01-06 21:14:54,243][model8_pretrain.py][INFO] Epoch:[0/2](938200/4588595) loss:2.364 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:14:54,243][model8_pretrain.py][INFO] Epoch:[0/2](938200/4588595) loss:2.555 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:14:54,243][model8_pretrain.py][INFO] Epoch:[0/2](938200/4588595) loss:2.643 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:14:54,243][model8_pretrain.py][INFO] Epoch:[0/2](938200/4588595) loss:3.251 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:14:54,243][model8_pretrain.py][INFO] Epoch:[0/2](938200/4588595) loss:2.658 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:14:54,243][model8_pretrain.py][INFO] Epoch:[0/2](938200/4588595) loss:2.792 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:14:54,243][model8_pretrain.py][INFO] Epoch:[0/2](938200/4588595) loss:2.472 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:14:54,243][model8_pretrain.py][INFO] Epoch:[0/2](938200/4588595) loss:2.729 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:15:31,205][model8_pretrain.py][INFO] Epoch:[0/2](938300/4588595) loss:2.856 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:15:31,205][model8_pretrain.py][INFO] Epoch:[0/2](938300/4588595) loss:3.058 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:15:31,205][model8_pretrain.py][INFO] Epoch:[0/2](938300/4588595) loss:2.795 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:15:31,205][model8_pretrain.py][INFO] Epoch:[0/2](938300/4588595) loss:2.659 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:15:31,205][model8_pretrain.py][INFO] Epoch:[0/2](938300/4588595) loss:3.246 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:15:31,205][model8_pretrain.py][INFO] Epoch:[0/2](938300/4588595) loss:2.798 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:15:31,205][model8_pretrain.py][INFO] Epoch:[0/2](938300/4588595) loss:2.828 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:15:31,206][model8_pretrain.py][INFO] Epoch:[0/2](938300/4588595) loss:2.643 lr:0.0000100 epoch_Time:23248.0min: [2024-01-06 21:16:08,169][model8_pretrain.py][INFO] Epoch:[0/2](938400/4588595) loss:3.073 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:08,169][model8_pretrain.py][INFO] Epoch:[0/2](938400/4588595) loss:2.809 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:08,169][model8_pretrain.py][INFO] Epoch:[0/2](938400/4588595) loss:2.774 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:08,169][model8_pretrain.py][INFO] Epoch:[0/2](938400/4588595) loss:2.076 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:08,169][model8_pretrain.py][INFO] Epoch:[0/2](938400/4588595) loss:2.759 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:08,169][model8_pretrain.py][INFO] Epoch:[0/2](938400/4588595) loss:2.290 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:08,169][model8_pretrain.py][INFO] Epoch:[0/2](938400/4588595) loss:2.689 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:08,169][model8_pretrain.py][INFO] Epoch:[0/2](938400/4588595) loss:3.326 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:45,135][model8_pretrain.py][INFO] Epoch:[0/2](938500/4588595) loss:2.859 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:45,135][model8_pretrain.py][INFO] Epoch:[0/2](938500/4588595) loss:2.796 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:45,135][model8_pretrain.py][INFO] Epoch:[0/2](938500/4588595) loss:2.896 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:45,135][model8_pretrain.py][INFO] Epoch:[0/2](938500/4588595) loss:2.996 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:45,135][model8_pretrain.py][INFO] Epoch:[0/2](938500/4588595) loss:2.521 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:45,135][model8_pretrain.py][INFO] Epoch:[0/2](938500/4588595) loss:2.860 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:45,135][model8_pretrain.py][INFO] Epoch:[0/2](938500/4588595) loss:2.783 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:16:45,135][model8_pretrain.py][INFO] Epoch:[0/2](938500/4588595) loss:3.013 lr:0.0000100 epoch_Time:23247.0min: [2024-01-06 21:17:26,922][model8_pretrain.py][INFO] Epoch:[0/2](938600/4588595) loss:3.028 lr:0.0000100 epoch_Time:23246.0min: [2024-01-06 21:17:26,922][model8_pretrain.py][INFO] Epoch:[0/2](938600/4588595) loss:3.241 lr:0.0000100 epoch_Time:23246.0min: [2024-01-06 21:17:26,922][model8_pretrain.py][INFO] Epoch:[0/2](938600/4588595) loss:2.505 lr:0.0000100 epoch_Time:23246.0min: [2024-01-06 21:17:26,926][model8_pretrain.py][INFO] Epoch:[0/2](938600/4588595) loss:2.736 lr:0.0000100 epoch_Time:23246.0min: [2024-01-06 21:17:26,927][model8_pretrain.py][INFO] Epoch:[0/2](938600/4588595) loss:2.487 lr:0.0000100 epoch_Time:23246.0min: [2024-01-06 21:17:26,927][model8_pretrain.py][INFO] Epoch:[0/2](938600/4588595) loss:2.975 lr:0.0000100 epoch_Time:23246.0min: [2024-01-06 21:17:26,927][model8_pretrain.py][INFO] Epoch:[0/2](938600/4588595) loss:2.878 lr:0.0000100 epoch_Time:23246.0min: [2024-01-06 21:17:28,725][model8_pretrain.py][INFO] Epoch:[0/2](938600/4588595) loss:2.368 lr:0.0000100 epoch_Time:23246.0min: [2024-01-06 21:18:12,496][model8_pretrain.py][INFO] Epoch:[0/2](938700/4588595) loss:3.056 lr:0.0000100 epoch_Time:23245.0min: [2024-01-06 21:18:12,496][model8_pretrain.py][INFO] Epoch:[0/2](938700/4588595) loss:2.397 lr:0.0000100 epoch_Time:23245.0min: [2024-01-06 21:18:12,496][model8_pretrain.py][INFO] Epoch:[0/2](938700/4588595) loss:3.126 lr:0.0000100 epoch_Time:23245.0min: [2024-01-06 21:18:12,496][model8_pretrain.py][INFO] Epoch:[0/2](938700/4588595) loss:2.730 lr:0.0000100 epoch_Time:23245.0min: [2024-01-06 21:18:12,497][model8_pretrain.py][INFO] Epoch:[0/2](938700/4588595) loss:3.404 lr:0.0000100 epoch_Time:23245.0min: [2024-01-06 21:18:12,497][model8_pretrain.py][INFO] Epoch:[0/2](938700/4588595) loss:3.266 lr:0.0000100 epoch_Time:23245.0min: [2024-01-06 21:18:12,496][model8_pretrain.py][INFO] Epoch:[0/2](938700/4588595) loss:3.202 lr:0.0000100 epoch_Time:23245.0min: [2024-01-06 21:18:12,497][model8_pretrain.py][INFO] Epoch:[0/2](938700/4588595) loss:3.172 lr:0.0000100 epoch_Time:23245.0min: [2024-01-06 21:18:49,443][model8_pretrain.py][INFO] Epoch:[0/2](938800/4588595) loss:3.091 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:18:49,444][model8_pretrain.py][INFO] Epoch:[0/2](938800/4588595) loss:2.252 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:18:49,444][model8_pretrain.py][INFO] Epoch:[0/2](938800/4588595) loss:2.193 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:18:49,444][model8_pretrain.py][INFO] Epoch:[0/2](938800/4588595) loss:2.080 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:18:49,444][model8_pretrain.py][INFO] Epoch:[0/2](938800/4588595) loss:2.355 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:18:49,444][model8_pretrain.py][INFO] Epoch:[0/2](938800/4588595) loss:2.310 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:18:49,444][model8_pretrain.py][INFO] Epoch:[0/2](938800/4588595) loss:2.782 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:18:49,444][model8_pretrain.py][INFO] Epoch:[0/2](938800/4588595) loss:2.690 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:19:26,390][model8_pretrain.py][INFO] Epoch:[0/2](938900/4588595) loss:2.607 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:19:26,390][model8_pretrain.py][INFO] Epoch:[0/2](938900/4588595) loss:3.100 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:19:26,390][model8_pretrain.py][INFO] Epoch:[0/2](938900/4588595) loss:2.491 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:19:26,390][model8_pretrain.py][INFO] Epoch:[0/2](938900/4588595) loss:2.820 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:19:26,390][model8_pretrain.py][INFO] Epoch:[0/2](938900/4588595) loss:3.016 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:19:26,390][model8_pretrain.py][INFO] Epoch:[0/2](938900/4588595) loss:2.587 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:19:26,390][model8_pretrain.py][INFO] Epoch:[0/2](938900/4588595) loss:2.814 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:19:26,390][model8_pretrain.py][INFO] Epoch:[0/2](938900/4588595) loss:2.693 lr:0.0000100 epoch_Time:23244.0min: [2024-01-06 21:20:03,350][model8_pretrain.py][INFO] Epoch:[0/2](939000/4588595) loss:2.791 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:03,350][model8_pretrain.py][INFO] Epoch:[0/2](939000/4588595) loss:2.785 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:03,351][model8_pretrain.py][INFO] Epoch:[0/2](939000/4588595) loss:2.312 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:03,351][model8_pretrain.py][INFO] Epoch:[0/2](939000/4588595) loss:2.981 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:03,351][model8_pretrain.py][INFO] Epoch:[0/2](939000/4588595) loss:2.575 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:03,351][model8_pretrain.py][INFO] Epoch:[0/2](939000/4588595) loss:2.511 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:03,351][model8_pretrain.py][INFO] Epoch:[0/2](939000/4588595) loss:2.311 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:03,351][model8_pretrain.py][INFO] Epoch:[0/2](939000/4588595) loss:2.436 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:40,315][model8_pretrain.py][INFO] Epoch:[0/2](939100/4588595) loss:2.693 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:40,315][model8_pretrain.py][INFO] Epoch:[0/2](939100/4588595) loss:2.752 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:40,315][model8_pretrain.py][INFO] Epoch:[0/2](939100/4588595) loss:2.812 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:40,315][model8_pretrain.py][INFO] Epoch:[0/2](939100/4588595) loss:2.382 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:40,315][model8_pretrain.py][INFO] Epoch:[0/2](939100/4588595) loss:2.331 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:40,315][model8_pretrain.py][INFO] Epoch:[0/2](939100/4588595) loss:2.581 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:40,315][model8_pretrain.py][INFO] Epoch:[0/2](939100/4588595) loss:2.125 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:20:40,315][model8_pretrain.py][INFO] Epoch:[0/2](939100/4588595) loss:2.954 lr:0.0000100 epoch_Time:23243.0min: [2024-01-06 21:21:17,278][model8_pretrain.py][INFO] Epoch:[0/2](939200/4588595) loss:1.998 lr:0.0000100 epoch_Time:23242.0min: [2024-01-06 21:21:17,278][model8_pretrain.py][INFO] Epoch:[0/2](939200/4588595) loss:1.754 lr:0.0000100 epoch_Time:23242.0min: [2024-01-06 21:21:17,278][model8_pretrain.py][INFO] Epoch:[0/2](939200/4588595) loss:2.960 lr:0.0000100 epoch_Time:23242.0min: [2024-01-06 21:21:17,278][model8_pretrain.py][INFO] Epoch:[0/2](939200/4588595) loss:2.790 lr:0.0000100 epoch_Time:23242.0min: [2024-01-06 21:21:17,278][model8_pretrain.py][INFO] Epoch:[0/2](939200/4588595) loss:2.266 lr:0.0000100 epoch_Time:23242.0min: [2024-01-06 21:21:17,278][model8_pretrain.py][INFO] Epoch:[0/2](939200/4588595) loss:3.343 lr:0.0000100 epoch_Time:23242.0min: [2024-01-06 21:21:17,278][model8_pretrain.py][INFO] Epoch:[0/2](939200/4588595) loss:2.656 lr:0.0000100 epoch_Time:23242.0min: [2024-01-06 21:21:17,278][model8_pretrain.py][INFO] Epoch:[0/2](939200/4588595) loss:3.055 lr:0.0000100 epoch_Time:23242.0min: [2024-01-06 21:21:54,225][model8_pretrain.py][INFO] Epoch:[0/2](939300/4588595) loss:2.801 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:21:54,225][model8_pretrain.py][INFO] Epoch:[0/2](939300/4588595) loss:2.825 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:21:54,225][model8_pretrain.py][INFO] Epoch:[0/2](939300/4588595) loss:3.161 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:21:54,225][model8_pretrain.py][INFO] Epoch:[0/2](939300/4588595) loss:2.746 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:21:54,225][model8_pretrain.py][INFO] Epoch:[0/2](939300/4588595) loss:2.811 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:21:54,225][model8_pretrain.py][INFO] Epoch:[0/2](939300/4588595) loss:3.070 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:21:54,225][model8_pretrain.py][INFO] Epoch:[0/2](939300/4588595) loss:2.645 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:21:54,225][model8_pretrain.py][INFO] Epoch:[0/2](939300/4588595) loss:3.110 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:22:34,470][model8_pretrain.py][INFO] Epoch:[0/2](939400/4588595) loss:2.846 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:22:34,470][model8_pretrain.py][INFO] Epoch:[0/2](939400/4588595) loss:2.932 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:22:34,470][model8_pretrain.py][INFO] Epoch:[0/2](939400/4588595) loss:3.091 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:22:34,470][model8_pretrain.py][INFO] Epoch:[0/2](939400/4588595) loss:2.799 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:22:34,470][model8_pretrain.py][INFO] Epoch:[0/2](939400/4588595) loss:2.379 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:22:34,470][model8_pretrain.py][INFO] Epoch:[0/2](939400/4588595) loss:3.191 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:22:34,470][model8_pretrain.py][INFO] Epoch:[0/2](939400/4588595) loss:2.920 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:22:34,470][model8_pretrain.py][INFO] Epoch:[0/2](939400/4588595) loss:2.661 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:23:21,520][model8_pretrain.py][INFO] Epoch:[0/2](939500/4588595) loss:2.366 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:23:21,520][model8_pretrain.py][INFO] Epoch:[0/2](939500/4588595) loss:2.871 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:23:21,521][model8_pretrain.py][INFO] Epoch:[0/2](939500/4588595) loss:2.822 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:23:21,521][model8_pretrain.py][INFO] Epoch:[0/2](939500/4588595) loss:3.149 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:23:21,521][model8_pretrain.py][INFO] Epoch:[0/2](939500/4588595) loss:2.761 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:23:21,521][model8_pretrain.py][INFO] Epoch:[0/2](939500/4588595) loss:2.680 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:23:21,521][model8_pretrain.py][INFO] Epoch:[0/2](939500/4588595) loss:3.197 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:23:21,521][model8_pretrain.py][INFO] Epoch:[0/2](939500/4588595) loss:2.109 lr:0.0000100 epoch_Time:23241.0min: [2024-01-06 21:23:58,458][model8_pretrain.py][INFO] Epoch:[0/2](939600/4588595) loss:3.316 lr:0.0000100 epoch_Time:23240.0min: [2024-01-06 21:23:58,458][model8_pretrain.py][INFO] Epoch:[0/2](939600/4588595) loss:2.718 lr:0.0000100 epoch_Time:23240.0min: [2024-01-06 21:23:58,458][model8_pretrain.py][INFO] Epoch:[0/2](939600/4588595) loss:2.972 lr:0.0000100 epoch_Time:23240.0min: [2024-01-06 21:23:58,458][model8_pretrain.py][INFO] Epoch:[0/2](939600/4588595) loss:2.867 lr:0.0000100 epoch_Time:23240.0min: [2024-01-06 21:23:58,458][model8_pretrain.py][INFO] Epoch:[0/2](939600/4588595) loss:2.367 lr:0.0000100 epoch_Time:23240.0min: [2024-01-06 21:23:58,458][model8_pretrain.py][INFO] Epoch:[0/2](939600/4588595) loss:2.853 lr:0.0000100 epoch_Time:23240.0min: [2024-01-06 21:23:58,458][model8_pretrain.py][INFO] Epoch:[0/2](939600/4588595) loss:2.767 lr:0.0000100 epoch_Time:23240.0min: [2024-01-06 21:23:58,458][model8_pretrain.py][INFO] Epoch:[0/2](939600/4588595) loss:2.791 lr:0.0000100 epoch_Time:23240.0min: [2024-01-06 21:24:35,398][model8_pretrain.py][INFO] Epoch:[0/2](939700/4588595) loss:2.619 lr:0.0000100 epoch_Time:23239.0min: [2024-01-06 21:24:35,398][model8_pretrain.py][INFO] Epoch:[0/2](939700/4588595) loss:2.682 lr:0.0000100 epoch_Time:23239.0min: [2024-01-06 21:24:35,398][model8_pretrain.py][INFO] Epoch:[0/2](939700/4588595) loss:2.657 lr:0.0000100 epoch_Time:23239.0min: [2024-01-06 21:24:35,398][model8_pretrain.py][INFO] Epoch:[0/2](939700/4588595) loss:2.770 lr:0.0000100 epoch_Time:23239.0min: [2024-01-06 21:24:35,398][model8_pretrain.py][INFO] Epoch:[0/2](939700/4588595) loss:3.187 lr:0.0000100 epoch_Time:23239.0min: [2024-01-06 21:24:35,398][model8_pretrain.py][INFO] Epoch:[0/2](939700/4588595) loss:2.734 lr:0.0000100 epoch_Time:23239.0min: [2024-01-06 21:24:35,398][model8_pretrain.py][INFO] Epoch:[0/2](939700/4588595) loss:2.818 lr:0.0000100 epoch_Time:23239.0min: [2024-01-06 21:24:35,398][model8_pretrain.py][INFO] Epoch:[0/2](939700/4588595) loss:3.039 lr:0.0000100 epoch_Time:23239.0min: [2024-01-06 21:25:12,337][model8_pretrain.py][INFO] Epoch:[0/2](939800/4588595) loss:2.811 lr:0.0000100 epoch_Time:23238.0min: [2024-01-06 21:25:12,337][model8_pretrain.py][INFO] Epoch:[0/2](939800/4588595) loss:2.946 lr:0.0000100 epoch_Time:23238.0min: [2024-01-06 21:25:12,337][model8_pretrain.py][INFO] Epoch:[0/2](939800/4588595) loss:2.350 lr:0.0000100 epoch_Time:23238.0min: [2024-01-06 21:25:12,337][model8_pretrain.py][INFO] Epoch:[0/2](939800/4588595) loss:2.936 lr:0.0000100 epoch_Time:23238.0min: [2024-01-06 21:25:12,337][model8_pretrain.py][INFO] Epoch:[0/2](939800/4588595) loss:2.827 lr:0.0000100 epoch_Time:23238.0min: [2024-01-06 21:25:12,337][model8_pretrain.py][INFO] Epoch:[0/2](939800/4588595) loss:2.881 lr:0.0000100 epoch_Time:23238.0min: [2024-01-06 21:25:12,337][model8_pretrain.py][INFO] Epoch:[0/2](939800/4588595) loss:2.610 lr:0.0000100 epoch_Time:23238.0min: [2024-01-06 21:25:12,337][model8_pretrain.py][INFO] Epoch:[0/2](939800/4588595) loss:3.014 lr:0.0000100 epoch_Time:23238.0min: [2024-01-06 21:25:49,285][model8_pretrain.py][INFO] Epoch:[0/2](939900/4588595) loss:1.771 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:25:49,285][model8_pretrain.py][INFO] Epoch:[0/2](939900/4588595) loss:2.805 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:25:49,285][model8_pretrain.py][INFO] Epoch:[0/2](939900/4588595) loss:2.959 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:25:49,285][model8_pretrain.py][INFO] Epoch:[0/2](939900/4588595) loss:2.861 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:25:49,285][model8_pretrain.py][INFO] Epoch:[0/2](939900/4588595) loss:2.564 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:25:49,285][model8_pretrain.py][INFO] Epoch:[0/2](939900/4588595) loss:2.864 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:25:49,286][model8_pretrain.py][INFO] Epoch:[0/2](939900/4588595) loss:2.721 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:25:49,286][model8_pretrain.py][INFO] Epoch:[0/2](939900/4588595) loss:2.962 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:26:26,227][model8_pretrain.py][INFO] Epoch:[0/2](940000/4588595) loss:2.579 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:26:26,227][model8_pretrain.py][INFO] Epoch:[0/2](940000/4588595) loss:3.059 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:26:26,227][model8_pretrain.py][INFO] Epoch:[0/2](940000/4588595) loss:2.740 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:26:26,227][model8_pretrain.py][INFO] Epoch:[0/2](940000/4588595) loss:2.990 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:26:26,227][model8_pretrain.py][INFO] Epoch:[0/2](940000/4588595) loss:3.019 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:26:26,228][model8_pretrain.py][INFO] Epoch:[0/2](940000/4588595) loss:2.688 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:26:26,228][model8_pretrain.py][INFO] Epoch:[0/2](940000/4588595) loss:3.156 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:26:26,228][model8_pretrain.py][INFO] Epoch:[0/2](940000/4588595) loss:2.703 lr:0.0000100 epoch_Time:23237.0min: [2024-01-06 21:27:03,174][model8_pretrain.py][INFO] Epoch:[0/2](940100/4588595) loss:2.875 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:03,174][model8_pretrain.py][INFO] Epoch:[0/2](940100/4588595) loss:2.308 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:03,174][model8_pretrain.py][INFO] Epoch:[0/2](940100/4588595) loss:2.636 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:03,174][model8_pretrain.py][INFO] Epoch:[0/2](940100/4588595) loss:2.699 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:03,174][model8_pretrain.py][INFO] Epoch:[0/2](940100/4588595) loss:2.169 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:03,174][model8_pretrain.py][INFO] Epoch:[0/2](940100/4588595) loss:2.920 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:03,175][model8_pretrain.py][INFO] Epoch:[0/2](940100/4588595) loss:2.376 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:03,175][model8_pretrain.py][INFO] Epoch:[0/2](940100/4588595) loss:3.422 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:41,597][model8_pretrain.py][INFO] Epoch:[0/2](940200/4588595) loss:2.521 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:41,597][model8_pretrain.py][INFO] Epoch:[0/2](940200/4588595) loss:2.939 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:41,597][model8_pretrain.py][INFO] Epoch:[0/2](940200/4588595) loss:3.249 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:41,597][model8_pretrain.py][INFO] Epoch:[0/2](940200/4588595) loss:2.903 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:41,597][model8_pretrain.py][INFO] Epoch:[0/2](940200/4588595) loss:2.750 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:41,597][model8_pretrain.py][INFO] Epoch:[0/2](940200/4588595) loss:2.813 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:41,597][model8_pretrain.py][INFO] Epoch:[0/2](940200/4588595) loss:2.416 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:27:41,597][model8_pretrain.py][INFO] Epoch:[0/2](940200/4588595) loss:2.746 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:28:30,474][model8_pretrain.py][INFO] Epoch:[0/2](940300/4588595) loss:3.055 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:28:30,474][model8_pretrain.py][INFO] Epoch:[0/2](940300/4588595) loss:2.720 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:28:30,474][model8_pretrain.py][INFO] Epoch:[0/2](940300/4588595) loss:2.805 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:28:30,474][model8_pretrain.py][INFO] Epoch:[0/2](940300/4588595) loss:2.214 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:28:30,474][model8_pretrain.py][INFO] Epoch:[0/2](940300/4588595) loss:3.383 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:28:30,474][model8_pretrain.py][INFO] Epoch:[0/2](940300/4588595) loss:2.718 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:28:30,474][model8_pretrain.py][INFO] Epoch:[0/2](940300/4588595) loss:2.535 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:28:30,474][model8_pretrain.py][INFO] Epoch:[0/2](940300/4588595) loss:2.563 lr:0.0000100 epoch_Time:23236.0min: [2024-01-06 21:29:07,414][model8_pretrain.py][INFO] Epoch:[0/2](940400/4588595) loss:2.962 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:07,414][model8_pretrain.py][INFO] Epoch:[0/2](940400/4588595) loss:3.268 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:07,414][model8_pretrain.py][INFO] Epoch:[0/2](940400/4588595) loss:3.161 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:07,414][model8_pretrain.py][INFO] Epoch:[0/2](940400/4588595) loss:2.974 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:07,415][model8_pretrain.py][INFO] Epoch:[0/2](940400/4588595) loss:3.252 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:07,415][model8_pretrain.py][INFO] Epoch:[0/2](940400/4588595) loss:2.994 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:07,415][model8_pretrain.py][INFO] Epoch:[0/2](940400/4588595) loss:3.099 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:07,415][model8_pretrain.py][INFO] Epoch:[0/2](940400/4588595) loss:2.849 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:44,367][model8_pretrain.py][INFO] Epoch:[0/2](940500/4588595) loss:3.129 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:44,367][model8_pretrain.py][INFO] Epoch:[0/2](940500/4588595) loss:2.451 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:44,367][model8_pretrain.py][INFO] Epoch:[0/2](940500/4588595) loss:2.478 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:44,367][model8_pretrain.py][INFO] Epoch:[0/2](940500/4588595) loss:2.826 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:44,367][model8_pretrain.py][INFO] Epoch:[0/2](940500/4588595) loss:2.713 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:44,367][model8_pretrain.py][INFO] Epoch:[0/2](940500/4588595) loss:2.171 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:44,367][model8_pretrain.py][INFO] Epoch:[0/2](940500/4588595) loss:2.864 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:29:44,367][model8_pretrain.py][INFO] Epoch:[0/2](940500/4588595) loss:3.005 lr:0.0000100 epoch_Time:23235.0min: [2024-01-06 21:30:21,315][model8_pretrain.py][INFO] Epoch:[0/2](940600/4588595) loss:2.669 lr:0.0000100 epoch_Time:23234.0min: [2024-01-06 21:30:21,315][model8_pretrain.py][INFO] Epoch:[0/2](940600/4588595) loss:2.670 lr:0.0000100 epoch_Time:23234.0min: [2024-01-06 21:30:21,315][model8_pretrain.py][INFO] Epoch:[0/2](940600/4588595) loss:2.989 lr:0.0000100 epoch_Time:23234.0min: [2024-01-06 21:30:21,315][model8_pretrain.py][INFO] Epoch:[0/2](940600/4588595) loss:2.361 lr:0.0000100 epoch_Time:23234.0min: [2024-01-06 21:30:21,315][model8_pretrain.py][INFO] Epoch:[0/2](940600/4588595) loss:2.828 lr:0.0000100 epoch_Time:23234.0min: [2024-01-06 21:30:21,315][model8_pretrain.py][INFO] Epoch:[0/2](940600/4588595) loss:2.332 lr:0.0000100 epoch_Time:23234.0min: [2024-01-06 21:30:21,315][model8_pretrain.py][INFO] Epoch:[0/2](940600/4588595) loss:3.167 lr:0.0000100 epoch_Time:23234.0min: [2024-01-06 21:30:21,315][model8_pretrain.py][INFO] Epoch:[0/2](940600/4588595) loss:2.911 lr:0.0000100 epoch_Time:23234.0min: [2024-01-06 21:30:58,267][model8_pretrain.py][INFO] Epoch:[0/2](940700/4588595) loss:3.095 lr:0.0000100 epoch_Time:23233.0min: [2024-01-06 21:30:58,267][model8_pretrain.py][INFO] Epoch:[0/2](940700/4588595) loss:2.559 lr:0.0000100 epoch_Time:23233.0min: [2024-01-06 21:30:58,268][model8_pretrain.py][INFO] Epoch:[0/2](940700/4588595) loss:3.166 lr:0.0000100 epoch_Time:23233.0min: [2024-01-06 21:30:58,268][model8_pretrain.py][INFO] Epoch:[0/2](940700/4588595) loss:1.789 lr:0.0000100 epoch_Time:23233.0min: [2024-01-06 21:30:58,268][model8_pretrain.py][INFO] Epoch:[0/2](940700/4588595) loss:2.433 lr:0.0000100 epoch_Time:23233.0min: [2024-01-06 21:30:58,268][model8_pretrain.py][INFO] Epoch:[0/2](940700/4588595) loss:2.822 lr:0.0000100 epoch_Time:23233.0min: [2024-01-06 21:30:58,267][model8_pretrain.py][INFO] Epoch:[0/2](940700/4588595) loss:2.271 lr:0.0000100 epoch_Time:23233.0min: [2024-01-06 21:30:58,268][model8_pretrain.py][INFO] Epoch:[0/2](940700/4588595) loss:2.687 lr:0.0000100 epoch_Time:23233.0min: [2024-01-06 21:31:35,214][model8_pretrain.py][INFO] Epoch:[0/2](940800/4588595) loss:2.195 lr:0.0000100 epoch_Time:23232.0min: [2024-01-06 21:31:35,214][model8_pretrain.py][INFO] Epoch:[0/2](940800/4588595) loss:3.156 lr:0.0000100 epoch_Time:23232.0min: [2024-01-06 21:31:35,214][model8_pretrain.py][INFO] Epoch:[0/2](940800/4588595) loss:3.368 lr:0.0000100 epoch_Time:23232.0min: [2024-01-06 21:31:35,214][model8_pretrain.py][INFO] Epoch:[0/2](940800/4588595) loss:2.536 lr:0.0000100 epoch_Time:23232.0min: [2024-01-06 21:31:35,214][model8_pretrain.py][INFO] Epoch:[0/2](940800/4588595) loss:2.454 lr:0.0000100 epoch_Time:23232.0min: [2024-01-06 21:31:35,214][model8_pretrain.py][INFO] Epoch:[0/2](940800/4588595) loss:2.849 lr:0.0000100 epoch_Time:23232.0min: [2024-01-06 21:31:35,214][model8_pretrain.py][INFO] Epoch:[0/2](940800/4588595) loss:2.986 lr:0.0000100 epoch_Time:23232.0min: [2024-01-06 21:31:35,214][model8_pretrain.py][INFO] Epoch:[0/2](940800/4588595) loss:2.509 lr:0.0000100 epoch_Time:23232.0min: [2024-01-06 21:32:12,175][model8_pretrain.py][INFO] Epoch:[0/2](940900/4588595) loss:2.329 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:32:12,175][model8_pretrain.py][INFO] Epoch:[0/2](940900/4588595) loss:2.744 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:32:12,175][model8_pretrain.py][INFO] Epoch:[0/2](940900/4588595) loss:2.451 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:32:12,175][model8_pretrain.py][INFO] Epoch:[0/2](940900/4588595) loss:2.553 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:32:12,175][model8_pretrain.py][INFO] Epoch:[0/2](940900/4588595) loss:2.633 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:32:12,175][model8_pretrain.py][INFO] Epoch:[0/2](940900/4588595) loss:3.167 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:32:12,175][model8_pretrain.py][INFO] Epoch:[0/2](940900/4588595) loss:3.352 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:32:12,175][model8_pretrain.py][INFO] Epoch:[0/2](940900/4588595) loss:2.781 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:32:50,671][model8_pretrain.py][INFO] Epoch:[0/2](941000/4588595) loss:3.148 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:32:50,672][model8_pretrain.py][INFO] Epoch:[0/2](941000/4588595) loss:2.916 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:32:50,672][model8_pretrain.py][INFO] Epoch:[0/2](941000/4588595) loss:3.100 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:32:50,672][model8_pretrain.py][INFO] Epoch:[0/2](941000/4588595) loss:2.701 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:32:50,672][model8_pretrain.py][INFO] Epoch:[0/2](941000/4588595) loss:2.248 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:32:50,672][model8_pretrain.py][INFO] Epoch:[0/2](941000/4588595) loss:2.924 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:32:50,672][model8_pretrain.py][INFO] Epoch:[0/2](941000/4588595) loss:2.872 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:32:50,672][model8_pretrain.py][INFO] Epoch:[0/2](941000/4588595) loss:2.853 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:33:39,524][model8_pretrain.py][INFO] Epoch:[0/2](941100/4588595) loss:3.020 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:33:39,524][model8_pretrain.py][INFO] Epoch:[0/2](941100/4588595) loss:2.675 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:33:39,524][model8_pretrain.py][INFO] Epoch:[0/2](941100/4588595) loss:2.676 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:33:39,524][model8_pretrain.py][INFO] Epoch:[0/2](941100/4588595) loss:2.982 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:33:39,524][model8_pretrain.py][INFO] Epoch:[0/2](941100/4588595) loss:2.786 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:33:39,524][model8_pretrain.py][INFO] Epoch:[0/2](941100/4588595) loss:3.013 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:33:39,524][model8_pretrain.py][INFO] Epoch:[0/2](941100/4588595) loss:2.657 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:33:39,524][model8_pretrain.py][INFO] Epoch:[0/2](941100/4588595) loss:3.359 lr:0.0000100 epoch_Time:23231.0min: [2024-01-06 21:34:16,467][model8_pretrain.py][INFO] Epoch:[0/2](941200/4588595) loss:2.638 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:34:16,467][model8_pretrain.py][INFO] Epoch:[0/2](941200/4588595) loss:2.814 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:34:16,467][model8_pretrain.py][INFO] Epoch:[0/2](941200/4588595) loss:3.035 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:34:16,467][model8_pretrain.py][INFO] Epoch:[0/2](941200/4588595) loss:2.831 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:34:16,467][model8_pretrain.py][INFO] Epoch:[0/2](941200/4588595) loss:2.618 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:34:16,467][model8_pretrain.py][INFO] Epoch:[0/2](941200/4588595) loss:3.199 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:34:16,467][model8_pretrain.py][INFO] Epoch:[0/2](941200/4588595) loss:2.864 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:34:16,467][model8_pretrain.py][INFO] Epoch:[0/2](941200/4588595) loss:2.797 lr:0.0000100 epoch_Time:23230.0min: [2024-01-06 21:34:53,414][model8_pretrain.py][INFO] Epoch:[0/2](941300/4588595) loss:2.697 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:34:53,414][model8_pretrain.py][INFO] Epoch:[0/2](941300/4588595) loss:2.461 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:34:53,414][model8_pretrain.py][INFO] Epoch:[0/2](941300/4588595) loss:2.764 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:34:53,414][model8_pretrain.py][INFO] Epoch:[0/2](941300/4588595) loss:2.193 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:34:53,414][model8_pretrain.py][INFO] Epoch:[0/2](941300/4588595) loss:2.864 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:34:53,414][model8_pretrain.py][INFO] Epoch:[0/2](941300/4588595) loss:2.607 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:34:53,414][model8_pretrain.py][INFO] Epoch:[0/2](941300/4588595) loss:2.905 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:34:53,414][model8_pretrain.py][INFO] Epoch:[0/2](941300/4588595) loss:2.981 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:35:30,364][model8_pretrain.py][INFO] Epoch:[0/2](941400/4588595) loss:2.934 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:35:30,364][model8_pretrain.py][INFO] Epoch:[0/2](941400/4588595) loss:3.098 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:35:30,364][model8_pretrain.py][INFO] Epoch:[0/2](941400/4588595) loss:2.004 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:35:30,364][model8_pretrain.py][INFO] Epoch:[0/2](941400/4588595) loss:2.586 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:35:30,364][model8_pretrain.py][INFO] Epoch:[0/2](941400/4588595) loss:2.566 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:35:30,364][model8_pretrain.py][INFO] Epoch:[0/2](941400/4588595) loss:3.358 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:35:30,364][model8_pretrain.py][INFO] Epoch:[0/2](941400/4588595) loss:2.293 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:35:30,365][model8_pretrain.py][INFO] Epoch:[0/2](941400/4588595) loss:3.459 lr:0.0000100 epoch_Time:23229.0min: [2024-01-06 21:36:07,309][model8_pretrain.py][INFO] Epoch:[0/2](941500/4588595) loss:2.165 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:07,309][model8_pretrain.py][INFO] Epoch:[0/2](941500/4588595) loss:2.498 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:07,309][model8_pretrain.py][INFO] Epoch:[0/2](941500/4588595) loss:2.610 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:07,309][model8_pretrain.py][INFO] Epoch:[0/2](941500/4588595) loss:2.336 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:07,309][model8_pretrain.py][INFO] Epoch:[0/2](941500/4588595) loss:2.917 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:07,309][model8_pretrain.py][INFO] Epoch:[0/2](941500/4588595) loss:2.147 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:07,309][model8_pretrain.py][INFO] Epoch:[0/2](941500/4588595) loss:2.554 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:07,310][model8_pretrain.py][INFO] Epoch:[0/2](941500/4588595) loss:2.713 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:44,290][model8_pretrain.py][INFO] Epoch:[0/2](941600/4588595) loss:2.430 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:44,290][model8_pretrain.py][INFO] Epoch:[0/2](941600/4588595) loss:3.104 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:44,290][model8_pretrain.py][INFO] Epoch:[0/2](941600/4588595) loss:2.579 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:44,290][model8_pretrain.py][INFO] Epoch:[0/2](941600/4588595) loss:2.849 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:44,290][model8_pretrain.py][INFO] Epoch:[0/2](941600/4588595) loss:2.468 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:44,290][model8_pretrain.py][INFO] Epoch:[0/2](941600/4588595) loss:1.815 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:44,290][model8_pretrain.py][INFO] Epoch:[0/2](941600/4588595) loss:2.335 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:36:44,290][model8_pretrain.py][INFO] Epoch:[0/2](941600/4588595) loss:2.743 lr:0.0000100 epoch_Time:23228.0min: [2024-01-06 21:37:21,256][model8_pretrain.py][INFO] Epoch:[0/2](941700/4588595) loss:2.096 lr:0.0000100 epoch_Time:23227.0min: [2024-01-06 21:37:21,256][model8_pretrain.py][INFO] Epoch:[0/2](941700/4588595) loss:2.641 lr:0.0000100 epoch_Time:23227.0min: [2024-01-06 21:37:21,256][model8_pretrain.py][INFO] Epoch:[0/2](941700/4588595) loss:3.011 lr:0.0000100 epoch_Time:23227.0min: [2024-01-06 21:37:21,256][model8_pretrain.py][INFO] Epoch:[0/2](941700/4588595) loss:2.517 lr:0.0000100 epoch_Time:23227.0min: [2024-01-06 21:37:21,256][model8_pretrain.py][INFO] Epoch:[0/2](941700/4588595) loss:2.265 lr:0.0000100 epoch_Time:23227.0min: [2024-01-06 21:37:21,256][model8_pretrain.py][INFO] Epoch:[0/2](941700/4588595) loss:2.662 lr:0.0000100 epoch_Time:23227.0min: [2024-01-06 21:37:21,256][model8_pretrain.py][INFO] Epoch:[0/2](941700/4588595) loss:2.525 lr:0.0000100 epoch_Time:23227.0min: [2024-01-06 21:37:21,256][model8_pretrain.py][INFO] Epoch:[0/2](941700/4588595) loss:2.660 lr:0.0000100 epoch_Time:23227.0min: [2024-01-06 21:37:59,735][model8_pretrain.py][INFO] Epoch:[0/2](941800/4588595) loss:3.255 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:37:59,735][model8_pretrain.py][INFO] Epoch:[0/2](941800/4588595) loss:3.021 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:37:59,735][model8_pretrain.py][INFO] Epoch:[0/2](941800/4588595) loss:3.198 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:37:59,735][model8_pretrain.py][INFO] Epoch:[0/2](941800/4588595) loss:2.744 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:37:59,735][model8_pretrain.py][INFO] Epoch:[0/2](941800/4588595) loss:2.562 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:37:59,735][model8_pretrain.py][INFO] Epoch:[0/2](941800/4588595) loss:2.968 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:37:59,735][model8_pretrain.py][INFO] Epoch:[0/2](941800/4588595) loss:2.587 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:37:59,735][model8_pretrain.py][INFO] Epoch:[0/2](941800/4588595) loss:2.929 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:38:48,663][model8_pretrain.py][INFO] Epoch:[0/2](941900/4588595) loss:2.610 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:38:48,663][model8_pretrain.py][INFO] Epoch:[0/2](941900/4588595) loss:2.328 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:38:48,663][model8_pretrain.py][INFO] Epoch:[0/2](941900/4588595) loss:2.852 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:38:48,663][model8_pretrain.py][INFO] Epoch:[0/2](941900/4588595) loss:2.561 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:38:48,663][model8_pretrain.py][INFO] Epoch:[0/2](941900/4588595) loss:3.215 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:38:48,663][model8_pretrain.py][INFO] Epoch:[0/2](941900/4588595) loss:2.638 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:38:48,663][model8_pretrain.py][INFO] Epoch:[0/2](941900/4588595) loss:3.131 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:38:48,663][model8_pretrain.py][INFO] Epoch:[0/2](941900/4588595) loss:3.081 lr:0.0000100 epoch_Time:23226.0min: [2024-01-06 21:39:25,592][model8_pretrain.py][INFO] Epoch:[0/2](942000/4588595) loss:2.208 lr:0.0000100 epoch_Time:23225.0min: [2024-01-06 21:39:25,592][model8_pretrain.py][INFO] Epoch:[0/2](942000/4588595) loss:3.195 lr:0.0000100 epoch_Time:23225.0min: [2024-01-06 21:39:25,593][model8_pretrain.py][INFO] Epoch:[0/2](942000/4588595) loss:2.902 lr:0.0000100 epoch_Time:23225.0min: [2024-01-06 21:39:25,593][model8_pretrain.py][INFO] Epoch:[0/2](942000/4588595) loss:2.776 lr:0.0000100 epoch_Time:23225.0min: [2024-01-06 21:39:25,592][model8_pretrain.py][INFO] Epoch:[0/2](942000/4588595) loss:2.483 lr:0.0000100 epoch_Time:23225.0min: [2024-01-06 21:39:25,592][model8_pretrain.py][INFO] Epoch:[0/2](942000/4588595) loss:3.058 lr:0.0000100 epoch_Time:23225.0min: [2024-01-06 21:39:25,592][model8_pretrain.py][INFO] Epoch:[0/2](942000/4588595) loss:2.711 lr:0.0000100 epoch_Time:23225.0min: [2024-01-06 21:39:25,592][model8_pretrain.py][INFO] Epoch:[0/2](942000/4588595) loss:3.190 lr:0.0000100 epoch_Time:23225.0min: [2024-01-06 21:40:02,530][model8_pretrain.py][INFO] Epoch:[0/2](942100/4588595) loss:2.540 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:02,530][model8_pretrain.py][INFO] Epoch:[0/2](942100/4588595) loss:2.546 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:02,530][model8_pretrain.py][INFO] Epoch:[0/2](942100/4588595) loss:2.740 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:02,530][model8_pretrain.py][INFO] Epoch:[0/2](942100/4588595) loss:3.092 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:02,530][model8_pretrain.py][INFO] Epoch:[0/2](942100/4588595) loss:2.265 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:02,530][model8_pretrain.py][INFO] Epoch:[0/2](942100/4588595) loss:2.933 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:02,530][model8_pretrain.py][INFO] Epoch:[0/2](942100/4588595) loss:2.620 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:02,530][model8_pretrain.py][INFO] Epoch:[0/2](942100/4588595) loss:3.081 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:39,471][model8_pretrain.py][INFO] Epoch:[0/2](942200/4588595) loss:2.377 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:39,471][model8_pretrain.py][INFO] Epoch:[0/2](942200/4588595) loss:2.731 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:39,471][model8_pretrain.py][INFO] Epoch:[0/2](942200/4588595) loss:2.990 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:39,471][model8_pretrain.py][INFO] Epoch:[0/2](942200/4588595) loss:2.788 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:39,471][model8_pretrain.py][INFO] Epoch:[0/2](942200/4588595) loss:2.265 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:39,471][model8_pretrain.py][INFO] Epoch:[0/2](942200/4588595) loss:3.220 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:39,471][model8_pretrain.py][INFO] Epoch:[0/2](942200/4588595) loss:2.746 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:40:39,471][model8_pretrain.py][INFO] Epoch:[0/2](942200/4588595) loss:3.142 lr:0.0000100 epoch_Time:23224.0min: [2024-01-06 21:41:16,400][model8_pretrain.py][INFO] Epoch:[0/2](942300/4588595) loss:2.490 lr:0.0000100 epoch_Time:23223.0min: [2024-01-06 21:41:16,400][model8_pretrain.py][INFO] Epoch:[0/2](942300/4588595) loss:2.683 lr:0.0000100 epoch_Time:23223.0min: [2024-01-06 21:41:16,400][model8_pretrain.py][INFO] Epoch:[0/2](942300/4588595) loss:3.199 lr:0.0000100 epoch_Time:23223.0min: [2024-01-06 21:41:16,400][model8_pretrain.py][INFO] Epoch:[0/2](942300/4588595) loss:3.235 lr:0.0000100 epoch_Time:23223.0min: [2024-01-06 21:41:16,400][model8_pretrain.py][INFO] Epoch:[0/2](942300/4588595) loss:1.809 lr:0.0000100 epoch_Time:23223.0min: [2024-01-06 21:41:16,400][model8_pretrain.py][INFO] Epoch:[0/2](942300/4588595) loss:2.899 lr:0.0000100 epoch_Time:23223.0min: [2024-01-06 21:41:16,400][model8_pretrain.py][INFO] Epoch:[0/2](942300/4588595) loss:2.718 lr:0.0000100 epoch_Time:23223.0min: [2024-01-06 21:41:16,401][model8_pretrain.py][INFO] Epoch:[0/2](942300/4588595) loss:2.505 lr:0.0000100 epoch_Time:23223.0min: [2024-01-06 21:41:53,341][model8_pretrain.py][INFO] Epoch:[0/2](942400/4588595) loss:2.131 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:41:53,341][model8_pretrain.py][INFO] Epoch:[0/2](942400/4588595) loss:3.012 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:41:53,341][model8_pretrain.py][INFO] Epoch:[0/2](942400/4588595) loss:2.817 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:41:53,341][model8_pretrain.py][INFO] Epoch:[0/2](942400/4588595) loss:2.995 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:41:53,341][model8_pretrain.py][INFO] Epoch:[0/2](942400/4588595) loss:2.802 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:41:53,341][model8_pretrain.py][INFO] Epoch:[0/2](942400/4588595) loss:3.107 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:41:53,341][model8_pretrain.py][INFO] Epoch:[0/2](942400/4588595) loss:3.016 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:41:53,341][model8_pretrain.py][INFO] Epoch:[0/2](942400/4588595) loss:2.841 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:42:30,288][model8_pretrain.py][INFO] Epoch:[0/2](942500/4588595) loss:3.341 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:42:30,288][model8_pretrain.py][INFO] Epoch:[0/2](942500/4588595) loss:2.779 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:42:30,288][model8_pretrain.py][INFO] Epoch:[0/2](942500/4588595) loss:3.174 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:42:30,289][model8_pretrain.py][INFO] Epoch:[0/2](942500/4588595) loss:2.359 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:42:30,289][model8_pretrain.py][INFO] Epoch:[0/2](942500/4588595) loss:3.081 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:42:30,289][model8_pretrain.py][INFO] Epoch:[0/2](942500/4588595) loss:3.057 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:42:30,289][model8_pretrain.py][INFO] Epoch:[0/2](942500/4588595) loss:3.014 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:42:30,289][model8_pretrain.py][INFO] Epoch:[0/2](942500/4588595) loss:2.276 lr:0.0000100 epoch_Time:23222.0min: [2024-01-06 21:43:08,858][model8_pretrain.py][INFO] Epoch:[0/2](942600/4588595) loss:3.348 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:08,858][model8_pretrain.py][INFO] Epoch:[0/2](942600/4588595) loss:2.949 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:08,858][model8_pretrain.py][INFO] Epoch:[0/2](942600/4588595) loss:2.557 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:08,858][model8_pretrain.py][INFO] Epoch:[0/2](942600/4588595) loss:2.429 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:08,858][model8_pretrain.py][INFO] Epoch:[0/2](942600/4588595) loss:3.088 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:08,858][model8_pretrain.py][INFO] Epoch:[0/2](942600/4588595) loss:3.374 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:08,858][model8_pretrain.py][INFO] Epoch:[0/2](942600/4588595) loss:3.427 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:08,859][model8_pretrain.py][INFO] Epoch:[0/2](942600/4588595) loss:2.819 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:57,694][model8_pretrain.py][INFO] Epoch:[0/2](942700/4588595) loss:3.149 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:57,694][model8_pretrain.py][INFO] Epoch:[0/2](942700/4588595) loss:2.911 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:57,694][model8_pretrain.py][INFO] Epoch:[0/2](942700/4588595) loss:3.113 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:57,694][model8_pretrain.py][INFO] Epoch:[0/2](942700/4588595) loss:2.947 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:57,694][model8_pretrain.py][INFO] Epoch:[0/2](942700/4588595) loss:2.340 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:57,694][model8_pretrain.py][INFO] Epoch:[0/2](942700/4588595) loss:2.560 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:57,694][model8_pretrain.py][INFO] Epoch:[0/2](942700/4588595) loss:2.313 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:43:57,694][model8_pretrain.py][INFO] Epoch:[0/2](942700/4588595) loss:2.577 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:44:34,619][model8_pretrain.py][INFO] Epoch:[0/2](942800/4588595) loss:2.669 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:44:34,619][model8_pretrain.py][INFO] Epoch:[0/2](942800/4588595) loss:3.259 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:44:34,620][model8_pretrain.py][INFO] Epoch:[0/2](942800/4588595) loss:2.869 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:44:34,619][model8_pretrain.py][INFO] Epoch:[0/2](942800/4588595) loss:2.788 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:44:34,620][model8_pretrain.py][INFO] Epoch:[0/2](942800/4588595) loss:2.753 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:44:34,620][model8_pretrain.py][INFO] Epoch:[0/2](942800/4588595) loss:2.494 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:44:34,620][model8_pretrain.py][INFO] Epoch:[0/2](942800/4588595) loss:2.644 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:44:34,620][model8_pretrain.py][INFO] Epoch:[0/2](942800/4588595) loss:2.779 lr:0.0000100 epoch_Time:23221.0min: [2024-01-06 21:45:11,552][model8_pretrain.py][INFO] Epoch:[0/2](942900/4588595) loss:2.863 lr:0.0000100 epoch_Time:23220.0min: [2024-01-06 21:45:11,552][model8_pretrain.py][INFO] Epoch:[0/2](942900/4588595) loss:2.802 lr:0.0000100 epoch_Time:23220.0min: [2024-01-06 21:45:11,552][model8_pretrain.py][INFO] Epoch:[0/2](942900/4588595) loss:2.526 lr:0.0000100 epoch_Time:23220.0min: [2024-01-06 21:45:11,552][model8_pretrain.py][INFO] Epoch:[0/2](942900/4588595) loss:2.640 lr:0.0000100 epoch_Time:23220.0min: [2024-01-06 21:45:11,552][model8_pretrain.py][INFO] Epoch:[0/2](942900/4588595) loss:2.708 lr:0.0000100 epoch_Time:23220.0min: [2024-01-06 21:45:11,552][model8_pretrain.py][INFO] Epoch:[0/2](942900/4588595) loss:2.617 lr:0.0000100 epoch_Time:23220.0min: [2024-01-06 21:45:11,552][model8_pretrain.py][INFO] Epoch:[0/2](942900/4588595) loss:2.584 lr:0.0000100 epoch_Time:23220.0min: [2024-01-06 21:45:11,552][model8_pretrain.py][INFO] Epoch:[0/2](942900/4588595) loss:3.023 lr:0.0000100 epoch_Time:23220.0min: [2024-01-06 21:45:48,480][model8_pretrain.py][INFO] Epoch:[0/2](943000/4588595) loss:2.688 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:45:48,480][model8_pretrain.py][INFO] Epoch:[0/2](943000/4588595) loss:2.829 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:45:48,480][model8_pretrain.py][INFO] Epoch:[0/2](943000/4588595) loss:2.844 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:45:48,480][model8_pretrain.py][INFO] Epoch:[0/2](943000/4588595) loss:3.133 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:45:48,480][model8_pretrain.py][INFO] Epoch:[0/2](943000/4588595) loss:2.863 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:45:48,480][model8_pretrain.py][INFO] Epoch:[0/2](943000/4588595) loss:3.047 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:45:48,480][model8_pretrain.py][INFO] Epoch:[0/2](943000/4588595) loss:2.592 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:45:48,480][model8_pretrain.py][INFO] Epoch:[0/2](943000/4588595) loss:2.916 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:46:25,405][model8_pretrain.py][INFO] Epoch:[0/2](943100/4588595) loss:2.662 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:46:25,405][model8_pretrain.py][INFO] Epoch:[0/2](943100/4588595) loss:2.361 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:46:25,405][model8_pretrain.py][INFO] Epoch:[0/2](943100/4588595) loss:3.233 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:46:25,405][model8_pretrain.py][INFO] Epoch:[0/2](943100/4588595) loss:3.044 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:46:25,405][model8_pretrain.py][INFO] Epoch:[0/2](943100/4588595) loss:3.276 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:46:25,405][model8_pretrain.py][INFO] Epoch:[0/2](943100/4588595) loss:2.886 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:46:25,406][model8_pretrain.py][INFO] Epoch:[0/2](943100/4588595) loss:2.834 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:46:25,407][model8_pretrain.py][INFO] Epoch:[0/2](943100/4588595) loss:2.729 lr:0.0000100 epoch_Time:23218.0min: [2024-01-06 21:47:02,357][model8_pretrain.py][INFO] Epoch:[0/2](943200/4588595) loss:2.671 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:02,357][model8_pretrain.py][INFO] Epoch:[0/2](943200/4588595) loss:2.428 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:02,357][model8_pretrain.py][INFO] Epoch:[0/2](943200/4588595) loss:2.445 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:02,357][model8_pretrain.py][INFO] Epoch:[0/2](943200/4588595) loss:2.509 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:02,357][model8_pretrain.py][INFO] Epoch:[0/2](943200/4588595) loss:2.246 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:02,357][model8_pretrain.py][INFO] Epoch:[0/2](943200/4588595) loss:2.842 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:02,357][model8_pretrain.py][INFO] Epoch:[0/2](943200/4588595) loss:2.519 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:02,357][model8_pretrain.py][INFO] Epoch:[0/2](943200/4588595) loss:2.565 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:39,382][model8_pretrain.py][INFO] Epoch:[0/2](943300/4588595) loss:2.400 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:39,382][model8_pretrain.py][INFO] Epoch:[0/2](943300/4588595) loss:2.678 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:39,382][model8_pretrain.py][INFO] Epoch:[0/2](943300/4588595) loss:2.732 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:39,382][model8_pretrain.py][INFO] Epoch:[0/2](943300/4588595) loss:2.801 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:39,382][model8_pretrain.py][INFO] Epoch:[0/2](943300/4588595) loss:3.291 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:39,382][model8_pretrain.py][INFO] Epoch:[0/2](943300/4588595) loss:2.452 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:39,382][model8_pretrain.py][INFO] Epoch:[0/2](943300/4588595) loss:2.822 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:47:39,400][model8_pretrain.py][INFO] Epoch:[0/2](943300/4588595) loss:2.734 lr:0.0000100 epoch_Time:23217.0min: [2024-01-06 21:48:18,167][model8_pretrain.py][INFO] Epoch:[0/2](943400/4588595) loss:2.843 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:48:18,167][model8_pretrain.py][INFO] Epoch:[0/2](943400/4588595) loss:2.698 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:48:18,167][model8_pretrain.py][INFO] Epoch:[0/2](943400/4588595) loss:2.721 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:48:18,167][model8_pretrain.py][INFO] Epoch:[0/2](943400/4588595) loss:3.241 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:48:18,168][model8_pretrain.py][INFO] Epoch:[0/2](943400/4588595) loss:2.509 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:48:18,168][model8_pretrain.py][INFO] Epoch:[0/2](943400/4588595) loss:2.769 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:48:18,168][model8_pretrain.py][INFO] Epoch:[0/2](943400/4588595) loss:3.374 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:48:18,176][model8_pretrain.py][INFO] Epoch:[0/2](943400/4588595) loss:3.396 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:07,114][model8_pretrain.py][INFO] Epoch:[0/2](943500/4588595) loss:2.699 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:07,114][model8_pretrain.py][INFO] Epoch:[0/2](943500/4588595) loss:2.525 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:07,114][model8_pretrain.py][INFO] Epoch:[0/2](943500/4588595) loss:2.986 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:07,114][model8_pretrain.py][INFO] Epoch:[0/2](943500/4588595) loss:3.075 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:07,114][model8_pretrain.py][INFO] Epoch:[0/2](943500/4588595) loss:2.741 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:07,114][model8_pretrain.py][INFO] Epoch:[0/2](943500/4588595) loss:3.057 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:07,114][model8_pretrain.py][INFO] Epoch:[0/2](943500/4588595) loss:2.280 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:07,114][model8_pretrain.py][INFO] Epoch:[0/2](943500/4588595) loss:2.741 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:44,057][model8_pretrain.py][INFO] Epoch:[0/2](943600/4588595) loss:3.100 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:44,057][model8_pretrain.py][INFO] Epoch:[0/2](943600/4588595) loss:2.518 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:44,057][model8_pretrain.py][INFO] Epoch:[0/2](943600/4588595) loss:2.856 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:44,057][model8_pretrain.py][INFO] Epoch:[0/2](943600/4588595) loss:2.914 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:44,057][model8_pretrain.py][INFO] Epoch:[0/2](943600/4588595) loss:2.793 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:44,057][model8_pretrain.py][INFO] Epoch:[0/2](943600/4588595) loss:2.800 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:44,057][model8_pretrain.py][INFO] Epoch:[0/2](943600/4588595) loss:3.018 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:49:44,057][model8_pretrain.py][INFO] Epoch:[0/2](943600/4588595) loss:2.775 lr:0.0000100 epoch_Time:23216.0min: [2024-01-06 21:50:21,000][model8_pretrain.py][INFO] Epoch:[0/2](943700/4588595) loss:3.081 lr:0.0000100 epoch_Time:23215.0min: [2024-01-06 21:50:21,000][model8_pretrain.py][INFO] Epoch:[0/2](943700/4588595) loss:2.457 lr:0.0000100 epoch_Time:23215.0min: [2024-01-06 21:50:21,000][model8_pretrain.py][INFO] Epoch:[0/2](943700/4588595) loss:2.886 lr:0.0000100 epoch_Time:23215.0min: [2024-01-06 21:50:21,000][model8_pretrain.py][INFO] Epoch:[0/2](943700/4588595) loss:3.001 lr:0.0000100 epoch_Time:23215.0min: [2024-01-06 21:50:21,000][model8_pretrain.py][INFO] Epoch:[0/2](943700/4588595) loss:3.216 lr:0.0000100 epoch_Time:23215.0min: [2024-01-06 21:50:21,000][model8_pretrain.py][INFO] Epoch:[0/2](943700/4588595) loss:3.116 lr:0.0000100 epoch_Time:23215.0min: [2024-01-06 21:50:21,000][model8_pretrain.py][INFO] Epoch:[0/2](943700/4588595) loss:3.069 lr:0.0000100 epoch_Time:23215.0min: [2024-01-06 21:50:21,000][model8_pretrain.py][INFO] Epoch:[0/2](943700/4588595) loss:2.397 lr:0.0000100 epoch_Time:23215.0min: [2024-01-06 21:50:57,942][model8_pretrain.py][INFO] Epoch:[0/2](943800/4588595) loss:3.035 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:50:57,942][model8_pretrain.py][INFO] Epoch:[0/2](943800/4588595) loss:2.839 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:50:57,942][model8_pretrain.py][INFO] Epoch:[0/2](943800/4588595) loss:2.470 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:50:57,942][model8_pretrain.py][INFO] Epoch:[0/2](943800/4588595) loss:3.233 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:50:57,942][model8_pretrain.py][INFO] Epoch:[0/2](943800/4588595) loss:2.962 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:50:57,942][model8_pretrain.py][INFO] Epoch:[0/2](943800/4588595) loss:1.984 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:50:57,942][model8_pretrain.py][INFO] Epoch:[0/2](943800/4588595) loss:2.383 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:50:57,942][model8_pretrain.py][INFO] Epoch:[0/2](943800/4588595) loss:2.059 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:51:34,887][model8_pretrain.py][INFO] Epoch:[0/2](943900/4588595) loss:3.148 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:51:34,887][model8_pretrain.py][INFO] Epoch:[0/2](943900/4588595) loss:2.749 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:51:34,887][model8_pretrain.py][INFO] Epoch:[0/2](943900/4588595) loss:2.814 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:51:34,887][model8_pretrain.py][INFO] Epoch:[0/2](943900/4588595) loss:3.447 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:51:34,887][model8_pretrain.py][INFO] Epoch:[0/2](943900/4588595) loss:2.959 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:51:34,887][model8_pretrain.py][INFO] Epoch:[0/2](943900/4588595) loss:3.188 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:51:34,887][model8_pretrain.py][INFO] Epoch:[0/2](943900/4588595) loss:2.807 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:51:34,888][model8_pretrain.py][INFO] Epoch:[0/2](943900/4588595) loss:2.630 lr:0.0000100 epoch_Time:23214.0min: [2024-01-06 21:52:11,844][model8_pretrain.py][INFO] Epoch:[0/2](944000/4588595) loss:3.093 lr:0.0000100 epoch_Time:23213.0min: [2024-01-06 21:52:11,844][model8_pretrain.py][INFO] Epoch:[0/2](944000/4588595) loss:2.795 lr:0.0000100 epoch_Time:23213.0min: [2024-01-06 21:52:11,844][model8_pretrain.py][INFO] Epoch:[0/2](944000/4588595) loss:2.609 lr:0.0000100 epoch_Time:23213.0min: [2024-01-06 21:52:11,844][model8_pretrain.py][INFO] Epoch:[0/2](944000/4588595) loss:2.795 lr:0.0000100 epoch_Time:23213.0min: [2024-01-06 21:52:11,844][model8_pretrain.py][INFO] Epoch:[0/2](944000/4588595) loss:2.717 lr:0.0000100 epoch_Time:23213.0min: [2024-01-06 21:52:11,845][model8_pretrain.py][INFO] Epoch:[0/2](944000/4588595) loss:2.905 lr:0.0000100 epoch_Time:23213.0min: [2024-01-06 21:52:11,845][model8_pretrain.py][INFO] Epoch:[0/2](944000/4588595) loss:2.374 lr:0.0000100 epoch_Time:23213.0min: [2024-01-06 21:52:11,845][model8_pretrain.py][INFO] Epoch:[0/2](944000/4588595) loss:2.863 lr:0.0000100 epoch_Time:23213.0min: [2024-01-06 21:52:48,794][model8_pretrain.py][INFO] Epoch:[0/2](944100/4588595) loss:2.017 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:52:48,794][model8_pretrain.py][INFO] Epoch:[0/2](944100/4588595) loss:2.184 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:52:48,794][model8_pretrain.py][INFO] Epoch:[0/2](944100/4588595) loss:3.090 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:52:48,794][model8_pretrain.py][INFO] Epoch:[0/2](944100/4588595) loss:3.187 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:52:48,794][model8_pretrain.py][INFO] Epoch:[0/2](944100/4588595) loss:2.903 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:52:48,794][model8_pretrain.py][INFO] Epoch:[0/2](944100/4588595) loss:2.666 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:52:48,794][model8_pretrain.py][INFO] Epoch:[0/2](944100/4588595) loss:2.672 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:52:48,795][model8_pretrain.py][INFO] Epoch:[0/2](944100/4588595) loss:2.826 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:53:27,431][model8_pretrain.py][INFO] Epoch:[0/2](944200/4588595) loss:3.092 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:53:27,431][model8_pretrain.py][INFO] Epoch:[0/2](944200/4588595) loss:2.437 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:53:27,431][model8_pretrain.py][INFO] Epoch:[0/2](944200/4588595) loss:2.548 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:53:27,431][model8_pretrain.py][INFO] Epoch:[0/2](944200/4588595) loss:2.936 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:53:27,431][model8_pretrain.py][INFO] Epoch:[0/2](944200/4588595) loss:2.741 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:53:27,431][model8_pretrain.py][INFO] Epoch:[0/2](944200/4588595) loss:2.707 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:53:27,431][model8_pretrain.py][INFO] Epoch:[0/2](944200/4588595) loss:2.702 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:53:27,431][model8_pretrain.py][INFO] Epoch:[0/2](944200/4588595) loss:3.085 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:54:16,285][model8_pretrain.py][INFO] Epoch:[0/2](944300/4588595) loss:3.027 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:54:16,285][model8_pretrain.py][INFO] Epoch:[0/2](944300/4588595) loss:2.767 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:54:16,285][model8_pretrain.py][INFO] Epoch:[0/2](944300/4588595) loss:2.408 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:54:16,285][model8_pretrain.py][INFO] Epoch:[0/2](944300/4588595) loss:3.127 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:54:16,285][model8_pretrain.py][INFO] Epoch:[0/2](944300/4588595) loss:2.894 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:54:16,285][model8_pretrain.py][INFO] Epoch:[0/2](944300/4588595) loss:3.208 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:54:16,285][model8_pretrain.py][INFO] Epoch:[0/2](944300/4588595) loss:3.102 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:54:16,285][model8_pretrain.py][INFO] Epoch:[0/2](944300/4588595) loss:2.977 lr:0.0000100 epoch_Time:23211.0min: [2024-01-06 21:54:53,226][model8_pretrain.py][INFO] Epoch:[0/2](944400/4588595) loss:2.680 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:54:53,226][model8_pretrain.py][INFO] Epoch:[0/2](944400/4588595) loss:2.889 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:54:53,226][model8_pretrain.py][INFO] Epoch:[0/2](944400/4588595) loss:3.161 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:54:53,226][model8_pretrain.py][INFO] Epoch:[0/2](944400/4588595) loss:2.642 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:54:53,226][model8_pretrain.py][INFO] Epoch:[0/2](944400/4588595) loss:3.080 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:54:53,226][model8_pretrain.py][INFO] Epoch:[0/2](944400/4588595) loss:3.481 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:54:53,226][model8_pretrain.py][INFO] Epoch:[0/2](944400/4588595) loss:2.553 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:54:53,226][model8_pretrain.py][INFO] Epoch:[0/2](944400/4588595) loss:3.018 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:55:30,170][model8_pretrain.py][INFO] Epoch:[0/2](944500/4588595) loss:2.789 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:55:30,170][model8_pretrain.py][INFO] Epoch:[0/2](944500/4588595) loss:2.814 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:55:30,170][model8_pretrain.py][INFO] Epoch:[0/2](944500/4588595) loss:3.054 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:55:30,170][model8_pretrain.py][INFO] Epoch:[0/2](944500/4588595) loss:3.290 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:55:30,170][model8_pretrain.py][INFO] Epoch:[0/2](944500/4588595) loss:2.841 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:55:30,170][model8_pretrain.py][INFO] Epoch:[0/2](944500/4588595) loss:3.189 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:55:30,170][model8_pretrain.py][INFO] Epoch:[0/2](944500/4588595) loss:2.804 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:55:30,170][model8_pretrain.py][INFO] Epoch:[0/2](944500/4588595) loss:2.822 lr:0.0000100 epoch_Time:23210.0min: [2024-01-06 21:56:07,113][model8_pretrain.py][INFO] Epoch:[0/2](944600/4588595) loss:2.804 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:07,113][model8_pretrain.py][INFO] Epoch:[0/2](944600/4588595) loss:3.086 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:07,113][model8_pretrain.py][INFO] Epoch:[0/2](944600/4588595) loss:2.830 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:07,113][model8_pretrain.py][INFO] Epoch:[0/2](944600/4588595) loss:2.330 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:07,113][model8_pretrain.py][INFO] Epoch:[0/2](944600/4588595) loss:3.272 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:07,113][model8_pretrain.py][INFO] Epoch:[0/2](944600/4588595) loss:3.098 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:07,113][model8_pretrain.py][INFO] Epoch:[0/2](944600/4588595) loss:2.956 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:07,113][model8_pretrain.py][INFO] Epoch:[0/2](944600/4588595) loss:2.990 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:44,055][model8_pretrain.py][INFO] Epoch:[0/2](944700/4588595) loss:2.460 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:44,055][model8_pretrain.py][INFO] Epoch:[0/2](944700/4588595) loss:2.679 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:44,055][model8_pretrain.py][INFO] Epoch:[0/2](944700/4588595) loss:3.012 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:44,055][model8_pretrain.py][INFO] Epoch:[0/2](944700/4588595) loss:2.488 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:44,055][model8_pretrain.py][INFO] Epoch:[0/2](944700/4588595) loss:2.609 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:44,055][model8_pretrain.py][INFO] Epoch:[0/2](944700/4588595) loss:2.708 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:44,055][model8_pretrain.py][INFO] Epoch:[0/2](944700/4588595) loss:2.289 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:56:44,055][model8_pretrain.py][INFO] Epoch:[0/2](944700/4588595) loss:2.492 lr:0.0000100 epoch_Time:23209.0min: [2024-01-06 21:57:21,007][model8_pretrain.py][INFO] Epoch:[0/2](944800/4588595) loss:3.052 lr:0.0000100 epoch_Time:23208.0min: [2024-01-06 21:57:21,007][model8_pretrain.py][INFO] Epoch:[0/2](944800/4588595) loss:3.049 lr:0.0000100 epoch_Time:23208.0min: [2024-01-06 21:57:21,007][model8_pretrain.py][INFO] Epoch:[0/2](944800/4588595) loss:3.094 lr:0.0000100 epoch_Time:23208.0min: [2024-01-06 21:57:21,007][model8_pretrain.py][INFO] Epoch:[0/2](944800/4588595) loss:2.957 lr:0.0000100 epoch_Time:23208.0min: [2024-01-06 21:57:21,007][model8_pretrain.py][INFO] Epoch:[0/2](944800/4588595) loss:2.395 lr:0.0000100 epoch_Time:23208.0min: [2024-01-06 21:57:21,007][model8_pretrain.py][INFO] Epoch:[0/2](944800/4588595) loss:2.800 lr:0.0000100 epoch_Time:23208.0min: [2024-01-06 21:57:21,007][model8_pretrain.py][INFO] Epoch:[0/2](944800/4588595) loss:2.708 lr:0.0000100 epoch_Time:23208.0min: [2024-01-06 21:57:21,007][model8_pretrain.py][INFO] Epoch:[0/2](944800/4588595) loss:2.985 lr:0.0000100 epoch_Time:23208.0min: [2024-01-06 21:57:57,947][model8_pretrain.py][INFO] Epoch:[0/2](944900/4588595) loss:3.152 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:57:57,947][model8_pretrain.py][INFO] Epoch:[0/2](944900/4588595) loss:3.191 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:57:57,947][model8_pretrain.py][INFO] Epoch:[0/2](944900/4588595) loss:3.049 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:57:57,947][model8_pretrain.py][INFO] Epoch:[0/2](944900/4588595) loss:2.739 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:57:57,947][model8_pretrain.py][INFO] Epoch:[0/2](944900/4588595) loss:2.845 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:57:57,947][model8_pretrain.py][INFO] Epoch:[0/2](944900/4588595) loss:3.061 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:57:57,947][model8_pretrain.py][INFO] Epoch:[0/2](944900/4588595) loss:2.592 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:57:57,947][model8_pretrain.py][INFO] Epoch:[0/2](944900/4588595) loss:2.753 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:58:34,900][model8_pretrain.py][INFO] Epoch:[0/2](945000/4588595) loss:3.026 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:58:34,900][model8_pretrain.py][INFO] Epoch:[0/2](945000/4588595) loss:2.578 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:58:34,900][model8_pretrain.py][INFO] Epoch:[0/2](945000/4588595) loss:2.513 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:58:34,900][model8_pretrain.py][INFO] Epoch:[0/2](945000/4588595) loss:3.316 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:58:34,900][model8_pretrain.py][INFO] Epoch:[0/2](945000/4588595) loss:3.134 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:58:34,900][model8_pretrain.py][INFO] Epoch:[0/2](945000/4588595) loss:2.658 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:58:34,900][model8_pretrain.py][INFO] Epoch:[0/2](945000/4588595) loss:2.745 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:58:36,598][model8_pretrain.py][INFO] Epoch:[0/2](945000/4588595) loss:3.298 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:59:25,504][model8_pretrain.py][INFO] Epoch:[0/2](945100/4588595) loss:2.654 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:59:25,504][model8_pretrain.py][INFO] Epoch:[0/2](945100/4588595) loss:2.746 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:59:25,504][model8_pretrain.py][INFO] Epoch:[0/2](945100/4588595) loss:2.252 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:59:25,504][model8_pretrain.py][INFO] Epoch:[0/2](945100/4588595) loss:2.930 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:59:25,504][model8_pretrain.py][INFO] Epoch:[0/2](945100/4588595) loss:3.022 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:59:25,504][model8_pretrain.py][INFO] Epoch:[0/2](945100/4588595) loss:2.772 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:59:25,504][model8_pretrain.py][INFO] Epoch:[0/2](945100/4588595) loss:3.191 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 21:59:25,504][model8_pretrain.py][INFO] Epoch:[0/2](945100/4588595) loss:2.926 lr:0.0000100 epoch_Time:23207.0min: [2024-01-06 22:00:02,449][model8_pretrain.py][INFO] Epoch:[0/2](945200/4588595) loss:2.991 lr:0.0000100 epoch_Time:23206.0min: [2024-01-06 22:00:02,449][model8_pretrain.py][INFO] Epoch:[0/2](945200/4588595) loss:3.104 lr:0.0000100 epoch_Time:23206.0min: [2024-01-06 22:00:02,449][model8_pretrain.py][INFO] Epoch:[0/2](945200/4588595) loss:1.849 lr:0.0000100 epoch_Time:23206.0min: [2024-01-06 22:00:02,449][model8_pretrain.py][INFO] Epoch:[0/2](945200/4588595) loss:2.274 lr:0.0000100 epoch_Time:23206.0min: [2024-01-06 22:00:02,449][model8_pretrain.py][INFO] Epoch:[0/2](945200/4588595) loss:2.369 lr:0.0000100 epoch_Time:23206.0min: [2024-01-06 22:00:02,449][model8_pretrain.py][INFO] Epoch:[0/2](945200/4588595) loss:2.533 lr:0.0000100 epoch_Time:23206.0min: [2024-01-06 22:00:02,449][model8_pretrain.py][INFO] Epoch:[0/2](945200/4588595) loss:3.025 lr:0.0000100 epoch_Time:23206.0min: [2024-01-06 22:00:02,449][model8_pretrain.py][INFO] Epoch:[0/2](945200/4588595) loss:2.659 lr:0.0000100 epoch_Time:23206.0min: [2024-01-06 22:00:39,397][model8_pretrain.py][INFO] Epoch:[0/2](945300/4588595) loss:2.646 lr:0.0000100 epoch_Time:23205.0min: [2024-01-06 22:00:39,397][model8_pretrain.py][INFO] Epoch:[0/2](945300/4588595) loss:2.331 lr:0.0000100 epoch_Time:23205.0min: [2024-01-06 22:00:39,397][model8_pretrain.py][INFO] Epoch:[0/2](945300/4588595) loss:3.180 lr:0.0000100 epoch_Time:23205.0min: [2024-01-06 22:00:39,397][model8_pretrain.py][INFO] Epoch:[0/2](945300/4588595) loss:2.629 lr:0.0000100 epoch_Time:23205.0min: [2024-01-06 22:00:39,397][model8_pretrain.py][INFO] Epoch:[0/2](945300/4588595) loss:3.155 lr:0.0000100 epoch_Time:23205.0min: [2024-01-06 22:00:39,397][model8_pretrain.py][INFO] Epoch:[0/2](945300/4588595) loss:2.483 lr:0.0000100 epoch_Time:23205.0min: [2024-01-06 22:00:39,398][model8_pretrain.py][INFO] Epoch:[0/2](945300/4588595) loss:2.790 lr:0.0000100 epoch_Time:23205.0min: [2024-01-06 22:00:39,398][model8_pretrain.py][INFO] Epoch:[0/2](945300/4588595) loss:2.573 lr:0.0000100 epoch_Time:23205.0min: [2024-01-06 22:01:16,342][model8_pretrain.py][INFO] Epoch:[0/2](945400/4588595) loss:3.148 lr:0.0000100 epoch_Time:23204.0min: [2024-01-06 22:01:16,342][model8_pretrain.py][INFO] Epoch:[0/2](945400/4588595) loss:2.855 lr:0.0000100 epoch_Time:23204.0min: [2024-01-06 22:01:16,342][model8_pretrain.py][INFO] Epoch:[0/2](945400/4588595) loss:2.916 lr:0.0000100 epoch_Time:23204.0min: [2024-01-06 22:01:16,342][model8_pretrain.py][INFO] Epoch:[0/2](945400/4588595) loss:2.612 lr:0.0000100 epoch_Time:23204.0min: [2024-01-06 22:01:16,342][model8_pretrain.py][INFO] Epoch:[0/2](945400/4588595) loss:2.292 lr:0.0000100 epoch_Time:23204.0min: [2024-01-06 22:01:16,342][model8_pretrain.py][INFO] Epoch:[0/2](945400/4588595) loss:2.884 lr:0.0000100 epoch_Time:23204.0min: [2024-01-06 22:01:16,342][model8_pretrain.py][INFO] Epoch:[0/2](945400/4588595) loss:3.137 lr:0.0000100 epoch_Time:23204.0min: [2024-01-06 22:01:16,342][model8_pretrain.py][INFO] Epoch:[0/2](945400/4588595) loss:2.766 lr:0.0000100 epoch_Time:23204.0min: [2024-01-06 22:01:53,288][model8_pretrain.py][INFO] Epoch:[0/2](945500/4588595) loss:2.540 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:01:53,288][model8_pretrain.py][INFO] Epoch:[0/2](945500/4588595) loss:2.623 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:01:53,288][model8_pretrain.py][INFO] Epoch:[0/2](945500/4588595) loss:3.078 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:01:53,288][model8_pretrain.py][INFO] Epoch:[0/2](945500/4588595) loss:3.079 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:01:53,288][model8_pretrain.py][INFO] Epoch:[0/2](945500/4588595) loss:3.129 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:01:53,288][model8_pretrain.py][INFO] Epoch:[0/2](945500/4588595) loss:3.251 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:01:53,288][model8_pretrain.py][INFO] Epoch:[0/2](945500/4588595) loss:2.873 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:01:53,288][model8_pretrain.py][INFO] Epoch:[0/2](945500/4588595) loss:3.390 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:02:30,250][model8_pretrain.py][INFO] Epoch:[0/2](945600/4588595) loss:2.535 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:02:30,250][model8_pretrain.py][INFO] Epoch:[0/2](945600/4588595) loss:2.819 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:02:30,250][model8_pretrain.py][INFO] Epoch:[0/2](945600/4588595) loss:2.688 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:02:30,250][model8_pretrain.py][INFO] Epoch:[0/2](945600/4588595) loss:2.739 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:02:30,250][model8_pretrain.py][INFO] Epoch:[0/2](945600/4588595) loss:2.517 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:02:30,250][model8_pretrain.py][INFO] Epoch:[0/2](945600/4588595) loss:2.910 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:02:30,250][model8_pretrain.py][INFO] Epoch:[0/2](945600/4588595) loss:2.009 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:02:30,250][model8_pretrain.py][INFO] Epoch:[0/2](945600/4588595) loss:2.687 lr:0.0000100 epoch_Time:23203.0min: [2024-01-06 22:03:07,195][model8_pretrain.py][INFO] Epoch:[0/2](945700/4588595) loss:2.581 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:07,195][model8_pretrain.py][INFO] Epoch:[0/2](945700/4588595) loss:2.376 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:07,195][model8_pretrain.py][INFO] Epoch:[0/2](945700/4588595) loss:2.769 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:07,195][model8_pretrain.py][INFO] Epoch:[0/2](945700/4588595) loss:3.032 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:07,195][model8_pretrain.py][INFO] Epoch:[0/2](945700/4588595) loss:2.390 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:07,195][model8_pretrain.py][INFO] Epoch:[0/2](945700/4588595) loss:2.445 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:07,195][model8_pretrain.py][INFO] Epoch:[0/2](945700/4588595) loss:2.579 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:07,196][model8_pretrain.py][INFO] Epoch:[0/2](945700/4588595) loss:2.593 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:44,149][model8_pretrain.py][INFO] Epoch:[0/2](945800/4588595) loss:2.548 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:44,149][model8_pretrain.py][INFO] Epoch:[0/2](945800/4588595) loss:3.191 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:44,149][model8_pretrain.py][INFO] Epoch:[0/2](945800/4588595) loss:3.031 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:44,149][model8_pretrain.py][INFO] Epoch:[0/2](945800/4588595) loss:3.067 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:44,149][model8_pretrain.py][INFO] Epoch:[0/2](945800/4588595) loss:2.596 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:44,149][model8_pretrain.py][INFO] Epoch:[0/2](945800/4588595) loss:3.021 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:44,149][model8_pretrain.py][INFO] Epoch:[0/2](945800/4588595) loss:2.904 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:03:44,150][model8_pretrain.py][INFO] Epoch:[0/2](945800/4588595) loss:3.154 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:04:34,793][model8_pretrain.py][INFO] Epoch:[0/2](945900/4588595) loss:2.654 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:04:34,793][model8_pretrain.py][INFO] Epoch:[0/2](945900/4588595) loss:2.289 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:04:34,793][model8_pretrain.py][INFO] Epoch:[0/2](945900/4588595) loss:2.551 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:04:34,793][model8_pretrain.py][INFO] Epoch:[0/2](945900/4588595) loss:2.820 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:04:34,793][model8_pretrain.py][INFO] Epoch:[0/2](945900/4588595) loss:2.901 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:04:34,793][model8_pretrain.py][INFO] Epoch:[0/2](945900/4588595) loss:2.222 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:04:34,793][model8_pretrain.py][INFO] Epoch:[0/2](945900/4588595) loss:2.706 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:04:34,793][model8_pretrain.py][INFO] Epoch:[0/2](945900/4588595) loss:2.569 lr:0.0000100 epoch_Time:23202.0min: [2024-01-06 22:05:11,745][model8_pretrain.py][INFO] Epoch:[0/2](946000/4588595) loss:2.743 lr:0.0000100 epoch_Time:23201.0min: [2024-01-06 22:05:11,745][model8_pretrain.py][INFO] Epoch:[0/2](946000/4588595) loss:2.300 lr:0.0000100 epoch_Time:23201.0min: [2024-01-06 22:05:11,745][model8_pretrain.py][INFO] Epoch:[0/2](946000/4588595) loss:2.212 lr:0.0000100 epoch_Time:23201.0min: [2024-01-06 22:05:11,745][model8_pretrain.py][INFO] Epoch:[0/2](946000/4588595) loss:3.181 lr:0.0000100 epoch_Time:23201.0min: [2024-01-06 22:05:11,745][model8_pretrain.py][INFO] Epoch:[0/2](946000/4588595) loss:3.064 lr:0.0000100 epoch_Time:23201.0min: [2024-01-06 22:05:11,745][model8_pretrain.py][INFO] Epoch:[0/2](946000/4588595) loss:2.740 lr:0.0000100 epoch_Time:23201.0min: [2024-01-06 22:05:11,745][model8_pretrain.py][INFO] Epoch:[0/2](946000/4588595) loss:3.200 lr:0.0000100 epoch_Time:23201.0min: [2024-01-06 22:05:11,745][model8_pretrain.py][INFO] Epoch:[0/2](946000/4588595) loss:2.650 lr:0.0000100 epoch_Time:23201.0min: [2024-01-06 22:05:48,691][model8_pretrain.py][INFO] Epoch:[0/2](946100/4588595) loss:2.510 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:05:48,691][model8_pretrain.py][INFO] Epoch:[0/2](946100/4588595) loss:2.533 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:05:48,691][model8_pretrain.py][INFO] Epoch:[0/2](946100/4588595) loss:2.741 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:05:48,691][model8_pretrain.py][INFO] Epoch:[0/2](946100/4588595) loss:2.954 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:05:48,691][model8_pretrain.py][INFO] Epoch:[0/2](946100/4588595) loss:3.052 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:05:48,691][model8_pretrain.py][INFO] Epoch:[0/2](946100/4588595) loss:2.768 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:05:48,691][model8_pretrain.py][INFO] Epoch:[0/2](946100/4588595) loss:2.287 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:05:48,691][model8_pretrain.py][INFO] Epoch:[0/2](946100/4588595) loss:2.718 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:06:25,639][model8_pretrain.py][INFO] Epoch:[0/2](946200/4588595) loss:2.817 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:06:25,639][model8_pretrain.py][INFO] Epoch:[0/2](946200/4588595) loss:3.273 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:06:25,639][model8_pretrain.py][INFO] Epoch:[0/2](946200/4588595) loss:2.650 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:06:25,639][model8_pretrain.py][INFO] Epoch:[0/2](946200/4588595) loss:3.019 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:06:25,639][model8_pretrain.py][INFO] Epoch:[0/2](946200/4588595) loss:2.540 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:06:25,639][model8_pretrain.py][INFO] Epoch:[0/2](946200/4588595) loss:2.594 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:06:25,639][model8_pretrain.py][INFO] Epoch:[0/2](946200/4588595) loss:3.137 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:06:25,639][model8_pretrain.py][INFO] Epoch:[0/2](946200/4588595) loss:2.355 lr:0.0000100 epoch_Time:23200.0min: [2024-01-06 22:07:02,580][model8_pretrain.py][INFO] Epoch:[0/2](946300/4588595) loss:2.426 lr:0.0000100 epoch_Time:23199.0min: [2024-01-06 22:07:02,580][model8_pretrain.py][INFO] Epoch:[0/2](946300/4588595) loss:3.010 lr:0.0000100 epoch_Time:23199.0min: [2024-01-06 22:07:02,580][model8_pretrain.py][INFO] Epoch:[0/2](946300/4588595) loss:2.281 lr:0.0000100 epoch_Time:23199.0min: [2024-01-06 22:07:02,580][model8_pretrain.py][INFO] Epoch:[0/2](946300/4588595) loss:2.890 lr:0.0000100 epoch_Time:23199.0min: [2024-01-06 22:07:02,580][model8_pretrain.py][INFO] Epoch:[0/2](946300/4588595) loss:2.989 lr:0.0000100 epoch_Time:23199.0min: [2024-01-06 22:07:02,580][model8_pretrain.py][INFO] Epoch:[0/2](946300/4588595) loss:2.297 lr:0.0000100 epoch_Time:23199.0min: [2024-01-06 22:07:02,580][model8_pretrain.py][INFO] Epoch:[0/2](946300/4588595) loss:3.309 lr:0.0000100 epoch_Time:23199.0min: [2024-01-06 22:07:02,580][model8_pretrain.py][INFO] Epoch:[0/2](946300/4588595) loss:2.675 lr:0.0000100 epoch_Time:23199.0min: [2024-01-06 22:07:39,558][model8_pretrain.py][INFO] Epoch:[0/2](946400/4588595) loss:2.882 lr:0.0000100 epoch_Time:23198.0min: [2024-01-06 22:07:39,558][model8_pretrain.py][INFO] Epoch:[0/2](946400/4588595) loss:3.005 lr:0.0000100 epoch_Time:23198.0min: [2024-01-06 22:07:39,558][model8_pretrain.py][INFO] Epoch:[0/2](946400/4588595) loss:2.961 lr:0.0000100 epoch_Time:23198.0min: [2024-01-06 22:07:39,558][model8_pretrain.py][INFO] Epoch:[0/2](946400/4588595) loss:2.780 lr:0.0000100 epoch_Time:23198.0min: [2024-01-06 22:07:39,558][model8_pretrain.py][INFO] Epoch:[0/2](946400/4588595) loss:3.037 lr:0.0000100 epoch_Time:23198.0min: [2024-01-06 22:07:39,558][model8_pretrain.py][INFO] Epoch:[0/2](946400/4588595) loss:2.478 lr:0.0000100 epoch_Time:23198.0min: [2024-01-06 22:07:39,558][model8_pretrain.py][INFO] Epoch:[0/2](946400/4588595) loss:2.773 lr:0.0000100 epoch_Time:23198.0min: [2024-01-06 22:07:39,558][model8_pretrain.py][INFO] Epoch:[0/2](946400/4588595) loss:2.803 lr:0.0000100 epoch_Time:23198.0min: [2024-01-06 22:08:16,489][model8_pretrain.py][INFO] Epoch:[0/2](946500/4588595) loss:2.364 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:08:16,489][model8_pretrain.py][INFO] Epoch:[0/2](946500/4588595) loss:2.948 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:08:16,489][model8_pretrain.py][INFO] Epoch:[0/2](946500/4588595) loss:2.892 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:08:16,489][model8_pretrain.py][INFO] Epoch:[0/2](946500/4588595) loss:3.154 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:08:16,489][model8_pretrain.py][INFO] Epoch:[0/2](946500/4588595) loss:2.695 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:08:16,489][model8_pretrain.py][INFO] Epoch:[0/2](946500/4588595) loss:2.485 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:08:16,489][model8_pretrain.py][INFO] Epoch:[0/2](946500/4588595) loss:2.517 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:08:16,489][model8_pretrain.py][INFO] Epoch:[0/2](946500/4588595) loss:2.385 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:08:53,433][model8_pretrain.py][INFO] Epoch:[0/2](946600/4588595) loss:2.767 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:08:53,433][model8_pretrain.py][INFO] Epoch:[0/2](946600/4588595) loss:2.808 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:08:53,433][model8_pretrain.py][INFO] Epoch:[0/2](946600/4588595) loss:2.927 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:08:53,433][model8_pretrain.py][INFO] Epoch:[0/2](946600/4588595) loss:2.385 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:08:53,433][model8_pretrain.py][INFO] Epoch:[0/2](946600/4588595) loss:2.880 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:08:53,433][model8_pretrain.py][INFO] Epoch:[0/2](946600/4588595) loss:2.998 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:08:53,434][model8_pretrain.py][INFO] Epoch:[0/2](946600/4588595) loss:2.896 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:08:53,434][model8_pretrain.py][INFO] Epoch:[0/2](946600/4588595) loss:2.078 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:09:44,005][model8_pretrain.py][INFO] Epoch:[0/2](946700/4588595) loss:2.682 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:09:44,005][model8_pretrain.py][INFO] Epoch:[0/2](946700/4588595) loss:2.415 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:09:44,005][model8_pretrain.py][INFO] Epoch:[0/2](946700/4588595) loss:2.869 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:09:44,005][model8_pretrain.py][INFO] Epoch:[0/2](946700/4588595) loss:2.771 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:09:44,005][model8_pretrain.py][INFO] Epoch:[0/2](946700/4588595) loss:2.770 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:09:44,005][model8_pretrain.py][INFO] Epoch:[0/2](946700/4588595) loss:2.607 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:09:44,005][model8_pretrain.py][INFO] Epoch:[0/2](946700/4588595) loss:3.129 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:09:44,005][model8_pretrain.py][INFO] Epoch:[0/2](946700/4588595) loss:3.090 lr:0.0000100 epoch_Time:23197.0min: [2024-01-06 22:10:20,951][model8_pretrain.py][INFO] Epoch:[0/2](946800/4588595) loss:2.925 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:10:20,951][model8_pretrain.py][INFO] Epoch:[0/2](946800/4588595) loss:2.371 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:10:20,951][model8_pretrain.py][INFO] Epoch:[0/2](946800/4588595) loss:1.865 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:10:20,951][model8_pretrain.py][INFO] Epoch:[0/2](946800/4588595) loss:2.269 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:10:20,951][model8_pretrain.py][INFO] Epoch:[0/2](946800/4588595) loss:2.925 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:10:20,951][model8_pretrain.py][INFO] Epoch:[0/2](946800/4588595) loss:2.682 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:10:20,952][model8_pretrain.py][INFO] Epoch:[0/2](946800/4588595) loss:2.755 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:10:20,952][model8_pretrain.py][INFO] Epoch:[0/2](946800/4588595) loss:2.743 lr:0.0000100 epoch_Time:23196.0min: [2024-01-06 22:10:57,891][model8_pretrain.py][INFO] Epoch:[0/2](946900/4588595) loss:2.604 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:10:57,891][model8_pretrain.py][INFO] Epoch:[0/2](946900/4588595) loss:3.170 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:10:57,891][model8_pretrain.py][INFO] Epoch:[0/2](946900/4588595) loss:3.205 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:10:57,891][model8_pretrain.py][INFO] Epoch:[0/2](946900/4588595) loss:2.930 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:10:57,891][model8_pretrain.py][INFO] Epoch:[0/2](946900/4588595) loss:2.962 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:10:57,891][model8_pretrain.py][INFO] Epoch:[0/2](946900/4588595) loss:3.040 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:10:57,891][model8_pretrain.py][INFO] Epoch:[0/2](946900/4588595) loss:2.604 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:10:57,891][model8_pretrain.py][INFO] Epoch:[0/2](946900/4588595) loss:2.717 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:11:34,840][model8_pretrain.py][INFO] Epoch:[0/2](947000/4588595) loss:2.859 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:11:34,840][model8_pretrain.py][INFO] Epoch:[0/2](947000/4588595) loss:2.384 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:11:34,840][model8_pretrain.py][INFO] Epoch:[0/2](947000/4588595) loss:3.099 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:11:34,840][model8_pretrain.py][INFO] Epoch:[0/2](947000/4588595) loss:2.186 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:11:34,840][model8_pretrain.py][INFO] Epoch:[0/2](947000/4588595) loss:3.292 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:11:34,840][model8_pretrain.py][INFO] Epoch:[0/2](947000/4588595) loss:2.386 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:11:34,840][model8_pretrain.py][INFO] Epoch:[0/2](947000/4588595) loss:2.008 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:11:34,840][model8_pretrain.py][INFO] Epoch:[0/2](947000/4588595) loss:3.241 lr:0.0000100 epoch_Time:23195.0min: [2024-01-06 22:12:11,788][model8_pretrain.py][INFO] Epoch:[0/2](947100/4588595) loss:3.144 lr:0.0000100 epoch_Time:23194.0min: [2024-01-06 22:12:11,788][model8_pretrain.py][INFO] Epoch:[0/2](947100/4588595) loss:3.649 lr:0.0000100 epoch_Time:23194.0min: [2024-01-06 22:12:11,788][model8_pretrain.py][INFO] Epoch:[0/2](947100/4588595) loss:2.721 lr:0.0000100 epoch_Time:23194.0min: [2024-01-06 22:12:11,788][model8_pretrain.py][INFO] Epoch:[0/2](947100/4588595) loss:3.189 lr:0.0000100 epoch_Time:23194.0min: [2024-01-06 22:12:11,788][model8_pretrain.py][INFO] Epoch:[0/2](947100/4588595) loss:2.264 lr:0.0000100 epoch_Time:23194.0min: [2024-01-06 22:12:11,788][model8_pretrain.py][INFO] Epoch:[0/2](947100/4588595) loss:2.655 lr:0.0000100 epoch_Time:23194.0min: [2024-01-06 22:12:11,788][model8_pretrain.py][INFO] Epoch:[0/2](947100/4588595) loss:2.681 lr:0.0000100 epoch_Time:23194.0min: [2024-01-06 22:12:11,788][model8_pretrain.py][INFO] Epoch:[0/2](947100/4588595) loss:2.937 lr:0.0000100 epoch_Time:23194.0min: [2024-01-06 22:12:48,733][model8_pretrain.py][INFO] Epoch:[0/2](947200/4588595) loss:3.056 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:12:48,733][model8_pretrain.py][INFO] Epoch:[0/2](947200/4588595) loss:3.122 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:12:48,733][model8_pretrain.py][INFO] Epoch:[0/2](947200/4588595) loss:2.785 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:12:48,733][model8_pretrain.py][INFO] Epoch:[0/2](947200/4588595) loss:2.572 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:12:48,733][model8_pretrain.py][INFO] Epoch:[0/2](947200/4588595) loss:2.452 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:12:48,733][model8_pretrain.py][INFO] Epoch:[0/2](947200/4588595) loss:2.294 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:12:48,733][model8_pretrain.py][INFO] Epoch:[0/2](947200/4588595) loss:2.910 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:12:48,733][model8_pretrain.py][INFO] Epoch:[0/2](947200/4588595) loss:2.312 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:13:25,688][model8_pretrain.py][INFO] Epoch:[0/2](947300/4588595) loss:2.511 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:13:25,688][model8_pretrain.py][INFO] Epoch:[0/2](947300/4588595) loss:2.973 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:13:25,688][model8_pretrain.py][INFO] Epoch:[0/2](947300/4588595) loss:2.549 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:13:25,688][model8_pretrain.py][INFO] Epoch:[0/2](947300/4588595) loss:2.707 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:13:25,688][model8_pretrain.py][INFO] Epoch:[0/2](947300/4588595) loss:2.592 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:13:25,688][model8_pretrain.py][INFO] Epoch:[0/2](947300/4588595) loss:2.383 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:13:25,688][model8_pretrain.py][INFO] Epoch:[0/2](947300/4588595) loss:2.603 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:13:25,688][model8_pretrain.py][INFO] Epoch:[0/2](947300/4588595) loss:3.179 lr:0.0000100 epoch_Time:23193.0min: [2024-01-06 22:14:02,632][model8_pretrain.py][INFO] Epoch:[0/2](947400/4588595) loss:2.988 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:02,632][model8_pretrain.py][INFO] Epoch:[0/2](947400/4588595) loss:2.282 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:02,632][model8_pretrain.py][INFO] Epoch:[0/2](947400/4588595) loss:2.938 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:02,633][model8_pretrain.py][INFO] Epoch:[0/2](947400/4588595) loss:2.872 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:02,632][model8_pretrain.py][INFO] Epoch:[0/2](947400/4588595) loss:2.787 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:02,633][model8_pretrain.py][INFO] Epoch:[0/2](947400/4588595) loss:2.493 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:02,633][model8_pretrain.py][INFO] Epoch:[0/2](947400/4588595) loss:2.222 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:02,633][model8_pretrain.py][INFO] Epoch:[0/2](947400/4588595) loss:2.385 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:53,368][model8_pretrain.py][INFO] Epoch:[0/2](947500/4588595) loss:2.513 lr:0.0000100 epoch_Time:23192.0min: [2024-01-06 22:14:53,368][model8_pretrain.py][INFO] Epoch:[0/2](947500/4588595) loss:2.766 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:53,368][model8_pretrain.py][INFO] Epoch:[0/2](947500/4588595) loss:2.272 lr:0.0000100 epoch_Time:23192.0min: [2024-01-06 22:14:53,368][model8_pretrain.py][INFO] Epoch:[0/2](947500/4588595) loss:2.931 lr:0.0000100 epoch_Time:23192.0min: [2024-01-06 22:14:53,368][model8_pretrain.py][INFO] Epoch:[0/2](947500/4588595) loss:2.353 lr:0.0000100 epoch_Time:23192.0min: [2024-01-06 22:14:53,368][model8_pretrain.py][INFO] Epoch:[0/2](947500/4588595) loss:2.655 lr:0.0000100 epoch_Time:23192.0min: [2024-01-06 22:14:53,368][model8_pretrain.py][INFO] Epoch:[0/2](947500/4588595) loss:3.044 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:14:53,368][model8_pretrain.py][INFO] Epoch:[0/2](947500/4588595) loss:2.787 lr:0.0000100 epoch_Time:23192.0min: [2024-01-06 22:15:30,303][model8_pretrain.py][INFO] Epoch:[0/2](947600/4588595) loss:3.154 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:15:30,303][model8_pretrain.py][INFO] Epoch:[0/2](947600/4588595) loss:3.102 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:15:30,303][model8_pretrain.py][INFO] Epoch:[0/2](947600/4588595) loss:2.243 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:15:30,303][model8_pretrain.py][INFO] Epoch:[0/2](947600/4588595) loss:3.300 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:15:30,303][model8_pretrain.py][INFO] Epoch:[0/2](947600/4588595) loss:2.847 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:15:30,303][model8_pretrain.py][INFO] Epoch:[0/2](947600/4588595) loss:2.811 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:15:30,303][model8_pretrain.py][INFO] Epoch:[0/2](947600/4588595) loss:2.695 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:15:30,303][model8_pretrain.py][INFO] Epoch:[0/2](947600/4588595) loss:2.809 lr:0.0000100 epoch_Time:23191.0min: [2024-01-06 22:16:07,247][model8_pretrain.py][INFO] Epoch:[0/2](947700/4588595) loss:2.595 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:07,247][model8_pretrain.py][INFO] Epoch:[0/2](947700/4588595) loss:3.491 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:07,247][model8_pretrain.py][INFO] Epoch:[0/2](947700/4588595) loss:2.949 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:07,247][model8_pretrain.py][INFO] Epoch:[0/2](947700/4588595) loss:3.035 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:07,247][model8_pretrain.py][INFO] Epoch:[0/2](947700/4588595) loss:3.058 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:07,247][model8_pretrain.py][INFO] Epoch:[0/2](947700/4588595) loss:2.989 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:07,247][model8_pretrain.py][INFO] Epoch:[0/2](947700/4588595) loss:3.312 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:07,247][model8_pretrain.py][INFO] Epoch:[0/2](947700/4588595) loss:2.776 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:44,185][model8_pretrain.py][INFO] Epoch:[0/2](947800/4588595) loss:2.416 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:44,185][model8_pretrain.py][INFO] Epoch:[0/2](947800/4588595) loss:2.926 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:44,185][model8_pretrain.py][INFO] Epoch:[0/2](947800/4588595) loss:2.832 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:44,185][model8_pretrain.py][INFO] Epoch:[0/2](947800/4588595) loss:2.813 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:44,185][model8_pretrain.py][INFO] Epoch:[0/2](947800/4588595) loss:3.096 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:44,185][model8_pretrain.py][INFO] Epoch:[0/2](947800/4588595) loss:2.509 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:44,185][model8_pretrain.py][INFO] Epoch:[0/2](947800/4588595) loss:2.885 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:16:44,185][model8_pretrain.py][INFO] Epoch:[0/2](947800/4588595) loss:2.758 lr:0.0000100 epoch_Time:23190.0min: [2024-01-06 22:17:21,138][model8_pretrain.py][INFO] Epoch:[0/2](947900/4588595) loss:2.749 lr:0.0000100 epoch_Time:23189.0min: [2024-01-06 22:17:21,138][model8_pretrain.py][INFO] Epoch:[0/2](947900/4588595) loss:3.326 lr:0.0000100 epoch_Time:23189.0min: [2024-01-06 22:17:21,138][model8_pretrain.py][INFO] Epoch:[0/2](947900/4588595) loss:2.632 lr:0.0000100 epoch_Time:23189.0min: [2024-01-06 22:17:21,138][model8_pretrain.py][INFO] Epoch:[0/2](947900/4588595) loss:3.023 lr:0.0000100 epoch_Time:23189.0min: [2024-01-06 22:17:21,138][model8_pretrain.py][INFO] Epoch:[0/2](947900/4588595) loss:2.573 lr:0.0000100 epoch_Time:23189.0min: [2024-01-06 22:17:21,138][model8_pretrain.py][INFO] Epoch:[0/2](947900/4588595) loss:2.528 lr:0.0000100 epoch_Time:23189.0min: [2024-01-06 22:17:21,138][model8_pretrain.py][INFO] Epoch:[0/2](947900/4588595) loss:3.051 lr:0.0000100 epoch_Time:23189.0min: [2024-01-06 22:17:21,138][model8_pretrain.py][INFO] Epoch:[0/2](947900/4588595) loss:2.928 lr:0.0000100 epoch_Time:23189.0min: [2024-01-06 22:17:58,078][model8_pretrain.py][INFO] Epoch:[0/2](948000/4588595) loss:2.442 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:17:58,078][model8_pretrain.py][INFO] Epoch:[0/2](948000/4588595) loss:2.757 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:17:58,078][model8_pretrain.py][INFO] Epoch:[0/2](948000/4588595) loss:3.126 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:17:58,078][model8_pretrain.py][INFO] Epoch:[0/2](948000/4588595) loss:2.261 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:17:58,078][model8_pretrain.py][INFO] Epoch:[0/2](948000/4588595) loss:2.503 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:17:58,078][model8_pretrain.py][INFO] Epoch:[0/2](948000/4588595) loss:2.880 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:17:58,078][model8_pretrain.py][INFO] Epoch:[0/2](948000/4588595) loss:2.855 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:17:58,079][model8_pretrain.py][INFO] Epoch:[0/2](948000/4588595) loss:2.765 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:18:35,024][model8_pretrain.py][INFO] Epoch:[0/2](948100/4588595) loss:2.566 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:18:35,024][model8_pretrain.py][INFO] Epoch:[0/2](948100/4588595) loss:2.691 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:18:35,024][model8_pretrain.py][INFO] Epoch:[0/2](948100/4588595) loss:2.977 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:18:35,024][model8_pretrain.py][INFO] Epoch:[0/2](948100/4588595) loss:2.757 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:18:35,024][model8_pretrain.py][INFO] Epoch:[0/2](948100/4588595) loss:3.226 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:18:35,024][model8_pretrain.py][INFO] Epoch:[0/2](948100/4588595) loss:3.242 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:18:35,024][model8_pretrain.py][INFO] Epoch:[0/2](948100/4588595) loss:2.578 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:18:35,024][model8_pretrain.py][INFO] Epoch:[0/2](948100/4588595) loss:2.446 lr:0.0000100 epoch_Time:23188.0min: [2024-01-06 22:19:11,966][model8_pretrain.py][INFO] Epoch:[0/2](948200/4588595) loss:3.089 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:19:11,966][model8_pretrain.py][INFO] Epoch:[0/2](948200/4588595) loss:2.948 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:19:11,966][model8_pretrain.py][INFO] Epoch:[0/2](948200/4588595) loss:3.075 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:19:11,966][model8_pretrain.py][INFO] Epoch:[0/2](948200/4588595) loss:2.662 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:19:11,966][model8_pretrain.py][INFO] Epoch:[0/2](948200/4588595) loss:2.881 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:19:11,966][model8_pretrain.py][INFO] Epoch:[0/2](948200/4588595) loss:2.816 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:19:11,966][model8_pretrain.py][INFO] Epoch:[0/2](948200/4588595) loss:3.044 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:19:11,967][model8_pretrain.py][INFO] Epoch:[0/2](948200/4588595) loss:2.935 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:02,733][model8_pretrain.py][INFO] Epoch:[0/2](948300/4588595) loss:2.715 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:02,733][model8_pretrain.py][INFO] Epoch:[0/2](948300/4588595) loss:3.342 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:02,733][model8_pretrain.py][INFO] Epoch:[0/2](948300/4588595) loss:2.777 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:02,733][model8_pretrain.py][INFO] Epoch:[0/2](948300/4588595) loss:2.957 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:02,733][model8_pretrain.py][INFO] Epoch:[0/2](948300/4588595) loss:2.219 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:02,733][model8_pretrain.py][INFO] Epoch:[0/2](948300/4588595) loss:2.770 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:02,733][model8_pretrain.py][INFO] Epoch:[0/2](948300/4588595) loss:2.732 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:02,733][model8_pretrain.py][INFO] Epoch:[0/2](948300/4588595) loss:2.856 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:39,667][model8_pretrain.py][INFO] Epoch:[0/2](948400/4588595) loss:2.569 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:39,667][model8_pretrain.py][INFO] Epoch:[0/2](948400/4588595) loss:3.129 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:39,667][model8_pretrain.py][INFO] Epoch:[0/2](948400/4588595) loss:3.160 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:39,667][model8_pretrain.py][INFO] Epoch:[0/2](948400/4588595) loss:2.993 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:39,667][model8_pretrain.py][INFO] Epoch:[0/2](948400/4588595) loss:3.197 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:39,667][model8_pretrain.py][INFO] Epoch:[0/2](948400/4588595) loss:3.186 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:39,667][model8_pretrain.py][INFO] Epoch:[0/2](948400/4588595) loss:2.720 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:20:39,667][model8_pretrain.py][INFO] Epoch:[0/2](948400/4588595) loss:2.868 lr:0.0000100 epoch_Time:23187.0min: [2024-01-06 22:21:16,616][model8_pretrain.py][INFO] Epoch:[0/2](948500/4588595) loss:2.965 lr:0.0000100 epoch_Time:23186.0min: [2024-01-06 22:21:16,616][model8_pretrain.py][INFO] Epoch:[0/2](948500/4588595) loss:2.792 lr:0.0000100 epoch_Time:23186.0min: [2024-01-06 22:21:16,616][model8_pretrain.py][INFO] Epoch:[0/2](948500/4588595) loss:2.919 lr:0.0000100 epoch_Time:23186.0min: [2024-01-06 22:21:16,616][model8_pretrain.py][INFO] Epoch:[0/2](948500/4588595) loss:2.505 lr:0.0000100 epoch_Time:23186.0min: [2024-01-06 22:21:16,616][model8_pretrain.py][INFO] Epoch:[0/2](948500/4588595) loss:2.628 lr:0.0000100 epoch_Time:23186.0min: [2024-01-06 22:21:16,616][model8_pretrain.py][INFO] Epoch:[0/2](948500/4588595) loss:2.764 lr:0.0000100 epoch_Time:23186.0min: [2024-01-06 22:21:16,616][model8_pretrain.py][INFO] Epoch:[0/2](948500/4588595) loss:2.632 lr:0.0000100 epoch_Time:23186.0min: [2024-01-06 22:21:16,616][model8_pretrain.py][INFO] Epoch:[0/2](948500/4588595) loss:2.170 lr:0.0000100 epoch_Time:23186.0min: [2024-01-06 22:21:53,570][model8_pretrain.py][INFO] Epoch:[0/2](948600/4588595) loss:2.583 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:21:53,570][model8_pretrain.py][INFO] Epoch:[0/2](948600/4588595) loss:3.290 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:21:53,570][model8_pretrain.py][INFO] Epoch:[0/2](948600/4588595) loss:2.819 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:21:53,570][model8_pretrain.py][INFO] Epoch:[0/2](948600/4588595) loss:2.902 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:21:53,570][model8_pretrain.py][INFO] Epoch:[0/2](948600/4588595) loss:3.033 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:21:53,570][model8_pretrain.py][INFO] Epoch:[0/2](948600/4588595) loss:1.975 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:21:53,570][model8_pretrain.py][INFO] Epoch:[0/2](948600/4588595) loss:2.590 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:21:53,571][model8_pretrain.py][INFO] Epoch:[0/2](948600/4588595) loss:3.104 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:22:30,547][model8_pretrain.py][INFO] Epoch:[0/2](948700/4588595) loss:2.786 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:22:30,547][model8_pretrain.py][INFO] Epoch:[0/2](948700/4588595) loss:3.063 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:22:30,547][model8_pretrain.py][INFO] Epoch:[0/2](948700/4588595) loss:3.122 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:22:30,547][model8_pretrain.py][INFO] Epoch:[0/2](948700/4588595) loss:3.280 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:22:30,547][model8_pretrain.py][INFO] Epoch:[0/2](948700/4588595) loss:2.477 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:22:30,547][model8_pretrain.py][INFO] Epoch:[0/2](948700/4588595) loss:2.714 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:22:30,547][model8_pretrain.py][INFO] Epoch:[0/2](948700/4588595) loss:3.499 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:22:30,547][model8_pretrain.py][INFO] Epoch:[0/2](948700/4588595) loss:2.841 lr:0.0000100 epoch_Time:23184.0min: [2024-01-06 22:23:07,513][model8_pretrain.py][INFO] Epoch:[0/2](948800/4588595) loss:2.855 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:07,513][model8_pretrain.py][INFO] Epoch:[0/2](948800/4588595) loss:2.421 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:07,513][model8_pretrain.py][INFO] Epoch:[0/2](948800/4588595) loss:3.173 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:07,513][model8_pretrain.py][INFO] Epoch:[0/2](948800/4588595) loss:2.510 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:07,513][model8_pretrain.py][INFO] Epoch:[0/2](948800/4588595) loss:2.257 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:07,513][model8_pretrain.py][INFO] Epoch:[0/2](948800/4588595) loss:2.065 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:07,513][model8_pretrain.py][INFO] Epoch:[0/2](948800/4588595) loss:2.765 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:07,513][model8_pretrain.py][INFO] Epoch:[0/2](948800/4588595) loss:2.852 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:44,490][model8_pretrain.py][INFO] Epoch:[0/2](948900/4588595) loss:3.298 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:44,490][model8_pretrain.py][INFO] Epoch:[0/2](948900/4588595) loss:2.219 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:44,490][model8_pretrain.py][INFO] Epoch:[0/2](948900/4588595) loss:3.135 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:44,490][model8_pretrain.py][INFO] Epoch:[0/2](948900/4588595) loss:2.391 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:44,490][model8_pretrain.py][INFO] Epoch:[0/2](948900/4588595) loss:3.049 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:44,490][model8_pretrain.py][INFO] Epoch:[0/2](948900/4588595) loss:2.552 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:44,490][model8_pretrain.py][INFO] Epoch:[0/2](948900/4588595) loss:3.079 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:23:44,490][model8_pretrain.py][INFO] Epoch:[0/2](948900/4588595) loss:2.725 lr:0.0000100 epoch_Time:23183.0min: [2024-01-06 22:24:21,449][model8_pretrain.py][INFO] Epoch:[0/2](949000/4588595) loss:2.285 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:24:21,449][model8_pretrain.py][INFO] Epoch:[0/2](949000/4588595) loss:3.019 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:24:21,449][model8_pretrain.py][INFO] Epoch:[0/2](949000/4588595) loss:2.815 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:24:21,449][model8_pretrain.py][INFO] Epoch:[0/2](949000/4588595) loss:2.068 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:24:21,449][model8_pretrain.py][INFO] Epoch:[0/2](949000/4588595) loss:3.372 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:24:21,449][model8_pretrain.py][INFO] Epoch:[0/2](949000/4588595) loss:2.928 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:24:21,449][model8_pretrain.py][INFO] Epoch:[0/2](949000/4588595) loss:2.736 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:24:21,449][model8_pretrain.py][INFO] Epoch:[0/2](949000/4588595) loss:2.440 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:25:12,112][model8_pretrain.py][INFO] Epoch:[0/2](949100/4588595) loss:3.192 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:25:12,112][model8_pretrain.py][INFO] Epoch:[0/2](949100/4588595) loss:2.836 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:25:12,112][model8_pretrain.py][INFO] Epoch:[0/2](949100/4588595) loss:3.344 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:25:12,112][model8_pretrain.py][INFO] Epoch:[0/2](949100/4588595) loss:2.937 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:25:12,112][model8_pretrain.py][INFO] Epoch:[0/2](949100/4588595) loss:2.977 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:25:12,112][model8_pretrain.py][INFO] Epoch:[0/2](949100/4588595) loss:2.567 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:25:12,112][model8_pretrain.py][INFO] Epoch:[0/2](949100/4588595) loss:2.929 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:25:12,112][model8_pretrain.py][INFO] Epoch:[0/2](949100/4588595) loss:3.052 lr:0.0000100 epoch_Time:23182.0min: [2024-01-06 22:25:49,056][model8_pretrain.py][INFO] Epoch:[0/2](949200/4588595) loss:2.671 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:25:49,057][model8_pretrain.py][INFO] Epoch:[0/2](949200/4588595) loss:3.125 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:25:49,057][model8_pretrain.py][INFO] Epoch:[0/2](949200/4588595) loss:2.740 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:25:49,057][model8_pretrain.py][INFO] Epoch:[0/2](949200/4588595) loss:2.151 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:25:49,057][model8_pretrain.py][INFO] Epoch:[0/2](949200/4588595) loss:2.923 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:25:49,057][model8_pretrain.py][INFO] Epoch:[0/2](949200/4588595) loss:3.081 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:25:49,057][model8_pretrain.py][INFO] Epoch:[0/2](949200/4588595) loss:2.744 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:25:49,057][model8_pretrain.py][INFO] Epoch:[0/2](949200/4588595) loss:2.840 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:26:26,014][model8_pretrain.py][INFO] Epoch:[0/2](949300/4588595) loss:2.848 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:26:26,014][model8_pretrain.py][INFO] Epoch:[0/2](949300/4588595) loss:2.626 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:26:26,014][model8_pretrain.py][INFO] Epoch:[0/2](949300/4588595) loss:3.034 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:26:26,014][model8_pretrain.py][INFO] Epoch:[0/2](949300/4588595) loss:2.620 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:26:26,014][model8_pretrain.py][INFO] Epoch:[0/2](949300/4588595) loss:2.755 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:26:26,014][model8_pretrain.py][INFO] Epoch:[0/2](949300/4588595) loss:2.174 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:26:26,014][model8_pretrain.py][INFO] Epoch:[0/2](949300/4588595) loss:2.518 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:26:26,014][model8_pretrain.py][INFO] Epoch:[0/2](949300/4588595) loss:2.297 lr:0.0000100 epoch_Time:23181.0min: [2024-01-06 22:27:02,980][model8_pretrain.py][INFO] Epoch:[0/2](949400/4588595) loss:3.260 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:02,980][model8_pretrain.py][INFO] Epoch:[0/2](949400/4588595) loss:2.957 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:02,980][model8_pretrain.py][INFO] Epoch:[0/2](949400/4588595) loss:2.998 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:02,980][model8_pretrain.py][INFO] Epoch:[0/2](949400/4588595) loss:2.324 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:02,980][model8_pretrain.py][INFO] Epoch:[0/2](949400/4588595) loss:3.665 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:02,980][model8_pretrain.py][INFO] Epoch:[0/2](949400/4588595) loss:1.982 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:02,980][model8_pretrain.py][INFO] Epoch:[0/2](949400/4588595) loss:2.778 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:02,980][model8_pretrain.py][INFO] Epoch:[0/2](949400/4588595) loss:2.716 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:39,944][model8_pretrain.py][INFO] Epoch:[0/2](949500/4588595) loss:3.265 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:39,944][model8_pretrain.py][INFO] Epoch:[0/2](949500/4588595) loss:2.908 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:39,944][model8_pretrain.py][INFO] Epoch:[0/2](949500/4588595) loss:2.531 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:39,944][model8_pretrain.py][INFO] Epoch:[0/2](949500/4588595) loss:3.053 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:39,944][model8_pretrain.py][INFO] Epoch:[0/2](949500/4588595) loss:3.132 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:39,944][model8_pretrain.py][INFO] Epoch:[0/2](949500/4588595) loss:1.854 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:39,945][model8_pretrain.py][INFO] Epoch:[0/2](949500/4588595) loss:2.151 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:27:39,945][model8_pretrain.py][INFO] Epoch:[0/2](949500/4588595) loss:2.803 lr:0.0000100 epoch_Time:23180.0min: [2024-01-06 22:28:16,904][model8_pretrain.py][INFO] Epoch:[0/2](949600/4588595) loss:2.608 lr:0.0000100 epoch_Time:23179.0min: [2024-01-06 22:28:16,904][model8_pretrain.py][INFO] Epoch:[0/2](949600/4588595) loss:3.044 lr:0.0000100 epoch_Time:23179.0min: [2024-01-06 22:28:16,904][model8_pretrain.py][INFO] Epoch:[0/2](949600/4588595) loss:2.643 lr:0.0000100 epoch_Time:23179.0min: [2024-01-06 22:28:16,904][model8_pretrain.py][INFO] Epoch:[0/2](949600/4588595) loss:3.385 lr:0.0000100 epoch_Time:23179.0min: [2024-01-06 22:28:16,904][model8_pretrain.py][INFO] Epoch:[0/2](949600/4588595) loss:2.362 lr:0.0000100 epoch_Time:23179.0min: [2024-01-06 22:28:16,904][model8_pretrain.py][INFO] Epoch:[0/2](949600/4588595) loss:2.779 lr:0.0000100 epoch_Time:23179.0min: [2024-01-06 22:28:16,904][model8_pretrain.py][INFO] Epoch:[0/2](949600/4588595) loss:3.378 lr:0.0000100 epoch_Time:23179.0min: [2024-01-06 22:28:16,904][model8_pretrain.py][INFO] Epoch:[0/2](949600/4588595) loss:2.952 lr:0.0000100 epoch_Time:23179.0min: [2024-01-06 22:28:53,862][model8_pretrain.py][INFO] Epoch:[0/2](949700/4588595) loss:2.001 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:28:53,862][model8_pretrain.py][INFO] Epoch:[0/2](949700/4588595) loss:2.763 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:28:53,862][model8_pretrain.py][INFO] Epoch:[0/2](949700/4588595) loss:2.676 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:28:53,862][model8_pretrain.py][INFO] Epoch:[0/2](949700/4588595) loss:2.748 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:28:53,862][model8_pretrain.py][INFO] Epoch:[0/2](949700/4588595) loss:2.814 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:28:53,862][model8_pretrain.py][INFO] Epoch:[0/2](949700/4588595) loss:3.350 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:28:53,862][model8_pretrain.py][INFO] Epoch:[0/2](949700/4588595) loss:2.682 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:28:53,862][model8_pretrain.py][INFO] Epoch:[0/2](949700/4588595) loss:3.005 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:29:30,830][model8_pretrain.py][INFO] Epoch:[0/2](949800/4588595) loss:2.810 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:29:30,830][model8_pretrain.py][INFO] Epoch:[0/2](949800/4588595) loss:2.943 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:29:30,830][model8_pretrain.py][INFO] Epoch:[0/2](949800/4588595) loss:2.302 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:29:30,830][model8_pretrain.py][INFO] Epoch:[0/2](949800/4588595) loss:2.467 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:29:30,830][model8_pretrain.py][INFO] Epoch:[0/2](949800/4588595) loss:3.188 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:29:30,830][model8_pretrain.py][INFO] Epoch:[0/2](949800/4588595) loss:3.033 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:29:30,830][model8_pretrain.py][INFO] Epoch:[0/2](949800/4588595) loss:2.966 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:29:30,830][model8_pretrain.py][INFO] Epoch:[0/2](949800/4588595) loss:3.181 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:30:21,595][model8_pretrain.py][INFO] Epoch:[0/2](949900/4588595) loss:2.865 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:30:21,595][model8_pretrain.py][INFO] Epoch:[0/2](949900/4588595) loss:3.116 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:30:21,595][model8_pretrain.py][INFO] Epoch:[0/2](949900/4588595) loss:2.580 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:30:21,595][model8_pretrain.py][INFO] Epoch:[0/2](949900/4588595) loss:2.716 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:30:21,595][model8_pretrain.py][INFO] Epoch:[0/2](949900/4588595) loss:2.597 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:30:21,595][model8_pretrain.py][INFO] Epoch:[0/2](949900/4588595) loss:3.032 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:30:21,595][model8_pretrain.py][INFO] Epoch:[0/2](949900/4588595) loss:3.128 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:30:21,595][model8_pretrain.py][INFO] Epoch:[0/2](949900/4588595) loss:2.401 lr:0.0000100 epoch_Time:23177.0min: [2024-01-06 22:30:58,525][model8_pretrain.py][INFO] Epoch:[0/2](950000/4588595) loss:3.071 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:30:58,525][model8_pretrain.py][INFO] Epoch:[0/2](950000/4588595) loss:2.423 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:30:58,525][model8_pretrain.py][INFO] Epoch:[0/2](950000/4588595) loss:2.771 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:30:58,525][model8_pretrain.py][INFO] Epoch:[0/2](950000/4588595) loss:1.926 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:30:58,525][model8_pretrain.py][INFO] Epoch:[0/2](950000/4588595) loss:2.813 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:30:58,525][model8_pretrain.py][INFO] Epoch:[0/2](950000/4588595) loss:2.845 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:30:58,525][model8_pretrain.py][INFO] Epoch:[0/2](950000/4588595) loss:2.617 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:30:58,525][model8_pretrain.py][INFO] Epoch:[0/2](950000/4588595) loss:2.215 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:31:35,470][model8_pretrain.py][INFO] Epoch:[0/2](950100/4588595) loss:2.203 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:31:35,470][model8_pretrain.py][INFO] Epoch:[0/2](950100/4588595) loss:2.051 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:31:35,470][model8_pretrain.py][INFO] Epoch:[0/2](950100/4588595) loss:2.902 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:31:35,470][model8_pretrain.py][INFO] Epoch:[0/2](950100/4588595) loss:2.635 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:31:35,470][model8_pretrain.py][INFO] Epoch:[0/2](950100/4588595) loss:2.685 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:31:35,470][model8_pretrain.py][INFO] Epoch:[0/2](950100/4588595) loss:2.210 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:31:35,470][model8_pretrain.py][INFO] Epoch:[0/2](950100/4588595) loss:2.943 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:31:35,470][model8_pretrain.py][INFO] Epoch:[0/2](950100/4588595) loss:2.573 lr:0.0000100 epoch_Time:23176.0min: [2024-01-06 22:32:12,410][model8_pretrain.py][INFO] Epoch:[0/2](950200/4588595) loss:3.136 lr:0.0000100 epoch_Time:23175.0min: [2024-01-06 22:32:12,410][model8_pretrain.py][INFO] Epoch:[0/2](950200/4588595) loss:3.019 lr:0.0000100 epoch_Time:23175.0min: [2024-01-06 22:32:12,410][model8_pretrain.py][INFO] Epoch:[0/2](950200/4588595) loss:3.196 lr:0.0000100 epoch_Time:23175.0min: [2024-01-06 22:32:12,410][model8_pretrain.py][INFO] Epoch:[0/2](950200/4588595) loss:2.504 lr:0.0000100 epoch_Time:23175.0min: [2024-01-06 22:32:12,410][model8_pretrain.py][INFO] Epoch:[0/2](950200/4588595) loss:3.227 lr:0.0000100 epoch_Time:23175.0min: [2024-01-06 22:32:12,410][model8_pretrain.py][INFO] Epoch:[0/2](950200/4588595) loss:2.875 lr:0.0000100 epoch_Time:23175.0min: [2024-01-06 22:32:12,410][model8_pretrain.py][INFO] Epoch:[0/2](950200/4588595) loss:2.317 lr:0.0000100 epoch_Time:23175.0min: [2024-01-06 22:32:12,410][model8_pretrain.py][INFO] Epoch:[0/2](950200/4588595) loss:2.455 lr:0.0000100 epoch_Time:23175.0min: [2024-01-06 22:32:49,355][model8_pretrain.py][INFO] Epoch:[0/2](950300/4588595) loss:2.272 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:32:49,355][model8_pretrain.py][INFO] Epoch:[0/2](950300/4588595) loss:2.990 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:32:49,355][model8_pretrain.py][INFO] Epoch:[0/2](950300/4588595) loss:2.941 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:32:49,355][model8_pretrain.py][INFO] Epoch:[0/2](950300/4588595) loss:2.897 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:32:49,355][model8_pretrain.py][INFO] Epoch:[0/2](950300/4588595) loss:2.695 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:32:49,355][model8_pretrain.py][INFO] Epoch:[0/2](950300/4588595) loss:3.117 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:32:49,355][model8_pretrain.py][INFO] Epoch:[0/2](950300/4588595) loss:2.888 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:32:49,356][model8_pretrain.py][INFO] Epoch:[0/2](950300/4588595) loss:2.741 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:33:26,300][model8_pretrain.py][INFO] Epoch:[0/2](950400/4588595) loss:2.839 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:33:26,300][model8_pretrain.py][INFO] Epoch:[0/2](950400/4588595) loss:3.179 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:33:26,300][model8_pretrain.py][INFO] Epoch:[0/2](950400/4588595) loss:2.915 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:33:26,300][model8_pretrain.py][INFO] Epoch:[0/2](950400/4588595) loss:2.732 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:33:26,300][model8_pretrain.py][INFO] Epoch:[0/2](950400/4588595) loss:2.474 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:33:26,300][model8_pretrain.py][INFO] Epoch:[0/2](950400/4588595) loss:3.240 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:33:26,300][model8_pretrain.py][INFO] Epoch:[0/2](950400/4588595) loss:3.145 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:33:26,300][model8_pretrain.py][INFO] Epoch:[0/2](950400/4588595) loss:2.839 lr:0.0000100 epoch_Time:23174.0min: [2024-01-06 22:34:03,252][model8_pretrain.py][INFO] Epoch:[0/2](950500/4588595) loss:2.872 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:03,252][model8_pretrain.py][INFO] Epoch:[0/2](950500/4588595) loss:2.752 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:03,252][model8_pretrain.py][INFO] Epoch:[0/2](950500/4588595) loss:2.289 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:03,252][model8_pretrain.py][INFO] Epoch:[0/2](950500/4588595) loss:2.834 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:03,252][model8_pretrain.py][INFO] Epoch:[0/2](950500/4588595) loss:2.629 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:03,252][model8_pretrain.py][INFO] Epoch:[0/2](950500/4588595) loss:2.156 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:03,253][model8_pretrain.py][INFO] Epoch:[0/2](950500/4588595) loss:2.529 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:03,253][model8_pretrain.py][INFO] Epoch:[0/2](950500/4588595) loss:2.562 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:40,195][model8_pretrain.py][INFO] Epoch:[0/2](950600/4588595) loss:3.072 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:40,195][model8_pretrain.py][INFO] Epoch:[0/2](950600/4588595) loss:2.974 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:40,195][model8_pretrain.py][INFO] Epoch:[0/2](950600/4588595) loss:3.218 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:40,195][model8_pretrain.py][INFO] Epoch:[0/2](950600/4588595) loss:2.752 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:40,195][model8_pretrain.py][INFO] Epoch:[0/2](950600/4588595) loss:2.666 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:40,195][model8_pretrain.py][INFO] Epoch:[0/2](950600/4588595) loss:2.369 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:40,195][model8_pretrain.py][INFO] Epoch:[0/2](950600/4588595) loss:2.952 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:34:40,195][model8_pretrain.py][INFO] Epoch:[0/2](950600/4588595) loss:3.130 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:35:30,914][model8_pretrain.py][INFO] Epoch:[0/2](950700/4588595) loss:2.773 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:35:30,914][model8_pretrain.py][INFO] Epoch:[0/2](950700/4588595) loss:2.975 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:35:30,914][model8_pretrain.py][INFO] Epoch:[0/2](950700/4588595) loss:2.958 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:35:30,914][model8_pretrain.py][INFO] Epoch:[0/2](950700/4588595) loss:2.224 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:35:30,914][model8_pretrain.py][INFO] Epoch:[0/2](950700/4588595) loss:2.959 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:35:30,914][model8_pretrain.py][INFO] Epoch:[0/2](950700/4588595) loss:3.022 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:35:30,914][model8_pretrain.py][INFO] Epoch:[0/2](950700/4588595) loss:3.308 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:35:30,914][model8_pretrain.py][INFO] Epoch:[0/2](950700/4588595) loss:2.737 lr:0.0000100 epoch_Time:23173.0min: [2024-01-06 22:36:07,818][model8_pretrain.py][INFO] Epoch:[0/2](950800/4588595) loss:3.278 lr:0.0000100 epoch_Time:23172.0min: [2024-01-06 22:36:07,818][model8_pretrain.py][INFO] Epoch:[0/2](950800/4588595) loss:3.161 lr:0.0000100 epoch_Time:23172.0min: [2024-01-06 22:36:07,818][model8_pretrain.py][INFO] Epoch:[0/2](950800/4588595) loss:3.227 lr:0.0000100 epoch_Time:23172.0min: [2024-01-06 22:36:07,818][model8_pretrain.py][INFO] Epoch:[0/2](950800/4588595) loss:3.023 lr:0.0000100 epoch_Time:23172.0min: [2024-01-06 22:36:07,818][model8_pretrain.py][INFO] Epoch:[0/2](950800/4588595) loss:3.038 lr:0.0000100 epoch_Time:23172.0min: [2024-01-06 22:36:07,818][model8_pretrain.py][INFO] Epoch:[0/2](950800/4588595) loss:3.070 lr:0.0000100 epoch_Time:23172.0min: [2024-01-06 22:36:07,818][model8_pretrain.py][INFO] Epoch:[0/2](950800/4588595) loss:2.815 lr:0.0000100 epoch_Time:23172.0min: [2024-01-06 22:36:07,818][model8_pretrain.py][INFO] Epoch:[0/2](950800/4588595) loss:2.865 lr:0.0000100 epoch_Time:23172.0min: [2024-01-06 22:36:44,781][model8_pretrain.py][INFO] Epoch:[0/2](950900/4588595) loss:2.334 lr:0.0000100 epoch_Time:23171.0min: [2024-01-06 22:36:44,781][model8_pretrain.py][INFO] Epoch:[0/2](950900/4588595) loss:2.421 lr:0.0000100 epoch_Time:23171.0min: [2024-01-06 22:36:44,781][model8_pretrain.py][INFO] Epoch:[0/2](950900/4588595) loss:2.676 lr:0.0000100 epoch_Time:23171.0min: [2024-01-06 22:36:44,781][model8_pretrain.py][INFO] Epoch:[0/2](950900/4588595) loss:2.992 lr:0.0000100 epoch_Time:23171.0min: [2024-01-06 22:36:44,781][model8_pretrain.py][INFO] Epoch:[0/2](950900/4588595) loss:3.167 lr:0.0000100 epoch_Time:23171.0min: [2024-01-06 22:36:44,781][model8_pretrain.py][INFO] Epoch:[0/2](950900/4588595) loss:3.099 lr:0.0000100 epoch_Time:23171.0min: [2024-01-06 22:36:44,781][model8_pretrain.py][INFO] Epoch:[0/2](950900/4588595) loss:2.167 lr:0.0000100 epoch_Time:23171.0min: [2024-01-06 22:36:44,781][model8_pretrain.py][INFO] Epoch:[0/2](950900/4588595) loss:2.711 lr:0.0000100 epoch_Time:23171.0min: [2024-01-06 22:37:21,741][model8_pretrain.py][INFO] Epoch:[0/2](951000/4588595) loss:3.177 lr:0.0000100 epoch_Time:23170.0min: [2024-01-06 22:37:21,741][model8_pretrain.py][INFO] Epoch:[0/2](951000/4588595) loss:2.969 lr:0.0000100 epoch_Time:23170.0min: [2024-01-06 22:37:21,741][model8_pretrain.py][INFO] Epoch:[0/2](951000/4588595) loss:3.144 lr:0.0000100 epoch_Time:23170.0min: [2024-01-06 22:37:21,741][model8_pretrain.py][INFO] Epoch:[0/2](951000/4588595) loss:2.734 lr:0.0000100 epoch_Time:23170.0min: [2024-01-06 22:37:21,741][model8_pretrain.py][INFO] Epoch:[0/2](951000/4588595) loss:2.671 lr:0.0000100 epoch_Time:23170.0min: [2024-01-06 22:37:21,741][model8_pretrain.py][INFO] Epoch:[0/2](951000/4588595) loss:2.640 lr:0.0000100 epoch_Time:23170.0min: [2024-01-06 22:37:21,741][model8_pretrain.py][INFO] Epoch:[0/2](951000/4588595) loss:2.942 lr:0.0000100 epoch_Time:23170.0min: [2024-01-06 22:37:21,742][model8_pretrain.py][INFO] Epoch:[0/2](951000/4588595) loss:2.158 lr:0.0000100 epoch_Time:23170.0min: [2024-01-06 22:37:58,695][model8_pretrain.py][INFO] Epoch:[0/2](951100/4588595) loss:2.769 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:37:58,695][model8_pretrain.py][INFO] Epoch:[0/2](951100/4588595) loss:2.939 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:37:58,695][model8_pretrain.py][INFO] Epoch:[0/2](951100/4588595) loss:2.687 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:37:58,695][model8_pretrain.py][INFO] Epoch:[0/2](951100/4588595) loss:2.767 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:37:58,695][model8_pretrain.py][INFO] Epoch:[0/2](951100/4588595) loss:2.480 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:37:58,695][model8_pretrain.py][INFO] Epoch:[0/2](951100/4588595) loss:2.176 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:37:58,695][model8_pretrain.py][INFO] Epoch:[0/2](951100/4588595) loss:2.377 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:37:58,695][model8_pretrain.py][INFO] Epoch:[0/2](951100/4588595) loss:2.908 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:38:35,651][model8_pretrain.py][INFO] Epoch:[0/2](951200/4588595) loss:2.908 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:38:35,651][model8_pretrain.py][INFO] Epoch:[0/2](951200/4588595) loss:2.586 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:38:35,651][model8_pretrain.py][INFO] Epoch:[0/2](951200/4588595) loss:2.675 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:38:35,651][model8_pretrain.py][INFO] Epoch:[0/2](951200/4588595) loss:2.683 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:38:35,652][model8_pretrain.py][INFO] Epoch:[0/2](951200/4588595) loss:2.336 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:38:35,652][model8_pretrain.py][INFO] Epoch:[0/2](951200/4588595) loss:3.127 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:38:35,653][model8_pretrain.py][INFO] Epoch:[0/2](951200/4588595) loss:2.752 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:38:35,653][model8_pretrain.py][INFO] Epoch:[0/2](951200/4588595) loss:2.565 lr:0.0000100 epoch_Time:23169.0min: [2024-01-06 22:39:12,622][model8_pretrain.py][INFO] Epoch:[0/2](951300/4588595) loss:2.313 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:39:12,622][model8_pretrain.py][INFO] Epoch:[0/2](951300/4588595) loss:2.720 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:39:12,622][model8_pretrain.py][INFO] Epoch:[0/2](951300/4588595) loss:2.286 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:39:12,622][model8_pretrain.py][INFO] Epoch:[0/2](951300/4588595) loss:2.998 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:39:12,622][model8_pretrain.py][INFO] Epoch:[0/2](951300/4588595) loss:3.011 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:39:12,622][model8_pretrain.py][INFO] Epoch:[0/2](951300/4588595) loss:2.889 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:39:12,622][model8_pretrain.py][INFO] Epoch:[0/2](951300/4588595) loss:2.500 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:39:12,622][model8_pretrain.py][INFO] Epoch:[0/2](951300/4588595) loss:2.885 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:39:49,586][model8_pretrain.py][INFO] Epoch:[0/2](951400/4588595) loss:2.208 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:39:49,586][model8_pretrain.py][INFO] Epoch:[0/2](951400/4588595) loss:3.134 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:39:49,586][model8_pretrain.py][INFO] Epoch:[0/2](951400/4588595) loss:3.176 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:39:49,586][model8_pretrain.py][INFO] Epoch:[0/2](951400/4588595) loss:2.453 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:39:49,586][model8_pretrain.py][INFO] Epoch:[0/2](951400/4588595) loss:2.565 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:39:49,586][model8_pretrain.py][INFO] Epoch:[0/2](951400/4588595) loss:2.999 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:39:49,586][model8_pretrain.py][INFO] Epoch:[0/2](951400/4588595) loss:2.660 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:39:49,586][model8_pretrain.py][INFO] Epoch:[0/2](951400/4588595) loss:2.458 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:40:40,347][model8_pretrain.py][INFO] Epoch:[0/2](951500/4588595) loss:2.757 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:40:40,347][model8_pretrain.py][INFO] Epoch:[0/2](951500/4588595) loss:2.739 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:40:40,348][model8_pretrain.py][INFO] Epoch:[0/2](951500/4588595) loss:2.689 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:40:40,347][model8_pretrain.py][INFO] Epoch:[0/2](951500/4588595) loss:2.978 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:40:40,348][model8_pretrain.py][INFO] Epoch:[0/2](951500/4588595) loss:2.596 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:40:40,348][model8_pretrain.py][INFO] Epoch:[0/2](951500/4588595) loss:3.265 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:40:40,348][model8_pretrain.py][INFO] Epoch:[0/2](951500/4588595) loss:2.607 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:40:40,348][model8_pretrain.py][INFO] Epoch:[0/2](951500/4588595) loss:2.395 lr:0.0000100 epoch_Time:23168.0min: [2024-01-06 22:41:17,264][model8_pretrain.py][INFO] Epoch:[0/2](951600/4588595) loss:2.932 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:41:17,264][model8_pretrain.py][INFO] Epoch:[0/2](951600/4588595) loss:2.819 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:41:17,264][model8_pretrain.py][INFO] Epoch:[0/2](951600/4588595) loss:3.277 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:41:17,264][model8_pretrain.py][INFO] Epoch:[0/2](951600/4588595) loss:2.941 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:41:17,264][model8_pretrain.py][INFO] Epoch:[0/2](951600/4588595) loss:2.369 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:41:17,264][model8_pretrain.py][INFO] Epoch:[0/2](951600/4588595) loss:3.277 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:41:17,264][model8_pretrain.py][INFO] Epoch:[0/2](951600/4588595) loss:3.228 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:41:17,264][model8_pretrain.py][INFO] Epoch:[0/2](951600/4588595) loss:2.778 lr:0.0000100 epoch_Time:23167.0min: [2024-01-06 22:41:54,192][model8_pretrain.py][INFO] Epoch:[0/2](951700/4588595) loss:2.983 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:41:54,192][model8_pretrain.py][INFO] Epoch:[0/2](951700/4588595) loss:2.715 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:41:54,192][model8_pretrain.py][INFO] Epoch:[0/2](951700/4588595) loss:2.452 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:41:54,192][model8_pretrain.py][INFO] Epoch:[0/2](951700/4588595) loss:2.966 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:41:54,192][model8_pretrain.py][INFO] Epoch:[0/2](951700/4588595) loss:2.918 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:41:54,192][model8_pretrain.py][INFO] Epoch:[0/2](951700/4588595) loss:2.284 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:41:54,192][model8_pretrain.py][INFO] Epoch:[0/2](951700/4588595) loss:2.723 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:41:54,193][model8_pretrain.py][INFO] Epoch:[0/2](951700/4588595) loss:2.854 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:42:31,135][model8_pretrain.py][INFO] Epoch:[0/2](951800/4588595) loss:3.108 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:42:31,135][model8_pretrain.py][INFO] Epoch:[0/2](951800/4588595) loss:2.973 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:42:31,136][model8_pretrain.py][INFO] Epoch:[0/2](951800/4588595) loss:2.491 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:42:31,136][model8_pretrain.py][INFO] Epoch:[0/2](951800/4588595) loss:2.501 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:42:31,136][model8_pretrain.py][INFO] Epoch:[0/2](951800/4588595) loss:3.132 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:42:31,136][model8_pretrain.py][INFO] Epoch:[0/2](951800/4588595) loss:3.402 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:42:31,136][model8_pretrain.py][INFO] Epoch:[0/2](951800/4588595) loss:2.887 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:42:31,136][model8_pretrain.py][INFO] Epoch:[0/2](951800/4588595) loss:2.949 lr:0.0000100 epoch_Time:23166.0min: [2024-01-06 22:43:08,075][model8_pretrain.py][INFO] Epoch:[0/2](951900/4588595) loss:2.671 lr:0.0000100 epoch_Time:23165.0min: [2024-01-06 22:43:08,075][model8_pretrain.py][INFO] Epoch:[0/2](951900/4588595) loss:2.648 lr:0.0000100 epoch_Time:23165.0min: [2024-01-06 22:43:08,075][model8_pretrain.py][INFO] Epoch:[0/2](951900/4588595) loss:2.712 lr:0.0000100 epoch_Time:23165.0min: [2024-01-06 22:43:08,075][model8_pretrain.py][INFO] Epoch:[0/2](951900/4588595) loss:1.994 lr:0.0000100 epoch_Time:23165.0min: [2024-01-06 22:43:08,075][model8_pretrain.py][INFO] Epoch:[0/2](951900/4588595) loss:2.449 lr:0.0000100 epoch_Time:23165.0min: [2024-01-06 22:43:08,075][model8_pretrain.py][INFO] Epoch:[0/2](951900/4588595) loss:2.848 lr:0.0000100 epoch_Time:23165.0min: [2024-01-06 22:43:08,075][model8_pretrain.py][INFO] Epoch:[0/2](951900/4588595) loss:3.148 lr:0.0000100 epoch_Time:23165.0min: [2024-01-06 22:43:08,075][model8_pretrain.py][INFO] Epoch:[0/2](951900/4588595) loss:2.615 lr:0.0000100 epoch_Time:23165.0min: [2024-01-06 22:43:45,003][model8_pretrain.py][INFO] Epoch:[0/2](952000/4588595) loss:3.256 lr:0.0000100 epoch_Time:23164.0min: [2024-01-06 22:43:45,003][model8_pretrain.py][INFO] Epoch:[0/2](952000/4588595) loss:2.112 lr:0.0000100 epoch_Time:23164.0min: [2024-01-06 22:43:45,003][model8_pretrain.py][INFO] Epoch:[0/2](952000/4588595) loss:3.315 lr:0.0000100 epoch_Time:23164.0min: [2024-01-06 22:43:45,003][model8_pretrain.py][INFO] Epoch:[0/2](952000/4588595) loss:2.455 lr:0.0000100 epoch_Time:23164.0min: [2024-01-06 22:43:45,003][model8_pretrain.py][INFO] Epoch:[0/2](952000/4588595) loss:2.806 lr:0.0000100 epoch_Time:23164.0min: [2024-01-06 22:43:45,003][model8_pretrain.py][INFO] Epoch:[0/2](952000/4588595) loss:2.742 lr:0.0000100 epoch_Time:23164.0min: [2024-01-06 22:43:45,003][model8_pretrain.py][INFO] Epoch:[0/2](952000/4588595) loss:3.084 lr:0.0000100 epoch_Time:23164.0min: [2024-01-06 22:43:45,003][model8_pretrain.py][INFO] Epoch:[0/2](952000/4588595) loss:2.059 lr:0.0000100 epoch_Time:23164.0min: [2024-01-06 22:44:21,939][model8_pretrain.py][INFO] Epoch:[0/2](952100/4588595) loss:2.553 lr:0.0000100 epoch_Time:23163.0min: [2024-01-06 22:44:21,939][model8_pretrain.py][INFO] Epoch:[0/2](952100/4588595) loss:2.314 lr:0.0000100 epoch_Time:23163.0min: [2024-01-06 22:44:21,939][model8_pretrain.py][INFO] Epoch:[0/2](952100/4588595) loss:2.893 lr:0.0000100 epoch_Time:23163.0min: [2024-01-06 22:44:21,939][model8_pretrain.py][INFO] Epoch:[0/2](952100/4588595) loss:2.703 lr:0.0000100 epoch_Time:23163.0min: [2024-01-06 22:44:21,939][model8_pretrain.py][INFO] Epoch:[0/2](952100/4588595) loss:2.711 lr:0.0000100 epoch_Time:23163.0min: [2024-01-06 22:44:21,940][model8_pretrain.py][INFO] Epoch:[0/2](952100/4588595) loss:2.452 lr:0.0000100 epoch_Time:23163.0min: [2024-01-06 22:44:21,940][model8_pretrain.py][INFO] Epoch:[0/2](952100/4588595) loss:3.048 lr:0.0000100 epoch_Time:23163.0min: [2024-01-06 22:44:21,940][model8_pretrain.py][INFO] Epoch:[0/2](952100/4588595) loss:3.163 lr:0.0000100 epoch_Time:23163.0min: [2024-01-06 22:44:58,862][model8_pretrain.py][INFO] Epoch:[0/2](952200/4588595) loss:2.743 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:44:58,862][model8_pretrain.py][INFO] Epoch:[0/2](952200/4588595) loss:2.691 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:44:58,862][model8_pretrain.py][INFO] Epoch:[0/2](952200/4588595) loss:2.807 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:44:58,862][model8_pretrain.py][INFO] Epoch:[0/2](952200/4588595) loss:2.600 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:44:58,862][model8_pretrain.py][INFO] Epoch:[0/2](952200/4588595) loss:3.431 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:44:58,862][model8_pretrain.py][INFO] Epoch:[0/2](952200/4588595) loss:2.029 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:44:58,862][model8_pretrain.py][INFO] Epoch:[0/2](952200/4588595) loss:2.579 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:44:58,863][model8_pretrain.py][INFO] Epoch:[0/2](952200/4588595) loss:2.734 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:45:49,428][model8_pretrain.py][INFO] Epoch:[0/2](952300/4588595) loss:2.724 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:45:49,428][model8_pretrain.py][INFO] Epoch:[0/2](952300/4588595) loss:2.185 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:45:49,428][model8_pretrain.py][INFO] Epoch:[0/2](952300/4588595) loss:3.200 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:45:49,428][model8_pretrain.py][INFO] Epoch:[0/2](952300/4588595) loss:2.285 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:45:49,428][model8_pretrain.py][INFO] Epoch:[0/2](952300/4588595) loss:3.327 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:45:49,428][model8_pretrain.py][INFO] Epoch:[0/2](952300/4588595) loss:3.513 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:45:49,428][model8_pretrain.py][INFO] Epoch:[0/2](952300/4588595) loss:2.165 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:45:49,428][model8_pretrain.py][INFO] Epoch:[0/2](952300/4588595) loss:2.857 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:46:26,350][model8_pretrain.py][INFO] Epoch:[0/2](952400/4588595) loss:3.336 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:46:26,350][model8_pretrain.py][INFO] Epoch:[0/2](952400/4588595) loss:2.999 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:46:26,350][model8_pretrain.py][INFO] Epoch:[0/2](952400/4588595) loss:2.539 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:46:26,350][model8_pretrain.py][INFO] Epoch:[0/2](952400/4588595) loss:2.753 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:46:26,350][model8_pretrain.py][INFO] Epoch:[0/2](952400/4588595) loss:3.250 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:46:26,350][model8_pretrain.py][INFO] Epoch:[0/2](952400/4588595) loss:2.984 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:46:26,350][model8_pretrain.py][INFO] Epoch:[0/2](952400/4588595) loss:2.225 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:46:26,351][model8_pretrain.py][INFO] Epoch:[0/2](952400/4588595) loss:2.634 lr:0.0000100 epoch_Time:23162.0min: [2024-01-06 22:47:03,286][model8_pretrain.py][INFO] Epoch:[0/2](952500/4588595) loss:2.727 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:03,286][model8_pretrain.py][INFO] Epoch:[0/2](952500/4588595) loss:3.310 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:03,286][model8_pretrain.py][INFO] Epoch:[0/2](952500/4588595) loss:2.957 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:03,286][model8_pretrain.py][INFO] Epoch:[0/2](952500/4588595) loss:2.818 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:03,286][model8_pretrain.py][INFO] Epoch:[0/2](952500/4588595) loss:2.401 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:03,286][model8_pretrain.py][INFO] Epoch:[0/2](952500/4588595) loss:2.561 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:03,286][model8_pretrain.py][INFO] Epoch:[0/2](952500/4588595) loss:2.648 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:03,286][model8_pretrain.py][INFO] Epoch:[0/2](952500/4588595) loss:3.061 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:40,227][model8_pretrain.py][INFO] Epoch:[0/2](952600/4588595) loss:2.896 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:40,227][model8_pretrain.py][INFO] Epoch:[0/2](952600/4588595) loss:3.213 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:40,227][model8_pretrain.py][INFO] Epoch:[0/2](952600/4588595) loss:3.142 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:40,227][model8_pretrain.py][INFO] Epoch:[0/2](952600/4588595) loss:2.475 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:40,227][model8_pretrain.py][INFO] Epoch:[0/2](952600/4588595) loss:2.770 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:40,227][model8_pretrain.py][INFO] Epoch:[0/2](952600/4588595) loss:2.798 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:40,227][model8_pretrain.py][INFO] Epoch:[0/2](952600/4588595) loss:2.796 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:47:40,227][model8_pretrain.py][INFO] Epoch:[0/2](952600/4588595) loss:2.778 lr:0.0000100 epoch_Time:23161.0min: [2024-01-06 22:48:17,161][model8_pretrain.py][INFO] Epoch:[0/2](952700/4588595) loss:3.022 lr:0.0000100 epoch_Time:23160.0min: [2024-01-06 22:48:17,161][model8_pretrain.py][INFO] Epoch:[0/2](952700/4588595) loss:2.706 lr:0.0000100 epoch_Time:23160.0min: [2024-01-06 22:48:17,161][model8_pretrain.py][INFO] Epoch:[0/2](952700/4588595) loss:2.723 lr:0.0000100 epoch_Time:23160.0min: [2024-01-06 22:48:17,161][model8_pretrain.py][INFO] Epoch:[0/2](952700/4588595) loss:2.942 lr:0.0000100 epoch_Time:23160.0min: [2024-01-06 22:48:17,161][model8_pretrain.py][INFO] Epoch:[0/2](952700/4588595) loss:3.346 lr:0.0000100 epoch_Time:23160.0min: [2024-01-06 22:48:17,161][model8_pretrain.py][INFO] Epoch:[0/2](952700/4588595) loss:2.097 lr:0.0000100 epoch_Time:23160.0min: [2024-01-06 22:48:17,161][model8_pretrain.py][INFO] Epoch:[0/2](952700/4588595) loss:3.162 lr:0.0000100 epoch_Time:23160.0min: [2024-01-06 22:48:17,161][model8_pretrain.py][INFO] Epoch:[0/2](952700/4588595) loss:2.990 lr:0.0000100 epoch_Time:23160.0min: [2024-01-06 22:48:54,097][model8_pretrain.py][INFO] Epoch:[0/2](952800/4588595) loss:2.691 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:48:54,097][model8_pretrain.py][INFO] Epoch:[0/2](952800/4588595) loss:2.303 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:48:54,097][model8_pretrain.py][INFO] Epoch:[0/2](952800/4588595) loss:2.560 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:48:54,097][model8_pretrain.py][INFO] Epoch:[0/2](952800/4588595) loss:2.557 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:48:54,097][model8_pretrain.py][INFO] Epoch:[0/2](952800/4588595) loss:2.995 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:48:54,097][model8_pretrain.py][INFO] Epoch:[0/2](952800/4588595) loss:2.433 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:48:54,097][model8_pretrain.py][INFO] Epoch:[0/2](952800/4588595) loss:2.798 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:48:54,097][model8_pretrain.py][INFO] Epoch:[0/2](952800/4588595) loss:2.766 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:49:31,031][model8_pretrain.py][INFO] Epoch:[0/2](952900/4588595) loss:2.586 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:49:31,031][model8_pretrain.py][INFO] Epoch:[0/2](952900/4588595) loss:2.494 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:49:31,031][model8_pretrain.py][INFO] Epoch:[0/2](952900/4588595) loss:2.961 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:49:31,031][model8_pretrain.py][INFO] Epoch:[0/2](952900/4588595) loss:2.686 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:49:31,031][model8_pretrain.py][INFO] Epoch:[0/2](952900/4588595) loss:2.447 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:49:31,031][model8_pretrain.py][INFO] Epoch:[0/2](952900/4588595) loss:2.879 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:49:31,031][model8_pretrain.py][INFO] Epoch:[0/2](952900/4588595) loss:2.633 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:49:31,031][model8_pretrain.py][INFO] Epoch:[0/2](952900/4588595) loss:2.864 lr:0.0000100 epoch_Time:23159.0min: [2024-01-06 22:50:07,965][model8_pretrain.py][INFO] Epoch:[0/2](953000/4588595) loss:2.734 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:07,965][model8_pretrain.py][INFO] Epoch:[0/2](953000/4588595) loss:2.786 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:07,965][model8_pretrain.py][INFO] Epoch:[0/2](953000/4588595) loss:3.201 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:07,965][model8_pretrain.py][INFO] Epoch:[0/2](953000/4588595) loss:2.479 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:07,965][model8_pretrain.py][INFO] Epoch:[0/2](953000/4588595) loss:2.854 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:07,965][model8_pretrain.py][INFO] Epoch:[0/2](953000/4588595) loss:3.173 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:07,965][model8_pretrain.py][INFO] Epoch:[0/2](953000/4588595) loss:2.592 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:07,965][model8_pretrain.py][INFO] Epoch:[0/2](953000/4588595) loss:2.730 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:58,584][model8_pretrain.py][INFO] Epoch:[0/2](953100/4588595) loss:3.039 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:58,584][model8_pretrain.py][INFO] Epoch:[0/2](953100/4588595) loss:2.237 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:58,584][model8_pretrain.py][INFO] Epoch:[0/2](953100/4588595) loss:2.845 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:58,584][model8_pretrain.py][INFO] Epoch:[0/2](953100/4588595) loss:1.541 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:58,584][model8_pretrain.py][INFO] Epoch:[0/2](953100/4588595) loss:3.214 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:58,584][model8_pretrain.py][INFO] Epoch:[0/2](953100/4588595) loss:3.236 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:58,584][model8_pretrain.py][INFO] Epoch:[0/2](953100/4588595) loss:2.715 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:50:58,584][model8_pretrain.py][INFO] Epoch:[0/2](953100/4588595) loss:3.428 lr:0.0000100 epoch_Time:23158.0min: [2024-01-06 22:51:35,494][model8_pretrain.py][INFO] Epoch:[0/2](953200/4588595) loss:2.473 lr:0.0000100 epoch_Time:23157.0min: [2024-01-06 22:51:35,494][model8_pretrain.py][INFO] Epoch:[0/2](953200/4588595) loss:2.739 lr:0.0000100 epoch_Time:23157.0min: [2024-01-06 22:51:35,494][model8_pretrain.py][INFO] Epoch:[0/2](953200/4588595) loss:2.979 lr:0.0000100 epoch_Time:23157.0min: [2024-01-06 22:51:35,494][model8_pretrain.py][INFO] Epoch:[0/2](953200/4588595) loss:3.001 lr:0.0000100 epoch_Time:23157.0min: [2024-01-06 22:51:35,494][model8_pretrain.py][INFO] Epoch:[0/2](953200/4588595) loss:2.882 lr:0.0000100 epoch_Time:23157.0min: [2024-01-06 22:51:35,494][model8_pretrain.py][INFO] Epoch:[0/2](953200/4588595) loss:2.998 lr:0.0000100 epoch_Time:23157.0min: [2024-01-06 22:51:35,494][model8_pretrain.py][INFO] Epoch:[0/2](953200/4588595) loss:3.062 lr:0.0000100 epoch_Time:23157.0min: [2024-01-06 22:51:35,494][model8_pretrain.py][INFO] Epoch:[0/2](953200/4588595) loss:2.423 lr:0.0000100 epoch_Time:23157.0min: [2024-01-06 22:52:12,428][model8_pretrain.py][INFO] Epoch:[0/2](953300/4588595) loss:2.265 lr:0.0000100 epoch_Time:23156.0min: [2024-01-06 22:52:12,428][model8_pretrain.py][INFO] Epoch:[0/2](953300/4588595) loss:2.614 lr:0.0000100 epoch_Time:23156.0min: [2024-01-06 22:52:12,428][model8_pretrain.py][INFO] Epoch:[0/2](953300/4588595) loss:3.139 lr:0.0000100 epoch_Time:23156.0min: [2024-01-06 22:52:12,428][model8_pretrain.py][INFO] Epoch:[0/2](953300/4588595) loss:3.089 lr:0.0000100 epoch_Time:23156.0min: [2024-01-06 22:52:12,428][model8_pretrain.py][INFO] Epoch:[0/2](953300/4588595) loss:3.208 lr:0.0000100 epoch_Time:23156.0min: [2024-01-06 22:52:12,428][model8_pretrain.py][INFO] Epoch:[0/2](953300/4588595) loss:2.885 lr:0.0000100 epoch_Time:23156.0min: [2024-01-06 22:52:12,428][model8_pretrain.py][INFO] Epoch:[0/2](953300/4588595) loss:3.014 lr:0.0000100 epoch_Time:23156.0min: [2024-01-06 22:52:12,428][model8_pretrain.py][INFO] Epoch:[0/2](953300/4588595) loss:1.927 lr:0.0000100 epoch_Time:23156.0min: [2024-01-06 22:52:49,373][model8_pretrain.py][INFO] Epoch:[0/2](953400/4588595) loss:2.433 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:52:49,373][model8_pretrain.py][INFO] Epoch:[0/2](953400/4588595) loss:3.315 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:52:49,373][model8_pretrain.py][INFO] Epoch:[0/2](953400/4588595) loss:2.655 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:52:49,373][model8_pretrain.py][INFO] Epoch:[0/2](953400/4588595) loss:2.344 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:52:49,373][model8_pretrain.py][INFO] Epoch:[0/2](953400/4588595) loss:2.800 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:52:49,373][model8_pretrain.py][INFO] Epoch:[0/2](953400/4588595) loss:2.625 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:52:49,373][model8_pretrain.py][INFO] Epoch:[0/2](953400/4588595) loss:2.944 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:52:49,373][model8_pretrain.py][INFO] Epoch:[0/2](953400/4588595) loss:2.859 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:53:26,309][model8_pretrain.py][INFO] Epoch:[0/2](953500/4588595) loss:1.944 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:53:26,309][model8_pretrain.py][INFO] Epoch:[0/2](953500/4588595) loss:3.283 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:53:26,309][model8_pretrain.py][INFO] Epoch:[0/2](953500/4588595) loss:3.050 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:53:26,309][model8_pretrain.py][INFO] Epoch:[0/2](953500/4588595) loss:2.984 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:53:26,309][model8_pretrain.py][INFO] Epoch:[0/2](953500/4588595) loss:2.809 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:53:26,309][model8_pretrain.py][INFO] Epoch:[0/2](953500/4588595) loss:2.653 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:53:26,309][model8_pretrain.py][INFO] Epoch:[0/2](953500/4588595) loss:2.909 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:53:26,309][model8_pretrain.py][INFO] Epoch:[0/2](953500/4588595) loss:2.968 lr:0.0000100 epoch_Time:23155.0min: [2024-01-06 22:54:03,251][model8_pretrain.py][INFO] Epoch:[0/2](953600/4588595) loss:2.470 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:03,251][model8_pretrain.py][INFO] Epoch:[0/2](953600/4588595) loss:2.826 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:03,251][model8_pretrain.py][INFO] Epoch:[0/2](953600/4588595) loss:2.413 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:03,251][model8_pretrain.py][INFO] Epoch:[0/2](953600/4588595) loss:2.818 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:03,251][model8_pretrain.py][INFO] Epoch:[0/2](953600/4588595) loss:3.249 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:03,251][model8_pretrain.py][INFO] Epoch:[0/2](953600/4588595) loss:2.892 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:03,251][model8_pretrain.py][INFO] Epoch:[0/2](953600/4588595) loss:3.097 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:03,251][model8_pretrain.py][INFO] Epoch:[0/2](953600/4588595) loss:2.485 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:40,191][model8_pretrain.py][INFO] Epoch:[0/2](953700/4588595) loss:2.963 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:40,191][model8_pretrain.py][INFO] Epoch:[0/2](953700/4588595) loss:3.009 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:40,192][model8_pretrain.py][INFO] Epoch:[0/2](953700/4588595) loss:3.196 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:40,192][model8_pretrain.py][INFO] Epoch:[0/2](953700/4588595) loss:3.203 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:40,192][model8_pretrain.py][INFO] Epoch:[0/2](953700/4588595) loss:3.175 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:40,192][model8_pretrain.py][INFO] Epoch:[0/2](953700/4588595) loss:2.927 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:40,192][model8_pretrain.py][INFO] Epoch:[0/2](953700/4588595) loss:2.685 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:54:40,192][model8_pretrain.py][INFO] Epoch:[0/2](953700/4588595) loss:2.998 lr:0.0000100 epoch_Time:23154.0min: [2024-01-06 22:55:17,119][model8_pretrain.py][INFO] Epoch:[0/2](953800/4588595) loss:2.997 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:55:17,119][model8_pretrain.py][INFO] Epoch:[0/2](953800/4588595) loss:3.254 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:55:17,119][model8_pretrain.py][INFO] Epoch:[0/2](953800/4588595) loss:2.593 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:55:17,119][model8_pretrain.py][INFO] Epoch:[0/2](953800/4588595) loss:2.880 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:55:17,119][model8_pretrain.py][INFO] Epoch:[0/2](953800/4588595) loss:2.027 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:55:17,119][model8_pretrain.py][INFO] Epoch:[0/2](953800/4588595) loss:2.509 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:55:17,119][model8_pretrain.py][INFO] Epoch:[0/2](953800/4588595) loss:3.175 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:55:17,119][model8_pretrain.py][INFO] Epoch:[0/2](953800/4588595) loss:2.345 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:07,594][model8_pretrain.py][INFO] Epoch:[0/2](953900/4588595) loss:2.695 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:07,594][model8_pretrain.py][INFO] Epoch:[0/2](953900/4588595) loss:2.709 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:07,594][model8_pretrain.py][INFO] Epoch:[0/2](953900/4588595) loss:2.898 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:07,594][model8_pretrain.py][INFO] Epoch:[0/2](953900/4588595) loss:3.210 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:07,594][model8_pretrain.py][INFO] Epoch:[0/2](953900/4588595) loss:2.795 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:07,594][model8_pretrain.py][INFO] Epoch:[0/2](953900/4588595) loss:3.072 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:07,594][model8_pretrain.py][INFO] Epoch:[0/2](953900/4588595) loss:2.648 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:07,595][model8_pretrain.py][INFO] Epoch:[0/2](953900/4588595) loss:3.125 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:44,512][model8_pretrain.py][INFO] Epoch:[0/2](954000/4588595) loss:2.421 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:44,512][model8_pretrain.py][INFO] Epoch:[0/2](954000/4588595) loss:2.873 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:44,512][model8_pretrain.py][INFO] Epoch:[0/2](954000/4588595) loss:2.220 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:44,512][model8_pretrain.py][INFO] Epoch:[0/2](954000/4588595) loss:2.630 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:44,512][model8_pretrain.py][INFO] Epoch:[0/2](954000/4588595) loss:2.652 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:44,512][model8_pretrain.py][INFO] Epoch:[0/2](954000/4588595) loss:2.800 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:44,512][model8_pretrain.py][INFO] Epoch:[0/2](954000/4588595) loss:2.779 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:56:44,513][model8_pretrain.py][INFO] Epoch:[0/2](954000/4588595) loss:2.560 lr:0.0000100 epoch_Time:23153.0min: [2024-01-06 22:57:21,445][model8_pretrain.py][INFO] Epoch:[0/2](954100/4588595) loss:2.574 lr:0.0000100 epoch_Time:23152.0min: [2024-01-06 22:57:21,445][model8_pretrain.py][INFO] Epoch:[0/2](954100/4588595) loss:2.722 lr:0.0000100 epoch_Time:23152.0min: [2024-01-06 22:57:21,445][model8_pretrain.py][INFO] Epoch:[0/2](954100/4588595) loss:2.720 lr:0.0000100 epoch_Time:23152.0min: [2024-01-06 22:57:21,445][model8_pretrain.py][INFO] Epoch:[0/2](954100/4588595) loss:2.599 lr:0.0000100 epoch_Time:23152.0min: [2024-01-06 22:57:21,445][model8_pretrain.py][INFO] Epoch:[0/2](954100/4588595) loss:2.867 lr:0.0000100 epoch_Time:23152.0min: [2024-01-06 22:57:21,445][model8_pretrain.py][INFO] Epoch:[0/2](954100/4588595) loss:2.797 lr:0.0000100 epoch_Time:23152.0min: [2024-01-06 22:57:21,445][model8_pretrain.py][INFO] Epoch:[0/2](954100/4588595) loss:2.706 lr:0.0000100 epoch_Time:23152.0min: [2024-01-06 22:57:21,445][model8_pretrain.py][INFO] Epoch:[0/2](954100/4588595) loss:1.865 lr:0.0000100 epoch_Time:23152.0min: [2024-01-06 22:57:58,388][model8_pretrain.py][INFO] Epoch:[0/2](954200/4588595) loss:2.843 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:57:58,389][model8_pretrain.py][INFO] Epoch:[0/2](954200/4588595) loss:2.608 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:57:58,389][model8_pretrain.py][INFO] Epoch:[0/2](954200/4588595) loss:2.704 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:57:58,389][model8_pretrain.py][INFO] Epoch:[0/2](954200/4588595) loss:2.686 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:57:58,389][model8_pretrain.py][INFO] Epoch:[0/2](954200/4588595) loss:2.751 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:57:58,389][model8_pretrain.py][INFO] Epoch:[0/2](954200/4588595) loss:2.532 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:57:58,389][model8_pretrain.py][INFO] Epoch:[0/2](954200/4588595) loss:3.132 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:57:58,389][model8_pretrain.py][INFO] Epoch:[0/2](954200/4588595) loss:2.771 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:58:35,331][model8_pretrain.py][INFO] Epoch:[0/2](954300/4588595) loss:2.824 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:58:35,331][model8_pretrain.py][INFO] Epoch:[0/2](954300/4588595) loss:2.955 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:58:35,331][model8_pretrain.py][INFO] Epoch:[0/2](954300/4588595) loss:3.185 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:58:35,331][model8_pretrain.py][INFO] Epoch:[0/2](954300/4588595) loss:3.102 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:58:35,331][model8_pretrain.py][INFO] Epoch:[0/2](954300/4588595) loss:2.611 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:58:35,331][model8_pretrain.py][INFO] Epoch:[0/2](954300/4588595) loss:2.459 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:58:35,331][model8_pretrain.py][INFO] Epoch:[0/2](954300/4588595) loss:2.380 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:58:35,331][model8_pretrain.py][INFO] Epoch:[0/2](954300/4588595) loss:3.009 lr:0.0000100 epoch_Time:23150.0min: [2024-01-06 22:59:12,275][model8_pretrain.py][INFO] Epoch:[0/2](954400/4588595) loss:2.759 lr:0.0000100 epoch_Time:23149.0min: [2024-01-06 22:59:12,275][model8_pretrain.py][INFO] Epoch:[0/2](954400/4588595) loss:3.427 lr:0.0000100 epoch_Time:23149.0min: [2024-01-06 22:59:12,275][model8_pretrain.py][INFO] Epoch:[0/2](954400/4588595) loss:2.376 lr:0.0000100 epoch_Time:23149.0min: [2024-01-06 22:59:12,275][model8_pretrain.py][INFO] Epoch:[0/2](954400/4588595) loss:2.547 lr:0.0000100 epoch_Time:23149.0min: [2024-01-06 22:59:12,275][model8_pretrain.py][INFO] Epoch:[0/2](954400/4588595) loss:2.769 lr:0.0000100 epoch_Time:23149.0min: [2024-01-06 22:59:12,275][model8_pretrain.py][INFO] Epoch:[0/2](954400/4588595) loss:2.683 lr:0.0000100 epoch_Time:23149.0min: [2024-01-06 22:59:12,275][model8_pretrain.py][INFO] Epoch:[0/2](954400/4588595) loss:3.336 lr:0.0000100 epoch_Time:23149.0min: [2024-01-06 22:59:12,275][model8_pretrain.py][INFO] Epoch:[0/2](954400/4588595) loss:2.697 lr:0.0000100 epoch_Time:23149.0min: [2024-01-06 22:59:49,218][model8_pretrain.py][INFO] Epoch:[0/2](954500/4588595) loss:3.008 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 22:59:49,218][model8_pretrain.py][INFO] Epoch:[0/2](954500/4588595) loss:2.944 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 22:59:49,218][model8_pretrain.py][INFO] Epoch:[0/2](954500/4588595) loss:2.832 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 22:59:49,218][model8_pretrain.py][INFO] Epoch:[0/2](954500/4588595) loss:3.024 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 22:59:49,218][model8_pretrain.py][INFO] Epoch:[0/2](954500/4588595) loss:2.409 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 22:59:49,218][model8_pretrain.py][INFO] Epoch:[0/2](954500/4588595) loss:2.561 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 22:59:49,218][model8_pretrain.py][INFO] Epoch:[0/2](954500/4588595) loss:2.471 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 22:59:49,219][model8_pretrain.py][INFO] Epoch:[0/2](954500/4588595) loss:3.087 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:00:26,186][model8_pretrain.py][INFO] Epoch:[0/2](954600/4588595) loss:2.821 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:00:26,186][model8_pretrain.py][INFO] Epoch:[0/2](954600/4588595) loss:2.847 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:00:26,186][model8_pretrain.py][INFO] Epoch:[0/2](954600/4588595) loss:3.145 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:00:26,186][model8_pretrain.py][INFO] Epoch:[0/2](954600/4588595) loss:3.006 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:00:26,186][model8_pretrain.py][INFO] Epoch:[0/2](954600/4588595) loss:3.100 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:00:26,186][model8_pretrain.py][INFO] Epoch:[0/2](954600/4588595) loss:3.281 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:00:26,186][model8_pretrain.py][INFO] Epoch:[0/2](954600/4588595) loss:2.752 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:00:26,186][model8_pretrain.py][INFO] Epoch:[0/2](954600/4588595) loss:3.345 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:01:13,356][model8_pretrain.py][INFO] Epoch:[0/2](954700/4588595) loss:2.749 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:01:13,356][model8_pretrain.py][INFO] Epoch:[0/2](954700/4588595) loss:2.756 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:01:13,356][model8_pretrain.py][INFO] Epoch:[0/2](954700/4588595) loss:3.161 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:01:13,356][model8_pretrain.py][INFO] Epoch:[0/2](954700/4588595) loss:3.049 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:01:13,356][model8_pretrain.py][INFO] Epoch:[0/2](954700/4588595) loss:2.474 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:01:13,356][model8_pretrain.py][INFO] Epoch:[0/2](954700/4588595) loss:2.566 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:01:13,356][model8_pretrain.py][INFO] Epoch:[0/2](954700/4588595) loss:2.468 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:01:13,356][model8_pretrain.py][INFO] Epoch:[0/2](954700/4588595) loss:2.862 lr:0.0000100 epoch_Time:23148.0min: [2024-01-06 23:01:53,742][model8_pretrain.py][INFO] Epoch:[0/2](954800/4588595) loss:2.762 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:01:53,742][model8_pretrain.py][INFO] Epoch:[0/2](954800/4588595) loss:3.221 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:01:53,742][model8_pretrain.py][INFO] Epoch:[0/2](954800/4588595) loss:2.541 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:01:53,742][model8_pretrain.py][INFO] Epoch:[0/2](954800/4588595) loss:2.436 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:01:53,742][model8_pretrain.py][INFO] Epoch:[0/2](954800/4588595) loss:3.582 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:01:53,742][model8_pretrain.py][INFO] Epoch:[0/2](954800/4588595) loss:2.203 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:01:53,742][model8_pretrain.py][INFO] Epoch:[0/2](954800/4588595) loss:2.314 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:01:53,743][model8_pretrain.py][INFO] Epoch:[0/2](954800/4588595) loss:2.601 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:02:30,687][model8_pretrain.py][INFO] Epoch:[0/2](954900/4588595) loss:3.022 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:02:30,687][model8_pretrain.py][INFO] Epoch:[0/2](954900/4588595) loss:2.890 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:02:30,687][model8_pretrain.py][INFO] Epoch:[0/2](954900/4588595) loss:2.816 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:02:30,687][model8_pretrain.py][INFO] Epoch:[0/2](954900/4588595) loss:3.382 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:02:30,687][model8_pretrain.py][INFO] Epoch:[0/2](954900/4588595) loss:2.886 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:02:30,687][model8_pretrain.py][INFO] Epoch:[0/2](954900/4588595) loss:2.683 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:02:30,687][model8_pretrain.py][INFO] Epoch:[0/2](954900/4588595) loss:2.707 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:02:30,687][model8_pretrain.py][INFO] Epoch:[0/2](954900/4588595) loss:3.485 lr:0.0000100 epoch_Time:23147.0min: [2024-01-06 23:03:07,620][model8_pretrain.py][INFO] Epoch:[0/2](955000/4588595) loss:2.699 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:07,620][model8_pretrain.py][INFO] Epoch:[0/2](955000/4588595) loss:2.856 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:07,620][model8_pretrain.py][INFO] Epoch:[0/2](955000/4588595) loss:2.988 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:07,620][model8_pretrain.py][INFO] Epoch:[0/2](955000/4588595) loss:3.225 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:07,620][model8_pretrain.py][INFO] Epoch:[0/2](955000/4588595) loss:2.468 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:07,620][model8_pretrain.py][INFO] Epoch:[0/2](955000/4588595) loss:2.622 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:07,621][model8_pretrain.py][INFO] Epoch:[0/2](955000/4588595) loss:2.738 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:07,621][model8_pretrain.py][INFO] Epoch:[0/2](955000/4588595) loss:2.791 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:44,564][model8_pretrain.py][INFO] Epoch:[0/2](955100/4588595) loss:2.243 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:44,564][model8_pretrain.py][INFO] Epoch:[0/2](955100/4588595) loss:3.397 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:44,564][model8_pretrain.py][INFO] Epoch:[0/2](955100/4588595) loss:3.155 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:44,564][model8_pretrain.py][INFO] Epoch:[0/2](955100/4588595) loss:2.865 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:44,564][model8_pretrain.py][INFO] Epoch:[0/2](955100/4588595) loss:2.419 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:44,564][model8_pretrain.py][INFO] Epoch:[0/2](955100/4588595) loss:2.973 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:44,564][model8_pretrain.py][INFO] Epoch:[0/2](955100/4588595) loss:2.218 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:03:44,564][model8_pretrain.py][INFO] Epoch:[0/2](955100/4588595) loss:2.430 lr:0.0000100 epoch_Time:23146.0min: [2024-01-06 23:04:21,515][model8_pretrain.py][INFO] Epoch:[0/2](955200/4588595) loss:2.793 lr:0.0000100 epoch_Time:23145.0min: [2024-01-06 23:04:21,516][model8_pretrain.py][INFO] Epoch:[0/2](955200/4588595) loss:2.986 lr:0.0000100 epoch_Time:23145.0min: [2024-01-06 23:04:21,515][model8_pretrain.py][INFO] Epoch:[0/2](955200/4588595) loss:2.703 lr:0.0000100 epoch_Time:23145.0min: [2024-01-06 23:04:21,516][model8_pretrain.py][INFO] Epoch:[0/2](955200/4588595) loss:3.082 lr:0.0000100 epoch_Time:23145.0min: [2024-01-06 23:04:21,516][model8_pretrain.py][INFO] Epoch:[0/2](955200/4588595) loss:3.018 lr:0.0000100 epoch_Time:23145.0min: [2024-01-06 23:04:21,516][model8_pretrain.py][INFO] Epoch:[0/2](955200/4588595) loss:3.373 lr:0.0000100 epoch_Time:23145.0min: [2024-01-06 23:04:21,516][model8_pretrain.py][INFO] Epoch:[0/2](955200/4588595) loss:2.667 lr:0.0000100 epoch_Time:23145.0min: [2024-01-06 23:04:21,516][model8_pretrain.py][INFO] Epoch:[0/2](955200/4588595) loss:3.062 lr:0.0000100 epoch_Time:23145.0min: [2024-01-06 23:04:58,458][model8_pretrain.py][INFO] Epoch:[0/2](955300/4588595) loss:3.020 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:04:58,458][model8_pretrain.py][INFO] Epoch:[0/2](955300/4588595) loss:2.854 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:04:58,458][model8_pretrain.py][INFO] Epoch:[0/2](955300/4588595) loss:3.285 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:04:58,458][model8_pretrain.py][INFO] Epoch:[0/2](955300/4588595) loss:2.450 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:04:58,458][model8_pretrain.py][INFO] Epoch:[0/2](955300/4588595) loss:2.529 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:04:58,458][model8_pretrain.py][INFO] Epoch:[0/2](955300/4588595) loss:2.813 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:04:58,458][model8_pretrain.py][INFO] Epoch:[0/2](955300/4588595) loss:2.625 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:04:58,458][model8_pretrain.py][INFO] Epoch:[0/2](955300/4588595) loss:2.532 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:05:35,387][model8_pretrain.py][INFO] Epoch:[0/2](955400/4588595) loss:3.211 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:05:35,387][model8_pretrain.py][INFO] Epoch:[0/2](955400/4588595) loss:2.871 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:05:35,387][model8_pretrain.py][INFO] Epoch:[0/2](955400/4588595) loss:2.655 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:05:35,387][model8_pretrain.py][INFO] Epoch:[0/2](955400/4588595) loss:2.880 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:05:35,387][model8_pretrain.py][INFO] Epoch:[0/2](955400/4588595) loss:2.653 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:05:35,387][model8_pretrain.py][INFO] Epoch:[0/2](955400/4588595) loss:3.277 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:05:35,387][model8_pretrain.py][INFO] Epoch:[0/2](955400/4588595) loss:2.499 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:05:35,387][model8_pretrain.py][INFO] Epoch:[0/2](955400/4588595) loss:2.618 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:06:22,491][model8_pretrain.py][INFO] Epoch:[0/2](955500/4588595) loss:2.821 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:06:22,491][model8_pretrain.py][INFO] Epoch:[0/2](955500/4588595) loss:2.657 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:06:22,491][model8_pretrain.py][INFO] Epoch:[0/2](955500/4588595) loss:3.087 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:06:22,491][model8_pretrain.py][INFO] Epoch:[0/2](955500/4588595) loss:3.189 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:06:22,491][model8_pretrain.py][INFO] Epoch:[0/2](955500/4588595) loss:2.708 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:06:22,491][model8_pretrain.py][INFO] Epoch:[0/2](955500/4588595) loss:2.995 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:06:22,491][model8_pretrain.py][INFO] Epoch:[0/2](955500/4588595) loss:2.683 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:06:22,491][model8_pretrain.py][INFO] Epoch:[0/2](955500/4588595) loss:3.049 lr:0.0000100 epoch_Time:23143.0min: [2024-01-06 23:07:02,940][model8_pretrain.py][INFO] Epoch:[0/2](955600/4588595) loss:2.753 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:02,941][model8_pretrain.py][INFO] Epoch:[0/2](955600/4588595) loss:2.617 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:02,941][model8_pretrain.py][INFO] Epoch:[0/2](955600/4588595) loss:2.539 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:02,941][model8_pretrain.py][INFO] Epoch:[0/2](955600/4588595) loss:3.270 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:02,941][model8_pretrain.py][INFO] Epoch:[0/2](955600/4588595) loss:3.205 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:02,941][model8_pretrain.py][INFO] Epoch:[0/2](955600/4588595) loss:2.645 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:02,941][model8_pretrain.py][INFO] Epoch:[0/2](955600/4588595) loss:2.838 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:02,941][model8_pretrain.py][INFO] Epoch:[0/2](955600/4588595) loss:2.265 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:39,875][model8_pretrain.py][INFO] Epoch:[0/2](955700/4588595) loss:2.724 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:39,875][model8_pretrain.py][INFO] Epoch:[0/2](955700/4588595) loss:2.492 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:39,875][model8_pretrain.py][INFO] Epoch:[0/2](955700/4588595) loss:2.612 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:39,875][model8_pretrain.py][INFO] Epoch:[0/2](955700/4588595) loss:2.542 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:39,876][model8_pretrain.py][INFO] Epoch:[0/2](955700/4588595) loss:3.267 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:39,876][model8_pretrain.py][INFO] Epoch:[0/2](955700/4588595) loss:2.979 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:39,876][model8_pretrain.py][INFO] Epoch:[0/2](955700/4588595) loss:3.170 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:07:39,876][model8_pretrain.py][INFO] Epoch:[0/2](955700/4588595) loss:2.366 lr:0.0000100 epoch_Time:23142.0min: [2024-01-06 23:08:16,817][model8_pretrain.py][INFO] Epoch:[0/2](955800/4588595) loss:2.700 lr:0.0000100 epoch_Time:23141.0min: [2024-01-06 23:08:16,817][model8_pretrain.py][INFO] Epoch:[0/2](955800/4588595) loss:2.633 lr:0.0000100 epoch_Time:23141.0min: [2024-01-06 23:08:16,817][model8_pretrain.py][INFO] Epoch:[0/2](955800/4588595) loss:2.179 lr:0.0000100 epoch_Time:23141.0min: [2024-01-06 23:08:16,817][model8_pretrain.py][INFO] Epoch:[0/2](955800/4588595) loss:2.234 lr:0.0000100 epoch_Time:23141.0min: [2024-01-06 23:08:16,817][model8_pretrain.py][INFO] Epoch:[0/2](955800/4588595) loss:3.419 lr:0.0000100 epoch_Time:23141.0min: [2024-01-06 23:08:16,817][model8_pretrain.py][INFO] Epoch:[0/2](955800/4588595) loss:2.688 lr:0.0000100 epoch_Time:23141.0min: [2024-01-06 23:08:16,817][model8_pretrain.py][INFO] Epoch:[0/2](955800/4588595) loss:2.951 lr:0.0000100 epoch_Time:23141.0min: [2024-01-06 23:08:16,817][model8_pretrain.py][INFO] Epoch:[0/2](955800/4588595) loss:2.899 lr:0.0000100 epoch_Time:23141.0min: [2024-01-06 23:08:53,760][model8_pretrain.py][INFO] Epoch:[0/2](955900/4588595) loss:2.566 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:08:53,760][model8_pretrain.py][INFO] Epoch:[0/2](955900/4588595) loss:2.690 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:08:53,760][model8_pretrain.py][INFO] Epoch:[0/2](955900/4588595) loss:2.772 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:08:53,760][model8_pretrain.py][INFO] Epoch:[0/2](955900/4588595) loss:2.632 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:08:53,760][model8_pretrain.py][INFO] Epoch:[0/2](955900/4588595) loss:2.745 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:08:53,760][model8_pretrain.py][INFO] Epoch:[0/2](955900/4588595) loss:3.097 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:08:53,760][model8_pretrain.py][INFO] Epoch:[0/2](955900/4588595) loss:2.833 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:08:53,761][model8_pretrain.py][INFO] Epoch:[0/2](955900/4588595) loss:2.919 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:09:30,712][model8_pretrain.py][INFO] Epoch:[0/2](956000/4588595) loss:2.981 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:09:30,712][model8_pretrain.py][INFO] Epoch:[0/2](956000/4588595) loss:2.811 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:09:30,712][model8_pretrain.py][INFO] Epoch:[0/2](956000/4588595) loss:2.746 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:09:30,712][model8_pretrain.py][INFO] Epoch:[0/2](956000/4588595) loss:3.030 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:09:30,712][model8_pretrain.py][INFO] Epoch:[0/2](956000/4588595) loss:2.924 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:09:30,712][model8_pretrain.py][INFO] Epoch:[0/2](956000/4588595) loss:2.722 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:09:30,712][model8_pretrain.py][INFO] Epoch:[0/2](956000/4588595) loss:2.159 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:09:30,713][model8_pretrain.py][INFO] Epoch:[0/2](956000/4588595) loss:2.619 lr:0.0000100 epoch_Time:23140.0min: [2024-01-06 23:10:07,665][model8_pretrain.py][INFO] Epoch:[0/2](956100/4588595) loss:2.606 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:07,665][model8_pretrain.py][INFO] Epoch:[0/2](956100/4588595) loss:2.943 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:07,665][model8_pretrain.py][INFO] Epoch:[0/2](956100/4588595) loss:2.810 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:07,665][model8_pretrain.py][INFO] Epoch:[0/2](956100/4588595) loss:2.871 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:07,665][model8_pretrain.py][INFO] Epoch:[0/2](956100/4588595) loss:2.993 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:07,665][model8_pretrain.py][INFO] Epoch:[0/2](956100/4588595) loss:3.416 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:07,665][model8_pretrain.py][INFO] Epoch:[0/2](956100/4588595) loss:2.884 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:07,665][model8_pretrain.py][INFO] Epoch:[0/2](956100/4588595) loss:2.878 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:44,606][model8_pretrain.py][INFO] Epoch:[0/2](956200/4588595) loss:2.573 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:44,606][model8_pretrain.py][INFO] Epoch:[0/2](956200/4588595) loss:2.921 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:44,606][model8_pretrain.py][INFO] Epoch:[0/2](956200/4588595) loss:2.345 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:44,606][model8_pretrain.py][INFO] Epoch:[0/2](956200/4588595) loss:3.092 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:44,606][model8_pretrain.py][INFO] Epoch:[0/2](956200/4588595) loss:2.741 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:44,606][model8_pretrain.py][INFO] Epoch:[0/2](956200/4588595) loss:3.290 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:44,607][model8_pretrain.py][INFO] Epoch:[0/2](956200/4588595) loss:2.420 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:10:44,607][model8_pretrain.py][INFO] Epoch:[0/2](956200/4588595) loss:2.065 lr:0.0000100 epoch_Time:23139.0min: [2024-01-06 23:11:30,180][model8_pretrain.py][INFO] Epoch:[0/2](956300/4588595) loss:3.240 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:11:30,180][model8_pretrain.py][INFO] Epoch:[0/2](956300/4588595) loss:3.306 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:11:30,180][model8_pretrain.py][INFO] Epoch:[0/2](956300/4588595) loss:3.008 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:11:30,180][model8_pretrain.py][INFO] Epoch:[0/2](956300/4588595) loss:2.878 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:11:30,180][model8_pretrain.py][INFO] Epoch:[0/2](956300/4588595) loss:2.588 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:11:30,180][model8_pretrain.py][INFO] Epoch:[0/2](956300/4588595) loss:1.793 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:11:30,180][model8_pretrain.py][INFO] Epoch:[0/2](956300/4588595) loss:3.264 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:11:30,185][model8_pretrain.py][INFO] Epoch:[0/2](956300/4588595) loss:2.728 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:12:12,319][model8_pretrain.py][INFO] Epoch:[0/2](956400/4588595) loss:2.735 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:12:12,319][model8_pretrain.py][INFO] Epoch:[0/2](956400/4588595) loss:3.132 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:12:12,319][model8_pretrain.py][INFO] Epoch:[0/2](956400/4588595) loss:2.871 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:12:12,319][model8_pretrain.py][INFO] Epoch:[0/2](956400/4588595) loss:3.490 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:12:12,319][model8_pretrain.py][INFO] Epoch:[0/2](956400/4588595) loss:2.847 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:12:12,319][model8_pretrain.py][INFO] Epoch:[0/2](956400/4588595) loss:2.777 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:12:12,319][model8_pretrain.py][INFO] Epoch:[0/2](956400/4588595) loss:2.641 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:12:12,319][model8_pretrain.py][INFO] Epoch:[0/2](956400/4588595) loss:2.076 lr:0.0000100 epoch_Time:23138.0min: [2024-01-06 23:12:49,260][model8_pretrain.py][INFO] Epoch:[0/2](956500/4588595) loss:3.045 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:12:49,260][model8_pretrain.py][INFO] Epoch:[0/2](956500/4588595) loss:3.294 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:12:49,260][model8_pretrain.py][INFO] Epoch:[0/2](956500/4588595) loss:2.890 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:12:49,260][model8_pretrain.py][INFO] Epoch:[0/2](956500/4588595) loss:2.879 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:12:49,260][model8_pretrain.py][INFO] Epoch:[0/2](956500/4588595) loss:2.849 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:12:49,260][model8_pretrain.py][INFO] Epoch:[0/2](956500/4588595) loss:2.701 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:12:49,260][model8_pretrain.py][INFO] Epoch:[0/2](956500/4588595) loss:2.651 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:12:49,260][model8_pretrain.py][INFO] Epoch:[0/2](956500/4588595) loss:2.428 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:13:26,179][model8_pretrain.py][INFO] Epoch:[0/2](956600/4588595) loss:2.537 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:13:26,179][model8_pretrain.py][INFO] Epoch:[0/2](956600/4588595) loss:3.391 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:13:26,179][model8_pretrain.py][INFO] Epoch:[0/2](956600/4588595) loss:2.256 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:13:26,179][model8_pretrain.py][INFO] Epoch:[0/2](956600/4588595) loss:2.555 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:13:26,179][model8_pretrain.py][INFO] Epoch:[0/2](956600/4588595) loss:2.701 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:13:26,179][model8_pretrain.py][INFO] Epoch:[0/2](956600/4588595) loss:2.963 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:13:26,179][model8_pretrain.py][INFO] Epoch:[0/2](956600/4588595) loss:2.509 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:13:26,180][model8_pretrain.py][INFO] Epoch:[0/2](956600/4588595) loss:2.531 lr:0.0000100 epoch_Time:23136.0min: [2024-01-06 23:14:03,124][model8_pretrain.py][INFO] Epoch:[0/2](956700/4588595) loss:2.333 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:03,124][model8_pretrain.py][INFO] Epoch:[0/2](956700/4588595) loss:2.566 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:03,124][model8_pretrain.py][INFO] Epoch:[0/2](956700/4588595) loss:2.312 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:03,124][model8_pretrain.py][INFO] Epoch:[0/2](956700/4588595) loss:3.192 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:03,124][model8_pretrain.py][INFO] Epoch:[0/2](956700/4588595) loss:2.895 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:03,124][model8_pretrain.py][INFO] Epoch:[0/2](956700/4588595) loss:2.474 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:03,124][model8_pretrain.py][INFO] Epoch:[0/2](956700/4588595) loss:2.702 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:03,124][model8_pretrain.py][INFO] Epoch:[0/2](956700/4588595) loss:2.337 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:40,076][model8_pretrain.py][INFO] Epoch:[0/2](956800/4588595) loss:2.504 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:40,076][model8_pretrain.py][INFO] Epoch:[0/2](956800/4588595) loss:3.055 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:40,076][model8_pretrain.py][INFO] Epoch:[0/2](956800/4588595) loss:2.653 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:40,076][model8_pretrain.py][INFO] Epoch:[0/2](956800/4588595) loss:3.022 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:40,076][model8_pretrain.py][INFO] Epoch:[0/2](956800/4588595) loss:2.250 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:40,077][model8_pretrain.py][INFO] Epoch:[0/2](956800/4588595) loss:3.298 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:40,077][model8_pretrain.py][INFO] Epoch:[0/2](956800/4588595) loss:2.934 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:14:40,077][model8_pretrain.py][INFO] Epoch:[0/2](956800/4588595) loss:2.706 lr:0.0000100 epoch_Time:23135.0min: [2024-01-06 23:15:17,018][model8_pretrain.py][INFO] Epoch:[0/2](956900/4588595) loss:2.908 lr:0.0000100 epoch_Time:23134.0min: [2024-01-06 23:15:17,018][model8_pretrain.py][INFO] Epoch:[0/2](956900/4588595) loss:3.091 lr:0.0000100 epoch_Time:23134.0min: [2024-01-06 23:15:17,018][model8_pretrain.py][INFO] Epoch:[0/2](956900/4588595) loss:2.821 lr:0.0000100 epoch_Time:23134.0min: [2024-01-06 23:15:17,018][model8_pretrain.py][INFO] Epoch:[0/2](956900/4588595) loss:2.669 lr:0.0000100 epoch_Time:23134.0min: [2024-01-06 23:15:17,018][model8_pretrain.py][INFO] Epoch:[0/2](956900/4588595) loss:3.271 lr:0.0000100 epoch_Time:23134.0min: [2024-01-06 23:15:17,018][model8_pretrain.py][INFO] Epoch:[0/2](956900/4588595) loss:2.903 lr:0.0000100 epoch_Time:23134.0min: [2024-01-06 23:15:17,018][model8_pretrain.py][INFO] Epoch:[0/2](956900/4588595) loss:2.416 lr:0.0000100 epoch_Time:23134.0min: [2024-01-06 23:15:17,018][model8_pretrain.py][INFO] Epoch:[0/2](956900/4588595) loss:2.424 lr:0.0000100 epoch_Time:23134.0min: [2024-01-06 23:15:53,969][model8_pretrain.py][INFO] Epoch:[0/2](957000/4588595) loss:2.853 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:15:53,969][model8_pretrain.py][INFO] Epoch:[0/2](957000/4588595) loss:2.938 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:15:53,969][model8_pretrain.py][INFO] Epoch:[0/2](957000/4588595) loss:2.694 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:15:53,969][model8_pretrain.py][INFO] Epoch:[0/2](957000/4588595) loss:2.723 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:15:53,970][model8_pretrain.py][INFO] Epoch:[0/2](957000/4588595) loss:3.048 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:15:53,970][model8_pretrain.py][INFO] Epoch:[0/2](957000/4588595) loss:2.861 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:15:53,970][model8_pretrain.py][INFO] Epoch:[0/2](957000/4588595) loss:3.496 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:15:53,970][model8_pretrain.py][INFO] Epoch:[0/2](957000/4588595) loss:2.820 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:16:37,826][model8_pretrain.py][INFO] Epoch:[0/2](957100/4588595) loss:3.038 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:16:37,826][model8_pretrain.py][INFO] Epoch:[0/2](957100/4588595) loss:2.535 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:16:37,826][model8_pretrain.py][INFO] Epoch:[0/2](957100/4588595) loss:2.926 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:16:37,826][model8_pretrain.py][INFO] Epoch:[0/2](957100/4588595) loss:2.512 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:16:37,826][model8_pretrain.py][INFO] Epoch:[0/2](957100/4588595) loss:2.986 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:16:37,826][model8_pretrain.py][INFO] Epoch:[0/2](957100/4588595) loss:2.087 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:16:37,826][model8_pretrain.py][INFO] Epoch:[0/2](957100/4588595) loss:2.520 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:16:37,826][model8_pretrain.py][INFO] Epoch:[0/2](957100/4588595) loss:2.762 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:17:21,667][model8_pretrain.py][INFO] Epoch:[0/2](957200/4588595) loss:2.733 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:17:21,668][model8_pretrain.py][INFO] Epoch:[0/2](957200/4588595) loss:3.163 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:17:21,668][model8_pretrain.py][INFO] Epoch:[0/2](957200/4588595) loss:2.912 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:17:21,668][model8_pretrain.py][INFO] Epoch:[0/2](957200/4588595) loss:3.194 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:17:21,668][model8_pretrain.py][INFO] Epoch:[0/2](957200/4588595) loss:2.867 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:17:21,668][model8_pretrain.py][INFO] Epoch:[0/2](957200/4588595) loss:3.157 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:17:21,668][model8_pretrain.py][INFO] Epoch:[0/2](957200/4588595) loss:2.258 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:17:21,668][model8_pretrain.py][INFO] Epoch:[0/2](957200/4588595) loss:2.971 lr:0.0000100 epoch_Time:23133.0min: [2024-01-06 23:17:58,606][model8_pretrain.py][INFO] Epoch:[0/2](957300/4588595) loss:3.246 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:17:58,606][model8_pretrain.py][INFO] Epoch:[0/2](957300/4588595) loss:2.404 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:17:58,606][model8_pretrain.py][INFO] Epoch:[0/2](957300/4588595) loss:2.717 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:17:58,606][model8_pretrain.py][INFO] Epoch:[0/2](957300/4588595) loss:3.121 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:17:58,606][model8_pretrain.py][INFO] Epoch:[0/2](957300/4588595) loss:2.862 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:17:58,606][model8_pretrain.py][INFO] Epoch:[0/2](957300/4588595) loss:2.848 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:17:58,606][model8_pretrain.py][INFO] Epoch:[0/2](957300/4588595) loss:2.844 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:17:58,606][model8_pretrain.py][INFO] Epoch:[0/2](957300/4588595) loss:2.519 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:18:35,555][model8_pretrain.py][INFO] Epoch:[0/2](957400/4588595) loss:2.779 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:18:35,555][model8_pretrain.py][INFO] Epoch:[0/2](957400/4588595) loss:2.736 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:18:35,555][model8_pretrain.py][INFO] Epoch:[0/2](957400/4588595) loss:2.985 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:18:35,555][model8_pretrain.py][INFO] Epoch:[0/2](957400/4588595) loss:3.471 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:18:35,555][model8_pretrain.py][INFO] Epoch:[0/2](957400/4588595) loss:2.650 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:18:35,555][model8_pretrain.py][INFO] Epoch:[0/2](957400/4588595) loss:2.823 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:18:35,555][model8_pretrain.py][INFO] Epoch:[0/2](957400/4588595) loss:2.691 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:18:35,556][model8_pretrain.py][INFO] Epoch:[0/2](957400/4588595) loss:2.923 lr:0.0000100 epoch_Time:23132.0min: [2024-01-06 23:19:12,509][model8_pretrain.py][INFO] Epoch:[0/2](957500/4588595) loss:2.407 lr:0.0000100 epoch_Time:23130.0min: [2024-01-06 23:19:12,509][model8_pretrain.py][INFO] Epoch:[0/2](957500/4588595) loss:2.236 lr:0.0000100 epoch_Time:23130.0min: [2024-01-06 23:19:12,509][model8_pretrain.py][INFO] Epoch:[0/2](957500/4588595) loss:2.505 lr:0.0000100 epoch_Time:23130.0min: [2024-01-06 23:19:12,509][model8_pretrain.py][INFO] Epoch:[0/2](957500/4588595) loss:2.878 lr:0.0000100 epoch_Time:23130.0min: [2024-01-06 23:19:12,509][model8_pretrain.py][INFO] Epoch:[0/2](957500/4588595) loss:3.025 lr:0.0000100 epoch_Time:23130.0min: [2024-01-06 23:19:12,509][model8_pretrain.py][INFO] Epoch:[0/2](957500/4588595) loss:2.870 lr:0.0000100 epoch_Time:23130.0min: [2024-01-06 23:19:12,509][model8_pretrain.py][INFO] Epoch:[0/2](957500/4588595) loss:2.834 lr:0.0000100 epoch_Time:23130.0min: [2024-01-06 23:19:12,509][model8_pretrain.py][INFO] Epoch:[0/2](957500/4588595) loss:2.563 lr:0.0000100 epoch_Time:23130.0min: [2024-01-06 23:19:49,450][model8_pretrain.py][INFO] Epoch:[0/2](957600/4588595) loss:2.719 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:19:49,450][model8_pretrain.py][INFO] Epoch:[0/2](957600/4588595) loss:2.798 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:19:49,450][model8_pretrain.py][INFO] Epoch:[0/2](957600/4588595) loss:2.441 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:19:49,450][model8_pretrain.py][INFO] Epoch:[0/2](957600/4588595) loss:2.962 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:19:49,450][model8_pretrain.py][INFO] Epoch:[0/2](957600/4588595) loss:2.512 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:19:49,450][model8_pretrain.py][INFO] Epoch:[0/2](957600/4588595) loss:2.720 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:19:49,450][model8_pretrain.py][INFO] Epoch:[0/2](957600/4588595) loss:2.423 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:19:49,450][model8_pretrain.py][INFO] Epoch:[0/2](957600/4588595) loss:2.762 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:20:26,394][model8_pretrain.py][INFO] Epoch:[0/2](957700/4588595) loss:3.051 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:20:26,394][model8_pretrain.py][INFO] Epoch:[0/2](957700/4588595) loss:2.217 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:20:26,394][model8_pretrain.py][INFO] Epoch:[0/2](957700/4588595) loss:2.856 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:20:26,394][model8_pretrain.py][INFO] Epoch:[0/2](957700/4588595) loss:2.923 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:20:26,394][model8_pretrain.py][INFO] Epoch:[0/2](957700/4588595) loss:3.196 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:20:26,394][model8_pretrain.py][INFO] Epoch:[0/2](957700/4588595) loss:2.975 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:20:26,394][model8_pretrain.py][INFO] Epoch:[0/2](957700/4588595) loss:2.637 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:20:26,394][model8_pretrain.py][INFO] Epoch:[0/2](957700/4588595) loss:2.881 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:21:03,341][model8_pretrain.py][INFO] Epoch:[0/2](957800/4588595) loss:1.916 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:21:03,341][model8_pretrain.py][INFO] Epoch:[0/2](957800/4588595) loss:2.704 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:21:03,341][model8_pretrain.py][INFO] Epoch:[0/2](957800/4588595) loss:2.702 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:21:03,341][model8_pretrain.py][INFO] Epoch:[0/2](957800/4588595) loss:2.922 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:21:03,341][model8_pretrain.py][INFO] Epoch:[0/2](957800/4588595) loss:2.657 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:21:03,341][model8_pretrain.py][INFO] Epoch:[0/2](957800/4588595) loss:2.894 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:21:03,341][model8_pretrain.py][INFO] Epoch:[0/2](957800/4588595) loss:2.508 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:21:03,341][model8_pretrain.py][INFO] Epoch:[0/2](957800/4588595) loss:2.407 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:21:47,225][model8_pretrain.py][INFO] Epoch:[0/2](957900/4588595) loss:2.577 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:21:47,225][model8_pretrain.py][INFO] Epoch:[0/2](957900/4588595) loss:2.754 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:21:47,225][model8_pretrain.py][INFO] Epoch:[0/2](957900/4588595) loss:2.796 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:21:47,225][model8_pretrain.py][INFO] Epoch:[0/2](957900/4588595) loss:2.883 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:21:47,225][model8_pretrain.py][INFO] Epoch:[0/2](957900/4588595) loss:2.372 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:21:47,225][model8_pretrain.py][INFO] Epoch:[0/2](957900/4588595) loss:2.576 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:21:47,225][model8_pretrain.py][INFO] Epoch:[0/2](957900/4588595) loss:2.665 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:21:47,225][model8_pretrain.py][INFO] Epoch:[0/2](957900/4588595) loss:2.997 lr:0.0000100 epoch_Time:23129.0min: [2024-01-06 23:22:30,833][model8_pretrain.py][INFO] Epoch:[0/2](958000/4588595) loss:3.013 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:22:30,833][model8_pretrain.py][INFO] Epoch:[0/2](958000/4588595) loss:2.602 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:22:30,833][model8_pretrain.py][INFO] Epoch:[0/2](958000/4588595) loss:2.165 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:22:30,833][model8_pretrain.py][INFO] Epoch:[0/2](958000/4588595) loss:1.708 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:22:30,833][model8_pretrain.py][INFO] Epoch:[0/2](958000/4588595) loss:2.759 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:22:30,833][model8_pretrain.py][INFO] Epoch:[0/2](958000/4588595) loss:2.535 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:22:30,833][model8_pretrain.py][INFO] Epoch:[0/2](958000/4588595) loss:2.808 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:22:30,833][model8_pretrain.py][INFO] Epoch:[0/2](958000/4588595) loss:2.927 lr:0.0000100 epoch_Time:23128.0min: [2024-01-06 23:23:07,774][model8_pretrain.py][INFO] Epoch:[0/2](958100/4588595) loss:2.364 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:07,774][model8_pretrain.py][INFO] Epoch:[0/2](958100/4588595) loss:3.019 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:07,774][model8_pretrain.py][INFO] Epoch:[0/2](958100/4588595) loss:2.591 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:07,774][model8_pretrain.py][INFO] Epoch:[0/2](958100/4588595) loss:2.477 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:07,774][model8_pretrain.py][INFO] Epoch:[0/2](958100/4588595) loss:2.428 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:07,774][model8_pretrain.py][INFO] Epoch:[0/2](958100/4588595) loss:2.724 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:07,774][model8_pretrain.py][INFO] Epoch:[0/2](958100/4588595) loss:2.445 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:07,774][model8_pretrain.py][INFO] Epoch:[0/2](958100/4588595) loss:3.166 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:44,721][model8_pretrain.py][INFO] Epoch:[0/2](958200/4588595) loss:3.027 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:44,721][model8_pretrain.py][INFO] Epoch:[0/2](958200/4588595) loss:2.664 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:44,721][model8_pretrain.py][INFO] Epoch:[0/2](958200/4588595) loss:2.688 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:44,721][model8_pretrain.py][INFO] Epoch:[0/2](958200/4588595) loss:3.130 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:44,721][model8_pretrain.py][INFO] Epoch:[0/2](958200/4588595) loss:2.704 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:44,721][model8_pretrain.py][INFO] Epoch:[0/2](958200/4588595) loss:2.971 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:44,721][model8_pretrain.py][INFO] Epoch:[0/2](958200/4588595) loss:2.950 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:23:44,721][model8_pretrain.py][INFO] Epoch:[0/2](958200/4588595) loss:2.644 lr:0.0000100 epoch_Time:23127.0min: [2024-01-06 23:24:21,667][model8_pretrain.py][INFO] Epoch:[0/2](958300/4588595) loss:2.824 lr:0.0000100 epoch_Time:23126.0min: [2024-01-06 23:24:21,667][model8_pretrain.py][INFO] Epoch:[0/2](958300/4588595) loss:2.625 lr:0.0000100 epoch_Time:23126.0min: [2024-01-06 23:24:21,667][model8_pretrain.py][INFO] Epoch:[0/2](958300/4588595) loss:2.608 lr:0.0000100 epoch_Time:23126.0min: [2024-01-06 23:24:21,667][model8_pretrain.py][INFO] Epoch:[0/2](958300/4588595) loss:3.266 lr:0.0000100 epoch_Time:23126.0min: [2024-01-06 23:24:21,667][model8_pretrain.py][INFO] Epoch:[0/2](958300/4588595) loss:2.559 lr:0.0000100 epoch_Time:23126.0min: [2024-01-06 23:24:21,667][model8_pretrain.py][INFO] Epoch:[0/2](958300/4588595) loss:2.554 lr:0.0000100 epoch_Time:23126.0min: [2024-01-06 23:24:21,667][model8_pretrain.py][INFO] Epoch:[0/2](958300/4588595) loss:2.550 lr:0.0000100 epoch_Time:23126.0min: [2024-01-06 23:24:21,667][model8_pretrain.py][INFO] Epoch:[0/2](958300/4588595) loss:2.771 lr:0.0000100 epoch_Time:23126.0min: [2024-01-06 23:24:58,618][model8_pretrain.py][INFO] Epoch:[0/2](958400/4588595) loss:2.377 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:24:58,618][model8_pretrain.py][INFO] Epoch:[0/2](958400/4588595) loss:3.153 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:24:58,618][model8_pretrain.py][INFO] Epoch:[0/2](958400/4588595) loss:3.022 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:24:58,618][model8_pretrain.py][INFO] Epoch:[0/2](958400/4588595) loss:2.989 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:24:58,618][model8_pretrain.py][INFO] Epoch:[0/2](958400/4588595) loss:1.886 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:24:58,618][model8_pretrain.py][INFO] Epoch:[0/2](958400/4588595) loss:3.145 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:24:58,618][model8_pretrain.py][INFO] Epoch:[0/2](958400/4588595) loss:2.549 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:24:58,618][model8_pretrain.py][INFO] Epoch:[0/2](958400/4588595) loss:2.772 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:25:35,569][model8_pretrain.py][INFO] Epoch:[0/2](958500/4588595) loss:2.790 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:25:35,569][model8_pretrain.py][INFO] Epoch:[0/2](958500/4588595) loss:2.913 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:25:35,569][model8_pretrain.py][INFO] Epoch:[0/2](958500/4588595) loss:2.781 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:25:35,569][model8_pretrain.py][INFO] Epoch:[0/2](958500/4588595) loss:3.131 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:25:35,569][model8_pretrain.py][INFO] Epoch:[0/2](958500/4588595) loss:2.937 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:25:35,569][model8_pretrain.py][INFO] Epoch:[0/2](958500/4588595) loss:2.538 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:25:35,569][model8_pretrain.py][INFO] Epoch:[0/2](958500/4588595) loss:2.863 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:25:35,569][model8_pretrain.py][INFO] Epoch:[0/2](958500/4588595) loss:2.590 lr:0.0000100 epoch_Time:23125.0min: [2024-01-06 23:26:12,526][model8_pretrain.py][INFO] Epoch:[0/2](958600/4588595) loss:3.134 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:12,526][model8_pretrain.py][INFO] Epoch:[0/2](958600/4588595) loss:2.773 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:12,526][model8_pretrain.py][INFO] Epoch:[0/2](958600/4588595) loss:3.141 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:12,526][model8_pretrain.py][INFO] Epoch:[0/2](958600/4588595) loss:3.102 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:12,526][model8_pretrain.py][INFO] Epoch:[0/2](958600/4588595) loss:3.032 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:12,526][model8_pretrain.py][INFO] Epoch:[0/2](958600/4588595) loss:3.556 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:12,526][model8_pretrain.py][INFO] Epoch:[0/2](958600/4588595) loss:2.829 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:12,526][model8_pretrain.py][INFO] Epoch:[0/2](958600/4588595) loss:3.194 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:54,563][model8_pretrain.py][INFO] Epoch:[0/2](958700/4588595) loss:2.879 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:54,564][model8_pretrain.py][INFO] Epoch:[0/2](958700/4588595) loss:2.838 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:54,564][model8_pretrain.py][INFO] Epoch:[0/2](958700/4588595) loss:2.964 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:54,568][model8_pretrain.py][INFO] Epoch:[0/2](958700/4588595) loss:3.212 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:54,568][model8_pretrain.py][INFO] Epoch:[0/2](958700/4588595) loss:2.641 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:54,568][model8_pretrain.py][INFO] Epoch:[0/2](958700/4588595) loss:2.526 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:54,568][model8_pretrain.py][INFO] Epoch:[0/2](958700/4588595) loss:2.789 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:26:56,309][model8_pretrain.py][INFO] Epoch:[0/2](958700/4588595) loss:3.000 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:27:40,152][model8_pretrain.py][INFO] Epoch:[0/2](958800/4588595) loss:3.276 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:27:40,152][model8_pretrain.py][INFO] Epoch:[0/2](958800/4588595) loss:2.645 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:27:40,152][model8_pretrain.py][INFO] Epoch:[0/2](958800/4588595) loss:2.926 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:27:40,152][model8_pretrain.py][INFO] Epoch:[0/2](958800/4588595) loss:2.560 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:27:40,152][model8_pretrain.py][INFO] Epoch:[0/2](958800/4588595) loss:3.060 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:27:40,152][model8_pretrain.py][INFO] Epoch:[0/2](958800/4588595) loss:2.183 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:27:40,152][model8_pretrain.py][INFO] Epoch:[0/2](958800/4588595) loss:2.600 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:27:40,152][model8_pretrain.py][INFO] Epoch:[0/2](958800/4588595) loss:3.014 lr:0.0000100 epoch_Time:23123.0min: [2024-01-06 23:28:17,109][model8_pretrain.py][INFO] Epoch:[0/2](958900/4588595) loss:2.790 lr:0.0000100 epoch_Time:23122.0min: [2024-01-06 23:28:17,109][model8_pretrain.py][INFO] Epoch:[0/2](958900/4588595) loss:2.910 lr:0.0000100 epoch_Time:23122.0min: [2024-01-06 23:28:17,109][model8_pretrain.py][INFO] Epoch:[0/2](958900/4588595) loss:2.925 lr:0.0000100 epoch_Time:23122.0min: [2024-01-06 23:28:17,109][model8_pretrain.py][INFO] Epoch:[0/2](958900/4588595) loss:2.997 lr:0.0000100 epoch_Time:23122.0min: [2024-01-06 23:28:17,110][model8_pretrain.py][INFO] Epoch:[0/2](958900/4588595) loss:2.086 lr:0.0000100 epoch_Time:23122.0min: [2024-01-06 23:28:17,110][model8_pretrain.py][INFO] Epoch:[0/2](958900/4588595) loss:2.664 lr:0.0000100 epoch_Time:23122.0min: [2024-01-06 23:28:17,110][model8_pretrain.py][INFO] Epoch:[0/2](958900/4588595) loss:2.220 lr:0.0000100 epoch_Time:23122.0min: [2024-01-06 23:28:17,110][model8_pretrain.py][INFO] Epoch:[0/2](958900/4588595) loss:2.654 lr:0.0000100 epoch_Time:23122.0min: [2024-01-06 23:28:54,072][model8_pretrain.py][INFO] Epoch:[0/2](959000/4588595) loss:3.102 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:28:54,072][model8_pretrain.py][INFO] Epoch:[0/2](959000/4588595) loss:3.507 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:28:54,072][model8_pretrain.py][INFO] Epoch:[0/2](959000/4588595) loss:3.023 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:28:54,072][model8_pretrain.py][INFO] Epoch:[0/2](959000/4588595) loss:2.883 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:28:54,072][model8_pretrain.py][INFO] Epoch:[0/2](959000/4588595) loss:2.738 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:28:54,072][model8_pretrain.py][INFO] Epoch:[0/2](959000/4588595) loss:3.180 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:28:54,072][model8_pretrain.py][INFO] Epoch:[0/2](959000/4588595) loss:2.718 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:28:54,072][model8_pretrain.py][INFO] Epoch:[0/2](959000/4588595) loss:2.878 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:29:31,033][model8_pretrain.py][INFO] Epoch:[0/2](959100/4588595) loss:2.900 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:29:31,033][model8_pretrain.py][INFO] Epoch:[0/2](959100/4588595) loss:2.635 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:29:31,033][model8_pretrain.py][INFO] Epoch:[0/2](959100/4588595) loss:2.864 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:29:31,033][model8_pretrain.py][INFO] Epoch:[0/2](959100/4588595) loss:2.949 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:29:31,033][model8_pretrain.py][INFO] Epoch:[0/2](959100/4588595) loss:2.864 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:29:31,034][model8_pretrain.py][INFO] Epoch:[0/2](959100/4588595) loss:2.512 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:29:31,034][model8_pretrain.py][INFO] Epoch:[0/2](959100/4588595) loss:2.351 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:29:31,034][model8_pretrain.py][INFO] Epoch:[0/2](959100/4588595) loss:2.541 lr:0.0000100 epoch_Time:23121.0min: [2024-01-06 23:30:08,000][model8_pretrain.py][INFO] Epoch:[0/2](959200/4588595) loss:2.945 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:08,000][model8_pretrain.py][INFO] Epoch:[0/2](959200/4588595) loss:2.918 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:08,000][model8_pretrain.py][INFO] Epoch:[0/2](959200/4588595) loss:2.795 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:08,000][model8_pretrain.py][INFO] Epoch:[0/2](959200/4588595) loss:2.084 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:08,000][model8_pretrain.py][INFO] Epoch:[0/2](959200/4588595) loss:3.064 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:08,000][model8_pretrain.py][INFO] Epoch:[0/2](959200/4588595) loss:2.500 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:08,000][model8_pretrain.py][INFO] Epoch:[0/2](959200/4588595) loss:3.004 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:08,000][model8_pretrain.py][INFO] Epoch:[0/2](959200/4588595) loss:3.062 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:44,949][model8_pretrain.py][INFO] Epoch:[0/2](959300/4588595) loss:2.158 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:44,949][model8_pretrain.py][INFO] Epoch:[0/2](959300/4588595) loss:3.371 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:44,949][model8_pretrain.py][INFO] Epoch:[0/2](959300/4588595) loss:3.112 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:44,949][model8_pretrain.py][INFO] Epoch:[0/2](959300/4588595) loss:2.815 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:44,949][model8_pretrain.py][INFO] Epoch:[0/2](959300/4588595) loss:3.071 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:44,949][model8_pretrain.py][INFO] Epoch:[0/2](959300/4588595) loss:3.264 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:44,949][model8_pretrain.py][INFO] Epoch:[0/2](959300/4588595) loss:2.415 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:30:44,950][model8_pretrain.py][INFO] Epoch:[0/2](959300/4588595) loss:2.915 lr:0.0000100 epoch_Time:23120.0min: [2024-01-06 23:31:21,906][model8_pretrain.py][INFO] Epoch:[0/2](959400/4588595) loss:2.764 lr:0.0000100 epoch_Time:23119.0min: [2024-01-06 23:31:21,906][model8_pretrain.py][INFO] Epoch:[0/2](959400/4588595) loss:2.394 lr:0.0000100 epoch_Time:23119.0min: [2024-01-06 23:31:21,906][model8_pretrain.py][INFO] Epoch:[0/2](959400/4588595) loss:2.860 lr:0.0000100 epoch_Time:23119.0min: [2024-01-06 23:31:21,906][model8_pretrain.py][INFO] Epoch:[0/2](959400/4588595) loss:2.709 lr:0.0000100 epoch_Time:23119.0min: [2024-01-06 23:31:21,906][model8_pretrain.py][INFO] Epoch:[0/2](959400/4588595) loss:2.851 lr:0.0000100 epoch_Time:23119.0min: [2024-01-06 23:31:21,906][model8_pretrain.py][INFO] Epoch:[0/2](959400/4588595) loss:2.865 lr:0.0000100 epoch_Time:23119.0min: [2024-01-06 23:31:21,906][model8_pretrain.py][INFO] Epoch:[0/2](959400/4588595) loss:2.877 lr:0.0000100 epoch_Time:23119.0min: [2024-01-06 23:31:21,906][model8_pretrain.py][INFO] Epoch:[0/2](959400/4588595) loss:2.660 lr:0.0000100 epoch_Time:23119.0min: [2024-01-06 23:32:02,317][model8_pretrain.py][INFO] Epoch:[0/2](959500/4588595) loss:3.201 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:02,317][model8_pretrain.py][INFO] Epoch:[0/2](959500/4588595) loss:3.018 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:02,317][model8_pretrain.py][INFO] Epoch:[0/2](959500/4588595) loss:3.354 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:02,317][model8_pretrain.py][INFO] Epoch:[0/2](959500/4588595) loss:2.428 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:02,317][model8_pretrain.py][INFO] Epoch:[0/2](959500/4588595) loss:2.840 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:02,317][model8_pretrain.py][INFO] Epoch:[0/2](959500/4588595) loss:2.980 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:02,317][model8_pretrain.py][INFO] Epoch:[0/2](959500/4588595) loss:2.934 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:02,317][model8_pretrain.py][INFO] Epoch:[0/2](959500/4588595) loss:2.696 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:49,298][model8_pretrain.py][INFO] Epoch:[0/2](959600/4588595) loss:3.159 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:49,298][model8_pretrain.py][INFO] Epoch:[0/2](959600/4588595) loss:3.278 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:49,298][model8_pretrain.py][INFO] Epoch:[0/2](959600/4588595) loss:2.326 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:49,298][model8_pretrain.py][INFO] Epoch:[0/2](959600/4588595) loss:2.755 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:49,298][model8_pretrain.py][INFO] Epoch:[0/2](959600/4588595) loss:2.880 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:49,298][model8_pretrain.py][INFO] Epoch:[0/2](959600/4588595) loss:2.473 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:49,298][model8_pretrain.py][INFO] Epoch:[0/2](959600/4588595) loss:2.319 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:32:49,298][model8_pretrain.py][INFO] Epoch:[0/2](959600/4588595) loss:2.696 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:33:26,213][model8_pretrain.py][INFO] Epoch:[0/2](959700/4588595) loss:2.618 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:33:26,213][model8_pretrain.py][INFO] Epoch:[0/2](959700/4588595) loss:2.851 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:33:26,213][model8_pretrain.py][INFO] Epoch:[0/2](959700/4588595) loss:2.793 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:33:26,213][model8_pretrain.py][INFO] Epoch:[0/2](959700/4588595) loss:2.791 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:33:26,213][model8_pretrain.py][INFO] Epoch:[0/2](959700/4588595) loss:2.962 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:33:26,213][model8_pretrain.py][INFO] Epoch:[0/2](959700/4588595) loss:2.668 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:33:26,213][model8_pretrain.py][INFO] Epoch:[0/2](959700/4588595) loss:3.207 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:33:26,213][model8_pretrain.py][INFO] Epoch:[0/2](959700/4588595) loss:2.663 lr:0.0000100 epoch_Time:23118.0min: [2024-01-06 23:34:03,143][model8_pretrain.py][INFO] Epoch:[0/2](959800/4588595) loss:3.027 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:03,143][model8_pretrain.py][INFO] Epoch:[0/2](959800/4588595) loss:2.206 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:03,143][model8_pretrain.py][INFO] Epoch:[0/2](959800/4588595) loss:2.992 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:03,143][model8_pretrain.py][INFO] Epoch:[0/2](959800/4588595) loss:2.710 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:03,143][model8_pretrain.py][INFO] Epoch:[0/2](959800/4588595) loss:2.781 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:03,143][model8_pretrain.py][INFO] Epoch:[0/2](959800/4588595) loss:3.015 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:03,143][model8_pretrain.py][INFO] Epoch:[0/2](959800/4588595) loss:2.303 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:03,143][model8_pretrain.py][INFO] Epoch:[0/2](959800/4588595) loss:2.924 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:40,074][model8_pretrain.py][INFO] Epoch:[0/2](959900/4588595) loss:3.424 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:40,074][model8_pretrain.py][INFO] Epoch:[0/2](959900/4588595) loss:2.641 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:40,074][model8_pretrain.py][INFO] Epoch:[0/2](959900/4588595) loss:2.931 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:40,074][model8_pretrain.py][INFO] Epoch:[0/2](959900/4588595) loss:2.971 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:40,074][model8_pretrain.py][INFO] Epoch:[0/2](959900/4588595) loss:2.795 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:40,074][model8_pretrain.py][INFO] Epoch:[0/2](959900/4588595) loss:2.119 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:40,074][model8_pretrain.py][INFO] Epoch:[0/2](959900/4588595) loss:3.008 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:34:40,074][model8_pretrain.py][INFO] Epoch:[0/2](959900/4588595) loss:2.833 lr:0.0000100 epoch_Time:23116.0min: [2024-01-06 23:35:17,007][model8_pretrain.py][INFO] Epoch:[0/2](960000/4588595) loss:3.093 lr:0.0000100 epoch_Time:23115.0min: [2024-01-06 23:35:17,007][model8_pretrain.py][INFO] Epoch:[0/2](960000/4588595) loss:2.856 lr:0.0000100 epoch_Time:23115.0min: [2024-01-06 23:35:17,007][model8_pretrain.py][INFO] Epoch:[0/2](960000/4588595) loss:2.898 lr:0.0000100 epoch_Time:23115.0min: [2024-01-06 23:35:17,007][model8_pretrain.py][INFO] Epoch:[0/2](960000/4588595) loss:2.974 lr:0.0000100 epoch_Time:23115.0min: [2024-01-06 23:35:17,007][model8_pretrain.py][INFO] Epoch:[0/2](960000/4588595) loss:2.335 lr:0.0000100 epoch_Time:23115.0min: [2024-01-06 23:35:17,007][model8_pretrain.py][INFO] Epoch:[0/2](960000/4588595) loss:2.741 lr:0.0000100 epoch_Time:23115.0min: [2024-01-06 23:35:17,007][model8_pretrain.py][INFO] Epoch:[0/2](960000/4588595) loss:2.767 lr:0.0000100 epoch_Time:23115.0min: [2024-01-06 23:35:17,008][model8_pretrain.py][INFO] Epoch:[0/2](960000/4588595) loss:3.294 lr:0.0000100 epoch_Time:23115.0min: [2024-01-06 23:35:53,944][model8_pretrain.py][INFO] Epoch:[0/2](960100/4588595) loss:2.916 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:35:53,944][model8_pretrain.py][INFO] Epoch:[0/2](960100/4588595) loss:2.681 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:35:53,944][model8_pretrain.py][INFO] Epoch:[0/2](960100/4588595) loss:3.138 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:35:53,944][model8_pretrain.py][INFO] Epoch:[0/2](960100/4588595) loss:2.488 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:35:53,944][model8_pretrain.py][INFO] Epoch:[0/2](960100/4588595) loss:2.842 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:35:53,944][model8_pretrain.py][INFO] Epoch:[0/2](960100/4588595) loss:2.527 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:35:53,944][model8_pretrain.py][INFO] Epoch:[0/2](960100/4588595) loss:2.454 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:35:53,944][model8_pretrain.py][INFO] Epoch:[0/2](960100/4588595) loss:2.848 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:36:30,877][model8_pretrain.py][INFO] Epoch:[0/2](960200/4588595) loss:2.930 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:36:30,877][model8_pretrain.py][INFO] Epoch:[0/2](960200/4588595) loss:3.087 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:36:30,877][model8_pretrain.py][INFO] Epoch:[0/2](960200/4588595) loss:3.074 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:36:30,877][model8_pretrain.py][INFO] Epoch:[0/2](960200/4588595) loss:3.277 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:36:30,877][model8_pretrain.py][INFO] Epoch:[0/2](960200/4588595) loss:2.525 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:36:30,877][model8_pretrain.py][INFO] Epoch:[0/2](960200/4588595) loss:2.526 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:36:30,877][model8_pretrain.py][INFO] Epoch:[0/2](960200/4588595) loss:3.334 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:36:30,877][model8_pretrain.py][INFO] Epoch:[0/2](960200/4588595) loss:2.626 lr:0.0000100 epoch_Time:23114.0min: [2024-01-06 23:37:09,458][model8_pretrain.py][INFO] Epoch:[0/2](960300/4588595) loss:2.451 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:09,458][model8_pretrain.py][INFO] Epoch:[0/2](960300/4588595) loss:3.423 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:09,458][model8_pretrain.py][INFO] Epoch:[0/2](960300/4588595) loss:2.849 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:09,458][model8_pretrain.py][INFO] Epoch:[0/2](960300/4588595) loss:2.878 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:09,458][model8_pretrain.py][INFO] Epoch:[0/2](960300/4588595) loss:2.882 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:09,458][model8_pretrain.py][INFO] Epoch:[0/2](960300/4588595) loss:2.728 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:09,458][model8_pretrain.py][INFO] Epoch:[0/2](960300/4588595) loss:2.825 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:09,459][model8_pretrain.py][INFO] Epoch:[0/2](960300/4588595) loss:3.040 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:58,059][model8_pretrain.py][INFO] Epoch:[0/2](960400/4588595) loss:2.951 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:58,059][model8_pretrain.py][INFO] Epoch:[0/2](960400/4588595) loss:2.331 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:58,059][model8_pretrain.py][INFO] Epoch:[0/2](960400/4588595) loss:2.823 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:58,059][model8_pretrain.py][INFO] Epoch:[0/2](960400/4588595) loss:3.244 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:58,059][model8_pretrain.py][INFO] Epoch:[0/2](960400/4588595) loss:2.980 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:58,059][model8_pretrain.py][INFO] Epoch:[0/2](960400/4588595) loss:2.929 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:58,059][model8_pretrain.py][INFO] Epoch:[0/2](960400/4588595) loss:3.054 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:37:58,060][model8_pretrain.py][INFO] Epoch:[0/2](960400/4588595) loss:3.137 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:38:34,992][model8_pretrain.py][INFO] Epoch:[0/2](960500/4588595) loss:2.853 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:38:34,992][model8_pretrain.py][INFO] Epoch:[0/2](960500/4588595) loss:2.785 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:38:34,992][model8_pretrain.py][INFO] Epoch:[0/2](960500/4588595) loss:2.882 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:38:34,992][model8_pretrain.py][INFO] Epoch:[0/2](960500/4588595) loss:2.763 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:38:34,992][model8_pretrain.py][INFO] Epoch:[0/2](960500/4588595) loss:2.721 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:38:34,992][model8_pretrain.py][INFO] Epoch:[0/2](960500/4588595) loss:3.214 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:38:34,992][model8_pretrain.py][INFO] Epoch:[0/2](960500/4588595) loss:2.788 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:38:34,992][model8_pretrain.py][INFO] Epoch:[0/2](960500/4588595) loss:2.244 lr:0.0000100 epoch_Time:23113.0min: [2024-01-06 23:39:11,932][model8_pretrain.py][INFO] Epoch:[0/2](960600/4588595) loss:3.106 lr:0.0000100 epoch_Time:23112.0min: [2024-01-06 23:39:11,932][model8_pretrain.py][INFO] Epoch:[0/2](960600/4588595) loss:2.697 lr:0.0000100 epoch_Time:23112.0min: [2024-01-06 23:39:11,932][model8_pretrain.py][INFO] Epoch:[0/2](960600/4588595) loss:2.502 lr:0.0000100 epoch_Time:23112.0min: [2024-01-06 23:39:11,932][model8_pretrain.py][INFO] Epoch:[0/2](960600/4588595) loss:2.988 lr:0.0000100 epoch_Time:23112.0min: [2024-01-06 23:39:11,932][model8_pretrain.py][INFO] Epoch:[0/2](960600/4588595) loss:3.017 lr:0.0000100 epoch_Time:23112.0min: [2024-01-06 23:39:11,932][model8_pretrain.py][INFO] Epoch:[0/2](960600/4588595) loss:2.278 lr:0.0000100 epoch_Time:23112.0min: [2024-01-06 23:39:11,932][model8_pretrain.py][INFO] Epoch:[0/2](960600/4588595) loss:3.338 lr:0.0000100 epoch_Time:23112.0min: [2024-01-06 23:39:11,933][model8_pretrain.py][INFO] Epoch:[0/2](960600/4588595) loss:1.905 lr:0.0000100 epoch_Time:23112.0min: [2024-01-06 23:39:48,877][model8_pretrain.py][INFO] Epoch:[0/2](960700/4588595) loss:3.059 lr:0.0000100 epoch_Time:23111.0min: [2024-01-06 23:39:48,877][model8_pretrain.py][INFO] Epoch:[0/2](960700/4588595) loss:2.823 lr:0.0000100 epoch_Time:23111.0min: [2024-01-06 23:39:48,877][model8_pretrain.py][INFO] Epoch:[0/2](960700/4588595) loss:3.331 lr:0.0000100 epoch_Time:23111.0min: [2024-01-06 23:39:48,877][model8_pretrain.py][INFO] Epoch:[0/2](960700/4588595) loss:2.016 lr:0.0000100 epoch_Time:23111.0min: [2024-01-06 23:39:48,877][model8_pretrain.py][INFO] Epoch:[0/2](960700/4588595) loss:2.617 lr:0.0000100 epoch_Time:23111.0min: [2024-01-06 23:39:48,877][model8_pretrain.py][INFO] Epoch:[0/2](960700/4588595) loss:2.948 lr:0.0000100 epoch_Time:23111.0min: [2024-01-06 23:39:48,877][model8_pretrain.py][INFO] Epoch:[0/2](960700/4588595) loss:2.852 lr:0.0000100 epoch_Time:23111.0min: [2024-01-06 23:39:48,877][model8_pretrain.py][INFO] Epoch:[0/2](960700/4588595) loss:2.750 lr:0.0000100 epoch_Time:23111.0min: [2024-01-06 23:40:25,817][model8_pretrain.py][INFO] Epoch:[0/2](960800/4588595) loss:2.401 lr:0.0000100 epoch_Time:23110.0min: [2024-01-06 23:40:25,817][model8_pretrain.py][INFO] Epoch:[0/2](960800/4588595) loss:2.747 lr:0.0000100 epoch_Time:23110.0min: [2024-01-06 23:40:25,817][model8_pretrain.py][INFO] Epoch:[0/2](960800/4588595) loss:3.235 lr:0.0000100 epoch_Time:23110.0min: [2024-01-06 23:40:25,817][model8_pretrain.py][INFO] Epoch:[0/2](960800/4588595) loss:2.856 lr:0.0000100 epoch_Time:23110.0min: [2024-01-06 23:40:25,817][model8_pretrain.py][INFO] Epoch:[0/2](960800/4588595) loss:2.513 lr:0.0000100 epoch_Time:23110.0min: [2024-01-06 23:40:25,817][model8_pretrain.py][INFO] Epoch:[0/2](960800/4588595) loss:2.414 lr:0.0000100 epoch_Time:23110.0min: [2024-01-06 23:40:25,817][model8_pretrain.py][INFO] Epoch:[0/2](960800/4588595) loss:2.910 lr:0.0000100 epoch_Time:23110.0min: [2024-01-06 23:40:25,817][model8_pretrain.py][INFO] Epoch:[0/2](960800/4588595) loss:2.924 lr:0.0000100 epoch_Time:23110.0min: [2024-01-06 23:41:02,764][model8_pretrain.py][INFO] Epoch:[0/2](960900/4588595) loss:3.046 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:02,764][model8_pretrain.py][INFO] Epoch:[0/2](960900/4588595) loss:2.986 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:02,764][model8_pretrain.py][INFO] Epoch:[0/2](960900/4588595) loss:2.475 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:02,764][model8_pretrain.py][INFO] Epoch:[0/2](960900/4588595) loss:2.947 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:02,764][model8_pretrain.py][INFO] Epoch:[0/2](960900/4588595) loss:2.874 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:02,764][model8_pretrain.py][INFO] Epoch:[0/2](960900/4588595) loss:2.887 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:02,764][model8_pretrain.py][INFO] Epoch:[0/2](960900/4588595) loss:1.967 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:02,764][model8_pretrain.py][INFO] Epoch:[0/2](960900/4588595) loss:2.977 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:39,702][model8_pretrain.py][INFO] Epoch:[0/2](961000/4588595) loss:2.387 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:39,702][model8_pretrain.py][INFO] Epoch:[0/2](961000/4588595) loss:2.060 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:39,703][model8_pretrain.py][INFO] Epoch:[0/2](961000/4588595) loss:2.908 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:39,703][model8_pretrain.py][INFO] Epoch:[0/2](961000/4588595) loss:3.029 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:39,703][model8_pretrain.py][INFO] Epoch:[0/2](961000/4588595) loss:2.851 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:39,703][model8_pretrain.py][INFO] Epoch:[0/2](961000/4588595) loss:2.967 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:39,703][model8_pretrain.py][INFO] Epoch:[0/2](961000/4588595) loss:3.371 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:41:39,703][model8_pretrain.py][INFO] Epoch:[0/2](961000/4588595) loss:2.276 lr:0.0000100 epoch_Time:23109.0min: [2024-01-06 23:42:18,176][model8_pretrain.py][INFO] Epoch:[0/2](961100/4588595) loss:2.942 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:42:18,176][model8_pretrain.py][INFO] Epoch:[0/2](961100/4588595) loss:2.959 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:42:18,176][model8_pretrain.py][INFO] Epoch:[0/2](961100/4588595) loss:3.298 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:42:18,176][model8_pretrain.py][INFO] Epoch:[0/2](961100/4588595) loss:2.898 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:42:18,176][model8_pretrain.py][INFO] Epoch:[0/2](961100/4588595) loss:2.610 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:42:18,176][model8_pretrain.py][INFO] Epoch:[0/2](961100/4588595) loss:2.779 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:42:18,176][model8_pretrain.py][INFO] Epoch:[0/2](961100/4588595) loss:2.936 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:42:18,176][model8_pretrain.py][INFO] Epoch:[0/2](961100/4588595) loss:3.021 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:06,791][model8_pretrain.py][INFO] Epoch:[0/2](961200/4588595) loss:2.893 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:06,791][model8_pretrain.py][INFO] Epoch:[0/2](961200/4588595) loss:2.659 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:06,791][model8_pretrain.py][INFO] Epoch:[0/2](961200/4588595) loss:2.762 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:06,791][model8_pretrain.py][INFO] Epoch:[0/2](961200/4588595) loss:2.237 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:06,792][model8_pretrain.py][INFO] Epoch:[0/2](961200/4588595) loss:2.717 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:06,791][model8_pretrain.py][INFO] Epoch:[0/2](961200/4588595) loss:3.188 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:06,791][model8_pretrain.py][INFO] Epoch:[0/2](961200/4588595) loss:3.356 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:06,792][model8_pretrain.py][INFO] Epoch:[0/2](961200/4588595) loss:2.683 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:43,729][model8_pretrain.py][INFO] Epoch:[0/2](961300/4588595) loss:3.058 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:43,729][model8_pretrain.py][INFO] Epoch:[0/2](961300/4588595) loss:2.585 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:43,729][model8_pretrain.py][INFO] Epoch:[0/2](961300/4588595) loss:2.559 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:43,729][model8_pretrain.py][INFO] Epoch:[0/2](961300/4588595) loss:2.865 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:43,729][model8_pretrain.py][INFO] Epoch:[0/2](961300/4588595) loss:2.953 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:43,729][model8_pretrain.py][INFO] Epoch:[0/2](961300/4588595) loss:3.114 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:43,729][model8_pretrain.py][INFO] Epoch:[0/2](961300/4588595) loss:2.586 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:43:43,729][model8_pretrain.py][INFO] Epoch:[0/2](961300/4588595) loss:3.091 lr:0.0000100 epoch_Time:23108.0min: [2024-01-06 23:44:20,676][model8_pretrain.py][INFO] Epoch:[0/2](961400/4588595) loss:3.045 lr:0.0000100 epoch_Time:23107.0min: [2024-01-06 23:44:20,676][model8_pretrain.py][INFO] Epoch:[0/2](961400/4588595) loss:2.706 lr:0.0000100 epoch_Time:23107.0min: [2024-01-06 23:44:20,676][model8_pretrain.py][INFO] Epoch:[0/2](961400/4588595) loss:2.185 lr:0.0000100 epoch_Time:23107.0min: [2024-01-06 23:44:20,676][model8_pretrain.py][INFO] Epoch:[0/2](961400/4588595) loss:2.787 lr:0.0000100 epoch_Time:23107.0min: [2024-01-06 23:44:20,676][model8_pretrain.py][INFO] Epoch:[0/2](961400/4588595) loss:2.821 lr:0.0000100 epoch_Time:23107.0min: [2024-01-06 23:44:20,676][model8_pretrain.py][INFO] Epoch:[0/2](961400/4588595) loss:2.728 lr:0.0000100 epoch_Time:23107.0min: [2024-01-06 23:44:20,676][model8_pretrain.py][INFO] Epoch:[0/2](961400/4588595) loss:2.839 lr:0.0000100 epoch_Time:23107.0min: [2024-01-06 23:44:20,676][model8_pretrain.py][INFO] Epoch:[0/2](961400/4588595) loss:2.538 lr:0.0000100 epoch_Time:23107.0min: [2024-01-06 23:44:57,621][model8_pretrain.py][INFO] Epoch:[0/2](961500/4588595) loss:2.827 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:44:57,621][model8_pretrain.py][INFO] Epoch:[0/2](961500/4588595) loss:2.284 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:44:57,621][model8_pretrain.py][INFO] Epoch:[0/2](961500/4588595) loss:2.611 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:44:57,621][model8_pretrain.py][INFO] Epoch:[0/2](961500/4588595) loss:2.285 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:44:57,621][model8_pretrain.py][INFO] Epoch:[0/2](961500/4588595) loss:2.782 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:44:57,621][model8_pretrain.py][INFO] Epoch:[0/2](961500/4588595) loss:3.178 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:44:57,621][model8_pretrain.py][INFO] Epoch:[0/2](961500/4588595) loss:2.932 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:44:57,621][model8_pretrain.py][INFO] Epoch:[0/2](961500/4588595) loss:2.525 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:45:34,558][model8_pretrain.py][INFO] Epoch:[0/2](961600/4588595) loss:3.245 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:45:34,558][model8_pretrain.py][INFO] Epoch:[0/2](961600/4588595) loss:2.284 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:45:34,558][model8_pretrain.py][INFO] Epoch:[0/2](961600/4588595) loss:3.378 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:45:34,558][model8_pretrain.py][INFO] Epoch:[0/2](961600/4588595) loss:2.988 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:45:34,558][model8_pretrain.py][INFO] Epoch:[0/2](961600/4588595) loss:2.835 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:45:34,558][model8_pretrain.py][INFO] Epoch:[0/2](961600/4588595) loss:3.036 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:45:34,558][model8_pretrain.py][INFO] Epoch:[0/2](961600/4588595) loss:3.084 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:45:34,558][model8_pretrain.py][INFO] Epoch:[0/2](961600/4588595) loss:2.635 lr:0.0000100 epoch_Time:23106.0min: [2024-01-06 23:46:11,502][model8_pretrain.py][INFO] Epoch:[0/2](961700/4588595) loss:2.707 lr:0.0000100 epoch_Time:23105.0min: [2024-01-06 23:46:11,502][model8_pretrain.py][INFO] Epoch:[0/2](961700/4588595) loss:2.333 lr:0.0000100 epoch_Time:23105.0min: [2024-01-06 23:46:11,502][model8_pretrain.py][INFO] Epoch:[0/2](961700/4588595) loss:2.823 lr:0.0000100 epoch_Time:23105.0min: [2024-01-06 23:46:11,502][model8_pretrain.py][INFO] Epoch:[0/2](961700/4588595) loss:2.695 lr:0.0000100 epoch_Time:23105.0min: [2024-01-06 23:46:11,502][model8_pretrain.py][INFO] Epoch:[0/2](961700/4588595) loss:2.757 lr:0.0000100 epoch_Time:23105.0min: [2024-01-06 23:46:11,502][model8_pretrain.py][INFO] Epoch:[0/2](961700/4588595) loss:2.600 lr:0.0000100 epoch_Time:23105.0min: [2024-01-06 23:46:11,502][model8_pretrain.py][INFO] Epoch:[0/2](961700/4588595) loss:2.571 lr:0.0000100 epoch_Time:23105.0min: [2024-01-06 23:46:11,502][model8_pretrain.py][INFO] Epoch:[0/2](961700/4588595) loss:2.831 lr:0.0000100 epoch_Time:23105.0min: [2024-01-06 23:46:48,421][model8_pretrain.py][INFO] Epoch:[0/2](961800/4588595) loss:3.035 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:46:48,421][model8_pretrain.py][INFO] Epoch:[0/2](961800/4588595) loss:2.609 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:46:48,421][model8_pretrain.py][INFO] Epoch:[0/2](961800/4588595) loss:2.933 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:46:48,421][model8_pretrain.py][INFO] Epoch:[0/2](961800/4588595) loss:3.198 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:46:48,421][model8_pretrain.py][INFO] Epoch:[0/2](961800/4588595) loss:2.934 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:46:48,421][model8_pretrain.py][INFO] Epoch:[0/2](961800/4588595) loss:2.761 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:46:48,421][model8_pretrain.py][INFO] Epoch:[0/2](961800/4588595) loss:2.999 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:46:48,421][model8_pretrain.py][INFO] Epoch:[0/2](961800/4588595) loss:3.197 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:47:26,883][model8_pretrain.py][INFO] Epoch:[0/2](961900/4588595) loss:3.106 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:47:26,883][model8_pretrain.py][INFO] Epoch:[0/2](961900/4588595) loss:3.165 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:47:26,883][model8_pretrain.py][INFO] Epoch:[0/2](961900/4588595) loss:2.006 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:47:26,883][model8_pretrain.py][INFO] Epoch:[0/2](961900/4588595) loss:2.854 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:47:26,883][model8_pretrain.py][INFO] Epoch:[0/2](961900/4588595) loss:2.877 lr:0.0000100 epoch_Time:23104.0min: [2024-01-06 23:47:26,883][model8_pretrain.py][INFO] Epoch:[0/2](961900/4588595) loss:3.300 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:47:26,883][model8_pretrain.py][INFO] Epoch:[0/2](961900/4588595) loss:3.281 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:47:26,883][model8_pretrain.py][INFO] Epoch:[0/2](961900/4588595) loss:2.647 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:48:15,597][model8_pretrain.py][INFO] Epoch:[0/2](962000/4588595) loss:2.372 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:48:15,597][model8_pretrain.py][INFO] Epoch:[0/2](962000/4588595) loss:2.730 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:48:15,597][model8_pretrain.py][INFO] Epoch:[0/2](962000/4588595) loss:3.074 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:48:15,597][model8_pretrain.py][INFO] Epoch:[0/2](962000/4588595) loss:2.482 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:48:15,597][model8_pretrain.py][INFO] Epoch:[0/2](962000/4588595) loss:3.092 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:48:15,597][model8_pretrain.py][INFO] Epoch:[0/2](962000/4588595) loss:3.134 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:48:15,597][model8_pretrain.py][INFO] Epoch:[0/2](962000/4588595) loss:2.557 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:48:15,597][model8_pretrain.py][INFO] Epoch:[0/2](962000/4588595) loss:2.607 lr:0.0000100 epoch_Time:23103.0min: [2024-01-06 23:48:52,543][model8_pretrain.py][INFO] Epoch:[0/2](962100/4588595) loss:2.310 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:48:52,543][model8_pretrain.py][INFO] Epoch:[0/2](962100/4588595) loss:2.850 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:48:52,543][model8_pretrain.py][INFO] Epoch:[0/2](962100/4588595) loss:2.599 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:48:52,543][model8_pretrain.py][INFO] Epoch:[0/2](962100/4588595) loss:2.942 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:48:52,543][model8_pretrain.py][INFO] Epoch:[0/2](962100/4588595) loss:3.253 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:48:52,543][model8_pretrain.py][INFO] Epoch:[0/2](962100/4588595) loss:3.046 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:48:52,543][model8_pretrain.py][INFO] Epoch:[0/2](962100/4588595) loss:2.845 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:48:52,543][model8_pretrain.py][INFO] Epoch:[0/2](962100/4588595) loss:2.795 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:49:29,501][model8_pretrain.py][INFO] Epoch:[0/2](962200/4588595) loss:3.211 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:49:29,501][model8_pretrain.py][INFO] Epoch:[0/2](962200/4588595) loss:2.895 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:49:29,501][model8_pretrain.py][INFO] Epoch:[0/2](962200/4588595) loss:2.969 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:49:29,501][model8_pretrain.py][INFO] Epoch:[0/2](962200/4588595) loss:2.768 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:49:29,501][model8_pretrain.py][INFO] Epoch:[0/2](962200/4588595) loss:2.413 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:49:29,501][model8_pretrain.py][INFO] Epoch:[0/2](962200/4588595) loss:2.300 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:49:29,501][model8_pretrain.py][INFO] Epoch:[0/2](962200/4588595) loss:2.335 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:49:29,502][model8_pretrain.py][INFO] Epoch:[0/2](962200/4588595) loss:2.635 lr:0.0000100 epoch_Time:23102.0min: [2024-01-06 23:50:06,445][model8_pretrain.py][INFO] Epoch:[0/2](962300/4588595) loss:2.886 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:06,445][model8_pretrain.py][INFO] Epoch:[0/2](962300/4588595) loss:2.543 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:06,445][model8_pretrain.py][INFO] Epoch:[0/2](962300/4588595) loss:2.635 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:06,445][model8_pretrain.py][INFO] Epoch:[0/2](962300/4588595) loss:3.107 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:06,445][model8_pretrain.py][INFO] Epoch:[0/2](962300/4588595) loss:2.540 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:06,445][model8_pretrain.py][INFO] Epoch:[0/2](962300/4588595) loss:2.468 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:06,445][model8_pretrain.py][INFO] Epoch:[0/2](962300/4588595) loss:3.127 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:06,445][model8_pretrain.py][INFO] Epoch:[0/2](962300/4588595) loss:2.668 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:43,398][model8_pretrain.py][INFO] Epoch:[0/2](962400/4588595) loss:3.271 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:43,398][model8_pretrain.py][INFO] Epoch:[0/2](962400/4588595) loss:2.662 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:43,398][model8_pretrain.py][INFO] Epoch:[0/2](962400/4588595) loss:2.630 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:43,398][model8_pretrain.py][INFO] Epoch:[0/2](962400/4588595) loss:2.767 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:43,398][model8_pretrain.py][INFO] Epoch:[0/2](962400/4588595) loss:2.711 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:43,398][model8_pretrain.py][INFO] Epoch:[0/2](962400/4588595) loss:3.053 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:43,398][model8_pretrain.py][INFO] Epoch:[0/2](962400/4588595) loss:2.926 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:50:43,398][model8_pretrain.py][INFO] Epoch:[0/2](962400/4588595) loss:2.944 lr:0.0000100 epoch_Time:23101.0min: [2024-01-06 23:51:20,356][model8_pretrain.py][INFO] Epoch:[0/2](962500/4588595) loss:2.552 lr:0.0000100 epoch_Time:23100.0min: [2024-01-06 23:51:20,356][model8_pretrain.py][INFO] Epoch:[0/2](962500/4588595) loss:2.968 lr:0.0000100 epoch_Time:23100.0min: [2024-01-06 23:51:20,356][model8_pretrain.py][INFO] Epoch:[0/2](962500/4588595) loss:3.143 lr:0.0000100 epoch_Time:23100.0min: [2024-01-06 23:51:20,356][model8_pretrain.py][INFO] Epoch:[0/2](962500/4588595) loss:2.148 lr:0.0000100 epoch_Time:23100.0min: [2024-01-06 23:51:20,356][model8_pretrain.py][INFO] Epoch:[0/2](962500/4588595) loss:2.690 lr:0.0000100 epoch_Time:23100.0min: [2024-01-06 23:51:20,356][model8_pretrain.py][INFO] Epoch:[0/2](962500/4588595) loss:2.326 lr:0.0000100 epoch_Time:23100.0min: [2024-01-06 23:51:20,357][model8_pretrain.py][INFO] Epoch:[0/2](962500/4588595) loss:2.763 lr:0.0000100 epoch_Time:23100.0min: [2024-01-06 23:51:20,357][model8_pretrain.py][INFO] Epoch:[0/2](962500/4588595) loss:2.330 lr:0.0000100 epoch_Time:23100.0min: [2024-01-06 23:51:57,311][model8_pretrain.py][INFO] Epoch:[0/2](962600/4588595) loss:2.917 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:51:57,311][model8_pretrain.py][INFO] Epoch:[0/2](962600/4588595) loss:2.891 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:51:57,311][model8_pretrain.py][INFO] Epoch:[0/2](962600/4588595) loss:2.924 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:51:57,311][model8_pretrain.py][INFO] Epoch:[0/2](962600/4588595) loss:2.977 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:51:57,312][model8_pretrain.py][INFO] Epoch:[0/2](962600/4588595) loss:3.013 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:51:57,312][model8_pretrain.py][INFO] Epoch:[0/2](962600/4588595) loss:2.110 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:51:57,312][model8_pretrain.py][INFO] Epoch:[0/2](962600/4588595) loss:2.495 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:51:57,312][model8_pretrain.py][INFO] Epoch:[0/2](962600/4588595) loss:2.843 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:52:35,772][model8_pretrain.py][INFO] Epoch:[0/2](962700/4588595) loss:2.490 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:52:35,772][model8_pretrain.py][INFO] Epoch:[0/2](962700/4588595) loss:2.404 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:52:35,772][model8_pretrain.py][INFO] Epoch:[0/2](962700/4588595) loss:2.829 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:52:35,772][model8_pretrain.py][INFO] Epoch:[0/2](962700/4588595) loss:3.156 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:52:35,772][model8_pretrain.py][INFO] Epoch:[0/2](962700/4588595) loss:3.278 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:52:35,773][model8_pretrain.py][INFO] Epoch:[0/2](962700/4588595) loss:2.892 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:52:35,773][model8_pretrain.py][INFO] Epoch:[0/2](962700/4588595) loss:2.346 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:52:35,774][model8_pretrain.py][INFO] Epoch:[0/2](962700/4588595) loss:2.917 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:53:24,580][model8_pretrain.py][INFO] Epoch:[0/2](962800/4588595) loss:3.097 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:53:24,581][model8_pretrain.py][INFO] Epoch:[0/2](962800/4588595) loss:2.747 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:53:24,581][model8_pretrain.py][INFO] Epoch:[0/2](962800/4588595) loss:2.870 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:53:24,581][model8_pretrain.py][INFO] Epoch:[0/2](962800/4588595) loss:2.367 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:53:24,581][model8_pretrain.py][INFO] Epoch:[0/2](962800/4588595) loss:2.647 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:53:24,581][model8_pretrain.py][INFO] Epoch:[0/2](962800/4588595) loss:2.471 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:53:24,581][model8_pretrain.py][INFO] Epoch:[0/2](962800/4588595) loss:2.229 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:53:24,581][model8_pretrain.py][INFO] Epoch:[0/2](962800/4588595) loss:3.094 lr:0.0000100 epoch_Time:23099.0min: [2024-01-06 23:54:01,518][model8_pretrain.py][INFO] Epoch:[0/2](962900/4588595) loss:2.708 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:01,518][model8_pretrain.py][INFO] Epoch:[0/2](962900/4588595) loss:3.160 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:01,518][model8_pretrain.py][INFO] Epoch:[0/2](962900/4588595) loss:3.064 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:01,518][model8_pretrain.py][INFO] Epoch:[0/2](962900/4588595) loss:2.979 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:01,518][model8_pretrain.py][INFO] Epoch:[0/2](962900/4588595) loss:3.088 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:01,518][model8_pretrain.py][INFO] Epoch:[0/2](962900/4588595) loss:3.107 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:01,518][model8_pretrain.py][INFO] Epoch:[0/2](962900/4588595) loss:2.874 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:01,518][model8_pretrain.py][INFO] Epoch:[0/2](962900/4588595) loss:2.912 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:38,456][model8_pretrain.py][INFO] Epoch:[0/2](963000/4588595) loss:2.760 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:38,456][model8_pretrain.py][INFO] Epoch:[0/2](963000/4588595) loss:3.234 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:38,456][model8_pretrain.py][INFO] Epoch:[0/2](963000/4588595) loss:3.014 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:38,456][model8_pretrain.py][INFO] Epoch:[0/2](963000/4588595) loss:2.817 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:38,456][model8_pretrain.py][INFO] Epoch:[0/2](963000/4588595) loss:2.947 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:38,456][model8_pretrain.py][INFO] Epoch:[0/2](963000/4588595) loss:3.028 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:38,456][model8_pretrain.py][INFO] Epoch:[0/2](963000/4588595) loss:3.561 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:54:38,456][model8_pretrain.py][INFO] Epoch:[0/2](963000/4588595) loss:3.108 lr:0.0000100 epoch_Time:23097.0min: [2024-01-06 23:55:15,395][model8_pretrain.py][INFO] Epoch:[0/2](963100/4588595) loss:2.440 lr:0.0000100 epoch_Time:23096.0min: [2024-01-06 23:55:15,395][model8_pretrain.py][INFO] Epoch:[0/2](963100/4588595) loss:2.688 lr:0.0000100 epoch_Time:23096.0min: [2024-01-06 23:55:15,395][model8_pretrain.py][INFO] Epoch:[0/2](963100/4588595) loss:2.195 lr:0.0000100 epoch_Time:23096.0min: [2024-01-06 23:55:15,395][model8_pretrain.py][INFO] Epoch:[0/2](963100/4588595) loss:3.080 lr:0.0000100 epoch_Time:23096.0min: [2024-01-06 23:55:15,395][model8_pretrain.py][INFO] Epoch:[0/2](963100/4588595) loss:3.024 lr:0.0000100 epoch_Time:23096.0min: [2024-01-06 23:55:15,395][model8_pretrain.py][INFO] Epoch:[0/2](963100/4588595) loss:2.704 lr:0.0000100 epoch_Time:23096.0min: [2024-01-06 23:55:15,396][model8_pretrain.py][INFO] Epoch:[0/2](963100/4588595) loss:2.456 lr:0.0000100 epoch_Time:23096.0min: [2024-01-06 23:55:15,396][model8_pretrain.py][INFO] Epoch:[0/2](963100/4588595) loss:2.603 lr:0.0000100 epoch_Time:23096.0min: [2024-01-06 23:55:52,345][model8_pretrain.py][INFO] Epoch:[0/2](963200/4588595) loss:2.845 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:55:52,345][model8_pretrain.py][INFO] Epoch:[0/2](963200/4588595) loss:2.719 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:55:52,345][model8_pretrain.py][INFO] Epoch:[0/2](963200/4588595) loss:2.625 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:55:52,345][model8_pretrain.py][INFO] Epoch:[0/2](963200/4588595) loss:2.817 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:55:52,345][model8_pretrain.py][INFO] Epoch:[0/2](963200/4588595) loss:3.013 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:55:52,345][model8_pretrain.py][INFO] Epoch:[0/2](963200/4588595) loss:2.950 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:55:52,345][model8_pretrain.py][INFO] Epoch:[0/2](963200/4588595) loss:3.160 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:55:52,345][model8_pretrain.py][INFO] Epoch:[0/2](963200/4588595) loss:3.288 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:56:29,291][model8_pretrain.py][INFO] Epoch:[0/2](963300/4588595) loss:3.032 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:56:29,291][model8_pretrain.py][INFO] Epoch:[0/2](963300/4588595) loss:2.452 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:56:29,291][model8_pretrain.py][INFO] Epoch:[0/2](963300/4588595) loss:3.326 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:56:29,291][model8_pretrain.py][INFO] Epoch:[0/2](963300/4588595) loss:3.117 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:56:29,291][model8_pretrain.py][INFO] Epoch:[0/2](963300/4588595) loss:2.382 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:56:29,291][model8_pretrain.py][INFO] Epoch:[0/2](963300/4588595) loss:2.123 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:56:29,291][model8_pretrain.py][INFO] Epoch:[0/2](963300/4588595) loss:2.439 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:56:29,292][model8_pretrain.py][INFO] Epoch:[0/2](963300/4588595) loss:2.587 lr:0.0000100 epoch_Time:23095.0min: [2024-01-06 23:57:06,231][model8_pretrain.py][INFO] Epoch:[0/2](963400/4588595) loss:2.105 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:06,231][model8_pretrain.py][INFO] Epoch:[0/2](963400/4588595) loss:3.460 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:06,231][model8_pretrain.py][INFO] Epoch:[0/2](963400/4588595) loss:2.934 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:06,231][model8_pretrain.py][INFO] Epoch:[0/2](963400/4588595) loss:2.600 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:06,231][model8_pretrain.py][INFO] Epoch:[0/2](963400/4588595) loss:3.393 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:06,231][model8_pretrain.py][INFO] Epoch:[0/2](963400/4588595) loss:2.792 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:06,231][model8_pretrain.py][INFO] Epoch:[0/2](963400/4588595) loss:2.382 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:06,231][model8_pretrain.py][INFO] Epoch:[0/2](963400/4588595) loss:2.637 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:44,730][model8_pretrain.py][INFO] Epoch:[0/2](963500/4588595) loss:2.786 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:44,730][model8_pretrain.py][INFO] Epoch:[0/2](963500/4588595) loss:2.811 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:44,730][model8_pretrain.py][INFO] Epoch:[0/2](963500/4588595) loss:2.758 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:44,730][model8_pretrain.py][INFO] Epoch:[0/2](963500/4588595) loss:2.696 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:44,730][model8_pretrain.py][INFO] Epoch:[0/2](963500/4588595) loss:2.584 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:44,730][model8_pretrain.py][INFO] Epoch:[0/2](963500/4588595) loss:2.924 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:44,730][model8_pretrain.py][INFO] Epoch:[0/2](963500/4588595) loss:2.716 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:57:44,730][model8_pretrain.py][INFO] Epoch:[0/2](963500/4588595) loss:2.987 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:58:33,622][model8_pretrain.py][INFO] Epoch:[0/2](963600/4588595) loss:2.716 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:58:33,622][model8_pretrain.py][INFO] Epoch:[0/2](963600/4588595) loss:2.718 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:58:33,622][model8_pretrain.py][INFO] Epoch:[0/2](963600/4588595) loss:2.262 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:58:33,622][model8_pretrain.py][INFO] Epoch:[0/2](963600/4588595) loss:2.796 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:58:33,622][model8_pretrain.py][INFO] Epoch:[0/2](963600/4588595) loss:3.189 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:58:33,622][model8_pretrain.py][INFO] Epoch:[0/2](963600/4588595) loss:3.052 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:58:33,622][model8_pretrain.py][INFO] Epoch:[0/2](963600/4588595) loss:3.232 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:58:33,622][model8_pretrain.py][INFO] Epoch:[0/2](963600/4588595) loss:3.075 lr:0.0000100 epoch_Time:23094.0min: [2024-01-06 23:59:10,555][model8_pretrain.py][INFO] Epoch:[0/2](963700/4588595) loss:2.748 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:10,555][model8_pretrain.py][INFO] Epoch:[0/2](963700/4588595) loss:3.296 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:10,555][model8_pretrain.py][INFO] Epoch:[0/2](963700/4588595) loss:2.387 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:10,555][model8_pretrain.py][INFO] Epoch:[0/2](963700/4588595) loss:2.377 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:10,555][model8_pretrain.py][INFO] Epoch:[0/2](963700/4588595) loss:2.668 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:10,555][model8_pretrain.py][INFO] Epoch:[0/2](963700/4588595) loss:2.778 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:10,556][model8_pretrain.py][INFO] Epoch:[0/2](963700/4588595) loss:3.041 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:10,556][model8_pretrain.py][INFO] Epoch:[0/2](963700/4588595) loss:3.040 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:47,492][model8_pretrain.py][INFO] Epoch:[0/2](963800/4588595) loss:2.761 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:47,493][model8_pretrain.py][INFO] Epoch:[0/2](963800/4588595) loss:2.315 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:47,493][model8_pretrain.py][INFO] Epoch:[0/2](963800/4588595) loss:2.751 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:47,493][model8_pretrain.py][INFO] Epoch:[0/2](963800/4588595) loss:2.520 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:47,493][model8_pretrain.py][INFO] Epoch:[0/2](963800/4588595) loss:2.943 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:47,493][model8_pretrain.py][INFO] Epoch:[0/2](963800/4588595) loss:2.803 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:47,493][model8_pretrain.py][INFO] Epoch:[0/2](963800/4588595) loss:2.723 lr:0.0000100 epoch_Time:23093.0min: [2024-01-06 23:59:47,493][model8_pretrain.py][INFO] Epoch:[0/2](963800/4588595) loss:2.648 lr:0.0000100 epoch_Time:23093.0min: [2024-01-07 00:00:24,433][model8_pretrain.py][INFO] Epoch:[0/2](963900/4588595) loss:2.876 lr:0.0000100 epoch_Time:23092.0min: [2024-01-07 00:00:24,433][model8_pretrain.py][INFO] Epoch:[0/2](963900/4588595) loss:2.968 lr:0.0000100 epoch_Time:23092.0min: [2024-01-07 00:00:24,433][model8_pretrain.py][INFO] Epoch:[0/2](963900/4588595) loss:2.750 lr:0.0000100 epoch_Time:23092.0min: [2024-01-07 00:00:24,433][model8_pretrain.py][INFO] Epoch:[0/2](963900/4588595) loss:2.463 lr:0.0000100 epoch_Time:23092.0min: [2024-01-07 00:00:24,433][model8_pretrain.py][INFO] Epoch:[0/2](963900/4588595) loss:2.638 lr:0.0000100 epoch_Time:23092.0min: [2024-01-07 00:00:24,433][model8_pretrain.py][INFO] Epoch:[0/2](963900/4588595) loss:2.769 lr:0.0000100 epoch_Time:23092.0min: [2024-01-07 00:00:24,433][model8_pretrain.py][INFO] Epoch:[0/2](963900/4588595) loss:3.191 lr:0.0000100 epoch_Time:23092.0min: [2024-01-07 00:00:24,433][model8_pretrain.py][INFO] Epoch:[0/2](963900/4588595) loss:1.903 lr:0.0000100 epoch_Time:23092.0min: [2024-01-07 00:01:01,380][model8_pretrain.py][INFO] Epoch:[0/2](964000/4588595) loss:3.377 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:01,380][model8_pretrain.py][INFO] Epoch:[0/2](964000/4588595) loss:3.228 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:01,380][model8_pretrain.py][INFO] Epoch:[0/2](964000/4588595) loss:2.586 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:01,380][model8_pretrain.py][INFO] Epoch:[0/2](964000/4588595) loss:2.392 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:01,380][model8_pretrain.py][INFO] Epoch:[0/2](964000/4588595) loss:2.692 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:01,380][model8_pretrain.py][INFO] Epoch:[0/2](964000/4588595) loss:2.680 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:01,380][model8_pretrain.py][INFO] Epoch:[0/2](964000/4588595) loss:2.750 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:01,381][model8_pretrain.py][INFO] Epoch:[0/2](964000/4588595) loss:2.943 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:38,321][model8_pretrain.py][INFO] Epoch:[0/2](964100/4588595) loss:2.772 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:38,321][model8_pretrain.py][INFO] Epoch:[0/2](964100/4588595) loss:2.518 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:38,321][model8_pretrain.py][INFO] Epoch:[0/2](964100/4588595) loss:2.791 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:38,322][model8_pretrain.py][INFO] Epoch:[0/2](964100/4588595) loss:2.867 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:38,322][model8_pretrain.py][INFO] Epoch:[0/2](964100/4588595) loss:2.910 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:38,322][model8_pretrain.py][INFO] Epoch:[0/2](964100/4588595) loss:2.707 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:38,322][model8_pretrain.py][INFO] Epoch:[0/2](964100/4588595) loss:2.752 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:01:38,322][model8_pretrain.py][INFO] Epoch:[0/2](964100/4588595) loss:2.871 lr:0.0000100 epoch_Time:23090.0min: [2024-01-07 00:02:15,268][model8_pretrain.py][INFO] Epoch:[0/2](964200/4588595) loss:2.613 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:02:15,269][model8_pretrain.py][INFO] Epoch:[0/2](964200/4588595) loss:2.906 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:02:15,269][model8_pretrain.py][INFO] Epoch:[0/2](964200/4588595) loss:2.879 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:02:15,269][model8_pretrain.py][INFO] Epoch:[0/2](964200/4588595) loss:2.830 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:02:15,269][model8_pretrain.py][INFO] Epoch:[0/2](964200/4588595) loss:2.520 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:02:15,269][model8_pretrain.py][INFO] Epoch:[0/2](964200/4588595) loss:2.634 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:02:15,269][model8_pretrain.py][INFO] Epoch:[0/2](964200/4588595) loss:2.176 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:02:15,269][model8_pretrain.py][INFO] Epoch:[0/2](964200/4588595) loss:2.965 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:02:53,758][model8_pretrain.py][INFO] Epoch:[0/2](964300/4588595) loss:2.369 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:02:53,758][model8_pretrain.py][INFO] Epoch:[0/2](964300/4588595) loss:2.785 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:02:53,758][model8_pretrain.py][INFO] Epoch:[0/2](964300/4588595) loss:2.887 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:02:53,758][model8_pretrain.py][INFO] Epoch:[0/2](964300/4588595) loss:2.573 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:02:53,758][model8_pretrain.py][INFO] Epoch:[0/2](964300/4588595) loss:3.388 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:02:53,758][model8_pretrain.py][INFO] Epoch:[0/2](964300/4588595) loss:2.751 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:02:53,759][model8_pretrain.py][INFO] Epoch:[0/2](964300/4588595) loss:2.302 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:02:53,759][model8_pretrain.py][INFO] Epoch:[0/2](964300/4588595) loss:2.714 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:03:42,471][model8_pretrain.py][INFO] Epoch:[0/2](964400/4588595) loss:2.592 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:03:42,471][model8_pretrain.py][INFO] Epoch:[0/2](964400/4588595) loss:3.320 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:03:42,471][model8_pretrain.py][INFO] Epoch:[0/2](964400/4588595) loss:2.370 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:03:42,471][model8_pretrain.py][INFO] Epoch:[0/2](964400/4588595) loss:2.394 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:03:42,471][model8_pretrain.py][INFO] Epoch:[0/2](964400/4588595) loss:2.882 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:03:42,471][model8_pretrain.py][INFO] Epoch:[0/2](964400/4588595) loss:2.428 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:03:42,471][model8_pretrain.py][INFO] Epoch:[0/2](964400/4588595) loss:2.368 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:03:42,471][model8_pretrain.py][INFO] Epoch:[0/2](964400/4588595) loss:2.755 lr:0.0000100 epoch_Time:23089.0min: [2024-01-07 00:04:19,398][model8_pretrain.py][INFO] Epoch:[0/2](964500/4588595) loss:3.515 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:04:19,398][model8_pretrain.py][INFO] Epoch:[0/2](964500/4588595) loss:2.232 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:04:19,398][model8_pretrain.py][INFO] Epoch:[0/2](964500/4588595) loss:2.944 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:04:19,398][model8_pretrain.py][INFO] Epoch:[0/2](964500/4588595) loss:3.315 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:04:19,398][model8_pretrain.py][INFO] Epoch:[0/2](964500/4588595) loss:2.511 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:04:19,398][model8_pretrain.py][INFO] Epoch:[0/2](964500/4588595) loss:2.724 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:04:19,398][model8_pretrain.py][INFO] Epoch:[0/2](964500/4588595) loss:2.687 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:04:19,398][model8_pretrain.py][INFO] Epoch:[0/2](964500/4588595) loss:2.296 lr:0.0000100 epoch_Time:23088.0min: [2024-01-07 00:04:56,327][model8_pretrain.py][INFO] Epoch:[0/2](964600/4588595) loss:2.738 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:04:56,327][model8_pretrain.py][INFO] Epoch:[0/2](964600/4588595) loss:2.327 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:04:56,327][model8_pretrain.py][INFO] Epoch:[0/2](964600/4588595) loss:2.960 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:04:56,327][model8_pretrain.py][INFO] Epoch:[0/2](964600/4588595) loss:2.707 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:04:56,327][model8_pretrain.py][INFO] Epoch:[0/2](964600/4588595) loss:2.945 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:04:56,327][model8_pretrain.py][INFO] Epoch:[0/2](964600/4588595) loss:2.903 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:04:56,327][model8_pretrain.py][INFO] Epoch:[0/2](964600/4588595) loss:2.298 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:04:56,327][model8_pretrain.py][INFO] Epoch:[0/2](964600/4588595) loss:2.234 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:05:33,259][model8_pretrain.py][INFO] Epoch:[0/2](964700/4588595) loss:3.187 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:05:33,259][model8_pretrain.py][INFO] Epoch:[0/2](964700/4588595) loss:2.823 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:05:33,259][model8_pretrain.py][INFO] Epoch:[0/2](964700/4588595) loss:2.730 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:05:33,259][model8_pretrain.py][INFO] Epoch:[0/2](964700/4588595) loss:2.268 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:05:33,259][model8_pretrain.py][INFO] Epoch:[0/2](964700/4588595) loss:3.176 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:05:33,259][model8_pretrain.py][INFO] Epoch:[0/2](964700/4588595) loss:2.251 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:05:33,259][model8_pretrain.py][INFO] Epoch:[0/2](964700/4588595) loss:2.824 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:05:33,260][model8_pretrain.py][INFO] Epoch:[0/2](964700/4588595) loss:2.010 lr:0.0000100 epoch_Time:23087.0min: [2024-01-07 00:06:10,209][model8_pretrain.py][INFO] Epoch:[0/2](964800/4588595) loss:2.817 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:10,209][model8_pretrain.py][INFO] Epoch:[0/2](964800/4588595) loss:3.149 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:10,209][model8_pretrain.py][INFO] Epoch:[0/2](964800/4588595) loss:2.704 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:10,209][model8_pretrain.py][INFO] Epoch:[0/2](964800/4588595) loss:3.110 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:10,209][model8_pretrain.py][INFO] Epoch:[0/2](964800/4588595) loss:3.067 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:10,209][model8_pretrain.py][INFO] Epoch:[0/2](964800/4588595) loss:2.657 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:10,209][model8_pretrain.py][INFO] Epoch:[0/2](964800/4588595) loss:2.964 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:10,209][model8_pretrain.py][INFO] Epoch:[0/2](964800/4588595) loss:2.740 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:47,145][model8_pretrain.py][INFO] Epoch:[0/2](964900/4588595) loss:2.229 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:47,145][model8_pretrain.py][INFO] Epoch:[0/2](964900/4588595) loss:3.095 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:47,145][model8_pretrain.py][INFO] Epoch:[0/2](964900/4588595) loss:3.093 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:47,145][model8_pretrain.py][INFO] Epoch:[0/2](964900/4588595) loss:3.294 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:47,145][model8_pretrain.py][INFO] Epoch:[0/2](964900/4588595) loss:2.825 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:47,146][model8_pretrain.py][INFO] Epoch:[0/2](964900/4588595) loss:2.633 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:47,146][model8_pretrain.py][INFO] Epoch:[0/2](964900/4588595) loss:2.440 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:06:47,146][model8_pretrain.py][INFO] Epoch:[0/2](964900/4588595) loss:2.787 lr:0.0000100 epoch_Time:23086.0min: [2024-01-07 00:07:24,096][model8_pretrain.py][INFO] Epoch:[0/2](965000/4588595) loss:2.599 lr:0.0000100 epoch_Time:23084.0min: [2024-01-07 00:07:24,096][model8_pretrain.py][INFO] Epoch:[0/2](965000/4588595) loss:2.916 lr:0.0000100 epoch_Time:23084.0min: [2024-01-07 00:07:24,096][model8_pretrain.py][INFO] Epoch:[0/2](965000/4588595) loss:2.588 lr:0.0000100 epoch_Time:23084.0min: [2024-01-07 00:07:24,096][model8_pretrain.py][INFO] Epoch:[0/2](965000/4588595) loss:2.434 lr:0.0000100 epoch_Time:23084.0min: [2024-01-07 00:07:24,096][model8_pretrain.py][INFO] Epoch:[0/2](965000/4588595) loss:2.104 lr:0.0000100 epoch_Time:23084.0min: [2024-01-07 00:07:24,096][model8_pretrain.py][INFO] Epoch:[0/2](965000/4588595) loss:2.744 lr:0.0000100 epoch_Time:23084.0min: [2024-01-07 00:07:24,096][model8_pretrain.py][INFO] Epoch:[0/2](965000/4588595) loss:2.620 lr:0.0000100 epoch_Time:23084.0min: [2024-01-07 00:07:24,096][model8_pretrain.py][INFO] Epoch:[0/2](965000/4588595) loss:2.558 lr:0.0000100 epoch_Time:23084.0min: [2024-01-07 00:08:01,039][model8_pretrain.py][INFO] Epoch:[0/2](965100/4588595) loss:2.067 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:01,039][model8_pretrain.py][INFO] Epoch:[0/2](965100/4588595) loss:3.269 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:01,039][model8_pretrain.py][INFO] Epoch:[0/2](965100/4588595) loss:3.029 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:01,039][model8_pretrain.py][INFO] Epoch:[0/2](965100/4588595) loss:2.619 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:01,039][model8_pretrain.py][INFO] Epoch:[0/2](965100/4588595) loss:2.743 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:01,039][model8_pretrain.py][INFO] Epoch:[0/2](965100/4588595) loss:2.741 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:01,039][model8_pretrain.py][INFO] Epoch:[0/2](965100/4588595) loss:2.882 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:02,720][model8_pretrain.py][INFO] Epoch:[0/2](965100/4588595) loss:2.784 lr:0.0000100 epoch_Time:23084.0min: [2024-01-07 00:08:51,569][model8_pretrain.py][INFO] Epoch:[0/2](965200/4588595) loss:2.418 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:51,570][model8_pretrain.py][INFO] Epoch:[0/2](965200/4588595) loss:2.777 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:51,570][model8_pretrain.py][INFO] Epoch:[0/2](965200/4588595) loss:3.146 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:51,570][model8_pretrain.py][INFO] Epoch:[0/2](965200/4588595) loss:2.206 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:51,570][model8_pretrain.py][INFO] Epoch:[0/2](965200/4588595) loss:2.971 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:51,570][model8_pretrain.py][INFO] Epoch:[0/2](965200/4588595) loss:2.134 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:51,570][model8_pretrain.py][INFO] Epoch:[0/2](965200/4588595) loss:3.005 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:08:51,570][model8_pretrain.py][INFO] Epoch:[0/2](965200/4588595) loss:3.065 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:09:28,507][model8_pretrain.py][INFO] Epoch:[0/2](965300/4588595) loss:3.362 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:09:28,507][model8_pretrain.py][INFO] Epoch:[0/2](965300/4588595) loss:3.317 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:09:28,507][model8_pretrain.py][INFO] Epoch:[0/2](965300/4588595) loss:2.929 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:09:28,507][model8_pretrain.py][INFO] Epoch:[0/2](965300/4588595) loss:3.047 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:09:28,507][model8_pretrain.py][INFO] Epoch:[0/2](965300/4588595) loss:2.556 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:09:28,507][model8_pretrain.py][INFO] Epoch:[0/2](965300/4588595) loss:2.872 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:09:28,507][model8_pretrain.py][INFO] Epoch:[0/2](965300/4588595) loss:2.806 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:09:28,508][model8_pretrain.py][INFO] Epoch:[0/2](965300/4588595) loss:2.692 lr:0.0000100 epoch_Time:23083.0min: [2024-01-07 00:10:05,449][model8_pretrain.py][INFO] Epoch:[0/2](965400/4588595) loss:3.476 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:05,449][model8_pretrain.py][INFO] Epoch:[0/2](965400/4588595) loss:3.047 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:05,449][model8_pretrain.py][INFO] Epoch:[0/2](965400/4588595) loss:3.286 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:05,449][model8_pretrain.py][INFO] Epoch:[0/2](965400/4588595) loss:2.382 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:05,449][model8_pretrain.py][INFO] Epoch:[0/2](965400/4588595) loss:2.832 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:05,449][model8_pretrain.py][INFO] Epoch:[0/2](965400/4588595) loss:2.510 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:05,449][model8_pretrain.py][INFO] Epoch:[0/2](965400/4588595) loss:2.744 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:05,449][model8_pretrain.py][INFO] Epoch:[0/2](965400/4588595) loss:2.924 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:42,386][model8_pretrain.py][INFO] Epoch:[0/2](965500/4588595) loss:3.117 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:42,386][model8_pretrain.py][INFO] Epoch:[0/2](965500/4588595) loss:2.802 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:42,386][model8_pretrain.py][INFO] Epoch:[0/2](965500/4588595) loss:2.082 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:42,386][model8_pretrain.py][INFO] Epoch:[0/2](965500/4588595) loss:2.807 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:42,386][model8_pretrain.py][INFO] Epoch:[0/2](965500/4588595) loss:2.589 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:42,386][model8_pretrain.py][INFO] Epoch:[0/2](965500/4588595) loss:2.310 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:42,386][model8_pretrain.py][INFO] Epoch:[0/2](965500/4588595) loss:2.759 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:10:42,386][model8_pretrain.py][INFO] Epoch:[0/2](965500/4588595) loss:2.390 lr:0.0000100 epoch_Time:23082.0min: [2024-01-07 00:11:19,331][model8_pretrain.py][INFO] Epoch:[0/2](965600/4588595) loss:2.765 lr:0.0000100 epoch_Time:23081.0min: [2024-01-07 00:11:19,331][model8_pretrain.py][INFO] Epoch:[0/2](965600/4588595) loss:2.216 lr:0.0000100 epoch_Time:23081.0min: [2024-01-07 00:11:19,331][model8_pretrain.py][INFO] Epoch:[0/2](965600/4588595) loss:2.408 lr:0.0000100 epoch_Time:23081.0min: [2024-01-07 00:11:19,331][model8_pretrain.py][INFO] Epoch:[0/2](965600/4588595) loss:2.213 lr:0.0000100 epoch_Time:23081.0min: [2024-01-07 00:11:19,331][model8_pretrain.py][INFO] Epoch:[0/2](965600/4588595) loss:2.235 lr:0.0000100 epoch_Time:23081.0min: [2024-01-07 00:11:19,332][model8_pretrain.py][INFO] Epoch:[0/2](965600/4588595) loss:2.589 lr:0.0000100 epoch_Time:23081.0min: [2024-01-07 00:11:19,332][model8_pretrain.py][INFO] Epoch:[0/2](965600/4588595) loss:2.578 lr:0.0000100 epoch_Time:23081.0min: [2024-01-07 00:11:19,332][model8_pretrain.py][INFO] Epoch:[0/2](965600/4588595) loss:2.802 lr:0.0000100 epoch_Time:23081.0min: [2024-01-07 00:11:56,278][model8_pretrain.py][INFO] Epoch:[0/2](965700/4588595) loss:2.835 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:11:56,278][model8_pretrain.py][INFO] Epoch:[0/2](965700/4588595) loss:2.725 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:11:56,278][model8_pretrain.py][INFO] Epoch:[0/2](965700/4588595) loss:2.742 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:11:56,278][model8_pretrain.py][INFO] Epoch:[0/2](965700/4588595) loss:2.985 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:11:56,278][model8_pretrain.py][INFO] Epoch:[0/2](965700/4588595) loss:3.054 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:11:56,278][model8_pretrain.py][INFO] Epoch:[0/2](965700/4588595) loss:2.732 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:11:56,278][model8_pretrain.py][INFO] Epoch:[0/2](965700/4588595) loss:2.728 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:11:56,278][model8_pretrain.py][INFO] Epoch:[0/2](965700/4588595) loss:2.310 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:12:33,226][model8_pretrain.py][INFO] Epoch:[0/2](965800/4588595) loss:2.592 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:12:33,226][model8_pretrain.py][INFO] Epoch:[0/2](965800/4588595) loss:2.697 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:12:33,226][model8_pretrain.py][INFO] Epoch:[0/2](965800/4588595) loss:2.665 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:12:33,226][model8_pretrain.py][INFO] Epoch:[0/2](965800/4588595) loss:3.077 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:12:33,226][model8_pretrain.py][INFO] Epoch:[0/2](965800/4588595) loss:2.550 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:12:33,226][model8_pretrain.py][INFO] Epoch:[0/2](965800/4588595) loss:2.608 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:12:33,226][model8_pretrain.py][INFO] Epoch:[0/2](965800/4588595) loss:2.411 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:12:33,226][model8_pretrain.py][INFO] Epoch:[0/2](965800/4588595) loss:3.358 lr:0.0000100 epoch_Time:23080.0min: [2024-01-07 00:13:10,170][model8_pretrain.py][INFO] Epoch:[0/2](965900/4588595) loss:2.470 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:13:10,170][model8_pretrain.py][INFO] Epoch:[0/2](965900/4588595) loss:2.537 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:13:10,170][model8_pretrain.py][INFO] Epoch:[0/2](965900/4588595) loss:2.881 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:13:10,170][model8_pretrain.py][INFO] Epoch:[0/2](965900/4588595) loss:3.089 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:13:10,170][model8_pretrain.py][INFO] Epoch:[0/2](965900/4588595) loss:2.836 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:13:10,170][model8_pretrain.py][INFO] Epoch:[0/2](965900/4588595) loss:2.586 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:13:10,170][model8_pretrain.py][INFO] Epoch:[0/2](965900/4588595) loss:3.034 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:13:10,170][model8_pretrain.py][INFO] Epoch:[0/2](965900/4588595) loss:2.750 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:00,732][model8_pretrain.py][INFO] Epoch:[0/2](966000/4588595) loss:3.205 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:00,732][model8_pretrain.py][INFO] Epoch:[0/2](966000/4588595) loss:2.582 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:00,732][model8_pretrain.py][INFO] Epoch:[0/2](966000/4588595) loss:2.990 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:00,732][model8_pretrain.py][INFO] Epoch:[0/2](966000/4588595) loss:2.702 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:00,732][model8_pretrain.py][INFO] Epoch:[0/2](966000/4588595) loss:3.245 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:00,732][model8_pretrain.py][INFO] Epoch:[0/2](966000/4588595) loss:2.842 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:00,732][model8_pretrain.py][INFO] Epoch:[0/2](966000/4588595) loss:2.961 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:00,732][model8_pretrain.py][INFO] Epoch:[0/2](966000/4588595) loss:2.542 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:37,668][model8_pretrain.py][INFO] Epoch:[0/2](966100/4588595) loss:2.566 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:37,668][model8_pretrain.py][INFO] Epoch:[0/2](966100/4588595) loss:2.325 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:37,668][model8_pretrain.py][INFO] Epoch:[0/2](966100/4588595) loss:2.759 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:37,668][model8_pretrain.py][INFO] Epoch:[0/2](966100/4588595) loss:2.951 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:37,668][model8_pretrain.py][INFO] Epoch:[0/2](966100/4588595) loss:2.609 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:37,668][model8_pretrain.py][INFO] Epoch:[0/2](966100/4588595) loss:2.987 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:37,668][model8_pretrain.py][INFO] Epoch:[0/2](966100/4588595) loss:3.192 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:14:37,668][model8_pretrain.py][INFO] Epoch:[0/2](966100/4588595) loss:2.833 lr:0.0000100 epoch_Time:23079.0min: [2024-01-07 00:15:14,609][model8_pretrain.py][INFO] Epoch:[0/2](966200/4588595) loss:3.117 lr:0.0000100 epoch_Time:23077.0min: [2024-01-07 00:15:14,609][model8_pretrain.py][INFO] Epoch:[0/2](966200/4588595) loss:2.682 lr:0.0000100 epoch_Time:23077.0min: [2024-01-07 00:15:14,609][model8_pretrain.py][INFO] Epoch:[0/2](966200/4588595) loss:2.857 lr:0.0000100 epoch_Time:23077.0min: [2024-01-07 00:15:14,609][model8_pretrain.py][INFO] Epoch:[0/2](966200/4588595) loss:2.693 lr:0.0000100 epoch_Time:23077.0min: [2024-01-07 00:15:14,609][model8_pretrain.py][INFO] Epoch:[0/2](966200/4588595) loss:2.816 lr:0.0000100 epoch_Time:23077.0min: [2024-01-07 00:15:14,609][model8_pretrain.py][INFO] Epoch:[0/2](966200/4588595) loss:2.753 lr:0.0000100 epoch_Time:23077.0min: [2024-01-07 00:15:14,609][model8_pretrain.py][INFO] Epoch:[0/2](966200/4588595) loss:2.157 lr:0.0000100 epoch_Time:23077.0min: [2024-01-07 00:15:14,610][model8_pretrain.py][INFO] Epoch:[0/2](966200/4588595) loss:2.654 lr:0.0000100 epoch_Time:23077.0min: [2024-01-07 00:15:51,545][model8_pretrain.py][INFO] Epoch:[0/2](966300/4588595) loss:2.855 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:15:51,545][model8_pretrain.py][INFO] Epoch:[0/2](966300/4588595) loss:3.114 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:15:51,545][model8_pretrain.py][INFO] Epoch:[0/2](966300/4588595) loss:2.526 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:15:51,545][model8_pretrain.py][INFO] Epoch:[0/2](966300/4588595) loss:2.739 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:15:51,545][model8_pretrain.py][INFO] Epoch:[0/2](966300/4588595) loss:2.652 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:15:51,545][model8_pretrain.py][INFO] Epoch:[0/2](966300/4588595) loss:1.973 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:15:51,545][model8_pretrain.py][INFO] Epoch:[0/2](966300/4588595) loss:3.157 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:15:51,546][model8_pretrain.py][INFO] Epoch:[0/2](966300/4588595) loss:2.717 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:16:28,486][model8_pretrain.py][INFO] Epoch:[0/2](966400/4588595) loss:2.808 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:16:28,486][model8_pretrain.py][INFO] Epoch:[0/2](966400/4588595) loss:2.907 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:16:28,486][model8_pretrain.py][INFO] Epoch:[0/2](966400/4588595) loss:2.974 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:16:28,486][model8_pretrain.py][INFO] Epoch:[0/2](966400/4588595) loss:2.963 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:16:28,486][model8_pretrain.py][INFO] Epoch:[0/2](966400/4588595) loss:2.693 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:16:28,486][model8_pretrain.py][INFO] Epoch:[0/2](966400/4588595) loss:2.499 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:16:28,486][model8_pretrain.py][INFO] Epoch:[0/2](966400/4588595) loss:2.529 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:16:28,487][model8_pretrain.py][INFO] Epoch:[0/2](966400/4588595) loss:2.609 lr:0.0000100 epoch_Time:23076.0min: [2024-01-07 00:17:05,399][model8_pretrain.py][INFO] Epoch:[0/2](966500/4588595) loss:2.648 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:05,399][model8_pretrain.py][INFO] Epoch:[0/2](966500/4588595) loss:2.772 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:05,399][model8_pretrain.py][INFO] Epoch:[0/2](966500/4588595) loss:3.148 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:05,399][model8_pretrain.py][INFO] Epoch:[0/2](966500/4588595) loss:2.359 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:05,399][model8_pretrain.py][INFO] Epoch:[0/2](966500/4588595) loss:2.883 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:05,399][model8_pretrain.py][INFO] Epoch:[0/2](966500/4588595) loss:3.514 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:05,400][model8_pretrain.py][INFO] Epoch:[0/2](966500/4588595) loss:2.808 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:05,400][model8_pretrain.py][INFO] Epoch:[0/2](966500/4588595) loss:2.868 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:42,335][model8_pretrain.py][INFO] Epoch:[0/2](966600/4588595) loss:2.987 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:42,335][model8_pretrain.py][INFO] Epoch:[0/2](966600/4588595) loss:2.302 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:42,335][model8_pretrain.py][INFO] Epoch:[0/2](966600/4588595) loss:2.119 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:42,335][model8_pretrain.py][INFO] Epoch:[0/2](966600/4588595) loss:3.158 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:42,335][model8_pretrain.py][INFO] Epoch:[0/2](966600/4588595) loss:2.789 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:42,336][model8_pretrain.py][INFO] Epoch:[0/2](966600/4588595) loss:2.562 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:42,336][model8_pretrain.py][INFO] Epoch:[0/2](966600/4588595) loss:2.841 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:17:42,336][model8_pretrain.py][INFO] Epoch:[0/2](966600/4588595) loss:2.850 lr:0.0000100 epoch_Time:23075.0min: [2024-01-07 00:18:19,275][model8_pretrain.py][INFO] Epoch:[0/2](966700/4588595) loss:2.876 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:18:19,275][model8_pretrain.py][INFO] Epoch:[0/2](966700/4588595) loss:3.006 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:18:19,275][model8_pretrain.py][INFO] Epoch:[0/2](966700/4588595) loss:2.830 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:18:19,275][model8_pretrain.py][INFO] Epoch:[0/2](966700/4588595) loss:3.173 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:18:19,275][model8_pretrain.py][INFO] Epoch:[0/2](966700/4588595) loss:2.875 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:18:19,276][model8_pretrain.py][INFO] Epoch:[0/2](966700/4588595) loss:3.043 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:18:19,276][model8_pretrain.py][INFO] Epoch:[0/2](966700/4588595) loss:3.024 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:18:19,276][model8_pretrain.py][INFO] Epoch:[0/2](966700/4588595) loss:2.626 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:09,808][model8_pretrain.py][INFO] Epoch:[0/2](966800/4588595) loss:2.998 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:09,808][model8_pretrain.py][INFO] Epoch:[0/2](966800/4588595) loss:2.823 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:09,808][model8_pretrain.py][INFO] Epoch:[0/2](966800/4588595) loss:2.896 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:09,808][model8_pretrain.py][INFO] Epoch:[0/2](966800/4588595) loss:2.858 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:09,808][model8_pretrain.py][INFO] Epoch:[0/2](966800/4588595) loss:2.831 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:09,808][model8_pretrain.py][INFO] Epoch:[0/2](966800/4588595) loss:2.934 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:09,808][model8_pretrain.py][INFO] Epoch:[0/2](966800/4588595) loss:2.650 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:09,808][model8_pretrain.py][INFO] Epoch:[0/2](966800/4588595) loss:2.555 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:46,745][model8_pretrain.py][INFO] Epoch:[0/2](966900/4588595) loss:2.312 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:46,745][model8_pretrain.py][INFO] Epoch:[0/2](966900/4588595) loss:3.061 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:46,745][model8_pretrain.py][INFO] Epoch:[0/2](966900/4588595) loss:2.776 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:46,745][model8_pretrain.py][INFO] Epoch:[0/2](966900/4588595) loss:2.706 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:46,745][model8_pretrain.py][INFO] Epoch:[0/2](966900/4588595) loss:2.896 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:46,745][model8_pretrain.py][INFO] Epoch:[0/2](966900/4588595) loss:2.221 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:46,745][model8_pretrain.py][INFO] Epoch:[0/2](966900/4588595) loss:2.917 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:19:46,745][model8_pretrain.py][INFO] Epoch:[0/2](966900/4588595) loss:2.806 lr:0.0000100 epoch_Time:23074.0min: [2024-01-07 00:20:23,697][model8_pretrain.py][INFO] Epoch:[0/2](967000/4588595) loss:2.959 lr:0.0000100 epoch_Time:23073.0min: [2024-01-07 00:20:23,697][model8_pretrain.py][INFO] Epoch:[0/2](967000/4588595) loss:2.923 lr:0.0000100 epoch_Time:23073.0min: [2024-01-07 00:20:23,697][model8_pretrain.py][INFO] Epoch:[0/2](967000/4588595) loss:2.703 lr:0.0000100 epoch_Time:23073.0min: [2024-01-07 00:20:23,697][model8_pretrain.py][INFO] Epoch:[0/2](967000/4588595) loss:2.920 lr:0.0000100 epoch_Time:23073.0min: [2024-01-07 00:20:23,697][model8_pretrain.py][INFO] Epoch:[0/2](967000/4588595) loss:3.168 lr:0.0000100 epoch_Time:23073.0min: [2024-01-07 00:20:23,697][model8_pretrain.py][INFO] Epoch:[0/2](967000/4588595) loss:2.662 lr:0.0000100 epoch_Time:23073.0min: [2024-01-07 00:20:23,697][model8_pretrain.py][INFO] Epoch:[0/2](967000/4588595) loss:2.076 lr:0.0000100 epoch_Time:23073.0min: [2024-01-07 00:20:23,697][model8_pretrain.py][INFO] Epoch:[0/2](967000/4588595) loss:2.650 lr:0.0000100 epoch_Time:23073.0min: [2024-01-07 00:21:00,644][model8_pretrain.py][INFO] Epoch:[0/2](967100/4588595) loss:2.521 lr:0.0000100 epoch_Time:23072.0min: [2024-01-07 00:21:00,644][model8_pretrain.py][INFO] Epoch:[0/2](967100/4588595) loss:2.807 lr:0.0000100 epoch_Time:23072.0min: [2024-01-07 00:21:00,644][model8_pretrain.py][INFO] Epoch:[0/2](967100/4588595) loss:3.082 lr:0.0000100 epoch_Time:23072.0min: [2024-01-07 00:21:00,644][model8_pretrain.py][INFO] Epoch:[0/2](967100/4588595) loss:2.554 lr:0.0000100 epoch_Time:23072.0min: [2024-01-07 00:21:00,644][model8_pretrain.py][INFO] Epoch:[0/2](967100/4588595) loss:2.840 lr:0.0000100 epoch_Time:23072.0min: [2024-01-07 00:21:00,644][model8_pretrain.py][INFO] Epoch:[0/2](967100/4588595) loss:3.109 lr:0.0000100 epoch_Time:23072.0min: [2024-01-07 00:21:00,644][model8_pretrain.py][INFO] Epoch:[0/2](967100/4588595) loss:2.754 lr:0.0000100 epoch_Time:23072.0min: [2024-01-07 00:21:00,644][model8_pretrain.py][INFO] Epoch:[0/2](967100/4588595) loss:2.433 lr:0.0000100 epoch_Time:23072.0min: [2024-01-07 00:21:37,592][model8_pretrain.py][INFO] Epoch:[0/2](967200/4588595) loss:2.789 lr:0.0000100 epoch_Time:23071.0min: [2024-01-07 00:21:37,592][model8_pretrain.py][INFO] Epoch:[0/2](967200/4588595) loss:2.928 lr:0.0000100 epoch_Time:23071.0min: [2024-01-07 00:21:37,592][model8_pretrain.py][INFO] Epoch:[0/2](967200/4588595) loss:2.485 lr:0.0000100 epoch_Time:23071.0min: [2024-01-07 00:21:37,592][model8_pretrain.py][INFO] Epoch:[0/2](967200/4588595) loss:2.979 lr:0.0000100 epoch_Time:23071.0min: [2024-01-07 00:21:37,592][model8_pretrain.py][INFO] Epoch:[0/2](967200/4588595) loss:2.964 lr:0.0000100 epoch_Time:23071.0min: [2024-01-07 00:21:37,592][model8_pretrain.py][INFO] Epoch:[0/2](967200/4588595) loss:2.320 lr:0.0000100 epoch_Time:23071.0min: [2024-01-07 00:21:37,592][model8_pretrain.py][INFO] Epoch:[0/2](967200/4588595) loss:3.083 lr:0.0000100 epoch_Time:23071.0min: [2024-01-07 00:21:37,593][model8_pretrain.py][INFO] Epoch:[0/2](967200/4588595) loss:2.605 lr:0.0000100 epoch_Time:23071.0min: [2024-01-07 00:22:14,571][model8_pretrain.py][INFO] Epoch:[0/2](967300/4588595) loss:2.555 lr:0.0000100 epoch_Time:23070.0min: [2024-01-07 00:22:14,571][model8_pretrain.py][INFO] Epoch:[0/2](967300/4588595) loss:2.849 lr:0.0000100 epoch_Time:23070.0min: [2024-01-07 00:22:14,571][model8_pretrain.py][INFO] Epoch:[0/2](967300/4588595) loss:2.776 lr:0.0000100 epoch_Time:23070.0min: [2024-01-07 00:22:14,572][model8_pretrain.py][INFO] Epoch:[0/2](967300/4588595) loss:3.069 lr:0.0000100 epoch_Time:23070.0min: [2024-01-07 00:22:14,572][model8_pretrain.py][INFO] Epoch:[0/2](967300/4588595) loss:2.719 lr:0.0000100 epoch_Time:23070.0min: [2024-01-07 00:22:14,572][model8_pretrain.py][INFO] Epoch:[0/2](967300/4588595) loss:2.509 lr:0.0000100 epoch_Time:23070.0min: [2024-01-07 00:22:14,572][model8_pretrain.py][INFO] Epoch:[0/2](967300/4588595) loss:3.070 lr:0.0000100 epoch_Time:23070.0min: [2024-01-07 00:22:14,572][model8_pretrain.py][INFO] Epoch:[0/2](967300/4588595) loss:2.406 lr:0.0000100 epoch_Time:23070.0min: [2024-01-07 00:22:51,522][model8_pretrain.py][INFO] Epoch:[0/2](967400/4588595) loss:3.146 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:22:51,522][model8_pretrain.py][INFO] Epoch:[0/2](967400/4588595) loss:2.663 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:22:51,522][model8_pretrain.py][INFO] Epoch:[0/2](967400/4588595) loss:2.604 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:22:51,522][model8_pretrain.py][INFO] Epoch:[0/2](967400/4588595) loss:2.484 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:22:51,522][model8_pretrain.py][INFO] Epoch:[0/2](967400/4588595) loss:2.751 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:22:51,522][model8_pretrain.py][INFO] Epoch:[0/2](967400/4588595) loss:3.277 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:22:51,522][model8_pretrain.py][INFO] Epoch:[0/2](967400/4588595) loss:3.190 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:22:51,522][model8_pretrain.py][INFO] Epoch:[0/2](967400/4588595) loss:3.114 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:23:28,454][model8_pretrain.py][INFO] Epoch:[0/2](967500/4588595) loss:2.426 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:23:28,454][model8_pretrain.py][INFO] Epoch:[0/2](967500/4588595) loss:2.640 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:23:28,454][model8_pretrain.py][INFO] Epoch:[0/2](967500/4588595) loss:2.842 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:23:28,454][model8_pretrain.py][INFO] Epoch:[0/2](967500/4588595) loss:3.379 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:23:28,454][model8_pretrain.py][INFO] Epoch:[0/2](967500/4588595) loss:3.117 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:23:28,454][model8_pretrain.py][INFO] Epoch:[0/2](967500/4588595) loss:2.345 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:23:28,454][model8_pretrain.py][INFO] Epoch:[0/2](967500/4588595) loss:3.251 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:23:28,454][model8_pretrain.py][INFO] Epoch:[0/2](967500/4588595) loss:3.017 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:24:19,179][model8_pretrain.py][INFO] Epoch:[0/2](967600/4588595) loss:2.868 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:24:19,179][model8_pretrain.py][INFO] Epoch:[0/2](967600/4588595) loss:2.659 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:24:19,179][model8_pretrain.py][INFO] Epoch:[0/2](967600/4588595) loss:2.924 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:24:19,179][model8_pretrain.py][INFO] Epoch:[0/2](967600/4588595) loss:2.801 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:24:19,179][model8_pretrain.py][INFO] Epoch:[0/2](967600/4588595) loss:2.679 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:24:19,179][model8_pretrain.py][INFO] Epoch:[0/2](967600/4588595) loss:3.023 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:24:19,179][model8_pretrain.py][INFO] Epoch:[0/2](967600/4588595) loss:3.019 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:24:19,179][model8_pretrain.py][INFO] Epoch:[0/2](967600/4588595) loss:3.029 lr:0.0000100 epoch_Time:23069.0min: [2024-01-07 00:24:56,109][model8_pretrain.py][INFO] Epoch:[0/2](967700/4588595) loss:3.559 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:24:56,109][model8_pretrain.py][INFO] Epoch:[0/2](967700/4588595) loss:2.824 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:24:56,109][model8_pretrain.py][INFO] Epoch:[0/2](967700/4588595) loss:2.570 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:24:56,109][model8_pretrain.py][INFO] Epoch:[0/2](967700/4588595) loss:2.853 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:24:56,109][model8_pretrain.py][INFO] Epoch:[0/2](967700/4588595) loss:2.595 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:24:56,109][model8_pretrain.py][INFO] Epoch:[0/2](967700/4588595) loss:2.916 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:24:56,109][model8_pretrain.py][INFO] Epoch:[0/2](967700/4588595) loss:2.330 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:24:56,109][model8_pretrain.py][INFO] Epoch:[0/2](967700/4588595) loss:2.841 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:25:33,052][model8_pretrain.py][INFO] Epoch:[0/2](967800/4588595) loss:2.816 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:25:33,052][model8_pretrain.py][INFO] Epoch:[0/2](967800/4588595) loss:2.743 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:25:33,052][model8_pretrain.py][INFO] Epoch:[0/2](967800/4588595) loss:3.218 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:25:33,052][model8_pretrain.py][INFO] Epoch:[0/2](967800/4588595) loss:3.295 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:25:33,052][model8_pretrain.py][INFO] Epoch:[0/2](967800/4588595) loss:2.085 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:25:33,052][model8_pretrain.py][INFO] Epoch:[0/2](967800/4588595) loss:2.877 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:25:33,052][model8_pretrain.py][INFO] Epoch:[0/2](967800/4588595) loss:2.961 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:25:33,053][model8_pretrain.py][INFO] Epoch:[0/2](967800/4588595) loss:2.556 lr:0.0000100 epoch_Time:23068.0min: [2024-01-07 00:26:10,013][model8_pretrain.py][INFO] Epoch:[0/2](967900/4588595) loss:2.976 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:10,013][model8_pretrain.py][INFO] Epoch:[0/2](967900/4588595) loss:2.391 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:10,013][model8_pretrain.py][INFO] Epoch:[0/2](967900/4588595) loss:3.039 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:10,013][model8_pretrain.py][INFO] Epoch:[0/2](967900/4588595) loss:2.892 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:10,013][model8_pretrain.py][INFO] Epoch:[0/2](967900/4588595) loss:3.329 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:10,013][model8_pretrain.py][INFO] Epoch:[0/2](967900/4588595) loss:2.908 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:10,013][model8_pretrain.py][INFO] Epoch:[0/2](967900/4588595) loss:2.706 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:10,013][model8_pretrain.py][INFO] Epoch:[0/2](967900/4588595) loss:2.545 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:46,967][model8_pretrain.py][INFO] Epoch:[0/2](968000/4588595) loss:3.098 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:46,967][model8_pretrain.py][INFO] Epoch:[0/2](968000/4588595) loss:2.532 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:46,967][model8_pretrain.py][INFO] Epoch:[0/2](968000/4588595) loss:2.901 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:46,967][model8_pretrain.py][INFO] Epoch:[0/2](968000/4588595) loss:2.373 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:46,967][model8_pretrain.py][INFO] Epoch:[0/2](968000/4588595) loss:2.766 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:46,967][model8_pretrain.py][INFO] Epoch:[0/2](968000/4588595) loss:2.817 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:46,967][model8_pretrain.py][INFO] Epoch:[0/2](968000/4588595) loss:3.598 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:26:46,967][model8_pretrain.py][INFO] Epoch:[0/2](968000/4588595) loss:2.362 lr:0.0000100 epoch_Time:23067.0min: [2024-01-07 00:27:23,932][model8_pretrain.py][INFO] Epoch:[0/2](968100/4588595) loss:2.812 lr:0.0000100 epoch_Time:23066.0min: [2024-01-07 00:27:23,932][model8_pretrain.py][INFO] Epoch:[0/2](968100/4588595) loss:2.697 lr:0.0000100 epoch_Time:23066.0min: [2024-01-07 00:27:23,932][model8_pretrain.py][INFO] Epoch:[0/2](968100/4588595) loss:2.940 lr:0.0000100 epoch_Time:23066.0min: [2024-01-07 00:27:23,932][model8_pretrain.py][INFO] Epoch:[0/2](968100/4588595) loss:2.543 lr:0.0000100 epoch_Time:23066.0min: [2024-01-07 00:27:23,932][model8_pretrain.py][INFO] Epoch:[0/2](968100/4588595) loss:2.389 lr:0.0000100 epoch_Time:23066.0min: [2024-01-07 00:27:23,932][model8_pretrain.py][INFO] Epoch:[0/2](968100/4588595) loss:2.887 lr:0.0000100 epoch_Time:23066.0min: [2024-01-07 00:27:23,932][model8_pretrain.py][INFO] Epoch:[0/2](968100/4588595) loss:3.017 lr:0.0000100 epoch_Time:23066.0min: [2024-01-07 00:27:23,932][model8_pretrain.py][INFO] Epoch:[0/2](968100/4588595) loss:3.146 lr:0.0000100 epoch_Time:23066.0min: [2024-01-07 00:28:00,878][model8_pretrain.py][INFO] Epoch:[0/2](968200/4588595) loss:2.356 lr:0.0000100 epoch_Time:23065.0min: [2024-01-07 00:28:00,878][model8_pretrain.py][INFO] Epoch:[0/2](968200/4588595) loss:2.639 lr:0.0000100 epoch_Time:23065.0min: [2024-01-07 00:28:00,878][model8_pretrain.py][INFO] Epoch:[0/2](968200/4588595) loss:2.862 lr:0.0000100 epoch_Time:23065.0min: [2024-01-07 00:28:00,878][model8_pretrain.py][INFO] Epoch:[0/2](968200/4588595) loss:3.252 lr:0.0000100 epoch_Time:23065.0min: [2024-01-07 00:28:00,879][model8_pretrain.py][INFO] Epoch:[0/2](968200/4588595) loss:3.182 lr:0.0000100 epoch_Time:23065.0min: [2024-01-07 00:28:00,879][model8_pretrain.py][INFO] Epoch:[0/2](968200/4588595) loss:2.565 lr:0.0000100 epoch_Time:23065.0min: [2024-01-07 00:28:00,879][model8_pretrain.py][INFO] Epoch:[0/2](968200/4588595) loss:3.112 lr:0.0000100 epoch_Time:23065.0min: [2024-01-07 00:28:00,879][model8_pretrain.py][INFO] Epoch:[0/2](968200/4588595) loss:1.951 lr:0.0000100 epoch_Time:23065.0min: [2024-01-07 00:28:37,831][model8_pretrain.py][INFO] Epoch:[0/2](968300/4588595) loss:2.293 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:28:37,831][model8_pretrain.py][INFO] Epoch:[0/2](968300/4588595) loss:2.842 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:28:37,831][model8_pretrain.py][INFO] Epoch:[0/2](968300/4588595) loss:2.551 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:28:37,831][model8_pretrain.py][INFO] Epoch:[0/2](968300/4588595) loss:2.531 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:28:37,831][model8_pretrain.py][INFO] Epoch:[0/2](968300/4588595) loss:2.823 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:28:37,831][model8_pretrain.py][INFO] Epoch:[0/2](968300/4588595) loss:3.426 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:28:37,831][model8_pretrain.py][INFO] Epoch:[0/2](968300/4588595) loss:2.630 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:28:37,831][model8_pretrain.py][INFO] Epoch:[0/2](968300/4588595) loss:3.073 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:29:28,912][model8_pretrain.py][INFO] Epoch:[0/2](968400/4588595) loss:2.833 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:29:28,912][model8_pretrain.py][INFO] Epoch:[0/2](968400/4588595) loss:2.778 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:29:28,913][model8_pretrain.py][INFO] Epoch:[0/2](968400/4588595) loss:2.354 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:29:28,913][model8_pretrain.py][INFO] Epoch:[0/2](968400/4588595) loss:2.004 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:29:28,913][model8_pretrain.py][INFO] Epoch:[0/2](968400/4588595) loss:3.085 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:29:28,913][model8_pretrain.py][INFO] Epoch:[0/2](968400/4588595) loss:2.733 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:29:28,913][model8_pretrain.py][INFO] Epoch:[0/2](968400/4588595) loss:2.485 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:29:28,913][model8_pretrain.py][INFO] Epoch:[0/2](968400/4588595) loss:2.792 lr:0.0000100 epoch_Time:23064.0min: [2024-01-07 00:30:05,852][model8_pretrain.py][INFO] Epoch:[0/2](968500/4588595) loss:2.994 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:05,852][model8_pretrain.py][INFO] Epoch:[0/2](968500/4588595) loss:2.629 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:05,852][model8_pretrain.py][INFO] Epoch:[0/2](968500/4588595) loss:2.603 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:05,852][model8_pretrain.py][INFO] Epoch:[0/2](968500/4588595) loss:3.026 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:05,852][model8_pretrain.py][INFO] Epoch:[0/2](968500/4588595) loss:2.404 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:05,852][model8_pretrain.py][INFO] Epoch:[0/2](968500/4588595) loss:2.535 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:05,852][model8_pretrain.py][INFO] Epoch:[0/2](968500/4588595) loss:2.761 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:05,852][model8_pretrain.py][INFO] Epoch:[0/2](968500/4588595) loss:3.245 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:42,802][model8_pretrain.py][INFO] Epoch:[0/2](968600/4588595) loss:3.086 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:42,802][model8_pretrain.py][INFO] Epoch:[0/2](968600/4588595) loss:2.950 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:42,802][model8_pretrain.py][INFO] Epoch:[0/2](968600/4588595) loss:2.456 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:42,802][model8_pretrain.py][INFO] Epoch:[0/2](968600/4588595) loss:2.244 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:42,802][model8_pretrain.py][INFO] Epoch:[0/2](968600/4588595) loss:3.209 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:42,802][model8_pretrain.py][INFO] Epoch:[0/2](968600/4588595) loss:3.201 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:42,802][model8_pretrain.py][INFO] Epoch:[0/2](968600/4588595) loss:2.245 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:30:42,802][model8_pretrain.py][INFO] Epoch:[0/2](968600/4588595) loss:2.661 lr:0.0000100 epoch_Time:23063.0min: [2024-01-07 00:31:19,754][model8_pretrain.py][INFO] Epoch:[0/2](968700/4588595) loss:2.697 lr:0.0000100 epoch_Time:23062.0min: [2024-01-07 00:31:19,754][model8_pretrain.py][INFO] Epoch:[0/2](968700/4588595) loss:2.877 lr:0.0000100 epoch_Time:23062.0min: [2024-01-07 00:31:19,754][model8_pretrain.py][INFO] Epoch:[0/2](968700/4588595) loss:2.726 lr:0.0000100 epoch_Time:23062.0min: [2024-01-07 00:31:19,754][model8_pretrain.py][INFO] Epoch:[0/2](968700/4588595) loss:2.285 lr:0.0000100 epoch_Time:23062.0min: [2024-01-07 00:31:19,754][model8_pretrain.py][INFO] Epoch:[0/2](968700/4588595) loss:2.654 lr:0.0000100 epoch_Time:23062.0min: [2024-01-07 00:31:19,754][model8_pretrain.py][INFO] Epoch:[0/2](968700/4588595) loss:2.402 lr:0.0000100 epoch_Time:23062.0min: [2024-01-07 00:31:19,754][model8_pretrain.py][INFO] Epoch:[0/2](968700/4588595) loss:2.522 lr:0.0000100 epoch_Time:23062.0min: [2024-01-07 00:31:19,754][model8_pretrain.py][INFO] Epoch:[0/2](968700/4588595) loss:3.227 lr:0.0000100 epoch_Time:23062.0min: [2024-01-07 00:31:56,705][model8_pretrain.py][INFO] Epoch:[0/2](968800/4588595) loss:2.881 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:31:56,705][model8_pretrain.py][INFO] Epoch:[0/2](968800/4588595) loss:3.119 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:31:56,705][model8_pretrain.py][INFO] Epoch:[0/2](968800/4588595) loss:3.083 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:31:56,705][model8_pretrain.py][INFO] Epoch:[0/2](968800/4588595) loss:2.477 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:31:56,705][model8_pretrain.py][INFO] Epoch:[0/2](968800/4588595) loss:2.595 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:31:56,705][model8_pretrain.py][INFO] Epoch:[0/2](968800/4588595) loss:2.670 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:31:56,705][model8_pretrain.py][INFO] Epoch:[0/2](968800/4588595) loss:2.555 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:31:56,705][model8_pretrain.py][INFO] Epoch:[0/2](968800/4588595) loss:3.054 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:32:33,690][model8_pretrain.py][INFO] Epoch:[0/2](968900/4588595) loss:2.931 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:32:33,690][model8_pretrain.py][INFO] Epoch:[0/2](968900/4588595) loss:3.142 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:32:33,690][model8_pretrain.py][INFO] Epoch:[0/2](968900/4588595) loss:2.486 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:32:33,690][model8_pretrain.py][INFO] Epoch:[0/2](968900/4588595) loss:2.516 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:32:33,690][model8_pretrain.py][INFO] Epoch:[0/2](968900/4588595) loss:2.749 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:32:33,690][model8_pretrain.py][INFO] Epoch:[0/2](968900/4588595) loss:2.955 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:32:33,690][model8_pretrain.py][INFO] Epoch:[0/2](968900/4588595) loss:3.069 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:32:33,690][model8_pretrain.py][INFO] Epoch:[0/2](968900/4588595) loss:2.575 lr:0.0000100 epoch_Time:23061.0min: [2024-01-07 00:33:10,672][model8_pretrain.py][INFO] Epoch:[0/2](969000/4588595) loss:3.224 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:33:10,672][model8_pretrain.py][INFO] Epoch:[0/2](969000/4588595) loss:2.883 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:33:10,672][model8_pretrain.py][INFO] Epoch:[0/2](969000/4588595) loss:2.456 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:33:10,672][model8_pretrain.py][INFO] Epoch:[0/2](969000/4588595) loss:3.102 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:33:10,672][model8_pretrain.py][INFO] Epoch:[0/2](969000/4588595) loss:2.338 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:33:10,672][model8_pretrain.py][INFO] Epoch:[0/2](969000/4588595) loss:2.980 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:33:10,672][model8_pretrain.py][INFO] Epoch:[0/2](969000/4588595) loss:3.234 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:33:10,673][model8_pretrain.py][INFO] Epoch:[0/2](969000/4588595) loss:2.910 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:33:47,628][model8_pretrain.py][INFO] Epoch:[0/2](969100/4588595) loss:2.869 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:33:47,628][model8_pretrain.py][INFO] Epoch:[0/2](969100/4588595) loss:2.599 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:33:47,628][model8_pretrain.py][INFO] Epoch:[0/2](969100/4588595) loss:2.448 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:33:47,628][model8_pretrain.py][INFO] Epoch:[0/2](969100/4588595) loss:2.543 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:33:47,628][model8_pretrain.py][INFO] Epoch:[0/2](969100/4588595) loss:2.531 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:33:47,628][model8_pretrain.py][INFO] Epoch:[0/2](969100/4588595) loss:2.682 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:33:47,628][model8_pretrain.py][INFO] Epoch:[0/2](969100/4588595) loss:2.860 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:33:47,628][model8_pretrain.py][INFO] Epoch:[0/2](969100/4588595) loss:2.695 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:34:38,187][model8_pretrain.py][INFO] Epoch:[0/2](969200/4588595) loss:3.194 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:34:38,187][model8_pretrain.py][INFO] Epoch:[0/2](969200/4588595) loss:2.855 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:34:38,187][model8_pretrain.py][INFO] Epoch:[0/2](969200/4588595) loss:2.727 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:34:38,187][model8_pretrain.py][INFO] Epoch:[0/2](969200/4588595) loss:2.526 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:34:38,187][model8_pretrain.py][INFO] Epoch:[0/2](969200/4588595) loss:3.144 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:34:38,187][model8_pretrain.py][INFO] Epoch:[0/2](969200/4588595) loss:3.070 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:34:38,187][model8_pretrain.py][INFO] Epoch:[0/2](969200/4588595) loss:2.971 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:34:38,187][model8_pretrain.py][INFO] Epoch:[0/2](969200/4588595) loss:2.538 lr:0.0000100 epoch_Time:23060.0min: [2024-01-07 00:35:15,134][model8_pretrain.py][INFO] Epoch:[0/2](969300/4588595) loss:2.942 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:35:15,134][model8_pretrain.py][INFO] Epoch:[0/2](969300/4588595) loss:2.337 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:35:15,134][model8_pretrain.py][INFO] Epoch:[0/2](969300/4588595) loss:2.415 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:35:15,134][model8_pretrain.py][INFO] Epoch:[0/2](969300/4588595) loss:2.457 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:35:15,134][model8_pretrain.py][INFO] Epoch:[0/2](969300/4588595) loss:3.023 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:35:15,134][model8_pretrain.py][INFO] Epoch:[0/2](969300/4588595) loss:2.668 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:35:15,134][model8_pretrain.py][INFO] Epoch:[0/2](969300/4588595) loss:2.677 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:35:15,134][model8_pretrain.py][INFO] Epoch:[0/2](969300/4588595) loss:2.812 lr:0.0000100 epoch_Time:23059.0min: [2024-01-07 00:35:52,089][model8_pretrain.py][INFO] Epoch:[0/2](969400/4588595) loss:3.081 lr:0.0000100 epoch_Time:23058.0min: [2024-01-07 00:35:52,089][model8_pretrain.py][INFO] Epoch:[0/2](969400/4588595) loss:2.920 lr:0.0000100 epoch_Time:23058.0min: [2024-01-07 00:35:52,089][model8_pretrain.py][INFO] Epoch:[0/2](969400/4588595) loss:2.991 lr:0.0000100 epoch_Time:23058.0min: [2024-01-07 00:35:52,089][model8_pretrain.py][INFO] Epoch:[0/2](969400/4588595) loss:2.799 lr:0.0000100 epoch_Time:23058.0min: [2024-01-07 00:35:52,089][model8_pretrain.py][INFO] Epoch:[0/2](969400/4588595) loss:2.491 lr:0.0000100 epoch_Time:23058.0min: [2024-01-07 00:35:52,089][model8_pretrain.py][INFO] Epoch:[0/2](969400/4588595) loss:2.857 lr:0.0000100 epoch_Time:23058.0min: [2024-01-07 00:35:52,089][model8_pretrain.py][INFO] Epoch:[0/2](969400/4588595) loss:2.958 lr:0.0000100 epoch_Time:23058.0min: [2024-01-07 00:35:52,089][model8_pretrain.py][INFO] Epoch:[0/2](969400/4588595) loss:3.003 lr:0.0000100 epoch_Time:23058.0min: [2024-01-07 00:36:29,039][model8_pretrain.py][INFO] Epoch:[0/2](969500/4588595) loss:2.978 lr:0.0000100 epoch_Time:23057.0min: [2024-01-07 00:36:29,039][model8_pretrain.py][INFO] Epoch:[0/2](969500/4588595) loss:2.847 lr:0.0000100 epoch_Time:23057.0min: [2024-01-07 00:36:29,039][model8_pretrain.py][INFO] Epoch:[0/2](969500/4588595) loss:2.476 lr:0.0000100 epoch_Time:23057.0min: [2024-01-07 00:36:29,039][model8_pretrain.py][INFO] Epoch:[0/2](969500/4588595) loss:2.411 lr:0.0000100 epoch_Time:23057.0min: [2024-01-07 00:36:29,039][model8_pretrain.py][INFO] Epoch:[0/2](969500/4588595) loss:2.536 lr:0.0000100 epoch_Time:23057.0min: [2024-01-07 00:36:29,039][model8_pretrain.py][INFO] Epoch:[0/2](969500/4588595) loss:2.709 lr:0.0000100 epoch_Time:23057.0min: [2024-01-07 00:36:29,039][model8_pretrain.py][INFO] Epoch:[0/2](969500/4588595) loss:3.048 lr:0.0000100 epoch_Time:23057.0min: [2024-01-07 00:36:29,039][model8_pretrain.py][INFO] Epoch:[0/2](969500/4588595) loss:3.198 lr:0.0000100 epoch_Time:23057.0min: [2024-01-07 00:37:05,986][model8_pretrain.py][INFO] Epoch:[0/2](969600/4588595) loss:3.172 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:05,987][model8_pretrain.py][INFO] Epoch:[0/2](969600/4588595) loss:3.023 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:05,987][model8_pretrain.py][INFO] Epoch:[0/2](969600/4588595) loss:3.189 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:05,987][model8_pretrain.py][INFO] Epoch:[0/2](969600/4588595) loss:2.820 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:05,987][model8_pretrain.py][INFO] Epoch:[0/2](969600/4588595) loss:2.445 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:05,987][model8_pretrain.py][INFO] Epoch:[0/2](969600/4588595) loss:3.038 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:05,987][model8_pretrain.py][INFO] Epoch:[0/2](969600/4588595) loss:3.045 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:05,987][model8_pretrain.py][INFO] Epoch:[0/2](969600/4588595) loss:3.036 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:42,946][model8_pretrain.py][INFO] Epoch:[0/2](969700/4588595) loss:3.028 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:42,946][model8_pretrain.py][INFO] Epoch:[0/2](969700/4588595) loss:2.957 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:42,946][model8_pretrain.py][INFO] Epoch:[0/2](969700/4588595) loss:2.808 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:42,946][model8_pretrain.py][INFO] Epoch:[0/2](969700/4588595) loss:2.604 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:42,946][model8_pretrain.py][INFO] Epoch:[0/2](969700/4588595) loss:2.669 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:42,946][model8_pretrain.py][INFO] Epoch:[0/2](969700/4588595) loss:3.045 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:42,946][model8_pretrain.py][INFO] Epoch:[0/2](969700/4588595) loss:2.762 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:37:42,947][model8_pretrain.py][INFO] Epoch:[0/2](969700/4588595) loss:2.963 lr:0.0000100 epoch_Time:23056.0min: [2024-01-07 00:38:20,038][model8_pretrain.py][INFO] Epoch:[0/2](969800/4588595) loss:2.786 lr:0.0000100 epoch_Time:23055.0min: [2024-01-07 00:38:20,038][model8_pretrain.py][INFO] Epoch:[0/2](969800/4588595) loss:2.978 lr:0.0000100 epoch_Time:23055.0min: [2024-01-07 00:38:20,038][model8_pretrain.py][INFO] Epoch:[0/2](969800/4588595) loss:2.977 lr:0.0000100 epoch_Time:23055.0min: [2024-01-07 00:38:20,038][model8_pretrain.py][INFO] Epoch:[0/2](969800/4588595) loss:3.207 lr:0.0000100 epoch_Time:23055.0min: [2024-01-07 00:38:20,038][model8_pretrain.py][INFO] Epoch:[0/2](969800/4588595) loss:3.234 lr:0.0000100 epoch_Time:23055.0min: [2024-01-07 00:38:20,038][model8_pretrain.py][INFO] Epoch:[0/2](969800/4588595) loss:3.073 lr:0.0000100 epoch_Time:23055.0min: [2024-01-07 00:38:20,039][model8_pretrain.py][INFO] Epoch:[0/2](969800/4588595) loss:2.410 lr:0.0000100 epoch_Time:23055.0min: [2024-01-07 00:38:20,039][model8_pretrain.py][INFO] Epoch:[0/2](969800/4588595) loss:2.942 lr:0.0000100 epoch_Time:23055.0min: [2024-01-07 00:38:57,042][model8_pretrain.py][INFO] Epoch:[0/2](969900/4588595) loss:2.447 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:38:57,042][model8_pretrain.py][INFO] Epoch:[0/2](969900/4588595) loss:2.514 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:38:57,042][model8_pretrain.py][INFO] Epoch:[0/2](969900/4588595) loss:3.290 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:38:57,042][model8_pretrain.py][INFO] Epoch:[0/2](969900/4588595) loss:2.616 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:38:57,042][model8_pretrain.py][INFO] Epoch:[0/2](969900/4588595) loss:3.075 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:38:57,042][model8_pretrain.py][INFO] Epoch:[0/2](969900/4588595) loss:2.493 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:38:57,042][model8_pretrain.py][INFO] Epoch:[0/2](969900/4588595) loss:2.648 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:38:57,042][model8_pretrain.py][INFO] Epoch:[0/2](969900/4588595) loss:2.806 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:39:47,634][model8_pretrain.py][INFO] Epoch:[0/2](970000/4588595) loss:3.318 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:39:47,634][model8_pretrain.py][INFO] Epoch:[0/2](970000/4588595) loss:2.751 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:39:47,634][model8_pretrain.py][INFO] Epoch:[0/2](970000/4588595) loss:2.742 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:39:47,634][model8_pretrain.py][INFO] Epoch:[0/2](970000/4588595) loss:2.875 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:39:47,634][model8_pretrain.py][INFO] Epoch:[0/2](970000/4588595) loss:2.650 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:39:47,634][model8_pretrain.py][INFO] Epoch:[0/2](970000/4588595) loss:2.555 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:39:47,634][model8_pretrain.py][INFO] Epoch:[0/2](970000/4588595) loss:2.842 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:39:47,635][model8_pretrain.py][INFO] Epoch:[0/2](970000/4588595) loss:2.653 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:40:24,565][model8_pretrain.py][INFO] Epoch:[0/2](970100/4588595) loss:3.061 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:40:24,565][model8_pretrain.py][INFO] Epoch:[0/2](970100/4588595) loss:2.538 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:40:24,565][model8_pretrain.py][INFO] Epoch:[0/2](970100/4588595) loss:2.763 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:40:24,565][model8_pretrain.py][INFO] Epoch:[0/2](970100/4588595) loss:2.792 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:40:24,566][model8_pretrain.py][INFO] Epoch:[0/2](970100/4588595) loss:2.177 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:40:24,566][model8_pretrain.py][INFO] Epoch:[0/2](970100/4588595) loss:2.074 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:40:24,566][model8_pretrain.py][INFO] Epoch:[0/2](970100/4588595) loss:2.796 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:40:24,566][model8_pretrain.py][INFO] Epoch:[0/2](970100/4588595) loss:2.493 lr:0.0000100 epoch_Time:23054.0min: [2024-01-07 00:41:01,508][model8_pretrain.py][INFO] Epoch:[0/2](970200/4588595) loss:2.862 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:01,508][model8_pretrain.py][INFO] Epoch:[0/2](970200/4588595) loss:2.040 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:01,508][model8_pretrain.py][INFO] Epoch:[0/2](970200/4588595) loss:2.847 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:01,508][model8_pretrain.py][INFO] Epoch:[0/2](970200/4588595) loss:2.436 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:01,508][model8_pretrain.py][INFO] Epoch:[0/2](970200/4588595) loss:2.786 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:01,508][model8_pretrain.py][INFO] Epoch:[0/2](970200/4588595) loss:2.644 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:01,508][model8_pretrain.py][INFO] Epoch:[0/2](970200/4588595) loss:2.986 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:01,508][model8_pretrain.py][INFO] Epoch:[0/2](970200/4588595) loss:2.749 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:38,458][model8_pretrain.py][INFO] Epoch:[0/2](970300/4588595) loss:2.650 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:38,458][model8_pretrain.py][INFO] Epoch:[0/2](970300/4588595) loss:3.197 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:38,458][model8_pretrain.py][INFO] Epoch:[0/2](970300/4588595) loss:2.387 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:38,458][model8_pretrain.py][INFO] Epoch:[0/2](970300/4588595) loss:3.346 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:38,458][model8_pretrain.py][INFO] Epoch:[0/2](970300/4588595) loss:3.219 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:38,458][model8_pretrain.py][INFO] Epoch:[0/2](970300/4588595) loss:1.971 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:38,458][model8_pretrain.py][INFO] Epoch:[0/2](970300/4588595) loss:2.317 lr:0.0000100 epoch_Time:23053.0min: [2024-01-07 00:41:38,458][model8_pretrain.py][INFO] Epoch:[0/2](970300/4588595) loss:2.881 lr:0.0000100 epoch_Time:23053.0min: